




Preview text:
PHẦN 1: BLAST
1. TÌM KIẾM TRÌNH TỰ TƯƠNG ĐỒNG VỚI CÁC TRÌNH TỰ BÊN DƯỚI: NM_001083955.1
WP_015138959.1 XM_003094848.1 1CF7_D NM_001033981.3 EU283339.1 FN543431.1 1FKA_A NM_175000.2 XM_003118058.1 AI612609.1 GI: 2300735 M11886 U13699 U09579 L29511
2. Ứng dụng tin sinh học với mục tiêu bên dưới:
a) Dùng công cụ BLAST tìm kiếm trình tự tương đồng phù hợp với thông tin của CAA86734 là gen gì?
b) Dùng công cụ BLAST tìm kiếm trình tự AJ427289 có bao nhiêu acid amin?
3. Một gen/protein được quản lý bằng mã số như sau: NP_001004376.1
a) Tìm kiếm 2 bài báo liên quan đến trình tự gen của bạn được xuất bản trong
năm 2021 trên Pubmed và Sciencedirect.
b) Tìm kiếm trình tự nucleotide và protein theo mã số định dạng FASTA.
c) Tìm kiếm các trình tự tương đồng với trình tự gen của bạn.
4. Dùng BLAST tìm hiểu về gen hay protein có trình tự tương đồng cao nhất với các sequences sau: SEQ1:
AGCACGGGTGCAACCATGGTGCTGTCCGCTGCTGACAAGAACAACGTCAAGGGCATCTTCACCAAAATCG
CCGGCCATGCTGAGGAGTATGGCGCCGAGACCCTGGAAAGGATGTTCACCACCTACCCCCCAACCAAGAC
CTACTTCCCCCACTTCGATCTGTCACACGGCTCCGCTCAGATCAAGGGGCACGGCAAGAAGGTAGTGGCT
GCCTTGATCGAGGCTGCCAACCACATTGATGACATCGCCGGCACCCTCTCCAAGCTCAGCGACCTCCATG
CCCACAAGCTCCGCGTGGACCCTGTCAACTTCAAACTCCTGGGCCAATGCTTCCTGGTGGTGGTGGCCAT
CCACCACCCTGCTGCCCTGACCCCGGAGGTCCATGCTTCCCTGGACAAGTTCTTGTGCGCCGTGGGCACT
GTGCTGACCGCCAAGTACCGTTAAGACGGCACGGTGGCTAGAGCTGGGGCCAACCCATCGCCAGCCCTCC
GACAGCGAGCAGCCAAATGAGATGAAATAAAATCTGTTGCATTTGTGCTCC SEQ2:
TCTGAGAATGGAGCACCTAGTATTGAAAGTCCTTGCTTTTGACTTGGCTGCACCAACAGTAAATCAGTTC
CTTACCCAGTACTTCCTGCACCTGCAGCCTGCAAACTGTAAGGTTGAAAGCTTAGCAATGTTTTTGGGAG
AACTGAGTTTGATAGATGCTGACCCGTACCTTAAGTACCTGCCTTCACTCATTGCTGGAGCTGCCTTCCA
CTTGGCTCTCTACACAGTCACAGGACAGAGCTGGCCTGAGTCATTGGCACAACAGACTGGATATACCCTG
GAGAGTCTTAAGCCTTGTCTTGTGGACCTTCACCAGACCTACCTCAAAGCGCCACAACATGCCCAACAGT
CAATACGGGAAAAGTACAAGCATTCAAAATATCACAGTGTTTCTCTTCTCAACCCACCAGAGACACTAAG
TGTGTGAGTGAAAGACTGCCGGCTTTGTTTGAAACAGGAGTCGCTCGGAGTCCATGCTGTACAGGTTTTA
TGTCGGGTTTTAAGTTCACAATCACTTCTGAATGTAGATGGTATAGCACAGAC SEQ3: NGCCGTAGGGCCCTACGGCT SEQ4:
GACAGAGTCACCATCACTTGCCGGGCCAGTCAGAGTATTAGTAGCTGGTTGGCCTGGTATCAGCAGAAAC
CAGGGAAAGCCCCTAAGCTCCTGATCTATAAGGCGTCTAGTTTAGAAAGTGGGGTCCCATCAAGGTTCAG
CGGCAGTGGATCTGGGACAGAATTCACTCTCACCATCAGCAGCCTGCAGCCTGATGATTTTGCAACTTAT
TACTGCCAACAGTATAATAGTTATCCGGACGTTC SEQ5:
CTACCAGTCTCGCGCCATAGCACTTAACCTCTAGACTACTGAGCCGGCAGGCATCCAACGG
TGTTAATCTCTAGGAAGGTAATACGTTTTACCCATTAATGGCAACATGGTTATTCCTCGTTTATGGGACT
TTTTCACGATTACCATCCGTGTGTACATTTCAAGGTATCTGTGGGTAAATGAAAGACTTGTAATTAGAAA
AACTTCACTTTGTTCCACTAATCCTGTTTTGCTTTGTTAAAAGATTAATAATCTGAGGTAACTGCCGAAG
GTTTGATCCAATATTTCAGAAAAAGATCATTTATGAAGAGCTAGTTTTTTTGACATTATCATTAGTTATA
CCGATTGATAAAGTATTTCCCAATCAATACACAAATATTGCAAATTAATGTCTCTTAATGGACGAAGCGT
AAATTTGGCAGCAAAGTTACCATACAATACATCTTAGGTAATCTGATAACGTTCCCAATATACTAAGGTG
AAAGATTATGAAGGTCATCGTGCCTGAGGAGGCAGAAACATTACTGGCATATGACCAGTTTTTTTTACTG
CGTGTAATCCATATCTACGTAACTAAAACACAAGGTTTAATCAATTGAGACCTAAAATTCAATCAATCAC
GATAGTCTCTCAAGGTTACATAGTTACGGTCAGAATGTTGATTGGTCGATTTGAATTGAGCCAATAAGCT
TGTATTGAGAGTTAGACACATCTGTATGGCTGGCCATGCTTTCCATAGTTTTCTCCTGTACGTATAAAGT
GCTTAAATTTAGAGGACGCATTGATCTCATCGTCTTAAGTTACTTGGATTTCAAATTGTTTACATCCTTT
TAATTAAACGAATATGTTGTTACCAAATCAACCTGCTCCTGATTTTGAAGGTACTGCTGTTATTGGCACG
GAATTACGTCCAATTAGTTTGAGTCAATTTCAAGGAAAATATGTGTTACTGGTATTTTATCCACTTGACT
TGTAAGTCAAATGTACTTTTCATTTCTCAACAATTTTACAATACTTCACGTAATTAAATTAACACACTAA
CGTTTCCTTCAGCACTTTTGTTTGTCCCACGGAACTAATTGCATTCAGTGAAAGAGCTGCTGAGTTTCAA
TCTAGAGGATGTCAAGTAATCGCATGCTCAACTGATTCAGTTTATGCTCATTTGGCATGGACAAAATTGG
ATCGCAAAGCTGGTGGTTTGGGGCAAATGAATATACCTTTGCTGTCCGATAAAAACCTAAGGATATCACG
AGCGTACGAGGTTCTTGACGAACAGGAAGGTCATGCATTCAGGTTCGTTTTATTAATTGATGAACACATT
TTCTGCTATATTTATCATGTTGCAAATGACACGCATGTGTGTTATCATATTACAAAAGTCTAACGTCGTG
TCATACTAAGTAGTGTTGGTAAGCCTCGAATTTTCAACCCGAATATTGTATACGGCTCAGTAGTTAGAGC
ATACTATTTATTCAGTTGGTATGTTGAATAGCTATTCATTCAACTAGGTATTCGTTGACCATATAGGTCA
ACATGATGTTGGATGCACTTGTAATAGTCACTTATTTTCCAACACCATTGCATAACTAATTAGTTAGTTA
CCTTTCACTTGGGTTTTTCGATAGCGTTCCAGACAGTCAT SEQ6:
ATTGAAGTAACGCTACTCCGGGACAATCTTTAGGAGTCAGGATGGCTGAAGACATCAAGACTAAAATCAA
GAACTACAAAACTGCCCCCTTTGACAGCCGCTTCCCCAACCAGAACCAGACTAAGAACTGTTGGCAGAAC
TACCTGGACTTCCACCGCTGTGAGAAGGCAATGACGGCCAAGGGGGGTGATGTCTCCGTGTGTGAGTGGT
ACCGGCGTGTGTACAAGTCCCTCTGTCCCGTGTCATGGGTCTCAGCCTGGGATGACCGCATAGCTGAAGG
CACATTTCCTGGGAAGATCTGACCTGGCTCCGCCACCTCTCCTCTGTTCTTTGTCTTTCTCCCCGGATAG
AAAAGGGGGACCTCAGCATATGATGGTCCTTACCCTGGGACCCTGAATCATGATGCAACTACTAATA SEQ7:
GCTGAACCCAGTGATTTTCTGGCTCAGTTTCCTGAAGTCCCTTGTCCCAGTTGAA
GAACCCATAGCCTTCGGTGGCAAGCTGAAGAACCCACTCCAAGTTGTCCTGGTGGCCACCCACGCTGACA
TCATGAATGTTCCTCGACCGGCTGGAGGCGAGTTTGGATATGACAAAGACACATCGTTGCTGAAAGAGAT
TAGGAACAGGTTTGGAAATGATCTTCACATTTCAAATAAGCTGTTTGTTCTGGATGCTGGGGCTTCTGGG
TCAAAGGACATGAAGGTACTTCGAAATCATCTGCAAGAAATACGAAGCCAGATTGTTTCGGTCTGTCCTC
CCATGACTCACCTGTGTGAGAAAATCATCTCCACGCTGCCTTCCTGGAGGAAGCTCAATGGACCCAACCA
GCTGATGTCGCTGCAGCAGTTTGTGTACGACGTGCAGGACCAGCTGAACCCCCTGGCCAGCGAGGAGGAC
CTCAGGCGCATTGCTCAGCAGCTCCACAGCACAGGCGAGATCAACATCATGCAAAGTGAAACAGTTCAGG
ACGTGCTGCTCCTGGACCCCCGCTGGCTCTGCACAAACGTCCTGGGGAAGTTGCTGTCCGTGGAGACCCC
ACGGGCGCTGCACCACTACCGGGGCCGCTACACCGTGGAGGACATCCAGCGCCTGGTGCCCGACAGCGAC
GTGGAGGAGCTGCTGCAGATCCTCGATGCCATGGACATCTGCGCCCGGGACCTGAGCAGCGGGACCATGG
TGGACGTCCCAGCCCTGATCAAGACAGACAACCTGCACCGCTCCTGGGCTGATGAGGAGGACGAGGTGAT
GGTGTATGGTGGCGTGCGCATCGTGCCCGTGGAACACCTCACCCCCTTCCCATGTGGCATCTTTCACAAG
GTCCAGGTGAACCTGTGCCGGTGGATCCACCAGCAAAGCACAGAGGGCGACGCGGACATCCGCCTGTGGG
TGAATGGCTGCAAGCTGGCCAACCGTGGGGCCGAGCTGCTGGTGCTGCTGGTCAACCACGGCCAGGGCAT
TGAGGTCCAGGTCCGTGGCCTGGAGACGGAGAAGATCAAGTGCTGCCTGCTGCTGGACTCGGTGTGCAGC
ACCATTGAGAACGTCATGGCCACCACGCTGCCAGGGCTCCTGACCGTGAAGCATTACCTGAGCCCCCAGC
AGCTGCGGGAGCACCATGAGCCCGTCATGATCTACCAGCCACGGGACTTCTTCCGGGCACAGACTCTGAA
GGAAACCTCACTGACCAACACCATGGGGGGGTACAAGGAAAGCTTCAGCAGCATCATGTGCTTCGGGTGT
CACGACGTCTACTCACAGGCCAGCCTCGGC SEQ8:
TGTAACGTGGTGGCCATCCCTGGGAATGCAAGCAGGGATGCAGTCTGCACGTCCAC
GTCCCCCACCCGGAGTATGGCCCCAGGGGCAGTACACTTACCCCAGCCAGTGTCCACACGATCCCAACAC
ACGCAGCCAACTCCAGAACCCAGCACTGCTCCAAGCACCTCCTTCCTGCTCCCAATGGGCCCCAGCCCCC
CAGCTGAAGGGAGCACTGGCGACTTCGCTCTTCCAGTTGGACTGATTGTGGGTGTGACAGCCTTGGGTCT
ACTAATAATAGGAGTGGTGAACTGTGTCATCATGACCCAGGTGAAAAAGAAGCCCTTGTGCCTGCAGAGA
GAAGCCAAGGTGCCTCACTTGCCTGCCGATAAGGCCCGGGGTACACAGGGCCCCGAGCAGCAGCACCTGC
TGATCACAGCGCCGAGCTCCAGCAGCAGCTCCCTGGAGAGCTCGGCCAGTGCGTTGGACAGAAGGGCGCC
CACTCGGAACCAGCCACAGGCACCAGGCGTGGAGGCCAGTGGGGCCGGGGAGGCCCGGGCCAGCACCGGG
AGCTCAGATTCTTCCCCTGGTGGCCATGGGACCCAGGTCAATGTCACCTGCATCGTGAACGTCTGTAGCA
GCTCTGACCACAGCTCACAGTGCTCCTCCCAAGCCAGCTCCACAATGGGAGACACAGATTCCAGCCCCTC
GGAGTCCCCGAAGGACGAGCAGGTCCCCTTCTCCAAGGAGGAATGTGCCTTTCGGTCACAGCTGGAGACG
CCAGAGACCCTGCTGGGGAGCACCGAAGAGAAGCCCCTGCCCCTTGGAGTGCCTGATGCTGGGATGAAGC
CCAGTTAACCAGGCCGGTGTGGGCTGTGTCGTAGCCAAGGTGGGCTGAGCCCTGGCAGGATGACCCTGCG
AAGGGGCCCTGGTCCTTCCAGGCCCCCACCACTAGGACTCTGAGGCTCTTTCTGGGCCAAGTTCCTCTAG
TGCCCTCCACAGCCGCAGCCTCCCTCTGACCTGCAGGCCAAGAGCAGAGGCAGCGGGTTGTGGAAAGCCT
CTGCTGCCATGGTGTGTCCCTCTCGGAAGGCTGGCTGGGCATGGACGTTCGGGGCATGCTGGGGCAAGTC
CCTGACTCTCTGTGACCTGCCCCGCCCAGCTGCACCTGCCAGCCTGGCTTCTGGAGCCCTTGGGTTTTTT
GTTTGTTTGTTTGTTTGTTTGTTTGTTTCTCCCCCTGGGCTCTGCCCCAGCTCTGGCTTCCAGAAAACCC CAGCATCCTTTTCTGCAGAGGGGCTTT SEQ9:
TTAAAAGCTTTATACAATAACGATTGAGTGATTATAAGAGCTGGCGGGGGAATGTTAAGAGGATG
ATAGGGAGCTAAGTTTAACAGAACAATTCACCTCTTTATCTTGTGACACCTACGAGCGCATCAATTCTGT
AATTGAAAAATAAAGTGCATATTTGCAGCAGCTGTACTCTCTTCAGGCTGCAAGGAGGCTTTTCCTCCCG
GTAGGCTTGATTTGCATTTCACTTTCACTTTCGTGGCTGGAAACTTTCTACCCACGTAGTGAGGCTAGAG
GAGCCACCTAAAGCTGGGGCTTGACGAAGCCGGGACCGGGACCCGATCTCCACATATGCCCGGACTTCTT
CTGCGGCCGGGTTCAGGAGTCAAAGAGGCGGGGAGACCTGCGCGACGCTGCCCCGCCCTGCGCCCGCTTC
CTCCAATGTATGCTCTAGGGGGCGGGCCTCGCGGGGAGCATGGACACGATTGGCCCTAAAGTCTTCCCCG
CAAGGCCGTGGGCTGGACAGCGTGGTGACGTCGCAACGCGGCGCAGGGTGAGAGCGCGCGCTTGCGGACG
CGGCGGCATTAAACGGTTGCAGGCGTAGAGAGTGGTCGTTGTCTTTCTAGGTCTCAGCCGGTCGTCGCGA
CGTTCGCCCGCTCGCTCTGAGGCTCCTGAAGCCGAAACTAGCTAGACTTTCCTCCTTCCCGCCTGCCTGT
AGCGGCGTTGTTGCCACTCCGCCACCATGTTCGAGGCGCGCCTGGTCCAGGGCTCCATCCTCAAGAAGGT
GTTGGAGGCACTCAAGGACCTCATCAACGAGGCCTGCTGGGATATTAGCTCCAGCGGTGTAAACCTGCAG
AGCATGGACTCGTCCCACGTCTCTTTGGTGCAGCTCACCCTGCGGTCTGAGGGCTTCGACACCTACCGCT
GCGACCGCAACCTGGCCATGGGCGTGAACCTCACCAGGTGAGCCTCGCGCCCCGGGAAGCCGCCCCGGCC
CGCCTGCACCTCCGGCTGTGGCGAGCGCTTCGAGCCTAGCCCTCATTGGCTGGCGTGGGCATCCAGAGCT
TCTCATTGGCCTGCACGCAGTGGTGGGGCCCAAGCTGAGATGAGCGGTTACGGAAAAGCCCGCGCTGGCT
GCTGCGCGAACCTGCTTTTTCGCGCCAAAGTCACAAAGCGGGTGGTGGCGGGAAAATCAAGGGTTTTTCC
GCAGTGCCAGGAACACTGTTCCAGGGACTCTTTGCTCACTAAACCTGTTGGCCTTGAATGGACGCTTTAG
CTGTGGCTTTCTTGTTTCTGAGACGGTCTCGGTCTCGGTGTGTTGCCCGGGCTGGTCTCCAACTTCTGGG
CTCAAGCGATCCTCCCGGCTCAGTCGCGTCGACTTTAAATGCTTTATAATGCCCTTGCGAGAAATGTGGC
AGCCTGTCATCCTACTTAGTGGTAGGAGATTGTTTCTATCCAGAAGGGACACTGCTGGTGGTATTTTAGT ATAAATACTGCCAGATGCGTCCAAAACG SEQ10:
GTTTGAGGGCTGGCGCTGTGTGGACCGTGACTTCTGCGCCAACATCCTCAGCGCCG
AGAGCAGCGACTCCGAGGGGTTTGTGATCCACGACGGCGAGTGCATGCAGGAGTGCCCCTCGGGCTTCAT
CCGCAACGGCAGCCAGAGCATGTACTGCATCCCTTGTGAAGGTCCTTGCCCGAAGGTCTGTGAGGAAGAA
AAGAAAACAAAGACCATTGATTCTGTTACTTCTGCTCAGATGCTCCAAGGATGCACCATCTTCAAGGGCA
ATTTGCTCATTAACATCCGACGGGGGAATAACATTGCTTCAGAGCTGGAGAACTTCATGGGGCTCATCGA
GGTGGTGACGGGCTACGTGAAGATCCGCCATTCTCATGCCTTGGTCTCCTTGTCCTTCCTAAAAAACCTT
CGCCTCATCCTAGGAGAGGAGCAGCTAGAAGGGAATTACTCCTTCTACGTCCTCGACAACCAGAACTTGC
AGCAACTGTGGGACTGGGACCACCGCAACCTGACCATCAAAGCAGGGAAAATGTACTTTGCTTTCAATCC
CAAATTATGTGTTTCCGAAATTTACCGCATGGAGGAAGTGACGGGGACTAAAGGGCGCCAAAGCAAAGGG
GACATAAACACCAGGAACAACGGGGAGAGAGCCTCCTGTGAAAGTGACGTCCTGCATTTCACCTCCACCA
CCACGTCGAAGAATCGCATCATCATAACCTGGCACCGGTACCGGCCCCCTGACTACAGGGATCTCATCAG
CTTCACCGTTTACTACAAGGAAGCACCCTTTAAGAATGTCACAGAGTATGATGGGCAGGATGCCTGCGGC
TCCAACAGCTGGAACATGGTGGACGTGGACCTCCCGCCCAACAAGGACGTGGAGCCCGGCATCTTACTAC
ATGGGCTGAAGCCCTGGACTCAGTACGCCGTTTACGTCAAGGCTGTGACCCTCACCATGGTGGAGAACGA
CCATATCCGTGGGGCCAAGAGTGAGATCTTGTACATTCGCACCAATGCTTCAGTTCCTTCCATTCCCTTG
GACGTTCTTTCAGCATCGAACTCCTCTTCTCAGTTAATCGTGAAGTGGAACCCTCCCTCTCTGCCCAACG
GCAACCTGAGTTACTACATTGTGCGCTGGCAGCGGCAGCCTCAGGACGGCTACCTTTACCGGCACAATTA
CTGCTCCAAAGACAAAATCCCCATCAGGAAGTATGCCGACGGCACCATCGACATTGAGGAGGTCACAGAG
AACCCCAAGACTGAGGTGTGTGGTGGGGAGAAAGGGCCTTGCTGCGCCTGCCCCAAAACTGAAGCCGAGA
AGCAGGCCGAGAAGGAGGAGGCTGAATACCGCAAAGTCTTTGAGAATTTCCTGCACAACTCCATCTTCGT
GCCCAGACCTGAAAGGAAGCGGAGAGATGTCATGCAAGTGGCCAACACCACCATGTCCAGCCGAAGCAGG
AACACCACGGCCGCAGACACCTACAACATCACCGACCCGGAAGAGCTGGAGACAGAGTACCCTTTCTTTG
AGAGCAGAGTGGATAACAAGGAGAGAACTGTCATTTCTAACCTTCGGCCTTTCACATTGTACCGCATCGA
TATCCACAGCTGCAACCACGAGGCTGAGAAGCTGGGCTGCAGCGCCTCCAACTTCGTCTTTGCAAGGACT
ATGCCCGCAGAAGGAGCAGATGACATTCCTGGGCCAGTGACCTGGGAGCCAAGGCCTGAAAACTCCATCT
TTTTAAAGTGGCCGGAACCTGAGAATCCCAATGGATTGATTCTAATGTATGAAATAAAATACGGATCACA
AGTTGAGGATCAGCGAGAATGTGTGTCCAGACAGGAATACAGGAAGTATGGAGGGGCCAAGCTAAACCGG
CTAAACCCGGGGAACTACACAGCCCGGATTCAGGCCACATCTCTCTCTGGGAATGGGTCGTGGACAGATC
CTGTGTTCTTCTATGTCCAGGCCAAAACAGGATATGAAAACTTCATCCATCTGATCATCGCTCTGCCCGT
CGCTGTCCTGTTGATCGTGGGAGGGTTGGTGATTATGCTGTACGTCTTCCATAGAAAGAGAAATAACAGC
AGGCTGGGGAATGGAGTGCTGTATGCCTCTGTGAACCCGGAGTACTTCAGCGCTGCTGATGTGTACGTTC
CTGATGAGTGGGAGGTGGCTCGGGAGAAGATCACCATGAGCCGGGAACTTGGGCAGGGGTCGTTTGGGAT
GGTCTATGAAGGAGTTGCCAAGGGTGTGGTGAAAGATGAACCTGAAACCAGAGTGGCCATTAAAACAGTG
AACGAGGCCGCAAGCATGCGTGAGAGGATTGAGTTTCTCAACGAAGCTTCTGTGATGAAGGAGTTCAATT
GTCACCATGTGGTGCGATTGCTGGGTGTGGTGTCCCAAGGCCAGCCA