Advertisement
Aceofspades25

A HERV-K ERV that originated in the common ancestor to Human

Feb 18th, 2015
274
0
Never
Not a member of Pastebin yet? Sign Up, it unlocks many cool features!
text 9.54 KB | None | 0 0
  1. A HERV-K ERV that originated in the common ancestor to Humans and Chimpanzees
  2.  
  3. ATTAT <- TSR (part of the original sequence that was duplicated upon insertion)
  4.  
  5. GTGGGGAAAAGCAAGAGAGATCAGATTGTCACTGTGTCTGTGTAGAAAGAAGTAGACATG <- 3' LTR
  6. GGAGACTCCATTTTGTTATGTACTAAGAAAAATTCTTCTGCCTTGAGATTCTGTGACCTT
  7. ACCCCCAACCCCGTGCTCTCTGAAACATGTGCTGTGTCAACTCAGAGTTAAATGGATTAA
  8. GGGCGGTGCAAGATGTGCTTTGTTAAACAGATGCTTGAAGGCAGCATGCTCCTTAAGAGT
  9. CATCACCACTCCCTAATCTCAAGTACCCAGGGACACAAAAACTGCGGAAGGCCGCAGGGA
  10. CCTCTGCCTAGGAAAGCCAGATATTGTCCAAGCTTTCTCCCCATGTGATAGTCTGAAATA
  11. CGGCCTCGTGGGAAGGGAAAGACCTGACCGTCCCCCAGCCCGACACCCGTAAAGGGTCTG
  12. TGCTGAGGAGGATTAGTATAAGAGGAAGGCATACCTCTTGCAGTTGAGACAAGAGGAAGG
  13. CATTGGTCTCCTGCCCGTCCCTGGGCAATGGAATGTCTCGGTATAAAACCCGATTGTACG
  14. TTCCATCTACTGAGATAGGGAAAAACCGCCTTAGGGCTGGAGGTGGGACATGCAGGCAGC
  15. AATACTGCTTTGTAAAGCATTGAGATGTTTATGTGTATGCATATCTAAAAGCACAGCACT
  16. TAATCCTTTACCTTGTCTATGATGCAAAGACCTTTGTTCACGTGTTTGTCTGCTGACCCT
  17. CTCCCCACAATTGTCTTGTGACCCTGACACATCCCCCTCTCGGAGAAACACCCACGAATG
  18. ATCAATAAATACTAAGGGAACTCAGAGGCTGGCGGGATCCTCCATATGCTAAACGCTGGT
  19. CCCCCGGGTCCCCTTATTTCTTTCTCTATACTTTGTCTCTGTGTCTCTTTCTTTCCTAAG
  20. TCTCTTGTTCCACCTTACGAGAAACACCCACAGGTGTGGAGGGGCAACCCACCCCTTCAT
  21.  
  22. TCTGGTGCCCAACGTGGAGGCTTTTCTCTAGGGTGAAGGTACGCTCGAGCGTGGTCATTG <- Protein coding sequences start here
  23. AGGACAAGTCGATGAGAGATCCCGAGTACCTCTACAGTCAGCCGTACGGTAAGCTTGTGC
  24. GCTCGGAAGAAGCTAGGGTGATAATGGGGCAAACTAAAAGTAAAACTAAAAGTAAATATG
  25. CCTCTTATCTCAGCTTTATTAAAATTCTTTTAAAAAGAGGGGAAGTTAAAGTATCTACAA
  26. AAAATCTAATCAAGCTATTTCAAATAATAGAACAATTTTGCCCATGGTTTCCAGAACAAG
  27. GAACTTTAGATCTAAAAGATTGGAAAAGAATTGGTAAGGAACTAAAACAAGCAGGTAGGA
  28. AGGGTAATATCATTCCACTTACAGTATGGAATGATTGGGCCACTATTAAAGCAGCTTTAG
  29. AACCATTTCAAACAGAAGAAGATAGCGTTTCAGTTTCTGATGCCCCTGGAAGCTGTATAA
  30. TAGATTGTAATGAAAAGACAAGGAAAAAATCCCAGAAAGAGACGGAAAGTTTACATTGCG
  31. AATATGTAGCAGAGCCGGTAATGGCTCATTCAACGCAAAATGTTGACTATAATCAATTAC
  32. AGGAAGTGATATATCCTGAAACGTTAAAATTAGAAGGAAAAGGTCCAGAATTAGTGGGGC
  33. CATCAGAGTCTAAACCACGAGGGCCAAGTCCTCTTCCAGCAGGTCAGGTGCCCGTAACAT
  34. TATAACCTCAAACGCAGGTTAAAGAAAATAAGACCCAACCGCCAGTAGTTTATCAATACT
  35. GGCCACCGGCTGAACTTCAGTATCGGCCACCCCCAGAAAGTCAGTATGGATATCCAGGAA
  36. TGCCCCCAGCACCACAGGGCAGGGCGCCATACCCTCAGCCACCCACTAGGAGACTTAATC
  37. CTACGGCACCACCTAGTAGACAGGGTAGTGAATTACATGAAATTATTGATAAATCAAGAA
  38. AGGAAGGAGATACTGAGGCGTGGCAATTCCCAGTAATGTTAGAACCGATGCCACCTGGAG
  39. AAGAAGCCCAAGAGGGAGAGCCTCTCACAGTTGAGGCCAGATACAAGTCTTTTTCGATAA
  40. AAATGCTAAAAGATATGAAGGAGGGAGTAAAACAGTATGGACCCAACTCCCCTTATATGA
  41. GGACATTATTAGATTCCACTGCTCATGGACATAGACTCATTCCTTATGATTGGGAGATTC
  42. TGGCAAAATCGTCTCTCTCACCCTCTCAATTTTTACAATTTAAGACTTGGTGGATTGATG
  43. GGGTACAAGAACAGGTCCGAAGAAATAGGGCTGCCAATCCTCCAGTTAACATAGATGCAG
  44. ATCAACTATTAGGAACAGGTCAAAATTGGAGTACTATTAGTCAACAAGCATTAATGCAAA
  45. ATGAGGCCATTGAGCAAGTTAGAGCTATCCGCCTTAGAGCCTGGGAAAAAATCCAAGACC
  46. CAGGAAGCGCCTGCCCCTCATTTAATACAGTAAGACAAGGTTCAAAAGAGCCCTACCCTG
  47. ATTTTGTGGCAAGGCTCCAAGATGTTGCTCAAAAGTCAATTGCCAATGAAAAAGCCCGTA
  48. AGGTCATAGTGGAGTTGATGGCATATGAAAACGCCAATCCTGAGTGTCAATCAGCCATTA
  49. AGCCATTAAAAGGAAAGGTTCCTGCAGGATCAGATGTAATCTCAGAATACGTAAAAGCAT
  50. GTGATGGAATCGGAGGAGCTATGCATAAAGCTATGCTTATGGCTCAAGCAATAACAGGAG
  51. TTGCTTTAGGAGGACAAGTTAGAACATTTGGAGGAAAATGTTATAATTGTGGTCAAATTG
  52. GTCATTTAAAAAAGAATTGCCCAGTCTTAAATAAACAGAATATAACTACTCAAGCTACTA
  53. CCACAACAGGTAGAGAGCCACCTGACTTATGTCCAAGATGTAAAAAAGGAAAACATTGGG
  54. CTAGTCAATGTCGTTCTAAATTTGACAAAAATGGGAAACCATTGTCAGGAAACGAGCAAA
  55. GGGGCCAGCCTCAGGCCCCACAACAAACTGGGGCATTCCCAATTCAGCCATTTGTTCCTC
  56. AGGGTTTTCAGGGACAACAACCCCCACTGTCCCAAGTGTTTCAGGGAATAAGCCAGTTAC
  57. CACAATACAACAATTGTCCCCCGCCACAAGCGGCAGTGCAGCAGTAGATTTATGTACTAT
  58. ACAAGCAGTCTCTCTGCTTCCAGGGGAGCCCCCGCAAAAAATCCCCACAGGGGTATATGG
  59. CCCACTGCCTGAGGGGACTGTAGGACTAATCTTGGGAAGATCAAGTCTAAATCTAAAAGG
  60. AGTTCAAATTCATACTGGTGTGGTTGATTCAGACTATAAAGGCGAAATTCAATTGGTTAT
  61. TAGCTCTTCAATTCCTTGGAGTGCCAGTCCAGGAGACAGGATTGCTCAATTATTACTCCT
  62. GCCATATATTAAGGTTGGAAATAGTGAAATAAAAAGAACAGGAGGGTTTGGAAGCACTGA
  63. TCCGACAGGAAAGGCTGCATATTGGGCAAGTCAGGTCTCAAAGAACAGACGTGTGTAAGG
  64. CCATTATTCAAGGAAAACAGTTTGAAGGGTTGGTAGACACTGGAGCAGATGTCTCTATCA
  65. TTGCTTTAAATCAGTGGCCAAAAAATTGGCCTAAACAAAAGGCTGTTACAGGACTTGTCG
  66. GCATAGGCACAGCCTCAGAAGTGTATCAAAGTACTGAGATTTTACATTGCTTAGGGCCAG
  67. ATAAGAAAGTACTGTTCAGCCAATGATTACTTCAATTCCTCTTAATCTGTGGGGTCGAAA
  68. TTTATTACAACAATGGGGTGCGGAAATCACCATGCCTGCTCCATTATATAGCCCCACGAG
  69. TCAAAAAATCATGACCAAGATGGGATATATACCAGGAAAGGGACTAGGAAAAAATGAAGA
  70. TGGCATTAAAGTTCCAGTTGAGGCTAAAATAAATCAAGAAAGAGAAGGAATAGGGTATCC
  71. TTTTTAGGGGCGGCCACTGTAGAGCCTCCTAAACCCATACCATTAACTTGGAAAACAGAA
  72. AAACCGGTGTGGGTAAATCAGTGGCCTCTACCAAAACAAAAACTGGAGGCTTTACATTTA
  73. TTAGCAAATGAACAGTTAGAAAAGGGTCATATTGAGCCTTCATTCTCGCCTTGGAATTCT
  74. CCTGTGTTTGTAATTCAGAAGAAATCAGGCAAATGGCGTATGTTAACTGACTTTAGGGCC
  75. GTAAACGCCGTAATTCAACCCATGGGGCCTCTCCAACCTGGGTTGCCCTCTCTGGCCATG
  76. ATCCCAAAAGACTGGCCTTTAATTATAATTGATCTAAAGGATTGCTTTTTTACCATCCCT
  77. CTGGCGGAGCAGGATTGCGAAAAATTTGCCTTTACTATACCAGCCATAAATAATAAAGAA
  78. CCAGCCACCAGGTTTCAGTGGAAAGTGTTACCTCAGGGAATGCTTACTAGTCCAACTATT
  79. TGTCAGACTTTTGTAGGTCGAGCTCTTCAAACAGTTAGAGACAAGTTTTCAGACTGTTAT
  80. ATTATTCATTATATTGATGATATTTTATGTGCTGCAGAAACGAGAGATAAATTAATTGAC
  81. TGTTACACATTTCTGCAAGCAGAGGTTGCCAACGCAGGACTGGCAATAGCATCTGATAAG
  82. ATCCAAACCTCTACTCCTTTTCATTATTTAGGGATGCAGATAGAAAATAGAAAAATTAAG
  83. CCACAAAAAATAGAAATAAGAAAAGACACATTAAAAACACTAAATGATTTTCAAAAATTG
  84. CTGGGAGATATTAATTGGATTCGGCCAACTCTAGGCATTCCTACTTATGCCACGTCAAAT
  85. TTGTTCTCTATCTTAAGAGGAGACTCAGACTTAAATAGTAAAAGAATGTTAACCCCAGAG
  86. GCAACAAAAGAAATTAAATTAGTGGAAGAAAAAATTCAGTCAGCACAAATAAATAGAATA
  87. GATCCCTTAGCCCCACTCCAACTTTTGATTTTTGCCACTGCACATTCTCCAACAGGCATC
  88. ATTATTCAAAATACTGATCTTGTGGAGTGGTCATTCCCTCCTCACAGTACAGTTAAGACT
  89. TTTACATTGTACTTGGATCAAATAGCTACATTAATTGGTCAGACAAGATTACGAATAATA
  90. AAATTATGTGGCAATGACCCAGACAAAATAGTTGTCCCTTTAACCAAGGAACAAGTTAGA
  91. CAAGGCTTTATCAATTCTGGTGCATGGCAGATTGGTCTTGCTAATTTTGTGGGAATTATT
  92. GATAATCATTACCCAAAAACAAAAATCTTCCAGTTCTTAAAATTGACTACTTGGATTCTA
  93. CCTAAAATTACCAGACGTGAACCTTTAGAAAATGCTCTAACAGTATTTACTGATGGTTCC
  94. AGCAATGGAAAAGCGGCTTACACAGGGCCGAAAGAACGAGTAATCAAAACTCCGTATCAA
  95. TCGGCTCAAAGAGCAGAGTTGGTTGCAGTCATTACAGTGTTACAAGATTTTGACCAACCT
  96. ATCAATATTATATCAGATTCTGCATATGTAGTACAGGCTACAAGGGATGTTGAGACAGCT
  97. CTAATTAAATATAGCATGGATGATCAGTTAAACCAGCTATTCAATTTATTACAACAAACT
  98. GTAAGAAAAAGAAATTTCCCATTTTATATTACTCATATTCGAGCACACACTAATTTACCA
  99. GGGCCTTTGACTAAAGCAAATGAACAAGCTGACTTACTGGTATCATCTGCATTCATAAAA
  100. GCACAAGAACTTCATGCTTTGACTCACGTAAATGCAGCAGGATTAAAAAACAAATTTGAT
  101. GTCACATGGAAATAGGCAAAAGATATTGTACAACATTGCACCCAGTGTCAAGTCTTACAC
  102. CTGCCCACTCAAGAGGCAGGAGTTAATCCCAGAGGTCTGTGTCCTAATGCATTATGGCAA
  103. ATGGATGTCACGCATGTACCTTCATTTGGAAAATTATCACATGTTCATGTAACCGTTGAT
  104. ACTTATTCACATTTCATATGGGCAACTTGCCAAACAGGAGAAAGTACTTCCCATGTTAAA
  105. AAACATTTATTGTCTTGTTTTGCTGTAATGGGAGTTCCAGAAAAAATCAAAACTGACAAT
  106. GGACCAAGATATTGTAGTAAAGCTTTCCAAAAATTCTTAAGTCAGTGGAAAATTTCACAT
  107. ACAACAGGAATTCCTTATAATTCCCAAGGACAGGCCATAGTTGAAAGAACTAATAGAACA
  108. CTCAAAACTCAATTAGTTAAACAAAAAGAAGGGGGAGACAGTAAGGAGTGTACCACTCCT
  109. CAGATGCAACTTAATCTAGCACTCTATACTTTAAATTTTTAAAACATTTATAGAAATCAG
  110. ACTACTACTTCTGTAGAACAACATCTTTCTGGTAAAAAGAACAGCCCACATGAAGGAAAA
  111. CTAATTTGGTGGAAAGATAATAAAAATAAGACATGGGAAATAGGGAAGGTGATAACGTGG
  112. GGGAGAGGTTTTGTTTGTGTTTCACCAGGAGAAAATCAGCTTCCTGTTTGGATACCCACT
  113. AAACATTTGAAGTTCTACAATGAACCCATCGGAGATGCAAAGAAAAGGGCCTCCACAGAG
  114. ATGGTAACACCAGTCACATGGATGGATAATCCTATAGAAGTATATGTTAATGATAGTATG
  115. CGTACCTGGCCCCACAGATGATCGCTGCCCTGCCAAACCTGAGGAAGAAGGGATGATGAT
  116. AAATATTTCCATTGGGTATTGTTATCCTCCTATTTGCCTAGGGAGAGCACCAGGATGTTT
  117. AATGCCTGCAGTCCAAAATTGGTTGGTAGAAGTACCTACTGTCAGTCCCATCAGTAGATT
  118. CACTTATCACATGGTAAGCGGGATGTCACTCAGGCCACGGGTAAATTATTTACAAGACTT
  119. TTCTTATCAAAGATCATTAAAATTTAGACCTAAAGGGAAACCTTGCCCCAAGGAAATTCC
  120. CAAAGAATCAAAAAATACAGAAGTTTTAGTTTGGGAAGAATGTGTGGCCAATAGTGTGGT
  121. GATATTACGAAACAATGAATTCGGAACTATTATAGATTGGGCACCTCGAGGTCAATTCTA
  122. CCACAATTGCTCAGGACAAACTCAGTCGTGTCCAAGTGCACAAGTGAGTCCAGCTGTTGA
  123. TAGCGACTTAACAGAAAGTCTAGACAAACGTAAGCATAAAAAATTGCAGTCTTTCTACCC
  124. TTGAGAATGGGGAGAAAAAGGAATCCCTACCCCAAGACCAAAAATAATAAGTCCTGTTTC
  125. TGGTCCTGAACATCCAGAATTATGGAGGCTTACTGTGGCCTCACACCACATTAGAATTTG
  126. GTCTGGAAATCAAACTTTATAAACAAGAGATCGTAAGCCATTTTATACTATCCACCTATA
  127. TTCCAATCCAACGGTTCCTTTACAAAGTTGCATAAAGCCCCCTTATATGCTAGTTGTAGG
  128. AAATATAGTTATTAAACCAGACTCTCAAACTATAACCTGTGAAAATTGTAGATTGTTTAC
  129. TTGCATTGATTCAACTTTTAATTGGCACCACCGTATTCTGCTGGTGAGAGCAAGAGAGGG
  130. CGTGTGGATCCCTGTGTCCATGGACCGACCATGGGAGGCCTTGCCATCCGTCCATATTTT
  131. GACTGAAGTATTAAAAGGTGTTTTAAATAGATCCAAAAGATTCGTTTTTACTTTAATTGC
  132. AGTGATTATGGGATTAATTGCAGTCACAGCTACGGCTGCTGTAGCAGGAGCTGCATTGCA
  133. CTCTTCTGTTCAGTCGGTAAACTTTTTTAATGATTGGCAAAAAAATTCTACAAGATTGTG
  134. GAATTCACAATCTAGTATTGATCAAAAATTGGCAAATCAAATTAATGATCTTAGACAAAC
  135. TGTCATTTGGATGGGAGACAGACTCATGAGCTTAGAACATCGTTTCCAGTTACAGTGTGA
  136. CTGGAATACGTCAGATTTTTGTATTACACCCCAAATTTATGAGTCTGAGCATCACTGGGA
  137. CATGGTTAGACGCCATCTACAGGGAAGAGAAGATAATCTCACTTTAGACATTTCCAAATT
  138. AAAAGAATAAATTTTCGAAGCATCAAAAGCCCATTTAAATTTGGTGCCAGCAACTGAGGC
  139. AACTGCAGGAGTTGCTGATGGCCTCGCAAATCTTAACCCTGTCAATTGGGTTAAGACCAT
  140. CGGAAGTACTACAATTATAAATCTCATATTAATCCTTGTGTGCCTGTTTTGTCTGTTGTT
  141. AGTCTGCAGGTGTACCCAACAGCTCTGAAGAGACAGCGACCATCGAGAACGGGCCATGAT
  142. GACGATTGCGGTTTTGTCGAAAAGGGGGAAAT
  143.  
  144. GTGGGGAAAAGCAAGAGAGATCAGATTGTCACTGTGTCTGCGTAAAAAGAAGTAGACATG <- 5' LTR
  145. GGAGACTCCATTTTGTTATGTACTAAGAAAAATTCTTCTGCCTTGAGATTCTGTGACCTT
  146. ACCCCCAACCCCGTGCTCTCTGAAACATGTGCTGTGTCAACTCAGAGTTAAATGGATTAA
  147. GGGCGGTGCAAGATGTGCTTTGTTAAACAGATGCTTGAAGGCAGCATGCTCCTTAAGTCA
  148. TCACCACTCCCTAATCTCAAGTACCCAGGGACACAAAAACTGCGGAAGGCCGCAGGGACC
  149. TCTGCCTAGGAAAGCCAGATATTGTCCAAGCTTTCTCCCCATGTGATAGTCTGAAATACG
  150. GCCTCGTGGGAAGGGAAAGACCTGACCATCCCCCAGCCCGACACCCGTAAAGGGTCTGTG
  151. CTGAAGAGGATTAGTATAAGAGGAAGGCATACCTCTTGCAGTTGAGACAAGAGGAAGGCA
  152. TCGGTCTCCTGCCCGTCCCTGGGCAATGGAATGTCTCGGTATAAAACCCGATTGTACGTT
  153. CCATCTACTGAGATAGGGAAAAACCGCCTTAGGGCTGGAGGTGGGACATGCAGGCAGCAA
  154. TACTGCTTTGTAAAGCATTGAGATGTTTATGTGTATGCATATCTAAAAGCACAGCACTTA
  155. ATCCTTTACCTTGTCTATGATGCAAAGACCTTTGTTCACGTGTTTGTCTGCTGACCCTCT
  156. CCCCACAATTGTCTTGTGACCCTGACACATCCCCCTCTCGGAGAAACACCCACGAATGAT
  157. CAATAAATACTAAGGGAACTCAGAGGCTGGCGGGATCCTCCATATGCTAAACGCTGGTCC
  158. CCCGGGTCCCCTTATTTCTTTCTCTATACTTTGTCTCTGTGTCTCTTTCTTTCCTAAGTC
  159. TCTCGTTCCACCTTACGAGAAACACCCACAGGTGTGGAGGGGCAACCCACCCCTTCAA
  160.  
  161. ATTAT <- TSR (part of the original sequence that was duplicated upon insertion)
Advertisement
Add Comment
Please, Sign In to add comment
Advertisement