Query: CNT0036050
Subject: sp|P04146|COPIA_DROME Copia protein OS=Drosophila melanogaster GN=GIP

transposon TNT 1-94 OS=Nicotiana tabacum PE=2 SV=1

 Score =  331 bits (848), Expect = 1e-91,   Method: Compositional matrix adjust.
Identities = 194/566 (34%), Positives = 282/566 (49%), Gaps = 56/566 (9%)

Query: 1121 EPSSYAEASSGPHAADWRKAMEEEMESQRDNKTWELAAPPPGVRLLANRWVYKLKPQPGG 1180
EP S E S P KAM+EEMES + N T++L P G R L +WV+KLK + G
Sbjct: 810 EPESLKEVLSHPEKNQLMKAMQEEMESLQKNGTYKLVELPKGKRPLKCKWVFKLK-KDGD 868

Query: 1181 AP--RFKARLVVKGFAQREGIDYSEVFAPTSRYVSXXXXXXXXXXXGLSLHQMDVKTAFL 1238
R+KARLVVKGF Q++GID+ E+F+P + S L + Q+DVKTAFL
Sbjct: 869 CKLVRYKARLVVKGFEQKKGIDFDEIFSPVVKMTSIRTILSLAASLDLEVEQLDVKTAFL 928

Query: 1239 NGDLDEELWMQQPQGFEVXXXXXXXXXXXXXXXXXXXXXXXXHRSVPLACRLLKSVYGLK 1298
+GDL+EE++M+QP+GFEV + C+L KS+YGLK
Sbjct: 929 HGDLEEEIYMEQPEGFEVAGKKH------------------------MVCKLNKSLYGLK 964

Query: 1299 QAPRCWYRKLSEELGGLGFTPATADPAL-FVRHDEAGPVYVLVHVDDLLIAAGCSAQLAA 1357
QAPR WY K + + +DP + F R E + +L++VDD+LI +A
Sbjct: 965 QAPRQWYMKFDSFMKSQTYLKTYSDPCVYFKRFSENNFIILLLYVDDMLIVGKDKGLIAK 1024

Query: 1358 VKAAIGKCFEVRDLGEASTYLGMEIKRDPSTGDILLQQRRYVNELLQRHGMTDAKPRSLP 1417
+K + K F+++DLG A LGM+I R+ ++ + L Q +Y+ +L+R M +AKP S P
Sbjct: 1025 LKGDLSKSFDMKDLGPAQQILGMKIVRERTSRKLWLSQEKYIERVLERFNMKNAKPVSTP 1084

Query: 1418 LPAGTRVLAASEQQPVLDDGG-----PYRSLIGGLNYVAVSTRPDIAYALSVLARHMAAP 1472
L AG L+ +++ G PY S +G L Y V TRPDIA+A+ V++R + P
Sbjct: 1085 L-AGHLKLSKKMCPTTVEEKGNMAKVPYSSAVGSLMYAMVCTRPDIAHAVGVVSRFLENP 1143

Query: 1473 TKAHLALATGVLRYLKHTVDMGLRFXXXXXXXXXXXXXXXXXXXXXXYDAGAFVGYCDAD 1532
K H +LRYL+ T L F GY DAD
Sbjct: 1144 GKEHWEAVKWILRYLRGTTGDCLCFGGSDP---------------------ILKGYTDAD 1182

Query: 1533 WAGDPNTRRSQTAFLFALGKTVVSWCSQQQRTVXXXXXXXXXXXXXXXXXXXLWLRKLAS 1592
AGD + R+S T +LF +SW S+ Q+ V +WL++
Sbjct: 1183 MAGDIDNRKSSTGYLFTFSGGAISWQSKLQKCVALSTTEAEYIAATETGKEMIWLKRFLQ 1242

Query: 1593 DLGLRSGAVVIRCDSQGALSLARNPIASSPLSKHIDIQHHLXXXXXXXXXXXXXYCPTEQ 1652
+LGL V+ CDSQ A+ L++N + + +KHID+++H T +
Sbjct: 1243 ELGLHQKEYVVYCDSQSAIDLSKNSMYHAR-TKHIDVRYHWIREMVDDESLKVLKISTNE 1301

Query: 1653 MIADALTKALPEAKFFFCRAAMGVST 1678
AD LTK +P KF C+ +G+ +
Sbjct: 1302 NPADMLTKVVPRNKFELCKELVGMHS 1327



Score = 177 bits (448), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 108/306 (35%), Positives = 152/306 (49%), Gaps = 13/306 (4%)

Query: 709 HRRLGHVGWHSLMQMVNGSLVTGLDVDLSALSQAAESVCSTCVEAKAASSPFPDSSSEPQ 768
H+R+GH+ L + SL+ S C C+ K F SS
Sbjct: 426 HKRMGHMSEKGLQILAKKSLI-------SYAKGTTVKPCDYCLFGKQHRVSFQTSSERKL 478

Query: 769 QPLALAHSDVCGPMPIMGRGGSRYFITLLDDATGVSAVRMLTTREHAGEALQEMIVQLEN 828
L L +SDVCGPM I GG++YF+T +DDA+ V +L T++ + Q+ +E
Sbjct: 479 NILDLVYSDVCGPMEIESMGGNKYFVTFIDDASRKLWVYILKTKDQVFQVFQKFHALVE- 537

Query: 829 CHPGGGKLRNLRSDNGGEYRSEELQQWLRERGTVQQFSAPYMPQQNGAAERLNRTLMDRT 888
G KL+ LRSDNGGEY S E +++ G + + P PQ NG AER+NRT++++
Sbjct: 538 -RETGRKLKRLRSDNGGEYTSREFEEYCSSHGIRHEKTVPGTPQHNGVAERMNRTIVEKV 596

Query: 889 RAMLFDAALPSSFWPEAVTYASHLRNLX-XXXXXXXTPWEALTGVKPDISSLRTFGCRVY 947
R+ML A LP SFW EAV A +L N P T + S L+ FGCR +
Sbjct: 597 RSMLRMAKLPKSFWGEAVQTACYLINRSPSVPLAFEIPERVWTNKEVSYSHLKVFGCRAF 656

Query: 948 VTLPADQRSKLSPRADVGTYLGLQRNSAAYRVM--VGGKVVVSRDVRFDE-DVRGPASRL 1004
+P +QR+KL ++ ++G YR+ V KV+ SRDV F E +VR A
Sbjct: 657 AHVPKEQRTKLDDKSIPCIFIGYGDEEFGYRLWDPVKKKVIRSRDVVFRESEVRTAADMS 716

Query: 1005 AGVPFG 1010
V G
Sbjct: 717 EKVKNG 722

Query: CNT0036050
Subject: sp|P25600|YCH4_YEAST Putative transposon Ty5-1 protein YCL074W

PE=1 SV=3

 Score =  300 bits (768), Expect = 2e-81,   Method: Compositional matrix adjust.
Identities = 179/601 (29%), Positives = 282/601 (46%), Gaps = 61/601 (10%)

Query: 1085 QLFDEDSDDDETPPLAPPSDDEDSYG--VSTATTAGVGEPSSYAEASSGPHAADWRKAME 1142
++ + S+ +T P +++++S V A T P+S+ E + W +A+
Sbjct: 852 EIINRRSERLKTKPQISYNEEDNSLNKVVLNAHTIFNDVPNSFDEIQYRDDKSSWEEAIN 911

Query: 1143 EEMESQRDNKTWELAAPPPGVRLLANRWVYKLKPQPGGAP-RFKARLVVKGFAQREGIDY 1201
E+ + + N TW + P ++ +RWV+ +K G P R+KARLV +GF Q+ IDY
Sbjct: 912 TELNAHKINNTWTITKRPENKNIVDSRWVFSVKYNELGNPIRYKARLVARGFTQKYQIDY 971

Query: 1202 SEVFAPTSRYVSXXXXXXXXXXXGLSLHQMDVKTAFLNGDLDEELWMQQPQGFEVXXXXX 1261
E FAP +R S L +HQMDVKTAFLNG L EE++M+ PQG
Sbjct: 972 EETFAPVARISSFRFILSLVIQYNLKVHQMDVKTAFLNGTLKEEIYMRLPQGISCNSDN- 1030

Query: 1262 XXXXXXXXXXXXXXXXXXXHRSVPLACRLLKSVYGLKQAPRCWYRKLSEELGGLGFTPAT 1321
C+L K++YGLKQA RCW+ + L F ++
Sbjct: 1031 -------------------------VCKLNKAIYGLKQAARCWFEVFEQALKECEFVNSS 1065

Query: 1322 ADPALFV--RHDEAGPVYVLVHVDDLLIAAGCSAQLAAVKAAIGKCFEVRDLGEASTYLG 1379
D +++ + + +YVL++VDD++IA G ++ K + + F + DL E ++G
Sbjct: 1066 VDRCIYILDKGNINENIYVLLYVDDVVIATGDMTRMNNFKRYLMEKFRMTDLNEIKHFIG 1125

Query: 1380 MEIKRDPSTGDILLQQRRYVNELLQRHGMTDAKPRSLPLPA--GTRVLAASEQQPVLDDG 1437
+ I + I L Q YV ++L + M + S PLP+ +L + E D
Sbjct: 1126 IRI--EMQEDKIYLSQSAYVKKILSKFNMENCNAVSTPLPSKINYELLNSDE-----DCN 1178

Query: 1438 GPYRSLIGGLNYVAVSTRPDIAYALSVLARHMAAPTKAHLALATGVLRYLKHTVDMGLRF 1497
P RSLIG L Y+ + TRPD+ A+++L+R+ + VLRYLK T+DM L F
Sbjct: 1179 TPCRSLIGCLMYIMLCTRPDLTTAVNILSRYSSKNNSELWQNLKRVLRYLKGTIDMKLIF 1238

Query: 1498 XXXXXXXXXXXXXXXXXXXXXXYDAGAFVGYCDADWAGDPNTRRSQTAFLFALGK-TVVS 1556
+GY D+DWAG R+S T +LF + ++
Sbjct: 1239 KKNLAF------------------ENKIIGYVDSDWAGSEIDRKSTTGYLFKMFDFNLIC 1280

Query: 1557 WCSQQQRTVXXXXXXXXXXXXXXXXXXXLWLRKLASDLGLR-SGAVVIRCDSQGALSLAR 1615
W +++Q +V LWL+ L + + ++ + I D+QG +S+A
Sbjct: 1281 WNTKRQNSVAASSTEAEYMALFEAVREALWLKFLLTSINIKLENPIKIYEDNQGCISIAN 1340

Query: 1616 NPIASSPLSKHIDIQHHLXXXXXXXXXXXXXYCPTEQMIADALTKALPEAKFFFCRAAMG 1675
NP + +KHIDI++H Y PTE +AD TK LP A+F R +G
Sbjct: 1341 NP-SCHKRAKHIDIKYHFAREQVQNNVICLEYIPTENQLADIFTKPLPAARFVELRDKLG 1399

Query: 1676 V 1676
+
Sbjct: 1400 L 1400



Score = 139 bits (351), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 87/292 (29%), Positives = 139/292 (47%), Gaps = 10/292 (3%)

Query: 709 HRRLGHVGWHSLMQMVNGSLVTGLDVDLSALSQAAESVCSTCVEAKAASSPFP--DSSSE 766
H R GH+ L+++ ++ + + L+ L + E +C C+ K A PF +
Sbjct: 419 HERFGHISDGKLLEIKRKNMFSDQSL-LNNLELSCE-ICEPCLNGKQARLPFKQLKDKTH 476

Query: 767 PQQPLALAHSDVCGPMPIMGRGGSRYFITLLDDATGVSAVRMLTTREHAGEALQEMIVQL 826
++PL + HSDVCGP+ + YF+ +D T ++ + Q+ + +
Sbjct: 477 IKRPLFVVHSDVCGPITPVTLDDKNYFVIFVDQFTHYCVTYLIKYKSDVFSMFQDFVAKS 536

Query: 827 ENCHPGGGKLRNLRSDNGGEYRSEELQQWLRERGTVQQFSAPYMPQQNGAAERLNRTLMD 886
E H K+ L DNG EY S E++Q+ ++G + P+ PQ NG +ER+ RT+ +
Sbjct: 537 E-AH-FNLKVVYLYIDNGREYLSNEMRQFCVKKGISYHLTVPHTPQLNGVSERMIRTITE 594

Query: 887 RTRAMLFDAALPSSFWPEAVTYASHLRNLX---XXXXXXXTPWEALTGVKPDISSLRTFG 943
+ R M+ A L SFW EAV A++L N TP+E KP + LR FG
Sbjct: 595 KARTMVSGAKLDKSFWGEAVLTATYLINRIPSRALVDSSKTPYEMWHNKKPYLKHLRVFG 654

Query: 944 CRVYVTLPADQRSKLSPRADVGTYLGLQRNSAAYRVMVGGKVVVSRDVRFDE 995
VYV + Q K ++ ++G + N V K +V+RDV DE
Sbjct: 655 ATVYVHIKNKQ-GKFDDKSFKSIFVGYEPNGFKLWDAVNEKFIVARDVVVDE 705

Query: CNT0036050
Subject: sp|P92519|M810_ARATH Uncharacterized mitochondrial protein AtMg00810

OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c)
GN=TY5A PE=5 SV=2

 Score =  145 bits (366), Expect = 5e-36,   Method: Compositional matrix adjust.
Identities = 98/335 (29%), Positives = 147/335 (43%), Gaps = 47/335 (14%)

Query: 1231 MDVKTAFLNGDLDEELWMQQPQGFEVXXXXXXXXXXXXXXXXXXXXXXXXHRSVPLACRL 1290
MDV TAFLN +DE ++++QP GF R+ L
Sbjct: 1 MDVDTAFLNSTMDEPIYVKQPPGF------------------------VNERNPDYVWEL 36

Query: 1291 LKSVYGLKQAPRCWYRKLSEELGGLGFTPATADPALFVRHDEAGPVYVLVHVDDLLIAAG 1350
+YGLKQAP W ++ L +GF + L+ R GP+Y+ V+VDDLL+AA
Sbjct: 37 YGGMYGLKQAPLLWNEHINNTLKKIGFCRHEGEHGLYFRSTSDGPIYIAVYVDDLLVAAP 96

Query: 1351 CSAQLAAVKAAIGKCFEVRDLGEASTYLGMEIKRDPSTGDILLQQRRYVNELLQRHGMTD 1410
VK + K + ++DLG+ +LG+ I + S GDI L + Y+ + +
Sbjct: 97 SPKIYDRVKQELTKLYSMKDLGKVDKFLGLNIHQS-SNGDITLSLQDYIAKAASESEINT 155

Query: 1411 AKPRSLPLPAGTRVLAASEQQPVLDDGGPYRSLIGGLNYVAVSTRPDIAYALSVLARHMA 1470
K PL + + P L D PY+S++G L + A + RPDI+Y +S+L+R +
Sbjct: 156 FKLTQTPLCNSKPLFETT--SPHLKDITPYQSIVGQLLFCANTGRPDISYPVSLLSRFLR 213

Query: 1471 APTKAHLALATGVLRYLKHTVDMGLRFXXXXXXXXXXXXXXXXXXXXXXYDAGAFVGYCD 1530
P HL A VLRYL T M L++ A YCD
Sbjct: 214 EPRAIHLESARRVLRYLYTTRSMCLKYRSGSQL--------------------ALTVYCD 253

Query: 1531 ADWAGDPNTRRSQTAFLFALGKTVVSWCSQQQRTV 1565
A + S ++ L V+W S++ + V
Sbjct: 254 ASHGAIHDLPHSTGGYVTLLAGAPVTWSSKKLKGV 288

Query: CNT0036050
Subject: sp|P0C2J3|YL21B_YEAST Transposon Ty2-LR1 Gag-Pol polyprotein

OS=Arabidopsis thaliana GN=AtMg00810 PE=4 SV=1

 Score =  131 bits (330), Expect = 6e-32,   Method: Composition-based stats.
Identities = 76/230 (33%), Positives = 118/230 (51%), Gaps = 26/230 (11%)

Query: 1336 VYVLVHVDDLLIAAGCSAQLAAVKAAIGKCFEVRDLGEASTYLGMEIKRDPSTGDILLQQ 1395
+Y+L++VDD+L+ + L + + F ++DLG +LG++IK PS + L Q
Sbjct: 1 MYLLLYVDDILLTGSSNTLLNMLIFQLSSTFSMKDLGPVHYFLGIQIKTHPS--GLFLSQ 58

Query: 1396 RRYVNELLQRHGMTDAKPRSLPLPAGTRVLAASEQQPVLDDGGPYRSLIGGLNYVAVSTR 1455
+Y ++L GM D KP S PLP ++ + P D +RS++G L Y+ + TR
Sbjct: 59 TKYAEQILNNAGMLDCKPMSTPLPLKLNSSVSTAKYP---DPSDFRSIVGALQYLTL-TR 114

Query: 1456 PDIAYALSVLARHMAAPTKAHLALATGVLRYLKHTVDMGLRFXXXXXXXXXXXXXXXXXX 1515
PDI+YA++++ + M PT A L VLRY+K T+ GL
Sbjct: 115 PDISYAVNIVCQRMHEPTLADFDLLKRVLRYVKGTIFHGLYIHKNSKLNVQ--------- 165

Query: 1516 XXXXYDAGAFVGYCDADWAGDPNTRRSQTAFLFALGKTVVSWCSQQQRTV 1565
+CD+DWAG +TRRS T F LG ++SW +++Q TV
Sbjct: 166 -----------AFCDSDWAGCTSTRRSTTGFCTFLGCNIISWSAKRQPTV 204

Query: CNT0036050
Subject: sp|P25384|YC21B_YEAST Transposon Ty2-C Gag-Pol polyprotein

OS=Saccharomyces cerevisiae (strain ATCC 204508 / S288c)
GN=TY2B-LR1 PE=3 SV=1

 Score = 95.5 bits (236), Expect = 4e-18,   Method: Compositional matrix adjust.
Identities = 74/282 (26%), Positives = 126/282 (44%), Gaps = 13/282 (4%)

Query: 708 LHRRLGHVGWHSLMQMVNGSLVTGLDVDLSALSQAAESVCSTCVEAKAASSPFPDSS--- 764
+HR LGH + S+ + + + VT L S A+ C C+ K+ S
Sbjct: 594 IHRMLGHANFRSIQKSLKKNAVTYLKESDIEWSNASTYQCPDCLIGKSTKHRHVKGSRLK 653

Query: 765 -SEPQQPLALAHSDVCGPMPIMGRGGSRYFITLLDDATGVSAVRMLTTR--EHAGEALQE 821
E +P H+D+ GP+ + + YFI+ D+ T V L R E
Sbjct: 654 YQESYEPFQYLHTDIFGPVHHLPKSAPSYFISFTDEKTRFQWVYPLHDRREESILNVFTS 713

Query: 822 MIVQLENCHPGGGKLRNLRSDNGGEYRSEELQQWLRERGTVQQFSAPYMPQQNGAAERLN 881
++ ++N ++ ++ D G EY ++ L ++ RG ++ + +G AERLN
Sbjct: 714 ILAFIKNQF--NARVLVIQMDRGSEYTNKTLHKFFTNRGITACYTTTADSRAHGVAERLN 771

Query: 882 RTLMDRTRAMLFDAALPSSFWPEAVTYASHLRNLXXXXXXXXTPWE--ALTGVKPDISSL 939
RTL++ R +L + LP+ W AV +++ +RN + + L G+ DI+++
Sbjct: 772 RTLLNDCRTLLHCSGLPNHLWFSAVEFSTIIRNSLVSPKNDKSARQHAGLAGL--DITTI 829

Query: 940 RTFGCRVYVTLPADQRSKLSPRADVGTYLGLQRNSAAYRVMV 981
FG V V + SK+ PR G L RNS Y + +
Sbjct: 830 LPFGQPVIVN-NHNPDSKIHPRGIPGYALHPSRNSYGYIIYL 870