BLASTX nr result
ID: Stemona21_contig00017519
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Stemona21_contig00017519 (773 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|ADN33767.1| gag protease polyprotein [Cucumis melo subsp. melo] 189 9e-46 emb|CAN69982.1| hypothetical protein VITISV_027150 [Vitis vinifera] 189 1e-45 emb|CBI17376.3| unnamed protein product [Vitis vinifera] 186 8e-45 emb|CAN66189.1| hypothetical protein VITISV_006047 [Vitis vinifera] 186 1e-44 ref|XP_004488407.1| PREDICTED: uncharacterized protein LOC101502... 183 6e-44 gb|EOY20371.1| Gag protease polyprotein-like protein [Theobroma ... 176 8e-42 gb|EOY19088.1| Uncharacterized protein TCM_043787 [Theobroma cacao] 176 1e-41 gb|EMJ15392.1| hypothetical protein PRUPE_ppa021406mg [Prunus pe... 176 1e-41 gb|EOY00215.1| DNA/RNA polymerases superfamily protein [Theobrom... 175 2e-41 gb|EXC31837.1| Transposon Ty3-I Gag-Pol polyprotein [Morus notab... 174 4e-41 gb|EOX94203.1| DNA/RNA polymerases superfamily protein [Theobrom... 174 4e-41 gb|EOY00082.1| DNA/RNA polymerases superfamily protein [Theobrom... 172 1e-40 prf||1510387A retrotransposon del1-46 170 4e-40 ref|XP_004153883.1| PREDICTED: uncharacterized protein LOC101208... 169 1e-39 gb|ADN34141.1| ty3-gypsy retrotransposon protein [Cucumis melo s... 169 1e-39 ref|XP_004154396.1| PREDICTED: uncharacterized protein LOC101203... 165 1e-38 emb|CAN69016.1| hypothetical protein VITISV_016361 [Vitis vinifera] 164 2e-38 ref|XP_006347259.1| PREDICTED: uncharacterized protein LOC102584... 164 4e-38 gb|EXB73268.1| Transposon Ty3-I Gag-Pol polyprotein [Morus notab... 163 5e-38 gb|EOY19083.1| DNA/RNA polymerases superfamily protein [Theobrom... 161 2e-37 >gb|ADN33767.1| gag protease polyprotein [Cucumis melo subsp. melo] Length = 871 Score = 189 bits (480), Expect = 9e-46 Identities = 106/249 (42%), Positives = 150/249 (60%), Gaps = 8/249 (3%) Frame = -3 Query: 723 QGRVFAVGQPVEYPDRAMIEGIFPLFSSWALVLFDTGASHSFVSDQVVRSLDLHCDVLHP 544 QGRVFA + ++ G P+ +ALVLFD+G+SHSF+S V L + LH Sbjct: 609 QGRVFATNRTEAEKAGTVVTGTLPVLGHYALVLFDSGSSHSFISSAFVSHARLEVEPLHH 668 Query: 543 PLTVLSPLGQRMTLDQVCRACVVKLADHEFVVDLVVMKMIGYDX---MDWLSSHSAILAC 373 L+V +P G+ M + +AC +++A H V L+V+ M+ +D MDWL+++ A + C Sbjct: 669 VLSVSTPSGECMLSKEKVKACQIEIAGHVIEVTLIVLDMLDFDVILGMDWLAANHASIDC 728 Query: 372 HSSRVTLKLPCGTAFSFRAVRNMPVP----GWKSKHSEDQYFGSLLASVVET-EQESSLD 208 VT P +F F+ + +P ++ Q +LASVV+T E + SL Sbjct: 729 SRKEVTFNPPSMASFKFKGGGSKSLPQVISAIRASKLLSQGTWGILASVVDTREADVSLS 788 Query: 207 CLPVIREYPDLFPDELPGLPPVREVEFSIDLPEGVSPISIPPHRMAPAELRELRKQLDDL 28 PV+R+YPD+FP+ELPGLPP REVEF+I+L G PIS P+RMAPAEL+EL+ QL +L Sbjct: 789 SEPVVRDYPDVFPEELPGLPPHREVEFAIELEPGTVPISRAPYRMAPAELKELKVQLQEL 848 Query: 27 LTKGLIRPS 1 L KG IRPS Sbjct: 849 LDKGFIRPS 857 >emb|CAN69982.1| hypothetical protein VITISV_027150 [Vitis vinifera] Length = 1495 Score = 189 bits (479), Expect = 1e-45 Identities = 108/264 (40%), Positives = 158/264 (59%), Gaps = 8/264 (3%) Frame = -3 Query: 768 RGRGSVRPSLPEPSTQGRVFAVGQPVEYPDRAMIEGIFPLFSSWALVLFDTGASHSFVSD 589 RGRG RP+ GRVFA+ D ++EG+ ++S+W VLFDTGA+HSF+S Sbjct: 409 RGRG--RPAA------GRVFALTPTEPXEDALLVEGMILVYSTWVRVLFDTGATHSFISA 460 Query: 588 QVVRSLDLHCDVLHPPLTVLSPLGQRMTLDQVCRACVVKLADHEFVVDLVVMKMIGYDX- 412 +L L + + L + SP+G +D++C+ CV+ LAD VDL ++ M GYD Sbjct: 461 SCANALGLKSERVENLLLIESPMGTNSRVDRICKGCVITLADRALNVDLRILDMTGYDVI 520 Query: 411 --MDWLSSHSAILACHSSRVTLKLPCGTAFSFRAVRNMPVPGWKSKHSEDQYF--GSL-- 250 MDWL+ + A++ CH R+ LP G F + + +P +S GS+ Sbjct: 521 LGMDWLAVYRAVIDCHRRRIIFCLPEGFEVCFVGGKCVSLPFSQSDPCYQYVLRKGSINF 580 Query: 249 LASVVETEQ-ESSLDCLPVIREYPDLFPDELPGLPPVREVEFSIDLPEGVSPISIPPHRM 73 LA + E+ + + +PV+R++ D+FPDELPGLPP RE +FSI++ G PIS+ P+RM Sbjct: 581 LACLRGKEKAQKDITEIPVVRKFQDVFPDELPGLPPHREFDFSIEVYPGTDPISVSPYRM 640 Query: 72 APAELRELRKQLDDLLTKGLIRPS 1 AP EL+EL+ QLD+LL +G IRPS Sbjct: 641 APLELKELKTQLDELLGRGFIRPS 664 >emb|CBI17376.3| unnamed protein product [Vitis vinifera] Length = 1567 Score = 186 bits (472), Expect = 8e-45 Identities = 105/264 (39%), Positives = 155/264 (58%), Gaps = 8/264 (3%) Frame = -3 Query: 768 RGRGSVRPSLPEPSTQGRVFAVGQPVEYPDRAMIEGIFPLFSSWALVLFDTGASHSFVSD 589 RGRG GRVFA+ D ++EG+ ++S+W VLFDTGA+HSF+S Sbjct: 393 RGRGR--------QAAGRVFALTPTEPEEDALLVEGMILVYSTWVRVLFDTGATHSFISA 444 Query: 588 QVVRSLDLHCDVLHPPLTVLSPLGQRMTLDQVCRACVVKLADHEFVVDLVVMKMIGYDX- 412 +L L + + L + SP+G +D++C+ CV+ LAD VDL ++ M GYD Sbjct: 445 SCANALGLKSERVENLLLIESPMGTNSRVDRICKGCVITLADRALNVDLRILDMTGYDVI 504 Query: 411 --MDWLSSHSAILACHSSRVTLKLPCGTAFSFRAVRNMPVPGWKSKHSEDQYF--GSL-- 250 MDWL+ + A++ CH R+ LP G F + + +P +S GS+ Sbjct: 505 LGMDWLAVYRAVIDCHRRRIIFCLPEGFEVCFVGGKCVSLPFSQSDPCYQYVLRKGSINF 564 Query: 249 LASVVETEQ-ESSLDCLPVIREYPDLFPDELPGLPPVREVEFSIDLPEGVSPISIPPHRM 73 LA + E+ + + +P++R++ D+FPDELPGLPP RE +FSI++ G PIS+ P+RM Sbjct: 565 LACLRGKEKAQKDITEIPMVRKFQDVFPDELPGLPPHREFDFSIEVYPGTDPISVSPYRM 624 Query: 72 APAELRELRKQLDDLLTKGLIRPS 1 AP EL+EL+ QLD+LL +G IRPS Sbjct: 625 APLELKELKTQLDELLGRGFIRPS 648 >emb|CAN66189.1| hypothetical protein VITISV_006047 [Vitis vinifera] Length = 1573 Score = 186 bits (471), Expect = 1e-44 Identities = 103/259 (39%), Positives = 151/259 (58%), Gaps = 3/259 (1%) Frame = -3 Query: 768 RGRGSVRPSLPEPSTQGRVFAVGQPVEYPDRAMIEGIFPLFSSWALVLFDTGASHSFVSD 589 RGRG RP+ GRVFA+ D ++EG+ ++S+W VLFDTGA+HSF+S Sbjct: 494 RGRG--RPAA------GRVFALTPTEPDKDALLVEGMILVYSTWVRVLFDTGATHSFISA 545 Query: 588 QVVRSLDLHCDVLHPPLTVLSPLGQRMTLDQVCRACVVKLADHEFVVDLVVMKMIGYDX- 412 +L L + + L + SP+G +D++C+ CV+ LAD VDL ++ M GYD Sbjct: 546 SCANALGLKSERVENLLLIESPMGTNSRVDRICKGCVITLADRALNVDLRILDMTGYDVI 605 Query: 411 --MDWLSSHSAILACHSSRVTLKLPCGTAFSFRAVRNMPVPGWKSKHSEDQYFGSLLASV 238 MDWL+ + A++ CH R+ LP G + P + + + L Sbjct: 606 LGMDWLAVYRAVIDCHRRRIIFCLPEG-------FESDPCYRYVLRKGSINFLACLRG-- 656 Query: 237 VETEQESSLDCLPVIREYPDLFPDELPGLPPVREVEFSIDLPEGVSPISIPPHRMAPAEL 58 + + + + +PV+R++ D+FPDELPGLPP RE +FSI++ G PIS+ P+RMAP EL Sbjct: 657 -KEKAQKDITEIPVVRKFQDVFPDELPGLPPHREFDFSIEVYPGTDPISVSPYRMAPLEL 715 Query: 57 RELRKQLDDLLTKGLIRPS 1 +EL+ QLD+LL KG IRPS Sbjct: 716 KELKTQLDELLGKGFIRPS 734 >ref|XP_004488407.1| PREDICTED: uncharacterized protein LOC101502180 [Cicer arietinum] Length = 1235 Score = 183 bits (464), Expect = 6e-44 Identities = 111/265 (41%), Positives = 151/265 (56%), Gaps = 8/265 (3%) Frame = -3 Query: 771 GRGRGSVRPSLPEPSTQGRVFAVGQPVEYPDRAMIEGIFPLFSSWALVLFDTGASHSFVS 592 GRG G R +P Q +VFA+ + A++ GI + S A VLFD GA+HSFVS Sbjct: 528 GRGFGG-RGQIPAERGQAQVFALTRQDAQTCNAVVTGILSICSRDAHVLFDLGATHSFVS 586 Query: 591 DQVVRSLDLHCDVLHPPLTVLSPLGQRMTLDQVCRACVVKLADHEFVVDLVVMKMIGYDX 412 L L PL V +P+G + V R C + + F VDLVV+ +I +D Sbjct: 587 SWFATRLGKCSSSLEEPLVVATPVGGNLLAKSVYRCCDITIDGKVFSVDLVVIDLIDFDV 646 Query: 411 ---MDWLSSHSAILACHSSRVTLKLPCGTAFSFRAVR----NMPVPGWKSKHSEDQYFGS 253 MDWL+ H A L CH V ++P + FSF+ R + + + + + Sbjct: 647 ILGMDWLAFHHATLDCHDKVVKFEIPGQSVFSFQGERCWVPHNQILALAASKLMRRGCQA 706 Query: 252 LLASVVETE-QESSLDCLPVIREYPDLFPDELPGLPPVREVEFSIDLPEGVSPISIPPHR 76 +A V +T+ E L+ +P+ E+PD+FP+ELPGLPP RE+EFSIDL PISIPP+R Sbjct: 707 YIALVRDTQVAEEKLEKIPIACEFPDVFPEELPGLPPDREIEFSIDLVPNTHPISIPPYR 766 Query: 75 MAPAELRELRKQLDDLLTKGLIRPS 1 MAPA+L+ELR+QL DLL KG IRPS Sbjct: 767 MAPAKLKELREQLQDLLDKGFIRPS 791 >gb|EOY20371.1| Gag protease polyprotein-like protein [Theobroma cacao] Length = 665 Score = 176 bits (446), Expect = 8e-42 Identities = 97/257 (37%), Positives = 149/257 (57%), Gaps = 8/257 (3%) Frame = -3 Query: 747 PSLPEPSTQGRVFAVGQPVEYPDRAMIEGIFPLFSSWALVLFDTGASHSFVSDQVVRSLD 568 PS P+ T RVFAV + + G LF A VL D+G+ S+VS D Sbjct: 369 PSRPQTRTSTRVFAVTEDEAQVRPGAVTGTISLFDKDAYVLIDSGSDRSYVSTTFASIAD 428 Query: 567 LHCDVLHPPLTVLSPLGQRMTLDQVCRACVVKLADHEFVVDLVVMKMIGYDX---MDWLS 397 + L + + +PLG+++ + R C V++ + EF DL+ ++++ +D MDWL+ Sbjct: 429 RNLSPLEEEIVIHTPLGEKLVRNSCYRDCGVRVGEEEFRGDLIPLEILDFDLILGMDWLT 488 Query: 396 SHSAILACHSSRVTLKLPCGTAFSF----RAVRNMPVPGWKSKHSEDQYFGSLLASVVET 229 +H A + C V L+ G F R + + + K+ + + + LA V++T Sbjct: 489 AHRANVDCFRKEVVLRNSKGAEIVFVGKRRVLPSCVISAIKASKLVQKGYSTYLAYVIDT 548 Query: 228 -EQESSLDCLPVIREYPDLFPDELPGLPPVREVEFSIDLPEGVSPISIPPHRMAPAELRE 52 ++E L+ +P++ E+PD+FPD+LPGLPP RE+EF IDL G +PISIPP+RMAPAEL+E Sbjct: 549 SKREPKLEDVPIVSEFPDVFPDDLPGLPPDRELEFPIDLLSGTAPISIPPYRMAPAELKE 608 Query: 51 LRKQLDDLLTKGLIRPS 1 L+ QL +L+ KG IRPS Sbjct: 609 LKVQLQELVDKGFIRPS 625 >gb|EOY19088.1| Uncharacterized protein TCM_043787 [Theobroma cacao] Length = 649 Score = 176 bits (445), Expect = 1e-41 Identities = 98/257 (38%), Positives = 149/257 (57%), Gaps = 8/257 (3%) Frame = -3 Query: 747 PSLPEPSTQGRVFAVGQPVEYPDRAMIEGIFPLFSSWALVLFDTGASHSFVSDQVVRSLD 568 PS P+ T RVFAV + + GI LF A VL D+G+ S+VS D Sbjct: 126 PSRPQTRTSTRVFAVTEDEAQVRPRAVTGIMSLFDKDAYVLIDSGSDRSYVSTTFASIAD 185 Query: 567 LHCDVLHPPLTVLSPLGQRMTLDQVCRACVVKLADHEFVVDLVVMKMIGYDX---MDWLS 397 + L + + +PLG+++ + R C V++ + EF DL+ ++++ +D MDWL+ Sbjct: 186 RNLSPLEEEIVIHTPLGEKLVRNSCYRDCGVRVGEEEFRGDLIPLEILDFDLILGMDWLT 245 Query: 396 SHSAILACHSSRVTLKLPCGTAFSF----RAVRNMPVPGWKSKHSEDQYFGSLLASVVET 229 +H A + C V L+ G F R + + + K+ + + + LA V++T Sbjct: 246 AHRANVDCFRKEVVLRNSEGAEIVFVGKRRVLPSCVISAIKASKLVQKGYPTYLAYVIDT 305 Query: 228 EQ-ESSLDCLPVIREYPDLFPDELPGLPPVREVEFSIDLPEGVSPISIPPHRMAPAELRE 52 + E L+ +P++ E+PD+FPD+LPGLPP RE+EF IDL G +PISIPP+RMAPAEL+E Sbjct: 306 SKGEPKLEDVPIVSEFPDVFPDDLPGLPPDRELEFPIDLLPGTAPISIPPYRMAPAELKE 365 Query: 51 LRKQLDDLLTKGLIRPS 1 L+ QL +L+ KG IRPS Sbjct: 366 LKVQLQELVDKGFIRPS 382 >gb|EMJ15392.1| hypothetical protein PRUPE_ppa021406mg [Prunus persica] Length = 881 Score = 176 bits (445), Expect = 1e-41 Identities = 93/227 (40%), Positives = 138/227 (60%), Gaps = 3/227 (1%) Frame = -3 Query: 672 MIEGIFPLFSSWALVLFDTGASHSFVSDQVVRSLDLHCDVLHPPLTVLSPLGQRMTLDQV 493 ++EG F +F+ WA +LFD GA++SF+S L L + + L V SPLG+ + ++++ Sbjct: 198 VVEGTFLIFNLWARILFDPGATNSFISVSFASILGLEYEFMKSSLMVGSPLGKCVEVNKL 257 Query: 492 CRACVVKLADHEFVVDLVVMKMIGYDX---MDWLSSHSAILACHSSRVTLKLPCGTAFSF 322 CR+C ++++DH+ VVDL+++K + YD MD L+ AIL C VT+ L G FSF Sbjct: 258 CRSCTIEISDHKLVVDLMILKFMVYDVILGMDMLTQFKAILDCGKKMVTMTLSKGKTFSF 317 Query: 321 RAVRNMPVPGWKSKHSEDQYFGSLLASVVETEQESSLDCLPVIREYPDLFPDELPGLPPV 142 RN P + Y + + + L+ +P+++ + D+F +EL GLPP Sbjct: 318 YGNRNTCKPKSILDNRNCNYLRLIAHMANKDLGDPKLELIPIVKNFNDVFLEELSGLPPK 377 Query: 141 REVEFSIDLPEGVSPISIPPHRMAPAELRELRKQLDDLLTKGLIRPS 1 EVEFSI + G SPISIPPHRMAP EL+EL+ QL +L KG IRP+ Sbjct: 378 GEVEFSIVIYPGTSPISIPPHRMAPTELKELKTQLQELERKGFIRPT 424 >gb|EOY00215.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1537 Score = 175 bits (443), Expect = 2e-41 Identities = 99/257 (38%), Positives = 148/257 (57%), Gaps = 8/257 (3%) Frame = -3 Query: 747 PSLPEPSTQGRVFAVGQPVEYPDRAMIEGIFPLFSSWALVLFDTGASHSFVSDQVVRSLD 568 PS P+ T RVFAV + + G LF A VL D+G+ S+VS D Sbjct: 366 PSRPQTCTATRVFAVTEDEARVRPGAVTGTMSLFDKDAYVLIDSGSDRSYVSTTFASITD 425 Query: 567 LHCDVLHPPLTVLSPLGQRMTLDQVCRACVVKLADHEFVVDLVVMKMIGYDX---MDWLS 397 + L + V +PLG+++ + R C V++ + EF DL+ ++++ +D MDWL+ Sbjct: 426 RNLSPLEEEIVVHTPLGEQLIRNTCYRDCGVRVGEEEFRGDLIPLEILDFDLILGMDWLT 485 Query: 396 SHSAILACHSSRVTLKLPCGTAFSF----RAVRNMPVPGWKSKHSEDQYFGSLLASVVET 229 +H A L C V L+ G F R + + + K+ + + + LA V++T Sbjct: 486 THRANLDCFRKEVVLRNSEGAEIVFVGERRVLPSCVISAIKASKLVQKGYPTYLAYVIDT 545 Query: 228 EQ-ESSLDCLPVIREYPDLFPDELPGLPPVREVEFSIDLPEGVSPISIPPHRMAPAELRE 52 + E L+ +P++ E+PD+FPD+LPG+PP RE+EF IDL G +PISIPP+RMAPAEL+E Sbjct: 546 SKGEPKLEDVPIVSEFPDVFPDDLPGIPPNRELEFPIDLLPGTAPISIPPYRMAPAELKE 605 Query: 51 LRKQLDDLLTKGLIRPS 1 L+ QL DL+ KG IRPS Sbjct: 606 LKAQLQDLVDKGFIRPS 622 >gb|EXC31837.1| Transposon Ty3-I Gag-Pol polyprotein [Morus notabilis] Length = 1088 Score = 174 bits (440), Expect = 4e-41 Identities = 105/269 (39%), Positives = 142/269 (52%), Gaps = 13/269 (4%) Frame = -3 Query: 768 RGRGSVRPSLPEPSTQGRVFAVGQPV-----EYPDRAMIEGIFPLFSSWALVLFDTGASH 604 RGRG + G+ +AV + D ++++G + SWA VLFDTGA+H Sbjct: 205 RGRGQKNKG----KSHGQAYAVTSTATPGRGQQADHSVVDGTILVSHSWAQVLFDTGATH 260 Query: 603 SFVSDQVVRSLDLHCDVLHPPLTVLSPLGQRMTLDQVCRACVVKLADHEFVVDLVVMKMI 424 SF+S L L D PPLT+ +P+G + + C + L DH DL V+ M Sbjct: 261 SFISMLFASVLQLSVDTHDPPLTLSTPMGGIAEVSMIRSPCCIVLGDHRLSADLFVLPMA 320 Query: 423 GYDX---MDWLSSHSAILACHSSRVTLKLPCGTAFSFRAVRNMPVPGWKSK-----HSED 268 G+D MDWLS + A + C+ RVTL G ++A P K Sbjct: 321 GFDVILGMDWLSKYHATVDCYRRRVTLLTKNGQVIDYQAKTGAVTPSPVLKACIGGRKNL 380 Query: 267 QYFGSLLASVVETEQESSLDCLPVIREYPDLFPDELPGLPPVREVEFSIDLPEGVSPISI 88 + G + A E+E S +P++ ++ D+FP ELPGLPP RE+EF IDL G SPISI Sbjct: 381 ESLGMVFALGGESEANDS-SYVPIVDDFQDVFPSELPGLPPDREIEFCIDLVPGTSPISI 439 Query: 87 PPHRMAPAELRELRKQLDDLLTKGLIRPS 1 P+RMAPA+ ELRKQL L+ KG IRPS Sbjct: 440 APYRMAPAKNVELRKQLQKLMEKGFIRPS 468 >gb|EOX94203.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1336 Score = 174 bits (440), Expect = 4e-41 Identities = 97/257 (37%), Positives = 148/257 (57%), Gaps = 8/257 (3%) Frame = -3 Query: 747 PSLPEPSTQGRVFAVGQPVEYPDRAMIEGIFPLFSSWALVLFDTGASHSFVSDQVVRSLD 568 PS P+ T RVFAV + + + G LF A VL D+G+ S+VS Sbjct: 316 PSRPQTRTSTRVFAVTEDEAWVRPGAVTGTMSLFDKDAYVLIDSGSDRSYVSTTFASIAA 375 Query: 567 LHCDVLHPPLTVLSPLGQRMTLDQVCRACVVKLADHEFVVDLVVMKMIGYDX---MDWLS 397 + L + + +PLG+++ + R C V++ + EF DL+ +K++ +D MDWL+ Sbjct: 376 RNLSPLEEEIVIHTPLGEKLVRNSCYRDCGVRVGEEEFRGDLIPLKILDFDLILGMDWLT 435 Query: 396 SHSAILACHSSRVTLKLPCGTAFSF----RAVRNMPVPGWKSKHSEDQYFGSLLASVVET 229 +H A + C V L+ G F R + + + K+ + + + LA V++T Sbjct: 436 THRANVDCFRKEVVLRNSEGAEIVFVGKHRVLPSCVISAIKASKLVQKGYPTYLAYVIDT 495 Query: 228 EQ-ESSLDCLPVIREYPDLFPDELPGLPPVREVEFSIDLPEGVSPISIPPHRMAPAELRE 52 + E L+ +P++ E+PD+FPD+LPGLPP RE+EF IDL G +PISIPP+RMAPAEL+E Sbjct: 496 SKGEPKLEDVPIVSEFPDVFPDDLPGLPPDRELEFPIDLLPGTAPISIPPYRMAPAELKE 555 Query: 51 LRKQLDDLLTKGLIRPS 1 L+ QL +L+ KG IRPS Sbjct: 556 LKVQLQELVDKGFIRPS 572 >gb|EOY00082.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1515 Score = 172 bits (436), Expect = 1e-40 Identities = 95/257 (36%), Positives = 148/257 (57%), Gaps = 8/257 (3%) Frame = -3 Query: 747 PSLPEPSTQGRVFAVGQPVEYPDRAMIEGIFPLFSSWALVLFDTGASHSFVSDQVVRSLD 568 PS P+ T RVFAV + + G LF A VL D+G+ S+VS V +D Sbjct: 353 PSRPQTRTSTRVFAVTEDEAQVRPGAVTGTMSLFDKDAYVLIDSGSDRSYVSTTFVSIVD 412 Query: 567 LHCDVLHPPLTVLSPLGQRMTLDQVCRACVVKLADHEFVVDLVVMKMIGYDX---MDWLS 397 + L + + +PLG+++ + R C V++ + EF DL+ ++++ +D MDWL+ Sbjct: 413 RNLSPLEEEIVIHTPLGEKLVRNSCYRDCGVRVGEEEFRGDLIPLEILDFDLILGMDWLT 472 Query: 396 SHSAILACHSSRVTLKLPCGTAFSF----RAVRNMPVPGWKSKHSEDQYFGSLLASVVET 229 +H A + C + L+ G F R + + + K+ + + + LA V++T Sbjct: 473 AHRANVDCFRKEIVLRNSEGAEIVFVGKRRVLPSCVISAIKASKLVQKGYSTYLAYVIDT 532 Query: 228 EQ-ESSLDCLPVIREYPDLFPDELPGLPPVREVEFSIDLPEGVSPISIPPHRMAPAELRE 52 + E L+ + ++ E+PD+FPD+LPGLPP RE+EF IDL G +PISIPP+RMAP EL+E Sbjct: 533 SKGEPKLEDVSIVSEFPDVFPDDLPGLPPDRELEFPIDLLPGTAPISIPPYRMAPTELKE 592 Query: 51 LRKQLDDLLTKGLIRPS 1 L+ QL +L+ KG IRPS Sbjct: 593 LKVQLQELVDKGFIRPS 609 >prf||1510387A retrotransposon del1-46 Length = 1443 Score = 170 bits (431), Expect = 4e-40 Identities = 91/227 (40%), Positives = 137/227 (60%), Gaps = 3/227 (1%) Frame = -3 Query: 672 MIEGIFPLFSSWALVLFDTGASHSFVSDQVVRSLDLHCDVLHPPLTVLSPLGQRMTLDQV 493 MI I +FSS VL DTG++HSF++ ++++ L++ L L+V+SP+G ++QV Sbjct: 334 MIRSILSIFSSLCHVLIDTGSTHSFITPRIIKMLEIPVQPLGYILSVISPIGTSTFVNQV 393 Query: 492 CRACVVKLADHEFVVDLVVMKMIGYDX---MDWLSSHSAILACHSSRVTLKLPCGTAFSF 322 C+ C++ + + E VDL+++ + D MDWL+++ +L C S +VT LP F F Sbjct: 394 CKGCMITIGNQELTVDLIILDLEDPDILLGMDWLAAYHVVLDCFSKKVTFHLPGIPEFHF 453 Query: 321 RAVRNMPVPGWKSKHSEDQYFGSLLASVVETEQESSLDCLPVIREYPDLFPDELPGLPPV 142 + Y SL + + T S D ++REY ++FPD+LPGLPP Sbjct: 454 HGETQHTFFPTFTHQPNLSYLASLASEINITP---STDLSLIVREYINVFPDDLPGLPPP 510 Query: 141 REVEFSIDLPEGVSPISIPPHRMAPAELRELRKQLDDLLTKGLIRPS 1 RE+EF I+L G SPISI P+ MAP+EL+EL++QL+DLL KG IR S Sbjct: 511 REIEFQINLLPGTSPISITPYHMAPSELQELKEQLEDLLNKGFIRGS 557 >ref|XP_004153883.1| PREDICTED: uncharacterized protein LOC101208523, partial [Cucumis sativus] Length = 804 Score = 169 bits (428), Expect = 1e-39 Identities = 98/252 (38%), Positives = 147/252 (58%), Gaps = 8/252 (3%) Frame = -3 Query: 732 PSTQGRVFAVGQPVEYPDRAMIEGIFPLFSSWALVLFDTGASHSFVSDQVVRSLDLHCDV 553 P +G +FA + ++ G P+ +AL LFD+G+SHSF+S V L Sbjct: 394 PPQRGTIFATSRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVKP 453 Query: 552 LHPPLTVLSPLGQRMTLDQVCRACVVKLADHEFVVDLVVMKMIGYDX---MDWLSSHSAI 382 L L+V +P G+ M + +AC +++A V L+V+ + +D MD L+++ A Sbjct: 454 LDYVLSVSTPSGEIMLSKEKIKACKIEIAGRVLDVTLLVLDIRDFDVILGMDLLATNHAS 513 Query: 381 LACHSSRVTLKLPCGTAFSFRAVRNMPVP----GWKSKHSEDQYFGSLLASVVET-EQES 217 + C V P ++F F+ V + +P K+ Q S+LASVV+T E E+ Sbjct: 514 IDCSRKEVVFSPPTESSFKFKGVGTVVLPKVISAMKASKLLSQGTWSILASVVDTREDET 573 Query: 216 SLDCLPVIREYPDLFPDELPGLPPVREVEFSIDLPEGVSPISIPPHRMAPAELRELRKQL 37 SL PV+REYPD+FP++LPGLPP RE++F+I+L +PIS P+RMAPAEL+EL+ QL Sbjct: 574 SLTSEPVVREYPDVFPEDLPGLPPHREIDFAIELEPDTTPISRAPYRMAPAELKELKVQL 633 Query: 36 DDLLTKGLIRPS 1 +LL KG I+PS Sbjct: 634 QELLDKGFIQPS 645 >gb|ADN34141.1| ty3-gypsy retrotransposon protein [Cucumis melo subsp. melo] Length = 1359 Score = 169 bits (427), Expect = 1e-39 Identities = 94/236 (39%), Positives = 141/236 (59%), Gaps = 8/236 (3%) Frame = -3 Query: 723 QGRVFAVGQPVEYPDRAMIEGIFPLFSSWALVLFDTGASHSFVSDQVVRSLDLHCDVLHP 544 QG+VFA + ++ G P+ +ALVLFD+G SHSF+S V L + LH Sbjct: 264 QGKVFATNKTEAERASTVVTGTLPVLGHYALVLFDSGFSHSFISSAFVLHARLEVEPLHH 323 Query: 543 PLTVLSPLGQRMTLDQVCRACVVKLADHEFVVDLVVMKMIGYDX---MDWLSSHSAILAC 373 L+V +P G+ M + +AC +++A H V L+V+ M+ +D MDWL+++ A + C Sbjct: 324 VLSVSTPFGECMLSKEKVKACQIEIAGHVIEVTLLVLDMLDFDVILGMDWLAANHASIDC 383 Query: 372 HSSRVTLKLPCGTAFSFR--AVRNMP--VPGWKSKHSEDQYFGSLLASVVET-EQESSLD 208 + P F F+ R++P + ++ Q S+LASVV+T E + SL Sbjct: 384 SRKEIAFNPPSMANFKFKEEGSRSLPKVISAMRASKLLSQGIWSILASVVDTREVDVSLS 443 Query: 207 CLPVIREYPDLFPDELPGLPPVREVEFSIDLPEGVSPISIPPHRMAPAELRELRKQ 40 P++R+YPD+FP+ELPGLPP RE+EF+I+L G PIS P+RMAPAEL+EL+K+ Sbjct: 444 SKPMVRDYPDVFPEELPGLPPHREIEFAIELELGTVPISRAPYRMAPAELKELKKK 499 >ref|XP_004154396.1| PREDICTED: uncharacterized protein LOC101203289 [Cucumis sativus] Length = 655 Score = 165 bits (418), Expect = 1e-38 Identities = 94/244 (38%), Positives = 143/244 (58%), Gaps = 8/244 (3%) Frame = -3 Query: 732 PSTQGRVFAVGQPVEYPDRAMIEGIFPLFSSWALVLFDTGASHSFVSDQVVRSLDLHCDV 553 P +G +FA + ++ G P+ +AL LFD+G+SHSF+S V L + Sbjct: 395 PPQRGTIFATNRSEAEKAGTVVTGTLPVLGHFALTLFDSGSSHSFISSLFVTHACLEVEP 454 Query: 552 LHPPLTVLSPLGQRMTLDQVCRACVVKLADHEFVVDLVVMKMIGYDX---MDWLSSHSAI 382 L L+V +P G+ M + +AC +++A V L+V+ M +D MDWL+++ A Sbjct: 455 LDYVLSVSTPSGEIMLSKEKIKACEIEIAGRVLDVTLLVLDMRDFDVILGMDWLATNHAS 514 Query: 381 LACHSSRVTLKLPCGTAFSFRAVRNMPVP----GWKSKHSEDQYFGSLLASVVET-EQES 217 + C V P ++F F+ V + +P K+ +Q S+LASVV+T E E+ Sbjct: 515 IDCSRKEVVFSPPTASSFKFKGVGTVVLPKVISAMKASKLLNQGTWSILASVVDTREGET 574 Query: 216 SLDCLPVIREYPDLFPDELPGLPPVREVEFSIDLPEGVSPISIPPHRMAPAELRELRKQL 37 SL PV+REYPD+FP++LPGLPP RE++F+I+L +PIS P+RMAP EL+EL+ QL Sbjct: 575 SLTSEPVVREYPDVFPEDLPGLPPHREIDFAIELEPDTTPISRAPYRMAPVELKELKIQL 634 Query: 36 DDLL 25 +LL Sbjct: 635 QELL 638 >emb|CAN69016.1| hypothetical protein VITISV_016361 [Vitis vinifera] Length = 1043 Score = 164 bits (416), Expect = 2e-38 Identities = 95/260 (36%), Positives = 147/260 (56%), Gaps = 4/260 (1%) Frame = -3 Query: 768 RGRGSVRPSLP----EPSTQGRVFAVGQPVEYPDRAMIEGIFPLFSSWALVLFDTGASHS 601 +G G+ R P + + QGR +A+G A++EG+ FS+WA VLFD GA+HS Sbjct: 11 QGEGNFRQGKPGGKSKQAQQGRFYAIGSQ-NAESNALVEGMLLCFSTWAHVLFDPGATHS 69 Query: 600 FVSDQVVRSLDLHCDVLHPPLTVLSPLGQRMTLDQVCRACVVKLADHEFVVDLVVMKMIG 421 F+S LD+ LH L V +P+G ++ VC ACV+ + E +DLV++ + Sbjct: 70 FISASFASMLDIEFVPLHCSLCVETPMGGKVETKWVCHACVLYIGGLEVTMDLVLLDISS 129 Query: 420 YDXMDWLSSHSAILACHSSRVTLKLPCGTAFSFRAVRNMPVPGWKSKHSEDQYFGSLLAS 241 +D + + +S + ++ + R +P +RN+ W K + +L Sbjct: 130 FDKVTFQTSSGSYMSFYGDRRLTFIPL--------IRNLD-DKWSRKDGRHYFLFNLKG- 179 Query: 240 VVETEQESSLDCLPVIREYPDLFPDELPGLPPVREVEFSIDLPEGVSPISIPPHRMAPAE 61 E ++ +++DC+P++ E+ D+FP ELP LPP RE++FSI+L G PISI P+RMA E Sbjct: 180 --EGKKMTTIDCIPMVCEFADVFPKELPCLPPHREMDFSIELYPGTDPISIAPYRMAXVE 237 Query: 60 LRELRKQLDDLLTKGLIRPS 1 L+EL QL +L TKG IRPS Sbjct: 238 LKELNIQLQELQTKGFIRPS 257 >ref|XP_006347259.1| PREDICTED: uncharacterized protein LOC102584611 [Solanum tuberosum] Length = 1107 Score = 164 bits (414), Expect = 4e-38 Identities = 96/232 (41%), Positives = 143/232 (61%), Gaps = 8/232 (3%) Frame = -3 Query: 672 MIEGIFPLFSSWALVLFDTGASHSFVSDQVVRSLDLHCDVLHPPLTVLSPLGQRMTLDQV 493 +I G L A VLFD G++ S+VS V L + + L P+ V +P+G+ + +DQ+ Sbjct: 45 VITGTLLLCHQPATVLFDPGSTFSYVSFYFVPRLGMRSESLAEPVHVSTPVGESLVVDQI 104 Query: 492 CRACVVKLADHEFVVDLVVMKMIGYDX---MDWLSSHSAILACHSSRVTLKLPCGTAFSF 322 R+C+V + + VDL+++ M+ +D MDWLS + A+L ++ VTL +P + + Sbjct: 105 LRSCLVTIQCCDTRVDLILLDMVDFDVILGMDWLSPYHAVLDFYAKTVTLAMPGISPVLW 164 Query: 321 RAVRNMPVPGWKSKHSEDQYFGS----LLASVVETEQE-SSLDCLPVIREYPDLFPDELP 157 ++ + G S + S LA V + +E SS+D +PV+RE+ D+FP +LP Sbjct: 165 QSAYSHTPTGIISFMRARRLVASGCLAYLAYVRDVSREGSSVDSVPVVREFADVFPTDLP 224 Query: 156 GLPPVREVEFSIDLPEGVSPISIPPHRMAPAELRELRKQLDDLLTKGLIRPS 1 GLPP R+++FSI+L G PISIPP+RMAPAELREL QL+DLL KG IRPS Sbjct: 225 GLPPERDIDFSIELEPGTRPISIPPYRMAPAELRELSVQLEDLLGKGFIRPS 276 >gb|EXB73268.1| Transposon Ty3-I Gag-Pol polyprotein [Morus notabilis] Length = 605 Score = 163 bits (413), Expect = 5e-38 Identities = 99/237 (41%), Positives = 142/237 (59%), Gaps = 10/237 (4%) Frame = -3 Query: 687 YPDRAMIEGIFPLFSSWALVLFDTGASHSFVSDQVVRSLDLHCDVLHPPLTVLSPLGQRM 508 + + ++EG+ P+ S+A VLFDTGA++SFV V+ L L D L + + SPLG R+ Sbjct: 227 HAENPVVEGMIPISHSFARVLFDTGATNSFVYTTFVKILGLKPDDLETSMFISSPLG-RV 285 Query: 507 TLDQVCRACVVKLADHEFVVDLVVMKMIGYDX---MDWLSSHSAILACHSSRVTLKLPCG 337 + VCR+CV+ + + DL+++ M +D MDWL + AI+ CH RVTL Sbjct: 286 EVTSVCRSCVITIESEKLKADLIILPMNQFDVVLGMDWLLRYGAIVDCHRMRVTLTTGSD 345 Query: 336 TAFSFRAVRNMPVPGWKSKHS----EDQYFGSLLASVVETEQ---ESSLDCLPVIREYPD 178 T +++ N +HS ++ S L S +E E E +++ +PV+ EY D Sbjct: 346 TTITYQGGVNPVTEEQLLRHSVGGRQNLACFSFL-SALEGESGIVEENVE-VPVVDEYAD 403 Query: 177 LFPDELPGLPPVREVEFSIDLPEGVSPISIPPHRMAPAELRELRKQLDDLLTKGLIR 7 +FPDELPGLPP RE+EF IDL +PISI P+RMA AE++ELRKQL +L KG IR Sbjct: 404 VFPDELPGLPPDREIEFCIDLLPETAPISIAPYRMASAEMKELRKQLGELAEKGFIR 460 >gb|EOY19083.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 906 Score = 161 bits (408), Expect = 2e-37 Identities = 94/257 (36%), Positives = 142/257 (55%), Gaps = 8/257 (3%) Frame = -3 Query: 747 PSLPEPSTQGRVFAVGQPVEYPDRAMIEGIFPLFSSWALVLFDTGASHSFVSDQVVRSLD 568 PS P+ T RVFA+ + + G LF A VL D+G+ S+VS D Sbjct: 399 PSRPQTRTATRVFAMTEDEAQVRPGAVTGTMSLFDKDAYVLIDSGSDRSYVSTTFASITD 458 Query: 567 LHCDVLHPPLTVLSPLGQRMTLDQVCRACVVKLADHEFVVDLVVMKMIGYDX---MDWLS 397 + L + V +PLG+++ + R C V++ + EF DL+ ++++ +D MDWL+ Sbjct: 459 RNLSPLEEEIVVHTPLGEQLIRNTCYRDCGVRVGEEEFRGDLIPLEILDFDLILGMDWLT 518 Query: 396 SHSAILACHSSRVTLKLPCGTAFSF----RAVRNMPVPGWKSKHSEDQYFGSLLASVVET 229 +H A + C V L+ G F R + + + K + + + LA V++T Sbjct: 519 AHRANVDCFRKEVVLRNSEGAEIVFVGERRVLPSYVISAIKVSKLVQKGYPTYLAYVIDT 578 Query: 228 EQ-ESSLDCLPVIREYPDLFPDELPGLPPVREVEFSIDLPEGVSPISIPPHRMAPAELRE 52 + E L+ +P++ E+ D+FPD LP +PP RE+EF IDL PISIPP+RMAPAEL+E Sbjct: 579 SKGEPKLEDVPIVSEFSDVFPDNLPRIPPNRELEFPIDLLPSTVPISIPPYRMAPAELKE 638 Query: 51 LRKQLDDLLTKGLIRPS 1 L+ QL DL+ KG IRPS Sbjct: 639 LKAQLQDLVDKGFIRPS 655