BLASTX nr result
ID: Forsythia23_contig00004312
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia23_contig00004312 (1190 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_007213082.1| hypothetical protein PRUPE_ppa021229mg [Prun... 427 e-162 ref|XP_007044250.1| DNA/RNA polymerases superfamily protein [The... 434 e-160 ref|XP_007050046.1| DNA/RNA polymerases superfamily protein [The... 419 e-157 ref|XP_007200265.1| hypothetical protein PRUPE_ppa015000mg [Prun... 417 e-156 ref|XP_009787832.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 404 e-153 ref|XP_007022574.1| DNA/RNA polymerases superfamily protein [The... 398 e-151 gb|AAO45752.1| pol protein [Cucumis melo subsp. melo] 409 e-151 ref|XP_010026793.1| PREDICTED: uncharacterized protein LOC104417... 416 e-150 ref|XP_007224141.1| hypothetical protein PRUPE_ppa016115mg [Prun... 392 e-149 gb|AEV42258.1| hypothetical protein [Beta vulgaris] 410 e-149 ref|XP_007010875.1| DNA/RNA polymerases superfamily protein, put... 392 e-149 ref|XP_012487798.1| PREDICTED: uncharacterized protein LOC105800... 407 e-149 ref|XP_008798775.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 406 e-148 gb|AAT38744.1| Putative gag-pol polyprotein, identical [Solanum ... 420 e-148 ref|XP_007032220.1| Retrotransposon protein, putative [Theobroma... 392 e-148 ref|XP_006366848.1| PREDICTED: uncharacterized protein LOC102605... 410 e-148 gb|AAT38724.1| Putative retrotransposon protein, identical [Sola... 420 e-148 emb|CAN77801.1| hypothetical protein VITISV_031477 [Vitis vinifera] 397 e-146 ref|XP_007032152.1| Retrotransposon protein, putative [Theobroma... 388 e-146 emb|CAA73042.1| polyprotein [Ananas comosus] 396 e-146 >ref|XP_007213082.1| hypothetical protein PRUPE_ppa021229mg [Prunus persica] gi|462408947|gb|EMJ14281.1| hypothetical protein PRUPE_ppa021229mg [Prunus persica] Length = 1194 Score = 427 bits (1099), Expect(2) = e-162 Identities = 204/263 (77%), Positives = 235/263 (89%) Frame = -1 Query: 1190 VTIRNKYPLPRIDDLFDQLRGATVFSKIDLRSGYHQLKIRDVDVPKTAFRSRYGHYEFLV 1011 +T+RN+YPLPRIDDLFDQL+GA VFSKIDLRSGYHQL++R+ D+PKTAFR+RYGHYEFLV Sbjct: 327 ITVRNRYPLPRIDDLFDQLKGAKVFSKIDLRSGYHQLRVREEDMPKTAFRTRYGHYEFLV 386 Query: 1010 MPFGLTNAPAAFMDLMNRVLHPYLDRFVIVFIDDILVYSRSREEHAEHLKIILQTLRDKQ 831 MPFGLTNAPAAFMDLMNRV YLDRFVIVFIDDILVYS+S++ H +HL ++L+TLR +Q Sbjct: 387 MPFGLTNAPAAFMDLMNRVFRRYLDRFVIVFIDDILVYSKSQKAHMKHLNLVLRTLRRRQ 446 Query: 830 LYAKFSKCEFWLDKVVFLGHVISAEGIYVDPIKTEAVMKWERPTNVTEVRSFLGLAGYYR 651 LYAKFSKC+FWLD+V FLGHVISAEGIYVDP K EAV+ W RPT+VTE+RSFLGLAGYYR Sbjct: 447 LYAKFSKCQFWLDRVSFLGHVISAEGIYVDPQKIEAVVNWLRPTSVTEIRSFLGLAGYYR 506 Query: 650 RFVEGFSRIALPLTSLTRKNSKFVWSEECDKSFRELKNRLVSAPILTLPSPGEEFVIYSD 471 RFVEGFS IA PLT LTRK KFVWS++C++SF ELK RL +AP+L LP FVIYSD Sbjct: 507 RFVEGFSTIAAPLTYLTRKGVKFVWSDKCEESFIELKTRLTTAPVLALPDDSGNFVIYSD 566 Query: 470 ASRQGLGCVLMQNGKVIAYASRQ 402 AS+QGLGCVLMQ+G+VIAYASRQ Sbjct: 567 ASQQGLGCVLMQHGRVIAYASRQ 589 Score = 172 bits (436), Expect(2) = e-162 Identities = 85/133 (63%), Positives = 94/133 (70%), Gaps = 1/133 (0%) Frame = -2 Query: 403 KHELNYPTHDLELAAVVLALKIWRHYLYGEKCQIFTDHKSLKYIFDQKDLNLRQRRWLEL 224 KHELNYP HDLELAAVV ALKIWRHYLYGE CQIFTDHKSLKY+F QK+LNLRQRRWLEL Sbjct: 592 KHELNYPVHDLELAAVVFALKIWRHYLYGETCQIFTDHKSLKYLFTQKELNLRQRRWLEL 651 Query: 223 IKDYDCRIDYHPGKANVVAD-XXXXXXXXXXXXXXXXXSLFCEFRKLHAQLSATTSGAVL 47 IKDYDC I++HPG+ANVVAD L E RKL L GA+L Sbjct: 652 IKDYDCTIEHHPGRANVVADALSRKSSGSIAYLRGRYLPLMVEMRKLRIGLDVDNQGALL 711 Query: 46 AHFQVRPTLIDRV 8 A VRP L++R+ Sbjct: 712 ATLHVRPVLVERI 724 >ref|XP_007044250.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508708185|gb|EOY00082.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1515 Score = 434 bits (1116), Expect(2) = e-160 Identities = 202/263 (76%), Positives = 236/263 (89%) Frame = -1 Query: 1190 VTIRNKYPLPRIDDLFDQLRGATVFSKIDLRSGYHQLKIRDVDVPKTAFRSRYGHYEFLV 1011 +TI+NKYPLPRIDDLFDQL+GATVFSK+DLRSGYHQL+I++ DVPKTAFR+RYGHYEFLV Sbjct: 639 MTIKNKYPLPRIDDLFDQLQGATVFSKVDLRSGYHQLRIKEQDVPKTAFRTRYGHYEFLV 698 Query: 1010 MPFGLTNAPAAFMDLMNRVLHPYLDRFVIVFIDDILVYSRSREEHAEHLKIILQTLRDKQ 831 MPFGLTNAPAAFMDLMNRV HPYLD+FVIVFIDDILVYSR +EHA HL+I+LQTLR++Q Sbjct: 699 MPFGLTNAPAAFMDLMNRVFHPYLDKFVIVFIDDILVYSRDNDEHAAHLRIVLQTLRERQ 758 Query: 830 LYAKFSKCEFWLDKVVFLGHVISAEGIYVDPIKTEAVMKWERPTNVTEVRSFLGLAGYYR 651 LYAKFSKCEFWL +VVFLGH++S GIYVDP K EA+++WE+P VTE+RSFLGLAGYYR Sbjct: 759 LYAKFSKCEFWLQEVVFLGHIVSRTGIYVDPKKVEAILQWEQPKTVTEIRSFLGLAGYYR 818 Query: 650 RFVEGFSRIALPLTSLTRKNSKFVWSEECDKSFRELKNRLVSAPILTLPSPGEEFVIYSD 471 RFV+GFS +A PLT LTRK KFVW + C+ F+ELKNRL SAP+LTLP G+ F++YSD Sbjct: 819 RFVQGFSLVAAPLTRLTRKGVKFVWDDVCENRFQELKNRLTSAPVLTLPVNGKGFIVYSD 878 Query: 470 ASRQGLGCVLMQNGKVIAYASRQ 402 AS+ GLGCVLMQ+ KV+AYASRQ Sbjct: 879 ASKLGLGCVLMQDEKVVAYASRQ 901 Score = 159 bits (403), Expect(2) = e-160 Identities = 80/135 (59%), Positives = 95/135 (70%), Gaps = 1/135 (0%) Frame = -2 Query: 403 KHELNYPTHDLELAAVVLALKIWRHYLYGEKCQIFTDHKSLKYIFDQKDLNLRQRRWLEL 224 +HE NYPTHDLELAAVV ALKIWRHYLYGE C+IFTDHKSLKY+ QK+LNLRQRRWLEL Sbjct: 904 RHEANYPTHDLELAAVVFALKIWRHYLYGEHCRIFTDHKSLKYLLTQKELNLRQRRWLEL 963 Query: 223 IKDYDCRIDYHPGKANVVAD-XXXXXXXXXXXXXXXXXSLFCEFRKLHAQLSATTSGAVL 47 IKDYD IDYH GKANVVAD E + L QL G++L Sbjct: 964 IKDYDLVIDYHLGKANVVADALSRKSSSSLAALQSCYFPALIEMKSLGVQLRNGEDGSLL 1023 Query: 46 AHFQVRPTLIDRVRE 2 A+F VRP+L++++++ Sbjct: 1024 ANFIVRPSLLNQIKD 1038 >ref|XP_007050046.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508702307|gb|EOX94203.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1336 Score = 419 bits (1076), Expect(2) = e-157 Identities = 197/263 (74%), Positives = 231/263 (87%) Frame = -1 Query: 1190 VTIRNKYPLPRIDDLFDQLRGATVFSKIDLRSGYHQLKIRDVDVPKTAFRSRYGHYEFLV 1011 +TI+NKYPLPRIDD+FDQL+GATVFSK++LRSGYHQL+I++ DV KT FR+RYGHYEFLV Sbjct: 602 MTIKNKYPLPRIDDIFDQLQGATVFSKVNLRSGYHQLRIKEQDVLKTEFRTRYGHYEFLV 661 Query: 1010 MPFGLTNAPAAFMDLMNRVLHPYLDRFVIVFIDDILVYSRSREEHAEHLKIILQTLRDKQ 831 MPFGLTNAPA FMDLM+RV HPYLD+FVIVFIDDILVY R +EHA HL+I+LQTLR++Q Sbjct: 662 MPFGLTNAPATFMDLMSRVFHPYLDKFVIVFIDDILVYLRDNDEHAAHLRIVLQTLRERQ 721 Query: 830 LYAKFSKCEFWLDKVVFLGHVISAEGIYVDPIKTEAVMKWERPTNVTEVRSFLGLAGYYR 651 LYAKFSKCEFWL +VVFLGHV+S GIYVDP K EA+++WE+P VTE+RSFLGLAGYYR Sbjct: 722 LYAKFSKCEFWLQEVVFLGHVVSRTGIYVDPKKVEAILQWEQPKTVTEIRSFLGLAGYYR 781 Query: 650 RFVEGFSRIALPLTSLTRKNSKFVWSEECDKSFRELKNRLVSAPILTLPSPGEEFVIYSD 471 RFV+GFS IA PLT LTRK KFVW + C+ F+ELKNRL AP+LTLP G+ FV+YSD Sbjct: 782 RFVQGFSLIAAPLTRLTRKGVKFVWDDVCENRFQELKNRLTFAPVLTLPVNGKGFVVYSD 841 Query: 470 ASRQGLGCVLMQNGKVIAYASRQ 402 AS+ GLGCVLMQ+ KV+AYASRQ Sbjct: 842 ASKLGLGCVLMQDEKVVAYASRQ 864 Score = 165 bits (417), Expect(2) = e-157 Identities = 81/135 (60%), Positives = 96/135 (71%), Gaps = 1/135 (0%) Frame = -2 Query: 403 KHELNYPTHDLELAAVVLALKIWRHYLYGEKCQIFTDHKSLKYIFDQKDLNLRQRRWLEL 224 +HE NYPTHDLELAAVV ALKIWRHYLYGE CQIFTDHKSLKY+ QK++NLRQRRWLEL Sbjct: 867 RHEANYPTHDLELAAVVFALKIWRHYLYGEHCQIFTDHKSLKYLLTQKEINLRQRRWLEL 926 Query: 223 IKDYDCRIDYHPGKANVVAD-XXXXXXXXXXXXXXXXXSLFCEFRKLHAQLSATTSGAVL 47 IKDYD IDYHPGKANVVAD E + L QL G++L Sbjct: 927 IKDYDLVIDYHPGKANVVADALSRKSSSSLAALQNCYFPALIEMKSLRVQLRNGEDGSLL 986 Query: 46 AHFQVRPTLIDRVRE 2 A+F VRP+L++++++ Sbjct: 987 ANFIVRPSLLNQIKD 1001 >ref|XP_007200265.1| hypothetical protein PRUPE_ppa015000mg [Prunus persica] gi|462395665|gb|EMJ01464.1| hypothetical protein PRUPE_ppa015000mg [Prunus persica] Length = 1493 Score = 417 bits (1071), Expect(2) = e-156 Identities = 201/263 (76%), Positives = 229/263 (87%) Frame = -1 Query: 1190 VTIRNKYPLPRIDDLFDQLRGATVFSKIDLRSGYHQLKIRDVDVPKTAFRSRYGHYEFLV 1011 VTIRN+YPLPRIDDLFDQL+GA FSKIDLRSGYHQL+IR+ D+P TA R+RYGHYEFLV Sbjct: 625 VTIRNRYPLPRIDDLFDQLKGAKYFSKIDLRSGYHQLRIREEDIPNTALRTRYGHYEFLV 684 Query: 1010 MPFGLTNAPAAFMDLMNRVLHPYLDRFVIVFIDDILVYSRSREEHAEHLKIILQTLRDKQ 831 MPFGLTNAPAAFMDLMNRV PYLD FVIVFIDDILVYS++ E H +HL+++L+TLR KQ Sbjct: 685 MPFGLTNAPAAFMDLMNRVFRPYLDHFVIVFIDDILVYSQTLEGHKKHLRVVLRTLRRKQ 744 Query: 830 LYAKFSKCEFWLDKVVFLGHVISAEGIYVDPIKTEAVMKWERPTNVTEVRSFLGLAGYYR 651 LYAKFSKC+FWLD VVFLGHVISAEGIYVDP K EA++ W + T+VTE+RSFLGLAGYYR Sbjct: 745 LYAKFSKCQFWLDIVVFLGHVISAEGIYVDPQKVEAIVNWVQSTSVTEIRSFLGLAGYYR 804 Query: 650 RFVEGFSRIALPLTSLTRKNSKFVWSEECDKSFRELKNRLVSAPILTLPSPGEEFVIYSD 471 RFVEGFS IA PLT LTRK+ F W+EEC++SF+ELK RL +AP+L LP FVIYSD Sbjct: 805 RFVEGFSSIAAPLTRLTRKDIAFEWTEECEQSFQELKKRLTTAPVLALPDNAGNFVIYSD 864 Query: 470 ASRQGLGCVLMQNGKVIAYASRQ 402 AS QGLGCVLMQ+ +VIAYASRQ Sbjct: 865 ASLQGLGCVLMQHDRVIAYASRQ 887 Score = 166 bits (419), Expect(2) = e-156 Identities = 81/133 (60%), Positives = 92/133 (69%), Gaps = 1/133 (0%) Frame = -2 Query: 403 KHELNYPTHDLELAAVVLALKIWRHYLYGEKCQIFTDHKSLKYIFDQKDLNLRQRRWLEL 224 KHE NYP HDLELAAVV ALKIWRHYLYGE CQIFTDHKSLKY F Q++LN+RQRRWLEL Sbjct: 890 KHEQNYPVHDLELAAVVFALKIWRHYLYGETCQIFTDHKSLKYFFTQRELNMRQRRWLEL 949 Query: 223 IKDYDCRIDYHPGKANVVAD-XXXXXXXXXXXXXXXXXSLFCEFRKLHAQLSATTSGAVL 47 IKDYDC I+Y+PG+ANVVAD L E RK +L T G +L Sbjct: 950 IKDYDCTIEYYPGRANVVADALSRKTTGSLTHLRTTYLPLLVELRKDGVELEMTQQGGIL 1009 Query: 46 AHFQVRPTLIDRV 8 A VRP L++R+ Sbjct: 1010 ASLHVRPILVERI 1022 >ref|XP_009787832.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC104235718, partial [Nicotiana sylvestris] Length = 1156 Score = 404 bits (1039), Expect(2) = e-153 Identities = 185/263 (70%), Positives = 234/263 (88%) Frame = -1 Query: 1190 VTIRNKYPLPRIDDLFDQLRGATVFSKIDLRSGYHQLKIRDVDVPKTAFRSRYGHYEFLV 1011 VTI+NKYPLPRIDDLFDQL+GA++FSKIDLRSGY+QL++R+ DVPKTAFR+RYGHYEFLV Sbjct: 675 VTIKNKYPLPRIDDLFDQLKGASLFSKIDLRSGYYQLRVREQDVPKTAFRTRYGHYEFLV 734 Query: 1010 MPFGLTNAPAAFMDLMNRVLHPYLDRFVIVFIDDILVYSRSREEHAEHLKIILQTLRDKQ 831 MPFGLTNAPAAFMDLMNRV PYLD+FV+VFIDDILVYS++RE+H +H++I++Q L+++Q Sbjct: 735 MPFGLTNAPAAFMDLMNRVFKPYLDQFVVVFIDDILVYSKNREDHDKHIRIVMQILKERQ 794 Query: 830 LYAKFSKCEFWLDKVVFLGHVISAEGIYVDPIKTEAVMKWERPTNVTEVRSFLGLAGYYR 651 LYAK SKCEFWL++V FLGH++S+EG+ VDP K +A++ W+ P TE+RSFLGLAGYYR Sbjct: 795 LYAKLSKCEFWLNEVAFLGHIVSSEGVKVDPSKIQAIVDWKLPKTPTEIRSFLGLAGYYR 854 Query: 650 RFVEGFSRIALPLTSLTRKNSKFVWSEECDKSFRELKNRLVSAPILTLPSPGEEFVIYSD 471 RFV+GFS IA PLT L K++KFVW ++C +SF +LK+ L APIL+LP+ G+++V+YSD Sbjct: 855 RFVKGFSIIASPLTKLLGKDAKFVWDDKCQESFEKLKSLLTQAPILSLPAEGKDYVVYSD 914 Query: 470 ASRQGLGCVLMQNGKVIAYASRQ 402 AS +GLGCVLMQ GKVIAYASR+ Sbjct: 915 ASHRGLGCVLMQEGKVIAYASRK 937 Score = 165 bits (418), Expect(2) = e-153 Identities = 80/133 (60%), Positives = 97/133 (72%) Frame = -2 Query: 400 HELNYPTHDLELAAVVLALKIWRHYLYGEKCQIFTDHKSLKYIFDQKDLNLRQRRWLELI 221 HELNYPTHDLELAA+V AL IWRHYLYGEKC IFTDHKSLKY+ QK+LNLRQRRWLELI Sbjct: 941 HELNYPTHDLELAAIVFALTIWRHYLYGEKCHIFTDHKSLKYLGTQKELNLRQRRWLELI 1000 Query: 220 KDYDCRIDYHPGKANVVADXXXXXXXXXXXXXXXXXSLFCEFRKLHAQLSATTSGAVLAH 41 KDYDC IDYHPG+ANVVAD L E R ++ LS ++G+++ + Sbjct: 1001 KDYDCTIDYHPGEANVVAD--ALSRNSLACLTLSPLPLLLELRAMNVCLSFNSNGSIITN 1058 Query: 40 FQVRPTLIDRVRE 2 QV+ L+++V+E Sbjct: 1059 LQVKLVLLEQVQE 1071 >ref|XP_007022574.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] gi|508722202|gb|EOY14099.1| DNA/RNA polymerases superfamily protein [Theobroma cacao] Length = 1502 Score = 398 bits (1023), Expect(2) = e-151 Identities = 189/263 (71%), Positives = 225/263 (85%) Frame = -1 Query: 1190 VTIRNKYPLPRIDDLFDQLRGATVFSKIDLRSGYHQLKIRDVDVPKTAFRSRYGHYEFLV 1011 VT++NKYPLPRIDDLFDQL+GA FSKIDLRSGYHQL+IR+ D+PK AF++RYGHYEFLV Sbjct: 700 VTVKNKYPLPRIDDLFDQLQGAQCFSKIDLRSGYHQLRIRNEDIPKIAFQTRYGHYEFLV 759 Query: 1010 MPFGLTNAPAAFMDLMNRVLHPYLDRFVIVFIDDILVYSRSREEHAEHLKIILQTLRDKQ 831 M FGLTNAPAAFMDLMNRV PYLD+FV+VFIDDIL+YS+SREEH +HLKI+LQ LR+ + Sbjct: 760 MSFGLTNAPAAFMDLMNRVFKPYLDKFVVVFIDDILIYSKSREEHEQHLKIVLQILREHR 819 Query: 830 LYAKFSKCEFWLDKVVFLGHVISAEGIYVDPIKTEAVMKWERPTNVTEVRSFLGLAGYYR 651 LYAKFSKCEFWL+ V FLGHV+S EGI VD K EAV KW RPT+VTE+RSF+GLAGYYR Sbjct: 820 LYAKFSKCEFWLESVAFLGHVVSKEGIQVDTKKIEAVEKWPRPTSVTEIRSFVGLAGYYR 879 Query: 650 RFVEGFSRIALPLTSLTRKNSKFVWSEECDKSFRELKNRLVSAPILTLPSPGEEFVIYSD 471 RFV+ FS+I PLT LTRK++KF WS+ C+ SF +LK L +AP+L+LP ++++ D Sbjct: 880 RFVKDFSKIVAPLTKLTRKDTKFEWSDACENSFEKLKACLTTAPVLSLPQGTGGYMVFCD 939 Query: 470 ASRQGLGCVLMQNGKVIAYASRQ 402 AS GLGCVLMQ+GKVIAYASRQ Sbjct: 940 ASGVGLGCVLMQHGKVIAYASRQ 962 Score = 164 bits (416), Expect(2) = e-151 Identities = 80/140 (57%), Positives = 98/140 (70%), Gaps = 6/140 (4%) Frame = -2 Query: 403 KHELNYPTHDLELAAVVLALKIWRHYLYGEKCQIFTDHKSLKYIFDQKDLNLRQRRWLEL 224 +HE NYP HDLE+AA+V ALKIWRHYLYGE C+I+TDHKSLKYIF Q+DLNLRQRRW+EL Sbjct: 965 RHEHNYPIHDLEMAAIVFALKIWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQRRWMEL 1024 Query: 223 IKDYDCRIDYHPGKANVVADXXXXXXXXXXXXXXXXXSLFCEFRKLHA------QLSATT 62 +KDYDC I YHPGKANVVAD R++H+ +L Sbjct: 1025 LKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSL--VREIHSLGDIGVRLEVAE 1082 Query: 61 SGAVLAHFQVRPTLIDRVRE 2 + A+LAHF+VRP L+DR++E Sbjct: 1083 TNALLAHFRVRPILMDRIKE 1102 >gb|AAO45752.1| pol protein [Cucumis melo subsp. melo] Length = 923 Score = 409 bits (1051), Expect(2) = e-151 Identities = 194/263 (73%), Positives = 226/263 (85%) Frame = -1 Query: 1190 VTIRNKYPLPRIDDLFDQLRGATVFSKIDLRSGYHQLKIRDVDVPKTAFRSRYGHYEFLV 1011 VT++N+YPLPRIDDLFDQL+GATVFSKIDLRSGYHQL+I+D DVPKTAFRSRYGHY+F+V Sbjct: 55 VTVKNRYPLPRIDDLFDQLQGATVFSKIDLRSGYHQLRIKDEDVPKTAFRSRYGHYQFIV 114 Query: 1010 MPFGLTNAPAAFMDLMNRVLHPYLDRFVIVFIDDILVYSRSREEHAEHLKIILQTLRDKQ 831 M FGLTNAPA FMDLMNRV +LD FVIVFIDDIL+YS++ EH EHL+++LQTLRD + Sbjct: 115 MSFGLTNAPAVFMDLMNRVFREFLDTFVIVFIDDILIYSKTEAEHEEHLRMVLQTLRDNK 174 Query: 830 LYAKFSKCEFWLDKVVFLGHVISAEGIYVDPIKTEAVMKWERPTNVTEVRSFLGLAGYYR 651 LYAKFSKCEFWL +V FLGHV+S G+ VDP K EAV W RP+ V+EVRSFLGLAGYYR Sbjct: 175 LYAKFSKCEFWLKQVSFLGHVVSKAGVSVDPAKIEAVTGWTRPSTVSEVRSFLGLAGYYR 234 Query: 650 RFVEGFSRIALPLTSLTRKNSKFVWSEECDKSFRELKNRLVSAPILTLPSPGEEFVIYSD 471 RFVE FSRIA PLT LTRK + FVWS+ C+ SF+ LK +LV+AP+LT+P FVIYSD Sbjct: 235 RFVENFSRIATPLTQLTRKGAPFVWSKACEDSFQTLKQKLVTAPVLTVPDGSGNFVIYSD 294 Query: 470 ASRQGLGCVLMQNGKVIAYASRQ 402 AS++GLGCVLMQ GKV+AYASRQ Sbjct: 295 ASKKGLGCVLMQQGKVVAYASRQ 317 Score = 154 bits (388), Expect(2) = e-151 Identities = 77/131 (58%), Positives = 85/131 (64%) Frame = -2 Query: 400 HELNYPTHDLELAAVVLALKIWRHYLYGEKCQIFTDHKSLKYIFDQKDLNLRQRRWLELI 221 HE NYPTHDLELAAVV ALKIWRHYLYGEK QIFTDHKSLKY F QK+LN+RQRRWLEL+ Sbjct: 321 HEQNYPTHDLELAAVVFALKIWRHYLYGEKIQIFTDHKSLKYFFTQKELNMRQRRWLELV 380 Query: 220 KDYDCRIDYHPGKANVVADXXXXXXXXXXXXXXXXXSLFCEFRKLHAQLSATTSGAVLAH 41 KDYDC I YHPGKANVVAD L + + + LA Sbjct: 381 KDYDCEILYHPGKANVVADALSRKVSHSAALITRQAPLHRDLERAEIAVLVGAVTMQLAQ 440 Query: 40 FQVRPTLIDRV 8 V+PTL R+ Sbjct: 441 LTVQPTLRQRI 451 >ref|XP_010026793.1| PREDICTED: uncharacterized protein LOC104417177 [Eucalyptus grandis] Length = 1753 Score = 416 bits (1068), Expect(2) = e-150 Identities = 196/263 (74%), Positives = 232/263 (88%) Frame = -1 Query: 1190 VTIRNKYPLPRIDDLFDQLRGATVFSKIDLRSGYHQLKIRDVDVPKTAFRSRYGHYEFLV 1011 VTI+NKYPLPRIDDLFDQL+GA++FSKIDLR+GYHQL+I+ D+PK+AFR+RYGHYEF V Sbjct: 629 VTIKNKYPLPRIDDLFDQLQGASIFSKIDLRTGYHQLRIKKEDIPKSAFRTRYGHYEFTV 688 Query: 1010 MPFGLTNAPAAFMDLMNRVLHPYLDRFVIVFIDDILVYSRSREEHAEHLKIILQTLRDKQ 831 MPFGLTNAPAAFMDLMNRV YLD+FVIVFIDDILVYSRS E+H +HL+I+LQTLRD + Sbjct: 689 MPFGLTNAPAAFMDLMNRVFKEYLDQFVIVFIDDILVYSRSSEDHEKHLRIVLQTLRDHE 748 Query: 830 LYAKFSKCEFWLDKVVFLGHVISAEGIYVDPIKTEAVMKWERPTNVTEVRSFLGLAGYYR 651 LYAKFSKCEFWL +V FLGHVIS EGI VDP K EAV+ W RPT VTE+RSFLGLAGYYR Sbjct: 749 LYAKFSKCEFWLTRVAFLGHVISGEGISVDPAKIEAVINWPRPTTVTEIRSFLGLAGYYR 808 Query: 650 RFVEGFSRIALPLTSLTRKNSKFVWSEECDKSFRELKNRLVSAPILTLPSPGEEFVIYSD 471 RFVEGFSR+A P+T L +K KFVW+++C+ SF+ELK++L +AP+LT+PS F IYSD Sbjct: 809 RFVEGFSRLASPMTRLLKKEEKFVWTDKCENSFQELKHKLTTAPVLTIPSGPGGFEIYSD 868 Query: 470 ASRQGLGCVLMQNGKVIAYASRQ 402 AS +GLGCVLMQ+G+V+AYASRQ Sbjct: 869 ASFKGLGCVLMQHGRVVAYASRQ 891 Score = 144 bits (363), Expect(2) = e-150 Identities = 63/79 (79%), Positives = 72/79 (91%) Frame = -2 Query: 400 HELNYPTHDLELAAVVLALKIWRHYLYGEKCQIFTDHKSLKYIFDQKDLNLRQRRWLELI 221 HELNYPTHDLELAA++ ALKIWRHYL GE+ QIFTDH+SLKY+F QK+LN+RQRRW+EL+ Sbjct: 895 HELNYPTHDLELAAIIFALKIWRHYLCGERFQIFTDHQSLKYLFSQKELNMRQRRWMELL 954 Query: 220 KDYDCRIDYHPGKANVVAD 164 KDYDC I YHPGKAN VAD Sbjct: 955 KDYDCEILYHPGKANKVAD 973 >ref|XP_007224141.1| hypothetical protein PRUPE_ppa016115mg [Prunus persica] gi|462421077|gb|EMJ25340.1| hypothetical protein PRUPE_ppa016115mg [Prunus persica] Length = 1269 Score = 392 bits (1008), Expect(2) = e-149 Identities = 192/263 (73%), Positives = 221/263 (84%) Frame = -1 Query: 1190 VTIRNKYPLPRIDDLFDQLRGATVFSKIDLRSGYHQLKIRDVDVPKTAFRSRYGHYEFLV 1011 +T+RN+YPLPRIDDLFDQL+GA VFSKIDLRSGYHQL++R+ DV KTAFR+RYGHYEFLV Sbjct: 437 ITVRNRYPLPRIDDLFDQLKGAKVFSKIDLRSGYHQLRVREEDVTKTAFRTRYGHYEFLV 496 Query: 1010 MPFGLTNAPAAFMDLMNRVLHPYLDRFVIVFIDDILVYSRSREEHAEHLKIILQTLRDKQ 831 MPFGLTNAPAAFMDLMNRV YLDRFVIVF+DDILVYS+S++ H +HL ++L+TLR +Q Sbjct: 497 MPFGLTNAPAAFMDLMNRVFRRYLDRFVIVFVDDILVYSKSQKAHMKHLNLVLRTLRRRQ 556 Query: 830 LYAKFSKCEFWLDKVVFLGHVISAEGIYVDPIKTEAVMKWERPTNVTEVRSFLGLAGYYR 651 LYAKFSKC+FWLD V FLGHVISAEGIYVDP K EAV+ W RPT+VTE+RSFLGLA YYR Sbjct: 557 LYAKFSKCQFWLDIVSFLGHVISAEGIYVDPQKIEAVVNWLRPTSVTEIRSFLGLARYYR 616 Query: 650 RFVEGFSRIALPLTSLTRKNSKFVWSEECDKSFRELKNRLVSAPILTLPSPGEEFVIYSD 471 RFVEGFS IA PLT LTRK KFVWS++C+++F P FVIYSD Sbjct: 617 RFVEGFSTIAAPLTYLTRKGVKFVWSDKCEETF---------------PDDSGNFVIYSD 661 Query: 470 ASRQGLGCVLMQNGKVIAYASRQ 402 AS+QGLGCVLMQ+G+VIAYASRQ Sbjct: 662 ASQQGLGCVLMQHGRVIAYASRQ 684 Score = 166 bits (419), Expect(2) = e-149 Identities = 82/132 (62%), Positives = 92/132 (69%) Frame = -2 Query: 403 KHELNYPTHDLELAAVVLALKIWRHYLYGEKCQIFTDHKSLKYIFDQKDLNLRQRRWLEL 224 KHELNYP HDLELAAVV ALKIWRHYLYGE CQIFT HKSLKY+F QK+LNLRQRRWLEL Sbjct: 687 KHELNYPVHDLELAAVVFALKIWRHYLYGETCQIFTYHKSLKYLFTQKELNLRQRRWLEL 746 Query: 223 IKDYDCRIDYHPGKANVVADXXXXXXXXXXXXXXXXXSLFCEFRKLHAQLSATTSGAVLA 44 IKDYDC I++HPG+ANVVAD + RKL L GA+LA Sbjct: 747 IKDYDCTIEHHPGRANVVADA-------------------LKMRKLRVGLDVDNQGALLA 787 Query: 43 HFQVRPTLIDRV 8 VRP L++R+ Sbjct: 788 TLHVRPVLVERI 799 >gb|AEV42258.1| hypothetical protein [Beta vulgaris] Length = 1553 Score = 410 bits (1053), Expect(2) = e-149 Identities = 189/263 (71%), Positives = 230/263 (87%) Frame = -1 Query: 1190 VTIRNKYPLPRIDDLFDQLRGATVFSKIDLRSGYHQLKIRDVDVPKTAFRSRYGHYEFLV 1011 VT++NKYPLPRIDDLFDQL+GA +FSKIDLRSGYHQL+I D D+PKTAFR+RYGHYEF V Sbjct: 654 VTVKNKYPLPRIDDLFDQLQGAGMFSKIDLRSGYHQLRIVDHDIPKTAFRTRYGHYEFTV 713 Query: 1010 MPFGLTNAPAAFMDLMNRVLHPYLDRFVIVFIDDILVYSRSREEHAEHLKIILQTLRDKQ 831 MPFGLTNAPA FMDLMNR+ PYLD+FV+VFIDDIL+YS+++EEH +HL++ILQTLRD Q Sbjct: 714 MPFGLTNAPAVFMDLMNRIFRPYLDKFVVVFIDDILIYSKNKEEHEDHLRVILQTLRDNQ 773 Query: 830 LYAKFSKCEFWLDKVVFLGHVISAEGIYVDPIKTEAVMKWERPTNVTEVRSFLGLAGYYR 651 LYAKFSKCEFWL++V FLGH IS EG+ VDP K +AV +W P NVT++RSFLGLAGYYR Sbjct: 774 LYAKFSKCEFWLERVSFLGHFISKEGVLVDPAKIKAVSEWPTPKNVTDIRSFLGLAGYYR 833 Query: 650 RFVEGFSRIALPLTSLTRKNSKFVWSEECDKSFRELKNRLVSAPILTLPSPGEEFVIYSD 471 RFV+ FS+IA P+T+L +K+ +F W+E+ +K+F+ LK RL SAP+LTLP+ E + +YSD Sbjct: 834 RFVKDFSKIAKPMTNLMKKDCRFTWNEDSEKAFQTLKERLTSAPVLTLPNGNEGYDVYSD 893 Query: 470 ASRQGLGCVLMQNGKVIAYASRQ 402 AS+ GLGCVLMQNGKVIAYASRQ Sbjct: 894 ASKNGLGCVLMQNGKVIAYASRQ 916 Score = 147 bits (371), Expect(2) = e-149 Identities = 76/135 (56%), Positives = 88/135 (65%), Gaps = 3/135 (2%) Frame = -2 Query: 400 HELNYPTHDLELAAVVLALKIWRHYLYGEKCQIFTDHKSLKYIFDQKDLNLRQRRWLELI 221 +E+NYPTHDLELAA+V ALKIWRHYLYG C+IFTDHKSLKYIF QKDLN+RQRRWLELI Sbjct: 920 YEVNYPTHDLELAAIVFALKIWRHYLYGVTCRIFTDHKSLKYIFTQKDLNMRQRRWLELI 979 Query: 220 KDYDCRIDYHPGKANVVADXXXXXXXXXXXXXXXXXSLFCEFRKLHAQLSATTSGAV--- 50 KDYD I YH GKANVVAD L EF +L Q+ G V Sbjct: 980 KDYDLDIQYHEGKANVVADALSRKSSHSLNTLVVADKLCEEFSRL--QIEVVHEGEVERL 1037 Query: 49 LAHFQVRPTLIDRVR 5 L+ + P ++ +R Sbjct: 1038 LSALTIEPNFLEEIR 1052 >ref|XP_007010875.1| DNA/RNA polymerases superfamily protein, putative [Theobroma cacao] gi|508727788|gb|EOY19685.1| DNA/RNA polymerases superfamily protein, putative [Theobroma cacao] Length = 1347 Score = 392 bits (1007), Expect(2) = e-149 Identities = 186/246 (75%), Positives = 216/246 (87%) Frame = -1 Query: 1139 QLRGATVFSKIDLRSGYHQLKIRDVDVPKTAFRSRYGHYEFLVMPFGLTNAPAAFMDLMN 960 +L+GATVFSK+DLRSGYHQL+I++ DVPKT FR+RYGHYEFLV+PFGLTNAPAAFMDLMN Sbjct: 510 ELKGATVFSKVDLRSGYHQLRIKEQDVPKTTFRTRYGHYEFLVIPFGLTNAPAAFMDLMN 569 Query: 959 RVLHPYLDRFVIVFIDDILVYSRSREEHAEHLKIILQTLRDKQLYAKFSKCEFWLDKVVF 780 RV HPYL +FVIVFIDDILVYSR +EHA HL+I+LQTLR+KQLYAKFSKCEFWL +VVF Sbjct: 570 RVFHPYLGKFVIVFIDDILVYSRDNDEHAAHLRIVLQTLREKQLYAKFSKCEFWLQEVVF 629 Query: 779 LGHVISAEGIYVDPIKTEAVMKWERPTNVTEVRSFLGLAGYYRRFVEGFSRIALPLTSLT 600 LGHV+S GIYVDP K EA+++WE+P VTE+RSFLGLAGYYRRFV+GFS IA PLT LT Sbjct: 630 LGHVVSRTGIYVDPKKVEAILQWEQPKTVTEIRSFLGLAGYYRRFVQGFSLIAAPLTRLT 689 Query: 599 RKNSKFVWSEECDKSFRELKNRLVSAPILTLPSPGEEFVIYSDASRQGLGCVLMQNGKVI 420 RK KFV + C+ F+ELKNRL SAP+LTLP G+ FV+YSDAS+ GLGCVLMQ+ KV+ Sbjct: 690 RKGVKFVCDDVCENRFQELKNRLTSAPVLTLPVNGKGFVVYSDASKLGLGCVLMQDEKVV 749 Query: 419 AYASRQ 402 AYASRQ Sbjct: 750 AYASRQ 755 Score = 164 bits (416), Expect(2) = e-149 Identities = 83/135 (61%), Positives = 97/135 (71%), Gaps = 1/135 (0%) Frame = -2 Query: 403 KHELNYPTHDLELAAVVLALKIWRHYLYGEKCQIFTDHKSLKYIFDQKDLNLRQRRWLEL 224 +HE NYPTHDLELAAVV ALKIWRHYLYGE C+IFTDHKSLKY+ QK+LNLRQRRWLEL Sbjct: 758 RHEANYPTHDLELAAVVFALKIWRHYLYGEHCRIFTDHKSLKYLLTQKELNLRQRRWLEL 817 Query: 223 IKDYDCRIDYHPGKANVVAD-XXXXXXXXXXXXXXXXXSLFCEFRKLHAQLSATTSGAVL 47 IKDYD IDYHPGKANVVAD S E + L QL G+VL Sbjct: 818 IKDYDLVIDYHPGKANVVADALSRKSSSSLAALQSCYFSALIEMKSLGVQLRNGEDGSVL 877 Query: 46 AHFQVRPTLIDRVRE 2 A+F VRP+L++++++ Sbjct: 878 ANFIVRPSLLNQIKD 892 >ref|XP_012487798.1| PREDICTED: uncharacterized protein LOC105800996 [Gossypium raimondii] Length = 847 Score = 407 bits (1045), Expect(2) = e-149 Identities = 193/263 (73%), Positives = 223/263 (84%) Frame = -1 Query: 1190 VTIRNKYPLPRIDDLFDQLRGATVFSKIDLRSGYHQLKIRDVDVPKTAFRSRYGHYEFLV 1011 VTI+NKYPLPRIDDLFDQL+GATVFSKIDLRSGY+QL++++ DVPKTAF +RYGHYEFLV Sbjct: 438 VTIKNKYPLPRIDDLFDQLKGATVFSKIDLRSGYYQLRVKESDVPKTAFNTRYGHYEFLV 497 Query: 1010 MPFGLTNAPAAFMDLMNRVLHPYLDRFVIVFIDDILVYSRSREEHAEHLKIILQTLRDKQ 831 MPFGLTNAPA FMDLMNR+ PYLDRFV+VFIDDILVYSR EHAEHL+I+LQ LR+K+ Sbjct: 498 MPFGLTNAPAVFMDLMNRIFRPYLDRFVVVFIDDILVYSRDENEHAEHLRIVLQILREKK 557 Query: 830 LYAKFSKCEFWLDKVVFLGHVISAEGIYVDPIKTEAVMKWERPTNVTEVRSFLGLAGYYR 651 LYAKFSKCEFWL +V FLGH++SAEGI VDP K A++ W P NV+EVRSFLGLAGYYR Sbjct: 558 LYAKFSKCEFWLREVGFLGHIVSAEGIRVDPSKISAIVNWSPPKNVSEVRSFLGLAGYYR 617 Query: 650 RFVEGFSRIALPLTSLTRKNSKFVWSEECDKSFRELKNRLVSAPILTLPSPGEEFVIYSD 471 RFV+GFS IA P+T L +K+ KF W+E+C +SF LK L AP+L P G+EF+IYSD Sbjct: 618 RFVQGFSMIASPMTRLLQKDVKFEWTEKCQQSFDRLKELLTKAPVLVQPESGKEFIIYSD 677 Query: 470 ASRQGLGCVLMQNGKVIAYASRQ 402 AS GLGCVLMQ KVIAYASRQ Sbjct: 678 ASLNGLGCVLMQEDKVIAYASRQ 700 Score = 150 bits (378), Expect(2) = e-149 Identities = 76/124 (61%), Positives = 88/124 (70%) Frame = -2 Query: 400 HELNYPTHDLELAAVVLALKIWRHYLYGEKCQIFTDHKSLKYIFDQKDLNLRQRRWLELI 221 HE NYP HDLELA +V ALKIWRHYLYGEKC I+TDHKSLKY+ QKDLNLRQRRWLELI Sbjct: 704 HERNYPVHDLELATIVFALKIWRHYLYGEKCHIYTDHKSLKYLMTQKDLNLRQRRWLELI 763 Query: 220 KDYDCRIDYHPGKANVVADXXXXXXXXXXXXXXXXXSLFCEFRKLHAQLSATTSGAVLAH 41 K+YD IDYHPGKANVVAD SLF R ++A+LS T G+++A Sbjct: 764 KNYDLVIDYHPGKANVVAD------------ALSRKSLFA-LRAMNARLSLTDDGSIIAE 810 Query: 40 FQVR 29 + + Sbjct: 811 TKAK 814 >ref|XP_008798775.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103713573 [Phoenix dactylifera] Length = 1757 Score = 406 bits (1043), Expect(2) = e-148 Identities = 195/263 (74%), Positives = 225/263 (85%) Frame = -1 Query: 1190 VTIRNKYPLPRIDDLFDQLRGATVFSKIDLRSGYHQLKIRDVDVPKTAFRSRYGHYEFLV 1011 VT+ NKYPLPRIDDLFDQL+GA +FSK+DLRSGYHQL+IR D+PKTAFR+RYGHYEFLV Sbjct: 543 VTVWNKYPLPRIDDLFDQLQGAQIFSKLDLRSGYHQLRIRAEDIPKTAFRTRYGHYEFLV 602 Query: 1010 MPFGLTNAPAAFMDLMNRVLHPYLDRFVIVFIDDILVYSRSREEHAEHLKIILQTLRDKQ 831 MPFGLTNAPAAFMDLMNRV YLD+FV+VFIDDILVYS+S +EH EHL+I+LQTLR+ + Sbjct: 603 MPFGLTNAPAAFMDLMNRVFKSYLDQFVVVFIDDILVYSKSPQEHEEHLRIVLQTLRENK 662 Query: 830 LYAKFSKCEFWLDKVVFLGHVISAEGIYVDPIKTEAVMKWERPTNVTEVRSFLGLAGYYR 651 LY K KCEFWL+ + FLGHVIS +GI VDP K EAV+ W RPTNV+EVRSFLG+AGYYR Sbjct: 663 LYGKLQKCEFWLNSITFLGHVISKDGISVDPKKVEAVVDWSRPTNVSEVRSFLGMAGYYR 722 Query: 650 RFVEGFSRIALPLTSLTRKNSKFVWSEECDKSFRELKNRLVSAPILTLPSPGEEFVIYSD 471 RFVEGFS IA+PL+ LT+K KF W ++C++SF+ELK RLV APIL LPS F IYS Sbjct: 723 RFVEGFSHIAMPLSRLTQKQVKFEWKKDCEQSFQELKRRLVIAPILALPSETGGFSIYSX 782 Query: 470 ASRQGLGCVLMQNGKVIAYASRQ 402 AS +GLGCVLMQN KVIAYASRQ Sbjct: 783 ASHKGLGCVLMQNEKVIAYASRQ 805 Score = 149 bits (377), Expect(2) = e-148 Identities = 73/134 (54%), Positives = 92/134 (68%), Gaps = 1/134 (0%) Frame = -2 Query: 406 DKHELNYPTHDLELAAVVLALKIWRHYLYGEKCQIFTDHKSLKYIFDQKDLNLRQRRWLE 227 + +ELNYPTHDLELAAV+ ALKIWRH+LYGE C+IFTDHKSLKYI+ QK+LNLRQRRWLE Sbjct: 807 EPYELNYPTHDLELAAVIFALKIWRHFLYGEHCEIFTDHKSLKYIYTQKELNLRQRRWLE 866 Query: 226 LIKDYDCRIDYHPGKANVVAD-XXXXXXXXXXXXXXXXXSLFCEFRKLHAQLSATTSGAV 50 L+KDYD I+YHP KANVVAD ++ + + ++ Sbjct: 867 LLKDYDLTINYHPEKANVVADVLSRKSSNKLAALITIQRNILFDLERYEIEVRLHDPQVX 926 Query: 49 LAHFQVRPTLIDRV 8 LA+ V+PTLI+R+ Sbjct: 927 LANLMVQPTLIERI 940 Score = 255 bits (652), Expect = 4e-65 Identities = 123/195 (63%), Positives = 152/195 (77%), Gaps = 10/195 (5%) Frame = -1 Query: 1190 VTIRNKYPLPRIDDLFDQLRGATVFSKIDLRSGYHQLKIRDVDVPKTAFRSRYGHYEFLV 1011 VT+RNKYPLPRIDDLFDQL+GA +FSK+DL SGYHQL+I+ D+PKTAFR+RYGHYEFLV Sbjct: 1389 VTVRNKYPLPRIDDLFDQLQGAQIFSKLDLCSGYHQLRIKAEDIPKTAFRTRYGHYEFLV 1448 Query: 1010 MPFGLTNAPAAFMDLMNRVLHPYLDRFVIVFIDDILVYSRSREEHAEHLKIILQTLRDKQ 831 MPFGLTNAPAAFMDLMNRV PYLD+ V+VFIDDILVYS+S +EH EHL+I+LQTLR+ + Sbjct: 1449 MPFGLTNAPAAFMDLMNRVFKPYLDQVVVVFIDDILVYSKSPQEHEEHLRIVLQTLRENK 1508 Query: 830 LYAKFSKCEFWLDKVVFLGHVISAEGIYVDPIKTEAVMKW----------ERPTNVTEVR 681 LY K KCEFWL+ + FLGHVIS +GI VDP K E V+ + PT+ E+ Sbjct: 1509 LYGKLQKCEFWLNSITFLGHVISKDGISVDPKKNEKVIAYASRQLKSYELNYPTHDLELA 1568 Query: 680 SFLGLAGYYRRFVEG 636 + + + +R ++ G Sbjct: 1569 AVIFASKIWRHYLYG 1583 Score = 147 bits (372), Expect(2) = 8e-34 Identities = 73/133 (54%), Positives = 91/133 (68%), Gaps = 1/133 (0%) Frame = -2 Query: 400 HELNYPTHDLELAAVVLALKIWRHYLYGEKCQIFTDHKSLKYIFDQKDLNLRQRRWLELI 221 +ELNYPTHDLELAAV+ A KIWRHYLYGE C+IFTDHKSLKYI+ QK+LNLRQRRWLEL+ Sbjct: 1556 YELNYPTHDLELAAVIFASKIWRHYLYGEHCEIFTDHKSLKYIYTQKELNLRQRRWLELL 1615 Query: 220 KDYDCRIDYHPGKANVVAD-XXXXXXXXXXXXXXXXXSLFCEFRKLHAQLSATTSGAVLA 44 KDYD I+YHP KANVVAD ++ + + ++ LA Sbjct: 1616 KDYDFTINYHPEKANVVADALSRKSSNKLAALITTQRNILFDLERYEIEVRLHDPQLRLA 1675 Query: 43 HFQVRPTLIDRVR 5 + V+PTLI+R++ Sbjct: 1676 NLMVQPTLIERIK 1688 Score = 25.0 bits (53), Expect(2) = 8e-34 Identities = 12/22 (54%), Positives = 15/22 (68%) Frame = -1 Query: 467 SRQGLGCVLMQNGKVIAYASRQ 402 S+ G+ +N KVIAYASRQ Sbjct: 1531 SKDGISVDPKKNEKVIAYASRQ 1552 >gb|AAT38744.1| Putative gag-pol polyprotein, identical [Solanum demissum] Length = 1515 Score = 420 bits (1079), Expect(2) = e-148 Identities = 199/263 (75%), Positives = 227/263 (86%) Frame = -1 Query: 1190 VTIRNKYPLPRIDDLFDQLRGATVFSKIDLRSGYHQLKIRDVDVPKTAFRSRYGHYEFLV 1011 VTI+NKYPLPRIDDLFDQL+GAT FSKIDLRSGYHQL++R+ D+PKTAFR+RYGHYEFLV Sbjct: 720 VTIKNKYPLPRIDDLFDQLQGATCFSKIDLRSGYHQLRVRERDIPKTAFRTRYGHYEFLV 779 Query: 1010 MPFGLTNAPAAFMDLMNRVLHPYLDRFVIVFIDDILVYSRSREEHAEHLKIILQTLRDKQ 831 M FGLTNAPAAFMDLMNRV PYLD FVI+FIDDIL+YSR+ E+HA HL+ +LQTL+DK+ Sbjct: 780 MSFGLTNAPAAFMDLMNRVFRPYLDMFVIIFIDDILIYSRNEEDHASHLRTVLQTLKDKE 839 Query: 830 LYAKFSKCEFWLDKVVFLGHVISAEGIYVDPIKTEAVMKWERPTNVTEVRSFLGLAGYYR 651 LYAKFSKCEFWL V FLGH++S +GI VD K EAV W RPT+ TE+RSFLGLAGYYR Sbjct: 840 LYAKFSKCEFWLKSVAFLGHIVSGDGIKVDTRKIEAVQNWPRPTSPTEIRSFLGLAGYYR 899 Query: 650 RFVEGFSRIALPLTSLTRKNSKFVWSEECDKSFRELKNRLVSAPILTLPSPGEEFVIYSD 471 RFVEGFS IA PLT LT+K KF WSE C+KSF+ELK RL++AP+LTLP + V+Y D Sbjct: 900 RFVEGFSSIASPLTKLTQKTGKFQWSEACEKSFQELKKRLITAPVLTLPEGTQGLVVYCD 959 Query: 470 ASRQGLGCVLMQNGKVIAYASRQ 402 ASR GLGCVLMQNGKVIAYASRQ Sbjct: 960 ASRIGLGCVLMQNGKVIAYASRQ 982 Score = 135 bits (341), Expect(2) = e-148 Identities = 62/79 (78%), Positives = 67/79 (84%) Frame = -2 Query: 400 HELNYPTHDLELAAVVLALKIWRHYLYGEKCQIFTDHKSLKYIFDQKDLNLRQRRWLELI 221 HE NYPTHDLELA VV ALK+WRHYLYG IFTDHKSL+Y+ QK+LNLRQRRWLEL+ Sbjct: 986 HEKNYPTHDLELAVVVFALKLWRHYLYGVHVDIFTDHKSLQYVLTQKELNLRQRRWLELL 1045 Query: 220 KDYDCRIDYHPGKANVVAD 164 KDYD I YHPGKANVVAD Sbjct: 1046 KDYDLSILYHPGKANVVAD 1064 >ref|XP_007032220.1| Retrotransposon protein, putative [Theobroma cacao] gi|508711249|gb|EOY03146.1| Retrotransposon protein, putative [Theobroma cacao] Length = 1480 Score = 392 bits (1006), Expect(2) = e-148 Identities = 188/263 (71%), Positives = 223/263 (84%) Frame = -1 Query: 1190 VTIRNKYPLPRIDDLFDQLRGATVFSKIDLRSGYHQLKIRDVDVPKTAFRSRYGHYEFLV 1011 VT++NKYPLPRIDDLFDQL+GA FSKIDLRSGYHQL+IR+ D+PKTAFR+RYGHYEFLV Sbjct: 705 VTVKNKYPLPRIDDLFDQLQGAQCFSKIDLRSGYHQLRIRNEDIPKTAFRTRYGHYEFLV 764 Query: 1010 MPFGLTNAPAAFMDLMNRVLHPYLDRFVIVFIDDILVYSRSREEHAEHLKIILQTLRDKQ 831 M FGLTNAPAAFMDLMNRV PYLD+FV+VFIDDIL+YS+SREEH +HLKI+LQ LR+ + Sbjct: 765 MSFGLTNAPAAFMDLMNRVFKPYLDKFVVVFIDDILIYSKSREEHEQHLKIVLQILREHR 824 Query: 830 LYAKFSKCEFWLDKVVFLGHVISAEGIYVDPIKTEAVMKWERPTNVTEVRSFLGLAGYYR 651 LYAKFSKCEFWL++V FLGHV+S EGI VD K EAV KW R T+VTE+RSF+GLAGYYR Sbjct: 825 LYAKFSKCEFWLERVAFLGHVVSREGIQVDTKKIEAVEKWPRSTSVTEIRSFVGLAGYYR 884 Query: 650 RFVEGFSRIALPLTSLTRKNSKFVWSEECDKSFRELKNRLVSAPILTLPSPGEEFVIYSD 471 RFV+ FS+I LT LTRK++KF WS+ C+ SF +LK L +AP+L+L + ++ D Sbjct: 885 RFVKDFSKIVALLTKLTRKDTKFEWSDACENSFEKLKACLTTAPVLSLLQGTGGYTVFCD 944 Query: 470 ASRQGLGCVLMQNGKVIAYASRQ 402 AS GLGCVLMQ+GKVIAYASRQ Sbjct: 945 ASGVGLGCVLMQHGKVIAYASRQ 967 Score = 163 bits (412), Expect(2) = e-148 Identities = 79/140 (56%), Positives = 97/140 (69%), Gaps = 6/140 (4%) Frame = -2 Query: 403 KHELNYPTHDLELAAVVLALKIWRHYLYGEKCQIFTDHKSLKYIFDQKDLNLRQRRWLEL 224 +HE NYP HDLE+AA+V ALKIWRHYLYGE C+I+TDHKSLKYIF Q+DLNLRQ RW+EL Sbjct: 970 RHEQNYPIHDLEMAAIVFALKIWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQHRWMEL 1029 Query: 223 IKDYDCRIDYHPGKANVVADXXXXXXXXXXXXXXXXXSLFCEFRKLHA------QLSATT 62 +KDYDC I YHPGKANVVAD R++H+ +L Sbjct: 1030 LKDYDCTILYHPGKANVVADALSRKSMGSLAHISIGRRSL--VREIHSLGDIGVRLEVAE 1087 Query: 61 SGAVLAHFQVRPTLIDRVRE 2 + A+LAHF+VRP L+DR++E Sbjct: 1088 TNALLAHFRVRPILMDRIKE 1107 >ref|XP_006366848.1| PREDICTED: uncharacterized protein LOC102605741 [Solanum tuberosum] Length = 823 Score = 410 bits (1053), Expect(2) = e-148 Identities = 199/263 (75%), Positives = 224/263 (85%) Frame = -1 Query: 1190 VTIRNKYPLPRIDDLFDQLRGATVFSKIDLRSGYHQLKIRDVDVPKTAFRSRYGHYEFLV 1011 VTIRNKYP+PRIDDLFDQL+GA++FSKIDLRSGYHQLK+R D+PKTAFR+RYGHYEFLV Sbjct: 181 VTIRNKYPIPRIDDLFDQLQGASIFSKIDLRSGYHQLKVRVEDIPKTAFRTRYGHYEFLV 240 Query: 1010 MPFGLTNAPAAFMDLMNRVLHPYLDRFVIVFIDDILVYSRSREEHAEHLKIILQTLRDKQ 831 M FGLTNAPAAFMDLMN V PYLD FVIVFIDDIL+YSRS+E+H HL+I+L L++K+ Sbjct: 241 MSFGLTNAPAAFMDLMNGVFRPYLDSFVIVFIDDILIYSRSKEKHEHHLRIVLGILKEKK 300 Query: 830 LYAKFSKCEFWLDKVVFLGHVISAEGIYVDPIKTEAVMKWERPTNVTEVRSFLGLAGYYR 651 LYAKFSKCEFWL V FLGHV+S EGI VDP K EAV W RP +VTE+RSFLGLAGYYR Sbjct: 301 LYAKFSKCEFWLSSVAFLGHVVSKEGIMVDPKKIEAVRDWVRPASVTEIRSFLGLAGYYR 360 Query: 650 RFVEGFSRIALPLTSLTRKNSKFVWSEECDKSFRELKNRLVSAPILTLPSPGEEFVIYSD 471 RFVEGFS IA PLT LT+K F WS+EC+ SF++LK L +APILTLP GE FV+Y D Sbjct: 361 RFVEGFSSIASPLTRLTQKEVVFQWSDECEVSFQKLKTLLTTAPILTLPVEGEGFVVYCD 420 Query: 470 ASRQGLGCVLMQNGKVIAYASRQ 402 ASR GLGCVLMQ GKVIAYASRQ Sbjct: 421 ASRIGLGCVLMQKGKVIAYASRQ 443 Score = 144 bits (364), Expect(2) = e-148 Identities = 65/79 (82%), Positives = 71/79 (89%) Frame = -2 Query: 400 HELNYPTHDLELAAVVLALKIWRHYLYGEKCQIFTDHKSLKYIFDQKDLNLRQRRWLELI 221 HE NYP HDLELAAVV ALKIWRHYLYG C++FTDH+SL+YIFDQ+DLNLRQRRWLEL+ Sbjct: 447 HEKNYPIHDLELAAVVFALKIWRHYLYGVHCEVFTDHRSLQYIFDQRDLNLRQRRWLELL 506 Query: 220 KDYDCRIDYHPGKANVVAD 164 KDYD I YHPGKANVVAD Sbjct: 507 KDYDMTILYHPGKANVVAD 525 >gb|AAT38724.1| Putative retrotransposon protein, identical [Solanum demissum] Length = 1602 Score = 420 bits (1079), Expect(2) = e-148 Identities = 199/263 (75%), Positives = 227/263 (86%) Frame = -1 Query: 1190 VTIRNKYPLPRIDDLFDQLRGATVFSKIDLRSGYHQLKIRDVDVPKTAFRSRYGHYEFLV 1011 VTI+NKYPLPRIDDLFDQL+GAT FSKIDLRSGYHQL++R+ D+PKTAFR+RYGHYEFLV Sbjct: 726 VTIKNKYPLPRIDDLFDQLQGATCFSKIDLRSGYHQLRVRERDIPKTAFRTRYGHYEFLV 785 Query: 1010 MPFGLTNAPAAFMDLMNRVLHPYLDRFVIVFIDDILVYSRSREEHAEHLKIILQTLRDKQ 831 M FGLTNAPAAFMDLMNRV PYLD FVI+FIDDIL+YSR+ E+HA HL+ +LQTL+DK+ Sbjct: 786 MSFGLTNAPAAFMDLMNRVFRPYLDMFVIIFIDDILIYSRNEEDHASHLRTVLQTLKDKE 845 Query: 830 LYAKFSKCEFWLDKVVFLGHVISAEGIYVDPIKTEAVMKWERPTNVTEVRSFLGLAGYYR 651 LYAKFSKCEFWL V FLGH++S +GI VD K EAV W RPT+ TE+RSFLGLAGYYR Sbjct: 846 LYAKFSKCEFWLKSVAFLGHIVSGDGIKVDTRKIEAVQNWPRPTSPTEIRSFLGLAGYYR 905 Query: 650 RFVEGFSRIALPLTSLTRKNSKFVWSEECDKSFRELKNRLVSAPILTLPSPGEEFVIYSD 471 RFVEGFS IA PLT LT+K KF WSE C+KSF+ELK RL++AP+LTLP + V+Y D Sbjct: 906 RFVEGFSSIASPLTKLTQKTGKFQWSEACEKSFQELKKRLITAPVLTLPEGTQGLVVYCD 965 Query: 470 ASRQGLGCVLMQNGKVIAYASRQ 402 ASR GLGCVLMQNGKVIAYASRQ Sbjct: 966 ASRIGLGCVLMQNGKVIAYASRQ 988 Score = 134 bits (337), Expect(2) = e-148 Identities = 62/79 (78%), Positives = 66/79 (83%) Frame = -2 Query: 400 HELNYPTHDLELAAVVLALKIWRHYLYGEKCQIFTDHKSLKYIFDQKDLNLRQRRWLELI 221 HE NYPTHDLELA VV ALK+WRHYLYG IFTDHKSL+Y+ QK LNLRQRRWLEL+ Sbjct: 992 HEKNYPTHDLELAVVVFALKLWRHYLYGVHVDIFTDHKSLQYVLTQKALNLRQRRWLELL 1051 Query: 220 KDYDCRIDYHPGKANVVAD 164 KDYD I YHPGKANVVAD Sbjct: 1052 KDYDLSILYHPGKANVVAD 1070 >emb|CAN77801.1| hypothetical protein VITISV_031477 [Vitis vinifera] Length = 855 Score = 397 bits (1021), Expect(2) = e-146 Identities = 192/265 (72%), Positives = 222/265 (83%), Gaps = 2/265 (0%) Frame = -1 Query: 1190 VTIRNKYPLPRIDDLFDQLRGATVFSKIDLRSGYHQLK--IRDVDVPKTAFRSRYGHYEF 1017 VT+RNKYPLP+IDDLFDQL+G VFSKIDLRSGYHQL+ +R DVPKTAFR+RYGHYEF Sbjct: 55 VTVRNKYPLPQIDDLFDQLQGTCVFSKIDLRSGYHQLRLRVRSEDVPKTAFRTRYGHYEF 114 Query: 1016 LVMPFGLTNAPAAFMDLMNRVLHPYLDRFVIVFIDDILVYSRSREEHAEHLKIILQTLRD 837 LVMPFGLTNAPAAF+DLMNRV PYLD V+VFIDDILVYS+SREEH +L I+LQTLRD Sbjct: 115 LVMPFGLTNAPAAFIDLMNRVFKPYLDPSVVVFIDDILVYSKSREEHERYLTIVLQTLRD 174 Query: 836 KQLYAKFSKCEFWLDKVVFLGHVISAEGIYVDPIKTEAVMKWERPTNVTEVRSFLGLAGY 657 KQLYAK KCEFWLDKV FL H+++ +GI VDP K + V W RP VTE+RSFLGL Y Sbjct: 175 KQLYAKLKKCEFWLDKVSFLRHMVTKDGIXVDPGKVDVVSNWRRPNTVTEIRSFLGLXXY 234 Query: 656 YRRFVEGFSRIALPLTSLTRKNSKFVWSEECDKSFRELKNRLVSAPILTLPSPGEEFVIY 477 YRRF+EGFS+IALPLT LT+K KF WS +C++SF+ELK RLV+APILT+PS FV+Y Sbjct: 235 YRRFIEGFSKIALPLTRLTQKGVKFEWSNDCERSFQELKKRLVTAPILTIPSXSGGFVVY 294 Query: 476 SDASRQGLGCVLMQNGKVIAYASRQ 402 SDAS QG GCVLMQ+ KV+A AS+Q Sbjct: 295 SDASHQGWGCVLMQHXKVVACASKQ 319 Score = 151 bits (382), Expect(2) = e-146 Identities = 73/132 (55%), Positives = 92/132 (69%) Frame = -2 Query: 400 HELNYPTHDLELAAVVLALKIWRHYLYGEKCQIFTDHKSLKYIFDQKDLNLRQRRWLELI 221 +E NYPTHDLELAAVV ALKIWRH+L+GE +IFTDHKSLKY+F QK+LN+R RW+EL+ Sbjct: 323 YERNYPTHDLELAAVVFALKIWRHFLFGETYEIFTDHKSLKYLFSQKELNMRXGRWIELL 382 Query: 220 KDYDCRIDYHPGKANVVADXXXXXXXXXXXXXXXXXSLFCEFRKLHAQLSATTSGAVLAH 41 KDYDC I YHPGKANVVAD L + R L + SGA++A+ Sbjct: 383 KDYDCIIQYHPGKANVVAD----------ALSRKSRQLLEDLRSLQVHMRVLDSGALVAN 432 Query: 40 FQVRPTLIDRVR 5 F+V+P L+ R++ Sbjct: 433 FKVQPDLVGRIK 444 >ref|XP_007032152.1| Retrotransposon protein, putative [Theobroma cacao] gi|508711181|gb|EOY03078.1| Retrotransposon protein, putative [Theobroma cacao] Length = 1263 Score = 388 bits (997), Expect(2) = e-146 Identities = 185/263 (70%), Positives = 222/263 (84%) Frame = -1 Query: 1190 VTIRNKYPLPRIDDLFDQLRGATVFSKIDLRSGYHQLKIRDVDVPKTAFRSRYGHYEFLV 1011 VT++NKYPLPRIDDLFDQL+ A FSKIDLRSGYHQL+IR+ D+PKTAFR+RYGHYEFLV Sbjct: 417 VTVKNKYPLPRIDDLFDQLQRAQCFSKIDLRSGYHQLRIRNEDIPKTAFRTRYGHYEFLV 476 Query: 1010 MPFGLTNAPAAFMDLMNRVLHPYLDRFVIVFIDDILVYSRSREEHAEHLKIILQTLRDKQ 831 M FGLTNAPAAFMDLMNRV PYLD+F++VFIDDIL+YS+SR+EH +HLKI+LQ L++ Q Sbjct: 477 MSFGLTNAPAAFMDLMNRVFKPYLDKFMVVFIDDILIYSKSRKEHEQHLKIVLQILKEHQ 536 Query: 830 LYAKFSKCEFWLDKVVFLGHVISAEGIYVDPIKTEAVMKWERPTNVTEVRSFLGLAGYYR 651 LYAKFSKCEFWL+ V FLGHV+S +GI VD K EAV KW RPT+VTE+RSF+GLAGYYR Sbjct: 537 LYAKFSKCEFWLESVAFLGHVVSKDGIQVDSKKIEAVEKWPRPTSVTEIRSFVGLAGYYR 596 Query: 650 RFVEGFSRIALPLTSLTRKNSKFVWSEECDKSFRELKNRLVSAPILTLPSPGEEFVIYSD 471 RFV+ FS+I PLT LT K++KF WS+ + SF +LK L AP+L+LP ++++ D Sbjct: 597 RFVKDFSKIVAPLTKLTCKDAKFEWSDAYENSFEKLKACLTIAPVLSLPQGTRGYMVFCD 656 Query: 470 ASRQGLGCVLMQNGKVIAYASRQ 402 ASR GLGCVLMQ+GKVIAYAS Q Sbjct: 657 ASRVGLGCVLMQHGKVIAYASSQ 679 Score = 160 bits (405), Expect(2) = e-146 Identities = 80/138 (57%), Positives = 96/138 (69%), Gaps = 4/138 (2%) Frame = -2 Query: 403 KHELNYPTHDLELAAVVLALKIWRHYLYGEKCQIFTDHKSLKYIFDQKDLNLRQRRWLEL 224 +HE NYP H+LE+AA+V ALKIWRHYLYGE C+I+TDHKSLKYIF Q+DLNLRQRRW+EL Sbjct: 682 RHEQNYPIHNLEIAAIVFALKIWRHYLYGETCEIYTDHKSLKYIFQQRDLNLRQRRWMEL 741 Query: 223 IKDYDCRIDYHPGKANVVAD-XXXXXXXXXXXXXXXXXSLFCEFRKL---HAQLSATTSG 56 +KDYDC I YHPGKANVVAD SL E L L + Sbjct: 742 LKDYDCTILYHPGKANVVADAFSRKSMGSLAHISTGRRSLVKEIHSLGDIGVHLEVAETN 801 Query: 55 AVLAHFQVRPTLIDRVRE 2 A+LAHF+VRP L+D+++E Sbjct: 802 ALLAHFRVRPILMDKIKE 819 >emb|CAA73042.1| polyprotein [Ananas comosus] Length = 871 Score = 396 bits (1017), Expect(2) = e-146 Identities = 189/263 (71%), Positives = 221/263 (84%) Frame = -1 Query: 1190 VTIRNKYPLPRIDDLFDQLRGATVFSKIDLRSGYHQLKIRDVDVPKTAFRSRYGHYEFLV 1011 VTI+NKYPLPRIDDLFDQL+G+ V+SKIDL+SGYHQLKI+ DV KTAFR+RYGHYEF V Sbjct: 79 VTIKNKYPLPRIDDLFDQLQGSCVYSKIDLQSGYHQLKIKPEDVSKTAFRTRYGHYEFAV 138 Query: 1010 MPFGLTNAPAAFMDLMNRVLHPYLDRFVIVFIDDILVYSRSREEHAEHLKIILQTLRDKQ 831 MPFGLTNAP AFMDLMNRV PYLDRFV+VFIDDILVYSRS +H EHL+I+LQ LR+K+ Sbjct: 139 MPFGLTNAPTAFMDLMNRVFKPYLDRFVVVFIDDILVYSRSDADHEEHLRIVLQVLREKE 198 Query: 830 LYAKFSKCEFWLDKVVFLGHVISAEGIYVDPIKTEAVMKWERPTNVTEVRSFLGLAGYYR 651 LY K KCEFWL +V FLGH+IS GI VDP K EA+ W R T+VTE+RSFLGLAGYYR Sbjct: 199 LYVKLKKCEFWLREVAFLGHLISGSGIAVDPKKIEAIKDWPRLTSVTEIRSFLGLAGYYR 258 Query: 650 RFVEGFSRIALPLTSLTRKNSKFVWSEECDKSFRELKNRLVSAPILTLPSPGEEFVIYSD 471 RFVE F++++ PLT LT K KF+W++ C++SF+ELK RL +APILTLP G +V+YSD Sbjct: 259 RFVERFAKLSTPLTRLTHKGVKFIWNDACERSFQELKQRLTTAPILTLPVAGAGYVVYSD 318 Query: 470 ASRQGLGCVLMQNGKVIAYASRQ 402 AS GLGCVLMQ+ KVIAYASRQ Sbjct: 319 ASLNGLGCVLMQDDKVIAYASRQ 341 Score = 152 bits (384), Expect(2) = e-146 Identities = 74/135 (54%), Positives = 94/135 (69%), Gaps = 1/135 (0%) Frame = -2 Query: 403 KHELNYPTHDLELAAVVLALKIWRHYLYGEKCQIFTDHKSLKYIFDQKDLNLRQRRWLEL 224 ++E NYPTHDLELAAVV ALK+WRHYLYGE+C+++TDHKSLKY+F QK+LNLRQRRWLEL Sbjct: 344 EYEKNYPTHDLELAAVVFALKLWRHYLYGERCEVYTDHKSLKYLFTQKELNLRQRRWLEL 403 Query: 223 IKDYDCRIDYHPGKANVVAD-XXXXXXXXXXXXXXXXXSLFCEFRKLHAQLSATTSGAVL 47 +KDYD I YHPGKANVVAD L + ++L ++ + L Sbjct: 404 LKDYDLTILYHPGKANVVADALSRKSMENLAMHVVTQPRLIEQMKRLELEIVTPDTPMRL 463 Query: 46 AHFQVRPTLIDRVRE 2 V+PTL+DR++E Sbjct: 464 MTLVVQPTLLDRIKE 478