BLASTX nr result
ID: Akebia24_contig00039259
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00039259 (859 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006586476.1| PREDICTED: uncharacterized protein LOC102659... 75 1e-26 ref|XP_007018745.1| RNA-directed DNA polymerase, putative [Theob... 79 1e-26 ref|XP_006595400.1| PREDICTED: uncharacterized protein LOC100801... 77 5e-26 ref|XP_006597965.1| PREDICTED: protein NYNRIN-like, partial [Gly... 76 7e-26 ref|XP_007028775.1| RNA-directed DNA polymerase (Reverse transcr... 77 7e-26 ref|XP_003528166.1| PREDICTED: uncharacterized protein LOC100792... 76 9e-26 ref|XP_006605017.1| PREDICTED: uncharacterized protein LOC102660... 75 9e-26 ref|XP_006582089.1| PREDICTED: uncharacterized protein LOC102667... 75 9e-26 ref|XP_006584253.1| PREDICTED: uncharacterized protein LOC100812... 76 1e-25 ref|XP_006591199.1| PREDICTED: uncharacterized protein LOC102663... 75 1e-25 ref|XP_003544290.1| PREDICTED: uncharacterized protein LOC100815... 76 1e-25 ref|XP_006584201.1| PREDICTED: uncharacterized protein LOC100789... 75 2e-25 ref|XP_007038597.1| Retrotransposon, unclassified-like protein [... 78 2e-25 ref|XP_006604068.1| PREDICTED: uncharacterized protein LOC102660... 74 2e-25 ref|XP_007025429.1| RNA-directed DNA polymerase (Reverse transcr... 76 2e-25 gb|AAQ82037.1| gag/pol polyprotein [Pisum sativum] 72 3e-25 ref|XP_007050215.1| Uncharacterized protein TCM_003960 [Theobrom... 75 7e-25 ref|XP_007036486.1| RNA-directed DNA polymerase (Reverse transcr... 73 7e-25 emb|CAN75930.1| hypothetical protein VITISV_038505 [Vitis vinifera] 75 9e-25 emb|CAN76756.1| hypothetical protein VITISV_012606 [Vitis vinifera] 77 2e-24 >ref|XP_006586476.1| PREDICTED: uncharacterized protein LOC102659780, partial [Glycine max] Length = 1680 Score = 75.1 bits (183), Expect(2) = 1e-26 Identities = 48/160 (30%), Positives = 76/160 (47%), Gaps = 2/160 (1%) Frame = +3 Query: 381 INKDATFIWNDDCQEVFDRIKEELLKPLDFNVSYGRKTFVIISIFH*QRNGCFIGS*KQW 560 + K+ T WN+DCQE F RIK+ L+ P + ++ + GC +G + Sbjct: 914 LRKNQTDRWNEDCQEAFGRIKKCLMNPPVLMPPVPGRPLILYMTILDESMGCMLGQHDES 973 Query: 561 D*KANLLP**GASHN--RK*IFMY*VSMLALVFATQKL*HYVLEQIVWLITKTDLIRYLL 734 K + + + ALV+A+ +L Y+L WLI+K D ++Y+ Sbjct: 974 GKKERAVYYLSKKFTTCEMNYSLLERTCCALVWASHRLRQYMLSHTTWLISKMDPVKYIF 1033 Query: 735 NWPMLTGCLARWSIKLVEFNVQCKN*KAILGQAILDMLVE 854 P LTG +ARW + L EF++ KAI G A+ D L + Sbjct: 1034 EKPALTGRIARWQVLLSEFDIVYVTQKAIKGSALADYLAQ 1073 Score = 72.0 bits (175), Expect(2) = 1e-26 Identities = 40/97 (41%), Positives = 57/97 (58%), Gaps = 1/97 (1%) Frame = +2 Query: 26 IFGDMLHXXXXXXXXXXXXRSEIDPDHITILK*ILE*CREKSY-LKVNLLKCTFGVTVGK 202 +F DM+H +S+ + +H+ L+ + E R K Y L++N KCTFGV GK Sbjct: 796 LFHDMMHQEIEVYVDDIIAKSKSEEEHLVNLRKLFE--RLKKYQLRLNPAKCTFGVKSGK 853 Query: 203 FLGFMVKYKEIEIDPTNLKAIKEMYPPTFVKQVKSFI 313 LGF+V KEIE+DP +KAI EM P +QV+ F+ Sbjct: 854 LLGFVVSQKEIEVDPEKVKAILEMPEPRTERQVRGFL 890 Score = 60.5 bits (145), Expect = 8e-07 Identities = 38/110 (34%), Positives = 59/110 (53%), Gaps = 2/110 (1%) Frame = +1 Query: 316 KILYICRFIPNLSKKALPIMHILIKMQLLYGMMTAKKYLIESRKNC*NLLTLMYPMEGKP 495 ++ YI RFI L+ P+ +L K Q ++ +K N LM P+ G+P Sbjct: 892 RLNYIARFISQLTAICEPLFKLLRKNQTDRWNEDCQEAFGRIKKCLMNPPVLMPPVPGRP 951 Query: 496 LLLYLSSIDNAMGVLLA-HENNG-IEKPIYYLSKVLLTIENRYSCIECLC 639 L+LY++ +D +MG +L H+ +G E+ +YYLSK T E YS +E C Sbjct: 952 LILYMTILDESMGCMLGQHDESGKKERAVYYLSKKFTTCEMNYSLLERTC 1001 >ref|XP_007018745.1| RNA-directed DNA polymerase, putative [Theobroma cacao] gi|508724073|gb|EOY15970.1| RNA-directed DNA polymerase, putative [Theobroma cacao] Length = 1685 Score = 79.0 bits (193), Expect(2) = 1e-26 Identities = 42/98 (42%), Positives = 58/98 (59%) Frame = +2 Query: 26 IFGDMLHXXXXXXXXXXXXRSEIDPDHITILK*ILE*CREKSYLKVNLLKCTFGVTVGKF 205 +F DM+H +S + DH LK + E R K LK+N KCTFGVT GK Sbjct: 767 LFHDMMHKEIEVYVDDMIAKSHTERDHTVNLKKLFERLR-KFQLKLNPAKCTFGVTSGKL 825 Query: 206 LGFMVKYKEIEIDPTNLKAIKEMYPPTFVKQVKSFIEK 319 LGF+V K IE+DP ++AI+E+ PP K+V+ F+E+ Sbjct: 826 LGFIVSEKGIEVDPDKIRAIQELPPPKTQKEVRGFLER 863 Score = 68.2 bits (165), Expect(2) = 1e-26 Identities = 47/158 (29%), Positives = 71/158 (44%), Gaps = 8/158 (5%) Frame = +3 Query: 405 WNDDCQEVFDRIKEELLKPLDFNVSYGRKTFVIISIFH*QRNGCFIGS*KQWD*KANLLP 584 WN++CQ FD+IKE L P K ++ + GC +G + K Sbjct: 893 WNEECQIAFDKIKEYLTNPPVLMPPTVEKPLILYLTVNRNSMGCVLGQHDETGMKER--- 949 Query: 585 **GASHNRK*IFMY*VSML--------ALVFATQKL*HYVLEQIVWLITKTDLIRYLLNW 740 A + FM S AL + Q+L Y+L WL+ K D I+Y+ Sbjct: 950 ---AVYYLSKKFMEYESKYSALEKMCCALAWTAQRLRQYMLYHTTWLVAKLDPIKYIFEK 1006 Query: 741 PMLTGCLARWSIKLVEFNVQCKN*KAILGQAILDMLVE 854 P L+G +ARW + L E+++ + K+I G AI D L + Sbjct: 1007 PCLSGRIARWQVLLSEYDIVYVSQKSIKGSAIADFLAD 1044 Score = 57.4 bits (137), Expect = 7e-06 Identities = 42/119 (35%), Positives = 63/119 (52%), Gaps = 11/119 (9%) Frame = +1 Query: 316 KILYICRFIPNLSKKALPIMHILIKM---------QLLYGMMTAKKYLIESRKNC*NLLT 468 ++ YI RFI L+ K PI +L K Q+ + + K+YL N Sbjct: 863 RLNYIARFISQLTCKCDPIFKLLRKRDPGEWNEECQIAFDKI--KEYLT-------NPPV 913 Query: 469 LMYPMEGKPLLLYLSSIDNAMGVLLA-HENNGI-EKPIYYLSKVLLTIENRYSCIECLC 639 LM P KPL+LYL+ N+MG +L H+ G+ E+ +YYLSK + E++YS +E +C Sbjct: 914 LMPPTVEKPLILYLTVNRNSMGCVLGQHDETGMKERAVYYLSKKFMEYESKYSALEKMC 972 >ref|XP_006595400.1| PREDICTED: uncharacterized protein LOC100801012 [Glycine max] Length = 1799 Score = 77.0 bits (188), Expect(2) = 5e-26 Identities = 47/159 (29%), Positives = 80/159 (50%), Gaps = 3/159 (1%) Frame = +3 Query: 381 INKDATFIWNDDCQEVFDRIKEELLKP-LDFNVSYGRKTFVIISIFH*QRNGCFIGS*KQ 557 + KD +W +DCQ+ FD IK LL+P + GR + +++ GC +G + Sbjct: 1025 LRKDQGVVWTEDCQKAFDSIKNYLLEPPILIPPVEGRPLIMYLTVLE-DSMGCVLGQQDE 1083 Query: 558 WD*KANLLP**GASHN--RK*IFMY*VSMLALVFATQKL*HYVLEQIVWLITKTDLIRYL 731 K +++ + + AL +A ++L HY++ WLI+K D I+Y+ Sbjct: 1084 TGRKEHVIYYLSKKFTDCESRYSLLEKTCCALAWAAKRLRHYMINHTTWLISKMDPIKYI 1143 Query: 732 LNWPMLTGCLARWSIKLVEFNVQCKN*KAILGQAILDML 848 P LTG +ARW + L E++++ + KAI G + D L Sbjct: 1144 FEKPALTGRIARWQMLLSEYDIEYRTQKAIKGSVLADHL 1182 Score = 68.2 bits (165), Expect(2) = 5e-26 Identities = 38/96 (39%), Positives = 54/96 (56%) Frame = +2 Query: 26 IFGDMLHXXXXXXXXXXXXRSEIDPDHITILK*ILE*CREKSYLKVNLLKCTFGVTVGKF 205 +F DM+H +S + +H+ L + + R K L++N KCTFGV GK Sbjct: 907 LFHDMMHKEIEVYVDDMIVKSGTEEEHVEYLLKMFQRLR-KYQLRLNPNKCTFGVRSGKL 965 Query: 206 LGFMVKYKEIEIDPTNLKAIKEMYPPTFVKQVKSFI 313 LGF+V K IE+DP +KAI+EM P KQV+ F+ Sbjct: 966 LGFIVSQKGIEVDPDKVKAIREMPVPQTEKQVRGFL 1001 Score = 57.4 bits (137), Expect = 7e-06 Identities = 36/110 (32%), Positives = 58/110 (52%), Gaps = 2/110 (1%) Frame = +1 Query: 316 KILYICRFIPNLSKKALPIMHILIKMQLLYGMMTAKKYLIESRKNC*NLLTLMYPMEGKP 495 ++ YI RFI +++ PI +L K Q + +K + L+ P+EG+P Sbjct: 1003 RLNYISRFISHMTATCGPIFKLLRKDQGVVWTEDCQKAFDSIKNYLLEPPILIPPVEGRP 1062 Query: 496 LLLYLSSIDNAMGVLLAH--ENNGIEKPIYYLSKVLLTIENRYSCIECLC 639 L++YL+ ++++MG +L E E IYYLSK E+RYS +E C Sbjct: 1063 LIMYLTVLEDSMGCVLGQQDETGRKEHVIYYLSKKFTDCESRYSLLEKTC 1112 >ref|XP_006597965.1| PREDICTED: protein NYNRIN-like, partial [Glycine max] Length = 1084 Score = 75.9 bits (185), Expect(2) = 7e-26 Identities = 48/160 (30%), Positives = 77/160 (48%), Gaps = 2/160 (1%) Frame = +3 Query: 381 INKDATFIWNDDCQEVFDRIKEELLKPLDFNVSYGRKTFVIISIFH*QRNGCFIGS*KQW 560 + K+ T WN+DCQE F RIK+ L+ P + ++ + GC +G + Sbjct: 355 LRKNQTDRWNEDCQEAFGRIKKCLMNPPVLMPPVPGRPLILYMTIIDESMGCMLGQHDES 414 Query: 561 D*KANLLP**GASHN--RK*IFMY*VSMLALVFATQKL*HYVLEQIVWLITKTDLIRYLL 734 K + + + ALV+A+ +L Y+L WLI+K D ++Y+ Sbjct: 415 GKKERAVYYLSKKFTTCEMNYSLLERTCCALVWASHRLRQYMLSHTTWLISKMDPVKYIF 474 Query: 735 NWPMLTGCLARWSIKLVEFNVQCKN*KAILGQAILDMLVE 854 P LTG +ARW + L EF++ KAI G A++D L + Sbjct: 475 EKPALTGRIARWQVLLSEFDIVYVTQKAIKGSALVDYLAQ 514 Score = 68.9 bits (167), Expect(2) = 7e-26 Identities = 39/97 (40%), Positives = 56/97 (57%), Gaps = 1/97 (1%) Frame = +2 Query: 26 IFGDMLHXXXXXXXXXXXXRSEIDPDHITILK*ILE*CREKSY-LKVNLLKCTFGVTVGK 202 +F DM+H +S+ + +H+ L+ + E R K Y L++N KCTFGV GK Sbjct: 237 LFHDMMHQEIEVYVDDIIAKSKSEEEHLVNLRKLFE--RLKKYQLRLNPAKCTFGVKSGK 294 Query: 203 FLGFMVKYKEIEIDPTNLKAIKEMYPPTFVKQVKSFI 313 LGF+V K IE+DP +KAI EM P +QV+ F+ Sbjct: 295 LLGFIVSQKGIEVDPEKVKAILEMPEPGTERQVRGFL 331 Score = 60.5 bits (145), Expect = 8e-07 Identities = 38/110 (34%), Positives = 59/110 (53%), Gaps = 2/110 (1%) Frame = +1 Query: 316 KILYICRFIPNLSKKALPIMHILIKMQLLYGMMTAKKYLIESRKNC*NLLTLMYPMEGKP 495 ++ YI RF+ L+ P+ +L K Q ++ +K N LM P+ G+P Sbjct: 333 RLNYIARFLSQLTAICEPLFKLLRKNQTDRWNEDCQEAFGRIKKCLMNPPVLMPPVPGRP 392 Query: 496 LLLYLSSIDNAMGVLLA-HENNG-IEKPIYYLSKVLLTIENRYSCIECLC 639 L+LY++ ID +MG +L H+ +G E+ +YYLSK T E YS +E C Sbjct: 393 LILYMTIIDESMGCMLGQHDESGKKERAVYYLSKKFTTCEMNYSLLERTC 442 >ref|XP_007028775.1| RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H, putative [Theobroma cacao] gi|508717380|gb|EOY09277.1| RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H, putative [Theobroma cacao] Length = 1560 Score = 77.0 bits (188), Expect(2) = 7e-26 Identities = 41/96 (42%), Positives = 57/96 (59%) Frame = +2 Query: 26 IFGDMLHXXXXXXXXXXXXRSEIDPDHITILK*ILE*CREKSYLKVNLLKCTFGVTVGKF 205 +F DM+H +S + DH LK + E R K LK+N +KCTFGVT GK Sbjct: 672 LFHDMMHKEIEVYVDDMIAKSHTERDHTVNLKKLFERLR-KFQLKLNPVKCTFGVTSGKL 730 Query: 206 LGFMVKYKEIEIDPTNLKAIKEMYPPTFVKQVKSFI 313 LGF+V K IE+DP ++AI+E+ PP K+V+ F+ Sbjct: 731 LGFIVSEKGIEVDPDKIRAIQELPPPKTQKEVRGFL 766 Score = 67.8 bits (164), Expect(2) = 7e-26 Identities = 47/158 (29%), Positives = 71/158 (44%), Gaps = 8/158 (5%) Frame = +3 Query: 405 WNDDCQEVFDRIKEELLKPLDFNVSYGRKTFVIISIFH*QRNGCFIGS*KQWD*KANLLP 584 WN++CQ FD+IKE L P K ++ + GC +G + K Sbjct: 798 WNEECQIAFDKIKEYLTNPPVLIPPTVEKPLILYLTVNKNSMGCVLGQHDETGKKER--- 854 Query: 585 **GASHNRK*IFMY*VSML--------ALVFATQKL*HYVLEQIVWLITKTDLIRYLLNW 740 A + FM S AL + Q+L Y+L WL+ K D I+Y+ Sbjct: 855 ---AVYYLSKKFMEYESKYSALEKMCCALAWTAQRLRQYMLYHTTWLVAKLDPIKYIFEK 911 Query: 741 PMLTGCLARWSIKLVEFNVQCKN*KAILGQAILDMLVE 854 P L+G +ARW + L E+++ + K+I G AI D L + Sbjct: 912 PCLSGRIARWQVLLSEYDIVYVSQKSIKGSAIADFLAD 949 >ref|XP_003528166.1| PREDICTED: uncharacterized protein LOC100792217 [Glycine max] Length = 2265 Score = 76.3 bits (186), Expect(2) = 9e-26 Identities = 47/159 (29%), Positives = 79/159 (49%), Gaps = 3/159 (1%) Frame = +3 Query: 381 INKDATFIWNDDCQEVFDRIKEELLKP-LDFNVSYGRKTFVIISIFH*QRNGCFIGS*KQ 557 + KD +W +DCQ+ FD IK LL+P + GR + +++ GC +G + Sbjct: 1491 LRKDQGVVWTEDCQKAFDSIKNYLLEPPILIPPVEGRPLIMYLTVLE-DSMGCVLGQQDE 1549 Query: 558 WD*KANLLP**GASHN--RK*IFMY*VSMLALVFATQKL*HYVLEQIVWLITKTDLIRYL 731 K + + + + AL +A ++L HY++ WLI+K D I+Y+ Sbjct: 1550 TGRKEHAIYYLSKKFTDCESRYSLLEKTCCALAWAAKRLRHYMINHTTWLISKMDPIKYI 1609 Query: 732 LNWPMLTGCLARWSIKLVEFNVQCKN*KAILGQAILDML 848 P LTG +ARW + L E++++ + KAI G + D L Sbjct: 1610 FEKPALTGRIARWQMLLSEYDIEYRTRKAIKGSVLADHL 1648 Score = 68.2 bits (165), Expect(2) = 9e-26 Identities = 38/96 (39%), Positives = 54/96 (56%) Frame = +2 Query: 26 IFGDMLHXXXXXXXXXXXXRSEIDPDHITILK*ILE*CREKSYLKVNLLKCTFGVTVGKF 205 +F DM+H +S + +H+ L + + R K L++N KCTFGV GK Sbjct: 1373 LFHDMMHKEIEVYVDDMIVKSGTEEEHVEYLLKMFQRLR-KYQLRLNPNKCTFGVRSGKL 1431 Query: 206 LGFMVKYKEIEIDPTNLKAIKEMYPPTFVKQVKSFI 313 LGF+V K IE+DP +KAI+EM P KQV+ F+ Sbjct: 1432 LGFIVSQKGIEVDPDKVKAIREMPVPQTEKQVRGFL 1467 Score = 57.8 bits (138), Expect = 5e-06 Identities = 36/110 (32%), Positives = 58/110 (52%), Gaps = 2/110 (1%) Frame = +1 Query: 316 KILYICRFIPNLSKKALPIMHILIKMQLLYGMMTAKKYLIESRKNC*NLLTLMYPMEGKP 495 ++ YI RFI +++ PI +L K Q + +K + L+ P+EG+P Sbjct: 1469 RLNYISRFISHMTATCGPIFKLLRKDQGVVWTEDCQKAFDSIKNYLLEPPILIPPVEGRP 1528 Query: 496 LLLYLSSIDNAMGVLLAH--ENNGIEKPIYYLSKVLLTIENRYSCIECLC 639 L++YL+ ++++MG +L E E IYYLSK E+RYS +E C Sbjct: 1529 LIMYLTVLEDSMGCVLGQQDETGRKEHAIYYLSKKFTDCESRYSLLEKTC 1578 >ref|XP_006605017.1| PREDICTED: uncharacterized protein LOC102660537 [Glycine max] Length = 1533 Score = 75.1 bits (183), Expect(2) = 9e-26 Identities = 48/160 (30%), Positives = 76/160 (47%), Gaps = 2/160 (1%) Frame = +3 Query: 381 INKDATFIWNDDCQEVFDRIKEELLKPLDFNVSYGRKTFVIISIFH*QRNGCFIGS*KQW 560 + K+ T WN+DCQE F RIK+ L+ P + ++ + GC +G + Sbjct: 793 LRKNQTDRWNEDCQEAFGRIKKCLMNPPVLMPPVPGRPLILYMTILDESMGCMLGQHDES 852 Query: 561 D*KANLLP**GASHN--RK*IFMY*VSMLALVFATQKL*HYVLEQIVWLITKTDLIRYLL 734 K + + + ALV+A+ +L Y+L WLI+K D ++Y+ Sbjct: 853 GKKERAVYYLSKKFTTCEMNYSLLERTCCALVWASHRLRQYMLSHTTWLISKMDPVKYIF 912 Query: 735 NWPMLTGCLARWSIKLVEFNVQCKN*KAILGQAILDMLVE 854 P LTG +ARW + L EF++ KAI G A+ D L + Sbjct: 913 EKPALTGRIARWQVLLSEFDIVYVTQKAIKGSALADYLAQ 952 Score = 69.3 bits (168), Expect(2) = 9e-26 Identities = 39/97 (40%), Positives = 56/97 (57%), Gaps = 1/97 (1%) Frame = +2 Query: 26 IFGDMLHXXXXXXXXXXXXRSEIDPDHITILK*ILE*CREKSY-LKVNLLKCTFGVTVGK 202 +F DM+H +S+ + +H+ L+ + E R K Y L++N KCTFGV GK Sbjct: 675 LFHDMMHQEIEVYVDDIIAKSKSEEEHLVNLRKLFE--RLKKYQLRLNPAKCTFGVKSGK 732 Query: 203 FLGFMVKYKEIEIDPTNLKAIKEMYPPTFVKQVKSFI 313 LGF+V K IE+DP +KAI EM P +QV+ F+ Sbjct: 733 LLGFVVSQKGIEVDPEKVKAILEMPEPRTERQVRGFL 769 Score = 60.5 bits (145), Expect = 8e-07 Identities = 38/110 (34%), Positives = 59/110 (53%), Gaps = 2/110 (1%) Frame = +1 Query: 316 KILYICRFIPNLSKKALPIMHILIKMQLLYGMMTAKKYLIESRKNC*NLLTLMYPMEGKP 495 ++ YI RFI L+ P+ +L K Q ++ +K N LM P+ G+P Sbjct: 771 RLNYIARFISQLTAICEPLFKLLRKNQTDRWNEDCQEAFGRIKKCLMNPPVLMPPVPGRP 830 Query: 496 LLLYLSSIDNAMGVLLA-HENNG-IEKPIYYLSKVLLTIENRYSCIECLC 639 L+LY++ +D +MG +L H+ +G E+ +YYLSK T E YS +E C Sbjct: 831 LILYMTILDESMGCMLGQHDESGKKERAVYYLSKKFTTCEMNYSLLERTC 880 >ref|XP_006582089.1| PREDICTED: uncharacterized protein LOC102667778, partial [Glycine max] Length = 1095 Score = 75.1 bits (183), Expect(2) = 9e-26 Identities = 48/160 (30%), Positives = 76/160 (47%), Gaps = 2/160 (1%) Frame = +3 Query: 381 INKDATFIWNDDCQEVFDRIKEELLKPLDFNVSYGRKTFVIISIFH*QRNGCFIGS*KQW 560 + K+ T WN+DCQE F RIK+ L+ P + ++ + GC +G + Sbjct: 355 LRKNQTDRWNEDCQEAFGRIKKCLMNPPVLMPPVPGRPLILYMTILDESMGCMLGQHDES 414 Query: 561 D*KANLLP**GASHN--RK*IFMY*VSMLALVFATQKL*HYVLEQIVWLITKTDLIRYLL 734 K + + + ALV+A+ +L Y+L WLI+K D ++Y+ Sbjct: 415 GKKERAVYYLSKKFTTCEMNYSLLERTCCALVWASHRLRQYMLSHTTWLISKMDPVKYIF 474 Query: 735 NWPMLTGCLARWSIKLVEFNVQCKN*KAILGQAILDMLVE 854 P LTG +ARW + L EF++ KAI G A+ D L + Sbjct: 475 EKPALTGRIARWQVLLSEFDIVYVTQKAIKGSALADYLAQ 514 Score = 69.3 bits (168), Expect(2) = 9e-26 Identities = 39/97 (40%), Positives = 56/97 (57%), Gaps = 1/97 (1%) Frame = +2 Query: 26 IFGDMLHXXXXXXXXXXXXRSEIDPDHITILK*ILE*CREKSY-LKVNLLKCTFGVTVGK 202 +F DM+H +S+ + +H+ L+ + E R K Y L++N KCTFGV GK Sbjct: 237 LFHDMMHQEIEVYVDDIIAKSKSEEEHLVNLRKLFE--RLKKYQLRLNPAKCTFGVKSGK 294 Query: 203 FLGFMVKYKEIEIDPTNLKAIKEMYPPTFVKQVKSFI 313 LGF+V K IE+DP +KAI EM P +QV+ F+ Sbjct: 295 LLGFVVSQKGIEVDPEKVKAILEMPEPRTERQVRGFL 331 Score = 60.5 bits (145), Expect = 8e-07 Identities = 38/110 (34%), Positives = 59/110 (53%), Gaps = 2/110 (1%) Frame = +1 Query: 316 KILYICRFIPNLSKKALPIMHILIKMQLLYGMMTAKKYLIESRKNC*NLLTLMYPMEGKP 495 ++ YI RFI L+ P+ +L K Q ++ +K N LM P+ G+P Sbjct: 333 RLNYIARFISQLTAICEPLFKLLRKNQTDRWNEDCQEAFGRIKKCLMNPPVLMPPVPGRP 392 Query: 496 LLLYLSSIDNAMGVLLA-HENNG-IEKPIYYLSKVLLTIENRYSCIECLC 639 L+LY++ +D +MG +L H+ +G E+ +YYLSK T E YS +E C Sbjct: 393 LILYMTILDESMGCMLGQHDESGKKERAVYYLSKKFTTCEMNYSLLERTC 442 >ref|XP_006584253.1| PREDICTED: uncharacterized protein LOC100812063 [Glycine max] Length = 2036 Score = 76.3 bits (186), Expect(2) = 1e-25 Identities = 47/159 (29%), Positives = 79/159 (49%), Gaps = 3/159 (1%) Frame = +3 Query: 381 INKDATFIWNDDCQEVFDRIKEELLKP-LDFNVSYGRKTFVIISIFH*QRNGCFIGS*KQ 557 + KD +W +DCQ+ FD IK LL+P + GR + +++ GC +G + Sbjct: 1545 LRKDQGVVWTEDCQKAFDSIKNYLLEPPILIPPVEGRPLIMYLTVLE-DSMGCVLGQQDE 1603 Query: 558 WD*KANLLP**GASHN--RK*IFMY*VSMLALVFATQKL*HYVLEQIVWLITKTDLIRYL 731 K + + + + AL +A ++L HY++ WLI+K D I+Y+ Sbjct: 1604 TGRKEHAIYYLSKKFTDCESRYSLLEKTCCALAWAAKRLRHYMINHTTWLISKMDPIKYI 1663 Query: 732 LNWPMLTGCLARWSIKLVEFNVQCKN*KAILGQAILDML 848 P LTG +ARW + L E++++ + KAI G + D L Sbjct: 1664 FEKPALTGRIARWQMLLSEYDIEYRTQKAIKGSVLADHL 1702 Score = 67.8 bits (164), Expect(2) = 1e-25 Identities = 38/96 (39%), Positives = 54/96 (56%) Frame = +2 Query: 26 IFGDMLHXXXXXXXXXXXXRSEIDPDHITILK*ILE*CREKSYLKVNLLKCTFGVTVGKF 205 +F DM+H +S + +H+ L + + R K L++N KCTFGV GK Sbjct: 1427 LFHDMMHKEIEVYVDDMIVKSGTEEEHVEYLLKMFQRLR-KYQLRLNPNKCTFGVRSGKL 1485 Query: 206 LGFMVKYKEIEIDPTNLKAIKEMYPPTFVKQVKSFI 313 LGF+V K IE+DP +KAI+EM P KQV+ F+ Sbjct: 1486 LGFIVSQKGIEVDPDKVKAIREMPIPQTEKQVRGFL 1521 Score = 57.8 bits (138), Expect = 5e-06 Identities = 36/110 (32%), Positives = 58/110 (52%), Gaps = 2/110 (1%) Frame = +1 Query: 316 KILYICRFIPNLSKKALPIMHILIKMQLLYGMMTAKKYLIESRKNC*NLLTLMYPMEGKP 495 ++ YI RFI +++ PI +L K Q + +K + L+ P+EG+P Sbjct: 1523 RLNYISRFISHMTATCGPIFKLLRKDQGVVWTEDCQKAFDSIKNYLLEPPILIPPVEGRP 1582 Query: 496 LLLYLSSIDNAMGVLLAH--ENNGIEKPIYYLSKVLLTIENRYSCIECLC 639 L++YL+ ++++MG +L E E IYYLSK E+RYS +E C Sbjct: 1583 LIMYLTVLEDSMGCVLGQQDETGRKEHAIYYLSKKFTDCESRYSLLEKTC 1632 >ref|XP_006591199.1| PREDICTED: uncharacterized protein LOC102663869, partial [Glycine max] Length = 1095 Score = 74.7 bits (182), Expect(2) = 1e-25 Identities = 48/160 (30%), Positives = 76/160 (47%), Gaps = 2/160 (1%) Frame = +3 Query: 381 INKDATFIWNDDCQEVFDRIKEELLKPLDFNVSYGRKTFVIISIFH*QRNGCFIGS*KQW 560 + K+ T WN+DCQE F RIK+ L+ P + ++ + GC +G + Sbjct: 355 LRKNQTDRWNEDCQEAFGRIKKCLMNPPVLIPPVPGRPLILYMTILDESMGCMLGQHDES 414 Query: 561 D*KANLLP**GASHN--RK*IFMY*VSMLALVFATQKL*HYVLEQIVWLITKTDLIRYLL 734 K + + + ALV+A+ +L Y+L WLI+K D ++Y+ Sbjct: 415 GKKERAVYYLSKKFTTCEMNYSLLERTCCALVWASHRLRQYMLSHTTWLISKMDPVKYIF 474 Query: 735 NWPMLTGCLARWSIKLVEFNVQCKN*KAILGQAILDMLVE 854 P LTG +ARW + L EF++ KAI G A+ D L + Sbjct: 475 EKPALTGRIARWQVLLSEFDIVYVTQKAIKGSALADYLAQ 514 Score = 69.3 bits (168), Expect(2) = 1e-25 Identities = 39/97 (40%), Positives = 56/97 (57%), Gaps = 1/97 (1%) Frame = +2 Query: 26 IFGDMLHXXXXXXXXXXXXRSEIDPDHITILK*ILE*CREKSY-LKVNLLKCTFGVTVGK 202 +F DM+H +S+ + +H+ L+ + E R K Y L++N KCTFGV GK Sbjct: 237 LFHDMMHQEIEVYVDDIIAKSKSEEEHLVNLRKLFE--RLKKYQLRLNPAKCTFGVKSGK 294 Query: 203 FLGFMVKYKEIEIDPTNLKAIKEMYPPTFVKQVKSFI 313 LGF+V K IE+DP +KAI EM P +QV+ F+ Sbjct: 295 LLGFVVSQKGIEVDPEKVKAILEMPEPRTERQVRGFL 331 Score = 58.9 bits (141), Expect = 2e-06 Identities = 37/110 (33%), Positives = 59/110 (53%), Gaps = 2/110 (1%) Frame = +1 Query: 316 KILYICRFIPNLSKKALPIMHILIKMQLLYGMMTAKKYLIESRKNC*NLLTLMYPMEGKP 495 ++ YI RFI L+ P+ +L K Q ++ +K N L+ P+ G+P Sbjct: 333 RLNYIARFISQLTAICEPLFKLLRKNQTDRWNEDCQEAFGRIKKCLMNPPVLIPPVPGRP 392 Query: 496 LLLYLSSIDNAMGVLLA-HENNG-IEKPIYYLSKVLLTIENRYSCIECLC 639 L+LY++ +D +MG +L H+ +G E+ +YYLSK T E YS +E C Sbjct: 393 LILYMTILDESMGCMLGQHDESGKKERAVYYLSKKFTTCEMNYSLLERTC 442 >ref|XP_003544290.1| PREDICTED: uncharacterized protein LOC100815788 [Glycine max] Length = 2270 Score = 75.9 bits (185), Expect(2) = 1e-25 Identities = 47/159 (29%), Positives = 79/159 (49%), Gaps = 3/159 (1%) Frame = +3 Query: 381 INKDATFIWNDDCQEVFDRIKEELLKP-LDFNVSYGRKTFVIISIFH*QRNGCFIGS*KQ 557 + KD +W +DCQ+ FD IK LL+P + GR + +++ GC +G + Sbjct: 1496 LRKDQGVVWTEDCQKAFDSIKNYLLEPPILIPPVEGRPLIMYLTVLE-DSMGCVLGQQDE 1554 Query: 558 WD*KANLLP**GASHN--RK*IFMY*VSMLALVFATQKL*HYVLEQIVWLITKTDLIRYL 731 K + + + + AL +A ++L HY++ WLI+K D I+Y+ Sbjct: 1555 TGRKEHAIYYLSKKFTDCESRYSLLEKTCCALAWAAKRLRHYMINHTTWLISKMDPIKYI 1614 Query: 732 LNWPMLTGCLARWSIKLVEFNVQCKN*KAILGQAILDML 848 P LTG +ARW + L E++++ + KAI G + D L Sbjct: 1615 FEKPALTGRIARWQMLLSEYDIKYRTQKAIKGNVLADHL 1653 Score = 67.8 bits (164), Expect(2) = 1e-25 Identities = 38/96 (39%), Positives = 54/96 (56%) Frame = +2 Query: 26 IFGDMLHXXXXXXXXXXXXRSEIDPDHITILK*ILE*CREKSYLKVNLLKCTFGVTVGKF 205 +F DM+H +S + +H+ L + + R K L++N KCTFGV GK Sbjct: 1378 LFHDMMHKEIEVYVDDMIVKSGTEEEHVEYLLKMFQRLR-KYQLRLNPNKCTFGVRSGKL 1436 Query: 206 LGFMVKYKEIEIDPTNLKAIKEMYPPTFVKQVKSFI 313 LGF+V K IE+DP +KAI+EM P KQV+ F+ Sbjct: 1437 LGFIVSQKGIEVDPDKVKAIREMPIPQTEKQVRGFL 1472 Score = 57.8 bits (138), Expect = 5e-06 Identities = 36/110 (32%), Positives = 58/110 (52%), Gaps = 2/110 (1%) Frame = +1 Query: 316 KILYICRFIPNLSKKALPIMHILIKMQLLYGMMTAKKYLIESRKNC*NLLTLMYPMEGKP 495 ++ YI RFI +++ PI +L K Q + +K + L+ P+EG+P Sbjct: 1474 RLNYISRFISHMTATCGPIFKLLRKDQGVVWTEDCQKAFDSIKNYLLEPPILIPPVEGRP 1533 Query: 496 LLLYLSSIDNAMGVLLAH--ENNGIEKPIYYLSKVLLTIENRYSCIECLC 639 L++YL+ ++++MG +L E E IYYLSK E+RYS +E C Sbjct: 1534 LIMYLTVLEDSMGCVLGQQDETGRKEHAIYYLSKKFTDCESRYSLLEKTC 1583 >ref|XP_006584201.1| PREDICTED: uncharacterized protein LOC100789592 [Glycine max] Length = 1177 Score = 75.1 bits (183), Expect(2) = 2e-25 Identities = 47/159 (29%), Positives = 78/159 (49%), Gaps = 3/159 (1%) Frame = +3 Query: 381 INKDATFIWNDDCQEVFDRIKEELLKP-LDFNVSYGRKTFVIISIFH*QRNGCFIGS*KQ 557 + KD +W DCQ+ FD IK LL+P + GR + +++ GC +G + Sbjct: 403 LRKDQGVVWTKDCQKAFDSIKNYLLEPPILIPPVEGRPLIMYLTVLE-DSMGCVLGQQDE 461 Query: 558 WD*KANLLP**GASHN--RK*IFMY*VSMLALVFATQKL*HYVLEQIVWLITKTDLIRYL 731 K + + + + AL +A ++L HY++ WLI+K D I+Y+ Sbjct: 462 TGRKEHAIYYLSKKFTDCESRYSLLEKTCCALAWAAKRLRHYMINHTTWLISKMDPIKYI 521 Query: 732 LNWPMLTGCLARWSIKLVEFNVQCKN*KAILGQAILDML 848 P LTG +ARW + L E++++ + KAI G + D L Sbjct: 522 FEKPALTGRIARWQMLLSEYDIEYRTQKAIKGSVLADHL 560 Score = 68.2 bits (165), Expect(2) = 2e-25 Identities = 38/96 (39%), Positives = 54/96 (56%) Frame = +2 Query: 26 IFGDMLHXXXXXXXXXXXXRSEIDPDHITILK*ILE*CREKSYLKVNLLKCTFGVTVGKF 205 +F DM+H +S + +H+ L + + R K L++N KCTFGV GK Sbjct: 285 LFHDMMHKEIEVYVDDMIVKSGTEEEHVEYLLKMFQRLR-KYQLRLNPNKCTFGVRSGKL 343 Query: 206 LGFMVKYKEIEIDPTNLKAIKEMYPPTFVKQVKSFI 313 LGF+V K IE+DP +KAI+EM P KQV+ F+ Sbjct: 344 LGFIVSQKGIEVDPDKVKAIREMPVPQTEKQVRGFL 379 Score = 58.2 bits (139), Expect = 4e-06 Identities = 36/110 (32%), Positives = 58/110 (52%), Gaps = 2/110 (1%) Frame = +1 Query: 316 KILYICRFIPNLSKKALPIMHILIKMQLLYGMMTAKKYLIESRKNC*NLLTLMYPMEGKP 495 ++ YI RFI +++ PI +L K Q + +K + L+ P+EG+P Sbjct: 381 RLNYISRFISHMTATCGPIFKLLRKDQGVVWTKDCQKAFDSIKNYLLEPPILIPPVEGRP 440 Query: 496 LLLYLSSIDNAMGVLLAH--ENNGIEKPIYYLSKVLLTIENRYSCIECLC 639 L++YL+ ++++MG +L E E IYYLSK E+RYS +E C Sbjct: 441 LIMYLTVLEDSMGCVLGQQDETGRKEHAIYYLSKKFTDCESRYSLLEKTC 490 >ref|XP_007038597.1| Retrotransposon, unclassified-like protein [Theobroma cacao] gi|508775842|gb|EOY23098.1| Retrotransposon, unclassified-like protein [Theobroma cacao] Length = 1609 Score = 77.8 bits (190), Expect(2) = 2e-25 Identities = 44/98 (44%), Positives = 58/98 (59%) Frame = +2 Query: 26 IFGDMLHXXXXXXXXXXXXRSEIDPDHITILK*ILE*CREKSYLKVNLLKCTFGVTVGKF 205 IF DM+H +S+ +H LK + E CR K L++N LKC FGVT G+F Sbjct: 1024 IFHDMMHDFMEDYVDDIVVKSKKAFNHFEDLKKVFERCR-KYNLRMNPLKCAFGVTAGRF 1082 Query: 206 LGFMVKYKEIEIDPTNLKAIKEMYPPTFVKQVKSFIEK 319 LGFMV K I++DPT +KAI+ M P KQ+KS + K Sbjct: 1083 LGFMVHRKGIDVDPTKIKAIQSMPSPMNQKQLKSLLGK 1120 Score = 65.5 bits (158), Expect(2) = 2e-25 Identities = 50/168 (29%), Positives = 83/168 (49%), Gaps = 5/168 (2%) Frame = +3 Query: 369 YNAHINKDATFIWNDDCQEVFDRIKEELLKPLDFNVSYGRKTFVIISIFH*QRNGCFIGS 548 + A + K FIW + Q+ F++IK+ L P + K ++ G + Sbjct: 1138 FQALLKKGVPFIWGEPQQQAFEKIKKILTSPATMIMPIKGKPMMLYLTSTPYSIGALLVQ 1197 Query: 549 *KQWD*K-----ANLLP**GASHNRK*IFMY*VSMLALVFATQKL*HYVLEQIVWLITKT 713 + K + L G+ N + + LALV+ TQKL HY L + ++TK+ Sbjct: 1198 EMDGEEKPVYYISRCLH--GSELNYPPMEKH---CLALVYTTQKLRHYFLAHKLIIVTKS 1252 Query: 714 DLIRYLLNWPMLTGCLARWSIKLVEFNVQCKN*KAILGQAILDMLVEY 857 D I++LL+ P+L+G +A+W + L EF+V KAI QA+ D+L + Sbjct: 1253 DPIKFLLSKPVLSGRVAKWLLLLGEFDVSVVQPKAIKSQALSDLLAYF 1300 Score = 63.5 bits (153), Expect = 1e-07 Identities = 38/109 (34%), Positives = 64/109 (58%), Gaps = 1/109 (0%) Frame = +1 Query: 316 KILYICRFIPNLSKKALPIMHILIK-MQLLYGMMTAKKYLIESRKNC*NLLTLMYPMEGK 492 K+ YI RFIP L + +P +L K + ++G + + + +K + T++ P++GK Sbjct: 1120 KVSYIRRFIPALGEIIVPFQALLKKGVPFIWGEPQQQAFE-KIKKILTSPATMIMPIKGK 1178 Query: 493 PLLLYLSSIDNAMGVLLAHENNGIEKPIYYLSKVLLTIENRYSCIECLC 639 P++LYL+S ++G LL E +G EKP+YY+S+ L E Y +E C Sbjct: 1179 PMMLYLTSTPYSIGALLVQEMDGEEKPVYYISRCLHGSELNYPPMEKHC 1227 >ref|XP_006604068.1| PREDICTED: uncharacterized protein LOC102660493, partial [Glycine max] Length = 1094 Score = 73.6 bits (179), Expect(2) = 2e-25 Identities = 47/160 (29%), Positives = 75/160 (46%), Gaps = 2/160 (1%) Frame = +3 Query: 381 INKDATFIWNDDCQEVFDRIKEELLKPLDFNVSYGRKTFVIISIFH*QRNGCFIGS*KQW 560 + K+ T WN+DCQE F RIK+ L+ P + ++ + GC +G + Sbjct: 355 LRKNQTDRWNEDCQEAFGRIKKCLMNPPVLMPPVPGRPLILYMTILDESMGCMLGQHDES 414 Query: 561 D*KANLLP**GASHN--RK*IFMY*VSMLALVFATQKL*HYVLEQIVWLITKTDLIRYLL 734 K + + + ALV+A+ +L Y+L WLI+K D ++Y+ Sbjct: 415 GKKERAVYYLSKKFTTCEMNYSLLERTCCALVWASHRLRQYMLSHTTWLISKMDPVKYIF 474 Query: 735 NWPMLTGCLARWSIKLVEFNVQCKN*KAILGQAILDMLVE 854 P LTG +ARW + L EF++ K I G A+ D L + Sbjct: 475 EKPALTGRIARWQVLLSEFDIVYVTQKTIKGSALADYLAQ 514 Score = 69.3 bits (168), Expect(2) = 2e-25 Identities = 39/97 (40%), Positives = 56/97 (57%), Gaps = 1/97 (1%) Frame = +2 Query: 26 IFGDMLHXXXXXXXXXXXXRSEIDPDHITILK*ILE*CREKSY-LKVNLLKCTFGVTVGK 202 +F DM+H +S+ + +H+ L+ + E R K Y L++N KCTFGV GK Sbjct: 237 LFHDMMHQEIEVYVDDIIAKSKSEEEHLVNLRKLFE--RLKKYQLRLNPAKCTFGVKSGK 294 Query: 203 FLGFMVKYKEIEIDPTNLKAIKEMYPPTFVKQVKSFI 313 LGF+V K IE+DP +KAI EM P +QV+ F+ Sbjct: 295 LLGFVVSQKGIEVDPEKVKAILEMPEPRTERQVRGFL 331 Score = 60.5 bits (145), Expect = 8e-07 Identities = 38/110 (34%), Positives = 59/110 (53%), Gaps = 2/110 (1%) Frame = +1 Query: 316 KILYICRFIPNLSKKALPIMHILIKMQLLYGMMTAKKYLIESRKNC*NLLTLMYPMEGKP 495 ++ YI RFI L+ P+ +L K Q ++ +K N LM P+ G+P Sbjct: 333 RLNYIARFISQLTAICEPLFKLLRKNQTDRWNEDCQEAFGRIKKCLMNPPVLMPPVPGRP 392 Query: 496 LLLYLSSIDNAMGVLLA-HENNG-IEKPIYYLSKVLLTIENRYSCIECLC 639 L+LY++ +D +MG +L H+ +G E+ +YYLSK T E YS +E C Sbjct: 393 LILYMTILDESMGCMLGQHDESGKKERAVYYLSKKFTTCEMNYSLLERTC 442 >ref|XP_007025429.1| RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H-like protein [Theobroma cacao] gi|508780795|gb|EOY28051.1| RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H-like protein [Theobroma cacao] Length = 1630 Score = 76.3 bits (186), Expect(2) = 2e-25 Identities = 41/96 (42%), Positives = 56/96 (58%) Frame = +2 Query: 26 IFGDMLHXXXXXXXXXXXXRSEIDPDHITILK*ILE*CREKSYLKVNLLKCTFGVTVGKF 205 +F DM+H +S + DH LK + E R K LK+N KCTFGVT GK Sbjct: 1012 LFHDMMHKEIEVYVDDMIAKSHTERDHTVNLKKLFERLR-KFQLKLNPAKCTFGVTSGKL 1070 Query: 206 LGFMVKYKEIEIDPTNLKAIKEMYPPTFVKQVKSFI 313 LGF+V K IE+DP ++AI+E+ PP K+V+ F+ Sbjct: 1071 LGFIVSEKGIEVDPDKIRAIQELPPPKTQKEVRGFL 1106 Score = 66.6 bits (161), Expect(2) = 2e-25 Identities = 47/158 (29%), Positives = 71/158 (44%), Gaps = 8/158 (5%) Frame = +3 Query: 405 WNDDCQEVFDRIKEELLKPLDFNVSYGRKTFVIISIFH*QRNGCFIGS*KQWD*KANLLP 584 WN++CQ FD+IKE L P K ++ + GC +G + K Sbjct: 1138 WNEECQIAFDKIKEYLTNPPVLMPPTVGKPLILYLTVNKDSMGCVLGQHDETGKKER--- 1194 Query: 585 **GASHNRK*IFMY*VSML--------ALVFATQKL*HYVLEQIVWLITKTDLIRYLLNW 740 A + FM S AL + Q+L Y+L WL+ K D I+Y+ Sbjct: 1195 ---AVYYLSKKFMEYESKYSALEKMCCALAWTAQRLRQYMLYHTTWLVAKLDPIKYIFEK 1251 Query: 741 PMLTGCLARWSIKLVEFNVQCKN*KAILGQAILDMLVE 854 P L+G +ARW + L E+++ + K+I G AI D L + Sbjct: 1252 PCLSGRIARWQVLLSEYDLVYVSQKSIKGSAIADFLAD 1289 Score = 57.4 bits (137), Expect = 7e-06 Identities = 42/119 (35%), Positives = 63/119 (52%), Gaps = 11/119 (9%) Frame = +1 Query: 316 KILYICRFIPNLSKKALPIMHILIKM---------QLLYGMMTAKKYLIESRKNC*NLLT 468 ++ YI RFI L+ K PI +L K Q+ + + K+YL N Sbjct: 1108 RLNYIARFISQLTCKCDPIFKLLRKRDPGEWNEECQIAFDKI--KEYLT-------NPPV 1158 Query: 469 LMYPMEGKPLLLYLSSIDNAMGVLLA-HENNGI-EKPIYYLSKVLLTIENRYSCIECLC 639 LM P GKPL+LYL+ ++MG +L H+ G E+ +YYLSK + E++YS +E +C Sbjct: 1159 LMPPTVGKPLILYLTVNKDSMGCVLGQHDETGKKERAVYYLSKKFMEYESKYSALEKMC 1217 >gb|AAQ82037.1| gag/pol polyprotein [Pisum sativum] Length = 2262 Score = 72.4 bits (176), Expect(2) = 3e-25 Identities = 50/161 (31%), Positives = 79/161 (49%), Gaps = 3/161 (1%) Frame = +3 Query: 381 INKDATFIWNDDCQEVFDRIKEELLKP-LDFNVSYGRKTFVIISIFH*QRNGCFIGS*KQ 557 + K+ WNDDCQ+ FD+IKE L KP + GR + +S+ GC +G + Sbjct: 1486 LRKNQAIKWNDDCQKAFDKIKEYLQKPPILIPPVPGRPLIMYLSVTE-NSMGCVLGRHDE 1544 Query: 558 WD*KANLLP**GASHN--RK*IFMY*VSMLALVFATQKL*HYVLEQIVWLITKTDLIRYL 731 K + + + + AL +A ++L Y+L LI+K D ++Y+ Sbjct: 1545 SGRKEHAIYYLSKKFTDCETRYSLLEKTCCALAWAARRLRQYMLNHTTLLISKMDPVKYI 1604 Query: 732 LNWPMLTGCLARWSIKLVEFNVQCKN*KAILGQAILDMLVE 854 P LTG +ARW + L E+++Q + KAI G + D L E Sbjct: 1605 FEKPALTGRVARWQMILTEYDIQYTSQKAIKGSILSDYLAE 1645 Score = 70.1 bits (170), Expect(2) = 3e-25 Identities = 38/96 (39%), Positives = 56/96 (58%) Frame = +2 Query: 26 IFGDMLHXXXXXXXXXXXXRSEIDPDHITILK*ILE*CREKSYLKVNLLKCTFGVTVGKF 205 +F DM+H +S+ + +H+ L+ + + R K L++N KCTFGV GK Sbjct: 1368 LFHDMMHKEIEVYVDDMIAKSQTEEEHLVNLQKLFDRLR-KFKLRLNPNKCTFGVRSGKL 1426 Query: 206 LGFMVKYKEIEIDPTNLKAIKEMYPPTFVKQVKSFI 313 LGF+V K IE+DP +KAI+EM P KQV+ F+ Sbjct: 1427 LGFIVSEKGIEVDPAKVKAIQEMPEPKTEKQVRGFL 1462 Score = 61.2 bits (147), Expect = 5e-07 Identities = 39/110 (35%), Positives = 60/110 (54%), Gaps = 2/110 (1%) Frame = +1 Query: 316 KILYICRFIPNLSKKALPIMHILIKMQLLYGMMTAKKYLIESRKNC*NLLTLMYPMEGKP 495 ++ YI RFI +L+ PI +L K Q + +K + ++ L+ P+ G+P Sbjct: 1464 RLNYIARFISHLTATCEPIFKLLRKNQAIKWNDDCQKAFDKIKEYLQKPPILIPPVPGRP 1523 Query: 496 LLLYLSSIDNAMGVLLA-HENNG-IEKPIYYLSKVLLTIENRYSCIECLC 639 L++YLS +N+MG +L H+ +G E IYYLSK E RYS +E C Sbjct: 1524 LIMYLSVTENSMGCVLGRHDESGRKEHAIYYLSKKFTDCETRYSLLEKTC 1573 >ref|XP_007050215.1| Uncharacterized protein TCM_003960 [Theobroma cacao] gi|508702476|gb|EOX94372.1| Uncharacterized protein TCM_003960 [Theobroma cacao] Length = 2336 Score = 74.7 bits (182), Expect(2) = 7e-25 Identities = 40/98 (40%), Positives = 56/98 (57%) Frame = +2 Query: 26 IFGDMLHXXXXXXXXXXXXRSEIDPDHITILK*ILE*CREKSYLKVNLLKCTFGVTVGKF 205 +F DM+H +S + DH LK + E R K LK+N KCTFGV GK Sbjct: 1940 LFHDMMHKEIEVYVDDMIAKSHTERDHTVNLKKLFERLR-KFQLKLNPAKCTFGVISGKL 1998 Query: 206 LGFMVKYKEIEIDPTNLKAIKEMYPPTFVKQVKSFIEK 319 LGF+V K IE+DP ++AI+E+ PP K+V+ F+ + Sbjct: 1999 LGFIVSEKGIEVDPDKIRAIQELPPPKTQKEVRGFLRR 2036 Score = 66.6 bits (161), Expect(2) = 7e-25 Identities = 47/158 (29%), Positives = 71/158 (44%), Gaps = 8/158 (5%) Frame = +3 Query: 405 WNDDCQEVFDRIKEELLKPLDFNVSYGRKTFVIISIFH*QRNGCFIGS*KQWD*KANLLP 584 WN++CQ FD+IKE L P K ++ + GC +G + K Sbjct: 2066 WNEECQIAFDKIKEYLTNPPVLVPLTVGKPLILYLTVNKNSMGCVLGQHDETGKKER--- 2122 Query: 585 **GASHNRK*IFMY*VSML--------ALVFATQKL*HYVLEQIVWLITKTDLIRYLLNW 740 A + FM S AL + Q+L Y+L WL+ K D I+Y+ Sbjct: 2123 ---AVYYLSKKFMEYESKYSALEKMCCALAWTAQRLRQYMLYHTTWLVAKLDPIKYIFEK 2179 Query: 741 PMLTGCLARWSIKLVEFNVQCKN*KAILGQAILDMLVE 854 P L+G +ARW + L E+++ + K+I G AI D L + Sbjct: 2180 PCLSGRIARWQVLLSEYDIVYVSQKSIKGSAIADFLAD 2217 >ref|XP_007036486.1| RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H [Theobroma cacao] gi|508773731|gb|EOY20987.1| RNA-directed DNA polymerase (Reverse transcriptase), Ribonuclease H [Theobroma cacao] Length = 857 Score = 73.2 bits (178), Expect(2) = 7e-25 Identities = 40/96 (41%), Positives = 55/96 (57%) Frame = +2 Query: 26 IFGDMLHXXXXXXXXXXXXRSEIDPDHITILK*ILE*CREKSYLKVNLLKCTFGVTVGKF 205 +F DM+H +S + DH LK + E R K LK+N KCTFGVT GK Sbjct: 175 LFHDMMHKEIEVYVDDMITKSHTERDHTVNLKKLFERLR-KFQLKLNPAKCTFGVTSGKL 233 Query: 206 LGFMVKYKEIEIDPTNLKAIKEMYPPTFVKQVKSFI 313 LGF+V K IE+D ++AI+E+ PP K+V+ F+ Sbjct: 234 LGFIVSEKGIEVDQDKIRAIQELPPPKTQKEVRGFL 269 Score = 68.2 bits (165), Expect(2) = 7e-25 Identities = 47/158 (29%), Positives = 72/158 (45%), Gaps = 8/158 (5%) Frame = +3 Query: 405 WNDDCQEVFDRIKEELLKPLDFNVSYGRKTFVIISIFH*QRNGCFIGS*KQWD*KANLLP 584 WN++CQ FD+IKE L P K ++ + GC +G + K Sbjct: 301 WNEECQIAFDKIKEYLTNPPVLMPPTVGKPLILYLTVNKNSMGCVLGQHDETGKKER--- 357 Query: 585 **GASHNRK*IFMY*VSML--------ALVFATQKL*HYVLEQIVWLITKTDLIRYLLNW 740 A + FM S AL + Q+L Y+L WL+ K D I+Y+ Sbjct: 358 ---AVYYLSKKFMEYESKYSALEKMCCALAWTAQRLRQYMLYHTTWLVAKLDPIKYIFEK 414 Query: 741 PMLTGCLARWSIKLVEFNVQCKN*KAILGQAILDMLVE 854 P L+G +ARW + L E+++ + K+I G AI+D L + Sbjct: 415 PCLSGRIARWQVLLSEYDIVYVSQKSIKGSAIVDFLAD 452 Score = 59.3 bits (142), Expect = 2e-06 Identities = 43/119 (36%), Positives = 63/119 (52%), Gaps = 11/119 (9%) Frame = +1 Query: 316 KILYICRFIPNLSKKALPIMHILIKM---------QLLYGMMTAKKYLIESRKNC*NLLT 468 ++ YI RFI L+ K PI +L K Q+ + + K+YL N Sbjct: 271 RLNYIARFISQLTCKCDPIFKLLRKRDPGEWNEECQIAFDKI--KEYLT-------NPPV 321 Query: 469 LMYPMEGKPLLLYLSSIDNAMGVLLA-HENNGI-EKPIYYLSKVLLTIENRYSCIECLC 639 LM P GKPL+LYL+ N+MG +L H+ G E+ +YYLSK + E++YS +E +C Sbjct: 322 LMPPTVGKPLILYLTVNKNSMGCVLGQHDETGKKERAVYYLSKKFMEYESKYSALEKMC 380 >emb|CAN75930.1| hypothetical protein VITISV_038505 [Vitis vinifera] Length = 2157 Score = 75.5 bits (184), Expect(2) = 9e-25 Identities = 52/163 (31%), Positives = 80/163 (49%), Gaps = 7/163 (4%) Frame = +3 Query: 381 INKDATFIWNDDCQEVFDRIKEELLKPLDFNVSYGRKTFVIISIFH*QRNGCFIGS*KQW 560 + K+ +WNDDCQ F++IKE LL P R+ ++ GC + Sbjct: 1444 LRKNQPTVWNDDCQFAFEKIKEYLLSPPVLVPPTPRRPLLLYLSVSDMALGCMLAQIDDL 1503 Query: 561 D*KANLLP**GASHNRK*IFMY*VSM-------LALVFATQKL*HYVLEQIVWLITKTDL 719 + + + K + Y + LALV+AT++L HY+ E V LI++ D Sbjct: 1504 GKERAIY------YLSKRMLEYEMRYVMIERLCLALVWATRRLRHYMTEYSVHLISRLDP 1557 Query: 720 IRYLLNWPMLTGCLARWSIKLVEFNVQCKN*KAILGQAILDML 848 +RYL + P LTG L RW + L EF++Q + K+I G + D L Sbjct: 1558 LRYLFDRPALTGRLMRWLVLLTEFDIQYVSQKSIKGSIVADHL 1600 Score = 65.5 bits (158), Expect(2) = 9e-25 Identities = 37/96 (38%), Positives = 52/96 (54%) Frame = +2 Query: 26 IFGDMLHXXXXXXXXXXXXRSEIDPDHITILK*ILE*CREKSYLKVNLLKCTFGVTVGKF 205 +F DM+H +S DH+ L+ E R K L++N KCTFGVT GK Sbjct: 1326 LFHDMMHRDVEVYVDDMIVKSRGRADHLDALERFFERIR-KFRLRLNPKKCTFGVTSGKL 1384 Query: 206 LGFMVKYKEIEIDPTNLKAIKEMYPPTFVKQVKSFI 313 LG MV + IE+DP +KAI +M P K+++ F+ Sbjct: 1385 LGHMVSERGIEVDPDKIKAILDMPAPKTEKEIRGFL 1420 Score = 60.1 bits (144), Expect = 1e-06 Identities = 44/116 (37%), Positives = 59/116 (50%), Gaps = 8/116 (6%) Frame = +1 Query: 316 KILYICRFIPNLSKKALPIMHILIKMQ-------LLYGMMTAKKYLIESRKNC*NLLTLM 474 ++ YI RFI L+ PI +L K Q + K+YL+ L+ Sbjct: 1422 RLQYISRFIARLTDICEPIFRLLRKNQPTVWNDDCQFAFEKIKEYLLSPP-------VLV 1474 Query: 475 YPMEGKPLLLYLSSIDNAMGVLLAH-ENNGIEKPIYYLSKVLLTIENRYSCIECLC 639 P +PLLLYLS D A+G +LA ++ G E+ IYYLSK +L E RY IE LC Sbjct: 1475 PPTPRRPLLLYLSVSDMALGCMLAQIDDLGKERAIYYLSKRMLEYEMRYVMIERLC 1530 >emb|CAN76756.1| hypothetical protein VITISV_012606 [Vitis vinifera] Length = 1195 Score = 77.0 bits (188), Expect(2) = 2e-24 Identities = 58/161 (36%), Positives = 83/161 (51%), Gaps = 5/161 (3%) Frame = +3 Query: 381 INKDATFIWNDDCQEVFDRIKEELLKP-LDFNVSYGRKTFVIISIFH*QRNGCFIGS*KQ 557 + K+ +WNDDCQ F++IKE LL P + GR F+ +S+ GC + Q Sbjct: 431 LRKNQPTVWNDDCQIAFEKIKEYLLSPPVLVPPMPGRPLFLYLSVSD-MALGCMLA---Q 486 Query: 558 WD*KANLLP**GASHNRK*IFMY*VSM----LALVFATQKL*HYVLEQIVWLITKTDLIR 725 D S M V + LALV+AT++L HY+ E V LI++ D +R Sbjct: 487 LDDSGKERAIYYLSKRMLEYEMRYVMIERMCLALVWATRRLRHYMTEYSVCLISRLDPLR 546 Query: 726 YLLNWPMLTGCLARWSIKLVEFNVQCKN*KAILGQAILDML 848 YL + P LTG L RW + L EF++Q + K+I G + D L Sbjct: 547 YLFDRPALTGRLMRWLVLLTEFDIQYVSQKSIKGSIVADHL 587 Score = 63.2 bits (152), Expect(2) = 2e-24 Identities = 36/96 (37%), Positives = 51/96 (53%) Frame = +2 Query: 26 IFGDMLHXXXXXXXXXXXXRSEIDPDHITILK*ILE*CREKSYLKVNLLKCTFGVTVGKF 205 +F DM+H +S DH+ L+ E R K L++N KCTFGVT GK Sbjct: 313 LFHDMMHRDVEVYVDDMIVKSRGRADHLDALERFFERIR-KFRLRLNPKKCTFGVTSGKL 371 Query: 206 LGFMVKYKEIEIDPTNLKAIKEMYPPTFVKQVKSFI 313 LG MV + IE+DP +K I +M P K+++ F+ Sbjct: 372 LGHMVSDRGIEVDPDKIKVILDMPVPKTEKEIRGFL 407 Score = 63.2 bits (152), Expect = 1e-07 Identities = 44/116 (37%), Positives = 60/116 (51%), Gaps = 8/116 (6%) Frame = +1 Query: 316 KILYICRFIPNLSKKALPIMHILIKMQ-------LLYGMMTAKKYLIESRKNC*NLLTLM 474 ++ YI RFI L+ PI +L K Q K+YL+ L+ Sbjct: 409 RLQYISRFIARLTDICEPIFRLLRKNQPTVWNDDCQIAFEKIKEYLLSPP-------VLV 461 Query: 475 YPMEGKPLLLYLSSIDNAMGVLLAH-ENNGIEKPIYYLSKVLLTIENRYSCIECLC 639 PM G+PL LYLS D A+G +LA +++G E+ IYYLSK +L E RY IE +C Sbjct: 462 PPMPGRPLFLYLSVSDMALGCMLAQLDDSGKERAIYYLSKRMLEYEMRYVMIERMC 517