BLASTX nr result
ID: Rehmannia31_contig00028612
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia31_contig00028612 (797 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PIM97453.1| hypothetical protein CDL12_30077 [Handroanthus im... 141 3e-34 ref|XP_012850054.1| PREDICTED: uncharacterized protein LOC105969... 131 9e-31 ref|XP_012828530.1| PREDICTED: uncharacterized protein LOC105949... 129 3e-30 ref|XP_012846702.1| PREDICTED: uncharacterized protein LOC105966... 128 8e-30 ref|XP_012844821.1| PREDICTED: uncharacterized protein LOC105964... 128 8e-30 ref|XP_012847850.1| PREDICTED: uncharacterized protein LOC105967... 128 8e-30 ref|XP_012828505.1| PREDICTED: uncharacterized protein LOC105949... 128 1e-29 ref|XP_012858045.1| PREDICTED: uncharacterized protein LOC105977... 128 1e-29 ref|XP_012853187.1| PREDICTED: uncharacterized protein LOC105972... 127 1e-29 ref|XP_012844111.1| PREDICTED: uncharacterized protein LOC105964... 127 3e-29 ref|XP_012855480.1| PREDICTED: uncharacterized protein LOC105974... 127 3e-29 gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao] 126 3e-29 ref|XP_012850055.1| PREDICTED: uncharacterized protein LOC105969... 126 5e-29 gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] 126 5e-29 gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theob... 125 1e-28 ref|XP_020549796.1| uncharacterized protein LOC110012043 [Sesamu... 124 3e-28 gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao] 123 4e-28 gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] 123 4e-28 gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao] 121 2e-27 gb|PKU62688.1| Putative ribonuclease H protein [Dendrobium caten... 116 2e-27 >gb|PIM97453.1| hypothetical protein CDL12_30077 [Handroanthus impetiginosus] Length = 983 Score = 141 bits (355), Expect = 3e-34 Identities = 79/238 (33%), Positives = 126/238 (52%), Gaps = 2/238 (0%) Frame = +1 Query: 1 VFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNKFDFSIWKGFSHIANHLNFLV 180 + W W RND K+R + F RII+ V ++ + L+NK F W+G + +A L + Sbjct: 745 ICWHIWEARNDAKYRYICFSARRIIFKVRQHIQHIILTNKLTFRHWRGDTAVAQALGQHI 804 Query: 181 PKSSVKKAILVFWDKPPVGCLKLNTXXXXXXXXXXXXX--VIRNDRGEVLRAFQAHFPAV 354 P +K++ V+W KP G KLNT ++ + G + AF + Sbjct: 805 PLPPPRKSMAVWWSKPKPGEWKLNTDGAAKRSTCRAGAGGILHDHTGMSILAFTHYLGLG 864 Query: 355 NSLEAEVKALAMGIDILSRFGTEGRWWIEVDSMALVWMIKNRKIGHWRLQHDFLRIKHKF 534 +SLEAE+ A+ G+ + G + WIE+DS V +I++ + G W++ H I+ Sbjct: 865 SSLEAELTAIHRGLYLCVAKGFT-KIWIEMDSQLAVQLIQSDQHGSWKIHHLLDGIRQLM 923 Query: 535 KDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPGKLKGLIRVDKMRIPAFR 708 + + I+H FREGN PAD+LAN+ CD Q + E P + L GL+R+D+ +P+FR Sbjct: 924 RHFPIRISHIFREGNRPADFLANMACDLQYSGEIKPQDFTRTLIGLLRMDRWGLPSFR 981 >ref|XP_012850054.1| PREDICTED: uncharacterized protein LOC105969824 [Erythranthe guttata] Length = 1805 Score = 131 bits (329), Expect = 9e-31 Identities = 77/240 (32%), Positives = 120/240 (50%), Gaps = 4/240 (1%) Frame = +1 Query: 1 VFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNKFDFSIWKGFSHIANHLN-FL 177 + W+ W+ RND KH+ + + II+ V ++ L +N W G H+A L + Sbjct: 1563 ILWYLWIARNDSKHKDITVRASSIIYRVIQHIRILHQTNLLSADSWTGIPHVAESLGLYY 1622 Query: 178 VPKSSVKKAILVFWDKPPVGCLKLNTXXXXXXXXXXXXX--VIRNDRGEVLRAFQAHFPA 351 ++ V W P G +KLNT +IR E + AFQ A Sbjct: 1623 RVRTPTLTPHRVVWLPPDPGWVKLNTDGARRASTQIAAIGGIIRGSDAEAIVAFQERISA 1682 Query: 352 VNSLEAEVKALAMGID-ILSRFGTEGRWWIEVDSMALVWMIKNRKIGHWRLQHDFLRIKH 528 +S+ AE+ ALA G+ ++ R T R WIE+D+ +V ++ + GHW LQ I++ Sbjct: 1683 PSSIAAELAALASGLRFVIQRQFT--RVWIELDAEVVVRLLSHTDEGHWSLQSSLTAIRN 1740 Query: 529 KFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPGKLKGLIRVDKMRIPAFR 708 + ITH +REGN AD LANLGC + + F + LP ++ +IR+D++ P+FR Sbjct: 1741 SLSTLEYRITHIYREGNKVADALANLGCQTELARTFTTAELPRPIQQMIRMDQLGYPSFR 1800 >ref|XP_012828530.1| PREDICTED: uncharacterized protein LOC105949758 [Erythranthe guttata] Length = 1245 Score = 129 bits (325), Expect = 3e-30 Identities = 76/240 (31%), Positives = 119/240 (49%), Gaps = 4/240 (1%) Frame = +1 Query: 1 VFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNKFDFSIWKGFSHIANHLN-FL 177 + W+ W+ RND KH+ + + II+ V ++ L +N W G H+A L + Sbjct: 1003 ILWYLWIARNDSKHKDITVRASSIIYRVIQHIRILHQTNLLSADSWTGIPHVAESLGLYY 1062 Query: 178 VPKSSVKKAILVFWDKPPVGCLKLNTXXXXXXXXXXXXX--VIRNDRGEVLRAFQAHFPA 351 ++ V W P G +KLNT +IR E + AF A Sbjct: 1063 RVRTPTLTPHRVVWLPPDPGWVKLNTDGARRASTQIAAIGGIIRGSDAEAIVAFHERISA 1122 Query: 352 VNSLEAEVKALAMGID-ILSRFGTEGRWWIEVDSMALVWMIKNRKIGHWRLQHDFLRIKH 528 +S+ AE+ ALA G+ ++ R T R WIE+D+ +V ++ + GHW LQ I++ Sbjct: 1123 PSSIAAELAALASGLRFVIQRQFT--RVWIELDAEVVVRLLSHTDQGHWSLQSSLTAIRN 1180 Query: 529 KFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPGKLKGLIRVDKMRIPAFR 708 + ITH +REGN AD LANLGC + + F + LP ++ +IR+D++ P+FR Sbjct: 1181 SLSTLEYKITHIYREGNKVADALANLGCQTELARTFTTAELPRPIQQMIRMDQLGYPSFR 1240 >ref|XP_012846702.1| PREDICTED: uncharacterized protein LOC105966658 [Erythranthe guttata] Length = 1233 Score = 128 bits (322), Expect = 8e-30 Identities = 76/240 (31%), Positives = 118/240 (49%), Gaps = 4/240 (1%) Frame = +1 Query: 1 VFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNKFDFSIWKGFSHIANHLN-FL 177 + W+ W+ RND KH+ + + II+ V ++ L +N W G H+A L + Sbjct: 991 ILWYLWIARNDSKHKDITVRASSIIYRVIQHIRILHQTNLLSADSWTGIPHVAESLGLYY 1050 Query: 178 VPKSSVKKAILVFWDKPPVGCLKLNTXXXXXXXXXXXXX--VIRNDRGEVLRAFQAHFPA 351 ++ V W P G +KLNT +IR E + AF A Sbjct: 1051 RVRTPTLTPHRVVWLPPDPGWVKLNTDGARRASTQIAAIGGIIRGSDAEAIVAFHERISA 1110 Query: 352 VNSLEAEVKALAMGID-ILSRFGTEGRWWIEVDSMALVWMIKNRKIGHWRLQHDFLRIKH 528 +S+ AE+ ALA G+ ++ R T R WIE+D+ V ++ + GHW LQ I++ Sbjct: 1111 PSSIAAELAALASGLRFVIQRQFT--RVWIELDAEVAVRLLSHTDQGHWSLQSSLTAIRN 1168 Query: 529 KFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPGKLKGLIRVDKMRIPAFR 708 + ITH +REGN AD LANLGC + + F + LP ++ +IR+D++ P+FR Sbjct: 1169 SLSTLEYRITHIYREGNKVADALANLGCQTELARTFTTAELPRPIQQMIRMDQLGYPSFR 1228 >ref|XP_012844821.1| PREDICTED: uncharacterized protein LOC105964855 [Erythranthe guttata] Length = 1237 Score = 128 bits (322), Expect = 8e-30 Identities = 76/240 (31%), Positives = 118/240 (49%), Gaps = 4/240 (1%) Frame = +1 Query: 1 VFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNKFDFSIWKGFSHIANHLN-FL 177 + W+ W+ RND KH+ + + II+ V ++ L +N W G H+A L + Sbjct: 995 ILWYLWIARNDSKHKDITVRASSIIYRVIQHIRILHQTNLLSADSWTGIPHVAESLGLYY 1054 Query: 178 VPKSSVKKAILVFWDKPPVGCLKLNTXXXXXXXXXXXXX--VIRNDRGEVLRAFQAHFPA 351 ++ V W P G +KLNT +IR E + AF A Sbjct: 1055 RVRTPTLTPHRVVWLPPDPGWVKLNTDGARRASTQIAAIGGIIRGSDAEAIVAFHERISA 1114 Query: 352 VNSLEAEVKALAMGID-ILSRFGTEGRWWIEVDSMALVWMIKNRKIGHWRLQHDFLRIKH 528 +S+ AE+ ALA G+ ++ R T R WIE+D+ V ++ + GHW LQ I++ Sbjct: 1115 PSSIAAELAALASGLRFVIQRQFT--RVWIELDAEVAVRLLSHTDQGHWSLQSSLTAIRN 1172 Query: 529 KFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPGKLKGLIRVDKMRIPAFR 708 + ITH +REGN AD LANLGC + + F + LP ++ +IR+D++ P+FR Sbjct: 1173 SLSTLEYRITHIYREGNKVADALANLGCQTELARTFTTAELPRPIQQMIRMDQLGYPSFR 1232 >ref|XP_012847850.1| PREDICTED: uncharacterized protein LOC105967783 [Erythranthe guttata] Length = 1298 Score = 128 bits (322), Expect = 8e-30 Identities = 76/240 (31%), Positives = 118/240 (49%), Gaps = 4/240 (1%) Frame = +1 Query: 1 VFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNKFDFSIWKGFSHIANHLN-FL 177 + W+ W+ RND KH+ + + II+ V ++ L +N W G H+A L + Sbjct: 1056 ILWYLWIARNDSKHKDITVRASSIIYRVIQHIRILHQTNLLSADSWTGIPHVAESLGLYY 1115 Query: 178 VPKSSVKKAILVFWDKPPVGCLKLNTXXXXXXXXXXXXX--VIRNDRGEVLRAFQAHFPA 351 ++ V W P G +KLNT +IR E + AF A Sbjct: 1116 RVRTPTLTPHRVVWLPPDPGWVKLNTDGARRASTQIAAIGGIIRGSDAEAIVAFHERISA 1175 Query: 352 VNSLEAEVKALAMGID-ILSRFGTEGRWWIEVDSMALVWMIKNRKIGHWRLQHDFLRIKH 528 +S+ AE+ ALA G+ ++ R T R WIE+D+ V ++ + GHW LQ I++ Sbjct: 1176 PSSIAAELAALASGLRFVIQRQFT--RVWIELDAEVAVRLLSHTDQGHWSLQSSLTAIRN 1233 Query: 529 KFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPGKLKGLIRVDKMRIPAFR 708 + ITH +REGN AD LANLGC + + F + LP ++ +IR+D++ P+FR Sbjct: 1234 SLSTLEYRITHIYREGNKVADALANLGCQTELARTFTTAELPRPIQQMIRMDQLGYPSFR 1293 >ref|XP_012828505.1| PREDICTED: uncharacterized protein LOC105949732 [Erythranthe guttata] Length = 1237 Score = 128 bits (321), Expect = 1e-29 Identities = 75/240 (31%), Positives = 118/240 (49%), Gaps = 4/240 (1%) Frame = +1 Query: 1 VFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNKFDFSIWKGFSHIANHLN-FL 177 + W+ W+ RND KH+ + + II+ V ++ L +N W G H+A L + Sbjct: 995 ILWYLWIARNDSKHKDITVRASSIIYRVIQHIRILHQTNLLSADSWTGIPHVAESLGLYY 1054 Query: 178 VPKSSVKKAILVFWDKPPVGCLKLNTXXXXXXXXXXXXX--VIRNDRGEVLRAFQAHFPA 351 ++ V W P G +KLNT +IR E + AF A Sbjct: 1055 RVRTPTLTPHRVVWLPPDPGWVKLNTDGARRASTQIAAIGGIIRGSDAEAIVAFHERISA 1114 Query: 352 VNSLEAEVKALAMGID-ILSRFGTEGRWWIEVDSMALVWMIKNRKIGHWRLQHDFLRIKH 528 +S+ AE+ ALA G+ ++ R T R WIE+D+ + ++ + GHW LQ I++ Sbjct: 1115 PSSIAAELAALASGLRFVIQRQFT--RVWIELDAEVAIRLLSHMDQGHWSLQSSLTAIRN 1172 Query: 529 KFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPGKLKGLIRVDKMRIPAFR 708 + ITH +REGN AD LANLGC + + F + LP ++ +IR+D++ P+FR Sbjct: 1173 SLSTLEYRITHIYREGNKVADALANLGCQTELARTFTTAELPRPIQQMIRMDQLGYPSFR 1232 >ref|XP_012858045.1| PREDICTED: uncharacterized protein LOC105977287 [Erythranthe guttata] Length = 1237 Score = 128 bits (321), Expect = 1e-29 Identities = 75/240 (31%), Positives = 118/240 (49%), Gaps = 4/240 (1%) Frame = +1 Query: 1 VFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNKFDFSIWKGFSHIANHLN-FL 177 + W+ W+ RND KH+ + + II+ V ++ L +N W G H+A L + Sbjct: 995 ILWYLWIARNDSKHKDITVRASSIIYRVIQHIKILHQTNLLSADSWTGIPHVAESLGLYY 1054 Query: 178 VPKSSVKKAILVFWDKPPVGCLKLNTXXXXXXXXXXXXX--VIRNDRGEVLRAFQAHFPA 351 ++ + V W P G +KLNT +IR E + AF A Sbjct: 1055 RVRTPTLRPHRVVWLPPDPGWVKLNTDGARRASTQIAAIGGIIRGSDAEAIVAFHERISA 1114 Query: 352 VNSLEAEVKALAMGID-ILSRFGTEGRWWIEVDSMALVWMIKNRKIGHWRLQHDFLRIKH 528 +S+ AE+ ALA G+ ++ R T R WIE+D+ V ++ + GHW L I++ Sbjct: 1115 PSSIAAELAALASGLRFVIQRQFT--RVWIELDAEVAVRLLSHTDQGHWSLHSSLTAIRN 1172 Query: 529 KFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPGKLKGLIRVDKMRIPAFR 708 + ITH +REGN AD LANLGC + + F + LP ++ +IR+D++ P+FR Sbjct: 1173 SLSTLEYRITHIYREGNKVADALANLGCQTELARTFTTAELPRPIQQMIRMDQLGYPSFR 1232 >ref|XP_012853187.1| PREDICTED: uncharacterized protein LOC105972756 [Erythranthe guttata] Length = 1285 Score = 127 bits (320), Expect = 1e-29 Identities = 75/240 (31%), Positives = 117/240 (48%), Gaps = 4/240 (1%) Frame = +1 Query: 1 VFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNKFDFSIWKGFSHIANHLN-FL 177 + W+ W+ RND KH+ + + II+ V ++ L + W G H+A L + Sbjct: 1043 ILWYLWIARNDSKHKDITVRASSIIYRVIQHIRILHQTKLLSADSWTGIPHVAESLGLYY 1102 Query: 178 VPKSSVKKAILVFWDKPPVGCLKLNTXXXXXXXXXXXXX--VIRNDRGEVLRAFQAHFPA 351 ++ V W P G +KLNT +IR E + AF A Sbjct: 1103 RVRTPTLTPYRVVWLPPDPGWVKLNTDGARRASTQIAAIGGIIRGSDAEAILAFHERISA 1162 Query: 352 VNSLEAEVKALAMGID-ILSRFGTEGRWWIEVDSMALVWMIKNRKIGHWRLQHDFLRIKH 528 +S+ AE+ ALA G+ ++ R T R WIE+D+ V ++ + GHW LQ I++ Sbjct: 1163 PSSIAAELAALASGLRFVIQRQFT--RVWIELDAEVAVRLLSHTDQGHWSLQSSLTAIRN 1220 Query: 529 KFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPGKLKGLIRVDKMRIPAFR 708 + ITH +REGN AD LANLGC + + F + LP ++ +IR+D++ P+FR Sbjct: 1221 SLSTLEYRITHIYREGNTVADALANLGCQTELARTFTTAELPRPIQQMIRMDQLGYPSFR 1280 >ref|XP_012844111.1| PREDICTED: uncharacterized protein LOC105964144 [Erythranthe guttata] Length = 1237 Score = 127 bits (318), Expect = 3e-29 Identities = 76/240 (31%), Positives = 117/240 (48%), Gaps = 4/240 (1%) Frame = +1 Query: 1 VFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNKFDFSIWKGFSHIANHLN-FL 177 + W+ W+ RND KH+ + + II V ++ L +N W G H+A L + Sbjct: 995 ILWYLWIARNDSKHKDITVRASSIINRVIQHIRILHQTNLLSADSWTGIPHVAESLGLYY 1054 Query: 178 VPKSSVKKAILVFWDKPPVGCLKLNTXXXXXXXXXXXXX--VIRNDRGEVLRAFQAHFPA 351 ++ V W P G +KLNT +IR E + AF A Sbjct: 1055 RVRTPTLTPHRVVWLPPDPGWMKLNTDGARRASTQIAAIGGIIRGSDAEAIVAFHERISA 1114 Query: 352 VNSLEAEVKALAMGID-ILSRFGTEGRWWIEVDSMALVWMIKNRKIGHWRLQHDFLRIKH 528 +S+ AE+ ALA G+ ++ R T R WIE+D+ V ++ + GHW LQ I++ Sbjct: 1115 PSSIAAELAALASGLRFVIQRQFT--RVWIELDAEVAVRLLSHTDQGHWSLQSSLTAIRN 1172 Query: 529 KFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPGKLKGLIRVDKMRIPAFR 708 + ITH +REGN AD LANLGC + + F + LP ++ +IR+D++ P+FR Sbjct: 1173 SLSSLEYRITHIYREGNKVADALANLGCQTELARTFTTAELPRPIQQMIRMDQLGYPSFR 1232 >ref|XP_012855480.1| PREDICTED: uncharacterized protein LOC105974867 [Erythranthe guttata] Length = 1393 Score = 127 bits (318), Expect = 3e-29 Identities = 75/240 (31%), Positives = 117/240 (48%), Gaps = 4/240 (1%) Frame = +1 Query: 1 VFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNKFDFSIWKGFSHIANHLN-FL 177 + W+ W+ RND KH+ + + II+ V ++ L + W G H+A L + Sbjct: 1151 ILWYLWIARNDSKHKDITVRASSIIYRVIQHIRILHQTKLLSADSWTGIPHVAESLGLYY 1210 Query: 178 VPKSSVKKAILVFWDKPPVGCLKLNTXXXXXXXXXXXXX--VIRNDRGEVLRAFQAHFPA 351 ++ V W P G +KLNT +IR E + AF A Sbjct: 1211 RVRTPTLTPHRVVWLPPDPGWVKLNTDGARRASTQIAAIGGIIRGSDAEAILAFHERISA 1270 Query: 352 VNSLEAEVKALAMGID-ILSRFGTEGRWWIEVDSMALVWMIKNRKIGHWRLQHDFLRIKH 528 +S+ AE+ ALA G+ ++ R T R WIE+D+ V ++ + GHW LQ I++ Sbjct: 1271 PSSIAAELAALASGLRFVIQRQFT--RVWIELDAEVAVRLLSHTDQGHWSLQSSLTAIRN 1328 Query: 529 KFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPGKLKGLIRVDKMRIPAFR 708 + ITH +REGN AD LANLGC + + F + LP ++ +IR+D++ P+FR Sbjct: 1329 SLSTLEYRITHIYREGNTVADALANLGCQTELARTFTTAELPRPIQQMIRMDQLGYPSFR 1388 >gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao] Length = 910 Score = 126 bits (317), Expect = 3e-29 Identities = 75/237 (31%), Positives = 118/237 (49%), Gaps = 1/237 (0%) Frame = +1 Query: 1 VFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNKFDFSIWKGFSHIANHLNFLV 180 + WF WVERND KHR LG R++W V + +LSL + WKG IA ++ Sbjct: 675 ILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQIAQEWGIIL 734 Query: 181 PKSSVKKAILVFWDKPPVGCLKLNT-XXXXXXXXXXXXXVIRNDRGEVLRAFQAHFPAVN 357 S+ + W KP G KLN ++R+ G ++ F + N Sbjct: 735 QAESLAPPKVFSWHKPTTGEFKLNVDGSAKHSHNAAGGGILRDHAGVMVFGFSENLGIQN 794 Query: 358 SLEAEVKALAMGIDILSRFGTEGRWWIEVDSMALVWMIKNRKIGHWRLQHDFLRIKHKFK 537 SL+AE+ AL G+ IL R R WIE+D+++++ +++ G +++ + ++ Sbjct: 795 SLQAELLALYRGL-ILCRDYNIRRLWIEMDAISVIRLLQGNHRGPHAIRYLMVSLRQLLS 853 Query: 538 DKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPGKLKGLIRVDKMRIPAFR 708 +H FREGN AD+LAN G + Q Q F + GKL+G++R+D+ P R Sbjct: 854 HFSFRFSHIFREGNQAADFLANRGHEHQNLQVFTVAQ--GKLRGMLRLDQTSFPYVR 908 >ref|XP_012850055.1| PREDICTED: uncharacterized protein LOC105969825 [Erythranthe guttata] Length = 1331 Score = 126 bits (316), Expect = 5e-29 Identities = 76/240 (31%), Positives = 116/240 (48%), Gaps = 4/240 (1%) Frame = +1 Query: 1 VFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNKFDFSIWKGFSHIANHLNFLV 180 + W+ W+ RND KH+ + + II+ V ++ L +N W G H+A L Sbjct: 1089 ILWYLWIARNDSKHKDITVRASSIIYRVIQHIRILHQTNLLSADSWTGIPHMAESLGLYY 1148 Query: 181 PKSS-VKKAILVFWDKPPVGCLKLNTXXXXXXXXXXXXX--VIRNDRGEVLRAFQAHFPA 351 + V W P G +KLNT +IR E + AF A Sbjct: 1149 RVGTPTLTPHRVVWLPPDPGWVKLNTNGARRASTQIAAIGGIIRGSDAEAIVAFHERISA 1208 Query: 352 VNSLEAEVKALAMGID-ILSRFGTEGRWWIEVDSMALVWMIKNRKIGHWRLQHDFLRIKH 528 +S+ AE+ ALA G+ ++ R T R WIE+D+ V ++ + GHW LQ I++ Sbjct: 1209 PSSIAAELAALASGLRFVIQRQFT--RVWIELDAEVAVRLLSHTDQGHWSLQSSLTAIRN 1266 Query: 529 KFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPGKLKGLIRVDKMRIPAFR 708 + ITH +REGN AD LANLGC + + F + LP ++ +IR+D++ P+FR Sbjct: 1267 SLSTLEYRITHIYREGNKVADALANLGCQTELARTFTTAELPRPIQQMIRMDQLGYPSFR 1326 >gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 126 bits (316), Expect = 5e-29 Identities = 75/237 (31%), Positives = 118/237 (49%), Gaps = 1/237 (0%) Frame = +1 Query: 1 VFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNKFDFSIWKGFSHIANHLNFLV 180 + WF WVERND KHR LG R++W V + +LSL + WKG IA + Sbjct: 2016 ILWFLWVERNDAKHRNLGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQIAQEWGIIF 2075 Query: 181 PKSSVKKAILVFWDKPPVGCLKLNT-XXXXXXXXXXXXXVIRNDRGEVLRAFQAHFPAVN 357 S+ + W KP +G KLN ++R+ GE++ F + N Sbjct: 2076 QAESLAPPKVFSWHKPSLGEFKLNVDGSAKQSHNAAGGGILRDHAGEMVFGFSENLGTQN 2135 Query: 358 SLEAEVKALAMGIDILSRFGTEGRWWIEVDSMALVWMIKNRKIGHWRLQHDFLRIKHKFK 537 SL+AE+ AL G+ IL R R WIE+D+++++ +++ G +++ + ++ Sbjct: 2136 SLQAELLALYRGL-ILCRDYNIRRLWIEMDAISVIRLLQGNHRGPHAIRYLMVSLRQLLS 2194 Query: 538 DKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPGKLKGLIRVDKMRIPAFR 708 +H FREGN AD+LAN G + Q Q F + GKL+G++ +D+ P R Sbjct: 2195 HFSFRFSHIFREGNQAADFLANRGHEHQNLQVFTVAQ--GKLRGMLCLDQTSFPYVR 2249 >gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao] Length = 1368 Score = 125 bits (313), Expect = 1e-28 Identities = 79/235 (33%), Positives = 116/235 (49%), Gaps = 2/235 (0%) Frame = +1 Query: 1 VFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNKFDFSIWKGFSHIANHLNFLV 180 +FWF WVERND KHR LG +RIIW + L KL WKG IA H F Sbjct: 1100 IFWFVWVERNDAKHRDLGMYPDRIIWRIMKILRKLFQGGLLCKWQWKGDLDIAIHWGFNF 1159 Query: 181 PKSSVKKAILVFWDKPPVGCLKLNT--XXXXXXXXXXXXXVIRNDRGEVLRAFQAHFPAV 354 + + ++ W KP +G LKLN V+R+ G ++ F +F Sbjct: 1160 AQERQARPKIINWIKPLIGELKLNVDGSSKDEFQNAAGGGVLRDHTGNLIFGFSENFGYQ 1219 Query: 355 NSLEAEVKALAMGIDILSRFGTEGRWWIEVDSMALVWMIKNRKIGHWRLQHDFLRIKHKF 534 NSL+AE+ AL G+ + + R WIEVD+ ++ MI+N G +++Q+ I+ Sbjct: 1220 NSLQAELLALHRGLCLCMEYNV-SRVWIEVDAQVVIQMIQNHHKGSYKIQYLLESIRKCL 1278 Query: 535 KDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPGKLKGLIRVDKMRIP 699 + V I+H REGN AD+L+ G Q F + G+L+G V+++ P Sbjct: 1279 QVISVRISHIHREGNQAADFLSKHGHTHQNLHVF--TEAQGELRGRTLVNRVEHP 1331 >ref|XP_020549796.1| uncharacterized protein LOC110012043 [Sesamum indicum] Length = 1116 Score = 124 bits (310), Expect = 3e-28 Identities = 73/225 (32%), Positives = 106/225 (47%), Gaps = 4/225 (1%) Frame = +1 Query: 1 VFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNKFDFSIWKGFSHIANHLNFLV 180 + WF W+ERND KHR F ERI W VH ++ S WKG +A + + Sbjct: 628 ILWFGWLERNDVKHRNKNFNSERIKWKVHQHIVTTFKSKTTKRINWKGDRFVAKSMGLEL 687 Query: 181 PKSSVKKAILVFWDKPPVGCLKLNTXXXXXXXXXXXXX--VIRNDRGEVLRAFQAHFPAV 354 K +V W KP +G +K+NT + R++ G V+ AF Sbjct: 688 GSQYKPKIKIVKWTKPELGWIKINTDGASKGNPGRAGAGGIARDEEGAVILAFYEVLGET 747 Query: 355 NSLEAEVKALAMGIDILSRFGTEG--RWWIEVDSMALVWMIKNRKIGHWRLQHDFLRIKH 528 N+ AEV AL + + TE R WIEVD+ ++ +++ + HW L+H I+ Sbjct: 748 NNTFAEVFALFKALQLCQ---TENIPRIWIEVDANCILHLVQQSEKAHWPLKHMLTHIRL 804 Query: 529 KFKDKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPGKL 663 K + ITH +REGN ADYLANL C ++ + G+L Sbjct: 805 MLKKVEYKITHIYREGNKAADYLANLACSTNSSKLLRGEEIQGQL 849 >gb|EOX96783.1| Uncharacterized protein TCM_005954 [Theobroma cacao] Length = 1134 Score = 123 bits (309), Expect = 4e-28 Identities = 72/237 (30%), Positives = 119/237 (50%), Gaps = 1/237 (0%) Frame = +1 Query: 1 VFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNKFDFSIWKGFSHIANHLNFLV 180 + WF W+ERND KHR G +R+IW + +L + WKG + IA L F Sbjct: 897 ICWFLWLERNDAKHRHTGLYPDRVIWRTMKHCRQLYDGSLLQQWQWKGDTDIAAMLGFSF 956 Query: 181 PKSSVKKAILVFWDKPPVGCLKLNTXXXXXXXXXXXXX-VIRNDRGEVLRAFQAHFPAVN 357 P +++W KP +G KLN V+R+ G+++ F + N Sbjct: 957 PPQQHASPQIIYWKKPSIGEYKLNVDGSSRNGLHAATGGVLRDHTGKLIFGFSENIGPCN 1016 Query: 358 SLEAEVKALAMGIDILSRFGTEGRWWIEVDSMALVWMIKNRKIGHWRLQHDFLRIKHKFK 537 SL+AE++AL G+ + E + WIE+D++A + +I+ K G + +++ I+ Sbjct: 1017 SLQAELRALLRGLLLCKERHIE-KLWIEMDALAAIQLIQPSKKGPYDIRYLLESIRMCLS 1075 Query: 538 DKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPGKLKGLIRVDKMRIPAFR 708 ++H FREGN ADYL+N G Q F + G+L G++++D++ +P R Sbjct: 1076 SFSYRLSHTFREGNKAADYLSNEGHKHQNLCVF--TEAQGQLHGMLKLDRLNLPYVR 1130 >gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] Length = 2249 Score = 123 bits (309), Expect = 4e-28 Identities = 76/235 (32%), Positives = 117/235 (49%), Gaps = 1/235 (0%) Frame = +1 Query: 7 WFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNKFDFSIWKGFSHIANHLNFLVPK 186 WF WVERND KHR LG RI+W + + +LSL + WKG IA Sbjct: 2016 WFLWVERNDAKHRNLGMYPNRIVWRILKLIQQLSLGQQLLKWQWKGDKQIAQEWGITFQA 2075 Query: 187 SSVKKAILVFWDKPPVGCLKLNT-XXXXXXXXXXXXXVIRNDRGEVLRAFQAHFPAVNSL 363 S+ + W KP +G KLN V+R+ G ++ F + NSL Sbjct: 2076 ESLPPPKVFPWHKPSIGEFKLNVDGSAKLSQNAAGGGVLRDHAGVMVFGFSENLGIQNSL 2135 Query: 364 EAEVKALAMGIDILSRFGTEGRWWIEVDSMALVWMIKNRKIGHWRLQHDFLRIKHKFKDK 543 +AE+ AL G+ IL R R WIE+D+ +++ +++ + G +++ + I+ Sbjct: 2136 QAELLALYRGL-ILCRDYNIRRLWIEMDAASVIRLLQGNQRGPHAIRYLLVSIRQLLSHF 2194 Query: 544 DVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPGKLKGLIRVDKMRIPAFR 708 ++H FREGN AD+LAN G + Q Q + GKL+G++R+D+ +P R Sbjct: 2195 SFRLSHIFREGNQAADFLANRGHEHQSLQVVTVAQ--GKLRGMLRLDQTSLPYVR 2247 >gb|EOY34747.1| Uncharacterized protein TCM_042327 [Theobroma cacao] Length = 1014 Score = 121 bits (304), Expect = 2e-27 Identities = 72/234 (30%), Positives = 118/234 (50%), Gaps = 1/234 (0%) Frame = +1 Query: 1 VFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNKFDFSIWKGFSHIANHLNFLV 180 + WF W+ERND KHR LG +R++W + L +L + WKG + IA F + Sbjct: 779 ICWFLWLERNDAKHRHLGMYSDRVVWKIMKVLRQLQDGSLLKKWQWKGDTDIAAMWGFTL 838 Query: 181 PKSSVKKAILVFWDKPPVGCLKLNT-XXXXXXXXXXXXXVIRNDRGEVLRAFQAHFPAVN 357 P + ++ W KP G KLN ++R+ G ++ F + N Sbjct: 839 PLKIRESPQIIHWVKPVTGEYKLNVDGSSRHNQSAATGGLLRDHTGTLVFGFSENIGPSN 898 Query: 358 SLEAEVKALAMGIDILSRFGTEGRWWIEVDSMALVWMIKNRKIGHWRLQHDFLRIKHKFK 537 SL+AE++AL G+ +L + + WIE+D++ ++ MI+ K G +++ I+ Sbjct: 899 SLQAELRALLRGL-LLCKDRNIEKLWIEMDALVVIQMIQQSKKGSHDIRYLLASIRKCLS 957 Query: 538 DKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPGKLKGLIRVDKMRIP 699 I+H FREGN AD+L+N G Q Q S GKL G++++D++ +P Sbjct: 958 FFSFRISHIFREGNQAADFLSNKGHTHQNLQVI--SEAQGKLHGMLKLDRLNLP 1009 >gb|PKU62688.1| Putative ribonuclease H protein [Dendrobium catenatum] Length = 287 Score = 116 bits (291), Expect = 2e-27 Identities = 73/237 (30%), Positives = 115/237 (48%), Gaps = 1/237 (0%) Frame = +1 Query: 1 VFWFTWVERNDCKHRGLGFKGERIIWNVHHYLSKLSLSNKFDFSIWKGFSHIANHLNF-L 177 + W+ W RN KH + +I V + + +L +N +K H+AN L Sbjct: 50 ICWYLWDSRNASKHDNMVMNVLNVIAKVKNKILQLYGANLIKVESFKNCIHVANEFGLNL 109 Query: 178 VPKSSVKKAILVFWDKPPVGCLKLNTXXXXXXXXXXXXXVIRNDRGEVLRAFQAHFPAVN 357 + +K L+ W P VGC KLNT +IR+ G+V+ AF +N Sbjct: 110 NLHPTTRKDKLIHWRLPVVGCFKLNTDGSYNKVNARCGGIIRDYSGKVVEAFAGPSSGIN 169 Query: 358 SLEAEVKALAMGIDILSRFGTEGRWWIEVDSMALVWMIKNRKIGHWRLQHDFLRIKHKFK 537 +++AEV +L G+ + G W +EVD+M L+ I+ + + + IKH Sbjct: 170 AIQAEVDSLLYGVQLCLTLGLYNIW-VEVDAMLLIHYIEGKTSLNPSNFYKIRDIKHCLS 228 Query: 538 DKDVVITHNFREGNAPADYLANLGCDEQRTQEFDPSNLPGKLKGLIRVDKMRIPAFR 708 I+H REGNA AD LA LGC +F+ +LP ++KGLI++D++ +P R Sbjct: 229 LISYYISHIMREGNAVADGLAKLGCSLTGFFDFNDHSLPREIKGLIKLDQLGLPYIR 285