BLASTX nr result
ID: Perilla23_contig00009450
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Perilla23_contig00009450 (1523 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_012858045.1| PREDICTED: uncharacterized protein LOC105977... 166 4e-38 ref|XP_012828530.1| PREDICTED: uncharacterized protein LOC105949... 165 9e-38 ref|XP_012850054.1| PREDICTED: uncharacterized protein LOC105969... 165 1e-37 ref|XP_012847850.1| PREDICTED: uncharacterized protein LOC105967... 165 1e-37 ref|XP_012846702.1| PREDICTED: uncharacterized protein LOC105966... 165 1e-37 ref|XP_012844821.1| PREDICTED: uncharacterized protein LOC105964... 165 1e-37 ref|XP_012828505.1| PREDICTED: uncharacterized protein LOC105949... 164 2e-37 ref|XP_012844111.1| PREDICTED: uncharacterized protein LOC105964... 163 3e-37 ref|XP_012850055.1| PREDICTED: uncharacterized protein LOC105969... 163 4e-37 ref|XP_012843177.1| PREDICTED: uncharacterized protein LOC105963... 163 4e-37 ref|XP_012855480.1| PREDICTED: uncharacterized protein LOC105974... 162 6e-37 ref|XP_012853187.1| PREDICTED: uncharacterized protein LOC105972... 162 1e-36 ref|XP_007017129.1| Uncharacterized protein TCM_042328 [Theobrom... 117 2e-23 ref|XP_012850129.1| PREDICTED: uncharacterized protein LOC105969... 115 8e-23 ref|XP_007010390.1| Retrotransposon, unclassified-like protein [... 115 1e-22 ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobrom... 115 1e-22 ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobrom... 113 4e-22 ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobrom... 113 4e-22 ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobrom... 111 2e-21 ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobrom... 107 4e-20 >ref|XP_012858045.1| PREDICTED: uncharacterized protein LOC105977287 [Erythranthe guttatus] Length = 1237 Score = 166 bits (421), Expect = 4e-38 Identities = 98/284 (34%), Positives = 136/284 (47%), Gaps = 4/284 (1%) Frame = -2 Query: 847 WMHFSSWFLITPPHFTQISHALSFWRHLIPRAT--HRHICFLIPCLIFWFTWTERNDRKH 674 W HF F TP H T I L +W+H H HI ++PCLI W+ W RND KH Sbjct: 949 WHHFFYLFGYTPAHTTHIPQILLYWQHFTSHTLTHHTHITTIVPCLILWYLWIARNDSKH 1008 Query: 673 RGRLFRSSHIIFQVTRHLHQLVLSGKLLPPQWADCFL*VHFMSVVEAV*RP-LRSRAVTW 497 + R+S II++V +H+ L + L W + + V P LR V W Sbjct: 1009 KDITVRASSIIYRVIQHIKILHQTNLLSADSWTGIPHVAESLGLYYRVRTPTLRPHRVVW 1068 Query: 496 QPPSEPWVKLN*DGSFDRSQQTAAXXXXXXXXXXXXLTSYYLPLQASSSFEAKXXXXXXX 317 PP WVKLN DG+ S Q AA + +++ + A SS A+ Sbjct: 1069 LPPDPGWVKLNTDGARRASTQIAAIGGIIRGSDAEAIVAFHERISAPSSIAAELAALASG 1128 Query: 316 XXXLPQTA-SHVWIELDVAAVVTVITSGARGSGQLREVFSRLRLILRNRHVRVTHIHREG 140 + Q + VWIELD V +++ +G L + +R L R+THI+REG Sbjct: 1129 LRFVIQRQFTRVWIELDAEVAVRLLSHTDQGHWSLHSSLTAIRNSLSTLEYRITHIYREG 1188 Query: 139 NRSADFLARLGHEVDGLHIFDVQTAPRPLSSLVRMDQLGYPNFR 8 N+ AD LA LG + + F PRP+ ++RMDQLGYP+FR Sbjct: 1189 NKVADALANLGCQTELARTFTTAELPRPIQQMIRMDQLGYPSFR 1232 >ref|XP_012828530.1| PREDICTED: uncharacterized protein LOC105949758 [Erythranthe guttatus] Length = 1245 Score = 165 bits (418), Expect = 9e-38 Identities = 97/284 (34%), Positives = 137/284 (48%), Gaps = 4/284 (1%) Frame = -2 Query: 847 WMHFSSWFLITPPHFTQISHALSFWRHLIPRAT--HRHICFLIPCLIFWFTWTERNDRKH 674 W HF F TP H T I L +W+H H HI ++PCLI W+ W RND KH Sbjct: 957 WHHFFYLFGYTPAHTTHIPQILLYWQHFTSHTLTHHTHITTIVPCLILWYLWIARNDSKH 1016 Query: 673 RGRLFRSSHIIFQVTRHLHQLVLSGKLLPPQWADCFL*VHFMSVVEAV*RP-LRSRAVTW 497 + R+S II++V +H+ L + L W + + V P L V W Sbjct: 1017 KDITVRASSIIYRVIQHIRILHQTNLLSADSWTGIPHVAESLGLYYRVRTPTLTPHRVVW 1076 Query: 496 QPPSEPWVKLN*DGSFDRSQQTAAXXXXXXXXXXXXLTSYYLPLQASSSFEAKXXXXXXX 317 PP WVKLN DG+ S Q AA + +++ + A SS A+ Sbjct: 1077 LPPDPGWVKLNTDGARRASTQIAAIGGIIRGSDAEAIVAFHERISAPSSIAAELAALASG 1136 Query: 316 XXXLPQTA-SHVWIELDVAAVVTVITSGARGSGQLREVFSRLRLILRNRHVRVTHIHREG 140 + Q + VWIELD VV +++ +G L+ + +R L ++THI+REG Sbjct: 1137 LRFVIQRQFTRVWIELDAEVVVRLLSHTDQGHWSLQSSLTAIRNSLSTLEYKITHIYREG 1196 Query: 139 NRSADFLARLGHEVDGLHIFDVQTAPRPLSSLVRMDQLGYPNFR 8 N+ AD LA LG + + F PRP+ ++RMDQLGYP+FR Sbjct: 1197 NKVADALANLGCQTELARTFTTAELPRPIQQMIRMDQLGYPSFR 1240 >ref|XP_012850054.1| PREDICTED: uncharacterized protein LOC105969824 [Erythranthe guttatus] Length = 1805 Score = 165 bits (417), Expect = 1e-37 Identities = 98/284 (34%), Positives = 135/284 (47%), Gaps = 4/284 (1%) Frame = -2 Query: 847 WMHFSSWFLITPPHFTQISHALSFWRHLIPRAT--HRHICFLIPCLIFWFTWTERNDRKH 674 W HF F TP H T I L +W+H H HI ++PCLI W+ W RND KH Sbjct: 1517 WHHFFYLFGYTPAHTTHIPQILLYWQHFTSHTLTHHTHITTIVPCLILWYLWIARNDSKH 1576 Query: 673 RGRLFRSSHIIFQVTRHLHQLVLSGKLLPPQWADCFL*VHFMSVVEAV*RP-LRSRAVTW 497 + R+S II++V +H+ L + L W + + V P L V W Sbjct: 1577 KDITVRASSIIYRVIQHIRILHQTNLLSADSWTGIPHVAESLGLYYRVRTPTLTPHRVVW 1636 Query: 496 QPPSEPWVKLN*DGSFDRSQQTAAXXXXXXXXXXXXLTSYYLPLQASSSFEAKXXXXXXX 317 PP WVKLN DG+ S Q AA + ++ + A SS A+ Sbjct: 1637 LPPDPGWVKLNTDGARRASTQIAAIGGIIRGSDAEAIVAFQERISAPSSIAAELAALASG 1696 Query: 316 XXXLPQTA-SHVWIELDVAAVVTVITSGARGSGQLREVFSRLRLILRNRHVRVTHIHREG 140 + Q + VWIELD VV +++ G L+ + +R L R+THI+REG Sbjct: 1697 LRFVIQRQFTRVWIELDAEVVVRLLSHTDEGHWSLQSSLTAIRNSLSTLEYRITHIYREG 1756 Query: 139 NRSADFLARLGHEVDGLHIFDVQTAPRPLSSLVRMDQLGYPNFR 8 N+ AD LA LG + + F PRP+ ++RMDQLGYP+FR Sbjct: 1757 NKVADALANLGCQTELARTFTTAELPRPIQQMIRMDQLGYPSFR 1800 >ref|XP_012847850.1| PREDICTED: uncharacterized protein LOC105967783 [Erythranthe guttatus] Length = 1298 Score = 165 bits (417), Expect = 1e-37 Identities = 97/284 (34%), Positives = 136/284 (47%), Gaps = 4/284 (1%) Frame = -2 Query: 847 WMHFSSWFLITPPHFTQISHALSFWRHLIPRAT--HRHICFLIPCLIFWFTWTERNDRKH 674 W HF F TP H T I L +W+H H HI ++PCLI W+ W RND KH Sbjct: 1010 WHHFFYLFGYTPAHTTHIPQILLYWQHFTSHTLTHHTHITTIVPCLILWYLWIARNDSKH 1069 Query: 673 RGRLFRSSHIIFQVTRHLHQLVLSGKLLPPQWADCFL*VHFMSVVEAV*RP-LRSRAVTW 497 + R+S II++V +H+ L + L W + + V P L V W Sbjct: 1070 KDITVRASSIIYRVIQHIRILHQTNLLSADSWTGIPHVAESLGLYYRVRTPTLTPHRVVW 1129 Query: 496 QPPSEPWVKLN*DGSFDRSQQTAAXXXXXXXXXXXXLTSYYLPLQASSSFEAKXXXXXXX 317 PP WVKLN DG+ S Q AA + +++ + A SS A+ Sbjct: 1130 LPPDPGWVKLNTDGARRASTQIAAIGGIIRGSDAEAIVAFHERISAPSSIAAELAALASG 1189 Query: 316 XXXLPQTA-SHVWIELDVAAVVTVITSGARGSGQLREVFSRLRLILRNRHVRVTHIHREG 140 + Q + VWIELD V +++ +G L+ + +R L R+THI+REG Sbjct: 1190 LRFVIQRQFTRVWIELDAEVAVRLLSHTDQGHWSLQSSLTAIRNSLSTLEYRITHIYREG 1249 Query: 139 NRSADFLARLGHEVDGLHIFDVQTAPRPLSSLVRMDQLGYPNFR 8 N+ AD LA LG + + F PRP+ ++RMDQLGYP+FR Sbjct: 1250 NKVADALANLGCQTELARTFTTAELPRPIQQMIRMDQLGYPSFR 1293 >ref|XP_012846702.1| PREDICTED: uncharacterized protein LOC105966658 [Erythranthe guttatus] Length = 1233 Score = 165 bits (417), Expect = 1e-37 Identities = 97/284 (34%), Positives = 136/284 (47%), Gaps = 4/284 (1%) Frame = -2 Query: 847 WMHFSSWFLITPPHFTQISHALSFWRHLIPRAT--HRHICFLIPCLIFWFTWTERNDRKH 674 W HF F TP H T I L +W+H H HI ++PCLI W+ W RND KH Sbjct: 945 WHHFFYLFGYTPAHTTHIPQILLYWQHFTSHTLTHHTHITTIVPCLILWYLWIARNDSKH 1004 Query: 673 RGRLFRSSHIIFQVTRHLHQLVLSGKLLPPQWADCFL*VHFMSVVEAV*RP-LRSRAVTW 497 + R+S II++V +H+ L + L W + + V P L V W Sbjct: 1005 KDITVRASSIIYRVIQHIRILHQTNLLSADSWTGIPHVAESLGLYYRVRTPTLTPHRVVW 1064 Query: 496 QPPSEPWVKLN*DGSFDRSQQTAAXXXXXXXXXXXXLTSYYLPLQASSSFEAKXXXXXXX 317 PP WVKLN DG+ S Q AA + +++ + A SS A+ Sbjct: 1065 LPPDPGWVKLNTDGARRASTQIAAIGGIIRGSDAEAIVAFHERISAPSSIAAELAALASG 1124 Query: 316 XXXLPQTA-SHVWIELDVAAVVTVITSGARGSGQLREVFSRLRLILRNRHVRVTHIHREG 140 + Q + VWIELD V +++ +G L+ + +R L R+THI+REG Sbjct: 1125 LRFVIQRQFTRVWIELDAEVAVRLLSHTDQGHWSLQSSLTAIRNSLSTLEYRITHIYREG 1184 Query: 139 NRSADFLARLGHEVDGLHIFDVQTAPRPLSSLVRMDQLGYPNFR 8 N+ AD LA LG + + F PRP+ ++RMDQLGYP+FR Sbjct: 1185 NKVADALANLGCQTELARTFTTAELPRPIQQMIRMDQLGYPSFR 1228 >ref|XP_012844821.1| PREDICTED: uncharacterized protein LOC105964855 [Erythranthe guttatus] Length = 1237 Score = 165 bits (417), Expect = 1e-37 Identities = 97/284 (34%), Positives = 136/284 (47%), Gaps = 4/284 (1%) Frame = -2 Query: 847 WMHFSSWFLITPPHFTQISHALSFWRHLIPRAT--HRHICFLIPCLIFWFTWTERNDRKH 674 W HF F TP H T I L +W+H H HI ++PCLI W+ W RND KH Sbjct: 949 WHHFFYLFGYTPAHTTHIPQILLYWQHFTSHTLTHHTHITTIVPCLILWYLWIARNDSKH 1008 Query: 673 RGRLFRSSHIIFQVTRHLHQLVLSGKLLPPQWADCFL*VHFMSVVEAV*RP-LRSRAVTW 497 + R+S II++V +H+ L + L W + + V P L V W Sbjct: 1009 KDITVRASSIIYRVIQHIRILHQTNLLSADSWTGIPHVAESLGLYYRVRTPTLTPHRVVW 1068 Query: 496 QPPSEPWVKLN*DGSFDRSQQTAAXXXXXXXXXXXXLTSYYLPLQASSSFEAKXXXXXXX 317 PP WVKLN DG+ S Q AA + +++ + A SS A+ Sbjct: 1069 LPPDPGWVKLNTDGARRASTQIAAIGGIIRGSDAEAIVAFHERISAPSSIAAELAALASG 1128 Query: 316 XXXLPQTA-SHVWIELDVAAVVTVITSGARGSGQLREVFSRLRLILRNRHVRVTHIHREG 140 + Q + VWIELD V +++ +G L+ + +R L R+THI+REG Sbjct: 1129 LRFVIQRQFTRVWIELDAEVAVRLLSHTDQGHWSLQSSLTAIRNSLSTLEYRITHIYREG 1188 Query: 139 NRSADFLARLGHEVDGLHIFDVQTAPRPLSSLVRMDQLGYPNFR 8 N+ AD LA LG + + F PRP+ ++RMDQLGYP+FR Sbjct: 1189 NKVADALANLGCQTELARTFTTAELPRPIQQMIRMDQLGYPSFR 1232 >ref|XP_012828505.1| PREDICTED: uncharacterized protein LOC105949732 [Erythranthe guttatus] Length = 1237 Score = 164 bits (415), Expect = 2e-37 Identities = 96/284 (33%), Positives = 136/284 (47%), Gaps = 4/284 (1%) Frame = -2 Query: 847 WMHFSSWFLITPPHFTQISHALSFWRHLIPRAT--HRHICFLIPCLIFWFTWTERNDRKH 674 W HF F TP H T I L +W+H H HI ++PCLI W+ W RND KH Sbjct: 949 WHHFFYLFGYTPAHTTHIPQILLYWQHFTSHTLTHHTHITTIVPCLILWYLWIARNDSKH 1008 Query: 673 RGRLFRSSHIIFQVTRHLHQLVLSGKLLPPQWADCFL*VHFMSVVEAV*RP-LRSRAVTW 497 + R+S II++V +H+ L + L W + + V P L V W Sbjct: 1009 KDITVRASSIIYRVIQHIRILHQTNLLSADSWTGIPHVAESLGLYYRVRTPTLTPHRVVW 1068 Query: 496 QPPSEPWVKLN*DGSFDRSQQTAAXXXXXXXXXXXXLTSYYLPLQASSSFEAKXXXXXXX 317 PP WVKLN DG+ S Q AA + +++ + A SS A+ Sbjct: 1069 LPPDPGWVKLNTDGARRASTQIAAIGGIIRGSDAEAIVAFHERISAPSSIAAELAALASG 1128 Query: 316 XXXLPQTA-SHVWIELDVAAVVTVITSGARGSGQLREVFSRLRLILRNRHVRVTHIHREG 140 + Q + VWIELD + +++ +G L+ + +R L R+THI+REG Sbjct: 1129 LRFVIQRQFTRVWIELDAEVAIRLLSHMDQGHWSLQSSLTAIRNSLSTLEYRITHIYREG 1188 Query: 139 NRSADFLARLGHEVDGLHIFDVQTAPRPLSSLVRMDQLGYPNFR 8 N+ AD LA LG + + F PRP+ ++RMDQLGYP+FR Sbjct: 1189 NKVADALANLGCQTELARTFTTAELPRPIQQMIRMDQLGYPSFR 1232 >ref|XP_012844111.1| PREDICTED: uncharacterized protein LOC105964144 [Erythranthe guttatus] Length = 1237 Score = 163 bits (413), Expect = 3e-37 Identities = 97/284 (34%), Positives = 137/284 (48%), Gaps = 4/284 (1%) Frame = -2 Query: 847 WMHFSSWFLITPPHFTQISHALSFWRHLIPRAT--HRHICFLIPCLIFWFTWTERNDRKH 674 W HF F TP H T I L +W+H A H HI ++PCLI W+ W RND KH Sbjct: 949 WHHFFYLFGYTPAHTTHIPQILLYWQHFTSHALTHHTHITTIVPCLILWYLWIARNDSKH 1008 Query: 673 RGRLFRSSHIIFQVTRHLHQLVLSGKLLPPQWADCFL*VHFMSVVEAV*RP-LRSRAVTW 497 + R+S II +V +H+ L + L W + + V P L V W Sbjct: 1009 KDITVRASSIINRVIQHIRILHQTNLLSADSWTGIPHVAESLGLYYRVRTPTLTPHRVVW 1068 Query: 496 QPPSEPWVKLN*DGSFDRSQQTAAXXXXXXXXXXXXLTSYYLPLQASSSFEAKXXXXXXX 317 PP W+KLN DG+ S Q AA + +++ + A SS A+ Sbjct: 1069 LPPDPGWMKLNTDGARRASTQIAAIGGIIRGSDAEAIVAFHERISAPSSIAAELAALASG 1128 Query: 316 XXXLPQTA-SHVWIELDVAAVVTVITSGARGSGQLREVFSRLRLILRNRHVRVTHIHREG 140 + Q + VWIELD V +++ +G L+ + +R L + R+THI+REG Sbjct: 1129 LRFVIQRQFTRVWIELDAEVAVRLLSHTDQGHWSLQSSLTAIRNSLSSLEYRITHIYREG 1188 Query: 139 NRSADFLARLGHEVDGLHIFDVQTAPRPLSSLVRMDQLGYPNFR 8 N+ AD LA LG + + F PRP+ ++RMDQLGYP+FR Sbjct: 1189 NKVADALANLGCQTELARTFTTAELPRPIQQMIRMDQLGYPSFR 1232 >ref|XP_012850055.1| PREDICTED: uncharacterized protein LOC105969825 [Erythranthe guttatus] Length = 1331 Score = 163 bits (412), Expect = 4e-37 Identities = 96/284 (33%), Positives = 136/284 (47%), Gaps = 4/284 (1%) Frame = -2 Query: 847 WMHFSSWFLITPPHFTQISHALSFWRHLIPRAT--HRHICFLIPCLIFWFTWTERNDRKH 674 W HF F TP H T I L +W+H H HI ++PCLI W+ W RND KH Sbjct: 1043 WHHFFYLFGYTPAHTTHIPQILLYWQHFTSHTLTHHTHITTIVPCLILWYLWIARNDSKH 1102 Query: 673 RGRLFRSSHIIFQVTRHLHQLVLSGKLLPPQWADCFL*VHFMSVVEAV*RP-LRSRAVTW 497 + R+S II++V +H+ L + L W + + V P L V W Sbjct: 1103 KDITVRASSIIYRVIQHIRILHQTNLLSADSWTGIPHMAESLGLYYRVGTPTLTPHRVVW 1162 Query: 496 QPPSEPWVKLN*DGSFDRSQQTAAXXXXXXXXXXXXLTSYYLPLQASSSFEAKXXXXXXX 317 PP WVKLN +G+ S Q AA + +++ + A SS A+ Sbjct: 1163 LPPDPGWVKLNTNGARRASTQIAAIGGIIRGSDAEAIVAFHERISAPSSIAAELAALASG 1222 Query: 316 XXXLPQTA-SHVWIELDVAAVVTVITSGARGSGQLREVFSRLRLILRNRHVRVTHIHREG 140 + Q + VWIELD V +++ +G L+ + +R L R+THI+REG Sbjct: 1223 LRFVIQRQFTRVWIELDAEVAVRLLSHTDQGHWSLQSSLTAIRNSLSTLEYRITHIYREG 1282 Query: 139 NRSADFLARLGHEVDGLHIFDVQTAPRPLSSLVRMDQLGYPNFR 8 N+ AD LA LG + + F PRP+ ++RMDQLGYP+FR Sbjct: 1283 NKVADALANLGCQTELARTFTTAELPRPIQQMIRMDQLGYPSFR 1326 >ref|XP_012843177.1| PREDICTED: uncharacterized protein LOC105963331 [Erythranthe guttatus] Length = 1172 Score = 163 bits (412), Expect = 4e-37 Identities = 98/284 (34%), Positives = 135/284 (47%), Gaps = 4/284 (1%) Frame = -2 Query: 847 WMHFSSWFLITPPHFTQISHALSFWRH--LIPRATHRHICFLIPCLIFWFTWTERNDRKH 674 W HF F TP H T I L +W+H L H HI ++PCLI WF W RNDR H Sbjct: 783 WHHFFYLFGYTPAHTTHIPQILLYWQHFTLHTLTHHTHITTIVPCLILWFLWIARNDRNH 842 Query: 673 RGRLFRSSHIIFQVTRHLHQLVLSGKLLPPQWADCFL*VHFMSVVEAV*RP-LRSRAVTW 497 + + R+S II++V +H+ L + L W + + V P L V W Sbjct: 843 KDIMVRASSIIYRVIQHIRILHQTNLLSADSWTGIPHVAESLGLYYRVRTPTLTPHRVVW 902 Query: 496 QPPSEPWVKLN*DGSFDRSQQTAAXXXXXXXXXXXXLTSYYLPLQASSSFEAKXXXXXXX 317 PP WVKLN DG+ S Q AA + +++ + SSS A+ Sbjct: 903 LPPDPGWVKLNTDGARRASTQIAAIGGIIRGSDADAILAFHERISVSSSIAAELAALASG 962 Query: 316 XXXLPQTA-SHVWIELDVAAVVTVITSGARGSGQLREVFSRLRLILRNRHVRVTHIHREG 140 + Q + VWIELD V ++ +G L+ + +R L R+ HI+REG Sbjct: 963 LRFVIQRQFTRVWIELDAEVTVRLLLHTDKGHWSLQSFLTAIRNSLSTLEYRIIHIYREG 1022 Query: 139 NRSADFLARLGHEVDGLHIFDVQTAPRPLSSLVRMDQLGYPNFR 8 N AD LA LG + + F PRP+ ++RMDQLGYP+FR Sbjct: 1023 NTVADVLANLGCQTELALTFTTAEIPRPIQQMIRMDQLGYPSFR 1066 >ref|XP_012855480.1| PREDICTED: uncharacterized protein LOC105974867 [Erythranthe guttatus] Length = 1393 Score = 162 bits (411), Expect = 6e-37 Identities = 97/284 (34%), Positives = 135/284 (47%), Gaps = 4/284 (1%) Frame = -2 Query: 847 WMHFSSWFLITPPHFTQISHALSFWRHLIPRAT--HRHICFLIPCLIFWFTWTERNDRKH 674 W HF F TP H T I L +W+H H HI ++PCLI W+ W RND KH Sbjct: 1105 WHHFFYLFGYTPAHTTHIPQILLYWQHFTSHTLTHHTHITTIVPCLILWYLWIARNDSKH 1164 Query: 673 RGRLFRSSHIIFQVTRHLHQLVLSGKLLPPQWADCFL*VHFMSVVEAV*RP-LRSRAVTW 497 + R+S II++V +H+ L + L W + + V P L V W Sbjct: 1165 KDITVRASSIIYRVIQHIRILHQTKLLSADSWTGIPHVAESLGLYYRVRTPTLTPHRVVW 1224 Query: 496 QPPSEPWVKLN*DGSFDRSQQTAAXXXXXXXXXXXXLTSYYLPLQASSSFEAKXXXXXXX 317 PP WVKLN DG+ S Q AA + +++ + A SS A+ Sbjct: 1225 LPPDPGWVKLNTDGARRASTQIAAIGGIIRGSDAEAILAFHERISAPSSIAAELAALASG 1284 Query: 316 XXXLPQTA-SHVWIELDVAAVVTVITSGARGSGQLREVFSRLRLILRNRHVRVTHIHREG 140 + Q + VWIELD V +++ +G L+ + +R L R+THI+REG Sbjct: 1285 LRFVIQRQFTRVWIELDAEVAVRLLSHTDQGHWSLQSSLTAIRNSLSTLEYRITHIYREG 1344 Query: 139 NRSADFLARLGHEVDGLHIFDVQTAPRPLSSLVRMDQLGYPNFR 8 N AD LA LG + + F PRP+ ++RMDQLGYP+FR Sbjct: 1345 NTVADALANLGCQTELARTFTTAELPRPIQQMIRMDQLGYPSFR 1388 >ref|XP_012853187.1| PREDICTED: uncharacterized protein LOC105972756 [Erythranthe guttatus] Length = 1285 Score = 162 bits (409), Expect = 1e-36 Identities = 97/284 (34%), Positives = 135/284 (47%), Gaps = 4/284 (1%) Frame = -2 Query: 847 WMHFSSWFLITPPHFTQISHALSFWRHLIPRAT--HRHICFLIPCLIFWFTWTERNDRKH 674 W HF F TP H T I L +W+H H HI ++PCLI W+ W RND KH Sbjct: 997 WHHFFYLFGYTPAHTTHIPQILLYWQHFTSHTLTHHTHITTIVPCLILWYLWIARNDSKH 1056 Query: 673 RGRLFRSSHIIFQVTRHLHQLVLSGKLLPPQWADCFL*VHFMSVVEAV*RP-LRSRAVTW 497 + R+S II++V +H+ L + L W + + V P L V W Sbjct: 1057 KDITVRASSIIYRVIQHIRILHQTKLLSADSWTGIPHVAESLGLYYRVRTPTLTPYRVVW 1116 Query: 496 QPPSEPWVKLN*DGSFDRSQQTAAXXXXXXXXXXXXLTSYYLPLQASSSFEAKXXXXXXX 317 PP WVKLN DG+ S Q AA + +++ + A SS A+ Sbjct: 1117 LPPDPGWVKLNTDGARRASTQIAAIGGIIRGSDAEAILAFHERISAPSSIAAELAALASG 1176 Query: 316 XXXLPQTA-SHVWIELDVAAVVTVITSGARGSGQLREVFSRLRLILRNRHVRVTHIHREG 140 + Q + VWIELD V +++ +G L+ + +R L R+THI+REG Sbjct: 1177 LRFVIQRQFTRVWIELDAEVAVRLLSHTDQGHWSLQSSLTAIRNSLSTLEYRITHIYREG 1236 Query: 139 NRSADFLARLGHEVDGLHIFDVQTAPRPLSSLVRMDQLGYPNFR 8 N AD LA LG + + F PRP+ ++RMDQLGYP+FR Sbjct: 1237 NTVADALANLGCQTELARTFTTAELPRPIQQMIRMDQLGYPSFR 1280 >ref|XP_007017129.1| Uncharacterized protein TCM_042328 [Theobroma cacao] gi|508787492|gb|EOY34748.1| Uncharacterized protein TCM_042328 [Theobroma cacao] Length = 910 Score = 117 bits (294), Expect = 2e-23 Identities = 81/283 (28%), Positives = 124/283 (43%), Gaps = 1/283 (0%) Frame = -2 Query: 847 WMHFSSWFLITPPHFTQISHALSFWRHLIPRATHRHICFLIPCLIFWFTWTERNDRKHRG 668 W +F+ F I + I+ + W H HI L+P I WF W ERND KHR Sbjct: 631 WNYFAKLFQICIINPCTINQIIGAWFHSGDYCKPGHIRTLVPLFILWFLWVERNDAKHRN 690 Query: 667 RLFRSSHIIFQVTRHLHQLVLSGKLLPPQWADCFL*VHFMSVVEAV*RPLRSRAVTWQPP 488 + ++++V + + QL L +LL QW ++ + +W P Sbjct: 691 LGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQIAQEWGIILQAESLAPPKVFSWHKP 750 Query: 487 SEPWVKLN*DGSFDRSQQTAAXXXXXXXXXXXXLTSYYLPLQASSSFEAKXXXXXXXXXX 308 + KLN DGS S AA + + L +S +A+ Sbjct: 751 TTGEFKLNVDGSAKHSHN-AAGGGILRDHAGVMVFGFSENLGIQNSLQAELLALYRGLIL 809 Query: 307 LPQ-TASHVWIELDVAAVVTVITSGARGSGQLREVFSRLRLILRNRHVRVTHIHREGNRS 131 +WIE+D +V+ ++ RG +R + LR +L + R +HI REGN++ Sbjct: 810 CRDYNIRRLWIEMDAISVIRLLQGNHRGPHAIRYLMVSLRQLLSHFSFRFSHIFREGNQA 869 Query: 130 ADFLARLGHEVDGLHIFDVQTAPRPLSSLVRMDQLGYPNFRFR 2 ADFLA GHE L +F V A L ++R+DQ +P RF+ Sbjct: 870 ADFLANRGHEHQNLQVFTV--AQGKLRGMLRLDQTSFPYVRFK 910 >ref|XP_012850129.1| PREDICTED: uncharacterized protein LOC105969901 [Erythranthe guttatus] Length = 1153 Score = 115 bits (289), Expect = 8e-23 Identities = 74/230 (32%), Positives = 109/230 (47%), Gaps = 2/230 (0%) Frame = -2 Query: 691 RNDRKHRGRLFRSSHIIFQVTRHLHQLVLSGKLLPPQWADCFL*VHFMSVVEAV*RP-LR 515 +ND KH+ R+S II++V +H+ L + L W + + V P L Sbjct: 919 KNDSKHKDITVRASSIIYRVIQHIRILHQTNLLSADSWTGIPHVAESLGLYYRVRTPTLT 978 Query: 514 SRAVTWQPPSEPWVKLN*DGSFDRSQQTAAXXXXXXXXXXXXLTSYYLPLQASSSFEAKX 335 V W PP WVKLN DG+ S Q AA + +++ + A SS A+ Sbjct: 979 PHRVVWLPPDPGWVKLNTDGARRASTQIAAIGGIIRGSDAEAIVAFHERISAPSSIAAEL 1038 Query: 334 XXXXXXXXXLPQTA-SHVWIELDVAAVVTVITSGARGSGQLREVFSRLRLILRNRHVRVT 158 + Q + VWIELD V +++ + L+ + +R L R+T Sbjct: 1039 AALASGLRFVIQRQFTRVWIELDAEVAVRLLSHTDQDHWSLQSSLTAIRNSLSTLEYRIT 1098 Query: 157 HIHREGNRSADFLARLGHEVDGLHIFDVQTAPRPLSSLVRMDQLGYPNFR 8 HI+REGN+ AD LA LG + + F PRP+ ++RMDQLGYP+FR Sbjct: 1099 HIYREGNKVADALANLGCQTELARTFTTAELPRPIQQMIRMDQLGYPSFR 1148 >ref|XP_007010390.1| Retrotransposon, unclassified-like protein [Theobroma cacao] gi|508727303|gb|EOY19200.1| Retrotransposon, unclassified-like protein [Theobroma cacao] Length = 1368 Score = 115 bits (288), Expect = 1e-22 Identities = 80/258 (31%), Positives = 120/258 (46%), Gaps = 2/258 (0%) Frame = -2 Query: 847 WMHFSSWFLITPPHFTQISHALSFWRHLIPRATHRHICFLIPCLIFWFTWTERNDRKHRG 668 W +FS +F I + I L+ W + HI LI IFWF W ERND KHR Sbjct: 1056 WNYFSKFFQIYVHNPQNILQILNSWYYSGDFTKPGHIRTLILLFIFWFVWVERNDAKHRD 1115 Query: 667 RLFRSSHIIFQVTRHLHQLVLSGKLLPPQW-ADCFL*VHFMSVVEAV*RPLRSRAVTWQP 491 II+++ + L +L G L QW D + +H+ A R R + + W Sbjct: 1116 LGMYPDRIIWRIMKILRKLFQGGLLCKWQWKGDLDIAIHW-GFNFAQERQARPKIINWIK 1174 Query: 490 PSEPWVKLN*DGSFDRSQQTAAXXXXXXXXXXXXLTSYYLPLQASSSFEAKXXXXXXXXX 311 P +KLN DGS Q AA + + +S +A+ Sbjct: 1175 PLIGELKLNVDGSSKDEFQNAAGGGVLRDHTGNLIFGFSENFGYQNSLQAELLALHRGLC 1234 Query: 310 XLPQ-TASHVWIELDVAAVVTVITSGARGSGQLREVFSRLRLILRNRHVRVTHIHREGNR 134 + S VWIE+D V+ +I + +GS +++ + +R L+ VR++HIHREGN+ Sbjct: 1235 LCMEYNVSRVWIEVDAQVVIQMIQNHHKGSYKIQYLLESIRKCLQVISVRISHIHREGNQ 1294 Query: 133 SADFLARLGHEVDGLHIF 80 +ADFL++ GH LH+F Sbjct: 1295 AADFLSKHGHTHQNLHVF 1312 >ref|XP_007008704.1| Uncharacterized protein TCM_042330 [Theobroma cacao] gi|508725617|gb|EOY17514.1| Uncharacterized protein TCM_042330 [Theobroma cacao] Length = 2249 Score = 115 bits (287), Expect = 1e-22 Identities = 85/284 (29%), Positives = 128/284 (45%), Gaps = 2/284 (0%) Frame = -2 Query: 847 WMHFSSWFLITPPHFTQISHALSFWRHLIPRATHRHICFLIPCLIFWFTWTERNDRKHRG 668 W +FS +F I + I+ L W + HI L+P WF W ERND KHR Sbjct: 1970 WNYFSKFFQILVINPCTINQILGAWFYSGDYCKPGHIRTLVPIFTLWFLWVERNDAKHRN 2029 Query: 667 RLFRSSHIIFQVTRHLHQLVLSGKLLPPQW-ADCFL*VHFMSVVEAV*RPLRSRAVTWQP 491 + I++++ + + QL L +LL QW D + + +A P + W Sbjct: 2030 LGMYPNRIVWRILKLIQQLSLGQQLLKWQWKGDKQIAQEWGITFQAESLP-PPKVFPWHK 2088 Query: 490 PSEPWVKLN*DGSFDRSQQTAAXXXXXXXXXXXXLTSYYLPLQASSSFEAKXXXXXXXXX 311 PS KLN DGS SQ AA + + L +S +A+ Sbjct: 2089 PSIGEFKLNVDGSAKLSQN-AAGGGVLRDHAGVMVFGFSENLGIQNSLQAELLALYRGLI 2147 Query: 310 XLPQ-TASHVWIELDVAAVVTVITSGARGSGQLREVFSRLRLILRNRHVRVTHIHREGNR 134 +WIE+D A+V+ ++ RG +R + +R +L + R++HI REGN+ Sbjct: 2148 LCRDYNIRRLWIEMDAASVIRLLQGNQRGPHAIRYLLVSIRQLLSHFSFRLSHIFREGNQ 2207 Query: 133 SADFLARLGHEVDGLHIFDVQTAPRPLSSLVRMDQLGYPNFRFR 2 +ADFLA GHE L + V A L ++R+DQ P RF+ Sbjct: 2208 AADFLANRGHEHQSLQV--VTVAQGKLRGMLRLDQTSLPYVRFK 2249 >ref|XP_007017131.1| Uncharacterized protein TCM_033752 [Theobroma cacao] gi|508722459|gb|EOY14356.1| Uncharacterized protein TCM_033752 [Theobroma cacao] Length = 2251 Score = 113 bits (283), Expect = 4e-22 Identities = 80/283 (28%), Positives = 124/283 (43%), Gaps = 1/283 (0%) Frame = -2 Query: 847 WMHFSSWFLITPPHFTQISHALSFWRHLIPRATHRHICFLIPCLIFWFTWTERNDRKHRG 668 W +F+ F I + I+ + W + HI L+P I WF W ERND KHR Sbjct: 1972 WNYFAKLFQILIINPCTINQIIGAWFYSGDYCKPGHIRTLVPLFILWFLWVERNDAKHRN 2031 Query: 667 RLFRSSHIIFQVTRHLHQLVLSGKLLPPQWADCFL*VHFMSVVEAV*RPLRSRAVTWQPP 488 + ++++V + + QL L +LL QW ++ + +W P Sbjct: 2032 LGMYPNRVVWRVLKLIQQLSLGQQLLKWQWKGDKQIAQEWGIIFQAESLAPPKVFSWHKP 2091 Query: 487 SEPWVKLN*DGSFDRSQQTAAXXXXXXXXXXXXLTSYYLPLQASSSFEAKXXXXXXXXXX 308 S KLN DGS +S AA + + L +S +A+ Sbjct: 2092 SLGEFKLNVDGSAKQSHN-AAGGGILRDHAGEMVFGFSENLGTQNSLQAELLALYRGLIL 2150 Query: 307 LPQ-TASHVWIELDVAAVVTVITSGARGSGQLREVFSRLRLILRNRHVRVTHIHREGNRS 131 +WIE+D +V+ ++ RG +R + LR +L + R +HI REGN++ Sbjct: 2151 CRDYNIRRLWIEMDAISVIRLLQGNHRGPHAIRYLMVSLRQLLSHFSFRFSHIFREGNQA 2210 Query: 130 ADFLARLGHEVDGLHIFDVQTAPRPLSSLVRMDQLGYPNFRFR 2 ADFLA GHE L +F V A L ++ +DQ +P RF+ Sbjct: 2211 ADFLANRGHEHQNLQVFTV--AQGKLRGMLCLDQTSFPYVRFK 2251 >ref|XP_007026458.1| Uncharacterized protein TCM_021522 [Theobroma cacao] gi|508715063|gb|EOY06960.1| Uncharacterized protein TCM_021522 [Theobroma cacao] Length = 3503 Score = 113 bits (283), Expect = 4e-22 Identities = 78/284 (27%), Positives = 131/284 (46%), Gaps = 2/284 (0%) Frame = -2 Query: 847 WMHFSSWFLITPPHFTQISHALSFWRHLIPRATHRHICFLIPCLIFWFTWTERNDRKHRG 668 W +F+ F I + I+H +S W + + HI L+P I WF W ERND KHR Sbjct: 3223 WSYFAKVFQIHIINPCTINHIISAWFYSGDYSKPGHIRTLVPLFILWFLWVERNDAKHRN 3282 Query: 667 RLFRSSHIIFQVTRHLHQLVLSGKLLPPQW-ADCFL*VHFMSVVEAV*RPLRSRAVTWQP 491 + I++++ + +HQL +L QW D + + +++AV P + + W Sbjct: 3283 LGMYPNRIVWKILKLIHQLFQGKQLQKWQWQGDKQIAQEWGIILKAV-APSPPKLLFWNK 3341 Query: 490 PSEPWVKLN*DGSFDRSQQTAAXXXXXXXXXXXXLTSYYLPLQASSSFEAK-XXXXXXXX 314 PS KLN DGS + QTAA + + + S +A+ Sbjct: 3342 PSIGEFKLNVDGSSKYNLQTAAGGGLLRDHTGSMIFGFSENFGSQDSLQAELMALHRGLL 3401 Query: 313 XXLPQTASHVWIELDVAAVVTVITSGARGSGQLREVFSRLRLILRNRHVRVTHIHREGNR 134 + + +WIE+D V +I G +GS + R + + + L R++HI REGN+ Sbjct: 3402 LCIDHNVTRLWIEMDAKVAVQMINEGHQGSSRTRYLLASIHRCLSGISFRISHIFREGNQ 3461 Query: 133 SADFLARLGHEVDGLHIFDVQTAPRPLSSLVRMDQLGYPNFRFR 2 +AD L+ G+ L + + A L ++R+D++ RF+ Sbjct: 3462 AADHLSNQGYTHQNLQV--ISQAEGQLRGILRLDKINLAYVRFK 3503 Score = 94.4 bits (233), Expect = 2e-16 Identities = 75/254 (29%), Positives = 112/254 (44%), Gaps = 7/254 (2%) Frame = -2 Query: 847 WMHFSSWFLI---TPPHFTQISHALSFWRHLIPRATHRHICFLIPCLIFWFTWTERNDRK 677 W F+ F I P H +QI A W + HI LIP I WF W ERND K Sbjct: 1429 WNFFAKSFQIYVSKPKHISQIIWA---WFFSGDYTRNGHIRILIPLFICWFLWLERNDAK 1485 Query: 676 HRGRLFRSSHIIFQVTRHLHQLVLSGKLLPPQW---ADCFL*VHFMSVVEAV*RPLRSRA 506 HR + +I+++ + L+QL L QW D F + P + Sbjct: 1486 HRHMGMYPNRVIWRIMKLLNQLHAGSLLKQWQWKGDTDIATMWGFKYPPKYCQSP---QI 1542 Query: 505 VTWQPPSEPWVKLN*DGSFDRSQQTAAXXXXXXXXXXXXLTSYYLPLQASSSFEAKXXXX 326 ++W P KLN DGS +S Q AA ++ L S +A+ Sbjct: 1543 ISWIKPFIGEYKLNVDGS-SKSSQNAAGGGVLRDHTGKLAFAFSENLGPLPSLQAELHAL 1601 Query: 325 XXXXXXLPQ-TASHVWIELDVAAVVTVITSGARGSGQLREVFSRLRLILRNRHVRVTHIH 149 + +++WIE+D V ++ +GS +R + +RL LR+ R++HI+ Sbjct: 1602 LRGLLLCKERNITNLWIEMDALVAVQMVQQSQKGSHDIRYLLESIRLCLRSFSYRISHIY 1661 Query: 148 REGNRSADFLARLG 107 REGN++ADFL+ G Sbjct: 1662 REGNQAADFLSNKG 1675 >ref|XP_007031312.1| Uncharacterized protein TCM_016762 [Theobroma cacao] gi|508710341|gb|EOY02238.1| Uncharacterized protein TCM_016762 [Theobroma cacao] Length = 2214 Score = 111 bits (277), Expect = 2e-21 Identities = 81/283 (28%), Positives = 126/283 (44%), Gaps = 2/283 (0%) Frame = -2 Query: 847 WMHFSSWFLITPPHFTQISHALSFWRHLIPRATHRHICFLIPCLIFWFTWTERNDRKHRG 668 W F+ +F I + +SH L W + HI L+P I WF W ERND K+R Sbjct: 1936 WAFFAKFFQIYVLNPKHVSHILWAWFYSGDYVKRGHIRTLLPIFICWFLWLERNDAKYRH 1995 Query: 667 RLFRSSHIIFQVTRHLHQLVLSGKLLPPQW-ADCFL*VHFMSVVEAV*RPLRSRAVTWQP 491 + I++++ + L QL L QW D + + + R + V W+ Sbjct: 1996 SGLNTDRIVWRIMKLLRQLKDGSLLQQWQWKGDTDIAAMWQYNFQLKLRA-PPQIVYWRK 2054 Query: 490 PSEPWVKLN*DGSFDRSQQTAAXXXXXXXXXXXXLTSYYLPLQASSSFEAKXXXXXXXXX 311 PS KLN DGS R Q AA + + + +S +A+ Sbjct: 2055 PSTGEYKLNVDGS-SRHGQHAASGGVLRDHTGKLIFGFSENIGTCNSLQAELRALLRGLL 2113 Query: 310 XLPQT-ASHVWIELDVAAVVTVITSGARGSGQLREVFSRLRLILRNRHVRVTHIHREGNR 134 + +WIE+D A + ++ +GS +R + +R L + R++HIHREGN+ Sbjct: 2114 LCKERHIEKLWIEMDALAAIQLLPHSQKGSHDIRYLLESIRKCLNSISYRISHIHREGNQ 2173 Query: 133 SADFLARLGHEVDGLHIFDVQTAPRPLSSLVRMDQLGYPNFRF 5 ADFL+ GH LH+F A L ++++D+L P RF Sbjct: 2174 VADFLSNEGHNHQNLHVF--TEAQGKLHGMLKLDRLNLPYVRF 2214 >ref|XP_007020288.1| Uncharacterized protein TCM_036737 [Theobroma cacao] gi|508725616|gb|EOY17513.1| Uncharacterized protein TCM_036737 [Theobroma cacao] Length = 2215 Score = 107 bits (266), Expect = 4e-20 Identities = 72/283 (25%), Positives = 122/283 (43%), Gaps = 1/283 (0%) Frame = -2 Query: 847 WMHFSSWFLITPPHFTQISHALSFWRHLIPRATHRHICFLIPCLIFWFTWTERNDRKHRG 668 W +F+ F I + I+ + W + + HI L+P WF W ERND KHR Sbjct: 1935 WSYFAKVFQIQIINPCTINQIICAWFYSGDYSKPGHIRTLVPLFTLWFLWVERNDAKHRN 1994 Query: 667 RLFRSSHIIFQVTRHLHQLVLSGKLLPPQWADCFL*VHFMSVVEAV*RPLRSRAVTWQPP 488 + +++++ + LHQL +L QW ++ P + + W P Sbjct: 1995 LGMYPNRVVWKILKLLHQLFQGKQLQKWQWQGDKQIAQEWGIILKADAPSPPKLLFWLKP 2054 Query: 487 SEPWVKLN*DGSFDRSQQTAAXXXXXXXXXXXXLTSYYLPLQASSSFEAK-XXXXXXXXX 311 S +KLN DGS + Q+AA + + S +A+ Sbjct: 2055 SIGELKLNVDGSCKHNPQSAAGGGLLRDHTGSMIFGFSENFGPQDSLQAELMALHRGLLL 2114 Query: 310 XLPQTASHVWIELDVAAVVTVITSGARGSGQLREVFSRLRLILRNRHVRVTHIHREGNRS 131 + S +WIE+D V +I G +GS + R + + + L R++HI REGN++ Sbjct: 2115 CIEHNISRLWIEMDAKVAVQMIKEGHQGSSRTRYLLASIHRCLSGISFRISHIFREGNQA 2174 Query: 130 ADFLARLGHEVDGLHIFDVQTAPRPLSSLVRMDQLGYPNFRFR 2 AD L+ GH L + + A L ++R++++ RF+ Sbjct: 2175 ADHLSNQGHTHQNLQV--ISQAEGQLRGILRLEKINLAYVRFK 2215