BLASTX nr result
ID: Perilla23_contig00029047
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Perilla23_contig00029047 (732 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011095893.1| PREDICTED: uncharacterized protein LOC105175... 260 6e-67 ref|XP_012854864.1| PREDICTED: uncharacterized protein LOC105974... 199 2e-48 ref|XP_004250651.1| PREDICTED: uncharacterized protein LOC101253... 191 4e-46 ref|XP_009794078.1| PREDICTED: uncharacterized protein LOC104240... 191 6e-46 ref|XP_006339352.1| PREDICTED: uncharacterized protein LOC102602... 188 4e-45 ref|XP_010665313.1| PREDICTED: uncharacterized protein LOC100241... 185 2e-44 ref|XP_010312982.1| PREDICTED: uncharacterized protein LOC101253... 185 2e-44 ref|XP_009794077.1| PREDICTED: uncharacterized protein LOC104240... 184 7e-44 ref|XP_006376147.1| hypothetical protein POPTR_0013s10230g [Popu... 182 2e-43 ref|XP_006339351.1| PREDICTED: uncharacterized protein LOC102602... 182 2e-43 gb|EYU22750.1| hypothetical protein MIMGU_mgv1a013618mg [Erythra... 181 6e-43 ref|XP_009588799.1| PREDICTED: uncharacterized protein LOC104086... 180 1e-42 ref|XP_011005439.1| PREDICTED: uncharacterized protein LOC105111... 179 2e-42 ref|XP_011005440.1| PREDICTED: uncharacterized protein LOC105111... 176 1e-41 ref|XP_009588798.1| PREDICTED: uncharacterized protein LOC104086... 173 1e-40 ref|XP_007019798.1| Uncharacterized protein isoform 1 [Theobroma... 160 1e-36 gb|KJB81204.1| hypothetical protein B456_013G136900 [Gossypium r... 152 3e-34 gb|KJB81208.1| hypothetical protein B456_013G136900 [Gossypium r... 147 7e-33 ref|XP_006858608.1| PREDICTED: uncharacterized protein LOC184484... 145 2e-32 ref|XP_007019799.1| Uncharacterized protein isoform 2, partial [... 145 2e-32 >ref|XP_011095893.1| PREDICTED: uncharacterized protein LOC105175222 [Sesamum indicum] Length = 268 Score = 260 bits (665), Expect = 6e-67 Identities = 137/244 (56%), Positives = 166/244 (68%), Gaps = 1/244 (0%) Frame = -2 Query: 731 KPFALLVKRWKLDIQRITWKGQGMSKKILVHVVKEGETLTSISKLYGVPVLEIAASNEEI 552 K F LL +RWKL IQRI WK + +SKKILVHVVK+GE LTSISKLYGVP+ +IAA N++I Sbjct: 23 KHFTLLAQRWKLHIQRIAWKDRDISKKILVHVVKDGENLTSISKLYGVPIHDIAAVNKDI 82 Query: 551 ADVDLVFEGQQLKIPSAVAQCAQVCHFEGYKLREHHFPSGSPRLGFHIRQRNQIFTMPSF 372 DVDLV EG+ L IPSA A AQ CHFEG K EH P +P F+ RQ NQI T+PS Sbjct: 83 VDVDLVSEGKHLNIPSASAGDAQGCHFEGDKFHEHQLPKATPCSEFNTRQWNQILTIPSS 142 Query: 371 RQFSIAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGV-HHHSSSARWKT 195 + +AK S VLVPL+AFCI CI+GA Q +A+N R QA K G+ +S RWKT Sbjct: 143 CRLPLAKRTGSVLVLVPLIAFCIRCIIGACQNRVARNLRDQAVNKSGMPRDRCNSVRWKT 202 Query: 194 ALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGMSNYGYW 15 LS+LR E+E++ FE++ H+Y KLE DYQKFLS+CGMSN+GYW Sbjct: 203 VLSELREPDALDAEPETDSDPFSEEQEEVHFEEIFHSYAKLEDDYQKFLSECGMSNWGYW 262 Query: 14 RGGS 3 RGGS Sbjct: 263 RGGS 266 >ref|XP_012854864.1| PREDICTED: uncharacterized protein LOC105974330 [Erythranthe guttatus] Length = 257 Score = 199 bits (505), Expect = 2e-48 Identities = 121/248 (48%), Positives = 156/248 (62%), Gaps = 12/248 (4%) Frame = -2 Query: 710 KRWKLDIQRITWKGQGM-SKKILVHVVKE--GETLTSISKLYGVPVLE-----IAASNEE 555 ++WKL+I+RI+ KGQGM +K+ HVVK+ GETLTSISKLYGV ++E AA+++E Sbjct: 27 QQWKLEIERISRKGQGMMNKESANHVVKDDRGETLTSISKLYGVSIIENCCLSAAANDKE 86 Query: 554 IADVDLVFEGQQLKIPSAVAQCAQVCHFEGYKLREHHFPSGSPRLGFHIRQRNQIFTMPS 375 +ADVDLV +GQ A+VC+ EG KL EHH S + +Q NQIF + S Sbjct: 87 MADVDLVSDGQN-------RNSARVCNLEGTKLHEHHQLSNTTVS----KQPNQIFYLLS 135 Query: 374 FRQFS-IAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHHS---SSA 207 Q +AK SF VLVPL+AFCI CI+G F+ + +N RH+A+ K GV HH+ +S Sbjct: 136 SHQLPLVAKAGGSFLVLVPLMAFCISCIIGTFRNRVLRNPRHKASNKYGVQHHNKSHNSP 195 Query: 206 RWKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGMSN 27 RWK L ++EQ+ FED SHA TKLE DYQKFLS+CGMSN Sbjct: 196 RWKNVLD--------AESEPDNSHPLYEDQEQVNFEDASHANTKLEDDYQKFLSECGMSN 247 Query: 26 YGYWRGGS 3 +GYWRGGS Sbjct: 248 WGYWRGGS 255 >ref|XP_004250651.1| PREDICTED: uncharacterized protein LOC101253651 isoform X2 [Solanum lycopersicum] Length = 267 Score = 191 bits (485), Expect = 4e-46 Identities = 117/248 (47%), Positives = 143/248 (57%), Gaps = 5/248 (2%) Frame = -2 Query: 731 KPFALLVKRWKLDIQRITWKGQ--GMSKKILVHVVKEGETLTSISKLYGVPVLEIAASNE 558 K F LL KR K IQ + G SK+ LVHVVKE +TLTS+SKLYGVP+ EIAA+N+ Sbjct: 26 KHFTLLPKRCKNHIQELFSNGLQWNSSKQFLVHVVKEDDTLTSLSKLYGVPIFEIAAANK 85 Query: 557 EIADVDLVFEGQQLKIPSAVAQCAQVCHFEGYKLREHHFPSGSPRL---GFHIRQRNQIF 387 EI DVDLVFEGQ L IPS V +Q E L + S G I Q+ + Sbjct: 86 EIIDVDLVFEGQHLNIPSYVTSYSQTNQREKINLPKIEVSETSRHFKLCGSDINQK--ML 143 Query: 386 TMPSFRQFSIAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHHSSSA 207 + S R AKT+ F VLVPL+ FCI CIM AF +A+N A H S S Sbjct: 144 YVLSCRHLPYAKTSGHFLVLVPLIGFCIRCIMNAFHHRVARNKLQDA------HQTSGSM 197 Query: 206 RWKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGMSN 27 RWK+AL DL ++E L+ E++SHAY KL+GDYQKFLS+CGMS Sbjct: 198 RWKSALRDLTDPDALYSDSRPETDNVTDDREHLQSEELSHAYAKLDGDYQKFLSECGMSK 257 Query: 26 YGYWRGGS 3 +GYWRGG+ Sbjct: 258 WGYWRGGT 265 >ref|XP_009794078.1| PREDICTED: uncharacterized protein LOC104240878 isoform X2 [Nicotiana sylvestris] Length = 268 Score = 191 bits (484), Expect = 6e-46 Identities = 118/247 (47%), Positives = 144/247 (58%), Gaps = 4/247 (1%) Frame = -2 Query: 731 KPFALLVKRWKLDIQRITWKGQGM--SKKILVHVVKEGETLTSISKLYGVPVLEIAASNE 558 K F++L KR K IQ GQ + SK+ LVHVVKE ETLTSISKLY VP+ EIAA+N+ Sbjct: 26 KHFSILPKRCKYQIQEFFSNGQHLNNSKQFLVHVVKEDETLTSISKLYRVPIYEIAAANK 85 Query: 557 EIADVDLVFEGQQLKIPSAVAQCAQVCHFEGYKLREHHFPSGSPRLGFHIRQRNQ-IFTM 381 EI DVDLVFEGQ L IPS + C+Q C + +L + H + RL + NQ I ++ Sbjct: 86 EIIDVDLVFEGQLLNIPSYITACSQTCQRKMIRLPKIHVSETNRRLKLCGKDFNQKILSV 145 Query: 380 PSFRQFSIAKTACSFPVLVPLVAFCIVCI-MGAFQIILAKNSRHQAAKKLGVHHHSSSAR 204 S R AKT F VLVPL+AFCI CI M AF +A+N K V S S R Sbjct: 146 LSCRHLPYAKTTGYFLVLVPLIAFCIRCIMMNAFHHRVARN------KLQDVRQASGSMR 199 Query: 203 WKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGMSNY 24 WK AL DL +++ ED S AY KL+ DYQKFL++CGMS + Sbjct: 200 WKLALRDLSDPDASYTDSRPEIENVTDDQDNFHSEDHSRAYAKLDHDYQKFLAECGMSKW 259 Query: 23 GYWRGGS 3 GYWRGGS Sbjct: 260 GYWRGGS 266 >ref|XP_006339352.1| PREDICTED: uncharacterized protein LOC102602767 isoform X2 [Solanum tuberosum] Length = 267 Score = 188 bits (477), Expect = 4e-45 Identities = 117/246 (47%), Positives = 142/246 (57%), Gaps = 3/246 (1%) Frame = -2 Query: 731 KPFALLVKRWKLDIQRITWKGQGMSKKI--LVHVVKEGETLTSISKLYGVPVLEIAASNE 558 K F+LL KR K IQ + GQ S I LVHVVKE ETLTS+SKLYGVP+ EIAA+N+ Sbjct: 26 KHFSLLPKRCKYHIQELFSNGQQWSSSIQFLVHVVKEDETLTSLSKLYGVPIYEIAAANK 85 Query: 557 EIADVDLVFEGQQLKIPSAVAQCAQVCHFEGYKLREHHFPSGSPRLGFHIRQRNQ-IFTM 381 EI DV+LVFEGQ L IPS V +Q E +L + S R NQ + + Sbjct: 86 EIIDVNLVFEGQHLNIPSYVTPYSQTNQREKIRLPKIDVSETSQRFKLCGNDINQKMLYV 145 Query: 380 PSFRQFSIAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHHSSSARW 201 S R AKT+ F VLVPL+ FCI CIM AF +A+N K V S S RW Sbjct: 146 LSCRHLPYAKTSGYFLVLVPLIGFCIRCIMNAFHHRVARN------KLQDVRQASGSMRW 199 Query: 200 KTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGMSNYG 21 K AL DL ++E L+ E++S AY KL+GDYQKFLS+CGMS +G Sbjct: 200 KLALRDLSDPDALYSDSRPEIENVTDDREHLQSEELSRAYAKLDGDYQKFLSECGMSKWG 259 Query: 20 YWRGGS 3 YWRGG+ Sbjct: 260 YWRGGT 265 >ref|XP_010665313.1| PREDICTED: uncharacterized protein LOC100241456 isoform X1 [Vitis vinifera] Length = 279 Score = 185 bits (470), Expect = 2e-44 Identities = 111/252 (44%), Positives = 147/252 (58%), Gaps = 9/252 (3%) Frame = -2 Query: 731 KPFALLVKRWKLDIQRITWKGQGMSKKILVHVVKEGETLTSISKLYGVPVLEIAASNEEI 552 K F +L ++W+ IQ I+ KGQ +K VH+VKEGETL+SISK YGV + IAA+N+ I Sbjct: 32 KHFRMLTEKWRFQIQEIS-KGQHSTKHNSVHMVKEGETLSSISKQYGVSIYSIAAANKNI 90 Query: 551 ADVDLVFEGQQLKIPSAV--------AQCAQVCHFEGYKLREHHFPSGSPRLGFHIRQRN 396 D+DLVF GQ L IPS+ + +++ F+ K +H LG + Q+ Sbjct: 91 EDIDLVFCGQHLNIPSSAVGETQKFQTEKSKLSSFDTLKRHQHSLEV----LGGRLNQKL 146 Query: 395 QIFTMPSFRQFSIAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHH- 219 + SF S AK F VLVPL+AFCI CI+GAFQ + + RHQA + V +H Sbjct: 147 CTVAL-SFHSLSHAKATGYFLVLVPLIAFCIRCIIGAFQNRVVGDLRHQAVNESEVDYHG 205 Query: 218 SSSARWKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQC 39 S S RWK+AL D+R + Q E+VSHAY KLE DYQ+FLS+C Sbjct: 206 SKSVRWKSALDDIREPDTLDTGLQPDSINPSEVQTQGSAEEVSHAYGKLEHDYQQFLSEC 265 Query: 38 GMSNYGYWRGGS 3 G+S +GYWRGGS Sbjct: 266 GISKWGYWRGGS 277 >ref|XP_010312982.1| PREDICTED: uncharacterized protein LOC101253651 isoform X1 [Solanum lycopersicum] Length = 268 Score = 185 bits (470), Expect = 2e-44 Identities = 116/249 (46%), Positives = 143/249 (57%), Gaps = 6/249 (2%) Frame = -2 Query: 731 KPFALLVKRWKLDIQRITWKGQ--GMSKKILVHVVKE-GETLTSISKLYGVPVLEIAASN 561 K F LL KR K IQ + G SK+ LVHVVK+ +TLTS+SKLYGVP+ EIAA+N Sbjct: 26 KHFTLLPKRCKNHIQELFSNGLQWNSSKQFLVHVVKDRDDTLTSLSKLYGVPIFEIAAAN 85 Query: 560 EEIADVDLVFEGQQLKIPSAVAQCAQVCHFEGYKLREHHFPSGSPRL---GFHIRQRNQI 390 +EI DVDLVFEGQ L IPS V +Q E L + S G I Q+ + Sbjct: 86 KEIIDVDLVFEGQHLNIPSYVTSYSQTNQREKINLPKIEVSETSRHFKLCGSDINQK--M 143 Query: 389 FTMPSFRQFSIAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHHSSS 210 + S R AKT+ F VLVPL+ FCI CIM AF +A+N A H S S Sbjct: 144 LYVLSCRHLPYAKTSGHFLVLVPLIGFCIRCIMNAFHHRVARNKLQDA------HQTSGS 197 Query: 209 ARWKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGMS 30 RWK+AL DL ++E L+ E++SHAY KL+GDYQKFLS+CGMS Sbjct: 198 MRWKSALRDLTDPDALYSDSRPETDNVTDDREHLQSEELSHAYAKLDGDYQKFLSECGMS 257 Query: 29 NYGYWRGGS 3 +GYWRGG+ Sbjct: 258 KWGYWRGGT 266 >ref|XP_009794077.1| PREDICTED: uncharacterized protein LOC104240878 isoform X1 [Nicotiana sylvestris] Length = 275 Score = 184 bits (466), Expect = 7e-44 Identities = 118/254 (46%), Positives = 145/254 (57%), Gaps = 11/254 (4%) Frame = -2 Query: 731 KPFALLVKRWKLDIQRITWKGQGM--SKKILVHVVKEGETLTSISKLYGVPVLEIAASNE 558 K F++L KR K IQ GQ + SK+ LVHVVKE ETLTSISKLY VP+ EIAA+N+ Sbjct: 26 KHFSILPKRCKYQIQEFFSNGQHLNNSKQFLVHVVKEDETLTSISKLYRVPIYEIAAANK 85 Query: 557 EIADVDLVFEGQQLKIPSAVAQCAQVCHFEGYKLREHHFPSGSPRLGFHIRQRNQ-IFTM 381 EI DVDLVFEGQ L IPS + C+Q C + +L + H + RL + NQ I ++ Sbjct: 86 EIIDVDLVFEGQLLNIPSYITACSQTCQRKMIRLPKIHVSETNRRLKLCGKDFNQKILSV 145 Query: 380 PSFRQ-------FSIAKTACSFPVLVPLVAFCIVCI-MGAFQIILAKNSRHQAAKKLGVH 225 S R + AKT F VLVPL+AFCI CI M AF +A+N K V Sbjct: 146 LSCRHLPYTCQCYYQAKTTGYFLVLVPLIAFCIRCIMMNAFHHRVARN------KLQDVR 199 Query: 224 HHSSSARWKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLS 45 S S RWK AL DL +++ ED S AY KL+ DYQKFL+ Sbjct: 200 QASGSMRWKLALRDLSDPDASYTDSRPEIENVTDDQDNFHSEDHSRAYAKLDHDYQKFLA 259 Query: 44 QCGMSNYGYWRGGS 3 +CGMS +GYWRGGS Sbjct: 260 ECGMSKWGYWRGGS 273 >ref|XP_006376147.1| hypothetical protein POPTR_0013s10230g [Populus trichocarpa] gi|550325417|gb|ERP53944.1| hypothetical protein POPTR_0013s10230g [Populus trichocarpa] Length = 286 Score = 182 bits (462), Expect = 2e-43 Identities = 112/251 (44%), Positives = 145/251 (57%), Gaps = 8/251 (3%) Frame = -2 Query: 731 KPFALLVKRWKLDIQRITWKGQGMSKKILVHVVKEGETLTSISKLYGVPVLEIAASNEEI 552 K F +L +RW+ IQ I+ KGQ + L+HVVKEGETLTSISK YGV + +AA+N+ I Sbjct: 40 KHFTVLAERWRFHIQDIS-KGQSSTNPYLLHVVKEGETLTSISKQYGVSIYSVAAANKNI 98 Query: 551 ADVDLVFEGQQLKIPSAVAQCAQVCHFEGYKLREHHFPSGS--PRLGFHIRQRNQIFTMP 378 DVDLVFEGQ L IP+A QV Y++++ PS RL ++ + + Sbjct: 99 LDVDLVFEGQLLNIPAAAPAGTQV-----YQIKKCESPSFDQLERLQNFMKIMDGVLNQK 153 Query: 377 SF-----RQFSIAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHH-S 216 F + AK F VLVP +AFCI CI+GAF +N QA+ + HH Sbjct: 154 PFITVTTLRLPHAKATGYFLVLVPALAFCIRCIIGAFHTRARRNLGCQASNESRRHHDVP 213 Query: 215 SSARWKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCG 36 S RWK ALSD+R +++Q FE+VSHAY KLE +YQKFLS+CG Sbjct: 214 ESKRWKHALSDIREPDNLDGEPILNSTGTSADQDQNSFEEVSHAYDKLEHEYQKFLSECG 273 Query: 35 MSNYGYWRGGS 3 +SN GYWRGGS Sbjct: 274 ISNSGYWRGGS 284 >ref|XP_006339351.1| PREDICTED: uncharacterized protein LOC102602767 isoform X1 [Solanum tuberosum] Length = 268 Score = 182 bits (462), Expect = 2e-43 Identities = 116/247 (46%), Positives = 142/247 (57%), Gaps = 4/247 (1%) Frame = -2 Query: 731 KPFALLVKRWKLDIQRITWKGQGMSKKI--LVHVVKE-GETLTSISKLYGVPVLEIAASN 561 K F+LL KR K IQ + GQ S I LVHVVK+ ETLTS+SKLYGVP+ EIAA+N Sbjct: 26 KHFSLLPKRCKYHIQELFSNGQQWSSSIQFLVHVVKDRDETLTSLSKLYGVPIYEIAAAN 85 Query: 560 EEIADVDLVFEGQQLKIPSAVAQCAQVCHFEGYKLREHHFPSGSPRLGFHIRQRNQ-IFT 384 +EI DV+LVFEGQ L IPS V +Q E +L + S R NQ + Sbjct: 86 KEIIDVNLVFEGQHLNIPSYVTPYSQTNQREKIRLPKIDVSETSQRFKLCGNDINQKMLY 145 Query: 383 MPSFRQFSIAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHHSSSAR 204 + S R AKT+ F VLVPL+ FCI CIM AF +A+N K V S S R Sbjct: 146 VLSCRHLPYAKTSGYFLVLVPLIGFCIRCIMNAFHHRVARN------KLQDVRQASGSMR 199 Query: 203 WKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGMSNY 24 WK AL DL ++E L+ E++S AY KL+GDYQKFLS+CGMS + Sbjct: 200 WKLALRDLSDPDALYSDSRPEIENVTDDREHLQSEELSRAYAKLDGDYQKFLSECGMSKW 259 Query: 23 GYWRGGS 3 GYWRGG+ Sbjct: 260 GYWRGGT 266 >gb|EYU22750.1| hypothetical protein MIMGU_mgv1a013618mg [Erythranthe guttata] Length = 215 Score = 181 bits (458), Expect = 6e-43 Identities = 111/231 (48%), Positives = 141/231 (61%), Gaps = 11/231 (4%) Frame = -2 Query: 662 MSKKILVHVVKE--GETLTSISKLYGVPVLE-----IAASNEEIADVDLVFEGQQLKIPS 504 M+K+ HVVK+ GETLTSISKLYGV ++E AA+++E+ADVDLV +GQ Sbjct: 2 MNKESANHVVKDDRGETLTSISKLYGVSIIENCCLSAAANDKEMADVDLVSDGQN----- 56 Query: 503 AVAQCAQVCHFEGYKLREHHFPSGSPRLGFHIRQRNQIFTMPSFRQFS-IAKTACSFPVL 327 A+VC+ EG KL EHH S + +Q NQIF + S Q +AK SF VL Sbjct: 57 --RNSARVCNLEGTKLHEHHQLSNTTVS----KQPNQIFYLLSSHQLPLVAKAGGSFLVL 110 Query: 326 VPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHHS---SSARWKTALSDLRXXXXXXX 156 VPL+AFCI CI+G F+ + +N RH+A+ K GV HH+ +S RWK L Sbjct: 111 VPLMAFCISCIIGTFRNRVLRNPRHKASNKYGVQHHNKSHNSPRWKNVLD--------AE 162 Query: 155 XXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGMSNYGYWRGGS 3 ++EQ+ FED SHA TKLE DYQKFLS+CGMSN+GYWRGGS Sbjct: 163 SEPDNSHPLYEDQEQVNFEDASHANTKLEDDYQKFLSECGMSNWGYWRGGS 213 >ref|XP_009588799.1| PREDICTED: uncharacterized protein LOC104086270 isoform X2 [Nicotiana tomentosiformis] Length = 267 Score = 180 bits (456), Expect = 1e-42 Identities = 110/246 (44%), Positives = 140/246 (56%), Gaps = 3/246 (1%) Frame = -2 Query: 731 KPFALLVKRWKLDIQRITWKGQGM--SKKILVHVVKEGETLTSISKLYGVPVLEIAASNE 558 K F +L KR K IQ Q + S++ LVHVVKE ETLTSISKLYGVP+ EIAA+N+ Sbjct: 26 KHFIILPKRCKYQIQEFFSNDQHLNNSRQFLVHVVKEDETLTSISKLYGVPIYEIAAANK 85 Query: 557 EIADVDLVFEGQQLKIPSAVAQCAQVCHFEGYKLREHHFPSGSPRLGFHIRQRNQ-IFTM 381 +I DVDLVFEGQ L +PS + C+Q C + +L + + RL + NQ I ++ Sbjct: 86 QIIDVDLVFEGQLLNVPSYITTCSQTCQRKMIRLPKIDVSETNRRLKLCGKDFNQKILSV 145 Query: 380 PSFRQFSIAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHHSSSARW 201 S R AKT F VLV L+AF I CIM AF + +N K V S S RW Sbjct: 146 LSCRHLPYAKTTGYFLVLVSLIAFGIRCIMNAFHRRVGRN------KLQDVRQASGSMRW 199 Query: 200 KTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGMSNYG 21 K AL DL +++ ED+S AY K++ DYQKFL++CGMS +G Sbjct: 200 KLALRDLSDPDASYTDSRTEIDNVTDDQDNFHSEDLSRAYAKVDHDYQKFLAECGMSKWG 259 Query: 20 YWRGGS 3 YWRGGS Sbjct: 260 YWRGGS 265 >ref|XP_011005439.1| PREDICTED: uncharacterized protein LOC105111694 isoform X1 [Populus euphratica] Length = 292 Score = 179 bits (454), Expect = 2e-42 Identities = 110/252 (43%), Positives = 145/252 (57%), Gaps = 9/252 (3%) Frame = -2 Query: 731 KPFALLVKRWKLDIQRITWKGQGMSKKILVHVVKEGETLTSISKLYGVPVLEIAASNEEI 552 K F +L +RW+ IQ I+ KGQ + L+HVVKEGETLTSISK YGV + +AA+N+ I Sbjct: 40 KHFTVLAERWRFHIQDIS-KGQSSTNPYLLHVVKEGETLTSISKQYGVSIYSVAAANKNI 98 Query: 551 ADVDLVFEGQQLKIPSAVAQCAQVC-HFEGYKLREHHFPSGS--PRLGFHIRQRNQIFTM 381 DVDLVFEGQ L IP++ +VC Y++++ PS RL ++ + + Sbjct: 99 LDVDLVFEGQLLNIPASAPADTKVCLCLNQYQVKKCESPSFDQLERLQNFMKIMDGVLNQ 158 Query: 380 PSFRQFSI-----AKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHH- 219 F + AK F VLVP +AFCI CI+GAF +N QA+ + H Sbjct: 159 KPFITVTTLHLPHAKATGYFLVLVPALAFCIRCIIGAFHTRARRNLGCQASNESRRHQDV 218 Query: 218 SSSARWKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQC 39 S RWK ALSD+R +++Q FE+VSHAY KLE +YQKFLS+C Sbjct: 219 PESKRWKHALSDIREPDNLDGEPILNSTGTSADQDQNSFEEVSHAYDKLEHEYQKFLSEC 278 Query: 38 GMSNYGYWRGGS 3 G+SN GYWRGGS Sbjct: 279 GISNSGYWRGGS 290 >ref|XP_011005440.1| PREDICTED: uncharacterized protein LOC105111694 isoform X2 [Populus euphratica] Length = 286 Score = 176 bits (446), Expect = 1e-41 Identities = 109/251 (43%), Positives = 144/251 (57%), Gaps = 8/251 (3%) Frame = -2 Query: 731 KPFALLVKRWKLDIQRITWKGQGMSKKILVHVVKEGETLTSISKLYGVPVLEIAASNEEI 552 K F +L +RW+ IQ I+ KGQ + L+HVVKEGETLTSISK YGV + +AA+N+ I Sbjct: 40 KHFTVLAERWRFHIQDIS-KGQSSTNPYLLHVVKEGETLTSISKQYGVSIYSVAAANKNI 98 Query: 551 ADVDLVFEGQQLKIPSAVAQCAQVCHFEGYKLREHHFPSGS--PRLGFHIRQRNQIFTMP 378 DVDLVFEGQ L IP++ +V Y++++ PS RL ++ + + Sbjct: 99 LDVDLVFEGQLLNIPASAPADTKV-----YQVKKCESPSFDQLERLQNFMKIMDGVLNQK 153 Query: 377 SFRQFSI-----AKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHH-S 216 F + AK F VLVP +AFCI CI+GAF +N QA+ + H Sbjct: 154 PFITVTTLHLPHAKATGYFLVLVPALAFCIRCIIGAFHTRARRNLGCQASNESRRHQDVP 213 Query: 215 SSARWKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCG 36 S RWK ALSD+R +++Q FE+VSHAY KLE +YQKFLS+CG Sbjct: 214 ESKRWKHALSDIREPDNLDGEPILNSTGTSADQDQNSFEEVSHAYDKLEHEYQKFLSECG 273 Query: 35 MSNYGYWRGGS 3 +SN GYWRGGS Sbjct: 274 ISNSGYWRGGS 284 >ref|XP_009588798.1| PREDICTED: uncharacterized protein LOC104086270 isoform X1 [Nicotiana tomentosiformis] Length = 274 Score = 173 bits (438), Expect = 1e-40 Identities = 110/253 (43%), Positives = 141/253 (55%), Gaps = 10/253 (3%) Frame = -2 Query: 731 KPFALLVKRWKLDIQRITWKGQGM--SKKILVHVVKEGETLTSISKLYGVPVLEIAASNE 558 K F +L KR K IQ Q + S++ LVHVVKE ETLTSISKLYGVP+ EIAA+N+ Sbjct: 26 KHFIILPKRCKYQIQEFFSNDQHLNNSRQFLVHVVKEDETLTSISKLYGVPIYEIAAANK 85 Query: 557 EIADVDLVFEGQQLKIPSAVAQCAQVCHFEGYKLREHHFPSGSPRLGFHIRQRNQ-IFTM 381 +I DVDLVFEGQ L +PS + C+Q C + +L + + RL + NQ I ++ Sbjct: 86 QIIDVDLVFEGQLLNVPSYITTCSQTCQRKMIRLPKIDVSETNRRLKLCGKDFNQKILSV 145 Query: 380 PSFRQ-------FSIAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHH 222 S R + AKT F VLV L+AF I CIM AF + +N K V Sbjct: 146 LSCRHLPYTCQCYYQAKTTGYFLVLVSLIAFGIRCIMNAFHRRVGRN------KLQDVRQ 199 Query: 221 HSSSARWKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQ 42 S S RWK AL DL +++ ED+S AY K++ DYQKFL++ Sbjct: 200 ASGSMRWKLALRDLSDPDASYTDSRTEIDNVTDDQDNFHSEDLSRAYAKVDHDYQKFLAE 259 Query: 41 CGMSNYGYWRGGS 3 CGMS +GYWRGGS Sbjct: 260 CGMSKWGYWRGGS 272 >ref|XP_007019798.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508725126|gb|EOY17023.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 273 Score = 160 bits (404), Expect = 1e-36 Identities = 101/250 (40%), Positives = 142/250 (56%), Gaps = 9/250 (3%) Frame = -2 Query: 725 FALLVKRWKLDIQRITWKGQGMSKK-ILVHVVKEGETLTSISKLYGVPVLEIAASNEEIA 549 F L+K+W+L Q SK I H+VKEGETL+SISK YGV V IAA+N++I Sbjct: 45 FQGLIKKWRL---------QNNSKDYICAHLVKEGETLSSISKKYGVSVYSIAAANKDIV 95 Query: 548 DVDLVFEGQQLKIPSA------VAQCAQVCHFEGYKLREHHFPSGSPRLGFHIRQRNQIF 387 D+ LVF+GQ L IP++ +A+ +++ H +R PS I+ Sbjct: 96 DIHLVFKGQLLNIPASSLKETLLAKKSRLWH----SIRAFRTPS-----------HKIIY 140 Query: 386 TMPSFRQFS-IAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHH-HSS 213 +M + S AK F VLVPL+AFCI CI+ F+I +A++ RHQA K HH + Sbjct: 141 SMVTSHGLSNQAKATGYFLVLVPLIAFCIRCIISTFRIRVARDMRHQAVDKSKGHHPGAK 200 Query: 212 SARWKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGM 33 S RWK+ALSD ++ + +++ SHAY++L+ DY+KFLS+CGM Sbjct: 201 SMRWKSALSDTEESDAFDSESGLDSNSPSEDEAYISYDEASHAYSRLQHDYEKFLSECGM 260 Query: 32 SNYGYWRGGS 3 S +GYWRGGS Sbjct: 261 SKWGYWRGGS 270 >gb|KJB81204.1| hypothetical protein B456_013G136900 [Gossypium raimondii] Length = 273 Score = 152 bits (383), Expect = 3e-34 Identities = 97/246 (39%), Positives = 133/246 (54%), Gaps = 5/246 (2%) Frame = -2 Query: 725 FALLVKRWKLDIQRITWKGQGMSKKILVHVVKEGETLTSISKLYGVPVLEIAASNEEIAD 546 F LVK+W+L + + HVVKEGETL+SISK+YGV V IAA+N+ I D Sbjct: 44 FQGLVKKWRLQNKTKDYS--------CAHVVKEGETLSSISKMYGVSVHSIAAANKNIVD 95 Query: 545 VDLVFEGQQLKIPSAVAQCAQVCHFEGYKL----REHHFPSGSPRLGFHIRQRNQIFTMP 378 ++LVF GQ L IPS+ Q+ + +L R PSG + FTM Sbjct: 96 INLVFRGQLLNIPSSSLLDTQLDRAKKSRLWQSIRALKAPSG-----------QKFFTMI 144 Query: 377 SFRQFSIAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHHSSSA-RW 201 + S AK+ F VLVPL+AFCI CI+ ++++ +HQAA + HH + RW Sbjct: 145 TAHCLSNAKSTGYFLVLVPLIAFCIGCIIVTLHTRVSRSIKHQAADESQAHHPGAKGRRW 204 Query: 200 KTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGMSNYG 21 K+ALSD ++ ++ E+ S Y +LE DYQKFLS+CG+S +G Sbjct: 205 KSALSDSVEGDVFDSELGLDSNSTSEDEANIQNEEASKDYGRLEHDYQKFLSECGISKWG 264 Query: 20 YWRGGS 3 YWRGGS Sbjct: 265 YWRGGS 270 >gb|KJB81208.1| hypothetical protein B456_013G136900 [Gossypium raimondii] Length = 274 Score = 147 bits (371), Expect = 7e-33 Identities = 97/247 (39%), Positives = 133/247 (53%), Gaps = 6/247 (2%) Frame = -2 Query: 725 FALLVKRWKLDIQRITWKGQGMSKKILVHVVKEGETLTSISKLYGVPVLEIAASNEEIAD 546 F LVK+W+L + + HVVKEGETL+SISK+YGV V IAA+N+ I D Sbjct: 44 FQGLVKKWRLQNKTKDYS--------CAHVVKEGETLSSISKMYGVSVHSIAAANKNIVD 95 Query: 545 VDLVFEGQQLKIPSAVAQCAQVCHFEGYKL----REHHFPSGSPRLGFHIRQRNQIFTMP 378 ++LVF GQ L IPS+ Q+ + +L R PSG + FTM Sbjct: 96 INLVFRGQLLNIPSSSLLDTQLDRAKKSRLWQSIRALKAPSG-----------QKFFTMI 144 Query: 377 SFRQFS-IAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHHSSSA-R 204 + S AK+ F VLVPL+AFCI CI+ ++++ +HQAA + HH + R Sbjct: 145 TAHCLSNQAKSTGYFLVLVPLIAFCIGCIIVTLHTRVSRSIKHQAADESQAHHPGAKGRR 204 Query: 203 WKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGMSNY 24 WK+ALSD ++ ++ E+ S Y +LE DYQKFLS+CG+S + Sbjct: 205 WKSALSDSVEGDVFDSELGLDSNSTSEDEANIQNEEASKDYGRLEHDYQKFLSECGISKW 264 Query: 23 GYWRGGS 3 GYWRGGS Sbjct: 265 GYWRGGS 271 >ref|XP_006858608.1| PREDICTED: uncharacterized protein LOC18448485 isoform X1 [Amborella trichopoda] gi|548862717|gb|ERN20075.1| hypothetical protein AMTR_s00071p00202040 [Amborella trichopoda] Length = 281 Score = 145 bits (367), Expect = 2e-32 Identities = 91/231 (39%), Positives = 120/231 (51%), Gaps = 9/231 (3%) Frame = -2 Query: 668 QGMSKKILVHVVKEGETLTSISKLYGVPVLEIAASNEEIADVDLVFEGQQLKIPSAVAQC 489 Q ++KK+LVHVVKEGETLTSIS+ Y V + IAA+N +I +VD V EG+ L +P + Sbjct: 54 QDIAKKLLVHVVKEGETLTSISRKYRVSIELIAAANTDITNVDFVLEGRSLNVPIVSKE- 112 Query: 488 AQVCHFEGYKLREHHFPSGSPRLGFHIRQRNQIFTMPSF--------RQFSIAKTACSFP 333 +G RE+H G + F N + ++ +AK F Sbjct: 113 -----IQGVSPRENHAIQGDAKEIFQYSHVNTLVAQANYNLSRMLSPHYLQLAKGTGYFL 167 Query: 332 VLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGV-HHHSSSARWKTALSDLRXXXXXXX 156 ++ LVAFC I F A +HQA L V H S S RWK ALS++R Sbjct: 168 LVATLVAFCFRYIFSEFHHRFANKLKHQAQNDLKVPHDGSGSMRWKFALSEIREMGIVDA 227 Query: 155 XXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGMSNYGYWRGGS 3 ++E E+V+ AYTKLE YQKFLS+CGMS +GYWRGGS Sbjct: 228 ESRENPDGDSQDQELDSLEEVAEAYTKLEPAYQKFLSECGMSKWGYWRGGS 278 >ref|XP_007019799.1| Uncharacterized protein isoform 2, partial [Theobroma cacao] gi|508725127|gb|EOY17024.1| Uncharacterized protein isoform 2, partial [Theobroma cacao] Length = 220 Score = 145 bits (367), Expect = 2e-32 Identities = 88/216 (40%), Positives = 125/216 (57%), Gaps = 8/216 (3%) Frame = -2 Query: 626 GETLTSISKLYGVPVLEIAASNEEIADVDLVFEGQQLKIPSA------VAQCAQVCHFEG 465 GETL+SISK YGV V IAA+N++I D+ LVF+GQ L IP++ +A+ +++ H Sbjct: 17 GETLSSISKKYGVSVYSIAAANKDIVDIHLVFKGQLLNIPASSLKETLLAKKSRLWH--- 73 Query: 464 YKLREHHFPSGSPRLGFHIRQRNQIFTMPSFRQFSI-AKTACSFPVLVPLVAFCIVCIMG 288 +R PS I++M + S AK F VLVPL+AFCI CI+ Sbjct: 74 -SIRAFRTPS-----------HKIIYSMVTSHGLSNQAKATGYFLVLVPLIAFCIRCIIS 121 Query: 287 AFQIILAKNSRHQAAKKLGVHHHSS-SARWKTALSDLRXXXXXXXXXXXXXXXXXXEKEQ 111 F+I +A++ RHQA K HH + S RWK+ALSD ++ Sbjct: 122 TFRIRVARDMRHQAVDKSKGHHPGAKSMRWKSALSDTEESDAFDSESGLDSNSPSEDEAY 181 Query: 110 LRFEDVSHAYTKLEGDYQKFLSQCGMSNYGYWRGGS 3 + +++ SHAY++L+ DY+KFLS+CGMS +GYWRGGS Sbjct: 182 ISYDEASHAYSRLQHDYEKFLSECGMSKWGYWRGGS 217