BLASTX nr result

ID: Perilla23_contig00029047 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Perilla23_contig00029047
         (732 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011095893.1| PREDICTED: uncharacterized protein LOC105175...   260   6e-67
ref|XP_012854864.1| PREDICTED: uncharacterized protein LOC105974...   199   2e-48
ref|XP_004250651.1| PREDICTED: uncharacterized protein LOC101253...   191   4e-46
ref|XP_009794078.1| PREDICTED: uncharacterized protein LOC104240...   191   6e-46
ref|XP_006339352.1| PREDICTED: uncharacterized protein LOC102602...   188   4e-45
ref|XP_010665313.1| PREDICTED: uncharacterized protein LOC100241...   185   2e-44
ref|XP_010312982.1| PREDICTED: uncharacterized protein LOC101253...   185   2e-44
ref|XP_009794077.1| PREDICTED: uncharacterized protein LOC104240...   184   7e-44
ref|XP_006376147.1| hypothetical protein POPTR_0013s10230g [Popu...   182   2e-43
ref|XP_006339351.1| PREDICTED: uncharacterized protein LOC102602...   182   2e-43
gb|EYU22750.1| hypothetical protein MIMGU_mgv1a013618mg [Erythra...   181   6e-43
ref|XP_009588799.1| PREDICTED: uncharacterized protein LOC104086...   180   1e-42
ref|XP_011005439.1| PREDICTED: uncharacterized protein LOC105111...   179   2e-42
ref|XP_011005440.1| PREDICTED: uncharacterized protein LOC105111...   176   1e-41
ref|XP_009588798.1| PREDICTED: uncharacterized protein LOC104086...   173   1e-40
ref|XP_007019798.1| Uncharacterized protein isoform 1 [Theobroma...   160   1e-36
gb|KJB81204.1| hypothetical protein B456_013G136900 [Gossypium r...   152   3e-34
gb|KJB81208.1| hypothetical protein B456_013G136900 [Gossypium r...   147   7e-33
ref|XP_006858608.1| PREDICTED: uncharacterized protein LOC184484...   145   2e-32
ref|XP_007019799.1| Uncharacterized protein isoform 2, partial [...   145   2e-32

>ref|XP_011095893.1| PREDICTED: uncharacterized protein LOC105175222 [Sesamum indicum]
          Length = 268

 Score =  260 bits (665), Expect = 6e-67
 Identities = 137/244 (56%), Positives = 166/244 (68%), Gaps = 1/244 (0%)
 Frame = -2

Query: 731 KPFALLVKRWKLDIQRITWKGQGMSKKILVHVVKEGETLTSISKLYGVPVLEIAASNEEI 552
           K F LL +RWKL IQRI WK + +SKKILVHVVK+GE LTSISKLYGVP+ +IAA N++I
Sbjct: 23  KHFTLLAQRWKLHIQRIAWKDRDISKKILVHVVKDGENLTSISKLYGVPIHDIAAVNKDI 82

Query: 551 ADVDLVFEGQQLKIPSAVAQCAQVCHFEGYKLREHHFPSGSPRLGFHIRQRNQIFTMPSF 372
            DVDLV EG+ L IPSA A  AQ CHFEG K  EH  P  +P   F+ RQ NQI T+PS 
Sbjct: 83  VDVDLVSEGKHLNIPSASAGDAQGCHFEGDKFHEHQLPKATPCSEFNTRQWNQILTIPSS 142

Query: 371 RQFSIAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGV-HHHSSSARWKT 195
            +  +AK   S  VLVPL+AFCI CI+GA Q  +A+N R QA  K G+     +S RWKT
Sbjct: 143 CRLPLAKRTGSVLVLVPLIAFCIRCIIGACQNRVARNLRDQAVNKSGMPRDRCNSVRWKT 202

Query: 194 ALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGMSNYGYW 15
            LS+LR                  E+E++ FE++ H+Y KLE DYQKFLS+CGMSN+GYW
Sbjct: 203 VLSELREPDALDAEPETDSDPFSEEQEEVHFEEIFHSYAKLEDDYQKFLSECGMSNWGYW 262

Query: 14  RGGS 3
           RGGS
Sbjct: 263 RGGS 266


>ref|XP_012854864.1| PREDICTED: uncharacterized protein LOC105974330 [Erythranthe
           guttatus]
          Length = 257

 Score =  199 bits (505), Expect = 2e-48
 Identities = 121/248 (48%), Positives = 156/248 (62%), Gaps = 12/248 (4%)
 Frame = -2

Query: 710 KRWKLDIQRITWKGQGM-SKKILVHVVKE--GETLTSISKLYGVPVLE-----IAASNEE 555
           ++WKL+I+RI+ KGQGM +K+   HVVK+  GETLTSISKLYGV ++E      AA+++E
Sbjct: 27  QQWKLEIERISRKGQGMMNKESANHVVKDDRGETLTSISKLYGVSIIENCCLSAAANDKE 86

Query: 554 IADVDLVFEGQQLKIPSAVAQCAQVCHFEGYKLREHHFPSGSPRLGFHIRQRNQIFTMPS 375
           +ADVDLV +GQ           A+VC+ EG KL EHH  S +       +Q NQIF + S
Sbjct: 87  MADVDLVSDGQN-------RNSARVCNLEGTKLHEHHQLSNTTVS----KQPNQIFYLLS 135

Query: 374 FRQFS-IAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHHS---SSA 207
             Q   +AK   SF VLVPL+AFCI CI+G F+  + +N RH+A+ K GV HH+   +S 
Sbjct: 136 SHQLPLVAKAGGSFLVLVPLMAFCISCIIGTFRNRVLRNPRHKASNKYGVQHHNKSHNSP 195

Query: 206 RWKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGMSN 27
           RWK  L                      ++EQ+ FED SHA TKLE DYQKFLS+CGMSN
Sbjct: 196 RWKNVLD--------AESEPDNSHPLYEDQEQVNFEDASHANTKLEDDYQKFLSECGMSN 247

Query: 26  YGYWRGGS 3
           +GYWRGGS
Sbjct: 248 WGYWRGGS 255


>ref|XP_004250651.1| PREDICTED: uncharacterized protein LOC101253651 isoform X2 [Solanum
           lycopersicum]
          Length = 267

 Score =  191 bits (485), Expect = 4e-46
 Identities = 117/248 (47%), Positives = 143/248 (57%), Gaps = 5/248 (2%)
 Frame = -2

Query: 731 KPFALLVKRWKLDIQRITWKGQ--GMSKKILVHVVKEGETLTSISKLYGVPVLEIAASNE 558
           K F LL KR K  IQ +   G     SK+ LVHVVKE +TLTS+SKLYGVP+ EIAA+N+
Sbjct: 26  KHFTLLPKRCKNHIQELFSNGLQWNSSKQFLVHVVKEDDTLTSLSKLYGVPIFEIAAANK 85

Query: 557 EIADVDLVFEGQQLKIPSAVAQCAQVCHFEGYKLREHHFPSGSPRL---GFHIRQRNQIF 387
           EI DVDLVFEGQ L IPS V   +Q    E   L +      S      G  I Q+  + 
Sbjct: 86  EIIDVDLVFEGQHLNIPSYVTSYSQTNQREKINLPKIEVSETSRHFKLCGSDINQK--ML 143

Query: 386 TMPSFRQFSIAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHHSSSA 207
            + S R    AKT+  F VLVPL+ FCI CIM AF   +A+N    A      H  S S 
Sbjct: 144 YVLSCRHLPYAKTSGHFLVLVPLIGFCIRCIMNAFHHRVARNKLQDA------HQTSGSM 197

Query: 206 RWKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGMSN 27
           RWK+AL DL                   ++E L+ E++SHAY KL+GDYQKFLS+CGMS 
Sbjct: 198 RWKSALRDLTDPDALYSDSRPETDNVTDDREHLQSEELSHAYAKLDGDYQKFLSECGMSK 257

Query: 26  YGYWRGGS 3
           +GYWRGG+
Sbjct: 258 WGYWRGGT 265


>ref|XP_009794078.1| PREDICTED: uncharacterized protein LOC104240878 isoform X2
           [Nicotiana sylvestris]
          Length = 268

 Score =  191 bits (484), Expect = 6e-46
 Identities = 118/247 (47%), Positives = 144/247 (58%), Gaps = 4/247 (1%)
 Frame = -2

Query: 731 KPFALLVKRWKLDIQRITWKGQGM--SKKILVHVVKEGETLTSISKLYGVPVLEIAASNE 558
           K F++L KR K  IQ     GQ +  SK+ LVHVVKE ETLTSISKLY VP+ EIAA+N+
Sbjct: 26  KHFSILPKRCKYQIQEFFSNGQHLNNSKQFLVHVVKEDETLTSISKLYRVPIYEIAAANK 85

Query: 557 EIADVDLVFEGQQLKIPSAVAQCAQVCHFEGYKLREHHFPSGSPRLGFHIRQRNQ-IFTM 381
           EI DVDLVFEGQ L IPS +  C+Q C  +  +L + H    + RL    +  NQ I ++
Sbjct: 86  EIIDVDLVFEGQLLNIPSYITACSQTCQRKMIRLPKIHVSETNRRLKLCGKDFNQKILSV 145

Query: 380 PSFRQFSIAKTACSFPVLVPLVAFCIVCI-MGAFQIILAKNSRHQAAKKLGVHHHSSSAR 204
            S R    AKT   F VLVPL+AFCI CI M AF   +A+N      K   V   S S R
Sbjct: 146 LSCRHLPYAKTTGYFLVLVPLIAFCIRCIMMNAFHHRVARN------KLQDVRQASGSMR 199

Query: 203 WKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGMSNY 24
           WK AL DL                   +++    ED S AY KL+ DYQKFL++CGMS +
Sbjct: 200 WKLALRDLSDPDASYTDSRPEIENVTDDQDNFHSEDHSRAYAKLDHDYQKFLAECGMSKW 259

Query: 23  GYWRGGS 3
           GYWRGGS
Sbjct: 260 GYWRGGS 266


>ref|XP_006339352.1| PREDICTED: uncharacterized protein LOC102602767 isoform X2 [Solanum
           tuberosum]
          Length = 267

 Score =  188 bits (477), Expect = 4e-45
 Identities = 117/246 (47%), Positives = 142/246 (57%), Gaps = 3/246 (1%)
 Frame = -2

Query: 731 KPFALLVKRWKLDIQRITWKGQGMSKKI--LVHVVKEGETLTSISKLYGVPVLEIAASNE 558
           K F+LL KR K  IQ +   GQ  S  I  LVHVVKE ETLTS+SKLYGVP+ EIAA+N+
Sbjct: 26  KHFSLLPKRCKYHIQELFSNGQQWSSSIQFLVHVVKEDETLTSLSKLYGVPIYEIAAANK 85

Query: 557 EIADVDLVFEGQQLKIPSAVAQCAQVCHFEGYKLREHHFPSGSPRLGFHIRQRNQ-IFTM 381
           EI DV+LVFEGQ L IPS V   +Q    E  +L +      S R        NQ +  +
Sbjct: 86  EIIDVNLVFEGQHLNIPSYVTPYSQTNQREKIRLPKIDVSETSQRFKLCGNDINQKMLYV 145

Query: 380 PSFRQFSIAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHHSSSARW 201
            S R    AKT+  F VLVPL+ FCI CIM AF   +A+N      K   V   S S RW
Sbjct: 146 LSCRHLPYAKTSGYFLVLVPLIGFCIRCIMNAFHHRVARN------KLQDVRQASGSMRW 199

Query: 200 KTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGMSNYG 21
           K AL DL                   ++E L+ E++S AY KL+GDYQKFLS+CGMS +G
Sbjct: 200 KLALRDLSDPDALYSDSRPEIENVTDDREHLQSEELSRAYAKLDGDYQKFLSECGMSKWG 259

Query: 20  YWRGGS 3
           YWRGG+
Sbjct: 260 YWRGGT 265


>ref|XP_010665313.1| PREDICTED: uncharacterized protein LOC100241456 isoform X1 [Vitis
           vinifera]
          Length = 279

 Score =  185 bits (470), Expect = 2e-44
 Identities = 111/252 (44%), Positives = 147/252 (58%), Gaps = 9/252 (3%)
 Frame = -2

Query: 731 KPFALLVKRWKLDIQRITWKGQGMSKKILVHVVKEGETLTSISKLYGVPVLEIAASNEEI 552
           K F +L ++W+  IQ I+ KGQ  +K   VH+VKEGETL+SISK YGV +  IAA+N+ I
Sbjct: 32  KHFRMLTEKWRFQIQEIS-KGQHSTKHNSVHMVKEGETLSSISKQYGVSIYSIAAANKNI 90

Query: 551 ADVDLVFEGQQLKIPSAV--------AQCAQVCHFEGYKLREHHFPSGSPRLGFHIRQRN 396
            D+DLVF GQ L IPS+          + +++  F+  K  +H        LG  + Q+ 
Sbjct: 91  EDIDLVFCGQHLNIPSSAVGETQKFQTEKSKLSSFDTLKRHQHSLEV----LGGRLNQKL 146

Query: 395 QIFTMPSFRQFSIAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHH- 219
               + SF   S AK    F VLVPL+AFCI CI+GAFQ  +  + RHQA  +  V +H 
Sbjct: 147 CTVAL-SFHSLSHAKATGYFLVLVPLIAFCIRCIIGAFQNRVVGDLRHQAVNESEVDYHG 205

Query: 218 SSSARWKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQC 39
           S S RWK+AL D+R                   + Q   E+VSHAY KLE DYQ+FLS+C
Sbjct: 206 SKSVRWKSALDDIREPDTLDTGLQPDSINPSEVQTQGSAEEVSHAYGKLEHDYQQFLSEC 265

Query: 38  GMSNYGYWRGGS 3
           G+S +GYWRGGS
Sbjct: 266 GISKWGYWRGGS 277


>ref|XP_010312982.1| PREDICTED: uncharacterized protein LOC101253651 isoform X1 [Solanum
           lycopersicum]
          Length = 268

 Score =  185 bits (470), Expect = 2e-44
 Identities = 116/249 (46%), Positives = 143/249 (57%), Gaps = 6/249 (2%)
 Frame = -2

Query: 731 KPFALLVKRWKLDIQRITWKGQ--GMSKKILVHVVKE-GETLTSISKLYGVPVLEIAASN 561
           K F LL KR K  IQ +   G     SK+ LVHVVK+  +TLTS+SKLYGVP+ EIAA+N
Sbjct: 26  KHFTLLPKRCKNHIQELFSNGLQWNSSKQFLVHVVKDRDDTLTSLSKLYGVPIFEIAAAN 85

Query: 560 EEIADVDLVFEGQQLKIPSAVAQCAQVCHFEGYKLREHHFPSGSPRL---GFHIRQRNQI 390
           +EI DVDLVFEGQ L IPS V   +Q    E   L +      S      G  I Q+  +
Sbjct: 86  KEIIDVDLVFEGQHLNIPSYVTSYSQTNQREKINLPKIEVSETSRHFKLCGSDINQK--M 143

Query: 389 FTMPSFRQFSIAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHHSSS 210
             + S R    AKT+  F VLVPL+ FCI CIM AF   +A+N    A      H  S S
Sbjct: 144 LYVLSCRHLPYAKTSGHFLVLVPLIGFCIRCIMNAFHHRVARNKLQDA------HQTSGS 197

Query: 209 ARWKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGMS 30
            RWK+AL DL                   ++E L+ E++SHAY KL+GDYQKFLS+CGMS
Sbjct: 198 MRWKSALRDLTDPDALYSDSRPETDNVTDDREHLQSEELSHAYAKLDGDYQKFLSECGMS 257

Query: 29  NYGYWRGGS 3
            +GYWRGG+
Sbjct: 258 KWGYWRGGT 266


>ref|XP_009794077.1| PREDICTED: uncharacterized protein LOC104240878 isoform X1
           [Nicotiana sylvestris]
          Length = 275

 Score =  184 bits (466), Expect = 7e-44
 Identities = 118/254 (46%), Positives = 145/254 (57%), Gaps = 11/254 (4%)
 Frame = -2

Query: 731 KPFALLVKRWKLDIQRITWKGQGM--SKKILVHVVKEGETLTSISKLYGVPVLEIAASNE 558
           K F++L KR K  IQ     GQ +  SK+ LVHVVKE ETLTSISKLY VP+ EIAA+N+
Sbjct: 26  KHFSILPKRCKYQIQEFFSNGQHLNNSKQFLVHVVKEDETLTSISKLYRVPIYEIAAANK 85

Query: 557 EIADVDLVFEGQQLKIPSAVAQCAQVCHFEGYKLREHHFPSGSPRLGFHIRQRNQ-IFTM 381
           EI DVDLVFEGQ L IPS +  C+Q C  +  +L + H    + RL    +  NQ I ++
Sbjct: 86  EIIDVDLVFEGQLLNIPSYITACSQTCQRKMIRLPKIHVSETNRRLKLCGKDFNQKILSV 145

Query: 380 PSFRQ-------FSIAKTACSFPVLVPLVAFCIVCI-MGAFQIILAKNSRHQAAKKLGVH 225
            S R        +  AKT   F VLVPL+AFCI CI M AF   +A+N      K   V 
Sbjct: 146 LSCRHLPYTCQCYYQAKTTGYFLVLVPLIAFCIRCIMMNAFHHRVARN------KLQDVR 199

Query: 224 HHSSSARWKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLS 45
             S S RWK AL DL                   +++    ED S AY KL+ DYQKFL+
Sbjct: 200 QASGSMRWKLALRDLSDPDASYTDSRPEIENVTDDQDNFHSEDHSRAYAKLDHDYQKFLA 259

Query: 44  QCGMSNYGYWRGGS 3
           +CGMS +GYWRGGS
Sbjct: 260 ECGMSKWGYWRGGS 273


>ref|XP_006376147.1| hypothetical protein POPTR_0013s10230g [Populus trichocarpa]
           gi|550325417|gb|ERP53944.1| hypothetical protein
           POPTR_0013s10230g [Populus trichocarpa]
          Length = 286

 Score =  182 bits (462), Expect = 2e-43
 Identities = 112/251 (44%), Positives = 145/251 (57%), Gaps = 8/251 (3%)
 Frame = -2

Query: 731 KPFALLVKRWKLDIQRITWKGQGMSKKILVHVVKEGETLTSISKLYGVPVLEIAASNEEI 552
           K F +L +RW+  IQ I+ KGQ  +   L+HVVKEGETLTSISK YGV +  +AA+N+ I
Sbjct: 40  KHFTVLAERWRFHIQDIS-KGQSSTNPYLLHVVKEGETLTSISKQYGVSIYSVAAANKNI 98

Query: 551 ADVDLVFEGQQLKIPSAVAQCAQVCHFEGYKLREHHFPSGS--PRLGFHIRQRNQIFTMP 378
            DVDLVFEGQ L IP+A     QV     Y++++   PS     RL   ++  + +    
Sbjct: 99  LDVDLVFEGQLLNIPAAAPAGTQV-----YQIKKCESPSFDQLERLQNFMKIMDGVLNQK 153

Query: 377 SF-----RQFSIAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHH-S 216
            F      +   AK    F VLVP +AFCI CI+GAF     +N   QA+ +   HH   
Sbjct: 154 PFITVTTLRLPHAKATGYFLVLVPALAFCIRCIIGAFHTRARRNLGCQASNESRRHHDVP 213

Query: 215 SSARWKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCG 36
            S RWK ALSD+R                  +++Q  FE+VSHAY KLE +YQKFLS+CG
Sbjct: 214 ESKRWKHALSDIREPDNLDGEPILNSTGTSADQDQNSFEEVSHAYDKLEHEYQKFLSECG 273

Query: 35  MSNYGYWRGGS 3
           +SN GYWRGGS
Sbjct: 274 ISNSGYWRGGS 284


>ref|XP_006339351.1| PREDICTED: uncharacterized protein LOC102602767 isoform X1 [Solanum
           tuberosum]
          Length = 268

 Score =  182 bits (462), Expect = 2e-43
 Identities = 116/247 (46%), Positives = 142/247 (57%), Gaps = 4/247 (1%)
 Frame = -2

Query: 731 KPFALLVKRWKLDIQRITWKGQGMSKKI--LVHVVKE-GETLTSISKLYGVPVLEIAASN 561
           K F+LL KR K  IQ +   GQ  S  I  LVHVVK+  ETLTS+SKLYGVP+ EIAA+N
Sbjct: 26  KHFSLLPKRCKYHIQELFSNGQQWSSSIQFLVHVVKDRDETLTSLSKLYGVPIYEIAAAN 85

Query: 560 EEIADVDLVFEGQQLKIPSAVAQCAQVCHFEGYKLREHHFPSGSPRLGFHIRQRNQ-IFT 384
           +EI DV+LVFEGQ L IPS V   +Q    E  +L +      S R        NQ +  
Sbjct: 86  KEIIDVNLVFEGQHLNIPSYVTPYSQTNQREKIRLPKIDVSETSQRFKLCGNDINQKMLY 145

Query: 383 MPSFRQFSIAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHHSSSAR 204
           + S R    AKT+  F VLVPL+ FCI CIM AF   +A+N      K   V   S S R
Sbjct: 146 VLSCRHLPYAKTSGYFLVLVPLIGFCIRCIMNAFHHRVARN------KLQDVRQASGSMR 199

Query: 203 WKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGMSNY 24
           WK AL DL                   ++E L+ E++S AY KL+GDYQKFLS+CGMS +
Sbjct: 200 WKLALRDLSDPDALYSDSRPEIENVTDDREHLQSEELSRAYAKLDGDYQKFLSECGMSKW 259

Query: 23  GYWRGGS 3
           GYWRGG+
Sbjct: 260 GYWRGGT 266


>gb|EYU22750.1| hypothetical protein MIMGU_mgv1a013618mg [Erythranthe guttata]
          Length = 215

 Score =  181 bits (458), Expect = 6e-43
 Identities = 111/231 (48%), Positives = 141/231 (61%), Gaps = 11/231 (4%)
 Frame = -2

Query: 662 MSKKILVHVVKE--GETLTSISKLYGVPVLE-----IAASNEEIADVDLVFEGQQLKIPS 504
           M+K+   HVVK+  GETLTSISKLYGV ++E      AA+++E+ADVDLV +GQ      
Sbjct: 2   MNKESANHVVKDDRGETLTSISKLYGVSIIENCCLSAAANDKEMADVDLVSDGQN----- 56

Query: 503 AVAQCAQVCHFEGYKLREHHFPSGSPRLGFHIRQRNQIFTMPSFRQFS-IAKTACSFPVL 327
                A+VC+ EG KL EHH  S +       +Q NQIF + S  Q   +AK   SF VL
Sbjct: 57  --RNSARVCNLEGTKLHEHHQLSNTTVS----KQPNQIFYLLSSHQLPLVAKAGGSFLVL 110

Query: 326 VPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHHS---SSARWKTALSDLRXXXXXXX 156
           VPL+AFCI CI+G F+  + +N RH+A+ K GV HH+   +S RWK  L           
Sbjct: 111 VPLMAFCISCIIGTFRNRVLRNPRHKASNKYGVQHHNKSHNSPRWKNVLD--------AE 162

Query: 155 XXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGMSNYGYWRGGS 3
                      ++EQ+ FED SHA TKLE DYQKFLS+CGMSN+GYWRGGS
Sbjct: 163 SEPDNSHPLYEDQEQVNFEDASHANTKLEDDYQKFLSECGMSNWGYWRGGS 213


>ref|XP_009588799.1| PREDICTED: uncharacterized protein LOC104086270 isoform X2
           [Nicotiana tomentosiformis]
          Length = 267

 Score =  180 bits (456), Expect = 1e-42
 Identities = 110/246 (44%), Positives = 140/246 (56%), Gaps = 3/246 (1%)
 Frame = -2

Query: 731 KPFALLVKRWKLDIQRITWKGQGM--SKKILVHVVKEGETLTSISKLYGVPVLEIAASNE 558
           K F +L KR K  IQ      Q +  S++ LVHVVKE ETLTSISKLYGVP+ EIAA+N+
Sbjct: 26  KHFIILPKRCKYQIQEFFSNDQHLNNSRQFLVHVVKEDETLTSISKLYGVPIYEIAAANK 85

Query: 557 EIADVDLVFEGQQLKIPSAVAQCAQVCHFEGYKLREHHFPSGSPRLGFHIRQRNQ-IFTM 381
           +I DVDLVFEGQ L +PS +  C+Q C  +  +L +      + RL    +  NQ I ++
Sbjct: 86  QIIDVDLVFEGQLLNVPSYITTCSQTCQRKMIRLPKIDVSETNRRLKLCGKDFNQKILSV 145

Query: 380 PSFRQFSIAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHHSSSARW 201
            S R    AKT   F VLV L+AF I CIM AF   + +N      K   V   S S RW
Sbjct: 146 LSCRHLPYAKTTGYFLVLVSLIAFGIRCIMNAFHRRVGRN------KLQDVRQASGSMRW 199

Query: 200 KTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGMSNYG 21
           K AL DL                   +++    ED+S AY K++ DYQKFL++CGMS +G
Sbjct: 200 KLALRDLSDPDASYTDSRTEIDNVTDDQDNFHSEDLSRAYAKVDHDYQKFLAECGMSKWG 259

Query: 20  YWRGGS 3
           YWRGGS
Sbjct: 260 YWRGGS 265


>ref|XP_011005439.1| PREDICTED: uncharacterized protein LOC105111694 isoform X1 [Populus
           euphratica]
          Length = 292

 Score =  179 bits (454), Expect = 2e-42
 Identities = 110/252 (43%), Positives = 145/252 (57%), Gaps = 9/252 (3%)
 Frame = -2

Query: 731 KPFALLVKRWKLDIQRITWKGQGMSKKILVHVVKEGETLTSISKLYGVPVLEIAASNEEI 552
           K F +L +RW+  IQ I+ KGQ  +   L+HVVKEGETLTSISK YGV +  +AA+N+ I
Sbjct: 40  KHFTVLAERWRFHIQDIS-KGQSSTNPYLLHVVKEGETLTSISKQYGVSIYSVAAANKNI 98

Query: 551 ADVDLVFEGQQLKIPSAVAQCAQVC-HFEGYKLREHHFPSGS--PRLGFHIRQRNQIFTM 381
            DVDLVFEGQ L IP++     +VC     Y++++   PS     RL   ++  + +   
Sbjct: 99  LDVDLVFEGQLLNIPASAPADTKVCLCLNQYQVKKCESPSFDQLERLQNFMKIMDGVLNQ 158

Query: 380 PSFRQFSI-----AKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHH- 219
             F   +      AK    F VLVP +AFCI CI+GAF     +N   QA+ +   H   
Sbjct: 159 KPFITVTTLHLPHAKATGYFLVLVPALAFCIRCIIGAFHTRARRNLGCQASNESRRHQDV 218

Query: 218 SSSARWKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQC 39
             S RWK ALSD+R                  +++Q  FE+VSHAY KLE +YQKFLS+C
Sbjct: 219 PESKRWKHALSDIREPDNLDGEPILNSTGTSADQDQNSFEEVSHAYDKLEHEYQKFLSEC 278

Query: 38  GMSNYGYWRGGS 3
           G+SN GYWRGGS
Sbjct: 279 GISNSGYWRGGS 290


>ref|XP_011005440.1| PREDICTED: uncharacterized protein LOC105111694 isoform X2 [Populus
           euphratica]
          Length = 286

 Score =  176 bits (446), Expect = 1e-41
 Identities = 109/251 (43%), Positives = 144/251 (57%), Gaps = 8/251 (3%)
 Frame = -2

Query: 731 KPFALLVKRWKLDIQRITWKGQGMSKKILVHVVKEGETLTSISKLYGVPVLEIAASNEEI 552
           K F +L +RW+  IQ I+ KGQ  +   L+HVVKEGETLTSISK YGV +  +AA+N+ I
Sbjct: 40  KHFTVLAERWRFHIQDIS-KGQSSTNPYLLHVVKEGETLTSISKQYGVSIYSVAAANKNI 98

Query: 551 ADVDLVFEGQQLKIPSAVAQCAQVCHFEGYKLREHHFPSGS--PRLGFHIRQRNQIFTMP 378
            DVDLVFEGQ L IP++     +V     Y++++   PS     RL   ++  + +    
Sbjct: 99  LDVDLVFEGQLLNIPASAPADTKV-----YQVKKCESPSFDQLERLQNFMKIMDGVLNQK 153

Query: 377 SFRQFSI-----AKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHH-S 216
            F   +      AK    F VLVP +AFCI CI+GAF     +N   QA+ +   H    
Sbjct: 154 PFITVTTLHLPHAKATGYFLVLVPALAFCIRCIIGAFHTRARRNLGCQASNESRRHQDVP 213

Query: 215 SSARWKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCG 36
            S RWK ALSD+R                  +++Q  FE+VSHAY KLE +YQKFLS+CG
Sbjct: 214 ESKRWKHALSDIREPDNLDGEPILNSTGTSADQDQNSFEEVSHAYDKLEHEYQKFLSECG 273

Query: 35  MSNYGYWRGGS 3
           +SN GYWRGGS
Sbjct: 274 ISNSGYWRGGS 284


>ref|XP_009588798.1| PREDICTED: uncharacterized protein LOC104086270 isoform X1
           [Nicotiana tomentosiformis]
          Length = 274

 Score =  173 bits (438), Expect = 1e-40
 Identities = 110/253 (43%), Positives = 141/253 (55%), Gaps = 10/253 (3%)
 Frame = -2

Query: 731 KPFALLVKRWKLDIQRITWKGQGM--SKKILVHVVKEGETLTSISKLYGVPVLEIAASNE 558
           K F +L KR K  IQ      Q +  S++ LVHVVKE ETLTSISKLYGVP+ EIAA+N+
Sbjct: 26  KHFIILPKRCKYQIQEFFSNDQHLNNSRQFLVHVVKEDETLTSISKLYGVPIYEIAAANK 85

Query: 557 EIADVDLVFEGQQLKIPSAVAQCAQVCHFEGYKLREHHFPSGSPRLGFHIRQRNQ-IFTM 381
           +I DVDLVFEGQ L +PS +  C+Q C  +  +L +      + RL    +  NQ I ++
Sbjct: 86  QIIDVDLVFEGQLLNVPSYITTCSQTCQRKMIRLPKIDVSETNRRLKLCGKDFNQKILSV 145

Query: 380 PSFRQ-------FSIAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHH 222
            S R        +  AKT   F VLV L+AF I CIM AF   + +N      K   V  
Sbjct: 146 LSCRHLPYTCQCYYQAKTTGYFLVLVSLIAFGIRCIMNAFHRRVGRN------KLQDVRQ 199

Query: 221 HSSSARWKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQ 42
            S S RWK AL DL                   +++    ED+S AY K++ DYQKFL++
Sbjct: 200 ASGSMRWKLALRDLSDPDASYTDSRTEIDNVTDDQDNFHSEDLSRAYAKVDHDYQKFLAE 259

Query: 41  CGMSNYGYWRGGS 3
           CGMS +GYWRGGS
Sbjct: 260 CGMSKWGYWRGGS 272


>ref|XP_007019798.1| Uncharacterized protein isoform 1 [Theobroma cacao]
           gi|508725126|gb|EOY17023.1| Uncharacterized protein
           isoform 1 [Theobroma cacao]
          Length = 273

 Score =  160 bits (404), Expect = 1e-36
 Identities = 101/250 (40%), Positives = 142/250 (56%), Gaps = 9/250 (3%)
 Frame = -2

Query: 725 FALLVKRWKLDIQRITWKGQGMSKK-ILVHVVKEGETLTSISKLYGVPVLEIAASNEEIA 549
           F  L+K+W+L         Q  SK  I  H+VKEGETL+SISK YGV V  IAA+N++I 
Sbjct: 45  FQGLIKKWRL---------QNNSKDYICAHLVKEGETLSSISKKYGVSVYSIAAANKDIV 95

Query: 548 DVDLVFEGQQLKIPSA------VAQCAQVCHFEGYKLREHHFPSGSPRLGFHIRQRNQIF 387
           D+ LVF+GQ L IP++      +A+ +++ H     +R    PS              I+
Sbjct: 96  DIHLVFKGQLLNIPASSLKETLLAKKSRLWH----SIRAFRTPS-----------HKIIY 140

Query: 386 TMPSFRQFS-IAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHH-HSS 213
           +M +    S  AK    F VLVPL+AFCI CI+  F+I +A++ RHQA  K   HH  + 
Sbjct: 141 SMVTSHGLSNQAKATGYFLVLVPLIAFCIRCIISTFRIRVARDMRHQAVDKSKGHHPGAK 200

Query: 212 SARWKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGM 33
           S RWK+ALSD                    ++  + +++ SHAY++L+ DY+KFLS+CGM
Sbjct: 201 SMRWKSALSDTEESDAFDSESGLDSNSPSEDEAYISYDEASHAYSRLQHDYEKFLSECGM 260

Query: 32  SNYGYWRGGS 3
           S +GYWRGGS
Sbjct: 261 SKWGYWRGGS 270


>gb|KJB81204.1| hypothetical protein B456_013G136900 [Gossypium raimondii]
          Length = 273

 Score =  152 bits (383), Expect = 3e-34
 Identities = 97/246 (39%), Positives = 133/246 (54%), Gaps = 5/246 (2%)
 Frame = -2

Query: 725 FALLVKRWKLDIQRITWKGQGMSKKILVHVVKEGETLTSISKLYGVPVLEIAASNEEIAD 546
           F  LVK+W+L  +   +           HVVKEGETL+SISK+YGV V  IAA+N+ I D
Sbjct: 44  FQGLVKKWRLQNKTKDYS--------CAHVVKEGETLSSISKMYGVSVHSIAAANKNIVD 95

Query: 545 VDLVFEGQQLKIPSAVAQCAQVCHFEGYKL----REHHFPSGSPRLGFHIRQRNQIFTMP 378
           ++LVF GQ L IPS+     Q+   +  +L    R    PSG            + FTM 
Sbjct: 96  INLVFRGQLLNIPSSSLLDTQLDRAKKSRLWQSIRALKAPSG-----------QKFFTMI 144

Query: 377 SFRQFSIAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHHSSSA-RW 201
           +    S AK+   F VLVPL+AFCI CI+      ++++ +HQAA +   HH  +   RW
Sbjct: 145 TAHCLSNAKSTGYFLVLVPLIAFCIGCIIVTLHTRVSRSIKHQAADESQAHHPGAKGRRW 204

Query: 200 KTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGMSNYG 21
           K+ALSD                    ++  ++ E+ S  Y +LE DYQKFLS+CG+S +G
Sbjct: 205 KSALSDSVEGDVFDSELGLDSNSTSEDEANIQNEEASKDYGRLEHDYQKFLSECGISKWG 264

Query: 20  YWRGGS 3
           YWRGGS
Sbjct: 265 YWRGGS 270


>gb|KJB81208.1| hypothetical protein B456_013G136900 [Gossypium raimondii]
          Length = 274

 Score =  147 bits (371), Expect = 7e-33
 Identities = 97/247 (39%), Positives = 133/247 (53%), Gaps = 6/247 (2%)
 Frame = -2

Query: 725 FALLVKRWKLDIQRITWKGQGMSKKILVHVVKEGETLTSISKLYGVPVLEIAASNEEIAD 546
           F  LVK+W+L  +   +           HVVKEGETL+SISK+YGV V  IAA+N+ I D
Sbjct: 44  FQGLVKKWRLQNKTKDYS--------CAHVVKEGETLSSISKMYGVSVHSIAAANKNIVD 95

Query: 545 VDLVFEGQQLKIPSAVAQCAQVCHFEGYKL----REHHFPSGSPRLGFHIRQRNQIFTMP 378
           ++LVF GQ L IPS+     Q+   +  +L    R    PSG            + FTM 
Sbjct: 96  INLVFRGQLLNIPSSSLLDTQLDRAKKSRLWQSIRALKAPSG-----------QKFFTMI 144

Query: 377 SFRQFS-IAKTACSFPVLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGVHHHSSSA-R 204
           +    S  AK+   F VLVPL+AFCI CI+      ++++ +HQAA +   HH  +   R
Sbjct: 145 TAHCLSNQAKSTGYFLVLVPLIAFCIGCIIVTLHTRVSRSIKHQAADESQAHHPGAKGRR 204

Query: 203 WKTALSDLRXXXXXXXXXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGMSNY 24
           WK+ALSD                    ++  ++ E+ S  Y +LE DYQKFLS+CG+S +
Sbjct: 205 WKSALSDSVEGDVFDSELGLDSNSTSEDEANIQNEEASKDYGRLEHDYQKFLSECGISKW 264

Query: 23  GYWRGGS 3
           GYWRGGS
Sbjct: 265 GYWRGGS 271


>ref|XP_006858608.1| PREDICTED: uncharacterized protein LOC18448485 isoform X1
           [Amborella trichopoda] gi|548862717|gb|ERN20075.1|
           hypothetical protein AMTR_s00071p00202040 [Amborella
           trichopoda]
          Length = 281

 Score =  145 bits (367), Expect = 2e-32
 Identities = 91/231 (39%), Positives = 120/231 (51%), Gaps = 9/231 (3%)
 Frame = -2

Query: 668 QGMSKKILVHVVKEGETLTSISKLYGVPVLEIAASNEEIADVDLVFEGQQLKIPSAVAQC 489
           Q ++KK+LVHVVKEGETLTSIS+ Y V +  IAA+N +I +VD V EG+ L +P    + 
Sbjct: 54  QDIAKKLLVHVVKEGETLTSISRKYRVSIELIAAANTDITNVDFVLEGRSLNVPIVSKE- 112

Query: 488 AQVCHFEGYKLREHHFPSGSPRLGFHIRQRNQIFTMPSF--------RQFSIAKTACSFP 333
                 +G   RE+H   G  +  F     N +    ++            +AK    F 
Sbjct: 113 -----IQGVSPRENHAIQGDAKEIFQYSHVNTLVAQANYNLSRMLSPHYLQLAKGTGYFL 167

Query: 332 VLVPLVAFCIVCIMGAFQIILAKNSRHQAAKKLGV-HHHSSSARWKTALSDLRXXXXXXX 156
           ++  LVAFC   I   F    A   +HQA   L V H  S S RWK ALS++R       
Sbjct: 168 LVATLVAFCFRYIFSEFHHRFANKLKHQAQNDLKVPHDGSGSMRWKFALSEIREMGIVDA 227

Query: 155 XXXXXXXXXXXEKEQLRFEDVSHAYTKLEGDYQKFLSQCGMSNYGYWRGGS 3
                      ++E    E+V+ AYTKLE  YQKFLS+CGMS +GYWRGGS
Sbjct: 228 ESRENPDGDSQDQELDSLEEVAEAYTKLEPAYQKFLSECGMSKWGYWRGGS 278


>ref|XP_007019799.1| Uncharacterized protein isoform 2, partial [Theobroma cacao]
           gi|508725127|gb|EOY17024.1| Uncharacterized protein
           isoform 2, partial [Theobroma cacao]
          Length = 220

 Score =  145 bits (367), Expect = 2e-32
 Identities = 88/216 (40%), Positives = 125/216 (57%), Gaps = 8/216 (3%)
 Frame = -2

Query: 626 GETLTSISKLYGVPVLEIAASNEEIADVDLVFEGQQLKIPSA------VAQCAQVCHFEG 465
           GETL+SISK YGV V  IAA+N++I D+ LVF+GQ L IP++      +A+ +++ H   
Sbjct: 17  GETLSSISKKYGVSVYSIAAANKDIVDIHLVFKGQLLNIPASSLKETLLAKKSRLWH--- 73

Query: 464 YKLREHHFPSGSPRLGFHIRQRNQIFTMPSFRQFSI-AKTACSFPVLVPLVAFCIVCIMG 288
             +R    PS              I++M +    S  AK    F VLVPL+AFCI CI+ 
Sbjct: 74  -SIRAFRTPS-----------HKIIYSMVTSHGLSNQAKATGYFLVLVPLIAFCIRCIIS 121

Query: 287 AFQIILAKNSRHQAAKKLGVHHHSS-SARWKTALSDLRXXXXXXXXXXXXXXXXXXEKEQ 111
            F+I +A++ RHQA  K   HH  + S RWK+ALSD                    ++  
Sbjct: 122 TFRIRVARDMRHQAVDKSKGHHPGAKSMRWKSALSDTEESDAFDSESGLDSNSPSEDEAY 181

Query: 110 LRFEDVSHAYTKLEGDYQKFLSQCGMSNYGYWRGGS 3
           + +++ SHAY++L+ DY+KFLS+CGMS +GYWRGGS
Sbjct: 182 ISYDEASHAYSRLQHDYEKFLSECGMSKWGYWRGGS 217


Top