BLASTX nr result

ID: Magnolia22_contig00012842 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Magnolia22_contig00012842
         (1355 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_010665313.1 PREDICTED: uncharacterized protein LOC100241456 i...   219   6e-65
XP_018849482.1 PREDICTED: uncharacterized protein LOC109012352 i...   209   3e-61
XP_006858608.1 PREDICTED: uncharacterized protein LOC18448485 is...   197   1e-56
XP_011005440.1 PREDICTED: uncharacterized protein LOC105111694 i...   195   1e-55
XP_006376147.1 hypothetical protein POPTR_0013s10230g [Populus t...   194   5e-55
XP_011005439.1 PREDICTED: uncharacterized protein LOC105111694 i...   192   1e-54
XP_007019798.1 PREDICTED: uncharacterized protein LOC18592837 is...   185   8e-52
KJB81204.1 hypothetical protein B456_013G136900 [Gossypium raimo...   184   1e-51
XP_016703596.1 PREDICTED: uncharacterized protein LOC107918531 i...   184   2e-51
XP_017981653.1 PREDICTED: uncharacterized protein LOC18592837 is...   182   9e-51
KCW69519.1 hypothetical protein EUGRSUZ_F02959 [Eucalyptus grandis]   182   1e-50
XP_016703595.1 PREDICTED: uncharacterized protein LOC107918531 i...   181   2e-50
KJB81208.1 hypothetical protein B456_013G136900 [Gossypium raimo...   180   5e-50
XP_004250651.1 PREDICTED: uncharacterized protein LOC101253651 i...   180   5e-50
XP_015056897.1 PREDICTED: uncharacterized protein LOC107003144 i...   179   1e-49
XP_017617430.1 PREDICTED: uncharacterized protein LOC108461944 i...   178   4e-49
XP_017981652.1 PREDICTED: uncharacterized protein LOC18592837 is...   178   4e-49
XP_011095893.1 PREDICTED: uncharacterized protein LOC105175222 [...   177   6e-49
XP_016703594.1 PREDICTED: uncharacterized protein LOC107918531 i...   177   8e-49
XP_010062380.1 PREDICTED: uncharacterized protein LOC104449808 i...   178   8e-49

>XP_010665313.1 PREDICTED: uncharacterized protein LOC100241456 isoform X1 [Vitis
            vinifera]
          Length = 279

 Score =  219 bits (558), Expect = 6e-65
 Identities = 125/255 (49%), Positives = 158/255 (61%), Gaps = 4/255 (1%)
 Frame = -1

Query: 1061 LKNKKLLTNRFRALSQRWNFIQIQGLSKSWNPAKQFSVHVVKEGETLTSISMQYGVSMQX 882
            +  ++     FR L+++W F QIQ +SK  +  K  SVH+VKEGETL+SIS QYGVS+  
Sbjct: 24   ISKQRSFKKHFRMLTEKWRF-QIQEISKGQHSTKHNSVHMVKEGETLSSISKQYGVSIYS 82

Query: 881  XXXXXXXXXXXXXXLEGQRLNIPPSAK-EAQMDSASTNQLDHCNIHERCQSSSSILGAHE 705
                            GQ LNIP SA  E Q      ++L   +  +R Q S  +LG   
Sbjct: 83   IAAANKNIEDIDLVFCGQHLNIPSSAVGETQKFQTEKSKLSSFDTLKRHQHSLEVLGGRL 142

Query: 704  NQRIFTLLSSQH-IQLAKPTGYFLVLVPLIAICIRCIIGTFHHRLPKQLKHQAVNESKDH 528
            NQ++ T+  S H +  AK TGYFLVLVPLIA CIRCIIG F +R+   L+HQAVNES+  
Sbjct: 143  NQKLCTVALSFHSLSHAKATGYFLVLVPLIAFCIRCIIGAFQNRVVGDLRHQAVNESEVD 202

Query: 527  HQRSRSERWKSVL--IRESDVADADSRQDLVNPSEDQSQVPFEDISHAYTKLEPAYLKFL 354
            +  S+S RWKS L  IRE D  D   + D +NPSE Q+Q   E++SHAY KLE  Y +FL
Sbjct: 203  YHGSKSVRWKSALDDIREPDTLDTGLQPDSINPSEVQTQGSAEEVSHAYGKLEHDYQQFL 262

Query: 353  SECGMSKWGYWRGGS 309
            SECG+SKWGYWRGGS
Sbjct: 263  SECGISKWGYWRGGS 277


>XP_018849482.1 PREDICTED: uncharacterized protein LOC109012352 isoform X1 [Juglans
            regia]
          Length = 280

 Score =  209 bits (533), Expect = 3e-61
 Identities = 126/291 (43%), Positives = 168/291 (57%), Gaps = 2/291 (0%)
 Frame = -1

Query: 1169 MEPKLNQKKEIGMMRSTVRRGTASNITSSPILSINPLKNKKLLTNRFRALSQRWNFIQIQ 990
            ME KL++++++      + + T  + +SS +LS +         N FR  +Q+W F +IQ
Sbjct: 1    MEVKLSERRDLLPFIKLLPKPTFPSQSSS-VLSFH---------NNFRGFAQKWRF-RIQ 49

Query: 989  GLSKSWNPAKQFSVHVVKEGETLTSISMQYGVSMQXXXXXXXXXXXXXXXLEGQRLNIPP 810
             +SK+  P K + VHV+KEGETLTSIS QYGVS+                  GQ LNIP 
Sbjct: 50   EISKAEQPTKHYLVHVIKEGETLTSISKQYGVSIHAIAAANKSIKDVDILFGGQHLNIPS 109

Query: 809  SAKEAQMDSASTNQLDHCNIHERCQSSSSILGAHENQRIFTLLSSQHIQLAKPTGYFLVL 630
            + ++  +       L    + E    S +IL     QR F +LSS ++  AK TGYFLVL
Sbjct: 110  ATRDTPVVRTVIIWLRGLKLRENHLGSLNILDGLLEQRSFNVLSSHYLPHAKTTGYFLVL 169

Query: 629  VPLIAICIRCIIGTFHHRLPKQLKHQAVNESKDHHQRSRSERWKSVL--IRESDVADADS 456
            VPL+A CIR I+  FH R+  +LK +  +ES+  H R RS RWKSVL  I E D  DA+ 
Sbjct: 170  VPLVAFCIRLILDAFHTRVAGELKDEVASESECQHHRCRSMRWKSVLSDIEEIDSVDAEL 229

Query: 455  RQDLVNPSEDQSQVPFEDISHAYTKLEPAYLKFLSECGMSKWGYWRGGSLE 303
                 N SE QSQV FE++SHAY KLE  Y KFLS+CGM + G+WRGGS E
Sbjct: 230  SPHSNNHSEAQSQVSFEEMSHAYNKLEQDYQKFLSDCGMKESGHWRGGSPE 280


>XP_006858608.1 PREDICTED: uncharacterized protein LOC18448485 isoform X1 [Amborella
            trichopoda] ERN20075.1 hypothetical protein
            AMTR_s00071p00202040 [Amborella trichopoda]
          Length = 281

 Score =  197 bits (502), Expect = 1e-56
 Identities = 118/260 (45%), Positives = 153/260 (58%), Gaps = 3/260 (1%)
 Frame = -1

Query: 1079 ILSINPLKNKKLLTNRFRALSQRWNFIQIQGLSKSWNPAKQFSVHVVKEGETLTSISMQY 900
            ++S   +KN   L+     +S+R + IQ  G S + + AK+  VHVVKEGETLTSIS +Y
Sbjct: 23   VVSSRFIKNLSQLSR----VSERCSSIQSHGQSTAQDIAKKLLVHVVKEGETLTSISRKY 78

Query: 899  GVSMQXXXXXXXXXXXXXXXLEGQRLNIPPSAKEAQMDSASTNQLDHCNIHERCQSSS-S 723
             VS++               LEG+ LN+P  +KE Q  S   N     +  E  Q S  +
Sbjct: 79   RVSIELIAAANTDITNVDFVLEGRSLNVPIVSKEIQGVSPRENHAIQGDAKEIFQYSHVN 138

Query: 722  ILGAHENQRIFTLLSSQHIQLAKPTGYFLVLVPLIAICIRCIIGTFHHRLPKQLKHQAVN 543
             L A  N  +  +LS  ++QLAK TGYFL++  L+A C R I   FHHR   +LKHQA N
Sbjct: 139  TLVAQANYNLSRMLSPHYLQLAKGTGYFLLVATLVAFCFRYIFSEFHHRFANKLKHQAQN 198

Query: 542  ESKDHHQRSRSERWKSVL--IRESDVADADSRQDLVNPSEDQSQVPFEDISHAYTKLEPA 369
            + K  H  S S RWK  L  IRE  + DA+SR++    S+DQ     E+++ AYTKLEPA
Sbjct: 199  DLKVPHDGSGSMRWKFALSEIREMGIVDAESRENPDGDSQDQELDSLEEVAEAYTKLEPA 258

Query: 368  YLKFLSECGMSKWGYWRGGS 309
            Y KFLSECGMSKWGYWRGGS
Sbjct: 259  YQKFLSECGMSKWGYWRGGS 278


>XP_011005440.1 PREDICTED: uncharacterized protein LOC105111694 isoform X2 [Populus
            euphratica]
          Length = 286

 Score =  195 bits (496), Expect = 1e-55
 Identities = 115/249 (46%), Positives = 141/249 (56%), Gaps = 6/249 (2%)
 Frame = -1

Query: 1031 FRALSQRWNFIQIQGLSKSWNPAKQFSVHVVKEGETLTSISMQYGVSMQXXXXXXXXXXX 852
            F  L++RW F  IQ +SK  +    + +HVVKEGETLTSIS QYGVS+            
Sbjct: 42   FTVLAERWRF-HIQDISKGQSSTNPYLLHVVKEGETLTSISKQYGVSIYSVAAANKNILD 100

Query: 851  XXXXLEGQRLNIPPSA----KEAQMDSASTNQLDHCNIHERCQSSSSILGAHENQRIFTL 684
                 EGQ LNIP SA    K  Q+    +   D     ER Q+   I+    NQ+ F  
Sbjct: 101  VDLVFEGQLLNIPASAPADTKVYQVKKCESPSFDQL---ERLQNFMKIMDGVLNQKPFIT 157

Query: 683  LSSQHIQLAKPTGYFLVLVPLIAICIRCIIGTFHHRLPKQLKHQAVNESKDHHQRSRSER 504
            +++ H+  AK TGYFLVLVP +A CIRCIIG FH R  + L  QA NES+ H     S+R
Sbjct: 158  VTTLHLPHAKATGYFLVLVPALAFCIRCIIGAFHTRARRNLGCQASNESRRHQDVPESKR 217

Query: 503  WKSVL--IRESDVADADSRQDLVNPSEDQSQVPFEDISHAYTKLEPAYLKFLSECGMSKW 330
            WK  L  IRE D  D +   +    S DQ Q  FE++SHAY KLE  Y KFLSECG+S  
Sbjct: 218  WKHALSDIREPDNLDGEPILNSTGTSADQDQNSFEEVSHAYDKLEHEYQKFLSECGISNS 277

Query: 329  GYWRGGSLE 303
            GYWRGGS +
Sbjct: 278  GYWRGGSTD 286


>XP_006376147.1 hypothetical protein POPTR_0013s10230g [Populus trichocarpa]
            ERP53944.1 hypothetical protein POPTR_0013s10230g
            [Populus trichocarpa]
          Length = 286

 Score =  194 bits (492), Expect = 5e-55
 Identities = 120/270 (44%), Positives = 149/270 (55%), Gaps = 9/270 (3%)
 Frame = -1

Query: 1091 TSSPILSINPLKNKKLLTNRFRALSQRWNFIQIQGLSKSWNPAKQFSVHVVKEGETLTSI 912
            T+ P  S+    N K     F  L++RW F  IQ +SK  +    + +HVVKEGETLTSI
Sbjct: 25   TTFPSPSLKWWVNSK---KHFTVLAERWRF-HIQDISKGQSSTNPYLLHVVKEGETLTSI 80

Query: 911  SMQYGVSMQXXXXXXXXXXXXXXXLEGQRLNIPPSAKEA-------QMDSASTNQLDHCN 753
            S QYGVS+                 EGQ LNIP +A          + +S S +QL    
Sbjct: 81   SKQYGVSIYSVAAANKNILDVDLVFEGQLLNIPAAAPAGTQVYQIKKCESPSFDQL---- 136

Query: 752  IHERCQSSSSILGAHENQRIFTLLSSQHIQLAKPTGYFLVLVPLIAICIRCIIGTFHHRL 573
              ER Q+   I+    NQ+ F  +++  +  AK TGYFLVLVP +A CIRCIIG FH R 
Sbjct: 137  --ERLQNFMKIMDGVLNQKPFITVTTLRLPHAKATGYFLVLVPALAFCIRCIIGAFHTRA 194

Query: 572  PKQLKHQAVNESKDHHQRSRSERWKSVL--IRESDVADADSRQDLVNPSEDQSQVPFEDI 399
             + L  QA NES+ HH    S+RWK  L  IRE D  D +   +    S DQ Q  FE++
Sbjct: 195  RRNLGCQASNESRRHHDVPESKRWKHALSDIREPDNLDGEPILNSTGTSADQDQNSFEEV 254

Query: 398  SHAYTKLEPAYLKFLSECGMSKWGYWRGGS 309
            SHAY KLE  Y KFLSECG+S  GYWRGGS
Sbjct: 255  SHAYDKLEHEYQKFLSECGISNSGYWRGGS 284


>XP_011005439.1 PREDICTED: uncharacterized protein LOC105111694 isoform X1 [Populus
            euphratica]
          Length = 292

 Score =  192 bits (489), Expect = 1e-54
 Identities = 116/258 (44%), Positives = 144/258 (55%), Gaps = 15/258 (5%)
 Frame = -1

Query: 1031 FRALSQRWNFIQIQGLSKSWNPAKQFSVHVVKEGETLTSISMQYGVSMQXXXXXXXXXXX 852
            F  L++RW F  IQ +SK  +    + +HVVKEGETLTSIS QYGVS+            
Sbjct: 42   FTVLAERWRF-HIQDISKGQSSTNPYLLHVVKEGETLTSISKQYGVSIYSVAAANKNILD 100

Query: 851  XXXXLEGQRLNIPPSA-------------KEAQMDSASTNQLDHCNIHERCQSSSSILGA 711
                 EGQ LNIP SA             +  + +S S +QL      ER Q+   I+  
Sbjct: 101  VDLVFEGQLLNIPASAPADTKVCLCLNQYQVKKCESPSFDQL------ERLQNFMKIMDG 154

Query: 710  HENQRIFTLLSSQHIQLAKPTGYFLVLVPLIAICIRCIIGTFHHRLPKQLKHQAVNESKD 531
              NQ+ F  +++ H+  AK TGYFLVLVP +A CIRCIIG FH R  + L  QA NES+ 
Sbjct: 155  VLNQKPFITVTTLHLPHAKATGYFLVLVPALAFCIRCIIGAFHTRARRNLGCQASNESRR 214

Query: 530  HHQRSRSERWKSVL--IRESDVADADSRQDLVNPSEDQSQVPFEDISHAYTKLEPAYLKF 357
            H     S+RWK  L  IRE D  D +   +    S DQ Q  FE++SHAY KLE  Y KF
Sbjct: 215  HQDVPESKRWKHALSDIREPDNLDGEPILNSTGTSADQDQNSFEEVSHAYDKLEHEYQKF 274

Query: 356  LSECGMSKWGYWRGGSLE 303
            LSECG+S  GYWRGGS +
Sbjct: 275  LSECGISNSGYWRGGSTD 292


>XP_007019798.1 PREDICTED: uncharacterized protein LOC18592837 isoform X3
           [Theobroma cacao] EOY17023.1 Uncharacterized protein
           TCM_036188 isoform 1 [Theobroma cacao]
          Length = 273

 Score =  185 bits (469), Expect = 8e-52
 Identities = 105/237 (44%), Positives = 144/237 (60%), Gaps = 7/237 (2%)
 Frame = -1

Query: 998 QIQGLSKSW----NPAKQFSVHVVKEGETLTSISMQYGVSMQXXXXXXXXXXXXXXXLEG 831
           + QGL K W    N       H+VKEGETL+SIS +YGVS+                 +G
Sbjct: 44  RFQGLIKKWRLQNNSKDYICAHLVKEGETLSSISKKYGVSVYSIAAANKDIVDIHLVFKG 103

Query: 830 QRLNIPPSAKEAQMDSASTNQLDHCNIHERCQSSSSILGAHENQRIFTLLSSQHIQ-LAK 654
           Q LNIP S+ +  +  A  ++L H         S        ++ I+++++S  +   AK
Sbjct: 104 QLLNIPASSLKETL-LAKKSRLWH---------SIRAFRTPSHKIIYSMVTSHGLSNQAK 153

Query: 653 PTGYFLVLVPLIAICIRCIIGTFHHRLPKQLKHQAVNESKDHHQRSRSERWKSVL--IRE 480
            TGYFLVLVPLIA CIRCII TF  R+ + ++HQAV++SK HH  ++S RWKS L    E
Sbjct: 154 ATGYFLVLVPLIAFCIRCIISTFRIRVARDMRHQAVDKSKGHHPGAKSMRWKSALSDTEE 213

Query: 479 SDVADADSRQDLVNPSEDQSQVPFEDISHAYTKLEPAYLKFLSECGMSKWGYWRGGS 309
           SD  D++S  D  +PSED++ + +++ SHAY++L+  Y KFLSECGMSKWGYWRGGS
Sbjct: 214 SDAFDSESGLDSNSPSEDEAYISYDEASHAYSRLQHDYEKFLSECGMSKWGYWRGGS 270


>KJB81204.1 hypothetical protein B456_013G136900 [Gossypium raimondii]
          Length = 273

 Score =  184 bits (468), Expect = 1e-51
 Identities = 118/294 (40%), Positives = 162/294 (55%), Gaps = 7/294 (2%)
 Frame = -1

Query: 1169 MEPKLNQKKEIGMMRSTVRRGTASNITSSPI----LSINPLKNKKLLTNRFRALSQRWNF 1002
            ME KL  KK I   +S +     SN+T +      L + P    ++    F+ L ++W  
Sbjct: 1    MEVKLTHKKHISPSKSLL-----SNLTKTTFPPHTLRLKPWAAAEI--QHFQGLVKKW-- 51

Query: 1001 IQIQGLSKSWNPAKQFSVHVVKEGETLTSISMQYGVSMQXXXXXXXXXXXXXXXLEGQRL 822
             ++Q  +K ++ A     HVVKEGETL+SIS  YGVS+                  GQ L
Sbjct: 52   -RLQNKTKDYSCA-----HVVKEGETLSSISKMYGVSVHSIAAANKNIVDINLVFRGQLL 105

Query: 821  NIPPSAK-EAQMDSASTNQLDHCNIHERCQSSSSILGAHENQRIFTLLSSQHIQLAKPTG 645
            NIP S+  + Q+D A  ++L           S   L A   Q+ FT++++  +  AK TG
Sbjct: 106  NIPSSSLLDTQLDRAKKSRL---------WQSIRALKAPSGQKFFTMITAHCLSNAKSTG 156

Query: 644  YFLVLVPLIAICIRCIIGTFHHRLPKQLKHQAVNESKDHHQRSRSERWKSVLIR--ESDV 471
            YFLVLVPLIA CI CII T H R+ + +KHQA +ES+ HH  ++  RWKS L    E DV
Sbjct: 157  YFLVLVPLIAFCIGCIIVTLHTRVSRSIKHQAADESQAHHPGAKGRRWKSALSDSVEGDV 216

Query: 470  ADADSRQDLVNPSEDQSQVPFEDISHAYTKLEPAYLKFLSECGMSKWGYWRGGS 309
             D++   D  + SED++ +  E+ S  Y +LE  Y KFLSECG+SKWGYWRGGS
Sbjct: 217  FDSELGLDSNSTSEDEANIQNEEASKDYGRLEHDYQKFLSECGISKWGYWRGGS 270


>XP_016703596.1 PREDICTED: uncharacterized protein LOC107918531 isoform X3
           [Gossypium hirsutum]
          Length = 274

 Score =  184 bits (467), Expect = 2e-51
 Identities = 109/236 (46%), Positives = 139/236 (58%), Gaps = 8/236 (3%)
 Frame = -1

Query: 992 QGLSKSW---NPAKQFS-VHVVKEGETLTSISMQYGVSMQXXXXXXXXXXXXXXXLEGQR 825
           QGL K W   N  K +S  HVVKEGETL+SIS  YGVS+                 +GQ 
Sbjct: 45  QGLVKKWRLQNKTKDYSCAHVVKEGETLSSISKMYGVSVHSIAAANKNIVDINLVFQGQL 104

Query: 824 LNIPPSAK-EAQMDSASTNQLDHCNIHERCQSSSSILGAHENQRIFTLLSSQHIQ-LAKP 651
           LNIP S+  + Q+D A  ++L           S   L A   Q+ FT+++S  +   AK 
Sbjct: 105 LNIPSSSLLDTQLDRAKKSRL---------WQSIRALKAPSGQKFFTMITSHCLSNQAKS 155

Query: 650 TGYFLVLVPLIAICIRCIIGTFHHRLPKQLKHQAVNESKDHHQRSRSERWKSVLIR--ES 477
           TGYFLVLVPLIA CI CIIGT H R+ + +KHQA +ES+ HH  ++  RWKS L    E 
Sbjct: 156 TGYFLVLVPLIAFCIGCIIGTLHTRVSRSIKHQAADESQAHHPGAKGRRWKSALSDSVEG 215

Query: 476 DVADADSRQDLVNPSEDQSQVPFEDISHAYTKLEPAYLKFLSECGMSKWGYWRGGS 309
           DV D++   D  + SED++ +  E+ S  Y +LE  Y KFLSECG+SKWGYWRGGS
Sbjct: 216 DVFDSELGLDSNSTSEDEANIQNEEASEDYGRLEHDYQKFLSECGISKWGYWRGGS 271


>XP_017981653.1 PREDICTED: uncharacterized protein LOC18592837 isoform X2
           [Theobroma cacao]
          Length = 276

 Score =  182 bits (462), Expect = 9e-51
 Identities = 104/240 (43%), Positives = 144/240 (60%), Gaps = 10/240 (4%)
 Frame = -1

Query: 998 QIQGLSKSW----NPAKQFSVHVVKE----GETLTSISMQYGVSMQXXXXXXXXXXXXXX 843
           + QGL K W    N       H+VK+    GETL+SIS +YGVS+               
Sbjct: 44  RFQGLIKKWRLQNNSKDYICAHLVKDFKCRGETLSSISKKYGVSVYSIAAANKDIVDIHL 103

Query: 842 XLEGQRLNIPPSAKEAQMDSASTNQLDHCNIHERCQSSSSILGAHENQRIFTLLSSQHIQ 663
             +GQ LNIP S+ +  +  A  ++L H         S        ++ I+++++S  + 
Sbjct: 104 VFKGQLLNIPASSLKETL-LAKKSRLWH---------SIRAFRTPSHKIIYSMVTSHGLS 153

Query: 662 LAKPTGYFLVLVPLIAICIRCIIGTFHHRLPKQLKHQAVNESKDHHQRSRSERWKSVL-- 489
            AK TGYFLVLVPLIA CIRCII TF  R+ + ++HQAV++SK HH  ++S RWKS L  
Sbjct: 154 NAKATGYFLVLVPLIAFCIRCIISTFRIRVARDMRHQAVDKSKGHHPGAKSMRWKSALSD 213

Query: 488 IRESDVADADSRQDLVNPSEDQSQVPFEDISHAYTKLEPAYLKFLSECGMSKWGYWRGGS 309
             ESD  D++S  D  +PSED++ + +++ SHAY++L+  Y KFLSECGMSKWGYWRGGS
Sbjct: 214 TEESDAFDSESGLDSNSPSEDEAYISYDEASHAYSRLQHDYEKFLSECGMSKWGYWRGGS 273


>KCW69519.1 hypothetical protein EUGRSUZ_F02959 [Eucalyptus grandis]
          Length = 300

 Score =  182 bits (463), Expect = 1e-50
 Identities = 109/261 (41%), Positives = 150/261 (57%), Gaps = 3/261 (1%)
 Frame = -1

Query: 1085 SPILSINPLKNKKL-LTNRFRALSQRWNFIQIQGLSKSWNPAKQFSVHVVKEGETLTSIS 909
            +P  S+N   + +L     FR L+  W+F  +  + KS    ++ SVHVV+EGETL+SIS
Sbjct: 33   TPRFSLNSWASYQLSFRKHFRGLAGTWSF-HVSKILKSQEFTEECSVHVVREGETLSSIS 91

Query: 908  MQYGVSMQXXXXXXXXXXXXXXXLEGQRLNIPPSAKEAQMDSASTNQLDHCNIHERCQSS 729
             +YG+ +                 EGQ L IP S + A+  +     L   +I ++ +++
Sbjct: 92   KRYGIPIHPIVTLNKSIVDHELVYEGQVLKIPLSKRYAEK-TEKKKLLKKFDIPQKHRNA 150

Query: 728  SSILGAHENQRIFTLLSSQHIQLAKPTGYFLVLVPLIAICIRCIIGTFHHRLPKQLKHQA 549
            S +L A   Q+ FT+L S  +  AK TGYFLVLVPLIA CIRCI      R+   L+ + 
Sbjct: 151  SKMLDAFSQQKSFTVLISHQLPYAKTTGYFLVLVPLIAFCIRCITSVLCTRVAGYLRRED 210

Query: 548  VNESKDHHQRSRSERWKSVLI--RESDVADADSRQDLVNPSEDQSQVPFEDISHAYTKLE 375
              E K HHQ  +S+RWKS L+  +E DV+D +S+ D+  PSEDQ+Q    D   AY KLE
Sbjct: 211  NIEPKQHHQVPKSKRWKSALLDAKELDVSDPESKSDIGCPSEDQAQSSSGDTFQAYGKLE 270

Query: 374  PAYLKFLSECGMSKWGYWRGG 312
              Y KFLSECG+S  GYWRGG
Sbjct: 271  EDYQKFLSECGISNSGYWRGG 291


>XP_016703595.1 PREDICTED: uncharacterized protein LOC107918531 isoform X2
           [Gossypium hirsutum]
          Length = 277

 Score =  181 bits (460), Expect = 2e-50
 Identities = 108/239 (45%), Positives = 139/239 (58%), Gaps = 11/239 (4%)
 Frame = -1

Query: 992 QGLSKSW---NPAKQFS-VHVVKE----GETLTSISMQYGVSMQXXXXXXXXXXXXXXXL 837
           QGL K W   N  K +S  HVVK+    GETL+SIS  YGVS+                 
Sbjct: 45  QGLVKKWRLQNKTKDYSCAHVVKDFHCRGETLSSISKMYGVSVHSIAAANKNIVDINLVF 104

Query: 836 EGQRLNIPPSAK-EAQMDSASTNQLDHCNIHERCQSSSSILGAHENQRIFTLLSSQHIQL 660
           +GQ LNIP S+  + Q+D A  ++L           S   L A   Q+ FT+++S  +  
Sbjct: 105 QGQLLNIPSSSLLDTQLDRAKKSRL---------WQSIRALKAPSGQKFFTMITSHCLSN 155

Query: 659 AKPTGYFLVLVPLIAICIRCIIGTFHHRLPKQLKHQAVNESKDHHQRSRSERWKSVLIR- 483
           AK TGYFLVLVPLIA CI CIIGT H R+ + +KHQA +ES+ HH  ++  RWKS L   
Sbjct: 156 AKSTGYFLVLVPLIAFCIGCIIGTLHTRVSRSIKHQAADESQAHHPGAKGRRWKSALSDS 215

Query: 482 -ESDVADADSRQDLVNPSEDQSQVPFEDISHAYTKLEPAYLKFLSECGMSKWGYWRGGS 309
            E DV D++   D  + SED++ +  E+ S  Y +LE  Y KFLSECG+SKWGYWRGGS
Sbjct: 216 VEGDVFDSELGLDSNSTSEDEANIQNEEASEDYGRLEHDYQKFLSECGISKWGYWRGGS 274


>KJB81208.1 hypothetical protein B456_013G136900 [Gossypium raimondii]
          Length = 274

 Score =  180 bits (457), Expect = 5e-50
 Identities = 118/295 (40%), Positives = 162/295 (54%), Gaps = 8/295 (2%)
 Frame = -1

Query: 1169 MEPKLNQKKEIGMMRSTVRRGTASNITSSPI----LSINPLKNKKLLTNRFRALSQRWNF 1002
            ME KL  KK I   +S +     SN+T +      L + P    ++    F+ L ++W  
Sbjct: 1    MEVKLTHKKHISPSKSLL-----SNLTKTTFPPHTLRLKPWAAAEI--QHFQGLVKKW-- 51

Query: 1001 IQIQGLSKSWNPAKQFSVHVVKEGETLTSISMQYGVSMQXXXXXXXXXXXXXXXLEGQRL 822
             ++Q  +K ++ A     HVVKEGETL+SIS  YGVS+                  GQ L
Sbjct: 52   -RLQNKTKDYSCA-----HVVKEGETLSSISKMYGVSVHSIAAANKNIVDINLVFRGQLL 105

Query: 821  NIPPSAK-EAQMDSASTNQLDHCNIHERCQSSSSILGAHENQRIFTLLSSQHIQ-LAKPT 648
            NIP S+  + Q+D A  ++L           S   L A   Q+ FT++++  +   AK T
Sbjct: 106  NIPSSSLLDTQLDRAKKSRL---------WQSIRALKAPSGQKFFTMITAHCLSNQAKST 156

Query: 647  GYFLVLVPLIAICIRCIIGTFHHRLPKQLKHQAVNESKDHHQRSRSERWKSVLIR--ESD 474
            GYFLVLVPLIA CI CII T H R+ + +KHQA +ES+ HH  ++  RWKS L    E D
Sbjct: 157  GYFLVLVPLIAFCIGCIIVTLHTRVSRSIKHQAADESQAHHPGAKGRRWKSALSDSVEGD 216

Query: 473  VADADSRQDLVNPSEDQSQVPFEDISHAYTKLEPAYLKFLSECGMSKWGYWRGGS 309
            V D++   D  + SED++ +  E+ S  Y +LE  Y KFLSECG+SKWGYWRGGS
Sbjct: 217  VFDSELGLDSNSTSEDEANIQNEEASKDYGRLEHDYQKFLSECGISKWGYWRGGS 271


>XP_004250651.1 PREDICTED: uncharacterized protein LOC101253651 isoform X2 [Solanum
           lycopersicum]
          Length = 267

 Score =  180 bits (456), Expect = 5e-50
 Identities = 101/230 (43%), Positives = 136/230 (59%), Gaps = 6/230 (2%)
 Frame = -1

Query: 974 WNPAKQFSVHVVKEGETLTSISMQYGVSMQXXXXXXXXXXXXXXXLEGQRLNIPPSAKEA 795
           WN +KQF VHVVKE +TLTS+S  YGV +                 EGQ LNIP      
Sbjct: 49  WNSSKQFLVHVVKEDDTLTSLSKLYGVPIFEIAAANKEIIDVDLVFEGQHLNIPSYVTSY 108

Query: 794 QMDSASTNQLDHCNIHERCQSSSS----ILGAHENQRIFTLLSSQHIQLAKPTGYFLVLV 627
               + TNQ +  N+ +   S +S    + G+  NQ++  +LS +H+  AK +G+FLVLV
Sbjct: 109 ----SQTNQREKINLPKIEVSETSRHFKLCGSDINQKMLYVLSCRHLPYAKTSGHFLVLV 164

Query: 626 PLIAICIRCIIGTFHHRLPKQLKHQAVNESKDHHQRSRSERWKSVL--IRESDVADADSR 453
           PLI  CIRCI+  FHHR+ +       N+ +D HQ S S RWKS L  + + D   +DSR
Sbjct: 165 PLIGFCIRCIMNAFHHRVAR-------NKLQDAHQTSGSMRWKSALRDLTDPDALYSDSR 217

Query: 452 QDLVNPSEDQSQVPFEDISHAYTKLEPAYLKFLSECGMSKWGYWRGGSLE 303
            +  N ++D+  +  E++SHAY KL+  Y KFLSECGMSKWGYWRGG+ E
Sbjct: 218 PETDNVTDDREHLQSEELSHAYAKLDGDYQKFLSECGMSKWGYWRGGTDE 267


>XP_015056897.1 PREDICTED: uncharacterized protein LOC107003144 isoform X2 [Solanum
           pennellii]
          Length = 267

 Score =  179 bits (453), Expect = 1e-49
 Identities = 100/230 (43%), Positives = 133/230 (57%), Gaps = 6/230 (2%)
 Frame = -1

Query: 974 WNPAKQFSVHVVKEGETLTSISMQYGVSMQXXXXXXXXXXXXXXXLEGQRLNIPPSAKEA 795
           WN +KQF VHVVKE +TLTS+S  YGV +                 EGQ LNIP      
Sbjct: 49  WNSSKQFLVHVVKEDDTLTSLSKLYGVPIFEIAAANKEIIDVDLVFEGQHLNIPSYVTSY 108

Query: 794 QMDSASTNQLDHCNIHE----RCQSSSSILGAHENQRIFTLLSSQHIQLAKPTGYFLVLV 627
               + TNQ +  N+ +         S + G+  NQ++  +LS +H+  AK +GYFLVLV
Sbjct: 109 ----SQTNQREKINLPKIEVSETSRHSKLCGSDINQKMLYVLSCRHLTYAKTSGYFLVLV 164

Query: 626 PLIAICIRCIIGTFHHRLPKQLKHQAVNESKDHHQRSRSERWKSVL--IRESDVADADSR 453
           PLI  CIRCI+  FHHR+ +       N+ +D HQ S S RWKS L  + + D   +DSR
Sbjct: 165 PLIGFCIRCIMNAFHHRVAR-------NKLQDVHQTSGSMRWKSALRDLTDPDALYSDSR 217

Query: 452 QDLVNPSEDQSQVPFEDISHAYTKLEPAYLKFLSECGMSKWGYWRGGSLE 303
            +  N ++D+  +  E++S AY KL+  Y KFLSECGMSKWGYWRGG+ E
Sbjct: 218 PETDNVTDDREDLQSEELSRAYAKLDGDYQKFLSECGMSKWGYWRGGTDE 267


>XP_017617430.1 PREDICTED: uncharacterized protein LOC108461944 isoform X3 [Gossypium
            arboreum]
          Length = 274

 Score =  178 bits (451), Expect = 4e-49
 Identities = 117/292 (40%), Positives = 160/292 (54%), Gaps = 5/292 (1%)
 Frame = -1

Query: 1169 MEPKLNQKKEIGMMRSTVRRGTASNITSSP-ILSINPLKNKKLLTNRFRALSQRWNFIQI 993
            ME KL  KK I   +S  R       T  P  L + P    ++    F+ L ++W   ++
Sbjct: 1    MEVKLTHKKHISPSKS--RLSNLIKTTFPPHTLRLKPWAAAEI--QHFQGLVKKW---RL 53

Query: 992  QGLSKSWNPAKQFSVHVVKEGETLTSISMQYGVSMQXXXXXXXXXXXXXXXLEGQRLNIP 813
            Q  +K ++ A     HVVKEGETL+SIS  Y VS+                 +GQ LNIP
Sbjct: 54   QNKTKDYSCA-----HVVKEGETLSSISKVYEVSVHSIAAANKNIVDINLVFQGQLLNIP 108

Query: 812  PSAK-EAQMDSASTNQLDHCNIHERCQSSSSILGAHENQRIFTLLSSQHIQ-LAKPTGYF 639
             S+  + Q+D A  ++L           S   L A   Q+ FT+++S  +   AK TGYF
Sbjct: 109  SSSLLDTQLDRAKKSRL---------WQSIRALKAPSGQKFFTMITSHCLSNQAKSTGYF 159

Query: 638  LVLVPLIAICIRCIIGTFHHRLPKQLKHQAVNESKDHHQRSRSERWKSVL--IRESDVAD 465
            LVLVPLIA CI CII T H R+ + +KHQA ++S+ HH  ++  RWKS L  + E DV D
Sbjct: 160  LVLVPLIAFCIGCIISTLHTRVSRSIKHQAADKSQAHHPGAKGRRWKSALSDLVEGDVFD 219

Query: 464  ADSRQDLVNPSEDQSQVPFEDISHAYTKLEPAYLKFLSECGMSKWGYWRGGS 309
            ++   D  + SED++ +  E+ S  Y +LE  Y KFLSECG+SKWGYWRGGS
Sbjct: 220  SELGLDSNSTSEDEANIQNEEASEDYGRLEHDYQKFLSECGISKWGYWRGGS 271


>XP_017981652.1 PREDICTED: uncharacterized protein LOC18592837 isoform X1
           [Theobroma cacao]
          Length = 277

 Score =  178 bits (451), Expect = 4e-49
 Identities = 104/241 (43%), Positives = 144/241 (59%), Gaps = 11/241 (4%)
 Frame = -1

Query: 998 QIQGLSKSW----NPAKQFSVHVVKE----GETLTSISMQYGVSMQXXXXXXXXXXXXXX 843
           + QGL K W    N       H+VK+    GETL+SIS +YGVS+               
Sbjct: 44  RFQGLIKKWRLQNNSKDYICAHLVKDFKCRGETLSSISKKYGVSVYSIAAANKDIVDIHL 103

Query: 842 XLEGQRLNIPPSAKEAQMDSASTNQLDHCNIHERCQSSSSILGAHENQRIFTLLSSQHIQ 663
             +GQ LNIP S+ +  +  A  ++L H         S        ++ I+++++S  + 
Sbjct: 104 VFKGQLLNIPASSLKETL-LAKKSRLWH---------SIRAFRTPSHKIIYSMVTSHGLS 153

Query: 662 -LAKPTGYFLVLVPLIAICIRCIIGTFHHRLPKQLKHQAVNESKDHHQRSRSERWKSVL- 489
             AK TGYFLVLVPLIA CIRCII TF  R+ + ++HQAV++SK HH  ++S RWKS L 
Sbjct: 154 NQAKATGYFLVLVPLIAFCIRCIISTFRIRVARDMRHQAVDKSKGHHPGAKSMRWKSALS 213

Query: 488 -IRESDVADADSRQDLVNPSEDQSQVPFEDISHAYTKLEPAYLKFLSECGMSKWGYWRGG 312
              ESD  D++S  D  +PSED++ + +++ SHAY++L+  Y KFLSECGMSKWGYWRGG
Sbjct: 214 DTEESDAFDSESGLDSNSPSEDEAYISYDEASHAYSRLQHDYEKFLSECGMSKWGYWRGG 273

Query: 311 S 309
           S
Sbjct: 274 S 274


>XP_011095893.1 PREDICTED: uncharacterized protein LOC105175222 [Sesamum indicum]
          Length = 268

 Score =  177 bits (449), Expect = 6e-49
 Identities = 110/253 (43%), Positives = 142/253 (56%), Gaps = 4/253 (1%)
 Frame = -1

Query: 1055 NKKLLTNRFRALSQRWNFIQIQGLS-KSWNPAKQFSVHVVKEGETLTSISMQYGVSMQXX 879
            +    T  F  L+QRW  + IQ ++ K  + +K+  VHVVK+GE LTSIS  YGV +   
Sbjct: 17   SSSFFTKHFTLLAQRWK-LHIQRIAWKDRDISKKILVHVVKDGENLTSISKLYGVPIHDI 75

Query: 878  XXXXXXXXXXXXXLEGQRLNIPP-SAKEAQMDSASTNQLDHCNIHERCQSSSSILGAHEN 702
                          EG+ LNIP  SA +AQ      ++     + +    S       + 
Sbjct: 76   AAVNKDIVDVDLVSEGKHLNIPSASAGDAQGCHFEGDKFHEHQLPKATPCSE--FNTRQW 133

Query: 701  QRIFTLLSSQHIQLAKPTGYFLVLVPLIAICIRCIIGTFHHRLPKQLKHQAVNESKDHHQ 522
             +I T+ SS  + LAK TG  LVLVPLIA CIRCIIG   +R+ + L+ QAVN+S     
Sbjct: 134  NQILTIPSSCRLPLAKRTGSVLVLVPLIAFCIRCIIGACQNRVARNLRDQAVNKSGMPRD 193

Query: 521  RSRSERWKSVL--IRESDVADADSRQDLVNPSEDQSQVPFEDISHAYTKLEPAYLKFLSE 348
            R  S RWK+VL  +RE D  DA+   D    SE+Q +V FE+I H+Y KLE  Y KFLSE
Sbjct: 194  RCNSVRWKTVLSELREPDALDAEPETDSDPFSEEQEEVHFEEIFHSYAKLEDDYQKFLSE 253

Query: 347  CGMSKWGYWRGGS 309
            CGMS WGYWRGGS
Sbjct: 254  CGMSNWGYWRGGS 266


>XP_016703594.1 PREDICTED: uncharacterized protein LOC107918531 isoform X1
           [Gossypium hirsutum]
          Length = 278

 Score =  177 bits (449), Expect = 8e-49
 Identities = 108/240 (45%), Positives = 139/240 (57%), Gaps = 12/240 (5%)
 Frame = -1

Query: 992 QGLSKSW---NPAKQFS-VHVVKE----GETLTSISMQYGVSMQXXXXXXXXXXXXXXXL 837
           QGL K W   N  K +S  HVVK+    GETL+SIS  YGVS+                 
Sbjct: 45  QGLVKKWRLQNKTKDYSCAHVVKDFHCRGETLSSISKMYGVSVHSIAAANKNIVDINLVF 104

Query: 836 EGQRLNIPPSAK-EAQMDSASTNQLDHCNIHERCQSSSSILGAHENQRIFTLLSSQHIQ- 663
           +GQ LNIP S+  + Q+D A  ++L           S   L A   Q+ FT+++S  +  
Sbjct: 105 QGQLLNIPSSSLLDTQLDRAKKSRL---------WQSIRALKAPSGQKFFTMITSHCLSN 155

Query: 662 LAKPTGYFLVLVPLIAICIRCIIGTFHHRLPKQLKHQAVNESKDHHQRSRSERWKSVLIR 483
            AK TGYFLVLVPLIA CI CIIGT H R+ + +KHQA +ES+ HH  ++  RWKS L  
Sbjct: 156 QAKSTGYFLVLVPLIAFCIGCIIGTLHTRVSRSIKHQAADESQAHHPGAKGRRWKSALSD 215

Query: 482 --ESDVADADSRQDLVNPSEDQSQVPFEDISHAYTKLEPAYLKFLSECGMSKWGYWRGGS 309
             E DV D++   D  + SED++ +  E+ S  Y +LE  Y KFLSECG+SKWGYWRGGS
Sbjct: 216 SVEGDVFDSELGLDSNSTSEDEANIQNEEASEDYGRLEHDYQKFLSECGISKWGYWRGGS 275


>XP_010062380.1 PREDICTED: uncharacterized protein LOC104449808 isoform X3
            [Eucalyptus grandis] KCW69518.1 hypothetical protein
            EUGRSUZ_F02959 [Eucalyptus grandis]
          Length = 304

 Score =  178 bits (451), Expect = 8e-49
 Identities = 109/265 (41%), Positives = 150/265 (56%), Gaps = 7/265 (2%)
 Frame = -1

Query: 1085 SPILSINPLKNKKL-LTNRFRALSQRWNFIQIQGLSKSWNPAKQFSVHVVKEGETLTSIS 909
            +P  S+N   + +L     FR L+  W+F  +  + KS    ++ SVHVV+EGETL+SIS
Sbjct: 33   TPRFSLNSWASYQLSFRKHFRGLAGTWSF-HVSKILKSQEFTEECSVHVVREGETLSSIS 91

Query: 908  MQYGVSMQXXXXXXXXXXXXXXXLEGQRLNIPPSAKEAQMDSASTNQLDHCNIHERCQSS 729
             +YG+ +                 EGQ L IP S + A+  +     L   +I ++ +++
Sbjct: 92   KRYGIPIHPIVTLNKSIVDHELVYEGQVLKIPLSKRYAEK-TEKKKLLKKFDIPQKHRNA 150

Query: 728  SSILGAHENQRIFTLLSSQHIQLAKPTGYFLVLVPLIAICIRCIIGTFHHRLPKQLKHQA 549
            S +L A   Q+ FT+L S  +  AK TGYFLVLVPLIA CIRCI      R+   L+ + 
Sbjct: 151  SKMLDAFSQQKSFTVLISHQLPYAKTTGYFLVLVPLIAFCIRCITSVLCTRVAGYLRRED 210

Query: 548  VNESKDHHQRSRSERWKSVLI--RESDVADADSRQDLVN----PSEDQSQVPFEDISHAY 387
              E K HHQ  +S+RWKS L+  +E DV+D +S+ D+      PSEDQ+Q    D   AY
Sbjct: 211  NIEPKQHHQVPKSKRWKSALLDAKELDVSDPESKSDIGTFVQCPSEDQAQSSSGDTFQAY 270

Query: 386  TKLEPAYLKFLSECGMSKWGYWRGG 312
             KLE  Y KFLSECG+S  GYWRGG
Sbjct: 271  GKLEEDYQKFLSECGISNSGYWRGG 295


Top