BLASTX nr result

ID: Magnolia22_contig00005192 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Magnolia22_contig00005192
         (2276 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_010662514.2 PREDICTED: uncharacterized protein LOC104882152 [...   159   9e-38
CAN77034.1 hypothetical protein VITISV_009309 [Vitis vinifera]        159   2e-37
OAY60030.1 hypothetical protein MANES_01G080700 [Manihot esculenta]   154   5e-36
GAV88203.1 zf-met domain-containing protein [Cephalotus follicul...   152   3e-35
XP_016743421.1 PREDICTED: uncharacterized protein LOC107952765 i...   149   2e-34
XP_017650075.1 PREDICTED: uncharacterized protein LOC108489869 i...   149   2e-34
XP_016743420.1 PREDICTED: uncharacterized protein LOC107952765 i...   149   4e-34
XP_016745652.1 PREDICTED: uncharacterized protein LOC107954565 [...   149   6e-34
XP_017650066.1 PREDICTED: uncharacterized protein LOC108489869 i...   149   6e-34
KJB09922.1 hypothetical protein B456_001G175100 [Gossypium raimo...   145   1e-33
XP_018820146.1 PREDICTED: uncharacterized protein LOC108990593 [...   147   2e-33
XP_015580707.1 PREDICTED: general transcriptional corepressor CY...   146   2e-33
KHG02771.1 cdsA [Gossypium arboreum]                                  149   3e-33
XP_012483055.1 PREDICTED: uncharacterized protein LOC105797655 i...   145   3e-33
XP_015889413.1 PREDICTED: mediator of RNA polymerase II transcri...   147   3e-33
XP_012483048.1 PREDICTED: uncharacterized protein LOC105797655 i...   145   6e-33
EEF33447.1 conserved hypothetical protein [Ricinus communis]          146   7e-33
OMP00682.1 Zinc finger, U1-type [Corchorus olitorius]                 142   2e-32
XP_010258936.1 PREDICTED: glutenin, high molecular weight subuni...   145   3e-32
EOX96328.1 Uncharacterized protein TCM_005599 [Theobroma cacao]       144   4e-32

>XP_010662514.2 PREDICTED: uncharacterized protein LOC104882152 [Vitis vinifera]
          Length = 537

 Score =  159 bits (401), Expect = 9e-38
 Identities = 97/220 (44%), Positives = 126/220 (57%), Gaps = 15/220 (6%)
 Frame = +1

Query: 1222 PPQVAWCDLCKVDCNTQEILEQHKNGKRHKKNMQKIEEFQKHQNLL-----LESQAKSGE 1386
            P QVAWC+LC+VDC + EILEQHKNGKRHKKN+Q+IEE  K  NL       E   +S  
Sbjct: 169  PSQVAWCELCRVDCTSLEILEQHKNGKRHKKNLQRIEEL-KSANLTGTEIPNEPVGESKF 227

Query: 1387 QPLNFEGGEEKNEASAE-----GLPSNAATNENKMEVDQQTQTAGQSEIPTAEPTVATAG 1551
            QP   + GEE+++   E      LPS A  NEN+M  +Q+     Q E P  E   +  G
Sbjct: 228  QPEIAQEGEEESDEEGEENPEKNLPSEAIANENEMVGEQKNDIVEQPEKPMEERPDSQVG 287

Query: 1552 NSRVDGGFENXXXXXXXXXXSKFGRGGKRLR--QQPEPVAEAPKEQ---PKYCALCHVTC 1716
              R++  F+N           + GRGGKR++  + P    E PK +   P  C LC+V C
Sbjct: 288  KPRME-HFDN--WRHGMKRRMRGGRGGKRMKMFEAPRRSIEPPKPKVVIPLICDLCNVKC 344

Query: 1717 DTQAVFECHLSGKKHTSRVKRSQGPQGPYRALGLHALYMP 1836
            DTQ VF+ HLSGKKH +++KR +G Q  Y  +GL ALY P
Sbjct: 345  DTQEVFDRHLSGKKHIAKLKRFEGHQAMYGPMGLQALYPP 384


>CAN77034.1 hypothetical protein VITISV_009309 [Vitis vinifera]
          Length = 618

 Score =  159 bits (401), Expect = 2e-37
 Identities = 97/220 (44%), Positives = 126/220 (57%), Gaps = 15/220 (6%)
 Frame = +1

Query: 1222 PPQVAWCDLCKVDCNTQEILEQHKNGKRHKKNMQKIEEFQKHQNLL-----LESQAKSGE 1386
            P QVAWC+LC+VDC + EILEQHKNGKRHKKN+Q+IEE  K  NL       E   +S  
Sbjct: 116  PSQVAWCELCRVDCTSLEILEQHKNGKRHKKNLQRIEEL-KSANLTGTEIPNEPVGESKF 174

Query: 1387 QPLNFEGGEEKNEASAE-----GLPSNAATNENKMEVDQQTQTAGQSEIPTAEPTVATAG 1551
            QP   + GEE+++   E      LPS A  NEN+M  +Q+     Q E P  E   +  G
Sbjct: 175  QPEIAQEGEEESDEEGEENPEKNLPSEAIANENEMVGEQKNDIVEQPEKPMEERPDSQVG 234

Query: 1552 NSRVDGGFENXXXXXXXXXXSKFGRGGKRLR--QQPEPVAEAPKEQ---PKYCALCHVTC 1716
              R++  F+N           + GRGGKR++  + P    E PK +   P  C LC+V C
Sbjct: 235  KPRME-HFDN--WRHGMKRRMRGGRGGKRMKMFEAPRRSIEPPKPKVVIPLICDLCNVKC 291

Query: 1717 DTQAVFECHLSGKKHTSRVKRSQGPQGPYRALGLHALYMP 1836
            DTQ VF+ HLSGKKH +++KR +G Q  Y  +GL ALY P
Sbjct: 292  DTQEVFDRHLSGKKHIAKLKRFEGHQAMYGPMGLQALYPP 331


>OAY60030.1 hypothetical protein MANES_01G080700 [Manihot esculenta]
          Length = 563

 Score =  154 bits (389), Expect = 5e-36
 Identities = 99/264 (37%), Positives = 133/264 (50%), Gaps = 15/264 (5%)
 Frame = +1

Query: 1090 FPHHAPAAQPQYFEPAPYQEPSSXXXXXXXXXXXXXXXXXXXXXPPQVAWCDLCKVDCNT 1269
            F  H   +    FEP+  +E S+                     P Q+AWC+LC+VDC +
Sbjct: 234  FQPHGVISTSSNFEPSAAKEHSAHAAKKVVESASAPEKVAPNRRPVQIAWCELCRVDCTS 293

Query: 1270 QEILEQHKNGKRHKKNMQKIEE----------FQKHQNLLLESQAKSGEQPLNFEGGEEK 1419
             EILEQHKNGKRHKKNM +IEE           Q HQ  + + + +  +QP   E GEE 
Sbjct: 294  LEILEQHKNGKRHKKNMLRIEELKNGTKPADCIQNHQEPINDLKPEEPQQPPIVEDGEE- 352

Query: 1420 NEASAEGLPSNAATNENKMEVDQQTQTAGQSEIPTAEPTVATAGNSRVDGGFENXXXXXX 1599
             + SAE LPS A ++E  ME +  + T  + ++P  E +    G       F+N      
Sbjct: 353  -QKSAENLPSEARSDEYGMENNLHSNTGEKPKVPVVELS-GKQGRKPRKILFDN--RRRG 408

Query: 1600 XXXXSKFGRGGKRLR--QQPEPVAEAPKEQ---PKYCALCHVTCDTQAVFECHLSGKKHT 1764
                 K G GGKR++  +     AE PK +   P  C LC+V CDT+ V + HLSGKKH 
Sbjct: 409  IKRKMKGGHGGKRIKTHETQRTAAEPPKPKVVTPLLCDLCNVKCDTREVLDRHLSGKKHI 468

Query: 1765 SRVKRSQGPQGPYRALGLHALYMP 1836
            +++KR QG Q  Y   GL ALY P
Sbjct: 469  AKLKRFQGHQAIYGPTGLQALYPP 492


>GAV88203.1 zf-met domain-containing protein [Cephalotus follicularis]
          Length = 577

 Score =  152 bits (383), Expect = 3e-35
 Identities = 90/213 (42%), Positives = 128/213 (60%), Gaps = 8/213 (3%)
 Frame = +1

Query: 1222 PPQVAWCDLCKVDCNTQEILEQHKNGKRHKKNMQKIEEFQKHQNLLLESQAKSGEQPLNF 1401
            PP+ AWC+LC+VDC + EILEQHKNGKRHKKNMQ+IEE +     ++E+Q  + ++P+  
Sbjct: 302  PPKSAWCELCRVDCTSLEILEQHKNGKRHKKNMQRIEELKIVVKPVVETQ--NQQKPVAI 359

Query: 1402 EGGE---EKNEASAEGLPSNAATNENKMEVDQQTQTAGQSEIPTAEPTVATAGNSRVDGG 1572
               E   +  + +AE + +    +E+K++ +QQ  T  QSE+ TAE +   AG +R D  
Sbjct: 360  SEPEVYHQTEKKAAESVFAEVVNDESKLDSEQQINTDKQSEV-TAESSKMQAGKARSD-R 417

Query: 1573 FENXXXXXXXXXXSKFGRGGKRLR-----QQPEPVAEAPKEQPKYCALCHVTCDTQAVFE 1737
            F+N           K G GGKR +     ++P    ++    P  C LC+V CDTQ VF+
Sbjct: 418  FDN--QRRGVKRKMKGGLGGKRKKTFEVPRRPNEPLKSKVVIPLICDLCNVKCDTQEVFD 475

Query: 1738 CHLSGKKHTSRVKRSQGPQGPYRALGLHALYMP 1836
             HLSGKKH +++KR +G Q  Y  LGL ALY P
Sbjct: 476  RHLSGKKHMAKLKRFEGHQAMYGPLGLQALYPP 508


>XP_016743421.1 PREDICTED: uncharacterized protein LOC107952765 isoform X2 [Gossypium
            hirsutum] XP_016743422.1 PREDICTED: uncharacterized
            protein LOC107952765 isoform X2 [Gossypium hirsutum]
          Length = 556

 Score =  149 bits (376), Expect = 2e-34
 Identities = 87/219 (39%), Positives = 116/219 (52%), Gaps = 14/219 (6%)
 Frame = +1

Query: 1222 PPQVAWCDLCKVDCNTQEILEQHKNGKRHKKNMQKIEEFQKHQNLLLESQA-------KS 1380
            PP++AWC+LC+VDCN  EILEQHKNGKRHKKN+Q  EE QK   ++   Q+         
Sbjct: 143  PPRMAWCELCRVDCNRPEILEQHKNGKRHKKNLQVREELQKRNGVITGQQSVQVPNLGSE 202

Query: 1381 GEQPLNFEGGEEKNEASAEGLPSNAATNENKMEVDQQTQTAGQSEIPTAEPTVATAGNSR 1560
              Q +  EG EEK     + +PS AAT +NK E++QQ            +P  +T G + 
Sbjct: 203  IVQLVKVEGSEEKQH--QQMVPSLAATTDNKKEIEQQQDIVN-------KPEASTTGPAE 253

Query: 1561 VDGGFENXXXXXXXXXXSKF--GRGGKRLRQQ--PEPVAEAPKEQ---PKYCALCHVTCD 1719
                  N           K   GRGGK +++       +E PK +   P  C LC+V C+
Sbjct: 254  AKRNLRNPSEARGRGLKRKMRGGRGGKYVKRNEGSRRPSEPPKPKGGIPFMCELCNVKCE 313

Query: 1720 TQAVFECHLSGKKHTSRVKRSQGPQGPYRALGLHALYMP 1836
            TQ VF CHL+GKKH + +KR  G +  Y   G+ ALY P
Sbjct: 314  TQVVFNCHLAGKKHIANMKRFHGHRALYGEAGVQALYPP 352


>XP_017650075.1 PREDICTED: uncharacterized protein LOC108489869 isoform X2 [Gossypium
            arboreum] XP_017650084.1 PREDICTED: uncharacterized
            protein LOC108489869 isoform X2 [Gossypium arboreum]
          Length = 544

 Score =  149 bits (375), Expect = 2e-34
 Identities = 87/219 (39%), Positives = 116/219 (52%), Gaps = 14/219 (6%)
 Frame = +1

Query: 1222 PPQVAWCDLCKVDCNTQEILEQHKNGKRHKKNMQKIEEFQKHQNLLLESQA-------KS 1380
            PP++AWC+LC+VDCN  EILEQHKNGKRHKKN+Q  EE QK   ++   Q+         
Sbjct: 131  PPRMAWCELCRVDCNRPEILEQHKNGKRHKKNLQVREELQKRNGVITGQQSVQVPNLGSE 190

Query: 1381 GEQPLNFEGGEEKNEASAEGLPSNAATNENKMEVDQQTQTAGQSEIPTAEPTVATAGNSR 1560
              Q +  EG EEK     + +PS AAT +NK E++QQ            +P  +T G + 
Sbjct: 191  IVQLVKVEGSEEKQH--QQMVPSLAATTDNKKEIEQQQDIVN-------KPEASTTGPAE 241

Query: 1561 VDGGFENXXXXXXXXXXSKF--GRGGKRLRQQ--PEPVAEAPKEQ---PKYCALCHVTCD 1719
                  N           K   GRGGK +++       +E PK +   P  C LC+V C+
Sbjct: 242  AKMNLRNPSEARGRGLKRKMRGGRGGKYVKRNEGSRRPSEPPKPKGGIPFMCELCNVKCE 301

Query: 1720 TQAVFECHLSGKKHTSRVKRSQGPQGPYRALGLHALYMP 1836
            TQ VF CHL+GKKH + +KR  G +  Y   G+ ALY P
Sbjct: 302  TQVVFNCHLAGKKHIANMKRFHGHRALYGEAGVQALYPP 340


>XP_016743420.1 PREDICTED: uncharacterized protein LOC107952765 isoform X1 [Gossypium
            hirsutum]
          Length = 627

 Score =  149 bits (376), Expect = 4e-34
 Identities = 87/219 (39%), Positives = 116/219 (52%), Gaps = 14/219 (6%)
 Frame = +1

Query: 1222 PPQVAWCDLCKVDCNTQEILEQHKNGKRHKKNMQKIEEFQKHQNLLLESQA-------KS 1380
            PP++AWC+LC+VDCN  EILEQHKNGKRHKKN+Q  EE QK   ++   Q+         
Sbjct: 214  PPRMAWCELCRVDCNRPEILEQHKNGKRHKKNLQVREELQKRNGVITGQQSVQVPNLGSE 273

Query: 1381 GEQPLNFEGGEEKNEASAEGLPSNAATNENKMEVDQQTQTAGQSEIPTAEPTVATAGNSR 1560
              Q +  EG EEK     + +PS AAT +NK E++QQ            +P  +T G + 
Sbjct: 274  IVQLVKVEGSEEKQH--QQMVPSLAATTDNKKEIEQQQDIVN-------KPEASTTGPAE 324

Query: 1561 VDGGFENXXXXXXXXXXSKF--GRGGKRLRQQ--PEPVAEAPKEQ---PKYCALCHVTCD 1719
                  N           K   GRGGK +++       +E PK +   P  C LC+V C+
Sbjct: 325  AKRNLRNPSEARGRGLKRKMRGGRGGKYVKRNEGSRRPSEPPKPKGGIPFMCELCNVKCE 384

Query: 1720 TQAVFECHLSGKKHTSRVKRSQGPQGPYRALGLHALYMP 1836
            TQ VF CHL+GKKH + +KR  G +  Y   G+ ALY P
Sbjct: 385  TQVVFNCHLAGKKHIANMKRFHGHRALYGEAGVQALYPP 423


>XP_016745652.1 PREDICTED: uncharacterized protein LOC107954565 [Gossypium hirsutum]
          Length = 621

 Score =  149 bits (375), Expect = 6e-34
 Identities = 88/219 (40%), Positives = 117/219 (53%), Gaps = 14/219 (6%)
 Frame = +1

Query: 1222 PPQVAWCDLCKVDCNTQEILEQHKNGKRHKKNMQKIEEFQKHQNLLLESQA----KSGEQ 1389
            PP++AWC+LC+VDCN  EILEQHKNGKRHKKN+Q  EE QK   ++   Q+      G +
Sbjct: 214  PPRMAWCELCRVDCNRPEILEQHKNGKRHKKNLQVREELQKRNGVITGQQSVQVPNLGSE 273

Query: 1390 PLNF---EGGEEKNEASAEGLPSNAATNENKMEVDQQTQTAGQSEIPTAEPTVATAGNSR 1560
             +     EG EEK     + +PS AAT +NK E +QQ            +P  +T G + 
Sbjct: 274  IVQLEKVEGSEEKQH--QQMVPSLAATTDNKKENEQQQDIVN-------KPEASTTGPAE 324

Query: 1561 VDGGFENXXXXXXXXXXSKF--GRGGKRLRQQ--PEPVAEAPKEQ---PKYCALCHVTCD 1719
              G   N           K   GRGGK +++       +E PK +   P  C LC+V C+
Sbjct: 325  AKGNLRNPSEARGRGLKRKMRGGRGGKYVKRNEGSRRPSEPPKPKGGIPFMCELCNVKCE 384

Query: 1720 TQAVFECHLSGKKHTSRVKRSQGPQGPYRALGLHALYMP 1836
            TQ VF CHL+GKKH + +KR  G +  Y   G+ ALY P
Sbjct: 385  TQVVFNCHLAGKKHIANMKRFHGHRALYGEAGVQALYPP 423


>XP_017650066.1 PREDICTED: uncharacterized protein LOC108489869 isoform X1 [Gossypium
            arboreum]
          Length = 627

 Score =  149 bits (375), Expect = 6e-34
 Identities = 87/219 (39%), Positives = 116/219 (52%), Gaps = 14/219 (6%)
 Frame = +1

Query: 1222 PPQVAWCDLCKVDCNTQEILEQHKNGKRHKKNMQKIEEFQKHQNLLLESQA-------KS 1380
            PP++AWC+LC+VDCN  EILEQHKNGKRHKKN+Q  EE QK   ++   Q+         
Sbjct: 214  PPRMAWCELCRVDCNRPEILEQHKNGKRHKKNLQVREELQKRNGVITGQQSVQVPNLGSE 273

Query: 1381 GEQPLNFEGGEEKNEASAEGLPSNAATNENKMEVDQQTQTAGQSEIPTAEPTVATAGNSR 1560
              Q +  EG EEK     + +PS AAT +NK E++QQ            +P  +T G + 
Sbjct: 274  IVQLVKVEGSEEKQH--QQMVPSLAATTDNKKEIEQQQDIVN-------KPEASTTGPAE 324

Query: 1561 VDGGFENXXXXXXXXXXSKF--GRGGKRLRQQ--PEPVAEAPKEQ---PKYCALCHVTCD 1719
                  N           K   GRGGK +++       +E PK +   P  C LC+V C+
Sbjct: 325  AKMNLRNPSEARGRGLKRKMRGGRGGKYVKRNEGSRRPSEPPKPKGGIPFMCELCNVKCE 384

Query: 1720 TQAVFECHLSGKKHTSRVKRSQGPQGPYRALGLHALYMP 1836
            TQ VF CHL+GKKH + +KR  G +  Y   G+ ALY P
Sbjct: 385  TQVVFNCHLAGKKHIANMKRFHGHRALYGEAGVQALYPP 423


>KJB09922.1 hypothetical protein B456_001G175100 [Gossypium raimondii]
          Length = 487

 Score =  145 bits (367), Expect = 1e-33
 Identities = 87/219 (39%), Positives = 116/219 (52%), Gaps = 14/219 (6%)
 Frame = +1

Query: 1222 PPQVAWCDLCKVDCNTQEILEQHKNGKRHKKNMQKIEEFQKHQNLLLESQA----KSGEQ 1389
            PP++AWC+LC+VDCN  EILEQHKNGKRHKKN+Q  EE QK   ++   Q+      G +
Sbjct: 80   PPRMAWCELCRVDCNRPEILEQHKNGKRHKKNLQVREELQKRNGVITGQQSVQVPNLGSE 139

Query: 1390 PLNF---EGGEEKNEASAEGLPSNAATNENKMEVDQQTQTAGQSEIPTAEPTVATAGNSR 1560
             +     EG EEK     + +PS AAT +NK E +QQ            +P  +T G + 
Sbjct: 140  IVQLEKVEGSEEKQH--QQMVPSLAATTDNKKENEQQQDIVN-------KPEASTTGPAE 190

Query: 1561 VDGGFENXXXXXXXXXXSKF--GRGGKRLRQQ--PEPVAEAPKEQ---PKYCALCHVTCD 1719
                  N           K   GRGGK +++       +E PK +   P  C LC+V C+
Sbjct: 191  AKRNLRNPSEARGRGLKRKMRGGRGGKYVKRNEGSRRPSEPPKPKGGIPFMCELCNVKCE 250

Query: 1720 TQAVFECHLSGKKHTSRVKRSQGPQGPYRALGLHALYMP 1836
            TQ VF CHL+GKKH + +KR  G +  Y   G+ ALY P
Sbjct: 251  TQVVFNCHLAGKKHIANMKRFHGHRALYGEAGVQALYPP 289


>XP_018820146.1 PREDICTED: uncharacterized protein LOC108990593 [Juglans regia]
          Length = 586

 Score =  147 bits (370), Expect = 2e-33
 Identities = 88/219 (40%), Positives = 118/219 (53%), Gaps = 14/219 (6%)
 Frame = +1

Query: 1222 PPQVAWCDLCKVDCNTQEILEQHKNGKRHKKNMQKIEEFQK-------HQNLLLESQAKS 1380
            PP++AWC+LC+VDCNT EILEQHKNGKRHKKN+QK EE Q+        QN+ + +    
Sbjct: 245  PPRMAWCELCRVDCNTLEILEQHKNGKRHKKNLQKHEELQRLNKVITGQQNVQMPNSELK 304

Query: 1381 GEQPLNFEGGEEKNEASA--EGLPSNAATNENKMEVDQQTQTAGQSEIPTAEPTVATAGN 1554
             E  +  E  EE  E     E L S A T +N+ E +QQ +  G SE+  AEP       
Sbjct: 305  PEVVVQSEKVEEYEEKQTPPENLTSEAVTGDNRNETEQQ-KDVGNSEV-LAEPEKKPKDQ 362

Query: 1555 SRVDGGFENXXXXXXXXXXSKFGRGGKRLR-----QQPEPVAEAPKEQPKYCALCHVTCD 1719
                G               + GRGGK +R     ++P   ++  +  P  C LC+V CD
Sbjct: 363  FAAQG--------RGLKRNMRGGRGGKYMRTYEGARRPVKTSKPKQAIPLMCELCNVKCD 414

Query: 1720 TQAVFECHLSGKKHTSRVKRSQGPQGPYRALGLHALYMP 1836
            +Q VF+ HL+GKKH + +KR  G +  Y  +GL ALY P
Sbjct: 415  SQVVFDSHLTGKKHLANLKRFHGHRALYGEIGLQALYPP 453


>XP_015580707.1 PREDICTED: general transcriptional corepressor CYC8 [Ricinus
            communis]
          Length = 587

 Score =  146 bits (369), Expect = 2e-33
 Identities = 94/265 (35%), Positives = 134/265 (50%), Gaps = 16/265 (6%)
 Frame = +1

Query: 1090 FPHHAPAAQPQYFEPAPYQEPSSXXXXXXXXXXXXXXXXXXXXXPPQVAWCDLCKVDCNT 1269
            F  H  A+   Y EP+   E  +                     P Q+AWC+LC+VDC +
Sbjct: 261  FKPHGAASTSSYPEPSAANEDPAAVAVEAAQSAPAPEKLAPKHQPAQIAWCELCRVDCTS 320

Query: 1270 QEILEQHKNGKRHKKNMQKIEEFQK----------HQNLLLESQAKSGEQPLNFEGGEEK 1419
             E+LEQHKNGKRHKKN+ +IEE +K           Q  ++ ++ +  ++P   + GEE 
Sbjct: 321  VEVLEQHKNGKRHKKNLLRIEELKKAVKFGEEIKNDQETIINTKPEDYQEPQVAQDGEE- 379

Query: 1420 NEASAEGLPSNAATNENKMEVDQQTQTAGQSEIPTAEPTVATAGNSRVDGGFENXXXXXX 1599
             + +AE L   A  +E+ M  D Q  T  +SE+P  E +       +++  F+N      
Sbjct: 380  -QKTAENLTVEATNDEHIMVRDLQDNTGEKSEVPVEELSDQQGMKPKMN-LFDNRRRGMK 437

Query: 1600 XXXXSKFGRGGKRLRQQP---EPVAEAPKEQ---PKYCALCHVTCDTQAVFECHLSGKKH 1761
                 K GRGGKR++       PV E PK +   P  C LC+V CDT+ V + HLSGKKH
Sbjct: 438  RKI--KGGRGGKRMKTSETHRRPV-EPPKPKVVVPLICDLCNVKCDTREVLDRHLSGKKH 494

Query: 1762 TSRVKRSQGPQGPYRALGLHALYMP 1836
             +++KR +G Q  Y   GL ALY P
Sbjct: 495  IAKLKRFEGHQAIYGPTGLQALYPP 519


>KHG02771.1 cdsA [Gossypium arboreum]
          Length = 1001

 Score =  149 bits (375), Expect = 3e-33
 Identities = 87/219 (39%), Positives = 116/219 (52%), Gaps = 14/219 (6%)
 Frame = +1

Query: 1222 PPQVAWCDLCKVDCNTQEILEQHKNGKRHKKNMQKIEEFQKHQNLLLESQA-------KS 1380
            PP++AWC+LC+VDCN  EILEQHKNGKRHKKN+Q  EE QK   ++   Q+         
Sbjct: 216  PPRMAWCELCRVDCNRPEILEQHKNGKRHKKNLQVREELQKRNGVITGQQSVQVPNLGSE 275

Query: 1381 GEQPLNFEGGEEKNEASAEGLPSNAATNENKMEVDQQTQTAGQSEIPTAEPTVATAGNSR 1560
              Q +  EG EEK     + +PS AAT +NK E++QQ            +P  +T G + 
Sbjct: 276  IVQLVKVEGSEEKQH--QQMVPSLAATTDNKKEIEQQQDIVN-------KPEASTTGPAE 326

Query: 1561 VDGGFENXXXXXXXXXXSKF--GRGGKRLRQQ--PEPVAEAPKEQ---PKYCALCHVTCD 1719
                  N           K   GRGGK +++       +E PK +   P  C LC+V C+
Sbjct: 327  AKMNLRNPSEARGRGLKRKMRGGRGGKYVKRNEGSRRPSEPPKPKGGIPFMCELCNVKCE 386

Query: 1720 TQAVFECHLSGKKHTSRVKRSQGPQGPYRALGLHALYMP 1836
            TQ VF CHL+GKKH + +KR  G +  Y   G+ ALY P
Sbjct: 387  TQVVFNCHLAGKKHIANMKRFHGHRALYGEAGVQALYPP 425


>XP_012483055.1 PREDICTED: uncharacterized protein LOC105797655 isoform X2 [Gossypium
            raimondii] XP_012483061.1 PREDICTED: uncharacterized
            protein LOC105797655 isoform X2 [Gossypium raimondii]
          Length = 550

 Score =  145 bits (367), Expect = 3e-33
 Identities = 87/219 (39%), Positives = 116/219 (52%), Gaps = 14/219 (6%)
 Frame = +1

Query: 1222 PPQVAWCDLCKVDCNTQEILEQHKNGKRHKKNMQKIEEFQKHQNLLLESQA----KSGEQ 1389
            PP++AWC+LC+VDCN  EILEQHKNGKRHKKN+Q  EE QK   ++   Q+      G +
Sbjct: 143  PPRMAWCELCRVDCNRPEILEQHKNGKRHKKNLQVREELQKRNGVITGQQSVQVPNLGSE 202

Query: 1390 PLNF---EGGEEKNEASAEGLPSNAATNENKMEVDQQTQTAGQSEIPTAEPTVATAGNSR 1560
             +     EG EEK     + +PS AAT +NK E +QQ            +P  +T G + 
Sbjct: 203  IVQLEKVEGSEEKQH--QQMVPSLAATTDNKKENEQQQDIVN-------KPEASTTGPAE 253

Query: 1561 VDGGFENXXXXXXXXXXSKF--GRGGKRLRQQ--PEPVAEAPKEQ---PKYCALCHVTCD 1719
                  N           K   GRGGK +++       +E PK +   P  C LC+V C+
Sbjct: 254  AKRNLRNPSEARGRGLKRKMRGGRGGKYVKRNEGSRRPSEPPKPKGGIPFMCELCNVKCE 313

Query: 1720 TQAVFECHLSGKKHTSRVKRSQGPQGPYRALGLHALYMP 1836
            TQ VF CHL+GKKH + +KR  G +  Y   G+ ALY P
Sbjct: 314  TQVVFNCHLAGKKHIANMKRFHGHRALYGEAGVQALYPP 352


>XP_015889413.1 PREDICTED: mediator of RNA polymerase II transcription subunit
            12-like [Ziziphus jujuba]
          Length = 655

 Score =  147 bits (370), Expect = 3e-33
 Identities = 135/462 (29%), Positives = 174/462 (37%), Gaps = 57/462 (12%)
 Frame = +1

Query: 622  DPAAQHS-GHAAPPDASQVAGG-----QTQQNAYYP----QDPQSSYP------VPQGLN 753
            +P + H  G   PP +SQ A       Q QQNAYYP    +  Q S P         GLN
Sbjct: 83   EPTSIHPPGVPIPPASSQSAEPAQTHLQRQQNAYYPHGAVESRQQSVPGSDVAGTTGGLN 142

Query: 754  PXXXXXXXXLSQLQQFAGRMDAAERAM-VGIQEPPW---------PANNGGYGQMPGQPP 903
            P        +SQ   FAG MDA++ +M   I + P+         P   GG G    + P
Sbjct: 143  PAAAVAA--ISQFTHFAGSMDASQSSMHPPIGQTPYRGGGRRGSRPFRGGGRGHFGYRGP 200

Query: 904  AQ----------------------HGAVGVHPNVXXXXXXXXXXXXXXXXXXXXXXXXXX 1017
                                    HGAV  +PN                           
Sbjct: 201  RPDGSAASFHGRGRGQGGSRHFTVHGAVSANPN--------------------------- 233

Query: 1018 XXXXXXXXXXXXXXXXXXXXXXXXFPHHAPAAQPQYFEPAPYQEPSSXXXXXXXXXXXXX 1197
                                     P  AP   P    PAP+ +P               
Sbjct: 234  -----SASAPAEGIAALMQPPSASVPGQAPLPVPAQVPPAPFWQP--------------- 273

Query: 1198 XXXXXXXXPPQVAWCDLCKVDCNTQEILEQHKNGKRHKKNMQKIEEFQKHQNLLLESQAK 1377
                     P++AWC+LC+VDCNT EILEQHKNGKRHKKN++  EE QK   ++   Q +
Sbjct: 274  ---------PRMAWCELCRVDCNTLEILEQHKNGKRHKKNLKVHEELQKLNKVITGQQNE 324

Query: 1378 SGEQPLNFE--GGEEKNEASAEGLP------SNAATNENKMEVDQQTQTAGQSEIPTAEP 1533
                 +     G  EK E   E  P      S   TN+N  E +QQ  T   SE     P
Sbjct: 325  QMPNAVTKPDVGQSEKVEGLVENQPLPQKLASEIVTNDNVNETEQQNDTVDNSEASVEPP 384

Query: 1534 TVATAGNSRVDGGFENXXXXXXXXXXSKFGRGGKRLRQQPEPVAEAPKEQ-PKYCALCHV 1710
                    ++                 K+ R  + LR+  EP    PK+  P  C LC+V
Sbjct: 385  --EQKSRDQLSARRRGSKRKMRGRQGGKYMRANEGLRRPVEP--PKPKQVIPLICGLCNV 440

Query: 1711 TCDTQAVFECHLSGKKHTSRVKRSQGPQGPYRALGLHALYMP 1836
             C++Q VF+ HL+GKKH S +KR  G +  Y   GL ALY P
Sbjct: 441  KCESQVVFDSHLTGKKHLSNLKRFHGHRALYGEAGLQALYPP 482


>XP_012483048.1 PREDICTED: uncharacterized protein LOC105797655 isoform X1 [Gossypium
            raimondii] KJB09923.1 hypothetical protein
            B456_001G175100 [Gossypium raimondii]
          Length = 621

 Score =  145 bits (367), Expect = 6e-33
 Identities = 87/219 (39%), Positives = 116/219 (52%), Gaps = 14/219 (6%)
 Frame = +1

Query: 1222 PPQVAWCDLCKVDCNTQEILEQHKNGKRHKKNMQKIEEFQKHQNLLLESQA----KSGEQ 1389
            PP++AWC+LC+VDCN  EILEQHKNGKRHKKN+Q  EE QK   ++   Q+      G +
Sbjct: 214  PPRMAWCELCRVDCNRPEILEQHKNGKRHKKNLQVREELQKRNGVITGQQSVQVPNLGSE 273

Query: 1390 PLNF---EGGEEKNEASAEGLPSNAATNENKMEVDQQTQTAGQSEIPTAEPTVATAGNSR 1560
             +     EG EEK     + +PS AAT +NK E +QQ            +P  +T G + 
Sbjct: 274  IVQLEKVEGSEEKQH--QQMVPSLAATTDNKKENEQQQDIVN-------KPEASTTGPAE 324

Query: 1561 VDGGFENXXXXXXXXXXSKF--GRGGKRLRQQ--PEPVAEAPKEQ---PKYCALCHVTCD 1719
                  N           K   GRGGK +++       +E PK +   P  C LC+V C+
Sbjct: 325  AKRNLRNPSEARGRGLKRKMRGGRGGKYVKRNEGSRRPSEPPKPKGGIPFMCELCNVKCE 384

Query: 1720 TQAVFECHLSGKKHTSRVKRSQGPQGPYRALGLHALYMP 1836
            TQ VF CHL+GKKH + +KR  G +  Y   G+ ALY P
Sbjct: 385  TQVVFNCHLAGKKHIANMKRFHGHRALYGEAGVQALYPP 423


>EEF33447.1 conserved hypothetical protein [Ricinus communis]
          Length = 725

 Score =  146 bits (369), Expect = 7e-33
 Identities = 94/265 (35%), Positives = 134/265 (50%), Gaps = 16/265 (6%)
 Frame = +1

Query: 1090 FPHHAPAAQPQYFEPAPYQEPSSXXXXXXXXXXXXXXXXXXXXXPPQVAWCDLCKVDCNT 1269
            F  H  A+   Y EP+   E  +                     P Q+AWC+LC+VDC +
Sbjct: 399  FKPHGAASTSSYPEPSAANEDPAAVAVEAAQSAPAPEKLAPKHQPAQIAWCELCRVDCTS 458

Query: 1270 QEILEQHKNGKRHKKNMQKIEEFQK----------HQNLLLESQAKSGEQPLNFEGGEEK 1419
             E+LEQHKNGKRHKKN+ +IEE +K           Q  ++ ++ +  ++P   + GEE 
Sbjct: 459  VEVLEQHKNGKRHKKNLLRIEELKKAVKFGEEIKNDQETIINTKPEDYQEPQVAQDGEE- 517

Query: 1420 NEASAEGLPSNAATNENKMEVDQQTQTAGQSEIPTAEPTVATAGNSRVDGGFENXXXXXX 1599
             + +AE L   A  +E+ M  D Q  T  +SE+P  E +       +++  F+N      
Sbjct: 518  -QKTAENLTVEATNDEHIMVRDLQDNTGEKSEVPVEELSDQQGMKPKMN-LFDNRRRGMK 575

Query: 1600 XXXXSKFGRGGKRLRQQP---EPVAEAPKEQ---PKYCALCHVTCDTQAVFECHLSGKKH 1761
                 K GRGGKR++       PV E PK +   P  C LC+V CDT+ V + HLSGKKH
Sbjct: 576  RKI--KGGRGGKRMKTSETHRRPV-EPPKPKVVVPLICDLCNVKCDTREVLDRHLSGKKH 632

Query: 1762 TSRVKRSQGPQGPYRALGLHALYMP 1836
             +++KR +G Q  Y   GL ALY P
Sbjct: 633  IAKLKRFEGHQAIYGPTGLQALYPP 657


>OMP00682.1 Zinc finger, U1-type [Corchorus olitorius]
          Length = 514

 Score =  142 bits (359), Expect = 2e-32
 Identities = 85/217 (39%), Positives = 117/217 (53%), Gaps = 12/217 (5%)
 Frame = +1

Query: 1222 PPQVAWCDLCKVDCNTQEILEQHKNGKRHKKNMQKIEEFQKHQNLLLESQA----KSGEQ 1389
            PP++ WC+LC+VDCN  EILEQHK GKRH+KN+Q +E+ QK  N+  E Q+     SG +
Sbjct: 95   PPRMGWCELCRVDCNRPEILEQHKKGKRHQKNLQALEQQQKLNNVKTEQQSLQVPSSGSE 154

Query: 1390 PL---NFEGGEEKNEASAEGLPSNAATNENKMEVDQQTQTAGQSEIPTAEPTVATAGNSR 1560
             +     EG EEK +   E LP  + TNE+K E + Q      SE  T+    +  G  +
Sbjct: 155  VVQLKKLEGSEEKQQ-QQETLPPFSVTNESKNETEHQKDLVNTSEASTSH---SVEGKRK 210

Query: 1561 VDGGFENXXXXXXXXXXSKFGRGGKRLR--QQPEPVAEAPKEQ---PKYCALCHVTCDTQ 1725
            +    E            + GRGGK ++  + P    E PK +   P  C LC+V C++Q
Sbjct: 211  LKDPSE--AHGRGLKRKMRGGRGGKYMKSNEGPRRPVEPPKPKGGIPFMCELCNVKCESQ 268

Query: 1726 AVFECHLSGKKHTSRVKRSQGPQGPYRALGLHALYMP 1836
             VF  HL+GKKH + +KR  G +  Y   GL ALY P
Sbjct: 269  VVFNSHLAGKKHIANMKRFHGHRAVYGEAGLRALYPP 305


>XP_010258936.1 PREDICTED: glutenin, high molecular weight subunit DX5-like [Nelumbo
            nucifera]
          Length = 790

 Score =  145 bits (365), Expect = 3e-32
 Identities = 98/284 (34%), Positives = 129/284 (45%), Gaps = 79/284 (27%)
 Frame = +1

Query: 1222 PPQVAWCDLCKVDCNTQEILEQHKNGKRHKKNMQKIEEFQKHQNLL-------------- 1359
            PPQ+AWC+LC+VDC + +ILEQHKNGKRHKKN+Q+IEE +  +  L              
Sbjct: 356  PPQIAWCELCRVDCTSLDILEQHKNGKRHKKNLQRIEELKNAKKPLPAIQVEQKPLPALK 415

Query: 1360 --------------------LESQAKSGEQPLNFEG------------------------ 1407
                                ++ +   G QP N EG                        
Sbjct: 416  SESMVQPASVQEIQPDNVQGMQPENIQGAQPENVEGLQADNMQGIQAEHVEGVQSENVLE 475

Query: 1408 --------GEEKNEASA--------EGLPSNAATNENKMEVDQQTQTAGQSEIPTAEPTV 1539
                    G+ +N   A        E LP+ A +NENK+E++QQ  +  Q    TA+   
Sbjct: 476  AGPDNVQEGQPENVQGAGESKPPIAENLPTEAPSNENKIEIEQQNSSVEQP--GTADVER 533

Query: 1540 ATAGNSRVDGGFENXXXXXXXXXXSKFGRGGKRLR--QQPEPVAEAPKEQ---PKYCALC 1704
            A A   R   G  +             GRGGKR R  + P    E PK +   P  C LC
Sbjct: 534  AEAPGKRQRFGRFDSRRRGIKRKMRGGGRGGKRTRTSEGPRKPVEPPKPKEVVPLICDLC 593

Query: 1705 HVTCDTQAVFECHLSGKKHTSRVKRSQGPQGPYRALGLHALYMP 1836
            +V CDTQ VF+CHL+GKKH S++KR QG Q  +  +GL ALY P
Sbjct: 594  NVKCDTQTVFDCHLAGKKHLSKLKRFQGHQAMFGPVGLQALYPP 637



 Score = 90.5 bits (223), Expect = 5e-15
 Identities = 55/113 (48%), Positives = 57/113 (50%), Gaps = 11/113 (9%)
 Frame = +1

Query: 634 QHSGHAAPPDASQV-AGGQTQQNAYYPQD-------PQSSYPVPQGLNPXXXXXXXXLSQ 789
           Q  G        QV A  Q QQNAYYPQ          SSYPVP GLNP        LSQ
Sbjct: 135 QQQGQVHGQSQGQVQAHVQQQQNAYYPQGVGEQQQAANSSYPVPPGLNPAAAAAVAALSQ 194

Query: 790 LQQFAGRMDAAERAMVGIQEPPWPANNGGYGQMP---GQPPAQHGAVGVHPNV 939
           L  FAG MDAAERAM G+QE  W   NGGYG MP     P   HG   + P V
Sbjct: 195 LTHFAGTMDAAERAMAGLQERQWHTKNGGYGHMPPPGSMPYGPHGPFPMRPPV 247


>EOX96328.1 Uncharacterized protein TCM_005599 [Theobroma cacao]
          Length = 731

 Score =  144 bits (363), Expect = 4e-32
 Identities = 83/215 (38%), Positives = 116/215 (53%), Gaps = 10/215 (4%)
 Frame = +1

Query: 1222 PPQVAWCDLCKVDCNTQEILEQHKNGKRHKKNMQKIEEFQKHQNLLLESQA----KSGEQ 1389
            PP++AWC+LC+VDCN  EILEQHKNGKRHKKN+Q  EE QK   ++   Q+     SG +
Sbjct: 297  PPRMAWCELCRVDCNRPEILEQHKNGKRHKKNLQVHEELQKLNKVITGQQSVQVPNSGSE 356

Query: 1390 PLNFEGGE-EKNEASAEGLPSNAATNENKMEVDQQTQTAGQSEIPTAEPTVATAGNSRVD 1566
             +  E  E  + +   E  PS A TN++K E +QQ      SE  T +     +  ++  
Sbjct: 357  AVQLEKVEGSEGQHQQETSPSLAVTNDSKKETEQQKDIVNNSEASTTD-----SAKAKRK 411

Query: 1567 GGFENXXXXXXXXXXSKFGRGGKRLR--QQPEPVAEAPKEQ---PKYCALCHVTCDTQAV 1731
             G  +           + GRGGK ++  ++P    E PK +   P  C LC+V C++  V
Sbjct: 412  LGDASEARGRGFKRKMRGGRGGKYMKGNERPRRPVEPPKPKGGIPFMCELCNVKCESHVV 471

Query: 1732 FECHLSGKKHTSRVKRSQGPQGPYRALGLHALYMP 1836
            F  HL+GKKH + +KR  G +  Y   GL ALY P
Sbjct: 472  FNSHLAGKKHIANLKRFHGHRALYGEAGLQALYPP 506


Top