BLASTX nr result

ID: Forsythia21_contig00030688 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia21_contig00030688
         (1570 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011076116.1| PREDICTED: uncharacterized protein LOC105160...   582   e-163
ref|XP_012852159.1| PREDICTED: uncharacterized protein LOC105971...   561   e-157
ref|XP_002273113.2| PREDICTED: uncharacterized protein LOC100264...   546   e-152
ref|XP_010662689.1| PREDICTED: uncharacterized protein LOC100264...   539   e-150
ref|XP_006422011.1| hypothetical protein CICLE_v10004922mg [Citr...   528   e-147
ref|XP_004242762.1| PREDICTED: uncharacterized protein LOC101267...   526   e-146
ref|XP_006358300.1| PREDICTED: uncharacterized protein LOC102584...   521   e-145
ref|XP_009628873.1| PREDICTED: uncharacterized protein LOC104119...   520   e-144
ref|XP_007038793.1| Keratin-associated protein 5-4 [Theobroma ca...   518   e-144
emb|CDP05152.1| unnamed protein product [Coffea canephora]            516   e-143
ref|XP_010255250.1| PREDICTED: uncharacterized protein LOC104595...   513   e-142
ref|XP_008234847.1| PREDICTED: uncharacterized protein LOC103333...   513   e-142
ref|XP_011043859.1| PREDICTED: uncharacterized protein LOC105139...   512   e-142
ref|XP_010255249.1| PREDICTED: uncharacterized protein LOC104595...   512   e-142
ref|XP_002305244.2| hypothetical protein POPTR_0004s07750g [Popu...   511   e-142
ref|XP_012090197.1| PREDICTED: uncharacterized protein LOC105648...   509   e-141
ref|XP_002513660.1| conserved hypothetical protein [Ricinus comm...   509   e-141
gb|KHG26673.1| putative membrane yjcL [Gossypium arboreum]            506   e-140
ref|XP_012484138.1| PREDICTED: uncharacterized protein LOC105798...   506   e-140
ref|XP_007217974.1| hypothetical protein PRUPE_ppa005389mg [Prun...   505   e-140

>ref|XP_011076116.1| PREDICTED: uncharacterized protein LOC105160444 [Sesamum indicum]
          Length = 455

 Score =  582 bits (1501), Expect = e-163
 Identities = 326/455 (71%), Positives = 350/455 (76%), Gaps = 9/455 (1%)
 Frame = -1

Query: 1468 MASKLLSVLPTHYLPPPEYRPFIHSGRN--------IPLFQDSSRLTSYEYKGKVL-ISP 1316
            MA KLL   P +  PPP  R    S +          PL QD S LTS   K + L +SP
Sbjct: 1    MALKLLFSQPINCHPPPLQRSRFASHQKPSQIPTARSPLIQDFSLLTSSSNKDRSLNLSP 60

Query: 1315 FNMPKKSSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVST 1136
               PK  +RSV+A SQLNFPIISPQD WGTWTALFATGAFGIWSEKT+IGSALSGALVS 
Sbjct: 61   NTNPKNVARSVVAKSQLNFPIISPQDQWGTWTALFATGAFGIWSEKTKIGSALSGALVSI 120

Query: 1135 LVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVA 956
            LVGLAASNLGIIASEAPAY VV+ F         LYRADMRR+IRSTGTLLLAFLLGSVA
Sbjct: 121  LVGLAASNLGIIASEAPAYKVVLEFLLPLAVPLLLYRADMRRIIRSTGTLLLAFLLGSVA 180

Query: 955  TTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAAD 776
            TT GT VAFL+VPM+SLGQDGWKIAAALMGRHIGGAVNYVAISEAL+V+PSVLAAGLAAD
Sbjct: 181  TTAGTAVAFLLVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALEVTPSVLAAGLAAD 240

Query: 775  NVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTA 596
            NVICA+YFTTLFALASKIPAE +TST+D  LNEES S NKLPVLQTATALA SF ICK+A
Sbjct: 241  NVICAIYFTTLFALASKIPAESATSTTDGGLNEESESSNKLPVLQTATALAVSFIICKSA 300

Query: 595  SFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSI 416
            SFLT Y GIQG +LP +TAIVVILAT  P QFAYLAPSGEAMA+ILMQVFF V+GASGSI
Sbjct: 301  SFLTNYLGIQGATLPTITAIVVILATMLPNQFAYLAPSGEAMALILMQVFFAVIGASGSI 360

Query: 415  RNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXX 236
            R+VI+TAPSIFLF LVQI VHLAIILG GKL RFDLKLLL+ASNANV             
Sbjct: 361  RSVISTAPSIFLFALVQIGVHLAIILGLGKLLRFDLKLLLLASNANVGGPTTACGMATAK 420

Query: 235  GWSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 131
            GWSSLVVP                   GQAVLKFM
Sbjct: 421  GWSSLVVPGILAGIFGIAIATFLGIAFGQAVLKFM 455


>ref|XP_012852159.1| PREDICTED: uncharacterized protein LOC105971807 [Erythranthe
            guttatus] gi|604306080|gb|EYU25137.1| hypothetical
            protein MIMGU_mgv1a006291mg [Erythranthe guttata]
          Length = 449

 Score =  561 bits (1447), Expect = e-157
 Identities = 311/455 (68%), Positives = 350/455 (76%), Gaps = 9/455 (1%)
 Frame = -1

Query: 1468 MASKLLSVLPTHYLPPPEYRPFIHSGRNI---------PLFQDSSRLTSYEYKGKVLISP 1316
            MA K+L   PT Y+PPP  R  I + RN          P FQ+S   T    K + L + 
Sbjct: 1    MAGKILLFHPT-YIPPPPARRSIVASRNAASQIPDTHTPSFQNSPLSTFSSDKFRTLKT- 58

Query: 1315 FNMPKKSSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVST 1136
              + +  +RSV+A SQLNFPIISP D WGTWTALFA GAFGIWSEKT+IGSALSGALVST
Sbjct: 59   --ISRNPARSVVARSQLNFPIISPHDQWGTWTALFAAGAFGIWSEKTKIGSALSGALVST 116

Query: 1135 LVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVA 956
            LVGLAASNLGIIASE  AYNVV+ F         LYRADMRRVI+STGTLLLAFLLGSVA
Sbjct: 117  LVGLAASNLGIIASETAAYNVVLEFLLPLAVPLLLYRADMRRVIKSTGTLLLAFLLGSVA 176

Query: 955  TTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAAD 776
            TT+GT+VA+ +VPM+SLGQDGWKIAAALMGRHIGGAVNYVAISEAL VSPSVLAAGLAAD
Sbjct: 177  TTVGTLVAYFLVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALGVSPSVLAAGLAAD 236

Query: 775  NVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTA 596
            NVICA+YFTTLFALASKIP+E S+ T  +  NEES S NKLPVLQTATA+A SF ICK A
Sbjct: 237  NVICAIYFTTLFALASKIPSESSSPTPGI--NEESESDNKLPVLQTATAVAVSFIICKIA 294

Query: 595  SFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSI 416
            + LT++FGIQGG+LPA+TAIVV+LAT+FP QFAYLAPSGEAMA+ILMQVFF V+GASGSI
Sbjct: 295  TVLTKHFGIQGGTLPAITAIVVVLATSFPNQFAYLAPSGEAMALILMQVFFAVIGASGSI 354

Query: 415  RNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXX 236
            RNVITTAPSIFLF L+QI VHLA+ILG GKLFRFDL+LLL+ASNANV             
Sbjct: 355  RNVITTAPSIFLFALIQIGVHLAVILGLGKLFRFDLRLLLLASNANVGGPTTACGMATAK 414

Query: 235  GWSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 131
            GW+SL+VP                   GQAVL+FM
Sbjct: 415  GWTSLIVPGILAGIFGIAIATFLGIAFGQAVLRFM 449


>ref|XP_002273113.2| PREDICTED: uncharacterized protein LOC100264478 isoform X1 [Vitis
            vinifera] gi|302143806|emb|CBI22667.3| unnamed protein
            product [Vitis vinifera]
          Length = 449

 Score =  546 bits (1407), Expect = e-152
 Identities = 309/453 (68%), Positives = 338/453 (74%), Gaps = 7/453 (1%)
 Frame = -1

Query: 1468 MASKLLSVLPTHYLPPPEYRPFIHSGRNIPLFQDSSRLTS-----YEYKGKVLISPFNMP 1304
            MASK L++     +P    +P   S +N P    SS  T      +  K +  +SP   P
Sbjct: 1    MASKFLTLRAPLSIP----QPVTCSRQNFPTVSTSSCSTVEDPSLWRSKKQTQLSPLIFP 56

Query: 1303 KKSS--RSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLV 1130
            K S   RSV   S L FPIISPQD WGTWTALFATGAFGIWSEKT+IGSALSGALVSTLV
Sbjct: 57   KSSDSIRSVTVRSSLTFPIISPQDQWGTWTALFATGAFGIWSEKTKIGSALSGALVSTLV 116

Query: 1129 GLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATT 950
            GLAASNLGII+ EAPAY+VV+ F         L+RAD+RRVI+STG LL+AFL+GSVATT
Sbjct: 117  GLAASNLGIISCEAPAYSVVLNFLLPLAVPLLLFRADLRRVIQSTGALLMAFLIGSVATT 176

Query: 949  IGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNV 770
            IGTVVAFLMVPM+SLGQD WKIAAALMGRHIGGAVNYVAISEAL VS SVLAAGLAADNV
Sbjct: 177  IGTVVAFLMVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISEALGVSRSVLAAGLAADNV 236

Query: 769  ICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTASF 590
            ICAVYFTTLFALASKIP E STS +D  +NE+   GNK PVL TATALA SFAICK   F
Sbjct: 237  ICAVYFTTLFALASKIPPEDSTSANDTGMNEQPEPGNKPPVLLTATALAVSFAICKAGIF 296

Query: 589  LTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRN 410
            LT+YFGIQGGSLPA+TAIVVILAT FPKQF+ LAP+GE MAMILMQVFFTVVGASG+I N
Sbjct: 297  LTKYFGIQGGSLPAITAIVVILATAFPKQFSLLAPAGETMAMILMQVFFTVVGASGNIGN 356

Query: 409  VITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGW 230
            V+ TAPSIF+F LVQIAVHLA+ILG GKLFRFDLKLLLIASNANV             GW
Sbjct: 357  VMNTAPSIFMFALVQIAVHLAVILGLGKLFRFDLKLLLIASNANVGGPTTACGMATAKGW 416

Query: 229  SSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 131
            SSLVVP                   G  VLKFM
Sbjct: 417  SSLVVPGILAGIFGIAIATFLGIVFGVTVLKFM 449


>ref|XP_010662689.1| PREDICTED: uncharacterized protein LOC100264478 isoform X2 [Vitis
            vinifera]
          Length = 447

 Score =  539 bits (1389), Expect = e-150
 Identities = 308/453 (67%), Positives = 337/453 (74%), Gaps = 7/453 (1%)
 Frame = -1

Query: 1468 MASKLLSVLPTHYLPPPEYRPFIHSGRNIPLFQDSSRLTS-----YEYKGKVLISPFNMP 1304
            MASK L++     +P    +P   S +N P    SS  T      +  K +  +SP   P
Sbjct: 1    MASKFLTLRAPLSIP----QPVTCSRQNFPTVSTSSCSTVEDPSLWRSKKQTQLSPLIFP 56

Query: 1303 KKSS--RSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLV 1130
            K S   RSV   S L FPIISPQD WGTWTALFATGAFGIWSEKT+IGSALSGALVSTLV
Sbjct: 57   KSSDSIRSVTVRSSLTFPIISPQDQWGTWTALFATGAFGIWSEKTKIGSALSGALVSTLV 116

Query: 1129 GLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATT 950
            GLAASNLGII+ EAPAY+VV+ F         L+RAD+RRVI+STG LL+AFL+GSVATT
Sbjct: 117  GLAASNLGIISCEAPAYSVVLNFLLPLAVPLLLFRADLRRVIQSTGALLMAFLIGSVATT 176

Query: 949  IGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNV 770
            IGTVVAFLMVPM+SLGQD WKIAAALMGRHIGGAVNYVAISEAL VS SVLAAGLAADNV
Sbjct: 177  IGTVVAFLMVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISEALGVSRSVLAAGLAADNV 236

Query: 769  ICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTASF 590
            ICAVYFTTLFALASKIP E STS +   +NE+   GNK PVL TATALA SFAICK   F
Sbjct: 237  ICAVYFTTLFALASKIPPEDSTSANG--MNEQPEPGNKPPVLLTATALAVSFAICKAGIF 294

Query: 589  LTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRN 410
            LT+YFGIQGGSLPA+TAIVVILAT FPKQF+ LAP+GE MAMILMQVFFTVVGASG+I N
Sbjct: 295  LTKYFGIQGGSLPAITAIVVILATAFPKQFSLLAPAGETMAMILMQVFFTVVGASGNIGN 354

Query: 409  VITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGW 230
            V+ TAPSIF+F LVQIAVHLA+ILG GKLFRFDLKLLLIASNANV             GW
Sbjct: 355  VMNTAPSIFMFALVQIAVHLAVILGLGKLFRFDLKLLLIASNANVGGPTTACGMATAKGW 414

Query: 229  SSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 131
            SSLVVP                   G  VLKFM
Sbjct: 415  SSLVVPGILAGIFGIAIATFLGIVFGVTVLKFM 447


>ref|XP_006422011.1| hypothetical protein CICLE_v10004922mg [Citrus clementina]
            gi|568875109|ref|XP_006490652.1| PREDICTED:
            uncharacterized protein LOC102608862 [Citrus sinensis]
            gi|557523884|gb|ESR35251.1| hypothetical protein
            CICLE_v10004922mg [Citrus clementina]
          Length = 466

 Score =  528 bits (1359), Expect = e-147
 Identities = 279/393 (70%), Positives = 318/393 (80%), Gaps = 1/393 (0%)
 Frame = -1

Query: 1387 NIPLFQDSSRLTSYEYKGKVLISPFNMPKKSSRSVIASSQL-NFPIISPQDHWGTWTALF 1211
            +IP  Q S+   S+      L   F  P   +RSV A SQL NFP+ISP D WGTWTALF
Sbjct: 47   SIPQHQSSASYLSHSRTNTFLSPQFPHPSNRTRSVTARSQLPNFPLISPHDKWGTWTALF 106

Query: 1210 ATGAFGIWSEKTEIGSALSGALVSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXL 1031
            ATGAFGIWSE+T+IGSALSGALVSTL+GLAASNLG+++ E+PAY++V+ F         L
Sbjct: 107  ATGAFGIWSERTKIGSALSGALVSTLIGLAASNLGVVSCESPAYSIVLEFLLPLAVPLLL 166

Query: 1030 YRADMRRVIRSTGTLLLAFLLGSVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGG 851
            +RAD+RRVI+STGTLLLAFL+GSVATT+GT +A+L+VPM+SLGQD WKIAAALMGRHIGG
Sbjct: 167  FRADLRRVIKSTGTLLLAFLIGSVATTVGTALAYLLVPMRSLGQDSWKIAAALMGRHIGG 226

Query: 850  AVNYVAISEALQVSPSVLAAGLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEES 671
            AVNYVAIS+AL VS SVLAAGLAADNVICAVYFTTLFALAS IPAE STS  DV +NE S
Sbjct: 227  AVNYVAISDALGVSSSVLAAGLAADNVICAVYFTTLFALASNIPAESSTSVDDVSMNEGS 286

Query: 670  GSGNKLPVLQTATALAASFAICKTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYL 491
              G+K PVLQ ATALA +FAICK  +FLT+YFGIQGGSLPA+TAIVV LATTFP QF  L
Sbjct: 287  VRGDKPPVLQFATALAVAFAICKAGTFLTKYFGIQGGSLPAITAIVVTLATTFPTQFNKL 346

Query: 490  APSGEAMAMILMQVFFTVVGASGSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFD 311
            AP+GEAMA+ILMQVFFTVVGASG+I +VI TAPSIF+F LVQIA+HLA+ILG GKLFRFD
Sbjct: 347  APAGEAMALILMQVFFTVVGASGNIWSVINTAPSIFMFALVQIAIHLAVILGLGKLFRFD 406

Query: 310  LKLLLIASNANVXXXXXXXXXXXXXGWSSLVVP 212
             KLLLIASNANV             GWSSL+VP
Sbjct: 407  QKLLLIASNANVGGPTTACGMATAKGWSSLIVP 439


>ref|XP_004242762.1| PREDICTED: uncharacterized protein LOC101267717 [Solanum
            lycopersicum]
          Length = 462

 Score =  526 bits (1356), Expect = e-146
 Identities = 300/452 (66%), Positives = 334/452 (73%), Gaps = 6/452 (1%)
 Frame = -1

Query: 1468 MASKLLSVLPTHYLPPPEY----RPFIHSGRNIPLFQDSSRLTSYEYKGKVLISPFNMPK 1301
            MA K L  L   Y+P P      R    +  +  + Q    L+    K K L  P N  +
Sbjct: 13   MALKQLLFLHNPYIPSPASYSCRRKNASAATSSTVLQHPMLLSMNIDKFKPLDFPKNSTR 72

Query: 1300 KSSRSVIA-SSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLVGL 1124
            K +RSV    SQLNFPIISPQD WGTWT LFATGAFGIWSEKT+IG+ALSG+LVS LVGL
Sbjct: 73   KLNRSVTTIRSQLNFPIISPQDQWGTWTVLFATGAFGIWSEKTKIGAALSGSLVSVLVGL 132

Query: 1123 AASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIG 944
            AASNLGIIASEAPAY +V  F         L+RADMRRV++STGTLL+AFLLGSVATTIG
Sbjct: 133  AASNLGIIASEAPAYKIVTGFLLPLAVPLLLFRADMRRVLKSTGTLLMAFLLGSVATTIG 192

Query: 943  TVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVIC 764
            TVVAF +VPM+SLGQDGWKIAAALMGRHIGGAVNYVAISEALQ SPSV+A+GLAADN+IC
Sbjct: 193  TVVAFFIVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALQTSPSVVASGLAADNLIC 252

Query: 763  AVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTASFLT 584
            AVYFTTLFALASKIPAE + S SD ++  ES SGNKLPVLQTATALA SFAICK    LT
Sbjct: 253  AVYFTTLFALASKIPAEAAQSVSDDKV--ESESGNKLPVLQTATALAVSFAICKAGELLT 310

Query: 583  RYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGAS-GSIRNV 407
            ++FGIQGG LP +TAIVVILAT+FP QFAYLAPSGEAMA+ILMQVFFT +GAS GSI NV
Sbjct: 311  KHFGIQGGLLPIITAIVVILATSFPSQFAYLAPSGEAMALILMQVFFTFIGASGGSISNV 370

Query: 406  ITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWS 227
            + TAPSIFLF L+QIAVHLA+ILG GKL R +LK LLIASNANV             GW 
Sbjct: 371  LNTAPSIFLFALIQIAVHLAVILGVGKLLRLELKELLIASNANVGGPTTACGMATAKGWI 430

Query: 226  SLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 131
            SLVVP                   GQ VLKF+
Sbjct: 431  SLVVPGILAGIFGIAIATFLGIAFGQTVLKFI 462


>ref|XP_006358300.1| PREDICTED: uncharacterized protein LOC102584987 [Solanum tuberosum]
          Length = 453

 Score =  521 bits (1343), Expect = e-145
 Identities = 296/455 (65%), Positives = 333/455 (73%), Gaps = 9/455 (1%)
 Frame = -1

Query: 1468 MASKLLSVLPTHYLPPP-------EYRPFIHSGRNIPLFQDSSRLTSYEYKGKVLISPFN 1310
            MA K L  L   Y+P P       +      S  +  + Q    L+    K K L  P N
Sbjct: 1    MALKQLLFLHNPYIPSPASCSSRRKNASAATSSTSNSILQHPMLLSKDIDKFKPLDFPKN 60

Query: 1309 MPKKSSRSVIA-SSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTL 1133
              +K +RSV    SQLNFPIISPQD WGTWT LFATGAFGIWSEKT++G+ALSG+LVS L
Sbjct: 61   STRKLNRSVTTIRSQLNFPIISPQDQWGTWTVLFATGAFGIWSEKTKVGAALSGSLVSVL 120

Query: 1132 VGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVAT 953
            VGLAASNLGIIASEAPAY +V  F         L+RADMRRV++STGTLLLAFLLGSVAT
Sbjct: 121  VGLAASNLGIIASEAPAYKIVTGFLLPLAVPLLLFRADMRRVLKSTGTLLLAFLLGSVAT 180

Query: 952  TIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADN 773
            TIGTVVAF +VPM+SLGQDGWKIAAALMGRHIGGAVNYVAISEALQ SPSV+A+GLAADN
Sbjct: 181  TIGTVVAFCIVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALQTSPSVVASGLAADN 240

Query: 772  VICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTAS 593
            +ICAVYFTTLFAL SKIPAE + S +D +++ E  SGNKLPVLQTATALA SFAICK   
Sbjct: 241  LICAVYFTTLFALTSKIPAEATQSATDDKVDSE--SGNKLPVLQTATALAVSFAICKAGE 298

Query: 592  FLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGAS-GSI 416
             LT++FGIQGG LP +TAIVVILAT+FP QFAYLAPSGEAMA+ILMQVFFT +GAS GSI
Sbjct: 299  LLTKHFGIQGGLLPTITAIVVILATSFPSQFAYLAPSGEAMALILMQVFFTFIGASGGSI 358

Query: 415  RNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXX 236
             NV+ TAPSIFLF  +QIAVHLA+ILG GKL + +LK LLIASNANV             
Sbjct: 359  SNVLNTAPSIFLFAFIQIAVHLAVILGVGKLLQLELKELLIASNANVGGPTTACGMATAK 418

Query: 235  GWSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 131
            GW S+VVP                   GQAVLKFM
Sbjct: 419  GWISMVVPGILAGIFGIAIATFLGIAFGQAVLKFM 453


>ref|XP_009628873.1| PREDICTED: uncharacterized protein LOC104119150 [Nicotiana
            tomentosiformis]
          Length = 452

 Score =  520 bits (1339), Expect = e-144
 Identities = 296/458 (64%), Positives = 339/458 (74%), Gaps = 12/458 (2%)
 Frame = -1

Query: 1468 MASKLLSVLPTHYLPPPEYRPFIHSGRNIPLFQDSSRLTSYE--YKGKVLIS------PF 1313
            MASKL  +   +  PP  Y P     +N+P    +S +TS     +  +L+S      P 
Sbjct: 1    MASKLWFLHNLYIPPPASYSP---RRQNVPA---ASAITSANTILQHPMLLSNIDKYTPL 54

Query: 1312 NMPKKS---SRSVIA-SSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGAL 1145
            + PK S   +RSV    SQLNFPIISPQD WGTWTALFATGAFGIWSEKT++G ALSGAL
Sbjct: 55   DFPKSSKKLNRSVTTIRSQLNFPIISPQDQWGTWTALFATGAFGIWSEKTKVGGALSGAL 114

Query: 1144 VSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLG 965
            VSTLVGLAASNLGIIA EAPAY +V  F         L+RADMRRV++STGTLLLAFLLG
Sbjct: 115  VSTLVGLAASNLGIIACEAPAYKIVTGFLLPLAVPLLLFRADMRRVLQSTGTLLLAFLLG 174

Query: 964  SVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGL 785
            SVATTIGTVVAF +VPM+SLGQDGWKIAAALMGRHIGGAVNYVAISEAL+ SPSV+ AGL
Sbjct: 175  SVATTIGTVVAFWIVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALETSPSVVTAGL 234

Query: 784  AADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAIC 605
            AADN+ICAVYFTTLFALASKIPAE + S ++ +++ ES SGN LPVLQ+ATALA SFAIC
Sbjct: 235  AADNLICAVYFTTLFALASKIPAEATPSAAEDKIDGESESGNTLPVLQSATALAVSFAIC 294

Query: 604  KTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGAS 425
            K   FLT++F IQGG+LP +TAIVVILAT+FP QFA LAPSGEAMA+ILMQVFF  +GA+
Sbjct: 295  KAGDFLTKHFVIQGGTLPIITAIVVILATSFPTQFADLAPSGEAMALILMQVFFAFIGAN 354

Query: 424  GSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXX 245
            GSI NV+ TAPSIF+F LVQI VHLA+ILG GKL RF+L+ LLIASNANV          
Sbjct: 355  GSILNVMNTAPSIFVFVLVQIGVHLAVILGVGKLLRFELEQLLIASNANVGGPTTACGMA 414

Query: 244  XXXGWSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 131
               GW SLVVP                   GQ +LKFM
Sbjct: 415  TAKGWISLVVPGILAGIFGITIATFLGIAFGQVILKFM 452


>ref|XP_007038793.1| Keratin-associated protein 5-4 [Theobroma cacao]
            gi|508776038|gb|EOY23294.1| Keratin-associated protein
            5-4 [Theobroma cacao]
          Length = 466

 Score =  518 bits (1333), Expect = e-144
 Identities = 272/362 (75%), Positives = 303/362 (83%)
 Frame = -1

Query: 1297 SSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLVGLAA 1118
            ++R V   SQLNFP+ISP D WGTWTALFA GAFGIWSEKT+IGSALSGALVSTL+GLAA
Sbjct: 78   ANRPVTVKSQLNFPLISPNDQWGTWTALFAIGAFGIWSEKTKIGSALSGALVSTLIGLAA 137

Query: 1117 SNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIGTV 938
            SNLGII+ EA AY+ V+ F         L+RAD+RRVI+STG LLLAFLLGSVATT+GT 
Sbjct: 138  SNLGIISCEAKAYSTVLEFLLPLAVPLLLFRADLRRVIKSTGKLLLAFLLGSVATTVGTA 197

Query: 937  VAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVICAV 758
            +A+L+VPM++LGQD WKIAAALMGRHIGGAVNYVAIS AL VSPSVLAAGLAADNVICAV
Sbjct: 198  LAYLIVPMRALGQDSWKIAAALMGRHIGGAVNYVAISNALGVSPSVLAAGLAADNVICAV 257

Query: 757  YFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTASFLTRY 578
            YFTTLFALASK+P E STS  DV + E S SG+KLPVLQ ATALA SF+ICK  ++LT+Y
Sbjct: 258  YFTTLFALASKVPPETSTSPEDVAMVEGSESGSKLPVLQIATALAVSFSICKLGAYLTKY 317

Query: 577  FGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRNVITT 398
            FGI GGSLPAVTAIVVILAT FP QF  LAP+GEAMA+ILMQVFFTVVGASG+I NVI T
Sbjct: 318  FGIPGGSLPAVTAIVVILATVFPTQFGRLAPAGEAMALILMQVFFTVVGASGNIWNVINT 377

Query: 397  APSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWSSLV 218
            APSIF+F LVQIA+HLA+ILG GKLFRFDLKLLLIASNANV             GWSS+V
Sbjct: 378  APSIFMFALVQIAIHLALILGLGKLFRFDLKLLLIASNANVGGPTTACGMATAKGWSSMV 437

Query: 217  VP 212
            VP
Sbjct: 438  VP 439


>emb|CDP05152.1| unnamed protein product [Coffea canephora]
          Length = 459

 Score =  516 bits (1330), Expect = e-143
 Identities = 273/355 (76%), Positives = 300/355 (84%), Gaps = 1/355 (0%)
 Frame = -1

Query: 1273 SQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLVGLAASNLGIIAS 1094
            SQL++PIISPQDHWGTWTALFATGAFGIWSE+T+IGS LSGALVS LVGLAASNLGII  
Sbjct: 78   SQLSYPIISPQDHWGTWTALFATGAFGIWSERTKIGSTLSGALVSILVGLAASNLGIIPC 137

Query: 1093 EAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIGTVVAFLMVPM 914
            +APAY +V+           L+RAD+RRVI+STGTLLLAFLLGSVATT+GT VAFL+VPM
Sbjct: 138  DAPAYKIVLQILLPMAVPLLLFRADLRRVIQSTGTLLLAFLLGSVATTLGTAVAFLLVPM 197

Query: 913  QSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVICAVYFTTLFAL 734
            +SLGQDGWKIAAALMGRHIGGAVNYVAISEAL V+PSVLAAGLAADNVICA+YFTTLFAL
Sbjct: 198  RSLGQDGWKIAAALMGRHIGGAVNYVAISEALGVTPSVLAAGLAADNVICAIYFTTLFAL 257

Query: 733  ASKIPAEPSTSTSDVELNEE-SGSGNKLPVLQTATALAASFAICKTASFLTRYFGIQGGS 557
            AS IP E ST+T+D +   + S SGNKLPVL TATALA SFAICK  S   +YFGI GGS
Sbjct: 258  ASGIPPEASTATTDADAGYDISESGNKLPVLPTATALAVSFAICKAGSSFAKYFGISGGS 317

Query: 556  LPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRNVITTAPSIFLF 377
            LPA+TAIVVILAT FP+ FA+LAPSGEAMA+ILMQVFFTVVGASGS+ NVI TAPSI LF
Sbjct: 318  LPAITAIVVILATVFPRLFAHLAPSGEAMALILMQVFFTVVGASGSMWNVINTAPSILLF 377

Query: 376  CLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWSSLVVP 212
             LVQIAVHLA+ILG GKLFRFDLKLLL+ASNANV             GWSSLVVP
Sbjct: 378  ALVQIAVHLAVILGLGKLFRFDLKLLLLASNANVGGPTTACGMATAKGWSSLVVP 432


>ref|XP_010255250.1| PREDICTED: uncharacterized protein LOC104595990 [Nelumbo nucifera]
          Length = 457

 Score =  513 bits (1321), Expect = e-142
 Identities = 267/378 (70%), Positives = 309/378 (81%), Gaps = 2/378 (0%)
 Frame = -1

Query: 1339 KGKVLISPFNMPKKSSR--SVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIG 1166
            + K LISP  +PK      S+   +QLNFP+ISP+DHWGTWTALFAT AFGIWSEKT+IG
Sbjct: 53   RSKTLISPLTIPKNHGPVPSLKTRAQLNFPLISPKDHWGTWTALFATSAFGIWSEKTKIG 112

Query: 1165 SALSGALVSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTL 986
            SALSG+LVS LVGLAASN+GII+ EAPAY+VVM +         L+RAD+RRVI STGTL
Sbjct: 113  SALSGSLVSILVGLAASNIGIISCEAPAYSVVMEYLLPMAVPLLLFRADLRRVIMSTGTL 172

Query: 985  LLAFLLGSVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSP 806
            LLAFLLGSVATTIGT+VA+L+VPM+SLGQD WKIAAALMGRHIGGAVNYVAISEAL V+P
Sbjct: 173  LLAFLLGSVATTIGTLVAYLLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISEALGVTP 232

Query: 805  SVLAAGLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATAL 626
            SVLAAGLAADNVICA+YFT+LFALAS IP E S ST D  ++ +S  GNKLPVLQTA A+
Sbjct: 233  SVLAAGLAADNVICAIYFTSLFALASNIPPEASKSTEDGVIDAKSEPGNKLPVLQTAIAI 292

Query: 625  AASFAICKTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVF 446
            A SF+ICKTA++LT+  GIQGGSLP +TA+VVILAT FP QF YLAP+GEA+A+ILMQVF
Sbjct: 293  AVSFSICKTATYLTKLLGIQGGSLPCITALVVILATIFPAQFGYLAPAGEAVALILMQVF 352

Query: 445  FTVVGASGSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXX 266
            F VVGA+GSI NVI TAPS+F+F L+QI +HLA+ILG GKL RFD KLLL+ASNANV   
Sbjct: 353  FAVVGANGSIWNVINTAPSVFMFALLQITIHLAVILGVGKLLRFDQKLLLLASNANVGGP 412

Query: 265  XXXXXXXXXXGWSSLVVP 212
                      GW SLV+P
Sbjct: 413  TTACGMATAKGWGSLVIP 430


>ref|XP_008234847.1| PREDICTED: uncharacterized protein LOC103333733 isoform X1 [Prunus
            mume]
          Length = 463

 Score =  513 bits (1320), Expect = e-142
 Identities = 272/399 (68%), Positives = 312/399 (78%), Gaps = 1/399 (0%)
 Frame = -1

Query: 1324 ISPFNMPKKSSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEK-TEIGSALSGA 1148
            +SP   P    RSV    QLN P+IS  D WGTWTALFATGAFGIWSEK T++G+ALSGA
Sbjct: 65   LSPPAPPDLGDRSVAVRFQLNAPLISSHDQWGTWTALFATGAFGIWSEKNTKVGAALSGA 124

Query: 1147 LVSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLL 968
            LVSTL+GLAASNLGII+S APA+++V+ F         LYRAD+RRVI+STG LLLAFLL
Sbjct: 125  LVSTLIGLAASNLGIISSNAPAFSIVLEFLLPLAVPLLLYRADLRRVIKSTGALLLAFLL 184

Query: 967  GSVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAG 788
            GSVATT+GTVVA+L+VPM+SLGQD WKIAAALMGRHIGGAVNYVAI++AL VSPS+LAAG
Sbjct: 185  GSVATTVGTVVAYLLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAIADALGVSPSILAAG 244

Query: 787  LAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAI 608
            LAADNVICAVYF+TLFALASK+P EPSTS   +E +  S  GNKLP++QTATAL+ S AI
Sbjct: 245  LAADNVICAVYFSTLFALASKVPPEPSTSDDGIEKDASSEPGNKLPLIQTATALSVSLAI 304

Query: 607  CKTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGA 428
            CK+  +LT+YFGIQGG LPAVTAIVV LAT FPKQFAYLAP+GEAMA+ILMQVFF+VVGA
Sbjct: 305  CKSGHYLTKYFGIQGGILPAVTAIVVTLATVFPKQFAYLAPTGEAMAVILMQVFFSVVGA 364

Query: 427  SGSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXX 248
            SG+I NVI TAPSIF F L+QIAVHLA+ILG GKL  FDLKLLLIASNANV         
Sbjct: 365  SGNIWNVINTAPSIFFFALIQIAVHLAVILGLGKLMGFDLKLLLIASNANVGGPTTACGM 424

Query: 247  XXXXGWSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 131
                 W+S++VP                   G AVLK+M
Sbjct: 425  ATAKEWNSMIVPGILAGIFGIAIATFIGIAFGLAVLKYM 463


>ref|XP_011043859.1| PREDICTED: uncharacterized protein LOC105139195 isoform X1 [Populus
            euphratica] gi|743901093|ref|XP_011043860.1| PREDICTED:
            uncharacterized protein LOC105139195 isoform X1 [Populus
            euphratica]
          Length = 452

 Score =  512 bits (1319), Expect = e-142
 Identities = 284/422 (67%), Positives = 326/422 (77%), Gaps = 7/422 (1%)
 Frame = -1

Query: 1456 LLSVLPTHYLPP-PEYRPFIHSGRNIPLFQDSSR---LTSYEYKGKV-LISPFNMPK--K 1298
            + S LP  + P  P  RP   S +N P    +     L S  Y  +   +SP   P   +
Sbjct: 1    MASRLPLLHSPVVPFRRPCFVSRQNSPTTTANPTRRTLLSANYGNQTSFLSPQKNPNLIR 60

Query: 1297 SSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLVGLAA 1118
            SS +V ++  LNFP+ISP D WG WTALFATGAFGIWSE+T+IGSALSGALVSTLVGLAA
Sbjct: 61   SSVTVRSNMILNFPLISPTDPWGMWTALFATGAFGIWSERTKIGSALSGALVSTLVGLAA 120

Query: 1117 SNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIGTV 938
            SNLGII+ E+PAY+ V+ F         L+RAD+RRVI+STGTLLLAFLLGSVATT+GTV
Sbjct: 121  SNLGIISCESPAYSTVLKFLLPLAVPLLLFRADLRRVIQSTGTLLLAFLLGSVATTVGTV 180

Query: 937  VAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVICAV 758
            +A++MVPM++LGQD WKIAAALMGRHIGGAVNYVAIS+AL VSPSVLAAGLAADNVICAV
Sbjct: 181  LAYMMVPMRALGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAV 240

Query: 757  YFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTASFLTRY 578
            YFT+LFALASKIPAE S S     ++  S SGNKLPVLQTATALA SFAICK   ++T++
Sbjct: 241  YFTSLFALASKIPAESSASIDGSGMDSGSESGNKLPVLQTATALAVSFAICKAGEYITKF 300

Query: 577  FGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRNVITT 398
            F I GG LPAVTAIVVILAT FP QF +LAPSGEA+A+ILMQVFF VVGASG++RNVI T
Sbjct: 301  FAIPGGILPAVTAIVVILATAFPTQFNHLAPSGEALALILMQVFFAVVGASGNVRNVINT 360

Query: 397  APSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWSSLV 218
            APSIF+F LVQIA+HLA+ILG GKLFRFD KLLLIASNANV             GWSSLV
Sbjct: 361  APSIFMFALVQIAIHLAVILGLGKLFRFDQKLLLIASNANVGGPTTACGMATAKGWSSLV 420

Query: 217  VP 212
            VP
Sbjct: 421  VP 422


>ref|XP_010255249.1| PREDICTED: uncharacterized protein LOC104595989 [Nelumbo nucifera]
          Length = 458

 Score =  512 bits (1318), Expect = e-142
 Identities = 269/377 (71%), Positives = 309/377 (81%), Gaps = 3/377 (0%)
 Frame = -1

Query: 1333 KVLISPFNMPKKS---SRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGS 1163
            K  +SP   PK +   +RSV   +QL+FP+ISP+DHWGTWTALF + AFGIWSEKT++GS
Sbjct: 55   KTFLSPSTFPKGNPDLNRSVKTKAQLSFPLISPKDHWGTWTALFVSSAFGIWSEKTKVGS 114

Query: 1162 ALSGALVSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLL 983
            ALSGALVSTLVGL ASNLGII+ EAPAY++VM +         L+RAD+RRVI STGTLL
Sbjct: 115  ALSGALVSTLVGLGASNLGIISCEAPAYSLVMEYLLPMAVPLLLFRADLRRVILSTGTLL 174

Query: 982  LAFLLGSVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPS 803
             AFLLGSVATTIGT+VA+LMVPM+SLG D WKIAAALMGRHIGGAVNYVAISEAL VSPS
Sbjct: 175  SAFLLGSVATTIGTIVAYLMVPMRSLGHDNWKIAAALMGRHIGGAVNYVAISEALAVSPS 234

Query: 802  VLAAGLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALA 623
            VLAAGLAADNVICA+YFT+LFALAS+IP E +T T+D  ++ ES  GNKLPVLQTATALA
Sbjct: 235  VLAAGLAADNVICAIYFTSLFALASQIPPESTTPTNDDVIDTESQIGNKLPVLQTATALA 294

Query: 622  ASFAICKTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFF 443
             SFAICKT ++L++  GIQGG+LP +TAIVVILAT FP QF YLAP+GEA+A+ILMQVFF
Sbjct: 295  VSFAICKTGTYLSKLLGIQGGNLPCITAIVVILATIFPAQFGYLAPAGEAVALILMQVFF 354

Query: 442  TVVGASGSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXX 263
             VVGA+GSI NVI TAPSIF+F LVQIAVHLA+ILG GKL +FD KLLL+ASNANV    
Sbjct: 355  AVVGANGSIWNVINTAPSIFMFSLVQIAVHLAVILGVGKLMQFDQKLLLLASNANVGGPA 414

Query: 262  XXXXXXXXXGWSSLVVP 212
                     GW SLVVP
Sbjct: 415  TACGMASTKGWGSLVVP 431


>ref|XP_002305244.2| hypothetical protein POPTR_0004s07750g [Populus trichocarpa]
            gi|550340557|gb|EEE85755.2| hypothetical protein
            POPTR_0004s07750g [Populus trichocarpa]
          Length = 452

 Score =  511 bits (1315), Expect = e-142
 Identities = 271/373 (72%), Positives = 309/373 (82%), Gaps = 2/373 (0%)
 Frame = -1

Query: 1324 ISPFNMPK--KSSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSG 1151
            +SP   P   +SS +V ++  LNFP+ISP D WG WTALFATGAFGIWSE+T+IGSALSG
Sbjct: 50   LSPQKNPNLIRSSVTVRSNLILNFPLISPTDPWGMWTALFATGAFGIWSERTKIGSALSG 109

Query: 1150 ALVSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFL 971
            ALVSTLVGLAASNLGII+ E+PAY++V+ F         L+RAD+RRVI+STGTLLLAFL
Sbjct: 110  ALVSTLVGLAASNLGIISCESPAYSIVLKFLLPLAVPLLLFRADLRRVIQSTGTLLLAFL 169

Query: 970  LGSVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAA 791
            LGSVATT+GTV+A++MVPM++LGQD WKIAAALMGRHIGGAVNYVAIS+AL+VSPSVLAA
Sbjct: 170  LGSVATTVGTVLAYMMVPMRALGQDSWKIAAALMGRHIGGAVNYVAISDALRVSPSVLAA 229

Query: 790  GLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFA 611
            GLAADNVICAVYFT+LFALASKIPAE S S     ++  S SGNKLPVLQTATALA SFA
Sbjct: 230  GLAADNVICAVYFTSLFALASKIPAESSASIDGSGMDSGSESGNKLPVLQTATALAVSFA 289

Query: 610  ICKTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVG 431
            ICK   ++T++F I GG LPAVTAIVVILAT FP QF +LAPSGEA+A+ILMQVFF VVG
Sbjct: 290  ICKAGEYITKFFAIPGGILPAVTAIVVILATAFPTQFNHLAPSGEALALILMQVFFAVVG 349

Query: 430  ASGSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXX 251
            ASG++ NVI TAPSIFLF LVQIA+HLA+ILG GKLFRFD KLLLIASNANV        
Sbjct: 350  ASGNVWNVINTAPSIFLFALVQIAIHLAVILGLGKLFRFDQKLLLIASNANVGGPTTACG 409

Query: 250  XXXXXGWSSLVVP 212
                 GWSSLVVP
Sbjct: 410  MATAKGWSSLVVP 422


>ref|XP_012090197.1| PREDICTED: uncharacterized protein LOC105648428 [Jatropha curcas]
            gi|643706105|gb|KDP22237.1| hypothetical protein
            JCGZ_26068 [Jatropha curcas]
          Length = 459

 Score =  509 bits (1310), Expect = e-141
 Identities = 275/392 (70%), Positives = 311/392 (79%), Gaps = 2/392 (0%)
 Frame = -1

Query: 1381 PLFQDSSRLTSYEYKGKVLISP--FNMPKKSSRSVIASSQLNFPIISPQDHWGTWTALFA 1208
            P  Q SS   S   +    +SP  +     S RSV   S LNFP+ISP D WGTWTALFA
Sbjct: 43   PALQSSS--ISLGNRSHTFLSPELYTEDSSSLRSVAVRSNLNFPLISPGDRWGTWTALFA 100

Query: 1207 TGAFGIWSEKTEIGSALSGALVSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLY 1028
            TGAFGIWSEKT+IGSALSGALVSTLVGLAASNLGII+ E+PAY +V+ F         L+
Sbjct: 101  TGAFGIWSEKTKIGSALSGALVSTLVGLAASNLGIISCESPAYPIVLEFLLPLAVPLLLF 160

Query: 1027 RADMRRVIRSTGTLLLAFLLGSVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGA 848
            RAD+RRVI+STGTLLLAFL+GSVATT+GT+VA+ +VPM+SLGQD WKIAAALMGRHIGGA
Sbjct: 161  RADLRRVIQSTGTLLLAFLIGSVATTVGTLVAYWIVPMRSLGQDSWKIAAALMGRHIGGA 220

Query: 847  VNYVAISEALQVSPSVLAAGLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESG 668
            VNYVAIS+AL VS SVLA+GLAADNVICAVYFTTLFALASKIP E S ST+D  +  E+ 
Sbjct: 221  VNYVAISDALGVSSSVLASGLAADNVICAVYFTTLFALASKIPPESSVSTNDGAIESETE 280

Query: 667  SGNKLPVLQTATALAASFAICKTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLA 488
              +KLPVL+ ATA+A SFAICK  SF+T+ FGIQGG LPAVTAIVVILAT FP QF  LA
Sbjct: 281  PSDKLPVLKIATAIAVSFAICKAGSFVTKLFGIQGGILPAVTAIVVILATAFPTQFNQLA 340

Query: 487  PSGEAMAMILMQVFFTVVGASGSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDL 308
            PSGEA+A+ILMQVFFTVVGASG+I +VI TAPSIF+F LVQI VHLA+ILG GKLFRFDL
Sbjct: 341  PSGEAIALILMQVFFTVVGASGNIWSVINTAPSIFMFALVQITVHLAVILGLGKLFRFDL 400

Query: 307  KLLLIASNANVXXXXXXXXXXXXXGWSSLVVP 212
            KLLL+ASNANV             GW+SLVVP
Sbjct: 401  KLLLLASNANVGGPTTACGMATAKGWNSLVVP 432


>ref|XP_002513660.1| conserved hypothetical protein [Ricinus communis]
            gi|223547568|gb|EEF49063.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 965

 Score =  509 bits (1310), Expect = e-141
 Identities = 281/409 (68%), Positives = 316/409 (77%), Gaps = 3/409 (0%)
 Frame = -1

Query: 1429 LPPPEYRPFIHSGRNIPL-FQDSSRLTSYEYKGKVLISPFNMP--KKSSRSVIASSQLNF 1259
            + P  Y+ F    +  PL F  +    S   + +  +SP   P    S RS+   S LNF
Sbjct: 31   MSPQSYQSF----KIYPLHFHSNDNDNSNNNRNQTFLSPQLYPGDPSSRRSLAVRSNLNF 86

Query: 1258 PIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLVGLAASNLGIIASEAPAY 1079
            P+IS  D WGTWTALFATGAFGIWSEKT+IGSALSGALVSTLVGLA SNLGII+ E+PAY
Sbjct: 87   PLISSNDRWGTWTALFATGAFGIWSEKTKIGSALSGALVSTLVGLAGSNLGIISCESPAY 146

Query: 1078 NVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIGTVVAFLMVPMQSLGQ 899
             VV+ F         L+RAD+RRVIRSTGTLLLAFLLGSVATT+GTVVA+ +VPM+SLGQ
Sbjct: 147  AVVLEFLLPLAVPLLLFRADLRRVIRSTGTLLLAFLLGSVATTVGTVVAYWIVPMRSLGQ 206

Query: 898  DGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVICAVYFTTLFALASKIP 719
            D WKIAAALMGRHIGGAVNYVAI++AL VS SVLA+GLAADNVICAVYFTTLFALASKIP
Sbjct: 207  DSWKIAAALMGRHIGGAVNYVAIADALGVSSSVLASGLAADNVICAVYFTTLFALASKIP 266

Query: 718  AEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTASFLTRYFGIQGGSLPAVTA 539
            AE STS+++  +   S SG KLPVLQ AT+LA S AICK  S++T+ FGIQGG LPAVTA
Sbjct: 267  AETSTSSNEDGMESGSVSGEKLPVLQLATSLAVSLAICKAGSYVTKLFGIQGGILPAVTA 326

Query: 538  IVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRNVITTAPSIFLFCLVQIA 359
            IVVILAT FP QF  LAPSGEAMA+ILMQVFFTVVGASG+I NV+ TAPSIF+F LVQIA
Sbjct: 327  IVVILATAFPTQFNGLAPSGEAMALILMQVFFTVVGASGNIWNVVKTAPSIFMFALVQIA 386

Query: 358  VHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWSSLVVP 212
            VHL IILG GKLFRFD KLLL+ASNANV             GWSSLVVP
Sbjct: 387  VHLVIILGLGKLFRFDQKLLLLASNANVGGPTTACGMATAKGWSSLVVP 435



 Score =  470 bits (1209), Expect = e-129
 Identities = 254/410 (61%), Positives = 302/410 (73%), Gaps = 11/410 (2%)
 Frame = -1

Query: 1408 PFIHSGRNIPLFQDSSRLTSYEYKGKV--------LISPFNMPKKSS---RSVIASSQLN 1262
            P +HS  +  L   S  L+ +  + K+          SP  +   ++   R +   SQL 
Sbjct: 529  PLLHSSCSPSLRISSRHLSPFSSRHKLSHPNINEAAFSPSTISLNNTSLIRQIKLRSQLR 588

Query: 1261 FPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLVGLAASNLGIIASEAPA 1082
            FP+ISP DHWGTWTALFATGAFGIWSE T++GS +S ALVSTLVGLAASN+GII  E  A
Sbjct: 589  FPLISPDDHWGTWTALFATGAFGIWSEGTKVGSMVSAALVSTLVGLAASNIGIIPYETAA 648

Query: 1081 YNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIGTVVAFLMVPMQSLG 902
            Y++V+ F         L+RAD+R VIRSTG L LAFLLGSVAT IGT VAFLMVPM+SLG
Sbjct: 649  YSLVLEFLLPLTVPLLLFRADLRNVIRSTGKLFLAFLLGSVATIIGTTVAFLMVPMRSLG 708

Query: 901  QDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVICAVYFTTLFALASKI 722
             D WKIAAALMG +IGG+VNYVAISEAL  SPSV+AAG+AADNVICA YF  LFALASKI
Sbjct: 709  PDNWKIAAALMGSYIGGSVNYVAISEALGTSPSVVAAGIAADNVICATYFMALFALASKI 768

Query: 721  PAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTASFLTRYFGIQGGSLPAVT 542
            PAE S ST+ VE++ ES S  K+PVLQ A ALA SF IC+TA++LT+   +QGG+LPA+T
Sbjct: 769  PAENSASTNGVEMDVESSSTGKIPVLQMAAALAISFMICRTATYLTQLCKVQGGNLPAIT 828

Query: 541  AIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRNVITTAPSIFLFCLVQI 362
            AIVV LAT+FP QF  LAP+G+ +A++LMQVFF VVGASGSI NVI TAPSIFLF LVQ+
Sbjct: 829  AIVVFLATSFPVQFGRLAPAGDTIALVLMQVFFAVVGASGSIWNVIKTAPSIFLFALVQL 888

Query: 361  AVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWSSLVVP 212
             VHLA++LG G+LF FDLKLLL+ASNAN+             GW SLVVP
Sbjct: 889  TVHLAVVLGLGRLFDFDLKLLLLASNANIGGPTTACGMATAKGWKSLVVP 938


>gb|KHG26673.1| putative membrane yjcL [Gossypium arboreum]
          Length = 464

 Score =  506 bits (1304), Expect = e-140
 Identities = 263/362 (72%), Positives = 302/362 (83%)
 Frame = -1

Query: 1297 SSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLVGLAA 1118
            ++R++I  SQLN P+ISP D WGTWTALFATGAFG+WSE T+ GSALSGALVSTL+GLAA
Sbjct: 76   ATRTLIVKSQLNSPLISPNDQWGTWTALFATGAFGLWSENTKAGSALSGALVSTLIGLAA 135

Query: 1117 SNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIGTV 938
            SNLGII+SEA AY++V  F         L+RAD+RRVI+STG LLLAFLLGSVATT+GT 
Sbjct: 136  SNLGIISSEAKAYSIVKEFLLPLAVPLLLFRADLRRVIKSTGKLLLAFLLGSVATTVGTA 195

Query: 937  VAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVICAV 758
            +A+L+VPM++LGQD WKIAAALMGRHIGGAVNYVAIS AL+ S SVLAAGLAADNVICAV
Sbjct: 196  LAYLIVPMRALGQDSWKIAAALMGRHIGGAVNYVAISNALETSESVLAAGLAADNVICAV 255

Query: 757  YFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTASFLTRY 578
            YFTTLFALASK+PAE STS  DV + E S S  KLPVL+ ATALA SFAICK  ++LT+Y
Sbjct: 256  YFTTLFALASKVPAETSTSPEDVAMGEGSISDGKLPVLKIATALAVSFAICKLGAYLTKY 315

Query: 577  FGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRNVITT 398
            FGI GG LPAVTAIVVILAT FP QF +LAPSGEAMA+ILMQVFFTVVGASG+I +VI T
Sbjct: 316  FGIPGGILPAVTAIVVILATVFPAQFGHLAPSGEAMALILMQVFFTVVGASGNIWSVIRT 375

Query: 397  APSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWSSLV 218
            APSIF+F LVQI++HLA+ILG GKLF+FDLKLLLIASNANV             GWSS++
Sbjct: 376  APSIFMFALVQISIHLALILGLGKLFKFDLKLLLIASNANVGGPTTASGMATAKGWSSMI 435

Query: 217  VP 212
            +P
Sbjct: 436  IP 437


>ref|XP_012484138.1| PREDICTED: uncharacterized protein LOC105798567 [Gossypium raimondii]
            gi|763766946|gb|KJB34161.1| hypothetical protein
            B456_006G051100 [Gossypium raimondii]
          Length = 464

 Score =  506 bits (1302), Expect = e-140
 Identities = 262/362 (72%), Positives = 301/362 (83%)
 Frame = -1

Query: 1297 SSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLVGLAA 1118
            ++R++I  SQLN P+ISP D WGTWTALFATGAFG+WSE T+ GSALSGALVSTL+GLAA
Sbjct: 76   ATRTLIVKSQLNCPLISPNDQWGTWTALFATGAFGLWSENTKAGSALSGALVSTLIGLAA 135

Query: 1117 SNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIGTV 938
            SNLGII+SEA  Y++V  F         L+RAD+RRVI+STG LLLAFLLGSVATT+GT 
Sbjct: 136  SNLGIISSEAKVYSIVKEFLLPLAVPLLLFRADLRRVIKSTGKLLLAFLLGSVATTVGTA 195

Query: 937  VAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVICAV 758
            +A+L+VPM++LGQD WKIAAALMGRHIGGAVNYVAIS AL+ S SVLAAGLAADNVICAV
Sbjct: 196  LAYLIVPMRALGQDSWKIAAALMGRHIGGAVNYVAISNALETSESVLAAGLAADNVICAV 255

Query: 757  YFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTASFLTRY 578
            YFTTLFALASK+PAE STS  DV + E S S  KLPVL+ ATALA SFAICK  ++LT+Y
Sbjct: 256  YFTTLFALASKVPAETSTSPEDVAMGEGSKSDGKLPVLKIATALAVSFAICKLGAYLTKY 315

Query: 577  FGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRNVITT 398
            FGI GG LPAVTAIVVILAT FP QF +LAPSGEAMA+ILMQVFFTVVGASG+I +VI T
Sbjct: 316  FGIPGGILPAVTAIVVILATVFPTQFGHLAPSGEAMALILMQVFFTVVGASGNIWSVIRT 375

Query: 397  APSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWSSLV 218
            APSIF+F LVQI++HLA+ILG GKLF+FDLKLLLIASNANV             GWSS++
Sbjct: 376  APSIFMFALVQISIHLALILGLGKLFKFDLKLLLIASNANVGGPTTASGMATAKGWSSMI 435

Query: 217  VP 212
            +P
Sbjct: 436  IP 437


>ref|XP_007217974.1| hypothetical protein PRUPE_ppa005389mg [Prunus persica]
            gi|462414436|gb|EMJ19173.1| hypothetical protein
            PRUPE_ppa005389mg [Prunus persica]
          Length = 463

 Score =  505 bits (1301), Expect = e-140
 Identities = 268/399 (67%), Positives = 308/399 (77%), Gaps = 1/399 (0%)
 Frame = -1

Query: 1324 ISPFNMPKKSSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEK-TEIGSALSGA 1148
            +SP   P    RSV    QLN P+IS  D WGTWTALFATGAFGIWSEK T++G+ALSGA
Sbjct: 65   LSPPAPPNLGDRSVAVRFQLNAPLISSHDQWGTWTALFATGAFGIWSEKNTKVGAALSGA 124

Query: 1147 LVSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLL 968
            LVSTL+GLAASNLGII+S APA+++V+ F         LYRAD+RRVI+STG LLLAFLL
Sbjct: 125  LVSTLIGLAASNLGIISSNAPAFSIVLEFLLPLAVPLLLYRADLRRVIKSTGALLLAFLL 184

Query: 967  GSVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAG 788
            GSVATT+GTVVA+L+VPM+SLGQD WKIAAALMGRHIGGAVNYVAI++AL VSPS+LAAG
Sbjct: 185  GSVATTVGTVVAYLLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAIADALGVSPSILAAG 244

Query: 787  LAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAI 608
            LAADNVICAVYF+TLFALASK+P EPSTS   +  +  S  GNKLP++QTA AL+ S AI
Sbjct: 245  LAADNVICAVYFSTLFALASKVPPEPSTSDDGIRKDASSEPGNKLPLIQTAAALSVSLAI 304

Query: 607  CKTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGA 428
            CK+  +LT+YFGIQGG LPAVTAIVV LAT FPKQFAYLAP+GEAMA+ILMQVFF VVGA
Sbjct: 305  CKSGHYLTKYFGIQGGILPAVTAIVVTLATVFPKQFAYLAPTGEAMAVILMQVFFAVVGA 364

Query: 427  SGSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXX 248
            SG+I +VI TAPSIF F L+QIAVHL +ILG GKL  FDLKLLLIASNANV         
Sbjct: 365  SGNIWSVINTAPSIFFFALIQIAVHLVVILGLGKLLGFDLKLLLIASNANVGGPTTACGM 424

Query: 247  XXXXGWSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 131
                 W+S++VP                   G AVLK+M
Sbjct: 425  ATAKEWNSMIVPGILAGIFGIAIATFIGIAFGLAVLKYM 463


Top