BLASTX nr result

ID: Forsythia23_contig00034041 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia23_contig00034041
         (1589 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011076116.1| PREDICTED: uncharacterized protein LOC105160...   582   e-163
ref|XP_012852159.1| PREDICTED: uncharacterized protein LOC105971...   562   e-157
ref|XP_002273113.2| PREDICTED: uncharacterized protein LOC100264...   547   e-152
ref|XP_010662689.1| PREDICTED: uncharacterized protein LOC100264...   540   e-150
ref|XP_006422011.1| hypothetical protein CICLE_v10004922mg [Citr...   528   e-147
ref|XP_004242762.1| PREDICTED: uncharacterized protein LOC101267...   526   e-146
ref|XP_006358300.1| PREDICTED: uncharacterized protein LOC102584...   521   e-145
ref|XP_009628873.1| PREDICTED: uncharacterized protein LOC104119...   520   e-144
ref|XP_007038793.1| Keratin-associated protein 5-4 [Theobroma ca...   518   e-144
emb|CDP05152.1| unnamed protein product [Coffea canephora]            516   e-143
ref|XP_010255250.1| PREDICTED: uncharacterized protein LOC104595...   513   e-142
ref|XP_008234847.1| PREDICTED: uncharacterized protein LOC103333...   513   e-142
ref|XP_011043859.1| PREDICTED: uncharacterized protein LOC105139...   512   e-142
ref|XP_010255249.1| PREDICTED: uncharacterized protein LOC104595...   512   e-142
ref|XP_002305244.2| hypothetical protein POPTR_0004s07750g [Popu...   511   e-142
ref|XP_012090197.1| PREDICTED: uncharacterized protein LOC105648...   509   e-141
ref|XP_002513660.1| conserved hypothetical protein [Ricinus comm...   509   e-141
gb|KHG26673.1| putative membrane yjcL [Gossypium arboreum]            508   e-141
ref|XP_012484138.1| PREDICTED: uncharacterized protein LOC105798...   507   e-140
ref|XP_007217974.1| hypothetical protein PRUPE_ppa005389mg [Prun...   505   e-140

>ref|XP_011076116.1| PREDICTED: uncharacterized protein LOC105160444 [Sesamum indicum]
          Length = 455

 Score =  582 bits (1501), Expect = e-163
 Identities = 326/455 (71%), Positives = 350/455 (76%), Gaps = 9/455 (1%)
 Frame = -1

Query: 1472 MASKLLSVLPTHYLPPPEYRPFIHSGRN--------IPLFQDSSRLTSYKYKGKVL-ISP 1320
            MA KLL   P +  PPP  R    S +          PL QD S LTS   K + L +SP
Sbjct: 1    MALKLLFSQPINCHPPPLQRSRFASHQKPSQIPTARSPLIQDFSLLTSSSNKDRSLNLSP 60

Query: 1319 FNMPKKSSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVST 1140
               PK  +RSV+A SQLNFPIISPQD WGTWTALFATGAFGIWSEKT+IGSALSGALVS 
Sbjct: 61   NTNPKNVARSVVAKSQLNFPIISPQDQWGTWTALFATGAFGIWSEKTKIGSALSGALVSI 120

Query: 1139 LVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVA 960
            LVGLAASNLGIIASEAPAY VV+ F         LYRADMRR+IRSTGTLLLAFLLGSVA
Sbjct: 121  LVGLAASNLGIIASEAPAYKVVLEFLLPLAVPLLLYRADMRRIIRSTGTLLLAFLLGSVA 180

Query: 959  TTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAAD 780
            TT GT VAFL+VPM+SLGQDGWKIAAALMGRHIGGAVNYVAISEAL+V+PSVLAAGLAAD
Sbjct: 181  TTAGTAVAFLLVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALEVTPSVLAAGLAAD 240

Query: 779  NVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTA 600
            NVICA+YFTTLFALASKIPAE +TST+D  LNEES S NKLPVLQTATALA SF ICK+A
Sbjct: 241  NVICAIYFTTLFALASKIPAESATSTTDGGLNEESESSNKLPVLQTATALAVSFIICKSA 300

Query: 599  SFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSI 420
            SFLT Y GIQG +LP +TAIVVILAT  P QFAYLAPSGEAMA+ILMQVFF V+GASGSI
Sbjct: 301  SFLTNYLGIQGATLPTITAIVVILATMLPNQFAYLAPSGEAMALILMQVFFAVIGASGSI 360

Query: 419  RNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXX 240
            R+VI+TAPSIFLF LVQI VHLAIILG GKL RFDLKLLL+ASNANV             
Sbjct: 361  RSVISTAPSIFLFALVQIGVHLAIILGLGKLLRFDLKLLLLASNANVGGPTTACGMATAK 420

Query: 239  GWSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 135
            GWSSLVVP                   GQAVLKFM
Sbjct: 421  GWSSLVVPGILAGIFGIAIATFLGIAFGQAVLKFM 455


>ref|XP_012852159.1| PREDICTED: uncharacterized protein LOC105971807 [Erythranthe
            guttatus] gi|604306080|gb|EYU25137.1| hypothetical
            protein MIMGU_mgv1a006291mg [Erythranthe guttata]
          Length = 449

 Score =  562 bits (1449), Expect = e-157
 Identities = 313/457 (68%), Positives = 352/457 (77%), Gaps = 11/457 (2%)
 Frame = -1

Query: 1472 MASKLLSVLPTHYLPPPEYRPFIHSGRNI---------PLFQDS--SRLTSYKYKGKVLI 1326
            MA K+L   PT Y+PPP  R  I + RN          P FQ+S  S  +S K++    I
Sbjct: 1    MAGKILLFHPT-YIPPPPARRSIVASRNAASQIPDTHTPSFQNSPLSTFSSDKFRTLKTI 59

Query: 1325 SPFNMPKKSSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALV 1146
            S     +  +RSV+A SQLNFPIISP D WGTWTALFA GAFGIWSEKT+IGSALSGALV
Sbjct: 60   S-----RNPARSVVARSQLNFPIISPHDQWGTWTALFAAGAFGIWSEKTKIGSALSGALV 114

Query: 1145 STLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGS 966
            STLVGLAASNLGIIASE  AYNVV+ F         LYRADMRRVI+STGTLLLAFLLGS
Sbjct: 115  STLVGLAASNLGIIASETAAYNVVLEFLLPLAVPLLLYRADMRRVIKSTGTLLLAFLLGS 174

Query: 965  VATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLA 786
            VATT+GT+VA+ +VPM+SLGQDGWKIAAALMGRHIGGAVNYVAISEAL VSPSVLAAGLA
Sbjct: 175  VATTVGTLVAYFLVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALGVSPSVLAAGLA 234

Query: 785  ADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICK 606
            ADNVICA+YFTTLFALASKIP+E S+ T  +  NEES S NKLPVLQTATA+A SF ICK
Sbjct: 235  ADNVICAIYFTTLFALASKIPSESSSPTPGI--NEESESDNKLPVLQTATAVAVSFIICK 292

Query: 605  TASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASG 426
             A+ LT++FGIQGG+LPA+TAIVV+LAT+FP QFAYLAPSGEAMA+ILMQVFF V+GASG
Sbjct: 293  IATVLTKHFGIQGGTLPAITAIVVVLATSFPNQFAYLAPSGEAMALILMQVFFAVIGASG 352

Query: 425  SIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXX 246
            SIRNVITTAPSIFLF L+QI VHLA+ILG GKLFRFDL+LLL+ASNANV           
Sbjct: 353  SIRNVITTAPSIFLFALIQIGVHLAVILGLGKLFRFDLRLLLLASNANVGGPTTACGMAT 412

Query: 245  XXGWSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 135
              GW+SL+VP                   GQAVL+FM
Sbjct: 413  AKGWTSLIVPGILAGIFGIAIATFLGIAFGQAVLRFM 449


>ref|XP_002273113.2| PREDICTED: uncharacterized protein LOC100264478 isoform X1 [Vitis
            vinifera] gi|302143806|emb|CBI22667.3| unnamed protein
            product [Vitis vinifera]
          Length = 449

 Score =  547 bits (1409), Expect = e-152
 Identities = 309/453 (68%), Positives = 339/453 (74%), Gaps = 7/453 (1%)
 Frame = -1

Query: 1472 MASKLLSVLPTHYLPPPEYRPFIHSGRNIPLFQDSSRLTS-----YKYKGKVLISPFNMP 1308
            MASK L++     +P    +P   S +N P    SS  T      ++ K +  +SP   P
Sbjct: 1    MASKFLTLRAPLSIP----QPVTCSRQNFPTVSTSSCSTVEDPSLWRSKKQTQLSPLIFP 56

Query: 1307 KKSS--RSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLV 1134
            K S   RSV   S L FPIISPQD WGTWTALFATGAFGIWSEKT+IGSALSGALVSTLV
Sbjct: 57   KSSDSIRSVTVRSSLTFPIISPQDQWGTWTALFATGAFGIWSEKTKIGSALSGALVSTLV 116

Query: 1133 GLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATT 954
            GLAASNLGII+ EAPAY+VV+ F         L+RAD+RRVI+STG LL+AFL+GSVATT
Sbjct: 117  GLAASNLGIISCEAPAYSVVLNFLLPLAVPLLLFRADLRRVIQSTGALLMAFLIGSVATT 176

Query: 953  IGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNV 774
            IGTVVAFLMVPM+SLGQD WKIAAALMGRHIGGAVNYVAISEAL VS SVLAAGLAADNV
Sbjct: 177  IGTVVAFLMVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISEALGVSRSVLAAGLAADNV 236

Query: 773  ICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTASF 594
            ICAVYFTTLFALASKIP E STS +D  +NE+   GNK PVL TATALA SFAICK   F
Sbjct: 237  ICAVYFTTLFALASKIPPEDSTSANDTGMNEQPEPGNKPPVLLTATALAVSFAICKAGIF 296

Query: 593  LTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRN 414
            LT+YFGIQGGSLPA+TAIVVILAT FPKQF+ LAP+GE MAMILMQVFFTVVGASG+I N
Sbjct: 297  LTKYFGIQGGSLPAITAIVVILATAFPKQFSLLAPAGETMAMILMQVFFTVVGASGNIGN 356

Query: 413  VITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGW 234
            V+ TAPSIF+F LVQIAVHLA+ILG GKLFRFDLKLLLIASNANV             GW
Sbjct: 357  VMNTAPSIFMFALVQIAVHLAVILGLGKLFRFDLKLLLIASNANVGGPTTACGMATAKGW 416

Query: 233  SSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 135
            SSLVVP                   G  VLKFM
Sbjct: 417  SSLVVPGILAGIFGIAIATFLGIVFGVTVLKFM 449


>ref|XP_010662689.1| PREDICTED: uncharacterized protein LOC100264478 isoform X2 [Vitis
            vinifera]
          Length = 447

 Score =  540 bits (1391), Expect = e-150
 Identities = 308/453 (67%), Positives = 338/453 (74%), Gaps = 7/453 (1%)
 Frame = -1

Query: 1472 MASKLLSVLPTHYLPPPEYRPFIHSGRNIPLFQDSSRLTS-----YKYKGKVLISPFNMP 1308
            MASK L++     +P    +P   S +N P    SS  T      ++ K +  +SP   P
Sbjct: 1    MASKFLTLRAPLSIP----QPVTCSRQNFPTVSTSSCSTVEDPSLWRSKKQTQLSPLIFP 56

Query: 1307 KKSS--RSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLV 1134
            K S   RSV   S L FPIISPQD WGTWTALFATGAFGIWSEKT+IGSALSGALVSTLV
Sbjct: 57   KSSDSIRSVTVRSSLTFPIISPQDQWGTWTALFATGAFGIWSEKTKIGSALSGALVSTLV 116

Query: 1133 GLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATT 954
            GLAASNLGII+ EAPAY+VV+ F         L+RAD+RRVI+STG LL+AFL+GSVATT
Sbjct: 117  GLAASNLGIISCEAPAYSVVLNFLLPLAVPLLLFRADLRRVIQSTGALLMAFLIGSVATT 176

Query: 953  IGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNV 774
            IGTVVAFLMVPM+SLGQD WKIAAALMGRHIGGAVNYVAISEAL VS SVLAAGLAADNV
Sbjct: 177  IGTVVAFLMVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISEALGVSRSVLAAGLAADNV 236

Query: 773  ICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTASF 594
            ICAVYFTTLFALASKIP E STS +   +NE+   GNK PVL TATALA SFAICK   F
Sbjct: 237  ICAVYFTTLFALASKIPPEDSTSANG--MNEQPEPGNKPPVLLTATALAVSFAICKAGIF 294

Query: 593  LTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRN 414
            LT+YFGIQGGSLPA+TAIVVILAT FPKQF+ LAP+GE MAMILMQVFFTVVGASG+I N
Sbjct: 295  LTKYFGIQGGSLPAITAIVVILATAFPKQFSLLAPAGETMAMILMQVFFTVVGASGNIGN 354

Query: 413  VITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGW 234
            V+ TAPSIF+F LVQIAVHLA+ILG GKLFRFDLKLLLIASNANV             GW
Sbjct: 355  VMNTAPSIFMFALVQIAVHLAVILGLGKLFRFDLKLLLIASNANVGGPTTACGMATAKGW 414

Query: 233  SSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 135
            SSLVVP                   G  VLKFM
Sbjct: 415  SSLVVPGILAGIFGIAIATFLGIVFGVTVLKFM 447


>ref|XP_006422011.1| hypothetical protein CICLE_v10004922mg [Citrus clementina]
            gi|568875109|ref|XP_006490652.1| PREDICTED:
            uncharacterized protein LOC102608862 [Citrus sinensis]
            gi|557523884|gb|ESR35251.1| hypothetical protein
            CICLE_v10004922mg [Citrus clementina]
          Length = 466

 Score =  528 bits (1359), Expect = e-147
 Identities = 279/393 (70%), Positives = 318/393 (80%), Gaps = 1/393 (0%)
 Frame = -1

Query: 1391 NIPLFQDSSRLTSYKYKGKVLISPFNMPKKSSRSVIASSQL-NFPIISPQDHWGTWTALF 1215
            +IP  Q S+   S+      L   F  P   +RSV A SQL NFP+ISP D WGTWTALF
Sbjct: 47   SIPQHQSSASYLSHSRTNTFLSPQFPHPSNRTRSVTARSQLPNFPLISPHDKWGTWTALF 106

Query: 1214 ATGAFGIWSEKTEIGSALSGALVSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXL 1035
            ATGAFGIWSE+T+IGSALSGALVSTL+GLAASNLG+++ E+PAY++V+ F         L
Sbjct: 107  ATGAFGIWSERTKIGSALSGALVSTLIGLAASNLGVVSCESPAYSIVLEFLLPLAVPLLL 166

Query: 1034 YRADMRRVIRSTGTLLLAFLLGSVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGG 855
            +RAD+RRVI+STGTLLLAFL+GSVATT+GT +A+L+VPM+SLGQD WKIAAALMGRHIGG
Sbjct: 167  FRADLRRVIKSTGTLLLAFLIGSVATTVGTALAYLLVPMRSLGQDSWKIAAALMGRHIGG 226

Query: 854  AVNYVAISEALQVSPSVLAAGLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEES 675
            AVNYVAIS+AL VS SVLAAGLAADNVICAVYFTTLFALAS IPAE STS  DV +NE S
Sbjct: 227  AVNYVAISDALGVSSSVLAAGLAADNVICAVYFTTLFALASNIPAESSTSVDDVSMNEGS 286

Query: 674  GSGNKLPVLQTATALAASFAICKTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYL 495
              G+K PVLQ ATALA +FAICK  +FLT+YFGIQGGSLPA+TAIVV LATTFP QF  L
Sbjct: 287  VRGDKPPVLQFATALAVAFAICKAGTFLTKYFGIQGGSLPAITAIVVTLATTFPTQFNKL 346

Query: 494  APSGEAMAMILMQVFFTVVGASGSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFD 315
            AP+GEAMA+ILMQVFFTVVGASG+I +VI TAPSIF+F LVQIA+HLA+ILG GKLFRFD
Sbjct: 347  APAGEAMALILMQVFFTVVGASGNIWSVINTAPSIFMFALVQIAIHLAVILGLGKLFRFD 406

Query: 314  LKLLLIASNANVXXXXXXXXXXXXXGWSSLVVP 216
             KLLLIASNANV             GWSSL+VP
Sbjct: 407  QKLLLIASNANVGGPTTACGMATAKGWSSLIVP 439


>ref|XP_004242762.1| PREDICTED: uncharacterized protein LOC101267717 [Solanum
            lycopersicum]
          Length = 462

 Score =  526 bits (1356), Expect = e-146
 Identities = 300/452 (66%), Positives = 334/452 (73%), Gaps = 6/452 (1%)
 Frame = -1

Query: 1472 MASKLLSVLPTHYLPPPEY----RPFIHSGRNIPLFQDSSRLTSYKYKGKVLISPFNMPK 1305
            MA K L  L   Y+P P      R    +  +  + Q    L+    K K L  P N  +
Sbjct: 13   MALKQLLFLHNPYIPSPASYSCRRKNASAATSSTVLQHPMLLSMNIDKFKPLDFPKNSTR 72

Query: 1304 KSSRSVIA-SSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLVGL 1128
            K +RSV    SQLNFPIISPQD WGTWT LFATGAFGIWSEKT+IG+ALSG+LVS LVGL
Sbjct: 73   KLNRSVTTIRSQLNFPIISPQDQWGTWTVLFATGAFGIWSEKTKIGAALSGSLVSVLVGL 132

Query: 1127 AASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIG 948
            AASNLGIIASEAPAY +V  F         L+RADMRRV++STGTLL+AFLLGSVATTIG
Sbjct: 133  AASNLGIIASEAPAYKIVTGFLLPLAVPLLLFRADMRRVLKSTGTLLMAFLLGSVATTIG 192

Query: 947  TVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVIC 768
            TVVAF +VPM+SLGQDGWKIAAALMGRHIGGAVNYVAISEALQ SPSV+A+GLAADN+IC
Sbjct: 193  TVVAFFIVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALQTSPSVVASGLAADNLIC 252

Query: 767  AVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTASFLT 588
            AVYFTTLFALASKIPAE + S SD ++  ES SGNKLPVLQTATALA SFAICK    LT
Sbjct: 253  AVYFTTLFALASKIPAEAAQSVSDDKV--ESESGNKLPVLQTATALAVSFAICKAGELLT 310

Query: 587  RYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGAS-GSIRNV 411
            ++FGIQGG LP +TAIVVILAT+FP QFAYLAPSGEAMA+ILMQVFFT +GAS GSI NV
Sbjct: 311  KHFGIQGGLLPIITAIVVILATSFPSQFAYLAPSGEAMALILMQVFFTFIGASGGSISNV 370

Query: 410  ITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWS 231
            + TAPSIFLF L+QIAVHLA+ILG GKL R +LK LLIASNANV             GW 
Sbjct: 371  LNTAPSIFLFALIQIAVHLAVILGVGKLLRLELKELLIASNANVGGPTTACGMATAKGWI 430

Query: 230  SLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 135
            SLVVP                   GQ VLKF+
Sbjct: 431  SLVVPGILAGIFGIAIATFLGIAFGQTVLKFI 462


>ref|XP_006358300.1| PREDICTED: uncharacterized protein LOC102584987 [Solanum tuberosum]
          Length = 453

 Score =  521 bits (1343), Expect = e-145
 Identities = 296/455 (65%), Positives = 333/455 (73%), Gaps = 9/455 (1%)
 Frame = -1

Query: 1472 MASKLLSVLPTHYLPPP-------EYRPFIHSGRNIPLFQDSSRLTSYKYKGKVLISPFN 1314
            MA K L  L   Y+P P       +      S  +  + Q    L+    K K L  P N
Sbjct: 1    MALKQLLFLHNPYIPSPASCSSRRKNASAATSSTSNSILQHPMLLSKDIDKFKPLDFPKN 60

Query: 1313 MPKKSSRSVIA-SSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTL 1137
              +K +RSV    SQLNFPIISPQD WGTWT LFATGAFGIWSEKT++G+ALSG+LVS L
Sbjct: 61   STRKLNRSVTTIRSQLNFPIISPQDQWGTWTVLFATGAFGIWSEKTKVGAALSGSLVSVL 120

Query: 1136 VGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVAT 957
            VGLAASNLGIIASEAPAY +V  F         L+RADMRRV++STGTLLLAFLLGSVAT
Sbjct: 121  VGLAASNLGIIASEAPAYKIVTGFLLPLAVPLLLFRADMRRVLKSTGTLLLAFLLGSVAT 180

Query: 956  TIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADN 777
            TIGTVVAF +VPM+SLGQDGWKIAAALMGRHIGGAVNYVAISEALQ SPSV+A+GLAADN
Sbjct: 181  TIGTVVAFCIVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALQTSPSVVASGLAADN 240

Query: 776  VICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTAS 597
            +ICAVYFTTLFAL SKIPAE + S +D +++ E  SGNKLPVLQTATALA SFAICK   
Sbjct: 241  LICAVYFTTLFALTSKIPAEATQSATDDKVDSE--SGNKLPVLQTATALAVSFAICKAGE 298

Query: 596  FLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGAS-GSI 420
             LT++FGIQGG LP +TAIVVILAT+FP QFAYLAPSGEAMA+ILMQVFFT +GAS GSI
Sbjct: 299  LLTKHFGIQGGLLPTITAIVVILATSFPSQFAYLAPSGEAMALILMQVFFTFIGASGGSI 358

Query: 419  RNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXX 240
             NV+ TAPSIFLF  +QIAVHLA+ILG GKL + +LK LLIASNANV             
Sbjct: 359  SNVLNTAPSIFLFAFIQIAVHLAVILGVGKLLQLELKELLIASNANVGGPTTACGMATAK 418

Query: 239  GWSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 135
            GW S+VVP                   GQAVLKFM
Sbjct: 419  GWISMVVPGILAGIFGIAIATFLGIAFGQAVLKFM 453


>ref|XP_009628873.1| PREDICTED: uncharacterized protein LOC104119150 [Nicotiana
            tomentosiformis]
          Length = 452

 Score =  520 bits (1339), Expect = e-144
 Identities = 296/458 (64%), Positives = 339/458 (74%), Gaps = 12/458 (2%)
 Frame = -1

Query: 1472 MASKLLSVLPTHYLPPPEYRPFIHSGRNIPLFQDSSRLTSYK--YKGKVLIS------PF 1317
            MASKL  +   +  PP  Y P     +N+P    +S +TS     +  +L+S      P 
Sbjct: 1    MASKLWFLHNLYIPPPASYSP---RRQNVPA---ASAITSANTILQHPMLLSNIDKYTPL 54

Query: 1316 NMPKKS---SRSVIA-SSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGAL 1149
            + PK S   +RSV    SQLNFPIISPQD WGTWTALFATGAFGIWSEKT++G ALSGAL
Sbjct: 55   DFPKSSKKLNRSVTTIRSQLNFPIISPQDQWGTWTALFATGAFGIWSEKTKVGGALSGAL 114

Query: 1148 VSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLG 969
            VSTLVGLAASNLGIIA EAPAY +V  F         L+RADMRRV++STGTLLLAFLLG
Sbjct: 115  VSTLVGLAASNLGIIACEAPAYKIVTGFLLPLAVPLLLFRADMRRVLQSTGTLLLAFLLG 174

Query: 968  SVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGL 789
            SVATTIGTVVAF +VPM+SLGQDGWKIAAALMGRHIGGAVNYVAISEAL+ SPSV+ AGL
Sbjct: 175  SVATTIGTVVAFWIVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALETSPSVVTAGL 234

Query: 788  AADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAIC 609
            AADN+ICAVYFTTLFALASKIPAE + S ++ +++ ES SGN LPVLQ+ATALA SFAIC
Sbjct: 235  AADNLICAVYFTTLFALASKIPAEATPSAAEDKIDGESESGNTLPVLQSATALAVSFAIC 294

Query: 608  KTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGAS 429
            K   FLT++F IQGG+LP +TAIVVILAT+FP QFA LAPSGEAMA+ILMQVFF  +GA+
Sbjct: 295  KAGDFLTKHFVIQGGTLPIITAIVVILATSFPTQFADLAPSGEAMALILMQVFFAFIGAN 354

Query: 428  GSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXX 249
            GSI NV+ TAPSIF+F LVQI VHLA+ILG GKL RF+L+ LLIASNANV          
Sbjct: 355  GSILNVMNTAPSIFVFVLVQIGVHLAVILGVGKLLRFELEQLLIASNANVGGPTTACGMA 414

Query: 248  XXXGWSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 135
               GW SLVVP                   GQ +LKFM
Sbjct: 415  TAKGWISLVVPGILAGIFGITIATFLGIAFGQVILKFM 452


>ref|XP_007038793.1| Keratin-associated protein 5-4 [Theobroma cacao]
            gi|508776038|gb|EOY23294.1| Keratin-associated protein
            5-4 [Theobroma cacao]
          Length = 466

 Score =  518 bits (1335), Expect = e-144
 Identities = 276/385 (71%), Positives = 312/385 (81%)
 Frame = -1

Query: 1370 SSRLTSYKYKGKVLISPFNMPKKSSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIW 1191
            SS L+ Y+ +  +     +    ++R V   SQLNFP+ISP D WGTWTALFA GAFGIW
Sbjct: 55   SSSLSLYRSQTFLSSHWLHQNPTANRPVTVKSQLNFPLISPNDQWGTWTALFAIGAFGIW 114

Query: 1190 SEKTEIGSALSGALVSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRV 1011
            SEKT+IGSALSGALVSTL+GLAASNLGII+ EA AY+ V+ F         L+RAD+RRV
Sbjct: 115  SEKTKIGSALSGALVSTLIGLAASNLGIISCEAKAYSTVLEFLLPLAVPLLLFRADLRRV 174

Query: 1010 IRSTGTLLLAFLLGSVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAIS 831
            I+STG LLLAFLLGSVATT+GT +A+L+VPM++LGQD WKIAAALMGRHIGGAVNYVAIS
Sbjct: 175  IKSTGKLLLAFLLGSVATTVGTALAYLIVPMRALGQDSWKIAAALMGRHIGGAVNYVAIS 234

Query: 830  EALQVSPSVLAAGLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPV 651
             AL VSPSVLAAGLAADNVICAVYFTTLFALASK+P E STS  DV + E S SG+KLPV
Sbjct: 235  NALGVSPSVLAAGLAADNVICAVYFTTLFALASKVPPETSTSPEDVAMVEGSESGSKLPV 294

Query: 650  LQTATALAASFAICKTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMA 471
            LQ ATALA SF+ICK  ++LT+YFGI GGSLPAVTAIVVILAT FP QF  LAP+GEAMA
Sbjct: 295  LQIATALAVSFSICKLGAYLTKYFGIPGGSLPAVTAIVVILATVFPTQFGRLAPAGEAMA 354

Query: 470  MILMQVFFTVVGASGSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIAS 291
            +ILMQVFFTVVGASG+I NVI TAPSIF+F LVQIA+HLA+ILG GKLFRFDLKLLLIAS
Sbjct: 355  LILMQVFFTVVGASGNIWNVINTAPSIFMFALVQIAIHLALILGLGKLFRFDLKLLLIAS 414

Query: 290  NANVXXXXXXXXXXXXXGWSSLVVP 216
            NANV             GWSS+VVP
Sbjct: 415  NANVGGPTTACGMATAKGWSSMVVP 439


>emb|CDP05152.1| unnamed protein product [Coffea canephora]
          Length = 459

 Score =  516 bits (1330), Expect = e-143
 Identities = 273/355 (76%), Positives = 300/355 (84%), Gaps = 1/355 (0%)
 Frame = -1

Query: 1277 SQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLVGLAASNLGIIAS 1098
            SQL++PIISPQDHWGTWTALFATGAFGIWSE+T+IGS LSGALVS LVGLAASNLGII  
Sbjct: 78   SQLSYPIISPQDHWGTWTALFATGAFGIWSERTKIGSTLSGALVSILVGLAASNLGIIPC 137

Query: 1097 EAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIGTVVAFLMVPM 918
            +APAY +V+           L+RAD+RRVI+STGTLLLAFLLGSVATT+GT VAFL+VPM
Sbjct: 138  DAPAYKIVLQILLPMAVPLLLFRADLRRVIQSTGTLLLAFLLGSVATTLGTAVAFLLVPM 197

Query: 917  QSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVICAVYFTTLFAL 738
            +SLGQDGWKIAAALMGRHIGGAVNYVAISEAL V+PSVLAAGLAADNVICA+YFTTLFAL
Sbjct: 198  RSLGQDGWKIAAALMGRHIGGAVNYVAISEALGVTPSVLAAGLAADNVICAIYFTTLFAL 257

Query: 737  ASKIPAEPSTSTSDVELNEE-SGSGNKLPVLQTATALAASFAICKTASFLTRYFGIQGGS 561
            AS IP E ST+T+D +   + S SGNKLPVL TATALA SFAICK  S   +YFGI GGS
Sbjct: 258  ASGIPPEASTATTDADAGYDISESGNKLPVLPTATALAVSFAICKAGSSFAKYFGISGGS 317

Query: 560  LPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRNVITTAPSIFLF 381
            LPA+TAIVVILAT FP+ FA+LAPSGEAMA+ILMQVFFTVVGASGS+ NVI TAPSI LF
Sbjct: 318  LPAITAIVVILATVFPRLFAHLAPSGEAMALILMQVFFTVVGASGSMWNVINTAPSILLF 377

Query: 380  CLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWSSLVVP 216
             LVQIAVHLA+ILG GKLFRFDLKLLL+ASNANV             GWSSLVVP
Sbjct: 378  ALVQIAVHLAVILGLGKLFRFDLKLLLLASNANVGGPTTACGMATAKGWSSLVVP 432


>ref|XP_010255250.1| PREDICTED: uncharacterized protein LOC104595990 [Nelumbo nucifera]
          Length = 457

 Score =  513 bits (1321), Expect = e-142
 Identities = 267/378 (70%), Positives = 309/378 (81%), Gaps = 2/378 (0%)
 Frame = -1

Query: 1343 KGKVLISPFNMPKKSSR--SVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIG 1170
            + K LISP  +PK      S+   +QLNFP+ISP+DHWGTWTALFAT AFGIWSEKT+IG
Sbjct: 53   RSKTLISPLTIPKNHGPVPSLKTRAQLNFPLISPKDHWGTWTALFATSAFGIWSEKTKIG 112

Query: 1169 SALSGALVSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTL 990
            SALSG+LVS LVGLAASN+GII+ EAPAY+VVM +         L+RAD+RRVI STGTL
Sbjct: 113  SALSGSLVSILVGLAASNIGIISCEAPAYSVVMEYLLPMAVPLLLFRADLRRVIMSTGTL 172

Query: 989  LLAFLLGSVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSP 810
            LLAFLLGSVATTIGT+VA+L+VPM+SLGQD WKIAAALMGRHIGGAVNYVAISEAL V+P
Sbjct: 173  LLAFLLGSVATTIGTLVAYLLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISEALGVTP 232

Query: 809  SVLAAGLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATAL 630
            SVLAAGLAADNVICA+YFT+LFALAS IP E S ST D  ++ +S  GNKLPVLQTA A+
Sbjct: 233  SVLAAGLAADNVICAIYFTSLFALASNIPPEASKSTEDGVIDAKSEPGNKLPVLQTAIAI 292

Query: 629  AASFAICKTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVF 450
            A SF+ICKTA++LT+  GIQGGSLP +TA+VVILAT FP QF YLAP+GEA+A+ILMQVF
Sbjct: 293  AVSFSICKTATYLTKLLGIQGGSLPCITALVVILATIFPAQFGYLAPAGEAVALILMQVF 352

Query: 449  FTVVGASGSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXX 270
            F VVGA+GSI NVI TAPS+F+F L+QI +HLA+ILG GKL RFD KLLL+ASNANV   
Sbjct: 353  FAVVGANGSIWNVINTAPSVFMFALLQITIHLAVILGVGKLLRFDQKLLLLASNANVGGP 412

Query: 269  XXXXXXXXXXGWSSLVVP 216
                      GW SLV+P
Sbjct: 413  TTACGMATAKGWGSLVIP 430


>ref|XP_008234847.1| PREDICTED: uncharacterized protein LOC103333733 isoform X1 [Prunus
            mume]
          Length = 463

 Score =  513 bits (1320), Expect = e-142
 Identities = 272/399 (68%), Positives = 312/399 (78%), Gaps = 1/399 (0%)
 Frame = -1

Query: 1328 ISPFNMPKKSSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEK-TEIGSALSGA 1152
            +SP   P    RSV    QLN P+IS  D WGTWTALFATGAFGIWSEK T++G+ALSGA
Sbjct: 65   LSPPAPPDLGDRSVAVRFQLNAPLISSHDQWGTWTALFATGAFGIWSEKNTKVGAALSGA 124

Query: 1151 LVSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLL 972
            LVSTL+GLAASNLGII+S APA+++V+ F         LYRAD+RRVI+STG LLLAFLL
Sbjct: 125  LVSTLIGLAASNLGIISSNAPAFSIVLEFLLPLAVPLLLYRADLRRVIKSTGALLLAFLL 184

Query: 971  GSVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAG 792
            GSVATT+GTVVA+L+VPM+SLGQD WKIAAALMGRHIGGAVNYVAI++AL VSPS+LAAG
Sbjct: 185  GSVATTVGTVVAYLLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAIADALGVSPSILAAG 244

Query: 791  LAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAI 612
            LAADNVICAVYF+TLFALASK+P EPSTS   +E +  S  GNKLP++QTATAL+ S AI
Sbjct: 245  LAADNVICAVYFSTLFALASKVPPEPSTSDDGIEKDASSEPGNKLPLIQTATALSVSLAI 304

Query: 611  CKTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGA 432
            CK+  +LT+YFGIQGG LPAVTAIVV LAT FPKQFAYLAP+GEAMA+ILMQVFF+VVGA
Sbjct: 305  CKSGHYLTKYFGIQGGILPAVTAIVVTLATVFPKQFAYLAPTGEAMAVILMQVFFSVVGA 364

Query: 431  SGSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXX 252
            SG+I NVI TAPSIF F L+QIAVHLA+ILG GKL  FDLKLLLIASNANV         
Sbjct: 365  SGNIWNVINTAPSIFFFALIQIAVHLAVILGLGKLMGFDLKLLLIASNANVGGPTTACGM 424

Query: 251  XXXXGWSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 135
                 W+S++VP                   G AVLK+M
Sbjct: 425  ATAKEWNSMIVPGILAGIFGIAIATFIGIAFGLAVLKYM 463


>ref|XP_011043859.1| PREDICTED: uncharacterized protein LOC105139195 isoform X1 [Populus
            euphratica] gi|743901093|ref|XP_011043860.1| PREDICTED:
            uncharacterized protein LOC105139195 isoform X1 [Populus
            euphratica]
          Length = 452

 Score =  512 bits (1319), Expect = e-142
 Identities = 284/422 (67%), Positives = 326/422 (77%), Gaps = 7/422 (1%)
 Frame = -1

Query: 1460 LLSVLPTHYLPP-PEYRPFIHSGRNIPLFQDSSR---LTSYKYKGKV-LISPFNMPK--K 1302
            + S LP  + P  P  RP   S +N P    +     L S  Y  +   +SP   P   +
Sbjct: 1    MASRLPLLHSPVVPFRRPCFVSRQNSPTTTANPTRRTLLSANYGNQTSFLSPQKNPNLIR 60

Query: 1301 SSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLVGLAA 1122
            SS +V ++  LNFP+ISP D WG WTALFATGAFGIWSE+T+IGSALSGALVSTLVGLAA
Sbjct: 61   SSVTVRSNMILNFPLISPTDPWGMWTALFATGAFGIWSERTKIGSALSGALVSTLVGLAA 120

Query: 1121 SNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIGTV 942
            SNLGII+ E+PAY+ V+ F         L+RAD+RRVI+STGTLLLAFLLGSVATT+GTV
Sbjct: 121  SNLGIISCESPAYSTVLKFLLPLAVPLLLFRADLRRVIQSTGTLLLAFLLGSVATTVGTV 180

Query: 941  VAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVICAV 762
            +A++MVPM++LGQD WKIAAALMGRHIGGAVNYVAIS+AL VSPSVLAAGLAADNVICAV
Sbjct: 181  LAYMMVPMRALGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVICAV 240

Query: 761  YFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTASFLTRY 582
            YFT+LFALASKIPAE S S     ++  S SGNKLPVLQTATALA SFAICK   ++T++
Sbjct: 241  YFTSLFALASKIPAESSASIDGSGMDSGSESGNKLPVLQTATALAVSFAICKAGEYITKF 300

Query: 581  FGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRNVITT 402
            F I GG LPAVTAIVVILAT FP QF +LAPSGEA+A+ILMQVFF VVGASG++RNVI T
Sbjct: 301  FAIPGGILPAVTAIVVILATAFPTQFNHLAPSGEALALILMQVFFAVVGASGNVRNVINT 360

Query: 401  APSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWSSLV 222
            APSIF+F LVQIA+HLA+ILG GKLFRFD KLLLIASNANV             GWSSLV
Sbjct: 361  APSIFMFALVQIAIHLAVILGLGKLFRFDQKLLLIASNANVGGPTTACGMATAKGWSSLV 420

Query: 221  VP 216
            VP
Sbjct: 421  VP 422


>ref|XP_010255249.1| PREDICTED: uncharacterized protein LOC104595989 [Nelumbo nucifera]
          Length = 458

 Score =  512 bits (1318), Expect = e-142
 Identities = 269/377 (71%), Positives = 309/377 (81%), Gaps = 3/377 (0%)
 Frame = -1

Query: 1337 KVLISPFNMPKKS---SRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGS 1167
            K  +SP   PK +   +RSV   +QL+FP+ISP+DHWGTWTALF + AFGIWSEKT++GS
Sbjct: 55   KTFLSPSTFPKGNPDLNRSVKTKAQLSFPLISPKDHWGTWTALFVSSAFGIWSEKTKVGS 114

Query: 1166 ALSGALVSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLL 987
            ALSGALVSTLVGL ASNLGII+ EAPAY++VM +         L+RAD+RRVI STGTLL
Sbjct: 115  ALSGALVSTLVGLGASNLGIISCEAPAYSLVMEYLLPMAVPLLLFRADLRRVILSTGTLL 174

Query: 986  LAFLLGSVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPS 807
             AFLLGSVATTIGT+VA+LMVPM+SLG D WKIAAALMGRHIGGAVNYVAISEAL VSPS
Sbjct: 175  SAFLLGSVATTIGTIVAYLMVPMRSLGHDNWKIAAALMGRHIGGAVNYVAISEALAVSPS 234

Query: 806  VLAAGLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALA 627
            VLAAGLAADNVICA+YFT+LFALAS+IP E +T T+D  ++ ES  GNKLPVLQTATALA
Sbjct: 235  VLAAGLAADNVICAIYFTSLFALASQIPPESTTPTNDDVIDTESQIGNKLPVLQTATALA 294

Query: 626  ASFAICKTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFF 447
             SFAICKT ++L++  GIQGG+LP +TAIVVILAT FP QF YLAP+GEA+A+ILMQVFF
Sbjct: 295  VSFAICKTGTYLSKLLGIQGGNLPCITAIVVILATIFPAQFGYLAPAGEAVALILMQVFF 354

Query: 446  TVVGASGSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXX 267
             VVGA+GSI NVI TAPSIF+F LVQIAVHLA+ILG GKL +FD KLLL+ASNANV    
Sbjct: 355  AVVGANGSIWNVINTAPSIFMFSLVQIAVHLAVILGVGKLMQFDQKLLLLASNANVGGPA 414

Query: 266  XXXXXXXXXGWSSLVVP 216
                     GW SLVVP
Sbjct: 415  TACGMASTKGWGSLVVP 431


>ref|XP_002305244.2| hypothetical protein POPTR_0004s07750g [Populus trichocarpa]
            gi|550340557|gb|EEE85755.2| hypothetical protein
            POPTR_0004s07750g [Populus trichocarpa]
          Length = 452

 Score =  511 bits (1315), Expect = e-142
 Identities = 271/373 (72%), Positives = 309/373 (82%), Gaps = 2/373 (0%)
 Frame = -1

Query: 1328 ISPFNMPK--KSSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSG 1155
            +SP   P   +SS +V ++  LNFP+ISP D WG WTALFATGAFGIWSE+T+IGSALSG
Sbjct: 50   LSPQKNPNLIRSSVTVRSNLILNFPLISPTDPWGMWTALFATGAFGIWSERTKIGSALSG 109

Query: 1154 ALVSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFL 975
            ALVSTLVGLAASNLGII+ E+PAY++V+ F         L+RAD+RRVI+STGTLLLAFL
Sbjct: 110  ALVSTLVGLAASNLGIISCESPAYSIVLKFLLPLAVPLLLFRADLRRVIQSTGTLLLAFL 169

Query: 974  LGSVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAA 795
            LGSVATT+GTV+A++MVPM++LGQD WKIAAALMGRHIGGAVNYVAIS+AL+VSPSVLAA
Sbjct: 170  LGSVATTVGTVLAYMMVPMRALGQDSWKIAAALMGRHIGGAVNYVAISDALRVSPSVLAA 229

Query: 794  GLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFA 615
            GLAADNVICAVYFT+LFALASKIPAE S S     ++  S SGNKLPVLQTATALA SFA
Sbjct: 230  GLAADNVICAVYFTSLFALASKIPAESSASIDGSGMDSGSESGNKLPVLQTATALAVSFA 289

Query: 614  ICKTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVG 435
            ICK   ++T++F I GG LPAVTAIVVILAT FP QF +LAPSGEA+A+ILMQVFF VVG
Sbjct: 290  ICKAGEYITKFFAIPGGILPAVTAIVVILATAFPTQFNHLAPSGEALALILMQVFFAVVG 349

Query: 434  ASGSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXX 255
            ASG++ NVI TAPSIFLF LVQIA+HLA+ILG GKLFRFD KLLLIASNANV        
Sbjct: 350  ASGNVWNVINTAPSIFLFALVQIAIHLAVILGLGKLFRFDQKLLLIASNANVGGPTTACG 409

Query: 254  XXXXXGWSSLVVP 216
                 GWSSLVVP
Sbjct: 410  MATAKGWSSLVVP 422


>ref|XP_012090197.1| PREDICTED: uncharacterized protein LOC105648428 [Jatropha curcas]
            gi|643706105|gb|KDP22237.1| hypothetical protein
            JCGZ_26068 [Jatropha curcas]
          Length = 459

 Score =  509 bits (1310), Expect = e-141
 Identities = 275/392 (70%), Positives = 311/392 (79%), Gaps = 2/392 (0%)
 Frame = -1

Query: 1385 PLFQDSSRLTSYKYKGKVLISP--FNMPKKSSRSVIASSQLNFPIISPQDHWGTWTALFA 1212
            P  Q SS   S   +    +SP  +     S RSV   S LNFP+ISP D WGTWTALFA
Sbjct: 43   PALQSSS--ISLGNRSHTFLSPELYTEDSSSLRSVAVRSNLNFPLISPGDRWGTWTALFA 100

Query: 1211 TGAFGIWSEKTEIGSALSGALVSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLY 1032
            TGAFGIWSEKT+IGSALSGALVSTLVGLAASNLGII+ E+PAY +V+ F         L+
Sbjct: 101  TGAFGIWSEKTKIGSALSGALVSTLVGLAASNLGIISCESPAYPIVLEFLLPLAVPLLLF 160

Query: 1031 RADMRRVIRSTGTLLLAFLLGSVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGA 852
            RAD+RRVI+STGTLLLAFL+GSVATT+GT+VA+ +VPM+SLGQD WKIAAALMGRHIGGA
Sbjct: 161  RADLRRVIQSTGTLLLAFLIGSVATTVGTLVAYWIVPMRSLGQDSWKIAAALMGRHIGGA 220

Query: 851  VNYVAISEALQVSPSVLAAGLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESG 672
            VNYVAIS+AL VS SVLA+GLAADNVICAVYFTTLFALASKIP E S ST+D  +  E+ 
Sbjct: 221  VNYVAISDALGVSSSVLASGLAADNVICAVYFTTLFALASKIPPESSVSTNDGAIESETE 280

Query: 671  SGNKLPVLQTATALAASFAICKTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLA 492
              +KLPVL+ ATA+A SFAICK  SF+T+ FGIQGG LPAVTAIVVILAT FP QF  LA
Sbjct: 281  PSDKLPVLKIATAIAVSFAICKAGSFVTKLFGIQGGILPAVTAIVVILATAFPTQFNQLA 340

Query: 491  PSGEAMAMILMQVFFTVVGASGSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDL 312
            PSGEA+A+ILMQVFFTVVGASG+I +VI TAPSIF+F LVQI VHLA+ILG GKLFRFDL
Sbjct: 341  PSGEAIALILMQVFFTVVGASGNIWSVINTAPSIFMFALVQITVHLAVILGLGKLFRFDL 400

Query: 311  KLLLIASNANVXXXXXXXXXXXXXGWSSLVVP 216
            KLLL+ASNANV             GW+SLVVP
Sbjct: 401  KLLLLASNANVGGPTTACGMATAKGWNSLVVP 432


>ref|XP_002513660.1| conserved hypothetical protein [Ricinus communis]
            gi|223547568|gb|EEF49063.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 965

 Score =  509 bits (1310), Expect = e-141
 Identities = 281/409 (68%), Positives = 316/409 (77%), Gaps = 3/409 (0%)
 Frame = -1

Query: 1433 LPPPEYRPFIHSGRNIPL-FQDSSRLTSYKYKGKVLISPFNMP--KKSSRSVIASSQLNF 1263
            + P  Y+ F    +  PL F  +    S   + +  +SP   P    S RS+   S LNF
Sbjct: 31   MSPQSYQSF----KIYPLHFHSNDNDNSNNNRNQTFLSPQLYPGDPSSRRSLAVRSNLNF 86

Query: 1262 PIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLVGLAASNLGIIASEAPAY 1083
            P+IS  D WGTWTALFATGAFGIWSEKT+IGSALSGALVSTLVGLA SNLGII+ E+PAY
Sbjct: 87   PLISSNDRWGTWTALFATGAFGIWSEKTKIGSALSGALVSTLVGLAGSNLGIISCESPAY 146

Query: 1082 NVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIGTVVAFLMVPMQSLGQ 903
             VV+ F         L+RAD+RRVIRSTGTLLLAFLLGSVATT+GTVVA+ +VPM+SLGQ
Sbjct: 147  AVVLEFLLPLAVPLLLFRADLRRVIRSTGTLLLAFLLGSVATTVGTVVAYWIVPMRSLGQ 206

Query: 902  DGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVICAVYFTTLFALASKIP 723
            D WKIAAALMGRHIGGAVNYVAI++AL VS SVLA+GLAADNVICAVYFTTLFALASKIP
Sbjct: 207  DSWKIAAALMGRHIGGAVNYVAIADALGVSSSVLASGLAADNVICAVYFTTLFALASKIP 266

Query: 722  AEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTASFLTRYFGIQGGSLPAVTA 543
            AE STS+++  +   S SG KLPVLQ AT+LA S AICK  S++T+ FGIQGG LPAVTA
Sbjct: 267  AETSTSSNEDGMESGSVSGEKLPVLQLATSLAVSLAICKAGSYVTKLFGIQGGILPAVTA 326

Query: 542  IVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRNVITTAPSIFLFCLVQIA 363
            IVVILAT FP QF  LAPSGEAMA+ILMQVFFTVVGASG+I NV+ TAPSIF+F LVQIA
Sbjct: 327  IVVILATAFPTQFNGLAPSGEAMALILMQVFFTVVGASGNIWNVVKTAPSIFMFALVQIA 386

Query: 362  VHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWSSLVVP 216
            VHL IILG GKLFRFD KLLL+ASNANV             GWSSLVVP
Sbjct: 387  VHLVIILGLGKLFRFDQKLLLLASNANVGGPTTACGMATAKGWSSLVVP 435



 Score =  470 bits (1209), Expect = e-129
 Identities = 254/410 (61%), Positives = 302/410 (73%), Gaps = 11/410 (2%)
 Frame = -1

Query: 1412 PFIHSGRNIPLFQDSSRLTSYKYKGKV--------LISPFNMPKKSS---RSVIASSQLN 1266
            P +HS  +  L   S  L+ +  + K+          SP  +   ++   R +   SQL 
Sbjct: 529  PLLHSSCSPSLRISSRHLSPFSSRHKLSHPNINEAAFSPSTISLNNTSLIRQIKLRSQLR 588

Query: 1265 FPIISPQDHWGTWTALFATGAFGIWSEKTEIGSALSGALVSTLVGLAASNLGIIASEAPA 1086
            FP+ISP DHWGTWTALFATGAFGIWSE T++GS +S ALVSTLVGLAASN+GII  E  A
Sbjct: 589  FPLISPDDHWGTWTALFATGAFGIWSEGTKVGSMVSAALVSTLVGLAASNIGIIPYETAA 648

Query: 1085 YNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIGTVVAFLMVPMQSLG 906
            Y++V+ F         L+RAD+R VIRSTG L LAFLLGSVAT IGT VAFLMVPM+SLG
Sbjct: 649  YSLVLEFLLPLTVPLLLFRADLRNVIRSTGKLFLAFLLGSVATIIGTTVAFLMVPMRSLG 708

Query: 905  QDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVICAVYFTTLFALASKI 726
             D WKIAAALMG +IGG+VNYVAISEAL  SPSV+AAG+AADNVICA YF  LFALASKI
Sbjct: 709  PDNWKIAAALMGSYIGGSVNYVAISEALGTSPSVVAAGIAADNVICATYFMALFALASKI 768

Query: 725  PAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAICKTASFLTRYFGIQGGSLPAVT 546
            PAE S ST+ VE++ ES S  K+PVLQ A ALA SF IC+TA++LT+   +QGG+LPA+T
Sbjct: 769  PAENSASTNGVEMDVESSSTGKIPVLQMAAALAISFMICRTATYLTQLCKVQGGNLPAIT 828

Query: 545  AIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRNVITTAPSIFLFCLVQI 366
            AIVV LAT+FP QF  LAP+G+ +A++LMQVFF VVGASGSI NVI TAPSIFLF LVQ+
Sbjct: 829  AIVVFLATSFPVQFGRLAPAGDTIALVLMQVFFAVVGASGSIWNVIKTAPSIFLFALVQL 888

Query: 365  AVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWSSLVVP 216
             VHLA++LG G+LF FDLKLLL+ASNAN+             GW SLVVP
Sbjct: 889  TVHLAVVLGLGRLFDFDLKLLLLASNANIGGPTTACGMATAKGWKSLVVP 938


>gb|KHG26673.1| putative membrane yjcL [Gossypium arboreum]
          Length = 464

 Score =  508 bits (1307), Expect = e-141
 Identities = 270/386 (69%), Positives = 313/386 (81%), Gaps = 2/386 (0%)
 Frame = -1

Query: 1367 SRLTSYKYKGKVLISPFNMPKK--SSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGI 1194
            SRL   K + +  +SP  + K   ++R++I  SQLN P+ISP D WGTWTALFATGAFG+
Sbjct: 53   SRLLPLK-RTQTFLSPKWLDKNPDATRTLIVKSQLNSPLISPNDQWGTWTALFATGAFGL 111

Query: 1193 WSEKTEIGSALSGALVSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRR 1014
            WSE T+ GSALSGALVSTL+GLAASNLGII+SEA AY++V  F         L+RAD+RR
Sbjct: 112  WSENTKAGSALSGALVSTLIGLAASNLGIISSEAKAYSIVKEFLLPLAVPLLLFRADLRR 171

Query: 1013 VIRSTGTLLLAFLLGSVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAI 834
            VI+STG LLLAFLLGSVATT+GT +A+L+VPM++LGQD WKIAAALMGRHIGGAVNYVAI
Sbjct: 172  VIKSTGKLLLAFLLGSVATTVGTALAYLIVPMRALGQDSWKIAAALMGRHIGGAVNYVAI 231

Query: 833  SEALQVSPSVLAAGLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLP 654
            S AL+ S SVLAAGLAADNVICAVYFTTLFALASK+PAE STS  DV + E S S  KLP
Sbjct: 232  SNALETSESVLAAGLAADNVICAVYFTTLFALASKVPAETSTSPEDVAMGEGSISDGKLP 291

Query: 653  VLQTATALAASFAICKTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAM 474
            VL+ ATALA SFAICK  ++LT+YFGI GG LPAVTAIVVILAT FP QF +LAPSGEAM
Sbjct: 292  VLKIATALAVSFAICKLGAYLTKYFGIPGGILPAVTAIVVILATVFPAQFGHLAPSGEAM 351

Query: 473  AMILMQVFFTVVGASGSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIA 294
            A+ILMQVFFTVVGASG+I +VI TAPSIF+F LVQI++HLA+ILG GKLF+FDLKLLLIA
Sbjct: 352  ALILMQVFFTVVGASGNIWSVIRTAPSIFMFALVQISIHLALILGLGKLFKFDLKLLLIA 411

Query: 293  SNANVXXXXXXXXXXXXXGWSSLVVP 216
            SNANV             GWSS+++P
Sbjct: 412  SNANVGGPTTASGMATAKGWSSMIIP 437


>ref|XP_012484138.1| PREDICTED: uncharacterized protein LOC105798567 [Gossypium raimondii]
            gi|763766946|gb|KJB34161.1| hypothetical protein
            B456_006G051100 [Gossypium raimondii]
          Length = 464

 Score =  507 bits (1305), Expect = e-140
 Identities = 269/386 (69%), Positives = 312/386 (80%), Gaps = 2/386 (0%)
 Frame = -1

Query: 1367 SRLTSYKYKGKVLISPFNMPKK--SSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGI 1194
            SRL   K + +  +SP  + K   ++R++I  SQLN P+ISP D WGTWTALFATGAFG+
Sbjct: 53   SRLLPLK-RTQTFLSPKWLDKNPDATRTLIVKSQLNCPLISPNDQWGTWTALFATGAFGL 111

Query: 1193 WSEKTEIGSALSGALVSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRR 1014
            WSE T+ GSALSGALVSTL+GLAASNLGII+SEA  Y++V  F         L+RAD+RR
Sbjct: 112  WSENTKAGSALSGALVSTLIGLAASNLGIISSEAKVYSIVKEFLLPLAVPLLLFRADLRR 171

Query: 1013 VIRSTGTLLLAFLLGSVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAI 834
            VI+STG LLLAFLLGSVATT+GT +A+L+VPM++LGQD WKIAAALMGRHIGGAVNYVAI
Sbjct: 172  VIKSTGKLLLAFLLGSVATTVGTALAYLIVPMRALGQDSWKIAAALMGRHIGGAVNYVAI 231

Query: 833  SEALQVSPSVLAAGLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLP 654
            S AL+ S SVLAAGLAADNVICAVYFTTLFALASK+PAE STS  DV + E S S  KLP
Sbjct: 232  SNALETSESVLAAGLAADNVICAVYFTTLFALASKVPAETSTSPEDVAMGEGSKSDGKLP 291

Query: 653  VLQTATALAASFAICKTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAM 474
            VL+ ATALA SFAICK  ++LT+YFGI GG LPAVTAIVVILAT FP QF +LAPSGEAM
Sbjct: 292  VLKIATALAVSFAICKLGAYLTKYFGIPGGILPAVTAIVVILATVFPTQFGHLAPSGEAM 351

Query: 473  AMILMQVFFTVVGASGSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIA 294
            A+ILMQVFFTVVGASG+I +VI TAPSIF+F LVQI++HLA+ILG GKLF+FDLKLLLIA
Sbjct: 352  ALILMQVFFTVVGASGNIWSVIRTAPSIFMFALVQISIHLALILGLGKLFKFDLKLLLIA 411

Query: 293  SNANVXXXXXXXXXXXXXGWSSLVVP 216
            SNANV             GWSS+++P
Sbjct: 412  SNANVGGPTTASGMATAKGWSSMIIP 437


>ref|XP_007217974.1| hypothetical protein PRUPE_ppa005389mg [Prunus persica]
            gi|462414436|gb|EMJ19173.1| hypothetical protein
            PRUPE_ppa005389mg [Prunus persica]
          Length = 463

 Score =  505 bits (1301), Expect = e-140
 Identities = 268/399 (67%), Positives = 308/399 (77%), Gaps = 1/399 (0%)
 Frame = -1

Query: 1328 ISPFNMPKKSSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEK-TEIGSALSGA 1152
            +SP   P    RSV    QLN P+IS  D WGTWTALFATGAFGIWSEK T++G+ALSGA
Sbjct: 65   LSPPAPPNLGDRSVAVRFQLNAPLISSHDQWGTWTALFATGAFGIWSEKNTKVGAALSGA 124

Query: 1151 LVSTLVGLAASNLGIIASEAPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLL 972
            LVSTL+GLAASNLGII+S APA+++V+ F         LYRAD+RRVI+STG LLLAFLL
Sbjct: 125  LVSTLIGLAASNLGIISSNAPAFSIVLEFLLPLAVPLLLYRADLRRVIKSTGALLLAFLL 184

Query: 971  GSVATTIGTVVAFLMVPMQSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAG 792
            GSVATT+GTVVA+L+VPM+SLGQD WKIAAALMGRHIGGAVNYVAI++AL VSPS+LAAG
Sbjct: 185  GSVATTVGTVVAYLLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAIADALGVSPSILAAG 244

Query: 791  LAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNEESGSGNKLPVLQTATALAASFAI 612
            LAADNVICAVYF+TLFALASK+P EPSTS   +  +  S  GNKLP++QTA AL+ S AI
Sbjct: 245  LAADNVICAVYFSTLFALASKVPPEPSTSDDGIRKDASSEPGNKLPLIQTAAALSVSLAI 304

Query: 611  CKTASFLTRYFGIQGGSLPAVTAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGA 432
            CK+  +LT+YFGIQGG LPAVTAIVV LAT FPKQFAYLAP+GEAMA+ILMQVFF VVGA
Sbjct: 305  CKSGHYLTKYFGIQGGILPAVTAIVVTLATVFPKQFAYLAPTGEAMAVILMQVFFAVVGA 364

Query: 431  SGSIRNVITTAPSIFLFCLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXX 252
            SG+I +VI TAPSIF F L+QIAVHL +ILG GKL  FDLKLLLIASNANV         
Sbjct: 365  SGNIWSVINTAPSIFFFALIQIAVHLVVILGLGKLLGFDLKLLLIASNANVGGPTTACGM 424

Query: 251  XXXXGWSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 135
                 W+S++VP                   G AVLK+M
Sbjct: 425  ATAKEWNSMIVPGILAGIFGIAIATFIGIAFGLAVLKYM 463


Top