BLASTX nr result

ID: Forsythia22_contig00030702 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00030702
         (2069 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011076116.1| PREDICTED: uncharacterized protein LOC105160...   582   e-163
ref|XP_012852159.1| PREDICTED: uncharacterized protein LOC105971...   563   e-157
ref|XP_002273113.2| PREDICTED: uncharacterized protein LOC100264...   545   e-152
ref|XP_010662689.1| PREDICTED: uncharacterized protein LOC100264...   538   e-150
ref|XP_006422011.1| hypothetical protein CICLE_v10004922mg [Citr...   532   e-148
ref|XP_004242762.1| PREDICTED: uncharacterized protein LOC101267...   523   e-145
emb|CDP05152.1| unnamed protein product [Coffea canephora]            521   e-144
ref|XP_011043859.1| PREDICTED: uncharacterized protein LOC105139...   520   e-144
ref|XP_009628873.1| PREDICTED: uncharacterized protein LOC104119...   520   e-144
ref|XP_007038793.1| Keratin-associated protein 5-4 [Theobroma ca...   520   e-144
ref|XP_006358300.1| PREDICTED: uncharacterized protein LOC102584...   519   e-144
ref|XP_002305244.2| hypothetical protein POPTR_0004s07750g [Popu...   516   e-143
ref|XP_010255250.1| PREDICTED: uncharacterized protein LOC104595...   514   e-143
ref|XP_010255249.1| PREDICTED: uncharacterized protein LOC104595...   514   e-142
ref|XP_008234847.1| PREDICTED: uncharacterized protein LOC103333...   513   e-142
ref|XP_012090197.1| PREDICTED: uncharacterized protein LOC105648...   511   e-142
ref|XP_002513660.1| conserved hypothetical protein [Ricinus comm...   511   e-141
gb|KHG26673.1| putative membrane yjcL [Gossypium arboreum]            509   e-141
ref|XP_012484138.1| PREDICTED: uncharacterized protein LOC105798...   509   e-141
ref|XP_007217974.1| hypothetical protein PRUPE_ppa005389mg [Prun...   506   e-140

>ref|XP_011076116.1| PREDICTED: uncharacterized protein LOC105160444 [Sesamum indicum]
          Length = 455

 Score =  582 bits (1500), Expect = e-163
 Identities = 328/455 (72%), Positives = 350/455 (76%), Gaps = 9/455 (1%)
 Frame = -3

Query: 1971 MASKLLSVLPTHYPLPPEYRPFIHSGRN--------IPLFQDSSRLTSYKYKGKVL-ISP 1819
            MA KLL   P +   PP  R    S +          PL QD S LTS   K + L +SP
Sbjct: 1    MALKLLFSQPINCHPPPLQRSRFASHQKPSQIPTARSPLIQDFSLLTSSSNKDRSLNLSP 60

Query: 1818 FKMPKKSSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTKIGSALSGALVST 1639
               PK  +RSV+A SQLNFPIISPQD WGTWTALFATGAFGIWSEKTKIGSALSGALVS 
Sbjct: 61   NTNPKNVARSVVAKSQLNFPIISPQDQWGTWTALFATGAFGIWSEKTKIGSALSGALVSI 120

Query: 1638 LVGLAASNLGIIASETPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVA 1459
            LVGLAASNLGIIASE PAY VV+ F         LYRADMRR+IRSTGTLLLAFLLGSVA
Sbjct: 121  LVGLAASNLGIIASEAPAYKVVLEFLLPLAVPLLLYRADMRRIIRSTGTLLLAFLLGSVA 180

Query: 1458 TTIGTAVAFLMVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAAD 1279
            TT GTAVAFL+VPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEAL+V+PSVLAAGLAAD
Sbjct: 181  TTAGTAVAFLLVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALEVTPSVLAAGLAAD 240

Query: 1278 NVICAVYFTTLFALASKIPAEPSTSTSDVELNKESGSGNKLPVLQTATALAASFAICKTA 1099
            NVICA+YFTTLFALASKIPAE +TST+D  LN+ES S NKLPVLQTATALA SF ICK+A
Sbjct: 241  NVICAIYFTTLFALASKIPAESATSTTDGGLNEESESSNKLPVLQTATALAVSFIICKSA 300

Query: 1098 SFLTRYFGIQGGSLPAITAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSI 919
            SFLT Y GIQG +LP ITAIVVILAT  P QFAYLAPSGEAMA+ILMQVFF V+GASGSI
Sbjct: 301  SFLTNYLGIQGATLPTITAIVVILATMLPNQFAYLAPSGEAMALILMQVFFAVIGASGSI 360

Query: 918  RNVISTAPSIFLFSLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXX 739
            R+VISTAPSIFLF+LVQI VHLAIILG GKL RFDLKLLL+ASNANV             
Sbjct: 361  RSVISTAPSIFLFALVQIGVHLAIILGLGKLLRFDLKLLLLASNANVGGPTTACGMATAK 420

Query: 738  GWSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 634
            GWSSLVVP                   GQAVLKFM
Sbjct: 421  GWSSLVVPGILAGIFGIAIATFLGIAFGQAVLKFM 455


>ref|XP_012852159.1| PREDICTED: uncharacterized protein LOC105971807 [Erythranthe
            guttatus] gi|604306080|gb|EYU25137.1| hypothetical
            protein MIMGU_mgv1a006291mg [Erythranthe guttata]
          Length = 449

 Score =  563 bits (1450), Expect = e-157
 Identities = 312/456 (68%), Positives = 351/456 (76%), Gaps = 10/456 (2%)
 Frame = -3

Query: 1971 MASKLLSVLPTHYPLPPEYRPFIHSGR--------NIPLFQDS--SRLTSYKYKGKVLIS 1822
            MA K+L   PT+ P PP  R  + S          + P FQ+S  S  +S K++    IS
Sbjct: 1    MAGKILLFHPTYIPPPPARRSIVASRNAASQIPDTHTPSFQNSPLSTFSSDKFRTLKTIS 60

Query: 1821 PFKMPKKSSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTKIGSALSGALVS 1642
                 +  +RSV+A SQLNFPIISP D WGTWTALFA GAFGIWSEKTKIGSALSGALVS
Sbjct: 61   -----RNPARSVVARSQLNFPIISPHDQWGTWTALFAAGAFGIWSEKTKIGSALSGALVS 115

Query: 1641 TLVGLAASNLGIIASETPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSV 1462
            TLVGLAASNLGIIASET AYNVV+ F         LYRADMRRVI+STGTLLLAFLLGSV
Sbjct: 116  TLVGLAASNLGIIASETAAYNVVLEFLLPLAVPLLLYRADMRRVIKSTGTLLLAFLLGSV 175

Query: 1461 ATTIGTAVAFLMVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAA 1282
            ATT+GT VA+ +VPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEAL VSPSVLAAGLAA
Sbjct: 176  ATTVGTLVAYFLVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALGVSPSVLAAGLAA 235

Query: 1281 DNVICAVYFTTLFALASKIPAEPSTSTSDVELNKESGSGNKLPVLQTATALAASFAICKT 1102
            DNVICA+YFTTLFALASKIP+E S+ T  +  N+ES S NKLPVLQTATA+A SF ICK 
Sbjct: 236  DNVICAIYFTTLFALASKIPSESSSPTPGI--NEESESDNKLPVLQTATAVAVSFIICKI 293

Query: 1101 ASFLTRYFGIQGGSLPAITAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGS 922
            A+ LT++FGIQGG+LPAITAIVV+LAT+FP QFAYLAPSGEAMA+ILMQVFF V+GASGS
Sbjct: 294  ATVLTKHFGIQGGTLPAITAIVVVLATSFPNQFAYLAPSGEAMALILMQVFFAVIGASGS 353

Query: 921  IRNVISTAPSIFLFSLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXX 742
            IRNVI+TAPSIFLF+L+QI VHLA+ILG GKLFRFDL+LLL+ASNANV            
Sbjct: 354  IRNVITTAPSIFLFALIQIGVHLAVILGLGKLFRFDLRLLLLASNANVGGPTTACGMATA 413

Query: 741  XGWSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 634
             GW+SL+VP                   GQAVL+FM
Sbjct: 414  KGWTSLIVPGILAGIFGIAIATFLGIAFGQAVLRFM 449


>ref|XP_002273113.2| PREDICTED: uncharacterized protein LOC100264478 isoform X1 [Vitis
            vinifera] gi|302143806|emb|CBI22667.3| unnamed protein
            product [Vitis vinifera]
          Length = 449

 Score =  545 bits (1405), Expect = e-152
 Identities = 310/454 (68%), Positives = 339/454 (74%), Gaps = 8/454 (1%)
 Frame = -3

Query: 1971 MASKLLSV-LPTHYPLPPEYRPFIHSGRNIPLFQDSSRLTS-----YKYKGKVLISPFKM 1810
            MASK L++  P   P     +P   S +N P    SS  T      ++ K +  +SP   
Sbjct: 1    MASKFLTLRAPLSIP-----QPVTCSRQNFPTVSTSSCSTVEDPSLWRSKKQTQLSPLIF 55

Query: 1809 PKKSS--RSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTKIGSALSGALVSTL 1636
            PK S   RSV   S L FPIISPQD WGTWTALFATGAFGIWSEKTKIGSALSGALVSTL
Sbjct: 56   PKSSDSIRSVTVRSSLTFPIISPQDQWGTWTALFATGAFGIWSEKTKIGSALSGALVSTL 115

Query: 1635 VGLAASNLGIIASETPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVAT 1456
            VGLAASNLGII+ E PAY+VV+ F         L+RAD+RRVI+STG LL+AFL+GSVAT
Sbjct: 116  VGLAASNLGIISCEAPAYSVVLNFLLPLAVPLLLFRADLRRVIQSTGALLMAFLIGSVAT 175

Query: 1455 TIGTAVAFLMVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADN 1276
            TIGT VAFLMVPMRSLGQD WKIAAALMGRHIGGAVNYVAISEAL VS SVLAAGLAADN
Sbjct: 176  TIGTVVAFLMVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISEALGVSRSVLAAGLAADN 235

Query: 1275 VICAVYFTTLFALASKIPAEPSTSTSDVELNKESGSGNKLPVLQTATALAASFAICKTAS 1096
            VICAVYFTTLFALASKIP E STS +D  +N++   GNK PVL TATALA SFAICK   
Sbjct: 236  VICAVYFTTLFALASKIPPEDSTSANDTGMNEQPEPGNKPPVLLTATALAVSFAICKAGI 295

Query: 1095 FLTRYFGIQGGSLPAITAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIR 916
            FLT+YFGIQGGSLPAITAIVVILAT FPKQF+ LAP+GE MAMILMQVFFTVVGASG+I 
Sbjct: 296  FLTKYFGIQGGSLPAITAIVVILATAFPKQFSLLAPAGETMAMILMQVFFTVVGASGNIG 355

Query: 915  NVISTAPSIFLFSLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXG 736
            NV++TAPSIF+F+LVQIAVHLA+ILG GKLFRFDLKLLLIASNANV             G
Sbjct: 356  NVMNTAPSIFMFALVQIAVHLAVILGLGKLFRFDLKLLLIASNANVGGPTTACGMATAKG 415

Query: 735  WSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 634
            WSSLVVP                   G  VLKFM
Sbjct: 416  WSSLVVPGILAGIFGIAIATFLGIVFGVTVLKFM 449


>ref|XP_010662689.1| PREDICTED: uncharacterized protein LOC100264478 isoform X2 [Vitis
            vinifera]
          Length = 447

 Score =  538 bits (1387), Expect = e-150
 Identities = 309/454 (68%), Positives = 338/454 (74%), Gaps = 8/454 (1%)
 Frame = -3

Query: 1971 MASKLLSV-LPTHYPLPPEYRPFIHSGRNIPLFQDSSRLTS-----YKYKGKVLISPFKM 1810
            MASK L++  P   P     +P   S +N P    SS  T      ++ K +  +SP   
Sbjct: 1    MASKFLTLRAPLSIP-----QPVTCSRQNFPTVSTSSCSTVEDPSLWRSKKQTQLSPLIF 55

Query: 1809 PKKSS--RSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTKIGSALSGALVSTL 1636
            PK S   RSV   S L FPIISPQD WGTWTALFATGAFGIWSEKTKIGSALSGALVSTL
Sbjct: 56   PKSSDSIRSVTVRSSLTFPIISPQDQWGTWTALFATGAFGIWSEKTKIGSALSGALVSTL 115

Query: 1635 VGLAASNLGIIASETPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVAT 1456
            VGLAASNLGII+ E PAY+VV+ F         L+RAD+RRVI+STG LL+AFL+GSVAT
Sbjct: 116  VGLAASNLGIISCEAPAYSVVLNFLLPLAVPLLLFRADLRRVIQSTGALLMAFLIGSVAT 175

Query: 1455 TIGTAVAFLMVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADN 1276
            TIGT VAFLMVPMRSLGQD WKIAAALMGRHIGGAVNYVAISEAL VS SVLAAGLAADN
Sbjct: 176  TIGTVVAFLMVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISEALGVSRSVLAAGLAADN 235

Query: 1275 VICAVYFTTLFALASKIPAEPSTSTSDVELNKESGSGNKLPVLQTATALAASFAICKTAS 1096
            VICAVYFTTLFALASKIP E STS +   +N++   GNK PVL TATALA SFAICK   
Sbjct: 236  VICAVYFTTLFALASKIPPEDSTSANG--MNEQPEPGNKPPVLLTATALAVSFAICKAGI 293

Query: 1095 FLTRYFGIQGGSLPAITAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIR 916
            FLT+YFGIQGGSLPAITAIVVILAT FPKQF+ LAP+GE MAMILMQVFFTVVGASG+I 
Sbjct: 294  FLTKYFGIQGGSLPAITAIVVILATAFPKQFSLLAPAGETMAMILMQVFFTVVGASGNIG 353

Query: 915  NVISTAPSIFLFSLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXG 736
            NV++TAPSIF+F+LVQIAVHLA+ILG GKLFRFDLKLLLIASNANV             G
Sbjct: 354  NVMNTAPSIFMFALVQIAVHLAVILGLGKLFRFDLKLLLIASNANVGGPTTACGMATAKG 413

Query: 735  WSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 634
            WSSLVVP                   G  VLKFM
Sbjct: 414  WSSLVVPGILAGIFGIAIATFLGIVFGVTVLKFM 447


>ref|XP_006422011.1| hypothetical protein CICLE_v10004922mg [Citrus clementina]
            gi|568875109|ref|XP_006490652.1| PREDICTED:
            uncharacterized protein LOC102608862 [Citrus sinensis]
            gi|557523884|gb|ESR35251.1| hypothetical protein
            CICLE_v10004922mg [Citrus clementina]
          Length = 466

 Score =  532 bits (1371), Expect = e-148
 Identities = 282/393 (71%), Positives = 321/393 (81%), Gaps = 1/393 (0%)
 Frame = -3

Query: 1890 NIPLFQDSSRLTSYKYKGKVLISPFKMPKKSSRSVIASSQL-NFPIISPQDHWGTWTALF 1714
            +IP  Q S+   S+      L   F  P   +RSV A SQL NFP+ISP D WGTWTALF
Sbjct: 47   SIPQHQSSASYLSHSRTNTFLSPQFPHPSNRTRSVTARSQLPNFPLISPHDKWGTWTALF 106

Query: 1713 ATGAFGIWSEKTKIGSALSGALVSTLVGLAASNLGIIASETPAYNVVMAFXXXXXXXXXL 1534
            ATGAFGIWSE+TKIGSALSGALVSTL+GLAASNLG+++ E+PAY++V+ F         L
Sbjct: 107  ATGAFGIWSERTKIGSALSGALVSTLIGLAASNLGVVSCESPAYSIVLEFLLPLAVPLLL 166

Query: 1533 YRADMRRVIRSTGTLLLAFLLGSVATTIGTAVAFLMVPMRSLGQDGWKIAAALMGRHIGG 1354
            +RAD+RRVI+STGTLLLAFL+GSVATT+GTA+A+L+VPMRSLGQD WKIAAALMGRHIGG
Sbjct: 167  FRADLRRVIKSTGTLLLAFLIGSVATTVGTALAYLLVPMRSLGQDSWKIAAALMGRHIGG 226

Query: 1353 AVNYVAISEALQVSPSVLAAGLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNKES 1174
            AVNYVAIS+AL VS SVLAAGLAADNVICAVYFTTLFALAS IPAE STS  DV +N+ S
Sbjct: 227  AVNYVAISDALGVSSSVLAAGLAADNVICAVYFTTLFALASNIPAESSTSVDDVSMNEGS 286

Query: 1173 GSGNKLPVLQTATALAASFAICKTASFLTRYFGIQGGSLPAITAIVVILATTFPKQFAYL 994
              G+K PVLQ ATALA +FAICK  +FLT+YFGIQGGSLPAITAIVV LATTFP QF  L
Sbjct: 287  VRGDKPPVLQFATALAVAFAICKAGTFLTKYFGIQGGSLPAITAIVVTLATTFPTQFNKL 346

Query: 993  APSGEAMAMILMQVFFTVVGASGSIRNVISTAPSIFLFSLVQIAVHLAIILGFGKLFRFD 814
            AP+GEAMA+ILMQVFFTVVGASG+I +VI+TAPSIF+F+LVQIA+HLA+ILG GKLFRFD
Sbjct: 347  APAGEAMALILMQVFFTVVGASGNIWSVINTAPSIFMFALVQIAIHLAVILGLGKLFRFD 406

Query: 813  LKLLLIASNANVXXXXXXXXXXXXXGWSSLVVP 715
             KLLLIASNANV             GWSSL+VP
Sbjct: 407  QKLLLIASNANVGGPTTACGMATAKGWSSLIVP 439


>ref|XP_004242762.1| PREDICTED: uncharacterized protein LOC101267717 [Solanum
            lycopersicum]
          Length = 462

 Score =  523 bits (1348), Expect = e-145
 Identities = 301/452 (66%), Positives = 333/452 (73%), Gaps = 6/452 (1%)
 Frame = -3

Query: 1971 MASKLLSVLPTHY-PLPPEY---RPFIHSGRNIPLFQDSSRLTSYKYKGKVLISPFKMPK 1804
            MA K L  L   Y P P  Y   R    +  +  + Q    L+    K K L  P    +
Sbjct: 13   MALKQLLFLHNPYIPSPASYSCRRKNASAATSSTVLQHPMLLSMNIDKFKPLDFPKNSTR 72

Query: 1803 KSSRSVIA-SSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTKIGSALSGALVSTLVGL 1627
            K +RSV    SQLNFPIISPQD WGTWT LFATGAFGIWSEKTKIG+ALSG+LVS LVGL
Sbjct: 73   KLNRSVTTIRSQLNFPIISPQDQWGTWTVLFATGAFGIWSEKTKIGAALSGSLVSVLVGL 132

Query: 1626 AASNLGIIASETPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIG 1447
            AASNLGIIASE PAY +V  F         L+RADMRRV++STGTLL+AFLLGSVATTIG
Sbjct: 133  AASNLGIIASEAPAYKIVTGFLLPLAVPLLLFRADMRRVLKSTGTLLMAFLLGSVATTIG 192

Query: 1446 TAVAFLMVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVIC 1267
            T VAF +VPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALQ SPSV+A+GLAADN+IC
Sbjct: 193  TVVAFFIVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALQTSPSVVASGLAADNLIC 252

Query: 1266 AVYFTTLFALASKIPAEPSTSTSDVELNKESGSGNKLPVLQTATALAASFAICKTASFLT 1087
            AVYFTTLFALASKIPAE + S SD ++  ES SGNKLPVLQTATALA SFAICK    LT
Sbjct: 253  AVYFTTLFALASKIPAEAAQSVSDDKV--ESESGNKLPVLQTATALAVSFAICKAGELLT 310

Query: 1086 RYFGIQGGSLPAITAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGAS-GSIRNV 910
            ++FGIQGG LP ITAIVVILAT+FP QFAYLAPSGEAMA+ILMQVFFT +GAS GSI NV
Sbjct: 311  KHFGIQGGLLPIITAIVVILATSFPSQFAYLAPSGEAMALILMQVFFTFIGASGGSISNV 370

Query: 909  ISTAPSIFLFSLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWS 730
            ++TAPSIFLF+L+QIAVHLA+ILG GKL R +LK LLIASNANV             GW 
Sbjct: 371  LNTAPSIFLFALIQIAVHLAVILGVGKLLRLELKELLIASNANVGGPTTACGMATAKGWI 430

Query: 729  SLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 634
            SLVVP                   GQ VLKF+
Sbjct: 431  SLVVPGILAGIFGIAIATFLGIAFGQTVLKFI 462


>emb|CDP05152.1| unnamed protein product [Coffea canephora]
          Length = 459

 Score =  521 bits (1341), Expect = e-144
 Identities = 276/355 (77%), Positives = 302/355 (85%), Gaps = 1/355 (0%)
 Frame = -3

Query: 1776 SQLNFPIISPQDHWGTWTALFATGAFGIWSEKTKIGSALSGALVSTLVGLAASNLGIIAS 1597
            SQL++PIISPQDHWGTWTALFATGAFGIWSE+TKIGS LSGALVS LVGLAASNLGII  
Sbjct: 78   SQLSYPIISPQDHWGTWTALFATGAFGIWSERTKIGSTLSGALVSILVGLAASNLGIIPC 137

Query: 1596 ETPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIGTAVAFLMVPM 1417
            + PAY +V+           L+RAD+RRVI+STGTLLLAFLLGSVATT+GTAVAFL+VPM
Sbjct: 138  DAPAYKIVLQILLPMAVPLLLFRADLRRVIQSTGTLLLAFLLGSVATTLGTAVAFLLVPM 197

Query: 1416 RSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVICAVYFTTLFAL 1237
            RSLGQDGWKIAAALMGRHIGGAVNYVAISEAL V+PSVLAAGLAADNVICA+YFTTLFAL
Sbjct: 198  RSLGQDGWKIAAALMGRHIGGAVNYVAISEALGVTPSVLAAGLAADNVICAIYFTTLFAL 257

Query: 1236 ASKIPAEPSTSTSDVELNKE-SGSGNKLPVLQTATALAASFAICKTASFLTRYFGIQGGS 1060
            AS IP E ST+T+D +   + S SGNKLPVL TATALA SFAICK  S   +YFGI GGS
Sbjct: 258  ASGIPPEASTATTDADAGYDISESGNKLPVLPTATALAVSFAICKAGSSFAKYFGISGGS 317

Query: 1059 LPAITAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRNVISTAPSIFLF 880
            LPAITAIVVILAT FP+ FA+LAPSGEAMA+ILMQVFFTVVGASGS+ NVI+TAPSI LF
Sbjct: 318  LPAITAIVVILATVFPRLFAHLAPSGEAMALILMQVFFTVVGASGSMWNVINTAPSILLF 377

Query: 879  SLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWSSLVVP 715
            +LVQIAVHLA+ILG GKLFRFDLKLLL+ASNANV             GWSSLVVP
Sbjct: 378  ALVQIAVHLAVILGLGKLFRFDLKLLLLASNANVGGPTTACGMATAKGWSSLVVP 432


>ref|XP_011043859.1| PREDICTED: uncharacterized protein LOC105139195 isoform X1 [Populus
            euphratica] gi|743901093|ref|XP_011043860.1| PREDICTED:
            uncharacterized protein LOC105139195 isoform X1 [Populus
            euphratica]
          Length = 452

 Score =  520 bits (1340), Expect = e-144
 Identities = 287/425 (67%), Positives = 331/425 (77%), Gaps = 6/425 (1%)
 Frame = -3

Query: 1971 MASKLLSVLPTHYPLPPEYRPFIHSGRNIPLFQDSSR---LTSYKYKGKV-LISPFKMPK 1804
            MAS+L  +   H P+ P  RP   S +N P    +     L S  Y  +   +SP K P 
Sbjct: 1    MASRLPLL---HSPVVPFRRPCFVSRQNSPTTTANPTRRTLLSANYGNQTSFLSPQKNPN 57

Query: 1803 --KSSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTKIGSALSGALVSTLVG 1630
              +SS +V ++  LNFP+ISP D WG WTALFATGAFGIWSE+TKIGSALSGALVSTLVG
Sbjct: 58   LIRSSVTVRSNMILNFPLISPTDPWGMWTALFATGAFGIWSERTKIGSALSGALVSTLVG 117

Query: 1629 LAASNLGIIASETPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTI 1450
            LAASNLGII+ E+PAY+ V+ F         L+RAD+RRVI+STGTLLLAFLLGSVATT+
Sbjct: 118  LAASNLGIISCESPAYSTVLKFLLPLAVPLLLFRADLRRVIQSTGTLLLAFLLGSVATTV 177

Query: 1449 GTAVAFLMVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVI 1270
            GT +A++MVPMR+LGQD WKIAAALMGRHIGGAVNYVAIS+AL VSPSVLAAGLAADNVI
Sbjct: 178  GTVLAYMMVPMRALGQDSWKIAAALMGRHIGGAVNYVAISDALGVSPSVLAAGLAADNVI 237

Query: 1269 CAVYFTTLFALASKIPAEPSTSTSDVELNKESGSGNKLPVLQTATALAASFAICKTASFL 1090
            CAVYFT+LFALASKIPAE S S     ++  S SGNKLPVLQTATALA SFAICK   ++
Sbjct: 238  CAVYFTSLFALASKIPAESSASIDGSGMDSGSESGNKLPVLQTATALAVSFAICKAGEYI 297

Query: 1089 TRYFGIQGGSLPAITAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRNV 910
            T++F I GG LPA+TAIVVILAT FP QF +LAPSGEA+A+ILMQVFF VVGASG++RNV
Sbjct: 298  TKFFAIPGGILPAVTAIVVILATAFPTQFNHLAPSGEALALILMQVFFAVVGASGNVRNV 357

Query: 909  ISTAPSIFLFSLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWS 730
            I+TAPSIF+F+LVQIA+HLA+ILG GKLFRFD KLLLIASNANV             GWS
Sbjct: 358  INTAPSIFMFALVQIAIHLAVILGLGKLFRFDQKLLLIASNANVGGPTTACGMATAKGWS 417

Query: 729  SLVVP 715
            SLVVP
Sbjct: 418  SLVVP 422


>ref|XP_009628873.1| PREDICTED: uncharacterized protein LOC104119150 [Nicotiana
            tomentosiformis]
          Length = 452

 Score =  520 bits (1338), Expect = e-144
 Identities = 297/458 (64%), Positives = 337/458 (73%), Gaps = 12/458 (2%)
 Frame = -3

Query: 1971 MASKLLSVLPTHYPLPPEYRPFIHSGRNIPLFQDSSRLTSYK--YKGKVLIS------PF 1816
            MASKL  +   + P P  Y P     +N+P    +S +TS     +  +L+S      P 
Sbjct: 1    MASKLWFLHNLYIPPPASYSP---RRQNVPA---ASAITSANTILQHPMLLSNIDKYTPL 54

Query: 1815 KMPKKS---SRSVIA-SSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTKIGSALSGAL 1648
              PK S   +RSV    SQLNFPIISPQD WGTWTALFATGAFGIWSEKTK+G ALSGAL
Sbjct: 55   DFPKSSKKLNRSVTTIRSQLNFPIISPQDQWGTWTALFATGAFGIWSEKTKVGGALSGAL 114

Query: 1647 VSTLVGLAASNLGIIASETPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLG 1468
            VSTLVGLAASNLGIIA E PAY +V  F         L+RADMRRV++STGTLLLAFLLG
Sbjct: 115  VSTLVGLAASNLGIIACEAPAYKIVTGFLLPLAVPLLLFRADMRRVLQSTGTLLLAFLLG 174

Query: 1467 SVATTIGTAVAFLMVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGL 1288
            SVATTIGT VAF +VPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEAL+ SPSV+ AGL
Sbjct: 175  SVATTIGTVVAFWIVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALETSPSVVTAGL 234

Query: 1287 AADNVICAVYFTTLFALASKIPAEPSTSTSDVELNKESGSGNKLPVLQTATALAASFAIC 1108
            AADN+ICAVYFTTLFALASKIPAE + S ++ +++ ES SGN LPVLQ+ATALA SFAIC
Sbjct: 235  AADNLICAVYFTTLFALASKIPAEATPSAAEDKIDGESESGNTLPVLQSATALAVSFAIC 294

Query: 1107 KTASFLTRYFGIQGGSLPAITAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGAS 928
            K   FLT++F IQGG+LP ITAIVVILAT+FP QFA LAPSGEAMA+ILMQVFF  +GA+
Sbjct: 295  KAGDFLTKHFVIQGGTLPIITAIVVILATSFPTQFADLAPSGEAMALILMQVFFAFIGAN 354

Query: 927  GSIRNVISTAPSIFLFSLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXX 748
            GSI NV++TAPSIF+F LVQI VHLA+ILG GKL RF+L+ LLIASNANV          
Sbjct: 355  GSILNVMNTAPSIFVFVLVQIGVHLAVILGVGKLLRFELEQLLIASNANVGGPTTACGMA 414

Query: 747  XXXGWSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 634
               GW SLVVP                   GQ +LKFM
Sbjct: 415  TAKGWISLVVPGILAGIFGITIATFLGIAFGQVILKFM 452


>ref|XP_007038793.1| Keratin-associated protein 5-4 [Theobroma cacao]
            gi|508776038|gb|EOY23294.1| Keratin-associated protein
            5-4 [Theobroma cacao]
          Length = 466

 Score =  520 bits (1338), Expect = e-144
 Identities = 272/362 (75%), Positives = 305/362 (84%)
 Frame = -3

Query: 1800 SSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTKIGSALSGALVSTLVGLAA 1621
            ++R V   SQLNFP+ISP D WGTWTALFA GAFGIWSEKTKIGSALSGALVSTL+GLAA
Sbjct: 78   ANRPVTVKSQLNFPLISPNDQWGTWTALFAIGAFGIWSEKTKIGSALSGALVSTLIGLAA 137

Query: 1620 SNLGIIASETPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIGTA 1441
            SNLGII+ E  AY+ V+ F         L+RAD+RRVI+STG LLLAFLLGSVATT+GTA
Sbjct: 138  SNLGIISCEAKAYSTVLEFLLPLAVPLLLFRADLRRVIKSTGKLLLAFLLGSVATTVGTA 197

Query: 1440 VAFLMVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVICAV 1261
            +A+L+VPMR+LGQD WKIAAALMGRHIGGAVNYVAIS AL VSPSVLAAGLAADNVICAV
Sbjct: 198  LAYLIVPMRALGQDSWKIAAALMGRHIGGAVNYVAISNALGVSPSVLAAGLAADNVICAV 257

Query: 1260 YFTTLFALASKIPAEPSTSTSDVELNKESGSGNKLPVLQTATALAASFAICKTASFLTRY 1081
            YFTTLFALASK+P E STS  DV + + S SG+KLPVLQ ATALA SF+ICK  ++LT+Y
Sbjct: 258  YFTTLFALASKVPPETSTSPEDVAMVEGSESGSKLPVLQIATALAVSFSICKLGAYLTKY 317

Query: 1080 FGIQGGSLPAITAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRNVIST 901
            FGI GGSLPA+TAIVVILAT FP QF  LAP+GEAMA+ILMQVFFTVVGASG+I NVI+T
Sbjct: 318  FGIPGGSLPAVTAIVVILATVFPTQFGRLAPAGEAMALILMQVFFTVVGASGNIWNVINT 377

Query: 900  APSIFLFSLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWSSLV 721
            APSIF+F+LVQIA+HLA+ILG GKLFRFDLKLLLIASNANV             GWSS+V
Sbjct: 378  APSIFMFALVQIAIHLALILGLGKLFRFDLKLLLIASNANVGGPTTACGMATAKGWSSMV 437

Query: 720  VP 715
            VP
Sbjct: 438  VP 439


>ref|XP_006358300.1| PREDICTED: uncharacterized protein LOC102584987 [Solanum tuberosum]
          Length = 453

 Score =  519 bits (1337), Expect = e-144
 Identities = 283/402 (70%), Positives = 314/402 (78%), Gaps = 6/402 (1%)
 Frame = -3

Query: 1821 PFKMPKKSSRSVIAS-----SQLNFPIISPQDHWGTWTALFATGAFGIWSEKTKIGSALS 1657
            P   PK S+R +  S     SQLNFPIISPQD WGTWT LFATGAFGIWSEKTK+G+ALS
Sbjct: 54   PLDFPKNSTRKLNRSVTTIRSQLNFPIISPQDQWGTWTVLFATGAFGIWSEKTKVGAALS 113

Query: 1656 GALVSTLVGLAASNLGIIASETPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAF 1477
            G+LVS LVGLAASNLGIIASE PAY +V  F         L+RADMRRV++STGTLLLAF
Sbjct: 114  GSLVSVLVGLAASNLGIIASEAPAYKIVTGFLLPLAVPLLLFRADMRRVLKSTGTLLLAF 173

Query: 1476 LLGSVATTIGTAVAFLMVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLA 1297
            LLGSVATTIGT VAF +VPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALQ SPSV+A
Sbjct: 174  LLGSVATTIGTVVAFCIVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALQTSPSVVA 233

Query: 1296 AGLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNKESGSGNKLPVLQTATALAASF 1117
            +GLAADN+ICAVYFTTLFAL SKIPAE + S +D +++ E  SGNKLPVLQTATALA SF
Sbjct: 234  SGLAADNLICAVYFTTLFALTSKIPAEATQSATDDKVDSE--SGNKLPVLQTATALAVSF 291

Query: 1116 AICKTASFLTRYFGIQGGSLPAITAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVV 937
            AICK    LT++FGIQGG LP ITAIVVILAT+FP QFAYLAPSGEAMA+ILMQVFFT +
Sbjct: 292  AICKAGELLTKHFGIQGGLLPTITAIVVILATSFPSQFAYLAPSGEAMALILMQVFFTFI 351

Query: 936  GAS-GSIRNVISTAPSIFLFSLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXX 760
            GAS GSI NV++TAPSIFLF+ +QIAVHLA+ILG GKL + +LK LLIASNANV      
Sbjct: 352  GASGGSISNVLNTAPSIFLFAFIQIAVHLAVILGVGKLLQLELKELLIASNANVGGPTTA 411

Query: 759  XXXXXXXGWSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 634
                   GW S+VVP                   GQAVLKFM
Sbjct: 412  CGMATAKGWISMVVPGILAGIFGIAIATFLGIAFGQAVLKFM 453


>ref|XP_002305244.2| hypothetical protein POPTR_0004s07750g [Populus trichocarpa]
            gi|550340557|gb|EEE85755.2| hypothetical protein
            POPTR_0004s07750g [Populus trichocarpa]
          Length = 452

 Score =  516 bits (1328), Expect = e-143
 Identities = 282/422 (66%), Positives = 329/422 (77%), Gaps = 7/422 (1%)
 Frame = -3

Query: 1959 LLSVLP-THYPLPPEYRPFIHSGRNIPLFQDS----SRLTSYKYKGKVLISPFKMPK--K 1801
            + S+LP  H P+ P  R    S +N  +   +    + L +        +SP K P   +
Sbjct: 1    MASILPFLHSPVVPSRRSCFISRQNTLITTANPTRRTLLPANNGNQTSFLSPQKNPNLIR 60

Query: 1800 SSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTKIGSALSGALVSTLVGLAA 1621
            SS +V ++  LNFP+ISP D WG WTALFATGAFGIWSE+TKIGSALSGALVSTLVGLAA
Sbjct: 61   SSVTVRSNLILNFPLISPTDPWGMWTALFATGAFGIWSERTKIGSALSGALVSTLVGLAA 120

Query: 1620 SNLGIIASETPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIGTA 1441
            SNLGII+ E+PAY++V+ F         L+RAD+RRVI+STGTLLLAFLLGSVATT+GT 
Sbjct: 121  SNLGIISCESPAYSIVLKFLLPLAVPLLLFRADLRRVIQSTGTLLLAFLLGSVATTVGTV 180

Query: 1440 VAFLMVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVICAV 1261
            +A++MVPMR+LGQD WKIAAALMGRHIGGAVNYVAIS+AL+VSPSVLAAGLAADNVICAV
Sbjct: 181  LAYMMVPMRALGQDSWKIAAALMGRHIGGAVNYVAISDALRVSPSVLAAGLAADNVICAV 240

Query: 1260 YFTTLFALASKIPAEPSTSTSDVELNKESGSGNKLPVLQTATALAASFAICKTASFLTRY 1081
            YFT+LFALASKIPAE S S     ++  S SGNKLPVLQTATALA SFAICK   ++T++
Sbjct: 241  YFTSLFALASKIPAESSASIDGSGMDSGSESGNKLPVLQTATALAVSFAICKAGEYITKF 300

Query: 1080 FGIQGGSLPAITAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRNVIST 901
            F I GG LPA+TAIVVILAT FP QF +LAPSGEA+A+ILMQVFF VVGASG++ NVI+T
Sbjct: 301  FAIPGGILPAVTAIVVILATAFPTQFNHLAPSGEALALILMQVFFAVVGASGNVWNVINT 360

Query: 900  APSIFLFSLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWSSLV 721
            APSIFLF+LVQIA+HLA+ILG GKLFRFD KLLLIASNANV             GWSSLV
Sbjct: 361  APSIFLFALVQIAIHLAVILGLGKLFRFDQKLLLIASNANVGGPTTACGMATAKGWSSLV 420

Query: 720  VP 715
            VP
Sbjct: 421  VP 422


>ref|XP_010255250.1| PREDICTED: uncharacterized protein LOC104595990 [Nelumbo nucifera]
          Length = 457

 Score =  514 bits (1325), Expect = e-143
 Identities = 269/378 (71%), Positives = 309/378 (81%), Gaps = 2/378 (0%)
 Frame = -3

Query: 1842 KGKVLISPFKMPKKSSR--SVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTKIG 1669
            + K LISP  +PK      S+   +QLNFP+ISP+DHWGTWTALFAT AFGIWSEKTKIG
Sbjct: 53   RSKTLISPLTIPKNHGPVPSLKTRAQLNFPLISPKDHWGTWTALFATSAFGIWSEKTKIG 112

Query: 1668 SALSGALVSTLVGLAASNLGIIASETPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTL 1489
            SALSG+LVS LVGLAASN+GII+ E PAY+VVM +         L+RAD+RRVI STGTL
Sbjct: 113  SALSGSLVSILVGLAASNIGIISCEAPAYSVVMEYLLPMAVPLLLFRADLRRVIMSTGTL 172

Query: 1488 LLAFLLGSVATTIGTAVAFLMVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSP 1309
            LLAFLLGSVATTIGT VA+L+VPMRSLGQD WKIAAALMGRHIGGAVNYVAISEAL V+P
Sbjct: 173  LLAFLLGSVATTIGTLVAYLLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAISEALGVTP 232

Query: 1308 SVLAAGLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNKESGSGNKLPVLQTATAL 1129
            SVLAAGLAADNVICA+YFT+LFALAS IP E S ST D  ++ +S  GNKLPVLQTA A+
Sbjct: 233  SVLAAGLAADNVICAIYFTSLFALASNIPPEASKSTEDGVIDAKSEPGNKLPVLQTAIAI 292

Query: 1128 AASFAICKTASFLTRYFGIQGGSLPAITAIVVILATTFPKQFAYLAPSGEAMAMILMQVF 949
            A SF+ICKTA++LT+  GIQGGSLP ITA+VVILAT FP QF YLAP+GEA+A+ILMQVF
Sbjct: 293  AVSFSICKTATYLTKLLGIQGGSLPCITALVVILATIFPAQFGYLAPAGEAVALILMQVF 352

Query: 948  FTVVGASGSIRNVISTAPSIFLFSLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXX 769
            F VVGA+GSI NVI+TAPS+F+F+L+QI +HLA+ILG GKL RFD KLLL+ASNANV   
Sbjct: 353  FAVVGANGSIWNVINTAPSVFMFALLQITIHLAVILGVGKLLRFDQKLLLLASNANVGGP 412

Query: 768  XXXXXXXXXXGWSSLVVP 715
                      GW SLV+P
Sbjct: 413  TTACGMATAKGWGSLVIP 430


>ref|XP_010255249.1| PREDICTED: uncharacterized protein LOC104595989 [Nelumbo nucifera]
          Length = 458

 Score =  514 bits (1324), Expect = e-142
 Identities = 272/377 (72%), Positives = 309/377 (81%), Gaps = 3/377 (0%)
 Frame = -3

Query: 1836 KVLISPFKMPKKS---SRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEKTKIGS 1666
            K  +SP   PK +   +RSV   +QL+FP+ISP+DHWGTWTALF + AFGIWSEKTK+GS
Sbjct: 55   KTFLSPSTFPKGNPDLNRSVKTKAQLSFPLISPKDHWGTWTALFVSSAFGIWSEKTKVGS 114

Query: 1665 ALSGALVSTLVGLAASNLGIIASETPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLL 1486
            ALSGALVSTLVGL ASNLGII+ E PAY++VM +         L+RAD+RRVI STGTLL
Sbjct: 115  ALSGALVSTLVGLGASNLGIISCEAPAYSLVMEYLLPMAVPLLLFRADLRRVILSTGTLL 174

Query: 1485 LAFLLGSVATTIGTAVAFLMVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPS 1306
             AFLLGSVATTIGT VA+LMVPMRSLG D WKIAAALMGRHIGGAVNYVAISEAL VSPS
Sbjct: 175  SAFLLGSVATTIGTIVAYLMVPMRSLGHDNWKIAAALMGRHIGGAVNYVAISEALAVSPS 234

Query: 1305 VLAAGLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNKESGSGNKLPVLQTATALA 1126
            VLAAGLAADNVICA+YFT+LFALAS+IP E +T T+D  ++ ES  GNKLPVLQTATALA
Sbjct: 235  VLAAGLAADNVICAIYFTSLFALASQIPPESTTPTNDDVIDTESQIGNKLPVLQTATALA 294

Query: 1125 ASFAICKTASFLTRYFGIQGGSLPAITAIVVILATTFPKQFAYLAPSGEAMAMILMQVFF 946
             SFAICKT ++L++  GIQGG+LP ITAIVVILAT FP QF YLAP+GEA+A+ILMQVFF
Sbjct: 295  VSFAICKTGTYLSKLLGIQGGNLPCITAIVVILATIFPAQFGYLAPAGEAVALILMQVFF 354

Query: 945  TVVGASGSIRNVISTAPSIFLFSLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXX 766
             VVGA+GSI NVI+TAPSIF+FSLVQIAVHLA+ILG GKL +FD KLLL+ASNANV    
Sbjct: 355  AVVGANGSIWNVINTAPSIFMFSLVQIAVHLAVILGVGKLMQFDQKLLLLASNANVGGPA 414

Query: 765  XXXXXXXXXGWSSLVVP 715
                     GW SLVVP
Sbjct: 415  TACGMASTKGWGSLVVP 431


>ref|XP_008234847.1| PREDICTED: uncharacterized protein LOC103333733 isoform X1 [Prunus
            mume]
          Length = 463

 Score =  513 bits (1322), Expect = e-142
 Identities = 271/399 (67%), Positives = 312/399 (78%), Gaps = 1/399 (0%)
 Frame = -3

Query: 1827 ISPFKMPKKSSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEK-TKIGSALSGA 1651
            +SP   P    RSV    QLN P+IS  D WGTWTALFATGAFGIWSEK TK+G+ALSGA
Sbjct: 65   LSPPAPPDLGDRSVAVRFQLNAPLISSHDQWGTWTALFATGAFGIWSEKNTKVGAALSGA 124

Query: 1650 LVSTLVGLAASNLGIIASETPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLL 1471
            LVSTL+GLAASNLGII+S  PA+++V+ F         LYRAD+RRVI+STG LLLAFLL
Sbjct: 125  LVSTLIGLAASNLGIISSNAPAFSIVLEFLLPLAVPLLLYRADLRRVIKSTGALLLAFLL 184

Query: 1470 GSVATTIGTAVAFLMVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAG 1291
            GSVATT+GT VA+L+VPMRSLGQD WKIAAALMGRHIGGAVNYVAI++AL VSPS+LAAG
Sbjct: 185  GSVATTVGTVVAYLLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAIADALGVSPSILAAG 244

Query: 1290 LAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNKESGSGNKLPVLQTATALAASFAI 1111
            LAADNVICAVYF+TLFALASK+P EPSTS   +E +  S  GNKLP++QTATAL+ S AI
Sbjct: 245  LAADNVICAVYFSTLFALASKVPPEPSTSDDGIEKDASSEPGNKLPLIQTATALSVSLAI 304

Query: 1110 CKTASFLTRYFGIQGGSLPAITAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGA 931
            CK+  +LT+YFGIQGG LPA+TAIVV LAT FPKQFAYLAP+GEAMA+ILMQVFF+VVGA
Sbjct: 305  CKSGHYLTKYFGIQGGILPAVTAIVVTLATVFPKQFAYLAPTGEAMAVILMQVFFSVVGA 364

Query: 930  SGSIRNVISTAPSIFLFSLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXX 751
            SG+I NVI+TAPSIF F+L+QIAVHLA+ILG GKL  FDLKLLLIASNANV         
Sbjct: 365  SGNIWNVINTAPSIFFFALIQIAVHLAVILGLGKLMGFDLKLLLIASNANVGGPTTACGM 424

Query: 750  XXXXGWSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 634
                 W+S++VP                   G AVLK+M
Sbjct: 425  ATAKEWNSMIVPGILAGIFGIAIATFIGIAFGLAVLKYM 463


>ref|XP_012090197.1| PREDICTED: uncharacterized protein LOC105648428 [Jatropha curcas]
            gi|643706105|gb|KDP22237.1| hypothetical protein
            JCGZ_26068 [Jatropha curcas]
          Length = 459

 Score =  511 bits (1316), Expect = e-142
 Identities = 277/392 (70%), Positives = 313/392 (79%), Gaps = 2/392 (0%)
 Frame = -3

Query: 1884 PLFQDSSRLTSYKYKGKVLISPFKMPKKSS--RSVIASSQLNFPIISPQDHWGTWTALFA 1711
            P  Q SS   S   +    +SP    + SS  RSV   S LNFP+ISP D WGTWTALFA
Sbjct: 43   PALQSSS--ISLGNRSHTFLSPELYTEDSSSLRSVAVRSNLNFPLISPGDRWGTWTALFA 100

Query: 1710 TGAFGIWSEKTKIGSALSGALVSTLVGLAASNLGIIASETPAYNVVMAFXXXXXXXXXLY 1531
            TGAFGIWSEKTKIGSALSGALVSTLVGLAASNLGII+ E+PAY +V+ F         L+
Sbjct: 101  TGAFGIWSEKTKIGSALSGALVSTLVGLAASNLGIISCESPAYPIVLEFLLPLAVPLLLF 160

Query: 1530 RADMRRVIRSTGTLLLAFLLGSVATTIGTAVAFLMVPMRSLGQDGWKIAAALMGRHIGGA 1351
            RAD+RRVI+STGTLLLAFL+GSVATT+GT VA+ +VPMRSLGQD WKIAAALMGRHIGGA
Sbjct: 161  RADLRRVIQSTGTLLLAFLIGSVATTVGTLVAYWIVPMRSLGQDSWKIAAALMGRHIGGA 220

Query: 1350 VNYVAISEALQVSPSVLAAGLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNKESG 1171
            VNYVAIS+AL VS SVLA+GLAADNVICAVYFTTLFALASKIP E S ST+D  +  E+ 
Sbjct: 221  VNYVAISDALGVSSSVLASGLAADNVICAVYFTTLFALASKIPPESSVSTNDGAIESETE 280

Query: 1170 SGNKLPVLQTATALAASFAICKTASFLTRYFGIQGGSLPAITAIVVILATTFPKQFAYLA 991
              +KLPVL+ ATA+A SFAICK  SF+T+ FGIQGG LPA+TAIVVILAT FP QF  LA
Sbjct: 281  PSDKLPVLKIATAIAVSFAICKAGSFVTKLFGIQGGILPAVTAIVVILATAFPTQFNQLA 340

Query: 990  PSGEAMAMILMQVFFTVVGASGSIRNVISTAPSIFLFSLVQIAVHLAIILGFGKLFRFDL 811
            PSGEA+A+ILMQVFFTVVGASG+I +VI+TAPSIF+F+LVQI VHLA+ILG GKLFRFDL
Sbjct: 341  PSGEAIALILMQVFFTVVGASGNIWSVINTAPSIFMFALVQITVHLAVILGLGKLFRFDL 400

Query: 810  KLLLIASNANVXXXXXXXXXXXXXGWSSLVVP 715
            KLLL+ASNANV             GW+SLVVP
Sbjct: 401  KLLLLASNANVGGPTTACGMATAKGWNSLVVP 432


>ref|XP_002513660.1| conserved hypothetical protein [Ricinus communis]
            gi|223547568|gb|EEF49063.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 965

 Score =  511 bits (1315), Expect = e-141
 Identities = 281/407 (69%), Positives = 315/407 (77%), Gaps = 3/407 (0%)
 Frame = -3

Query: 1926 PPEYRPFIHSGRNIPL-FQDSSRLTSYKYKGKVLISPFKMP--KKSSRSVIASSQLNFPI 1756
            P  Y+ F    +  PL F  +    S   + +  +SP   P    S RS+   S LNFP+
Sbjct: 33   PQSYQSF----KIYPLHFHSNDNDNSNNNRNQTFLSPQLYPGDPSSRRSLAVRSNLNFPL 88

Query: 1755 ISPQDHWGTWTALFATGAFGIWSEKTKIGSALSGALVSTLVGLAASNLGIIASETPAYNV 1576
            IS  D WGTWTALFATGAFGIWSEKTKIGSALSGALVSTLVGLA SNLGII+ E+PAY V
Sbjct: 89   ISSNDRWGTWTALFATGAFGIWSEKTKIGSALSGALVSTLVGLAGSNLGIISCESPAYAV 148

Query: 1575 VMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIGTAVAFLMVPMRSLGQDG 1396
            V+ F         L+RAD+RRVIRSTGTLLLAFLLGSVATT+GT VA+ +VPMRSLGQD 
Sbjct: 149  VLEFLLPLAVPLLLFRADLRRVIRSTGTLLLAFLLGSVATTVGTVVAYWIVPMRSLGQDS 208

Query: 1395 WKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVICAVYFTTLFALASKIPAE 1216
            WKIAAALMGRHIGGAVNYVAI++AL VS SVLA+GLAADNVICAVYFTTLFALASKIPAE
Sbjct: 209  WKIAAALMGRHIGGAVNYVAIADALGVSSSVLASGLAADNVICAVYFTTLFALASKIPAE 268

Query: 1215 PSTSTSDVELNKESGSGNKLPVLQTATALAASFAICKTASFLTRYFGIQGGSLPAITAIV 1036
             STS+++  +   S SG KLPVLQ AT+LA S AICK  S++T+ FGIQGG LPA+TAIV
Sbjct: 269  TSTSSNEDGMESGSVSGEKLPVLQLATSLAVSLAICKAGSYVTKLFGIQGGILPAVTAIV 328

Query: 1035 VILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRNVISTAPSIFLFSLVQIAVH 856
            VILAT FP QF  LAPSGEAMA+ILMQVFFTVVGASG+I NV+ TAPSIF+F+LVQIAVH
Sbjct: 329  VILATAFPTQFNGLAPSGEAMALILMQVFFTVVGASGNIWNVVKTAPSIFMFALVQIAVH 388

Query: 855  LAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWSSLVVP 715
            L IILG GKLFRFD KLLL+ASNANV             GWSSLVVP
Sbjct: 389  LVIILGLGKLFRFDQKLLLLASNANVGGPTTACGMATAKGWSSLVVP 435



 Score =  476 bits (1224), Expect = e-131
 Identities = 258/410 (62%), Positives = 304/410 (74%), Gaps = 11/410 (2%)
 Frame = -3

Query: 1911 PFIHSGRNIPLFQDSSRLTSYKYKGKV--------LISPFKMPKKSS---RSVIASSQLN 1765
            P +HS  +  L   S  L+ +  + K+          SP  +   ++   R +   SQL 
Sbjct: 529  PLLHSSCSPSLRISSRHLSPFSSRHKLSHPNINEAAFSPSTISLNNTSLIRQIKLRSQLR 588

Query: 1764 FPIISPQDHWGTWTALFATGAFGIWSEKTKIGSALSGALVSTLVGLAASNLGIIASETPA 1585
            FP+ISP DHWGTWTALFATGAFGIWSE TK+GS +S ALVSTLVGLAASN+GII  ET A
Sbjct: 589  FPLISPDDHWGTWTALFATGAFGIWSEGTKVGSMVSAALVSTLVGLAASNIGIIPYETAA 648

Query: 1584 YNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLLGSVATTIGTAVAFLMVPMRSLG 1405
            Y++V+ F         L+RAD+R VIRSTG L LAFLLGSVAT IGT VAFLMVPMRSLG
Sbjct: 649  YSLVLEFLLPLTVPLLLFRADLRNVIRSTGKLFLAFLLGSVATIIGTTVAFLMVPMRSLG 708

Query: 1404 QDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAGLAADNVICAVYFTTLFALASKI 1225
             D WKIAAALMG +IGG+VNYVAISEAL  SPSV+AAG+AADNVICA YF  LFALASKI
Sbjct: 709  PDNWKIAAALMGSYIGGSVNYVAISEALGTSPSVVAAGIAADNVICATYFMALFALASKI 768

Query: 1224 PAEPSTSTSDVELNKESGSGNKLPVLQTATALAASFAICKTASFLTRYFGIQGGSLPAIT 1045
            PAE S ST+ VE++ ES S  K+PVLQ A ALA SF IC+TA++LT+   +QGG+LPAIT
Sbjct: 769  PAENSASTNGVEMDVESSSTGKIPVLQMAAALAISFMICRTATYLTQLCKVQGGNLPAIT 828

Query: 1044 AIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGASGSIRNVISTAPSIFLFSLVQI 865
            AIVV LAT+FP QF  LAP+G+ +A++LMQVFF VVGASGSI NVI TAPSIFLF+LVQ+
Sbjct: 829  AIVVFLATSFPVQFGRLAPAGDTIALVLMQVFFAVVGASGSIWNVIKTAPSIFLFALVQL 888

Query: 864  AVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXXXXXXGWSSLVVP 715
             VHLA++LG G+LF FDLKLLL+ASNAN+             GW SLVVP
Sbjct: 889  TVHLAVVLGLGRLFDFDLKLLLLASNANIGGPTTACGMATAKGWKSLVVP 938


>gb|KHG26673.1| putative membrane yjcL [Gossypium arboreum]
          Length = 464

 Score =  509 bits (1312), Expect = e-141
 Identities = 270/386 (69%), Positives = 314/386 (81%), Gaps = 2/386 (0%)
 Frame = -3

Query: 1866 SRLTSYKYKGKVLISPFKMPKK--SSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGI 1693
            SRL   K + +  +SP  + K   ++R++I  SQLN P+ISP D WGTWTALFATGAFG+
Sbjct: 53   SRLLPLK-RTQTFLSPKWLDKNPDATRTLIVKSQLNSPLISPNDQWGTWTALFATGAFGL 111

Query: 1692 WSEKTKIGSALSGALVSTLVGLAASNLGIIASETPAYNVVMAFXXXXXXXXXLYRADMRR 1513
            WSE TK GSALSGALVSTL+GLAASNLGII+SE  AY++V  F         L+RAD+RR
Sbjct: 112  WSENTKAGSALSGALVSTLIGLAASNLGIISSEAKAYSIVKEFLLPLAVPLLLFRADLRR 171

Query: 1512 VIRSTGTLLLAFLLGSVATTIGTAVAFLMVPMRSLGQDGWKIAAALMGRHIGGAVNYVAI 1333
            VI+STG LLLAFLLGSVATT+GTA+A+L+VPMR+LGQD WKIAAALMGRHIGGAVNYVAI
Sbjct: 172  VIKSTGKLLLAFLLGSVATTVGTALAYLIVPMRALGQDSWKIAAALMGRHIGGAVNYVAI 231

Query: 1332 SEALQVSPSVLAAGLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNKESGSGNKLP 1153
            S AL+ S SVLAAGLAADNVICAVYFTTLFALASK+PAE STS  DV + + S S  KLP
Sbjct: 232  SNALETSESVLAAGLAADNVICAVYFTTLFALASKVPAETSTSPEDVAMGEGSISDGKLP 291

Query: 1152 VLQTATALAASFAICKTASFLTRYFGIQGGSLPAITAIVVILATTFPKQFAYLAPSGEAM 973
            VL+ ATALA SFAICK  ++LT+YFGI GG LPA+TAIVVILAT FP QF +LAPSGEAM
Sbjct: 292  VLKIATALAVSFAICKLGAYLTKYFGIPGGILPAVTAIVVILATVFPAQFGHLAPSGEAM 351

Query: 972  AMILMQVFFTVVGASGSIRNVISTAPSIFLFSLVQIAVHLAIILGFGKLFRFDLKLLLIA 793
            A+ILMQVFFTVVGASG+I +VI TAPSIF+F+LVQI++HLA+ILG GKLF+FDLKLLLIA
Sbjct: 352  ALILMQVFFTVVGASGNIWSVIRTAPSIFMFALVQISIHLALILGLGKLFKFDLKLLLIA 411

Query: 792  SNANVXXXXXXXXXXXXXGWSSLVVP 715
            SNANV             GWSS+++P
Sbjct: 412  SNANVGGPTTASGMATAKGWSSMIIP 437


>ref|XP_012484138.1| PREDICTED: uncharacterized protein LOC105798567 [Gossypium raimondii]
            gi|763766946|gb|KJB34161.1| hypothetical protein
            B456_006G051100 [Gossypium raimondii]
          Length = 464

 Score =  509 bits (1310), Expect = e-141
 Identities = 269/386 (69%), Positives = 313/386 (81%), Gaps = 2/386 (0%)
 Frame = -3

Query: 1866 SRLTSYKYKGKVLISPFKMPKK--SSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGI 1693
            SRL   K + +  +SP  + K   ++R++I  SQLN P+ISP D WGTWTALFATGAFG+
Sbjct: 53   SRLLPLK-RTQTFLSPKWLDKNPDATRTLIVKSQLNCPLISPNDQWGTWTALFATGAFGL 111

Query: 1692 WSEKTKIGSALSGALVSTLVGLAASNLGIIASETPAYNVVMAFXXXXXXXXXLYRADMRR 1513
            WSE TK GSALSGALVSTL+GLAASNLGII+SE   Y++V  F         L+RAD+RR
Sbjct: 112  WSENTKAGSALSGALVSTLIGLAASNLGIISSEAKVYSIVKEFLLPLAVPLLLFRADLRR 171

Query: 1512 VIRSTGTLLLAFLLGSVATTIGTAVAFLMVPMRSLGQDGWKIAAALMGRHIGGAVNYVAI 1333
            VI+STG LLLAFLLGSVATT+GTA+A+L+VPMR+LGQD WKIAAALMGRHIGGAVNYVAI
Sbjct: 172  VIKSTGKLLLAFLLGSVATTVGTALAYLIVPMRALGQDSWKIAAALMGRHIGGAVNYVAI 231

Query: 1332 SEALQVSPSVLAAGLAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNKESGSGNKLP 1153
            S AL+ S SVLAAGLAADNVICAVYFTTLFALASK+PAE STS  DV + + S S  KLP
Sbjct: 232  SNALETSESVLAAGLAADNVICAVYFTTLFALASKVPAETSTSPEDVAMGEGSKSDGKLP 291

Query: 1152 VLQTATALAASFAICKTASFLTRYFGIQGGSLPAITAIVVILATTFPKQFAYLAPSGEAM 973
            VL+ ATALA SFAICK  ++LT+YFGI GG LPA+TAIVVILAT FP QF +LAPSGEAM
Sbjct: 292  VLKIATALAVSFAICKLGAYLTKYFGIPGGILPAVTAIVVILATVFPTQFGHLAPSGEAM 351

Query: 972  AMILMQVFFTVVGASGSIRNVISTAPSIFLFSLVQIAVHLAIILGFGKLFRFDLKLLLIA 793
            A+ILMQVFFTVVGASG+I +VI TAPSIF+F+LVQI++HLA+ILG GKLF+FDLKLLLIA
Sbjct: 352  ALILMQVFFTVVGASGNIWSVIRTAPSIFMFALVQISIHLALILGLGKLFKFDLKLLLIA 411

Query: 792  SNANVXXXXXXXXXXXXXGWSSLVVP 715
            SNANV             GWSS+++P
Sbjct: 412  SNANVGGPTTASGMATAKGWSSMIIP 437


>ref|XP_007217974.1| hypothetical protein PRUPE_ppa005389mg [Prunus persica]
            gi|462414436|gb|EMJ19173.1| hypothetical protein
            PRUPE_ppa005389mg [Prunus persica]
          Length = 463

 Score =  506 bits (1303), Expect = e-140
 Identities = 267/399 (66%), Positives = 308/399 (77%), Gaps = 1/399 (0%)
 Frame = -3

Query: 1827 ISPFKMPKKSSRSVIASSQLNFPIISPQDHWGTWTALFATGAFGIWSEK-TKIGSALSGA 1651
            +SP   P    RSV    QLN P+IS  D WGTWTALFATGAFGIWSEK TK+G+ALSGA
Sbjct: 65   LSPPAPPNLGDRSVAVRFQLNAPLISSHDQWGTWTALFATGAFGIWSEKNTKVGAALSGA 124

Query: 1650 LVSTLVGLAASNLGIIASETPAYNVVMAFXXXXXXXXXLYRADMRRVIRSTGTLLLAFLL 1471
            LVSTL+GLAASNLGII+S  PA+++V+ F         LYRAD+RRVI+STG LLLAFLL
Sbjct: 125  LVSTLIGLAASNLGIISSNAPAFSIVLEFLLPLAVPLLLYRADLRRVIKSTGALLLAFLL 184

Query: 1470 GSVATTIGTAVAFLMVPMRSLGQDGWKIAAALMGRHIGGAVNYVAISEALQVSPSVLAAG 1291
            GSVATT+GT VA+L+VPMRSLGQD WKIAAALMGRHIGGAVNYVAI++AL VSPS+LAAG
Sbjct: 185  GSVATTVGTVVAYLLVPMRSLGQDSWKIAAALMGRHIGGAVNYVAIADALGVSPSILAAG 244

Query: 1290 LAADNVICAVYFTTLFALASKIPAEPSTSTSDVELNKESGSGNKLPVLQTATALAASFAI 1111
            LAADNVICAVYF+TLFALASK+P EPSTS   +  +  S  GNKLP++QTA AL+ S AI
Sbjct: 245  LAADNVICAVYFSTLFALASKVPPEPSTSDDGIRKDASSEPGNKLPLIQTAAALSVSLAI 304

Query: 1110 CKTASFLTRYFGIQGGSLPAITAIVVILATTFPKQFAYLAPSGEAMAMILMQVFFTVVGA 931
            CK+  +LT+YFGIQGG LPA+TAIVV LAT FPKQFAYLAP+GEAMA+ILMQVFF VVGA
Sbjct: 305  CKSGHYLTKYFGIQGGILPAVTAIVVTLATVFPKQFAYLAPTGEAMAVILMQVFFAVVGA 364

Query: 930  SGSIRNVISTAPSIFLFSLVQIAVHLAIILGFGKLFRFDLKLLLIASNANVXXXXXXXXX 751
            SG+I +VI+TAPSIF F+L+QIAVHL +ILG GKL  FDLKLLLIASNANV         
Sbjct: 365  SGNIWSVINTAPSIFFFALIQIAVHLVVILGLGKLLGFDLKLLLIASNANVGGPTTACGM 424

Query: 750  XXXXGWSSLVVPXXXXXXXXXXXXXXXXXXXGQAVLKFM 634
                 W+S++VP                   G AVLK+M
Sbjct: 425  ATAKEWNSMIVPGILAGIFGIAIATFIGIAFGLAVLKYM 463


Top