BLASTX nr result

ID: Forsythia22_contig00008759 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00008759
         (1661 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011097551.1| PREDICTED: uncharacterized protein LOC105176...   258   1e-65
ref|XP_011100044.1| PREDICTED: uncharacterized protein LOC105178...   233   2e-61
ref|XP_012845331.1| PREDICTED: uncharacterized protein LOC105965...   228   1e-56
ref|XP_012853562.1| PREDICTED: uncharacterized protein LOC105973...   197   2e-47
gb|EYU23966.1| hypothetical protein MIMGU_mgv1a017990mg, partial...   197   2e-47
emb|CDP01491.1| unnamed protein product [Coffea canephora]            192   7e-46
ref|XP_006353221.1| PREDICTED: uncharacterized protein LOC102592...   181   6e-45
ref|XP_009604212.1| PREDICTED: uncharacterized protein LOC104099...   171   9e-43
ref|XP_007047892.1| Uncharacterized protein isoform 1 [Theobroma...   181   1e-42
ref|XP_008234961.1| PREDICTED: uncharacterized protein LOC103333...   181   2e-42
ref|XP_007205638.1| hypothetical protein PRUPE_ppa009241mg [Prun...   180   3e-42
ref|XP_007047893.1| Uncharacterized protein isoform 2 [Theobroma...   177   2e-41
ref|XP_012081607.1| PREDICTED: uncharacterized protein LOC105641...   177   2e-41
ref|XP_011022815.1| PREDICTED: uncharacterized protein LOC105124...   176   6e-41
ref|XP_009802512.1| PREDICTED: uncharacterized protein LOC104248...   172   5e-40
ref|XP_010312631.1| PREDICTED: uncharacterized protein LOC101259...   172   5e-40
ref|XP_010245835.1| PREDICTED: uncharacterized protein LOC104589...   172   7e-40
ref|XP_010027912.1| PREDICTED: uncharacterized protein LOC104418...   172   7e-40
ref|XP_010245837.1| PREDICTED: uncharacterized protein LOC104589...   171   2e-39
ref|XP_002309969.1| hypothetical protein POPTR_0007s05170g [Popu...   170   3e-39

>ref|XP_011097551.1| PREDICTED: uncharacterized protein LOC105176447 [Sesamum indicum]
          Length = 318

 Score =  258 bits (658), Expect = 1e-65
 Identities = 150/286 (52%), Positives = 182/286 (63%), Gaps = 2/286 (0%)
 Frame = +1

Query: 226  MPIAVQCLFTNMNPTSFHINLSRTHFPESPKLPFLLQNSETRFPNSKNFSTHYHLLPMTK 405
            M   ++ +FTN NPT     LS T   +S  L F    S   FPN +  ++   +  +T+
Sbjct: 1    MSFTLRAIFTNKNPTFSPATLSPTRLLKSVNLSFSATKSRIHFPNHRFLTSQCQVFSITR 60

Query: 406  IRASFQEPYGNAKKQNSSNGSTSADFNLDEFLSIIEFLCLAASAAISVYIVLNS-VTQKS 582
            I+ S  E YG ++ Q + +GST   F+ D FLS +EFL LA+SAAISVY+ L+S V Q  
Sbjct: 61   IKVSLNESYGTSESQVNGSGSTLNHFSFDAFLSTLEFLSLASSAAISVYVALSSGVQQGG 120

Query: 583  VIGWLGNKILVWQFXXXXXXXXXXXXIRRRQWRRICGVGFLMGSASV-VNLMDRVDKLEE 759
            V+G +G+KILVWQ             IRRRQWRRICG GF   SAS  VNL+ RV+KLEE
Sbjct: 121  VLGRVGSKILVWQCVVLVSSVVVGAVIRRRQWRRICGAGFSRSSASYGVNLLGRVEKLEE 180

Query: 760  GLRSSATIIRVLSMQLEKLGIRFRVIRKALKEPIAETAALAQKNSEATQALAVQDDFLEK 939
             LRSSATIIRVLS QLEKLGIR RV RKAL+EPIAETAALAQKNSEAT+ALAVQ+D LEK
Sbjct: 181  DLRSSATIIRVLSRQLEKLGIRVRVTRKALQEPIAETAALAQKNSEATRALAVQEDILEK 240

Query: 940  ELSEIQKXXXXXXXXXXXXXXXXXXXGKTGK*W*SKRIQHEDLNKS 1077
            EL EIQK                   GK GK W ++R+Q +D N S
Sbjct: 241  ELGEIQKVLLAMQEQQQKQLELILAIGKAGKLWETQRLQTKDQNVS 286


>ref|XP_011100044.1| PREDICTED: uncharacterized protein LOC105178294 [Sesamum indicum]
          Length = 314

 Score =  233 bits (593), Expect(2) = 2e-61
 Identities = 136/282 (48%), Positives = 177/282 (62%), Gaps = 2/282 (0%)
 Frame = +1

Query: 226  MPIAVQCLFTNMNPTSFHINLSRTHFPESPKLPFLLQNSETRFPNSKNFSTHYHLLPMTK 405
            M + +Q +  N NP++ ++ +S++ F +S K  F   NS T F N +  ++  +L   TK
Sbjct: 1    MALMLQTISGNKNPSTSNLLVSQSRFLKSLKFYFPPSNSRTHFLNCRFLTSQSYLFAATK 60

Query: 406  IRASFQEPYGNAKKQNSSNGSTSADFNLDEFLSIIEFLCLAASAAISVYIVLNSVTQKS- 582
            + ASF+EPYG ++ Q   N S+S     D FLS +EFL LA+SAA+SVYI +    QK  
Sbjct: 61   VNASFEEPYGASQNQVGGNSSSS----FDAFLSAVEFLSLASSAAVSVYIAVRCGIQKGG 116

Query: 583  VIGWLGNKILVWQFXXXXXXXXXXXXIRRRQWRRICGVGFLMGSASV-VNLMDRVDKLEE 759
             +G LG+KILVWQ             IR+RQWRR+CGVG     AS   NL++RV+KLE+
Sbjct: 117  ALGLLGSKILVWQCVVLVGGLVAGAVIRQRQWRRVCGVGLSRAPASSGANLLERVEKLED 176

Query: 760  GLRSSATIIRVLSMQLEKLGIRFRVIRKALKEPIAETAALAQKNSEATQALAVQDDFLEK 939
             LRSSATII+ LS +LEKLGIRFR+ RKALKEPIAET+AL QKNSEAT+ALA Q+D LEK
Sbjct: 177  DLRSSATIIQALSRRLEKLGIRFRLTRKALKEPIAETSALVQKNSEATRALAAQEDILEK 236

Query: 940  ELSEIQKXXXXXXXXXXXXXXXXXXXGKTGK*W*SKRIQHED 1065
            EL EIQK                   GK GK   +KR++  D
Sbjct: 237  ELGEIQKVLLAMQEQQQKQLELILAIGKAGKLLETKRVKSRD 278



 Score = 33.5 bits (75), Expect(2) = 2e-61
 Identities = 17/29 (58%), Positives = 21/29 (72%)
 Frame = +3

Query: 1080 SSNLVVDGVPH*ETNHIQTLARQKEANID 1166
            +S   VDGV + E N I+TLAR+KEAN D
Sbjct: 284  TSESSVDGVANLEINQIETLARKKEANND 312


>ref|XP_012845331.1| PREDICTED: uncharacterized protein LOC105965332 [Erythranthe
            guttatus] gi|604319828|gb|EYU30992.1| hypothetical
            protein MIMGU_mgv1a010697mg [Erythranthe guttata]
          Length = 304

 Score =  228 bits (580), Expect = 1e-56
 Identities = 135/270 (50%), Positives = 165/270 (61%), Gaps = 5/270 (1%)
 Frame = +1

Query: 271  SFH-INLSRTHFPESPKLPFLLQNSETRFPNSKNFSTHYHLLPMTKIRASFQEPYGNAKK 447
            +FH +  + T F +SPK PF   NS T+F   + F+TH       +I ASFQEPYG +  
Sbjct: 4    TFHTVFTNPTRFLKSPKFPFPPPNSRTQFETPRFFTTHPRRFAAARINASFQEPYGASGN 63

Query: 448  QNSSNGSTSADFNLDEFLSIIEFLCLAASAAISVYIVLNSVTQKS---VIGWLGNKILVW 618
              +S G        D FLS +EFL LA+SA  SVYI +    QK     +G +G+K LVW
Sbjct: 64   ITTSGGGGGG---YDAFLSTLEFLSLASSAGFSVYIAVKCGVQKGGGGALGVVGSKFLVW 120

Query: 619  QFXXXXXXXXXXXXIRRRQWRRICGVGFLMGSASV-VNLMDRVDKLEEGLRSSATIIRVL 795
            Q             IRRRQWRRICGVGF  G  S   +L+DRV+KLEE LRS +TII+ L
Sbjct: 121  QCVVLVIGLAAGAVIRRRQWRRICGVGFSRGPPSYGASLLDRVEKLEEDLRSVSTIIQAL 180

Query: 796  SMQLEKLGIRFRVIRKALKEPIAETAALAQKNSEATQALAVQDDFLEKELSEIQKXXXXX 975
            S +LEKLGIRFR+ RKALKEPIAETAAL +KNSEATQALA Q+D LEKEL EIQK     
Sbjct: 181  SRRLEKLGIRFRLTRKALKEPIAETAALVRKNSEATQALAAQEDNLEKELGEIQKVLLAM 240

Query: 976  XXXXXXXXXXXXXXGKTGK*W*SKRIQHED 1065
                          GK GK W +KR++++D
Sbjct: 241  QEQQQKQLELILAIGKAGKLWDTKRVENQD 270


>ref|XP_012853562.1| PREDICTED: uncharacterized protein LOC105973097 [Erythranthe
            guttatus]
          Length = 312

 Score =  197 bits (501), Expect = 2e-47
 Identities = 133/293 (45%), Positives = 161/293 (54%), Gaps = 7/293 (2%)
 Frame = +1

Query: 226  MPIAVQCLFTNMNPTSFHINLSRTHFPESPK----LPFLLQNSETRFPNSKNFSTHYHLL 393
            M + +Q +F N NPT+   +LS    P   K     PF   NS T+ P S       HLL
Sbjct: 1    MSLNLQSIFPNKNPTTSSASLSSPPPPHFLKPHLNFPFPPPNSPTQSPISTFSKPQPHLL 60

Query: 394  PMT---KIRASFQEPYGNAKKQNSSNGSTSADFNLDEFLSIIEFLCLAASAAISVYIVLN 564
                  KI AS  E Y       ++ G      +LD FLS +EFL LA+SAA+SVY+ + 
Sbjct: 61   LRRVGKKINASSDEAY-------AAIGVAPNPSSLDAFLSAVEFLSLASSAAVSVYVAVG 113

Query: 565  SVTQKSVIGWLGNKILVWQFXXXXXXXXXXXXIRRRQWRRICGVGFLMGSASVVNLMDRV 744
                K     LG++ILVWQ             IRRRQWRRICG       + V NL  RV
Sbjct: 114  GGVLKGGGLVLGSRILVWQCVVLVGGVLVGAAIRRRQWRRICGAA--AAPSGVNNLSARV 171

Query: 745  DKLEEGLRSSATIIRVLSMQLEKLGIRFRVIRKALKEPIAETAALAQKNSEATQALAVQD 924
            +K+EE LRSSATIIRVLS QL+KLG RFRV RKALKEP++ETAALAQKNSEAT+ALA Q+
Sbjct: 172  EKVEEDLRSSATIIRVLSRQLDKLGSRFRVTRKALKEPVSETAALAQKNSEATRALAAQE 231

Query: 925  DFLEKELSEIQKXXXXXXXXXXXXXXXXXXXGKTGK*W*SKRIQHEDLNKSVR 1083
            D LE EL EIQ                    GK GK W ++ +  +D N S R
Sbjct: 232  DILENELGEIQNVLLAMQEQQQKQLELIIALGKAGKLWETRSVPTKDHNASDR 284


>gb|EYU23966.1| hypothetical protein MIMGU_mgv1a017990mg, partial [Erythranthe
            guttata]
          Length = 289

 Score =  197 bits (501), Expect = 2e-47
 Identities = 133/293 (45%), Positives = 161/293 (54%), Gaps = 7/293 (2%)
 Frame = +1

Query: 226  MPIAVQCLFTNMNPTSFHINLSRTHFPESPK----LPFLLQNSETRFPNSKNFSTHYHLL 393
            M + +Q +F N NPT+   +LS    P   K     PF   NS T+ P S       HLL
Sbjct: 1    MSLNLQSIFPNKNPTTSSASLSSPPPPHFLKPHLNFPFPPPNSPTQSPISTFSKPQPHLL 60

Query: 394  PMT---KIRASFQEPYGNAKKQNSSNGSTSADFNLDEFLSIIEFLCLAASAAISVYIVLN 564
                  KI AS  E Y       ++ G      +LD FLS +EFL LA+SAA+SVY+ + 
Sbjct: 61   LRRVGKKINASSDEAY-------AAIGVAPNPSSLDAFLSAVEFLSLASSAAVSVYVAVG 113

Query: 565  SVTQKSVIGWLGNKILVWQFXXXXXXXXXXXXIRRRQWRRICGVGFLMGSASVVNLMDRV 744
                K     LG++ILVWQ             IRRRQWRRICG       + V NL  RV
Sbjct: 114  GGVLKGGGLVLGSRILVWQCVVLVGGVLVGAAIRRRQWRRICGAA--AAPSGVNNLSARV 171

Query: 745  DKLEEGLRSSATIIRVLSMQLEKLGIRFRVIRKALKEPIAETAALAQKNSEATQALAVQD 924
            +K+EE LRSSATIIRVLS QL+KLG RFRV RKALKEP++ETAALAQKNSEAT+ALA Q+
Sbjct: 172  EKVEEDLRSSATIIRVLSRQLDKLGSRFRVTRKALKEPVSETAALAQKNSEATRALAAQE 231

Query: 925  DFLEKELSEIQKXXXXXXXXXXXXXXXXXXXGKTGK*W*SKRIQHEDLNKSVR 1083
            D LE EL EIQ                    GK GK W ++ +  +D N S R
Sbjct: 232  DILENELGEIQNVLLAMQEQQQKQLELIIALGKAGKLWETRSVPTKDHNASDR 284


>emb|CDP01491.1| unnamed protein product [Coffea canephora]
          Length = 326

 Score =  192 bits (488), Expect = 7e-46
 Identities = 127/296 (42%), Positives = 168/296 (56%), Gaps = 14/296 (4%)
 Frame = +1

Query: 226  MPIAVQCLFTN-MNPTSFHINLSRTHFPESPKLPFLLQN----SETRFPNSK-NFSTHYH 387
            M +A + L T+ ++  +  IN S+TH   +P+LP L       + +RF   K +F     
Sbjct: 1    MSLASRNLLTSPISQINTTINTSKTHL-RNPRLPLLSYPKPFLNPSRFHAQKPHFLKFAT 59

Query: 388  LLPMTK-----IRASFQEPYGNAKKQNSSNGSTSADFNLDEFLSIIEFLCLAASAAISVY 552
             +P  +     I+    +  G   +++ S    + D N D FLSI+EF CL +S AIS  
Sbjct: 60   FIPPIQNHTWSIQVKSLDLDGTVGEESRSENPANWDVNFDAFLSILEFFCLVSSIAISGI 119

Query: 553  IVLNSVT---QKSVIGWLGNKILVWQFXXXXXXXXXXXXIRRRQWRRICGVGFLMGSASV 723
            + +NS     Q+ V  WLG K +VWQ             IRRRQWRRIC   +    +  
Sbjct: 120  LAVNSGFLGGQRMVFRWLGEKGMVWQCVVLVAGVLVGAVIRRRQWRRICQAKYF---SRP 176

Query: 724  VNLMDRVDKLEEGLRSSATIIRVLSMQLEKLGIRFRVIRKALKEPIAETAALAQKNSEAT 903
            VNL++R++KLEE  +SSAT+IR LS QLEKLGIRFRV RKALKEPIAETAALAQKNSEAT
Sbjct: 177  VNLVERIEKLEENFKSSATVIRALSRQLEKLGIRFRVFRKALKEPIAETAALAQKNSEAT 236

Query: 904  QALAVQDDFLEKELSEIQKXXXXXXXXXXXXXXXXXXXGKTGK*W*SKRIQHEDLN 1071
            +ALA+Q+D LEKEL EIQK                    K+GK W +KR ++   N
Sbjct: 237  RALAIQEDILEKELGEIQKVLLAMQEQQQKQLELILAIAKSGKLWDTKREENHGNN 292


>ref|XP_006353221.1| PREDICTED: uncharacterized protein LOC102592816 [Solanum tuberosum]
          Length = 313

 Score =  181 bits (458), Expect(2) = 6e-45
 Identities = 109/213 (51%), Positives = 134/213 (62%), Gaps = 4/213 (1%)
 Frame = +1

Query: 448  QNSSNGSTSAD--FNLDEFLSIIEFLCLAASAAISVYIVLNSVTQKSVIGWLGNKILVWQ 621
            + + NG  SA+  FN D FLSI+EFLCL +SA +++   +NS    S   WLGN++L  Q
Sbjct: 67   EGTVNGQVSAEYEFNFDGFLSILEFLCLLSSAVVAIGFAVNSWVLGSQ-KWLGNRVLAAQ 125

Query: 622  FXXXXXXXXXXXXIRRRQWRRICGVGFLMGSASV--VNLMDRVDKLEEGLRSSATIIRVL 795
                         IRRRQWRRIC   F    + +  VNL++R++K+EE LRSSATIIRVL
Sbjct: 126  CVVLVGGVIIGSVIRRRQWRRICMNKFSRSGSDLKGVNLLERIEKVEEDLRSSATIIRVL 185

Query: 796  SMQLEKLGIRFRVIRKALKEPIAETAALAQKNSEATQALAVQDDFLEKELSEIQKXXXXX 975
            S QLEKLGIRFRV RK LK+PI E A LAQKNSEAT+ALA+QD+ LEKEL EIQK     
Sbjct: 186  SRQLEKLGIRFRVTRKTLKDPITEAAMLAQKNSEATRALALQDERLEKELGEIQKVLLAM 245

Query: 976  XXXXXXXXXXXXXXGKTGK*W*SKRIQHEDLNK 1074
                          GKTGK + +KR   +D NK
Sbjct: 246  QDQQHKQLELILAIGKTGKLFENKRGLSQDPNK 278



 Score = 30.0 bits (66), Expect(2) = 6e-45
 Identities = 14/30 (46%), Positives = 16/30 (53%)
 Frame = +3

Query: 1083 SNLVVDGVPH*ETNHIQTLARQKEANIDSL 1172
            SN   DG P    N IQ L RQ+E N D +
Sbjct: 284  SNTAADGFPQLGVNQIQALKRQRETNNDRI 313


>ref|XP_009604212.1| PREDICTED: uncharacterized protein LOC104099043 [Nicotiana
            tomentosiformis]
          Length = 362

 Score =  171 bits (434), Expect(2) = 9e-43
 Identities = 113/249 (45%), Positives = 141/249 (56%), Gaps = 2/249 (0%)
 Frame = +1

Query: 310  SPKLPFLLQNSETRFPNSKNFSTHYHLLPMTKIRASFQEPYGNAKKQNSSNGSTSADFNL 489
            S  L F    +   FP  K        L   + +    E  G  K+Q+ +      +FN+
Sbjct: 76   STPLDFKPLKNRLCFPTQKLHLLTIESLQCHQWKVKAFESEGAVKEQSLAE----FEFNI 131

Query: 490  DEFLSIIEFLCLAASAAISVYIVLNSVTQKSVIGWLGNKILVWQFXXXXXXXXXXXXIRR 669
            D FLSI+EFLCL +SA +S+   +NS    S   WLGN++L  Q             IRR
Sbjct: 132  DAFLSILEFLCLFSSAVVSIGYAVNSWFLGSQ-KWLGNRVLAAQCVVLVGGVVIGSVIRR 190

Query: 670  RQWRRICGVGFLM-GSASV-VNLMDRVDKLEEGLRSSATIIRVLSMQLEKLGIRFRVIRK 843
            RQW RIC V F   GS S  VNL++R++KLEE LRSS T+IRVLS QLEKLGIRFR+ RK
Sbjct: 191  RQWSRICMVEFSRSGSGSRGVNLVERIEKLEEDLRSSTTLIRVLSRQLEKLGIRFRITRK 250

Query: 844  ALKEPIAETAALAQKNSEATQALAVQDDFLEKELSEIQKXXXXXXXXXXXXXXXXXXXGK 1023
             LK+P+ E A LAQKNSEAT+ALA+Q + LEKEL EIQK                   GK
Sbjct: 251  TLKDPVTEAATLAQKNSEATRALALQGEHLEKELGEIQKVLLAMQEQQHKQLELILAIGK 310

Query: 1024 TGK*W*SKR 1050
            TGK + +KR
Sbjct: 311  TGKLFENKR 319



 Score = 32.0 bits (71), Expect(2) = 9e-43
 Identities = 14/28 (50%), Positives = 18/28 (64%)
 Frame = +3

Query: 1083 SNLVVDGVPH*ETNHIQTLARQKEANID 1166
            SN  +DGVP  E N +Q+L  Q+E N D
Sbjct: 333  SNTAIDGVPQLEVNRLQSLKGQREINND 360


>ref|XP_007047892.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508700153|gb|EOX92049.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 316

 Score =  181 bits (460), Expect = 1e-42
 Identities = 125/288 (43%), Positives = 158/288 (54%), Gaps = 6/288 (2%)
 Frame = +1

Query: 226  MPIAVQCLFTNMNPTSFHINLSRTH---FPE--SPKLPFLLQNSETRFPNSKNFSTHYHL 390
            M IA Q LFT   P+S H+N +  +   FP   +  L F L NS+     ++NF      
Sbjct: 1    MSIAFQNLFT---PSSPHLNPNLKNPNSFPPITTRHLSFTLSNSQILHFRTRNFLNFKSP 57

Query: 391  LPMTKIRASFQEPYGNAKKQNSSNGSTSADFNLDEFLSIIEFLCLAASAAISVYIVLNSV 570
             P +       E   +       N   + DFNLD FLSI EFLC+ +SA +SV   ++  
Sbjct: 58   HPSSHSLLKAYESDSSIAASQEQNPIFN-DFNLDSFLSIAEFLCILSSAVVSVVGAVSG- 115

Query: 571  TQKSVIGWLGNKILVWQFXXXXXXXXXXXXIRRRQWRRICGVGFLMGSASV-VNLMDRVD 747
             +  ++G +  +++VW              IRRRQWRRIC      G     +NL+ R++
Sbjct: 116  WKGVILGGIWRRVMVWGIVGLVSGVAIGAWIRRRQWRRICAETVKGGGGGKNLNLIGRIE 175

Query: 748  KLEEGLRSSATIIRVLSMQLEKLGIRFRVIRKALKEPIAETAALAQKNSEATQALAVQDD 927
            KLEE LRS ATI R LS QLEKLGIRFRV RKALKEPIAETAALAQKNSEAT+ALAVQ+D
Sbjct: 176  KLEEDLRSYATITRALSRQLEKLGIRFRVTRKALKEPIAETAALAQKNSEATRALAVQED 235

Query: 928  FLEKELSEIQKXXXXXXXXXXXXXXXXXXXGKTGK*W*SKRIQHEDLN 1071
             LEKEL EIQK                   GK+GK +  KR   ++ N
Sbjct: 236  ILEKELGEIQKVLLAMQEQQGKQLELILAIGKSGKLFEDKREPSQEKN 283


>ref|XP_008234961.1| PREDICTED: uncharacterized protein LOC103333834 [Prunus mume]
          Length = 300

 Score =  181 bits (459), Expect = 2e-42
 Identities = 122/246 (49%), Positives = 144/246 (58%), Gaps = 8/246 (3%)
 Frame = +1

Query: 247 LFTNMNPTSFHINLSRTHFPESPKLPFLLQNSETRFPNSKNFSTHYHLLPMTKIRASFQE 426
           LF N +P  F + ++ T  P++P L  L   S   FP S       H LP      S   
Sbjct: 8   LFIN-SPPRFTLYIT-TSLPKTPALLSLPITSRPHFPISN------HSLPNPHNSTSLSS 59

Query: 427 PYGNAKKQNS-----SNGSTSADFNLDEFLSIIEFLCLAASAAISVYIVLNSVT---QKS 582
            +   +   S     SN   +  FNLD FLS+ EFLCLA+SA +SV   LN      +K+
Sbjct: 60  HHSRLRVYESDGTLQSNDVVNGAFNLDYFLSVAEFLCLASSALVSVGFALNCAVLSLKKT 119

Query: 583 VIGWLGNKILVWQFXXXXXXXXXXXXIRRRQWRRICGVGFLMGSASVVNLMDRVDKLEEG 762
            +  +GN +L                IR RQWRRIC      G    VNL +R++KLEE 
Sbjct: 120 ALVAMGNNVLASGAVALVMAVGIGAWIRMRQWRRICRESVKGGLE--VNLFERIEKLEED 177

Query: 763 LRSSATIIRVLSMQLEKLGIRFRVIRKALKEPIAETAALAQKNSEATQALAVQDDFLEKE 942
           LRSSATIIRVLS QLEKLGIRFRV RKALKEPIAETAALAQKNSEAT+ALAVQ+D LEKE
Sbjct: 178 LRSSATIIRVLSRQLEKLGIRFRVTRKALKEPIAETAALAQKNSEATRALAVQEDNLEKE 237

Query: 943 LSEIQK 960
           L EIQK
Sbjct: 238 LGEIQK 243


>ref|XP_007205638.1| hypothetical protein PRUPE_ppa009241mg [Prunus persica]
           gi|462401280|gb|EMJ06837.1| hypothetical protein
           PRUPE_ppa009241mg [Prunus persica]
          Length = 300

 Score =  180 bits (456), Expect = 3e-42
 Identities = 121/246 (49%), Positives = 144/246 (58%), Gaps = 8/246 (3%)
 Frame = +1

Query: 247 LFTNMNPTSFHINLSRTHFPESPKLPFLLQNSETRFPNSKNFSTHYHLLPMTKIRASFQE 426
           LF N +P  F + ++ T  P++P L  L   S   FP S       H LP      S   
Sbjct: 8   LFIN-SPPRFTLYIT-TSLPKTPALLSLPITSRPHFPISN------HSLPNPHNSTSLSS 59

Query: 427 PYGNAKKQNS-----SNGSTSADFNLDEFLSIIEFLCLAASAAISVYIVLNSVT---QKS 582
            +   +   S     SN   +  FNLD FL++ EFLCLA+SA +SV   LN      +K+
Sbjct: 60  HHSRLRVYESDGTLQSNDVVNGAFNLDYFLTVAEFLCLASSAIVSVGFALNCAVLSLKKT 119

Query: 583 VIGWLGNKILVWQFXXXXXXXXXXXXIRRRQWRRICGVGFLMGSASVVNLMDRVDKLEEG 762
            +  +GN +L                IR RQWRRIC      G    VNL +R++KLEE 
Sbjct: 120 ALVAMGNSVLASGAVALVMAVGIGAWIRMRQWRRICRESVKGGLE--VNLFERIEKLEED 177

Query: 763 LRSSATIIRVLSMQLEKLGIRFRVIRKALKEPIAETAALAQKNSEATQALAVQDDFLEKE 942
           LRSSATIIRVLS QLEKLGIRFRV RKALKEPIAETAALAQKNSEAT+ALAVQ+D LEKE
Sbjct: 178 LRSSATIIRVLSRQLEKLGIRFRVTRKALKEPIAETAALAQKNSEATRALAVQEDNLEKE 237

Query: 943 LSEIQK 960
           L EIQK
Sbjct: 238 LGEIQK 243


>ref|XP_007047893.1| Uncharacterized protein isoform 2 [Theobroma cacao]
           gi|508700154|gb|EOX92050.1| Uncharacterized protein
           isoform 2 [Theobroma cacao]
          Length = 313

 Score =  177 bits (450), Expect = 2e-41
 Identities = 118/251 (47%), Positives = 147/251 (58%), Gaps = 6/251 (2%)
 Frame = +1

Query: 226 MPIAVQCLFTNMNPTSFHINLSRTH---FPE--SPKLPFLLQNSETRFPNSKNFSTHYHL 390
           M IA Q LFT   P+S H+N +  +   FP   +  L F L NS+     ++NF      
Sbjct: 1   MSIAFQNLFT---PSSPHLNPNLKNPNSFPPITTRHLSFTLSNSQILHFRTRNFLNFKSP 57

Query: 391 LPMTKIRASFQEPYGNAKKQNSSNGSTSADFNLDEFLSIIEFLCLAASAAISVYIVLNSV 570
            P +       E   +       N   + DFNLD FLSI EFLC+ +SA +SV   ++  
Sbjct: 58  HPSSHSLLKAYESDSSIAASQEQNPIFN-DFNLDSFLSIAEFLCILSSAVVSVVGAVSG- 115

Query: 571 TQKSVIGWLGNKILVWQFXXXXXXXXXXXXIRRRQWRRICGVGFLMGSASV-VNLMDRVD 747
            +  ++G +  +++VW              IRRRQWRRIC      G     +NL+ R++
Sbjct: 116 WKGVILGGIWRRVMVWGIVGLVSGVAIGAWIRRRQWRRICAETVKGGGGGKNLNLIGRIE 175

Query: 748 KLEEGLRSSATIIRVLSMQLEKLGIRFRVIRKALKEPIAETAALAQKNSEATQALAVQDD 927
           KLEE LRS ATI R LS QLEKLGIRFRV RKALKEPIAETAALAQKNSEAT+ALAVQ+D
Sbjct: 176 KLEEDLRSYATITRALSRQLEKLGIRFRVTRKALKEPIAETAALAQKNSEATRALAVQED 235

Query: 928 FLEKELSEIQK 960
            LEKEL EIQK
Sbjct: 236 ILEKELGEIQK 246


>ref|XP_012081607.1| PREDICTED: uncharacterized protein LOC105641632 [Jatropha curcas]
            gi|802673470|ref|XP_012081608.1| PREDICTED:
            uncharacterized protein LOC105641632 [Jatropha curcas]
            gi|643718519|gb|KDP29713.1| hypothetical protein
            JCGZ_18648 [Jatropha curcas]
          Length = 319

 Score =  177 bits (449), Expect = 2e-41
 Identities = 122/291 (41%), Positives = 162/291 (55%), Gaps = 6/291 (2%)
 Frame = +1

Query: 223  TMPIAVQCLFTNMNPTSFHINLSRTHFPESPKLPFLLQNSETRFPNSKNFSTHYHLLPMT 402
            T PI++     N++P     +L+ +         F  Q    +  +S NF+   +  P+ 
Sbjct: 21   TTPISLSLQNPNISPRILSRHLATS---------FHCQTFRYKPKSSLNFTLKSNSFPLK 71

Query: 403  KIRASFQEPYGNAKKQNSSNGSTSADFNLDEFLSIIEFLCLAASAAISVYIVLNSV---T 573
              +      Y       SS+G     FNLD FLSI E LC+ +SA ++V   +NS    +
Sbjct: 72   AYQ------YDGVVPTPSSDG-----FNLDAFLSIAEILCIISSAVVTVCYAVNSTFLSS 120

Query: 574  QKSVIGWLG-NKILVWQFXXXXXXXXXXXXIRRRQWRRICGVGFLMGSASVVNLMDRVDK 750
            +++V   +G N+ L W              IR+RQW R C V    G  SV NL++R++K
Sbjct: 121  KRTVFAVIGSNRALAWGLVVMMGGVLIGALIRKRQWLRFCRVTVREGRESV-NLVERIEK 179

Query: 751  LEEGLRSSATIIRVLSMQLEKLGIRFRVIRKALKEPIAETAALAQKNSEATQALAVQDDF 930
            LEE LRSSATIIRVLS QLEKLGIRFRV RKALKEPIAETAALA+KNSEAT+ALA+Q+D 
Sbjct: 180  LEEDLRSSATIIRVLSRQLEKLGIRFRVTRKALKEPIAETAALAKKNSEATRALAMQEDI 239

Query: 931  LEKELSEIQKXXXXXXXXXXXXXXXXXXXGKTGK*W*SKR--IQHEDLNKS 1077
            LEKEL EIQK                   GK+GK W S++   Q + LN++
Sbjct: 240  LEKELGEIQKVLLAMQEQQEKQLELILAIGKSGKLWESRQEPSQQQGLNET 290


>ref|XP_011022815.1| PREDICTED: uncharacterized protein LOC105124481 isoform X1 [Populus
            euphratica]
          Length = 317

 Score =  176 bits (445), Expect = 6e-41
 Identities = 113/252 (44%), Positives = 148/252 (58%), Gaps = 8/252 (3%)
 Frame = +1

Query: 346  TRFPNSKNFSTHYHLLPMTKIRASFQEPYGNAKKQNSSNGSTSADFNLDEFLSIIEFLCL 525
            +R  N+   S ++H  P T  ++SF       +   +     S  FNLD+FLS+ E LC+
Sbjct: 40   SRHLNTSLHSHNFHFKPQTP-KSSFNFTLKAYQSDPTIPTQDSKQFNLDQFLSVAELLCI 98

Query: 526  AASAAISV-----YIVLNSVTQKSVIGWLG-NKILVWQFXXXXXXXXXXXXIRRRQWRRI 687
             +S+ I++     Y VLNS  ++ V+G +G N    W              IRRRQW ++
Sbjct: 99   FSSSIITISYALNYTVLNS--KRGVLGVIGSNTGFAWGMVVMVSGVVIGAWIRRRQWWQV 156

Query: 688  CGVGFLMGSASVVNLMDRVDKLEEGLRSSATIIRVLSMQLEKLGIRFRVIRKALKEPIAE 867
                   GS   +NL+ R++KLEE +RSSATIIRVLS QLEKLGIRFRV RKALKEPIAE
Sbjct: 157  SRETGREGSRESLNLVGRIEKLEEDVRSSATIIRVLSRQLEKLGIRFRVTRKALKEPIAE 216

Query: 868  TAALAQKNSEATQALAVQDDFLEKELSEIQKXXXXXXXXXXXXXXXXXXXGKTGK*W*SK 1047
            TAALAQKNS+AT+ALAVQ+D LEKEL EIQK                   GK+GK W  +
Sbjct: 217  TAALAQKNSDATRALAVQEDILEKELGEIQKVLLAMQEQQQKQLELILAIGKSGKLWDKR 276

Query: 1048 R--IQHEDLNKS 1077
            R  +Q ++L K+
Sbjct: 277  REPVQEQELIKT 288


>ref|XP_009802512.1| PREDICTED: uncharacterized protein LOC104248030 [Nicotiana
            sylvestris]
          Length = 363

 Score =  172 bits (437), Expect = 5e-40
 Identities = 110/236 (46%), Positives = 138/236 (58%), Gaps = 3/236 (1%)
 Frame = +1

Query: 352  FPNSKNFSTHYHLLPMTKIRASFQEPYGNAKKQNSSNGSTSADFNLDEFLSIIEFLCLAA 531
            FP  K        L   + +    E  G+ K+Q+ +      +FN+D FLSI+EFLCL +
Sbjct: 90   FPTQKPHLLKIESLQCHQWKVKAFESEGSVKEQSLAE----FEFNIDAFLSILEFLCLFS 145

Query: 532  SAAISVYIVLNSVTQKSVIGWLGNKILVWQFXXXXXXXXXXXXIRRRQWRRICGVGFLMG 711
            SA +++   +NS    S   WLGN++L  Q             IRRRQW RIC   F   
Sbjct: 146  SAVVAIGYAVNSWFWGSQ-KWLGNRVLGAQCVVLVGGVIIGSVIRRRQWSRICTFEFSSR 204

Query: 712  SASV---VNLMDRVDKLEEGLRSSATIIRVLSMQLEKLGIRFRVIRKALKEPIAETAALA 882
            S S    VNL++R++KLEE LRSSAT+IRVLS QLEKLGIRFRV RK LK+P+ E AALA
Sbjct: 205  SGSGSRGVNLVERIEKLEEDLRSSATLIRVLSRQLEKLGIRFRVTRKTLKDPVTEAAALA 264

Query: 883  QKNSEATQALAVQDDFLEKELSEIQKXXXXXXXXXXXXXXXXXXXGKTGK*W*SKR 1050
            QKNSEAT+ALA+Q + LEKEL EIQK                   GKTGK + +KR
Sbjct: 265  QKNSEATRALALQGERLEKELGEIQKVLLAMQEQQHKQLELILAIGKTGKLFENKR 320


>ref|XP_010312631.1| PREDICTED: uncharacterized protein LOC101259600 [Solanum
            lycopersicum]
          Length = 310

 Score =  172 bits (437), Expect = 5e-40
 Identities = 104/213 (48%), Positives = 132/213 (61%), Gaps = 4/213 (1%)
 Frame = +1

Query: 448  QNSSNGSTSAD--FNLDEFLSIIEFLCLAASAAISVYIVLNSVTQKSVIGWLGNKILVWQ 621
            + + NG  SA+  FN D FLSI+EFLCL +SA +++   +N     S   WLGN++L  Q
Sbjct: 67   EGTVNGQVSAEYEFNFDGFLSILEFLCLLSSAVVAIGFAVNCWFLGSH-KWLGNRVLAAQ 125

Query: 622  FXXXXXXXXXXXXIRRRQWRRICGVGFLMGSASV--VNLMDRVDKLEEGLRSSATIIRVL 795
                         IRRRQWRRIC   F    + +  VN+++R++K+EE LRSSATIIRVL
Sbjct: 126  CVVLVGGVIIGSVIRRRQWRRICMNNFSRPGSDLKGVNMLERIEKVEEDLRSSATIIRVL 185

Query: 796  SMQLEKLGIRFRVIRKALKEPIAETAALAQKNSEATQALAVQDDFLEKELSEIQKXXXXX 975
            S QLEKLGIRFRV RK LK+PI E A LAQKNSEAT+ALA+Q + LEKEL E+QK     
Sbjct: 186  SRQLEKLGIRFRVTRKTLKDPITEAAMLAQKNSEATRALALQGERLEKELGEVQKVLLAM 245

Query: 976  XXXXXXXXXXXXXXGKTGK*W*SKRIQHEDLNK 1074
                          GKTGK + +KR   +D N+
Sbjct: 246  QDQQHKQLELILAIGKTGKLFENKRGPSQDPNQ 278


>ref|XP_010245835.1| PREDICTED: uncharacterized protein LOC104589273 isoform X1 [Nelumbo
            nucifera]
          Length = 334

 Score =  172 bits (436), Expect = 7e-40
 Identities = 103/189 (54%), Positives = 121/189 (64%), Gaps = 6/189 (3%)
 Frame = +1

Query: 484  NLDEFLSIIEFLCLAASAAISVYIVLN-----SVTQKSVIGWLGNKILVWQFXXXXXXXX 648
            NL+ FLSI+E LC+  SA +SV   +N     S  QKS+   L N+I VWQF        
Sbjct: 99   NLEAFLSIVEVLCIVPSAVLSVGYAVNWAFFSSPLQKSLQVSLVNRIFVWQFVLLVGAVA 158

Query: 649  XXXXIRRRQWRRICGVGFLMGSA-SVVNLMDRVDKLEEGLRSSATIIRVLSMQLEKLGIR 825
                +RRRQWRRIC      G+  S VNL++R++K+EE LRSSATIIRVLS QLEKLG R
Sbjct: 159  AGALVRRRQWRRICRDTIKTGAGGSSVNLIERIEKIEEDLRSSATIIRVLSRQLEKLGTR 218

Query: 826  FRVIRKALKEPIAETAALAQKNSEATQALAVQDDFLEKELSEIQKXXXXXXXXXXXXXXX 1005
            FRV RKALKEPI +TAALAQKNSEAT++LAVQ+D LEKEL EIQK               
Sbjct: 219  FRVTRKALKEPITQTAALAQKNSEATRSLAVQEDNLEKELVEIQKVLLAMQDQQQKQLKL 278

Query: 1006 XXXXGKTGK 1032
                GK GK
Sbjct: 279  ILAIGKVGK 287


>ref|XP_010027912.1| PREDICTED: uncharacterized protein LOC104418308 [Eucalyptus grandis]
            gi|629088299|gb|KCW54552.1| hypothetical protein
            EUGRSUZ_I00513 [Eucalyptus grandis]
          Length = 306

 Score =  172 bits (436), Expect = 7e-40
 Identities = 110/230 (47%), Positives = 138/230 (60%), Gaps = 3/230 (1%)
 Frame = +1

Query: 463  GSTSADFNLDEFLSIIEFLCLAASAAISVYIVLNSVTQKSVIGWLGNKILVWQFXXXXXX 642
            G   + F+ D FLSI E LCL +SA +SV   +    +++  G  G+++L W        
Sbjct: 87   GEGVSHFDFDSFLSIAELLCLVSSAVVSVVFAV----KRAAFGAAGDRVLGWLVVALVGG 142

Query: 643  XXXXXXIRRRQWRRICGVGFLMGSASV--VNLMDRVDKLEEGLRSSATIIRVLSMQLEKL 816
                  +RRRQWRR+       G A+V  VNL++RV+KLEE LRSS T+IRVLS QLEKL
Sbjct: 143  VASGAWVRRRQWRRVFEQP---GKAAVPNVNLVERVEKLEEDLRSSTTMIRVLSRQLEKL 199

Query: 817  GIRFRVIRKALKEPIAETAALAQKNSEATQALAVQDDFLEKELSEIQKXXXXXXXXXXXX 996
            GIRFRV RK LKEPIAETAALAQKNSEAT+ALA+Q+D LEKEL EIQK            
Sbjct: 200  GIRFRVTRKTLKEPIAETAALAQKNSEATRALAMQEDILEKELGEIQKVLLAMQDQQQKQ 259

Query: 997  XXXXXXXGKTGK*W*SKRIQH-EDLNKSVRPI*WWMEYHIKKQTTSKLWP 1143
                   GKTGK W S+R  + E  N+S        E  +K+  ++KL P
Sbjct: 260  LELILAIGKTGKLWESRRAPNPEHTNESAS----LAEEDLKQLESTKLKP 305


>ref|XP_010245837.1| PREDICTED: uncharacterized protein LOC104589273 isoform X2 [Nelumbo
           nucifera]
          Length = 277

 Score =  171 bits (433), Expect = 2e-39
 Identities = 99/165 (60%), Positives = 117/165 (70%), Gaps = 6/165 (3%)
 Frame = +1

Query: 484 NLDEFLSIIEFLCLAASAAISVYIVLN-----SVTQKSVIGWLGNKILVWQFXXXXXXXX 648
           NL+ FLSI+E LC+  SA +SV   +N     S  QKS+   L N+I VWQF        
Sbjct: 99  NLEAFLSIVEVLCIVPSAVLSVGYAVNWAFFSSPLQKSLQVSLVNRIFVWQFVLLVGAVA 158

Query: 649 XXXXIRRRQWRRICGVGFLMGSA-SVVNLMDRVDKLEEGLRSSATIIRVLSMQLEKLGIR 825
               +RRRQWRRIC      G+  S VNL++R++K+EE LRSSATIIRVLS QLEKLG R
Sbjct: 159 AGALVRRRQWRRICRDTIKTGAGGSSVNLIERIEKIEEDLRSSATIIRVLSRQLEKLGTR 218

Query: 826 FRVIRKALKEPIAETAALAQKNSEATQALAVQDDFLEKELSEIQK 960
           FRV RKALKEPI +TAALAQKNSEAT++LAVQ+D LEKEL EIQK
Sbjct: 219 FRVTRKALKEPITQTAALAQKNSEATRSLAVQEDNLEKELVEIQK 263


>ref|XP_002309969.1| hypothetical protein POPTR_0007s05170g [Populus trichocarpa]
           gi|222852872|gb|EEE90419.1| hypothetical protein
           POPTR_0007s05170g [Populus trichocarpa]
          Length = 267

 Score =  170 bits (431), Expect = 3e-39
 Identities = 114/249 (45%), Positives = 144/249 (57%), Gaps = 15/249 (6%)
 Frame = +1

Query: 259 MNPTSFHINLSRTHFPESPKLPFLLQNSET---------RFPNSKNFSTHYHLLPMTKIR 411
           M+ T++H +     F  SP    LL  S +         R  N+   S ++H  P T  +
Sbjct: 1   MSLTTYHHHHHHLLFNNSPHRITLLFTSTSLSLRNLTLSRHVNTSLHSHNFHFKPQTP-K 59

Query: 412 ASFQEPYGNAKKQNSSNGSTSADFNLDEFLSIIEFLCLAASAAISV-----YIVLNSVTQ 576
           +SF       +   +     S  FNLD FLS+ E LC+ +S+ I++     Y VLNS  +
Sbjct: 60  SSFNLTLKAYQSDPTIPTQDSKQFNLDHFLSVAELLCIFSSSIITISYALNYTVLNS--K 117

Query: 577 KSVIGWLG-NKILVWQFXXXXXXXXXXXXIRRRQWRRICGVGFLMGSASVVNLMDRVDKL 753
           + V+G +G N    W              IRRR W R+       GS   +NL+ R++KL
Sbjct: 118 RGVLGVIGSNTGFAWGMVVMVSGVVIGAWIRRRMWWRVSRETGREGSRESLNLVGRIEKL 177

Query: 754 EEGLRSSATIIRVLSMQLEKLGIRFRVIRKALKEPIAETAALAQKNSEATQALAVQDDFL 933
           EE LRSSATIIRVLS QLEKLGIRFRV RKALKEPIAETAALAQKNS+AT+ALAVQ+D L
Sbjct: 178 EEDLRSSATIIRVLSRQLEKLGIRFRVTRKALKEPIAETAALAQKNSDATRALAVQEDIL 237

Query: 934 EKELSEIQK 960
           EKEL EIQK
Sbjct: 238 EKELGEIQK 246


Top