BLASTX nr result

ID: Akebia24_contig00011740 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00011740
         (2521 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006447354.1| hypothetical protein CICLE_v10014904mg [Citr...   485   e-134
ref|XP_002273873.2| PREDICTED: uncharacterized protein LOC100255...   464   e-128
ref|XP_004139943.1| PREDICTED: uncharacterized protein LOC101204...   457   e-126
ref|XP_006470134.1| PREDICTED: uncharacterized protein LOC102608...   451   e-124
ref|XP_007043575.1| Uncharacterized protein isoform 1 [Theobroma...   451   e-124
ref|XP_002517932.1| conserved hypothetical protein [Ricinus comm...   443   e-121
ref|XP_004303120.1| PREDICTED: uncharacterized protein LOC101310...   435   e-119
emb|CAN73945.1| hypothetical protein VITISV_032245 [Vitis vinifera]   427   e-116
ref|XP_002319898.2| hypothetical protein POPTR_0013s13670g [Popu...   426   e-116
gb|EXB79637.1| hypothetical protein L484_011577 [Morus notabilis]     416   e-113
ref|XP_006366024.1| PREDICTED: uncharacterized protein LOC102600...   403   e-109
ref|XP_002869513.1| hypothetical protein ARALYDRAFT_491947 [Arab...   392   e-106
ref|XP_007198901.1| hypothetical protein PRUPE_ppa004741mg [Prun...   390   e-105
ref|XP_004248159.1| PREDICTED: uncharacterized protein LOC101261...   388   e-105
ref|XP_007043578.1| Uncharacterized protein isoform 4 [Theobroma...   387   e-104
ref|XP_007043576.1| Uncharacterized protein isoform 2, partial [...   387   e-104
ref|NP_194555.1| uncharacterized protein [Arabidopsis thaliana] ...   387   e-104
ref|XP_007043577.1| Uncharacterized protein isoform 3 [Theobroma...   385   e-104
ref|XP_007149858.1| hypothetical protein PHAVU_005G104500g [Phas...   381   e-102
ref|XP_006412978.1| hypothetical protein EUTSA_v10024944mg [Eutr...   374   e-100

>ref|XP_006447354.1| hypothetical protein CICLE_v10014904mg [Citrus clementina]
            gi|567910083|ref|XP_006447355.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|567910085|ref|XP_006447356.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|567910087|ref|XP_006447357.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|568831767|ref|XP_006470130.1| PREDICTED:
            uncharacterized protein LOC102608093 isoform X1 [Citrus
            sinensis] gi|568831769|ref|XP_006470131.1| PREDICTED:
            uncharacterized protein LOC102608093 isoform X2 [Citrus
            sinensis] gi|568831771|ref|XP_006470132.1| PREDICTED:
            uncharacterized protein LOC102608093 isoform X3 [Citrus
            sinensis] gi|568831773|ref|XP_006470133.1| PREDICTED:
            uncharacterized protein LOC102608093 isoform X4 [Citrus
            sinensis] gi|557549965|gb|ESR60594.1| hypothetical
            protein CICLE_v10014904mg [Citrus clementina]
            gi|557549966|gb|ESR60595.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|557549967|gb|ESR60596.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|557549968|gb|ESR60597.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
          Length = 523

 Score =  485 bits (1249), Expect = e-134
 Identities = 265/526 (50%), Positives = 333/526 (63%), Gaps = 36/526 (6%)
 Frame = +3

Query: 666  MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 845
            MA RRELGFPK   ++LR+Q AR TL  VR +GH  VE+REDG  FIFFC  C APCYSD
Sbjct: 1    MAGRRELGFPKTSAFSLREQLARTTLSNVRAQGHTYVELREDGKRFIFFCTLCLAPCYSD 60

Query: 846  SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLSGSSLIVVNSGK 1025
              LFDHLKGNLH ER +AAK+TLLG NPWPFNDGVLF  N  E++KQ + S+  +  S  
Sbjct: 61   LVLFDHLKGNLHTERLSAAKVTLLGPNPWPFNDGVLFFDNSNEKEKQTTVSNDKLGRSLD 120

Query: 1026 NSNGNDNVALERVGDNENLDSHK----------CATVTNDKSLNGENCNMVIPGVLCKDV 1175
              N + N+A+ + G++  ++ ++          C   T  + +  E+C+ VIPGV  KD 
Sbjct: 121  YHNNDSNLAIVKYGEDMKVNGNEHSGLDEVHFDCENGTQVRDIYSESCDKVIPGVFLKDE 180

Query: 1176 ISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVT 1355
            I  L VRFIG G+IAAR+ +  E   +I+RIWC WLGK       ED  ++  HDF IVT
Sbjct: 181  IVDLRVRFIGLGQIAARMIQKDEGSIEISRIWCEWLGKKDPED--EDIVEIPDHDFAIVT 238

Query: 1356 FSYNYTLGRRTL-DDLNPXXXXXXXXEIDNGEGKRRKQRKKSFSDPEDISKS-------- 1508
            F YNY LGR+ L DD+          + +NGEG  RK RKKSFSDPED+S+S        
Sbjct: 239  FVYNYDLGRKGLFDDVKLLLSSSPAEDSENGEGTGRK-RKKSFSDPEDVSESLSKQYDSC 297

Query: 1509 -------------LVLGQYGDESRHA----ISXXXXXXXXXXXVASERICDICKHKILPE 1637
                         L+L +YGD+  HA                 +A+ER+CDIC+ KILP+
Sbjct: 298  GEDSSASNSSTSRLLLDRYGDQLLHARFISSKAARREMRRQQRIAAERMCDICQQKILPD 357

Query: 1638 KDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKR 1817
            KDV+ LLN+KTG LACSSRN+NG FH+FH SCLIHWILLC+FE+ TNQ    K    S+R
Sbjct: 358  KDVAALLNLKTGNLACSSRNLNGVFHVFHISCLIHWILLCEFELKTNQPVTPKVKRRSRR 417

Query: 1818 KNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKAN 1997
            KN SKR +    D + I   Q SS+ CPECQG+G+N+E  +LE+PTI   ++F Y IK +
Sbjct: 418  KNGSKRVQ-ARKDGEYIFTNQISSLFCPECQGTGVNIEGDELEKPTISLSQMFKYKIKVS 476

Query: 1998 EACLAWMKDPEILQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRAD 2135
            +A  AWMK+PE LQN STG  FP  SEE  QEKV PLKLLHFY A+
Sbjct: 477  DARKAWMKNPEALQNCSTGFYFPSRSEEKFQEKVSPLKLLHFYSAE 522


>ref|XP_002273873.2| PREDICTED: uncharacterized protein LOC100255678 [Vitis vinifera]
          Length = 520

 Score =  464 bits (1195), Expect = e-128
 Identities = 270/534 (50%), Positives = 336/534 (62%), Gaps = 43/534 (8%)
 Frame = +3

Query: 666  MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 845
            MA R ELGF K    +LR+Q AR TLR VR++GH  VE+REDG  FIFFC  C APCYS+
Sbjct: 1    MARRTELGFLKTSASSLREQAARTTLRNVRMQGHPYVELREDGKRFIFFCTLCLAPCYSE 60

Query: 846  SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLS---GSSLIVVN 1016
            S L+DHLKGNLH ERYAAAK+TLL S+PWPFNDGVLF  N  E DK LS   G+   ++ 
Sbjct: 61   SVLYDHLKGNLHSERYAAAKVTLLKSHPWPFNDGVLFFDNSSENDKHLSIANGNPTRLLG 120

Query: 1017 SGKNSNGNDNVALERVGD------NENLDSHK-----CATVTNDKSLN--GENCNMVIPG 1157
            + KN N   N+A+   GD      N +++ H      C     ++SLN  G NC+M+IPG
Sbjct: 121  THKNDN---NLAIVCHGDDLSQSNNRHVEQHSNKNSDCDVSFYNESLNNGGRNCDMMIPG 177

Query: 1158 VLCKDVISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKY--GDSSFHEDASKLT 1331
            V+ KD ++ LEVRF+GFG+IAAR  E   + K I++IWC W GK   GD     +   + 
Sbjct: 178  VMIKDEVTELEVRFLGFGQIAARFFEKDGVSKGISKIWCEWFGKEEPGDG----ETVMVP 233

Query: 1332 GHDFGIVTFSYNYTLGRRTLDDLNPXXXXXXXXEIDNGEGKRRKQRKKSFSDPEDISKSL 1511
             HDF +VTF+Y+Y LGR+ L D                EG  RK RKKSFSDPEDIS+SL
Sbjct: 234  DHDFAVVTFNYHYNLGRKGLFD-----DVISMLSSSPTEGSGRK-RKKSFSDPEDISESL 287

Query: 1512 ---------------------VLGQYGDE---SRHAISXXXXXXXXXXX-VASERICDIC 1616
                                 +L +Y D+   +R   S            VA+ER+CDIC
Sbjct: 288  SNQYDSSGEDSLISNSPSPRLLLDRYDDQLLDTRFISSKTIRRELRRQQRVAAERMCDIC 347

Query: 1617 KHKILPEKDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSK 1796
            +HK+LP KDV+TL+NMKTG+L CSSRNV GAFH+FHTSCLIHWILLC+FEI+TNQL   K
Sbjct: 348  QHKMLPGKDVATLMNMKTGKLVCSSRNVYGAFHVFHTSCLIHWILLCEFEIFTNQLVCPK 407

Query: 1797 GTHGSKRKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIF 1976
                S+RK+ SK +    +   + T  Q  SV CPECQG+GI +ED +LE P IP  E+F
Sbjct: 408  LRRSSRRKSGSKCNGKGKDGVIKPTTLQICSVFCPECQGTGIMIED-ELEIPNIPLSEMF 466

Query: 1977 NYNIKANEACLAWMKDPEILQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRADE 2138
             Y IK ++A  AWMK+PE L++ STG  FP  S ET+QEKV  LKLLHFY ADE
Sbjct: 467  KYKIKVSDAHRAWMKNPEELKHCSTGFNFPSQSGETVQEKVSSLKLLHFYSADE 520


>ref|XP_004139943.1| PREDICTED: uncharacterized protein LOC101204451 [Cucumis sativus]
            gi|449475785|ref|XP_004154550.1| PREDICTED:
            uncharacterized LOC101204451 [Cucumis sativus]
          Length = 525

 Score =  457 bits (1177), Expect = e-126
 Identities = 253/530 (47%), Positives = 318/530 (60%), Gaps = 41/530 (7%)
 Frame = +3

Query: 666  MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 845
            MA R ELGFPK   Y+LR+Q AR  LR VR +GH  VE+RE+G  FIFFC  C APCYSD
Sbjct: 1    MARRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELRENGKKFIFFCTLCLAPCYSD 60

Query: 846  SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQL------------ 989
            S LF HLKG LH ER +AAKLTLLG NPWPF+DGVLF +   E D Q+            
Sbjct: 61   SVLFSHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPIEGDNQVGISNDNHERLLE 120

Query: 990  ---SGSSLIVVNSGKNSNGNDNVALERVGDNENLDSHKCATVTNDKSLNGENCNMVIPGV 1160
               + ++L +V    NS GN N   E  G+  N++      + +     GE+C +VIPGV
Sbjct: 121  YNNNDNNLAIVKYVGNSKGNGNRQEEFNGNMRNVEDCSFENLND----GGESCPLVIPGV 176

Query: 1161 LCKDVISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHD 1340
            L K+ IS ++VR +G+G+IAAR  E   I   ++RIWC WLGK  D    E+  K+  H+
Sbjct: 177  LIKEEISDIKVRELGYGQIAARFTEKDGIFSGVSRIWCEWLGKVNDGI--ENMVKVPEHN 234

Query: 1341 FGIVTFSYNYTLGRR-TLDDLNPXXXXXXXXEIDNGEGKRRKQRKKSFSDPED------- 1496
            + I+TF+YN  LGR+  LDD+          E  N E  R+ +RKKSFSDPED       
Sbjct: 235  YAIITFTYNVDLGRKGLLDDVKLLLSSSPGAESQNDE-NRQVKRKKSFSDPEDGSLSMSP 293

Query: 1497 --------------ISKSLVLGQYGDESRHAI----SXXXXXXXXXXXVASERICDICKH 1622
                          +  SL L  Y D+                     +A+ER+CDIC+ 
Sbjct: 294  QYDSSGEDSSASNCVMSSLSLDGYDDQILSTTVMLNKAVRRELRRQQRLAAERMCDICQQ 353

Query: 1623 KILPEKDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGT 1802
            KIL  KDV+TLLNMKTGRLACSSRNVNG FH+FHTSCLIHWILLC++EI    L  SK  
Sbjct: 354  KILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEISVKDLGGSKVR 413

Query: 1803 HGSKRKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNY 1982
               +RK  +K ++ + + + R  K Q  SV CP CQG+GI ++   LE+PT+P  EIF Y
Sbjct: 414  RRYRRKKKTKGNKHIKDGETRQIKTQIDSVFCPACQGTGITIDGDDLEKPTVPLSEIFKY 473

Query: 1983 NIKANEACLAWMKDPEILQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRA 2132
             IK ++A  AWMK PE+LQN STG +FPY  +ETIQE V PLKLLHFY A
Sbjct: 474  KIKVSDARRAWMKSPEVLQNCSTGFQFPYQPDETIQENVKPLKLLHFYGA 523


>ref|XP_006470134.1| PREDICTED: uncharacterized protein LOC102608093 isoform X5 [Citrus
            sinensis]
          Length = 508

 Score =  451 bits (1160), Expect(2) = e-124
 Identities = 247/499 (49%), Positives = 314/499 (62%), Gaps = 36/499 (7%)
 Frame = +3

Query: 666  MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 845
            MA RRELGFPK   ++LR+Q AR TL  VR +GH  VE+REDG  FIFFC  C APCYSD
Sbjct: 1    MAGRRELGFPKTSAFSLREQLARTTLSNVRAQGHTYVELREDGKRFIFFCTLCLAPCYSD 60

Query: 846  SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLSGSSLIVVNSGK 1025
              LFDHLKGNLH ER +AAK+TLLG NPWPFNDGVLF  N  E++KQ + S+  +  S  
Sbjct: 61   LVLFDHLKGNLHTERLSAAKVTLLGPNPWPFNDGVLFFDNSNEKEKQTTVSNDKLGRSLD 120

Query: 1026 NSNGNDNVALERVGDNENLDSHK----------CATVTNDKSLNGENCNMVIPGVLCKDV 1175
              N + N+A+ + G++  ++ ++          C   T  + +  E+C+ VIPGV  KD 
Sbjct: 121  YHNNDSNLAIVKYGEDMKVNGNEHSGLDEVHFDCENGTQVRDIYSESCDKVIPGVFLKDE 180

Query: 1176 ISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVT 1355
            I  L VRFIG G+IAAR+ +  E   +I+RIWC WLGK       ED  ++  HDF IVT
Sbjct: 181  IVDLRVRFIGLGQIAARMIQKDEGSIEISRIWCEWLGKKDPED--EDIVEIPDHDFAIVT 238

Query: 1356 FSYNYTLGRRTL-DDLNPXXXXXXXXEIDNGEGKRRKQRKKSFSDPEDISKS-------- 1508
            F YNY LGR+ L DD+          + +NGEG  RK RKKSFSDPED+S+S        
Sbjct: 239  FVYNYDLGRKGLFDDVKLLLSSSPAEDSENGEGTGRK-RKKSFSDPEDVSESLSKQYDSC 297

Query: 1509 -------------LVLGQYGDESRHA----ISXXXXXXXXXXXVASERICDICKHKILPE 1637
                         L+L +YGD+  HA                 +A+ER+CDIC+ KILP+
Sbjct: 298  GEDSSASNSSTSRLLLDRYGDQLLHARFISSKAARREMRRQQRIAAERMCDICQQKILPD 357

Query: 1638 KDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKR 1817
            KDV+ LLN+KTG LACSSRN+NG FH+FH SCLIHWILLC+FE+ TNQ    K    S+R
Sbjct: 358  KDVAALLNLKTGNLACSSRNLNGVFHVFHISCLIHWILLCEFELKTNQPVTPKVKRRSRR 417

Query: 1818 KNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKAN 1997
            KN SKR +    D + I   Q SS+ CPECQG+G+N+E  +LE+PTI   ++F Y IK +
Sbjct: 418  KNGSKRVQ-ARKDGEYIFTNQISSLFCPECQGTGVNIEGDELEKPTISLSQMFKYKIKVS 476

Query: 1998 EACLAWMKDPEILQNRSTG 2054
            +A  AWMK+PE LQN STG
Sbjct: 477  DARKAWMKNPEALQNCSTG 495



 Score = 25.8 bits (55), Expect(2) = e-124
 Identities = 10/13 (76%), Positives = 13/13 (100%)
 Frame = +2

Query: 2087 SGKGIASKVASFL 2125
            +GKG+ASK+ASFL
Sbjct: 494  TGKGVASKIASFL 506


>ref|XP_007043575.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508707510|gb|EOX99406.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 517

 Score =  451 bits (1161), Expect = e-124
 Identities = 250/528 (47%), Positives = 329/528 (62%), Gaps = 34/528 (6%)
 Frame = +3

Query: 666  MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 845
            MAERRELG P+    +L++Q AR TL  VR +GH  +E+REDG  FIFFC  C APCYSD
Sbjct: 1    MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60

Query: 846  SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLSGSSLIVVNSGK 1025
            S L DHLKG+LH  R AAAK+TLLG+NPWPFNDGVLF     E++K+L+G          
Sbjct: 61   SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGL--------- 111

Query: 1026 NSNGNDNVALERVGDNENLD-----SHKCATVTNDKSLNGENCNMVIPGVLCKDVISSLE 1190
              +GN N  LE   +++NL        + ++   + +    + +++IPGVL KD IS L+
Sbjct: 112  --HGNQNRLLEFHNNDDNLAIVEYVGSEVSSYRKNVNCRAGDSDLLIPGVLIKDEISDLK 169

Query: 1191 VRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVTFSYNY 1370
            VRFIGFG+IAAR  E   +  +I+RIWC WLGK  +   ++D  K   H F +VTF YN 
Sbjct: 170  VRFIGFGKIAARFCEKDGVLNEISRIWCEWLGK--EVPRNDDKLKAPKHGFAVVTFVYNC 227

Query: 1371 TLGRR-TLDDLNPXXXXXXXXEIDNGEGKRRKQRKKSFSDPEDISKSL------------ 1511
             LGR+  LDD+           ++NG+   RK RKKSFSDPEDIS+SL            
Sbjct: 228  DLGRKGLLDDVKSLLTSGSPTGLENGDSASRK-RKKSFSDPEDISESLSNQYDSSGEDSS 286

Query: 1512 ---------VLGQYGDE-------SRHAISXXXXXXXXXXXVASERICDICKHKILPEKD 1643
                      L +Y D+       S  AI            +A+ER+CDIC+ K+LPEKD
Sbjct: 287  ASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQR---IAAERMCDICQQKMLPEKD 343

Query: 1644 VSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKRKN 1823
            V+TL+N+ TG+L CSSRNVNGAFH+FHTSCLIHWILLC+ E   N   N K    S+RKN
Sbjct: 344  VATLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRKN 403

Query: 1824 VSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEA 2003
             +K +++  + + + T    SSV+CPECQG+GI+VE  +LE+P +   ++F Y IK ++A
Sbjct: 404  GAKSNDMGKDGETKATGTLISSVLCPECQGTGIDVEGDELEKPDVSLSQMFRYKIKVSDA 463

Query: 2004 CLAWMKDPEILQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRADE*KS 2147
              AWMK PE+L+N STG  F   S E +QEK+LPLKLLHFY AD+ +S
Sbjct: 464  RRAWMKSPEMLENCSTGFHFRSQSGEMVQEKILPLKLLHFYSADKYES 511


>ref|XP_002517932.1| conserved hypothetical protein [Ricinus communis]
            gi|223542914|gb|EEF44450.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 509

 Score =  443 bits (1139), Expect = e-121
 Identities = 247/519 (47%), Positives = 316/519 (60%), Gaps = 28/519 (5%)
 Frame = +3

Query: 666  MAERRELGFPK-GGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYS 842
            MA R ELGF K GG  +L++Q AR TL  VR +GH  VE+REDG  FIFFC  C APCYS
Sbjct: 1    MAGRWELGFTKTGGANSLKEQLARTTLNNVRSKGHPYVELREDGKRFIFFCTLCLAPCYS 60

Query: 843  DSALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLSGSSLIVVNSG 1022
            D+ LFDHLKGNLH ER + A LTLL  NPWPF+DGV F     E +KQL    +I  ++ 
Sbjct: 61   DAVLFDHLKGNLHTERLSTATLTLLKENPWPFSDGVHFFDTSSENEKQL----VIKNDNE 116

Query: 1023 KNSNGNDNVALERVGDNENLDSHKCATVTNDKSLNGENCNMVIPGVLCKDVISSLEVRFI 1202
               NGN ++A+ + G +      +      D + NG   +++I GVL KD IS L+ RF+
Sbjct: 117  SRGNGNSSLAIVKYGGSLKPTGDEDTGCNKDANDNGRISDLLIQGVLVKDDISDLQARFM 176

Query: 1203 GFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVTFSYNYTLGR 1382
            G+G I AR+ E       I+RIWC WLGK  ++    D +K+  H+F +VTF+YNY LGR
Sbjct: 177  GYGRIGARLIEKDGNSNDISRIWCEWLGK--NTPCDLDKAKVLDHEFAVVTFAYNYDLGR 234

Query: 1383 R-TLDDLNPXXXXXXXXEIDNGEGKRRKQRKKSFSDPEDISKS----------------- 1508
            +  LDD+          E DN  G  RK RKKSFSDPED+S+S                 
Sbjct: 235  KGLLDDVKLLLSSSPVQESDNQGGTNRK-RKKSFSDPEDVSESFSNQYDSSGEESLTSIG 293

Query: 1509 -----LVLGQYGDESRHA----ISXXXXXXXXXXXVASERICDICKHKILPEKDVSTLLN 1661
                 L+L ++ D+  H+                 +A+ER+CDIC+ KILPEKDV+TL+N
Sbjct: 294  GPPTRLLLDRHDDQFLHSKVISSKTLRRELRRQHHIAAERMCDICQQKILPEKDVATLVN 353

Query: 1662 MKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKRKNVSKRSE 1841
            M TG+LACSSRN  G +H+FHTSCLIHWILL ++E+  NQ  + KG   S+RKN +K S 
Sbjct: 354  MNTGKLACSSRNTYGQYHVFHTSCLIHWILLSEYEMARNQSVSPKGRRKSRRKNGTKSSH 413

Query: 1842 ILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEACLAWMK 2021
            +   +K +    Q SSV CPECQG+G  +E  + E PTIP  E+F Y IK  +   AWMK
Sbjct: 414  V---EKVKALNNQISSVFCPECQGTGAILEKDERELPTIPLSEMFKYKIKVGDGRRAWMK 470

Query: 2022 DPEILQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRADE 2138
             PE+L+N S G  FP  SE  +Q KVLPLKLLHFYRADE
Sbjct: 471  SPEVLENCSIGFHFPSQSEGAVQAKVLPLKLLHFYRADE 509


>ref|XP_004303120.1| PREDICTED: uncharacterized protein LOC101310040 [Fragaria vesca
            subsp. vesca]
          Length = 525

 Score =  435 bits (1118), Expect = e-119
 Identities = 251/534 (47%), Positives = 317/534 (59%), Gaps = 43/534 (8%)
 Frame = +3

Query: 666  MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 845
            MA R ++G PK    +LR+Q  R  LR VR +GH  VEVREDG  FIFFC  C APCYSD
Sbjct: 1    MAGRWDVGVPKTNACSLREQATRTILRNVRSQGHSYVEVREDGKKFIFFCTLCLAPCYSD 60

Query: 846  SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQL------------ 989
              LFDHLKGNLH ER AAAK+TLL  NPWPFNDGV+F  N +E DK +            
Sbjct: 61   KVLFDHLKGNLHNERLAAAKVTLLRPNPWPFNDGVVFFNNSYETDKGVVTPDDNKCRMLE 120

Query: 990  ---SGSSLIVVNSGKN--SNGNDNVALERVGDNENLDSHKCATVTNDKSLNGENCNMVIP 1154
               + ++L +V  G N  +NG D+  ++ +  NE +D     +   D + +G   ++VIP
Sbjct: 121  SHDNENNLAIVKYGGNLKTNGYDHCGVDGLECNEYIDLQGLQSNVGDSTADGAKSSVVIP 180

Query: 1155 GVLCKDVISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTG 1334
            G++ +D I+ LEVR +G GEIAAR          I RIWC WLG     S  ED   +  
Sbjct: 181  GIVVRDEITDLEVREVGLGEIAARFLGK----DGIGRIWCEWLGVKSIDS--EDLCNVPE 234

Query: 1335 HDFGIVTFSYNYTLGRR-TLDDLNPXXXXXXXXEIDNGEGKRRKQRKKSFSDPEDISKSL 1511
            HDF +VTFSYN  LGR+  LDD+          E  NGEG   K RKKSFSDPEDIS SL
Sbjct: 235  HDFAVVTFSYNIDLGRKGLLDDVRMLLSSSPTIESGNGEGTGCK-RKKSFSDPEDISDSL 293

Query: 1512 ---------------------VLGQYGDE---SRHAISXXXXXXXXXXX-VASERICDIC 1616
                                 +L  Y D+   +R  ++            +AS R+CDIC
Sbjct: 294  SNQYESFGEDSSASSGTASRLLLDHYDDQLLNTRFILNKSIRRELRRQQRLASGRMCDIC 353

Query: 1617 KHKILPEKDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSK 1796
            + ++LP KDV+TL+N+KTG+LACSSRNVNGAFH+FHTSCLIHWILLC+ E+ TNQ   SK
Sbjct: 354  QQRMLPGKDVATLMNLKTGKLACSSRNVNGAFHVFHTSCLIHWILLCEVEVITNQNTGSK 413

Query: 1797 GTHGSKRKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIF 1976
                S+RK  +K +    + + +   PQ  SV CPECQG+GI V+   LE+P +P  ++F
Sbjct: 414  ARRRSRRKTAAKCNG--KDAQLKSLSPQIYSVFCPECQGTGIVVDGDDLEKPNLPLSQMF 471

Query: 1977 NYNIKANEACLAWMKDPEILQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRADE 2138
             Y IK ++A  AWMK PE+LQN STG  FP  +   IQEKV  LKLL FYRA E
Sbjct: 472  RYKIKVSDARRAWMKSPEMLQNCSTGFHFPSLNAAGIQEKVKTLKLLRFYRAHE 525


>emb|CAN73945.1| hypothetical protein VITISV_032245 [Vitis vinifera]
          Length = 896

 Score =  427 bits (1097), Expect = e-116
 Identities = 249/503 (49%), Positives = 314/503 (62%), Gaps = 43/503 (8%)
 Frame = +3

Query: 711  NLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSDSALFDHLKGNLHKER 890
            +LR+Q AR TLR VR++GH  VE+REDG  FIFFC  C APCYS+S L+DHLKGNLH ER
Sbjct: 352  SLREQAARTTLRNVRMQGHPYVELREDGKRFIFFCTLCLAPCYSESVLYDHLKGNLHSER 411

Query: 891  YAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLS---GSSLIVVNSGKNSNGNDNVALER 1061
            YAAAK+TLL S+PWPFNDGVLF  N  E DK LS   G+   ++ + KN N   N+A+  
Sbjct: 412  YAAAKVTLLKSHPWPFNDGVLFFDNSSENDKHLSIANGNPTRLLGTHKNDN---NLAIVC 468

Query: 1062 VGD------NENLDSHK-----CATVTNDKSLN--GENCNMVIPGVLCKDVISSLEVRFI 1202
             GD      N +++ H      C     ++SLN  G NC+M+IPGV+ KD ++ LEVRF+
Sbjct: 469  HGDDLSQSNNRHVEQHSNKNSDCDVSFYNESLNNGGRNCDMMIPGVMIKDEVTELEVRFL 528

Query: 1203 GFGEIAARIHESMEIPKKINRIWCAWLGKY--GDSSFHEDASKLTGHDFGIVTFSYNYTL 1376
            GFG+IAAR  E   + K I++IWC W GK   GD     +   +  HDF +VTF+Y+Y L
Sbjct: 529  GFGQIAARFFEKDGVSKGISKIWCEWFGKEEPGDG----ETVMVPDHDFAVVTFNYHYNL 584

Query: 1377 GRRTLDDLNPXXXXXXXXEIDNGEGKRRKQRKKSFSDPEDISKSL--------------- 1511
            GR+ L D                EG  RK RKKSFSDPEDIS+SL               
Sbjct: 585  GRKGLFD-----DVISMLSSSPTEGSGRK-RKKSFSDPEDISESLSNQYDSSGEDSLISN 638

Query: 1512 ------VLGQYGDE---SRHAISXXXXXXXXXXX-VASERICDICKHKILPEKDVSTLLN 1661
                  +L +Y D+   +R   S            VA+ER+CDIC+HK+LP KDV+TL N
Sbjct: 639  SPSPRLLLDRYDDQLLDTRFISSKTIRRELRRQQRVAAERMCDICQHKMLPGKDVATLXN 698

Query: 1662 MKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKRKNVSKRSE 1841
            MKTG+L CSSRNV GAFH+FHTSCLIHWILLC+FEI+TNQL   K    S+RK+ SK + 
Sbjct: 699  MKTGKLVCSSRNVYGAFHVFHTSCLIHWILLCEFEIFTNQLVCPKLRRSSRRKSGSKCNG 758

Query: 1842 ILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEACLAWMK 2021
               +   + T  Q  SV CPECQG+GI +ED +LE P IP  E+F Y IK ++A  AWMK
Sbjct: 759  KGKDGVIKPTTLQICSVFCPECQGTGIMIED-ELEIPNIPLSEMFKYKIKVSDAHRAWMK 817

Query: 2022 DPEILQNRSTGLRFPYNSEETIQ 2090
            +PE L++ STG  FP  S ET+Q
Sbjct: 818  NPEELKHCSTGFNFPSQSGETVQ 840


>ref|XP_002319898.2| hypothetical protein POPTR_0013s13670g [Populus trichocarpa]
            gi|550325787|gb|EEE95821.2| hypothetical protein
            POPTR_0013s13670g [Populus trichocarpa]
          Length = 513

 Score =  426 bits (1096), Expect = e-116
 Identities = 234/525 (44%), Positives = 317/525 (60%), Gaps = 34/525 (6%)
 Frame = +3

Query: 666  MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 845
            MA  RE+GFPK    +LR+Q AR TL +VR  GH  +E+REDG  FIFFC  C +PCYSD
Sbjct: 1    MAGNREVGFPKTTASSLREQLARTTLSRVRARGHPYLELREDGKRFIFFCTLCLSPCYSD 60

Query: 846  SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLSGSSLIVVNSGK 1025
            + L DHL+GNLH ER +AAK TLL  NPWPF+DG+ F       ++QL+      +  GK
Sbjct: 61   TILLDHLRGNLHTERLSAAKATLLKPNPWPFSDGIHFFDASSGNEEQLA------IKDGK 114

Query: 1026 NSN-------GNDNVALERVGDNENLDSHKCATVTNDKSLNG--ENCNMVIPGVLCKDVI 1178
             S+        +DN+A+ +  +N       C TV  D++L+G  E  ++VIP V  K+ +
Sbjct: 115  ESSRFLKFEENSDNLAIVKYVENLKPG---CDTVV-DENLSGSDEGSDLVIPSVRLKEEV 170

Query: 1179 SSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVTF 1358
            S L+   +G G+IAAR++E  +   +I+RIWC WLGK   SS  ED  K+  HDFG+VTF
Sbjct: 171  SDLKATLVGSGQIAARMYEKKDGSNEISRIWCEWLGK--KSSNDEDKVKVLDHDFGVVTF 228

Query: 1359 SYNYTLGRRTLDDLNPXXXXXXXXEIDNGEGKRRKQRKKSFSDPEDISKSL--------- 1511
            +Y+Y LG+  L D            +   + +   +RK+S S+PED+S+SL         
Sbjct: 229  AYDYELGKSGLFDDVKLLLSSSAPALTENDERGNWKRKRSVSEPEDVSRSLTNQYGLCEE 288

Query: 1512 ------------VLGQYGDESRH----AISXXXXXXXXXXXVASERICDICKHKILPEKD 1643
                        VL +Y D+  H    +             +A+E++CDIC+ K+LPEKD
Sbjct: 289  ESSKTTCASSNLVLDRYDDQLMHTRFISNKTVRREVRKQQRIAAEKMCDICQQKMLPEKD 348

Query: 1644 VSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKRKN 1823
            V+TL N KTG+LACSSRNV GAFH+FHTSCLIHWIL C+FEI  NQ  ++KG   S++KN
Sbjct: 349  VATLWNRKTGKLACSSRNVYGAFHVFHTSCLIHWILYCEFEIVRNQTVSTKGGRRSRKKN 408

Query: 1824 VSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEA 2003
             +K +    +    +      SV CP+CQG+G+N+E  + E+P  P  E+F Y IK +E 
Sbjct: 409  GTKSNTTGKDGTVNVLPNPIVSVFCPDCQGTGVNIEGDEFEKPLTPLSEMFKYKIKVSEG 468

Query: 2004 CLAWMKDPEILQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRADE 2138
               WMK+PEIL+N STG  FP  S E +QEKVLPLKLLHFYR +E
Sbjct: 469  HRGWMKNPEILENCSTGFHFPSQSGEPVQEKVLPLKLLHFYRPEE 513


>gb|EXB79637.1| hypothetical protein L484_011577 [Morus notabilis]
          Length = 638

 Score =  416 bits (1070), Expect = e-113
 Identities = 245/539 (45%), Positives = 320/539 (59%), Gaps = 54/539 (10%)
 Frame = +3

Query: 666  MAERRELGFPKGGVY--------NLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIW 821
            MA R  LGFPK            +L+ Q  R  LR VR +GH  VE+REDG   IFFC  
Sbjct: 1    MAGRGILGFPKSNELAVSKTTSCSLKDQAKRTILRNVRSQGHTYVELREDGKKSIFFCTL 60

Query: 822  CRAPCYSDSALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEED------- 980
            C APCYSD  LFDHLKGNLH +R + AK+TLLG NPWPFNDGV+F  N  E D       
Sbjct: 61   CLAPCYSDCVLFDHLKGNLHNQRLSTAKVTLLGPNPWPFNDGVVFFNNPTENDDDTVISN 120

Query: 981  --------KQLSGSSLIVVNSGKN--SNGNDNVALERVG-DNENLDSHKCATVTNDKSLN 1127
                     Q S ++L +V  G+N  S  N ++ ++ +G  NEN DS        + + +
Sbjct: 121  GNQSRLLESQDSENNLAIVTYGENLESCANGHIMVDELGHQNENPDS------AGNLAGS 174

Query: 1128 GENCNMVIPGVLCKDVISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSF 1307
            GENC ++IPGV   D I+++EVR +G+G I+ R  E   +   I+RIWC WLGK   +  
Sbjct: 175  GENCAVLIPGVRAGDEIANVEVREVGYGLISVRFREKDGVSNDISRIWCEWLGK--KTIE 232

Query: 1308 HEDASKLTGHDFGIVTFSYN-YTLGRRTL-DDLNPXXXXXXXXEIDNGEGKRRKQRKKSF 1481
             ED  K+  HDF IVTFSYN ++LGR  L DD+          E+ NG+   RK R+KSF
Sbjct: 233  DEDFLKVPEHDFAIVTFSYNNFSLGRMGLHDDVKALLCSSPAAEMQNGDVSSRK-RRKSF 291

Query: 1482 SDPEDISK-------------------SLVLGQYGDE-------SRHAISXXXXXXXXXX 1583
            SDPED S+                   SL+L QY D+       S  AI           
Sbjct: 292  SDPEDSSENLSNQYDSCGEDSSASAVTSLMLDQYDDQLLQTRFISNKAIRRELRRQQR-- 349

Query: 1584 XVASERICDICKHKILPEKDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDF 1763
             +A+ER+CDIC+HK+LP KDV+TL+N+KTGRLACSSRN NGAFHLFHTSCLIHW+LLC+ 
Sbjct: 350  -IAAERMCDICQHKMLPGKDVATLMNVKTGRLACSSRNTNGAFHLFHTSCLIHWVLLCEV 408

Query: 1764 EIWTNQLDNSKGTHGSKRKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQL 1943
            E  TNQ +  K    S+RK  SK +E+L + + +  +   + VICPECQG+G  + DG+ 
Sbjct: 409  EKCTNQSEAPKVKRRSRRKAASKCNEVLNDSEVKAFRTPINRVICPECQGTGTMI-DGED 467

Query: 1944 EEPTIPPFEIFNYNIKANEACLAWMKDPEILQNRSTGLRFPYNSEETIQEKVLPLKLLH 2120
            E+PT+P  ++F Y IK ++A  AWMK PE+L N STG  FP  +EETIQ  ++ +  +H
Sbjct: 468  EKPTVPLSKMFKYKIKVSDARRAWMKSPEVLGNCSTGFHFPSPAEETIQVHLVYIAEIH 526


>ref|XP_006366024.1| PREDICTED: uncharacterized protein LOC102600129 [Solanum tuberosum]
          Length = 521

 Score =  403 bits (1036), Expect = e-109
 Identities = 241/527 (45%), Positives = 310/527 (58%), Gaps = 41/527 (7%)
 Frame = +3

Query: 678  RELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSDSALF 857
            R+L FP+    NL++Q  R TL+ VR +GHI VE+REDG   +FFC  C +PCYSDS LF
Sbjct: 4    RQLDFPRTSGGNLKEQLVRRTLQNVRSQGHIYVELREDGKRLVFFCTLCHSPCYSDSVLF 63

Query: 858  DHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLSGSSLIVVNSGKNSNG 1037
            +HLKGNLH E  AAAK TLL  NPWPFNDGVLF +N  E+DK         VN GK+   
Sbjct: 64   NHLKGNLHTEMLAAAKATLLKPNPWPFNDGVLF-FNDPEQDKHSPN-----VNVGKSRLV 117

Query: 1038 N----DNVALERVGDNENLDSHKCATVTN------DKSL--NGENCNMVIPGVLCKDVIS 1181
            +    D  +L  V  ++NL  +    VT       D  L  NGE+  +VIPGVLCKD +S
Sbjct: 118  DTCLEDESSLAIVECDDNLRHNGDTYVTEYEYCLLDSELTGNGESEYLVIPGVLCKDELS 177

Query: 1182 SLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVTFS 1361
             LEV+ IG G+IAARI       KKI RIWC WL K        D S +  HDF +VTF 
Sbjct: 178  DLEVKHIGIGKIAARISVRGIDSKKIRRIWCEWLVKKDSDDM--DTSVVPDHDFAVVTFP 235

Query: 1362 YNYTLGRRTLDDLNPXXXXXXXXEIDNGEGKRRKQRKKSFSDPEDISKSL---------- 1511
            YNY LGR+ L D           E +   G R+++RK SFSDPED S+SL          
Sbjct: 236  YNYNLGRKPLLDDRFLLPSSPYSESEETSGTRKRKRK-SFSDPEDFSESLSNHCDSSGEE 294

Query: 1512 -----------VLGQYGDE--SRHAISXXXXXXXXXXX--VASERICDICKHKILPEKDV 1646
                       +LG   D+  S   IS             VASER+CDIC+ K+LP KDV
Sbjct: 295  SQSTNNSNMKLILGTCDDQLVSSRIISSKTMRRELRKQQRVASERMCDICQQKMLPGKDV 354

Query: 1647 STLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLD----NSKGTHGSK 1814
            +TLL+ K+G+L CSSRN+ GAFHLFH SCLIHWIL C+ + +   +D     +K    SK
Sbjct: 355  ATLLSWKSGKLMCSSRNMTGAFHLFHVSCLIHWILQCELQTYVKPVDEPKMETKAKRRSK 414

Query: 1815 RKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKA 1994
            RK  +K +     D+ +  + + +SV CPECQG+GI +E  +LE+P +   E++ + IK 
Sbjct: 415  RKTGTKHNAKEKEDEIKSAR-RINSVFCPECQGTGIIIEGDELEKPPVSLSEVYRHKIKL 473

Query: 1995 NEACLAWMKDPEILQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRAD 2135
            ++A  AWMK+PE+LQN STG   P   ++ +QE V PLKLLHFYRA+
Sbjct: 474  SDARKAWMKNPEVLQNCSTGFDLPPEHDDLLQEYVSPLKLLHFYRAN 520


>ref|XP_002869513.1| hypothetical protein ARALYDRAFT_491947 [Arabidopsis lyrata subsp.
            lyrata] gi|297315349|gb|EFH45772.1| hypothetical protein
            ARALYDRAFT_491947 [Arabidopsis lyrata subsp. lyrata]
          Length = 517

 Score =  392 bits (1008), Expect = e-106
 Identities = 225/525 (42%), Positives = 311/525 (59%), Gaps = 35/525 (6%)
 Frame = +3

Query: 666  MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 845
            MAE++ELG PK  + NL++Q AR TL+ +RL+GH  +E+REDG  F+FFC  C APCYSD
Sbjct: 1    MAEKKELGLPKSSI-NLKEQLARTTLKNLRLQGHTYIELREDGKRFVFFCTLCLAPCYSD 59

Query: 846  SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQ---LSGSSLIVVN 1016
            + L  HL GNLHKER A A+LTLLG+NPWPF+DGVLF  +   E+++   +SG + +   
Sbjct: 60   TILLGHLNGNLHKERLACARLTLLGTNPWPFSDGVLFFDSSTGEEEEKTPVSGGASVPGT 119

Query: 1017 SGKNSNGNDNVALERVGDNENLDSHKCATVTNDKSLNGENCNMVIPGVLCKDVISSLEVR 1196
             G  S+ +D  A+ +  +N+    ++ A VT+D+  +  + +++I GVL K+    +E +
Sbjct: 120  LGHCSD-DDRFAIVKYDNNKANGGNQPAAVTDDEPSHSTD-DLLISGVLIKERTLDVEAK 177

Query: 1197 FIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVTFSYNYTL 1376
            FIGFG IAAR+ E+      I+++WC WLG  G S   E+ + +  HDF IVTFSY Y L
Sbjct: 178  FIGFGRIAARLFETKGRTTWIDKLWCEWLGDEGPSD--EEKATIPEHDFAIVTFSYFYNL 235

Query: 1377 GRRTLDDLNPXXXXXXXXEIDNGEGKRRKQRKKSFSDPEDISKSLV----------LGQY 1526
            GR  L D           E  NGE   RK RKKSFSDPED S+SL            G  
Sbjct: 236  GRLGLLDDPSRLLTTSQSESGNGEDSGRK-RKKSFSDPEDTSESLCNQYDSSEEVSSGHN 294

Query: 1527 GDESRHAISXXXXXXXXXXXVA---------------SERICDICKHKILPEKDVSTLLN 1661
             + SR  I+           V                SERIC++CK K+LP KD + +LN
Sbjct: 295  SNSSRALIADYDDSLMSKRVVKNKTVRRELRRQQRIFSERICEVCKQKMLPGKDAAAILN 354

Query: 1662 MKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKR--KNVSKR 1835
            MKTG LAC SRN+ GAFHLFH SC++HW L C+ EI  N++ + K   G KR  K+ S +
Sbjct: 355  MKTGNLACGSRNLLGAFHLFHVSCVVHWFLFCESEILGNKMVSGK---GKKRCTKHSSGQ 411

Query: 1836 SEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEACLAW 2015
            + +  N+       Q  SV CPECQG+GIN+E G +E  T P  + + + +K +E   AW
Sbjct: 412  TGVKWNELANDVSWQIFSVFCPECQGTGINIEGGVIERDTFPLSQTWRFQVKVSEGRKAW 471

Query: 2016 MKDPEILQNRSTGLRFPYNSEETIQ-----EKVLPLKLLHFYRAD 2135
            +K+PE L+N STG  FP  ++E+ Q     E+V  +KL+ FYR +
Sbjct: 472  VKNPEKLKNCSTGFHFPQQADESGQIPVQEERVQMMKLVRFYRVE 516


>ref|XP_007198901.1| hypothetical protein PRUPE_ppa004741mg [Prunus persica]
            gi|462394196|gb|EMJ00100.1| hypothetical protein
            PRUPE_ppa004741mg [Prunus persica]
          Length = 493

 Score =  390 bits (1001), Expect = e-105
 Identities = 235/539 (43%), Positives = 297/539 (55%), Gaps = 49/539 (9%)
 Frame = +3

Query: 666  MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 845
            MA R ELGFPK    +LR+Q  R  LR VR +GH  VE+REDG  FIFFC  C APCYSD
Sbjct: 1    MAGRWELGFPKTSASSLREQATRTILRNVRSQGHTYVELREDGKKFIFFCTLCLAPCYSD 60

Query: 846  SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQL------------ 989
              LFDHLKGNLHK+R AAAK+TLL  NPWPFNDGV F +N  E DK L            
Sbjct: 61   KVLFDHLKGNLHKDRLAAAKVTLLRPNPWPFNDGVAFFHNPDETDKHLVITDGNKFRMLE 120

Query: 990  ---SGSSLIVVNSGKN--SNGNDNVALERVGDNENLD------SHKCATVTNDKSLNGEN 1136
                 ++L +V  G+N  SNGN++V  + +  N +LD      + K +    + + N  N
Sbjct: 121  SPDDENNLAIVKYGENLISNGNEHVGTDGLECNGSLDFPRVRSNFKFSCSNENSTANEVN 180

Query: 1137 CNMVIPGVLCKDVISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHED 1316
             ++VIP VL +D ++ +E + +G G+IAAR  E  ++ K I RIWC WLGK   +  +E 
Sbjct: 181  SSVVIPSVLVRDDVTDIEAKKVGLGQIAARFLEKDKVSKGIGRIWCEWLGK--KAIGNEY 238

Query: 1317 ASKLTGHDFGIVTFSYNYTLGRR-TLDDLNPXXXXXXXXEIDNGEGKRRKQRKKSFSDPE 1493
              K+  HDF +VTFSYN  LGRR  LDD+          E +NGEG   K RKKSFSDPE
Sbjct: 239  HLKVPEHDFAVVTFSYNIDLGRRGLLDDVKMLLSSSPSVETENGEGSGSK-RKKSFSDPE 297

Query: 1494 DISKS---------------------LVLGQYGDESRHA----ISXXXXXXXXXXXVASE 1598
            DIS+S                     L+L +Y D+  H                  +A  
Sbjct: 298  DISESLSNQYDSCGEDSSASSGASSKLLLDRYDDQLLHTRFILNKSIRRELRRQQRLALG 357

Query: 1599 RICDICKHKILPEKDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTN 1778
            R+CDIC+ +++P KDVS L+N+KTGRLACSSRNVNGAFH+FHTSCLIHWILLC+ EI  N
Sbjct: 358  RMCDICQQRMIPGKDVSALINLKTGRLACSSRNVNGAFHVFHTSCLIHWILLCEVEI-AN 416

Query: 1779 QLDNSKGTHGSKRKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTI 1958
            Q  NSK    S+RKN +K +    + +      Q  SV CPECQG+G  ++   LE+P +
Sbjct: 417  QSTNSKVRRRSRRKNAAKCNG--QDGQMTALSTQIHSVFCPECQGTGAIIDGDDLEKPNL 474

Query: 1959 PPFEIFNYNIKANEACLAWMKDPEILQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRAD 2135
            P                                          QEKV PLKL+HFYRAD
Sbjct: 475  P----------------------------------------LSQEKVKPLKLMHFYRAD 493


>ref|XP_004248159.1| PREDICTED: uncharacterized protein LOC101261554 [Solanum
            lycopersicum]
          Length = 526

 Score =  388 bits (997), Expect = e-105
 Identities = 234/527 (44%), Positives = 304/527 (57%), Gaps = 41/527 (7%)
 Frame = +3

Query: 678  RELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSDSALF 857
            ++L  P+    NL++Q  R TL+ VR +GHI VE+REDG   IFFC  C +PCYSDS LF
Sbjct: 4    KQLDVPRTSGGNLKEQLVRRTLQNVRSQGHIYVELREDGKRLIFFCTLCHSPCYSDSVLF 63

Query: 858  DHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLSGSSLIVVNSGKNS-- 1031
            +HLKGNLH E  AAAK TLL  NPWPFNDGVLF +N  E+DKQ   S    VN GK+   
Sbjct: 64   NHLKGNLHTEMLAAAKATLLKPNPWPFNDGVLF-FNDPEQDKQDKQSPN--VNVGKSRLV 120

Query: 1032 ----NGNDNVALERVGDN--ENLDSH----KCATVTNDKSLNGENCNMVIPGVLCKDVIS 1181
                    +VA+    DN   N D++    +   + ++   N E+  +VIPGVLCKD +S
Sbjct: 121  DTCLEDESSVAIVEYDDNLRHNEDTYVSEYEYGLLDSELIGNEESDYLVIPGVLCKDELS 180

Query: 1182 SLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVTFS 1361
             LEV+ IG G+IAARI       K I RIWC WL K        D S +  HDF +VTF 
Sbjct: 181  DLEVKHIGIGKIAARISVRGIDSKSIRRIWCEWLAKKDSDDM--DTSVVPDHDFAVVTFP 238

Query: 1362 YNYTLGRRTLDDLNPXXXXXXXXEIDNGEGKRRKQRKKSFSDPEDISKSL---------- 1511
            YNY LGR  L D           E +       K+++KSFSDPED S+SL          
Sbjct: 239  YNYNLGRSPLLDDRFLLPSSPYSESEE-TSVTGKRKRKSFSDPEDFSESLSNHCDSSGEE 297

Query: 1512 -----------VLGQYGDE--SRHAISXXXXXXXXXXX--VASERICDICKHKILPEKDV 1646
                       +LG   D+  S   IS             VASER+CDIC+ K+LP KDV
Sbjct: 298  SQSTNNSNMKLILGTCDDQLVSSRIISSKTMRRELRKQQRVASERMCDICQQKMLPGKDV 357

Query: 1647 STLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLD----NSKGTHGSK 1814
            +TLL+ K+G+L CSSRN++GAFHLFH SCLIHWIL C+ +     +D      K    SK
Sbjct: 358  ATLLSWKSGKLMCSSRNMSGAFHLFHVSCLIHWILQCELQTSVKPVDEPKMEPKAKRRSK 417

Query: 1815 RKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKA 1994
            +K  +K +     D+ +  + + +SV CPECQG+GI +E  +LE+P +   E++   IK 
Sbjct: 418  KKTGTKHNAKEKEDETKSAR-RINSVFCPECQGTGICIEGDELEKPPVSLSEVYRLKIKL 476

Query: 1995 NEACLAWMKDPEILQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRAD 2135
            ++A  AWMK+PE+LQN STG   P   ++ +QE V PLKLLHFYRA+
Sbjct: 477  SDARKAWMKNPEVLQNCSTGFDLPPEHDDLLQEYVSPLKLLHFYRAN 523


>ref|XP_007043578.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508707513|gb|EOX99409.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 478

 Score =  387 bits (995), Expect = e-104
 Identities = 218/481 (45%), Positives = 291/481 (60%), Gaps = 34/481 (7%)
 Frame = +3

Query: 666  MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 845
            MAERRELG P+    +L++Q AR TL  VR +GH  +E+REDG  FIFFC  C APCYSD
Sbjct: 1    MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60

Query: 846  SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLSGSSLIVVNSGK 1025
            S L DHLKG+LH  R AAAK+TLLG+NPWPFNDGVLF     E++K+L+G          
Sbjct: 61   SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGL--------- 111

Query: 1026 NSNGNDNVALERVGDNENLD-----SHKCATVTNDKSLNGENCNMVIPGVLCKDVISSLE 1190
              +GN N  LE   +++NL        + ++   + +    + +++IPGVL KD IS L+
Sbjct: 112  --HGNQNRLLEFHNNDDNLAIVEYVGSEVSSYRKNVNCRAGDSDLLIPGVLIKDEISDLK 169

Query: 1191 VRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVTFSYNY 1370
            VRFIGFG+IAAR  E   +  +I+RIWC WLGK  +   ++D  K   H F +VTF YN 
Sbjct: 170  VRFIGFGKIAARFCEKDGVLNEISRIWCEWLGK--EVPRNDDKLKAPKHGFAVVTFVYNC 227

Query: 1371 TLGRR-TLDDLNPXXXXXXXXEIDNGEGKRRKQRKKSFSDPEDISKSL------------ 1511
             LGR+  LDD+           ++NG+   RK RKKSFSDPEDIS+SL            
Sbjct: 228  DLGRKGLLDDVKSLLTSGSPTGLENGDSASRK-RKKSFSDPEDISESLSNQYDSSGEDSS 286

Query: 1512 ---------VLGQYGDE-------SRHAISXXXXXXXXXXXVASERICDICKHKILPEKD 1643
                      L +Y D+       S  AI            +A+ER+CDIC+ K+LPEKD
Sbjct: 287  ASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQR---IAAERMCDICQQKMLPEKD 343

Query: 1644 VSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKRKN 1823
            V+TL+N+ TG+L CSSRNVNGAFH+FHTSCLIHWILLC+ E   N   N K    S+RKN
Sbjct: 344  VATLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRKN 403

Query: 1824 VSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEA 2003
             +K +++  + + + T    SSV+CPECQG+GI+VE  +LE+P +   ++   ++K    
Sbjct: 404  GAKSNDMGKDGETKATGTLISSVLCPECQGTGIDVEGDELEKPDVSLSQVCISDLKTIRC 463

Query: 2004 C 2006
            C
Sbjct: 464  C 464


>ref|XP_007043576.1| Uncharacterized protein isoform 2, partial [Theobroma cacao]
            gi|508707511|gb|EOX99407.1| Uncharacterized protein
            isoform 2, partial [Theobroma cacao]
          Length = 481

 Score =  387 bits (995), Expect = e-104
 Identities = 218/481 (45%), Positives = 291/481 (60%), Gaps = 34/481 (7%)
 Frame = +3

Query: 666  MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 845
            MAERRELG P+    +L++Q AR TL  VR +GH  +E+REDG  FIFFC  C APCYSD
Sbjct: 1    MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60

Query: 846  SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLSGSSLIVVNSGK 1025
            S L DHLKG+LH  R AAAK+TLLG+NPWPFNDGVLF     E++K+L+G          
Sbjct: 61   SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGL--------- 111

Query: 1026 NSNGNDNVALERVGDNENLD-----SHKCATVTNDKSLNGENCNMVIPGVLCKDVISSLE 1190
              +GN N  LE   +++NL        + ++   + +    + +++IPGVL KD IS L+
Sbjct: 112  --HGNQNRLLEFHNNDDNLAIVEYVGSEVSSYRKNVNCRAGDSDLLIPGVLIKDEISDLK 169

Query: 1191 VRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVTFSYNY 1370
            VRFIGFG+IAAR  E   +  +I+RIWC WLGK  +   ++D  K   H F +VTF YN 
Sbjct: 170  VRFIGFGKIAARFCEKDGVLNEISRIWCEWLGK--EVPRNDDKLKAPKHGFAVVTFVYNC 227

Query: 1371 TLGRR-TLDDLNPXXXXXXXXEIDNGEGKRRKQRKKSFSDPEDISKSL------------ 1511
             LGR+  LDD+           ++NG+   RK RKKSFSDPEDIS+SL            
Sbjct: 228  DLGRKGLLDDVKSLLTSGSPTGLENGDSASRK-RKKSFSDPEDISESLSNQYDSSGEDSS 286

Query: 1512 ---------VLGQYGDE-------SRHAISXXXXXXXXXXXVASERICDICKHKILPEKD 1643
                      L +Y D+       S  AI            +A+ER+CDIC+ K+LPEKD
Sbjct: 287  ASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQR---IAAERMCDICQQKMLPEKD 343

Query: 1644 VSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKRKN 1823
            V+TL+N+ TG+L CSSRNVNGAFH+FHTSCLIHWILLC+ E   N   N K    S+RKN
Sbjct: 344  VATLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRKN 403

Query: 1824 VSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEA 2003
             +K +++  + + + T    SSV+CPECQG+GI+VE  +LE+P +   ++   ++K    
Sbjct: 404  GAKSNDMGKDGETKATGTLISSVLCPECQGTGIDVEGDELEKPDVSLSQVCISDLKTIRC 463

Query: 2004 C 2006
            C
Sbjct: 464  C 464


>ref|NP_194555.1| uncharacterized protein [Arabidopsis thaliana]
            gi|145334149|ref|NP_001078455.1| uncharacterized protein
            [Arabidopsis thaliana] gi|7269680|emb|CAB79628.1|
            putative protein [Arabidopsis thaliana]
            gi|110742700|dbj|BAE99261.1| hypothetical protein
            [Arabidopsis thaliana] gi|332660060|gb|AEE85460.1|
            uncharacterized protein AT4G28260 [Arabidopsis thaliana]
            gi|332660061|gb|AEE85461.1| uncharacterized protein
            AT4G28260 [Arabidopsis thaliana]
          Length = 516

 Score =  387 bits (994), Expect = e-104
 Identities = 220/522 (42%), Positives = 304/522 (58%), Gaps = 32/522 (6%)
 Frame = +3

Query: 666  MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 845
            MAE++ELG PK  + NL++Q AR TL+ +RL+GH  +E+REDG  F+FFC  C APCYSD
Sbjct: 1    MAEKKELGLPKPSI-NLKEQLARTTLKNLRLQGHTYIELREDGKRFVFFCTLCLAPCYSD 59

Query: 846  SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCF--EEDKQLSGSSLIVVNS 1019
            + L  HL GNLHKER A A++TLLG+NPWPF+DGVLF  +    EE+K        V ++
Sbjct: 60   TILLGHLNGNLHKERLACARITLLGTNPWPFSDGVLFFDSSTGEEEEKSPVSGGEGVPDT 119

Query: 1020 GKNSNGNDNVALERVGDNENLDSHKCATVTNDKSLNGENCNMVIPGVLCKDVISSLEVRF 1199
             ++ + ++  A+ +  +N+    +  A VT+D+  +  + +++I GVL K+    +E +F
Sbjct: 120  LEHCSDDERFAIVKYDNNKTNGDNVPAAVTDDEPSHAAD-DLLISGVLIKERTLDVEAKF 178

Query: 1200 IGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVTFSYNYTLG 1379
            IGFG IAAR+ E+      I+++WC WLG  G S   E+ + +  HDF IVTFSY Y LG
Sbjct: 179  IGFGRIAARLFETKGRTTWIDKLWCEWLGDEGPSD--EEKATIPEHDFAIVTFSYFYNLG 236

Query: 1380 RRTLDDLNPXXXXXXXXEIDNGEGKRRKQRKKSFSDPEDISKSLV----------LGQYG 1529
            R  L D           E  NGE   RK RKKSFSDPED S+SL            G   
Sbjct: 237  RLGLLDDPGRLLTSSQSESGNGEDSGRK-RKKSFSDPEDTSESLCNQYDSSEEVSSGHNS 295

Query: 1530 DESRHAISXXXXXXXXXXXVA---------------SERICDICKHKILPEKDVSTLLNM 1664
            + SR  I+           V                SERIC++CK K+LP KD + +LNM
Sbjct: 296  NSSRDLIADYDDSLMSKRVVKNRTVRRELRRQQRIFSERICEVCKQKMLPGKDAAAILNM 355

Query: 1665 KTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKRKNVSKRSEI 1844
            KTG LAC SRN+ GAFHLFH SC++HW L C+ EI  N++ + KG     +   S ++ +
Sbjct: 356  KTGNLACGSRNLLGAFHLFHVSCVVHWFLFCESEILGNKMVSGKGKKRCTKH--SGQTGV 413

Query: 1845 LMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEACLAWMKD 2024
              N+       Q  SV CPECQG+GIN+E   +E  T P  + + + +K +E   AW+K+
Sbjct: 414  KWNELANDVSWQIFSVFCPECQGTGINIEGAVIERDTFPLSQTWRFQVKVSEGRKAWVKN 473

Query: 2025 PEILQNRSTGLRFPYNSEETIQ-----EKVLPLKLLHFYRAD 2135
            PE L+N STG  FP  +EET Q     E+V  +KL+ FYR +
Sbjct: 474  PERLKNCSTGFHFPQQAEETEQIPVQEERVQMMKLVRFYRVE 515


>ref|XP_007043577.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508707512|gb|EOX99408.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 470

 Score =  385 bits (989), Expect = e-104
 Identities = 216/465 (46%), Positives = 285/465 (61%), Gaps = 34/465 (7%)
 Frame = +3

Query: 666  MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 845
            MAERRELG P+    +L++Q AR TL  VR +GH  +E+REDG  FIFFC  C APCYSD
Sbjct: 1    MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60

Query: 846  SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLSGSSLIVVNSGK 1025
            S L DHLKG+LH  R AAAK+TLLG+NPWPFNDGVLF     E++K+L+G          
Sbjct: 61   SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGL--------- 111

Query: 1026 NSNGNDNVALERVGDNENLD-----SHKCATVTNDKSLNGENCNMVIPGVLCKDVISSLE 1190
              +GN N  LE   +++NL        + ++   + +    + +++IPGVL KD IS L+
Sbjct: 112  --HGNQNRLLEFHNNDDNLAIVEYVGSEVSSYRKNVNCRAGDSDLLIPGVLIKDEISDLK 169

Query: 1191 VRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVTFSYNY 1370
            VRFIGFG+IAAR  E   +  +I+RIWC WLGK  +   ++D  K   H F +VTF YN 
Sbjct: 170  VRFIGFGKIAARFCEKDGVLNEISRIWCEWLGK--EVPRNDDKLKAPKHGFAVVTFVYNC 227

Query: 1371 TLGRR-TLDDLNPXXXXXXXXEIDNGEGKRRKQRKKSFSDPEDISKSL------------ 1511
             LGR+  LDD+           ++NG+   RK RKKSFSDPEDIS+SL            
Sbjct: 228  DLGRKGLLDDVKSLLTSGSPTGLENGDSASRK-RKKSFSDPEDISESLSNQYDSSGEDSS 286

Query: 1512 ---------VLGQYGDE-------SRHAISXXXXXXXXXXXVASERICDICKHKILPEKD 1643
                      L +Y D+       S  AI            +A+ER+CDIC+ K+LPEKD
Sbjct: 287  ASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQR---IAAERMCDICQQKMLPEKD 343

Query: 1644 VSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKRKN 1823
            V+TL+N+ TG+L CSSRNVNGAFH+FHTSCLIHWILLC+ E   N   N K    S+RKN
Sbjct: 344  VATLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRKN 403

Query: 1824 VSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTI 1958
             +K +++  + + + T    SSV+CPECQG+GI+VE  +LE+P +
Sbjct: 404  GAKSNDMGKDGETKATGTLISSVLCPECQGTGIDVEGDELEKPDV 448


>ref|XP_007149858.1| hypothetical protein PHAVU_005G104500g [Phaseolus vulgaris]
            gi|561023122|gb|ESW21852.1| hypothetical protein
            PHAVU_005G104500g [Phaseolus vulgaris]
          Length = 498

 Score =  381 bits (978), Expect = e-102
 Identities = 216/517 (41%), Positives = 293/517 (56%), Gaps = 27/517 (5%)
 Frame = +3

Query: 666  MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 845
            MA + ELG  K  V N ++Q AR  L+ VR +GH  VE+RE+G  FI+FC  C APCYSD
Sbjct: 1    MAGKLELGPLKSDVSNPKEQAARKILKIVRSQGHPYVELRENGKKFIYFCTLCLAPCYSD 60

Query: 846  SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLSGSSLIVVNSGK 1025
              LFDHLKGNLHKER +AAK+TLLG  PWPFNDG++F     E D+ L  +        K
Sbjct: 61   DVLFDHLKGNLHKERLSAAKVTLLGPKPWPFNDGLVFFDTSIESDRDLEVADSYRNRLLK 120

Query: 1026 NSNGNDNVALERVGDNENLDSHKCATVTNDKSLNGENCNMVIPGVLCKDVISSLEVRFIG 1205
             +N ++++A+ +  +    ++  C+T      +  + C +VIP +L +D I  ++V  +G
Sbjct: 121  FNNNDNSLAIVKFDEGVQSNAEPCST----DGMPNDECGLVIPHLLIRDEIFDVKVSEVG 176

Query: 1206 FGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVTFSYNYTLGRR 1385
             G+IAAR  E       I RIWC WLGK G+    +D  ++  HDF IV F+YNY LGR 
Sbjct: 177  LGKIAARFLEKCSALSGIKRIWCEWLGKKGNDQ--QDGVEILEHDFAIVNFAYNYDLGRS 234

Query: 1386 -TLDDLNPXXXXXXXXEIDNGEGKRRKQRKKSFSDPEDISKSLVLGQYGDESRHAISXXX 1562
              LDD+           + +  G R+   K+S SD +DIS SL   QY   +  +     
Sbjct: 235  GLLDDVKSL--------LPSASGGRKG--KRSLSDSDDISDSLC-NQYDSSAEESSDSNN 283

Query: 1563 XXXXXXXX--------------------------VASERICDICKHKILPEKDVSTLLNM 1664
                                              +A+E++C+IC+ K+LP KDV+ LLN+
Sbjct: 284  SSAPLTLDQFNNHHVCTRFISSKAVRKELRRKQRLAAEKVCNICQQKMLPGKDVAALLNL 343

Query: 1665 KTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKRKNVSKRSEI 1844
             T R+ACSSRN  GAFH+FHTSCLIHWI+LC+FEI TN L         KRK  S   +I
Sbjct: 344  NTRRVACSSRNKTGAFHVFHTSCLIHWIILCEFEIITNHLVRPNVRRIVKRKIASDGEKI 403

Query: 1845 LMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEACLAWMKD 2024
                K++  +    +V CPECQG+G+ ++   +E+P     ++F + IKA +A   WMK 
Sbjct: 404  ---GKEKDIEKHIRTVFCPECQGTGMVIDGDGVEQPEFSLSQMFKFKIKACDARREWMKS 460

Query: 2025 PEILQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRAD 2135
            PEILQN STG  FP  SEE  +EKV P+ LLHFYRAD
Sbjct: 461  PEILQNCSTGFHFPSQSEEIFEEKVEPINLLHFYRAD 497


>ref|XP_006412978.1| hypothetical protein EUTSA_v10024944mg [Eutrema salsugineum]
            gi|557114148|gb|ESQ54431.1| hypothetical protein
            EUTSA_v10024944mg [Eutrema salsugineum]
          Length = 514

 Score =  374 bits (960), Expect = e-100
 Identities = 217/527 (41%), Positives = 300/527 (56%), Gaps = 37/527 (7%)
 Frame = +3

Query: 666  MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 845
            MAE +ELG PK  + +L++Q AR TLR +R +GH  +E+REDG  F+FFC  C APCYSD
Sbjct: 1    MAESKELGLPKTAI-SLKEQLARTTLRNLRSQGHTYIELREDGKRFVFFCTLCLAPCYSD 59

Query: 846  SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCF-EEDKQLSGSSLIVVNSG 1022
            + L  HL GNLHKER + A++TLLG NPWPFNDGVLF  +   EE+K L      V    
Sbjct: 60   AILLGHLNGNLHKERLSCARITLLGENPWPFNDGVLFFDSSTGEEEKTLISDGEGVTGPL 119

Query: 1023 KNSNGNDNVALERVGDNENL----DSHKCATVTNDKSLNGENCNMVIPGVLCKDVISSLE 1190
             + + N+  A+    +N       D+   A + ++ +   EN  +VI  +L K+    +E
Sbjct: 120  HHCSDNERFAIVTYDENRTCESQGDNQPAAGIDDEPNHCAEN--LVISNLLIKEKTLDVE 177

Query: 1191 VRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVTFSYNY 1370
             +FIGFG IAAR+ E+      I+++WC WLG+  +S   E+ + +  HDF IVTFSY Y
Sbjct: 178  AKFIGFGRIAARLFETKGRTTWIDKLWCEWLGE--ESPPDEEKATVPEHDFAIVTFSYFY 235

Query: 1371 TLGRR-TLDDLNPXXXXXXXXEIDNGEGKRRKQRKKSFSDPEDISKSLVLGQYGDESRHA 1547
             LGR   L D +         E  NGE   RK RKKSFSDPED S+SL   QY  +S   
Sbjct: 236  NLGRLGLLADPSRLLTLSQSAESGNGEDNGRK-RKKSFSDPEDTSESLC-NQY--DSSEE 291

Query: 1548 ISXXXXXXXXXXXVA----------------------------SERICDICKHKILPEKD 1643
            +S           +A                            S+RIC++CK K+LP KD
Sbjct: 292  VSSARNSNSSRALIADYDDHLVNKRVIKNKSVRRELRKQQRIFSDRICEVCKQKMLPGKD 351

Query: 1644 VSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKRKN 1823
             + +LNMKTG+LACSSRN  GAFHLFH SC++HW L C+ EI  +++ + KG     +K 
Sbjct: 352  AAAILNMKTGKLACSSRNRLGAFHLFHVSCVVHWFLFCETEILGSKMVSGKG-----KKR 406

Query: 1824 VSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEA 2003
             +K+S +  N+       Q  SV CPECQG+GIN+E   +E  T P  + + + +K +E 
Sbjct: 407  CTKQSGVKWNELVGDVSWQIFSVFCPECQGTGINIEGDVIERDTFPLSQTWRFGVKVSEG 466

Query: 2004 CLAWMKDPEILQNRSTGLRFPYNSEETI---QEKVLPLKLLHFYRAD 2135
              AW+K+PE L+N STG  FP   EE +   +++V  +KL+ FYR +
Sbjct: 467  RKAWVKNPEKLENCSTGFHFPQQDEELVKGQEDRVQSMKLVRFYRVE 513


Top