BLASTX nr result

ID: Akebia22_contig00012194 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia22_contig00012194
         (1930 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006447354.1| hypothetical protein CICLE_v10014904mg [Citr...   485   e-134
ref|XP_002273873.2| PREDICTED: uncharacterized protein LOC100255...   464   e-128
ref|XP_006470134.1| PREDICTED: uncharacterized protein LOC102608...   451   e-125
ref|XP_004139943.1| PREDICTED: uncharacterized protein LOC101204...   453   e-124
ref|XP_007043575.1| Uncharacterized protein isoform 1 [Theobroma...   453   e-124
ref|XP_002517932.1| conserved hypothetical protein [Ricinus comm...   438   e-120
ref|XP_004303120.1| PREDICTED: uncharacterized protein LOC101310...   427   e-117
emb|CAN73945.1| hypothetical protein VITISV_032245 [Vitis vinifera]   426   e-116
ref|XP_002319898.2| hypothetical protein POPTR_0013s13670g [Popu...   423   e-115
gb|EXB79637.1| hypothetical protein L484_011577 [Morus notabilis]     412   e-112
ref|XP_006366024.1| PREDICTED: uncharacterized protein LOC102600...   402   e-109
ref|XP_007043578.1| Uncharacterized protein isoform 4 [Theobroma...   389   e-105
ref|XP_007043576.1| Uncharacterized protein isoform 2, partial [...   389   e-105
ref|XP_002869513.1| hypothetical protein ARALYDRAFT_491947 [Arab...   387   e-105
ref|XP_004248159.1| PREDICTED: uncharacterized protein LOC101261...   387   e-104
ref|XP_007043577.1| Uncharacterized protein isoform 3 [Theobroma...   386   e-104
ref|XP_007198901.1| hypothetical protein PRUPE_ppa004741mg [Prun...   385   e-104
ref|NP_194555.1| uncharacterized protein [Arabidopsis thaliana] ...   382   e-103
ref|XP_007149858.1| hypothetical protein PHAVU_005G104500g [Phas...   379   e-102
ref|XP_006412978.1| hypothetical protein EUTSA_v10024944mg [Eutr...   374   e-101

>ref|XP_006447354.1| hypothetical protein CICLE_v10014904mg [Citrus clementina]
            gi|567910083|ref|XP_006447355.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|567910085|ref|XP_006447356.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|567910087|ref|XP_006447357.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|568831767|ref|XP_006470130.1| PREDICTED:
            uncharacterized protein LOC102608093 isoform X1 [Citrus
            sinensis] gi|568831769|ref|XP_006470131.1| PREDICTED:
            uncharacterized protein LOC102608093 isoform X2 [Citrus
            sinensis] gi|568831771|ref|XP_006470132.1| PREDICTED:
            uncharacterized protein LOC102608093 isoform X3 [Citrus
            sinensis] gi|568831773|ref|XP_006470133.1| PREDICTED:
            uncharacterized protein LOC102608093 isoform X4 [Citrus
            sinensis] gi|557549965|gb|ESR60594.1| hypothetical
            protein CICLE_v10014904mg [Citrus clementina]
            gi|557549966|gb|ESR60595.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|557549967|gb|ESR60596.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|557549968|gb|ESR60597.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
          Length = 523

 Score =  485 bits (1248), Expect = e-134
 Identities = 268/526 (50%), Positives = 336/526 (63%), Gaps = 36/526 (6%)
 Frame = -1

Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667
            MA RRELGFPK   ++LR+Q AR TL  VR +GH  VE+REDG  FIFFC  C APCYSD
Sbjct: 1    MAGRRELGFPKTSAFSLREQLARTTLSNVRAQGHTYVELREDGKRFIFFCTLCLAPCYSD 60

Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLSGSSLIVVNSGK 1487
              LFDHLKGNLH ER +AAK+TLLG NPWPFNDGVLF  N  E++KQ + S+  +  S  
Sbjct: 61   LVLFDHLKGNLHTERLSAAKVTLLGPNPWPFNDGVLFFDNSNEKEKQTTVSNDKLGRSLD 120

Query: 1486 NSNGNDNVALERVGDNENLDSHK----------CATVTIEKSLNGENCNMVIPGVLCKDV 1337
              N + N+A+ + G++  ++ ++          C   T  + +  E+C+ VIPGV  KD 
Sbjct: 121  YHNNDSNLAIVKYGEDMKVNGNEHSGLDEVHFDCENGTQVRDIYSESCDKVIPGVFLKDE 180

Query: 1336 ISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVT 1157
            I  L VRFIG G+IAAR+ +  E   +I+RIWC WLGK  DP   ED  ++  HDF IVT
Sbjct: 181  IVDLRVRFIGLGQIAARMIQKDEGSIEISRIWCEWLGK-KDPE-DEDIVEIPDHDFAIVT 238

Query: 1156 FSYNYTLGRRTL-DDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKS-------- 1004
            F YNY LGR+ L DD+          + +NGEG  RK RKKSFSDPED+S+S        
Sbjct: 239  FVYNYDLGRKGLFDDVKLLLSSSPAEDSENGEGTGRK-RKKSFSDPEDVSESLSKQYDSC 297

Query: 1003 -------------LVLGQYGDESRHA----ISXXXXXXXXXXRVASERICDICKHKILPE 875
                         L+L +YGD+  HA                R+A+ER+CDIC+ KILP+
Sbjct: 298  GEDSSASNSSTSRLLLDRYGDQLLHARFISSKAARREMRRQQRIAAERMCDICQQKILPD 357

Query: 874  KDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKR 695
            KDV+ LLN+KTG LACSSRN+NG FH+FH SCLIHWILLC+FE+ TNQ    K    S+R
Sbjct: 358  KDVAALLNLKTGNLACSSRNLNGVFHVFHISCLIHWILLCEFELKTNQPVTPKVKRRSRR 417

Query: 694  KNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKAN 515
            KN SKR +    D + I   Q SS+ CPECQG+G+N+E  +LE+PTI   ++F Y IK +
Sbjct: 418  KNGSKRVQ-ARKDGEYIFTNQISSLFCPECQGTGVNIEGDELEKPTISLSQMFKYKIKVS 476

Query: 514  EACLAWMKDPELLQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRAD 377
            +A  AWMK+PE LQN STG  FP  SEE  QEKV PLKLLHFY A+
Sbjct: 477  DARKAWMKNPEALQNCSTGFYFPSRSEEKFQEKVSPLKLLHFYSAE 522


>ref|XP_002273873.2| PREDICTED: uncharacterized protein LOC100255678 [Vitis vinifera]
          Length = 520

 Score =  464 bits (1194), Expect = e-128
 Identities = 271/532 (50%), Positives = 337/532 (63%), Gaps = 41/532 (7%)
 Frame = -1

Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667
            MA R ELGF K    +LR+Q AR TLR VR++GH  VE+REDG  FIFFC  C APCYS+
Sbjct: 1    MARRTELGFLKTSASSLREQAARTTLRNVRMQGHPYVELREDGKRFIFFCTLCLAPCYSE 60

Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLS---GSSLIVVN 1496
            S L+DHLKGNLH ERYAAAK+TLL S+PWPFNDGVLF  N  E DK LS   G+   ++ 
Sbjct: 61   SVLYDHLKGNLHSERYAAAKVTLLKSHPWPFNDGVLFFDNSSENDKHLSIANGNPTRLLG 120

Query: 1495 SGKNSNGNDNVALERVGD------NENLDSHK-----CATVTIEKSLN--GENCNMVIPG 1355
            + KN N   N+A+   GD      N +++ H      C      +SLN  G NC+M+IPG
Sbjct: 121  THKNDN---NLAIVCHGDDLSQSNNRHVEQHSNKNSDCDVSFYNESLNNGGRNCDMMIPG 177

Query: 1354 VLCKDVISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGH 1175
            V+ KD ++ LEVRF+GFG+IAAR  E   + K I++IWC W GK  +P   E    +  H
Sbjct: 178  VMIKDEVTELEVRFLGFGQIAARFFEKDGVSKGISKIWCEWFGKE-EPGDGETVM-VPDH 235

Query: 1174 DFGIVTFSYNYTLGRRTLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL-- 1001
            DF +VTF+Y+Y LGR+ L D          L     EG  RK RKKSFSDPEDIS+SL  
Sbjct: 236  DFAVVTFNYHYNLGRKGLFD-----DVISMLSSSPTEGSGRK-RKKSFSDPEDISESLSN 289

Query: 1000 -------------------VLGQYGDE---SRHAISXXXXXXXXXXR-VASERICDICKH 890
                               +L +Y D+   +R   S          + VA+ER+CDIC+H
Sbjct: 290  QYDSSGEDSLISNSPSPRLLLDRYDDQLLDTRFISSKTIRRELRRQQRVAAERMCDICQH 349

Query: 889  KILPEKDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGT 710
            K+LP KDV+TL+NMKTG+L CSSRNV GAFH+FHTSCLIHWILLC+FEI+TNQL   K  
Sbjct: 350  KMLPGKDVATLMNMKTGKLVCSSRNVYGAFHVFHTSCLIHWILLCEFEIFTNQLVCPKLR 409

Query: 709  HGSKRKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNY 530
              S+RK+ SK +    +   + T  Q  SV CPECQG+GI +ED +LE P IP  E+F Y
Sbjct: 410  RSSRRKSGSKCNGKGKDGVIKPTTLQICSVFCPECQGTGIMIED-ELEIPNIPLSEMFKY 468

Query: 529  NIKANEACLAWMKDPELLQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRADE 374
             IK ++A  AWMK+PE L++ STG  FP  S ET+QEKV  LKLLHFY ADE
Sbjct: 469  KIKVSDAHRAWMKNPEELKHCSTGFNFPSQSGETVQEKVSSLKLLHFYSADE 520


>ref|XP_006470134.1| PREDICTED: uncharacterized protein LOC102608093 isoform X5 [Citrus
            sinensis]
          Length = 508

 Score =  451 bits (1159), Expect(2) = e-125
 Identities = 250/499 (50%), Positives = 317/499 (63%), Gaps = 36/499 (7%)
 Frame = -1

Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667
            MA RRELGFPK   ++LR+Q AR TL  VR +GH  VE+REDG  FIFFC  C APCYSD
Sbjct: 1    MAGRRELGFPKTSAFSLREQLARTTLSNVRAQGHTYVELREDGKRFIFFCTLCLAPCYSD 60

Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLSGSSLIVVNSGK 1487
              LFDHLKGNLH ER +AAK+TLLG NPWPFNDGVLF  N  E++KQ + S+  +  S  
Sbjct: 61   LVLFDHLKGNLHTERLSAAKVTLLGPNPWPFNDGVLFFDNSNEKEKQTTVSNDKLGRSLD 120

Query: 1486 NSNGNDNVALERVGDNENLDSHK----------CATVTIEKSLNGENCNMVIPGVLCKDV 1337
              N + N+A+ + G++  ++ ++          C   T  + +  E+C+ VIPGV  KD 
Sbjct: 121  YHNNDSNLAIVKYGEDMKVNGNEHSGLDEVHFDCENGTQVRDIYSESCDKVIPGVFLKDE 180

Query: 1336 ISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVT 1157
            I  L VRFIG G+IAAR+ +  E   +I+RIWC WLGK  DP   ED  ++  HDF IVT
Sbjct: 181  IVDLRVRFIGLGQIAARMIQKDEGSIEISRIWCEWLGK-KDPE-DEDIVEIPDHDFAIVT 238

Query: 1156 FSYNYTLGRRTL-DDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKS-------- 1004
            F YNY LGR+ L DD+          + +NGEG  RK RKKSFSDPED+S+S        
Sbjct: 239  FVYNYDLGRKGLFDDVKLLLSSSPAEDSENGEGTGRK-RKKSFSDPEDVSESLSKQYDSC 297

Query: 1003 -------------LVLGQYGDESRHA----ISXXXXXXXXXXRVASERICDICKHKILPE 875
                         L+L +YGD+  HA                R+A+ER+CDIC+ KILP+
Sbjct: 298  GEDSSASNSSTSRLLLDRYGDQLLHARFISSKAARREMRRQQRIAAERMCDICQQKILPD 357

Query: 874  KDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKR 695
            KDV+ LLN+KTG LACSSRN+NG FH+FH SCLIHWILLC+FE+ TNQ    K    S+R
Sbjct: 358  KDVAALLNLKTGNLACSSRNLNGVFHVFHISCLIHWILLCEFELKTNQPVTPKVKRRSRR 417

Query: 694  KNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKAN 515
            KN SKR +    D + I   Q SS+ CPECQG+G+N+E  +LE+PTI   ++F Y IK +
Sbjct: 418  KNGSKRVQ-ARKDGEYIFTNQISSLFCPECQGTGVNIEGDELEKPTISLSQMFKYKIKVS 476

Query: 514  EACLAWMKDPELLQNRSTG 458
            +A  AWMK+PE LQN STG
Sbjct: 477  DARKAWMKNPEALQNCSTG 495



 Score = 25.8 bits (55), Expect(2) = e-125
 Identities = 10/13 (76%), Positives = 13/13 (100%)
 Frame = -3

Query: 425 SGKGIASKVASFL 387
           +GKG+ASK+ASFL
Sbjct: 494 TGKGVASKIASFL 506


>ref|XP_004139943.1| PREDICTED: uncharacterized protein LOC101204451 [Cucumis sativus]
            gi|449475785|ref|XP_004154550.1| PREDICTED:
            uncharacterized LOC101204451 [Cucumis sativus]
          Length = 525

 Score =  453 bits (1166), Expect = e-124
 Identities = 254/530 (47%), Positives = 319/530 (60%), Gaps = 41/530 (7%)
 Frame = -1

Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667
            MA R ELGFPK   Y+LR+Q AR  LR VR +GH  VE+RE+G  FIFFC  C APCYSD
Sbjct: 1    MARRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELRENGKKFIFFCTLCLAPCYSD 60

Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQL------------ 1523
            S LF HLKG LH ER +AAKLTLLG NPWPF+DGVLF +   E D Q+            
Sbjct: 61   SVLFSHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPIEGDNQVGISNDNHERLLE 120

Query: 1522 ---SGSSLIVVNSGKNSNGNDNVALERVGDNENLDSHKCATVTIEKSLNGENCNMVIPGV 1352
               + ++L +V    NS GN N   E  G+  N++   C+   +     GE+C +VIPGV
Sbjct: 121  YNNNDNNLAIVKYVGNSKGNGNRQEEFNGNMRNVED--CSFENLND--GGESCPLVIPGV 176

Query: 1351 LCKDVISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHD 1172
            L K+ IS ++VR +G+G+IAAR  E   I   ++RIWC WLGK  D    E+  K+  H+
Sbjct: 177  LIKEEISDIKVRELGYGQIAARFTEKDGIFSGVSRIWCEWLGKVNDGI--ENMVKVPEHN 234

Query: 1171 FGIVTFSYNYTLGRR-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPED------- 1016
            + I+TF+YN  LGR+  LDD+          E  N E  R+ +RKKSFSDPED       
Sbjct: 235  YAIITFTYNVDLGRKGLLDDVKLLLSSSPGAESQNDE-NRQVKRKKSFSDPEDGSLSMSP 293

Query: 1015 --------------ISKSLVLGQYGDESRHAI----SXXXXXXXXXXRVASERICDICKH 890
                          +  SL L  Y D+                    R+A+ER+CDIC+ 
Sbjct: 294  QYDSSGEDSSASNCVMSSLSLDGYDDQILSTTVMLNKAVRRELRRQQRLAAERMCDICQQ 353

Query: 889  KILPEKDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGT 710
            KIL  KDV+TLLNMKTGRLACSSRNVNG FH+FHTSCLIHWILLC++EI    L   K  
Sbjct: 354  KILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEISVKDLGGSKVR 413

Query: 709  HGSKRKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNY 530
               +RK  +K ++ + + + R  K Q  SV CP CQG+GI ++   LE+PT+P  EIF Y
Sbjct: 414  RRYRRKKKTKGNKHIKDGETRQIKTQIDSVFCPACQGTGITIDGDDLEKPTVPLSEIFKY 473

Query: 529  NIKANEACLAWMKDPELLQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRA 380
             IK ++A  AWMK PE+LQN STG +FPY  +ETIQE V PLKLLHFY A
Sbjct: 474  KIKVSDARRAWMKSPEVLQNCSTGFQFPYQPDETIQENVKPLKLLHFYGA 523


>ref|XP_007043575.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508707510|gb|EOX99406.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 517

 Score =  453 bits (1165), Expect = e-124
 Identities = 252/528 (47%), Positives = 327/528 (61%), Gaps = 34/528 (6%)
 Frame = -1

Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667
            MAERRELG P+    +L++Q AR TL  VR +GH  +E+REDG  FIFFC  C APCYSD
Sbjct: 1    MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60

Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLSGSSLIVVNSGK 1487
            S L DHLKG+LH  R AAAK+TLLG+NPWPFNDGVLF     E++K+L+G          
Sbjct: 61   SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGL--------- 111

Query: 1486 NSNGNDNVALERVGDNENLDSHKCATVTIEKSLNGENC-----NMVIPGVLCKDVISSLE 1322
              +GN N  LE   +++NL   +     +       NC     +++IPGVL KD IS L+
Sbjct: 112  --HGNQNRLLEFHNNDDNLAIVEYVGSEVSSYRKNVNCRAGDSDLLIPGVLIKDEISDLK 169

Query: 1321 VRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTFSYNY 1142
            VRFIGFG+IAAR  E   +  +I+RIWC WLGK  +   ++D  K   H F +VTF YN 
Sbjct: 170  VRFIGFGKIAARFCEKDGVLNEISRIWCEWLGK--EVPRNDDKLKAPKHGFAVVTFVYNC 227

Query: 1141 TLGRR-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL------------ 1001
             LGR+  LDD+           ++NG+   RK RKKSFSDPEDIS+SL            
Sbjct: 228  DLGRKGLLDDVKSLLTSGSPTGLENGDSASRK-RKKSFSDPEDISESLSNQYDSSGEDSS 286

Query: 1000 ---------VLGQYGDE-------SRHAISXXXXXXXXXXRVASERICDICKHKILPEKD 869
                      L +Y D+       S  AI            +A+ER+CDIC+ K+LPEKD
Sbjct: 287  ASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQR---IAAERMCDICQQKMLPEKD 343

Query: 868  VSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKRKN 689
            V+TL+N+ TG+L CSSRNVNGAFH+FHTSCLIHWILLC+ E   N   N K    S+RKN
Sbjct: 344  VATLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRKN 403

Query: 688  VSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEA 509
             +K +++  + + + T    SSV+CPECQG+GI+VE  +LE+P +   ++F Y IK ++A
Sbjct: 404  GAKSNDMGKDGETKATGTLISSVLCPECQGTGIDVEGDELEKPDVSLSQMFRYKIKVSDA 463

Query: 508  CLAWMKDPELLQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRADE*KS 365
              AWMK PE+L+N STG  F   S E +QEK+LPLKLLHFY AD+ +S
Sbjct: 464  RRAWMKSPEMLENCSTGFHFRSQSGEMVQEKILPLKLLHFYSADKYES 511


>ref|XP_002517932.1| conserved hypothetical protein [Ricinus communis]
            gi|223542914|gb|EEF44450.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 509

 Score =  438 bits (1127), Expect = e-120
 Identities = 246/519 (47%), Positives = 315/519 (60%), Gaps = 28/519 (5%)
 Frame = -1

Query: 1846 MAERRELGFPK-GGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYS 1670
            MA R ELGF K GG  +L++Q AR TL  VR +GH  VE+REDG  FIFFC  C APCYS
Sbjct: 1    MAGRWELGFTKTGGANSLKEQLARTTLNNVRSKGHPYVELREDGKRFIFFCTLCLAPCYS 60

Query: 1669 DSALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLSGSSLIVVNSG 1490
            D+ LFDHLKGNLH ER + A LTLL  NPWPF+DGV F     E +KQL    +I  ++ 
Sbjct: 61   DAVLFDHLKGNLHTERLSTATLTLLKENPWPFSDGVHFFDTSSENEKQL----VIKNDNE 116

Query: 1489 KNSNGNDNVALERVGDNENLDSHKCATVTIEKSLNGENCNMVIPGVLCKDVISSLEVRFI 1310
               NGN ++A+ + G +      +      + + NG   +++I GVL KD IS L+ RF+
Sbjct: 117  SRGNGNSSLAIVKYGGSLKPTGDEDTGCNKDANDNGRISDLLIQGVLVKDDISDLQARFM 176

Query: 1309 GFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTFSYNYTLGR 1130
            G+G I AR+ E       I+RIWC WLGK  +     D +K+  H+F +VTF+YNY LGR
Sbjct: 177  GYGRIGARLIEKDGNSNDISRIWCEWLGK--NTPCDLDKAKVLDHEFAVVTFAYNYDLGR 234

Query: 1129 R-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKS----------------- 1004
            +  LDD+          E DN  G  RK RKKSFSDPED+S+S                 
Sbjct: 235  KGLLDDVKLLLSSSPVQESDNQGGTNRK-RKKSFSDPEDVSESFSNQYDSSGEESLTSIG 293

Query: 1003 -----LVLGQYGDESRHA----ISXXXXXXXXXXRVASERICDICKHKILPEKDVSTLLN 851
                 L+L ++ D+  H+                 +A+ER+CDIC+ KILPEKDV+TL+N
Sbjct: 294  GPPTRLLLDRHDDQFLHSKVISSKTLRRELRRQHHIAAERMCDICQQKILPEKDVATLVN 353

Query: 850  MKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKRKNVSKRSE 671
            M TG+LACSSRN  G +H+FHTSCLIHWILL ++E+  NQ  + KG   S+RKN +K S 
Sbjct: 354  MNTGKLACSSRNTYGQYHVFHTSCLIHWILLSEYEMARNQSVSPKGRRKSRRKNGTKSSH 413

Query: 670  ILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEACLAWMK 491
            +   +K +    Q SSV CPECQG+G  +E  + E PTIP  E+F Y IK  +   AWMK
Sbjct: 414  V---EKVKALNNQISSVFCPECQGTGAILEKDERELPTIPLSEMFKYKIKVGDGRRAWMK 470

Query: 490  DPELLQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRADE 374
             PE+L+N S G  FP  SE  +Q KVLPLKLLHFYRADE
Sbjct: 471  SPEVLENCSIGFHFPSQSEGAVQAKVLPLKLLHFYRADE 509


>ref|XP_004303120.1| PREDICTED: uncharacterized protein LOC101310040 [Fragaria vesca
            subsp. vesca]
          Length = 525

 Score =  427 bits (1099), Expect = e-117
 Identities = 249/534 (46%), Positives = 317/534 (59%), Gaps = 43/534 (8%)
 Frame = -1

Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667
            MA R ++G PK    +LR+Q  R  LR VR +GH  VEVREDG  FIFFC  C APCYSD
Sbjct: 1    MAGRWDVGVPKTNACSLREQATRTILRNVRSQGHSYVEVREDGKKFIFFCTLCLAPCYSD 60

Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQL------------ 1523
              LFDHLKGNLH ER AAAK+TLL  NPWPFNDGV+F  N  E DK +            
Sbjct: 61   KVLFDHLKGNLHNERLAAAKVTLLRPNPWPFNDGVVFFNNSYETDKGVVTPDDNKCRMLE 120

Query: 1522 ---SGSSLIVVNSGKN--SNGNDNVALERVGDNENLDSHKCATVTIEKSLNGENCNMVIP 1358
               + ++L +V  G N  +NG D+  ++ +  NE +D     +   + + +G   ++VIP
Sbjct: 121  SHDNENNLAIVKYGGNLKTNGYDHCGVDGLECNEYIDLQGLQSNVGDSTADGAKSSVVIP 180

Query: 1357 GVLCKDVISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTG 1178
            G++ +D I+ LEVR +G GEIAAR          I RIWC WLG     S  ED   +  
Sbjct: 181  GIVVRDEITDLEVREVGLGEIAARFLGK----DGIGRIWCEWLGVKSIDS--EDLCNVPE 234

Query: 1177 HDFGIVTFSYNYTLGRR-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL 1001
            HDF +VTFSYN  LGR+  LDD+         +E  NGEG   K RKKSFSDPEDIS SL
Sbjct: 235  HDFAVVTFSYNIDLGRKGLLDDVRMLLSSSPTIESGNGEGTGCK-RKKSFSDPEDISDSL 293

Query: 1000 ---------------------VLGQYGDE---SRHAISXXXXXXXXXXR-VASERICDIC 896
                                 +L  Y D+   +R  ++          + +AS R+CDIC
Sbjct: 294  SNQYESFGEDSSASSGTASRLLLDHYDDQLLNTRFILNKSIRRELRRQQRLASGRMCDIC 353

Query: 895  KHKILPEKDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLK 716
            + ++LP KDV+TL+N+KTG+LACSSRNVNGAFH+FHTSCLIHWILLC+ E+ TNQ    K
Sbjct: 354  QQRMLPGKDVATLMNLKTGKLACSSRNVNGAFHVFHTSCLIHWILLCEVEVITNQNTGSK 413

Query: 715  GTHGSKRKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIF 536
                S+RK  +K +    + + +   PQ  SV CPECQG+GI V+   LE+P +P  ++F
Sbjct: 414  ARRRSRRKTAAKCNG--KDAQLKSLSPQIYSVFCPECQGTGIVVDGDDLEKPNLPLSQMF 471

Query: 535  NYNIKANEACLAWMKDPELLQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRADE 374
             Y IK ++A  AWMK PE+LQN STG  FP  +   IQEKV  LKLL FYRA E
Sbjct: 472  RYKIKVSDARRAWMKSPEMLQNCSTGFHFPSLNAAGIQEKVKTLKLLRFYRAHE 525


>emb|CAN73945.1| hypothetical protein VITISV_032245 [Vitis vinifera]
          Length = 896

 Score =  426 bits (1096), Expect = e-116
 Identities = 250/501 (49%), Positives = 315/501 (62%), Gaps = 41/501 (8%)
 Frame = -1

Query: 1801 NLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSDSALFDHLKGNLHKER 1622
            +LR+Q AR TLR VR++GH  VE+REDG  FIFFC  C APCYS+S L+DHLKGNLH ER
Sbjct: 352  SLREQAARTTLRNVRMQGHPYVELREDGKRFIFFCTLCLAPCYSESVLYDHLKGNLHSER 411

Query: 1621 YAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLS---GSSLIVVNSGKNSNGNDNVALER 1451
            YAAAK+TLL S+PWPFNDGVLF  N  E DK LS   G+   ++ + KN N   N+A+  
Sbjct: 412  YAAAKVTLLKSHPWPFNDGVLFFDNSSENDKHLSIANGNPTRLLGTHKNDN---NLAIVC 468

Query: 1450 VGD------NENLDSHK-----CATVTIEKSLN--GENCNMVIPGVLCKDVISSLEVRFI 1310
             GD      N +++ H      C      +SLN  G NC+M+IPGV+ KD ++ LEVRF+
Sbjct: 469  HGDDLSQSNNRHVEQHSNKNSDCDVSFYNESLNNGGRNCDMMIPGVMIKDEVTELEVRFL 528

Query: 1309 GFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTFSYNYTLGR 1130
            GFG+IAAR  E   + K I++IWC W GK  +P   E    +  HDF +VTF+Y+Y LGR
Sbjct: 529  GFGQIAARFFEKDGVSKGISKIWCEWFGKE-EPGDGETVM-VPDHDFAVVTFNYHYNLGR 586

Query: 1129 RTLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL----------------- 1001
            + L D          L     EG  RK RKKSFSDPEDIS+SL                 
Sbjct: 587  KGLFD-----DVISMLSSSPTEGSGRK-RKKSFSDPEDISESLSNQYDSSGEDSLISNSP 640

Query: 1000 ----VLGQYGDE---SRHAISXXXXXXXXXXR-VASERICDICKHKILPEKDVSTLLNMK 845
                +L +Y D+   +R   S          + VA+ER+CDIC+HK+LP KDV+TL NMK
Sbjct: 641  SPRLLLDRYDDQLLDTRFISSKTIRRELRRQQRVAAERMCDICQHKMLPGKDVATLXNMK 700

Query: 844  TGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKRKNVSKRSEIL 665
            TG+L CSSRNV GAFH+FHTSCLIHWILLC+FEI+TNQL   K    S+RK+ SK +   
Sbjct: 701  TGKLVCSSRNVYGAFHVFHTSCLIHWILLCEFEIFTNQLVCPKLRRSSRRKSGSKCNGKG 760

Query: 664  MNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEACLAWMKDP 485
             +   + T  Q  SV CPECQG+GI +ED +LE P IP  E+F Y IK ++A  AWMK+P
Sbjct: 761  KDGVIKPTTLQICSVFCPECQGTGIMIED-ELEIPNIPLSEMFKYKIKVSDAHRAWMKNP 819

Query: 484  ELLQNRSTGLRFPYNSEETIQ 422
            E L++ STG  FP  S ET+Q
Sbjct: 820  EELKHCSTGFNFPSQSGETVQ 840


>ref|XP_002319898.2| hypothetical protein POPTR_0013s13670g [Populus trichocarpa]
            gi|550325787|gb|EEE95821.2| hypothetical protein
            POPTR_0013s13670g [Populus trichocarpa]
          Length = 513

 Score =  423 bits (1087), Expect = e-115
 Identities = 232/525 (44%), Positives = 317/525 (60%), Gaps = 34/525 (6%)
 Frame = -1

Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667
            MA  RE+GFPK    +LR+Q AR TL +VR  GH  +E+REDG  FIFFC  C +PCYSD
Sbjct: 1    MAGNREVGFPKTTASSLREQLARTTLSRVRARGHPYLELREDGKRFIFFCTLCLSPCYSD 60

Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLSGSSLIVVNSGK 1487
            + L DHL+GNLH ER +AAK TLL  NPWPF+DG+ F       ++QL+      +  GK
Sbjct: 61   TILLDHLRGNLHTERLSAAKATLLKPNPWPFSDGIHFFDASSGNEEQLA------IKDGK 114

Query: 1486 NSN-------GNDNVALERVGDNENLDSHKCATVTIEKSLNG--ENCNMVIPGVLCKDVI 1334
             S+        +DN+A+ +  +N       C TV ++++L+G  E  ++VIP V  K+ +
Sbjct: 115  ESSRFLKFEENSDNLAIVKYVENLKPG---CDTV-VDENLSGSDEGSDLVIPSVRLKEEV 170

Query: 1333 SSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTF 1154
            S L+   +G G+IAAR++E  +   +I+RIWC WLGK    S  ED  K+  HDFG+VTF
Sbjct: 171  SDLKATLVGSGQIAARMYEKKDGSNEISRIWCEWLGKKS--SNDEDKVKVLDHDFGVVTF 228

Query: 1153 SYNYTLGRRTLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL--------- 1001
            +Y+Y LG+  L D            +   + +   +RK+S S+PED+S+SL         
Sbjct: 229  AYDYELGKSGLFDDVKLLLSSSAPALTENDERGNWKRKRSVSEPEDVSRSLTNQYGLCEE 288

Query: 1000 ------------VLGQYGDESRH----AISXXXXXXXXXXRVASERICDICKHKILPEKD 869
                        VL +Y D+  H    +            R+A+E++CDIC+ K+LPEKD
Sbjct: 289  ESSKTTCASSNLVLDRYDDQLMHTRFISNKTVRREVRKQQRIAAEKMCDICQQKMLPEKD 348

Query: 868  VSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKRKN 689
            V+TL N KTG+LACSSRNV GAFH+FHTSCLIHWIL C+FEI  NQ  + KG   S++KN
Sbjct: 349  VATLWNRKTGKLACSSRNVYGAFHVFHTSCLIHWILYCEFEIVRNQTVSTKGGRRSRKKN 408

Query: 688  VSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEA 509
             +K +    +    +      SV CP+CQG+G+N+E  + E+P  P  E+F Y IK +E 
Sbjct: 409  GTKSNTTGKDGTVNVLPNPIVSVFCPDCQGTGVNIEGDEFEKPLTPLSEMFKYKIKVSEG 468

Query: 508  CLAWMKDPELLQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRADE 374
               WMK+PE+L+N STG  FP  S E +QEKVLPLKLLHFYR +E
Sbjct: 469  HRGWMKNPEILENCSTGFHFPSQSGEPVQEKVLPLKLLHFYRPEE 513


>gb|EXB79637.1| hypothetical protein L484_011577 [Morus notabilis]
          Length = 638

 Score =  412 bits (1060), Expect = e-112
 Identities = 245/539 (45%), Positives = 318/539 (58%), Gaps = 54/539 (10%)
 Frame = -1

Query: 1846 MAERRELGFPKGGVY--------NLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIW 1691
            MA R  LGFPK            +L+ Q  R  LR VR +GH  VE+REDG   IFFC  
Sbjct: 1    MAGRGILGFPKSNELAVSKTTSCSLKDQAKRTILRNVRSQGHTYVELREDGKKSIFFCTL 60

Query: 1690 CRAPCYSDSALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEED------- 1532
            C APCYSD  LFDHLKGNLH +R + AK+TLLG NPWPFNDGV+F  N  E D       
Sbjct: 61   CLAPCYSDCVLFDHLKGNLHNQRLSTAKVTLLGPNPWPFNDGVVFFNNPTENDDDTVISN 120

Query: 1531 --------KQLSGSSLIVVNSGKN--SNGNDNVALERVG-DNENLDSHKCATVTIEKSLN 1385
                     Q S ++L +V  G+N  S  N ++ ++ +G  NEN DS          + +
Sbjct: 121  GNQSRLLESQDSENNLAIVTYGENLESCANGHIMVDELGHQNENPDSAG------NLAGS 174

Query: 1384 GENCNMVIPGVLCKDVISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSF 1205
            GENC ++IPGV   D I+++EVR +G+G I+ R  E   +   I+RIWC WLGK      
Sbjct: 175  GENCAVLIPGVRAGDEIANVEVREVGYGLISVRFREKDGVSNDISRIWCEWLGKKTIED- 233

Query: 1204 HEDASKLTGHDFGIVTFSYN-YTLGRRTL-DDLNPXXXXXXXLEIDNGEGKRRKQRKKSF 1031
             ED  K+  HDF IVTFSYN ++LGR  L DD+          E+ NG+   RK R+KSF
Sbjct: 234  -EDFLKVPEHDFAIVTFSYNNFSLGRMGLHDDVKALLCSSPAAEMQNGDVSSRK-RRKSF 291

Query: 1030 SDPEDISK-------------------SLVLGQYGDE-------SRHAISXXXXXXXXXX 929
            SDPED S+                   SL+L QY D+       S  AI           
Sbjct: 292  SDPEDSSENLSNQYDSCGEDSSASAVTSLMLDQYDDQLLQTRFISNKAIRRELRRQQR-- 349

Query: 928  RVASERICDICKHKILPEKDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDF 749
             +A+ER+CDIC+HK+LP KDV+TL+N+KTGRLACSSRN NGAFHLFHTSCLIHW+LLC+ 
Sbjct: 350  -IAAERMCDICQHKMLPGKDVATLMNVKTGRLACSSRNTNGAFHLFHTSCLIHWVLLCEV 408

Query: 748  EIWTNQLDNLKGTHGSKRKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQL 569
            E  TNQ +  K    S+RK  SK +E+L + + +  +   + VICPECQG+G  + DG+ 
Sbjct: 409  EKCTNQSEAPKVKRRSRRKAASKCNEVLNDSEVKAFRTPINRVICPECQGTGTMI-DGED 467

Query: 568  EEPTIPPFEIFNYNIKANEACLAWMKDPELLQNRSTGLRFPYNSEETIQEKVLPLKLLH 392
            E+PT+P  ++F Y IK ++A  AWMK PE+L N STG  FP  +EETIQ  ++ +  +H
Sbjct: 468  EKPTVPLSKMFKYKIKVSDARRAWMKSPEVLGNCSTGFHFPSPAEETIQVHLVYIAEIH 526


>ref|XP_006366024.1| PREDICTED: uncharacterized protein LOC102600129 [Solanum tuberosum]
          Length = 521

 Score =  402 bits (1032), Expect = e-109
 Identities = 238/527 (45%), Positives = 310/527 (58%), Gaps = 41/527 (7%)
 Frame = -1

Query: 1834 RELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSDSALF 1655
            R+L FP+    NL++Q  R TL+ VR +GHI VE+REDG   +FFC  C +PCYSDS LF
Sbjct: 4    RQLDFPRTSGGNLKEQLVRRTLQNVRSQGHIYVELREDGKRLVFFCTLCHSPCYSDSVLF 63

Query: 1654 DHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLSGSSLIVVNSGKNS-- 1481
            +HLKGNLH E  AAAK TLL  NPWPFNDGVLF +N  E+DK         VN GK+   
Sbjct: 64   NHLKGNLHTEMLAAAKATLLKPNPWPFNDGVLF-FNDPEQDKHSPN-----VNVGKSRLV 117

Query: 1480 ----NGNDNVALERVGDN--ENLDSH----KCATVTIEKSLNGENCNMVIPGVLCKDVIS 1331
                    ++A+    DN   N D++    +   +  E + NGE+  +VIPGVLCKD +S
Sbjct: 118  DTCLEDESSLAIVECDDNLRHNGDTYVTEYEYCLLDSELTGNGESEYLVIPGVLCKDELS 177

Query: 1330 SLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTFS 1151
             LEV+ IG G+IAARI       KKI RIWC WL K        D S +  HDF +VTF 
Sbjct: 178  DLEVKHIGIGKIAARISVRGIDSKKIRRIWCEWLVKKDSDDM--DTSVVPDHDFAVVTFP 235

Query: 1150 YNYTLGRRTLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL---------- 1001
            YNY LGR+ L D           E +   G R+++RK SFSDPED S+SL          
Sbjct: 236  YNYNLGRKPLLDDRFLLPSSPYSESEETSGTRKRKRK-SFSDPEDFSESLSNHCDSSGEE 294

Query: 1000 -----------VLGQYGDE--SRHAISXXXXXXXXXXR--VASERICDICKHKILPEKDV 866
                       +LG   D+  S   IS          +  VASER+CDIC+ K+LP KDV
Sbjct: 295  SQSTNNSNMKLILGTCDDQLVSSRIISSKTMRRELRKQQRVASERMCDICQQKMLPGKDV 354

Query: 865  STLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLD----NLKGTHGSK 698
            +TLL+ K+G+L CSSRN+ GAFHLFH SCLIHWIL C+ + +   +D      K    SK
Sbjct: 355  ATLLSWKSGKLMCSSRNMTGAFHLFHVSCLIHWILQCELQTYVKPVDEPKMETKAKRRSK 414

Query: 697  RKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKA 518
            RK  +K +     D+ +  + + +SV CPECQG+GI +E  +LE+P +   E++ + IK 
Sbjct: 415  RKTGTKHNAKEKEDEIKSAR-RINSVFCPECQGTGIIIEGDELEKPPVSLSEVYRHKIKL 473

Query: 517  NEACLAWMKDPELLQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRAD 377
            ++A  AWMK+PE+LQN STG   P   ++ +QE V PLKLLHFYRA+
Sbjct: 474  SDARKAWMKNPEVLQNCSTGFDLPPEHDDLLQEYVSPLKLLHFYRAN 520


>ref|XP_007043578.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508707513|gb|EOX99409.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 478

 Score =  389 bits (998), Expect = e-105
 Identities = 220/481 (45%), Positives = 289/481 (60%), Gaps = 34/481 (7%)
 Frame = -1

Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667
            MAERRELG P+    +L++Q AR TL  VR +GH  +E+REDG  FIFFC  C APCYSD
Sbjct: 1    MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60

Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLSGSSLIVVNSGK 1487
            S L DHLKG+LH  R AAAK+TLLG+NPWPFNDGVLF     E++K+L+G          
Sbjct: 61   SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGL--------- 111

Query: 1486 NSNGNDNVALERVGDNENLDSHKCATVTIEKSLNGENC-----NMVIPGVLCKDVISSLE 1322
              +GN N  LE   +++NL   +     +       NC     +++IPGVL KD IS L+
Sbjct: 112  --HGNQNRLLEFHNNDDNLAIVEYVGSEVSSYRKNVNCRAGDSDLLIPGVLIKDEISDLK 169

Query: 1321 VRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTFSYNY 1142
            VRFIGFG+IAAR  E   +  +I+RIWC WLGK  +   ++D  K   H F +VTF YN 
Sbjct: 170  VRFIGFGKIAARFCEKDGVLNEISRIWCEWLGK--EVPRNDDKLKAPKHGFAVVTFVYNC 227

Query: 1141 TLGRR-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL------------ 1001
             LGR+  LDD+           ++NG+   RK RKKSFSDPEDIS+SL            
Sbjct: 228  DLGRKGLLDDVKSLLTSGSPTGLENGDSASRK-RKKSFSDPEDISESLSNQYDSSGEDSS 286

Query: 1000 ---------VLGQYGDE-------SRHAISXXXXXXXXXXRVASERICDICKHKILPEKD 869
                      L +Y D+       S  AI            +A+ER+CDIC+ K+LPEKD
Sbjct: 287  ASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQR---IAAERMCDICQQKMLPEKD 343

Query: 868  VSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKRKN 689
            V+TL+N+ TG+L CSSRNVNGAFH+FHTSCLIHWILLC+ E   N   N K    S+RKN
Sbjct: 344  VATLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRKN 403

Query: 688  VSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEA 509
             +K +++  + + + T    SSV+CPECQG+GI+VE  +LE+P +   ++   ++K    
Sbjct: 404  GAKSNDMGKDGETKATGTLISSVLCPECQGTGIDVEGDELEKPDVSLSQVCISDLKTIRC 463

Query: 508  C 506
            C
Sbjct: 464  C 464


>ref|XP_007043576.1| Uncharacterized protein isoform 2, partial [Theobroma cacao]
            gi|508707511|gb|EOX99407.1| Uncharacterized protein
            isoform 2, partial [Theobroma cacao]
          Length = 481

 Score =  389 bits (998), Expect = e-105
 Identities = 220/481 (45%), Positives = 289/481 (60%), Gaps = 34/481 (7%)
 Frame = -1

Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667
            MAERRELG P+    +L++Q AR TL  VR +GH  +E+REDG  FIFFC  C APCYSD
Sbjct: 1    MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60

Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLSGSSLIVVNSGK 1487
            S L DHLKG+LH  R AAAK+TLLG+NPWPFNDGVLF     E++K+L+G          
Sbjct: 61   SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGL--------- 111

Query: 1486 NSNGNDNVALERVGDNENLDSHKCATVTIEKSLNGENC-----NMVIPGVLCKDVISSLE 1322
              +GN N  LE   +++NL   +     +       NC     +++IPGVL KD IS L+
Sbjct: 112  --HGNQNRLLEFHNNDDNLAIVEYVGSEVSSYRKNVNCRAGDSDLLIPGVLIKDEISDLK 169

Query: 1321 VRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTFSYNY 1142
            VRFIGFG+IAAR  E   +  +I+RIWC WLGK  +   ++D  K   H F +VTF YN 
Sbjct: 170  VRFIGFGKIAARFCEKDGVLNEISRIWCEWLGK--EVPRNDDKLKAPKHGFAVVTFVYNC 227

Query: 1141 TLGRR-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL------------ 1001
             LGR+  LDD+           ++NG+   RK RKKSFSDPEDIS+SL            
Sbjct: 228  DLGRKGLLDDVKSLLTSGSPTGLENGDSASRK-RKKSFSDPEDISESLSNQYDSSGEDSS 286

Query: 1000 ---------VLGQYGDE-------SRHAISXXXXXXXXXXRVASERICDICKHKILPEKD 869
                      L +Y D+       S  AI            +A+ER+CDIC+ K+LPEKD
Sbjct: 287  ASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQR---IAAERMCDICQQKMLPEKD 343

Query: 868  VSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKRKN 689
            V+TL+N+ TG+L CSSRNVNGAFH+FHTSCLIHWILLC+ E   N   N K    S+RKN
Sbjct: 344  VATLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRKN 403

Query: 688  VSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEA 509
             +K +++  + + + T    SSV+CPECQG+GI+VE  +LE+P +   ++   ++K    
Sbjct: 404  GAKSNDMGKDGETKATGTLISSVLCPECQGTGIDVEGDELEKPDVSLSQVCISDLKTIRC 463

Query: 508  C 506
            C
Sbjct: 464  C 464


>ref|XP_002869513.1| hypothetical protein ARALYDRAFT_491947 [Arabidopsis lyrata subsp.
            lyrata] gi|297315349|gb|EFH45772.1| hypothetical protein
            ARALYDRAFT_491947 [Arabidopsis lyrata subsp. lyrata]
          Length = 517

 Score =  387 bits (995), Expect = e-105
 Identities = 226/525 (43%), Positives = 311/525 (59%), Gaps = 35/525 (6%)
 Frame = -1

Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667
            MAE++ELG PK  + NL++Q AR TL+ +RL+GH  +E+REDG  F+FFC  C APCYSD
Sbjct: 1    MAEKKELGLPKSSI-NLKEQLARTTLKNLRLQGHTYIELREDGKRFVFFCTLCLAPCYSD 59

Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQ---LSGSSLIVVN 1496
            + L  HL GNLHKER A A+LTLLG+NPWPF+DGVLF  +   E+++   +SG + +   
Sbjct: 60   TILLGHLNGNLHKERLACARLTLLGTNPWPFSDGVLFFDSSTGEEEEKTPVSGGASVPGT 119

Query: 1495 SGKNSNGNDNVALERVGDNENLDSHKCATVTIEKSLNGENCNMVIPGVLCKDVISSLEVR 1316
             G  S+ +D  A+ +  +N+    ++ A VT ++  +  + +++I GVL K+    +E +
Sbjct: 120  LGHCSD-DDRFAIVKYDNNKANGGNQPAAVTDDEPSHSTD-DLLISGVLIKERTLDVEAK 177

Query: 1315 FIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTFSYNYTL 1136
            FIGFG IAAR+ E+      I+++WC WLG  G PS  E A+ +  HDF IVTFSY Y L
Sbjct: 178  FIGFGRIAARLFETKGRTTWIDKLWCEWLGDEG-PSDEEKAT-IPEHDFAIVTFSYFYNL 235

Query: 1135 GRRTLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSLV----------LGQY 986
            GR  L D           E  NGE   RK RKKSFSDPED S+SL            G  
Sbjct: 236  GRLGLLDDPSRLLTTSQSESGNGEDSGRK-RKKSFSDPEDTSESLCNQYDSSEEVSSGHN 294

Query: 985  GDESRHAISXXXXXXXXXXRVA---------------SERICDICKHKILPEKDVSTLLN 851
             + SR  I+           V                SERIC++CK K+LP KD + +LN
Sbjct: 295  SNSSRALIADYDDSLMSKRVVKNKTVRRELRRQQRIFSERICEVCKQKMLPGKDAAAILN 354

Query: 850  MKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKR--KNVSKR 677
            MKTG LAC SRN+ GAFHLFH SC++HW L C+ EI  N++ + K   G KR  K+ S +
Sbjct: 355  MKTGNLACGSRNLLGAFHLFHVSCVVHWFLFCESEILGNKMVSGK---GKKRCTKHSSGQ 411

Query: 676  SEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEACLAW 497
            + +  N+       Q  SV CPECQG+GIN+E G +E  T P  + + + +K +E   AW
Sbjct: 412  TGVKWNELANDVSWQIFSVFCPECQGTGINIEGGVIERDTFPLSQTWRFQVKVSEGRKAW 471

Query: 496  MKDPELLQNRSTGLRFPYNSEETIQ-----EKVLPLKLLHFYRAD 377
            +K+PE L+N STG  FP  ++E+ Q     E+V  +KL+ FYR +
Sbjct: 472  VKNPEKLKNCSTGFHFPQQADESGQIPVQEERVQMMKLVRFYRVE 516


>ref|XP_004248159.1| PREDICTED: uncharacterized protein LOC101261554 [Solanum
            lycopersicum]
          Length = 526

 Score =  387 bits (993), Expect = e-104
 Identities = 235/527 (44%), Positives = 304/527 (57%), Gaps = 41/527 (7%)
 Frame = -1

Query: 1834 RELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSDSALF 1655
            ++L  P+    NL++Q  R TL+ VR +GHI VE+REDG   IFFC  C +PCYSDS LF
Sbjct: 4    KQLDVPRTSGGNLKEQLVRRTLQNVRSQGHIYVELREDGKRLIFFCTLCHSPCYSDSVLF 63

Query: 1654 DHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLSGSSLIVVNSGKNS-- 1481
            +HLKGNLH E  AAAK TLL  NPWPFNDGVLF +N  E+DKQ   S    VN GK+   
Sbjct: 64   NHLKGNLHTEMLAAAKATLLKPNPWPFNDGVLF-FNDPEQDKQDKQSPN--VNVGKSRLV 120

Query: 1480 ----NGNDNVALERVGDN--ENLDSH----KCATVTIEKSLNGENCNMVIPGVLCKDVIS 1331
                    +VA+    DN   N D++    +   +  E   N E+  +VIPGVLCKD +S
Sbjct: 121  DTCLEDESSVAIVEYDDNLRHNEDTYVSEYEYGLLDSELIGNEESDYLVIPGVLCKDELS 180

Query: 1330 SLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTFS 1151
             LEV+ IG G+IAARI       K I RIWC WL K        D S +  HDF +VTF 
Sbjct: 181  DLEVKHIGIGKIAARISVRGIDSKSIRRIWCEWLAKKDSDDM--DTSVVPDHDFAVVTFP 238

Query: 1150 YNYTLGRRTLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL---------- 1001
            YNY LGR  L D           E +       K+++KSFSDPED S+SL          
Sbjct: 239  YNYNLGRSPLLDDRFLLPSSPYSESEE-TSVTGKRKRKSFSDPEDFSESLSNHCDSSGEE 297

Query: 1000 -----------VLGQYGDE--SRHAISXXXXXXXXXXR--VASERICDICKHKILPEKDV 866
                       +LG   D+  S   IS          +  VASER+CDIC+ K+LP KDV
Sbjct: 298  SQSTNNSNMKLILGTCDDQLVSSRIISSKTMRRELRKQQRVASERMCDICQQKMLPGKDV 357

Query: 865  STLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLD----NLKGTHGSK 698
            +TLL+ K+G+L CSSRN++GAFHLFH SCLIHWIL C+ +     +D      K    SK
Sbjct: 358  ATLLSWKSGKLMCSSRNMSGAFHLFHVSCLIHWILQCELQTSVKPVDEPKMEPKAKRRSK 417

Query: 697  RKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKA 518
            +K  +K +     D+ +  + + +SV CPECQG+GI +E  +LE+P +   E++   IK 
Sbjct: 418  KKTGTKHNAKEKEDETKSAR-RINSVFCPECQGTGICIEGDELEKPPVSLSEVYRLKIKL 476

Query: 517  NEACLAWMKDPELLQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRAD 377
            ++A  AWMK+PE+LQN STG   P   ++ +QE V PLKLLHFYRA+
Sbjct: 477  SDARKAWMKNPEVLQNCSTGFDLPPEHDDLLQEYVSPLKLLHFYRAN 523


>ref|XP_007043577.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508707512|gb|EOX99408.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 470

 Score =  386 bits (992), Expect = e-104
 Identities = 218/465 (46%), Positives = 283/465 (60%), Gaps = 34/465 (7%)
 Frame = -1

Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667
            MAERRELG P+    +L++Q AR TL  VR +GH  +E+REDG  FIFFC  C APCYSD
Sbjct: 1    MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60

Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLSGSSLIVVNSGK 1487
            S L DHLKG+LH  R AAAK+TLLG+NPWPFNDGVLF     E++K+L+G          
Sbjct: 61   SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGL--------- 111

Query: 1486 NSNGNDNVALERVGDNENLDSHKCATVTIEKSLNGENC-----NMVIPGVLCKDVISSLE 1322
              +GN N  LE   +++NL   +     +       NC     +++IPGVL KD IS L+
Sbjct: 112  --HGNQNRLLEFHNNDDNLAIVEYVGSEVSSYRKNVNCRAGDSDLLIPGVLIKDEISDLK 169

Query: 1321 VRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTFSYNY 1142
            VRFIGFG+IAAR  E   +  +I+RIWC WLGK  +   ++D  K   H F +VTF YN 
Sbjct: 170  VRFIGFGKIAARFCEKDGVLNEISRIWCEWLGK--EVPRNDDKLKAPKHGFAVVTFVYNC 227

Query: 1141 TLGRR-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL------------ 1001
             LGR+  LDD+           ++NG+   RK RKKSFSDPEDIS+SL            
Sbjct: 228  DLGRKGLLDDVKSLLTSGSPTGLENGDSASRK-RKKSFSDPEDISESLSNQYDSSGEDSS 286

Query: 1000 ---------VLGQYGDE-------SRHAISXXXXXXXXXXRVASERICDICKHKILPEKD 869
                      L +Y D+       S  AI            +A+ER+CDIC+ K+LPEKD
Sbjct: 287  ASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQR---IAAERMCDICQQKMLPEKD 343

Query: 868  VSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKRKN 689
            V+TL+N+ TG+L CSSRNVNGAFH+FHTSCLIHWILLC+ E   N   N K    S+RKN
Sbjct: 344  VATLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRKN 403

Query: 688  VSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTI 554
             +K +++  + + + T    SSV+CPECQG+GI+VE  +LE+P +
Sbjct: 404  GAKSNDMGKDGETKATGTLISSVLCPECQGTGIDVEGDELEKPDV 448


>ref|XP_007198901.1| hypothetical protein PRUPE_ppa004741mg [Prunus persica]
            gi|462394196|gb|EMJ00100.1| hypothetical protein
            PRUPE_ppa004741mg [Prunus persica]
          Length = 493

 Score =  385 bits (989), Expect = e-104
 Identities = 235/539 (43%), Positives = 296/539 (54%), Gaps = 49/539 (9%)
 Frame = -1

Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667
            MA R ELGFPK    +LR+Q  R  LR VR +GH  VE+REDG  FIFFC  C APCYSD
Sbjct: 1    MAGRWELGFPKTSASSLREQATRTILRNVRSQGHTYVELREDGKKFIFFCTLCLAPCYSD 60

Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQL------------ 1523
              LFDHLKGNLHK+R AAAK+TLL  NPWPFNDGV F +N  E DK L            
Sbjct: 61   KVLFDHLKGNLHKDRLAAAKVTLLRPNPWPFNDGVAFFHNPDETDKHLVITDGNKFRMLE 120

Query: 1522 ---SGSSLIVVNSGKN--SNGNDNVALERVGDNENLD------SHKCATVTIEKSLNGEN 1376
                 ++L +V  G+N  SNGN++V  + +  N +LD      + K +      + N  N
Sbjct: 121  SPDDENNLAIVKYGENLISNGNEHVGTDGLECNGSLDFPRVRSNFKFSCSNENSTANEVN 180

Query: 1375 CNMVIPGVLCKDVISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHED 1196
             ++VIP VL +D ++ +E + +G G+IAAR  E  ++ K I RIWC WLGK      +E 
Sbjct: 181  SSVVIPSVLVRDDVTDIEAKKVGLGQIAARFLEKDKVSKGIGRIWCEWLGK--KAIGNEY 238

Query: 1195 ASKLTGHDFGIVTFSYNYTLGRR-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPE 1019
              K+  HDF +VTFSYN  LGRR  LDD+         +E +NGEG   K RKKSFSDPE
Sbjct: 239  HLKVPEHDFAVVTFSYNIDLGRRGLLDDVKMLLSSSPSVETENGEGSGSK-RKKSFSDPE 297

Query: 1018 DISKS---------------------LVLGQYGDESRHA----ISXXXXXXXXXXRVASE 914
            DIS+S                     L+L +Y D+  H                 R+A  
Sbjct: 298  DISESLSNQYDSCGEDSSASSGASSKLLLDRYDDQLLHTRFILNKSIRRELRRQQRLALG 357

Query: 913  RICDICKHKILPEKDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTN 734
            R+CDIC+ +++P KDVS L+N+KTGRLACSSRNVNGAFH+FHTSCLIHWILLC+ EI  N
Sbjct: 358  RMCDICQQRMIPGKDVSALINLKTGRLACSSRNVNGAFHVFHTSCLIHWILLCEVEI-AN 416

Query: 733  QLDNLKGTHGSKRKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTI 554
            Q  N K    S+RKN +K +    + +      Q  SV CPECQG+G  ++   LE+P +
Sbjct: 417  QSTNSKVRRRSRRKNAAKCNG--QDGQMTALSTQIHSVFCPECQGTGAIIDGDDLEKPNL 474

Query: 553  PPFEIFNYNIKANEACLAWMKDPELLQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRAD 377
            P                                          QEKV PLKL+HFYRAD
Sbjct: 475  P----------------------------------------LSQEKVKPLKLMHFYRAD 493


>ref|NP_194555.1| uncharacterized protein [Arabidopsis thaliana]
            gi|145334149|ref|NP_001078455.1| uncharacterized protein
            [Arabidopsis thaliana] gi|7269680|emb|CAB79628.1|
            putative protein [Arabidopsis thaliana]
            gi|110742700|dbj|BAE99261.1| hypothetical protein
            [Arabidopsis thaliana] gi|332660060|gb|AEE85460.1|
            uncharacterized protein AT4G28260 [Arabidopsis thaliana]
            gi|332660061|gb|AEE85461.1| uncharacterized protein
            AT4G28260 [Arabidopsis thaliana]
          Length = 516

 Score =  382 bits (981), Expect = e-103
 Identities = 221/522 (42%), Positives = 304/522 (58%), Gaps = 32/522 (6%)
 Frame = -1

Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667
            MAE++ELG PK  + NL++Q AR TL+ +RL+GH  +E+REDG  F+FFC  C APCYSD
Sbjct: 1    MAEKKELGLPKPSI-NLKEQLARTTLKNLRLQGHTYIELREDGKRFVFFCTLCLAPCYSD 59

Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCC--EEDKQLSGSSLIVVNS 1493
            + L  HL GNLHKER A A++TLLG+NPWPF+DGVLF  +    EE+K        V ++
Sbjct: 60   TILLGHLNGNLHKERLACARITLLGTNPWPFSDGVLFFDSSTGEEEEKSPVSGGEGVPDT 119

Query: 1492 GKNSNGNDNVALERVGDNENLDSHKCATVTIEKSLNGENCNMVIPGVLCKDVISSLEVRF 1313
             ++ + ++  A+ +  +N+    +  A VT ++  +  + +++I GVL K+    +E +F
Sbjct: 120  LEHCSDDERFAIVKYDNNKTNGDNVPAAVTDDEPSHAAD-DLLISGVLIKERTLDVEAKF 178

Query: 1312 IGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTFSYNYTLG 1133
            IGFG IAAR+ E+      I+++WC WLG  G PS  E A+ +  HDF IVTFSY Y LG
Sbjct: 179  IGFGRIAARLFETKGRTTWIDKLWCEWLGDEG-PSDEEKAT-IPEHDFAIVTFSYFYNLG 236

Query: 1132 RRTLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSLV----------LGQYG 983
            R  L D           E  NGE   RK RKKSFSDPED S+SL            G   
Sbjct: 237  RLGLLDDPGRLLTSSQSESGNGEDSGRK-RKKSFSDPEDTSESLCNQYDSSEEVSSGHNS 295

Query: 982  DESRHAISXXXXXXXXXXRVA---------------SERICDICKHKILPEKDVSTLLNM 848
            + SR  I+           V                SERIC++CK K+LP KD + +LNM
Sbjct: 296  NSSRDLIADYDDSLMSKRVVKNRTVRRELRRQQRIFSERICEVCKQKMLPGKDAAAILNM 355

Query: 847  KTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKRKNVSKRSEI 668
            KTG LAC SRN+ GAFHLFH SC++HW L C+ EI  N++ + KG     +   S ++ +
Sbjct: 356  KTGNLACGSRNLLGAFHLFHVSCVVHWFLFCESEILGNKMVSGKGKKRCTKH--SGQTGV 413

Query: 667  LMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEACLAWMKD 488
              N+       Q  SV CPECQG+GIN+E   +E  T P  + + + +K +E   AW+K+
Sbjct: 414  KWNELANDVSWQIFSVFCPECQGTGINIEGAVIERDTFPLSQTWRFQVKVSEGRKAWVKN 473

Query: 487  PELLQNRSTGLRFPYNSEETIQ-----EKVLPLKLLHFYRAD 377
            PE L+N STG  FP  +EET Q     E+V  +KL+ FYR +
Sbjct: 474  PERLKNCSTGFHFPQQAEETEQIPVQEERVQMMKLVRFYRVE 515


>ref|XP_007149858.1| hypothetical protein PHAVU_005G104500g [Phaseolus vulgaris]
            gi|561023122|gb|ESW21852.1| hypothetical protein
            PHAVU_005G104500g [Phaseolus vulgaris]
          Length = 498

 Score =  379 bits (972), Expect = e-102
 Identities = 215/517 (41%), Positives = 293/517 (56%), Gaps = 27/517 (5%)
 Frame = -1

Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667
            MA + ELG  K  V N ++Q AR  L+ VR +GH  VE+RE+G  FI+FC  C APCYSD
Sbjct: 1    MAGKLELGPLKSDVSNPKEQAARKILKIVRSQGHPYVELRENGKKFIYFCTLCLAPCYSD 60

Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCCEEDKQLSGSSLIVVNSGK 1487
              LFDHLKGNLHKER +AAK+TLLG  PWPFNDG++F     E D+ L  +        K
Sbjct: 61   DVLFDHLKGNLHKERLSAAKVTLLGPKPWPFNDGLVFFDTSIESDRDLEVADSYRNRLLK 120

Query: 1486 NSNGNDNVALERVGDNENLDSHKCATVTIEKSLNGENCNMVIPGVLCKDVISSLEVRFIG 1307
             +N ++++A+ +  +    ++  C+T      +  + C +VIP +L +D I  ++V  +G
Sbjct: 121  FNNNDNSLAIVKFDEGVQSNAEPCST----DGMPNDECGLVIPHLLIRDEIFDVKVSEVG 176

Query: 1306 FGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTFSYNYTLGRR 1127
             G+IAAR  E       I RIWC WLGK G+    +D  ++  HDF IV F+YNY LGR 
Sbjct: 177  LGKIAARFLEKCSALSGIKRIWCEWLGKKGNDQ--QDGVEILEHDFAIVNFAYNYDLGRS 234

Query: 1126 -TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSLVLGQYGDESRHAISXXX 950
              LDD+           + +  G R+   K+S SD +DIS SL   QY   +  +     
Sbjct: 235  GLLDDVKSL--------LPSASGGRKG--KRSLSDSDDISDSLC-NQYDSSAEESSDSNN 283

Query: 949  XXXXXXXR--------------------------VASERICDICKHKILPEKDVSTLLNM 848
                                              +A+E++C+IC+ K+LP KDV+ LLN+
Sbjct: 284  SSAPLTLDQFNNHHVCTRFISSKAVRKELRRKQRLAAEKVCNICQQKMLPGKDVAALLNL 343

Query: 847  KTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKRKNVSKRSEI 668
             T R+ACSSRN  GAFH+FHTSCLIHWI+LC+FEI TN L         KRK  S   +I
Sbjct: 344  NTRRVACSSRNKTGAFHVFHTSCLIHWIILCEFEIITNHLVRPNVRRIVKRKIASDGEKI 403

Query: 667  LMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEACLAWMKD 488
                K++  +    +V CPECQG+G+ ++   +E+P     ++F + IKA +A   WMK 
Sbjct: 404  ---GKEKDIEKHIRTVFCPECQGTGMVIDGDGVEQPEFSLSQMFKFKIKACDARREWMKS 460

Query: 487  PELLQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRAD 377
            PE+LQN STG  FP  SEE  +EKV P+ LLHFYRAD
Sbjct: 461  PEILQNCSTGFHFPSQSEEIFEEKVEPINLLHFYRAD 497


>ref|XP_006412978.1| hypothetical protein EUTSA_v10024944mg [Eutrema salsugineum]
            gi|557114148|gb|ESQ54431.1| hypothetical protein
            EUTSA_v10024944mg [Eutrema salsugineum]
          Length = 514

 Score =  374 bits (961), Expect = e-101
 Identities = 217/525 (41%), Positives = 296/525 (56%), Gaps = 35/525 (6%)
 Frame = -1

Query: 1846 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1667
            MAE +ELG PK  + +L++Q AR TLR +R +GH  +E+REDG  F+FFC  C APCYSD
Sbjct: 1    MAESKELGLPKTAI-SLKEQLARTTLRNLRSQGHTYIELREDGKRFVFFCTLCLAPCYSD 59

Query: 1666 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCC-EEDKQLSGSSLIVVNSG 1490
            + L  HL GNLHKER + A++TLLG NPWPFNDGVLF  +   EE+K L      V    
Sbjct: 60   AILLGHLNGNLHKERLSCARITLLGENPWPFNDGVLFFDSSTGEEEKTLISDGEGVTGPL 119

Query: 1489 KNSNGNDNVALERVGDNENLDSH--KCATVTIEKSLNGENCNMVIPGVLCKDVISSLEVR 1316
             + + N+  A+    +N   +S         I+   N    N+VI  +L K+    +E +
Sbjct: 120  HHCSDNERFAIVTYDENRTCESQGDNQPAAGIDDEPNHCAENLVISNLLIKEKTLDVEAK 179

Query: 1315 FIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDPSFHEDASKLTGHDFGIVTFSYNYTL 1136
            FIGFG IAAR+ E+      I+++WC WLG+   P   E+ + +  HDF IVTFSY Y L
Sbjct: 180  FIGFGRIAARLFETKGRTTWIDKLWCEWLGEESPPD--EEKATVPEHDFAIVTFSYFYNL 237

Query: 1135 GRR-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSLVLGQYGDESRHAIS 959
            GR   L D +         E  NGE   RK RKKSFSDPED S+SL   QY  +S   +S
Sbjct: 238  GRLGLLADPSRLLTLSQSAESGNGEDNGRK-RKKSFSDPEDTSESLC-NQY--DSSEEVS 293

Query: 958  XXXXXXXXXXRVA----------------------------SERICDICKHKILPEKDVS 863
                       +A                            S+RIC++CK K+LP KD +
Sbjct: 294  SARNSNSSRALIADYDDHLVNKRVIKNKSVRRELRKQQRIFSDRICEVCKQKMLPGKDAA 353

Query: 862  TLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNLKGTHGSKRKNVS 683
             +LNMKTG+LACSSRN  GAFHLFH SC++HW L C+ EI  +++ + KG     +K  +
Sbjct: 354  AILNMKTGKLACSSRNRLGAFHLFHVSCVVHWFLFCETEILGSKMVSGKG-----KKRCT 408

Query: 682  KRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEACL 503
            K+S +  N+       Q  SV CPECQG+GIN+E   +E  T P  + + + +K +E   
Sbjct: 409  KQSGVKWNELVGDVSWQIFSVFCPECQGTGINIEGDVIERDTFPLSQTWRFGVKVSEGRK 468

Query: 502  AWMKDPELLQNRSTGLRFPYNSEETI---QEKVLPLKLLHFYRAD 377
            AW+K+PE L+N STG  FP   EE +   +++V  +KL+ FYR +
Sbjct: 469  AWVKNPEKLENCSTGFHFPQQDEELVKGQEDRVQSMKLVRFYRVE 513


Top