BLASTX nr result

ID: Akebia25_contig00014327 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00014327
         (1874 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006447354.1| hypothetical protein CICLE_v10014904mg [Citr...   485   e-134
ref|XP_002273873.2| PREDICTED: uncharacterized protein LOC100255...   464   e-128
ref|XP_004139943.1| PREDICTED: uncharacterized protein LOC101204...   457   e-126
ref|XP_006470134.1| PREDICTED: uncharacterized protein LOC102608...   451   e-125
ref|XP_007043575.1| Uncharacterized protein isoform 1 [Theobroma...   451   e-124
ref|XP_002517932.1| conserved hypothetical protein [Ricinus comm...   443   e-121
ref|XP_004303120.1| PREDICTED: uncharacterized protein LOC101310...   435   e-119
emb|CAN73945.1| hypothetical protein VITISV_032245 [Vitis vinifera]   427   e-116
ref|XP_002319898.2| hypothetical protein POPTR_0013s13670g [Popu...   426   e-116
gb|EXB79637.1| hypothetical protein L484_011577 [Morus notabilis]     416   e-113
ref|XP_006366024.1| PREDICTED: uncharacterized protein LOC102600...   403   e-109
ref|XP_002869513.1| hypothetical protein ARALYDRAFT_491947 [Arab...   392   e-106
ref|XP_007198901.1| hypothetical protein PRUPE_ppa004741mg [Prun...   390   e-105
ref|XP_004248159.1| PREDICTED: uncharacterized protein LOC101261...   388   e-105
ref|XP_007043578.1| Uncharacterized protein isoform 4 [Theobroma...   387   e-105
ref|XP_007043576.1| Uncharacterized protein isoform 2, partial [...   387   e-105
ref|NP_194555.1| uncharacterized protein [Arabidopsis thaliana] ...   387   e-105
ref|XP_007043577.1| Uncharacterized protein isoform 3 [Theobroma...   385   e-104
ref|XP_007149858.1| hypothetical protein PHAVU_005G104500g [Phas...   381   e-103
ref|XP_006412978.1| hypothetical protein EUTSA_v10024944mg [Eutr...   374   e-101

>ref|XP_006447354.1| hypothetical protein CICLE_v10014904mg [Citrus clementina]
            gi|567910083|ref|XP_006447355.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|567910085|ref|XP_006447356.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|567910087|ref|XP_006447357.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|568831767|ref|XP_006470130.1| PREDICTED:
            uncharacterized protein LOC102608093 isoform X1 [Citrus
            sinensis] gi|568831769|ref|XP_006470131.1| PREDICTED:
            uncharacterized protein LOC102608093 isoform X2 [Citrus
            sinensis] gi|568831771|ref|XP_006470132.1| PREDICTED:
            uncharacterized protein LOC102608093 isoform X3 [Citrus
            sinensis] gi|568831773|ref|XP_006470133.1| PREDICTED:
            uncharacterized protein LOC102608093 isoform X4 [Citrus
            sinensis] gi|557549965|gb|ESR60594.1| hypothetical
            protein CICLE_v10014904mg [Citrus clementina]
            gi|557549966|gb|ESR60595.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|557549967|gb|ESR60596.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
            gi|557549968|gb|ESR60597.1| hypothetical protein
            CICLE_v10014904mg [Citrus clementina]
          Length = 523

 Score =  485 bits (1249), Expect = e-134
 Identities = 266/526 (50%), Positives = 334/526 (63%), Gaps = 36/526 (6%)
 Frame = -1

Query: 1646 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1467
            MA RRELGFPK   ++LR+Q AR TL  VR +GH  VE+REDG  FIFFC  C APCYSD
Sbjct: 1    MAGRRELGFPKTSAFSLREQLARTTLSNVRAQGHTYVELREDGKRFIFFCTLCLAPCYSD 60

Query: 1466 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLSGSSLIVVNSGK 1287
              LFDHLKGNLH ER +AAK+TLLG NPWPFNDGVLF  N  E++KQ + S+  +  S  
Sbjct: 61   LVLFDHLKGNLHTERLSAAKVTLLGPNPWPFNDGVLFFDNSNEKEKQTTVSNDKLGRSLD 120

Query: 1286 NSNGNDNVALERVGDNENLDSHK----------CATVTNDKSLNGENCNMVIPGVLCKDV 1137
              N + N+A+ + G++  ++ ++          C   T  + +  E+C+ VIPGV  KD 
Sbjct: 121  YHNNDSNLAIVKYGEDMKVNGNEHSGLDEVHFDCENGTQVRDIYSESCDKVIPGVFLKDE 180

Query: 1136 ISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVT 957
            I  L VRFIG G+IAAR+ +  E   +I+RIWC WLGK       ED  ++  HDF IVT
Sbjct: 181  IVDLRVRFIGLGQIAARMIQKDEGSIEISRIWCEWLGKKDPED--EDIVEIPDHDFAIVT 238

Query: 956  FSYNYTLGRRTL-DDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKS-------- 804
            F YNY LGR+ L DD+          + +NGEG  RK RKKSFSDPED+S+S        
Sbjct: 239  FVYNYDLGRKGLFDDVKLLLSSSPAEDSENGEGTGRK-RKKSFSDPEDVSESLSKQYDSC 297

Query: 803  -------------LVLGQYGDESRHA----ISXXXXXXXXXXRVASERICDICKHKILPE 675
                         L+L +YGD+  HA                R+A+ER+CDIC+ KILP+
Sbjct: 298  GEDSSASNSSTSRLLLDRYGDQLLHARFISSKAARREMRRQQRIAAERMCDICQQKILPD 357

Query: 674  KDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKR 495
            KDV+ LLN+KTG LACSSRN+NG FH+FH SCLIHWILLC+FE+ TNQ    K    S+R
Sbjct: 358  KDVAALLNLKTGNLACSSRNLNGVFHVFHISCLIHWILLCEFELKTNQPVTPKVKRRSRR 417

Query: 494  KNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKAN 315
            KN SKR +    D + I   Q SS+ CPECQG+G+N+E  +LE+PTI   ++F Y IK +
Sbjct: 418  KNGSKRVQ-ARKDGEYIFTNQISSLFCPECQGTGVNIEGDELEKPTISLSQMFKYKIKVS 476

Query: 314  EACLAWMKDPEILQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRAD 177
            +A  AWMK+PE LQN STG  FP  SEE  QEKV PLKLLHFY A+
Sbjct: 477  DARKAWMKNPEALQNCSTGFYFPSRSEEKFQEKVSPLKLLHFYSAE 522


>ref|XP_002273873.2| PREDICTED: uncharacterized protein LOC100255678 [Vitis vinifera]
          Length = 520

 Score =  464 bits (1195), Expect = e-128
 Identities = 271/534 (50%), Positives = 338/534 (63%), Gaps = 43/534 (8%)
 Frame = -1

Query: 1646 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1467
            MA R ELGF K    +LR+Q AR TLR VR++GH  VE+REDG  FIFFC  C APCYS+
Sbjct: 1    MARRTELGFLKTSASSLREQAARTTLRNVRMQGHPYVELREDGKRFIFFCTLCLAPCYSE 60

Query: 1466 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLS---GSSLIVVN 1296
            S L+DHLKGNLH ERYAAAK+TLL S+PWPFNDGVLF  N  E DK LS   G+   ++ 
Sbjct: 61   SVLYDHLKGNLHSERYAAAKVTLLKSHPWPFNDGVLFFDNSSENDKHLSIANGNPTRLLG 120

Query: 1295 SGKNSNGNDNVALERVGD------NENLDSHK-----CATVTNDKSLN--GENCNMVIPG 1155
            + KN N   N+A+   GD      N +++ H      C     ++SLN  G NC+M+IPG
Sbjct: 121  THKNDN---NLAIVCHGDDLSQSNNRHVEQHSNKNSDCDVSFYNESLNNGGRNCDMMIPG 177

Query: 1154 VLCKDVISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKY--GDSSFHEDASKLT 981
            V+ KD ++ LEVRF+GFG+IAAR  E   + K I++IWC W GK   GD     +   + 
Sbjct: 178  VMIKDEVTELEVRFLGFGQIAARFFEKDGVSKGISKIWCEWFGKEEPGDG----ETVMVP 233

Query: 980  GHDFGIVTFSYNYTLGRRTLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL 801
             HDF +VTF+Y+Y LGR+ L D          L     EG  RK RKKSFSDPEDIS+SL
Sbjct: 234  DHDFAVVTFNYHYNLGRKGLFD-----DVISMLSSSPTEGSGRK-RKKSFSDPEDISESL 287

Query: 800  ---------------------VLGQYGDE---SRHAISXXXXXXXXXXR-VASERICDIC 696
                                 +L +Y D+   +R   S          + VA+ER+CDIC
Sbjct: 288  SNQYDSSGEDSLISNSPSPRLLLDRYDDQLLDTRFISSKTIRRELRRQQRVAAERMCDIC 347

Query: 695  KHKILPEKDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSK 516
            +HK+LP KDV+TL+NMKTG+L CSSRNV GAFH+FHTSCLIHWILLC+FEI+TNQL   K
Sbjct: 348  QHKMLPGKDVATLMNMKTGKLVCSSRNVYGAFHVFHTSCLIHWILLCEFEIFTNQLVCPK 407

Query: 515  GTHGSKRKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIF 336
                S+RK+ SK +    +   + T  Q  SV CPECQG+GI +ED +LE P IP  E+F
Sbjct: 408  LRRSSRRKSGSKCNGKGKDGVIKPTTLQICSVFCPECQGTGIMIED-ELEIPNIPLSEMF 466

Query: 335  NYNIKANEACLAWMKDPEILQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRADE 174
             Y IK ++A  AWMK+PE L++ STG  FP  S ET+QEKV  LKLLHFY ADE
Sbjct: 467  KYKIKVSDAHRAWMKNPEELKHCSTGFNFPSQSGETVQEKVSSLKLLHFYSADE 520


>ref|XP_004139943.1| PREDICTED: uncharacterized protein LOC101204451 [Cucumis sativus]
            gi|449475785|ref|XP_004154550.1| PREDICTED:
            uncharacterized LOC101204451 [Cucumis sativus]
          Length = 525

 Score =  457 bits (1177), Expect = e-126
 Identities = 254/530 (47%), Positives = 319/530 (60%), Gaps = 41/530 (7%)
 Frame = -1

Query: 1646 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1467
            MA R ELGFPK   Y+LR+Q AR  LR VR +GH  VE+RE+G  FIFFC  C APCYSD
Sbjct: 1    MARRMELGFPKSASYSLREQAARTILRNVRSQGHTYVELRENGKKFIFFCTLCLAPCYSD 60

Query: 1466 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQL------------ 1323
            S LF HLKG LH ER +AAKLTLLG NPWPF+DGVLF +   E D Q+            
Sbjct: 61   SVLFSHLKGTLHTERLSAAKLTLLGPNPWPFDDGVLFFHKPIEGDNQVGISNDNHERLLE 120

Query: 1322 ---SGSSLIVVNSGKNSNGNDNVALERVGDNENLDSHKCATVTNDKSLNGENCNMVIPGV 1152
               + ++L +V    NS GN N   E  G+  N++      + +     GE+C +VIPGV
Sbjct: 121  YNNNDNNLAIVKYVGNSKGNGNRQEEFNGNMRNVEDCSFENLND----GGESCPLVIPGV 176

Query: 1151 LCKDVISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHD 972
            L K+ IS ++VR +G+G+IAAR  E   I   ++RIWC WLGK  D    E+  K+  H+
Sbjct: 177  LIKEEISDIKVRELGYGQIAARFTEKDGIFSGVSRIWCEWLGKVNDGI--ENMVKVPEHN 234

Query: 971  FGIVTFSYNYTLGRR-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPED------- 816
            + I+TF+YN  LGR+  LDD+          E  N E  R+ +RKKSFSDPED       
Sbjct: 235  YAIITFTYNVDLGRKGLLDDVKLLLSSSPGAESQNDE-NRQVKRKKSFSDPEDGSLSMSP 293

Query: 815  --------------ISKSLVLGQYGDESRHAI----SXXXXXXXXXXRVASERICDICKH 690
                          +  SL L  Y D+                    R+A+ER+CDIC+ 
Sbjct: 294  QYDSSGEDSSASNCVMSSLSLDGYDDQILSTTVMLNKAVRRELRRQQRLAAERMCDICQQ 353

Query: 689  KILPEKDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGT 510
            KIL  KDV+TLLNMKTGRLACSSRNVNG FH+FHTSCLIHWILLC++EI    L  SK  
Sbjct: 354  KILTHKDVATLLNMKTGRLACSSRNVNGVFHVFHTSCLIHWILLCEYEISVKDLGGSKVR 413

Query: 509  HGSKRKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNY 330
               +RK  +K ++ + + + R  K Q  SV CP CQG+GI ++   LE+PT+P  EIF Y
Sbjct: 414  RRYRRKKKTKGNKHIKDGETRQIKTQIDSVFCPACQGTGITIDGDDLEKPTVPLSEIFKY 473

Query: 329  NIKANEACLAWMKDPEILQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRA 180
             IK ++A  AWMK PE+LQN STG +FPY  +ETIQE V PLKLLHFY A
Sbjct: 474  KIKVSDARRAWMKSPEVLQNCSTGFQFPYQPDETIQENVKPLKLLHFYGA 523


>ref|XP_006470134.1| PREDICTED: uncharacterized protein LOC102608093 isoform X5 [Citrus
            sinensis]
          Length = 508

 Score =  451 bits (1160), Expect(2) = e-125
 Identities = 248/499 (49%), Positives = 315/499 (63%), Gaps = 36/499 (7%)
 Frame = -1

Query: 1646 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1467
            MA RRELGFPK   ++LR+Q AR TL  VR +GH  VE+REDG  FIFFC  C APCYSD
Sbjct: 1    MAGRRELGFPKTSAFSLREQLARTTLSNVRAQGHTYVELREDGKRFIFFCTLCLAPCYSD 60

Query: 1466 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLSGSSLIVVNSGK 1287
              LFDHLKGNLH ER +AAK+TLLG NPWPFNDGVLF  N  E++KQ + S+  +  S  
Sbjct: 61   LVLFDHLKGNLHTERLSAAKVTLLGPNPWPFNDGVLFFDNSNEKEKQTTVSNDKLGRSLD 120

Query: 1286 NSNGNDNVALERVGDNENLDSHK----------CATVTNDKSLNGENCNMVIPGVLCKDV 1137
              N + N+A+ + G++  ++ ++          C   T  + +  E+C+ VIPGV  KD 
Sbjct: 121  YHNNDSNLAIVKYGEDMKVNGNEHSGLDEVHFDCENGTQVRDIYSESCDKVIPGVFLKDE 180

Query: 1136 ISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVT 957
            I  L VRFIG G+IAAR+ +  E   +I+RIWC WLGK       ED  ++  HDF IVT
Sbjct: 181  IVDLRVRFIGLGQIAARMIQKDEGSIEISRIWCEWLGKKDPED--EDIVEIPDHDFAIVT 238

Query: 956  FSYNYTLGRRTL-DDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKS-------- 804
            F YNY LGR+ L DD+          + +NGEG  RK RKKSFSDPED+S+S        
Sbjct: 239  FVYNYDLGRKGLFDDVKLLLSSSPAEDSENGEGTGRK-RKKSFSDPEDVSESLSKQYDSC 297

Query: 803  -------------LVLGQYGDESRHA----ISXXXXXXXXXXRVASERICDICKHKILPE 675
                         L+L +YGD+  HA                R+A+ER+CDIC+ KILP+
Sbjct: 298  GEDSSASNSSTSRLLLDRYGDQLLHARFISSKAARREMRRQQRIAAERMCDICQQKILPD 357

Query: 674  KDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKR 495
            KDV+ LLN+KTG LACSSRN+NG FH+FH SCLIHWILLC+FE+ TNQ    K    S+R
Sbjct: 358  KDVAALLNLKTGNLACSSRNLNGVFHVFHISCLIHWILLCEFELKTNQPVTPKVKRRSRR 417

Query: 494  KNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKAN 315
            KN SKR +    D + I   Q SS+ CPECQG+G+N+E  +LE+PTI   ++F Y IK +
Sbjct: 418  KNGSKRVQ-ARKDGEYIFTNQISSLFCPECQGTGVNIEGDELEKPTISLSQMFKYKIKVS 476

Query: 314  EACLAWMKDPEILQNRSTG 258
            +A  AWMK+PE LQN STG
Sbjct: 477  DARKAWMKNPEALQNCSTG 495



 Score = 25.8 bits (55), Expect(2) = e-125
 Identities = 10/13 (76%), Positives = 13/13 (100%)
 Frame = -3

Query: 225 SGKGIASKVASFL 187
           +GKG+ASK+ASFL
Sbjct: 494 TGKGVASKIASFL 506


>ref|XP_007043575.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508707510|gb|EOX99406.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 517

 Score =  451 bits (1161), Expect = e-124
 Identities = 250/528 (47%), Positives = 329/528 (62%), Gaps = 34/528 (6%)
 Frame = -1

Query: 1646 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1467
            MAERRELG P+    +L++Q AR TL  VR +GH  +E+REDG  FIFFC  C APCYSD
Sbjct: 1    MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60

Query: 1466 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLSGSSLIVVNSGK 1287
            S L DHLKG+LH  R AAAK+TLLG+NPWPFNDGVLF     E++K+L+G          
Sbjct: 61   SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGL--------- 111

Query: 1286 NSNGNDNVALERVGDNENLD-----SHKCATVTNDKSLNGENCNMVIPGVLCKDVISSLE 1122
              +GN N  LE   +++NL        + ++   + +    + +++IPGVL KD IS L+
Sbjct: 112  --HGNQNRLLEFHNNDDNLAIVEYVGSEVSSYRKNVNCRAGDSDLLIPGVLIKDEISDLK 169

Query: 1121 VRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVTFSYNY 942
            VRFIGFG+IAAR  E   +  +I+RIWC WLGK  +   ++D  K   H F +VTF YN 
Sbjct: 170  VRFIGFGKIAARFCEKDGVLNEISRIWCEWLGK--EVPRNDDKLKAPKHGFAVVTFVYNC 227

Query: 941  TLGRR-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL------------ 801
             LGR+  LDD+           ++NG+   RK RKKSFSDPEDIS+SL            
Sbjct: 228  DLGRKGLLDDVKSLLTSGSPTGLENGDSASRK-RKKSFSDPEDISESLSNQYDSSGEDSS 286

Query: 800  ---------VLGQYGDE-------SRHAISXXXXXXXXXXRVASERICDICKHKILPEKD 669
                      L +Y D+       S  AI            +A+ER+CDIC+ K+LPEKD
Sbjct: 287  ASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQR---IAAERMCDICQQKMLPEKD 343

Query: 668  VSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKRKN 489
            V+TL+N+ TG+L CSSRNVNGAFH+FHTSCLIHWILLC+ E   N   N K    S+RKN
Sbjct: 344  VATLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRKN 403

Query: 488  VSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEA 309
             +K +++  + + + T    SSV+CPECQG+GI+VE  +LE+P +   ++F Y IK ++A
Sbjct: 404  GAKSNDMGKDGETKATGTLISSVLCPECQGTGIDVEGDELEKPDVSLSQMFRYKIKVSDA 463

Query: 308  CLAWMKDPEILQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRADE*KS 165
              AWMK PE+L+N STG  F   S E +QEK+LPLKLLHFY AD+ +S
Sbjct: 464  RRAWMKSPEMLENCSTGFHFRSQSGEMVQEKILPLKLLHFYSADKYES 511


>ref|XP_002517932.1| conserved hypothetical protein [Ricinus communis]
            gi|223542914|gb|EEF44450.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 509

 Score =  443 bits (1139), Expect = e-121
 Identities = 247/519 (47%), Positives = 316/519 (60%), Gaps = 28/519 (5%)
 Frame = -1

Query: 1646 MAERRELGFPK-GGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYS 1470
            MA R ELGF K GG  +L++Q AR TL  VR +GH  VE+REDG  FIFFC  C APCYS
Sbjct: 1    MAGRWELGFTKTGGANSLKEQLARTTLNNVRSKGHPYVELREDGKRFIFFCTLCLAPCYS 60

Query: 1469 DSALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLSGSSLIVVNSG 1290
            D+ LFDHLKGNLH ER + A LTLL  NPWPF+DGV F     E +KQL    +I  ++ 
Sbjct: 61   DAVLFDHLKGNLHTERLSTATLTLLKENPWPFSDGVHFFDTSSENEKQL----VIKNDNE 116

Query: 1289 KNSNGNDNVALERVGDNENLDSHKCATVTNDKSLNGENCNMVIPGVLCKDVISSLEVRFI 1110
               NGN ++A+ + G +      +      D + NG   +++I GVL KD IS L+ RF+
Sbjct: 117  SRGNGNSSLAIVKYGGSLKPTGDEDTGCNKDANDNGRISDLLIQGVLVKDDISDLQARFM 176

Query: 1109 GFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVTFSYNYTLGR 930
            G+G I AR+ E       I+RIWC WLGK  ++    D +K+  H+F +VTF+YNY LGR
Sbjct: 177  GYGRIGARLIEKDGNSNDISRIWCEWLGK--NTPCDLDKAKVLDHEFAVVTFAYNYDLGR 234

Query: 929  R-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKS----------------- 804
            +  LDD+          E DN  G  RK RKKSFSDPED+S+S                 
Sbjct: 235  KGLLDDVKLLLSSSPVQESDNQGGTNRK-RKKSFSDPEDVSESFSNQYDSSGEESLTSIG 293

Query: 803  -----LVLGQYGDESRHA----ISXXXXXXXXXXRVASERICDICKHKILPEKDVSTLLN 651
                 L+L ++ D+  H+                 +A+ER+CDIC+ KILPEKDV+TL+N
Sbjct: 294  GPPTRLLLDRHDDQFLHSKVISSKTLRRELRRQHHIAAERMCDICQQKILPEKDVATLVN 353

Query: 650  MKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKRKNVSKRSE 471
            M TG+LACSSRN  G +H+FHTSCLIHWILL ++E+  NQ  + KG   S+RKN +K S 
Sbjct: 354  MNTGKLACSSRNTYGQYHVFHTSCLIHWILLSEYEMARNQSVSPKGRRKSRRKNGTKSSH 413

Query: 470  ILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEACLAWMK 291
            +   +K +    Q SSV CPECQG+G  +E  + E PTIP  E+F Y IK  +   AWMK
Sbjct: 414  V---EKVKALNNQISSVFCPECQGTGAILEKDERELPTIPLSEMFKYKIKVGDGRRAWMK 470

Query: 290  DPEILQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRADE 174
             PE+L+N S G  FP  SE  +Q KVLPLKLLHFYRADE
Sbjct: 471  SPEVLENCSIGFHFPSQSEGAVQAKVLPLKLLHFYRADE 509


>ref|XP_004303120.1| PREDICTED: uncharacterized protein LOC101310040 [Fragaria vesca
            subsp. vesca]
          Length = 525

 Score =  435 bits (1118), Expect = e-119
 Identities = 251/534 (47%), Positives = 319/534 (59%), Gaps = 43/534 (8%)
 Frame = -1

Query: 1646 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1467
            MA R ++G PK    +LR+Q  R  LR VR +GH  VEVREDG  FIFFC  C APCYSD
Sbjct: 1    MAGRWDVGVPKTNACSLREQATRTILRNVRSQGHSYVEVREDGKKFIFFCTLCLAPCYSD 60

Query: 1466 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQL------------ 1323
              LFDHLKGNLH ER AAAK+TLL  NPWPFNDGV+F  N +E DK +            
Sbjct: 61   KVLFDHLKGNLHNERLAAAKVTLLRPNPWPFNDGVVFFNNSYETDKGVVTPDDNKCRMLE 120

Query: 1322 ---SGSSLIVVNSGKN--SNGNDNVALERVGDNENLDSHKCATVTNDKSLNGENCNMVIP 1158
               + ++L +V  G N  +NG D+  ++ +  NE +D     +   D + +G   ++VIP
Sbjct: 121  SHDNENNLAIVKYGGNLKTNGYDHCGVDGLECNEYIDLQGLQSNVGDSTADGAKSSVVIP 180

Query: 1157 GVLCKDVISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTG 978
            G++ +D I+ LEVR +G GEIAAR          I RIWC WLG     S  ED   +  
Sbjct: 181  GIVVRDEITDLEVREVGLGEIAARFLGK----DGIGRIWCEWLGVKSIDS--EDLCNVPE 234

Query: 977  HDFGIVTFSYNYTLGRR-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL 801
            HDF +VTFSYN  LGR+  LDD+         +E  NGEG   K RKKSFSDPEDIS SL
Sbjct: 235  HDFAVVTFSYNIDLGRKGLLDDVRMLLSSSPTIESGNGEGTGCK-RKKSFSDPEDISDSL 293

Query: 800  ---------------------VLGQYGDE---SRHAISXXXXXXXXXXR-VASERICDIC 696
                                 +L  Y D+   +R  ++          + +AS R+CDIC
Sbjct: 294  SNQYESFGEDSSASSGTASRLLLDHYDDQLLNTRFILNKSIRRELRRQQRLASGRMCDIC 353

Query: 695  KHKILPEKDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSK 516
            + ++LP KDV+TL+N+KTG+LACSSRNVNGAFH+FHTSCLIHWILLC+ E+ TNQ   SK
Sbjct: 354  QQRMLPGKDVATLMNLKTGKLACSSRNVNGAFHVFHTSCLIHWILLCEVEVITNQNTGSK 413

Query: 515  GTHGSKRKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIF 336
                S+RK  +K +    + + +   PQ  SV CPECQG+GI V+   LE+P +P  ++F
Sbjct: 414  ARRRSRRKTAAKCNG--KDAQLKSLSPQIYSVFCPECQGTGIVVDGDDLEKPNLPLSQMF 471

Query: 335  NYNIKANEACLAWMKDPEILQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRADE 174
             Y IK ++A  AWMK PE+LQN STG  FP  +   IQEKV  LKLL FYRA E
Sbjct: 472  RYKIKVSDARRAWMKSPEMLQNCSTGFHFPSLNAAGIQEKVKTLKLLRFYRAHE 525


>emb|CAN73945.1| hypothetical protein VITISV_032245 [Vitis vinifera]
          Length = 896

 Score =  427 bits (1097), Expect = e-116
 Identities = 250/503 (49%), Positives = 316/503 (62%), Gaps = 43/503 (8%)
 Frame = -1

Query: 1601 NLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSDSALFDHLKGNLHKER 1422
            +LR+Q AR TLR VR++GH  VE+REDG  FIFFC  C APCYS+S L+DHLKGNLH ER
Sbjct: 352  SLREQAARTTLRNVRMQGHPYVELREDGKRFIFFCTLCLAPCYSESVLYDHLKGNLHSER 411

Query: 1421 YAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLS---GSSLIVVNSGKNSNGNDNVALER 1251
            YAAAK+TLL S+PWPFNDGVLF  N  E DK LS   G+   ++ + KN N   N+A+  
Sbjct: 412  YAAAKVTLLKSHPWPFNDGVLFFDNSSENDKHLSIANGNPTRLLGTHKNDN---NLAIVC 468

Query: 1250 VGD------NENLDSHK-----CATVTNDKSLN--GENCNMVIPGVLCKDVISSLEVRFI 1110
             GD      N +++ H      C     ++SLN  G NC+M+IPGV+ KD ++ LEVRF+
Sbjct: 469  HGDDLSQSNNRHVEQHSNKNSDCDVSFYNESLNNGGRNCDMMIPGVMIKDEVTELEVRFL 528

Query: 1109 GFGEIAARIHESMEIPKKINRIWCAWLGKY--GDSSFHEDASKLTGHDFGIVTFSYNYTL 936
            GFG+IAAR  E   + K I++IWC W GK   GD     +   +  HDF +VTF+Y+Y L
Sbjct: 529  GFGQIAARFFEKDGVSKGISKIWCEWFGKEEPGDG----ETVMVPDHDFAVVTFNYHYNL 584

Query: 935  GRRTLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL--------------- 801
            GR+ L D          L     EG  RK RKKSFSDPEDIS+SL               
Sbjct: 585  GRKGLFD-----DVISMLSSSPTEGSGRK-RKKSFSDPEDISESLSNQYDSSGEDSLISN 638

Query: 800  ------VLGQYGDE---SRHAISXXXXXXXXXXR-VASERICDICKHKILPEKDVSTLLN 651
                  +L +Y D+   +R   S          + VA+ER+CDIC+HK+LP KDV+TL N
Sbjct: 639  SPSPRLLLDRYDDQLLDTRFISSKTIRRELRRQQRVAAERMCDICQHKMLPGKDVATLXN 698

Query: 650  MKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKRKNVSKRSE 471
            MKTG+L CSSRNV GAFH+FHTSCLIHWILLC+FEI+TNQL   K    S+RK+ SK + 
Sbjct: 699  MKTGKLVCSSRNVYGAFHVFHTSCLIHWILLCEFEIFTNQLVCPKLRRSSRRKSGSKCNG 758

Query: 470  ILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEACLAWMK 291
               +   + T  Q  SV CPECQG+GI +ED +LE P IP  E+F Y IK ++A  AWMK
Sbjct: 759  KGKDGVIKPTTLQICSVFCPECQGTGIMIED-ELEIPNIPLSEMFKYKIKVSDAHRAWMK 817

Query: 290  DPEILQNRSTGLRFPYNSEETIQ 222
            +PE L++ STG  FP  S ET+Q
Sbjct: 818  NPEELKHCSTGFNFPSQSGETVQ 840


>ref|XP_002319898.2| hypothetical protein POPTR_0013s13670g [Populus trichocarpa]
            gi|550325787|gb|EEE95821.2| hypothetical protein
            POPTR_0013s13670g [Populus trichocarpa]
          Length = 513

 Score =  426 bits (1096), Expect = e-116
 Identities = 235/525 (44%), Positives = 318/525 (60%), Gaps = 34/525 (6%)
 Frame = -1

Query: 1646 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1467
            MA  RE+GFPK    +LR+Q AR TL +VR  GH  +E+REDG  FIFFC  C +PCYSD
Sbjct: 1    MAGNREVGFPKTTASSLREQLARTTLSRVRARGHPYLELREDGKRFIFFCTLCLSPCYSD 60

Query: 1466 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLSGSSLIVVNSGK 1287
            + L DHL+GNLH ER +AAK TLL  NPWPF+DG+ F       ++QL+      +  GK
Sbjct: 61   TILLDHLRGNLHTERLSAAKATLLKPNPWPFSDGIHFFDASSGNEEQLA------IKDGK 114

Query: 1286 NSN-------GNDNVALERVGDNENLDSHKCATVTNDKSLNG--ENCNMVIPGVLCKDVI 1134
             S+        +DN+A+ +  +N       C TV  D++L+G  E  ++VIP V  K+ +
Sbjct: 115  ESSRFLKFEENSDNLAIVKYVENLKPG---CDTVV-DENLSGSDEGSDLVIPSVRLKEEV 170

Query: 1133 SSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVTF 954
            S L+   +G G+IAAR++E  +   +I+RIWC WLGK   SS  ED  K+  HDFG+VTF
Sbjct: 171  SDLKATLVGSGQIAARMYEKKDGSNEISRIWCEWLGK--KSSNDEDKVKVLDHDFGVVTF 228

Query: 953  SYNYTLGRRTLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL--------- 801
            +Y+Y LG+  L D            +   + +   +RK+S S+PED+S+SL         
Sbjct: 229  AYDYELGKSGLFDDVKLLLSSSAPALTENDERGNWKRKRSVSEPEDVSRSLTNQYGLCEE 288

Query: 800  ------------VLGQYGDESRH----AISXXXXXXXXXXRVASERICDICKHKILPEKD 669
                        VL +Y D+  H    +            R+A+E++CDIC+ K+LPEKD
Sbjct: 289  ESSKTTCASSNLVLDRYDDQLMHTRFISNKTVRREVRKQQRIAAEKMCDICQQKMLPEKD 348

Query: 668  VSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKRKN 489
            V+TL N KTG+LACSSRNV GAFH+FHTSCLIHWIL C+FEI  NQ  ++KG   S++KN
Sbjct: 349  VATLWNRKTGKLACSSRNVYGAFHVFHTSCLIHWILYCEFEIVRNQTVSTKGGRRSRKKN 408

Query: 488  VSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEA 309
             +K +    +    +      SV CP+CQG+G+N+E  + E+P  P  E+F Y IK +E 
Sbjct: 409  GTKSNTTGKDGTVNVLPNPIVSVFCPDCQGTGVNIEGDEFEKPLTPLSEMFKYKIKVSEG 468

Query: 308  CLAWMKDPEILQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRADE 174
               WMK+PEIL+N STG  FP  S E +QEKVLPLKLLHFYR +E
Sbjct: 469  HRGWMKNPEILENCSTGFHFPSQSGEPVQEKVLPLKLLHFYRPEE 513


>gb|EXB79637.1| hypothetical protein L484_011577 [Morus notabilis]
          Length = 638

 Score =  416 bits (1070), Expect = e-113
 Identities = 245/539 (45%), Positives = 320/539 (59%), Gaps = 54/539 (10%)
 Frame = -1

Query: 1646 MAERRELGFPKGGVY--------NLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIW 1491
            MA R  LGFPK            +L+ Q  R  LR VR +GH  VE+REDG   IFFC  
Sbjct: 1    MAGRGILGFPKSNELAVSKTTSCSLKDQAKRTILRNVRSQGHTYVELREDGKKSIFFCTL 60

Query: 1490 CRAPCYSDSALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEED------- 1332
            C APCYSD  LFDHLKGNLH +R + AK+TLLG NPWPFNDGV+F  N  E D       
Sbjct: 61   CLAPCYSDCVLFDHLKGNLHNQRLSTAKVTLLGPNPWPFNDGVVFFNNPTENDDDTVISN 120

Query: 1331 --------KQLSGSSLIVVNSGKN--SNGNDNVALERVG-DNENLDSHKCATVTNDKSLN 1185
                     Q S ++L +V  G+N  S  N ++ ++ +G  NEN DS        + + +
Sbjct: 121  GNQSRLLESQDSENNLAIVTYGENLESCANGHIMVDELGHQNENPDS------AGNLAGS 174

Query: 1184 GENCNMVIPGVLCKDVISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSF 1005
            GENC ++IPGV   D I+++EVR +G+G I+ R  E   +   I+RIWC WLGK   +  
Sbjct: 175  GENCAVLIPGVRAGDEIANVEVREVGYGLISVRFREKDGVSNDISRIWCEWLGK--KTIE 232

Query: 1004 HEDASKLTGHDFGIVTFSYN-YTLGRRTL-DDLNPXXXXXXXLEIDNGEGKRRKQRKKSF 831
             ED  K+  HDF IVTFSYN ++LGR  L DD+          E+ NG+   RK R+KSF
Sbjct: 233  DEDFLKVPEHDFAIVTFSYNNFSLGRMGLHDDVKALLCSSPAAEMQNGDVSSRK-RRKSF 291

Query: 830  SDPEDISK-------------------SLVLGQYGDE-------SRHAISXXXXXXXXXX 729
            SDPED S+                   SL+L QY D+       S  AI           
Sbjct: 292  SDPEDSSENLSNQYDSCGEDSSASAVTSLMLDQYDDQLLQTRFISNKAIRRELRRQQR-- 349

Query: 728  RVASERICDICKHKILPEKDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDF 549
             +A+ER+CDIC+HK+LP KDV+TL+N+KTGRLACSSRN NGAFHLFHTSCLIHW+LLC+ 
Sbjct: 350  -IAAERMCDICQHKMLPGKDVATLMNVKTGRLACSSRNTNGAFHLFHTSCLIHWVLLCEV 408

Query: 548  EIWTNQLDNSKGTHGSKRKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQL 369
            E  TNQ +  K    S+RK  SK +E+L + + +  +   + VICPECQG+G  + DG+ 
Sbjct: 409  EKCTNQSEAPKVKRRSRRKAASKCNEVLNDSEVKAFRTPINRVICPECQGTGTMI-DGED 467

Query: 368  EEPTIPPFEIFNYNIKANEACLAWMKDPEILQNRSTGLRFPYNSEETIQEKVLPLKLLH 192
            E+PT+P  ++F Y IK ++A  AWMK PE+L N STG  FP  +EETIQ  ++ +  +H
Sbjct: 468  EKPTVPLSKMFKYKIKVSDARRAWMKSPEVLGNCSTGFHFPSPAEETIQVHLVYIAEIH 526


>ref|XP_006366024.1| PREDICTED: uncharacterized protein LOC102600129 [Solanum tuberosum]
          Length = 521

 Score =  403 bits (1036), Expect = e-109
 Identities = 241/527 (45%), Positives = 311/527 (59%), Gaps = 41/527 (7%)
 Frame = -1

Query: 1634 RELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSDSALF 1455
            R+L FP+    NL++Q  R TL+ VR +GHI VE+REDG   +FFC  C +PCYSDS LF
Sbjct: 4    RQLDFPRTSGGNLKEQLVRRTLQNVRSQGHIYVELREDGKRLVFFCTLCHSPCYSDSVLF 63

Query: 1454 DHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLSGSSLIVVNSGKNSNG 1275
            +HLKGNLH E  AAAK TLL  NPWPFNDGVLF +N  E+DK         VN GK+   
Sbjct: 64   NHLKGNLHTEMLAAAKATLLKPNPWPFNDGVLF-FNDPEQDKHSPN-----VNVGKSRLV 117

Query: 1274 N----DNVALERVGDNENLDSHKCATVTN------DKSL--NGENCNMVIPGVLCKDVIS 1131
            +    D  +L  V  ++NL  +    VT       D  L  NGE+  +VIPGVLCKD +S
Sbjct: 118  DTCLEDESSLAIVECDDNLRHNGDTYVTEYEYCLLDSELTGNGESEYLVIPGVLCKDELS 177

Query: 1130 SLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVTFS 951
             LEV+ IG G+IAARI       KKI RIWC WL K        D S +  HDF +VTF 
Sbjct: 178  DLEVKHIGIGKIAARISVRGIDSKKIRRIWCEWLVKKDSDDM--DTSVVPDHDFAVVTFP 235

Query: 950  YNYTLGRRTLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL---------- 801
            YNY LGR+ L D           E +   G R+++RK SFSDPED S+SL          
Sbjct: 236  YNYNLGRKPLLDDRFLLPSSPYSESEETSGTRKRKRK-SFSDPEDFSESLSNHCDSSGEE 294

Query: 800  -----------VLGQYGDE--SRHAISXXXXXXXXXXR--VASERICDICKHKILPEKDV 666
                       +LG   D+  S   IS          +  VASER+CDIC+ K+LP KDV
Sbjct: 295  SQSTNNSNMKLILGTCDDQLVSSRIISSKTMRRELRKQQRVASERMCDICQQKMLPGKDV 354

Query: 665  STLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLD----NSKGTHGSK 498
            +TLL+ K+G+L CSSRN+ GAFHLFH SCLIHWIL C+ + +   +D     +K    SK
Sbjct: 355  ATLLSWKSGKLMCSSRNMTGAFHLFHVSCLIHWILQCELQTYVKPVDEPKMETKAKRRSK 414

Query: 497  RKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKA 318
            RK  +K +     D+ +  + + +SV CPECQG+GI +E  +LE+P +   E++ + IK 
Sbjct: 415  RKTGTKHNAKEKEDEIKSAR-RINSVFCPECQGTGIIIEGDELEKPPVSLSEVYRHKIKL 473

Query: 317  NEACLAWMKDPEILQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRAD 177
            ++A  AWMK+PE+LQN STG   P   ++ +QE V PLKLLHFYRA+
Sbjct: 474  SDARKAWMKNPEVLQNCSTGFDLPPEHDDLLQEYVSPLKLLHFYRAN 520


>ref|XP_002869513.1| hypothetical protein ARALYDRAFT_491947 [Arabidopsis lyrata subsp.
            lyrata] gi|297315349|gb|EFH45772.1| hypothetical protein
            ARALYDRAFT_491947 [Arabidopsis lyrata subsp. lyrata]
          Length = 517

 Score =  392 bits (1008), Expect = e-106
 Identities = 225/525 (42%), Positives = 311/525 (59%), Gaps = 35/525 (6%)
 Frame = -1

Query: 1646 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1467
            MAE++ELG PK  + NL++Q AR TL+ +RL+GH  +E+REDG  F+FFC  C APCYSD
Sbjct: 1    MAEKKELGLPKSSI-NLKEQLARTTLKNLRLQGHTYIELREDGKRFVFFCTLCLAPCYSD 59

Query: 1466 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQ---LSGSSLIVVN 1296
            + L  HL GNLHKER A A+LTLLG+NPWPF+DGVLF  +   E+++   +SG + +   
Sbjct: 60   TILLGHLNGNLHKERLACARLTLLGTNPWPFSDGVLFFDSSTGEEEEKTPVSGGASVPGT 119

Query: 1295 SGKNSNGNDNVALERVGDNENLDSHKCATVTNDKSLNGENCNMVIPGVLCKDVISSLEVR 1116
             G  S+ +D  A+ +  +N+    ++ A VT+D+  +  + +++I GVL K+    +E +
Sbjct: 120  LGHCSD-DDRFAIVKYDNNKANGGNQPAAVTDDEPSHSTD-DLLISGVLIKERTLDVEAK 177

Query: 1115 FIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVTFSYNYTL 936
            FIGFG IAAR+ E+      I+++WC WLG  G S   E+ + +  HDF IVTFSY Y L
Sbjct: 178  FIGFGRIAARLFETKGRTTWIDKLWCEWLGDEGPSD--EEKATIPEHDFAIVTFSYFYNL 235

Query: 935  GRRTLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSLV----------LGQY 786
            GR  L D           E  NGE   RK RKKSFSDPED S+SL            G  
Sbjct: 236  GRLGLLDDPSRLLTTSQSESGNGEDSGRK-RKKSFSDPEDTSESLCNQYDSSEEVSSGHN 294

Query: 785  GDESRHAISXXXXXXXXXXRVA---------------SERICDICKHKILPEKDVSTLLN 651
             + SR  I+           V                SERIC++CK K+LP KD + +LN
Sbjct: 295  SNSSRALIADYDDSLMSKRVVKNKTVRRELRRQQRIFSERICEVCKQKMLPGKDAAAILN 354

Query: 650  MKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKR--KNVSKR 477
            MKTG LAC SRN+ GAFHLFH SC++HW L C+ EI  N++ + K   G KR  K+ S +
Sbjct: 355  MKTGNLACGSRNLLGAFHLFHVSCVVHWFLFCESEILGNKMVSGK---GKKRCTKHSSGQ 411

Query: 476  SEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEACLAW 297
            + +  N+       Q  SV CPECQG+GIN+E G +E  T P  + + + +K +E   AW
Sbjct: 412  TGVKWNELANDVSWQIFSVFCPECQGTGINIEGGVIERDTFPLSQTWRFQVKVSEGRKAW 471

Query: 296  MKDPEILQNRSTGLRFPYNSEETIQ-----EKVLPLKLLHFYRAD 177
            +K+PE L+N STG  FP  ++E+ Q     E+V  +KL+ FYR +
Sbjct: 472  VKNPEKLKNCSTGFHFPQQADESGQIPVQEERVQMMKLVRFYRVE 516


>ref|XP_007198901.1| hypothetical protein PRUPE_ppa004741mg [Prunus persica]
            gi|462394196|gb|EMJ00100.1| hypothetical protein
            PRUPE_ppa004741mg [Prunus persica]
          Length = 493

 Score =  390 bits (1001), Expect = e-105
 Identities = 236/539 (43%), Positives = 299/539 (55%), Gaps = 49/539 (9%)
 Frame = -1

Query: 1646 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1467
            MA R ELGFPK    +LR+Q  R  LR VR +GH  VE+REDG  FIFFC  C APCYSD
Sbjct: 1    MAGRWELGFPKTSASSLREQATRTILRNVRSQGHTYVELREDGKKFIFFCTLCLAPCYSD 60

Query: 1466 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQL------------ 1323
              LFDHLKGNLHK+R AAAK+TLL  NPWPFNDGV F +N  E DK L            
Sbjct: 61   KVLFDHLKGNLHKDRLAAAKVTLLRPNPWPFNDGVAFFHNPDETDKHLVITDGNKFRMLE 120

Query: 1322 ---SGSSLIVVNSGKN--SNGNDNVALERVGDNENLD------SHKCATVTNDKSLNGEN 1176
                 ++L +V  G+N  SNGN++V  + +  N +LD      + K +    + + N  N
Sbjct: 121  SPDDENNLAIVKYGENLISNGNEHVGTDGLECNGSLDFPRVRSNFKFSCSNENSTANEVN 180

Query: 1175 CNMVIPGVLCKDVISSLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHED 996
             ++VIP VL +D ++ +E + +G G+IAAR  E  ++ K I RIWC WLGK   +  +E 
Sbjct: 181  SSVVIPSVLVRDDVTDIEAKKVGLGQIAARFLEKDKVSKGIGRIWCEWLGK--KAIGNEY 238

Query: 995  ASKLTGHDFGIVTFSYNYTLGRR-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPE 819
              K+  HDF +VTFSYN  LGRR  LDD+         +E +NGEG   K RKKSFSDPE
Sbjct: 239  HLKVPEHDFAVVTFSYNIDLGRRGLLDDVKMLLSSSPSVETENGEGSGSK-RKKSFSDPE 297

Query: 818  DISKS---------------------LVLGQYGDESRHA----ISXXXXXXXXXXRVASE 714
            DIS+S                     L+L +Y D+  H                 R+A  
Sbjct: 298  DISESLSNQYDSCGEDSSASSGASSKLLLDRYDDQLLHTRFILNKSIRRELRRQQRLALG 357

Query: 713  RICDICKHKILPEKDVSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTN 534
            R+CDIC+ +++P KDVS L+N+KTGRLACSSRNVNGAFH+FHTSCLIHWILLC+ EI  N
Sbjct: 358  RMCDICQQRMIPGKDVSALINLKTGRLACSSRNVNGAFHVFHTSCLIHWILLCEVEI-AN 416

Query: 533  QLDNSKGTHGSKRKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTI 354
            Q  NSK    S+RKN +K +    + +      Q  SV CPECQG+G  ++   LE+P +
Sbjct: 417  QSTNSKVRRRSRRKNAAKCNG--QDGQMTALSTQIHSVFCPECQGTGAIIDGDDLEKPNL 474

Query: 353  PPFEIFNYNIKANEACLAWMKDPEILQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRAD 177
            P                                          QEKV PLKL+HFYRAD
Sbjct: 475  P----------------------------------------LSQEKVKPLKLMHFYRAD 493


>ref|XP_004248159.1| PREDICTED: uncharacterized protein LOC101261554 [Solanum
            lycopersicum]
          Length = 526

 Score =  388 bits (997), Expect = e-105
 Identities = 234/527 (44%), Positives = 305/527 (57%), Gaps = 41/527 (7%)
 Frame = -1

Query: 1634 RELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSDSALF 1455
            ++L  P+    NL++Q  R TL+ VR +GHI VE+REDG   IFFC  C +PCYSDS LF
Sbjct: 4    KQLDVPRTSGGNLKEQLVRRTLQNVRSQGHIYVELREDGKRLIFFCTLCHSPCYSDSVLF 63

Query: 1454 DHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLSGSSLIVVNSGKNS-- 1281
            +HLKGNLH E  AAAK TLL  NPWPFNDGVLF +N  E+DKQ   S    VN GK+   
Sbjct: 64   NHLKGNLHTEMLAAAKATLLKPNPWPFNDGVLF-FNDPEQDKQDKQSPN--VNVGKSRLV 120

Query: 1280 ----NGNDNVALERVGDN--ENLDSH----KCATVTNDKSLNGENCNMVIPGVLCKDVIS 1131
                    +VA+    DN   N D++    +   + ++   N E+  +VIPGVLCKD +S
Sbjct: 121  DTCLEDESSVAIVEYDDNLRHNEDTYVSEYEYGLLDSELIGNEESDYLVIPGVLCKDELS 180

Query: 1130 SLEVRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVTFS 951
             LEV+ IG G+IAARI       K I RIWC WL K        D S +  HDF +VTF 
Sbjct: 181  DLEVKHIGIGKIAARISVRGIDSKSIRRIWCEWLAKKDSDDM--DTSVVPDHDFAVVTFP 238

Query: 950  YNYTLGRRTLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL---------- 801
            YNY LGR  L D           E +       K+++KSFSDPED S+SL          
Sbjct: 239  YNYNLGRSPLLDDRFLLPSSPYSESEE-TSVTGKRKRKSFSDPEDFSESLSNHCDSSGEE 297

Query: 800  -----------VLGQYGDE--SRHAISXXXXXXXXXXR--VASERICDICKHKILPEKDV 666
                       +LG   D+  S   IS          +  VASER+CDIC+ K+LP KDV
Sbjct: 298  SQSTNNSNMKLILGTCDDQLVSSRIISSKTMRRELRKQQRVASERMCDICQQKMLPGKDV 357

Query: 665  STLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLD----NSKGTHGSK 498
            +TLL+ K+G+L CSSRN++GAFHLFH SCLIHWIL C+ +     +D      K    SK
Sbjct: 358  ATLLSWKSGKLMCSSRNMSGAFHLFHVSCLIHWILQCELQTSVKPVDEPKMEPKAKRRSK 417

Query: 497  RKNVSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKA 318
            +K  +K +     D+ +  + + +SV CPECQG+GI +E  +LE+P +   E++   IK 
Sbjct: 418  KKTGTKHNAKEKEDETKSAR-RINSVFCPECQGTGICIEGDELEKPPVSLSEVYRLKIKL 476

Query: 317  NEACLAWMKDPEILQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRAD 177
            ++A  AWMK+PE+LQN STG   P   ++ +QE V PLKLLHFYRA+
Sbjct: 477  SDARKAWMKNPEVLQNCSTGFDLPPEHDDLLQEYVSPLKLLHFYRAN 523


>ref|XP_007043578.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508707513|gb|EOX99409.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 478

 Score =  387 bits (995), Expect = e-105
 Identities = 218/481 (45%), Positives = 291/481 (60%), Gaps = 34/481 (7%)
 Frame = -1

Query: 1646 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1467
            MAERRELG P+    +L++Q AR TL  VR +GH  +E+REDG  FIFFC  C APCYSD
Sbjct: 1    MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60

Query: 1466 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLSGSSLIVVNSGK 1287
            S L DHLKG+LH  R AAAK+TLLG+NPWPFNDGVLF     E++K+L+G          
Sbjct: 61   SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGL--------- 111

Query: 1286 NSNGNDNVALERVGDNENLD-----SHKCATVTNDKSLNGENCNMVIPGVLCKDVISSLE 1122
              +GN N  LE   +++NL        + ++   + +    + +++IPGVL KD IS L+
Sbjct: 112  --HGNQNRLLEFHNNDDNLAIVEYVGSEVSSYRKNVNCRAGDSDLLIPGVLIKDEISDLK 169

Query: 1121 VRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVTFSYNY 942
            VRFIGFG+IAAR  E   +  +I+RIWC WLGK  +   ++D  K   H F +VTF YN 
Sbjct: 170  VRFIGFGKIAARFCEKDGVLNEISRIWCEWLGK--EVPRNDDKLKAPKHGFAVVTFVYNC 227

Query: 941  TLGRR-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL------------ 801
             LGR+  LDD+           ++NG+   RK RKKSFSDPEDIS+SL            
Sbjct: 228  DLGRKGLLDDVKSLLTSGSPTGLENGDSASRK-RKKSFSDPEDISESLSNQYDSSGEDSS 286

Query: 800  ---------VLGQYGDE-------SRHAISXXXXXXXXXXRVASERICDICKHKILPEKD 669
                      L +Y D+       S  AI            +A+ER+CDIC+ K+LPEKD
Sbjct: 287  ASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQR---IAAERMCDICQQKMLPEKD 343

Query: 668  VSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKRKN 489
            V+TL+N+ TG+L CSSRNVNGAFH+FHTSCLIHWILLC+ E   N   N K    S+RKN
Sbjct: 344  VATLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRKN 403

Query: 488  VSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEA 309
             +K +++  + + + T    SSV+CPECQG+GI+VE  +LE+P +   ++   ++K    
Sbjct: 404  GAKSNDMGKDGETKATGTLISSVLCPECQGTGIDVEGDELEKPDVSLSQVCISDLKTIRC 463

Query: 308  C 306
            C
Sbjct: 464  C 464


>ref|XP_007043576.1| Uncharacterized protein isoform 2, partial [Theobroma cacao]
            gi|508707511|gb|EOX99407.1| Uncharacterized protein
            isoform 2, partial [Theobroma cacao]
          Length = 481

 Score =  387 bits (995), Expect = e-105
 Identities = 218/481 (45%), Positives = 291/481 (60%), Gaps = 34/481 (7%)
 Frame = -1

Query: 1646 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1467
            MAERRELG P+    +L++Q AR TL  VR +GH  +E+REDG  FIFFC  C APCYSD
Sbjct: 1    MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60

Query: 1466 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLSGSSLIVVNSGK 1287
            S L DHLKG+LH  R AAAK+TLLG+NPWPFNDGVLF     E++K+L+G          
Sbjct: 61   SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGL--------- 111

Query: 1286 NSNGNDNVALERVGDNENLD-----SHKCATVTNDKSLNGENCNMVIPGVLCKDVISSLE 1122
              +GN N  LE   +++NL        + ++   + +    + +++IPGVL KD IS L+
Sbjct: 112  --HGNQNRLLEFHNNDDNLAIVEYVGSEVSSYRKNVNCRAGDSDLLIPGVLIKDEISDLK 169

Query: 1121 VRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVTFSYNY 942
            VRFIGFG+IAAR  E   +  +I+RIWC WLGK  +   ++D  K   H F +VTF YN 
Sbjct: 170  VRFIGFGKIAARFCEKDGVLNEISRIWCEWLGK--EVPRNDDKLKAPKHGFAVVTFVYNC 227

Query: 941  TLGRR-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL------------ 801
             LGR+  LDD+           ++NG+   RK RKKSFSDPEDIS+SL            
Sbjct: 228  DLGRKGLLDDVKSLLTSGSPTGLENGDSASRK-RKKSFSDPEDISESLSNQYDSSGEDSS 286

Query: 800  ---------VLGQYGDE-------SRHAISXXXXXXXXXXRVASERICDICKHKILPEKD 669
                      L +Y D+       S  AI            +A+ER+CDIC+ K+LPEKD
Sbjct: 287  ASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQR---IAAERMCDICQQKMLPEKD 343

Query: 668  VSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKRKN 489
            V+TL+N+ TG+L CSSRNVNGAFH+FHTSCLIHWILLC+ E   N   N K    S+RKN
Sbjct: 344  VATLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRKN 403

Query: 488  VSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEA 309
             +K +++  + + + T    SSV+CPECQG+GI+VE  +LE+P +   ++   ++K    
Sbjct: 404  GAKSNDMGKDGETKATGTLISSVLCPECQGTGIDVEGDELEKPDVSLSQVCISDLKTIRC 463

Query: 308  C 306
            C
Sbjct: 464  C 464


>ref|NP_194555.1| uncharacterized protein [Arabidopsis thaliana]
            gi|145334149|ref|NP_001078455.1| uncharacterized protein
            [Arabidopsis thaliana] gi|7269680|emb|CAB79628.1|
            putative protein [Arabidopsis thaliana]
            gi|110742700|dbj|BAE99261.1| hypothetical protein
            [Arabidopsis thaliana] gi|332660060|gb|AEE85460.1|
            uncharacterized protein AT4G28260 [Arabidopsis thaliana]
            gi|332660061|gb|AEE85461.1| uncharacterized protein
            AT4G28260 [Arabidopsis thaliana]
          Length = 516

 Score =  387 bits (994), Expect = e-105
 Identities = 220/522 (42%), Positives = 304/522 (58%), Gaps = 32/522 (6%)
 Frame = -1

Query: 1646 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1467
            MAE++ELG PK  + NL++Q AR TL+ +RL+GH  +E+REDG  F+FFC  C APCYSD
Sbjct: 1    MAEKKELGLPKPSI-NLKEQLARTTLKNLRLQGHTYIELREDGKRFVFFCTLCLAPCYSD 59

Query: 1466 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCF--EEDKQLSGSSLIVVNS 1293
            + L  HL GNLHKER A A++TLLG+NPWPF+DGVLF  +    EE+K        V ++
Sbjct: 60   TILLGHLNGNLHKERLACARITLLGTNPWPFSDGVLFFDSSTGEEEEKSPVSGGEGVPDT 119

Query: 1292 GKNSNGNDNVALERVGDNENLDSHKCATVTNDKSLNGENCNMVIPGVLCKDVISSLEVRF 1113
             ++ + ++  A+ +  +N+    +  A VT+D+  +  + +++I GVL K+    +E +F
Sbjct: 120  LEHCSDDERFAIVKYDNNKTNGDNVPAAVTDDEPSHAAD-DLLISGVLIKERTLDVEAKF 178

Query: 1112 IGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVTFSYNYTLG 933
            IGFG IAAR+ E+      I+++WC WLG  G S   E+ + +  HDF IVTFSY Y LG
Sbjct: 179  IGFGRIAARLFETKGRTTWIDKLWCEWLGDEGPSD--EEKATIPEHDFAIVTFSYFYNLG 236

Query: 932  RRTLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSLV----------LGQYG 783
            R  L D           E  NGE   RK RKKSFSDPED S+SL            G   
Sbjct: 237  RLGLLDDPGRLLTSSQSESGNGEDSGRK-RKKSFSDPEDTSESLCNQYDSSEEVSSGHNS 295

Query: 782  DESRHAISXXXXXXXXXXRVA---------------SERICDICKHKILPEKDVSTLLNM 648
            + SR  I+           V                SERIC++CK K+LP KD + +LNM
Sbjct: 296  NSSRDLIADYDDSLMSKRVVKNRTVRRELRRQQRIFSERICEVCKQKMLPGKDAAAILNM 355

Query: 647  KTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKRKNVSKRSEI 468
            KTG LAC SRN+ GAFHLFH SC++HW L C+ EI  N++ + KG     +   S ++ +
Sbjct: 356  KTGNLACGSRNLLGAFHLFHVSCVVHWFLFCESEILGNKMVSGKGKKRCTKH--SGQTGV 413

Query: 467  LMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEACLAWMKD 288
              N+       Q  SV CPECQG+GIN+E   +E  T P  + + + +K +E   AW+K+
Sbjct: 414  KWNELANDVSWQIFSVFCPECQGTGINIEGAVIERDTFPLSQTWRFQVKVSEGRKAWVKN 473

Query: 287  PEILQNRSTGLRFPYNSEETIQ-----EKVLPLKLLHFYRAD 177
            PE L+N STG  FP  +EET Q     E+V  +KL+ FYR +
Sbjct: 474  PERLKNCSTGFHFPQQAEETEQIPVQEERVQMMKLVRFYRVE 515


>ref|XP_007043577.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508707512|gb|EOX99408.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 470

 Score =  385 bits (989), Expect = e-104
 Identities = 216/465 (46%), Positives = 285/465 (61%), Gaps = 34/465 (7%)
 Frame = -1

Query: 1646 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1467
            MAERRELG P+    +L++Q AR TL  VR +GH  +E+REDG  FIFFC  C APCYSD
Sbjct: 1    MAERRELGLPRTSACSLKEQLARTTLNNVRSQGHTYIELREDGKRFIFFCTLCLAPCYSD 60

Query: 1466 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLSGSSLIVVNSGK 1287
            S L DHLKG+LH  R AAAK+TLLG+NPWPFNDGVLF     E++K+L+G          
Sbjct: 61   SVLLDHLKGSLHSGRLAAAKVTLLGTNPWPFNDGVLFFGKLNEKEKRLAGL--------- 111

Query: 1286 NSNGNDNVALERVGDNENLD-----SHKCATVTNDKSLNGENCNMVIPGVLCKDVISSLE 1122
              +GN N  LE   +++NL        + ++   + +    + +++IPGVL KD IS L+
Sbjct: 112  --HGNQNRLLEFHNNDDNLAIVEYVGSEVSSYRKNVNCRAGDSDLLIPGVLIKDEISDLK 169

Query: 1121 VRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVTFSYNY 942
            VRFIGFG+IAAR  E   +  +I+RIWC WLGK  +   ++D  K   H F +VTF YN 
Sbjct: 170  VRFIGFGKIAARFCEKDGVLNEISRIWCEWLGK--EVPRNDDKLKAPKHGFAVVTFVYNC 227

Query: 941  TLGRR-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSL------------ 801
             LGR+  LDD+           ++NG+   RK RKKSFSDPEDIS+SL            
Sbjct: 228  DLGRKGLLDDVKSLLTSGSPTGLENGDSASRK-RKKSFSDPEDISESLSNQYDSSGEDSS 286

Query: 800  ---------VLGQYGDE-------SRHAISXXXXXXXXXXRVASERICDICKHKILPEKD 669
                      L +Y D+       S  AI            +A+ER+CDIC+ K+LPEKD
Sbjct: 287  ASNITSSRLALDRYDDQLLLTRFISSKAIRRELRRQQR---IAAERMCDICQQKMLPEKD 343

Query: 668  VSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKRKN 489
            V+TL+N+ TG+L CSSRNVNGAFH+FHTSCLIHWILLC+ E   N   N K    S+RKN
Sbjct: 344  VATLMNLNTGKLVCSSRNVNGAFHVFHTSCLIHWILLCEVERIENHSVNPKARRRSRRKN 403

Query: 488  VSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTI 354
             +K +++  + + + T    SSV+CPECQG+GI+VE  +LE+P +
Sbjct: 404  GAKSNDMGKDGETKATGTLISSVLCPECQGTGIDVEGDELEKPDV 448


>ref|XP_007149858.1| hypothetical protein PHAVU_005G104500g [Phaseolus vulgaris]
            gi|561023122|gb|ESW21852.1| hypothetical protein
            PHAVU_005G104500g [Phaseolus vulgaris]
          Length = 498

 Score =  381 bits (978), Expect = e-103
 Identities = 216/517 (41%), Positives = 293/517 (56%), Gaps = 27/517 (5%)
 Frame = -1

Query: 1646 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1467
            MA + ELG  K  V N ++Q AR  L+ VR +GH  VE+RE+G  FI+FC  C APCYSD
Sbjct: 1    MAGKLELGPLKSDVSNPKEQAARKILKIVRSQGHPYVELRENGKKFIYFCTLCLAPCYSD 60

Query: 1466 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCFEEDKQLSGSSLIVVNSGK 1287
              LFDHLKGNLHKER +AAK+TLLG  PWPFNDG++F     E D+ L  +        K
Sbjct: 61   DVLFDHLKGNLHKERLSAAKVTLLGPKPWPFNDGLVFFDTSIESDRDLEVADSYRNRLLK 120

Query: 1286 NSNGNDNVALERVGDNENLDSHKCATVTNDKSLNGENCNMVIPGVLCKDVISSLEVRFIG 1107
             +N ++++A+ +  +    ++  C+T      +  + C +VIP +L +D I  ++V  +G
Sbjct: 121  FNNNDNSLAIVKFDEGVQSNAEPCST----DGMPNDECGLVIPHLLIRDEIFDVKVSEVG 176

Query: 1106 FGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVTFSYNYTLGRR 927
             G+IAAR  E       I RIWC WLGK G+    +D  ++  HDF IV F+YNY LGR 
Sbjct: 177  LGKIAARFLEKCSALSGIKRIWCEWLGKKGNDQ--QDGVEILEHDFAIVNFAYNYDLGRS 234

Query: 926  -TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSLVLGQYGDESRHAISXXX 750
              LDD+           + +  G R+   K+S SD +DIS SL   QY   +  +     
Sbjct: 235  GLLDDVKSL--------LPSASGGRKG--KRSLSDSDDISDSLC-NQYDSSAEESSDSNN 283

Query: 749  XXXXXXXR--------------------------VASERICDICKHKILPEKDVSTLLNM 648
                                              +A+E++C+IC+ K+LP KDV+ LLN+
Sbjct: 284  SSAPLTLDQFNNHHVCTRFISSKAVRKELRRKQRLAAEKVCNICQQKMLPGKDVAALLNL 343

Query: 647  KTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKRKNVSKRSEI 468
             T R+ACSSRN  GAFH+FHTSCLIHWI+LC+FEI TN L         KRK  S   +I
Sbjct: 344  NTRRVACSSRNKTGAFHVFHTSCLIHWIILCEFEIITNHLVRPNVRRIVKRKIASDGEKI 403

Query: 467  LMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEACLAWMKD 288
                K++  +    +V CPECQG+G+ ++   +E+P     ++F + IKA +A   WMK 
Sbjct: 404  ---GKEKDIEKHIRTVFCPECQGTGMVIDGDGVEQPEFSLSQMFKFKIKACDARREWMKS 460

Query: 287  PEILQNRSTGLRFPYNSEETIQEKVLPLKLLHFYRAD 177
            PEILQN STG  FP  SEE  +EKV P+ LLHFYRAD
Sbjct: 461  PEILQNCSTGFHFPSQSEEIFEEKVEPINLLHFYRAD 497


>ref|XP_006412978.1| hypothetical protein EUTSA_v10024944mg [Eutrema salsugineum]
            gi|557114148|gb|ESQ54431.1| hypothetical protein
            EUTSA_v10024944mg [Eutrema salsugineum]
          Length = 514

 Score =  374 bits (960), Expect = e-101
 Identities = 217/527 (41%), Positives = 300/527 (56%), Gaps = 37/527 (7%)
 Frame = -1

Query: 1646 MAERRELGFPKGGVYNLRQQEARITLRKVRLEGHISVEVREDGNNFIFFCIWCRAPCYSD 1467
            MAE +ELG PK  + +L++Q AR TLR +R +GH  +E+REDG  F+FFC  C APCYSD
Sbjct: 1    MAESKELGLPKTAI-SLKEQLARTTLRNLRSQGHTYIELREDGKRFVFFCTLCLAPCYSD 59

Query: 1466 SALFDHLKGNLHKERYAAAKLTLLGSNPWPFNDGVLFIYNCF-EEDKQLSGSSLIVVNSG 1290
            + L  HL GNLHKER + A++TLLG NPWPFNDGVLF  +   EE+K L      V    
Sbjct: 60   AILLGHLNGNLHKERLSCARITLLGENPWPFNDGVLFFDSSTGEEEKTLISDGEGVTGPL 119

Query: 1289 KNSNGNDNVALERVGDNENL----DSHKCATVTNDKSLNGENCNMVIPGVLCKDVISSLE 1122
             + + N+  A+    +N       D+   A + ++ +   EN  +VI  +L K+    +E
Sbjct: 120  HHCSDNERFAIVTYDENRTCESQGDNQPAAGIDDEPNHCAEN--LVISNLLIKEKTLDVE 177

Query: 1121 VRFIGFGEIAARIHESMEIPKKINRIWCAWLGKYGDSSFHEDASKLTGHDFGIVTFSYNY 942
             +FIGFG IAAR+ E+      I+++WC WLG+  +S   E+ + +  HDF IVTFSY Y
Sbjct: 178  AKFIGFGRIAARLFETKGRTTWIDKLWCEWLGE--ESPPDEEKATVPEHDFAIVTFSYFY 235

Query: 941  TLGRR-TLDDLNPXXXXXXXLEIDNGEGKRRKQRKKSFSDPEDISKSLVLGQYGDESRHA 765
             LGR   L D +         E  NGE   RK RKKSFSDPED S+SL   QY  +S   
Sbjct: 236  NLGRLGLLADPSRLLTLSQSAESGNGEDNGRK-RKKSFSDPEDTSESLC-NQY--DSSEE 291

Query: 764  ISXXXXXXXXXXRVA----------------------------SERICDICKHKILPEKD 669
            +S           +A                            S+RIC++CK K+LP KD
Sbjct: 292  VSSARNSNSSRALIADYDDHLVNKRVIKNKSVRRELRKQQRIFSDRICEVCKQKMLPGKD 351

Query: 668  VSTLLNMKTGRLACSSRNVNGAFHLFHTSCLIHWILLCDFEIWTNQLDNSKGTHGSKRKN 489
             + +LNMKTG+LACSSRN  GAFHLFH SC++HW L C+ EI  +++ + KG     +K 
Sbjct: 352  AAAILNMKTGKLACSSRNRLGAFHLFHVSCVVHWFLFCETEILGSKMVSGKG-----KKR 406

Query: 488  VSKRSEILMNDKKRITKPQFSSVICPECQGSGINVEDGQLEEPTIPPFEIFNYNIKANEA 309
             +K+S +  N+       Q  SV CPECQG+GIN+E   +E  T P  + + + +K +E 
Sbjct: 407  CTKQSGVKWNELVGDVSWQIFSVFCPECQGTGINIEGDVIERDTFPLSQTWRFGVKVSEG 466

Query: 308  CLAWMKDPEILQNRSTGLRFPYNSEETI---QEKVLPLKLLHFYRAD 177
              AW+K+PE L+N STG  FP   EE +   +++V  +KL+ FYR +
Sbjct: 467  RKAWVKNPEKLENCSTGFHFPQQDEELVKGQEDRVQSMKLVRFYRVE 513


Top