BLASTX nr result

ID: Catharanthus22_contig00004268 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00004268
         (3021 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006351149.1| PREDICTED: uncharacterized protein LOC102587...   756   0.0  
ref|XP_004250383.1| PREDICTED: uncharacterized protein LOC101266...   752   0.0  
ref|XP_002267682.1| PREDICTED: uncharacterized protein LOC100248...   734   0.0  
emb|CAN72342.1| hypothetical protein VITISV_029506 [Vitis vinifera]   728   0.0  
ref|XP_006434617.1| hypothetical protein CICLE_v10000424mg [Citr...   704   0.0  
ref|XP_006473199.1| PREDICTED: uncharacterized protein LOC102626...   701   0.0  
gb|EMJ00916.1| hypothetical protein PRUPE_ppa002175mg [Prunus pe...   701   0.0  
ref|XP_004138816.1| PREDICTED: uncharacterized protein LOC101218...   694   0.0  
ref|XP_002526825.1| conserved hypothetical protein [Ricinus comm...   691   0.0  
gb|ESW15647.1| hypothetical protein PHAVU_007G090000g [Phaseolus...   687   0.0  
ref|XP_003556200.1| PREDICTED: uncharacterized protein LOC100797...   686   0.0  
gb|EOY17186.1| Uncharacterized protein isoform 1 [Theobroma cacao]    683   0.0  
gb|EOY17187.1| Uncharacterized protein isoform 2 [Theobroma caca...   681   0.0  
ref|XP_006589455.1| PREDICTED: uncharacterized protein LOC100810...   679   0.0  
ref|XP_004158544.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   677   0.0  
ref|XP_002327334.1| predicted protein [Populus trichocarpa] gi|5...   671   0.0  
ref|XP_004496362.1| PREDICTED: uncharacterized protein LOC101489...   663   0.0  
ref|XP_006396046.1| hypothetical protein EUTSA_v10006981mg [Eutr...   652   0.0  
gb|EOY17189.1| Uncharacterized protein isoform 4 [Theobroma cacao]    651   0.0  
ref|XP_006851237.1| hypothetical protein AMTR_s00180p00023300 [A...   624   e-175

>ref|XP_006351149.1| PREDICTED: uncharacterized protein LOC102587291 [Solanum tuberosum]
          Length = 713

 Score =  756 bits (1951), Expect = 0.0
 Identities = 405/701 (57%), Positives = 480/701 (68%), Gaps = 8/701 (1%)
 Frame = -3

Query: 2647 VVPDHNHHQVSIGIRSGAXXXXXXXXXXXXXXXRLVKGRKISIIGALIVFLVVAFIASVA 2468
            V  DH    VS+GIRS                      RKISI GA+IV L VAF+ SV+
Sbjct: 25   VASDH----VSVGIRSNKQQQQQQQQLRNHHRRLKSTTRKISI-GAIIVILFVAFVVSVS 79

Query: 2467 AFLYLSSKDKDINSN-YRKXXXXXXXXDFLRNVTRTEKNKVVKFGHGSVKHGXXXXXXXX 2291
            AF Y +S++K++++N ++         DFL NVTRT+  KV++FGHGSV HG        
Sbjct: 80   AFFYFTSQNKELDNNHFQDDGDVENDSDFLTNVTRTQ-GKVLQFGHGSVNHGRDSRYWDK 138

Query: 2290 XXXXXXXXXXXXXXXXXXDGSVDKDSIAVNPKFSDKKSLTAKSRKDLDHRGNGLYNEAGR 2111
                                 +DK+    + K  DKKS +    K LDH GNGLYNEAGR
Sbjct: 139  DDRRRDDDYNEEDLERNPVSDLDKEQSPPDGKKGDKKSFS----KGLDHGGNGLYNEAGR 194

Query: 2110 NELKMYEAEYEASLKTIGQSK-GHGVVEQQSHDAGTGNENKMVDSDDEYDDGIDLQXXXX 1934
            +EL+ YEA+Y+ASL+    ++ GH +  QQS DA  G + ++VD+DD YDDGIDL+    
Sbjct: 195  DELRKYEAKYQASLENAEHAQDGHHLPNQQSSDADKGKKRELVDADDGYDDGIDLEDAHT 254

Query: 1933 XXXXXXXXXXXDHSIATSLHDLDGKDSSVSHKSITSHQKSPEEIDEYSDDLPXXXXXXXX 1754
                       +H++A    D+       +H +  ++Q   +E  + S D          
Sbjct: 255  DGYDDGGHEDWNHTVAAESQDIHDSHFFDTHVAGNNYQNHAKEASKTSKDFSNKESSSS- 313

Query: 1753 XXXXXDLKHANSINSQS------TXXXXXXXXXXXXXXXXXXXXXXSCEMKLLNSSALLV 1592
                    H    N+ S                             +CEMK+LN+SALLV
Sbjct: 314  -------SHHQKSNTNSGRVSFIDGHPSKKSSSEKRPVSRRKPRKHACEMKILNASALLV 366

Query: 1591 EPLESRKFARFSLQYXXXXXXXXXXXEWVPRFAGHQTLKEREESFIARDQKINCGFVKGP 1412
            EPLESRKF+RFSLQY            W PRF+GHQ+L+EREESF+AR+QKINCGFV+GP
Sbjct: 367  EPLESRKFSRFSLQYTETEDKPFDDANWEPRFSGHQSLEEREESFLARNQKINCGFVRGP 426

Query: 1411 KGYTSTGFDLAEDDAKYISSCHIAVVSCIFGNSDRLRIPVGKMVSRLSRKNVCFVMFVDE 1232
            +G  STGFDLAEDDAKYISSCHIAV SCIFGNSDRLRIPVGKMVSR+S+KNVCFVMFVDE
Sbjct: 427  EGTPSTGFDLAEDDAKYISSCHIAVASCIFGNSDRLRIPVGKMVSRISKKNVCFVMFVDE 486

Query: 1231 VTLRTLSSEGHMLDRMGFIGLWKIVVVKNLPSDDMRRVGKIPKLLSHRLFSSARYSIWLD 1052
             TL+TL++EG M D MGF+GLWKIVVVKNLP  DMRRVGKIPKLLSHRLF+SARYSIWLD
Sbjct: 487  ATLKTLTAEGKMPDSMGFVGLWKIVVVKNLPFSDMRRVGKIPKLLSHRLFTSARYSIWLD 546

Query: 1051 SKLRLQLDPYLILEYFLWRKGHEYAISNHYDRHCVWEEVAQNKRLNKYNHTVIDEQFEFY 872
            SKLRLQLDP LILEYFLWRKG+EYAISNHYDRHCVWEEVAQNK+LNKYNHTVIDEQF FY
Sbjct: 547  SKLRLQLDPLLILEYFLWRKGYEYAISNHYDRHCVWEEVAQNKKLNKYNHTVIDEQFAFY 606

Query: 871  QADGLKKFNASDLHKLLPSNVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTPRDQLSFAYT 692
            QADGL++FNASD +KLL SNVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTPRDQLSFAYT
Sbjct: 607  QADGLQRFNASDPNKLLHSNVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTPRDQLSFAYT 666

Query: 691  YYKLRRTNPEKPFYLNMFKDCERRKMAKLFRHRSEEKKNIL 569
            YYKL++ NP+KPFYLNMFKDCERRK+AKLFRHRS+E+++IL
Sbjct: 667  YYKLKKMNPDKPFYLNMFKDCERRKIAKLFRHRSDEQRDIL 707


>ref|XP_004250383.1| PREDICTED: uncharacterized protein LOC101266589 [Solanum
            lycopersicum]
          Length = 713

 Score =  752 bits (1941), Expect = 0.0
 Identities = 406/701 (57%), Positives = 481/701 (68%), Gaps = 8/701 (1%)
 Frame = -3

Query: 2647 VVPDHNHHQVSIGIRSGAXXXXXXXXXXXXXXXRLVKGRKISIIGALIVFLVVAFIASVA 2468
            V  DH    VS+GIRS                      RKISI GA+IV L VAF+ SV+
Sbjct: 27   VASDH----VSVGIRSSKQQQQQQSRNYHRRLKSTT--RKISI-GAIIVILFVAFVVSVS 79

Query: 2467 AFLYLSSKDKDINSN-YRKXXXXXXXXDFLRNVTRTEKNKVVKFGHGSVKHGXXXXXXXX 2291
            AF Y +S++K++++N ++         DFL NVTRT+  KV++FGHGSV HG        
Sbjct: 80   AFFYFTSQNKELDNNHFQDDGDVENDSDFLTNVTRTQ-GKVLQFGHGSVNHGRDSRYWDK 138

Query: 2290 XXXXXXXXXXXXXXXXXXDGSVDKDSIAVNPKFSDKKSLTAKSRKDLDHRGNGLYNEAGR 2111
                              D  ++K+    + K  DKKS +    K LDH GNGLYNEAGR
Sbjct: 139  DDRRRDDDYNEEDLERNQDSDLNKEQSPPDGKKGDKKSFS----KGLDHGGNGLYNEAGR 194

Query: 2110 NELKMYEAEYEASLKTIGQSK-GHGVVEQQSHDAGTGNENKMVDSDDEYDDGIDLQXXXX 1934
            +EL+ YEA Y+ASL+  G S+ GH +  QQ  DA  G + ++VD+DD YDDGIDL+    
Sbjct: 195  DELRKYEARYQASLENAGHSQDGHHLPNQQLSDADKGKKTELVDADDGYDDGIDLEDAHT 254

Query: 1933 XXXXXXXXXXXDHSIATSLHDLDGKDSSVSHKSITSHQKSPEEIDEYSDDLPXXXXXXXX 1754
                       +H++A    D++ +    +H +  ++Q   +E  +   D          
Sbjct: 255  DGYDDGDHEDWNHTVAAESQDINDRHFLDTHVAGNNYQNHAKEAGKTYRDFSNKESSSS- 313

Query: 1753 XXXXXDLKHANSINSQS------TXXXXXXXXXXXXXXXXXXXXXXSCEMKLLNSSALLV 1592
                    H    N+ S                             +CEMK+LN+SALLV
Sbjct: 314  -------SHHQKSNTNSGRVSFIDGHPSKKSSSEKRPVPRRKSRKHACEMKILNASALLV 366

Query: 1591 EPLESRKFARFSLQYXXXXXXXXXXXEWVPRFAGHQTLKEREESFIARDQKINCGFVKGP 1412
            EPLESRKF+RFSLQY            W PRF+GHQ+++EREESF+AR+QKINCGFV+GP
Sbjct: 367  EPLESRKFSRFSLQYAETEDKPFDDANWEPRFSGHQSMEEREESFLARNQKINCGFVRGP 426

Query: 1411 KGYTSTGFDLAEDDAKYISSCHIAVVSCIFGNSDRLRIPVGKMVSRLSRKNVCFVMFVDE 1232
            +   STGFDLAEDDAKYISSCHIAV SCIFGNSDRLRIPVGKMVSR+S+KNVCFVMFVDE
Sbjct: 427  EETPSTGFDLAEDDAKYISSCHIAVASCIFGNSDRLRIPVGKMVSRISKKNVCFVMFVDE 486

Query: 1231 VTLRTLSSEGHMLDRMGFIGLWKIVVVKNLPSDDMRRVGKIPKLLSHRLFSSARYSIWLD 1052
            VTL+TL++EG M D MGF+GLWKIVVVKNLP  DMRRVGKIPKLLSHRLF+SARYSIWLD
Sbjct: 487  VTLKTLTAEGKMPDSMGFVGLWKIVVVKNLPFSDMRRVGKIPKLLSHRLFTSARYSIWLD 546

Query: 1051 SKLRLQLDPYLILEYFLWRKGHEYAISNHYDRHCVWEEVAQNKRLNKYNHTVIDEQFEFY 872
            SKLRLQLDP LILEYFLWRKG+EYAISNHYDRHCVWEEVAQNK+LNKYNHTVIDEQF FY
Sbjct: 547  SKLRLQLDPLLILEYFLWRKGYEYAISNHYDRHCVWEEVAQNKKLNKYNHTVIDEQFAFY 606

Query: 871  QADGLKKFNASDLHKLLPSNVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTPRDQLSFAYT 692
            QADGL++FNASD +KLL SNVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTPRDQLSFAYT
Sbjct: 607  QADGLQRFNASDPNKLLHSNVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTPRDQLSFAYT 666

Query: 691  YYKLRRTNPEKPFYLNMFKDCERRKMAKLFRHRSEEKKNIL 569
            YYKLR+ NP+KPFYLNMFKDCERRK+AKLFRHRS+E++NIL
Sbjct: 667  YYKLRKMNPDKPFYLNMFKDCERRKIAKLFRHRSDEQRNIL 707


>ref|XP_002267682.1| PREDICTED: uncharacterized protein LOC100248770 [Vitis vinifera]
          Length = 698

 Score =  734 bits (1896), Expect = 0.0
 Identities = 398/667 (59%), Positives = 457/667 (68%), Gaps = 5/667 (0%)
 Frame = -3

Query: 2539 KGRKISIIGALIVFLVVAFIASVAAFLYLSSKDKDINSNYRKXXXXXXXXDFLRNVTRTE 2360
            KG +IS+ GA+++ L +    +V A+ Y+S  D +IN+ + +        DFL NVTR +
Sbjct: 47   KGSRISV-GAVVLILSLVLTVTVFAYNYISG-DSEINTYHAQDDDSKDELDFLTNVTRID 104

Query: 2359 KNKVVKFGHGSVKHG----XXXXXXXXXXXXXXXXXXXXXXXXXXDGSVDKDSIAVNPKF 2192
            K+KV++FG GS  HG                              DGS+DK  + V  K 
Sbjct: 105  KSKVLEFGQGSGVHGGDSRYWERDDRRRDEDYNEEALEHSTMSTRDGSIDKSRVVVKGKN 164

Query: 2191 SDKKSLTAKSRKDLDHRGNGLYNEAGRNELKMYEAEYEASLKTIGQS-KGHGVVEQQSHD 2015
             ++K     S K    RG+GLYNEAGR+ELK+YEAEYEASLK +GQS   HG   +   D
Sbjct: 165  DNEKIFFDNSIKGSGGRGSGLYNEAGRDELKIYEAEYEASLKNVGQSINEHGDRNKLFDD 224

Query: 2014 AGTGNENKMVDSDDEYDDGIDLQXXXXXXXXXXXXXXXDHSIATSLHDLDGKDSSVSHKS 1835
            AG G  N+ +D+DDEYDDGID                 D S     H     DSS S  +
Sbjct: 225  AGFGMHNEEMDADDEYDDGIDSHDARMVEDDDNGHENGDISNVAKSH-----DSSDSISA 279

Query: 1834 ITSHQKSPEEIDEYSDDLPXXXXXXXXXXXXXDLKHANSINSQSTXXXXXXXXXXXXXXX 1655
             T      EE+DE S                 + +H + ++ +ST               
Sbjct: 280  GTKDGNIVEEVDESSS--------VSSSLNSQNSRHVSVVDGRSTRKFSSEKRPESKRKR 331

Query: 1654 XXXXXXXSCEMKLLNSSALLVEPLESRKFARFSLQYXXXXXXXXXXXEWVPRFAGHQTLK 1475
                   SCEMKLLNS+A LVEPLESRKFARFSLQY            W PRF+GHQ+L+
Sbjct: 332  RHKFSGSSCEMKLLNSTAQLVEPLESRKFARFSLQYTAVEEKPNGQEHWEPRFSGHQSLQ 391

Query: 1474 EREESFIARDQKINCGFVKGPKGYTSTGFDLAEDDAKYISSCHIAVVSCIFGNSDRLRIP 1295
            EREESF+A DQKINC FVK PKGY STGFDLAEDD +YISSCHIAV+SCIFGNSDRLR P
Sbjct: 392  EREESFLAHDQKINCAFVKSPKGYPSTGFDLAEDDVRYISSCHIAVISCIFGNSDRLRSP 451

Query: 1294 VGKMVSRLSRKNVCFVMFVDEVTLRTLSSEGHMLDRMGFIGLWKIVVVKNLPSDDMRRVG 1115
             GK +SRLSRKNVCFVMF+DE+TL+TLSSE  M DRMGFIGLWK VVVKNLP  DMRRVG
Sbjct: 452  AGKTISRLSRKNVCFVMFMDEITLQTLSSERQMPDRMGFIGLWKTVVVKNLPYTDMRRVG 511

Query: 1114 KIPKLLSHRLFSSARYSIWLDSKLRLQLDPYLILEYFLWRKGHEYAISNHYDRHCVWEEV 935
            KIPKLL+HRLF SARYSIWLDSKLRLQLDP LILEYFLWRKGHEYAISNHYDRHCVWEEV
Sbjct: 512  KIPKLLAHRLFPSARYSIWLDSKLRLQLDPLLILEYFLWRKGHEYAISNHYDRHCVWEEV 571

Query: 934  AQNKRLNKYNHTVIDEQFEFYQADGLKKFNASDLHKLLPSNVPEGSFIVRAHTPMSNLFS 755
            AQNK+LNKYNH++ID+QF FYQADGLK+FNASD +KLLPSNVPEGSFIVRAHTPMSNLFS
Sbjct: 572  AQNKKLNKYNHSIIDQQFAFYQADGLKRFNASDPNKLLPSNVPEGSFIVRAHTPMSNLFS 631

Query: 754  CLWFNEVDRFTPRDQLSFAYTYYKLRRTNPEKPFYLNMFKDCERRKMAKLFRHRSEEKKN 575
            CLWFNEVDRFTPRDQLSFAYTY KLRR NP KPF+LNMFKDCERR +AKLFRHRSEEK+N
Sbjct: 632  CLWFNEVDRFTPRDQLSFAYTYQKLRRVNPGKPFHLNMFKDCERRAIAKLFRHRSEEKRN 691

Query: 574  ILRREIE 554
            IL+   E
Sbjct: 692  ILQAAAE 698


>emb|CAN72342.1| hypothetical protein VITISV_029506 [Vitis vinifera]
          Length = 692

 Score =  728 bits (1879), Expect = 0.0
 Identities = 397/672 (59%), Positives = 456/672 (67%), Gaps = 10/672 (1%)
 Frame = -3

Query: 2539 KGRKISIIGALIVFLVVAFIASVAAFLYLSSKDKDINSNYRKXXXXXXXXDFLRNVTRTE 2360
            KG +IS+ GA+++ L +    +V A+ Y+S  D +IN+ + +        DFL NVTR +
Sbjct: 47   KGSRISV-GAVVLILSLVLTVTVFAYNYISG-DSEINTYHAQDDDSKDELDFLTNVTRID 104

Query: 2359 KNKVVKFGHGSVKHGXXXXXXXXXXXXXXXXXXXXXXXXXXD----GSVDKDSIAVNPKF 2192
            K+KV++FG GS  HG                               GS+DK  + V  K 
Sbjct: 105  KSKVLEFGQGSGVHGGDSRYWERDDRRRDEDYNEEALEHSTMSTRDGSIDKSRVVVKGKN 164

Query: 2191 SDKKSLTAKSRKDLDHRGNGLYNEAGRNELKMYEAEYEASLKTIGQS-KGHGVVEQQSHD 2015
             ++K     S K    RG+GLYNEAGR+ELK+YEAEYEASLK +GQS   HG   +   D
Sbjct: 165  DNEKIFFDNSIKGSGGRGSGLYNEAGRDELKIYEAEYEASLKNVGQSINEHGDRNKLFDD 224

Query: 2014 AGTGNENKMVDSDDEYDDGIDLQXXXXXXXXXXXXXXXDHSIATSLHDLDGKDSSVSHKS 1835
            AG G  N+ +D+DDEYDDGID                 D S     HD     SS S  +
Sbjct: 225  AGFGMHNEEMDADDEYDDGIDSHDARMVEDDDNGHENGDISNVAKSHD-----SSDSISA 279

Query: 1834 ITSHQKSPEEIDEYSDDLPXXXXXXXXXXXXXDLKHANSINSQSTXXXXXXXXXXXXXXX 1655
             T      EE+DE S                     ++S+NSQ++               
Sbjct: 280  GTKDGNIVEEVDESSSV-------------------SSSLNSQNSRHVLREEPQLDECRK 320

Query: 1654 XXXXXXXS-----CEMKLLNSSALLVEPLESRKFARFSLQYXXXXXXXXXXXEWVPRFAG 1490
                   +     CEMKLLNS+A LVEPLESRKFARFSLQY            W PRF+G
Sbjct: 321  SSVKTKAAIRCSSCEMKLLNSTAQLVEPLESRKFARFSLQYTAVEEKPNGQEHWEPRFSG 380

Query: 1489 HQTLKEREESFIARDQKINCGFVKGPKGYTSTGFDLAEDDAKYISSCHIAVVSCIFGNSD 1310
            HQ+L+EREESF+A DQKINC FVK PKGY STGFDLAEDD +YISSCHIAV+SCIFGNSD
Sbjct: 381  HQSLQEREESFLAHDQKINCAFVKSPKGYPSTGFDLAEDDVRYISSCHIAVISCIFGNSD 440

Query: 1309 RLRIPVGKMVSRLSRKNVCFVMFVDEVTLRTLSSEGHMLDRMGFIGLWKIVVVKNLPSDD 1130
            RLR P GK +SRLSRKNVCFVMF+DE+TL+TLSSE  M DRMGFIGLWK VVVKNLP  D
Sbjct: 441  RLRSPAGKTISRLSRKNVCFVMFMDEITLQTLSSERQMPDRMGFIGLWKTVVVKNLPYTD 500

Query: 1129 MRRVGKIPKLLSHRLFSSARYSIWLDSKLRLQLDPYLILEYFLWRKGHEYAISNHYDRHC 950
            MRRVGKIPKLL+HRLF SARYSIWLDSKLRLQLDP LILEYFLWRKGHEYAISNHYDRHC
Sbjct: 501  MRRVGKIPKLLAHRLFPSARYSIWLDSKLRLQLDPLLILEYFLWRKGHEYAISNHYDRHC 560

Query: 949  VWEEVAQNKRLNKYNHTVIDEQFEFYQADGLKKFNASDLHKLLPSNVPEGSFIVRAHTPM 770
            VWEEVAQNK+LNKYNH++ID+QF FYQADGLK+FNASD +KLLPSNVPEGSFIVRAHTPM
Sbjct: 561  VWEEVAQNKKLNKYNHSIIDQQFAFYQADGLKRFNASDPNKLLPSNVPEGSFIVRAHTPM 620

Query: 769  SNLFSCLWFNEVDRFTPRDQLSFAYTYYKLRRTNPEKPFYLNMFKDCERRKMAKLFRHRS 590
            SNLFSCLWFNEVDRFTPRDQLSFAYTY KLRR NP KPF+LNMFKDCERR +AKLFRHRS
Sbjct: 621  SNLFSCLWFNEVDRFTPRDQLSFAYTYQKLRRVNPGKPFHLNMFKDCERRAIAKLFRHRS 680

Query: 589  EEKKNILRREIE 554
            EEK+NIL+   E
Sbjct: 681  EEKRNILQAAAE 692


>ref|XP_006434617.1| hypothetical protein CICLE_v10000424mg [Citrus clementina]
            gi|557536739|gb|ESR47857.1| hypothetical protein
            CICLE_v10000424mg [Citrus clementina]
          Length = 722

 Score =  704 bits (1817), Expect = 0.0
 Identities = 394/710 (55%), Positives = 453/710 (63%), Gaps = 15/710 (2%)
 Frame = -3

Query: 2656 NGVVVPDHNHHQVSIGIRSGAXXXXXXXXXXXXXXXRLVKGRKISIIGALIVFLVVAFIA 2477
            NG    DH    VSIGIRS                     GR++SI G++I  L++  +A
Sbjct: 25   NGTSFNDH----VSIGIRSAPYNKPARARRSARSDK---NGRRLSI-GSVIFVLLLVLLA 76

Query: 2476 SVAAFLYLSS--------KDKDINSNYRKXXXXXXXXDFLRNVTRTEKNKVVKFGHGSVK 2321
            +V A+LY+S         +DK+I S+           DFL NVTRT   KVV FG GS+ 
Sbjct: 77   TVLAYLYISGYSNHNDDDQDKEIISHSAVDDELKNDIDFLMNVTRTNTLKVVGFGKGSIS 136

Query: 2320 HGXXXXXXXXXXXXXXXXXXXXXXXXXXDGSVDKDS----IAVNPKFSDKKSLTAKSRKD 2153
            HG                            + DK +     +V     ++K       K 
Sbjct: 137  HGRDSRYWDKDDRRRDDDYSEDILEHASVAATDKSTGTGHASVKVDSGNEKISVDDPHKG 196

Query: 2152 LDHRGNGLYNEAGRNELKMYEAEYEASLKTIGQSKG-HGVVEQQSHDAGTGNENKMVDSD 1976
             D +G GLYNEAGRNELKMYEAEYEASLK  G S   +G   QQS D   G  ++ +D D
Sbjct: 197  SDRKGVGLYNEAGRNELKMYEAEYEASLKNAGLSGNLNGNENQQSGDKIIGVNSEPIDVD 256

Query: 1975 DEYDDGIDLQXXXXXXXXXXXXXXXDHSIATSLHDLDGKDSSVSHKSITSHQKSPEEIDE 1796
            DEYDD ++                 DHS    + +   ++SS  H +   HQ    +++E
Sbjct: 257  DEYDDNVEFHDTRVGEYDDSRHDKGDHSDVAKIQNQYQRESSDLHDAKILHQNIVRKVEE 316

Query: 1795 YSDDLPXXXXXXXXXXXXXDL--KHANSINSQSTXXXXXXXXXXXXXXXXXXXXXXSCEM 1622
             S +L                  +  + +  QST                       CE+
Sbjct: 317  VSSNLSVDSSLKSQNLDKFYATQRQVSLVGGQSTKASPKKKSKRRSS----------CEV 366

Query: 1621 KLLNSSALLVEPLESRKFARFSLQYXXXXXXXXXXXEWVPRFAGHQTLKEREESFIARDQ 1442
            K+LNS+  LVEPLESRKFARF LQY           EW PRFAGHQ+L+EREESF+ARDQ
Sbjct: 367  KILNSTTQLVEPLESRKFARFFLQYTEVEEKPDGEAEWEPRFAGHQSLQEREESFLARDQ 426

Query: 1441 KINCGFVKGPKGYTSTGFDLAEDDAKYISSCHIAVVSCIFGNSDRLRIPVGKMVSRLSRK 1262
            KINCGFVK P+GY STGFDLAEDDA Y S CHIAV+SCIFGNSDRLRIPVGK V+RLSRK
Sbjct: 427  KINCGFVKAPEGYPSTGFDLAEDDANYNSRCHIAVISCIFGNSDRLRIPVGKTVTRLSRK 486

Query: 1261 NVCFVMFVDEVTLRTLSSEGHMLDRMGFIGLWKIVVVKNLPSDDMRRVGKIPKLLSHRLF 1082
            NVCFVMF DE+TL+TLSSEG + DR GFIGLWK+VVVKNLP DDMRRVGKIPKLL HRLF
Sbjct: 487  NVCFVMFTDELTLQTLSSEGQIPDRTGFIGLWKMVVVKNLPYDDMRRVGKIPKLLPHRLF 546

Query: 1081 SSARYSIWLDSKLRLQLDPYLILEYFLWRKGHEYAISNHYDRHCVWEEVAQNKRLNKYNH 902
             SARYSIWLDSKLRLQ DP LILEYFLWRKG+EYAISNHYDRHCVWEEVAQNK+LNKYNH
Sbjct: 547  PSARYSIWLDSKLRLQRDPLLILEYFLWRKGYEYAISNHYDRHCVWEEVAQNKKLNKYNH 606

Query: 901  TVIDEQFEFYQADGLKKFNASDLHKLLPSNVPEGSFIVRAHTPMSNLFSCLWFNEVDRFT 722
            TVID+QF FYQADGLK+F+ SD  +LLPSNVPEGSFIVRAHTPMSNLFSCLWFNEVDRFT
Sbjct: 607  TVIDQQFAFYQADGLKRFDPSDPDRLLPSNVPEGSFIVRAHTPMSNLFSCLWFNEVDRFT 666

Query: 721  PRDQLSFAYTYYKLRRTNPEKPFYLNMFKDCERRKMAKLFRHRSEEKKNI 572
             RDQLSFAYTY KLRR NP K FYLNMFKDCERR MAKLFRHRS EK+ +
Sbjct: 667  SRDQLSFAYTYQKLRRMNPSKMFYLNMFKDCERRSMAKLFRHRSAEKRGV 716


>ref|XP_006473199.1| PREDICTED: uncharacterized protein LOC102626086 [Citrus sinensis]
          Length = 722

 Score =  701 bits (1810), Expect = 0.0
 Identities = 394/710 (55%), Positives = 451/710 (63%), Gaps = 15/710 (2%)
 Frame = -3

Query: 2656 NGVVVPDHNHHQVSIGIRSGAXXXXXXXXXXXXXXXRLVKGRKISIIGALIVFLVVAFIA 2477
            NG    DH    VSIGIRS                     GR++SI G++I  L++  +A
Sbjct: 25   NGTSFNDH----VSIGIRSAPYNKPARARRSARSDK---NGRRLSI-GSVIFVLLLVLLA 76

Query: 2476 SVAAFLYLSS--------KDKDINSNYRKXXXXXXXXDFLRNVTRTEKNKVVKFGHGSVK 2321
            +V A+LY+S         +DK+I S+           DFL NVTRT   KVV FG GS+ 
Sbjct: 77   TVLAYLYISGYSNHNDDDQDKEIISHSAVDDELKNDIDFLTNVTRTNTLKVVGFGKGSIG 136

Query: 2320 HGXXXXXXXXXXXXXXXXXXXXXXXXXXDGSVDKD----SIAVNPKFSDKKSLTAKSRKD 2153
            HG                            + DK       +V     ++K       K 
Sbjct: 137  HGRDSRYWDKDDRRRDDDYSEDILEHASMAATDKSIGTGHASVKVDSGNEKLSVDDPHKG 196

Query: 2152 LDHRGNGLYNEAGRNELKMYEAEYEASLKTIGQSKG-HGVVEQQSHDAGTGNENKMVDSD 1976
             D +G GLYNEAGRNELKMYEAEYEASLK  G S   +G   QQS D   G  ++ +D D
Sbjct: 197  SDRKGVGLYNEAGRNELKMYEAEYEASLKNAGLSGNLNGNENQQSGDKIIGVNSEPIDVD 256

Query: 1975 DEYDDGIDLQXXXXXXXXXXXXXXXDHSIATSLHDLDGKDSSVSHKSITSHQKSPEEIDE 1796
            DEYDD ++                 DHS    +     ++SS  H +   HQ    +++E
Sbjct: 257  DEYDDNVEFHDTRIGEYDDSGHDKGDHSDVAKIQSQYQRESSDLHDAKILHQNIVRKVEE 316

Query: 1795 YSDDLPXXXXXXXXXXXXXDL--KHANSINSQSTXXXXXXXXXXXXXXXXXXXXXXSCEM 1622
             S +L                  +  + +  QST                       CE+
Sbjct: 317  VSSNLSVDSSLKSQNLDKFYATQRQVSLVGGQSTKASPKKKSKRRSS----------CEV 366

Query: 1621 KLLNSSALLVEPLESRKFARFSLQYXXXXXXXXXXXEWVPRFAGHQTLKEREESFIARDQ 1442
            K+LNS+  LVEPLESRKFARF LQY           EW PRFAGHQ+L+EREESF+ARDQ
Sbjct: 367  KILNSTTQLVEPLESRKFARFFLQYTEVEEKPDGEAEWEPRFAGHQSLQEREESFLARDQ 426

Query: 1441 KINCGFVKGPKGYTSTGFDLAEDDAKYISSCHIAVVSCIFGNSDRLRIPVGKMVSRLSRK 1262
            KINCGFVK P+GY STGFDLAEDDA Y S CHIAV+SCIFGNSDRLRIPVGK V+RLSRK
Sbjct: 427  KINCGFVKAPEGYPSTGFDLAEDDANYNSRCHIAVISCIFGNSDRLRIPVGKTVTRLSRK 486

Query: 1261 NVCFVMFVDEVTLRTLSSEGHMLDRMGFIGLWKIVVVKNLPSDDMRRVGKIPKLLSHRLF 1082
            NVCFVMF DE+TL+TLSSEG + DR GFIGLWK+VVVKNLP DDMRRVGKIPKLL HRLF
Sbjct: 487  NVCFVMFTDELTLQTLSSEGQIPDRTGFIGLWKMVVVKNLPYDDMRRVGKIPKLLPHRLF 546

Query: 1081 SSARYSIWLDSKLRLQLDPYLILEYFLWRKGHEYAISNHYDRHCVWEEVAQNKRLNKYNH 902
             SARYSIWLDSKLRLQ DP LILEYFLWRKG+EYAISNHYDRHCVWEEVAQNK+LNKYNH
Sbjct: 547  PSARYSIWLDSKLRLQRDPLLILEYFLWRKGYEYAISNHYDRHCVWEEVAQNKKLNKYNH 606

Query: 901  TVIDEQFEFYQADGLKKFNASDLHKLLPSNVPEGSFIVRAHTPMSNLFSCLWFNEVDRFT 722
            TVID+QF FYQADGLK+F+ SD  +LLPSNVPEGSFIVRAHTPMSNLFSCLWFNEVDRFT
Sbjct: 607  TVIDQQFAFYQADGLKRFDPSDPDRLLPSNVPEGSFIVRAHTPMSNLFSCLWFNEVDRFT 666

Query: 721  PRDQLSFAYTYYKLRRTNPEKPFYLNMFKDCERRKMAKLFRHRSEEKKNI 572
             RDQLSFAYTY KLRR NP K FYLNMFKDCERR MAKLFRHRS EK+ +
Sbjct: 667  SRDQLSFAYTYQKLRRMNPGKMFYLNMFKDCERRSMAKLFRHRSAEKRGV 716


>gb|EMJ00916.1| hypothetical protein PRUPE_ppa002175mg [Prunus persica]
          Length = 705

 Score =  701 bits (1810), Expect = 0.0
 Identities = 382/669 (57%), Positives = 440/669 (65%), Gaps = 14/669 (2%)
 Frame = -3

Query: 2518 IGALIVFLVVAFIASVAAFLYLSSKDKDINSNYRKXXXXXXXXDFLRNVTRTEKNKVVKF 2339
            IGA+++ L + F+ ++ AF YLS   +++N+ + +        DFL NVTRTE +KV++F
Sbjct: 63   IGAVVLVLALVFVFTLLAFYYLSRNSRELNTYHAEEDDIKNDPDFLTNVTRTETSKVLRF 122

Query: 2338 GHGSVKHGXXXXXXXXXXXXXXXXXXXXXXXXXXDGSVDKDSIAVNPKFSDKKSLTAKSR 2159
            G GSV HG                          D + DK  + V  K SDKKSL     
Sbjct: 123  GKGSVVHGRDSRYWDKDDRRRDGDYNEDGSAGVSDEATDKGDVHVRVKNSDKKSLNDDFP 182

Query: 2158 KDLDHRGNGLYNEAGRNELKMYEAEYEASLKTIGQSKGHGVVEQQSHDAGTGNENKMVDS 1979
            K    +G GLYNEAGRNELK+YEAEYEASLK   +SK          D     + +++D 
Sbjct: 183  KSSSRKG-GLYNEAGRNELKIYEAEYEASLKNSRESK----------DEDLDKQKEVIDV 231

Query: 1978 DDEYDDGIDLQXXXXXXXXXXXXXXXDHSIATS----------LHDLDGKDSSVSHK--- 1838
            DDEYDDGID                       S          L D+   D +V++K   
Sbjct: 232  DDEYDDGIDFHETHMDEYEDMGHQNDHFDEEKSRDEDSGESIDLPDVGTNDQNVANKVEK 291

Query: 1837 -SITSHQKSPEEIDEYSDDLPXXXXXXXXXXXXXDLKHANSINSQSTXXXXXXXXXXXXX 1661
             S  S +  P +     D++                +H +  + QS+             
Sbjct: 292  VSTNSFEDDPVQHSRNLDEVNTKP------------RHVSIHSGQSSKKSRSTSKRKPKR 339

Query: 1660 XXXXXXXXXSCEMKLLNSSALLVEPLESRKFARFSLQYXXXXXXXXXXXEWVPRFAGHQT 1481
                      CEMK LNS+A L+EPLESRKFARFS+QY            W PRFAGHQT
Sbjct: 340  RKYSGSS---CEMKFLNSTAQLIEPLESRKFARFSMQYTQAEDKPEGEEHWEPRFAGHQT 396

Query: 1480 LKEREESFIARDQKINCGFVKGPKGYTSTGFDLAEDDAKYISSCHIAVVSCIFGNSDRLR 1301
            L+ERE SF+A DQKI CGFVKGPK   STGFDLAEDD  YIS CHIAV+SCIFGNSDRLR
Sbjct: 397  LQERENSFLANDQKIKCGFVKGPKESPSTGFDLAEDDTNYISRCHIAVMSCIFGNSDRLR 456

Query: 1300 IPVGKMVSRLSRKNVCFVMFVDEVTLRTLSSEGHMLDRMGFIGLWKIVVVKNLPSDDMRR 1121
            +P GK VSRLSRK VCFVMFVDEVTL+T+SSEG + DRMGFIGLWKIVVVKNLP  DMRR
Sbjct: 457  MPYGKTVSRLSRKYVCFVMFVDEVTLQTISSEGQIPDRMGFIGLWKIVVVKNLPYTDMRR 516

Query: 1120 VGKIPKLLSHRLFSSARYSIWLDSKLRLQLDPYLILEYFLWRKGHEYAISNHYDRHCVWE 941
            VGKIPKLL HRLF SARYSIWLDSKLRLQLDP LILEYFLWRKG+EYAISNHYDRHCVWE
Sbjct: 517  VGKIPKLLPHRLFPSARYSIWLDSKLRLQLDPLLILEYFLWRKGYEYAISNHYDRHCVWE 576

Query: 940  EVAQNKRLNKYNHTVIDEQFEFYQADGLKKFNASDLHKLLPSNVPEGSFIVRAHTPMSNL 761
            EVAQNKRLNKYNHT+ID+QF FYQADGL +FNA D +KLLPSNVPEGSFIVRAHTPMSNL
Sbjct: 577  EVAQNKRLNKYNHTIIDQQFAFYQADGLTRFNALDPNKLLPSNVPEGSFIVRAHTPMSNL 636

Query: 760  FSCLWFNEVDRFTPRDQLSFAYTYYKLRRTNPEKPFYLNMFKDCERRKMAKLFRHRSEEK 581
            FSCLWFNEV+RFTPRDQLSFAYTY KLRR NP KPF LNMFKDCERR +AKLFRHRS+EK
Sbjct: 637  FSCLWFNEVERFTPRDQLSFAYTYQKLRRMNPGKPFQLNMFKDCERRAIAKLFRHRSDEK 696

Query: 580  KNILRREIE 554
            +NI ++  E
Sbjct: 697  QNIRQKATE 705


>ref|XP_004138816.1| PREDICTED: uncharacterized protein LOC101218369 [Cucumis sativus]
          Length = 731

 Score =  694 bits (1792), Expect = 0.0
 Identities = 375/685 (54%), Positives = 453/685 (66%), Gaps = 23/685 (3%)
 Frame = -3

Query: 2539 KGRKISIIGALIVFLVVAFIASVAAFLYLSSKDKDINSNYRKXXXXXXXXDFLRNVTRTE 2360
            KGR IS+ GA++  L +  + +V A+ YL    K+I+++  +        DFL NVTRTE
Sbjct: 53   KGRGISV-GAIVFVLSLVLVVTVLAYYYLLRDTKEISNSNVEDDALKNDPDFLANVTRTE 111

Query: 2359 KNKVVKFGHGSVKHGXXXXXXXXXXXXXXXXXXXXXXXXXXD--GSVDKDSIAVNPKFSD 2186
              KV +FG+G VKHG                              +  K  + V      
Sbjct: 112  TTKV-RFGNGLVKHGRDSRYWDGDDRRRDQDYNEDVVDHMATINKATGKGDVPVKVSEDQ 170

Query: 2185 KKSLTAKSRKDLDHRGNGLYNEAGRNELKMYEAEYEASLKTIGQSKGHGVVEQQ-SHDAG 2009
            ++S   +S+  LD +  GLYNEAGR EL+ YEAEYEAS+KT GQ +  G  + Q S +  
Sbjct: 171  RESSLEQSQNSLDRKDTGLYNEAGRKELRKYEAEYEASVKTSGQLEKEGNEDNQVSDEDD 230

Query: 2008 TGNENKMVDSDDEYDDGIDLQXXXXXXXXXXXXXXXDHSIATSL---------------- 1877
            + N N  +D+DDEY++G D +               DHS +TSL                
Sbjct: 231  SENWNDTIDTDDEYENGSDSKNHAMEEDDDTEREKGDHSDSTSLTEEDSGKSVNFVENEN 290

Query: 1876 --HDLDGKDSSVSHKSITSHQKSPEEID--EYSDDLPXXXXXXXXXXXXXDLKHANSINS 1709
              +D +GK  +V     T +Q+  E ++   +S D               + KH +  NS
Sbjct: 291  PHNDDNGKSLNVDDGE-TKYQQEDENVETSNHSLDEDYTSSSQHVDKANQNSKHVSVTNS 349

Query: 1708 QSTXXXXXXXXXXXXXXXXXXXXXXSCEMKLLNSSALLVEPLESRKFARFSLQYXXXXXX 1529
            Q T                       CEMK LNS+A ++EP+E++KF RF+LQY      
Sbjct: 350  QHTKRSKLDPRKKPKHRKFSGSS---CEMKFLNSTAQILEPIENKKFVRFTLQYTDTEQD 406

Query: 1528 XXXXXEWVPRFAGHQTLKEREESFIARDQKINCGFVKGPKGYTSTGFDLAEDDAKYISSC 1349
                 +W+PRFAGHQTL+ERE SF A+DQKINCGFVKGPK ++STGFDL EDD+ Y+S C
Sbjct: 407  PSNQEKWMPRFAGHQTLQERETSFYAQDQKINCGFVKGPKTFSSTGFDLTEDDSNYVSRC 466

Query: 1348 HIAVVSCIFGNSDRLRIPVGKMVSRLSRKNVCFVMFVDEVTLRTLSSEGHMLDRMGFIGL 1169
            HIAVVSCIFGNSD LR P GK V+R SRKNVCFVMF+DEVTL TLSSEG  +DRMGFIGL
Sbjct: 467  HIAVVSCIFGNSDHLRSPTGKTVTRFSRKNVCFVMFMDEVTLETLSSEGQTVDRMGFIGL 526

Query: 1168 WKIVVVKNLPSDDMRRVGKIPKLLSHRLFSSARYSIWLDSKLRLQLDPYLILEYFLWRKG 989
            WKIVVVKNLP  DMRRVGKIPKLL HR+F SARYSIWLDSKLRLQ DP LILEYFLWRKG
Sbjct: 527  WKIVVVKNLPYTDMRRVGKIPKLLPHRIFPSARYSIWLDSKLRLQYDPLLILEYFLWRKG 586

Query: 988  HEYAISNHYDRHCVWEEVAQNKRLNKYNHTVIDEQFEFYQADGLKKFNASDLHKLLPSNV 809
            +E+AISNHYDRHCVWEEVAQNKRLNKYNHT+ID+QF FYQADGLK+FNASD++KLLPSNV
Sbjct: 587  YEFAISNHYDRHCVWEEVAQNKRLNKYNHTIIDQQFSFYQADGLKRFNASDVNKLLPSNV 646

Query: 808  PEGSFIVRAHTPMSNLFSCLWFNEVDRFTPRDQLSFAYTYYKLRRTNPEKPFYLNMFKDC 629
            PEGSFI+RAHTPMSNLFSCLWFNEVD+FTPRDQLSFAYTY KL+R NP KPFYLNMFKDC
Sbjct: 647  PEGSFIIRAHTPMSNLFSCLWFNEVDKFTPRDQLSFAYTYQKLKRMNPGKPFYLNMFKDC 706

Query: 628  ERRKMAKLFRHRSEEKKNILRREIE 554
            ERRK+AKLFRHRS+EK+ + +  +E
Sbjct: 707  ERRKIAKLFRHRSDEKRIVHKNAME 731


>ref|XP_002526825.1| conserved hypothetical protein [Ricinus communis]
            gi|223533829|gb|EEF35560.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 722

 Score =  691 bits (1784), Expect = 0.0
 Identities = 373/659 (56%), Positives = 438/659 (66%), Gaps = 6/659 (0%)
 Frame = -3

Query: 2518 IGALIVFLVVAFIASVAAFLYLSSKDKDINSNYRKXXXXXXXXD-FLRNVTRTEKNKVVK 2342
            IGALIV L +  + +V A+ Y+S+ +K+IN ++            FL NVTRT+  KV++
Sbjct: 70   IGALIVVLSLVLVVTVLAYYYISADNKEINDHHGDGDDEVKNDLDFLTNVTRTDTVKVLE 129

Query: 2341 FGHGSVKHGXXXXXXXXXXXXXXXXXXXXXXXXXXDG----SVDKDSIAVNPKFSDKKSL 2174
            FG GS  HG                                S +K    V  K   +K+ 
Sbjct: 130  FGQGSA-HGRDSRYWDKDDRRRDGDYNEDDVDHDSKEAEDESTEKGHNLVKMKNGKEKTS 188

Query: 2173 TAKSRKDLDHRGNGLYNEAGRNELKMYEAEYEASLKTIGQSKGHGVVEQQSHDAGTGNEN 1994
                 K LD RG GLYNE GR ELKMYEAEYEASLK  GQS+    ++ ++ D    NE 
Sbjct: 189  QNDPSKGLDQRGTGLYNEDGRKELKMYEAEYEASLKNAGQSRNKNEIKPRALDDEEQNEG 248

Query: 1993 KMVDSDDEYDDGIDLQXXXXXXXXXXXXXXXDHSIATSLHDLDGKDSSVSHKSITSHQKS 1814
              VD+D+EYDDGID                   +      D DG++ S  + + T  Q  
Sbjct: 249  --VDTDNEYDDGIDSHDPHVEDYGDSDHDNGHQASVRISSDEDGREFSNFNDADTKDQNV 306

Query: 1813 PEEIDEYSDDLPXXXXXXXXXXXXXDLKHAN-SINSQSTXXXXXXXXXXXXXXXXXXXXX 1637
             ++  E S+++              +      S + QST                     
Sbjct: 307  AKDNHEVSENISDKSLNIRTLDNVDNNSQVGRSSSGQSTTKSRSYSKKKSRHRKGS---- 362

Query: 1636 XSCEMKLLNSSALLVEPLESRKFARFSLQYXXXXXXXXXXXEWVPRFAGHQTLKEREESF 1457
              CEMK LNS+  LVEP ESRKFARFSLQY           +W P+FAGHQ+L+E EESF
Sbjct: 363  --CEMKFLNSTTQLVEPFESRKFARFSLQYSEKEEKPNGDLQWEPKFAGHQSLQEWEESF 420

Query: 1456 IARDQKINCGFVKGPKGYTSTGFDLAEDDAKYISSCHIAVVSCIFGNSDRLRIPVGKMVS 1277
            +  DQKINCGFVKGP+G  STGFDL+EDDA YIS CHIAV+SCIFGNSDRLR P  KMV+
Sbjct: 421  LVHDQKINCGFVKGPEGSPSTGFDLSEDDASYISRCHIAVISCIFGNSDRLRSPPTKMVT 480

Query: 1276 RLSRKNVCFVMFVDEVTLRTLSSEGHMLDRMGFIGLWKIVVVKNLPSDDMRRVGKIPKLL 1097
            RLSRKNVCFV+FVD++TL+TLSSEGHM D  GFIG WK+VVVKNLP  DMRRVGKIPK+L
Sbjct: 481  RLSRKNVCFVIFVDKITLQTLSSEGHMPDIAGFIGFWKVVVVKNLPYTDMRRVGKIPKML 540

Query: 1096 SHRLFSSARYSIWLDSKLRLQLDPYLILEYFLWRKGHEYAISNHYDRHCVWEEVAQNKRL 917
             HRLF SARYSIWLDSKLRLQ+DP L+LEYFLWRKG+EYAISNHYDRHCVWEEVAQNKRL
Sbjct: 541  PHRLFPSARYSIWLDSKLRLQIDPLLVLEYFLWRKGYEYAISNHYDRHCVWEEVAQNKRL 600

Query: 916  NKYNHTVIDEQFEFYQADGLKKFNASDLHKLLPSNVPEGSFIVRAHTPMSNLFSCLWFNE 737
            NKYNHT+ID+QF FYQADGLKKFNASD +KLLPSNVPEGS IVRAHTPMSNLFSCLWFNE
Sbjct: 601  NKYNHTIIDQQFTFYQADGLKKFNASDPNKLLPSNVPEGSLIVRAHTPMSNLFSCLWFNE 660

Query: 736  VDRFTPRDQLSFAYTYYKLRRTNPEKPFYLNMFKDCERRKMAKLFRHRSEEKKNILRRE 560
            V+RFTPRDQLSFAYTY KLRR NP+KPF+L+MFKDCERR +AKLFRHRSEEK+N LR++
Sbjct: 661  VERFTPRDQLSFAYTYQKLRRMNPDKPFHLHMFKDCERRAVAKLFRHRSEEKRNSLRQQ 719


>gb|ESW15647.1| hypothetical protein PHAVU_007G090000g [Phaseolus vulgaris]
          Length = 698

 Score =  687 bits (1772), Expect = 0.0
 Identities = 382/702 (54%), Positives = 448/702 (63%), Gaps = 10/702 (1%)
 Frame = -3

Query: 2653 GVVVPDHNHHQ-------VSIGIRSGAXXXXXXXXXXXXXXXRLVKGRKISIIGALIVFL 2495
            GV V  H+ H        +++GIR GA                  +G ++S++ A++VFL
Sbjct: 13   GVRVGPHDLHHTNGAGDHLAVGIRGGAAHKQQRSRRSARSE----RGTQLSVV-AILVFL 67

Query: 2494 VVAFIASVAAFLYLSSKDKDINSNYRKXXXXXXXXDFLRNVTRTEKNKVVKFGHGSVKHG 2315
             +  + +V  F Y+S  +  I++N           DFL NV R +K KV+ FGHGS  HG
Sbjct: 68   FLVLVVTVLVFSYISRDE--ISNNGDDSVDLKSESDFLTNVPRIQKKKVLDFGHGSGGHG 125

Query: 2314 XXXXXXXXXXXXXXXXXXXXXXXXXXDGSVDKDSIAVNPKFSDKKSLTAKSRKDLDHRGN 2135
                                       G    D IA       K   T  S   L  RG+
Sbjct: 126  RDSRYWDKDDRRRDGDYDEDMMEQT--GKDPGDEIAEEDDAVKKDQDTKSSHDGLKRRGD 183

Query: 2134 GLYNEAGRNELKMYEAEYEASLKTIGQSKGHGVVEQQSHDAGTGNENKMVDSDDEYDDGI 1955
            GLYNEAGR+ELK YEAEYEASLK +  S       +  H+     +N   D DDEYDD  
Sbjct: 184  GLYNEAGRHELKRYEAEYEASLKNLRHSTEDD--GKLLHETDLEKKNASDDIDDEYDDFF 241

Query: 1954 DLQXXXXXXXXXXXXXXXDHSIATSLHDLDGKDSSVSHKSITSHQKSPEEIDEY-SDDLP 1778
            D                 +HS A  L    G D+ V  +  ++   + E  D+  S+D+ 
Sbjct: 242  DFNDVQLENTSYSKNMRGEHSNANVL----GLDNEVQKQKESNDSLAEENNDDVTSEDIE 297

Query: 1777 XXXXXXXXXXXXXDL--KHANSINSQSTXXXXXXXXXXXXXXXXXXXXXXSCEMKLLNSS 1604
                             KHA++ N QST                       CEMKLLNS+
Sbjct: 298  GASSLNKKILLEGKTNSKHASNFNGQSTRKSHPETKKKVRRRKFSGS----CEMKLLNST 353

Query: 1603 ALLVEPLESRKFARFSLQYXXXXXXXXXXXEWVPRFAGHQTLKEREESFIARDQKINCGF 1424
            + LVEPLESRKFARF+L Y           +WVPRFAGHQ+L+ERE SF+ARDQKINCGF
Sbjct: 354  SQLVEPLESRKFARFNLHYTEMEEKPLGEEQWVPRFAGHQSLEERESSFLARDQKINCGF 413

Query: 1423 VKGPKGYTSTGFDLAEDDAKYISSCHIAVVSCIFGNSDRLRIPVGKMVSRLSRKNVCFVM 1244
            VKGP+G  STGFDL EDD  YIS CHIAV+SCIFGNSDRLRIP  K V+RLSRKNVCFVM
Sbjct: 414  VKGPEGSQSTGFDLTEDDTSYISRCHIAVISCIFGNSDRLRIPATKTVTRLSRKNVCFVM 473

Query: 1243 FVDEVTLRTLSSEGHMLDRMGFIGLWKIVVVKNLPSDDMRRVGKIPKLLSHRLFSSARYS 1064
            F DE+T+RTLSSEGH+ DRMGFIG WK+VVVKNLP DDMRRVGKIPKLL HRLF  ARYS
Sbjct: 474  FTDEITIRTLSSEGHVPDRMGFIGFWKLVVVKNLPYDDMRRVGKIPKLLPHRLFPFARYS 533

Query: 1063 IWLDSKLRLQLDPYLILEYFLWRKGHEYAISNHYDRHCVWEEVAQNKRLNKYNHTVIDEQ 884
            IWLDSKLRLQLDP LILEYFLWRKG+E+AISNHYDRHCVWEEVAQNK+LNKYNHTVID+Q
Sbjct: 534  IWLDSKLRLQLDPLLILEYFLWRKGYEFAISNHYDRHCVWEEVAQNKKLNKYNHTVIDQQ 593

Query: 883  FEFYQADGLKKFNASDLHKLLPSNVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTPRDQLS 704
            F FY+ADG+++F+ASD +KLLPSNVPEGSFI+RAHTPMSNLFSCLWFNEVDRFTPRDQLS
Sbjct: 594  FSFYRADGMERFDASDPNKLLPSNVPEGSFIIRAHTPMSNLFSCLWFNEVDRFTPRDQLS 653

Query: 703  FAYTYYKLRRTNPEKPFYLNMFKDCERRKMAKLFRHRSEEKK 578
            FAYTY KLRR N +KPF+LNMFKDCERR +AKLFRHR +EKK
Sbjct: 654  FAYTYQKLRRMNADKPFHLNMFKDCERRHIAKLFRHRLDEKK 695


>ref|XP_003556200.1| PREDICTED: uncharacterized protein LOC100797815 [Glycine max]
          Length = 699

 Score =  686 bits (1771), Expect = 0.0
 Identities = 369/649 (56%), Positives = 435/649 (67%), Gaps = 2/649 (0%)
 Frame = -3

Query: 2518 IGALIVFLVVAFIASVAAFLYLSSKDKDINSNYRKXXXXXXXXDFLRNVTRTEKNKVVKF 2339
            + A++VFL +  + +V  F Y+S  +  I++N           DFL NV R ++ KV+ F
Sbjct: 56   VAAILVFLFLVLVVTVLVFSYISRDE--ISNNGGDSDDLKSDSDFLTNVPRIQRKKVLDF 113

Query: 2338 GHGSVKHGXXXXXXXXXXXXXXXXXXXXXXXXXXDGSVDKDSIAVNPKFSDKKSLTAKSR 2159
            GHGS  HG                              D+++       S K    +KS 
Sbjct: 114  GHGSGGHGRDSRYWDRDDRRRDGDYGEDMMEQTSKDHGDENA---EDDASVKTDHDSKSS 170

Query: 2158 KD-LDHRGNGLYNEAGRNELKMYEAEYEASLKTIGQS-KGHGVVEQQSHDAGTGNENKMV 1985
            +D L  RG+GLYNEAGR+ELK YEAEYEASLK +G S +  G V   SHD     +N   
Sbjct: 171  QDGLQRRGDGLYNEAGRHELKRYEAEYEASLKNLGHSTEDDGKV---SHDTDLEKKNAAD 227

Query: 1984 DSDDEYDDGIDLQXXXXXXXXXXXXXXXDHSIATSLHDLDGKDSSVSHKSITSHQKSPEE 1805
            D DDEYDD  D                  HS ++ L  LD +       + +  +++ ++
Sbjct: 228  DIDDEYDDFFDFHDAQMEDSGDSKNMKVKHSNSSVL-SLDNEVQKQKEPNDSFDEENNDD 286

Query: 1804 IDEYSDDLPXXXXXXXXXXXXXDLKHANSINSQSTXXXXXXXXXXXXXXXXXXXXXXSCE 1625
            +     +               + KHAN  N QST                       CE
Sbjct: 287  VTSEDVEGTSSFNKKKSHDGKTNAKHANPSNGQSTRKSHPETKKKAKRRKFSGS----CE 342

Query: 1624 MKLLNSSALLVEPLESRKFARFSLQYXXXXXXXXXXXEWVPRFAGHQTLKEREESFIARD 1445
            MKLLNS++ LVEPLESRKF+RF+LQY           +WVPRFAGHQ+L+ERE SF+ARD
Sbjct: 343  MKLLNSTSQLVEPLESRKFSRFNLQYTETEEKPLGDEQWVPRFAGHQSLEERESSFLARD 402

Query: 1444 QKINCGFVKGPKGYTSTGFDLAEDDAKYISSCHIAVVSCIFGNSDRLRIPVGKMVSRLSR 1265
            Q+INCGFVKGP+G  STGFDL EDDA YIS CHIAV+SCIFGNSDRLR P  K V+RLSR
Sbjct: 403  QQINCGFVKGPEGSQSTGFDLTEDDANYISRCHIAVISCIFGNSDRLRTPATKTVTRLSR 462

Query: 1264 KNVCFVMFVDEVTLRTLSSEGHMLDRMGFIGLWKIVVVKNLPSDDMRRVGKIPKLLSHRL 1085
            KNVCFVMF DE+T+RTLSSEGH+ DRMGFIG WK+VVVKNLP DDMRRVGKIPKLL HRL
Sbjct: 463  KNVCFVMFTDEITIRTLSSEGHVPDRMGFIGFWKLVVVKNLPYDDMRRVGKIPKLLPHRL 522

Query: 1084 FSSARYSIWLDSKLRLQLDPYLILEYFLWRKGHEYAISNHYDRHCVWEEVAQNKRLNKYN 905
            F  ARYSIWLDSKLRLQLDP LILEYFLWRKG+E+AISNHYDRHCVWEEVA+NK+LNKYN
Sbjct: 523  FPFARYSIWLDSKLRLQLDPLLILEYFLWRKGYEFAISNHYDRHCVWEEVARNKKLNKYN 582

Query: 904  HTVIDEQFEFYQADGLKKFNASDLHKLLPSNVPEGSFIVRAHTPMSNLFSCLWFNEVDRF 725
            HTVIDEQF FY+ADGL+KF+ASD +KLLPSNVPEGSFI+RAHTPMSNLFSCLWFNEVDRF
Sbjct: 583  HTVIDEQFAFYRADGLEKFDASDPNKLLPSNVPEGSFIIRAHTPMSNLFSCLWFNEVDRF 642

Query: 724  TPRDQLSFAYTYYKLRRTNPEKPFYLNMFKDCERRKMAKLFRHRSEEKK 578
            TPRDQLSFAYTY KLRR NP+KPF+LNMFKDCERR +AKLFRHR +EK+
Sbjct: 643  TPRDQLSFAYTYQKLRRMNPDKPFHLNMFKDCERRHIAKLFRHRLDEKR 691


>gb|EOY17186.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 781

 Score =  683 bits (1762), Expect = 0.0
 Identities = 377/671 (56%), Positives = 438/671 (65%), Gaps = 9/671 (1%)
 Frame = -3

Query: 2539 KGRKISIIGALIVFLVVAFIASVAAFLYLSSKDKD----INSNYRKXXXXXXXXDFLRNV 2372
            K R+ISI G LIV L +  + +V  + Y+S+ + D    +NS + K        DFL NV
Sbjct: 47   KNRRISI-GFLIVVLSLVLVVTVLVYYYISADNNDNSEELNSYHPKDVDSKVDSDFLTNV 105

Query: 2371 TRTEKNKVVKFGHGSVKHGXXXXXXXXXXXXXXXXXXXXXXXXXXDGSVDKD----SIAV 2204
            TR + +KV+ FG  S+ HG                            S D+      + V
Sbjct: 106  TRMDSSKVLSFGRSSIAHGRDSRYWDRDDRRRDDDYNEDVVEHNIMDSSDESLDGGHVPV 165

Query: 2203 NPKFSDKKSLTAKSRKDLDHRGNGLYNEAGRNELKMYEAEYEASLKTIGQSKGHGVVEQQ 2024
              K ++KK  +    KDLD R  GLYNEAGRNELK YE EYE SLK  G+ +      ++
Sbjct: 166  KVK-NEKKEASLDPNKDLDRRAVGLYNEAGRNELKRYEKEYELSLKDGGKLQKELENSRR 224

Query: 2023 SHDAGTGNENKMVDSDDEYDDGIDLQXXXXXXXXXXXXXXXDHSIATSLHDLDGKDSSVS 1844
              D+     +  VD+DD Y+DG D                  H    ++ +    D  V 
Sbjct: 225  LSDSKDFGLHDEVDADDHYNDGFDSSDSQTEDYDDFG-----HDKEDNVDEAKSHDEHVK 279

Query: 1843 HKSITSHQKSPEEIDEYSDD-LPXXXXXXXXXXXXXDLKHANSINSQSTXXXXXXXXXXX 1667
              S  S  K    + E  ++ +              + +H  S+  +             
Sbjct: 280  EFSTFSKTKERHVVKEGKEESMLSREASGDFGDVDANSQHVGSLGRKGAKSSRADSKRKP 339

Query: 1666 XXXXXXXXXXXSCEMKLLNSSALLVEPLESRKFARFSLQYXXXXXXXXXXXEWVPRFAGH 1487
                        CEMKLLNS+ L VEPLESRKFARFSLQY           +WVP FAGH
Sbjct: 340  RRRKFSGS----CEMKLLNSTHL-VEPLESRKFARFSLQYKQMEENSEGEEQWVPTFAGH 394

Query: 1486 QTLKEREESFIARDQKINCGFVKGPKGYTSTGFDLAEDDAKYISSCHIAVVSCIFGNSDR 1307
            Q+L+EREESF+A DQKINCGFVKGP+GY STGFDLAEDD  YIS CHIAV+SCIFGNSDR
Sbjct: 395  QSLQEREESFLAHDQKINCGFVKGPQGYPSTGFDLAEDDVNYISRCHIAVISCIFGNSDR 454

Query: 1306 LRIPVGKMVSRLSRKNVCFVMFVDEVTLRTLSSEGHMLDRMGFIGLWKIVVVKNLPSDDM 1127
            LR P GKMV+RLSRKNVCFVMFVDEVT++TL SEG   D  GFIGLWKIVVVKNLP  DM
Sbjct: 455  LRTPAGKMVTRLSRKNVCFVMFVDEVTMQTLFSEGQSPDG-GFIGLWKIVVVKNLPYADM 513

Query: 1126 RRVGKIPKLLSHRLFSSARYSIWLDSKLRLQLDPYLILEYFLWRKGHEYAISNHYDRHCV 947
            RRVGKIPKLL HRLF SARYSIWLDSKLRLQ DP  +L+YFLWRKGHEYAISNHYDRHCV
Sbjct: 514  RRVGKIPKLLPHRLFPSARYSIWLDSKLRLQRDPLQLLDYFLWRKGHEYAISNHYDRHCV 573

Query: 946  WEEVAQNKRLNKYNHTVIDEQFEFYQADGLKKFNASDLHKLLPSNVPEGSFIVRAHTPMS 767
            WEEVAQNK+LNKYNHTVIDEQFEFYQADGLKKFN+SD +KLLPSNVPEGSFIVRAHTPMS
Sbjct: 574  WEEVAQNKKLNKYNHTVIDEQFEFYQADGLKKFNSSDPNKLLPSNVPEGSFIVRAHTPMS 633

Query: 766  NLFSCLWFNEVDRFTPRDQLSFAYTYYKLRRTNPEKPFYLNMFKDCERRKMAKLFRHRSE 587
            NLFSCLWFNEV+RFTPRDQLSFAYTY KLRR NP+KPFYLNMFKDCERR +AKLFRHRSE
Sbjct: 634  NLFSCLWFNEVERFTPRDQLSFAYTYQKLRRMNPDKPFYLNMFKDCERRAIAKLFRHRSE 693

Query: 586  EKKNILRREIE 554
            EK+N+ ++ I+
Sbjct: 694  EKRNVQQQAIK 704


>gb|EOY17187.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508725291|gb|EOY17188.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 704

 Score =  681 bits (1758), Expect = 0.0
 Identities = 376/665 (56%), Positives = 434/665 (65%), Gaps = 9/665 (1%)
 Frame = -3

Query: 2539 KGRKISIIGALIVFLVVAFIASVAAFLYLSSKDKD----INSNYRKXXXXXXXXDFLRNV 2372
            K R+ISI G LIV L +  + +V  + Y+S+ + D    +NS + K        DFL NV
Sbjct: 47   KNRRISI-GFLIVVLSLVLVVTVLVYYYISADNNDNSEELNSYHPKDVDSKVDSDFLTNV 105

Query: 2371 TRTEKNKVVKFGHGSVKHGXXXXXXXXXXXXXXXXXXXXXXXXXXDGSVDKD----SIAV 2204
            TR + +KV+ FG  S+ HG                            S D+      + V
Sbjct: 106  TRMDSSKVLSFGRSSIAHGRDSRYWDRDDRRRDDDYNEDVVEHNIMDSSDESLDGGHVPV 165

Query: 2203 NPKFSDKKSLTAKSRKDLDHRGNGLYNEAGRNELKMYEAEYEASLKTIGQSKGHGVVEQQ 2024
              K ++KK  +    KDLD R  GLYNEAGRNELK YE EYE SLK  G+ +      ++
Sbjct: 166  KVK-NEKKEASLDPNKDLDRRAVGLYNEAGRNELKRYEKEYELSLKDGGKLQKELENSRR 224

Query: 2023 SHDAGTGNENKMVDSDDEYDDGIDLQXXXXXXXXXXXXXXXDHSIATSLHDLDGKDSSVS 1844
              D+     +  VD+DD Y+DG D                  H    ++ +    D  V 
Sbjct: 225  LSDSKDFGLHDEVDADDHYNDGFDSSDSQTEDYDDFG-----HDKEDNVDEAKSHDEHVK 279

Query: 1843 HKSITSHQKSPEEIDEYSDD-LPXXXXXXXXXXXXXDLKHANSINSQSTXXXXXXXXXXX 1667
              S  S  K    + E  ++ +              + +H  S+  +             
Sbjct: 280  EFSTFSKTKERHVVKEGKEESMLSREASGDFGDVDANSQHVGSLGRKGAKSSRADSKRKP 339

Query: 1666 XXXXXXXXXXXSCEMKLLNSSALLVEPLESRKFARFSLQYXXXXXXXXXXXEWVPRFAGH 1487
                        CEMKLLNS+ L VEPLESRKFARFSLQY           +WVP FAGH
Sbjct: 340  RRRKFSGS----CEMKLLNSTHL-VEPLESRKFARFSLQYKQMEENSEGEEQWVPTFAGH 394

Query: 1486 QTLKEREESFIARDQKINCGFVKGPKGYTSTGFDLAEDDAKYISSCHIAVVSCIFGNSDR 1307
            Q+L+EREESF+A DQKINCGFVKGP+GY STGFDLAEDD  YIS CHIAV+SCIFGNSDR
Sbjct: 395  QSLQEREESFLAHDQKINCGFVKGPQGYPSTGFDLAEDDVNYISRCHIAVISCIFGNSDR 454

Query: 1306 LRIPVGKMVSRLSRKNVCFVMFVDEVTLRTLSSEGHMLDRMGFIGLWKIVVVKNLPSDDM 1127
            LR P GKMV+RLSRKNVCFVMFVDEVT++TL SEG   D  GFIGLWKIVVVKNLP  DM
Sbjct: 455  LRTPAGKMVTRLSRKNVCFVMFVDEVTMQTLFSEGQSPDG-GFIGLWKIVVVKNLPYADM 513

Query: 1126 RRVGKIPKLLSHRLFSSARYSIWLDSKLRLQLDPYLILEYFLWRKGHEYAISNHYDRHCV 947
            RRVGKIPKLL HRLF SARYSIWLDSKLRLQ DP  +L+YFLWRKGHEYAISNHYDRHCV
Sbjct: 514  RRVGKIPKLLPHRLFPSARYSIWLDSKLRLQRDPLQLLDYFLWRKGHEYAISNHYDRHCV 573

Query: 946  WEEVAQNKRLNKYNHTVIDEQFEFYQADGLKKFNASDLHKLLPSNVPEGSFIVRAHTPMS 767
            WEEVAQNK+LNKYNHTVIDEQFEFYQADGLKKFN+SD +KLLPSNVPEGSFIVRAHTPMS
Sbjct: 574  WEEVAQNKKLNKYNHTVIDEQFEFYQADGLKKFNSSDPNKLLPSNVPEGSFIVRAHTPMS 633

Query: 766  NLFSCLWFNEVDRFTPRDQLSFAYTYYKLRRTNPEKPFYLNMFKDCERRKMAKLFRHRSE 587
            NLFSCLWFNEV+RFTPRDQLSFAYTY KLRR NP+KPFYLNMFKDCERR +AKLFRHRSE
Sbjct: 634  NLFSCLWFNEVERFTPRDQLSFAYTYQKLRRMNPDKPFYLNMFKDCERRAIAKLFRHRSE 693

Query: 586  EKKNI 572
            EK+N+
Sbjct: 694  EKRNV 698


>ref|XP_006589455.1| PREDICTED: uncharacterized protein LOC100810524 [Glycine max]
          Length = 711

 Score =  679 bits (1753), Expect = 0.0
 Identities = 366/649 (56%), Positives = 433/649 (66%), Gaps = 2/649 (0%)
 Frame = -3

Query: 2518 IGALIVFLVVAFIASVAAFLYLSSKDKDINSNYRKXXXXXXXXDFLRNVTRTEKNKVVKF 2339
            + A++VFL++  + ++  F Y+S  +  I++N           DFL NV R ++ KV+ F
Sbjct: 68   VAAILVFLLLVLVVTLLVFSYISRDE--ISNNGDDSDDLKSDSDFLTNVPRIQRKKVLDF 125

Query: 2338 GHGSVKHGXXXXXXXXXXXXXXXXXXXXXXXXXXDGSVDKDSIAVNPKFSDKKSLTAKSR 2159
            GHGS  HG                            S D +        S K     KS 
Sbjct: 126  GHGSGGHGRDSRYWDRDDRRRDGDYDEDMMEQT---SKDPEDENAEDDASVKTDHDTKSS 182

Query: 2158 KD-LDHRGNGLYNEAGRNELKMYEAEYEASLKTIGQS-KGHGVVEQQSHDAGTGNENKMV 1985
            +D L  RG+GLYNEAGR+ELK YEAEYEASLK +G S +  G V    HD     +N   
Sbjct: 183  QDGLKRRGDGLYNEAGRHELKRYEAEYEASLKNLGHSTEDDGKV---LHDTDLEKKNAAD 239

Query: 1984 DSDDEYDDGIDLQXXXXXXXXXXXXXXXDHSIATSLHDLDGKDSSVSHKSITSHQKSPEE 1805
            D DDEYDD  D                  HS  +S+  LD +       + +  +++ ++
Sbjct: 240  DIDDEYDDFFDFHDAQMEDSGDSKNMRAKHS-NSSVLSLDNEVQKQKSSNDSFDEENDDD 298

Query: 1804 IDEYSDDLPXXXXXXXXXXXXXDLKHANSINSQSTXXXXXXXXXXXXXXXXXXXXXXSCE 1625
            +     +               + KHAN  N QS                       SC+
Sbjct: 299  VTSEDVEEASSLNKKNSHDGKTNSKHANHSNGQS----IRKSHPETKKKAKRHKFSGSCD 354

Query: 1624 MKLLNSSALLVEPLESRKFARFSLQYXXXXXXXXXXXEWVPRFAGHQTLKEREESFIARD 1445
            MKLLNS++ LVEPLESRKF+RF+LQY           +WVPRFAGHQ+L+ERE SF+ARD
Sbjct: 355  MKLLNSTSQLVEPLESRKFSRFNLQYTETEEKPQGDEQWVPRFAGHQSLEERESSFLARD 414

Query: 1444 QKINCGFVKGPKGYTSTGFDLAEDDAKYISSCHIAVVSCIFGNSDRLRIPVGKMVSRLSR 1265
            Q+INCGFVKGP+G+ STGFDL EDDA YIS CHIAV+SCIFGNSDRLR P  K V+RLSR
Sbjct: 415  QQINCGFVKGPEGFQSTGFDLTEDDANYISRCHIAVISCIFGNSDRLRTPTTKTVTRLSR 474

Query: 1264 KNVCFVMFVDEVTLRTLSSEGHMLDRMGFIGLWKIVVVKNLPSDDMRRVGKIPKLLSHRL 1085
            KNVCFVMF DEVT+RTLSSEGH+ DRMGFIG WK+VVVKNLP DDMRRVGKIPKLL HRL
Sbjct: 475  KNVCFVMFTDEVTIRTLSSEGHVPDRMGFIGFWKLVVVKNLPYDDMRRVGKIPKLLPHRL 534

Query: 1084 FSSARYSIWLDSKLRLQLDPYLILEYFLWRKGHEYAISNHYDRHCVWEEVAQNKRLNKYN 905
            F  ARYSIWLDSKLRLQLDP LILEYFLWRKG+E+AISNHYDRHCVWEEVAQNK+LNKYN
Sbjct: 535  FPFARYSIWLDSKLRLQLDPLLILEYFLWRKGYEFAISNHYDRHCVWEEVAQNKKLNKYN 594

Query: 904  HTVIDEQFEFYQADGLKKFNASDLHKLLPSNVPEGSFIVRAHTPMSNLFSCLWFNEVDRF 725
            HTVIDEQF FY+ADGL++F+ASD +KLLPSNVPEGSFI+RAHTPMSNLFSCLWFNEVDRF
Sbjct: 595  HTVIDEQFAFYRADGLERFDASDPNKLLPSNVPEGSFIIRAHTPMSNLFSCLWFNEVDRF 654

Query: 724  TPRDQLSFAYTYYKLRRTNPEKPFYLNMFKDCERRKMAKLFRHRSEEKK 578
            TPRDQLSFA+TY KLRR NP+KPF+LNMFKDCERR +AKLF HR +EK+
Sbjct: 655  TPRDQLSFAHTYQKLRRMNPDKPFHLNMFKDCERRHIAKLFHHRLDEKR 703


>ref|XP_004158544.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101218369 [Cucumis
            sativus]
          Length = 713

 Score =  677 bits (1746), Expect = 0.0
 Identities = 371/691 (53%), Positives = 453/691 (65%), Gaps = 29/691 (4%)
 Frame = -3

Query: 2539 KGRKISIIGALIVFLVVAFIASVAAFLYLSSKDKDINSNYRKXXXXXXXXDFLRNVTRTE 2360
            KGR IS+ GA++  L +  + +V A+ YL    K+I+++  +        DFL NVTRTE
Sbjct: 53   KGRGISV-GAIVFVLSLVLVVTVLAYYYLLRDTKEISNSNVEDDALKNDPDFLANVTRTE 111

Query: 2359 KNKVVKFGHGSVKHGXXXXXXXXXXXXXXXXXXXXXXXXXXDGSVDKDSIAVNPKFSD-- 2186
              KV +FG+G VKHG                              D D    +  +++  
Sbjct: 112  TTKV-RFGNGLVKHGRDSRYW------------------------DGDDRRRDQDYNEDD 146

Query: 2185 -KKSLTAKSRKDLDHRGNGLYNEAGRNELKMYEAEYEASLKTIGQSKGHGVVEQQ-SHDA 2012
             ++S   +S+  LD +  GLYNEAGR EL+ YEAEYEAS+KT GQ +  G  + Q S + 
Sbjct: 147  QRESSLEQSQNSLDRKDTGLYNEAGRKELRKYEAEYEASVKTSGQLEKEGNEDNQVSDED 206

Query: 2011 GTGNENKMVDSDDEYDDGIDLQXXXXXXXXXXXXXXXDHSIATSL--------------- 1877
             + N N  +D+DDEY++G D +               DHS +TSL               
Sbjct: 207  DSENWNDTIDTDDEYENGSDSKNHAMEEDDDTEREKGDHSDSTSLTEEDSGKSVNFVENE 266

Query: 1876 ---HDLDGKDSSVSHKSITSHQKSPEEID--EYSDDLPXXXXXXXXXXXXXDLKHANSIN 1712
               +D +GK  +V     T +Q+  E ++   +S D               + KH +  N
Sbjct: 267  NPHNDDNGKSLNVDDGE-TKYQQEDENVETSNHSLDEDYTSSSQHVDKANQNSKHVSVTN 325

Query: 1711 SQSTXXXXXXXXXXXXXXXXXXXXXXSCEMKLLNSSALLVEPLESRKFARFSLQYXXXXX 1532
            SQ T                       CEMK LNS+A ++EP+E+ KF RF+LQY     
Sbjct: 326  SQHTKRSKLDPRKKPKHRKFSGSS---CEMKFLNSTAQILEPIENXKFVRFTLQYTDTEQ 382

Query: 1531 XXXXXXEWVPRFAGHQTLKEREESFIARDQKINCGFVKGPKGYTSTGFDLAEDDAKYISS 1352
                  +W+PRFAGHQTL+ERE SF A+DQKINCGFVKGPK ++STGFDL EDD+ Y+S 
Sbjct: 383  DPSNQEKWMPRFAGHQTLQERETSFYAQDQKINCGFVKGPKTFSSTGFDLTEDDSNYVSR 442

Query: 1351 CHIAVVSCIFGNSDRLRIPVGKMVSRLS-----RKNVCFVMFVDEVTLRTLSSEGHMLDR 1187
            CHIAVVSCIFGNSD LR P GK  + +S     +KNVCFVMF+DEVTL TLSSEG  +DR
Sbjct: 443  CHIAVVSCIFGNSDHLRSPTGKTFAFVSGYSFLKKNVCFVMFMDEVTLETLSSEGQTVDR 502

Query: 1186 MGFIGLWKIVVVKNLPSDDMRRVGKIPKLLSHRLFSSARYSIWLDSKLRLQLDPYLILEY 1007
            MGFIGLWKIVVVKNLP  DMRRVGKIPKLL HR+F SARYSIWLDSKLRLQ DP LILEY
Sbjct: 503  MGFIGLWKIVVVKNLPYTDMRRVGKIPKLLPHRIFPSARYSIWLDSKLRLQYDPLLILEY 562

Query: 1006 FLWRKGHEYAISNHYDRHCVWEEVAQNKRLNKYNHTVIDEQFEFYQADGLKKFNASDLHK 827
            FLWRKG+E+AISNHYDRHCVWEEVAQNKRLNKYNHT+ID+QF FYQADGLK+FNASD++K
Sbjct: 563  FLWRKGYEFAISNHYDRHCVWEEVAQNKRLNKYNHTIIDQQFSFYQADGLKRFNASDVNK 622

Query: 826  LLPSNVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTPRDQLSFAYTYYKLRRTNPEKPFYL 647
            LLPSNVPEGSFI+RAHTPMSNLFSCLWFNEVD+FTPRDQLSFAYTY K++R NP KPFYL
Sbjct: 623  LLPSNVPEGSFIIRAHTPMSNLFSCLWFNEVDKFTPRDQLSFAYTYXKIKRMNPGKPFYL 682

Query: 646  NMFKDCERRKMAKLFRHRSEEKKNILRREIE 554
            NMFKDCERRK+AKLFRHRS+EK+ + +  +E
Sbjct: 683  NMFKDCERRKIAKLFRHRSDEKRIVHKNAME 713


>ref|XP_002327334.1| predicted protein [Populus trichocarpa]
            gi|566200769|ref|XP_006376300.1| hypothetical protein
            POPTR_0013s11800g [Populus trichocarpa]
            gi|550325576|gb|ERP54097.1| hypothetical protein
            POPTR_0013s11800g [Populus trichocarpa]
          Length = 678

 Score =  671 bits (1732), Expect = 0.0
 Identities = 367/659 (55%), Positives = 435/659 (66%), Gaps = 3/659 (0%)
 Frame = -3

Query: 2527 ISIIGALIVFLVVAFIASVAAFLYLSSKDKDINSNYRKXXXXXXXXDFLRNVTRTEKNKV 2348
            +S++GA+IVFL +  I +V A+ +LS+ ++++N N R         DFL NVTRT+  KV
Sbjct: 51   LSLVGAVIVFLCLVLIVTVLAYNFLSTDNRNVNDN-RVEDDEIKNNDFLANVTRTDSIKV 109

Query: 2347 VKFGHGSVKHGXXXXXXXXXXXXXXXXXXXXXXXXXXDGSVDKDSIAVNPKFSDKKSLTA 2168
            + FG GSV HG                              ++D +  + K S     + 
Sbjct: 110  LGFGQGSVGHGRDSRYWDRDDRRRDE-------------DYNEDDVDNDSKLSGDDESSE 156

Query: 2167 KSRKDLDHRGNGLYNEAGRNELKMYEAEYEASLKTIGQSKGHGVVEQQSHDAGTGNENKM 1988
            K    +  +G GLYNE GR ELK+YE EYEASLK  G+                  EN++
Sbjct: 157  KGHNSV--KGAGLYNEDGRKELKIYEKEYEASLKNTGKLT---------------KENEI 199

Query: 1987 VDSDDEYDDGIDLQXXXXXXXXXXXXXXXD-HSIATSLHDLDGKDSSVSHKSITSHQKSP 1811
             + ++EYDDGID                 +  S  T++H  D + SS    + T  Q   
Sbjct: 200  KNLENEYDDGIDSHDRHMEEYGGDSEPNKEDRSSETTVHIEDNRASSNFLDAETKDQNIA 259

Query: 1810 EEIDEYSDDL--PXXXXXXXXXXXXXDLKHANSINSQSTXXXXXXXXXXXXXXXXXXXXX 1637
            ++  E S  L                D ++ ++I   ST                     
Sbjct: 260  KDNLEDSMSLLEKGSLNSQNLDDGDTDSRNVHNIGGHSTSKSRSDSKKKSKRRKFSGSS- 318

Query: 1636 XSCEMKLLNSSALLVEPLESRKFARFSLQYXXXXXXXXXXXEWVPRFAGHQTLKEREESF 1457
              C MKLLNS+  LVEP ESRKFARFSLQY           +W PRFAGHQ+L EREESF
Sbjct: 319  --CGMKLLNSTTRLVEPFESRKFARFSLQYTEIEEKPDGQEQWEPRFAGHQSLHEREESF 376

Query: 1456 IARDQKINCGFVKGPKGYTSTGFDLAEDDAKYISSCHIAVVSCIFGNSDRLRIPVGKMVS 1277
            +A DQKINCGFVKG +G +STGFDLAEDDA YIS CHIAV+SCIFGNSDRLR P  KMV+
Sbjct: 377  LAHDQKINCGFVKGSEGSSSTGFDLAEDDASYISRCHIAVISCIFGNSDRLRSPADKMVT 436

Query: 1276 RLSRKNVCFVMFVDEVTLRTLSSEGHMLDRMGFIGLWKIVVVKNLPSDDMRRVGKIPKLL 1097
            RLSRKNVCFVMF+DEV+ +TL+SEGH+ DR GF+GLWKIVVVKNLP +DMRRVGK+PKLL
Sbjct: 437  RLSRKNVCFVMFMDEVSFQTLTSEGHIPDRAGFVGLWKIVVVKNLPYNDMRRVGKVPKLL 496

Query: 1096 SHRLFSSARYSIWLDSKLRLQLDPYLILEYFLWRKGHEYAISNHYDRHCVWEEVAQNKRL 917
             HRLF SARYSIWLDSKLRLQ+DP L+LEYFLWRKGHEYAISNHYDRHCVWEEV QNK+L
Sbjct: 497  PHRLFPSARYSIWLDSKLRLQVDPLLVLEYFLWRKGHEYAISNHYDRHCVWEEVVQNKKL 556

Query: 916  NKYNHTVIDEQFEFYQADGLKKFNASDLHKLLPSNVPEGSFIVRAHTPMSNLFSCLWFNE 737
            NKYNHTVID+QF FYQADGLK+FN SD +KLLPSNVPEGS IVRAHTPMSNLFSCLWFNE
Sbjct: 557  NKYNHTVIDQQFAFYQADGLKRFNVSDPNKLLPSNVPEGSLIVRAHTPMSNLFSCLWFNE 616

Query: 736  VDRFTPRDQLSFAYTYYKLRRTNPEKPFYLNMFKDCERRKMAKLFRHRSEEKKNILRRE 560
            VDRFTPRDQLSFA+TY KLRR NP KPFYLNMFKDCERR +AKLFRHRS+EK++ L +E
Sbjct: 617  VDRFTPRDQLSFAFTYQKLRRMNPGKPFYLNMFKDCERRAIAKLFRHRSDEKRSTLHQE 675


>ref|XP_004496362.1| PREDICTED: uncharacterized protein LOC101489831 [Cicer arietinum]
          Length = 704

 Score =  663 bits (1711), Expect = 0.0
 Identities = 377/707 (53%), Positives = 444/707 (62%), Gaps = 9/707 (1%)
 Frame = -3

Query: 2653 GVVVPDHNHH-------QVSIGIRSGAXXXXXXXXXXXXXXXRLVKGRKISIIGALIVFL 2495
            GV V  H+ H        V++GIR G                   +G   S++ A++VFL
Sbjct: 13   GVRVGSHDLHLGNGSGDHVAVGIRGGVAHKQQRLRRSGRSD----RGAHFSVV-AILVFL 67

Query: 2494 VVAFIASVAAFLYLSSKDKDINSNYRKXXXXXXXXDFLRNVTRTEKNKVVKFGHGSVKHG 2315
             +  + +  AF Y+S  +  I++N           DFL NV R EK KV+ FGH S  HG
Sbjct: 68   FLVLVVTFLAFSYISRDE--ISNNGDDTDDIKNDSDFLTNVPRIEK-KVLDFGHASGGHG 124

Query: 2314 XXXXXXXXXXXXXXXXXXXXXXXXXXDGSVDK--DSIAVNPKFSDKKSLTAKSRKDLDHR 2141
                                      D    K  D   V    + K S    S+  L  +
Sbjct: 125  RDSRYWDKDDRRRDDGYDEDKEEISRDNGDAKLKDDAPVKTNHNVKSSQEG-SQTGLKRK 183

Query: 2140 GNGLYNEAGRNELKMYEAEYEASLKTIGQSKGHGVVEQQSHDAGTGNENKMVDSDDEYDD 1961
            G GLYNEAGR+ELK YEAEYEASLK +G S    V  + SH+     +N + D DDEYDD
Sbjct: 184  GVGLYNEAGRHELKRYEAEYEASLKNVGHSAE--VDGKLSHETVMEKKNVVDDIDDEYDD 241

Query: 1960 GIDLQXXXXXXXXXXXXXXXDHSIATSLHDLDGKDSSVSHKSITSHQKSPEEIDEYSDDL 1781
             ID                  HS   S   LD +    SH   +S     ++I     D 
Sbjct: 242  FIDSHDAQMEDSADSGNMRGKHSNFNS-QKLDNEVHKESHDD-SSDVGIDDDIASEDTDG 299

Query: 1780 PXXXXXXXXXXXXXDLKHANSINSQSTXXXXXXXXXXXXXXXXXXXXXXSCEMKLLNSSA 1601
                          + K AN I+ Q++                       CEMKLLNS++
Sbjct: 300  ESSVSQKSSRGGKANSKPANVISEQTSRKSHPETKRKGRRHKYSGS----CEMKLLNSTS 355

Query: 1600 LLVEPLESRKFARFSLQYXXXXXXXXXXXEWVPRFAGHQTLKEREESFIARDQKINCGFV 1421
             LVEPLESRKFARF+LQY           +WVPRFAGHQ+L+ERE SF+ARDQ + CGFV
Sbjct: 356  QLVEPLESRKFARFNLQYVETEEKPSGVEQWVPRFAGHQSLEERENSFLARDQNLKCGFV 415

Query: 1420 KGPKGYTSTGFDLAEDDAKYISSCHIAVVSCIFGNSDRLRIPVGKMVSRLSRKNVCFVMF 1241
            KGP+G  STGFD++EDD  YIS CHIAV+SCIFGNSDRLR P  K ++RLSRKNVCFVMF
Sbjct: 416  KGPEGSPSTGFDISEDDESYISRCHIAVISCIFGNSDRLRTPATKTITRLSRKNVCFVMF 475

Query: 1240 VDEVTLRTLSSEGHMLDRMGFIGLWKIVVVKNLPSDDMRRVGKIPKLLSHRLFSSARYSI 1061
             DEVT++TL+SEGH  DRMGFIG WK+VVVKNLP DDMRRVGKIPKLL+HRLF  ARYSI
Sbjct: 476  TDEVTVQTLTSEGHAPDRMGFIGFWKLVVVKNLPYDDMRRVGKIPKLLAHRLFPFARYSI 535

Query: 1060 WLDSKLRLQLDPYLILEYFLWRKGHEYAISNHYDRHCVWEEVAQNKRLNKYNHTVIDEQF 881
            WLDSKLRLQLDP LILEYFLWRKG+E+AISNHYDRHCVWEEV QNK+LNKYNHTVID+QF
Sbjct: 536  WLDSKLRLQLDPLLILEYFLWRKGYEFAISNHYDRHCVWEEVVQNKKLNKYNHTVIDQQF 595

Query: 880  EFYQADGLKKFNASDLHKLLPSNVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTPRDQLSF 701
             FY+ADGL++FNASD +KLL SNVPEGSFI+RAHTPMSNLF+CLWFNEVDRFTPRDQLSF
Sbjct: 596  AFYRADGLERFNASDPNKLLSSNVPEGSFIIRAHTPMSNLFNCLWFNEVDRFTPRDQLSF 655

Query: 700  AYTYYKLRRTNPEKPFYLNMFKDCERRKMAKLFRHRSEEKKNILRRE 560
            AYTY KLRR NPEKPF+LNMFKDCERR MAKLF HR +EK+   R++
Sbjct: 656  AYTYQKLRRMNPEKPFHLNMFKDCERRHMAKLFHHRMDEKRTATRQK 702


>ref|XP_006396046.1| hypothetical protein EUTSA_v10006981mg [Eutrema salsugineum]
            gi|557092750|gb|ESQ33332.1| hypothetical protein
            EUTSA_v10006981mg [Eutrema salsugineum]
          Length = 679

 Score =  652 bits (1683), Expect = 0.0
 Identities = 365/706 (51%), Positives = 436/706 (61%), Gaps = 5/706 (0%)
 Frame = -3

Query: 2656 NGVVVPDHNHHQVSIGIRSGAXXXXXXXXXXXXXXXRLVKGRKISI-IGALIVFLVVAFI 2480
            NG      +H  ++IGIR+G                 +   R   + IG+++  L +  +
Sbjct: 25   NGAASSSSDH--IAIGIRNGVGGGAPQQGKANRWRRSVRPDRIRRLGIGSVVFVLCLVLV 82

Query: 2479 ASVAAFLYLSSKDKDINSNYRKXXXXXXXXDFLRNVTRTEKNKVVKFGHGSVKHGXXXXX 2300
             +V A+ Y+S      N+ Y          DFL NVTR +  KV++FGHGSV HG     
Sbjct: 83   VTVLAYYYISGF---ANNGYDDKGFDSYEGDFLTNVTRIDPAKVLEFGHGSVVHGRDSIY 139

Query: 2299 XXXXXXXXXXXXXXXXXXXXXDGSVDKDSIAVNPKFSDKKSLTAKSRKD----LDHRGNG 2132
                                     D +   V  K+ D     A+ +KD    LD +G G
Sbjct: 140  WDKDDRRRDD---------------DYNEDEVEHKYVDVDRSVAEVKKDPVKGLDLKGIG 184

Query: 2131 LYNEAGRNELKMYEAEYEASLKTIGQSKGHGVVEQQSHDAGTGNENKMVDSDDEYDDGID 1952
            LYNE GRNELK YEAEY+ASL   G+S              +G +++ VD D + DD ID
Sbjct: 185  LYNEDGRNELKKYEAEYQASLVKGGESLKK-----------SGGDHEAVDMDPDEDDAID 233

Query: 1951 LQXXXXXXXXXXXXXXXDHSIATSLHDLDGKDSSVSHKSITSHQKSPEEIDEYSDDLPXX 1772
                                   S HD D  +     K         ++ +   DD+   
Sbjct: 234  SHDSQGD------------EYVDSGHDEDENEEPHKEKDTEVLPSMTKQQNSEKDDVAAS 281

Query: 1771 XXXXXXXXXXXDLKHANSINSQSTXXXXXXXXXXXXXXXXXXXXXXSCEMKLLNSSALLV 1592
                             S+ S+                        SCEMKL+NSS  +V
Sbjct: 282  KRSLGDI----------SVVSKGGKSGKTSRSDTKRRGRGRRSSGASCEMKLMNSSHQIV 331

Query: 1591 EPLESRKFARFSLQYXXXXXXXXXXXEWVPRFAGHQTLKEREESFIARDQKINCGFVKGP 1412
            EPL +RK ARFSLQY           +W PRFAGHQ+L+ERE+SF+ +D+KI+CGFVK  
Sbjct: 332  EPLNTRKSARFSLQYIETEDKPDGEEQWEPRFAGHQSLQEREDSFLVQDKKIHCGFVKAL 391

Query: 1411 KGYTSTGFDLAEDDAKYISSCHIAVVSCIFGNSDRLRIPVGKMVSRLSRKNVCFVMFVDE 1232
            KG  STGFDL EDD  YIS CHIAV+SCIFGNSDRLR P  KM+SRLSRKNVCF++FVDE
Sbjct: 392  KGSPSTGFDLTEDDTNYISRCHIAVISCIFGNSDRLRPPANKMISRLSRKNVCFIVFVDE 451

Query: 1231 VTLRTLSSEGHMLDRMGFIGLWKIVVVKNLPSDDMRRVGKIPKLLSHRLFSSARYSIWLD 1052
            +T++TLS+EGH  DR GFIGLWK+VVVKNLP  DMRRVGKIPKLL HRLF SARYSIWLD
Sbjct: 452  ITMQTLSAEGHAPDRAGFIGLWKLVVVKNLPYADMRRVGKIPKLLPHRLFPSARYSIWLD 511

Query: 1051 SKLRLQLDPYLILEYFLWRKGHEYAISNHYDRHCVWEEVAQNKRLNKYNHTVIDEQFEFY 872
            SKLRLQLDP LILEYFLWRKGHEYAISNHYDRHC+WEEVAQNK+LNKYNHTVID+QFEFY
Sbjct: 512  SKLRLQLDPLLILEYFLWRKGHEYAISNHYDRHCLWEEVAQNKKLNKYNHTVIDQQFEFY 571

Query: 871  QADGLKKFNASDLHKLLPSNVPEGSFIVRAHTPMSNLFSCLWFNEVDRFTPRDQLSFAYT 692
            +ADGL +FNASD  KLLPSNVPEGSFIVRAHTPMSNLFSCLWFNEV+RFTPRDQLSFAYT
Sbjct: 572  KADGLTRFNASDPFKLLPSNVPEGSFIVRAHTPMSNLFSCLWFNEVERFTPRDQLSFAYT 631

Query: 691  YYKLRRTNPEKPFYLNMFKDCERRKMAKLFRHRSEEKKNILRREIE 554
            Y KLRR NPEKPF L+MFKDCERRK+AKLFRHRSEEK+N+++  ++
Sbjct: 632  YQKLRRMNPEKPFNLHMFKDCERRKIAKLFRHRSEEKRNLIQAALQ 677


>gb|EOY17189.1| Uncharacterized protein isoform 4 [Theobroma cacao]
          Length = 597

 Score =  651 bits (1680), Expect = 0.0
 Identities = 352/602 (58%), Positives = 400/602 (66%), Gaps = 5/602 (0%)
 Frame = -3

Query: 2362 EKNKVVKFGHGSVKHGXXXXXXXXXXXXXXXXXXXXXXXXXXDGSVDKD----SIAVNPK 2195
            + +KV+ FG  S+ HG                            S D+      + V  K
Sbjct: 2    DSSKVLSFGRSSIAHGRDSRYWDRDDRRRDDDYNEDVVEHNIMDSSDESLDGGHVPVKVK 61

Query: 2194 FSDKKSLTAKSRKDLDHRGNGLYNEAGRNELKMYEAEYEASLKTIGQSKGHGVVEQQSHD 2015
             ++KK  +    KDLD R  GLYNEAGRNELK YE EYE SLK  G+ +      ++  D
Sbjct: 62   -NEKKEASLDPNKDLDRRAVGLYNEAGRNELKRYEKEYELSLKDGGKLQKELENSRRLSD 120

Query: 2014 AGTGNENKMVDSDDEYDDGIDLQXXXXXXXXXXXXXXXDHSIATSLHDLDGKDSSVSHKS 1835
            +     +  VD+DD Y+DG D                  H    ++ +    D  V   S
Sbjct: 121  SKDFGLHDEVDADDHYNDGFDSSDSQTEDYDDFG-----HDKEDNVDEAKSHDEHVKEFS 175

Query: 1834 ITSHQKSPEEIDEYSDD-LPXXXXXXXXXXXXXDLKHANSINSQSTXXXXXXXXXXXXXX 1658
              S  K    + E  ++ +              + +H  S+  +                
Sbjct: 176  TFSKTKERHVVKEGKEESMLSREASGDFGDVDANSQHVGSLGRKGAKSSRADSKRKPRRR 235

Query: 1657 XXXXXXXXSCEMKLLNSSALLVEPLESRKFARFSLQYXXXXXXXXXXXEWVPRFAGHQTL 1478
                     CEMKLLNS+ L VEPLESRKFARFSLQY           +WVP FAGHQ+L
Sbjct: 236  KFSGS----CEMKLLNSTHL-VEPLESRKFARFSLQYKQMEENSEGEEQWVPTFAGHQSL 290

Query: 1477 KEREESFIARDQKINCGFVKGPKGYTSTGFDLAEDDAKYISSCHIAVVSCIFGNSDRLRI 1298
            +EREESF+A DQKINCGFVKGP+GY STGFDLAEDD  YIS CHIAV+SCIFGNSDRLR 
Sbjct: 291  QEREESFLAHDQKINCGFVKGPQGYPSTGFDLAEDDVNYISRCHIAVISCIFGNSDRLRT 350

Query: 1297 PVGKMVSRLSRKNVCFVMFVDEVTLRTLSSEGHMLDRMGFIGLWKIVVVKNLPSDDMRRV 1118
            P GKMV+RLSRKNVCFVMFVDEVT++TL SEG   D  GFIGLWKIVVVKNLP  DMRRV
Sbjct: 351  PAGKMVTRLSRKNVCFVMFVDEVTMQTLFSEGQSPDG-GFIGLWKIVVVKNLPYADMRRV 409

Query: 1117 GKIPKLLSHRLFSSARYSIWLDSKLRLQLDPYLILEYFLWRKGHEYAISNHYDRHCVWEE 938
            GKIPKLL HRLF SARYSIWLDSKLRLQ DP  +L+YFLWRKGHEYAISNHYDRHCVWEE
Sbjct: 410  GKIPKLLPHRLFPSARYSIWLDSKLRLQRDPLQLLDYFLWRKGHEYAISNHYDRHCVWEE 469

Query: 937  VAQNKRLNKYNHTVIDEQFEFYQADGLKKFNASDLHKLLPSNVPEGSFIVRAHTPMSNLF 758
            VAQNK+LNKYNHTVIDEQFEFYQADGLKKFN+SD +KLLPSNVPEGSFIVRAHTPMSNLF
Sbjct: 470  VAQNKKLNKYNHTVIDEQFEFYQADGLKKFNSSDPNKLLPSNVPEGSFIVRAHTPMSNLF 529

Query: 757  SCLWFNEVDRFTPRDQLSFAYTYYKLRRTNPEKPFYLNMFKDCERRKMAKLFRHRSEEKK 578
            SCLWFNEV+RFTPRDQLSFAYTY KLRR NP+KPFYLNMFKDCERR +AKLFRHRSEEK+
Sbjct: 530  SCLWFNEVERFTPRDQLSFAYTYQKLRRMNPDKPFYLNMFKDCERRAIAKLFRHRSEEKR 589

Query: 577  NI 572
            N+
Sbjct: 590  NV 591


>ref|XP_006851237.1| hypothetical protein AMTR_s00180p00023300 [Amborella trichopoda]
            gi|548854920|gb|ERN12818.1| hypothetical protein
            AMTR_s00180p00023300 [Amborella trichopoda]
          Length = 659

 Score =  624 bits (1608), Expect = e-175
 Identities = 347/658 (52%), Positives = 409/658 (62%), Gaps = 2/658 (0%)
 Frame = -3

Query: 2539 KGRKISIIGALIVFLVVAFIASVAAFLYLSSKDKDINSNYRKXXXXXXXXDFLRNVTRTE 2360
            KG+K+SI G + +F VV  +  +A F Y+S    +   +Y++            +   T 
Sbjct: 29   KGKKLSIGGVIAIFTVVLLLTFLA-FGYVSRGKNEKMKDYKED-----------SSNGTS 76

Query: 2359 KNKVVKFGHGSVKHGXXXXXXXXXXXXXXXXXXXXXXXXXXDGSVDKDSIAVNPKFSDKK 2180
             +KV+KFGHG V                               + D ++     K S   
Sbjct: 77   GSKVLKFGHGFVSS--LERDSRDWDRDDRRRDDAYNEDTADLSNADANNNGEGKKSSKDM 134

Query: 2179 SLTAKSRKDLDHRGNGLYNEAGRNELKMYEAEYEASLKTIGQSKGHGVVEQQSHDAGTGN 2000
               +  ++ +  +  GLYNEAGR+EL  Y AEYEASL        HG         G GN
Sbjct: 135  HENSIEKQQMKPKSMGLYNEAGRHELDQYRAEYEASL--------HG-----GESNGVGN 181

Query: 1999 ENKMVDSDDEYDDGIDLQXXXXXXXXXXXXXXXDHSIATSLHDLDGKDSSVSHKSITSHQ 1820
             ++  D  DEYDDG D Q               DH  +++    D + S      +   +
Sbjct: 182  LSEEEDLIDEYDDGFDAQDTHHEDTDVKDRREDDHPSSSNTISGDRQGSD----GMNGGE 237

Query: 1819 KSPEEIDEY--SDDLPXXXXXXXXXXXXXDLKHANSINSQSTXXXXXXXXXXXXXXXXXX 1646
             S  E+D++  +D                 +      +SQ T                  
Sbjct: 238  GSNPELDKHLENDSSIVGEGFSAGKMEGDHVGFVEGGSSQMTGSVKKSFSKKKGKRHKFA 297

Query: 1645 XXXXSCEMKLLNSSALLVEPLESRKFARFSLQYXXXXXXXXXXXEWVPRFAGHQTLKERE 1466
                 C+MKLLNS+A LVEPL+S+KF+RFSLQY           +W PRFAGHQTL+ERE
Sbjct: 298  GAS--CDMKLLNSTAQLVEPLQSKKFSRFSLQYTEVEERPSGVEQWEPRFAGHQTLRERE 355

Query: 1465 ESFIARDQKINCGFVKGPKGYTSTGFDLAEDDAKYISSCHIAVVSCIFGNSDRLRIPVGK 1286
            E+F ARDQ+INCGFVK PKGY STGFDLAEDD KY+  CHIAV SCIFGNSD LR P GK
Sbjct: 356  ETFYARDQRINCGFVKAPKGYPSTGFDLAEDDMKYMQRCHIAVSSCIFGNSDNLRTPTGK 415

Query: 1285 MVSRLSRKNVCFVMFVDEVTLRTLSSEGHMLDRMGFIGLWKIVVVKNLPSDDMRRVGKIP 1106
            MVSRLSRKNVCFVMFVDE TL+TLSSEG   D MG+IGLWKIVVV+NLP  DMRRVGKIP
Sbjct: 416  MVSRLSRKNVCFVMFVDENTLQTLSSEGQKPDTMGYIGLWKIVVVRNLPYTDMRRVGKIP 475

Query: 1105 KLLSHRLFSSARYSIWLDSKLRLQLDPYLILEYFLWRKGHEYAISNHYDRHCVWEEVAQN 926
            K L+HRLF +ARYSIWLDSKLRLQ DP LILEYFLWR  +EYAISNHYDRHCVWEEV QN
Sbjct: 476  KFLTHRLFPAARYSIWLDSKLRLQRDPLLILEYFLWRHNYEYAISNHYDRHCVWEEVLQN 535

Query: 925  KRLNKYNHTVIDEQFEFYQADGLKKFNASDLHKLLPSNVPEGSFIVRAHTPMSNLFSCLW 746
            K+LNK+NHT+ID+QF FYQ DGLK+FNASD  K LPS VPEGSFI+RAHTPMSNLFSCLW
Sbjct: 536  KKLNKFNHTIIDQQFAFYQHDGLKRFNASDPDKFLPSYVPEGSFIIRAHTPMSNLFSCLW 595

Query: 745  FNEVDRFTPRDQLSFAYTYYKLRRTNPEKPFYLNMFKDCERRKMAKLFRHRSEEKKNI 572
            FNEVDRFTPRDQLSFAYT  KLRR NP KPFY NMFKDCERR +AKLF HRSEEK+N+
Sbjct: 596  FNEVDRFTPRDQLSFAYTSLKLRRMNPGKPFYFNMFKDCERRSIAKLFHHRSEEKRNL 653


Top