BLASTX nr result

ID: Cocculus23_contig00009652 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00009652
         (1804 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI39364.3| unnamed protein product [Vitis vinifera]              692   0.0  
ref|XP_007049004.1| S-adenosyl-L-methionine-dependent methyltran...   676   0.0  
ref|XP_006429750.1| hypothetical protein CICLE_v10011552mg [Citr...   674   0.0  
ref|XP_004149525.1| PREDICTED: UPF0586 protein C9orf41 homolog [...   665   0.0  
ref|XP_002307920.2| hypothetical protein POPTR_0006s02420g [Popu...   658   0.0  
ref|XP_006429749.1| hypothetical protein CICLE_v10011552mg [Citr...   650   0.0  
ref|XP_007145965.1| hypothetical protein PHAVU_006G001800g [Phas...   649   0.0  
ref|XP_006605686.1| PREDICTED: UPF0586 protein C9orf41 homolog i...   643   0.0  
ref|XP_007215348.1| hypothetical protein PRUPE_ppa005412mg [Prun...   640   0.0  
ref|XP_006605685.1| PREDICTED: UPF0586 protein C9orf41 homolog i...   635   e-179
ref|XP_006345754.1| PREDICTED: UPF0586 protein C9orf41-like [Sol...   635   e-179
ref|XP_006593845.1| PREDICTED: UPF0586 protein C9orf41 homolog i...   632   e-178
ref|XP_004305888.1| PREDICTED: UPF0586 protein C9orf41 homolog [...   630   e-178
ref|XP_007145966.1| hypothetical protein PHAVU_006G001800g [Phas...   626   e-176
ref|XP_004502075.1| PREDICTED: LOW QUALITY PROTEIN: UPF0586 prot...   625   e-176
ref|NP_850185.1| S-adenosyl-L-methionine-dependent methyltransfe...   624   e-176
ref|XP_002879368.1| hypothetical protein ARALYDRAFT_902264 [Arab...   622   e-175
ref|XP_004239631.1| PREDICTED: UPF0586 protein C9orf41-like [Sol...   617   e-174
ref|XP_007049003.1| S-adenosyl-L-methionine-dependent methyltran...   611   e-172
ref|XP_006853312.1| hypothetical protein AMTR_s00032p00047690 [A...   610   e-172

>emb|CBI39364.3| unnamed protein product [Vitis vinifera]
          Length = 498

 Score =  692 bits (1787), Expect = 0.0
 Identities = 345/477 (72%), Positives = 387/477 (81%), Gaps = 3/477 (0%)
 Frame = -3

Query: 1763 KLEVALEVKSLRRIISAYLNYSDAAEEDVKRYEGSFSKLPPAHKVLLSHLPSKFQRLRRC 1584
            ++E ALEVKSLRRIISAYLNY DAAE+DV+RYE SF +LPPAHK LLSH PSKFQRLRRC
Sbjct: 19   EIEEALEVKSLRRIISAYLNYPDAAEDDVRRYERSFRRLPPAHKALLSHYPSKFQRLRRC 78

Query: 1583 ITVNSFFIFSMLQAFEPPLDMSQDTDICXXXXXXXXXXXHI-SGERNASSAQSASTSGRA 1407
            I+VNSFFIF+MLQAFEPPLDMSQDTD+C           H+ SGERN    ++ASTSGR 
Sbjct: 79   ISVNSFFIFNMLQAFEPPLDMSQDTDMCENPHLENALDDHLDSGERNICPCEAASTSGRI 138

Query: 1406 CCSETDHERYGAHDNLNYISPEERFANQXXXXXXXDCVTGSTSISLDCAKQSESPNISDV 1227
               ++D   YG  D +   SPE     +       +   G  +      ++++    SDV
Sbjct: 139  SFPQSDQASYGKSD-ITCKSPEGVNNKELGTESCCESGPGICNAYPGNNRETDQAGSSDV 197

Query: 1226 QVVENHASTIS--DSNGNVNLSQHDWLDPSLQLNVPLVDVDKVRCIIRNIVRDWAQEGQR 1053
            ++  + A+  S  DSNGNV+ S H+WLDPS QLNVPLVDVDKVRCIIRNIVRDWA EGQ+
Sbjct: 198  KINNDEATPYSFADSNGNVSSSTHEWLDPSFQLNVPLVDVDKVRCIIRNIVRDWAAEGQK 257

Query: 1052 EREQCYKPILEELDRIFPSRSTDRPPCCLVPGAGLGRLALEISCLGFVSQGNEFSYYMMI 873
            ER+QCYKPILEELD +FP+RS DRPP CLVPGAGLGRLALEISCLGF+SQGNEFSYYMMI
Sbjct: 258  ERDQCYKPILEELDGLFPNRSKDRPPSCLVPGAGLGRLALEISCLGFISQGNEFSYYMMI 317

Query: 872  CSSFILNHAQMVGEWTIYPWIHSNCNSLSDKDQLRPVSIPDIHPASAGITEGFSMCGGDF 693
            CSSFILN+AQ   EWTIYPWIHSNCNSLS+ DQLRPVSIPD+HPASAGITEGFSMCGGDF
Sbjct: 318  CSSFILNNAQTAEEWTIYPWIHSNCNSLSENDQLRPVSIPDMHPASAGITEGFSMCGGDF 377

Query: 692  VEVYRDPSQKGAWDAVVTCFFLDTAHNIVEYIEIISNILRDGGVWINLGPLLYHFADAYG 513
            VEVY DPSQ G WDAVVTCFF+DTAHNIVEYIEIIS IL+DGGVWIN GPLLYHFAD YG
Sbjct: 378  VEVYSDPSQIGVWDAVVTCFFIDTAHNIVEYIEIISRILKDGGVWINFGPLLYHFADMYG 437

Query: 512  TEDEMSIELSLEDVKRVAFHYGFHLEREKTIETTYTTNPRSMMQNRYYAAFWTMTKK 342
             EDEMSIELSLEDVK+VA HYGF +E+E+TIETTYTTNPRSMMQNRY+AAFWTM KK
Sbjct: 438  QEDEMSIELSLEDVKKVALHYGFQMEKERTIETTYTTNPRSMMQNRYFAAFWTMRKK 494


>ref|XP_007049004.1| S-adenosyl-L-methionine-dependent methyltransferases superfamily
            protein isoform 3 [Theobroma cacao]
            gi|508701265|gb|EOX93161.1|
            S-adenosyl-L-methionine-dependent methyltransferases
            superfamily protein isoform 3 [Theobroma cacao]
          Length = 485

 Score =  676 bits (1745), Expect = 0.0
 Identities = 338/481 (70%), Positives = 374/481 (77%), Gaps = 1/481 (0%)
 Frame = -3

Query: 1781 EHHEPSKLEVALEVKSLRRIISAYLNYSDAAEEDVKRYEGSFSKLPPAHKVLLSHLPSKF 1602
            E     KLE  LEVKSLRRIISAYLNY +AAEEDV+R+E SF KL PAHK LLSH P KF
Sbjct: 10   ERRRQRKLEEVLEVKSLRRIISAYLNYPEAAEEDVRRFERSFKKLSPAHKALLSHYPLKF 69

Query: 1601 QRLRRCITVNSFFIFSMLQAFEPPLDMSQDTDICXXXXXXXXXXXHI-SGERNASSAQSA 1425
            QRLRRCI+VNS+FIF+MLQ+FEPPLDMSQD DIC           H  S ERNA   QSA
Sbjct: 70   QRLRRCISVNSYFIFNMLQSFEPPLDMSQDVDICEDPHLENFQHEHCHSEERNACFCQSA 129

Query: 1424 STSGRACCSETDHERYGAHDNLNYISPEERFANQXXXXXXXDCVTGSTSISLDCAKQSES 1245
            STSGR CCS           N+      E    +       + ++GS      CA +  +
Sbjct: 130  STSGRMCCSNLAQACSQERSNIISNPTAETTHEEVQSGHQHETISGS------CAGEVGN 183

Query: 1244 PNISDVQVVENHASTISDSNGNVNLSQHDWLDPSLQLNVPLVDVDKVRCIIRNIVRDWAQ 1065
                D ++ E   + ++DSNGNV  S HDWLDPSLQLNVPLVDVDKVRCIIRNIVRDWA 
Sbjct: 184  ----DKEIAECCGNDVTDSNGNVFSSPHDWLDPSLQLNVPLVDVDKVRCIIRNIVRDWAA 239

Query: 1064 EGQREREQCYKPILEELDRIFPSRSTDRPPCCLVPGAGLGRLALEISCLGFVSQGNEFSY 885
            EG++ER+QCYKPILEELD +FP+RS + PP CLVPGAGLGRLALEISCLGF+SQGNEFSY
Sbjct: 240  EGEKERDQCYKPILEELDALFPNRSKESPPACLVPGAGLGRLALEISCLGFISQGNEFSY 299

Query: 884  YMMICSSFILNHAQMVGEWTIYPWIHSNCNSLSDKDQLRPVSIPDIHPASAGITEGFSMC 705
            YMM+CSSFILNH Q  GEWTIYPWIHSNCNSLSD DQLRPVSIPDIHPASAGITEGFSMC
Sbjct: 300  YMMLCSSFILNHTQTTGEWTIYPWIHSNCNSLSDNDQLRPVSIPDIHPASAGITEGFSMC 359

Query: 704  GGDFVEVYRDPSQKGAWDAVVTCFFLDTAHNIVEYIEIISNILRDGGVWINLGPLLYHFA 525
            GGDFVEVY D SQ G WDAVVTCFF+DTAHNI+EYIEIIS IL++GGVWINLGPLLYHFA
Sbjct: 360  GGDFVEVYNDSSQIGVWDAVVTCFFIDTAHNIIEYIEIISKILKEGGVWINLGPLLYHFA 419

Query: 524  DAYGTEDEMSIELSLEDVKRVAFHYGFHLEREKTIETTYTTNPRSMMQNRYYAAFWTMTK 345
            D YG EDEMSIELSLEDVK+VA  YGF  E+E+TIETTYTTNPRSMMQN Y+A FWT+ K
Sbjct: 420  DVYGQEDEMSIELSLEDVKKVALRYGFQFEKEQTIETTYTTNPRSMMQNHYFAVFWTLRK 479

Query: 344  K 342
            K
Sbjct: 480  K 480


>ref|XP_006429750.1| hypothetical protein CICLE_v10011552mg [Citrus clementina]
            gi|568855494|ref|XP_006481339.1| PREDICTED: UPF0586
            protein C9orf41 homolog [Citrus sinensis]
            gi|557531807|gb|ESR42990.1| hypothetical protein
            CICLE_v10011552mg [Citrus clementina]
          Length = 496

 Score =  674 bits (1739), Expect = 0.0
 Identities = 337/483 (69%), Positives = 373/483 (77%)
 Frame = -3

Query: 1781 EHHEPSKLEVALEVKSLRRIISAYLNYSDAAEEDVKRYEGSFSKLPPAHKVLLSHLPSKF 1602
            E     KLE ALEVKSLRRIISAYLNY +AAEEDVKRYE SF KLPP+HK LLSH P KF
Sbjct: 17   ERRRQRKLEEALEVKSLRRIISAYLNYPEAAEEDVKRYEQSFRKLPPSHKALLSHYPLKF 76

Query: 1601 QRLRRCITVNSFFIFSMLQAFEPPLDMSQDTDICXXXXXXXXXXXHISGERNASSAQSAS 1422
            ++LRRCI++NS+FIF+MLQAF+PPLDMSQD DIC           + S   N  S  S S
Sbjct: 77   KKLRRCISMNSYFIFAMLQAFDPPLDMSQDMDICVDSHVSHTQYDNQSDGMNVCSGHSTS 136

Query: 1421 TSGRACCSETDHERYGAHDNLNYISPEERFANQXXXXXXXDCVTGSTSISLDCAKQSESP 1242
            +SGR CCS+ DH       N N  S     AN+       +        +  C  + E+ 
Sbjct: 137  SSGRMCCSKADHA------NCNEQSKVVETANEMTTNEEEEAEGPIEYKTASCPGKLENR 190

Query: 1241 NISDVQVVENHASTISDSNGNVNLSQHDWLDPSLQLNVPLVDVDKVRCIIRNIVRDWAQE 1062
              ++    ++ ++  +DSNGN +    DWLDPS+QLNVPL DVDKVRCIIRNIVRDWA E
Sbjct: 191  EETN----QSCSNDFTDSNGNASSPACDWLDPSIQLNVPLADVDKVRCIIRNIVRDWAAE 246

Query: 1061 GQREREQCYKPILEELDRIFPSRSTDRPPCCLVPGAGLGRLALEISCLGFVSQGNEFSYY 882
            G+ ER+QCYKPILEELD +FP+RS + PP CLVPGAGLGRLALEIS LGF+SQGNEFSYY
Sbjct: 247  GKTERDQCYKPILEELDALFPNRSKESPPACLVPGAGLGRLALEISRLGFISQGNEFSYY 306

Query: 881  MMICSSFILNHAQMVGEWTIYPWIHSNCNSLSDKDQLRPVSIPDIHPASAGITEGFSMCG 702
            MMICSSFILNH Q  GEW IYPWIHSNCNSLSD DQLRPVSIPDIHPASAGITEGFSMCG
Sbjct: 307  MMICSSFILNHTQTAGEWNIYPWIHSNCNSLSDSDQLRPVSIPDIHPASAGITEGFSMCG 366

Query: 701  GDFVEVYRDPSQKGAWDAVVTCFFLDTAHNIVEYIEIISNILRDGGVWINLGPLLYHFAD 522
            GDFVEVY DPSQ GAWDAVVTCFF+DTAHNIVEYIEIIS IL+DGGVWINLGPLLYHFAD
Sbjct: 367  GDFVEVYSDPSQVGAWDAVVTCFFIDTAHNIVEYIEIISRILKDGGVWINLGPLLYHFAD 426

Query: 521  AYGTEDEMSIELSLEDVKRVAFHYGFHLEREKTIETTYTTNPRSMMQNRYYAAFWTMTKK 342
             YG EDEMSIELSLEDVKRVA HYGF  E+EKTIETTYTTNPRSMMQNRY+ AFWTM KK
Sbjct: 427  LYGQEDEMSIELSLEDVKRVALHYGFEFEKEKTIETTYTTNPRSMMQNRYFTAFWTMRKK 486

Query: 341  GRT 333
              T
Sbjct: 487  SVT 489


>ref|XP_004149525.1| PREDICTED: UPF0586 protein C9orf41 homolog [Cucumis sativus]
          Length = 492

 Score =  665 bits (1717), Expect = 0.0
 Identities = 341/505 (67%), Positives = 374/505 (74%), Gaps = 16/505 (3%)
 Frame = -3

Query: 1799 REMSSGEHHEPSKLEVALEVKSLRRIISAYLNYSDAAEEDVKRYEGSFSKLPPAHKVLLS 1620
            RE    E     KLE ALEVKSLRRI+SAYLNY +A+EEDVKRYE SFSKLPPAHK LLS
Sbjct: 5    REDEDEEQTRQRKLEEALEVKSLRRIVSAYLNYPEASEEDVKRYERSFSKLPPAHKALLS 64

Query: 1619 HLPSKFQRLRRCITVNSFFIFSMLQAFEPPLDMSQDTDICXXXXXXXXXXXHIS--GERN 1446
            H P KF+RLRRCI+ NS+FIF+MLQAFEPPLDMSQDTD C                GERN
Sbjct: 65   HFPLKFERLRRCISTNSYFIFNMLQAFEPPLDMSQDTDCCDGSYPDHAHDDQFCCRGERN 124

Query: 1445 AS-----------SAQSASTSGRACCSETDHE--RYGAHDNLNYISPEERFANQXXXXXX 1305
            A+           S +  STSGR C  E+       GA D     SP+    NQ      
Sbjct: 125  ANGNLCSRESNVCSGEPTSTSGRMCSLESKQICCPEGASD-----SPKASTINQEVENGV 179

Query: 1304 XDCVTGSTSISLDCAKQSESPNISDVQVVENHAS-TISDSNGNVNLSQHDWLDPSLQLNV 1128
                             +   ++ + +V + H+    SD NGN   S H+WLDPSLQLNV
Sbjct: 180  -----------------NHDQHLEEKEVTDKHSGHCASDCNGNDCSSSHEWLDPSLQLNV 222

Query: 1127 PLVDVDKVRCIIRNIVRDWAQEGQREREQCYKPILEELDRIFPSRSTDRPPCCLVPGAGL 948
            PLVDVDKVRCIIRNIVRDWA+EGQ+EREQCYKPILEEL  +FP R  + PP CLVPGAGL
Sbjct: 223  PLVDVDKVRCIIRNIVRDWAEEGQKEREQCYKPILEELHSLFPDRKKESPPACLVPGAGL 282

Query: 947  GRLALEISCLGFVSQGNEFSYYMMICSSFILNHAQMVGEWTIYPWIHSNCNSLSDKDQLR 768
            GRLALEISCLGF+SQGNEFSYYMMICSSFILNH Q VGEWTIYPWIHSN NSLSD DQLR
Sbjct: 283  GRLALEISCLGFISQGNEFSYYMMICSSFILNHTQKVGEWTIYPWIHSNSNSLSDSDQLR 342

Query: 767  PVSIPDIHPASAGITEGFSMCGGDFVEVYRDPSQKGAWDAVVTCFFLDTAHNIVEYIEII 588
            PVSIPDIHPASAGITEGFSMCGGDFVEVY DPSQ G WDAVVTCFF+DTAHNI+EYIE+I
Sbjct: 343  PVSIPDIHPASAGITEGFSMCGGDFVEVYSDPSQVGLWDAVVTCFFIDTAHNIIEYIEVI 402

Query: 587  SNILRDGGVWINLGPLLYHFADAYGTEDEMSIELSLEDVKRVAFHYGFHLEREKTIETTY 408
            S IL+DGGVWINLGPLLYHFAD YG EDEMSIE SLEDVK++  HYGF  E+E+T+ETTY
Sbjct: 403  SKILKDGGVWINLGPLLYHFADMYGQEDEMSIEPSLEDVKKIILHYGFVFEKERTVETTY 462

Query: 407  TTNPRSMMQNRYYAAFWTMTKKGRT 333
            TTNPRSMMQNRYYAAFWTM KK  T
Sbjct: 463  TTNPRSMMQNRYYAAFWTMRKKSAT 487


>ref|XP_002307920.2| hypothetical protein POPTR_0006s02420g [Populus trichocarpa]
            gi|550335301|gb|EEE91443.2| hypothetical protein
            POPTR_0006s02420g [Populus trichocarpa]
          Length = 484

 Score =  658 bits (1697), Expect = 0.0
 Identities = 336/490 (68%), Positives = 371/490 (75%), Gaps = 5/490 (1%)
 Frame = -3

Query: 1796 EMSSGEHHEPSKLEVALEVKSLRRIISAYLNYSDAAEEDVKRYEGSFSKLPPAHKVLLSH 1617
            E    E     KLE ALEVKSLRRIISAYLNY +AAEEDVKRYE SF KL  +HK LLSH
Sbjct: 8    EEEDEERLRQRKLEEALEVKSLRRIISAYLNYPEAAEEDVKRYERSFRKLSSSHKALLSH 67

Query: 1616 LPSKFQRLRRCITVNSFFIFSMLQAFEPPLDMSQDTDICXXXXXXXXXXXHISGERNASS 1437
             P KFQ LRRCI++NSFFI +MLQAFEPPLDMS D D C               + N  S
Sbjct: 68   YPLKFQSLRRCISINSFFIINMLQAFEPPLDMSHDVDDCGCSHFEQPP-----NDMNVCS 122

Query: 1436 AQSASTSGRACCSETDHERYGAHDNL-----NYISPEERFANQXXXXXXXDCVTGSTSIS 1272
             +SA+ SG +CCS+ D    G   N+     + ++P E    +        C+   T   
Sbjct: 123  HESAAASG-SCCSKPDEACCGEPSNMMSKPADCLAPNEEVDTEG-------CLGSDTG-- 172

Query: 1271 LDCAKQSESPNISDVQVVENHASTISDSNGNVNLSQHDWLDPSLQLNVPLVDVDKVRCII 1092
              C    E+  ++  +   NH   +SDSNGNV  S HDWLDPSLQL VP+VDVDKVRCII
Sbjct: 173  -SCLAGRENYKMTS-ECCSNH---VSDSNGNVPSSHHDWLDPSLQLRVPMVDVDKVRCII 227

Query: 1091 RNIVRDWAQEGQREREQCYKPILEELDRIFPSRSTDRPPCCLVPGAGLGRLALEISCLGF 912
            RNIVRDWA EGQ+ER+QCYKPILEEL+ +FP RS + PP CLVPGAGLGRLALEISCLGF
Sbjct: 228  RNIVRDWAAEGQKERDQCYKPILEELNSLFPDRSNESPPTCLVPGAGLGRLALEISCLGF 287

Query: 911  VSQGNEFSYYMMICSSFILNHAQMVGEWTIYPWIHSNCNSLSDKDQLRPVSIPDIHPASA 732
            VSQGNEFSYYMMICSSFILN  +  GEWTIYPWIHSNCNSLSD DQLRPVSIPDIHPASA
Sbjct: 288  VSQGNEFSYYMMICSSFILNQTETAGEWTIYPWIHSNCNSLSDSDQLRPVSIPDIHPASA 347

Query: 731  GITEGFSMCGGDFVEVYRDPSQKGAWDAVVTCFFLDTAHNIVEYIEIISNILRDGGVWIN 552
            GITEGFSMCGGDFVEVY DPSQ G WDAVVTCFF+DTAHNIVEYIEIIS IL+DGGVWIN
Sbjct: 348  GITEGFSMCGGDFVEVYSDPSQVGVWDAVVTCFFIDTAHNIVEYIEIISRILKDGGVWIN 407

Query: 551  LGPLLYHFADAYGTEDEMSIELSLEDVKRVAFHYGFHLEREKTIETTYTTNPRSMMQNRY 372
            LGPLLYHFAD YG EDEMSIELSLEDVKRVA +YGF +E+E TIETTYTTNPR+MMQNRY
Sbjct: 408  LGPLLYHFADVYGQEDEMSIELSLEDVKRVALNYGFEVEKESTIETTYTTNPRAMMQNRY 467

Query: 371  YAAFWTMTKK 342
            + AFWTM KK
Sbjct: 468  FPAFWTMRKK 477


>ref|XP_006429749.1| hypothetical protein CICLE_v10011552mg [Citrus clementina]
            gi|557531806|gb|ESR42989.1| hypothetical protein
            CICLE_v10011552mg [Citrus clementina]
          Length = 481

 Score =  650 bits (1676), Expect = 0.0
 Identities = 326/467 (69%), Positives = 361/467 (77%)
 Frame = -3

Query: 1781 EHHEPSKLEVALEVKSLRRIISAYLNYSDAAEEDVKRYEGSFSKLPPAHKVLLSHLPSKF 1602
            E     KLE ALEVKSLRRIISAYLNY +AAEEDVKRYE SF KLPP+HK LLSH P KF
Sbjct: 17   ERRRQRKLEEALEVKSLRRIISAYLNYPEAAEEDVKRYEQSFRKLPPSHKALLSHYPLKF 76

Query: 1601 QRLRRCITVNSFFIFSMLQAFEPPLDMSQDTDICXXXXXXXXXXXHISGERNASSAQSAS 1422
            ++LRRCI++NS+FIF+MLQAF+PPLDMSQD DIC           + S   N  S  S S
Sbjct: 77   KKLRRCISMNSYFIFAMLQAFDPPLDMSQDMDICVDSHVSHTQYDNQSDGMNVCSGHSTS 136

Query: 1421 TSGRACCSETDHERYGAHDNLNYISPEERFANQXXXXXXXDCVTGSTSISLDCAKQSESP 1242
            +SGR CCS+ DH       N N  S     AN+       +        +  C  + E+ 
Sbjct: 137  SSGRMCCSKADHA------NCNEQSKVVETANEMTTNEEEEAEGPIEYKTASCPGKLENR 190

Query: 1241 NISDVQVVENHASTISDSNGNVNLSQHDWLDPSLQLNVPLVDVDKVRCIIRNIVRDWAQE 1062
              ++    ++ ++  +DSNGN +    DWLDPS+QLNVPL DVDKVRCIIRNIVRDWA E
Sbjct: 191  EETN----QSCSNDFTDSNGNASSPACDWLDPSIQLNVPLADVDKVRCIIRNIVRDWAAE 246

Query: 1061 GQREREQCYKPILEELDRIFPSRSTDRPPCCLVPGAGLGRLALEISCLGFVSQGNEFSYY 882
            G+ ER+QCYKPILEELD +FP+RS + PP CLVPGAGLGRLALEIS LGF+SQGNEFSYY
Sbjct: 247  GKTERDQCYKPILEELDALFPNRSKESPPACLVPGAGLGRLALEISRLGFISQGNEFSYY 306

Query: 881  MMICSSFILNHAQMVGEWTIYPWIHSNCNSLSDKDQLRPVSIPDIHPASAGITEGFSMCG 702
            MMICSSFILNH Q  GEW IYPWIHSNCNSLSD DQLRPVSIPDIHPASAGITEGFSMCG
Sbjct: 307  MMICSSFILNHTQTAGEWNIYPWIHSNCNSLSDSDQLRPVSIPDIHPASAGITEGFSMCG 366

Query: 701  GDFVEVYRDPSQKGAWDAVVTCFFLDTAHNIVEYIEIISNILRDGGVWINLGPLLYHFAD 522
            GDFVEVY DPSQ GAWDAVVTCFF+DTAHNIVEYIEIIS IL+DGGVWINLGPLLYHFAD
Sbjct: 367  GDFVEVYSDPSQVGAWDAVVTCFFIDTAHNIVEYIEIISRILKDGGVWINLGPLLYHFAD 426

Query: 521  AYGTEDEMSIELSLEDVKRVAFHYGFHLEREKTIETTYTTNPRSMMQ 381
             YG EDEMSIELSLEDVKRVA HYGF  E+EKTIETTYTTNPRSMMQ
Sbjct: 427  LYGQEDEMSIELSLEDVKRVALHYGFEFEKEKTIETTYTTNPRSMMQ 473


>ref|XP_007145965.1| hypothetical protein PHAVU_006G001800g [Phaseolus vulgaris]
            gi|561019188|gb|ESW17959.1| hypothetical protein
            PHAVU_006G001800g [Phaseolus vulgaris]
          Length = 481

 Score =  649 bits (1673), Expect = 0.0
 Identities = 336/489 (68%), Positives = 372/489 (76%), Gaps = 9/489 (1%)
 Frame = -3

Query: 1781 EHHEPSKLEVALEVKSLRRIISAYLNYSDAAEEDVKRYEGSFSKLPPAHKVLLSHLPSKF 1602
            E     KLE ALE++SLRRIISAYLNY DAAEEDV+RYE S+ KLPPAHK LL H P KF
Sbjct: 6    EQRRRLKLEEALEIQSLRRIISAYLNYPDAAEEDVRRYERSYRKLPPAHKALLPHYPRKF 65

Query: 1601 QRLRRCITVNSFFIFSMLQAFEPPLDMSQDTDICXXXXXXXXXXXHISGER-NASSAQS- 1428
            QRLRRCI++NS FIF MLQAFEPPLDMSQD +             H++ E  NA S +S 
Sbjct: 66   QRLRRCISMNSHFIFGMLQAFEPPLDMSQDLEFSEDPHPESAEKDHLASEGINACSCESD 125

Query: 1427 -----ASTSGRACCSETDHERYGAHDNLNYISPEERFANQXXXXXXXDCVTGSTSISLDC 1263
                  S S + CC E D+        L + + E    +Q          TGS S SL  
Sbjct: 126  PSRITCSVSDQDCCVE-DNNHTCRSQGLTHSNEEVDIESQHQSN------TGSLSPSL-- 176

Query: 1262 AKQSESPNISDVQVVENHASTISDSNGNVNL--SQHDWLDPSLQLNVPLVDVDKVRCIIR 1089
                    I+  +  E    +I+DSNGNV++  SQ  WL+PSL+LNVPLVDVDKVRCIIR
Sbjct: 177  --------INTKETTEYCGHSINDSNGNVSVTSSQQQWLEPSLRLNVPLVDVDKVRCIIR 228

Query: 1088 NIVRDWAQEGQREREQCYKPILEELDRIFPSRSTDRPPCCLVPGAGLGRLALEISCLGFV 909
            NIVRDWA EG++ER+QCY PILEEL+ +FP+RS   PP CLVPGAGLGRLALEISCLGF+
Sbjct: 229  NIVRDWAAEGKKERDQCYNPILEELNMLFPNRSKKSPPACLVPGAGLGRLALEISCLGFI 288

Query: 908  SQGNEFSYYMMICSSFILNHAQMVGEWTIYPWIHSNCNSLSDKDQLRPVSIPDIHPASAG 729
            SQGNEFSYYMMICSSFILNH+Q  GEWTIYPWIHSNCNSLSD DQLRPVSIPDIHPASAG
Sbjct: 289  SQGNEFSYYMMICSSFILNHSQTAGEWTIYPWIHSNCNSLSDSDQLRPVSIPDIHPASAG 348

Query: 728  ITEGFSMCGGDFVEVYRDPSQKGAWDAVVTCFFLDTAHNIVEYIEIISNILRDGGVWINL 549
            ITEGFSMCGGDFVEVY D SQ GAWDAVVTCFF+DTAHNIVEYIEIIS IL+DGGVWINL
Sbjct: 349  ITEGFSMCGGDFVEVYSDSSQVGAWDAVVTCFFIDTAHNIVEYIEIISKILKDGGVWINL 408

Query: 548  GPLLYHFADAYGTEDEMSIELSLEDVKRVAFHYGFHLEREKTIETTYTTNPRSMMQNRYY 369
            GPLLYHFAD YG EDEMSIELSLEDVK VA +YGF  E+E TIETTYTTNPRSMMQNRY+
Sbjct: 409  GPLLYHFADVYGQEDEMSIELSLEDVKSVALNYGFEFEKESTIETTYTTNPRSMMQNRYF 468

Query: 368  AAFWTMTKK 342
            AAFWTM KK
Sbjct: 469  AAFWTMRKK 477


>ref|XP_006605686.1| PREDICTED: UPF0586 protein C9orf41 homolog isoform X2 [Glycine max]
          Length = 488

 Score =  643 bits (1658), Expect = 0.0
 Identities = 329/477 (68%), Positives = 366/477 (76%), Gaps = 3/477 (0%)
 Frame = -3

Query: 1763 KLEVALEVKSLRRIISAYLNYSDAAEEDVKRYEGSFSKLPPAHKVLLSHLPSKFQRLRRC 1584
            KLE ALE++SLRRIISAYLNY DAAEEDV+RYE S+ KLPP+HK LLSH   KFQRLR C
Sbjct: 14   KLEEALEIQSLRRIISAYLNYPDAAEEDVRRYERSYRKLPPSHKALLSHYSRKFQRLRWC 73

Query: 1583 ITVNSFFIFSMLQAFEPPLDMSQDTDICXXXXXXXXXXXHISGERNASSAQSASTSGRAC 1404
            I++N+ FIF MLQAFEPPLDMSQD D             H+  E   S+    S   R  
Sbjct: 74   ISMNTHFIFGMLQAFEPPLDMSQDVDFSEDPHPESTQKDHLVSE-GISACSCESVPVRIT 132

Query: 1403 CSETDHERYGAHDNLNYISPEERFANQXXXXXXXDCVTGSTSISLDCAKQSESPN-ISDV 1227
            CS +D  R     N   IS  +  +N+        C   +T         S SP+ I   
Sbjct: 133  CSVSDQHRCVEGGNHTCISQAQMHSNEEVDIE--SCHQSNTG--------SHSPSMIHPK 182

Query: 1226 QVVENHASTISDSNGNVNL--SQHDWLDPSLQLNVPLVDVDKVRCIIRNIVRDWAQEGQR 1053
            +  E   S I+DSNGNV +  SQ  WLDPSL+LNVPLVDVDKVRCIIRNIVRDWA EG+ 
Sbjct: 183  ETSEYCGSPIADSNGNVPVTSSQQQWLDPSLKLNVPLVDVDKVRCIIRNIVRDWAAEGKN 242

Query: 1052 EREQCYKPILEELDRIFPSRSTDRPPCCLVPGAGLGRLALEISCLGFVSQGNEFSYYMMI 873
            ER+QCY PIL+EL+ +FP+RS D PP CLVPGAGLGRLALEISCLGF+SQGNEFSYYMMI
Sbjct: 243  ERDQCYSPILDELNMLFPNRSKDSPPACLVPGAGLGRLALEISCLGFISQGNEFSYYMMI 302

Query: 872  CSSFILNHAQMVGEWTIYPWIHSNCNSLSDKDQLRPVSIPDIHPASAGITEGFSMCGGDF 693
            CSSFILNH+Q  GEWTIYPWIHSNCNSLSD DQLRPVSIPD+HPASAGITEGFSMCGGDF
Sbjct: 303  CSSFILNHSQTAGEWTIYPWIHSNCNSLSDSDQLRPVSIPDMHPASAGITEGFSMCGGDF 362

Query: 692  VEVYRDPSQKGAWDAVVTCFFLDTAHNIVEYIEIISNILRDGGVWINLGPLLYHFADAYG 513
            VEVY D SQ GAWDAVVTCFF+DTAHNIVEYIEIIS IL++GGVWINLGPLLYHFAD YG
Sbjct: 363  VEVYSDSSQVGAWDAVVTCFFIDTAHNIVEYIEIISKILKEGGVWINLGPLLYHFADMYG 422

Query: 512  TEDEMSIELSLEDVKRVAFHYGFHLEREKTIETTYTTNPRSMMQNRYYAAFWTMTKK 342
             +DEMSIELSLEDVKRVA HYGF LE+E+TIETTYT N RSMMQNRY++AFWTM KK
Sbjct: 423  QDDEMSIELSLEDVKRVALHYGFELEKERTIETTYTANSRSMMQNRYFSAFWTMRKK 479


>ref|XP_007215348.1| hypothetical protein PRUPE_ppa005412mg [Prunus persica]
            gi|462411498|gb|EMJ16547.1| hypothetical protein
            PRUPE_ppa005412mg [Prunus persica]
          Length = 462

 Score =  640 bits (1650), Expect = 0.0
 Identities = 327/493 (66%), Positives = 367/493 (74%), Gaps = 6/493 (1%)
 Frame = -3

Query: 1802 EREMSSGEHHEPS-----KLEVALEVKSLRRIISAYLNYSDAAEEDVKRYEGSFSKLPPA 1638
            EREM  GE  +       KLE ALEVKSLRRIISAYLNY +AAEEDV+RYE SF  LPP+
Sbjct: 8    EREMHRGEDEDDERRRQRKLEEALEVKSLRRIISAYLNYPEAAEEDVRRYERSFKILPPS 67

Query: 1637 HKVLLSHLPSKFQRLRRCITVNSFFIFSMLQAFEPPLDMSQDTDICXXXXXXXXXXXH-I 1461
            HK LLSH P KFQRLRRCI+VNS+FIFSMLQAFEPPLD+SQD D+            H +
Sbjct: 68   HKALLSHYPLKFQRLRRCISVNSYFIFSMLQAFEPPLDLSQDMDVRDGPHLERVSYNHDV 127

Query: 1460 SGERNASSAQSASTSGRACCSETDHERYGAHDNLNYISPEERFANQXXXXXXXDCVTGST 1281
            SG ++ SS+QS STS R   S +D                              C  GS+
Sbjct: 128  SGVKSVSSSQSNSTSERMHISNSDQAC---------------------------CGEGSS 160

Query: 1280 SISLDCAKQSESPNISDVQVVENHASTISDSNGNVNLSQHDWLDPSLQLNVPLVDVDKVR 1101
            ++                      ++ I  +   V+     WLDPSLQL+VPLVDVDKVR
Sbjct: 161  AVC---------------------STPIGVTTKKVSSPTRTWLDPSLQLHVPLVDVDKVR 199

Query: 1100 CIIRNIVRDWAQEGQREREQCYKPILEELDRIFPSRSTDRPPCCLVPGAGLGRLALEISC 921
            CI+RNIVRDWA EGQ+ER+QCYKPILEELD +F  RS + PP CLVPGAGLGRLALEISC
Sbjct: 200  CIVRNIVRDWAAEGQKERDQCYKPILEELDSLFADRSKESPPACLVPGAGLGRLALEISC 259

Query: 920  LGFVSQGNEFSYYMMICSSFILNHAQMVGEWTIYPWIHSNCNSLSDKDQLRPVSIPDIHP 741
            LGF+SQGNEFSYYMMICSSFILNH++  GEWTIYPWIHSNCNSLSD DQLRPVS+PDIHP
Sbjct: 260  LGFISQGNEFSYYMMICSSFILNHSRTAGEWTIYPWIHSNCNSLSDSDQLRPVSVPDIHP 319

Query: 740  ASAGITEGFSMCGGDFVEVYRDPSQKGAWDAVVTCFFLDTAHNIVEYIEIISNILRDGGV 561
            ASAGITEGFSMCGGDFVEVY DP+Q G WDAVVTCFF+DTAHNIVEYIEIIS IL+DGGV
Sbjct: 320  ASAGITEGFSMCGGDFVEVYNDPNQVGVWDAVVTCFFIDTAHNIVEYIEIISRILKDGGV 379

Query: 560  WINLGPLLYHFADAYGTEDEMSIELSLEDVKRVAFHYGFHLEREKTIETTYTTNPRSMMQ 381
            WIN+GPLLYHFA+ YG +DEMSIELSLEDVKRVA HYGFH E+E+TIETTYTTNP+SMMQ
Sbjct: 380  WINMGPLLYHFAEMYGQDDEMSIELSLEDVKRVALHYGFHFEKERTIETTYTTNPKSMMQ 439

Query: 380  NRYYAAFWTMTKK 342
            NRY AAFWTM K+
Sbjct: 440  NRYNAAFWTMRKR 452


>ref|XP_006605685.1| PREDICTED: UPF0586 protein C9orf41 homolog isoform X1 [Glycine max]
          Length = 499

 Score =  635 bits (1639), Expect = e-179
 Identities = 329/488 (67%), Positives = 367/488 (75%), Gaps = 14/488 (2%)
 Frame = -3

Query: 1763 KLEVALEVKSLRRIISAYLNYSDAAEEDVKRYEGSFSKLPPAHKVLLSHLPSKFQRLRRC 1584
            KLE ALE++SLRRIISAYLNY DAAEEDV+RYE S+ KLPP+HK LLSH   KFQRLR C
Sbjct: 14   KLEEALEIQSLRRIISAYLNYPDAAEEDVRRYERSYRKLPPSHKALLSHYSRKFQRLRWC 73

Query: 1583 ITVNSFFIFSMLQAFEPPLDMSQDTDICXXXXXXXXXXXHISGERNASSAQSASTSGRAC 1404
            I++N+ FIF MLQAFEPPLDMSQD D             H+  E   S+    S   R  
Sbjct: 74   ISMNTHFIFGMLQAFEPPLDMSQDVDFSEDPHPESTQKDHLVSE-GISACSCESVPVRIT 132

Query: 1403 CSETDHERYGAHDNLNYISPEERFANQXXXXXXXDCVTGSTSISLDCAKQSESPN-ISDV 1227
            CS +D  R     N   IS  +  +N+        C   +T         S SP+ I   
Sbjct: 133  CSVSDQHRCVEGGNHTCISQAQMHSNEEVDIE--SCHQSNTG--------SHSPSMIHPK 182

Query: 1226 QVVENHASTISDSNGNVNL--SQHDWLDPSLQLNVPLVDVDKVRCIIRNIVRDWAQEGQR 1053
            +  E   S I+DSNGNV +  SQ  WLDPSL+LNVPLVDVDKVRCIIRNIVRDWA EG+ 
Sbjct: 183  ETSEYCGSPIADSNGNVPVTSSQQQWLDPSLKLNVPLVDVDKVRCIIRNIVRDWAAEGKN 242

Query: 1052 EREQCYKPILEELDRIFPSRSTDRPPCCLVPGAGLGRLALEISCLGFVSQGNEFSYYMMI 873
            ER+QCY PIL+EL+ +FP+RS D PP CLVPGAGLGRLALEISCLGF+SQGNEFSYYMMI
Sbjct: 243  ERDQCYSPILDELNMLFPNRSKDSPPACLVPGAGLGRLALEISCLGFISQGNEFSYYMMI 302

Query: 872  CSSFILNHAQMVG-----------EWTIYPWIHSNCNSLSDKDQLRPVSIPDIHPASAGI 726
            CSSFILNH+Q +G           EWTIYPWIHSNCNSLSD DQLRPVSIPD+HPASAGI
Sbjct: 303  CSSFILNHSQSIGLMEHLSSQTAGEWTIYPWIHSNCNSLSDSDQLRPVSIPDMHPASAGI 362

Query: 725  TEGFSMCGGDFVEVYRDPSQKGAWDAVVTCFFLDTAHNIVEYIEIISNILRDGGVWINLG 546
            TEGFSMCGGDFVEVY D SQ GAWDAVVTCFF+DTAHNIVEYIEIIS IL++GGVWINLG
Sbjct: 363  TEGFSMCGGDFVEVYSDSSQVGAWDAVVTCFFIDTAHNIVEYIEIISKILKEGGVWINLG 422

Query: 545  PLLYHFADAYGTEDEMSIELSLEDVKRVAFHYGFHLEREKTIETTYTTNPRSMMQNRYYA 366
            PLLYHFAD YG +DEMSIELSLEDVKRVA HYGF LE+E+TIETTYT N RSMMQNRY++
Sbjct: 423  PLLYHFADMYGQDDEMSIELSLEDVKRVALHYGFELEKERTIETTYTANSRSMMQNRYFS 482

Query: 365  AFWTMTKK 342
            AFWTM KK
Sbjct: 483  AFWTMRKK 490


>ref|XP_006345754.1| PREDICTED: UPF0586 protein C9orf41-like [Solanum tuberosum]
          Length = 468

 Score =  635 bits (1637), Expect = e-179
 Identities = 324/488 (66%), Positives = 361/488 (73%), Gaps = 3/488 (0%)
 Frame = -3

Query: 1796 EMSSGEHHEPSKLEVALEVKSLRRIISAYLNYSDAAEEDVKRYEGSFSKLPPAHKVLLSH 1617
            E+ + +  +  + E ALEVKSLRRIISAYLNY +AAEEDVKR+E S +KLPP HK LLSH
Sbjct: 3    EIETADELQRREFEEALEVKSLRRIISAYLNYPEAAEEDVKRWERSLTKLPPHHKALLSH 62

Query: 1616 LPSKFQRLRRCITVNSFFIFSMLQAFEPPLDMSQDTDICXXXXXXXXXXXH-ISGERNAS 1440
            LPSKFQ+LR CIT NS+FIF ML+ FEPPLDMSQD DI            H  S  RN S
Sbjct: 63   LPSKFQKLRWCITENSYFIFEMLKMFEPPLDMSQDVDIRENQHLDDVSGSHHFSRSRNLS 122

Query: 1439 SAQSASTSGRACCSETDHERYGAHDNLNYISPEERFANQXXXXXXXDCVTGSTSISLDCA 1260
              +S STSG   C            N  Y +P  +                         
Sbjct: 123  LCESTSTSGGVDCHCLAEPSSKETCNGKYPAPFNK------------------------- 157

Query: 1259 KQSESPNISDVQVVENHASTISDS--NGNVNLSQHDWLDPSLQLNVPLVDVDKVRCIIRN 1086
                   + D + + N  +  + +  NG V+ S  +WLDPSLQL+VPLVDVDKVRCIIRN
Sbjct: 158  ----EQEVDDCKSLPNQDTLYASACCNGKVSSSPPEWLDPSLQLHVPLVDVDKVRCIIRN 213

Query: 1085 IVRDWAQEGQREREQCYKPILEELDRIFPSRSTDRPPCCLVPGAGLGRLALEISCLGFVS 906
            IVRDWA EGQ+ER+QCY+PILEEL+R+FP+RS + PP CLVPGAGLGRLALEISCLGF S
Sbjct: 214  IVRDWANEGQKERDQCYRPILEELERLFPNRSNENPPACLVPGAGLGRLALEISCLGFAS 273

Query: 905  QGNEFSYYMMICSSFILNHAQMVGEWTIYPWIHSNCNSLSDKDQLRPVSIPDIHPASAGI 726
            QGNEFSYYMMICSSFILNH Q  GEWTI+PWIHSNCNS+SD DQLRPVS+PDIHPASAGI
Sbjct: 274  QGNEFSYYMMICSSFILNHTQAAGEWTIFPWIHSNCNSVSDNDQLRPVSVPDIHPASAGI 333

Query: 725  TEGFSMCGGDFVEVYRDPSQKGAWDAVVTCFFLDTAHNIVEYIEIISNILRDGGVWINLG 546
            TEGFSMCGGDFVEVY DPSQ G WDAVVTCFFLDTAHNIVEYIEIIS +L+DGGVWINLG
Sbjct: 334  TEGFSMCGGDFVEVYSDPSQAGVWDAVVTCFFLDTAHNIVEYIEIISKVLKDGGVWINLG 393

Query: 545  PLLYHFADAYGTEDEMSIELSLEDVKRVAFHYGFHLEREKTIETTYTTNPRSMMQNRYYA 366
            PLLYHFAD Y  EDEMSI+LSLEDVKRVA HYGF  E+E TIETTYTTN RSMMQNRYYA
Sbjct: 394  PLLYHFADMYSPEDEMSIDLSLEDVKRVALHYGFIFEKESTIETTYTTNLRSMMQNRYYA 453

Query: 365  AFWTMTKK 342
            AFWTM KK
Sbjct: 454  AFWTMRKK 461


>ref|XP_006593845.1| PREDICTED: UPF0586 protein C9orf41 homolog isoform X1 [Glycine max]
          Length = 487

 Score =  632 bits (1629), Expect = e-178
 Identities = 327/482 (67%), Positives = 363/482 (75%), Gaps = 8/482 (1%)
 Frame = -3

Query: 1763 KLEVALEVKSLRRIISAYLNYSDAAEEDVKRYEGSFSKLPPAHKVLLSHLPSKFQRLRRC 1584
            KLE ALE++SLRRIISAYLNY DAAEEDV+R E S+ KLPP+HK LLS  P KFQRLR C
Sbjct: 13   KLEEALEIQSLRRIISAYLNYPDAAEEDVRRNERSYRKLPPSHKALLSQYPQKFQRLRWC 72

Query: 1583 ITVNSFFIFSMLQAFEPPLDMSQDTDICXXXXXXXXXXXHISGER-NASSAQSASTSGRA 1407
            I++N+ FIFSMLQAFEPPLDMSQD D             H+  E  +A S +SA    R 
Sbjct: 73   ISMNTHFIFSMLQAFEPPLDMSQDADFSEDPHPESAQKDHLVSEGISACSCESAPV--RI 130

Query: 1406 CCSETDHERYGAHDNLNYISPEERFANQXXXXXXXDCVTGSTSISLDCAKQSESPN---- 1239
             CS +D        N    S  +  +N+               + ++   QS + N    
Sbjct: 131  TCSVSDQHCCVEGSNHTCRSQAQMHSNE--------------EVGIESRHQSNTGNHSPR 176

Query: 1238 -ISDVQVVENHASTISDSNGNV--NLSQHDWLDPSLQLNVPLVDVDKVRCIIRNIVRDWA 1068
             I   +  E   S I+DS GNV    SQ  WL PSL+LNVPLVD DKVRCIIRNIVRDWA
Sbjct: 177  LIHTKETREYCGSPIADSKGNVPDTSSQQQWLAPSLKLNVPLVDADKVRCIIRNIVRDWA 236

Query: 1067 QEGQREREQCYKPILEELDRIFPSRSTDRPPCCLVPGAGLGRLALEISCLGFVSQGNEFS 888
             EG++ER+QCY PILEEL+ +FP+RS + PP CLVPGAGLGRLALEISCLGF+SQGNEFS
Sbjct: 237  AEGKKERDQCYNPILEELNMLFPNRSKESPPACLVPGAGLGRLALEISCLGFISQGNEFS 296

Query: 887  YYMMICSSFILNHAQMVGEWTIYPWIHSNCNSLSDKDQLRPVSIPDIHPASAGITEGFSM 708
            YYMMICSSFILNH+Q  GEWTIYPWIHSNCNSLSD DQLRPVSIPDIHPASAGITEGFSM
Sbjct: 297  YYMMICSSFILNHSQTAGEWTIYPWIHSNCNSLSDSDQLRPVSIPDIHPASAGITEGFSM 356

Query: 707  CGGDFVEVYRDPSQKGAWDAVVTCFFLDTAHNIVEYIEIISNILRDGGVWINLGPLLYHF 528
            CGGDFVEVY D SQ GAWDAVVTCFF+DTAHNIVEYIEIIS IL+DGGVWINLGPLLYHF
Sbjct: 357  CGGDFVEVYSDSSQIGAWDAVVTCFFIDTAHNIVEYIEIISKILKDGGVWINLGPLLYHF 416

Query: 527  ADAYGTEDEMSIELSLEDVKRVAFHYGFHLEREKTIETTYTTNPRSMMQNRYYAAFWTMT 348
            AD YG +DEMSIELSLEDVKRVAFHYGF  E E+TIETTYT N RSMMQNRY+AAFWTM 
Sbjct: 417  ADMYGQDDEMSIELSLEDVKRVAFHYGFEFENERTIETTYTANSRSMMQNRYFAAFWTMR 476

Query: 347  KK 342
            KK
Sbjct: 477  KK 478


>ref|XP_004305888.1| PREDICTED: UPF0586 protein C9orf41 homolog [Fragaria vesca subsp.
            vesca]
          Length = 493

 Score =  630 bits (1625), Expect = e-178
 Identities = 320/483 (66%), Positives = 360/483 (74%)
 Frame = -3

Query: 1781 EHHEPSKLEVALEVKSLRRIISAYLNYSDAAEEDVKRYEGSFSKLPPAHKVLLSHLPSKF 1602
            E     + E ALE+KSLRRIISAYLNY +AAEEDV+RYE SF  LPP HK LLSH  SKF
Sbjct: 18   EERRRPRHEEALEIKSLRRIISAYLNYPEAAEEDVRRYERSFKMLPPPHKALLSHYHSKF 77

Query: 1601 QRLRRCITVNSFFIFSMLQAFEPPLDMSQDTDICXXXXXXXXXXXHISGERNASSAQSAS 1422
            ++LRRCI+ NS+FIF MLQAF+PPLD+SQD D              +S  R + S+Q  S
Sbjct: 78   EKLRRCISANSYFIFDMLQAFQPPLDLSQDVDDYDGLPENISTNHDVS--RVSKSSQLTS 135

Query: 1421 TSGRACCSETDHERYGAHDNLNYISPEERFANQXXXXXXXDCVTGSTSISLDCAKQSESP 1242
            T+     S    E Y   +  N +        +         + GS + SL+  K     
Sbjct: 136  TNTHVSKSVQAAEAYVV-ERSNTVCNCNLPIGEDKHEGHGGSINGSHTSSLEYTK----- 189

Query: 1241 NISDVQVVENHASTISDSNGNVNLSQHDWLDPSLQLNVPLVDVDKVRCIIRNIVRDWAQE 1062
               D+ +   H +   DSNGNV+ S   WLDPS+QL+VPLVDVDKVRCIIRNIVRDWA E
Sbjct: 190  ---DIHIC--HGNNAIDSNGNVSSSTRTWLDPSIQLHVPLVDVDKVRCIIRNIVRDWAAE 244

Query: 1061 GQREREQCYKPILEELDRIFPSRSTDRPPCCLVPGAGLGRLALEISCLGFVSQGNEFSYY 882
            GQ+ER+QCY PILEELD +F +RS + PP CLVPGAGLGRLALEIS  GF+ QGNEFSYY
Sbjct: 245  GQKERDQCYTPILEELDSLFVNRSKESPPACLVPGAGLGRLALEISSRGFICQGNEFSYY 304

Query: 881  MMICSSFILNHAQMVGEWTIYPWIHSNCNSLSDKDQLRPVSIPDIHPASAGITEGFSMCG 702
            MMICSSFILN  Q  GEWTIYPWIHSNCNSLSD DQLRP+ IPDIHPASAGITEGFSMCG
Sbjct: 305  MMICSSFILNDCQTAGEWTIYPWIHSNCNSLSDDDQLRPIPIPDIHPASAGITEGFSMCG 364

Query: 701  GDFVEVYRDPSQKGAWDAVVTCFFLDTAHNIVEYIEIISNILRDGGVWINLGPLLYHFAD 522
            GDFVEVY DPSQ GAWDAVVTCFF+DTAHNIVEYIEIIS IL++GGVWINLGPLLYHFAD
Sbjct: 365  GDFVEVYNDPSQVGAWDAVVTCFFIDTAHNIVEYIEIISRILKEGGVWINLGPLLYHFAD 424

Query: 521  AYGTEDEMSIELSLEDVKRVAFHYGFHLEREKTIETTYTTNPRSMMQNRYYAAFWTMTKK 342
             YG  D+MSIELSLEDVKRVA HYGFH+E+E TIETTYTTNP+SMMQNRYYAAFWTM KK
Sbjct: 425  VYGQGDDMSIELSLEDVKRVALHYGFHIEKETTIETTYTTNPKSMMQNRYYAAFWTMRKK 484

Query: 341  GRT 333
              T
Sbjct: 485  STT 487


>ref|XP_007145966.1| hypothetical protein PHAVU_006G001800g [Phaseolus vulgaris]
            gi|561019189|gb|ESW17960.1| hypothetical protein
            PHAVU_006G001800g [Phaseolus vulgaris]
          Length = 442

 Score =  626 bits (1614), Expect = e-176
 Identities = 322/480 (67%), Positives = 353/480 (73%)
 Frame = -3

Query: 1781 EHHEPSKLEVALEVKSLRRIISAYLNYSDAAEEDVKRYEGSFSKLPPAHKVLLSHLPSKF 1602
            E     KLE ALE++SLRRIISAYLNY DAAEEDV+RYE S+ KLPPAHK LL H P KF
Sbjct: 6    EQRRRLKLEEALEIQSLRRIISAYLNYPDAAEEDVRRYERSYRKLPPAHKALLPHYPRKF 65

Query: 1601 QRLRRCITVNSFFIFSMLQAFEPPLDMSQDTDICXXXXXXXXXXXHISGERNASSAQSAS 1422
            QRLRRCI++NS FIF MLQAFEPPLDMSQD +                   +A     AS
Sbjct: 66   QRLRRCISMNSHFIFGMLQAFEPPLDMSQDLEFSEDPHP-----------ESAEKDHLAS 114

Query: 1421 TSGRACCSETDHERYGAHDNLNYISPEERFANQXXXXXXXDCVTGSTSISLDCAKQSESP 1242
                AC  E+D  R                            +T S S   DC  +  + 
Sbjct: 115  EGINACSCESDPSR----------------------------ITCSVS-DQDCCVEDNNH 145

Query: 1241 NISDVQVVENHASTISDSNGNVNLSQHDWLDPSLQLNVPLVDVDKVRCIIRNIVRDWAQE 1062
                  +   H++ +S     V  SQ  WL+PSL+LNVPLVDVDKVRCIIRNIVRDWA E
Sbjct: 146  TCRSQGLT--HSNEVS-----VTSSQQQWLEPSLRLNVPLVDVDKVRCIIRNIVRDWAAE 198

Query: 1061 GQREREQCYKPILEELDRIFPSRSTDRPPCCLVPGAGLGRLALEISCLGFVSQGNEFSYY 882
            G++ER+QCY PILEEL+ +FP+RS   PP CLVPGAGLGRLALEISCLGF+SQGNEFSYY
Sbjct: 199  GKKERDQCYNPILEELNMLFPNRSKKSPPACLVPGAGLGRLALEISCLGFISQGNEFSYY 258

Query: 881  MMICSSFILNHAQMVGEWTIYPWIHSNCNSLSDKDQLRPVSIPDIHPASAGITEGFSMCG 702
            MMICSSFILNH+Q  GEWTIYPWIHSNCNSLSD DQLRPVSIPDIHPASAGITEGFSMCG
Sbjct: 259  MMICSSFILNHSQTAGEWTIYPWIHSNCNSLSDSDQLRPVSIPDIHPASAGITEGFSMCG 318

Query: 701  GDFVEVYRDPSQKGAWDAVVTCFFLDTAHNIVEYIEIISNILRDGGVWINLGPLLYHFAD 522
            GDFVEVY D SQ GAWDAVVTCFF+DTAHNIVEYIEIIS IL+DGGVWINLGPLLYHFAD
Sbjct: 319  GDFVEVYSDSSQVGAWDAVVTCFFIDTAHNIVEYIEIISKILKDGGVWINLGPLLYHFAD 378

Query: 521  AYGTEDEMSIELSLEDVKRVAFHYGFHLEREKTIETTYTTNPRSMMQNRYYAAFWTMTKK 342
             YG EDEMSIELSLEDVK VA +YGF  E+E TIETTYTTNPRSMMQNRY+AAFWTM KK
Sbjct: 379  VYGQEDEMSIELSLEDVKSVALNYGFEFEKESTIETTYTTNPRSMMQNRYFAAFWTMRKK 438


>ref|XP_004502075.1| PREDICTED: LOW QUALITY PROTEIN: UPF0586 protein C9orf41 homolog
            [Cicer arietinum]
          Length = 492

 Score =  625 bits (1611), Expect = e-176
 Identities = 326/495 (65%), Positives = 366/495 (73%), Gaps = 10/495 (2%)
 Frame = -3

Query: 1796 EMSSGEHHEPSKLEVALEVKSLRRIISAYLNYSDAAEEDVKRYEGSFSKLPPAHKVLLSH 1617
            E +  E     KLE ALE++SLRRI+SAYLNY DAA+EDV+RYE SF KLPPAHK LLSH
Sbjct: 2    EDAEEEKRRRLKLEEALEIQSLRRIVSAYLNYPDAADEDVRRYERSFKKLPPAHKDLLSH 61

Query: 1616 LPSKFQRLRRCITVNSFFIFSMLQAFEPPLDMSQDTDICXXXXXXXXXXXHISGER-NAS 1440
             P KF+RLR CI++NS FIF+MLQAFEPPLDMSQD D+            ++  E  N  
Sbjct: 62   YPLKFKRLRWCISMNSHFIFNMLQAFEPPLDMSQDIDLSEDAHPEYCQKDNLVCEGINCC 121

Query: 1439 SAQSASTSGRACCSETDHERY--GAHDNLNYISPEERFANQXXXXXXXDCVTGSTSISLD 1266
            S +SA    R  CS ++      G +D+     PE                TGS   ++ 
Sbjct: 122  SCESAPL--RITCSVSNQHGCVEGNNDSCRSSVPEHP-NEDVNIESHHQSNTGSHPSNMI 178

Query: 1265 CAKQSESPNISDVQVVENHASTISDSNGNV--NLSQHDWLDPSLQLNVPLVDVDKVRCII 1092
              K +           E   S I+DSNGNV     Q  WLDPS Q NVPLVDVDKVRCII
Sbjct: 179  HTKDNS----------EYGGSAIADSNGNVMDTSPQQQWLDPSFQFNVPLVDVDKVRCII 228

Query: 1091 RNIVRDWAQEGQREREQCYKPILEELDRIFPSRSTDRPPCCLVPGAGLGRLALEISCLGF 912
            RN+VRDWA EGQ+ER+QCYKPILEEL+ +FP RS + PP CLVPGAGLGRLAL+IS LGF
Sbjct: 229  RNVVRDWAVEGQKERDQCYKPILEELNILFPDRSKESPPACLVPGAGLGRLALDISSLGF 288

Query: 911  VSQGNEFSYYMMICSSFILNHAQMVGEWTIYPWIHSNCNSLSDKDQLRPVSIPDIHPASA 732
            + QGNEFSYYMMICSSFILN+ Q  GEWTIYPWIHSNCNSLSD DQLRPVSIPDIHPASA
Sbjct: 289  ICQGNEFSYYMMICSSFILNNCQTAGEWTIYPWIHSNCNSLSDSDQLRPVSIPDIHPASA 348

Query: 731  GITEGFSMCGGDFVEVYRDPSQKGAWDAVVTCFFLDTAHNIVEYIEIISNILRDGGVWIN 552
            GITEGFSMCGGDFVEVY DPSQ GAWDAVVTCFF+DTAHNIVEYIEIIS IL+DGGVWIN
Sbjct: 349  GITEGFSMCGGDFVEVYSDPSQIGAWDAVVTCFFIDTAHNIVEYIEIISQILKDGGVWIN 408

Query: 551  LGPLLYHFADAYGTEDEMSIELSLEDVKRVAFHYGFHLEREKTIETT-----YTTNPRSM 387
            LGPLLYHFAD YG +DEMS+ELSLEDVKR+A HYGF  E+E+TIETT     YTTNP+SM
Sbjct: 409  LGPLLYHFADTYGQDDEMSVELSLEDVKRIALHYGFEFEKERTIETTYTXXXYTTNPKSM 468

Query: 386  MQNRYYAAFWTMTKK 342
            MQNRY+AAFWTM KK
Sbjct: 469  MQNRYFAAFWTMRKK 483


>ref|NP_850185.1| S-adenosyl-L-methionine-dependent methyltransferases superfamily
            protein [Arabidopsis thaliana] gi|20259498|gb|AAM13869.1|
            unknown protein [Arabidopsis thaliana]
            gi|22136766|gb|AAM91702.1| unknown protein [Arabidopsis
            thaliana] gi|330253550|gb|AEC08644.1|
            S-adenosyl-L-methionine-dependent methyltransferases
            superfamily protein [Arabidopsis thaliana]
          Length = 504

 Score =  624 bits (1609), Expect = e-176
 Identities = 317/492 (64%), Positives = 366/492 (74%), Gaps = 6/492 (1%)
 Frame = -3

Query: 1799 REMSSGEHHEP----SKLEVALEVKSLRRIISAYLNYSDAAEEDVKRYEGSFSKLPPAHK 1632
            RE+   E  E      KLE ALE KSLRRIISAYLNY +A+EED+KR+E S+ KL PAHK
Sbjct: 26   RELVDNEEEEKIRRQKKLEEALEAKSLRRIISAYLNYPEASEEDLKRWERSYRKLSPAHK 85

Query: 1631 VLLSHLPSKFQRLRRCITVNSFFIFSMLQAFEPPLDMSQDTDICXXXXXXXXXXXHIS-G 1455
             L+ H P KFQRLRRCI+ NS+FIF+MLQAFEPP+D+SQ+ D C             +  
Sbjct: 86   ALVPHYPMKFQRLRRCISANSYFIFNMLQAFEPPIDLSQELDGCEDSNLDCAPHERYTLD 145

Query: 1454 ERNASSAQSASTSGRACCSETDHERYGAHDNLNYISPEERFANQXXXXXXXDCVTGSTSI 1275
            ER+ SS Q A T+      E+ H R    D +  +S EE    +       D        
Sbjct: 146  ERHDSSCQPALTNSCTYKEESKHIR----DPITGVSIEELQRKEAHDHSPKD-------- 193

Query: 1274 SLDCAKQSESPNISDVQVVENHASTISDSNGNVNLSQHDWLDPSLQLNVPLVDVDKVRCI 1095
                   S    I+D +  + H   ++  +G+V+ S HDWLD SLQ +VPLVDVDKVRCI
Sbjct: 194  ------DSADTRIND-KTCDCHEGQLNHDHGSVSFSSHDWLDSSLQTHVPLVDVDKVRCI 246

Query: 1094 IRNIVRDWAQEGQREREQCYKPILEELDRIFPSRSTDR-PPCCLVPGAGLGRLALEISCL 918
            IRNIVRDWA EGQRER+QCYKPILEELD +FP R  +  PP CLVPGAGLGRLALEISCL
Sbjct: 247  IRNIVRDWAAEGQRERDQCYKPILEELDSLFPDRLKESTPPACLVPGAGLGRLALEISCL 306

Query: 917  GFVSQGNEFSYYMMICSSFILNHAQMVGEWTIYPWIHSNCNSLSDKDQLRPVSIPDIHPA 738
            GF+SQGNEFSYYMMICSSFILN+ Q+ GEWTIYPWIHSNCNSLSD DQLRP++IPDIHPA
Sbjct: 307  GFISQGNEFSYYMMICSSFILNYTQVPGEWTIYPWIHSNCNSLSDNDQLRPIAIPDIHPA 366

Query: 737  SAGITEGFSMCGGDFVEVYRDPSQKGAWDAVVTCFFLDTAHNIVEYIEIISNILRDGGVW 558
            SAGITEGFSMCGGDFVEVY + S  G WDAVVTCFF+DTAHN++EYI+ IS IL+DGGVW
Sbjct: 367  SAGITEGFSMCGGDFVEVYNESSHAGMWDAVVTCFFIDTAHNVIEYIQTISKILKDGGVW 426

Query: 557  INLGPLLYHFADAYGTEDEMSIELSLEDVKRVAFHYGFHLEREKTIETTYTTNPRSMMQN 378
            INLGPLLYHFAD YG E+EMSIELSLEDVKRVA H+GF +E+E+TIETTYTTNPR+MMQN
Sbjct: 427  INLGPLLYHFADTYGHENEMSIELSLEDVKRVASHFGFVIEKERTIETTYTTNPRAMMQN 486

Query: 377  RYYAAFWTMTKK 342
            RYY AFWTM KK
Sbjct: 487  RYYTAFWTMRKK 498


>ref|XP_002879368.1| hypothetical protein ARALYDRAFT_902264 [Arabidopsis lyrata subsp.
            lyrata] gi|297325207|gb|EFH55627.1| hypothetical protein
            ARALYDRAFT_902264 [Arabidopsis lyrata subsp. lyrata]
          Length = 508

 Score =  622 bits (1605), Expect = e-175
 Identities = 316/492 (64%), Positives = 366/492 (74%), Gaps = 7/492 (1%)
 Frame = -3

Query: 1799 REMSSGEHHEPS-----KLEVALEVKSLRRIISAYLNYSDAAEEDVKRYEGSFSKLPPAH 1635
            RE+ + E  E       KLE ALE KSLRRIISAYLNY +A+EED+KR+E S+ KL P+H
Sbjct: 30   REVDNKEEEEEKIRRQKKLEEALEAKSLRRIISAYLNYPEASEEDLKRWERSYRKLSPSH 89

Query: 1634 KVLLSHLPSKFQRLRRCITVNSFFIFSMLQAFEPPLDMSQDTDICXXXXXXXXXXXHIS- 1458
            K L+SH P KFQRLRRCI+ NS+FIF+MLQAFEPP+D+SQ+ D C             + 
Sbjct: 90   KALVSHYPIKFQRLRRCISANSYFIFNMLQAFEPPIDLSQELDGCEDSNLECAPHERYTL 149

Query: 1457 GERNASSAQSASTSGRACCSETDHERYGAHDNLNYISPEERFANQXXXXXXXDCVTGSTS 1278
             ER+ SS Q A T+      E+ H R    + +  +S EE    +       D    +  
Sbjct: 150  DERHDSSCQPALTNSCTYKEESKHIR----EPITGVSIEELQRKEAHDHSSKDDSADARI 205

Query: 1277 ISLDCAKQSESPNISDVQVVENHASTISDSNGNVNLSQHDWLDPSLQLNVPLVDVDKVRC 1098
             +  C               E     ++  +G+V+ S HDWLD SLQ +VPLVDVDKVRC
Sbjct: 206  TNKTC---------------ECDGGQLNHDHGSVSFSSHDWLDSSLQTHVPLVDVDKVRC 250

Query: 1097 IIRNIVRDWAQEGQREREQCYKPILEELDRIFPSRSTDR-PPCCLVPGAGLGRLALEISC 921
            IIRNIVRDWA EGQRER+QCYKPILEELD +FP RS +  PP CLVPGAGLGRLALEISC
Sbjct: 251  IIRNIVRDWAAEGQRERDQCYKPILEELDSLFPDRSKESTPPACLVPGAGLGRLALEISC 310

Query: 920  LGFVSQGNEFSYYMMICSSFILNHAQMVGEWTIYPWIHSNCNSLSDKDQLRPVSIPDIHP 741
            LGF+SQGNEFSYYMMICSSFILN++Q+ GEWTIYPWIHSNCNSLSD DQLRP++IPDIHP
Sbjct: 311  LGFISQGNEFSYYMMICSSFILNYSQVPGEWTIYPWIHSNCNSLSDNDQLRPIAIPDIHP 370

Query: 740  ASAGITEGFSMCGGDFVEVYRDPSQKGAWDAVVTCFFLDTAHNIVEYIEIISNILRDGGV 561
            ASAGITEGFSMCGGDFVEVY + S  G WDAVVTCFF+DTAHN++EYIE IS IL+DGGV
Sbjct: 371  ASAGITEGFSMCGGDFVEVYNESSHAGMWDAVVTCFFIDTAHNVIEYIETISKILKDGGV 430

Query: 560  WINLGPLLYHFADAYGTEDEMSIELSLEDVKRVAFHYGFHLEREKTIETTYTTNPRSMMQ 381
            WINLGPLLYHFAD YG E+EMSIELSLEDVKRVA HYGF +E+E+TIETTYTTNPR+MMQ
Sbjct: 431  WINLGPLLYHFADTYGHENEMSIELSLEDVKRVASHYGFVIEKERTIETTYTTNPRAMMQ 490

Query: 380  NRYYAAFWTMTK 345
            NRYY AFWTM K
Sbjct: 491  NRYYTAFWTMRK 502


>ref|XP_004239631.1| PREDICTED: UPF0586 protein C9orf41-like [Solanum lycopersicum]
          Length = 461

 Score =  617 bits (1591), Expect = e-174
 Identities = 325/486 (66%), Positives = 358/486 (73%), Gaps = 1/486 (0%)
 Frame = -3

Query: 1796 EMSSGEHHEPSKLEVALEVKSLRRIISAYLNYSDAAEEDVKRYEGSFSKLPPAHKVLLSH 1617
            E+ + +  +  + E ALEVKSLRRIISAYLNY +AAEEDVKR+E S +KLPP HK LLSH
Sbjct: 3    EIETADELQRREFEEALEVKSLRRIISAYLNYPEAAEEDVKRWERSLTKLPPHHKDLLSH 62

Query: 1616 LPSKFQRLRRCITVNSFFIFSMLQAFEPPLDMSQDTDICXXXXXXXXXXXH-ISGERNAS 1440
            LP+KFQ+LR CIT NS+FIF ML+ FEPPLDMSQD DI            H  S  RN  
Sbjct: 63   LPAKFQKLRWCITENSYFIFEMLKMFEPPLDMSQDVDIREDQHLDDVSGSHHFSRSRNLC 122

Query: 1439 SAQSASTSGRACCSETDHERYGAHDNLNYISPEERFANQXXXXXXXDCVTGSTSISLDCA 1260
              +S STSG   C            N  Y SP  +           DC            
Sbjct: 123  LCESTSTSGGVDCHCLAEPSSKETCNGKYPSPFNK------EQEVDDC------------ 164

Query: 1259 KQSESPNISDVQVVENHASTISDSNGNVNLSQHDWLDPSLQLNVPLVDVDKVRCIIRNIV 1080
               +SP   D      +AS     NG V+ S  +WLDPSLQL+VPLVDVDKVRCIIRNIV
Sbjct: 165  ---KSPPDQDTL----YASACC--NGKVSSSPPEWLDPSLQLHVPLVDVDKVRCIIRNIV 215

Query: 1079 RDWAQEGQREREQCYKPILEELDRIFPSRSTDRPPCCLVPGAGLGRLALEISCLGFVSQG 900
            RDWA EGQ+ER+QCY+PILEEL+R+FP+RS + PP CLVPGAGLGRLALEISCLGF SQG
Sbjct: 216  RDWANEGQKERDQCYRPILEELERLFPNRSNENPPACLVPGAGLGRLALEISCLGFASQG 275

Query: 899  NEFSYYMMICSSFILNHAQMVGEWTIYPWIHSNCNSLSDKDQLRPVSIPDIHPASAGITE 720
            NEFSYYMMICSSFILNH Q  GEWTI+PWIHSNCNS+SD DQLRPVS+PDIHPASAGITE
Sbjct: 276  NEFSYYMMICSSFILNHTQAAGEWTIFPWIHSNCNSVSDNDQLRPVSVPDIHPASAGITE 335

Query: 719  GFSMCGGDFVEVYRDPSQKGAWDAVVTCFFLDTAHNIVEYIEIISNILRDGGVWINLGPL 540
            GFSMCGGDFVEVY DPSQ     AVVTCFFLDTAHNIVEYIEIIS +L+DGGVWINLGPL
Sbjct: 336  GFSMCGGDFVEVYSDPSQ-----AVVTCFFLDTAHNIVEYIEIISKVLKDGGVWINLGPL 390

Query: 539  LYHFADAYGTEDEMSIELSLEDVKRVAFHYGFHLEREKTIETTYTTNPRSMMQNRYYAAF 360
            LYHFAD Y  EDEMSI+LSLEDVKRVA HYGF  E+E TIETTYTTN RSMMQNRYYAAF
Sbjct: 391  LYHFADMYSPEDEMSIDLSLEDVKRVALHYGFIFEKESTIETTYTTNLRSMMQNRYYAAF 450

Query: 359  WTMTKK 342
            WTM KK
Sbjct: 451  WTMRKK 456


>ref|XP_007049003.1| S-adenosyl-L-methionine-dependent methyltransferases superfamily
            protein isoform 2 [Theobroma cacao]
            gi|508701264|gb|EOX93160.1|
            S-adenosyl-L-methionine-dependent methyltransferases
            superfamily protein isoform 2 [Theobroma cacao]
          Length = 435

 Score =  611 bits (1575), Expect = e-172
 Identities = 303/432 (70%), Positives = 336/432 (77%), Gaps = 1/432 (0%)
 Frame = -3

Query: 1634 KVLLSHLPSKFQRLRRCITVNSFFIFSMLQAFEPPLDMSQDTDICXXXXXXXXXXXHI-S 1458
            K LLSH P KFQRLRRCI+VNS+FIF+MLQ+FEPPLDMSQD DIC           H  S
Sbjct: 9    KALLSHYPLKFQRLRRCISVNSYFIFNMLQSFEPPLDMSQDVDICEDPHLENFQHEHCHS 68

Query: 1457 GERNASSAQSASTSGRACCSETDHERYGAHDNLNYISPEERFANQXXXXXXXDCVTGSTS 1278
             ERNA   QSASTSGR CCS           N+      E    +       + ++GS  
Sbjct: 69   EERNACFCQSASTSGRMCCSNLAQACSQERSNIISNPTAETTHEEVQSGHQHETISGS-- 126

Query: 1277 ISLDCAKQSESPNISDVQVVENHASTISDSNGNVNLSQHDWLDPSLQLNVPLVDVDKVRC 1098
                CA +  +    D ++ E   + ++DSNGNV  S HDWLDPSLQLNVPLVDVDKVRC
Sbjct: 127  ----CAGEVGN----DKEIAECCGNDVTDSNGNVFSSPHDWLDPSLQLNVPLVDVDKVRC 178

Query: 1097 IIRNIVRDWAQEGQREREQCYKPILEELDRIFPSRSTDRPPCCLVPGAGLGRLALEISCL 918
            IIRNIVRDWA EG++ER+QCYKPILEELD +FP+RS + PP CLVPGAGLGRLALEISCL
Sbjct: 179  IIRNIVRDWAAEGEKERDQCYKPILEELDALFPNRSKESPPACLVPGAGLGRLALEISCL 238

Query: 917  GFVSQGNEFSYYMMICSSFILNHAQMVGEWTIYPWIHSNCNSLSDKDQLRPVSIPDIHPA 738
            GF+SQGNEFSYYMM+CSSFILNH Q  GEWTIYPWIHSNCNSLSD DQLRPVSIPDIHPA
Sbjct: 239  GFISQGNEFSYYMMLCSSFILNHTQTTGEWTIYPWIHSNCNSLSDNDQLRPVSIPDIHPA 298

Query: 737  SAGITEGFSMCGGDFVEVYRDPSQKGAWDAVVTCFFLDTAHNIVEYIEIISNILRDGGVW 558
            SAGITEGFSMCGGDFVEVY D SQ G WDAVVTCFF+DTAHNI+EYIEIIS IL++GGVW
Sbjct: 299  SAGITEGFSMCGGDFVEVYNDSSQIGVWDAVVTCFFIDTAHNIIEYIEIISKILKEGGVW 358

Query: 557  INLGPLLYHFADAYGTEDEMSIELSLEDVKRVAFHYGFHLEREKTIETTYTTNPRSMMQN 378
            INLGPLLYHFAD YG EDEMSIELSLEDVK+VA  YGF  E+E+TIETTYTTNPRSMMQN
Sbjct: 359  INLGPLLYHFADVYGQEDEMSIELSLEDVKKVALRYGFQFEKEQTIETTYTTNPRSMMQN 418

Query: 377  RYYAAFWTMTKK 342
             Y+A FWT+ KK
Sbjct: 419  HYFAVFWTLRKK 430


>ref|XP_006853312.1| hypothetical protein AMTR_s00032p00047690 [Amborella trichopoda]
            gi|548856965|gb|ERN14779.1| hypothetical protein
            AMTR_s00032p00047690 [Amborella trichopoda]
          Length = 471

 Score =  610 bits (1573), Expect = e-172
 Identities = 311/481 (64%), Positives = 358/481 (74%), Gaps = 1/481 (0%)
 Frame = -3

Query: 1781 EHHEPSKLEVALEVKSLRRIISAYLNYSDAAEEDVKRYEGSFSKLPPAHKVLLSHLPSKF 1602
            E    S LE ALE++SLRRIISAYLNY+ AAEEDV RYE SF KLPP+HK LL H   K 
Sbjct: 3    ERTNNSGLEEALEIQSLRRIISAYLNYAAAAEEDVHRYERSFHKLPPSHKALLPHYLYKC 62

Query: 1601 QRLRRCITVNSFFIFSMLQAFEPPLDMSQDTDICXXXXXXXXXXXHISGERNASSAQSAS 1422
            +RLR CI+ N + I +MLQAFEPP+DM+++ +               +GE N       +
Sbjct: 63   RRLRWCISENGYVILNMLQAFEPPIDMTRNLE---------------TGE-NEPPVSICT 106

Query: 1421 TSGRACCSETDHERYGAHDNLNYISPEERFANQXXXXXXXDCVTGSTSISLDCAKQSES- 1245
            T     CS TD    G    LN  S ++            D  T   + +  C+    + 
Sbjct: 107  TLNEGSCS-TD----GESTRLNQTSLKKANIEASSMGNFRDSDTRIHTATCSCSSACGAC 161

Query: 1244 PNISDVQVVENHASTISDSNGNVNLSQHDWLDPSLQLNVPLVDVDKVRCIIRNIVRDWAQ 1065
             + + V   + +A ++S++NG  +    DW DPS QLNVPLVDVDKVRCIIRNIVRDWA 
Sbjct: 162  DDKTKVFGCQTYAYSVSEANGKGDCPTFDWWDPSFQLNVPLVDVDKVRCIIRNIVRDWAP 221

Query: 1064 EGQREREQCYKPILEELDRIFPSRSTDRPPCCLVPGAGLGRLALEISCLGFVSQGNEFSY 885
            EGQRER+QCY+PILEELDR+FP+R  DRPP CLVPGAGLGRLALEISCLGF+SQGNEFSY
Sbjct: 222  EGQRERDQCYRPILEELDRLFPTRRKDRPPSCLVPGAGLGRLALEISCLGFISQGNEFSY 281

Query: 884  YMMICSSFILNHAQMVGEWTIYPWIHSNCNSLSDKDQLRPVSIPDIHPASAGITEGFSMC 705
            YMMICSSFILNH Q   EWTIYPWIHSNCNSLSD+DQLRPV  PDIHPASAGIT+GFSMC
Sbjct: 282  YMMICSSFILNHTQRQREWTIYPWIHSNCNSLSDRDQLRPVEFPDIHPASAGITDGFSMC 341

Query: 704  GGDFVEVYRDPSQKGAWDAVVTCFFLDTAHNIVEYIEIISNILRDGGVWINLGPLLYHFA 525
            GGDFVEVY D SQ+G+WD VVTCFF+DTAHN+VEYIEIIS ILRDGGVWINLGPLLYHFA
Sbjct: 342  GGDFVEVYGDQSQEGSWDTVVTCFFIDTAHNVVEYIEIISRILRDGGVWINLGPLLYHFA 401

Query: 524  DAYGTEDEMSIELSLEDVKRVAFHYGFHLEREKTIETTYTTNPRSMMQNRYYAAFWTMTK 345
            D+YG+EDEMS+ELSLEDVKR+AF YGF LE E+TIETTYT NPRSMMQN Y+AAFWTM K
Sbjct: 402  DSYGSEDEMSVELSLEDVKRIAFQYGFELEIERTIETTYTANPRSMMQNHYFAAFWTMRK 461

Query: 344  K 342
            K
Sbjct: 462  K 462


Top