BLASTX nr result

ID: Zanthoxylum22_contig00011713 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Zanthoxylum22_contig00011713
         (1309 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006469004.1| PREDICTED: uncharacterized protein LOC102629...   552   e-154
ref|XP_006469003.1| PREDICTED: uncharacterized protein LOC102629...   552   e-154
gb|KDO37998.1| hypothetical protein CISIN_1g006533mg [Citrus sin...   531   e-148
ref|XP_006446800.1| hypothetical protein CICLE_v10014575mg [Citr...   525   e-146
ref|XP_010104398.1| hypothetical protein L484_010350 [Morus nota...   248   1e-62
ref|XP_011033749.1| PREDICTED: uncharacterized protein LOC105132...   228   1e-56
ref|XP_011033746.1| PREDICTED: uncharacterized protein LOC105132...   227   2e-56
ref|XP_002320413.2| hypothetical protein POPTR_0014s13950g [Popu...   226   4e-56
ref|XP_002528543.1| conserved hypothetical protein [Ricinus comm...   209   6e-51
ref|XP_006582277.1| PREDICTED: dentin sialophosphoprotein-like i...   198   8e-48
gb|KRH25783.1| hypothetical protein GLYMA_12G128800 [Glycine max...   196   4e-47
gb|KHN34238.1| hypothetical protein glysoja_035873 [Glycine soja]     196   4e-47
ref|XP_006593116.1| PREDICTED: uncharacterized protein LOC100808...   196   4e-47
ref|XP_007048702.1| Uncharacterized protein isoform 2 [Theobroma...   194   2e-46
ref|XP_007048701.1| Uncharacterized protein isoform 1 [Theobroma...   194   2e-46
gb|KHG24123.1| Protein arginine N-methyltransferase 7 [Gossypium...   180   3e-42
ref|XP_012437373.1| PREDICTED: uncharacterized protein LOC105763...   179   6e-42
gb|KJB49044.1| hypothetical protein B456_008G099200 [Gossypium r...   179   6e-42
ref|XP_006348878.1| PREDICTED: uncharacterized protein LOC102602...   178   8e-42
gb|KHF99143.1| hypothetical protein F383_20211 [Gossypium arboreum]   178   1e-41

>ref|XP_006469004.1| PREDICTED: uncharacterized protein LOC102629060 isoform X2 [Citrus
            sinensis]
          Length = 649

 Score =  552 bits (1423), Expect = e-154
 Identities = 290/422 (68%), Positives = 319/422 (75%), Gaps = 4/422 (0%)
 Frame = +2

Query: 56   EKNTTKCXXXXXXXXXXXV----DGDETYADALDTLSRTESFFFNCSVSGVSGLDEAEVK 223
            +KNT KC                DG+E YADALDTLSRT SFFFNCSVSGVSGLD+ E+K
Sbjct: 141  DKNTAKCESSAEVMDETRSSRSEDGNEAYADALDTLSRTASFFFNCSVSGVSGLDDEEMK 200

Query: 224  PSGTFSTDQWTRDFMMGRFLPAAKAIASEAPQHTNRKQIIAKEQPRNIQRIVNMDRRPLP 403
            PSGTFS+DQWTRDFMM RFLPAAKAIAS APQH NRKQ + +E PRNIQR+VNMDRRP P
Sbjct: 201  PSGTFSSDQWTRDFMMTRFLPAAKAIASGAPQHKNRKQPVTQELPRNIQRVVNMDRRPPP 260

Query: 404  KQYSPNSLQFHAQDKKWXXXXXXXXHCGPGNSSATICGLLPRFCMKSSLCLLNPVPGMRL 583
            KQYSPNSLQFHAQDKKW        + GPGNSSAT+CG LP FC+K+S CLLNPVPGMRL
Sbjct: 261  KQYSPNSLQFHAQDKKWEESDDEDDYDGPGNSSATVCGFLPPFCLKTSFCLLNPVPGMRL 320

Query: 584  QAQEAIELVHRAPAIGSYASSYCEIPEKSKNFKVKESDPERKGSKTFRELIVDQSNICET 763
            QAQ+A++L HRAPA GSYASSYCEIPEK+KNFKV +SDPERKGSK FREL+VD+S  CET
Sbjct: 321  QAQQAVDLAHRAPARGSYASSYCEIPEKAKNFKVNKSDPERKGSKIFRELLVDESTNCET 380

Query: 764  SLASPVEKTLYIDFXXXXXXXXXXXXXXXXXXXXXXRGDYFFDAPIKSGEAEGPHAIASI 943
             LA PVEKTLYID                       +GD  FDA IKS E   P  I + 
Sbjct: 381  GLAIPVEKTLYIDSVHKMNSPKSNSSSLDAKRLSDIQGD-DFDALIKSEETVAPQPIDAS 439

Query: 944  LGDIKVSNVTGEKAISQPKSLVSVDSIFLSSPDRSPHGLQIDAKSSSRKDQDIDENSIKT 1123
            L DIKVSNVTGEKAIS  K L  V S FLSSPDR  HGLQ DAKSSSR+DQD+ +NSIK 
Sbjct: 440  LQDIKVSNVTGEKAISHTKCLKPVYSDFLSSPDRCSHGLQADAKSSSREDQDLGKNSIKL 499

Query: 1124 ASPKMDGSEKIDLESHLHKKLSNQEKSHGRILKSISSTRSKLADDGKTDLKCQPQMALSS 1303
            ASPKMDGSEKIDLESHL KKLSNQ K+HGRILKS+SS  +K+AD  KTD K QPQ+A SS
Sbjct: 500  ASPKMDGSEKIDLESHLRKKLSNQAKAHGRILKSVSSATTKIADGIKTDFKTQPQVAFSS 559

Query: 1304 QE 1309
            QE
Sbjct: 560  QE 561


>ref|XP_006469003.1| PREDICTED: uncharacterized protein LOC102629060 isoform X1 [Citrus
            sinensis]
          Length = 672

 Score =  552 bits (1423), Expect = e-154
 Identities = 290/422 (68%), Positives = 319/422 (75%), Gaps = 4/422 (0%)
 Frame = +2

Query: 56   EKNTTKCXXXXXXXXXXXV----DGDETYADALDTLSRTESFFFNCSVSGVSGLDEAEVK 223
            +KNT KC                DG+E YADALDTLSRT SFFFNCSVSGVSGLD+ E+K
Sbjct: 141  DKNTAKCESSAEVMDETRSSRSEDGNEAYADALDTLSRTASFFFNCSVSGVSGLDDEEMK 200

Query: 224  PSGTFSTDQWTRDFMMGRFLPAAKAIASEAPQHTNRKQIIAKEQPRNIQRIVNMDRRPLP 403
            PSGTFS+DQWTRDFMM RFLPAAKAIAS APQH NRKQ + +E PRNIQR+VNMDRRP P
Sbjct: 201  PSGTFSSDQWTRDFMMTRFLPAAKAIASGAPQHKNRKQPVTQELPRNIQRVVNMDRRPPP 260

Query: 404  KQYSPNSLQFHAQDKKWXXXXXXXXHCGPGNSSATICGLLPRFCMKSSLCLLNPVPGMRL 583
            KQYSPNSLQFHAQDKKW        + GPGNSSAT+CG LP FC+K+S CLLNPVPGMRL
Sbjct: 261  KQYSPNSLQFHAQDKKWEESDDEDDYDGPGNSSATVCGFLPPFCLKTSFCLLNPVPGMRL 320

Query: 584  QAQEAIELVHRAPAIGSYASSYCEIPEKSKNFKVKESDPERKGSKTFRELIVDQSNICET 763
            QAQ+A++L HRAPA GSYASSYCEIPEK+KNFKV +SDPERKGSK FREL+VD+S  CET
Sbjct: 321  QAQQAVDLAHRAPARGSYASSYCEIPEKAKNFKVNKSDPERKGSKIFRELLVDESTNCET 380

Query: 764  SLASPVEKTLYIDFXXXXXXXXXXXXXXXXXXXXXXRGDYFFDAPIKSGEAEGPHAIASI 943
             LA PVEKTLYID                       +GD  FDA IKS E   P  I + 
Sbjct: 381  GLAIPVEKTLYIDSVHKMNSPKSNSSSLDAKRLSDIQGD-DFDALIKSEETVAPQPIDAS 439

Query: 944  LGDIKVSNVTGEKAISQPKSLVSVDSIFLSSPDRSPHGLQIDAKSSSRKDQDIDENSIKT 1123
            L DIKVSNVTGEKAIS  K L  V S FLSSPDR  HGLQ DAKSSSR+DQD+ +NSIK 
Sbjct: 440  LQDIKVSNVTGEKAISHTKCLKPVYSDFLSSPDRCSHGLQADAKSSSREDQDLGKNSIKL 499

Query: 1124 ASPKMDGSEKIDLESHLHKKLSNQEKSHGRILKSISSTRSKLADDGKTDLKCQPQMALSS 1303
            ASPKMDGSEKIDLESHL KKLSNQ K+HGRILKS+SS  +K+AD  KTD K QPQ+A SS
Sbjct: 500  ASPKMDGSEKIDLESHLRKKLSNQAKAHGRILKSVSSATTKIADGIKTDFKTQPQVAFSS 559

Query: 1304 QE 1309
            QE
Sbjct: 560  QE 561


>gb|KDO37998.1| hypothetical protein CISIN_1g006533mg [Citrus sinensis]
          Length = 641

 Score =  531 bits (1369), Expect = e-148
 Identities = 283/422 (67%), Positives = 310/422 (73%), Gaps = 4/422 (0%)
 Frame = +2

Query: 56   EKNTTKCXXXXXXXXXXXV----DGDETYADALDTLSRTESFFFNCSVSGVSGLDEAEVK 223
            +KNT KC                DG+E YADALDTLSRTESFFFNCSVSGVSGLD+ E+K
Sbjct: 141  DKNTAKCESSAEGMDETRSSRSEDGNEAYADALDTLSRTESFFFNCSVSGVSGLDDEEMK 200

Query: 224  PSGTFSTDQWTRDFMMGRFLPAAKAIASEAPQHTNRKQIIAKEQPRNIQRIVNMDRRPLP 403
            PSGTFS+DQWTRDFMM RFLPAAKAIAS APQH NRKQ + +E PRNIQR+VNMDRRP P
Sbjct: 201  PSGTFSSDQWTRDFMMTRFLPAAKAIASGAPQHKNRKQPVTQELPRNIQRVVNMDRRPPP 260

Query: 404  KQYSPNSLQFHAQDKKWXXXXXXXXHCGPGNSSATICGLLPRFCMKSSLCLLNPVPGMRL 583
            KQYSPNSLQFHAQDKKW          GPGNSSAT+CG LP FC+K+S CLLNPVPGMRL
Sbjct: 261  KQYSPNSLQFHAQDKKWEESDHEDDDDGPGNSSATVCGFLPPFCLKTSFCLLNPVPGMRL 320

Query: 584  QAQEAIELVHRAPAIGSYASSYCEIPEKSKNFKVKESDPERKGSKTFRELIVDQSNICET 763
            QAQ+A++L HR P  GSYASSYCEIPEK        SDPERKG K FREL+VD+S  CET
Sbjct: 321  QAQQAVDLAHRPPVRGSYASSYCEIPEK--------SDPERKGGKIFRELLVDESTNCET 372

Query: 764  SLASPVEKTLYIDFXXXXXXXXXXXXXXXXXXXXXXRGDYFFDAPIKSGEAEGPHAIASI 943
             LASP+EKTLYID                       RGD  FDA IKS E   P +I + 
Sbjct: 373  GLASPIEKTLYIDSVHKMNSPKSNSSSSDAKRLSDIRGD-DFDALIKSEETVAPQSIDAS 431

Query: 944  LGDIKVSNVTGEKAISQPKSLVSVDSIFLSSPDRSPHGLQIDAKSSSRKDQDIDENSIKT 1123
            L DIKVSNVTGEKAIS  K L  V S FLSSPDR  HGLQ DAKSSSR+DQD+ +NSIK 
Sbjct: 432  LQDIKVSNVTGEKAISHTKCLEPVYSDFLSSPDRCSHGLQADAKSSSREDQDLGKNSIKL 491

Query: 1124 ASPKMDGSEKIDLESHLHKKLSNQEKSHGRILKSISSTRSKLADDGKTDLKCQPQMALSS 1303
            ASPKMDGSEKIDLESHL KKLSNQ K+HG ILKS+SS  +K+AD  KTD K QPQ+A SS
Sbjct: 492  ASPKMDGSEKIDLESHLRKKLSNQAKAHGCILKSVSSASTKIADGIKTDFKTQPQVAFSS 551

Query: 1304 QE 1309
            QE
Sbjct: 552  QE 553


>ref|XP_006446800.1| hypothetical protein CICLE_v10014575mg [Citrus clementina]
            gi|557549411|gb|ESR60040.1| hypothetical protein
            CICLE_v10014575mg [Citrus clementina]
          Length = 641

 Score =  525 bits (1351), Expect = e-146
 Identities = 279/422 (66%), Positives = 309/422 (73%), Gaps = 4/422 (0%)
 Frame = +2

Query: 56   EKNTTKCXXXXXXXXXXXV----DGDETYADALDTLSRTESFFFNCSVSGVSGLDEAEVK 223
            +KNT KC                DG+E YADALDTLSRTESFFFNCSVSGVSGLD+ E+K
Sbjct: 141  DKNTAKCESSAEGMDETRSSRSEDGNEAYADALDTLSRTESFFFNCSVSGVSGLDDEEMK 200

Query: 224  PSGTFSTDQWTRDFMMGRFLPAAKAIASEAPQHTNRKQIIAKEQPRNIQRIVNMDRRPLP 403
            PSGTFS+DQWTRDFMM RFLPAAKAIAS APQH NRKQ + +E PRNIQR+VNMDRRP P
Sbjct: 201  PSGTFSSDQWTRDFMMTRFLPAAKAIASGAPQHKNRKQPVTQELPRNIQRVVNMDRRPPP 260

Query: 404  KQYSPNSLQFHAQDKKWXXXXXXXXHCGPGNSSATICGLLPRFCMKSSLCLLNPVPGMRL 583
            KQYSPNS+QFHAQDKKW          GPGNSSAT+CG LP FC+K+S CLLNPVPGMRL
Sbjct: 261  KQYSPNSVQFHAQDKKWEESDHEDDDDGPGNSSATVCGFLPPFCLKTSFCLLNPVPGMRL 320

Query: 584  QAQEAIELVHRAPAIGSYASSYCEIPEKSKNFKVKESDPERKGSKTFRELIVDQSNICET 763
            QAQ+A++L HR P  GSYA+SYCEIPEK        SDPERKG K FREL+VD+S  CET
Sbjct: 321  QAQQAVDLAHRPPVRGSYATSYCEIPEK--------SDPERKGGKIFRELLVDESTNCET 372

Query: 764  SLASPVEKTLYIDFXXXXXXXXXXXXXXXXXXXXXXRGDYFFDAPIKSGEAEGPHAIASI 943
             LASPVEKTL ID                       +GD  FDA IKS E   P +  + 
Sbjct: 373  GLASPVEKTLCIDSVHKMNSPKSNSSSSDAKRLSDIQGD-DFDALIKSEETVAPQSTDAS 431

Query: 944  LGDIKVSNVTGEKAISQPKSLVSVDSIFLSSPDRSPHGLQIDAKSSSRKDQDIDENSIKT 1123
            L DIKVSNVTGEKAIS  K L  V S FLSSPDR  HGLQ DAK+SSR+DQD+ +NSIK 
Sbjct: 432  LQDIKVSNVTGEKAISHTKCLEPVYSDFLSSPDRCSHGLQADAKNSSREDQDLGKNSIKL 491

Query: 1124 ASPKMDGSEKIDLESHLHKKLSNQEKSHGRILKSISSTRSKLADDGKTDLKCQPQMALSS 1303
            ASPKMDGSEKIDLESHL KKLSNQ K+HGRILKS+SS  +K+AD  KTD K QPQ+A SS
Sbjct: 492  ASPKMDGSEKIDLESHLRKKLSNQAKAHGRILKSVSSASTKIADGIKTDFKTQPQVAFSS 551

Query: 1304 QE 1309
            QE
Sbjct: 552  QE 553


>ref|XP_010104398.1| hypothetical protein L484_010350 [Morus notabilis]
            gi|587912410|gb|EXC00243.1| hypothetical protein
            L484_010350 [Morus notabilis]
          Length = 775

 Score =  248 bits (632), Expect = 1e-62
 Identities = 183/483 (37%), Positives = 237/483 (49%), Gaps = 84/483 (17%)
 Frame = +2

Query: 113  DGDETYADALDTLSRTESFFFNCSVSGVSGLDEAEVKPSGTFSTDQWTRDFMMGRFLPAA 292
            DGDETY DALDTLSR+ESFF NCS+SGVSGLD+ +VKPSGTFSTDQ TRDFMMGRFLPAA
Sbjct: 157  DGDETYLDALDTLSRSESFFLNCSISGVSGLDDPDVKPSGTFSTDQQTRDFMMGRFLPAA 216

Query: 293  KAIASEAPQHTNRKQIIAKEQPRNIQRIVNMDRRPLPKQYSPNSLQFHAQDKKW-XXXXX 469
            K +AS+  Q+  RK  + +EQPR I ++V+ D+R       PN L  +AQ+         
Sbjct: 217  KVMASDTHQYALRKPQVVREQPRQINKVVSGDKRRPLNLNKPNRLPPYAQELGGEESEDE 276

Query: 470  XXXHCGPGNSSATICGLLPRFCMKSSLCLLNPVPGMRLQAQEAIELVHRAPAIGSYASSY 649
               + G    S  +CGL PRFC+K+S CLLNPVPGM++Q+Q  I  V R PA  S AS+ 
Sbjct: 277  SVTYEGSDILSDKVCGLFPRFCLKNSFCLLNPVPGMKMQSQFPISSVRRVPANSSSASTC 336

Query: 650  CE-------------------------------------IPEKSKNFKVKESD------- 697
             E                                     I +KS + KV +S        
Sbjct: 337  RETKVEHAEHLVYEQKSMVREQTAELNKGKIKLKYKSNGIEDKSDSQKVDQSSLYRHQQG 396

Query: 698  ---------------PERKGSKTFRE----------------------LIVDQSNICETS 766
                           PE+KG    RE                      L+ +++   E  
Sbjct: 397  NGLSLYHSGHSQLKLPEQKGFLGIREKKRNSRERGFDIHKSRRSNFRELLNNENTKLEVG 456

Query: 767  LASP-VEKTLYIDFXXXXXXXXXXXXXXXXXXXXXXRGDYFFDAPIKSGEAEGPHAIASI 943
              SP VEKTLYID                       RG+   + P KS + E  H++ S 
Sbjct: 457  SGSPVVEKTLYIDSVHTVKPPSSNSSASDMKSFTDCRGN-DVEIPEKSSDMEDTHSVDSS 515

Query: 944  LGDIKVSNVTGEKAISQPKSLVSVDSIFLSSPDRSPHGLQIDAKSSSRKDQDIDENSIKT 1123
            L DIK  +V  EKA + PKSL SVDS F S  ++S    Q+   + S +D+ +  +S   
Sbjct: 516  LQDIKCLSVVDEKATTTPKSLQSVDSCFQSCSNKSTLEKQMHMTNGSIQDEYLIPDSFTL 575

Query: 1124 ASPKMDGSEKIDLESHLHKKLS-NQEKSHGRILKSISSTRSKLADDGKTDLKCQPQMALS 1300
             S K+   E  DLES   ++LS  QEK H     SI+ T SK+A + K DLK +  + L 
Sbjct: 576  MSSKVAAQESYDLES---QRLSCEQEKCHDLTKDSITFTSSKIA-ERKIDLKSRQYLGLD 631

Query: 1301 SQE 1309
             QE
Sbjct: 632  YQE 634


>ref|XP_011033749.1| PREDICTED: uncharacterized protein LOC105132125 isoform X2 [Populus
            euphratica]
          Length = 696

 Score =  228 bits (580), Expect = 1e-56
 Identities = 169/459 (36%), Positives = 223/459 (48%), Gaps = 61/459 (13%)
 Frame = +2

Query: 116  GDETYADALDTLSRTESFFFNCSVSGVSGLDEAEVKPSGTFSTDQWTRDFMMGRFLPAAK 295
            G+E YADALD LSR+ESFF NCS+SGVSGLD  ++KPSG F TDQ  +DFMM RFLPAAK
Sbjct: 156  GEEAYADALDILSRSESFFLNCSISGVSGLDGPDLKPSGAFFTDQHGQDFMMARFLPAAK 215

Query: 296  AIASEAPQHTNRKQIIAKEQPRNIQRIVNMDRRPLPKQYSPNSLQFHAQ-DKKWXXXXXX 472
            A+ASE PQ   RKQ + +E PR I +  + +R PL  +YSP ++  +AQ D         
Sbjct: 216  AMASETPQCFTRKQPVVRELPRQIAKATSAERHPL-NRYSPYNIPNYAQADAVEDSEDED 274

Query: 473  XXHCGPGNSSATICGLLPRFCMKSSLCLLNPVPGMRLQAQEAIELVHRAPAIGSYASS-- 646
                 P + S  +CGLLP+ C ++SLC +NPV GMR Q Q  I  V    +  S A+S  
Sbjct: 275  HDDDRPDDPSLKLCGLLPQLCSQNSLCFMNPVLGMRKQVQVPISAVCTTKSGSSNAASRN 334

Query: 647  --------------YCEIPEKSKNFKVKESD----------------------------- 697
                            +I  K++N K+ ES                              
Sbjct: 335  VTAHERNDKYEKRESIKIACKTENKKLDESSACKGWHSKVASPTDSQFPQPVHEEQRCTE 394

Query: 698  -PER-------------KGSKTFRELIVDQSNICET-SLASPVEKTLYIDFXXXXXXXXX 832
             P++             KGS  FREL+ ++S   E+ S  S  EKTLYID          
Sbjct: 395  IPDKCKDSVASDFIRCAKGSTIFRELLANESREWESVSAVSVAEKTLYIDSMHMVKPQNS 454

Query: 833  XXXXXXXXXXXXXRGDYFFDAPIKSGEAEGPHAIASILGDIKVSNVTGEKAISQPKSLVS 1012
                           D   +  +K+ E E    + S L   K  +  GEK   +P SL S
Sbjct: 455  NSSSSDARGLSECSMD-DVEILVKNREIEENDDVDSSLLGSKCLSTVGEKKKLRPDSLES 513

Query: 1013 VDSIFLSSPDRSPHGLQIDAKSSSRKDQDIDENSIKTASPKMDGSEKIDLESHLHKKLSN 1192
            VDS FLS  D+S H + +     SR+D+D  + S    SPK+D   KIDLE    KKL N
Sbjct: 514  VDSCFLSLSDKSIHDVHMAVMGGSRQDEDNMQVSNTLTSPKVDKDGKIDLEIRSDKKLGN 573

Query: 1193 QEKSHGRILKSISSTRSKLADDGKTDLKCQPQMALSSQE 1309
             E SH      I  +  ++A DG+ DL+ Q    LS++E
Sbjct: 574  LESSH----VFIQDSNGEVAGDGRIDLESQQCRKLSNKE 608


>ref|XP_011033746.1| PREDICTED: uncharacterized protein LOC105132125 isoform X1 [Populus
            euphratica] gi|743871023|ref|XP_011033747.1| PREDICTED:
            uncharacterized protein LOC105132125 isoform X1 [Populus
            euphratica] gi|743871025|ref|XP_011033748.1| PREDICTED:
            uncharacterized protein LOC105132125 isoform X1 [Populus
            euphratica]
          Length = 698

 Score =  227 bits (578), Expect = 2e-56
 Identities = 169/461 (36%), Positives = 223/461 (48%), Gaps = 63/461 (13%)
 Frame = +2

Query: 116  GDETYADALDTLSRTESFFFNCSVSGVSGLDEAEVKPSGTFSTDQWTRDFMMGRFLPAAK 295
            G+E YADALD LSR+ESFF NCS+SGVSGLD  ++KPSG F TDQ  +DFMM RFLPAAK
Sbjct: 156  GEEAYADALDILSRSESFFLNCSISGVSGLDGPDLKPSGAFFTDQHGQDFMMARFLPAAK 215

Query: 296  AIASEAPQHTNRKQIIAKEQPRNIQRIVNMDRRPLPKQYSPNSLQFHAQ-DKKWXXXXXX 472
            A+ASE PQ   RKQ + +E PR I +  + +R PL  +YSP ++  +AQ D         
Sbjct: 216  AMASETPQCFTRKQPVVRELPRQIAKATSAERHPL-NRYSPYNIPNYAQADAVEDSEDED 274

Query: 473  XXHCGPGNSSATICGLLPRFCMKSSLCLLNPVPGMRLQAQEAIELVHRAPAIGSYASS-- 646
                 P + S  +CGLLP+ C ++SLC +NPV GMR Q Q  I  V    +  S A+S  
Sbjct: 275  HDDDRPDDPSLKLCGLLPQLCSQNSLCFMNPVLGMRKQVQVPISAVCTTKSGSSNAASRN 334

Query: 647  ----------------YCEIPEKSKNFKVKESD--------------------------- 697
                              +I  K++N K+ ES                            
Sbjct: 335  VTAHEHQRNDKYEKRESIKIACKTENKKLDESSACKGWHSKVASPTDSQFPQPVHEEQRC 394

Query: 698  ---PER-------------KGSKTFRELIVDQSNICET-SLASPVEKTLYIDFXXXXXXX 826
               P++             KGS  FREL+ ++S   E+ S  S  EKTLYID        
Sbjct: 395  TEIPDKCKDSVASDFIRCAKGSTIFRELLANESREWESVSAVSVAEKTLYIDSMHMVKPQ 454

Query: 827  XXXXXXXXXXXXXXXRGDYFFDAPIKSGEAEGPHAIASILGDIKVSNVTGEKAISQPKSL 1006
                             D   +  +K+ E E    + S L   K  +  GEK   +P SL
Sbjct: 455  NSNSSSSDARGLSECSMD-DVEILVKNREIEENDDVDSSLLGSKCLSTVGEKKKLRPDSL 513

Query: 1007 VSVDSIFLSSPDRSPHGLQIDAKSSSRKDQDIDENSIKTASPKMDGSEKIDLESHLHKKL 1186
             SVDS FLS  D+S H + +     SR+D+D  + S    SPK+D   KIDLE    KKL
Sbjct: 514  ESVDSCFLSLSDKSIHDVHMAVMGGSRQDEDNMQVSNTLTSPKVDKDGKIDLEIRSDKKL 573

Query: 1187 SNQEKSHGRILKSISSTRSKLADDGKTDLKCQPQMALSSQE 1309
             N E SH      I  +  ++A DG+ DL+ Q    LS++E
Sbjct: 574  GNLESSH----VFIQDSNGEVAGDGRIDLESQQCRKLSNKE 610


>ref|XP_002320413.2| hypothetical protein POPTR_0014s13950g [Populus trichocarpa]
            gi|550324153|gb|EEE98728.2| hypothetical protein
            POPTR_0014s13950g [Populus trichocarpa]
          Length = 698

 Score =  226 bits (575), Expect = 4e-56
 Identities = 168/459 (36%), Positives = 222/459 (48%), Gaps = 61/459 (13%)
 Frame = +2

Query: 116  GDETYADALDTLSRTESFFFNCSVSGVSGLDEAEVKPSGTFSTDQWTRDFMMGRFLPAAK 295
            G+E YADALD LSR+ESFF NCS+SGVSGLD  ++KPSG F TDQ  +DFMM RFLPAAK
Sbjct: 157  GEEAYADALDILSRSESFFLNCSISGVSGLDGPDLKPSGAFFTDQHGQDFMMARFLPAAK 216

Query: 296  AIASEAPQHTNRKQIIAKEQPRNIQRIVNMDRRPLPKQYSPNSLQFHAQ-DKKWXXXXXX 472
            A+ASE PQ   RKQ + +E PR I +   ++R PL  +YSPN++  +AQ D         
Sbjct: 217  AMASETPQCFTRKQPVVRELPRQIAKATGVERHPL-NRYSPNNIPNYAQADAVEDSEDED 275

Query: 473  XXHCGPGNSSATICGLLPRFCMKSSLCLLNPVPGMRLQAQEAIELVHRAPAIGSYASS-- 646
                 P + S  +CGLLP+ C ++SLC +NPV GMR Q    I  V    +  S A+S  
Sbjct: 276  CDDDRPDDPSLKLCGLLPQLCSQNSLCFMNPVLGMRKQVPVPISSVCTTKSGSSNAASRN 335

Query: 647  --------------YCEIPEKSKNFKVKESD----------------------------- 697
                            +I  K++N ++ ES                              
Sbjct: 336  VTAHERNAMYEKRESIKIACKTENKRLDESSACKGWHSKVASPTDSQFPQPVHEERRCTE 395

Query: 698  -PER-------------KGSKTFRELIVDQSNICET-SLASPVEKTLYIDFXXXXXXXXX 832
             P++             KGS  FREL+  +S   E+ S  S  EKTLYID          
Sbjct: 396  IPDKCRNSAASDFIQCAKGSTIFRELLATESREWESVSAVSVAEKTLYIDSMHMVKPQNS 455

Query: 833  XXXXXXXXXXXXXRGDYFFDAPIKSGEAEGPHAIASILGDIKVSNVTGEKAISQPKSLVS 1012
                           D   +  +K+ E E    + S L D K  +   EK   +P SL S
Sbjct: 456  NSSSSDARGLSECSKD-DVEILVKNREIEETDDVNSSLLDSKHLSTVDEKKKLRPDSLES 514

Query: 1013 VDSIFLSSPDRSPHGLQIDAKSSSRKDQDIDENSIKTASPKMDGSEKIDLESHLHKKLSN 1192
            VDS FLS  D+S H + +     SR+D+D  + S    SPK+D   KIDLES   KKL N
Sbjct: 515  VDSCFLSLSDKSIHDVHMAVMDGSRQDEDNMQVSNTLTSPKVDKDGKIDLESRSDKKLGN 574

Query: 1193 QEKSHGRILKSISSTRSKLADDGKTDLKCQPQMALSSQE 1309
             E SH      I  +   +A +G+ DL+ Q    LS++E
Sbjct: 575  LESSH----VFIQDSNGVVAGNGRIDLESQQCRKLSNKE 609


>ref|XP_002528543.1| conserved hypothetical protein [Ricinus communis]
           gi|223532045|gb|EEF33855.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 607

 Score =  209 bits (531), Expect = 6e-51
 Identities = 121/263 (46%), Positives = 153/263 (58%), Gaps = 33/263 (12%)
 Frame = +2

Query: 113 DGDETYADALDTLSRTESFFFNCSVSGVSGLDEAEVKPSGTFSTDQWTRDFMMGRFLPAA 292
           + DETY DALDTLSR+ESFF NCS+SGVSGLD  ++KPSGTFSTD  TRDFMMGRFLPAA
Sbjct: 161 EDDETYVDALDTLSRSESFFLNCSISGVSGLDGPDMKPSGTFSTDPQTRDFMMGRFLPAA 220

Query: 293 KAIASEAPQHTNRKQIIAKEQPRNIQRIVNMDRRPLPKQYSPNSLQFHAQD----KKWXX 460
           KA+ASE PQH+ +KQ  A+EQPR I++ + +++     +    S   H       K+   
Sbjct: 221 KAMASETPQHSTKKQPAAQEQPRQIKKTLGVEKYHPFNECRRQSDMPHCSQCSGVKEIEQ 280

Query: 461 XXXXXXHCGPGNSSATICGLLPRFCMKSSLCLLNPVPGMRLQAQEAIELVHRAPAIGSYA 640
                 + GP NSS  +CGL PR C+++S CLL+PVPGMR Q Q  I L H      SYA
Sbjct: 281 EDDDYNYEGPDNSSPKVCGLFPRLCLQNSFCLLSPVPGMRKQVQLPISLSHMTKVKPSYA 340

Query: 641 SSYCE----------------------------IPEKSKNFKVKESDPERKGSKTFRELI 736
           +   E                            +PEK KN   +  +   KG K FREL+
Sbjct: 341 ACCTETMNEGNGTSPYHDKFSQSAVSEEKGFLGVPEKPKNSGARGFNAHAKGGKNFRELL 400

Query: 737 VDQSNICETSLASP-VEKTLYID 802
            ++ N  E++ AS  VEKTLYID
Sbjct: 401 ANERNEWESAPASSLVEKTLYID 423


>ref|XP_006582277.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Glycine max]
            gi|571462416|ref|XP_006582278.1| PREDICTED: dentin
            sialophosphoprotein-like isoform X2 [Glycine max]
            gi|947107353|gb|KRH55736.1| hypothetical protein
            GLYMA_06G276800 [Glycine max] gi|947107354|gb|KRH55737.1|
            hypothetical protein GLYMA_06G276800 [Glycine max]
            gi|947107355|gb|KRH55738.1| hypothetical protein
            GLYMA_06G276800 [Glycine max]
          Length = 687

 Score =  198 bits (504), Expect = 8e-48
 Identities = 157/446 (35%), Positives = 218/446 (48%), Gaps = 47/446 (10%)
 Frame = +2

Query: 113  DGDETYADALDTLSRTESFFFNCSVSGVSGLDEAEVKPSGTFSTDQWTRDFMMGRFLPAA 292
            D DE Y DALDTLSRTESFF +CSVSG+S  D+ EV+ SG FSTDQ  RDFM+GRFLPAA
Sbjct: 161  DEDENYLDALDTLSRTESFFMSCSVSGLSEWDDQEVQLSGNFSTDQQARDFMIGRFLPAA 220

Query: 293  KAIASEAP--QHTNRKQIIAKEQPRNIQRIVN-MDRRPLPKQYSPNSLQFHAQD-KKWXX 460
            KA+ASE P  QH +RK ++ +EQP+   ++V+  + RPL  ++    L  +AQD  +   
Sbjct: 221  KAMASETPQIQHNSRKPLVTQEQPKQAWKVVSGANSRPLNPKWQ-KVLPHYAQDIGREES 279

Query: 461  XXXXXXHCGPGNSSATICGLLPRFCMKSSLCLLNPVPGMRLQAQEAIELVH--RAPAIGS 634
                  + G  N++  +CGL PRF      CLLNPVPG+R++ +      H  ++ +I S
Sbjct: 280  EDESDDNDGYENNAPKVCGLFPRF------CLLNPVPGLRMEGRIRSSTFHGVQSKSITS 333

Query: 635  Y--------------------ASSYCE------IPEKSKNFKVKESDPERKGSKTFRELI 736
            +                     S Y E        EKSK+    + DP R+      +  
Sbjct: 334  HRRTAKEHGRTATNGKKSVNSQSGYTEERDFLSTAEKSKH----DIDPHRRACS---KSS 386

Query: 737  VDQSNICETSLASP-VEKTLYIDFXXXXXXXXXXXXXXXXXXXXXXRGDYFFDAPIKSGE 913
              +S   E+S  SP VEKTLY+D                       RGD  FD   K  +
Sbjct: 387  ASESTEFESSCDSPVVEKTLYVD-----------SVHKVKSSDTNIRGD-DFDTLRKDTD 434

Query: 914  AEGPHAIASILGDIKVSNVTGEKAISQPKSLVSVDSIFLSSPDRSPHGLQIDAKSSSRK- 1090
             +   +I S + D K   +  EK +S+PKS  S+DS      D S   +Q++ K+ S K 
Sbjct: 435  LDKSLSIDSSIEDSKPLGIVDEKEVSEPKSSASLDSSLPVCSDNSNDNMQMEMKNHSNKI 494

Query: 1091 -------------DQDIDENSIKTASPKMDGSEKIDLESHLHKKLSNQEKSHGRILKSIS 1231
                           ++D + +  +SP M   +KI+ ES   K  S +E S+G I   +S
Sbjct: 495  CPEKQELTKPDYQGSNLDRDLVAISSPDMVACKKIESES---KDFSTKESSNGLIKNPVS 551

Query: 1232 STRSKLADDGKTDLKCQPQMALSSQE 1309
                K A D K D KCQ    +  QE
Sbjct: 552  MRNRKFASDVKFDSKCQQATKVVDQE 577


>gb|KRH25783.1| hypothetical protein GLYMA_12G128800 [Glycine max]
            gi|947076944|gb|KRH25784.1| hypothetical protein
            GLYMA_12G128800 [Glycine max] gi|947076945|gb|KRH25785.1|
            hypothetical protein GLYMA_12G128800 [Glycine max]
          Length = 707

 Score =  196 bits (498), Expect = 4e-47
 Identities = 156/446 (34%), Positives = 219/446 (49%), Gaps = 47/446 (10%)
 Frame = +2

Query: 113  DGDETYADALDTLSRTESFFFNCSVSGVSGLDEAEVKPSGTFSTDQWTRDFMMGRFLPAA 292
            D DE Y DALDTLSRTESFF +CSVSG+S  D  +V+PSG FSTDQ TRDFM+GRFLPAA
Sbjct: 160  DEDENYLDALDTLSRTESFFMSCSVSGLSEWDGPDVQPSGNFSTDQQTRDFMIGRFLPAA 219

Query: 293  KAIASEAP--QHTNRKQIIAKEQPRNIQRIVN-MDRRPLPKQYSPNSLQFHAQD-KKWXX 460
            KA+ASE P  QH +RK ++ +EQ +  +++ +  + RPL  ++    L  +AQD  +   
Sbjct: 220  KAMASETPQIQHNSRKSLVTQEQLKQARKVESGANSRPLNPKWQ-KVLPHYAQDIGREES 278

Query: 461  XXXXXXHCGPGNSSATICGLLPRFCMKSSLCLLNPVPGMRLQAQEAIELVH--RAPAIGS 634
                  H G  N +  +CGL PRF      CLLNPVPG+R++ +     VH  +  +I S
Sbjct: 279  EDESDDHDGYENYAPKVCGLFPRF------CLLNPVPGLRMEGRIPSSTVHGVQGKSITS 332

Query: 635  Y--------------------ASSYCE------IPEKSKNFKVKESDPERKG-SKTFREL 733
            +                     S Y E        EKSK+    + DP R+  SK+    
Sbjct: 333  HRRTAKEHGRTATYGKKSVNSQSGYTEERDFLSTAEKSKH----DIDPHRRACSKSLASE 388

Query: 734  IVDQSNICETSLASPVEKTLYIDFXXXXXXXXXXXXXXXXXXXXXXRGDYFFDAPIKSGE 913
              +  + CE+ +   +EKTLY+D                       RGD  FD   K  +
Sbjct: 389  RTEFESSCESPV---IEKTLYVD------SVHKVKTSISCSSDTNLRGD-DFDTLRKDTD 438

Query: 914  AEGPHAIASILGDIKVSNVTGEKAISQPKSLVSVDSIFLSSPDRSPHGLQIDAKSSSRK- 1090
             +   +I   + D K   +  EKA+S+P+   S+DS  L   D S   +Q++ K+ S K 
Sbjct: 439  LDKNLSIDFSIEDSKHLGIVDEKAVSEPEISASLDSSLLVCSDNSNDNMQMEMKNHSNKI 498

Query: 1091 -------------DQDIDENSIKTASPKMDGSEKIDLESHLHKKLSNQEKSHGRILKSIS 1231
                           ++D + +  +SP+M   EKI+ ES   K  S++E S+G I   I+
Sbjct: 499  CPEKQELTKPDYQGSNLDHDLVAISSPEMVAWEKIESES---KGFSSKESSNGLIKNPIA 555

Query: 1232 STRSKLADDGKTDLKCQPQMALSSQE 1309
                K A D K D  CQ    L  QE
Sbjct: 556  WRNRKFASDLKFDSMCQQATKLVDQE 581


>gb|KHN34238.1| hypothetical protein glysoja_035873 [Glycine soja]
          Length = 707

 Score =  196 bits (498), Expect = 4e-47
 Identities = 156/446 (34%), Positives = 219/446 (49%), Gaps = 47/446 (10%)
 Frame = +2

Query: 113  DGDETYADALDTLSRTESFFFNCSVSGVSGLDEAEVKPSGTFSTDQWTRDFMMGRFLPAA 292
            D DE Y DALDTLSRTESFF +CSVSG+S  D  +V+PSG FSTDQ TRDFM+GRFLPAA
Sbjct: 160  DEDENYLDALDTLSRTESFFMSCSVSGLSEWDGPDVQPSGNFSTDQQTRDFMIGRFLPAA 219

Query: 293  KAIASEAP--QHTNRKQIIAKEQPRNIQRIVN-MDRRPLPKQYSPNSLQFHAQD-KKWXX 460
            KA+ASE P  QH +RK ++ +EQ +  +++ +  + RPL  ++    L  +AQD  +   
Sbjct: 220  KAMASETPQIQHNSRKSLVTQEQLKQARKVESGANSRPLNPKWQ-KVLPHYAQDIGREES 278

Query: 461  XXXXXXHCGPGNSSATICGLLPRFCMKSSLCLLNPVPGMRLQAQEAIELVH--RAPAIGS 634
                  H G  N +  +CGL PRF      CLLNPVPG+R++ +     VH  +  +I S
Sbjct: 279  EDESDDHDGYENYAPKVCGLFPRF------CLLNPVPGLRMEGRIPSSTVHGVQGKSITS 332

Query: 635  Y--------------------ASSYCE------IPEKSKNFKVKESDPERKG-SKTFREL 733
            +                     S Y E        EKSK+    + DP R+  SK+    
Sbjct: 333  HRRTAKEHGRTATYGKKSVNSQSGYTEERDFLSTAEKSKH----DIDPHRRACSKSLASE 388

Query: 734  IVDQSNICETSLASPVEKTLYIDFXXXXXXXXXXXXXXXXXXXXXXRGDYFFDAPIKSGE 913
              +  + CE+ +   +EKTLY+D                       RGD  FD   K  +
Sbjct: 389  RTEFESSCESPV---IEKTLYVD------SVHKVKTSISCSSDTNLRGD-DFDTLRKDTD 438

Query: 914  AEGPHAIASILGDIKVSNVTGEKAISQPKSLVSVDSIFLSSPDRSPHGLQIDAKSSSRK- 1090
             +   +I   + D K   +  EKA+S+P+   S+DS  L   D S   +Q++ K+ S K 
Sbjct: 439  LDKNLSIDFSIEDSKHLGIVDEKAVSEPEISASLDSSLLVCSDNSNDNMQMEMKNHSNKI 498

Query: 1091 -------------DQDIDENSIKTASPKMDGSEKIDLESHLHKKLSNQEKSHGRILKSIS 1231
                           ++D + +  +SP+M   EKI+ ES   K  S++E S+G I   I+
Sbjct: 499  CPEKQELTKPDYQGSNLDHDLVAISSPEMVAWEKIESES---KGFSSKESSNGLIKNPIA 555

Query: 1232 STRSKLADDGKTDLKCQPQMALSSQE 1309
                K A D K D  CQ    L  QE
Sbjct: 556  WRNRKFASDLKFDSMCQQATKLVDQE 581


>ref|XP_006593116.1| PREDICTED: uncharacterized protein LOC100808447, partial [Glycine
            max]
          Length = 700

 Score =  196 bits (498), Expect = 4e-47
 Identities = 156/446 (34%), Positives = 219/446 (49%), Gaps = 47/446 (10%)
 Frame = +2

Query: 113  DGDETYADALDTLSRTESFFFNCSVSGVSGLDEAEVKPSGTFSTDQWTRDFMMGRFLPAA 292
            D DE Y DALDTLSRTESFF +CSVSG+S  D  +V+PSG FSTDQ TRDFM+GRFLPAA
Sbjct: 153  DEDENYLDALDTLSRTESFFMSCSVSGLSEWDGPDVQPSGNFSTDQQTRDFMIGRFLPAA 212

Query: 293  KAIASEAP--QHTNRKQIIAKEQPRNIQRIVN-MDRRPLPKQYSPNSLQFHAQD-KKWXX 460
            KA+ASE P  QH +RK ++ +EQ +  +++ +  + RPL  ++    L  +AQD  +   
Sbjct: 213  KAMASETPQIQHNSRKSLVTQEQLKQARKVESGANSRPLNPKWQ-KVLPHYAQDIGREES 271

Query: 461  XXXXXXHCGPGNSSATICGLLPRFCMKSSLCLLNPVPGMRLQAQEAIELVH--RAPAIGS 634
                  H G  N +  +CGL PRF      CLLNPVPG+R++ +     VH  +  +I S
Sbjct: 272  EDESDDHDGYENYAPKVCGLFPRF------CLLNPVPGLRMEGRIPSSTVHGVQGKSITS 325

Query: 635  Y--------------------ASSYCE------IPEKSKNFKVKESDPERKG-SKTFREL 733
            +                     S Y E        EKSK+    + DP R+  SK+    
Sbjct: 326  HRRTAKEHGRTATYGKKSVNSQSGYTEERDFLSTAEKSKH----DIDPHRRACSKSLASE 381

Query: 734  IVDQSNICETSLASPVEKTLYIDFXXXXXXXXXXXXXXXXXXXXXXRGDYFFDAPIKSGE 913
              +  + CE+ +   +EKTLY+D                       RGD  FD   K  +
Sbjct: 382  RTEFESSCESPV---IEKTLYVD------SVHKVKTSISCSSDTNLRGD-DFDTLRKDTD 431

Query: 914  AEGPHAIASILGDIKVSNVTGEKAISQPKSLVSVDSIFLSSPDRSPHGLQIDAKSSSRK- 1090
             +   +I   + D K   +  EKA+S+P+   S+DS  L   D S   +Q++ K+ S K 
Sbjct: 432  LDKNLSIDFSIEDSKHLGIVDEKAVSEPEISASLDSSLLVCSDNSNDNMQMEMKNHSNKI 491

Query: 1091 -------------DQDIDENSIKTASPKMDGSEKIDLESHLHKKLSNQEKSHGRILKSIS 1231
                           ++D + +  +SP+M   EKI+ ES   K  S++E S+G I   I+
Sbjct: 492  CPEKQELTKPDYQGSNLDHDLVAISSPEMVAWEKIESES---KGFSSKESSNGLIKNPIA 548

Query: 1232 STRSKLADDGKTDLKCQPQMALSSQE 1309
                K A D K D  CQ    L  QE
Sbjct: 549  WRNRKFASDLKFDSMCQQATKLVDQE 574


>ref|XP_007048702.1| Uncharacterized protein isoform 2 [Theobroma cacao]
           gi|508700963|gb|EOX92859.1| Uncharacterized protein
           isoform 2 [Theobroma cacao]
          Length = 723

 Score =  194 bits (492), Expect = 2e-46
 Identities = 108/217 (49%), Positives = 138/217 (63%), Gaps = 3/217 (1%)
 Frame = +2

Query: 113 DGDETYADALDTLSRTESFFFNCSVSGVSGLDEAEVKPSGTFSTDQWTRDFMMGRFLPAA 292
           D DE Y DALDT SRTESFF NCS+SGVSG D  E+KPSG F+TD  TRDFMMGRFLPAA
Sbjct: 156 DSDEAYVDALDTFSRTESFFLNCSISGVSGFDGPEIKPSGIFTTDPQTRDFMMGRFLPAA 215

Query: 293 KAIASEAPQHTNRKQIIAKEQPRNIQRIVNMDRRPLPKQYSPNSLQFHAQDKKWXXXXXX 472
           KA+ASE P + +RKQ +A+E  R ++++V +D++      SPN    HAQD         
Sbjct: 216 KAVASEIPPYASRKQPVAREPQRQVKKVVIVDKQQPLYVSSPNKFPNHAQDDWLEESEGE 275

Query: 473 XXHCGPGNSSATICGLLPRFCMKSSLCLLNPVPGMRLQAQEAIELVH---RAPAIGSYAS 643
             + G  NSSA +CGL P+F +KSS CLLNPVPGM++QAQ+  +  H   R  A  SY  
Sbjct: 276 DDYSGSQNSSAKVCGLFPQFLLKSSFCLLNPVPGMKIQAQKPAKPAHSVRRRQAKSSYLR 335

Query: 644 SYCEIPEKSKNFKVKESDPERKGSKTFRELIVDQSNI 754
           S  E   +S+  K        + S+T  ELI D++N+
Sbjct: 336 SGNE--TESEYAKAATEKGLTRISRT-EELIEDKNNL 369



 Score =  122 bits (307), Expect = 5e-25
 Identities = 83/220 (37%), Positives = 118/220 (53%), Gaps = 2/220 (0%)
 Frame = +2

Query: 656  IPEKSKNFKVKESDPERKGSKTFRELIVDQSNICETSLASPV-EKTLYIDFXXXXXXXXX 832
            IPEK+KN+ V   DP +KGS  F+EL+  QS   E+ L SPV EKTLY+D          
Sbjct: 419  IPEKAKNYGVSSIDPLKKGSNNFQELLALQSKYQESGLDSPVVEKTLYVDSVHKVISTNP 478

Query: 833  XXXXXXXXXXXXXRGDYFFDAPIKSGEAEGPHAIASILGDIKVSN-VTGEKAISQPKSLV 1009
                           +      +K G+ E   ++ S+L DIK  N V  +K I Q KSL 
Sbjct: 479  YFSATKTAQGMEDDSEIV----VKPGKVEETPSVDSLLQDIKHLNCVVDDKVIVQRKSLE 534

Query: 1010 SVDSIFLSSPDRSPHGLQIDAKSSSRKDQDIDENSIKTASPKMDGSEKIDLESHLHKKLS 1189
            SVDS  L   ++    +++DA + SR+DQD+ ++S K  S  +  ++K D+ES LH KLS
Sbjct: 535  SVDSYSLFPSEKYAPEMELDATNGSRRDQDLIKDSCKLTSLNVTDNKKDDMESQLHVKLS 594

Query: 1190 NQEKSHGRILKSISSTRSKLADDGKTDLKCQPQMALSSQE 1309
             +E SHG +  SI+ T+SK+    K  L+   Q   S+QE
Sbjct: 595  YRETSHGLVQDSITLTKSKVGGRRKIGLESHLQKKSSNQE 634


>ref|XP_007048701.1| Uncharacterized protein isoform 1 [Theobroma cacao]
           gi|508700962|gb|EOX92858.1| Uncharacterized protein
           isoform 1 [Theobroma cacao]
          Length = 759

 Score =  194 bits (492), Expect = 2e-46
 Identities = 108/217 (49%), Positives = 138/217 (63%), Gaps = 3/217 (1%)
 Frame = +2

Query: 113 DGDETYADALDTLSRTESFFFNCSVSGVSGLDEAEVKPSGTFSTDQWTRDFMMGRFLPAA 292
           D DE Y DALDT SRTESFF NCS+SGVSG D  E+KPSG F+TD  TRDFMMGRFLPAA
Sbjct: 192 DSDEAYVDALDTFSRTESFFLNCSISGVSGFDGPEIKPSGIFTTDPQTRDFMMGRFLPAA 251

Query: 293 KAIASEAPQHTNRKQIIAKEQPRNIQRIVNMDRRPLPKQYSPNSLQFHAQDKKWXXXXXX 472
           KA+ASE P + +RKQ +A+E  R ++++V +D++      SPN    HAQD         
Sbjct: 252 KAVASEIPPYASRKQPVAREPQRQVKKVVIVDKQQPLYVSSPNKFPNHAQDDWLEESEGE 311

Query: 473 XXHCGPGNSSATICGLLPRFCMKSSLCLLNPVPGMRLQAQEAIELVH---RAPAIGSYAS 643
             + G  NSSA +CGL P+F +KSS CLLNPVPGM++QAQ+  +  H   R  A  SY  
Sbjct: 312 DDYSGSQNSSAKVCGLFPQFLLKSSFCLLNPVPGMKIQAQKPAKPAHSVRRRQAKSSYLR 371

Query: 644 SYCEIPEKSKNFKVKESDPERKGSKTFRELIVDQSNI 754
           S  E   +S+  K        + S+T  ELI D++N+
Sbjct: 372 SGNE--TESEYAKAATEKGLTRISRT-EELIEDKNNL 405



 Score =  122 bits (307), Expect = 5e-25
 Identities = 83/220 (37%), Positives = 118/220 (53%), Gaps = 2/220 (0%)
 Frame = +2

Query: 656  IPEKSKNFKVKESDPERKGSKTFRELIVDQSNICETSLASPV-EKTLYIDFXXXXXXXXX 832
            IPEK+KN+ V   DP +KGS  F+EL+  QS   E+ L SPV EKTLY+D          
Sbjct: 455  IPEKAKNYGVSSIDPLKKGSNNFQELLALQSKYQESGLDSPVVEKTLYVDSVHKVISTNP 514

Query: 833  XXXXXXXXXXXXXRGDYFFDAPIKSGEAEGPHAIASILGDIKVSN-VTGEKAISQPKSLV 1009
                           +      +K G+ E   ++ S+L DIK  N V  +K I Q KSL 
Sbjct: 515  YFSATKTAQGMEDDSEIV----VKPGKVEETPSVDSLLQDIKHLNCVVDDKVIVQRKSLE 570

Query: 1010 SVDSIFLSSPDRSPHGLQIDAKSSSRKDQDIDENSIKTASPKMDGSEKIDLESHLHKKLS 1189
            SVDS  L   ++    +++DA + SR+DQD+ ++S K  S  +  ++K D+ES LH KLS
Sbjct: 571  SVDSYSLFPSEKYAPEMELDATNGSRRDQDLIKDSCKLTSLNVTDNKKDDMESQLHVKLS 630

Query: 1190 NQEKSHGRILKSISSTRSKLADDGKTDLKCQPQMALSSQE 1309
             +E SHG +  SI+ T+SK+    K  L+   Q   S+QE
Sbjct: 631  YRETSHGLVQDSITLTKSKVGGRRKIGLESHLQKKSSNQE 670


>gb|KHG24123.1| Protein arginine N-methyltransferase 7 [Gossypium arboreum]
          Length = 708

 Score =  180 bits (456), Expect = 3e-42
 Identities = 104/216 (48%), Positives = 132/216 (61%), Gaps = 2/216 (0%)
 Frame = +2

Query: 113 DGDETYADALDTLSRTESFFFNCSVSGVSGLDEAEVKPSGTFSTDQWTRDFMMGRFLPAA 292
           D  E Y DALDTLSR+ESFF NCS+SGVSGLD +++KPSGTFS+D  TRDFMMGRFLPAA
Sbjct: 162 DSGEAYVDALDTLSRSESFFLNCSISGVSGLDGSDIKPSGTFSSDPQTRDFMMGRFLPAA 221

Query: 293 KAIASEAPQHTNRKQIIAKEQPRNIQRIVNMDRRPLPKQYSPNSLQFHAQDKKWXXXXXX 472
           KA+ASE P +  +KQ IA+E PR I+++V  D++      SPN    HAQD  W      
Sbjct: 222 KAVASETPPYATKKQPIAREPPRQIKKLVIADKQQPLYASSPNKFT-HAQD-DWSEESED 279

Query: 473 XXHCGPGNSSATICGLLPRFCMKSSLCLLNPVPGMRLQ--AQEAIELVHRAPAIGSYASS 646
             +    N S  +CGL P+F +K+SLCLLNP+PG++ Q  AQ A    HR  A  SY  S
Sbjct: 280 DCYSDSQNFSVNVCGLFPQFLLKNSLCLLNPIPGVKAQKSAQTAYS-DHRREAKSSYLRS 338

Query: 647 YCEIPEKSKNFKVKESDPERKGSKTFRELIVDQSNI 754
             E   +      K+      G     E I D++N+
Sbjct: 339 CNETETEHSEAAGKK---RLTGIAQTEEAIEDKNNL 371



 Score = 79.3 bits (194), Expect = 7e-12
 Identities = 67/227 (29%), Positives = 100/227 (44%), Gaps = 1/227 (0%)
 Frame = +2

Query: 629  GSYASSYCEIPEKSKNFKVKESDPERKGSKTFRELIVDQSNICETSLASPVEKTLYIDFX 808
            G     +  IP+K+KN++V   DP + GSK  +E +  +    E+  ASPVEKTLY+D  
Sbjct: 412  GHQEKRFLGIPDKAKNYRVSSFDPHKPGSKNLQECLASECISQESGSASPVEKTLYVDSV 471

Query: 809  XXXXXXXXXXXXXXXXXXXXXRGDYFFDAPIKSGEAEGPHAIASILGDIK-VSNVTGEKA 985
                                       +  +  GE E   ++ S L   K +++V  EK 
Sbjct: 472  QRGISSNSCFPDETASCMKDG-----LEILVNPGEMEENPSVDSSLKHTKHLNHVVDEKT 526

Query: 986  ISQPKSLVSVDSIFLSSPDRSPHGLQIDAKSSSRKDQDIDENSIKTASPKMDGSEKIDLE 1165
                K + SVD   L  P++    LQ+DA    R+DQD+ ++S K     ++        
Sbjct: 527  PVLHKCMESVDPYSLLLPEKYAPYLQMDATDGVRRDQDLIQDSSKLTFLNVN-------- 578

Query: 1166 SHLHKKLSNQEKSHGRILKSISSTRSKLADDGKTDLKCQPQMALSSQ 1306
                 KLS Q+     I  S   T SK A+  K DL+ QPQ+  S+Q
Sbjct: 579  ---EFKLSQQD----LIQHSNKFTNSKAAECRKADLESQPQIKSSNQ 618


>ref|XP_012437373.1| PREDICTED: uncharacterized protein LOC105763636 [Gossypium
           raimondii] gi|823207534|ref|XP_012437375.1| PREDICTED:
           uncharacterized protein LOC105763636 [Gossypium
           raimondii] gi|823207537|ref|XP_012437376.1| PREDICTED:
           uncharacterized protein LOC105763636 [Gossypium
           raimondii] gi|823207540|ref|XP_012437377.1| PREDICTED:
           uncharacterized protein LOC105763636 [Gossypium
           raimondii] gi|763781974|gb|KJB49045.1| hypothetical
           protein B456_008G099200 [Gossypium raimondii]
          Length = 708

 Score =  179 bits (453), Expect = 6e-42
 Identities = 102/217 (47%), Positives = 134/217 (61%), Gaps = 3/217 (1%)
 Frame = +2

Query: 113 DGDETYADALDTLSRTESFFFNCSVSGVSGLDEAEVKPSGTFSTDQWTRDFMMGRFLPAA 292
           D  E Y DALDTLSR+ESFF NCS+SGVSGLD +++KPSGTFS+D  TRDFMMGRFLPAA
Sbjct: 162 DSGEAYVDALDTLSRSESFFLNCSISGVSGLDGSDIKPSGTFSSDPQTRDFMMGRFLPAA 221

Query: 293 KAIASEAPQHTNRKQIIAKEQPRNIQRIVNMDRRPLPKQYSPNSLQFHAQDKKWXXXXXX 472
           KA+ASE P +  +KQ IA+E PR I+++V  D++      SPN    HAQD  W      
Sbjct: 222 KAVASETPPYATKKQPIAREPPRQIKKLVIADKQQPLYASSPNKFP-HAQD-DWSEESED 279

Query: 473 XXHCGPGNSSATICGLLPRFCMKSSLCLLNPVPGMRLQAQEAIELV---HRAPAIGSYAS 643
             +    N S  +CGL P+F +K+SLCLLNP+P  R++AQ++++     HR  A  SY  
Sbjct: 280 DCYSDSQNYSVNVCGLFPQFLLKNSLCLLNPIP--RVKAQKSVKTAYSDHRREAKSSYLR 337

Query: 644 SYCEIPEKSKNFKVKESDPERKGSKTFRELIVDQSNI 754
           S  E   +      K+      G     E I D++N+
Sbjct: 338 SCNETETEHTEAAGKK---RLTGIAQTEEAIEDKNNL 371



 Score = 84.7 bits (208), Expect = 2e-13
 Identities = 69/228 (30%), Positives = 103/228 (45%), Gaps = 1/228 (0%)
 Frame = +2

Query: 626  IGSYASSYCEIPEKSKNFKVKESDPERKGSKTFRELIVDQSNICETSLASPVEKTLYIDF 805
            +G     +  IP+K+KN++V   DP ++GSK  +E +  +S   E+  ASPVEKTLY+D 
Sbjct: 411  LGHQEKRFLGIPDKAKNYRVSSIDPHKQGSKNLQECLASESISQESGSASPVEKTLYVDS 470

Query: 806  XXXXXXXXXXXXXXXXXXXXXXRGDYFFDAPIKSGEAEGPHAIASILGDIK-VSNVTGEK 982
                                        +  +  GE E   ++ S L   K +++V  EK
Sbjct: 471  VQRGISSNSSFPDETASCMKDG-----LEILVNPGEMEENPSVDSSLKHTKHLNHVVDEK 525

Query: 983  AISQPKSLVSVDSIFLSSPDRSPHGLQIDAKSSSRKDQDIDENSIKTASPKMDGSEKIDL 1162
               Q K + SVD   L SP++     Q+DA    R+DQD+  +S K     ++       
Sbjct: 526  TPVQHKCMESVDPYSLLSPEKYAPYWQMDATDGFRRDQDLIRDSSKLTFLNVN------- 578

Query: 1163 ESHLHKKLSNQEKSHGRILKSISSTRSKLADDGKTDLKCQPQMALSSQ 1306
                  KLS Q+     I  S   T SK A+  K DL+ QPQ+  S+Q
Sbjct: 579  ----EFKLSQQD----LIQHSNKFTNSKAAECRKADLESQPQIKSSNQ 618


>gb|KJB49044.1| hypothetical protein B456_008G099200 [Gossypium raimondii]
          Length = 717

 Score =  179 bits (453), Expect = 6e-42
 Identities = 102/217 (47%), Positives = 134/217 (61%), Gaps = 3/217 (1%)
 Frame = +2

Query: 113 DGDETYADALDTLSRTESFFFNCSVSGVSGLDEAEVKPSGTFSTDQWTRDFMMGRFLPAA 292
           D  E Y DALDTLSR+ESFF NCS+SGVSGLD +++KPSGTFS+D  TRDFMMGRFLPAA
Sbjct: 171 DSGEAYVDALDTLSRSESFFLNCSISGVSGLDGSDIKPSGTFSSDPQTRDFMMGRFLPAA 230

Query: 293 KAIASEAPQHTNRKQIIAKEQPRNIQRIVNMDRRPLPKQYSPNSLQFHAQDKKWXXXXXX 472
           KA+ASE P +  +KQ IA+E PR I+++V  D++      SPN    HAQD  W      
Sbjct: 231 KAVASETPPYATKKQPIAREPPRQIKKLVIADKQQPLYASSPNKFP-HAQD-DWSEESED 288

Query: 473 XXHCGPGNSSATICGLLPRFCMKSSLCLLNPVPGMRLQAQEAIELV---HRAPAIGSYAS 643
             +    N S  +CGL P+F +K+SLCLLNP+P  R++AQ++++     HR  A  SY  
Sbjct: 289 DCYSDSQNYSVNVCGLFPQFLLKNSLCLLNPIP--RVKAQKSVKTAYSDHRREAKSSYLR 346

Query: 644 SYCEIPEKSKNFKVKESDPERKGSKTFRELIVDQSNI 754
           S  E   +      K+      G     E I D++N+
Sbjct: 347 SCNETETEHTEAAGKK---RLTGIAQTEEAIEDKNNL 380



 Score = 84.7 bits (208), Expect = 2e-13
 Identities = 69/228 (30%), Positives = 103/228 (45%), Gaps = 1/228 (0%)
 Frame = +2

Query: 626  IGSYASSYCEIPEKSKNFKVKESDPERKGSKTFRELIVDQSNICETSLASPVEKTLYIDF 805
            +G     +  IP+K+KN++V   DP ++GSK  +E +  +S   E+  ASPVEKTLY+D 
Sbjct: 420  LGHQEKRFLGIPDKAKNYRVSSIDPHKQGSKNLQECLASESISQESGSASPVEKTLYVDS 479

Query: 806  XXXXXXXXXXXXXXXXXXXXXXRGDYFFDAPIKSGEAEGPHAIASILGDIK-VSNVTGEK 982
                                        +  +  GE E   ++ S L   K +++V  EK
Sbjct: 480  VQRGISSNSSFPDETASCMKDG-----LEILVNPGEMEENPSVDSSLKHTKHLNHVVDEK 534

Query: 983  AISQPKSLVSVDSIFLSSPDRSPHGLQIDAKSSSRKDQDIDENSIKTASPKMDGSEKIDL 1162
               Q K + SVD   L SP++     Q+DA    R+DQD+  +S K     ++       
Sbjct: 535  TPVQHKCMESVDPYSLLSPEKYAPYWQMDATDGFRRDQDLIRDSSKLTFLNVN------- 587

Query: 1163 ESHLHKKLSNQEKSHGRILKSISSTRSKLADDGKTDLKCQPQMALSSQ 1306
                  KLS Q+     I  S   T SK A+  K DL+ QPQ+  S+Q
Sbjct: 588  ----EFKLSQQD----LIQHSNKFTNSKAAECRKADLESQPQIKSSNQ 627


>ref|XP_006348878.1| PREDICTED: uncharacterized protein LOC102602497 isoform X3 [Solanum
            tuberosum]
          Length = 593

 Score =  178 bits (452), Expect = 8e-42
 Identities = 131/379 (34%), Positives = 187/379 (49%), Gaps = 11/379 (2%)
 Frame = +2

Query: 113  DGDETYADALDTLSRTESFFFNCSVSGVSGLDEAEVKPSGTFSTDQWTRDFMMGRFLPAA 292
            D DE Y DA +TLSRTESFF NCSVSG+SGLDE E KPSGT   D   RDFM+ RFLPAA
Sbjct: 163  DADEVYMDAPNTLSRTESFFVNCSVSGLSGLDEPEAKPSGTSLRDPQARDFMIDRFLPAA 222

Query: 293  KAIAS----EAPQHTNRKQIIAKEQPRNIQRIVNMDRRPLPKQYSPNSLQFHAQDKKWXX 460
            KA+AS    E P +  RKQ   +EQPR  +++VN D+RP   +Y P+    ++Q      
Sbjct: 223  KAMASEKSLEMPHYAPRKQPAVQEQPRQPKKVVNGDKRP-QLRYGPSFALRYSQFHDNYE 281

Query: 461  XXXXXXHCGPGNSSATICGLLPRFCMKSSLCLLNPVPGMRLQAQEAIELVHRAPAIGSYA 640
                   C  GN    +CGLLPRFC+KSS CL+NPVPGM  + +  +    R     S  
Sbjct: 282  EESDDDSCYDGNLPTKVCGLLPRFCLKSSFCLMNPVPGMSARTRVPMSPASRTQTGSSST 341

Query: 641  SSYCEIPEKSKNFKVK---ESDPERKGSKTFRELIVDQSNICETSLASPV-EKTLYIDFX 808
            +S      +SK+  V            +  F+EL   Q+   E  L +P+ EKTL++D  
Sbjct: 342  ASCSGSENESKSESVAGFVTVHAHEDTNDCFQELFEYQNIAGEADLTAPLAEKTLHVDIV 401

Query: 809  XXXXXXXXXXXXXXXXXXXXXRGDYFFDAPIKSG--EAEGPHAIASILGDIKVSNVTGEK 982
                                    +  ++PI     +AE P  +     DI       +K
Sbjct: 402  ------------------------HKVESPIMKSPPKAERPFNLQDENQDIL------KK 431

Query: 983  AISQPKSLVSVDSIF-LSSPDRSPHGLQIDAKSSSRKDQDIDENSIKTASPKMDGSEKID 1159
               Q  S+ S  S F  SS ++  +G ++ A ++S + QD  ++++ +  PK D      
Sbjct: 432  MAEQKPSVDSSLSYFPPSSAEKLNNGGEMMALTASEQAQDHYQDAVASVKPKDDVKRSTR 491

Query: 1160 LESHLHKKLSNQEKSHGRI 1216
             ++   +KL N   +H ++
Sbjct: 492  KQTVRKEKLQNSRVAHSKL 510


>gb|KHF99143.1| hypothetical protein F383_20211 [Gossypium arboreum]
          Length = 740

 Score =  178 bits (451), Expect = 1e-41
 Identities = 96/208 (46%), Positives = 129/208 (62%), Gaps = 3/208 (1%)
 Frame = +2

Query: 113 DGDETYADALDTLSRTESFFFNCSVSGVSGLDEAEVKPSGTFSTDQWTRDFMMGRFLPAA 292
           D D+ Y+DALDTLS T+SF  NCS+SG+SGLD    KPSGTFSTD  T+DFM+ RFLPAA
Sbjct: 152 DDDDVYSDALDTLSPTDSFSMNCSISGLSGLDGLVAKPSGTFSTDPQTQDFMLRRFLPAA 211

Query: 293 KAIASEAPQHTNRKQIIAKEQPRNIQRIVNMDRRPLPKQYSPNSLQFHAQD-KKWXXXXX 469
           KA+A E PQ++ RKQ  A EQPR ++++V  DR+PL  QY    + +H QD  +      
Sbjct: 212 KAMALETPQYSLRKQSTAPEQPREVKKLVVADRKPLVNQYETAIVPYHNQDVDEEETDDE 271

Query: 470 XXXHCGPGNSSATICGLLPRFCMKSSLCLLNPVPGMRLQAQEAIELVH--RAPAIGSYAS 643
              +   GN S   CGLLPR C K+SLCLLNPVPG++++   ++        P   +Y  
Sbjct: 272 SIDYQDSGNLSRKACGLLPRLCFKNSLCLLNPVPGLKVRTHSSMSSTRDVAKPCKATYLK 331

Query: 644 SYCEIPEKSKNFKVKESDPERKGSKTFR 727
           S+ +I EK+    V   D   +G ++ R
Sbjct: 332 SHSQIAEKNAR-DVVHKDKSARGVRSPR 358


Top