BLASTX nr result

ID: Sinomenium22_contig00010512 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00010512
         (1454 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002274593.2| PREDICTED: uncharacterized protein LOC100248...   260   9e-67
ref|XP_007207901.1| hypothetical protein PRUPE_ppa020794mg [Prun...   247   1e-62
emb|CBI32667.3| unnamed protein product [Vitis vinifera]              227   9e-57
ref|XP_006480729.1| PREDICTED: uncharacterized protein LOC102617...   219   3e-54
ref|XP_004305226.1| PREDICTED: uncharacterized protein LOC101298...   218   4e-54
ref|XP_006429000.1| hypothetical protein CICLE_v10011022mg [Citr...   217   1e-53
ref|XP_007027126.1| Uncharacterized protein isoform 2 [Theobroma...   216   2e-53
ref|XP_007027125.1| Uncharacterized protein isoform 1 [Theobroma...   216   2e-53
ref|XP_002308481.2| hypothetical protein POPTR_0006s23020g [Popu...   190   1e-45
ref|XP_007144479.1| hypothetical protein PHAVU_007G159500g [Phas...   189   3e-45
ref|XP_006594085.1| PREDICTED: uncharacterized protein LOC100794...   187   1e-44
ref|XP_006594084.1| PREDICTED: uncharacterized protein LOC100794...   187   1e-44
ref|XP_003541395.1| PREDICTED: uncharacterized protein LOC100794...   187   1e-44
gb|EXB67881.1| hypothetical protein L484_008898 [Morus notabilis]     186   2e-44
ref|XP_007162683.1| hypothetical protein PHAVU_001G171300g [Phas...   176   2e-41
ref|XP_006576988.1| PREDICTED: uncharacterized protein LOC100801...   173   2e-40
ref|XP_006576987.1| PREDICTED: uncharacterized protein LOC100801...   173   2e-40
ref|XP_006588732.1| PREDICTED: uncharacterized protein LOC100797...   168   5e-39
ref|XP_006588731.1| PREDICTED: uncharacterized protein LOC100797...   168   5e-39
ref|XP_003536963.1| PREDICTED: uncharacterized protein LOC100797...   168   5e-39

>ref|XP_002274593.2| PREDICTED: uncharacterized protein LOC100248303 [Vitis vinifera]
          Length = 984

 Score =  260 bits (665), Expect = 9e-67
 Identities = 180/485 (37%), Positives = 245/485 (50%), Gaps = 2/485 (0%)
 Frame = +3

Query: 6    PPSPQTGHITVLRSLNAHNYVEKDVCSKPDRKKESRNFAGSVNQYENDLLSHSHKKHGAH 185
            PP P    ITV +S N+  Y       K  R    +N   S  ++ +D  SHS+ KH AH
Sbjct: 248  PPQPHCRRITVSKSSNSPKYENNATGWKSKRGTSRKNDISSPQKHHDDHFSHSYGKHDAH 307

Query: 186  VSSKVSKSQPYERDDTHLLQVTKSHPYE-RDDTRLLPTRIVVLKPNLRKALNPARPVXXX 362
             S   S+ Q                 +E RD+T +LPTRIVVLKPNL K L+ ++ +   
Sbjct: 308  KSLHPSRIQ-----------------FEGRDETSVLPTRIVVLKPNLGKVLSSSKSISSP 350

Query: 363  XXXXXXXXXYRKQEHRSSENLELFSEVRERRSSPIESELIKHRDRGSREIAKEITRQMRC 542
                       K     + ++ + ++  E + S  E    +H+ R SREIAKE+TR+MR 
Sbjct: 351  RSSYDFLSDCGKH----TGSMSIRNKEAELQGSN-EMGFSRHKSRESREIAKEVTRRMRN 405

Query: 543  PANYGSMKFSSSRLRGYAGDESSCTMSGDDSENE-FDTMTPSRHFSDWKGRYXXXXXXXX 719
                GSM FSS+  RGYAGDESSC MSG+DS +E  +T+  SR+  D   RY        
Sbjct: 406  SITNGSMNFSSAGFRGYAGDESSC-MSGNDSLSEPEETVLISRNSFDRSSRYRASSSHST 464

Query: 720  XXXXXREAKKRLSERWKLTHRLQDSRTVGRGSTLAEMLAMPDREIKSMTSDPLVGQDNFR 899
                 REA+KRLSERWK+T R Q+   V RGSTLAEMLA+ D+E++S   D ++GQ    
Sbjct: 465  ESSVSREARKRLSERWKMTRRFQEVGAVNRGSTLAEMLAISDKEVRSENLDSMIGQGGCS 524

Query: 900  DRLVRKDGAARWAAPLGISSKDGWKDECXXXXXXXXXXXXXXTGFGIPSMSIRHESLSNN 1079
            +   R DG + WA+PLGISS DGWKD C                FG P  S+ HE+  + 
Sbjct: 525  NSFSRNDGTSEWASPLGISSMDGWKDGCGRHLSRSRSLPASSDVFGSPKASMHHETQVDG 584

Query: 1080 KRSMLKEAVIHGSNKLKKGGGNQREGILLKSNIKAGHKKSYSVTYPGEECKHTGQENHHL 1259
               M KE +  G N+  +G    +E  L   N+K   KKS S      E   T QE  + 
Sbjct: 585  WYLMSKEVMNRGRNRTIRGSIGPKES-LSSRNLKCSSKKSQSSRDKSREHNDTLQE-IYF 642

Query: 1260 NPDEPLPSLEEKNLSEQKPLILEVFGDGMEEKMRVTDETKVSRYKDVEMSLEIREEQQLE 1439
            N +E   +L+EK  SE+KP+I E       +   V D T V   +++ MS E  +E   E
Sbjct: 643  NHNEMKCNLDEKGPSEEKPMISETSAYNATDTNLVVD-TIVDEQENMAMSSESPDESLRE 701

Query: 1440 AAACV 1454
             + C+
Sbjct: 702  LSTCI 706


>ref|XP_007207901.1| hypothetical protein PRUPE_ppa020794mg [Prunus persica]
            gi|462403543|gb|EMJ09100.1| hypothetical protein
            PRUPE_ppa020794mg [Prunus persica]
          Length = 910

 Score =  247 bits (630), Expect = 1e-62
 Identities = 162/443 (36%), Positives = 233/443 (52%), Gaps = 2/443 (0%)
 Frame = +3

Query: 6    PPSPQTGHITVLRSLNAHNYVEKDVCSKPDRKKESRNFAGSVNQYENDLLSHSHKKHGAH 185
            PPS + GHI  ++S  A  Y   D+     R+   +N   S  ++ +   SHS  +H  H
Sbjct: 166  PPS-RCGHIASMKSSEAQRYENIDLGWTAVRETPRKNNCKSPQEHRDSFSSHSDSRHAGH 224

Query: 186  VSSKVSKSQPYERDDTHLLQVTKSHPYERDDTRLLPTRIVVLKPNLRKALNPARPVXXXX 365
             S K S          +L +V       ++++ + PTRIVVLKPNL K LN  + +    
Sbjct: 225  SSLKSS---------INLSEV-------KNESSIPPTRIVVLKPNLGKMLNGTKTISSPC 268

Query: 366  XXXXXXXXYRKQ-EHRSSENLELFSEVRERRSSPIESELIKHRDRGSREIAKEITRQMRC 542
                     RK  E  S  N E  +E R R++S  +   ++H+ R SRE+AKEITRQMR 
Sbjct: 269  SSHASMLDGRKHAEFPSIRNRE--TESRGRKNSQDKDGHLRHKSRESREVAKEITRQMRN 326

Query: 543  PANYGSMKFSSSRLRGYAGDESSCTMSGDDSENEFDTMT-PSRHFSDWKGRYXXXXXXXX 719
              + GS++FSSS L+GYAGDESSC+MS ++S NE + M+  SRH                
Sbjct: 327  NFSTGSVRFSSSGLKGYAGDESSCSMSENESANESEVMSVASRHSFHLNNHSRPSSSCST 386

Query: 720  XXXXXREAKKRLSERWKLTHRLQDSRTVGRGSTLAEMLAMPDREIKSMTSDPLVGQDNFR 899
                 REAKKRLSERWK+TH+ Q+   V RG+TLAEMLA+PD+E+++   + ++G+  FR
Sbjct: 387  ESTVSREAKKRLSERWKMTHKSQEMGVVSRGNTLAEMLAIPDKEMRAEKLNAMIGEARFR 446

Query: 900  DRLVRKDGAARWAAPLGISSKDGWKDECXXXXXXXXXXXXXXTGFGIPSMSIRHESLSNN 1079
            D+   +D  AR   PLGISS+DGWKD C              + FG    S+R E++ ++
Sbjct: 447  DKFSTEDAPARCGGPLGISSRDGWKDGCINSLSRSKSLPSSSSAFGSYKTSMRRETIRDD 506

Query: 1080 KRSMLKEAVIHGSNKLKKGGGNQREGILLKSNIKAGHKKSYSVTYPGEECKHTGQENHHL 1259
            +  + KE V H  N+L KG  + REG   + + ++ +K+SYS    G E      E  H 
Sbjct: 507  RYLIPKETVQHERNQLVKGNLDLREG--ARKHSRSSNKRSYSSRSLGREAIDISPET-HT 563

Query: 1260 NPDEPLPSLEEKNLSEQKPLILE 1328
               +     E  N S+Q   + E
Sbjct: 564  TQSKDKTDFEANNQSQQNISVFE 586


>emb|CBI32667.3| unnamed protein product [Vitis vinifera]
          Length = 867

 Score =  227 bits (579), Expect = 9e-57
 Identities = 168/484 (34%), Positives = 229/484 (47%), Gaps = 1/484 (0%)
 Frame = +3

Query: 6    PPSPQTGHITVLRSLNAHNYVEKDVCSKPDRKKESRNFAGSVNQYENDLLSHSHKKHGAH 185
            PP P    ITV +S N+  Y E +      ++  SR         +ND+           
Sbjct: 177  PPQPHCRRITVSKSSNSPKY-ENNATGWKSKRGTSR---------KNDI----------- 215

Query: 186  VSSKVSKSQPYERDDTHLLQVTKSHPYERDDTRLLPTRIVVLKPNLRKALNPARPVXXXX 365
                   S P +  D H         + RD+T +LPTRIVVLKPNL K L+ ++ +    
Sbjct: 216  -------SSPQKHHDDH---------FRRDETSVLPTRIVVLKPNLGKVLSSSKSISSPR 259

Query: 366  XXXXXXXXYRKQEHRSSENLELFSEVRERRSSPIESELIKHRDRGSREIAKEITRQMRCP 545
                      K     + ++ + ++  E + S  E    +H+ R SREIAKE+TR+MR  
Sbjct: 260  SSYDFLSDCGKH----TGSMSIRNKEAELQGSN-EMGFSRHKSRESREIAKEVTRRMRNS 314

Query: 546  ANYGSMKFSSSRLRGYAGDESSCTMSGDDSENE-FDTMTPSRHFSDWKGRYXXXXXXXXX 722
               GSM FSS+  RGYAGDESSC MSG+DS +E  +T+  SR+  D   RY         
Sbjct: 315  ITNGSMNFSSAGFRGYAGDESSC-MSGNDSLSEPEETVLISRNSFDRSSRYRASSSHSTE 373

Query: 723  XXXXREAKKRLSERWKLTHRLQDSRTVGRGSTLAEMLAMPDREIKSMTSDPLVGQDNFRD 902
                REA+KRLSERWK+T R Q+   V RGSTLAEMLA+ D+E++S   D ++GQ    +
Sbjct: 374  SSVSREARKRLSERWKMTRRFQEVGAVNRGSTLAEMLAISDKEVRSENLDSMIGQGGCSN 433

Query: 903  RLVRKDGAARWAAPLGISSKDGWKDECXXXXXXXXXXXXXXTGFGIPSMSIRHESLSNNK 1082
               R DG + WA+PLGISS DGWKD C                FG P  S+ HE+     
Sbjct: 434  SFSRNDGTSEWASPLGISSMDGWKDGCGRHLSRSRSLPASSDVFGSPKASMHHET----- 488

Query: 1083 RSMLKEAVIHGSNKLKKGGGNQREGILLKSNIKAGHKKSYSVTYPGEECKHTGQENHHLN 1262
                                 Q +G L   N+K   KKS S      E   T QE  + N
Sbjct: 489  ---------------------QVDGCLSSRNLKCSSKKSQSSRDKSREHNDTLQE-IYFN 526

Query: 1263 PDEPLPSLEEKNLSEQKPLILEVFGDGMEEKMRVTDETKVSRYKDVEMSLEIREEQQLEA 1442
             +E   +L+EK  SE+KP+I E       +   V D T V   +++ MS E  +E   E 
Sbjct: 527  HNEMKCNLDEKGPSEEKPMISETSAYNATDTNLVVD-TIVDEQENMAMSSESPDESLREL 585

Query: 1443 AACV 1454
            + C+
Sbjct: 586  STCI 589


>ref|XP_006480729.1| PREDICTED: uncharacterized protein LOC102617097 [Citrus sinensis]
          Length = 989

 Score =  219 bits (557), Expect = 3e-54
 Identities = 157/439 (35%), Positives = 219/439 (49%), Gaps = 4/439 (0%)
 Frame = +3

Query: 24   GHITVLRSLNAHNYVEKDVCSKPDRKKESRNFAGSVNQYENDLLSHSHKKHGAHVSSKVS 203
            GHI+ +    A      DV  K +R  + +N   S  ++ + L SHS   H A   +K +
Sbjct: 249  GHISAMTPSLARQCESSDVGWKAERGTQCKNQRKSSQEHPDGLSSHSSSGHAAQSLNKPA 308

Query: 204  KSQPYERDDTHLLQVTKSHPYERDDTRLLPTRIVVLKPNLRKALNPARPVXXXXXXXXXX 383
                       ++Q+       ++D  +LPTRIVVLKPN+ +    AR V          
Sbjct: 309  -----------IVQLEG-----KEDHSVLPTRIVVLKPNVGRVQAAARTVSSPRSSHGYP 352

Query: 384  XXYRKQEHRSSENLELFS-EVRERRSSPIESELIKHRDRGSREIAKEITRQMRCPANYGS 560
               RK        +E    E  E++  P +    +H+ R SRE+AKEITRQMR   +  S
Sbjct: 353  SDSRKHTELPGPGMENREPETWEKKKFPDDVGFSRHKSRESRELAKEITRQMRDNLSSVS 412

Query: 561  MKFSSSRLRGYAGDESSCTMSGDDSENEFD--TMTPSRHFSDWKGRYXXXXXXXXXXXXX 734
            MKFSS+  +GYAGDESS   SG++S NE +  TMT    F   + R              
Sbjct: 413  MKFSSTGFKGYAGDESSSNFSGNESANELEIKTMTSKDGFIRHR-RSRSSSSHSSESSVS 471

Query: 735  REAKKRLSERWKLTHRLQDSRTVGRGSTLAEMLAMPDREIKSMTSDPLVGQDNFRDRLVR 914
            REAKKRLSERWK++H+ Q+   + RG+TL EMLAM DRE++    D L+GQ+ F DR   
Sbjct: 472  REAKKRLSERWKMSHKSQELGVINRGNTLGEMLAMSDREVRPANVDTLIGQEGFCDRRDG 531

Query: 915  KDGAARWAAPLGISSKDGWKDECXXXXXXXXXXXXXXTGFGIPSMSIRHESLSNNKRSML 1094
             +G  RW  PLGISS+DGWKD                T    P  S+R+ESL +++  + 
Sbjct: 532  NNGPTRWVEPLGISSRDGWKDGRISTLTRSRSLPTSST-LASPKTSMRYESLRDDRYIIP 590

Query: 1095 KEAVIHGSNKLKKGGGNQREGILLKSNIKAGHKKSYSVTYPGEECKHTGQENHH-LNPDE 1271
            KE +     K  KG  NQREG   +S+ KA  +K  S      E   T  + H  LN  E
Sbjct: 591  KETIKRERGKAVKGNFNQREGSSSRSS-KASRRKYLSSQCTSRESNITSPDTHFTLNQVE 649

Query: 1272 PLPSLEEKNLSEQKPLILE 1328
               +++E + SE+  ++LE
Sbjct: 650  --SNIKEYDPSEESFMVLE 666


>ref|XP_004305226.1| PREDICTED: uncharacterized protein LOC101298051 [Fragaria vesca
            subsp. vesca]
          Length = 988

 Score =  218 bits (556), Expect = 4e-54
 Identities = 138/424 (32%), Positives = 216/424 (50%), Gaps = 2/424 (0%)
 Frame = +3

Query: 9    PSPQTGHITVLRSLNAHNYVEKDVCSKPDRKKESRNFAGSVNQYENDLLSHSHKKHGAHV 188
            P    G +  ++S  A  Y + D+     R+   RN+  S  ++ +   S+S  +H    
Sbjct: 252  PQSHCGRVASMKSSEAQKYEKIDLGWTSARESPLRNYCKSPQRHRDSFSSYSDSRHATRY 311

Query: 189  SSKVSKSQPYERDDTHLLQVTKSHPYERDDTRLLPTRIVVLKPNLRKALNPARPVXXXXX 368
            S K                 ++  P  + +T + PTRIVVLKPNL K LN  + +     
Sbjct: 312  SLK-----------------SQYRPEAKHETAITPTRIVVLKPNLGKILNATKTISSPCS 354

Query: 369  XXXXXXXYR-KQEHRSSENLELFSEVRERRSSPIESELIKHRDRGSREIAKEITRQMRCP 545
                    R + +  +  N E+  +   +++ P      +H+ R SRE+AKEITRQMR  
Sbjct: 355  SQASMSVCRNRSDFPNIGNREV--DAWGKKNFPDNEGQSRHKSRESREVAKEITRQMRKN 412

Query: 546  ANYGSMKFSSSRLRGYAGDESSCTMSGDDSENEFDTMT-PSRHFSDWKGRYXXXXXXXXX 722
             + GS++ SSS  +GYAGD+SSC+MS ++S NE + ++  S+ FSD              
Sbjct: 413  ISMGSVQISSSGFKGYAGDDSSCSMSENESGNESEVISVASKQFSDRHNHSRRSSTCSAE 472

Query: 723  XXXXREAKKRLSERWKLTHRLQDSRTVGRGSTLAEMLAMPDREIKSMTSDPLVGQDNFRD 902
                REAKKRLSERWK+TH+ Q+     RG+TLAEMLA+PD+E+++   D + G+  FRD
Sbjct: 473  SSVSREAKKRLSERWKMTHKSQEIGVASRGNTLAEMLAIPDKEMQAAKLDAMKGEAGFRD 532

Query: 903  RLVRKDGAARWAAPLGISSKDGWKDECXXXXXXXXXXXXXXTGFGIPSMSIRHESLSNNK 1082
            +  R+DG   W  PLGISS+DGWKDEC                FG    ++R E++ +N+
Sbjct: 533  KFAREDGPVGWGGPLGISSRDGWKDECIKSLSRSKSLPASSGAFG-SYKTMRRETIRDNR 591

Query: 1083 RSMLKEAVIHGSNKLKKGGGNQREGILLKSNIKAGHKKSYSVTYPGEECKHTGQENHHLN 1262
              +  E + H  N+  +   + RE    + N ++ +K+SYS        +   +E+  ++
Sbjct: 592  YLIPSEVLKHKRNQSVEVDFDHRES--GRINYRSRNKRSYS-------SRSLSRESMDIS 642

Query: 1263 PDEP 1274
            P+ P
Sbjct: 643  PETP 646


>ref|XP_006429000.1| hypothetical protein CICLE_v10011022mg [Citrus clementina]
            gi|557531057|gb|ESR42240.1| hypothetical protein
            CICLE_v10011022mg [Citrus clementina]
          Length = 909

 Score =  217 bits (552), Expect = 1e-53
 Identities = 156/439 (35%), Positives = 218/439 (49%), Gaps = 4/439 (0%)
 Frame = +3

Query: 24   GHITVLRSLNAHNYVEKDVCSKPDRKKESRNFAGSVNQYENDLLSHSHKKHGAHVSSKVS 203
            GHI+ +    A      DV  K +R  + +N   S  ++ + L  HS   H A   +K +
Sbjct: 169  GHISAMTPSLARQCESSDVGWKAERGTQCKNQRKSSQEHPDGLSRHSSSGHAAQSLNKPA 228

Query: 204  KSQPYERDDTHLLQVTKSHPYERDDTRLLPTRIVVLKPNLRKALNPARPVXXXXXXXXXX 383
                       ++Q+       ++D  +LPTRIVVLKPN+ +    AR V          
Sbjct: 229  -----------IVQLEG-----KEDHSVLPTRIVVLKPNVGRVQAAARTVSSPRSSHGYP 272

Query: 384  XXYRKQEHRSSENLELFS-EVRERRSSPIESELIKHRDRGSREIAKEITRQMRCPANYGS 560
               RK        +E    E  E++  P +    +H+ R SRE+AKEITRQMR   +  S
Sbjct: 273  SDSRKHTELPGPGMENREPETWEKKKFPDDVGFSRHKSRESRELAKEITRQMRDNLSSVS 332

Query: 561  MKFSSSRLRGYAGDESSCTMSGDDSENEFD--TMTPSRHFSDWKGRYXXXXXXXXXXXXX 734
            MKFSS+  +GYAGDESS   SG++S NE +  TMT    F   + R              
Sbjct: 333  MKFSSTGFKGYAGDESSSNFSGNESANELEIKTMTSKDGFIRHR-RSRSSSSHSSESSVS 391

Query: 735  REAKKRLSERWKLTHRLQDSRTVGRGSTLAEMLAMPDREIKSMTSDPLVGQDNFRDRLVR 914
            REAKKRLSERWK++H+ Q+   + RG+TL EMLAM DRE++    D L+GQ+ F DR   
Sbjct: 392  REAKKRLSERWKMSHKSQELGVINRGNTLGEMLAMSDREVRPANVDTLIGQEGFCDRRDG 451

Query: 915  KDGAARWAAPLGISSKDGWKDECXXXXXXXXXXXXXXTGFGIPSMSIRHESLSNNKRSML 1094
             +G  RW  PLGISS+DGWKD                T    P  S+R+ESL +++  + 
Sbjct: 452  NNGPTRWVEPLGISSRDGWKDGRISTLTRSRSLPTSST-LASPKTSMRYESLRDDRYIIP 510

Query: 1095 KEAVIHGSNKLKKGGGNQREGILLKSNIKAGHKKSYSVTYPGEECKHTGQENHH-LNPDE 1271
            KE +     K  KG  NQREG   +S+ KA  +K  S      E   T  + H  LN  E
Sbjct: 511  KETIKRERGKAVKGNFNQREGSSSRSS-KASRRKYLSSQCTSRESNITSPDTHFTLNQVE 569

Query: 1272 PLPSLEEKNLSEQKPLILE 1328
               +++E + SE+  ++LE
Sbjct: 570  --SNIKEYDPSEESFMVLE 586


>ref|XP_007027126.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508715731|gb|EOY07628.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 991

 Score =  216 bits (550), Expect = 2e-53
 Identities = 167/482 (34%), Positives = 233/482 (48%), Gaps = 14/482 (2%)
 Frame = +3

Query: 9    PSPQTGHITVLRSLNAHNYVEKDVCSKPDRKKESRNFAGSVNQYENDLLSHSHKKHGAHV 188
            P  + G I+ ++S +        +  +  R+ + ++ + S   +  DLLSHS  ++ AH 
Sbjct: 249  PQSRCGRISAMKSSHTLTNENGHLGRRAGRETQCKHCSKSPQGHREDLLSHSCGRYAAH- 307

Query: 189  SSKVSKSQPYERDDTHLLQVTKSHPYERDDTRLLPTRIVVLKPNLRKALNPARPVXXXXX 368
                           +LL+  K    E+ +  + PTRIVVLKPNL K+LN  R       
Sbjct: 308  ---------------NLLKSPKVQLEEKQEPAVAPTRIVVLKPNLGKSLNSMRTASSPCS 352

Query: 369  XXXXXXXYRKQ-EHRSSENLELFSEVRERRSSPIESELIKHRDRGSREIAKEITRQMRCP 545
                      Q E    EN E  +E+  ++    +    +H  R SRE+AKEITR+M+  
Sbjct: 353  SHHFPSDCTGQSEILGIENRE--AEIWGKKKVHQDVGFSRHNSRESREMAKEITRRMKNS 410

Query: 546  ANYGSMKFSSSRLRGYAGDESSCTMSGDDSENEFDTMTPSRHFSDWKGR---YXXXXXXX 716
             + GSMKFS+SR RGYAGDESSC +SG +S N+ D  T S  + D  GR   +       
Sbjct: 411  FSNGSMKFSTSRFRGYAGDESSCDVSGSESANDSDVTTVS--YRDNIGRNKKHRRSSSRS 468

Query: 717  XXXXXXREAKKRLSERWKLTHRLQDSRTVGRGSTLAEMLAMPDREIKSMTSDPLVGQDNF 896
                  REAKKRLSERWKLTH  Q+   V RGSTL EMLA+ DRE++   S  +VG++  
Sbjct: 469  SESSVSREAKKRLSERWKLTHGSQELLMVSRGSTLGEMLAISDREVRPANSSGIVGEEGC 528

Query: 897  RD--RLVRKDGAARWAAPLGISSKDGWKDECXXXXXXXXXXXXXXTGFGIPSMSIRHESL 1070
             +    VR+   A W  PLGISS+DGWK+EC              T FG P ++ RHESL
Sbjct: 529  SEFGNDVRR---AVWKEPLGISSRDGWKNECLGNLSRSRSVPASSTDFGSPRINTRHESL 585

Query: 1071 SNNKRSMLKEAVIHGSNKLKKGGGNQREGILLKSNIKAGHKKS--YSVTYPGEECKHTGQ 1244
              +K  + KE      NK  KG  +      L SN ++  KKS   S      E   T  
Sbjct: 586  RRDKYVIPKEGFKWDRNKAVKGNFSPWVA-PLPSNQRSCTKKSQFLSTCSSNNENSDTSP 644

Query: 1245 ENHHLNPDEPLPSLEEKNLSEQKPLILEVFGDG------MEEKMRVTDETKVSRYKDVEM 1406
            E  H+ P +   +LE  +  EQ P++             +E  + V D+ KV   +  +M
Sbjct: 645  E-FHITPYQVKQTLEGHDQPEQSPMVSGASSTSVDASSVLENAVDVNDQNKVVLSEPSQM 703

Query: 1407 SL 1412
             L
Sbjct: 704  EL 705


>ref|XP_007027125.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508715730|gb|EOY07627.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1023

 Score =  216 bits (550), Expect = 2e-53
 Identities = 167/482 (34%), Positives = 233/482 (48%), Gaps = 14/482 (2%)
 Frame = +3

Query: 9    PSPQTGHITVLRSLNAHNYVEKDVCSKPDRKKESRNFAGSVNQYENDLLSHSHKKHGAHV 188
            P  + G I+ ++S +        +  +  R+ + ++ + S   +  DLLSHS  ++ AH 
Sbjct: 281  PQSRCGRISAMKSSHTLTNENGHLGRRAGRETQCKHCSKSPQGHREDLLSHSCGRYAAH- 339

Query: 189  SSKVSKSQPYERDDTHLLQVTKSHPYERDDTRLLPTRIVVLKPNLRKALNPARPVXXXXX 368
                           +LL+  K    E+ +  + PTRIVVLKPNL K+LN  R       
Sbjct: 340  ---------------NLLKSPKVQLEEKQEPAVAPTRIVVLKPNLGKSLNSMRTASSPCS 384

Query: 369  XXXXXXXYRKQ-EHRSSENLELFSEVRERRSSPIESELIKHRDRGSREIAKEITRQMRCP 545
                      Q E    EN E  +E+  ++    +    +H  R SRE+AKEITR+M+  
Sbjct: 385  SHHFPSDCTGQSEILGIENRE--AEIWGKKKVHQDVGFSRHNSRESREMAKEITRRMKNS 442

Query: 546  ANYGSMKFSSSRLRGYAGDESSCTMSGDDSENEFDTMTPSRHFSDWKGR---YXXXXXXX 716
             + GSMKFS+SR RGYAGDESSC +SG +S N+ D  T S  + D  GR   +       
Sbjct: 443  FSNGSMKFSTSRFRGYAGDESSCDVSGSESANDSDVTTVS--YRDNIGRNKKHRRSSSRS 500

Query: 717  XXXXXXREAKKRLSERWKLTHRLQDSRTVGRGSTLAEMLAMPDREIKSMTSDPLVGQDNF 896
                  REAKKRLSERWKLTH  Q+   V RGSTL EMLA+ DRE++   S  +VG++  
Sbjct: 501  SESSVSREAKKRLSERWKLTHGSQELLMVSRGSTLGEMLAISDREVRPANSSGIVGEEGC 560

Query: 897  RD--RLVRKDGAARWAAPLGISSKDGWKDECXXXXXXXXXXXXXXTGFGIPSMSIRHESL 1070
             +    VR+   A W  PLGISS+DGWK+EC              T FG P ++ RHESL
Sbjct: 561  SEFGNDVRR---AVWKEPLGISSRDGWKNECLGNLSRSRSVPASSTDFGSPRINTRHESL 617

Query: 1071 SNNKRSMLKEAVIHGSNKLKKGGGNQREGILLKSNIKAGHKKS--YSVTYPGEECKHTGQ 1244
              +K  + KE      NK  KG  +      L SN ++  KKS   S      E   T  
Sbjct: 618  RRDKYVIPKEGFKWDRNKAVKGNFSPWVA-PLPSNQRSCTKKSQFLSTCSSNNENSDTSP 676

Query: 1245 ENHHLNPDEPLPSLEEKNLSEQKPLILEVFGDG------MEEKMRVTDETKVSRYKDVEM 1406
            E  H+ P +   +LE  +  EQ P++             +E  + V D+ KV   +  +M
Sbjct: 677  E-FHITPYQVKQTLEGHDQPEQSPMVSGASSTSVDASSVLENAVDVNDQNKVVLSEPSQM 735

Query: 1407 SL 1412
             L
Sbjct: 736  EL 737


>ref|XP_002308481.2| hypothetical protein POPTR_0006s23020g [Populus trichocarpa]
            gi|550336905|gb|EEE92004.2| hypothetical protein
            POPTR_0006s23020g [Populus trichocarpa]
          Length = 907

 Score =  190 bits (483), Expect = 1e-45
 Identities = 126/338 (37%), Positives = 176/338 (52%), Gaps = 2/338 (0%)
 Frame = +3

Query: 147  DLLSHSHKKHGAHVSSKVSKSQPYERDDTHLLQVTKSHPYERDDTRLLPTRIVVLKPNLR 326
            D  SHSH KHGA    ++SK Q  ++D++                 +LPTRIVVLKPNL 
Sbjct: 211  DPASHSHGKHGAQNPVELSKIQLDQKDES----------------AILPTRIVVLKPNLG 254

Query: 327  KALNPARPVXXXXXXXXXXXXYRKQ-EHRSSENLELFSEVRERRSSPIESELIKHRDRGS 503
            +  N  +               R+  E    +N E+ S  +++   P ++   +++ R S
Sbjct: 255  RTQNSTKNTSSPQYSRASPLDCRQHTEPPGIKNREVVSYGKKK--FPDDAGPSRYKSRES 312

Query: 504  REIAKEITRQMRCPANYGSMKFSSSRLRGYAGDESSCTMSGDDSENEFD-TMTPSRHFSD 680
            REIAKEITRQMR     GSM FS+    GYA DESS  MS ++S NE + T   SR+  D
Sbjct: 313  REIAKEITRQMRESFGNGSMSFSTPAFIGYARDESSPDMSENESANESEETTVTSRNSVD 372

Query: 681  WKGRYXXXXXXXXXXXXXREAKKRLSERWKLTHRLQDSRTVGRGSTLAEMLAMPDREIKS 860
            W  RY             REA+KRLSERWK+TH+  D   V R +TL EMLA+PD E +S
Sbjct: 373  WSNRYRPSSSCSTESSVSREARKRLSERWKMTHKSVDMGIVSRSNTLGEMLAIPDLETRS 432

Query: 861  MTSDPLVGQDNFRDRLVRKDGAARWAAPLGISSKDGWKDECXXXXXXXXXXXXXXTGFGI 1040
              SD ++ +  F D+  RK GA R   PLGISS++GWKD                T    
Sbjct: 433  GNSDAMICKKVFSDKGDRKHGAVRRDEPLGISSREGWKDVGTGNLSRSRSVPATSTVISS 492

Query: 1041 PSMSIRHESLSNNKRSMLKEAVIHGSNKLKKGGGNQRE 1154
            P + +RHE++ +++  + K+ +    N+  KG  ++RE
Sbjct: 493  PRLGMRHENVCHDRYIIPKQLIQQERNRTIKGNFSKRE 530


>ref|XP_007144479.1| hypothetical protein PHAVU_007G159500g [Phaseolus vulgaris]
            gi|561017669|gb|ESW16473.1| hypothetical protein
            PHAVU_007G159500g [Phaseolus vulgaris]
          Length = 947

 Score =  189 bits (479), Expect = 3e-45
 Identities = 133/399 (33%), Positives = 190/399 (47%), Gaps = 1/399 (0%)
 Frame = +3

Query: 15   PQTGHITVLRSLNAHNYV-EKDVCSKPDRKKESRNFAGSVNQYENDLLSHSHKKHGAHVS 191
            P   H   + +++   Y  E D+  + DR+K   N+  S   + +    H  K+H  H S
Sbjct: 236  PVKSHYGDVETMDIEKYEHEHDLSWRSDREKTGLNYNRSHENHLDGYPCHFDKRHVMHSS 295

Query: 192  SKVSKSQPYERDDTHLLQVTKSHPYERDDTRLLPTRIVVLKPNLRKALNPARPVXXXXXX 371
             + SK Q   R             +E+D    +PT+IV+LKPNL K  N  R V      
Sbjct: 296  PRSSKLQFQGR-------------HEQD---AVPTKIVLLKPNLGKVQNGTRIVSSPC-- 337

Query: 372  XXXXXXYRKQEHRSSENLELFSEVRERRSSPIESELIKHRDRGSREIAKEITRQMRCPAN 551
                       H      E  +E+ +  + P  +   +     SREIAKEITRQMR   N
Sbjct: 338  ----------SHNFLSGREKDTELCQVTNMPESARSWRQDSFESREIAKEITRQMRNSLN 387

Query: 552  YGSMKFSSSRLRGYAGDESSCTMSGDDSENEFDTMTPSRHFSDWKGRYXXXXXXXXXXXX 731
               M  S+SR+ GYAGD+SSC+ SG++S +    +T     S                  
Sbjct: 388  NSGMMLSTSRIAGYAGDDSSCSFSGNESPDVSGEITAILGNSFDLNNRTRRSSRSGESSV 447

Query: 732  XREAKKRLSERWKLTHRLQDSRTVGRGSTLAEMLAMPDREIKSMTSDPLVGQDNFRDRLV 911
             +EAKKRLSERWK+TH+ Q+ + + R STLAEMLA+PD+E+K+     +   + FRD+  
Sbjct: 448  SKEAKKRLSERWKMTHKSQELQGISRSSTLAEMLAIPDKELKAANFAGMATGEGFRDKFT 507

Query: 912  RKDGAARWAAPLGISSKDGWKDECXXXXXXXXXXXXXXTGFGIPSMSIRHESLSNNKRSM 1091
                 A+W  PLGISS+DGWKD C              T FG P   +R E+L  ++  +
Sbjct: 508  PNSEPAKWVEPLGISSRDGWKDGCIGSLSRSKSLPSSSTAFGSPRRFLRTEALRADRYMV 567

Query: 1092 LKEAVIHGSNKLKKGGGNQREGILLKSNIKAGHKKSYSV 1208
             KEA  H   +      + R G     N ++GHKKS+S+
Sbjct: 568  PKEA--HKRERRAAKNFDHRHG--NNRNSRSGHKKSWSL 602


>ref|XP_006594085.1| PREDICTED: uncharacterized protein LOC100794819 isoform X3 [Glycine
            max]
          Length = 862

 Score =  187 bits (475), Expect = 1e-44
 Identities = 132/400 (33%), Positives = 193/400 (48%), Gaps = 1/400 (0%)
 Frame = +3

Query: 12   SPQTGHITVLRSLNAHNYVEKDVCSKPDRKKESRNFAGSVNQYEND-LLSHSHKKHGAHV 188
            +P   H   ++ ++   Y E D   + D +K   N+  S ++  +D    H  K+H  H 
Sbjct: 164  APVQSHYGYVKPMDIEKY-EHDFNLRSDWEKTRSNYNRSSHEKHHDGYPCHFDKRHVMHS 222

Query: 189  SSKVSKSQPYERDDTHLLQVTKSHPYERDDTRLLPTRIVVLKPNLRKALNPARPVXXXXX 368
            S K SK Q   +             YE+   + + ++IV+LKPNL K  N  R V     
Sbjct: 223  SPKSSKLQFKAK-------------YEQ---KAVTSQIVLLKPNLGKVQNGTRIVSSPC- 265

Query: 369  XXXXXXXYRKQEHRSSENLELFSEVRERRSSPIESELIKHRDRGSREIAKEITRQMRCPA 548
                        H      E  +E+ +  + P  +   +     SREIAKE+TRQM+   
Sbjct: 266  ----------SSHNFLAGCENDTELCQATNLPESARSWRQDSFESREIAKEVTRQMKISL 315

Query: 549  NYGSMKFSSSRLRGYAGDESSCTMSGDDSENEFDTMTPSRHFSDWKGRYXXXXXXXXXXX 728
            N GSMK S+SR+RGYAGD+SSC++SG++S  E +  T +   S                 
Sbjct: 316  NNGSMKLSTSRIRGYAGDDSSCSVSGNESPEESEETTATLGNSIDLNNRSRRSSRSSESS 375

Query: 729  XXREAKKRLSERWKLTHRLQDSRTVGRGSTLAEMLAMPDREIKSMTSDPLVGQDNFRDRL 908
              REAKKRLSERWK+TH+ Q+ + + R STLAEMLA+PD ++K+  SD +   + F D+ 
Sbjct: 376  VSREAKKRLSERWKMTHKSQELQGISRSSTLAEMLAIPDMKLKASNSDSMASGEGFHDKC 435

Query: 909  VRKDGAARWAAPLGISSKDGWKDECXXXXXXXXXXXXXXTGFGIPSMSIRHESLSNNKRS 1088
                  A+W  PLGISS+DGWKD C              T FG P   +R E+L + +  
Sbjct: 436  TPNSQPAKWVEPLGISSRDGWKDGCIGSLSRSKSLPSSSTAFGSPRRFLRTEALLDERFM 495

Query: 1089 MLKEAVIHGSNKLKKGGGNQREGILLKSNIKAGHKKSYSV 1208
            + K+A             ++RE        ++GHKKS S+
Sbjct: 496  VPKDA-------------HRRE------RRRSGHKKSRSL 516


>ref|XP_006594084.1| PREDICTED: uncharacterized protein LOC100794819 isoform X2 [Glycine
            max]
          Length = 941

 Score =  187 bits (475), Expect = 1e-44
 Identities = 132/400 (33%), Positives = 193/400 (48%), Gaps = 1/400 (0%)
 Frame = +3

Query: 12   SPQTGHITVLRSLNAHNYVEKDVCSKPDRKKESRNFAGSVNQYEND-LLSHSHKKHGAHV 188
            +P   H   ++ ++   Y E D   + D +K   N+  S ++  +D    H  K+H  H 
Sbjct: 243  APVQSHYGYVKPMDIEKY-EHDFNLRSDWEKTRSNYNRSSHEKHHDGYPCHFDKRHVMHS 301

Query: 189  SSKVSKSQPYERDDTHLLQVTKSHPYERDDTRLLPTRIVVLKPNLRKALNPARPVXXXXX 368
            S K SK Q   +             YE+   + + ++IV+LKPNL K  N  R V     
Sbjct: 302  SPKSSKLQFKAK-------------YEQ---KAVTSQIVLLKPNLGKVQNGTRIVSSPC- 344

Query: 369  XXXXXXXYRKQEHRSSENLELFSEVRERRSSPIESELIKHRDRGSREIAKEITRQMRCPA 548
                        H      E  +E+ +  + P  +   +     SREIAKE+TRQM+   
Sbjct: 345  ----------SSHNFLAGCENDTELCQATNLPESARSWRQDSFESREIAKEVTRQMKISL 394

Query: 549  NYGSMKFSSSRLRGYAGDESSCTMSGDDSENEFDTMTPSRHFSDWKGRYXXXXXXXXXXX 728
            N GSMK S+SR+RGYAGD+SSC++SG++S  E +  T +   S                 
Sbjct: 395  NNGSMKLSTSRIRGYAGDDSSCSVSGNESPEESEETTATLGNSIDLNNRSRRSSRSSESS 454

Query: 729  XXREAKKRLSERWKLTHRLQDSRTVGRGSTLAEMLAMPDREIKSMTSDPLVGQDNFRDRL 908
              REAKKRLSERWK+TH+ Q+ + + R STLAEMLA+PD ++K+  SD +   + F D+ 
Sbjct: 455  VSREAKKRLSERWKMTHKSQELQGISRSSTLAEMLAIPDMKLKASNSDSMASGEGFHDKC 514

Query: 909  VRKDGAARWAAPLGISSKDGWKDECXXXXXXXXXXXXXXTGFGIPSMSIRHESLSNNKRS 1088
                  A+W  PLGISS+DGWKD C              T FG P   +R E+L + +  
Sbjct: 515  TPNSQPAKWVEPLGISSRDGWKDGCIGSLSRSKSLPSSSTAFGSPRRFLRTEALLDERFM 574

Query: 1089 MLKEAVIHGSNKLKKGGGNQREGILLKSNIKAGHKKSYSV 1208
            + K+A             ++RE        ++GHKKS S+
Sbjct: 575  VPKDA-------------HRRE------RRRSGHKKSRSL 595


>ref|XP_003541395.1| PREDICTED: uncharacterized protein LOC100794819 isoform X1 [Glycine
            max]
          Length = 942

 Score =  187 bits (475), Expect = 1e-44
 Identities = 132/400 (33%), Positives = 193/400 (48%), Gaps = 1/400 (0%)
 Frame = +3

Query: 12   SPQTGHITVLRSLNAHNYVEKDVCSKPDRKKESRNFAGSVNQYEND-LLSHSHKKHGAHV 188
            +P   H   ++ ++   Y E D   + D +K   N+  S ++  +D    H  K+H  H 
Sbjct: 244  APVQSHYGYVKPMDIEKY-EHDFNLRSDWEKTRSNYNRSSHEKHHDGYPCHFDKRHVMHS 302

Query: 189  SSKVSKSQPYERDDTHLLQVTKSHPYERDDTRLLPTRIVVLKPNLRKALNPARPVXXXXX 368
            S K SK Q   +             YE+   + + ++IV+LKPNL K  N  R V     
Sbjct: 303  SPKSSKLQFKAK-------------YEQ---KAVTSQIVLLKPNLGKVQNGTRIVSSPC- 345

Query: 369  XXXXXXXYRKQEHRSSENLELFSEVRERRSSPIESELIKHRDRGSREIAKEITRQMRCPA 548
                        H      E  +E+ +  + P  +   +     SREIAKE+TRQM+   
Sbjct: 346  ----------SSHNFLAGCENDTELCQATNLPESARSWRQDSFESREIAKEVTRQMKISL 395

Query: 549  NYGSMKFSSSRLRGYAGDESSCTMSGDDSENEFDTMTPSRHFSDWKGRYXXXXXXXXXXX 728
            N GSMK S+SR+RGYAGD+SSC++SG++S  E +  T +   S                 
Sbjct: 396  NNGSMKLSTSRIRGYAGDDSSCSVSGNESPEESEETTATLGNSIDLNNRSRRSSRSSESS 455

Query: 729  XXREAKKRLSERWKLTHRLQDSRTVGRGSTLAEMLAMPDREIKSMTSDPLVGQDNFRDRL 908
              REAKKRLSERWK+TH+ Q+ + + R STLAEMLA+PD ++K+  SD +   + F D+ 
Sbjct: 456  VSREAKKRLSERWKMTHKSQELQGISRSSTLAEMLAIPDMKLKASNSDSMASGEGFHDKC 515

Query: 909  VRKDGAARWAAPLGISSKDGWKDECXXXXXXXXXXXXXXTGFGIPSMSIRHESLSNNKRS 1088
                  A+W  PLGISS+DGWKD C              T FG P   +R E+L + +  
Sbjct: 516  TPNSQPAKWVEPLGISSRDGWKDGCIGSLSRSKSLPSSSTAFGSPRRFLRTEALLDERFM 575

Query: 1089 MLKEAVIHGSNKLKKGGGNQREGILLKSNIKAGHKKSYSV 1208
            + K+A             ++RE        ++GHKKS S+
Sbjct: 576  VPKDA-------------HRRE------RRRSGHKKSRSL 596


>gb|EXB67881.1| hypothetical protein L484_008898 [Morus notabilis]
          Length = 997

 Score =  186 bits (473), Expect = 2e-44
 Identities = 150/473 (31%), Positives = 217/473 (45%), Gaps = 4/473 (0%)
 Frame = +3

Query: 9    PSPQTGHITVLRSLNAHNYVEKDVCSKPDRKKESRNFAGSVNQYENDLLSHSHKKHGAHV 188
            P    G I  +++ +A  Y    +  K  R+         V++  N      H +H  H 
Sbjct: 255  PQLLCGRIEAMKASDAQMYESTHLDIKSARQ---------VHKNRNVSSQKHHDRHSGHS 305

Query: 189  SSKVSKSQPYERDDTHLLQVTKSHPYERDDTRLLPTRIVVLKPNLRKALNPARPVXXXXX 368
            +  ++ S          L+   +    ++++ +LPTRIVVLKPNL K L+ A  V     
Sbjct: 306  NCYMAPSS---------LKAPNNQLEGKEESAILPTRIVVLKPNLGKVLHAANDVSSPCS 356

Query: 369  XXXXXXXYRKQEH---RSSENLELFSEVRERRSSPIESELIKHRDRGSREIAKEITRQMR 539
                    RK        + N+EL      RRS   +  L  H+ R SRE+AKEI RQMR
Sbjct: 357  SRPSISDCRKDMEIPILKNSNVELLG----RRSFHGDGGLSGHKARESRELAKEIARQMR 412

Query: 540  CPANYGSMKFSSSRLRGYAGDESSCTMSGDDSENEFDTMTPSRHFS-DWKGRYXXXXXXX 716
               +  SM+FSS   +GYAGDESSC+MSG++S NE + M+ S  +S DW  +        
Sbjct: 413  ASFSNSSMRFSSFAYKGYAGDESSCSMSGNESANESEVMSMSSKYSFDWNNQSRPSSSRS 472

Query: 717  XXXXXXREAKKRLSERWKLTHRLQDSRTVGRGSTLAEMLAMPDREIKSMTSDPLVGQDNF 896
                  REAKKRLSERW+L HR  D  +V RG+TL EMLA+PD E   +  + +  +  F
Sbjct: 473  TESSVTREAKKRLSERWRLNHRSLDMGSVSRGTTLGEMLAIPDNERIPVHFNTITDEKGF 532

Query: 897  RDRLVRKDGAARWAAPLGISSKDGWKDECXXXXXXXXXXXXXXTGFGIPSMSIRHESLSN 1076
            R++        R   PLGISS+DGWKD C              T FG     +  E + +
Sbjct: 533  RNKFASDRPTGR-VEPLGISSRDGWKDGCVGKLPRSRSLPSSSTVFGSAKSIMCREPIRD 591

Query: 1077 NKRSMLKEAVIHGSNKLKKGGGNQREGILLKSNIKAGHKKSYSVTYPGEECKHTGQENHH 1256
            ++  + +EA +   NK  K   + R  I    N ++   +SY   Y   E      +  H
Sbjct: 592  DRYVVPREAFMRERNKSPKNNLDDRSII---RNTRSRSTRSYLSHYIIRESCDMSPDT-H 647

Query: 1257 LNPDEPLPSLEEKNLSEQKPLILEVFGDGMEEKMRVTDETKVSRYKDVEMSLE 1415
             + ++    LE  +   QK   LE     +++   V  ET V    DVE  +E
Sbjct: 648  TSQNQVKIKLEVNSPPVQKLEELESLASNVKDTTPV-PETLV----DVECEVE 695


>ref|XP_007162683.1| hypothetical protein PHAVU_001G171300g [Phaseolus vulgaris]
            gi|561036147|gb|ESW34677.1| hypothetical protein
            PHAVU_001G171300g [Phaseolus vulgaris]
          Length = 945

 Score =  176 bits (447), Expect = 2e-41
 Identities = 143/478 (29%), Positives = 213/478 (44%), Gaps = 5/478 (1%)
 Frame = +3

Query: 24   GHITVLRSLNAHNYVEKDVCSKPDRKKESRNFAGSVNQYENDLLSHSHKKHGAHVSSKVS 203
            GH    +S    NY + D+  KPDR+ +  N+            + SH+KH    S  V 
Sbjct: 232  GHSEGTKSSAMENYEQGDLSKKPDREMKRLNY------------NRSHQKHHGGYSCNVV 279

Query: 204  KSQPYERDDTHLLQVTKSHPYER-DDTRLLPTRIVVLKPNLRKALNPARPVXXXXXXXXX 380
            +     R D H    +    ++  ++   +PTRIV+LKPNL K +  A  +         
Sbjct: 280  R-----RQDIHSSPKSSKLQFKGGNEPDAVPTRIVILKPNLGK-VQKATKIGSPPCSSHT 333

Query: 381  XXXYRKQEHRSSENLELFSEVRERRSSPIESELIKHRDRGSREIAKEITRQMRCPANYGS 560
                R +    S+     +E+ +R++    +   +     SREIAKEIT QM+   N  S
Sbjct: 334  FLLERGKCPEFSDRRFRDTELNQRKNLHDNAWHSRQNSLESREIAKEITSQMKNNLNNDS 393

Query: 561  MKFSSSRLRGYAGDESSCTMSGDDSENEFDTMTPSRHFSDWKGRYXXXXXXXXXXXXXRE 740
            M  SSSR RG  GD SSC+ SG++S  E +  + +   S +                 +E
Sbjct: 394  MLLSSSRFRGNTGDNSSCSFSGNESLGESEVTSATLGRSFYISNTISPSSCFSESFVSKE 453

Query: 741  AKKRLSERWKLTHRLQDSRTVGRGSTLAEMLAMPDREIKSMTSDPLVGQDNFRDRLVRKD 920
            AKKRLSERWK++ + Q   +V    TLAEMLA+PD+E+K+   D +      RD+L  K 
Sbjct: 454  AKKRLSERWKMSLKSQQGHSVSMSGTLAEMLAIPDKEMKTANFDSIPSGKGLRDKLSSKG 513

Query: 921  GAARWAAPLGISSKDGWKDEC-XXXXXXXXXXXXXXTGFGIPSMSIRHESLSNNKRSMLK 1097
              A W  PLGISS+DGWKD C               T FG P   +RHE+L +++  M K
Sbjct: 514  KPAGWVEPLGISSRDGWKDGCIGSLPRSKSLPASSTTSFGSPRTILRHEALHDDRFMMPK 573

Query: 1098 EAVIHGSNKLKKGGGNQREGILLKSNIKAGHKKSYSVTYPGEECK---HTGQENHHLNPD 1268
             A      K+ K   +QR+ +  ++      + S+     G E     +T Q    +N +
Sbjct: 574  VACKRERKKVVK-CLDQRQCMNTRNLKNKNSRCSHPSNLEGNESSPDLNTIQNKVRINLE 632

Query: 1269 EPLPSLEEKNLSEQKPLILEVFGDGMEEKMRVTDETKVSRYKDVEMSLEIREEQQLEA 1442
            E LP  E         +I E      E  + V DE  V   +     L +   +++ A
Sbjct: 633  EDLPKQEMLAAESLAEIIRETIA-VTEAVVDVGDENAVGSSESYIKELSVGSSRKISA 689


>ref|XP_006576988.1| PREDICTED: uncharacterized protein LOC100801297 isoform X2 [Glycine
            max]
          Length = 875

 Score =  173 bits (439), Expect = 2e-40
 Identities = 147/480 (30%), Positives = 219/480 (45%), Gaps = 21/480 (4%)
 Frame = +3

Query: 6    PPSPQTGHITVLRSLNAHNYVEKDVCSKPDRKKESRNFAGSVNQYENDLLSHSHKKHGAH 185
            P     GH+   +  N  N    D   KPD++ +  N+  S  ++++    H  ++H  H
Sbjct: 174  PSQSHFGHVEGTKLSNIVNCEHNDFSRKPDKEMKWLNYNRSNQKHDDGYSCHFVRRHAIH 233

Query: 186  VSSKVSKSQPYERDDTHLLQVTKSHPYERDDTRLLPTRIVVLKPNLRKALNPARPVXXXX 365
             S K S++Q   ++  + +                PTRIVVLKPNL K  +P +      
Sbjct: 234  SSPKSSRNQFKGKNVPNAV----------------PTRIVVLKPNLEKVQSPTK----IG 273

Query: 366  XXXXXXXXYRKQEHRSSENLELFSEVR-------ERRSSPIESELIKHRDRGSREIAKEI 524
                    +  Q  + +E    FS++R       +R++    +   K     SREIAKEI
Sbjct: 274  SSPCSPYAFLSQCGKHAE----FSDIRFRETGLNQRKNLTANAWHSKQNSLESREIAKEI 329

Query: 525  TRQMRCPANYGSMKFSSSRLRGYAGDESSCTMSGDDSENEFD--------------TMTP 662
            T QM+   N GSM FSSSR RGY  D+SSC++SG+ S +E +              T++P
Sbjct: 330  TSQMKNNLNIGSMIFSSSRFRGYTWDDSSCSLSGNQSPDESEVTPATLEKSFEICNTISP 389

Query: 663  SRHFSDWKGRYXXXXXXXXXXXXXREAKKRLSERWKLTHRLQDSRTVGRGSTLAEMLAMP 842
            S  FS+                  REAKKRLSERWK++ + Q   ++ R  TLAEMLA+P
Sbjct: 390  SSCFSE--------------SFVSREAKKRLSERWKMSLKFQQGNSISRSGTLAEMLAIP 435

Query: 843  DREIKSMTSDPLVGQDNFRDRLVRKDGAARWAAPLGISSKDGWKDECXXXXXXXXXXXXX 1022
            ++E+K+   D +   +   D++      A W  PLG+SSKDG+                 
Sbjct: 436  NKEMKASKFDSISCGEGSHDKISSNGKPAGWVEPLGVSSKDGY----IGSLPRSKSLPAS 491

Query: 1023 XTGFGIPSMSIRHESLSNNKRSMLKEAVIHGSNKLKKGGGNQREGILLKSNIKAGHKKSY 1202
             T FG P   + HE+L +++  M KEA      K+ K   +QR     K + K+GHKK  
Sbjct: 492  STTFGSPRTILHHEALCDDRFMMPKEASKQEKKKVVK-LLDQRPCTNTKRS-KSGHKKC- 548

Query: 1203 SVTYPGEECKHTGQENHHLNPDEPLPSLEEKNLSEQKPLILEVFGDGMEEKMRVTDETKV 1382
                      +T Q    +N +E LP  +E  ++E    IL       EE + VT+E  V
Sbjct: 549  ------SPYLNTIQNKVKINLEENLPK-QEVLIAESLAEILRDTSAVTEEVVGVTNENAV 601


>ref|XP_006576987.1| PREDICTED: uncharacterized protein LOC100801297 isoform X1 [Glycine
            max]
          Length = 926

 Score =  173 bits (439), Expect = 2e-40
 Identities = 147/480 (30%), Positives = 219/480 (45%), Gaps = 21/480 (4%)
 Frame = +3

Query: 6    PPSPQTGHITVLRSLNAHNYVEKDVCSKPDRKKESRNFAGSVNQYENDLLSHSHKKHGAH 185
            P     GH+   +  N  N    D   KPD++ +  N+  S  ++++    H  ++H  H
Sbjct: 225  PSQSHFGHVEGTKLSNIVNCEHNDFSRKPDKEMKWLNYNRSNQKHDDGYSCHFVRRHAIH 284

Query: 186  VSSKVSKSQPYERDDTHLLQVTKSHPYERDDTRLLPTRIVVLKPNLRKALNPARPVXXXX 365
             S K S++Q   ++  + +                PTRIVVLKPNL K  +P +      
Sbjct: 285  SSPKSSRNQFKGKNVPNAV----------------PTRIVVLKPNLEKVQSPTK----IG 324

Query: 366  XXXXXXXXYRKQEHRSSENLELFSEVR-------ERRSSPIESELIKHRDRGSREIAKEI 524
                    +  Q  + +E    FS++R       +R++    +   K     SREIAKEI
Sbjct: 325  SSPCSPYAFLSQCGKHAE----FSDIRFRETGLNQRKNLTANAWHSKQNSLESREIAKEI 380

Query: 525  TRQMRCPANYGSMKFSSSRLRGYAGDESSCTMSGDDSENEFD--------------TMTP 662
            T QM+   N GSM FSSSR RGY  D+SSC++SG+ S +E +              T++P
Sbjct: 381  TSQMKNNLNIGSMIFSSSRFRGYTWDDSSCSLSGNQSPDESEVTPATLEKSFEICNTISP 440

Query: 663  SRHFSDWKGRYXXXXXXXXXXXXXREAKKRLSERWKLTHRLQDSRTVGRGSTLAEMLAMP 842
            S  FS+                  REAKKRLSERWK++ + Q   ++ R  TLAEMLA+P
Sbjct: 441  SSCFSE--------------SFVSREAKKRLSERWKMSLKFQQGNSISRSGTLAEMLAIP 486

Query: 843  DREIKSMTSDPLVGQDNFRDRLVRKDGAARWAAPLGISSKDGWKDECXXXXXXXXXXXXX 1022
            ++E+K+   D +   +   D++      A W  PLG+SSKDG+                 
Sbjct: 487  NKEMKASKFDSISCGEGSHDKISSNGKPAGWVEPLGVSSKDGY----IGSLPRSKSLPAS 542

Query: 1023 XTGFGIPSMSIRHESLSNNKRSMLKEAVIHGSNKLKKGGGNQREGILLKSNIKAGHKKSY 1202
             T FG P   + HE+L +++  M KEA      K+ K   +QR     K + K+GHKK  
Sbjct: 543  STTFGSPRTILHHEALCDDRFMMPKEASKQEKKKVVK-LLDQRPCTNTKRS-KSGHKKC- 599

Query: 1203 SVTYPGEECKHTGQENHHLNPDEPLPSLEEKNLSEQKPLILEVFGDGMEEKMRVTDETKV 1382
                      +T Q    +N +E LP  +E  ++E    IL       EE + VT+E  V
Sbjct: 600  ------SPYLNTIQNKVKINLEENLPK-QEVLIAESLAEILRDTSAVTEEVVGVTNENAV 652


>ref|XP_006588732.1| PREDICTED: uncharacterized protein LOC100797413 isoform X3 [Glycine
            max]
          Length = 860

 Score =  168 bits (426), Expect = 5e-39
 Identities = 121/381 (31%), Positives = 182/381 (47%), Gaps = 1/381 (0%)
 Frame = +3

Query: 12   SPQTGHITVLRSLNAHNYVEKDVCSKPDRKKESRNFAGSVNQYEND-LLSHSHKKHGAHV 188
            +P   H   + +++   Y + D     D +K   N+  S ++  +D       K+H  H+
Sbjct: 165  APIQSHYGHVEAMDIEKY-DHDFNLMLDGEKTRLNYNRSSHEKHHDGYPCDLDKRHVMHI 223

Query: 189  SSKVSKSQPYERDDTHLLQVTKSHPYERDDTRLLPTRIVVLKPNLRKALNPARPVXXXXX 368
            S K SK          L + T    YE+   + + ++IV+LKPNL K  N  R V     
Sbjct: 224  SPKSSKL---------LFKGT----YEQ---KAVTSQIVLLKPNLGKVQNGTRIVSSPC- 266

Query: 369  XXXXXXXYRKQEHRSSENLELFSEVRERRSSPIESELIKHRDRGSREIAKEITRQMRCPA 548
                        H      E  +E+ +  + P  +   +     SREIAKE+TRQM+   
Sbjct: 267  ----------SSHNFLSGRENDTELCQPTNLPESAMSWRQDSFESREIAKEVTRQMKISL 316

Query: 549  NYGSMKFSSSRLRGYAGDESSCTMSGDDSENEFDTMTPSRHFSDWKGRYXXXXXXXXXXX 728
            + G MK S+SR+RGYAGD+SSC++SG++S  E +  T +   S                 
Sbjct: 317  HSGGMKLSTSRIRGYAGDDSSCSVSGNESPEESEETTATLGNSIDLNNRSRRSSRSSESS 376

Query: 729  XXREAKKRLSERWKLTHRLQDSRTVGRGSTLAEMLAMPDREIKSMTSDPLVGQDNFRDRL 908
              REAKKRLSERWK+TH+ Q+ + + R +TLAEMLA+PD+ +K+  S  +   + F D+ 
Sbjct: 377  VSREAKKRLSERWKMTHKSQELQGISRSNTLAEMLAVPDKVLKAANSYSMASGEGFHDKF 436

Query: 909  VRKDGAARWAAPLGISSKDGWKDECXXXXXXXXXXXXXXTGFGIPSMSIRHESLSNNKRS 1088
                  ++W  PLGISS+DGWKD C                FG P   +R E+L + +  
Sbjct: 437  TPNSQPSKWVEPLGISSRDGWKDGCIGSLSRSKSLPSSSAAFGSPRRFMRTEALLDERFM 496

Query: 1089 MLKEAVIHGSNKLKKGGGNQR 1151
            + KEA  H   + + G    R
Sbjct: 497  VPKEA--HRCERRRSGHKKSR 515


>ref|XP_006588731.1| PREDICTED: uncharacterized protein LOC100797413 isoform X2 [Glycine
            max]
          Length = 941

 Score =  168 bits (426), Expect = 5e-39
 Identities = 121/381 (31%), Positives = 182/381 (47%), Gaps = 1/381 (0%)
 Frame = +3

Query: 12   SPQTGHITVLRSLNAHNYVEKDVCSKPDRKKESRNFAGSVNQYEND-LLSHSHKKHGAHV 188
            +P   H   + +++   Y + D     D +K   N+  S ++  +D       K+H  H+
Sbjct: 246  APIQSHYGHVEAMDIEKY-DHDFNLMLDGEKTRLNYNRSSHEKHHDGYPCDLDKRHVMHI 304

Query: 189  SSKVSKSQPYERDDTHLLQVTKSHPYERDDTRLLPTRIVVLKPNLRKALNPARPVXXXXX 368
            S K SK          L + T    YE+   + + ++IV+LKPNL K  N  R V     
Sbjct: 305  SPKSSKL---------LFKGT----YEQ---KAVTSQIVLLKPNLGKVQNGTRIVSSPC- 347

Query: 369  XXXXXXXYRKQEHRSSENLELFSEVRERRSSPIESELIKHRDRGSREIAKEITRQMRCPA 548
                        H      E  +E+ +  + P  +   +     SREIAKE+TRQM+   
Sbjct: 348  ----------SSHNFLSGRENDTELCQPTNLPESAMSWRQDSFESREIAKEVTRQMKISL 397

Query: 549  NYGSMKFSSSRLRGYAGDESSCTMSGDDSENEFDTMTPSRHFSDWKGRYXXXXXXXXXXX 728
            + G MK S+SR+RGYAGD+SSC++SG++S  E +  T +   S                 
Sbjct: 398  HSGGMKLSTSRIRGYAGDDSSCSVSGNESPEESEETTATLGNSIDLNNRSRRSSRSSESS 457

Query: 729  XXREAKKRLSERWKLTHRLQDSRTVGRGSTLAEMLAMPDREIKSMTSDPLVGQDNFRDRL 908
              REAKKRLSERWK+TH+ Q+ + + R +TLAEMLA+PD+ +K+  S  +   + F D+ 
Sbjct: 458  VSREAKKRLSERWKMTHKSQELQGISRSNTLAEMLAVPDKVLKAANSYSMASGEGFHDKF 517

Query: 909  VRKDGAARWAAPLGISSKDGWKDECXXXXXXXXXXXXXXTGFGIPSMSIRHESLSNNKRS 1088
                  ++W  PLGISS+DGWKD C                FG P   +R E+L + +  
Sbjct: 518  TPNSQPSKWVEPLGISSRDGWKDGCIGSLSRSKSLPSSSAAFGSPRRFMRTEALLDERFM 577

Query: 1089 MLKEAVIHGSNKLKKGGGNQR 1151
            + KEA  H   + + G    R
Sbjct: 578  VPKEA--HRCERRRSGHKKSR 596


>ref|XP_003536963.1| PREDICTED: uncharacterized protein LOC100797413 isoform X1 [Glycine
            max]
          Length = 943

 Score =  168 bits (426), Expect = 5e-39
 Identities = 121/381 (31%), Positives = 182/381 (47%), Gaps = 1/381 (0%)
 Frame = +3

Query: 12   SPQTGHITVLRSLNAHNYVEKDVCSKPDRKKESRNFAGSVNQYEND-LLSHSHKKHGAHV 188
            +P   H   + +++   Y + D     D +K   N+  S ++  +D       K+H  H+
Sbjct: 248  APIQSHYGHVEAMDIEKY-DHDFNLMLDGEKTRLNYNRSSHEKHHDGYPCDLDKRHVMHI 306

Query: 189  SSKVSKSQPYERDDTHLLQVTKSHPYERDDTRLLPTRIVVLKPNLRKALNPARPVXXXXX 368
            S K SK          L + T    YE+   + + ++IV+LKPNL K  N  R V     
Sbjct: 307  SPKSSKL---------LFKGT----YEQ---KAVTSQIVLLKPNLGKVQNGTRIVSSPC- 349

Query: 369  XXXXXXXYRKQEHRSSENLELFSEVRERRSSPIESELIKHRDRGSREIAKEITRQMRCPA 548
                        H      E  +E+ +  + P  +   +     SREIAKE+TRQM+   
Sbjct: 350  ----------SSHNFLSGRENDTELCQPTNLPESAMSWRQDSFESREIAKEVTRQMKISL 399

Query: 549  NYGSMKFSSSRLRGYAGDESSCTMSGDDSENEFDTMTPSRHFSDWKGRYXXXXXXXXXXX 728
            + G MK S+SR+RGYAGD+SSC++SG++S  E +  T +   S                 
Sbjct: 400  HSGGMKLSTSRIRGYAGDDSSCSVSGNESPEESEETTATLGNSIDLNNRSRRSSRSSESS 459

Query: 729  XXREAKKRLSERWKLTHRLQDSRTVGRGSTLAEMLAMPDREIKSMTSDPLVGQDNFRDRL 908
              REAKKRLSERWK+TH+ Q+ + + R +TLAEMLA+PD+ +K+  S  +   + F D+ 
Sbjct: 460  VSREAKKRLSERWKMTHKSQELQGISRSNTLAEMLAVPDKVLKAANSYSMASGEGFHDKF 519

Query: 909  VRKDGAARWAAPLGISSKDGWKDECXXXXXXXXXXXXXXTGFGIPSMSIRHESLSNNKRS 1088
                  ++W  PLGISS+DGWKD C                FG P   +R E+L + +  
Sbjct: 520  TPNSQPSKWVEPLGISSRDGWKDGCIGSLSRSKSLPSSSAAFGSPRRFMRTEALLDERFM 579

Query: 1089 MLKEAVIHGSNKLKKGGGNQR 1151
            + KEA  H   + + G    R
Sbjct: 580  VPKEA--HRCERRRSGHKKSR 598


Top