BLASTX nr result

ID: Akebia25_contig00017828 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00017828
         (1670 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002274593.2| PREDICTED: uncharacterized protein LOC100248...   312   3e-82
ref|XP_006480729.1| PREDICTED: uncharacterized protein LOC102617...   303   2e-79
ref|XP_007027125.1| Uncharacterized protein isoform 1 [Theobroma...   302   3e-79
gb|EXB67881.1| hypothetical protein L484_008898 [Morus notabilis]     301   6e-79
ref|XP_007207901.1| hypothetical protein PRUPE_ppa020794mg [Prun...   299   2e-78
ref|XP_004305226.1| PREDICTED: uncharacterized protein LOC101298...   299   3e-78
ref|XP_007027126.1| Uncharacterized protein isoform 2 [Theobroma...   295   4e-77
emb|CBI32667.3| unnamed protein product [Vitis vinifera]              277   9e-72
ref|XP_002308481.2| hypothetical protein POPTR_0006s23020g [Popu...   275   4e-71
ref|XP_006429000.1| hypothetical protein CICLE_v10011022mg [Citr...   271   8e-70
gb|EYU41671.1| hypothetical protein MIMGU_mgv1a001075mg [Mimulus...   257   1e-65
ref|XP_002322831.2| hypothetical protein POPTR_0016s08100g [Popu...   251   7e-64
ref|XP_002534051.1| conserved hypothetical protein [Ricinus comm...   250   2e-63
ref|XP_006594084.1| PREDICTED: uncharacterized protein LOC100794...   239   4e-60
ref|XP_003541395.1| PREDICTED: uncharacterized protein LOC100794...   239   4e-60
ref|XP_006345668.1| PREDICTED: uncharacterized protein LOC102591...   237   1e-59
ref|XP_006850705.1| hypothetical protein AMTR_s00034p00242520 [A...   237   1e-59
ref|XP_004494988.1| PREDICTED: uncharacterized protein LOC101494...   233   2e-58
ref|XP_006403676.1| hypothetical protein EUTSA_v10010116mg [Eutr...   232   4e-58
ref|XP_007144479.1| hypothetical protein PHAVU_007G159500g [Phas...   230   1e-57

>ref|XP_002274593.2| PREDICTED: uncharacterized protein LOC100248303 [Vitis vinifera]
          Length = 984

 Score =  312 bits (800), Expect = 3e-82
 Identities = 207/486 (42%), Positives = 258/486 (53%), Gaps = 31/486 (6%)
 Frame = +3

Query: 306  GNSQVRKWVHTSKVLXXXXXXXXXXTEEDLLKCELNRDFSKQSSGTPMKKLLAEELSKET 485
            GN Q+R   +  K+           TEED    EL    SKQ+ GTPMKKLLA+E+SKE 
Sbjct: 25   GNRQIRNQRNFPKLASDLSSCTSGSTEEDSFTIELGPSSSKQAIGTPMKKLLAKEMSKEA 84

Query: 486  ESKRRPPSVIARLMGLDTLXXXXXXXXXXXXFSKDYQQRTASIGIGFLEKSTPYEGRSF- 662
            E K+R PSVIARLMGLD L              +++QQRT ++     E    + G    
Sbjct: 85   EPKKRSPSVIARLMGLDGLPPQQPIHKQQKKLMENHQQRTETVERA--EGGGTFYGPQLH 142

Query: 663  -KMCNKERQEFKDVFEVLETSKDKKHGNSLVQKGMENTKLSETKMDFIRQKFMDVKRFST 839
             K  +KE++EFKDVFEVL   K +     +  +G  N+KL+E +  FIRQKFMD KR ST
Sbjct: 143  RKKNSKEQEEFKDVFEVLVAPKGESDCYQVEGQGTTNSKLTEAEKAFIRQKFMDAKRLST 202

Query: 840  DEKFLQSKEFHDALEVLDSNTEVLLKFLQEPDSLFTKHLHDLKGIPP----------KSS 989
            DEK   S+EFHDALEVLDSN ++LLKFLQEPDSLFTKHL DL+G+PP          KSS
Sbjct: 203  DEKLQDSQEFHDALEVLDSNKDLLLKFLQEPDSLFTKHLQDLQGVPPQPHCRRITVSKSS 262

Query: 990  NDQSLENRDV----ERKIEWKDATDYPRKH-----SHSHNEPGVQISNKVSKSQLNEKDE 1142
            N    EN       +R    K+    P+KH     SHS+ +     S   S+ Q   +DE
Sbjct: 263  NSPKYENNATGWKSKRGTSRKNDISSPQKHHDDHFSHSYGKHDAHKSLHPSRIQFEGRDE 322

Query: 1143 SCLLPTRIVILKSNLRKAXXXXXXXXXXXXXGGLHSGYRKHDEYRSSENLELFSKVRERR 1322
            + +LPTRIV+LK NL K                  S   KH    S  N E         
Sbjct: 323  TSVLPTRIVVLKPNLGKVLSSSKSISSPRSSYDFLSDCGKHTGSMSIRNKE------AEL 376

Query: 1323 NSSIELELMKHKVRGSREVAKEITRQMRRSVR----------LRGYSGDESSYTMSENDS 1472
              S E+   +HK R SRE+AKE+TR+MR S+            RGY+GDESS  MS NDS
Sbjct: 377  QGSNEMGFSRHKSRESREIAKEVTRRMRNSITNGSMNFSSAGFRGYAGDESS-CMSGNDS 435

Query: 1473 ANESEIMTPTSRHFYDRKXXXXXXXXXXXXXXXXXXXKKRLSERWRMTHKFQEVGMVSRS 1652
             +E E     SR+ +DR                    +KRLSERW+MT +FQEVG V+R 
Sbjct: 436  LSEPEETVLISRNSFDRSSRYRASSSHSTESSVSREARKRLSERWKMTRRFQEVGAVNRG 495

Query: 1653 STLGEM 1670
            STL EM
Sbjct: 496  STLAEM 501


>ref|XP_006480729.1| PREDICTED: uncharacterized protein LOC102617097 [Citrus sinensis]
          Length = 989

 Score =  303 bits (775), Expect = 2e-79
 Identities = 205/485 (42%), Positives = 267/485 (55%), Gaps = 30/485 (6%)
 Frame = +3

Query: 306  GNSQVRKWVHTSKVLXXXXXXXXXXTEEDLLKCELNRDFSKQSSGTPMKKLLAEELSKET 485
            GN QV    +  K+           T++D L  +  R  SKQ+  TPMKKLLA+E+S+ET
Sbjct: 26   GNRQVLNKRNFPKLASDSSSCSSDTTDDDSLMFDFGRRSSKQAVRTPMKKLLAKEMSRET 85

Query: 486  ESKRRPPSVIARLMGLDTLXXXXXXXXXXXXFSKDYQQRTASIGIGFLEKSTPYEGR-SF 662
            ESKRR PSVIARLMG D L             +++ Q  TAS      ++ST   GR SF
Sbjct: 86   ESKRRSPSVIARLMGFDGLPATQAAHKQHKRSAENNQPWTASAEKA--QRSTTSSGRRSF 143

Query: 663  KMCNKERQEFKDVFEVLETSKDKKHGNSLVQKGMENTKLSETKMDFIRQKFMDVKRFSTD 842
            +  +KE QEFKDVFEVL+ SK +    +  ++   N+KLSE +M FIRQKFM+ KR STD
Sbjct: 144  RKSSKEEQEFKDVFEVLDASKME----TCSKQESTNSKLSEAEMVFIRQKFMEAKRLSTD 199

Query: 843  EKFLQSKEFHDALEVLDSNTEVLLKFLQEPDSLFTKHLHDLKGIPPKS----------SN 992
            E+F  SKEF DALEVLDSN ++LLKFLQ+PDSLFTKHLHDL G   +S          S 
Sbjct: 200  ERFQDSKEFQDALEVLDSNKDLLLKFLQQPDSLFTKHLHDL-GASSQSHCGHISAMTPSL 258

Query: 993  DQSLENRDV----ERKIEWKDATDYPRKH-----SHSHNEPGVQISNKVSKSQLNEKDES 1145
             +  E+ DV    ER  + K+     ++H     SHS +    Q  NK +  QL  K++ 
Sbjct: 259  ARQCESSDVGWKAERGTQCKNQRKSSQEHPDGLSSHSSSGHAAQSLNKPAIVQLEGKEDH 318

Query: 1146 CLLPTRIVILKSNLRKAXXXXXXXXXXXXXGGLHSGYRKHDEYRSSENLELFSKVRERRN 1325
             +LPTRIV+LK N+ +               G  S  RKH E           +  E++ 
Sbjct: 319  SVLPTRIVVLKPNVGRVQAAARTVSSPRSSHGYPSDSRKHTELPGPGMENREPETWEKKK 378

Query: 1326 SSIELELMKHKVRGSREVAKEITRQMR----------RSVRLRGYSGDESSYTMSENDSA 1475
               ++   +HK R SRE+AKEITRQMR           S   +GY+GDESS   S N+SA
Sbjct: 379  FPDDVGFSRHKSRESRELAKEITRQMRDNLSSVSMKFSSTGFKGYAGDESSSNFSGNESA 438

Query: 1476 NESEIMTPTSRHFYDRKXXXXXXXXXXXXXXXXXXXKKRLSERWRMTHKFQEVGMVSRSS 1655
            NE EI T TS+  + R                    KKRLSERW+M+HK QE+G+++R +
Sbjct: 439  NELEIKTMTSKDGFIRHRRSRSSSSHSSESSVSREAKKRLSERWKMSHKSQELGVINRGN 498

Query: 1656 TLGEM 1670
            TLGEM
Sbjct: 499  TLGEM 503


>ref|XP_007027125.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508715730|gb|EOY07627.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1023

 Score =  302 bits (774), Expect = 3e-79
 Identities = 206/495 (41%), Positives = 267/495 (53%), Gaps = 36/495 (7%)
 Frame = +3

Query: 294  DLFPGNSQVRKWVHTSKVLXXXXXXXXXXTEEDLLKCELNRDFSKQSSGTPMKKLLAEEL 473
            +LFPGN Q++K    SK+           T+ED L  EL+   SKQS+GTPMKKLLA+E+
Sbjct: 58   ELFPGNRQLQKQRKFSKLASDSSSCGTDSTDEDQLTFELSWRSSKQSTGTPMKKLLAQEM 117

Query: 474  SKETESKRRPPSVIARLMGLDTLXXXXXXXXXXXXF-SKDYQQRTASIGIGFLEKSTPYE 650
            SKE ES+RR PSVIARLMGLD L              SK+  Q+  S           Y 
Sbjct: 118  SKENESRRRQPSVIARLMGLDGLPPQQPGHKQQKRTESKEKVQKGGSF----------YS 167

Query: 651  GRSFKMCNKERQEFKDVFEVLETSKDKKHGNSLVQKGMENTKLSETKMDFIRQKFMDVKR 830
             RS +  +KE QEFKDVFEVL+ SK +    S   +G  N+KLS+ ++ F++QKFM+ KR
Sbjct: 168  RRSSRKSSKEEQEFKDVFEVLDASKVET--GSYSSQGTANSKLSDAEVAFVQQKFMEAKR 225

Query: 831  FSTDEKFLQSKEFHDALEVLDSNTEVLLKFLQEPDSLFTKHLHDLKGI------------ 974
             STDEK   S+EF+DALEVLDSNT++LLKFLQ+PDSLFTKHLHDL+G             
Sbjct: 226  LSTDEKLQDSEEFNDALEVLDSNTDLLLKFLQQPDSLFTKHLHDLQGAHDLQGAQPQSRC 285

Query: 975  ----PPKSSNDQSLEN----RDVERKIEWKDATDYPRKH-----SHSHNEPGVQISNKVS 1115
                  KSS+  + EN    R   R+ + K  +  P+ H     SHS          K  
Sbjct: 286  GRISAMKSSHTLTNENGHLGRRAGRETQCKHCSKSPQGHREDLLSHSCGRYAAHNLLKSP 345

Query: 1116 KSQLNEKDESCLLPTRIVILKSNLRKAXXXXXXXXXXXXXGGLHSGYRKHDEYRSSENLE 1295
            K QL EK E  + PTRIV+LK NL K+                 S      E    EN E
Sbjct: 346  KVQLEEKQEPAVAPTRIVVLKPNLGKSLNSMRTASSPCSSHHFPSDCTGQSEILGIENRE 405

Query: 1296 LFSKVRERRNSSIELELMKHKVRGSREVAKEITRQMRRSV----------RLRGYSGDES 1445
              +++  ++    ++   +H  R SRE+AKEITR+M+ S           R RGY+GDES
Sbjct: 406  --AEIWGKKKVHQDVGFSRHNSRESREMAKEITRRMKNSFSNGSMKFSTSRFRGYAGDES 463

Query: 1446 SYTMSENDSANESEIMTPTSRHFYDRKXXXXXXXXXXXXXXXXXXXKKRLSERWRMTHKF 1625
            S  +S ++SAN+S++ T + R    R                    KKRLSERW++TH  
Sbjct: 464  SCDVSGSESANDSDVTTVSYRDNIGRNKKHRRSSSRSSESSVSREAKKRLSERWKLTHGS 523

Query: 1626 QEVGMVSRSSTLGEM 1670
            QE+ MVSR STLGEM
Sbjct: 524  QELLMVSRGSTLGEM 538


>gb|EXB67881.1| hypothetical protein L484_008898 [Morus notabilis]
          Length = 997

 Score =  301 bits (771), Expect = 6e-79
 Identities = 204/488 (41%), Positives = 266/488 (54%), Gaps = 33/488 (6%)
 Frame = +3

Query: 306  GNSQVRKWVHTSKVLXXXXXXXXXXTEEDLLKCELNRDFSKQSSGTPMKKLLAEELSKET 485
            GN QV+   +  K+            ++D    EL    SK+  GTPMKKLLA+E+SKET
Sbjct: 33   GNRQVQNQRNLPKLASDSSSCSSDTADDDSFTFELGLRSSKRGIGTPMKKLLAKEMSKET 92

Query: 486  ESKRRPPSVIARLMGLDTLXXXXXXXXXXXXFSKDYQQRTASIGIGFLEKSTPYEGRSF- 662
            ESKRR PSVIA+LMGLD L             S++Y Q + S   G    S  Y+ RS  
Sbjct: 93   ESKRRSPSVIAKLMGLDGLPTQLPAYKEEKGMSENYLQTSGSAEKG-QRSSRHYDYRSSS 151

Query: 663  KMCNKERQEFKDVFEVLETSKDKKHGNSLVQKGMENTKLSETKMDFIRQKFMDVKRFSTD 842
            +  +K+ QEFKDVFEVLETSK      S   +G+ N+ L++ ++ FI+QKFMD KR STD
Sbjct: 152  RKSSKDEQEFKDVFEVLETSKVASC--SYPSQGVVNSNLTDAEIAFIKQKFMDAKRLSTD 209

Query: 843  EKFLQSKEFHDALEVLDSNTEVLLKFLQEPDSLFTKHLHDLKGIPP----------KSSN 992
            EK   SKEFHDALE+LDSN ++LLKFLQ+PD LFTKHLHDL+G  P          K+S+
Sbjct: 210  EKLQSSKEFHDALEILDSNKDLLLKFLQQPDLLFTKHLHDLQGSAPQLLCGRIEAMKASD 269

Query: 993  DQSLENRDVE----------RKIEWKDATDYPRKHSHSHNEPGVQISNKVSKSQLNEKDE 1142
             Q  E+  ++          R +  +   D    HS+ +  P    S K   +QL  K+E
Sbjct: 270  AQMYESTHLDIKSARQVHKNRNVSSQKHHDRHSGHSNCYMAPS---SLKAPNNQLEGKEE 326

Query: 1143 SCLLPTRIVILKSNLRKAXXXXXXXXXXXXXGGLHSGYRKHDE--YRSSENLELFSKVRE 1316
            S +LPTRIV+LK NL K                  S  RK  E     + N+EL      
Sbjct: 327  SAILPTRIVVLKPNLGKVLHAANDVSSPCSSRPSISDCRKDMEIPILKNSNVELLG---- 382

Query: 1317 RRNSSIELELMKHKVRGSREVAKEITRQMR----------RSVRLRGYSGDESSYTMSEN 1466
            RR+   +  L  HK R SRE+AKEI RQMR           S   +GY+GDESS +MS N
Sbjct: 383  RRSFHGDGGLSGHKARESRELAKEIARQMRASFSNSSMRFSSFAYKGYAGDESSCSMSGN 442

Query: 1467 DSANESEIMTPTSRHFYDRKXXXXXXXXXXXXXXXXXXXKKRLSERWRMTHKFQEVGMVS 1646
            +SANESE+M+ +S++ +D                     KKRLSERWR+ H+  ++G VS
Sbjct: 443  ESANESEVMSMSSKYSFDWNNQSRPSSSRSTESSVTREAKKRLSERWRLNHRSLDMGSVS 502

Query: 1647 RSSTLGEM 1670
            R +TLGEM
Sbjct: 503  RGTTLGEM 510


>ref|XP_007207901.1| hypothetical protein PRUPE_ppa020794mg [Prunus persica]
            gi|462403543|gb|EMJ09100.1| hypothetical protein
            PRUPE_ppa020794mg [Prunus persica]
          Length = 910

 Score =  299 bits (766), Expect = 2e-78
 Identities = 199/434 (45%), Positives = 246/434 (56%), Gaps = 36/434 (8%)
 Frame = +3

Query: 477  KETESKRRPPSVIARLMGLDTLXXXXXXXXXXXXFSKDYQQRTASIGIGFLEK----STP 644
            +ETE +RR PSVIA+LMGLD L             S++  QRT  +     EK    S  
Sbjct: 3    RETEPRRRSPSVIAKLMGLDGLPPQQPAHRQQKSISENCLQRTRLV-----EKEERSSMC 57

Query: 645  YEGRSFKMCNKERQEFKDVFEVLETSKDKKHGNSLVQKGMENTKLSETKMDFIRQKFMDV 824
            Y+ RS +  +KE+QEFKDVFEV E SK +  G S   +G  N+KLS+ +M F+RQKFMD 
Sbjct: 58   YDRRSSRKNSKEQQEFKDVFEVFEASKVE--GRSCSSRGNANSKLSDAEMAFVRQKFMDA 115

Query: 825  KRFSTDEKFLQSKEFHDALEVLDSNTEVLLKFLQEPDSLFTKHLHDLKGIPP-------- 980
            KR STDE+   SKEFHDALEVLDSN ++LLKFLQ+PDSLF KHLHDL+G PP        
Sbjct: 116  KRLSTDERLQDSKEFHDALEVLDSNKDLLLKFLQQPDSLFAKHLHDLQGGPPSRCGHIAS 175

Query: 981  -KSSNDQSLENRDVERKIEWKDATDYPRKH-------------SHSHNEPGVQISNKVSK 1118
             KSS  Q  EN D    + W    + PRK+             SHS +      S K S 
Sbjct: 176  MKSSEAQRYENID----LGWTAVRETPRKNNCKSPQEHRDSFSSHSDSRHAGHSSLKSSI 231

Query: 1119 SQLNEKDESCLLPTRIVILKSNLRKAXXXXXXXXXXXXXGGLHSGYRKHDEYRSSENLEL 1298
            +    K+ES + PTRIV+LK NL K                     RKH E+ S  N E 
Sbjct: 232  NLSEVKNESSIPPTRIVVLKPNLGKMLNGTKTISSPCSSHASMLDGRKHAEFPSIRNRE- 290

Query: 1299 FSKVRERRNSSIELELMKHKVRGSREVAKEITRQMRR-----SVR-----LRGYSGDESS 1448
             ++ R R+NS  +   ++HK R SREVAKEITRQMR      SVR     L+GY+GDESS
Sbjct: 291  -TESRGRKNSQDKDGHLRHKSRESREVAKEITRQMRNNFSTGSVRFSSSGLKGYAGDESS 349

Query: 1449 YTMSENDSANESEIMTPTSRHFYDRKXXXXXXXXXXXXXXXXXXXKKRLSERWRMTHKFQ 1628
             +MSEN+SANESE+M+  SRH +                      KKRLSERW+MTHK Q
Sbjct: 350  CSMSENESANESEVMSVASRHSFHLNNHSRPSSSCSTESTVSREAKKRLSERWKMTHKSQ 409

Query: 1629 EVGMVSRSSTLGEM 1670
            E+G+VSR +TL EM
Sbjct: 410  EMGVVSRGNTLAEM 423


>ref|XP_004305226.1| PREDICTED: uncharacterized protein LOC101298051 [Fragaria vesca
            subsp. vesca]
          Length = 988

 Score =  299 bits (765), Expect = 3e-78
 Identities = 198/459 (43%), Positives = 257/459 (55%), Gaps = 29/459 (6%)
 Frame = +3

Query: 381  TEEDLLKCELNRDFSKQSSGTPMKKLLAEELSKETESKRRPPSVIARLMGLDTLXXXXXX 560
            T ED L  EL    SKQ+ G P+KKLLAEE+ +ETES+RR PSVIA+LMGLD +      
Sbjct: 56   TGEDPLTFELGWRSSKQAGGAPIKKLLAEEMLRETESRRRSPSVIAKLMGLDGMPPQQPI 115

Query: 561  XXXXXX-FSKDYQQRTASIGIGFLEKSTPYEGRSFKMCNKERQEFKDVFEVLETSKDKKH 737
                     ++  QRT S           Y+ RS +  +KE+QEFKDVFEVLETSK +  
Sbjct: 116  AHKQQKGIPENRHQRTRSAEKEH-RSGVCYDHRSSRKNSKEQQEFKDVFEVLETSKVESC 174

Query: 738  GNSLVQKGMENTKLSETKMDFIRQKFMDVKRFSTDEKFLQSKEFHDALEVLDSNTEVLLK 917
              S   +   NTKLS+ +M F+RQKFMD KR STDEK   SKEFHDALEVLDSN ++LLK
Sbjct: 175  SYS--SRAAANTKLSDAEMAFVRQKFMDAKRLSTDEKLQDSKEFHDALEVLDSNKDLLLK 232

Query: 918  FLQEPDSLFTKHLHDLKGIPP---------KSSNDQSLENRDV----ERKIEWKDATDYP 1058
            FLQ+PDSLFTKHLHDL   P          KSS  Q  E  D+     R+   ++    P
Sbjct: 233  FLQQPDSLFTKHLHDLHSGPQSHCGRVASMKSSEAQKYEKIDLGWTSARESPLRNYCKSP 292

Query: 1059 RKH-----SHSHNEPGVQISNKVSKSQLNEKDESCLLPTRIVILKSNLRKAXXXXXXXXX 1223
            ++H     S+S +    + S K S+ +   K E+ + PTRIV+LK NL K          
Sbjct: 293  QRHRDSFSSYSDSRHATRYSLK-SQYRPEAKHETAITPTRIVVLKPNLGKILNATKTISS 351

Query: 1224 XXXXGGLHSGYRKHDEYRSSENLELFSKVRERRNSSIELELMKHKVRGSREVAKEITRQM 1403
                    S  R   ++ +  N E+      ++N        +HK R SREVAKEITRQM
Sbjct: 352  PCSSQASMSVCRNRSDFPNIGNREV--DAWGKKNFPDNEGQSRHKSRESREVAKEITRQM 409

Query: 1404 RRSVRL----------RGYSGDESSYTMSENDSANESEIMTPTSRHFYDRKXXXXXXXXX 1553
            R+++ +          +GY+GD+SS +MSEN+S NESE+++  S+ F DR          
Sbjct: 410  RKNISMGSVQISSSGFKGYAGDDSSCSMSENESGNESEVISVASKQFSDRHNHSRRSSTC 469

Query: 1554 XXXXXXXXXXKKRLSERWRMTHKFQEVGMVSRSSTLGEM 1670
                      KKRLSERW+MTHK QE+G+ SR +TL EM
Sbjct: 470  SAESSVSREAKKRLSERWKMTHKSQEIGVASRGNTLAEM 508


>ref|XP_007027126.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508715731|gb|EOY07628.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 991

 Score =  295 bits (755), Expect = 4e-77
 Identities = 203/491 (41%), Positives = 263/491 (53%), Gaps = 36/491 (7%)
 Frame = +3

Query: 306  GNSQVRKWVHTSKVLXXXXXXXXXXTEEDLLKCELNRDFSKQSSGTPMKKLLAEELSKET 485
            GN Q++K    SK+           T+ED L  EL+   SKQS+GTPMKKLLA+E+SKE 
Sbjct: 30   GNRQLQKQRKFSKLASDSSSCGTDSTDEDQLTFELSWRSSKQSTGTPMKKLLAQEMSKEN 89

Query: 486  ESKRRPPSVIARLMGLDTLXXXXXXXXXXXXF-SKDYQQRTASIGIGFLEKSTPYEGRSF 662
            ES+RR PSVIARLMGLD L              SK+  Q+  S           Y  RS 
Sbjct: 90   ESRRRQPSVIARLMGLDGLPPQQPGHKQQKRTESKEKVQKGGSF----------YSRRSS 139

Query: 663  KMCNKERQEFKDVFEVLETSKDKKHGNSLVQKGMENTKLSETKMDFIRQKFMDVKRFSTD 842
            +  +KE QEFKDVFEVL+ SK +    S   +G  N+KLS+ ++ F++QKFM+ KR STD
Sbjct: 140  RKSSKEEQEFKDVFEVLDASKVET--GSYSSQGTANSKLSDAEVAFVQQKFMEAKRLSTD 197

Query: 843  EKFLQSKEFHDALEVLDSNTEVLLKFLQEPDSLFTKHLHDLKGI---------------- 974
            EK   S+EF+DALEVLDSNT++LLKFLQ+PDSLFTKHLHDL+G                 
Sbjct: 198  EKLQDSEEFNDALEVLDSNTDLLLKFLQQPDSLFTKHLHDLQGAHDLQGAQPQSRCGRIS 257

Query: 975  PPKSSNDQSLEN----RDVERKIEWKDATDYPRKH-----SHSHNEPGVQISNKVSKSQL 1127
              KSS+  + EN    R   R+ + K  +  P+ H     SHS          K  K QL
Sbjct: 258  AMKSSHTLTNENGHLGRRAGRETQCKHCSKSPQGHREDLLSHSCGRYAAHNLLKSPKVQL 317

Query: 1128 NEKDESCLLPTRIVILKSNLRKAXXXXXXXXXXXXXGGLHSGYRKHDEYRSSENLELFSK 1307
             EK E  + PTRIV+LK NL K+                 S      E    EN E  ++
Sbjct: 318  EEKQEPAVAPTRIVVLKPNLGKSLNSMRTASSPCSSHHFPSDCTGQSEILGIENRE--AE 375

Query: 1308 VRERRNSSIELELMKHKVRGSREVAKEITRQMRRSV----------RLRGYSGDESSYTM 1457
            +  ++    ++   +H  R SRE+AKEITR+M+ S           R RGY+GDESS  +
Sbjct: 376  IWGKKKVHQDVGFSRHNSRESREMAKEITRRMKNSFSNGSMKFSTSRFRGYAGDESSCDV 435

Query: 1458 SENDSANESEIMTPTSRHFYDRKXXXXXXXXXXXXXXXXXXXKKRLSERWRMTHKFQEVG 1637
            S ++SAN+S++ T + R    R                    KKRLSERW++TH  QE+ 
Sbjct: 436  SGSESANDSDVTTVSYRDNIGRNKKHRRSSSRSSESSVSREAKKRLSERWKLTHGSQELL 495

Query: 1638 MVSRSSTLGEM 1670
            MVSR STLGEM
Sbjct: 496  MVSRGSTLGEM 506


>emb|CBI32667.3| unnamed protein product [Vitis vinifera]
          Length = 867

 Score =  277 bits (709), Expect = 9e-72
 Identities = 185/434 (42%), Positives = 229/434 (52%), Gaps = 26/434 (5%)
 Frame = +3

Query: 447  MKKLLAEELSKETESKRRPPSVIARLMGLDTLXXXXXXXXXXXXFSKDYQQRTASIGIGF 626
            MKKLLA+E+SKE E K+R PSVIARLMGLD L              +++QQRT ++    
Sbjct: 1    MKKLLAKEMSKEAEPKKRSPSVIARLMGLDGLPPQQPIHKQQKKLMENHQQRTETVERA- 59

Query: 627  LEKSTPYEGRSF--KMCNKERQEFKDVFEVLETSKDKKHGNSLVQKGMENTKLSETKMDF 800
             E    + G     K  +KE++EFKDVFEVL   K +     +  +G  N+KL+E +  F
Sbjct: 60   -EGGGTFYGPQLHRKKNSKEQEEFKDVFEVLVAPKGESDCYQVEGQGTTNSKLTEAEKAF 118

Query: 801  IRQKFMDVKRFSTDEKFLQSKEFHDALEVLDSNTEVLLKFLQEPDSLFTKHLHDLKGIPP 980
            IRQKFMD KR STDEK   S+EFHDALEVLDSN ++LLKFLQEPDSLFTKHL DL+G+PP
Sbjct: 119  IRQKFMDAKRLSTDEKLQDSQEFHDALEVLDSNKDLLLKFLQEPDSLFTKHLQDLQGVPP 178

Query: 981  ----------KSSNDQSLENRDV----ERKIEWKDATDYPRKHSHSHNEPGVQISNKVSK 1118
                      KSSN    EN       +R    K+    P+KH   H             
Sbjct: 179  QPHCRRITVSKSSNSPKYENNATGWKSKRGTSRKNDISSPQKHHDDH------------- 225

Query: 1119 SQLNEKDESCLLPTRIVILKSNLRKAXXXXXXXXXXXXXGGLHSGYRKHDEYRSSENLEL 1298
                 +DE+ +LPTRIV+LK NL K                  S   KH    S  N E 
Sbjct: 226  ---FRRDETSVLPTRIVVLKPNLGKVLSSSKSISSPRSSYDFLSDCGKHTGSMSIRNKE- 281

Query: 1299 FSKVRERRNSSIELELMKHKVRGSREVAKEITRQMRRSVR----------LRGYSGDESS 1448
                      S E+   +HK R SRE+AKE+TR+MR S+            RGY+GDESS
Sbjct: 282  -----AELQGSNEMGFSRHKSRESREIAKEVTRRMRNSITNGSMNFSSAGFRGYAGDESS 336

Query: 1449 YTMSENDSANESEIMTPTSRHFYDRKXXXXXXXXXXXXXXXXXXXKKRLSERWRMTHKFQ 1628
              MS NDS +E E     SR+ +DR                    +KRLSERW+MT +FQ
Sbjct: 337  -CMSGNDSLSEPEETVLISRNSFDRSSRYRASSSHSTESSVSREARKRLSERWKMTRRFQ 395

Query: 1629 EVGMVSRSSTLGEM 1670
            EVG V+R STL EM
Sbjct: 396  EVGAVNRGSTLAEM 409


>ref|XP_002308481.2| hypothetical protein POPTR_0006s23020g [Populus trichocarpa]
            gi|550336905|gb|EEE92004.2| hypothetical protein
            POPTR_0006s23020g [Populus trichocarpa]
          Length = 907

 Score =  275 bits (703), Expect = 4e-71
 Identities = 179/427 (41%), Positives = 240/427 (56%), Gaps = 27/427 (6%)
 Frame = +3

Query: 471  LSKETESKRRPPSVIARLMGLDTLXXXXXXXXXXXXFSKDYQQRTASIGIGFLEKSTPYE 650
            +S++++SKRR PSVIARLMGLD L              ++Y QR   +       +  Y 
Sbjct: 1    MSRKSDSKRRSPSVIARLMGLDGLPPQQSSHKQQKKSLENYTQRMV-LTEKAQRNNASYG 59

Query: 651  GRSFKMCNKERQEFKDVFEVLETSKDKKHGNSLVQKGMENTKLSETKMDFIRQKFMDVKR 830
             RS +  +K+ QEFKDVFEVL+ SK     +S   +G  ++KL+  +M FI+QKFMD KR
Sbjct: 60   RRSSRKSSKDEQEFKDVFEVLDPSK--MDSSSYSSRGTAHSKLTAAEMAFIQQKFMDAKR 117

Query: 831  FSTDEKFLQSKEFHDALEVLDSNTEVLLKFLQEPDSLFTKHLHDLKGIPPKS-------- 986
             STDEK   S+EFHDA+E LDSN ++LLK+LQ+PDSLFTKHLHDL+G+P +S        
Sbjct: 118  LSTDEKLQNSREFHDAIEDLDSNKDLLLKYLQQPDSLFTKHLHDLQGVPSQSHCGQTRIS 177

Query: 987  ----SNDQSLENRDVERKIEWKDATDYPRKH-----SHSHNEPGVQISNKVSKSQLNEKD 1139
                S+     +  +   IE + A    RK+     SHSH + G Q   ++SK QL++KD
Sbjct: 178  DMKPSHPPHCGSSGLGSNIERQTALKNRRKNHVDPASHSHGKHGAQNPVELSKIQLDQKD 237

Query: 1140 ESCLLPTRIVILKSNLRKAXXXXXXXXXXXXXGGLHSGYRKHDEYRSSENLELFSKVRER 1319
            ES +LPTRIV+LK NL +                     R+H E    +N E+ S  +++
Sbjct: 238  ESAILPTRIVVLKPNLGRTQNSTKNTSSPQYSRASPLDCRQHTEPPGIKNREVVSYGKKK 297

Query: 1320 RNSSIELELMKHKVRGSREVAKEITRQMRRSV----------RLRGYSGDESSYTMSEND 1469
                 +    ++K R SRE+AKEITRQMR S              GY+ DESS  MSEN+
Sbjct: 298  FPD--DAGPSRYKSRESREIAKEITRQMRESFGNGSMSFSTPAFIGYARDESSPDMSENE 355

Query: 1470 SANESEIMTPTSRHFYDRKXXXXXXXXXXXXXXXXXXXKKRLSERWRMTHKFQEVGMVSR 1649
            SANESE  T TSR+  D                     +KRLSERW+MTHK  ++G+VSR
Sbjct: 356  SANESEETTVTSRNSVDWSNRYRPSSSCSTESSVSREARKRLSERWKMTHKSVDMGIVSR 415

Query: 1650 SSTLGEM 1670
            S+TLGEM
Sbjct: 416  SNTLGEM 422


>ref|XP_006429000.1| hypothetical protein CICLE_v10011022mg [Citrus clementina]
            gi|557531057|gb|ESR42240.1| hypothetical protein
            CICLE_v10011022mg [Citrus clementina]
          Length = 909

 Score =  271 bits (692), Expect = 8e-70
 Identities = 183/430 (42%), Positives = 238/430 (55%), Gaps = 30/430 (6%)
 Frame = +3

Query: 471  LSKETESKRRPPSVIARLMGLDTLXXXXXXXXXXXXFSKDYQQRTASIGIGFLEKSTPYE 650
            +S+ETESKRR PSVIARLMG D L             +++ Q  TAS      ++ST   
Sbjct: 1    MSRETESKRRSPSVIARLMGFDGLPATQAAHKQHKRSAENNQPWTASAEKA--QRSTTSS 58

Query: 651  GR-SFKMCNKERQEFKDVFEVLETSKDKKHGNSLVQKGMENTKLSETKMDFIRQKFMDVK 827
            GR SF+  +KE QEFKDVFEVL+ SK +    +  ++   N+KLSE +M FIRQKFM+ K
Sbjct: 59   GRRSFRKSSKEEQEFKDVFEVLDASKME----TCSKQESTNSKLSEAEMVFIRQKFMEAK 114

Query: 828  RFSTDEKFLQSKEFHDALEVLDSNTEVLLKFLQEPDSLFTKHLHDLKGIPPKS------- 986
            R STDE+F  SKEF DALEVLDSN ++LLKFLQ+PDSLFTKHLHDL G   +S       
Sbjct: 115  RLSTDERFQDSKEFQDALEVLDSNKDLLLKFLQQPDSLFTKHLHDL-GASSQSHCGHISA 173

Query: 987  ---SNDQSLENRDV----ERKIEWKDATDYPRKH-----SHSHNEPGVQISNKVSKSQLN 1130
               S  +  E+ DV    ER  + K+     ++H      HS +    Q  NK +  QL 
Sbjct: 174  MTPSLARQCESSDVGWKAERGTQCKNQRKSSQEHPDGLSRHSSSGHAAQSLNKPAIVQLE 233

Query: 1131 EKDESCLLPTRIVILKSNLRKAXXXXXXXXXXXXXGGLHSGYRKHDEYRSSENLELFSKV 1310
             K++  +LPTRIV+LK N+ +               G  S  RKH E           + 
Sbjct: 234  GKEDHSVLPTRIVVLKPNVGRVQAAARTVSSPRSSHGYPSDSRKHTELPGPGMENREPET 293

Query: 1311 RERRNSSIELELMKHKVRGSREVAKEITRQMR----------RSVRLRGYSGDESSYTMS 1460
             E++    ++   +HK R SRE+AKEITRQMR           S   +GY+GDESS   S
Sbjct: 294  WEKKKFPDDVGFSRHKSRESRELAKEITRQMRDNLSSVSMKFSSTGFKGYAGDESSSNFS 353

Query: 1461 ENDSANESEIMTPTSRHFYDRKXXXXXXXXXXXXXXXXXXXKKRLSERWRMTHKFQEVGM 1640
             N+SANE EI T TS+  + R                    KKRLSERW+M+HK QE+G+
Sbjct: 354  GNESANELEIKTMTSKDGFIRHRRSRSSSSHSSESSVSREAKKRLSERWKMSHKSQELGV 413

Query: 1641 VSRSSTLGEM 1670
            ++R +TLGEM
Sbjct: 414  INRGNTLGEM 423


>gb|EYU41671.1| hypothetical protein MIMGU_mgv1a001075mg [Mimulus guttatus]
          Length = 895

 Score =  257 bits (657), Expect = 1e-65
 Identities = 175/441 (39%), Positives = 238/441 (53%), Gaps = 29/441 (6%)
 Frame = +3

Query: 435  SGTPMKKLLAEELSKETESKRRPPSVIARLMGLDTLXXXXXXXXXXXXFSKDYQQRTASI 614
            +G PMKKLLAEE+SKE ESKRR PSVIARLMGL+ L            FS+   QR  S 
Sbjct: 15   AGKPMKKLLAEEMSKEVESKRRTPSVIARLMGLEGLPSPRHVHRQPKRFSESLPQRNVSS 74

Query: 615  GIGFLEKSTPYEGRSFKMCNKERQEFKDVFEVLETSKDKKHGNSLVQKGMENTKLSETKM 794
             I     S P+EGRS +  + E+QEFKDV+E LE S       S   +   ++ L++ +M
Sbjct: 75   NIQ--RNSQPHEGRSNRRRSTEQQEFKDVYEDLEASHVANRRCS--SRWSASSILTKPEM 130

Query: 795  DFIRQKFMDVKRFSTDEKFLQSKEFHDALEVLDSNTEVLLKFLQEPDSLFTKHLHDLKG- 971
              I+QKF+D KR STDEK   SKE  D LE+LDSN ++LL+FL +P+SLF KHLHD +  
Sbjct: 131  ALIQQKFLDAKRLSTDEKLQDSKELDDTLEMLDSNKDLLLRFLWQPNSLFMKHLHDGQVD 190

Query: 972  ---------IPPKSSNDQSLENRDVERKIEWKDATDYPRKHSHSH--------NEPGVQI 1100
                        K SN +  EN+    K+   +     + H+ SH         EP  + 
Sbjct: 191  HGNSLGSHIAVLKPSNSEKYENK---AKVFGSEKNTSSKHHATSHVKRQDGLLLEPHSRR 247

Query: 1101 SNKVSK--SQLNEKDESCLLPTRIVILKSNLRKAXXXXXXXXXXXXXGGLHSGYRKHDEY 1274
                S+  +QL+ +    +LPTRIV+LK NL K               G H   +K  E+
Sbjct: 248  RGHTSRNSTQLDAEKGENILPTRIVVLKPNLGKTQKAATSNSSPDFSSGYHPSLKKIKEF 307

Query: 1275 RSSENLELFSKVRERRNSSIELELMKHKVRGSREVAKEITRQMR---------RSVRLRG 1427
            +S    E  ++ R R++SS ++ L K   + +RE+AKEIT +MR         +S   RG
Sbjct: 308  QSVGGNE--TESRRRKDSSHKMGLSKSMSKEAREIAKEITTRMRDGSDETMDAKSSGFRG 365

Query: 1428 YSGDESSYTMSENDSANESEIMTPTSRHFYDRKXXXXXXXXXXXXXXXXXXXKKRLSERW 1607
            Y GDESSY  +E+DS NESE+   + R  +D                     KKRLSERW
Sbjct: 366  YIGDESSYDPNESDSGNESEVF-KSCRKSFDGSNLCRYPSPSLFETSVNREAKKRLSERW 424

Query: 1608 RMTHKFQEVGMVSRSSTLGEM 1670
            +M+HK+Q++ M+S+ STLGEM
Sbjct: 425  KMSHKYQDLEMISKGSTLGEM 445


>ref|XP_002322831.2| hypothetical protein POPTR_0016s08100g [Populus trichocarpa]
            gi|550321088|gb|EEF04592.2| hypothetical protein
            POPTR_0016s08100g [Populus trichocarpa]
          Length = 903

 Score =  251 bits (641), Expect = 7e-64
 Identities = 176/427 (41%), Positives = 236/427 (55%), Gaps = 27/427 (6%)
 Frame = +3

Query: 471  LSKETESKRRPPSVIARLMGLDTLXXXXXXXXXXXXFSKDYQQRTASIGIGFLEKSTPYE 650
            +S+E+ES RR PSVIARLMGLD L              ++Y QR     I    + + Y 
Sbjct: 1    MSRESES-RRSPSVIARLMGLDGLPLQQSSHKHPKKSLENYTQRMVLAEIAQRNRGS-YG 58

Query: 651  GRSFKMCNKERQEFKDVFEVLETSKDKKHGNSLVQKGMENTKLSETKMDFIRQKFMDVKR 830
              S +  +K+ QEFKDVFEVL+TSK     +S    G  +++L+  +M FI+QKF DVK 
Sbjct: 59   RWSSRKSSKDEQEFKDVFEVLDTSK--MGSSSYSSCGNGHSELTAAEMAFIQQKFTDVKW 116

Query: 831  FSTDEKFLQSKEFHDALEVLDSNTEVLLKFLQEPDSLFTKHLHDLKGIPP---------- 980
             STDEK   SKEFHDA+E LDSN ++LLK+LQ+PDSLFTKHLHDL+GIPP          
Sbjct: 117  LSTDEKLQNSKEFHDAIEDLDSNKDLLLKYLQQPDSLFTKHLHDLQGIPPQSHCGRTHIP 176

Query: 981  --KSSNDQSLENRDVERKIEWKDATDYPRK-----HSHSHNEPGVQISNKVSKSQLNEKD 1139
              KSS      +  +   IE ++     RK      S+S+++   Q   K+SK QL++KD
Sbjct: 177  AKKSSYPAHCGSIGLGCNIERENPLKNRRKPHVDPSSYSYSKLEAQNPVKLSKVQLDQKD 236

Query: 1140 ESCLLPTRIVILKSNLRKAXXXXXXXXXXXXXGGLHSGYRKHDEYRSSENLELFSKVRER 1319
            ES +LPTRIV+LK N+ K                  S  RKH E  S +  E+ S    +
Sbjct: 237  ESAILPTRIVVLKPNIGKMQNSKKNTSSSQSSHASPSDCRKHTETPSIKKKEVVS--WGK 294

Query: 1320 RNSSIELELMKHKVRGSREVAKEITRQMRRSV----------RLRGYSGDESSYTMSEND 1469
            ++   +    ++K R SRE+A+EITR+MR++             RGY GDESS   +EN+
Sbjct: 295  KSFPDDAGPSRYKSRESREIAREITRKMRKNFINSSMNFSTSGFRGYVGDESS---TENE 351

Query: 1470 SANESEIMTPTSRHFYDRKXXXXXXXXXXXXXXXXXXXKKRLSERWRMTHKFQEVGMVSR 1649
            SANESE     SR+  D                     +KRLSERW++THK   +G+VS+
Sbjct: 352  SANESEETAVNSRNSIDWSNRSIPSSSCSNESSVSREARKRLSERWKLTHKSVNMGIVSQ 411

Query: 1650 SSTLGEM 1670
            SSTLGEM
Sbjct: 412  SSTLGEM 418


>ref|XP_002534051.1| conserved hypothetical protein [Ricinus communis]
            gi|223525922|gb|EEF28329.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 779

 Score =  250 bits (638), Expect = 2e-63
 Identities = 165/394 (41%), Positives = 223/394 (56%), Gaps = 25/394 (6%)
 Frame = +3

Query: 405  ELNRDFSKQSSGTPMKKLLAEELSKETESKRRPPSVIARLMGLDTLXXXXXXXXXXXXFS 584
            EL    SKQ++GTP+KKLLAEE+S+ETES+++ P VIARLMG D L             S
Sbjct: 4    ELGWRSSKQATGTPIKKLLAEEMSRETESRKKSPGVIARLMGFDGLPPQQLAHKQQKRSS 63

Query: 585  KDYQQRTASIGIGFLEKSTPYEGRSFKMCNKERQEFKDVFEVLETSKDKKHGNSLVQKGM 764
             +Y QR A +    L+ ST    RS +  +KE QEFKDVFEVL+  K + + +SL  +G 
Sbjct: 64   DNYLQRVA-LSERSLKSSTSCSRRSPRRSSKEEQEFKDVFEVLDMGKMETNTSSL--QGT 120

Query: 765  ENTKLSETKMDFIRQKFMDVKRFSTDEKFLQSKEFHDALEVLDSNTEVLLKFLQEPDSLF 944
             N+KL+E +M F++QKF+DV R STDEKF  SKEFHDA++ LDSN ++LLKFL++PDSLF
Sbjct: 121  ANSKLTEAEMAFVKQKFLDVTRLSTDEKFHDSKEFHDAVDDLDSNKDLLLKFLEQPDSLF 180

Query: 945  TKHLHDLKGIPP----------KSSNDQSLENRDVERKIEWKDATDYPRKH-----SHSH 1079
             +HLHDL+  P           K S     E R +    E +    Y RK+     SHS 
Sbjct: 181  KRHLHDLRAAPSSPHCSHVSGMKLSRASEYEGRGLGCNREIETTWKYGRKNHSDPLSHSC 240

Query: 1080 NEPGVQISNKVSKSQLNEKDESCLLPTRIVILKSNLRKAXXXXXXXXXXXXXGGLHSGYR 1259
            ++       K SK Q+  K +S ++PTRIV+LK N  K                  S   
Sbjct: 241  SKRAAHDPPKSSKIQVESKVDSSVIPTRIVVLKPNFGKVQNASRTVSSPRTSYDFLSDCD 300

Query: 1260 KHDEYRSSENLELFSKVRERRNSSIELELMKHKVRGSREVAKEITRQMRRSVR------- 1418
            +  +  S+      S++   R    ++ L ++K R SRE+AKEITRQMR S+        
Sbjct: 301  RQMDLPSTNG---ESELCGSRRFPNDVGLPRYKSRESREIAKEITRQMRNSIESGSLNPS 357

Query: 1419 ---LRGYSGDESSYTMSENDSANESEIMTPTSRH 1511
                RGY+GDESS   S+N+SANES+  T  SR+
Sbjct: 358  TSGFRGYAGDESSSNRSDNESANESDGPTVISRN 391


>ref|XP_006594084.1| PREDICTED: uncharacterized protein LOC100794819 isoform X2 [Glycine
            max]
          Length = 941

 Score =  239 bits (609), Expect = 4e-60
 Identities = 168/442 (38%), Positives = 223/442 (50%), Gaps = 26/442 (5%)
 Frame = +3

Query: 423  SKQSSGTPMKKLLAEELSKETESKRRPPSVIARLMGLDTLXXXXXXXXXXXXFSKDYQQR 602
            SKQ  GTP+KKLLAEE+S + ESKRR P VIARLMGLD L             S++ QQ+
Sbjct: 64   SKQLFGTPIKKLLAEEMSPKAESKRRSPGVIARLMGLDGLPFQQPINKQHKALSEN-QQK 122

Query: 603  TASIGIGFLEKSTPYEGRSFKMCNKERQEFKDVFEVLETSKDKKHGNSLVQKGMENTKLS 782
            TA +      K  PY+G+S +  +K+ QEFKDVFEV E  K + H      +G  +   +
Sbjct: 123  TAQLE-RTRGKGVPYDGQSSRRSSKDHQEFKDVFEVSEIPKVESH--RYPSQGCADLMTT 179

Query: 783  ETKMDFIRQKFMDVKRFSTDEKFLQSKEFHDALEVLDSNTEVLLKFLQEPDSLFTKHLHD 962
            + ++ FI QKFMD KR +T +    SK+F D LEVLDSN ++LLK+ + PDSLF KHL+D
Sbjct: 180  DAEISFIEQKFMDAKRLATHQDLQSSKDFCDTLEVLDSNKDLLLKYFKRPDSLFKKHLND 239

Query: 963  LKGIPPKS------SNDQSLENRDVERKIEWKDATDYPRKHSHSHNEPG----------V 1094
            L+  P +S        D      D   + +W+       + SH  +  G          +
Sbjct: 240  LQAAPVQSHYGYVKPMDIEKYEHDFNLRSDWEKTRSNYNRSSHEKHHDGYPCHFDKRHVM 299

Query: 1095 QISNKVSKSQLNEKDESCLLPTRIVILKSNLRKAXXXXXXXXXXXXXGGLHSGYRKHDEY 1274
              S K SK Q   K E   + ++IV+LK NL K                  +G     E 
Sbjct: 300  HSSPKSSKLQFKAKYEQKAVTSQIVLLKPNLGKVQNGTRIVSSPCSSHNFLAGCENDTEL 359

Query: 1275 RSSENLELFSKVRERRNSSIELELMKHKVRGSREVAKEITRQMRRSV----------RLR 1424
              + NL      R  R  S E          SRE+AKE+TRQM+ S+          R+R
Sbjct: 360  CQATNLP--ESARSWRQDSFE----------SREIAKEVTRQMKISLNNGSMKLSTSRIR 407

Query: 1425 GYSGDESSYTMSENDSANESEIMTPTSRHFYDRKXXXXXXXXXXXXXXXXXXXKKRLSER 1604
            GY+GD+SS ++S N+S  ESE  T T  +  D                     KKRLSER
Sbjct: 408  GYAGDDSSCSVSGNESPEESEETTATLGNSIDLN-NRSRRSSRSSESSVSREAKKRLSER 466

Query: 1605 WRMTHKFQEVGMVSRSSTLGEM 1670
            W+MTHK QE+  +SRSSTL EM
Sbjct: 467  WKMTHKSQELQGISRSSTLAEM 488


>ref|XP_003541395.1| PREDICTED: uncharacterized protein LOC100794819 isoform X1 [Glycine
            max]
          Length = 942

 Score =  239 bits (609), Expect = 4e-60
 Identities = 168/442 (38%), Positives = 223/442 (50%), Gaps = 26/442 (5%)
 Frame = +3

Query: 423  SKQSSGTPMKKLLAEELSKETESKRRPPSVIARLMGLDTLXXXXXXXXXXXXFSKDYQQR 602
            SKQ  GTP+KKLLAEE+S + ESKRR P VIARLMGLD L             S++ QQ+
Sbjct: 65   SKQLFGTPIKKLLAEEMSPKAESKRRSPGVIARLMGLDGLPFQQPINKQHKALSEN-QQK 123

Query: 603  TASIGIGFLEKSTPYEGRSFKMCNKERQEFKDVFEVLETSKDKKHGNSLVQKGMENTKLS 782
            TA +      K  PY+G+S +  +K+ QEFKDVFEV E  K + H      +G  +   +
Sbjct: 124  TAQLE-RTRGKGVPYDGQSSRRSSKDHQEFKDVFEVSEIPKVESH--RYPSQGCADLMTT 180

Query: 783  ETKMDFIRQKFMDVKRFSTDEKFLQSKEFHDALEVLDSNTEVLLKFLQEPDSLFTKHLHD 962
            + ++ FI QKFMD KR +T +    SK+F D LEVLDSN ++LLK+ + PDSLF KHL+D
Sbjct: 181  DAEISFIEQKFMDAKRLATHQDLQSSKDFCDTLEVLDSNKDLLLKYFKRPDSLFKKHLND 240

Query: 963  LKGIPPKS------SNDQSLENRDVERKIEWKDATDYPRKHSHSHNEPG----------V 1094
            L+  P +S        D      D   + +W+       + SH  +  G          +
Sbjct: 241  LQAAPVQSHYGYVKPMDIEKYEHDFNLRSDWEKTRSNYNRSSHEKHHDGYPCHFDKRHVM 300

Query: 1095 QISNKVSKSQLNEKDESCLLPTRIVILKSNLRKAXXXXXXXXXXXXXGGLHSGYRKHDEY 1274
              S K SK Q   K E   + ++IV+LK NL K                  +G     E 
Sbjct: 301  HSSPKSSKLQFKAKYEQKAVTSQIVLLKPNLGKVQNGTRIVSSPCSSHNFLAGCENDTEL 360

Query: 1275 RSSENLELFSKVRERRNSSIELELMKHKVRGSREVAKEITRQMRRSV----------RLR 1424
              + NL      R  R  S E          SRE+AKE+TRQM+ S+          R+R
Sbjct: 361  CQATNLP--ESARSWRQDSFE----------SREIAKEVTRQMKISLNNGSMKLSTSRIR 408

Query: 1425 GYSGDESSYTMSENDSANESEIMTPTSRHFYDRKXXXXXXXXXXXXXXXXXXXKKRLSER 1604
            GY+GD+SS ++S N+S  ESE  T T  +  D                     KKRLSER
Sbjct: 409  GYAGDDSSCSVSGNESPEESEETTATLGNSIDLN-NRSRRSSRSSESSVSREAKKRLSER 467

Query: 1605 WRMTHKFQEVGMVSRSSTLGEM 1670
            W+MTHK QE+  +SRSSTL EM
Sbjct: 468  WKMTHKSQELQGISRSSTLAEM 489


>ref|XP_006345668.1| PREDICTED: uncharacterized protein LOC102591321 isoform X1 [Solanum
            tuberosum]
          Length = 991

 Score =  237 bits (605), Expect = 1e-59
 Identities = 177/499 (35%), Positives = 250/499 (50%), Gaps = 41/499 (8%)
 Frame = +3

Query: 297  LFPGNSQVRKWVHTSKVLXXXXXXXXXXTEEDLLKCELNRDFSKQSSGTPMKKLLAEELS 476
            +  G+   +K V T KV           T  D++  +L +  SK+ +GTP+K LLAEE++
Sbjct: 15   IMEGSKLGKKQVATPKVTLNSRSYCDETTRGDMIMHDLGKISSKRVTGTPIKNLLAEEMA 74

Query: 477  KETESKRRPPSVIARLMGLDTLXXXXXXXXXXXXFSKDYQQRTASIGIGFLEKSTPYEGR 656
            KE ESK+RP S++ARLMGL+ +            FS   Q R   I      +   ++ +
Sbjct: 75   KEGESKKRPTSIVARLMGLEGMPSPQHIGRQQRRFSDSCQHRNEQIDSR--RRKQLFDEQ 132

Query: 657  SFKMCNKERQEFKDVFEVLETSK--DKKHGNSLVQKGMENTKLSETKMDFIRQKFMDVKR 830
            S K  + E QEFKDV+E LE S   +++H +   + G    + +   M  I+QKFMD KR
Sbjct: 133  SSKRSSMEHQEFKDVYEDLEASHVGNRRHSSRWNETG----RFATPDMALIQQKFMDAKR 188

Query: 831  FSTDEKFLQSKEFHDALEVLDSNTEVLLKFLQEPDSLFTKHLHDLKG------------I 974
             STDE+F  SKEF+D LE LDSN E+LLK+LQEPDSLF KHL DL+             +
Sbjct: 189  LSTDERFQNSKEFNDTLEALDSNKELLLKYLQEPDSLFVKHLQDLQVESASSTCSRIAVL 248

Query: 975  PPKSS--------NDQSLENRDVERKI----EWKDATDYPRKHSHS-HNEPGVQISNKVS 1115
             P +S        + +S+     ++ I    E  D      +H HS HN       ++ S
Sbjct: 249  KPSNSVKYEGSAKSSKSVRGGSCKQSISLQKERLDGLLLQSQHRHSGHN-------SQKS 301

Query: 1116 KSQLNEKDESCLLPTRIVILKSNLRKAXXXXXXXXXXXXXGGLHSGYRKHDEY-RSSENL 1292
               L+E  E  +LPTRIV+LK NL                   H   RKH +Y R+S   
Sbjct: 302  SPVLSEGKEENILPTRIVVLKPNLGITQSNIASVPH-------HPDVRKHAQYHRASPG- 353

Query: 1293 ELFSKVRERRNSSIELELMKHKVRGSREVAKEITRQMR-------------RSVRLRGYS 1433
               +   E +NSS  + + + K   +R++AKEITR+MR             R   ++GY+
Sbjct: 354  --GAGEEEEKNSSKNMGISRPKSNEARDIAKEITRRMRDSFGPFDGRDAYFRGSGVKGYA 411

Query: 1434 GDESSYTMSENDSANESEIMTPTSRHFYDRKXXXXXXXXXXXXXXXXXXXKKRLSERWRM 1613
            GDESS  + E+DS  +S+I T + R    R                    KKRLSERW+M
Sbjct: 412  GDESSCDVYESDSTGDSDITTLSCRKSSGR-GNLKKSSSLGSESSVGREAKKRLSERWKM 470

Query: 1614 THKFQEVGMVSRSSTLGEM 1670
            T  +Q++ M  +SSTLGEM
Sbjct: 471  TQYYQDIEMAGKSSTLGEM 489


>ref|XP_006850705.1| hypothetical protein AMTR_s00034p00242520 [Amborella trichopoda]
            gi|548854374|gb|ERN12286.1| hypothetical protein
            AMTR_s00034p00242520 [Amborella trichopoda]
          Length = 1048

 Score =  237 bits (605), Expect = 1e-59
 Identities = 189/493 (38%), Positives = 245/493 (49%), Gaps = 64/493 (12%)
 Frame = +3

Query: 384  EEDLLKCELNRDFSKQSSGTPMKKLLAEELSKETESKRRPPSVIARLMGLDTLXXXXXXX 563
            EE   + EL R  S + SG PMK LLAEE+SK+TESKRRPPSVIARLMGLD L       
Sbjct: 50   EESSFEHELRRSSSARGSGIPMKMLLAEEMSKDTESKRRPPSVIARLMGLDALPTQQSIS 109

Query: 564  XXXXXFS-------------------KDYQQRTASI-GIGFLEKSTPYEGRSFKMCNKER 683
                  S                   + Y  R  S+    F+ K T     SF+    E+
Sbjct: 110  KHYKKNSEKQALDMPLMKPQKKPFCHQGYHHRQESVCPDEFMSKET----HSFRNHATEQ 165

Query: 684  QEFKDVFEVLETSKDKKHGNSLVQKGMENTKLSETKMDFIRQKFMDVKRFSTDEKFLQSK 863
            QEFKDVFEV ETS   K     V K   +TK SE KM+ IRQKFMD KR ST+EK  QSK
Sbjct: 166  QEFKDVFEVWETSDAGK-----VSKQSIHTKHSEKKMELIRQKFMDAKRLSTNEKLRQSK 220

Query: 864  EFHDALEVLDSNTEVLLKFLQEPDSLFTKHLHDLKGIPPK---------SSNDQSLENRD 1016
            EFHDALEVLDSN ++ LKFLQEPDSLFTKHLHDL+  P            S+  +  N D
Sbjct: 221  EFHDALEVLDSNKDLFLKFLQEPDSLFTKHLHDLQSAPEPHWPSHITLLKSSKAAFANND 280

Query: 1017 VE--RKIEW-------------KDATDYPRKHSHSHNEPGVQISNK-VSKSQLNEKDESC 1148
             E   +I W             K +     K    + E     S+K V++S++ EK + C
Sbjct: 281  HEGVSEIYWRHQRQIIVEGKKHKSSRSILEKREVGYKEQWFPSSHKSVTRSRMKEKIDPC 340

Query: 1149 LLPTRIVILKSNL--RKAXXXXXXXXXXXXXGGLHSGYRKH-DEYRSSENLELFSKVRE- 1316
            L+PTRIV+LK +L   ++                +S  RKH    +   + + FS+V   
Sbjct: 341  LVPTRIVVLKPSLVMEESTRGEVLSSPSSSLDPYNSICRKHMSSSQGYSDKDEFSEVCSW 400

Query: 1317 ---RRNSSIELELMKHKVRGSREVAKEITRQMRRS-----VRLRGYSGDESSYTMSENDS 1472
                RN     E+++ + +GSRE+AKEITRQMR+S     + L   + +++SY       
Sbjct: 401  EGVIRNGGRGREIVRDRPKGSREIAKEITRQMRKSMTKDTLNLSSSASNKASYYRPNEKG 460

Query: 1473 ANESE---IMTPTSRHFYD--RKXXXXXXXXXXXXXXXXXXXKKRLSERWRMTH--KFQE 1631
            +   E   + TP S   +D  +K                   KKRLSERW++T     +E
Sbjct: 461  SVVVEPKVLNTPPSETLWDWNKKRLSPNPSNSSTESTVSREAKKRLSERWKITRGGYQEE 520

Query: 1632 VGMVSRSSTLGEM 1670
              +   SSTL EM
Sbjct: 521  EKLRKNSSTLAEM 533


>ref|XP_004494988.1| PREDICTED: uncharacterized protein LOC101494666 [Cicer arietinum]
          Length = 959

 Score =  233 bits (594), Expect = 2e-58
 Identities = 173/490 (35%), Positives = 238/490 (48%), Gaps = 30/490 (6%)
 Frame = +3

Query: 291  LDLFPGNSQVRKWVHTSKVLXXXXXXXXXXTEEDLLKCELNRDFSKQSSGTPMKKLLAEE 470
            L L  GN Q+ +      +            E+D    +     SKQS GTP+KKLLAEE
Sbjct: 13   LHLPQGNEQIHRQRQFPDLSPDSSSSSGGVAEKDSFSFKFGWKSSKQSVGTPIKKLLAEE 72

Query: 471  LSKETESKRRPPSVIARLMGLDTLXXXXXXXXXXXXFSKDY-QQRTASIGIGFLEKSTPY 647
            +S   ESKRR P VIARLMGLD L              K    ++T S G+         
Sbjct: 73   MSPTAESKRRSPGVIARLMGLDGLPSQQPTNKQHKDPQKAMLSEKTRSRGMA-------N 125

Query: 648  EGRSFKMCNKERQEFKDVFEVLETSKDKKHGNSLVQKGMENTKLSETKMDFIRQKFMDVK 827
            +GRS +  ++++QEFKDVFEV E  K +    S       + K++E +M FI QKFMD K
Sbjct: 126  DGRSSRRSSRDQQEFKDVFEVSEIPKAESGRYSSA-----DLKVNEAEMSFIEQKFMDAK 180

Query: 828  RFSTDEKFLQSKEFHDALEVLDSNTEVLLKFLQEPDSLFTKHLHDLKGIPPKSSNDQ--- 998
            R +T + F  SK+FHD LEVLDSN ++LLK+ + PDSLF KHL+DL+  P +S +     
Sbjct: 181  RLATYQDFQSSKDFHDTLEVLDSNKDLLLKYFKRPDSLFKKHLNDLQATPLQSHSGHIEP 240

Query: 999  -SLENRDVERKIEWKDATD--------YPRKHSHSH-----NEPGVQISNKVSKSQLNEK 1136
             ++EN   E    W+   +        + +KH + H         +  S + SK      
Sbjct: 241  TNIEN--FEHDFTWRSDRETAQLNYKRFHQKHPNGHPCQFDKRRVMHNSPRSSKHHFKGS 298

Query: 1137 DESCLLPTRIVILKSNLRKAXXXXXXXXXXXXXGGLHSGYRKHDEYRS--SENLELFSKV 1310
             E   + T+IV+LK N+ K                  S +  H E+      + EL+ K+
Sbjct: 299  HEQGAVATKIVVLKPNMGKLQTGTRIESSPCSPHNFLSEHGSHAEFSDVRFRDTELYKKI 358

Query: 1311 RERRNSSIELELMKHKVRGSREVAKEITRQMRRSV----------RLRGYSGDESSYTMS 1460
                N        +H    S E+AKE+TRQMR S+          R +GYS ++SS ++S
Sbjct: 359  ----NLPDSARSFRHNSLESMEIAKEVTRQMRNSLNNGCTMSSSSRFKGYSRNDSSSSVS 414

Query: 1461 ENDSANESEIMTPTSRHFYDRKXXXXXXXXXXXXXXXXXXXKKRLSERWRMTHKFQEVGM 1640
             N+S  ESE +T T    +D                     KKRLSERW+MTHK QEV +
Sbjct: 415  GNESPEESEEITATLGDPFDLN-KRNRRSPRSSGSSVSKEAKKRLSERWKMTHKSQEVQV 473

Query: 1641 VSRSSTLGEM 1670
            VSRSSTL +M
Sbjct: 474  VSRSSTLADM 483


>ref|XP_006403676.1| hypothetical protein EUTSA_v10010116mg [Eutrema salsugineum]
            gi|557104795|gb|ESQ45129.1| hypothetical protein
            EUTSA_v10010116mg [Eutrema salsugineum]
          Length = 858

 Score =  232 bits (591), Expect = 4e-58
 Identities = 166/432 (38%), Positives = 229/432 (53%), Gaps = 24/432 (5%)
 Frame = +3

Query: 447  MKKLLAEELSKETESKRRPPSVIARLMGLDTLXXXXXXXXXXXXFSKDYQQRTASIGIGF 626
            MK LLA+E+SK+ ESKRR PS+IARLMGLD L             S   Q +      G 
Sbjct: 1    MKSLLAQEMSKQKESKRRSPSIIARLMGLDVLPPQS---------SPHRQHKPVENQQGR 51

Query: 627  LEKSTPYEG-RSFKMCNKERQEFKDVFEVLETSKDKKHGNSLVQKGMENTKLSETKMDFI 803
                + Y+G +S +  +K  Q+FKDVFEVL+    + + +  +Q  +    L++ +M FI
Sbjct: 52   SGGGSSYDGYKSLQRSSKGEQKFKDVFEVLDAKMAESNRSLCLQGRVNAANLTQAEMAFI 111

Query: 804  RQKFMDVKRFSTDEKFLQSKEFHDALEVLDSNTEVLLKFLQEPDSLFTKHLHDLKGIP-- 977
            RQKFM+ KR STDEK   SKEF+DALE LDSN ++LLKFLQ PDSLFTKHLHDL+  P  
Sbjct: 112  RQKFMEAKRLSTDEKLRHSKEFNDALEALDSNKDLLLKFLQHPDSLFTKHLHDLQSTPHK 171

Query: 978  PKSSNDQSLENRDVERKIE-------WKDATDYPRKHSHSHNEPGV----QISNKVSKSQ 1124
            P S    SL+  + +R ++        +D+     +  H H   G       S  VS   
Sbjct: 172  PHSGQAPSLKYPNSQRHVDILKTQRVERDSLRKSHRSPHQHGGGGACSSRSHSRHVSYDT 231

Query: 1125 LNEKDE-----SCLLPTRIVILKSNLRKAXXXXXXXXXXXXXGGLHSGYRKHDEYRSSEN 1289
            L+  +E     S L PT+IV+LK NL +                  S     DE+R+   
Sbjct: 232  LDLPNEELAKRSELQPTKIVVLKPNLGEPRYAGRT---------FASPSSSSDEFRADRR 282

Query: 1290 LELFSKVRERRNSSIELELMKHKVRGSREVAKEITRQMRRSVR---LRGYSGDESSYTMS 1460
            L   S    R+ S+ ++ L +   R S E++K ++RQ + S      RGY+GDESS   S
Sbjct: 283  LPCTSN-HGRKKSNEDVRLSRQSSRDSGEMSKIMSRQRKTSFETSGFRGYAGDESS---S 338

Query: 1461 ENDSANESEIMTPTS--RHFYDRKXXXXXXXXXXXXXXXXXXXKKRLSERWRMTHKFQEV 1634
             +DSA+ESE++  TS  R  ++RK                   K+RLSERWR+THK+++ 
Sbjct: 339  GSDSASESELVPVTSRTRTAFNRKNYHRSLPSKSATSSVSREAKRRLSERWRLTHKYEQE 398

Query: 1635 GMVSRSSTLGEM 1670
              +SRS TL EM
Sbjct: 399  IEISRSGTLAEM 410


>ref|XP_007144479.1| hypothetical protein PHAVU_007G159500g [Phaseolus vulgaris]
            gi|561017669|gb|ESW16473.1| hypothetical protein
            PHAVU_007G159500g [Phaseolus vulgaris]
          Length = 947

 Score =  230 bits (587), Expect = 1e-57
 Identities = 171/483 (35%), Positives = 230/483 (47%), Gaps = 27/483 (5%)
 Frame = +3

Query: 303  PGNSQVRKWVHTSKVLXXXXXXXXXXTEEDLLKCELNRDFSKQSSGTPMKKLLAEELSKE 482
            PGN QV +      +            ++D    +     SKQ  GTP+KKLL EE+S +
Sbjct: 17   PGNKQVHRQRLPPNLSPDSCSDGGVVADKDSFSFKFGWRSSKQLLGTPIKKLLDEEMSPK 76

Query: 483  TESKRRPPSVIARLMGLDTLXXXXXXXXXXXXFSKDYQQRTASIGIGFLEKSTPYEGRSF 662
            +++KRR P VIARLMGLD L             S++ +        G   K  PY+G S 
Sbjct: 77   SDTKRRSPGVIARLMGLDGLPFQQPISKQHKGLSENQKTPQLQKTRG---KGVPYDGGSS 133

Query: 663  KMCNKERQEFKDVFEVLETSKDKKHGNSLVQKGMENTKLSETKMDFIRQKFMDVKRFSTD 842
            +   +++QEFKDVFEV E  K +   +     G  + K ++ +M FI QKFMD KR +T 
Sbjct: 134  RRGLRDQQEFKDVFEVSEIPKVES--SRYPSPGCVDLKANDAEMSFIEQKFMDAKRLATH 191

Query: 843  EKFLQSKEFHDALEVLDSNTEVLLKFLQEPDSLFTKHLHDLKGIPPKSS----NDQSLEN 1010
            +    SK+F D LEVLDSN ++LLK+ + PDSLF KHL+DL+  P KS         +E 
Sbjct: 192  QDLQSSKDFRDTLEVLDSNKDLLLKYFKRPDSLFKKHLNDLQADPVKSHYGDVETMDIEK 251

Query: 1011 RDVERKIEW-----KDATDYPRKHS--------HSHNEPGVQISNKVSKSQLNEKDESCL 1151
             + E  + W     K   +Y R H         H      +  S + SK Q   + E   
Sbjct: 252  YEHEHDLSWRSDREKTGLNYNRSHENHLDGYPCHFDKRHVMHSSPRSSKLQFQGRHEQDA 311

Query: 1152 LPTRIVILKSNLRKAXXXXXXXXXXXXXGGLHSGYRKHDEYRSSENLELFSKVRERRNSS 1331
            +PT+IV+LK NL K                L SG  K  E     N+      R  R  S
Sbjct: 312  VPTKIVLLKPNLGKVQNGTRIVSSPCSHNFL-SGREKDTELCQVTNMP--ESARSWRQDS 368

Query: 1332 IELELMKHKVRGSREVAKEITRQMRRSV----------RLRGYSGDESSYTMSENDSANE 1481
             E          SRE+AKEITRQMR S+          R+ GY+GD+SS + S N+S + 
Sbjct: 369  FE----------SREIAKEITRQMRNSLNNSGMMLSTSRIAGYAGDDSSCSFSGNESPDV 418

Query: 1482 SEIMTPTSRHFYDRKXXXXXXXXXXXXXXXXXXXKKRLSERWRMTHKFQEVGMVSRSSTL 1661
            S  +T    + +D                     KKRLSERW+MTHK QE+  +SRSSTL
Sbjct: 419  SGEITAILGNSFDLN-NRTRRSSRSGESSVSKEAKKRLSERWKMTHKSQELQGISRSSTL 477

Query: 1662 GEM 1670
             EM
Sbjct: 478  AEM 480


Top