BLASTX nr result

ID: Akebia25_contig00005560 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00005560
         (1822 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002267713.2| PREDICTED: uncharacterized protein LOC100258...   268   5e-69
ref|XP_006483756.1| PREDICTED: muscle M-line assembly protein un...   243   2e-61
ref|XP_006438506.1| hypothetical protein CICLE_v10031371mg [Citr...   243   2e-61
ref|XP_007224227.1| hypothetical protein PRUPE_ppa018071mg, part...   233   2e-58
ref|XP_006378203.1| hypothetical protein POPTR_0010s04760g [Popu...   229   3e-57
ref|XP_007157526.1| hypothetical protein PHAVU_002G077200g [Phas...   222   4e-55
ref|XP_003517869.1| PREDICTED: neurofilament medium polypeptide ...   221   9e-55
ref|XP_007044472.1| Uncharacterized protein isoform 7 [Theobroma...   221   1e-54
ref|XP_007044468.1| Uncharacterized protein isoform 3 [Theobroma...   221   1e-54
ref|XP_007044466.1| Uncharacterized protein isoform 1 [Theobroma...   221   1e-54
ref|XP_004297680.1| PREDICTED: uncharacterized protein LOC101298...   219   3e-54
ref|XP_002520203.1| conserved hypothetical protein [Ricinus comm...   218   7e-54
ref|XP_006574580.1| PREDICTED: neurofilament heavy polypeptide-l...   214   1e-52
ref|XP_006574579.1| PREDICTED: neurofilament heavy polypeptide-l...   214   1e-52
ref|XP_003613430.1| hypothetical protein MTR_5g036560 [Medicago ...   213   2e-52
ref|XP_004155338.1| PREDICTED: uncharacterized LOC101209261 [Cuc...   209   3e-51
ref|XP_006584485.1| PREDICTED: uncharacterized protein LOC100306...   206   2e-50
ref|XP_006584484.1| PREDICTED: uncharacterized protein LOC100306...   206   2e-50
ref|XP_002311790.2| hypothetical protein POPTR_0008s19710g, part...   206   3e-50
ref|XP_007153684.1| hypothetical protein PHAVU_003G056200g [Phas...   203   2e-49

>ref|XP_002267713.2| PREDICTED: uncharacterized protein LOC100258808 [Vitis vinifera]
            gi|296086485|emb|CBI32074.3| unnamed protein product
            [Vitis vinifera]
          Length = 513

 Score =  268 bits (686), Expect = 5e-69
 Identities = 184/449 (40%), Positives = 232/449 (51%), Gaps = 24/449 (5%)
 Frame = +3

Query: 237  MGETMTESLKDEIEVKDSAASKAVFEASVSFGRFENDL-LSWERWSSFSQNKYLEEVEKC 413
            MGE++  +LKDE ++ +SAAS  V EASVSFGRFEND  LSWE+WSSFS NKYLEEVEKC
Sbjct: 1    MGESIVGALKDENKMGESAASDDVLEASVSFGRFENDSSLSWEKWSSFSPNKYLEEVEKC 60

Query: 414  STPGSVAKKKAYFEAHYKKIAARKAELLEQEQKMRDDSLRSDYPSSGNPISNGCDTNVDL 593
            STPGSVA+KKAYFEAHYKKIAARKAELL+ E++M  D L SD P+ G+ I N    N + 
Sbjct: 61   STPGSVAQKKAYFEAHYKKIAARKAELLDLEKQMGTDPLGSDDPNCGDQIRNTDGNNTEF 120

Query: 594  GTSNVDRITKSVEPDTDLIDAVSSTHIDEPNQDEVGI-------DHLIERGKEEVDSRPE 752
              SN     + V+ DT+LI  V++TH+DEP++   G           +E  +EE+DS+  
Sbjct: 121  DVSNGQSSAEGVDQDTNLISVVTTTHVDEPSESNEGAPITIECQSSSVEEAEEELDSKQG 180

Query: 753  SPKSNSCVVSVKEENHSMGSKDPSEELVVLVKEENHMTGSQPELEEIAILVKEENHLTRS 932
            +PK                 KD  E   V +KEE    GSQ  +E               
Sbjct: 181  TPKL----------------KDGEE--TVSIKEEASPMGSQNVME--------------- 207

Query: 933  EAQLEEMAILVQENHSTRSHSQLYNKTGNSKVNKIEDAKLKPRKESQKIVQRTLASNEKN 1112
                                  L N TGN+   K E  KL P KE++KI   TLA+ E+ 
Sbjct: 208  ------------------LPPSLDNGTGNTPRIKKERPKLDPPKETKKI---TLANKERK 246

Query: 1113 LGRTKNXXXXXXXXXXHTSVXXXXXXXXXXXXXXXXXXXTRKESSPSLLRNKQPSTMESK 1292
                              S                     +K +  SL +NK PS  E K
Sbjct: 247  TASVMKKAVSPIAKSPQISKPRDSKPTPTSKMISSSQPSIKKANGSSLPKNKNPSAGEIK 306

Query: 1293 RP---------------APTSLHMSLSFGSTNYDSTS-TTARRSFIMEKMGDKEIVKRAF 1424
            +P               APTSLH SLS G  + DS S TT R+S IMEKMGDK+IV+RAF
Sbjct: 307  KPSPRSKIPSAGEWKKVAPTSLHKSLSLGPPHSDSASLTTTRKSLIMEKMGDKDIVRRAF 366

Query: 1425 KAFQNNYSPSYTSIEAKSSVPPKVCFSSS 1511
            K FQN+++    S E +SSVP +V   S+
Sbjct: 367  KTFQNSFNQLKPSSEVRSSVPKQVSAKST 395


>ref|XP_006483756.1| PREDICTED: muscle M-line assembly protein unc-89-like [Citrus
            sinensis]
          Length = 484

 Score =  243 bits (621), Expect = 2e-61
 Identities = 177/436 (40%), Positives = 235/436 (53%), Gaps = 16/436 (3%)
 Frame = +3

Query: 237  MGETMTESLKDEIEVKD----SAASKAVFEASVSFGRFENDLLSWERWSSFSQNKYLEEV 404
            MGE++ ++    + ++D    +  S  V E SVSFGRFEND LSWE+WSSFS NKYLEEV
Sbjct: 1    MGESILDASPSSLNLEDKMGKAVPSNPVLEVSVSFGRFENDSLSWEKWSSFSPNKYLEEV 60

Query: 405  EKCSTPGSVAKKKAYFEAHYKKIAARKAELLEQEQKMRDDSLRSDYPSSGNPISNGCDTN 584
            EKC+TPGSVAKK AYFEAHYKKIAARKAELL+QE++M +DS R D  + G+ +++ C   
Sbjct: 61   EKCATPGSVAKKAAYFEAHYKKIAARKAELLDQEKQMDNDSSRLDNQTCGDLMADNCKNK 120

Query: 585  VDLGTSNVDRITKSVEPDTDLIDAVSSTHIDEPNQD-EVGID---HLIERGKEEVDSRPE 752
             +   S+  R    V P+T L++ V    +D+P  D  + ++     +ER KEE  SR E
Sbjct: 121  SESDISDHQRSDDIVYPETSLVNEVRGMPVDQPGGDAAIKVECQSSPVERVKEE-KSRLE 179

Query: 753  SPKSN----SCVVSVKE--ENHSMGSKDPSEELVVLVKEENHMTGSQPELEEIAILVKEE 914
            SP SN    + VV+VKE  EN SM         +V+VKE       Q +  E A  VKEE
Sbjct: 180  SPTSNKPEEAVVVTVKEDVENSSM--------RMVIVKE------LQEKEMEPATNVKEE 225

Query: 915  NHLTRSEAQLEEMAILVQENHSTRSHS-QLYNKTGNSKVNKIEDAKLKPRKESQKIVQRT 1091
            N               V+ +H   SH     NK  N  ++KI+     P  +S  I + +
Sbjct: 226  N---------------VKLDHPKNSHKIAPVNKEKN--ISKIKKKPASPAAKSSPITKAS 268

Query: 1092 LASNEKNLGRTKNXXXXXXXXXXHTSVXXXXXXXXXXXXXXXXXXXTRKESSPSLLRNKQ 1271
              +   +L   K                                  T+  +  SL R+K 
Sbjct: 269  RIAKSPHLSTPK------------------VSKPTPMSTLSSSRSSTKIGNGSSLPRSKN 310

Query: 1272 PSTMESKRPAPTSLHMSLSFGSTNYDSTS-TTARRSFIMEKMGDKEIVKRAFKAFQNNYS 1448
             S  ESK+ AP SLH+SLS G ++ D  S TT R+S IMEKMGDK+IVKRAFK FQNNY+
Sbjct: 311  LSAGESKKVAPKSLHISLSLGPSSSDPVSLTTTRKSLIMEKMGDKDIVKRAFKTFQNNYN 370

Query: 1449 PSYTSIEAKSSVPPKV 1496
               +S E +S  P +V
Sbjct: 371  QLKSSKEERSPAPKQV 386


>ref|XP_006438506.1| hypothetical protein CICLE_v10031371mg [Citrus clementina]
            gi|557540702|gb|ESR51746.1| hypothetical protein
            CICLE_v10031371mg [Citrus clementina]
          Length = 484

 Score =  243 bits (621), Expect = 2e-61
 Identities = 177/436 (40%), Positives = 235/436 (53%), Gaps = 16/436 (3%)
 Frame = +3

Query: 237  MGETMTESLKDEIEVKD----SAASKAVFEASVSFGRFENDLLSWERWSSFSQNKYLEEV 404
            MGE++ ++    + ++D    +  S  V E SVSFGRFEND LSWE+WSSFS NKYLEEV
Sbjct: 1    MGESILDASPSSLNLEDKMGKAVPSNPVLEVSVSFGRFENDSLSWEKWSSFSPNKYLEEV 60

Query: 405  EKCSTPGSVAKKKAYFEAHYKKIAARKAELLEQEQKMRDDSLRSDYPSSGNPISNGCDTN 584
            EKC+TPGSVAKK AYFEAHYKKIAARKAELL+QE++M +DS R D  + G+ +++ C   
Sbjct: 61   EKCATPGSVAKKAAYFEAHYKKIAARKAELLDQEKQMDNDSSRLDNQTCGDLMADNCKNK 120

Query: 585  VDLGTSNVDRITKSVEPDTDLIDAVSSTHIDEPNQD-EVGID---HLIERGKEEVDSRPE 752
             +   S+  R    V P+T L++ V    +D+P  D  + ++     +ER KEE  SR E
Sbjct: 121  SESDISDHQRSDDIVYPETSLVNEVRGMPVDQPGGDAAIKVECQSSPVERVKEE-KSRLE 179

Query: 753  SPKSN----SCVVSVKE--ENHSMGSKDPSEELVVLVKEENHMTGSQPELEEIAILVKEE 914
            SP SN    + VV+VKE  EN SM         +V+VKE       Q +  E A  VKEE
Sbjct: 180  SPTSNKPEEAVVVTVKEDVENSSM--------RMVIVKE------LQEKEMEPATNVKEE 225

Query: 915  NHLTRSEAQLEEMAILVQENHSTRSHS-QLYNKTGNSKVNKIEDAKLKPRKESQKIVQRT 1091
            N               V+ +H   SH     NK  N  ++KI+     P  +S  I + +
Sbjct: 226  N---------------VKLDHPKNSHKIAPVNKEKN--ISKIKKKPASPAAKSSPITKAS 268

Query: 1092 LASNEKNLGRTKNXXXXXXXXXXHTSVXXXXXXXXXXXXXXXXXXXTRKESSPSLLRNKQ 1271
              +   +L   K                                  T+  +  SL R+K 
Sbjct: 269  RIAKSPHLSTPK------------------VSKPTPMSTLSSSRSSTKIGNGSSLPRSKN 310

Query: 1272 PSTMESKRPAPTSLHMSLSFGSTNYDSTS-TTARRSFIMEKMGDKEIVKRAFKAFQNNYS 1448
             S  ESK+ AP SLH+SLS G ++ D  S TT R+S IMEKMGDK+IVKRAFK FQNNY+
Sbjct: 311  LSAGESKKVAPKSLHISLSLGPSSSDPVSLTTTRKSLIMEKMGDKDIVKRAFKTFQNNYN 370

Query: 1449 PSYTSIEAKSSVPPKV 1496
               +S E +S  P +V
Sbjct: 371  QLKSSKEERSPAPKQV 386


>ref|XP_007224227.1| hypothetical protein PRUPE_ppa018071mg, partial [Prunus persica]
            gi|462421163|gb|EMJ25426.1| hypothetical protein
            PRUPE_ppa018071mg, partial [Prunus persica]
          Length = 479

 Score =  233 bits (595), Expect = 2e-58
 Identities = 162/420 (38%), Positives = 221/420 (52%), Gaps = 9/420 (2%)
 Frame = +3

Query: 276  EVKDSAASKAVFEASVSFGRFENDLLSWERWSSFSQNKYLEEVEKCSTPGSVAKKKAYFE 455
            E   S +S    E SVSFG+FEND LSWE+WS+FS NKYLEEVEKC+TPGSVA+K+AYFE
Sbjct: 3    EAAASNSSNPALEVSVSFGKFENDSLSWEKWSTFSPNKYLEEVEKCATPGSVAQKRAYFE 62

Query: 456  AHYKKIAARKAE-LLEQEQKMRDDSLRSDYPSSGNPISNGCDTNVDLGTSNVDRITKSVE 632
            AHYKKIAARKAE LLEQE++M+DD  RSD    G+ I  G    +DL  +N    T++  
Sbjct: 63   AHYKKIAARKAEELLEQEKQMQDDPFRSDDQKGGDQIDCGAHFEIDL--TNSQSTTQANY 120

Query: 633  PDTDLIDAVSSTHIDEPNQDEVGI----DHLIERGKEEVDSRPESPKSNS---CVVSVKE 791
             +T+  +   STH+D+  +D+V        L E  KEE DS   SP  N+    V+  + 
Sbjct: 121  QETNFDNDTFSTHVDDLKEDDVITIECQSSLTEGEKEETDSVTASPNLNNPEELVLEKEA 180

Query: 792  ENHSMGSKDPSEELVVLVKEENHMTGSQPELEEIAILVKEENHLTRSEAQLEEMAILVQE 971
            EN    S+   E    + K  ++  G  PE++E     K   HL +  +Q     +  + 
Sbjct: 181  ENVPAVSQGIQE----IPKSLDNEMGKAPEVKE----EKPRLHLQKG-SQKVTTGVSKER 231

Query: 972  NHSTRSHSQLYNKTGNSKVNKIEDAKLKPRKESQKIVQRTLASNEKNLGRTKNXXXXXXX 1151
            N +      +   T   +  K      KP   S   V + ++++   + +  +       
Sbjct: 232  NVANVKKKPIPQITKTPQ--KSTPRMSKPISTSTPRVSKPISTSTPRVSKPISTSTPRVS 289

Query: 1152 XXXHTSVXXXXXXXXXXXXXXXXXXXTRKESSPSLLRNKQPSTMESKRPAPTSLHMSLSF 1331
                TS                     +K ++ SL R+K PS  ++K+  P SLHMS S 
Sbjct: 290  KPISTSTPRASKSISTSTATPAPRSSVKKGNTSSLPRSKNPSIEDTKKVPPKSLHMSPSL 349

Query: 1332 GSTNYDSTS-TTARRSFIMEKMGDKEIVKRAFKAFQNNYSPSYTSIEAKSSVPPKVCFSS 1508
                 DS S TTAR+SFIME MGDK+IV+RAFK FQNNY+   +S E KSS P +   SS
Sbjct: 350  DPAKSDSASPTTARKSFIMENMGDKDIVRRAFKTFQNNYNQPKSSSEEKSSTPTQAAPSS 409


>ref|XP_006378203.1| hypothetical protein POPTR_0010s04760g [Populus trichocarpa]
            gi|550329075|gb|ERP56000.1| hypothetical protein
            POPTR_0010s04760g [Populus trichocarpa]
          Length = 566

 Score =  229 bits (584), Expect = 3e-57
 Identities = 158/433 (36%), Positives = 221/433 (51%), Gaps = 19/433 (4%)
 Frame = +3

Query: 237  MGETMTESLKDEIEVKDSAASKAVFEASVSFGRFENDLLSWERWSSFSQNKYLEEVEKCS 416
            MGE++  +   E ++  + AS    +ASVSFGRFEND LSW++WSSFSQNKYLEEVEKC+
Sbjct: 1    MGESLVAASSYEDKIGGTVASDPALQASVSFGRFENDSLSWDKWSSFSQNKYLEEVEKCA 60

Query: 417  TPGSVAKKKAYFEAHYKKIAARKAELLEQEQKMRDDSLRSDYPSSGNPISNGCDTNVDLG 596
            TPGSVA+K+AYFEAHYKKIAARKAELL+QE+++  D  R++  +SG+ I      + D  
Sbjct: 61   TPGSVAEKRAYFEAHYKKIAARKAELLDQEKQIEHDLSRANNQNSGDLIVKTSQMDSDFD 120

Query: 597  TSNVDRITKSVEPDTDLIDAVSSTHIDEPNQDEVGIDHLIERGKEEVDSRPESPKSNSCV 776
             SN    ++ + P++   +     HID+P +D              +D+  ++  +    
Sbjct: 121  ASNGQTSSEGIRPESKFDNEWDGGHIDKPTEDAA------------IDAHGQASTNKPYE 168

Query: 777  VSVKEENHSMGSKDPSEELVVLVKEENHMTGSQPELEEIAIL-----------VKEENHL 923
             +  + +    S DP E+    V    H   S  E  E A +           VKEE   
Sbjct: 169  DTAVDAHGQASSNDPYEDAAFSV----HGQASLNEPYEDAAIDVQGQVPLNGRVKEEQDS 224

Query: 924  ---TRSEAQLEEMAILVQENHSTRSHSQLYNKTGNSKVN----KIEDAKLKPRKESQKIV 1082
               T   A+LEE+A++ +E   ++   +L         +    K E  KL  RKES KI 
Sbjct: 225  ELDTPVSAKLEEVALMKKEETGSQDMRELPKNLEKEMESILMIKEEKVKLDHRKESPKI- 283

Query: 1083 QRTLASNEKNLGRTKNXXXXXXXXXXHTSVXXXXXXXXXXXXXXXXXXXTRKESSPSLLR 1262
              +  S  ++L   K             S                     +K +  SL R
Sbjct: 284  --SPMSKVRDLAMAKKKPEPPITKRPQISSLKFSKPASTSSSLSASQSSIKKVNGSSLPR 341

Query: 1263 NKQPSTMESKRPAPTSLHMSLSFGSTNYDSTS-TTARRSFIMEKMGDKEIVKRAFKAFQN 1439
            +K      +K+  P SLHMSLS  S N ++   TT R+SFIMEKMGDK+IVKRAFK FQN
Sbjct: 342  SKNTPVGGNKKVNPKSLHMSLSMDSPNSETVPLTTTRKSFIMEKMGDKDIVKRAFKTFQN 401

Query: 1440 NYSPSYTSIEAKS 1478
            N+S   +S E +S
Sbjct: 402  NFSQLKSSAEERS 414


>ref|XP_007157526.1| hypothetical protein PHAVU_002G077200g [Phaseolus vulgaris]
            gi|561030941|gb|ESW29520.1| hypothetical protein
            PHAVU_002G077200g [Phaseolus vulgaris]
          Length = 487

 Score =  222 bits (566), Expect = 4e-55
 Identities = 157/421 (37%), Positives = 214/421 (50%), Gaps = 1/421 (0%)
 Frame = +3

Query: 237  MGETMTESLKDEIEVKDSAASKAVFEASVSFGRFENDLLSWERWSSFSQNKYLEEVEKCS 416
            MGE + ++   E ++ +SAAS    + SVSFGRFEND LSWERWSSFS NKYLEEVEKC+
Sbjct: 1    MGEFLVDATVFEDKMGESAASSPPLQVSVSFGRFENDSLSWERWSSFSPNKYLEEVEKCA 60

Query: 417  TPGSVAKKKAYFEAHYKKIAARKAELLEQEQKMRDDSLRSDYPSSGNPISNGCDTNVDLG 596
            TPGSVA+KKAYFEAHYKKIAARKAELL QE++   DS RS+       +  G +T+ +L 
Sbjct: 61   TPGSVAQKKAYFEAHYKKIAARKAELLAQEKQREKDSFRSE---DQVEVDLGGNTDAELD 117

Query: 597  TSNVDRITKSVEPDTDLIDAVSSTHIDEPNQDEVGIDHLIERGKEEVDSRPESPKSNSCV 776
             S+     + V  +T  +  +  TH D  +++EV +         E++++          
Sbjct: 118  KSDTQDFNEGVTQETSSVGEIHRTH-DNDSEEEVAVSTGYHGSPVEMENK---------- 166

Query: 777  VSVKEENHSMGSKDPSEELVVLVKEENHMTGSQPELEEIAILVKEENHLTRSEAQLEEMA 956
              ++  +HS    D  E+  V +K E       P +E  A  VKE +H+           
Sbjct: 167  -ELESRSHSSFQMDEPED--VCMKHE-----ESPNIE--AEDVKEISHV----------- 205

Query: 957  ILVQENHSTRSHSQLYNKTGNSKVNKIEDAKLKPRKESQKIVQRTLASNEKNLGRTKNXX 1136
                          +Y +TG +   +  D KL   KES+     T  +   N  +TK   
Sbjct: 206  --------------VYKETGKASEVEANDVKLVHPKESKV----TSVNKGSNAAKTKKKP 247

Query: 1137 XXXXXXXXHTSVXXXXXXXXXXXXXXXXXXXTRKESSPSLLRNKQPSTMESKRPAPTSLH 1316
                      S                    T+K SSPSL R +  S+ ES++ A   LH
Sbjct: 248  MLSTSKASQISTPRSSKPASTPTKTVTPASSTKKGSSPSLSRRQITSSGESRKFANKPLH 307

Query: 1317 MSLSFGSTNYD-STSTTARRSFIMEKMGDKEIVKRAFKAFQNNYSPSYTSIEAKSSVPPK 1493
            MSLS   +N D +   T RRS IMEKMGDK+IVKRAFK FQN+++   T  E KS +  +
Sbjct: 308  MSLSLAPSNPDPAPQATMRRSLIMEKMGDKDIVKRAFKTFQNSFNQPKTPGEDKSLIKKQ 367

Query: 1494 V 1496
            V
Sbjct: 368  V 368


>ref|XP_003517869.1| PREDICTED: neurofilament medium polypeptide isoform X1 [Glycine max]
            gi|571434004|ref|XP_006573072.1| PREDICTED: neurofilament
            medium polypeptide isoform X2 [Glycine max]
            gi|571434006|ref|XP_006573073.1| PREDICTED: neurofilament
            medium polypeptide isoform X3 [Glycine max]
            gi|571434008|ref|XP_006573074.1| PREDICTED: neurofilament
            medium polypeptide isoform X4 [Glycine max]
          Length = 490

 Score =  221 bits (563), Expect = 9e-55
 Identities = 155/424 (36%), Positives = 219/424 (51%), Gaps = 4/424 (0%)
 Frame = +3

Query: 237  MGETMTES--LKDEIEVKDSAASKAVFEASVSFGRFENDLLSWERWSSFSQNKYLEEVEK 410
            MGE + ++   +D+   + +AAS    + SVSFGRFEND LSWERWSSFS NKYLEEVEK
Sbjct: 1    MGEFLVDATVFEDKKMGEGAAASNPALQVSVSFGRFENDSLSWERWSSFSPNKYLEEVEK 60

Query: 411  CSTPGSVAKKKAYFEAHYKKIAARKAELLEQEQKMRDDSLRSDYPSSGNPISNGCDTNVD 590
            C+TPGSVA+KKAYFEAHYKK+AARKAELL QE++   DS  S+   SG  +S   D   D
Sbjct: 61   CATPGSVAQKKAYFEAHYKKVAARKAELLAQEKQREKDSFGSE-EHSGIDLSGNTDAEHD 119

Query: 591  LGTSNVDRITKSVEPDTDLIDAVSSTHIDEPNQDEVGIDHLIERGKEEVDSRPESPKSNS 770
            + ++N    ++ VE +T     +  TH++E +++E  +    +    +V+++    +S+S
Sbjct: 120  I-SNNTQGSSEGVEHETSSAGEIHKTHVNE-SEEEFAVSRDYQSSSVQVENKELESRSHS 177

Query: 771  CVVSVKEENHSMGSKDPSEELVVLVKEENHMTGSQPELEEIAILVKEENHLTRSEAQLEE 950
                 + EN              + K++     +  E E+    VKE +H+         
Sbjct: 178  SYQIDEPEN--------------VCKKQVESPNNNIEAED----VKEISHV--------- 210

Query: 951  MAILVQENHSTRSHSQLYNKTGNSKVNKIEDAKLKPRKESQKIVQRTLASNEKNLGRTKN 1130
                            +Y +TG +   +++D KL   KES+        S   N  RTK 
Sbjct: 211  ----------------VYKETGKASEGEVKDVKLNHPKESKV----KSVSKGSNAARTKK 250

Query: 1131 XXXXXXXXXXHTSV-XXXXXXXXXXXXXXXXXXXTRKESSPSLLRNKQPSTMESKRPAPT 1307
                        S                     TRK SSPSL R +  S+ ES++ A  
Sbjct: 251  KSMLPTSKASPISTPKSSKPASTTPTKTVTPASSTRKGSSPSLTRRQITSSGESRKFANK 310

Query: 1308 SLHMSLSFGSTNYD-STSTTARRSFIMEKMGDKEIVKRAFKAFQNNYSPSYTSIEAKSSV 1484
             LHMSLS   +N D +  +T RRS IME MGDK+IVKRAFK FQN+++   TS+E KS +
Sbjct: 311  PLHMSLSLAPSNPDPAPQSTMRRSLIMENMGDKDIVKRAFKTFQNSFNQPKTSVEDKSLI 370

Query: 1485 PPKV 1496
              +V
Sbjct: 371  KKQV 374


>ref|XP_007044472.1| Uncharacterized protein isoform 7 [Theobroma cacao]
            gi|508708407|gb|EOY00304.1| Uncharacterized protein
            isoform 7 [Theobroma cacao]
          Length = 518

 Score =  221 bits (562), Expect = 1e-54
 Identities = 164/429 (38%), Positives = 216/429 (50%), Gaps = 10/429 (2%)
 Frame = +3

Query: 237  MGETMTESLKDEIEVKDSAASKAVFEASVSFGRFENDLLSWERWSSFSQNKYLEEVEKCS 416
            MGE++ ++   E+++ + A+S   FE SVSFGRFEND LSWE+WSSFS NKYLEEVEKC+
Sbjct: 1    MGESIVDASNKEVKIGEMASSNPAFEVSVSFGRFENDSLSWEKWSSFSPNKYLEEVEKCA 60

Query: 417  TPGSVAKKKAYFEAHYKKIAARKAELLEQEQKMRDDSLRSDYPSSGNPISNGCDTNVDLG 596
            TPGSVAKKKAYFE HYKKIAARKAEL  QE+ M      SD  + G+ +          G
Sbjct: 61   TPGSVAKKKAYFEEHYKKIAARKAELQAQEKPMESKPFNSDDQNCGDLV----------G 110

Query: 597  TSNVDRITKSVEPDTDLIDAVSSTHIDEPNQD-EVGI---DHLIERGKEEVDSRPESPKS 764
             SN     +  + +T+ +  VS TH DE N++ E+ I   +   E  KE++DSR ES   
Sbjct: 111  KSNGQCSNEGDKQETNWLSEVSDTHFDEHNEEPEIAIKSQNSSAEGVKEKIDSRVES--- 167

Query: 765  NSCVVSVKEENHSMGSKDPSEELVVLVKEENHMTGSQPELEEIAILVKEENHLTRSEAQL 944
                  V E+  S    +  EE+   V E   +  S+    + A+LVKE        +Q 
Sbjct: 168  -----QVIEKIESRVESEEKEEMDSAV-ESPKLIESEETAPDEAVLVKEAVETLPKGSQD 221

Query: 945  EEMAILVQENHSTRSHSQLYNKTGNSKVNKIEDAKLKPRKESQKIVQRTLASNEKNLGRT 1124
            E+   L Q +      +  +         K ++ KL    +S KI       NE  + + 
Sbjct: 222  EKE--LPQNSEKDIKDTPKF---------KHKNLKLGHLAKSDKITPANKERNETRI-KK 269

Query: 1125 KNXXXXXXXXXXHTSVXXXXXXXXXXXXXXXXXXXTRKESSPSLLRNKQPSTMESKRPAP 1304
            K            T                     T+  SS SL + K PS  ESK+  P
Sbjct: 270  KPASPVTKTPQFSTPKASKPTSTPTTPSASRTPSKTKTTSSYSLPKTKIPSMGESKKVVP 329

Query: 1305 TSLHMSLSFGSTNYDSTSTTA-RRSFIMEKMGDKEIVKRAFKAFQNNYSPSYTSIE---- 1469
             SLHMSLS G +     S  A R+S IMEKMGDK+IVKRAFK FQ+NY     S +    
Sbjct: 330  RSLHMSLSLGPSGSGLASLPATRKSLIMEKMGDKDIVKRAFKTFQSNYHQLKPSSQEQYA 389

Query: 1470 -AKSSVPPK 1493
             +K  VP K
Sbjct: 390  ASKQQVPAK 398


>ref|XP_007044468.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|590693941|ref|XP_007044471.1| Uncharacterized protein
            isoform 3 [Theobroma cacao] gi|508708403|gb|EOY00300.1|
            Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508708406|gb|EOY00303.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 530

 Score =  221 bits (562), Expect = 1e-54
 Identities = 164/428 (38%), Positives = 214/428 (50%), Gaps = 9/428 (2%)
 Frame = +3

Query: 237  MGETMTESLKDEIEVKDSAASKAVFEASVSFGRFENDLLSWERWSSFSQNKYLEEVEKCS 416
            MGE++ ++   E+++ + A+S   FE SVSFGRFEND LSWE+WSSFS NKYLEEVEKC+
Sbjct: 1    MGESIVDASNKEVKIGEMASSNPAFEVSVSFGRFENDSLSWEKWSSFSPNKYLEEVEKCA 60

Query: 417  TPGSVAKKKAYFEAHYKKIAARKAELLEQEQKMRDDSLRSDYPSSGNPISNGCDTNVDLG 596
            TPGSVAKKKAYFE HYKKIAARKAEL  QE+ M      SD  + G+ +          G
Sbjct: 61   TPGSVAKKKAYFEEHYKKIAARKAELQAQEKPMESKPFNSDDQNCGDLV----------G 110

Query: 597  TSNVDRITKSVEPDTDLIDAVSSTHIDEPNQD-EVGI---DHLIERGKEEVDSRPESPKS 764
             SN     +  + +T+ +  VS TH DE N++ E+ I   +   E  KE++DSR ES   
Sbjct: 111  KSNGQCSNEGDKQETNWLSEVSDTHFDEHNEEPEIAIKSQNSSAEGVKEKIDSRVES--- 167

Query: 765  NSCVVSVKEENHSMGSKDPSEELVVLVKEENHMTGSQPELEEIAILVKEENHLTRSEAQL 944
                  V E+  S    +  EE+   V E   +  S+    + A+LVKE        +Q 
Sbjct: 168  -----QVIEKIESRVESEEKEEMDSAV-ESPKLIESEETAPDEAVLVKEAVETLPKGSQD 221

Query: 945  EEMAILVQENHSTRSHSQLYNKTGNSKVNKIEDAKLKPRKESQKIVQRTLASNEKNLGRT 1124
            E+   L Q +      +  +         K ++ KL    +S KI       NE  + + 
Sbjct: 222  EKE--LPQNSEKDIKDTPKF---------KHKNLKLGHLAKSDKITPANKERNETRI-KK 269

Query: 1125 KNXXXXXXXXXXHTSVXXXXXXXXXXXXXXXXXXXTRKESSPSLLRNKQPSTMESKRPAP 1304
            K            T                     T+  SS SL + K PS  ESK+  P
Sbjct: 270  KPASPVTKTPQFSTPKASKPTSTPTTPSASRTPSKTKTTSSYSLPKTKIPSMGESKKVVP 329

Query: 1305 TSLHMSLSFGSTNYDSTSTTA-RRSFIMEKMGDKEIVKRAFKAFQNNY----SPSYTSIE 1469
             SLHMSLS G +     S  A R+S IMEKMGDK+IVKRAFK FQ+NY      S     
Sbjct: 330  RSLHMSLSLGPSGSGLASLPATRKSLIMEKMGDKDIVKRAFKTFQSNYHQLKPSSQEQYA 389

Query: 1470 AKSSVPPK 1493
            A   VP K
Sbjct: 390  ASKQVPAK 397


>ref|XP_007044466.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590693928|ref|XP_007044467.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
            gi|590693934|ref|XP_007044469.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
            gi|590693938|ref|XP_007044470.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508708401|gb|EOY00298.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508708402|gb|EOY00299.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508708404|gb|EOY00301.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508708405|gb|EOY00302.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 517

 Score =  221 bits (562), Expect = 1e-54
 Identities = 164/428 (38%), Positives = 214/428 (50%), Gaps = 9/428 (2%)
 Frame = +3

Query: 237  MGETMTESLKDEIEVKDSAASKAVFEASVSFGRFENDLLSWERWSSFSQNKYLEEVEKCS 416
            MGE++ ++   E+++ + A+S   FE SVSFGRFEND LSWE+WSSFS NKYLEEVEKC+
Sbjct: 1    MGESIVDASNKEVKIGEMASSNPAFEVSVSFGRFENDSLSWEKWSSFSPNKYLEEVEKCA 60

Query: 417  TPGSVAKKKAYFEAHYKKIAARKAELLEQEQKMRDDSLRSDYPSSGNPISNGCDTNVDLG 596
            TPGSVAKKKAYFE HYKKIAARKAEL  QE+ M      SD  + G+ +          G
Sbjct: 61   TPGSVAKKKAYFEEHYKKIAARKAELQAQEKPMESKPFNSDDQNCGDLV----------G 110

Query: 597  TSNVDRITKSVEPDTDLIDAVSSTHIDEPNQD-EVGI---DHLIERGKEEVDSRPESPKS 764
             SN     +  + +T+ +  VS TH DE N++ E+ I   +   E  KE++DSR ES   
Sbjct: 111  KSNGQCSNEGDKQETNWLSEVSDTHFDEHNEEPEIAIKSQNSSAEGVKEKIDSRVES--- 167

Query: 765  NSCVVSVKEENHSMGSKDPSEELVVLVKEENHMTGSQPELEEIAILVKEENHLTRSEAQL 944
                  V E+  S    +  EE+   V E   +  S+    + A+LVKE        +Q 
Sbjct: 168  -----QVIEKIESRVESEEKEEMDSAV-ESPKLIESEETAPDEAVLVKEAVETLPKGSQD 221

Query: 945  EEMAILVQENHSTRSHSQLYNKTGNSKVNKIEDAKLKPRKESQKIVQRTLASNEKNLGRT 1124
            E+   L Q +      +  +         K ++ KL    +S KI       NE  + + 
Sbjct: 222  EKE--LPQNSEKDIKDTPKF---------KHKNLKLGHLAKSDKITPANKERNETRI-KK 269

Query: 1125 KNXXXXXXXXXXHTSVXXXXXXXXXXXXXXXXXXXTRKESSPSLLRNKQPSTMESKRPAP 1304
            K            T                     T+  SS SL + K PS  ESK+  P
Sbjct: 270  KPASPVTKTPQFSTPKASKPTSTPTTPSASRTPSKTKTTSSYSLPKTKIPSMGESKKVVP 329

Query: 1305 TSLHMSLSFGSTNYDSTSTTA-RRSFIMEKMGDKEIVKRAFKAFQNNY----SPSYTSIE 1469
             SLHMSLS G +     S  A R+S IMEKMGDK+IVKRAFK FQ+NY      S     
Sbjct: 330  RSLHMSLSLGPSGSGLASLPATRKSLIMEKMGDKDIVKRAFKTFQSNYHQLKPSSQEQYA 389

Query: 1470 AKSSVPPK 1493
            A   VP K
Sbjct: 390  ASKQVPAK 397


>ref|XP_004297680.1| PREDICTED: uncharacterized protein LOC101298117 [Fragaria vesca
            subsp. vesca]
          Length = 557

 Score =  219 bits (558), Expect = 3e-54
 Identities = 161/444 (36%), Positives = 227/444 (51%), Gaps = 27/444 (6%)
 Frame = +3

Query: 237  MGETMTESLKDEI---EVKDSAASKAVFEASVSFGRFENDLLSWERWSSFSQNKYLEEVE 407
            MGE++  S KD     EV  + +S    E SVSFGRFEND LSWE+WS+FS NKYLEEVE
Sbjct: 1    MGESIVGSPKDADKMGEVASTDSSNPSLEVSVSFGRFENDSLSWEKWSAFSPNKYLEEVE 60

Query: 408  KCSTPGSVAKKKAYFEAHYKKIAARKAE-LLEQEQKMRDDS-LRSDYPSSGNPISNGCDT 581
            KC+TPGSVA+KKAYFEAHYK+IAARKAE LLEQE++M DD  L+SD  ++G+ I  G D 
Sbjct: 61   KCATPGSVAQKKAYFEAHYKRIAARKAEELLEQEKQMHDDEPLKSDDQNNGDQICCGTDN 120

Query: 582  --NVDLGTSNVDRITKSVEPDTDLIDAVSSTHIDEPNQDEVGI-------DHLIERGKEE 734
              ++D+ TS  +    S EP+ +  + +S T +++  +D+  +         + ER +EE
Sbjct: 121  GIDIDIATSQTNAQGNSQEPNLE--NGISCTPVEDLKEDDEDVYTIECQTSSIEEREREE 178

Query: 735  VDSRPESPKSNSC-----VVSVKEENHSMGSKDPSEELVVLVKEENHMTGSQPEL-EEIA 896
             DS   SPK+ +      +V VKE      + D  E +  L K  ++  G  PE+ EE A
Sbjct: 179  TDSGVVSPKTPNLNRPEELVLVKEVETI--TADTQETIQELTKTLDNDAGDAPEVKEEKA 236

Query: 897  ILVKEENHLTRSEAQLEEMAILVQENHS----TRSHSQLYNKTGNSKVNKIEDAKLKPRK 1064
             L  ++     +    E M +   +  S    T++      +      N        P+ 
Sbjct: 237  RLDLQKRPQKVTPVSKERMTVAKAKKKSVSPMTKTPQNPTPRVSKLPQNSTPRVSKLPQN 296

Query: 1065 ESQKIVQRTLASNEKNLGRTKNXXXXXXXXXXHTSVXXXXXXXXXXXXXXXXXXXTRKES 1244
             + ++ +    S  +     +N          +T+                        +
Sbjct: 297  STSRVSKTPQNSTPRVSKIPQNTTPRVSKILQNTTPRVSKPMSASTGAKSAPRLSVTNAN 356

Query: 1245 SPSLLRNKQPSTMESKRPAPTSLHMSLSFGSTNYDS---TSTTARRSFIMEKMGDKEIVK 1415
              SL R+  PS   +K+  P SLHMSLS      DS   T  TAR+S IME+MGDK+IVK
Sbjct: 357  GSSLSRSSNPSIQRTKKVPPKSLHMSLSLDPKKSDSATETVVTARKSLIMEQMGDKDIVK 416

Query: 1416 RAFKAFQNNYSPSYTSIEAKSSVP 1487
            RAFK FQN+ +   +S E K S P
Sbjct: 417  RAFKTFQNSVNQLKSSNEEKPSTP 440


>ref|XP_002520203.1| conserved hypothetical protein [Ricinus communis]
            gi|223540695|gb|EEF42258.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 556

 Score =  218 bits (555), Expect = 7e-54
 Identities = 165/440 (37%), Positives = 228/440 (51%), Gaps = 20/440 (4%)
 Frame = +3

Query: 237  MGETMTESLKDEIEVKDSAASKAVFEASVSFGRFENDLLSWERWSSFSQNKYLEEVEKCS 416
            MGE++  +  DE ++ ++A S    E SVSFGRFEND LSWE+WSSFS NKYLEEVEKC+
Sbjct: 1    MGESIVATSYDEDKMGETATSDRSLEVSVSFGRFENDSLSWEKWSSFSPNKYLEEVEKCA 60

Query: 417  TPGSVAKKKAYFEAHYKKIAARKAELLEQEQKMRDDSLRSDYPSSGNPI--SNGCDTNVD 590
            TPGSVA KKAYFEAHYKKIAA+KAE L QE++M    L S+  + G+PI  +NG D+  D
Sbjct: 61   TPGSVAMKKAYFEAHYKKIAAKKAEQLGQEKQMEHKPLGSNDQNGGDPIGKANGIDSEFD 120

Query: 591  LGTSNVDRITKSVEPDTDLIDAVSSTHIDEPNQDEVGIDHL------IERGKEEVDSRPE 752
              T N    ++    +  L   + S  ++EP +D  G  +L      +E+ +EE+ SR +
Sbjct: 121  --TFNTQTSSEGTRQEIKLDSELDSGLVNEPYED--GAINLEAQGLSVEQAEEELCSRID 176

Query: 753  SPKSNSCVVS--VKEEN----HSMGSKDPSEELVVLVKEENHMTGSQPELEEIAILVKEE 914
             P  N    +  V+E       S   KD  ++L               E E I I VKE 
Sbjct: 177  GPSLNKPEETPFVREAETIPMESQAMKDLPKKL-------------DKEAESIPI-VKER 222

Query: 915  NHLTRSEAQLEEMAILVQENHSTRSHSQLYNKTGNSKVNKIEDAKLKPRKESQKIVQRTL 1094
            N      A++ +     + N+        Y +T  S ++K+ D     +K +  + + T 
Sbjct: 223  N------AKINQRKEPQKVNNFAIEIIDSYKET-TSPMSKVRDMARIKKKPASPVAKSTQ 275

Query: 1095 ASNEKNLGRTKNXXXXXXXXXXHTSVXXXXXXXXXXXXXXXXXXXTRKESSPSLLRNKQP 1274
             S  K    TK            +S                    T+K +  SL ++K P
Sbjct: 276  LSTPKV---TKTGPTSGVLSTPQSS--------------------TKKATVSSLPKSKSP 312

Query: 1275 STMESKRPAPTSLHMSLSFGSTNYD------STSTTARRSFIMEKMGDKEIVKRAFKAFQ 1436
            S   + + AP SLHMSLS  + N D      + +TTAR+SFIMEKM DKEIVKRAFK FQ
Sbjct: 313  SVAGNNKVAPKSLHMSLSMDTPNSDPAPLAAAPTTTARKSFIMEKMKDKEIVKRAFKTFQ 372

Query: 1437 NNYSPSYTSIEAKSSVPPKV 1496
            NNY+   +S + +S V  +V
Sbjct: 373  NNYNQLKSSADERSLVAKQV 392


>ref|XP_006574580.1| PREDICTED: neurofilament heavy polypeptide-like isoform X2 [Glycine
            max]
          Length = 500

 Score =  214 bits (544), Expect = 1e-52
 Identities = 160/425 (37%), Positives = 219/425 (51%), Gaps = 5/425 (1%)
 Frame = +3

Query: 237  MGETMTES--LKDEIEVKDSAASKAVFEASVSFGRFENDLLSWERWSSFSQNKYLEEVEK 410
            MGE + ++   +D+   +  AAS    + SVSFGRFEND LSWERWSSFS NKYLEEVEK
Sbjct: 1    MGEFLVDATVFEDKKMGEGGAASNPALQVSVSFGRFENDSLSWERWSSFSPNKYLEEVEK 60

Query: 411  CSTPGSVAKKKAYFEAHYKKIAARKAELLEQEQKMRDDSLRSDYPSSGNPISNGCDTNVD 590
            C+TPGSVA+KKAYFEAHYKK+AARKAELL QE++   DS  S    SG  +S       D
Sbjct: 61   CATPGSVAQKKAYFEAHYKKVAARKAELLAQEKQREQDSFGSQ-DHSGIDLSGNTGAEHD 119

Query: 591  LGTSNVDRITKSVEPDTDLIDAVSSTHIDEPNQDEVGIDHLIERGKEEVDSRPESPKSNS 770
            + ++N     + VE +   +  +  TH++E + +EV +    +    EV+++    +S+S
Sbjct: 120  V-SNNTQGSNEGVEQEASSVCEIHRTHVNE-SVEEVAVSRDYQSSSVEVENKDY--QSSS 175

Query: 771  CVVSVKE-ENHSMGSKDPSEELVVLVKEENHMTGSQPELEEIAILVKEENHLTRSEAQLE 947
              V +KE E+ S  S    E   V  K+E       P +E  A  VKE +H+        
Sbjct: 176  FEVEIKELESRSHSSYQIGEAEDVCKKQEE-----SPNIE--AEDVKEISHV-------- 220

Query: 948  EMAILVQENHSTRSHSQLYNKTGNSKVNKIEDAKLKPRKESQKIVQRTLASNEKNLGRTK 1127
                             +Y +TG +   +++D KL   KES+        S   N  +TK
Sbjct: 221  -----------------VYKETGKALEVEVKDVKLDHPKESKV----KSVSKGSNAAKTK 259

Query: 1128 NXXXXXXXXXXHTSVXXXXXXXXXXXXXXXXXXXTRKE-SSPSLLRNKQPSTMESKRPAP 1304
                         S                    T K  SSPSL R +  S+ ES++ A 
Sbjct: 260  KKSMLLTSKASPISAPSSKPALTTPTKTVSPASSTIKRISSPSLSRRQIISSGESRKFAN 319

Query: 1305 TSLHMSLSFGSTNYD-STSTTARRSFIMEKMGDKEIVKRAFKAFQNNYSPSYTSIEAKSS 1481
              LHMSLS   +N D +  +T RRS IME+MGDK+IVKRAFK F N+++   TS+E KS 
Sbjct: 320  KPLHMSLSLAPSNPDPARQSTMRRSLIMERMGDKDIVKRAFKTFHNSFNQPKTSVEDKSL 379

Query: 1482 VPPKV 1496
               +V
Sbjct: 380  TKKQV 384


>ref|XP_006574579.1| PREDICTED: neurofilament heavy polypeptide-like isoform X1 [Glycine
            max]
          Length = 502

 Score =  214 bits (544), Expect = 1e-52
 Identities = 160/425 (37%), Positives = 219/425 (51%), Gaps = 5/425 (1%)
 Frame = +3

Query: 237  MGETMTES--LKDEIEVKDSAASKAVFEASVSFGRFENDLLSWERWSSFSQNKYLEEVEK 410
            MGE + ++   +D+   +  AAS    + SVSFGRFEND LSWERWSSFS NKYLEEVEK
Sbjct: 1    MGEFLVDATVFEDKKMGEGGAASNPALQVSVSFGRFENDSLSWERWSSFSPNKYLEEVEK 60

Query: 411  CSTPGSVAKKKAYFEAHYKKIAARKAELLEQEQKMRDDSLRSDYPSSGNPISNGCDTNVD 590
            C+TPGSVA+KKAYFEAHYKK+AARKAELL QE++   DS  S    SG  +S       D
Sbjct: 61   CATPGSVAQKKAYFEAHYKKVAARKAELLAQEKQREQDSFGSQ-DHSGIDLSGNTGAEHD 119

Query: 591  LGTSNVDRITKSVEPDTDLIDAVSSTHIDEPNQDEVGIDHLIERGKEEVDSRPESPKSNS 770
            + ++N     + VE +   +  +  TH++E + +EV +    +    EV+++    +S+S
Sbjct: 120  V-SNNTQGSNEGVEQEASSVCEIHRTHVNE-SVEEVAVSRDYQSSSVEVENKDY--QSSS 175

Query: 771  CVVSVKE-ENHSMGSKDPSEELVVLVKEENHMTGSQPELEEIAILVKEENHLTRSEAQLE 947
              V +KE E+ S  S    E   V  K+E       P +E  A  VKE +H+        
Sbjct: 176  FEVEIKELESRSHSSYQIGEAEDVCKKQEE-----SPNIE--AEDVKEISHV-------- 220

Query: 948  EMAILVQENHSTRSHSQLYNKTGNSKVNKIEDAKLKPRKESQKIVQRTLASNEKNLGRTK 1127
                             +Y +TG +   +++D KL   KES+        S   N  +TK
Sbjct: 221  -----------------VYKETGKALEVEVKDVKLDHPKESKV----KSVSKGSNAAKTK 259

Query: 1128 NXXXXXXXXXXHTSVXXXXXXXXXXXXXXXXXXXTRKE-SSPSLLRNKQPSTMESKRPAP 1304
                         S                    T K  SSPSL R +  S+ ES++ A 
Sbjct: 260  KKSMLLTSKASPISAPSSKPALTTPTKTVSPASSTIKRISSPSLSRRQIISSGESRKFAN 319

Query: 1305 TSLHMSLSFGSTNYD-STSTTARRSFIMEKMGDKEIVKRAFKAFQNNYSPSYTSIEAKSS 1481
              LHMSLS   +N D +  +T RRS IME+MGDK+IVKRAFK F N+++   TS+E KS 
Sbjct: 320  KPLHMSLSLAPSNPDPARQSTMRRSLIMERMGDKDIVKRAFKTFHNSFNQPKTSVEDKSL 379

Query: 1482 VPPKV 1496
               +V
Sbjct: 380  TKKQV 384


>ref|XP_003613430.1| hypothetical protein MTR_5g036560 [Medicago truncatula]
            gi|355514765|gb|AES96388.1| hypothetical protein
            MTR_5g036560 [Medicago truncatula]
          Length = 683

 Score =  213 bits (542), Expect = 2e-52
 Identities = 153/421 (36%), Positives = 223/421 (52%), Gaps = 21/421 (4%)
 Frame = +3

Query: 285  DSAASKAVFEASVSFGRFENDLLSWERWSSFSQNKYLEEVEKCSTPGSVAKKKAYFEAHY 464
            ++ AS +  + SVSFGRF+ND LSWERWSSFS NKYLEEVEKC+TPGSVA+KKAYFEAHY
Sbjct: 177  ETTASNSALQVSVSFGRFDNDSLSWERWSSFSPNKYLEEVEKCATPGSVAQKKAYFEAHY 236

Query: 465  KKIAARKAELLEQEQKMRDDSLRS------DYPSSGNPISNGCDTNVDLGTSNVDR--IT 620
            KKIAARKAELL QE++M  +S RS      D   +GN  SN C+T+ + G SN     + 
Sbjct: 237  KKIAARKAELLAQEKEMERESFRSEENNGIDLSGNGNGNSNACETDSEFGISNTQGSCVE 296

Query: 621  KSVEPDTDLIDA--VSSTHIDEPNQDEVGIDHLIERGKEEVDSRPESPKSNSCVVSVKEE 794
            +  E + ++I    +  +H+D+  ++EV +    +    EV+++             + E
Sbjct: 297  ERDEQEIEVIPVGEIDRSHVDDLKEEEVAVSVDYQSSSVEVENK-------------EVE 343

Query: 795  NHSMGSKDPSEELV-VLVKEENHMTGSQPELEEIAILVKEENHLTRSEAQLEEMAILVQE 971
            + S GS    E +  V +K E  +     +++EI+ +V +E   T   +Q+EE    V+ 
Sbjct: 344  SGSHGSYKIDEPVKDVCIKLEEILDVEAEDVKEISHVVYKE---TEKASQVEEKD--VKL 398

Query: 972  NHSTRSHSQLYNKTGNSKVNKIED--AKLKPRKESQKIVQRTLASNEKNLGRTKNXXXXX 1145
            +H  +S     N+  N+   K +   AK K  + S  +  ++ AS       +K      
Sbjct: 399  DHPNKSKVIPVNRENNAAKTKKKPVAAKSKASQISTPVAAKSKASQISTPRYSKPTSAPT 458

Query: 1146 XXXXXHTSVXXXXXXXXXXXXXXXXXXXTRKESSPSLLRNKQPSTMESKRPAPTSLHMSL 1325
                   S                    T+K +  SL R +  S++E+K+ A  SLH+S+
Sbjct: 459  KTLASAAS--------------------TKKGNPQSLPRRQVTSSVENKKAATRSLHLSM 498

Query: 1326 SFGSTN-------YDSTSTTARRSFIMEKMGDKEIVKRAFKAFQNNYS-PSYTSIEAKSS 1481
            S G +N        D    T R+S IM+ MGDK+IVKRAFK FQ N++ P  +  E KSS
Sbjct: 499  SLGPSNPEPVPHTTDPVPHTMRKSLIMDSMGDKDIVKRAFKTFQKNFNQPKTSGEEDKSS 558

Query: 1482 V 1484
            V
Sbjct: 559  V 559


>ref|XP_004155338.1| PREDICTED: uncharacterized LOC101209261 [Cucumis sativus]
          Length = 486

 Score =  209 bits (532), Expect = 3e-51
 Identities = 163/430 (37%), Positives = 226/430 (52%), Gaps = 14/430 (3%)
 Frame = +3

Query: 249  MTESLKDEIEVKDSAASKAVFEASVSFGRFENDLLSWERWSSFSQNKYLEEVEKCSTPGS 428
            + + L DE  V+++A++K + E SVSFGRFENDLLSWE+WS+FS NKYLEEVEK +TPGS
Sbjct: 6    VVDVLNDEHTVEENASTKPLLEVSVSFGRFENDLLSWEKWSTFSPNKYLEEVEKYATPGS 65

Query: 429  VAKKKAYFEAHYKKIAARKAELLEQEQKMRDDSLRSDYPSSGNPISNGCD-TNVDLGTSN 605
            VA+K+AYFEAHYKKIA RK +LLE+E++M  ++  S+    G+ + +  +  + +  TSN
Sbjct: 66   VAQKRAYFEAHYKKIADRKTKLLEEEREMEFNTTVSN--GGGDLMMDHSERADSESETSN 123

Query: 606  VDRITKSVEPDTDLIDAVSSTHIDEPNQD---EVGIDHLIERGKEEVDSRPESPKSNSCV 776
                 + V+  T L   +SS + +    D    V  + L +  KEE D + +   S+S +
Sbjct: 124  HHVSVEEVDQTTMLTGELSSVYHEVVKNDVESNVDCESLPDGEKEEPDGKFDCVGSDSEI 183

Query: 777  VSVKEENHSMGSKDPSEELVVLVKEENHMTGSQPELEEIAILVKEENHLTRSEAQLEEMA 956
                            EE+VV   E    T + P                          
Sbjct: 184  -------------SKQEEVVVKEVETPTPTPTPP-------------------------- 204

Query: 957  ILVQENHSTRSHSQ-LYNKTGNSKVNKIEDAKLKPR--KESQKIV----QRTLAS-NEKN 1112
              V+ + +T+   Q L NK   S V+K++   LKP   KES+KI     +R  AS  +K 
Sbjct: 205  --VESSQTTKEPPQKLVNKV--SAVSKVKQQILKPNRPKESKKITPIVKERNSASVKKKP 260

Query: 1113 LGRTKNXXXXXXXXXXHTSVXXXXXXXXXXXXXXXXXXXTRKESSPSLLRNKQPSTMESK 1292
            +  T             T+                      K S+ SLLR++ PS++ESK
Sbjct: 261  ISSTAKAPQILTPKLSKTT----PGPTTPAARSSVLRSSVNKGSNSSLLRSRNPSSIESK 316

Query: 1293 RPAPTSLHMSLSFGSTNYDSTSTTA-RRSFIMEKMGDKEIVKRAFKAFQNNYSPSYTS-I 1466
            + AP SLHMSLS G+ N D +S    RRSFIMEKMGDK+IVKRAFK FQN+ +   +S  
Sbjct: 317  KVAPKSLHMSLSLGTPNSDPSSVNGIRRSFIMEKMGDKDIVKRAFKTFQNSLNQMKSSPQ 376

Query: 1467 EAKSSVPPKV 1496
            E KSS P KV
Sbjct: 377  EEKSSAPKKV 386


>ref|XP_006584485.1| PREDICTED: uncharacterized protein LOC100306130 isoform X2 [Glycine
            max] gi|571468881|ref|XP_006584486.1| PREDICTED:
            uncharacterized protein LOC100306130 isoform X3 [Glycine
            max] gi|571468883|ref|XP_006584487.1| PREDICTED:
            uncharacterized protein LOC100306130 isoform X4 [Glycine
            max]
          Length = 481

 Score =  206 bits (525), Expect = 2e-50
 Identities = 155/408 (37%), Positives = 210/408 (51%), Gaps = 5/408 (1%)
 Frame = +3

Query: 288  SAASKAVFEASVSFGRFENDLLSWERWSSFSQNKYLEEVEKCSTPGSVAKKKAYFEAHYK 467
            +AAS    + SVSFGRFEND LSWE+WS+FS NKYLEEVEKC+TPGSVA+KKAYFEAHYK
Sbjct: 4    TAASNPALQVSVSFGRFENDSLSWEKWSAFSPNKYLEEVEKCATPGSVAQKKAYFEAHYK 63

Query: 468  KIAARKAELLEQEQKMRDDSLRSDYPSSGNPISNGCDTNVDLGTSNVDRITKSVEPDTDL 647
             IAARKAELL Q ++M  DS RS   +  +   N C T+ +   S+    ++ V+ +T+ 
Sbjct: 64   NIAARKAELLAQAKQMEKDSPRSQRQNGEDLSCNTCGTDAECDMSSTQGSSEGVKQETNS 123

Query: 648  IDAVSSTHIDEPNQD-EVGIDHLIERGKEEVDSRPESPKSNSCVVSVKEENHSMGSK--D 818
            I  +  T +    +D  V ID+          S  E  K N      +E    +GS   D
Sbjct: 124  IGEIVRTDVSNLMEDVAVSIDYQ--------GSSVEGEKEN------EELESRLGSSQID 169

Query: 819  PSEELVVLVKEENHMTGSQPELEEIAILVKEENHLTRSE-AQLEEMAILVQENHSTRSHS 995
              EE+V + +  +       E E+    VKE +H   +E A+  E     +  + T  H 
Sbjct: 170  KHEEVVCVEQGGSKEESPNTEAED----VKEISHNVNNEPAKTSEN----EAKYVTLDHP 221

Query: 996  QLYNKTGNSKVNKIEDAKLKPRKESQKIVQRTLASNEKNLGRTKNXXXXXXXXXXHTSVX 1175
            ++  K   + VN+  +A  K +K+S     +  AS       +K            +S  
Sbjct: 222  KVSKKV--TPVNRESNAT-KAKKKSMLSTSKPKASQFSTPRSSKPTSTPTKTLASASS-- 276

Query: 1176 XXXXXXXXXXXXXXXXXXTRKESSPSLLRNKQPSTMESKRPAPTSLHMSLSFGSTNYDST 1355
                              T++  SPS+   K  ST E+++    SLHMSLS   +  D  
Sbjct: 277  ------------------TKRGISPSISGRKINSTSENRKVPNKSLHMSLSLAPSQPDPA 318

Query: 1356 S-TTARRSFIMEKMGDKEIVKRAFKAFQNNYSPSYTSIEAKSSVPPKV 1496
            S TT R+S IMEKMGDK+IVKRAFK FQNN++   TS E KS V  KV
Sbjct: 319  SHTTMRKSLIMEKMGDKDIVKRAFKTFQNNFNQPKTSGENKSLVKEKV 366


>ref|XP_006584484.1| PREDICTED: uncharacterized protein LOC100306130 isoform X1 [Glycine
            max]
          Length = 482

 Score =  206 bits (525), Expect = 2e-50
 Identities = 155/408 (37%), Positives = 210/408 (51%), Gaps = 5/408 (1%)
 Frame = +3

Query: 288  SAASKAVFEASVSFGRFENDLLSWERWSSFSQNKYLEEVEKCSTPGSVAKKKAYFEAHYK 467
            +AAS    + SVSFGRFEND LSWE+WS+FS NKYLEEVEKC+TPGSVA+KKAYFEAHYK
Sbjct: 5    TAASNPALQVSVSFGRFENDSLSWEKWSAFSPNKYLEEVEKCATPGSVAQKKAYFEAHYK 64

Query: 468  KIAARKAELLEQEQKMRDDSLRSDYPSSGNPISNGCDTNVDLGTSNVDRITKSVEPDTDL 647
             IAARKAELL Q ++M  DS RS   +  +   N C T+ +   S+    ++ V+ +T+ 
Sbjct: 65   NIAARKAELLAQAKQMEKDSPRSQRQNGEDLSCNTCGTDAECDMSSTQGSSEGVKQETNS 124

Query: 648  IDAVSSTHIDEPNQD-EVGIDHLIERGKEEVDSRPESPKSNSCVVSVKEENHSMGSK--D 818
            I  +  T +    +D  V ID+          S  E  K N      +E    +GS   D
Sbjct: 125  IGEIVRTDVSNLMEDVAVSIDYQ--------GSSVEGEKEN------EELESRLGSSQID 170

Query: 819  PSEELVVLVKEENHMTGSQPELEEIAILVKEENHLTRSE-AQLEEMAILVQENHSTRSHS 995
              EE+V + +  +       E E+    VKE +H   +E A+  E     +  + T  H 
Sbjct: 171  KHEEVVCVEQGGSKEESPNTEAED----VKEISHNVNNEPAKTSEN----EAKYVTLDHP 222

Query: 996  QLYNKTGNSKVNKIEDAKLKPRKESQKIVQRTLASNEKNLGRTKNXXXXXXXXXXHTSVX 1175
            ++  K   + VN+  +A  K +K+S     +  AS       +K            +S  
Sbjct: 223  KVSKKV--TPVNRESNAT-KAKKKSMLSTSKPKASQFSTPRSSKPTSTPTKTLASASS-- 277

Query: 1176 XXXXXXXXXXXXXXXXXXTRKESSPSLLRNKQPSTMESKRPAPTSLHMSLSFGSTNYDST 1355
                              T++  SPS+   K  ST E+++    SLHMSLS   +  D  
Sbjct: 278  ------------------TKRGISPSISGRKINSTSENRKVPNKSLHMSLSLAPSQPDPA 319

Query: 1356 S-TTARRSFIMEKMGDKEIVKRAFKAFQNNYSPSYTSIEAKSSVPPKV 1496
            S TT R+S IMEKMGDK+IVKRAFK FQNN++   TS E KS V  KV
Sbjct: 320  SHTTMRKSLIMEKMGDKDIVKRAFKTFQNNFNQPKTSGENKSLVKEKV 367


>ref|XP_002311790.2| hypothetical protein POPTR_0008s19710g, partial [Populus trichocarpa]
            gi|550333484|gb|EEE89157.2| hypothetical protein
            POPTR_0008s19710g, partial [Populus trichocarpa]
          Length = 421

 Score =  206 bits (524), Expect = 3e-50
 Identities = 148/429 (34%), Positives = 214/429 (49%), Gaps = 15/429 (3%)
 Frame = +3

Query: 237  MGETMTESLKDEIEVKDSAASKAVFEASVSFGRFENDLLSWERWSSFSQNKYLEEVEKCS 416
            MGE++  +   E ++  +AAS    + SVSFGRFEND LSWE+WSSFSQNKYLEEVEKC+
Sbjct: 1    MGESIVAASSYEDKIGGTAASDPALQVSVSFGRFENDSLSWEKWSSFSQNKYLEEVEKCA 60

Query: 417  TPGSVAKKKAYFEAHYKKIAARKAELLEQEQKMRDDSLRSDYPSSGNPISNGCDTNVDLG 596
            +PGSVA+KKAYFEAHYKKIAARKAEL +QE++M  +S   +  + G+       T+    
Sbjct: 61   SPGSVAEKKAYFEAHYKKIAARKAELFDQEKQMEHESSMENNHNIGDLTGKNGQTDSSFD 120

Query: 597  TSNVDRITKSVEPDTDLIDAVSSTHIDEPNQD-------EVGIDHLIERGKEEVDSRPES 755
             SN     + +  ++ L +     H+DEP +D       +  +  L E    +V S+  S
Sbjct: 121  VSNGQTSAEGIWHESKLDNERDGGHVDEPYEDAAIDVHGQASLSGLYEDAANDVQSQASS 180

Query: 756  PKSNSCVVSVKEENHSMGSKDPSEELVVLVKEENHMTGSQPELEEIAILVKEE------N 917
                     VKEE   + +K  S E                +LEE+A++ +EE       
Sbjct: 181  NG------RVKEE---LENKLDSPE--------------STKLEELALIKEEEKGYQDTR 217

Query: 918  HLTRSEAQLEEMAILVQENHSTRSHSQLYNK-TGNSKVNKIEDAKLKPRKESQKIVQRTL 1094
             L ++  + +E  ++++E      H +  +K    SKV  I  AK KP     K  Q + 
Sbjct: 218  ELPKNSEKEKESILMIKEEKVKFDHQRGSSKIIPLSKVRDIARAKKKPEPLVTKQPQIST 277

Query: 1095 ASNEKNLGRTKNXXXXXXXXXXHTSVXXXXXXXXXXXXXXXXXXXTRKESSPSLLRNKQP 1274
                K +  + +                                 T+K +   L R+K P
Sbjct: 278  PKVSKRVPTSSS--------------------------LSASQSSTKKMNGSLLPRSKNP 311

Query: 1275 STMESKRPAPTSLHMSLSFGSTNYD-STSTTARRSFIMEKMGDKEIVKRAFKAFQNNYSP 1451
               E+K+    SLH+SL+   +N +     T R+SFI EKMGDK+IVKRAFK FQNN+S 
Sbjct: 312  PAGENKKVTSKSLHLSLTMDPSNSEPDPLITTRKSFIREKMGDKDIVKRAFKTFQNNFSQ 371

Query: 1452 SYTSIEAKS 1478
              +S E ++
Sbjct: 372  LKSSAEERA 380


>ref|XP_007153684.1| hypothetical protein PHAVU_003G056200g [Phaseolus vulgaris]
            gi|593781289|ref|XP_007153685.1| hypothetical protein
            PHAVU_003G056200g [Phaseolus vulgaris]
            gi|561027038|gb|ESW25678.1| hypothetical protein
            PHAVU_003G056200g [Phaseolus vulgaris]
            gi|561027039|gb|ESW25679.1| hypothetical protein
            PHAVU_003G056200g [Phaseolus vulgaris]
          Length = 482

 Score =  203 bits (517), Expect = 2e-49
 Identities = 149/410 (36%), Positives = 210/410 (51%), Gaps = 6/410 (1%)
 Frame = +3

Query: 285  DSAASKAVFEASVSFGRFENDLLSWERWSSFSQNKYLEEVEKCSTPGSVAKKKAYFEAHY 464
            ++A+S    + SVSFGRFEND LSWE+WS+FS NKYLEEVEKC+TPGSVA+KKAYFEAHY
Sbjct: 3    ETASSNPALQVSVSFGRFENDSLSWEKWSAFSPNKYLEEVEKCATPGSVAQKKAYFEAHY 62

Query: 465  KKIAARKAELLEQEQKMRDDSLRSDYPSSGNPISNGCDTNVDLGTSNVDRITKSVEPDTD 644
            K +AARKAELL QE++M  DS++S Y +  +       T+ +   SN    ++ V+ +T+
Sbjct: 63   KNVAARKAELLAQEKQMEKDSVKSQYQNDEDLSCISSVTDAECDISNAQHSSEGVKQETN 122

Query: 645  LIDAVSSTHIDEPNQDEVGIDHLIERGKEEVDSRPESPKSNSCVVSVKEENHSMGSKDPS 824
             I  +  T           + +L E      D +  S +     V+ + +  S  S+   
Sbjct: 123  SIGEIVRT----------DVSNLGEYAAVSTDYQGSSVEGEK--VNDELDRRSGSSQIDK 170

Query: 825  EELVVLVKEENHMTGSQPELEEIAILVKEENHLTRSEAQ-LEEMAILVQENHSTRSHSQL 1001
            +E VV V++     GS+ E                SEA+ L E++  V       S ++ 
Sbjct: 171  QEEVVCVEQ----GGSKEE-------------CPNSEAEGLNEISHDVNNEPVWASETEA 213

Query: 1002 YNKT-GNSKVNKIEDAKLKPR---KESQKIVQRTLASNEKNLGRTKNXXXXXXXXXXHTS 1169
              KT  N KV+K      + R   K  +K +Q T  S    +   +N            S
Sbjct: 214  QYKTLDNPKVSKKVTPVSRERNAIKGKKKSMQPTSKSKASRISTPRNPKPTSTPTKTLAS 273

Query: 1170 VXXXXXXXXXXXXXXXXXXXTRKESSPSLLRNKQPSTMESKRPAPTSLHMSLSFGSTNYD 1349
                                T++E SPS+   +  ST E+++    SLHMSLS G +  D
Sbjct: 274  A----------------SSSTKREISPSISGRETASTAENRKIPNKSLHMSLSLGPSQLD 317

Query: 1350 -STSTTARRSFIMEKMGDKEIVKRAFKAFQNNYSPSYTSIEAKSSVPPKV 1496
             +  T+ R+S IME+MGDK+IVKRAFK FQNN++   TS E KS V  KV
Sbjct: 318  PAPRTSVRKSLIMERMGDKDIVKRAFKTFQNNFNQPKTSGENKSMVKEKV 367


Top