BLASTX nr result

ID: Akebia25_contig00002711 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00002711
         (1807 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007035326.1| MOS2, putative isoform 1 [Theobroma cacao] g...   410   e-111
ref|XP_002522998.1| Protein MOS2, putative [Ricinus communis] gi...   397   e-108
ref|XP_006846498.1| hypothetical protein AMTR_s00018p00151280 [A...   391   e-106
ref|XP_006355237.1| PREDICTED: protein MOS2-like [Solanum tubero...   387   e-104
ref|XP_007153069.1| hypothetical protein PHAVU_003G004000g [Phas...   386   e-104
ref|XP_004169661.1| PREDICTED: protein MOS2-like [Cucumis sativus]    386   e-104
ref|XP_004144463.1| PREDICTED: protein MOS2-like [Cucumis sativus]    386   e-104
ref|XP_004246061.1| PREDICTED: protein MOS2-like isoform 1 [Sola...   385   e-104
ref|XP_004498120.1| PREDICTED: protein MOS2-like isoform X1 [Cic...   382   e-103
gb|EXC18489.1| Protein MOS2 [Morus notabilis]                         382   e-103
ref|XP_007153333.1| hypothetical protein PHAVU_003G026500g [Phas...   381   e-103
ref|NP_174617.1| protein MOS2 [Arabidopsis thaliana] gi|75169419...   381   e-103
ref|XP_006307443.1| hypothetical protein CARUB_v10009066mg [Caps...   379   e-102
ref|XP_002893773.1| hypothetical protein ARALYDRAFT_890931 [Arab...   375   e-101
ref|XP_006415079.1| hypothetical protein EUTSA_v10007601mg [Eutr...   373   e-100
dbj|BAJ34354.1| unnamed protein product [Thellungiella halophila]     373   e-100
ref|XP_003556598.1| PREDICTED: protein MOS2-like, partial [Glyci...   367   1e-98
ref|XP_003529331.1| PREDICTED: protein MOS2-like [Glycine max]        363   1e-97
ref|XP_006368274.1| KOW domain-containing family protein [Populu...   362   3e-97
ref|XP_002304388.1| KOW domain-containing family protein [Populu...   359   2e-96

>ref|XP_007035326.1| MOS2, putative isoform 1 [Theobroma cacao]
            gi|590660169|ref|XP_007035327.1| MOS2, putative isoform 1
            [Theobroma cacao] gi|508714355|gb|EOY06252.1| MOS2,
            putative isoform 1 [Theobroma cacao]
            gi|508714356|gb|EOY06253.1| MOS2, putative isoform 1
            [Theobroma cacao]
          Length = 465

 Score =  410 bits (1054), Expect = e-111
 Identities = 243/470 (51%), Positives = 307/470 (65%), Gaps = 16/470 (3%)
 Frame = -3

Query: 1508 EFVTEFDPSKTITD--RQPKHVIPRLENTWKPFKKMRNLDLPLQSSTDDPDLRFELEAP- 1338
            EFVTEFDPSKT  D   +P  VIP  +N W+P+KKM+NL +PLQS     DL+FELE+  
Sbjct: 33   EFVTEFDPSKTPADPNSKPSFVIPPKQNEWRPYKKMKNLHIPLQSD-GSRDLQFELESSS 91

Query: 1337 ----PDTDSAMSYGLNLRSKEDGKKSDDLPEDRSGLPGS---IENLMLQKFKEDMKNLPE 1179
                P++D+ +SYGLNLR  ++  K+D    D+ G+P S   +E ++LQ  KED+K LPE
Sbjct: 92   DLPLPNSDAKISYGLNLR--DNSAKND--AGDQQGIPESAAPVEAVLLQSLKEDLKRLPE 147

Query: 1178 DRGFDEFVDVPVEGFGARLLSSYGWSEGKGIGRNAKEDVKVVQYVRRSGREGLGFEPQTN 999
            DRGF+EF DVPVEGFG  LL+ YGW EG+GIG+NAKEDVKV QY RR+ +EGLGF  + N
Sbjct: 148  DRGFEEFEDVPVEGFGKALLAGYGWVEGRGIGKNAKEDVKVKQYERRTDKEGLGFSSKEN 207

Query: 998  GTKQKGNNSRPQLVAPKGPDGRTRHVVGIDEKLVERELKGIHVGKIVRVVGGRHXXXXXX 819
              +  G  +  Q           +H     E++V+ +  G  VGK VRV+ GR       
Sbjct: 208  KERLPGFTNVKQ-----------KHDT---EEIVKEDKDGFFVGKDVRVIEGREMGLKGT 253

Query: 818  XXXXXXGDSEYPAMVVLKLSRSXXXXXXXXXXXXELGSVEEEKFLRK-----IRESKDDR 654
                  G       +VL+L +S            +LGS EEEK LRK     IRE+KD +
Sbjct: 254  IMEKLGG-----GWIVLRLKKSEEKVKVRLFEIADLGSREEEKCLRKLTELKIREAKDLK 308

Query: 653  QKDGRRDSLSRVDREKRNGGKNDRKGNKEE-RRRGDEGRHKEEERKKVVSWLRSHIRVRI 477
             K   R  +S+  RE     +++ K N E  R  GD G          VSWLRSHIRVRI
Sbjct: 309  TKGDER-KVSKRSRESEK--RSETKVNVERVRTNGDRG----------VSWLRSHIRVRI 355

Query: 476  ISKDFKGGKLYLKKGEVVDVVGPATCDITMDESKELVQNVNQDMLETALPRRGGQVLVLY 297
            ISK+ +GG+LYLKKG+VVDVVGP  CDI+MDES+EL+Q V Q++LETALPRRGG VL+LY
Sbjct: 356  ISKNLEGGRLYLKKGQVVDVVGPYMCDISMDESRELIQGVEQELLETALPRRGGPVLILY 415

Query: 296  GEHKGVFGNLVERDVEKEIGVVRDADSHALLNVRLEQIAEYIGDPSYIGY 147
            G HKGV+G+LVERDV++E GVVRDADSH LLNV+LEQIAEY+GDPSY+GY
Sbjct: 416  GRHKGVYGSLVERDVDRETGVVRDADSHELLNVKLEQIAEYMGDPSYLGY 465


>ref|XP_002522998.1| Protein MOS2, putative [Ricinus communis] gi|223537810|gb|EEF39428.1|
            Protein MOS2, putative [Ricinus communis]
          Length = 479

 Score =  397 bits (1021), Expect = e-108
 Identities = 233/485 (48%), Positives = 303/485 (62%), Gaps = 15/485 (3%)
 Frame = -3

Query: 1556 FSLEDKPRDEVRETQHEFVTEFDPSKTITDRQPKHVIPRLENTWKPFKKMRNLDLPLQSS 1377
            FS       +   T  +FVTEFDPSKT+T +Q + +IP  EN W+P KKM+NL L     
Sbjct: 21   FSASVDAETQTNGTDKQFVTEFDPSKTLT-KQNRIIIPPKENEWRPHKKMKNLALLPSLQ 79

Query: 1376 TDDPD-LRFELEAPPDT--DSAMSYGLNLRS--KEDGKKSDDLPEDRSGLPGSIENLMLQ 1212
            + DPD LRFE+    D   D +MSYGLN+R+  ++DG KS    +     P S EN+ML+
Sbjct: 80   SSDPDALRFEIATDADDGDDKSMSYGLNVRAAGEDDGGKSQQQKK-----PESTENIMLE 134

Query: 1211 KFKEDMKNLPEDRGFDEFVDVPVEGFGARLLSSYGWSEGKGIGRNAKEDVKVVQYVRRSG 1032
            K + D++ LPEDRGFDEF DVPVEGFGA LL+ YGW EG+GIGRNAKEDVKV QY +R+ 
Sbjct: 135  KLRYDLERLPEDRGFDEFKDVPVEGFGAALLAGYGWREGRGIGRNAKEDVKVKQYTKRTD 194

Query: 1031 REGLGFEPQTNGTKQKGNNSRPQLVAPKGPDGRTRHVVGIDEKLVERELKGIH------V 870
            +EGLGF      +    N    Q       +      +   +K  +RE  GI+      V
Sbjct: 195  KEGLGFVASVVSSNNVKNRDTVQNDFNSVSNINNVKHIDNGQKERKRERDGINNGDGFFV 254

Query: 869  GKIVRVVGGRHXXXXXXXXXXXXGDSEYPAMVVLKLSRSXXXXXXXXXXXXELGSVEEEK 690
            GK VRV+ G               ++++   V+LK++ S            +LGS EE+K
Sbjct: 255  GKDVRVIAGGREIYGLKGRILERLNADW---VILKIAESNDEVKLRVSDIADLGSKEEDK 311

Query: 689  FLRKIR--ESKDDRQKDGRRDSLSRVDREKRNGGKNDRKGNKEERR--RGDEGRHKEEER 522
             LRK++  + +D + KD             R+ GK   + +KE R   R D G+ K+E+ 
Sbjct: 312  CLRKLKALQLEDKKSKD-------------RDNGKGVTELSKERRESVRRDGGQVKDEKM 358

Query: 521  KKVVSWLRSHIRVRIISKDFKGGKLYLKKGEVVDVVGPATCDITMDESKELVQNVNQDML 342
            +    WLR HIRVR+ISKD KGG+ YLKKGEVVDVVGP  CDI+MDE+KELVQ V+QD+L
Sbjct: 359  R----WLRDHIRVRVISKDLKGGRFYLKKGEVVDVVGPYVCDISMDETKELVQGVDQDLL 414

Query: 341  ETALPRRGGQVLVLYGEHKGVFGNLVERDVEKEIGVVRDADSHALLNVRLEQIAEYIGDP 162
            ETALPRRGG VLVLYG+HKG +GNLVE+D+++E GVV+D D+   LNV+LEQIAEY+GDP
Sbjct: 415  ETALPRRGGPVLVLYGKHKGAYGNLVEKDLDRETGVVQDFDTREFLNVKLEQIAEYVGDP 474

Query: 161  SYIGY 147
            SYIGY
Sbjct: 475  SYIGY 479


>ref|XP_006846498.1| hypothetical protein AMTR_s00018p00151280 [Amborella trichopoda]
            gi|548849308|gb|ERN08173.1| hypothetical protein
            AMTR_s00018p00151280 [Amborella trichopoda]
          Length = 540

 Score =  391 bits (1005), Expect = e-106
 Identities = 237/522 (45%), Positives = 321/522 (61%), Gaps = 64/522 (12%)
 Frame = -3

Query: 1520 ETQHEFVTEFDPSKTITDRQPKHVIPRLENTWKPFKKMRNLDLPLQSSTDDPDLRFELEA 1341
            E + EFVTEFD SKT +++  + VIPR E++W+  K M+N+        ++  L FE+  
Sbjct: 29   EPKAEFVTEFDSSKTPSEKS-RLVIPRQESSWRAEKNMKNI------KPEETHLEFEIIT 81

Query: 1340 PPDT-DSAMSYGLNLRSKEDGKKSDDLPED--RSGLP--GSIENLMLQ-KFKEDMKN--- 1188
               + +S + YGLNLR+K +G  S    ED   SGL     +E   +  K K+DM N   
Sbjct: 82   HETSIESDVGYGLNLRNKSNGGDSKRENEDMGNSGLSCMEPVEATEVDAKRKKDMGNSSF 141

Query: 1187 -----------LPEDRGFDEFVDVPVEGFGARLLSSYGWSEGKGIGRNAKEDVKVVQYVR 1041
                       L ED G DEF D+P+EGFGA +L+ YGW+EG+GIGR AK+D++VVQY+R
Sbjct: 142  PSVKPKNLDSELEEDGGLDEFSDMPIEGFGAAVLAGYGWTEGQGIGRKAKKDIQVVQYIR 201

Query: 1040 RSGREGLGFEPQTNGTKQK--------GNNSRPQLVAPKGPDGRTRHVVGIDEKLVEREL 885
            R+G  GLGF P +   K++           SRP+L+APKG +GR RH VGIDEKLV RE+
Sbjct: 202  RAGMGGLGFTPSSVPEKKQKKYVKPGESRESRPELIAPKGSNGRIRHAVGIDEKLVPREI 261

Query: 884  KGIHVGKIVRVVGGRHXXXXXXXXXXXXGDSEYPAMVVLKLSRSXXXXXXXXXXXXELGS 705
            KG  VGKI+RV+GG H             D      + LKL +S            ELGS
Sbjct: 262  KGFFVGKILRVIGGPHLGLKGQLIEIFGDDGS-SQKIGLKLLKSEEMVVVDREELAELGS 320

Query: 704  VEEEKFLRKIRESKDDRQKDGRR-DSLSRVDREKRNG---------------GKNDR--- 582
            +EE+K L+++RE K   + DG R   L R +RE  NG                ++DR   
Sbjct: 321  LEEDKCLKRMRELK--LEGDGNRLKHLRRDERESHNGEFGKERKAEPLHGDVSRHDRERE 378

Query: 581  ----KGNKEERRRGDEGRHK-----EEERKKV--------VSWLRSHIRVRIISKDFKGG 453
                K  KE+RR+ ++ RH+     E + K +        +SWLRSHIRV+++SKDF+GG
Sbjct: 379  RSSSKREKEDRRKREKSRHQGRKSGERDGKSIREGVETAPLSWLRSHIRVKVVSKDFRGG 438

Query: 452  KLYLKKGEVVDVVGPATCDITMDESKELVQNVNQDMLETALPRRGGQVLVLYGEHKGVFG 273
            +LYLKKGEV+DVVGP TCDITMD+SKE++Q VNQ++L+TALP+RGG VLVL G+HK VFG
Sbjct: 439  RLYLKKGEVMDVVGPLTCDITMDDSKEVIQGVNQEILQTALPQRGGYVLVLLGKHKDVFG 498

Query: 272  NLVERDVEKEIGVVRDADSHALLNVRLEQIAEYIGDPSYIGY 147
             LVE+D++K IG+V+DAD+  +++V L+QIAEY GDP  IGY
Sbjct: 499  KLVEKDLDKGIGIVQDADTFEMVSVELDQIAEYTGDPGCIGY 540


>ref|XP_006355237.1| PREDICTED: protein MOS2-like [Solanum tuberosum]
          Length = 484

 Score =  387 bits (993), Expect = e-104
 Identities = 221/474 (46%), Positives = 300/474 (63%), Gaps = 8/474 (1%)
 Frame = -3

Query: 1544 DKPRDEVRETQHEFVTEFDPSKTI-TDRQPKHVIPRLENTWKPFKKMRNLDLPLQS--ST 1374
            D PR+     + E+VTEFDPSK   +  +   +IP  +N W+P K+M+NL++PLQ+  S 
Sbjct: 27   DDPRNSSNPVEKEYVTEFDPSKAAASSTKDTLIIPPKQNEWRPIKRMKNLEVPLQADASA 86

Query: 1373 DDPDLRFELEAPPDTDSA---MSYGLNLRSKEDGKKSDDLPEDRSGLPGSIENLMLQKFK 1203
             D  L+FEL++    + A   +SYGLN+R  E+     +   + +  P  + + ML KFK
Sbjct: 87   ADQPLQFELDSGAGVEPASDGISYGLNVRQSENPNPDPNPNPNTNSNPKQMIDPMLHKFK 146

Query: 1202 EDMKNLPEDRGFDEFVDVPVEGFGARLLSSYGWSEGKGIGRNAKEDVKVVQYVRRSGREG 1023
            ED+K LPE  G DE+ D+PVEGFGA LL  YGW EG+GIGRNAKEDVKVV+Y + + +EG
Sbjct: 147  EDLKRLPEHNGIDEYTDMPVEGFGAALLKGYGWVEGRGIGRNAKEDVKVVEYKKWTAKEG 206

Query: 1022 LGFEPQTNGTKQKGNNSRPQLVAPKGPDG-RTRHVVGIDEKLV-ERELKGIHVGKIVRVV 849
            +GF P+      KG  +   +   K  DG +  H  G  EK+  E+   G++VGK VRVV
Sbjct: 207  IGFIPEVPKPSSKGEGAVKSI--KKSEDGVKVDHSDGNIEKIDREKAGNGLYVGKKVRVV 264

Query: 848  GGRHXXXXXXXXXXXXGDSEYPAMVVLKLSRSXXXXXXXXXXXXELGSVEEEKFLRKIRE 669
             G+                    +V+LKL+               LGSVEEE+ L+K+ E
Sbjct: 265  RGKEMGMKGEILEVNSSGD----LVILKLADKEVKLQARDLAE--LGSVEEERCLKKLLE 318

Query: 668  SKDDRQKDGRRDSLSRVDREKRNGGKNDRKGNKEERRRGDEGRHKEEERKKVVSWLRSHI 489
             K   +K     +L  V R++ +GG++  +   E ++   E R   +ER   VSWL SHI
Sbjct: 319  LKIREEKS----NLDGV-RKQSSGGRSRDEATTESKK---ESRRSRDERSDKVSWLASHI 370

Query: 488  RVRIISKDFKGGKLYLKKGEVVDVVGPATCDITMDESKELVQNVNQDMLETALPRRGGQV 309
            RVRIISKD K G+LYLKKGE++DVVGP +CDI MDE++EL+Q V+Q++LETALP+RGG V
Sbjct: 371  RVRIISKDLKKGRLYLKKGEIMDVVGPTSCDICMDETRELIQGVDQELLETALPKRGGPV 430

Query: 308  LVLYGEHKGVFGNLVERDVEKEIGVVRDADSHALLNVRLEQIAEYIGDPSYIGY 147
            LVLYG +KGV+G+LVE+D EKE G++RD D+  LL VRLEQIAEY+GDPSYIGY
Sbjct: 431  LVLYGRNKGVYGHLVEKDSEKETGIIRDGDTKELLKVRLEQIAEYLGDPSYIGY 484


>ref|XP_007153069.1| hypothetical protein PHAVU_003G004000g [Phaseolus vulgaris]
            gi|561026423|gb|ESW25063.1| hypothetical protein
            PHAVU_003G004000g [Phaseolus vulgaris]
          Length = 472

 Score =  386 bits (992), Expect = e-104
 Identities = 228/468 (48%), Positives = 296/468 (63%), Gaps = 16/468 (3%)
 Frame = -3

Query: 1502 VTEFDPSKTITDRQPKHVIPRLENTWKPFKKMRNLDLPLQSSTDDPD---LRFELEAPPD 1332
            +TEFDPSK      PK +IP ++N WKPFKKM+NL LP    T DP+   L FEL A  D
Sbjct: 39   ITEFDPSKPAPSLAPKTLIPPIQNQWKPFKKMKNLHLP----TADPESEALTFELHAADD 94

Query: 1331 T-DSAMSYGLNLRSKEDGKKSDDL---PEDRSGLPGSIENLMLQKFKEDMKNLPEDRGFD 1164
              DS +SYGLNLR+ +  ++++     P     +P   E+ MLQK K+D+  LPED GFD
Sbjct: 95   QPDSDVSYGLNLRADKKSEQNNGTALPPPPPRRVPA--ESTMLQKLKDDLLRLPEDNGFD 152

Query: 1163 EFVDVPVEGFGARLLSSYGWSEGKGIGRNAKEDVKVVQYVRRSGREGLGFEPQTNGTKQK 984
            EF DVPVEGFGA LL+ YGW EG GIG+NAKEDVKVV+  RR+ +EGLGF         +
Sbjct: 153  EFKDVPVEGFGAALLAGYGWKEGMGIGKNAKEDVKVVEIKRRTAKEGLGFVGDAPAALVR 212

Query: 983  GNNSRPQLVAPKGPDGRTRHVVGIDEKLVERELKGIHVGKIVRVVGGRHXXXXXXXXXXX 804
             NN +         D + +      EK  ++E       K+VR+VGGR            
Sbjct: 213  SNNDK---------DNKDK------EKNEKKE-------KVVRIVGGRDAGLKGSVVSRI 250

Query: 803  XGDSEYPAMVVLKLSRSXXXXXXXXXXXXELGSVEEEKFLRKIRESKDDRQKDGRRDSLS 624
              D      +VL+LSRS            ELGS EEE+ LRK++ESK  R+  G +    
Sbjct: 251  GDD-----YLVLELSRSGEKVKVKVGDVAELGSKEEERCLRKLKESKTQREDRGPKRKHE 305

Query: 623  RVDREKRNG----GKNDRKG--NKEERRRGDEGRHKEEER---KKVVSWLRSHIRVRIIS 471
            R D  + NG     + +RKG   ++   +   G  +EE R    + VSWL SHIRVR+IS
Sbjct: 306  R-DEVEENGVDVSRREERKGVGRRDVVEKRTNGGRREERRVVDHRKVSWLTSHIRVRVIS 364

Query: 470  KDFKGGKLYLKKGEVVDVVGPATCDITMDESKELVQNVNQDMLETALPRRGGQVLVLYGE 291
            +D KGG LYLKKGEV+DVVGP TCD++MDES+E+VQ V+QD LETA+P+RGG VLVL G+
Sbjct: 365  RDLKGGLLYLKKGEVLDVVGPTTCDVSMDESREIVQGVSQDFLETAIPKRGGPVLVLAGK 424

Query: 290  HKGVFGNLVERDVEKEIGVVRDADSHALLNVRLEQIAEYIGDPSYIGY 147
            +KGVFG+LVERD+++E+ +VRDAD+H LLNV+LEQIAEY+GDPS +G+
Sbjct: 425  YKGVFGSLVERDLDREMAIVRDADTHELLNVKLEQIAEYMGDPSLLGH 472


>ref|XP_004169661.1| PREDICTED: protein MOS2-like [Cucumis sativus]
          Length = 478

 Score =  386 bits (992), Expect = e-104
 Identities = 228/470 (48%), Positives = 300/470 (63%), Gaps = 16/470 (3%)
 Frame = -3

Query: 1508 EFVTEFDPSKTITDRQPKH---VIPRLENTWKPFKKMRNLDLPLQSSTDDPDLRFELEA- 1341
            ++V EFD SK +++   K    VIP L+N W+P K+M+NL++PL  S D+  L+FE  + 
Sbjct: 42   QYVNEFDASKPLSETTGKSRNLVIPSLQNEWRPLKRMKNLEVPLDQS-DESHLKFESASG 100

Query: 1340 -PPDTDSAMSYGLNLRSKEDGKKSDDLPEDRSG----LPGSIENLMLQKFKEDMKNLPED 1176
              P  DS MSYGLN+R   DG K  D  E +SG     P  +E +ML+KFK D++ LPED
Sbjct: 101  LDPLDDSKMSYGLNVRQSVDGMKISD--ESKSGEEPPRPAPLEVIMLEKFKADLERLPED 158

Query: 1175 RGFDEFVDVPVEGFGARLLSSYGWSEGKGIGRNAKEDVKVVQYVRRSGREGLGF--EPQT 1002
            RGF++F +VPVE F A L++ YGW +GKGIGRNAKEDVKV +Y RR+ ++GLGF  +   
Sbjct: 159  RGFEDFEEVPVESFAAALMNGYGWRQGKGIGRNAKEDVKVREYSRRTDKQGLGFVSDVPV 218

Query: 1001 NGTKQKGNNSRPQLVAPKGPDGRTRHVVGIDEKLVERELKGI-HVGKIVRVVGGRHXXXX 825
              +K++      +    K  +GR +       +  +RE  G+  +GK VR+V GR     
Sbjct: 219  GISKKEEEKDGGRERERKRDEGRVK-------ENRDRESDGLASIGKHVRIVRGRDAGLK 271

Query: 824  XXXXXXXXGDSEYPAMVVLKLSR--SXXXXXXXXXXXXELGSVEEEKFLRKIRESKDDRQ 651
                     D      +VLKLS+               ELGS EEEKFL+K+ E K   +
Sbjct: 272  GRVLEKLDSD-----WLVLKLSKRDEHVKLKVRATDIAELGSKEEEKFLKKLEELKVKNE 326

Query: 650  KDG--RRDSLSRVDREKRNGGKNDRKGNKEERRRGDEGRHKEEERKKVVSWLRSHIRVRI 477
              G  RR  + +V  ++ NG ++                  +E+R   +SWL SHIRVRI
Sbjct: 327  NTGQKRRREVEQVVEKRENGSRD------------------KEKRTGRLSWLTSHIRVRI 368

Query: 476  ISKDFKGGKLYLKKGEVVDVVGPATCDITMDESKELVQNVNQDMLETALPRRGGQVLVLY 297
            ISK+FKGGK YLKKGE+VDVVGP+ CDI++D S+ELVQ V+Q++LETALPRRGG VLVLY
Sbjct: 369  ISKEFKGGKFYLKKGEIVDVVGPSICDISIDGSRELVQGVSQELLETALPRRGGPVLVLY 428

Query: 296  GEHKGVFGNLVERDVEKEIGVVRDADSHALLNVRLEQIAEYIGDPSYIGY 147
            G+HKGV+G+LVERD++KE GVVRDADSH LLNVRLEQIAEYIGDPSY+GY
Sbjct: 429  GKHKGVYGSLVERDLDKETGVVRDADSHELLNVRLEQIAEYIGDPSYLGY 478


>ref|XP_004144463.1| PREDICTED: protein MOS2-like [Cucumis sativus]
          Length = 500

 Score =  386 bits (992), Expect = e-104
 Identities = 228/470 (48%), Positives = 300/470 (63%), Gaps = 16/470 (3%)
 Frame = -3

Query: 1508 EFVTEFDPSKTITDRQPKH---VIPRLENTWKPFKKMRNLDLPLQSSTDDPDLRFELEA- 1341
            ++V EFD SK +++   K    VIP L+N W+P K+M+NL++PL  S D+  L+FE  + 
Sbjct: 64   QYVNEFDASKPLSETTGKSRNLVIPSLQNEWRPLKRMKNLEVPLDQS-DESHLKFESASG 122

Query: 1340 -PPDTDSAMSYGLNLRSKEDGKKSDDLPEDRSG----LPGSIENLMLQKFKEDMKNLPED 1176
              P  DS MSYGLN+R   DG K  D  E +SG     P  +E +ML+KFK D++ LPED
Sbjct: 123  LDPLDDSKMSYGLNVRQSVDGMKISD--ESKSGEEPPRPAPLEVIMLEKFKADLERLPED 180

Query: 1175 RGFDEFVDVPVEGFGARLLSSYGWSEGKGIGRNAKEDVKVVQYVRRSGREGLGF--EPQT 1002
            RGF++F +VPVE F A L++ YGW +GKGIGRNAKEDVKV +Y RR+ ++GLGF  +   
Sbjct: 181  RGFEDFEEVPVESFAAALMNGYGWRQGKGIGRNAKEDVKVREYSRRTDKQGLGFVSDVPV 240

Query: 1001 NGTKQKGNNSRPQLVAPKGPDGRTRHVVGIDEKLVERELKGI-HVGKIVRVVGGRHXXXX 825
              +K++      +    K  +GR +       +  +RE  G+  +GK VR+V GR     
Sbjct: 241  GISKKEEEKDGGRERERKRDEGRVK-------ENRDRESDGLASIGKHVRIVRGRDAGLK 293

Query: 824  XXXXXXXXGDSEYPAMVVLKLSR--SXXXXXXXXXXXXELGSVEEEKFLRKIRESKDDRQ 651
                     D      +VLKLS+               ELGS EEEKFL+K+ E K   +
Sbjct: 294  GRVLEKLDSD-----WLVLKLSKRDEHVKLKVRATDIAELGSKEEEKFLKKLEELKVKNE 348

Query: 650  KDG--RRDSLSRVDREKRNGGKNDRKGNKEERRRGDEGRHKEEERKKVVSWLRSHIRVRI 477
              G  RR  + +V  ++ NG ++                  +E+R   +SWL SHIRVRI
Sbjct: 349  NTGQKRRREVEQVVEKRENGSRD------------------KEKRTGRLSWLTSHIRVRI 390

Query: 476  ISKDFKGGKLYLKKGEVVDVVGPATCDITMDESKELVQNVNQDMLETALPRRGGQVLVLY 297
            ISK+FKGGK YLKKGE+VDVVGP+ CDI++D S+ELVQ V+Q++LETALPRRGG VLVLY
Sbjct: 391  ISKEFKGGKFYLKKGEIVDVVGPSICDISIDGSRELVQGVSQELLETALPRRGGPVLVLY 450

Query: 296  GEHKGVFGNLVERDVEKEIGVVRDADSHALLNVRLEQIAEYIGDPSYIGY 147
            G+HKGV+G+LVERD++KE GVVRDADSH LLNVRLEQIAEYIGDPSY+GY
Sbjct: 451  GKHKGVYGSLVERDLDKETGVVRDADSHELLNVRLEQIAEYIGDPSYLGY 500


>ref|XP_004246061.1| PREDICTED: protein MOS2-like isoform 1 [Solanum lycopersicum]
            gi|460401091|ref|XP_004246062.1| PREDICTED: protein
            MOS2-like isoform 2 [Solanum lycopersicum]
          Length = 485

 Score =  385 bits (988), Expect = e-104
 Identities = 225/477 (47%), Positives = 298/477 (62%), Gaps = 11/477 (2%)
 Frame = -3

Query: 1544 DKPRDEVRETQHEFVTEFDPSKTI-TDRQPKHVIPRLENTWKPFKKMRNLDLPLQS--ST 1374
            D PR+     + E+VTEFDPSK   +  +   +IP  +N W+P K+M+NL++PLQ+  S 
Sbjct: 27   DDPRNSSNPIEKEYVTEFDPSKAAASSTKDTLIIPPKQNEWRPIKRMKNLEVPLQADASA 86

Query: 1373 DDPDLRFELEAPPDTDSA---MSYGLNLRSKEDGKKSDDLPEDRSGLPGSIENLMLQKFK 1203
             D  L+FEL++    + A   +SYGLN+R  E+   S +   + +  P  + + ML KFK
Sbjct: 87   ADQPLQFELDSGAGVEPASDGISYGLNVRQSENPNPSPNPNPNPTPNPKQVIDPMLHKFK 146

Query: 1202 EDMKNLPEDRGFDEFVDVPVEGFGARLLSSYGWSEGKGIGRNAKEDVKVVQYVRRSGREG 1023
            ED+K LPE  G DE+ D+PVEGFGA LL  YGW EG+GIGRNAKEDVKVV+Y R + +EG
Sbjct: 147  EDLKRLPEHNGIDEYTDMPVEGFGAALLKGYGWVEGRGIGRNAKEDVKVVEYKRWTAKEG 206

Query: 1022 LGFEPQTNGTKQKGNNSRPQLVAPKGPDG-RTRHVVGIDEKLV-ERELKGIHVGKIVRVV 849
            +GF P+      K      + +  KG +G +  H  G  EK+  E+  KG++VGK VRVV
Sbjct: 207  IGFIPEVPKPSSKAEGG-VKPIKKKGEEGIKVDHSDGYIEKIDREKGGKGLYVGKKVRVV 265

Query: 848  GGRHXXXXXXXXXXXXGDSEYPAMVVLKLSRSXXXXXXXXXXXXELGSVEEEKFLRKIRE 669
             G+                    +V+LKL+               LGSVEEE+ L+K+ E
Sbjct: 266  RGKEMGMKGEVLEVNSRGE----LVILKLADKEVKLQARDLAE--LGSVEEERCLKKLLE 319

Query: 668  SKDDRQK---DGRRDSLSRVDREKRNGGKNDRKGNKEERRRGDEGRHKEEERKKVVSWLR 498
             K   +K   DG R         K++ G   R     ER++  E R   +ER   VSWL 
Sbjct: 320  LKIREEKSHLDGVR---------KQSSGSRSRDEATTERKK--ESRRSRDERSDKVSWLA 368

Query: 497  SHIRVRIISKDFKGGKLYLKKGEVVDVVGPATCDITMDESKELVQNVNQDMLETALPRRG 318
            SHIRVRIISKD K G+LYLKKGE++DVVGP +CDI MDE++EL+Q V+Q++LETALP+RG
Sbjct: 369  SHIRVRIISKDLKRGRLYLKKGEIMDVVGPMSCDICMDETRELIQGVDQELLETALPKRG 428

Query: 317  GQVLVLYGEHKGVFGNLVERDVEKEIGVVRDADSHALLNVRLEQIAEYIGDPSYIGY 147
            G VLVLYG +KGV+G+LVE+D EKE GV+RD D+  LL VRLEQIAEY+GDPS IGY
Sbjct: 429  GPVLVLYGRNKGVYGHLVEKDSEKETGVIRDGDTKDLLKVRLEQIAEYLGDPSDIGY 485


>ref|XP_004498120.1| PREDICTED: protein MOS2-like isoform X1 [Cicer arietinum]
            gi|502123466|ref|XP_004498121.1| PREDICTED: protein
            MOS2-like isoform X2 [Cicer arietinum]
          Length = 460

 Score =  382 bits (981), Expect = e-103
 Identities = 222/485 (45%), Positives = 294/485 (60%), Gaps = 13/485 (2%)
 Frame = -3

Query: 1562 QNFSLEDKPRDEVRETQHEFVTEFDPSKTITDRQPKHVIPRLENTWKPFKKMRNLDLPLQ 1383
            QNF  ++ P    ++     +TEFDPSK  T   PK +IP L N W+P KKM+NLDLP+ 
Sbjct: 27   QNFHDDEDPSSNSKQ----LITEFDPSKPQTLHPPKTLIPPLPNQWRPNKKMKNLDLPIT 82

Query: 1382 SSTDDPDLRFELEAPPDTDSA---MSYGLNLRSKEDG------KKSDDLPEDRSGLPGSI 1230
             S     L FE++    +D      S+GLNLRS          ++  D+P  R     S+
Sbjct: 83   DSHSSHSLAFEIDTTSISDQPDDNTSFGLNLRSTTTDDNNTKQQQQPDVPRPRV----SV 138

Query: 1229 ENLMLQKFKEDMKNLPEDRGFDEFVDVPVEGFGARLLSSYGWSEGKGIGRNAKEDVKVVQ 1050
            E  M++KFKED++ LP+D+GFDEF DV V+GFGA LL  YGW EG GIG+NAKE+VKVV+
Sbjct: 139  EVSMMKKFKEDLERLPDDQGFDEFKDVAVDGFGAALLGGYGWKEGMGIGKNAKENVKVVE 198

Query: 1049 YVRRSGREGLGFE---PQTNGTKQKGNNSRPQLVAPKGPDGRTRHVVGIDEKLVERELKG 879
              RR+ +EGLGF    P     K + N  +                    EK  + E   
Sbjct: 199  IKRRTAKEGLGFVADVPPPTSKKSEMNGKKES------------------EKRKKEE--- 237

Query: 878  IHVGKIVRVVGGRHXXXXXXXXXXXXGDSEYPAMVVLKLSRSXXXXXXXXXXXXELGSVE 699
                +IVR+V GR              D      ++LK+ RS            ELGS E
Sbjct: 238  ----RIVRIVRGRDVGLKASVVDRFGDD-----FLILKVLRSGEEVKVKIEDVAELGSKE 288

Query: 698  EEKFLRKIRESKDDRQKDGRRDSLSRVDREKRNGGKNDR-KGNKEERRRGDEGRHKEEER 522
            E++ LRK+++SK                RE+ NG ++ R +   EERR    G  +EE+ 
Sbjct: 289  EDRCLRKLQDSKTR-------------GREEENGSRSKRGRDEVEERRVNGNGGGREEKG 335

Query: 521  KKVVSWLRSHIRVRIISKDFKGGKLYLKKGEVVDVVGPATCDITMDESKELVQNVNQDML 342
            KK +SWL SHIRVR+IS+ FK G+LYLKKGEV+DV+GP TCDI++DES+E++Q V+QDML
Sbjct: 336  KKQISWLTSHIRVRVISRSFKAGRLYLKKGEVLDVIGPTTCDISLDESREIIQGVSQDML 395

Query: 341  ETALPRRGGQVLVLYGEHKGVFGNLVERDVEKEIGVVRDADSHALLNVRLEQIAEYIGDP 162
            ETA+P+RGG VLVLYG+HKGVFG+LVERD+++EIGVVRDAD+H LLNV+LE +AEYIGDP
Sbjct: 396  ETAIPKRGGPVLVLYGKHKGVFGSLVERDLDREIGVVRDADTHELLNVKLEHMAEYIGDP 455

Query: 161  SYIGY 147
            S +G+
Sbjct: 456  SLLGH 460


>gb|EXC18489.1| Protein MOS2 [Morus notabilis]
          Length = 476

 Score =  382 bits (980), Expect = e-103
 Identities = 218/489 (44%), Positives = 306/489 (62%), Gaps = 17/489 (3%)
 Frame = -3

Query: 1562 QNFSLE-DKPRDEVRETQHEFVTEFDPSKTITDRQPKH--VIPRLENTWKPFKKMRNLDL 1392
            QNF  + D    E      ++V EF+ S+T+T    ++  VIP ++N W+P K+M+NLDL
Sbjct: 22   QNFEDDNDNKSTENDANSRKYVIEFNASETLTGNATQNAVVIPPIQNEWRPHKRMKNLDL 81

Query: 1391 PLQSSTDDPD-LRFELEAPPD-TDSAMSYGLNLRSKEDGKKSDDLPEDRSGLPGS----- 1233
            P+ + +D    L+FE+E+  D T+S+MSYGLNLR    G   D++         +     
Sbjct: 82   PIAAQSDGSGGLQFEVESLSDATNSSMSYGLNLRQTAKGDHDDEINGQDEAKDKNERLRF 141

Query: 1232 --IENLMLQKFKEDMKNLPEDRGFDEFVDVPVEGFGARLLSSYGWSEGKGIGRNAKEDVK 1059
               E+++LQK K D++ LPEDRG  EF DVPVEGFGA LLS YGW EG+GIG+NAKEDVK
Sbjct: 142  TPTEDVLLQKLKFDLQRLPEDRGMAEFEDVPVEGFGAALLSGYGWHEGRGIGKNAKEDVK 201

Query: 1058 VVQYVRRSGREGLGF-----EPQTNGTKQKGNNSRPQLVAPKGPDGRTRHVVGIDEKLVE 894
            VV+Y +R+G++GLGF      P  N  +   NNS P+   PK  +    +    +++ + 
Sbjct: 202  VVEYTKRTGKQGLGFVMTDLPPLPNSNRDSLNNSIPK---PKDNNNNNNNNSSSNKESL- 257

Query: 893  RELKGIHVGKIVRVVGGRHXXXXXXXXXXXXGDSEYPAMVVLKLSRSXXXXXXXXXXXXE 714
                   +GK VR+V GR              D+     +V++LSRS            E
Sbjct: 258  -------IGKEVRIVRGRELGLKGRVLEKLSDDNR----LVVRLSRSQETVKVNIQDVAE 306

Query: 713  LGSVEEEKFLRKIRESKDDRQKDGRRDSLSRVDREKRNGGKNDRKGNKEERRRGDEGRHK 534
            LGS E+E  L++++E +   +++                 K ++K  + E +  D    K
Sbjct: 307  LGSEEDEACLKRLKELRIREEEE-----------------KKEKKSKRRENKSRDSDGEK 349

Query: 533  EEERKKVVSWLRSHIRVRIISKDFKGGKLYLKKGEVVDVVGPATCDITMDESKELVQNVN 354
            ++  +K  SWLRSHIRVRIIS++ KGG+LYLKKGEVVDVVGP  CD++MD+ +EL+Q V+
Sbjct: 350  QQPPRK--SWLRSHIRVRIISRELKGGRLYLKKGEVVDVVGPKVCDVSMDDGRELIQGVS 407

Query: 353  QDMLETALPRRGGQVLVLYGEHKGVFGNLVERDVEKEIGVVRDADSHALLNVRLEQIAEY 174
            QD+LE+ALPRRGG VLVL+G+H+GV+G+LVERD+++E GVVRDAD+H L+NVRLEQIAEY
Sbjct: 408  QDVLESALPRRGGPVLVLFGKHEGVYGSLVERDLDRETGVVRDADTHDLINVRLEQIAEY 467

Query: 173  IGDPSYIGY 147
            IGDPSY+GY
Sbjct: 468  IGDPSYLGY 476


>ref|XP_007153333.1| hypothetical protein PHAVU_003G026500g [Phaseolus vulgaris]
            gi|561026687|gb|ESW25327.1| hypothetical protein
            PHAVU_003G026500g [Phaseolus vulgaris]
          Length = 468

 Score =  381 bits (979), Expect = e-103
 Identities = 226/482 (46%), Positives = 301/482 (62%), Gaps = 20/482 (4%)
 Frame = -3

Query: 1532 DEVRETQHE------FVTEFDPSKTITDRQPKHVIPRLENTWKPFKKMRNLDLPLQSSTD 1371
            D+   TQ++       +TEFDPSK      PK  IP ++N WKPFKKM+NL LP    T 
Sbjct: 23   DDTSGTQNDGGGSKHLITEFDPSKPAPSLAPKTQIPPIQNQWKPFKKMKNLHLP----TA 78

Query: 1370 DPD---LRFELEAPPDT-DSAMSYGLNLRSKEDGKKSDD--LPEDRSGLPGSIENLMLQK 1209
            DP+   L FEL A  D  DS +SYGLNLR+ +  ++++   LP     +P   E+ MLQK
Sbjct: 79   DPESEALTFELHAADDQPDSDVSYGLNLRTDKKSEQNNGTALPPPSRRVPA--ESTMLQK 136

Query: 1208 FKEDMKNLPEDRGFDEFVDVPVEGFGARLLSSYGWSEGKGIGRNAKEDVKVVQYVRRSGR 1029
             K+D+  LPED+GFDEF DVPVEGFGA LL+ YGW EG GIG+NAKEDVKVV+  RR+ +
Sbjct: 137  LKDDLLRLPEDKGFDEFKDVPVEGFGAALLAGYGWKEGMGIGKNAKEDVKVVEIKRRTAK 196

Query: 1028 EGLGFEPQTNGTKQKGNNSRPQLVAPKGPDGRTRHVVGIDEKLVERELKGIHVGKIVRVV 849
            EGLGF         + NN + +                  EK  +++       K+VR+V
Sbjct: 197  EGLGFVGDAPAALVRSNNDKDK------------------EKNEKKD-------KVVRIV 231

Query: 848  GGRHXXXXXXXXXXXXGDSEYPAMVVLKLSRSXXXXXXXXXXXXELGSVEEEKFLRKIRE 669
            GGR                +Y   +VL+LSRS            ELGS EEE+ LRK++E
Sbjct: 232  GGRDAGLKGSVVSRI---EDY--YLVLELSRSGEKVKVKVGDVAELGSKEEERCLRKLKE 286

Query: 668  SKDDRQKDGRRDSLSRVDREKRN---GGKNDRKG--NKEERRRGDEGRHKEEER---KKV 513
             K  R+  G +    R + E+       + +RKG   ++   +  +G  +EE R    + 
Sbjct: 287  LKIQREDRGPKRKQDRNEVEENRVDVSRREERKGVGRRDVIEKRTDGGRREERRVVDHRK 346

Query: 512  VSWLRSHIRVRIISKDFKGGKLYLKKGEVVDVVGPATCDITMDESKELVQNVNQDMLETA 333
            VSWL SHIRVR+IS+D KGG LYLKKGEV+DVVGP TCD++MDES+E+VQ V+Q+ LETA
Sbjct: 347  VSWLTSHIRVRVISRDLKGGLLYLKKGEVLDVVGPTTCDVSMDESREIVQGVSQEFLETA 406

Query: 332  LPRRGGQVLVLYGEHKGVFGNLVERDVEKEIGVVRDADSHALLNVRLEQIAEYIGDPSYI 153
            +P+RGG VLVL G++KGVFG+LVERD+++E+ +VRDAD+H LLNV+LEQIAEY+GDPS +
Sbjct: 407  IPKRGGPVLVLAGKYKGVFGSLVERDLDREMAIVRDADTHELLNVKLEQIAEYLGDPSLL 466

Query: 152  GY 147
            G+
Sbjct: 467  GH 468


>ref|NP_174617.1| protein MOS2 [Arabidopsis thaliana]
            gi|75169419|sp|Q9C801.1|MOS2_ARATH RecName: Full=Protein
            MOS2 gi|12322393|gb|AAG51225.1|AC051630_22 unknown
            protein; 82634-81246 [Arabidopsis thaliana]
            gi|20259490|gb|AAM13865.1| unknown protein [Arabidopsis
            thaliana] gi|29824125|gb|AAP04023.1| unknown protein
            [Arabidopsis thaliana] gi|77176696|gb|ABA64466.1|
            putative nucleic-acid binding protein [Arabidopsis
            thaliana] gi|332193481|gb|AEE31602.1| protein MOS2
            [Arabidopsis thaliana]
          Length = 462

 Score =  381 bits (978), Expect = e-103
 Identities = 222/465 (47%), Positives = 298/465 (64%), Gaps = 8/465 (1%)
 Frame = -3

Query: 1517 TQHEFVTEFDPSKTITDRQPKHVIPRLENTWKPFKKMRNLDLPLQSSTDDPDLRFELEAP 1338
            T  EFVTEFDPSKT+ +  PK+VIP +ENTW+P KKM+NLDLPLQS      L FE E P
Sbjct: 30   TSKEFVTEFDPSKTLANSIPKYVIPPIENTWRPHKKMKNLDLPLQSGNAGSGLEFEPEVP 89

Query: 1337 -PDTDSA--MSYGLNLRSK-EDGKKSDDLPEDRSGLPGSIENLMLQKFKEDMKNLPEDRG 1170
             P T+    +SYGLNLR K +D     D  E+R    G  E LMLQ  + D+ +L +D  
Sbjct: 90   LPGTEKPDNISYGLNLRQKVKDDSIGGDAVEERKVSMG--EQLMLQSLRRDLMSLADDPT 147

Query: 1169 FDEFVDVPVEGFGARLLSSYGWSEGKGIGRNAKEDVKVVQYVRRSGREGLGFEPQTNGTK 990
             ++F  VPV+GFGA L++ YGW  GKGIG+NAKEDV++ +Y + + +EGLGF+P      
Sbjct: 148  LEDFESVPVDGFGAALMAGYGWKPGKGIGKNAKEDVEIKEYKKWTAKEGLGFDPD----- 202

Query: 989  QKGNNSRPQLVAPKGPDGRTRHVVGIDEKLVERELKGIH-VGKIVRVVGGRHXXXXXXXX 813
                  R ++V  K    + +  V +D+K V      +  VGK VR++ GR         
Sbjct: 203  ------RSKVVDVKA---KVKESVKLDKKGVGINGGDVFFVGKEVRIIAGRDVGLKGKIV 253

Query: 812  XXXXGDSEYPAMVVLKLSRSXXXXXXXXXXXXELGSVEEEKFLRKIRESK-DDRQKDGRR 636
                 D       V+K+S S            +LGS EEEK L+K+++ + +DR+KD   
Sbjct: 254  EKPGSD-----FFVIKISGSEEEVKVGVNEVADLGSKEEEKCLKKLKDLQLNDREKD--- 305

Query: 635  DSLSRVDREKRNG-GKNDRKGNKEERRRGD-EGRHKEEERKKVVSWLRSHIRVRIISKDF 462
                    +K +G G+   +G++ E R  + + R +  ERK   SWLRSHI+VRI+SKD+
Sbjct: 306  --------KKTSGRGRGAERGSRSEVRASEKQDRGQTRERKVKPSWLRSHIKVRIVSKDW 357

Query: 461  KGGKLYLKKGEVVDVVGPATCDITMDESKELVQNVNQDMLETALPRRGGQVLVLYGEHKG 282
            KGG+LYLKKG+VVDVVGP TCDITMDE++ELVQ V+Q++LETALPRRGG VLVL G+HKG
Sbjct: 358  KGGRLYLKKGKVVDVVGPTTCDITMDETQELVQGVDQELLETALPRRGGPVLVLSGKHKG 417

Query: 281  VFGNLVERDVEKEIGVVRDADSHALLNVRLEQIAEYIGDPSYIGY 147
            V+GNLVE+D++KE GVVRD D+H +L+VRL+Q+AEY+GD   I Y
Sbjct: 418  VYGNLVEKDLDKETGVVRDLDNHKMLDVRLDQVAEYMGDMDDIEY 462


>ref|XP_006307443.1| hypothetical protein CARUB_v10009066mg [Capsella rubella]
            gi|482576154|gb|EOA40341.1| hypothetical protein
            CARUB_v10009066mg [Capsella rubella]
          Length = 463

 Score =  379 bits (973), Expect = e-102
 Identities = 224/493 (45%), Positives = 302/493 (61%), Gaps = 22/493 (4%)
 Frame = -3

Query: 1559 NFSLEDKPRDEVRET-----------QHEFVTEFDPSKTITDRQPKHVIPRLENTWKPFK 1413
            +FSL  K + +V  T             EFVTEFDPSKT+ D  PK VIP +ENTW+P K
Sbjct: 4    SFSLPSKSKPKVTATADGNNAGDDGASKEFVTEFDPSKTLADSTPKFVIPPIENTWRPHK 63

Query: 1412 KMRNLDLPLQSSTDDPDLRFELEAP----PDTDSAMSYGLNLRSK--EDGKKSDDLPEDR 1251
            KM+NLDLPLQS      L FE E P       D+ ++YGLNLR K  ED     D   D 
Sbjct: 64   KMKNLDLPLQSGNTGSGLEFEPEVPLPGSERPDNNITYGLNLRQKVTEDESVGGDASGDG 123

Query: 1250 SGLPGSIENLMLQKFKEDMKNLPEDRGFDEFVDVPVEGFGARLLSSYGWSEGKGIGRNAK 1071
                G  E LM+QK ++D++ L +D   ++F  VPVEG+GA L++ YGW  GKGIG+NAK
Sbjct: 124  KLSIG--EQLMVQKLRKDLQTLADDPTLEDFESVPVEGYGAALMAGYGWKPGKGIGKNAK 181

Query: 1070 EDVKVVQYVRRSGREGLGFEPQTNGTKQKGNNSRPQLVAPKGPDGRTRHVVGIDEKLVER 891
            EDV++ +Y + + +EGLGF+P            R ++V  K    + +  V +D+K   R
Sbjct: 182  EDVEIKEYKKWTAKEGLGFDPD-----------RSKVVDVKA---KVKESVKLDKK--PR 225

Query: 890  ELKG---IHVGKIVRVVGGRHXXXXXXXXXXXXGDSEYPAMVVLKLSRSXXXXXXXXXXX 720
            ++ G     VGK VR+VGGR              D       V+K+S S           
Sbjct: 226  DMNGGDLFFVGKEVRIVGGRDIGLKGKIVEKLGSD-----FFVMKISGSEDEVKVGVDEV 280

Query: 719  XELGSVEEEKFLRKIRESK-DDRQKDGRRDSLSRVDREKRNGGKNDRKGNKEERRRGDE- 546
             +LGS EEEK L+K+++ + +D++KD +    SR             +G++ E R  ++ 
Sbjct: 281  ADLGSKEEEKCLKKLKDLQLNDKEKDKKVSKRSR----------GTERGSRTEVRVSEKV 330

Query: 545  GRHKEEERKKVVSWLRSHIRVRIISKDFKGGKLYLKKGEVVDVVGPATCDITMDESKELV 366
             R +  E+K   SWLRSHI+VRI+SKD KGG+LYLKKG++VDVVGP  CDITMDE++ELV
Sbjct: 331  DRSETREKKAKPSWLRSHIKVRIVSKDMKGGRLYLKKGKIVDVVGPTICDITMDETQELV 390

Query: 365  QNVNQDMLETALPRRGGQVLVLYGEHKGVFGNLVERDVEKEIGVVRDADSHALLNVRLEQ 186
            Q V+Q++LETALPRRGG VLVL G+HKGV+GNLVE+D++KE GVVRD D+H +L+VRL+Q
Sbjct: 391  QGVDQELLETALPRRGGPVLVLLGKHKGVYGNLVEKDLDKETGVVRDLDNHKMLDVRLDQ 450

Query: 185  IAEYIGDPSYIGY 147
            +AEY+GD   I Y
Sbjct: 451  VAEYMGDMDDIEY 463


>ref|XP_002893773.1| hypothetical protein ARALYDRAFT_890931 [Arabidopsis lyrata subsp.
            lyrata] gi|297339615|gb|EFH70032.1| hypothetical protein
            ARALYDRAFT_890931 [Arabidopsis lyrata subsp. lyrata]
          Length = 461

 Score =  375 bits (963), Expect = e-101
 Identities = 217/469 (46%), Positives = 295/469 (62%), Gaps = 12/469 (2%)
 Frame = -3

Query: 1517 TQHEFVTEFDPSKTITDRQPKHVIPRLENTWKPFKKMRNLDLPLQSSTDDPDLRFELEAP 1338
            T  EFVTEFDPSKT+++  PK+VIP +ENTW+P KKM+NLDLPLQS      L FE E P
Sbjct: 29   TSKEFVTEFDPSKTLSNSIPKYVIPPIENTWRPHKKMKNLDLPLQSGNTGSGLEFEPEVP 88

Query: 1337 ------PDTDSAMSYGLNLRSK-EDGKKSDDLPEDRSGLPGSIENLMLQKFKEDMKNLPE 1179
                  PD    ++YGLNLR K ++     D  EDR    G  E LMLQ  ++D+++L +
Sbjct: 89   LPGHERPDN---ITYGLNLRQKVKEDSIGGDAIEDRKVSMG--EQLMLQSLRKDLQSLAD 143

Query: 1178 DRGFDEFVDVPVEGFGARLLSSYGWSEGKGIGRNAKEDVKVVQYVRRSGREGLGFEPQTN 999
            D   ++F  VPVEGFGA L++ YGW  GKGIG+NAKEDV++ +Y + + +EGLGF+P  +
Sbjct: 144  DPTLEDFESVPVEGFGAALMAGYGWKPGKGIGKNAKEDVEIKEYKKWTAKEGLGFDPDRS 203

Query: 998  ---GTKQKGNNSRPQLVAPKGPDGRTRHVVGIDEKLVERELKGIHVGKIVRVVGGRHXXX 828
                 K +G  S        G +G     VG + +++     G+  GKIV  +G      
Sbjct: 204  KVVDVKVRGKESVKLDKMGVGVNGGDVFFVGKEVRIIAGRDVGLK-GKIVEKLGSD---- 258

Query: 827  XXXXXXXXXGDSEYPAMVVLKLSRSXXXXXXXXXXXXELGSVEEEKFLRKIRESK-DDRQ 651
                              V+K+S S            +LGS EEEK L+K+++ + +D++
Sbjct: 259  ----------------FFVMKISGSEEEVKVGVNEVADLGSKEEEKCLKKLKDLQLNDKE 302

Query: 650  KDGRRDSLSRVDREKRNGGKNDRKGNKEERRRGD-EGRHKEEERKKVVSWLRSHIRVRII 474
            KD          ++   GG+   +G++ E R  + + R +  ERK   SWLRS I+VRI+
Sbjct: 303  KD----------KKASRGGRGTERGSRSEVRVSEKQDRGQTRERKVKPSWLRSQIKVRIV 352

Query: 473  SKDFKGGKLYLKKGEVVDVVGPATCDITMDESKELVQNVNQDMLETALPRRGGQVLVLYG 294
            SK+ KGG+LYLKKG+VVDVVGP TCDITMDE++ELVQ V+Q++LETALPRRGG VLVL G
Sbjct: 353  SKELKGGRLYLKKGKVVDVVGPTTCDITMDETQELVQGVDQELLETALPRRGGPVLVLSG 412

Query: 293  EHKGVFGNLVERDVEKEIGVVRDADSHALLNVRLEQIAEYIGDPSYIGY 147
            +HKGV+GNLVE+D++KE GVVRD D+H +L+VRLEQ+AEY+GD   I Y
Sbjct: 413  KHKGVYGNLVEKDLDKETGVVRDLDNHKMLDVRLEQVAEYMGDMDDIEY 461


>ref|XP_006415079.1| hypothetical protein EUTSA_v10007601mg [Eutrema salsugineum]
            gi|557092850|gb|ESQ33432.1| hypothetical protein
            EUTSA_v10007601mg [Eutrema salsugineum]
          Length = 453

 Score =  373 bits (958), Expect = e-100
 Identities = 219/461 (47%), Positives = 292/461 (63%), Gaps = 7/461 (1%)
 Frame = -3

Query: 1508 EFVTEFDPSKTITDRQPKHVIPRLENTWKPFKKMRNLDLPLQSSTDDPDLRFELEAP--- 1338
            EFVTEFDPSKT+ D  PK+VIP +ENTW+P KKM+NLDLPLQS      L FE E P   
Sbjct: 32   EFVTEFDPSKTLADSTPKYVIPPIENTWRPHKKMKNLDLPLQSGNTGSGLEFEPEVPLGD 91

Query: 1337 -PDTDSAMSYGLNLRSK--EDGKKSDDLPEDRSGLPGSIENLMLQKFKEDMKNLPEDRGF 1167
               +DS ++YGLNLR K  ++G  SD+  EDR   P  +E LM Q  ++D+++L +D   
Sbjct: 92   SKGSDSNITYGLNLRQKVVKEGDASDET-EDRKLAP--VEQLMQQNLRKDLESLADDPTM 148

Query: 1166 DEFVDVPVEGFGARLLSSYGWSEGKGIGRNAKEDVKVVQYVRRSGREGLGFEPQTNGTKQ 987
            ++F  VPVEGFGA L++ YGW  GKGIG+NAK+DV++ +Y + + +EGLGF+P       
Sbjct: 149  EDFESVPVEGFGAALMAGYGWKPGKGIGKNAKDDVEIKEYKKWTAKEGLGFDPD------ 202

Query: 986  KGNNSRPQLVAPKGPDGRTRHVVGIDEKLVERELKGIHVGKIVRVVGGRHXXXXXXXXXX 807
                 R ++V  K         V    KL         VGK VR+V GR           
Sbjct: 203  -----RSKVVDTKAK-------VKESGKLDINGGDVFFVGKEVRIVAGRDIGLKGKIVEK 250

Query: 806  XXGDSEYPAMVVLKLSRSXXXXXXXXXXXXELGSVEEEKFLRKIRESK-DDRQKDGRRDS 630
               D     + VLKLS S            +LGS EEE+ L+K+++ + +D++KD +   
Sbjct: 251  LGKD-----LFVLKLSGSKDEVTVGVNEVADLGSKEEERCLKKLKDLQLNDKEKDKKASK 305

Query: 629  LSRVDREKRNGGKNDRKGNKEERRRGDEGRHKEEERKKVVSWLRSHIRVRIISKDFKGGK 450
             SR             +G+K E ++ + G+ +E   K   SWLRS I+VRI+SK+ KGG+
Sbjct: 306  RSR----------GTERGSKSEVKQ-ERGQTREWRVKP--SWLRSQIKVRIVSKELKGGR 352

Query: 449  LYLKKGEVVDVVGPATCDITMDESKELVQNVNQDMLETALPRRGGQVLVLYGEHKGVFGN 270
            LYLKKG+VVDVVGP TCDITMDE++ELVQ V+Q++LETALPRRGG VLVL G+HKGV+GN
Sbjct: 353  LYLKKGKVVDVVGPTTCDITMDETQELVQGVDQELLETALPRRGGPVLVLLGKHKGVYGN 412

Query: 269  LVERDVEKEIGVVRDADSHALLNVRLEQIAEYIGDPSYIGY 147
            LVE+D++KE GVVRD D+H +L+VRLEQ+AEY+GD   I Y
Sbjct: 413  LVEKDLDKETGVVRDLDNHKMLDVRLEQVAEYMGDMDDIEY 453


>dbj|BAJ34354.1| unnamed protein product [Thellungiella halophila]
          Length = 453

 Score =  373 bits (958), Expect = e-100
 Identities = 218/467 (46%), Positives = 293/467 (62%), Gaps = 13/467 (2%)
 Frame = -3

Query: 1508 EFVTEFDPSKTITDRQPKHVIPRLENTWKPFKKMRNLDLPLQSSTDDPDLRFELEAP--- 1338
            EFVTEFDPSKT+ D  PK+VIP +ENTW+P KKM+NLDLPLQS      L FE E P   
Sbjct: 32   EFVTEFDPSKTLADSTPKYVIPPIENTWRPHKKMKNLDLPLQSGNTGSGLEFEPEVPLGD 91

Query: 1337 -PDTDSAMSYGLNLRSK--EDGKKSDDLPEDRSGLPGSIENLMLQKFKEDMKNLPEDRGF 1167
               +DS ++YGLNLR K  ++G  SD+  EDR   P  +E LM Q  ++D+++L +D   
Sbjct: 92   SKGSDSNITYGLNLRQKVVKEGDASDET-EDRKLAP--VEQLMQQNLRKDLESLADDPTM 148

Query: 1166 DEFVDVPVEGFGARLLSSYGWSEGKGIGRNAKEDVKVVQYVRRSGREGLGFEPQTNGTKQ 987
            ++F  VPVEGFGA L++ YGW  GKGIG+NAK+DV++ +Y + + +EGLGF+P  +    
Sbjct: 149  EDFESVPVEGFGAALMAGYGWKPGKGIGKNAKDDVEIKEYKKWTAKEGLGFDPDRS---- 204

Query: 986  KGNNSRPQLVAPKGPDGRTRHVVGIDEKLVERELKGIH------VGKIVRVVGGRHXXXX 825
                                 VV  + K+ E     I+      VGK VR+V GR     
Sbjct: 205  --------------------KVVDTEAKVKESGKLDINGGDVFFVGKEVRIVAGRDIGLK 244

Query: 824  XXXXXXXXGDSEYPAMVVLKLSRSXXXXXXXXXXXXELGSVEEEKFLRKIRESK-DDRQK 648
                     D     + VLKLS S            +LGS EEE+ L+K+++ + +D++K
Sbjct: 245  GKIVEKLGKD-----LFVLKLSGSKDEVTVGVNEVADLGSKEEERCLKKLKDLQLNDKEK 299

Query: 647  DGRRDSLSRVDREKRNGGKNDRKGNKEERRRGDEGRHKEEERKKVVSWLRSHIRVRIISK 468
            D +    SR             +G+K E ++ + G+ +E   K   SWLRS I+VRI+SK
Sbjct: 300  DKKASKRSR----------GTERGSKSEVKQ-ERGQTREWRVKP--SWLRSQIKVRIVSK 346

Query: 467  DFKGGKLYLKKGEVVDVVGPATCDITMDESKELVQNVNQDMLETALPRRGGQVLVLYGEH 288
            + KGG+LYLKKG+VVDVVGP TCDITMDE++ELVQ V+Q++LETALPRRGG VLVL G+H
Sbjct: 347  ELKGGRLYLKKGKVVDVVGPTTCDITMDETQELVQGVDQELLETALPRRGGPVLVLLGKH 406

Query: 287  KGVFGNLVERDVEKEIGVVRDADSHALLNVRLEQIAEYIGDPSYIGY 147
            KGV+GNLVE+D++KE GVVRD D+H +L+VRLEQ+AEY+GD   I Y
Sbjct: 407  KGVYGNLVEKDLDKETGVVRDLDNHKMLDVRLEQVAEYMGDMDDIEY 453


>ref|XP_003556598.1| PREDICTED: protein MOS2-like, partial [Glycine max]
          Length = 431

 Score =  367 bits (942), Expect = 1e-98
 Identities = 215/460 (46%), Positives = 281/460 (61%), Gaps = 8/460 (1%)
 Frame = -3

Query: 1502 VTEFDPSKTITDRQPKHVIPRLENTWKPFKKMRNLDLPLQSSTDDPDLRFELEAPPDT-D 1326
            +TEFDPSK      PK +IP ++N W+PFKKM+NL LP  ++ D   L FEL    D  +
Sbjct: 10   ITEFDPSKPAPTSAPKTLIPPIQNQWQPFKKMKNLHLP--TAADAESLAFELHTDGDQPE 67

Query: 1325 SAMSYGLNLRSKE--DGKKSDDLPEDRSGLPGSIENLMLQKFKEDMKNLPEDRGFDEFVD 1152
            S +SYGLN+R+ +  +G   DD           +E   LQK K D++ LPED+G +EF D
Sbjct: 68   SDISYGLNVRADKNPEGNNKDDSDGAAPRRRVPLEATALQKLKSDLERLPEDQGMEEFKD 127

Query: 1151 VPVEGFGARLLSSYGWSEGKGIGRNAKEDVKVVQYVRRSGREGLGFEPQTNGTKQKGNNS 972
            V VEG+GA LL+ YGW EG GIGRNAKEDVKVV+  RR+ +EGLGF           NN 
Sbjct: 128  VAVEGYGAALLAGYGWKEGMGIGRNAKEDVKVVEIKRRTAKEGLGFVGDAPAALVLSNNE 187

Query: 971  RPQLVAPKGPDGRTRHVVGIDEKLVERELKGIHVGKIVRVVGGRHXXXXXXXXXXXXGDS 792
            +                   D K  E++       K+VR+VGGR              D 
Sbjct: 188  K-------------------DNKKKEKK------EKVVRIVGGRDAGLKGSVVSRIGDD- 221

Query: 791  EYPAMVVLKLSRSXXXXXXXXXXXXE--LGSVEEEKFLRKIRESKDDRQKDGRRDSLSRV 618
                 +VL+LSRS               LGS EEE+ LRK++E K  R+    +    R 
Sbjct: 222  ----YLVLELSRSGEKVKVKVKVGDVAELGSKEEERCLRKLKELKTQREDKVSKSKRGRD 277

Query: 617  DREKRNGGKNDRKGNKEERRRGDEGRHKEEER---KKVVSWLRSHIRVRIISKDFKGGKL 447
            + E++ G  N RK      +R D GR KEE R    + VSWL SHIRVR+IS+D KGG+L
Sbjct: 278  EVEEKRGDVNRRK-----EKRVDVGR-KEERRVVDHRKVSWLTSHIRVRVISRDLKGGRL 331

Query: 446  YLKKGEVVDVVGPATCDITMDESKELVQNVNQDMLETALPRRGGQVLVLYGEHKGVFGNL 267
            YLKKGEV+DVVGP TCDI+MDE++E+VQ V+QD+LET +P+RGG VLVL G++KGV+G++
Sbjct: 332  YLKKGEVLDVVGPTTCDISMDENREIVQGVSQDVLETVIPKRGGPVLVLAGKYKGVYGSM 391

Query: 266  VERDVEKEIGVVRDADSHALLNVRLEQIAEYIGDPSYIGY 147
             ERD+++E  +VRDAD+H LLNV+LEQIAEYIGDPS +G+
Sbjct: 392  AERDLDQETAIVRDADTHELLNVKLEQIAEYIGDPSLLGH 431


>ref|XP_003529331.1| PREDICTED: protein MOS2-like [Glycine max]
          Length = 477

 Score =  363 bits (933), Expect = 1e-97
 Identities = 217/463 (46%), Positives = 282/463 (60%), Gaps = 11/463 (2%)
 Frame = -3

Query: 1502 VTEFDPSKTITDRQPKHVIPRLENTWKPFKKMRNLDLPLQSSTDDPDLRFELEAPPDT-D 1326
            +TEFDPSK      PK +IP ++N W+PFKKM+NL LP  ++ D   L FEL    D  +
Sbjct: 55   ITEFDPSKPAPTSVPKTLIPPIQNQWQPFKKMKNLHLP--TAADVESLAFELHTDGDQPE 112

Query: 1325 SAMSYGLNLRSKED----GKKSDDLPEDRSGLPGSIENLMLQKFKEDMKNLPEDRGFDEF 1158
            S +SYGLN+R+  +     K   D    R  +P  +E   LQK K D++ LPED+G +EF
Sbjct: 113  SDISYGLNVRADNNPEGNNKDDSDAAAPRRRVP--LEATALQKLKSDLERLPEDQGMEEF 170

Query: 1157 VDVPVEGFGARLLSSYGWSEGKGIGRNAKEDVKVVQYVRRSGREGLGFEPQTNGTKQKGN 978
             DV VEG+GA LL+ YGW EG GIGRNAKEDVKVV+  RR+ +EGLGF           N
Sbjct: 171  KDVAVEGYGAALLAGYGWKEGMGIGRNAKEDVKVVEIKRRTAKEGLGFVGDAPAALVLSN 230

Query: 977  NSRPQLVAPKGPDGRTRHVVGIDEKLVERELKGIHVGKIVRVVGGRHXXXXXXXXXXXXG 798
            N +                   D K  E++       K+VR+VGGR              
Sbjct: 231  NEK-------------------DNKKKEKK------EKVVRIVGGRDSGLKGSVVSRIGD 265

Query: 797  DSEYPAMVVLKLSRSXXXXXXXXXXXXE--LGSVEEEKFLRKIRESKDDRQKDG-RRDSL 627
            D      +VL+LSRS               LGS EEE+ LRK++E K   ++D   +   
Sbjct: 266  D-----YLVLELSRSGEKVKVKVKVGDVAELGSKEEERCLRKLKELKTQSEEDKVSKSKR 320

Query: 626  SRVDREKRNGGKNDRKGNKEERRRGDEGRHKEEER---KKVVSWLRSHIRVRIISKDFKG 456
             R + E++ G  N RK      +R D GR KEE R    + VSWL SHIRVR+IS+D KG
Sbjct: 321  GRDEVEEKRGDLNRRK-----EKRVDVGR-KEERRVVDHRKVSWLTSHIRVRVISRDLKG 374

Query: 455  GKLYLKKGEVVDVVGPATCDITMDESKELVQNVNQDMLETALPRRGGQVLVLYGEHKGVF 276
            G+LYLKKGEV+DVVGP TCDI+MDE++E+VQ V+QD+LET +P+RGG VLVL G++KGV+
Sbjct: 375  GRLYLKKGEVLDVVGPTTCDISMDENREIVQGVSQDVLETVIPKRGGPVLVLAGKYKGVY 434

Query: 275  GNLVERDVEKEIGVVRDADSHALLNVRLEQIAEYIGDPSYIGY 147
            G+L ERD ++E  +VRDAD+H LLNV+LEQIAEYIGDPS +G+
Sbjct: 435  GSLAERDFDRETAIVRDADTHELLNVKLEQIAEYIGDPSLLGH 477


>ref|XP_006368274.1| KOW domain-containing family protein [Populus trichocarpa]
            gi|550346178|gb|ERP64843.1| KOW domain-containing family
            protein [Populus trichocarpa]
          Length = 455

 Score =  362 bits (929), Expect = 3e-97
 Identities = 213/456 (46%), Positives = 288/456 (63%), Gaps = 2/456 (0%)
 Frame = -3

Query: 1508 EFVTEFDPSKTI-TDRQPKHVIPRLENTWKPFKKMRNLDLPLQSSTDDPDLRFELEA-PP 1335
            ++VTEFDP+KT+ + R P  +I  ++N ++P KK++N+DL L       DLRFEL+   P
Sbjct: 34   QYVTEFDPTKTLQSTRTP--IIQPIQNEYQPHKKLKNIDLLLHPDPST-DLRFELQTLSP 90

Query: 1334 DTDSAMSYGLNLRSKEDGKKSDDLPEDRSGLPGSIENLMLQKFKEDMKNLPEDRGFDEFV 1155
            D    MS+GLNLR  +    +  L ++       +E+ ML+K + D+K LPEDRGF+EF 
Sbjct: 91   DPPDPMSFGLNLR--QPTATATSLTKE-----ARVEDEMLEKLRYDLKRLPEDRGFEEFE 143

Query: 1154 DVPVEGFGARLLSSYGWSEGKGIGRNAKEDVKVVQYVRRSGREGLGFEPQTNGTKQKGNN 975
            ++PVE F   LL  YGW EG+G+G+NAKEDVK+ QY +R+ +EGLGF   +  +K    N
Sbjct: 144  EMPVEDFAKALLKGYGWHEGRGVGKNAKEDVKIKQYTKRTDKEGLGFFSASLDSKNSNKN 203

Query: 974  SRPQLVAPKGPDGRTRHVVGIDEKLVERELKGIHVGKIVRVVGGRHXXXXXXXXXXXXGD 795
            S          DG       + EK  E+   G  VGK VRV  G+               
Sbjct: 204  S-------SNGDGSG----SVKEKESEKNKDGFSVGKEVRVFFGKKENLGLKGTIVDRLG 252

Query: 794  SEYPAMVVLKLSRSXXXXXXXXXXXXELGSVEEEKFLRKIRESKDDRQKDGRRDSLSRVD 615
            S+    ++L++ +S            ELGS EEE+ L+++++ K   +K       S  D
Sbjct: 253  SD---SIILRVEKSGESVKVRVSDVAELGSGEEERCLKELKDLKIKEEKKS-----SDGD 304

Query: 614  REKRNGGKNDRKGNKEERRRGDEGRHKEEERKKVVSWLRSHIRVRIISKDFKGGKLYLKK 435
            RE+R   K   + ++E    G+ G  KE    + V WLRSHIRVRIISKD KGGKLYLKK
Sbjct: 305  REQRPVNKRSVE-SRESLIIGNGGIVKE----RGVQWLRSHIRVRIISKDLKGGKLYLKK 359

Query: 434  GEVVDVVGPATCDITMDESKELVQNVNQDMLETALPRRGGQVLVLYGEHKGVFGNLVERD 255
            GEVVDVVGP  CD++MDES+ELVQ+V+QD+LE ALPRRGG VLVLYG+H+G +GNLV+RD
Sbjct: 360  GEVVDVVGPYKCDVSMDESRELVQSVDQDLLENALPRRGGPVLVLYGKHRGAYGNLVQRD 419

Query: 254  VEKEIGVVRDADSHALLNVRLEQIAEYIGDPSYIGY 147
            +++E+GVV+D  SH LLNV+LEQIAEY+GDPSYIGY
Sbjct: 420  LDREVGVVQDYGSHELLNVKLEQIAEYVGDPSYIGY 455


>ref|XP_002304388.1| KOW domain-containing family protein [Populus trichocarpa]
            gi|222841820|gb|EEE79367.1| KOW domain-containing family
            protein [Populus trichocarpa]
          Length = 436

 Score =  359 bits (922), Expect = 2e-96
 Identities = 202/463 (43%), Positives = 281/463 (60%), Gaps = 5/463 (1%)
 Frame = -3

Query: 1520 ETQHEFVTEFDPSKTITDRQPKH-VIPRLENTWKPFKKMRNLDLPLQSSTDDPDLRFELE 1344
            +   +++TEFDPSK +  +  +  +I  + N ++P KKM+N+ LPL       DLRFE+E
Sbjct: 26   DNSKQYLTEFDPSKNLLPQNTQTPIILPIPNDYQPHKKMKNIHLPLHQDDSSTDLRFEVE 85

Query: 1343 A----PPDTDSAMSYGLNLRSKEDGKKSDDLPEDRSGLPGSIENLMLQKFKEDMKNLPED 1176
                 P     ++S+GLNLR     +  D   ED          ++L+K + D+K LPED
Sbjct: 86   TLSSDPAAASDSISFGLNLRQSATTQTQDARSED----------VLLEKLRYDLKRLPED 135

Query: 1175 RGFDEFVDVPVEGFGARLLSSYGWSEGKGIGRNAKEDVKVVQYVRRSGREGLGFEPQTNG 996
            RGF+EF ++PVE F   LL  YGW EG+G+G+N+KEDV+V QY +R+ +EGLGF   ++ 
Sbjct: 136  RGFEEFEEMPVEDFAKALLKGYGWHEGRGVGKNSKEDVQVKQYTKRTDKEGLGFLAASHD 195

Query: 995  TKQKGNNSRPQLVAPKGPDGRTRHVVGIDEKLVERELKGIHVGKIVRVVGGRHXXXXXXX 816
            +K K                          K  ER   G+ +GK VRV+ G+        
Sbjct: 196  SKNK--------------------------KQRERSKDGLFLGKEVRVISGKKENLGLKG 229

Query: 815  XXXXXGDSEYPAMVVLKLSRSXXXXXXXXXXXXELGSVEEEKFLRKIRESKDDRQKDGRR 636
                   S+    + L++ +S            ELGS EEE+ L++++  ++ +  DG  
Sbjct: 230  TVVERLGSD---SIALRVEKSGERVKVRVSDVAELGSREEERCLKELKSIEEKKPSDG-- 284

Query: 635  DSLSRVDREKRNGGKNDRKGNKEERRRGDEGRHKEEERKKVVSWLRSHIRVRIISKDFKG 456
                  DRE+R   K + + +++  + G+    KE    + V WLRSHIRVRIISKD KG
Sbjct: 285  ------DREQRRVNKRNVE-SRDSLKMGNGNVGKE----RGVQWLRSHIRVRIISKDLKG 333

Query: 455  GKLYLKKGEVVDVVGPATCDITMDESKELVQNVNQDMLETALPRRGGQVLVLYGEHKGVF 276
            GKLYLKKGEVVDVVGP  CDI+MDES+ELVQ+V+QD LETALPRRGG VLVLYG+HKG +
Sbjct: 334  GKLYLKKGEVVDVVGPYKCDISMDESRELVQSVDQDALETALPRRGGPVLVLYGKHKGAY 393

Query: 275  GNLVERDVEKEIGVVRDADSHALLNVRLEQIAEYIGDPSYIGY 147
            GNLV+RD+++E+GVV+D+ SH LL+V+LEQIAEY+GDP YIGY
Sbjct: 394  GNLVQRDIDREVGVVQDSGSHELLDVKLEQIAEYVGDPGYIGY 436


Top