BLASTX nr result

ID: Rehmannia22_contig00018073 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00018073
         (1288 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EPS70759.1| hypothetical protein M569_04002, partial [Genlise...   255   3e-65
gb|EOY06252.1| MOS2, putative isoform 1 [Theobroma cacao] gi|508...   244   4e-62
gb|EXC18489.1| Protein MOS2 [Morus notabilis]                         239   1e-60
ref|XP_004246061.1| PREDICTED: protein MOS2-like isoform 1 [Sola...   237   7e-60
ref|XP_006355237.1| PREDICTED: protein MOS2-like [Solanum tubero...   237   9e-60
ref|XP_004169661.1| PREDICTED: protein MOS2-like [Cucumis sativus]    237   9e-60
ref|XP_004144463.1| PREDICTED: protein MOS2-like [Cucumis sativus]    237   9e-60
ref|XP_002326591.1| predicted protein [Populus trichocarpa] gi|5...   233   1e-58
ref|XP_002522998.1| Protein MOS2, putative [Ricinus communis] gi...   226   1e-56
ref|XP_004498120.1| PREDICTED: protein MOS2-like isoform X1 [Cic...   226   2e-56
ref|XP_006307443.1| hypothetical protein CARUB_v10009066mg [Caps...   224   5e-56
ref|XP_002304388.1| KOW domain-containing family protein [Populu...   223   1e-55
gb|ESW25063.1| hypothetical protein PHAVU_003G004000g [Phaseolus...   223   2e-55
ref|XP_003529331.1| PREDICTED: protein MOS2-like [Glycine max]        218   3e-54
ref|XP_003556598.1| PREDICTED: protein MOS2-like, partial [Glyci...   218   6e-54
ref|XP_002893773.1| hypothetical protein ARALYDRAFT_890931 [Arab...   216   1e-53
ref|NP_174617.1| protein MOS2 [Arabidopsis thaliana] gi|75169419...   216   2e-53
gb|ESW25327.1| hypothetical protein PHAVU_003G026500g [Phaseolus...   216   2e-53
ref|XP_006415079.1| hypothetical protein EUTSA_v10007601mg [Eutr...   215   3e-53
dbj|BAJ34354.1| unnamed protein product [Thellungiella halophila]     215   3e-53

>gb|EPS70759.1| hypothetical protein M569_04002, partial [Genlisea aurea]
          Length = 430

 Score =  255 bits (651), Expect = 3e-65
 Identities = 143/263 (54%), Positives = 176/263 (66%)
 Frame = -2

Query: 1287 GRGGLGFTGEFPETRIDSNGNSGENIGNANGRKKVEDRKENGSFGVGKEVRIVNGRDMGM 1108
            GRGGLGFT E  E  + ++   G+ +  A    +  +++E  SF VGK+VRIVNG  MGM
Sbjct: 181  GRGGLGFTEEPLENAVKTDARLGDKL--AAVAVEPVNQEEGKSFSVGKKVRIVNGSKMGM 238

Query: 1107 KGKIVEVRSGGDLLILRISRSNEKVKVRPTDVVEVGSAEEDKCXXXXXXXXXXXXXXXXX 928
            KG IVE+R G D+ ++R S SNEKVKV+  DV E+GS +E++C                 
Sbjct: 239  KGTIVEMRKG-DIFVIRTSDSNEKVKVQSIDVAEIGSIKEEQCMKKLKELKIKEEKDD-- 295

Query: 927  XXXXXLSRKTRQEEEKEVSERVNWLRNHIRVKIISEELKGGRLFLKKXXXXXXXXXGMCD 748
                    K   +  K  S RV WLRNHIRV+IIS+ELK GRLFLKK         G+CD
Sbjct: 296  --------KKDDDPNKARSVRVKWLRNHIRVRIISKELKKGRLFLKKGVVVDVVGPGLCD 347

Query: 747  ISIDESRELVQGVDQELLETALPKRGGPVLVLYGRHKGVYGSLLERDSEKEMGVVRDADT 568
            I +DESREL+Q V+QE LETALPKRGGPVLVLYG++K VYGSL+ERD EKE G V+DADT
Sbjct: 348  ILMDESRELIQDVEQEFLETALPKRGGPVLVLYGKYKDVYGSLVERDLEKERGTVQDADT 407

Query: 567  HELLNVSLDQIAEYIGDPSDIGY 499
             ELL+V L+QIAEY GDPS+IGY
Sbjct: 408  RELLSVKLEQIAEYTGDPSEIGY 430


>gb|EOY06252.1| MOS2, putative isoform 1 [Theobroma cacao]
            gi|508714356|gb|EOY06253.1| MOS2, putative isoform 1
            [Theobroma cacao]
          Length = 465

 Score =  244 bits (624), Expect = 4e-62
 Identities = 132/273 (48%), Positives = 178/273 (65%), Gaps = 13/273 (4%)
 Frame = -2

Query: 1278 GLGFTGEFPETRIDSNGNSGENIGNANGRKKVEDRKENGSFGVGKEVRIVNGRDMGMKGK 1099
            GLGF+ +  + R+        N+   +  +++    ++G F VGK+VR++ GR+MG+KG 
Sbjct: 199  GLGFSSKENKERLPGF----TNVKQKHDTEEIVKEDKDGFF-VGKDVRVIEGREMGLKGT 253

Query: 1098 IVEVRSGGDLLILRISRSNEKVKVRPTDVVEVGSAEEDKCXXXXXXXXXXXXXXXXXXXX 919
            I+E + GG  ++LR+ +S EKVKVR  ++ ++GS EE+KC                    
Sbjct: 254  IME-KLGGGWIVLRLKKSEEKVKVRLFEIADLGSREEEKCLRKLTELKIREAKDLKTKGD 312

Query: 918  XXLSRKTRQEEEKEVSERVN-------------WLRNHIRVKIISEELKGGRLFLKKXXX 778
                 K  +E EK    +VN             WLR+HIRV+IIS+ L+GGRL+LKK   
Sbjct: 313  ERKVSKRSRESEKRSETKVNVERVRTNGDRGVSWLRSHIRVRIISKNLEGGRLYLKKGQV 372

Query: 777  XXXXXXGMCDISIDESRELVQGVDQELLETALPKRGGPVLVLYGRHKGVYGSLLERDSEK 598
                   MCDIS+DESREL+QGV+QELLETALP+RGGPVL+LYGRHKGVYGSL+ERD ++
Sbjct: 373  VDVVGPYMCDISMDESRELIQGVEQELLETALPRRGGPVLILYGRHKGVYGSLVERDVDR 432

Query: 597  EMGVVRDADTHELLNVSLDQIAEYIGDPSDIGY 499
            E GVVRDAD+HELLNV L+QIAEY+GDPS +GY
Sbjct: 433  ETGVVRDADSHELLNVKLEQIAEYMGDPSYLGY 465


>gb|EXC18489.1| Protein MOS2 [Morus notabilis]
          Length = 476

 Score =  239 bits (611), Expect = 1e-60
 Identities = 130/270 (48%), Positives = 173/270 (64%), Gaps = 7/270 (2%)
 Frame = -2

Query: 1287 GRGGLGFT----GEFPETRIDSNGNS--GENIGNANGRKKVEDRKENGSFGVGKEVRIVN 1126
            G+ GLGF        P +  DS  NS       N N        KE+    +GKEVRIV 
Sbjct: 210  GKQGLGFVMTDLPPLPNSNRDSLNNSIPKPKDNNNNNNNNSSSNKESL---IGKEVRIVR 266

Query: 1125 GRDMGMKGKIVEVRSGGDLLILRISRSNEKVKVRPTDVVEVGSAEEDKCXXXXXXXXXXX 946
            GR++G+KG+++E  S  + L++R+SRS E VKV   DV E+GS E++ C           
Sbjct: 267  GRELGLKGRVLEKLSDDNRLVVRLSRSQETVKVNIQDVAELGSEEDEACLKRLKELRIRE 326

Query: 945  XXXXXXXXXXXLSRKTRQEE-EKEVSERVNWLRNHIRVKIISEELKGGRLFLKKXXXXXX 769
                          K+R  + EK+   R +WLR+HIRV+IIS ELKGGRL+LKK      
Sbjct: 327  EEEKKEKKSKRRENKSRDSDGEKQQPPRKSWLRSHIRVRIISRELKGGRLYLKKGEVVDV 386

Query: 768  XXXGMCDISIDESRELVQGVDQELLETALPKRGGPVLVLYGRHKGVYGSLLERDSEKEMG 589
                +CD+S+D+ REL+QGV Q++LE+ALP+RGGPVLVL+G+H+GVYGSL+ERD ++E G
Sbjct: 387  VGPKVCDVSMDDGRELIQGVSQDVLESALPRRGGPVLVLFGKHEGVYGSLVERDLDRETG 446

Query: 588  VVRDADTHELLNVSLDQIAEYIGDPSDIGY 499
            VVRDADTH+L+NV L+QIAEYIGDPS +GY
Sbjct: 447  VVRDADTHDLINVRLEQIAEYIGDPSYLGY 476


>ref|XP_004246061.1| PREDICTED: protein MOS2-like isoform 1 [Solanum lycopersicum]
            gi|460401091|ref|XP_004246062.1| PREDICTED: protein
            MOS2-like isoform 2 [Solanum lycopersicum]
          Length = 485

 Score =  237 bits (605), Expect = 7e-60
 Identities = 130/282 (46%), Positives = 180/282 (63%), Gaps = 22/282 (7%)
 Frame = -2

Query: 1278 GLGFTGEFPETR---------IDSNGNSGENIGNANGRKKVEDRKENGS-FGVGKEVRIV 1129
            G+GF  E P+           I   G  G  + +++G  +  DR++ G    VGK+VR+V
Sbjct: 206  GIGFIPEVPKPSSKAEGGVKPIKKKGEEGIKVDHSDGYIEKIDREKGGKGLYVGKKVRVV 265

Query: 1128 NGRDMGMKGKIVEVRSGGDLLILRISRSNEKVKVRPTDVVEVGSAEEDKCXXXXXXXXXX 949
             G++MGMKG+++EV S G+L+IL++  ++++VK++  D+ E+GS EE++C          
Sbjct: 266  RGKEMGMKGEVLEVNSRGELVILKL--ADKEVKLQARDLAELGSVEEERCLKKLLELKIR 323

Query: 948  XXXXXXXXXXXXLS------------RKTRQEEEKEVSERVNWLRNHIRVKIISEELKGG 805
                         S            +K  +    E S++V+WL +HIRV+IIS++LK G
Sbjct: 324  EEKSHLDGVRKQSSGSRSRDEATTERKKESRRSRDERSDKVSWLASHIRVRIISKDLKRG 383

Query: 804  RLFLKKXXXXXXXXXGMCDISIDESRELVQGVDQELLETALPKRGGPVLVLYGRHKGVYG 625
            RL+LKK           CDI +DE+REL+QGVDQELLETALPKRGGPVLVLYGR+KGVYG
Sbjct: 384  RLYLKKGEIMDVVGPMSCDICMDETRELIQGVDQELLETALPKRGGPVLVLYGRNKGVYG 443

Query: 624  SLLERDSEKEMGVVRDADTHELLNVSLDQIAEYIGDPSDIGY 499
             L+E+DSEKE GV+RD DT +LL V L+QIAEY+GDPSDIGY
Sbjct: 444  HLVEKDSEKETGVIRDGDTKDLLKVRLEQIAEYLGDPSDIGY 485


>ref|XP_006355237.1| PREDICTED: protein MOS2-like [Solanum tuberosum]
          Length = 484

 Score =  237 bits (604), Expect = 9e-60
 Identities = 129/281 (45%), Positives = 177/281 (62%), Gaps = 21/281 (7%)
 Frame = -2

Query: 1278 GLGFTGEFPETRIDSNG--------NSGENIGNANGR-KKVEDRKENGSFGVGKEVRIVN 1126
            G+GF  E P+      G          G  + +++G  +K++  K      VGK+VR+V 
Sbjct: 206  GIGFIPEVPKPSSKGEGAVKSIKKSEDGVKVDHSDGNIEKIDREKAGNGLYVGKKVRVVR 265

Query: 1125 GRDMGMKGKIVEVRSGGDLLILRISRSNEKVKVRPTDVVEVGSAEEDKC----------- 979
            G++MGMKG+I+EV S GDL+IL++  ++++VK++  D+ E+GS EE++C           
Sbjct: 266  GKEMGMKGEILEVNSSGDLVILKL--ADKEVKLQARDLAELGSVEEERCLKKLLELKIRE 323

Query: 978  -XXXXXXXXXXXXXXXXXXXXXXLSRKTRQEEEKEVSERVNWLRNHIRVKIISEELKGGR 802
                                    S+K  +    E S++V+WL +HIRV+IIS++LK GR
Sbjct: 324  EKSNLDGVRKQSSGGRSRDEATTESKKESRRSRDERSDKVSWLASHIRVRIISKDLKKGR 383

Query: 801  LFLKKXXXXXXXXXGMCDISIDESRELVQGVDQELLETALPKRGGPVLVLYGRHKGVYGS 622
            L+LKK           CDI +DE+REL+QGVDQELLETALPKRGGPVLVLYGR+KGVYG 
Sbjct: 384  LYLKKGEIMDVVGPTSCDICMDETRELIQGVDQELLETALPKRGGPVLVLYGRNKGVYGH 443

Query: 621  LLERDSEKEMGVVRDADTHELLNVSLDQIAEYIGDPSDIGY 499
            L+E+DSEKE G++RD DT ELL V L+QIAEY+GDPS IGY
Sbjct: 444  LVEKDSEKETGIIRDGDTKELLKVRLEQIAEYLGDPSYIGY 484


>ref|XP_004169661.1| PREDICTED: protein MOS2-like [Cucumis sativus]
          Length = 478

 Score =  237 bits (604), Expect = 9e-60
 Identities = 137/280 (48%), Positives = 176/280 (62%), Gaps = 20/280 (7%)
 Frame = -2

Query: 1278 GLGFTGEFP----ETRIDSNGNSGENIGNANGR-KKVEDRKENGSFGVGKEVRIVNGRDM 1114
            GLGF  + P    +   + +G          GR K+  DR+ +G   +GK VRIV GRD 
Sbjct: 209  GLGFVSDVPVGISKKEEEKDGGRERERKRDEGRVKENRDRESDGLASIGKHVRIVRGRDA 268

Query: 1113 GMKGKIVEVRSGGDLLILRISRSNE--KVKVRPTDVVEVGSAEEDKCXXXXXXXXXXXXX 940
            G+KG+++E +   D L+L++S+ +E  K+KVR TD+ E+GS EE+K              
Sbjct: 269  GLKGRVLE-KLDSDWLVLKLSKRDEHVKLKVRATDIAELGSKEEEK---------FLKKL 318

Query: 939  XXXXXXXXXLSRKTRQEEEKEVSERVN-------------WLRNHIRVKIISEELKGGRL 799
                       +K R+E E+ V +R N             WL +HIRV+IIS+E KGG+ 
Sbjct: 319  EELKVKNENTGQKRRREVEQVVEKRENGSRDKEKRTGRLSWLTSHIRVRIISKEFKGGKF 378

Query: 798  FLKKXXXXXXXXXGMCDISIDESRELVQGVDQELLETALPKRGGPVLVLYGRHKGVYGSL 619
            +LKK          +CDISID SRELVQGV QELLETALP+RGGPVLVLYG+HKGVYGSL
Sbjct: 379  YLKKGEIVDVVGPSICDISIDGSRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSL 438

Query: 618  LERDSEKEMGVVRDADTHELLNVSLDQIAEYIGDPSDIGY 499
            +ERD +KE GVVRDAD+HELLNV L+QIAEYIGDPS +GY
Sbjct: 439  VERDLDKETGVVRDADSHELLNVRLEQIAEYIGDPSYLGY 478


>ref|XP_004144463.1| PREDICTED: protein MOS2-like [Cucumis sativus]
          Length = 500

 Score =  237 bits (604), Expect = 9e-60
 Identities = 137/280 (48%), Positives = 176/280 (62%), Gaps = 20/280 (7%)
 Frame = -2

Query: 1278 GLGFTGEFP----ETRIDSNGNSGENIGNANGR-KKVEDRKENGSFGVGKEVRIVNGRDM 1114
            GLGF  + P    +   + +G          GR K+  DR+ +G   +GK VRIV GRD 
Sbjct: 231  GLGFVSDVPVGISKKEEEKDGGRERERKRDEGRVKENRDRESDGLASIGKHVRIVRGRDA 290

Query: 1113 GMKGKIVEVRSGGDLLILRISRSNE--KVKVRPTDVVEVGSAEEDKCXXXXXXXXXXXXX 940
            G+KG+++E +   D L+L++S+ +E  K+KVR TD+ E+GS EE+K              
Sbjct: 291  GLKGRVLE-KLDSDWLVLKLSKRDEHVKLKVRATDIAELGSKEEEK---------FLKKL 340

Query: 939  XXXXXXXXXLSRKTRQEEEKEVSERVN-------------WLRNHIRVKIISEELKGGRL 799
                       +K R+E E+ V +R N             WL +HIRV+IIS+E KGG+ 
Sbjct: 341  EELKVKNENTGQKRRREVEQVVEKRENGSRDKEKRTGRLSWLTSHIRVRIISKEFKGGKF 400

Query: 798  FLKKXXXXXXXXXGMCDISIDESRELVQGVDQELLETALPKRGGPVLVLYGRHKGVYGSL 619
            +LKK          +CDISID SRELVQGV QELLETALP+RGGPVLVLYG+HKGVYGSL
Sbjct: 401  YLKKGEIVDVVGPSICDISIDGSRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSL 460

Query: 618  LERDSEKEMGVVRDADTHELLNVSLDQIAEYIGDPSDIGY 499
            +ERD +KE GVVRDAD+HELLNV L+QIAEYIGDPS +GY
Sbjct: 461  VERDLDKETGVVRDADSHELLNVRLEQIAEYIGDPSYLGY 500


>ref|XP_002326591.1| predicted protein [Populus trichocarpa]
            gi|566146521|ref|XP_006368274.1| KOW domain-containing
            family protein [Populus trichocarpa]
            gi|550346178|gb|ERP64843.1| KOW domain-containing family
            protein [Populus trichocarpa]
          Length = 455

 Score =  233 bits (595), Expect = 1e-58
 Identities = 130/272 (47%), Positives = 178/272 (65%), Gaps = 12/272 (4%)
 Frame = -2

Query: 1278 GLGFTGEFPETRIDSNGNSGENIGNANGRKKVEDRKENGSFGVGKEVRIVNGR--DMGMK 1105
            GLGF     +++ +SN NS    G+ + ++K  ++ ++G F VGKEVR+  G+  ++G+K
Sbjct: 187  GLGFFSASLDSK-NSNKNSSNGDGSGSVKEKESEKNKDG-FSVGKEVRVFFGKKENLGLK 244

Query: 1104 GKIVEVRSGGDLLILRISRSNEKVKVRPTDVVEVGSAEEDKCXXXXXXXXXXXXXXXXXX 925
            G IV+ R G D +ILR+ +S E VKVR +DV E+GS EE++C                  
Sbjct: 245  GTIVD-RLGSDSIILRVEKSGESVKVRVSDVAELGSGEEERCLKELKDLKIKEEKKSSDG 303

Query: 924  XXXXLSRKTRQEEEKE---------VSER-VNWLRNHIRVKIISEELKGGRLFLKKXXXX 775
                     R  E +E         V ER V WLR+HIRV+IIS++LKGG+L+LKK    
Sbjct: 304  DREQRPVNKRSVESRESLIIGNGGIVKERGVQWLRSHIRVRIISKDLKGGKLYLKKGEVV 363

Query: 774  XXXXXGMCDISIDESRELVQGVDQELLETALPKRGGPVLVLYGRHKGVYGSLLERDSEKE 595
                   CD+S+DESRELVQ VDQ+LLE ALP+RGGPVLVLYG+H+G YG+L++RD ++E
Sbjct: 364  DVVGPYKCDVSMDESRELVQSVDQDLLENALPRRGGPVLVLYGKHRGAYGNLVQRDLDRE 423

Query: 594  MGVVRDADTHELLNVSLDQIAEYIGDPSDIGY 499
            +GVV+D  +HELLNV L+QIAEY+GDPS IGY
Sbjct: 424  VGVVQDYGSHELLNVKLEQIAEYVGDPSYIGY 455


>ref|XP_002522998.1| Protein MOS2, putative [Ricinus communis] gi|223537810|gb|EEF39428.1|
            Protein MOS2, putative [Ricinus communis]
          Length = 479

 Score =  226 bits (577), Expect = 1e-56
 Identities = 129/284 (45%), Positives = 181/284 (63%), Gaps = 24/284 (8%)
 Frame = -2

Query: 1278 GLGFTGEFPETR-------IDSNGNSGENIGNA----NG---RKKVEDRKENGS-FGVGK 1144
            GLGF      +        + ++ NS  NI N     NG   RK+  D   NG  F VGK
Sbjct: 197  GLGFVASVVSSNNVKNRDTVQNDFNSVSNINNVKHIDNGQKERKRERDGINNGDGFFVGK 256

Query: 1143 EVRIV-NGRDM-GMKGKIVEVRSGGDLLILRISRSNEKVKVRPTDVVEVGSAEEDKCXXX 970
            +VR++  GR++ G+KG+I+E R   D +IL+I+ SN++VK+R +D+ ++GS EEDKC   
Sbjct: 257  DVRVIAGGREIYGLKGRILE-RLNADWVILKIAESNDEVKLRVSDIADLGSKEEDKCLRK 315

Query: 969  XXXXXXXXXXXXXXXXXXXLSRKTRQEEE-------KEVSERVNWLRNHIRVKIISEELK 811
                               ++  +++  E       +   E++ WLR+HIRV++IS++LK
Sbjct: 316  LKALQLEDKKSKDRDNGKGVTELSKERRESVRRDGGQVKDEKMRWLRDHIRVRVISKDLK 375

Query: 810  GGRLFLKKXXXXXXXXXGMCDISIDESRELVQGVDQELLETALPKRGGPVLVLYGRHKGV 631
            GGR +LKK          +CDIS+DE++ELVQGVDQ+LLETALP+RGGPVLVLYG+HKG 
Sbjct: 376  GGRFYLKKGEVVDVVGPYVCDISMDETKELVQGVDQDLLETALPRRGGPVLVLYGKHKGA 435

Query: 630  YGSLLERDSEKEMGVVRDADTHELLNVSLDQIAEYIGDPSDIGY 499
            YG+L+E+D ++E GVV+D DT E LNV L+QIAEY+GDPS IGY
Sbjct: 436  YGNLVEKDLDRETGVVQDFDTREFLNVKLEQIAEYVGDPSYIGY 479


>ref|XP_004498120.1| PREDICTED: protein MOS2-like isoform X1 [Cicer arietinum]
            gi|502123466|ref|XP_004498121.1| PREDICTED: protein
            MOS2-like isoform X2 [Cicer arietinum]
          Length = 460

 Score =  226 bits (575), Expect = 2e-56
 Identities = 127/272 (46%), Positives = 170/272 (62%), Gaps = 12/272 (4%)
 Frame = -2

Query: 1278 GLGFTGEFPETRIDSNGNSGENIGNANGRKKVEDRKENGSFGVGKEVRIVNGRDMGMKGK 1099
            GLGF  + P      +          NG+K+ E RK+         VRIV GRD+G+K  
Sbjct: 207  GLGFVADVPPPTSKKS--------EMNGKKESEKRKKEERI-----VRIVRGRDVGLKAS 253

Query: 1098 IVEVRSGGDLLILRISRSNEKVKVRPTDVVEVGSAEEDKCXXXXXXXXXXXXXXXXXXXX 919
            +V+ R G D LIL++ RS E+VKV+  DV E+GS EED+C                    
Sbjct: 254  VVD-RFGDDFLILKVLRSGEEVKVKIEDVAELGSKEEDRC----LRKLQDSKTRGREEEN 308

Query: 918  XXLSRKTRQE-EEKEVS-----------ERVNWLRNHIRVKIISEELKGGRLFLKKXXXX 775
               S++ R E EE+ V+           ++++WL +HIRV++IS   K GRL+LKK    
Sbjct: 309  GSRSKRGRDEVEERRVNGNGGGREEKGKKQISWLTSHIRVRVISRSFKAGRLYLKKGEVL 368

Query: 774  XXXXXGMCDISIDESRELVQGVDQELLETALPKRGGPVLVLYGRHKGVYGSLLERDSEKE 595
                   CDIS+DESRE++QGV Q++LETA+PKRGGPVLVLYG+HKGV+GSL+ERD ++E
Sbjct: 369  DVIGPTTCDISLDESREIIQGVSQDMLETAIPKRGGPVLVLYGKHKGVFGSLVERDLDRE 428

Query: 594  MGVVRDADTHELLNVSLDQIAEYIGDPSDIGY 499
            +GVVRDADTHELLNV L+ +AEYIGDPS +G+
Sbjct: 429  IGVVRDADTHELLNVKLEHMAEYIGDPSLLGH 460


>ref|XP_006307443.1| hypothetical protein CARUB_v10009066mg [Capsella rubella]
            gi|482576154|gb|EOA40341.1| hypothetical protein
            CARUB_v10009066mg [Capsella rubella]
          Length = 463

 Score =  224 bits (572), Expect = 5e-56
 Identities = 119/243 (48%), Positives = 158/243 (65%), Gaps = 12/243 (4%)
 Frame = -2

Query: 1191 KKVEDRKENGSFGVGKEVRIVNGRDMGMKGKIVEVRSGGDLLILRISRSNEKVKVRPTDV 1012
            KK  D      F VGKEVRIV GRD+G+KGKIVE + G D  +++IS S ++VKV   +V
Sbjct: 222  KKPRDMNGGDLFFVGKEVRIVGGRDIGLKGKIVE-KLGSDFFVMKISGSEDEVKVGVDEV 280

Query: 1011 VEVGSAEEDKCXXXXXXXXXXXXXXXXXXXXXXLSRKTRQEEEKEVSERVN--------- 859
             ++GS EE+KC                         +     E  VSE+V+         
Sbjct: 281  ADLGSKEEEKCLKKLKDLQLNDKEKDKKVSKRSRGTERGSRTEVRVSEKVDRSETREKKA 340

Query: 858  ---WLRNHIRVKIISEELKGGRLFLKKXXXXXXXXXGMCDISIDESRELVQGVDQELLET 688
               WLR+HI+V+I+S+++KGGRL+LKK          +CDI++DE++ELVQGVDQELLET
Sbjct: 341  KPSWLRSHIKVRIVSKDMKGGRLYLKKGKIVDVVGPTICDITMDETQELVQGVDQELLET 400

Query: 687  ALPKRGGPVLVLYGRHKGVYGSLLERDSEKEMGVVRDADTHELLNVSLDQIAEYIGDPSD 508
            ALP+RGGPVLVL G+HKGVYG+L+E+D +KE GVVRD D H++L+V LDQ+AEY+GD  D
Sbjct: 401  ALPRRGGPVLVLLGKHKGVYGNLVEKDLDKETGVVRDLDNHKMLDVRLDQVAEYMGDMDD 460

Query: 507  IGY 499
            I Y
Sbjct: 461  IEY 463


>ref|XP_002304388.1| KOW domain-containing family protein [Populus trichocarpa]
            gi|222841820|gb|EEE79367.1| KOW domain-containing family
            protein [Populus trichocarpa]
          Length = 436

 Score =  223 bits (569), Expect = 1e-55
 Identities = 121/258 (46%), Positives = 168/258 (65%), Gaps = 9/258 (3%)
 Frame = -2

Query: 1245 RIDSNGNSGENIGNANGRKKVEDRKENGSFGVGKEVRIVNGR--DMGMKGKIVEVRSGGD 1072
            R D  G       + +  KK  +R ++G F +GKEVR+++G+  ++G+KG +VE R G D
Sbjct: 181  RTDKEGLGFLAASHDSKNKKQRERSKDGLF-LGKEVRVISGKKENLGLKGTVVE-RLGSD 238

Query: 1071 LLILRISRSNEKVKVRPTDVVEVGSAEEDKCXXXXXXXXXXXXXXXXXXXXXXLSRKTRQ 892
             + LR+ +S E+VKVR +DV E+GS EE++C                        R    
Sbjct: 239  SIALRVEKSGERVKVRVSDVAELGSREEERCLKELKSIEEKKPSDGDREQRRVNKRNVES 298

Query: 891  EEEKEVS------ER-VNWLRNHIRVKIISEELKGGRLFLKKXXXXXXXXXGMCDISIDE 733
             +  ++       ER V WLR+HIRV+IIS++LKGG+L+LKK           CDIS+DE
Sbjct: 299  RDSLKMGNGNVGKERGVQWLRSHIRVRIISKDLKGGKLYLKKGEVVDVVGPYKCDISMDE 358

Query: 732  SRELVQGVDQELLETALPKRGGPVLVLYGRHKGVYGSLLERDSEKEMGVVRDADTHELLN 553
            SRELVQ VDQ+ LETALP+RGGPVLVLYG+HKG YG+L++RD ++E+GVV+D+ +HELL+
Sbjct: 359  SRELVQSVDQDALETALPRRGGPVLVLYGKHKGAYGNLVQRDIDREVGVVQDSGSHELLD 418

Query: 552  VSLDQIAEYIGDPSDIGY 499
            V L+QIAEY+GDP  IGY
Sbjct: 419  VKLEQIAEYVGDPGYIGY 436


>gb|ESW25063.1| hypothetical protein PHAVU_003G004000g [Phaseolus vulgaris]
          Length = 472

 Score =  223 bits (567), Expect = 2e-55
 Identities = 130/288 (45%), Positives = 169/288 (58%), Gaps = 28/288 (9%)
 Frame = -2

Query: 1278 GLGFTGEFPETRIDSNGNSGENIGNANGRKKVEDRKENGSFGVGKEVRIVNGRDMGMKGK 1099
            GLGF G+ P   + SN +        N  K+  ++KE       K VRIV GRD G+KG 
Sbjct: 199  GLGFVGDAPAALVRSNNDKD------NKDKEKNEKKE-------KVVRIVGGRDAGLKGS 245

Query: 1098 IVEVRSGGDLLILRISRSNEKVKVRPTDVVEVGSAEEDKCXXXXXXXXXXXXXXXXXXXX 919
            +V  R G D L+L +SRS EKVKV+  DV E+GS EE++C                    
Sbjct: 246  VVS-RIGDDYLVLELSRSGEKVKVKVGDVAELGSKEEERCLRKLKESKTQREDRGPKRKH 304

Query: 918  XXLSRK------TRQEEEKEVSER----------------------VNWLRNHIRVKIIS 823
                 +      +R+EE K V  R                      V+WL +HIRV++IS
Sbjct: 305  ERDEVEENGVDVSRREERKGVGRRDVVEKRTNGGRREERRVVDHRKVSWLTSHIRVRVIS 364

Query: 822  EELKGGRLFLKKXXXXXXXXXGMCDISIDESRELVQGVDQELLETALPKRGGPVLVLYGR 643
             +LKGG L+LKK           CD+S+DESRE+VQGV Q+ LETA+PKRGGPVLVL G+
Sbjct: 365  RDLKGGLLYLKKGEVLDVVGPTTCDVSMDESREIVQGVSQDFLETAIPKRGGPVLVLAGK 424

Query: 642  HKGVYGSLLERDSEKEMGVVRDADTHELLNVSLDQIAEYIGDPSDIGY 499
            +KGV+GSL+ERD ++EM +VRDADTHELLNV L+QIAEY+GDPS +G+
Sbjct: 425  YKGVFGSLVERDLDREMAIVRDADTHELLNVKLEQIAEYMGDPSLLGH 472


>ref|XP_003529331.1| PREDICTED: protein MOS2-like [Glycine max]
          Length = 477

 Score =  218 bits (556), Expect = 3e-54
 Identities = 132/280 (47%), Positives = 166/280 (59%), Gaps = 20/280 (7%)
 Frame = -2

Query: 1278 GLGFTGEFPETRIDSNGNSGENIGNANGRKKVEDRKENGSFGVGKEVRIVNGRDMGMKGK 1099
            GLGF G+ P   + SN        N    KK E ++        K VRIV GRD G+KG 
Sbjct: 215  GLGFVGDAPAALVLSN--------NEKDNKKKEKKE--------KVVRIVGGRDSGLKGS 258

Query: 1098 IVEVRSGGDLLILRISRSNEKVKVRPT--DVVEVGSAEEDKCXXXXXXXXXXXXXXXXXX 925
            +V  R G D L+L +SRS EKVKV+    DV E+GS EE++C                  
Sbjct: 259  VVS-RIGDDYLVLELSRSGEKVKVKVKVGDVAELGSKEEERCLRKLKELKTQSEEDKVSK 317

Query: 924  XXXXLS-----------RKT------RQEEEKEVSER-VNWLRNHIRVKIISEELKGGRL 799
                             RK       R+EE + V  R V+WL +HIRV++IS +LKGGRL
Sbjct: 318  SKRGRDEVEEKRGDLNRRKEKRVDVGRKEERRVVDHRKVSWLTSHIRVRVISRDLKGGRL 377

Query: 798  FLKKXXXXXXXXXGMCDISIDESRELVQGVDQELLETALPKRGGPVLVLYGRHKGVYGSL 619
            +LKK           CDIS+DE+RE+VQGV Q++LET +PKRGGPVLVL G++KGVYGSL
Sbjct: 378  YLKKGEVLDVVGPTTCDISMDENREIVQGVSQDVLETVIPKRGGPVLVLAGKYKGVYGSL 437

Query: 618  LERDSEKEMGVVRDADTHELLNVSLDQIAEYIGDPSDIGY 499
             ERD ++E  +VRDADTHELLNV L+QIAEYIGDPS +G+
Sbjct: 438  AERDFDRETAIVRDADTHELLNVKLEQIAEYIGDPSLLGH 477


>ref|XP_003556598.1| PREDICTED: protein MOS2-like, partial [Glycine max]
          Length = 431

 Score =  218 bits (554), Expect = 6e-54
 Identities = 131/279 (46%), Positives = 166/279 (59%), Gaps = 19/279 (6%)
 Frame = -2

Query: 1278 GLGFTGEFPETRIDSNGNSGENIGNANGRKKVEDRKENGSFGVGKEVRIVNGRDMGMKGK 1099
            GLGF G+ P   + SN        N    KK E ++        K VRIV GRD G+KG 
Sbjct: 170  GLGFVGDAPAALVLSN--------NEKDNKKKEKKE--------KVVRIVGGRDAGLKGS 213

Query: 1098 IVEVRSGGDLLILRISRSNEKVKVRPT--DVVEVGSAEEDKCXXXXXXXXXXXXXXXXXX 925
            +V  R G D L+L +SRS EKVKV+    DV E+GS EE++C                  
Sbjct: 214  VVS-RIGDDYLVLELSRSGEKVKVKVKVGDVAELGSKEEERCLRKLKELKTQREDKVSKS 272

Query: 924  XXXXLS----------RKT------RQEEEKEVSER-VNWLRNHIRVKIISEELKGGRLF 796
                            RK       R+EE + V  R V+WL +HIRV++IS +LKGGRL+
Sbjct: 273  KRGRDEVEEKRGDVNRRKEKRVDVGRKEERRVVDHRKVSWLTSHIRVRVISRDLKGGRLY 332

Query: 795  LKKXXXXXXXXXGMCDISIDESRELVQGVDQELLETALPKRGGPVLVLYGRHKGVYGSLL 616
            LKK           CDIS+DE+RE+VQGV Q++LET +PKRGGPVLVL G++KGVYGS+ 
Sbjct: 333  LKKGEVLDVVGPTTCDISMDENREIVQGVSQDVLETVIPKRGGPVLVLAGKYKGVYGSMA 392

Query: 615  ERDSEKEMGVVRDADTHELLNVSLDQIAEYIGDPSDIGY 499
            ERD ++E  +VRDADTHELLNV L+QIAEYIGDPS +G+
Sbjct: 393  ERDLDQETAIVRDADTHELLNVKLEQIAEYIGDPSLLGH 431


>ref|XP_002893773.1| hypothetical protein ARALYDRAFT_890931 [Arabidopsis lyrata subsp.
            lyrata] gi|297339615|gb|EFH70032.1| hypothetical protein
            ARALYDRAFT_890931 [Arabidopsis lyrata subsp. lyrata]
          Length = 461

 Score =  216 bits (551), Expect = 1e-53
 Identities = 123/269 (45%), Positives = 161/269 (59%), Gaps = 29/269 (10%)
 Frame = -2

Query: 1218 ENIGNANGRKKVEDRKENGS-----------------FGVGKEVRIVNGRDMGMKGKIVE 1090
            E +G    R KV D K  G                  F VGKEVRI+ GRD+G+KGKIVE
Sbjct: 194  EGLGFDPDRSKVVDVKVRGKESVKLDKMGVGVNGGDVFFVGKEVRIIAGRDVGLKGKIVE 253

Query: 1089 VRSGGDLLILRISRSNEKVKVRPTDVVEVGSAEEDKCXXXXXXXXXXXXXXXXXXXXXXL 910
             + G D  +++IS S E+VKV   +V ++GS EE+KC                       
Sbjct: 254  -KLGSDFFVMKISGSEEEVKVGVNEVADLGSKEEEKCLKKLKDLQLNDKEKDKKASRGGR 312

Query: 909  SRKTRQEEEKEVSERVN------------WLRNHIRVKIISEELKGGRLFLKKXXXXXXX 766
              +     E  VSE+ +            WLR+ I+V+I+S+ELKGGRL+LKK       
Sbjct: 313  GTERGSRSEVRVSEKQDRGQTRERKVKPSWLRSQIKVRIVSKELKGGRLYLKKGKVVDVV 372

Query: 765  XXGMCDISIDESRELVQGVDQELLETALPKRGGPVLVLYGRHKGVYGSLLERDSEKEMGV 586
                CDI++DE++ELVQGVDQELLETALP+RGGPVLVL G+HKGVYG+L+E+D +KE GV
Sbjct: 373  GPTTCDITMDETQELVQGVDQELLETALPRRGGPVLVLSGKHKGVYGNLVEKDLDKETGV 432

Query: 585  VRDADTHELLNVSLDQIAEYIGDPSDIGY 499
            VRD D H++L+V L+Q+AEY+GD  DI Y
Sbjct: 433  VRDLDNHKMLDVRLEQVAEYMGDMDDIEY 461


>ref|NP_174617.1| protein MOS2 [Arabidopsis thaliana]
            gi|75169419|sp|Q9C801.1|MOS2_ARATH RecName: Full=Protein
            MOS2 gi|12322393|gb|AAG51225.1|AC051630_22 unknown
            protein; 82634-81246 [Arabidopsis thaliana]
            gi|20259490|gb|AAM13865.1| unknown protein [Arabidopsis
            thaliana] gi|29824125|gb|AAP04023.1| unknown protein
            [Arabidopsis thaliana] gi|77176696|gb|ABA64466.1|
            putative nucleic-acid binding protein [Arabidopsis
            thaliana] gi|332193481|gb|AEE31602.1| protein MOS2
            [Arabidopsis thaliana]
          Length = 462

 Score =  216 bits (550), Expect = 2e-53
 Identities = 114/232 (49%), Positives = 151/232 (65%), Gaps = 12/232 (5%)
 Frame = -2

Query: 1158 FGVGKEVRIVNGRDMGMKGKIVEVRSGGDLLILRISRSNEKVKVRPTDVVEVGSAEEDKC 979
            F VGKEVRI+ GRD+G+KGKIVE + G D  +++IS S E+VKV   +V ++GS EE+KC
Sbjct: 232  FFVGKEVRIIAGRDVGLKGKIVE-KPGSDFFVIKISGSEEEVKVGVNEVADLGSKEEEKC 290

Query: 978  XXXXXXXXXXXXXXXXXXXXXXLSRKTRQEEEKEVSERVN------------WLRNHIRV 835
                                     +     E   SE+ +            WLR+HI+V
Sbjct: 291  LKKLKDLQLNDREKDKKTSGRGRGAERGSRSEVRASEKQDRGQTRERKVKPSWLRSHIKV 350

Query: 834  KIISEELKGGRLFLKKXXXXXXXXXGMCDISIDESRELVQGVDQELLETALPKRGGPVLV 655
            +I+S++ KGGRL+LKK           CDI++DE++ELVQGVDQELLETALP+RGGPVLV
Sbjct: 351  RIVSKDWKGGRLYLKKGKVVDVVGPTTCDITMDETQELVQGVDQELLETALPRRGGPVLV 410

Query: 654  LYGRHKGVYGSLLERDSEKEMGVVRDADTHELLNVSLDQIAEYIGDPSDIGY 499
            L G+HKGVYG+L+E+D +KE GVVRD D H++L+V LDQ+AEY+GD  DI Y
Sbjct: 411  LSGKHKGVYGNLVEKDLDKETGVVRDLDNHKMLDVRLDQVAEYMGDMDDIEY 462


>gb|ESW25327.1| hypothetical protein PHAVU_003G026500g [Phaseolus vulgaris]
          Length = 468

 Score =  216 bits (549), Expect = 2e-53
 Identities = 128/288 (44%), Positives = 167/288 (57%), Gaps = 28/288 (9%)
 Frame = -2

Query: 1278 GLGFTGEFPETRIDSNGNSGENIGNANGRKKVEDRKENGSFGVGKEVRIVNGRDMGMKGK 1099
            GLGF G+ P   + SN          N + K ++ K++      K VRIV GRD G+KG 
Sbjct: 198  GLGFVGDAPAALVRSN----------NDKDKEKNEKKD------KVVRIVGGRDAGLKGS 241

Query: 1098 IVEVRSGGDLLILRISRSNEKVKVRPTDVVEVGSAEEDKCXXXXXXXXXXXXXXXXXXXX 919
            +V  R     L+L +SRS EKVKV+  DV E+GS EE++C                    
Sbjct: 242  VVS-RIEDYYLVLELSRSGEKVKVKVGDVAELGSKEEERCLRKLKELKIQREDRGPKRKQ 300

Query: 918  XXLSRK------TRQEEEKEVSER----------------------VNWLRNHIRVKIIS 823
                 +      +R+EE K V  R                      V+WL +HIRV++IS
Sbjct: 301  DRNEVEENRVDVSRREERKGVGRRDVIEKRTDGGRREERRVVDHRKVSWLTSHIRVRVIS 360

Query: 822  EELKGGRLFLKKXXXXXXXXXGMCDISIDESRELVQGVDQELLETALPKRGGPVLVLYGR 643
             +LKGG L+LKK           CD+S+DESRE+VQGV QE LETA+PKRGGPVLVL G+
Sbjct: 361  RDLKGGLLYLKKGEVLDVVGPTTCDVSMDESREIVQGVSQEFLETAIPKRGGPVLVLAGK 420

Query: 642  HKGVYGSLLERDSEKEMGVVRDADTHELLNVSLDQIAEYIGDPSDIGY 499
            +KGV+GSL+ERD ++EM +VRDADTHELLNV L+QIAEY+GDPS +G+
Sbjct: 421  YKGVFGSLVERDLDREMAIVRDADTHELLNVKLEQIAEYLGDPSLLGH 468


>ref|XP_006415079.1| hypothetical protein EUTSA_v10007601mg [Eutrema salsugineum]
            gi|557092850|gb|ESQ33432.1| hypothetical protein
            EUTSA_v10007601mg [Eutrema salsugineum]
          Length = 453

 Score =  215 bits (548), Expect = 3e-53
 Identities = 124/262 (47%), Positives = 166/262 (63%), Gaps = 22/262 (8%)
 Frame = -2

Query: 1218 ENIGNANGRKKVEDRK----ENGS--------FGVGKEVRIVNGRDMGMKGKIVEVRSGG 1075
            E +G    R KV D K    E+G         F VGKEVRIV GRD+G+KGKIVE + G 
Sbjct: 195  EGLGFDPDRSKVVDTKAKVKESGKLDINGGDVFFVGKEVRIVAGRDIGLKGKIVE-KLGK 253

Query: 1074 DLLILRISRSNEKVKVRPTDVVEVGSAEEDKCXXXXXXXXXXXXXXXXXXXXXXLSRKTR 895
            DL +L++S S ++V V   +V ++GS EE++C                       SR T 
Sbjct: 254  DLFVLKLSGSKDEVTVGVNEVADLGSKEEERCLKKLKDLQLNDKEKDKKASKR--SRGTE 311

Query: 894  QEEEKEVSE----------RVNWLRNHIRVKIISEELKGGRLFLKKXXXXXXXXXGMCDI 745
            +  + EV +          + +WLR+ I+V+I+S+ELKGGRL+LKK           CDI
Sbjct: 312  RGSKSEVKQERGQTREWRVKPSWLRSQIKVRIVSKELKGGRLYLKKGKVVDVVGPTTCDI 371

Query: 744  SIDESRELVQGVDQELLETALPKRGGPVLVLYGRHKGVYGSLLERDSEKEMGVVRDADTH 565
            ++DE++ELVQGVDQELLETALP+RGGPVLVL G+HKGVYG+L+E+D +KE GVVRD D H
Sbjct: 372  TMDETQELVQGVDQELLETALPRRGGPVLVLLGKHKGVYGNLVEKDLDKETGVVRDLDNH 431

Query: 564  ELLNVSLDQIAEYIGDPSDIGY 499
            ++L+V L+Q+AEY+GD  DI Y
Sbjct: 432  KMLDVRLEQVAEYMGDMDDIEY 453


>dbj|BAJ34354.1| unnamed protein product [Thellungiella halophila]
          Length = 453

 Score =  215 bits (548), Expect = 3e-53
 Identities = 124/262 (47%), Positives = 166/262 (63%), Gaps = 22/262 (8%)
 Frame = -2

Query: 1218 ENIGNANGRKKVEDR----KENGS--------FGVGKEVRIVNGRDMGMKGKIVEVRSGG 1075
            E +G    R KV D     KE+G         F VGKEVRIV GRD+G+KGKIVE + G 
Sbjct: 195  EGLGFDPDRSKVVDTEAKVKESGKLDINGGDVFFVGKEVRIVAGRDIGLKGKIVE-KLGK 253

Query: 1074 DLLILRISRSNEKVKVRPTDVVEVGSAEEDKCXXXXXXXXXXXXXXXXXXXXXXLSRKTR 895
            DL +L++S S ++V V   +V ++GS EE++C                       SR T 
Sbjct: 254  DLFVLKLSGSKDEVTVGVNEVADLGSKEEERCLKKLKDLQLNDKEKDKKASKR--SRGTE 311

Query: 894  QEEEKEVSE----------RVNWLRNHIRVKIISEELKGGRLFLKKXXXXXXXXXGMCDI 745
            +  + EV +          + +WLR+ I+V+I+S+ELKGGRL+LKK           CDI
Sbjct: 312  RGSKSEVKQERGQTREWRVKPSWLRSQIKVRIVSKELKGGRLYLKKGKVVDVVGPTTCDI 371

Query: 744  SIDESRELVQGVDQELLETALPKRGGPVLVLYGRHKGVYGSLLERDSEKEMGVVRDADTH 565
            ++DE++ELVQGVDQELLETALP+RGGPVLVL G+HKGVYG+L+E+D +KE GVVRD D H
Sbjct: 372  TMDETQELVQGVDQELLETALPRRGGPVLVLLGKHKGVYGNLVEKDLDKETGVVRDLDNH 431

Query: 564  ELLNVSLDQIAEYIGDPSDIGY 499
            ++L+V L+Q+AEY+GD  DI Y
Sbjct: 432  KMLDVRLEQVAEYMGDMDDIEY 453


Top