BLASTX nr result

ID: Rehmannia26_contig00030892 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia26_contig00030892
         (570 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EPS66392.1| hypothetical protein M569_08394 [Genlisea aurea]       144   2e-32
ref|XP_002514395.1| conserved hypothetical protein [Ricinus comm...   141   1e-31
ref|XP_006350381.1| PREDICTED: methyl-CpG-binding domain protein...   139   5e-31
ref|XP_004231530.1| PREDICTED: uncharacterized protein LOC101255...   139   5e-31
ref|XP_006423926.1| hypothetical protein CICLE_v10028470mg [Citr...   134   2e-29
ref|XP_006494703.1| PREDICTED: transcriptional regulator ATRX ho...   127   3e-27
gb|EXB50510.1| Methyl-CpG-binding domain protein 4 [Morus notabi...   126   3e-27
gb|ESW35972.1| hypothetical protein PHAVU_L0004001g [Phaseolus v...   126   4e-27
gb|ESW35973.1| hypothetical protein PHAVU_L0004001g, partial [Ph...   125   1e-26
gb|ESW35982.1| hypothetical protein PHAVU_L0001001g, partial [Ph...   124   2e-26
gb|ESW35981.1| hypothetical protein PHAVU_L0001001g [Phaseolus v...   124   2e-26
gb|EOY03082.1| DNA glycosylase superfamily protein, putative iso...   122   5e-26
ref|XP_006593877.1| PREDICTED: axoneme-associated protein mst101...   120   2e-25
ref|XP_006297652.1| hypothetical protein CARUB_v10013672mg [Caps...   120   2e-25
ref|XP_006407780.1| hypothetical protein EUTSA_v10020704mg [Eutr...   119   4e-25
gb|AAO22623.1| unknown protein [Arabidopsis thaliana]                 119   7e-25
ref|NP_974253.1| DNA glycosylase superfamily protein [Arabidopsi...   117   2e-24
gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thal...   117   2e-24
ref|XP_004309787.1| PREDICTED: uncharacterized protein LOC101298...   116   3e-24
ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arab...   113   4e-23

>gb|EPS66392.1| hypothetical protein M569_08394 [Genlisea aurea]
          Length = 369

 Score =  144 bits (363), Expect = 2e-32
 Identities = 82/164 (50%), Positives = 98/164 (59%)
 Frame = -3

Query: 493 GAGKEARVVSPYFANADANAEEKVRTKEGKIESVKLQVRIVSPYFCSTQQDKEDENAVSL 314
           G  K+  V+ PYFA      E+  R K             VSPYF S ++    +  +S 
Sbjct: 156 GCDKKVVVLDPYFA------EDMSRKK-------------VSPYFQSPRKTSGSDRGIS- 195

Query: 313 GGPTNSKVQXXXXXXXXXXXPLLTAAQKKDEAYERKTADNPWRPPRSPFNLLQEDHAFDP 134
                 +V            P+L++ QK+DEAYER+T DN W PPRSPFNLLQEDH FDP
Sbjct: 196 ------EVVEESPERSKRWKPVLSSVQKRDEAYERRTPDNEWTPPRSPFNLLQEDHMFDP 249

Query: 133 WRVLVICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIE 2
           WRVLVICMLLNQTTG+Q  RVLSK F+LCP AK ATEVA + IE
Sbjct: 250 WRVLVICMLLNQTTGRQAFRVLSKLFELCPTAKAATEVARDDIE 293


>ref|XP_002514395.1| conserved hypothetical protein [Ricinus communis]
           gi|223546492|gb|EEF47991.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 608

 Score =  141 bits (355), Expect = 1e-31
 Identities = 83/163 (50%), Positives = 98/163 (60%), Gaps = 3/163 (1%)
 Frame = -3

Query: 481 EARVVSPYFANADANAEEKVRTKEGK-IESVKLQVRIVSPYFCSTQQDKEDENAVS--LG 311
           + R VSP F N     +E ++ K  K  E V L VR VSPYF    + +E+E A S  + 
Sbjct: 370 QVRKVSPNF-NLSIGQQECMKIKPLKPCERVGLTVRNVSPYFQKVPKQEEEEAADSNMID 428

Query: 310 GPTNSKVQXXXXXXXXXXXPLLTAAQKKDEAYERKTADNPWRPPRSPFNLLQEDHAFDPW 131
                K               L+AA+K+ EAY RKT DN W+PPRS F LLQEDHA DPW
Sbjct: 429 NKHGQKKLPEKKKRPARKSITLSAAEKRSEAYRRKTPDNTWKPPRSDFGLLQEDHASDPW 488

Query: 130 RVLVICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIE 2
           RVLVICMLLN TTGKQ   V+S FF LCP+AK ATE  TE+IE
Sbjct: 489 RVLVICMLLNCTTGKQVRGVISDFFTLCPDAKAATEAKTEEIE 531


>ref|XP_006350381.1| PREDICTED: methyl-CpG-binding domain protein 4-like, partial
           [Solanum tuberosum]
          Length = 222

 Score =  139 bits (350), Expect = 5e-31
 Identities = 84/160 (52%), Positives = 93/160 (58%)
 Frame = -3

Query: 481 EARVVSPYFANADANAEEKVRTKEGKIESVKLQVRIVSPYFCSTQQDKEDENAVSLGGPT 302
           + RVVSPYFAN     E KV           L  R VSPYF    Q+   EN  S  G  
Sbjct: 4   KVRVVSPYFANLTVGEEIKVGKDRSNPSKNCLNGRKVSPYF----QNAYRENKKSRKGSK 59

Query: 301 NSKVQXXXXXXXXXXXPLLTAAQKKDEAYERKTADNPWRPPRSPFNLLQEDHAFDPWRVL 122
             K             P L+A QK+DEAY R++ DN W PPRS FNLLQE+HA DPWRVL
Sbjct: 60  RQK-------------PCLSAFQKRDEAYLRRSEDNTWVPPRSHFNLLQENHAHDPWRVL 106

Query: 121 VICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIE 2
           VICMLLN TTG Q  RV+ +FF LCPNA  ATEVA E IE
Sbjct: 107 VICMLLNCTTGVQVKRVVDEFFTLCPNAVAATEVAVEDIE 146


>ref|XP_004231530.1| PREDICTED: uncharacterized protein LOC101255935 [Solanum
           lycopersicum]
          Length = 544

 Score =  139 bits (350), Expect = 5e-31
 Identities = 81/160 (50%), Positives = 94/160 (58%)
 Frame = -3

Query: 481 EARVVSPYFANADANAEEKVRTKEGKIESVKLQVRIVSPYFCSTQQDKEDENAVSLGGPT 302
           + RVVSPYFAN     E KV           L  R VSPYF +  ++K+           
Sbjct: 324 KVRVVSPYFANLKVGEEIKVGKDSSNASKNCLNGRKVSPYFQNAYREKKKSTI------- 376

Query: 301 NSKVQXXXXXXXXXXXPLLTAAQKKDEAYERKTADNPWRPPRSPFNLLQEDHAFDPWRVL 122
            SK Q             L+A+QK+DEAY R++ DN W PPRS FNLLQE+HA DPWRVL
Sbjct: 377 GSKRQKPC----------LSASQKRDEAYLRRSEDNMWVPPRSHFNLLQENHAHDPWRVL 426

Query: 121 VICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIE 2
           VICMLLN TTG Q  RV+ +FF LCPNA  ATEVA E IE
Sbjct: 427 VICMLLNCTTGVQVRRVVDEFFTLCPNAVAATEVAVEDIE 466


>ref|XP_006423926.1| hypothetical protein CICLE_v10028470mg [Citrus clementina]
           gi|568883956|ref|XP_006494704.1| PREDICTED:
           transcriptional regulator ATRX homolog isoform X2
           [Citrus sinensis] gi|557525860|gb|ESR37166.1|
           hypothetical protein CICLE_v10028470mg [Citrus
           clementina]
          Length = 439

 Score =  134 bits (336), Expect = 2e-29
 Identities = 76/157 (48%), Positives = 88/157 (56%), Gaps = 1/157 (0%)
 Frame = -3

Query: 469 VSPYFANADANAEEKVRTKEGKIESVKLQVRIVSPYFCSTQQDKEDENAVSLGGPTNS-K 293
           VSPYF    A   E+    +    S   Q R VSPYF +          V +       K
Sbjct: 210 VSPYFQRQKAGNVER----KNHDTSTMAQARKVSPYFQNQNSTTPAAATVQVHNQQQEEK 265

Query: 292 VQXXXXXXXXXXXPLLTAAQKKDEAYERKTADNPWRPPRSPFNLLQEDHAFDPWRVLVIC 113
            +             LTAAQK+DEAYERK  DN W PPRSP  LLQ +H  DPWRV+VIC
Sbjct: 266 EKDIAVKKKRSRSVTLTAAQKRDEAYERKRPDNTWNPPRSPIVLLQHEHVHDPWRVIVIC 325

Query: 112 MLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIE 2
           MLLN+TTG Q GRV+S  F LCP+AKTATEV  E+IE
Sbjct: 326 MLLNRTTGLQAGRVISDLFTLCPDAKTATEVDAEEIE 362


>ref|XP_006494703.1| PREDICTED: transcriptional regulator ATRX homolog isoform X1
           [Citrus sinensis]
          Length = 446

 Score =  127 bits (318), Expect = 3e-27
 Identities = 76/164 (46%), Positives = 88/164 (53%), Gaps = 8/164 (4%)
 Frame = -3

Query: 469 VSPYFANADANAEEKVRTKEGKIESVKLQVRIVSPYFCSTQQDKEDENAVSLGGPTNS-K 293
           VSPYF    A   E+    +    S   Q R VSPYF +          V +       K
Sbjct: 210 VSPYFQRQKAGNVER----KNHDTSTMAQARKVSPYFQNQNSTTPAAATVQVHNQQQEEK 265

Query: 292 VQXXXXXXXXXXXPLLTAAQKKDEAYERKTADNPWRPPRSPFNLLQEDHAFDPWRVLVIC 113
            +             LTAAQK+DEAYERK  DN W PPRSP  LLQ +H  DPWRV+VIC
Sbjct: 266 EKDIAVKKKRSRSVTLTAAQKRDEAYERKRPDNTWNPPRSPIVLLQHEHVHDPWRVIVIC 325

Query: 112 MLLNQTTGKQ-------TGRVLSKFFQLCPNAKTATEVATEKIE 2
           MLLN+TTG Q        GRV+S  F LCP+AKTATEV  E+IE
Sbjct: 326 MLLNRTTGLQEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIE 369


>gb|EXB50510.1| Methyl-CpG-binding domain protein 4 [Morus notabilis]
          Length = 418

 Score =  126 bits (317), Expect = 3e-27
 Identities = 77/176 (43%), Positives = 99/176 (56%), Gaps = 18/176 (10%)
 Frame = -3

Query: 478 ARVVSPYFANADANAEEK-----------VRTKEGKIESVKLQVRIVSPYFCSTQQDK-- 338
           +RVVSPYF     + +EK           V   E K E +KL V ++S +     ++K  
Sbjct: 167 SRVVSPYFTTNRNDTQEKKKKPEKDGREEVELGEKKEEHLKL-VDVLSRFAYKPMKEKTT 225

Query: 337 ----EDENAVSLGGPTNSKV-QXXXXXXXXXXXPLLTAAQKKDEAYERKTADNPWRPPRS 173
               E    + L G    K+ +            +L AA+K+DEAY+RKT DN W PP S
Sbjct: 226 VERAEKGRKLGLVGVGEKKMSKIVVRRKKIEKSKVLNAAEKRDEAYKRKTDDNKWNPPPS 285

Query: 172 PFNLLQEDHAFDPWRVLVICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKI 5
              L+Q+DH  DPWRVLVICMLLN+TTG Q  RV+S FF LCPNAK ATEV+ E+I
Sbjct: 286 EIRLIQQDHLHDPWRVLVICMLLNRTTGAQATRVISDFFSLCPNAKAATEVSPEEI 341


>gb|ESW35972.1| hypothetical protein PHAVU_L0004001g [Phaseolus vulgaris]
          Length = 726

 Score =  126 bits (316), Expect = 4e-27
 Identities = 75/180 (41%), Positives = 101/180 (56%), Gaps = 15/180 (8%)
 Frame = -3

Query: 496  KGAGKEARVVSPYFANADANAEEKVRTKEGKIESVKLQVRIVSPYFCSTQQDKEDENAVS 317
            K A    R VSPYF N   + +  V++K    ++V   +R VSPYF +      D   + 
Sbjct: 473  KNAAHGIRYVSPYFHND--SGKMSVKSKPLVQKNVAHAIRYVSPYFHNDSGKNIDVKPLD 530

Query: 316  LGGPTNS---------------KVQXXXXXXXXXXXPLLTAAQKKDEAYERKTADNPWRP 182
             G    S               + +             L+A+QK DEAY+RKT D  W+P
Sbjct: 531  EGSKFESIALHATENYVEDKPEENKSSCSEKSIEIKKNLSASQKWDEAYKRKTPDITWKP 590

Query: 181  PRSPFNLLQEDHAFDPWRVLVICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIE 2
            PRS   L+QEDHA DPWRVLVICMLLN+T+G+QT  ++S FF+LCP+AK+ TEV+ E+IE
Sbjct: 591  PRSATVLIQEDHAHDPWRVLVICMLLNRTSGRQTKNIVSDFFKLCPDAKSCTEVSREEIE 650


>gb|ESW35973.1| hypothetical protein PHAVU_L0004001g, partial [Phaseolus vulgaris]
          Length = 715

 Score =  125 bits (313), Expect = 1e-26
 Identities = 73/166 (43%), Positives = 98/166 (59%), Gaps = 1/166 (0%)
 Frame = -3

Query: 496 KGAGKEARVVSPYFAN-ADANAEEKVRTKEGKIESVKLQVRIVSPYFCSTQQDKEDENAV 320
           K      R VSPYF N +  N + K   +  K ES+ L          +  +DK +EN  
Sbjct: 492 KNVAHAIRYVSPYFHNDSGKNIDVKPLDEGSKFESIALHATE------NYVEDKPEENKS 545

Query: 319 SLGGPTNSKVQXXXXXXXXXXXPLLTAAQKKDEAYERKTADNPWRPPRSPFNLLQEDHAF 140
           S    +   ++             L+A+QK DEAY+RKT D  W+PPRS   L+QEDHA 
Sbjct: 546 SC---SEKSIEIKKN---------LSASQKWDEAYKRKTPDITWKPPRSATVLIQEDHAH 593

Query: 139 DPWRVLVICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIE 2
           DPWRVLVICMLLN+T+G+QT  ++S FF+LCP+AK+ TEV+ E+IE
Sbjct: 594 DPWRVLVICMLLNRTSGRQTKNIVSDFFKLCPDAKSCTEVSREEIE 639


>gb|ESW35982.1| hypothetical protein PHAVU_L0001001g, partial [Phaseolus vulgaris]
          Length = 205

 Score =  124 bits (310), Expect = 2e-26
 Identities = 72/166 (43%), Positives = 98/166 (59%), Gaps = 1/166 (0%)
 Frame = -3

Query: 496 KGAGKEARVVSPYFAN-ADANAEEKVRTKEGKIESVKLQVRIVSPYFCSTQQDKEDENAV 320
           K      R VSPYF N +  N + K   +  K ES+ L          +  +DK +EN  
Sbjct: 19  KNVAHAIRYVSPYFHNDSGKNIDVKPLDEGSKFESIALHATE------NFVEDKPEENKS 72

Query: 319 SLGGPTNSKVQXXXXXXXXXXXPLLTAAQKKDEAYERKTADNPWRPPRSPFNLLQEDHAF 140
           S    +   ++             L+A++K DEAY+RKT D  W+PPRS   L+QEDHA 
Sbjct: 73  SC---SEKSIEIKKN---------LSASEKWDEAYKRKTPDITWKPPRSATVLIQEDHAH 120

Query: 139 DPWRVLVICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIE 2
           DPWRVLVICMLLN+T+G+QT  ++S FF+LCP+AK+ TEV+ E+IE
Sbjct: 121 DPWRVLVICMLLNRTSGRQTKNIVSDFFKLCPDAKSCTEVSREEIE 166


>gb|ESW35981.1| hypothetical protein PHAVU_L0001001g [Phaseolus vulgaris]
          Length = 197

 Score =  124 bits (310), Expect = 2e-26
 Identities = 72/166 (43%), Positives = 98/166 (59%), Gaps = 1/166 (0%)
 Frame = -3

Query: 496 KGAGKEARVVSPYFAN-ADANAEEKVRTKEGKIESVKLQVRIVSPYFCSTQQDKEDENAV 320
           K      R VSPYF N +  N + K   +  K ES+ L          +  +DK +EN  
Sbjct: 11  KNVAHAIRYVSPYFHNDSGKNIDVKPLDEGSKFESIALHATE------NFVEDKPEENKS 64

Query: 319 SLGGPTNSKVQXXXXXXXXXXXPLLTAAQKKDEAYERKTADNPWRPPRSPFNLLQEDHAF 140
           S    +   ++             L+A++K DEAY+RKT D  W+PPRS   L+QEDHA 
Sbjct: 65  SC---SEKSIEIKKN---------LSASEKWDEAYKRKTPDITWKPPRSATVLIQEDHAH 112

Query: 139 DPWRVLVICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIE 2
           DPWRVLVICMLLN+T+G+QT  ++S FF+LCP+AK+ TEV+ E+IE
Sbjct: 113 DPWRVLVICMLLNRTSGRQTKNIVSDFFKLCPDAKSCTEVSREEIE 158


>gb|EOY03082.1| DNA glycosylase superfamily protein, putative isoform 1 [Theobroma
           cacao] gi|508711186|gb|EOY03083.1| DNA glycosylase
           superfamily protein, putative isoform 1 [Theobroma
           cacao]
          Length = 382

 Score =  122 bits (307), Expect = 5e-26
 Identities = 79/193 (40%), Positives = 106/193 (54%), Gaps = 6/193 (3%)
 Frame = -3

Query: 562 RIVMINGGIASQRKMRAGANSCKGAGKEARV------VSPYFANADANAEEKVRTKEGKI 401
           ++ +I+  + S +K+    +  K  GK  R       VSPY   +    + +  T + K 
Sbjct: 132 KLNLISQVVHSYKKVLKKGDVNKQNGKRRRADAQVLKVSPYLQRSGEKQDMESGTSKPKH 191

Query: 400 ESVKLQVRIVSPYFCSTQQDKEDENAVSLGGPTNSKVQXXXXXXXXXXXPLLTAAQKKDE 221
           + VK      SPYF   + +        LGG   +              P+L+A+QK+DE
Sbjct: 192 KVVK-----ASPYFLKNKDN-------ILGGMKKAM-------KPAGVKPVLSASQKRDE 232

Query: 220 AYERKTADNPWRPPRSPFNLLQEDHAFDPWRVLVICMLLNQTTGKQTGRVLSKFFQLCPN 41
           AY+RKT +N W PPRS   LLQEDH  DPWRVL+ICMLLN+T+G Q   VLS  F LCP+
Sbjct: 233 AYQRKTPNNTWIPPRSNAPLLQEDHTHDPWRVLLICMLLNKTSGNQARNVLSDLFTLCPD 292

Query: 40  AKTATEVATEKIE 2
           AKTATEVAT +IE
Sbjct: 293 AKTATEVATGEIE 305


>ref|XP_006593877.1| PREDICTED: axoneme-associated protein mst101(2)-like [Glycine max]
          Length = 1424

 Score =  120 bits (302), Expect = 2e-25
 Identities = 71/163 (43%), Positives = 95/163 (58%), Gaps = 1/163 (0%)
 Frame = -3

Query: 487  GKEARVVSPYFANADANAEEKVRTKEGKIESVKLQVRIVSPYFCST-QQDKEDENAVSLG 311
            G   R VSPYF N   N+ +KV  K     S    + +   + C    +DK +EN  +  
Sbjct: 1203 GHGIRYVSPYFCN---NSGKKVNVKPFDKGSTSESIAL---HTCKNFVEDKLEENKSNC- 1255

Query: 310  GPTNSKVQXXXXXXXXXXXPLLTAAQKKDEAYERKTADNPWRPPRSPFNLLQEDHAFDPW 131
              +N  ++               A++K DEAY+RKT DN W+PPRS   L+QEDH  DPW
Sbjct: 1256 --SNKSIEIKRFPP---------ASEKWDEAYKRKTPDNTWKPPRSEIVLIQEDHLHDPW 1304

Query: 130  RVLVICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIE 2
            RVLVICMLLN+T G QT +V+S FF+LCP+AK+ T+V  E+IE
Sbjct: 1305 RVLVICMLLNRTAGGQTKKVVSNFFKLCPDAKSCTQVTREEIE 1347


>ref|XP_006297652.1| hypothetical protein CARUB_v10013672mg [Capsella rubella]
           gi|482566361|gb|EOA30550.1| hypothetical protein
           CARUB_v10013672mg [Capsella rubella]
          Length = 456

 Score =  120 bits (301), Expect = 2e-25
 Identities = 78/179 (43%), Positives = 97/179 (54%)
 Frame = -3

Query: 538 IASQRKMRAGANSCKGAGKEARVVSPYFANADANAEEKVRTKEGKIESVKLQVRIVSPYF 359
           ++SQ       +S K   K  RV   + A+AD+      R      + VK     VS YF
Sbjct: 214 VSSQSGGSYRRDSSKHQAKVRRVSRYFQASADSEQPNPPRDLRKYFKVVK-----VSRYF 268

Query: 358 CSTQQDKEDENAVSLGGPTNSKVQXXXXXXXXXXXPLLTAAQKKDEAYERKTADNPWRPP 179
                   D +A  +    + K +           P L+ +QK DEAY RKT DN W PP
Sbjct: 269 -------HDVSADGIQVADSQKEKSRRVRKTPVVSPSLSPSQKTDEAYLRKTPDNTWVPP 321

Query: 178 RSPFNLLQEDHAFDPWRVLVICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIE 2
           RSP NLLQEDH  DPWRVLVICMLLN+T+G QT  V+S  F LCP+AKTATEV  ++IE
Sbjct: 322 RSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFTLCPDAKTATEVEEKEIE 380


>ref|XP_006407780.1| hypothetical protein EUTSA_v10020704mg [Eutrema salsugineum]
           gi|557108926|gb|ESQ49233.1| hypothetical protein
           EUTSA_v10020704mg [Eutrema salsugineum]
          Length = 456

 Score =  119 bits (299), Expect = 4e-25
 Identities = 80/205 (39%), Positives = 104/205 (50%), Gaps = 28/205 (13%)
 Frame = -3

Query: 532 SQRKMRAGANSCKGAGKEARVVSPYF-----ANADANAEEKVRTKEGKIESVKLQVRI-- 374
           S +  R     C+    + R VSPYF     +  D+ +      ++ + ES KLQ ++  
Sbjct: 176 SSQNGRNYRKECRKVQAKVRRVSPYFQASTFSQCDSESVASQSGRKYRKESSKLQAKVPR 235

Query: 373 VSPYFCSTQQDKEDENAVSL-------------------GGPTNS--KVQXXXXXXXXXX 257
           VSPYF  +   ++   +  L                   G   N   K +          
Sbjct: 236 VSPYFQGSTVSEQPNPSRDLRQYFKVVKVSRYFHDMPADGTQVNEPQKERSRRMRKTPVV 295

Query: 256 XPLLTAAQKKDEAYERKTADNPWRPPRSPFNLLQEDHAFDPWRVLVICMLLNQTTGKQTG 77
            P L+  QK DEAY RK  DN W PPRSP NLLQEDH  DPWRVLVICMLLN+T+G QT 
Sbjct: 296 SPSLSQCQKTDEAYLRKMPDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTR 355

Query: 76  RVLSKFFQLCPNAKTATEVATEKIE 2
            V+S  F LCP+AK+ATEV  ++IE
Sbjct: 356 GVISDLFVLCPDAKSATEVEEKEIE 380


>gb|AAO22623.1| unknown protein [Arabidopsis thaliana]
          Length = 407

 Score =  119 bits (297), Expect = 7e-25
 Identities = 78/175 (44%), Positives = 100/175 (57%), Gaps = 3/175 (1%)
 Frame = -3

Query: 517 RAGANSCKGAGK---EARVVSPYFANADANAEEKVRTKEGKIESVKLQVRIVSPYFCSTQ 347
           ++G N  KG+ K   +AR VSPYF  +  + E+  +  +G     K  V  VS YF    
Sbjct: 170 QSGRNYRKGSSKRQVKARRVSPYFQESTVS-EQPNQAPKGLRNYFK--VVKVSRYF---- 222

Query: 346 QDKEDENAVSLGGPTNSKVQXXXXXXXXXXXPLLTAAQKKDEAYERKTADNPWRPPRSPF 167
                 +A  +    + K +           P+L+ +QK D+ Y RKT DN W PPRSP 
Sbjct: 223 ------HADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKTPDNTWVPPRSPC 276

Query: 166 NLLQEDHAFDPWRVLVICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIE 2
           NLLQEDH  DPWRVLVICMLLN+T+G QT  V+S  F LC +AKTATEV  E+IE
Sbjct: 277 NLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATEVKEEEIE 331


>ref|NP_974253.1| DNA glycosylase superfamily protein [Arabidopsis thaliana]
           gi|114050633|gb|ABI49466.1| At3g07930 [Arabidopsis
           thaliana] gi|332641100|gb|AEE74621.1| DNA glycosylase
           superfamily protein [Arabidopsis thaliana]
          Length = 445

 Score =  117 bits (293), Expect = 2e-24
 Identities = 77/175 (44%), Positives = 99/175 (56%), Gaps = 3/175 (1%)
 Frame = -3

Query: 517 RAGANSCKGAGK---EARVVSPYFANADANAEEKVRTKEGKIESVKLQVRIVSPYFCSTQ 347
           ++G N  KG+ K   + R VSPYF  +  + E+  +  +G     K  V  VS YF    
Sbjct: 208 QSGRNYRKGSSKRQVKVRRVSPYFQESTVS-EQPNQAPKGLRNYFK--VVKVSRYF---- 260

Query: 346 QDKEDENAVSLGGPTNSKVQXXXXXXXXXXXPLLTAAQKKDEAYERKTADNPWRPPRSPF 167
                 +A  +    + K +           P+L+ +QK D+ Y RKT DN W PPRSP 
Sbjct: 261 ------HADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKTPDNTWVPPRSPC 314

Query: 166 NLLQEDHAFDPWRVLVICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIE 2
           NLLQEDH  DPWRVLVICMLLN+T+G QT  V+S  F LC +AKTATEV  E+IE
Sbjct: 315 NLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATEVKEEEIE 369


>gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thaliana]
          Length = 419

 Score =  117 bits (293), Expect = 2e-24
 Identities = 77/175 (44%), Positives = 99/175 (56%), Gaps = 3/175 (1%)
 Frame = -3

Query: 517 RAGANSCKGAGK---EARVVSPYFANADANAEEKVRTKEGKIESVKLQVRIVSPYFCSTQ 347
           ++G N  KG+ K   + R VSPYF  +  + E+  +  +G     K  V  VS YF    
Sbjct: 182 QSGRNYRKGSSKRQVKVRRVSPYFQESTVS-EQPNQAPKGLRNYFK--VVKVSRYF---- 234

Query: 346 QDKEDENAVSLGGPTNSKVQXXXXXXXXXXXPLLTAAQKKDEAYERKTADNPWRPPRSPF 167
                 +A  +    + K +           P+L+ +QK D+ Y RKT DN W PPRSP 
Sbjct: 235 ------HADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKTPDNTWVPPRSPC 288

Query: 166 NLLQEDHAFDPWRVLVICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIE 2
           NLLQEDH  DPWRVLVICMLLN+T+G QT  V+S  F LC +AKTATEV  E+IE
Sbjct: 289 NLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATEVKEEEIE 343


>ref|XP_004309787.1| PREDICTED: uncharacterized protein LOC101298191 [Fragaria vesca
           subsp. vesca]
          Length = 410

 Score =  116 bits (291), Expect = 3e-24
 Identities = 55/82 (67%), Positives = 62/82 (75%)
 Frame = -3

Query: 247 LTAAQKKDEAYERKTADNPWRPPRSPFNLLQEDHAFDPWRVLVICMLLNQTTGKQTGRVL 68
           L+A+Q++DEAY R+T DN W PPRS   LLQEDH  DPWRVLVICMLLN+T GKQ   V+
Sbjct: 253 LSASQRRDEAYRRRTPDNTWIPPRSEIKLLQEDHYHDPWRVLVICMLLNRTQGKQLKGVI 312

Query: 67  SKFFQLCPNAKTATEVATEKIE 2
           S FF LCP AK ATEVA   IE
Sbjct: 313 SNFFSLCPTAKAATEVALRDIE 334


>ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arabidopsis lyrata subsp.
           lyrata] gi|297328398|gb|EFH58817.1| hypothetical protein
           ARALYDRAFT_896965 [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  113 bits (282), Expect = 4e-23
 Identities = 54/82 (65%), Positives = 62/82 (75%)
 Frame = -3

Query: 247 LTAAQKKDEAYERKTADNPWRPPRSPFNLLQEDHAFDPWRVLVICMLLNQTTGKQTGRVL 68
           L+ +QK DEAY+RKT D  W PPRSP NLLQE H  DPWRVLVICMLLN+T+G QT  V+
Sbjct: 278 LSLSQKTDEAYQRKTPDKTWVPPRSPCNLLQEHHWHDPWRVLVICMLLNKTSGAQTRGVI 337

Query: 67  SKFFQLCPNAKTATEVATEKIE 2
              F LCP+AKTATEV   +IE
Sbjct: 338 EDLFALCPDAKTATEVEEREIE 359