BLASTX nr result

ID: Rehmannia22_contig00023588 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00023588
         (2802 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EPS66392.1| hypothetical protein M569_08394 [Genlisea aurea]       273   3e-70
ref|XP_006350381.1| PREDICTED: methyl-CpG-binding domain protein...   248   8e-63
ref|XP_006423926.1| hypothetical protein CICLE_v10028470mg [Citr...   245   7e-62
ref|XP_004231530.1| PREDICTED: uncharacterized protein LOC101255...   245   7e-62
ref|XP_002514395.1| conserved hypothetical protein [Ricinus comm...   244   1e-61
ref|XP_006494703.1| PREDICTED: transcriptional regulator ATRX ho...   238   8e-60
gb|EXB50510.1| Methyl-CpG-binding domain protein 4 [Morus notabi...   230   3e-57
ref|XP_006407780.1| hypothetical protein EUTSA_v10020704mg [Eutr...   227   2e-56
ref|XP_006297652.1| hypothetical protein CARUB_v10013672mg [Caps...   227   2e-56
gb|ESW35972.1| hypothetical protein PHAVU_L0004001g [Phaseolus v...   224   1e-55
gb|ESW35973.1| hypothetical protein PHAVU_L0004001g, partial [Ph...   223   4e-55
gb|EOY03082.1| DNA glycosylase superfamily protein, putative iso...   222   6e-55
ref|XP_004309787.1| PREDICTED: uncharacterized protein LOC101298...   221   1e-54
ref|XP_006593877.1| PREDICTED: axoneme-associated protein mst101...   219   4e-54
gb|AAO22623.1| unknown protein [Arabidopsis thaliana]                 219   4e-54
ref|XP_002317727.2| hypothetical protein POPTR_0012s03470g [Popu...   218   9e-54
ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arab...   217   3e-53
ref|NP_974253.1| DNA glycosylase superfamily protein [Arabidopsi...   215   7e-53
gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thal...   215   7e-53
emb|CBI29440.3| unnamed protein product [Vitis vinifera]              215   1e-52

>gb|EPS66392.1| hypothetical protein M569_08394 [Genlisea aurea]
          Length = 369

 Score =  273 bits (698), Expect = 3e-70
 Identities = 163/360 (45%), Positives = 208/360 (57%), Gaps = 7/360 (1%)
 Frame = +3

Query: 1620 DSGGVFKMEKFSLDDFFSRFAYTGGKCYMNSAKF-----GVCQSSSQTTETCGEGQMKTD 1784
            DSG V   EK SLDD  SR++ T  +C   S+       G+   +S+T     E      
Sbjct: 43   DSGCVSDREKLSLDDVISRYSCTISRCPSKSSPRCLEAGGIENPTSETKGLSSEITALAS 102

Query: 1785 TMKIVKDDLAAGNNARLCRADVGSQTAISTPNSCENAKMGERIVMINGGIASQRKMRAGA 1964
            T   V+   A  +  ++ R     + ++S   + +   + +RI         +R+ R   
Sbjct: 103  TPDAVEGFTADCSVVKMKRR----KNSMSKDENGDGKVLPDRI---------KRRSRKKK 149

Query: 1965 NSCK--GAGKEARVVSPYFANADANAEEKVRTKEGKIESVKLQVRIVSPYFCSTQQDKED 2138
            N     G  K+  V+ PYFA      E+  R K             VSPYF S ++    
Sbjct: 150  NIVTEDGCDKKVVVLDPYFA------EDMSRKK-------------VSPYFQSPRKTSGS 190

Query: 2139 ENAVSLGGPTNSKVQXXXXXXXXXXXXLLTAAQKKDEAYERKTADNPWRPPRSPFNLLQE 2318
            +  +S       +V             +L++ QK+DEAYER+T DN W PPRSPFNLLQE
Sbjct: 191  DRGIS-------EVVEESPERSKRWKPVLSSVQKRDEAYERRTPDNEWTPPRSPFNLLQE 243

Query: 2319 DHAFDPWRVLVICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIEEVIRSLGLYK 2498
            DH FDPWRVLVICMLLNQTTG+Q  RVLSK F+LCP AK ATEVA + IE+ IR LGL +
Sbjct: 244  DHMFDPWRVLVICMLLNQTTGRQAFRVLSKLFELCPTAKAATEVARDDIEDAIRCLGLQR 303

Query: 2499 KRAAGIQRFSEEYLNERWTHVTDLTGVGKYAADAYAIFCTGKWERVRPIDHMLVKYWEFL 2678
            KRA  IQRFSEEY++E WTHVT+L G+GKYAADAYAIFCTG+W+RVRP DHMLVKYWE+L
Sbjct: 304  KRAEMIQRFSEEYMSEEWTHVTELPGIGKYAADAYAIFCTGRWQRVRPADHMLVKYWEWL 363


>ref|XP_006350381.1| PREDICTED: methyl-CpG-binding domain protein 4-like, partial [Solanum
            tuberosum]
          Length = 222

 Score =  248 bits (634), Expect = 8e-63
 Identities = 133/233 (57%), Positives = 150/233 (64%)
 Frame = +3

Query: 1989 EARVVSPYFANADANAEEKVRTKEGKIESVKLQVRIVSPYFCSTQQDKEDENAVSLGGPT 2168
            + RVVSPYFAN     E KV           L  R VSPYF    Q+   EN  S  G  
Sbjct: 4    KVRVVSPYFANLTVGEEIKVGKDRSNPSKNCLNGRKVSPYF----QNAYRENKKSRKGSK 59

Query: 2169 NSKVQXXXXXXXXXXXXLLTAAQKKDEAYERKTADNPWRPPRSPFNLLQEDHAFDPWRVL 2348
              K               L+A QK+DEAY R++ DN W PPRS FNLLQE+HA DPWRVL
Sbjct: 60   RQK-------------PCLSAFQKRDEAYLRRSEDNTWVPPRSHFNLLQENHAHDPWRVL 106

Query: 2349 VICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIEEVIRSLGLYKKRAAGIQRFS 2528
            VICMLLN TTG Q  RV+ +FF LCPNA  ATEVA E IE+++R LGLY KR+  I R S
Sbjct: 107  VICMLLNCTTGVQVKRVVDEFFTLCPNAVAATEVAVEDIEKLLRPLGLYTKRSLAIPRLS 166

Query: 2529 EEYLNERWTHVTDLTGVGKYAADAYAIFCTGKWERVRPIDHMLVKYWEFLCGN 2687
            +EYL E WTHVT L G+GKYAADAYAIFCTGKW++V P DHML KYWEFL  N
Sbjct: 167  QEYLGETWTHVTQLHGIGKYAADAYAIFCTGKWDQVHPNDHMLTKYWEFLHAN 219


>ref|XP_006423926.1| hypothetical protein CICLE_v10028470mg [Citrus clementina]
            gi|568883956|ref|XP_006494704.1| PREDICTED:
            transcriptional regulator ATRX homolog isoform X2 [Citrus
            sinensis] gi|557525860|gb|ESR37166.1| hypothetical
            protein CICLE_v10028470mg [Citrus clementina]
          Length = 439

 Score =  245 bits (626), Expect = 7e-62
 Identities = 132/234 (56%), Positives = 150/234 (64%), Gaps = 4/234 (1%)
 Frame = +3

Query: 2001 VSPYFANADANAEEKVRTKEGKIESVKLQVRIVSPYFCSTQQDKEDENAVSLGGPTNS-K 2177
            VSPYF    A   E+    +    S   Q R VSPYF +          V +       K
Sbjct: 210  VSPYFQRQKAGNVER----KNHDTSTMAQARKVSPYFQNQNSTTPAAATVQVHNQQQEEK 265

Query: 2178 VQXXXXXXXXXXXXLLTAAQKKDEAYERKTADNPWRPPRSPFNLLQEDHAFDPWRVLVIC 2357
             +             LTAAQK+DEAYERK  DN W PPRSP  LLQ +H  DPWRV+VIC
Sbjct: 266  EKDIAVKKKRSRSVTLTAAQKRDEAYERKRPDNTWNPPRSPIVLLQHEHVHDPWRVIVIC 325

Query: 2358 MLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIEEVIRSLGLYKKRAAGIQRFSEEY 2537
            MLLN+TTG Q GRV+S  F LCP+AKTATEV  E+IE++I +LGL KKRA  I+RFS+EY
Sbjct: 326  MLLNRTTGLQAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEY 385

Query: 2538 LNERWTHVTDLTGVGKYAADAYAIFCTGKWERVRPIDHMLVKYWEFLC---GNL 2690
            L E WTHVT L GVGKYAADAYAIFCTGKW+RVRP DHML  YWEFL    GNL
Sbjct: 386  LGESWTHVTQLHGVGKYAADAYAIFCTGKWDRVRPTDHMLNYYWEFLVSTKGNL 439


>ref|XP_004231530.1| PREDICTED: uncharacterized protein LOC101255935 [Solanum
            lycopersicum]
          Length = 544

 Score =  245 bits (626), Expect = 7e-62
 Identities = 129/233 (55%), Positives = 151/233 (64%)
 Frame = +3

Query: 1989 EARVVSPYFANADANAEEKVRTKEGKIESVKLQVRIVSPYFCSTQQDKEDENAVSLGGPT 2168
            + RVVSPYFAN     E KV           L  R VSPYF +  ++K+           
Sbjct: 324  KVRVVSPYFANLKVGEEIKVGKDSSNASKNCLNGRKVSPYFQNAYREKKKSTI------- 376

Query: 2169 NSKVQXXXXXXXXXXXXLLTAAQKKDEAYERKTADNPWRPPRSPFNLLQEDHAFDPWRVL 2348
             SK Q             L+A+QK+DEAY R++ DN W PPRS FNLLQE+HA DPWRVL
Sbjct: 377  GSKRQKPC----------LSASQKRDEAYLRRSEDNMWVPPRSHFNLLQENHAHDPWRVL 426

Query: 2349 VICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIEEVIRSLGLYKKRAAGIQRFS 2528
            VICMLLN TTG Q  RV+ +FF LCPNA  ATEVA E IE+++R LGLY KR+  I R S
Sbjct: 427  VICMLLNCTTGVQVRRVVDEFFTLCPNAVAATEVAVEDIEKLLRPLGLYTKRSLSIPRLS 486

Query: 2529 EEYLNERWTHVTDLTGVGKYAADAYAIFCTGKWERVRPIDHMLVKYWEFLCGN 2687
            +EYL + WTHVT L G+GKYAADAYAIFCTG W++V P DHML KYWEFL  N
Sbjct: 487  QEYLGKNWTHVTQLHGIGKYAADAYAIFCTGNWDQVHPNDHMLTKYWEFLHAN 539


>ref|XP_002514395.1| conserved hypothetical protein [Ricinus communis]
            gi|223546492|gb|EEF47991.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 608

 Score =  244 bits (624), Expect = 1e-61
 Identities = 133/233 (57%), Positives = 155/233 (66%), Gaps = 3/233 (1%)
 Frame = +3

Query: 1989 EARVVSPYFANADANAEEKVRTKEGK-IESVKLQVRIVSPYFCSTQQDKEDENAVS--LG 2159
            + R VSP F N     +E ++ K  K  E V L VR VSPYF    + +E+E A S  + 
Sbjct: 370  QVRKVSPNF-NLSIGQQECMKIKPLKPCERVGLTVRNVSPYFQKVPKQEEEEAADSNMID 428

Query: 2160 GPTNSKVQXXXXXXXXXXXXLLTAAQKKDEAYERKTADNPWRPPRSPFNLLQEDHAFDPW 2339
                 K               L+AA+K+ EAY RKT DN W+PPRS F LLQEDHA DPW
Sbjct: 429  NKHGQKKLPEKKKRPARKSITLSAAEKRSEAYRRKTPDNTWKPPRSDFGLLQEDHASDPW 488

Query: 2340 RVLVICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIEEVIRSLGLYKKRAAGIQ 2519
            RVLVICMLLN TTGKQ   V+S FF LCP+AK ATE  TE+IE++I  LGL KKRA  IQ
Sbjct: 489  RVLVICMLLNCTTGKQVRGVISDFFTLCPDAKAATEAKTEEIEKIIVPLGLQKKRAVMIQ 548

Query: 2520 RFSEEYLNERWTHVTDLTGVGKYAADAYAIFCTGKWERVRPIDHMLVKYWEFL 2678
            R S+EYL + WTHVT L GVGKYAADAYAIFCTGKW++VRP DHML  YW+FL
Sbjct: 549  RLSQEYLADDWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPKDHMLNYYWDFL 601


>ref|XP_006494703.1| PREDICTED: transcriptional regulator ATRX homolog isoform X1 [Citrus
            sinensis]
          Length = 446

 Score =  238 bits (608), Expect = 8e-60
 Identities = 132/241 (54%), Positives = 150/241 (62%), Gaps = 11/241 (4%)
 Frame = +3

Query: 2001 VSPYFANADANAEEKVRTKEGKIESVKLQVRIVSPYFCSTQQDKEDENAVSLGGPTNS-K 2177
            VSPYF    A   E+    +    S   Q R VSPYF +          V +       K
Sbjct: 210  VSPYFQRQKAGNVER----KNHDTSTMAQARKVSPYFQNQNSTTPAAATVQVHNQQQEEK 265

Query: 2178 VQXXXXXXXXXXXXLLTAAQKKDEAYERKTADNPWRPPRSPFNLLQEDHAFDPWRVLVIC 2357
             +             LTAAQK+DEAYERK  DN W PPRSP  LLQ +H  DPWRV+VIC
Sbjct: 266  EKDIAVKKKRSRSVTLTAAQKRDEAYERKRPDNTWNPPRSPIVLLQHEHVHDPWRVIVIC 325

Query: 2358 MLLNQTTGKQ-------TGRVLSKFFQLCPNAKTATEVATEKIEEVIRSLGLYKKRAAGI 2516
            MLLN+TTG Q        GRV+S  F LCP+AKTATEV  E+IE++I +LGL KKRA  I
Sbjct: 326  MLLNRTTGLQEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMI 385

Query: 2517 QRFSEEYLNERWTHVTDLTGVGKYAADAYAIFCTGKWERVRPIDHMLVKYWEFLC---GN 2687
            +RFS+EYL E WTHVT L GVGKYAADAYAIFCTGKW+RVRP DHML  YWEFL    GN
Sbjct: 386  KRFSQEYLGESWTHVTQLHGVGKYAADAYAIFCTGKWDRVRPTDHMLNYYWEFLVSTKGN 445

Query: 2688 L 2690
            L
Sbjct: 446  L 446


>gb|EXB50510.1| Methyl-CpG-binding domain protein 4 [Morus notabilis]
          Length = 418

 Score =  230 bits (586), Expect = 3e-57
 Identities = 128/247 (51%), Positives = 157/247 (63%), Gaps = 18/247 (7%)
 Frame = +3

Query: 1992 ARVVSPYFANADANAEEK-----------VRTKEGKIESVKLQVRIVSPYFCSTQQDK-- 2132
            +RVVSPYF     + +EK           V   E K E +KL V ++S +     ++K  
Sbjct: 167  SRVVSPYFTTNRNDTQEKKKKPEKDGREEVELGEKKEEHLKL-VDVLSRFAYKPMKEKTT 225

Query: 2133 ----EDENAVSLGGPTNSKV-QXXXXXXXXXXXXLLTAAQKKDEAYERKTADNPWRPPRS 2297
                E    + L G    K+ +            +L AA+K+DEAY+RKT DN W PP S
Sbjct: 226  VERAEKGRKLGLVGVGEKKMSKIVVRRKKIEKSKVLNAAEKRDEAYKRKTDDNKWNPPPS 285

Query: 2298 PFNLLQEDHAFDPWRVLVICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIEEVI 2477
               L+Q+DH  DPWRVLVICMLLN+TTG Q  RV+S FF LCPNAK ATEV+ E+I ++I
Sbjct: 286  EIRLIQQDHLHDPWRVLVICMLLNRTTGAQATRVISDFFSLCPNAKAATEVSPEEIVKII 345

Query: 2478 RSLGLYKKRAAGIQRFSEEYLNERWTHVTDLTGVGKYAADAYAIFCTGKWERVRPIDHML 2657
             +LGL+ KRA  IQRFS EYL E WTHVT L GVGKYAADAYAIFCTGKW+RV+P DHML
Sbjct: 346  HTLGLH-KRAQMIQRFSREYLEESWTHVTQLHGVGKYAADAYAIFCTGKWDRVKPADHML 404

Query: 2658 VKYWEFL 2678
              YW+FL
Sbjct: 405  NYYWKFL 411


>ref|XP_006407780.1| hypothetical protein EUTSA_v10020704mg [Eutrema salsugineum]
            gi|557108926|gb|ESQ49233.1| hypothetical protein
            EUTSA_v10020704mg [Eutrema salsugineum]
          Length = 456

 Score =  227 bits (578), Expect = 2e-56
 Identities = 149/370 (40%), Positives = 192/370 (51%), Gaps = 28/370 (7%)
 Frame = +3

Query: 1653 SLDDFFSRFAYTGGKCYMNSAKFGVCQSSSQTTETCGEGQMKTDTMKIVKDDLAAGNNAR 1832
            +LDD F+ FAY G +   N   FG    S+   +   + Q   D      D +   ++ R
Sbjct: 89   NLDDLFAGFAYKGVRKTRNV--FGSKPKSTLDDDDTVKEQDFDD------DSVFESHSER 140

Query: 1833 LCRADVGSQTAISTPNSCENAKMGERIVMINGGIASQRKMRAGANSCKGAGKEARVVSPY 2012
               ++  +Q    +P    +    +     +    S +  R     C+    + R VSPY
Sbjct: 141  QVCSEFQTQVRKVSPYFQGSTVSQQPKDGCDSDCVSSQNGRNYRKECRKVQAKVRRVSPY 200

Query: 2013 F-----ANADANAEEKVRTKEGKIESVKLQVRI--VSPYFCSTQQDKEDENAVSL----- 2156
            F     +  D+ +      ++ + ES KLQ ++  VSPYF  +   ++   +  L     
Sbjct: 201  FQASTFSQCDSESVASQSGRKYRKESSKLQAKVPRVSPYFQGSTVSEQPNPSRDLRQYFK 260

Query: 2157 --------------GGPTNS--KVQXXXXXXXXXXXXLLTAAQKKDEAYERKTADNPWRP 2288
                          G   N   K +             L+  QK DEAY RK  DN W P
Sbjct: 261  VVKVSRYFHDMPADGTQVNEPQKERSRRMRKTPVVSPSLSQCQKTDEAYLRKMPDNTWVP 320

Query: 2289 PRSPFNLLQEDHAFDPWRVLVICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIE 2468
            PRSP NLLQEDH  DPWRVLVICMLLN+T+G QT  V+S  F LCP+AK+ATEV  ++IE
Sbjct: 321  PRSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFVLCPDAKSATEVEEKEIE 380

Query: 2469 EVIRSLGLYKKRAAGIQRFSEEYLNERWTHVTDLTGVGKYAADAYAIFCTGKWERVRPID 2648
             +I+ LGL KKRA  IQRFS EYL E WTHVT L GVGKYAADAYAIFC GKW+ VRP D
Sbjct: 381  SLIKPLGLQKKRAKMIQRFSLEYLQESWTHVTQLYGVGKYAADAYAIFCNGKWDCVRPAD 440

Query: 2649 HMLVKYWEFL 2678
            HML  YWEFL
Sbjct: 441  HMLNYYWEFL 450


>ref|XP_006297652.1| hypothetical protein CARUB_v10013672mg [Capsella rubella]
            gi|482566361|gb|EOA30550.1| hypothetical protein
            CARUB_v10013672mg [Capsella rubella]
          Length = 456

 Score =  227 bits (578), Expect = 2e-56
 Identities = 128/249 (51%), Positives = 152/249 (61%)
 Frame = +3

Query: 1932 IASQRKMRAGANSCKGAGKEARVVSPYFANADANAEEKVRTKEGKIESVKLQVRIVSPYF 2111
            ++SQ       +S K   K  RV   + A+AD+      R      + VK     VS YF
Sbjct: 214  VSSQSGGSYRRDSSKHQAKVRRVSRYFQASADSEQPNPPRDLRKYFKVVK-----VSRYF 268

Query: 2112 CSTQQDKEDENAVSLGGPTNSKVQXXXXXXXXXXXXLLTAAQKKDEAYERKTADNPWRPP 2291
                    D +A  +    + K +             L+ +QK DEAY RKT DN W PP
Sbjct: 269  -------HDVSADGIQVADSQKEKSRRVRKTPVVSPSLSPSQKTDEAYLRKTPDNTWVPP 321

Query: 2292 RSPFNLLQEDHAFDPWRVLVICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIEE 2471
            RSP NLLQEDH  DPWRVLVICMLLN+T+G QT  V+S  F LCP+AKTATEV  ++IE 
Sbjct: 322  RSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFTLCPDAKTATEVEEKEIES 381

Query: 2472 VIRSLGLYKKRAAGIQRFSEEYLNERWTHVTDLTGVGKYAADAYAIFCTGKWERVRPIDH 2651
            +I+ LGL KKRA  IQRFS EYLNE WTHVT L G+GKYAADAYAIFC G W+RV+P DH
Sbjct: 382  LIKPLGLQKKRAKMIQRFSLEYLNESWTHVTQLHGIGKYAADAYAIFCNGNWDRVKPSDH 441

Query: 2652 MLVKYWEFL 2678
            ML  YWEFL
Sbjct: 442  MLNYYWEFL 450


>gb|ESW35972.1| hypothetical protein PHAVU_L0004001g [Phaseolus vulgaris]
          Length = 726

 Score =  224 bits (572), Expect = 1e-55
 Identities = 135/313 (43%), Positives = 180/313 (57%), Gaps = 36/313 (11%)
 Frame = +3

Query: 1848 VGSQTAISTPNSCENAKMGERIVMINGGIASQRKMRAGANSCKGAGKEA----------- 1994
            VG+++  +     E+  +G+  V+ NG I  ++K  +  N  +G GK+            
Sbjct: 412  VGAESCCTGGMLLEHKLLGDGNVIENGLINIKKKTIS--NKLQGNGKDTTSKVKPKKTKP 469

Query: 1995 ----------RVVSPYFANADANAEEKVRTKEGKIESVKLQVRIVSPYFCSTQQDKEDEN 2144
                      R VSPYF N   + +  V++K    ++V   +R VSPYF +      D  
Sbjct: 470  LVQKNAAHGIRYVSPYFHND--SGKMSVKSKPLVQKNVAHAIRYVSPYFHNDSGKNIDVK 527

Query: 2145 AVSLGGPTNS---------------KVQXXXXXXXXXXXXLLTAAQKKDEAYERKTADNP 2279
             +  G    S               + +             L+A+QK DEAY+RKT D  
Sbjct: 528  PLDEGSKFESIALHATENYVEDKPEENKSSCSEKSIEIKKNLSASQKWDEAYKRKTPDIT 587

Query: 2280 WRPPRSPFNLLQEDHAFDPWRVLVICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATE 2459
            W+PPRS   L+QEDHA DPWRVLVICMLLN+T+G+QT  ++S FF+LCP+AK+ TEV+ E
Sbjct: 588  WKPPRSATVLIQEDHAHDPWRVLVICMLLNRTSGRQTKNIVSDFFKLCPDAKSCTEVSRE 647

Query: 2460 KIEEVIRSLGLYKKRAAGIQRFSEEYLNERWTHVTDLTGVGKYAADAYAIFCTGKWERVR 2639
            +IEE I++LG   KRA  ++R SEEYL+E WTHVT L GVGKYAADAYAIF TGK +RVR
Sbjct: 648  EIEETIKTLGFQHKRAKMLKRLSEEYLDESWTHVTQLHGVGKYAADAYAIFVTGKSDRVR 707

Query: 2640 PIDHMLVKYWEFL 2678
            P DHML  YWEFL
Sbjct: 708  PTDHMLNYYWEFL 720


>gb|ESW35973.1| hypothetical protein PHAVU_L0004001g, partial [Phaseolus vulgaris]
          Length = 715

 Score =  223 bits (568), Expect = 4e-55
 Identities = 122/236 (51%), Positives = 153/236 (64%), Gaps = 1/236 (0%)
 Frame = +3

Query: 1974 KGAGKEARVVSPYFAN-ADANAEEKVRTKEGKIESVKLQVRIVSPYFCSTQQDKEDENAV 2150
            K      R VSPYF N +  N + K   +  K ES+ L          +  +DK +EN  
Sbjct: 492  KNVAHAIRYVSPYFHNDSGKNIDVKPLDEGSKFESIALHATE------NYVEDKPEENKS 545

Query: 2151 SLGGPTNSKVQXXXXXXXXXXXXLLTAAQKKDEAYERKTADNPWRPPRSPFNLLQEDHAF 2330
            S    +   ++             L+A+QK DEAY+RKT D  W+PPRS   L+QEDHA 
Sbjct: 546  SC---SEKSIEIKKN---------LSASQKWDEAYKRKTPDITWKPPRSATVLIQEDHAH 593

Query: 2331 DPWRVLVICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIEEVIRSLGLYKKRAA 2510
            DPWRVLVICMLLN+T+G+QT  ++S FF+LCP+AK+ TEV+ E+IEE I++LG   KRA 
Sbjct: 594  DPWRVLVICMLLNRTSGRQTKNIVSDFFKLCPDAKSCTEVSREEIEETIKTLGFQHKRAK 653

Query: 2511 GIQRFSEEYLNERWTHVTDLTGVGKYAADAYAIFCTGKWERVRPIDHMLVKYWEFL 2678
             ++R SEEYL+E WTHVT L GVGKYAADAYAIF TGK +RVRP DHML  YWEFL
Sbjct: 654  MLKRLSEEYLDESWTHVTQLHGVGKYAADAYAIFVTGKSDRVRPTDHMLNYYWEFL 709


>gb|EOY03082.1| DNA glycosylase superfamily protein, putative isoform 1 [Theobroma
            cacao] gi|508711186|gb|EOY03083.1| DNA glycosylase
            superfamily protein, putative isoform 1 [Theobroma cacao]
          Length = 382

 Score =  222 bits (566), Expect = 6e-55
 Identities = 154/390 (39%), Positives = 195/390 (50%), Gaps = 43/390 (11%)
 Frame = +3

Query: 1653 SLDDFFSRFAYTGGKCYMNSAKFGVCQSS---------------SQTTETCGEGQMKTDT 1787
            +LD   S+FAY  G  Y    K     S                S   +T GE Q     
Sbjct: 10   NLDHLLSQFAYKSGHSYEKVLKESEIVSGQNGHRMRADVQVPKVSPYFQTSGEKQEMLSG 69

Query: 1788 MKIVKDDLAA----GNNARLCRADVGSQTAISTPNSCENAKM-------GERIVMINGG- 1931
                K +L +     N   L + DV  Q         +  K+       GE+  M++G  
Sbjct: 70   NCQPKVNLLSQVVHSNKKVLKKGDVNKQNGKRRRADAQVLKVSPYFQTSGEKQEMLSGNC 129

Query: 1932 ----------IASQRKMRAGANSCKGAGKEARV------VSPYFANADANAEEKVRTKEG 2063
                      + S +K+    +  K  GK  R       VSPY   +    + +  T + 
Sbjct: 130  KPKLNLISQVVHSYKKVLKKGDVNKQNGKRRRADAQVLKVSPYLQRSGEKQDMESGTSKP 189

Query: 2064 KIESVKLQVRIVSPYFCSTQQDKEDENAVSLGGPTNSKVQXXXXXXXXXXXXLLTAAQKK 2243
            K + VK      SPYF   + +        LGG   +               +L+A+QK+
Sbjct: 190  KHKVVK-----ASPYFLKNKDN-------ILGGMKKAM-------KPAGVKPVLSASQKR 230

Query: 2244 DEAYERKTADNPWRPPRSPFNLLQEDHAFDPWRVLVICMLLNQTTGKQTGRVLSKFFQLC 2423
            DEAY+RKT +N W PPRS   LLQEDH  DPWRVL+ICMLLN+T+G Q   VLS  F LC
Sbjct: 231  DEAYQRKTPNNTWIPPRSNAPLLQEDHTHDPWRVLLICMLLNKTSGNQARNVLSDLFTLC 290

Query: 2424 PNAKTATEVATEKIEEVIRSLGLYKKRAAGIQRFSEEYLNERWTHVTDLTGVGKYAADAY 2603
            P+AKTATEVAT +IE+ I+ LGL +KRA  IQR S+EYL + WTHVT+L GVGKYAADAY
Sbjct: 291  PDAKTATEVATGEIEKAIKPLGLQRKRAEMIQRMSQEYLWKEWTHVTELHGVGKYAADAY 350

Query: 2604 AIFCTGKWERVRPIDHMLVKYWEFLCGNLD 2693
            AIFCTGK +RV P DHML  YW FL G  D
Sbjct: 351  AIFCTGKGDRVTPSDHMLNYYWNFLYGPKD 380


>ref|XP_004309787.1| PREDICTED: uncharacterized protein LOC101298191 [Fragaria vesca
            subsp. vesca]
          Length = 410

 Score =  221 bits (564), Expect = 1e-54
 Identities = 108/152 (71%), Positives = 120/152 (78%)
 Frame = +3

Query: 2223 LTAAQKKDEAYERKTADNPWRPPRSPFNLLQEDHAFDPWRVLVICMLLNQTTGKQTGRVL 2402
            L+A+Q++DEAY R+T DN W PPRS   LLQEDH  DPWRVLVICMLLN+T GKQ   V+
Sbjct: 253  LSASQRRDEAYRRRTPDNTWIPPRSEIKLLQEDHYHDPWRVLVICMLLNRTQGKQLKGVI 312

Query: 2403 SKFFQLCPNAKTATEVATEKIEEVIRSLGLYKKRAAGIQRFSEEYLNERWTHVTDLTGVG 2582
            S FF LCP AK ATEVA   IEEVIRSLGL+ KRA  IQR SEEYL E WTHV +L GVG
Sbjct: 313  SNFFSLCPTAKAATEVALRDIEEVIRSLGLH-KRAEMIQRMSEEYLGESWTHVPELYGVG 371

Query: 2583 KYAADAYAIFCTGKWERVRPIDHMLVKYWEFL 2678
            KYAADAYAIFCTG WE+V+P DH L +YWEFL
Sbjct: 372  KYAADAYAIFCTGMWEQVKPTDHKLNEYWEFL 403


>ref|XP_006593877.1| PREDICTED: axoneme-associated protein mst101(2)-like [Glycine max]
          Length = 1424

 Score =  219 bits (559), Expect = 4e-54
 Identities = 119/233 (51%), Positives = 149/233 (63%), Gaps = 1/233 (0%)
 Frame = +3

Query: 1983 GKEARVVSPYFANADANAEEKVRTKEGKIESVKLQVRIVSPYFCST-QQDKEDENAVSLG 2159
            G   R VSPYF N   N+ +KV  K     S    + +   + C    +DK +EN  +  
Sbjct: 1203 GHGIRYVSPYFCN---NSGKKVNVKPFDKGSTSESIAL---HTCKNFVEDKLEENKSNC- 1255

Query: 2160 GPTNSKVQXXXXXXXXXXXXLLTAAQKKDEAYERKTADNPWRPPRSPFNLLQEDHAFDPW 2339
              +N  ++               A++K DEAY+RKT DN W+PPRS   L+QEDH  DPW
Sbjct: 1256 --SNKSIEIKRFPP---------ASEKWDEAYKRKTPDNTWKPPRSEIVLIQEDHLHDPW 1304

Query: 2340 RVLVICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIEEVIRSLGLYKKRAAGIQ 2519
            RVLVICMLLN+T G QT +V+S FF+LCP+AK+ T+V  E+IE+ I++LG   KRA  +Q
Sbjct: 1305 RVLVICMLLNRTAGGQTKKVVSNFFKLCPDAKSCTQVTREEIEKTIKTLGFQHKRAEMLQ 1364

Query: 2520 RFSEEYLNERWTHVTDLTGVGKYAADAYAIFCTGKWERVRPIDHMLVKYWEFL 2678
            R SEEYL+E WTHVT L GVGKYAADAYAIF TG W+RV P DHML  YWEFL
Sbjct: 1365 RLSEEYLDESWTHVTQLHGVGKYAADAYAIFVTGMWDRVTPTDHMLNYYWEFL 1417


>gb|AAO22623.1| unknown protein [Arabidopsis thaliana]
          Length = 407

 Score =  219 bits (559), Expect = 4e-54
 Identities = 149/361 (41%), Positives = 186/361 (51%), Gaps = 6/361 (1%)
 Frame = +3

Query: 1614 NQDSGGVFKMEKFSLDDFFSRFAYTGGKCYMNSAKFGVCQSSSQTTETCGEGQMKTDTMK 1793
            + D   + K    SLDD FS F Y G +       FG     S TT      Q+  D   
Sbjct: 74   HDDGCSLEKDNSNSLDDLFSGFVYKGVR-RRKRDDFG-----SITTSNLVSPQIADDDDD 127

Query: 1794 IVKDDLAAGNNARLCRAD---VGSQTAISTPNSCENAKMGERIVMINGGIASQRKMRAGA 1964
             V D           +A    V      ST + C++  +                 ++G 
Sbjct: 128  SVSDSHIERQECSKVQAKVPRVSPYFQASTISQCDSDIVS--------------SSQSGR 173

Query: 1965 NSCKGAGK---EARVVSPYFANADANAEEKVRTKEGKIESVKLQVRIVSPYFCSTQQDKE 2135
            N  KG+ K   +AR VSPYF  +   +E+  +  +G     K  V  VS YF        
Sbjct: 174  NYRKGSSKRQVKARRVSPYFQESTV-SEQPNQAPKGLRNYFK--VVKVSRYF-------- 222

Query: 2136 DENAVSLGGPTNSKVQXXXXXXXXXXXXLLTAAQKKDEAYERKTADNPWRPPRSPFNLLQ 2315
              +A  +    + K +            +L+ +QK D+ Y RKT DN W PPRSP NLLQ
Sbjct: 223  --HADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKTPDNTWVPPRSPCNLLQ 280

Query: 2316 EDHAFDPWRVLVICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIEEVIRSLGLY 2495
            EDH  DPWRVLVICMLLN+T+G QT  V+S  F LC +AKTATEV  E+IE +I+ LGL 
Sbjct: 281  EDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATEVKEEEIENLIKPLGLQ 340

Query: 2496 KKRAAGIQRFSEEYLNERWTHVTDLTGVGKYAADAYAIFCTGKWERVRPIDHMLVKYWEF 2675
            KKR   IQR S EYL E WTHVT L GVGKYAADAYAIFC G W+RV+P DHML  YW++
Sbjct: 341  KKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCNGNWDRVKPNDHMLNYYWDY 400

Query: 2676 L 2678
            L
Sbjct: 401  L 401


>ref|XP_002317727.2| hypothetical protein POPTR_0012s03470g [Populus trichocarpa]
            gi|550326306|gb|EEE95947.2| hypothetical protein
            POPTR_0012s03470g [Populus trichocarpa]
          Length = 229

 Score =  218 bits (556), Expect = 9e-54
 Identities = 108/196 (55%), Positives = 134/196 (68%)
 Frame = +3

Query: 2115 STQQDKEDENAVSLGGPTNSKVQXXXXXXXXXXXXLLTAAQKKDEAYERKTADNPWRPPR 2294
            S Q++ ++++A  +G     K +                  K DEAYERKTA+N W+PP+
Sbjct: 35   SNQEEDKEKDANVIGRSKKKKKKKEGTKTSLHSDTTSPYYNKFDEAYERKTAENTWKPPQ 94

Query: 2295 SPFNLLQEDHAFDPWRVLVICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIEEV 2474
            S F  L  +HA DPWRVLVICMLLN+T G +  RV++  F LCP+AK AT VATE+IE  
Sbjct: 95   SEFGFLH-NHAHDPWRVLVICMLLNRTAGTRAERVVADLFTLCPDAKAATGVATEEIERA 153

Query: 2475 IRSLGLYKKRAAGIQRFSEEYLNERWTHVTDLTGVGKYAADAYAIFCTGKWERVRPIDHM 2654
            I+SLGL K+RA  +QR SE+YL E WTHVT L GVGKYAADAYAIFCTGKWE+VRP DHM
Sbjct: 154  IKSLGLQKRRAKMVQRLSEDYLEEDWTHVTQLPGVGKYAADAYAIFCTGKWEQVRPNDHM 213

Query: 2655 LVKYWEFLCGNLDVKS 2702
            L +YWE+LC   +  S
Sbjct: 214  LNRYWEYLCSTKNALS 229


>ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arabidopsis lyrata subsp.
            lyrata] gi|297328398|gb|EFH58817.1| hypothetical protein
            ARALYDRAFT_896965 [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  217 bits (552), Expect = 3e-53
 Identities = 104/152 (68%), Positives = 117/152 (76%)
 Frame = +3

Query: 2223 LTAAQKKDEAYERKTADNPWRPPRSPFNLLQEDHAFDPWRVLVICMLLNQTTGKQTGRVL 2402
            L+ +QK DEAY+RKT D  W PPRSP NLLQE H  DPWRVLVICMLLN+T+G QT  V+
Sbjct: 278  LSLSQKTDEAYQRKTPDKTWVPPRSPCNLLQEHHWHDPWRVLVICMLLNKTSGAQTRGVI 337

Query: 2403 SKFFQLCPNAKTATEVATEKIEEVIRSLGLYKKRAAGIQRFSEEYLNERWTHVTDLTGVG 2582
               F LCP+AKTATEV   +IE +I+ LGL KKRA  IQRFS EYL E WTHVT L G+G
Sbjct: 338  EDLFALCPDAKTATEVEEREIESLIKPLGLQKKRARMIQRFSLEYLQESWTHVTQLHGIG 397

Query: 2583 KYAADAYAIFCTGKWERVRPIDHMLVKYWEFL 2678
            KYAADAYAIFC G W+RV+P DHML  YWEFL
Sbjct: 398  KYAADAYAIFCNGNWDRVKPDDHMLNYYWEFL 429


>ref|NP_974253.1| DNA glycosylase superfamily protein [Arabidopsis thaliana]
            gi|114050633|gb|ABI49466.1| At3g07930 [Arabidopsis
            thaliana] gi|332641100|gb|AEE74621.1| DNA glycosylase
            superfamily protein [Arabidopsis thaliana]
          Length = 445

 Score =  215 bits (548), Expect = 7e-53
 Identities = 123/245 (50%), Positives = 151/245 (61%), Gaps = 3/245 (1%)
 Frame = +3

Query: 1953 RAGANSCKGAGK---EARVVSPYFANADANAEEKVRTKEGKIESVKLQVRIVSPYFCSTQ 2123
            ++G N  KG+ K   + R VSPYF  +  + E+  +  +G     K  V  VS YF    
Sbjct: 208  QSGRNYRKGSSKRQVKVRRVSPYFQESTVS-EQPNQAPKGLRNYFK--VVKVSRYF---- 260

Query: 2124 QDKEDENAVSLGGPTNSKVQXXXXXXXXXXXXLLTAAQKKDEAYERKTADNPWRPPRSPF 2303
                  +A  +    + K +            +L+ +QK D+ Y RKT DN W PPRSP 
Sbjct: 261  ------HADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKTPDNTWVPPRSPC 314

Query: 2304 NLLQEDHAFDPWRVLVICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIEEVIRS 2483
            NLLQEDH  DPWRVLVICMLLN+T+G QT  V+S  F LC +AKTATEV  E+IE +I+ 
Sbjct: 315  NLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATEVKEEEIENLIKP 374

Query: 2484 LGLYKKRAAGIQRFSEEYLNERWTHVTDLTGVGKYAADAYAIFCTGKWERVRPIDHMLVK 2663
            LGL KKR   IQR S EYL E WTHVT L GVGKYAADAYAIFC G W+RV+P DHML  
Sbjct: 375  LGLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCNGNWDRVKPNDHMLNY 434

Query: 2664 YWEFL 2678
            YW++L
Sbjct: 435  YWDYL 439


>gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thaliana]
          Length = 419

 Score =  215 bits (548), Expect = 7e-53
 Identities = 123/245 (50%), Positives = 151/245 (61%), Gaps = 3/245 (1%)
 Frame = +3

Query: 1953 RAGANSCKGAGK---EARVVSPYFANADANAEEKVRTKEGKIESVKLQVRIVSPYFCSTQ 2123
            ++G N  KG+ K   + R VSPYF  +  + E+  +  +G     K  V  VS YF    
Sbjct: 182  QSGRNYRKGSSKRQVKVRRVSPYFQESTVS-EQPNQAPKGLRNYFK--VVKVSRYF---- 234

Query: 2124 QDKEDENAVSLGGPTNSKVQXXXXXXXXXXXXLLTAAQKKDEAYERKTADNPWRPPRSPF 2303
                  +A  +    + K +            +L+ +QK D+ Y RKT DN W PPRSP 
Sbjct: 235  ------HADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKTPDNTWVPPRSPC 288

Query: 2304 NLLQEDHAFDPWRVLVICMLLNQTTGKQTGRVLSKFFQLCPNAKTATEVATEKIEEVIRS 2483
            NLLQEDH  DPWRVLVICMLLN+T+G QT  V+S  F LC +AKTATEV  E+IE +I+ 
Sbjct: 289  NLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATEVKEEEIENLIKP 348

Query: 2484 LGLYKKRAAGIQRFSEEYLNERWTHVTDLTGVGKYAADAYAIFCTGKWERVRPIDHMLVK 2663
            LGL KKR   IQR S EYL E WTHVT L GVGKYAADAYAIFC G W+RV+P DHML  
Sbjct: 349  LGLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCNGNWDRVKPNDHMLNY 408

Query: 2664 YWEFL 2678
            YW++L
Sbjct: 409  YWDYL 413


>emb|CBI29440.3| unnamed protein product [Vitis vinifera]
          Length = 599

 Score =  215 bits (547), Expect = 1e-52
 Identities = 170/515 (33%), Positives = 238/515 (46%), Gaps = 10/515 (1%)
 Frame = +3

Query: 1164 VSPYFIKKCMKDENRYSQLDDYLGVDPATVTEDRDEGLKARKDAETNTSHGIKILPHADV 1343
            +SPYF +K +K E RYS+       +      + D   K +K                  
Sbjct: 138  ISPYF-QKAVKQEERYSE-------EHCNFPNETDNKKKKKKK----------------- 172

Query: 1344 KKRKQKKEYEETTNPPYFRKICVKNDNKDSKSDENTGIE-----SEKVAEGRVEXXXXXX 1508
            +KRK    +E        RKI V+N   D   D+  G+E     S   +E +V       
Sbjct: 173  RKRKGNDTFESLKEE---RKINVQNVKMD---DQKMGVELPVFNSNSSSERKVSPFCQKA 226

Query: 1509 XXXXXXXXXXXXENSSDGVVH-FPDGNLKVEPDSISNQDSGGVFKMEKFSLDDFFSRFAY 1685
                         +S   VV  + +   +   DS++N  S       +  +  +F +   
Sbjct: 227  VKEEEEMNLEAQVDSKPTVVSPYFEKKKRAVSDSVANSSSDS---NSQRLVSPYFQKAVK 283

Query: 1686 TGGKCYMNSAKFGVCQSSSQTTETCGEGQMKTDTMKIVKDDLAAGNNARLCRADVGSQTA 1865
               +       F       +T +   +G    ++ K  K  +    N R+    +  Q  
Sbjct: 284  QQERNPEEHCNFPNKIERRKTKKRKKKGNDTVESFKEQKKKINV-QNVRVEDQKMEVQQP 342

Query: 1866 ISTPNSCENAKMG---ERIVMINGGIASQRKMRAGANSCKGAGKEARVVSPYFANADANA 2036
            IS+ NS    K+    +R V       S+   + G  + +   +E +  +    NA    
Sbjct: 343  ISSSNSNSQKKVSPYCQRAVKEEEEGNSEEDTKKGHENEESFKEEGKRKT----NAQNVT 398

Query: 2037 EEKVRTKEGKIESVKLQVRIVSPYFCSTQQD-KEDENAVSLGGPTNSKVQXXXXXXXXXX 2213
             E  + K  K +S    +R+VSPYF   ++D K+   A+                     
Sbjct: 399  MEDEKMKLPKKKSRAPPIRVVSPYFPINEEDAKKPVRAMFFN------------------ 440

Query: 2214 XXLLTAAQKKDEAYERKTADNPWRPPRSPFNLLQEDHAFDPWRVLVICMLLNQTTGKQTG 2393
                    K + AY RK+ DN W+PP S F+LLQEDH  DPWRV+VICMLLN T+G Q  
Sbjct: 441  --------KLNVAYRRKSPDNNWKPPPSHFHLLQEDHYHDPWRVMVICMLLNCTSGLQAS 492

Query: 2394 RVLSKFFQLCPNAKTATEVATEKIEEVIRSLGLYKKRAAGIQRFSEEYLNERWTHVTDLT 2573
            RV+S  F LCP+AKTAT+V TE IE+VI +LGL KKRAA IQRFS EYL++ WTHVT L 
Sbjct: 493  RVISDLFTLCPDAKTATDVPTEMIEKVIETLGLQKKRAAMIQRFSREYLDDSWTHVTQLH 552

Query: 2574 GVGKYAADAYAIFCTGKWERVRPIDHMLVKYWEFL 2678
            G+GKYAADAYAIFC+G W  V P DHMLVKYW++L
Sbjct: 553  GIGKYAADAYAIFCSGDWGLVVPNDHMLVKYWKYL 587


Top