BLASTX nr result

ID: Akebia27_contig00026602 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00026602
         (1169 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002514395.1| conserved hypothetical protein [Ricinus comm...   262   2e-67
ref|XP_006423926.1| hypothetical protein CICLE_v10028470mg [Citr...   261   4e-67
gb|EXB50510.1| Methyl-CpG-binding domain protein 4 [Morus notabi...   257   6e-66
ref|XP_006494703.1| PREDICTED: transcriptional regulator ATRX ho...   254   5e-65
emb|CBI29440.3| unnamed protein product [Vitis vinifera]              250   9e-64
ref|XP_002271845.1| PREDICTED: uncharacterized protein LOC100244...   250   9e-64
ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arab...   247   6e-63
ref|XP_006407780.1| hypothetical protein EUTSA_v10020704mg [Eutr...   246   1e-62
gb|AAO22623.1| unknown protein [Arabidopsis thaliana]                 246   1e-62
ref|XP_006297652.1| hypothetical protein CARUB_v10013672mg [Caps...   246   2e-62
ref|NP_974253.1| DNA glycosylase superfamily protein [Arabidopsi...   245   3e-62
gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thal...   245   3e-62
ref|XP_006593877.1| PREDICTED: axoneme-associated protein mst101...   244   5e-62
ref|XP_007163979.1| hypothetical protein PHAVU_L0004001g, partia...   238   4e-60
ref|XP_007163978.1| hypothetical protein PHAVU_L0004001g [Phaseo...   238   4e-60
ref|XP_002317727.2| hypothetical protein POPTR_0012s03470g [Popu...   238   4e-60
ref|XP_006350381.1| PREDICTED: methyl-CpG-binding domain protein...   237   8e-60
ref|XP_004231530.1| PREDICTED: uncharacterized protein LOC101255...   236   1e-59
ref|XP_007032156.1| DNA glycosylase superfamily protein, putativ...   236   2e-59
emb|CAN67143.1| hypothetical protein VITISV_044254 [Vitis vinifera]   232   3e-58

>ref|XP_002514395.1| conserved hypothetical protein [Ricinus communis]
           gi|223546492|gb|EEF47991.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 608

 Score =  262 bits (670), Expect = 2e-67
 Identities = 125/195 (64%), Positives = 148/195 (75%)
 Frame = -2

Query: 751 PYFREKTLEEGVEPIENYLLERNYKCEKQPKKVQSRASASHSLNVSQKLDEAYRRKSPDN 572
           PYF++   +E  E  ++ +++  +  +K P+K +  A  S +L+ ++K  EAYRRK+PDN
Sbjct: 408 PYFQKVPKQEEEEAADSNMIDNKHGQKKLPEKKKRPARKSITLSAAEKRSEAYRRKTPDN 467

Query: 571 TWKPPPSHFTLLQEQHFKDPWRVLVICMLLNRTAGRQARRVIANLFELCPDAKTATEVAT 392
           TWKPP S F LLQE H  DPWRVLVICMLLN T G+Q R VI++ F LCPDAK ATE  T
Sbjct: 468 TWKPPRSDFGLLQEDHASDPWRVLVICMLLNCTTGKQVRGVISDFFTLCPDAKAATEAKT 527

Query: 391 EEIEKVIQILGLHHKRAKMIQRFSKEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKWDQV 212
           EEIEK+I  LGL  KRA MIQR S+EYL D WTHVTQLHGVGKYAADAYAIFCTGKWDQV
Sbjct: 528 EEIEKIIVPLGLQKKRAVMIQRLSQEYLADDWTHVTQLHGVGKYAADAYAIFCTGKWDQV 587

Query: 211 RPNDHMLNKYWDYLH 167
           RP DHMLN YWD+LH
Sbjct: 588 RPKDHMLNYYWDFLH 602


>ref|XP_006423926.1| hypothetical protein CICLE_v10028470mg [Citrus clementina]
           gi|568883956|ref|XP_006494704.1| PREDICTED:
           transcriptional regulator ATRX homolog isoform X2
           [Citrus sinensis] gi|557525860|gb|ESR37166.1|
           hypothetical protein CICLE_v10028470mg [Citrus
           clementina]
          Length = 439

 Score =  261 bits (667), Expect = 4e-67
 Identities = 132/226 (58%), Positives = 155/226 (68%), Gaps = 2/226 (0%)
 Frame = -2

Query: 841 ISPYFQTTRAEEVEINEEDKPNXXXXXXXSPYFREKTLEEGVEPIENYLLERNYKCEKQP 662
           +SPYFQ  +A  VE    D          SPYF+    +    P    +   N + E++ 
Sbjct: 210 VSPYFQRQKAGNVERKNHDTSTMAQARKVSPYFQN---QNSTTPAAATVQVHNQQQEEKE 266

Query: 661 KK--VQSRASASHSLNVSQKLDEAYRRKSPDNTWKPPPSHFTLLQEQHFKDPWRVLVICM 488
           K   V+ + S S +L  +QK DEAY RK PDNTW PP S   LLQ +H  DPWRV+VICM
Sbjct: 267 KDIAVKKKRSRSVTLTAAQKRDEAYERKRPDNTWNPPRSPIVLLQHEHVHDPWRVIVICM 326

Query: 487 LLNRTAGRQARRVIANLFELCPDAKTATEVATEEIEKVIQILGLHHKRAKMIQRFSKEYL 308
           LLNRT G QA RVI++LF LCPDAKTATEV  EEIEK+I  LGL  KRA MI+RFS+EYL
Sbjct: 327 LLNRTTGLQAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYL 386

Query: 307 EDGWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPNDHMLNKYWDYL 170
            + WTHVTQLHGVGKYAADAYAIFCTGKWD+VRP DHMLN YW++L
Sbjct: 387 GESWTHVTQLHGVGKYAADAYAIFCTGKWDRVRPTDHMLNYYWEFL 432


>gb|EXB50510.1| Methyl-CpG-binding domain protein 4 [Morus notabilis]
          Length = 418

 Score =  257 bits (657), Expect = 6e-66
 Identities = 144/281 (51%), Positives = 178/281 (63%), Gaps = 21/281 (7%)
 Frame = -2

Query: 934 STDEDENLPQKSEKKSIPNSSKRKKIDKSRVISPYFQTTR--------------AEEVEI 797
           S  E E   +K  KK+I    ++  +  SRV+SPYF T R               EEVE+
Sbjct: 142 SRKEVEIAGKKRRKKNI---DRKDDVAGSRVVSPYFTTNRNDTQEKKKKPEKDGREEVEL 198

Query: 796 NEEDKPNXXXXXXXSPY----FREKTLEEGVEPIENYLLERNYKCEKQPKKV---QSRAS 638
            E+ + +       S +     +EKT  E  E  +   L      EK+  K+   + +  
Sbjct: 199 GEKKEEHLKLVDVLSRFAYKPMKEKTTVERAE--KGRKLGLVGVGEKKMSKIVVRRKKIE 256

Query: 637 ASHSLNVSQKLDEAYRRKSPDNTWKPPPSHFTLLQEQHFKDPWRVLVICMLLNRTAGRQA 458
            S  LN ++K DEAY+RK+ DN W PPPS   L+Q+ H  DPWRVLVICMLLNRT G QA
Sbjct: 257 KSKVLNAAEKRDEAYKRKTDDNKWNPPPSEIRLIQQDHLHDPWRVLVICMLLNRTTGAQA 316

Query: 457 RRVIANLFELCPDAKTATEVATEEIEKVIQILGLHHKRAKMIQRFSKEYLEDGWTHVTQL 278
            RVI++ F LCP+AK ATEV+ EEI K+I  LGL HKRA+MIQRFS+EYLE+ WTHVTQL
Sbjct: 317 TRVISDFFSLCPNAKAATEVSPEEIVKIIHTLGL-HKRAQMIQRFSREYLEESWTHVTQL 375

Query: 277 HGVGKYAADAYAIFCTGKWDQVRPNDHMLNKYWDYLHIKRD 155
           HGVGKYAADAYAIFCTGKWD+V+P DHMLN YW +LH  RD
Sbjct: 376 HGVGKYAADAYAIFCTGKWDRVKPADHMLNYYWKFLHSIRD 416


>ref|XP_006494703.1| PREDICTED: transcriptional regulator ATRX homolog isoform X1
           [Citrus sinensis]
          Length = 446

 Score =  254 bits (649), Expect = 5e-65
 Identities = 132/233 (56%), Positives = 155/233 (66%), Gaps = 9/233 (3%)
 Frame = -2

Query: 841 ISPYFQTTRAEEVEINEEDKPNXXXXXXXSPYFREKTLEEGVEPIENYLLERNYKCEKQP 662
           +SPYFQ  +A  VE    D          SPYF+    +    P    +   N + E++ 
Sbjct: 210 VSPYFQRQKAGNVERKNHDTSTMAQARKVSPYFQN---QNSTTPAAATVQVHNQQQEEKE 266

Query: 661 KK--VQSRASASHSLNVSQKLDEAYRRKSPDNTWKPPPSHFTLLQEQHFKDPWRVLVICM 488
           K   V+ + S S +L  +QK DEAY RK PDNTW PP S   LLQ +H  DPWRV+VICM
Sbjct: 267 KDIAVKKKRSRSVTLTAAQKRDEAYERKRPDNTWNPPRSPIVLLQHEHVHDPWRVIVICM 326

Query: 487 LLNRTAGRQ-------ARRVIANLFELCPDAKTATEVATEEIEKVIQILGLHHKRAKMIQ 329
           LLNRT G Q       A RVI++LF LCPDAKTATEV  EEIEK+I  LGL  KRA MI+
Sbjct: 327 LLNRTTGLQEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIK 386

Query: 328 RFSKEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPNDHMLNKYWDYL 170
           RFS+EYL + WTHVTQLHGVGKYAADAYAIFCTGKWD+VRP DHMLN YW++L
Sbjct: 387 RFSQEYLGESWTHVTQLHGVGKYAADAYAIFCTGKWDRVRPTDHMLNYYWEFL 439


>emb|CBI29440.3| unnamed protein product [Vitis vinifera]
          Length = 599

 Score =  250 bits (638), Expect = 9e-64
 Identities = 138/278 (49%), Positives = 169/278 (60%), Gaps = 10/278 (3%)
 Frame = -2

Query: 970  KQEKSRAENFNCSTDEDENLPQKSEKKSIPNSSKRKKIDKSRVISPYFQTTRAEEVEINE 791
            K++K +    N   ++ +   Q+    S  NS K+        +SPY Q    EE E N 
Sbjct: 319  KEQKKKINVQNVRVEDQKMEVQQPISSSNSNSQKK--------VSPYCQRAVKEEEEGNS 370

Query: 790  ED--KPNXXXXXXXSPYFREKTLEEGVEPIENYLLERNYKCEKQPKKVQSRASASHSLNV 617
            E+  K             + KT  + V   +  +     K    P +V S     +  + 
Sbjct: 371  EEDTKKGHENEESFKEEGKRKTNAQNVTMEDEKMKLPKKKSRAPPIRVVSPYFPINEEDA 430

Query: 616  SQ--------KLDEAYRRKSPDNTWKPPPSHFTLLQEQHFKDPWRVLVICMLLNRTAGRQ 461
             +        KL+ AYRRKSPDN WKPPPSHF LLQE H+ DPWRV+VICMLLN T+G Q
Sbjct: 431  KKPVRAMFFNKLNVAYRRKSPDNNWKPPPSHFHLLQEDHYHDPWRVMVICMLLNCTSGLQ 490

Query: 460  ARRVIANLFELCPDAKTATEVATEEIEKVIQILGLHHKRAKMIQRFSKEYLEDGWTHVTQ 281
            A RVI++LF LCPDAKTAT+V TE IEKVI+ LGL  KRA MIQRFS+EYL+D WTHVTQ
Sbjct: 491  ASRVISDLFTLCPDAKTATDVPTEMIEKVIETLGLQKKRAAMIQRFSREYLDDSWTHVTQ 550

Query: 280  LHGVGKYAADAYAIFCTGKWDQVRPNDHMLNKYWDYLH 167
            LHG+GKYAADAYAIFC+G W  V PNDHML KYW YL+
Sbjct: 551  LHGIGKYAADAYAIFCSGDWGLVVPNDHMLVKYWKYLY 588


>ref|XP_002271845.1| PREDICTED: uncharacterized protein LOC100244192 [Vitis vinifera]
          Length = 536

 Score =  250 bits (638), Expect = 9e-64
 Identities = 138/278 (49%), Positives = 169/278 (60%), Gaps = 10/278 (3%)
 Frame = -2

Query: 970  KQEKSRAENFNCSTDEDENLPQKSEKKSIPNSSKRKKIDKSRVISPYFQTTRAEEVEINE 791
            K++K +    N   ++ +   Q+    S  NS K+        +SPY Q    EE E N 
Sbjct: 256  KEQKKKINVQNVRVEDQKMEVQQPISSSNSNSQKK--------VSPYCQRAVKEEEEGNS 307

Query: 790  ED--KPNXXXXXXXSPYFREKTLEEGVEPIENYLLERNYKCEKQPKKVQSRASASHSLNV 617
            E+  K             + KT  + V   +  +     K    P +V S     +  + 
Sbjct: 308  EEDTKKGHENEESFKEEGKRKTNAQNVTMEDEKMKLPKKKSRAPPIRVVSPYFPINEEDA 367

Query: 616  SQ--------KLDEAYRRKSPDNTWKPPPSHFTLLQEQHFKDPWRVLVICMLLNRTAGRQ 461
             +        KL+ AYRRKSPDN WKPPPSHF LLQE H+ DPWRV+VICMLLN T+G Q
Sbjct: 368  KKPVRAMFFNKLNVAYRRKSPDNNWKPPPSHFHLLQEDHYHDPWRVMVICMLLNCTSGLQ 427

Query: 460  ARRVIANLFELCPDAKTATEVATEEIEKVIQILGLHHKRAKMIQRFSKEYLEDGWTHVTQ 281
            A RVI++LF LCPDAKTAT+V TE IEKVI+ LGL  KRA MIQRFS+EYL+D WTHVTQ
Sbjct: 428  ASRVISDLFTLCPDAKTATDVPTEMIEKVIETLGLQKKRAAMIQRFSREYLDDSWTHVTQ 487

Query: 280  LHGVGKYAADAYAIFCTGKWDQVRPNDHMLNKYWDYLH 167
            LHG+GKYAADAYAIFC+G W  V PNDHML KYW YL+
Sbjct: 488  LHGIGKYAADAYAIFCSGDWGLVVPNDHMLVKYWKYLY 525


>ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arabidopsis lyrata subsp.
            lyrata] gi|297328398|gb|EFH58817.1| hypothetical protein
            ARALYDRAFT_896965 [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  247 bits (631), Expect = 6e-63
 Identities = 137/308 (44%), Positives = 181/308 (58%), Gaps = 30/308 (9%)
 Frame = -2

Query: 994  YFQTPTPQKQEKSRAENFNCSTDEDENLPQKSEKKSIPNSSKRKKIDKSRVISPYFQTTR 815
            YFQ  T  +Q K   ++ +  +    N  +   K  +P            ++SPYFQ++ 
Sbjct: 139  YFQGSTVSQQSKEECDSDSVCSQSGRNCSKVQAK--VP------------IVSPYFQSST 184

Query: 814  AEE-----VEINEEDK-------PNXXXXXXXSPYFREKTLEEG--------------VE 713
              +     V  ++  K                SPYF+E T+ E               V 
Sbjct: 185  ISQCGSDIVSSSQSGKNYRRGSSKRQAKVRRDSPYFQESTVSEQPSQAPPRDLRQYFKVV 244

Query: 712  PIENYL----LERNYKCEKQPKKVQSRASASHSLNVSQKLDEAYRRKSPDNTWKPPPSHF 545
             +  Y     ++ N   +++  +V+     S SL++SQK DEAY+RK+PD TW PP S  
Sbjct: 245  KVSRYFHADGIQVNESQKEKSTRVRKTPVVSPSLSLSQKTDEAYQRKTPDKTWVPPRSPC 304

Query: 544  TLLQEQHFKDPWRVLVICMLLNRTAGRQARRVIANLFELCPDAKTATEVATEEIEKVIQI 365
             LLQE H+ DPWRVLVICMLLN+T+G Q R VI +LF LCPDAKTATEV   EIE +I+ 
Sbjct: 305  NLLQEHHWHDPWRVLVICMLLNKTSGAQTRGVIEDLFALCPDAKTATEVEEREIESLIKP 364

Query: 364  LGLHHKRAKMIQRFSKEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPNDHMLNK 185
            LGL  KRA+MIQRFS EYL++ WTHVTQLHG+GKYAADAYAIFC G WD+V+P+DHMLN 
Sbjct: 365  LGLQKKRARMIQRFSLEYLQESWTHVTQLHGIGKYAADAYAIFCNGNWDRVKPDDHMLNY 424

Query: 184  YWDYLHIK 161
            YW++L I+
Sbjct: 425  YWEFLRIR 432


>ref|XP_006407780.1| hypothetical protein EUTSA_v10020704mg [Eutrema salsugineum]
            gi|557108926|gb|ESQ49233.1| hypothetical protein
            EUTSA_v10020704mg [Eutrema salsugineum]
          Length = 456

 Score =  246 bits (628), Expect = 1e-62
 Identities = 144/310 (46%), Positives = 179/310 (57%), Gaps = 32/310 (10%)
 Frame = -2

Query: 994  YFQTPTPQKQEKSRAENFNCSTDEDENLPQKSEKKSIPNSSKRKKIDKSRVISPYFQTTR 815
            YFQ  T  +Q K   ++   S+    N  ++           RK   K R +SPYFQ + 
Sbjct: 156  YFQGSTVSQQPKDGCDSDCVSSQNGRNYRKEC----------RKVQAKVRRVSPYFQAST 205

Query: 814  AEEVE-----------INEEDKPNXXXXXXXSPYFREKTLEEGVEPIENYLLERNYKCEK 668
              + +             +E           SPYF+  T+ E   P  +  L + +K  K
Sbjct: 206  FSQCDSESVASQSGRKYRKESSKLQAKVPRVSPYFQGSTVSEQPNPSRD--LRQYFKVVK 263

Query: 667  ----------------QPKKVQSRAS-----ASHSLNVSQKLDEAYRRKSPDNTWKPPPS 551
                            +P+K +SR        S SL+  QK DEAY RK PDNTW PP S
Sbjct: 264  VSRYFHDMPADGTQVNEPQKERSRRMRKTPVVSPSLSQCQKTDEAYLRKMPDNTWVPPRS 323

Query: 550  HFTLLQEQHFKDPWRVLVICMLLNRTAGRQARRVIANLFELCPDAKTATEVATEEIEKVI 371
               LLQE H+ DPWRVLVICMLLN+T+G Q R VI++LF LCPDAK+ATEV  +EIE +I
Sbjct: 324  PCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFVLCPDAKSATEVEEKEIESLI 383

Query: 370  QILGLHHKRAKMIQRFSKEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPNDHML 191
            + LGL  KRAKMIQRFS EYL++ WTHVTQL+GVGKYAADAYAIFC GKWD VRP DHML
Sbjct: 384  KPLGLQKKRAKMIQRFSLEYLQESWTHVTQLYGVGKYAADAYAIFCNGKWDCVRPADHML 443

Query: 190  NKYWDYLHIK 161
            N YW++L I+
Sbjct: 444  NYYWEFLRIR 453


>gb|AAO22623.1| unknown protein [Arabidopsis thaliana]
          Length = 407

 Score =  246 bits (628), Expect = 1e-62
 Identities = 134/260 (51%), Positives = 167/260 (64%), Gaps = 4/260 (1%)
 Frame = -2

Query: 928 DEDENLPQKSEKKSIPNSSKRKKIDKSRVISPYFQTTRAEEVEINEEDKPNXXXXXXXSP 749
           D D     +S +     SSKR+   K+R +SPYFQ +   E       +PN       + 
Sbjct: 162 DSDIVSSSQSGRNYRKGSSKRQV--KARRVSPYFQESTVSE-------QPNQAPKGLRN- 211

Query: 748 YFREKTLEEGVEPIENYL----LERNYKCEKQPKKVQSRASASHSLNVSQKLDEAYRRKS 581
           YF+       V  +  Y     ++ N   +++ + V+     S  L++SQK D+ Y RK+
Sbjct: 212 YFK-------VVKVSRYFHADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKT 264

Query: 580 PDNTWKPPPSHFTLLQEQHFKDPWRVLVICMLLNRTAGRQARRVIANLFELCPDAKTATE 401
           PDNTW PP S   LLQE H+ DPWRVLVICMLLN+T+G Q R VI++LF LC DAKTATE
Sbjct: 265 PDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATE 324

Query: 400 VATEEIEKVIQILGLHHKRAKMIQRFSKEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKW 221
           V  EEIE +I+ LGL  KR KMIQR S EYL++ WTHVTQLHGVGKYAADAYAIFC G W
Sbjct: 325 VKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCNGNW 384

Query: 220 DQVRPNDHMLNKYWDYLHIK 161
           D+V+PNDHMLN YWDYL I+
Sbjct: 385 DRVKPNDHMLNYYWDYLRIR 404


>ref|XP_006297652.1| hypothetical protein CARUB_v10013672mg [Capsella rubella]
           gi|482566361|gb|EOA30550.1| hypothetical protein
           CARUB_v10013672mg [Capsella rubella]
          Length = 456

 Score =  246 bits (627), Expect = 2e-62
 Identities = 128/240 (53%), Positives = 163/240 (67%)
 Frame = -2

Query: 880 NSSKRKKIDKSRVISPYFQTTRAEEVEINEEDKPNXXXXXXXSPYFREKTLEEGVEPIEN 701
           +SSK +   K R +S YFQ +   E      D          S YF + + + G++  ++
Sbjct: 225 DSSKHQA--KVRRVSRYFQASADSEQPNPPRDLRKYFKVVKVSRYFHDVSAD-GIQVADS 281

Query: 700 YLLERNYKCEKQPKKVQSRASASHSLNVSQKLDEAYRRKSPDNTWKPPPSHFTLLQEQHF 521
                    +++ ++V+     S SL+ SQK DEAY RK+PDNTW PP S   LLQE H+
Sbjct: 282 Q--------KEKSRRVRKTPVVSPSLSPSQKTDEAYLRKTPDNTWVPPRSPCNLLQEDHW 333

Query: 520 KDPWRVLVICMLLNRTAGRQARRVIANLFELCPDAKTATEVATEEIEKVIQILGLHHKRA 341
            DPWRVLVICMLLN+T+G Q R VI++LF LCPDAKTATEV  +EIE +I+ LGL  KRA
Sbjct: 334 HDPWRVLVICMLLNKTSGAQTRGVISDLFTLCPDAKTATEVEEKEIESLIKPLGLQKKRA 393

Query: 340 KMIQRFSKEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPNDHMLNKYWDYLHIK 161
           KMIQRFS EYL + WTHVTQLHG+GKYAADAYAIFC G WD+V+P+DHMLN YW++L I+
Sbjct: 394 KMIQRFSLEYLNESWTHVTQLHGIGKYAADAYAIFCNGNWDRVKPSDHMLNYYWEFLRIR 453


>ref|NP_974253.1| DNA glycosylase superfamily protein [Arabidopsis thaliana]
           gi|114050633|gb|ABI49466.1| At3g07930 [Arabidopsis
           thaliana] gi|332641100|gb|AEE74621.1| DNA glycosylase
           superfamily protein [Arabidopsis thaliana]
          Length = 445

 Score =  245 bits (625), Expect = 3e-62
 Identities = 134/260 (51%), Positives = 166/260 (63%), Gaps = 4/260 (1%)
 Frame = -2

Query: 928 DEDENLPQKSEKKSIPNSSKRKKIDKSRVISPYFQTTRAEEVEINEEDKPNXXXXXXXSP 749
           D D     +S +     SSKR+   K R +SPYFQ +   E       +PN       + 
Sbjct: 200 DSDIVSSSQSGRNYRKGSSKRQV--KVRRVSPYFQESTVSE-------QPNQAPKGLRN- 249

Query: 748 YFREKTLEEGVEPIENYL----LERNYKCEKQPKKVQSRASASHSLNVSQKLDEAYRRKS 581
           YF+       V  +  Y     ++ N   +++ + V+     S  L++SQK D+ Y RK+
Sbjct: 250 YFK-------VVKVSRYFHADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKT 302

Query: 580 PDNTWKPPPSHFTLLQEQHFKDPWRVLVICMLLNRTAGRQARRVIANLFELCPDAKTATE 401
           PDNTW PP S   LLQE H+ DPWRVLVICMLLN+T+G Q R VI++LF LC DAKTATE
Sbjct: 303 PDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATE 362

Query: 400 VATEEIEKVIQILGLHHKRAKMIQRFSKEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKW 221
           V  EEIE +I+ LGL  KR KMIQR S EYL++ WTHVTQLHGVGKYAADAYAIFC G W
Sbjct: 363 VKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCNGNW 422

Query: 220 DQVRPNDHMLNKYWDYLHIK 161
           D+V+PNDHMLN YWDYL I+
Sbjct: 423 DRVKPNDHMLNYYWDYLRIR 442


>gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thaliana]
          Length = 419

 Score =  245 bits (625), Expect = 3e-62
 Identities = 134/260 (51%), Positives = 166/260 (63%), Gaps = 4/260 (1%)
 Frame = -2

Query: 928 DEDENLPQKSEKKSIPNSSKRKKIDKSRVISPYFQTTRAEEVEINEEDKPNXXXXXXXSP 749
           D D     +S +     SSKR+   K R +SPYFQ +   E       +PN       + 
Sbjct: 174 DSDIVSSSQSGRNYRKGSSKRQV--KVRRVSPYFQESTVSE-------QPNQAPKGLRN- 223

Query: 748 YFREKTLEEGVEPIENYL----LERNYKCEKQPKKVQSRASASHSLNVSQKLDEAYRRKS 581
           YF+       V  +  Y     ++ N   +++ + V+     S  L++SQK D+ Y RK+
Sbjct: 224 YFK-------VVKVSRYFHADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKT 276

Query: 580 PDNTWKPPPSHFTLLQEQHFKDPWRVLVICMLLNRTAGRQARRVIANLFELCPDAKTATE 401
           PDNTW PP S   LLQE H+ DPWRVLVICMLLN+T+G Q R VI++LF LC DAKTATE
Sbjct: 277 PDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATE 336

Query: 400 VATEEIEKVIQILGLHHKRAKMIQRFSKEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKW 221
           V  EEIE +I+ LGL  KR KMIQR S EYL++ WTHVTQLHGVGKYAADAYAIFC G W
Sbjct: 337 VKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCNGNW 396

Query: 220 DQVRPNDHMLNKYWDYLHIK 161
           D+V+PNDHMLN YWDYL I+
Sbjct: 397 DRVKPNDHMLNYYWDYLRIR 416


>ref|XP_006593877.1| PREDICTED: axoneme-associated protein mst101(2)-like [Glycine max]
          Length = 1424

 Score =  244 bits (623), Expect = 5e-62
 Identities = 119/227 (52%), Positives = 150/227 (66%)
 Frame = -2

Query: 847  RVISPYFQTTRAEEVEINEEDKPNXXXXXXXSPYFREKTLEEGVEPIENYLLERNYKCEK 668
            R +SPYF     ++V +   DK +               L      +E+ L E    C  
Sbjct: 1207 RYVSPYFCNNSGKKVNVKPFDKGSTSESI---------ALHTCKNFVEDKLEENKSNCSN 1257

Query: 667  QPKKVQSRASASHSLNVSQKLDEAYRRKSPDNTWKPPPSHFTLLQEQHFKDPWRVLVICM 488
            +  +++    AS      +K DEAY+RK+PDNTWKPP S   L+QE H  DPWRVLVICM
Sbjct: 1258 KSIEIKRFPPAS------EKWDEAYKRKTPDNTWKPPRSEIVLIQEDHLHDPWRVLVICM 1311

Query: 487  LLNRTAGRQARRVIANLFELCPDAKTATEVATEEIEKVIQILGLHHKRAKMIQRFSKEYL 308
            LLNRTAG Q ++V++N F+LCPDAK+ T+V  EEIEK I+ LG  HKRA+M+QR S+EYL
Sbjct: 1312 LLNRTAGGQTKKVVSNFFKLCPDAKSCTQVTREEIEKTIKTLGFQHKRAEMLQRLSEEYL 1371

Query: 307  EDGWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPNDHMLNKYWDYLH 167
            ++ WTHVTQLHGVGKYAADAYAIF TG WD+V P DHMLN YW++LH
Sbjct: 1372 DESWTHVTQLHGVGKYAADAYAIFVTGMWDRVTPTDHMLNYYWEFLH 1418


>ref|XP_007163979.1| hypothetical protein PHAVU_L0004001g, partial [Phaseolus vulgaris]
            gi|561039879|gb|ESW35973.1| hypothetical protein
            PHAVU_L0004001g, partial [Phaseolus vulgaris]
          Length = 715

 Score =  238 bits (607), Expect = 4e-60
 Identities = 116/226 (51%), Positives = 152/226 (67%)
 Frame = -2

Query: 847  RVISPYFQTTRAEEVEINEEDKPNXXXXXXXSPYFREKTLEEGVEPIENYLLERNYKCEK 668
            R +SPYF     + +++   D+ +          F    L      +E+   E    C +
Sbjct: 499  RYVSPYFHNDSGKNIDVKPLDEGSK---------FESIALHATENYVEDKPEENKSSCSE 549

Query: 667  QPKKVQSRASASHSLNVSQKLDEAYRRKSPDNTWKPPPSHFTLLQEQHFKDPWRVLVICM 488
            +  +++   SAS      QK DEAY+RK+PD TWKPP S   L+QE H  DPWRVLVICM
Sbjct: 550  KSIEIKKNLSAS------QKWDEAYKRKTPDITWKPPRSATVLIQEDHAHDPWRVLVICM 603

Query: 487  LLNRTAGRQARRVIANLFELCPDAKTATEVATEEIEKVIQILGLHHKRAKMIQRFSKEYL 308
            LLNRT+GRQ + ++++ F+LCPDAK+ TEV+ EEIE+ I+ LG  HKRAKM++R S+EYL
Sbjct: 604  LLNRTSGRQTKNIVSDFFKLCPDAKSCTEVSREEIEETIKTLGFQHKRAKMLKRLSEEYL 663

Query: 307  EDGWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPNDHMLNKYWDYL 170
            ++ WTHVTQLHGVGKYAADAYAIF TGK D+VRP DHMLN YW++L
Sbjct: 664  DESWTHVTQLHGVGKYAADAYAIFVTGKSDRVRPTDHMLNYYWEFL 709


>ref|XP_007163978.1| hypothetical protein PHAVU_L0004001g [Phaseolus vulgaris]
            gi|561039878|gb|ESW35972.1| hypothetical protein
            PHAVU_L0004001g [Phaseolus vulgaris]
          Length = 726

 Score =  238 bits (607), Expect = 4e-60
 Identities = 116/226 (51%), Positives = 152/226 (67%)
 Frame = -2

Query: 847  RVISPYFQTTRAEEVEINEEDKPNXXXXXXXSPYFREKTLEEGVEPIENYLLERNYKCEK 668
            R +SPYF     + +++   D+ +          F    L      +E+   E    C +
Sbjct: 510  RYVSPYFHNDSGKNIDVKPLDEGSK---------FESIALHATENYVEDKPEENKSSCSE 560

Query: 667  QPKKVQSRASASHSLNVSQKLDEAYRRKSPDNTWKPPPSHFTLLQEQHFKDPWRVLVICM 488
            +  +++   SAS      QK DEAY+RK+PD TWKPP S   L+QE H  DPWRVLVICM
Sbjct: 561  KSIEIKKNLSAS------QKWDEAYKRKTPDITWKPPRSATVLIQEDHAHDPWRVLVICM 614

Query: 487  LLNRTAGRQARRVIANLFELCPDAKTATEVATEEIEKVIQILGLHHKRAKMIQRFSKEYL 308
            LLNRT+GRQ + ++++ F+LCPDAK+ TEV+ EEIE+ I+ LG  HKRAKM++R S+EYL
Sbjct: 615  LLNRTSGRQTKNIVSDFFKLCPDAKSCTEVSREEIEETIKTLGFQHKRAKMLKRLSEEYL 674

Query: 307  EDGWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPNDHMLNKYWDYL 170
            ++ WTHVTQLHGVGKYAADAYAIF TGK D+VRP DHMLN YW++L
Sbjct: 675  DESWTHVTQLHGVGKYAADAYAIFVTGKSDRVRPTDHMLNYYWEFL 720


>ref|XP_002317727.2| hypothetical protein POPTR_0012s03470g [Populus trichocarpa]
           gi|550326306|gb|EEE95947.2| hypothetical protein
           POPTR_0012s03470g [Populus trichocarpa]
          Length = 229

 Score =  238 bits (607), Expect = 4e-60
 Identities = 123/222 (55%), Positives = 152/222 (68%), Gaps = 3/222 (1%)
 Frame = -2

Query: 826 QTTRAEEVEINEEDKPNXXXXXXXSPYFREKTLEEGVEPIENYLLERNYKCEKQPKKVQS 647
           +++R  E+++ E    N        P   +   EE  E   N +     + +K+ KK + 
Sbjct: 8   ESSRVGELDLEECSNSNKAKRRKKKPISNQ---EEDKEKDANVI----GRSKKKKKKKEG 60

Query: 646 RASASHSLNVS---QKLDEAYRRKSPDNTWKPPPSHFTLLQEQHFKDPWRVLVICMLLNR 476
             ++ HS   S    K DEAY RK+ +NTWKPP S F  L   H  DPWRVLVICMLLNR
Sbjct: 61  TKTSLHSDTTSPYYNKFDEAYERKTAENTWKPPQSEFGFLHN-HAHDPWRVLVICMLLNR 119

Query: 475 TAGRQARRVIANLFELCPDAKTATEVATEEIEKVIQILGLHHKRAKMIQRFSKEYLEDGW 296
           TAG +A RV+A+LF LCPDAK AT VATEEIE+ I+ LGL  +RAKM+QR S++YLE+ W
Sbjct: 120 TAGTRAERVVADLFTLCPDAKAATGVATEEIERAIKSLGLQKRRAKMVQRLSEDYLEEDW 179

Query: 295 THVTQLHGVGKYAADAYAIFCTGKWDQVRPNDHMLNKYWDYL 170
           THVTQL GVGKYAADAYAIFCTGKW+QVRPNDHMLN+YW+YL
Sbjct: 180 THVTQLPGVGKYAADAYAIFCTGKWEQVRPNDHMLNRYWEYL 221


>ref|XP_006350381.1| PREDICTED: methyl-CpG-binding domain protein 4-like, partial
           [Solanum tuberosum]
          Length = 222

 Score =  237 bits (604), Expect = 8e-60
 Identities = 124/233 (53%), Positives = 150/233 (64%), Gaps = 4/233 (1%)
 Frame = -2

Query: 853 KSRVISPYFQT-TRAEEVEINEE---DKPNXXXXXXXSPYFREKTLEEGVEPIENYLLER 686
           K RV+SPYF   T  EE+++ ++      N       SPYF+                  
Sbjct: 4   KVRVVSPYFANLTVGEEIKVGKDRSNPSKNCLNGRKVSPYFQNA---------------- 47

Query: 685 NYKCEKQPKKVQSRASASHSLNVSQKLDEAYRRKSPDNTWKPPPSHFTLLQEQHFKDPWR 506
            Y+  K+ +K   R      L+  QK DEAY R+S DNTW PP SHF LLQE H  DPWR
Sbjct: 48  -YRENKKSRKGSKRQKPC--LSAFQKRDEAYLRRSEDNTWVPPRSHFNLLQENHAHDPWR 104

Query: 505 VLVICMLLNRTAGRQARRVIANLFELCPDAKTATEVATEEIEKVIQILGLHHKRAKMIQR 326
           VLVICMLLN T G Q +RV+   F LCP+A  ATEVA E+IEK+++ LGL+ KR+  I R
Sbjct: 105 VLVICMLLNCTTGVQVKRVVDEFFTLCPNAVAATEVAVEDIEKLLRPLGLYTKRSLAIPR 164

Query: 325 FSKEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPNDHMLNKYWDYLH 167
            S+EYL + WTHVTQLHG+GKYAADAYAIFCTGKWDQV PNDHML KYW++LH
Sbjct: 165 LSQEYLGETWTHVTQLHGIGKYAADAYAIFCTGKWDQVHPNDHMLTKYWEFLH 217


>ref|XP_004231530.1| PREDICTED: uncharacterized protein LOC101255935 [Solanum
            lycopersicum]
          Length = 544

 Score =  236 bits (602), Expect = 1e-59
 Identities = 129/273 (47%), Positives = 165/273 (60%), Gaps = 6/273 (2%)
 Frame = -2

Query: 967  QEKSRA--ENFNCSTDEDENLPQKSEKKSIPNSSKRKKIDKSRVISPYFQTTRA-EEVEI 797
            ++K+RA    F  S + +  + +    + +   + +K   K RV+SPYF   +  EE+++
Sbjct: 284  EQKARAVCPYFLNSRNGETEMKKGRSVECVKKRNDKKLRTKVRVVSPYFANLKVGEEIKV 343

Query: 796  NEEDK---PNXXXXXXXSPYFREKTLEEGVEPIENYLLERNYKCEKQPKKVQSRASASHS 626
             ++      N       SPYF+    E+    I +   +R   C                
Sbjct: 344  GKDSSNASKNCLNGRKVSPYFQNAYREKKKSTIGS---KRQKPC---------------- 384

Query: 625  LNVSQKLDEAYRRKSPDNTWKPPPSHFTLLQEQHFKDPWRVLVICMLLNRTAGRQARRVI 446
            L+ SQK DEAY R+S DN W PP SHF LLQE H  DPWRVLVICMLLN T G Q RRV+
Sbjct: 385  LSASQKRDEAYLRRSEDNMWVPPRSHFNLLQENHAHDPWRVLVICMLLNCTTGVQVRRVV 444

Query: 445  ANLFELCPDAKTATEVATEEIEKVIQILGLHHKRAKMIQRFSKEYLEDGWTHVTQLHGVG 266
               F LCP+A  ATEVA E+IEK+++ LGL+ KR+  I R S+EYL   WTHVTQLHG+G
Sbjct: 445  DEFFTLCPNAVAATEVAVEDIEKLLRPLGLYTKRSLSIPRLSQEYLGKNWTHVTQLHGIG 504

Query: 265  KYAADAYAIFCTGKWDQVRPNDHMLNKYWDYLH 167
            KYAADAYAIFCTG WDQV PNDHML KYW++LH
Sbjct: 505  KYAADAYAIFCTGNWDQVHPNDHMLTKYWEFLH 537


>ref|XP_007032156.1| DNA glycosylase superfamily protein, putative isoform 1 [Theobroma
           cacao] gi|590648404|ref|XP_007032157.1| DNA glycosylase
           superfamily protein, putative isoform 1 [Theobroma
           cacao] gi|508711185|gb|EOY03082.1| DNA glycosylase
           superfamily protein, putative isoform 1 [Theobroma
           cacao] gi|508711186|gb|EOY03083.1| DNA glycosylase
           superfamily protein, putative isoform 1 [Theobroma
           cacao]
          Length = 382

 Score =  236 bits (601), Expect = 2e-59
 Identities = 125/242 (51%), Positives = 157/242 (64%), Gaps = 1/242 (0%)
 Frame = -2

Query: 877 SSKRKKIDKSRV-ISPYFQTTRAEEVEINEEDKPNXXXXXXXSPYFREKTLEEGVEPIEN 701
           + KR++ D   + +SPY Q +  ++   +   KP                 +  V     
Sbjct: 156 NGKRRRADAQVLKVSPYLQRSGEKQDMESGTSKP-----------------KHKVVKASP 198

Query: 700 YLLERNYKCEKQPKKVQSRASASHSLNVSQKLDEAYRRKSPDNTWKPPPSHFTLLQEQHF 521
           Y L+         KK    A     L+ SQK DEAY+RK+P+NTW PP S+  LLQE H 
Sbjct: 199 YFLKNKDNILGGMKKAMKPAGVKPVLSASQKRDEAYQRKTPNNTWIPPRSNAPLLQEDHT 258

Query: 520 KDPWRVLVICMLLNRTAGRQARRVIANLFELCPDAKTATEVATEEIEKVIQILGLHHKRA 341
            DPWRVL+ICMLLN+T+G QAR V+++LF LCPDAKTATEVAT EIEK I+ LGL  KRA
Sbjct: 259 HDPWRVLLICMLLNKTSGNQARNVLSDLFTLCPDAKTATEVATGEIEKAIKPLGLQRKRA 318

Query: 340 KMIQRFSKEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPNDHMLNKYWDYLHIK 161
           +MIQR S+EYL   WTHVT+LHGVGKYAADAYAIFCTGK D+V P+DHMLN YW++L+  
Sbjct: 319 EMIQRMSQEYLWKEWTHVTELHGVGKYAADAYAIFCTGKGDRVTPSDHMLNYYWNFLYGP 378

Query: 160 RD 155
           +D
Sbjct: 379 KD 380


>emb|CAN67143.1| hypothetical protein VITISV_044254 [Vitis vinifera]
          Length = 635

 Score =  232 bits (591), Expect = 3e-58
 Identities = 138/314 (43%), Positives = 169/314 (53%), Gaps = 46/314 (14%)
 Frame = -2

Query: 970  KQEKSRAENFNCSTDEDENLPQKSEKKSIPNSSKRKKIDKSRVISPYFQTTRAEEVEINE 791
            K++K +    N   ++ +   Q+    S  NS K+        +SPY Q    EE E N 
Sbjct: 319  KEQKKKINVQNVRVEDQKMEVQQPISSSNSNSQKK--------VSPYCQRAVKEEEEGNS 370

Query: 790  ED--KPNXXXXXXXSPYFREKTLEEGVEPIENYLLERNYKCEKQPKKVQSRASASHSLNV 617
            E+  K             + KT  + V   +  +     K    P +V S     +  + 
Sbjct: 371  EEDTKKGHENEESFKEEGKRKTNAQNVTMEDEKMKLPKKKSRAPPIRVVSPYFPINEEDA 430

Query: 616  SQ--------KLDEAYRRKSPDNTWKPPPSHFTLLQEQHFKDPWRVLVICMLLNRTAGRQ 461
             +        KL+ AYRRKSPDN WKPPPSHF LLQE H+ DPWRV+VICMLLN T+G Q
Sbjct: 431  KKPVRAMFFNKLNVAYRRKSPDNNWKPPPSHFHLLQEDHYHDPWRVMVICMLLNCTSGLQ 490

Query: 460  ------------------------------------ARRVIANLFELCPDAKTATEVATE 389
                                                A RVI++LF LCPDAKTAT+V TE
Sbjct: 491  GWFGTCVTCMILKWAVEPRSHVVGFIMIELPVGILLASRVISDLFTLCPDAKTATDVPTE 550

Query: 388  EIEKVIQILGLHHKRAKMIQRFSKEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKWDQVR 209
             IEKVI+ LGL  KRA MIQRFS+EYL+D WTHVTQLHG+GKYAADAYAIFC+G W  V 
Sbjct: 551  MIEKVIETLGLQKKRAAMIQRFSREYLDDSWTHVTQLHGIGKYAADAYAIFCSGDWGLVV 610

Query: 208  PNDHMLNKYWDYLH 167
            PNDHML KYW YL+
Sbjct: 611  PNDHMLVKYWKYLY 624


Top