BLASTX nr result
ID: Akebia24_contig00009581
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00009581 (1491 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002514395.1| conserved hypothetical protein [Ricinus comm... 265 3e-68 ref|XP_006423926.1| hypothetical protein CICLE_v10028470mg [Citr... 261 7e-67 gb|EXB50510.1| Methyl-CpG-binding domain protein 4 [Morus notabi... 258 6e-66 ref|XP_006494703.1| PREDICTED: transcriptional regulator ATRX ho... 254 9e-65 emb|CBI29440.3| unnamed protein product [Vitis vinifera] 248 4e-63 ref|XP_002271845.1| PREDICTED: uncharacterized protein LOC100244... 248 4e-63 gb|AAO22623.1| unknown protein [Arabidopsis thaliana] 246 2e-62 ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arab... 246 2e-62 ref|XP_006407780.1| hypothetical protein EUTSA_v10020704mg [Eutr... 245 3e-62 ref|XP_006297652.1| hypothetical protein CARUB_v10013672mg [Caps... 244 6e-62 ref|NP_974253.1| DNA glycosylase superfamily protein [Arabidopsi... 244 6e-62 gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thal... 244 6e-62 ref|XP_006593877.1| PREDICTED: axoneme-associated protein mst101... 243 2e-61 ref|XP_007163979.1| hypothetical protein PHAVU_L0004001g, partia... 239 3e-60 ref|XP_007163978.1| hypothetical protein PHAVU_L0004001g [Phaseo... 239 3e-60 ref|XP_002317727.2| hypothetical protein POPTR_0012s03470g [Popu... 237 1e-59 ref|XP_007032156.1| DNA glycosylase superfamily protein, putativ... 236 3e-59 ref|XP_006350381.1| PREDICTED: methyl-CpG-binding domain protein... 234 7e-59 ref|XP_004231530.1| PREDICTED: uncharacterized protein LOC101255... 233 1e-58 emb|CAN67143.1| hypothetical protein VITISV_044254 [Vitis vinifera] 230 1e-57 >ref|XP_002514395.1| conserved hypothetical protein [Ricinus communis] gi|223546492|gb|EEF47991.1| conserved hypothetical protein [Ricinus communis] Length = 608 Score = 265 bits (678), Expect = 3e-68 Identities = 126/195 (64%), Positives = 149/195 (76%) Frame = +3 Query: 588 PYFREKPLEEGVEPIENYLLERNYKCEKQPKKVQSRASASHSLNVSQKLDEAYRRKSPDN 767 PYF++ P +E E ++ +++ + +K P+K + A S +L+ ++K EAYRRK+PDN Sbjct: 408 PYFQKVPKQEEEEAADSNMIDNKHGQKKLPEKKKRPARKSITLSAAEKRSEAYRRKTPDN 467 Query: 768 TWKPPPSHFTLLQEQHFKDPWRVLVICMLLNRTAGRQARRVIANLFELCPDAKTATEVAT 947 TWKPP S F LLQE H DPWRVLVICMLLN T G+Q R VI++ F LCPDAK ATE T Sbjct: 468 TWKPPRSDFGLLQEDHASDPWRVLVICMLLNCTTGKQVRGVISDFFTLCPDAKAATEAKT 527 Query: 948 EEIEKVIQILGLHHKRAKMIQRFSKEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKWDQV 1127 EEIEK+I LGL KRA MIQR S+EYL D WTHVTQLHGVGKYAADAYAIFCTGKWDQV Sbjct: 528 EEIEKIIVPLGLQKKRAVMIQRLSQEYLADDWTHVTQLHGVGKYAADAYAIFCTGKWDQV 587 Query: 1128 RPNDHMLNKYWDYLH 1172 RP DHMLN YWD+LH Sbjct: 588 RPKDHMLNYYWDFLH 602 >ref|XP_006423926.1| hypothetical protein CICLE_v10028470mg [Citrus clementina] gi|568883956|ref|XP_006494704.1| PREDICTED: transcriptional regulator ATRX homolog isoform X2 [Citrus sinensis] gi|557525860|gb|ESR37166.1| hypothetical protein CICLE_v10028470mg [Citrus clementina] Length = 439 Score = 261 bits (666), Expect = 7e-67 Identities = 131/226 (57%), Positives = 154/226 (68%), Gaps = 2/226 (0%) Frame = +3 Query: 498 ISPYFQTTRAEEVEINEEDKPXXXXXXXXXPYFREKPLEEGVEPIENYLLERNYKCEKQP 677 +SPYFQ +A VE D PYF+ + P + N + E++ Sbjct: 210 VSPYFQRQKAGNVERKNHDTSTMAQARKVSPYFQN---QNSTTPAAATVQVHNQQQEEKE 266 Query: 678 KK--VQSRASASHSLNVSQKLDEAYRRKSPDNTWKPPPSHFTLLQEQHFKDPWRVLVICM 851 K V+ + S S +L +QK DEAY RK PDNTW PP S LLQ +H DPWRV+VICM Sbjct: 267 KDIAVKKKRSRSVTLTAAQKRDEAYERKRPDNTWNPPRSPIVLLQHEHVHDPWRVIVICM 326 Query: 852 LLNRTAGRQARRVIANLFELCPDAKTATEVATEEIEKVIQILGLHHKRAKMIQRFSKEYL 1031 LLNRT G QA RVI++LF LCPDAKTATEV EEIEK+I LGL KRA MI+RFS+EYL Sbjct: 327 LLNRTTGLQAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIKRFSQEYL 386 Query: 1032 EDGWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPNDHMLNKYWDYL 1169 + WTHVTQLHGVGKYAADAYAIFCTGKWD+VRP DHMLN YW++L Sbjct: 387 GESWTHVTQLHGVGKYAADAYAIFCTGKWDRVRPTDHMLNYYWEFL 432 >gb|EXB50510.1| Methyl-CpG-binding domain protein 4 [Morus notabilis] Length = 418 Score = 258 bits (658), Expect = 6e-66 Identities = 145/283 (51%), Positives = 178/283 (62%), Gaps = 23/283 (8%) Frame = +3 Query: 405 STDEDENLPQKSEKKSIPNSSKRKKIDKSRVISPYFQTTR--------------AEEVEI 542 S E E +K KK+I ++ + SRV+SPYF T R EEVE+ Sbjct: 142 SRKEVEIAGKKRRKKNI---DRKDDVAGSRVVSPYFTTNRNDTQEKKKKPEKDGREEVEL 198 Query: 543 NEEDKPXXXXXXXXXPYFREKPLEEGVEPIENYLLERNYKC------EKQPKKV---QSR 695 E+ K F KP++E +E E+ K EK+ K+ + + Sbjct: 199 GEK-KEEHLKLVDVLSRFAYKPMKEKTT-VER--AEKGRKLGLVGVGEKKMSKIVVRRKK 254 Query: 696 ASASHSLNVSQKLDEAYRRKSPDNTWKPPPSHFTLLQEQHFKDPWRVLVICMLLNRTAGR 875 S LN ++K DEAY+RK+ DN W PPPS L+Q+ H DPWRVLVICMLLNRT G Sbjct: 255 IEKSKVLNAAEKRDEAYKRKTDDNKWNPPPSEIRLIQQDHLHDPWRVLVICMLLNRTTGA 314 Query: 876 QARRVIANLFELCPDAKTATEVATEEIEKVIQILGLHHKRAKMIQRFSKEYLEDGWTHVT 1055 QA RVI++ F LCP+AK ATEV+ EEI K+I LGL HKRA+MIQRFS+EYLE+ WTHVT Sbjct: 315 QATRVISDFFSLCPNAKAATEVSPEEIVKIIHTLGL-HKRAQMIQRFSREYLEESWTHVT 373 Query: 1056 QLHGVGKYAADAYAIFCTGKWDQVRPNDHMLNKYWDYLHIKRD 1184 QLHGVGKYAADAYAIFCTGKWD+V+P DHMLN YW +LH RD Sbjct: 374 QLHGVGKYAADAYAIFCTGKWDRVKPADHMLNYYWKFLHSIRD 416 >ref|XP_006494703.1| PREDICTED: transcriptional regulator ATRX homolog isoform X1 [Citrus sinensis] Length = 446 Score = 254 bits (648), Expect = 9e-65 Identities = 131/233 (56%), Positives = 154/233 (66%), Gaps = 9/233 (3%) Frame = +3 Query: 498 ISPYFQTTRAEEVEINEEDKPXXXXXXXXXPYFREKPLEEGVEPIENYLLERNYKCEKQP 677 +SPYFQ +A VE D PYF+ + P + N + E++ Sbjct: 210 VSPYFQRQKAGNVERKNHDTSTMAQARKVSPYFQN---QNSTTPAAATVQVHNQQQEEKE 266 Query: 678 KK--VQSRASASHSLNVSQKLDEAYRRKSPDNTWKPPPSHFTLLQEQHFKDPWRVLVICM 851 K V+ + S S +L +QK DEAY RK PDNTW PP S LLQ +H DPWRV+VICM Sbjct: 267 KDIAVKKKRSRSVTLTAAQKRDEAYERKRPDNTWNPPRSPIVLLQHEHVHDPWRVIVICM 326 Query: 852 LLNRTAGRQ-------ARRVIANLFELCPDAKTATEVATEEIEKVIQILGLHHKRAKMIQ 1010 LLNRT G Q A RVI++LF LCPDAKTATEV EEIEK+I LGL KRA MI+ Sbjct: 327 LLNRTTGLQEIAILLKAGRVISDLFTLCPDAKTATEVDAEEIEKIISTLGLQKKRAPMIK 386 Query: 1011 RFSKEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPNDHMLNKYWDYL 1169 RFS+EYL + WTHVTQLHGVGKYAADAYAIFCTGKWD+VRP DHMLN YW++L Sbjct: 387 RFSQEYLGESWTHVTQLHGVGKYAADAYAIFCTGKWDRVRPTDHMLNYYWEFL 439 >emb|CBI29440.3| unnamed protein product [Vitis vinifera] Length = 599 Score = 248 bits (634), Expect = 4e-63 Identities = 143/285 (50%), Positives = 170/285 (59%), Gaps = 17/285 (5%) Frame = +3 Query: 369 KQEKSRAENFNCSTDEDENLPQKSEKKSIPNSSKRKKIDKSRVISPYFQTTRAEEVEINE 548 K++K + N ++ + Q+ S NS K+ +SPY Q EE E N Sbjct: 319 KEQKKKINVQNVRVEDQKMEVQQPISSSNSNSQKK--------VSPYCQRAVKEEEEGNS 370 Query: 549 EDKPXXXXXXXXXPYFREKPLEEGVEPIENYLLERNYKCEKQPKKVQSRA------SASH 710 E+ E EEG + + K PKK +SRA S Sbjct: 371 EEDTKKGHEN------EESFKEEGKRKTNAQNVTMEDEKMKLPKK-KSRAPPIRVVSPYF 423 Query: 711 SLNVSQ-----------KLDEAYRRKSPDNTWKPPPSHFTLLQEQHFKDPWRVLVICMLL 857 +N KL+ AYRRKSPDN WKPPPSHF LLQE H+ DPWRV+VICMLL Sbjct: 424 PINEEDAKKPVRAMFFNKLNVAYRRKSPDNNWKPPPSHFHLLQEDHYHDPWRVMVICMLL 483 Query: 858 NRTAGRQARRVIANLFELCPDAKTATEVATEEIEKVIQILGLHHKRAKMIQRFSKEYLED 1037 N T+G QA RVI++LF LCPDAKTAT+V TE IEKVI+ LGL KRA MIQRFS+EYL+D Sbjct: 484 NCTSGLQASRVISDLFTLCPDAKTATDVPTEMIEKVIETLGLQKKRAAMIQRFSREYLDD 543 Query: 1038 GWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPNDHMLNKYWDYLH 1172 WTHVTQLHG+GKYAADAYAIFC+G W V PNDHML KYW YL+ Sbjct: 544 SWTHVTQLHGIGKYAADAYAIFCSGDWGLVVPNDHMLVKYWKYLY 588 >ref|XP_002271845.1| PREDICTED: uncharacterized protein LOC100244192 [Vitis vinifera] Length = 536 Score = 248 bits (634), Expect = 4e-63 Identities = 143/285 (50%), Positives = 170/285 (59%), Gaps = 17/285 (5%) Frame = +3 Query: 369 KQEKSRAENFNCSTDEDENLPQKSEKKSIPNSSKRKKIDKSRVISPYFQTTRAEEVEINE 548 K++K + N ++ + Q+ S NS K+ +SPY Q EE E N Sbjct: 256 KEQKKKINVQNVRVEDQKMEVQQPISSSNSNSQKK--------VSPYCQRAVKEEEEGNS 307 Query: 549 EDKPXXXXXXXXXPYFREKPLEEGVEPIENYLLERNYKCEKQPKKVQSRA------SASH 710 E+ E EEG + + K PKK +SRA S Sbjct: 308 EEDTKKGHEN------EESFKEEGKRKTNAQNVTMEDEKMKLPKK-KSRAPPIRVVSPYF 360 Query: 711 SLNVSQ-----------KLDEAYRRKSPDNTWKPPPSHFTLLQEQHFKDPWRVLVICMLL 857 +N KL+ AYRRKSPDN WKPPPSHF LLQE H+ DPWRV+VICMLL Sbjct: 361 PINEEDAKKPVRAMFFNKLNVAYRRKSPDNNWKPPPSHFHLLQEDHYHDPWRVMVICMLL 420 Query: 858 NRTAGRQARRVIANLFELCPDAKTATEVATEEIEKVIQILGLHHKRAKMIQRFSKEYLED 1037 N T+G QA RVI++LF LCPDAKTAT+V TE IEKVI+ LGL KRA MIQRFS+EYL+D Sbjct: 421 NCTSGLQASRVISDLFTLCPDAKTATDVPTEMIEKVIETLGLQKKRAAMIQRFSREYLDD 480 Query: 1038 GWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPNDHMLNKYWDYLH 1172 WTHVTQLHG+GKYAADAYAIFC+G W V PNDHML KYW YL+ Sbjct: 481 SWTHVTQLHGIGKYAADAYAIFCSGDWGLVVPNDHMLVKYWKYLY 525 >gb|AAO22623.1| unknown protein [Arabidopsis thaliana] Length = 407 Score = 246 bits (627), Expect = 2e-62 Identities = 134/260 (51%), Positives = 167/260 (64%), Gaps = 4/260 (1%) Frame = +3 Query: 411 DEDENLPQKSEKKSIPNSSKRKKIDKSRVISPYFQTTRAEEVEINEEDKPXXXXXXXXXP 590 D D +S + SSKR+ K+R +SPYFQ + E + N+ K Sbjct: 162 DSDIVSSSQSGRNYRKGSSKRQV--KARRVSPYFQESTVSE-QPNQAPKGLRN------- 211 Query: 591 YFREKPLEEGVEPIENYL----LERNYKCEKQPKKVQSRASASHSLNVSQKLDEAYRRKS 758 YF+ V + Y ++ N +++ + V+ S L++SQK D+ Y RK+ Sbjct: 212 YFK-------VVKVSRYFHADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKT 264 Query: 759 PDNTWKPPPSHFTLLQEQHFKDPWRVLVICMLLNRTAGRQARRVIANLFELCPDAKTATE 938 PDNTW PP S LLQE H+ DPWRVLVICMLLN+T+G Q R VI++LF LC DAKTATE Sbjct: 265 PDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATE 324 Query: 939 VATEEIEKVIQILGLHHKRAKMIQRFSKEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKW 1118 V EEIE +I+ LGL KR KMIQR S EYL++ WTHVTQLHGVGKYAADAYAIFC G W Sbjct: 325 VKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCNGNW 384 Query: 1119 DQVRPNDHMLNKYWDYLHIK 1178 D+V+PNDHMLN YWDYL I+ Sbjct: 385 DRVKPNDHMLNYYWDYLRIR 404 >ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arabidopsis lyrata subsp. lyrata] gi|297328398|gb|EFH58817.1| hypothetical protein ARALYDRAFT_896965 [Arabidopsis lyrata subsp. lyrata] Length = 435 Score = 246 bits (627), Expect = 2e-62 Identities = 131/252 (51%), Positives = 164/252 (65%), Gaps = 4/252 (1%) Frame = +3 Query: 435 KSEKKSIPNSSKRKKIDKSRVISPYFQTTRAEEVEINEEDKPXXXXXXXXXPYFREKPLE 614 +S K SSKR+ K R SPYFQ + E +P YF+ Sbjct: 197 QSGKNYRRGSSKRQA--KVRRDSPYFQESTVSE-------QPSQAPPRDLRQYFK----- 242 Query: 615 EGVEPIENYL----LERNYKCEKQPKKVQSRASASHSLNVSQKLDEAYRRKSPDNTWKPP 782 V + Y ++ N +++ +V+ S SL++SQK DEAY+RK+PD TW PP Sbjct: 243 --VVKVSRYFHADGIQVNESQKEKSTRVRKTPVVSPSLSLSQKTDEAYQRKTPDKTWVPP 300 Query: 783 PSHFTLLQEQHFKDPWRVLVICMLLNRTAGRQARRVIANLFELCPDAKTATEVATEEIEK 962 S LLQE H+ DPWRVLVICMLLN+T+G Q R VI +LF LCPDAKTATEV EIE Sbjct: 301 RSPCNLLQEHHWHDPWRVLVICMLLNKTSGAQTRGVIEDLFALCPDAKTATEVEEREIES 360 Query: 963 VIQILGLHHKRAKMIQRFSKEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPNDH 1142 +I+ LGL KRA+MIQRFS EYL++ WTHVTQLHG+GKYAADAYAIFC G WD+V+P+DH Sbjct: 361 LIKPLGLQKKRARMIQRFSLEYLQESWTHVTQLHGIGKYAADAYAIFCNGNWDRVKPDDH 420 Query: 1143 MLNKYWDYLHIK 1178 MLN YW++L I+ Sbjct: 421 MLNYYWEFLRIR 432 >ref|XP_006407780.1| hypothetical protein EUTSA_v10020704mg [Eutrema salsugineum] gi|557108926|gb|ESQ49233.1| hypothetical protein EUTSA_v10020704mg [Eutrema salsugineum] Length = 456 Score = 245 bits (626), Expect = 3e-62 Identities = 134/255 (52%), Positives = 164/255 (64%) Frame = +3 Query: 414 EDENLPQKSEKKSIPNSSKRKKIDKSRVISPYFQTTRAEEVEINEEDKPXXXXXXXXXPY 593 + E++ +S +K SSK + K +SPYFQ + E D Y Sbjct: 210 DSESVASQSGRKYRKESSKLQA--KVPRVSPYFQGSTVSEQPNPSRDLRQYFKVVKVSRY 267 Query: 594 FREKPLEEGVEPIENYLLERNYKCEKQPKKVQSRASASHSLNVSQKLDEAYRRKSPDNTW 773 F + P + G + + ER+ + K P S SL+ QK DEAY RK PDNTW Sbjct: 268 FHDMPAD-GTQ-VNEPQKERSRRMRKTPV-------VSPSLSQCQKTDEAYLRKMPDNTW 318 Query: 774 KPPPSHFTLLQEQHFKDPWRVLVICMLLNRTAGRQARRVIANLFELCPDAKTATEVATEE 953 PP S LLQE H+ DPWRVLVICMLLN+T+G Q R VI++LF LCPDAK+ATEV +E Sbjct: 319 VPPRSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFVLCPDAKSATEVEEKE 378 Query: 954 IEKVIQILGLHHKRAKMIQRFSKEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKWDQVRP 1133 IE +I+ LGL KRAKMIQRFS EYL++ WTHVTQL+GVGKYAADAYAIFC GKWD VRP Sbjct: 379 IESLIKPLGLQKKRAKMIQRFSLEYLQESWTHVTQLYGVGKYAADAYAIFCNGKWDCVRP 438 Query: 1134 NDHMLNKYWDYLHIK 1178 DHMLN YW++L I+ Sbjct: 439 ADHMLNYYWEFLRIR 453 >ref|XP_006297652.1| hypothetical protein CARUB_v10013672mg [Capsella rubella] gi|482566361|gb|EOA30550.1| hypothetical protein CARUB_v10013672mg [Capsella rubella] Length = 456 Score = 244 bits (624), Expect = 6e-62 Identities = 127/240 (52%), Positives = 161/240 (67%) Frame = +3 Query: 459 NSSKRKKIDKSRVISPYFQTTRAEEVEINEEDKPXXXXXXXXXPYFREKPLEEGVEPIEN 638 +SSK + K R +S YFQ + E D YF + + G++ ++ Sbjct: 225 DSSKHQA--KVRRVSRYFQASADSEQPNPPRDLRKYFKVVKVSRYFHDVSAD-GIQVADS 281 Query: 639 YLLERNYKCEKQPKKVQSRASASHSLNVSQKLDEAYRRKSPDNTWKPPPSHFTLLQEQHF 818 +++ ++V+ S SL+ SQK DEAY RK+PDNTW PP S LLQE H+ Sbjct: 282 Q--------KEKSRRVRKTPVVSPSLSPSQKTDEAYLRKTPDNTWVPPRSPCNLLQEDHW 333 Query: 819 KDPWRVLVICMLLNRTAGRQARRVIANLFELCPDAKTATEVATEEIEKVIQILGLHHKRA 998 DPWRVLVICMLLN+T+G Q R VI++LF LCPDAKTATEV +EIE +I+ LGL KRA Sbjct: 334 HDPWRVLVICMLLNKTSGAQTRGVISDLFTLCPDAKTATEVEEKEIESLIKPLGLQKKRA 393 Query: 999 KMIQRFSKEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPNDHMLNKYWDYLHIK 1178 KMIQRFS EYL + WTHVTQLHG+GKYAADAYAIFC G WD+V+P+DHMLN YW++L I+ Sbjct: 394 KMIQRFSLEYLNESWTHVTQLHGIGKYAADAYAIFCNGNWDRVKPSDHMLNYYWEFLRIR 453 >ref|NP_974253.1| DNA glycosylase superfamily protein [Arabidopsis thaliana] gi|114050633|gb|ABI49466.1| At3g07930 [Arabidopsis thaliana] gi|332641100|gb|AEE74621.1| DNA glycosylase superfamily protein [Arabidopsis thaliana] Length = 445 Score = 244 bits (624), Expect = 6e-62 Identities = 134/260 (51%), Positives = 166/260 (63%), Gaps = 4/260 (1%) Frame = +3 Query: 411 DEDENLPQKSEKKSIPNSSKRKKIDKSRVISPYFQTTRAEEVEINEEDKPXXXXXXXXXP 590 D D +S + SSKR+ K R +SPYFQ + E + N+ K Sbjct: 200 DSDIVSSSQSGRNYRKGSSKRQV--KVRRVSPYFQESTVSE-QPNQAPKGLRN------- 249 Query: 591 YFREKPLEEGVEPIENYL----LERNYKCEKQPKKVQSRASASHSLNVSQKLDEAYRRKS 758 YF+ V + Y ++ N +++ + V+ S L++SQK D+ Y RK+ Sbjct: 250 YFK-------VVKVSRYFHADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKT 302 Query: 759 PDNTWKPPPSHFTLLQEQHFKDPWRVLVICMLLNRTAGRQARRVIANLFELCPDAKTATE 938 PDNTW PP S LLQE H+ DPWRVLVICMLLN+T+G Q R VI++LF LC DAKTATE Sbjct: 303 PDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATE 362 Query: 939 VATEEIEKVIQILGLHHKRAKMIQRFSKEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKW 1118 V EEIE +I+ LGL KR KMIQR S EYL++ WTHVTQLHGVGKYAADAYAIFC G W Sbjct: 363 VKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCNGNW 422 Query: 1119 DQVRPNDHMLNKYWDYLHIK 1178 D+V+PNDHMLN YWDYL I+ Sbjct: 423 DRVKPNDHMLNYYWDYLRIR 442 >gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thaliana] Length = 419 Score = 244 bits (624), Expect = 6e-62 Identities = 134/260 (51%), Positives = 166/260 (63%), Gaps = 4/260 (1%) Frame = +3 Query: 411 DEDENLPQKSEKKSIPNSSKRKKIDKSRVISPYFQTTRAEEVEINEEDKPXXXXXXXXXP 590 D D +S + SSKR+ K R +SPYFQ + E + N+ K Sbjct: 174 DSDIVSSSQSGRNYRKGSSKRQV--KVRRVSPYFQESTVSE-QPNQAPKGLRN------- 223 Query: 591 YFREKPLEEGVEPIENYL----LERNYKCEKQPKKVQSRASASHSLNVSQKLDEAYRRKS 758 YF+ V + Y ++ N +++ + V+ S L++SQK D+ Y RK+ Sbjct: 224 YFK-------VVKVSRYFHADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKT 276 Query: 759 PDNTWKPPPSHFTLLQEQHFKDPWRVLVICMLLNRTAGRQARRVIANLFELCPDAKTATE 938 PDNTW PP S LLQE H+ DPWRVLVICMLLN+T+G Q R VI++LF LC DAKTATE Sbjct: 277 PDNTWVPPRSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATE 336 Query: 939 VATEEIEKVIQILGLHHKRAKMIQRFSKEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKW 1118 V EEIE +I+ LGL KR KMIQR S EYL++ WTHVTQLHGVGKYAADAYAIFC G W Sbjct: 337 VKEEEIENLIKPLGLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCNGNW 396 Query: 1119 DQVRPNDHMLNKYWDYLHIK 1178 D+V+PNDHMLN YWDYL I+ Sbjct: 397 DRVKPNDHMLNYYWDYLRIR 416 >ref|XP_006593877.1| PREDICTED: axoneme-associated protein mst101(2)-like [Glycine max] Length = 1424 Score = 243 bits (620), Expect = 2e-61 Identities = 119/227 (52%), Positives = 149/227 (65%) Frame = +3 Query: 492 RVISPYFQTTRAEEVEINEEDKPXXXXXXXXXPYFREKPLEEGVEPIENYLLERNYKCEK 671 R +SPYF ++V + DK L +E+ L E C Sbjct: 1207 RYVSPYFCNNSGKKVNVKPFDKGSTSESIA---------LHTCKNFVEDKLEENKSNCSN 1257 Query: 672 QPKKVQSRASASHSLNVSQKLDEAYRRKSPDNTWKPPPSHFTLLQEQHFKDPWRVLVICM 851 + +++ AS +K DEAY+RK+PDNTWKPP S L+QE H DPWRVLVICM Sbjct: 1258 KSIEIKRFPPAS------EKWDEAYKRKTPDNTWKPPRSEIVLIQEDHLHDPWRVLVICM 1311 Query: 852 LLNRTAGRQARRVIANLFELCPDAKTATEVATEEIEKVIQILGLHHKRAKMIQRFSKEYL 1031 LLNRTAG Q ++V++N F+LCPDAK+ T+V EEIEK I+ LG HKRA+M+QR S+EYL Sbjct: 1312 LLNRTAGGQTKKVVSNFFKLCPDAKSCTQVTREEIEKTIKTLGFQHKRAEMLQRLSEEYL 1371 Query: 1032 EDGWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPNDHMLNKYWDYLH 1172 ++ WTHVTQLHGVGKYAADAYAIF TG WD+V P DHMLN YW++LH Sbjct: 1372 DESWTHVTQLHGVGKYAADAYAIFVTGMWDRVTPTDHMLNYYWEFLH 1418 >ref|XP_007163979.1| hypothetical protein PHAVU_L0004001g, partial [Phaseolus vulgaris] gi|561039879|gb|ESW35973.1| hypothetical protein PHAVU_L0004001g, partial [Phaseolus vulgaris] Length = 715 Score = 239 bits (609), Expect = 3e-60 Identities = 123/233 (52%), Positives = 158/233 (67%), Gaps = 7/233 (3%) Frame = +3 Query: 492 RVISPYFQTTRAEEVEINEEDKPXXXXXXXXXPYFREKPLEEG--VEPIENYLLERNYKC 665 R +SPYF + +++ KPL+EG E I + E NY Sbjct: 499 RYVSPYFHNDSGKNIDV--------------------KPLDEGSKFESIALHATE-NY-V 536 Query: 666 EKQPKKVQSRASASH-----SLNVSQKLDEAYRRKSPDNTWKPPPSHFTLLQEQHFKDPW 830 E +P++ +S S +L+ SQK DEAY+RK+PD TWKPP S L+QE H DPW Sbjct: 537 EDKPEENKSSCSEKSIEIKKNLSASQKWDEAYKRKTPDITWKPPRSATVLIQEDHAHDPW 596 Query: 831 RVLVICMLLNRTAGRQARRVIANLFELCPDAKTATEVATEEIEKVIQILGLHHKRAKMIQ 1010 RVLVICMLLNRT+GRQ + ++++ F+LCPDAK+ TEV+ EEIE+ I+ LG HKRAKM++ Sbjct: 597 RVLVICMLLNRTSGRQTKNIVSDFFKLCPDAKSCTEVSREEIEETIKTLGFQHKRAKMLK 656 Query: 1011 RFSKEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPNDHMLNKYWDYL 1169 R S+EYL++ WTHVTQLHGVGKYAADAYAIF TGK D+VRP DHMLN YW++L Sbjct: 657 RLSEEYLDESWTHVTQLHGVGKYAADAYAIFVTGKSDRVRPTDHMLNYYWEFL 709 >ref|XP_007163978.1| hypothetical protein PHAVU_L0004001g [Phaseolus vulgaris] gi|561039878|gb|ESW35972.1| hypothetical protein PHAVU_L0004001g [Phaseolus vulgaris] Length = 726 Score = 239 bits (609), Expect = 3e-60 Identities = 123/233 (52%), Positives = 158/233 (67%), Gaps = 7/233 (3%) Frame = +3 Query: 492 RVISPYFQTTRAEEVEINEEDKPXXXXXXXXXPYFREKPLEEG--VEPIENYLLERNYKC 665 R +SPYF + +++ KPL+EG E I + E NY Sbjct: 510 RYVSPYFHNDSGKNIDV--------------------KPLDEGSKFESIALHATE-NY-V 547 Query: 666 EKQPKKVQSRASASH-----SLNVSQKLDEAYRRKSPDNTWKPPPSHFTLLQEQHFKDPW 830 E +P++ +S S +L+ SQK DEAY+RK+PD TWKPP S L+QE H DPW Sbjct: 548 EDKPEENKSSCSEKSIEIKKNLSASQKWDEAYKRKTPDITWKPPRSATVLIQEDHAHDPW 607 Query: 831 RVLVICMLLNRTAGRQARRVIANLFELCPDAKTATEVATEEIEKVIQILGLHHKRAKMIQ 1010 RVLVICMLLNRT+GRQ + ++++ F+LCPDAK+ TEV+ EEIE+ I+ LG HKRAKM++ Sbjct: 608 RVLVICMLLNRTSGRQTKNIVSDFFKLCPDAKSCTEVSREEIEETIKTLGFQHKRAKMLK 667 Query: 1011 RFSKEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPNDHMLNKYWDYL 1169 R S+EYL++ WTHVTQLHGVGKYAADAYAIF TGK D+VRP DHMLN YW++L Sbjct: 668 RLSEEYLDESWTHVTQLHGVGKYAADAYAIFVTGKSDRVRPTDHMLNYYWEFL 720 >ref|XP_002317727.2| hypothetical protein POPTR_0012s03470g [Populus trichocarpa] gi|550326306|gb|EEE95947.2| hypothetical protein POPTR_0012s03470g [Populus trichocarpa] Length = 229 Score = 237 bits (604), Expect = 1e-59 Identities = 114/173 (65%), Positives = 135/173 (78%), Gaps = 3/173 (1%) Frame = +3 Query: 660 KCEKQPKKVQSRASASHSLNVS---QKLDEAYRRKSPDNTWKPPPSHFTLLQEQHFKDPW 830 + +K+ KK + ++ HS S K DEAY RK+ +NTWKPP S F L H DPW Sbjct: 50 RSKKKKKKKEGTKTSLHSDTTSPYYNKFDEAYERKTAENTWKPPQSEFGFLHN-HAHDPW 108 Query: 831 RVLVICMLLNRTAGRQARRVIANLFELCPDAKTATEVATEEIEKVIQILGLHHKRAKMIQ 1010 RVLVICMLLNRTAG +A RV+A+LF LCPDAK AT VATEEIE+ I+ LGL +RAKM+Q Sbjct: 109 RVLVICMLLNRTAGTRAERVVADLFTLCPDAKAATGVATEEIERAIKSLGLQKRRAKMVQ 168 Query: 1011 RFSKEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPNDHMLNKYWDYL 1169 R S++YLE+ WTHVTQL GVGKYAADAYAIFCTGKW+QVRPNDHMLN+YW+YL Sbjct: 169 RLSEDYLEEDWTHVTQLPGVGKYAADAYAIFCTGKWEQVRPNDHMLNRYWEYL 221 >ref|XP_007032156.1| DNA glycosylase superfamily protein, putative isoform 1 [Theobroma cacao] gi|590648404|ref|XP_007032157.1| DNA glycosylase superfamily protein, putative isoform 1 [Theobroma cacao] gi|508711185|gb|EOY03082.1| DNA glycosylase superfamily protein, putative isoform 1 [Theobroma cacao] gi|508711186|gb|EOY03083.1| DNA glycosylase superfamily protein, putative isoform 1 [Theobroma cacao] Length = 382 Score = 236 bits (601), Expect = 3e-59 Identities = 125/242 (51%), Positives = 157/242 (64%), Gaps = 1/242 (0%) Frame = +3 Query: 462 SSKRKKIDKSRV-ISPYFQTTRAEEVEINEEDKPXXXXXXXXXPYFREKPLEEGVEPIEN 638 + KR++ D + +SPY Q + ++ + KP + V Sbjct: 156 NGKRRRADAQVLKVSPYLQRSGEKQDMESGTSKP-----------------KHKVVKASP 198 Query: 639 YLLERNYKCEKQPKKVQSRASASHSLNVSQKLDEAYRRKSPDNTWKPPPSHFTLLQEQHF 818 Y L+ KK A L+ SQK DEAY+RK+P+NTW PP S+ LLQE H Sbjct: 199 YFLKNKDNILGGMKKAMKPAGVKPVLSASQKRDEAYQRKTPNNTWIPPRSNAPLLQEDHT 258 Query: 819 KDPWRVLVICMLLNRTAGRQARRVIANLFELCPDAKTATEVATEEIEKVIQILGLHHKRA 998 DPWRVL+ICMLLN+T+G QAR V+++LF LCPDAKTATEVAT EIEK I+ LGL KRA Sbjct: 259 HDPWRVLLICMLLNKTSGNQARNVLSDLFTLCPDAKTATEVATGEIEKAIKPLGLQRKRA 318 Query: 999 KMIQRFSKEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPNDHMLNKYWDYLHIK 1178 +MIQR S+EYL WTHVT+LHGVGKYAADAYAIFCTGK D+V P+DHMLN YW++L+ Sbjct: 319 EMIQRMSQEYLWKEWTHVTELHGVGKYAADAYAIFCTGKGDRVTPSDHMLNYYWNFLYGP 378 Query: 1179 RD 1184 +D Sbjct: 379 KD 380 >ref|XP_006350381.1| PREDICTED: methyl-CpG-binding domain protein 4-like, partial [Solanum tuberosum] Length = 222 Score = 234 bits (597), Expect = 7e-59 Identities = 122/233 (52%), Positives = 148/233 (63%), Gaps = 4/233 (1%) Frame = +3 Query: 486 KSRVISPYFQT-TRAEEVEINEE---DKPXXXXXXXXXPYFREKPLEEGVEPIENYLLER 653 K RV+SPYF T EE+++ ++ PYF+ Sbjct: 4 KVRVVSPYFANLTVGEEIKVGKDRSNPSKNCLNGRKVSPYFQNA---------------- 47 Query: 654 NYKCEKQPKKVQSRASASHSLNVSQKLDEAYRRKSPDNTWKPPPSHFTLLQEQHFKDPWR 833 Y+ K+ +K R L+ QK DEAY R+S DNTW PP SHF LLQE H DPWR Sbjct: 48 -YRENKKSRKGSKRQKPC--LSAFQKRDEAYLRRSEDNTWVPPRSHFNLLQENHAHDPWR 104 Query: 834 VLVICMLLNRTAGRQARRVIANLFELCPDAKTATEVATEEIEKVIQILGLHHKRAKMIQR 1013 VLVICMLLN T G Q +RV+ F LCP+A ATEVA E+IEK+++ LGL+ KR+ I R Sbjct: 105 VLVICMLLNCTTGVQVKRVVDEFFTLCPNAVAATEVAVEDIEKLLRPLGLYTKRSLAIPR 164 Query: 1014 FSKEYLEDGWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPNDHMLNKYWDYLH 1172 S+EYL + WTHVTQLHG+GKYAADAYAIFCTGKWDQV PNDHML KYW++LH Sbjct: 165 LSQEYLGETWTHVTQLHGIGKYAADAYAIFCTGKWDQVHPNDHMLTKYWEFLH 217 >ref|XP_004231530.1| PREDICTED: uncharacterized protein LOC101255935 [Solanum lycopersicum] Length = 544 Score = 233 bits (595), Expect = 1e-58 Identities = 127/273 (46%), Positives = 163/273 (59%), Gaps = 6/273 (2%) Frame = +3 Query: 372 QEKSRA--ENFNCSTDEDENLPQKSEKKSIPNSSKRKKIDKSRVISPYFQTTRA-EEVEI 542 ++K+RA F S + + + + + + + +K K RV+SPYF + EE+++ Sbjct: 284 EQKARAVCPYFLNSRNGETEMKKGRSVECVKKRNDKKLRTKVRVVSPYFANLKVGEEIKV 343 Query: 543 NEEDK---PXXXXXXXXXPYFREKPLEEGVEPIENYLLERNYKCEKQPKKVQSRASASHS 713 ++ PYF+ N EK+ + S+ Sbjct: 344 GKDSSNASKNCLNGRKVSPYFQ------------------NAYREKKKSTIGSKRQKP-C 384 Query: 714 LNVSQKLDEAYRRKSPDNTWKPPPSHFTLLQEQHFKDPWRVLVICMLLNRTAGRQARRVI 893 L+ SQK DEAY R+S DN W PP SHF LLQE H DPWRVLVICMLLN T G Q RRV+ Sbjct: 385 LSASQKRDEAYLRRSEDNMWVPPRSHFNLLQENHAHDPWRVLVICMLLNCTTGVQVRRVV 444 Query: 894 ANLFELCPDAKTATEVATEEIEKVIQILGLHHKRAKMIQRFSKEYLEDGWTHVTQLHGVG 1073 F LCP+A ATEVA E+IEK+++ LGL+ KR+ I R S+EYL WTHVTQLHG+G Sbjct: 445 DEFFTLCPNAVAATEVAVEDIEKLLRPLGLYTKRSLSIPRLSQEYLGKNWTHVTQLHGIG 504 Query: 1074 KYAADAYAIFCTGKWDQVRPNDHMLNKYWDYLH 1172 KYAADAYAIFCTG WDQV PNDHML KYW++LH Sbjct: 505 KYAADAYAIFCTGNWDQVHPNDHMLTKYWEFLH 537 >emb|CAN67143.1| hypothetical protein VITISV_044254 [Vitis vinifera] Length = 635 Score = 230 bits (587), Expect = 1e-57 Identities = 143/321 (44%), Positives = 170/321 (52%), Gaps = 53/321 (16%) Frame = +3 Query: 369 KQEKSRAENFNCSTDEDENLPQKSEKKSIPNSSKRKKIDKSRVISPYFQTTRAEEVEINE 548 K++K + N ++ + Q+ S NS K+ +SPY Q EE E N Sbjct: 319 KEQKKKINVQNVRVEDQKMEVQQPISSSNSNSQKK--------VSPYCQRAVKEEEEGNS 370 Query: 549 EDKPXXXXXXXXXPYFREKPLEEGVEPIENYLLERNYKCEKQPKKVQSRA------SASH 710 E+ E EEG + + K PKK +SRA S Sbjct: 371 EEDTKKGHEN------EESFKEEGKRKTNAQNVTMEDEKMKLPKK-KSRAPPIRVVSPYF 423 Query: 711 SLNVSQ-----------KLDEAYRRKSPDNTWKPPPSHFTLLQEQHFKDPWRVLVICMLL 857 +N KL+ AYRRKSPDN WKPPPSHF LLQE H+ DPWRV+VICMLL Sbjct: 424 PINEEDAKKPVRAMFFNKLNVAYRRKSPDNNWKPPPSHFHLLQEDHYHDPWRVMVICMLL 483 Query: 858 NRTAGRQ------------------------------------ARRVIANLFELCPDAKT 929 N T+G Q A RVI++LF LCPDAKT Sbjct: 484 NCTSGLQGWFGTCVTCMILKWAVEPRSHVVGFIMIELPVGILLASRVISDLFTLCPDAKT 543 Query: 930 ATEVATEEIEKVIQILGLHHKRAKMIQRFSKEYLEDGWTHVTQLHGVGKYAADAYAIFCT 1109 AT+V TE IEKVI+ LGL KRA MIQRFS+EYL+D WTHVTQLHG+GKYAADAYAIFC+ Sbjct: 544 ATDVPTEMIEKVIETLGLQKKRAAMIQRFSREYLDDSWTHVTQLHGIGKYAADAYAIFCS 603 Query: 1110 GKWDQVRPNDHMLNKYWDYLH 1172 G W V PNDHML KYW YL+ Sbjct: 604 GDWGLVVPNDHMLVKYWKYLY 624