BLASTX nr result
ID: Paeonia23_contig00001425
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Paeonia23_contig00001425 (1950 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006423926.1| hypothetical protein CICLE_v10028470mg [Citr... 258 9e-66 ref|XP_006494703.1| PREDICTED: transcriptional regulator ATRX ho... 251 1e-63 ref|XP_002514395.1| conserved hypothetical protein [Ricinus comm... 248 9e-63 ref|XP_006350381.1| PREDICTED: methyl-CpG-binding domain protein... 240 1e-60 ref|XP_006407780.1| hypothetical protein EUTSA_v10020704mg [Eutr... 239 3e-60 gb|EXB50510.1| Methyl-CpG-binding domain protein 4 [Morus notabi... 238 6e-60 gb|AAO22623.1| unknown protein [Arabidopsis thaliana] 238 7e-60 ref|XP_006297652.1| hypothetical protein CARUB_v10013672mg [Caps... 238 1e-59 ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arab... 237 1e-59 ref|XP_004231530.1| PREDICTED: uncharacterized protein LOC101255... 236 2e-59 ref|NP_974253.1| DNA glycosylase superfamily protein [Arabidopsi... 236 2e-59 gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thal... 236 2e-59 ref|XP_007032156.1| DNA glycosylase superfamily protein, putativ... 235 6e-59 ref|XP_006593877.1| PREDICTED: axoneme-associated protein mst101... 231 7e-58 ref|XP_002317727.2| hypothetical protein POPTR_0012s03470g [Popu... 231 9e-58 emb|CBI29440.3| unnamed protein product [Vitis vinifera] 229 3e-57 ref|XP_002271845.1| PREDICTED: uncharacterized protein LOC100244... 229 3e-57 ref|XP_007163979.1| hypothetical protein PHAVU_L0004001g, partia... 229 4e-57 ref|XP_007163978.1| hypothetical protein PHAVU_L0004001g [Phaseo... 229 4e-57 gb|EPS66392.1| hypothetical protein M569_08394 [Genlisea aurea] 222 5e-55 >ref|XP_006423926.1| hypothetical protein CICLE_v10028470mg [Citrus clementina] gi|568883956|ref|XP_006494704.1| PREDICTED: transcriptional regulator ATRX homolog isoform X2 [Citrus sinensis] gi|557525860|gb|ESR37166.1| hypothetical protein CICLE_v10028470mg [Citrus clementina] Length = 439 Score = 258 bits (658), Expect = 9e-66 Identities = 135/249 (54%), Positives = 159/249 (63%), Gaps = 4/249 (1%) Frame = -1 Query: 978 FSKRGRENNIQYKN----APKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXX 811 + +R + N++ KN + R VSPYF N T Sbjct: 213 YFQRQKAGNVERKNHDTSTMAQARKVSPYFQNQNSTTPAAATVQV--------------- 257 Query: 810 SHYFHNVPKVNGDDFLGGEIGCIKSKKVNKHVLTASQKLHEAYERKTSDNTWKPPGSPFY 631 HN + + + +K K+ LTA+QK EAYERK DNTW PP SP Sbjct: 258 ----HNQQQEEKEKDIA-----VKKKRSRSVTLTAAQKRDEAYERKRPDNTWNPPRSPIV 308 Query: 630 LLQEKHSHDPWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSL 451 LLQ +H HDPWRV+VIC+LLNRTTGLQA RVI DLFTLCP+AK ATEV EEIE++I +L Sbjct: 309 LLQHEHVHDPWRVIVICMLLNRTTGLQAGRVISDLFTLCPDAKTATEVDAEEIEKIISTL 368 Query: 450 GLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYY 271 GLQKKRA I+R S+EYL ESWTHVTQLHG+GKYAADAYAIFCTGKW+ V P DHMLNYY Sbjct: 369 GLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAYAIFCTGKWDRVRPTDHMLNYY 428 Query: 270 WEFLISFYG 244 WEFL+S G Sbjct: 429 WEFLVSTKG 437 >ref|XP_006494703.1| PREDICTED: transcriptional regulator ATRX homolog isoform X1 [Citrus sinensis] Length = 446 Score = 251 bits (640), Expect = 1e-63 Identities = 135/256 (52%), Positives = 159/256 (62%), Gaps = 11/256 (4%) Frame = -1 Query: 978 FSKRGRENNIQYKN----APKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXX 811 + +R + N++ KN + R VSPYF N T Sbjct: 213 YFQRQKAGNVERKNHDTSTMAQARKVSPYFQNQNSTTPAAATVQV--------------- 257 Query: 810 SHYFHNVPKVNGDDFLGGEIGCIKSKKVNKHVLTASQKLHEAYERKTSDNTWKPPGSPFY 631 HN + + + +K K+ LTA+QK EAYERK DNTW PP SP Sbjct: 258 ----HNQQQEEKEKDIA-----VKKKRSRSVTLTAAQKRDEAYERKRPDNTWNPPRSPIV 308 Query: 630 LLQEKHSHDPWRVLVICLLLNRTTGLQ-------ARRVIWDLFTLCPNAKIATEVGTEEI 472 LLQ +H HDPWRV+VIC+LLNRTTGLQ A RVI DLFTLCP+AK ATEV EEI Sbjct: 309 LLQHEHVHDPWRVIVICMLLNRTTGLQEIAILLKAGRVISDLFTLCPDAKTATEVDAEEI 368 Query: 471 ERVIQSLGLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPE 292 E++I +LGLQKKRA I+R S+EYL ESWTHVTQLHG+GKYAADAYAIFCTGKW+ V P Sbjct: 369 EKIISTLGLQKKRAPMIKRFSQEYLGESWTHVTQLHGVGKYAADAYAIFCTGKWDRVRPT 428 Query: 291 DHMLNYYWEFLISFYG 244 DHMLNYYWEFL+S G Sbjct: 429 DHMLNYYWEFLVSTKG 444 >ref|XP_002514395.1| conserved hypothetical protein [Ricinus communis] gi|223546492|gb|EEF47991.1| conserved hypothetical protein [Ricinus communis] Length = 608 Score = 248 bits (632), Expect = 9e-63 Identities = 200/597 (33%), Positives = 273/597 (45%), Gaps = 33/597 (5%) Frame = -1 Query: 1950 EASKREKRRVISPYFXXXXXXXXXXXXXXXRAFSVDGGYDGDGKMWKREGGTMQSRKKER 1771 + K++K+RV+SPYF VD D + + + RKK++ Sbjct: 50 KTKKKKKKRVVSPYFERVESTIS----------KVDNNLSFDSHDHESKQKKKKKRKKKK 99 Query: 1770 NQSHMDMSLMVSQVIEDESNAMVFKKKETEPSSSINLQINSTIAVEDDDG-RLSKEVYQG 1594 +VS E + M+ K + + NL +S VE+ R+S + Q Sbjct: 100 G--------VVSPYFE-RAECMISKDEPVDN----NLTFDSYDPVEEKKNKRVSPFLAQA 146 Query: 1593 NGMEKRTWDDTYCCNKSIKMNSLHQKAANKAEEHSFSVQAEINSSVLVDKDEQNEVNEVV 1414 + D+ N ++ ++ +K K+ + +++ E + +V + + E Sbjct: 147 ESRISK--DENVDNNLTLHGHAREKKKKKKSGTFTLNLEEEQGGANVVSRGDGKE----- 199 Query: 1413 LLPDNFNDLDEKKSTNLSPYFNKVNEQVGEEVVLVESNTKNNLFLQFIYNASGDGNRSTS 1234 N KK+ + Y NK + V + + + + N + DGN +T Sbjct: 200 ----KANKRKRKKNDG-AIYPNKTRDTVSSDAQMRDIVKLTEI------NVASDGNMATD 248 Query: 1233 DSNYGKNN---------NELEGFSHCFHKAARXXXXXXXXENKKSL---EFGYINKNIIS 1090 D N N F K A +KK L + + KNI Sbjct: 249 DCKTSAKNLLNEQMVAPNAGMSFEDVLSKYAYKSDGRLNFRDKKILGAPHYPMVVKNIEK 308 Query: 1089 SVILEN----EGEENLVSSPSLCGDLLDGKVPRNGVVVDQVFS---KRGRENNIQYKNAP 931 EN E E L + + L +G + +V + R EN Sbjct: 309 YEESENKISKEAEGTLKITENEAAPLPAIPYGNSGSQISEVGNVTPTRNIENEKPNSRVH 368 Query: 930 KKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXSHYFHNVPKVNGDDFLGGEI 751 +VR VSP F S E M S YF VPK ++ + Sbjct: 369 IQVRKVSPNFNLSIGQQEC--MKIKPLKPCERVGLTVRNVSPYFQKVPKQEEEE--AADS 424 Query: 750 GCIKSKKVNKHV-------------LTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHS 610 I +K K + L+A++K EAY RKT DNTWKPP S F LLQE H+ Sbjct: 425 NMIDNKHGQKKLPEKKKRPARKSITLSAAEKRSEAYRRKTPDNTWKPPRSDFGLLQEDHA 484 Query: 609 HDPWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRA 430 DPWRVLVIC+LLN TTG Q R VI D FTLCP+AK ATE TEEIE++I LGLQKKRA Sbjct: 485 SDPWRVLVICMLLNCTTGKQVRGVISDFFTLCPDAKAATEAKTEEIEKIIVPLGLQKKRA 544 Query: 429 VAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFL 259 V IQRLS+EYL + WTHVTQLHG+GKYAADAYAIFCTGKW+ V P+DHMLNYYW+FL Sbjct: 545 VMIQRLSQEYLADDWTHVTQLHGVGKYAADAYAIFCTGKWDQVRPKDHMLNYYWDFL 601 >ref|XP_006350381.1| PREDICTED: methyl-CpG-binding domain protein 4-like, partial [Solanum tuberosum] Length = 222 Score = 240 bits (613), Expect = 1e-60 Identities = 128/226 (56%), Positives = 148/226 (65%), Gaps = 3/226 (1%) Frame = -1 Query: 927 KVRVVSPYFLNSTVTNE---GKDMXXXXXXXXXXXXXXXXXXSHYFHNVPKVNGDDFLGG 757 KVRVVSPYF N TV E GKD YF N + N Sbjct: 4 KVRVVSPYFANLTVGEEIKVGKDRSNPSKNCLNGRKVSP-----YFQNAYRENKKSR--- 55 Query: 756 EIGCIKSKKVNKHVLTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHSHDPWRVLVICL 577 K K K L+A QK EAY R++ DNTW PP S F LLQE H+HDPWRVLVIC+ Sbjct: 56 -----KGSKRQKPCLSAFQKRDEAYLRRSEDNTWVPPRSHFNLLQENHAHDPWRVLVICM 110 Query: 576 LLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSREYL 397 LLN TTG+Q +RV+ + FTLCPNA ATEV E+IE++++ LGL KR++AI RLS+EYL Sbjct: 111 LLNCTTGVQVKRVVDEFFTLCPNAVAATEVAVEDIEKLLRPLGLYTKRSLAIPRLSQEYL 170 Query: 396 EESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFL 259 E+WTHVTQLHGIGKYAADAYAIFCTGKW+ V P DHML YWEFL Sbjct: 171 GETWTHVTQLHGIGKYAADAYAIFCTGKWDQVHPNDHMLTKYWEFL 216 >ref|XP_006407780.1| hypothetical protein EUTSA_v10020704mg [Eutrema salsugineum] gi|557108926|gb|ESQ49233.1| hypothetical protein EUTSA_v10020704mg [Eutrema salsugineum] Length = 456 Score = 239 bits (611), Expect = 3e-60 Identities = 133/251 (52%), Positives = 158/251 (62%), Gaps = 8/251 (3%) Frame = -1 Query: 987 DQVFSKRGRENNIQYKNAPKKVRVVSPYFLNSTVT---NEGKDMXXXXXXXXXXXXXXXX 817 + V S+ GR+ + KV VSPYF STV+ N +D+ Sbjct: 212 ESVASQSGRKYRKESSKLQAKVPRVSPYFQGSTVSEQPNPSRDLRQYFKVVKVS------ 265 Query: 816 XXSHYFHNVPKVNGDDFLGGEIGCIKSKKVNKHV-----LTASQKLHEAYERKTSDNTWK 652 YFH++P D E +S+++ K L+ QK EAY RK DNTW Sbjct: 266 ---RYFHDMP---ADGTQVNEPQKERSRRMRKTPVVSPSLSQCQKTDEAYLRKMPDNTWV 319 Query: 651 PPGSPFYLLQEKHSHDPWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEI 472 PP SP LLQE H HDPWRVLVIC+LLN+T+G Q R VI DLF LCP+AK ATEV +EI Sbjct: 320 PPRSPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFVLCPDAKSATEVEEKEI 379 Query: 471 ERVIQSLGLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPE 292 E +I+ LGLQKKRA IQR S EYL+ESWTHVTQL+G+GKYAADAYAIFC GKW+ V P Sbjct: 380 ESLIKPLGLQKKRAKMIQRFSLEYLQESWTHVTQLYGVGKYAADAYAIFCNGKWDCVRPA 439 Query: 291 DHMLNYYWEFL 259 DHMLNYYWEFL Sbjct: 440 DHMLNYYWEFL 450 >gb|EXB50510.1| Methyl-CpG-binding domain protein 4 [Morus notabilis] Length = 418 Score = 238 bits (608), Expect = 6e-60 Identities = 115/165 (69%), Positives = 132/165 (80%), Gaps = 1/165 (0%) Frame = -1 Query: 744 IKSKKVNKH-VLTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHSHDPWRVLVICLLLN 568 ++ KK+ K VL A++K EAY+RKT DN W PP S L+Q+ H HDPWRVLVIC+LLN Sbjct: 250 VRRKKIEKSKVLNAAEKRDEAYKRKTDDNKWNPPPSEIRLIQQDHLHDPWRVLVICMLLN 309 Query: 567 RTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSREYLEES 388 RTTG QA RVI D F+LCPNAK ATEV EEI ++I +LGL KRA IQR SREYLEES Sbjct: 310 RTTGAQATRVISDFFSLCPNAKAATEVSPEEIVKIIHTLGLH-KRAQMIQRFSREYLEES 368 Query: 387 WTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFLIS 253 WTHVTQLHG+GKYAADAYAIFCTGKW+ V+P DHMLNYYW+FL S Sbjct: 369 WTHVTQLHGVGKYAADAYAIFCTGKWDRVKPADHMLNYYWKFLHS 413 >gb|AAO22623.1| unknown protein [Arabidopsis thaliana] Length = 407 Score = 238 bits (607), Expect = 7e-60 Identities = 167/418 (39%), Positives = 211/418 (50%), Gaps = 22/418 (5%) Frame = -1 Query: 1446 KDEQNEVNEVVLLPDNFNDLDEKKSTNLSPYFNKVNEQVGEEVVLVESNTKNNLFLQFIY 1267 +D+ + V PD+ D E N S K +++ ++ LV+ + N + Sbjct: 19 RDDDSSVMMTRRRPDS--DFIEVSDENRSFALFKEDDEKNRDLGLVDDGSTN-----LVL 71 Query: 1266 NASGDGNRSTSDSNYGKNNNELEG-FSHCFHKAARXXXXXXXXENKKSLEFGYIN-KNII 1093 DG D N+N L+ FS +K R +K +FG I N++ Sbjct: 72 QCHDDGCSLEKD-----NSNSLDDLFSGFVYKGVR---------RRKRDDFGSITTSNLV 117 Query: 1092 SSVILENEGEENLVSSPSL----CGDLLDGKVPR-----NGVVVDQV------FSKRGRE 958 S I +++ + VS + C + KVPR + Q S+ GR Sbjct: 118 SPQIADDDDDS--VSDSHIERQECSKV-QAKVPRVSPYFQASTISQCDSDIVSSSQSGRN 174 Query: 957 NNIQYKNAPKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXSHYFHNVPKVN 778 K R VSPYF STV+ + YFH Sbjct: 175 YRKGSSKRQVKARRVSPYFQESTVSEQPNQAPKGLRNYFKVVKVS-----RYFH------ 223 Query: 777 GDDFLGGEIGCIKSKKVNKH-----VLTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKH 613 D E KS+ V K VL+ SQK + Y RKT DNTW PP SP LLQE H Sbjct: 224 ADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKTPDNTWVPPRSPCNLLQEDH 283 Query: 612 SHDPWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKR 433 HDPWRVLVIC+LLN+T+G Q R VI DLF LC +AK ATEV EEIE +I+ LGLQKKR Sbjct: 284 WHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATEVKEEEIENLIKPLGLQKKR 343 Query: 432 AVAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFL 259 IQRLS EYL+ESWTHVTQLHG+GKYAADAYAIFC G W+ V+P DHMLNYYW++L Sbjct: 344 TKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCNGNWDRVKPNDHMLNYYWDYL 401 >ref|XP_006297652.1| hypothetical protein CARUB_v10013672mg [Capsella rubella] gi|482566361|gb|EOA30550.1| hypothetical protein CARUB_v10013672mg [Capsella rubella] Length = 456 Score = 238 bits (606), Expect = 1e-59 Identities = 136/248 (54%), Positives = 152/248 (61%), Gaps = 5/248 (2%) Frame = -1 Query: 987 DQVFSKRGRENNIQYKNAPKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXS 808 D V S+ G KVR VS YF S D S Sbjct: 212 DIVSSQSGGSYRRDSSKHQAKVRRVSRYFQASA------DSEQPNPPRDLRKYFKVVKVS 265 Query: 807 HYFHNVPKVNGDDFLGGEIGCIKSKKVNKHV-----LTASQKLHEAYERKTSDNTWKPPG 643 YFH+V + D + KS++V K L+ SQK EAY RKT DNTW PP Sbjct: 266 RYFHDV---SADGIQVADSQKEKSRRVRKTPVVSPSLSPSQKTDEAYLRKTPDNTWVPPR 322 Query: 642 SPFYLLQEKHSHDPWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERV 463 SP LLQE H HDPWRVLVIC+LLN+T+G Q R VI DLFTLCP+AK ATEV +EIE + Sbjct: 323 SPCNLLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFTLCPDAKTATEVEEKEIESL 382 Query: 462 IQSLGLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHM 283 I+ LGLQKKRA IQR S EYL ESWTHVTQLHGIGKYAADAYAIFC G W+ V+P DHM Sbjct: 383 IKPLGLQKKRAKMIQRFSLEYLNESWTHVTQLHGIGKYAADAYAIFCNGNWDRVKPSDHM 442 Query: 282 LNYYWEFL 259 LNYYWEFL Sbjct: 443 LNYYWEFL 450 >ref|XP_002882558.1| hypothetical protein ARALYDRAFT_896965 [Arabidopsis lyrata subsp. lyrata] gi|297328398|gb|EFH58817.1| hypothetical protein ARALYDRAFT_896965 [Arabidopsis lyrata subsp. lyrata] Length = 435 Score = 237 bits (605), Expect = 1e-59 Identities = 130/228 (57%), Positives = 145/228 (63%), Gaps = 5/228 (2%) Frame = -1 Query: 927 KVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXSHYFHNVPKVNGDDFLGGEIG 748 KVR SPYF STV+ + YFH D E Sbjct: 212 KVRRDSPYFQESTVSEQPSQAPPRDLRQYFKVVKVS----RYFH------ADGIQVNESQ 261 Query: 747 CIKSKKVNKHV-----LTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHSHDPWRVLVI 583 KS +V K L+ SQK EAY+RKT D TW PP SP LLQE H HDPWRVLVI Sbjct: 262 KEKSTRVRKTPVVSPSLSLSQKTDEAYQRKTPDKTWVPPRSPCNLLQEHHWHDPWRVLVI 321 Query: 582 CLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSRE 403 C+LLN+T+G Q R VI DLF LCP+AK ATEV EIE +I+ LGLQKKRA IQR S E Sbjct: 322 CMLLNKTSGAQTRGVIEDLFALCPDAKTATEVEEREIESLIKPLGLQKKRARMIQRFSLE 381 Query: 402 YLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFL 259 YL+ESWTHVTQLHGIGKYAADAYAIFC G W+ V+P+DHMLNYYWEFL Sbjct: 382 YLQESWTHVTQLHGIGKYAADAYAIFCNGNWDRVKPDDHMLNYYWEFL 429 >ref|XP_004231530.1| PREDICTED: uncharacterized protein LOC101255935 [Solanum lycopersicum] Length = 544 Score = 236 bits (603), Expect = 2e-59 Identities = 137/291 (47%), Positives = 170/291 (58%), Gaps = 9/291 (3%) Frame = -1 Query: 1104 KNIISSVILENEGEENLVSSPS--LCGDLLDGKVPRNGVVVDQVFSKRGRENNIQYKNAP 931 K + + +N+ E ++ + +C L+ RNG + K+GR K Sbjct: 266 KTVFEPCLSQNQINEKMIEQKARAVCPYFLNS---RNG----ETEMKKGRSVECVKKRND 318 Query: 930 KK----VRVVSPYFLNSTVTNE---GKDMXXXXXXXXXXXXXXXXXXSHYFHNVPKVNGD 772 KK VRVVSPYF N V E GKD YF N + Sbjct: 319 KKLRTKVRVVSPYFANLKVGEEIKVGKDSSNASKNCLNGRKVSP-----YFQNAYREKKK 373 Query: 771 DFLGGEIGCIKSKKVNKHVLTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHSHDPWRV 592 +G K K L+ASQK EAY R++ DN W PP S F LLQE H+HDPWRV Sbjct: 374 STIGS--------KRQKPCLSASQKRDEAYLRRSEDNMWVPPRSHFNLLQENHAHDPWRV 425 Query: 591 LVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRL 412 LVIC+LLN TTG+Q RRV+ + FTLCPNA ATEV E+IE++++ LGL KR+++I RL Sbjct: 426 LVICMLLNCTTGVQVRRVVDEFFTLCPNAVAATEVAVEDIEKLLRPLGLYTKRSLSIPRL 485 Query: 411 SREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFL 259 S+EYL ++WTHVTQLHGIGKYAADAYAIFCTG W+ V P DHML YWEFL Sbjct: 486 SQEYLGKNWTHVTQLHGIGKYAADAYAIFCTGNWDQVHPNDHMLTKYWEFL 536 >ref|NP_974253.1| DNA glycosylase superfamily protein [Arabidopsis thaliana] gi|114050633|gb|ABI49466.1| At3g07930 [Arabidopsis thaliana] gi|332641100|gb|AEE74621.1| DNA glycosylase superfamily protein [Arabidopsis thaliana] Length = 445 Score = 236 bits (603), Expect = 2e-59 Identities = 131/244 (53%), Positives = 149/244 (61%), Gaps = 5/244 (2%) Frame = -1 Query: 975 SKRGRENNIQYKNAPKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXSHYFH 796 S+ GR KVR VSPYF STV+ + YFH Sbjct: 207 SQSGRNYRKGSSKRQVKVRRVSPYFQESTVSEQPNQAPKGLRNYFKVVKVS-----RYFH 261 Query: 795 NVPKVNGDDFLGGEIGCIKSKKVNKH-----VLTASQKLHEAYERKTSDNTWKPPGSPFY 631 D E KS+ V K VL+ SQK + Y RKT DNTW PP SP Sbjct: 262 ------ADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKTPDNTWVPPRSPCN 315 Query: 630 LLQEKHSHDPWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSL 451 LLQE H HDPWRVLVIC+LLN+T+G Q R VI DLF LC +AK ATEV EEIE +I+ L Sbjct: 316 LLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATEVKEEEIENLIKPL 375 Query: 450 GLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYY 271 GLQKKR IQRLS EYL+ESWTHVTQLHG+GKYAADAYAIFC G W+ V+P DHMLNYY Sbjct: 376 GLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCNGNWDRVKPNDHMLNYY 435 Query: 270 WEFL 259 W++L Sbjct: 436 WDYL 439 >gb|AAF21203.1|AC013483_27 hypothetical protein [Arabidopsis thaliana] Length = 419 Score = 236 bits (603), Expect = 2e-59 Identities = 131/244 (53%), Positives = 149/244 (61%), Gaps = 5/244 (2%) Frame = -1 Query: 975 SKRGRENNIQYKNAPKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXSHYFH 796 S+ GR KVR VSPYF STV+ + YFH Sbjct: 181 SQSGRNYRKGSSKRQVKVRRVSPYFQESTVSEQPNQAPKGLRNYFKVVKVS-----RYFH 235 Query: 795 NVPKVNGDDFLGGEIGCIKSKKVNKH-----VLTASQKLHEAYERKTSDNTWKPPGSPFY 631 D E KS+ V K VL+ SQK + Y RKT DNTW PP SP Sbjct: 236 ------ADGIQVNESQKEKSRNVRKTPIVSPVLSLSQKTDDVYLRKTPDNTWVPPRSPCN 289 Query: 630 LLQEKHSHDPWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSL 451 LLQE H HDPWRVLVIC+LLN+T+G Q R VI DLF LC +AK ATEV EEIE +I+ L Sbjct: 290 LLQEDHWHDPWRVLVICMLLNKTSGAQTRGVISDLFGLCTDAKTATEVKEEEIENLIKPL 349 Query: 450 GLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYY 271 GLQKKR IQRLS EYL+ESWTHVTQLHG+GKYAADAYAIFC G W+ V+P DHMLNYY Sbjct: 350 GLQKKRTKMIQRLSLEYLQESWTHVTQLHGVGKYAADAYAIFCNGNWDRVKPNDHMLNYY 409 Query: 270 WEFL 259 W++L Sbjct: 410 WDYL 413 >ref|XP_007032156.1| DNA glycosylase superfamily protein, putative isoform 1 [Theobroma cacao] gi|590648404|ref|XP_007032157.1| DNA glycosylase superfamily protein, putative isoform 1 [Theobroma cacao] gi|508711185|gb|EOY03082.1| DNA glycosylase superfamily protein, putative isoform 1 [Theobroma cacao] gi|508711186|gb|EOY03083.1| DNA glycosylase superfamily protein, putative isoform 1 [Theobroma cacao] Length = 382 Score = 235 bits (599), Expect = 6e-59 Identities = 116/174 (66%), Positives = 137/174 (78%) Frame = -1 Query: 780 NGDDFLGGEIGCIKSKKVNKHVLTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHSHDP 601 N D+ LGG +K V K VL+ASQK EAY+RKT +NTW PP S LLQE H+HDP Sbjct: 203 NKDNILGGMKKAMKPAGV-KPVLSASQKRDEAYQRKTPNNTWIPPRSNAPLLQEDHTHDP 261 Query: 600 WRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAI 421 WRVL+IC+LLN+T+G QAR V+ DLFTLCP+AK ATEV T EIE+ I+ LGLQ+KRA I Sbjct: 262 WRVLLICMLLNKTSGNQARNVLSDLFTLCPDAKTATEVATGEIEKAIKPLGLQRKRAEMI 321 Query: 420 QRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFL 259 QR+S+EYL + WTHVT+LHG+GKYAADAYAIFCTGK + V P DHMLNYYW FL Sbjct: 322 QRMSQEYLWKEWTHVTELHGVGKYAADAYAIFCTGKGDRVTPSDHMLNYYWNFL 375 >ref|XP_006593877.1| PREDICTED: axoneme-associated protein mst101(2)-like [Glycine max] Length = 1424 Score = 231 bits (590), Expect = 7e-58 Identities = 132/272 (48%), Positives = 156/272 (57%), Gaps = 16/272 (5%) Frame = -1 Query: 1020 DGKVPRNGV--VVDQVFSKRGRENN---IQYKNAPKK-----------VRVVSPYFLNST 889 DG V NG+ V +V S + +EN K PKK +R VSPYF N Sbjct: 1158 DGNVTENGMINVKRKVISNKLQENGNNATTSKVKPKKKKPLVQKNGHGIRYVSPYFCN-- 1215 Query: 888 VTNEGKDMXXXXXXXXXXXXXXXXXXSHYFHNVPKVNGDDFLGGEIGCIKSKKVNKHVLT 709 N GK + H D + C K Sbjct: 1216 --NSGKKVNVKPFDKGSTSESIA------LHTCKNFVEDKLEENKSNCSNKSIEIKRFPP 1267 Query: 708 ASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHSHDPWRVLVICLLLNRTTGLQARRVIWD 529 AS+K EAY+RKT DNTWKPP S L+QE H HDPWRVLVIC+LLNRT G Q ++V+ + Sbjct: 1268 ASEKWDEAYKRKTPDNTWKPPRSEIVLIQEDHLHDPWRVLVICMLLNRTAGGQTKKVVSN 1327 Query: 528 LFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKY 349 F LCP+AK T+V EEIE+ I++LG Q KRA +QRLS EYL+ESWTHVTQLHG+GKY Sbjct: 1328 FFKLCPDAKSCTQVTREEIEKTIKTLGFQHKRAEMLQRLSEEYLDESWTHVTQLHGVGKY 1387 Query: 348 AADAYAIFCTGKWELVEPEDHMLNYYWEFLIS 253 AADAYAIF TG W+ V P DHMLNYYWEFL S Sbjct: 1388 AADAYAIFVTGMWDRVTPTDHMLNYYWEFLHS 1419 >ref|XP_002317727.2| hypothetical protein POPTR_0012s03470g [Populus trichocarpa] gi|550326306|gb|EEE95947.2| hypothetical protein POPTR_0012s03470g [Populus trichocarpa] Length = 229 Score = 231 bits (589), Expect = 9e-58 Identities = 118/177 (66%), Positives = 131/177 (74%), Gaps = 10/177 (5%) Frame = -1 Query: 753 IGCIKSKKVNK-------HVLTAS---QKLHEAYERKTSDNTWKPPGSPFYLLQEKHSHD 604 IG K KK K H T S K EAYERKT++NTWKPP S F L H+HD Sbjct: 48 IGRSKKKKKKKEGTKTSLHSDTTSPYYNKFDEAYERKTAENTWKPPQSEFGFLHN-HAHD 106 Query: 603 PWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVA 424 PWRVLVIC+LLNRT G +A RV+ DLFTLCP+AK AT V TEEIER I+SLGLQK+RA Sbjct: 107 PWRVLVICMLLNRTAGTRAERVVADLFTLCPDAKAATGVATEEIERAIKSLGLQKRRAKM 166 Query: 423 IQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFLIS 253 +QRLS +YLEE WTHVTQL G+GKYAADAYAIFCTGKWE V P DHMLN YWE+L S Sbjct: 167 VQRLSEDYLEEDWTHVTQLPGVGKYAADAYAIFCTGKWEQVRPNDHMLNRYWEYLCS 223 >emb|CBI29440.3| unnamed protein product [Vitis vinifera] Length = 599 Score = 229 bits (584), Expect = 3e-57 Identities = 108/147 (73%), Positives = 124/147 (84%) Frame = -1 Query: 699 KLHEAYERKTSDNTWKPPGSPFYLLQEKHSHDPWRVLVICLLLNRTTGLQARRVIWDLFT 520 KL+ AY RK+ DN WKPP S F+LLQE H HDPWRV+VIC+LLN T+GLQA RVI DLFT Sbjct: 441 KLNVAYRRKSPDNNWKPPPSHFHLLQEDHYHDPWRVMVICMLLNCTSGLQASRVISDLFT 500 Query: 519 LCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKYAAD 340 LCP+AK AT+V TE IE+VI++LGLQKKRA IQR SREYL++SWTHVTQLHGIGKYAAD Sbjct: 501 LCPDAKTATDVPTEMIEKVIETLGLQKKRAAMIQRFSREYLDDSWTHVTQLHGIGKYAAD 560 Query: 339 AYAIFCTGKWELVEPEDHMLNYYWEFL 259 AYAIFC+G W LV P DHML YW++L Sbjct: 561 AYAIFCSGDWGLVVPNDHMLVKYWKYL 587 >ref|XP_002271845.1| PREDICTED: uncharacterized protein LOC100244192 [Vitis vinifera] Length = 536 Score = 229 bits (584), Expect = 3e-57 Identities = 108/147 (73%), Positives = 124/147 (84%) Frame = -1 Query: 699 KLHEAYERKTSDNTWKPPGSPFYLLQEKHSHDPWRVLVICLLLNRTTGLQARRVIWDLFT 520 KL+ AY RK+ DN WKPP S F+LLQE H HDPWRV+VIC+LLN T+GLQA RVI DLFT Sbjct: 378 KLNVAYRRKSPDNNWKPPPSHFHLLQEDHYHDPWRVMVICMLLNCTSGLQASRVISDLFT 437 Query: 519 LCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSREYLEESWTHVTQLHGIGKYAAD 340 LCP+AK AT+V TE IE+VI++LGLQKKRA IQR SREYL++SWTHVTQLHGIGKYAAD Sbjct: 438 LCPDAKTATDVPTEMIEKVIETLGLQKKRAAMIQRFSREYLDDSWTHVTQLHGIGKYAAD 497 Query: 339 AYAIFCTGKWELVEPEDHMLNYYWEFL 259 AYAIFC+G W LV P DHML YW++L Sbjct: 498 AYAIFCSGDWGLVVPNDHMLVKYWKYL 524 >ref|XP_007163979.1| hypothetical protein PHAVU_L0004001g, partial [Phaseolus vulgaris] gi|561039879|gb|ESW35973.1| hypothetical protein PHAVU_L0004001g, partial [Phaseolus vulgaris] Length = 715 Score = 229 bits (583), Expect = 4e-57 Identities = 121/233 (51%), Positives = 148/233 (63%), Gaps = 1/233 (0%) Frame = -1 Query: 942 KNAPKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXS-HYFHNVPKVNGDDF 766 KN +R VSPYF N + GK++ + +Y + P+ N Sbjct: 492 KNVAHAIRYVSPYFHNDS----GKNIDVKPLDEGSKFESIALHATENYVEDKPEEN---- 543 Query: 765 LGGEIGCIKSKKVNKHVLTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHSHDPWRVLV 586 + C + K L+ASQK EAY+RKT D TWKPP S L+QE H+HDPWRVLV Sbjct: 544 ---KSSCSEKSIEIKKNLSASQKWDEAYKRKTPDITWKPPRSATVLIQEDHAHDPWRVLV 600 Query: 585 ICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSR 406 IC+LLNRT+G Q + ++ D F LCP+AK TEV EEIE I++LG Q KRA ++RLS Sbjct: 601 ICMLLNRTSGRQTKNIVSDFFKLCPDAKSCTEVSREEIEETIKTLGFQHKRAKMLKRLSE 660 Query: 405 EYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFLISFY 247 EYL+ESWTHVTQLHG+GKYAADAYAIF TGK + V P DHMLNYYWEFL Y Sbjct: 661 EYLDESWTHVTQLHGVGKYAADAYAIFVTGKSDRVRPTDHMLNYYWEFLRRIY 713 >ref|XP_007163978.1| hypothetical protein PHAVU_L0004001g [Phaseolus vulgaris] gi|561039878|gb|ESW35972.1| hypothetical protein PHAVU_L0004001g [Phaseolus vulgaris] Length = 726 Score = 229 bits (583), Expect = 4e-57 Identities = 121/233 (51%), Positives = 148/233 (63%), Gaps = 1/233 (0%) Frame = -1 Query: 942 KNAPKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXS-HYFHNVPKVNGDDF 766 KN +R VSPYF N + GK++ + +Y + P+ N Sbjct: 503 KNVAHAIRYVSPYFHNDS----GKNIDVKPLDEGSKFESIALHATENYVEDKPEEN---- 554 Query: 765 LGGEIGCIKSKKVNKHVLTASQKLHEAYERKTSDNTWKPPGSPFYLLQEKHSHDPWRVLV 586 + C + K L+ASQK EAY+RKT D TWKPP S L+QE H+HDPWRVLV Sbjct: 555 ---KSSCSEKSIEIKKNLSASQKWDEAYKRKTPDITWKPPRSATVLIQEDHAHDPWRVLV 611 Query: 585 ICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKKRAVAIQRLSR 406 IC+LLNRT+G Q + ++ D F LCP+AK TEV EEIE I++LG Q KRA ++RLS Sbjct: 612 ICMLLNRTSGRQTKNIVSDFFKLCPDAKSCTEVSREEIEETIKTLGFQHKRAKMLKRLSE 671 Query: 405 EYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFLISFY 247 EYL+ESWTHVTQLHG+GKYAADAYAIF TGK + V P DHMLNYYWEFL Y Sbjct: 672 EYLDESWTHVTQLHGVGKYAADAYAIFVTGKSDRVRPTDHMLNYYWEFLRRIY 724 >gb|EPS66392.1| hypothetical protein M569_08394 [Genlisea aurea] Length = 369 Score = 222 bits (565), Expect = 5e-55 Identities = 123/243 (50%), Positives = 148/243 (60%), Gaps = 2/243 (0%) Frame = -1 Query: 969 RGRENNIQYKNAPKKVRVVSPYFLNSTVTNEGKDMXXXXXXXXXXXXXXXXXXSHYFHNV 790 R ++N + KKV V+ PYF +DM YF + Sbjct: 146 RKKKNIVTEDGCDKKVVVLDPYF--------AEDMSRKKVSP-------------YFQSP 184 Query: 789 PKVNGDDFLGGEI--GCIKSKKVNKHVLTASQKLHEAYERKTSDNTWKPPGSPFYLLQEK 616 K +G D E+ + K K VL++ QK EAYER+T DN W PP SPF LLQE Sbjct: 185 RKTSGSDRGISEVVEESPERSKRWKPVLSSVQKRDEAYERRTPDNEWTPPRSPFNLLQED 244 Query: 615 HSHDPWRVLVICLLLNRTTGLQARRVIWDLFTLCPNAKIATEVGTEEIERVIQSLGLQKK 436 H DPWRVLVIC+LLN+TTG QA RV+ LF LCP AK ATEV ++IE I+ LGLQ+K Sbjct: 245 HMFDPWRVLVICMLLNQTTGRQAFRVLSKLFELCPTAKAATEVARDDIEDAIRCLGLQRK 304 Query: 435 RAVAIQRLSREYLEESWTHVTQLHGIGKYAADAYAIFCTGKWELVEPEDHMLNYYWEFLI 256 RA IQR S EY+ E WTHVT+L GIGKYAADAYAIFCTG+W+ V P DHML YWE+L Sbjct: 305 RAEMIQRFSEEYMSEEWTHVTELPGIGKYAADAYAIFCTGRWQRVRPADHMLVKYWEWLN 364 Query: 255 SFY 247 F+ Sbjct: 365 EFF 367