BLASTX nr result
ID: Sinomenium22_contig00048923
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium22_contig00048923 (766 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002263612.1| PREDICTED: uncharacterized protein LOC100256... 202 1e-49 emb|CAN68394.1| hypothetical protein VITISV_042519 [Vitis vinifera] 201 2e-49 ref|XP_006364818.1| PREDICTED: uncharacterized protein LOC102594... 192 8e-47 ref|XP_003531474.1| PREDICTED: uncharacterized protein LOC100791... 192 1e-46 ref|XP_007022761.1| DNA glycosylase superfamily protein isoform ... 190 4e-46 ref|XP_004232605.1| PREDICTED: uncharacterized protein LOC101246... 189 1e-45 ref|XP_003546838.1| PREDICTED: uncharacterized protein LOC100811... 188 2e-45 ref|XP_007133461.1| hypothetical protein PHAVU_011G180500g [Phas... 187 3e-45 ref|XP_002312220.1| methyladenine glycosylase family protein [Po... 187 4e-45 ref|XP_002315089.2| methyladenine glycosylase family protein [Po... 186 6e-45 ref|XP_003545728.1| PREDICTED: uncharacterized protein LOC100793... 185 2e-44 ref|XP_007149154.1| hypothetical protein PHAVU_005G045900g [Phas... 184 2e-44 ref|XP_007211731.1| hypothetical protein PRUPE_ppa006139mg [Prun... 184 3e-44 ref|XP_002303719.2| hypothetical protein POPTR_0003s15520g [Popu... 182 9e-44 ref|XP_002303720.2| hypothetical protein POPTR_0003s15520g [Popu... 182 9e-44 ref|XP_007022762.1| DNA glycosylase superfamily protein isoform ... 182 1e-43 gb|AFK37052.1| unknown [Medicago truncatula] 181 2e-43 ref|XP_003595924.1| DNA-3-methyladenine glycosylase [Medicago tr... 180 6e-43 ref|XP_007212311.1| hypothetical protein PRUPE_ppa006731mg [Prun... 178 2e-42 ref|XP_003531809.1| PREDICTED: uncharacterized protein LOC100793... 178 2e-42 >ref|XP_002263612.1| PREDICTED: uncharacterized protein LOC100256507 [Vitis vinifera] gi|297738175|emb|CBI27376.3| unnamed protein product [Vitis vinifera] Length = 398 Score = 202 bits (514), Expect = 1e-49 Identities = 122/243 (50%), Positives = 140/243 (57%), Gaps = 5/243 (2%) Frame = +2 Query: 53 VDMAPQVAQINGRPVLQPGGNRVPMSLERRNXXXXXXXXXXXXXXXXR-----TRDNVXX 217 +D+ P AQINGRP LQP NR+P SLER + T N Sbjct: 12 IDITPSKAQINGRPALQPTCNRIP-SLERHHSFKKISPKSPTSPLPASPPPPTTIINTTK 70 Query: 218 XXXXXXXXXXXXXXXXXXXXXXXGNDHNGLNSSADKALPVTPRATPKLATLVRKKSKKSG 397 GND NGLNSS +K L TPR T K ++ +K K Sbjct: 71 TKPSLTPPASPNLKSPRQPALKRGNDPNGLNSSLEKVL--TPRGTTKSSSSPKKTKK--- 125 Query: 398 SANGVCGAAVAPLVESATLNYSSSLIVEAPGSIAAARREQVTLIQAQRKMRIAHYGRTKS 577 C A +AP ++++LNYSSSLIVEAPGSIAAARREQ+ ++Q QRKMRIAHYGRTKS Sbjct: 126 -----CSAGLAPSSDTSSLNYSSSLIVEAPGSIAAARREQMAIMQVQRKMRIAHYGRTKS 180 Query: 578 GKFEEKIVPLDXXXXXXXXXXXXXREKRCSFITPNSDPLYVTYHDEEWGVPVHDDKLLFE 757 K+EEKI P+D EKRCSFITPNSDP YV YHDEEWGVPVHDDK LFE Sbjct: 181 AKYEEKIGPVD------PLVITTREEKRCSFITPNSDPSYVEYHDEEWGVPVHDDKRLFE 234 Query: 758 LLV 766 LLV Sbjct: 235 LLV 237 >emb|CAN68394.1| hypothetical protein VITISV_042519 [Vitis vinifera] Length = 398 Score = 201 bits (511), Expect = 2e-49 Identities = 121/243 (49%), Positives = 139/243 (57%), Gaps = 5/243 (2%) Frame = +2 Query: 53 VDMAPQVAQINGRPVLQPGGNRVPMSLERRNXXXXXXXXXXXXXXXXR-----TRDNVXX 217 +D+ P AQINGRP LQP NR+P SLER + T N Sbjct: 12 IDITPSKAQINGRPALQPTCNRIP-SLERHHSFKKISPKSPTSPLPASLPPPTTIINTTK 70 Query: 218 XXXXXXXXXXXXXXXXXXXXXXXGNDHNGLNSSADKALPVTPRATPKLATLVRKKSKKSG 397 GND NGLNSS +K L TPR T K ++ +K K Sbjct: 71 TKPSLTPPASPNLKSPRQPALKRGNDPNGLNSSLEKVL--TPRGTTKSSSSPKKTKK--- 125 Query: 398 SANGVCGAAVAPLVESATLNYSSSLIVEAPGSIAAARREQVTLIQAQRKMRIAHYGRTKS 577 C A +AP ++++LNYSSS IVEAPGSIAAARREQ+ ++Q QRKMRIAHYGRTKS Sbjct: 126 -----CSAGLAPSSDTSSLNYSSSFIVEAPGSIAAARREQMAIMQVQRKMRIAHYGRTKS 180 Query: 578 GKFEEKIVPLDXXXXXXXXXXXXXREKRCSFITPNSDPLYVTYHDEEWGVPVHDDKLLFE 757 K+EEKI P+D EKRCSFITPNSDP YV YHDEEWGVPVHDDK LFE Sbjct: 181 AKYEEKISPVD------PLVITTREEKRCSFITPNSDPSYVEYHDEEWGVPVHDDKRLFE 234 Query: 758 LLV 766 LLV Sbjct: 235 LLV 237 >ref|XP_006364818.1| PREDICTED: uncharacterized protein LOC102594852 [Solanum tuberosum] Length = 395 Score = 192 bits (489), Expect = 8e-47 Identities = 121/238 (50%), Positives = 139/238 (58%), Gaps = 3/238 (1%) Frame = +2 Query: 62 APQ-VAQINGRPVLQPGGNRVPMSLERRNXXXXXXXXXXXXXXXXRTRDNVXXXXXXXXX 238 +PQ ++QINGRPVLQP N VP+ ERRN T+ Sbjct: 11 SPQTLSQINGRPVLQPHSNIVPL-YERRNSLKKTTNTAASVTANGSTKVKTSSSTTPPVS 69 Query: 239 XXXXXXXXXXXXXXXXGN--DHNGLNSSADKALPVTPRATPKLATLVRKKSKKSGSANGV 412 GN D NGL+SSA+K VTP+ T A ++ KK KKS Sbjct: 70 PKMKSPRLPAIKR---GNNIDPNGLSSSAEKI--VTPKGTANKAPILLKKPKKSSG---- 120 Query: 413 CGAAVAPLVESATLNYSSSLIVEAPGSIAAARREQVTLIQAQRKMRIAHYGRTKSGKFEE 592 G A P VE+++L YSSSLIVEAPGSIAAARREQV + Q QRKM+IAHYGRTKS K+E Sbjct: 121 -GLASPPYVENSSLKYSSSLIVEAPGSIAAARREQVAIAQVQRKMKIAHYGRTKSAKYEG 179 Query: 593 KIVPLDXXXXXXXXXXXXXREKRCSFITPNSDPLYVTYHDEEWGVPVHDDKLLFELLV 766 K+ LD EKRCSFITPNSDPLY+ YHDEEWGVPVHDD LLFELLV Sbjct: 180 KVSSLD-PSFASAVIPNPREEKRCSFITPNSDPLYIAYHDEEWGVPVHDDNLLFELLV 236 >ref|XP_003531474.1| PREDICTED: uncharacterized protein LOC100791725 [Glycine max] Length = 400 Score = 192 bits (487), Expect = 1e-46 Identities = 119/234 (50%), Positives = 140/234 (59%) Frame = +2 Query: 65 PQVAQINGRPVLQPGGNRVPMSLERRNXXXXXXXXXXXXXXXXRTRDNVXXXXXXXXXXX 244 P VA+INGRPVLQP NRVP +LERRN Sbjct: 21 PSVARINGRPVLQPTCNRVP-NLERRNSIKKVAPPKSLSPPSPPLPSKTSLTPPVSPKLK 79 Query: 245 XXXXXXXXXXXXXXGNDHNGLNSSADKALPVTPRATPKLATLVRKKSKKSGSANGVCGAA 424 GND+NGLNSS +K V PR++ K TL RKKSK G+ Sbjct: 80 SPRLPATKR-----GNDNNGLNSSYEKI--VIPRSSTKTPTLERKKSKSFKE-----GSC 127 Query: 425 VAPLVESATLNYSSSLIVEAPGSIAAARREQVTLIQAQRKMRIAHYGRTKSGKFEEKIVP 604 V+ +E A+L+YSSSLI ++PGSIAA RREQ+ L QAQRKM+IAHYGR+KS KFE ++VP Sbjct: 128 VSASIE-ASLSYSSSLITDSPGSIAAVRREQMALQQAQRKMKIAHYGRSKSAKFE-RVVP 185 Query: 605 LDXXXXXXXXXXXXXREKRCSFITPNSDPLYVTYHDEEWGVPVHDDKLLFELLV 766 LD EKRCSFITPNSDP+Y+ YHDEEWGVPVHDDK+LFELLV Sbjct: 186 LDPSNTSLASKPTE-EEKRCSFITPNSDPIYIAYHDEEWGVPVHDDKMLFELLV 238 >ref|XP_007022761.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508722389|gb|EOY14286.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] Length = 409 Score = 190 bits (483), Expect = 4e-46 Identities = 123/248 (49%), Positives = 144/248 (58%), Gaps = 10/248 (4%) Frame = +2 Query: 53 VDMAPQVAQINGRPVLQPGGNRVPMSLERRNXXXXXXXXXXXXXXXXR--------TRDN 208 V++ P VA+INGRPVLQP NRVP SL+RRN T N Sbjct: 12 VEITPAVARINGRPVLQPTCNRVP-SLDRRNSLKKIPPLSPPTPPSLASTLPATSATVGN 70 Query: 209 VXXXXXXXXXXXXXXXXXXXXXXXXXGNDHNGLNSSADKALPVTPRATPKLATLVRKKSK 388 G+D N LN+S++K + TPR K TL RKKSK Sbjct: 71 GGRAKASLTPPISPKSKSPRPAAIKRGSDPNALNTSSEKVM--TPRNITK--TLERKKSK 126 Query: 389 --KSGSANGVCGAAVAPLVESATLNYSSSLIVEAPGSIAAARREQVTLIQAQRKMRIAHY 562 K G NG+ + + P +L+YSSSLIVEAPGSIAA RREQ+ L QAQRKM+IAHY Sbjct: 127 SFKEGMGNGL-SSWIEP-----SLSYSSSLIVEAPGSIAAVRREQMALQQAQRKMKIAHY 180 Query: 563 GRTKSGKFEEKIVPLDXXXXXXXXXXXXXREKRCSFITPNSDPLYVTYHDEEWGVPVHDD 742 GR+KS KFE K+VPL+ EKRCSFITPNSDP+YV YHDEEWGVPVHDD Sbjct: 181 GRSKSAKFESKVVPLN---TSSAMTKPDEEEKRCSFITPNSDPVYVAYHDEEWGVPVHDD 237 Query: 743 KLLFELLV 766 +LFELLV Sbjct: 238 SMLFELLV 245 >ref|XP_004232605.1| PREDICTED: uncharacterized protein LOC101246304 [Solanum lycopersicum] Length = 395 Score = 189 bits (479), Expect = 1e-45 Identities = 118/237 (49%), Positives = 137/237 (57%), Gaps = 2/237 (0%) Frame = +2 Query: 62 APQVAQINGRPVLQPGGNRVPMSLERRNXXXXXXXXXXXXXXXXRTRDNVXXXXXXXXXX 241 A ++QINGRPVLQP N VP+ ERRN T+ + Sbjct: 12 AQTLSQINGRPVLQPHSNIVPL-YERRNSLKKTTHTAAPVTANGSTKVKMSSSTTPPVSP 70 Query: 242 XXXXXXXXXXXXXXXGN--DHNGLNSSADKALPVTPRATPKLATLVRKKSKKSGSANGVC 415 GN D NGL+SSA+K VTP+ T A ++ KK KKS Sbjct: 71 KMKSPRLPAIKR---GNNIDPNGLSSSAEKI--VTPKGTANKAPILLKKPKKSSG----- 120 Query: 416 GAAVAPLVESATLNYSSSLIVEAPGSIAAARREQVTLIQAQRKMRIAHYGRTKSGKFEEK 595 G A VE+++L YSSSLIVEAPGSIAAARREQV + Q QRKM+IAHYGRTKS K+E K Sbjct: 121 GLASPSSVENSSLKYSSSLIVEAPGSIAAARREQVAIAQVQRKMKIAHYGRTKSAKYEGK 180 Query: 596 IVPLDXXXXXXXXXXXXXREKRCSFITPNSDPLYVTYHDEEWGVPVHDDKLLFELLV 766 + LD +KRCSFITPNSDPLY+ YHDEEWGVPVHDD LLFELLV Sbjct: 181 VSSLD-PSFASAVIPNPREDKRCSFITPNSDPLYIAYHDEEWGVPVHDDNLLFELLV 236 >ref|XP_003546838.1| PREDICTED: uncharacterized protein LOC100811352 [Glycine max] Length = 400 Score = 188 bits (477), Expect = 2e-45 Identities = 118/234 (50%), Positives = 139/234 (59%) Frame = +2 Query: 65 PQVAQINGRPVLQPGGNRVPMSLERRNXXXXXXXXXXXXXXXXRTRDNVXXXXXXXXXXX 244 P VA+INGRPVLQP NRVP +LERRN Sbjct: 26 PSVARINGRPVLQPTCNRVP-NLERRNSIKKVAPAKSLSPPSPPLPSKTSLTPPVSPKSK 84 Query: 245 XXXXXXXXXXXXXXGNDHNGLNSSADKALPVTPRATPKLATLVRKKSKKSGSANGVCGAA 424 GND+NGLNSS +K V PR++ K TL RKKSK G+ Sbjct: 85 SPRLPATKR-----GNDNNGLNSSYEKI--VIPRSSIKTPTLERKKSKSFKE-----GSC 132 Query: 425 VAPLVESATLNYSSSLIVEAPGSIAAARREQVTLIQAQRKMRIAHYGRTKSGKFEEKIVP 604 V+ +E A+L+YSSSLI ++PGSIAA RREQ+ L QAQRKM+IAHYGR+KS KFE ++VP Sbjct: 133 VSASIE-ASLSYSSSLITDSPGSIAAVRREQMALQQAQRKMKIAHYGRSKSAKFE-RVVP 190 Query: 605 LDXXXXXXXXXXXXXREKRCSFITPNSDPLYVTYHDEEWGVPVHDDKLLFELLV 766 LD EKRCSFIT NSDP+Y+ YHDEEWGVPVHDDK+LFELLV Sbjct: 191 LDPSNTSLASKPTE-EEKRCSFITANSDPIYIAYHDEEWGVPVHDDKMLFELLV 243 >ref|XP_007133461.1| hypothetical protein PHAVU_011G180500g [Phaseolus vulgaris] gi|561006461|gb|ESW05455.1| hypothetical protein PHAVU_011G180500g [Phaseolus vulgaris] Length = 392 Score = 187 bits (476), Expect = 3e-45 Identities = 118/231 (51%), Positives = 134/231 (58%) Frame = +2 Query: 74 AQINGRPVLQPGGNRVPMSLERRNXXXXXXXXXXXXXXXXRTRDNVXXXXXXXXXXXXXX 253 A+INGRPVLQP NRVP +LERRN Sbjct: 23 ARINGRPVLQPTCNRVP-NLERRNSIKKVSPKSPFPPSPP------LPIKTSLTPPVSPK 75 Query: 254 XXXXXXXXXXXGNDHNGLNSSADKALPVTPRATPKLATLVRKKSKKSGSANGVCGAAVAP 433 GND NGLNSS++K VTPR T K TL RKKSK G C V+ Sbjct: 76 SESPRPPPIKRGNDSNGLNSSSEKI--VTPRHTIKTPTLERKKSKSF--KEGSCCTLVSS 131 Query: 434 LVESATLNYSSSLIVEAPGSIAAARREQVTLIQAQRKMRIAHYGRTKSGKFEEKIVPLDX 613 A+L+YSS+LI E+PGSIAA RREQ+ L AQRKM+IAHYGR+KS KFE K+VPLD Sbjct: 132 ASIEASLSYSSTLITESPGSIAAVRREQMALQHAQRKMKIAHYGRSKSAKFE-KVVPLDI 190 Query: 614 XXXXXXXXXXXXREKRCSFITPNSDPLYVTYHDEEWGVPVHDDKLLFELLV 766 EKRCSFIT NSDP+Y+ YHDEEWGVPVHDDK+LFELLV Sbjct: 191 STNLTSKTCE--EEKRCSFITANSDPVYIAYHDEEWGVPVHDDKMLFELLV 239 >ref|XP_002312220.1| methyladenine glycosylase family protein [Populus trichocarpa] gi|118486806|gb|ABK95238.1| unknown [Populus trichocarpa] gi|222852040|gb|EEE89587.1| methyladenine glycosylase family protein [Populus trichocarpa] Length = 403 Score = 187 bits (475), Expect = 4e-45 Identities = 118/238 (49%), Positives = 136/238 (57%) Frame = +2 Query: 53 VDMAPQVAQINGRPVLQPGGNRVPMSLERRNXXXXXXXXXXXXXXXXRTRDNVXXXXXXX 232 VD+ P VA+INGRPVLQP N V +LERRN Sbjct: 12 VDITPAVARINGRPVLQPTCNLVS-TLERRNSLKKTAPKSSPPPPPPPP--TFSNKTNKA 68 Query: 233 XXXXXXXXXXXXXXXXXXGNDHNGLNSSADKALPVTPRATPKLATLVRKKSKKSGSANGV 412 G+D N LNSS++K V PR T K TL RKKSK ++ Sbjct: 69 SPPLSPMSKSPRLPAIKRGSDANSLNSSSEKV--VIPRNTTKTPTLERKKSKSFKESS-- 124 Query: 413 CGAAVAPLVESATLNYSSSLIVEAPGSIAAARREQVTLIQAQRKMRIAHYGRTKSGKFEE 592 G V A+L+YSSSLIVEAPGSIAA RREQ+ L AQRKMRIAHYGR+KS +FE+ Sbjct: 125 VGRGVHSSFIEASLSYSSSLIVEAPGSIAAVRREQMALQHAQRKMRIAHYGRSKSARFED 184 Query: 593 KIVPLDXXXXXXXXXXXXXREKRCSFITPNSDPLYVTYHDEEWGVPVHDDKLLFELLV 766 ++VP D EKRCSFIT NSDP+YV YHDEEWGVPVHDDK+LFELLV Sbjct: 185 QVVPND-SSISMATKTDQEEEKRCSFITANSDPIYVAYHDEEWGVPVHDDKMLFELLV 241 >ref|XP_002315089.2| methyladenine glycosylase family protein [Populus trichocarpa] gi|550330066|gb|EEF01260.2| methyladenine glycosylase family protein [Populus trichocarpa] Length = 411 Score = 186 bits (473), Expect = 6e-45 Identities = 119/245 (48%), Positives = 140/245 (57%), Gaps = 7/245 (2%) Frame = +2 Query: 53 VDMAPQVAQINGRPVLQPGGNRVPMSLERRNXXXXXXXXXXXXXXXXRTRDNVXXXXXXX 232 V++ P VA+INGRPVLQP NRVP +LER N Sbjct: 12 VEITPAVARINGRPVLQPTCNRVP-TLERHNSLKKTAPKSPPPPPPPLPPPTSANKTNKA 70 Query: 233 XXXXXXXXXXXXXXXXXXGNDHNGLNSSADKALPVTPRATPKLATLVRKKSKKSGSANGV 412 G+D N LNSS+DK V PR+T K L RKKSK S V Sbjct: 71 SPPLSPKSKSPRLPAIKRGSDANSLNSSSDKV--VIPRSTAKTPILERKKSK-SFKETSV 127 Query: 413 CGAAVAPLVESATLNYSSSLIVEAPGSIAAARREQVTLIQAQRKMRIAHYGRTKSGKFEE 592 A++ +E A+L+YSSSLIVEAPGSIAA RREQ+ L AQRKMRIAHYGR+KS +FE Sbjct: 128 GSGALSSSIE-ASLSYSSSLIVEAPGSIAAVRREQMALQHAQRKMRIAHYGRSKSSRFEA 186 Query: 593 KIVPLDXXXXXXXXXXXXXREKRCSFITPNS-------DPLYVTYHDEEWGVPVHDDKLL 751 K+VP+D EKRCSFIT NS +P+YV YHD+EWGVPVHDDK+L Sbjct: 187 KVVPVD--SSINVTTKTDEEEKRCSFITANSGKEKYEMNPIYVAYHDKEWGVPVHDDKML 244 Query: 752 FELLV 766 FELLV Sbjct: 245 FELLV 249 >ref|XP_003545728.1| PREDICTED: uncharacterized protein LOC100793449 [Glycine max] Length = 398 Score = 185 bits (469), Expect = 2e-44 Identities = 115/232 (49%), Positives = 134/232 (57%) Frame = +2 Query: 71 VAQINGRPVLQPGGNRVPMSLERRNXXXXXXXXXXXXXXXXRTRDNVXXXXXXXXXXXXX 250 VA+INGRPVLQP NRVP +LERRN Sbjct: 20 VARINGRPVLQPTCNRVP-NLERRNSIKKLSPKSRSPPSPPLLSKT------SLTPPVSP 72 Query: 251 XXXXXXXXXXXXGNDHNGLNSSADKALPVTPRATPKLATLVRKKSKKSGSANGVCGAAVA 430 GN+ NGLNSS++K VTPR T K TL RKKSK G CGA Sbjct: 73 KSKSPRPPPIKRGNESNGLNSSSEKI--VTPRNTIKTPTLERKKSKSF--KEGSCGALGL 128 Query: 431 PLVESATLNYSSSLIVEAPGSIAAARREQVTLIQAQRKMRIAHYGRTKSGKFEEKIVPLD 610 A+L+YSS+LI E+PGSIAA RREQ+ L AQRKM+IAHYGR+KS KF +++PL+ Sbjct: 129 SASTEASLSYSSTLITESPGSIAAVRREQMALQHAQRKMKIAHYGRSKSAKFA-RVIPLE 187 Query: 611 XXXXXXXXXXXXXREKRCSFITPNSDPLYVTYHDEEWGVPVHDDKLLFELLV 766 EKRCSFIT NSDP+Y+ YHDEEWGVPVHDDK+LFELLV Sbjct: 188 PSTNLTSKTS---EEKRCSFITANSDPIYIAYHDEEWGVPVHDDKMLFELLV 236 >ref|XP_007149154.1| hypothetical protein PHAVU_005G045900g [Phaseolus vulgaris] gi|561022418|gb|ESW21148.1| hypothetical protein PHAVU_005G045900g [Phaseolus vulgaris] Length = 405 Score = 184 bits (468), Expect = 2e-44 Identities = 115/236 (48%), Positives = 136/236 (57%), Gaps = 2/236 (0%) Frame = +2 Query: 65 PQVAQINGRPVLQPGGNRVPMSLERRNXXXXXXXXXXXXXXXXRTRDNVXXXXXXXXXXX 244 P VA+INGRPVLQP NRVP +LERRN Sbjct: 27 PSVARINGRPVLQPTCNRVP-NLERRNSIKKVQPPKSLSPPSPPLSSKTSLTPPVSPKSK 85 Query: 245 XXXXXXXXXXXXXXGNDHNGLNSSADKALPVTPRATPKLATLVRKKSK--KSGSANGVCG 418 GND+NGLN+S +K P+++ K TL RKKSK K GS Sbjct: 86 SPRLPAVKR-----GNDNNGLNTSYEKI--AIPKSSSKAPTLERKKSKSFKEGSC----- 133 Query: 419 AAVAPLVESATLNYSSSLIVEAPGSIAAARREQVTLIQAQRKMRIAHYGRTKSGKFEEKI 598 AP A+ +Y+SSLI ++PGSIAA RREQ+ L QAQRKM+IAHYGR+KS KFE ++ Sbjct: 134 ---APASTEASFSYASSLITDSPGSIAAVRREQMALQQAQRKMKIAHYGRSKSAKFE-RV 189 Query: 599 VPLDXXXXXXXXXXXXXREKRCSFITPNSDPLYVTYHDEEWGVPVHDDKLLFELLV 766 VPLD EKRCSFIT NSDP+Y+ YHDEEWGVPVHDDK+LFELLV Sbjct: 190 VPLDPSTTTLTSKPTE-EEKRCSFITANSDPIYIAYHDEEWGVPVHDDKMLFELLV 244 >ref|XP_007211731.1| hypothetical protein PRUPE_ppa006139mg [Prunus persica] gi|462407596|gb|EMJ12930.1| hypothetical protein PRUPE_ppa006139mg [Prunus persica] Length = 426 Score = 184 bits (467), Expect = 3e-44 Identities = 123/260 (47%), Positives = 142/260 (54%), Gaps = 22/260 (8%) Frame = +2 Query: 53 VDMAPQVAQINGRPVLQPGGNRVPMSLERRNXXXXXXXXXXXXXXXXRTRDNVXXXXXXX 232 V++ P VA+INGRPVLQP NRVP SL+RRN T Sbjct: 12 VEVTPMVARINGRPVLQPTCNRVP-SLDRRNSIKKISTPRAPPPPPLPTSSASSTSPRIS 70 Query: 233 XXXXXXXXXXXXXXXXXX-------GNDHNGLNSSADKALPVTPRATPKLATLVRKKSKK 391 GND NGLNSS++K VTP T + L RKKSK Sbjct: 71 NKASSLLTPPISPKSKSPRPPAIKRGNDPNGLNSSSEKV--VTPGGTTRAKILERKKSKS 128 Query: 392 SGSAN-GVCGAAV--------------APLVESATLNYSSSLIVEAPGSIAAARREQVTL 526 A+ GV GA+ + L A+L+YSSSLI EAPGSIAA RREQ+ L Sbjct: 129 FKRASVGVDGASADLHHHGDFSAGGFSSSLNIEASLSYSSSLITEAPGSIAAVRREQMAL 188 Query: 527 IQAQRKMRIAHYGRTKSGKFEEKIVPLDXXXXXXXXXXXXXREKRCSFITPNSDPLYVTY 706 AQRKMRIAHYGR+KS FE ++VP+D EKRCSFIT NSDP+YV Y Sbjct: 189 QHAQRKMRIAHYGRSKSANFE-RVVPVDASGNIEAKGAE--EEKRCSFITANSDPIYVAY 245 Query: 707 HDEEWGVPVHDDKLLFELLV 766 HDEEWGVPVHDDK+LFELLV Sbjct: 246 HDEEWGVPVHDDKMLFELLV 265 >ref|XP_002303719.2| hypothetical protein POPTR_0003s15520g [Populus trichocarpa] gi|550343248|gb|EEE78698.2| hypothetical protein POPTR_0003s15520g [Populus trichocarpa] Length = 420 Score = 182 bits (463), Expect = 9e-44 Identities = 99/159 (62%), Positives = 115/159 (72%) Frame = +2 Query: 287 GNDHNGLNSSADKALPVTPRATPKLATLVRKKSKKSGSANGVCGAAVAPLVESATLNYSS 466 GN+ GLN+SA+K L TPR+T K+ T KKSKKS +A V V++ + YSS Sbjct: 101 GNEPGGLNTSAEKVL--TPRSTTKVTTSTVKKSKKSSTAG------VPHSVDTFAMKYSS 152 Query: 467 SLIVEAPGSIAAARREQVTLIQAQRKMRIAHYGRTKSGKFEEKIVPLDXXXXXXXXXXXX 646 SL+VEAPGSIAAARREQV ++Q QRKMRIAHYGRTKS K++ KIVP + Sbjct: 153 SLLVEAPGSIAAARREQVAVMQEQRKMRIAHYGRTKSAKYQGKIVPAN----SPATSTIT 208 Query: 647 XREKRCSFITPNSDPLYVTYHDEEWGVPVHDDKLLFELL 763 EKRCSFITPNSDP+YV YHDEEWGVPVHDDKLLFELL Sbjct: 209 REEKRCSFITPNSDPVYVAYHDEEWGVPVHDDKLLFELL 247 >ref|XP_002303720.2| hypothetical protein POPTR_0003s15520g [Populus trichocarpa] gi|550343247|gb|EEE78699.2| hypothetical protein POPTR_0003s15520g [Populus trichocarpa] Length = 417 Score = 182 bits (463), Expect = 9e-44 Identities = 99/159 (62%), Positives = 115/159 (72%) Frame = +2 Query: 287 GNDHNGLNSSADKALPVTPRATPKLATLVRKKSKKSGSANGVCGAAVAPLVESATLNYSS 466 GN+ GLN+SA+K L TPR+T K+ T KKSKKS +A V V++ + YSS Sbjct: 101 GNEPGGLNTSAEKVL--TPRSTTKVTTSTVKKSKKSSTAG------VPHSVDTFAMKYSS 152 Query: 467 SLIVEAPGSIAAARREQVTLIQAQRKMRIAHYGRTKSGKFEEKIVPLDXXXXXXXXXXXX 646 SL+VEAPGSIAAARREQV ++Q QRKMRIAHYGRTKS K++ KIVP + Sbjct: 153 SLLVEAPGSIAAARREQVAVMQEQRKMRIAHYGRTKSAKYQGKIVPAN----SPATSTIT 208 Query: 647 XREKRCSFITPNSDPLYVTYHDEEWGVPVHDDKLLFELL 763 EKRCSFITPNSDP+YV YHDEEWGVPVHDDKLLFELL Sbjct: 209 REEKRCSFITPNSDPVYVAYHDEEWGVPVHDDKLLFELL 247 >ref|XP_007022762.1| DNA glycosylase superfamily protein isoform 2 [Theobroma cacao] gi|508722390|gb|EOY14287.1| DNA glycosylase superfamily protein isoform 2 [Theobroma cacao] Length = 413 Score = 182 bits (461), Expect = 1e-43 Identities = 122/252 (48%), Positives = 143/252 (56%), Gaps = 14/252 (5%) Frame = +2 Query: 53 VDMAPQVAQINGRPVLQPGGNRVPMSLERRNXXXXXXXXXXXXXXXXR--------TRDN 208 V++ P VA+INGRPVLQP NRVP SL+RRN T N Sbjct: 12 VEITPAVARINGRPVLQPTCNRVP-SLDRRNSLKKIPPLSPPTPPSLASTLPATSATVGN 70 Query: 209 VXXXXXXXXXXXXXXXXXXXXXXXXXGNDHNGLNSSADKALPVTPRATPKLATLVRKKSK 388 G+D N LN+S++K + TPR K TL RKKSK Sbjct: 71 GGRAKASLTPPISPKSKSPRPAAIKRGSDPNALNTSSEKVM--TPRNITK--TLERKKSK 126 Query: 389 --KSGSANGVCGAAVAPLVESATLNYSSSLIVEAPGSIAAARREQVTLIQAQRKMRIAHY 562 K G NG+ + + P +L+YSSSLIVEAPGSIAA RREQ+ L QAQRKM+IAHY Sbjct: 127 SFKEGMGNGL-SSWIEP-----SLSYSSSLIVEAPGSIAAVRREQMALQQAQRKMKIAHY 180 Query: 563 GRTKSGKFEEKIVPLDXXXXXXXXXXXXXREKRCSFITPNSD----PLYVTYHDEEWGVP 730 GR+KS KFE K+VPL+ EKRCSFITPNS P+YV YHDEEWGVP Sbjct: 181 GRSKSAKFESKVVPLN---TSSAMTKPDEEEKRCSFITPNSGIAIYPVYVAYHDEEWGVP 237 Query: 731 VHDDKLLFELLV 766 VHDD +LFELLV Sbjct: 238 VHDDSMLFELLV 249 >gb|AFK37052.1| unknown [Medicago truncatula] Length = 390 Score = 181 bits (460), Expect = 2e-43 Identities = 114/237 (48%), Positives = 135/237 (56%), Gaps = 2/237 (0%) Frame = +2 Query: 62 APQVAQINGRPVLQPGGNRVPMSLERRNXXXXXXXXXXXXXXXXRTRDNVXXXXXXXXXX 241 AP VA+INGRPVLQP N VP +LERRN + Sbjct: 18 APHVARINGRPVLQPTCNHVP-NLERRNSIKKSTPKSLSPLPLPNKTNT--SSLTPPISP 74 Query: 242 XXXXXXXXXXXXXXXGNDHNGLNSSADKALPVTPRATPKLATLVRKKSK--KSGSANGVC 415 GND+NGLN S +K P+ K TL RKKSK K GS Sbjct: 75 KPKSPTSTRPLAIKRGNDNNGLNLSCEKIS--IPKNIMKTPTLERKKSKSFKEGSFG--- 129 Query: 416 GAAVAPLVESATLNYSSSLIVEAPGSIAAARREQVTLIQAQRKMRIAHYGRTKSGKFEEK 595 +E+A+L+YSSSLI ++PGSIAA RREQV L QAQRKM+IAHYGR+KS KF E+ Sbjct: 130 -------IEAASLSYSSSLITDSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAKF-ER 181 Query: 596 IVPLDXXXXXXXXXXXXXREKRCSFITPNSDPLYVTYHDEEWGVPVHDDKLLFELLV 766 + P+D EKRCSFIT NSDP+Y+ YHDEEWGVPVHDDK+LFELL+ Sbjct: 182 VFPID-PSSALDSKITNQEEKRCSFITTNSDPIYIAYHDEEWGVPVHDDKMLFELLI 237 >ref|XP_003595924.1| DNA-3-methyladenine glycosylase [Medicago truncatula] gi|355484972|gb|AES66175.1| DNA-3-methyladenine glycosylase [Medicago truncatula] Length = 390 Score = 180 bits (456), Expect = 6e-43 Identities = 113/236 (47%), Positives = 134/236 (56%), Gaps = 2/236 (0%) Frame = +2 Query: 65 PQVAQINGRPVLQPGGNRVPMSLERRNXXXXXXXXXXXXXXXXRTRDNVXXXXXXXXXXX 244 P VA+INGRPVLQP N VP +LERRN + Sbjct: 19 PHVARINGRPVLQPTCNHVP-NLERRNSIKKSTPKSLSPLPLPNKTNT--SSLTPPISPK 75 Query: 245 XXXXXXXXXXXXXXGNDHNGLNSSADKALPVTPRATPKLATLVRKKSK--KSGSANGVCG 418 GND+NGLN S +K P+ K TL RKKSK K GS Sbjct: 76 PKSPTSTRPLAIKRGNDNNGLNLSCEKIS--IPKNIMKTPTLERKKSKSFKEGSFG---- 129 Query: 419 AAVAPLVESATLNYSSSLIVEAPGSIAAARREQVTLIQAQRKMRIAHYGRTKSGKFEEKI 598 +E+A+L+YSSSLI ++PGSIAA RREQV L QAQRKM+IAHYGR+KS KF E++ Sbjct: 130 ------IEAASLSYSSSLITDSPGSIAAVRREQVALQQAQRKMKIAHYGRSKSAKF-ERV 182 Query: 599 VPLDXXXXXXXXXXXXXREKRCSFITPNSDPLYVTYHDEEWGVPVHDDKLLFELLV 766 P+D EKRCSFIT NSDP+Y+ YHDEEWGVPVHDDK+LFELL+ Sbjct: 183 FPID-PSSALDSKTTNQEEKRCSFITTNSDPIYIAYHDEEWGVPVHDDKMLFELLI 237 >ref|XP_007212311.1| hypothetical protein PRUPE_ppa006731mg [Prunus persica] gi|462408176|gb|EMJ13510.1| hypothetical protein PRUPE_ppa006731mg [Prunus persica] Length = 397 Score = 178 bits (452), Expect = 2e-42 Identities = 113/239 (47%), Positives = 132/239 (55%), Gaps = 5/239 (2%) Frame = +2 Query: 65 PQVAQINGRPVLQPGGNRVPM-----SLERRNXXXXXXXXXXXXXXXXRTRDNVXXXXXX 229 P ++N RPVLQP GN+ P SL++ + +T+ ++ Sbjct: 16 PSTPKMNRRPVLQPTGNQFPSLEQRKSLKKSSQEPLAPTPLPSPLPSAKTKASLSPPISP 75 Query: 230 XXXXXXXXXXXXXXXXXXXGNDHNGLNSSADKALPVTPRATPKLATLVRKKSKKSGSANG 409 G D N LNSSA+K VTPR T K + V+K K SGS Sbjct: 76 KLPSPRPPAFKR-------GKDPNELNSSAEKV--VTPRCTTKFTSSVKKSKKSSGSV-- 124 Query: 410 VCGAAVAPLVESATLNYSSSLIVEAPGSIAAARREQVTLIQAQRKMRIAHYGRTKSGKFE 589 A AP ES N SS LIVEAPGSIAAARREQV +Q QRKMRIAHYGRTKS K E Sbjct: 125 ----AAAPSAESILKNISS-LIVEAPGSIAAARREQVATMQEQRKMRIAHYGRTKSAKNE 179 Query: 590 EKIVPLDXXXXXXXXXXXXXREKRCSFITPNSDPLYVTYHDEEWGVPVHDDKLLFELLV 766 K+VPLD ++RC+FITPNSDP+YV YHDEEWGVPVHDD LL ELLV Sbjct: 180 GKVVPLDASPTTDFGRD----QRRCTFITPNSDPIYVAYHDEEWGVPVHDDNLLLELLV 234 >ref|XP_003531809.1| PREDICTED: uncharacterized protein LOC100793991 [Glycine max] Length = 400 Score = 178 bits (451), Expect = 2e-42 Identities = 115/236 (48%), Positives = 135/236 (57%), Gaps = 4/236 (1%) Frame = +2 Query: 71 VAQINGRPVLQPGGNRVPMSLERRNXXXXXXXXXXXXXXXXRTRDNVXXXXXXXXXXXXX 250 VA+INGRPVLQP NR P +LERRN Sbjct: 21 VARINGRPVLQPTCNRFP-NLERRNSIKKLSPKSPCPPSPPLPSKT------SLAPLVSP 73 Query: 251 XXXXXXXXXXXXGNDHNGLNSSADKALPVTPRATPKLATLVRKKSK----KSGSANGVCG 418 GN+ GLNSS++K VTPR T K TL RKKSK +S A G+ Sbjct: 74 KSKSPRPPPIKRGNESTGLNSSSEKI--VTPRNTIKTPTLERKKSKSFKERSYDALGLSA 131 Query: 419 AAVAPLVESATLNYSSSLIVEAPGSIAAARREQVTLIQAQRKMRIAHYGRTKSGKFEEKI 598 + A+L+YSS+LI E+PGSIAA RREQ+ L AQRKM+IAHYGR+KS KFE ++ Sbjct: 132 ST------EASLSYSSNLITESPGSIAAVRREQMALQHAQRKMKIAHYGRSKSAKFE-RV 184 Query: 599 VPLDXXXXXXXXXXXXXREKRCSFITPNSDPLYVTYHDEEWGVPVHDDKLLFELLV 766 VPLD EKRCSFIT NSDP+Y+ YHDEEWGVPVHDDK+LFELLV Sbjct: 185 VPLDPSSNLTSKTSE--EEKRCSFITANSDPIYIAYHDEEWGVPVHDDKMLFELLV 238