BLASTX nr result
ID: Wisteria21_contig00002716
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Wisteria21_contig00002716 (1760 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004501010.1| PREDICTED: DNA-3-methyladenine glycosylase 1... 613 e-172 ref|XP_003523827.1| PREDICTED: probable DNA-3-methyladenine glyc... 563 e-157 ref|XP_013462123.1| HhH-GPD base excision DNA repair family prot... 560 e-156 ref|XP_007136597.1| hypothetical protein PHAVU_009G058200g [Phas... 560 e-156 ref|XP_014501651.1| PREDICTED: DNA-3-methyladenine glycosylase 1... 553 e-154 ref|XP_013462124.1| HhH-GPD base excision DNA repair family prot... 545 e-152 gb|KOM51587.1| hypothetical protein LR48_Vigan09g024600 [Vigna a... 543 e-151 ref|XP_003528090.1| PREDICTED: DNA-3-methyladenine glycosylase 1... 529 e-147 gb|ACU22727.1| unknown [Glycine max] 528 e-147 ref|XP_007017553.1| DNA glycosylase superfamily protein [Theobro... 510 e-141 ref|XP_010102087.1| DNA-3-methyladenine glycosylase 1 [Morus not... 499 e-138 ref|XP_006473513.1| PREDICTED: DNA-3-methyladenine glycosylase 1... 499 e-138 ref|XP_006435004.1| hypothetical protein CICLE_v10001539mg [Citr... 490 e-135 ref|XP_012444918.1| PREDICTED: probable DNA-3-methyladenine glyc... 489 e-135 gb|KDO84582.1| hypothetical protein CISIN_1g039604mg [Citrus sin... 489 e-135 ref|XP_006473512.1| PREDICTED: DNA-3-methyladenine glycosylase 1... 489 e-135 ref|XP_012071867.1| PREDICTED: uncharacterized protein LOC105633... 487 e-134 ref|XP_008466558.1| PREDICTED: DNA-3-methyladenine glycosylase 1... 484 e-133 ref|XP_004147864.1| PREDICTED: DNA-3-methyladenine glycosylase 1... 483 e-133 ref|XP_012446557.1| PREDICTED: probable DNA-3-methyladenine glyc... 483 e-133 >ref|XP_004501010.1| PREDICTED: DNA-3-methyladenine glycosylase 1 [Cicer arietinum] Length = 384 Score = 613 bits (1580), Expect = e-172 Identities = 317/384 (82%), Positives = 338/384 (88%), Gaps = 9/384 (2%) Frame = -1 Query: 1439 MGEQTRTQAQ--SLTRTQIEPLPQ--PQGASSVAPTTTTLAASIVPVESELSNVPPQTNS 1272 MGE+T+ Q Q +L T+IEP PQ PQ ASS T A +I+PVESELSNVPP NS Sbjct: 1 MGEETQIQPQPQTLIGTEIEPQPQSQPQEASSNNTVAATAAGAIIPVESELSNVPPHINS 60 Query: 1271 PASKIPLRPRKIRKVSPDPTT-SESQTETPKASTSTGGKTCGRN-NKTVQQQRALVVPRM 1098 PA+KIPLRPRKIRKVSPDPTT SESQ+ETPK++TST GK+CGR+ NK+VQQQRAL+VPR+ Sbjct: 61 PATKIPLRPRKIRKVSPDPTTTSESQSETPKSATSTAGKSCGRHSNKSVQQQRALIVPRI 120 Query: 1097 VARSLSCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKA 918 VARSLSCEGEVEIALRYLRNADPLLSPLIDIHQPP+FDNFHTPFLALTRSILYQQLA+KA Sbjct: 121 VARSLSCEGEVEIALRYLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAFKA 180 Query: 917 GTSIYTRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDST 738 GTSIYTRFIALCGGEA VVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDS Sbjct: 181 GTSIYTRFIALCGGEAGVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSA 240 Query: 737 IVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPR 558 IVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQ+LYNLEDLPR Sbjct: 241 IVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQILYNLEDLPR 300 Query: 557 PSQMDQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASL---XXXXXXXXXXXXX 387 PSQMDQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGA L Sbjct: 301 PSQMDQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGAGLQQHQLEQHQQQQQQQQ 360 Query: 386 XXXXQLLDPINSMFNLGAACAWGQ 315 QL+DP+NSMFN+GAACAWGQ Sbjct: 361 HSQQQLMDPMNSMFNIGAACAWGQ 384 >ref|XP_003523827.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2-like [Glycine max] gi|947112855|gb|KRH61157.1| hypothetical protein GLYMA_04G031600 [Glycine max] Length = 374 Score = 563 bits (1450), Expect = e-157 Identities = 302/390 (77%), Positives = 314/390 (80%), Gaps = 15/390 (3%) Frame = -1 Query: 1439 MGEQTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSNVPPQTNSPASK 1260 MGEQT QAQSL IEP P P +S+ P T V+SEL+NVP T SPA+K Sbjct: 1 MGEQTLGQAQSL----IEPQPLPAPSSTAVPDGAT-------VDSELNNVPRPTTSPATK 49 Query: 1259 IPLRPRKIRKVSPDPTTSESQTETPKASTSTGGKTCGRNNKTVQQQRAL-VVPRMVARSL 1083 IPLRPRKIRKVSPDP+TSESQTETPK + KT GRN RAL VVPR+VARSL Sbjct: 50 IPLRPRKIRKVSPDPSTSESQTETPKPA-----KTGGRNTTKAAPPRALTVVPRIVARSL 104 Query: 1082 SCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSIY 903 SC+GEVEIALRYLRNADP+LSPLIDIHQPP+FDNFHTPFLALTRSILYQQLAYKAGTSIY Sbjct: 105 SCDGEVEIALRYLRNADPVLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAYKAGTSIY 164 Query: 902 TRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNMD 723 TRFIALCGGE VVPETVLAL PQQLRQIGVSGRKASYLHDLARKYQNGILSDS IVNMD Sbjct: 165 TRFIALCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIVNMD 224 Query: 722 DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD 543 DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD Sbjct: 225 DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD 284 Query: 542 QLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASL--------------XXXXXXX 405 QLC+KWRPYRSVASWYMWRFVEAKGTPSSAVAVATGA L Sbjct: 285 QLCDKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGAGLQQQQHHQHHHQHQQQEQQQQQ 344 Query: 404 XXXXXXXXXXQLLDPINSMFNLGAACAWGQ 315 QLLDPINSMFNLGAACAWGQ Sbjct: 345 QQQQQHPPQPQLLDPINSMFNLGAACAWGQ 374 >ref|XP_013462123.1| HhH-GPD base excision DNA repair family protein [Medicago truncatula] gi|657396011|gb|KEH36158.1| HhH-GPD base excision DNA repair family protein [Medicago truncatula] Length = 377 Score = 560 bits (1444), Expect = e-156 Identities = 287/373 (76%), Positives = 314/373 (84%), Gaps = 12/373 (3%) Frame = -1 Query: 1397 TQIEPLPQPQGASSVAPTTTTLAA----SIVPVESELSNVPPQTNSPASKIPLRPRKIRK 1230 TQ +P PQPQG SS + T A +I+PVESELSNVPP T +PA+K+PLRPRKIRK Sbjct: 5 TQQQPQPQPQGVSSDNTISATTAVDSVQTIIPVESELSNVPPHTKAPATKMPLRPRKIRK 64 Query: 1229 VSPDPTTSESQTETPKASTSTG-GKTCGRNNKTVQ--QQRALVVPRMVARSLSCEGEVEI 1059 VSPDPTTSESQ+ET K ST GK+ GRNNKTVQ QQR L VP++V RSLSCEGEVEI Sbjct: 65 VSPDPTTSESQSETLKPPNSTAAGKSNGRNNKTVQPPQQRTLAVPKIVPRSLSCEGEVEI 124 Query: 1058 ALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSIYTRFIALCG 879 A+RYLR+ADPLLSPLIDIHQPP+FDNF TPFLALTRSILYQQLA+KAGTSIYTRFIALCG Sbjct: 125 AIRYLRSADPLLSPLIDIHQPPTFDNFQTPFLALTRSILYQQLAFKAGTSIYTRFIALCG 184 Query: 878 GEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNMDDKSLFTML 699 GEA VVP+ VLAL QQLRQIGVSGRKASYLHDLARKYQNGILSDS IVNMDDKSLFTML Sbjct: 185 GEAGVVPDNVLALTAQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIVNMDDKSLFTML 244 Query: 698 TMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMDQLCEKWRP 519 TMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQ+LYNL+DLPRPSQMDQLCEKW+P Sbjct: 245 TMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQILYNLDDLPRPSQMDQLCEKWKP 304 Query: 518 YRSVASWYMWRFVEAKGTPSSAVAVATG-----ASLXXXXXXXXXXXXXXXXXQLLDPIN 354 YRSVASWY+WRFVEAKG+PS+AVAVATG L ++DP+N Sbjct: 305 YRSVASWYLWRFVEAKGSPSTAVAVATGNGLQQHELDHHQQQQQQQQQQHSQQPIMDPMN 364 Query: 353 SMFNLGAACAWGQ 315 +MFN+GAACAWGQ Sbjct: 365 NMFNMGAACAWGQ 377 >ref|XP_007136597.1| hypothetical protein PHAVU_009G058200g [Phaseolus vulgaris] gi|561009684|gb|ESW08591.1| hypothetical protein PHAVU_009G058200g [Phaseolus vulgaris] Length = 366 Score = 560 bits (1444), Expect = e-156 Identities = 300/381 (78%), Positives = 311/381 (81%), Gaps = 6/381 (1%) Frame = -1 Query: 1439 MGEQTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSNVPPQTNSPASK 1260 MGE T QAQSL IEP P P +SS A A +SEL+NV P NSPASK Sbjct: 1 MGEHTLGQAQSL----IEPQPHPVPSSSAA------APDGAQADSELNNVLPHANSPASK 50 Query: 1259 IPLRPRKIRKVSPDPTTSESQTETPKASTSTGGKTCGRNNKTVQQQRAL-VVPRMVARSL 1083 IPLRPRKIRKVSPDP+TSESQTE PK GK+ GR+ K V R + V+PR+VARSL Sbjct: 51 IPLRPRKIRKVSPDPSTSESQTEPPKP-----GKSGGRSTKHVPPSRGMSVLPRLVARSL 105 Query: 1082 SCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSIY 903 SCEGEVEIALR+LRNADPLLSPLIDIHQPP+FDNFHTPFLALTRSILYQQLAYKAGTSIY Sbjct: 106 SCEGEVEIALRFLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAYKAGTSIY 165 Query: 902 TRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNMD 723 TRFIALCGGE VVPETVLAL PQQLRQIGVSGRKASYLHDLARKYQNGILSDS IVNMD Sbjct: 166 TRFIALCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIVNMD 225 Query: 722 DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD 543 DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD Sbjct: 226 DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD 285 Query: 542 QLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASL-----XXXXXXXXXXXXXXXX 378 LCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGA L Sbjct: 286 HLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGAGLQQQHHHQHQQHEQQQQQHPPQ 345 Query: 377 XQLLDPINSMFNLGAACAWGQ 315 QLLDPINSMFNLGAACAWGQ Sbjct: 346 PQLLDPINSMFNLGAACAWGQ 366 >ref|XP_014501651.1| PREDICTED: DNA-3-methyladenine glycosylase 1 [Vigna radiata var. radiata] Length = 367 Score = 553 bits (1425), Expect = e-154 Identities = 297/382 (77%), Positives = 309/382 (80%), Gaps = 7/382 (1%) Frame = -1 Query: 1439 MGEQTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSNVPPQTNSPASK 1260 MGE T QAQSL IEP P P AP+++ +SEL+NV P NSPASK Sbjct: 1 MGEHTLGQAQSL----IEPQPHP------APSSSAAGPDGAQADSELNNVLPHVNSPASK 50 Query: 1259 IPLRPRKIRKVSPDPTTSESQTETPKASTSTGGKTCGRNNKTVQQQRAL-VVPRMVARSL 1083 IPLRPRKIRKVSPDP+TSES TE K GK GR+ K V RA+ VVPR+VARSL Sbjct: 51 IPLRPRKIRKVSPDPSTSESLTEPSKP-----GKNGGRSTKHVPPSRAMTVVPRLVARSL 105 Query: 1082 SCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSIY 903 S EGEVEIALR+LRNADPLLSPLIDIHQPP+FDNFHTPFLALTRSILYQQLAYKAGTSIY Sbjct: 106 SYEGEVEIALRFLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAYKAGTSIY 165 Query: 902 TRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNMD 723 TRFIALCGGE VVPETVLAL PQQLRQIGVSGRKASYLHDLARKYQNGILSD+ IVNMD Sbjct: 166 TRFIALCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDAAIVNMD 225 Query: 722 DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD 543 DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD Sbjct: 226 DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD 285 Query: 542 QLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASL------XXXXXXXXXXXXXXX 381 QLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGA L Sbjct: 286 QLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGAGLQQQHHHQHQQHEQQQQQQHPP 345 Query: 380 XXQLLDPINSMFNLGAACAWGQ 315 QLLDPINSMFNLGAACAWGQ Sbjct: 346 QPQLLDPINSMFNLGAACAWGQ 367 >ref|XP_013462124.1| HhH-GPD base excision DNA repair family protein [Medicago truncatula] gi|657396012|gb|KEH36159.1| HhH-GPD base excision DNA repair family protein [Medicago truncatula] Length = 375 Score = 545 bits (1405), Expect = e-152 Identities = 281/368 (76%), Positives = 308/368 (83%), Gaps = 12/368 (3%) Frame = -1 Query: 1397 TQIEPLPQPQGASSVAPTTTTLAA----SIVPVESELSNVPPQTNSPASKIPLRPRKIRK 1230 TQ +P PQPQG SS + T A +I+PVESELSNVPP T +PA+K+PLRPRKIRK Sbjct: 5 TQQQPQPQPQGVSSDNTISATTAVDSVQTIIPVESELSNVPPHTKAPATKMPLRPRKIRK 64 Query: 1229 VSPDPTTSESQTETPKASTSTG-GKTCGRNNKTVQ--QQRALVVPRMVARSLSCEGEVEI 1059 VSPDPTTSESQ+ET K ST GK+ GRNNKTVQ QQR L VP++V RSLSCEGEVEI Sbjct: 65 VSPDPTTSESQSETLKPPNSTAAGKSNGRNNKTVQPPQQRTLAVPKIVPRSLSCEGEVEI 124 Query: 1058 ALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSIYTRFIALCG 879 A+RYLR+ADPLLSPLIDIHQPP+FDNF TPFLALTRSILYQQLA+KAGTSIYTRFIALCG Sbjct: 125 AIRYLRSADPLLSPLIDIHQPPTFDNFQTPFLALTRSILYQQLAFKAGTSIYTRFIALCG 184 Query: 878 GEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNMDDKSLFTML 699 GEA VVP+ VLAL QQLRQIGVSGRKASYLHDLARKYQNGILSDS IVNMDDKSLFTML Sbjct: 185 GEAGVVPDNVLALTAQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIVNMDDKSLFTML 244 Query: 698 TMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMDQLCEKWRP 519 TMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQ+LYNL+DLPRPSQMDQLCEKW+P Sbjct: 245 TMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQILYNLDDLPRPSQMDQLCEKWKP 304 Query: 518 YRSVASWYMWRFVEAKGTPSSAVAVATG-----ASLXXXXXXXXXXXXXXXXXQLLDPIN 354 YRSVASWY+WRFVEAKG+PS+AVAVATG L ++DP+N Sbjct: 305 YRSVASWYLWRFVEAKGSPSTAVAVATGNGLQQHELDHHQQQQQQQQQQHSQQPIMDPMN 364 Query: 353 SMFNLGAA 330 +MFN+G A Sbjct: 365 NMFNMGCA 372 >gb|KOM51587.1| hypothetical protein LR48_Vigan09g024600 [Vigna angularis] Length = 362 Score = 543 bits (1400), Expect = e-151 Identities = 293/377 (77%), Positives = 305/377 (80%), Gaps = 9/377 (2%) Frame = -1 Query: 1439 MGEQTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSNVPPQTNSPASK 1260 MGE T QAQSL IEP P P AP+++ +SEL+NV P NSPASK Sbjct: 1 MGEHTLGQAQSL----IEPQPHP------APSSSAAGPDGAQADSELNNVLPHVNSPASK 50 Query: 1259 IPLRPRKIRKVSPDPTTSESQTETPKASTSTGGKTCGRNNKTVQQQRAL-VVPRMVARSL 1083 IPLRPRKIRKVSPDP+TSES TE K GK+ GR+ K V RA+ VVPR+VARSL Sbjct: 51 IPLRPRKIRKVSPDPSTSESLTEPSKP-----GKSGGRSTKHVPPSRAMAVVPRLVARSL 105 Query: 1082 SCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSIY 903 SCEGEVEIALR+LRNADPLLSPLIDIHQPP+FDNFHTPFLALTRSILYQQLAYKAGTSIY Sbjct: 106 SCEGEVEIALRFLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAYKAGTSIY 165 Query: 902 TRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNMD 723 TRFIALCGGE VVPETVLAL PQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNMD Sbjct: 166 TRFIALCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNMD 225 Query: 722 DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD 543 DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD Sbjct: 226 DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD 285 Query: 542 QLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASL--------XXXXXXXXXXXXX 387 QLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGA L Sbjct: 286 QLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGAGLQHQHHHQHQQHEQQQQQQQQH 345 Query: 386 XXXXQLLDPINSMFNLG 336 QLLDPINSMFNLG Sbjct: 346 PPQPQLLDPINSMFNLG 362 >ref|XP_003528090.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like [Glycine max] gi|947103464|gb|KRH51847.1| hypothetical protein GLYMA_06G031700 [Glycine max] Length = 351 Score = 529 bits (1363), Expect = e-147 Identities = 283/382 (74%), Positives = 298/382 (78%), Gaps = 7/382 (1%) Frame = -1 Query: 1439 MGEQTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSNVPPQTNSPASK 1260 MGEQT QAQ PLP P A++ SEL+NVP T SPA+K Sbjct: 1 MGEQTLGQAQ--------PLPAPDAATA---------------HSELNNVPQPTTSPATK 37 Query: 1259 IPLRPRKIRKVSPDPTTSESQTETPKASTSTGGKTCGRN--NKTVQQQRALVVPRMVARS 1086 IPLRPRKIRKVSPDP+TSE+ + K GRN +K + VVPR+VARS Sbjct: 38 IPLRPRKIRKVSPDPSTSEAPIKP--------AKPVGRNTTSKAAPPRALTVVPRIVARS 89 Query: 1085 LSCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSI 906 LSC+GEVEI+LRYLRNADPLLSPLIDIHQPP+FDNFHTPFLALTRSILYQQLA+KAGTSI Sbjct: 90 LSCDGEVEISLRYLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAFKAGTSI 149 Query: 905 YTRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNM 726 YTRFI LCGGE VVPETVLAL PQQLRQIGVSGRKASYLHDLARKYQNGILSDS IVNM Sbjct: 150 YTRFIGLCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIVNM 209 Query: 725 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQM 546 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQM Sbjct: 210 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQM 269 Query: 545 DQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASL-----XXXXXXXXXXXXXXX 381 DQLC+KWRPYRSVASWYMWRFVEAKGTPSSAV VATGA L Sbjct: 270 DQLCDKWRPYRSVASWYMWRFVEAKGTPSSAVTVATGAGLQQQRHHQHQQQEQQQQQHAP 329 Query: 380 XXQLLDPINSMFNLGAACAWGQ 315 QLLDPINSMFNLGAACAWGQ Sbjct: 330 QPQLLDPINSMFNLGAACAWGQ 351 >gb|ACU22727.1| unknown [Glycine max] Length = 351 Score = 528 bits (1359), Expect = e-147 Identities = 282/382 (73%), Positives = 297/382 (77%), Gaps = 7/382 (1%) Frame = -1 Query: 1439 MGEQTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSNVPPQTNSPASK 1260 MGEQT QAQ PLP P A++ SEL+NVP T SPA+K Sbjct: 1 MGEQTLGQAQ--------PLPAPDAATA---------------HSELNNVPQPTTSPATK 37 Query: 1259 IPLRPRKIRKVSPDPTTSESQTETPKASTSTGGKTCGRN--NKTVQQQRALVVPRMVARS 1086 IPLRPRKIRKVSPDP+TSE+ + K GRN +K + VVPR+VARS Sbjct: 38 IPLRPRKIRKVSPDPSTSEAPIKP--------AKPVGRNTTSKAAPPRALTVVPRIVARS 89 Query: 1085 LSCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSI 906 LSC+GEVEI+LRYLRNADPLLSPLIDIHQPP+FDNFHTPFLALTRSILYQQLA+KAGTSI Sbjct: 90 LSCDGEVEISLRYLRNADPLLSPLIDIHQPPTFDNFHTPFLALTRSILYQQLAFKAGTSI 149 Query: 905 YTRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNM 726 YTRFI LCGGE VVPETVLAL PQQLRQIGVSGRKASYLHDLARKYQNGILSDS IVNM Sbjct: 150 YTRFIGLCGGENGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIVNM 209 Query: 725 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQM 546 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQM Sbjct: 210 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQM 269 Query: 545 DQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASL-----XXXXXXXXXXXXXXX 381 DQLC+KWRPYRSVASWYMWRFVEAKGTPSSAV VATGA L Sbjct: 270 DQLCDKWRPYRSVASWYMWRFVEAKGTPSSAVTVATGAGLQQQRHHQHQQQEQQQQQHAP 329 Query: 380 XXQLLDPINSMFNLGAACAWGQ 315 QLLDPINSMFNLGA CAWGQ Sbjct: 330 QPQLLDPINSMFNLGAVCAWGQ 351 >ref|XP_007017553.1| DNA glycosylase superfamily protein [Theobroma cacao] gi|508722881|gb|EOY14778.1| DNA glycosylase superfamily protein [Theobroma cacao] Length = 397 Score = 510 bits (1314), Expect = e-141 Identities = 274/377 (72%), Positives = 301/377 (79%), Gaps = 6/377 (1%) Frame = -1 Query: 1427 TRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSNVPPQTNSPASKIPLR 1248 T TQ QS +TQ + P S+ A +TT +A + +EL+NVPPQT+SP SKIP R Sbjct: 25 TPTQEQSQGQTQTQ---NPNNTSNAAVSTTVTSAVVTSAPTELTNVPPQTSSPPSKIPFR 81 Query: 1247 PRKIRKVSPDPT--TSESQTETPKASTSTGG-KTCGRNNKT-VQQQRAL-VVPRMVARSL 1083 PRKIRK+SPDP T+ SQ T A+++T KT + KT + Q RAL VVPR++ARSL Sbjct: 82 PRKIRKLSPDPNSDTNASQQATTSATSATEPPKTVAKTPKTKLTQHRALAVVPRIMARSL 141 Query: 1082 SCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSIY 903 SCEGEVE A+R+LRNADPLL+ LIDIH PP+FD FHTPFLALTRSILYQQLA+KAGTSIY Sbjct: 142 SCEGEVETAIRHLRNADPLLASLIDIHPPPTFDTFHTPFLALTRSILYQQLAFKAGTSIY 201 Query: 902 TRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNMD 723 RFIALCGGE VVPETVL+L QQLRQIGVSGRKASYLHDLARKYQ GILSDS IVNMD Sbjct: 202 NRFIALCGGENGVVPETVLSLTAQQLRQIGVSGRKASYLHDLARKYQTGILSDSAIVNMD 261 Query: 722 DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD 543 DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLE+LPRPSQMD Sbjct: 262 DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEELPRPSQMD 321 Query: 542 QLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASL-XXXXXXXXXXXXXXXXXQLL 366 QLCEKWRPYRSVASWY+WRFVEAKG PSSA AVA GASL QLL Sbjct: 322 QLCEKWRPYRSVASWYLWRFVEAKGAPSSAAAVAAGASLPPPQQEEQQQHQQHQQQPQLL 381 Query: 365 DPINSMFNLGAACAWGQ 315 DPINS+ NLG ACAWGQ Sbjct: 382 DPINSILNLG-ACAWGQ 397 >ref|XP_010102087.1| DNA-3-methyladenine glycosylase 1 [Morus notabilis] gi|587903719|gb|EXB91937.1| DNA-3-methyladenine glycosylase 1 [Morus notabilis] Length = 451 Score = 499 bits (1285), Expect = e-138 Identities = 257/344 (74%), Positives = 278/344 (80%), Gaps = 6/344 (1%) Frame = -1 Query: 1439 MGEQTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSNVPPQTNSPASK 1260 MGEQT+TQ Q+ Q Q +S V +TT A +ELSN P QT+SP SK Sbjct: 1 MGEQTQTQTQTQQPQQHHGQTQESSSSMVTSISTTTIAPSSTAPTELSNAPSQTSSPPSK 60 Query: 1259 IPLRPRKIRKVSPDPTTSESQT-----ETPKASTSTGGKTCGRNNKTVQQQR-ALVVPRM 1098 IPLRPRKIRK+SPD + S+S E PK S + K VQQ+ A+ PR+ Sbjct: 61 IPLRPRKIRKLSPDDSDSKSSQVVAVPENPKPSPTAAAAAKPAKAKIVQQRALAIAAPRI 120 Query: 1097 VARSLSCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKA 918 VARSLSCEGEVE+ALR+LR ADPLL+PLIDIHQPP+FDNFHTPFLALTRSILYQQLAYKA Sbjct: 121 VARSLSCEGEVEVALRHLRRADPLLAPLIDIHQPPTFDNFHTPFLALTRSILYQQLAYKA 180 Query: 917 GTSIYTRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDST 738 GTSIYTRFIALCGGE VVPETVLAL PQQLRQIGVSGRKASYLHDLARKYQNGILSDS Sbjct: 181 GTSIYTRFIALCGGETGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSA 240 Query: 737 IVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPR 558 IVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLE+LPR Sbjct: 241 IVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEELPR 300 Query: 557 PSQMDQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASL 426 PSQMDQLCEKWRPYRSVA+WYMWRFVE KG P +A VA GA+L Sbjct: 301 PSQMDQLCEKWRPYRSVAAWYMWRFVEQKGAPPNAATVAVGANL 344 >ref|XP_006473513.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like isoform X2 [Citrus sinensis] Length = 371 Score = 499 bits (1285), Expect = e-138 Identities = 265/383 (69%), Positives = 297/383 (77%), Gaps = 8/383 (2%) Frame = -1 Query: 1439 MGEQTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSN--------VPP 1284 M EQT++Q Q+ Q EP QP +TTTLA ++PV++E +N V P Sbjct: 1 MVEQTQSQTQNQPEPQPEPETQPPPNQD---STTTLA--VIPVQTETANNATITHANVTP 55 Query: 1283 QTNSPASKIPLRPRKIRKVSPDPTTSESQTETPKASTSTGGKTCGRNNKTVQQQRALVVP 1104 QT+SP SKIPLRPRKIRK+SPD ++ + P S+ ++ QQQ+ L VP Sbjct: 56 QTSSPPSKIPLRPRKIRKLSPDNGVDQASSSQPTESSKATSAKSTKSRAIQQQQQTLTVP 115 Query: 1103 RMVARSLSCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAY 924 R++AR LS EGEVE A+R+LRNAD L+ LIDIH PP+FD+FHTPFLALTRSILYQQLA+ Sbjct: 116 RIIARPLSSEGEVEAAIRHLRNADRQLASLIDIHPPPTFDSFHTPFLALTRSILYQQLAF 175 Query: 923 KAGTSIYTRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSD 744 KAGTSIYTRFIALCGGEA VVPETVLAL PQQLRQIGVSGRKASYLHDLARKYQNGILSD Sbjct: 176 KAGTSIYTRFIALCGGEAGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSD 235 Query: 743 STIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDL 564 S IVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLY+LE+L Sbjct: 236 SAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYSLEEL 295 Query: 563 PRPSQMDQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASLXXXXXXXXXXXXXX 384 PRPSQMDQLCEKWRPYRSVASWY+WRFVEAKG PSSA AVA GA+L Sbjct: 296 PRPSQMDQLCEKWRPYRSVASWYLWRFVEAKGAPSSAAAVAAGAAL------PQPQQEEQ 349 Query: 383 XXXQLLDPINSMFNLGAACAWGQ 315 QLLD INS+ N+G ACAWGQ Sbjct: 350 QQPQLLDQINSLINIG-ACAWGQ 371 >ref|XP_006435004.1| hypothetical protein CICLE_v10001539mg [Citrus clementina] gi|557537126|gb|ESR48244.1| hypothetical protein CICLE_v10001539mg [Citrus clementina] Length = 373 Score = 490 bits (1261), Expect = e-135 Identities = 256/369 (69%), Positives = 288/369 (78%), Gaps = 8/369 (2%) Frame = -1 Query: 1418 QAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSN--------VPPQTNSPAS 1263 Q QS T+ Q EP P+P+ +TT A +++PV+SE +N V PQT+SP S Sbjct: 4 QTQSQTQNQPEPQPEPETQPPPNQDSTT-ALAVIPVQSETANNATITHANVTPQTSSPPS 62 Query: 1262 KIPLRPRKIRKVSPDPTTSESQTETPKASTSTGGKTCGRNNKTVQQQRALVVPRMVARSL 1083 KIPLRPRKIRK+SPD ++ + P S+ ++ QQQ+ L VPR++AR L Sbjct: 63 KIPLRPRKIRKLSPDNGVDQTSSSQPTESSKATSAKSTKSRAIQQQQQTLTVPRIIARPL 122 Query: 1082 SCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSIY 903 S EGEVE A+R+LRNAD L+ LIDIH PP+FD+FHTPFLALTRSILYQQLA+KAGTSIY Sbjct: 123 SSEGEVEAAIRHLRNADRQLASLIDIHPPPTFDSFHTPFLALTRSILYQQLAFKAGTSIY 182 Query: 902 TRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNMD 723 TRFIALCGGEA VVPETVLAL PQQLRQIGVSGRKASYLHDLARKYQNGILSDS IVNMD Sbjct: 183 TRFIALCGGEAGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIVNMD 242 Query: 722 DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD 543 DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLY+LE+LPRPSQMD Sbjct: 243 DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYSLEELPRPSQMD 302 Query: 542 QLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASLXXXXXXXXXXXXXXXXXQLLD 363 QLCEKWRPYRSVASWY+WRFVEAKG PSSA AVA GA+L QLLD Sbjct: 303 QLCEKWRPYRSVASWYLWRFVEAKGAPSSAAAVAAGAAL------PQPQQEEQQQPQLLD 356 Query: 362 PINSMFNLG 336 INS+ N+G Sbjct: 357 QINSLINIG 365 >ref|XP_012444918.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2 isoform X2 [Gossypium raimondii] gi|763791263|gb|KJB58259.1| hypothetical protein B456_009G201500 [Gossypium raimondii] Length = 395 Score = 489 bits (1258), Expect = e-135 Identities = 268/397 (67%), Positives = 298/397 (75%), Gaps = 22/397 (5%) Frame = -1 Query: 1439 MGEQTRTQAQ------------SLTRTQIEPLPQPQGASSVAP--TTTTLAASIVPV-ES 1305 MGEQT +Q Q + T+ Q++ SS AP T TT +IV + Sbjct: 1 MGEQTPSQPQPQVQSQPPNDSSTTTQAQVQTQSGDPNNSSTAPVSTVTTACTAIVACGPT 60 Query: 1304 ELSNVPPQTNSPASKIPLRPRKIRKVSPD----PTTSESQTETPKASTSTGGKTCGRNNK 1137 EL NVP T SP SKIP RPRKIRK+SPD P S+ T + S + KT GR +K Sbjct: 61 ELVNVPLSTLSPPSKIPSRPRKIRKLSPDLSFDPNASQQATTSSSTSLTEQRKTVGRTSK 120 Query: 1136 T-VQQQRALVV--PRMVARSLSCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPF 966 T + Q RAL V PR+++RSLSCEGEVE A+ +LR+ADPLL+ LID+H PP+FD FH PF Sbjct: 121 TKLSQHRALAVVAPRIISRSLSCEGEVENAIHHLRDADPLLASLIDLHPPPTFDTFHAPF 180 Query: 965 LALTRSILYQQLAYKAGTSIYTRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYL 786 LALTRSILYQQLA+KAGTSIYTRFI+LCGGE VVPETVL+L QQLRQIGVSGRKASYL Sbjct: 181 LALTRSILYQQLAFKAGTSIYTRFISLCGGENGVVPETVLSLTSQQLRQIGVSGRKASYL 240 Query: 785 HDLARKYQNGILSDSTIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLG 606 HDLARKYQ GILSDS IVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLG Sbjct: 241 HDLARKYQTGILSDSAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLG 300 Query: 605 VRKGVQLLYNLEDLPRPSQMDQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASL 426 VRKGVQLLYNLE+LPRPSQMDQLCEKWRPYRSVASWY+WR+VEAKG PSSA AVA GASL Sbjct: 301 VRKGVQLLYNLEELPRPSQMDQLCEKWRPYRSVASWYLWRYVEAKGAPSSAAAVAAGASL 360 Query: 425 XXXXXXXXXXXXXXXXXQLLDPINSMFNLGAACAWGQ 315 QL+DPINS+ NLG ACAWGQ Sbjct: 361 -PPLQQQEEPQQHQQQPQLMDPINSILNLG-ACAWGQ 395 >gb|KDO84582.1| hypothetical protein CISIN_1g039604mg [Citrus sinensis] Length = 373 Score = 489 bits (1258), Expect = e-135 Identities = 255/369 (69%), Positives = 288/369 (78%), Gaps = 8/369 (2%) Frame = -1 Query: 1418 QAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSN--------VPPQTNSPAS 1263 Q QS T+ Q EP P+P+ +TT A +++PV++E +N V PQT+SP S Sbjct: 4 QTQSQTQNQPEPQPEPETQPPPNQDSTT-ALAVIPVQTETANNATITHANVTPQTSSPPS 62 Query: 1262 KIPLRPRKIRKVSPDPTTSESQTETPKASTSTGGKTCGRNNKTVQQQRALVVPRMVARSL 1083 KIPLRPRKIRK+SPD ++ + P S+ ++ QQQ+ L VPR++AR L Sbjct: 63 KIPLRPRKIRKLSPDNGVDQASSSQPTESSKATSAKSTKSRAIQQQQQTLTVPRIIARPL 122 Query: 1082 SCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSIY 903 S EGEVE A+R+LRNAD L+ LIDIH PP+FD+FHTPFLALTRSILYQQLA+KAGTSIY Sbjct: 123 SSEGEVEAAIRHLRNADRQLASLIDIHPPPTFDSFHTPFLALTRSILYQQLAFKAGTSIY 182 Query: 902 TRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNMD 723 TRFIALCGGEA VVPETVLAL PQQLRQIGVSGRKASYLHDLARKYQNGILSDS IVNMD Sbjct: 183 TRFIALCGGEAGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSDSAIVNMD 242 Query: 722 DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQMD 543 DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLY+LE+LPRPSQMD Sbjct: 243 DKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYSLEELPRPSQMD 302 Query: 542 QLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASLXXXXXXXXXXXXXXXXXQLLD 363 QLCEKWRPYRSVASWY+WRFVEAKG PSSA AVA GA+L QLLD Sbjct: 303 QLCEKWRPYRSVASWYLWRFVEAKGAPSSAAAVAAGAAL------PQPQQEEQQQPQLLD 356 Query: 362 PINSMFNLG 336 INS+ N+G Sbjct: 357 QINSLINIG 365 >ref|XP_006473512.1| PREDICTED: DNA-3-methyladenine glycosylase 1-like isoform X1 [Citrus sinensis] Length = 373 Score = 489 bits (1258), Expect = e-135 Identities = 259/376 (68%), Positives = 291/376 (77%), Gaps = 8/376 (2%) Frame = -1 Query: 1439 MGEQTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSN--------VPP 1284 M EQT++Q Q+ Q EP QP +TTTLA ++PV++E +N V P Sbjct: 1 MVEQTQSQTQNQPEPQPEPETQPPPNQD---STTTLA--VIPVQTETANNATITHANVTP 55 Query: 1283 QTNSPASKIPLRPRKIRKVSPDPTTSESQTETPKASTSTGGKTCGRNNKTVQQQRALVVP 1104 QT+SP SKIPLRPRKIRK+SPD ++ + P S+ ++ QQQ+ L VP Sbjct: 56 QTSSPPSKIPLRPRKIRKLSPDNGVDQASSSQPTESSKATSAKSTKSRAIQQQQQTLTVP 115 Query: 1103 RMVARSLSCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAY 924 R++AR LS EGEVE A+R+LRNAD L+ LIDIH PP+FD+FHTPFLALTRSILYQQLA+ Sbjct: 116 RIIARPLSSEGEVEAAIRHLRNADRQLASLIDIHPPPTFDSFHTPFLALTRSILYQQLAF 175 Query: 923 KAGTSIYTRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSD 744 KAGTSIYTRFIALCGGEA VVPETVLAL PQQLRQIGVSGRKASYLHDLARKYQNGILSD Sbjct: 176 KAGTSIYTRFIALCGGEAGVVPETVLALTPQQLRQIGVSGRKASYLHDLARKYQNGILSD 235 Query: 743 STIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDL 564 S IVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLY+LE+L Sbjct: 236 SAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYSLEEL 295 Query: 563 PRPSQMDQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASLXXXXXXXXXXXXXX 384 PRPSQMDQLCEKWRPYRSVASWY+WRFVEAKG PSSA AVA GA+L Sbjct: 296 PRPSQMDQLCEKWRPYRSVASWYLWRFVEAKGAPSSAAAVAAGAAL------PQPQQEEQ 349 Query: 383 XXXQLLDPINSMFNLG 336 QLLD INS+ N+G Sbjct: 350 QQPQLLDQINSLINIG 365 >ref|XP_012071867.1| PREDICTED: uncharacterized protein LOC105633802 [Jatropha curcas] gi|643731174|gb|KDP38512.1| hypothetical protein JCGZ_04437 [Jatropha curcas] Length = 406 Score = 487 bits (1253), Expect = e-134 Identities = 260/385 (67%), Positives = 302/385 (78%), Gaps = 7/385 (1%) Frame = -1 Query: 1448 HTHMGEQTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSNVPPQTNSP 1269 H+H Q+++ ++ +TQ +P P ++ TTT +EL+ +P QT SP Sbjct: 40 HSHSQPQSQSPIKAQVQTQTQPQPLHDSTTTSTITTT----------NELTTIPQQTVSP 89 Query: 1268 ASKIP-LRPRKIRKVSPDPT-TSESQTETPKASTSTGG--KTCGRNNKT-VQQQRALVV- 1107 +KIP RPRKIRK+SPD T T+ + + + +T+T KT ++ KT + Q +A+VV Sbjct: 90 PAKIPPSRPRKIRKLSPDDTATTATDPNSSQLTTTTNEPPKTTAKSAKTRIAQTKAIVVA 149 Query: 1106 -PRMVARSLSCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQL 930 PR++ RSLSCEGEVE A+R+LR+ADPLL+ LID+H PP+FD FHTPFLALTRSILYQQL Sbjct: 150 PPRIIPRSLSCEGEVENAIRHLRDADPLLASLIDLHPPPTFDTFHTPFLALTRSILYQQL 209 Query: 929 AYKAGTSIYTRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGIL 750 A+KAGTSIYTRFIALCGGEA V+P TVL+L PQQLRQIGVSGRKASYLHDLARKY NGIL Sbjct: 210 AFKAGTSIYTRFIALCGGEAGVLPGTVLSLTPQQLRQIGVSGRKASYLHDLARKYHNGIL 269 Query: 749 SDSTIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLE 570 SD+ IVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLE Sbjct: 270 SDTAIVNMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLE 329 Query: 569 DLPRPSQMDQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASLXXXXXXXXXXXX 390 DLPRPSQMDQLCEKWRPYRSVASWY+WRFVEAKG+PSSAVAVATGA + Sbjct: 330 DLPRPSQMDQLCEKWRPYRSVASWYLWRFVEAKGSPSSAVAVATGAGM-------TQQQQ 382 Query: 389 XXXXXQLLDPINSMFNLGAACAWGQ 315 QLLDPINS+ NLG ACAWGQ Sbjct: 383 EEQQPQLLDPINSILNLG-ACAWGQ 406 >ref|XP_008466558.1| PREDICTED: DNA-3-methyladenine glycosylase 1 [Cucumis melo] Length = 379 Score = 484 bits (1245), Expect = e-133 Identities = 257/380 (67%), Positives = 283/380 (74%), Gaps = 5/380 (1%) Frame = -1 Query: 1439 MGEQTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSNVPPQTNSPASK 1260 MGEQT+ Q Q+ T++Q +P Q Q + +TT A + SE+ N P Q +SP SK Sbjct: 1 MGEQTQVQVQTQTQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSK 60 Query: 1259 IPLRPRKIRKVSPDPTTSESQTETPKASTSTGGKTCGRNNKTVQQQRALVVPRMV--ARS 1086 +PLRPRKIRK+SP+ + S T N QQRA V ARS Sbjct: 61 MPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNKSKTAQQRAAFASATVPLARS 120 Query: 1085 LSCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSI 906 LSCEGEVEIALR+LRNADPLL+ LID+HQ P+FD+F TPFLALTRSILYQQLAYKAGTSI Sbjct: 121 LSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180 Query: 905 YTRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNM 726 YTRFIALCGGEA V+PETVL+LNPQQLRQIG+SGRK+SYLHDLARKYQNGILSD IVNM Sbjct: 181 YTRFIALCGGEAGVLPETVLSLNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240 Query: 725 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQM 546 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDL VRKGVQLLYNLE+LPRPSQM Sbjct: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM 300 Query: 545 DQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASL---XXXXXXXXXXXXXXXXX 375 DQLCEKWRPYRSV SWYMWR EAKG SSA AVA GASL Sbjct: 301 DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQQQDHHQEHQHPQHPQQP 360 Query: 374 QLLDPINSMFNLGAACAWGQ 315 QLLDP+N + NLG ACAWGQ Sbjct: 361 QLLDPLNGILNLG-ACAWGQ 379 >ref|XP_004147864.1| PREDICTED: DNA-3-methyladenine glycosylase 1 [Cucumis sativus] gi|700204833|gb|KGN59966.1| hypothetical protein Csa_3G857070 [Cucumis sativus] Length = 382 Score = 483 bits (1244), Expect = e-133 Identities = 258/383 (67%), Positives = 283/383 (73%), Gaps = 8/383 (2%) Frame = -1 Query: 1439 MGEQTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSNVPPQTNSPASK 1260 MGEQT+ Q Q+ T++Q +P Q Q + +TT A + SE+ N P Q +SP SK Sbjct: 1 MGEQTQVQVQTQTQSQPQPQSQAQNTFHESSNSTTPIAQATVMLSEVMNAPSQISSPPSK 60 Query: 1259 IPLRPRKIRKVSPDPTTSESQTETPKASTSTGGKTCGRNNKTVQQQRALVVPRMV--ARS 1086 +PLRPRKIRK+SP+ + S T N QRA V ARS Sbjct: 61 MPLRPRKIRKLSPEESDPNSSHVVAIPDGPKPIATVKSNKSKTAHQRAAFASATVPPARS 120 Query: 1085 LSCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGTSI 906 LSCEGEVEIALR+LRNADPLL+ LID+HQ P+FD+F TPFLALTRSILYQQLAYKAGTSI Sbjct: 121 LSCEGEVEIALRHLRNADPLLAQLIDLHQRPTFDSFQTPFLALTRSILYQQLAYKAGTSI 180 Query: 905 YTRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIVNM 726 YTRFIALCGGEA V+PETVLALNPQQLRQIG+SGRK+SYLHDLARKYQNGILSD IVNM Sbjct: 181 YTRFIALCGGEAGVLPETVLALNPQQLRQIGISGRKSSYLHDLARKYQNGILSDPAIVNM 240 Query: 725 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPSQM 546 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDL VRKGVQLLYNLE+LPRPSQM Sbjct: 241 DDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLNVRKGVQLLYNLEELPRPSQM 300 Query: 545 DQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASL------XXXXXXXXXXXXXX 384 DQLCEKWRPYRSV SWYMWR EAKG SSA AVA GASL Sbjct: 301 DQLCEKWRPYRSVGSWYMWRLAEAKGASSSAAAVAAGASLQLQHQDHHQEHQHPQHPQHP 360 Query: 383 XXXQLLDPINSMFNLGAACAWGQ 315 QLLDP+NS+ NLG ACAWGQ Sbjct: 361 QQPQLLDPLNSILNLG-ACAWGQ 382 >ref|XP_012446557.1| PREDICTED: probable DNA-3-methyladenine glycosylase 2 [Gossypium raimondii] gi|763792804|gb|KJB59800.1| hypothetical protein B456_009G273100 [Gossypium raimondii] Length = 396 Score = 483 bits (1243), Expect = e-133 Identities = 262/379 (69%), Positives = 293/379 (77%), Gaps = 7/379 (1%) Frame = -1 Query: 1430 QTRTQAQSLTRTQIEPLPQPQGASSVAPTTTTLAASIVPVESELSNVPPQTNSPASKIPL 1251 QT+ QAQ+LT+T+ ++ APT TT A +V +EL++ P T+SP SKIP Sbjct: 28 QTQRQAQTLTKTE----NSNDAFAAAAPTVTT--ALVVSASTELTDGSPLTSSPPSKIPS 81 Query: 1250 RPRKIRKVSPDPTTS-----ESQTETPKASTSTGGKTCGRNNKT-VQQQRALVV-PRMVA 1092 RPRKIRK+SPD + ++ T T S + KT R K + Q RALVV P+ A Sbjct: 82 RPRKIRKLSPDSNSEPNASQQATTSTTSTSVAVPLKTVPRAPKAKLSQHRALVVAPQFFA 141 Query: 1091 RSLSCEGEVEIALRYLRNADPLLSPLIDIHQPPSFDNFHTPFLALTRSILYQQLAYKAGT 912 RSLSCEGEVE A+R+LRNADPLL+ LID+H PP+FD F TPFLALTRSILYQQLA+KAGT Sbjct: 142 RSLSCEGEVETAVRHLRNADPLLASLIDLHPPPTFDTFQTPFLALTRSILYQQLAFKAGT 201 Query: 911 SIYTRFIALCGGEAAVVPETVLALNPQQLRQIGVSGRKASYLHDLARKYQNGILSDSTIV 732 SIYTRFIALCGGE VVPETVL+L PQQLRQIGVSGRKASYLHDLARKYQ GILSDS IV Sbjct: 202 SIYTRFIALCGGENGVVPETVLSLTPQQLRQIGVSGRKASYLHDLARKYQTGILSDSAIV 261 Query: 731 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGVRKGVQLLYNLEDLPRPS 552 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLG+RKGVQLLY+LE+LPRPS Sbjct: 262 NMDDKSLFTMLTMVNGIGSWSVHMFMIFSLHRPDVLPINDLGIRKGVQLLYSLEELPRPS 321 Query: 551 QMDQLCEKWRPYRSVASWYMWRFVEAKGTPSSAVAVATGASLXXXXXXXXXXXXXXXXXQ 372 QMDQLCEKWRPYRSVASWY+WRFVEAKG PSSA AVA GASL Q Sbjct: 322 QMDQLCEKWRPYRSVASWYLWRFVEAKGAPSSAAAVAAGASL---QPLPQEEHQHQQQPQ 378 Query: 371 LLDPINSMFNLGAACAWGQ 315 LLD INS+ +LG AC WGQ Sbjct: 379 LLDSINSILDLG-ACTWGQ 396