BLASTX nr result
ID: Catharanthus22_contig00029337
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00029337 (970 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002276173.1| PREDICTED: probable GMP synthase [glutamine-... 342 2e-91 ref|XP_004251752.1| PREDICTED: probable GMP synthase [glutamine-... 337 4e-90 ref|XP_006350099.1| PREDICTED: uncharacterized protein LOC102595... 337 5e-90 ref|XP_003549544.1| PREDICTED: uncharacterized protein LOC100785... 336 7e-90 ref|XP_004507736.1| PREDICTED: probable GMP synthase [glutamine-... 333 6e-89 gb|ESW26879.1| hypothetical protein PHAVU_003G156200g [Phaseolus... 332 2e-88 gb|EXB96612.1| Putative Glutamine amidotransferase [Morus notabi... 331 2e-88 gb|EMJ06013.1| hypothetical protein PRUPE_ppa026720mg [Prunus pe... 330 5e-88 ref|XP_002324538.1| methyladenine glycosylase family protein [Po... 330 6e-88 ref|XP_003610321.1| Methyladenine glycosylase protein-like prote... 330 6e-88 gb|EOY29555.1| DNA glycosylase superfamily protein isoform 1 [Th... 329 1e-87 ref|XP_002529378.1| DNA-3-methyladenine glycosylase, putative [R... 329 1e-87 ref|XP_002309346.1| methyladenine glycosylase family protein [Po... 328 2e-87 ref|XP_006453620.1| hypothetical protein CICLE_v10008612mg [Citr... 326 9e-87 ref|XP_003519147.1| PREDICTED: uncharacterized protein LOC100783... 325 1e-86 ref|XP_006473998.1| PREDICTED: uncharacterized protein LOC102607... 325 2e-86 ref|XP_004287130.1| PREDICTED: uncharacterized protein LOC101313... 321 3e-85 ref|XP_003527169.1| PREDICTED: uncharacterized protein LOC100801... 318 1e-84 gb|AFW58766.1| hypothetical protein ZEAMMB73_734031 [Zea mays] 298 2e-78 ref|XP_004976128.1| PREDICTED: uncharacterized protein LOC101782... 298 2e-78 >ref|XP_002276173.1| PREDICTED: probable GMP synthase [glutamine-hydrolyzing] [Vitis vinifera] gi|297743642|emb|CBI36525.3| unnamed protein product [Vitis vinifera] Length = 375 Score = 342 bits (876), Expect = 2e-91 Identities = 166/233 (71%), Positives = 190/233 (81%) Frame = -3 Query: 968 GLLPKKRCSWVTPTTDQSYATFHDEEWGVPVHDDKKLFELLVLCGALSELTWPSILSKRH 789 GL K+RC+WVTP TD SY FHDEEWGVPVHDDKKLFELLVL GAL+ELTWP+ILSKRH Sbjct: 145 GLKAKRRCAWVTPNTDLSYIAFHDEEWGVPVHDDKKLFELLVLSGALAELTWPTILSKRH 204 Query: 788 IFREVFADFDPAIVAKLNEKKILAPGSTXXXXXXXXXXXXXXENARQISKIVEEFGSFDK 609 IFREVFADFDP VAKLNEKK++APGS ENARQ+SK+++EFGSFD+ Sbjct: 205 IFREVFADFDPIAVAKLNEKKLMAPGSIASSLISELKLRGIIENARQMSKVIDEFGSFDE 264 Query: 608 YIWSFVNHKPLVSTFRYPRQVPVKTPKADVISKDLLRRGFRCVGPTIVYSFMQAAGITND 429 YIWSFVNHKP+VS FRYPR VPVKTPKADVISKDL+RRGFR VGPT++YSFMQ AGITND Sbjct: 265 YIWSFVNHKPIVSRFRYPRHVPVKTPKADVISKDLVRRGFRSVGPTVIYSFMQVAGITND 324 Query: 428 HLISCYRFQECSNASEEKEDAASITDNSPEKRVDEVIESEICRSSDELSISSE 270 HLISC+RFQ+C A+E KE+ IT + E++ VIESE+ R+ DELS SSE Sbjct: 325 HLISCFRFQDCVTAAEVKEE--EITTGAAEEKKSNVIESELSRAIDELSFSSE 375 >ref|XP_004251752.1| PREDICTED: probable GMP synthase [glutamine-hydrolyzing]-like [Solanum lycopersicum] Length = 372 Score = 337 bits (864), Expect = 4e-90 Identities = 164/233 (70%), Positives = 187/233 (80%) Frame = -3 Query: 968 GLLPKKRCSWVTPTTDQSYATFHDEEWGVPVHDDKKLFELLVLCGALSELTWPSILSKRH 789 G KKRC+WVTP TD SYA FHDEEWGVPVHDDKKLFELLVLCGAL+ELTWPSIL KRH Sbjct: 141 GSQSKKRCAWVTPNTDPSYANFHDEEWGVPVHDDKKLFELLVLCGALAELTWPSILCKRH 200 Query: 788 IFREVFADFDPAIVAKLNEKKILAPGSTXXXXXXXXXXXXXXENARQISKIVEEFGSFDK 609 IFREVFADFDP +VAKLNEKK LAPG T ENARQ+ K+++EFGSFDK Sbjct: 201 IFREVFADFDPIVVAKLNEKKTLAPGGTACSLLSELKLRGIIENARQMLKVIDEFGSFDK 260 Query: 608 YIWSFVNHKPLVSTFRYPRQVPVKTPKADVISKDLLRRGFRCVGPTIVYSFMQAAGITND 429 YIWSFVNHKP+VS FRYPRQVPVKT KAD+ISKDL+RRGFR VGPT+VYSFMQ AGITND Sbjct: 261 YIWSFVNHKPIVSGFRYPRQVPVKTAKADLISKDLIRRGFRGVGPTVVYSFMQVAGITND 320 Query: 428 HLISCYRFQECSNASEEKEDAASITDNSPEKRVDEVIESEICRSSDELSISSE 270 HLISC+RF +C ++E KE ++ D + + ++ E+EICRS D+LS SSE Sbjct: 321 HLISCFRFPDCVESAEGKEKDSN-NDETESAQANKANETEICRSIDDLSFSSE 372 >ref|XP_006350099.1| PREDICTED: uncharacterized protein LOC102595001 isoform X1 [Solanum tuberosum] Length = 372 Score = 337 bits (863), Expect = 5e-90 Identities = 164/233 (70%), Positives = 187/233 (80%) Frame = -3 Query: 968 GLLPKKRCSWVTPTTDQSYATFHDEEWGVPVHDDKKLFELLVLCGALSELTWPSILSKRH 789 G KKRC+WVTP TD SYA FHDEEWGVPVHDDKKLFELLVLCGAL+ELTWPSIL KRH Sbjct: 141 GSQSKKRCAWVTPNTDPSYANFHDEEWGVPVHDDKKLFELLVLCGALAELTWPSILCKRH 200 Query: 788 IFREVFADFDPAIVAKLNEKKILAPGSTXXXXXXXXXXXXXXENARQISKIVEEFGSFDK 609 IFREVFADFDP +VAKLNEKK LAPG T ENARQ+ K+++EFGSFDK Sbjct: 201 IFREVFADFDPIVVAKLNEKKTLAPGGTACSLLSELKLRGIIENARQMLKVIDEFGSFDK 260 Query: 608 YIWSFVNHKPLVSTFRYPRQVPVKTPKADVISKDLLRRGFRCVGPTIVYSFMQAAGITND 429 YIWSFVNHKP+VS FRYPRQVPVKT KAD+ISKDL+RRGFR VGPT+VYSFMQ AGITND Sbjct: 261 YIWSFVNHKPIVSGFRYPRQVPVKTAKADLISKDLIRRGFRGVGPTVVYSFMQVAGITND 320 Query: 428 HLISCYRFQECSNASEEKEDAASITDNSPEKRVDEVIESEICRSSDELSISSE 270 HLISC+RF +C ++E KE ++ D + + ++ E+EICRS D+LS SSE Sbjct: 321 HLISCFRFPDCVESAEGKEKDSN-NDETEATQANKANETEICRSIDDLSFSSE 372 >ref|XP_003549544.1| PREDICTED: uncharacterized protein LOC100785912 [Glycine max] Length = 373 Score = 336 bits (862), Expect = 7e-90 Identities = 162/233 (69%), Positives = 189/233 (81%) Frame = -3 Query: 968 GLLPKKRCSWVTPTTDQSYATFHDEEWGVPVHDDKKLFELLVLCGALSELTWPSILSKRH 789 G KKRC+W+TP T+ YATFHDEEWGVPVHDDKKLFELLVL ALSEL+WP+ILSKRH Sbjct: 142 GSQSKKRCAWITPNTEPCYATFHDEEWGVPVHDDKKLFELLVLSSALSELSWPAILSKRH 201 Query: 788 IFREVFADFDPAIVAKLNEKKILAPGSTXXXXXXXXXXXXXXENARQISKIVEEFGSFDK 609 IFREVF DFDP V+K NEKKI+APGST ENARQISK++EEFGSFDK Sbjct: 202 IFREVFVDFDPVAVSKFNEKKIMAPGSTASSLLSDLKLRAIIENARQISKVIEEFGSFDK 261 Query: 608 YIWSFVNHKPLVSTFRYPRQVPVKTPKADVISKDLLRRGFRCVGPTIVYSFMQAAGITND 429 YIWSFVNHKP++S FRYPRQVPVKTPKADVISKDL+RRGFR VGPT++YSFMQ G+TND Sbjct: 262 YIWSFVNHKPIISRFRYPRQVPVKTPKADVISKDLVRRGFRGVGPTVIYSFMQVVGLTND 321 Query: 428 HLISCYRFQECSNASEEKEDAASITDNSPEKRVDEVIESEICRSSDELSISSE 270 HLISC+RFQ+C A+E KE+ A I D++ +K D V+ES++ + D LS+SSE Sbjct: 322 HLISCFRFQDCMAAAEGKEENA-IKDDAQQKERDHVMESDLSIAIDNLSLSSE 373 >ref|XP_004507736.1| PREDICTED: probable GMP synthase [glutamine-hydrolyzing]-like isoform X1 [Cicer arietinum] Length = 381 Score = 333 bits (854), Expect = 6e-89 Identities = 161/233 (69%), Positives = 187/233 (80%) Frame = -3 Query: 968 GLLPKKRCSWVTPTTDQSYATFHDEEWGVPVHDDKKLFELLVLCGALSELTWPSILSKRH 789 G +KRC+W+TP T+ YATFHDEEWGVPVHDDKKLFELLVL ALSELTWP+ILSKRH Sbjct: 149 GAQSQKRCAWITPNTEPCYATFHDEEWGVPVHDDKKLFELLVLSSALSELTWPAILSKRH 208 Query: 788 IFREVFADFDPAIVAKLNEKKILAPGSTXXXXXXXXXXXXXXENARQISKIVEEFGSFDK 609 IFRE+FADFDP V+KLNEKK++APG+T ENARQISK++EE GSFD Sbjct: 209 IFREMFADFDPVAVSKLNEKKMMAPGTTGSSLLSDLKLRAIIENARQISKVIEESGSFDN 268 Query: 608 YIWSFVNHKPLVSTFRYPRQVPVKTPKADVISKDLLRRGFRCVGPTIVYSFMQAAGITND 429 YIWSFVNHKP+VS FRYPRQVPVKTPKADVISKDL+RRGFR VGPT++YSFMQ AG+TND Sbjct: 269 YIWSFVNHKPIVSKFRYPRQVPVKTPKADVISKDLVRRGFRGVGPTVIYSFMQVAGLTND 328 Query: 428 HLISCYRFQECSNASEEKEDAASITDNSPEKRVDEVIESEICRSSDELSISSE 270 HLISC+RFQEC A+E KED A + +K D V+ES++ + D LS+SSE Sbjct: 329 HLISCFRFQECVAAAEGKEDKAIKDVDHQQKACDSVMESDLSIAIDNLSLSSE 381 >gb|ESW26879.1| hypothetical protein PHAVU_003G156200g [Phaseolus vulgaris] Length = 364 Score = 332 bits (850), Expect = 2e-88 Identities = 158/233 (67%), Positives = 188/233 (80%) Frame = -3 Query: 968 GLLPKKRCSWVTPTTDQSYATFHDEEWGVPVHDDKKLFELLVLCGALSELTWPSILSKRH 789 G KKRC+W+TP T+ YATFHDEEWGVPVHDDKKLFELLVL ALSELTWP+ILS+RH Sbjct: 133 GSQSKKRCAWITPNTEPCYATFHDEEWGVPVHDDKKLFELLVLSSALSELTWPAILSQRH 192 Query: 788 IFREVFADFDPAIVAKLNEKKILAPGSTXXXXXXXXXXXXXXENARQISKIVEEFGSFDK 609 IFREVFADFDP V+K NEKKI+APG+ ENARQISK++EEFGSFDK Sbjct: 193 IFREVFADFDPVAVSKFNEKKIMAPGTAASSLLSDLKLRAIIENARQISKVIEEFGSFDK 252 Query: 608 YIWSFVNHKPLVSTFRYPRQVPVKTPKADVISKDLLRRGFRCVGPTIVYSFMQAAGITND 429 YIWSFVNHKP++S FRYPRQVPVKTPKADVISKDL++RGFR VGPT++YSFMQ G+TND Sbjct: 253 YIWSFVNHKPIISRFRYPRQVPVKTPKADVISKDLVKRGFRGVGPTVIYSFMQVVGLTND 312 Query: 428 HLISCYRFQECSNASEEKEDAASITDNSPEKRVDEVIESEICRSSDELSISSE 270 HLISC+RFQ+C +E KE+ A+ D++ +K D V+ES++ + D LS+SSE Sbjct: 313 HLISCFRFQDCMAGAEGKEENAT-KDDAQQKECDHVMESDLSIAIDNLSLSSE 364 >gb|EXB96612.1| Putative Glutamine amidotransferase [Morus notabilis] Length = 383 Score = 331 bits (849), Expect = 2e-88 Identities = 159/229 (69%), Positives = 184/229 (80%) Frame = -3 Query: 956 KKRCSWVTPTTDQSYATFHDEEWGVPVHDDKKLFELLVLCGALSELTWPSILSKRHIFRE 777 KKRC+WVTP T+ Y FHDEEWGVPVHDD+KLFELLVL GAL+ELTWP+ILSKRHIFRE Sbjct: 155 KKRCAWVTPNTEPCYVAFHDEEWGVPVHDDRKLFELLVLSGALAELTWPAILSKRHIFRE 214 Query: 776 VFADFDPAIVAKLNEKKILAPGSTXXXXXXXXXXXXXXENARQISKIVEEFGSFDKYIWS 597 VFADFDPA V+KLNEKKI+APGST EN RQISK+++EFGSFD YIWS Sbjct: 215 VFADFDPAAVSKLNEKKIMAPGSTASSLLSELKLRAIIENGRQISKVIDEFGSFDNYIWS 274 Query: 596 FVNHKPLVSTFRYPRQVPVKTPKADVISKDLLRRGFRCVGPTIVYSFMQAAGITNDHLIS 417 FVN+KP+VS FRYPRQVPVKTPKADVISKDL+RRGFR VGPT+VYSFMQ AGITNDHLIS Sbjct: 275 FVNNKPIVSKFRYPRQVPVKTPKADVISKDLVRRGFRSVGPTVVYSFMQVAGITNDHLIS 334 Query: 416 CYRFQECSNASEEKEDAASITDNSPEKRVDEVIESEICRSSDELSISSE 270 C+RFQEC NA+E K++ + + + + ESE+C +EL+ SSE Sbjct: 335 CFRFQECLNAAEGKDENGIKNEAGEKNKNNNGAESELCIGIEELNFSSE 383 >gb|EMJ06013.1| hypothetical protein PRUPE_ppa026720mg [Prunus persica] Length = 378 Score = 330 bits (846), Expect = 5e-88 Identities = 159/233 (68%), Positives = 189/233 (81%) Frame = -3 Query: 968 GLLPKKRCSWVTPTTDQSYATFHDEEWGVPVHDDKKLFELLVLCGALSELTWPSILSKRH 789 G KKRC+WVTP TD YA FHDEEWG+PVHDDKKLFELLVL GAL+EL+WP+ILSK+H Sbjct: 147 GSQSKKRCAWVTPNTDPCYAAFHDEEWGLPVHDDKKLFELLVLSGALAELSWPAILSKKH 206 Query: 788 IFREVFADFDPAIVAKLNEKKILAPGSTXXXXXXXXXXXXXXENARQISKIVEEFGSFDK 609 IFREVFADFDP ++KLNEKK++APGS ENARQ++K++EEFGSFDK Sbjct: 207 IFREVFADFDPVAISKLNEKKLIAPGSNASSLLSELKLRAIIENARQMTKVIEEFGSFDK 266 Query: 608 YIWSFVNHKPLVSTFRYPRQVPVKTPKADVISKDLLRRGFRCVGPTIVYSFMQAAGITND 429 YIWSFVN+KP+VS FRYPRQVP KTPKADVISKDL+RRGFR VGPT++YSFMQ AGITND Sbjct: 267 YIWSFVNNKPIVSRFRYPRQVPAKTPKADVISKDLMRRGFRSVGPTVIYSFMQVAGITND 326 Query: 428 HLISCYRFQECSNASEEKEDAASITDNSPEKRVDEVIESEICRSSDELSISSE 270 HL+SC+RFQEC NA+E KE+ I D + EK+ + IES++ + DELS SS+ Sbjct: 327 HLVSCFRFQECLNAAEGKEE-YGIKDEA-EKKTENGIESDLSVAMDELSFSSD 377 >ref|XP_002324538.1| methyladenine glycosylase family protein [Populus trichocarpa] gi|222865972|gb|EEF03103.1| methyladenine glycosylase family protein [Populus trichocarpa] Length = 380 Score = 330 bits (845), Expect = 6e-88 Identities = 161/229 (70%), Positives = 186/229 (81%) Frame = -3 Query: 956 KKRCSWVTPTTDQSYATFHDEEWGVPVHDDKKLFELLVLCGALSELTWPSILSKRHIFRE 777 KK C+WVTP TD YATFHDEEWGVP+HDD+KLFELLVL GAL+ELTWP+ILSKRHIFRE Sbjct: 155 KKSCAWVTPNTDPCYATFHDEEWGVPIHDDRKLFELLVLSGALAELTWPAILSKRHIFRE 214 Query: 776 VFADFDPAIVAKLNEKKILAPGSTXXXXXXXXXXXXXXENARQISKIVEEFGSFDKYIWS 597 VFADFDP V+K NEKKILAPGST ENARQISK+++EFGSFDKYIWS Sbjct: 215 VFADFDPIAVSKFNEKKILAPGSTATSLLSELKLRAIVENARQISKVIDEFGSFDKYIWS 274 Query: 596 FVNHKPLVSTFRYPRQVPVKTPKADVISKDLLRRGFRCVGPTIVYSFMQAAGITNDHLIS 417 FVN+KP+VS FRYPRQVPVKTPKAD ISKDL+RRGFR VGPT++YSFMQ AGITNDHLIS Sbjct: 275 FVNYKPIVSRFRYPRQVPVKTPKADAISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLIS 334 Query: 416 CYRFQECSNASEEKEDAASITDNSPEKRVDEVIESEICRSSDELSISSE 270 C+RFQEC +A+E K + S + + ++V+ES+I + DELS SSE Sbjct: 335 CFRFQECLDAAEGKVENGI---KSEDIKTNDVMESKISIAIDELSFSSE 380 >ref|XP_003610321.1| Methyladenine glycosylase protein-like protein [Medicago truncatula] gi|355511376|gb|AES92518.1| Methyladenine glycosylase protein-like protein [Medicago truncatula] Length = 375 Score = 330 bits (845), Expect = 6e-88 Identities = 156/233 (66%), Positives = 186/233 (79%) Frame = -3 Query: 968 GLLPKKRCSWVTPTTDQSYATFHDEEWGVPVHDDKKLFELLVLCGALSELTWPSILSKRH 789 G PKKRC+W+TP T+ YATFHDEEWGVPVHDDKKLFE+LVL ALSELTWP+ILSKRH Sbjct: 143 GAQPKKRCAWITPNTEPYYATFHDEEWGVPVHDDKKLFEVLVLSSALSELTWPAILSKRH 202 Query: 788 IFREVFADFDPAIVAKLNEKKILAPGSTXXXXXXXXXXXXXXENARQISKIVEEFGSFDK 609 IFREVFADFDP V+KLNEKK++ PG+T ENARQISK++ EFGSFD Sbjct: 203 IFREVFADFDPVAVSKLNEKKVITPGTTASSLLSDQKLRGIIENARQISKVIVEFGSFDN 262 Query: 608 YIWSFVNHKPLVSTFRYPRQVPVKTPKADVISKDLLRRGFRCVGPTIVYSFMQAAGITND 429 YIWSFVNHKP++S FRYPRQVPVKTPKA+VISKDL+RRGFR VGPT++YSFMQ G+TND Sbjct: 263 YIWSFVNHKPILSKFRYPRQVPVKTPKAEVISKDLVRRGFRGVGPTVIYSFMQVVGLTND 322 Query: 428 HLISCYRFQECSNASEEKEDAASITDNSPEKRVDEVIESEICRSSDELSISSE 270 HLISC+RFQEC A+E KE+ + +++ D V+ES++ + D LS+SSE Sbjct: 323 HLISCFRFQECVAAAEGKEENSIKNEDAQPNACDSVMESDLSIAIDNLSLSSE 375 >gb|EOY29555.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508782300|gb|EOY29556.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508782301|gb|EOY29557.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508782302|gb|EOY29558.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] Length = 379 Score = 329 bits (843), Expect = 1e-87 Identities = 160/229 (69%), Positives = 183/229 (79%) Frame = -3 Query: 956 KKRCSWVTPTTDQSYATFHDEEWGVPVHDDKKLFELLVLCGALSELTWPSILSKRHIFRE 777 KKRC+WVTP TD SY FHDEEWGVPVHDD+KLFELLVL GALSELTWP+ILSKRHI RE Sbjct: 152 KKRCAWVTPNTDPSYVAFHDEEWGVPVHDDRKLFELLVLSGALSELTWPAILSKRHIVRE 211 Query: 776 VFADFDPAIVAKLNEKKILAPGSTXXXXXXXXXXXXXXENARQISKIVEEFGSFDKYIWS 597 VF DFD V+KLNEKK++ PGS ENARQISK+++EFGSFD+YIWS Sbjct: 212 VFVDFDAVAVSKLNEKKLVTPGSIASSLLSELKLRAIIENARQISKVIDEFGSFDEYIWS 271 Query: 596 FVNHKPLVSTFRYPRQVPVKTPKADVISKDLLRRGFRCVGPTIVYSFMQAAGITNDHLIS 417 FVNHKP+VS FRYPRQVPVKTPKADVISKDL+RRGFR VGPT++YSFMQ AGITNDHL S Sbjct: 272 FVNHKPIVSRFRYPRQVPVKTPKADVISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLTS 331 Query: 416 CYRFQECSNASEEKEDAASITDNSPEKRVDEVIESEICRSSDELSISSE 270 C+RFQEC A+E KE+ I D EK+ + V+ES++ + DELS SSE Sbjct: 332 CFRFQECITAAEGKEE-NGIKDMPEEKKTENVMESKLSIAIDELSFSSE 379 >ref|XP_002529378.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis] gi|223531126|gb|EEF32974.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis] Length = 380 Score = 329 bits (843), Expect = 1e-87 Identities = 161/233 (69%), Positives = 184/233 (78%) Frame = -3 Query: 968 GLLPKKRCSWVTPTTDQSYATFHDEEWGVPVHDDKKLFELLVLCGALSELTWPSILSKRH 789 G KK C+WVTP D Y FHDEEWG+PVHDDKKLFELLVL GAL+ELTWP+ILSKRH Sbjct: 151 GSQAKKSCAWVTPNADPCYTAFHDEEWGIPVHDDKKLFELLVLSGALAELTWPAILSKRH 210 Query: 788 IFREVFADFDPAIVAKLNEKKILAPGSTXXXXXXXXXXXXXXENARQISKIVEEFGSFDK 609 IFREVFA+FDP +V+K NEKKI+APGST ENARQISK+ +E GSFDK Sbjct: 211 IFREVFANFDPVVVSKFNEKKIIAPGSTASSLLSEIKLRAIIENARQISKVTDELGSFDK 270 Query: 608 YIWSFVNHKPLVSTFRYPRQVPVKTPKADVISKDLLRRGFRCVGPTIVYSFMQAAGITND 429 YIWSFVN+KP+VS FRYPRQVPVKTPKADVISKDL+RRGFR VGPT+VYSFMQ AG+TND Sbjct: 271 YIWSFVNYKPIVSRFRYPRQVPVKTPKADVISKDLVRRGFRSVGPTVVYSFMQVAGLTND 330 Query: 428 HLISCYRFQECSNASEEKEDAASITDNSPEKRVDEVIESEICRSSDELSISSE 270 HLISC+RFQEC NA+E KE+ + +K D V+ES+I + DELS SSE Sbjct: 331 HLISCFRFQECINAAEGKEENGVKVE---DKITDGVVESQISIAMDELSFSSE 380 >ref|XP_002309346.1| methyladenine glycosylase family protein [Populus trichocarpa] gi|222855322|gb|EEE92869.1| methyladenine glycosylase family protein [Populus trichocarpa] Length = 381 Score = 328 bits (840), Expect = 2e-87 Identities = 158/233 (67%), Positives = 185/233 (79%) Frame = -3 Query: 968 GLLPKKRCSWVTPTTDQSYATFHDEEWGVPVHDDKKLFELLVLCGALSELTWPSILSKRH 789 G KK C+WVTP TD Y FHDEEWG+PVHDD+KLFELLVL GAL+ELTWP+ILSKRH Sbjct: 152 GSQSKKSCAWVTPNTDPCYTAFHDEEWGLPVHDDRKLFELLVLSGALAELTWPAILSKRH 211 Query: 788 IFREVFADFDPAIVAKLNEKKILAPGSTXXXXXXXXXXXXXXENARQISKIVEEFGSFDK 609 +FREVFADFDP V+K NEKKI+APGST ENARQISK+++EFGSFDK Sbjct: 212 MFREVFADFDPIAVSKFNEKKIIAPGSTAASLLSELKLRAIIENARQISKVIDEFGSFDK 271 Query: 608 YIWSFVNHKPLVSTFRYPRQVPVKTPKADVISKDLLRRGFRCVGPTIVYSFMQAAGITND 429 YIWSFVN+KP+VS FRYPRQVP KTPKAD ISKDL+RRGFR VGPT++YSFMQ AG+TND Sbjct: 272 YIWSFVNYKPIVSRFRYPRQVPAKTPKADAISKDLVRRGFRSVGPTVIYSFMQVAGVTND 331 Query: 428 HLISCYRFQECSNASEEKEDAASITDNSPEKRVDEVIESEICRSSDELSISSE 270 HLISC+RFQEC +A+E KE+ S + + D+++ES+I S DELS SSE Sbjct: 332 HLISCFRFQECIDAAEGKEENGI---KSEDVKTDDIMESKISISIDELSFSSE 381 >ref|XP_006453620.1| hypothetical protein CICLE_v10008612mg [Citrus clementina] gi|567923232|ref|XP_006453622.1| hypothetical protein CICLE_v10008612mg [Citrus clementina] gi|557556846|gb|ESR66860.1| hypothetical protein CICLE_v10008612mg [Citrus clementina] gi|557556848|gb|ESR66862.1| hypothetical protein CICLE_v10008612mg [Citrus clementina] Length = 385 Score = 326 bits (835), Expect = 9e-87 Identities = 162/233 (69%), Positives = 183/233 (78%) Frame = -3 Query: 968 GLLPKKRCSWVTPTTDQSYATFHDEEWGVPVHDDKKLFELLVLCGALSELTWPSILSKRH 789 G KKRC+WVTP TD YA FHDEEWGVPVHDDKKLFELLVL GALSELTWP+ILSKRH Sbjct: 155 GSQTKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLVLSGALSELTWPAILSKRH 214 Query: 788 IFREVFADFDPAIVAKLNEKKILAPGSTXXXXXXXXXXXXXXENARQISKIVEEFGSFDK 609 IFREVF FDP V+KLNEKK+LA GS ENARQISK+++EFGSF+ Sbjct: 215 IFREVFVGFDPIAVSKLNEKKLLAAGSAASSLLSELKLRAIIENARQISKVIDEFGSFNN 274 Query: 608 YIWSFVNHKPLVSTFRYPRQVPVKTPKADVISKDLLRRGFRCVGPTIVYSFMQAAGITND 429 YIWSFV+HKP+VS FRYPRQVPVKTPKADVISKDL+RRGFR VGPTI+YSFMQ AG+TND Sbjct: 275 YIWSFVSHKPIVSRFRYPRQVPVKTPKADVISKDLVRRGFRSVGPTIIYSFMQVAGVTND 334 Query: 428 HLISCYRFQECSNASEEKEDAASITDNSPEKRVDEVIESEICRSSDELSISSE 270 HL SC+RFQEC NA+E KE+ I DN K+ D I S++ + D LS+SSE Sbjct: 335 HLTSCFRFQECINAAEVKEE-NGIPDNDENKKTDGTI-SQLSMAIDALSLSSE 385 >ref|XP_003519147.1| PREDICTED: uncharacterized protein LOC100783263 [Glycine max] Length = 371 Score = 325 bits (834), Expect = 1e-86 Identities = 158/233 (67%), Positives = 186/233 (79%) Frame = -3 Query: 968 GLLPKKRCSWVTPTTDQSYATFHDEEWGVPVHDDKKLFELLVLCGALSELTWPSILSKRH 789 G KKRC+W+TP T+ YATFHD+EWGVPVHDDKKLFELLVL ALSELTWP+ILSKRH Sbjct: 141 GSQSKKRCAWITPNTEPCYATFHDKEWGVPVHDDKKLFELLVLSSALSELTWPAILSKRH 200 Query: 788 IFREVFADFDPAIVAKLNEKKILAPGSTXXXXXXXXXXXXXXENARQISKIVEEFGSFDK 609 I EVFADFDP ++K NEKKI+APGST ENARQISK++EEFGSFDK Sbjct: 201 ILGEVFADFDPVAISKFNEKKIMAPGSTASSLLSDLKLRAIIENARQISKVIEEFGSFDK 260 Query: 608 YIWSFVNHKPLVSTFRYPRQVPVKTPKADVISKDLLRRGFRCVGPTIVYSFMQAAGITND 429 YIWSFVNHKP++S FRYPRQVPVKTPKADVISKDL+RRGFR VGPT++YSFMQ G+TND Sbjct: 261 YIWSFVNHKPIISRFRYPRQVPVKTPKADVISKDLVRRGFRGVGPTVIYSFMQVVGLTND 320 Query: 428 HLISCYRFQECSNASEEKEDAASITDNSPEKRVDEVIESEICRSSDELSISSE 270 HLISC+RFQ+C +E KE+ A + D++ +K D V ES++ + D LS+SSE Sbjct: 321 HLISCFRFQDCMAVAEGKEENA-VKDDAQQKEGDHV-ESDLSIAIDNLSLSSE 371 >ref|XP_006473998.1| PREDICTED: uncharacterized protein LOC102607933 [Citrus sinensis] Length = 385 Score = 325 bits (833), Expect = 2e-86 Identities = 161/233 (69%), Positives = 183/233 (78%) Frame = -3 Query: 968 GLLPKKRCSWVTPTTDQSYATFHDEEWGVPVHDDKKLFELLVLCGALSELTWPSILSKRH 789 G KKRC+WVTP TD YA FHDEEWGVPVHDDKKLFELLVL GALSELTWP+I+SKRH Sbjct: 155 GSQTKKRCAWVTPNTDPCYAAFHDEEWGVPVHDDKKLFELLVLSGALSELTWPAIMSKRH 214 Query: 788 IFREVFADFDPAIVAKLNEKKILAPGSTXXXXXXXXXXXXXXENARQISKIVEEFGSFDK 609 IFREVF FDP V+KLNEKK+LA GS ENARQISK+++EFGSF+ Sbjct: 215 IFREVFVGFDPIAVSKLNEKKLLAAGSAASSLLSELKLRAIIENARQISKVIDEFGSFNN 274 Query: 608 YIWSFVNHKPLVSTFRYPRQVPVKTPKADVISKDLLRRGFRCVGPTIVYSFMQAAGITND 429 YIWSFV+HKP+VS FRYPRQVPVKTPKADVISKDL+RRGFR VGPTI+YSFMQ AG+TND Sbjct: 275 YIWSFVSHKPIVSRFRYPRQVPVKTPKADVISKDLVRRGFRSVGPTIIYSFMQVAGVTND 334 Query: 428 HLISCYRFQECSNASEEKEDAASITDNSPEKRVDEVIESEICRSSDELSISSE 270 HL SC+RFQEC NA+E KE+ I DN K+ D I S++ + D LS+SSE Sbjct: 335 HLTSCFRFQECINAAEVKEE-NGIPDNDENKKTDGTI-SQLSMAIDALSLSSE 385 >ref|XP_004287130.1| PREDICTED: uncharacterized protein LOC101313540 [Fragaria vesca subsp. vesca] Length = 429 Score = 321 bits (822), Expect = 3e-85 Identities = 155/232 (66%), Positives = 183/232 (78%) Frame = -3 Query: 968 GLLPKKRCSWVTPTTDQSYATFHDEEWGVPVHDDKKLFELLVLCGALSELTWPSILSKRH 789 G KKRC+WVTP TD Y FHDEEWG+PVHDDKKLFELLVL GAL+EL+WP ILSKRH Sbjct: 147 GSQSKKRCAWVTPNTDPCYVAFHDEEWGLPVHDDKKLFELLVLSGALAELSWPLILSKRH 206 Query: 788 IFREVFADFDPAIVAKLNEKKILAPGSTXXXXXXXXXXXXXXENARQISKIVEEFGSFDK 609 IFREVFADFDP V++ NEKKI+APGS ENARQ++K+++EFGSFDK Sbjct: 207 IFREVFADFDPVDVSEFNEKKIMAPGSVASSLLSESKLRAILENARQMTKVIDEFGSFDK 266 Query: 608 YIWSFVNHKPLVSTFRYPRQVPVKTPKADVISKDLLRRGFRCVGPTIVYSFMQAAGITND 429 YIWSFVN+KP+VS FRYPRQVP KTPKADVISKDL+RRGFR VGPT++YSFMQ AGITND Sbjct: 267 YIWSFVNNKPIVSRFRYPRQVPAKTPKADVISKDLVRRGFRSVGPTVIYSFMQVAGITND 326 Query: 428 HLISCYRFQECSNASEEKEDAASITDNSPEKRVDEVIESEICRSSDELSISS 273 HL+SC+RFQ+C NA+E KE+ + T K+ + IES++ + DELS SS Sbjct: 327 HLVSCFRFQDCLNAAEGKEE--NRTKEESGKKTENGIESDLSVALDELSFSS 376 >ref|XP_003527169.1| PREDICTED: uncharacterized protein LOC100801026 isoform X1 [Glycine max] gi|571461733|ref|XP_006582090.1| PREDICTED: uncharacterized protein LOC100801026 isoform X2 [Glycine max] gi|571461735|ref|XP_006582091.1| PREDICTED: uncharacterized protein LOC100801026 isoform X3 [Glycine max] Length = 383 Score = 318 bits (816), Expect = 1e-84 Identities = 151/233 (64%), Positives = 184/233 (78%) Frame = -3 Query: 968 GLLPKKRCSWVTPTTDQSYATFHDEEWGVPVHDDKKLFELLVLCGALSELTWPSILSKRH 789 G KRC+WVTP T+ YATFHDEEWGVPVHDDKKLFELLVL L+E TWP+ILSKRH Sbjct: 151 GSQSNKRCAWVTPNTEPCYATFHDEEWGVPVHDDKKLFELLVLSSVLAEHTWPAILSKRH 210 Query: 788 IFREVFADFDPAIVAKLNEKKILAPGSTXXXXXXXXXXXXXXENARQISKIVEEFGSFDK 609 IFREVF DF+P V+KLNEKKI+ PG+ ENARQISK+++EFGSFDK Sbjct: 211 IFREVFVDFEPVAVSKLNEKKIMTPGTIASSLLSEVKLRAIIENARQISKVIDEFGSFDK 270 Query: 608 YIWSFVNHKPLVSTFRYPRQVPVKTPKADVISKDLLRRGFRCVGPTIVYSFMQAAGITND 429 YIWSFVNHKP+VS FRYPRQVPVKTPKADVISKDL+RRGFR VGPT+VYSFMQ AG+T D Sbjct: 271 YIWSFVNHKPIVSRFRYPRQVPVKTPKADVISKDLVRRGFRGVGPTVVYSFMQVAGLTID 330 Query: 428 HLISCYRFQECSNASEEKEDAASITDNSPEKRVDEVIESEICRSSDELSISSE 270 HLISC+RF+EC A+E KE+ + +++ +K + ++ES++ + ++LS +SE Sbjct: 331 HLISCFRFEECIAAAEGKEENGIMDNHADQKESENIMESDLSIAMEDLSFASE 383 >gb|AFW58766.1| hypothetical protein ZEAMMB73_734031 [Zea mays] Length = 385 Score = 298 bits (764), Expect = 2e-78 Identities = 151/235 (64%), Positives = 177/235 (75%), Gaps = 8/235 (3%) Frame = -3 Query: 956 KKRCSWVTPTTDQSYATFHDEEWGVPVHDDKKLFELLVLCGALSELTWPSILSKRHIFRE 777 K+RC+WVT TD Y+ FHDEEWGVPVHDD+KLFELLVL GAL+ELTWP+IL+KR IFRE Sbjct: 151 KRRCAWVTANTDPCYSAFHDEEWGVPVHDDRKLFELLVLSGALAELTWPAILNKRDIFRE 210 Query: 776 VFADFDPAIVAKLNEKKILAPGSTXXXXXXXXXXXXXXENARQISKIVEEFGSFDKYIWS 597 VF DFDP V+KL+EKKI+APGS ENARQI KI+EEFGSFDKY WS Sbjct: 211 VFMDFDPVSVSKLSEKKIIAPGSPSSSLLSEQKLRGVIENARQILKIIEEFGSFDKYCWS 270 Query: 596 FVNHKPLVSTFRYPRQVPVKTPKADVISKDLLRRGFRCVGPTIVYSFMQAAGITNDHLIS 417 FVNHKP++S FRY RQVPVKT KAD ISKDL+RRGFR VGPT+VY+FMQ +G+TNDHLIS Sbjct: 271 FVNHKPILSRFRYSRQVPVKTSKADAISKDLVRRGFRSVGPTVVYTFMQVSGMTNDHLIS 330 Query: 416 CYRFQEC--SNASEEKEDAASITDNS------PEKRVDEVIESEICRSSDELSIS 276 CYRF EC S+A K S+ D S E++V+ + E+ R+ DELSIS Sbjct: 331 CYRFAECVASSAGAAKLTDGSLADASDSNHATAEQKVNGTNDIELSRAIDELSIS 385 >ref|XP_004976128.1| PREDICTED: uncharacterized protein LOC101782624 [Setaria italica] Length = 389 Score = 298 bits (763), Expect = 2e-78 Identities = 152/242 (62%), Positives = 173/242 (71%), Gaps = 15/242 (6%) Frame = -3 Query: 956 KKRCSWVTPTTDQSYATFHDEEWGVPVHDDKKLFELLVLCGALSELTWPSILSKRHIFRE 777 K+RC+WVT TD YA FHDEEWGVPVHDDKKLFELLVL GAL+ELTWP+IL+KR IFRE Sbjct: 151 KRRCAWVTANTDPCYAAFHDEEWGVPVHDDKKLFELLVLSGALAELTWPAILNKRAIFRE 210 Query: 776 VFADFDPAIVAKLNEKKILAPGSTXXXXXXXXXXXXXXENARQISKIVEEFGSFDKYIWS 597 VF DFDP +V+KL+EKKI+APGS ENARQI KIVEEFGSFDKY WS Sbjct: 211 VFMDFDPVLVSKLSEKKIIAPGSPSSSLLSEQKLRGVIENARQILKIVEEFGSFDKYCWS 270 Query: 596 FVNHKPLVSTFRYPRQVPVKTPKADVISKDLLRRGFRCVGPTIVYSFMQAAGITNDHLIS 417 FVNHKP++S FRYPRQVPVKT KAD ISKDL+RRGFR VGPT+VY+FMQ +G+TNDHLIS Sbjct: 271 FVNHKPILSRFRYPRQVPVKTSKADAISKDLVRRGFRSVGPTVVYTFMQVSGMTNDHLIS 330 Query: 416 CYRFQECSNASEEKEDAASITDNSPEKRVDE---------------VIESEICRSSDELS 282 CYRF EC + A +TD S D + E+ R+ DELS Sbjct: 331 CYRFAEC---AASPASPAKLTDGSEANSSDSNHAPTEQKMNGTNGLAADIELSRTIDELS 387 Query: 281 IS 276 IS Sbjct: 388 IS 389