BLASTX nr result
ID: Catharanthus23_contig00003019
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00003019 (2239 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002303719.2| hypothetical protein POPTR_0003s15520g [Popu... 501 e-139 ref|XP_002303720.2| hypothetical protein POPTR_0003s15520g [Popu... 501 e-139 ref|XP_002263612.1| PREDICTED: uncharacterized protein LOC100256... 500 e-138 emb|CAN68394.1| hypothetical protein VITISV_042519 [Vitis vinifera] 495 e-137 ref|XP_004232605.1| PREDICTED: uncharacterized protein LOC101246... 495 e-137 ref|XP_006364818.1| PREDICTED: uncharacterized protein LOC102594... 490 e-136 gb|EXB83232.1| Putative Glutamine amidotransferase [Morus notabi... 478 e-132 gb|EMJ13510.1| hypothetical protein PRUPE_ppa006731mg [Prunus pe... 467 e-128 ref|XP_002530111.1| DNA-3-methyladenine glycosylase, putative [R... 463 e-127 ref|XP_006468594.1| PREDICTED: uncharacterized protein LOC102614... 462 e-127 gb|EMJ12930.1| hypothetical protein PRUPE_ppa006139mg [Prunus pe... 459 e-126 gb|EOX96813.1| DNA glycosylase superfamily protein, putative [Th... 456 e-125 gb|EOY14286.1| DNA glycosylase superfamily protein isoform 1 [Th... 455 e-125 ref|XP_003531474.1| PREDICTED: uncharacterized protein LOC100791... 454 e-124 gb|ESW21148.1| hypothetical protein PHAVU_005G045900g [Phaseolus... 452 e-124 ref|XP_002299382.2| hypothetical protein POPTR_0001s12320g [Popu... 452 e-124 ref|XP_004295546.1| PREDICTED: uncharacterized protein LOC101298... 452 e-124 ref|XP_003546838.1| PREDICTED: uncharacterized protein LOC100811... 452 e-124 ref|XP_006448576.1| hypothetical protein CICLE_v10015639mg [Citr... 450 e-123 gb|EOY14287.1| DNA glycosylase superfamily protein isoform 2 [Th... 446 e-122 >ref|XP_002303719.2| hypothetical protein POPTR_0003s15520g [Populus trichocarpa] gi|550343248|gb|EEE78698.2| hypothetical protein POPTR_0003s15520g [Populus trichocarpa] Length = 420 Score = 501 bits (1291), Expect = e-139 Identities = 263/413 (63%), Positives = 318/413 (76%), Gaps = 9/413 (2%) Frame = +1 Query: 205 MCSSKSKPQGSTATDIVVVSHINGRPVLQPNCNRSPLLERRNSLKKSSTNKSFATNLQVP 384 MCSSKS+ ST+ ++ INGRPVLQP N+ P LER NSLKK+S KS P Sbjct: 1 MCSSKSRLNQSTSNIATTIAKINGRPVLQPKSNQVPSLERHNSLKKNSPPKSPTREPAGP 60 Query: 385 ISSTSPVSTNIGKVK---PAVTTPPASPKLKSPRQPAIKRGNDPNGLNSSVEKVILSTPK 555 + N K P+ +PP SPKLKSPR PA+KRGN+P GLN+S EKV+ TP+ Sbjct: 61 PVPLMQPACNAAGTKTRLPSALSPPISPKLKSPRPPAVKRGNEPGGLNTSAEKVL--TPR 118 Query: 556 CNGNKIVADPVKKSKNSNN-GV--SLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQ 726 K+ VKKSK S+ GV S+D +K SSSL+VEAPGSIAAARREQVA+MQ Q Sbjct: 119 ST-TKVTTSTVKKSKKSSTAGVPHSVDTFAMK-YSSSLLVEAPGSIAAARREQVAVMQEQ 176 Query: 727 RKMRIAHYGRTKSAKYERKIVPLDSSATAAISVKEERRCHFISSNSDPIYIAYHDEEWGV 906 RKMRIAHYGRTKSAKY+ KIVP +S AT+ I+ +EE+RC FI+ NSDP+Y+AYHDEEWGV Sbjct: 177 RKMRIAHYGRTKSAKYQGKIVPANSPATSTIT-REEKRCSFITPNSDPVYVAYHDEEWGV 235 Query: 907 PVHEDKVLFELLVLTGAQVGSDWTTVLKKRQXXXXXXXXXXPEIVSKYSEKKMTTICNDY 1086 PVH+DK+LFELL LTGAQVGS+WT+VLKKR+ EIV+K++EKK+ +I +Y Sbjct: 236 PVHDDKLLFELLALTGAQVGSEWTSVLKKREAFREAFSGFDAEIVAKFTEKKIASISAEY 295 Query: 1087 GIELSQVRGVVDNANRILEIKREFGSFDKYLWAFVNHKPIATQYKSCLKIPVKTSKSESI 1266 G+++SQVRGVVDN+NRILE+KREFGSFD+YLW +VNHKPI+TQYKSC KIPVKTSKSE+I Sbjct: 296 GLDISQVRGVVDNSNRILEVKREFGSFDEYLWGYVNHKPISTQYKSCQKIPVKTSKSETI 355 Query: 1267 SKDMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLQCVALASQ---TISP 1416 SKDMV+RGFR VGPTVIHSFM+A GL+NDHLI CPRHLQC+ALASQ T++P Sbjct: 356 SKDMVKRGFRFVGPTVIHSFMQAGGLSNDHLITCPRHLQCIALASQLPRTVAP 408 >ref|XP_002303720.2| hypothetical protein POPTR_0003s15520g [Populus trichocarpa] gi|550343247|gb|EEE78699.2| hypothetical protein POPTR_0003s15520g [Populus trichocarpa] Length = 417 Score = 501 bits (1291), Expect = e-139 Identities = 263/413 (63%), Positives = 318/413 (76%), Gaps = 9/413 (2%) Frame = +1 Query: 205 MCSSKSKPQGSTATDIVVVSHINGRPVLQPNCNRSPLLERRNSLKKSSTNKSFATNLQVP 384 MCSSKS+ ST+ ++ INGRPVLQP N+ P LER NSLKK+S KS P Sbjct: 1 MCSSKSRLNQSTSNIATTIAKINGRPVLQPKSNQVPSLERHNSLKKNSPPKSPTREPAGP 60 Query: 385 ISSTSPVSTNIGKVK---PAVTTPPASPKLKSPRQPAIKRGNDPNGLNSSVEKVILSTPK 555 + N K P+ +PP SPKLKSPR PA+KRGN+P GLN+S EKV+ TP+ Sbjct: 61 PVPLMQPACNAAGTKTRLPSALSPPISPKLKSPRPPAVKRGNEPGGLNTSAEKVL--TPR 118 Query: 556 CNGNKIVADPVKKSKNSNN-GV--SLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQ 726 K+ VKKSK S+ GV S+D +K SSSL+VEAPGSIAAARREQVA+MQ Q Sbjct: 119 ST-TKVTTSTVKKSKKSSTAGVPHSVDTFAMK-YSSSLLVEAPGSIAAARREQVAVMQEQ 176 Query: 727 RKMRIAHYGRTKSAKYERKIVPLDSSATAAISVKEERRCHFISSNSDPIYIAYHDEEWGV 906 RKMRIAHYGRTKSAKY+ KIVP +S AT+ I+ +EE+RC FI+ NSDP+Y+AYHDEEWGV Sbjct: 177 RKMRIAHYGRTKSAKYQGKIVPANSPATSTIT-REEKRCSFITPNSDPVYVAYHDEEWGV 235 Query: 907 PVHEDKVLFELLVLTGAQVGSDWTTVLKKRQXXXXXXXXXXPEIVSKYSEKKMTTICNDY 1086 PVH+DK+LFELL LTGAQVGS+WT+VLKKR+ EIV+K++EKK+ +I +Y Sbjct: 236 PVHDDKLLFELLALTGAQVGSEWTSVLKKREAFREAFSGFDAEIVAKFTEKKIASISAEY 295 Query: 1087 GIELSQVRGVVDNANRILEIKREFGSFDKYLWAFVNHKPIATQYKSCLKIPVKTSKSESI 1266 G+++SQVRGVVDN+NRILE+KREFGSFD+YLW +VNHKPI+TQYKSC KIPVKTSKSE+I Sbjct: 296 GLDISQVRGVVDNSNRILEVKREFGSFDEYLWGYVNHKPISTQYKSCQKIPVKTSKSETI 355 Query: 1267 SKDMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLQCVALASQ---TISP 1416 SKDMV+RGFR VGPTVIHSFM+A GL+NDHLI CPRHLQC+ALASQ T++P Sbjct: 356 SKDMVKRGFRFVGPTVIHSFMQAGGLSNDHLITCPRHLQCIALASQLPRTVAP 408 >ref|XP_002263612.1| PREDICTED: uncharacterized protein LOC100256507 [Vitis vinifera] gi|297738175|emb|CBI27376.3| unnamed protein product [Vitis vinifera] Length = 398 Score = 500 bits (1287), Expect = e-138 Identities = 274/402 (68%), Positives = 310/402 (77%), Gaps = 3/402 (0%) Frame = +1 Query: 205 MCSSKSK-PQGSTATDIVVVSHINGRPVLQPNCNRSPLLERRNSLKKSSTNKSFATNLQV 381 MCSSKSK QG T + INGRP LQP CNR P LER +S KK S + Sbjct: 1 MCSSKSKLHQGIDITPSK--AQINGRPALQPTCNRIPSLERHHSFKKISPKSPTSPLPAS 58 Query: 382 PISSTSPVSTNIGKVKPAVTTPPASPKLKSPRQPAIKRGNDPNGLNSSVEKVILSTPKCN 561 P T+ ++T K KP++T PPASP LKSPRQPA+KRGNDPNGLNSS+EKV+ TP+ Sbjct: 59 PPPPTTIINTT--KTKPSLT-PPASPNLKSPRQPALKRGNDPNGLNSSLEKVL--TPR-- 111 Query: 562 GNKIVADPVKKSKNSNNGV--SLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKM 735 G + KK+K + G+ S D S L N SSSLIVEAPGSIAAARREQ+AIMQVQRKM Sbjct: 112 GTTKSSSSPKKTKKCSAGLAPSSDTSSL-NYSSSLIVEAPGSIAAARREQMAIMQVQRKM 170 Query: 736 RIAHYGRTKSAKYERKIVPLDSSATAAISVKEERRCHFISSNSDPIYIAYHDEEWGVPVH 915 RIAHYGRTKSAKYE KI P+D I+ +EE+RC FI+ NSDP Y+ YHDEEWGVPVH Sbjct: 171 RIAHYGRTKSAKYEEKIGPVDP---LVITTREEKRCSFITPNSDPSYVEYHDEEWGVPVH 227 Query: 916 EDKVLFELLVLTGAQVGSDWTTVLKKRQXXXXXXXXXXPEIVSKYSEKKMTTICNDYGIE 1095 +DK LFELLV+TGAQVGSDWTTVLKKRQ EIV K+SEKK+T+I YGI+ Sbjct: 228 DDKRLFELLVMTGAQVGSDWTTVLKKRQEYRDALSGYDAEIVGKFSEKKITSISAYYGID 287 Query: 1096 LSQVRGVVDNANRILEIKREFGSFDKYLWAFVNHKPIATQYKSCLKIPVKTSKSESISKD 1275 LSQVRGVVDN+NRILEIKREFGSF KY+W FVNHKPI TQYKSC KIPVKTSKSESISKD Sbjct: 288 LSQVRGVVDNSNRILEIKREFGSFHKYIWGFVNHKPITTQYKSCHKIPVKTSKSESISKD 347 Query: 1276 MVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLQCVALAS 1401 MVRRGFRLVGPTVI+SFM+AAGLTNDHLI+CPRHLQC+AL+S Sbjct: 348 MVRRGFRLVGPTVIYSFMQAAGLTNDHLISCPRHLQCIALSS 389 >emb|CAN68394.1| hypothetical protein VITISV_042519 [Vitis vinifera] Length = 398 Score = 495 bits (1275), Expect = e-137 Identities = 273/403 (67%), Positives = 308/403 (76%), Gaps = 4/403 (0%) Frame = +1 Query: 205 MCSSKSK-PQGSTATDIVVVSHINGRPVLQPNCNRSPLLERRNSLKKSSTNKSFATNLQV 381 MCSSKSK QG T + INGRP LQP CNR P LER +S KK S + + Sbjct: 1 MCSSKSKLHQGIDITPSK--AQINGRPALQPTCNRIPSLERHHSFKKISPKSPTSP---L 55 Query: 382 PISSTSPVST-NIGKVKPAVTTPPASPKLKSPRQPAIKRGNDPNGLNSSVEKVILSTPKC 558 P S P + N K KP++T PPASP LKSPRQPA+KRGNDPNGLNSS+EKV+ TP+ Sbjct: 56 PASLPPPTTIINTTKTKPSLT-PPASPNLKSPRQPALKRGNDPNGLNSSLEKVL--TPR- 111 Query: 559 NGNKIVADPVKKSKNSNNGV--SLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRK 732 G + KK+K + G+ S D S L N SSS IVEAPGSIAAARREQ+AIMQVQRK Sbjct: 112 -GTTKSSSSPKKTKKCSAGLAPSSDTSSL-NYSSSFIVEAPGSIAAARREQMAIMQVQRK 169 Query: 733 MRIAHYGRTKSAKYERKIVPLDSSATAAISVKEERRCHFISSNSDPIYIAYHDEEWGVPV 912 MRIAHYGRTKSAKYE KI P+D I+ +EE+RC FI+ NSDP Y+ YHDEEWGVPV Sbjct: 170 MRIAHYGRTKSAKYEEKISPVDP---LVITTREEKRCSFITPNSDPSYVEYHDEEWGVPV 226 Query: 913 HEDKVLFELLVLTGAQVGSDWTTVLKKRQXXXXXXXXXXPEIVSKYSEKKMTTICNDYGI 1092 H+DK LFELLV+TGAQVGSDWTTVLKKRQ EIV K+SEKK+T+I YGI Sbjct: 227 HDDKRLFELLVMTGAQVGSDWTTVLKKRQEYRDAFSGYDAEIVGKFSEKKITSISAYYGI 286 Query: 1093 ELSQVRGVVDNANRILEIKREFGSFDKYLWAFVNHKPIATQYKSCLKIPVKTSKSESISK 1272 +LSQVRGVVDN+NRILEIKREFGSF KY+W FVNHKPI TQ KSC KIPVKTSKSESISK Sbjct: 287 DLSQVRGVVDNSNRILEIKREFGSFHKYIWGFVNHKPITTQXKSCHKIPVKTSKSESISK 346 Query: 1273 DMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLQCVALAS 1401 DMVRRGFRLVGPTVI+SFM+AAGLTNDHLI+CPRHLQC+AL+S Sbjct: 347 DMVRRGFRLVGPTVIYSFMQAAGLTNDHLISCPRHLQCIALSS 389 >ref|XP_004232605.1| PREDICTED: uncharacterized protein LOC101246304 [Solanum lycopersicum] Length = 395 Score = 495 bits (1274), Expect = e-137 Identities = 268/412 (65%), Positives = 314/412 (76%), Gaps = 8/412 (1%) Frame = +1 Query: 205 MCSSKSKPQGSTATDIVVVSHINGRPVLQPNCNRSPLLERRNSLKKSSTNKSFATNLQVP 384 MC+SK+K Q S T +S INGRPVLQP+ N PL ERRNSLKK+ T+ P Sbjct: 1 MCNSKTKLQSSAQT----LSQINGRPVLQPHSNIVPLYERRNSLKKT-------THTAAP 49 Query: 385 ISSTSPVSTNIGKVKPAVTTPPASPKLKSPRQPAIKRGN--DPNGLNSSVEKVILSTPKC 558 +++ + + TTPP SPK+KSPR PAIKRGN DPNGL+SS EK++ TPK Sbjct: 50 VTANGSTKVKMS----SSTTPPVSPKMKSPRLPAIKRGNNIDPNGLSSSAEKIV--TPKG 103 Query: 559 NGNKIVADPVKKSKNSNNGV----SLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQ 726 NK +KK K S+ G+ S++NS LK SSSLIVEAPGSIAAARREQVAI QVQ Sbjct: 104 TANKAPI-LLKKPKKSSGGLASPSSVENSSLK-YSSSLIVEAPGSIAAARREQVAIAQVQ 161 Query: 727 RKMRIAHYGRTKSAKYERKIVPLDSSATAAI--SVKEERRCHFISSNSDPIYIAYHDEEW 900 RKM+IAHYGRTKSAKYE K+ LD S +A+ + +E++RC FI+ NSDP+YIAYHDEEW Sbjct: 162 RKMKIAHYGRTKSAKYEGKVSSLDPSFASAVIPNPREDKRCSFITPNSDPLYIAYHDEEW 221 Query: 901 GVPVHEDKVLFELLVLTGAQVGSDWTTVLKKRQXXXXXXXXXXPEIVSKYSEKKMTTICN 1080 GVPVH+D +LFELLVLTGAQVGSDWT+VLKKRQ PEIVSKY+EKK+T+ Sbjct: 222 GVPVHDDNLLFELLVLTGAQVGSDWTSVLKKRQEFRDAFSGFDPEIVSKYNEKKITSTSV 281 Query: 1081 DYGIELSQVRGVVDNANRILEIKREFGSFDKYLWAFVNHKPIATQYKSCLKIPVKTSKSE 1260 +YGIELSQ+RG VDN+ RILEIK+ FGSFDKYLW FVN+KPIATQYK+C KIPVKTSKSE Sbjct: 282 EYGIELSQIRGAVDNSTRILEIKKTFGSFDKYLWGFVNNKPIATQYKACNKIPVKTSKSE 341 Query: 1261 SISKDMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLQCVALASQTISP 1416 +ISKDMV+RGFR VGPTVIHSFM+AAGLTNDHLI CPRHL CVALA+Q P Sbjct: 342 TISKDMVKRGFRYVGPTVIHSFMQAAGLTNDHLIACPRHLPCVALATQPAPP 393 >ref|XP_006364818.1| PREDICTED: uncharacterized protein LOC102594852 [Solanum tuberosum] Length = 395 Score = 490 bits (1262), Expect = e-136 Identities = 267/412 (64%), Positives = 316/412 (76%), Gaps = 8/412 (1%) Frame = +1 Query: 205 MCSSKSKPQGSTATDIVVVSHINGRPVLQPNCNRSPLLERRNSLKKSSTNKSFATNLQVP 384 MC+SK+K Q S T +S INGRPVLQP+ N PL ERRNSLKK++ T V Sbjct: 1 MCNSKTKLQSSPQT----LSQINGRPVLQPHSNIVPLYERRNSLKKTTN-----TAASVT 51 Query: 385 ISSTSPVSTNIGKVKPAVTTPPASPKLKSPRQPAIKRGN--DPNGLNSSVEKVILSTPKC 558 + ++ V T+ + TTPP SPK+KSPR PAIKRGN DPNGL+SS EK++ TPK Sbjct: 52 ANGSTKVKTS------SSTTPPVSPKMKSPRLPAIKRGNNIDPNGLSSSAEKIV--TPKG 103 Query: 559 NGNKIVADPVKKSKNSNNGVS----LDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQ 726 NK +KK K S+ G++ ++NS LK SSSLIVEAPGSIAAARREQVAI QVQ Sbjct: 104 TANKAPI-LLKKPKKSSGGLASPPYVENSSLK-YSSSLIVEAPGSIAAARREQVAIAQVQ 161 Query: 727 RKMRIAHYGRTKSAKYERKIVPLDSSATAAI--SVKEERRCHFISSNSDPIYIAYHDEEW 900 RKM+IAHYGRTKSAKYE K+ LD S +A+ + +EE+RC FI+ NSDP+YIAYHDEEW Sbjct: 162 RKMKIAHYGRTKSAKYEGKVSSLDPSFASAVIPNPREEKRCSFITPNSDPLYIAYHDEEW 221 Query: 901 GVPVHEDKVLFELLVLTGAQVGSDWTTVLKKRQXXXXXXXXXXPEIVSKYSEKKMTTICN 1080 GVPVH+D +LFELLVLTGAQVGSDWT+VL+KRQ PEIVSKY+EKK+T+ Sbjct: 222 GVPVHDDNLLFELLVLTGAQVGSDWTSVLRKRQEFRDAFSGFDPEIVSKYNEKKITSTSV 281 Query: 1081 DYGIELSQVRGVVDNANRILEIKREFGSFDKYLWAFVNHKPIATQYKSCLKIPVKTSKSE 1260 +YGIELSQ+RG VDN+ RILEIK+ F SF+KYLW FVN+KPIATQYK+C KIPVKTSKSE Sbjct: 282 EYGIELSQIRGAVDNSTRILEIKKTFDSFNKYLWGFVNNKPIATQYKACNKIPVKTSKSE 341 Query: 1261 SISKDMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLQCVALASQTISP 1416 +ISKDMV+RGFR VGPTVIHSFM+AAGLTNDHLI CPRHLQC+ALA+Q P Sbjct: 342 TISKDMVKRGFRYVGPTVIHSFMQAAGLTNDHLIACPRHLQCMALATQPAPP 393 >gb|EXB83232.1| Putative Glutamine amidotransferase [Morus notabilis] Length = 394 Score = 478 bits (1230), Expect = e-132 Identities = 265/403 (65%), Positives = 300/403 (74%), Gaps = 3/403 (0%) Frame = +1 Query: 205 MCSSKSKPQGSTATDIVVVSHINGRPVLQPNCNRSPLLERRNSLKKSSTNKSFATNLQVP 384 MCSSK K T T INGRPVLQP CNR LERR SLKK++ + L +P Sbjct: 1 MCSSKPKTLLGTNTITSAEPKINGRPVLQPTCNRVSSLERRMSLKKTTPKSPTSPPLALP 60 Query: 385 ISSTSPVSTNIGKVKPAVTTPPASPKLKSPRQPAIKRGNDPN-GLNSSVEKVILSTPKCN 561 I + + K KP+ +PP SPKL SPR PAIKRG DPN LNSS EKV+ TP+C Sbjct: 61 IQNGAC------KTKPSTLSPPVSPKLPSPRPPAIKRGKDPNYELNSSAEKVL--TPRCI 112 Query: 562 GNKIVADPVKKSKNSNNGVSLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKMRI 741 +KKSK G + LKN SSSLIVEAPGSIAAARREQVAIMQ QRK+RI Sbjct: 113 IKS--TSSIKKSKKCG-GAGVVAETLKN-SSSLIVEAPGSIAAARREQVAIMQEQRKIRI 168 Query: 742 AHYGRTKSAKYERKIVP--LDSSATAAISVKEERRCHFISSNSDPIYIAYHDEEWGVPVH 915 AHYGRTKSAK+E K+V LDSS KE++RC +I+ NSDPIY+AYHDEEWGVPVH Sbjct: 169 AHYGRTKSAKFEGKVVAPMLDSSVG-----KEQKRCSYITPNSDPIYVAYHDEEWGVPVH 223 Query: 916 EDKVLFELLVLTGAQVGSDWTTVLKKRQXXXXXXXXXXPEIVSKYSEKKMTTICNDYGIE 1095 +DK+LFELLVLTGAQVGSDWT+VLKKR+ E VSKY+EKK+T+I DYGIE Sbjct: 224 DDKLLFELLVLTGAQVGSDWTSVLKKREIFRNAFSGFDAEAVSKYNEKKITSIGADYGIE 283 Query: 1096 LSQVRGVVDNANRILEIKREFGSFDKYLWAFVNHKPIATQYKSCLKIPVKTSKSESISKD 1275 LS +RG VDNANRILEIK+EFGS +KYLW FVN+K I+TQYKSC KIPVKTSKSESISKD Sbjct: 284 LSLIRGAVDNANRILEIKKEFGSLNKYLWGFVNNKLISTQYKSCQKIPVKTSKSESISKD 343 Query: 1276 MVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLQCVALASQ 1404 MVRRGFR VGPTVI+SFM+AAGLTNDHLI CPRHLQC+ALASQ Sbjct: 344 MVRRGFRFVGPTVIYSFMQAAGLTNDHLITCPRHLQCLALASQ 386 >gb|EMJ13510.1| hypothetical protein PRUPE_ppa006731mg [Prunus persica] Length = 397 Score = 467 bits (1201), Expect = e-128 Identities = 257/399 (64%), Positives = 294/399 (73%), Gaps = 3/399 (0%) Frame = +1 Query: 205 MCSSKSKPQGSTATDIVVVSHINGRPVLQPNCNRSPLLERRNSLKKSSTNKSFATNLQVP 384 MCSSK K Q +T+ +N RPVLQP N+ P LE+R SLKKSS T L P Sbjct: 1 MCSSKPKLQRTTSVP-PSTPKMNRRPVLQPTGNQFPSLEQRKSLKKSSQEPLAPTPLPSP 59 Query: 385 ISSTSPVSTNIGKVKPAVTTPPASPKLKSPRQPAIKRGNDPNGLNSSVEKVILSTPKCNG 564 + S K K +++ PP SPKL SPR PA KRG DPN LNSS EKV+ TP+C Sbjct: 60 LPSA--------KTKASLS-PPISPKLPSPRPPAFKRGKDPNELNSSAEKVV--TPRCTT 108 Query: 565 NKIVADPVKKSKNSNNGVSLDNSP---LKNISSSLIVEAPGSIAAARREQVAIMQVQRKM 735 VKKSK S+ V+ S LKNISS LIVEAPGSIAAARREQVA MQ QRKM Sbjct: 109 K--FTSSVKKSKKSSGSVAAAPSAESILKNISS-LIVEAPGSIAAARREQVATMQEQRKM 165 Query: 736 RIAHYGRTKSAKYERKIVPLDSSATAAISVKEERRCHFISSNSDPIYIAYHDEEWGVPVH 915 RIAHYGRTKSAK E K+VPLD+S T +++RRC FI+ NSDPIY+AYHDEEWGVPVH Sbjct: 166 RIAHYGRTKSAKNEGKVVPLDASPTTDFG-RDQRRCTFITPNSDPIYVAYHDEEWGVPVH 224 Query: 916 EDKVLFELLVLTGAQVGSDWTTVLKKRQXXXXXXXXXXPEIVSKYSEKKMTTICNDYGIE 1095 +D +L ELLVLTGAQVGSDWT+VL+KRQ + V+K+SE+K+T++ +D GI+ Sbjct: 225 DDNLLLELLVLTGAQVGSDWTSVLRKRQALRESFSGFDADGVAKFSERKITSVSSDSGID 284 Query: 1096 LSQVRGVVDNANRILEIKREFGSFDKYLWAFVNHKPIATQYKSCLKIPVKTSKSESISKD 1275 +S VRG VDNA RIL+IKRE GSFDKYLW FVNHKPI+TQYKSC KIPVK SKSESISKD Sbjct: 285 ISLVRGAVDNAKRILQIKREVGSFDKYLWGFVNHKPISTQYKSCHKIPVKNSKSESISKD 344 Query: 1276 MVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLQCVA 1392 MVRRGFRLVGPTVIHSFM+AAGLTNDHLI CPRHLQC A Sbjct: 345 MVRRGFRLVGPTVIHSFMQAAGLTNDHLITCPRHLQCAA 383 >ref|XP_002530111.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis] gi|223530365|gb|EEF32255.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis] Length = 403 Score = 463 bits (1192), Expect = e-127 Identities = 251/411 (61%), Positives = 303/411 (73%), Gaps = 12/411 (2%) Frame = +1 Query: 205 MCSSKSK--PQGSTAT-----DIVVVSHINGRPVLQPNCNRSPLLERRNSLKKSSTNKSF 363 MCSSKSK G+ A ++ INGRPVLQP ++ P LERRNSLKK+S Sbjct: 1 MCSSKSKLHHHGAAAAANHHIPASTIAKINGRPVLQPKSDQVPTLERRNSLKKNSPKSPI 60 Query: 364 ATNLQVPISSTSPVSTNIGKVKPAVTTPPASPKLKSPRQPAIKRGNDPNGLNSSVEKVIL 543 P+ P +T I +P+ +PP SPKLKSPR PA+KRGND N LNSS EK + Sbjct: 61 IQPPAAPLPLL-PTTTTIKPKQPSSLSPPISPKLKSPRPPALKRGNDLNTLNSSAEKFL- 118 Query: 544 STPKCNGNKIVADPVKKSKNSNNG--VSLDNSPLKNISSSLIVEAPGSIAAARREQVAIM 717 TP+ K V+ +KKSK S+ V + + N SSSLIVEAPGSIAAARRE VA M Sbjct: 119 -TPR----KAVSTTLKKSKKSSPATPVVAETCTVLNYSSSLIVEAPGSIAAARREHVATM 173 Query: 718 QVQRKMRIAHYGRTKS---AKYERKIVPLDSSATAAISVKEERRCHFISSNSDPIYIAYH 888 Q QRK+R AHYGR S +K + KIVP+DS A A+ +EERRC FI+ +SDPIY+AYH Sbjct: 174 QEQRKLRTAHYGRVNSGSKSKRDAKIVPVDSPAATAVP-QEERRCSFITPSSDPIYVAYH 232 Query: 889 DEEWGVPVHEDKVLFELLVLTGAQVGSDWTTVLKKRQXXXXXXXXXXPEIVSKYSEKKMT 1068 D+EWGVPVH+DK+LFELLVLTGAQ+GSDWT+VLKKR+ EIV+K+SEKK T Sbjct: 233 DQEWGVPVHDDKMLFELLVLTGAQIGSDWTSVLKKREAFREAFSGFDAEIVAKFSEKKTT 292 Query: 1069 TICNDYGIELSQVRGVVDNANRILEIKREFGSFDKYLWAFVNHKPIATQYKSCLKIPVKT 1248 +I +YG+E+SQVRGVVDN+NRIL++K+EFGSFDKYLW FVNHKPI TQY+S KIPVKT Sbjct: 293 SISAEYGMEISQVRGVVDNSNRILQVKKEFGSFDKYLWGFVNHKPITTQYRSSNKIPVKT 352 Query: 1249 SKSESISKDMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLQCVALAS 1401 SKSE+ISKDMV+RGFR VGPTV+HSFM+AAGL+NDHLI+C RH QC+ALAS Sbjct: 353 SKSETISKDMVKRGFRYVGPTVMHSFMQAAGLSNDHLISCSRHHQCLALAS 403 >ref|XP_006468594.1| PREDICTED: uncharacterized protein LOC102614205 [Citrus sinensis] Length = 375 Score = 462 bits (1190), Expect = e-127 Identities = 249/399 (62%), Positives = 296/399 (74%) Frame = +1 Query: 205 MCSSKSKPQGSTATDIVVVSHINGRPVLQPNCNRSPLLERRNSLKKSSTNKSFATNLQVP 384 MCSSKSK +T INGRPVLQP N+ P LE+RNS+KK+ + KS P Sbjct: 1 MCSSKSKLHSAT--------QINGRPVLQPTSNQVPSLEKRNSIKKTGSPKS-------P 45 Query: 385 ISSTSPVSTNIGKVKPAVTTPPASPKLKSPRQPAIKRGNDPNGLNSSVEKVILSTPKCNG 564 I++ + S + K ++ +PP SPKLKSPR A+KRGNDPN LN+S EK++ TPK Sbjct: 46 ITTDNVNSKSFTK---SLLSPPVSPKLKSPRPAAVKRGNDPNVLNTSAEKIM--TPKK-- 98 Query: 565 NKIVADPVKKSKNSNNGVSLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKMRIA 744 +A VKK KN D SSLIVEAPGSIAAARRE VAIMQ QRK+RIA Sbjct: 99 ---LASLVKKPKNVGVAPCYD--------SSLIVEAPGSIAAARREHVAIMQEQRKLRIA 147 Query: 745 HYGRTKSAKYERKIVPLDSSATAAISVKEERRCHFISSNSDPIYIAYHDEEWGVPVHEDK 924 HYGRTKSAK+E K+ LDS A + +EE+RC FI+ NSDPIY+AYHDEEWGVPVH+DK Sbjct: 148 HYGRTKSAKFEGKVPGLDSFANGDNNDREEKRCSFITPNSDPIYVAYHDEEWGVPVHDDK 207 Query: 925 VLFELLVLTGAQVGSDWTTVLKKRQXXXXXXXXXXPEIVSKYSEKKMTTICNDYGIELSQ 1104 +LFELLVLT AQVGSDWT+VLKKRQ E+V+K++EKKMT++ +Y I+LSQ Sbjct: 208 LLFELLVLTAAQVGSDWTSVLKKRQAFREAFSGFDAEVVAKFTEKKMTSLSANYAIDLSQ 267 Query: 1105 VRGVVDNANRILEIKREFGSFDKYLWAFVNHKPIATQYKSCLKIPVKTSKSESISKDMVR 1284 VRG+VDN+ RILE+K++FGSFDKYLW FVNHKPI TQY+S KIPVKTSKSE+ISKDMV+ Sbjct: 268 VRGIVDNSIRILEVKKQFGSFDKYLWGFVNHKPINTQYRSSQKIPVKTSKSEAISKDMVK 327 Query: 1285 RGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLQCVALAS 1401 +GFR VGPTVIHSFM+AAGLTNDHLI C RHLQC ALAS Sbjct: 328 KGFRFVGPTVIHSFMQAAGLTNDHLITCTRHLQCTALAS 366 >gb|EMJ12930.1| hypothetical protein PRUPE_ppa006139mg [Prunus persica] Length = 426 Score = 459 bits (1182), Expect = e-126 Identities = 247/422 (58%), Positives = 300/422 (71%), Gaps = 22/422 (5%) Frame = +1 Query: 205 MCSSKSKPQ-GSTATDIVVVSHINGRPVLQPNCNRSPLLERRNSLKKSSTNKSFATNLQV 381 MCSSK+K G T +V + INGRPVLQP CNR P L+RRNS+KK ST ++ + Sbjct: 1 MCSSKAKVTIGVEVTPMV--ARINGRPVLQPTCNRVPSLDRRNSIKKISTPRA-PPPPPL 57 Query: 382 PISSTSPVSTNIGKVKPAVTTPPASPKLKSPRQPAIKRGNDPNGLNSSVEKVILSTPKCN 561 P SS S S I ++ TPP SPK KSPR PAIKRGNDPNGLNSS EKV+ Sbjct: 58 PTSSASSTSPRISNKASSLLTPPISPKSKSPRPPAIKRGNDPNGLNSSSEKVVTPGGTTR 117 Query: 562 GNKIVADPVKKSKNSNNGV--------------------SLDNSPLKNISSSLIVEAPGS 681 + K K ++ GV SL+ + SSSLI EAPGS Sbjct: 118 AKILERKKSKSFKRASVGVDGASADLHHHGDFSAGGFSSSLNIEASLSYSSSLITEAPGS 177 Query: 682 IAAARREQVAIMQVQRKMRIAHYGRTKSAKYERKIVPLDSSATA-AISVKEERRCHFISS 858 IAA RREQ+A+ QRKMRIAHYGR+KSA +ER +VP+D+S A +EE+RC FI++ Sbjct: 178 IAAVRREQMALQHAQRKMRIAHYGRSKSANFER-VVPVDASGNIEAKGAEEEKRCSFITA 236 Query: 859 NSDPIYIAYHDEEWGVPVHEDKVLFELLVLTGAQVGSDWTTVLKKRQXXXXXXXXXXPEI 1038 NSDPIY+AYHDEEWGVPVH+DK+LFELLVL+GAQVGSDWT++LKKRQ EI Sbjct: 237 NSDPIYVAYHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRNAFSDFDAEI 296 Query: 1039 VSKYSEKKMTTICNDYGIELSQVRGVVDNANRILEIKREFGSFDKYLWAFVNHKPIATQY 1218 V+ +++K+M +I ++YGI++S+VRGVVDN+NRILEIK+EFGSFDKY+W FVN KPI+ QY Sbjct: 297 VANFTDKQMVSIGSEYGIDISRVRGVVDNSNRILEIKKEFGSFDKYIWGFVNQKPISPQY 356 Query: 1219 KSCLKIPVKTSKSESISKDMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLQCVALA 1398 K KIPVKTSKSESISKDMVRRGFR VGPTV+HSFM+A+GLTNDHLI C RHLQC LA Sbjct: 357 KLGYKIPVKTSKSESISKDMVRRGFRFVGPTVVHSFMQASGLTNDHLITCHRHLQCTLLA 416 Query: 1399 SQ 1404 ++ Sbjct: 417 AR 418 >gb|EOX96813.1| DNA glycosylase superfamily protein, putative [Theobroma cacao] Length = 398 Score = 456 bits (1174), Expect = e-125 Identities = 246/407 (60%), Positives = 300/407 (73%), Gaps = 3/407 (0%) Frame = +1 Query: 205 MCSSKSKPQGSTATDIVVVSHINGRPVLQPNCNRSPLLERRNSLKKSSTNKSFATNLQVP 384 MC SK K + V+ INGRPVLQP N+ ++RNSLKK S+N + L P Sbjct: 1 MCCSKFKLH-KDSNIASTVAEINGRPVLQPPSNQITSSDKRNSLKKISSN---SPALSAP 56 Query: 385 ISSTSPVSTNIGKVKPAVTTPPASPKLKSPRQPAIKRGNDPNGLNSSVEKVILSTPKCNG 564 + ++ + + P+++ PP SPK SPR A+KRG D N LNSS EKVI P+CN Sbjct: 57 LQLSNSRARAVKATMPSLS-PPISPK--SPRPTALKRGKDSNELNSSSEKVI--APRCNV 111 Query: 565 NKIVADPVKKSKN-SNNGVSLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKMRI 741 + VKK KN S GV+L + K SS +++EAPGSIAAARREQVA++Q QRKMRI Sbjct: 112 K--LDSKVKKPKNASGGGVALTSVDAKYSSSFMVLEAPGSIAAARREQVAMIQEQRKMRI 169 Query: 742 AHYGRTKSAKYERKIVPLDSSATAAISVKEERRCHFISSNSDPIYIAYHDEEWGVPVHED 921 AHYGRTKSAKYERK+V LDSSA + +++RRC FI+ NSDP+Y AYHDEEWGV VH+D Sbjct: 170 AHYGRTKSAKYERKMVGLDSSAARTAARQDQRRCSFITVNSDPVYAAYHDEEWGVAVHDD 229 Query: 922 KVLFELLVLTGAQVGSDWTTVLKKRQXXXXXXXXXXPEIVSKYSEKKMTTICNDYGIELS 1101 K+LFEL+VL GAQVGSDWT+VLKKRQ E+++ +SEK + +I +DYGI++S Sbjct: 230 KLLFELVVLIGAQVGSDWTSVLKKRQDFREAFSGFDAEVIAGFSEKNILSISSDYGIDVS 289 Query: 1102 QVRGVVDNANRILEIKREFGSFDKYLWAFVNHKPIATQYKSCLKIPVKTSKSESISKDMV 1281 QVR VDNANRILE+++EFGSF+ YLW FVNHKPI TQYKSC KIPVKTSKSE+ISKDMV Sbjct: 290 QVRAAVDNANRILEVRKEFGSFNNYLWGFVNHKPIVTQYKSCHKIPVKTSKSEAISKDMV 349 Query: 1282 RRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLQCVALASQ--TISP 1416 RRGFR VGPTVIHS M+AAGLTNDHL CPRHLQC+ALASQ T++P Sbjct: 350 RRGFRFVGPTVIHSLMQAAGLTNDHLSTCPRHLQCIALASQFPTVAP 396 >gb|EOY14286.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] Length = 409 Score = 455 bits (1170), Expect = e-125 Identities = 241/405 (59%), Positives = 298/405 (73%), Gaps = 3/405 (0%) Frame = +1 Query: 205 MCSSKSKPQGSTATDIVVVSHINGRPVLQPNCNRSPLLERRNSLKK-SSTNKSFATNLQV 381 MCSS +K V+ INGRPVLQP CNR P L+RRNSLKK + +L Sbjct: 1 MCSSNAKVTAGVEIT-PAVARINGRPVLQPTCNRVPSLDRRNSLKKIPPLSPPTPPSLAS 59 Query: 382 PISSTSPVSTNIGKVKPAVTTPPASPKLKSPRQPAIKRGNDPNGLNSSVEKVILSTPKCN 561 + +TS N G+ K ++T PP SPK KSPR AIKRG+DPN LN+S EKV+ TP+ N Sbjct: 60 TLPATSATVGNGGRAKASLT-PPISPKSKSPRPAAIKRGSDPNALNTSSEKVM--TPR-N 115 Query: 562 GNKIVADPVKKS--KNSNNGVSLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKM 735 K + KS + NG+S P + SSSLIVEAPGSIAA RREQ+A+ Q QRKM Sbjct: 116 ITKTLERKKSKSFKEGMGNGLSSWIEPSLSYSSSLIVEAPGSIAAVRREQMALQQAQRKM 175 Query: 736 RIAHYGRTKSAKYERKIVPLDSSATAAISVKEERRCHFISSNSDPIYIAYHDEEWGVPVH 915 +IAHYGR+KSAK+E K+VPL++S+ +EE+RC FI+ NSDP+Y+AYHDEEWGVPVH Sbjct: 176 KIAHYGRSKSAKFESKVVPLNTSSAMTKPDEEEKRCSFITPNSDPVYVAYHDEEWGVPVH 235 Query: 916 EDKVLFELLVLTGAQVGSDWTTVLKKRQXXXXXXXXXXPEIVSKYSEKKMTTICNDYGIE 1095 +D +LFELLVL+GAQVGSDW ++LKKRQ E V+K+++K+MTTI ++YGI+ Sbjct: 236 DDSMLFELLVLSGAQVGSDWISILKKRQDFRDAFSGFDAETVAKFTDKEMTTISSEYGID 295 Query: 1096 LSQVRGVVDNANRILEIKREFGSFDKYLWAFVNHKPIATQYKSCLKIPVKTSKSESISKD 1275 +S+V GVVDN+NRILE+K +FGSFDKY+W FVNHK I+TQYK KIPVKTSKSESISKD Sbjct: 296 ISRVLGVVDNSNRILEVKGQFGSFDKYIWGFVNHKAISTQYKFGHKIPVKTSKSESISKD 355 Query: 1276 MVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLQCVALASQTI 1410 M+RRGFR VGPTV+HSFM+AAGLTNDHLI C RHL C LA+ +I Sbjct: 356 MLRRGFRCVGPTVVHSFMQAAGLTNDHLITCHRHLPCTLLAASSI 400 >ref|XP_003531474.1| PREDICTED: uncharacterized protein LOC100791725 [Glycine max] Length = 400 Score = 454 bits (1167), Expect = e-124 Identities = 245/410 (59%), Positives = 297/410 (72%), Gaps = 8/410 (1%) Frame = +1 Query: 205 MCSSKSKP----QGSTATDIVVVSHINGRPVLQPNCNRSPLLERRNSLKKSSTNKSFATN 372 MCSSK+K + A V+ INGRPVLQP CNR P LERRNS+KK + KS + Sbjct: 1 MCSSKTKVTVGLEAVVAAAKPSVARINGRPVLQPTCNRVPNLERRNSIKKVAPPKSLS-- 58 Query: 373 LQVPISSTSPVSTNIGKVKPAVTTPPASPKLKSPRQPAIKRGNDPNGLNSSVEKVILSTP 552 P S P T++ TPP SPKLKSPR PA KRGND NGLNSS EK+++ Sbjct: 59 ---PPSPPLPSKTSL--------TPPVSPKLKSPRLPATKRGNDNNGLNSSYEKIVIPR- 106 Query: 553 KCNGNKIVADPVKKSKNSNNG--VSLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQ 726 + K KKSK+ G VS + SSSLI ++PGSIAA RREQ+A+ Q Q Sbjct: 107 --SSTKTPTLERKKSKSFKEGSCVSASIEASLSYSSSLITDSPGSIAAVRREQMALQQAQ 164 Query: 727 RKMRIAHYGRTKSAKYERKIVPLDSSATAAIS--VKEERRCHFISSNSDPIYIAYHDEEW 900 RKM+IAHYGR+KSAK+ER +VPLD S T+ S +EE+RC FI+ NSDPIYIAYHDEEW Sbjct: 165 RKMKIAHYGRSKSAKFER-VVPLDPSNTSLASKPTEEEKRCSFITPNSDPIYIAYHDEEW 223 Query: 901 GVPVHEDKVLFELLVLTGAQVGSDWTTVLKKRQXXXXXXXXXXPEIVSKYSEKKMTTICN 1080 GVPVH+DK+LFELLVL+GAQVGSDWT+ LKKR E V+ ++K+M +I + Sbjct: 224 GVPVHDDKMLFELLVLSGAQVGSDWTSTLKKRLDFRAAFSEFDAETVANLTDKQMMSISS 283 Query: 1081 DYGIELSQVRGVVDNANRILEIKREFGSFDKYLWAFVNHKPIATQYKSCLKIPVKTSKSE 1260 +YGI++S+VRGVVDNAN+ILEIK++FGSFDKY+W FVNHKPI+TQYK KIPVKTSKSE Sbjct: 284 EYGIDISRVRGVVDNANQILEIKKDFGSFDKYIWGFVNHKPISTQYKFGHKIPVKTSKSE 343 Query: 1261 SISKDMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLQCVALASQTI 1410 SISKDMVRRGFR VGPTV+HSFM+ +GLTNDHLI C RHLQC LA++++ Sbjct: 344 SISKDMVRRGFRFVGPTVVHSFMQTSGLTNDHLITCHRHLQCTLLAARSL 393 >gb|ESW21148.1| hypothetical protein PHAVU_005G045900g [Phaseolus vulgaris] Length = 405 Score = 452 bits (1164), Expect = e-124 Identities = 246/414 (59%), Positives = 300/414 (72%), Gaps = 14/414 (3%) Frame = +1 Query: 205 MCSSKSK----------PQGSTATDIVVVSHINGRPVLQPNCNRSPLLERRNSLKKSSTN 354 MCSSK+K +T+T + V+ INGRPVLQP CNR P LERRNS+KK Sbjct: 1 MCSSKAKVTVGIEGVVAAATTTSTVMPSVARINGRPVLQPTCNRVPNLERRNSIKKVQPP 60 Query: 355 KSFATNLQVPISSTSPVSTNIGKVKPAVTTPPASPKLKSPRQPAIKRGNDPNGLNSSVEK 534 KS + P+SS + + TPP SPK KSPR PA+KRGND NGLN+S EK Sbjct: 61 KSLSPP-SPPLSSKTSL------------TPPVSPKSKSPRLPAVKRGNDNNGLNTSYEK 107 Query: 535 VILSTPKCNGNKIVADPVKKSKNSNNGVSLDNSPLKNIS--SSLIVEAPGSIAAARREQV 708 + + PK + +K KKSK+ G S + S SSLI ++PGSIAA RREQ+ Sbjct: 108 IAI--PK-SSSKAPTLERKKSKSFKEGSCAPASTEASFSYASSLITDSPGSIAAVRREQM 164 Query: 709 AIMQVQRKMRIAHYGRTKSAKYERKIVPLDSSATAAIS--VKEERRCHFISSNSDPIYIA 882 A+ Q QRKM+IAHYGR+KSAK+ER +VPLD S T S +EE+RC FI++NSDPIYIA Sbjct: 165 ALQQAQRKMKIAHYGRSKSAKFER-VVPLDPSTTTLTSKPTEEEKRCSFITANSDPIYIA 223 Query: 883 YHDEEWGVPVHEDKVLFELLVLTGAQVGSDWTTVLKKRQXXXXXXXXXXPEIVSKYSEKK 1062 YHDEEWGVPVH+DK+LFELLVL+GAQVGSDWT+ LKKRQ E V+ ++K+ Sbjct: 224 YHDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSTLKKRQDFRAAFSDFDAETVANLTDKQ 283 Query: 1063 MTTICNDYGIELSQVRGVVDNANRILEIKREFGSFDKYLWAFVNHKPIATQYKSCLKIPV 1242 M +I ++YGI++S+VRGVVDNAN+ILEIK++FGSFDKY+W FVNHKPI+TQYK KIPV Sbjct: 284 MMSISSEYGIDISRVRGVVDNANQILEIKKDFGSFDKYIWGFVNHKPISTQYKFGHKIPV 343 Query: 1243 KTSKSESISKDMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLQCVALASQ 1404 KTSKSESISKDMVRRG+R VGPTV+HSFM+AAGLTNDHLI C RHLQC LA++ Sbjct: 344 KTSKSESISKDMVRRGYRFVGPTVVHSFMQAAGLTNDHLITCHRHLQCTLLAAR 397 >ref|XP_002299382.2| hypothetical protein POPTR_0001s12320g [Populus trichocarpa] gi|550347083|gb|EEE84187.2| hypothetical protein POPTR_0001s12320g [Populus trichocarpa] Length = 373 Score = 452 bits (1164), Expect = e-124 Identities = 247/411 (60%), Positives = 287/411 (69%), Gaps = 7/411 (1%) Frame = +1 Query: 205 MCSSKSKPQGSTATDIVVVSHINGRPVLQPNCNRSPLLERRNSLKKSSTNKSFATNLQV- 381 MCS K + S ++ INGRPVLQP N+ P LERRNSLKK+S KS Sbjct: 1 MCSFKFRLHRSANNIATPIAKINGRPVLQPKSNQVPSLERRNSLKKNSPAKSPTQEPAAV 60 Query: 382 -PISSTSPVSTNIG-KVK-PAVTTPPASPKLKSPRQPAIKRGNDPNGLNSSVEKVILSTP 552 PI P G K K P+ +PP SPKLKSP PA+KRGNDP+GLN+S EKV TP Sbjct: 61 PPIPLMQPAGNAAGTKTKQPSGLSPPISPKLKSPVLPAVKRGNDPDGLNTSAEKVW--TP 118 Query: 553 KCNGNKIVADPVKKSKNSNNGVSLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRK 732 +E+PGSIAAARRE VA+MQ QRK Sbjct: 119 -------------------------------------LESPGSIAAARREHVAVMQEQRK 141 Query: 733 MRIAHYGRTKSAKYERKIVPLDSSATAAISVKEERRCHFISSNSDPIYIAYHDEEWGVPV 912 MRIAHYGRTKSAKY K+VP DS AT IS +EE+RC FI+ NSDPIY+AYHDEEWGVPV Sbjct: 142 MRIAHYGRTKSAKYHGKVVPADSPATNTIS-REEKRCSFITPNSDPIYVAYHDEEWGVPV 200 Query: 913 HEDKVLFELLVLTGAQVGSDWTTVLKKRQXXXXXXXXXXPEIVSKYSEKKMTTICNDYGI 1092 H+DK+LFELLVLTGAQVGSDWT+VLKKR+ E+V+K++EKK+ +I +YGI Sbjct: 201 HDDKMLFELLVLTGAQVGSDWTSVLKKREAFREAFSGFDAEVVAKFTEKKIASISAEYGI 260 Query: 1093 ELSQVRGVVDNANRILEIKREFGSFDKYLWAFVNHKPIATQYKSCLKIPVKTSKSESISK 1272 + SQVRGVVDN+N+I+E+KREFGSFDKYLW +VNHKPI TQYKSC KIPVKTSKSE+ISK Sbjct: 261 DTSQVRGVVDNSNKIMEVKREFGSFDKYLWEYVNHKPIFTQYKSCQKIPVKTSKSETISK 320 Query: 1273 DMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLQCVALASQ---TISP 1416 DMV+RGFR VGPTVIHSFM+A GL NDHLI CPRHLQ ALASQ T++P Sbjct: 321 DMVKRGFRFVGPTVIHSFMQAGGLRNDHLITCPRHLQYTALASQHPSTLAP 371 >ref|XP_004295546.1| PREDICTED: uncharacterized protein LOC101298985 [Fragaria vesca subsp. vesca] Length = 410 Score = 452 bits (1162), Expect = e-124 Identities = 246/417 (58%), Positives = 299/417 (71%), Gaps = 15/417 (3%) Frame = +1 Query: 205 MCSSKSK-PQGSTATDIVVVSHINGRPVLQPNCNRSPLLERRNSLKKSSTNKSFATNLQV 381 MCSSK+K G T +V S INGRPVLQP CNR P L+RRNSLKK ST + Sbjct: 1 MCSSKAKVTMGIEITPLV--SRINGRPVLQPTCNRVPSLDRRNSLKKLSTPPP----PPL 54 Query: 382 PISSTSPVSTNIG-KVKPAVTTPPASPKLKSPRQPAIKR-GNDPNGLNSSVEKVILSTPK 555 P+S+ S ST+ K ++TTPP SPK KSPR PAIKR GNDPNGLNSS EKV+ TP Sbjct: 55 PLSNASSTSTSPRISTKASLTTPPVSPKSKSPRPPAIKRSGNDPNGLNSSSEKVV--TPG 112 Query: 556 CNGNKIVADPVKKSKNSNNGVSLDNS------------PLKNISSSLIVEAPGSIAAARR 699 V + KKSK+ GV DN+ + SSSLI EAPG+IAA RR Sbjct: 113 GTTRAKVLER-KKSKSFKLGVGADNAHDHGRLSSASIEASLSYSSSLITEAPGTIAAGRR 171 Query: 700 EQVAIMQVQRKMRIAHYGRTKSAKYERKIVPLDSSATAAISVKEERRCHFISSNSDPIYI 879 EQ+A+ QRKMRIAHYGR+ SA +ER + P+D+ ++ +RC FI++NSDPIY+ Sbjct: 172 EQMALQHAQRKMRIAHYGRSNSANFER-VAPIDTMEAKG-GEEDHKRCSFITANSDPIYV 229 Query: 880 AYHDEEWGVPVHEDKVLFELLVLTGAQVGSDWTTVLKKRQXXXXXXXXXXPEIVSKYSEK 1059 AYHD+EWGVPVH+DK+LFELLVL+GAQVGSDWT++LKKRQ E V+ ++K Sbjct: 230 AYHDQEWGVPVHDDKMLFELLVLSGAQVGSDWTSILKKRQDFRDAFSGFDAEAVANLTDK 289 Query: 1060 KMTTICNDYGIELSQVRGVVDNANRILEIKREFGSFDKYLWAFVNHKPIATQYKSCLKIP 1239 +M +IC++YGI++S+VRGVVDN+NRILE+KREFGSF KY+W FVNHKPI+ QYK KIP Sbjct: 290 QMISICSEYGIDISRVRGVVDNSNRILEVKREFGSFHKYIWGFVNHKPISPQYKQGYKIP 349 Query: 1240 VKTSKSESISKDMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLQCVALASQTI 1410 VKTSKSESISKDMVRRGFR VGPTV+HSFM+A+GLTNDHL C RHLQC LA+ + Sbjct: 350 VKTSKSESISKDMVRRGFRFVGPTVVHSFMQASGLTNDHLTTCHRHLQCTLLAAHPL 406 >ref|XP_003546838.1| PREDICTED: uncharacterized protein LOC100811352 [Glycine max] Length = 400 Score = 452 bits (1162), Expect = e-124 Identities = 244/417 (58%), Positives = 297/417 (71%), Gaps = 13/417 (3%) Frame = +1 Query: 205 MCSSKSK---------PQGSTATDIVVVSHINGRPVLQPNCNRSPLLERRNSLKKSSTNK 357 MC SK+K +T T V+ INGRPVLQP CNR P LERRNS+KK + K Sbjct: 1 MCGSKTKVTIGLEVIAAAATTTTAKPSVARINGRPVLQPTCNRVPNLERRNSIKKVAPAK 60 Query: 358 SFATNLQVPISSTSPVSTNIGKVKPAVTTPPASPKLKSPRQPAIKRGNDPNGLNSSVEKV 537 S + P S P T++ TPP SPK KSPR PA KRGND NGLNSS EK+ Sbjct: 61 SLS-----PPSPPLPSKTSL--------TPPVSPKSKSPRLPATKRGNDNNGLNSSYEKI 107 Query: 538 ILSTPKCNGNKIVADPVKKSKNSNNG--VSLDNSPLKNISSSLIVEAPGSIAAARREQVA 711 ++ + KKSK+ G VS + SSSLI ++PGSIAA RREQ+A Sbjct: 108 VIPRSSIKTPTLER---KKSKSFKEGSCVSASIEASLSYSSSLITDSPGSIAAVRREQMA 164 Query: 712 IMQVQRKMRIAHYGRTKSAKYERKIVPLDSSATAAIS--VKEERRCHFISSNSDPIYIAY 885 + Q QRKM+IAHYGR+KSAK+ER +VPLD S T+ S +EE+RC FI++NSDPIYIAY Sbjct: 165 LQQAQRKMKIAHYGRSKSAKFER-VVPLDPSNTSLASKPTEEEKRCSFITANSDPIYIAY 223 Query: 886 HDEEWGVPVHEDKVLFELLVLTGAQVGSDWTTVLKKRQXXXXXXXXXXPEIVSKYSEKKM 1065 HDEEWGVPVH+DK+LFELLVL+GAQVGSDWT+ LKKR E V+ ++K+M Sbjct: 224 HDEEWGVPVHDDKMLFELLVLSGAQVGSDWTSTLKKRLDFRAAFSEFDAETVANLTDKQM 283 Query: 1066 TTICNDYGIELSQVRGVVDNANRILEIKREFGSFDKYLWAFVNHKPIATQYKSCLKIPVK 1245 +I ++YGI++S+VRGVVDNAN+ILEIK++FGSFDKY+W FVNHKP++TQYK KIPVK Sbjct: 284 MSISSEYGIDISRVRGVVDNANQILEIKKDFGSFDKYIWGFVNHKPLSTQYKFGHKIPVK 343 Query: 1246 TSKSESISKDMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLQCVALASQTISP 1416 TSKSESISKDMVRRGFR VGPTV+HSFM+A+GLTNDHLI C RHLQC LA+++ P Sbjct: 344 TSKSESISKDMVRRGFRYVGPTVVHSFMQASGLTNDHLITCHRHLQCTLLAARSFVP 400 >ref|XP_006448576.1| hypothetical protein CICLE_v10015639mg [Citrus clementina] gi|557551187|gb|ESR61816.1| hypothetical protein CICLE_v10015639mg [Citrus clementina] Length = 375 Score = 450 bits (1157), Expect = e-123 Identities = 242/399 (60%), Positives = 294/399 (73%) Frame = +1 Query: 205 MCSSKSKPQGSTATDIVVVSHINGRPVLQPNCNRSPLLERRNSLKKSSTNKSFATNLQVP 384 MCSSKSK +T INGRPVLQP N+ P LE+R+S+KK+ + KS P Sbjct: 1 MCSSKSKLHSAT--------QINGRPVLQPTSNQVPSLEKRSSIKKTGSPKS-------P 45 Query: 385 ISSTSPVSTNIGKVKPAVTTPPASPKLKSPRQPAIKRGNDPNGLNSSVEKVILSTPKCNG 564 I++ + S + K ++ +PP SPKLKSPR A+KRGNDPN LN+S EK++ TPK Sbjct: 46 ITTNNVNSKSFTK---SLLSPPVSPKLKSPRPAAVKRGNDPNVLNTSAEKIM--TPKK-- 98 Query: 565 NKIVADPVKKSKNSNNGVSLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKMRIA 744 +A VKK KN+ D SSLIVEAPGSIAAARRE VAIMQ QRK+RIA Sbjct: 99 ---LASFVKKPKNAEVAPCYD--------SSLIVEAPGSIAAARREHVAIMQEQRKLRIA 147 Query: 745 HYGRTKSAKYERKIVPLDSSATAAISVKEERRCHFISSNSDPIYIAYHDEEWGVPVHEDK 924 HYGRTKSAK+E K+ LDS A + +EE+RC FI+ NSDP Y+AYHDEEWGVPVH+DK Sbjct: 148 HYGRTKSAKFEGKVPGLDSFANGDNNDREEKRCSFITPNSDPKYVAYHDEEWGVPVHDDK 207 Query: 925 VLFELLVLTGAQVGSDWTTVLKKRQXXXXXXXXXXPEIVSKYSEKKMTTICNDYGIELSQ 1104 +LFELLVLT AQVGSDWT+VLKKR+ E+V+K++EKK+T++ +Y I+LSQ Sbjct: 208 LLFELLVLTAAQVGSDWTSVLKKRRAFREAFSGFDAEVVAKFTEKKITSLSANYAIDLSQ 267 Query: 1105 VRGVVDNANRILEIKREFGSFDKYLWAFVNHKPIATQYKSCLKIPVKTSKSESISKDMVR 1284 VRG+VDN+ RILE+K++FGSFDKYLW FVNHK I TQY+S KIP KTSKSE+ISKDMV+ Sbjct: 268 VRGIVDNSIRILEVKKQFGSFDKYLWGFVNHKTINTQYRSSQKIPAKTSKSEAISKDMVK 327 Query: 1285 RGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLQCVALAS 1401 +GFR VGPTVIHSFM+AAGL+NDHLI C RHLQC ALAS Sbjct: 328 KGFRFVGPTVIHSFMQAAGLSNDHLITCTRHLQCTALAS 366 >gb|EOY14287.1| DNA glycosylase superfamily protein isoform 2 [Theobroma cacao] Length = 413 Score = 446 bits (1148), Expect = e-122 Identities = 240/409 (58%), Positives = 297/409 (72%), Gaps = 7/409 (1%) Frame = +1 Query: 205 MCSSKSKPQGSTATDIVVVSHINGRPVLQPNCNRSPLLERRNSLKK-SSTNKSFATNLQV 381 MCSS +K V+ INGRPVLQP CNR P L+RRNSLKK + +L Sbjct: 1 MCSSNAKVTAGVEIT-PAVARINGRPVLQPTCNRVPSLDRRNSLKKIPPLSPPTPPSLAS 59 Query: 382 PISSTSPVSTNIGKVKPAVTTPPASPKLKSPRQPAIKRGNDPNGLNSSVEKVILSTPKCN 561 + +TS N G+ K ++T PP SPK KSPR AIKRG+DPN LN+S EKV+ TP+ N Sbjct: 60 TLPATSATVGNGGRAKASLT-PPISPKSKSPRPAAIKRGSDPNALNTSSEKVM--TPR-N 115 Query: 562 GNKIVADPVKKS--KNSNNGVSLDNSPLKNISSSLIVEAPGSIAAARREQVAIMQVQRKM 735 K + KS + NG+S P + SSSLIVEAPGSIAA RREQ+A+ Q QRKM Sbjct: 116 ITKTLERKKSKSFKEGMGNGLSSWIEPSLSYSSSLIVEAPGSIAAVRREQMALQQAQRKM 175 Query: 736 RIAHYGRTKSAKYERKIVPLDSSATAAISVKEERRCHFISSNSD----PIYIAYHDEEWG 903 +IAHYGR+KSAK+E K+VPL++S+ +EE+RC FI+ NS P+Y+AYHDEEWG Sbjct: 176 KIAHYGRSKSAKFESKVVPLNTSSAMTKPDEEEKRCSFITPNSGIAIYPVYVAYHDEEWG 235 Query: 904 VPVHEDKVLFELLVLTGAQVGSDWTTVLKKRQXXXXXXXXXXPEIVSKYSEKKMTTICND 1083 VPVH+D +LFELLVL+GAQVGSDW ++LKKRQ E V+K+++K+MTTI ++ Sbjct: 236 VPVHDDSMLFELLVLSGAQVGSDWISILKKRQDFRDAFSGFDAETVAKFTDKEMTTISSE 295 Query: 1084 YGIELSQVRGVVDNANRILEIKREFGSFDKYLWAFVNHKPIATQYKSCLKIPVKTSKSES 1263 YGI++S+V GVVDN+NRILE+K +FGSFDKY+W FVNHK I+TQYK KIPVKTSKSES Sbjct: 296 YGIDISRVLGVVDNSNRILEVKGQFGSFDKYIWGFVNHKAISTQYKFGHKIPVKTSKSES 355 Query: 1264 ISKDMVRRGFRLVGPTVIHSFMEAAGLTNDHLINCPRHLQCVALASQTI 1410 ISKDM+RRGFR VGPTV+HSFM+AAGLTNDHLI C RHL C LA+ +I Sbjct: 356 ISKDMLRRGFRCVGPTVVHSFMQAAGLTNDHLITCHRHLPCTLLAASSI 404