BLASTX nr result
ID: Glycyrrhiza23_contig00021772
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza23_contig00021772 (1289 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003540731.1| PREDICTED: uracil-DNA glycosylase-like [Glyc... 479 e-133 ref|XP_002271878.2| PREDICTED: uracil-DNA glycosylase [Vitis vin... 409 e-111 emb|CBI27448.3| unnamed protein product [Vitis vinifera] 405 e-110 ref|XP_004140430.1| PREDICTED: uracil-DNA glycosylase-like [Cucu... 398 e-108 ref|XP_002885267.1| uracil DNA glycosylase family protein [Arabi... 381 e-103 >ref|XP_003540731.1| PREDICTED: uracil-DNA glycosylase-like [Glycine max] Length = 303 Score = 479 bits (1234), Expect = e-133 Identities = 245/308 (79%), Positives = 261/308 (84%) Frame = +2 Query: 125 MASASSKTLIGMFERASKRLKPTLTPTSCKSDDATTNGSTLSVDQKSRIEYNKQLAKSKR 304 MASA S+TL F+ ASKRLKPTL P SCKSDDA N STLSVDQK R+EYNK LAKSKR Sbjct: 1 MASAPSRTLTDFFQPASKRLKPTL-PASCKSDDA--NASTLSVDQKLRMEYNKLLAKSKR 57 Query: 305 NLKICLEIVSKHKGAGGDGCCXXXXXXXXXXXXXXXXPGEFQKPYAVTLSKFVETEISSA 484 NLK+C+E VSK K +G G PGE QKPYA+TLSKFVE+EIS Sbjct: 58 NLKLCVERVSKSKESGLGGV--KLEELLVEETWLEALPGELQKPYALTLSKFVESEISGG 115 Query: 485 DGAVYPPSHLIFNALNSTPFHAVKAVILGQDPYHGPGQAMGLSFSVPEGVKVPSSLVNIF 664 DG ++PP+HLIFNALNSTPFH VKAVILGQDPYHGPGQAMGLSFSVPEG+KVPSSLVNIF Sbjct: 116 DGVIFPPTHLIFNALNSTPFHTVKAVILGQDPYHGPGQAMGLSFSVPEGIKVPSSLVNIF 175 Query: 665 KELKQDLACSIPSHGNLEKWALQGVLLLNAVLTVRKHQANSHAKKGWEQLTDAVIKTISQ 844 KEL QDL CSIP+HGNL+KWA+QGVLLLNAVLTVRKHQANSHAKKGWEQ TD VIKTISQ Sbjct: 176 KELHQDLGCSIPTHGNLQKWAVQGVLLLNAVLTVRKHQANSHAKKGWEQFTDVVIKTISQ 235 Query: 845 KKEGVVFLLWGNSAREKSRLIDATKHHILKAAHPSGLSANRGFFGCRHFSRTNQLLEQMG 1024 KKEGVVFLLWGNSAREKSRLIDA KHH+L AAHPSGLSANRGFFGCRHFSRTNQLLEQMG Sbjct: 236 KKEGVVFLLWGNSAREKSRLIDARKHHVLTAAHPSGLSANRGFFGCRHFSRTNQLLEQMG 295 Query: 1025 IAPIDWQL 1048 I PIDWQL Sbjct: 296 IDPIDWQL 303 >ref|XP_002271878.2| PREDICTED: uracil-DNA glycosylase [Vitis vinifera] Length = 328 Score = 409 bits (1050), Expect = e-111 Identities = 218/331 (65%), Positives = 243/331 (73%), Gaps = 26/331 (7%) Frame = +2 Query: 134 ASSKTLIGMFERASKRLKPTLTPTSCKSDDATTN-------------------------- 235 A+SKTL+ + SKRLK + TPTS S +++ Sbjct: 2 AASKTLMDYLQ-PSKRLKVS-TPTSSSSSSSSSPKSLLLPVSSLSHSQSQDPHQSPPSSP 59 Query: 236 GSTLSVDQKSRIEYNKQLAKSKRNLKICLEIVSKHKGAGGDGCCXXXXXXXXXXXXXXXX 415 S L+ QKSRIE+NK LAKSKRNL IC + VSK K G Sbjct: 60 SSALTAHQKSRIEFNKFLAKSKRNLTICSQKVSKSKAEGVG--FVELEDLLLEETWLDAL 117 Query: 416 PGEFQKPYAVTLSKFVETEISSADGAVYPPSHLIFNALNSTPFHAVKAVILGQDPYHGPG 595 PGEFQKPYA TL +F+E E+ + +YPP HLIFNALNSTPF VKAVI+GQDPYHGPG Sbjct: 118 PGEFQKPYAKTLCRFLEREVCGSGVPIYPPQHLIFNALNSTPFDRVKAVIIGQDPYHGPG 177 Query: 596 QAMGLSFSVPEGVKVPSSLVNIFKELKQDLACSIPSHGNLEKWALQGVLLLNAVLTVRKH 775 QAMGLSFSVPEGVKVPSSLVNIFKEL+QDL CSIPSHGNLEKWA+QGVLLLNAVLTVR H Sbjct: 178 QAMGLSFSVPEGVKVPSSLVNIFKELQQDLGCSIPSHGNLEKWAVQGVLLLNAVLTVRSH 237 Query: 776 QANSHAKKGWEQLTDAVIKTISQKKEGVVFLLWGNSAREKSRLIDATKHHILKAAHPSGL 955 QANSHAKKGWEQ TD+VI+TISQK+ GVVFLLWGNSA+EKSRLID TKHHILKAAHPSGL Sbjct: 238 QANSHAKKGWEQFTDSVIRTISQKQRGVVFLLWGNSAQEKSRLIDDTKHHILKAAHPSGL 297 Query: 956 SANRGFFGCRHFSRTNQLLEQMGIAPIDWQL 1048 SANRGFFGCRHFSRTN++LEQ G+ PIDWQL Sbjct: 298 SANRGFFGCRHFSRTNKILEQKGVPPIDWQL 328 >emb|CBI27448.3| unnamed protein product [Vitis vinifera] Length = 321 Score = 405 bits (1041), Expect = e-110 Identities = 203/270 (75%), Positives = 221/270 (81%) Frame = +2 Query: 239 STLSVDQKSRIEYNKQLAKSKRNLKICLEIVSKHKGAGGDGCCXXXXXXXXXXXXXXXXP 418 S L+ QKSRIE+NK LAKSKRNL IC + VSK K G P Sbjct: 54 SALTAHQKSRIEFNKFLAKSKRNLTICSQKVSKSKAEGVG--FVELEDLLLEETWLDALP 111 Query: 419 GEFQKPYAVTLSKFVETEISSADGAVYPPSHLIFNALNSTPFHAVKAVILGQDPYHGPGQ 598 GEFQKPYA TL +F+E E+ + +YPP HLIFNALNSTPF VKAVI+GQDPYHGPGQ Sbjct: 112 GEFQKPYAKTLCRFLEREVCGSGVPIYPPQHLIFNALNSTPFDRVKAVIIGQDPYHGPGQ 171 Query: 599 AMGLSFSVPEGVKVPSSLVNIFKELKQDLACSIPSHGNLEKWALQGVLLLNAVLTVRKHQ 778 AMGLSFSVPEGVKVPSSLVNIFKEL+QDL CSIPSHGNLEKWA+QGVLLLNAVLTVR HQ Sbjct: 172 AMGLSFSVPEGVKVPSSLVNIFKELQQDLGCSIPSHGNLEKWAVQGVLLLNAVLTVRSHQ 231 Query: 779 ANSHAKKGWEQLTDAVIKTISQKKEGVVFLLWGNSAREKSRLIDATKHHILKAAHPSGLS 958 ANSHAKKGWEQ TD+VI+TISQK+ GVVFLLWGNSA+EKSRLID TKHHILKAAHPSGLS Sbjct: 232 ANSHAKKGWEQFTDSVIRTISQKQRGVVFLLWGNSAQEKSRLIDDTKHHILKAAHPSGLS 291 Query: 959 ANRGFFGCRHFSRTNQLLEQMGIAPIDWQL 1048 ANRGFFGCRHFSRTN++LEQ G+ PIDWQL Sbjct: 292 ANRGFFGCRHFSRTNKILEQKGVPPIDWQL 321 >ref|XP_004140430.1| PREDICTED: uracil-DNA glycosylase-like [Cucumis sativus] gi|449518103|ref|XP_004166083.1| PREDICTED: uracil-DNA glycosylase-like [Cucumis sativus] Length = 318 Score = 398 bits (1022), Expect = e-108 Identities = 209/315 (66%), Positives = 237/315 (75%), Gaps = 9/315 (2%) Frame = +2 Query: 131 SASSKTLIGMFERA-SKRLKPTLT-------PTSCKSDDATTNGST-LSVDQKSRIEYNK 283 S+ ++TLI +F+ A SKRLK + T C SD + S +S Q SR+E NK Sbjct: 9 SSKTRTLIDIFQPALSKRLKTSQTLKTLATNDDKCDSDLTLASSSADISASQISRMETNK 68 Query: 284 QLAKSKRNLKICLEIVSKHKGAGGDGCCXXXXXXXXXXXXXXXXPGEFQKPYAVTLSKFV 463 +A+SKRNLK C + VSK + C PGEFQKPYA+ L KFV Sbjct: 69 WIARSKRNLKTCSDRVSKWENG-----CVKLEELLVEETWFEALPGEFQKPYALNLCKFV 123 Query: 464 ETEISSADGAVYPPSHLIFNALNSTPFHAVKAVILGQDPYHGPGQAMGLSFSVPEGVKVP 643 +TEI S+ +YPP LIFNALNSTPF VK VILGQDPYHGPGQAMGLSFSVPEGVK+P Sbjct: 124 QTEICSSGVPIYPPPSLIFNALNSTPFDRVKVVILGQDPYHGPGQAMGLSFSVPEGVKIP 183 Query: 644 SSLVNIFKELKQDLACSIPSHGNLEKWALQGVLLLNAVLTVRKHQANSHAKKGWEQLTDA 823 SSL+NIFKEL+ DL CSIPSHGNL KWA+QGVLLLNAVL+VRKHQANSHAK+GWEQ TDA Sbjct: 184 SSLLNIFKELRDDLGCSIPSHGNLGKWAVQGVLLLNAVLSVRKHQANSHAKRGWEQFTDA 243 Query: 824 VIKTISQKKEGVVFLLWGNSAREKSRLIDATKHHILKAAHPSGLSANRGFFGCRHFSRTN 1003 VIKTISQKKEG++FLLWGNSA+ K RLID KHHILKAAHPSGLSANRGFFGCRHFSRTN Sbjct: 244 VIKTISQKKEGIIFLLWGNSAQAKLRLIDEKKHHILKAAHPSGLSANRGFFGCRHFSRTN 303 Query: 1004 QLLEQMGIAPIDWQL 1048 LL++MG A IDWQL Sbjct: 304 ILLKEMGTASIDWQL 318 >ref|XP_002885267.1| uracil DNA glycosylase family protein [Arabidopsis lyrata subsp. lyrata] gi|297331107|gb|EFH61526.1| uracil DNA glycosylase family protein [Arabidopsis lyrata subsp. lyrata] Length = 329 Score = 381 bits (978), Expect = e-103 Identities = 206/335 (61%), Positives = 236/335 (70%), Gaps = 29/335 (8%) Frame = +2 Query: 131 SASSKTLIGMFERASKRLK---------------------------PTLTPTSCKSDDAT 229 ++SSKTL+ F+ A KRLK P +T T+ +DD+ Sbjct: 2 ASSSKTLMDFFQPA-KRLKASPSSSSFPAVSVAGGSRGLVSAANSPPRVTVTTSVADDS- 59 Query: 230 TNGSTLSVDQKSRIEYNKQLAKSKRNLKICLEIVSKHKGAGGDGCCXXXXXXXXXXXXXX 409 S L+ +Q +R E+NK +AKSKRNL +C E V+K K GG C Sbjct: 60 ---SGLTPEQVARAEFNKFVAKSKRNLAVCSEKVTKAKAEGG--CYVPLSELLVEESWLK 114 Query: 410 XXPGEFQKPYAVTLSKFVETEI--SSADGAVYPPSHLIFNALNSTPFHAVKAVILGQDPY 583 PGE KPYA TLS F+E EI S +YPP HLIFNALN+TPF VK VI+GQDPY Sbjct: 115 ALPGELHKPYAKTLSDFLEREIIADSKSPPIYPPQHLIFNALNTTPFDRVKTVIIGQDPY 174 Query: 584 HGPGQAMGLSFSVPEGVKVPSSLVNIFKELKQDLACSIPSHGNLEKWALQGVLLLNAVLT 763 HGPGQAMGLSFSVPEG K+PSSL+NIFKEL +D+ CSIP HGNL+KWA+QGVLLLNAVLT Sbjct: 175 HGPGQAMGLSFSVPEGEKLPSSLLNIFKELHKDVGCSIPRHGNLQKWAVQGVLLLNAVLT 234 Query: 764 VRKHQANSHAKKGWEQLTDAVIKTISQKKEGVVFLLWGNSAREKSRLIDATKHHILKAAH 943 VR Q NSHAKKGWEQ TDAVI++ISQ+KEGVVFLLWG A+EKS+LIDATKHHIL AAH Sbjct: 235 VRSKQPNSHAKKGWEQFTDAVIQSISQQKEGVVFLLWGRYAQEKSKLIDATKHHILTAAH 294 Query: 944 PSGLSANRGFFGCRHFSRTNQLLEQMGIAPIDWQL 1048 PSGLSANRGFF CRHFSR NQLLEQMGI PIDWQL Sbjct: 295 PSGLSANRGFFNCRHFSRANQLLEQMGIPPIDWQL 329