BLASTX nr result
ID: Forsythia23_contig00018725
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia23_contig00018725 (1240 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002276173.1| PREDICTED: uncharacterized protein LOC100267... 412 e-112 ref|XP_010656606.1| PREDICTED: uncharacterized protein LOC100267... 407 e-111 emb|CDP10232.1| unnamed protein product [Coffea canephora] 405 e-110 ref|XP_007011936.1| DNA glycosylase superfamily protein isoform ... 404 e-110 ref|XP_010103669.1| Putative Glutamine amidotransferase [Morus n... 403 e-109 ref|XP_008242987.1| PREDICTED: uncharacterized protein LOC103341... 399 e-108 ref|XP_011083975.1| PREDICTED: uncharacterized protein LOC105166... 399 e-108 ref|XP_007204814.1| hypothetical protein PRUPE_ppa026720mg [Prun... 399 e-108 ref|XP_009795108.1| PREDICTED: uncharacterized protein LOC104241... 397 e-108 ref|XP_002324538.1| methyladenine glycosylase family protein [Po... 397 e-108 ref|XP_009617886.1| PREDICTED: uncharacterized protein LOC104110... 396 e-107 ref|XP_011018029.1| PREDICTED: uncharacterized protein LOC105121... 392 e-106 ref|XP_012462430.1| PREDICTED: uncharacterized protein LOC105782... 391 e-106 ref|XP_004287130.1| PREDICTED: uncharacterized protein LOC101313... 391 e-106 ref|XP_010275821.1| PREDICTED: uncharacterized protein LOC104610... 390 e-105 ref|XP_012449856.1| PREDICTED: uncharacterized protein LOC105772... 390 e-105 ref|XP_011083973.1| PREDICTED: uncharacterized protein LOC105166... 390 e-105 ref|XP_002529378.1| DNA-3-methyladenine glycosylase, putative [R... 390 e-105 gb|KHG05578.1| guaA [Gossypium arboreum] 389 e-105 gb|KHG15995.1| putative GMP synthase [glutamine-hydrolyzing] [Go... 388 e-105 >ref|XP_002276173.1| PREDICTED: uncharacterized protein LOC100267363 isoform X2 [Vitis vinifera] gi|297743642|emb|CBI36525.3| unnamed protein product [Vitis vinifera] Length = 375 Score = 412 bits (1058), Expect = e-112 Identities = 223/348 (64%), Positives = 255/348 (73%), Gaps = 6/348 (1%) Frame = -2 Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883 MSG PR RS N+ADS+ R AGNK R + +KP K LRK E++ KD+ E K Sbjct: 1 MSGGPRVRSMNVADSEVRPVLGPAGNKTMRSLSGRKPATKPLRKAEKATKDD--EEIKAL 58 Query: 882 PISVTKLSNP-LHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRTS 706 P S S+P HSVSVP VLRR E +L SRASTGR+ R+S Sbjct: 59 PSSNGAASSPPSHSVSVPLVLRRQEQLLHSNLSLNASCSSDASTDSFHSRASTGRITRSS 118 Query: 705 STSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAHD 526 ST+ R+ SK K++V +GV E PD ++ KRRCAWVT NTD SY+ FHDEEWGVP HD Sbjct: 119 STAR-RRSYASKPKVIVSDGVSESPPDGLKAKRRCAWVTPNTDLSYIAFHDEEWGVPVHD 177 Query: 525 DNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXXX 346 D KLFELLVL GALAE+TWP ILS+RHIFREVFADFDPI+VAK+NEKK++A Sbjct: 178 DKKLFELLVLSGALAELTWPTILSKRHIFREVFADFDPIAVAKLNEKKLMAPGSIASSLI 237 Query: 345 SELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLISK 166 SELKLR II+NARQ+SKVIDEFGSFD+YIW FV HKPIV RFRYPR VPV+T KAD+ISK Sbjct: 238 SELKLRGIIENARQMSKVIDEFGSFDEYIWSFVNHKPIVSRFRYPRHVPVKTPKADVISK 297 Query: 165 DLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDC-DAAEVKQEE 25 DLVRRGFRSVGPTV+YSFMQVAGITNDHLISCFRF DC AAEVK+EE Sbjct: 298 DLVRRGFRSVGPTVIYSFMQVAGITNDHLISCFRFQDCVTAAEVKEEE 345 >ref|XP_010656606.1| PREDICTED: uncharacterized protein LOC100267363 isoform X1 [Vitis vinifera] gi|731407750|ref|XP_010656607.1| PREDICTED: uncharacterized protein LOC100267363 isoform X1 [Vitis vinifera] gi|731407752|ref|XP_010656608.1| PREDICTED: uncharacterized protein LOC100267363 isoform X1 [Vitis vinifera] gi|731407754|ref|XP_010656609.1| PREDICTED: uncharacterized protein LOC100267363 isoform X1 [Vitis vinifera] Length = 376 Score = 407 bits (1046), Expect = e-111 Identities = 223/349 (63%), Positives = 255/349 (73%), Gaps = 7/349 (2%) Frame = -2 Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883 MSG PR RS N+ADS+ R AGNK R + +KP K LRK E++ KD+ E K Sbjct: 1 MSGGPRVRSMNVADSEVRPVLGPAGNKTMRSLSGRKPATKPLRKAEKATKDD--EEIKAL 58 Query: 882 PISVTKLSNP-LHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRTS 706 P S S+P HSVSVP VLRR E +L SRASTGR+ R+S Sbjct: 59 PSSNGAASSPPSHSVSVPLVLRRQEQLLHSNLSLNASCSSDASTDSFHSRASTGRITRSS 118 Query: 705 STSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANT-DPSYVTFHDEEWGVPAH 529 ST+ R+ SK K++V +GV E PD ++ KRRCAWVT NT D SY+ FHDEEWGVP H Sbjct: 119 STAR-RRSYASKPKVIVSDGVSESPPDGLKAKRRCAWVTPNTADLSYIAFHDEEWGVPVH 177 Query: 528 DDNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXX 349 DD KLFELLVL GALAE+TWP ILS+RHIFREVFADFDPI+VAK+NEKK++A Sbjct: 178 DDKKLFELLVLSGALAELTWPTILSKRHIFREVFADFDPIAVAKLNEKKLMAPGSIASSL 237 Query: 348 XSELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLIS 169 SELKLR II+NARQ+SKVIDEFGSFD+YIW FV HKPIV RFRYPR VPV+T KAD+IS Sbjct: 238 ISELKLRGIIENARQMSKVIDEFGSFDEYIWSFVNHKPIVSRFRYPRHVPVKTPKADVIS 297 Query: 168 KDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDC-DAAEVKQEE 25 KDLVRRGFRSVGPTV+YSFMQVAGITNDHLISCFRF DC AAEVK+EE Sbjct: 298 KDLVRRGFRSVGPTVIYSFMQVAGITNDHLISCFRFQDCVTAAEVKEEE 346 >emb|CDP10232.1| unnamed protein product [Coffea canephora] Length = 380 Score = 405 bits (1041), Expect = e-110 Identities = 217/358 (60%), Positives = 249/358 (69%), Gaps = 10/358 (2%) Frame = -2 Query: 1050 MSGAPRERSTNLADSQPRHA----GNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKN- 886 MSGAPR R ++ DS+ R GNKA R +KPV K E+S + V DKN Sbjct: 1 MSGAPRMRPMSVGDSEVRTVLVPGGNKAQRSLRVKKPVTKAWGNAEKSTDEVEVVEDKNG 60 Query: 885 --KPISVTKLSNPLHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYR 712 P SVT LS PL+S PS+LRR + +L SRASTGR+YR Sbjct: 61 PSSPTSVTDLSPPLNSSRFPSILRRQDSLLHSSLSLSASCSSDASTDSFHSRASTGRIYR 120 Query: 711 TSSTSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPA 532 T +N +K L SKAKIV PNGV D + KR CAWVT TDP+Y TFHDEEWGVP Sbjct: 121 TRIIANRKKHLASKAKIVGPNGVSGSTSDGLPAKRTCAWVTPTTDPAYATFHDEEWGVPV 180 Query: 531 HDDNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXX 352 HDD +LFELLVLCGAL+E+TWP+ILSRR IFREVFADFDP VAK+NEKKIIA Sbjct: 181 HDDKRLFELLVLCGALSELTWPSILSRRQIFREVFADFDPTVVAKLNEKKIIAPGNTASS 240 Query: 351 XXSELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLI 172 SEL+LRAII+NARQ+SKVIDEFGSFDKYIW FV HKP+V RFRYPRQ+PV+T KAD+I Sbjct: 241 LLSELRLRAIIENARQISKVIDEFGSFDKYIWSFVNHKPLVSRFRYPRQIPVKTPKADVI 300 Query: 171 SKDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQE---EDDIKEK 7 SKDL+RRGFR VGPTVVYSFMQVAG+TNDHL+SCFRF DC E K E ED ++K Sbjct: 301 SKDLMRRGFRCVGPTVVYSFMQVAGLTNDHLVSCFRFQDCMTPEGKAEASVEDIAQQK 358 >ref|XP_007011936.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|590572766|ref|XP_007011937.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|590572769|ref|XP_007011938.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|590572773|ref|XP_007011939.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508782299|gb|EOY29555.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508782300|gb|EOY29556.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508782301|gb|EOY29557.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508782302|gb|EOY29558.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] Length = 379 Score = 404 bits (1039), Expect = e-110 Identities = 219/353 (62%), Positives = 253/353 (71%), Gaps = 6/353 (1%) Frame = -2 Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883 MSGAPR RS N+ADS+ R AGNKA L +A+KP K LRKVE+S + V +K Sbjct: 1 MSGAPRMRSMNVADSEARPVLGPAGNKAGSL-SARKPASKPLRKVEKSPVEVTVAEEKKA 59 Query: 882 PIS--VTKLSNPLHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRT 709 S V LS HSVSVPSVLRRHE +L SRASTGR+ R+ Sbjct: 60 LPSSTVNSLSPKTHSVSVPSVLRRHEQLLHSNLSLNASCSSDASTDSFHSRASTGRLIRS 119 Query: 708 SSTSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAH 529 +S N RK SK + VV +G + PD K+RCAWVT NTDPSYV FHDEEWGVP H Sbjct: 120 NSVGNRRKPYASKPRSVVSDGGLDSPPDGSHQKKRCAWVTPNTDPSYVAFHDEEWGVPVH 179 Query: 528 DDNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXX 349 DD KLFELLVL GAL+E+TWPAILS+RHI REVF DFD ++V+K+NEKK++ Sbjct: 180 DDRKLFELLVLSGALSELTWPAILSKRHIVREVFVDFDAVAVSKLNEKKLVTPGSIASSL 239 Query: 348 XSELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLIS 169 SELKLRAII+NARQ+SKVIDEFGSFD+YIW FV HKPIV RFRYPRQVPV+T KAD+IS Sbjct: 240 LSELKLRAIIENARQISKVIDEFGSFDEYIWSFVNHKPIVSRFRYPRQVPVKTPKADVIS 299 Query: 168 KDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQEEDDIKE 10 KDLVRRGFRSVGPTV+YSFMQVAGITNDHL SCFRF +C A +EE+ IK+ Sbjct: 300 KDLVRRGFRSVGPTVIYSFMQVAGITNDHLTSCFRFQECITAAEGKEENGIKD 352 >ref|XP_010103669.1| Putative Glutamine amidotransferase [Morus notabilis] gi|587908671|gb|EXB96612.1| Putative Glutamine amidotransferase [Morus notabilis] Length = 383 Score = 403 bits (1036), Expect = e-109 Identities = 221/357 (61%), Positives = 259/357 (72%), Gaps = 8/357 (2%) Frame = -2 Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883 MSGAPR RS N+ADS+ R AGNKA + +K KT RKV++S + + +K K Sbjct: 1 MSGAPRVRSMNVADSESRPVLGLAGNKAGTWSSTRKSTSKTPRKVDKSPDEVTLSEEKKK 60 Query: 882 P--ISVTKLSNP-LHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMY- 715 +S T ++P LHS SVPSVLRRHE +L SRASTGR+ Sbjct: 61 TRQVSSTGATSPQLHSSSVPSVLRRHEQLLHSNLSLNASCSSDASTDSFHSRASTGRLLT 120 Query: 714 RTSSTSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVP 535 R+ ST + RKQL S+ + VV +G E PDD Q K+RCAWVT NT+P YV FHDEEWGVP Sbjct: 121 RSYSTGSRRKQLVSRTRSVVSDGGLESPPDDSQQKKRCAWVTPNTEPCYVAFHDEEWGVP 180 Query: 534 AHDDNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXX 355 HDD KLFELLVL GALAE+TWPAILS+RHIFREVFADFDP +V+K+NEKKI+A Sbjct: 181 VHDDRKLFELLVLSGALAELTWPAILSKRHIFREVFADFDPAAVSKLNEKKIMAPGSTAS 240 Query: 354 XXXSELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADL 175 SELKLRAII+N RQ+SKVIDEFGSFD YIW FV +KPIV +FRYPRQVPV+T KAD+ Sbjct: 241 SLLSELKLRAIIENGRQISKVIDEFGSFDNYIWSFVNNKPIVSKFRYPRQVPVKTPKADV 300 Query: 174 ISKDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQEEDDIKEKA 4 ISKDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRF +C A ++E+ IK +A Sbjct: 301 ISKDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFQECLNAAEGKDENGIKNEA 357 >ref|XP_008242987.1| PREDICTED: uncharacterized protein LOC103341267 isoform X1 [Prunus mume] Length = 378 Score = 399 bits (1025), Expect = e-108 Identities = 215/354 (60%), Positives = 256/354 (72%), Gaps = 5/354 (1%) Frame = -2 Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEE-SIKDEGVELDKN 886 MSGAPR RS N+ADS+ R AGNKA +A+KPV K LRK E+ + K E K Sbjct: 1 MSGAPRVRSINVADSESRPVLGPAGNKAGTF-SARKPVSKPLRKAEKLAEKVASAEEKKT 59 Query: 885 KPISVTKLSNPLHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRTS 706 + S+ S LHS SVPSVLRRHE +L SRASTGR+ R++ Sbjct: 60 RQSSMLTTSPQLHSPSVPSVLRRHEQLLHSNFSLNASCSSDASTDSFHSRASTGRLTRSN 119 Query: 705 STSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAHD 526 S + RKQ SK + VV +G + PD Q K+RCAWVT NTDP Y FHDEEWG+P HD Sbjct: 120 SAGSRRKQYVSKPRSVVSDGGLDSPPDGSQSKKRCAWVTPNTDPCYAAFHDEEWGLPVHD 179 Query: 525 DNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXXX 346 D KLFELLVL GALAE++WPAILS++HIFREVFADFDP++V+K+NEKK+IA Sbjct: 180 DKKLFELLVLSGALAELSWPAILSKKHIFREVFADFDPVAVSKLNEKKLIAPGSTASSLL 239 Query: 345 SELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLISK 166 SELKLRAII+NARQ++KVI+EFGSFDKYIW FV +KPIV RFRYPRQVP +T KAD+ISK Sbjct: 240 SELKLRAIIENARQMTKVIEEFGSFDKYIWSFVNNKPIVSRFRYPRQVPAKTPKADVISK 299 Query: 165 DLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQEEDDIKEKA 4 DLVRRGFRSVGPTV+YSFMQVAGITNDHL+SCFRF +C A +E+ IK++A Sbjct: 300 DLVRRGFRSVGPTVIYSFMQVAGITNDHLVSCFRFQECLNAAEGKEDYGIKDEA 353 >ref|XP_011083975.1| PREDICTED: uncharacterized protein LOC105166349 isoform X2 [Sesamum indicum] Length = 372 Score = 399 bits (1024), Expect = e-108 Identities = 221/359 (61%), Positives = 257/359 (71%), Gaps = 10/359 (2%) Frame = -2 Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883 MSG + RSTN+ADS+ R GNKA RL +++K V+K L+K ++D+ L Sbjct: 1 MSGTAKIRSTNMADSEVRPILGPGGNKAQRLIDSRKHVVKPLKKEAVPVEDKNGSL---- 56 Query: 882 PISVTKLSNPL-HSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRTS 706 P S S+PL H VSVPS L RHE +L SRASTGR+ RT Sbjct: 57 PASTRAESSPLLHYVSVPSTLHRHESLLCSNLSLSASCSSDASTDSFHSRASTGRICRTI 116 Query: 705 STSNWRKQLGSKAKI-VVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAH 529 S S+ RK+L KA+ V NGV E L + VQ KRRCAWVTANTDP YV FHDEEWGVP H Sbjct: 117 SKSS-RKELALKARNGAVSNGVTESLTEGVQAKRRCAWVTANTDPIYVAFHDEEWGVPTH 175 Query: 528 DDNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXX 349 DD KLFE LVL GALAE+TWPAILS+RHIFREVF DFDP +VAK++EKKIIA Sbjct: 176 DDRKLFEFLVLSGALAELTWPAILSKRHIFREVFVDFDPTAVAKLSEKKIIAPGSPASSL 235 Query: 348 XSELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLIS 169 SELKLR+II+NARQVS+VIDEFGSFDKYIW FV +KPIVG FRYPRQVPV+T KAD+IS Sbjct: 236 LSELKLRSIIENARQVSRVIDEFGSFDKYIWSFVNYKPIVGSFRYPRQVPVKTPKADVIS 295 Query: 168 KDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQ----EEDDIKEKA 4 KDLVRRGFRSVGPT++YSFMQ AGITNDHL+SCFRFH+C AA+ K+ D +EKA Sbjct: 296 KDLVRRGFRSVGPTIIYSFMQGAGITNDHLMSCFRFHECGAAKAKEGSPLTNKDEEEKA 354 >ref|XP_007204814.1| hypothetical protein PRUPE_ppa026720mg [Prunus persica] gi|462400345|gb|EMJ06013.1| hypothetical protein PRUPE_ppa026720mg [Prunus persica] Length = 378 Score = 399 bits (1024), Expect = e-108 Identities = 214/354 (60%), Positives = 256/354 (72%), Gaps = 5/354 (1%) Frame = -2 Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEE-SIKDEGVELDKN 886 MSGAPR RS N+ADS+ R AGNKA +A+KPV K LRK E+ + K E K Sbjct: 1 MSGAPRVRSINVADSESRPVLGPAGNKAGTF-SARKPVSKPLRKAEKLAEKVASAEEKKT 59 Query: 885 KPISVTKLSNPLHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRTS 706 + S+ S LHS SVPSVLRRHE +L SRASTGR+ R++ Sbjct: 60 RQSSMLTTSPQLHSPSVPSVLRRHEQLLHSNFSLNASCSSDASTDSFHSRASTGRLTRSN 119 Query: 705 STSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAHD 526 S + RKQ SK + VV +G + PD Q K+RCAWVT NTDP Y FHDEEWG+P HD Sbjct: 120 SAGSRRKQYVSKPRSVVSDGGLDSPPDGSQSKKRCAWVTPNTDPCYAAFHDEEWGLPVHD 179 Query: 525 DNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXXX 346 D KLFELLVL GALAE++WPAILS++HIFREVFADFDP++++K+NEKK+IA Sbjct: 180 DKKLFELLVLSGALAELSWPAILSKKHIFREVFADFDPVAISKLNEKKLIAPGSNASSLL 239 Query: 345 SELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLISK 166 SELKLRAII+NARQ++KVI+EFGSFDKYIW FV +KPIV RFRYPRQVP +T KAD+ISK Sbjct: 240 SELKLRAIIENARQMTKVIEEFGSFDKYIWSFVNNKPIVSRFRYPRQVPAKTPKADVISK 299 Query: 165 DLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQEEDDIKEKA 4 DL+RRGFRSVGPTV+YSFMQVAGITNDHL+SCFRF +C A +EE IK++A Sbjct: 300 DLMRRGFRSVGPTVIYSFMQVAGITNDHLVSCFRFQECLNAAEGKEEYGIKDEA 353 >ref|XP_009795108.1| PREDICTED: uncharacterized protein LOC104241850 [Nicotiana sylvestris] gi|698498404|ref|XP_009795109.1| PREDICTED: uncharacterized protein LOC104241850 [Nicotiana sylvestris] Length = 368 Score = 397 bits (1020), Expect = e-108 Identities = 216/352 (61%), Positives = 252/352 (71%), Gaps = 5/352 (1%) Frame = -2 Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883 MSGA R RS N ADS+ R AGNKA R ++K V K RK +S K+E D+ Sbjct: 1 MSGASRVRSMNAADSEARPVLGLAGNKALRSPGSRKSVSKPTRKAVKS-KEEDKNGDQPS 59 Query: 882 PISVTKLSNPLHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRTSS 703 P LHS VPS+LRR E L S ASTGR+YR SS Sbjct: 60 P--------SLHSFDVPSILRRQES-LYSNLSLSASCSSDASTDSFHSSASTGRIYRMSS 110 Query: 702 TSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAHDD 523 TS+ RKQL SK+K +V + + + D +Q K+RC+WVT NTDPSY FHDEEWGVP HDD Sbjct: 111 TSSRRKQLASKSKRIVSDDISDSSIDGLQSKKRCSWVTPNTDPSYADFHDEEWGVPVHDD 170 Query: 522 NKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXXXS 343 KLFELLVLCGALAE++WP+IL +RHIFREVFADFDPI VAK+NEKKI+A S Sbjct: 171 KKLFELLVLCGALAELSWPSILCKRHIFREVFADFDPIVVAKLNEKKILAPGSTACSLLS 230 Query: 342 ELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLISKD 163 ELKLRAII+NARQ+SKVIDEFGSFDKYIW FV +KPIV FRYPRQVPV+T+KADLISKD Sbjct: 231 ELKLRAIIENARQMSKVIDEFGSFDKYIWSFVNNKPIVSGFRYPRQVPVKTAKADLISKD 290 Query: 162 LVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDC-DAAEVKQEEDDIKE 10 L+RRGFR VGPTVVYSFMQV+GITNDHLISCFRFHDC ++AE K+++ + E Sbjct: 291 LIRRGFRGVGPTVVYSFMQVSGITNDHLISCFRFHDCVESAEAKEKDSNKDE 342 >ref|XP_002324538.1| methyladenine glycosylase family protein [Populus trichocarpa] gi|222865972|gb|EEF03103.1| methyladenine glycosylase family protein [Populus trichocarpa] Length = 380 Score = 397 bits (1020), Expect = e-108 Identities = 223/361 (61%), Positives = 254/361 (70%), Gaps = 15/361 (4%) Frame = -2 Query: 1050 MSGAPRERSTNLADSQPRHA----GNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883 MSGAPR RS N+ADS+ R GN +A+KPV K RKVE+S E V+L + K Sbjct: 1 MSGAPRVRSMNVADSEARSVLGPTGNNKAGPLSARKPVSKQSRKVEKS--PEEVKLGEEK 58 Query: 882 PI----SVTKLSNPLHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMY 715 +V LS HS+++ SVLRRHEL+L SRASTGR+ Sbjct: 59 KTLTVPAVGTLSPKSHSLNISSVLRRHELLLHSNLSLNASCSSDASTDSFHSRASTGRLT 118 Query: 714 RTSSTSNWRKQLGSKAKIVVPNGVPE--PLPDDVQVKRRCAWVTANTDPSYVTFHDEEWG 541 R++S RKQ + + V G E P PDD Q K+ CAWVT NTDP Y TFHDEEWG Sbjct: 119 RSNSAGTRRKQYVLRPRSFVSEGGLESPPSPDDSQSKKSCAWVTPNTDPCYATFHDEEWG 178 Query: 540 VPAHDDNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXX 361 VP HDD KLFELLVL GALAE+TWPAILS+RHIFREVFADFDPI+V+K NEKKI+A Sbjct: 179 VPIHDDRKLFELLVLSGALAELTWPAILSKRHIFREVFADFDPIAVSKFNEKKILAPGST 238 Query: 360 XXXXXSELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKA 181 SELKLRAI++NARQ+SKVIDEFGSFDKYIW FV +KPIV RFRYPRQVPV+T KA Sbjct: 239 ATSLLSELKLRAIVENARQISKVIDEFGSFDKYIWSFVNYKPIVSRFRYPRQVPVKTPKA 298 Query: 180 DLISKDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDC-DAAEVKQE----EDDI 16 D ISKDLVRRGFRSVGPTV+YSFMQVAGITNDHLISCFRF +C DAAE K E +DI Sbjct: 299 DAISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLISCFRFQECLDAAEGKVENGIKSEDI 358 Query: 15 K 13 K Sbjct: 359 K 359 >ref|XP_009617886.1| PREDICTED: uncharacterized protein LOC104110152 [Nicotiana tomentosiformis] Length = 372 Score = 396 bits (1017), Expect = e-107 Identities = 216/352 (61%), Positives = 253/352 (71%), Gaps = 5/352 (1%) Frame = -2 Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883 MSGA R RS N ADS+ R AGNKA R ++K V K RK +S ++ +E DKN Sbjct: 1 MSGASRVRSMNAADSEARPVLGLAGNKALRSPGSRKSVSKPTRKAVKSKEEVEME-DKNG 59 Query: 882 PISVTKLSNPLHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRTSS 703 + S LHS VPS+LRR E L S ASTGR+YR SS Sbjct: 60 H----QPSPSLHSFDVPSILRRQES-LYSNLSLSASCSSDASTDSFHSSASTGRIYRMSS 114 Query: 702 TSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAHDD 523 TS+ RKQL SK+K +V + + + D +Q K++C WVT NTDPSY FHDEEWGVP HDD Sbjct: 115 TSSRRKQLASKSKRIVSDDISDSSIDGLQSKKKCGWVTPNTDPSYADFHDEEWGVPVHDD 174 Query: 522 NKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXXXS 343 KLFELLVLCGALAE++WP+IL +RHIFREVF DFDPI VAK+NEKKI+A S Sbjct: 175 KKLFELLVLCGALAELSWPSILCKRHIFREVFTDFDPIVVAKLNEKKILAPGSTACSLLS 234 Query: 342 ELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLISKD 163 ELKLRAII+NARQ+SKVIDEFGSFDKYIW FV +KPIV FRYPRQVPV+T+KADLISKD Sbjct: 235 ELKLRAIIENARQMSKVIDEFGSFDKYIWSFVNNKPIVSGFRYPRQVPVKTAKADLISKD 294 Query: 162 LVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDC-DAAEVKQEEDDIKE 10 L+RRGFR VGPTVVYSFMQVAGITNDHLISCFRFHDC ++AE K+++ + E Sbjct: 295 LIRRGFRGVGPTVVYSFMQVAGITNDHLISCFRFHDCVESAEAKEKDSNKDE 346 >ref|XP_011018029.1| PREDICTED: uncharacterized protein LOC105121177 [Populus euphratica] Length = 380 Score = 392 bits (1007), Expect = e-106 Identities = 218/359 (60%), Positives = 252/359 (70%), Gaps = 13/359 (3%) Frame = -2 Query: 1050 MSGAPRERSTNLADSQPRHA----GNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883 MSGAPR +S N+ DS+ R GN +A+KP K LRKVE+S ++ + +K Sbjct: 1 MSGAPRVKSMNVTDSEARSVLGPTGNNKAGPLSARKPASKQLRKVEKSAEEVRLGEEKKT 60 Query: 882 PI--SVTKLSNPLHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRT 709 I +V LS HS+++ SVL RHEL+L SRASTGR+ R+ Sbjct: 61 LIVPAVGTLSPKSHSLNISSVLLRHELLLHSNLSLNASCSSDASTDSFHSRASTGRLTRS 120 Query: 708 SSTSNWRKQLGSKAKIVVPNGVPE--PLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVP 535 +S +KQ S+ + V G E P P+D Q K+ CAWVT NTDP Y TFHDEEWGVP Sbjct: 121 NSAGTRKKQYVSRPRSFVSEGGLESPPSPNDSQSKKSCAWVTPNTDPCYATFHDEEWGVP 180 Query: 534 AHDDNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXX 355 HDD KLFELLVL GALAE+TWPAILS+RHIFREVFADFDP++V+K NEKKIIA Sbjct: 181 IHDDRKLFELLVLSGALAELTWPAILSKRHIFREVFADFDPVAVSKFNEKKIIAPGSTAT 240 Query: 354 XXXSELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADL 175 SELKLRAII+NARQ+SKVIDEFGSFDKYIW FV KPIV RFRYPRQVPV+T KAD Sbjct: 241 SLLSELKLRAIIENARQISKVIDEFGSFDKYIWSFVNFKPIVSRFRYPRQVPVKTPKADA 300 Query: 174 ISKDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDC-DAAEVKQE----EDDIK 13 ISKDLVRRGFRSVGPTV+YSFMQVAGITNDHLISCFRF +C DAAE K E +DIK Sbjct: 301 ISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLISCFRFQECLDAAEGKGENGIKSEDIK 359 >ref|XP_012462430.1| PREDICTED: uncharacterized protein LOC105782309 [Gossypium raimondii] gi|763815982|gb|KJB82834.1| hypothetical protein B456_013G216100 [Gossypium raimondii] gi|763815983|gb|KJB82835.1| hypothetical protein B456_013G216100 [Gossypium raimondii] Length = 381 Score = 391 bits (1005), Expect = e-106 Identities = 218/356 (61%), Positives = 254/356 (71%), Gaps = 7/356 (1%) Frame = -2 Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEES-IKDEGVELDKN 886 MSGAPR RS N DS+ R AGNKA L +A+KP K LRKVE+S ++ E K+ Sbjct: 1 MSGAPRLRSMNAPDSEARPVLGPAGNKAGSL-SARKPASKPLRKVEKSPVEVTATEEKKS 59 Query: 885 KPIS-VTKLSNPLHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRT 709 P S V+ LS HSVSVPSVLRRHE +L SRASTGR+ R+ Sbjct: 60 LPSSIVSSLSPKKHSVSVPSVLRRHEKLLHSNLSLNASCSSDASTDSFHSRASTGRLIRS 119 Query: 708 SSTSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAH 529 +S + RK SK + V + + D K+RCAWVT NTDPSY TFHDEEWGVP H Sbjct: 120 NSVGSRRKPYVSKPRSFVSDSGSDSPSDGSHQKKRCAWVTPNTDPSYATFHDEEWGVPVH 179 Query: 528 DDNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXX 349 DD KLFELLVL GAL+E+TWPAILS+R +FREVF DFDP +V+K+NEKK+IA Sbjct: 180 DDKKLFELLVLSGALSELTWPAILSKRQMFREVFMDFDPAAVSKLNEKKLIAPGSVSSSL 239 Query: 348 XSELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLIS 169 SELKLRAII+NARQ+SKVIDEFGSFD+YIW FV HKPI+ +FRYPRQVPV+T KAD+IS Sbjct: 240 LSELKLRAIIENARQISKVIDEFGSFDEYIWSFVNHKPIISKFRYPRQVPVKTPKADVIS 299 Query: 168 KDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDC-DAAEVKQEEDDIKEKA 4 KDLVRRGFRSVGPTV+YSFMQVAGITNDHL CFRF +C AAE K+ E IKE+A Sbjct: 300 KDLVRRGFRSVGPTVIYSFMQVAGITNDHLTGCFRFQECITAAEGKEVE--IKERA 353 >ref|XP_004287130.1| PREDICTED: uncharacterized protein LOC101313540 [Fragaria vesca subsp. vesca] Length = 429 Score = 391 bits (1004), Expect = e-106 Identities = 209/354 (59%), Positives = 251/354 (70%), Gaps = 5/354 (1%) Frame = -2 Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKD-EGVELDKN 886 MSGAPR +S N+A+S+ R AGNK +A+KP K LRK E+ +++ E K Sbjct: 1 MSGAPRVKSINVANSESRSVLGPAGNKGGAF-SARKPATKPLRKTEKMVEEFTSAEDKKT 59 Query: 885 KPISVTKLSNPLHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRTS 706 + S S LHS+SVPSVLRRHE +L SRASTGR+ R++ Sbjct: 60 QQSSKLSTSPQLHSLSVPSVLRRHEQLLQSNFSLNASCSSDASTDSFHSRASTGRLIRSN 119 Query: 705 STSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAHD 526 S + RKQ SK + VV +G + P Q K+RCAWVT NTDP YV FHDEEWG+P HD Sbjct: 120 SVGSRRKQYVSKPRSVVSDGGLDSPPGGSQSKKRCAWVTPNTDPCYVAFHDEEWGLPVHD 179 Query: 525 DNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXXX 346 D KLFELLVL GALAE++WP ILS+RHIFREVFADFDP+ V++ NEKKI+A Sbjct: 180 DKKLFELLVLSGALAELSWPLILSKRHIFREVFADFDPVDVSEFNEKKIMAPGSVASSLL 239 Query: 345 SELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLISK 166 SE KLRAI++NARQ++KVIDEFGSFDKYIW FV +KPIV RFRYPRQVP +T KAD+ISK Sbjct: 240 SESKLRAILENARQMTKVIDEFGSFDKYIWSFVNNKPIVSRFRYPRQVPAKTPKADVISK 299 Query: 165 DLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQEEDDIKEKA 4 DLVRRGFRSVGPTV+YSFMQVAGITNDHL+SCFRF DC A +EE+ KE++ Sbjct: 300 DLVRRGFRSVGPTVIYSFMQVAGITNDHLVSCFRFQDCLNAAEGKEENRTKEES 353 >ref|XP_010275821.1| PREDICTED: uncharacterized protein LOC104610746 [Nelumbo nucifera] Length = 387 Score = 390 bits (1002), Expect = e-105 Identities = 212/349 (60%), Positives = 243/349 (69%), Gaps = 5/349 (1%) Frame = -2 Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883 MSGAPR RS N+ADS+ R AGNK L +KP K LRKVE++ E V+ +K Sbjct: 1 MSGAPRVRSINVADSEARPVLGPAGNKTRSLVT-RKPASKPLRKVEKT--PEAVDEEKKA 57 Query: 882 PISVTKLSNP-LHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRTS 706 P S S P L VSVPS+LRRHE L SRASTGR+ RT Sbjct: 58 PSSPVAASPPKLQPVSVPSILRRHEF-LHSNLSLNASCSSDASSDSVYSRASTGRLIRTR 116 Query: 705 STSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAHD 526 ST + RK S+ + VVP+ + PD ++ K+RCAWVT NTDP Y FHDEEWGVP HD Sbjct: 117 STPSRRKYSISRPEKVVPDSASDSSPDSIETKKRCAWVTPNTDPCYAAFHDEEWGVPVHD 176 Query: 525 DNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXXX 346 D KLFELLVL GALAE+TWP ILS+RHIFREVF+DFDP++V+K+NEKKI A Sbjct: 177 DKKLFELLVLSGALAELTWPTILSKRHIFREVFSDFDPVAVSKLNEKKITAPGSTASSLL 236 Query: 345 SELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLISK 166 SELKLRAII+NARQ+ KVIDEFGSFD YIW FV HKPI+ +FRYPRQVPV+ KAD+ISK Sbjct: 237 SELKLRAIIENARQICKVIDEFGSFDNYIWSFVNHKPIISKFRYPRQVPVKIPKADVISK 296 Query: 165 DLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQEEDD 19 DLVRRGFRSVGPTVVYSFMQVAGITNDHLI+CFRF C E DD Sbjct: 297 DLVRRGFRSVGPTVVYSFMQVAGITNDHLINCFRFQVCMDTPTVSEGDD 345 >ref|XP_012449856.1| PREDICTED: uncharacterized protein LOC105772910 [Gossypium raimondii] gi|763798832|gb|KJB65787.1| hypothetical protein B456_010G113100 [Gossypium raimondii] gi|763798836|gb|KJB65791.1| hypothetical protein B456_010G113100 [Gossypium raimondii] Length = 374 Score = 390 bits (1001), Expect = e-105 Identities = 212/352 (60%), Positives = 252/352 (71%), Gaps = 5/352 (1%) Frame = -2 Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883 M G PR RS N+ADS+ R AGNK L +A+KP K RK+E+ + + +KN Sbjct: 1 MFGPPRLRSMNMADSEARPVLGPAGNKTGSL-SARKPGSKPSRKIEKCSAEATLAEEKNG 59 Query: 882 PISVTKLSNPLHSVSV-PSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRTS 706 + +K+++ HSVSV PSVLRRHE +L SRASTGR+ ++ Sbjct: 60 -LQSSKVNS--HSVSVVPSVLRRHEQLLHSNLSLNASCSSDASTDSFHSRASTGRLIWSN 116 Query: 705 STSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAHD 526 S RK S + VV +G + LP D ++RCAWVT NTDPSYV FHDEEWGVP HD Sbjct: 117 SVGTRRKPFPSTPRSVVSDGGLDSLPGDSHRRKRCAWVTPNTDPSYVAFHDEEWGVPVHD 176 Query: 525 DNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXXX 346 D KLFELLVL GAL+E+TWPAILS+RHIFREVFADFDP++V+K+NEKK+IA Sbjct: 177 DKKLFELLVLAGALSELTWPAILSKRHIFREVFADFDPLAVSKLNEKKLIAPGSTASSLL 236 Query: 345 SELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLISK 166 SELKLRAI++NA Q+SKVIDEFGSFDKYIW FV HKPIV RFRYPRQVPV+T KAD+ISK Sbjct: 237 SELKLRAIVENAHQISKVIDEFGSFDKYIWSFVNHKPIVSRFRYPRQVPVKTPKADVISK 296 Query: 165 DLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQEEDDIKE 10 DLVRRGFRSVGPTV+YSFMQV+GITNDHL SCFRF DC A +EE+ IKE Sbjct: 297 DLVRRGFRSVGPTVIYSFMQVSGITNDHLTSCFRFQDCITAAEGKEENGIKE 348 >ref|XP_011083973.1| PREDICTED: uncharacterized protein LOC105166349 isoform X1 [Sesamum indicum] gi|747073987|ref|XP_011083974.1| PREDICTED: uncharacterized protein LOC105166349 isoform X1 [Sesamum indicum] Length = 384 Score = 390 bits (1001), Expect = e-105 Identities = 221/371 (59%), Positives = 257/371 (69%), Gaps = 22/371 (5%) Frame = -2 Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883 MSG + RSTN+ADS+ R GNKA RL +++K V+K L+K ++D+ L Sbjct: 1 MSGTAKIRSTNMADSEVRPILGPGGNKAQRLIDSRKHVVKPLKKEAVPVEDKNGSL---- 56 Query: 882 PISVTKLSNPL-HSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRTS 706 P S S+PL H VSVPS L RHE +L SRASTGR+ RT Sbjct: 57 PASTRAESSPLLHYVSVPSTLHRHESLLCSNLSLSASCSSDASTDSFHSRASTGRICRTI 116 Query: 705 STSNWRKQLGSKAKI-VVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAH 529 S S+ RK+L KA+ V NGV E L + VQ KRRCAWVTANTDP YV FHDEEWGVP H Sbjct: 117 SKSS-RKELALKARNGAVSNGVTESLTEGVQAKRRCAWVTANTDPIYVAFHDEEWGVPTH 175 Query: 528 DDNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXX 349 DD KLFE LVL GALAE+TWPAILS+RHIFREVF DFDP +VAK++EKKIIA Sbjct: 176 DDRKLFEFLVLSGALAELTWPAILSKRHIFREVFVDFDPTAVAKLSEKKIIAPGSPASSL 235 Query: 348 XSELKLRAIIDNARQVSK------------VIDEFGSFDKYIWGFVKHKPIVGRFRYPRQ 205 SELKLR+II+NARQVS+ VIDEFGSFDKYIW FV +KPIVG FRYPRQ Sbjct: 236 LSELKLRSIIENARQVSRVRLTFSRFCKHQVIDEFGSFDKYIWSFVNYKPIVGSFRYPRQ 295 Query: 204 VPVRTSKADLISKDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQ-- 31 VPV+T KAD+ISKDLVRRGFRSVGPT++YSFMQ AGITNDHL+SCFRFH+C AA+ K+ Sbjct: 296 VPVKTPKADVISKDLVRRGFRSVGPTIIYSFMQGAGITNDHLMSCFRFHECGAAKAKEGS 355 Query: 30 --EEDDIKEKA 4 D +EKA Sbjct: 356 PLTNKDEEEKA 366 >ref|XP_002529378.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis] gi|223531126|gb|EEF32974.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis] Length = 380 Score = 390 bits (1001), Expect = e-105 Identities = 214/356 (60%), Positives = 247/356 (69%), Gaps = 10/356 (2%) Frame = -2 Query: 1050 MSGAPRERSTNLADSQPRHA----GNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883 MSGAPR RS N+ADS+ R GN +A+KP K LRKVE S E V+L + K Sbjct: 1 MSGAPRVRSMNVADSETRPVLGPTGNNKAGSLSAKKPASKQLRKVETS--PEAVKLGQEK 58 Query: 882 PI----SVTKLSNPLHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMY 715 + + + LS HSVSVPSVLRRHE +L SRASTGR+ Sbjct: 59 KLVTVPTASALSPKSHSVSVPSVLRRHEQLLHSNLSLNASCSSDASTDSFHSRASTGRLT 118 Query: 714 RTSSTSNWRKQLGSKAKIVVPNGVPEPLP--DDVQVKRRCAWVTANTDPSYVTFHDEEWG 541 R++S RKQ K + VV +G E P D Q K+ CAWVT N DP Y FHDEEWG Sbjct: 119 RSNSLGTRRKQYALKPRSVVSDGGLESPPPSDGSQAKKSCAWVTPNADPCYTAFHDEEWG 178 Query: 540 VPAHDDNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXX 361 +P HDD KLFELLVL GALAE+TWPAILS+RHIFREVFA+FDP+ V+K NEKKIIA Sbjct: 179 IPVHDDKKLFELLVLSGALAELTWPAILSKRHIFREVFANFDPVVVSKFNEKKIIAPGST 238 Query: 360 XXXXXSELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKA 181 SE+KLRAII+NARQ+SKV DE GSFDKYIW FV +KPIV RFRYPRQVPV+T KA Sbjct: 239 ASSLLSEIKLRAIIENARQISKVTDELGSFDKYIWSFVNYKPIVSRFRYPRQVPVKTPKA 298 Query: 180 DLISKDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQEEDDIK 13 D+ISKDLVRRGFRSVGPTVVYSFMQVAG+TNDHLISCFRF +C A +EE+ +K Sbjct: 299 DVISKDLVRRGFRSVGPTVVYSFMQVAGLTNDHLISCFRFQECINAAEGKEENGVK 354 >gb|KHG05578.1| guaA [Gossypium arboreum] Length = 381 Score = 389 bits (1000), Expect = e-105 Identities = 214/355 (60%), Positives = 251/355 (70%), Gaps = 6/355 (1%) Frame = -2 Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKD-EGVELDKN 886 M GAPR RS N DS+ R AGNKA L +A+KP K LRKVE+S + E K+ Sbjct: 1 MLGAPRLRSMNAPDSEARPVLGPAGNKAGSL-SARKPASKPLRKVEKSPAEVTATEEKKS 59 Query: 885 KPIS-VTKLSNPLHSVSVPSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRT 709 P S V+ LS HSVSVPSVLRRHE +L SRASTGR+ R+ Sbjct: 60 LPSSIVSSLSPKKHSVSVPSVLRRHEKLLHSNLSLNASCSSDASTDSFHSRASTGRLIRS 119 Query: 708 SSTSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAH 529 +S + RK SK + V + + D K+RCAWVT NTDPSY TFHDEEWGVP H Sbjct: 120 NSVGSRRKPYASKPRSFVSDSGSDSPSDGSHQKKRCAWVTPNTDPSYATFHDEEWGVPVH 179 Query: 528 DDNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXX 349 DD KLFELLVL GAL+E+TWPAILS+R +FREVF DFDP +V+K+NEKK+IA Sbjct: 180 DDKKLFELLVLSGALSELTWPAILSKRQMFREVFMDFDPAAVSKLNEKKLIAPGSVSSSL 239 Query: 348 XSELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLIS 169 SELKLRAII+NARQ+SKVIDEFGSFD+YIW FV HKPI+ +FRYPRQVPV+T KAD+IS Sbjct: 240 LSELKLRAIIENARQISKVIDEFGSFDEYIWSFVNHKPIISKFRYPRQVPVKTPKADVIS 299 Query: 168 KDLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQEEDDIKEKA 4 KDLVRRGFRSVGPTV+YSFMQVAGITNDHL CFRF +C A + +E +IKE+A Sbjct: 300 KDLVRRGFRSVGPTVIYSFMQVAGITNDHLTGCFRFQECTTA-AEGKEVEIKERA 353 >gb|KHG15995.1| putative GMP synthase [glutamine-hydrolyzing] [Gossypium arboreum] Length = 374 Score = 388 bits (997), Expect = e-105 Identities = 212/352 (60%), Positives = 252/352 (71%), Gaps = 5/352 (1%) Frame = -2 Query: 1050 MSGAPRERSTNLADSQPRH----AGNKAFRLKNAQKPVLKTLRKVEESIKDEGVELDKNK 883 M G PR RS N ADS+ R AGNK L +A+KP K LRK+E+ + + +KN Sbjct: 1 MFGPPRLRSMNTADSEARPVLGPAGNKTGSL-SARKPGSKPLRKIEKCSAEATLAEEKNG 59 Query: 882 PISVTKLSNPLHSVSV-PSVLRRHELVLXXXXXXXXXXXXXXXXXXXXSRASTGRMYRTS 706 + +K+++ HSVSV PSVLRRHE +L SRASTGR+ ++ Sbjct: 60 -LPSSKVNS--HSVSVVPSVLRRHEQLLHSNLSLNASCSSDASTDSFHSRASTGRLIWSN 116 Query: 705 STSNWRKQLGSKAKIVVPNGVPEPLPDDVQVKRRCAWVTANTDPSYVTFHDEEWGVPAHD 526 S RK S + VV +G + P D ++RCAWVT NTDPSYV FHDEEWGVP HD Sbjct: 117 SVGTRRKPFPSTPRSVVSDGGLDSPPGDSHRRKRCAWVTLNTDPSYVAFHDEEWGVPVHD 176 Query: 525 DNKLFELLVLCGALAEITWPAILSRRHIFREVFADFDPISVAKMNEKKIIAXXXXXXXXX 346 D KLFELLVL GAL+E+TWPAILS+RHIFREVFADFDP++V+K+NEKK+IA Sbjct: 177 DKKLFELLVLAGALSELTWPAILSKRHIFREVFADFDPLAVSKLNEKKLIAPGSTASSLL 236 Query: 345 SELKLRAIIDNARQVSKVIDEFGSFDKYIWGFVKHKPIVGRFRYPRQVPVRTSKADLISK 166 SELKLRAI++NA Q+SKVI+EFGSFDKYIWGFV HKPIV RFRYPRQVPV+T KAD+ISK Sbjct: 237 SELKLRAIVENAHQISKVINEFGSFDKYIWGFVNHKPIVSRFRYPRQVPVKTPKADVISK 296 Query: 165 DLVRRGFRSVGPTVVYSFMQVAGITNDHLISCFRFHDCDAAEVKQEEDDIKE 10 DLVRRGFRSVGPTV+YSFMQVAGITNDHL SCFRF DC A +EE+ IK+ Sbjct: 297 DLVRRGFRSVGPTVIYSFMQVAGITNDHLTSCFRFQDCITAAEGKEENGIKD 348