BLASTX nr result
ID: Forsythia21_contig00002608
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia21_contig00002608 (1696 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011073325.1| PREDICTED: uncharacterized protein LOC105158... 505 e-140 ref|XP_011082395.1| PREDICTED: uncharacterized protein LOC105165... 497 e-137 ref|XP_012852805.1| PREDICTED: uncharacterized protein LOC105972... 479 e-132 ref|XP_012856196.1| PREDICTED: uncharacterized protein LOC105975... 457 e-125 emb|CDO98228.1| unnamed protein product [Coffea canephora] 450 e-123 ref|XP_009617988.1| PREDICTED: uncharacterized protein LOC104110... 428 e-117 ref|XP_009802477.1| PREDICTED: uncharacterized protein LOC104248... 424 e-115 ref|XP_004235942.1| PREDICTED: uncharacterized protein LOC101247... 423 e-115 ref|XP_006341344.1| PREDICTED: uncharacterized protein LOC102594... 421 e-115 ref|XP_010265584.1| PREDICTED: uncharacterized protein LOC104603... 407 e-110 ref|XP_010275821.1| PREDICTED: uncharacterized protein LOC104610... 393 e-106 ref|XP_007011936.1| DNA glycosylase superfamily protein isoform ... 387 e-104 ref|XP_009358441.1| PREDICTED: uncharacterized protein LOC103949... 381 e-102 ref|XP_008242987.1| PREDICTED: uncharacterized protein LOC103341... 380 e-102 ref|XP_003527169.1| PREDICTED: uncharacterized protein LOC100801... 380 e-102 ref|XP_012442673.1| PREDICTED: uncharacterized protein LOC105767... 378 e-102 ref|XP_002529378.1| DNA-3-methyladenine glycosylase, putative [R... 378 e-102 gb|KHG02440.1| putative GMP synthase [glutamine-hydrolyzing] [Go... 377 e-101 ref|XP_007204814.1| hypothetical protein PRUPE_ppa026720mg [Prun... 377 e-101 ref|XP_002276173.1| PREDICTED: uncharacterized protein LOC100267... 377 e-101 >ref|XP_011073325.1| PREDICTED: uncharacterized protein LOC105158309 [Sesamum indicum] Length = 397 Score = 505 bits (1301), Expect = e-140 Identities = 262/397 (65%), Positives = 304/397 (76%), Gaps = 17/397 (4%) Frame = -1 Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKKSP 1196 MSGPPRVKSM+FT+ E RPVLGPAGNK+RS ELRKP+ KP SEK Q+ D DE GKKSP Sbjct: 1 MSGPPRVKSMNFTEPEARPVLGPAGNKSRSAELRKPVLKPKSEKTQRPPDIDESKGKKSP 60 Query: 1195 VTV------TDHLALKVDSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXT-GR 1037 + ++ + V + AASIL Q++ NLSLN + GR Sbjct: 61 AALESPELASEKIPSPVGFRRSGSSAASILRQRQANLSLNASCSSDASSDSSQSRASTGR 120 Query: 1036 ISRRRVTLTPTMRRKQQCSPKERSAQ------KSFDGESEDINL----AKKRCAWVTSNT 887 ISRR T TP ++RK QCS K + K+ GESE + + KKRCAWVTSNT Sbjct: 121 ISRRSATPTPPLKRKPQCSSKGGKIENKEGYGKNVGGESESLVVDGAAVKKRCAWVTSNT 180 Query: 886 DPYYAALHDEEWGVPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVS 707 DP YAA HDEEWGVPVHDDKKLFELLSFSTALAE+TWPIIL+KRHIFREVFL FDP+AVS Sbjct: 181 DPSYAAFHDEEWGVPVHDDKKLFELLSFSTALAEITWPIILSKRHIFREVFLGFDPVAVS 240 Query: 706 KLNEKKVATPGSPASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNF 527 KLNEKK+ATPG+PA SLLSELKLRAI+ENARQICKII++LGSF+KYIW FVN KPI+GNF Sbjct: 241 KLNEKKIATPGNPACSLLSELKLRAIVENARQICKIINELGSFDKYIWGFVNYKPIVGNF 300 Query: 526 RYPRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAG 347 RYPRQVPI+TSKADTISKDLVRRGFRGVGPTVVYSFMQV+GITNDHL+SCFR+ +C+ AG Sbjct: 301 RYPRQVPIRTSKADTISKDLVRRGFRGVGPTVVYSFMQVAGITNDHLISCFRHHDCVIAG 360 Query: 346 DLRDGDEGLKTMNEGKTAEEISELELGRAIDDLGFTT 236 DLRD +E + + +EGK E+I ELEL R IDDL ++ Sbjct: 361 DLRDKNEDVTSKHEGKPPEDIMELELVRDIDDLSLSS 397 >ref|XP_011082395.1| PREDICTED: uncharacterized protein LOC105165174 [Sesamum indicum] Length = 395 Score = 497 bits (1280), Expect = e-137 Identities = 265/396 (66%), Positives = 304/396 (76%), Gaps = 17/396 (4%) Frame = -1 Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKKSP 1196 MSGPP V+SM+F + E RPVLGP GNKARSVELRKPI KP SEK ++S + D+ GKK P Sbjct: 1 MSGPPMVQSMNFAEPEDRPVLGPTGNKARSVELRKPILKPKSEKTRQSPEADK--GKKPP 58 Query: 1195 VTV------TDHLALKVDSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXT-GR 1037 T+ T+ + V + AASIL Q++PNLSLN + GR Sbjct: 59 ATLHSPEITTEKIPSPVGFRRNASSAASILRQRQPNLSLNASCSSDASTDSSHSRASTGR 118 Query: 1036 ISRRRVTLTPTMRRKQQCSPKERSAQK------SFDGESEDI----NLAKKRCAWVTSNT 887 I RR T TP +++KQQ SPK +K S GESE I +L KKRCAWVTSNT Sbjct: 119 IGRRTGTSTPPLKKKQQFSPKGERIEKMAGNGKSVGGESEGIECDGSLVKKRCAWVTSNT 178 Query: 886 DPYYAALHDEEWGVPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVS 707 DP YAA HDEEWG+PVHDDKKLFELLSFSTALAELTWP+IL+KR IFR+VFLDFDPIAVS Sbjct: 179 DPSYAAFHDEEWGLPVHDDKKLFELLSFSTALAELTWPVILSKRPIFRDVFLDFDPIAVS 238 Query: 706 KLNEKKVATPGSPASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNF 527 KLN+KK+AT GSPASSLLSELKLRAIIENARQICKIID++GSF+KYIW FVN KPI+GNF Sbjct: 239 KLNDKKIATQGSPASSLLSELKLRAIIENARQICKIIDEVGSFDKYIWGFVNYKPIVGNF 298 Query: 526 RYPRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAG 347 RYPRQVPIKTSKADTISKDLVRRG RGVGPTVVYSFMQV+GITNDHL++CFRYQ+CIAAG Sbjct: 299 RYPRQVPIKTSKADTISKDLVRRGLRGVGPTVVYSFMQVAGITNDHLINCFRYQDCIAAG 358 Query: 346 DLRDGDEGLKTMNEGKTAEEISELELGRAIDDLGFT 239 DLRD +EG+ + NE E++ ELEL R IDDL + Sbjct: 359 DLRDKNEGITSNNEENPPEDLRELELVRDIDDLNLS 394 >ref|XP_012852805.1| PREDICTED: uncharacterized protein LOC105972398 [Erythranthe guttatus] gi|604305450|gb|EYU24594.1| hypothetical protein MIMGU_mgv1a007518mg [Erythranthe guttata] Length = 404 Score = 479 bits (1232), Expect = e-132 Identities = 258/403 (64%), Positives = 293/403 (72%), Gaps = 24/403 (5%) Frame = -1 Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKK-- 1202 MSGPP VKSM+F + E RPVLGPAGNKARSVELRKPI K SEK QK D DE G Sbjct: 1 MSGPPLVKSMNFAEPEARPVLGPAGNKARSVELRKPILKQKSEKTQKPLDADEAKGNTAP 60 Query: 1201 ------SPVTVTDHLALKVDSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXT- 1043 SP T+ + V K AASIL Q++PNLS+N + Sbjct: 61 SPAAFLSPEMKTEKIPSPVGFKKNASSAASILRQRQPNLSMNASCSSDASTDSSHSRAST 120 Query: 1042 GRISRRRVTLTPTMRRKQQCSPKERSAQ------KSFDGESEDI----NLAKKRCAWVTS 893 GR+ RR T TP +RRK QCSPK + K+ ES+ + +L KKRCAWVTS Sbjct: 121 GRLLRRSATFTPPLRRKHQCSPKGERIEMIEGNGKNVGSESDGVVLDGSLVKKRCAWVTS 180 Query: 892 NTDPYYAALHDEEWGVPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIA 713 NTDP YAA HDEEWG+PVHDDKKLFELLS STALAEL+WP+IL+KR IFR+VFLDFDP A Sbjct: 181 NTDPLYAAFHDEEWGLPVHDDKKLFELLSLSTALAELSWPVILSKRSIFRDVFLDFDPAA 240 Query: 712 VSKLNEKKVATPGSPASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIG 533 VSKLN+KK+ATPGSPASSLLSE KLRAI+ENARQICKIID+LGSF+KYIW FVN KPI G Sbjct: 241 VSKLNDKKIATPGSPASSLLSEQKLRAIVENARQICKIIDELGSFDKYIWGFVNYKPIAG 300 Query: 532 NFRYPRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIA 353 NFRY RQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQV+GITNDHL++CFRYQ+CI Sbjct: 301 NFRYSRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVAGITNDHLINCFRYQDCII 360 Query: 352 AGD--LRDGDE---GLKTMNEGKTAEEISELELGRAIDDLGFT 239 AGD LRD + + + NE AE+ SEL+L IDDL + Sbjct: 361 AGDLILRDNNNNNWSIASKNEVNLAEDFSELDLATEIDDLNLS 403 >ref|XP_012856196.1| PREDICTED: uncharacterized protein LOC105975546 [Erythranthe guttatus] gi|604302147|gb|EYU21733.1| hypothetical protein MIMGU_mgv1a024334mg [Erythranthe guttata] Length = 390 Score = 457 bits (1176), Expect = e-125 Identities = 245/390 (62%), Positives = 292/390 (74%), Gaps = 11/390 (2%) Frame = -1 Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKKSP 1196 MSGPPRVK M + E RPVLGP GNKARSVELRKP+ K SEK Q++QD D+ GKKSP Sbjct: 1 MSGPPRVKLMTSAELEARPVLGPTGNKARSVELRKPMLKSKSEKAQRAQDVDDSKGKKSP 60 Query: 1195 VTVT--DHLALKVDSK---WINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXT-GRI 1034 + + K+ S NGR+A+ Q+ ++SLN + GRI Sbjct: 61 TALQLPETKPEKIPSPVGFMKNGRSAASFFMQR-SMSLNVSCSSDASSDSSHSRASTGRI 119 Query: 1033 SRRRVTLTPTMRRKQQCSPKERSAQKSFDGESEDIN---LAKKRCAWVTSNTDPYYAALH 863 S R T TP ++R QQ S K +K GE E ++ + KKRCAWVT+NTDP YAA H Sbjct: 120 SWRSGTPTPPLKRNQQSSFKRERIEKIVGGEGEVVDGAAVVKKRCAWVTANTDPLYAAFH 179 Query: 862 DEEWGVPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVSKLNEKKVA 683 DEEWG+ VHDDKKLFELLSFSTALAELTWP+IL+KRH+FREVFLDFDP AVSKLN+KK+A Sbjct: 180 DEEWGLAVHDDKKLFELLSFSTALAELTWPVILSKRHLFREVFLDFDPNAVSKLNDKKIA 239 Query: 682 TPGSPASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNFRYPRQVPI 503 TPGSPASSLLS+L LRAI ENAR+ICKIID+ GSF+KYIW FVN KPI+GNFRYPR VPI Sbjct: 240 TPGSPASSLLSDLNLRAITENARRICKIIDEFGSFDKYIWGFVNHKPIVGNFRYPRLVPI 299 Query: 502 KTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRD-GDE 326 KTSKADTISKDLV+RGFRGVGPTVVYSFMQV+GITNDHL+SCFR+++CI A DL D +E Sbjct: 300 KTSKADTISKDLVKRGFRGVGPTVVYSFMQVAGITNDHLISCFRHRDCITACDLSDKSNE 359 Query: 325 GLKT-MNEGKTAEEISELELGRAIDDLGFT 239 G+ T NE K+ + I+E+EL R I+D+ + Sbjct: 360 GITTSKNEVKSLDNITEMELVRDINDVSLS 389 >emb|CDO98228.1| unnamed protein product [Coffea canephora] Length = 399 Score = 450 bits (1158), Expect = e-123 Identities = 240/402 (59%), Positives = 286/402 (71%), Gaps = 22/402 (5%) Frame = -1 Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARS-VELRKPIGKPNSEKVQKSQDFDEFNGKKS 1199 MSGPPRV+SM+ +SEVRPVLGPAGNK RS +ELRKP+ KP V K Q+ ++ KKS Sbjct: 1 MSGPPRVRSMNHAESEVRPVLGPAGNKTRSALELRKPVSKPKISSVNKMQEGED---KKS 57 Query: 1198 PVTVTDHLALKVD-SKWINGRAASILGQQ-----------KPNLSLNXXXXXXXXXXXXX 1055 P TVT L K G +A+I+ QQ + NLS+N Sbjct: 58 PATVTMEKDLSPSPKKKFGGASAAIMSQQQQRQEVKSFLMRSNLSMNASCSSDASTDSSQ 117 Query: 1054 XXXT-GRISRRRVTLTPTMRRKQQCSPKERSAQK--------SFDGESEDINLAKKRCAW 902 + G+ISRR +T TP R++Q C PK +K + G ++D ++A+KRCAW Sbjct: 118 SRASTGKISRRSLTPTPIRRKQQHCGPKVEKLEKVGSEVDSVAVVGLADD-SVARKRCAW 176 Query: 901 VTSNTDPYYAALHDEEWGVPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFD 722 VT NTDP YAA HDEEWGVP H+DKKLFE LS STALAEL WP ILNKRH FREVF DFD Sbjct: 177 VTPNTDPSYAAFHDEEWGVPAHEDKKLFEFLSLSTALAELPWPTILNKRHTFREVFQDFD 236 Query: 721 PIAVSKLNEKKVATPGSPASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKP 542 P+AVSKLNEKK+ATPGSPASSLLSELKLRAI+ENARQ CKII++ GSFEKYIW FVN KP Sbjct: 237 PVAVSKLNEKKIATPGSPASSLLSELKLRAIVENARQACKIIEEFGSFEKYIWGFVNYKP 296 Query: 541 IIGNFRYPRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQE 362 I+G+FRYPRQVPIKTSKAD ISKDLVRRGFRG+GPTVVYSFMQV+GITNDHL+SCFR+++ Sbjct: 297 IVGHFRYPRQVPIKTSKADAISKDLVRRGFRGIGPTVVYSFMQVAGITNDHLISCFRFRD 356 Query: 361 CIAAGDLRDGDEGLKTMNEGKTAEEISELELGRAIDDLGFTT 236 C+ GD R+ D+ L EGK AE+ +E +D L +T Sbjct: 357 CVDVGDGRNKDDDLIATIEGKQAEDSAESGFEERLDALSLST 398 >ref|XP_009617988.1| PREDICTED: uncharacterized protein LOC104110244 [Nicotiana tomentosiformis] Length = 398 Score = 428 bits (1101), Expect = e-117 Identities = 236/410 (57%), Positives = 277/410 (67%), Gaps = 30/410 (7%) Frame = -1 Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKP----------NSEKVQKSQD 1226 MSG PRVKSM+ SEVRPVLGPAGNKARSVELRKP KP K +K Q Sbjct: 1 MSGGPRVKSMNHADSEVRPVLGPAGNKARSVELRKPTEKPIKTNNKPAETEESKGKKFQG 60 Query: 1225 FDEFNGKKSPVTVTDHLALKVDSKWINGRAASILGQQ--------KPNLSLNXXXXXXXX 1070 D KSPV + G SIL QQ +PNLSLN Sbjct: 61 ADPLPQSKSPVAASKKC----------GSVPSILRQQQDHRTLLMRPNLSLNASCSSDAS 110 Query: 1069 XXXXXXXXT-GRISRRRVTLTPTMRRKQQCSPKERSAQKSFDGESE-----------DIN 926 + G++SR +LTP R++QCSPK ++KS E D + Sbjct: 111 TDSSHSRASTGKLSRG--SLTPKSGRRKQCSPKVDKSEKSGKSVGEVESLSPSPVSGDAS 168 Query: 925 LAKKRCAWVTSNTDPYYAALHDEEWGVPVHDDKKLFELLSFSTALAELTWPIILNKRHIF 746 + KKRCAWVT TDP YAA HDEEWGVPVHDDKKLFELLS TALAEL+WP IL+KRH F Sbjct: 169 VIKKRCAWVTPTTDPSYAAFHDEEWGVPVHDDKKLFELLSLCTALAELSWPAILSKRHTF 228 Query: 745 REVFLDFDPIAVSKLNEKKVATPGSPASSLLSELKLRAIIENARQICKIIDDLGSFEKYI 566 REVF +FDP+AVSKLNEKK+A PGSPAS+LLSE+KLRAI+ENARQ CKIID+LGSF+KYI Sbjct: 229 REVFQNFDPVAVSKLNEKKIAPPGSPASTLLSEVKLRAIVENARQTCKIIDELGSFDKYI 288 Query: 565 WSFVNSKPIIGNFRYPRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHL 386 W FVN+KPI+ FRY RQVP+KTSKA+ ISKDLV+RGFRGVGPTVVYSFMQV+GITNDHL Sbjct: 289 WGFVNNKPIVSQFRYARQVPMKTSKAEGISKDLVKRGFRGVGPTVVYSFMQVAGITNDHL 348 Query: 385 VSCFRYQECIAAGDLRDGDEGLKTMNEGKTAEEISELELGRAIDDLGFTT 236 +SCFR+ +C+AA D + D+GL E K ++ +E+ L RAIDD +T Sbjct: 349 ISCFRFHDCVAAIDGMENDDGLAAKTEVKQLKDETEMGLIRAIDDFNLST 398 >ref|XP_009802477.1| PREDICTED: uncharacterized protein LOC104248004 [Nicotiana sylvestris] Length = 399 Score = 424 bits (1090), Expect = e-115 Identities = 239/402 (59%), Positives = 280/402 (69%), Gaps = 22/402 (5%) Frame = -1 Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKKSP 1196 MSG PRVKSM+ SEVRPVLGPAGNKARSVELRKPI KP K + +E GKK P Sbjct: 1 MSGGPRVKSMNHADSEVRPVLGPAGNKARSVELRKPIEKPVKTN-NKPAETEESKGKKFP 59 Query: 1195 -VTVTDHLALKVDSKWINGRAASILGQQ--------KPNLSLNXXXXXXXXXXXXXXXXT 1043 V + G SIL QQ +PNLSLN + Sbjct: 60 GADPLPQSKSPVAASKKCGSVPSILRQQQDHRTLLMRPNLSLNASCSSDASTDSSHSRAS 119 Query: 1042 -GRISRRRVTLTPTMRRKQQCSPKERSAQKSFD--GESE---------DINLAKKRCAWV 899 G++SR +LTP R++QCSPK ++KS GESE D ++ KKRCAWV Sbjct: 120 TGKLSRG--SLTPKSGRRKQCSPKVDKSEKSGKSVGESESLSPSPVSGDASVIKKRCAWV 177 Query: 898 TSNTDPYYAALHDEEWGVPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDP 719 T TDP YAA HDEEWGVPVHDDKKLFELLS TALAEL+WP IL+KRH FREVF +FDP Sbjct: 178 TPTTDPSYAAFHDEEWGVPVHDDKKLFELLSLCTALAELSWPAILSKRHTFREVFQNFDP 237 Query: 718 IAVSKLNEKKVATPGSPASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPI 539 +AVSKLNEKK+A PGSPAS+LLSE+KLRAIIENARQ CKIID+LGSF+KY+W FVN+KPI Sbjct: 238 VAVSKLNEKKIAPPGSPASTLLSEVKLRAIIENARQTCKIIDELGSFDKYMWGFVNNKPI 297 Query: 538 IGNFRYPRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQEC 359 + FRY RQVP+KTSKA+ ISKDLV+RGFRGVGPTVVYSFMQV+GITNDHL+SCFR+ +C Sbjct: 298 VSQFRYARQVPMKTSKAEGISKDLVKRGFRGVGPTVVYSFMQVAGITNDHLISCFRFHDC 357 Query: 358 IAAGDLRDGDEGLKTMNEGK-TAEEISELELGRAIDDLGFTT 236 +AA D D D+GL E K ++ +E+ L RAI D +T Sbjct: 358 VAAIDGMDKDDGLVAKTEVKQQLKDETEMGLIRAIADFNLST 399 >ref|XP_004235942.1| PREDICTED: uncharacterized protein LOC101247118 [Solanum lycopersicum] Length = 395 Score = 423 bits (1087), Expect = e-115 Identities = 234/402 (58%), Positives = 281/402 (69%), Gaps = 22/402 (5%) Frame = -1 Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKKSP 1196 MSG PRVK M+ SEVR VLGPAGNKARSVELRKP+ KP V+K+ + +E GKK Sbjct: 1 MSGGPRVKLMNNADSEVRSVLGPAGNKARSVELRKPVEKP----VKKAAESEESKGKKFE 56 Query: 1195 VTVTDHLALKVDSKWINGRAASILGQQ--------KPNLSLNXXXXXXXXXXXXXXXXTG 1040 T + + ++ G SIL QQ +PNLSLN + Sbjct: 57 GTDS---VPQSRARKCGGAVPSILRQQQDHRSLMMRPNLSLNASCSSDASTDSSHSRAST 113 Query: 1039 RISRRRVTLTPTMRRKQQCS-PK----ERSAQKSFDGES-------EDINLAKKRCAWVT 896 R ++TPT R++QCS PK E+ + +GES +D ++ KKRCAWVT Sbjct: 114 TGKMSRGSVTPTAGRRKQCSSPKVVKSEKIGKTVGEGESLASSPTPDDASVMKKRCAWVT 173 Query: 895 SNTDPYYAALHDEEWGVPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPI 716 NTDP YAA HDEEWGV VHDDKKLFELLS TALAEL+WP IL+KRH+FREVF +FDP+ Sbjct: 174 PNTDPSYAAFHDEEWGVSVHDDKKLFELLSLCTALAELSWPAILSKRHMFREVFQNFDPV 233 Query: 715 AVSKLNEKKVATPGSPASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPII 536 AVSKLNEKK+A PGSPAS+LLSE+KLRA+IENARQ CKIID+LGSF+KYIW FVN+KPI+ Sbjct: 234 AVSKLNEKKIAPPGSPASTLLSEVKLRAVIENARQTCKIIDELGSFDKYIWGFVNNKPIV 293 Query: 535 GNFRYPRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECI 356 FRY RQVP+KTSKA+ ISKDLV+RGFRGVGPTVVYSFMQV+GITNDHL+SCFR+ +C+ Sbjct: 294 SQFRYARQVPMKTSKAEGISKDLVKRGFRGVGPTVVYSFMQVAGITNDHLISCFRFHDCV 353 Query: 355 AAGDLRDGDEGLKTMNEGKTAEEISELELG--RAIDDLGFTT 236 AA D D D+GL E K + E E+G RAIDD +T Sbjct: 354 AATDGTDKDDGLAAKTEVKQLQLKDETEMGLIRAIDDFNLST 395 >ref|XP_006341344.1| PREDICTED: uncharacterized protein LOC102594169 [Solanum tuberosum] Length = 399 Score = 421 bits (1083), Expect = e-115 Identities = 234/403 (58%), Positives = 281/403 (69%), Gaps = 23/403 (5%) Frame = -1 Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKKSP 1196 MSG PRVK M+ SEVR VLGPAGNKARSVELRKP+ KP ++K+ + +E GKK Sbjct: 1 MSGGPRVKLMNNADSEVRSVLGPAGNKARSVELRKPVEKP----IKKAAESEESKGKKFE 56 Query: 1195 VT--VTDHLALKVDSKWINGRAASILGQQ--------KPNLSLNXXXXXXXXXXXXXXXX 1046 T V A SK G SIL QQ +PNLSLN Sbjct: 57 GTDSVPQSRAPVAASKKCGGAVPSILRQQQDHRSLMMRPNLSLNASCSSDASTDSSHSRA 116 Query: 1045 TGRISRRRVTLTPTMRRKQQCS-PK----ERSAQKSFDGES-------EDINLAKKRCAW 902 + R ++TPT R++QCS PK E+ + +G+S D ++ KKRCAW Sbjct: 117 STTGKLSRGSVTPTAGRRKQCSSPKVVKSEKIGKTVGEGQSLASSPTPGDASVMKKRCAW 176 Query: 901 VTSNTDPYYAALHDEEWGVPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFD 722 VT NTDP YAA HDEEWGV +HDDKKLFELLS TALAEL+WP IL+KRH+FREVF +FD Sbjct: 177 VTPNTDPSYAAFHDEEWGVSIHDDKKLFELLSLCTALAELSWPAILSKRHMFREVFQNFD 236 Query: 721 PIAVSKLNEKKVATPGSPASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKP 542 P+AVSKLNEKK+A PGSPAS+LLSE+KLRA+IENARQ CKIID+LGSF+KYIW FVN+KP Sbjct: 237 PVAVSKLNEKKIAPPGSPASTLLSEVKLRAVIENARQTCKIIDELGSFDKYIWGFVNNKP 296 Query: 541 IIGNFRYPRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQE 362 I+ FRY RQVP+KTSKA+ ISKDLV+RGFRGVGPTVVYSFMQV+GITNDHL+SCFR+ + Sbjct: 297 IVSQFRYARQVPMKTSKAEGISKDLVKRGFRGVGPTVVYSFMQVAGITNDHLISCFRFHD 356 Query: 361 CIAAGDLRDGDEGLKTMNEGK-TAEEISELELGRAIDDLGFTT 236 C+AA D D D+GL E K ++ +E+ L RAIDD +T Sbjct: 357 CVAATDGTDKDDGLAAKTEVKQQLKDETEMGLIRAIDDFNLST 399 >ref|XP_010265584.1| PREDICTED: uncharacterized protein LOC104603287 [Nelumbo nucifera] Length = 380 Score = 407 bits (1045), Expect = e-110 Identities = 228/381 (59%), Positives = 262/381 (68%), Gaps = 5/381 (1%) Frame = -1 Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNG-KKS 1199 MSG PRV+SM+ S+ RPVLGP GNK S+ RKP+ KP KV+KS + NG KK+ Sbjct: 1 MSGAPRVRSMNVADSDARPVLGPTGNKTGSLVTRKPVSKP-LRKVEKSPEVA--NGEKKT 57 Query: 1198 PVTVTDHLALKVDSKWINGRAASILGQQK---PNLSLNXXXXXXXXXXXXXXXXT-GRIS 1031 P + K+ S + SIL + + NLSLN + GRI Sbjct: 58 PSSPVAPSPPKLQSASV----PSILRRHEFLHSNLSLNASCSSDASSDSVYSRASTGRII 113 Query: 1030 RRRVTLTPTMRRKQQCSPKERSAQKSFDGESEDINLAKKRCAWVTSNTDPYYAALHDEEW 851 R T + T RRK+ S E+ A S S + K+RCAWVT NTDP YAA HDEEW Sbjct: 114 R---TSSTTSRRKRSISRPEKVAPDSVSDSSSESIQTKRRCAWVTPNTDPCYAAFHDEEW 170 Query: 850 GVPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVSKLNEKKVATPGS 671 GVPVHDDKKLFE L S ALAEL WP+IL+KRHIFREVF DFDP+AVSKLNEKK+ TPG Sbjct: 171 GVPVHDDKKLFEFLVLSGALAELPWPVILSKRHIFREVFADFDPVAVSKLNEKKITTPGG 230 Query: 670 PASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNFRYPRQVPIKTSK 491 A SLLSELKLRAIIENARQICK+ID+ GSF YIWSFVN KPII FRYPRQVP+KT K Sbjct: 231 TAISLLSELKLRAIIENARQICKVIDEFGSFNNYIWSFVNHKPIISKFRYPRQVPVKTPK 290 Query: 490 ADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDEGLKTM 311 AD ISKDLVRRGFR VGPTV+YSFMQV+GITNDHL++CFRYQECI A + DEG K Sbjct: 291 ADVISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLINCFRYQECIDATAAIE-DEGSKAK 349 Query: 310 NEGKTAEEISELELGRAIDDL 248 E K E+I LELG+AID+L Sbjct: 350 AEEKKTEDIINLELGKAIDEL 370 >ref|XP_010275821.1| PREDICTED: uncharacterized protein LOC104610746 [Nelumbo nucifera] Length = 387 Score = 393 bits (1010), Expect = e-106 Identities = 222/391 (56%), Positives = 256/391 (65%), Gaps = 15/391 (3%) Frame = -1 Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDF--DEFNGKK 1202 MSG PRV+S++ SE RPVLGPAGNK RS+ RKP KP KV+K+ + +E Sbjct: 1 MSGAPRVRSINVADSEARPVLGPAGNKTRSLVTRKPASKP-LRKVEKTPEAVDEEKKAPS 59 Query: 1201 SPVTVTDHLALKVDSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXT-GRISRR 1025 SPV + V I R + NLSLN + GR+ R Sbjct: 60 SPVAASPPKLQPVSVPSILRRHEFL----HSNLSLNASCSSDASSDSVYSRASTGRLIRT 115 Query: 1024 RVTLTPTMRRKQQCSPKERSAQKSFDGESEDINLAKKRCAWVTSNTDPYYAALHDEEWGV 845 R T + RRK S E+ S S D KKRCAWVT NTDP YAA HDEEWGV Sbjct: 116 RSTPS---RRKYSISRPEKVVPDSASDSSPDSIETKKRCAWVTPNTDPCYAAFHDEEWGV 172 Query: 844 PVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVSKLNEKKVATPGSPA 665 PVHDDKKLFELL S ALAELTWP IL+KRHIFREVF DFDP+AVSKLNEKK+ PGS A Sbjct: 173 PVHDDKKLFELLVLSGALAELTWPTILSKRHIFREVFSDFDPVAVSKLNEKKITAPGSTA 232 Query: 664 SSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNFRYPRQVPIKTSKAD 485 SSLLSELKLRAIIENARQICK+ID+ GSF+ YIWSFVN KPII FRYPRQVP+K KAD Sbjct: 233 SSLLSELKLRAIIENARQICKVIDEFGSFDNYIWSFVNHKPIISKFRYPRQVPVKIPKAD 292 Query: 484 TISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDE------- 326 ISKDLVRRGFR VGPTVVYSFMQV+GITNDHL++CFR+Q C+ + +GD+ Sbjct: 293 VISKDLVRRGFRSVGPTVVYSFMQVAGITNDHLINCFRFQVCMDTPTVSEGDDKLRIGKA 352 Query: 325 -----GLKTMNEGKTAEEISELELGRAIDDL 248 G K E K E++ + ELG+A+D L Sbjct: 353 EETPTGSKGTAEEKKTEDMIKSELGKAMDKL 383 >ref|XP_007011936.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|590572766|ref|XP_007011937.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|590572769|ref|XP_007011938.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|590572773|ref|XP_007011939.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508782299|gb|EOY29555.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508782300|gb|EOY29556.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508782301|gb|EOY29557.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508782302|gb|EOY29558.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] Length = 379 Score = 387 bits (993), Expect = e-104 Identities = 215/384 (55%), Positives = 263/384 (68%), Gaps = 4/384 (1%) Frame = -1 Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQ-DFDEFNGKKS 1199 MSG PR++SM+ SE RPVLGPAGNKA S+ RKP KP KV+KS + KK+ Sbjct: 1 MSGAPRMRSMNVADSEARPVLGPAGNKAGSLSARKPASKP-LRKVEKSPVEVTVAEEKKA 59 Query: 1198 -PVTVTDHLALKVDSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXTGRISRRR 1022 P + + L+ K S + S+L + + L N R S R Sbjct: 60 LPSSTVNSLSPKTHSVSV----PSVLRRHEQLLHSNLSLNASCSSDASTDSFHSRASTGR 115 Query: 1021 VTLTPTM-RRKQQCSPKERSAQKSFDGESE-DINLAKKRCAWVTSNTDPYYAALHDEEWG 848 + + ++ R++ + K RS +S D + KKRCAWVT NTDP Y A HDEEWG Sbjct: 116 LIRSNSVGNRRKPYASKPRSVVSDGGLDSPPDGSHQKKRCAWVTPNTDPSYVAFHDEEWG 175 Query: 847 VPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVSKLNEKKVATPGSP 668 VPVHDD+KLFELL S AL+ELTWP IL+KRHI REVF+DFD +AVSKLNEKK+ TPGS Sbjct: 176 VPVHDDRKLFELLVLSGALSELTWPAILSKRHIVREVFVDFDAVAVSKLNEKKLVTPGSI 235 Query: 667 ASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNFRYPRQVPIKTSKA 488 ASSLLSELKLRAIIENARQI K+ID+ GSF++YIWSFVN KPI+ FRYPRQVP+KT KA Sbjct: 236 ASSLLSELKLRAIIENARQISKVIDEFGSFDEYIWSFVNHKPIVSRFRYPRQVPVKTPKA 295 Query: 487 DTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDEGLKTMN 308 D ISKDLVRRGFR VGPTV+YSFMQV+GITNDHL SCFR+QECI A + ++ + G+K M Sbjct: 296 DVISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLTSCFRFQECITAAEGKE-ENGIKDMP 354 Query: 307 EGKTAEEISELELGRAIDDLGFTT 236 E K E + E +L AID+L F++ Sbjct: 355 EEKKTENVMESKLSIAIDELSFSS 378 >ref|XP_009358441.1| PREDICTED: uncharacterized protein LOC103949071 [Pyrus x bretschneideri] Length = 378 Score = 381 bits (978), Expect = e-102 Identities = 210/382 (54%), Positives = 251/382 (65%), Gaps = 2/382 (0%) Frame = -1 Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKKSP 1196 MSG PRV+S++ SE RPVLGPAGNKA + RKP KP ++K++ F E Sbjct: 1 MSGAPRVRSINVADSESRPVLGPAGNKAGTFSARKPASKP----LRKAEKFSEEVSSAEE 56 Query: 1195 VTVTDHLALKVDSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXTGRISRRRVT 1016 L + + + S+L + + L N R S R+ Sbjct: 57 KKTHQSPMLTTSPQPHSPKVHSVLRRHEQLLHSNFSLNASCSSDASTDSFQSRASTGRLI 116 Query: 1015 LTPTM--RRKQQCSPKERSAQKSFDGESEDINLAKKRCAWVTSNTDPYYAALHDEEWGVP 842 + ++ RRKQ S D + +KKRCAWVT N DP YAA HDEEWG+P Sbjct: 117 RSNSVGSRRKQYVSKPRSVVSDGGLDSPPDGSQSKKRCAWVTPNADPCYAAFHDEEWGLP 176 Query: 841 VHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVSKLNEKKVATPGSPAS 662 VHDDKKLFELL S ALAEL+WP IL+K+HIFREVF DFDPIAVSKLNEKK+ +PGS AS Sbjct: 177 VHDDKKLFELLVLSGALAELSWPAILSKKHIFREVFADFDPIAVSKLNEKKLISPGSAAS 236 Query: 661 SLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNFRYPRQVPIKTSKADT 482 SLLSELKLRAIIENARQ K+I++ GSF+KYIWSFVN+KPI FRYPRQVP+KT KAD Sbjct: 237 SLLSELKLRAIIENARQTTKVIEEFGSFDKYIWSFVNNKPIESRFRYPRQVPVKTPKADV 296 Query: 481 ISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDEGLKTMNEG 302 ISKDLVRRGFR VGPTV+YSFMQV+GITNDHLVSCFR+QEC+ A + DG+ G+K G Sbjct: 297 ISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLVSCFRFQECVNAAE-GDGENGIKD-EAG 354 Query: 301 KTAEEISELELGRAIDDLGFTT 236 K E E EL AID L F++ Sbjct: 355 KKTENGIESELSVAIDKLSFSS 376 >ref|XP_008242987.1| PREDICTED: uncharacterized protein LOC103341267 isoform X1 [Prunus mume] Length = 378 Score = 380 bits (977), Expect = e-102 Identities = 208/382 (54%), Positives = 253/382 (66%), Gaps = 2/382 (0%) Frame = -1 Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKKSP 1196 MSG PRV+S++ SE RPVLGPAGNKA + RKP+ KP ++K++ E Sbjct: 1 MSGAPRVRSINVADSESRPVLGPAGNKAGTFSARKPVSKP----LRKAEKLAEKVASAEE 56 Query: 1195 VTVTDHLALKVDSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXTGRISRRRVT 1016 L + + S+L + + L N R S R+T Sbjct: 57 KKTRQSSMLTTSPQLHSPSVPSVLRRHEQLLHSNFSLNASCSSDASTDSFHSRASTGRLT 116 Query: 1015 LTPTM--RRKQQCSPKERSAQKSFDGESEDINLAKKRCAWVTSNTDPYYAALHDEEWGVP 842 + + RRKQ S D + +KKRCAWVT NTDP YAA HDEEWG+P Sbjct: 117 RSNSAGSRRKQYVSKPRSVVSDGGLDSPPDGSQSKKRCAWVTPNTDPCYAAFHDEEWGLP 176 Query: 841 VHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVSKLNEKKVATPGSPAS 662 VHDDKKLFELL S ALAEL+WP IL+K+HIFREVF DFDP+AVSKLNEKK+ PGS AS Sbjct: 177 VHDDKKLFELLVLSGALAELSWPAILSKKHIFREVFADFDPVAVSKLNEKKLIAPGSTAS 236 Query: 661 SLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNFRYPRQVPIKTSKADT 482 SLLSELKLRAIIENARQ+ K+I++ GSF+KYIWSFVN+KPI+ FRYPRQVP KT KAD Sbjct: 237 SLLSELKLRAIIENARQMTKVIEEFGSFDKYIWSFVNNKPIVSRFRYPRQVPAKTPKADV 296 Query: 481 ISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDEGLKTMNEG 302 ISKDLVRRGFR VGPTV+YSFMQV+GITNDHLVSCFR+QEC+ A + ++ D G+K E Sbjct: 297 ISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLVSCFRFQECLNAAEGKE-DYGIKDEAEK 355 Query: 301 KTAEEISELELGRAIDDLGFTT 236 KT I E +L A+D+L F++ Sbjct: 356 KTENGI-ESDLSVAMDELSFSS 376 >ref|XP_003527169.1| PREDICTED: uncharacterized protein LOC100801026 isoform X1 [Glycine max] gi|571461733|ref|XP_006582090.1| PREDICTED: uncharacterized protein LOC100801026 isoform X2 [Glycine max] gi|571461735|ref|XP_006582091.1| PREDICTED: uncharacterized protein LOC100801026 isoform X3 [Glycine max] gi|734430051|gb|KHN45352.1| Putative GMP synthase [glutamine-hydrolyzing] [Glycine soja] Length = 383 Score = 380 bits (976), Expect = e-102 Identities = 204/382 (53%), Positives = 252/382 (65%), Gaps = 2/382 (0%) Frame = -1 Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKKSP 1196 MSG PR++SM+ SE RPVLGPAGNK S+ RK KP +KV K D +K P Sbjct: 1 MSGAPRLRSMNVADSEARPVLGPAGNKTGSLSSRKTASKPLRKKVDKLLDEIASVKEKKP 60 Query: 1195 VTVTDHLALKVDSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXTGRISRRRVT 1016 V + + + +L + + L N R S R+T Sbjct: 61 HQVLLSSVATSSPQSHSASVSLLLPRHEQLLHSNLSLNASCSSDASTDSFHSRASTGRLT 120 Query: 1015 LTPTM--RRKQQCSPKERSAQKSFDGESEDINLAKKRCAWVTSNTDPYYAALHDEEWGVP 842 + ++ RRK S A D + + KRCAWVT NT+P YA HDEEWGVP Sbjct: 121 RSYSLGSRRKPYVSKPRSVASDGVLESPTDGSQSNKRCAWVTPNTEPCYATFHDEEWGVP 180 Query: 841 VHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVSKLNEKKVATPGSPAS 662 VHDDKKLFELL S+ LAE TWP IL+KRHIFREVF+DF+P+AVSKLNEKK+ TPG+ AS Sbjct: 181 VHDDKKLFELLVLSSVLAEHTWPAILSKRHIFREVFVDFEPVAVSKLNEKKIMTPGTIAS 240 Query: 661 SLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNFRYPRQVPIKTSKADT 482 SLLSE+KLRAIIENARQI K+ID+ GSF+KYIWSFVN KPI+ FRYPRQVP+KT KAD Sbjct: 241 SLLSEVKLRAIIENARQISKVIDEFGSFDKYIWSFVNHKPIVSRFRYPRQVPVKTPKADV 300 Query: 481 ISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDEGLKTMNEG 302 ISKDLVRRGFRGVGPTVVYSFMQV+G+T DHL+SCFR++ECIAA + ++ + + + Sbjct: 301 ISKDLVRRGFRGVGPTVVYSFMQVAGLTIDHLISCFRFEECIAAAEGKEENGIMDNHADQ 360 Query: 301 KTAEEISELELGRAIDDLGFTT 236 K +E I E +L A++DL F + Sbjct: 361 KESENIMESDLSIAMEDLSFAS 382 >ref|XP_012442673.1| PREDICTED: uncharacterized protein LOC105767651 isoform X1 [Gossypium raimondii] gi|763787989|gb|KJB54985.1| hypothetical protein B456_009G057100 [Gossypium raimondii] gi|763787990|gb|KJB54986.1| hypothetical protein B456_009G057100 [Gossypium raimondii] Length = 379 Score = 378 bits (971), Expect = e-102 Identities = 209/384 (54%), Positives = 255/384 (66%), Gaps = 4/384 (1%) Frame = -1 Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKKS- 1199 MSG PR++SM+ T SE RPVLGPAGNKA S+ RKP KP+ + + S + KK+ Sbjct: 1 MSGAPRLRSMNVTDSEARPVLGPAGNKAGSLSARKPASKPSKKVEKSSVEVTVVEEKKAL 60 Query: 1198 PVTVTDHLALKVDSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXTGRISRRRV 1019 P + + L+ K S + S+L + + L + R S R+ Sbjct: 61 PSSTVNSLSPKTHSLSV----PSVLRRHERLLHSSLSLNASCSSDASTDSFQSRASTGRL 116 Query: 1018 TLTPTM--RRKQQCS-PKERSAQKSFDGESEDINLAKKRCAWVTSNTDPYYAALHDEEWG 848 + ++ RRK S PK + S D S + KKRC WVT NTDP YAA HDEEWG Sbjct: 117 SRCGSLGSRRKPYASKPKSLVSDDSLDLSSNSSH-HKKRCTWVTPNTDPSYAAFHDEEWG 175 Query: 847 VPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVSKLNEKKVATPGSP 668 VPVHDDKKLFELL S +L+ELTW IL+KRHIFREVF+DFDP+AVSKLNEKK+ GS Sbjct: 176 VPVHDDKKLFELLVLSGSLSELTWSAILSKRHIFREVFMDFDPVAVSKLNEKKLIAHGSV 235 Query: 667 ASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNFRYPRQVPIKTSKA 488 ASSLLSEL LRAI+ENARQI K+ID+ SF++YIWSFVN KPI+ FRYPRQVP+KT KA Sbjct: 236 ASSLLSELMLRAIVENARQISKVIDEFRSFDQYIWSFVNHKPIVSRFRYPRQVPVKTPKA 295 Query: 487 DTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDEGLKTMN 308 D ISKDLVRRGFR VGPTV+YSFMQV+GITNDHL SCFR+QECI A + ++ + K Sbjct: 296 DVISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLTSCFRFQECITAAEAKE-ENVTKDTT 354 Query: 307 EGKTAEEISELELGRAIDDLGFTT 236 E K + EL AID+L F+T Sbjct: 355 EKKETVNVINTELSVAIDELSFST 378 >ref|XP_002529378.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis] gi|223531126|gb|EEF32974.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis] Length = 380 Score = 378 bits (971), Expect = e-102 Identities = 210/385 (54%), Positives = 259/385 (67%), Gaps = 5/385 (1%) Frame = -1 Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGN-KARSVELRKPIGKPNSEKVQKSQDFDEFNGKKS 1199 MSG PRV+SM+ SE RPVLGP GN KA S+ +KP K KV+ S + + +K Sbjct: 1 MSGAPRVRSMNVADSETRPVLGPTGNNKAGSLSAKKPASK-QLRKVETSPEAVKLGQEKK 59 Query: 1198 PVTVTDHLALKVDSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXTGRISRRRV 1019 VTV AL S ++ S+L + + L N R S R+ Sbjct: 60 LVTVPTASALSPKSHSVS--VPSVLRRHEQLLHSNLSLNASCSSDASTDSFHSRASTGRL 117 Query: 1018 TLTPTM-RRKQQCSPKERSAQKSFDGES---EDINLAKKRCAWVTSNTDPYYAALHDEEW 851 T + ++ R++Q + K RS ES D + AKK CAWVT N DP Y A HDEEW Sbjct: 118 TRSNSLGTRRKQYALKPRSVVSDGGLESPPPSDGSQAKKSCAWVTPNADPCYTAFHDEEW 177 Query: 850 GVPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVSKLNEKKVATPGS 671 G+PVHDDKKLFELL S ALAELTWP IL+KRHIFREVF +FDP+ VSK NEKK+ PGS Sbjct: 178 GIPVHDDKKLFELLVLSGALAELTWPAILSKRHIFREVFANFDPVVVSKFNEKKIIAPGS 237 Query: 670 PASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNFRYPRQVPIKTSK 491 ASSLLSE+KLRAIIENARQI K+ D+LGSF+KYIWSFVN KPI+ FRYPRQVP+KT K Sbjct: 238 TASSLLSEIKLRAIIENARQISKVTDELGSFDKYIWSFVNYKPIVSRFRYPRQVPVKTPK 297 Query: 490 ADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDEGLKTM 311 AD ISKDLVRRGFR VGPTVVYSFMQV+G+TNDHL+SCFR+QECI A + ++ + G+K Sbjct: 298 ADVISKDLVRRGFRSVGPTVVYSFMQVAGLTNDHLISCFRFQECINAAEGKE-ENGVKV- 355 Query: 310 NEGKTAEEISELELGRAIDDLGFTT 236 E K + + E ++ A+D+L F++ Sbjct: 356 -EDKITDGVVESQISIAMDELSFSS 379 >gb|KHG02440.1| putative GMP synthase [glutamine-hydrolyzing] [Gossypium arboreum] Length = 379 Score = 377 bits (969), Expect = e-101 Identities = 211/385 (54%), Positives = 258/385 (67%), Gaps = 5/385 (1%) Frame = -1 Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKKS- 1199 MSG PR++SM+ T SE RPVLGPAGNKA S+ RKP K +S+KV+KS G+K Sbjct: 1 MSGAPRLRSMNVTDSEARPVLGPAGNKAGSLSARKPASK-SSKKVEKSSVEVTVVGEKKA 59 Query: 1198 -PVTVTDHLALKVDSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXTGRISRRR 1022 P + + L+ K S + S+L + + L + R S R Sbjct: 60 LPSSTVNSLSPKTHSLSV----PSVLRRHERLLHSSLSLNASCSSDASTDSFQSRASTGR 115 Query: 1021 VTLTPTM--RRKQQCS-PKERSAQKSFDGESEDINLAKKRCAWVTSNTDPYYAALHDEEW 851 + ++ RRK S PK + S D S + KKRCAWVT +TDP YAA HDEEW Sbjct: 116 LNRCDSLGSRRKPYASKPKSVVSDDSLDLSSNSSH-PKKRCAWVTPSTDPSYAAFHDEEW 174 Query: 850 GVPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVSKLNEKKVATPGS 671 GVPVHDD+KLFELL S +L+ELTW IL+KRHIFREVF+DFDP+AVSKLNEKK+ GS Sbjct: 175 GVPVHDDRKLFELLVLSGSLSELTWSAILSKRHIFREVFIDFDPVAVSKLNEKKLIAHGS 234 Query: 670 PASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNFRYPRQVPIKTSK 491 ASSLLSELKLR I+ENARQI K+ID+ GSF++YIWSFVN KPI+ FRYPRQVP+KT K Sbjct: 235 VASSLLSELKLRVIVENARQISKVIDEFGSFDQYIWSFVNHKPIVSRFRYPRQVPVKTPK 294 Query: 490 ADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDEGLKTM 311 AD ISKDLVRRGFR VGPTV+YSFMQV+GITNDHL SCFR+QECI A + ++ + K Sbjct: 295 ADVISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLTSCFRFQECITAAEAKE-ENVTKDP 353 Query: 310 NEGKTAEEISELELGRAIDDLGFTT 236 E K + EL AID+L F++ Sbjct: 354 TEKKETVNVINTELSVAIDELSFSS 378 >ref|XP_007204814.1| hypothetical protein PRUPE_ppa026720mg [Prunus persica] gi|462400345|gb|EMJ06013.1| hypothetical protein PRUPE_ppa026720mg [Prunus persica] Length = 378 Score = 377 bits (968), Expect = e-101 Identities = 205/382 (53%), Positives = 253/382 (66%), Gaps = 2/382 (0%) Frame = -1 Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDFDEFNGKKSP 1196 MSG PRV+S++ SE RPVLGPAGNKA + RKP+ KP ++K++ E Sbjct: 1 MSGAPRVRSINVADSESRPVLGPAGNKAGTFSARKPVSKP----LRKAEKLAEKVASAEE 56 Query: 1195 VTVTDHLALKVDSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXTGRISRRRVT 1016 L + + S+L + + L N R S R+T Sbjct: 57 KKTRQSSMLTTSPQLHSPSVPSVLRRHEQLLHSNFSLNASCSSDASTDSFHSRASTGRLT 116 Query: 1015 LTPTM--RRKQQCSPKERSAQKSFDGESEDINLAKKRCAWVTSNTDPYYAALHDEEWGVP 842 + + RRKQ S D + +KKRCAWVT NTDP YAA HDEEWG+P Sbjct: 117 RSNSAGSRRKQYVSKPRSVVSDGGLDSPPDGSQSKKRCAWVTPNTDPCYAAFHDEEWGLP 176 Query: 841 VHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVSKLNEKKVATPGSPAS 662 VHDDKKLFELL S ALAEL+WP IL+K+HIFREVF DFDP+A+SKLNEKK+ PGS AS Sbjct: 177 VHDDKKLFELLVLSGALAELSWPAILSKKHIFREVFADFDPVAISKLNEKKLIAPGSNAS 236 Query: 661 SLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNFRYPRQVPIKTSKADT 482 SLLSELKLRAIIENARQ+ K+I++ GSF+KYIWSFVN+KPI+ FRYPRQVP KT KAD Sbjct: 237 SLLSELKLRAIIENARQMTKVIEEFGSFDKYIWSFVNNKPIVSRFRYPRQVPAKTPKADV 296 Query: 481 ISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDEGLKTMNEG 302 ISKDL+RRGFR VGPTV+YSFMQV+GITNDHLVSCFR+QEC+ A + ++ + G+K E Sbjct: 297 ISKDLMRRGFRSVGPTVIYSFMQVAGITNDHLVSCFRFQECLNAAEGKE-EYGIKDEAEK 355 Query: 301 KTAEEISELELGRAIDDLGFTT 236 KT I E +L A+D+L F++ Sbjct: 356 KTENGI-ESDLSVAMDELSFSS 376 >ref|XP_002276173.1| PREDICTED: uncharacterized protein LOC100267363 isoform X2 [Vitis vinifera] gi|297743642|emb|CBI36525.3| unnamed protein product [Vitis vinifera] Length = 375 Score = 377 bits (967), Expect = e-101 Identities = 208/392 (53%), Positives = 255/392 (65%), Gaps = 12/392 (3%) Frame = -1 Query: 1375 MSGPPRVKSMDFTQSEVRPVLGPAGNKA-RSVELRKPIGKPNSEKVQKSQDFDEFNGKKS 1199 MSG PRV+SM+ SEVRPVLGPAGNK RS+ RKP KP + + ++D +E S Sbjct: 1 MSGGPRVRSMNVADSEVRPVLGPAGNKTMRSLSGRKPATKPLRKAEKATKDDEEIKALPS 60 Query: 1198 PVTVTDHLALKVDSKWINGRAAS----------ILGQQKPNLSLNXXXXXXXXXXXXXXX 1049 NG A+S +L +Q+ L N Sbjct: 61 S----------------NGAASSPPSHSVSVPLVLRRQEQLLHSNLSLNASCSSDASTDS 104 Query: 1048 XTGRISRRRVTLTPTMRRKQQCSPKERSAQKSFDGESEDINL-AKKRCAWVTSNTDPYYA 872 R S R+T + + R++ + K + ES L AK+RCAWVT NTD Y Sbjct: 105 FHSRASTGRITRSSSTARRRSYASKPKVIVSDGVSESPPDGLKAKRRCAWVTPNTDLSYI 164 Query: 871 ALHDEEWGVPVHDDKKLFELLSFSTALAELTWPIILNKRHIFREVFLDFDPIAVSKLNEK 692 A HDEEWGVPVHDDKKLFELL S ALAELTWP IL+KRHIFREVF DFDPIAV+KLNEK Sbjct: 165 AFHDEEWGVPVHDDKKLFELLVLSGALAELTWPTILSKRHIFREVFADFDPIAVAKLNEK 224 Query: 691 KVATPGSPASSLLSELKLRAIIENARQICKIIDDLGSFEKYIWSFVNSKPIIGNFRYPRQ 512 K+ PGS ASSL+SELKLR IIENARQ+ K+ID+ GSF++YIWSFVN KPI+ FRYPR Sbjct: 225 KLMAPGSIASSLISELKLRGIIENARQMSKVIDEFGSFDEYIWSFVNHKPIVSRFRYPRH 284 Query: 511 VPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDG 332 VP+KT KAD ISKDLVRRGFR VGPTV+YSFMQV+GITNDHL+SCFR+Q+C+ A +++ Sbjct: 285 VPVKTPKADVISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLISCFRFQDCVTAAEVK-- 342 Query: 331 DEGLKTMNEGKTAEEISELELGRAIDDLGFTT 236 +E + T + + E EL RAID+L F++ Sbjct: 343 EEEITTGAAEEKKSNVIESELSRAIDELSFSS 374