BLASTX nr result
ID: Forsythia22_contig00006812
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia22_contig00006812 (1642 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011073325.1| PREDICTED: uncharacterized protein LOC105158... 509 e-141 ref|XP_011082395.1| PREDICTED: uncharacterized protein LOC105165... 499 e-138 ref|XP_012852805.1| PREDICTED: uncharacterized protein LOC105972... 484 e-134 ref|XP_012856196.1| PREDICTED: uncharacterized protein LOC105975... 460 e-126 emb|CDO98228.1| unnamed protein product [Coffea canephora] 454 e-125 ref|XP_009617988.1| PREDICTED: uncharacterized protein LOC104110... 425 e-116 ref|XP_009802477.1| PREDICTED: uncharacterized protein LOC104248... 421 e-115 ref|XP_004235942.1| PREDICTED: uncharacterized protein LOC101247... 419 e-114 ref|XP_006341344.1| PREDICTED: uncharacterized protein LOC102594... 417 e-113 ref|XP_010265584.1| PREDICTED: uncharacterized protein LOC104603... 405 e-110 ref|XP_010275821.1| PREDICTED: uncharacterized protein LOC104610... 393 e-106 ref|XP_007011936.1| DNA glycosylase superfamily protein isoform ... 387 e-104 ref|XP_012442673.1| PREDICTED: uncharacterized protein LOC105767... 382 e-103 ref|XP_003527169.1| PREDICTED: uncharacterized protein LOC100801... 382 e-103 ref|XP_002529378.1| DNA-3-methyladenine glycosylase, putative [R... 379 e-102 gb|KHG02440.1| putative GMP synthase [glutamine-hydrolyzing] [Go... 379 e-102 ref|XP_002276173.1| PREDICTED: uncharacterized protein LOC100267... 378 e-102 ref|XP_012462430.1| PREDICTED: uncharacterized protein LOC105782... 378 e-102 ref|XP_002324538.1| methyladenine glycosylase family protein [Po... 378 e-102 ref|XP_009358441.1| PREDICTED: uncharacterized protein LOC103949... 377 e-101 >ref|XP_011073325.1| PREDICTED: uncharacterized protein LOC105158309 [Sesamum indicum] Length = 397 Score = 509 bits (1311), Expect = e-141 Identities = 261/397 (65%), Positives = 306/397 (77%), Gaps = 17/397 (4%) Frame = -1 Query: 1312 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDVDEFNGKKSP 1133 MSGPPRVKSM+FT+ E RPVLGPAGNK+RS ELRKP+ KP SEK Q+ D+DE GKKSP Sbjct: 1 MSGPPRVKSMNFTEPEARPVLGPAGNKSRSAELRKPVLKPKSEKTQRPPDIDESKGKKSP 60 Query: 1132 VTV------TDHLALKVNSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXT-GR 974 + ++ + V + AASIL Q++ NLSLN + GR Sbjct: 61 AALESPELASEKIPSPVGFRRSGSSAASILRQRQANLSLNASCSSDASSDSSQSRASTGR 120 Query: 973 ISRRRVTLTPTMRRKQQCSPKERNAQ------KSFDGESEDIDL----AKKRCAWVTSNT 824 ISRR T TP ++RK QCS K + K+ GESE + + KKRCAWVTSNT Sbjct: 121 ISRRSATPTPPLKRKPQCSSKGGKIENKEGYGKNVGGESESLVVDGAAVKKRCAWVTSNT 180 Query: 823 DPSYAALHDEEWGVPVHDDKKLFELLSFSTALAELTWPVILNKRHIFREVFLDFDPIAVS 644 DPSYAA HDEEWGVPVHDDKKLFELLSFSTALAE+TWP+IL+KRHIFREVFL FDP+AVS Sbjct: 181 DPSYAAFHDEEWGVPVHDDKKLFELLSFSTALAEITWPIILSKRHIFREVFLGFDPVAVS 240 Query: 643 KLNEKKVATPGSPASSLLSELNLRAIIENARQICKIIDDLGSFEKYIWSFVNYKPIIGNF 464 KLNEKK+ATPG+PA SLLSEL LRAI+ENARQICKII++LGSF+KYIW FVNYKPI+GNF Sbjct: 241 KLNEKKIATPGNPACSLLSELKLRAIVENARQICKIINELGSFDKYIWGFVNYKPIVGNF 300 Query: 463 RYPRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAG 284 RYPRQVPI+TSKADTISKDLVRRGFRGVGPTVVYSFMQV+GITNDHL+SCFR+ +C+ AG Sbjct: 301 RYPRQVPIRTSKADTISKDLVRRGFRGVGPTVVYSFMQVAGITNDHLISCFRHHDCVIAG 360 Query: 283 DLRDGDESLKTMNDGKMAEEISELELGRAIDDLGFTT 173 DLRD +E + + ++GK E+I ELEL R IDDL ++ Sbjct: 361 DLRDKNEDVTSKHEGKPPEDIMELELVRDIDDLSLSS 397 >ref|XP_011082395.1| PREDICTED: uncharacterized protein LOC105165174 [Sesamum indicum] Length = 395 Score = 499 bits (1286), Expect = e-138 Identities = 265/396 (66%), Positives = 304/396 (76%), Gaps = 17/396 (4%) Frame = -1 Query: 1312 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDVDEFNGKKSP 1133 MSGPP V+SM+F + E RPVLGP GNKARSVELRKPI KP SEK ++S + D+ GKK P Sbjct: 1 MSGPPMVQSMNFAEPEDRPVLGPTGNKARSVELRKPILKPKSEKTRQSPEADK--GKKPP 58 Query: 1132 VTV------TDHLALKVNSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXT-GR 974 T+ T+ + V + AASIL Q++PNLSLN + GR Sbjct: 59 ATLHSPEITTEKIPSPVGFRRNASSAASILRQRQPNLSLNASCSSDASTDSSHSRASTGR 118 Query: 973 ISRRRVTLTPTMRRKQQCSPKERNAQK------SFDGESEDID----LAKKRCAWVTSNT 824 I RR T TP +++KQQ SPK +K S GESE I+ L KKRCAWVTSNT Sbjct: 119 IGRRTGTSTPPLKKKQQFSPKGERIEKMAGNGKSVGGESEGIECDGSLVKKRCAWVTSNT 178 Query: 823 DPSYAALHDEEWGVPVHDDKKLFELLSFSTALAELTWPVILNKRHIFREVFLDFDPIAVS 644 DPSYAA HDEEWG+PVHDDKKLFELLSFSTALAELTWPVIL+KR IFR+VFLDFDPIAVS Sbjct: 179 DPSYAAFHDEEWGLPVHDDKKLFELLSFSTALAELTWPVILSKRPIFRDVFLDFDPIAVS 238 Query: 643 KLNEKKVATPGSPASSLLSELNLRAIIENARQICKIIDDLGSFEKYIWSFVNYKPIIGNF 464 KLN+KK+AT GSPASSLLSEL LRAIIENARQICKIID++GSF+KYIW FVNYKPI+GNF Sbjct: 239 KLNDKKIATQGSPASSLLSELKLRAIIENARQICKIIDEVGSFDKYIWGFVNYKPIVGNF 298 Query: 463 RYPRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAG 284 RYPRQVPIKTSKADTISKDLVRRG RGVGPTVVYSFMQV+GITNDHL++CFRYQ+CIAAG Sbjct: 299 RYPRQVPIKTSKADTISKDLVRRGLRGVGPTVVYSFMQVAGITNDHLINCFRYQDCIAAG 358 Query: 283 DLRDGDESLKTMNDGKMAEEISELELGRAIDDLGFT 176 DLRD +E + + N+ E++ ELEL R IDDL + Sbjct: 359 DLRDKNEGITSNNEENPPEDLRELELVRDIDDLNLS 394 >ref|XP_012852805.1| PREDICTED: uncharacterized protein LOC105972398 [Erythranthe guttatus] gi|604305450|gb|EYU24594.1| hypothetical protein MIMGU_mgv1a007518mg [Erythranthe guttata] Length = 404 Score = 484 bits (1247), Expect = e-134 Identities = 260/403 (64%), Positives = 292/403 (72%), Gaps = 24/403 (5%) Frame = -1 Query: 1312 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDVDEFNGKK-- 1139 MSGPP VKSM+F + E RPVLGPAGNKARSVELRKPI K SEK QK D DE G Sbjct: 1 MSGPPLVKSMNFAEPEARPVLGPAGNKARSVELRKPILKQKSEKTQKPLDADEAKGNTAP 60 Query: 1138 ------SPVTVTDHLALKVNSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXT- 980 SP T+ + V K AASIL Q++PNLS+N + Sbjct: 61 SPAAFLSPEMKTEKIPSPVGFKKNASSAASILRQRQPNLSMNASCSSDASTDSSHSRAST 120 Query: 979 GRISRRRVTLTPTMRRKQQCSPKE----------RNAQKSFDGESEDIDLAKKRCAWVTS 830 GR+ RR T TP +RRK QCSPK +N DG D L KKRCAWVTS Sbjct: 121 GRLLRRSATFTPPLRRKHQCSPKGERIEMIEGNGKNVGSESDGVVLDGSLVKKRCAWVTS 180 Query: 829 NTDPSYAALHDEEWGVPVHDDKKLFELLSFSTALAELTWPVILNKRHIFREVFLDFDPIA 650 NTDP YAA HDEEWG+PVHDDKKLFELLS STALAEL+WPVIL+KR IFR+VFLDFDP A Sbjct: 181 NTDPLYAAFHDEEWGLPVHDDKKLFELLSLSTALAELSWPVILSKRSIFRDVFLDFDPAA 240 Query: 649 VSKLNEKKVATPGSPASSLLSELNLRAIIENARQICKIIDDLGSFEKYIWSFVNYKPIIG 470 VSKLN+KK+ATPGSPASSLLSE LRAI+ENARQICKIID+LGSF+KYIW FVNYKPI G Sbjct: 241 VSKLNDKKIATPGSPASSLLSEQKLRAIVENARQICKIIDELGSFDKYIWGFVNYKPIAG 300 Query: 469 NFRYPRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIA 290 NFRY RQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQV+GITNDHL++CFRYQ+CI Sbjct: 301 NFRYSRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVAGITNDHLINCFRYQDCII 360 Query: 289 AGD--LRDGDE---SLKTMNDGKMAEEISELELGRAIDDLGFT 176 AGD LRD + S+ + N+ +AE+ SEL+L IDDL + Sbjct: 361 AGDLILRDNNNNNWSIASKNEVNLAEDFSELDLATEIDDLNLS 403 >ref|XP_012856196.1| PREDICTED: uncharacterized protein LOC105975546 [Erythranthe guttatus] gi|604302147|gb|EYU21733.1| hypothetical protein MIMGU_mgv1a024334mg [Erythranthe guttata] Length = 390 Score = 460 bits (1184), Expect = e-126 Identities = 247/390 (63%), Positives = 293/390 (75%), Gaps = 11/390 (2%) Frame = -1 Query: 1312 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDVDEFNGKKSP 1133 MSGPPRVK M + E RPVLGP GNKARSVELRKP+ K SEK Q++QDVD+ GKKSP Sbjct: 1 MSGPPRVKLMTSAELEARPVLGPTGNKARSVELRKPMLKSKSEKAQRAQDVDDSKGKKSP 60 Query: 1132 VTVT--DHLALKVNSK---WINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXT-GRI 971 + + K+ S NGR+A+ Q+ ++SLN + GRI Sbjct: 61 TALQLPETKPEKIPSPVGFMKNGRSAASFFMQR-SMSLNVSCSSDASSDSSHSRASTGRI 119 Query: 970 SRRRVTLTPTMRRKQQCSPKERNAQKSFDGESEDID---LAKKRCAWVTSNTDPSYAALH 800 S R T TP ++R QQ S K +K GE E +D + KKRCAWVT+NTDP YAA H Sbjct: 120 SWRSGTPTPPLKRNQQSSFKRERIEKIVGGEGEVVDGAAVVKKRCAWVTANTDPLYAAFH 179 Query: 799 DEEWGVPVHDDKKLFELLSFSTALAELTWPVILNKRHIFREVFLDFDPIAVSKLNEKKVA 620 DEEWG+ VHDDKKLFELLSFSTALAELTWPVIL+KRH+FREVFLDFDP AVSKLN+KK+A Sbjct: 180 DEEWGLAVHDDKKLFELLSFSTALAELTWPVILSKRHLFREVFLDFDPNAVSKLNDKKIA 239 Query: 619 TPGSPASSLLSELNLRAIIENARQICKIIDDLGSFEKYIWSFVNYKPIIGNFRYPRQVPI 440 TPGSPASSLLS+LNLRAI ENAR+ICKIID+ GSF+KYIW FVN+KPI+GNFRYPR VPI Sbjct: 240 TPGSPASSLLSDLNLRAITENARRICKIIDEFGSFDKYIWGFVNHKPIVGNFRYPRLVPI 299 Query: 439 KTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRD-GDE 263 KTSKADTISKDLV+RGFRGVGPTVVYSFMQV+GITNDHL+SCFR+++CI A DL D +E Sbjct: 300 KTSKADTISKDLVKRGFRGVGPTVVYSFMQVAGITNDHLISCFRHRDCITACDLSDKSNE 359 Query: 262 SLKT-MNDGKMAEEISELELGRAIDDLGFT 176 + T N+ K + I+E+EL R I+D+ + Sbjct: 360 GITTSKNEVKSLDNITEMELVRDINDVSLS 389 >emb|CDO98228.1| unnamed protein product [Coffea canephora] Length = 399 Score = 454 bits (1168), Expect = e-125 Identities = 240/402 (59%), Positives = 287/402 (71%), Gaps = 22/402 (5%) Frame = -1 Query: 1312 MSGPPRVKSMDFTQSEVRPVLGPAGNKARS-VELRKPIGKPNSEKVQKSQDVDEFNGKKS 1136 MSGPPRV+SM+ +SEVRPVLGPAGNK RS +ELRKP+ KP V K Q+ ++ KKS Sbjct: 1 MSGPPRVRSMNHAESEVRPVLGPAGNKTRSALELRKPVSKPKISSVNKMQEGED---KKS 57 Query: 1135 PVTVTDHLALKVN-SKWINGRAASILGQQ-----------KPNLSLNXXXXXXXXXXXXX 992 P TVT L + K G +A+I+ QQ + NLS+N Sbjct: 58 PATVTMEKDLSPSPKKKFGGASAAIMSQQQQRQEVKSFLMRSNLSMNASCSSDASTDSSQ 117 Query: 991 XXXT-GRISRRRVTLTPTMRRKQQCSPKERNAQK--------SFDGESEDIDLAKKRCAW 839 + G+ISRR +T TP R++Q C PK +K + G ++D +A+KRCAW Sbjct: 118 SRASTGKISRRSLTPTPIRRKQQHCGPKVEKLEKVGSEVDSVAVVGLADD-SVARKRCAW 176 Query: 838 VTSNTDPSYAALHDEEWGVPVHDDKKLFELLSFSTALAELTWPVILNKRHIFREVFLDFD 659 VT NTDPSYAA HDEEWGVP H+DKKLFE LS STALAEL WP ILNKRH FREVF DFD Sbjct: 177 VTPNTDPSYAAFHDEEWGVPAHEDKKLFEFLSLSTALAELPWPTILNKRHTFREVFQDFD 236 Query: 658 PIAVSKLNEKKVATPGSPASSLLSELNLRAIIENARQICKIIDDLGSFEKYIWSFVNYKP 479 P+AVSKLNEKK+ATPGSPASSLLSEL LRAI+ENARQ CKII++ GSFEKYIW FVNYKP Sbjct: 237 PVAVSKLNEKKIATPGSPASSLLSELKLRAIVENARQACKIIEEFGSFEKYIWGFVNYKP 296 Query: 478 IIGNFRYPRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQE 299 I+G+FRYPRQVPIKTSKAD ISKDLVRRGFRG+GPTVVYSFMQV+GITNDHL+SCFR+++ Sbjct: 297 IVGHFRYPRQVPIKTSKADAISKDLVRRGFRGIGPTVVYSFMQVAGITNDHLISCFRFRD 356 Query: 298 CIAAGDLRDGDESLKTMNDGKMAEEISELELGRAIDDLGFTT 173 C+ GD R+ D+ L +GK AE+ +E +D L +T Sbjct: 357 CVDVGDGRNKDDDLIATIEGKQAEDSAESGFEERLDALSLST 398 >ref|XP_009617988.1| PREDICTED: uncharacterized protein LOC104110244 [Nicotiana tomentosiformis] Length = 398 Score = 425 bits (1093), Expect = e-116 Identities = 234/410 (57%), Positives = 274/410 (66%), Gaps = 30/410 (7%) Frame = -1 Query: 1312 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKP----------NSEKVQKSQD 1163 MSG PRVKSM+ SEVRPVLGPAGNKARSVELRKP KP K +K Q Sbjct: 1 MSGGPRVKSMNHADSEVRPVLGPAGNKARSVELRKPTEKPIKTNNKPAETEESKGKKFQG 60 Query: 1162 VDEFNGKKSPVTVTDHLALKVNSKWINGRAASILGQQ--------KPNLSLNXXXXXXXX 1007 D KSPV + G SIL QQ +PNLSLN Sbjct: 61 ADPLPQSKSPVAASKKC----------GSVPSILRQQQDHRTLLMRPNLSLNASCSSDAS 110 Query: 1006 XXXXXXXXT-GRISRRRVTLTPTMRRKQQCSPKERNAQKSFDGESE-----------DID 863 + G++SR +LTP R++QCSPK ++KS E D Sbjct: 111 TDSSHSRASTGKLSRG--SLTPKSGRRKQCSPKVDKSEKSGKSVGEVESLSPSPVSGDAS 168 Query: 862 LAKKRCAWVTSNTDPSYAALHDEEWGVPVHDDKKLFELLSFSTALAELTWPVILNKRHIF 683 + KKRCAWVT TDPSYAA HDEEWGVPVHDDKKLFELLS TALAEL+WP IL+KRH F Sbjct: 169 VIKKRCAWVTPTTDPSYAAFHDEEWGVPVHDDKKLFELLSLCTALAELSWPAILSKRHTF 228 Query: 682 REVFLDFDPIAVSKLNEKKVATPGSPASSLLSELNLRAIIENARQICKIIDDLGSFEKYI 503 REVF +FDP+AVSKLNEKK+A PGSPAS+LLSE+ LRAI+ENARQ CKIID+LGSF+KYI Sbjct: 229 REVFQNFDPVAVSKLNEKKIAPPGSPASTLLSEVKLRAIVENARQTCKIIDELGSFDKYI 288 Query: 502 WSFVNYKPIIGNFRYPRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHL 323 W FVN KPI+ FRY RQVP+KTSKA+ ISKDLV+RGFRGVGPTVVYSFMQV+GITNDHL Sbjct: 289 WGFVNNKPIVSQFRYARQVPMKTSKAEGISKDLVKRGFRGVGPTVVYSFMQVAGITNDHL 348 Query: 322 VSCFRYQECIAAGDLRDGDESLKTMNDGKMAEEISELELGRAIDDLGFTT 173 +SCFR+ +C+AA D + D+ L + K ++ +E+ L RAIDD +T Sbjct: 349 ISCFRFHDCVAAIDGMENDDGLAAKTEVKQLKDETEMGLIRAIDDFNLST 398 >ref|XP_009802477.1| PREDICTED: uncharacterized protein LOC104248004 [Nicotiana sylvestris] Length = 399 Score = 421 bits (1082), Expect = e-115 Identities = 237/402 (58%), Positives = 277/402 (68%), Gaps = 22/402 (5%) Frame = -1 Query: 1312 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDVDEFNGKKSP 1133 MSG PRVKSM+ SEVRPVLGPAGNKARSVELRKPI KP K + +E GKK P Sbjct: 1 MSGGPRVKSMNHADSEVRPVLGPAGNKARSVELRKPIEKPVKTN-NKPAETEESKGKKFP 59 Query: 1132 -VTVTDHLALKVNSKWINGRAASILGQQ--------KPNLSLNXXXXXXXXXXXXXXXXT 980 V + G SIL QQ +PNLSLN + Sbjct: 60 GADPLPQSKSPVAASKKCGSVPSILRQQQDHRTLLMRPNLSLNASCSSDASTDSSHSRAS 119 Query: 979 -GRISRRRVTLTPTMRRKQQCSPKERNAQKSFD--GESE---------DIDLAKKRCAWV 836 G++SR +LTP R++QCSPK ++KS GESE D + KKRCAWV Sbjct: 120 TGKLSRG--SLTPKSGRRKQCSPKVDKSEKSGKSVGESESLSPSPVSGDASVIKKRCAWV 177 Query: 835 TSNTDPSYAALHDEEWGVPVHDDKKLFELLSFSTALAELTWPVILNKRHIFREVFLDFDP 656 T TDPSYAA HDEEWGVPVHDDKKLFELLS TALAEL+WP IL+KRH FREVF +FDP Sbjct: 178 TPTTDPSYAAFHDEEWGVPVHDDKKLFELLSLCTALAELSWPAILSKRHTFREVFQNFDP 237 Query: 655 IAVSKLNEKKVATPGSPASSLLSELNLRAIIENARQICKIIDDLGSFEKYIWSFVNYKPI 476 +AVSKLNEKK+A PGSPAS+LLSE+ LRAIIENARQ CKIID+LGSF+KY+W FVN KPI Sbjct: 238 VAVSKLNEKKIAPPGSPASTLLSEVKLRAIIENARQTCKIIDELGSFDKYMWGFVNNKPI 297 Query: 475 IGNFRYPRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQEC 296 + FRY RQVP+KTSKA+ ISKDLV+RGFRGVGPTVVYSFMQV+GITNDHL+SCFR+ +C Sbjct: 298 VSQFRYARQVPMKTSKAEGISKDLVKRGFRGVGPTVVYSFMQVAGITNDHLISCFRFHDC 357 Query: 295 IAAGDLRDGDESLKTMNDGK-MAEEISELELGRAIDDLGFTT 173 +AA D D D+ L + K ++ +E+ L RAI D +T Sbjct: 358 VAAIDGMDKDDGLVAKTEVKQQLKDETEMGLIRAIADFNLST 399 >ref|XP_004235942.1| PREDICTED: uncharacterized protein LOC101247118 [Solanum lycopersicum] Length = 395 Score = 419 bits (1078), Expect = e-114 Identities = 232/402 (57%), Positives = 278/402 (69%), Gaps = 22/402 (5%) Frame = -1 Query: 1312 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDVDEFNGKKSP 1133 MSG PRVK M+ SEVR VLGPAGNKARSVELRKP+ KP V+K+ + +E GKK Sbjct: 1 MSGGPRVKLMNNADSEVRSVLGPAGNKARSVELRKPVEKP----VKKAAESEESKGKKFE 56 Query: 1132 VTVTDHLALKVNSKWINGRAASILGQQ--------KPNLSLNXXXXXXXXXXXXXXXXTG 977 T + + ++ G SIL QQ +PNLSLN + Sbjct: 57 GTDS---VPQSRARKCGGAVPSILRQQQDHRSLMMRPNLSLNASCSSDASTDSSHSRAST 113 Query: 976 RISRRRVTLTPTMRRKQQCS-PK----ERNAQKSFDGES-------EDIDLAKKRCAWVT 833 R ++TPT R++QCS PK E+ + +GES +D + KKRCAWVT Sbjct: 114 TGKMSRGSVTPTAGRRKQCSSPKVVKSEKIGKTVGEGESLASSPTPDDASVMKKRCAWVT 173 Query: 832 SNTDPSYAALHDEEWGVPVHDDKKLFELLSFSTALAELTWPVILNKRHIFREVFLDFDPI 653 NTDPSYAA HDEEWGV VHDDKKLFELLS TALAEL+WP IL+KRH+FREVF +FDP+ Sbjct: 174 PNTDPSYAAFHDEEWGVSVHDDKKLFELLSLCTALAELSWPAILSKRHMFREVFQNFDPV 233 Query: 652 AVSKLNEKKVATPGSPASSLLSELNLRAIIENARQICKIIDDLGSFEKYIWSFVNYKPII 473 AVSKLNEKK+A PGSPAS+LLSE+ LRA+IENARQ CKIID+LGSF+KYIW FVN KPI+ Sbjct: 234 AVSKLNEKKIAPPGSPASTLLSEVKLRAVIENARQTCKIIDELGSFDKYIWGFVNNKPIV 293 Query: 472 GNFRYPRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECI 293 FRY RQVP+KTSKA+ ISKDLV+RGFRGVGPTVVYSFMQV+GITNDHL+SCFR+ +C+ Sbjct: 294 SQFRYARQVPMKTSKAEGISKDLVKRGFRGVGPTVVYSFMQVAGITNDHLISCFRFHDCV 353 Query: 292 AAGDLRDGDESLKTMNDGKMAEEISELELG--RAIDDLGFTT 173 AA D D D+ L + K + E E+G RAIDD +T Sbjct: 354 AATDGTDKDDGLAAKTEVKQLQLKDETEMGLIRAIDDFNLST 395 >ref|XP_006341344.1| PREDICTED: uncharacterized protein LOC102594169 [Solanum tuberosum] Length = 399 Score = 417 bits (1072), Expect = e-113 Identities = 232/403 (57%), Positives = 277/403 (68%), Gaps = 23/403 (5%) Frame = -1 Query: 1312 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDVDEFNGKKSP 1133 MSG PRVK M+ SEVR VLGPAGNKARSVELRKP+ KP ++K+ + +E GKK Sbjct: 1 MSGGPRVKLMNNADSEVRSVLGPAGNKARSVELRKPVEKP----IKKAAESEESKGKKFE 56 Query: 1132 VT--VTDHLALKVNSKWINGRAASILGQQ--------KPNLSLNXXXXXXXXXXXXXXXX 983 T V A SK G SIL QQ +PNLSLN Sbjct: 57 GTDSVPQSRAPVAASKKCGGAVPSILRQQQDHRSLMMRPNLSLNASCSSDASTDSSHSRA 116 Query: 982 TGRISRRRVTLTPTMRRKQQCS-PKERNAQK--SFDGESE---------DIDLAKKRCAW 839 + R ++TPT R++QCS PK ++K GE + D + KKRCAW Sbjct: 117 STTGKLSRGSVTPTAGRRKQCSSPKVVKSEKIGKTVGEGQSLASSPTPGDASVMKKRCAW 176 Query: 838 VTSNTDPSYAALHDEEWGVPVHDDKKLFELLSFSTALAELTWPVILNKRHIFREVFLDFD 659 VT NTDPSYAA HDEEWGV +HDDKKLFELLS TALAEL+WP IL+KRH+FREVF +FD Sbjct: 177 VTPNTDPSYAAFHDEEWGVSIHDDKKLFELLSLCTALAELSWPAILSKRHMFREVFQNFD 236 Query: 658 PIAVSKLNEKKVATPGSPASSLLSELNLRAIIENARQICKIIDDLGSFEKYIWSFVNYKP 479 P+AVSKLNEKK+A PGSPAS+LLSE+ LRA+IENARQ CKIID+LGSF+KYIW FVN KP Sbjct: 237 PVAVSKLNEKKIAPPGSPASTLLSEVKLRAVIENARQTCKIIDELGSFDKYIWGFVNNKP 296 Query: 478 IIGNFRYPRQVPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQE 299 I+ FRY RQVP+KTSKA+ ISKDLV+RGFRGVGPTVVYSFMQV+GITNDHL+SCFR+ + Sbjct: 297 IVSQFRYARQVPMKTSKAEGISKDLVKRGFRGVGPTVVYSFMQVAGITNDHLISCFRFHD 356 Query: 298 CIAAGDLRDGDESLKTMNDGK-MAEEISELELGRAIDDLGFTT 173 C+AA D D D+ L + K ++ +E+ L RAIDD +T Sbjct: 357 CVAATDGTDKDDGLAAKTEVKQQLKDETEMGLIRAIDDFNLST 399 >ref|XP_010265584.1| PREDICTED: uncharacterized protein LOC104603287 [Nelumbo nucifera] Length = 380 Score = 405 bits (1040), Expect = e-110 Identities = 227/381 (59%), Positives = 262/381 (68%), Gaps = 5/381 (1%) Frame = -1 Query: 1312 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDVDEFNG-KKS 1136 MSG PRV+SM+ S+ RPVLGP GNK S+ RKP+ KP KV+KS +V NG KK+ Sbjct: 1 MSGAPRVRSMNVADSDARPVLGPTGNKTGSLVTRKPVSKP-LRKVEKSPEVA--NGEKKT 57 Query: 1135 PVTVTDHLALKVNSKWINGRAASILGQQK---PNLSLNXXXXXXXXXXXXXXXXT-GRIS 968 P + K+ S + SIL + + NLSLN + GRI Sbjct: 58 PSSPVAPSPPKLQSASV----PSILRRHEFLHSNLSLNASCSSDASSDSVYSRASTGRII 113 Query: 967 RRRVTLTPTMRRKQQCSPKERNAQKSFDGESEDIDLAKKRCAWVTSNTDPSYAALHDEEW 788 R T + T RRK+ S E+ A S S + K+RCAWVT NTDP YAA HDEEW Sbjct: 114 R---TSSTTSRRKRSISRPEKVAPDSVSDSSSESIQTKRRCAWVTPNTDPCYAAFHDEEW 170 Query: 787 GVPVHDDKKLFELLSFSTALAELTWPVILNKRHIFREVFLDFDPIAVSKLNEKKVATPGS 608 GVPVHDDKKLFE L S ALAEL WPVIL+KRHIFREVF DFDP+AVSKLNEKK+ TPG Sbjct: 171 GVPVHDDKKLFEFLVLSGALAELPWPVILSKRHIFREVFADFDPVAVSKLNEKKITTPGG 230 Query: 607 PASSLLSELNLRAIIENARQICKIIDDLGSFEKYIWSFVNYKPIIGNFRYPRQVPIKTSK 428 A SLLSEL LRAIIENARQICK+ID+ GSF YIWSFVN+KPII FRYPRQVP+KT K Sbjct: 231 TAISLLSELKLRAIIENARQICKVIDEFGSFNNYIWSFVNHKPIISKFRYPRQVPVKTPK 290 Query: 427 ADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDESLKTM 248 AD ISKDLVRRGFR VGPTV+YSFMQV+GITNDHL++CFRYQECI A + DE K Sbjct: 291 ADVISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLINCFRYQECIDATAAIE-DEGSKAK 349 Query: 247 NDGKMAEEISELELGRAIDDL 185 + K E+I LELG+AID+L Sbjct: 350 AEEKKTEDIINLELGKAIDEL 370 >ref|XP_010275821.1| PREDICTED: uncharacterized protein LOC104610746 [Nelumbo nucifera] Length = 387 Score = 393 bits (1009), Expect = e-106 Identities = 220/391 (56%), Positives = 257/391 (65%), Gaps = 15/391 (3%) Frame = -1 Query: 1312 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDV--DEFNGKK 1139 MSG PRV+S++ SE RPVLGPAGNK RS+ RKP KP KV+K+ + +E Sbjct: 1 MSGAPRVRSINVADSEARPVLGPAGNKTRSLVTRKPASKP-LRKVEKTPEAVDEEKKAPS 59 Query: 1138 SPVTVTDHLALKVNSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXT-GRISRR 962 SPV + V+ I R + NLSLN + GR+ R Sbjct: 60 SPVAASPPKLQPVSVPSILRRHEFL----HSNLSLNASCSSDASSDSVYSRASTGRLIRT 115 Query: 961 RVTLTPTMRRKQQCSPKERNAQKSFDGESEDIDLAKKRCAWVTSNTDPSYAALHDEEWGV 782 R T + RRK S E+ S S D KKRCAWVT NTDP YAA HDEEWGV Sbjct: 116 RSTPS---RRKYSISRPEKVVPDSASDSSPDSIETKKRCAWVTPNTDPCYAAFHDEEWGV 172 Query: 781 PVHDDKKLFELLSFSTALAELTWPVILNKRHIFREVFLDFDPIAVSKLNEKKVATPGSPA 602 PVHDDKKLFELL S ALAELTWP IL+KRHIFREVF DFDP+AVSKLNEKK+ PGS A Sbjct: 173 PVHDDKKLFELLVLSGALAELTWPTILSKRHIFREVFSDFDPVAVSKLNEKKITAPGSTA 232 Query: 601 SSLLSELNLRAIIENARQICKIIDDLGSFEKYIWSFVNYKPIIGNFRYPRQVPIKTSKAD 422 SSLLSEL LRAIIENARQICK+ID+ GSF+ YIWSFVN+KPII FRYPRQVP+K KAD Sbjct: 233 SSLLSELKLRAIIENARQICKVIDEFGSFDNYIWSFVNHKPIISKFRYPRQVPVKIPKAD 292 Query: 421 TISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDESL----- 257 ISKDLVRRGFR VGPTVVYSFMQV+GITNDHL++CFR+Q C+ + +GD+ L Sbjct: 293 VISKDLVRRGFRSVGPTVVYSFMQVAGITNDHLINCFRFQVCMDTPTVSEGDDKLRIGKA 352 Query: 256 -------KTMNDGKMAEEISELELGRAIDDL 185 K + K E++ + ELG+A+D L Sbjct: 353 EETPTGSKGTAEEKKTEDMIKSELGKAMDKL 383 >ref|XP_007011936.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|590572766|ref|XP_007011937.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|590572769|ref|XP_007011938.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|590572773|ref|XP_007011939.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508782299|gb|EOY29555.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508782300|gb|EOY29556.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508782301|gb|EOY29557.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] gi|508782302|gb|EOY29558.1| DNA glycosylase superfamily protein isoform 1 [Theobroma cacao] Length = 379 Score = 387 bits (993), Expect = e-104 Identities = 213/384 (55%), Positives = 259/384 (67%), Gaps = 4/384 (1%) Frame = -1 Query: 1312 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQ-DVDEFNGKKS 1136 MSG PR++SM+ SE RPVLGPAGNKA S+ RKP KP KV+KS +V KK+ Sbjct: 1 MSGAPRMRSMNVADSEARPVLGPAGNKAGSLSARKPASKP-LRKVEKSPVEVTVAEEKKA 59 Query: 1135 -PVTVTDHLALKVNSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXTGRISRRR 959 P + + L+ K +S + S+L + + L N R S R Sbjct: 60 LPSSTVNSLSPKTHSVSV----PSVLRRHEQLLHSNLSLNASCSSDASTDSFHSRASTGR 115 Query: 958 VTLTPTM--RRKQQCSPKERNAQKSFDGESEDIDLAKKRCAWVTSNTDPSYAALHDEEWG 785 + + ++ RRK S D KKRCAWVT NTDPSY A HDEEWG Sbjct: 116 LIRSNSVGNRRKPYASKPRSVVSDGGLDSPPDGSHQKKRCAWVTPNTDPSYVAFHDEEWG 175 Query: 784 VPVHDDKKLFELLSFSTALAELTWPVILNKRHIFREVFLDFDPIAVSKLNEKKVATPGSP 605 VPVHDD+KLFELL S AL+ELTWP IL+KRHI REVF+DFD +AVSKLNEKK+ TPGS Sbjct: 176 VPVHDDRKLFELLVLSGALSELTWPAILSKRHIVREVFVDFDAVAVSKLNEKKLVTPGSI 235 Query: 604 ASSLLSELNLRAIIENARQICKIIDDLGSFEKYIWSFVNYKPIIGNFRYPRQVPIKTSKA 425 ASSLLSEL LRAIIENARQI K+ID+ GSF++YIWSFVN+KPI+ FRYPRQVP+KT KA Sbjct: 236 ASSLLSELKLRAIIENARQISKVIDEFGSFDEYIWSFVNHKPIVSRFRYPRQVPVKTPKA 295 Query: 424 DTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDESLKTMN 245 D ISKDLVRRGFR VGPTV+YSFMQV+GITNDHL SCFR+QECI A + ++ + +K M Sbjct: 296 DVISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLTSCFRFQECITAAEGKE-ENGIKDMP 354 Query: 244 DGKMAEEISELELGRAIDDLGFTT 173 + K E + E +L AID+L F++ Sbjct: 355 EEKKTENVMESKLSIAIDELSFSS 378 >ref|XP_012442673.1| PREDICTED: uncharacterized protein LOC105767651 isoform X1 [Gossypium raimondii] gi|763787989|gb|KJB54985.1| hypothetical protein B456_009G057100 [Gossypium raimondii] gi|763787990|gb|KJB54986.1| hypothetical protein B456_009G057100 [Gossypium raimondii] Length = 379 Score = 382 bits (981), Expect = e-103 Identities = 210/384 (54%), Positives = 258/384 (67%), Gaps = 4/384 (1%) Frame = -1 Query: 1312 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDVDEFNGKKS- 1136 MSG PR++SM+ T SE RPVLGPAGNKA S+ RKP KP+ + + S +V KK+ Sbjct: 1 MSGAPRLRSMNVTDSEARPVLGPAGNKAGSLSARKPASKPSKKVEKSSVEVTVVEEKKAL 60 Query: 1135 PVTVTDHLALKVNSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXTGRISRRRV 956 P + + L+ K +S + S+L + + L + R S R+ Sbjct: 61 PSSTVNSLSPKTHSLSV----PSVLRRHERLLHSSLSLNASCSSDASTDSFQSRASTGRL 116 Query: 955 TLTPTM--RRKQQCS-PKERNAQKSFDGESEDIDLAKKRCAWVTSNTDPSYAALHDEEWG 785 + ++ RRK S PK + S D S KKRC WVT NTDPSYAA HDEEWG Sbjct: 117 SRCGSLGSRRKPYASKPKSLVSDDSLDLSSNSSH-HKKRCTWVTPNTDPSYAAFHDEEWG 175 Query: 784 VPVHDDKKLFELLSFSTALAELTWPVILNKRHIFREVFLDFDPIAVSKLNEKKVATPGSP 605 VPVHDDKKLFELL S +L+ELTW IL+KRHIFREVF+DFDP+AVSKLNEKK+ GS Sbjct: 176 VPVHDDKKLFELLVLSGSLSELTWSAILSKRHIFREVFMDFDPVAVSKLNEKKLIAHGSV 235 Query: 604 ASSLLSELNLRAIIENARQICKIIDDLGSFEKYIWSFVNYKPIIGNFRYPRQVPIKTSKA 425 ASSLLSEL LRAI+ENARQI K+ID+ SF++YIWSFVN+KPI+ FRYPRQVP+KT KA Sbjct: 236 ASSLLSELMLRAIVENARQISKVIDEFRSFDQYIWSFVNHKPIVSRFRYPRQVPVKTPKA 295 Query: 424 DTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDESLKTMN 245 D ISKDLVRRGFR VGPTV+YSFMQV+GITNDHL SCFR+QECI A + ++ + K Sbjct: 296 DVISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLTSCFRFQECITAAEAKE-ENVTKDTT 354 Query: 244 DGKMAEEISELELGRAIDDLGFTT 173 + K + EL AID+L F+T Sbjct: 355 EKKETVNVINTELSVAIDELSFST 378 >ref|XP_003527169.1| PREDICTED: uncharacterized protein LOC100801026 isoform X1 [Glycine max] gi|571461733|ref|XP_006582090.1| PREDICTED: uncharacterized protein LOC100801026 isoform X2 [Glycine max] gi|571461735|ref|XP_006582091.1| PREDICTED: uncharacterized protein LOC100801026 isoform X3 [Glycine max] gi|734430051|gb|KHN45352.1| Putative GMP synthase [glutamine-hydrolyzing] [Glycine soja] Length = 383 Score = 382 bits (981), Expect = e-103 Identities = 204/382 (53%), Positives = 252/382 (65%), Gaps = 2/382 (0%) Frame = -1 Query: 1312 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDVDEFNGKKSP 1133 MSG PR++SM+ SE RPVLGPAGNK S+ RK KP +KV K D +K P Sbjct: 1 MSGAPRLRSMNVADSEARPVLGPAGNKTGSLSSRKTASKPLRKKVDKLLDEIASVKEKKP 60 Query: 1132 VTVTDHLALKVNSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXTGRISRRRVT 953 V + + + + +L + + L N R S R+T Sbjct: 61 HQVLLSSVATSSPQSHSASVSLLLPRHEQLLHSNLSLNASCSSDASTDSFHSRASTGRLT 120 Query: 952 LTPTM--RRKQQCSPKERNAQKSFDGESEDIDLAKKRCAWVTSNTDPSYAALHDEEWGVP 779 + ++ RRK S A D + KRCAWVT NT+P YA HDEEWGVP Sbjct: 121 RSYSLGSRRKPYVSKPRSVASDGVLESPTDGSQSNKRCAWVTPNTEPCYATFHDEEWGVP 180 Query: 778 VHDDKKLFELLSFSTALAELTWPVILNKRHIFREVFLDFDPIAVSKLNEKKVATPGSPAS 599 VHDDKKLFELL S+ LAE TWP IL+KRHIFREVF+DF+P+AVSKLNEKK+ TPG+ AS Sbjct: 181 VHDDKKLFELLVLSSVLAEHTWPAILSKRHIFREVFVDFEPVAVSKLNEKKIMTPGTIAS 240 Query: 598 SLLSELNLRAIIENARQICKIIDDLGSFEKYIWSFVNYKPIIGNFRYPRQVPIKTSKADT 419 SLLSE+ LRAIIENARQI K+ID+ GSF+KYIWSFVN+KPI+ FRYPRQVP+KT KAD Sbjct: 241 SLLSEVKLRAIIENARQISKVIDEFGSFDKYIWSFVNHKPIVSRFRYPRQVPVKTPKADV 300 Query: 418 ISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDESLKTMNDG 239 ISKDLVRRGFRGVGPTVVYSFMQV+G+T DHL+SCFR++ECIAA + ++ + + D Sbjct: 301 ISKDLVRRGFRGVGPTVVYSFMQVAGLTIDHLISCFRFEECIAAAEGKEENGIMDNHADQ 360 Query: 238 KMAEEISELELGRAIDDLGFTT 173 K +E I E +L A++DL F + Sbjct: 361 KESENIMESDLSIAMEDLSFAS 382 >ref|XP_002529378.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis] gi|223531126|gb|EEF32974.1| DNA-3-methyladenine glycosylase, putative [Ricinus communis] Length = 380 Score = 379 bits (973), Expect = e-102 Identities = 208/385 (54%), Positives = 255/385 (66%), Gaps = 5/385 (1%) Frame = -1 Query: 1312 MSGPPRVKSMDFTQSEVRPVLGPAGN-KARSVELRKPIGKPNSEKVQKSQDVDEFNGKKS 1136 MSG PRV+SM+ SE RPVLGP GN KA S+ +KP K KV+ S + + +K Sbjct: 1 MSGAPRVRSMNVADSETRPVLGPTGNNKAGSLSAKKPASK-QLRKVETSPEAVKLGQEKK 59 Query: 1135 PVTVTDHLALKVNSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXTGRISRRRV 956 VTV AL S ++ S+L + + L N R S R+ Sbjct: 60 LVTVPTASALSPKSHSVS--VPSVLRRHEQLLHSNLSLNASCSSDASTDSFHSRASTGRL 117 Query: 955 TLTPTM-RRKQQCSPKERNAQKSFDGES---EDIDLAKKRCAWVTSNTDPSYAALHDEEW 788 T + ++ R++Q + K R+ ES D AKK CAWVT N DP Y A HDEEW Sbjct: 118 TRSNSLGTRRKQYALKPRSVVSDGGLESPPPSDGSQAKKSCAWVTPNADPCYTAFHDEEW 177 Query: 787 GVPVHDDKKLFELLSFSTALAELTWPVILNKRHIFREVFLDFDPIAVSKLNEKKVATPGS 608 G+PVHDDKKLFELL S ALAELTWP IL+KRHIFREVF +FDP+ VSK NEKK+ PGS Sbjct: 178 GIPVHDDKKLFELLVLSGALAELTWPAILSKRHIFREVFANFDPVVVSKFNEKKIIAPGS 237 Query: 607 PASSLLSELNLRAIIENARQICKIIDDLGSFEKYIWSFVNYKPIIGNFRYPRQVPIKTSK 428 ASSLLSE+ LRAIIENARQI K+ D+LGSF+KYIWSFVNYKPI+ FRYPRQVP+KT K Sbjct: 238 TASSLLSEIKLRAIIENARQISKVTDELGSFDKYIWSFVNYKPIVSRFRYPRQVPVKTPK 297 Query: 427 ADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDESLKTM 248 AD ISKDLVRRGFR VGPTVVYSFMQV+G+TNDHL+SCFR+QECI A +G E Sbjct: 298 ADVISKDLVRRGFRSVGPTVVYSFMQVAGLTNDHLISCFRFQECINAA---EGKEENGVK 354 Query: 247 NDGKMAEEISELELGRAIDDLGFTT 173 + K+ + + E ++ A+D+L F++ Sbjct: 355 VEDKITDGVVESQISIAMDELSFSS 379 >gb|KHG02440.1| putative GMP synthase [glutamine-hydrolyzing] [Gossypium arboreum] Length = 379 Score = 379 bits (972), Expect = e-102 Identities = 211/385 (54%), Positives = 261/385 (67%), Gaps = 5/385 (1%) Frame = -1 Query: 1312 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQ-DVDEFNGKKS 1136 MSG PR++SM+ T SE RPVLGPAGNKA S+ RKP K +S+KV+KS +V KK+ Sbjct: 1 MSGAPRLRSMNVTDSEARPVLGPAGNKAGSLSARKPASK-SSKKVEKSSVEVTVVGEKKA 59 Query: 1135 -PVTVTDHLALKVNSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXTGRISRRR 959 P + + L+ K +S + S+L + + L + R S R Sbjct: 60 LPSSTVNSLSPKTHSLSV----PSVLRRHERLLHSSLSLNASCSSDASTDSFQSRASTGR 115 Query: 958 VTLTPTM--RRKQQCS-PKERNAQKSFDGESEDIDLAKKRCAWVTSNTDPSYAALHDEEW 788 + ++ RRK S PK + S D S KKRCAWVT +TDPSYAA HDEEW Sbjct: 116 LNRCDSLGSRRKPYASKPKSVVSDDSLDLSSNSSH-PKKRCAWVTPSTDPSYAAFHDEEW 174 Query: 787 GVPVHDDKKLFELLSFSTALAELTWPVILNKRHIFREVFLDFDPIAVSKLNEKKVATPGS 608 GVPVHDD+KLFELL S +L+ELTW IL+KRHIFREVF+DFDP+AVSKLNEKK+ GS Sbjct: 175 GVPVHDDRKLFELLVLSGSLSELTWSAILSKRHIFREVFIDFDPVAVSKLNEKKLIAHGS 234 Query: 607 PASSLLSELNLRAIIENARQICKIIDDLGSFEKYIWSFVNYKPIIGNFRYPRQVPIKTSK 428 ASSLLSEL LR I+ENARQI K+ID+ GSF++YIWSFVN+KPI+ FRYPRQVP+KT K Sbjct: 235 VASSLLSELKLRVIVENARQISKVIDEFGSFDQYIWSFVNHKPIVSRFRYPRQVPVKTPK 294 Query: 427 ADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDESLKTM 248 AD ISKDLVRRGFR VGPTV+YSFMQV+GITNDHL SCFR+QECI A + ++ + K Sbjct: 295 ADVISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLTSCFRFQECITAAEAKE-ENVTKDP 353 Query: 247 NDGKMAEEISELELGRAIDDLGFTT 173 + K + EL AID+L F++ Sbjct: 354 TEKKETVNVINTELSVAIDELSFSS 378 >ref|XP_002276173.1| PREDICTED: uncharacterized protein LOC100267363 isoform X2 [Vitis vinifera] gi|297743642|emb|CBI36525.3| unnamed protein product [Vitis vinifera] Length = 375 Score = 378 bits (971), Expect = e-102 Identities = 208/392 (53%), Positives = 256/392 (65%), Gaps = 12/392 (3%) Frame = -1 Query: 1312 MSGPPRVKSMDFTQSEVRPVLGPAGNKA-RSVELRKPIGKPNSEKVQKSQDVDEFNGKKS 1136 MSG PRV+SM+ SEVRPVLGPAGNK RS+ RKP KP + + ++D +E S Sbjct: 1 MSGGPRVRSMNVADSEVRPVLGPAGNKTMRSLSGRKPATKPLRKAEKATKDDEEIKALPS 60 Query: 1135 PVTVTDHLALKVNSKWINGRAAS----------ILGQQKPNLSLNXXXXXXXXXXXXXXX 986 NG A+S +L +Q+ L N Sbjct: 61 S----------------NGAASSPPSHSVSVPLVLRRQEQLLHSNLSLNASCSSDASTDS 104 Query: 985 XTGRISRRRVTLTPTMRRKQQCSPKERNAQKSFDGESEDIDL-AKKRCAWVTSNTDPSYA 809 R S R+T + + R++ + K + ES L AK+RCAWVT NTD SY Sbjct: 105 FHSRASTGRITRSSSTARRRSYASKPKVIVSDGVSESPPDGLKAKRRCAWVTPNTDLSYI 164 Query: 808 ALHDEEWGVPVHDDKKLFELLSFSTALAELTWPVILNKRHIFREVFLDFDPIAVSKLNEK 629 A HDEEWGVPVHDDKKLFELL S ALAELTWP IL+KRHIFREVF DFDPIAV+KLNEK Sbjct: 165 AFHDEEWGVPVHDDKKLFELLVLSGALAELTWPTILSKRHIFREVFADFDPIAVAKLNEK 224 Query: 628 KVATPGSPASSLLSELNLRAIIENARQICKIIDDLGSFEKYIWSFVNYKPIIGNFRYPRQ 449 K+ PGS ASSL+SEL LR IIENARQ+ K+ID+ GSF++YIWSFVN+KPI+ FRYPR Sbjct: 225 KLMAPGSIASSLISELKLRGIIENARQMSKVIDEFGSFDEYIWSFVNHKPIVSRFRYPRH 284 Query: 448 VPIKTSKADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDG 269 VP+KT KAD ISKDLVRRGFR VGPTV+YSFMQV+GITNDHL+SCFR+Q+C+ A +++ Sbjct: 285 VPVKTPKADVISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLISCFRFQDCVTAAEVK-- 342 Query: 268 DESLKTMNDGKMAEEISELELGRAIDDLGFTT 173 +E + T + + E EL RAID+L F++ Sbjct: 343 EEEITTGAAEEKKSNVIESELSRAIDELSFSS 374 >ref|XP_012462430.1| PREDICTED: uncharacterized protein LOC105782309 [Gossypium raimondii] gi|763815982|gb|KJB82834.1| hypothetical protein B456_013G216100 [Gossypium raimondii] gi|763815983|gb|KJB82835.1| hypothetical protein B456_013G216100 [Gossypium raimondii] Length = 381 Score = 378 bits (970), Expect = e-102 Identities = 214/388 (55%), Positives = 255/388 (65%), Gaps = 9/388 (2%) Frame = -1 Query: 1312 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQ-DVDEFNGKKS 1136 MSG PR++SM+ SE RPVLGPAGNKA S+ RKP KP KV+KS +V KKS Sbjct: 1 MSGAPRLRSMNAPDSEARPVLGPAGNKAGSLSARKPASKP-LRKVEKSPVEVTATEEKKS 59 Query: 1135 -PVTVTDHLALKVNSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXTGRISRRR 959 P ++ L+ K +S + S+L + + L N R S R Sbjct: 60 LPSSIVSSLSPKKHSVSV----PSVLRRHEKLLHSNLSLNASCSSDASTDSFHSRASTGR 115 Query: 958 VTLTPTM--RRKQQCSPKERNAQKSFDGESEDIDLAKKRCAWVTSNTDPSYAALHDEEWG 785 + + ++ RRK S S D KKRCAWVT NTDPSYA HDEEWG Sbjct: 116 LIRSNSVGSRRKPYVSKPRSFVSDSGSDSPSDGSHQKKRCAWVTPNTDPSYATFHDEEWG 175 Query: 784 VPVHDDKKLFELLSFSTALAELTWPVILNKRHIFREVFLDFDPIAVSKLNEKKVATPGSP 605 VPVHDDKKLFELL S AL+ELTWP IL+KR +FREVF+DFDP AVSKLNEKK+ PGS Sbjct: 176 VPVHDDKKLFELLVLSGALSELTWPAILSKRQMFREVFMDFDPAAVSKLNEKKLIAPGSV 235 Query: 604 ASSLLSELNLRAIIENARQICKIIDDLGSFEKYIWSFVNYKPIIGNFRYPRQVPIKTSKA 425 +SSLLSEL LRAIIENARQI K+ID+ GSF++YIWSFVN+KPII FRYPRQVP+KT KA Sbjct: 236 SSSLLSELKLRAIIENARQISKVIDEFGSFDEYIWSFVNHKPIISKFRYPRQVPVKTPKA 295 Query: 424 DTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAG-----DLRDGDES 260 D ISKDLVRRGFR VGPTV+YSFMQV+GITNDHL CFR+QECI A ++++ E Sbjct: 296 DVISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLTGCFRFQECITAAEGKEVEIKERAEE 355 Query: 259 LKTMNDGKMAEEISELELGRAIDDLGFT 176 K N + + E EL AID+L F+ Sbjct: 356 KKPDN----SVSVIESELSIAIDELSFS 379 >ref|XP_002324538.1| methyladenine glycosylase family protein [Populus trichocarpa] gi|222865972|gb|EEF03103.1| methyladenine glycosylase family protein [Populus trichocarpa] Length = 380 Score = 378 bits (970), Expect = e-102 Identities = 209/389 (53%), Positives = 258/389 (66%), Gaps = 9/389 (2%) Frame = -1 Query: 1312 MSGPPRVKSMDFTQSEVRPVLGPAGN-KARSVELRKPIGKPNSEKVQKSQDVDEFNGKKS 1136 MSG PRV+SM+ SE R VLGP GN KA + RKP+ K S KV+KS + + +K Sbjct: 1 MSGAPRVRSMNVADSEARSVLGPTGNNKAGPLSARKPVSK-QSRKVEKSPEEVKLGEEKK 59 Query: 1135 PVTVTDHLALKVNSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXTGRISRRRV 956 +TV L S +N +S+L + + L N R S R+ Sbjct: 60 TLTVPAVGTLSPKSHSLN--ISSVLRRHELLLHSNLSLNASCSSDASTDSFHSRASTGRL 117 Query: 955 TLTP---TMRRKQQCSPKERNAQKSFDGE-SEDIDLAKKRCAWVTSNTDPSYAALHDEEW 788 T + T R++ P+ ++ + S D +KK CAWVT NTDP YA HDEEW Sbjct: 118 TRSNSAGTRRKQYVLRPRSFVSEGGLESPPSPDDSQSKKSCAWVTPNTDPCYATFHDEEW 177 Query: 787 GVPVHDDKKLFELLSFSTALAELTWPVILNKRHIFREVFLDFDPIAVSKLNEKKVATPGS 608 GVP+HDD+KLFELL S ALAELTWP IL+KRHIFREVF DFDPIAVSK NEKK+ PGS Sbjct: 178 GVPIHDDRKLFELLVLSGALAELTWPAILSKRHIFREVFADFDPIAVSKFNEKKILAPGS 237 Query: 607 PASSLLSELNLRAIIENARQICKIIDDLGSFEKYIWSFVNYKPIIGNFRYPRQVPIKTSK 428 A+SLLSEL LRAI+ENARQI K+ID+ GSF+KYIWSFVNYKPI+ FRYPRQVP+KT K Sbjct: 238 TATSLLSELKLRAIVENARQISKVIDEFGSFDKYIWSFVNYKPIVSRFRYPRQVPVKTPK 297 Query: 427 ADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECI--AAGDLRDG--DES 260 AD ISKDLVRRGFR VGPTV+YSFMQV+GITNDHL+SCFR+QEC+ A G + +G E Sbjct: 298 ADAISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLISCFRFQECLDAAEGKVENGIKSED 357 Query: 259 LKTMNDGKMAEEISELELGRAIDDLGFTT 173 +KT ++ E ++ AID+L F++ Sbjct: 358 IKT-------NDVMESKISIAIDELSFSS 379 >ref|XP_009358441.1| PREDICTED: uncharacterized protein LOC103949071 [Pyrus x bretschneideri] Length = 378 Score = 377 bits (968), Expect = e-101 Identities = 211/385 (54%), Positives = 252/385 (65%), Gaps = 5/385 (1%) Frame = -1 Query: 1312 MSGPPRVKSMDFTQSEVRPVLGPAGNKARSVELRKPIGKPNSEKVQKSQDVDEFNGKK-- 1139 MSG PRV+S++ SE RPVLGPAGNKA + RKP KP + + S++V KK Sbjct: 1 MSGAPRVRSINVADSESRPVLGPAGNKAGTFSARKPASKPLRKAEKFSEEVSSAEEKKTH 60 Query: 1138 -SPVTVTDHLALKVNSKWINGRAASILGQQKPNLSLNXXXXXXXXXXXXXXXXTGRISRR 962 SP+ T + + + + S+L + + L N R S Sbjct: 61 QSPMLTT-------SPQPHSPKVHSVLRRHEQLLHSNFSLNASCSSDASTDSFQSRASTG 113 Query: 961 RVTLTPTM--RRKQQCSPKERNAQKSFDGESEDIDLAKKRCAWVTSNTDPSYAALHDEEW 788 R+ + ++ RRKQ S D +KKRCAWVT N DP YAA HDEEW Sbjct: 114 RLIRSNSVGSRRKQYVSKPRSVVSDGGLDSPPDGSQSKKRCAWVTPNADPCYAAFHDEEW 173 Query: 787 GVPVHDDKKLFELLSFSTALAELTWPVILNKRHIFREVFLDFDPIAVSKLNEKKVATPGS 608 G+PVHDDKKLFELL S ALAEL+WP IL+K+HIFREVF DFDPIAVSKLNEKK+ +PGS Sbjct: 174 GLPVHDDKKLFELLVLSGALAELSWPAILSKKHIFREVFADFDPIAVSKLNEKKLISPGS 233 Query: 607 PASSLLSELNLRAIIENARQICKIIDDLGSFEKYIWSFVNYKPIIGNFRYPRQVPIKTSK 428 ASSLLSEL LRAIIENARQ K+I++ GSF+KYIWSFVN KPI FRYPRQVP+KT K Sbjct: 234 AASSLLSELKLRAIIENARQTTKVIEEFGSFDKYIWSFVNNKPIESRFRYPRQVPVKTPK 293 Query: 427 ADTISKDLVRRGFRGVGPTVVYSFMQVSGITNDHLVSCFRYQECIAAGDLRDGDESLKTM 248 AD ISKDLVRRGFR VGPTV+YSFMQV+GITNDHLVSCFR+QEC+ A + DG+ +K Sbjct: 294 ADVISKDLVRRGFRSVGPTVIYSFMQVAGITNDHLVSCFRFQECVNAAE-GDGENGIKD- 351 Query: 247 NDGKMAEEISELELGRAIDDLGFTT 173 GK E E EL AID L F++ Sbjct: 352 EAGKKTENGIESELSVAIDKLSFSS 376