BLASTX nr result
ID: Atractylodes21_contig00014105
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Atractylodes21_contig00014105 (3221 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002530889.1| conserved hypothetical protein [Ricinus comm... 443 e-121 ref|XP_002267310.1| PREDICTED: transcriptional activator DEMETER... 416 e-113 ref|XP_002316518.1| predicted protein [Populus trichocarpa] gi|2... 379 e-102 emb|CBI40219.3| unnamed protein product [Vitis vinifera] 343 2e-91 gb|AEC12445.1| DNA N-glycosylase/DNA-(apurinic or apyrimidinic s... 342 3e-91 >ref|XP_002530889.1| conserved hypothetical protein [Ricinus communis] gi|223529542|gb|EEF31495.1| conserved hypothetical protein [Ricinus communis] Length = 1876 Score = 443 bits (1140), Expect = e-121 Identities = 363/1061 (34%), Positives = 511/1061 (48%), Gaps = 71/1061 (6%) Frame = -1 Query: 3020 DDPANWNCNLLA-AVVRPKKSSASFPSLIGAHNNSVHALNRTPVPNSSTQVGSNSIGPST 2844 ++ +W+ N LA +V + ++PS N S+ R +PN +T V NS+ Sbjct: 112 NENVSWSSNSLADLLVMNNTAPTAYPSRTLHRNTSI--AERPLIPNLNTPV--NSLREFN 167 Query: 2843 VGSMVGNKKSHTFASNKPMGGYNSKQLPTNGFPVPYRPCYNLNSPPRSELDAASSGITGP 2664 G + ++H +SN P G + Q+P GFP+PY P Y+LNSPP E DAAS+ +T Sbjct: 168 SGELFYTNQAHCSSSNVPSGHNSLFQMPQYGFPIPYNPNYDLNSPPSIEADAAST-VTNS 226 Query: 2663 LPFAPITPDTRRKHTDNQWVPAKDRHEGQRNE-DADNHY-------------NEQLQTIG 2526 FAPI ++ + + +G E D ++Y ++ Q I Sbjct: 227 FQFAPIIEQAKKLENQLSALVNFPQGKGSSEERDKQDNYVVSLGNVPNQHNPDKLFQNIV 286 Query: 2525 DSTSSAVSTTQKEHLVSEEGDELGIDLNKTPQQKTPARRKKHRPKVIREXXXXXXXXXXX 2346 DS S+ +ST +E S +G + IDLNKTPQQKTP RRK HRPKVI E Sbjct: 287 DSASAVISTPFEEPKESCQGSDQVIDLNKTPQQKTPKRRK-HRPKVIVEGKPKKTPKSVT 345 Query: 2345 XXXXPSNGTPV-KRKYVRKKDVNISESPQGNGVEISPNGVPQSSGKRKYVRKKGVDNSDI 2169 N + KRKYVRKK S + + + + N + KRKYVRKK + I Sbjct: 346 PKTVDPNEKAIEKRKYVRKKGQKESTTEHPDSIGETTNSTEKPKQKRKYVRKKSLKEPQI 405 Query: 2168 QQKTRAEEATAPVVETPAKSCRKQLNFELE---------VVKDGSQMRGSQQDINLNAR- 2019 + A E T P T A SCRK LNFE+E +V M ++ NLN Sbjct: 406 RNADYAGETTYPSAGTAA-SCRKALNFEMENTYSEREKNLVAQQEIMNKGKETYNLNTGF 464 Query: 2018 --PQDVEQERINSILERSAMKITENDRYAG--VSTHQESSTNRMQVGTQTMSLPKPNVPT 1851 + +E R S L+ R+ G + Q N + +S N Sbjct: 465 HVSESLETHRTKSDLQMR--------RHNGSLLEFQQSRDVNNLTPFMNQIS----NNHQ 512 Query: 1850 PMAKARDHALNVLARNLTMRNSVSGKGYN-QVGQ----HVRGQSGTV---STNRDGREPS 1695 + R+ A+ AR ++ +G G + VG H G TV TN E + Sbjct: 513 SNSHRREGAVRPTARKDGQMDNSNGSGRDIDVGMLQHIHAEGTGRTVLPEKTNCKSLEKN 572 Query: 1694 GRMVNFE-----------ERRGIKRQSFE-QMHPRNLNAMDSLLMYQKLLLGADLRTDGS 1551 +V E RG KR + ++ +N L++Q+ +L D + Sbjct: 573 EEIVYHSTESVTKIPLLTEGRGYKRDYHQAELTMQNTGNPRGKLIFQEGVLIDDCHLNSH 632 Query: 1550 NDLANILESHKKTKTQSDHQTFVSNTPLGNNFSGEIRRTNGVYGNVSALQLLNSCTGRVD 1371 N A E+ KK K G + NG+ V+A+ + D Sbjct: 633 NSNAACPETCKKQKND-----------------GIQKNKNGMPPPVAAVNQSGGGNSKTD 675 Query: 1370 PSYKVTNAAGGNVNRH---------HFQPPMAATQNLQKHPAPSGMQPIAERSQRCTPGH 1218 S + + H++ +A+ Q+L +G ER+ G Sbjct: 676 SSASTVERNRELLKSYLKSKRDVVEHYKHSVASGQDLSLQHKWAGQNSCIERT-----GE 730 Query: 1217 GVNHVTAMVSWNRPPATPPKDYSRSAVVTYPATL---LDKKRTATPNSSNRGPNGADKML 1047 N V P TPPK +S P K+T S P+ ML Sbjct: 731 NCNIV---------PPTPPKMAPQSRDQLQPQICHIDASTKQTMASTQSLSVPSRKGNML 781 Query: 1046 LQLRKDALEVHQQSYTKAKGGPRKQKVSVSVSVEDLTYMLEGLCIYDENEKRQNALVPYR 867 Q +K+ L+ + + + G P KQK +++E++ Y +E L + +E + Q A+VPY+ Sbjct: 782 -QTQKNILKDQKSTAKRKAGQPAKQK---PITIEEIIYRMEHLNL-NEVKGEQTAIVPYK 836 Query: 866 GNNAIIP---FEPIKKRKPRPKVDLDPETDRLWRLLMGKEGSEATETLNKDKEKWWEDER 696 G+ A+IP FE IKKRKPRPKVDLDPET+R+W+LLM KEG E E +++K++WWE+ER Sbjct: 837 GDGALIPYDGFEIIKKRKPRPKVDLDPETERVWKLLMWKEGGEGLEGTDQEKKQWWEEER 896 Query: 695 RVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAAKFT 516 RVF GRADSFIARMHLVQGDRRFS+WKGSVVDSVIGVFLTQNVSDHLSSSAFM+LAAKF Sbjct: 897 RVFGGRADSFIARMHLVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMNLAAKF- 955 Query: 515 PKSTSTNKTCCQDMGCILVEEP-IETALPNDSMKCHDKIGRQPVFNQSSFASCESSEHMR 339 P + N+TC +D L++EP I PN ++K H+K+ P +NQSS ES EH R Sbjct: 956 PLKSMRNRTCERDEPRRLIQEPDIYMLNPNPTIKWHEKL-LTPFYNQSSMTPHESIEHRR 1014 Query: 338 HHIS-----TKATGDKQNRTSEEVILSQDSLDSSTIQTVDEIRSSSGSNSEAEDQTTGFE 174 + T EEV+ SQDS DSS +Q+ IRS SGSN EAED G + Sbjct: 1015 DQETSCTERTSIVEAHSYSPEEEVLSSQDSFDSSIVQSNGVIRSYSGSNLEAEDPAKGCK 1074 Query: 173 TSKEPGPANPMQAEKVSMFKELFSHDNRSTPLNDRSQYMHQ 51 ++ +N + E F+E FSH + + ++ S++ H+ Sbjct: 1075 HNENHNTSNAQKLE----FEEFFSHVSGRSLFHEGSRHRHR 1111 >ref|XP_002267310.1| PREDICTED: transcriptional activator DEMETER-like [Vitis vinifera] Length = 2198 Score = 416 bits (1068), Expect = e-113 Identities = 357/1061 (33%), Positives = 499/1061 (47%), Gaps = 84/1061 (7%) Frame = -1 Query: 2933 AHNNSVHALNRTPVPNSSTQVGSNSIGPSTVGSMVGNKKSHTFASNKPMGGYNSKQLPTN 2754 A S+ +R VPNS +Q N +++ ++G K++ S+ Q+P Sbjct: 413 APERSLLNASRPQVPNSHSQFEINWGEDNSIDMLLG-KENQCSGSSMWKNSNGLLQIPEY 471 Query: 2753 GFPVPYRPCYNLNSPPRSELDAASSGITGPLPFAPITPDTRRK----HTDNQWVPAKDRH 2586 GFP+PY+P +NLNSPP E DA SS IT P P+TP+ +K D P K++ Sbjct: 472 GFPIPYQPSFNLNSPPGVEADATSS-ITNSFPCPPVTPERPKKILNFSADEGSSPDKNQE 530 Query: 2585 --EGQRNEDADNHYNEQLQTIGDSTSSAVSTTQK-EHLVSEEGDELGIDLNKTPQQKTPA 2415 N +N +E L I S+S+A + K +++V++EGDE GIDLNKTP+QK P Sbjct: 531 YITSTTNGATENRCDELLHNIVASSSAAPPSPCKGKNIVAKEGDE-GIDLNKTPKQKQPK 589 Query: 2414 RRKKHRPKVIREXXXXXXXXXXXXXXXPSNGTPVKRKYVRKKDVNISESPQGNGVEISPN 2235 +RK HRPKV+ E TP K V + +P+ N Sbjct: 590 KRK-HRPKVVIEGKPKKTPKPKVVIEGKPKKTP-------KPKVPSNSNPKEN------- 634 Query: 2234 GVPQSSGKRKYVRKKG-----VDNSDIQQKTRAEEATAPVVETPAKSCRKQLNFELEVVK 2070 +GKRKYVRK D +D+ R E AKSC++ LNF E Sbjct: 635 ----PTGKRKYVRKNNPKVPVTDPTDV----RKEILDPSFASATAKSCKRVLNFGEEKSG 686 Query: 2069 DGSQMRGSQQDI---------NLNARPQDVEQ-ERINSILERSAMKITENDRYAGVSTHQ 1920 DG SQQ + LN Q E RIN I + V + Q Sbjct: 687 DGQHDVASQQGVMQQDNEPTFTLNLTSQTKEPCTRINIISGTKVAMQNDQQNELVVKSQQ 746 Query: 1919 ESSTNRMQVGTQTMSLPKPNVPTPMAKARDHAL---NVLAR-----NLTMRNSVSGKGYN 1764 S+ Q+ +++ K P + L NV++R N R S Y Sbjct: 747 MSAVESQQISADYIAMLKRYTPAAQPTTENLQLGNLNVISRTVNKGNTDPRQRNSKNAYV 806 Query: 1763 QVGQHVRG--------QSGTVSTNRDGRE--------PSGRMVNFEERRGIKRQSFEQMH 1632 + QH+ Q T N D + + N + G KR + Sbjct: 807 PIPQHIHADGIGQIVIQPLTTQENLDSSRRQMMQSTSQTNKFANSNQATGSKRDYCHTIE 866 Query: 1631 PRNLNAMDSL--LMYQKLLLGADLRTDGSNDLANIL-ESHKKTKTQ----SDHQTFVSNT 1473 +A + + Q++ + S++L + + KK KT+ ++ T S T Sbjct: 867 QSQAHAAHLIGPSLCQEIF---QVNEYNSSNLCKVFSDMQKKRKTEKAAYTNMSTMASYT 923 Query: 1472 PLGNN--FSGEIRRTNGVYGNVSALQLLNSCTGRVDPSYKVTNAAGGNVNRHHFQPPMAA 1299 G + E + N + ++ +LN C + S + N A Sbjct: 924 TAGEDELHQAEAKSVNQLTSQINH-GILNICFEGNNDSQNLANGVNKTTRDSSMHQTTAG 982 Query: 1298 TQNLQKHPA---PSGMQPIAERSQR-CTPGHGVNHVTAMVSWNRPPATPPKDYSRSA--- 1140 + H + PS + + E+ CT H + +TA P P K S S+ Sbjct: 983 NSMWKHHISNEWPSQTEDMREKQVNGCTQLHRLTVLTAAAKDKLQPPAPIKARSYSSGQH 1042 Query: 1139 --VVTYPATLLDKKRTATPNSSNRGPNGADKMLLQLRKDALEVHQQSYTKAKGGPRKQKV 966 TL +K++ P SN + K LQ KD L + Q K +G P K+K Sbjct: 1043 SIESCRVITLAEKQKE--PLFSNSHSSSTYKPFLQEPKDKLYDYHQPSIKKRGRPAKKKQ 1100 Query: 965 SVSVSVEDLTYMLEGLCIYDENEK----RQNALVPYRGNNAIIPFEPIKKRKPRPKVDLD 798 + + L+ L + D + + +NA++ Y+G+ AIIP+E IKKRKPRPKVDLD Sbjct: 1101 PDPIDA--IIERLKSLELNDTSNETVSQEENAIILYKGDGAIIPYE-IKKRKPRPKVDLD 1157 Query: 797 PETDRLWRLLMGKEGSEATETLNKDKEKWWEDERRVFRGRADSFIARMHLVQGDRRFSRW 618 ET+R+W+LLMG E ++ K KWWE+ER VFRGRADSFIARMHLVQGDRRFS W Sbjct: 1158 LETERVWKLLMGAEQDVGDS--DERKAKWWEEEREVFRGRADSFIARMHLVQGDRRFSPW 1215 Query: 617 KGSVVDSVIGVFLTQNVSDHLSSSAFMSLAAKFTPKSTSTNKTCCQDMGCILVEEPIETA 438 KGSVVDSVIGVFLTQNVSDHLSSSAFMSL ++F P +NKT + ILVEEP Sbjct: 1216 KGSVVDSVIGVFLTQNVSDHLSSSAFMSLVSRF-PLHPESNKTSYSNEASILVEEPEVCI 1274 Query: 437 L-PNDSMKCHDKIGRQPVFNQSSFASCESSEHMRHH-----ISTKATGDKQNRTSEEVIL 276 + P+D++K H+K+ Q V+NQ+ A ESSEH R T G R EEV+ Sbjct: 1275 MNPDDTIKWHEKVSHQQVYNQAFVAYSESSEHRRDSPDSGTSETSLVGAPNQRAEEEVMS 1334 Query: 275 SQDSLDSSTIQTVDEIRSSSGSNSEAEDQTTGFETSKEPGPA--NPMQAEKVSMFKELFS 102 SQDS++SS +QT +RS SGSNSEAED TTG +T+K A N + EK M +E Sbjct: 1335 SQDSVNSSVVQTT-VLRSCSGSNSEAEDPTTGHKTNKVQASASTNILYMEKTFMSQECQY 1393 Query: 101 HDNRSTPLNDRS-QYMHQLPK-------THVRNMQVPINSG 3 H N+S+ ++ + +Y Q P+ T ++ INSG Sbjct: 1394 HANKSSNFDENTMRYRKQNPRLDRVENHTESSSLTYLINSG 1434 >ref|XP_002316518.1| predicted protein [Populus trichocarpa] gi|222865558|gb|EEF02689.1| predicted protein [Populus trichocarpa] Length = 1312 Score = 379 bits (972), Expect = e-102 Identities = 339/1034 (32%), Positives = 481/1034 (46%), Gaps = 86/1034 (8%) Frame = -1 Query: 2903 RTPVPNSSTQVGSNSIGPSTVGSMVGNKKSHTFASNKPMGGYNSKQLPTNGFPVPYRPCY 2724 R PN QV +N P+ ++GN+ +H Y S Q P +P Y Sbjct: 142 RPSFPNLHPQV-NNYREPNL---LLGNQ-THCSGLRHLGSNYISSQEPNYEPMMPCPHNY 196 Query: 2723 NLNSPPRSELDAASSGITGPLPFAPITPDTRRKHTDNQWVPAKDRHE----GQRNE---- 2568 +LN PPR E DAAS T A + PD ++ A E G++ + Sbjct: 197 DLNFPPRMEADAASY-FTTSFKLATVVPDQCKRLESRLSATASPSQEKNSSGEKEKTDLV 255 Query: 2567 -----DADNHYNEQLQ-TIGDSTSSAVSTTQKEHLVSEEGDELGIDLNKTPQQKTPARRK 2406 +A+ H +++L I D+ S+ +ST +E + GIDLN+TPQQK P +R+ Sbjct: 256 IFKECEANQHNSKELSCNITDAPSAVISTPFEEAKDLATANAQGIDLNRTPQQK-PQKRR 314 Query: 2405 KHRPKVIREXXXXXXXXXXXXXXXPSNGTPV-KRKYVRKKDVNISESPQGNGVEISPNGV 2229 KHRPKVI E P+ KRKYVRK + P E + + Sbjct: 315 KHRPKVIVEGKPKRTPKAATTKITDPKEKPIEKRKYVRKA----LKEPATKPTESTVDTA 370 Query: 2228 PQSSGKRKYVRKKGVDNSDIQQK--------------------------TRAEEATAPVV 2127 P SS KRKYVRKK +D S +Q R ++T + Sbjct: 371 PPSSAKRKYVRKKALDESAVQHTDSIGETINTHAVKRKYVRKKDLNKSANRHADSTVEIT 430 Query: 2126 ETP---AKSCRKQLNFELEVVKDGSQMRGSQQDINLNARPQDVEQERINSILERSAMKIT 1956 ++ AKSCR+ L F+LE D S + Q LN + + +N+ L+ + + T Sbjct: 431 QSSSADAKSCRRALRFDLETATDRSCSNAAAQQDMLNQKRGTFD---LNASLQVADLSTT 487 Query: 1955 ENDRYAGVSTHQESSTNRMQVGTQTMSLPKPNVPTPMAKARDHA----LNVLARNLTMRN 1788 T Q S +R+ V Q P P D+ + V+A LT R Sbjct: 488 ---------TSQMSQQHRLLVENQQSGAPSNQTPFMNQPRGDYISISEIQVVAAELTPRK 538 Query: 1787 ----------------SVSGKGYNQVGQHVRGQSGTVSTNRDGREPSGRMVN--FEERRG 1662 S+ +G QV +G T S + + E RG Sbjct: 539 NMHMEKLNLNAGDVERSIHAQGIGQVVFPEKGPEWTRQITSQNNSQSAQKITPYLIEGRG 598 Query: 1661 IKRQSFEQMHPRNLNAMDSLLMYQKLLLGADLRTDGSNDLANILESHKKTKTQSDHQTFV 1482 KR+ F H + N + Y L +GS + E+ K+ KT+ QT Sbjct: 599 FKREHF---HIKKTNPCTA---YPVGSLTDGYDQNGSIPGSGCSETQKRKKTEDGIQT-- 650 Query: 1481 SNTPLGNNFSGEIRRTNGVYGNVSALQLL-NSCTGRVDPSYKVTNAAGGNVNRHHFQPPM 1305 NT ++F +++ Y + ALQ L C + P + G N Sbjct: 651 -NTHSISSFVSKVKYPGEWYVHSMALQNLPKQC---ISPQPHLCLEMLGETN-------- 698 Query: 1304 AATQNLQKHPAPSGMQPIAERSQRCTPGHGVNHVTAMVSWNR-PPATPPKDYSR---SAV 1137 + +Q P+ ++ SQ T+ S N+ P T + SR + Sbjct: 699 -GSTQVQNSLCPTTIETSHRLSQTSLK-------TSRASDNQLQPKTCNAEMSRIQQMSE 750 Query: 1136 VTYPATLLDKKRTATPNSSNRGPNGADKMLLQLRKDALEVHQQSYTKAKGGPRKQKVSVS 957 T P ++ P+ + P Q KD L+VHQQ Y K +G P KQ + S Sbjct: 751 ATVPISI--------PSEKGKIP--------QEPKDDLKVHQQPYAKRRGRPAKQ--TFS 792 Query: 956 VSVEDLTYMLEGLCIYDENEK----RQNALVPYRGNNAIIP---FEPIKKRKPRPKVDLD 798 ++E + Y +EGL + ++K QNALVPY+G+ ++P FE +KK KPRPKVDLD Sbjct: 793 STIEQIIYQMEGLRLNAGSKKIENKEQNALVPYKGDGKLVPYDGFEVVKKHKPRPKVDLD 852 Query: 797 PETDRLWRLLMGKEGSEATETLNKDKEKWWEDERRVFRGRADSFIARMHLVQGDRRFSRW 618 PE+DR+W+LLMGKEGS+ E +K KE+WW +ER+VF GR DSFIARMHLVQGDRRFS+W Sbjct: 853 PESDRVWKLLMGKEGSQGLEGTDKGKEQWWGEERKVFHGRVDSFIARMHLVQGDRRFSKW 912 Query: 617 KGSVVDSVIGVFLTQNVSDHLSSSAFMSLAAKFTPKSTSTNKTCCQDMGCILVEEPIETA 438 KGSVVDSVIGVFLTQNVSDHLSSSAFMSLA+ F P ++ C ++ I++EEP Sbjct: 913 KGSVVDSVIGVFLTQNVSDHLSSSAFMSLASLF-PLKLRSSGACDRERTSIVIEEPDTCI 971 Query: 437 L-PNDSMKCHDKIGRQPVFNQSSFASCESSEHMRH----HISTKATGDKQNRT-SEEVIL 276 L PND K P++NQSS S+E + I + + Q+ + EE +L Sbjct: 972 LNPNDI-----KWNSNPLYNQSSVTHHGSAEPHKDSETLFIERASMVETQSHSLEEEFVL 1026 Query: 275 SQDSLDSSTIQTVDEIRSSSGSNSEAEDQTTGFETSKEPGPA--NPMQAEKVSMFKELFS 102 SQDS DSST+Q + +RS SGSNSEAED TG + S + + +Q E ++ E + Sbjct: 1027 SQDSFDSSTVQ-ANGVRSYSGSNSEAEDPATGCKPSMNDDLSFMDLLQMESPTLLGEFYG 1085 Query: 101 HDNRSTPLNDRSQY 60 + S+ + S++ Sbjct: 1086 CEGGSSLFHKESRH 1099 >emb|CBI40219.3| unnamed protein product [Vitis vinifera] Length = 1621 Score = 343 bits (880), Expect = 2e-91 Identities = 300/916 (32%), Positives = 422/916 (46%), Gaps = 69/916 (7%) Frame = -1 Query: 2933 AHNNSVHALNRTPVPNSSTQVGSNSIGPSTVGSMVGNKKSHTFASNKPMGGYNSKQLPTN 2754 A S+ +R VPNS +Q N +++ ++G K++ S+ Q+P Sbjct: 88 APERSLLNASRPQVPNSHSQFEINWGEDNSIDMLLG-KENQCSGSSMWKNSNGLLQIPEY 146 Query: 2753 GFPVPYRPCYNLNSPPRSELDAASSGITGPLPFAPITPDTRRK----HTDNQWVPAKDRH 2586 GFP+PY+P +NLNSPP E DA SS IT P P+TP+ +K D P K++ Sbjct: 147 GFPIPYQPSFNLNSPPGVEADATSS-ITNSFPCPPVTPERPKKILNFSADEGSSPDKNQE 205 Query: 2585 --EGQRNEDADNHYNEQLQTIGDSTSSAVSTTQK-EHLVSEEGDELGIDLNKTPQQKTPA 2415 N +N +E L I S+S+A + K +++V++EGDE GIDLNKTP+QK P Sbjct: 206 YITSTTNGATENRCDELLHNIVASSSAAPPSPCKGKNIVAKEGDE-GIDLNKTPKQKQPK 264 Query: 2414 RRKKHRPKVIREXXXXXXXXXXXXXXXPSNGTPVKRKYVRKKDVNISESPQGNGVEISPN 2235 +RK HRPKV+ E TP K V + +P+ N Sbjct: 265 KRK-HRPKVVIEGKPKKTPKPKVVIEGKPKKTP-------KPKVPSNSNPKEN------- 309 Query: 2234 GVPQSSGKRKYVRKKG-----VDNSDIQQKTRAEEATAPVVETPAKSCRKQLNFELEVVK 2070 +GKRKYVRK D +D+ R E AKSC++ LNF E Sbjct: 310 ----PTGKRKYVRKNNPKVPVTDPTDV----RKEILDPSFASATAKSCKRVLNFGEEKSG 361 Query: 2069 DGSQMRGSQQDI---------NLNARPQDVEQ-ERINSILERSAMKITENDRYAGVSTHQ 1920 DG SQQ + LN Q E RIN I + V + Q Sbjct: 362 DGQHDVASQQGVMQQDNEPTFTLNLTSQTKEPCTRINIISGTKVAMQNDQQNELVVKSQQ 421 Query: 1919 ESSTNRMQVGTQTMSLPKPNVPTPMAKARDHAL---NVLAR-----NLTMRNSVSGKGYN 1764 S+ Q+ +++ K P + L NV++R N R S Y Sbjct: 422 MSAVESQQISADYIAMLKRYTPAAQPTTENLQLGNLNVISRTVNKGNTDPRQRNSKNAYV 481 Query: 1763 QVGQHVRG--------QSGTVSTNRDGRE--------PSGRMVNFEERRGIKRQSFEQMH 1632 + QH+ Q T N D + + N + G KR + Sbjct: 482 PIPQHIHADGIGQIVIQPLTTQENLDSSRRQMMQSTSQTNKFANSNQATGSKRDYCHTIE 541 Query: 1631 PRNLNAMDSL--LMYQKLLLGADLRTDGSNDLANIL-ESHKKTKTQ----SDHQTFVSNT 1473 +A + + Q++ + S++L + + KK KT+ ++ T S T Sbjct: 542 QSQAHAAHLIGPSLCQEIF---QVNEYNSSNLCKVFSDMQKKRKTEKAAYTNMSTMASYT 598 Query: 1472 PLGNN--FSGEIRRTNGVYGNVSALQLLNSCTGRVDPSYKVTNAAGGNVNRHHFQPPMAA 1299 G + E + N + ++ +LN C + S + N A Sbjct: 599 TAGEDELHQAEAKSVNQLTSQINH-GILNICFEGNNDSQNLANGVNKTTRDSSMHQTTAG 657 Query: 1298 TQNLQKHPA---PSGMQPIAERSQR-CTPGHGVNHVTAMVSWNRPPATPPKDYSRSA--- 1140 + H + PS + + E+ CT H + +TA P P K S S+ Sbjct: 658 NSMWKHHISNEWPSQTEDMREKQVNGCTQLHRLTVLTAAAKDKLQPPAPIKARSYSSGQH 717 Query: 1139 --VVTYPATLLDKKRTATPNSSNRGPNGADKMLLQLRKDALEVHQQSYTKAKGGPRKQKV 966 TL +K++ P SN + K LQ KD L + Q K +G P K+K Sbjct: 718 SIESCRVITLAEKQKE--PLFSNSHSSSTYKPFLQEPKDKLYDYHQPSIKKRGRPAKKKQ 775 Query: 965 SVSVSVEDLTYMLEGLCIYDENEK----RQNALVPYRGNNAIIPFEPIKKRKPRPKVDLD 798 + + L+ L + D + + +NA++ Y+G+ AIIP+E IKKRKPRPKVDLD Sbjct: 776 PDPIDA--IIERLKSLELNDTSNETVSQEENAIILYKGDGAIIPYE-IKKRKPRPKVDLD 832 Query: 797 PETDRLWRLLMGKEGSEATETLNKDKEKWWEDERRVFRGRADSFIARMHLVQGDRRFSRW 618 ET+R+W+LLMG E ++ K KWWE+ER VFRGRADSFIARMHLVQGDRRFS W Sbjct: 833 LETERVWKLLMGAEQDVGDS--DERKAKWWEEEREVFRGRADSFIARMHLVQGDRRFSPW 890 Query: 617 KGSVVDSVIGVFLTQNVSDHLSSSAFMSLAAKFTPKSTSTNKTCCQDMGCILVEEPIETA 438 KGSVVDSVIGVFLTQNVSDHLSSSAFMSL ++F P +NKT + ILVEEP Sbjct: 891 KGSVVDSVIGVFLTQNVSDHLSSSAFMSLVSRF-PLHPESNKTSYSNEASILVEEPEVCI 949 Query: 437 L-PNDSMKCHDKIGRQ 393 + P+D++K H+K+ Q Sbjct: 950 MNPDDTIKWHEKVSHQ 965 >gb|AEC12445.1| DNA N-glycosylase/DNA-(apurinic or apyrimidinic site) lyase [Gossypium hirsutum] Length = 2055 Score = 342 bits (878), Expect = 3e-91 Identities = 322/1000 (32%), Positives = 457/1000 (45%), Gaps = 102/1000 (10%) Frame = -1 Query: 2753 GFPVPYRPCYNLNSPPRSELDAASSGITGPLPFAPITPDTRRKHTDNQWVPAKDRHEGQR 2574 GFP+P P NLNSP R+E+ A S T T++ N PA D + Sbjct: 293 GFPIPSMPVCNLNSPARTEVGAPSHFNTSFQSLLATPDQTQKTRKQN---PAADENSVSE 349 Query: 2573 NEDAD------NHYNEQ----LQTIGDSTSSAVSTTQKEHLVSEEGDELGIDLNKTPQQK 2424 E +++Q LQ I DS+S +S +E SE G GIDLNKTPQQK Sbjct: 350 KEQESLIVCNKKEFSQQNCDLLQNIVDSSSVIISAPMEEK-DSERGSVQGIDLNKTPQQK 408 Query: 2423 TPARRKKHRPKVIREXXXXXXXXXXXXXXXPSNGTPV-KRKYVRKKDVNISESPQGNGVE 2247 P +R+KHRPKVI E S P KRKYVR+K + + + + Sbjct: 409 -PPKRRKHRPKVIVEGKPKRTPKPTTTANVNSKDNPSGKRKYVRRKGLTEPATQHADPTK 467 Query: 2246 ISPNGVPQSSGKRKYVRKKGVDNSDIQ----------------------QKTRAEEATAP 2133 S + + KRKYVRKKG+ Q +T +E+ +P Sbjct: 468 AS-DSTAGTPAKRKYVRKKGLTELATQHAEVLQTNLLVMLGSTIRGKCMHETNQKESASP 526 Query: 2132 VVE----------TPAKSCRKQLNFELEVVKDGSQMRGSQQDINLNARPQDVEQERINSI 1983 + +SCR+ LNF+LE +GS L+++ + +S+ Sbjct: 527 QGDCIRDSDPSPVCAPRSCRRALNFDLENTGNGSLAGTLNHQEMLSSKSSESRSMGFSSV 586 Query: 1982 LERSAMK---ITENDRYAGVSTH--------QESSTNRMQVGTQTMSLPKPNVPTP---M 1845 S K T++++ +G++ S + + MSLP T Sbjct: 587 -GNSGFKTRFTTQSNQQSGLAVENPQLQAECSHSPFMKKMMPIDYMSLPGITAATASRLQ 645 Query: 1844 AKARDHALNVLARNLTMRN-SVSGKGYNQVGQHVRGQSGTV----STNRDGREPSGR--- 1689 AK +NV+ARN M + ++ Y VG + + T + EP Sbjct: 646 AKELMENVNVMARNANMYDIDLNQNSYRNVGTLPHSKLSNLFHKEETGKILMEPRNSCLK 705 Query: 1688 ---------MVNFEERRGIKRQSF---EQMHPRNLNAMDSLLMYQKLLLGADLRTDGSND 1545 + N E RG KR + EQ M SLL Q + + +G ++ Sbjct: 706 DTLSQSATVLTNSNEGRGSKRDHYHAIEQGQFSTAGTMSSLLS-QAIFQADEGYRNGCSN 764 Query: 1544 LANILESHKKTKTQSDHQTFVSNTPLGNNFSGEIRRTNGVYG-NVSALQLLNSCTGRVDP 1368 A ++ K+ + + + + + + +T G N L C G DP Sbjct: 765 EAAFPQASKRRIIEDEFHAYKYGMKCSVSHAAGLLQTKGTNDVNAGQFTSLRDC-GTSDP 823 Query: 1367 SYKVTN---AAGG---NVNRHHFQPPMAATQNLQKHPAPSGMQPIAERSQRCTPGHGVNH 1206 ++ N GG + + + A K S + E+ G + H Sbjct: 824 HFRSDNIDRRKGGVFSQLTGNRYVNSTAGDLTSSKQNILSQLHSGIEKVGNIN-GLALVH 882 Query: 1205 VTAMVSWNRP---PATPPK-DYSRSAVV--TYPATLLDKKRTATPNSSNRGPNGADKMLL 1044 A + NR P TP K R+ +V T+ + + K+ P P KM+ Sbjct: 883 NLATIE-NRNLLLPTTPEKVSTPRTGLVGQTFHTNVSENKK-REPGLPRNVPFTVGKMVQ 940 Query: 1043 QLRKDALEVHQQSYTKAKGGPRKQKVSVSVSVEDLTYMLEGLCIYDENEK----RQNALV 876 + K + +QQS TKA+ GP + VS++ VE++ +GL + ++N K QNALV Sbjct: 941 E--KKRVSENQQS-TKAR-GPSAKHVSLN-PVEEIINRFKGLTLEEKNNKPKAELQNALV 995 Query: 875 PYRGNNAIIPFEPIK--KRKPRPKVDLDPETDRLWRLLMGKEGSEATETLNKDKEKWWED 702 Y G ++PFE + K+K RP+VDLDPET+R+W LLMGKEG + T DKEKWWE+ Sbjct: 996 LYNGAGTVVPFEGFESIKKKVRPRVDLDPETNRVWNLLMGKEGEDTEGT---DKEKWWEE 1052 Query: 701 ERRVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAAK 522 ERRVF GR DSFIARMHLVQGDRRFS+WKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAAK Sbjct: 1053 ERRVFHGRVDSFIARMHLVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAAK 1112 Query: 521 FTPKSTSTNKTCCQDMGCILVEEPIETAL-PNDSMKCHDKIGRQPVFNQSSFASCESSEH 345 F P +S C + IL+EEP L +++K H+K R + +QSS S+++ Sbjct: 1113 F-PLKSSCKGDCNAERTTILIEEPEVCELNSEETIKWHEKPFRHQLDSQSSMTPNRSTDY 1171 Query: 344 MRHH-----ISTKATGDKQNRTSEEVILSQDSLDSSTIQTVDEIRSSSGSNSEAEDQTTG 180 R+ T G EEV+ SQ S DSS IQ IR+ SGS SE ED T Sbjct: 1172 QRNSEYSGIERTSFMGTYSQSLEEEVLSSQGSFDSSVIQANGGIRTYSGSYSETEDPTMS 1231 Query: 179 FETSKEPGPANPMQAEKVSMFKELFSHDNRSTPLNDRSQY 60 + G + Q E + +E + + S+ L++ +Y Sbjct: 1232 CKFLSIHG-STLDQIENSASVEEFYHCASGSSQLHEGIKY 1270