BLASTX nr result

ID: Atractylodes21_contig00014105 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Atractylodes21_contig00014105
         (3221 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002530889.1| conserved hypothetical protein [Ricinus comm...   443   e-121
ref|XP_002267310.1| PREDICTED: transcriptional activator DEMETER...   416   e-113
ref|XP_002316518.1| predicted protein [Populus trichocarpa] gi|2...   379   e-102
emb|CBI40219.3| unnamed protein product [Vitis vinifera]              343   2e-91
gb|AEC12445.1| DNA N-glycosylase/DNA-(apurinic or apyrimidinic s...   342   3e-91

>ref|XP_002530889.1| conserved hypothetical protein [Ricinus communis]
            gi|223529542|gb|EEF31495.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 1876

 Score =  443 bits (1140), Expect = e-121
 Identities = 363/1061 (34%), Positives = 511/1061 (48%), Gaps = 71/1061 (6%)
 Frame = -1

Query: 3020 DDPANWNCNLLA-AVVRPKKSSASFPSLIGAHNNSVHALNRTPVPNSSTQVGSNSIGPST 2844
            ++  +W+ N LA  +V    +  ++PS     N S+    R  +PN +T V  NS+    
Sbjct: 112  NENVSWSSNSLADLLVMNNTAPTAYPSRTLHRNTSI--AERPLIPNLNTPV--NSLREFN 167

Query: 2843 VGSMVGNKKSHTFASNKPMGGYNSKQLPTNGFPVPYRPCYNLNSPPRSELDAASSGITGP 2664
             G +    ++H  +SN P G  +  Q+P  GFP+PY P Y+LNSPP  E DAAS+ +T  
Sbjct: 168  SGELFYTNQAHCSSSNVPSGHNSLFQMPQYGFPIPYNPNYDLNSPPSIEADAAST-VTNS 226

Query: 2663 LPFAPITPDTRRKHTDNQWVPAKDRHEGQRNE-DADNHY-------------NEQLQTIG 2526
              FAPI    ++       +    + +G   E D  ++Y             ++  Q I 
Sbjct: 227  FQFAPIIEQAKKLENQLSALVNFPQGKGSSEERDKQDNYVVSLGNVPNQHNPDKLFQNIV 286

Query: 2525 DSTSSAVSTTQKEHLVSEEGDELGIDLNKTPQQKTPARRKKHRPKVIREXXXXXXXXXXX 2346
            DS S+ +ST  +E   S +G +  IDLNKTPQQKTP RRK HRPKVI E           
Sbjct: 287  DSASAVISTPFEEPKESCQGSDQVIDLNKTPQQKTPKRRK-HRPKVIVEGKPKKTPKSVT 345

Query: 2345 XXXXPSNGTPV-KRKYVRKKDVNISESPQGNGVEISPNGVPQSSGKRKYVRKKGVDNSDI 2169
                  N   + KRKYVRKK    S +   + +  + N   +   KRKYVRKK +    I
Sbjct: 346  PKTVDPNEKAIEKRKYVRKKGQKESTTEHPDSIGETTNSTEKPKQKRKYVRKKSLKEPQI 405

Query: 2168 QQKTRAEEATAPVVETPAKSCRKQLNFELE---------VVKDGSQMRGSQQDINLNAR- 2019
            +    A E T P   T A SCRK LNFE+E         +V     M   ++  NLN   
Sbjct: 406  RNADYAGETTYPSAGTAA-SCRKALNFEMENTYSEREKNLVAQQEIMNKGKETYNLNTGF 464

Query: 2018 --PQDVEQERINSILERSAMKITENDRYAG--VSTHQESSTNRMQVGTQTMSLPKPNVPT 1851
               + +E  R  S L+          R+ G  +   Q    N +      +S    N   
Sbjct: 465  HVSESLETHRTKSDLQMR--------RHNGSLLEFQQSRDVNNLTPFMNQIS----NNHQ 512

Query: 1850 PMAKARDHALNVLARNLTMRNSVSGKGYN-QVGQ----HVRGQSGTV---STNRDGREPS 1695
              +  R+ A+   AR     ++ +G G +  VG     H  G   TV    TN    E +
Sbjct: 513  SNSHRREGAVRPTARKDGQMDNSNGSGRDIDVGMLQHIHAEGTGRTVLPEKTNCKSLEKN 572

Query: 1694 GRMVNFE-----------ERRGIKRQSFE-QMHPRNLNAMDSLLMYQKLLLGADLRTDGS 1551
              +V              E RG KR   + ++  +N       L++Q+ +L  D   +  
Sbjct: 573  EEIVYHSTESVTKIPLLTEGRGYKRDYHQAELTMQNTGNPRGKLIFQEGVLIDDCHLNSH 632

Query: 1550 NDLANILESHKKTKTQSDHQTFVSNTPLGNNFSGEIRRTNGVYGNVSALQLLNSCTGRVD 1371
            N  A   E+ KK K                   G  +  NG+   V+A+        + D
Sbjct: 633  NSNAACPETCKKQKND-----------------GIQKNKNGMPPPVAAVNQSGGGNSKTD 675

Query: 1370 PSYKVTNAAGGNVNRH---------HFQPPMAATQNLQKHPAPSGMQPIAERSQRCTPGH 1218
             S          +  +         H++  +A+ Q+L      +G     ER+     G 
Sbjct: 676  SSASTVERNRELLKSYLKSKRDVVEHYKHSVASGQDLSLQHKWAGQNSCIERT-----GE 730

Query: 1217 GVNHVTAMVSWNRPPATPPKDYSRSAVVTYPATL---LDKKRTATPNSSNRGPNGADKML 1047
              N V         P TPPK   +S     P         K+T     S   P+    ML
Sbjct: 731  NCNIV---------PPTPPKMAPQSRDQLQPQICHIDASTKQTMASTQSLSVPSRKGNML 781

Query: 1046 LQLRKDALEVHQQSYTKAKGGPRKQKVSVSVSVEDLTYMLEGLCIYDENEKRQNALVPYR 867
             Q +K+ L+  + +  +  G P KQK    +++E++ Y +E L + +E +  Q A+VPY+
Sbjct: 782  -QTQKNILKDQKSTAKRKAGQPAKQK---PITIEEIIYRMEHLNL-NEVKGEQTAIVPYK 836

Query: 866  GNNAIIP---FEPIKKRKPRPKVDLDPETDRLWRLLMGKEGSEATETLNKDKEKWWEDER 696
            G+ A+IP   FE IKKRKPRPKVDLDPET+R+W+LLM KEG E  E  +++K++WWE+ER
Sbjct: 837  GDGALIPYDGFEIIKKRKPRPKVDLDPETERVWKLLMWKEGGEGLEGTDQEKKQWWEEER 896

Query: 695  RVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAAKFT 516
            RVF GRADSFIARMHLVQGDRRFS+WKGSVVDSVIGVFLTQNVSDHLSSSAFM+LAAKF 
Sbjct: 897  RVFGGRADSFIARMHLVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMNLAAKF- 955

Query: 515  PKSTSTNKTCCQDMGCILVEEP-IETALPNDSMKCHDKIGRQPVFNQSSFASCESSEHMR 339
            P  +  N+TC +D    L++EP I    PN ++K H+K+   P +NQSS    ES EH R
Sbjct: 956  PLKSMRNRTCERDEPRRLIQEPDIYMLNPNPTIKWHEKL-LTPFYNQSSMTPHESIEHRR 1014

Query: 338  HHIS-----TKATGDKQNRTSEEVILSQDSLDSSTIQTVDEIRSSSGSNSEAEDQTTGFE 174
               +     T           EEV+ SQDS DSS +Q+   IRS SGSN EAED   G +
Sbjct: 1015 DQETSCTERTSIVEAHSYSPEEEVLSSQDSFDSSIVQSNGVIRSYSGSNLEAEDPAKGCK 1074

Query: 173  TSKEPGPANPMQAEKVSMFKELFSHDNRSTPLNDRSQYMHQ 51
             ++    +N  + E    F+E FSH +  +  ++ S++ H+
Sbjct: 1075 HNENHNTSNAQKLE----FEEFFSHVSGRSLFHEGSRHRHR 1111


>ref|XP_002267310.1| PREDICTED: transcriptional activator DEMETER-like [Vitis vinifera]
          Length = 2198

 Score =  416 bits (1068), Expect = e-113
 Identities = 357/1061 (33%), Positives = 499/1061 (47%), Gaps = 84/1061 (7%)
 Frame = -1

Query: 2933 AHNNSVHALNRTPVPNSSTQVGSNSIGPSTVGSMVGNKKSHTFASNKPMGGYNSKQLPTN 2754
            A   S+   +R  VPNS +Q   N    +++  ++G K++    S+         Q+P  
Sbjct: 413  APERSLLNASRPQVPNSHSQFEINWGEDNSIDMLLG-KENQCSGSSMWKNSNGLLQIPEY 471

Query: 2753 GFPVPYRPCYNLNSPPRSELDAASSGITGPLPFAPITPDTRRK----HTDNQWVPAKDRH 2586
            GFP+PY+P +NLNSPP  E DA SS IT   P  P+TP+  +K      D    P K++ 
Sbjct: 472  GFPIPYQPSFNLNSPPGVEADATSS-ITNSFPCPPVTPERPKKILNFSADEGSSPDKNQE 530

Query: 2585 --EGQRNEDADNHYNEQLQTIGDSTSSAVSTTQK-EHLVSEEGDELGIDLNKTPQQKTPA 2415
                  N   +N  +E L  I  S+S+A  +  K +++V++EGDE GIDLNKTP+QK P 
Sbjct: 531  YITSTTNGATENRCDELLHNIVASSSAAPPSPCKGKNIVAKEGDE-GIDLNKTPKQKQPK 589

Query: 2414 RRKKHRPKVIREXXXXXXXXXXXXXXXPSNGTPVKRKYVRKKDVNISESPQGNGVEISPN 2235
            +RK HRPKV+ E                   TP       K  V  + +P+ N       
Sbjct: 590  KRK-HRPKVVIEGKPKKTPKPKVVIEGKPKKTP-------KPKVPSNSNPKEN------- 634

Query: 2234 GVPQSSGKRKYVRKKG-----VDNSDIQQKTRAEEATAPVVETPAKSCRKQLNFELEVVK 2070
                 +GKRKYVRK        D +D+    R E          AKSC++ LNF  E   
Sbjct: 635  ----PTGKRKYVRKNNPKVPVTDPTDV----RKEILDPSFASATAKSCKRVLNFGEEKSG 686

Query: 2069 DGSQMRGSQQDI---------NLNARPQDVEQ-ERINSILERSAMKITENDRYAGVSTHQ 1920
            DG     SQQ +          LN   Q  E   RIN I         +      V + Q
Sbjct: 687  DGQHDVASQQGVMQQDNEPTFTLNLTSQTKEPCTRINIISGTKVAMQNDQQNELVVKSQQ 746

Query: 1919 ESSTNRMQVGTQTMSLPKPNVPTPMAKARDHAL---NVLAR-----NLTMRNSVSGKGYN 1764
             S+    Q+    +++ K   P       +  L   NV++R     N   R   S   Y 
Sbjct: 747  MSAVESQQISADYIAMLKRYTPAAQPTTENLQLGNLNVISRTVNKGNTDPRQRNSKNAYV 806

Query: 1763 QVGQHVRG--------QSGTVSTNRDGRE--------PSGRMVNFEERRGIKRQSFEQMH 1632
             + QH+          Q  T   N D            + +  N  +  G KR     + 
Sbjct: 807  PIPQHIHADGIGQIVIQPLTTQENLDSSRRQMMQSTSQTNKFANSNQATGSKRDYCHTIE 866

Query: 1631 PRNLNAMDSL--LMYQKLLLGADLRTDGSNDLANIL-ESHKKTKTQ----SDHQTFVSNT 1473
                +A   +   + Q++     +    S++L  +  +  KK KT+    ++  T  S T
Sbjct: 867  QSQAHAAHLIGPSLCQEIF---QVNEYNSSNLCKVFSDMQKKRKTEKAAYTNMSTMASYT 923

Query: 1472 PLGNN--FSGEIRRTNGVYGNVSALQLLNSCTGRVDPSYKVTNAAGGNVNRHHFQPPMAA 1299
              G +     E +  N +   ++   +LN C    + S  + N               A 
Sbjct: 924  TAGEDELHQAEAKSVNQLTSQINH-GILNICFEGNNDSQNLANGVNKTTRDSSMHQTTAG 982

Query: 1298 TQNLQKHPA---PSGMQPIAERSQR-CTPGHGVNHVTAMVSWNRPPATPPKDYSRSA--- 1140
                + H +   PS  + + E+    CT  H +  +TA       P  P K  S S+   
Sbjct: 983  NSMWKHHISNEWPSQTEDMREKQVNGCTQLHRLTVLTAAAKDKLQPPAPIKARSYSSGQH 1042

Query: 1139 --VVTYPATLLDKKRTATPNSSNRGPNGADKMLLQLRKDALEVHQQSYTKAKGGPRKQKV 966
                    TL +K++   P  SN   +   K  LQ  KD L  + Q   K +G P K+K 
Sbjct: 1043 SIESCRVITLAEKQKE--PLFSNSHSSSTYKPFLQEPKDKLYDYHQPSIKKRGRPAKKKQ 1100

Query: 965  SVSVSVEDLTYMLEGLCIYDENEK----RQNALVPYRGNNAIIPFEPIKKRKPRPKVDLD 798
               +    +   L+ L + D + +     +NA++ Y+G+ AIIP+E IKKRKPRPKVDLD
Sbjct: 1101 PDPIDA--IIERLKSLELNDTSNETVSQEENAIILYKGDGAIIPYE-IKKRKPRPKVDLD 1157

Query: 797  PETDRLWRLLMGKEGSEATETLNKDKEKWWEDERRVFRGRADSFIARMHLVQGDRRFSRW 618
             ET+R+W+LLMG E        ++ K KWWE+ER VFRGRADSFIARMHLVQGDRRFS W
Sbjct: 1158 LETERVWKLLMGAEQDVGDS--DERKAKWWEEEREVFRGRADSFIARMHLVQGDRRFSPW 1215

Query: 617  KGSVVDSVIGVFLTQNVSDHLSSSAFMSLAAKFTPKSTSTNKTCCQDMGCILVEEPIETA 438
            KGSVVDSVIGVFLTQNVSDHLSSSAFMSL ++F P    +NKT   +   ILVEEP    
Sbjct: 1216 KGSVVDSVIGVFLTQNVSDHLSSSAFMSLVSRF-PLHPESNKTSYSNEASILVEEPEVCI 1274

Query: 437  L-PNDSMKCHDKIGRQPVFNQSSFASCESSEHMRHH-----ISTKATGDKQNRTSEEVIL 276
            + P+D++K H+K+  Q V+NQ+  A  ESSEH R         T   G    R  EEV+ 
Sbjct: 1275 MNPDDTIKWHEKVSHQQVYNQAFVAYSESSEHRRDSPDSGTSETSLVGAPNQRAEEEVMS 1334

Query: 275  SQDSLDSSTIQTVDEIRSSSGSNSEAEDQTTGFETSKEPGPA--NPMQAEKVSMFKELFS 102
            SQDS++SS +QT   +RS SGSNSEAED TTG +T+K    A  N +  EK  M +E   
Sbjct: 1335 SQDSVNSSVVQTT-VLRSCSGSNSEAEDPTTGHKTNKVQASASTNILYMEKTFMSQECQY 1393

Query: 101  HDNRSTPLNDRS-QYMHQLPK-------THVRNMQVPINSG 3
            H N+S+  ++ + +Y  Q P+       T   ++   INSG
Sbjct: 1394 HANKSSNFDENTMRYRKQNPRLDRVENHTESSSLTYLINSG 1434


>ref|XP_002316518.1| predicted protein [Populus trichocarpa] gi|222865558|gb|EEF02689.1|
            predicted protein [Populus trichocarpa]
          Length = 1312

 Score =  379 bits (972), Expect = e-102
 Identities = 339/1034 (32%), Positives = 481/1034 (46%), Gaps = 86/1034 (8%)
 Frame = -1

Query: 2903 RTPVPNSSTQVGSNSIGPSTVGSMVGNKKSHTFASNKPMGGYNSKQLPTNGFPVPYRPCY 2724
            R   PN   QV +N   P+    ++GN+ +H          Y S Q P     +P    Y
Sbjct: 142  RPSFPNLHPQV-NNYREPNL---LLGNQ-THCSGLRHLGSNYISSQEPNYEPMMPCPHNY 196

Query: 2723 NLNSPPRSELDAASSGITGPLPFAPITPDTRRKHTDNQWVPAKDRHE----GQRNE---- 2568
            +LN PPR E DAAS   T     A + PD  ++        A    E    G++ +    
Sbjct: 197  DLNFPPRMEADAASY-FTTSFKLATVVPDQCKRLESRLSATASPSQEKNSSGEKEKTDLV 255

Query: 2567 -----DADNHYNEQLQ-TIGDSTSSAVSTTQKEHLVSEEGDELGIDLNKTPQQKTPARRK 2406
                 +A+ H +++L   I D+ S+ +ST  +E       +  GIDLN+TPQQK P +R+
Sbjct: 256  IFKECEANQHNSKELSCNITDAPSAVISTPFEEAKDLATANAQGIDLNRTPQQK-PQKRR 314

Query: 2405 KHRPKVIREXXXXXXXXXXXXXXXPSNGTPV-KRKYVRKKDVNISESPQGNGVEISPNGV 2229
            KHRPKVI E                    P+ KRKYVRK      + P     E + +  
Sbjct: 315  KHRPKVIVEGKPKRTPKAATTKITDPKEKPIEKRKYVRKA----LKEPATKPTESTVDTA 370

Query: 2228 PQSSGKRKYVRKKGVDNSDIQQK--------------------------TRAEEATAPVV 2127
            P SS KRKYVRKK +D S +Q                             R  ++T  + 
Sbjct: 371  PPSSAKRKYVRKKALDESAVQHTDSIGETINTHAVKRKYVRKKDLNKSANRHADSTVEIT 430

Query: 2126 ETP---AKSCRKQLNFELEVVKDGSQMRGSQQDINLNARPQDVEQERINSILERSAMKIT 1956
            ++    AKSCR+ L F+LE   D S    + Q   LN +    +   +N+ L+ + +  T
Sbjct: 431  QSSSADAKSCRRALRFDLETATDRSCSNAAAQQDMLNQKRGTFD---LNASLQVADLSTT 487

Query: 1955 ENDRYAGVSTHQESSTNRMQVGTQTMSLPKPNVPTPMAKARDHA----LNVLARNLTMRN 1788
                     T Q S  +R+ V  Q    P    P       D+     + V+A  LT R 
Sbjct: 488  ---------TSQMSQQHRLLVENQQSGAPSNQTPFMNQPRGDYISISEIQVVAAELTPRK 538

Query: 1787 ----------------SVSGKGYNQVGQHVRGQSGTVSTNRDGREPSGRMVN--FEERRG 1662
                            S+  +G  QV    +G   T          S + +     E RG
Sbjct: 539  NMHMEKLNLNAGDVERSIHAQGIGQVVFPEKGPEWTRQITSQNNSQSAQKITPYLIEGRG 598

Query: 1661 IKRQSFEQMHPRNLNAMDSLLMYQKLLLGADLRTDGSNDLANILESHKKTKTQSDHQTFV 1482
             KR+ F   H +  N   +   Y    L      +GS   +   E+ K+ KT+   QT  
Sbjct: 599  FKREHF---HIKKTNPCTA---YPVGSLTDGYDQNGSIPGSGCSETQKRKKTEDGIQT-- 650

Query: 1481 SNTPLGNNFSGEIRRTNGVYGNVSALQLL-NSCTGRVDPSYKVTNAAGGNVNRHHFQPPM 1305
             NT   ++F  +++     Y +  ALQ L   C   + P   +     G  N        
Sbjct: 651  -NTHSISSFVSKVKYPGEWYVHSMALQNLPKQC---ISPQPHLCLEMLGETN-------- 698

Query: 1304 AATQNLQKHPAPSGMQPIAERSQRCTPGHGVNHVTAMVSWNR-PPATPPKDYSR---SAV 1137
              +  +Q    P+ ++     SQ           T+  S N+  P T   + SR    + 
Sbjct: 699  -GSTQVQNSLCPTTIETSHRLSQTSLK-------TSRASDNQLQPKTCNAEMSRIQQMSE 750

Query: 1136 VTYPATLLDKKRTATPNSSNRGPNGADKMLLQLRKDALEVHQQSYTKAKGGPRKQKVSVS 957
             T P ++        P+   + P        Q  KD L+VHQQ Y K +G P KQ  + S
Sbjct: 751  ATVPISI--------PSEKGKIP--------QEPKDDLKVHQQPYAKRRGRPAKQ--TFS 792

Query: 956  VSVEDLTYMLEGLCIYDENEK----RQNALVPYRGNNAIIP---FEPIKKRKPRPKVDLD 798
             ++E + Y +EGL +   ++K     QNALVPY+G+  ++P   FE +KK KPRPKVDLD
Sbjct: 793  STIEQIIYQMEGLRLNAGSKKIENKEQNALVPYKGDGKLVPYDGFEVVKKHKPRPKVDLD 852

Query: 797  PETDRLWRLLMGKEGSEATETLNKDKEKWWEDERRVFRGRADSFIARMHLVQGDRRFSRW 618
            PE+DR+W+LLMGKEGS+  E  +K KE+WW +ER+VF GR DSFIARMHLVQGDRRFS+W
Sbjct: 853  PESDRVWKLLMGKEGSQGLEGTDKGKEQWWGEERKVFHGRVDSFIARMHLVQGDRRFSKW 912

Query: 617  KGSVVDSVIGVFLTQNVSDHLSSSAFMSLAAKFTPKSTSTNKTCCQDMGCILVEEPIETA 438
            KGSVVDSVIGVFLTQNVSDHLSSSAFMSLA+ F P    ++  C ++   I++EEP    
Sbjct: 913  KGSVVDSVIGVFLTQNVSDHLSSSAFMSLASLF-PLKLRSSGACDRERTSIVIEEPDTCI 971

Query: 437  L-PNDSMKCHDKIGRQPVFNQSSFASCESSEHMRH----HISTKATGDKQNRT-SEEVIL 276
            L PND      K    P++NQSS     S+E  +      I   +  + Q+ +  EE +L
Sbjct: 972  LNPNDI-----KWNSNPLYNQSSVTHHGSAEPHKDSETLFIERASMVETQSHSLEEEFVL 1026

Query: 275  SQDSLDSSTIQTVDEIRSSSGSNSEAEDQTTGFETSKEPGPA--NPMQAEKVSMFKELFS 102
            SQDS DSST+Q  + +RS SGSNSEAED  TG + S     +  + +Q E  ++  E + 
Sbjct: 1027 SQDSFDSSTVQ-ANGVRSYSGSNSEAEDPATGCKPSMNDDLSFMDLLQMESPTLLGEFYG 1085

Query: 101  HDNRSTPLNDRSQY 60
             +  S+  +  S++
Sbjct: 1086 CEGGSSLFHKESRH 1099


>emb|CBI40219.3| unnamed protein product [Vitis vinifera]
          Length = 1621

 Score =  343 bits (880), Expect = 2e-91
 Identities = 300/916 (32%), Positives = 422/916 (46%), Gaps = 69/916 (7%)
 Frame = -1

Query: 2933 AHNNSVHALNRTPVPNSSTQVGSNSIGPSTVGSMVGNKKSHTFASNKPMGGYNSKQLPTN 2754
            A   S+   +R  VPNS +Q   N    +++  ++G K++    S+         Q+P  
Sbjct: 88   APERSLLNASRPQVPNSHSQFEINWGEDNSIDMLLG-KENQCSGSSMWKNSNGLLQIPEY 146

Query: 2753 GFPVPYRPCYNLNSPPRSELDAASSGITGPLPFAPITPDTRRK----HTDNQWVPAKDRH 2586
            GFP+PY+P +NLNSPP  E DA SS IT   P  P+TP+  +K      D    P K++ 
Sbjct: 147  GFPIPYQPSFNLNSPPGVEADATSS-ITNSFPCPPVTPERPKKILNFSADEGSSPDKNQE 205

Query: 2585 --EGQRNEDADNHYNEQLQTIGDSTSSAVSTTQK-EHLVSEEGDELGIDLNKTPQQKTPA 2415
                  N   +N  +E L  I  S+S+A  +  K +++V++EGDE GIDLNKTP+QK P 
Sbjct: 206  YITSTTNGATENRCDELLHNIVASSSAAPPSPCKGKNIVAKEGDE-GIDLNKTPKQKQPK 264

Query: 2414 RRKKHRPKVIREXXXXXXXXXXXXXXXPSNGTPVKRKYVRKKDVNISESPQGNGVEISPN 2235
            +RK HRPKV+ E                   TP       K  V  + +P+ N       
Sbjct: 265  KRK-HRPKVVIEGKPKKTPKPKVVIEGKPKKTP-------KPKVPSNSNPKEN------- 309

Query: 2234 GVPQSSGKRKYVRKKG-----VDNSDIQQKTRAEEATAPVVETPAKSCRKQLNFELEVVK 2070
                 +GKRKYVRK        D +D+    R E          AKSC++ LNF  E   
Sbjct: 310  ----PTGKRKYVRKNNPKVPVTDPTDV----RKEILDPSFASATAKSCKRVLNFGEEKSG 361

Query: 2069 DGSQMRGSQQDI---------NLNARPQDVEQ-ERINSILERSAMKITENDRYAGVSTHQ 1920
            DG     SQQ +          LN   Q  E   RIN I         +      V + Q
Sbjct: 362  DGQHDVASQQGVMQQDNEPTFTLNLTSQTKEPCTRINIISGTKVAMQNDQQNELVVKSQQ 421

Query: 1919 ESSTNRMQVGTQTMSLPKPNVPTPMAKARDHAL---NVLAR-----NLTMRNSVSGKGYN 1764
             S+    Q+    +++ K   P       +  L   NV++R     N   R   S   Y 
Sbjct: 422  MSAVESQQISADYIAMLKRYTPAAQPTTENLQLGNLNVISRTVNKGNTDPRQRNSKNAYV 481

Query: 1763 QVGQHVRG--------QSGTVSTNRDGRE--------PSGRMVNFEERRGIKRQSFEQMH 1632
             + QH+          Q  T   N D            + +  N  +  G KR     + 
Sbjct: 482  PIPQHIHADGIGQIVIQPLTTQENLDSSRRQMMQSTSQTNKFANSNQATGSKRDYCHTIE 541

Query: 1631 PRNLNAMDSL--LMYQKLLLGADLRTDGSNDLANIL-ESHKKTKTQ----SDHQTFVSNT 1473
                +A   +   + Q++     +    S++L  +  +  KK KT+    ++  T  S T
Sbjct: 542  QSQAHAAHLIGPSLCQEIF---QVNEYNSSNLCKVFSDMQKKRKTEKAAYTNMSTMASYT 598

Query: 1472 PLGNN--FSGEIRRTNGVYGNVSALQLLNSCTGRVDPSYKVTNAAGGNVNRHHFQPPMAA 1299
              G +     E +  N +   ++   +LN C    + S  + N               A 
Sbjct: 599  TAGEDELHQAEAKSVNQLTSQINH-GILNICFEGNNDSQNLANGVNKTTRDSSMHQTTAG 657

Query: 1298 TQNLQKHPA---PSGMQPIAERSQR-CTPGHGVNHVTAMVSWNRPPATPPKDYSRSA--- 1140
                + H +   PS  + + E+    CT  H +  +TA       P  P K  S S+   
Sbjct: 658  NSMWKHHISNEWPSQTEDMREKQVNGCTQLHRLTVLTAAAKDKLQPPAPIKARSYSSGQH 717

Query: 1139 --VVTYPATLLDKKRTATPNSSNRGPNGADKMLLQLRKDALEVHQQSYTKAKGGPRKQKV 966
                    TL +K++   P  SN   +   K  LQ  KD L  + Q   K +G P K+K 
Sbjct: 718  SIESCRVITLAEKQKE--PLFSNSHSSSTYKPFLQEPKDKLYDYHQPSIKKRGRPAKKKQ 775

Query: 965  SVSVSVEDLTYMLEGLCIYDENEK----RQNALVPYRGNNAIIPFEPIKKRKPRPKVDLD 798
               +    +   L+ L + D + +     +NA++ Y+G+ AIIP+E IKKRKPRPKVDLD
Sbjct: 776  PDPIDA--IIERLKSLELNDTSNETVSQEENAIILYKGDGAIIPYE-IKKRKPRPKVDLD 832

Query: 797  PETDRLWRLLMGKEGSEATETLNKDKEKWWEDERRVFRGRADSFIARMHLVQGDRRFSRW 618
             ET+R+W+LLMG E        ++ K KWWE+ER VFRGRADSFIARMHLVQGDRRFS W
Sbjct: 833  LETERVWKLLMGAEQDVGDS--DERKAKWWEEEREVFRGRADSFIARMHLVQGDRRFSPW 890

Query: 617  KGSVVDSVIGVFLTQNVSDHLSSSAFMSLAAKFTPKSTSTNKTCCQDMGCILVEEPIETA 438
            KGSVVDSVIGVFLTQNVSDHLSSSAFMSL ++F P    +NKT   +   ILVEEP    
Sbjct: 891  KGSVVDSVIGVFLTQNVSDHLSSSAFMSLVSRF-PLHPESNKTSYSNEASILVEEPEVCI 949

Query: 437  L-PNDSMKCHDKIGRQ 393
            + P+D++K H+K+  Q
Sbjct: 950  MNPDDTIKWHEKVSHQ 965


>gb|AEC12445.1| DNA N-glycosylase/DNA-(apurinic or apyrimidinic site) lyase
            [Gossypium hirsutum]
          Length = 2055

 Score =  342 bits (878), Expect = 3e-91
 Identities = 322/1000 (32%), Positives = 457/1000 (45%), Gaps = 102/1000 (10%)
 Frame = -1

Query: 2753 GFPVPYRPCYNLNSPPRSELDAASSGITGPLPFAPITPDTRRKHTDNQWVPAKDRHEGQR 2574
            GFP+P  P  NLNSP R+E+ A S   T           T++    N   PA D +    
Sbjct: 293  GFPIPSMPVCNLNSPARTEVGAPSHFNTSFQSLLATPDQTQKTRKQN---PAADENSVSE 349

Query: 2573 NEDAD------NHYNEQ----LQTIGDSTSSAVSTTQKEHLVSEEGDELGIDLNKTPQQK 2424
             E           +++Q    LQ I DS+S  +S   +E   SE G   GIDLNKTPQQK
Sbjct: 350  KEQESLIVCNKKEFSQQNCDLLQNIVDSSSVIISAPMEEK-DSERGSVQGIDLNKTPQQK 408

Query: 2423 TPARRKKHRPKVIREXXXXXXXXXXXXXXXPSNGTPV-KRKYVRKKDVNISESPQGNGVE 2247
             P +R+KHRPKVI E                S   P  KRKYVR+K +    +   +  +
Sbjct: 409  -PPKRRKHRPKVIVEGKPKRTPKPTTTANVNSKDNPSGKRKYVRRKGLTEPATQHADPTK 467

Query: 2246 ISPNGVPQSSGKRKYVRKKGVDNSDIQ----------------------QKTRAEEATAP 2133
             S +    +  KRKYVRKKG+     Q                       +T  +E+ +P
Sbjct: 468  AS-DSTAGTPAKRKYVRKKGLTELATQHAEVLQTNLLVMLGSTIRGKCMHETNQKESASP 526

Query: 2132 VVE----------TPAKSCRKQLNFELEVVKDGSQMRGSQQDINLNARPQDVEQERINSI 1983
              +             +SCR+ LNF+LE   +GS          L+++  +      +S+
Sbjct: 527  QGDCIRDSDPSPVCAPRSCRRALNFDLENTGNGSLAGTLNHQEMLSSKSSESRSMGFSSV 586

Query: 1982 LERSAMK---ITENDRYAGVSTH--------QESSTNRMQVGTQTMSLPKPNVPTP---M 1845
               S  K    T++++ +G++            S   +  +    MSLP     T     
Sbjct: 587  -GNSGFKTRFTTQSNQQSGLAVENPQLQAECSHSPFMKKMMPIDYMSLPGITAATASRLQ 645

Query: 1844 AKARDHALNVLARNLTMRN-SVSGKGYNQVGQHVRGQSGTV----STNRDGREPSGR--- 1689
            AK     +NV+ARN  M +  ++   Y  VG     +   +     T +   EP      
Sbjct: 646  AKELMENVNVMARNANMYDIDLNQNSYRNVGTLPHSKLSNLFHKEETGKILMEPRNSCLK 705

Query: 1688 ---------MVNFEERRGIKRQSF---EQMHPRNLNAMDSLLMYQKLLLGADLRTDGSND 1545
                     + N  E RG KR  +   EQ        M SLL  Q +    +   +G ++
Sbjct: 706  DTLSQSATVLTNSNEGRGSKRDHYHAIEQGQFSTAGTMSSLLS-QAIFQADEGYRNGCSN 764

Query: 1544 LANILESHKKTKTQSDHQTFVSNTPLGNNFSGEIRRTNGVYG-NVSALQLLNSCTGRVDP 1368
             A   ++ K+   + +   +        + +  + +T G    N      L  C G  DP
Sbjct: 765  EAAFPQASKRRIIEDEFHAYKYGMKCSVSHAAGLLQTKGTNDVNAGQFTSLRDC-GTSDP 823

Query: 1367 SYKVTN---AAGG---NVNRHHFQPPMAATQNLQKHPAPSGMQPIAERSQRCTPGHGVNH 1206
             ++  N     GG    +  + +    A      K    S +    E+      G  + H
Sbjct: 824  HFRSDNIDRRKGGVFSQLTGNRYVNSTAGDLTSSKQNILSQLHSGIEKVGNIN-GLALVH 882

Query: 1205 VTAMVSWNRP---PATPPK-DYSRSAVV--TYPATLLDKKRTATPNSSNRGPNGADKMLL 1044
              A +  NR    P TP K    R+ +V  T+   + + K+   P      P    KM+ 
Sbjct: 883  NLATIE-NRNLLLPTTPEKVSTPRTGLVGQTFHTNVSENKK-REPGLPRNVPFTVGKMVQ 940

Query: 1043 QLRKDALEVHQQSYTKAKGGPRKQKVSVSVSVEDLTYMLEGLCIYDENEK----RQNALV 876
            +  K  +  +QQS TKA+ GP  + VS++  VE++    +GL + ++N K     QNALV
Sbjct: 941  E--KKRVSENQQS-TKAR-GPSAKHVSLN-PVEEIINRFKGLTLEEKNNKPKAELQNALV 995

Query: 875  PYRGNNAIIPFEPIK--KRKPRPKVDLDPETDRLWRLLMGKEGSEATETLNKDKEKWWED 702
             Y G   ++PFE  +  K+K RP+VDLDPET+R+W LLMGKEG +   T   DKEKWWE+
Sbjct: 996  LYNGAGTVVPFEGFESIKKKVRPRVDLDPETNRVWNLLMGKEGEDTEGT---DKEKWWEE 1052

Query: 701  ERRVFRGRADSFIARMHLVQGDRRFSRWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAAK 522
            ERRVF GR DSFIARMHLVQGDRRFS+WKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAAK
Sbjct: 1053 ERRVFHGRVDSFIARMHLVQGDRRFSKWKGSVVDSVIGVFLTQNVSDHLSSSAFMSLAAK 1112

Query: 521  FTPKSTSTNKTCCQDMGCILVEEPIETAL-PNDSMKCHDKIGRQPVFNQSSFASCESSEH 345
            F P  +S    C  +   IL+EEP    L   +++K H+K  R  + +QSS     S+++
Sbjct: 1113 F-PLKSSCKGDCNAERTTILIEEPEVCELNSEETIKWHEKPFRHQLDSQSSMTPNRSTDY 1171

Query: 344  MRHH-----ISTKATGDKQNRTSEEVILSQDSLDSSTIQTVDEIRSSSGSNSEAEDQTTG 180
             R+        T   G       EEV+ SQ S DSS IQ    IR+ SGS SE ED T  
Sbjct: 1172 QRNSEYSGIERTSFMGTYSQSLEEEVLSSQGSFDSSVIQANGGIRTYSGSYSETEDPTMS 1231

Query: 179  FETSKEPGPANPMQAEKVSMFKELFSHDNRSTPLNDRSQY 60
             +     G +   Q E  +  +E +   + S+ L++  +Y
Sbjct: 1232 CKFLSIHG-STLDQIENSASVEEFYHCASGSSQLHEGIKY 1270


Top