BLASTX nr result

ID: Coptis21_contig00006691 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis21_contig00006691
         (1504 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002263286.2| PREDICTED: DNA mismatch repair protein Msh3-...   441   e-121
ref|XP_002510803.1| DNA mismatch repair protein MSH3, putative [...   410   e-112
ref|XP_003556101.1| PREDICTED: DNA mismatch repair protein Msh3-...   399   e-108
ref|XP_002867605.1| hypothetical protein ARALYDRAFT_492273 [Arab...   389   e-106
ref|NP_194284.2| DNA mismatch repair protein Msh3 [Arabidopsis t...   387   e-105

>ref|XP_002263286.2| PREDICTED: DNA mismatch repair protein Msh3-like [Vitis vinifera]
          Length = 1137

 Score =  441 bits (1135), Expect = e-121
 Identities = 235/405 (58%), Positives = 291/405 (71%), Gaps = 8/405 (1%)
 Frame = +1

Query: 1    ELALACRSAWDRFLEGFSKYYAEFQASVQXXXXXXXXXXXXXXSRNKNYVRPVFVDDTES 180
            EL +ACR AWD FL  F KY++EFQA+VQ              SRNKNYVRPVFV D+E 
Sbjct: 725  ELMIACRGAWDSFLRAFDKYFSEFQAAVQALATLDCLHSLAILSRNKNYVRPVFVGDSEP 784

Query: 181  VQIHIHSGRHPVLDLILQDSFVPNDTNLNAEGEYCQIVTGPNMGGKSCYIRQVALLSIMA 360
            VQ+HI SGRHPVL+ +LQD+FVPNDTNL+A+GEYC+IVTGPNMGGKSCYIRQVAL++IMA
Sbjct: 785  VQMHICSGRHPVLETVLQDNFVPNDTNLHADGEYCEIVTGPNMGGKSCYIRQVALIAIMA 844

Query: 361  QVGSFVPASSAKLHVLDGIYTRMGASDNIQQGRSTFLDELSEASNILRNSTSRSLVIIDE 540
            QVGSFVPASSAKL VLDGI+TRMG+SD+IQQGRSTFL+ELSEAS+I+ N TSRSLVIIDE
Sbjct: 845  QVGSFVPASSAKLCVLDGIHTRMGSSDSIQQGRSTFLEELSEASHIIHNCTSRSLVIIDE 904

Query: 541  LGRATSTHDGVAIAYATLHHLL-GKKCMILFVTHYPKIVDIKNDFPKSVGDYHVSYLTSE 717
            LGR TSTHDGVAIAYATLH+LL  K+CM+LFVTHYPKIVD+KN+FP SVG YHVSY+ S+
Sbjct: 905  LGRGTSTHDGVAIAYATLHYLLEHKRCMVLFVTHYPKIVDVKNEFPGSVGAYHVSYMMSQ 964

Query: 718  NAQEV------TDSELDCN-ENMARGDITFLYKVVPGTSDKSFGLNVARLAQLPSXXXXX 876
             A ++      TDS+ D N + M   D+T+LYK+VPG S++SFG  VA+LAQLPS     
Sbjct: 965  RAMDMDTDTDKTDSKSDKNAQTMDHEDVTYLYKLVPGVSERSFGFKVAQLAQLPSSCIRR 1024

Query: 877  XXXXXXKLEELVARRMAIQQGKKLLLDTESTFEGNVQVXXXXXXXXXIGEQLYNYSSING 1056
                  +LE ++  R+     +K       T +G+ Q              +   S  + 
Sbjct: 1025 ANVMAAELEAMIVSRVKNSSAQK-------TLQGSQQ-------------SISIQSGCSR 1064

Query: 1057 TGELAELTDACHEVFANLKCALGHSDPMKSFQSLEHASNLAVELM 1191
              ++    DAC E F +LK ALG++DP +S Q L+HA ++A EL+
Sbjct: 1065 AEQIGLEEDACREFFLDLKSALGNADPERSLQFLKHARSIAKELI 1109


>ref|XP_002510803.1| DNA mismatch repair protein MSH3, putative [Ricinus communis]
            gi|223549918|gb|EEF51405.1| DNA mismatch repair protein
            MSH3, putative [Ricinus communis]
          Length = 1100

 Score =  410 bits (1054), Expect = e-112
 Identities = 228/401 (56%), Positives = 275/401 (68%), Gaps = 2/401 (0%)
 Frame = +1

Query: 1    ELALACRSAWDRFLEGFSKYYAEFQASVQXXXXXXXXXXXXXXSRNKNYVRPVFVDDTES 180
            EL + CR+AWD FL  F+K+YAEFQA +Q              S+NKNYVRPVFVDD E 
Sbjct: 716  ELMVVCRAAWDSFLRSFAKHYAEFQAVIQALAALDCLHSLAILSKNKNYVRPVFVDDNEP 775

Query: 181  VQIHIHSGRHPVLDLILQDSFVPNDTNLNAEGEYCQIVTGPNMGGKSCYIRQVALLSIMA 360
            VQIHI SGRHPVL+ IL D+FVPNDT L+ +GE+CQ+VTGPNMGGKSCYIRQVAL+ +MA
Sbjct: 776  VQIHISSGRHPVLETILLDNFVPNDTCLHVDGEHCQVVTGPNMGGKSCYIRQVALIVMMA 835

Query: 361  QVGSFVPASSAKLHVLDGIYTRMGASDNIQQGRSTFLDELSEASNILRNSTSRSLVIIDE 540
            QVGSFVPASSAKLHVLDGIYTRMGASD+IQQGRSTFL+ELSE S+ILR  T  SLVIIDE
Sbjct: 836  QVGSFVPASSAKLHVLDGIYTRMGASDSIQQGRSTFLEELSETSHILRKCTGYSLVIIDE 895

Query: 541  LGRATSTHDGVAIAYATLHHLL-GKKCMILFVTHYPKIVDIKNDFPKSVGDYHVSYLTSE 717
            LGR TSTHDG AIAYATL HLL  K+CM+LFVTHYPKI +I+  F  SVG YHVSYL +E
Sbjct: 896  LGRGTSTHDGEAIAYATLCHLLEQKRCMVLFVTHYPKIANIRTGFLNSVGAYHVSYLMAE 955

Query: 718  NAQEVTDSELDCNENMARGDITFLYKVVPGTSDKSFGLNVARLAQLPSXXXXXXXXXXXK 897
               + TDS+ D NE     D+T+LYK+VPG S++SFG  VA+LAQLP+           +
Sbjct: 956  KNNDATDSKFD-NE-----DVTYLYKLVPGVSERSFGFKVAQLAQLPTSCIERATVMAAR 1009

Query: 898  LEELVARRMAIQQGKKLLLDTESTFE-GNVQVXXXXXXXXXIGEQLYNYSSINGTGELAE 1074
            LEE ++ R+  +  K  LL      +   +Q            +++ NY  +N T E   
Sbjct: 1010 LEEAISCRIRNRLDKSQLLKALQIDQLQEIQEKIPESPGNFHDKRIENYEELNNTYE--- 1066

Query: 1075 LTDACHEVFANLKCALGHSDPMKSFQSLEHASNLAVELMKR 1197
                  + F N K AL   D  KSFQ LE+A ++A  L+KR
Sbjct: 1067 ------KFFLNFKSAL-LGDDAKSFQYLENARSIARALIKR 1100


>ref|XP_003556101.1| PREDICTED: DNA mismatch repair protein Msh3-like [Glycine max]
          Length = 1070

 Score =  399 bits (1024), Expect = e-108
 Identities = 226/405 (55%), Positives = 272/405 (67%), Gaps = 1/405 (0%)
 Frame = +1

Query: 1    ELALACRSAWDRFLEGFSKYYAEFQASVQXXXXXXXXXXXXXXSRNKNYVRPVFVDDTES 180
            EL +ACR+AW+ FL  FSK+YAEFQA+VQ              SRNK YV PVFVDD E 
Sbjct: 692  ELTVACRAAWNNFLTDFSKHYAEFQAAVQALAALDCLHSLAILSRNKGYVCPVFVDDHEP 751

Query: 181  VQIHIHSGRHPVLDLILQDSFVPNDTNLNAEGEYCQIVTGPNMGGKSCYIRQVALLSIMA 360
            VQI I SGRHPVL+  LQD+FVPNDTN++A+GEYCQIVTGPNMGGKSCYIRQVAL+ IMA
Sbjct: 752  VQIQISSGRHPVLETTLQDNFVPNDTNMHADGEYCQIVTGPNMGGKSCYIRQVALIVIMA 811

Query: 361  QVGSFVPASSAKLHVLDGIYTRMGASDNIQQGRSTFLDELSEASNILRNSTSRSLVIIDE 540
            QVGSFVPASSAKLHVLD IYTRMGASD+IQ GRSTFL+ELSE S+IL + T  SLVIIDE
Sbjct: 812  QVGSFVPASSAKLHVLDRIYTRMGASDSIQLGRSTFLEELSETSHILNSCTEHSLVIIDE 871

Query: 541  LGRATSTHDGVAIAYATLHHLL-GKKCMILFVTHYPKIVDIKNDFPKSVGDYHVSYLTSE 717
            LGR TSTHDG+AIA+ATLH+LL  K+ M+LFVTHYPKI  +  +FP SV  YHVS+L S 
Sbjct: 872  LGRGTSTHDGMAIAHATLHYLLKQKRSMVLFVTHYPKIASLATEFPGSVAAYHVSHLISH 931

Query: 718  NAQEVTDSELDCNENMARGDITFLYKVVPGTSDKSFGLNVARLAQLPSXXXXXXXXXXXK 897
            +A +  +S LD        D+T+LYK+VPG S++SFG  VA+LAQLPS           K
Sbjct: 932  DASK--NSNLD-------HDVTYLYKLVPGVSERSFGFKVAQLAQLPSHCISRAIVMASK 982

Query: 898  LEELVARRMAIQQGKKLLLDTESTFEGNVQVXXXXXXXXXIGEQLYNYSSINGTGELAEL 1077
            LE LV  R+  +  K+LLLDT                   IG++     + +      E 
Sbjct: 983  LEALVNSRIHGRSTKELLLDT-----------------LVIGQEKEQLMAQSLDRPHKEF 1025

Query: 1078 TDACHEVFANLKCALGHSDPMKSFQSLEHASNLAVELMKRLAVHV 1212
              A  + + NLK A    D  KSF  LEHA ++A +L+ R   +V
Sbjct: 1026 DMAYKDFYLNLKAATEDDDWAKSFHLLEHARSIAKKLIGRSMQYV 1070


>ref|XP_002867605.1| hypothetical protein ARALYDRAFT_492273 [Arabidopsis lyrata subsp.
            lyrata] gi|297313441|gb|EFH43864.1| hypothetical protein
            ARALYDRAFT_492273 [Arabidopsis lyrata subsp. lyrata]
          Length = 1078

 Score =  389 bits (1000), Expect = e-106
 Identities = 216/396 (54%), Positives = 267/396 (67%), Gaps = 2/396 (0%)
 Frame = +1

Query: 4    LALACRSAWDRFLEGFSKYYAEFQASVQXXXXXXXXXXXXXXSRNKNYVRPVFVDDTESV 183
            LA+  R++WD FLE FS+YY +FQA+VQ              S+NK YV PVFVDD E V
Sbjct: 717  LAIVNRASWDSFLESFSRYYTDFQAAVQALAALDCLHSLATLSKNKKYVCPVFVDDCEPV 776

Query: 184  QIHIHSGRHPVLDLILQDSFVPNDTNLNAEGEYCQIVTGPNMGGKSCYIRQVALLSIMAQ 363
            +I+I SGRHPVL+ +LQD+FVPNDT+L+AEGEYCQI+TGPNMGGKSCYIRQVAL+SIMAQ
Sbjct: 777  EINIQSGRHPVLETLLQDNFVPNDTSLHAEGEYCQIITGPNMGGKSCYIRQVALISIMAQ 836

Query: 364  VGSFVPASSAKLHVLDGIYTRMGASDNIQQGRSTFLDELSEASNILRNSTSRSLVIIDEL 543
            VGSFVPASS KLHVLDG++TRMGASD+IQ GRSTFL+ELSEAS+I+R  +SRSLVI+DEL
Sbjct: 837  VGSFVPASSVKLHVLDGVFTRMGASDSIQHGRSTFLEELSEASHIIRTCSSRSLVILDEL 896

Query: 544  GRATSTHDGVAIAYATLHH-LLGKKCMILFVTHYPKIVDIKNDFPKSVGDYHVSYLTSEN 720
            GR TSTHDGVAIAYATL H LL K+C++LFVTHYP+I +I N F  SVG YHVSYLTS+ 
Sbjct: 897  GRGTSTHDGVAIAYATLQHLLLEKRCLVLFVTHYPEIAEISNGFRGSVGTYHVSYLTSQK 956

Query: 721  AQEVTDSELDCNENMARGDITFLYKVVPGTSDKSFGLNVARLAQLPSXXXXXXXXXXXKL 900
             +   D +          D+T+LYK+V G   +SFG  VA+LAQ+PS           KL
Sbjct: 957  KKSGFDHD----------DVTYLYKLVRGLCSRSFGFKVAQLAQIPSSCIRRAISMGAKL 1006

Query: 901  E-ELVARRMAIQQGKKLLLDTESTFEGNVQVXXXXXXXXXIGEQLYNYSSINGTGELAEL 1077
            E E+ AR    + G+          EG+ +           G+      SI+  G+L   
Sbjct: 1007 EAEVGARERNTRMGEA---------EGHEE-------HGAPGDWTGAEESISALGDL--- 1047

Query: 1078 TDACHEVFANLKCALGHSDPMKSFQSLEHASNLAVE 1185
                   FA+LK AL   DP K+F+ L HA  +A +
Sbjct: 1048 -------FADLKFALSEEDPWKAFEFLNHAWKIAAK 1076


>ref|NP_194284.2| DNA mismatch repair protein Msh3 [Arabidopsis thaliana]
            gi|12644077|sp|O65607.2|MSH3_ARATH RecName: Full=DNA
            mismatch repair protein MSH3; Short=AtMSH3; AltName:
            Full=MutS protein homolog 3 gi|3319876|emb|CAA07684.1|
            Msh3 protein [Arabidopsis thaliana]
            gi|332659675|gb|AEE85075.1| DNA mismatch repair protein
            Msh3 [Arabidopsis thaliana]
          Length = 1081

 Score =  387 bits (994), Expect = e-105
 Identities = 214/393 (54%), Positives = 261/393 (66%), Gaps = 1/393 (0%)
 Frame = +1

Query: 4    LALACRSAWDRFLEGFSKYYAEFQASVQXXXXXXXXXXXXXXSRNKNYVRPVFVDDTESV 183
            LA+  R++WD FL+ FS+YY +F+A+VQ              SRNKNYVRP FVDD E V
Sbjct: 719  LAIVNRASWDSFLKSFSRYYTDFKAAVQALAALDCLHSLSTLSRNKNYVRPEFVDDCEPV 778

Query: 184  QIHIHSGRHPVLDLILQDSFVPNDTNLNAEGEYCQIVTGPNMGGKSCYIRQVALLSIMAQ 363
            +I+I SGRHPVL+ ILQD+FVPNDT L+AEGEYCQI+TGPNMGGKSCYIRQVAL+SIMAQ
Sbjct: 779  EINIQSGRHPVLETILQDNFVPNDTILHAEGEYCQIITGPNMGGKSCYIRQVALISIMAQ 838

Query: 364  VGSFVPASSAKLHVLDGIYTRMGASDNIQQGRSTFLDELSEASNILRNSTSRSLVIIDEL 543
            VGSFVPAS AKLHVLDG++TRMGASD+IQ GRSTFL+ELSEAS+I+R  +SRSLVI+DEL
Sbjct: 839  VGSFVPASFAKLHVLDGVFTRMGASDSIQHGRSTFLEELSEASHIIRTCSSRSLVILDEL 898

Query: 544  GRATSTHDGVAIAYATLHHLLG-KKCMILFVTHYPKIVDIKNDFPKSVGDYHVSYLTSEN 720
            GR TSTHDGVAIAYATL HLL  K+C++LFVTHYP+I +I N FP SVG YHVSYLT + 
Sbjct: 899  GRGTSTHDGVAIAYATLQHLLAEKRCLVLFVTHYPEIAEISNGFPGSVGTYHVSYLTLQK 958

Query: 721  AQEVTDSELDCNENMARGDITFLYKVVPGTSDKSFGLNVARLAQLPSXXXXXXXXXXXKL 900
             +   D +          D+T+LYK+V G   +SFG  VA+LAQ+P            KL
Sbjct: 959  DKGSYDHD----------DVTYLYKLVRGLCSRSFGFKVAQLAQIPPSCIRRAISMAAKL 1008

Query: 901  EELVARRMAIQQGKKLLLDTESTFEGNVQVXXXXXXXXXIGEQLYNYSSINGTGELAELT 1080
            E  V  R                 E N ++          G +     SI+  G+L    
Sbjct: 1009 EAEVRAR-----------------ERNTRMGEPEGHEEPRGAE----ESISALGDL---- 1043

Query: 1081 DACHEVFANLKCALGHSDPMKSFQSLEHASNLA 1179
                  FA+LK AL   DP K+F+ L+HA  +A
Sbjct: 1044 ------FADLKFALSEEDPWKAFEFLKHAWKIA 1070


Top