BLASTX nr result

ID: Angelica22_contig00024623 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00024623
         (1750 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|NP_193469.2| DNA mismatch repair protein MSH4 [Arabidopsis t...   775   0.0  
gb|AAT70180.1| MSH4 [Arabidopsis thaliana]                            773   0.0  
ref|XP_002870107.1| hypothetical protein ARALYDRAFT_493139 [Arab...   773   0.0  
ref|XP_004169721.1| PREDICTED: DNA mismatch repair protein MSH4-...   759   0.0  
ref|XP_003538746.1| PREDICTED: mutS protein homolog 4-like [Glyc...   759   0.0  

>ref|NP_193469.2| DNA mismatch repair protein MSH4 [Arabidopsis thaliana]
            gi|395406788|sp|F4JP48.1|MSH4_ARATH RecName: Full=DNA
            mismatch repair protein MSH4; Short=AtMSH4; AltName:
            Full=MutS protein homolog 4 gi|332658482|gb|AEE83882.1|
            DNA mismatch repair protein MSH4 [Arabidopsis thaliana]
          Length = 792

 Score =  775 bits (2001), Expect = 0.0
 Identities = 392/563 (69%), Positives = 437/563 (77%)
 Frame = +2

Query: 62   MEDDASEKSTFVVALIENRAKEVGVAAFDLRSASLHLSQYIETSRSYQNTKTLMHFYDPM 241
            MEDD  E+S+FV  LIENRAKEVG+AAFDLRSASLHLSQYIETS SYQNTKTL+ FYDP 
Sbjct: 1    MEDDGGERSSFVAGLIENRAKEVGMAAFDLRSASLHLSQYIETSSSYQNTKTLLRFYDPS 60

Query: 242  VIIVPPNKLAPDSMVGVSEFVDRFYSTTRKVVMMRGCFDDTTGAVMVKNLAAKEPSALGL 421
            VIIVPPNKLA D MVGVSE VDR YST RKVV  RGCFDDT GAV+++NLAA+EP ALGL
Sbjct: 61   VIIVPPNKLAADGMVGVSELVDRCYSTVRKVVFARGCFDDTKGAVLIQNLAAEEPLALGL 120

Query: 422  DSYYKQYYLCLXXXXXTIKWIEAEKGIIITNHSLLVTFNGSYDHMNIDATSVHNLEIIEP 601
            D+YYKQ+YL L     TIKWIEAEKG+I+TNHSL VTFNGS+DHMNIDATSV NLE+I+P
Sbjct: 121  DTYYKQHYLSLAAAAATIKWIEAEKGVIVTNHSLTVTFNGSFDHMNIDATSVENLELIDP 180

Query: 602  LQSTLWGTTNKKRSLFQXXXXXXXXXXXXXXXANLLQPLKDIETIKARLDCLDELMTNEQ 781
              + L GT+NKKRSLFQ               ANLLQPLKDIETI  RLDCLDELM+NEQ
Sbjct: 181  FHNALLGTSNKKRSLFQMFKTTKTAGGTRLLRANLLQPLKDIETINTRLDCLDELMSNEQ 240

Query: 782  LFFGLSQALRKFPKETDRVLCHFCFKPKKVTKEVLGIDNARXXXXXXXXXXXXXXXXXXX 961
            LFFGLSQ LRKFPKETDRVLCHFCFKPKKVT+ V+G +N R                   
Sbjct: 241  LFFGLSQVLRKFPKETDRVLCHFCFKPKKVTEAVIGFENTRKSQNMISSIILLKTALDAL 300

Query: 962  XXXSSVLKDAKSFLLGNVYNSVCENEKYASIRKRIGEEIDEDVLHARVPFVARTQQCFAV 1141
               + VLKDAK FLL NVY SVCEN++YASIRK+IGE ID+DVLHARVPFVARTQQCFA+
Sbjct: 301  PILAKVLKDAKCFLLANVYKSVCENDRYASIRKKIGEVIDDDVLHARVPFVARTQQCFAL 360

Query: 1142 KAGIDGMLDIARRTFCDTSEGVLYDCXXXXXXXXXXXXXXXXXTKYTAIHSLANKYREEY 1321
            KAGIDG LDIARRTFCDTSE                           AIH+LA+KYREE+
Sbjct: 361  KAGIDGFLDIARRTFCDTSE---------------------------AIHNLASKYREEF 393

Query: 1322 KLPNLKIPFNNRQGFYFSIPHKDVQGKLPSKFIQVVKHGNNIHCSTPELASLNVRNKSAA 1501
             LPNLK+PFNNRQGF+F IP K+VQGKLP+KF QVVKHG NIHCS+ ELASLNVRNKSAA
Sbjct: 394  NLPNLKLPFNNRQGFFFRIPQKEVQGKLPNKFTQVVKHGKNIHCSSLELASLNVRNKSAA 453

Query: 1502 GECYIRTEFCLEELMDAIRKEVSAXXXXXXXXXXXDMIVNSFANMISTKPVDRYTRPQFT 1681
            GEC+IRTE CLE LMDAIR+++SA           DMIVNSFA+ ISTKPVDRY+RP+ T
Sbjct: 454  GECFIRTETCLEALMDAIREDISALTLLAEVLCLLDMIVNSFAHTISTKPVDRYSRPELT 513

Query: 1682 DDGPLAIDAGRHPILESVHNEFI 1750
            D GPLAIDAGRHPILES+HN+F+
Sbjct: 514  DSGPLAIDAGRHPILESIHNDFV 536


>gb|AAT70180.1| MSH4 [Arabidopsis thaliana]
          Length = 792

 Score =  773 bits (1997), Expect = 0.0
 Identities = 391/563 (69%), Positives = 437/563 (77%)
 Frame = +2

Query: 62   MEDDASEKSTFVVALIENRAKEVGVAAFDLRSASLHLSQYIETSRSYQNTKTLMHFYDPM 241
            MEDD  E+S+FV  LIENRAKEVG+AAFDLRSASLHLSQYIETS SYQNTKTL+ FYDP 
Sbjct: 1    MEDDGGERSSFVAGLIENRAKEVGMAAFDLRSASLHLSQYIETSSSYQNTKTLLRFYDPS 60

Query: 242  VIIVPPNKLAPDSMVGVSEFVDRFYSTTRKVVMMRGCFDDTTGAVMVKNLAAKEPSALGL 421
            VIIVPPNKLA D MVGVSE VDR YST RKVV  RGCFDDT GAV+++NLAA+EP ALGL
Sbjct: 61   VIIVPPNKLAADGMVGVSELVDRCYSTVRKVVFARGCFDDTKGAVLIQNLAAEEPLALGL 120

Query: 422  DSYYKQYYLCLXXXXXTIKWIEAEKGIIITNHSLLVTFNGSYDHMNIDATSVHNLEIIEP 601
            D+YYKQ+YL L     TIKWIEAEKG+I+TNHSL VTFNGS+DHMNIDATSV NLE+I+P
Sbjct: 121  DTYYKQHYLSLAAAAATIKWIEAEKGVIVTNHSLTVTFNGSFDHMNIDATSVENLELIDP 180

Query: 602  LQSTLWGTTNKKRSLFQXXXXXXXXXXXXXXXANLLQPLKDIETIKARLDCLDELMTNEQ 781
              + L GT+NKKRSLFQ               ANLLQPLKDIETI  RLDCLDELM+NEQ
Sbjct: 181  FHNALLGTSNKKRSLFQMFKTTKTAGGTRLLRANLLQPLKDIETINTRLDCLDELMSNEQ 240

Query: 782  LFFGLSQALRKFPKETDRVLCHFCFKPKKVTKEVLGIDNARXXXXXXXXXXXXXXXXXXX 961
            LFFGLSQ LRKFP+ETDRVLCHFCFKPKKVT+ V+G +N R                   
Sbjct: 241  LFFGLSQVLRKFPEETDRVLCHFCFKPKKVTEAVIGFENTRKSQNMISSIILLKTALDAL 300

Query: 962  XXXSSVLKDAKSFLLGNVYNSVCENEKYASIRKRIGEEIDEDVLHARVPFVARTQQCFAV 1141
               + VLKDAK FLL NVY SVCEN++YASIRK+IGE ID+DVLHARVPFVARTQQCFA+
Sbjct: 301  PILAKVLKDAKCFLLANVYKSVCENDRYASIRKKIGEVIDDDVLHARVPFVARTQQCFAL 360

Query: 1142 KAGIDGMLDIARRTFCDTSEGVLYDCXXXXXXXXXXXXXXXXXTKYTAIHSLANKYREEY 1321
            KAGIDG LDIARRTFCDTSE                           AIH+LA+KYREE+
Sbjct: 361  KAGIDGFLDIARRTFCDTSE---------------------------AIHNLASKYREEF 393

Query: 1322 KLPNLKIPFNNRQGFYFSIPHKDVQGKLPSKFIQVVKHGNNIHCSTPELASLNVRNKSAA 1501
             LPNLK+PFNNRQGF+F IP K+VQGKLP+KF QVVKHG NIHCS+ ELASLNVRNKSAA
Sbjct: 394  NLPNLKLPFNNRQGFFFRIPQKEVQGKLPNKFTQVVKHGKNIHCSSLELASLNVRNKSAA 453

Query: 1502 GECYIRTEFCLEELMDAIRKEVSAXXXXXXXXXXXDMIVNSFANMISTKPVDRYTRPQFT 1681
            GEC+IRTE CLE LMDAIR+++SA           DMIVNSFA+ ISTKPVDRY+RP+ T
Sbjct: 454  GECFIRTETCLEALMDAIREDISALTLLAEVLCLLDMIVNSFAHTISTKPVDRYSRPELT 513

Query: 1682 DDGPLAIDAGRHPILESVHNEFI 1750
            D GPLAIDAGRHPILES+HN+F+
Sbjct: 514  DSGPLAIDAGRHPILESIHNDFV 536


>ref|XP_002870107.1| hypothetical protein ARALYDRAFT_493139 [Arabidopsis lyrata subsp.
            lyrata] gi|297315943|gb|EFH46366.1| hypothetical protein
            ARALYDRAFT_493139 [Arabidopsis lyrata subsp. lyrata]
          Length = 792

 Score =  773 bits (1996), Expect = 0.0
 Identities = 390/563 (69%), Positives = 437/563 (77%)
 Frame = +2

Query: 62   MEDDASEKSTFVVALIENRAKEVGVAAFDLRSASLHLSQYIETSRSYQNTKTLMHFYDPM 241
            MEDD  E+S+FV  LIENRAKEVG+AAFDLRSASLHLSQYIETS SYQNTKTL+ FYDP 
Sbjct: 1    MEDDGGERSSFVAGLIENRAKEVGMAAFDLRSASLHLSQYIETSSSYQNTKTLLRFYDPS 60

Query: 242  VIIVPPNKLAPDSMVGVSEFVDRFYSTTRKVVMMRGCFDDTTGAVMVKNLAAKEPSALGL 421
            VIIVPPNKLA D MVGVSE VDR YST RKVV  RGCFDDT GAV+++NLAA+EP ALGL
Sbjct: 61   VIIVPPNKLAADGMVGVSELVDRCYSTVRKVVFARGCFDDTKGAVLIQNLAAEEPLALGL 120

Query: 422  DSYYKQYYLCLXXXXXTIKWIEAEKGIIITNHSLLVTFNGSYDHMNIDATSVHNLEIIEP 601
            D+YYKQ+YL L     TIKWIEAEKG+I+TNHSL +TFNGS+DHMNIDATSV NLEII+P
Sbjct: 121  DTYYKQHYLSLAAAAATIKWIEAEKGVIVTNHSLTITFNGSFDHMNIDATSVENLEIIDP 180

Query: 602  LQSTLWGTTNKKRSLFQXXXXXXXXXXXXXXXANLLQPLKDIETIKARLDCLDELMTNEQ 781
              + L GT+NKKRSLFQ               ANLLQPLKDIETI  RLDCLDELM+NEQ
Sbjct: 181  FHNALLGTSNKKRSLFQMFKTTKTVGGTRLLRANLLQPLKDIETINTRLDCLDELMSNEQ 240

Query: 782  LFFGLSQALRKFPKETDRVLCHFCFKPKKVTKEVLGIDNARXXXXXXXXXXXXXXXXXXX 961
            LFFGLSQ LRKFP+ETDRVLCHFCFKPKKVT+ V+G +N R                   
Sbjct: 241  LFFGLSQVLRKFPQETDRVLCHFCFKPKKVTEAVIGFENTRRSQNMISSIILLKTALDAL 300

Query: 962  XXXSSVLKDAKSFLLGNVYNSVCENEKYASIRKRIGEEIDEDVLHARVPFVARTQQCFAV 1141
               + VLKDAK FLL NVY SVCEN++YASIRK+IGE ID+DVLHARVPFVARTQQCFA+
Sbjct: 301  PLLAKVLKDAKCFLLANVYKSVCENDRYASIRKKIGEVIDDDVLHARVPFVARTQQCFAL 360

Query: 1142 KAGIDGMLDIARRTFCDTSEGVLYDCXXXXXXXXXXXXXXXXXTKYTAIHSLANKYREEY 1321
            KAGIDG LDIARRTFCDTSE                           AIH+LA+KYREE+
Sbjct: 361  KAGIDGFLDIARRTFCDTSE---------------------------AIHNLASKYREEF 393

Query: 1322 KLPNLKIPFNNRQGFYFSIPHKDVQGKLPSKFIQVVKHGNNIHCSTPELASLNVRNKSAA 1501
             LPNLK+PFNNRQGF+F IP K+VQGKLP+KF QVVKHG NIHCS+ ELASLNVRNKSAA
Sbjct: 394  NLPNLKLPFNNRQGFFFRIPQKEVQGKLPNKFTQVVKHGKNIHCSSLELASLNVRNKSAA 453

Query: 1502 GECYIRTEFCLEELMDAIRKEVSAXXXXXXXXXXXDMIVNSFANMISTKPVDRYTRPQFT 1681
            GEC+IRTE CLE LMDAIR+++SA           DMIVNSFA+ ISTKPVDRY+RP+ T
Sbjct: 454  GECFIRTETCLEALMDAIREDISALTLLAEVLCLLDMIVNSFAHTISTKPVDRYSRPELT 513

Query: 1682 DDGPLAIDAGRHPILESVHNEFI 1750
            D GPLAIDAGRHP+LES+HN+F+
Sbjct: 514  DSGPLAIDAGRHPLLESIHNDFV 536


>ref|XP_004169721.1| PREDICTED: DNA mismatch repair protein MSH4-like, partial [Cucumis
            sativus]
          Length = 770

 Score =  759 bits (1961), Expect = 0.0
 Identities = 384/541 (70%), Positives = 425/541 (78%)
 Frame = +2

Query: 128  VGVAAFDLRSASLHLSQYIETSRSYQNTKTLMHFYDPMVIIVPPNKLAPDSMVGVSEFVD 307
            VGVAAFDLRSASLHLSQYIETS SYQNTKTL+HFY+PMVI+V PNKLAPD MVGVS   D
Sbjct: 1    VGVAAFDLRSASLHLSQYIETSSSYQNTKTLLHFYEPMVILVSPNKLAPDGMVGVSVLAD 60

Query: 308  RFYSTTRKVVMMRGCFDDTTGAVMVKNLAAKEPSALGLDSYYKQYYLCLXXXXXTIKWIE 487
            RF++T +KVVM R CFDDT GAV++KNLAAKEPSALGL++YYKQYYLCL     +IKWIE
Sbjct: 61   RFFATVKKVVMARSCFDDTKGAVLIKNLAAKEPSALGLETYYKQYYLCLAAAAASIKWIE 120

Query: 488  AEKGIIITNHSLLVTFNGSYDHMNIDATSVHNLEIIEPLQSTLWGTTNKKRSLFQXXXXX 667
            AEKG+I+TNHSLLVTFNGS DH++IDATSV NLEIIEPL S LWGT+NKKRSL+      
Sbjct: 121  AEKGVIVTNHSLLVTFNGSSDHVSIDATSVQNLEIIEPLHSNLWGTSNKKRSLYNMLKTT 180

Query: 668  XXXXXXXXXXANLLQPLKDIETIKARLDCLDELMTNEQLFFGLSQALRKFPKETDRVLCH 847
                      ANLLQPLKDIETI ARLDCLDELM+NEQLFFGLSQALRKFPKETDRVLCH
Sbjct: 181  KTIGGSRLLRANLLQPLKDIETINARLDCLDELMSNEQLFFGLSQALRKFPKETDRVLCH 240

Query: 848  FCFKPKKVTKEVLGIDNARXXXXXXXXXXXXXXXXXXXXXXSSVLKDAKSFLLGNVYNSV 1027
            FCFK KKVT EVL   +A+                      S VLK+AKSFLL N+Y S+
Sbjct: 241  FCFKQKKVTNEVLHPSDAKKSQNLISSIILLKTALEALPLLSKVLKEAKSFLLANIYKSI 300

Query: 1028 CENEKYASIRKRIGEEIDEDVLHARVPFVARTQQCFAVKAGIDGMLDIARRTFCDTSEGV 1207
            CENEKY +IRKRIGE IDEDVLHARVPF+ARTQQCFAVKAGIDG+LDIARRTFCDTSE  
Sbjct: 301  CENEKYTNIRKRIGEVIDEDVLHARVPFIARTQQCFAVKAGIDGLLDIARRTFCDTSE-- 358

Query: 1208 LYDCXXXXXXXXXXXXXXXXXTKYTAIHSLANKYREEYKLPNLKIPFNNRQGFYFSIPHK 1387
                                     AIH+LANKYREEYKL NLK+PFNNRQGFY SIPHK
Sbjct: 359  -------------------------AIHNLANKYREEYKLSNLKLPFNNRQGFYLSIPHK 393

Query: 1388 DVQGKLPSKFIQVVKHGNNIHCSTPELASLNVRNKSAAGECYIRTEFCLEELMDAIRKEV 1567
            DVQGKLP+KFIQV+KHGNNI CST ELASLNVRNKSAAGECYIRTE CLE L+DAIR++V
Sbjct: 394  DVQGKLPNKFIQVLKHGNNIRCSTLELASLNVRNKSAAGECYIRTEICLEGLVDAIREDV 453

Query: 1568 SAXXXXXXXXXXXDMIVNSFANMISTKPVDRYTRPQFTDDGPLAIDAGRHPILESVHNEF 1747
            S            DMIVNSFA+ ISTKPVDRYTRP FT++GP+AI+A RHPILES+HN+F
Sbjct: 454  SMLTLLAEVLCLLDMIVNSFAHTISTKPVDRYTRPNFTENGPMAIEAARHPILESIHNDF 513

Query: 1748 I 1750
            +
Sbjct: 514  V 514


>ref|XP_003538746.1| PREDICTED: mutS protein homolog 4-like [Glycine max]
          Length = 794

 Score =  759 bits (1961), Expect = 0.0
 Identities = 381/563 (67%), Positives = 431/563 (76%)
 Frame = +2

Query: 62   MEDDASEKSTFVVALIENRAKEVGVAAFDLRSASLHLSQYIETSRSYQNTKTLMHFYDPM 241
            MEDD  E S+FVV +IENRAKEVG+AAFDLRSASLHLSQYIETS SYQNTKTL+HFYDP+
Sbjct: 1    MEDDGGESSSFVVGIIENRAKEVGLAAFDLRSASLHLSQYIETSSSYQNTKTLLHFYDPI 60

Query: 242  VIIVPPNKLAPDSMVGVSEFVDRFYSTTRKVVMMRGCFDDTTGAVMVKNLAAKEPSALGL 421
            VII+PPNKLA +S  GV+E VDRFY + ++V++ RGCFDDT GA+++KNLAAKEPSALGL
Sbjct: 61   VIIIPPNKLASNSTAGVTELVDRFYGSVKQVMLARGCFDDTKGAILIKNLAAKEPSALGL 120

Query: 422  DSYYKQYYLCLXXXXXTIKWIEAEKGIIITNHSLLVTFNGSYDHMNIDATSVHNLEIIEP 601
            D+YYKQYYLCL     T+KW EAEKG+++TNHSL VTFNGS+DHMNID+TS+ NLEIIEP
Sbjct: 121  DTYYKQYYLCLAAAAATLKWTEAEKGVVVTNHSLSVTFNGSFDHMNIDSTSIQNLEIIEP 180

Query: 602  LQSTLWGTTNKKRSLFQXXXXXXXXXXXXXXXANLLQPLKDIETIKARLDCLDELMTNEQ 781
              STL GT+NKKRSLF                ANLLQPLKDIETI ARLDCLDELM+NEQ
Sbjct: 181  FHSTLLGTSNKKRSLFHMLKTTKTIGGSRLLRANLLQPLKDIETINARLDCLDELMSNEQ 240

Query: 782  LFFGLSQALRKFPKETDRVLCHFCFKPKKVTKEVLGIDNARXXXXXXXXXXXXXXXXXXX 961
            LFFGL Q LRKFPKETDRVLCHFCFK KKVT E L +D A+                   
Sbjct: 241  LFFGLCQILRKFPKETDRVLCHFCFKAKKVTAEALAVDRAKKSQVLVSSVILLKTALDAL 300

Query: 962  XXXSSVLKDAKSFLLGNVYNSVCENEKYASIRKRIGEEIDEDVLHARVPFVARTQQCFAV 1141
               S VLKD KS LL N+Y SVCENEKY  IRKRIGE IDEDVLHARVPFVA TQQCFAV
Sbjct: 301  PLLSKVLKDVKSSLLSNIYKSVCENEKYDLIRKRIGEVIDEDVLHARVPFVACTQQCFAV 360

Query: 1142 KAGIDGMLDIARRTFCDTSEGVLYDCXXXXXXXXXXXXXXXXXTKYTAIHSLANKYREEY 1321
            KAGIDG+LDI+RR FC+TSE                           AIH+LAN YRE++
Sbjct: 361  KAGIDGLLDISRRAFCETSE---------------------------AIHNLANNYREDF 393

Query: 1322 KLPNLKIPFNNRQGFYFSIPHKDVQGKLPSKFIQVVKHGNNIHCSTPELASLNVRNKSAA 1501
            KLPNLK+ + NRQGF+F IP K++QGKLPSKFIQVVKHGNNI CS+ ELASLN RNKSAA
Sbjct: 394  KLPNLKLTYKNRQGFHFVIPQKNIQGKLPSKFIQVVKHGNNIRCSSLELASLNARNKSAA 453

Query: 1502 GECYIRTEFCLEELMDAIRKEVSAXXXXXXXXXXXDMIVNSFANMISTKPVDRYTRPQFT 1681
             ECY RTE CLEELMD IR+ VS            DMIVNSFA+MISTKPVDRYTRP+FT
Sbjct: 454  AECYTRTEVCLEELMDDIRENVSVLTLLAEVLCLLDMIVNSFAHMISTKPVDRYTRPEFT 513

Query: 1682 DDGPLAIDAGRHPILESVHNEFI 1750
            ++GPLAID+GRHPILES+HN+F+
Sbjct: 514  ENGPLAIDSGRHPILESIHNDFV 536


Top