BLASTX nr result
ID: Angelica22_contig00024623
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica22_contig00024623 (1750 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|NP_193469.2| DNA mismatch repair protein MSH4 [Arabidopsis t... 775 0.0 gb|AAT70180.1| MSH4 [Arabidopsis thaliana] 773 0.0 ref|XP_002870107.1| hypothetical protein ARALYDRAFT_493139 [Arab... 773 0.0 ref|XP_004169721.1| PREDICTED: DNA mismatch repair protein MSH4-... 759 0.0 ref|XP_003538746.1| PREDICTED: mutS protein homolog 4-like [Glyc... 759 0.0 >ref|NP_193469.2| DNA mismatch repair protein MSH4 [Arabidopsis thaliana] gi|395406788|sp|F4JP48.1|MSH4_ARATH RecName: Full=DNA mismatch repair protein MSH4; Short=AtMSH4; AltName: Full=MutS protein homolog 4 gi|332658482|gb|AEE83882.1| DNA mismatch repair protein MSH4 [Arabidopsis thaliana] Length = 792 Score = 775 bits (2001), Expect = 0.0 Identities = 392/563 (69%), Positives = 437/563 (77%) Frame = +2 Query: 62 MEDDASEKSTFVVALIENRAKEVGVAAFDLRSASLHLSQYIETSRSYQNTKTLMHFYDPM 241 MEDD E+S+FV LIENRAKEVG+AAFDLRSASLHLSQYIETS SYQNTKTL+ FYDP Sbjct: 1 MEDDGGERSSFVAGLIENRAKEVGMAAFDLRSASLHLSQYIETSSSYQNTKTLLRFYDPS 60 Query: 242 VIIVPPNKLAPDSMVGVSEFVDRFYSTTRKVVMMRGCFDDTTGAVMVKNLAAKEPSALGL 421 VIIVPPNKLA D MVGVSE VDR YST RKVV RGCFDDT GAV+++NLAA+EP ALGL Sbjct: 61 VIIVPPNKLAADGMVGVSELVDRCYSTVRKVVFARGCFDDTKGAVLIQNLAAEEPLALGL 120 Query: 422 DSYYKQYYLCLXXXXXTIKWIEAEKGIIITNHSLLVTFNGSYDHMNIDATSVHNLEIIEP 601 D+YYKQ+YL L TIKWIEAEKG+I+TNHSL VTFNGS+DHMNIDATSV NLE+I+P Sbjct: 121 DTYYKQHYLSLAAAAATIKWIEAEKGVIVTNHSLTVTFNGSFDHMNIDATSVENLELIDP 180 Query: 602 LQSTLWGTTNKKRSLFQXXXXXXXXXXXXXXXANLLQPLKDIETIKARLDCLDELMTNEQ 781 + L GT+NKKRSLFQ ANLLQPLKDIETI RLDCLDELM+NEQ Sbjct: 181 FHNALLGTSNKKRSLFQMFKTTKTAGGTRLLRANLLQPLKDIETINTRLDCLDELMSNEQ 240 Query: 782 LFFGLSQALRKFPKETDRVLCHFCFKPKKVTKEVLGIDNARXXXXXXXXXXXXXXXXXXX 961 LFFGLSQ LRKFPKETDRVLCHFCFKPKKVT+ V+G +N R Sbjct: 241 LFFGLSQVLRKFPKETDRVLCHFCFKPKKVTEAVIGFENTRKSQNMISSIILLKTALDAL 300 Query: 962 XXXSSVLKDAKSFLLGNVYNSVCENEKYASIRKRIGEEIDEDVLHARVPFVARTQQCFAV 1141 + VLKDAK FLL NVY SVCEN++YASIRK+IGE ID+DVLHARVPFVARTQQCFA+ Sbjct: 301 PILAKVLKDAKCFLLANVYKSVCENDRYASIRKKIGEVIDDDVLHARVPFVARTQQCFAL 360 Query: 1142 KAGIDGMLDIARRTFCDTSEGVLYDCXXXXXXXXXXXXXXXXXTKYTAIHSLANKYREEY 1321 KAGIDG LDIARRTFCDTSE AIH+LA+KYREE+ Sbjct: 361 KAGIDGFLDIARRTFCDTSE---------------------------AIHNLASKYREEF 393 Query: 1322 KLPNLKIPFNNRQGFYFSIPHKDVQGKLPSKFIQVVKHGNNIHCSTPELASLNVRNKSAA 1501 LPNLK+PFNNRQGF+F IP K+VQGKLP+KF QVVKHG NIHCS+ ELASLNVRNKSAA Sbjct: 394 NLPNLKLPFNNRQGFFFRIPQKEVQGKLPNKFTQVVKHGKNIHCSSLELASLNVRNKSAA 453 Query: 1502 GECYIRTEFCLEELMDAIRKEVSAXXXXXXXXXXXDMIVNSFANMISTKPVDRYTRPQFT 1681 GEC+IRTE CLE LMDAIR+++SA DMIVNSFA+ ISTKPVDRY+RP+ T Sbjct: 454 GECFIRTETCLEALMDAIREDISALTLLAEVLCLLDMIVNSFAHTISTKPVDRYSRPELT 513 Query: 1682 DDGPLAIDAGRHPILESVHNEFI 1750 D GPLAIDAGRHPILES+HN+F+ Sbjct: 514 DSGPLAIDAGRHPILESIHNDFV 536 >gb|AAT70180.1| MSH4 [Arabidopsis thaliana] Length = 792 Score = 773 bits (1997), Expect = 0.0 Identities = 391/563 (69%), Positives = 437/563 (77%) Frame = +2 Query: 62 MEDDASEKSTFVVALIENRAKEVGVAAFDLRSASLHLSQYIETSRSYQNTKTLMHFYDPM 241 MEDD E+S+FV LIENRAKEVG+AAFDLRSASLHLSQYIETS SYQNTKTL+ FYDP Sbjct: 1 MEDDGGERSSFVAGLIENRAKEVGMAAFDLRSASLHLSQYIETSSSYQNTKTLLRFYDPS 60 Query: 242 VIIVPPNKLAPDSMVGVSEFVDRFYSTTRKVVMMRGCFDDTTGAVMVKNLAAKEPSALGL 421 VIIVPPNKLA D MVGVSE VDR YST RKVV RGCFDDT GAV+++NLAA+EP ALGL Sbjct: 61 VIIVPPNKLAADGMVGVSELVDRCYSTVRKVVFARGCFDDTKGAVLIQNLAAEEPLALGL 120 Query: 422 DSYYKQYYLCLXXXXXTIKWIEAEKGIIITNHSLLVTFNGSYDHMNIDATSVHNLEIIEP 601 D+YYKQ+YL L TIKWIEAEKG+I+TNHSL VTFNGS+DHMNIDATSV NLE+I+P Sbjct: 121 DTYYKQHYLSLAAAAATIKWIEAEKGVIVTNHSLTVTFNGSFDHMNIDATSVENLELIDP 180 Query: 602 LQSTLWGTTNKKRSLFQXXXXXXXXXXXXXXXANLLQPLKDIETIKARLDCLDELMTNEQ 781 + L GT+NKKRSLFQ ANLLQPLKDIETI RLDCLDELM+NEQ Sbjct: 181 FHNALLGTSNKKRSLFQMFKTTKTAGGTRLLRANLLQPLKDIETINTRLDCLDELMSNEQ 240 Query: 782 LFFGLSQALRKFPKETDRVLCHFCFKPKKVTKEVLGIDNARXXXXXXXXXXXXXXXXXXX 961 LFFGLSQ LRKFP+ETDRVLCHFCFKPKKVT+ V+G +N R Sbjct: 241 LFFGLSQVLRKFPEETDRVLCHFCFKPKKVTEAVIGFENTRKSQNMISSIILLKTALDAL 300 Query: 962 XXXSSVLKDAKSFLLGNVYNSVCENEKYASIRKRIGEEIDEDVLHARVPFVARTQQCFAV 1141 + VLKDAK FLL NVY SVCEN++YASIRK+IGE ID+DVLHARVPFVARTQQCFA+ Sbjct: 301 PILAKVLKDAKCFLLANVYKSVCENDRYASIRKKIGEVIDDDVLHARVPFVARTQQCFAL 360 Query: 1142 KAGIDGMLDIARRTFCDTSEGVLYDCXXXXXXXXXXXXXXXXXTKYTAIHSLANKYREEY 1321 KAGIDG LDIARRTFCDTSE AIH+LA+KYREE+ Sbjct: 361 KAGIDGFLDIARRTFCDTSE---------------------------AIHNLASKYREEF 393 Query: 1322 KLPNLKIPFNNRQGFYFSIPHKDVQGKLPSKFIQVVKHGNNIHCSTPELASLNVRNKSAA 1501 LPNLK+PFNNRQGF+F IP K+VQGKLP+KF QVVKHG NIHCS+ ELASLNVRNKSAA Sbjct: 394 NLPNLKLPFNNRQGFFFRIPQKEVQGKLPNKFTQVVKHGKNIHCSSLELASLNVRNKSAA 453 Query: 1502 GECYIRTEFCLEELMDAIRKEVSAXXXXXXXXXXXDMIVNSFANMISTKPVDRYTRPQFT 1681 GEC+IRTE CLE LMDAIR+++SA DMIVNSFA+ ISTKPVDRY+RP+ T Sbjct: 454 GECFIRTETCLEALMDAIREDISALTLLAEVLCLLDMIVNSFAHTISTKPVDRYSRPELT 513 Query: 1682 DDGPLAIDAGRHPILESVHNEFI 1750 D GPLAIDAGRHPILES+HN+F+ Sbjct: 514 DSGPLAIDAGRHPILESIHNDFV 536 >ref|XP_002870107.1| hypothetical protein ARALYDRAFT_493139 [Arabidopsis lyrata subsp. lyrata] gi|297315943|gb|EFH46366.1| hypothetical protein ARALYDRAFT_493139 [Arabidopsis lyrata subsp. lyrata] Length = 792 Score = 773 bits (1996), Expect = 0.0 Identities = 390/563 (69%), Positives = 437/563 (77%) Frame = +2 Query: 62 MEDDASEKSTFVVALIENRAKEVGVAAFDLRSASLHLSQYIETSRSYQNTKTLMHFYDPM 241 MEDD E+S+FV LIENRAKEVG+AAFDLRSASLHLSQYIETS SYQNTKTL+ FYDP Sbjct: 1 MEDDGGERSSFVAGLIENRAKEVGMAAFDLRSASLHLSQYIETSSSYQNTKTLLRFYDPS 60 Query: 242 VIIVPPNKLAPDSMVGVSEFVDRFYSTTRKVVMMRGCFDDTTGAVMVKNLAAKEPSALGL 421 VIIVPPNKLA D MVGVSE VDR YST RKVV RGCFDDT GAV+++NLAA+EP ALGL Sbjct: 61 VIIVPPNKLAADGMVGVSELVDRCYSTVRKVVFARGCFDDTKGAVLIQNLAAEEPLALGL 120 Query: 422 DSYYKQYYLCLXXXXXTIKWIEAEKGIIITNHSLLVTFNGSYDHMNIDATSVHNLEIIEP 601 D+YYKQ+YL L TIKWIEAEKG+I+TNHSL +TFNGS+DHMNIDATSV NLEII+P Sbjct: 121 DTYYKQHYLSLAAAAATIKWIEAEKGVIVTNHSLTITFNGSFDHMNIDATSVENLEIIDP 180 Query: 602 LQSTLWGTTNKKRSLFQXXXXXXXXXXXXXXXANLLQPLKDIETIKARLDCLDELMTNEQ 781 + L GT+NKKRSLFQ ANLLQPLKDIETI RLDCLDELM+NEQ Sbjct: 181 FHNALLGTSNKKRSLFQMFKTTKTVGGTRLLRANLLQPLKDIETINTRLDCLDELMSNEQ 240 Query: 782 LFFGLSQALRKFPKETDRVLCHFCFKPKKVTKEVLGIDNARXXXXXXXXXXXXXXXXXXX 961 LFFGLSQ LRKFP+ETDRVLCHFCFKPKKVT+ V+G +N R Sbjct: 241 LFFGLSQVLRKFPQETDRVLCHFCFKPKKVTEAVIGFENTRRSQNMISSIILLKTALDAL 300 Query: 962 XXXSSVLKDAKSFLLGNVYNSVCENEKYASIRKRIGEEIDEDVLHARVPFVARTQQCFAV 1141 + VLKDAK FLL NVY SVCEN++YASIRK+IGE ID+DVLHARVPFVARTQQCFA+ Sbjct: 301 PLLAKVLKDAKCFLLANVYKSVCENDRYASIRKKIGEVIDDDVLHARVPFVARTQQCFAL 360 Query: 1142 KAGIDGMLDIARRTFCDTSEGVLYDCXXXXXXXXXXXXXXXXXTKYTAIHSLANKYREEY 1321 KAGIDG LDIARRTFCDTSE AIH+LA+KYREE+ Sbjct: 361 KAGIDGFLDIARRTFCDTSE---------------------------AIHNLASKYREEF 393 Query: 1322 KLPNLKIPFNNRQGFYFSIPHKDVQGKLPSKFIQVVKHGNNIHCSTPELASLNVRNKSAA 1501 LPNLK+PFNNRQGF+F IP K+VQGKLP+KF QVVKHG NIHCS+ ELASLNVRNKSAA Sbjct: 394 NLPNLKLPFNNRQGFFFRIPQKEVQGKLPNKFTQVVKHGKNIHCSSLELASLNVRNKSAA 453 Query: 1502 GECYIRTEFCLEELMDAIRKEVSAXXXXXXXXXXXDMIVNSFANMISTKPVDRYTRPQFT 1681 GEC+IRTE CLE LMDAIR+++SA DMIVNSFA+ ISTKPVDRY+RP+ T Sbjct: 454 GECFIRTETCLEALMDAIREDISALTLLAEVLCLLDMIVNSFAHTISTKPVDRYSRPELT 513 Query: 1682 DDGPLAIDAGRHPILESVHNEFI 1750 D GPLAIDAGRHP+LES+HN+F+ Sbjct: 514 DSGPLAIDAGRHPLLESIHNDFV 536 >ref|XP_004169721.1| PREDICTED: DNA mismatch repair protein MSH4-like, partial [Cucumis sativus] Length = 770 Score = 759 bits (1961), Expect = 0.0 Identities = 384/541 (70%), Positives = 425/541 (78%) Frame = +2 Query: 128 VGVAAFDLRSASLHLSQYIETSRSYQNTKTLMHFYDPMVIIVPPNKLAPDSMVGVSEFVD 307 VGVAAFDLRSASLHLSQYIETS SYQNTKTL+HFY+PMVI+V PNKLAPD MVGVS D Sbjct: 1 VGVAAFDLRSASLHLSQYIETSSSYQNTKTLLHFYEPMVILVSPNKLAPDGMVGVSVLAD 60 Query: 308 RFYSTTRKVVMMRGCFDDTTGAVMVKNLAAKEPSALGLDSYYKQYYLCLXXXXXTIKWIE 487 RF++T +KVVM R CFDDT GAV++KNLAAKEPSALGL++YYKQYYLCL +IKWIE Sbjct: 61 RFFATVKKVVMARSCFDDTKGAVLIKNLAAKEPSALGLETYYKQYYLCLAAAAASIKWIE 120 Query: 488 AEKGIIITNHSLLVTFNGSYDHMNIDATSVHNLEIIEPLQSTLWGTTNKKRSLFQXXXXX 667 AEKG+I+TNHSLLVTFNGS DH++IDATSV NLEIIEPL S LWGT+NKKRSL+ Sbjct: 121 AEKGVIVTNHSLLVTFNGSSDHVSIDATSVQNLEIIEPLHSNLWGTSNKKRSLYNMLKTT 180 Query: 668 XXXXXXXXXXANLLQPLKDIETIKARLDCLDELMTNEQLFFGLSQALRKFPKETDRVLCH 847 ANLLQPLKDIETI ARLDCLDELM+NEQLFFGLSQALRKFPKETDRVLCH Sbjct: 181 KTIGGSRLLRANLLQPLKDIETINARLDCLDELMSNEQLFFGLSQALRKFPKETDRVLCH 240 Query: 848 FCFKPKKVTKEVLGIDNARXXXXXXXXXXXXXXXXXXXXXXSSVLKDAKSFLLGNVYNSV 1027 FCFK KKVT EVL +A+ S VLK+AKSFLL N+Y S+ Sbjct: 241 FCFKQKKVTNEVLHPSDAKKSQNLISSIILLKTALEALPLLSKVLKEAKSFLLANIYKSI 300 Query: 1028 CENEKYASIRKRIGEEIDEDVLHARVPFVARTQQCFAVKAGIDGMLDIARRTFCDTSEGV 1207 CENEKY +IRKRIGE IDEDVLHARVPF+ARTQQCFAVKAGIDG+LDIARRTFCDTSE Sbjct: 301 CENEKYTNIRKRIGEVIDEDVLHARVPFIARTQQCFAVKAGIDGLLDIARRTFCDTSE-- 358 Query: 1208 LYDCXXXXXXXXXXXXXXXXXTKYTAIHSLANKYREEYKLPNLKIPFNNRQGFYFSIPHK 1387 AIH+LANKYREEYKL NLK+PFNNRQGFY SIPHK Sbjct: 359 -------------------------AIHNLANKYREEYKLSNLKLPFNNRQGFYLSIPHK 393 Query: 1388 DVQGKLPSKFIQVVKHGNNIHCSTPELASLNVRNKSAAGECYIRTEFCLEELMDAIRKEV 1567 DVQGKLP+KFIQV+KHGNNI CST ELASLNVRNKSAAGECYIRTE CLE L+DAIR++V Sbjct: 394 DVQGKLPNKFIQVLKHGNNIRCSTLELASLNVRNKSAAGECYIRTEICLEGLVDAIREDV 453 Query: 1568 SAXXXXXXXXXXXDMIVNSFANMISTKPVDRYTRPQFTDDGPLAIDAGRHPILESVHNEF 1747 S DMIVNSFA+ ISTKPVDRYTRP FT++GP+AI+A RHPILES+HN+F Sbjct: 454 SMLTLLAEVLCLLDMIVNSFAHTISTKPVDRYTRPNFTENGPMAIEAARHPILESIHNDF 513 Query: 1748 I 1750 + Sbjct: 514 V 514 >ref|XP_003538746.1| PREDICTED: mutS protein homolog 4-like [Glycine max] Length = 794 Score = 759 bits (1961), Expect = 0.0 Identities = 381/563 (67%), Positives = 431/563 (76%) Frame = +2 Query: 62 MEDDASEKSTFVVALIENRAKEVGVAAFDLRSASLHLSQYIETSRSYQNTKTLMHFYDPM 241 MEDD E S+FVV +IENRAKEVG+AAFDLRSASLHLSQYIETS SYQNTKTL+HFYDP+ Sbjct: 1 MEDDGGESSSFVVGIIENRAKEVGLAAFDLRSASLHLSQYIETSSSYQNTKTLLHFYDPI 60 Query: 242 VIIVPPNKLAPDSMVGVSEFVDRFYSTTRKVVMMRGCFDDTTGAVMVKNLAAKEPSALGL 421 VII+PPNKLA +S GV+E VDRFY + ++V++ RGCFDDT GA+++KNLAAKEPSALGL Sbjct: 61 VIIIPPNKLASNSTAGVTELVDRFYGSVKQVMLARGCFDDTKGAILIKNLAAKEPSALGL 120 Query: 422 DSYYKQYYLCLXXXXXTIKWIEAEKGIIITNHSLLVTFNGSYDHMNIDATSVHNLEIIEP 601 D+YYKQYYLCL T+KW EAEKG+++TNHSL VTFNGS+DHMNID+TS+ NLEIIEP Sbjct: 121 DTYYKQYYLCLAAAAATLKWTEAEKGVVVTNHSLSVTFNGSFDHMNIDSTSIQNLEIIEP 180 Query: 602 LQSTLWGTTNKKRSLFQXXXXXXXXXXXXXXXANLLQPLKDIETIKARLDCLDELMTNEQ 781 STL GT+NKKRSLF ANLLQPLKDIETI ARLDCLDELM+NEQ Sbjct: 181 FHSTLLGTSNKKRSLFHMLKTTKTIGGSRLLRANLLQPLKDIETINARLDCLDELMSNEQ 240 Query: 782 LFFGLSQALRKFPKETDRVLCHFCFKPKKVTKEVLGIDNARXXXXXXXXXXXXXXXXXXX 961 LFFGL Q LRKFPKETDRVLCHFCFK KKVT E L +D A+ Sbjct: 241 LFFGLCQILRKFPKETDRVLCHFCFKAKKVTAEALAVDRAKKSQVLVSSVILLKTALDAL 300 Query: 962 XXXSSVLKDAKSFLLGNVYNSVCENEKYASIRKRIGEEIDEDVLHARVPFVARTQQCFAV 1141 S VLKD KS LL N+Y SVCENEKY IRKRIGE IDEDVLHARVPFVA TQQCFAV Sbjct: 301 PLLSKVLKDVKSSLLSNIYKSVCENEKYDLIRKRIGEVIDEDVLHARVPFVACTQQCFAV 360 Query: 1142 KAGIDGMLDIARRTFCDTSEGVLYDCXXXXXXXXXXXXXXXXXTKYTAIHSLANKYREEY 1321 KAGIDG+LDI+RR FC+TSE AIH+LAN YRE++ Sbjct: 361 KAGIDGLLDISRRAFCETSE---------------------------AIHNLANNYREDF 393 Query: 1322 KLPNLKIPFNNRQGFYFSIPHKDVQGKLPSKFIQVVKHGNNIHCSTPELASLNVRNKSAA 1501 KLPNLK+ + NRQGF+F IP K++QGKLPSKFIQVVKHGNNI CS+ ELASLN RNKSAA Sbjct: 394 KLPNLKLTYKNRQGFHFVIPQKNIQGKLPSKFIQVVKHGNNIRCSSLELASLNARNKSAA 453 Query: 1502 GECYIRTEFCLEELMDAIRKEVSAXXXXXXXXXXXDMIVNSFANMISTKPVDRYTRPQFT 1681 ECY RTE CLEELMD IR+ VS DMIVNSFA+MISTKPVDRYTRP+FT Sbjct: 454 AECYTRTEVCLEELMDDIRENVSVLTLLAEVLCLLDMIVNSFAHMISTKPVDRYTRPEFT 513 Query: 1682 DDGPLAIDAGRHPILESVHNEFI 1750 ++GPLAID+GRHPILES+HN+F+ Sbjct: 514 ENGPLAIDSGRHPILESIHNDFV 536