BLASTX nr result
ID: Coptis25_contig00014275
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis25_contig00014275 (948 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004169721.1| PREDICTED: DNA mismatch repair protein MSH4-... 504 e-140 ref|XP_003538746.1| PREDICTED: mutS protein homolog 4-like [Glyc... 481 e-133 ref|XP_002262998.2| PREDICTED: mutS protein homolog 4-like, part... 480 e-133 ref|NP_193469.2| DNA mismatch repair protein MSH4 [Arabidopsis t... 475 e-132 ref|XP_002870107.1| hypothetical protein ARALYDRAFT_493139 [Arab... 475 e-132 >ref|XP_004169721.1| PREDICTED: DNA mismatch repair protein MSH4-like, partial [Cucumis sativus] Length = 770 Score = 504 bits (1298), Expect = e-140 Identities = 250/316 (79%), Positives = 280/316 (88%) Frame = -1 Query: 948 LCLAAVAATIKWIEAEKGVIVTNHSLLVTFNGSFDHMNIDATSVQNLEIIEPLHSNLWGS 769 LCLAA AA+IKWIEAEKGVIVTNHSLLVTFNGS DH++IDATSVQNLEIIEPLHSNLWG+ Sbjct: 107 LCLAAAAASIKWIEAEKGVIVTNHSLLVTFNGSSDHVSIDATSVQNLEIIEPLHSNLWGT 166 Query: 768 SNKKRSLFHMLKXXXXXXXXXXXRANLLQPLKDIETINARLDCLDELMSNEELFFGLTQV 589 SNKKRSL++MLK RANLLQPLKDIETINARLDCLDELMSNE+LFFGL+Q Sbjct: 167 SNKKRSLYNMLKTTKTIGGSRLLRANLLQPLKDIETINARLDCLDELMSNEQLFFGLSQA 226 Query: 588 LRRFPKETDRVLCHFCFKPKKVTNEVLRVGDAKRSQVLISSIIXXXXXXXXXXXLSKVLK 409 LR+FPKETDRVLCHFCFK KKVTNEVL DAK+SQ LISSII LSKVLK Sbjct: 227 LRKFPKETDRVLCHFCFKQKKVTNEVLHPSDAKKSQNLISSIILLKTALEALPLLSKVLK 286 Query: 408 DAKSFLLGNIYKSICENEKYASIRKRIGEVIDDDVLHARIPFVARTQQCFAVKAGIDGLL 229 +AKSFLL NIYKSICENEKY +IRKRIGEVID+DVLHAR+PF+ARTQQCFAVKAGIDGLL Sbjct: 287 EAKSFLLANIYKSICENEKYTNIRKRIGEVIDEDVLHARVPFIARTQQCFAVKAGIDGLL 346 Query: 228 DVARRSFCDSSEAIHSLANTYREDFQLPNLKIPFNNKQGFYLSIPQKDIKGRLPSKFIQV 49 D+ARR+FCD+SEAIH+LAN YRE+++L NLK+PFNN+QGFYLSIP KD++G+LP+KFIQV Sbjct: 347 DIARRTFCDTSEAIHNLANKYREEYKLSNLKLPFNNRQGFYLSIPHKDVQGKLPNKFIQV 406 Query: 48 VKHGNNVHCSTLELAS 1 +KHGNN+ CSTLELAS Sbjct: 407 LKHGNNIRCSTLELAS 422 >ref|XP_003538746.1| PREDICTED: mutS protein homolog 4-like [Glycine max] Length = 794 Score = 481 bits (1237), Expect = e-133 Identities = 239/316 (75%), Positives = 269/316 (85%) Frame = -1 Query: 948 LCLAAVAATIKWIEAEKGVIVTNHSLLVTFNGSFDHMNIDATSVQNLEIIEPLHSNLWGS 769 LCLAA AAT+KW EAEKGV+VTNHSL VTFNGSFDHMNID+TS+QNLEIIEP HS L G+ Sbjct: 129 LCLAAAAATLKWTEAEKGVVVTNHSLSVTFNGSFDHMNIDSTSIQNLEIIEPFHSTLLGT 188 Query: 768 SNKKRSLFHMLKXXXXXXXXXXXRANLLQPLKDIETINARLDCLDELMSNEELFFGLTQV 589 SNKKRSLFHMLK RANLLQPLKDIETINARLDCLDELMSNE+LFFGL Q+ Sbjct: 189 SNKKRSLFHMLKTTKTIGGSRLLRANLLQPLKDIETINARLDCLDELMSNEQLFFGLCQI 248 Query: 588 LRRFPKETDRVLCHFCFKPKKVTNEVLRVGDAKRSQVLISSIIXXXXXXXXXXXLSKVLK 409 LR+FPKETDRVLCHFCFK KKVT E L V AK+SQVL+SS+I LSKVLK Sbjct: 249 LRKFPKETDRVLCHFCFKAKKVTAEALAVDRAKKSQVLVSSVILLKTALDALPLLSKVLK 308 Query: 408 DAKSFLLGNIYKSICENEKYASIRKRIGEVIDDDVLHARIPFVARTQQCFAVKAGIDGLL 229 D KS LL NIYKS+CENEKY IRKRIGEVID+DVLHAR+PFVA TQQCFAVKAGIDGLL Sbjct: 309 DVKSSLLSNIYKSVCENEKYDLIRKRIGEVIDEDVLHARVPFVACTQQCFAVKAGIDGLL 368 Query: 228 DVARRSFCDSSEAIHSLANTYREDFQLPNLKIPFNNKQGFYLSIPQKDIKGRLPSKFIQV 49 D++RR+FC++SEAIH+LAN YREDF+LPNLK+ + N+QGF+ IPQK+I+G+LPSKFIQV Sbjct: 369 DISRRAFCETSEAIHNLANNYREDFKLPNLKLTYKNRQGFHFVIPQKNIQGKLPSKFIQV 428 Query: 48 VKHGNNVHCSTLELAS 1 VKHGNN+ CS+LELAS Sbjct: 429 VKHGNNIRCSSLELAS 444 >ref|XP_002262998.2| PREDICTED: mutS protein homolog 4-like, partial [Vitis vinifera] Length = 456 Score = 480 bits (1235), Expect = e-133 Identities = 245/328 (74%), Positives = 276/328 (84%), Gaps = 12/328 (3%) Frame = -1 Query: 948 LCLAAVAATIKWI--------EAEKGVIVTNHS----LLVTFNGSFDHMNIDATSVQNLE 805 LCLAA AATIK I + + ++ S ++VTFNGSFDHMNIDATSVQNLE Sbjct: 129 LCLAAAAATIKCIAFFFPKDVKLQYAILEDLSSCDKLMMVTFNGSFDHMNIDATSVQNLE 188 Query: 804 IIEPLHSNLWGSSNKKRSLFHMLKXXXXXXXXXXXRANLLQPLKDIETINARLDCLDELM 625 IIEPLHS+LWG+SNKKRSLFHMLK RANLLQPLKDIETINARLDCLDELM Sbjct: 189 IIEPLHSSLWGTSNKKRSLFHMLKTTKTTGGTRLLRANLLQPLKDIETINARLDCLDELM 248 Query: 624 SNEELFFGLTQVLRRFPKETDRVLCHFCFKPKKVTNEVLRVGDAKRSQVLISSIIXXXXX 445 SNE+LFFGL+QVLR+FPKETDRVLCHFCFKPKKVT EVL V A+++Q+LISSII Sbjct: 249 SNEQLFFGLSQVLRKFPKETDRVLCHFCFKPKKVTKEVLGVEYARKNQMLISSIILLKTA 308 Query: 444 XXXXXXLSKVLKDAKSFLLGNIYKSICENEKYASIRKRIGEVIDDDVLHARIPFVARTQQ 265 LSKVLKDAKSFLL N+YKS+C NE YASIRKRIGEVID+DVLHAR+PFVARTQQ Sbjct: 309 LDALPLLSKVLKDAKSFLLANVYKSVCANETYASIRKRIGEVIDEDVLHARVPFVARTQQ 368 Query: 264 CFAVKAGIDGLLDVARRSFCDSSEAIHSLANTYREDFQLPNLKIPFNNKQGFYLSIPQKD 85 CFAVKAGIDGLLD+ARRSFCD+SEAIH+LAN YREDF+LPNLK+PFNN+QGFY +IPQKD Sbjct: 369 CFAVKAGIDGLLDIARRSFCDTSEAIHNLANKYREDFKLPNLKLPFNNRQGFYFTIPQKD 428 Query: 84 IKGRLPSKFIQVVKHGNNVHCSTLELAS 1 I+G+LPSKFIQV++HGNN+HCSTLELAS Sbjct: 429 IQGKLPSKFIQVLRHGNNIHCSTLELAS 456 >ref|NP_193469.2| DNA mismatch repair protein MSH4 [Arabidopsis thaliana] gi|395406788|sp|F4JP48.1|MSH4_ARATH RecName: Full=DNA mismatch repair protein MSH4; Short=AtMSH4; AltName: Full=MutS protein homolog 4 gi|332658482|gb|AEE83882.1| DNA mismatch repair protein MSH4 [Arabidopsis thaliana] Length = 792 Score = 475 bits (1222), Expect = e-132 Identities = 232/316 (73%), Positives = 269/316 (85%) Frame = -1 Query: 948 LCLAAVAATIKWIEAEKGVIVTNHSLLVTFNGSFDHMNIDATSVQNLEIIEPLHSNLWGS 769 L LAA AATIKWIEAEKGVIVTNHSL VTFNGSFDHMNIDATSV+NLE+I+P H+ L G+ Sbjct: 129 LSLAAAAATIKWIEAEKGVIVTNHSLTVTFNGSFDHMNIDATSVENLELIDPFHNALLGT 188 Query: 768 SNKKRSLFHMLKXXXXXXXXXXXRANLLQPLKDIETINARLDCLDELMSNEELFFGLTQV 589 SNKKRSLF M K RANLLQPLKDIETIN RLDCLDELMSNE+LFFGL+QV Sbjct: 189 SNKKRSLFQMFKTTKTAGGTRLLRANLLQPLKDIETINTRLDCLDELMSNEQLFFGLSQV 248 Query: 588 LRRFPKETDRVLCHFCFKPKKVTNEVLRVGDAKRSQVLISSIIXXXXXXXXXXXLSKVLK 409 LR+FPKETDRVLCHFCFKPKKVT V+ + ++SQ +ISSII L+KVLK Sbjct: 249 LRKFPKETDRVLCHFCFKPKKVTEAVIGFENTRKSQNMISSIILLKTALDALPILAKVLK 308 Query: 408 DAKSFLLGNIYKSICENEKYASIRKRIGEVIDDDVLHARIPFVARTQQCFAVKAGIDGLL 229 DAK FLL N+YKS+CEN++YASIRK+IGEVIDDDVLHAR+PFVARTQQCFA+KAGIDG L Sbjct: 309 DAKCFLLANVYKSVCENDRYASIRKKIGEVIDDDVLHARVPFVARTQQCFALKAGIDGFL 368 Query: 228 DVARRSFCDSSEAIHSLANTYREDFQLPNLKIPFNNKQGFYLSIPQKDIKGRLPSKFIQV 49 D+ARR+FCD+SEAIH+LA+ YRE+F LPNLK+PFNN+QGF+ IPQK+++G+LP+KF QV Sbjct: 369 DIARRTFCDTSEAIHNLASKYREEFNLPNLKLPFNNRQGFFFRIPQKEVQGKLPNKFTQV 428 Query: 48 VKHGNNVHCSTLELAS 1 VKHG N+HCS+LELAS Sbjct: 429 VKHGKNIHCSSLELAS 444 >ref|XP_002870107.1| hypothetical protein ARALYDRAFT_493139 [Arabidopsis lyrata subsp. lyrata] gi|297315943|gb|EFH46366.1| hypothetical protein ARALYDRAFT_493139 [Arabidopsis lyrata subsp. lyrata] Length = 792 Score = 475 bits (1222), Expect = e-132 Identities = 232/316 (73%), Positives = 269/316 (85%) Frame = -1 Query: 948 LCLAAVAATIKWIEAEKGVIVTNHSLLVTFNGSFDHMNIDATSVQNLEIIEPLHSNLWGS 769 L LAA AATIKWIEAEKGVIVTNHSL +TFNGSFDHMNIDATSV+NLEII+P H+ L G+ Sbjct: 129 LSLAAAAATIKWIEAEKGVIVTNHSLTITFNGSFDHMNIDATSVENLEIIDPFHNALLGT 188 Query: 768 SNKKRSLFHMLKXXXXXXXXXXXRANLLQPLKDIETINARLDCLDELMSNEELFFGLTQV 589 SNKKRSLF M K RANLLQPLKDIETIN RLDCLDELMSNE+LFFGL+QV Sbjct: 189 SNKKRSLFQMFKTTKTVGGTRLLRANLLQPLKDIETINTRLDCLDELMSNEQLFFGLSQV 248 Query: 588 LRRFPKETDRVLCHFCFKPKKVTNEVLRVGDAKRSQVLISSIIXXXXXXXXXXXLSKVLK 409 LR+FP+ETDRVLCHFCFKPKKVT V+ + +RSQ +ISSII L+KVLK Sbjct: 249 LRKFPQETDRVLCHFCFKPKKVTEAVIGFENTRRSQNMISSIILLKTALDALPLLAKVLK 308 Query: 408 DAKSFLLGNIYKSICENEKYASIRKRIGEVIDDDVLHARIPFVARTQQCFAVKAGIDGLL 229 DAK FLL N+YKS+CEN++YASIRK+IGEVIDDDVLHAR+PFVARTQQCFA+KAGIDG L Sbjct: 309 DAKCFLLANVYKSVCENDRYASIRKKIGEVIDDDVLHARVPFVARTQQCFALKAGIDGFL 368 Query: 228 DVARRSFCDSSEAIHSLANTYREDFQLPNLKIPFNNKQGFYLSIPQKDIKGRLPSKFIQV 49 D+ARR+FCD+SEAIH+LA+ YRE+F LPNLK+PFNN+QGF+ IPQK+++G+LP+KF QV Sbjct: 369 DIARRTFCDTSEAIHNLASKYREEFNLPNLKLPFNNRQGFFFRIPQKEVQGKLPNKFTQV 428 Query: 48 VKHGNNVHCSTLELAS 1 VKHG N+HCS+LELAS Sbjct: 429 VKHGKNIHCSSLELAS 444