BLASTX nr result
ID: Coptis24_contig00013865
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Coptis24_contig00013865 (1993 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004144463.1| PREDICTED: protein MOS2-like [Cucumis sativus] 370 e-99 ref|XP_004169661.1| PREDICTED: protein MOS2-like [Cucumis sativus] 369 2e-99 ref|XP_002522998.1| Protein MOS2, putative [Ricinus communis] gi... 366 1e-98 ref|NP_174617.1| protein MOS2 [Arabidopsis thaliana] gi|75169419... 361 5e-97 ref|XP_002304388.1| predicted protein [Populus trichocarpa] gi|2... 357 9e-96 >ref|XP_004144463.1| PREDICTED: protein MOS2-like [Cucumis sativus] Length = 500 Score = 370 bits (949), Expect = e-99 Identities = 216/504 (42%), Positives = 302/504 (59%), Gaps = 22/504 (4%) Frame = +2 Query: 104 TTMLSFSLSSKPNPNNKR----RPSSSTFSLQETD--PSSQQQQFVIEFDPSKPSILINN 265 T LSFSL SK + ++ + +PS F + D P + +Q+V EFD SKP Sbjct: 22 TMKLSFSLPSKSSSSSSKPNLVKPSKE-FDDKTLDHGPLNDSKQYVNEFDASKPLSETTG 80 Query: 266 NTH--VIPRLENTWNPYKKMKNINTPLQNIQDPNLTFELETPALNMDTDPSMSYGINIRD 439 + VIP L+N W P K+MKN+ PL + +L FE + +D D MSYG+N+R Sbjct: 81 KSRNLVIPSLQNEWRPLKRMKNLEVPLDQSDESHLKFESASGLDPLD-DSKMSYGLNVRQ 139 Query: 440 K----NIEEEKRVGYNP-----VENVLLKKFKEDMKNLPDDRGIDEFEGVPVEGFGVAIL 592 I +E + G P +E ++L+KFK D++ LP+DRG ++FE VPVE F A++ Sbjct: 140 SVDGMKISDESKSGEEPPRPAPLEVIMLEKFKADLERLPEDRGFEDFEEVPVESFAAALM 199 Query: 593 KGYGWKEGQGIGKNAKEDVKMVQYVKRGNKEGLGFQPDKPTFDKKGRDLAPKGENGKTRH 772 GYGW++G+GIG+NAKEDVK+ +Y +R +K+GLGF D P K + G + + Sbjct: 200 NGYGWRQGKGIGRNAKEDVKVREYSRRTDKQGLGFVSDVPVGISKKEEEKDGGRERERKR 259 Query: 773 VIGIDGKLVVRELKGI-HVGKVVRVVSGRHVGLKGXXXXXXXXXXXXXXXXNEE----VT 937 G + RE G+ +GK VR+V GR GLKG + + Sbjct: 260 DEGRVKENRDRESDGLASIGKHVRIVRGRDAGLKGRVLEKLDSDWLVLKLSKRDEHVKLK 319 Query: 938 VGVQEVAELGTVDDELCVKGLQEEKMRRGGSRDDRRKNERSSYXXXXXXXXXXXXXXXXX 1117 V ++AELG+ ++E +K L+E K++ + RR+ Sbjct: 320 VRATDIAELGSKEEEKFLKKLEELKVKNENTGQKRRREVEQVVEKRE------------- 366 Query: 1118 XXXXVNGAGYGPSSKEQERSVAVSWLTSHIRVRIISKDYRRGKLYLKKGEVLDVVGPNTC 1297 NG S +++R+ +SWLTSHIRVRIISK+++ GK YLKKGE++DVVGP+ C Sbjct: 367 -----NG-----SRDKEKRTGRLSWLTSHIRVRIISKEFKGGKFYLKKGEIVDVVGPSIC 416 Query: 1298 DISLDESKEIIQSVHQDLLETAVPKRGGPVLVLYGKHKGVFGYLVERNMEKETGVVQDAD 1477 DIS+D S+E++Q V Q+LLETA+P+RGGPVLVLYGKHKGV+G LVER+++KETGVV+DAD Sbjct: 417 DISIDGSRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLVERDLDKETGVVRDAD 476 Query: 1478 SHELLNVHLEQIAEYLGDPDCIGY 1549 SHELLNV LEQIAEY+GDP +GY Sbjct: 477 SHELLNVRLEQIAEYIGDPSYLGY 500 >ref|XP_004169661.1| PREDICTED: protein MOS2-like [Cucumis sativus] Length = 478 Score = 369 bits (946), Expect = 2e-99 Identities = 215/501 (42%), Positives = 301/501 (60%), Gaps = 22/501 (4%) Frame = +2 Query: 113 LSFSLSSKPNPNNKR----RPSSSTFSLQETD--PSSQQQQFVIEFDPSKPSILINNNTH 274 LSFSL SK + ++ + +PS F + D P + +Q+V EFD SKP + Sbjct: 3 LSFSLPSKSSSSSSKPNLVKPSKE-FDDKTLDHGPLNDSKQYVNEFDASKPLSETTGKSR 61 Query: 275 --VIPRLENTWNPYKKMKNINTPLQNIQDPNLTFELETPALNMDTDPSMSYGINIRDK-- 442 VIP L+N W P K+MKN+ PL + +L FE + +D D MSYG+N+R Sbjct: 62 NLVIPSLQNEWRPLKRMKNLEVPLDQSDESHLKFESASGLDPLD-DSKMSYGLNVRQSVD 120 Query: 443 --NIEEEKRVGYNP-----VENVLLKKFKEDMKNLPDDRGIDEFEGVPVEGFGVAILKGY 601 I +E + G P +E ++L+KFK D++ LP+DRG ++FE VPVE F A++ GY Sbjct: 121 GMKISDESKSGEEPPRPAPLEVIMLEKFKADLERLPEDRGFEDFEEVPVESFAAALMNGY 180 Query: 602 GWKEGQGIGKNAKEDVKMVQYVKRGNKEGLGFQPDKPTFDKKGRDLAPKGENGKTRHVIG 781 GW++G+GIG+NAKEDVK+ +Y +R +K+GLGF D P K + G + + G Sbjct: 181 GWRQGKGIGRNAKEDVKVREYSRRTDKQGLGFVSDVPVGISKKEEEKDGGRERERKRDEG 240 Query: 782 IDGKLVVRELKGI-HVGKVVRVVSGRHVGLKGXXXXXXXXXXXXXXXXNEE----VTVGV 946 + RE G+ +GK VR+V GR GLKG + + V Sbjct: 241 RVKENRDRESDGLASIGKHVRIVRGRDAGLKGRVLEKLDSDWLVLKLSKRDEHVKLKVRA 300 Query: 947 QEVAELGTVDDELCVKGLQEEKMRRGGSRDDRRKNERSSYXXXXXXXXXXXXXXXXXXXX 1126 ++AELG+ ++E +K L+E K++ + RR+ Sbjct: 301 TDIAELGSKEEEKFLKKLEELKVKNENTGQKRRREVEQVVEKRE---------------- 344 Query: 1127 XVNGAGYGPSSKEQERSVAVSWLTSHIRVRIISKDYRRGKLYLKKGEVLDVVGPNTCDIS 1306 NG S +++R+ +SWLTSHIRVRIISK+++ GK YLKKGE++DVVGP+ CDIS Sbjct: 345 --NG-----SRDKEKRTGRLSWLTSHIRVRIISKEFKGGKFYLKKGEIVDVVGPSICDIS 397 Query: 1307 LDESKEIIQSVHQDLLETAVPKRGGPVLVLYGKHKGVFGYLVERNMEKETGVVQDADSHE 1486 +D S+E++Q V Q+LLETA+P+RGGPVLVLYGKHKGV+G LVER+++KETGVV+DADSHE Sbjct: 398 IDGSRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLVERDLDKETGVVRDADSHE 457 Query: 1487 LLNVHLEQIAEYLGDPDCIGY 1549 LLNV LEQIAEY+GDP +GY Sbjct: 458 LLNVRLEQIAEYIGDPSYLGY 478 >ref|XP_002522998.1| Protein MOS2, putative [Ricinus communis] gi|223537810|gb|EEF39428.1| Protein MOS2, putative [Ricinus communis] Length = 479 Score = 366 bits (940), Expect = 1e-98 Identities = 224/510 (43%), Positives = 302/510 (59%), Gaps = 31/510 (6%) Frame = +2 Query: 113 LSFSLSSKPNPNNKRRPSSSTFSLQETDPSSQQQQFVIEFDPSKPSILINNNTHVIPRLE 292 LSFS+ +K + + +P S ET + +QFV EFDPSK L N +IP E Sbjct: 3 LSFSIPAKSSSKSTSKPKFSASVDAETQTNGTDKQFVTEFDPSKT--LTKQNRIIIPPKE 60 Query: 293 NTWNPYKKMKNINT-PLQNIQDPN-LTFELETPALNMDTDPSMSYGINIR----DKNIEE 454 N W P+KKMKN+ P DP+ L FE+ T A + D D SMSYG+N+R D + Sbjct: 61 NEWRPHKKMKNLALLPSLQSSDPDALRFEIATDADDGD-DKSMSYGLNVRAAGEDDGGKS 119 Query: 455 EKRVGYNPVENVLLKKFKEDMKNLPDDRGIDEFEGVPVEGFGVAILKGYGWKEGQGIGKN 634 +++ EN++L+K + D++ LP+DRG DEF+ VPVEGFG A+L GYGW+EG+GIG+N Sbjct: 120 QQQKKPESTENIMLEKLRYDLERLPEDRGFDEFKDVPVEGFGAALLAGYGWREGRGIGRN 179 Query: 635 AKEDVKMVQYVKRGNKEGLGFQPDKPTFDK-KGRDLAPKGENGKTR--HVIGIDG--KLV 799 AKEDVK+ QY KR +KEGLGF + + K RD N + +V ID K Sbjct: 180 AKEDVKVKQYTKRTDKEGLGFVASVVSSNNVKNRDTVQNDFNSVSNINNVKHIDNGQKER 239 Query: 800 VRELKGIH------VGKVVRVVSGRH--VGLKGXXXXXXXXXXXXXXXX--NEEVTVGVQ 949 RE GI+ VGK VRV++G GLKG N+EV + V Sbjct: 240 KRERDGINNGDGFFVGKDVRVIAGGREIYGLKGRILERLNADWVILKIAESNDEVKLRVS 299 Query: 950 EVAELGTVDDELCVKGLQEEKMRRGGSRDDRRKNERSSYXXXXXXXXXXXXXXXXXXXXX 1129 ++A+LG+ +++ C++ L+ ++ +D++ +R Sbjct: 300 DIADLGSKEEDKCLRKLKALQL------EDKKSKDRD----------------------- 330 Query: 1130 VNGAGYGPSSKEQERSVA----------VSWLTSHIRVRIISKDYRRGKLYLKKGEVLDV 1279 NG G SKE+ SV + WL HIRVR+ISKD + G+ YLKKGEV+DV Sbjct: 331 -NGKGVTELSKERRESVRRDGGQVKDEKMRWLRDHIRVRVISKDLKGGRFYLKKGEVVDV 389 Query: 1280 VGPNTCDISLDESKEIIQSVHQDLLETAVPKRGGPVLVLYGKHKGVFGYLVERNMEKETG 1459 VGP CDIS+DE+KE++Q V QDLLETA+P+RGGPVLVLYGKHKG +G LVE+++++ETG Sbjct: 390 VGPYVCDISMDETKELVQGVDQDLLETALPRRGGPVLVLYGKHKGAYGNLVEKDLDRETG 449 Query: 1460 VVQDADSHELLNVHLEQIAEYLGDPDCIGY 1549 VVQD D+ E LNV LEQIAEY+GDP IGY Sbjct: 450 VVQDFDTREFLNVKLEQIAEYVGDPSYIGY 479 >ref|NP_174617.1| protein MOS2 [Arabidopsis thaliana] gi|75169419|sp|Q9C801.1|MOS2_ARATH RecName: Full=Protein MOS2 gi|12322393|gb|AAG51225.1|AC051630_22 unknown protein; 82634-81246 [Arabidopsis thaliana] gi|20259490|gb|AAM13865.1| unknown protein [Arabidopsis thaliana] gi|29824125|gb|AAP04023.1| unknown protein [Arabidopsis thaliana] gi|77176696|gb|ABA64466.1| putative nucleic-acid binding protein [Arabidopsis thaliana] gi|332193481|gb|AEE31602.1| protein MOS2 [Arabidopsis thaliana] Length = 462 Score = 361 bits (926), Expect = 5e-97 Identities = 210/491 (42%), Positives = 303/491 (61%), Gaps = 12/491 (2%) Frame = +2 Query: 113 LSFSLSSKPNPNNKRRPSSSTFSLQETDPSSQQQQFVIEFDPSKPSILINNNTHVIPRLE 292 LSFSL SK P + +++T ++FV EFDPSK ++ + +VIP +E Sbjct: 3 LSFSLPSKSKP----KVTATTADGNNAVDDGTSKEFVTEFDPSK-TLANSIPKYVIPPIE 57 Query: 293 NTWNPYKKMKNINTPLQNIQ-DPNLTFELETPALNMDTDPSMSYGINIRDK--------N 445 NTW P+KKMKN++ PLQ+ L FE E P + ++SYG+N+R K + Sbjct: 58 NTWRPHKKMKNLDLPLQSGNAGSGLEFEPEVPLPGTEKPDNISYGLNLRQKVKDDSIGGD 117 Query: 446 IEEEKRVGYNPVENVLLKKFKEDMKNLPDDRGIDEFEGVPVEGFGVAILKGYGWKEGQGI 625 EE++V E ++L+ + D+ +L DD +++FE VPV+GFG A++ GYGWK G+GI Sbjct: 118 AVEERKVSMG--EQLMLQSLRRDLMSLADDPTLEDFESVPVDGFGAALMAGYGWKPGKGI 175 Query: 626 GKNAKEDVKMVQYVKRGNKEGLGFQPDKP-TFDKKGRDLAPKGENGKTRHVIGIDGKLVV 802 GKNAKEDV++ +Y K KEGLGF PD+ D K + K + +GI+G V Sbjct: 176 GKNAKEDVEIKEYKKWTAKEGLGFDPDRSKVVDVKAK---VKESVKLDKKGVGINGGDV- 231 Query: 803 RELKGIHVGKVVRVVSGRHVGLKGXXXXXXXXXXXXXXXXN--EEVTVGVQEVAELGTVD 976 VGK VR+++GR VGLKG EEV VGV EVA+LG+ + Sbjct: 232 -----FFVGKEVRIIAGRDVGLKGKIVEKPGSDFFVIKISGSEEEVKVGVNEVADLGSKE 286 Query: 977 DELCVKGLQEEKMRRGGSRDDRRKNERSSYXXXXXXXXXXXXXXXXXXXXXVNGAGYGPS 1156 +E C+K L++ ++ +DR K++++S V + Sbjct: 287 EEKCLKKLKDLQL------NDREKDKKTS---------GRGRGAERGSRSEVRASEKQDR 331 Query: 1157 SKEQERSVAVSWLTSHIRVRIISKDYRRGKLYLKKGEVLDVVGPNTCDISLDESKEIIQS 1336 + +ER V SWL SHI+VRI+SKD++ G+LYLKKG+V+DVVGP TCDI++DE++E++Q Sbjct: 332 GQTRERKVKPSWLRSHIKVRIVSKDWKGGRLYLKKGKVVDVVGPTTCDITMDETQELVQG 391 Query: 1337 VHQDLLETAVPKRGGPVLVLYGKHKGVFGYLVERNMEKETGVVQDADSHELLNVHLEQIA 1516 V Q+LLETA+P+RGGPVLVL GKHKGV+G LVE++++KETGVV+D D+H++L+V L+Q+A Sbjct: 392 VDQELLETALPRRGGPVLVLSGKHKGVYGNLVEKDLDKETGVVRDLDNHKMLDVRLDQVA 451 Query: 1517 EYLGDPDCIGY 1549 EY+GD D I Y Sbjct: 452 EYMGDMDDIEY 462 >ref|XP_002304388.1| predicted protein [Populus trichocarpa] gi|222841820|gb|EEE79367.1| predicted protein [Populus trichocarpa] Length = 436 Score = 357 bits (915), Expect = 9e-96 Identities = 222/491 (45%), Positives = 295/491 (60%), Gaps = 12/491 (2%) Frame = +2 Query: 113 LSFSLSSKPNPNNKRRPSSSTFSLQETDPSSQQQQFVIEFDPSKPSILINNNTHVIPRLE 292 LSFS+ SK +K +P S + + +Q++ EFDPSK + N T +I + Sbjct: 4 LSFSIPSKSK--SKPKPVS------DQPDNDNSKQYLTEFDPSKNLLPQNTQTPIILPIP 55 Query: 293 NTWNPYKKMKNINTPL-QNIQDPNLTFELETPALNMDTDP-----SMSYGINIRDKNIEE 454 N + P+KKMKNI+ PL Q+ +L FE+ET + +DP S+S+G+N+R + Sbjct: 56 NDYQPHKKMKNIHLPLHQDDSSTDLRFEVET----LSSDPAAASDSISFGLNLRQSATTQ 111 Query: 455 EKRVGYNPVENVLLKKFKEDMKNLPDDRGIDEFEGVPVEGFGVAILKGYGWKEGQGIGKN 634 + E+VLL+K + D+K LP+DRG +EFE +PVE F A+LKGYGW EG+G+GKN Sbjct: 112 TQDARS---EDVLLEKLRYDLKRLPEDRGFEEFEEMPVEDFAKALLKGYGWHEGRGVGKN 168 Query: 635 AKEDVKMVQYVKRGNKEGLGFQPDKPTFDKKGRDLAPKGENGKTRHVIGIDGKLVVRELK 814 +KEDV++ QY KR +KEGLGF + D K N K R R Sbjct: 169 SKEDVQVKQYTKRTDKEGLGFLA--ASHDSK---------NKKQRE----------RSKD 207 Query: 815 GIHVGKVVRVVSGR--HVGLKGXXXXXXXXXXXXXXXXN--EEVTVGVQEVAELGTVDDE 982 G+ +GK VRV+SG+ ++GLKG E V V V +VAELG+ ++E Sbjct: 208 GLFLGKEVRVISGKKENLGLKGTVVERLGSDSIALRVEKSGERVKVRVSDVAELGSREEE 267 Query: 983 LCVKGLQ--EEKMRRGGSRDDRRKNERSSYXXXXXXXXXXXXXXXXXXXXXVNGAGYGPS 1156 C+K L+ EEK G R+ RR N+R+ G G Sbjct: 268 RCLKELKSIEEKKPSDGDREQRRVNKRNVESRDSLKM------------------GNGNV 309 Query: 1157 SKEQERSVAVSWLTSHIRVRIISKDYRRGKLYLKKGEVLDVVGPNTCDISLDESKEIIQS 1336 KE+ V WL SHIRVRIISKD + GKLYLKKGEV+DVVGP CDIS+DES+E++QS Sbjct: 310 GKER----GVQWLRSHIRVRIISKDLKGGKLYLKKGEVVDVVGPYKCDISMDESRELVQS 365 Query: 1337 VHQDLLETAVPKRGGPVLVLYGKHKGVFGYLVERNMEKETGVVQDADSHELLNVHLEQIA 1516 V QD LETA+P+RGGPVLVLYGKHKG +G LV+R++++E GVVQD+ SHELL+V LEQIA Sbjct: 366 VDQDALETALPRRGGPVLVLYGKHKGAYGNLVQRDIDREVGVVQDSGSHELLDVKLEQIA 425 Query: 1517 EYLGDPDCIGY 1549 EY+GDP IGY Sbjct: 426 EYVGDPGYIGY 436