BLASTX nr result

ID: Coptis24_contig00013865 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis24_contig00013865
         (1993 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004144463.1| PREDICTED: protein MOS2-like [Cucumis sativus]    370   e-99 
ref|XP_004169661.1| PREDICTED: protein MOS2-like [Cucumis sativus]    369   2e-99
ref|XP_002522998.1| Protein MOS2, putative [Ricinus communis] gi...   366   1e-98
ref|NP_174617.1| protein MOS2 [Arabidopsis thaliana] gi|75169419...   361   5e-97
ref|XP_002304388.1| predicted protein [Populus trichocarpa] gi|2...   357   9e-96

>ref|XP_004144463.1| PREDICTED: protein MOS2-like [Cucumis sativus]
          Length = 500

 Score =  370 bits (949), Expect = e-99
 Identities = 216/504 (42%), Positives = 302/504 (59%), Gaps = 22/504 (4%)
 Frame = +2

Query: 104  TTMLSFSLSSKPNPNNKR----RPSSSTFSLQETD--PSSQQQQFVIEFDPSKPSILINN 265
            T  LSFSL SK + ++ +    +PS   F  +  D  P +  +Q+V EFD SKP      
Sbjct: 22   TMKLSFSLPSKSSSSSSKPNLVKPSKE-FDDKTLDHGPLNDSKQYVNEFDASKPLSETTG 80

Query: 266  NTH--VIPRLENTWNPYKKMKNINTPLQNIQDPNLTFELETPALNMDTDPSMSYGINIRD 439
             +   VIP L+N W P K+MKN+  PL    + +L FE  +    +D D  MSYG+N+R 
Sbjct: 81   KSRNLVIPSLQNEWRPLKRMKNLEVPLDQSDESHLKFESASGLDPLD-DSKMSYGLNVRQ 139

Query: 440  K----NIEEEKRVGYNP-----VENVLLKKFKEDMKNLPDDRGIDEFEGVPVEGFGVAIL 592
                  I +E + G  P     +E ++L+KFK D++ LP+DRG ++FE VPVE F  A++
Sbjct: 140  SVDGMKISDESKSGEEPPRPAPLEVIMLEKFKADLERLPEDRGFEDFEEVPVESFAAALM 199

Query: 593  KGYGWKEGQGIGKNAKEDVKMVQYVKRGNKEGLGFQPDKPTFDKKGRDLAPKGENGKTRH 772
             GYGW++G+GIG+NAKEDVK+ +Y +R +K+GLGF  D P    K  +    G   + + 
Sbjct: 200  NGYGWRQGKGIGRNAKEDVKVREYSRRTDKQGLGFVSDVPVGISKKEEEKDGGRERERKR 259

Query: 773  VIGIDGKLVVRELKGI-HVGKVVRVVSGRHVGLKGXXXXXXXXXXXXXXXXNEE----VT 937
              G   +   RE  G+  +GK VR+V GR  GLKG                  +    + 
Sbjct: 260  DEGRVKENRDRESDGLASIGKHVRIVRGRDAGLKGRVLEKLDSDWLVLKLSKRDEHVKLK 319

Query: 938  VGVQEVAELGTVDDELCVKGLQEEKMRRGGSRDDRRKNERSSYXXXXXXXXXXXXXXXXX 1117
            V   ++AELG+ ++E  +K L+E K++   +   RR+                       
Sbjct: 320  VRATDIAELGSKEEEKFLKKLEELKVKNENTGQKRRREVEQVVEKRE------------- 366

Query: 1118 XXXXVNGAGYGPSSKEQERSVAVSWLTSHIRVRIISKDYRRGKLYLKKGEVLDVVGPNTC 1297
                 NG     S  +++R+  +SWLTSHIRVRIISK+++ GK YLKKGE++DVVGP+ C
Sbjct: 367  -----NG-----SRDKEKRTGRLSWLTSHIRVRIISKEFKGGKFYLKKGEIVDVVGPSIC 416

Query: 1298 DISLDESKEIIQSVHQDLLETAVPKRGGPVLVLYGKHKGVFGYLVERNMEKETGVVQDAD 1477
            DIS+D S+E++Q V Q+LLETA+P+RGGPVLVLYGKHKGV+G LVER+++KETGVV+DAD
Sbjct: 417  DISIDGSRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLVERDLDKETGVVRDAD 476

Query: 1478 SHELLNVHLEQIAEYLGDPDCIGY 1549
            SHELLNV LEQIAEY+GDP  +GY
Sbjct: 477  SHELLNVRLEQIAEYIGDPSYLGY 500


>ref|XP_004169661.1| PREDICTED: protein MOS2-like [Cucumis sativus]
          Length = 478

 Score =  369 bits (946), Expect = 2e-99
 Identities = 215/501 (42%), Positives = 301/501 (60%), Gaps = 22/501 (4%)
 Frame = +2

Query: 113  LSFSLSSKPNPNNKR----RPSSSTFSLQETD--PSSQQQQFVIEFDPSKPSILINNNTH 274
            LSFSL SK + ++ +    +PS   F  +  D  P +  +Q+V EFD SKP       + 
Sbjct: 3    LSFSLPSKSSSSSSKPNLVKPSKE-FDDKTLDHGPLNDSKQYVNEFDASKPLSETTGKSR 61

Query: 275  --VIPRLENTWNPYKKMKNINTPLQNIQDPNLTFELETPALNMDTDPSMSYGINIRDK-- 442
              VIP L+N W P K+MKN+  PL    + +L FE  +    +D D  MSYG+N+R    
Sbjct: 62   NLVIPSLQNEWRPLKRMKNLEVPLDQSDESHLKFESASGLDPLD-DSKMSYGLNVRQSVD 120

Query: 443  --NIEEEKRVGYNP-----VENVLLKKFKEDMKNLPDDRGIDEFEGVPVEGFGVAILKGY 601
               I +E + G  P     +E ++L+KFK D++ LP+DRG ++FE VPVE F  A++ GY
Sbjct: 121  GMKISDESKSGEEPPRPAPLEVIMLEKFKADLERLPEDRGFEDFEEVPVESFAAALMNGY 180

Query: 602  GWKEGQGIGKNAKEDVKMVQYVKRGNKEGLGFQPDKPTFDKKGRDLAPKGENGKTRHVIG 781
            GW++G+GIG+NAKEDVK+ +Y +R +K+GLGF  D P    K  +    G   + +   G
Sbjct: 181  GWRQGKGIGRNAKEDVKVREYSRRTDKQGLGFVSDVPVGISKKEEEKDGGRERERKRDEG 240

Query: 782  IDGKLVVRELKGI-HVGKVVRVVSGRHVGLKGXXXXXXXXXXXXXXXXNEE----VTVGV 946
               +   RE  G+  +GK VR+V GR  GLKG                  +    + V  
Sbjct: 241  RVKENRDRESDGLASIGKHVRIVRGRDAGLKGRVLEKLDSDWLVLKLSKRDEHVKLKVRA 300

Query: 947  QEVAELGTVDDELCVKGLQEEKMRRGGSRDDRRKNERSSYXXXXXXXXXXXXXXXXXXXX 1126
             ++AELG+ ++E  +K L+E K++   +   RR+                          
Sbjct: 301  TDIAELGSKEEEKFLKKLEELKVKNENTGQKRRREVEQVVEKRE---------------- 344

Query: 1127 XVNGAGYGPSSKEQERSVAVSWLTSHIRVRIISKDYRRGKLYLKKGEVLDVVGPNTCDIS 1306
              NG     S  +++R+  +SWLTSHIRVRIISK+++ GK YLKKGE++DVVGP+ CDIS
Sbjct: 345  --NG-----SRDKEKRTGRLSWLTSHIRVRIISKEFKGGKFYLKKGEIVDVVGPSICDIS 397

Query: 1307 LDESKEIIQSVHQDLLETAVPKRGGPVLVLYGKHKGVFGYLVERNMEKETGVVQDADSHE 1486
            +D S+E++Q V Q+LLETA+P+RGGPVLVLYGKHKGV+G LVER+++KETGVV+DADSHE
Sbjct: 398  IDGSRELVQGVSQELLETALPRRGGPVLVLYGKHKGVYGSLVERDLDKETGVVRDADSHE 457

Query: 1487 LLNVHLEQIAEYLGDPDCIGY 1549
            LLNV LEQIAEY+GDP  +GY
Sbjct: 458  LLNVRLEQIAEYIGDPSYLGY 478


>ref|XP_002522998.1| Protein MOS2, putative [Ricinus communis] gi|223537810|gb|EEF39428.1|
            Protein MOS2, putative [Ricinus communis]
          Length = 479

 Score =  366 bits (940), Expect = 1e-98
 Identities = 224/510 (43%), Positives = 302/510 (59%), Gaps = 31/510 (6%)
 Frame = +2

Query: 113  LSFSLSSKPNPNNKRRPSSSTFSLQETDPSSQQQQFVIEFDPSKPSILINNNTHVIPRLE 292
            LSFS+ +K +  +  +P  S     ET  +   +QFV EFDPSK   L   N  +IP  E
Sbjct: 3    LSFSIPAKSSSKSTSKPKFSASVDAETQTNGTDKQFVTEFDPSKT--LTKQNRIIIPPKE 60

Query: 293  NTWNPYKKMKNINT-PLQNIQDPN-LTFELETPALNMDTDPSMSYGINIR----DKNIEE 454
            N W P+KKMKN+   P     DP+ L FE+ T A + D D SMSYG+N+R    D   + 
Sbjct: 61   NEWRPHKKMKNLALLPSLQSSDPDALRFEIATDADDGD-DKSMSYGLNVRAAGEDDGGKS 119

Query: 455  EKRVGYNPVENVLLKKFKEDMKNLPDDRGIDEFEGVPVEGFGVAILKGYGWKEGQGIGKN 634
            +++      EN++L+K + D++ LP+DRG DEF+ VPVEGFG A+L GYGW+EG+GIG+N
Sbjct: 120  QQQKKPESTENIMLEKLRYDLERLPEDRGFDEFKDVPVEGFGAALLAGYGWREGRGIGRN 179

Query: 635  AKEDVKMVQYVKRGNKEGLGFQPDKPTFDK-KGRDLAPKGENGKTR--HVIGIDG--KLV 799
            AKEDVK+ QY KR +KEGLGF     + +  K RD      N  +   +V  ID   K  
Sbjct: 180  AKEDVKVKQYTKRTDKEGLGFVASVVSSNNVKNRDTVQNDFNSVSNINNVKHIDNGQKER 239

Query: 800  VRELKGIH------VGKVVRVVSGRH--VGLKGXXXXXXXXXXXXXXXX--NEEVTVGVQ 949
             RE  GI+      VGK VRV++G     GLKG                  N+EV + V 
Sbjct: 240  KRERDGINNGDGFFVGKDVRVIAGGREIYGLKGRILERLNADWVILKIAESNDEVKLRVS 299

Query: 950  EVAELGTVDDELCVKGLQEEKMRRGGSRDDRRKNERSSYXXXXXXXXXXXXXXXXXXXXX 1129
            ++A+LG+ +++ C++ L+  ++      +D++  +R                        
Sbjct: 300  DIADLGSKEEDKCLRKLKALQL------EDKKSKDRD----------------------- 330

Query: 1130 VNGAGYGPSSKEQERSVA----------VSWLTSHIRVRIISKDYRRGKLYLKKGEVLDV 1279
             NG G    SKE+  SV           + WL  HIRVR+ISKD + G+ YLKKGEV+DV
Sbjct: 331  -NGKGVTELSKERRESVRRDGGQVKDEKMRWLRDHIRVRVISKDLKGGRFYLKKGEVVDV 389

Query: 1280 VGPNTCDISLDESKEIIQSVHQDLLETAVPKRGGPVLVLYGKHKGVFGYLVERNMEKETG 1459
            VGP  CDIS+DE+KE++Q V QDLLETA+P+RGGPVLVLYGKHKG +G LVE+++++ETG
Sbjct: 390  VGPYVCDISMDETKELVQGVDQDLLETALPRRGGPVLVLYGKHKGAYGNLVEKDLDRETG 449

Query: 1460 VVQDADSHELLNVHLEQIAEYLGDPDCIGY 1549
            VVQD D+ E LNV LEQIAEY+GDP  IGY
Sbjct: 450  VVQDFDTREFLNVKLEQIAEYVGDPSYIGY 479


>ref|NP_174617.1| protein MOS2 [Arabidopsis thaliana]
            gi|75169419|sp|Q9C801.1|MOS2_ARATH RecName: Full=Protein
            MOS2 gi|12322393|gb|AAG51225.1|AC051630_22 unknown
            protein; 82634-81246 [Arabidopsis thaliana]
            gi|20259490|gb|AAM13865.1| unknown protein [Arabidopsis
            thaliana] gi|29824125|gb|AAP04023.1| unknown protein
            [Arabidopsis thaliana] gi|77176696|gb|ABA64466.1|
            putative nucleic-acid binding protein [Arabidopsis
            thaliana] gi|332193481|gb|AEE31602.1| protein MOS2
            [Arabidopsis thaliana]
          Length = 462

 Score =  361 bits (926), Expect = 5e-97
 Identities = 210/491 (42%), Positives = 303/491 (61%), Gaps = 12/491 (2%)
 Frame = +2

Query: 113  LSFSLSSKPNPNNKRRPSSSTFSLQETDPSSQQQQFVIEFDPSKPSILINNNTHVIPRLE 292
            LSFSL SK  P    + +++T            ++FV EFDPSK ++  +   +VIP +E
Sbjct: 3    LSFSLPSKSKP----KVTATTADGNNAVDDGTSKEFVTEFDPSK-TLANSIPKYVIPPIE 57

Query: 293  NTWNPYKKMKNINTPLQNIQ-DPNLTFELETPALNMDTDPSMSYGINIRDK--------N 445
            NTW P+KKMKN++ PLQ+      L FE E P    +   ++SYG+N+R K        +
Sbjct: 58   NTWRPHKKMKNLDLPLQSGNAGSGLEFEPEVPLPGTEKPDNISYGLNLRQKVKDDSIGGD 117

Query: 446  IEEEKRVGYNPVENVLLKKFKEDMKNLPDDRGIDEFEGVPVEGFGVAILKGYGWKEGQGI 625
              EE++V     E ++L+  + D+ +L DD  +++FE VPV+GFG A++ GYGWK G+GI
Sbjct: 118  AVEERKVSMG--EQLMLQSLRRDLMSLADDPTLEDFESVPVDGFGAALMAGYGWKPGKGI 175

Query: 626  GKNAKEDVKMVQYVKRGNKEGLGFQPDKP-TFDKKGRDLAPKGENGKTRHVIGIDGKLVV 802
            GKNAKEDV++ +Y K   KEGLGF PD+    D K +    K      +  +GI+G  V 
Sbjct: 176  GKNAKEDVEIKEYKKWTAKEGLGFDPDRSKVVDVKAK---VKESVKLDKKGVGINGGDV- 231

Query: 803  RELKGIHVGKVVRVVSGRHVGLKGXXXXXXXXXXXXXXXXN--EEVTVGVQEVAELGTVD 976
                   VGK VR+++GR VGLKG                   EEV VGV EVA+LG+ +
Sbjct: 232  -----FFVGKEVRIIAGRDVGLKGKIVEKPGSDFFVIKISGSEEEVKVGVNEVADLGSKE 286

Query: 977  DELCVKGLQEEKMRRGGSRDDRRKNERSSYXXXXXXXXXXXXXXXXXXXXXVNGAGYGPS 1156
            +E C+K L++ ++      +DR K++++S                      V  +     
Sbjct: 287  EEKCLKKLKDLQL------NDREKDKKTS---------GRGRGAERGSRSEVRASEKQDR 331

Query: 1157 SKEQERSVAVSWLTSHIRVRIISKDYRRGKLYLKKGEVLDVVGPNTCDISLDESKEIIQS 1336
             + +ER V  SWL SHI+VRI+SKD++ G+LYLKKG+V+DVVGP TCDI++DE++E++Q 
Sbjct: 332  GQTRERKVKPSWLRSHIKVRIVSKDWKGGRLYLKKGKVVDVVGPTTCDITMDETQELVQG 391

Query: 1337 VHQDLLETAVPKRGGPVLVLYGKHKGVFGYLVERNMEKETGVVQDADSHELLNVHLEQIA 1516
            V Q+LLETA+P+RGGPVLVL GKHKGV+G LVE++++KETGVV+D D+H++L+V L+Q+A
Sbjct: 392  VDQELLETALPRRGGPVLVLSGKHKGVYGNLVEKDLDKETGVVRDLDNHKMLDVRLDQVA 451

Query: 1517 EYLGDPDCIGY 1549
            EY+GD D I Y
Sbjct: 452  EYMGDMDDIEY 462


>ref|XP_002304388.1| predicted protein [Populus trichocarpa] gi|222841820|gb|EEE79367.1|
            predicted protein [Populus trichocarpa]
          Length = 436

 Score =  357 bits (915), Expect = 9e-96
 Identities = 222/491 (45%), Positives = 295/491 (60%), Gaps = 12/491 (2%)
 Frame = +2

Query: 113  LSFSLSSKPNPNNKRRPSSSTFSLQETDPSSQQQQFVIEFDPSKPSILINNNTHVIPRLE 292
            LSFS+ SK    +K +P S      +   +   +Q++ EFDPSK  +  N  T +I  + 
Sbjct: 4    LSFSIPSKSK--SKPKPVS------DQPDNDNSKQYLTEFDPSKNLLPQNTQTPIILPIP 55

Query: 293  NTWNPYKKMKNINTPL-QNIQDPNLTFELETPALNMDTDP-----SMSYGINIRDKNIEE 454
            N + P+KKMKNI+ PL Q+    +L FE+ET    + +DP     S+S+G+N+R     +
Sbjct: 56   NDYQPHKKMKNIHLPLHQDDSSTDLRFEVET----LSSDPAAASDSISFGLNLRQSATTQ 111

Query: 455  EKRVGYNPVENVLLKKFKEDMKNLPDDRGIDEFEGVPVEGFGVAILKGYGWKEGQGIGKN 634
             +       E+VLL+K + D+K LP+DRG +EFE +PVE F  A+LKGYGW EG+G+GKN
Sbjct: 112  TQDARS---EDVLLEKLRYDLKRLPEDRGFEEFEEMPVEDFAKALLKGYGWHEGRGVGKN 168

Query: 635  AKEDVKMVQYVKRGNKEGLGFQPDKPTFDKKGRDLAPKGENGKTRHVIGIDGKLVVRELK 814
            +KEDV++ QY KR +KEGLGF     + D K         N K R           R   
Sbjct: 169  SKEDVQVKQYTKRTDKEGLGFLA--ASHDSK---------NKKQRE----------RSKD 207

Query: 815  GIHVGKVVRVVSGR--HVGLKGXXXXXXXXXXXXXXXXN--EEVTVGVQEVAELGTVDDE 982
            G+ +GK VRV+SG+  ++GLKG                   E V V V +VAELG+ ++E
Sbjct: 208  GLFLGKEVRVISGKKENLGLKGTVVERLGSDSIALRVEKSGERVKVRVSDVAELGSREEE 267

Query: 983  LCVKGLQ--EEKMRRGGSRDDRRKNERSSYXXXXXXXXXXXXXXXXXXXXXVNGAGYGPS 1156
             C+K L+  EEK    G R+ RR N+R+                           G G  
Sbjct: 268  RCLKELKSIEEKKPSDGDREQRRVNKRNVESRDSLKM------------------GNGNV 309

Query: 1157 SKEQERSVAVSWLTSHIRVRIISKDYRRGKLYLKKGEVLDVVGPNTCDISLDESKEIIQS 1336
             KE+     V WL SHIRVRIISKD + GKLYLKKGEV+DVVGP  CDIS+DES+E++QS
Sbjct: 310  GKER----GVQWLRSHIRVRIISKDLKGGKLYLKKGEVVDVVGPYKCDISMDESRELVQS 365

Query: 1337 VHQDLLETAVPKRGGPVLVLYGKHKGVFGYLVERNMEKETGVVQDADSHELLNVHLEQIA 1516
            V QD LETA+P+RGGPVLVLYGKHKG +G LV+R++++E GVVQD+ SHELL+V LEQIA
Sbjct: 366  VDQDALETALPRRGGPVLVLYGKHKGAYGNLVQRDIDREVGVVQDSGSHELLDVKLEQIA 425

Query: 1517 EYLGDPDCIGY 1549
            EY+GDP  IGY
Sbjct: 426  EYVGDPGYIGY 436


Top