BLASTX nr result

ID: Coptis21_contig00019205 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis21_contig00019205
         (1821 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002331338.1| predicted protein [Populus trichocarpa] gi|2...   224   6e-56
gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]       213   2e-52
emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulga...   209   2e-51
ref|XP_002331746.1| predicted protein [Populus trichocarpa] gi|2...   202   3e-49
emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|72694...   196   2e-47

>ref|XP_002331338.1| predicted protein [Populus trichocarpa] gi|222873371|gb|EEF10502.1|
            predicted protein [Populus trichocarpa]
          Length = 819

 Score =  224 bits (571), Expect = 6e-56
 Identities = 132/407 (32%), Positives = 209/407 (51%), Gaps = 4/407 (0%)
 Frame = -1

Query: 1278 RGLNKAFKQVEVKDMIRRKELSVFGLIETKVKQGKMQEVLNGCCNGWMVEHNYEYSELGR 1099
            RGLN   K  E++ +I ++ +++FGL+ET+VK      V       W   +NY++S  GR
Sbjct: 386  RGLNDPIKHSELRRLIHQERIALFGLVETRVKDKNKDNVSQLLLRSWSFLYNYDFSCRGR 445

Query: 1098 IWCCWKPQNCNMNNIEKDEQSITMEAELI-NGKSFVITMVYGSNDRDKQRELWRRLIQVS 922
            IW CW      ++     +Q+I +   ++    SF  +++YG N+   +  LW  ++  S
Sbjct: 446  IWVCWNADTVKVDVFGMSDQAIHVSVTILATNISFNTSIIYGDNNASLREALWSDIVSRS 505

Query: 921  CTVQKP-WLVIGDFNSVLNGQERVGGTETRPHHFQDLLHCVTMAGLVDLKYSGNFLTWSN 745
               +   W++IGDFN++ N  +R+GG+ T       L  C+  A + DL+YSG   TWSN
Sbjct: 506  DGWESTLWILIGDFNAIRNQSDRLGGSTTWAGTMDRLDTCIREAKVDDLRYSGMHYTWSN 565

Query: 744  KQTIRL-WRKLDRVMVNAEWINEFGDTEATFKNPGATSDHSSMAVETLRTKGQCKKSFRL 568
            +    L  RKLDRV+VN +W  +F  +EA F  P   SDHS M V+ +      KK FR 
Sbjct: 566  QCPENLIMRKLDRVLVNEKWNLKFPLSEARFL-PSGMSDHSPMVVKVIGNDQNKKKPFRF 624

Query: 567  FNYWIDEEGFFEVVQDAWGVAVVGNPMYTFTKKLKHVKNALISWNREKGQACTRRV-EAE 391
            F+ W+D + F  +V+  W     G PMY    KL+ +K  L  +N       + RV +A+
Sbjct: 625  FDMWMDHDEFMPLVKKVWDQNSRGCPMYQLCCKLRKLKQELKLFNMAHFSNISDRVRDAK 684

Query: 390  RELEGVXXXXXXXXXXXXXXXQERVITRKYVKAAIEEENECKQKSRDQVVTLGDSNTKYF 211
             +++                 +ER +  KY      EE+  KQK+R Q ++LGD NT YF
Sbjct: 685  NKMDKAQQALHTAHENPILCMRERDVVHKYASTVRAEESFFKQKARIQWLSLGDQNTSYF 744

Query: 210  FNSVKARRMTNKITCLRDGNGKILEEIESIGEECVKFYADLYSPDQM 70
              SV  R+  NK+  L   +G+++E  E++  E + ++  +   DQM
Sbjct: 745  HKSVNGRQNRNKLLSLTREDGEVVERQEAVKSEVISYFHRVLGVDQM 791


>gb|AAG50806.1|AC079281_8 unknown protein [Arabidopsis thaliana]
          Length = 1213

 Score =  213 bits (541), Expect = 2e-52
 Identities = 138/429 (32%), Positives = 208/429 (48%), Gaps = 11/429 (2%)
 Frame = -1

Query: 1299 NVGCWNVRGLNKAFKQVEVKDMIRRKELSVFGLIETKVKQGKMQEVLNGCCNGWMVEHNY 1120
            N+ CWN+RG N    +   K  ++  +    G+IET VKQ K ++ +N    GW    NY
Sbjct: 4    NLFCWNIRGFNNVSHRSGFKKWVKANKPIFGGVIETHVKQPKDRKFINALLPGWSFVENY 63

Query: 1119 EYSELGRIWCCWKPQNCNMNNIEKDEQSITMEAELINGKSFVI-TMVYGSNDRDKQRELW 943
             +S+LG+IW  W P +  +  + K  Q IT E  L    S++I ++VY +N+   ++ELW
Sbjct: 64   AFSDLGKIWVMWDP-SVQVVVVAKSLQMITCEVLLPGSPSWIIVSVVYAANEVASRKELW 122

Query: 942  RRLIQV---SCTVQKPWLVIGDFNSVLNGQERVGGTETRPH-HFQDLLHCVTMAGLVDLK 775
              ++ +        +PWLV+GDFN VLN QE           + +D   C+  A L DL+
Sbjct: 123  IEIVNMVVSGIIGDRPWLVLGDFNQVLNPQEHSNPVSLNVDINMRDFRDCLLAAELSDLR 182

Query: 774  YSGNFLTWSNK-QTIRLWRKLDRVMVNAEWINEFGDTEATFKNPGATSDHSSMAVETLRT 598
            Y GN  TW NK  T  + +K+DR++VN  W   F  +   F +    SDH S  V    T
Sbjct: 183  YKGNTFTWWNKSHTTPVAKKIDRILVNDSWNALFPSSLGIFGSLD-FSDHVSCGVVLEET 241

Query: 597  KGQCKKSFRLFNYWIDEEGFFEVVQDAW-GVAVVGNPMYTFTKKLKHVKNALISWNREKG 421
              + K+ F+ FNY +    F  +V+D W  + VVG+ M+  +KKLK +K  +  ++R   
Sbjct: 242  SIKAKRPFKFFNYLLKNLDFLNLVRDNWFTLNVVGSSMFRVSKKLKALKKPIKDFSRLNY 301

Query: 420  QACTRRVEAERELEGVXXXXXXXXXXXXXXXQERVITRKYVKAAIEEENECKQKSRDQVV 241
                +R +   +                    E    RK+      EE+  +QKSR    
Sbjct: 302  SELEKRTKEAHDFLIGCQDRTLADPTPINASFELEAERKWHILTAAEESFFRQKSRISWF 361

Query: 240  TLGDSNTKYFFNSVKARRMTNKITCLRDGNGKILEEIESIGEECVKFYADL----YSPDQ 73
              GD NTKYF     AR  +N I+ L DGNGK+++  E I + C  ++  L      P  
Sbjct: 362  AEGDGNTKYFHRMADARNSSNSISALYDGNGKLVDSQEGILDLCASYFGSLLGDEVDPYL 421

Query: 72   MEDMDFSML 46
            ME  D ++L
Sbjct: 422  MEQNDMNLL 430


>emb|CCA65981.1| hypothetical protein [Beta vulgaris subsp. vulgaris]
          Length = 1114

 Score =  209 bits (532), Expect = 2e-51
 Identities = 133/424 (31%), Positives = 209/424 (49%), Gaps = 5/424 (1%)
 Frame = -1

Query: 1302 INVGCWNVRGLNKAFKQVEVKDMIRRKELSVFGLIETKVKQGKMQEVLNGCCNGWMVEHN 1123
            + +  WNVRGLN   K  EVK  +  +++S+  L ET+V+Q    ++     N W   +N
Sbjct: 1    MKITTWNVRGLNDPIKVKEVKHFLHSQKISLCSLFETRVRQQNSGKIQKKFGNRWSWINN 60

Query: 1122 YEYSELGRIWCCWKPQNCNMNNIEKDEQSITMEAELING-KSFVITMVYGSNDRDKQREL 946
            Y  S  GRIW  W   + N+N +   EQ ITME +   G   F +  VYG +    ++ L
Sbjct: 61   YACSPRGRIWVGWLNNDVNINVLSVTEQVITMEVKNSYGLNMFKMAAVYGLHTIADRKVL 120

Query: 945  WRRLIQVSCTVQKPWLVIGDFNSVLNGQERVGGTETRPHHFQDLLHCVTMAGLVDLKYSG 766
            W  L        +P ++IGD+N+V + Q+R+ G +       DL   V  A L++   +G
Sbjct: 121  WEELYNFVSVCHEPCILIGDYNAVYSAQDRLNGNDVSEAETSDLRSFVLKAQLLEAPTTG 180

Query: 765  NFLTWSNKQ--TIRLWRKLDRVMVNAEWINEFGDTEATFKNPGATSDHSSMAVETLRTKG 592
             F +W+NK     R+  ++D+  VN  WIN++ D    ++  G  SDHS +         
Sbjct: 181  LFYSWNNKSIGADRISSRIDKSFVNVAWINQYPDVVVEYREAG-ISDHSPLIFNLATQHD 239

Query: 591  QCKKSFRLFNYWIDEEGFFEVVQDAWGVAVVGNPMYTFTKKLKHVKNALISWNREKGQAC 412
            +  + F+  N+  D+ GF EVV++AWG A     M     +L+ VK AL S++ +K    
Sbjct: 240  EGGRPFKFLNFLADQNGFVEVVKEAWGSANHRFKMKNIWVRLQAVKRALKSFHSKKFSKA 299

Query: 411  TRRVEAERELEGVXXXXXXXXXXXXXXXQERVITRKYVKAAIEEENECKQKSRDQVVTLG 232
              +VE  R                    +E+ +  +  K +  +E+  KQKSR Q ++LG
Sbjct: 300  HCQVEELRRKLAAVQALPEVSQVSELQEEEKDLIAQLRKWSTIDESILKQKSRIQWLSLG 359

Query: 231  DSNTKYFFNSVKARRMTNKITCLRDGNGKILEEIESIGEECVKFYADLY--SPDQMEDMD 58
            DSN+K+FF ++K R+  NKI  L++  G  L E   I  E   FY  L   S  Q+E +D
Sbjct: 360  DSNSKFFFTAIKVRKARNKIVLLQNDRGDQLTENTEIQNEICNFYRRLLGTSSSQLEAID 419

Query: 57   FSML 46
              ++
Sbjct: 420  LHVV 423


>ref|XP_002331746.1| predicted protein [Populus trichocarpa] gi|222874272|gb|EEF11403.1|
            predicted protein [Populus trichocarpa]
          Length = 503

 Score =  202 bits (513), Expect = 3e-49
 Identities = 125/397 (31%), Positives = 194/397 (48%), Gaps = 4/397 (1%)
 Frame = -1

Query: 1248 EVKDMIRRKELSVFGLIETKVKQGKMQEVLNGCCNGWMVEHNYEYSELGRIWCCWKPQNC 1069
            E  ++I ++ +++FGL+ET+VK      V       W   +NY++S  GRIW CW     
Sbjct: 3    EFVNLIHQERIALFGLVETRVKDKNKDNVSQLLLRSWSFLYNYDFSCRGRIWVCWNADTV 62

Query: 1068 NMNNIEKDEQSITMEAELI-NGKSFVITMVYGSNDRDKQRELWRRLIQVSCTVQK-PWLV 895
             +N     +Q+I +   ++    SF  +++YG N+   +  LW  ++  S   +  PW++
Sbjct: 63   KVNVFGMSDQAIHVSVTILATNISFNTSIIYGDNNASLREALWSDIVSRSDGWESTPWIL 122

Query: 894  IGDFNSVLNGQERVGGTETRPHHFQDLLHCVTMAGLVDLKYSGNFLTWSNKQTIRL-WRK 718
            +GDFN++ N   R+GG+ T       L  C+  A + DL+YSG   TWSN+    L  RK
Sbjct: 123  MGDFNAIRNQSHRLGGSTTWAGTMDRLDTCIREAKVDDLRYSGMHYTWSNQCPENLIMRK 182

Query: 717  LDRVMVNAEWINEFGDTEATFKNPGATSDHSSMAVETLRTKGQCKKSFRLFNYWIDEEGF 538
            LDRV+VN +W   F  +E  F  P   SDHS M V+ +      KK FR F+ W+D+   
Sbjct: 183  LDRVLVNEKWNLNFPLSEVRFL-PSGISDHSPMVVKVIGNDQNIKKPFRFFDMWMDQNSG 241

Query: 537  FEVVQDAWGVAVVGNPMYTFTKKLKHVKNALISWNREKGQACTRRV-EAERELEGVXXXX 361
                         G PMY     LK +K  L  +N       + RV +A+ E++      
Sbjct: 242  -------------GCPMYQLCCNLKKLKQELKLFNMAHFSNISDRVKDAKNEMDKAQQAL 288

Query: 360  XXXXXXXXXXXQERVITRKYVKAAIEEENECKQKSRDQVVTLGDSNTKYFFNSVKARRMT 181
                       +ER +  KY      EE+  KQK+R Q ++LGD NT YF  SV  R   
Sbjct: 289  HTAHENPILCMRERDVVHKYASTVRAEESFFKQKARIQWLSLGDQNTSYFHKSVNGRHNR 348

Query: 180  NKITCLRDGNGKILEEIESIGEECVKFYADLYSPDQM 70
            NK+  L   +G+++E  E++  E + ++  +   DQM
Sbjct: 349  NKLLSLTREDGEVVEGHEAVKSEVIAYFHRVLGVDQM 385


>emb|CAA18234.1| putative protein [Arabidopsis thaliana] gi|7269488|emb|CAB79491.1|
            putative protein [Arabidopsis thaliana]
          Length = 1141

 Score =  196 bits (497), Expect = 2e-47
 Identities = 126/398 (31%), Positives = 194/398 (48%), Gaps = 11/398 (2%)
 Frame = -1

Query: 1206 GLIETKVKQGKMQEVLNGCCNGWMVEHNYEYSELGRIWCCWKPQNCNMNNIEKDEQSITM 1027
            G+IE  VKQ K ++ +N    GW  + NY +S+LG+IW  W P +  +  + K  Q IT 
Sbjct: 27   GVIEKHVKQPKDKKFINALLPGWFFDENYGFSDLGKIWVLWDP-SVEVVIVAKSLQMITC 85

Query: 1026 EAELINGKSF-VITMVYGSNDRDKQRELWRR---LIQVSCTVQKPWLVIGDFNSVLNGQE 859
            E    N +++ VI++VY +N+ DK++ELWR    L+    T  +PW+++GDFN VL+  E
Sbjct: 86   EVLFPNSRTWIVISVVYAANEDDKRKELWREITALVASPVTFNRPWILLGDFNQVLHPHE 145

Query: 858  RVGGTETR-PHHFQDLLHCVTMAGLVDLKYSGNFLTWSNKQTIR-LWRKLDRVMVNAEWI 685
                         +D   C+  A L DL Y G+  TW NK   R + +K+DR++VN  W 
Sbjct: 146  HSRHVSLNVDRRIRDFRECLLDAELSDLVYKGSSFTWWNKSKTRPVAKKIDRILVNESWS 205

Query: 684  NEFGDTEATFKNPGATSDHSSMAVETLRTKGQCKKSFRLFNYWIDEEGFFEVVQDAW-GV 508
            N F  +   F  P   SDH+S  V       + K+ F+ FN+ +    F  +V D W   
Sbjct: 206  NLFPSSFGLF-GPPDFSDHASCGVVLELDPIKAKRPFKFFNFLLKNPEFLNLVWDVWYST 264

Query: 507  AVVGNPMYTFTKKLKHVKNALISWNREKGQACTRRVEAERELEGVXXXXXXXXXXXXXXX 328
             VVG+ M+  +KKLK +K  +  ++R       +R E   E                   
Sbjct: 265  NVVGSSMFRVSKKLKALKKPIKDFSRLNYSNLEKRTEEAHETLLSFQNLTLDNPSLENAA 324

Query: 327  QERVITRKYVKAAIEEENECKQKSRDQVVTLGDSNTKYFFNSVKARRMTNKITCLRDGNG 148
             E    RK+   A  EE+  +Q+SR      GD NT+YF     +R+  N IT L D +G
Sbjct: 325  HELEAQRKWQILATAEESFFRQRSRVTWFAEGDGNTRYFHRMADSRKSVNTITTLVDDSG 384

Query: 147  KILEEIESIGEECVKFYADLYSPD----QMEDMDFSML 46
              ++  + I + C  ++ +L S D     +E  D ++L
Sbjct: 385  TQIDSQQGIADHCALYFENLLSDDNDPYSLEQDDMNLL 422


Top