BLASTX nr result

ID: Coptis24_contig00006501 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis24_contig00006501
         (1470 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI21002.3| unnamed protein product [Vitis vinifera]              208   3e-51
ref|XP_003525657.1| PREDICTED: uncharacterized protein LOC100809...   206   1e-50
ref|NP_194970.1| tudor-like RNA-binding protein [Arabidopsis tha...   191   3e-46
ref|XP_002869273.1| agenet domain-containing protein [Arabidopsi...   185   2e-44
ref|NP_974660.1| tudor-like RNA-binding protein [Arabidopsis tha...   185   3e-44

>emb|CBI21002.3| unnamed protein product [Vitis vinifera]
          Length = 386

 Score =  208 bits (529), Expect = 3e-51
 Identities = 145/386 (37%), Positives = 195/386 (50%), Gaps = 83/386 (21%)
 Frame = +1

Query: 193  MRFFKGDRVEVLTKRDVPSGSWRYGIILSGNGHTYYIRSESATGSSDTTGVPIVERVPRK 372
            M+F KG +VEVL K++VPSG+W    I+SGNGH Y +R +S  G ++   V   +RV RK
Sbjct: 1    MKFRKGSKVEVLNKKEVPSGAWHCAEIISGNGHNYSVRYDSYLGMTNKANV---DRVSRK 57

Query: 373  AIRPCP-PLKSIESWAAGDFVEVFDNSSWKFATVLKVTEFNLFLVKLLGTSQKVKANKSN 549
            AIRPCP P+K +ESW  GD VEVF++ SWK A VLKVT    +LV+LLG+  + + +KSN
Sbjct: 58   AIRPCPLPVKGMESWVTGDVVEVFNDGSWKCAMVLKVTCAVYYLVRLLGSCHEFEVHKSN 117

Query: 550  LRLPQSWQDGEWIVIQKVSADQPSTRRYLNKSL-------------DKDLKMRHVRDDCS 690
            +R+ Q+W D +W+VI + S      R  +NK L               D+K++    D  
Sbjct: 118  IRVRQAWIDDKWVVIGQGSITFEDVR--VNKLLTSNCHQKMSFQVPQADMKLKLQAGD-D 174

Query: 691  PVESESGLQIPRMVSSKILKRGSPGNLS-----RGTNQKLRAMEKEGR------------ 819
              +  +  Q   MVSSK LKR SP   S      G  QK+RA+E  GR            
Sbjct: 175  FFQKHADFQDSPMVSSKTLKRASPYCSSGLEAYSGNIQKMRAIENGGRQRVITGYSSPLR 234

Query: 820  ---------------------LQRRDTM-------------SFYAKDLEANHAERSACSV 897
                                    R T               F A+  E+N ++  +CSV
Sbjct: 235  AKVDAVAYPRLNLGEKYMHASFNDRSTKYCEMERGKPNGVDCFLARSSESNDSDSMSCSV 294

Query: 898  GSSGTSN--------------KEEMVD----GTHKLALDAYYRTMEALRVSGFISWEQEE 1023
            GS   +N              +++  D      H L L AY RT+ AL  SG +SWEQ  
Sbjct: 295  GSCSINNNGPNKFSSCILAHPRQDTDDLCTASIHSLELHAYRRTLAALHASGPLSWEQST 354

Query: 1024 MITNLRLKLNISNDEHLMELRKLVSA 1101
            M+TNLR  L+ISNDEHLMEL+ L+SA
Sbjct: 355  MLTNLRCSLHISNDEHLMELKNLISA 380


>ref|XP_003525657.1| PREDICTED: uncharacterized protein LOC100809539 [Glycine max]
          Length = 397

 Score =  206 bits (524), Expect = 1e-50
 Identities = 144/401 (35%), Positives = 196/401 (48%), Gaps = 97/401 (24%)
 Frame = +1

Query: 193  MRFFKGDRVEVLTKRDVPSGSWRYGIILSGNGHTYYIRSESATGSSDTTGVPIVERVPRK 372
            MRF KG++VEVL+K +VP GSW Y  I+ GNGH Y ++ +   G     G  IVE+V RK
Sbjct: 1    MRFKKGNKVEVLSKVEVPCGSWLYAEIICGNGHHYTVKYD---GYESDAGEAIVEQVSRK 57

Query: 373  AIRPCPP-LKSIESWAAGDFVEVFDNSSWKFATVLKVTEFNLFLVKLLGTSQKVKANKSN 549
             IRPCPP L+  ++W +GD VEVF N SWK ATVLKV   N  LV+LLG+S + + +K +
Sbjct: 58   DIRPCPPALELTDNWNSGDVVEVFQNFSWKMATVLKVFGKNHILVRLLGSSLEFQVSKFD 117

Query: 550  LRLPQSWQDGEWIVIQKVSADQPSTRR--------YLNKSLDKDLKMRHVRDDCSPVESE 705
            +R+ QSWQD +WI++ K S+   + +R        Y    L      R  +   S +ES 
Sbjct: 118  IRVRQSWQDDKWIIVGKGSSSHENRKRSSAQLQKMYTKTKLSGSAYYRPEKKKLSILES- 176

Query: 706  SGLQIPRMVSSKILKRGSPGNLSRGTN--QKLRAMEKEGR------------LQRRDTMS 843
                  ++VS K LKRGS   +        K RA E EGR            L++   +S
Sbjct: 177  ------KLVSFKTLKRGSNSQVDAYAKPPPKFRARENEGRCHRARLRNPPTPLKQVQGVS 230

Query: 844  F--------------------------------YAKDLEANHAERSACSVGSSGTSNK-- 921
            F                                  ++LE+NHA    CSVGS   +++  
Sbjct: 231  FPREVIAEECIPASVNNRKTGISNMVDMERRKQTGENLESNHAYSVTCSVGSCSITSRNS 290

Query: 922  ----------------------------------------EEMVDGTHKLALDAYYRTME 981
                                                    EE+    H+L L AY+ T+E
Sbjct: 291  YKSQFPVYAGPFDDVDSSFSDAESVCQRSDEEGNCSPPTQEELAAEIHRLELHAYHCTIE 350

Query: 982  ALRVSGFISWEQEEMITNLRLKLNISNDEHLMELRKLVSAQ 1104
            AL  SG +SWEQE ++TNLRL L+ISNDEHLMELR L+S++
Sbjct: 351  ALHASGPLSWEQEALMTNLRLSLHISNDEHLMELRNLISSE 391


>ref|NP_194970.1| tudor-like RNA-binding protein [Arabidopsis thaliana]
            gi|4049346|emb|CAA22571.1| putative protein [Arabidopsis
            thaliana] gi|7270148|emb|CAB79961.1| putative protein
            [Arabidopsis thaliana] gi|27765060|gb|AAO23651.1|
            At4g32440 [Arabidopsis thaliana]
            gi|110742940|dbj|BAE99365.1| hypothetical protein
            [Arabidopsis thaliana] gi|332660659|gb|AEE86059.1|
            tudor-like RNA-binding protein [Arabidopsis thaliana]
          Length = 377

 Score =  191 bits (486), Expect = 3e-46
 Identities = 129/382 (33%), Positives = 187/382 (48%), Gaps = 80/382 (20%)
 Frame = +1

Query: 193  MRFFKGDRVEVLTKRDVPSGSWRYGIILSGNGHTYYIRSESATGSSDTTGVPIVERVPRK 372
            MR  KG RVEV + ++ P G+WR   I+SGNGHTY +R  S     +     ++E+VPRK
Sbjct: 1    MRIRKGSRVEVFSNKEAPYGAWRCAEIVSGNGHTYNVRFYSFQIEHEEA---VMEKVPRK 57

Query: 373  AIRPCPPLKSIESWAAGDFVEVFDNSSWKFATVLKVTEFNLFLVKLLGTSQKVKANKSNL 552
             IRPCPPL  +E W  G+ VEV DN SWK ATV +    + ++V+LLGT +++  +K NL
Sbjct: 58   IIRPCPPLVDVERWDTGELVEVLDNFSWKAATVREELSGHYYVVRLLGTPEELTFHKVNL 117

Query: 553  RLPQSWQDGEWIVIQKVSADQPSTRRYLNKSLDKDLKMRHVRDDCSPVESESGLQIPRMV 732
            R  +SWQD  W+ I K+S    S+           L    V     P  +   L  P +V
Sbjct: 118  RARKSWQDERWVAIGKISGSLKSS----------TLTGSDVHQKLQPHRNSMPLHEPSVV 167

Query: 733  SSKILKRGSPGNLSR------GTNQKLRAMEKEGRLQRRDTMSF---------------- 846
            S+++LKR SP N S       G  +K+R++EKEG+ Q+ D +S                 
Sbjct: 168  SARLLKRPSPYNWSECAESCTGNPKKMRSLEKEGQQQKVDAISCRPENRGGKSHVQASLN 227

Query: 847  ----------------YAKDLEANH-AERSACSVGSSGTSNKEE------MVDGTHKLA- 954
                            +++ + A+  ++   CSVGS   ++ +E      M+DG+ + A 
Sbjct: 228  NHKTGYCQIVRVRSKGFSESVRADDCSDSDVCSVGSCSATSYDESNMPPCMLDGSTQQAD 287

Query: 955  ----------------------------------LDAYYRTMEALRVSGFISWEQEEMIT 1032
                                              L +Y  T+  L  SG +SWEQE  +T
Sbjct: 288  SCSSDAESSCGLGEEPRWKHSSVGDGARNSCRSELYSYRSTLGELFSSGPLSWEQEASLT 347

Query: 1033 NLRLKLNISNDEHLMELRKLVS 1098
            +LRL LNIS+DEHLME+R L+S
Sbjct: 348  DLRLSLNISDDEHLMEVRNLIS 369


>ref|XP_002869273.1| agenet domain-containing protein [Arabidopsis lyrata subsp. lyrata]
            gi|297315109|gb|EFH45532.1| agenet domain-containing
            protein [Arabidopsis lyrata subsp. lyrata]
          Length = 394

 Score =  185 bits (470), Expect = 2e-44
 Identities = 131/377 (34%), Positives = 181/377 (48%), Gaps = 81/377 (21%)
 Frame = +1

Query: 193  MRFFKGDRVEVLTKRDVPSGSWRYGIILSGNGHTYYIRSESATGSSDTTGVPIVERVPRK 372
            MRF KG RVEV + ++ P G+WR   I+SGNGHTY +R  S     +     ++ERVPRK
Sbjct: 1    MRFRKGSRVEVFSNKEAPYGAWRCAEIVSGNGHTYNVRFYSFQLEHEEA---VMERVPRK 57

Query: 373  AIRPCPPLKSIESWAAGDFVEVFDNSSWKFATVLKVTEFNLFLVKLLGTSQKVKANKSNL 552
             IRPCPPL  +E W  G+ VEV DN SWK ATV +    N ++V+LLGT  +   +K NL
Sbjct: 58   IIRPCPPLLDVEKWETGELVEVLDNFSWKAATVREELSGNYYVVRLLGTPAERTFHKVNL 117

Query: 553  RLPQSWQDGEWIVIQKVSADQPSTRRYLNKSLDKDLKMRHVRDDCSPVESESGLQIPRMV 732
            R  +SWQD +W+ I K+S    S+           L    V     P  +   L  P  V
Sbjct: 118  RARKSWQDEKWVAIGKISGSVKSS----------TLTGSDVYQKLQPHRNNIPLHEPSDV 167

Query: 733  SSKILKRGSPGNLSR------GTN-QKLRAMEKEGRLQRRDTMS---------FYAKDLE 864
            S+++LKR SP N S       G N +K+R++EKEG+ Q+ D ++          + +   
Sbjct: 168  SARMLKRPSPYNWSEFAESCTGNNPKKIRSLEKEGQQQKVDAIACRPEKRGGKSHVQASS 227

Query: 865  ANH------------------------AERSACSVGSSGTSNKEE------MVDGTHKLA 954
             NH                        ++  ACSVGS   ++ +E      M+DG+ + A
Sbjct: 228  NNHKTDYCQIVRVRSKGFSESVRADDSSDSDACSVGSCSATSYDESNMPPCMLDGSSQQA 287

Query: 955  -----------------------------------LDAYYRTMEALRVSGFISWEQEEMI 1029
                                               L +Y  T+  L  SG +SWEQE  +
Sbjct: 288  DSCSSDAESSCGLGEEPRRKHSSAGDGARRSCRSELYSYRSTLGELFSSGPLSWEQEASL 347

Query: 1030 TNLRLKLNISNDEHLME 1080
            T+LRL LNIS+DEHLME
Sbjct: 348  TDLRLSLNISDDEHLME 364


>ref|NP_974660.1| tudor-like RNA-binding protein [Arabidopsis thaliana]
            gi|110738311|dbj|BAF01084.1| hypothetical protein
            [Arabidopsis thaliana] gi|332660660|gb|AEE86060.1|
            tudor-like RNA-binding protein [Arabidopsis thaliana]
          Length = 393

 Score =  185 bits (469), Expect = 3e-44
 Identities = 126/376 (33%), Positives = 182/376 (48%), Gaps = 80/376 (21%)
 Frame = +1

Query: 193  MRFFKGDRVEVLTKRDVPSGSWRYGIILSGNGHTYYIRSESATGSSDTTGVPIVERVPRK 372
            MR  KG RVEV + ++ P G+WR   I+SGNGHTY +R  S     +     ++E+VPRK
Sbjct: 1    MRIRKGSRVEVFSNKEAPYGAWRCAEIVSGNGHTYNVRFYSFQIEHEEA---VMEKVPRK 57

Query: 373  AIRPCPPLKSIESWAAGDFVEVFDNSSWKFATVLKVTEFNLFLVKLLGTSQKVKANKSNL 552
             IRPCPPL  +E W  G+ VEV DN SWK ATV +    + ++V+LLGT +++  +K NL
Sbjct: 58   IIRPCPPLVDVERWDTGELVEVLDNFSWKAATVREELSGHYYVVRLLGTPEELTFHKVNL 117

Query: 553  RLPQSWQDGEWIVIQKVSADQPSTRRYLNKSLDKDLKMRHVRDDCSPVESESGLQIPRMV 732
            R  +SWQD  W+ I K+S    S+           L    V     P  +   L  P +V
Sbjct: 118  RARKSWQDERWVAIGKISGSLKSS----------TLTGSDVHQKLQPHRNSMPLHEPSVV 167

Query: 733  SSKILKRGSPGNLSR------GTNQKLRAMEKEGRLQRRDTMSF---------------- 846
            S+++LKR SP N S       G  +K+R++EKEG+ Q+ D +S                 
Sbjct: 168  SARLLKRPSPYNWSECAESCTGNPKKMRSLEKEGQQQKVDAISCRPENRGGKSHVQASLN 227

Query: 847  ----------------YAKDLEANH-AERSACSVGSSGTSNKEE------MVDGTHKLA- 954
                            +++ + A+  ++   CSVGS   ++ +E      M+DG+ + A 
Sbjct: 228  NHKTGYCQIVRVRSKGFSESVRADDCSDSDVCSVGSCSATSYDESNMPPCMLDGSTQQAD 287

Query: 955  ----------------------------------LDAYYRTMEALRVSGFISWEQEEMIT 1032
                                              L +Y  T+  L  SG +SWEQE  +T
Sbjct: 288  SCSSDAESSCGLGEEPRWKHSSVGDGARNSCRSELYSYRSTLGELFSSGPLSWEQEASLT 347

Query: 1033 NLRLKLNISNDEHLME 1080
            +LRL LNIS+DEHLME
Sbjct: 348  DLRLSLNISDDEHLME 363


Top