BLASTX nr result

ID: Coptis23_contig00020468 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis23_contig00020468
         (2061 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002525120.1| hypothetical protein RCOM_0553590 [Ricinus c...   168   6e-39
ref|XP_003525991.1| PREDICTED: uncharacterized protein LOC100803...   130   1e-27
ref|NP_565715.1| uncharacterized protein [Arabidopsis thaliana] ...   128   5e-27
ref|XP_002881158.1| hypothetical protein ARALYDRAFT_482041 [Arab...   124   7e-26
ref|XP_004149372.1| PREDICTED: uncharacterized protein LOC101205...   102   4e-19

>ref|XP_002525120.1| hypothetical protein RCOM_0553590 [Ricinus communis]
            gi|223535579|gb|EEF37247.1| hypothetical protein
            RCOM_0553590 [Ricinus communis]
          Length = 490

 Score =  168 bits (425), Expect = 6e-39
 Identities = 152/509 (29%), Positives = 239/509 (46%), Gaps = 44/509 (8%)
 Frame = +3

Query: 327  LKVKGISWVGNVYQKFEAMCLEVEDAVCQETAKYVESQVQTVGSSVKKFYAEVMQDLLPP 506
            + +KGISWVGN+YQKFEAMCLEVE+ + Q+T KYVE+QVQTVGSSVK+FY++VMQDLLPP
Sbjct: 1    MDLKGISWVGNIYQKFEAMCLEVEEVMYQDTVKYVENQVQTVGSSVKRFYSDVMQDLLPP 60

Query: 507  SSLDPKEIEGT----EQNAAFGTCEKQKLSTEGLEKDNSPYNDGSEIHVPAVKGLCEKQK 674
            SS+D  +  G     E  A  G   K K+      K+     D  E      K   +K+ 
Sbjct: 61   SSVDAAKGAGVDVPLELYADLGIYMKPKVGV----KEKQGKVDDRERLTEDPKITTDKKS 116

Query: 675  EEADVCKRPNAGTKKYPINERLLPVDMSEVIISGKSSSQASLRSRVHSTSQLLPSLSVDP 854
             +     R      ++P+++       S        S++++  +R +S  +   ++SVD 
Sbjct: 117  MDPLTFHRLGLVENRFPLSQGNSAGGASRQHGKRSLSNKSNPYTRKNSNRE---NMSVDK 173

Query: 855  VVAAGSRL--------FLE---QNCNDE------------VCRNSTVPIDKGPLKANLSL 965
             + A S L        F E   +N  D             + +++++  +    + N+ L
Sbjct: 174  KLEAISCLDKGLIRASFSERSNENLGDSGGGAPKQYGDSCLPKDTSLGTNGNSERQNIFL 233

Query: 966  TEVSEIVDPAGEGTCQVSSFGCVRKEDNAKPC-DKLMKMTS--SIDFTICNSPEKIRPLC 1136
             E + +V P      + SS  C    +N K C D+  K+T+  S++ T  +S ++ +   
Sbjct: 234  HEKARVVIPLYNDLTRASSI-CELSNENHKDCVDQQAKITTPGSVEMTGHDSVDESKYEI 292

Query: 1137 SNRMVESGHXXXXXXXXXXXXXVLPVASRERKIVESGLTTFSGIPTEANGLD-------- 1292
             N   +                V    S   K ++   ++   +  EA+  D        
Sbjct: 293  ENASEQ---------IPDIPDMVNSTESGASKGMDMTCSSHGSLSAEAHAADDCMSHGAD 343

Query: 1293 -PLATFDTCSRMGSSWNGHEHFCEEV-TDDAHSESDNGDDIVEQELKTTEEFQKAKLEES 1466
             P  +F   +  G S +  E F     +DD +++    D  +  E++  ++  KAKLEES
Sbjct: 344  FPADSFVNGNGKGQSSDSDEDFVSNSGSDDCNTDVYKIDFSISHEMEIIQQVDKAKLEES 403

Query: 1467 CIVVDVKELPFVPHHTGRQRSYKKKFRDALASRMRLAKKQENEQLATWQGDTSNP-RTEC 1643
            CI+V+  E  ++P    + +SYKKK RD  + R R  +K  +EQL+   G  SNP + EC
Sbjct: 404  CILVNRDECHYLPQSERKSKSYKKKIRDVFSPRKRSMRK--HEQLSICPGSDSNPNQEEC 461

Query: 1644 ---SSPSVLTGDLKKSSTHDTSESEWELL 1721
               S P     D  + ST D  +SEWE L
Sbjct: 462  AKNSMPRHTIKDADRYSTPDCCDSEWEFL 490


>ref|XP_003525991.1| PREDICTED: uncharacterized protein LOC100803672 [Glycine max]
          Length = 533

 Score =  130 bits (327), Expect = 1e-27
 Identities = 149/536 (27%), Positives = 227/536 (42%), Gaps = 69/536 (12%)
 Frame = +3

Query: 321  MDLKVKGISWVGNVYQKFEAMCLEVEDAVCQETAKYVESQVQTVGSSVKKFYAEVMQDLL 500
            MDLK++ I WVGN+YQKFEA+C EV+D V Q+  KY+E+QVQ VG SVKKFY+ V+ +LL
Sbjct: 1    MDLKIQHIKWVGNIYQKFEAVCQEVDDIVGQDAVKYLENQVQNVGDSVKKFYSGVVHELL 60

Query: 501  P-PSSLDPK---EIEGTEQNAAF--GTCEKQKLSTEGLEKDNSPYN-------------- 620
            P P+S D K          N  F   +    K + +  +++N   N              
Sbjct: 61   PFPTSADSKYESHSVALTNNIGFPVESVVGHKDNNKKRDEENPTNNVIKSLQESSAIDIA 120

Query: 621  DGSEIHVPAVKGLCEK-------------QKEEADVCKRPNAGTKKYPINERL------- 740
            +  ++ VP    L ++              +EE     R  +G KK  +N  +       
Sbjct: 121  NNQQVGVPIKHKLIDETCSDSLEVEDSYITQEEVGDDSRETSGAKKEKLNTSIEEVSVES 180

Query: 741  LPVDMSEVIISGKSSSQASLRSRVHSTS-------QLLPSLSVDPVVAAGSRLFLEQNC- 896
            +P  M+ + +  K S +  + S  +S S        +    ++D  V   S L +E+N  
Sbjct: 181  VPKSMNLMSLREKESLEFPIHSESYSDSSDSGCEDSIAKKDNIDVTVEQNSCLVVEKNAM 240

Query: 897  ---NDEVCRNSTVPIDKGPLKANLSLTEVSEIVD-PAGEGTCQVSSFGCVRKEDNAKPCD 1064
                 EV  + ++  ++  +K +L  +E S+ VD    +   +VS    V  E   +P  
Sbjct: 241  NSSTSEVLSSQSLDGEES-IKVSL-FSESSDAVDEDTHDILAEVSPDASVSSE---RPII 295

Query: 1065 KLMKMTSSIDFTICNS---------PEKIRPLCSNRMVESGHXXXXXXXXXXXXXVLPVA 1217
             + +   S +F   +S         P +I   C N   ++                 P  
Sbjct: 296  TMTEPLCSRNFITSDSLYSKSLGSYPLEIES-CKNNSGDATLCISDSSMMHICCESSPHV 354

Query: 1218 SRERKIVESGLTTFSGI--PTEANGLDPLATFDTCSRMGSSWNGHEHFCEEVTDDAHSES 1391
            +R+    + GL  FSG     E+NG         C +  +       F   + +   S  
Sbjct: 355  ARQIMESQDGL-AFSGYCQSLESNGCHSYLCCINCVKFAA-------FASLMLNTGESNK 406

Query: 1392 DNGDDIVEQELKTTEEFQKAKLEESCIVVDVKELPFVPHHTGRQRSYKKKFRDALASRMR 1571
                  VE  L+  +     KLEE+C+ VD  EL  V     + RSYKK+  DA +S+ R
Sbjct: 407  SLFSS-VESSLEDIDLNDDPKLEENCVFVDDSELYAVSCRAQKLRSYKKRILDAFSSKKR 465

Query: 1572 LAKKQENEQLATWQGDTS-NPRTECSSPSV-----LTGDLKKSSTHDTSESEWELL 1721
            L+K  E EQLA W GDT   P+   S  S+        D K       SE+EWELL
Sbjct: 466  LSK--EYEQLAIWYGDTDIEPKQGFSQTSLPFISRTYMDSKNVQVQRASETEWELL 519


>ref|NP_565715.1| uncharacterized protein [Arabidopsis thaliana]
            gi|16612317|gb|AAL27517.1|AF439849_1 At2g31130/T16B12.6
            [Arabidopsis thaliana] gi|20197328|gb|AAC63838.2|
            expressed protein [Arabidopsis thaliana]
            gi|23506163|gb|AAN31093.1| At2g31130/T16B12.6
            [Arabidopsis thaliana] gi|330253402|gb|AEC08496.1|
            uncharacterized protein [Arabidopsis thaliana]
          Length = 419

 Score =  128 bits (322), Expect = 5e-27
 Identities = 131/475 (27%), Positives = 201/475 (42%), Gaps = 13/475 (2%)
 Frame = +3

Query: 336  KGISWVGNVYQKFEAMCLEVEDAVCQETAKYVESQVQTVGSSVKKFYAEVMQDLLPPSSL 515
            KGI WVGNVYQKFEAMCLEVE+ + Q+TAKYVE+QVQTVG+SVKKF ++V+ DLLP  S+
Sbjct: 4    KGIKWVGNVYQKFEAMCLEVEEIIVQDTAKYVENQVQTVGNSVKKFCSDVVHDLLPDESV 63

Query: 516  D---PKEIEGTEQNAAFGTCEKQKLSTEGLEKDNSPYNDGSEIHVPA-VKGLCEKQKEEA 683
            D   P  +    + A   + +K+K S     KD +   + +E       K L     ++ 
Sbjct: 64   DSGKPLPVSMLHEYAPVYSFKKKKDSMNRKTKDVTQEQEVTEGKKDGFAKKLRGLDADDY 123

Query: 684  DVCKRPNAGTKKYPINERLLPVDMSEVIISGKSSSQASLRSRVHSTSQLLPSLSVDPVVA 863
            D+C  P    ++Y          +    I  K      +R  +      L SLS+     
Sbjct: 124  DICTSP----RQYSYGGPYRRTRIGRKQIFKKEELSQVIRPYIQKD---LTSLSMVHSAR 176

Query: 864  AGSRLFLEQNCNDEVCRNSTVPIDKGPL-KANLSLTEVSEIVDPAGEGTCQVSSFGCVRK 1040
                L    + +  +  ++ V  D G +  ++LS+   + + D  G      S  G V K
Sbjct: 177  VKDDLGTVNSSSLSMVHSARVNDDVGTVNSSSLSMVHHASMKDDVGTVKSSDSPPGEVEK 236

Query: 1041 EDNAKPCDKLMKMTSSIDFTICNSPEKIRPLCSNRMVESGHXXXXXXXXXXXXXVLPVAS 1220
              + K C K  K  +    T+ NS                                 V S
Sbjct: 237  LISKKKCQKDDKAKNQQSLTVVNS---------------------------------VKS 263

Query: 1221 RERKIV---ESGLTTFSGIPTEANGLDPLATFDTCSRMGSSWNGHEHFCEEVTDDAHSES 1391
             + +++   E GL+    + ++   + P         + +S       C + T+   S S
Sbjct: 264  NDSEVIVDNEHGLSADKSVRSQDLEIQP--------SLATSLPAESDDCRKETNVETSSS 315

Query: 1392 DNGDDIVEQELKTTEEFQKAKLEESCIVVDVKELPFVPHHTGRQRSYK--KKFRDALASR 1565
                 + E + +  +      +EESCI+VD  E   V         +K  KK RDA++SR
Sbjct: 316  ----SVSEPKSEILQHLSGRSVEESCILVDRDEFHSVFPDKMENDKHKPYKKIRDAISSR 371

Query: 1566 MRLAKKQENEQLA-TWQGDTSNPRTECSSPSVLTGDLKK--SSTHDTSESEWELL 1721
            M+  +++E ++LA  W  +      EC       GD  K       + ESEWELL
Sbjct: 372  MKQNREKEYKRLARQWYAEDVENGREC-------GDNPKPIEENQSSEESEWELL 419


>ref|XP_002881158.1| hypothetical protein ARALYDRAFT_482041 [Arabidopsis lyrata subsp.
            lyrata] gi|297326997|gb|EFH57417.1| hypothetical protein
            ARALYDRAFT_482041 [Arabidopsis lyrata subsp. lyrata]
          Length = 418

 Score =  124 bits (312), Expect = 7e-26
 Identities = 130/475 (27%), Positives = 205/475 (43%), Gaps = 13/475 (2%)
 Frame = +3

Query: 336  KGISWVGNVYQKFEAMCLEVEDAVCQETAKYVESQVQTVGSSVKKFYAEVMQDLLPPSSL 515
            KGI WVGNVYQKFEAMCLEVE+ + Q+TAKYVE+QVQTVG+SVKKF ++V+QDLLP  S+
Sbjct: 4    KGIKWVGNVYQKFEAMCLEVEEIIVQDTAKYVENQVQTVGNSVKKFCSDVVQDLLPDDSV 63

Query: 516  D---PKEIEGTEQNAAFGTCEKQKLSTEGLEKDNSPYNDGSEIHVPAVKGLCEK----QK 674
            D   P  +    + A   + +K++   + + +         E+      G  +K      
Sbjct: 64   DSGKPLPVSMLHEYAPVCSFKKKR---DSMNRKTRDVKQEQEVTEGKKDGCAQKFRGLDA 120

Query: 675  EEADVCKRPNAGTKKYPINERLLPVDMSEVIISGKSSSQASLRSRVHSTSQLLPSLSVDP 854
            ++ D+C  P    ++Y          +    I  K       R  +   S    SLS+  
Sbjct: 121  DDYDICTSP----RQYSYGGPYRRTRVGRKQIFKKEELSQVTRPYMQKDSS---SLSMVH 173

Query: 855  VVAAGSRLFLEQNCNDEVCRNSTVPIDKGPL-KANLSLTEVSEIVDPAGEGTCQVSSFGC 1031
                   +    + +  +  ++ V  D G +  ++L++   + I D  G      S  G 
Sbjct: 174  SARVKDDVGTVNSSSLSMVHSARVKDDVGTVNSSSLTMVHSARIKDDVGTVKSSDSPPGE 233

Query: 1032 VRKEDNAKPCDKLMKMTSSIDFTICNSPEKIRPLCSNRMVESGHXXXXXXXXXXXXXVLP 1211
            V K    K C K  K  +    T+ NS   ++   S   +++ H             ++ 
Sbjct: 234  VEKLIYKKECQKDDKTKNQQSLTVVNS---VKRNDSEIRIDNEH------------GLMG 278

Query: 1212 VASRERKIVESGLTTFSGIPTEANGLDPLATFDTCSRMGSSWNGHEHFCEEVTDDAHSES 1391
             +S++ +I  S  T+ +            A  D C +             E   D  + S
Sbjct: 279  DSSQDSEIQPSVATSLA------------AGSDDCRK-------------ETNVDTKTSS 313

Query: 1392 DNGDDIVEQELKTTEEFQKAKLEESCIVVDVKELPFVPHHTGRQRSYK--KKFRDALASR 1565
             +   + EQ+ +  +      +EESCI+VD  E   V         +K  KK RDA++SR
Sbjct: 314  SS---VSEQKSEILQPLSGRSVEESCILVDRDEFHCVFPDKMENDKHKPYKKIRDAISSR 370

Query: 1566 MRLAKKQENEQLA-TWQGDTSNPRTECSSPSVLTGDLKKSSTHDTS--ESEWELL 1721
            M+  +++E ++LA  W  +      EC       GD  K    + S  ESEWELL
Sbjct: 371  MKQNREKEYKRLARQWYAEDVENGREC-------GDDPKPLEENQSPEESEWELL 418


>ref|XP_004149372.1| PREDICTED: uncharacterized protein LOC101205697 [Cucumis sativus]
          Length = 379

 Score =  102 bits (254), Expect = 4e-19
 Identities = 43/68 (63%), Positives = 58/68 (85%)
 Frame = +3

Query: 327 LKVKGISWVGNVYQKFEAMCLEVEDAVCQETAKYVESQVQTVGSSVKKFYAEVMQDLLPP 506
           + VKGI+WVG +Y+KFE MCLEVED +CQ+T KYVE+QV+ VG+SVK+FY++VMQD LPP
Sbjct: 1   MDVKGIAWVGRLYEKFETMCLEVEDIICQDTVKYVENQVEVVGASVKRFYSDVMQDFLPP 60

Query: 507 SSLDPKEI 530
           S L  +++
Sbjct: 61  SELSDEKV 68


Top