BLASTX nr result

ID: Coptis21_contig00007669 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis21_contig00007669
         (2160 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002525120.1| hypothetical protein RCOM_0553590 [Ricinus c...   178   6e-42
ref|XP_002279986.1| PREDICTED: uncharacterized protein LOC100250...   169   4e-39
ref|XP_003525991.1| PREDICTED: uncharacterized protein LOC100803...   137   1e-29
ref|NP_565715.1| uncharacterized protein [Arabidopsis thaliana] ...   126   2e-26
ref|XP_002881158.1| hypothetical protein ARALYDRAFT_482041 [Arab...   125   4e-26

>ref|XP_002525120.1| hypothetical protein RCOM_0553590 [Ricinus communis]
            gi|223535579|gb|EEF37247.1| hypothetical protein
            RCOM_0553590 [Ricinus communis]
          Length = 490

 Score =  178 bits (451), Expect = 6e-42
 Identities = 159/515 (30%), Positives = 243/515 (47%), Gaps = 50/515 (9%)
 Frame = -3

Query: 1819 LKVKGISWVGNVYQKFEAMCLEVEDAVCQETAKYVESQVQTVGSSVKKFYAEVMQDLLPP 1640
            + +KGISWVGN+YQKFEAMCLEVE+ + Q+T KYVE+QVQTVGSSVK+FY++VMQDLLPP
Sbjct: 1    MDLKGISWVGNIYQKFEAMCLEVEEVMYQDTVKYVENQVQTVGSSVKRFYSDVMQDLLPP 60

Query: 1639 SSLDSKENEGT----EQNAAFGTCEKQKLSTEGFEKDNSPYNDGSEIHVPAVKGLCEKQK 1472
            SS+D+ +  G     E  A  G   K K+   G ++     +D           L E  K
Sbjct: 61   SSVDAAKGAGVDVPLELYADLGIYMKPKV---GVKEKQGKVDDRER--------LTEDPK 109

Query: 1471 DEADVCKRSDAGTKKYPINERLLPVDMSEGIILGKSSSQVSLRSRVHSTSQLL------P 1310
               D          +  + E   P  +S+G   G +S Q   RS  + ++          
Sbjct: 110  ITTDKKSMDPLTFHRLGLVENRFP--LSQGNSAGGASRQHGKRSLSNKSNPYTRKNSNRE 167

Query: 1309 SLSVDPVVAAGSRL--------FLE---QNCNDE------------VCRNSTVPIDKGPL 1199
            ++SVD  + A S L        F E   +N  D             + +++++  +    
Sbjct: 168  NMSVDKKLEAISCLDKGLIRASFSERSNENLGDSGGGAPKQYGDSCLPKDTSLGTNGNSE 227

Query: 1198 KANLSLTEVSEIVDPAGEGTCQASSFSCVRKEDNAKPC-DKLTKMTS--SIDFTICNSPE 1028
            + N+ L E + +V P      +ASS  C    +N K C D+  K+T+  S++ T  +S +
Sbjct: 228  RQNIFLHEKARVVIPLYNDLTRASSI-CELSNENHKDCVDQQAKITTPGSVEMTGHDSVD 286

Query: 1027 KTRPLCSNRMVESGHTADXXXXXXXXXSVLPVASRERKIVESGLTTFSGIPTEANGLD-- 854
            +++    N   +     D          V    S   K ++   ++   +  EA+  D  
Sbjct: 287  ESKYEIENASEQIPDIPD---------MVNSTESGASKGMDMTCSSHGSLSAEAHAADDC 337

Query: 853  -------PLATFDTRSRMGSSWNGHEHFCEEV-TDDAHSESDKGDDIVEQELKTAEEFQK 698
                   P  +F   +  G S +  E F     +DD +++  K D  +  E++  ++  K
Sbjct: 338  MSHGADFPADSFVNGNGKGQSSDSDEDFVSNSGSDDCNTDVYKIDFSISHEMEIIQQVDK 397

Query: 697  AKLEESCIVVDVKELPFVPHHTGRQRSYKKKFRDALASRMRLAKKQENEQLATWQGDTSN 518
            AKLEESCI+V+  E  ++P    + +SYKKK RD  + R R  +K  +EQL+   G  SN
Sbjct: 398  AKLEESCILVNRDECHYLPQSERKSKSYKKKIRDVFSPRKRSMRK--HEQLSICPGSDSN 455

Query: 517  P-RTEC---SSPSVLTGDLKKSSTHDTSESEWELL 425
            P + EC   S P     D  + ST D  +SEWE L
Sbjct: 456  PNQEECAKNSMPRHTIKDADRYSTPDCCDSEWEFL 490


>ref|XP_002279986.1| PREDICTED: uncharacterized protein LOC100250516 [Vitis vinifera]
            gi|302143402|emb|CBI21963.3| unnamed protein product
            [Vitis vinifera]
          Length = 451

 Score =  169 bits (427), Expect = 4e-39
 Identities = 154/494 (31%), Positives = 234/494 (47%), Gaps = 32/494 (6%)
 Frame = -3

Query: 1810 KGISWVGNVYQKFEAMCLEVEDAVCQETAKY-------VESQVQTVGSSVKKFYAEVMQD 1652
            KGI+WVGN+YQKFE +CLEVED + Q+T KY       VE QV+TVG SVKKF +E++QD
Sbjct: 4    KGITWVGNMYQKFETICLEVEDIMYQDTVKYFENHVKYVEDQVETVGESVKKFCSEIVQD 63

Query: 1651 LLPPSSLD-SKENEGTEQNAAFGTCEKQKLSTEGFEKDNSPYNDGSEIHVPAVKGLCEKQ 1475
            LL P SL+ +  N   +Q+     C+K K+                              
Sbjct: 64   LLLPDSLEVTDSNLSLDQHDNVKLCKKPKVGI---------------------------- 95

Query: 1474 KDEADVCKRSDAGTKKYPINERLLPVDM------SEGIILGKSSSQVSLRSRVHSTSQLL 1313
            K+EA V  + +    K  I E  +  D+      SE   L +     S  + +H  + L 
Sbjct: 96   KEEAKVGFKEEP---KVSIKEEFIKFDIDRLTEHSEIADLNEDVEHKSSFTGLHGVNNLF 152

Query: 1312 PSLSVDPVVAAGSRLFLEQNCNDEVCRNSTVPIDKGPLKANLSLTEVSEIVDP-AGEGTC 1136
             S S + V  A S L L QN +  +C+N    I + P+K +    EVS ++ P +G+ + 
Sbjct: 153  QSYSGNSVTGACSDLHLVQNDDGVMCKNLDAGIKRNPVKVSQFPIEVSGVIAPISGDVSR 212

Query: 1135 QASSFSCVRKEDNAKPCDKL--TKMTSSIDFTICNSPEKTRPLCSNRMVESGHTADXXXX 962
              SS +    E+    C+++  T   +S++ T CN       +C+         AD    
Sbjct: 213  LPSSLN----ENCENKCNQMAITSSPASVEITDCNLEGA---ICNE-------IADVTAI 258

Query: 961  XXXXXSVLPVASRERKIVESGLTTFSGIPTEANGLDPLATFDTRSRMGSSWNGHEHFCEE 782
                 SV  V S  ++  E   ++  G+ +E N  +        S +GS  +  ++   E
Sbjct: 259  SVDLPSVPLVESVGKEGREMVFSSRGGLSSELNAGNIPMDNGVGSLIGSFRDIQQNETAE 318

Query: 781  VTDD-AHSESDKG--------DDIVEQELKTAEEF-QKAKLEESCIVVDVKELPFVPHHT 632
              D  +HSE   G        +D++EQ ++T ++   K KLE++C++VD  EL  V H  
Sbjct: 319  KKDLLSHSEGSDGWNIDAIEINDVIEQGIETTKDLLDKMKLEDACVMVDGDELHVVSHRE 378

Query: 631  GRQRSYKKKFRDALASRMRLAKKQENEQLATW----QGDTSNPRTECSSPSVLT-GDLKK 467
            G+    KKK R+A  S+ RLA+K E E+LA W      +++ P  E  +PS  T  D + 
Sbjct: 379  GKVWLVKKKLRNAFYSKRRLARK-EYERLAVWHRVIDSESNQPGAEGLTPSPSTDSDKRT 437

Query: 466  SSTHDTSESEWELL 425
            S   D  +SEWELL
Sbjct: 438  SPDDDFCQSEWELL 451


>ref|XP_003525991.1| PREDICTED: uncharacterized protein LOC100803672 [Glycine max]
          Length = 533

 Score =  137 bits (345), Expect = 1e-29
 Identities = 153/540 (28%), Positives = 235/540 (43%), Gaps = 73/540 (13%)
 Frame = -3

Query: 1825 MDLKVKGISWVGNVYQKFEAMCLEVEDAVCQETAKYVESQVQTVGSSVKKFYAEVMQDLL 1646
            MDLK++ I WVGN+YQKFEA+C EV+D V Q+  KY+E+QVQ VG SVKKFY+ V+ +LL
Sbjct: 1    MDLKIQHIKWVGNIYQKFEAVCQEVDDIVGQDAVKYLENQVQNVGDSVKKFYSGVVHELL 60

Query: 1645 P-PSSLDSK---------ENEGTEQNAAFGTCEKQKLSTEGFEKDN----------SPYN 1526
            P P+S DSK          N G    +  G  +  K   E    +N              
Sbjct: 61   PFPTSADSKYESHSVALTNNIGFPVESVVGHKDNNKKRDEENPTNNVIKSLQESSAIDIA 120

Query: 1525 DGSEIHVPAVKGLCEK-------------QKDEADVCKRSDAGTKKYPINERLLPVDMSE 1385
            +  ++ VP    L ++              ++E     R  +G KK  +N  +  V +  
Sbjct: 121  NNQQVGVPIKHKLIDETCSDSLEVEDSYITQEEVGDDSRETSGAKKEKLNTSIEEVSVES 180

Query: 1384 GIILGKSSSQVSLRSR------VHSTSQLLPSLSVDPVVAAGSRLFLEQNCNDEVCRNST 1223
               + KS + +SLR +      +HS S    S S          +  + N +  V +NS 
Sbjct: 181  ---VPKSMNLMSLREKESLEFPIHSESYSDSSDS-----GCEDSIAKKDNIDVTVEQNSC 232

Query: 1222 VPIDKGPLKANLSLTEVSEIVDPAGEGTCQASSFSCVRKEDNAKPCDKLTKMTSSIDFTI 1043
            + ++K  + ++ S    S+ +D  GE + + S FS      +    D L ++  S D ++
Sbjct: 233  LVVEKNAMNSSTSEVLSSQSLD--GEESIKVSLFSESSDAVDEDTHDILAEV--SPDASV 288

Query: 1042 CNSP---EKTRPLCSNRMVESGHTADXXXXXXXXXSVLPVASRERKIVESGL--TTFSGI 878
             +       T PLCS   +    T+D           L + S +    ++ L  +  S +
Sbjct: 289  SSERPIITMTEPLCSRNFI----TSDSLYSKSLGSYPLEIESCKNNSGDATLCISDSSMM 344

Query: 877  PTEANGLDPLATFDTRSRMGSSWNGHEHFCEEV-TDDAHS-------------------- 761
                     +A     S+ G +++G   +C+ + ++  HS                    
Sbjct: 345  HICCESSPHVARQIMESQDGLAFSG---YCQSLESNGCHSYLCCINCVKFAAFASLMLNT 401

Query: 760  -ESDKG-DDIVEQELKTAEEFQKAKLEESCIVVDVKELPFVPHHTGRQRSYKKKFRDALA 587
             ES+K     VE  L+  +     KLEE+C+ VD  EL  V     + RSYKK+  DA +
Sbjct: 402  GESNKSLFSSVESSLEDIDLNDDPKLEENCVFVDDSELYAVSCRAQKLRSYKKRILDAFS 461

Query: 586  SRMRLAKKQENEQLATWQGDTS-NPRTECSSPSV-----LTGDLKKSSTHDTSESEWELL 425
            S+ RL+K  E EQLA W GDT   P+   S  S+        D K       SE+EWELL
Sbjct: 462  SKKRLSK--EYEQLAIWYGDTDIEPKQGFSQTSLPFISRTYMDSKNVQVQRASETEWELL 519


>ref|NP_565715.1| uncharacterized protein [Arabidopsis thaliana]
            gi|16612317|gb|AAL27517.1|AF439849_1 At2g31130/T16B12.6
            [Arabidopsis thaliana] gi|20197328|gb|AAC63838.2|
            expressed protein [Arabidopsis thaliana]
            gi|23506163|gb|AAN31093.1| At2g31130/T16B12.6
            [Arabidopsis thaliana] gi|330253402|gb|AEC08496.1|
            uncharacterized protein [Arabidopsis thaliana]
          Length = 419

 Score =  126 bits (317), Expect = 2e-26
 Identities = 125/475 (26%), Positives = 193/475 (40%), Gaps = 13/475 (2%)
 Frame = -3

Query: 1810 KGISWVGNVYQKFEAMCLEVEDAVCQETAKYVESQVQTVGSSVKKFYAEVMQDLLPPSSL 1631
            KGI WVGNVYQKFEAMCLEVE+ + Q+TAKYVE+QVQTVG+SVKKF ++V+ DLLP  S+
Sbjct: 4    KGIKWVGNVYQKFEAMCLEVEEIIVQDTAKYVENQVQTVGNSVKKFCSDVVHDLLPDESV 63

Query: 1630 DSKENEGTEQNAAFGTCEKQKLSTEGFEKDNSPYNDGSEIHVPAVKGLCEKQK----DEA 1463
            DS +         +      K   +   +         E+      G  +K +    D+ 
Sbjct: 64   DSGKPLPVSMLHEYAPVYSFKKKKDSMNRKTKDVTQEQEVTEGKKDGFAKKLRGLDADDY 123

Query: 1462 DVCKRSDAGTKKYPINERLLPVDMSEGIILGKSSSQVSLRSRVHSTSQLLPSLSVDPVVA 1283
            D+C       ++Y          +    I  K      +R  +      L SLS+     
Sbjct: 124  DIC----TSPRQYSYGGPYRRTRIGRKQIFKKEELSQVIRPYIQKD---LTSLSMVHSAR 176

Query: 1282 AGSRLFLEQNCNDEVCRNSTVPIDKGPL-KANLSLTEVSEIVDPAGEGTCQASSFSCVRK 1106
                L    + +  +  ++ V  D G +  ++LS+   + + D  G      S    V K
Sbjct: 177  VKDDLGTVNSSSLSMVHSARVNDDVGTVNSSSLSMVHHASMKDDVGTVKSSDSPPGEVEK 236

Query: 1105 EDNAKPCDKLTKMTSSIDFTICNSPEKTRPLCSNRMVESGHTADXXXXXXXXXSVLPVAS 926
              + K C K  K  +    T+ NS                                 V S
Sbjct: 237  LISKKKCQKDDKAKNQQSLTVVNS---------------------------------VKS 263

Query: 925  RERKIV---ESGLTTFSGIPTEANGLDPLATFDTRSRMGSSWNGHEHFCEEVTDDAHSES 755
             + +++   E GL+    + ++   + P         + +S       C + T+   S S
Sbjct: 264  NDSEVIVDNEHGLSADKSVRSQDLEIQP--------SLATSLPAESDDCRKETNVETSSS 315

Query: 754  DKGDDIVEQELKTAEEFQKAKLEESCIVVDVKELPFVPHHTGRQRSYK--KKFRDALASR 581
                 + E + +  +      +EESCI+VD  E   V         +K  KK RDA++SR
Sbjct: 316  ----SVSEPKSEILQHLSGRSVEESCILVDRDEFHSVFPDKMENDKHKPYKKIRDAISSR 371

Query: 580  MRLAKKQENEQLA-TWQGDTSNPRTECSSPSVLTGDLKK--SSTHDTSESEWELL 425
            M+  +++E ++LA  W  +      EC       GD  K       + ESEWELL
Sbjct: 372  MKQNREKEYKRLARQWYAEDVENGREC-------GDNPKPIEENQSSEESEWELL 419


>ref|XP_002881158.1| hypothetical protein ARALYDRAFT_482041 [Arabidopsis lyrata subsp.
            lyrata] gi|297326997|gb|EFH57417.1| hypothetical protein
            ARALYDRAFT_482041 [Arabidopsis lyrata subsp. lyrata]
          Length = 418

 Score =  125 bits (315), Expect = 4e-26
 Identities = 134/484 (27%), Positives = 202/484 (41%), Gaps = 22/484 (4%)
 Frame = -3

Query: 1810 KGISWVGNVYQKFEAMCLEVEDAVCQETAKYVESQVQTVGSSVKKFYAEVMQDLLPPSSL 1631
            KGI WVGNVYQKFEAMCLEVE+ + Q+TAKYVE+QVQTVG+SVKKF ++V+QDLLP  S+
Sbjct: 4    KGIKWVGNVYQKFEAMCLEVEEIIVQDTAKYVENQVQTVGNSVKKFCSDVVQDLLPDDSV 63

Query: 1630 DSKENEGTEQNAAFGTCEKQKLSTEGFEKDNSPYNDGSEIHVPAVKGLCEK----QKDEA 1463
            DS +         +      K   +   +         E+      G  +K      D+ 
Sbjct: 64   DSGKPLPVSMLHEYAPVCSFKKKRDSMNRKTRDVKQEQEVTEGKKDGCAQKFRGLDADDY 123

Query: 1462 DVC------------KRSDAGTKKYPINERLLPVDMSEGIILGKSSSQVSLRSRVHSTSQ 1319
            D+C            +R+  G K+    E L  V       + K SS +S+   VHS   
Sbjct: 124  DICTSPRQYSYGGPYRRTRVGRKQIFKKEELSQVTRP---YMQKDSSSLSM---VHSA-- 175

Query: 1318 LLPSLSVDPVVAAGSRLFLEQNCNDEVCRNSTVPIDKGPL-KANLSLTEVSEIVDPAGEG 1142
                +  D      S L         +  ++ V  D G +  ++L++   + I D  G  
Sbjct: 176  ---RVKDDVGTVNSSSL--------SMVHSARVKDDVGTVNSSSLTMVHSARIKDDVGTV 224

Query: 1141 TCQASSFSCVRKEDNAKPCDKLTKMTSSIDFTICNSPEKTRPLCSNRMVESGHTADXXXX 962
                S    V K    K C K  K  +    T+ NS ++     S   +++ H       
Sbjct: 225  KSSDSPPGEVEKLIYKKECQKDDKTKNQQSLTVVNSVKRND---SEIRIDNEH------- 274

Query: 961  XXXXXSVLPVASRERKIVESGLTTFSGIPTEANGLDPLATFDTRSRMGSSWNGHEHFCEE 782
                  ++  +S++ +I  S  T+ +                          G +   +E
Sbjct: 275  -----GLMGDSSQDSEIQPSVATSLAA-------------------------GSDDCRKE 304

Query: 781  VTDDAHSESDKGDDIVEQELKTAEEFQKAKLEESCIVVDVKELPFVPHHTGRQRSYK--K 608
               D  + S     + EQ+ +  +      +EESCI+VD  E   V         +K  K
Sbjct: 305  TNVDTKTSS---SSVSEQKSEILQPLSGRSVEESCILVDRDEFHCVFPDKMENDKHKPYK 361

Query: 607  KFRDALASRMRLAKKQENEQLA-TWQGDTSNPRTECSSPSVLTGDLKKSSTHDTS--ESE 437
            K RDA++SRM+  +++E ++LA  W  +      EC       GD  K    + S  ESE
Sbjct: 362  KIRDAISSRMKQNREKEYKRLARQWYAEDVENGREC-------GDDPKPLEENQSPEESE 414

Query: 436  WELL 425
            WELL
Sbjct: 415  WELL 418


Top