BLASTX nr result

ID: Coptis21_contig00008561 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis21_contig00008561
         (1731 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI17664.3| unnamed protein product [Vitis vinifera]              286   1e-74
ref|XP_003536157.1| PREDICTED: uncharacterized protein LOC100801...   266   2e-68
ref|XP_003556442.1| PREDICTED: uncharacterized protein LOC100793...   261   3e-67
ref|XP_002512619.1| conserved hypothetical protein [Ricinus comm...   238   5e-60
ref|XP_004152835.1| PREDICTED: uncharacterized protein LOC101211...   226   1e-56

>emb|CBI17664.3| unnamed protein product [Vitis vinifera]
          Length = 309

 Score =  286 bits (732), Expect = 1e-74
 Identities = 164/334 (49%), Positives = 218/334 (65%)
 Frame = -1

Query: 1650 MSMTQFAMVEELAFLVKDNLSCKHLILSVEEALVDFLQNDTSLDGILELKPMSSYNRLLL 1471
            M+MTQFAMVEELAFL+KDNL CKHL+LS+EEALV+FLQ+D SLDG++EL+PM+ Y RLL+
Sbjct: 1    MTMTQFAMVEELAFLIKDNLPCKHLVLSIEEALVNFLQDDNSLDGVMELEPMNPYERLLV 60

Query: 1470 HRLADIFGLAHESVGEGDDRHLILERSPESSIPSVLVSDILWQCNDYQSPAASHQLLRRT 1291
            HRLADIFG AH S+GEGD+RHL LER PE+SIPS+LVSDI+WQ  + Q P  SHQLLRR 
Sbjct: 61   HRLADIFGFAHVSIGEGDERHLTLERCPETSIPSILVSDIIWQYQESQPPTMSHQLLRRN 120

Query: 1290 EALPAAKNNPTSLGTTXXXXXXXXXXXXXRIFSEENGGIRESVAPKPRNVPMVARRMIAH 1111
            EA P  K    SL  +             RIFS + G IRESV  KPRNVP+VARRMIAH
Sbjct: 121  EASPVLKATEPSLQYSLEEREAAYLAARERIFSMDEGEIRESVQQKPRNVPVVARRMIAH 180

Query: 1110 ALGQKIHTNPSSVKINLSNSKENDEERIKELSTEEDNGNCLSGSLRNPQESSSKSVQKMR 931
            ALG +I++    VK          E +I EL+ ++             ++SS+++ Q   
Sbjct: 181  ALGHRINSCNQDVK--------ECEGQIAELTVQD-------------KDSSAEACQNEN 219

Query: 930  SCERIAPGSKGRNASGSASQGERKIQKKVSDSRTSNGGVLQTGSSGKVVSSENFKQEHMG 751
            S  + + GS   N +G+ S G RK  +K +D   SN     +  +   V +EN K+ H+G
Sbjct: 220  SRSKTSVGS--HNGNGALSHGGRKTPQKSADRSPSNSSSPPSRRNRSRVCTENPKEVHLG 277

Query: 750  AAKRIFANALGLQGVKETNSLLLKCTNSKAIDKQ 649
            AA+R+FA+ALGLQ  K+  +L+ K  N++ ++ +
Sbjct: 278  AARRMFAHALGLQSPKD--ALISKPGNTQQMNTE 309


>ref|XP_003536157.1| PREDICTED: uncharacterized protein LOC100801478 [Glycine max]
          Length = 319

 Score =  266 bits (679), Expect = 2e-68
 Identities = 163/321 (50%), Positives = 206/321 (64%), Gaps = 4/321 (1%)
 Frame = -1

Query: 1650 MSMTQFAMVEELAFLVKDNLSCKHLILSVEEALVDFLQND-TSLDGILELKPMSSYNRLL 1474
            MSMTQFAMVEELAFLVKDNL CKHL+L++EE+LV+FLQ+D TS DGILEL+PM+SYNRLL
Sbjct: 1    MSMTQFAMVEELAFLVKDNLPCKHLVLTMEESLVNFLQDDDTSSDGILELEPMNSYNRLL 60

Query: 1473 LHRLADIFGLAHESVGEGDDRHLILERSPESSIPSVLVSDILWQCNDY-QSPAASHQLLR 1297
            LHRLA+IFG AHESVGEGDDRHLILER P++SIP VLVSDILW+ +D  QS   SHQ+LR
Sbjct: 61   LHRLAEIFGFAHESVGEGDDRHLILERCPDTSIPPVLVSDILWKYDDEPQSLVTSHQILR 120

Query: 1296 RTEALPAAKNNPTSLGTTXXXXXXXXXXXXXRIFSEENGGIRESVAPKPRNVPMVARRMI 1117
            R+E+ P  + N  SL  +             RIFS +   ++E    KPR+VP+VARRMI
Sbjct: 121  RSESSPVLQKNTASLSQSLEERKAAYLIARERIFSMKLEEVKEPGEQKPRSVPVVARRMI 180

Query: 1116 AHALGQKIHTNPSSVKINLSNSKENDEERIKELSTEEDNGNCLSGSLRNPQESSSKSVQK 937
            AHALGQ+IHT   +   +L++    D     EL+  +          +N +ES+ K V +
Sbjct: 181  AHALGQRIHTKNQN---DLASDSMKDGVLTDELNAHD----------KNMEESTLKKVSE 227

Query: 936  MRSCERIAPGSKGRNASGS--ASQGERKIQKKVSDSRTSNGGVLQTGSSGKVVSSENFKQ 763
              S  R    +  RN S S  AS  +R  Q  V           Q    G  VS +  K+
Sbjct: 228  ESSHLRGNSNNSIRNTSSSNAASLNKRNDQTTVDKDLPE---FSQERKQGLSVSKDYIKK 284

Query: 762  EHMGAAKRIFANALGLQGVKE 700
            EH+GAAKR+FA+ALG+   K+
Sbjct: 285  EHLGAAKRMFAHALGVHSGKD 305


>ref|XP_003556442.1| PREDICTED: uncharacterized protein LOC100793409 [Glycine max]
          Length = 333

 Score =  261 bits (668), Expect = 3e-67
 Identities = 162/335 (48%), Positives = 211/335 (62%), Gaps = 18/335 (5%)
 Frame = -1

Query: 1650 MSMTQFAMVEELAFLVKDNLSCKHLILSVEEALVDFLQND-TSLDGILELKPMSSYNRLL 1474
            MSMTQFAMVEELAFLVKDNL CKHL+L++EEALV+FLQ+D TS DGILEL+PM+SYNRLL
Sbjct: 1    MSMTQFAMVEELAFLVKDNLPCKHLVLTMEEALVNFLQDDDTSSDGILELEPMNSYNRLL 60

Query: 1473 LHRLADIFGLAHESVGEGDDRHLILERSPESSIPSVLVSDILWQCNDYQSPAASHQLLRR 1294
            LHRLA+IFG AHESVGEGDDRHLILER P++SIP +LVSDILW+ ++ QS   SHQ+LRR
Sbjct: 61   LHRLAEIFGFAHESVGEGDDRHLILERCPDTSIPPILVSDILWKYDEPQSLVTSHQILRR 120

Query: 1293 TEALP----------------AAKNNPTSLGTTXXXXXXXXXXXXXRIFSEENGGIRESV 1162
            + A P                 ++ N  SL  +             RIFS +   ++E  
Sbjct: 121  SVASPDSCSVLWALIRVLHLLVSQKNTASLSQSLEERKAAYLIARERIFSMKLEEVKEPG 180

Query: 1161 APKPRNVPMVARRMIAHALGQKIHTNPSSVKINLSNSKENDEERIKELSTEEDNGNCLSG 982
              KPR+VP+VARRMIAHALGQ+IHT   +   +L++    D     ELS ++        
Sbjct: 181  EQKPRSVPVVARRMIAHALGQRIHTKNQN---DLASDSMKDGVLTDELSAQD-------- 229

Query: 981  SLRNPQESSSKSVQKMRSCERIAPGSKGR-NASGSASQGERKIQKKVSDSRTSNGGVLQT 805
              +N +ES+ K V +  S  R    ++ R N+S +A+  +RK Q  V           + 
Sbjct: 230  --KNQEESTLKKVSEESSHLRGNSNNRIRNNSSNAATLNKRKDQTTVDKDLPQFSAERKQ 287

Query: 804  GSSGKVVSSENFKQEHMGAAKRIFANALGLQGVKE 700
            G S   VS +  K+EH+GAAKR+FA+ALG+   K+
Sbjct: 288  GLS---VSKDYMKKEHLGAAKRMFAHALGVHSGKD 319


>ref|XP_002512619.1| conserved hypothetical protein [Ricinus communis]
            gi|223548580|gb|EEF50071.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 339

 Score =  238 bits (606), Expect = 5e-60
 Identities = 158/363 (43%), Positives = 212/363 (58%), Gaps = 29/363 (7%)
 Frame = -1

Query: 1650 MSMTQFAMVEELAFLVKDNLSCKHLILSVEEALVDFLQ-NDTSLD-GILELKPMSSYNRL 1477
            MS+TQFAMVEELA+LVKDNL CKHL+LS+E+A V+FLQ N+TS D GILEL+PM SY+RL
Sbjct: 1    MSVTQFAMVEELAYLVKDNLRCKHLVLSMEDAFVNFLQDNNTSTDDGILELEPMDSYSRL 60

Query: 1476 LLHRLADIFGLAHESVGEGDDRHLILERSPESSIPSVLVSDILWQCNDYQSPAASHQLLR 1297
            LLHRLADIFG AH SVGEGD+RHLILER PE+S+PS+LV+DIL+Q ++ +    SHQLLR
Sbjct: 61   LLHRLADIFGFAHVSVGEGDERHLILERCPETSVPSILVNDILFQYDEPEPLTMSHQLLR 120

Query: 1296 RTEALPA------------------------AKNNPTSLGTTXXXXXXXXXXXXXRIFSE 1189
            R  A PA                         + +P+ L  T             RIFS 
Sbjct: 121  RDGASPAIGYVPILDALNVEISYQLMLVSVLKEKSPSFL--TLEEREAAYSAARERIFSV 178

Query: 1188 ENGGIRESVAPKPRNVPMVARRMIAHALGQKIHTNPSSVKINLSNSKENDEERIKELSTE 1009
            + G ++E    KPRNVP+VARRMIAHALGQK+    +     +  +  N +++ K +   
Sbjct: 179  DVGEMKEPQRQKPRNVPVVARRMIAHALGQKLSPQKNCKGYEVQTTLLNVQDKDK-IDPM 237

Query: 1008 EDNGNCLSGSLRNPQ---ESSSKSVQKMRSCERIAPGSKGRNASGSASQGERKIQKKVSD 838
            ED      G++  PQ   +S  K+    R C   +PG +  + + +         K  +D
Sbjct: 238  ED----FEGTIFQPQKYVDSHDKAKSNNRQCSASSPGKRNMSHAPAC--------KMSTD 285

Query: 837  SRTSNGGVLQTGSSGKVVSSENFKQEHMGAAKRIFANALGLQGVKETNSLLLKCTNSKAI 658
              TS+ G      S +    E  K+EH+GAAKR+ A+ALGLQ  K     L +C+ +K +
Sbjct: 286  MSTSHNG------SSRNRVKEYSKEEHVGAAKRMLAHALGLQSSK---GGLARCSATKPV 336

Query: 657  DKQ 649
            D +
Sbjct: 337  DME 339


>ref|XP_004152835.1| PREDICTED: uncharacterized protein LOC101211053 [Cucumis sativus]
          Length = 309

 Score =  226 bits (577), Expect = 1e-56
 Identities = 140/312 (44%), Positives = 185/312 (59%), Gaps = 1/312 (0%)
 Frame = -1

Query: 1650 MSMTQFAMVEELAFLVKDNLSCKHLILSVEEALVDFLQNDTSLDGILELKPMSSYNRLLL 1471
            M++ QFAMVEELAFLVKDNL  KHLILS+EE  ++FL N+TS DGILELKPM SYNRLLL
Sbjct: 1    MTVAQFAMVEELAFLVKDNLPSKHLILSMEETFINFLHNETSSDGILELKPMDSYNRLLL 60

Query: 1470 HRLADIFGLAHESVGEGDDRHLILERSPESSIPSVLVSDILWQCNDYQSPAASHQLLRRT 1291
            HRLADIFGL H SVGEGD+RHL+LER PESSIPS+LVSDILW+ ++ Q     HQLLRR 
Sbjct: 61   HRLADIFGLGHVSVGEGDNRHLVLERYPESSIPSILVSDILWEYDEPQMSTIPHQLLRRK 120

Query: 1290 EALPAAKNNPTSLGTTXXXXXXXXXXXXXRIFSEENGGIRESVAPKPRNVPMVARRMIAH 1111
            E   +A +  +S   +             RIF    G   E + PKPR  P VARRMIAH
Sbjct: 121  EN-SSASSTKSSPQRSLEEREAAYLAVRERIFMTHVGEDNEPLKPKPRCDPAVARRMIAH 179

Query: 1110 ALGQKIHTNPSSVKINLSNSKENDEERIKELSTEEDNGNCLSGSLRNPQESSSKSVQKMR 931
            ALGQ++++         +N  + ++  +   +  +   + L  S     E+ +K++ +  
Sbjct: 180  ALGQRVNSLSED-----TNCHQKEQGGVTNNAYIQARDSKLPDS---TVEAINKTISRSD 231

Query: 930  SCERIAPG-SKGRNASGSASQGERKIQKKVSDSRTSNGGVLQTGSSGKVVSSENFKQEHM 754
             C  +     K  N   S ++G    + K + S                V +E+ K+EH+
Sbjct: 232  QCVNLKNELDKNCNPDVSLARGSTAAKMKPAKSY----------PKASHVDNEHLKREHL 281

Query: 753  GAAKRIFANALG 718
            GAAKR+F+ ALG
Sbjct: 282  GAAKRMFSQALG 293