BLASTX nr result

ID: Coptis24_contig00005036 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis24_contig00005036
         (1560 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002531255.1| conserved hypothetical protein [Ricinus comm...   348   2e-93
ref|XP_002310564.1| predicted protein [Populus trichocarpa] gi|2...   337   4e-90
ref|XP_002307087.1| predicted protein [Populus trichocarpa] gi|2...   316   1e-83
ref|XP_004137188.1| PREDICTED: uncharacterized protein LOC101220...   315   3e-83
ref|XP_003531351.1| PREDICTED: uncharacterized protein LOC100814...   309   1e-81

>ref|XP_002531255.1| conserved hypothetical protein [Ricinus communis]
            gi|223529140|gb|EEF31119.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 375

 Score =  348 bits (894), Expect = 2e-93
 Identities = 205/375 (54%), Positives = 237/375 (63%), Gaps = 15/375 (4%)
 Frame = +1

Query: 43   KEASFRSKAVHFVSDHLPTVILNPISDDEPSKSYHFA-----DDIKEMDRTHQETITEED 207
            K  SF+SKAVHFVSD L TV LNPISD   SK    A     +D+ E  +   E+I+EED
Sbjct: 3    KAKSFKSKAVHFVSD-LTTVFLNPISDKPSSKPPPHAHHPPTEDLNESIKNQHESISEED 61

Query: 208  SEVAMYGPDTSSFTAFLYSLLSLSESGNCSKVEEQSQRHVDRNELASEPGIKGSSRRKSL 387
            +   + GPDTSSFTAFLYSLLS SESG+ S  +E++    +     S+   K S  RK L
Sbjct: 62   TGDLVDGPDTSSFTAFLYSLLSSSESGDNSNADEKNDNIAEMVNQLSDNVRKESGTRKGL 121

Query: 388  LSWGKQSLGRAINQAVRTNGYKNQTPKGNS----------NHGALEMETLHPPVESVSQI 537
            LS GKQSL +AI  A R  GY+ Q  KGNS          N   LEM+ + P  E V   
Sbjct: 122  LSRGKQSL-KAIYHATRIGGYRGQERKGNSDMIIADESDDNFYGLEMKNMQPVKEPVELG 180

Query: 538  WXXXXXXXXXXXXXKARCSLYASLPVVVQGRKWGLLYSTWRHGISLSTMYRRSMLCPGFS 717
                          KAR  LYASLP +V GRKW +LYSTWRHGISLST+YRRSML PG S
Sbjct: 181  DLPDMSEPSLLLSEKARSVLYASLPALVHGRKWLMLYSTWRHGISLSTLYRRSMLWPGLS 240

Query: 718  LLVVGDRKGAVFGGLVESPLQPTNKRKYQGTNNSFVFTNTSGAPVVFRPTGANRYFTLCS 897
            LLVVGDRKGAVFGGLVE+PL+PTNKRKYQGTN++FVFTNT G PV+FRPTGANRYFTLC+
Sbjct: 241  LLVVGDRKGAVFGGLVEAPLRPTNKRKYQGTNSTFVFTNTPGDPVIFRPTGANRYFTLCT 300

Query: 898  TDXXXXXXXXXXXXXXXXXXXXXXXXVSETFGNPGLAHTEDFTVKEVELWSFNYTPSKYA 1077
            TD                        VSET+GNP LA +EDF VKEVELW F Y  +KY 
Sbjct: 301  TDFLAIGGGGHFALYLDGDLLNGSSSVSETYGNPCLARSEDFEVKEVELWGFVY-GTKYE 359

Query: 1078 ETISSWRTEEPEICR 1122
            E ++  RTE P ICR
Sbjct: 360  EILALSRTEAPGICR 374


>ref|XP_002310564.1| predicted protein [Populus trichocarpa] gi|222853467|gb|EEE91014.1|
            predicted protein [Populus trichocarpa]
          Length = 374

 Score =  337 bits (865), Expect = 4e-90
 Identities = 204/375 (54%), Positives = 239/375 (63%), Gaps = 15/375 (4%)
 Frame = +1

Query: 43   KEASFRSKAVHFVSDHLPTVILNPISDDEPSKS-----YHFADDIKEMDRTHQETITEED 207
            ++ S RSKAVHFVSD L TVILNPISD +PSK      +   +D+ E  R+  E +TEED
Sbjct: 4    QQKSLRSKAVHFVSD-LTTVILNPISD-KPSKHPTPLPHPPPEDVSESKRSQLEFVTEED 61

Query: 208  SEVAMYGPDTSSFTAFLYSLLSLSESGNCSKVEEQSQRHVDRNELASEPGIKGSSRRKSL 387
            +   +  PDTSSFTAFLYSLLS SESGN  K++EQ+       +  SE   K S  +K L
Sbjct: 62   TGHLVEEPDTSSFTAFLYSLLSSSESGNNPKLDEQNDHSAQMGDQLSENVAKESGTKKGL 121

Query: 388  LSWGKQSLGRAINQAVRTNGYKNQTPKGNSN---------HGALEMETLHPPV-ESVSQI 537
             S GKQ+L RA+ QA R  G ++Q  KGNS+          G LE++     + E V+  
Sbjct: 122  FSRGKQTL-RAVYQATRIGGNRSQESKGNSDLKNVDENDDFGGLEVKMPRQNMKEPVALG 180

Query: 538  WXXXXXXXXXXXXXKARCSLYASLPVVVQGRKWGLLYSTWRHGISLSTMYRRSMLCPGFS 717
                          K R +LY SLP +VQGRKW LLYSTWRHGISLST+YRRSML  G S
Sbjct: 181  DLPGISEPSLLLSEKERSTLYVSLPALVQGRKWLLLYSTWRHGISLSTLYRRSMLWSGHS 240

Query: 718  LLVVGDRKGAVFGGLVESPLQPTNKRKYQGTNNSFVFTNTSGAPVVFRPTGANRYFTLCS 897
            LLVVGDRKGAVFGGLVE+PL+PTNK KYQGTN++FVFTN  G PV+FRPTGANRYFTLCS
Sbjct: 241  LLVVGDRKGAVFGGLVEAPLRPTNK-KYQGTNSTFVFTNKPGHPVIFRPTGANRYFTLCS 299

Query: 898  TDXXXXXXXXXXXXXXXXXXXXXXXXVSETFGNPGLAHTEDFTVKEVELWSFNYTPSKYA 1077
            TD                        VSET+GNP LAHTEDF VKEVELW F Y  SKY 
Sbjct: 300  TDFLAIGGGGRFALYMDSDLLNGSSSVSETYGNPCLAHTEDFEVKEVELWGFVY-GSKYE 358

Query: 1078 ETISSWRTEEPEICR 1122
            E ++  RTE P ICR
Sbjct: 359  EILALSRTESPGICR 373


>ref|XP_002307087.1| predicted protein [Populus trichocarpa] gi|222856536|gb|EEE94083.1|
            predicted protein [Populus trichocarpa]
          Length = 368

 Score =  316 bits (809), Expect = 1e-83
 Identities = 198/371 (53%), Positives = 237/371 (63%), Gaps = 11/371 (2%)
 Frame = +1

Query: 43   KEASFRSKAVHFVSDHLPTVILNPISDDEPSKSYHFADDIKEMDRTHQETITEEDSEVAM 222
            ++ S RSK+VHFVSD L TVILNPISD +PSK  H      E   +  E+ITEED+    
Sbjct: 4    QQKSLRSKSVHFVSD-LTTVILNPISD-KPSK--HSTPHASESKTSQLESITEEDTGHLA 59

Query: 223  YGPDTSSFTAFLYSLLSLSESGNCSKVEEQSQRHVDRNELASEPGIKGSSRRKSLLSWGK 402
              PDTSSFTAFL+SLL+ SESGN +K+++Q+      ++  S    K S  +K LLS  K
Sbjct: 60   EEPDTSSFTAFLFSLLTSSESGNNAKLDKQNDNSDQMDDQLSGNVAKESGTKKGLLSRAK 119

Query: 403  QSLGRAINQAVRTNGYKNQTPKGNS-------NHG-ALEMETL--HPPVESVSQIWXXXX 552
             SLG AI QA R + Y++Q  K NS       N G A+E+ ++      E V+       
Sbjct: 120  HSLG-AIYQATRIDRYQSQEHKENSDLKIADDNDGDAVEIRSMLKQNMKEPVALGDISNI 178

Query: 553  XXXXXXXXXKARCSLYASLPVVVQGRKWGLLYSTWRHGISLSTMYRRSMLCPGFSLLVVG 732
                     KAR +LY SLP +VQGRKW LLYSTWRHGISLST+YRRSML PG  LL VG
Sbjct: 179  SEPSLLLSEKARSTLYVSLPALVQGRKWLLLYSTWRHGISLSTLYRRSMLWPGPCLLAVG 238

Query: 733  DRKGAVFGGLVESPLQPTNKRKYQGTNNSFVFTNTSGAPVVFRPT-GANRYFTLCSTDXX 909
            DRKGAVFGGLVE+PL+PTNK KYQG+N++FVFTNT G PV+FRPT GANRYFTLCSTD  
Sbjct: 239  DRKGAVFGGLVEAPLRPTNK-KYQGSNSTFVFTNTPGHPVIFRPTAGANRYFTLCSTDFL 297

Query: 910  XXXXXXXXXXXXXXXXXXXXXXVSETFGNPGLAHTEDFTVKEVELWSFNYTPSKYAETIS 1089
                                  VSET+GNP LAH+EDF VKEVELW F +  SKY E ++
Sbjct: 298  AIGGGGHFALYLDSDLLNGSSSVSETYGNPCLAHSEDFEVKEVELWGFVH-GSKYEEILA 356

Query: 1090 SWRTEEPEICR 1122
              RTE P ICR
Sbjct: 357  LSRTEAPGICR 367


>ref|XP_004137188.1| PREDICTED: uncharacterized protein LOC101220181 [Cucumis sativus]
            gi|449531317|ref|XP_004172633.1| PREDICTED:
            uncharacterized protein LOC101231647 [Cucumis sativus]
          Length = 374

 Score =  315 bits (806), Expect = 3e-83
 Identities = 190/378 (50%), Positives = 232/378 (61%), Gaps = 18/378 (4%)
 Frame = +1

Query: 43   KEASFRSKAVHFVSDHLPTVILNPISDDEPSKS-------YHFADDIKEM-DRTHQETIT 198
            K+ S RSKAVHFVSD + TV+LNPISD +PS +       +H  ++  ++ D +   + +
Sbjct: 3    KQQSLRSKAVHFVSD-ITTVLLNPISD-KPSATASAHPQHHHSPENATDLKDDSDLGSAS 60

Query: 199  EEDSEVAMYGPDTSSFTAFLYSLLSLSESGNCSKVEEQSQRHVDRNELASEPGIKGSSR- 375
            E+ SE +  GPDTSSFTAFLYSLLS SE       +E+S    +    A  P I  +   
Sbjct: 61   EQGSEFSEDGPDTSSFTAFLYSLLSSSELQENLNSDERSDNQAE----AGTPMINAAKET 116

Query: 376  --RKSLLSWGKQSLGRAINQAVRTNGYKNQTPKGN-------SNHGALEMETLHPPVESV 528
              +KSL S GKQSLGRA + A R  GY+NQ  K +       S    +EM       E  
Sbjct: 117  VMKKSLFSRGKQSLGRAFHHAARITGYRNQDRKSDVDLKVNDSKFSGIEMMEKTNVSEVP 176

Query: 529  SQIWXXXXXXXXXXXXXKARCSLYASLPVVVQGRKWGLLYSTWRHGISLSTMYRRSMLCP 708
            S +              KAR  LYASLP +VQGRKW LLYSTWRHGISLST+YRRSML  
Sbjct: 177  SSVVVPDASEPSVLLSEKARTVLYASLPALVQGRKWLLLYSTWRHGISLSTLYRRSMLWS 236

Query: 709  GFSLLVVGDRKGAVFGGLVESPLQPTNKRKYQGTNNSFVFTNTSGAPVVFRPTGANRYFT 888
            GFSLLVVGD+KGAVFGGLVE+PL+P++K+KYQGTNN+FVFT+  G PV++RPTG N YFT
Sbjct: 237  GFSLLVVGDQKGAVFGGLVEAPLKPSSKKKYQGTNNTFVFTSIPGHPVIYRPTGENYYFT 296

Query: 889  LCSTDXXXXXXXXXXXXXXXXXXXXXXXXVSETFGNPGLAHTEDFTVKEVELWSFNYTPS 1068
            LCS D                         SET+GNP LA+TE+F VKEVELW F YT S
Sbjct: 297  LCSPDFLAIGGGSHFALYLDNDLLNGSSSTSETYGNPCLANTEEFEVKEVELWGFVYT-S 355

Query: 1069 KYAETISSWRTEEPEICR 1122
            KY E ++  RTE P ICR
Sbjct: 356  KYEEMLALSRTEAPGICR 373


>ref|XP_003531351.1| PREDICTED: uncharacterized protein LOC100814579 [Glycine max]
          Length = 365

 Score =  309 bits (792), Expect = 1e-81
 Identities = 185/372 (49%), Positives = 234/372 (62%), Gaps = 12/372 (3%)
 Frame = +1

Query: 43   KEASFRSKAVHFVSDHLPTVILNPISDDEPSKSYHFADDIKEMDRTHQETITEE---DSE 213
            K+ SFRSKA +FVSD L T +LNPISD+  + +      +       +E + E      +
Sbjct: 3    KKQSFRSKAAYFVSD-LTTGLLNPISDNNNNNNK--PPSLPPSGEEEKEDVGESKGGSDD 59

Query: 214  VAMYGPDTSSFTAFLYSLLSLSESGNCSKVEEQSQRHVDR----NELASEPGIKGSSR-R 378
            V + GPDTSSFTAFLYSLLS S+SG+  KV E  +++ D     + L S+   K S   +
Sbjct: 60   VIIDGPDTSSFTAFLYSLLSTSDSGD--KVGEADKKNDDEVAGDDSLLSDSATKESFVVK 117

Query: 379  KSLLSWGKQSLGRAINQAVRTNGYKNQTPKGNSNHGALEMETL--HPPVESVSQIWXXXX 552
            KSL S  K SLGRAI+Q     G+ N+     ++ G +EM+ +   P   +VS +     
Sbjct: 118  KSLFSRSKHSLGRAIHQM---GGFSNRD-SNYTDEGGVEMKRIVKEPLAVAVSGVGDHLP 173

Query: 553  XXXXXXXXXK--ARCSLYASLPVVVQGRKWGLLYSTWRHGISLSTMYRRSMLCPGFSLLV 726
                         R ++YASLP +++GRKW +LYSTW+HGISLST+YRRSMLCPG SLLV
Sbjct: 174  QISEPSMLVSEGVRNAVYASLPALIRGRKWFMLYSTWKHGISLSTLYRRSMLCPGMSLLV 233

Query: 727  VGDRKGAVFGGLVESPLQPTNKRKYQGTNNSFVFTNTSGAPVVFRPTGANRYFTLCSTDX 906
            VGDRKGAVFGGLVE+PL+P+NKRKYQGTNNSFVFTNTSG PV++ PTG NRYFTLC+TD 
Sbjct: 234  VGDRKGAVFGGLVEAPLRPSNKRKYQGTNNSFVFTNTSGCPVIYHPTGVNRYFTLCTTDF 293

Query: 907  XXXXXXXXXXXXXXXXXXXXXXXVSETFGNPGLAHTEDFTVKEVELWSFNYTPSKYAETI 1086
                                   VSET+GNP LAH+++F VKEVELW F + PSKY E +
Sbjct: 294  LAIGGGSHFALYLEGDLLNGSSSVSETYGNPCLAHSQEFEVKEVELWGFVF-PSKYEEIV 352

Query: 1087 SSWRTEEPEICR 1122
               RTE P ICR
Sbjct: 353  ELSRTEAPGICR 364


Top