BLASTX nr result

ID: Coptis24_contig00035269 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis24_contig00035269
         (493 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002311390.1| predicted protein [Populus trichocarpa] gi|2...   234   4e-60
ref|XP_002265079.1| PREDICTED: pentatricopeptide repeat-containi...   177   6e-43
ref|NP_193619.1| pentatricopeptide repeat-containing protein [Ar...   176   2e-42
ref|XP_002867972.1| pentatricopeptide repeat-containing protein ...   170   1e-40
emb|CBI16398.3| unnamed protein product [Vitis vinifera]              169   3e-40

>ref|XP_002311390.1| predicted protein [Populus trichocarpa] gi|222851210|gb|EEE88757.1|
           predicted protein [Populus trichocarpa]
          Length = 594

 Score =  234 bits (598), Expect = 4e-60
 Identities = 110/164 (67%), Positives = 136/164 (82%)
 Frame = +2

Query: 2   PDIVSWNSLLDGYVKSGRVDECTRVFDEMPCRDTVSWTMMLVGCVNAGLLSEARYVFDEM 181
           PD+VSWN L++GYVKSG +DE  R+FDEMP RD VSWT+MLVG  +AG LSEA  +FDEM
Sbjct: 200 PDVVSWNCLINGYVKSGDLDEARRLFDEMPERDVVSWTIMLVGYADAGFLSEASCLFDEM 259

Query: 182 PERNVVSWSAMISGYVKDGCWKEALGLFKEMQVVGVLADKVMLTSVLSACASLGALDQGC 361
           P+RN+VSWSA+I GY++ GC+ +AL LFKEMQV  V  D+V++T++LSACA LGALDQG 
Sbjct: 260 PKRNLVSWSALIKGYIQIGCYSKALELFKEMQVAKVKMDEVIVTTLLSACARLGALDQGR 319

Query: 362 WIHSYIDKHGIEVDAHLSTALVDMYSKCGRVEVALDVFWRAPDK 493
           W+H YIDKHGI+VDAHLSTAL+DMYSKCGR+++A  VF    DK
Sbjct: 320 WLHMYIDKHGIKVDAHLSTALIDMYSKCGRIDMAWKVFQETGDK 363


>ref|XP_002265079.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g18840-like [Vitis vinifera]
          Length = 536

 Score =  177 bits (450), Expect = 6e-43
 Identities = 87/157 (55%), Positives = 110/157 (70%)
 Frame = +2

Query: 5   DIVSWNSLLDGYVKSGRVDECTRVFDEMPCRDTVSWTMMLVGCVNAGLLSEARYVFDEMP 184
           D+VSWN+LL  Y + G ++    +FDEM  R+  SW  M+ G V  GLL EAR VF E P
Sbjct: 173 DVVSWNALLSAYAERGLMELACHLFDEMTERNVESWNFMISGYVGVGLLEEARRVFGETP 232

Query: 185 ERNVVSWSAMISGYVKDGCWKEALGLFKEMQVVGVLADKVMLTSVLSACASLGALDQGCW 364
            +NVVSW+AMI+GY   G + E L LF++MQ  GV  D   L SVLSACA +GAL QG W
Sbjct: 233 VKNVVSWNAMITGYSHAGRFSEVLVLFEDMQHAGVKPDNCTLVSVLSACAHVGALSQGEW 292

Query: 365 IHSYIDKHGIEVDAHLSTALVDMYSKCGRVEVALDVF 475
           +H+YIDK+GI +D  ++TALVDMYSKCG +E AL+VF
Sbjct: 293 VHAYIDKNGISIDGFVATALVDMYSKCGSIEKALEVF 329



 Score = 73.9 bits (180), Expect = 1e-11
 Identities = 51/203 (25%), Positives = 89/203 (43%), Gaps = 40/203 (19%)
 Frame = +2

Query: 5   DIVSWNSLLDGYVKSGRVDECTRVFDEMPCR----DTVSWTMMLVGCVNAGLLSEARYV- 169
           ++VSWN+++ GY  +GR  E   +F++M       D  +   +L  C + G LS+  +V 
Sbjct: 235 NVVSWNAMITGYSHAGRFSEVLVLFEDMQHAGVKPDNCTLVSVLSACAHVGALSQGEWVH 294

Query: 170 ----------------------------------FDEMPERNVVSWSAMISGYVKDGCWK 247
                                             F+    +++ +W+++ISG    G  +
Sbjct: 295 AYIDKNGISIDGFVATALVDMYSKCGSIEKALEVFNSCLRKDISTWNSIISGLSTHGSGQ 354

Query: 248 EALGLFKEMQVVGVLADKVMLTSVLSACASLGALDQGC-WIHSYIDKHGIEVDAHLSTAL 424
            AL +F EM V G   ++V    VLSAC+  G LD+G    +  +  HGI+        +
Sbjct: 355 HALQIFSEMLVEGFKPNEVTFVCVLSACSRAGLLDEGREMFNLMVHVHGIQPTIEHYGCM 414

Query: 425 VDMYSKCGRVEVALDVFWRAPDK 493
           VD+  + G +E A ++  + P K
Sbjct: 415 VDLLGRVGLLEEAEELVQKMPQK 437



 Score = 64.7 bits (156), Expect = 7e-09
 Identities = 32/109 (29%), Positives = 53/109 (48%)
 Frame = +2

Query: 137 NAGLLSEARYVFDEMPERNVVSWSAMISGYVKDGCWKEALGLFKEMQVVGVLADKVMLTS 316
           +A  +  A  +F  +P  N   W+ +I  Y      + AL +F +M    VL DK   T 
Sbjct: 54  HAQAIPYAHSIFSRIPNPNSYMWNTIIRAYANSPTPEAALTIFHQMLHASVLPDKYTFTF 113

Query: 317 VLSACASLGALDQGCWIHSYIDKHGIEVDAHLSTALVDMYSKCGRVEVA 463
            L +C S   +++G  IH ++ K G+  D  +   L+ +Y+ CG +E A
Sbjct: 114 ALKSCGSFSGVEEGRQIHGHVLKTGLGDDLFIQNTLIHLYASCGCIEDA 162


>ref|NP_193619.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75098703|sp|O49399.2|PP321_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At4g18840 gi|5738365|emb|CAA16741.2| putative protein
           [Arabidopsis thaliana] gi|7268678|emb|CAB78886.1|
           putative protein [Arabidopsis thaliana]
           gi|332658697|gb|AEE84097.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 545

 Score =  176 bits (446), Expect = 2e-42
 Identities = 90/164 (54%), Positives = 113/164 (68%), Gaps = 1/164 (0%)
 Frame = +2

Query: 5   DIVSWNSLLDGYVKSGRVDECTRVFDEMPCRDTVSWTMMLVGCVNAGLLSEARYVFDEMP 184
           D VSWNSLL  Y++ G VDE   +FDEM  R+  SW  M+ G   AGL+ EA+ VFD MP
Sbjct: 205 DAVSWNSLLSAYLEKGLVDEARALFDEMEERNVESWNFMISGYAAAGLVKEAKEVFDSMP 264

Query: 185 ERNVVSWSAMISGYVKDGCWKEALGLFKEMQVVGV-LADKVMLTSVLSACASLGALDQGC 361
            R+VVSW+AM++ Y   GC+ E L +F +M        D   L SVLSACASLG+L QG 
Sbjct: 265 VRDVVSWNAMVTAYAHVGCYNEVLEVFNKMLDDSTEKPDGFTLVSVLSACASLGSLSQGE 324

Query: 362 WIHSYIDKHGIEVDAHLSTALVDMYSKCGRVEVALDVFWRAPDK 493
           W+H YIDKHGIE++  L+TALVDMYSKCG+++ AL+VF RA  K
Sbjct: 325 WVHVYIDKHGIEIEGFLATALVDMYSKCGKIDKALEVF-RATSK 367



 Score = 58.9 bits (141), Expect = 4e-07
 Identities = 34/113 (30%), Positives = 55/113 (48%)
 Frame = +2

Query: 149 LSEARYVFDEMPERNVVSWSAMISGYVKDGCWKEALGLFKEMQVVGVLADKVMLTSVLSA 328
           +S A  + + +   N  + +++I  Y      + AL +F+EM +  V  DK   T VL A
Sbjct: 90  VSYAHSILNRIGSPNGFTHNSVIRAYANSSTPEVALTVFREMLLGPVFPDKYSFTFVLKA 149

Query: 329 CASLGALDQGCWIHSYIDKHGIEVDAHLSTALVDMYSKCGRVEVALDVFWRAP 487
           CA+    ++G  IH    K G+  D  +   LV++Y + G  E+A  V  R P
Sbjct: 150 CAAFCGFEEGRQIHGLFIKSGLVTDVFVENTLVNVYGRSGYFEIARKVLDRMP 202


>ref|XP_002867972.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297313808|gb|EFH44231.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 535

 Score =  170 bits (431), Expect = 1e-40
 Identities = 84/158 (53%), Positives = 108/158 (68%), Gaps = 1/158 (0%)
 Frame = +2

Query: 5   DIVSWNSLLDGYVKSGRVDECTRVFDEMPCRDTVSWTMMLVGCVNAGLLSEARYVFDEMP 184
           D VSWNSLL  Y+  G V+E   +FDEM  R+  SW  M+ G   AGL+ EAR VFD MP
Sbjct: 175 DAVSWNSLLSAYLDKGLVEEARALFDEMEERNVESWNFMISGYAAAGLVKEAREVFDSMP 234

Query: 185 ERNVVSWSAMISGYVKDGCWKEALGLFKEM-QVVGVLADKVMLTSVLSACASLGALDQGC 361
            ++VVSW+AM++ Y   GC+ E L +F  M        D   L +VLSACASLG+L QG 
Sbjct: 235 VKDVVSWNAMVTAYAHVGCYNEVLEVFNMMLDDSAERPDGFTLVNVLSACASLGSLSQGE 294

Query: 362 WIHSYIDKHGIEVDAHLSTALVDMYSKCGRVEVALDVF 475
           W+H YIDKHGIE++  ++TALVDMYSKCG+++ AL+VF
Sbjct: 295 WVHVYIDKHGIEIEGFVATALVDMYSKCGKIDKALEVF 332



 Score = 63.5 bits (153), Expect = 2e-08
 Identities = 50/202 (24%), Positives = 83/202 (41%), Gaps = 41/202 (20%)
 Frame = +2

Query: 5   DIVSWNSLLDGYVKSGRVDECTRVF-----DEMPCRDTVSWTMMLVGCVNAGLLSEARYV 169
           D+VSWN+++  Y   G  +E   VF     D     D  +   +L  C + G LS+  +V
Sbjct: 237 DVVSWNAMVTAYAHVGCYNEVLEVFNMMLDDSAERPDGFTLVNVLSACASLGSLSQGEWV 296

Query: 170 -----------------------------------FDEMPERNVVSWSAMISGYVKDGCW 244
                                              F +  +R+V +W+++I+G    G  
Sbjct: 297 HVYIDKHGIEIEGFVATALVDMYSKCGKIDKALEVFRDTSKRDVSTWNSIITGLSVHGLG 356

Query: 245 KEALGLFKEMQVVGVLADKVMLTSVLSACASLGALDQGCWIHSYIDK-HGIEVDAHLSTA 421
           K+AL +F EM   G   + +    VLSAC  +G LDQ   +   ++  +GIE        
Sbjct: 357 KDALEIFSEMVYEGFKPNGITFIGVLSACNHVGLLDQARKLFEMMNSVYGIEPTIEHYGC 416

Query: 422 LVDMYSKCGRVEVALDVFWRAP 487
           +VD+  + G+ E A ++    P
Sbjct: 417 MVDLLGRMGKFEEAEELVNEVP 438



 Score = 55.8 bits (133), Expect = 3e-06
 Identities = 32/113 (28%), Positives = 54/113 (47%)
 Frame = +2

Query: 149 LSEARYVFDEMPERNVVSWSAMISGYVKDGCWKEALGLFKEMQVVGVLADKVMLTSVLSA 328
           +S A  + + +   N  + +++I  Y      + AL +F+EM +  V  DK   T VL A
Sbjct: 60  VSYAHSILNRIESPNGFTHNSVIRAYANSSTPEIALTVFREMLLGPVFPDKYSFTFVLKA 119

Query: 329 CASLGALDQGCWIHSYIDKHGIEVDAHLSTALVDMYSKCGRVEVALDVFWRAP 487
           CA+    ++G  IH    K  +  D  +   L+++Y + G  E+A  V  R P
Sbjct: 120 CAAFCGFEEGRQIHGLFMKSDLVTDVFVENTLINVYGRSGYFEIARKVLDRMP 172


>emb|CBI16398.3| unnamed protein product [Vitis vinifera]
          Length = 608

 Score =  169 bits (427), Expect = 3e-40
 Identities = 80/163 (49%), Positives = 111/163 (68%)
 Frame = +2

Query: 5   DIVSWNSLLDGYVKSGRVDECTRVFDEMPCRDTVSWTMMLVGCVNAGLLSEARYVFDEMP 184
           D+VSWNS+++GY   G ++   ++FD M  +  VSWT M+VG   +GLL  A  +FDEMP
Sbjct: 216 DLVSWNSMINGYC--GNLESARKLFDSMTNKTMVSWTTMVVGYAQSGLLDMAWKLFDEMP 273

Query: 185 ERNVVSWSAMISGYVKDGCWKEALGLFKEMQVVGVLADKVMLTSVLSACASLGALDQGCW 364
           +++VV W+AMI GYV     KEAL LF EMQ + +  D+V + S LSAC+ LGALD G W
Sbjct: 274 DKDVVPWNAMIGGYVHANRGKEALALFNEMQAMNINPDEVTMVSCLSACSQLGALDVGIW 333

Query: 365 IHSYIDKHGIEVDAHLSTALVDMYSKCGRVEVALDVFWRAPDK 493
           IH YI+KH + ++  L TAL+DMY+KCG++  A+ VF   P +
Sbjct: 334 IHHYIEKHELSLNVALGTALIDMYAKCGKITKAIQVFQELPGR 376



 Score = 68.6 bits (166), Expect = 5e-10
 Identities = 50/201 (24%), Positives = 86/201 (42%), Gaps = 40/201 (19%)
 Frame = +2

Query: 5   DIVSWNSLLDGYVKSGRVDECTRVFDEMPCRDTVSWTMMLVGCVNA-------------- 142
           D+V WN+++ GYV + R  E   +F+EM   +     + +V C++A              
Sbjct: 276 DVVPWNAMIGGYVHANRGKEALALFNEMQAMNINPDEVTMVSCLSACSQLGALDVGIWIH 335

Query: 143 -------------------------GLLSEARYVFDEMPERNVVSWSAMISGYVKDGCWK 247
                                    G +++A  VF E+P RN ++W+A+ISG    G   
Sbjct: 336 HYIEKHELSLNVALGTALIDMYAKCGKITKAIQVFQELPGRNSLTWTAIISGLALHGNAH 395

Query: 248 EALGLFKEMQVVGVLADKVMLTSVLSACASLGALDQGCWIHSYI-DKHGIEVDAHLSTAL 424
            A+  F EM    V+ D+V    +LSAC   G +++G    S +  K  +       + +
Sbjct: 396 GAIAYFSEMIDNSVMPDEVTFLGLLSACCHGGLVEEGRKYFSQMSSKFNLSPKLKHYSCM 455

Query: 425 VDMYSKCGRVEVALDVFWRAP 487
           VD+  + G +E A ++    P
Sbjct: 456 VDLLGRAGLLEEAEELIKSMP 476


Top