BLASTX nr result

ID: Coptis25_contig00038705 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis25_contig00038705
         (597 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002887548.1| hypothetical protein ARALYDRAFT_339650 [Arab...   256   2e-66
ref|XP_002282084.1| PREDICTED: pentatricopeptide repeat-containi...   256   2e-66
ref|XP_002328557.1| predicted protein [Populus trichocarpa] gi|2...   253   2e-65
ref|NP_177601.1| pentatricopeptide repeat-containing protein [Ar...   251   7e-65
dbj|BAD93880.1| hypothetical protein [Arabidopsis thaliana] gi|6...   251   7e-65

>ref|XP_002887548.1| hypothetical protein ARALYDRAFT_339650 [Arabidopsis lyrata subsp.
           lyrata] gi|297333389|gb|EFH63807.1| hypothetical protein
           ARALYDRAFT_339650 [Arabidopsis lyrata subsp. lyrata]
          Length = 1221

 Score =  256 bits (655), Expect = 2e-66
 Identities = 123/197 (62%), Positives = 151/197 (76%), Gaps = 4/197 (2%)
 Frame = +3

Query: 18  MHCDLVRRGFDAH----GPSLCLDNYCLRCIGSSRKAFDELPEPNVVAWNAIVTACFRRG 185
           MHC  ++ G D+H       + +   C  C+G +RK FDE+P+PN+VAWNA+VTACFR  
Sbjct: 295 MHCQALKHGLDSHLFVATTLIGMYGEC-GCVGFARKVFDEMPQPNLVAWNAVVTACFRGN 353

Query: 186 DFKGAEVFFSEMPLRNLTSWNVMIAGYMKGGELESAKRVFGEMTAKDEVSWSTMIVGFAH 365
           D  GA   F +M +RN TSWNVM+AGY+K GELE AKR+F EM  +D+VSWSTMIVGF+H
Sbjct: 354 DVSGAREIFDKMLVRNHTSWNVMLAGYIKAGELECAKRIFSEMPHRDDVSWSTMIVGFSH 413

Query: 366 NGSFDEAFGFFRELTSVGMRANEVSLTGVLSACAQSGAFEFGKVLHGYIEKAGLGLIVSV 545
           NGSF+E+F +FREL    MR NEVSLTGVLSAC+QSGAFEFGK LHG++EK+G   IVSV
Sbjct: 414 NGSFNESFSYFRELLRAEMRPNEVSLTGVLSACSQSGAFEFGKTLHGFVEKSGYSWIVSV 473

Query: 546 SNALLDTYSRCGNVDMA 596
           +NAL+D YSRCGNV MA
Sbjct: 474 NNALIDMYSRCGNVPMA 490



 Score = 66.6 bits (161), Expect = 3e-09
 Identities = 48/164 (29%), Positives = 79/164 (48%), Gaps = 10/164 (6%)
 Frame = +3

Query: 135 PNVVAWNAIVTACFRRGDFKGAEVFFSEMPLRNLTSW-----NVMIAGYMKGGELESAKR 299
           PN V+   +++AC + G F+  +     +  ++  SW     N +I  Y + G +  A+ 
Sbjct: 434 PNEVSLTGVLSACSQSGAFEFGKTLHGFVE-KSGYSWIVSVNNALIDMYSRCGNVPMARL 492

Query: 300 VFGEMTAKDE-VSWSTMIVGFAHNGSFDEAFGFFRELTSVGMRANEVSLTGVLSACAQSG 476
           VF  M  K   VSW++MI G A +G  +EA   F E+T  G+  +E+S   +L AC+ +G
Sbjct: 493 VFEGMQEKRSIVSWTSMIAGLAMHGHGEEAIRIFNEMTESGVMPDEISFISLLYACSHAG 552

Query: 477 AFEFGKVLHGYIEKA----GLGLIVSVSNALLDTYSRCGNVDMA 596
             + G+   GY  K      +   V     ++D Y R G +  A
Sbjct: 553 LIKEGE---GYFSKMKRVYHIEPAVEHYGCMVDLYGRSGKLQKA 593


>ref|XP_002282084.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g74630-like [Vitis vinifera]
          Length = 643

 Score =  256 bits (654), Expect = 2e-66
 Identities = 126/196 (64%), Positives = 150/196 (76%), Gaps = 3/196 (1%)
 Frame = +3

Query: 18  MHCDLVRRGFDAH---GPSLCLDNYCLRCIGSSRKAFDELPEPNVVAWNAIVTACFRRGD 188
           +HC  +  G D H   G +L         +  ++K F+E+ EPNVVAWNA+VTACFR GD
Sbjct: 128 LHCQAIVHGLDTHLFVGTTLVSMYSECGFVAFAKKVFEEMFEPNVVAWNAVVTACFRCGD 187

Query: 189 FKGAEVFFSEMPLRNLTSWNVMIAGYMKGGELESAKRVFGEMTAKDEVSWSTMIVGFAHN 368
            KGA++ F+ MP RNLTSWNVM+AGY K GELE A+++F EM  KD+VSWSTMIVGFAHN
Sbjct: 188 VKGADMMFNRMPFRNLTSWNVMLAGYTKAGELELARKLFLEMPVKDDVSWSTMIVGFAHN 247

Query: 369 GSFDEAFGFFRELTSVGMRANEVSLTGVLSACAQSGAFEFGKVLHGYIEKAGLGLIVSVS 548
           G F EAFGFFREL  VGMR NEVSLTG LSACA +GA EFGK+LHG+IEK+G   +VSV+
Sbjct: 248 GFFYEAFGFFRELQQVGMRPNEVSLTGALSACADAGAIEFGKILHGFIEKSGFLWMVSVN 307

Query: 549 NALLDTYSRCGNVDMA 596
           NALLDTYS+CGNV MA
Sbjct: 308 NALLDTYSKCGNVGMA 323



 Score = 60.5 bits (145), Expect = 2e-07
 Identities = 42/160 (26%), Positives = 75/160 (46%), Gaps = 6/160 (3%)
 Frame = +3

Query: 135 PNVVAWNAIVTACFRRGDFKGAEVFFSEMP----LRNLTSWNVMIAGYMKGGELESAKRV 302
           PN V+    ++AC   G  +  ++    +     L  ++  N ++  Y K G +  A+ V
Sbjct: 267 PNEVSLTGALSACADAGAIEFGKILHGFIEKSGFLWMVSVNNALLDTYSKCGNVGMARLV 326

Query: 303 FGEMTAKDE-VSWSTMIVGFAHNGSFDEAFGFFRELTSVGMRANEVSLTGVLSACAQSGA 479
           F  M  K   VSW++MI G A +G  +EA   F E+   G+R + ++   +L AC+ +G 
Sbjct: 327 FERMPEKRSIVSWTSMIAGLAMHGYGEEAIQLFHEMEESGIRPDGIAFISILYACSHAGL 386

Query: 480 FEFG-KVLHGYIEKAGLGLIVSVSNALLDTYSRCGNVDMA 596
            E G +  +   +   +   +     ++D Y R G +D A
Sbjct: 387 IEKGYEYFYKMKDIYNIEPAIEHYGCMVDLYGRAGQLDKA 426


>ref|XP_002328557.1| predicted protein [Populus trichocarpa] gi|222838272|gb|EEE76637.1|
           predicted protein [Populus trichocarpa]
          Length = 643

 Score =  253 bits (646), Expect = 2e-65
 Identities = 126/196 (64%), Positives = 146/196 (74%), Gaps = 3/196 (1%)
 Frame = +3

Query: 18  MHCDLVRRGFDAH---GPSLCLDNYCLRCIGSSRKAFDELPEPNVVAWNAIVTACFRRGD 188
           +HC  +  G D H   G +L         +G +RK FDE+PEPN +AWNA+VTAC R GD
Sbjct: 128 LHCQALVHGLDTHLFVGTTLISMYGECGFVGFARKVFDEMPEPNAIAWNAMVTACCRGGD 187

Query: 189 FKGAEVFFSEMPLRNLTSWNVMIAGYMKGGELESAKRVFGEMTAKDEVSWSTMIVGFAHN 368
            KG    F  MP+RNL SWNVM+AGY K GELE A+ +F EM  KD+VSWSTMIVGFAHN
Sbjct: 188 MKGGRELFDLMPVRNLMSWNVMLAGYTKAGELELAREMFLEMPMKDDVSWSTMIVGFAHN 247

Query: 369 GSFDEAFGFFRELTSVGMRANEVSLTGVLSACAQSGAFEFGKVLHGYIEKAGLGLIVSVS 548
           G F+EAF FFREL   GMR NE SLTGVLSACAQ+GA EFGK+LHG+IEK+GL  IVSV+
Sbjct: 248 GYFEEAFSFFRELQRKGMRPNETSLTGVLSACAQAGALEFGKILHGFIEKSGLAWIVSVN 307

Query: 549 NALLDTYSRCGNVDMA 596
           NALLDTYS+CGNV MA
Sbjct: 308 NALLDTYSKCGNVLMA 323



 Score = 58.5 bits (140), Expect = 8e-07
 Identities = 41/164 (25%), Positives = 77/164 (46%), Gaps = 10/164 (6%)
 Frame = +3

Query: 135 PNVVAWNAIVTACFRRGDFKGAEVFFSEMPLRNLTSW-----NVMIAGYMKGGELESAKR 299
           PN  +   +++AC + G  +  ++    +    L +W     N ++  Y K G +  A+ 
Sbjct: 267 PNETSLTGVLSACAQAGALEFGKILHGFIEKSGL-AWIVSVNNALLDTYSKCGNVLMAQL 325

Query: 300 VFGE-MTAKDEVSWSTMIVGFAHNGSFDEAFGFFRELTSVGMRANEVSLTGVLSACAQSG 476
           VF   M  ++ VSW++M+   A +G  +EA G F ++   G+R +E++   +L AC+ +G
Sbjct: 326 VFERIMNERNIVSWTSMMAALAMHGHGEEAIGIFHKMEESGIRPDEIAFISLLYACSHAG 385

Query: 477 AFEFGKVLHGYIEKA----GLGLIVSVSNALLDTYSRCGNVDMA 596
             E G     Y +K      +   +     ++D Y R G +  A
Sbjct: 386 LVEQG---CEYFDKMKGMYNIEPSIEHYGCMVDLYGRAGQLQKA 426


>ref|NP_177601.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75169836|sp|Q9CA54.1|PP122_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At1g74630 gi|12324801|gb|AAG52363.1|AC011765_15
           hypothetical protein; 86841-88772 [Arabidopsis thaliana]
           gi|332197495|gb|AEE35616.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 643

 Score =  251 bits (641), Expect = 7e-65
 Identities = 121/196 (61%), Positives = 150/196 (76%), Gaps = 3/196 (1%)
 Frame = +3

Query: 18  MHCDLVRRGFDAH---GPSLCLDNYCLRCIGSSRKAFDELPEPNVVAWNAIVTACFRRGD 188
           MHC  ++ G ++H   G +L        C+  +RK FDE+ +PN+VAWNA++TACFR  D
Sbjct: 128 MHCQALKHGLESHLFVGTTLIGMYGGCGCVEFARKVFDEMHQPNLVAWNAVITACFRGND 187

Query: 189 FKGAEVFFSEMPLRNLTSWNVMIAGYMKGGELESAKRVFGEMTAKDEVSWSTMIVGFAHN 368
             GA   F +M +RN TSWNVM+AGY+K GELESAKR+F EM  +D+VSWSTMIVG AHN
Sbjct: 188 VAGAREIFDKMLVRNHTSWNVMLAGYIKAGELESAKRIFSEMPHRDDVSWSTMIVGIAHN 247

Query: 369 GSFDEAFGFFRELTSVGMRANEVSLTGVLSACAQSGAFEFGKVLHGYIEKAGLGLIVSVS 548
           GSF+E+F +FREL   GM  NEVSLTGVLSAC+QSG+FEFGK+LHG++EKAG   IVSV+
Sbjct: 248 GSFNESFLYFRELQRAGMSPNEVSLTGVLSACSQSGSFEFGKILHGFVEKAGYSWIVSVN 307

Query: 549 NALLDTYSRCGNVDMA 596
           NAL+D YSRCGNV MA
Sbjct: 308 NALIDMYSRCGNVPMA 323



 Score = 64.7 bits (156), Expect = 1e-08
 Identities = 44/161 (27%), Positives = 78/161 (48%), Gaps = 7/161 (4%)
 Frame = +3

Query: 135 PNVVAWNAIVTACFRRGDFKGAEVFFSEMPLRNLTSW-----NVMIAGYMKGGELESAKR 299
           PN V+   +++AC + G F+  ++    +      SW     N +I  Y + G +  A+ 
Sbjct: 267 PNEVSLTGVLSACSQSGSFEFGKILHGFVEKAGY-SWIVSVNNALIDMYSRCGNVPMARL 325

Query: 300 VFGEMTAKD-EVSWSTMIVGFAHNGSFDEAFGFFRELTSVGMRANEVSLTGVLSACAQSG 476
           VF  M  K   VSW++MI G A +G  +EA   F E+T+ G+  + +S   +L AC+ +G
Sbjct: 326 VFEGMQEKRCIVSWTSMIAGLAMHGQGEEAVRLFNEMTAYGVTPDGISFISLLHACSHAG 385

Query: 477 AFEFGKVLHGYIEKA-GLGLIVSVSNALLDTYSRCGNVDMA 596
             E G+     +++   +   +     ++D Y R G +  A
Sbjct: 386 LIEEGEDYFSEMKRVYHIEPEIEHYGCMVDLYGRSGKLQKA 426


>dbj|BAD93880.1| hypothetical protein [Arabidopsis thaliana]
           gi|62318835|dbj|BAD93890.1| hypothetical protein
           [Arabidopsis thaliana]
          Length = 635

 Score =  251 bits (641), Expect = 7e-65
 Identities = 121/196 (61%), Positives = 150/196 (76%), Gaps = 3/196 (1%)
 Frame = +3

Query: 18  MHCDLVRRGFDAH---GPSLCLDNYCLRCIGSSRKAFDELPEPNVVAWNAIVTACFRRGD 188
           MHC  ++ G ++H   G +L        C+  +RK FDE+ +PN+VAWNA++TACFR  D
Sbjct: 120 MHCQALKHGLESHLFVGTTLIGMYGGCGCVEFARKVFDEMHQPNLVAWNAVITACFRGND 179

Query: 189 FKGAEVFFSEMPLRNLTSWNVMIAGYMKGGELESAKRVFGEMTAKDEVSWSTMIVGFAHN 368
             GA   F +M +RN TSWNVM+AGY+K GELESAKR+F EM  +D+VSWSTMIVG AHN
Sbjct: 180 VAGAREIFDKMLVRNHTSWNVMLAGYIKAGELESAKRIFSEMPHRDDVSWSTMIVGIAHN 239

Query: 369 GSFDEAFGFFRELTSVGMRANEVSLTGVLSACAQSGAFEFGKVLHGYIEKAGLGLIVSVS 548
           GSF+E+F +FREL   GM  NEVSLTGVLSAC+QSG+FEFGK+LHG++EKAG   IVSV+
Sbjct: 240 GSFNESFLYFRELQRAGMSPNEVSLTGVLSACSQSGSFEFGKILHGFVEKAGYSWIVSVN 299

Query: 549 NALLDTYSRCGNVDMA 596
           NAL+D YSRCGNV MA
Sbjct: 300 NALIDMYSRCGNVPMA 315



 Score = 64.7 bits (156), Expect = 1e-08
 Identities = 44/161 (27%), Positives = 78/161 (48%), Gaps = 7/161 (4%)
 Frame = +3

Query: 135 PNVVAWNAIVTACFRRGDFKGAEVFFSEMPLRNLTSW-----NVMIAGYMKGGELESAKR 299
           PN V+   +++AC + G F+  ++    +      SW     N +I  Y + G +  A+ 
Sbjct: 259 PNEVSLTGVLSACSQSGSFEFGKILHGFVEKAGY-SWIVSVNNALIDMYSRCGNVPMARL 317

Query: 300 VFGEMTAKD-EVSWSTMIVGFAHNGSFDEAFGFFRELTSVGMRANEVSLTGVLSACAQSG 476
           VF  M  K   VSW++MI G A +G  +EA   F E+T+ G+  + +S   +L AC+ +G
Sbjct: 318 VFEGMQEKRCIVSWTSMIAGLAMHGQGEEAVRLFNEMTAYGVTPDGISFISLLHACSHAG 377

Query: 477 AFEFGKVLHGYIEKA-GLGLIVSVSNALLDTYSRCGNVDMA 596
             E G+     +++   +   +     ++D Y R G +  A
Sbjct: 378 LIEEGEDYFSEMKRVYHIEPEIEHYGCMVDLYGRSGKLQKA 418


Top