BLASTX nr result

ID: Coptis24_contig00013079 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis24_contig00013079
         (1902 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN69960.1| hypothetical protein VITISV_032887 [Vitis vinifera]   582   e-164
ref|XP_002521681.1| pentatricopeptide repeat-containing protein,...   540   e-151
ref|XP_003626687.1| Pentatricopeptide repeat-containing protein ...   537   e-150
ref|XP_002306437.1| predicted protein [Populus trichocarpa] gi|2...   523   e-146
ref|XP_003595941.1| Pentatricopeptide repeat-containing protein ...   513   e-143

>emb|CAN69960.1| hypothetical protein VITISV_032887 [Vitis vinifera]
          Length = 472

 Score =  582 bits (1501), Expect = e-164
 Identities = 289/484 (59%), Positives = 367/484 (75%), Gaps = 3/484 (0%)
 Frame = +3

Query: 90   MGKLTPSMRSMVV---TTLQNXXXXXXXXXXXXXXXTTTTQVPNQQEPKSKNNKRPHNSK 260
            MGK+ PS R+  V   T L+N                 +T +   Q+P     KRP  S 
Sbjct: 1    MGKIPPSFRTSTVPVTTLLKNPPAVLPKQ---------STVLETPQKPHHFPKKRPQPSG 51

Query: 261  KPTRNKTLTPSPQPKPIFHTPSLSNAKKTFDQIISTSQIPLDLRHYNSLLKSFSQISNIH 440
            K  + +T    P+   IF++P+L +AKK F  I +TS  PLDLR +N+LL+S+S IS ++
Sbjct: 52   KTKKTRTPIEDPKSPVIFNSPNLLDAKKLFASITTTSTTPLDLRFHNALLQSYSSISTVN 111

Query: 441  DSFSFLQYMTKKNRPNFTPDHSTYNILLTQSCKIPPNHSQDHSDLSVIRKTLDLMVENQV 620
            DS SFL++M K ++P+F+P+ STY+ILL+QSCK P      +SDLS + +TL+LMV +  
Sbjct: 112  DSISFLRHMIK-SQPSFSPERSTYHILLSQSCKSP------NSDLSAVHQTLNLMVTHGF 164

Query: 621  PPDHVAVDLTVRTLCSVSREEDAIQMIKEMGVKYSEPDMFTFNFMVRHLCKTRSLNYVYE 800
            PPD V  D+ VR+LCS  REE AI+++KE+ +K+S PD FT+NF++RHLCKTR+L+ VY 
Sbjct: 165  PPDRVTTDIAVRSLCSAGREEHAIELVKELSLKHSPPDSFTYNFIIRHLCKTRALSTVYN 224

Query: 801  FIDEMKEELSLQPDLVTYTILIENVCNGKNFREATRLLSVLADAGFKPDCFLYNTIMKGY 980
            FIDE++    L+PDLVTYTILI+NVCNGKN REATRLL VL +AGFKPDC++YNTIMKGY
Sbjct: 225  FIDELQNSFQLKPDLVTYTILIDNVCNGKNLREATRLLEVLGEAGFKPDCYVYNTIMKGY 284

Query: 981  CMLDRGSEVIGVYKQMKEEEVEPDLVTYNTMIYGLSKVGRVDEARQFLVVMTEMGHFPDT 1160
            C+LD+GSE IGVYK+MKEE VEPDLVTYNT+I+GLSK GRV EAR+FL +M EMGHFPD 
Sbjct: 285  CILDKGSEAIGVYKKMKEEGVEPDLVTYNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDA 344

Query: 1161 VTYTSLMNGMCRKGDALGAMALLSRMEENGCSPNSCTYNTLLHGLCKSNLLEDGLQLYGV 1340
            VTYTSLMNG+CR+G+ALGA+ALL  ME  GCSPNSCTYNTLLHGLCK  +LE G++LYGV
Sbjct: 345  VTYTSLMNGLCREGNALGALALLEEMEAKGCSPNSCTYNTLLHGLCKLRMLERGIELYGV 404

Query: 1341 MKSAGMQLEASSYATFVRALCRDDRIAEAYQVFDYAVESKSLTDVAAYSALESTLKWAKK 1520
            MKS GM+LE +SYATFVRALC++ R+AEAY+ FDY VESKS  DV AYS LE++LKW +K
Sbjct: 405  MKSGGMKLEKASYATFVRALCKEGRVAEAYEAFDYVVESKSFDDVTAYSTLENSLKWLRK 464

Query: 1521 FKEQ 1532
             +EQ
Sbjct: 465  AREQ 468


>ref|XP_002521681.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223539072|gb|EEF40668.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 458

 Score =  540 bits (1391), Expect = e-151
 Identities = 272/481 (56%), Positives = 348/481 (72%)
 Frame = +3

Query: 90   MGKLTPSMRSMVVTTLQNXXXXXXXXXXXXXXXTTTTQVPNQQEPKSKNNKRPHNSKKPT 269
            MGK+ PS+RS V TT                      + PN   P  K +     +K   
Sbjct: 1    MGKVPPSLRSAVSTT-------------------ALLRKPNPFPPPEKPHYLSKKTKLQL 41

Query: 270  RNKTLTPSPQPKPIFHTPSLSNAKKTFDQIISTSQIPLDLRHYNSLLKSFSQISNIHDSF 449
              K  TP  Q K +F +P L+ AK+ F+ +IST+++PLDLR ++S L+S+S IS I DS 
Sbjct: 42   SQKIPTPIQQ-KRLFKSPELNEAKEIFNSLISTTRVPLDLRFHHSFLQSYSSISTIDDSI 100

Query: 450  SFLQYMTKKNRPNFTPDHSTYNILLTQSCKIPPNHSQDHSDLSVIRKTLDLMVENQVPPD 629
            S L++M K   P+FTP  STY+ILL+QSCK P         LS + + L+LMV N   P 
Sbjct: 101  SLLRHMIK-TLPSFTPTISTYHILLSQSCKAPD------PTLSPVHQILNLMVNNGFMPT 153

Query: 630  HVAVDLTVRTLCSVSREEDAIQMIKEMGVKYSEPDMFTFNFMVRHLCKTRSLNYVYEFID 809
             V VD+ VR LCS  +E+DA++++KE+ +K+S+PD FT+NF+V+ LCK R+L+ VY FID
Sbjct: 154  QVTVDIAVRALCSAGKEDDAVKLVKELSLKHSKPDSFTYNFLVKCLCKCRALSNVYSFID 213

Query: 810  EMKEELSLQPDLVTYTILIENVCNGKNFREATRLLSVLADAGFKPDCFLYNTIMKGYCML 989
            EM+    L+P+LVTYTILI+NVCN KN REA RLL +L + GFKPDCF+YNTIMKGYCML
Sbjct: 214  EMRSSFDLEPNLVTYTILIDNVCNSKNLREAMRLLGILRECGFKPDCFVYNTIMKGYCML 273

Query: 990  DRGSEVIGVYKQMKEEEVEPDLVTYNTMIYGLSKVGRVDEARQFLVVMTEMGHFPDTVTY 1169
             +GS+ I V+K+MKEE +EPDL+TYNT+I+GLSK GRV EA+++L +M E GHFPD VTY
Sbjct: 274  SKGSDAIQVFKKMKEEGIEPDLITYNTLIFGLSKGGRVSEAKRYLKIMVESGHFPDAVTY 333

Query: 1170 TSLMNGMCRKGDALGAMALLSRMEENGCSPNSCTYNTLLHGLCKSNLLEDGLQLYGVMKS 1349
            TSLMNG+CRKGDALGA+ALL  ME  GCSPNSCTYNTLL+GLCK  LLE G++LY V+K 
Sbjct: 334  TSLMNGLCRKGDALGALALLEDMEMKGCSPNSCTYNTLLYGLCKERLLEKGIELYNVIKE 393

Query: 1350 AGMQLEASSYATFVRALCRDDRIAEAYQVFDYAVESKSLTDVAAYSALESTLKWAKKFKE 1529
             GM L+ +SYATFVRALCR+ ++AEAY+VFDYAVESKSLT+ AAY+ LESTLKW KK +E
Sbjct: 394  GGMLLDTASYATFVRALCREGKVAEAYEVFDYAVESKSLTNAAAYTTLESTLKWLKKARE 453

Query: 1530 Q 1532
            Q
Sbjct: 454  Q 454


>ref|XP_003626687.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355520709|gb|AET01163.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 501

 Score =  537 bits (1383), Expect = e-150
 Identities = 265/481 (55%), Positives = 356/481 (74%)
 Frame = +3

Query: 90   MGKLTPSMRSMVVTTLQNXXXXXXXXXXXXXXXTTTTQVPNQQEPKSKNNKRPHNSKKPT 269
            MGK+ PS RS     L N                +++ +P+  +P    NK     +K  
Sbjct: 1    MGKIPPSFRS----ALSNPNLIHR----------SSSLIPSSPKPHHFPNKTRKPHQKQQ 46

Query: 270  RNKTLTPSPQPKPIFHTPSLSNAKKTFDQIISTSQIPLDLRHYNSLLKSFSQISNIHDSF 449
            ++++ + SP+P  +F +P+L  AK  F+  +++S  P+D R +NSLL+S++ IS I+DS 
Sbjct: 47   QSQSQSQSPKPVSVFKSPNLQEAKSIFNSFVNSSNAPIDSRFHNSLLQSYASISTINDSI 106

Query: 450  SFLQYMTKKNRPNFTPDHSTYNILLTQSCKIPPNHSQDHSDLSVIRKTLDLMVENQVPPD 629
            +FL++MTK + P+F+PD STY+ILLT  CK   +    +S LS+I +TL+LMV + + PD
Sbjct: 107  AFLRHMTKTH-PSFSPDKSTYHILLTHCCK---STDSKYSTLSLIHQTLNLMVSDGISPD 162

Query: 630  HVAVDLTVRTLCSVSREEDAIQMIKEMGVKYSEPDMFTFNFMVRHLCKTRSLNYVYEFID 809
               VDL VR+LC+  R +DA+++IKE+  K+  PD++++NF+V++LCK+R+L+ VY FID
Sbjct: 163  KGTVDLAVRSLCTADRVDDAVELIKELSSKHCSPDIYSYNFLVKNLCKSRTLSLVYAFID 222

Query: 810  EMKEELSLQPDLVTYTILIENVCNGKNFREATRLLSVLADAGFKPDCFLYNTIMKGYCML 989
            EM+ +  ++P+LVTYTILI+NVCN KN REATRL+ +L + GFKPDCFLYNTIMKGYCML
Sbjct: 223  EMRTKFDVKPNLVTYTILIDNVCNTKNLREATRLVDILEEEGFKPDCFLYNTIMKGYCML 282

Query: 990  DRGSEVIGVYKQMKEEEVEPDLVTYNTMIYGLSKVGRVDEARQFLVVMTEMGHFPDTVTY 1169
             RGSE I VY +MKE+ VEPDL+TYNT+I+GLSK GRV EA++ L VM E GHFPD VTY
Sbjct: 283  SRGSEAIEVYNRMKEKGVEPDLITYNTLIFGLSKSGRVSEAKKLLRVMAEKGHFPDEVTY 342

Query: 1170 TSLMNGMCRKGDALGAMALLSRMEENGCSPNSCTYNTLLHGLCKSNLLEDGLQLYGVMKS 1349
            TSLMNGMCRKG+ L A+ALL  ME  GCSPN+CTYNTLLHGLCKS + +  ++LYG MKS
Sbjct: 343  TSLMNGMCRKGETLAALALLEEMEMKGCSPNTCTYNTLLHGLCKSRMFDKAMELYGAMKS 402

Query: 1350 AGMQLEASSYATFVRALCRDDRIAEAYQVFDYAVESKSLTDVAAYSALESTLKWAKKFKE 1529
             G++L+ +SYATFVRALC   R+A+AY+VFDYAVESKSL+DVAAYS LESTLKW KK KE
Sbjct: 403  DGLKLDMASYATFVRALCSVGRVADAYEVFDYAVESKSLSDVAAYSTLESTLKWFKKAKE 462

Query: 1530 Q 1532
            +
Sbjct: 463  E 463



 Score = 64.7 bits (156), Expect = 8e-08
 Identities = 54/266 (20%), Positives = 104/266 (39%), Gaps = 2/266 (0%)
 Frame = +3

Query: 747  NFMVRHLCKTRSLNYVYEFIDEM-KEELSLQPDLVTYTILIENVCNGKNFREATRLLSVL 923
            N +++      ++N    F+  M K   S  PD  TY IL+ + C   + + +T  L   
Sbjct: 90   NSLLQSYASISTINDSIAFLRHMTKTHPSFSPDKSTYHILLTHCCKSTDSKYSTLSL--- 146

Query: 924  ADAGFKPDCFLYNTIMKGYCMLDRGSEVIGVYKQMKEEEVEPDLVTYNTMIYGLSKVGRV 1103
                      ++ T+                   M  + + PD  T +  +  L    RV
Sbjct: 147  ----------IHQTL-----------------NLMVSDGISPDKGTVDLAVRSLCTADRV 179

Query: 1104 DEARQFLVVMTEMGHFPDTVTYTSLMNGMCRKGDALGAMALLSRMEEN-GCSPNSCTYNT 1280
            D+A + +  ++     PD  +Y  L+  +C+        A +  M       PN  TY  
Sbjct: 180  DDAVELIKELSSKHCSPDIYSYNFLVKNLCKSRTLSLVYAFIDEMRTKFDVKPNLVTYTI 239

Query: 1281 LLHGLCKSNLLEDGLQLYGVMKSAGMQLEASSYATFVRALCRDDRIAEAYQVFDYAVESK 1460
            L+  +C +  L +  +L  +++  G + +   Y T ++  C   R +EA +V++   E  
Sbjct: 240  LIDNVCNTKNLREATRLVDILEEEGFKPDCFLYNTIMKGYCMLSRGSEAIEVYNRMKEKG 299

Query: 1461 SLTDVAAYSALESTLKWAKKFKEQRQ 1538
               D+  Y+ L   L  + +  E ++
Sbjct: 300  VEPDLITYNTLIFGLSKSGRVSEAKK 325


>ref|XP_002306437.1| predicted protein [Populus trichocarpa] gi|222855886|gb|EEE93433.1|
            predicted protein [Populus trichocarpa]
          Length = 462

 Score =  523 bits (1347), Expect = e-146
 Identities = 273/489 (55%), Positives = 341/489 (69%), Gaps = 8/489 (1%)
 Frame = +3

Query: 90   MGKLTPSMRSMVVTTLQNXXXXXXXXXXXXXXXTTTTQVPNQQE------PKSKNNKRPH 251
            MGK  PS RS + +T                  +     P+QQ+      PK    K   
Sbjct: 1    MGKFPPSFRSAISST------------------SLIKNTPSQQQQQPHYFPKKLTKK--- 39

Query: 252  NSKKPTRNKTLTPSPQPKPIFHTPSLSNAKKTFDQIISTSQIPL--DLRHYNSLLKSFSQ 425
            NS KP  ++T TP P  K +F T SL+ AK  F+  IST++ PL  +LR +NS L+S++ 
Sbjct: 40   NSPKP--HETETPPPH-KSLFKTSSLNEAKSLFNSFISTTKAPLLDNLRLHNSFLQSYTS 96

Query: 426  ISNIHDSFSFLQYMTKKNRPNFTPDHSTYNILLTQSCKIPPNHSQDHSDLSVIRKTLDLM 605
            IS + DS S L +M K   P+ +PD STY++LL+QSC+ P       S LS  +K L+LM
Sbjct: 97   ISTLDDSISLLDHMVK-TLPSLSPDRSTYHVLLSQSCREPD------SSLSSAQKVLNLM 149

Query: 606  VENQVPPDHVAVDLTVRTLCSVSREEDAIQMIKEMGVKYSEPDMFTFNFMVRHLCKTRSL 785
            +     P+   VD+ +R+LCS  R +DAI ++KE   K+S+PD FT+NF+V+ LCK+R  
Sbjct: 150  INKGFKPNQFTVDVAIRSLCSAGRVDDAILLVKEFSSKHSKPDTFTYNFLVKCLCKSRIF 209

Query: 786  NYVYEFIDEMKEELSLQPDLVTYTILIENVCNGKNFREATRLLSVLADAGFKPDCFLYNT 965
            N VY FIDEMK    ++PDLVTYTILI+NVCN KN REA RL++VL + G KPD FLYNT
Sbjct: 210  NSVYSFIDEMKSSFDIKPDLVTYTILIDNVCNAKNIREADRLVAVLKECGLKPDAFLYNT 269

Query: 966  IMKGYCMLDRGSEVIGVYKQMKEEEVEPDLVTYNTMIYGLSKVGRVDEARQFLVVMTEMG 1145
            IMKGYC+L++G E + +YKQMKEE VEPDLVTYNT+I+GLSK GRV EA++ L +M E G
Sbjct: 270  IMKGYCLLNKGIEAVRIYKQMKEEGVEPDLVTYNTLIFGLSKCGRVSEAKKLLKIMVESG 329

Query: 1146 HFPDTVTYTSLMNGMCRKGDALGAMALLSRMEENGCSPNSCTYNTLLHGLCKSNLLEDGL 1325
            HFPD VTYTSLMNGMCR+GD LGA ALL  ME  GCSPNSCTYNTLLHG CK   L  G+
Sbjct: 330  HFPDAVTYTSLMNGMCREGDVLGAAALLEEMELKGCSPNSCTYNTLLHGFCKGRRLNKGV 389

Query: 1326 QLYGVMKSAGMQLEASSYATFVRALCRDDRIAEAYQVFDYAVESKSLTDVAAYSALESTL 1505
            +LYGV+K  GM+LE +SYATFVRALCR+ R+AEAY+VFDYAVESKSLTDVAAY+ LESTL
Sbjct: 390  ELYGVIKKGGMKLETASYATFVRALCREGRVAEAYEVFDYAVESKSLTDVAAYTTLESTL 449

Query: 1506 KWAKKFKEQ 1532
            KW KK +EQ
Sbjct: 450  KWLKKAREQ 458


>ref|XP_003595941.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355484989|gb|AES66192.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 472

 Score =  513 bits (1320), Expect = e-143
 Identities = 257/483 (53%), Positives = 349/483 (72%)
 Frame = +3

Query: 84   QKMGKLTPSMRSMVVTTLQNXXXXXXXXXXXXXXXTTTTQVPNQQEPKSKNNKRPHNSKK 263
            ++ GK+ PS RS     L N                +++ +P+  +P    NK     +K
Sbjct: 15   KQYGKIPPSFRS----ALSNPNLIHR----------SSSLIPSSPKPHHFPNKTRKPHQK 60

Query: 264  PTRNKTLTPSPQPKPIFHTPSLSNAKKTFDQIISTSQIPLDLRHYNSLLKSFSQISNIHD 443
              ++++ + SP+P  +F +P+L  AK  F+  +++S  P+D R +NSLL+S++ IS I+D
Sbjct: 61   QQQSQSQSQSPKPVSVFKSPNLQEAKSIFNSFVNSSNAPIDSRFHNSLLQSYASISTIND 120

Query: 444  SFSFLQYMTKKNRPNFTPDHSTYNILLTQSCKIPPNHSQDHSDLSVIRKTLDLMVENQVP 623
            S +FL++MTK + P+F+PD STY+ILLT  CK   +    +S LS+I +TL+LMV + + 
Sbjct: 121  SIAFLRHMTKTH-PSFSPDKSTYHILLTHCCK---STDSKYSTLSLIHQTLNLMVSDGIS 176

Query: 624  PDHVAVDLTVRTLCSVSREEDAIQMIKEMGVKYSEPDMFTFNFMVRHLCKTRSLNYVYEF 803
            PD   VDL VR+LC+  R +DA+++IKE+  K+  PD++++NF+V++LCK+R+L+ VY  
Sbjct: 177  PDKGTVDLAVRSLCTADRVDDAVELIKELSSKHCSPDIYSYNFLVKNLCKSRTLSLVY-- 234

Query: 804  IDEMKEELSLQPDLVTYTILIENVCNGKNFREATRLLSVLADAGFKPDCFLYNTIMKGYC 983
                       P+LVTYTILI+NVCN KN REATRL+ +L + GFKPDCFLYNTIMKGYC
Sbjct: 235  -----------PNLVTYTILIDNVCNTKNLREATRLVDILEEEGFKPDCFLYNTIMKGYC 283

Query: 984  MLDRGSEVIGVYKQMKEEEVEPDLVTYNTMIYGLSKVGRVDEARQFLVVMTEMGHFPDTV 1163
            ML RGSE I VY +MKE+ VEPDL+TYNT+I+GLSK GRV EA++ L VM E GHFPD V
Sbjct: 284  MLSRGSEAIEVYNRMKEKGVEPDLITYNTLIFGLSKSGRVSEAKKLLRVMAEKGHFPDEV 343

Query: 1164 TYTSLMNGMCRKGDALGAMALLSRMEENGCSPNSCTYNTLLHGLCKSNLLEDGLQLYGVM 1343
            TYTSLMNGMCRKG+ L A+ALL  ME  GCSPN+CTYNTLLHGLCKS + +  ++LYG M
Sbjct: 344  TYTSLMNGMCRKGETLAALALLEEMEMKGCSPNTCTYNTLLHGLCKSRMFDKAMELYGAM 403

Query: 1344 KSAGMQLEASSYATFVRALCRDDRIAEAYQVFDYAVESKSLTDVAAYSALESTLKWAKKF 1523
            KS G++L+ +SYATFVRALC   R+A+AY+VFDYAVESKSL+DVAAYS LESTLKW++K 
Sbjct: 404  KSDGLKLDMASYATFVRALCSVGRVADAYEVFDYAVESKSLSDVAAYSTLESTLKWSRKQ 463

Query: 1524 KEQ 1532
            K++
Sbjct: 464  KKK 466


Top