BLASTX nr result

ID: Coptis23_contig00005171 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis23_contig00005171
         (1467 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI17752.3| unnamed protein product [Vitis vinifera]              636   e-180
ref|XP_002304774.1| predicted protein [Populus trichocarpa] gi|2...   583   e-164
ref|XP_002269066.2| PREDICTED: pentatricopeptide repeat-containi...   579   e-163
ref|XP_003555182.1| PREDICTED: pentatricopeptide repeat-containi...   574   e-161
ref|XP_003598903.1| Pentatricopeptide repeat-containing protein ...   569   e-160

>emb|CBI17752.3| unnamed protein product [Vitis vinifera]
          Length = 729

 Score =  636 bits (1640), Expect = e-180
 Identities = 306/447 (68%), Positives = 378/447 (84%)
 Frame = -2

Query: 1466 DEAFELLGRMRENLCKPDVFAYTAMIRVLIAEGNLDGCLTVWEEMLKDGVDPDVMAYTTL 1287
            DE  ELL RMR NLCKPDVFAYTAM++VL+AEGNLDGCL VWEEM KD V+PDVMAYTTL
Sbjct: 274  DEVLELLDRMRGNLCKPDVFAYTAMVKVLVAEGNLDGCLRVWEEMRKDKVEPDVMAYTTL 333

Query: 1286 ITALCKGNVVEKGYELFKEMKKKEYLIDRAVYGSLIEAFVAEGKIGSACDLLKDLMSSGY 1107
            + ALC GN V +G+ELFKEMK+K+YLIDRA+YGSLIE FV   ++GSACDLLKDLM SGY
Sbjct: 334  VAALCNGNRVGEGFELFKEMKQKKYLIDRAIYGSLIEGFVVNERVGSACDLLKDLMDSGY 393

Query: 1106 RADLSILNSLIEGLCNANKIGKAYKLFQFTVQEGLSPDFVTVNPILALYAEQSRMDEICK 927
            RADL+I NSLIEG+CN  ++ KAYKLFQ TV E L P+F+TV P+L  YAE  RMD+ C 
Sbjct: 394  RADLAIYNSLIEGMCNVKQVDKAYKLFQVTVHESLEPNFLTVKPMLVSYAEMKRMDDFCS 453

Query: 926  LLEQMQKLGLHVIDDLSNFFSFMVGKGGREYKALETLKYLKARGFCNVSIYSIIIQALHK 747
            LL QMQKLG  VIDDLS FFS M+ KG R   ALE  ++LKA+G+C++SIY+I+++A+H+
Sbjct: 454  LLGQMQKLGFPVIDDLSKFFSVMIEKGERLKLALEVFEHLKAKGYCSISIYNILMEAIHR 513

Query: 746  IGDVKGALSLFEEMKDSEYIPDSSTYSNIIPCFVDGGDVKAACSCYNKMKEKSWVPTVSA 567
             G+VK ALSLF+++KDS + PDSSTYSN I CFV+ GDV+ AC+CYNK+ E   +P+V+A
Sbjct: 514  TGEVKKALSLFDDIKDSNFKPDSSTYSNAIICFVEVGDVQEACACYNKIIEMCQLPSVAA 573

Query: 566  YSSLVYGLCKIREIDAALTLIHDCLGNVTSWPMDFKYTLSIIHACRSCDAKKVIDIVDEM 387
            Y SLV GLCK  EIDAA+ L+ DCL NVTS PM+FKYTL+I+HAC+S +A+KVID+++EM
Sbjct: 574  YRSLVKGLCKSEEIDAAIMLVRDCLANVTSGPMEFKYTLTILHACKSGNAEKVIDVLNEM 633

Query: 386  MEQGCPLNDLIYSAVISGMCDYGTLEEARKVFTSMRERKLLTEANLIVYEELLIDHMKKK 207
            M++GC  +++ YSA+ISGMC +GTLEEARKVF++MRERKLLTEAN+IVY+E+LI+HMKKK
Sbjct: 634  MQEGCTPDEVTYSALISGMCKHGTLEEARKVFSNMRERKLLTEANVIVYDEILIEHMKKK 693

Query: 206  TAGLVLAGLKFFGLESKLKSKGSTILP 126
            TA LVL+GLKFFGLESKL+SKGST+LP
Sbjct: 694  TADLVLSGLKFFGLESKLRSKGSTLLP 720



 Score = 58.9 bits (141), Expect = 3e-06
 Identities = 40/154 (25%), Positives = 69/154 (44%)
 Frame = -2

Query: 719 LFEEMKDSEYIPDSSTYSNIIPCFVDGGDVKAACSCYNKMKEKSWVPTVSAYSSLVYGLC 540
           ++E+MK     P    Y+ I+   V  G +  A S Y   KE   V     Y  LV GLC
Sbjct: 209 VYEKMKKFGIKPRVFLYNRIMDGLVKTGHLDLAMSVYEDFKEDGLVEESVTYMILVKGLC 268

Query: 539 KIREIDAALTLIHDCLGNVTSWPMDFKYTLSIIHACRSCDAKKVIDIVDEMMEQGCPLND 360
           K   ID  L L+    GN+   P  F YT  +       +    + + +EM +     + 
Sbjct: 269 KAGRIDEVLELLDRMRGNLCK-PDVFAYTAMVKVLVAEGNLDGCLRVWEEMRKDKVEPDV 327

Query: 359 LIYSAVISGMCDYGTLEEARKVFTSMRERKLLTE 258
           + Y+ +++ +C+   + E  ++F  M+++K L +
Sbjct: 328 MAYTTLVAALCNGNRVGEGFELFKEMKQKKYLID 361


>ref|XP_002304774.1| predicted protein [Populus trichocarpa] gi|222842206|gb|EEE79753.1|
            predicted protein [Populus trichocarpa]
          Length = 728

 Score =  583 bits (1503), Expect = e-164
 Identities = 287/446 (64%), Positives = 357/446 (80%)
 Frame = -2

Query: 1466 DEAFELLGRMRENLCKPDVFAYTAMIRVLIAEGNLDGCLTVWEEMLKDGVDPDVMAYTTL 1287
            +E  E+LGRMRENLCKPDVFAYTAM+R L  EGNLD CL VWEEM +DGV+PDVMAY TL
Sbjct: 282  EEMMEVLGRMRENLCKPDVFAYTAMVRALAGEGNLDACLRVWEEMKRDGVEPDVMAYVTL 341

Query: 1286 ITALCKGNVVEKGYELFKEMKKKEYLIDRAVYGSLIEAFVAEGKIGSACDLLKDLMSSGY 1107
            +TALCKG  V+KGYE+FKEMK +  LIDR +YG L+EAFVA+GKIG ACDLLKDL+ SGY
Sbjct: 342  VTALCKGGRVDKGYEVFKEMKGRRILIDRGIYGILVEAFVADGKIGLACDLLKDLVDSGY 401

Query: 1106 RADLSILNSLIEGLCNANKIGKAYKLFQFTVQEGLSPDFVTVNPILALYAEQSRMDEICK 927
            RADL I NSLIEG CN  ++ KA+KLFQ TVQEGL  DF TVNP+L  YAE  +MD+ CK
Sbjct: 402  RADLRIYNSLIEGFCNVKRVDKAHKLFQVTVQEGLERDFKTVNPLLMSYAEMKKMDDFCK 461

Query: 926  LLEQMQKLGLHVIDDLSNFFSFMVGKGGREYKALETLKYLKARGFCNVSIYSIIIQALHK 747
            LL+QM+KLG  V DDLS FFS++VGK  R   ALE  + LK +G+ +V IY+I+++AL  
Sbjct: 462  LLKQMEKLGFSVFDDLSKFFSYVVGKPERTMMALEVFEDLKVKGYSSVPIYNILMEALLT 521

Query: 746  IGDVKGALSLFEEMKDSEYIPDSSTYSNIIPCFVDGGDVKAACSCYNKMKEKSWVPTVSA 567
            IG++K ALSLF EMKD    PDS+TYS  I CFV+ G+++ AC  +NK+ E   VP+V+A
Sbjct: 522  IGEMKRALSLFGEMKDLNK-PDSTTYSIAIICFVEDGNIQEACVSHNKIVEMFCVPSVAA 580

Query: 566  YSSLVYGLCKIREIDAALTLIHDCLGNVTSWPMDFKYTLSIIHACRSCDAKKVIDIVDEM 387
            Y SL  GLC   EIDAA+ L+ DCL +V S PM+FKY+L+I+HAC++  A+KVID+++EM
Sbjct: 581  YCSLAKGLCDNGEIDAAMMLVRDCLASVESGPMEFKYSLTILHACKTGGAEKVIDVLNEM 640

Query: 386  MEQGCPLNDLIYSAVISGMCDYGTLEEARKVFTSMRERKLLTEANLIVYEELLIDHMKKK 207
            M++GC  N++IYSA+ISGMC +GT EEARKVFT +R+RK+LTEA  IV++E+LI+HMKKK
Sbjct: 641  MQEGCTPNEVIYSAIISGMCKHGTFEEARKVFTDLRQRKILTEAKTIVFDEILIEHMKKK 700

Query: 206  TAGLVLAGLKFFGLESKLKSKGSTIL 129
            TA LVLAGLKFFGLESKLK+ GST+L
Sbjct: 701  TADLVLAGLKFFGLESKLKAMGSTLL 726



 Score = 61.6 bits (148), Expect = 5e-07
 Identities = 62/383 (16%), Positives = 155/383 (40%), Gaps = 35/383 (9%)
 Frame = -2

Query: 1346 VWEEMLKDGVDPDVMAYTTLITALCKGNVVEKGYELFKEMKKKEYLIDRAVYGSLIEAFV 1167
            V+++M+K GV P V  Y  ++ +L K   ++    ++++ ++   + +   Y  LI+   
Sbjct: 217  VYQKMVKFGVKPRVFLYNRIMDSLIKTGHLDLALSVYEDFRRDGLVEESVTYMILIKGLC 276

Query: 1166 AEGKIGSACDLLKDLMSSGYRADLSILNSLIEGLCNANKIGKAYKLFQFTVQEGLSPDFV 987
              G+I    ++L  +  +  + D+    +++  L     +    ++++   ++G+ PD +
Sbjct: 277  KAGRIEEMMEVLGRMRENLCKPDVFAYTAMVRALAGEGNLDACLRVWEEMKRDGVEPDVM 336

Query: 986  TVNPILALYAEQSRMDEICKLLEQMQKLGLHVIDDLSNFFSFMVGKGGREYKALETLKYL 807
                ++    +  R+D+  ++ ++M+   + +   +           G+   A + LK L
Sbjct: 337  AYVTLVTALCKGGRVDKGYEVFKEMKGRRILIDRGIYGILVEAFVADGKIGLACDLLKDL 396

Query: 806  KARGF-CNVSIYSIIIQALHKIGDVKGALSLFEEMKDSEYIPDSSTYSNIIPCFVDGGDV 630
               G+  ++ IY+ +I+    +  V  A  LF+         D  T + ++  + +   +
Sbjct: 397  VDSGYRADLRIYNSLIEGFCNVKRVDKAHKLFQVTVQEGLERDFKTVNPLLMSYAEMKKM 456

Query: 629  KAACSCYNKMKEKSW----------------------------------VPTVSAYSSLV 552
               C    +M++  +                                    +V  Y+ L+
Sbjct: 457  DDFCKLLKQMEKLGFSVFDDLSKFFSYVVGKPERTMMALEVFEDLKVKGYSSVPIYNILM 516

Query: 551  YGLCKIREIDAALTLIHDCLGNVTSWPMDFKYTLSIIHACRSCDAKKVIDIVDEMMEQGC 372
              L  I E+  AL+L  +      + P    Y+++II      + ++     ++++E  C
Sbjct: 517  EALLTIGEMKRALSLFGEM--KDLNKPDSTTYSIAIICFVEDGNIQEACVSHNKIVEMFC 574

Query: 371  PLNDLIYSAVISGMCDYGTLEEA 303
              +   Y ++  G+CD G ++ A
Sbjct: 575  VPSVAAYCSLAKGLCDNGEIDAA 597


>ref|XP_002269066.2| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like
            [Vitis vinifera]
          Length = 1294

 Score =  579 bits (1493), Expect = e-163
 Identities = 282/447 (63%), Positives = 359/447 (80%)
 Frame = -2

Query: 1466 DEAFELLGRMRENLCKPDVFAYTAMIRVLIAEGNLDGCLTVWEEMLKDGVDPDVMAYTTL 1287
            D A  +    +E+    +   Y  +++ L   G +D  L VWEEM KD V+PDVMAYTTL
Sbjct: 778  DLAMSVYEDFKEDGLVEESVTYMILVKGLCKAGRIDEVLEVWEEMRKDKVEPDVMAYTTL 837

Query: 1286 ITALCKGNVVEKGYELFKEMKKKEYLIDRAVYGSLIEAFVAEGKIGSACDLLKDLMSSGY 1107
            + ALC GN V +G+ELFKEMK+K+YLIDRA+YGSLIE FV   ++GSACDLLKDLM SGY
Sbjct: 838  VAALCNGNRVGEGFELFKEMKQKKYLIDRAIYGSLIEGFVVNERVGSACDLLKDLMDSGY 897

Query: 1106 RADLSILNSLIEGLCNANKIGKAYKLFQFTVQEGLSPDFVTVNPILALYAEQSRMDEICK 927
            RADL+I NSLIEG+CN  ++ KAYKLFQ TV E L P+F+TV P+L  YAE  RMD+ C 
Sbjct: 898  RADLAIYNSLIEGMCNVKQVDKAYKLFQVTVHESLEPNFLTVKPMLVSYAEMKRMDDFCS 957

Query: 926  LLEQMQKLGLHVIDDLSNFFSFMVGKGGREYKALETLKYLKARGFCNVSIYSIIIQALHK 747
            LL QMQKLG  VIDDLS FFS M+ KG R   ALE  ++LKA+G+C++SIY+I+++A+H+
Sbjct: 958  LLGQMQKLGFPVIDDLSKFFSVMIEKGERLKLALEVFEHLKAKGYCSISIYNILMEAIHR 1017

Query: 746  IGDVKGALSLFEEMKDSEYIPDSSTYSNIIPCFVDGGDVKAACSCYNKMKEKSWVPTVSA 567
             G+VK ALSLF+++KDS + PDSSTYSN I CFV+ GDV+ AC+CYNK+ E   +P+V+A
Sbjct: 1018 TGEVKKALSLFDDIKDSNFKPDSSTYSNAIICFVEVGDVQEACACYNKIIEMCQLPSVAA 1077

Query: 566  YSSLVYGLCKIREIDAALTLIHDCLGNVTSWPMDFKYTLSIIHACRSCDAKKVIDIVDEM 387
            Y SLV GLCK  EIDAA+ L+ DCL NVTS PM+FKYTL+I+HAC+S +A+KVID+++EM
Sbjct: 1078 YRSLVKGLCKSEEIDAAIMLVRDCLANVTSGPMEFKYTLTILHACKSGNAEKVIDVLNEM 1137

Query: 386  MEQGCPLNDLIYSAVISGMCDYGTLEEARKVFTSMRERKLLTEANLIVYEELLIDHMKKK 207
            M++GC  +++ YSA+ISGMC +GTLEEARKVF++MRERKLLTEAN+IVY+E+LI+HMKKK
Sbjct: 1138 MQEGCTPDEVTYSALISGMCKHGTLEEARKVFSNMRERKLLTEANVIVYDEILIEHMKKK 1197

Query: 206  TAGLVLAGLKFFGLESKLKSKGSTILP 126
            TA LVL+GLKFFGLESKL+SKGST+LP
Sbjct: 1198 TADLVLSGLKFFGLESKLRSKGSTLLP 1224


>ref|XP_003555182.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20740-like
            [Glycine max]
          Length = 733

 Score =  574 bits (1479), Expect = e-161
 Identities = 283/447 (63%), Positives = 354/447 (79%)
 Frame = -2

Query: 1466 DEAFELLGRMRENLCKPDVFAYTAMIRVLIAEGNLDGCLTVWEEMLKDGVDPDVMAYTTL 1287
            DE  E+LGRMRE LCKPDVFAYTA++++L+  GNLD CL VWEEM +D V+PDV AY T+
Sbjct: 287  DEMLEVLGRMRERLCKPDVFAYTALVKILVPAGNLDACLRVWEEMKRDRVEPDVKAYATM 346

Query: 1286 ITALCKGNVVEKGYELFKEMKKKEYLIDRAVYGSLIEAFVAEGKIGSACDLLKDLMSSGY 1107
            I  L KG  V++GYELF+EMK K  L+DR +YG+L+EAFVAEGK+  A DLLKDL+SSGY
Sbjct: 347  IVGLAKGGRVQEGYELFREMKGKGCLVDRVIYGALVEAFVAEGKVELAFDLLKDLVSSGY 406

Query: 1106 RADLSILNSLIEGLCNANKIGKAYKLFQFTVQEGLSPDFVTVNPILALYAEQSRMDEICK 927
            RADL I   LIEGLCN N++ KAYKLFQ TV+EGL PDF+TV P+L  YAE +RM+E CK
Sbjct: 407  RADLGIYICLIEGLCNLNRVQKAYKLFQLTVREGLEPDFLTVKPLLVAYAEANRMEEFCK 466

Query: 926  LLEQMQKLGLHVIDDLSNFFSFMVGKGGREYKALETLKYLKARGFCNVSIYSIIIQALHK 747
            LLEQMQKLG  VI DLS FFS +V K G    ALET   LK +G  +V IY+I + +LHK
Sbjct: 467  LLEQMQKLGFPVIADLSKFFSVLVEKKG-PIMALETFGQLKEKGHVSVEIYNIFMDSLHK 525

Query: 746  IGDVKGALSLFEEMKDSEYIPDSSTYSNIIPCFVDGGDVKAACSCYNKMKEKSWVPTVSA 567
            IG+VK ALSLF+EMK     PDS TY   I C VD G++K AC+C+N++ E S +P+V+A
Sbjct: 526  IGEVKKALSLFDEMKGLSLKPDSFTYCTAILCLVDLGEIKEACACHNRIIEMSCIPSVAA 585

Query: 566  YSSLVYGLCKIREIDAALTLIHDCLGNVTSWPMDFKYTLSIIHACRSCDAKKVIDIVDEM 387
            YSSL  GLC+I EID A+ L+ DCLGNV+  P++FKY+L+IIHAC+S  A+KVID+++EM
Sbjct: 586  YSSLTKGLCQIGEIDEAMLLVRDCLGNVSDGPLEFKYSLTIIHACKSNVAEKVIDVLNEM 645

Query: 386  MEQGCPLNDLIYSAVISGMCDYGTLEEARKVFTSMRERKLLTEANLIVYEELLIDHMKKK 207
            +EQGC L+++IY ++ISGMC +GT+EEARKVF+++RER  LTE+N IVY+ELLIDHMKKK
Sbjct: 646  IEQGCSLDNVIYCSIISGMCKHGTIEEARKVFSNLRERNFLTESNTIVYDELLIDHMKKK 705

Query: 206  TAGLVLAGLKFFGLESKLKSKGSTILP 126
            TA LVL+ LKFFGLESKLK+KG  +LP
Sbjct: 706  TADLVLSSLKFFGLESKLKAKGCKLLP 732


>ref|XP_003598903.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355487951|gb|AES69154.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 767

 Score =  569 bits (1466), Expect = e-160
 Identities = 281/442 (63%), Positives = 353/442 (79%)
 Frame = -2

Query: 1466 DEAFELLGRMRENLCKPDVFAYTAMIRVLIAEGNLDGCLTVWEEMLKDGVDPDVMAYTTL 1287
            DE  E+LGRMRE LCKPDVFAYTA++R+++ EGNLDGCL VW+EM +D VDPDVMAY T+
Sbjct: 277  DEMLEVLGRMREKLCKPDVFAYTALVRIMVKEGNLDGCLRVWKEMKRDRVDPDVMAYGTI 336

Query: 1286 ITALCKGNVVEKGYELFKEMKKKEYLIDRAVYGSLIEAFVAEGKIGSACDLLKDLMSSGY 1107
            I  L KG  V +GYELFKEMK K +LIDRA+YGSL+E+FVA  K+G A DLLKDL+SSGY
Sbjct: 337  IGGLAKGGRVSEGYELFKEMKSKGHLIDRAIYGSLVESFVAGNKVGLAFDLLKDLVSSGY 396

Query: 1106 RADLSILNSLIEGLCNANKIGKAYKLFQFTVQEGLSPDFVTVNPILALYAEQSRMDEICK 927
            RADL + N+LIEGLCN NK+ KAYKLFQ T+QEGL PDF++V P+L  YAE  RM+E   
Sbjct: 397  RADLGMYNNLIEGLCNLNKVEKAYKLFQVTIQEGLEPDFLSVKPLLLAYAEAKRMEEFFM 456

Query: 926  LLEQMQKLGLHVIDDLSNFFSFMVGKGGREYKALETLKYLKARGFCNVSIYSIIIQALHK 747
            LLE+M+KLG  VIDDLS FFS +V K G E  ALE   +LK + + +V IY+I +++LH 
Sbjct: 457  LLEKMKKLGFPVIDDLSKFFSHLVEKKGPE-MALEIFTHLKEKSYVSVEIYNIFMESLHL 515

Query: 746  IGDVKGALSLFEEMKDSEYIPDSSTYSNIIPCFVDGGDVKAACSCYNKMKEKSWVPTVSA 567
             G V+ ALSLF+E+K S+  PDSSTY+  I C VD G +K AC C+NK+ E S +P+V+A
Sbjct: 516  SGKVEKALSLFDEIKGSDLEPDSSTYNIAILCLVDHGQIKEACECHNKIIEMSSIPSVAA 575

Query: 566  YSSLVYGLCKIREIDAALTLIHDCLGNVTSWPMDFKYTLSIIHACRSCDAKKVIDIVDEM 387
            Y+ L  GLC I EID A+ L+ DCLGNVTS PM+FKY L+II  C+S  A+K+ID+++EM
Sbjct: 576  YNCLAKGLCNIGEIDEAMLLVRDCLGNVTSGPMEFKYCLTIIRMCKSNVAEKLIDVLNEM 635

Query: 386  MEQGCPLNDLIYSAVISGMCDYGTLEEARKVFTSMRERKLLTEANLIVYEELLIDHMKKK 207
            M++GC L++++ SA+ISGMC YGT+EEARKVF+ +RERKLLTE++ IVY+ELLIDHMKKK
Sbjct: 636  MQEGCSLDNVVCSAIISGMCKYGTIEEARKVFSILRERKLLTESDTIVYDELLIDHMKKK 695

Query: 206  TAGLVLAGLKFFGLESKLKSKG 141
            TA LV++GLKFFGLESKLKSKG
Sbjct: 696  TADLVISGLKFFGLESKLKSKG 717


Top