BLASTX nr result

ID: Coptis25_contig00004945 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis25_contig00004945
         (2487 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN70994.1| hypothetical protein VITISV_038698 [Vitis vinifera]   770   0.0  
ref|XP_002271824.1| PREDICTED: pentatricopeptide repeat-containi...   733   0.0  
ref|XP_002326871.1| predicted protein [Populus trichocarpa] gi|2...   702   0.0  
ref|XP_002880144.1| pentatricopeptide repeat-containing protein ...   647   0.0  
ref|XP_003551717.1| PREDICTED: pentatricopeptide repeat-containi...   646   0.0  

>emb|CAN70994.1| hypothetical protein VITISV_038698 [Vitis vinifera]
          Length = 751

 Score =  770 bits (1989), Expect = 0.0
 Identities = 367/591 (62%), Positives = 461/591 (78%), Gaps = 4/591 (0%)
 Frame = +1

Query: 40   QHSSWSSIERKCLSLLQKPNTKNTIPQIHSFILRNGIENNVNIFTKFISACCSV----PT 207
            Q S WS IERKCLSLLQ+  T+  + QIH+F+LRN +E N N+FTKFI+ C S+    P 
Sbjct: 144  QQSLWSPIERKCLSLLQQSKTRANLLQIHAFMLRNALETNPNLFTKFIATCSSIALLAPL 203

Query: 208  TCSFSGVRYARRVFDQRTERDDSYLCNSMIKGYVDVYLFKESMTLYRDLKRGTCFSPDNF 387
                +G+ +ARR+FD R  RDD++LCNSMIK YV +  + ES  LYRDL+R T F+PD+F
Sbjct: 204  YDPLAGIVHARRMFDHRPHRDDAFLCNSMIKAYVGMRQYSESFALYRDLRRNTSFTPDSF 263

Query: 388  TFPSLLKSCGLNLGFNEGFQLHGEVIKMGFCYNLFISTTLVDMYVKFGNMVSSRKVFDEM 567
            TF  L KSC LN+   EG ++H  V+ +GFC +L+ +T LVDMY KFG M  +RK+FDEM
Sbjct: 264  TFSVLAKSCALNMAIWEGQEIHSHVVAVGFCLDLYAATALVDMYAKFGKMDCARKLFDEM 323

Query: 568  PFRNQVSWTAVVVGYARCGDTCMSKELFDCMPEKDSAAYNAMIDAYVKSGNMSFAKELFD 747
              R+QVSWTA++ GY R GD   + +LFD M EKDSAA+N MIDAYVK G+M  A++LFD
Sbjct: 324  IDRSQVSWTALIGGYVRSGDMDNAGKLFDQMIEKDSAAFNTMIDAYVKLGDMCSARKLFD 383

Query: 748  QIPEKNIVSWSTLIDGYCKNGDLGAARLFFDAMPEKNLVSWNTMISGYSQNKQPCQALEL 927
            ++PE+++VSW+ +I GY  NG+L +AR  FDAMPEKNL SWN MISGY QNKQP +AL+L
Sbjct: 384  EMPERSVVSWTIMIYGYSSNGNLDSARSLFDAMPEKNLFSWNAMISGYXQNKQPYEALKL 443

Query: 928  FREMQSSSMFQPDNVTVVSILPAIADLGALDFGCWVHGYVQRKGLNRASNVCTALIDMFA 1107
            F EMQS++  +PD VT+VS+LPAIADLGALD G WVH +V+RK L+RA+NV TALIDM+A
Sbjct: 444  FHEMQSTTSLEPDEVTIVSVLPAIADLGALDLGGWVHRFVRRKKLDRATNVGTALIDMYA 503

Query: 1108 KCGETVKAKQVFGRIHNRETPSWNAMINGFALNGHAKEALEVFSEMLKEGVKPNEITMIG 1287
            KCGE VK++ VF  +  +ET SWNA+IN FA+NG AKEAL +F EM  +G  PNEITMIG
Sbjct: 504  KCGEIVKSRGVFDNMPEKETASWNALINAFAINGRAKEALGLFMEMNHKGFMPNEITMIG 563

Query: 1288 VLSACNHAGLVDEGKKWFKLMENYGIIPRIEHYGCMVDILGRAGRVEEAEELMMNMPYEL 1467
            VLSACNH+GLV+EGK+WFK ME +G+ P+IEHYGCMVD+LGRAG ++EAE+LM +MPYE 
Sbjct: 564  VLSACNHSGLVEEGKRWFKAMEEFGLTPKIEHYGCMVDLLGRAGCLQEAEKLMESMPYEA 623

Query: 1468 NSIVLSSFLFACGCRGDVMRAEKFIRKAFDIEPWNDGNYIMMRNLYAGEKRWKEVEEIKG 1647
            N I+LSSFLFACG   DV RAE+ +++A  +E WNDGNYIM+RNLYA EKRWKE +E+KG
Sbjct: 624  NGIILSSFLFACGYSKDVARAERVLKEAIKMEAWNDGNYIMLRNLYANEKRWKEADEVKG 683

Query: 1648 LMRKYRAKKEVGCSVIEVNNGVWEFVAGDRAHPQWEGMHLVLRQLWLHMKG 1800
            LMR+   KKE GCS IEV++ VWEFVAGDR HP+WE +H VL QLW+HMKG
Sbjct: 684  LMRRNGVKKEAGCSAIEVDSRVWEFVAGDRVHPKWEAIHSVLGQLWVHMKG 734


>ref|XP_002271824.1| PREDICTED: pentatricopeptide repeat-containing protein At2g44880
            [Vitis vinifera] gi|297734603|emb|CBI16654.3| unnamed
            protein product [Vitis vinifera]
          Length = 577

 Score =  733 bits (1891), Expect = 0.0
 Identities = 348/560 (62%), Positives = 438/560 (78%), Gaps = 4/560 (0%)
 Frame = +1

Query: 133  ILRNGIENNVNIFTKFISACCSV----PTTCSFSGVRYARRVFDQRTERDDSYLCNSMIK 300
            +LRN +E N N+FTKFI+ C S+    P     +G+ +ARR+FD R  RDD++LCNSMIK
Sbjct: 1    MLRNALETNPNLFTKFIATCSSIALLAPLYDPLAGIVHARRMFDHRPHRDDAFLCNSMIK 60

Query: 301  GYVDVYLFKESMTLYRDLKRGTCFSPDNFTFPSLLKSCGLNLGFNEGFQLHGEVIKMGFC 480
             YV +  + ES  LYRDL+R T F+PD+FTF  L KSC LN+   EG ++H  V+ +GFC
Sbjct: 61   AYVGMRQYSESFALYRDLRRNTSFTPDSFTFSVLAKSCALNMAIWEGQEIHSHVVAVGFC 120

Query: 481  YNLFISTTLVDMYVKFGNMVSSRKVFDEMPFRNQVSWTAVVVGYARCGDTCMSKELFDCM 660
             +L+ +T LVDMY KFG M  +RK+FDEM  R+QVSWTA++ GY R GD   + +LFD M
Sbjct: 121  LDLYAATALVDMYAKFGKMDCARKLFDEMIDRSQVSWTALIGGYVRSGDMDNAGKLFDQM 180

Query: 661  PEKDSAAYNAMIDAYVKSGNMSFAKELFDQIPEKNIVSWSTLIDGYCKNGDLGAARLFFD 840
             EKDSAA+N MIDAYVK G+M  A++LFD++PE+++VSW+ +I GY  NG+L +AR  FD
Sbjct: 181  IEKDSAAFNTMIDAYVKLGDMCSARKLFDEMPERSVVSWTIMIYGYSSNGNLDSARSLFD 240

Query: 841  AMPEKNLVSWNTMISGYSQNKQPCQALELFREMQSSSMFQPDNVTVVSILPAIADLGALD 1020
            AMPEKNL SWN MISGY QNKQP +AL+LF EMQS++  +PD VT+VS+LPAIADLGALD
Sbjct: 241  AMPEKNLFSWNAMISGYRQNKQPYEALKLFHEMQSTTSLEPDEVTIVSVLPAIADLGALD 300

Query: 1021 FGCWVHGYVQRKGLNRASNVCTALIDMFAKCGETVKAKQVFGRIHNRETPSWNAMINGFA 1200
             G WVH +V+RK L+RA+NV TALIDM+AKCGE VK++ VF  +  +ET SWNA+IN FA
Sbjct: 301  LGGWVHRFVRRKKLDRATNVGTALIDMYAKCGEIVKSRGVFDNMPEKETASWNALINAFA 360

Query: 1201 LNGHAKEALEVFSEMLKEGVKPNEITMIGVLSACNHAGLVDEGKKWFKLMENYGIIPRIE 1380
            +NG AKEAL +F EM  +G  PNEITMIGVLSACNH+GLV+EGK+WFK ME +G+ P+IE
Sbjct: 361  INGRAKEALGLFMEMNHKGFMPNEITMIGVLSACNHSGLVEEGKRWFKAMEEFGLTPKIE 420

Query: 1381 HYGCMVDILGRAGRVEEAEELMMNMPYELNSIVLSSFLFACGCRGDVMRAEKFIRKAFDI 1560
            HYGCMVD+LGRAG ++EAE+LM +MPYE N I+LSSFLFACG   DV RAE+ +++A  +
Sbjct: 421  HYGCMVDLLGRAGCLQEAEKLMESMPYEANGIILSSFLFACGYSKDVARAERVLKEAIKM 480

Query: 1561 EPWNDGNYIMMRNLYAGEKRWKEVEEIKGLMRKYRAKKEVGCSVIEVNNGVWEFVAGDRA 1740
            E WNDGNYIM+RNLYA EKRWKE +E+KGLMR+   KKE GCS IEV++ VWEFVAGDR 
Sbjct: 481  EAWNDGNYIMLRNLYANEKRWKEADEVKGLMRRNGVKKEAGCSAIEVDSRVWEFVAGDRV 540

Query: 1741 HPQWEGMHLVLRQLWLHMKG 1800
            HP+WE +H VL QLW+HMKG
Sbjct: 541  HPKWEAIHSVLGQLWVHMKG 560


>ref|XP_002326871.1| predicted protein [Populus trichocarpa] gi|222835186|gb|EEE73621.1|
            predicted protein [Populus trichocarpa]
          Length = 581

 Score =  702 bits (1811), Expect = 0.0
 Identities = 337/582 (57%), Positives = 445/582 (76%)
 Frame = +1

Query: 61   IERKCLSLLQKPNTKNTIPQIHSFILRNGIENNVNIFTKFISACCSVPTTCSFSGVRYAR 240
            +ER+CL LLQ+  T+ T+ QIH+ ILRN I+ NVNI TKFI+ C  + +T      R+AR
Sbjct: 1    MERECLFLLQRCRTRKTLLQIHALILRNAIDANVNILTKFITTCGQLSST------RHAR 54

Query: 241  RVFDQRTERDDSYLCNSMIKGYVDVYLFKESMTLYRDLKRGTCFSPDNFTFPSLLKSCGL 420
             +FD R+ R D++LCNSMIK +V +    ++ TLY+DL+R TCF PDNFTF  L K C L
Sbjct: 55   HLFDNRSHRGDTFLCNSMIKSHVVMRQLADAFTLYKDLRRETCFVPDNFTFTVLAKCCAL 114

Query: 421  NLGFNEGFQLHGEVIKMGFCYNLFISTTLVDMYVKFGNMVSSRKVFDEMPFRNQVSWTAV 600
             +   EG + HG V+K+GFC+++++ST LVDMY KFGN+  +RKVF++MP R+ VSWTA+
Sbjct: 115  RMAVWEGLETHGHVVKIGFCFDMYVSTALVDMYAKFGNLGLARKVFNDMPDRSLVSWTAL 174

Query: 601  VVGYARCGDTCMSKELFDCMPEKDSAAYNAMIDAYVKSGNMSFAKELFDQIPEKNIVSWS 780
            + GY R GD   +  LF  MP +DSAA+N +ID YVK G+M  A+ LFD++PE+N++SW+
Sbjct: 175  IGGYVRRGDMGNAWFLFKLMPGRDSAAFNLLIDGYVKVGDMESARSLFDEMPERNVISWT 234

Query: 781  TLIDGYCKNGDLGAARLFFDAMPEKNLVSWNTMISGYSQNKQPCQALELFREMQSSSMFQ 960
            ++I GYC NGD+ +AR  FDAMPEKNLVSWN MI GY QNKQP +AL+LFRE+QSS++F+
Sbjct: 235  SMIYGYCNNGDVLSARFLFDAMPEKNLVSWNAMIGGYCQNKQPHEALKLFRELQSSTVFE 294

Query: 961  PDNVTVVSILPAIADLGALDFGCWVHGYVQRKGLNRASNVCTALIDMFAKCGETVKAKQV 1140
            P+ VTVVSILPAIA LGAL+ G WVH +VQRK L+ A NVCT+L+DM+ KCGE  KA++V
Sbjct: 295  PNEVTVVSILPAIATLGALELGEWVHRFVQRKKLDAAVNVCTSLVDMYLKCGEISKARKV 354

Query: 1141 FGRIHNRETPSWNAMINGFALNGHAKEALEVFSEMLKEGVKPNEITMIGVLSACNHAGLV 1320
            F  I  +ET +WNA+INGFA+NG A EALE FSEM +EG+KPN+ITM GVLSAC+H GLV
Sbjct: 355  FSEIPKKETATWNALINGFAMNGLASEALEAFSEMQQEGIKPNDITMTGVLSACSHGGLV 414

Query: 1321 DEGKKWFKLMENYGIIPRIEHYGCMVDILGRAGRVEEAEELMMNMPYELNSIVLSSFLFA 1500
            +EGK  FK M   G+ P+IEHYGC+VD+LGRAG ++EAE L+ +MP+E N I+LSSF FA
Sbjct: 415  EEGKGQFKAMIESGLSPKIEHYGCLVDLLGRAGCLDEAENLIKSMPFEANGIILSSFSFA 474

Query: 1501 CGCRGDVMRAEKFIRKAFDIEPWNDGNYIMMRNLYAGEKRWKEVEEIKGLMRKYRAKKEV 1680
            CG   DV RA++ + +A ++EP N+G Y+MMRNLYA E+RWK+V+EI GLMR+  AKKEV
Sbjct: 475  CGFSNDVTRAQRVLNQAVNMEPGNNGIYVMMRNLYAMEERWKDVKEINGLMRRRGAKKEV 534

Query: 1681 GCSVIEVNNGVWEFVAGDRAHPQWEGMHLVLRQLWLHMKGEV 1806
            G S IEV++ V EF++G  AHPQ + +  V+ QLW+HM+  V
Sbjct: 535  GSSAIEVDSRVSEFISGGIAHPQLDVIESVIGQLWIHMRDSV 576


>ref|XP_002880144.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297325983|gb|EFH56403.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 555

 Score =  647 bits (1668), Expect = 0.0
 Identities = 298/558 (53%), Positives = 419/558 (75%), Gaps = 1/558 (0%)
 Frame = +1

Query: 133  ILRNGIENNVNIFTKFISACCSVPTTCSFSGVRYARRVFDQRTERDDSYLCNSMIKGYVD 312
            +LR+ IE NV IFTKF+    S        G+ YAR++FDQR  R+DS+LCNSMIK Y++
Sbjct: 1    MLRHAIETNVQIFTKFLVISASAV------GIGYARKLFDQRPHREDSFLCNSMIKAYLE 54

Query: 313  VYLFKESMTLYRDLKRGTCFSPDNFTFPSLLKSCGLNLGFNEGFQLHGEVIKMGFCYNLF 492
               + +S   YRDL++ TC +PDNFTF ++ KSC L++   +G QLH ++ + GFC +++
Sbjct: 55   TRHYNDSFAFYRDLRKETCLAPDNFTFTTMTKSCTLSMCVYQGLQLHSQIWRSGFCADMY 114

Query: 493  ISTTLVDMYVKFGNMVSSRKVFDEMPFRNQVSWTAVVVGYARCGDTCMSKELFDCMPE-K 669
            +ST +VDMY KFG M  +R VFDEMP R++VSWTA++ GY R G+  ++ +LFD MP+ K
Sbjct: 115  VSTGVVDMYAKFGKMGCARNVFDEMPQRSEVSWTALICGYVRFGELDLASKLFDQMPQVK 174

Query: 670  DSAAYNAMIDAYVKSGNMSFAKELFDQIPEKNIVSWSTLIDGYCKNGDLGAARLFFDAMP 849
            D   YNAM+D +VKSG+M+ A+ LFD++  K +++W+T+I GYC + D+ +AR  FDAMP
Sbjct: 175  DVVIYNAMMDGFVKSGDMTSARRLFDEMTHKTVITWTTMIHGYCNSNDIDSARKLFDAMP 234

Query: 850  EKNLVSWNTMISGYSQNKQPCQALELFREMQSSSMFQPDNVTVVSILPAIADLGALDFGC 1029
            E+NLVSWNTMI GY QNKQP +A+ LF+EMQ+++   PD+VT++S+LPAI+D GAL  G 
Sbjct: 235  ERNLVSWNTMIGGYCQNKQPQEAIRLFQEMQATTSLDPDDVTILSVLPAISDTGALSLGE 294

Query: 1030 WVHGYVQRKGLNRASNVCTALIDMFAKCGETVKAKQVFGRIHNRETPSWNAMINGFALNG 1209
            W H +VQRK L++   VCTA++DM++KCGE  KAK++F  +  ++  SWNAMI+G+ALNG
Sbjct: 295  WCHCFVQRKNLDKKVKVCTAILDMYSKCGEIEKAKRIFDEMPEKQVASWNAMIHGYALNG 354

Query: 1210 HAKEALEVFSEMLKEGVKPNEITMIGVLSACNHAGLVDEGKKWFKLMENYGIIPRIEHYG 1389
            +A  AL++F  M KE  KP+EITM+ V+SACNH GLV+EG+KWF++M  +G+  +IEHYG
Sbjct: 355  NAHAALDLFLTMAKE-EKPDEITMLAVISACNHGGLVEEGRKWFQMMRKFGLNAKIEHYG 413

Query: 1390 CMVDILGRAGRVEEAEELMMNMPYELNSIVLSSFLFACGCRGDVMRAEKFIRKAFDIEPW 1569
            CMVD+LGRAG +++AE L+ NMP++ N I+LSSFL ACG   D+ RAE+ ++KA ++EP 
Sbjct: 414  CMVDLLGRAGNLKQAEHLITNMPFKPNGIILSSFLSACGQYKDIERAERILKKAVELEPQ 473

Query: 1570 NDGNYIMMRNLYAGEKRWKEVEEIKGLMRKYRAKKEVGCSVIEVNNGVWEFVAGDRAHPQ 1749
            NDGNY+++RNLYA +KRW +   +K +MRK  AKKEVGCS+IE+N  V EF++GD  HP 
Sbjct: 474  NDGNYVLLRNLYAADKRWDDFGMVKNMMRKNEAKKEVGCSLIEINYIVSEFISGDTTHPH 533

Query: 1750 WEGMHLVLRQLWLHMKGE 1803
             + +HLVL +L +HMK E
Sbjct: 534  RQSIHLVLEKLLVHMKEE 551


>ref|XP_003551717.1| PREDICTED: pentatricopeptide repeat-containing protein At2g44880-like
            [Glycine max]
          Length = 599

 Score =  646 bits (1666), Expect = 0.0
 Identities = 313/595 (52%), Positives = 429/595 (72%), Gaps = 4/595 (0%)
 Frame = +1

Query: 34   ETQHSSWSSIERKCLSLLQ-KPNTKNTIPQIHSFILRNGIENNVNIFTKFISACCSVPTT 210
            + Q + WS+ ER CL +LQ +  +  T+ QIH+FILR+ + +N+N+ T F++ C S+  +
Sbjct: 6    QPQRTLWSNAERTCLHILQCRTKSIPTLLQIHAFILRHSLHSNLNLLTAFVTTCASLAAS 65

Query: 211  CS--FSGVRYARRVFDQRTERDDSYLCNSMIKGYVDVYLFKESMTLYRDLKR-GTCFSPD 381
                 + + +ARR F+  T   D++LCNSMI  +     F +  TL+RDL+R    F+PD
Sbjct: 66   AKRPLAIINHARRFFNA-THTRDTFLCNSMIAAHFAARQFSQPFTLFRDLRRQAPPFTPD 124

Query: 382  NFTFPSLLKSCGLNLGFNEGFQLHGEVIKMGFCYNLFISTTLVDMYVKFGNMVSSRKVFD 561
             +TF +L+K C   +   EG  LHG V+K G C++L+++T LVDMYVKFG + S+RKVFD
Sbjct: 125  GYTFTALVKGCATRVATGEGTLLHGMVLKNGVCFDLYVATALVDMYVKFGVLGSARKVFD 184

Query: 562  EMPFRNQVSWTAVVVGYARCGDTCMSKELFDCMPEKDSAAYNAMIDAYVKSGNMSFAKEL 741
            EM  R++VSWTAV+VGYARCGD   ++ LFD M ++D  A+NAMID YVK G +  A+EL
Sbjct: 185  EMSVRSKVSWTAVIVGYARCGDMSEARRLFDEMEDRDIVAFNAMIDGYVKMGCVGLAREL 244

Query: 742  FDQIPEKNIVSWSTLIDGYCKNGDLGAARLFFDAMPEKNLVSWNTMISGYSQNKQPCQAL 921
            F+++ E+N+VSW++++ GYC NGD+  A+L FD MPEKN+ +WN MI GY QN++   AL
Sbjct: 245  FNEMRERNVVSWTSMVSGYCGNGDVENAKLMFDLMPEKNVFTWNAMIGGYCQNRRSHDAL 304

Query: 922  ELFREMQSSSMFQPDNVTVVSILPAIADLGALDFGCWVHGYVQRKGLNRASNVCTALIDM 1101
            ELFREMQ++S+ +P+ VTVV +LPA+ADLGALD G W+H +  RK L+R++ + TALIDM
Sbjct: 305  ELFREMQTASV-EPNEVTVVCVLPAVADLGALDLGRWIHRFALRKKLDRSARIGTALIDM 363

Query: 1102 FAKCGETVKAKQVFGRIHNRETPSWNAMINGFALNGHAKEALEVFSEMLKEGVKPNEITM 1281
            +AKCGE  KAK  F  +  RET SWNA+INGFA+NG AKEALEVF+ M++EG  PNE+TM
Sbjct: 364  YAKCGEITKAKLAFEGMTERETASWNALINGFAVNGCAKEALEVFARMIEEGFGPNEVTM 423

Query: 1282 IGVLSACNHAGLVDEGKKWFKLMENYGIIPRIEHYGCMVDILGRAGRVEEAEELMMNMPY 1461
            IGVLSACNH GLV+EG++WF  ME +GI P++EHYGCMVD+LGRAG ++EAE L+  MPY
Sbjct: 424  IGVLSACNHCGLVEEGRRWFNAMERFGIAPQVEHYGCMVDLLGRAGCLDEAENLIQTMPY 483

Query: 1462 ELNSIVLSSFLFACGCRGDVMRAEKFIRKAFDIEPWNDGNYIMMRNLYAGEKRWKEVEEI 1641
            + N I+LSSFLFACG   DV+RAE+ +++   ++    GNY+M+RNLYA  +RW +VE++
Sbjct: 484  DANGIILSSFLFACGYFNDVLRAERVLKEVVKMDEDVAGNYVMLRNLYATRQRWTDVEDV 543

Query: 1642 KGLMRKYRAKKEVGCSVIEVNNGVWEFVAGDRAHPQWEGMHLVLRQLWLHMKGEV 1806
            K +M+K    KEV CSVIE+     EF AGD  H   E + L L QL  HMK E+
Sbjct: 544  KQMMKKRGTSKEVACSVIEIGGSFIEFAAGDYLHSHLEVIQLTLGQLSKHMKVEI 598


Top