BLASTX nr result

ID: Coptis25_contig00012911 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Coptis25_contig00012911
         (1622 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002268821.1| PREDICTED: pentatricopeptide repeat-containi...   541   e-151
ref|XP_002526948.1| pentatricopeptide repeat-containing protein,...   486   e-135
ref|XP_002324000.1| predicted protein [Populus trichocarpa] gi|2...   479   e-133
ref|XP_002873660.1| pentatricopeptide repeat-containing protein ...   474   e-131
ref|NP_190245.1| pentatricopeptide repeat-containing protein [Ar...   461   e-127

>ref|XP_002268821.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610
            [Vitis vinifera]
          Length = 763

 Score =  541 bits (1394), Expect = e-151
 Identities = 290/509 (56%), Positives = 349/509 (68%), Gaps = 39/509 (7%)
 Frame = +2

Query: 212  MQSLSVWSVRGDFLAVPHFSFGLNTSRTKDKH-----------------------FTCTS 322
            MQ+LSVW  +G F AVP   + L +S    +                         + +S
Sbjct: 1    MQALSVWPSKGVFWAVPQLDYNLGSSSIPSRRRGRRKLWNPEDPVCQYRSLAFLWVSSSS 60

Query: 323  RYHLVFVP----------------SNLKVFKLXXXXXXXXXXXXXALTWAVEQEETGNGV 454
            R   V V                 S LK+F L             AL WA+EQ+  GN  
Sbjct: 61   RSDRVGVYCGSPKFDFGCGLLSGYSKLKIF-LLCERKRGSFGASFALAWALEQQAIGNEF 119

Query: 455  SVESTSLIGELSDKSDSVEVDCGNTDEGADSETVNVVTEGNGIGQNENDLGEQTSIRVDV 634
              E ++ I  L+  +++V++DC   D   D +  +   E     +   ++ E+ S  VDV
Sbjct: 120  VKEDSNSIHSLAGNTETVDIDCLKVDGARDGDEND--NEEEKEAEKNGEVIEEKSRNVDV 177

Query: 635  RALAGRLQFAETADDVEEVLREMEELPLPVYSSMIRGFGLDKRMESALALFEWLKMKKKT 814
            RALA  L+FA TADDVEEVL++  ELPL VYS+MIRGFG DKR+++A+AL EWLK KK+T
Sbjct: 178  RALAHGLEFATTADDVEEVLKDKVELPLQVYSTMIRGFGTDKRLDAAMALVEWLKRKKET 237

Query: 815  TGGRIGPNLFIYNSLLGAMKQCKQFGGVEKVMEDMAEEGVGSNMVTYNTLMAIYLEQGWP 994
             G + GPNLF+YNSLLGA+KQ ++F  VEKVM DMA EG+  N+VTYNTLM+IYLEQG  
Sbjct: 238  NGSK-GPNLFVYNSLLGAVKQSEKFALVEKVMNDMAREGILPNVVTYNTLMSIYLEQGRS 296

Query: 995  NRALTFLEEIEKKGMSPSPVSYSTALVAYRRMEDGNGGLKFFTELRDRYKKGEIGKYDDE 1174
              AL  LEEI+K G+ PSPVSYSTAL+ YRRMEDG+G LKFF ELR+ Y KGEIGK  DE
Sbjct: 297  VEALNILEEIQKNGLCPSPVSYSTALLVYRRMEDGHGALKFFIELRENYLKGEIGKDADE 356

Query: 1175 DWENEFVKLEKFTIRICYQVMRRWLVKGDHSSTKVLQLLSEMDKAGLNPGRAEYERLVWA 1354
            DWENEFVKL+ FTIRICYQVMRRWLVK  + S  +L+LL++MD AGL PGRAEYERLVWA
Sbjct: 357  DWENEFVKLKNFTIRICYQVMRRWLVKEGNQSPILLKLLADMDNAGLQPGRAEYERLVWA 416

Query: 1355 CTLEGHHIVAKDLYKRIRERESEISLSVCNHAIWLMGKAKKWWAALEIYEDLLDKGPKPN 1534
            CT E H++VAK+LY RIRER +EISLSVCNH IWLMGKAKKWWAALEIYEDLLDKGPKPN
Sbjct: 417  CTREEHYVVAKELYTRIRERHTEISLSVCNHIIWLMGKAKKWWAALEIYEDLLDKGPKPN 476

Query: 1535 NLSYELVVSHFNVLLTAASRRGIWRWGVR 1621
            NLSYELVVSHFN+LLTAA ++GIWRWGVR
Sbjct: 477  NLSYELVVSHFNILLTAARKKGIWRWGVR 505


>ref|XP_002526948.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223533700|gb|EEF35435.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 671

 Score =  486 bits (1252), Expect = e-135
 Identities = 245/406 (60%), Positives = 304/406 (74%), Gaps = 2/406 (0%)
 Frame = +2

Query: 410  ALTWAVEQEETGNGVSVESTSLIGELSDKSDSVEVDCGNTDEGADSETVNVVTEGNGIG- 586
            A  WA+++++  +       SL   L  KS+  +V+  N     DS+  N   E N    
Sbjct: 9    AFAWALQKQDISSEFHGVEPSLDDGLLGKSEKEDVNPHNLGRLEDSDDDNNNQEDNIELD 68

Query: 587  -QNENDLGEQTSIRVDVRALAGRLQFAETADDVEEVLREMEELPLPVYSSMIRGFGLDKR 763
             +++  +GE+    +DVR+LA  L  A+TADDVEEVL++  ELPL VYSSMI+ FG D +
Sbjct: 69   LRSKEGVGEEKCRSIDVRSLARSLHSAQTADDVEEVLKDKGELPLQVYSSMIKAFGWDNK 128

Query: 764  MESALALFEWLKMKKKTTGGRIGPNLFIYNSLLGAMKQCKQFGGVEKVMEDMAEEGVGSN 943
            MESALAL EWLK ++K  G  IGPNLFIYNSLL A+K+ K F   EK++ DM +EG+  N
Sbjct: 129  MESALALVEWLK-RRKEIGSSIGPNLFIYNSLLSAVKKSKLFEEAEKILNDMTQEGIAPN 187

Query: 944  MVTYNTLMAIYLEQGWPNRALTFLEEIEKKGMSPSPVSYSTALVAYRRMEDGNGGLKFFT 1123
            +VTYNTLM IY+E+G   +AL  LE++ +KG  P+  SYSTAL+AYR MEDG+G L FF 
Sbjct: 188  VVTYNTLMGIYVEKGQATKALNILEQMHEKGFIPTAASYSTALLAYRGMEDGHGALAFFV 247

Query: 1124 ELRDRYKKGEIGKYDDEDWENEFVKLEKFTIRICYQVMRRWLVKGDHSSTKVLQLLSEMD 1303
            +++D+Y KG+IGK  DE+WENEFVKLE F IRICYQVMRRWLV+ D+ ST VL+LL++MD
Sbjct: 248  DIKDKYLKGKIGKNSDENWENEFVKLETFIIRICYQVMRRWLVRHDNFSTDVLKLLTDMD 307

Query: 1304 KAGLNPGRAEYERLVWACTLEGHHIVAKDLYKRIRERESEISLSVCNHAIWLMGKAKKWW 1483
            KAGL P +AEYERLVWACT E H+ V K+LY RIRER S+ISLSVCNH IWLMGKAKKWW
Sbjct: 308  KAGLQPSQAEYERLVWACTREDHYAVGKELYIRIRERHSKISLSVCNHLIWLMGKAKKWW 367

Query: 1484 AALEIYEDLLDKGPKPNNLSYELVVSHFNVLLTAASRRGIWRWGVR 1621
            AALEIYEDLLDKGP PNN+SYEL+VSHFN+LLTAA +RGIWRWGVR
Sbjct: 368  AALEIYEDLLDKGPNPNNMSYELIVSHFNILLTAARKRGIWRWGVR 413


>ref|XP_002324000.1| predicted protein [Populus trichocarpa] gi|222867002|gb|EEF04133.1|
            predicted protein [Populus trichocarpa]
          Length = 709

 Score =  479 bits (1234), Expect = e-133
 Identities = 258/489 (52%), Positives = 328/489 (67%), Gaps = 19/489 (3%)
 Frame = +2

Query: 212  MQSLSVWSVRGDFLAVPHFSF------------GLNTSRTKDKHFT-CTSRYHLV----- 337
            MQ+LSVW + G   AVPH  F            G+      D  F   +S + +V     
Sbjct: 1    MQTLSVWPLSGGSCAVPHLEFEEDSSCFLSTRRGIKRWGLVDNVFQGASSGFPMVSGDLR 60

Query: 338  FVPSNLKV-FKLXXXXXXXXXXXXXALTWAVEQEETGNGVSVESTSLIGELSDKSDSVEV 514
            F+ ++ K+ +               AL  A+EQ++ GN    E   +   L D+S     
Sbjct: 61   FLSNHSKIKYVCFRETKEGSFGSSLALASALEQQKIGN----EFHRVESSLDDRS----- 111

Query: 515  DCGNTDEGADSETVNVVTEGNGIGQNENDLGEQTSIRVDVRALAGRLQFAETADDVEEVL 694
                                        + GE+   ++DV ALA  L FA+T DD+EEVL
Sbjct: 112  --------------------------LGEAGEERDEKIDVPALAQSLYFAKTVDDIEEVL 145

Query: 695  REMEELPLPVYSSMIRGFGLDKRMESALALFEWLKMKKKTTGGRIGPNLFIYNSLLGAMK 874
            ++  ELP+ VY SMI+GFG DK+ME A+AL +WLK+KK+T G  I PNLFIYNSLL A+K
Sbjct: 146  KDKGELPVQVYLSMIKGFGWDKKMEPAIALVDWLKIKKETDG-TIVPNLFIYNSLLSAVK 204

Query: 875  QCKQFGGVEKVMEDMAEEGVGSNMVTYNTLMAIYLEQGWPNRALTFLEEIEKKGMSPSPV 1054
            Q +Q+   EK++E M +EGV  N+VTYN LM IY++QG   +AL  LEE+ + G +PS  
Sbjct: 205  QSEQYEETEKILERMTQEGVAPNVVTYNILMVIYVKQGQAKKALDVLEEMRRNGFTPSAA 264

Query: 1055 SYSTALVAYRRMEDGNGGLKFFTELRDRYKKGEIGKYDDEDWENEFVKLEKFTIRICYQV 1234
            SYS+AL+AYR+MEDG+G LKFF E++D+Y KGEIGK  DEDWE E+VKLE FTIR+CYQV
Sbjct: 265  SYSSALLAYRKMEDGDGALKFFVEIKDKYMKGEIGKDADEDWEREYVKLENFTIRVCYQV 324

Query: 1235 MRRWLVKGDHSSTKVLQLLSEMDKAGLNPGRAEYERLVWACTLEGHHIVAKDLYKRIRER 1414
            MRRWLV+ ++ +T VL+LL++MDKA L PGR++YERLVWACT E H++VAK+LY RIRER
Sbjct: 325  MRRWLVRLENLNTNVLKLLTDMDKAELQPGRSDYERLVWACTREEHYVVAKELYIRIRER 384

Query: 1415 ESEISLSVCNHAIWLMGKAKKWWAALEIYEDLLDKGPKPNNLSYELVVSHFNVLLTAASR 1594
             S+ISLSVCNH IWLMGKAKKWWAALE+YEDLLDKGPKPNNLSYEL+VS+FNVLLTAA +
Sbjct: 385  CSDISLSVCNHVIWLMGKAKKWWAALEVYEDLLDKGPKPNNLSYELIVSYFNVLLTAAKK 444

Query: 1595 RGIWRWGVR 1621
            RGIWRWGVR
Sbjct: 445  RGIWRWGVR 453


>ref|XP_002873660.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297319497|gb|EFH49919.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 674

 Score =  474 bits (1221), Expect = e-131
 Identities = 260/491 (52%), Positives = 322/491 (65%), Gaps = 21/491 (4%)
 Frame = +2

Query: 212  MQSLSVWSVRGDFLAVPHFSFGLNTS------RTKDKHFT----CTSRYH-LVFVPSNLK 358
            MQ+LS+W ++   L      F L+ S      +++ +H +    C  R   L+ V SN K
Sbjct: 1    MQALSIWPLKSGLLVGSRLEFELDCSCFVVSHKSRKRHCSAQQGCFGRISSLILVSSNRK 60

Query: 359  VFKLXXXXXXXXXXXXX----------ALTWAVEQEETGNGVSVESTSLIGELSDKSDSV 508
               L                        + WA EQ E G  VS E +S            
Sbjct: 61   FEGLAVNPTSKVLFLCEPKRNLSGSSVGVGWATEQRELGEEVSTEDSSY----------- 109

Query: 509  EVDCGNTDEGADSETVNVVTEGNGIGQNENDLGEQTSIRVDVRALAGRLQFAETADDVEE 688
                         +TVN               GE+T+ RVDVR LA  L+ A+TADDV+ 
Sbjct: 110  ------------PQTVNG--------------GEKTNSRVDVRELAYSLRAAKTADDVDI 143

Query: 689  VLREMEELPLPVYSSMIRGFGLDKRMESALALFEWLKMKKKTTGGRIGPNLFIYNSLLGA 868
            V++EM ELPL VY +MIRGFG DKR++ A+A+ +WL+ KK  +GG IGPNLFIYNSLLGA
Sbjct: 144  VIKEMGELPLQVYCAMIRGFGKDKRLKPAIAVVDWLRRKKSESGGVIGPNLFIYNSLLGA 203

Query: 869  MKQCKQFGGVEKVMEDMAEEGVGSNMVTYNTLMAIYLEQGWPNRALTFLEEIEKKGMSPS 1048
            MKQ    G  EK++ DM EEG+  N+VTYNTLM IY+E+G  ++AL  L+ +++KG  P+
Sbjct: 204  MKQ-SSVGEAEKILSDMEEEGIVPNIVTYNTLMVIYMEKGEFHKALGILDLVKEKGFEPN 262

Query: 1049 PVSYSTALVAYRRMEDGNGGLKFFTELRDRYKKGEIGKYDDEDWENEFVKLEKFTIRICY 1228
            P++YSTAL+ YRRMEDG G L+FF ELR++Y K EIG   D DWE EFVKLE F  RICY
Sbjct: 263  PITYSTALLVYRRMEDGMGALEFFVELREKYSKREIGNDADYDWEFEFVKLENFIGRICY 322

Query: 1229 QVMRRWLVKGDHSSTKVLQLLSEMDKAGLNPGRAEYERLVWACTLEGHHIVAKDLYKRIR 1408
            QVMRRWLVK ++ +T+VL+LL+ MD AG  P R E+ERL+WACT E H+IV K+LYKRIR
Sbjct: 323  QVMRRWLVKDENWTTRVLKLLNAMDNAGPKPSREEHERLIWACTREEHYIVGKELYKRIR 382

Query: 1409 ERESEISLSVCNHAIWLMGKAKKWWAALEIYEDLLDKGPKPNNLSYELVVSHFNVLLTAA 1588
            ER  EISLSVCNH IWLMGKAKKWWAALEIYEDLLD+GP+PNNLSYELVVSHFN+LL+AA
Sbjct: 383  ERFPEISLSVCNHLIWLMGKAKKWWAALEIYEDLLDEGPEPNNLSYELVVSHFNILLSAA 442

Query: 1589 SRRGIWRWGVR 1621
            SRRGIWRWGVR
Sbjct: 443  SRRGIWRWGVR 453



 Score = 57.8 bits (138), Expect = 8e-06
 Identities = 58/262 (22%), Positives = 106/262 (40%), Gaps = 21/262 (8%)
 Frame = +2

Query: 836  NLFIYNSLLGAMKQCKQFGGVEKVMEDMAEEGVGSNMVTY-------NTLMAIYLEQGWP 994
            +L + N L+  M + K++    ++ ED+ +EG   N ++Y       N L++    +G  
Sbjct: 389  SLSVCNHLIWLMGKAKKWWAALEIYEDLLDEGPEPNNLSYELVVSHFNILLSAASRRGIW 448

Query: 995  NRALTFLEEIEKKGMSPSPVSYSTALVAYRRMEDGNGGLKFFTELRDRYKKG-------- 1150
               +  L ++E KG+ P    ++  LVA  +  +    ++ F  + D  +K         
Sbjct: 449  RWGVRLLNKMEDKGLKPQSRHWNAVLVACSKASETTAAIQIFKAMVDNGEKPTVISYGAL 508

Query: 1151 ----EIGKYDDEDWE--NEFVKLEKFTIRICYQVMRRWLVKGDHSSTKVLQLLSEMDKAG 1312
                E GK  DE +   N  +K+        Y  M   ++ G      +  LL EM   G
Sbjct: 509  LSALEKGKLYDEAFRVWNHMIKVGIEPNLYAYTTMAS-VLTGQQKFNLLDTLLKEMASKG 567

Query: 1313 LNPGRAEYERLVWACTLEGHHIVAKDLYKRIRERESEISLSVCNHAIWLMGKAKKWWAAL 1492
            + P    Y  ++  C   G   VA + + R+R  + E +       I  +    K   A 
Sbjct: 568  IEPSVVTYNAVISGCARNGLSGVAYEWFHRMRGEKVEPNEITYEMLIEALANDAKPRLAY 627

Query: 1493 EIYEDLLDKGPKPNNLSYELVV 1558
            E++    + G K ++  Y+ VV
Sbjct: 628  ELHLKAQNDGLKLSSKPYDAVV 649


>ref|NP_190245.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75206903|sp|Q9SNB7.1|PP264_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At3g46610 gi|6523064|emb|CAB62331.1| hypothetical protein
            [Arabidopsis thaliana] gi|332644660|gb|AEE78181.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 665

 Score =  461 bits (1185), Expect = e-127
 Identities = 225/339 (66%), Positives = 276/339 (81%)
 Frame = +2

Query: 605  GEQTSIRVDVRALAGRLQFAETADDVEEVLREMEELPLPVYSSMIRGFGLDKRMESALAL 784
            GE+ ++RVDVR LA  L+ A+TADDV+ VL++  ELPL V+ +MI+GFG DKR++ A+A+
Sbjct: 109  GEKNNLRVDVRELAFSLRAAKTADDVDAVLKDKGELPLQVFCAMIKGFGKDKRLKPAVAV 168

Query: 785  FEWLKMKKKTTGGRIGPNLFIYNSLLGAMKQCKQFGGVEKVMEDMAEEGVGSNMVTYNTL 964
             +WLK KK  +GG IGPNLFIYNSLLGAM+    FG  EK+++DM EEG+  N+VTYNTL
Sbjct: 169  VDWLKRKKSESGGVIGPNLFIYNSLLGAMRG---FGEAEKILKDMEEEGIVPNIVTYNTL 225

Query: 965  MAIYLEQGWPNRALTFLEEIEKKGMSPSPVSYSTALVAYRRMEDGNGGLKFFTELRDRYK 1144
            M IY+E+G   +AL  L+  ++KG  P+P++YSTAL+ YRRMEDG G L+FF ELR++Y 
Sbjct: 226  MVIYMEEGEFLKALGILDLTKEKGFEPNPITYSTALLVYRRMEDGMGALEFFVELREKYA 285

Query: 1145 KGEIGKYDDEDWENEFVKLEKFTIRICYQVMRRWLVKGDHSSTKVLQLLSEMDKAGLNPG 1324
            K EIG     DWE EFVKLE F  RICYQVMRRWLVK D+ +T+VL+LL+ MD AG+ P 
Sbjct: 286  KREIGNDVGYDWEFEFVKLENFIGRICYQVMRRWLVKDDNWTTRVLKLLNAMDSAGVRPS 345

Query: 1325 RAEYERLVWACTLEGHHIVAKDLYKRIRERESEISLSVCNHAIWLMGKAKKWWAALEIYE 1504
            R E+ERL+WACT E H+IV K+LYKRIRER SEISLSVCNH IWLMGKAKKWWAALEIYE
Sbjct: 346  REEHERLIWACTREEHYIVGKELYKRIRERFSEISLSVCNHLIWLMGKAKKWWAALEIYE 405

Query: 1505 DLLDKGPKPNNLSYELVVSHFNVLLTAASRRGIWRWGVR 1621
            DLLD+GP+PNNLSYELVVSHFN+LL+AAS+RGIWRWGVR
Sbjct: 406  DLLDEGPEPNNLSYELVVSHFNILLSAASKRGIWRWGVR 444


Top