BLASTX nr result

ID: Cnidium21_contig00035941 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cnidium21_contig00035941
         (1493 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containi...   524   e-146
ref|XP_002521239.1| pentatricopeptide repeat-containing protein,...   475   e-131
dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana]           462   e-127
ref|NP_565402.1| pentatricopeptide repeat-containing protein [Ar...   462   e-127
ref|NP_849962.1| pentatricopeptide repeat-containing protein [Ar...   462   e-127

>ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Vitis vinifera] gi|297744557|emb|CBI37819.3| unnamed
            protein product [Vitis vinifera]
          Length = 435

 Score =  524 bits (1349), Expect = e-146
 Identities = 269/449 (59%), Positives = 338/449 (75%)
 Frame = +2

Query: 23   LQLSVPHPWCSNKTHLQLGKSLRSLRLLPSCALSKQGQRFFTSLATSVTSDPLVTDRLIR 202
            LQ+S P PW                 LL  CALSKQGQ F +S+A     DP  ++RLI 
Sbjct: 5    LQVSRPQPWNHRSP------------LLIQCALSKQGQLFLSSVAR----DPSASNRLIC 48

Query: 203  KFIASSSKPVALNALSHLLTPNSSHPHLSSLAFPFYSLLTRAPWFSWNPKLVADVIAMLY 382
            KFIASSSK +ALNALSHLL+P ++HP+LSSLA P YS ++ A WFSWNPKL+ADVIA+LY
Sbjct: 49   KFIASSSKSIALNALSHLLSPTTTHPYLSSLALPLYSRISEASWFSWNPKLIADVIALLY 108

Query: 383  KQERFREADALISETVSKLDIRERELCTFYCCLIEFHSKHKSKQGVFDSYTSLKQILSCS 562
            KQ + +EA+ L+SET+ KL  RER+L +FYC LI+ HSKH S QGVFD  + L +I+S S
Sbjct: 109  KQGQLKEAETLVSETLIKLGSRERDLVSFYCNLIDSHSKHSSNQGVFDVISRLSRIVSES 168

Query: 563  SSVYLKRRAYESMICSLCVIDLPREAENMMEEMKVQGITSKPSLFELRSIVYAYGRVGLF 742
            SSVY+K RAY+SMI SLC + LP EAEN++EEM+V+G+  KPS+FE RS+VY YGRVGL 
Sbjct: 169  SSVYVKERAYKSMISSLCAVGLPLEAENLIEEMRVKGL--KPSVFEFRSVVYGYGRVGLS 226

Query: 743  KDMKRLLFEMESEGIKMDTVCFNMVLSSLGAHAELLEMVSWLQRMKYLNVPLSIRTYNTV 922
            +DM+R+L +M +EG ++DTV  NMVLSS GA+ +  EMVSWLQRMK  ++P SIRTYN+V
Sbjct: 227  EDMQRILLQMGNEGFELDTVVSNMVLSSYGAYNKQSEMVSWLQRMKNSSIPFSIRTYNSV 286

Query: 923  LNSCPSIMSRLQDPKSIPLSLNELQEGLCDNESMLVQELIVSSVLVEAMGWTSLELKLDL 1102
            LNSCP IMS LQD K+ P +++EL E L  +E++LV+ELI S VL E M W   E KLDL
Sbjct: 287  LNSCPMIMSILQDLKTFPPTIDELMETLKGDEALLVKELIGSMVLAELMEWDCSEGKLDL 346

Query: 1103 HGMHLSSAYLIILQWFDELRTRFQPVEADIPAEVRIICGSGKHSTVRGDSPVKHLVKVLM 1282
            HGMHL SAYLI+LQW +ELR R    E  +P E+ ++CGSGKHS+VRG+SPVK +V+ +M
Sbjct: 347  HGMHLGSAYLIMLQWREELRYRLNAAEYVMPVEITVVCGSGKHSSVRGESPVKRMVREMM 406

Query: 1283 VRTESPLRIDRKNIGCFVAKGKVCKSWLC 1369
             RT SP++IDRKNIGCFVAK KV K+WLC
Sbjct: 407  TRTRSPMKIDRKNIGCFVAKAKVVKNWLC 435


>ref|XP_002521239.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223539507|gb|EEF41095.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 460

 Score =  475 bits (1222), Expect = e-131
 Identities = 237/455 (52%), Positives = 321/455 (70%), Gaps = 7/455 (1%)
 Frame = +2

Query: 26   QLSVPHPWCSNKTHLQLGKSLRSLRLLPS------CALSKQGQRFFTSLATSVTS-DPLV 184
            QL +  PW     H    +     ++LP        ALSKQGQRF +SLA + T  D + 
Sbjct: 8    QLRIKLPWSEQLRHRDHHRHRNQQQVLPMKWVFRCAALSKQGQRFLSSLAIATTKGDTVA 67

Query: 185  TDRLIRKFIASSSKPVALNALSHLLTPNSSHPHLSSLAFPFYSLLTRAPWFSWNPKLVAD 364
            T+RLI+KF+A+S K +AL+ALSHLL P+SSH HLSSLAF  Y  +  A WF WNPKLVAD
Sbjct: 68   TNRLIKKFVAASPKSIALDALSHLLNPHSSHSHLSSLAFTLYLKIAEARWFQWNPKLVAD 127

Query: 365  VIAMLYKQERFREADALISETVSKLDIRERELCTFYCCLIEFHSKHKSKQGVFDSYTSLK 544
            V+A L KQ R+ E+  L+S+++SKL ++ER+L  FYC L+E  SK  S +G  +S  SL 
Sbjct: 128  VVAFLDKQGRYDESATLVSDSISKLQVKERDLARFYCNLVESQSKQNSIRGFDNSVASLM 187

Query: 545  QILSCSSSVYLKRRAYESMICSLCVIDLPREAENMMEEMKVQGITSKPSLFELRSIVYAY 724
            Q++  S+SVY+KR+ Y+SM+  LC +  PREAE ++EEM  +G+  +PS+FE + +VYAY
Sbjct: 188  QLVCNSNSVYVKRQGYKSMVNGLCEMGRPREAETLIEEMGKEGV--RPSMFEFKCVVYAY 245

Query: 725  GRVGLFKDMKRLLFEMESEGIKMDTVCFNMVLSSLGAHAELLEMVSWLQRMKYLNVPLSI 904
            G +G F++M + L +ME  G ++DTVC NM+L+S GAH  L EMV WLQ+MK L +P S+
Sbjct: 246  GSLGSFEEMNKCLHQMERAGFRVDTVCSNMILASYGAHNALPEMVLWLQKMKDLGIPFSL 305

Query: 905  RTYNTVLNSCPSIMSRLQDPKSIPLSLNELQEGLCDNESMLVQELIVSSVLVEAMGWTSL 1084
            RT N+ LNSCP+IMS +Q+    P+S+++L + L ++E++LV+E++ SSVL EAM W   
Sbjct: 306  RTCNSALNSCPTIMSMMQNSNDFPISIHDLMKILSEDEALLVKEIVTSSVLDEAMKWDVA 365

Query: 1085 ELKLDLHGMHLSSAYLIILQWFDELRTRFQPVEADIPAEVRIICGSGKHSTVRGDSPVKH 1264
            E KLDLHG HL SAYLIIL W +E+R RF+ V    P E+ ++CGSG HS VRG+SPVK 
Sbjct: 366  EAKLDLHGTHLCSAYLIILLWIEEMRKRFKSVNYVNPTEITVVCGSGNHSIVRGESPVKC 425

Query: 1265 LVKVLMVRTESPLRIDRKNIGCFVAKGKVCKSWLC 1369
            +VK  MVR  SP+RIDR+NIGCF+AKGKV + WLC
Sbjct: 426  MVKDFMVRARSPMRIDRRNIGCFIAKGKVVEEWLC 460


>dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana]
          Length = 501

 Score =  462 bits (1189), Expect = e-127
 Identities = 234/418 (55%), Positives = 309/418 (73%), Gaps = 1/418 (0%)
 Frame = +2

Query: 119  LSKQGQRFFTSLAT-SVTSDPLVTDRLIRKFIASSSKPVALNALSHLLTPNSSHPHLSSL 295
            L K G RF +SL++ ++  DP   +R I+KF+A+S K VALN LSHLL+  +SHPHLS  
Sbjct: 85   LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 144

Query: 296  AFPFYSLLTRAPWFSWNPKLVADVIAMLYKQERFREADALISETVSKLDIRERELCTFYC 475
            A   YS +T A WF WNPKL+A++IA+L KQERF E++ L+S  VS+L   ER+   F C
Sbjct: 145  ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 204

Query: 476  CLIEFHSKHKSKQGVFDSYTSLKQILSCSSSVYLKRRAYESMICSLCVIDLPREAENMME 655
             L+E +SK  S QG  ++   L++I+  SSSVY+K +AY+SM+  LC +D P +AE ++E
Sbjct: 205  NLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIE 264

Query: 656  EMKVQGITSKPSLFELRSIVYAYGRVGLFKDMKRLLFEMESEGIKMDTVCFNMVLSSLGA 835
            EM+++ I  KP LFE +S++Y YGR+GLF DM R++  M +EG K+DTVC NMVLSS GA
Sbjct: 265  EMRMEKI--KPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGA 322

Query: 836  HAELLEMVSWLQRMKYLNVPLSIRTYNTVLNSCPSIMSRLQDPKSIPLSLNELQEGLCDN 1015
            H  L +M SWLQ++K  NVP SIRTYN+VLNSCP+I+S L+D  S P+SL+EL+  L ++
Sbjct: 323  HDALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNED 382

Query: 1016 ESMLVQELIVSSVLVEAMGWTSLELKLDLHGMHLSSAYLIILQWFDELRTRFQPVEADIP 1195
            E++LV EL  SSVL EA+ W ++E KLDLHGMHLSS+YLI+LQW DE R RF   +  IP
Sbjct: 383  EALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIP 442

Query: 1196 AEVRIICGSGKHSTVRGDSPVKHLVKVLMVRTESPLRIDRKNIGCFVAKGKVCKSWLC 1369
            AE+ ++ GSGKHS VRG+SPVK LVK +MVRT SP+RIDRKN+G F+AKGK  K WLC
Sbjct: 443  AEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 500


>ref|NP_565402.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|13877877|gb|AAK44016.1|AF370201_1 unknown protein
            [Arabidopsis thaliana] gi|21280879|gb|AAM44931.1| unknown
            protein [Arabidopsis thaliana]
            gi|330251481|gb|AEC06575.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 504

 Score =  462 bits (1189), Expect = e-127
 Identities = 234/418 (55%), Positives = 309/418 (73%), Gaps = 1/418 (0%)
 Frame = +2

Query: 119  LSKQGQRFFTSLAT-SVTSDPLVTDRLIRKFIASSSKPVALNALSHLLTPNSSHPHLSSL 295
            L K G RF +SL++ ++  DP   +R I+KF+A+S K VALN LSHLL+  +SHPHLS  
Sbjct: 88   LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 147

Query: 296  AFPFYSLLTRAPWFSWNPKLVADVIAMLYKQERFREADALISETVSKLDIRERELCTFYC 475
            A   YS +T A WF WNPKL+A++IA+L KQERF E++ L+S  VS+L   ER+   F C
Sbjct: 148  ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 207

Query: 476  CLIEFHSKHKSKQGVFDSYTSLKQILSCSSSVYLKRRAYESMICSLCVIDLPREAENMME 655
             L+E +SK  S QG  ++   L++I+  SSSVY+K +AY+SM+  LC +D P +AE ++E
Sbjct: 208  NLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIE 267

Query: 656  EMKVQGITSKPSLFELRSIVYAYGRVGLFKDMKRLLFEMESEGIKMDTVCFNMVLSSLGA 835
            EM+++ I  KP LFE +S++Y YGR+GLF DM R++  M +EG K+DTVC NMVLSS GA
Sbjct: 268  EMRMEKI--KPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGA 325

Query: 836  HAELLEMVSWLQRMKYLNVPLSIRTYNTVLNSCPSIMSRLQDPKSIPLSLNELQEGLCDN 1015
            H  L +M SWLQ++K  NVP SIRTYN+VLNSCP+I+S L+D  S P+SL+EL+  L ++
Sbjct: 326  HDALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNED 385

Query: 1016 ESMLVQELIVSSVLVEAMGWTSLELKLDLHGMHLSSAYLIILQWFDELRTRFQPVEADIP 1195
            E++LV EL  SSVL EA+ W ++E KLDLHGMHLSS+YLI+LQW DE R RF   +  IP
Sbjct: 386  EALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIP 445

Query: 1196 AEVRIICGSGKHSTVRGDSPVKHLVKVLMVRTESPLRIDRKNIGCFVAKGKVCKSWLC 1369
            AE+ ++ GSGKHS VRG+SPVK LVK +MVRT SP+RIDRKN+G F+AKGK  K WLC
Sbjct: 446  AEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 503


>ref|NP_849962.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75244359|sp|Q8GWA9.1|PP157_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g17033 gi|26452937|dbj|BAC43545.1| unknown protein
            [Arabidopsis thaliana] gi|330251482|gb|AEC06576.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 505

 Score =  462 bits (1189), Expect = e-127
 Identities = 234/418 (55%), Positives = 309/418 (73%), Gaps = 1/418 (0%)
 Frame = +2

Query: 119  LSKQGQRFFTSLAT-SVTSDPLVTDRLIRKFIASSSKPVALNALSHLLTPNSSHPHLSSL 295
            L K G RF +SL++ ++  DP   +R I+KF+A+S K VALN LSHLL+  +SHPHLS  
Sbjct: 89   LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 148

Query: 296  AFPFYSLLTRAPWFSWNPKLVADVIAMLYKQERFREADALISETVSKLDIRERELCTFYC 475
            A   YS +T A WF WNPKL+A++IA+L KQERF E++ L+S  VS+L   ER+   F C
Sbjct: 149  ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 208

Query: 476  CLIEFHSKHKSKQGVFDSYTSLKQILSCSSSVYLKRRAYESMICSLCVIDLPREAENMME 655
             L+E +SK  S QG  ++   L++I+  SSSVY+K +AY+SM+  LC +D P +AE ++E
Sbjct: 209  NLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIE 268

Query: 656  EMKVQGITSKPSLFELRSIVYAYGRVGLFKDMKRLLFEMESEGIKMDTVCFNMVLSSLGA 835
            EM+++ I  KP LFE +S++Y YGR+GLF DM R++  M +EG K+DTVC NMVLSS GA
Sbjct: 269  EMRMEKI--KPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGA 326

Query: 836  HAELLEMVSWLQRMKYLNVPLSIRTYNTVLNSCPSIMSRLQDPKSIPLSLNELQEGLCDN 1015
            H  L +M SWLQ++K  NVP SIRTYN+VLNSCP+I+S L+D  S P+SL+EL+  L ++
Sbjct: 327  HDALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNED 386

Query: 1016 ESMLVQELIVSSVLVEAMGWTSLELKLDLHGMHLSSAYLIILQWFDELRTRFQPVEADIP 1195
            E++LV EL  SSVL EA+ W ++E KLDLHGMHLSS+YLI+LQW DE R RF   +  IP
Sbjct: 387  EALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIP 446

Query: 1196 AEVRIICGSGKHSTVRGDSPVKHLVKVLMVRTESPLRIDRKNIGCFVAKGKVCKSWLC 1369
            AE+ ++ GSGKHS VRG+SPVK LVK +MVRT SP+RIDRKN+G F+AKGK  K WLC
Sbjct: 447  AEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 504


Top