BLASTX nr result

ID: Cnidium21_contig00018268 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cnidium21_contig00018268
         (1193 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002868835.1| pentatricopeptide repeat-containing protein ...   291   2e-76
ref|XP_003537572.1| PREDICTED: pentatricopeptide repeat-containi...   291   3e-76
ref|XP_003516576.1| PREDICTED: pentatricopeptide repeat-containi...   290   5e-76
ref|NP_195528.1| pentatricopeptide repeat-containing protein [Ar...   282   1e-73
gb|AAL77701.1| AT4g38150/F20D10_270 [Arabidopsis thaliana] gi|23...   281   3e-73

>ref|XP_002868835.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297314671|gb|EFH45094.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 301

 Score =  291 bits (745), Expect = 2e-76
 Identities = 152/285 (53%), Positives = 192/285 (67%)
 Frame = -3

Query: 1047 QEGRRMPPDPIPNRPLRAEKPTHQRKTFNQGRASDGGGTGATRMDFNKSGNXXXXXXXXX 868
            QE ++ PP+P+PNRPLR E+ ++  +     +A D G    T  D               
Sbjct: 38   QEKQQNPPEPLPNRPLRGERSSNSHREPPARQAHDLGKIDNTLSDDG------------- 84

Query: 867  XXXXXXANLDFLEKFKLGFDKGGEKKEPPKMVYNNSNQPTALPQPEDADEIFKKMKETGL 688
                      FLE+FKLG ++  ++   P+       Q   LP PED+DEIFKKMKE GL
Sbjct: 85   ----------FLEQFKLGVNQDSQETPKPEQY----PQDPLLP-PEDSDEIFKKMKEGGL 129

Query: 687  IPNAVSMLHGLCQDGLVQEAMKLFSLMHEKGTIPEVIIYTAVIEGFCKAHKLDDAKRIFR 508
            IPNAV+ML GLC+DGLVQEAMKLF LM +KGTIPEV+IYTAV+EGFCKAHK++DAKRIFR
Sbjct: 130  IPNAVAMLDGLCKDGLVQEAMKLFGLMRDKGTIPEVVIYTAVVEGFCKAHKIEDAKRIFR 189

Query: 507  KMQGNGISPNAITYGILIQGLIHKKSLDDALVFSVEMLDAGHSPNLATFTGLVDCFCREK 328
            KMQ NGI+PNA +YG+L+QGL +   LDDA+ F  EML++GHSPN+ TF GLVD  CREK
Sbjct: 190  KMQTNGITPNAFSYGVLVQGLYNCNMLDDAVTFCCEMLESGHSPNIPTFVGLVDALCREK 249

Query: 327  GLEEAQNMIKMLREKSFFFEEKAVREYLDKKGPFSQLVWEAILGK 193
            G+E+AQ+ I  L +K F    KAV+E++DK+ PF  L WEAI  K
Sbjct: 250  GVEQAQSAIDGLNQKGFALNVKAVKEFMDKRAPFPSLAWEAIFKK 294


>ref|XP_003537572.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            [Glycine max]
          Length = 388

 Score =  291 bits (744), Expect = 3e-76
 Identities = 172/345 (49%), Positives = 208/345 (60%), Gaps = 23/345 (6%)
 Frame = -3

Query: 1158 VRRRSFILDSLARSFSSRG--GDFVPKYNKDGARSRSTDQEGRRMPPDPIPNRPLRAEKP 985
            VR  SF  D   RS    G   DF   + +    S   + E  +   +PIP+RPLR+ KP
Sbjct: 40   VRHFSFTDDCSGRSKQPVGESDDF---FLQQSDSSFKDNGESDQSLSEPIPSRPLRSRKP 96

Query: 984  THQR----KTFNQGRAS------DGGGTGATRMDFNKSGNXXXXXXXXXXXXXXXA---- 847
             +Q     + +++G  S      D  G        NKS                      
Sbjct: 97   VNQPPPRFQEYDRGSHSFPPRFYDNHGGPDELDQTNKSSKIDLAFQNTNVAKTNRDAGQS 156

Query: 846  NLDFLEKFKLGFD-------KGGEKKEPPKMVYNNSNQPTALPQPEDADEIFKKMKETGL 688
               FL KFKLGFD       +    K+  +   +N NQP     P+DADEIFKKMKETGL
Sbjct: 157  GDSFLNKFKLGFDDKTVNLSEVAASKQSEEAKRSNPNQPAQESMPQDADEIFKKMKETGL 216

Query: 687  IPNAVSMLHGLCQDGLVQEAMKLFSLMHEKGTIPEVIIYTAVIEGFCKAHKLDDAKRIFR 508
            IPNAV+ML GLC+DGLVQEA+KLF LM EKGTIPE++IYTAV+EG+ KAHK DDAKRIFR
Sbjct: 217  IPNAVAMLDGLCKDGLVQEALKLFGLMREKGTIPEIVIYTAVVEGYTKAHKADDAKRIFR 276

Query: 507  KMQGNGISPNAITYGILIQGLIHKKSLDDALVFSVEMLDAGHSPNLATFTGLVDCFCREK 328
            KMQ +G+SPNA +Y +LIQGL     L DA  F VEML+AGHSPN+ TF GLVD FC EK
Sbjct: 277  KMQSSGVSPNAFSYMVLIQGLYKCSRLHDAFEFCVEMLEAGHSPNVTTFVGLVDGFCNEK 336

Query: 327  GLEEAQNMIKMLREKSFFFEEKAVREYLDKKGPFSQLVWEAILGK 193
            G+EEA++ IK L +K F   EKAVR++LDKK PFS  VWEAI GK
Sbjct: 337  GVEEAKSAIKTLTDKGFVVNEKAVRQFLDKKAPFSPSVWEAIFGK 381


>ref|XP_003516576.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            [Glycine max]
          Length = 397

 Score =  290 bits (742), Expect = 5e-76
 Identities = 172/351 (49%), Positives = 214/351 (60%), Gaps = 29/351 (8%)
 Frame = -3

Query: 1158 VRRRSFILDSLARSFSSRG--GDFVPKYN----KDGARSRSTDQEG-RRMPPDPIPNRPL 1000
            VR  SF  D   RS    G   DF  + +    KD   +R+ +     +   +PIP+RPL
Sbjct: 40   VRHFSFTDDRSGRSKQPVGESDDFFREQSDSSFKDNGSNRTQESYNVEQSLSEPIPSRPL 99

Query: 999  RAEKPTHQR----KTFNQGRAS-----DGGGTGATRMD-FNKSGNXXXXXXXXXXXXXXX 850
            R +KP +Q     + +++G  S     D    G   +D  NKS                 
Sbjct: 100  RGKKPINQPPPRFREYDRGSHSFPPRFDDNHGGPDELDKINKSSQIDLAFQGTTNVAETN 159

Query: 849  ANL-----DFLEKFKLGFD-------KGGEKKEPPKMVYNNSNQPTALPQPEDADEIFKK 706
             ++      FL+KFKLGFD       +    K+  +   +N NQP     P+DA+EIFKK
Sbjct: 160  RDVGKSGGSFLDKFKLGFDDKTVNLSEVAASKQSEEAKRSNPNQPAQESMPQDANEIFKK 219

Query: 705  MKETGLIPNAVSMLHGLCQDGLVQEAMKLFSLMHEKGTIPEVIIYTAVIEGFCKAHKLDD 526
            MKETGLIPNAV+ML GLC+DGLVQEA+KLF L+ EKGTIPE++IYTAV+EG+ KAHK DD
Sbjct: 220  MKETGLIPNAVAMLDGLCKDGLVQEALKLFGLIREKGTIPEIVIYTAVVEGYTKAHKADD 279

Query: 525  AKRIFRKMQGNGISPNAITYGILIQGLIHKKSLDDALVFSVEMLDAGHSPNLATFTGLVD 346
            AKRIFRKMQ +GISPNA +Y +LIQGL     L DA  F VEML+AGHSPN+  F GLVD
Sbjct: 280  AKRIFRKMQSSGISPNAFSYTVLIQGLYKCNRLHDAFEFCVEMLEAGHSPNVTAFVGLVD 339

Query: 345  CFCREKGLEEAQNMIKMLREKSFFFEEKAVREYLDKKGPFSQLVWEAILGK 193
             FC EKG+EEA++ IK L EK F   EKAV ++LDKK PFS  VWEAI GK
Sbjct: 340  GFCNEKGVEEAKSAIKTLTEKGFVVNEKAVGQFLDKKAPFSPSVWEAIFGK 390


>ref|NP_195528.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|79326453|ref|NP_001031806.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|75266764|sp|Q9SZL5.1|PP356_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g38150 gi|4467121|emb|CAB37555.1| putative protein
            [Arabidopsis thaliana] gi|7270799|emb|CAB80480.1|
            putative protein [Arabidopsis thaliana]
            gi|26453272|dbj|BAC43709.1| unknown protein [Arabidopsis
            thaliana] gi|332661484|gb|AEE86884.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|332661485|gb|AEE86885.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 302

 Score =  282 bits (721), Expect = 1e-73
 Identities = 146/279 (52%), Positives = 186/279 (66%)
 Frame = -3

Query: 1029 PPDPIPNRPLRAEKPTHQRKTFNQGRASDGGGTGATRMDFNKSGNXXXXXXXXXXXXXXX 850
            PP+P+PNRPLR E+ ++  +     +A + G +  T  D                     
Sbjct: 45   PPEPLPNRPLRGERSSNSHREPPARQAHNLGKSDTTLSDDG------------------- 85

Query: 849  ANLDFLEKFKLGFDKGGEKKEPPKMVYNNSNQPTALPQPEDADEIFKKMKETGLIPNAVS 670
                FLE+FKLG ++   +   P+       +P  LP PED+DEIFKKMKE GLIPNAV+
Sbjct: 86   ----FLEQFKLGVNQDSRETPKPEQY---PQEP--LPPPEDSDEIFKKMKEGGLIPNAVA 136

Query: 669  MLHGLCQDGLVQEAMKLFSLMHEKGTIPEVIIYTAVIEGFCKAHKLDDAKRIFRKMQGNG 490
            ML GLC+DGLVQEAMKLF LM +KGTIPEV+IYTAV+E FCKAHK++DAKRIFRKMQ NG
Sbjct: 137  MLDGLCKDGLVQEAMKLFGLMRDKGTIPEVVIYTAVVEAFCKAHKIEDAKRIFRKMQNNG 196

Query: 489  ISPNAITYGILIQGLIHKKSLDDALVFSVEMLDAGHSPNLATFTGLVDCFCREKGLEEAQ 310
            I+PNA +YG+L+QGL +   LDDA+ F  EML++GHSPN+ TF  LVD  CR KG+E+AQ
Sbjct: 197  IAPNAFSYGVLVQGLYNCNMLDDAVAFCSEMLESGHSPNVPTFVELVDALCRVKGVEQAQ 256

Query: 309  NMIKMLREKSFFFEEKAVREYLDKKGPFSQLVWEAILGK 193
            + I  L +K F    KAV+E++DK+ PF  L WEAI  K
Sbjct: 257  SAIDTLNQKGFAVNVKAVKEFMDKRAPFPSLAWEAIFKK 295


>gb|AAL77701.1| AT4g38150/F20D10_270 [Arabidopsis thaliana]
            gi|23505863|gb|AAN28791.1| At4g38150/F20D10_270
            [Arabidopsis thaliana]
          Length = 302

 Score =  281 bits (718), Expect = 3e-73
 Identities = 145/279 (51%), Positives = 186/279 (66%)
 Frame = -3

Query: 1029 PPDPIPNRPLRAEKPTHQRKTFNQGRASDGGGTGATRMDFNKSGNXXXXXXXXXXXXXXX 850
            PP+P+PNRPLR E+ ++  +     +A + G +  T  D                     
Sbjct: 45   PPEPLPNRPLRGERSSNSHREPPARQAHNLGKSDTTLSDDG------------------- 85

Query: 849  ANLDFLEKFKLGFDKGGEKKEPPKMVYNNSNQPTALPQPEDADEIFKKMKETGLIPNAVS 670
                FLE+FKLG ++   +   P+       +P  LP PED+DEIFKKMKE GLIPNAV+
Sbjct: 86   ----FLEQFKLGVNQDSRETPKPEQY---PQEP--LPPPEDSDEIFKKMKEGGLIPNAVA 136

Query: 669  MLHGLCQDGLVQEAMKLFSLMHEKGTIPEVIIYTAVIEGFCKAHKLDDAKRIFRKMQGNG 490
            ML GLC+DGLVQEAMKLF LM +KGTIPEV+IYTAV+E FCKAHK++DAKRIFRKMQ NG
Sbjct: 137  MLDGLCKDGLVQEAMKLFGLMRDKGTIPEVVIYTAVVEAFCKAHKIEDAKRIFRKMQNNG 196

Query: 489  ISPNAITYGILIQGLIHKKSLDDALVFSVEMLDAGHSPNLATFTGLVDCFCREKGLEEAQ 310
            I+PNA +YG+L+QGL +   LDDA+ F  +ML++GHSPN+ TF  LVD  CR KG+E+AQ
Sbjct: 197  IAPNAFSYGVLVQGLYNCNMLDDAVAFCSDMLESGHSPNVPTFVELVDALCRVKGVEQAQ 256

Query: 309  NMIKMLREKSFFFEEKAVREYLDKKGPFSQLVWEAILGK 193
            + I  L +K F    KAV+E++DK+ PF  L WEAI  K
Sbjct: 257  SAIDTLNQKGFAVNVKAVKEFMDKRAPFPSLAWEAIFKK 295


Top