BLASTX nr result

ID: Cephaelis21_contig00029057 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00029057
         (1632 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN78573.1| hypothetical protein VITISV_020581 [Vitis vinifera]   324   4e-86
ref|XP_002283311.1| PREDICTED: pentatricopeptide repeat-containi...   323   1e-85
ref|XP_002868835.1| pentatricopeptide repeat-containing protein ...   320   5e-85
ref|XP_003537572.1| PREDICTED: pentatricopeptide repeat-containi...   318   2e-84
ref|NP_195528.1| pentatricopeptide repeat-containing protein [Ar...   312   2e-82

>emb|CAN78573.1| hypothetical protein VITISV_020581 [Vitis vinifera]
          Length = 381

 Score =  324 bits (831), Expect = 4e-86
 Identities = 172/315 (54%), Positives = 218/315 (69%), Gaps = 1/315 (0%)
 Frame = +2

Query: 416  KTSASEHNDNNYDNPPEPIPNRPLRGERRTPINPXXXXXXXXXXXXXFGEDESHARQQNE 595
            K+S+S     +  NPP PIPNRPLRGE+R    P              G D +    Q  
Sbjct: 83   KSSSSCGGGGSSSNPPNPIPNRPLRGEQRMNRPPPHIPQRKLGLPKDEGVDRA---SQAS 139

Query: 596  QLNRPRFGENAQRETGALEDSDFLEKFKLGFDRN-KGENPNTRQPSYNKQRGGEKYDQAA 772
              N+P   E    + GA  +  FLE+FKLG  +  + +     QPS  +     K     
Sbjct: 140  PFNQPSPAE----KVGATLEDGFLERFKLGVQKKERPQESAAAQPSREQDANHGK----- 190

Query: 773  ETPPPSQPPEDADEIFRKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMRERGTIPEV 952
                  QPP++ADEIFRKMKE+GLIPNAVAMLDGLCKDGLVQEAMKLFGLMRE+GTIPEV
Sbjct: 191  -----EQPPQNADEIFRKMKESGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEV 245

Query: 953  VIYTAVVEGFCKAHKVDDAVRIFKKMQSIGITPNAFSYGILIQGLLRVKRLDDAREFCLE 1132
            VIYTAVVEGFCKA ++DDAVRIF+KMQ+ GI+PNAFSY +LI+G+ +  RLD A +FC+E
Sbjct: 246  VIYTAVVEGFCKARQLDDAVRIFRKMQNNGISPNAFSYTVLIRGMYKGNRLDIAVDFCVE 305

Query: 1133 MLEGGHSPNVVTFIGLVDAYCREKGLEEAQTMLSSLREKGFYLNEKAVREHLEKKGPFLP 1312
            MLE GHSPNV T + L+  +C+EKG+EEA+ ++ +L++KG ++++KAVRE+L+KKGP  P
Sbjct: 306  MLEAGHSPNVATLVDLIHEFCKEKGVEEAKNVIVTLKQKGLFVDDKAVREYLDKKGPQSP 365

Query: 1313 LVWEAIFGKKSSERS 1357
            LVWEA FGKKS +RS
Sbjct: 366  LVWEAFFGKKSPQRS 380


>ref|XP_002283311.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            [Vitis vinifera]
          Length = 380

 Score =  323 bits (827), Expect = 1e-85
 Identities = 170/317 (53%), Positives = 218/317 (68%), Gaps = 1/317 (0%)
 Frame = +2

Query: 410  FSKTSASEHNDNNYDNPPEPIPNRPLRGERRTPINPXXXXXXXXXXXXXFGEDESHARQQ 589
            + + S+S     +  NPP PIPNRPLRGE+R    P              G D +    Q
Sbjct: 80   YGRKSSSSCGGGSSSNPPNPIPNRPLRGEQRMNRPPPHIPQRKLGLPKDEGVDRA---SQ 136

Query: 590  NEQLNRPRFGENAQRETGALEDSDFLEKFKLGFDRN-KGENPNTRQPSYNKQRGGEKYDQ 766
                N+P   E    + GA  +  FLE+FKLG  +  + +     QPS  +     K   
Sbjct: 137  ASPFNQPSPAE----KVGATLEDGFLERFKLGVQKKERPQESAAAQPSREQDANHGK--- 189

Query: 767  AAETPPPSQPPEDADEIFRKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMRERGTIP 946
                    QPP++ADEIFRKMKE+GLIPNAVAMLDGLCKDGLVQEAMKLFGLMRE+GTIP
Sbjct: 190  -------EQPPQNADEIFRKMKESGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIP 242

Query: 947  EVVIYTAVVEGFCKAHKVDDAVRIFKKMQSIGITPNAFSYGILIQGLLRVKRLDDAREFC 1126
            EVVIYTAVVEGFCKA +++DAVRIF+KMQ+ GI+PNAFSY +LI+G+ +  RLD A +FC
Sbjct: 243  EVVIYTAVVEGFCKARQLNDAVRIFRKMQNNGISPNAFSYTVLIRGMYKGNRLDIAVDFC 302

Query: 1127 LEMLEGGHSPNVVTFIGLVDAYCREKGLEEAQTMLSSLREKGFYLNEKAVREHLEKKGPF 1306
            +EMLE GHSPNV T + L+  +C+EKG+EEA+ ++ +L++KG ++++KAVRE+L+KKGP 
Sbjct: 303  VEMLEAGHSPNVATLVDLIHEFCKEKGVEEAKNVIVTLKQKGLFVDDKAVREYLDKKGPQ 362

Query: 1307 LPLVWEAIFGKKSSERS 1357
             PLVWEA FGKKS +RS
Sbjct: 363  SPLVWEAFFGKKSPQRS 379


>ref|XP_002868835.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297314671|gb|EFH45094.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 301

 Score =  320 bits (821), Expect = 5e-85
 Identities = 170/311 (54%), Positives = 205/311 (65%)
 Frame = +2

Query: 422  SASEHNDNNYDNPPEPIPNRPLRGERRTPINPXXXXXXXXXXXXXFGEDESHARQQNEQL 601
            S  +       NPPEP+PNRPLRGER +                     E  ARQ ++  
Sbjct: 32   STGDKGQEKQQNPPEPLPNRPLRGERSSN-----------------SHREPPARQAHD-- 72

Query: 602  NRPRFGENAQRETGALEDSDFLEKFKLGFDRNKGENPNTRQPSYNKQRGGEKYDQAAETP 781
                      +    L D  FLE+FKLG +++  E P             E+Y Q     
Sbjct: 73   --------LGKIDNTLSDDGFLEQFKLGVNQDSQETPKP-----------EQYPQ----- 108

Query: 782  PPSQPPEDADEIFRKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMRERGTIPEVVIY 961
             P  PPED+DEIF+KMKE GLIPNAVAMLDGLCKDGLVQEAMKLFGLMR++GTIPEVVIY
Sbjct: 109  DPLLPPEDSDEIFKKMKEGGLIPNAVAMLDGLCKDGLVQEAMKLFGLMRDKGTIPEVVIY 168

Query: 962  TAVVEGFCKAHKVDDAVRIFKKMQSIGITPNAFSYGILIQGLLRVKRLDDAREFCLEMLE 1141
            TAVVEGFCKAHK++DA RIF+KMQ+ GITPNAFSYG+L+QGL     LDDA  FC EMLE
Sbjct: 169  TAVVEGFCKAHKIEDAKRIFRKMQTNGITPNAFSYGVLVQGLYNCNMLDDAVTFCCEMLE 228

Query: 1142 GGHSPNVVTFIGLVDAYCREKGLEEAQTMLSSLREKGFYLNEKAVREHLEKKGPFLPLVW 1321
             GHSPN+ TF+GLVDA CREKG+E+AQ+ +  L +KGF LN KAV+E ++K+ PF  L W
Sbjct: 229  SGHSPNIPTFVGLVDALCREKGVEQAQSAIDGLNQKGFALNVKAVKEFMDKRAPFPSLAW 288

Query: 1322 EAIFGKKSSER 1354
            EAIF KK +++
Sbjct: 289  EAIFKKKPTDK 299


>ref|XP_003537572.1| PREDICTED: pentatricopeptide repeat-containing protein At4g38150-like
            [Glycine max]
          Length = 388

 Score =  318 bits (816), Expect = 2e-84
 Identities = 193/391 (49%), Positives = 241/391 (61%), Gaps = 25/391 (6%)
 Frame = +2

Query: 257  HSLYSSHFYRKTESYLHELLSNVVPSLLSMKIMRPFSTIRDVDS-----LHTHHSNFSKT 421
            H L S     K  S++H      +P LL  + +R FS   D        +      F + 
Sbjct: 12   HKLVSFSQIEKLVSFVH--CKQYLPPLL--ETVRHFSFTDDCSGRSKQPVGESDDFFLQQ 67

Query: 422  SASEHNDNNYDNPP--EPIPNRPLRGERRTPIN-PXXXXXXXXXXXXXF---------GE 565
            S S   DN   +    EPIP+RPLR   R P+N P             F         G 
Sbjct: 68   SDSSFKDNGESDQSLSEPIPSRPLRS--RKPVNQPPPRFQEYDRGSHSFPPRFYDNHGGP 125

Query: 566  DESHARQQNEQLNRPRFGENA---QRETGALEDSDFLEKFKLGFDRNKGENPNTRQPSYN 736
            DE     ++ +++      N     R+ G   DS FL KFKLGFD    +  N  + + +
Sbjct: 126  DELDQTNKSSKIDLAFQNTNVAKTNRDAGQSGDS-FLNKFKLGFD---DKTVNLSEVAAS 181

Query: 737  KQRGGEKYDQAAETPPPSQP-----PEDADEIFRKMKETGLIPNAVAMLDGLCKDGLVQE 901
            KQ       + A+   P+QP     P+DADEIF+KMKETGLIPNAVAMLDGLCKDGLVQE
Sbjct: 182  KQ------SEEAKRSNPNQPAQESMPQDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQE 235

Query: 902  AMKLFGLMRERGTIPEVVIYTAVVEGFCKAHKVDDAVRIFKKMQSIGITPNAFSYGILIQ 1081
            A+KLFGLMRE+GTIPE+VIYTAVVEG+ KAHK DDA RIF+KMQS G++PNAFSY +LIQ
Sbjct: 236  ALKLFGLMREKGTIPEIVIYTAVVEGYTKAHKADDAKRIFRKMQSSGVSPNAFSYMVLIQ 295

Query: 1082 GLLRVKRLDDAREFCLEMLEGGHSPNVVTFIGLVDAYCREKGLEEAQTMLSSLREKGFYL 1261
            GL +  RL DA EFC+EMLE GHSPNV TF+GLVD +C EKG+EEA++ + +L +KGF +
Sbjct: 296  GLYKCSRLHDAFEFCVEMLEAGHSPNVTTFVGLVDGFCNEKGVEEAKSAIKTLTDKGFVV 355

Query: 1262 NEKAVREHLEKKGPFLPLVWEAIFGKKSSER 1354
            NEKAVR+ L+KK PF P VWEAIFGKK+ +R
Sbjct: 356  NEKAVRQFLDKKAPFSPSVWEAIFGKKAPQR 386


>ref|NP_195528.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|79326453|ref|NP_001031806.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|75266764|sp|Q9SZL5.1|PP356_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At4g38150 gi|4467121|emb|CAB37555.1| putative protein
            [Arabidopsis thaliana] gi|7270799|emb|CAB80480.1|
            putative protein [Arabidopsis thaliana]
            gi|26453272|dbj|BAC43709.1| unknown protein [Arabidopsis
            thaliana] gi|332661484|gb|AEE86884.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
            gi|332661485|gb|AEE86885.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 302

 Score =  312 bits (799), Expect = 2e-82
 Identities = 175/344 (50%), Positives = 215/344 (62%)
 Frame = +2

Query: 323  VVPSLLSMKIMRPFSTIRDVDSLHTHHSNFSKTSASEHNDNNYDNPPEPIPNRPLRGERR 502
            +VPS  ++   R  +    V +     + F  T  +   D    NPPEP+PNRPLRGER 
Sbjct: 1    MVPSSKAVVFARQMAKQIRVTTPSMSATRFLSTGDNGQVDEQ-QNPPEPLPNRPLRGERS 59

Query: 503  TPINPXXXXXXXXXXXXXFGEDESHARQQNEQLNRPRFGENAQRETGALEDSDFLEKFKL 682
            +                     E  ARQ +          N  +    L D  FLE+FKL
Sbjct: 60   SN-----------------SHREPPARQAH----------NLGKSDTTLSDDGFLEQFKL 92

Query: 683  GFDRNKGENPNTRQPSYNKQRGGEKYDQAAETPPPSQPPEDADEIFRKMKETGLIPNAVA 862
            G +++  E P             E+Y Q      P  PPED+DEIF+KMKE GLIPNAVA
Sbjct: 93   GVNQDSRETPKP-----------EQYPQE-----PLPPPEDSDEIFKKMKEGGLIPNAVA 136

Query: 863  MLDGLCKDGLVQEAMKLFGLMRERGTIPEVVIYTAVVEGFCKAHKVDDAVRIFKKMQSIG 1042
            MLDGLCKDGLVQEAMKLFGLMR++GTIPEVVIYTAVVE FCKAHK++DA RIF+KMQ+ G
Sbjct: 137  MLDGLCKDGLVQEAMKLFGLMRDKGTIPEVVIYTAVVEAFCKAHKIEDAKRIFRKMQNNG 196

Query: 1043 ITPNAFSYGILIQGLLRVKRLDDAREFCLEMLEGGHSPNVVTFIGLVDAYCREKGLEEAQ 1222
            I PNAFSYG+L+QGL     LDDA  FC EMLE GHSPNV TF+ LVDA CR KG+E+AQ
Sbjct: 197  IAPNAFSYGVLVQGLYNCNMLDDAVAFCSEMLESGHSPNVPTFVELVDALCRVKGVEQAQ 256

Query: 1223 TMLSSLREKGFYLNEKAVREHLEKKGPFLPLVWEAIFGKKSSER 1354
            + + +L +KGF +N KAV+E ++K+ PF  L WEAIF KK +E+
Sbjct: 257  SAIDTLNQKGFAVNVKAVKEFMDKRAPFPSLAWEAIFKKKPTEK 300


Top