BLASTX nr result

ID: Cephaelis21_contig00044978 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00044978
         (597 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002278719.1| PREDICTED: pentatricopeptide repeat-containi...   226   2e-57
ref|XP_002530010.1| pentatricopeptide repeat-containing protein,...   197   1e-48
ref|XP_002324074.1| predicted protein [Populus trichocarpa] gi|2...   184   1e-44
ref|XP_003618546.1| Pentatricopeptide repeat-containing protein ...   176   4e-42
ref|XP_002873720.1| pentatricopeptide repeat-containing protein ...   168   6e-40

>ref|XP_002278719.1| PREDICTED: pentatricopeptide repeat-containing protein At5g15340,
           mitochondrial-like [Vitis vinifera]
          Length = 632

 Score =  226 bits (576), Expect = 2e-57
 Identities = 107/193 (55%), Positives = 136/193 (70%), Gaps = 2/193 (1%)
 Frame = -2

Query: 575 WGHH--VSSLARHYRTLLRSTARHSSFDVGQKLHATAVTTGLLALSLPNAFLRNTILHVY 402
           W  H  +SS++RHYR LLRS AR SS D+G++LHAT +TTG+     P  FL N +L  Y
Sbjct: 3   WSRHTALSSVSRHYRFLLRSCARESSLDIGERLHATIITTGIAGA--PETFLHNALLQFY 60

Query: 401 AACGDCRSARKVFDEIPNRHKDTVDWTTLMNCYTRAGFPLESLNLFIDMRKLGTPIDDIT 222
           A+CG    ARKVFDEIP+ HKDTVDWTTLM C+ R     E+L +F++MR+ G   D++T
Sbjct: 61  ASCGCAWQARKVFDEIPHSHKDTVDWTTLMGCFVRHNVSDEALLIFVEMRRCGVKPDEVT 120

Query: 221 LVSFFSACAKMGKGRLGHQGHACLIKMGFWNTEKTCNATMDMYVKCGLMHEARRVFYDMN 42
           LV  F  CA++G   +G QGH C++KMG    EK CNA MDMY K GLM EARRVFY+M 
Sbjct: 121 LVCLFGGCARLGDVVVGAQGHGCMVKMGLGGVEKACNAVMDMYAKSGLMGEARRVFYEMK 180

Query: 41  ERSIVSWTVLLEG 3
            +S+VSWTV+L+G
Sbjct: 181 GQSVVSWTVILDG 193



 Score = 73.9 bits (180), Expect = 2e-11
 Identities = 48/133 (36%), Positives = 71/133 (53%), Gaps = 6/133 (4%)
 Frame = -2

Query: 383 RSARKVFDEIPNRHKDTVDWTTLMNCYTRAGFPLESLNLFIDMR-KLGTPIDDITLVSFF 207
           R+ R VFDE+P R++  V WT ++  Y  +G   ES  L  +M   L   ++ +TL S  
Sbjct: 201 RNGRVVFDEMPERNE--VAWTIMIAGYLDSGLTQESFALVREMIFDLEMELNYVTLCSIL 258

Query: 206 SACAKMGKGRLGHQGHACLIKMGFWNTEKTCN-----ATMDMYVKCGLMHEARRVFYDMN 42
           +AC++ G   +G   HA  +K      EK  N     A +DMY KCG +H A + F  M 
Sbjct: 259 TACSQSGDLMMGRWVHAYALK----TKEKELNIMVGTAMVDMYAKCGRIHIAFKFFKKMP 314

Query: 41  ERSIVSWTVLLEG 3
           +R++VSW  +L G
Sbjct: 315 QRNVVSWNAMLSG 327


>ref|XP_002530010.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223530489|gb|EEF32372.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 487

 Score =  197 bits (501), Expect = 1e-48
 Identities = 96/184 (52%), Positives = 125/184 (67%), Gaps = 1/184 (0%)
 Frame = -2

Query: 551 ARHYRTLLRSTARHSSFDVGQKLHATAVTTGLLALSLPNAFLRNTILHVYAACGDCRSAR 372
           +RH R+LLRS AR SS   G+KLHA  +TTG+   + PNAFL N +LH+Y+ CG  R A 
Sbjct: 24  SRHLRSLLRSCARESSLSTGKKLHAILLTTGVA--TSPNAFLLNALLHLYSQCGITRYAH 81

Query: 371 KVFDEIPNRHKDTVDWTTLMNCYTR-AGFPLESLNLFIDMRKLGTPIDDITLVSFFSACA 195
            +FD+IPN HKDT DWT+L++C  +    P  + +LF +MRK G  +DD+  V  FS CA
Sbjct: 82  HLFDQIPNSHKDTADWTSLLSCLAKHTSTPRNAFSLFEEMRKRGVILDDVAFVCVFSLCA 141

Query: 194 KMGKGRLGHQGHACLIKMGFWNTEKTCNATMDMYVKCGLMHEARRVFYDMNERSIVSWTV 15
           ++G   +G Q H C++KMGF    K CNA M++YVKC LM EA+ VF +M ER IVSWT 
Sbjct: 142 RVGNLEMGRQAHGCVVKMGFGINVKVCNAVMNVYVKCRLMGEAKGVFSEMGERDIVSWTA 201

Query: 14  LLEG 3
           LLEG
Sbjct: 202 LLEG 205



 Score = 67.8 bits (164), Expect = 1e-09
 Identities = 56/205 (27%), Positives = 90/205 (43%), Gaps = 33/205 (16%)
 Frame = -2

Query: 518 ARHSSFDVGQKLHATAVTTGLLALSLPNAFLRNTILHVYAAC---GDCR----------- 381
           AR  + ++G++ H   V  G       N  + N +++VY  C   G+ +           
Sbjct: 141 ARVGNLEMGRQAHGCVVKMGFGI----NVKVCNAVMNVYVKCRLMGEAKGVFSEMGERDI 196

Query: 380 -----------------SARKVFDEIPNRHKDTVDWTTLMNCYTRAGFPLESLNLFIDM- 255
                            + + VFD++P R++  V WT +++ Y  +GF  E   L  +M 
Sbjct: 197 VSWTALLEGVVNWEGVENGKVVFDQMPERNE--VGWTIMISGYVGSGFCKEGFLLLSEMV 254

Query: 254 RKLGTPIDDITLVSFFSACAKMGKGRLGHQGHA-CLIKMGFWNTEKTCNATMDMYVKCGL 78
             L   ++ +TL S  SACA+ G   +G   H   L KMG         A +DMY KCG 
Sbjct: 255 LGLRLELNYVTLCSILSACAQSGDVVMGRWVHVYALKKMGREIDMMVGTALIDMYAKCGR 314

Query: 77  MHEARRVFYDMNERSIVSWTVLLEG 3
           +  A  VF  +  R++V+W  +L G
Sbjct: 315 IKMAYEVFKYLPRRNVVAWNAILGG 339


>ref|XP_002324074.1| predicted protein [Populus trichocarpa] gi|222867076|gb|EEF04207.1|
           predicted protein [Populus trichocarpa]
          Length = 636

 Score =  184 bits (466), Expect = 1e-44
 Identities = 93/188 (49%), Positives = 122/188 (64%), Gaps = 1/188 (0%)
 Frame = -2

Query: 563 VSSLARHYRTLLRSTARHSSFDVGQKLHATAVTTGLLALSLPNAFLRNTILHVYAACGDC 384
           + SL   +R+LLRS AR+SS   G+KLHA  +T+GL A S PN FL N + H+YA+CG  
Sbjct: 11  LQSLPARFRSLLRSCARNSSLSTGKKLHAVILTSGL-ASSSPNTFLLNALHHLYASCGVT 69

Query: 383 RSARKVFDEIPNRHKDTVDWTTLMNCYTRAGF-PLESLNLFIDMRKLGTPIDDITLVSFF 207
            SAR +F +IP  HKD  DWTTL+    + G  P E    F +MRK G  +DD+ ++S F
Sbjct: 70  SSARHLFYQIPRSHKDVTDWTTLLTSLVQHGTKPSEGFFFFKEMRKEGVVLDDVAMISVF 129

Query: 206 SACAKMGKGRLGHQGHACLIKMGFWNTEKTCNATMDMYVKCGLMHEARRVFYDMNERSIV 27
             C ++    +G Q   CL+KMG     K CNA M+MYVKCGL+ E RRVF +MNER++V
Sbjct: 130 VLCTRVEDLGMGRQAQGCLVKMGLGLGVKVCNAIMNMYVKCGLVEEVRRVFCEMNERNVV 189

Query: 26  SWTVLLEG 3
           SW+ LLEG
Sbjct: 190 SWSTLLEG 197



 Score = 67.0 bits (162), Expect = 2e-09
 Identities = 46/142 (32%), Positives = 71/142 (50%), Gaps = 2/142 (1%)
 Frame = -2

Query: 422 NTILHVYAACGDCRSARKVFDEIPNRHKDTVDWTTLMNCYTRAGFPLESLNLFIDM-RKL 246
           +T+L          + R VFDE+P R++  V WT ++  Y   GF  E   L  +M  + 
Sbjct: 192 STLLEGVVKWEGVENGRVVFDEMPERNE--VGWTIMIAGYVGNGFSREGFLLLDEMVLRF 249

Query: 245 GTPIDDITLVSFFSACAKMGKGRLGHQGHACLIK-MGFWNTEKTCNATMDMYVKCGLMHE 69
              ++ +TL S  SACA+ G   +G   H   +K MG         A +DMY KCG +  
Sbjct: 250 RLGLNFVTLSSILSACAQSGDVLMGRWVHVYALKGMGREMHIMVGTALVDMYAKCGPIDM 309

Query: 68  ARRVFYDMNERSIVSWTVLLEG 3
           A +VF  + +R++V+W  +L G
Sbjct: 310 AFKVFKYLPKRNVVAWNAMLGG 331


>ref|XP_003618546.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355493561|gb|AES74764.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 637

 Score =  176 bits (445), Expect = 4e-42
 Identities = 92/189 (48%), Positives = 128/189 (67%), Gaps = 2/189 (1%)
 Frame = -2

Query: 563 VSSLARHYRTLLRSTARHSSFDVGQKLHATAVTTGLLALSLPNAFLRNTILHVYAACGDC 384
           +SSLA H+R+LLR  +R ++   GQ+LHATA+ TGL+  S PN FLRN +LH+Y +C   
Sbjct: 16  LSSLALHFRSLLRQCSRATALRPGQQLHATAIVTGLI--SSPNHFLRNALLHLYGSCSLP 73

Query: 383 RSARKVFDEIPNRHKDTVDWTTLMNCYTRAGFPLESLNLFIDMRKLGTPIDDITLVSFFS 204
             ARK+FDEIP  HKD+VD+T L+    R   P ESL LFI MR+   P+D + +V   +
Sbjct: 74  SHARKLFDEIPQSHKDSVDYTALI----RHCPPFESLKLFIQMRQFDLPLDGVVMVCALN 129

Query: 203 ACAKMGKG--RLGHQGHACLIKMGFWNTEKTCNATMDMYVKCGLMHEARRVFYDMNERSI 30
           ACA++G G  ++G Q H  ++K GF   +K CNA M++YVK GL+ EAR++F  +  RS+
Sbjct: 130 ACARLGGGDTKVGSQMHVGVVKFGFVKFDKVCNALMNVYVKFGLVGEARKMFEGIEVRSV 189

Query: 29  VSWTVLLEG 3
           VSW+  LEG
Sbjct: 190 VSWSCFLEG 198



 Score = 67.4 bits (163), Expect = 2e-09
 Identities = 44/128 (34%), Positives = 67/128 (52%), Gaps = 2/128 (1%)
 Frame = -2

Query: 380 SARKVFDEIPNRHKDTVDWTTLMNCYTRAGFPLESLNLFIDMR-KLGTPIDDITLVSFFS 204
           S R +FDE+P R++  V WT ++  Y   GF  E+  L  +M    G  +  +TL S  S
Sbjct: 207 SGRVLFDEMPERNE--VAWTVMIVGYVGNGFTKEAFLLLKEMVFGCGFRLSFVTLCSVLS 264

Query: 203 ACAKMGKGRLGHQGHACLIK-MGFWNTEKTCNATMDMYVKCGLMHEARRVFYDMNERSIV 27
           AC++ G   +G   H   +K MG         + +DMY KCG ++ A  VF  M +R++V
Sbjct: 265 ACSQSGDVCVGRWVHCYAVKEMGLDFGVMVGTSLVDMYAKCGRINAALSVFRSMLKRNVV 324

Query: 26  SWTVLLEG 3
           +W  +L G
Sbjct: 325 AWNAMLGG 332


>ref|XP_002873720.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297319557|gb|EFH49979.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 623

 Score =  168 bits (426), Expect = 6e-40
 Identities = 80/181 (44%), Positives = 116/181 (64%)
 Frame = -2

Query: 548 RHYRTLLRSTARHSSFDVGQKLHATAVTTGLLALSLPNAFLRNTILHVYAACGDCRSARK 369
           ++ R LLR +A  S    G++LHA   T+GL     P ++L N +   YA+ G+  +A+K
Sbjct: 7   QNVRLLLRQSAHRSFLHPGRELHAVLTTSGLK--KAPRSYLSNALFQFYASSGEIATAQK 64

Query: 368 VFDEIPNRHKDTVDWTTLMNCYTRAGFPLESLNLFIDMRKLGTPIDDITLVSFFSACAKM 189
           +FDEIP   KD VDWTTL++ ++R G  + S+ LF++MR+    ID ++LV  F  CAK+
Sbjct: 65  LFDEIPLSDKDNVDWTTLLSSFSRFGLLVNSMKLFVEMRRKRVEIDHVSLVCLFGVCAKL 124

Query: 188 GKGRLGHQGHACLIKMGFWNTEKTCNATMDMYVKCGLMHEARRVFYDMNERSIVSWTVLL 9
              R G QGH   +KMGF  + K CNA MDMY KCG + E +R+F  + E+S+VSWTV+L
Sbjct: 125 EDLRFGEQGHGVAVKMGFLTSVKVCNALMDMYGKCGFVSEVKRIFQALEEKSVVSWTVVL 184

Query: 8   E 6
           +
Sbjct: 185 D 185



 Score = 73.9 bits (180), Expect = 2e-11
 Identities = 46/135 (34%), Positives = 69/135 (51%), Gaps = 8/135 (5%)
 Frame = -2

Query: 383 RSARKVFDEIPNRHKDTVDWTTLMNCYTRAGFPLESLNLFIDMR-KLGTPIDDITLVSFF 207
           +  R+VFDE+P R+   V WT ++  Y  AGF  E L L  +M  + G  ++ +TL S  
Sbjct: 194 KRGREVFDEMPERN--VVAWTLMVAGYLGAGFTREVLELLAEMVFRCGHGLNFVTLCSML 251

Query: 206 SACAKMGKGRLGHQGHACLIKMGFWNTEKTC-------NATMDMYVKCGLMHEARRVFYD 48
           SACA+ G   +G   H   +K      E+          A +DMY KCG +  + +VF  
Sbjct: 252 SACAQSGNLVIGRWVHVYALKKAMMMGEEETYDGVMVGTALVDMYAKCGNIDSSIKVFRL 311

Query: 47  MNERSIVSWTVLLEG 3
           M +R++V+W  L  G
Sbjct: 312 MRKRNVVTWNALFSG 326


Top