BLASTX nr result

ID: Cephaelis21_contig00028555 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00028555
         (1362 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002268821.1| PREDICTED: pentatricopeptide repeat-containi...   223   7e-56
ref|XP_002526948.1| pentatricopeptide repeat-containing protein,...   154   4e-35
ref|XP_003548551.1| PREDICTED: pentatricopeptide repeat-containi...   144   5e-32
ref|XP_002873660.1| pentatricopeptide repeat-containing protein ...   139   1e-30
ref|XP_002324000.1| predicted protein [Populus trichocarpa] gi|2...   139   1e-30

>ref|XP_002268821.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46610
           [Vitis vinifera]
          Length = 763

 Score =  223 bits (569), Expect = 7e-56
 Identities = 138/314 (43%), Positives = 185/314 (58%), Gaps = 3/314 (0%)
 Frame = -2

Query: 935 MQSLSTWSSINESRLVPQSDSELGSSSVSRTTKFIRKRFFLGTPICHFSIPAGLLRVSRD 756
           MQ+LS W S      VPQ D  LGSSS+    +  RK +    P+C +   A  L VS  
Sbjct: 1   MQALSVWPSKGVFWAVPQLDYNLGSSSIPSRRRGRRKLWNPEDPVCQYRSLA-FLWVSSS 59

Query: 755 CRGISSRTCFSGRNWDVKHKLSTKYSK---HLFSEPNRESLGASFALAWVLEERAIGNHA 585
            R            +D    L + YSK    L  E  R S GASFALAW LE++AIGN  
Sbjct: 60  SRSDRVGVYCGSPKFDFGCGLLSGYSKLKIFLLCERKRGSFGASFALAWALEQQAIGNEF 119

Query: 584 GTNNSASLNGVSEKTENGDCHSAGLEEEVRVLDLAFGCDHVETVDGEDHDKEEDASEEGV 405
              +S S++ ++  TE  D     +             D     D  D+++E++A + G 
Sbjct: 120 VKEDSNSIHSLAGNTETVDIDCLKV-------------DGARDGDENDNEEEKEAEKNGE 166

Query: 404 GVIQNRKLVDVRALGRRLHMARTADDVEEVLKKRGKLPLQVYSSLMRGFGKEKRLHAAVA 225
            + +  + VDVRAL   L  A TADDVEEVLK + +LPLQVYS+++RGFG +KRL AA+A
Sbjct: 167 VIEEKSRNVDVRALAHGLEFATTADDVEEVLKDKVELPLQVYSTMIRGFGTDKRLDAAMA 226

Query: 224 LFEWLRRKSQKTNGAIRPNLFIYNSLLGSIKHAQEYGLVELVVNDMVVEDINPDIVTYNT 45
           L EWL+RK ++TNG+  PNLF+YNSLLG++K ++++ LVE V+NDM  E I P++VTYNT
Sbjct: 227 LVEWLKRK-KETNGSKGPNLFVYNSLLGAVKQSEKFALVEKVMNDMAREGILPNVVTYNT 285

Query: 44  LMGIYIGQGREVEA 3
           LM IY+ QGR VEA
Sbjct: 286 LMSIYLEQGRSVEA 299


>ref|XP_002526948.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223533700|gb|EEF35435.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 671

 Score =  154 bits (390), Expect = 4e-35
 Identities = 88/221 (39%), Positives = 140/221 (63%), Gaps = 5/221 (2%)
 Frame = -2

Query: 650 ESLGASFALAWVLEERAIGNHAGTNNSASLNGVSEKTENGDCHSAGLEEEVRVLDLAFGC 471
           +S  +S A AW L+++ I +       +  +G+  K+E  D +   L             
Sbjct: 2   DSFRSSIAFAWALQKQDISSEFHGVEPSLDDGLLGKSEKEDVNPHNL------------- 48

Query: 470 DHVETVDGEDHDKEEDA-----SEEGVGVIQNRKLVDVRALGRRLHMARTADDVEEVLKK 306
             +E  D +++++E++      S+EGVG  + R  +DVR+L R LH A+TADDVEEVLK 
Sbjct: 49  GRLEDSDDDNNNQEDNIELDLRSKEGVGEEKCRS-IDVRSLARSLHSAQTADDVEEVLKD 107

Query: 305 RGKLPLQVYSSLMRGFGKEKRLHAAVALFEWLRRKSQKTNGAIRPNLFIYNSLLGSIKHA 126
           +G+LPLQVYSS+++ FG + ++ +A+AL EWL+R+ ++   +I PNLFIYNSLL ++K +
Sbjct: 108 KGELPLQVYSSMIKAFGWDNKMESALALVEWLKRR-KEIGSSIGPNLFIYNSLLSAVKKS 166

Query: 125 QEYGLVELVVNDMVVEDINPDIVTYNTLMGIYIGQGREVEA 3
           + +   E ++NDM  E I P++VTYNTLMGIY+ +G+  +A
Sbjct: 167 KLFEEAEKILNDMTQEGIAPNVVTYNTLMGIYVEKGQATKA 207


>ref|XP_003548551.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g46610-like [Glycine max]
          Length = 808

 Score =  144 bits (363), Expect = 5e-32
 Identities = 88/218 (40%), Positives = 133/218 (61%), Gaps = 5/218 (2%)
 Frame = -2

Query: 656 NRESLGA-SFALAWVLEERAIGNHAGTNNSASLNG----VSEKTENGDCHSAGLEEEVRV 492
           NRES G  S  L  V +    G   G ++ +  +G    V E+T++ D    G  E V+ 
Sbjct: 126 NRESEGVKSLNLDQVQDSDFEGQIRGYDDDSKESGGNELVEEQTDSNDALVNGDLEGVKS 185

Query: 491 LDLAFGCDHVETVDGEDHDKEEDASEEGVGVIQNRKLVDVRALGRRLHMARTADDVEEVL 312
           L+L    D V+  D E     +D S+EG G  ++   VDVRAL   L   +T +DV  +L
Sbjct: 186 LNL----DQVKDSDCEGKMCGDDNSKEG-GEEESDGKVDVRALALSLQTVKTVEDVGGIL 240

Query: 311 KKRGKLPLQVYSSLMRGFGKEKRLHAAVALFEWLRRKSQKTNGAIRPNLFIYNSLLGSIK 132
           K +G LPLQV+S+++ GFGKEKR+ +A+ LF W++++  +TNG+  PNLFIYN LLG +K
Sbjct: 241 KDKGDLPLQVFSTIISGFGKEKRMDSALILFNWMKKRKIETNGSFGPNLFIYNGLLGVVK 300

Query: 131 HAQEYGLVELVVNDMVVEDINPDIVTYNTLMGIYIGQG 18
            + ++  +E+++N+M  + I  ++VTYNTLM IYI +G
Sbjct: 301 QSGQFAEMEVILNEMAEDGIAYNVVTYNTLMAIYIEKG 338


>ref|XP_002873660.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297319497|gb|EFH49919.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 674

 Score =  139 bits (351), Expect = 1e-30
 Identities = 86/222 (38%), Positives = 120/222 (54%), Gaps = 1/222 (0%)
 Frame = -2

Query: 680 SKHLF-SEPNRESLGASFALAWVLEERAIGNHAGTNNSASLNGVSEKTENGDCHSAGLEE 504
           SK LF  EP R   G+S  + W  E+R +G    T +S+                     
Sbjct: 70  SKVLFLCEPKRNLSGSSVGVGWATEQRELGEEVSTEDSS--------------------- 108

Query: 503 EVRVLDLAFGCDHVETVDGEDHDKEEDASEEGVGVIQNRKLVDVRALGRRLHMARTADDV 324
                       + +TV+G +               +    VDVR L   L  A+TADDV
Sbjct: 109 ------------YPQTVNGGE---------------KTNSRVDVRELAYSLRAAKTADDV 141

Query: 323 EEVLKKRGKLPLQVYSSLMRGFGKEKRLHAAVALFEWLRRKSQKTNGAIRPNLFIYNSLL 144
           + V+K+ G+LPLQVY +++RGFGK+KRL  A+A+ +WLRRK  ++ G I PNLFIYNSLL
Sbjct: 142 DIVIKEMGELPLQVYCAMIRGFGKDKRLKPAIAVVDWLRRKKSESGGVIGPNLFIYNSLL 201

Query: 143 GSIKHAQEYGLVELVVNDMVVEDINPDIVTYNTLMGIYIGQG 18
           G++K +   G  E +++DM  E I P+IVTYNTLM IY+ +G
Sbjct: 202 GAMKQS-SVGEAEKILSDMEEEGIVPNIVTYNTLMVIYMEKG 242


>ref|XP_002324000.1| predicted protein [Populus trichocarpa] gi|222867002|gb|EEF04133.1|
           predicted protein [Populus trichocarpa]
          Length = 709

 Score =  139 bits (351), Expect = 1e-30
 Identities = 104/311 (33%), Positives = 154/311 (49%)
 Frame = -2

Query: 935 MQSLSTWSSINESRLVPQSDSELGSSSVSRTTKFIRKRFFLGTPICHFSIPAGLLRVSRD 756
           MQ+LS W     S  VP  + E  SS    T + I KR+ L   +      +G   VS D
Sbjct: 1   MQTLSVWPLSGGSCAVPHLEFEEDSSCFLSTRRGI-KRWGLVDNVFQ-GASSGFPMVSGD 58

Query: 755 CRGISSRTCFSGRNWDVKHKLSTKYSKHLFSEPNRESLGASFALAWVLEERAIGNHAGTN 576
            R +S+ +               K     F E    S G+S ALA  LE++ IGN     
Sbjct: 59  LRFLSNHS---------------KIKYVCFRETKEGSFGSSLALASALEQQKIGN----- 98

Query: 575 NSASLNGVSEKTENGDCHSAGLEEEVRVLDLAFGCDHVETVDGEDHDKEEDASEEGVGVI 396
                          + H      + R L  A          GE+ D++           
Sbjct: 99  ---------------EFHRVESSLDDRSLGEA----------GEERDEK----------- 122

Query: 395 QNRKLVDVRALGRRLHMARTADDVEEVLKKRGKLPLQVYSSLMRGFGKEKRLHAAVALFE 216
                +DV AL + L+ A+T DD+EEVLK +G+LP+QVY S+++GFG +K++  A+AL +
Sbjct: 123 -----IDVPALAQSLYFAKTVDDIEEVLKDKGELPVQVYLSMIKGFGWDKKMEPAIALVD 177

Query: 215 WLRRKSQKTNGAIRPNLFIYNSLLGSIKHAQEYGLVELVVNDMVVEDINPDIVTYNTLMG 36
           WL+ K ++T+G I PNLFIYNSLL ++K +++Y   E ++  M  E + P++VTYN LM 
Sbjct: 178 WLKIK-KETDGTIVPNLFIYNSLLSAVKQSEQYEETEKILERMTQEGVAPNVVTYNILMV 236

Query: 35  IYIGQGREVEA 3
           IY+ QG+  +A
Sbjct: 237 IYVKQGQAKKA 247