BLASTX nr result

ID: Cephaelis21_contig00016194 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cephaelis21_contig00016194
         (1793 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN69960.1| hypothetical protein VITISV_032887 [Vitis vinifera]   593   e-167
ref|XP_002521681.1| pentatricopeptide repeat-containing protein,...   578   e-162
ref|XP_003626687.1| Pentatricopeptide repeat-containing protein ...   555   e-155
ref|XP_002306437.1| predicted protein [Populus trichocarpa] gi|2...   552   e-155
ref|XP_002884075.1| pentatricopeptide repeat-containing protein ...   536   e-150

>emb|CAN69960.1| hypothetical protein VITISV_032887 [Vitis vinifera]
          Length = 472

 Score =  593 bits (1530), Expect = e-167
 Identities = 299/472 (63%), Positives = 359/472 (76%), Gaps = 9/472 (1%)
 Frame = -2

Query: 1633 MGKFPPSMRGARLPIGALMKKP------LSEPQSPAQKPHYLAGXXXXXXXXKSM---PQ 1481
            MGK PPS R + +P+  L+K P       S      QKPH+                 P 
Sbjct: 1    MGKIPPSFRTSTVPVTTLLKNPPAVLPKQSTVLETPQKPHHFPKKRPQPSGKTKKTRTPI 60

Query: 1480 IAPRVPTITFDSPSLSDAKTIFNQLVSSPKKAPLDLRFCNAILQSFSSVGTLQDSIFLLN 1301
              P+ P I F+SP+L DAK +F  + ++    PLDLRF NA+LQS+SS+ T+ DSI  L 
Sbjct: 61   EDPKSPVI-FNSPNLLDAKKLFASITTT-STTPLDLRFHNALLQSYSSISTVNDSISFLR 118

Query: 1300 HMIKTHPVFSPDSSTYNILLVQCCNAEDFSISSVHQVLNYMSTQGFPPNKVTTDVAVRTL 1121
            HMIK+ P FSP+ STY+ILL Q C + +  +S+VHQ LN M T GFPP++VTTD+AVR+L
Sbjct: 119  HMIKSQPSFSPERSTYHILLSQSCKSPNSDLSAVHQTLNLMVTHGFPPDRVTTDIAVRSL 178

Query: 1120 CFAGREEDAIELVKELSGKNLTPDSYTYNFLVRHLVKNRELSTVNCFIKEMREGFDIKPD 941
            C AGREE AIELVKELS K+  PDS+TYNF++RHL K R LSTV  FI E++  F +KPD
Sbjct: 179  CSAGREEHAIELVKELSLKHSPPDSFTYNFIIRHLCKTRALSTVYNFIDELQNSFQLKPD 238

Query: 940  LVSYTIMIDNVCNRKNLREATRLLGVLREEGYKPDCYVYNTIMKGHCMLNQSGEVLDVYK 761
            LV+YTI+IDNVCN KNLREATRLL VL E G+KPDCYVYNTIMKG+C+L++  E + VYK
Sbjct: 239  LVTYTILIDNVCNGKNLREATRLLEVLGEAGFKPDCYVYNTIMKGYCILDKGSEAIGVYK 298

Query: 760  KMKEEGVQPDLVTYNTLIYGLSKSGRVKEAKKFLGVMTEMGQFPDAATYTSLMNGMCREG 581
            KMKEEGV+PDLVTYNTLI+GLSKSGRVKEA+KFL +M EMG FPDA TYTSLMNG+CREG
Sbjct: 299  KMKEEGVEPDLVTYNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGLCREG 358

Query: 580  DAIRALGLLEEMEAKGCSPNECTYNTLLHGLCKARLVDKGIEFYKVMQEGGMKIEPGSYG 401
            +A+ AL LLEEMEAKGCSPN CTYNTLLHGLCK R++++GIE Y VM+ GGMK+E  SY 
Sbjct: 359  NALGALALLEEMEAKGCSPNSCTYNTLLHGLCKLRMLERGIELYGVMKSGGMKLEKASYA 418

Query: 400  TLVRALCKNGRVAEAYEVFDYAIESKSLTDVSAYSTLESNLKWLKKAREQGL 245
            T VRALCK GRVAEAYE FDY +ESKS  DV+AYSTLE++LKWL+KAREQGL
Sbjct: 419  TFVRALCKEGRVAEAYEAFDYVVESKSFDDVTAYSTLENSLKWLRKAREQGL 470


>ref|XP_002521681.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223539072|gb|EEF40668.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 458

 Score =  578 bits (1491), Expect = e-162
 Identities = 291/470 (61%), Positives = 364/470 (77%), Gaps = 5/470 (1%)
 Frame = -2

Query: 1633 MGKFPPSMRGARLPIGALMKKPLSEPQSPAQKPHYLAGXXXXXXXXKSMPQIAPRVPTIT 1454
            MGK PPS+R A +   AL++KP   P  P +KPHYL+         K+  Q++ ++PT  
Sbjct: 1    MGKVPPSLRSA-VSTTALLRKP--NPFPPPEKPHYLS--------KKTKLQLSQKIPTPI 49

Query: 1453 -----FDSPSLSDAKTIFNQLVSSPKKAPLDLRFCNAILQSFSSVGTLQDSIFLLNHMIK 1289
                 F SP L++AK IFN L+S+ +  PLDLRF ++ LQS+SS+ T+ DSI LL HMIK
Sbjct: 50   QQKRLFKSPELNEAKEIFNSLISTTR-VPLDLRFHHSFLQSYSSISTIDDSISLLRHMIK 108

Query: 1288 THPVFSPDSSTYNILLVQCCNAEDFSISSVHQVLNYMSTQGFPPNKVTTDVAVRTLCFAG 1109
            T P F+P  STY+ILL Q C A D ++S VHQ+LN M   GF P +VT D+AVR LC AG
Sbjct: 109  TLPSFTPTISTYHILLSQSCKAPDPTLSPVHQILNLMVNNGFMPTQVTVDIAVRALCSAG 168

Query: 1108 REEDAIELVKELSGKNLTPDSYTYNFLVRHLVKNRELSTVNCFIKEMREGFDIKPDLVSY 929
            +E+DA++LVKELS K+  PDS+TYNFLV+ L K R LS V  FI EMR  FD++P+LV+Y
Sbjct: 169  KEDDAVKLVKELSLKHSKPDSFTYNFLVKCLCKCRALSNVYSFIDEMRSSFDLEPNLVTY 228

Query: 928  TIMIDNVCNRKNLREATRLLGVLREEGYKPDCYVYNTIMKGHCMLNQSGEVLDVYKKMKE 749
            TI+IDNVCN KNLREA RLLG+LRE G+KPDC+VYNTIMKG+CML++  + + V+KKMKE
Sbjct: 229  TILIDNVCNSKNLREAMRLLGILRECGFKPDCFVYNTIMKGYCMLSKGSDAIQVFKKMKE 288

Query: 748  EGVQPDLVTYNTLIYGLSKSGRVKEAKKFLGVMTEMGQFPDAATYTSLMNGMCREGDAIR 569
            EG++PDL+TYNTLI+GLSK GRV EAK++L +M E G FPDA TYTSLMNG+CR+GDA+ 
Sbjct: 289  EGIEPDLITYNTLIFGLSKGGRVSEAKRYLKIMVESGHFPDAVTYTSLMNGLCRKGDALG 348

Query: 568  ALGLLEEMEAKGCSPNECTYNTLLHGLCKARLVDKGIEFYKVMQEGGMKIEPGSYGTLVR 389
            AL LLE+ME KGCSPN CTYNTLL+GLCK RL++KGIE Y V++EGGM ++  SY T VR
Sbjct: 349  ALALLEDMEMKGCSPNSCTYNTLLYGLCKERLLEKGIELYNVIKEGGMLLDTASYATFVR 408

Query: 388  ALCKNGRVAEAYEVFDYAIESKSLTDVSAYSTLESNLKWLKKAREQGLVV 239
            ALC+ G+VAEAYEVFDYA+ESKSLT+ +AY+TLES LKWLKKAREQGL V
Sbjct: 409  ALCREGKVAEAYEVFDYAVESKSLTNAAAYTTLESTLKWLKKAREQGLSV 458


>ref|XP_003626687.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355520709|gb|AET01163.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 501

 Score =  555 bits (1429), Expect = e-155
 Identities = 275/469 (58%), Positives = 344/469 (73%), Gaps = 5/469 (1%)
 Frame = -2

Query: 1633 MGKFPPSMRGARLPIGALMKKPLSEPQSPAQKPHYLAGXXXXXXXXK--SMPQIAPRVPT 1460
            MGK PPS R A      + +     P SP  KPH+           +  S  Q     P 
Sbjct: 1    MGKIPPSFRSALSNPNLIHRSSSLIPSSP--KPHHFPNKTRKPHQKQQQSQSQSQSPKPV 58

Query: 1459 ITFDSPSLSDAKTIFNQLVSSPKKAPLDLRFCNAILQSFSSVGTLQDSIFLLNHMIKTHP 1280
              F SP+L +AK+IFN  V+S   AP+D RF N++LQS++S+ T+ DSI  L HM KTHP
Sbjct: 59   SVFKSPNLQEAKSIFNSFVNS-SNAPIDSRFHNSLLQSYASISTINDSIAFLRHMTKTHP 117

Query: 1279 VFSPDSSTYNILLVQCCNAEDF---SISSVHQVLNYMSTQGFPPNKVTTDVAVRTLCFAG 1109
             FSPD STY+ILL  CC + D    ++S +HQ LN M + G  P+K T D+AVR+LC A 
Sbjct: 118  SFSPDKSTYHILLTHCCKSTDSKYSTLSLIHQTLNLMVSDGISPDKGTVDLAVRSLCTAD 177

Query: 1108 REEDAIELVKELSGKNLTPDSYTYNFLVRHLVKNRELSTVNCFIKEMREGFDIKPDLVSY 929
            R +DA+EL+KELS K+ +PD Y+YNFLV++L K+R LS V  FI EMR  FD+KP+LV+Y
Sbjct: 178  RVDDAVELIKELSSKHCSPDIYSYNFLVKNLCKSRTLSLVYAFIDEMRTKFDVKPNLVTY 237

Query: 928  TIMIDNVCNRKNLREATRLLGVLREEGYKPDCYVYNTIMKGHCMLNQSGEVLDVYKKMKE 749
            TI+IDNVCN KNLREATRL+ +L EEG+KPDC++YNTIMKG+CML++  E ++VY +MKE
Sbjct: 238  TILIDNVCNTKNLREATRLVDILEEEGFKPDCFLYNTIMKGYCMLSRGSEAIEVYNRMKE 297

Query: 748  EGVQPDLVTYNTLIYGLSKSGRVKEAKKFLGVMTEMGQFPDAATYTSLMNGMCREGDAIR 569
            +GV+PDL+TYNTLI+GLSKSGRV EAKK L VM E G FPD  TYTSLMNGMCR+G+ + 
Sbjct: 298  KGVEPDLITYNTLIFGLSKSGRVSEAKKLLRVMAEKGHFPDEVTYTSLMNGMCRKGETLA 357

Query: 568  ALGLLEEMEAKGCSPNECTYNTLLHGLCKARLVDKGIEFYKVMQEGGMKIEPGSYGTLVR 389
            AL LLEEME KGCSPN CTYNTLLHGLCK+R+ DK +E Y  M+  G+K++  SY T VR
Sbjct: 358  ALALLEEMEMKGCSPNTCTYNTLLHGLCKSRMFDKAMELYGAMKSDGLKLDMASYATFVR 417

Query: 388  ALCKNGRVAEAYEVFDYAIESKSLTDVSAYSTLESNLKWLKKAREQGLV 242
            ALC  GRVA+AYEVFDYA+ESKSL+DV+AYSTLES LKW KKA+E+G +
Sbjct: 418  ALCSVGRVADAYEVFDYAVESKSLSDVAAYSTLESTLKWFKKAKEEGAI 466


>ref|XP_002306437.1| predicted protein [Populus trichocarpa] gi|222855886|gb|EEE93433.1|
            predicted protein [Populus trichocarpa]
          Length = 462

 Score =  552 bits (1423), Expect = e-155
 Identities = 282/466 (60%), Positives = 345/466 (74%), Gaps = 1/466 (0%)
 Frame = -2

Query: 1633 MGKFPPSMRGARLPIGALMKKPLSEPQSPAQKPHYLAGXXXXXXXXKSMPQIAPRVPTIT 1454
            MGKFPPS R A +   +L+K   S+ Q   Q+PHY           K      P      
Sbjct: 1    MGKFPPSFRSA-ISSTSLIKNTPSQQQ---QQPHYFPKKLTKKNSPKPHETETPPPHKSL 56

Query: 1453 FDSPSLSDAKTIFNQLVSSPKKAPLD-LRFCNAILQSFSSVGTLQDSIFLLNHMIKTHPV 1277
            F + SL++AK++FN  +S+ K   LD LR  N+ LQS++S+ TL DSI LL+HM+KT P 
Sbjct: 57   FKTSSLNEAKSLFNSFISTTKAPLLDNLRLHNSFLQSYTSISTLDDSISLLDHMVKTLPS 116

Query: 1276 FSPDSSTYNILLVQCCNAEDFSISSVHQVLNYMSTQGFPPNKVTTDVAVRTLCFAGREED 1097
             SPD STY++LL Q C   D S+SS  +VLN M  +GF PN+ T DVA+R+LC AGR +D
Sbjct: 117  LSPDRSTYHVLLSQSCREPDSSLSSAQKVLNLMINKGFKPNQFTVDVAIRSLCSAGRVDD 176

Query: 1096 AIELVKELSGKNLTPDSYTYNFLVRHLVKNRELSTVNCFIKEMREGFDIKPDLVSYTIMI 917
            AI LVKE S K+  PD++TYNFLV+ L K+R  ++V  FI EM+  FDIKPDLV+YTI+I
Sbjct: 177  AILLVKEFSSKHSKPDTFTYNFLVKCLCKSRIFNSVYSFIDEMKSSFDIKPDLVTYTILI 236

Query: 916  DNVCNRKNLREATRLLGVLREEGYKPDCYVYNTIMKGHCMLNQSGEVLDVYKKMKEEGVQ 737
            DNVCN KN+REA RL+ VL+E G KPD ++YNTIMKG+C+LN+  E + +YK+MKEEGV+
Sbjct: 237  DNVCNAKNIREADRLVAVLKECGLKPDAFLYNTIMKGYCLLNKGIEAVRIYKQMKEEGVE 296

Query: 736  PDLVTYNTLIYGLSKSGRVKEAKKFLGVMTEMGQFPDAATYTSLMNGMCREGDAIRALGL 557
            PDLVTYNTLI+GLSK GRV EAKK L +M E G FPDA TYTSLMNGMCREGD + A  L
Sbjct: 297  PDLVTYNTLIFGLSKCGRVSEAKKLLKIMVESGHFPDAVTYTSLMNGMCREGDVLGAAAL 356

Query: 556  LEEMEAKGCSPNECTYNTLLHGLCKARLVDKGIEFYKVMQEGGMKIEPGSYGTLVRALCK 377
            LEEME KGCSPN CTYNTLLHG CK R ++KG+E Y V+++GGMK+E  SY T VRALC+
Sbjct: 357  LEEMELKGCSPNSCTYNTLLHGFCKGRRLNKGVELYGVIKKGGMKLETASYATFVRALCR 416

Query: 376  NGRVAEAYEVFDYAIESKSLTDVSAYSTLESNLKWLKKAREQGLVV 239
             GRVAEAYEVFDYA+ESKSLTDV+AY+TLES LKWLKKAREQGL V
Sbjct: 417  EGRVAEAYEVFDYAVESKSLTDVAAYTTLESTLKWLKKAREQGLAV 462


>ref|XP_002884075.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297329915|gb|EFH60334.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 462

 Score =  536 bits (1380), Expect = e-150
 Identities = 265/466 (56%), Positives = 349/466 (74%), Gaps = 2/466 (0%)
 Frame = -2

Query: 1633 MGKFPPSMRGARLPIGALMKKPLSEPQSPAQKPHYLAGXXXXXXXXKSMPQIAPRVPTIT 1454
            MGK P S R   +P   L+KKP   P +P +     A         ++    APR P++ 
Sbjct: 1    MGKVPSSFRA--MPANLLVKKPTPSPPAPPRDFRNRAAVRDSTKLPENTQ--APREPSLR 56

Query: 1453 --FDSPSLSDAKTIFNQLVSSPKKAPLDLRFCNAILQSFSSVGTLQDSIFLLNHMIKTHP 1280
              F SP+LSDAK++FN + ++  + PLDL+F N++LQS++S+  + D++ L  H++K+ P
Sbjct: 57   NPFKSPNLSDAKSLFNSIAAT-SRIPLDLKFHNSVLQSYASIAAVDDTVKLFQHILKSQP 115

Query: 1279 VFSPDSSTYNILLVQCCNAEDFSISSVHQVLNYMSTQGFPPNKVTTDVAVRTLCFAGREE 1100
             F P  ST+ ILL   C A D SIS+VH+VLN M   G  P++VTTD+AVR+LC  GR +
Sbjct: 116  NFRPGRSTFLILLSHACRAPDSSISNVHRVLNLMVNNGLEPDQVTTDIAVRSLCETGRVD 175

Query: 1099 DAIELVKELSGKNLTPDSYTYNFLVRHLVKNRELSTVNCFIKEMREGFDIKPDLVSYTIM 920
            +A +L+KEL+ K+  PD+YTYNFL++HL K ++L  V  F+ EMR+ FD+KPDLVS+TI+
Sbjct: 176  EAKDLMKELTEKHSPPDTYTYNFLLKHLCKCKDLHVVYEFVDEMRDDFDVKPDLVSFTIL 235

Query: 919  IDNVCNRKNLREATRLLGVLREEGYKPDCYVYNTIMKGHCMLNQSGEVLDVYKKMKEEGV 740
            IDNVCN KNLREA  L+  L   G+KPDC++YNTIMKG C L++  E + VYKKMKEEGV
Sbjct: 236  IDNVCNSKNLREAMYLVSKLGNAGFKPDCFLYNTIMKGFCTLSKGSEAIGVYKKMKEEGV 295

Query: 739  QPDLVTYNTLIYGLSKSGRVKEAKKFLGVMTEMGQFPDAATYTSLMNGMCREGDAIRALG 560
            +PD +TYNTLIYGLSKSGRV+EA+ +L  M + G  PD ATYTSLMNGMCR+G+++ AL 
Sbjct: 296  EPDQITYNTLIYGLSKSGRVEEARMYLKTMVDAGYEPDTATYTSLMNGMCRKGESLGALS 355

Query: 559  LLEEMEAKGCSPNECTYNTLLHGLCKARLVDKGIEFYKVMQEGGMKIEPGSYGTLVRALC 380
            LLEEMEA+GC+PN+CTYNTLLHGLCKARL+DKG+E Y++M+  G+K+E   Y TLVR+L 
Sbjct: 356  LLEEMEARGCAPNDCTYNTLLHGLCKARLMDKGMELYELMKSSGVKLETNGYATLVRSLV 415

Query: 379  KNGRVAEAYEVFDYAIESKSLTDVSAYSTLESNLKWLKKAREQGLV 242
            K+G+VAEAYEVFDYA++SKSLTD SAYSTLE+ LKWLKKA+EQGLV
Sbjct: 416  KSGKVAEAYEVFDYAVDSKSLTDASAYSTLETTLKWLKKAKEQGLV 461


Top