BLASTX nr result
ID: Cephaelis21_contig00016194
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00016194 (1793 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN69960.1| hypothetical protein VITISV_032887 [Vitis vinifera] 593 e-167 ref|XP_002521681.1| pentatricopeptide repeat-containing protein,... 578 e-162 ref|XP_003626687.1| Pentatricopeptide repeat-containing protein ... 555 e-155 ref|XP_002306437.1| predicted protein [Populus trichocarpa] gi|2... 552 e-155 ref|XP_002884075.1| pentatricopeptide repeat-containing protein ... 536 e-150 >emb|CAN69960.1| hypothetical protein VITISV_032887 [Vitis vinifera] Length = 472 Score = 593 bits (1530), Expect = e-167 Identities = 299/472 (63%), Positives = 359/472 (76%), Gaps = 9/472 (1%) Frame = -2 Query: 1633 MGKFPPSMRGARLPIGALMKKP------LSEPQSPAQKPHYLAGXXXXXXXXKSM---PQ 1481 MGK PPS R + +P+ L+K P S QKPH+ P Sbjct: 1 MGKIPPSFRTSTVPVTTLLKNPPAVLPKQSTVLETPQKPHHFPKKRPQPSGKTKKTRTPI 60 Query: 1480 IAPRVPTITFDSPSLSDAKTIFNQLVSSPKKAPLDLRFCNAILQSFSSVGTLQDSIFLLN 1301 P+ P I F+SP+L DAK +F + ++ PLDLRF NA+LQS+SS+ T+ DSI L Sbjct: 61 EDPKSPVI-FNSPNLLDAKKLFASITTT-STTPLDLRFHNALLQSYSSISTVNDSISFLR 118 Query: 1300 HMIKTHPVFSPDSSTYNILLVQCCNAEDFSISSVHQVLNYMSTQGFPPNKVTTDVAVRTL 1121 HMIK+ P FSP+ STY+ILL Q C + + +S+VHQ LN M T GFPP++VTTD+AVR+L Sbjct: 119 HMIKSQPSFSPERSTYHILLSQSCKSPNSDLSAVHQTLNLMVTHGFPPDRVTTDIAVRSL 178 Query: 1120 CFAGREEDAIELVKELSGKNLTPDSYTYNFLVRHLVKNRELSTVNCFIKEMREGFDIKPD 941 C AGREE AIELVKELS K+ PDS+TYNF++RHL K R LSTV FI E++ F +KPD Sbjct: 179 CSAGREEHAIELVKELSLKHSPPDSFTYNFIIRHLCKTRALSTVYNFIDELQNSFQLKPD 238 Query: 940 LVSYTIMIDNVCNRKNLREATRLLGVLREEGYKPDCYVYNTIMKGHCMLNQSGEVLDVYK 761 LV+YTI+IDNVCN KNLREATRLL VL E G+KPDCYVYNTIMKG+C+L++ E + VYK Sbjct: 239 LVTYTILIDNVCNGKNLREATRLLEVLGEAGFKPDCYVYNTIMKGYCILDKGSEAIGVYK 298 Query: 760 KMKEEGVQPDLVTYNTLIYGLSKSGRVKEAKKFLGVMTEMGQFPDAATYTSLMNGMCREG 581 KMKEEGV+PDLVTYNTLI+GLSKSGRVKEA+KFL +M EMG FPDA TYTSLMNG+CREG Sbjct: 299 KMKEEGVEPDLVTYNTLIFGLSKSGRVKEARKFLDIMAEMGHFPDAVTYTSLMNGLCREG 358 Query: 580 DAIRALGLLEEMEAKGCSPNECTYNTLLHGLCKARLVDKGIEFYKVMQEGGMKIEPGSYG 401 +A+ AL LLEEMEAKGCSPN CTYNTLLHGLCK R++++GIE Y VM+ GGMK+E SY Sbjct: 359 NALGALALLEEMEAKGCSPNSCTYNTLLHGLCKLRMLERGIELYGVMKSGGMKLEKASYA 418 Query: 400 TLVRALCKNGRVAEAYEVFDYAIESKSLTDVSAYSTLESNLKWLKKAREQGL 245 T VRALCK GRVAEAYE FDY +ESKS DV+AYSTLE++LKWL+KAREQGL Sbjct: 419 TFVRALCKEGRVAEAYEAFDYVVESKSFDDVTAYSTLENSLKWLRKAREQGL 470 >ref|XP_002521681.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223539072|gb|EEF40668.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 458 Score = 578 bits (1491), Expect = e-162 Identities = 291/470 (61%), Positives = 364/470 (77%), Gaps = 5/470 (1%) Frame = -2 Query: 1633 MGKFPPSMRGARLPIGALMKKPLSEPQSPAQKPHYLAGXXXXXXXXKSMPQIAPRVPTIT 1454 MGK PPS+R A + AL++KP P P +KPHYL+ K+ Q++ ++PT Sbjct: 1 MGKVPPSLRSA-VSTTALLRKP--NPFPPPEKPHYLS--------KKTKLQLSQKIPTPI 49 Query: 1453 -----FDSPSLSDAKTIFNQLVSSPKKAPLDLRFCNAILQSFSSVGTLQDSIFLLNHMIK 1289 F SP L++AK IFN L+S+ + PLDLRF ++ LQS+SS+ T+ DSI LL HMIK Sbjct: 50 QQKRLFKSPELNEAKEIFNSLISTTR-VPLDLRFHHSFLQSYSSISTIDDSISLLRHMIK 108 Query: 1288 THPVFSPDSSTYNILLVQCCNAEDFSISSVHQVLNYMSTQGFPPNKVTTDVAVRTLCFAG 1109 T P F+P STY+ILL Q C A D ++S VHQ+LN M GF P +VT D+AVR LC AG Sbjct: 109 TLPSFTPTISTYHILLSQSCKAPDPTLSPVHQILNLMVNNGFMPTQVTVDIAVRALCSAG 168 Query: 1108 REEDAIELVKELSGKNLTPDSYTYNFLVRHLVKNRELSTVNCFIKEMREGFDIKPDLVSY 929 +E+DA++LVKELS K+ PDS+TYNFLV+ L K R LS V FI EMR FD++P+LV+Y Sbjct: 169 KEDDAVKLVKELSLKHSKPDSFTYNFLVKCLCKCRALSNVYSFIDEMRSSFDLEPNLVTY 228 Query: 928 TIMIDNVCNRKNLREATRLLGVLREEGYKPDCYVYNTIMKGHCMLNQSGEVLDVYKKMKE 749 TI+IDNVCN KNLREA RLLG+LRE G+KPDC+VYNTIMKG+CML++ + + V+KKMKE Sbjct: 229 TILIDNVCNSKNLREAMRLLGILRECGFKPDCFVYNTIMKGYCMLSKGSDAIQVFKKMKE 288 Query: 748 EGVQPDLVTYNTLIYGLSKSGRVKEAKKFLGVMTEMGQFPDAATYTSLMNGMCREGDAIR 569 EG++PDL+TYNTLI+GLSK GRV EAK++L +M E G FPDA TYTSLMNG+CR+GDA+ Sbjct: 289 EGIEPDLITYNTLIFGLSKGGRVSEAKRYLKIMVESGHFPDAVTYTSLMNGLCRKGDALG 348 Query: 568 ALGLLEEMEAKGCSPNECTYNTLLHGLCKARLVDKGIEFYKVMQEGGMKIEPGSYGTLVR 389 AL LLE+ME KGCSPN CTYNTLL+GLCK RL++KGIE Y V++EGGM ++ SY T VR Sbjct: 349 ALALLEDMEMKGCSPNSCTYNTLLYGLCKERLLEKGIELYNVIKEGGMLLDTASYATFVR 408 Query: 388 ALCKNGRVAEAYEVFDYAIESKSLTDVSAYSTLESNLKWLKKAREQGLVV 239 ALC+ G+VAEAYEVFDYA+ESKSLT+ +AY+TLES LKWLKKAREQGL V Sbjct: 409 ALCREGKVAEAYEVFDYAVESKSLTNAAAYTTLESTLKWLKKAREQGLSV 458 >ref|XP_003626687.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355520709|gb|AET01163.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 501 Score = 555 bits (1429), Expect = e-155 Identities = 275/469 (58%), Positives = 344/469 (73%), Gaps = 5/469 (1%) Frame = -2 Query: 1633 MGKFPPSMRGARLPIGALMKKPLSEPQSPAQKPHYLAGXXXXXXXXK--SMPQIAPRVPT 1460 MGK PPS R A + + P SP KPH+ + S Q P Sbjct: 1 MGKIPPSFRSALSNPNLIHRSSSLIPSSP--KPHHFPNKTRKPHQKQQQSQSQSQSPKPV 58 Query: 1459 ITFDSPSLSDAKTIFNQLVSSPKKAPLDLRFCNAILQSFSSVGTLQDSIFLLNHMIKTHP 1280 F SP+L +AK+IFN V+S AP+D RF N++LQS++S+ T+ DSI L HM KTHP Sbjct: 59 SVFKSPNLQEAKSIFNSFVNS-SNAPIDSRFHNSLLQSYASISTINDSIAFLRHMTKTHP 117 Query: 1279 VFSPDSSTYNILLVQCCNAEDF---SISSVHQVLNYMSTQGFPPNKVTTDVAVRTLCFAG 1109 FSPD STY+ILL CC + D ++S +HQ LN M + G P+K T D+AVR+LC A Sbjct: 118 SFSPDKSTYHILLTHCCKSTDSKYSTLSLIHQTLNLMVSDGISPDKGTVDLAVRSLCTAD 177 Query: 1108 REEDAIELVKELSGKNLTPDSYTYNFLVRHLVKNRELSTVNCFIKEMREGFDIKPDLVSY 929 R +DA+EL+KELS K+ +PD Y+YNFLV++L K+R LS V FI EMR FD+KP+LV+Y Sbjct: 178 RVDDAVELIKELSSKHCSPDIYSYNFLVKNLCKSRTLSLVYAFIDEMRTKFDVKPNLVTY 237 Query: 928 TIMIDNVCNRKNLREATRLLGVLREEGYKPDCYVYNTIMKGHCMLNQSGEVLDVYKKMKE 749 TI+IDNVCN KNLREATRL+ +L EEG+KPDC++YNTIMKG+CML++ E ++VY +MKE Sbjct: 238 TILIDNVCNTKNLREATRLVDILEEEGFKPDCFLYNTIMKGYCMLSRGSEAIEVYNRMKE 297 Query: 748 EGVQPDLVTYNTLIYGLSKSGRVKEAKKFLGVMTEMGQFPDAATYTSLMNGMCREGDAIR 569 +GV+PDL+TYNTLI+GLSKSGRV EAKK L VM E G FPD TYTSLMNGMCR+G+ + Sbjct: 298 KGVEPDLITYNTLIFGLSKSGRVSEAKKLLRVMAEKGHFPDEVTYTSLMNGMCRKGETLA 357 Query: 568 ALGLLEEMEAKGCSPNECTYNTLLHGLCKARLVDKGIEFYKVMQEGGMKIEPGSYGTLVR 389 AL LLEEME KGCSPN CTYNTLLHGLCK+R+ DK +E Y M+ G+K++ SY T VR Sbjct: 358 ALALLEEMEMKGCSPNTCTYNTLLHGLCKSRMFDKAMELYGAMKSDGLKLDMASYATFVR 417 Query: 388 ALCKNGRVAEAYEVFDYAIESKSLTDVSAYSTLESNLKWLKKAREQGLV 242 ALC GRVA+AYEVFDYA+ESKSL+DV+AYSTLES LKW KKA+E+G + Sbjct: 418 ALCSVGRVADAYEVFDYAVESKSLSDVAAYSTLESTLKWFKKAKEEGAI 466 >ref|XP_002306437.1| predicted protein [Populus trichocarpa] gi|222855886|gb|EEE93433.1| predicted protein [Populus trichocarpa] Length = 462 Score = 552 bits (1423), Expect = e-155 Identities = 282/466 (60%), Positives = 345/466 (74%), Gaps = 1/466 (0%) Frame = -2 Query: 1633 MGKFPPSMRGARLPIGALMKKPLSEPQSPAQKPHYLAGXXXXXXXXKSMPQIAPRVPTIT 1454 MGKFPPS R A + +L+K S+ Q Q+PHY K P Sbjct: 1 MGKFPPSFRSA-ISSTSLIKNTPSQQQ---QQPHYFPKKLTKKNSPKPHETETPPPHKSL 56 Query: 1453 FDSPSLSDAKTIFNQLVSSPKKAPLD-LRFCNAILQSFSSVGTLQDSIFLLNHMIKTHPV 1277 F + SL++AK++FN +S+ K LD LR N+ LQS++S+ TL DSI LL+HM+KT P Sbjct: 57 FKTSSLNEAKSLFNSFISTTKAPLLDNLRLHNSFLQSYTSISTLDDSISLLDHMVKTLPS 116 Query: 1276 FSPDSSTYNILLVQCCNAEDFSISSVHQVLNYMSTQGFPPNKVTTDVAVRTLCFAGREED 1097 SPD STY++LL Q C D S+SS +VLN M +GF PN+ T DVA+R+LC AGR +D Sbjct: 117 LSPDRSTYHVLLSQSCREPDSSLSSAQKVLNLMINKGFKPNQFTVDVAIRSLCSAGRVDD 176 Query: 1096 AIELVKELSGKNLTPDSYTYNFLVRHLVKNRELSTVNCFIKEMREGFDIKPDLVSYTIMI 917 AI LVKE S K+ PD++TYNFLV+ L K+R ++V FI EM+ FDIKPDLV+YTI+I Sbjct: 177 AILLVKEFSSKHSKPDTFTYNFLVKCLCKSRIFNSVYSFIDEMKSSFDIKPDLVTYTILI 236 Query: 916 DNVCNRKNLREATRLLGVLREEGYKPDCYVYNTIMKGHCMLNQSGEVLDVYKKMKEEGVQ 737 DNVCN KN+REA RL+ VL+E G KPD ++YNTIMKG+C+LN+ E + +YK+MKEEGV+ Sbjct: 237 DNVCNAKNIREADRLVAVLKECGLKPDAFLYNTIMKGYCLLNKGIEAVRIYKQMKEEGVE 296 Query: 736 PDLVTYNTLIYGLSKSGRVKEAKKFLGVMTEMGQFPDAATYTSLMNGMCREGDAIRALGL 557 PDLVTYNTLI+GLSK GRV EAKK L +M E G FPDA TYTSLMNGMCREGD + A L Sbjct: 297 PDLVTYNTLIFGLSKCGRVSEAKKLLKIMVESGHFPDAVTYTSLMNGMCREGDVLGAAAL 356 Query: 556 LEEMEAKGCSPNECTYNTLLHGLCKARLVDKGIEFYKVMQEGGMKIEPGSYGTLVRALCK 377 LEEME KGCSPN CTYNTLLHG CK R ++KG+E Y V+++GGMK+E SY T VRALC+ Sbjct: 357 LEEMELKGCSPNSCTYNTLLHGFCKGRRLNKGVELYGVIKKGGMKLETASYATFVRALCR 416 Query: 376 NGRVAEAYEVFDYAIESKSLTDVSAYSTLESNLKWLKKAREQGLVV 239 GRVAEAYEVFDYA+ESKSLTDV+AY+TLES LKWLKKAREQGL V Sbjct: 417 EGRVAEAYEVFDYAVESKSLTDVAAYTTLESTLKWLKKAREQGLAV 462 >ref|XP_002884075.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297329915|gb|EFH60334.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 462 Score = 536 bits (1380), Expect = e-150 Identities = 265/466 (56%), Positives = 349/466 (74%), Gaps = 2/466 (0%) Frame = -2 Query: 1633 MGKFPPSMRGARLPIGALMKKPLSEPQSPAQKPHYLAGXXXXXXXXKSMPQIAPRVPTIT 1454 MGK P S R +P L+KKP P +P + A ++ APR P++ Sbjct: 1 MGKVPSSFRA--MPANLLVKKPTPSPPAPPRDFRNRAAVRDSTKLPENTQ--APREPSLR 56 Query: 1453 --FDSPSLSDAKTIFNQLVSSPKKAPLDLRFCNAILQSFSSVGTLQDSIFLLNHMIKTHP 1280 F SP+LSDAK++FN + ++ + PLDL+F N++LQS++S+ + D++ L H++K+ P Sbjct: 57 NPFKSPNLSDAKSLFNSIAAT-SRIPLDLKFHNSVLQSYASIAAVDDTVKLFQHILKSQP 115 Query: 1279 VFSPDSSTYNILLVQCCNAEDFSISSVHQVLNYMSTQGFPPNKVTTDVAVRTLCFAGREE 1100 F P ST+ ILL C A D SIS+VH+VLN M G P++VTTD+AVR+LC GR + Sbjct: 116 NFRPGRSTFLILLSHACRAPDSSISNVHRVLNLMVNNGLEPDQVTTDIAVRSLCETGRVD 175 Query: 1099 DAIELVKELSGKNLTPDSYTYNFLVRHLVKNRELSTVNCFIKEMREGFDIKPDLVSYTIM 920 +A +L+KEL+ K+ PD+YTYNFL++HL K ++L V F+ EMR+ FD+KPDLVS+TI+ Sbjct: 176 EAKDLMKELTEKHSPPDTYTYNFLLKHLCKCKDLHVVYEFVDEMRDDFDVKPDLVSFTIL 235 Query: 919 IDNVCNRKNLREATRLLGVLREEGYKPDCYVYNTIMKGHCMLNQSGEVLDVYKKMKEEGV 740 IDNVCN KNLREA L+ L G+KPDC++YNTIMKG C L++ E + VYKKMKEEGV Sbjct: 236 IDNVCNSKNLREAMYLVSKLGNAGFKPDCFLYNTIMKGFCTLSKGSEAIGVYKKMKEEGV 295 Query: 739 QPDLVTYNTLIYGLSKSGRVKEAKKFLGVMTEMGQFPDAATYTSLMNGMCREGDAIRALG 560 +PD +TYNTLIYGLSKSGRV+EA+ +L M + G PD ATYTSLMNGMCR+G+++ AL Sbjct: 296 EPDQITYNTLIYGLSKSGRVEEARMYLKTMVDAGYEPDTATYTSLMNGMCRKGESLGALS 355 Query: 559 LLEEMEAKGCSPNECTYNTLLHGLCKARLVDKGIEFYKVMQEGGMKIEPGSYGTLVRALC 380 LLEEMEA+GC+PN+CTYNTLLHGLCKARL+DKG+E Y++M+ G+K+E Y TLVR+L Sbjct: 356 LLEEMEARGCAPNDCTYNTLLHGLCKARLMDKGMELYELMKSSGVKLETNGYATLVRSLV 415 Query: 379 KNGRVAEAYEVFDYAIESKSLTDVSAYSTLESNLKWLKKAREQGLV 242 K+G+VAEAYEVFDYA++SKSLTD SAYSTLE+ LKWLKKA+EQGLV Sbjct: 416 KSGKVAEAYEVFDYAVDSKSLTDASAYSTLETTLKWLKKAKEQGLV 461