BLASTX nr result
ID: Cephaelis21_contig00012143
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00012143 (2169 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002266822.1| PREDICTED: pentatricopeptide repeat-containi... 728 0.0 emb|CAN67401.1| hypothetical protein VITISV_025967 [Vitis vinifera] 728 0.0 ref|XP_002521980.1| pentatricopeptide repeat-containing protein,... 684 0.0 ref|XP_002884468.1| pentatricopeptide repeat-containing protein ... 674 0.0 ref|NP_566237.1| pentatricopeptide repeat-containing protein [Ar... 672 0.0 >ref|XP_002266822.1| PREDICTED: pentatricopeptide repeat-containing protein At3g04760, chloroplastic-like [Vitis vinifera] Length = 582 Score = 728 bits (1880), Expect = 0.0 Identities = 355/580 (61%), Positives = 458/580 (78%), Gaps = 3/580 (0%) Frame = -1 Query: 1989 TIFSADVFSLLFPC--VTNPTSNLHKKAVVRCRSLMSNDR-KSESSQNAKVSTETRSTHL 1819 TI+S D F P PTS+ H ++V CR+ ND S ++ VS E R HL Sbjct: 2 TIYSTDFFPRCPPFNPQLKPTSHSHHTSIVTCRNPNPNDGFNSRNAPKVGVSAEARPAHL 61 Query: 1818 QPDDSTEPNFVKLLYRSFKAAKYNEALYFLECMVNRSLKPDVILSTKLIQGFFSSKNAEK 1639 Q D E + +KLL RS KA K+NE+LYFLEC+VN+ PDVIL TKLI+GFF+ KN EK Sbjct: 62 QSYDFRETHLMKLLNRSCKAGKFNESLYFLECLVNKGYTPDVILCTKLIKGFFNFKNIEK 121 Query: 1638 AIRVMQILEQYGEPDVFAYNAVISGFCKLNQFDSANKILNRMRARGFSPDVVTYNIMIGS 1459 A RVM+ILE + EPDVFAYNAVISGFCK+N+ ++A ++LNRM+ARGF PD+VTYNIMIGS Sbjct: 122 ASRVMEILESHTEPDVFAYNAVISGFCKVNRIEAATQVLNRMKARGFLPDIVTYNIMIGS 181 Query: 1458 LCNRGKLGLALKVFDQLLEDNCQPSVVTYTILIEATAVEGGTREAMKLFDEMLSKGLQPD 1279 LCNR KLGLALKV DQLL DNC P+V+TYTILIEAT VEGG EAMKL +EML++GL PD Sbjct: 182 LCNRRKLGLALKVLDQLLLDNCMPTVITYTILIEATIVEGGINEAMKLLEEMLARGLLPD 241 Query: 1278 MYTYNAIIRAMCKDGLMDEAFDFVKSLPARGCKADLVSYNVLLRALLSQGRWKDGEKLVA 1099 MYTYNAIIR MCK+G+++ A + + SL ++GCK D++SYN+LLRA L+QG+W +GEKLVA Sbjct: 242 MYTYNAIIRGMCKEGMVERAAELITSLTSKGCKPDVISYNILLRAFLNQGKWDEGEKLVA 301 Query: 1098 EMLSEDIEPNVVTYTILISALCHLGKLEESLEFLKLMMDEGLTPNTYTYDPLISAFCKEG 919 EM S EPN VTY+ILIS+LC G+++E++ LK+M+++ LTP+TY+YDPLISA CKEG Sbjct: 302 EMFSRGCEPNKVTYSILISSLCRFGRIDEAISVLKVMIEKELTPDTYSYDPLISALCKEG 361 Query: 918 KMDLAIAFLNHMVSSGCLPDIVNYNTLLSAMCKNKNVDQALELFGNLSEVGCPPDVSTYN 739 ++DLAI +++M+S+GCLPDIVNYNT+L+A+CKN N +QALE+F L +GCPP+VS+YN Sbjct: 362 RLDLAIGIMDYMISNGCLPDIVNYNTILAALCKNGNANQALEIFNKLRGMGCPPNVSSYN 421 Query: 738 TMISALWNTGERAQALRIASEMISKAIDPDEITFKTLISCLCRDGMVDEARELLTGMESS 559 TMISALW+ G+R++AL + MISK +DPDEIT+ +LISCLCRDG+V+EA LL ME S Sbjct: 422 TMISALWSCGDRSRALGMVPAMISKGVDPDEITYNSLISCLCRDGLVEEAIGLLDDMEQS 481 Query: 558 GFPLTVATYNIVLLGLCKVHRVDDAIEVLELMIQKGCLPNETTYILLLEGIGFAGWQAEA 379 GF TV +YNIVLLGLCKV R+DDAI + MI+KGC PNETTYILL+EGIGFAGW+ EA Sbjct: 482 GFRPTVISYNIVLLGLCKVRRIDDAIGMFAEMIEKGCRPNETTYILLIEGIGFAGWRTEA 541 Query: 378 IETASTLLRKHVITKESVQRLKRTFPTLNADKAVTHTKRK 259 +E A++L + VI+++S +RL +TFP L+ K +++++ K Sbjct: 542 MELANSLFSRDVISQDSFKRLNKTFPMLDVYKELSNSETK 581 >emb|CAN67401.1| hypothetical protein VITISV_025967 [Vitis vinifera] Length = 592 Score = 728 bits (1879), Expect = 0.0 Identities = 356/582 (61%), Positives = 458/582 (78%), Gaps = 3/582 (0%) Frame = -1 Query: 1995 MTTIFSADVFSLL--FPCVTNPTSNLHKKAVVRCRSLMSNDR-KSESSQNAKVSTETRST 1825 + TI+S D F F PTS+ H ++V CR+ ND S +S VS E R Sbjct: 10 LMTIYSTDFFPHCPPFSPQLKPTSHSHHTSIVTCRNPNPNDGYNSRNSPKVGVSAEARPA 69 Query: 1824 HLQPDDSTEPNFVKLLYRSFKAAKYNEALYFLECMVNRSLKPDVILSTKLIQGFFSSKNA 1645 HLQ D E + +KLL RS KA K+NE+LYFLEC+VN+ PDVIL TKLI+GFF+ KN Sbjct: 70 HLQSYDFRETHLMKLLNRSCKAGKFNESLYFLECLVNKGYTPDVILCTKLIKGFFNFKNI 129 Query: 1644 EKAIRVMQILEQYGEPDVFAYNAVISGFCKLNQFDSANKILNRMRARGFSPDVVTYNIMI 1465 EKA RVM+ILE + EPDVFAYNAVISGFCK+NQ ++A ++LNRM+ARGF PD+VTYNIMI Sbjct: 130 EKASRVMEILESHTEPDVFAYNAVISGFCKVNQIEAATQVLNRMKARGFLPDIVTYNIMI 189 Query: 1464 GSLCNRGKLGLALKVFDQLLEDNCQPSVVTYTILIEATAVEGGTREAMKLFDEMLSKGLQ 1285 GSLCNR KLGLAL V DQLL DNC P+V+TYTILIEAT VEGG EAMKL +EML++GL Sbjct: 190 GSLCNRRKLGLALTVLDQLLLDNCMPTVITYTILIEATIVEGGINEAMKLLEEMLARGLL 249 Query: 1284 PDMYTYNAIIRAMCKDGLMDEAFDFVKSLPARGCKADLVSYNVLLRALLSQGRWKDGEKL 1105 PDMYTYNAIIR MCK+G+++ A + + SL ++GC+ D++SYN+LLRA L+QG+W +GEKL Sbjct: 250 PDMYTYNAIIRGMCKEGMVERAAELITSLTSKGCEPDVISYNILLRAFLNQGKWDEGEKL 309 Query: 1104 VAEMLSEDIEPNVVTYTILISALCHLGKLEESLEFLKLMMDEGLTPNTYTYDPLISAFCK 925 VAEM S EPN VTY+ILIS+LC G+++E++ LK+M+++ LTP+TY+YDPLISA CK Sbjct: 310 VAEMFSRGCEPNKVTYSILISSLCRFGRIDEAISVLKVMIEKELTPDTYSYDPLISALCK 369 Query: 924 EGKMDLAIAFLNHMVSSGCLPDIVNYNTLLSAMCKNKNVDQALELFGNLSEVGCPPDVST 745 EG++DLAI +++M+S+GCLPDIVNYNT+L+A+CKN N +QALE+F L +GCPP+VS+ Sbjct: 370 EGRLDLAIGIMDYMISNGCLPDIVNYNTILAALCKNGNANQALEIFNKLRGMGCPPNVSS 429 Query: 744 YNTMISALWNTGERAQALRIASEMISKAIDPDEITFKTLISCLCRDGMVDEARELLTGME 565 YNTMISALW+ G+R++AL + MISK IDPDEIT+ +LISCLCRDG+V+EA LL ME Sbjct: 430 YNTMISALWSCGDRSRALGMVPAMISKGIDPDEITYNSLISCLCRDGLVEEAIGLLDDME 489 Query: 564 SSGFPLTVATYNIVLLGLCKVHRVDDAIEVLELMIQKGCLPNETTYILLLEGIGFAGWQA 385 SGF TV +YNIVLLGLCKV R+DDAI + MI+KGC PNETTYILL+EGIGFAGW+ Sbjct: 490 QSGFRPTVISYNIVLLGLCKVRRIDDAIGMFAEMIEKGCRPNETTYILLIEGIGFAGWRT 549 Query: 384 EAIETASTLLRKHVITKESVQRLKRTFPTLNADKAVTHTKRK 259 EA+E A++L + VI+++S +RL +TFP L+ K +++++ K Sbjct: 550 EAMELANSLFSRDVISQDSFKRLNKTFPMLDVYKELSNSETK 591 >ref|XP_002521980.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223538784|gb|EEF40384.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 584 Score = 684 bits (1764), Expect = 0.0 Identities = 326/556 (58%), Positives = 440/556 (79%), Gaps = 1/556 (0%) Frame = -1 Query: 1938 PTSNLHKKAVVRC-RSLMSNDRKSESSQNAKVSTETRSTHLQPDDSTEPNFVKLLYRSFK 1762 PTSN +V C R +++ K + Q +VS ETR TH+ D E + +KLL RS + Sbjct: 22 PTSNSLHSTIVSCIRPELNDANKVRNPQKVRVSAETRQTHVLSFDFKEVHLMKLLNRSCR 81 Query: 1761 AAKYNEALYFLECMVNRSLKPDVILSTKLIQGFFSSKNAEKAIRVMQILEQYGEPDVFAY 1582 A KYNE+LYFLECMV++ PDVIL TKLI+GFF+S+N KA RVM+ILE+YG+PDVFAY Sbjct: 82 AGKYNESLYFLECMVDKGYTPDVILCTKLIKGFFNSRNIGKATRVMEILERYGKPDVFAY 141 Query: 1581 NAVISGFCKLNQFDSANKILNRMRARGFSPDVVTYNIMIGSLCNRGKLGLALKVFDQLLE 1402 NA+ISGF K NQ ++AN++L+RM++RGF PDVVTYNIMIGS C+RGKL LAL++F++LL+ Sbjct: 142 NALISGFIKANQLENANRVLDRMKSRGFLPDVVTYNIMIGSFCSRGKLDLALEIFEELLK 201 Query: 1401 DNCQPSVVTYTILIEATAVEGGTREAMKLFDEMLSKGLQPDMYTYNAIIRAMCKDGLMDE 1222 DNC+P+V+TYTILIEAT ++GG AMKL DEMLSKGL+PD TYNAIIR MCK+ ++D+ Sbjct: 202 DNCEPTVITYTILIEATILDGGIDVAMKLLDEMLSKGLEPDTLTYNAIIRGMCKEMMVDK 261 Query: 1221 AFDFVKSLPARGCKADLVSYNVLLRALLSQGRWKDGEKLVAEMLSEDIEPNVVTYTILIS 1042 AF+ ++SL +RGCK D+++YN+LLR LLS+G+W +GEKL++EM+S +PNVVT++ILI Sbjct: 262 AFELLRSLSSRGCKPDIITYNILLRTLLSRGKWSEGEKLISEMISIGCKPNVVTHSILIG 321 Query: 1041 ALCHLGKLEESLEFLKLMMDEGLTPNTYTYDPLISAFCKEGKMDLAIAFLNHMVSSGCLP 862 LC GK+EE++ L+ M ++GL P+ Y YDPLI+ FC+EG++DLA FL +M+S GCLP Sbjct: 322 TLCRDGKVEEAVNLLRSMKEKGLKPDAYCYDPLIAGFCREGRLDLATEFLEYMISDGCLP 381 Query: 861 DIVNYNTLLSAMCKNKNVDQALELFGNLSEVGCPPDVSTYNTMISALWNTGERAQALRIA 682 DIVNYNT+++ +C+ DQALE+F L EVGCPP+VS+YNT+ SALW++G+R +AL + Sbjct: 382 DIVNYNTIMAGLCRTGKADQALEVFEKLDEVGCPPNVSSYNTLFSALWSSGDRYRALEMI 441 Query: 681 SEMISKAIDPDEITFKTLISCLCRDGMVDEARELLTGMESSGFPLTVATYNIVLLGLCKV 502 +++++ IDPDEIT+ +LISCLCRDGMVDEA ELL M+S + V +YNI+LLGLCKV Sbjct: 442 LKLLNQGIDPDEITYNSLISCLCRDGMVDEAIELLVDMQSGRYRPNVVSYNIILLGLCKV 501 Query: 501 HRVDDAIEVLELMIQKGCLPNETTYILLLEGIGFAGWQAEAIETASTLLRKHVITKESVQ 322 +R +DAIEVL M +KGC PNETTYILL+EGIGF+G +AEA+E A++L + I+++S Sbjct: 502 NRANDAIEVLAAMTEKGCQPNETTYILLIEGIGFSGLRAEAMELANSLHGMNAISEDSFN 561 Query: 321 RLKRTFPTLNADKAVT 274 RL +TFP L+ K +T Sbjct: 562 RLNKTFPLLDVYKDLT 577 >ref|XP_002884468.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297330308|gb|EFH60727.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 598 Score = 674 bits (1740), Expect = 0.0 Identities = 314/525 (59%), Positives = 421/525 (80%) Frame = -1 Query: 1866 SSQNAKVSTETRSTHLQPDDSTEPNFVKLLYRSFKAAKYNEALYFLECMVNRSLKPDVIL 1687 ++ +A + TE R H Q + +K+ +RS ++ Y E+L+ LE MV + PDVIL Sbjct: 63 TTTDAAIPTERRQQHSQSLGFRDTQMLKIFHRSCRSGNYIESLHLLETMVRKGYNPDVIL 122 Query: 1686 STKLIQGFFSSKNAEKAIRVMQILEQYGEPDVFAYNAVISGFCKLNQFDSANKILNRMRA 1507 TKLI+GFF+ +N KA+RVM+ILE++G+PDVFAYNA+I+GFCK+N+ D A ++L+RMR+ Sbjct: 123 CTKLIKGFFTLRNVPKAVRVMEILEKFGQPDVFAYNALINGFCKMNRIDDATRVLDRMRS 182 Query: 1506 RGFSPDVVTYNIMIGSLCNRGKLGLALKVFDQLLEDNCQPSVVTYTILIEATAVEGGTRE 1327 + FSPD VTYNIMIGSLC+RGKL LALKV DQLL DNCQP+V+TYTILIEAT +EGG E Sbjct: 183 KDFSPDTVTYNIMIGSLCSRGKLDLALKVLDQLLSDNCQPTVITYTILIEATMLEGGVDE 242 Query: 1326 AMKLFDEMLSKGLQPDMYTYNAIIRAMCKDGLMDEAFDFVKSLPARGCKADLVSYNVLLR 1147 A+KL DEMLS+GL+PDM+TYN IIR MCK+G++D AF+ +++L +GC+ D++SYN+LLR Sbjct: 243 ALKLLDEMLSRGLKPDMFTYNTIIRGMCKEGMVDRAFEMIRNLELKGCEPDVISYNILLR 302 Query: 1146 ALLSQGRWKDGEKLVAEMLSEDIEPNVVTYTILISALCHLGKLEESLEFLKLMMDEGLTP 967 ALL+QG+W++GEKL+ +M SE +PNVVTY+ILI+ LC GK+EE++ LKLM ++GLTP Sbjct: 303 ALLNQGKWEEGEKLMTKMFSEKCDPNVVTYSILITTLCRDGKIEEAMNLLKLMKEKGLTP 362 Query: 966 NTYTYDPLISAFCKEGKMDLAIAFLNHMVSSGCLPDIVNYNTLLSAMCKNKNVDQALELF 787 + Y+YDPLI+AFC+EG++D+AI FL M+S GCLPDIVNYNT+L+ +CKN DQALE+F Sbjct: 363 DAYSYDPLIAAFCREGRLDVAIEFLETMISDGCLPDIVNYNTVLATLCKNGKADQALEIF 422 Query: 786 GNLSEVGCPPDVSTYNTMISALWNTGERAQALRIASEMISKAIDPDEITFKTLISCLCRD 607 G L EVGC P+ S+YNTM SALW++G++ +AL + EM+S IDPDEIT+ ++ISCLCR+ Sbjct: 423 GKLGEVGCSPNSSSYNTMFSALWSSGDKIRALHMILEMVSNGIDPDEITYNSMISCLCRE 482 Query: 606 GMVDEARELLTGMESSGFPLTVATYNIVLLGLCKVHRVDDAIEVLELMIQKGCLPNETTY 427 GMVD+A ELL M S F +V TYNIVLLG CK HR++DAI+VL+ M+ GC PNETTY Sbjct: 483 GMVDKAFELLVDMRSCEFHPSVVTYNIVLLGFCKAHRIEDAIDVLDSMVGNGCRPNETTY 542 Query: 426 ILLLEGIGFAGWQAEAIETASTLLRKHVITKESVQRLKRTFPTLN 292 +L+EGIGFAG++AEA+E A+ L+R + I++ S +RL RTFP LN Sbjct: 543 TVLIEGIGFAGYRAEAMELANDLVRINAISEYSFKRLHRTFPLLN 587 >ref|NP_566237.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75207286|sp|Q9SR00.1|PP213_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At3g04760, chloroplastic; Flags: Precursor gi|6175176|gb|AAF04902.1|AC011437_17 hypothetical protein [Arabidopsis thaliana] gi|15810359|gb|AAL07067.1| unknown protein [Arabidopsis thaliana] gi|22136960|gb|AAM91709.1| unknown protein [Arabidopsis thaliana] gi|332640611|gb|AEE74132.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 602 Score = 672 bits (1735), Expect = 0.0 Identities = 316/528 (59%), Positives = 421/528 (79%) Frame = -1 Query: 1875 KSESSQNAKVSTETRSTHLQPDDSTEPNFVKLLYRSFKAAKYNEALYFLECMVNRSLKPD 1696 ++ ++ +A + TE R H Q + +K+ +RS ++ Y E+L+ LE MV + PD Sbjct: 64 QTTTTTDATLPTERRQQHSQSLGFRDTQMLKIFHRSCRSGNYIESLHLLETMVRKGYNPD 123 Query: 1695 VILSTKLIQGFFSSKNAEKAIRVMQILEQYGEPDVFAYNAVISGFCKLNQFDSANKILNR 1516 VIL TKLI+GFF+ +N KA+RVM+ILE++G+PDVFAYNA+I+GFCK+N+ D A ++L+R Sbjct: 124 VILCTKLIKGFFTLRNIPKAVRVMEILEKFGQPDVFAYNALINGFCKMNRIDDATRVLDR 183 Query: 1515 MRARGFSPDVVTYNIMIGSLCNRGKLGLALKVFDQLLEDNCQPSVVTYTILIEATAVEGG 1336 MR++ FSPD VTYNIMIGSLC+RGKL LALKV +QLL DNCQP+V+TYTILIEAT +EGG Sbjct: 184 MRSKDFSPDTVTYNIMIGSLCSRGKLDLALKVLNQLLSDNCQPTVITYTILIEATMLEGG 243 Query: 1335 TREAMKLFDEMLSKGLQPDMYTYNAIIRAMCKDGLMDEAFDFVKSLPARGCKADLVSYNV 1156 EA+KL DEMLS+GL+PDM+TYN IIR MCK+G++D AF+ V++L +GC+ D++SYN+ Sbjct: 244 VDEALKLMDEMLSRGLKPDMFTYNTIIRGMCKEGMVDRAFEMVRNLELKGCEPDVISYNI 303 Query: 1155 LLRALLSQGRWKDGEKLVAEMLSEDIEPNVVTYTILISALCHLGKLEESLEFLKLMMDEG 976 LLRALL+QG+W++GEKL+ +M SE +PNVVTY+ILI+ LC GK+EE++ LKLM ++G Sbjct: 304 LLRALLNQGKWEEGEKLMTKMFSEKCDPNVVTYSILITTLCRDGKIEEAMNLLKLMKEKG 363 Query: 975 LTPNTYTYDPLISAFCKEGKMDLAIAFLNHMVSSGCLPDIVNYNTLLSAMCKNKNVDQAL 796 LTP+ Y+YDPLI+AFC+EG++D+AI FL M+S GCLPDIVNYNT+L+ +CKN DQAL Sbjct: 364 LTPDAYSYDPLIAAFCREGRLDVAIEFLETMISDGCLPDIVNYNTVLATLCKNGKADQAL 423 Query: 795 ELFGNLSEVGCPPDVSTYNTMISALWNTGERAQALRIASEMISKAIDPDEITFKTLISCL 616 E+FG L EVGC P+ S+YNTM SALW++G++ +AL + EM+S IDPDEIT+ ++ISCL Sbjct: 424 EIFGKLGEVGCSPNSSSYNTMFSALWSSGDKIRALHMILEMMSNGIDPDEITYNSMISCL 483 Query: 615 CRDGMVDEARELLTGMESSGFPLTVATYNIVLLGLCKVHRVDDAIEVLELMIQKGCLPNE 436 CR+GMVDEA ELL M S F +V TYNIVLLG CK HR++DAI VLE M+ GC PNE Sbjct: 484 CREGMVDEAFELLVDMRSCEFHPSVVTYNIVLLGFCKAHRIEDAINVLESMVGNGCRPNE 543 Query: 435 TTYILLLEGIGFAGWQAEAIETASTLLRKHVITKESVQRLKRTFPTLN 292 TTY +L+EGIGFAG++AEA+E A+ L+R I++ S +RL RTFP LN Sbjct: 544 TTYTVLIEGIGFAGYRAEAMELANDLVRIDAISEYSFKRLHRTFPLLN 591