BLASTX nr result
ID: Cephaelis21_contig00012120
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cephaelis21_contig00012120 (3077 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002514778.1| pentatricopeptide repeat-containing protein,... 517 e-144 ref|XP_003546486.1| PREDICTED: pentatricopeptide repeat-containi... 472 e-130 ref|XP_003542095.1| PREDICTED: pentatricopeptide repeat-containi... 466 e-128 ref|NP_001143372.1| uncharacterized protein LOC100276004 [Zea ma... 367 1e-98 gb|ACN34333.1| unknown [Zea mays] gi|414879211|tpg|DAA56342.1| T... 362 4e-97 >ref|XP_002514778.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223545829|gb|EEF47332.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 584 Score = 517 bits (1332), Expect = e-144 Identities = 289/604 (47%), Positives = 384/604 (63%), Gaps = 3/604 (0%) Frame = -2 Query: 2989 FNSWGTLKLPSYPKLRFYSSDPTPENDEFYREAPINYVRNEFDGSPFGANEGFADDSVVS 2810 F++W +P LRF S+ ND + +N +DG +EG D+ + Sbjct: 31 FSNW----VPCQQNLRFLSNLSVNTNDTEIDHSSHGSAQNNYDGD----DEGKVTDTHLH 82 Query: 2809 VDSRLGXXXXXXXXXXXEKAGNLLGGFNEVASQNGDIDET---GVSEGEGDNVVGADELE 2639 S E + F + + + T G EG D V+ AD Sbjct: 83 NFSPQADINEVSHHYSSENGDTHMDNFVQRSPDLAEEANTQIHGEVEGHVDYVIDAD--- 139 Query: 2638 KTDKIEGMAQKLEDVLSLLQSSGYGKSIEPSLEDMGLALNEKYVLRVLETPFIPGENLIG 2459 KLE+VLSLLQSS S+E SL++M L L+E ++++VLETP I G+NLI Sbjct: 140 ----------KLENVLSLLQSST-DASLESSLDNMDLHLHEDFIVKVLETPLIVGDNLIK 188 Query: 2458 FFKWVFRKNEILVTKEALDALVTAISNEFRARNAYALWDLVKEAGEKDIGVVSTETFNEL 2279 FF W ++ +I VT + LV AI +E R ++AYALWDLVK+ GE++ V++ + N+L Sbjct: 189 FFNWAIKQPDINVTTRLVHPLVRAICSELRKKDAYALWDLVKDIGEEENTVLNVDLLNQL 248 Query: 2278 LSLFSRLGKGKVAFEVFNKFEDFGCVPDADTYYFTIDALCKRMIFGWACSVCEKMVSADK 2099 ++LFS+LGKGK AFEVFNKF DFGCVPD++TY++TI+ALC+R IF WA SV EKM+ A+ Sbjct: 249 IALFSKLGKGKAAFEVFNKFGDFGCVPDSETYHYTIEALCRRSIFDWASSVREKMLRAEA 308 Query: 2098 VPESKKVGKIVSHLCKGKKFRDAHTVYLWAKERKIYPSRSSVNFLIRSLCGKDKLMGEDK 1919 +P+++K+GKI+ CKG K DA+ VYL AKE+ YP + SVNFLI LC K+ Sbjct: 309 LPDTEKIGKIICWFCKGDKANDAYLVYLLAKEKNKYPPQPSVNFLIGLLCQKN------- 361 Query: 1918 GRTKNETYSPSQAEVIAQEEKENVYLALKMLDDFSEEERKHAIKPFSFVIKGLLRIRDFG 1739 E V LAL+MLD FS +RK+AIKPFS VI+ L RI+D Sbjct: 362 ---------------------ETVKLALEMLDAFSGPKRKYAIKPFSSVIRALCRIKDLD 400 Query: 1738 GAKKLLFQMIEAGPPPGNMIFNTIIGSLSKAGDMQEAMVTLKIMEDRGLKPDVYSYTVVI 1559 GAK LL +M++ GPPPGN +FN+II SK GDM+EA+ ++M RGLKPD+++Y V++ Sbjct: 401 GAKMLLSKMVDEGPPPGNAVFNSIINGYSKCGDMKEAIKMKQLMVRRGLKPDLFTYAVIM 460 Query: 1558 SGYVEGGAMEEACQVFGEAKKRHSKLSHVTYHSMIRGYCKLEQFEKALELMEEMKNSCLQ 1379 SGY GG MEEAC+V EAKK+HSKLS V YH++IRGYCKLEQF+KAL+L+ EMK +Q Sbjct: 461 SGYASGGQMEEACKVLSEAKKKHSKLSPVMYHTVIRGYCKLEQFDKALDLLAEMKTFGVQ 520 Query: 1378 PNADEYNKMIKSLCVKALDWETAAKLLEEMSESGLHVNEKKRCLVRAVKELQDEIVESEA 1199 NADEYNK+I+SLC+KALDWE A KLLE+M E GLH+N R L+RAVKEL+DE +E E Sbjct: 521 ANADEYNKLIQSLCLKALDWERAEKLLEKMKEDGLHLNGITRGLIRAVKELEDEGIEKEV 580 Query: 1198 VAAA 1187 A A Sbjct: 581 GAEA 584 >ref|XP_003546486.1| PREDICTED: pentatricopeptide repeat-containing protein At3g02650, mitochondrial-like isoform 1 [Glycine max] gi|356556346|ref|XP_003546487.1| PREDICTED: pentatricopeptide repeat-containing protein At3g02650, mitochondrial-like isoform 2 [Glycine max] Length = 538 Score = 472 bits (1215), Expect = e-130 Identities = 248/506 (49%), Positives = 340/506 (67%), Gaps = 2/506 (0%) Frame = -2 Query: 2698 DETGVSEGEGDNVVGADELEKTDKIEGMAQKLEDVLSLLQSSGYGKSIEPSLEDMGLALN 2519 D EGEG G D+ E + KLE VL LLQ+S G S+E L+D+ L L+ Sbjct: 68 DRQLAEEGEGAGGGG-------DRYEVDSDKLESVLRLLQTSADG-SLESCLDDIDLTLH 119 Query: 2518 EKYVLRVLETPFIPGENLIGFFKWVFRKNEILVTKEALDALVTAI-SNEFRARNA-YALW 2345 ++ V ++ ETPF+ ENLI FF W + + + VT +++LV AI N+ R + Y+LW Sbjct: 120 QQLVTKITETPFVLSENLIRFFWWAWSERSLGVTTPMVESLVLAICGNDVRKKEVVYSLW 179 Query: 2344 DLVKEAGEKDIGVVSTETFNELLSLFSRLGKGKVAFEVFNKFEDFGCVPDADTYYFTIDA 2165 DLVKE GEK+ G+++ + NEL+S F RLGKGK A EVFNKFE F CVPDADTYYFTI+A Sbjct: 180 DLVKEIGEKESGILNVKILNELISSFLRLGKGKAALEVFNKFEAFHCVPDADTYYFTIEA 239 Query: 2164 LCKRMIFGWACSVCEKMVSADKVPESKKVGKIVSHLCKGKKFRDAHTVYLWAKERKIYPS 1985 LC+R WAC VC+KMV A +P+ +KVG I+S LCKGKK ++AH VY+ A E+ P Sbjct: 240 LCRRRALDWACGVCQKMVDAQILPDGEKVGAILSWLCKGKKAKEAHGVYVVATEKGKQPP 299 Query: 1984 RSSVNFLIRSLCGKDKLMGEDKGRTKNETYSPSQAEVIAQEEKENVYLALKMLDDFSEEE 1805 + V+FL+ LCG+D E V AL+ML+D EE+ Sbjct: 300 VNVVSFLVVKLCGED----------------------------ETVKFALEMLEDIPEEK 331 Query: 1804 RKHAIKPFSFVIKGLLRIRDFGGAKKLLFQMIEAGPPPGNMIFNTIIGSLSKAGDMQEAM 1625 R+ AIKPF V++ L RI++ AK+L+ +MIE GPPPGN +FN ++ + SKAG+M +A+ Sbjct: 332 RERAIKPFLAVVRALCRIKEVDKAKELVLKMIEDGPPPGNAVFNFVVTAYSKAGEMGKAV 391 Query: 1624 VTLKIMEDRGLKPDVYSYTVVISGYVEGGAMEEACQVFGEAKKRHSKLSHVTYHSMIRGY 1445 +++ME RGL+PDVY+YTV+ S Y GG MEEA ++ E KK+H+KL V +H++IRGY Sbjct: 392 EMMRLMESRGLRPDVYTYTVLASAYSNGGEMEEAQKILAEVKKKHAKLGPVMFHTLIRGY 451 Query: 1444 CKLEQFEKALELMEEMKNSCLQPNADEYNKMIKSLCVKALDWETAAKLLEEMSESGLHVN 1265 CKLEQF++AL+L+ EMK+ + P+ DEY+K+I+SLC+KALDWE A KL EEM ESGLH+ Sbjct: 452 CKLEQFDEALKLLAEMKDYGVHPSVDEYDKLIQSLCLKALDWEMAEKLHEEMKESGLHLK 511 Query: 1264 EKKRCLVRAVKELQDEIVESEAVAAA 1187 R L+RAVKE++ E+VE+ ++ AA Sbjct: 512 GITRGLIRAVKEMEKEVVEAGSITAA 537 >ref|XP_003542095.1| PREDICTED: pentatricopeptide repeat-containing protein At3g02650, mitochondrial-like [Glycine max] Length = 539 Score = 466 bits (1200), Expect = e-128 Identities = 241/485 (49%), Positives = 331/485 (68%), Gaps = 4/485 (0%) Frame = -2 Query: 2632 DKIEGMAQKLEDVLSLLQSSGYGKSIEPSLEDMGLALNEKYVLRVLETPFIPGENLIGFF 2453 D E + LE VL LLQ+S G S+E L+DM L L+++ V ++ ETPF+ ENLI FF Sbjct: 82 DTYEVDSDTLESVLRLLQTSADG-SLESCLDDMDLTLHQQLVTKITETPFVLSENLIRFF 140 Query: 2452 KWVFRKNEILVTKEALDALVTAISNEFRARN----AYALWDLVKEAGEKDIGVVSTETFN 2285 W + + + VT +++LV AI R Y+LWDLVKE GEK+ G+++ N Sbjct: 141 WWAWSERSLEVTTPMVESLVLAICGNDDVRKKKEVVYSLWDLVKEIGEKESGLLNVRILN 200 Query: 2284 ELLSLFSRLGKGKVAFEVFNKFEDFGCVPDADTYYFTIDALCKRMIFGWACSVCEKMVSA 2105 EL+S FSRL KGK A EVF+KFE F CVPDADTYYFTI+ALC+R F WAC VC+KMV A Sbjct: 201 ELISSFSRLRKGKAALEVFDKFEAFHCVPDADTYYFTIEALCRRRAFDWACGVCQKMVDA 260 Query: 2104 DKVPESKKVGKIVSHLCKGKKFRDAHTVYLWAKERKIYPSRSSVNFLIRSLCGKDKLMGE 1925 +P+++KVG I+S LCKGKK ++AH VY+ A E+ P + V+FL+ LCG+D Sbjct: 261 RTLPDAEKVGAILSWLCKGKKAKEAHGVYVVATEKGKLPPVNVVSFLVLKLCGED----- 315 Query: 1924 DKGRTKNETYSPSQAEVIAQEEKENVYLALKMLDDFSEEERKHAIKPFSFVIKGLLRIRD 1745 E V AL++L+D EE+R+ AIKPF V++ L RI++ Sbjct: 316 -----------------------ETVKSALEILEDIPEEKRERAIKPFLAVVRALCRIKE 352 Query: 1744 FGGAKKLLFQMIEAGPPPGNMIFNTIIGSLSKAGDMQEAMVTLKIMEDRGLKPDVYSYTV 1565 AK+LL +MIE GPPPGN +FN ++ + SKAG+M +A+ +++ME RGL+PDVY+YTV Sbjct: 353 VDKAKELLLKMIENGPPPGNAVFNFVVTAYSKAGEMGKAVEMMRLMESRGLRPDVYTYTV 412 Query: 1564 VISGYVEGGAMEEACQVFGEAKKRHSKLSHVTYHSMIRGYCKLEQFEKALELMEEMKNSC 1385 + S Y GG MEEA ++ EAKK+H KL V +H++IRGYCKLEQF++AL+L+ EMK+ Sbjct: 413 LASAYSNGGEMEEAQKILAEAKKKHVKLGPVMFHTLIRGYCKLEQFDEALKLLAEMKDYG 472 Query: 1384 LQPNADEYNKMIKSLCVKALDWETAAKLLEEMSESGLHVNEKKRCLVRAVKELQDEIVES 1205 ++P+ DEY+K+I+SLC+KALDW+ A KL EEM ESGLH+ R L+RAVKE++ E+VE+ Sbjct: 473 VRPSVDEYDKLIQSLCLKALDWKMAEKLQEEMKESGLHLKGITRGLIRAVKEMEKEVVEA 532 Query: 1204 EAVAA 1190 E++ A Sbjct: 533 ESITA 537 >ref|NP_001143372.1| uncharacterized protein LOC100276004 [Zea mays] gi|195619158|gb|ACG31409.1| hypothetical protein [Zea mays] Length = 597 Score = 367 bits (941), Expect = 1e-98 Identities = 229/635 (36%), Positives = 347/635 (54%), Gaps = 16/635 (2%) Frame = -2 Query: 3055 VLQSRETLPPPLALFRYRNFWRFNSWGTLKLPSYPKLRFYSSDPTPENDEFYREAPINYV 2876 +L+S T PPP Y T L P RF SS P P D A Sbjct: 11 LLRSTITRPPPPPPQPYP---------TRTLTRVPPPRFLSSSPDPIPDSSSAAA----- 56 Query: 2875 RNEFDGSPFGANEGFADDSVVSVDSRLGXXXXXXXXXXXEKAGNLLGGFNEVASQNGDID 2696 PF E F+ + S D+ +AG + + + GD D Sbjct: 57 -----ADPFP--EAFSSPTKASQDAA--------------EAGE--DNLSSMWEEAGDAD 93 Query: 2695 ETGVSEGEGDNVVGADELEKTDKIEGMAQKLEDVLSLLQSSGYGKSIEPSLEDMGLALNE 2516 + S G D V +E+ + + + ++S+ + I +L DM + NE Sbjct: 94 DIFASPGSADAVADDEEVAR-------------ICAAVESTPEDE-IASTLADMTVDFNE 139 Query: 2515 KYVLRVL-ETPFIPGENLIGFFKWVFRKNEILVTKEALDALVTAISN--EFRARNAYALW 2345 + VL + LI F + + N + L+ LV+ +++ E +AY LW Sbjct: 140 PLLAAVLLAADQCSCKKLISLFNYAAKNNPTSKSLSNLEVLVSKLADSAEIDKADAYLLW 199 Query: 2344 DLVKEAGEKDIGVVSTETFNELLSLFSRLGKGKVAFEVFNKFEDFGCVPDADTYYFTIDA 2165 D +KE G G VST NE++++F +L K K A EVF+KF++FGC PD+D+YY I+A Sbjct: 200 DSIKEIGSVP-GSVSTPLLNEMIAIFWKLEKSKAALEVFSKFDEFGCTPDSDSYYLVIEA 258 Query: 2164 LCKRMIFGWACSVCEKMVSADKVPESKKVGKIVSHLCKGKKFRDAHTVYLWAKERKIYPS 1985 K+ +F AC VCEKM+ + P +KVG+I+ +LC+GKK + AH++YL KE+KI Sbjct: 259 ARKKSLFRSACEVCEKMIGSACFPNGEKVGRILIYLCEGKKVKMAHSLYLAVKEKKIPVP 318 Query: 1984 RSSVNFLIRSLCGKDKLMG-------EDKGRTKNE------TYSPSQAEVIAQEEKENVY 1844 + +++FL+ +L D+ +G E +G + T + + E+ N+ Sbjct: 319 KLALDFLVGALARNDETIGTALELLEEYQGESLKHAGKSFATVVHALCRLSKMEDANNLL 378 Query: 1843 LALKMLDDFSEEERKHAIKPFSFVIKGLLRIRDFGGAKKLLFQMIEAGPPPGNMIFNTII 1664 + + L+++ E K+A K F+ VI GL R + AK LL +M+ GP PGN +FN +I Sbjct: 379 MRMVQLEEYKGESLKNAGKTFATVIHGLCRKKKLEDAKALLMRMVNVGPAPGNAVFNFVI 438 Query: 1663 GSLSKAGDMQEAMVTLKIMEDRGLKPDVYSYTVVISGYVEGGAMEEACQVFGEAKKRHSK 1484 +LSK G+M++A +++ME +G+ PD+Y+Y+V++SGYV+GG ++EA + EAKK H K Sbjct: 439 TALSKQGEMEDAKGLMRMMESQGISPDIYTYSVLMSGYVKGGMIDEAHDLLREAKKIHPK 498 Query: 1483 LSHVTYHSMIRGYCKLEQFEKALELMEEMKNSCLQPNADEYNKMIKSLCVKALDWETAAK 1304 L+ V YH +IRGYCK+E FEKA E ++EMK LQPN DEY+K+I+SLC+KA+DW A K Sbjct: 499 LNRVAYHILIRGYCKMEDFEKANECLKEMKKDGLQPNVDEYDKLIQSLCLKAMDWRRAEK 558 Query: 1303 LLEEMSESGLHVNEKKRCLVRAVKELQDEIVESEA 1199 LLEEM +SGL + R L+ AVKEL+ E ++S+A Sbjct: 559 LLEEMEDSGLCLRGISRSLITAVKELEGEEMQSKA 593 >gb|ACN34333.1| unknown [Zea mays] gi|414879211|tpg|DAA56342.1| TPA: hypothetical protein ZEAMMB73_618544 [Zea mays] Length = 598 Score = 362 bits (928), Expect = 4e-97 Identities = 228/635 (35%), Positives = 344/635 (54%), Gaps = 16/635 (2%) Frame = -2 Query: 3055 VLQSRETLPPPLALFRYRNFWRFNSWGTLKLPSYPKLRFYSSDPTPENDEFYREAPINYV 2876 +L+S T PPP Y T L P RF SS P P D A Sbjct: 12 LLRSTITRPPPPPPQPYP---------TRTLTRVPPPRFLSSSPDPIPDSSSAAA----- 57 Query: 2875 RNEFDGSPFGANEGFADDSVVSVDSRLGXXXXXXXXXXXEKAGNLLGGFNEVASQNGDID 2696 PF E F+ + S D+ +AG + + + G D Sbjct: 58 -----ADPFP--EAFSSPTKASQDAA--------------EAGE--DNLSSMWEEAGHAD 94 Query: 2695 ETGVSEGEGDNVVGADELEKTDKIEGMAQKLEDVLSLLQSSGYGKSIEPSLEDMGLALNE 2516 + S G D V +E+ + V + ++S+ + I +L DM + NE Sbjct: 95 DIFASPGSADAVADDEEVAR-------------VCAAVESTPEDE-IASTLADMTVDFNE 140 Query: 2515 KYVLRVL-ETPFIPGENLIGFFKWVFRKNEILVTKEALDALVTAISN--EFRARNAYALW 2345 + VL + LI F + + N + L+ LV+ +++ E +AY LW Sbjct: 141 PLLAAVLLAAEQCSCKKLISLFNYAAKNNPASKSLSNLEVLVSKLADSAEIDKADAYLLW 200 Query: 2344 DLVKEAGEKDIGVVSTETFNELLSLFSRLGKGKVAFEVFNKFEDFGCVPDADTYYFTIDA 2165 D +KE G G VST NE++++F ++ K K A EVF+KF++FGC PD+D+YY I+A Sbjct: 201 DSIKEIGSVS-GSVSTPLLNEMIAIFWKVEKSKAALEVFSKFDEFGCTPDSDSYYLVIEA 259 Query: 2164 LCKRMIFGWACSVCEKMVSADKVPESKKVGKIVSHLCKGKKFRDAHTVYLWAKERKIYPS 1985 K+ +F AC VCEKM+ + P +KVG+I+ +LC+GKK + AH++YL KE+KI Sbjct: 260 ARKKSLFRSACEVCEKMIGSACFPNGEKVGRILIYLCEGKKVKMAHSLYLAVKEKKIPVP 319 Query: 1984 RSSVNFLIRSLCGKDKLMG-------EDKGRTKNETYSPSQAEVIA------QEEKENVY 1844 + +++FL+ +L D+ +G E +G + V A E+ N+ Sbjct: 320 KLALDFLVGALARNDETIGTALELLEEYQGESLKHAGKSFATVVHALCRLNKMEDANNLL 379 Query: 1843 LALKMLDDFSEEERKHAIKPFSFVIKGLLRIRDFGGAKKLLFQMIEAGPPPGNMIFNTII 1664 + + L+++ E K+A K F+ VI GL R + AK LL +M+ GP PGN +FN +I Sbjct: 380 MRMVQLEEYKGESLKNAGKTFATVIHGLCRKKKLEDAKALLMRMVNVGPAPGNAVFNFVI 439 Query: 1663 GSLSKAGDMQEAMVTLKIMEDRGLKPDVYSYTVVISGYVEGGAMEEACQVFGEAKKRHSK 1484 +LSK G+M++A +++ME +G+ PD+Y+Y+V++SGY +GG ++EA + EAKK H K Sbjct: 440 TALSKQGEMEDAKGLMRMMESQGISPDIYTYSVLMSGYAKGGMIDEAHDLLREAKKIHPK 499 Query: 1483 LSHVTYHSMIRGYCKLEQFEKALELMEEMKNSCLQPNADEYNKMIKSLCVKALDWETAAK 1304 L+ V YH +IRGYCK+E FEKA E ++EMK LQPN DEY+K+I+SLC+KA+DW A K Sbjct: 500 LNRVAYHILIRGYCKMEDFEKANECLKEMKKDGLQPNVDEYDKLIQSLCLKAMDWRRAEK 559 Query: 1303 LLEEMSESGLHVNEKKRCLVRAVKELQDEIVESEA 1199 LLEEM +SGL + R L+ AVKEL+ E ++S+A Sbjct: 560 LLEEMEDSGLCLRGISRSLITAVKELEGEEMQSKA 594