BLASTX nr result
ID: Akebia24_contig00026684
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00026684 (1039 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002270439.2| PREDICTED: pentatricopeptide repeat-containi... 416 e-114 ref|XP_007034318.1| Tetratricopeptide repeat-like superfamily pr... 390 e-106 ref|XP_004139010.1| PREDICTED: pentatricopeptide repeat-containi... 357 3e-96 ref|XP_002518071.1| pentatricopeptide repeat-containing protein,... 353 8e-95 ref|XP_006420980.1| hypothetical protein CICLE_v10004495mg [Citr... 352 1e-94 ref|XP_006420979.1| hypothetical protein CICLE_v10004495mg [Citr... 352 1e-94 ref|XP_002300166.1| pentatricopeptide repeat-containing family p... 352 2e-94 ref|XP_006489829.1| PREDICTED: pentatricopeptide repeat-containi... 351 2e-94 ref|XP_006489828.1| PREDICTED: pentatricopeptide repeat-containi... 351 2e-94 ref|XP_006489827.1| PREDICTED: pentatricopeptide repeat-containi... 351 2e-94 ref|XP_006489825.1| PREDICTED: pentatricopeptide repeat-containi... 351 2e-94 ref|XP_006360955.1| PREDICTED: pentatricopeptide repeat-containi... 334 3e-89 ref|XP_002892245.1| pentatricopeptide repeat-containing protein ... 317 4e-84 ref|XP_006418090.1| hypothetical protein EUTSA_v10007007mg [Eutr... 313 1e-82 ref|NP_171976.1| pentatricopeptide repeat-containing protein [Ar... 313 1e-82 ref|XP_006306938.1| hypothetical protein CARUB_v10008505mg, part... 301 3e-79 ref|XP_004247960.1| PREDICTED: pentatricopeptide repeat-containi... 275 2e-71 emb|CBI17228.3| unnamed protein product [Vitis vinifera] 231 3e-58 ref|XP_004295870.1| PREDICTED: pentatricopeptide repeat-containi... 209 1e-51 ref|XP_007050341.1| Basic helix-loop-helix DNA-binding superfami... 208 3e-51 >ref|XP_002270439.2| PREDICTED: pentatricopeptide repeat-containing protein At1g04840-like [Vitis vinifera] Length = 677 Score = 416 bits (1070), Expect = e-114 Identities = 208/314 (66%), Positives = 241/314 (76%) Frame = -3 Query: 944 ENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQHFY 765 E HFI LIH S T QL QIHAQ+FLHNL +DY + IF+ F Sbjct: 40 ETHFIPLIHASNTLPQLHQIHAQIFLHNLFSNSRVVTQLISSSCSLKSLDYALSIFRCFD 99 Query: 764 SPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGGT 585 PN F+FNA IR L+ENS+F+ S+ HFVL LRLS++PDRLT PFVLKS A+L+ LG Sbjct: 100 HPNLFVFNALIRGLAENSRFEGSVSHFVLMLRLSIRPDRLTLPFVLKSVAALVDVGLGRC 159 Query: 584 LHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCC 405 LHG + KLGL+FDSFV VSLVDMYVK G LGF LQLFDE+ +RNK SILLWN+LINGCC Sbjct: 160 LHGGVMKLGLEFDSFVRVSLVDMYVKIGELGFGLQLFDESPQRNKAESILLWNVLINGCC 219 Query: 404 KGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWTTMVAG 225 K GDL KA LFEAMPERN GSWN LINGF +N DL++A+ LF QMPEKNVVSWTTM+ G Sbjct: 220 KVGDLSKAASLFEAMPERNAGSWNSLINGFVRNGDLDRARELFVQMPEKNVVSWTTMING 279 Query: 224 LSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNGFQLN 45 SQN DHE+ALSMF+RML+EGVR NDLT+VSAL AC +IGAL+ G +IH+Y+S NGFQLN Sbjct: 280 FSQNGDHEKALSMFWRMLEEGVRPNDLTVVSALLACTKIGALQVGERIHNYLSSNGFQLN 339 Query: 44 RAIGTSLVDMYAKC 3 R IGT+LVDMYAKC Sbjct: 340 RGIGTALVDMYAKC 353 >ref|XP_007034318.1| Tetratricopeptide repeat-like superfamily protein isoform 1 [Theobroma cacao] gi|590656608|ref|XP_007034319.1| Tetratricopeptide repeat-like superfamily protein isoform 1 [Theobroma cacao] gi|590656611|ref|XP_007034320.1| Tetratricopeptide repeat-like superfamily protein isoform 1 [Theobroma cacao] gi|590656614|ref|XP_007034321.1| Tetratricopeptide repeat-like superfamily protein isoform 1 [Theobroma cacao] gi|508713347|gb|EOY05244.1| Tetratricopeptide repeat-like superfamily protein isoform 1 [Theobroma cacao] gi|508713348|gb|EOY05245.1| Tetratricopeptide repeat-like superfamily protein isoform 1 [Theobroma cacao] gi|508713349|gb|EOY05246.1| Tetratricopeptide repeat-like superfamily protein isoform 1 [Theobroma cacao] gi|508713350|gb|EOY05247.1| Tetratricopeptide repeat-like superfamily protein isoform 1 [Theobroma cacao] Length = 682 Score = 390 bits (1002), Expect = e-106 Identities = 198/316 (62%), Positives = 240/316 (75%) Frame = -3 Query: 950 PLENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQH 771 PL+ HF SLI +SKTT QL+QIHAQ+F NLS I Y I +F H Sbjct: 43 PLKTHFASLIQSSKTTLQLRQIHAQIFRRNLSSSSNLTTLLISASSSLKSIPYAISLFNH 102 Query: 770 FYSPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLG 591 F+ + F+FNA IR L++NS +SSI HF+L L L V+PD+LT+PFVLKS A L + LG Sbjct: 103 FHHKSIFLFNALIRGLTDNSLLESSISHFLLMLSLGVRPDKLTYPFVLKSIAGLGLRCLG 162 Query: 590 GTLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILING 411 LHG+I K G++FDSFV V+LV+MYVK LGFALQ+FDE+ ERNK SILLWN+LING Sbjct: 163 LILHGRIIKSGVEFDSFVRVALVEMYVKLKELGFALQVFDESPERNKSGSILLWNVLING 222 Query: 410 CCKGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWTTMV 231 CK G+L KAMELFEA PERN+GSWN LINGF +N DL+KA LFD+M EK+VVSWTTMV Sbjct: 223 YCKDGNLGKAMELFEATPERNIGSWNSLINGFMRNGDLDKAVELFDEMKEKDVVSWTTMV 282 Query: 230 AGLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNGFQ 51 G SQN DHE+ALSMF++ML+ +R NDLT+V ALSACA+IGALE+G +IHDYV NGF+ Sbjct: 283 NGFSQNGDHEKALSMFFKMLEAALRPNDLTLVPALSACAKIGALEAGARIHDYVLENGFR 342 Query: 50 LNRAIGTSLVDMYAKC 3 LN+AIG +LVDMYAKC Sbjct: 343 LNKAIGAALVDMYAKC 358 Score = 63.9 bits (154), Expect = 1e-07 Identities = 37/146 (25%), Positives = 70/146 (47%) Frame = -3 Query: 797 DYTILIFQHFYSPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSA 618 D + +F + + + S+N + ++ F L +++P+ LT L + Sbjct: 261 DKAVELFDEMKEKDVVSWTTMVNGFSQNGDHEKALSMFFKMLEAALRPNDLTLVPALSAC 320 Query: 617 ASLLVPRLGGTLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSI 438 A + G +H + + G + + +LVDMY K G + A ++FDET ER+ I Sbjct: 321 AKIGALEAGARIHDYVLENGFRLNKAIGAALVDMYAKCGDIQSASKVFDETKERD----I 376 Query: 437 LLWNILINGCCKGGDLKKAMELFEAM 360 L W+++I G G ++A++ F+ M Sbjct: 377 LTWSVMIWGWAIHGYYEQAIQCFKKM 402 >ref|XP_004139010.1| PREDICTED: pentatricopeptide repeat-containing protein At1g04840-like [Cucumis sativus] gi|449505311|ref|XP_004162432.1| PREDICTED: pentatricopeptide repeat-containing protein At1g04840-like [Cucumis sativus] Length = 679 Score = 357 bits (917), Expect = 3e-96 Identities = 185/315 (58%), Positives = 229/315 (72%) Frame = -3 Query: 947 LENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQHF 768 LE HFI LIH S +T +L+QIH QL+ N+ +DY I IFQ F Sbjct: 41 LETHFIDLIHASNSTHKLRQIHGQLYRCNVFSSSRVVTQFISSCSSLNSVDYAISIFQRF 100 Query: 767 YSPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGG 588 NS++FNA IR L+ENS+F+SSI FVL L+ + PDRLTFPFVLKSAA+L +G Sbjct: 101 ELKNSYLFNALIRGLAENSRFESSISFFVLMLKWKISPDRLTFPFVLKSAAALSNGGVGR 160 Query: 587 TLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGC 408 LH I K GL+FDSFV VSLVDMYVK LG AL++FDE+ E K S+L+WN+LI+G Sbjct: 161 ALHCGILKFGLEFDSFVRVSLVDMYVKVEELGSALKVFDESPESVKNGSVLIWNVLIHGY 220 Query: 407 CKGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWTTMVA 228 C+ GDL KA ELF++MP+++ GSWN LINGF K D+ +AK LF +MPEKNVVSWTTMV Sbjct: 221 CRMGDLVKATELFDSMPKKDTGSWNSLINGFMKMGDMGRAKELFVKMPEKNVVSWTTMVN 280 Query: 227 GLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNGFQL 48 G SQN D E+AL F+ ML+EG R ND TIVSALSACA+IGAL++G++IH+Y+S NGF+L Sbjct: 281 GFSQNGDPEKALETFFCMLEEGARPNDYTIVSALSACAKIGALDAGLRIHNYLSGNGFKL 340 Query: 47 NRAIGTSLVDMYAKC 3 N IGT+LVDMYAKC Sbjct: 341 NLVIGTALVDMYAKC 355 >ref|XP_002518071.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223542667|gb|EEF44204.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 404 Score = 353 bits (905), Expect = 8e-95 Identities = 179/315 (56%), Positives = 225/315 (71%) Frame = -3 Query: 947 LENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQHF 768 +E H I LIH+SKT QL QIH Q+ LHNLS I Y++ IF + Sbjct: 34 IETHIIPLIHSSKTALQLHQIHTQILLHNLSSSSHITAQLISSSSLRKSIAYSLSIFNSY 93 Query: 767 YSPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGG 588 + N ++FNA IR L++N ++ SI HF+L LR ++PD LTF FVLKS ASL + L Sbjct: 94 HPKNLYLFNALIRGLTDNYRYLDSIDHFILLLRSDIKPDHLTFSFVLKSIASLSLKGLAR 153 Query: 587 TLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGC 408 LHG I + GL+FDSFV +S+VD+YVK + AL++FDE+ +R S LLWN+LINGC Sbjct: 154 ALHGMILRCGLEFDSFVRISMVDVYVKLEEVKLALKVFDESPQRFHEGSTLLWNVLINGC 213 Query: 407 CKGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWTTMVA 228 CK GD++KA+ELFE MP RN SWN LINGFFK DLE+A FD+MP K+VVSWTTMV Sbjct: 214 CKVGDMRKALELFEDMPLRNTASWNSLINGFFKIGDLEQAIEHFDRMPVKDVVSWTTMVN 273 Query: 227 GLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNGFQL 48 G SQN DHE+ALS+F RML E V+ ND TIVSALSACA+IGALE+G++IH Y+ NGF+L Sbjct: 274 GFSQNGDHEKALSVFSRMLDEDVKPNDFTIVSALSACAKIGALEAGLRIHKYLKDNGFRL 333 Query: 47 NRAIGTSLVDMYAKC 3 NRA+G +LVDM+AKC Sbjct: 334 NRAVGNALVDMHAKC 348 >ref|XP_006420980.1| hypothetical protein CICLE_v10004495mg [Citrus clementina] gi|557522853|gb|ESR34220.1| hypothetical protein CICLE_v10004495mg [Citrus clementina] Length = 664 Score = 352 bits (903), Expect = 1e-94 Identities = 180/314 (57%), Positives = 224/314 (71%) Frame = -3 Query: 944 ENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQHFY 765 E H ISLIH+S +T+QL+QIHAQ+ LHNL DY + IF HF Sbjct: 31 ETHIISLIHSSNSTKQLRQIHAQIILHNLFASSRITTQLISSASLHKSTDYALSIFGHFT 90 Query: 764 SPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGGT 585 N IFN IR L+ENS FQS I HFV LRLSV+P+RLT+PFV KS ASL + LG Sbjct: 91 PKNLHIFNVLIRGLAENSHFQSCISHFVFMLRLSVRPNRLTYPFVSKSVASLSLLSLGRG 150 Query: 584 LHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCC 405 LH I K G+++D+FV V L DMYV+ G A ++FDET E+NK S+LLWN+LINGC Sbjct: 151 LHCLIVKSGVEYDAFVRVHLADMYVQLGKTRGAFKVFDETPEKNKSESVLLWNVLINGCS 210 Query: 404 KGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWTTMVAG 225 K G L+KA+ELF MP++NV SW LI+GF + DL+KA LF+QMPEK VVSWT M+ G Sbjct: 211 KIGYLRKAVELFGMMPKKNVASWVSLIDGFMRKGDLKKAGELFEQMPEKGVVSWTAMING 270 Query: 224 LSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNGFQLN 45 SQN + E+AL+MF++ML GVRAND T+VSALSACA++GALE+GV++H+Y+S N F L Sbjct: 271 FSQNGEAEKALAMFFQMLDAGVRANDFTVVSALSACAKVGALEAGVRVHNYISCNDFGLK 330 Query: 44 RAIGTSLVDMYAKC 3 AIGT+LVDMYAKC Sbjct: 331 GAIGTALVDMYAKC 344 >ref|XP_006420979.1| hypothetical protein CICLE_v10004495mg [Citrus clementina] gi|557522852|gb|ESR34219.1| hypothetical protein CICLE_v10004495mg [Citrus clementina] Length = 466 Score = 352 bits (903), Expect = 1e-94 Identities = 180/314 (57%), Positives = 224/314 (71%) Frame = -3 Query: 944 ENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQHFY 765 E H ISLIH+S +T+QL+QIHAQ+ LHNL DY + IF HF Sbjct: 31 ETHIISLIHSSNSTKQLRQIHAQIILHNLFASSRITTQLISSASLHKSTDYALSIFGHFT 90 Query: 764 SPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGGT 585 N IFN IR L+ENS FQS I HFV LRLSV+P+RLT+PFV KS ASL + LG Sbjct: 91 PKNLHIFNVLIRGLAENSHFQSCISHFVFMLRLSVRPNRLTYPFVSKSVASLSLLSLGRG 150 Query: 584 LHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCC 405 LH I K G+++D+FV V L DMYV+ G A ++FDET E+NK S+LLWN+LINGC Sbjct: 151 LHCLIVKSGVEYDAFVRVHLADMYVQLGKTRGAFKVFDETPEKNKSESVLLWNVLINGCS 210 Query: 404 KGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWTTMVAG 225 K G L+KA+ELF MP++NV SW LI+GF + DL+KA LF+QMPEK VVSWT M+ G Sbjct: 211 KIGYLRKAVELFGMMPKKNVASWVSLIDGFMRKGDLKKAGELFEQMPEKGVVSWTAMING 270 Query: 224 LSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNGFQLN 45 SQN + E+AL+MF++ML GVRAND T+VSALSACA++GALE+GV++H+Y+S N F L Sbjct: 271 FSQNGEAEKALAMFFQMLDAGVRANDFTVVSALSACAKVGALEAGVRVHNYISCNDFGLK 330 Query: 44 RAIGTSLVDMYAKC 3 AIGT+LVDMYAKC Sbjct: 331 GAIGTALVDMYAKC 344 >ref|XP_002300166.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|222847424|gb|EEE84971.1| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 719 Score = 352 bits (902), Expect = 2e-94 Identities = 181/318 (56%), Positives = 225/318 (70%), Gaps = 1/318 (0%) Frame = -3 Query: 953 TPLENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQ 774 TP E HFISLIH SKT QL QIHAQ+ +HNLS I++++ +F Sbjct: 78 TPTEAHFISLIHGSKTILQLHQIHAQIIIHNLSSSSLITTQLISSSSLRKSINHSLAVFN 137 Query: 773 HFYSPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRL 594 H N F FNA IR L+ NS F ++IFHF L LR ++PDRLT+PFVLKS A L L Sbjct: 138 HHKPKNLFTFNALIRGLTTNSHFFNAIFHFRLMLRSGIKPDRLTYPFVLKSMAGLFSTEL 197 Query: 593 GGTLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLER-NKVSSILLWNILI 417 G +H I + G++ DSFV VSLVDMYVK LG A ++FDE+ ER + SS LLWN+LI Sbjct: 198 GMAIHCMILRCGIELDSFVRVSLVDMYVKVEKLGSAFKVFDESPERFDSGSSALLWNVLI 257 Query: 416 NGCCKGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWTT 237 GCCK G +KKA++LF+AMP++ SW+ LI+GF KN D+++A LFDQMPEKNVVSWTT Sbjct: 258 KGCCKAGSMKKAVKLFKAMPKKENVSWSTLIDGFAKNGDMDRAMELFDQMPEKNVVSWTT 317 Query: 236 MVAGLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNG 57 MV G S+N D E+ALSMF +ML+EGVR N TIVSALSACA+IG LE+G++IH Y+ NG Sbjct: 318 MVDGFSRNGDSEKALSMFSKMLEEGVRPNAFTIVSALSACAKIGGLEAGLRIHKYIKDNG 377 Query: 56 FQLNRAIGTSLVDMYAKC 3 L A+GT+LVDMYAKC Sbjct: 378 LHLTEALGTALVDMYAKC 395 >ref|XP_006489829.1| PREDICTED: pentatricopeptide repeat-containing protein At1g04840-like isoform X5 [Citrus sinensis] Length = 457 Score = 351 bits (901), Expect = 2e-94 Identities = 181/314 (57%), Positives = 222/314 (70%) Frame = -3 Query: 944 ENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQHFY 765 E H ISLIH+S +T+QL+QIHAQ+ LHNL IDY + IF HF Sbjct: 31 ETHIISLIHSSNSTKQLRQIHAQIILHNLFASSRITTQLISSASLHKSIDYALSIFDHFT 90 Query: 764 SPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGGT 585 N IFN IR L+ENS FQS I HFV LRLSV+P+RLT+PFV KS ASL + LG Sbjct: 91 PKNLHIFNVLIRGLAENSHFQSCISHFVFMLRLSVRPNRLTYPFVSKSVASLSLLSLGRG 150 Query: 584 LHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCC 405 LH I K G+++D+FV V L DMYVK G A ++FDET ERNK S+LLWN+LINGC Sbjct: 151 LHCLIVKSGVEYDAFVRVHLADMYVKLGKTRGAFKMFDETPERNKSESVLLWNVLINGCS 210 Query: 404 KGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWTTMVAG 225 K G L+KA+ELF MP++N SW LI+GF + DL+KA LF+QMPEK VVSWT M+ G Sbjct: 211 KIGYLRKAVELFGVMPKKNAASWVSLIDGFMRKGDLKKAGELFEQMPEKGVVSWTAMING 270 Query: 224 LSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNGFQLN 45 SQN + E AL+MF++ML GVRAND T+VSALSACA++GALE+GV++H+Y+S N F L Sbjct: 271 FSQNGEAETALAMFFQMLDAGVRANDFTVVSALSACAKVGALEAGVRVHNYISCNDFGLK 330 Query: 44 RAIGTSLVDMYAKC 3 AIGT+LV MYAKC Sbjct: 331 GAIGTALVHMYAKC 344 Score = 66.6 bits (161), Expect = 2e-08 Identities = 47/199 (23%), Positives = 91/199 (45%), Gaps = 39/199 (19%) Frame = -3 Query: 533 VSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCCKGGDLKKAMELFEAMPE 354 VSL+D +++ G L A +LF++ E+ VS W +ING + G+ + A+ +F M + Sbjct: 234 VSLIDGFMRKGDLKKAGELFEQMPEKGVVS----WTAMINGFSQNGEAETALAMFFQMLD 289 Query: 353 RNVGS---------------------------------------WNCLINGFFKNKDLEK 291 V + L++ + K ++E Sbjct: 290 AGVRANDFTVVSALSACAKVGALEAGVRVHNYISCNDFGLKGAIGTALVHMYAKCGNIEA 349 Query: 290 AKLLFDQMPEKNVVSWTTMVAGLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACAR 111 A L+F + EK++++WT M+ GL+ + +EQA+ F +M+ G+ + ++ L+AC Sbjct: 350 ASLVFGETKEKDLLTWTAMIWGLAIHGRYEQAIQCFKKMMYSGIEPDGTVFLAILTACWY 409 Query: 110 IGALESGVQIHDYVSRNGF 54 G ++ + D +S + F Sbjct: 410 SGQVKLALNFFDSMSFDYF 428 >ref|XP_006489828.1| PREDICTED: pentatricopeptide repeat-containing protein At1g04840-like isoform X4 [Citrus sinensis] Length = 458 Score = 351 bits (901), Expect = 2e-94 Identities = 181/314 (57%), Positives = 222/314 (70%) Frame = -3 Query: 944 ENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQHFY 765 E H ISLIH+S +T+QL+QIHAQ+ LHNL IDY + IF HF Sbjct: 31 ETHIISLIHSSNSTKQLRQIHAQIILHNLFASSRITTQLISSASLHKSIDYALSIFDHFT 90 Query: 764 SPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGGT 585 N IFN IR L+ENS FQS I HFV LRLSV+P+RLT+PFV KS ASL + LG Sbjct: 91 PKNLHIFNVLIRGLAENSHFQSCISHFVFMLRLSVRPNRLTYPFVSKSVASLSLLSLGRG 150 Query: 584 LHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCC 405 LH I K G+++D+FV V L DMYVK G A ++FDET ERNK S+LLWN+LINGC Sbjct: 151 LHCLIVKSGVEYDAFVRVHLADMYVKLGKTRGAFKMFDETPERNKSESVLLWNVLINGCS 210 Query: 404 KGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWTTMVAG 225 K G L+KA+ELF MP++N SW LI+GF + DL+KA LF+QMPEK VVSWT M+ G Sbjct: 211 KIGYLRKAVELFGVMPKKNAASWVSLIDGFMRKGDLKKAGELFEQMPEKGVVSWTAMING 270 Query: 224 LSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNGFQLN 45 SQN + E AL+MF++ML GVRAND T+VSALSACA++GALE+GV++H+Y+S N F L Sbjct: 271 FSQNGEAETALAMFFQMLDAGVRANDFTVVSALSACAKVGALEAGVRVHNYISCNDFGLK 330 Query: 44 RAIGTSLVDMYAKC 3 AIGT+LV MYAKC Sbjct: 331 GAIGTALVHMYAKC 344 Score = 66.6 bits (161), Expect = 2e-08 Identities = 47/199 (23%), Positives = 91/199 (45%), Gaps = 39/199 (19%) Frame = -3 Query: 533 VSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCCKGGDLKKAMELFEAMPE 354 VSL+D +++ G L A +LF++ E+ VS W +ING + G+ + A+ +F M + Sbjct: 234 VSLIDGFMRKGDLKKAGELFEQMPEKGVVS----WTAMINGFSQNGEAETALAMFFQMLD 289 Query: 353 RNVGS---------------------------------------WNCLINGFFKNKDLEK 291 V + L++ + K ++E Sbjct: 290 AGVRANDFTVVSALSACAKVGALEAGVRVHNYISCNDFGLKGAIGTALVHMYAKCGNIEA 349 Query: 290 AKLLFDQMPEKNVVSWTTMVAGLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACAR 111 A L+F + EK++++WT M+ GL+ + +EQA+ F +M+ G+ + ++ L+AC Sbjct: 350 ASLVFGETKEKDLLTWTAMIWGLAIHGRYEQAIQCFKKMMYSGIEPDGTVFLAILTACWY 409 Query: 110 IGALESGVQIHDYVSRNGF 54 G ++ + D +S + F Sbjct: 410 SGQVKLALNFFDSMSFDYF 428 >ref|XP_006489827.1| PREDICTED: pentatricopeptide repeat-containing protein At1g04840-like isoform X3 [Citrus sinensis] Length = 466 Score = 351 bits (901), Expect = 2e-94 Identities = 181/314 (57%), Positives = 222/314 (70%) Frame = -3 Query: 944 ENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQHFY 765 E H ISLIH+S +T+QL+QIHAQ+ LHNL IDY + IF HF Sbjct: 31 ETHIISLIHSSNSTKQLRQIHAQIILHNLFASSRITTQLISSASLHKSIDYALSIFDHFT 90 Query: 764 SPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGGT 585 N IFN IR L+ENS FQS I HFV LRLSV+P+RLT+PFV KS ASL + LG Sbjct: 91 PKNLHIFNVLIRGLAENSHFQSCISHFVFMLRLSVRPNRLTYPFVSKSVASLSLLSLGRG 150 Query: 584 LHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCC 405 LH I K G+++D+FV V L DMYVK G A ++FDET ERNK S+LLWN+LINGC Sbjct: 151 LHCLIVKSGVEYDAFVRVHLADMYVKLGKTRGAFKMFDETPERNKSESVLLWNVLINGCS 210 Query: 404 KGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWTTMVAG 225 K G L+KA+ELF MP++N SW LI+GF + DL+KA LF+QMPEK VVSWT M+ G Sbjct: 211 KIGYLRKAVELFGVMPKKNAASWVSLIDGFMRKGDLKKAGELFEQMPEKGVVSWTAMING 270 Query: 224 LSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNGFQLN 45 SQN + E AL+MF++ML GVRAND T+VSALSACA++GALE+GV++H+Y+S N F L Sbjct: 271 FSQNGEAETALAMFFQMLDAGVRANDFTVVSALSACAKVGALEAGVRVHNYISCNDFGLK 330 Query: 44 RAIGTSLVDMYAKC 3 AIGT+LV MYAKC Sbjct: 331 GAIGTALVHMYAKC 344 Score = 66.6 bits (161), Expect = 2e-08 Identities = 47/199 (23%), Positives = 91/199 (45%), Gaps = 39/199 (19%) Frame = -3 Query: 533 VSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCCKGGDLKKAMELFEAMPE 354 VSL+D +++ G L A +LF++ E+ VS W +ING + G+ + A+ +F M + Sbjct: 234 VSLIDGFMRKGDLKKAGELFEQMPEKGVVS----WTAMINGFSQNGEAETALAMFFQMLD 289 Query: 353 RNVGS---------------------------------------WNCLINGFFKNKDLEK 291 V + L++ + K ++E Sbjct: 290 AGVRANDFTVVSALSACAKVGALEAGVRVHNYISCNDFGLKGAIGTALVHMYAKCGNIEA 349 Query: 290 AKLLFDQMPEKNVVSWTTMVAGLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACAR 111 A L+F + EK++++WT M+ GL+ + +EQA+ F +M+ G+ + ++ L+AC Sbjct: 350 ASLVFGETKEKDLLTWTAMIWGLAIHGRYEQAIQCFKKMMYSGIEPDGTVFLAILTACWY 409 Query: 110 IGALESGVQIHDYVSRNGF 54 G ++ + D +S + F Sbjct: 410 SGQVKLALNFFDSMSFDYF 428 >ref|XP_006489825.1| PREDICTED: pentatricopeptide repeat-containing protein At1g04840-like isoform X1 [Citrus sinensis] gi|568873396|ref|XP_006489826.1| PREDICTED: pentatricopeptide repeat-containing protein At1g04840-like isoform X2 [Citrus sinensis] Length = 664 Score = 351 bits (901), Expect = 2e-94 Identities = 181/314 (57%), Positives = 222/314 (70%) Frame = -3 Query: 944 ENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQHFY 765 E H ISLIH+S +T+QL+QIHAQ+ LHNL IDY + IF HF Sbjct: 31 ETHIISLIHSSNSTKQLRQIHAQIILHNLFASSRITTQLISSASLHKSIDYALSIFDHFT 90 Query: 764 SPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGGT 585 N IFN IR L+ENS FQS I HFV LRLSV+P+RLT+PFV KS ASL + LG Sbjct: 91 PKNLHIFNVLIRGLAENSHFQSCISHFVFMLRLSVRPNRLTYPFVSKSVASLSLLSLGRG 150 Query: 584 LHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCC 405 LH I K G+++D+FV V L DMYVK G A ++FDET ERNK S+LLWN+LINGC Sbjct: 151 LHCLIVKSGVEYDAFVRVHLADMYVKLGKTRGAFKMFDETPERNKSESVLLWNVLINGCS 210 Query: 404 KGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWTTMVAG 225 K G L+KA+ELF MP++N SW LI+GF + DL+KA LF+QMPEK VVSWT M+ G Sbjct: 211 KIGYLRKAVELFGVMPKKNAASWVSLIDGFMRKGDLKKAGELFEQMPEKGVVSWTAMING 270 Query: 224 LSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNGFQLN 45 SQN + E AL+MF++ML GVRAND T+VSALSACA++GALE+GV++H+Y+S N F L Sbjct: 271 FSQNGEAETALAMFFQMLDAGVRANDFTVVSALSACAKVGALEAGVRVHNYISCNDFGLK 330 Query: 44 RAIGTSLVDMYAKC 3 AIGT+LV MYAKC Sbjct: 331 GAIGTALVHMYAKC 344 Score = 66.6 bits (161), Expect = 2e-08 Identities = 47/199 (23%), Positives = 91/199 (45%), Gaps = 39/199 (19%) Frame = -3 Query: 533 VSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCCKGGDLKKAMELFEAMPE 354 VSL+D +++ G L A +LF++ E+ VS W +ING + G+ + A+ +F M + Sbjct: 234 VSLIDGFMRKGDLKKAGELFEQMPEKGVVS----WTAMINGFSQNGEAETALAMFFQMLD 289 Query: 353 RNVGS---------------------------------------WNCLINGFFKNKDLEK 291 V + L++ + K ++E Sbjct: 290 AGVRANDFTVVSALSACAKVGALEAGVRVHNYISCNDFGLKGAIGTALVHMYAKCGNIEA 349 Query: 290 AKLLFDQMPEKNVVSWTTMVAGLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACAR 111 A L+F + EK++++WT M+ GL+ + +EQA+ F +M+ G+ + ++ L+AC Sbjct: 350 ASLVFGETKEKDLLTWTAMIWGLAIHGRYEQAIQCFKKMMYSGIEPDGTVFLAILTACWY 409 Query: 110 IGALESGVQIHDYVSRNGF 54 G ++ + D +S + F Sbjct: 410 SGQVKLALNFFDSMSFDYF 428 >ref|XP_006360955.1| PREDICTED: pentatricopeptide repeat-containing protein At1g04840-like isoform X1 [Solanum tuberosum] gi|565390461|ref|XP_006360956.1| PREDICTED: pentatricopeptide repeat-containing protein At1g04840-like isoform X2 [Solanum tuberosum] Length = 666 Score = 334 bits (857), Expect = 3e-89 Identities = 167/315 (53%), Positives = 222/315 (70%), Gaps = 1/315 (0%) Frame = -3 Query: 944 ENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQHFY 765 E HFISLIH+SK T QL+QIH Q+ NLS I+Y + IF F Sbjct: 28 EPHFISLIHSSKNTLQLQQIHGQIIRKNLSSNSRIVTQLISSASLHKSINYGLSIFNCFL 87 Query: 764 SPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGGT 585 N F+FN IR L ENS F+ SI +F +++ V+PD+LT+PFVLKS +L +GG Sbjct: 88 DKNVFLFNVLIRGLKENSLFEKSILYFRKMVKMGVRPDKLTYPFVLKSVTALGEKGVGGG 147 Query: 584 LHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCC 405 +H + K+GL++D+FV V LV++YVK + FALQLFDE+ ERNKV S++LWN++INGCC Sbjct: 148 VHCGVLKVGLEYDTFVRVCLVELYVKVELVDFALQLFDESPERNKVESVILWNVVINGCC 207 Query: 404 KGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMP-EKNVVSWTTMVA 228 K G + A+ LFE MPERNVGSWN LI+G +N +++KA LFD+MP EKNVVSWT M+ Sbjct: 208 KIGRMSNALALFEEMPERNVGSWNTLISGLLRNGEVDKAMELFDEMPNEKNVVSWTCMIH 267 Query: 227 GLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNGFQL 48 GL N H++AL +F++M++EGV+ N LT+VSALSACA+ GALE+G +IHD + NG L Sbjct: 268 GLMLNGLHQKALDLFFKMVEEGVKPNGLTVVSALSACAKTGALEAGKKIHDNIMNNGLHL 327 Query: 47 NRAIGTSLVDMYAKC 3 N A+G +L+DMYAKC Sbjct: 328 NAAVGNALLDMYAKC 342 >ref|XP_002892245.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297338087|gb|EFH68504.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 664 Score = 317 bits (813), Expect = 4e-84 Identities = 161/325 (49%), Positives = 217/325 (66%) Frame = -3 Query: 977 YYINMNRCTPLENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXI 798 Y+ R +P E+HFISLIHT K T L+ +HA + + Sbjct: 18 YFPADRRASPDESHFISLIHTCKDTVSLRLVHAHILRRGVLSSRVAAQLVSCSSLLKSP- 76 Query: 797 DYTILIFQHFYSPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSA 618 DY++ IF++ N F+FNA IR L+EN++F+ S+ HF+L L L V+PDRLTFPFVLKS Sbjct: 77 DYSLSIFRNSEERNPFVFNALIRGLTENARFECSVRHFILMLTLGVKPDRLTFPFVLKSN 136 Query: 617 ASLLVPRLGGTLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSI 438 + L LG LH K +D DSFV VSLVDMY K G L A Q+F+ET +R K SI Sbjct: 137 SKLGFRWLGRALHAATLKNFVDCDSFVRVSLVDMYAKTGQLNHAFQVFEETPDRIKKESI 196 Query: 437 LLWNILINGCCKGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEK 258 LLWN+L+NG C+ D++ A LF +MPERN GSW+ LI G+ N +L +AK LF+ MPEK Sbjct: 197 LLWNVLVNGYCRAKDMQMATTLFRSMPERNSGSWSTLIKGYVDNGELNRAKQLFELMPEK 256 Query: 257 NVVSWTTMVAGLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIH 78 NVVSWTT++ G SQ D+E A+S ++ ML++G++ N+ T+ + LSAC++ GAL SG++IH Sbjct: 257 NVVSWTTLINGFSQTGDYETAISTYFEMLEKGLKPNEYTVAAVLSACSKSGALGSGIRIH 316 Query: 77 DYVSRNGFQLNRAIGTSLVDMYAKC 3 Y+ NG +L+RAIGTSL+DMYAKC Sbjct: 317 GYILDNGIKLDRAIGTSLLDMYAKC 341 >ref|XP_006418090.1| hypothetical protein EUTSA_v10007007mg [Eutrema salsugineum] gi|557095861|gb|ESQ36443.1| hypothetical protein EUTSA_v10007007mg [Eutrema salsugineum] Length = 665 Score = 313 bits (801), Expect = 1e-82 Identities = 157/320 (49%), Positives = 216/320 (67%) Frame = -3 Query: 962 NRCTPLENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTIL 783 +R +P E+H ISLIH K T L+++HA + + DY++ Sbjct: 23 HRASPDESHIISLIHACKDTVCLRRVHAYILRRGVLSSRVAAQLVSSSSLLKSP-DYSLS 81 Query: 782 IFQHFYSPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLV 603 IF++ N F+FNA IR L+E+++F+ S+ HF+L LRL V+PDRLTFPFVLKS + L Sbjct: 82 IFRYLKEKNLFVFNALIRGLAESARFKCSVRHFILMLRLGVRPDRLTFPFVLKSNSKLGF 141 Query: 602 PRLGGTLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNI 423 LG LH K +D DSFV VSLVDMY K G L +A Q+FDE+ + K+ ILLWN+ Sbjct: 142 RWLGRALHAAALKDSVDCDSFVRVSLVDMYAKTGGLKYAFQVFDESPDWIKMERILLWNV 201 Query: 422 LINGCCKGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSW 243 LING C+ D++ A LF +MPERN GSW+ LI G+ N DL +A+ LF+ MPEK+VVSW Sbjct: 202 LINGYCRAKDMQMATTLFGSMPERNSGSWSTLIKGYVDNGDLNRARQLFEVMPEKSVVSW 261 Query: 242 TTMVAGLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSR 63 TT++ G SQN D+E A+S ++ ML+EG++ N+ T+ + LSAC++ GAL SG++IH Y+ Sbjct: 262 TTLINGFSQNGDYESAISTYFEMLEEGMKPNEYTVAAVLSACSKSGALGSGIRIHGYILD 321 Query: 62 NGFQLNRAIGTSLVDMYAKC 3 NG L+RAIGT+L+DMYAKC Sbjct: 322 NGINLDRAIGTALIDMYAKC 341 >ref|NP_171976.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|75192500|sp|Q9MAT2.1|PPR10_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At1g04840 gi|7211995|gb|AAF40466.1|AC004809_24 F13M7.17 [Arabidopsis thaliana] gi|332189629|gb|AEE27750.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 665 Score = 313 bits (801), Expect = 1e-82 Identities = 159/325 (48%), Positives = 217/325 (66%) Frame = -3 Query: 977 YYINMNRCTPLENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXI 798 Y+ + +P E+HFISLIH K T L+ +HAQ+ + Sbjct: 18 YFPADRQASPDESHFISLIHACKDTASLRHVHAQILRRGVLSSRVAAQLVSCSSLLKSP- 76 Query: 797 DYTILIFQHFYSPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSA 618 DY++ IF++ N F+ NA IR L+EN++F+SS+ HF+L LRL V+PDRLTFPFVLKS Sbjct: 77 DYSLSIFRNSEERNPFVLNALIRGLTENARFESSVRHFILMLRLGVKPDRLTFPFVLKSN 136 Query: 617 ASLLVPRLGGTLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSI 438 + L LG LH K +D DSFV +SLVDMY K G L A Q+F+E+ +R K SI Sbjct: 137 SKLGFRWLGRALHAATLKNFVDCDSFVRLSLVDMYAKTGQLKHAFQVFEESPDRIKKESI 196 Query: 437 LLWNILINGCCKGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEK 258 L+WN+LING C+ D+ A LF +MPERN GSW+ LI G+ + +L +AK LF+ MPEK Sbjct: 197 LIWNVLINGYCRAKDMHMATTLFRSMPERNSGSWSTLIKGYVDSGELNRAKQLFELMPEK 256 Query: 257 NVVSWTTMVAGLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIH 78 NVVSWTT++ G SQ D+E A+S ++ ML++G++ N+ TI + LSAC++ GAL SG++IH Sbjct: 257 NVVSWTTLINGFSQTGDYETAISTYFEMLEKGLKPNEYTIAAVLSACSKSGALGSGIRIH 316 Query: 77 DYVSRNGFQLNRAIGTSLVDMYAKC 3 Y+ NG +L+RAIGT+LVDMYAKC Sbjct: 317 GYILDNGIKLDRAIGTALVDMYAKC 341 >ref|XP_006306938.1| hypothetical protein CARUB_v10008505mg, partial [Capsella rubella] gi|482575649|gb|EOA39836.1| hypothetical protein CARUB_v10008505mg, partial [Capsella rubella] Length = 672 Score = 301 bits (771), Expect = 3e-79 Identities = 155/319 (48%), Positives = 212/319 (66%) Frame = -3 Query: 959 RCTPLENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILI 780 R +P E+HFISLIH K T L+++HAQ+ + DY + I Sbjct: 31 RASPDESHFISLIHACKDTVSLRRVHAQILRRGVLSSRVAAQLVSCSGLLQSP-DYCLSI 89 Query: 779 FQHFYSPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVP 600 F++F N F+FN IR L+EN++ SS+ HF+L LRL V+PDRLTFPFVLKS + L Sbjct: 90 FRNFEEKNLFVFNVLIRGLTENARSASSVRHFILMLRLGVRPDRLTFPFVLKSNSKLGFR 149 Query: 599 RLGGTLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNIL 420 LG LH K +D DSFV VSLVDMY K L +A Q+FDE+ +R K S LL N+L Sbjct: 150 WLGRALHAATLKNFVDCDSFVRVSLVDMYAKTRQLNYAFQVFDESPDRIKKESTLLSNVL 209 Query: 419 INGCCKGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWT 240 I G C+ D++ A +LF +MPERN GSW+ LI G+ L +AK LF+ MPEK+VV+WT Sbjct: 210 IKGYCRAKDMQMATKLFRSMPERNSGSWSTLIKGYADCSQLNRAKQLFELMPEKHVVTWT 269 Query: 239 TMVAGLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRN 60 T++ G SQN +E A+S ++ ML++G++ N+ T+ +ALSAC++ GAL SG++IH Y+ N Sbjct: 270 TLINGFSQNGYYETAISTYFEMLEKGLKPNEYTVAAALSACSKSGALGSGIRIHAYILDN 329 Query: 59 GFQLNRAIGTSLVDMYAKC 3 G +L+RAIGT+L+DMYAKC Sbjct: 330 GIRLDRAIGTALIDMYAKC 348 >ref|XP_004247960.1| PREDICTED: pentatricopeptide repeat-containing protein At1g04840-like [Solanum lycopersicum] Length = 547 Score = 275 bits (703), Expect = 2e-71 Identities = 132/223 (59%), Positives = 175/223 (78%), Gaps = 1/223 (0%) Frame = -3 Query: 668 LSVQPDRLTFPFVLKSAASLLVPRLGGTLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGF 489 + V+PD+LT+PFVLKS +L R+GG +H I K+GL++D+FV V LV+MYVK + F Sbjct: 1 MGVRPDKLTYPFVLKSVTALGDKRVGGVVHCGILKMGLEYDTFVRVCLVEMYVKAELVDF 60 Query: 488 ALQLFDETLERNKVSSILLWNILINGCCKGGDLKKAMELFEAMPERNVGSWNCLINGFFK 309 ALQLFDE+ ERNKV S++LWN++INGCCK G + KA+ LFE MPERNVGSWN LI+G + Sbjct: 61 ALQLFDESSERNKVESVILWNVVINGCCKIGRVSKALALFEEMPERNVGSWNTLISGLLR 120 Query: 308 NKDLEKAKLLFDQMP-EKNVVSWTTMVAGLSQNEDHEQALSMFYRMLKEGVRANDLTIVS 132 N +++KA LFD+M EKNVVSWT M+ GL NE H++AL +F++M++EGV+ N LT+VS Sbjct: 121 NGEVDKAMELFDEMTNEKNVVSWTCMIHGLMLNELHQKALDLFFKMVEEGVKPNGLTVVS 180 Query: 131 ALSACARIGALESGVQIHDYVSRNGFQLNRAIGTSLVDMYAKC 3 ALSACA+ GALE+G +IHD + NG LN A+G +L+DMYAKC Sbjct: 181 ALSACAKTGALEAGKKIHDNIVNNGLHLNAAVGNALLDMYAKC 223 >emb|CBI17228.3| unnamed protein product [Vitis vinifera] Length = 590 Score = 231 bits (590), Expect = 3e-58 Identities = 128/304 (42%), Positives = 180/304 (59%), Gaps = 14/304 (4%) Frame = -3 Query: 944 ENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQHFY 765 E HFI LIH S T QL QIHAQ+FLHNL +DY + IF+ F Sbjct: 40 ETHFIPLIHASNTLPQLHQIHAQIFLHNLFSNSRVVTQLISSSCSLKSLDYALSIFRCFD 99 Query: 764 SPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGGT 585 PN F+FNA IR L+ENS+F+ S+ HFVL LRLS++PDRLT PFVLKS A+L+ LG Sbjct: 100 HPNLFVFNALIRGLAENSRFEGSVSHFVLMLRLSIRPDRLTLPFVLKSVAALVDVGLGRC 159 Query: 584 LHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWN------- 426 LHG + KLGL+FDSFV VSLVDMYVK G LGF LQLFDE+ +RNK SILLWN Sbjct: 160 LHGGVMKLGLEFDSFVRVSLVDMYVKIGELGFGLQLFDESPQRNKAESILLWNGVRPNDL 219 Query: 425 ---ILINGCCKGGDLKKAMELFEAMP----ERNVGSWNCLINGFFKNKDLEKAKLLFDQM 267 + C K G L+ + + + N G L++ + K +++ A +F + Sbjct: 220 TVVSALLACTKIGALQVGERIHNYLSSNGFQLNRGIGTALVDMYAKCGNIKSASRVFVET 279 Query: 266 PEKNVVSWTTMVAGLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGV 87 K++++W+ M+ G + + +QAL F +M G+ +++ ++ L+AC+ G ++ G+ Sbjct: 280 KGKDLLTWSVMIWGWAIHGCFDQALQCFVKMKSAGINPDEVIFLAILTACSHSGNVDQGL 339 Query: 86 QIHD 75 + Sbjct: 340 NFFE 343 Score = 98.2 bits (243), Expect = 5e-18 Identities = 79/229 (34%), Positives = 109/229 (47%), Gaps = 24/229 (10%) Frame = -3 Query: 617 ASLLVPRLGGTLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSI 438 AS +P+L +H QI L +S V+ L+ SL +AL +F N + Sbjct: 49 ASNTLPQLH-QIHAQIFLHNLFSNSRVVTQLISSSCSLKSLDYALSIFRCFDHPN----L 103 Query: 437 LLWNILINGCCKGGDLKKAMELFEAM------PER--------------NVGSWNCLING 318 ++N LI G + + ++ F M P+R +VG CL G Sbjct: 104 FVFNALIRGLAENSRFEGSVSHFVLMLRLSIRPDRLTLPFVLKSVAALVDVGLGRCLHGG 163 Query: 317 FFKNKDLEKAKLLFDQMPEKNVVSWTTMVA----GLSQNEDHEQALSMFYRMLKEGVRAN 150 K L FD ++V + GL ++ Q +L GVR N Sbjct: 164 VMK------LGLEFDSFVRVSLVDMYVKIGELGFGLQLFDESPQRNKAESILLWNGVRPN 217 Query: 149 DLTIVSALSACARIGALESGVQIHDYVSRNGFQLNRAIGTSLVDMYAKC 3 DLT+VSAL AC +IGAL+ G +IH+Y+S NGFQLNR IGT+LVDMYAKC Sbjct: 218 DLTVVSALLACTKIGALQVGERIHNYLSSNGFQLNRGIGTALVDMYAKC 266 >ref|XP_004295870.1| PREDICTED: pentatricopeptide repeat-containing protein At5g48910-like [Fragaria vesca subsp. vesca] Length = 729 Score = 209 bits (533), Expect = 1e-51 Identities = 110/264 (41%), Positives = 168/264 (63%), Gaps = 3/264 (1%) Frame = -3 Query: 785 LIFQHFY-SPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRL--SVQPDRLTFPFVLKSAA 615 LIF+HF +PN F +NA +++ ++N+ + +I +F +L + PD TF VLK+ A Sbjct: 146 LIFRHFLETPNIFAYNALLKAFAQNNDWHHTILYFNSQLLSPNAPTPDEYTFTSVLKACA 205 Query: 614 SLLVPRLGGTLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSIL 435 LL GG +H + K G + + FV SL DMY KFG +G A +LFDE R+ VS Sbjct: 206 GLLRVTEGGKVHCLVTKFGCEENLFVRNSLTDMYFKFGKVGVAQKLFDEMRVRDVVS--- 262 Query: 434 LWNILINGCCKGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKN 255 WN L+ G C G++ +A +F+ M E++ SW+ +I+ + K +LE+A+ LFD +P++N Sbjct: 263 -WNTLVAGYCVSGEVGEARRVFDGMVEKSSFSWSTMISAYAKLGELEEAQRLFDAVPQRN 321 Query: 254 VVSWTTMVAGLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHD 75 VVSW M+AG +QNE +++A+ +F M + G+ ND+T+VS LSACA +GAL+ G I Sbjct: 322 VVSWNAMIAGYAQNEKYDEAVGLFREMQECGLAPNDVTLVSVLSACAHLGALDLGKWIDR 381 Query: 74 YVSRNGFQLNRAIGTSLVDMYAKC 3 ++ R+G L +G +L DMYAKC Sbjct: 382 FIKRSGMDLGLFLGNALADMYAKC 405 Score = 62.4 bits (150), Expect = 3e-07 Identities = 53/238 (22%), Positives = 97/238 (40%), Gaps = 41/238 (17%) Frame = -3 Query: 758 NSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGGTLH 579 N +NA I ++N ++ ++ F + P+ +T VL + A L LG + Sbjct: 321 NVVSWNAMIAGYAQNEKYDEAVGLFREMQECGLAPNDVTLVSVLSACAHLGALDLGKWID 380 Query: 578 GQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCCKG 399 I + G+D F+ +L DMY K G + A ++F+ ER+ +S W+I+I G Sbjct: 381 RFIKRSGMDLGLFLGNALADMYAKCGCITEARRVFNNMQERDVIS----WSIIITGLAMN 436 Query: 398 GDLKKAMELFEAMPER----------------------------------------NVGS 339 G +A E F+ M E + Sbjct: 437 GHADQAFECFDKMIEHGLKPNEITFMGLLTACTHAGLVDKGLEYFNMMEKAFGISPKIEH 496 Query: 338 WNCLINGFFKNKDLEKAKLLFDQMPEK-NVVSWTTMVAGLSQNEDHEQALSMFYRMLK 168 + C+++ + L KA+ + + MP K NV+ W ++ G +D ++ + R+L+ Sbjct: 497 YGCVVDLLSRASRLAKAEDMINSMPMKPNVIVWGALLGGCRTYKDTDRGERVVRRILE 554 >ref|XP_007050341.1| Basic helix-loop-helix DNA-binding superfamily protein [Theobroma cacao] gi|508702602|gb|EOX94498.1| Basic helix-loop-helix DNA-binding superfamily protein [Theobroma cacao] Length = 600 Score = 208 bits (530), Expect = 3e-51 Identities = 111/311 (35%), Positives = 176/311 (56%), Gaps = 1/311 (0%) Frame = -3 Query: 932 ISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQHFYSPNS 753 + I QL+ I+A + N +Q +DY IL F PN Sbjct: 52 VDQIKKCSNLNQLETIYATMIKTNANQDCFLTNQFVSACATFCRMDYAILAFTQMQKPNV 111 Query: 752 FIFNAFIRSLSE-NSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGGTLHG 576 F++NA I+ L ++ FQ+ +H + LR V P TF ++K+ + G ++HG Sbjct: 112 FVYNALIKGLVHCHNPFQALDYHKHM-LRAGVWPSSFTFSSLVKACGLVSELGFGESVHG 170 Query: 575 QIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCCKGG 396 Q+ K G + FV +LVD Y G + ++FDE +R+ + W +++G K G Sbjct: 171 QVWKHGFESHVFVQTALVDFYANVGKFAESKRVFDEMPDRD----VFAWTTMVSGFLKAG 226 Query: 395 DLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWTTMVAGLSQ 216 DL + LF+ MPERN +WN +I+G+ + D+E A+L F+QMP K+++SWT+M+ S+ Sbjct: 227 DLVSSRRLFDEMPERNTATWNAMIDGYARVGDVESAELFFNQMPVKDIISWTSMINCYSK 286 Query: 215 NEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNGFQLNRAI 36 N+ +AL++F M + V +++T+ S +SACA +GAL +G +IH YV +NGF L+ I Sbjct: 287 NKQFREALAVFEEMRRNKVSPDEVTMASVISACAHLGALNTGKEIHHYVMQNGFYLDVYI 346 Query: 35 GTSLVDMYAKC 3 G++LVDMYAKC Sbjct: 347 GSALVDMYAKC 357 Score = 66.2 bits (160), Expect = 2e-08 Identities = 42/147 (28%), Positives = 70/147 (47%) Frame = -3 Query: 785 LIFQHFYSPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLL 606 L F + + + I S+N QF+ ++ F R V PD +T V+ + A L Sbjct: 264 LFFNQMPVKDIISWTSMINCYSKNKQFREALAVFEEMRRNKVSPDEVTMASVISACAHLG 323 Query: 605 VPRLGGTLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWN 426 G +H + + G D ++ +LVDMY K GSL +L F + E+N + WN Sbjct: 324 ALNTGKEIHHYVMQNGFYLDVYIGSALVDMYAKCGSLERSLLAFFKLREKN----LFCWN 379 Query: 425 ILINGCCKGGDLKKAMELFEAMPERNV 345 +I G G ++A+ +F++M +V Sbjct: 380 SVIEGLAVHGYAQEALAMFDSMERHHV 406