BLASTX nr result
ID: Sinomenium21_contig00020099
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Sinomenium21_contig00020099 (1615 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002265961.1| PREDICTED: pentatricopeptide repeat-containi... 424 e-116 ref|XP_007138368.1| hypothetical protein PHAVU_009G202600g [Phas... 419 e-114 ref|XP_006429524.1| hypothetical protein CICLE_v10013613mg [Citr... 414 e-113 ref|XP_007140168.1| hypothetical protein PHAVU_008G089500g [Phas... 411 e-112 ref|XP_003533674.1| PREDICTED: pentatricopeptide repeat-containi... 407 e-111 ref|XP_007026524.1| Pentatricopeptide repeat superfamily protein... 400 e-109 ref|XP_003623530.1| Pentatricopeptide repeat-containing protein ... 399 e-108 ref|XP_004305097.1| PREDICTED: pentatricopeptide repeat-containi... 397 e-108 ref|XP_002309173.2| pentatricopeptide repeat-containing family p... 386 e-104 gb|AHB18409.1| pentatricopeptide repeat-containing protein [Goss... 382 e-103 gb|EXB42398.1| hypothetical protein L484_021993 [Morus notabilis] 378 e-102 ref|XP_004137893.1| PREDICTED: pentatricopeptide repeat-containi... 373 e-100 ref|XP_004246310.1| PREDICTED: pentatricopeptide repeat-containi... 343 1e-91 ref|XP_006411054.1| hypothetical protein EUTSA_v10017948mg [Eutr... 332 4e-88 ref|XP_002533822.1| pentatricopeptide repeat-containing protein,... 325 5e-86 ref|NP_181376.3| pentatricopeptide repeat-containing protein [Ar... 311 5e-82 gb|AAM98219.1| unknown protein [Arabidopsis thaliana] gi|3137637... 311 5e-82 ref|XP_006294146.1| hypothetical protein CARUB_v10023139mg [Caps... 311 7e-82 ref|XP_002879744.1| pentatricopeptide repeat-containing protein ... 309 2e-81 ref|XP_006854116.1| hypothetical protein AMTR_s00048p00149840 [A... 274 7e-71 >ref|XP_002265961.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial-like [Vitis vinifera] Length = 505 Score = 424 bits (1089), Expect = e-116 Identities = 225/423 (53%), Positives = 288/423 (68%), Gaps = 28/423 (6%) Frame = +1 Query: 1 LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180 LI+SF IY DPTP+ Y F+++TLT+ QF LPP+LHR+E +EKF+ PE F +LI+ Y Sbjct: 64 LIDSFRIYNSDPTPNAYRFVISTLTRCRQFHHLPPLLHRLEKVEKFETPEFIFTNLIKVY 123 Query: 181 GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360 G ANM DAVD+FFRIP FRC PS +SLNALL VLCK+REGL MV Q+LLKS MN+RL+ Sbjct: 124 GNANMFEDAVDLFFRIPNFRCVPSVYSLNALLYVLCKRREGLVMVPQILLKSQAMNIRLE 183 Query: 361 ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540 ESSFRIL++ALC IKK +AI ILN M + GY D+ S+ILS+LC+ LS EVL F Sbjct: 184 ESSFRILVAALCRIKKHNYAIRILNYMLNDGYAVDAKMCSIILSSLCEQKGLSGDEVLRF 243 Query: 541 LGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTS 639 + EMRK GF P VD NVI FLVK +GVT+ Sbjct: 244 MEEMRKLGFYPGRVDCNNVIRFLVKEGMVMDALGVFDQMKTDGIKPDTVSYTMILNGVTA 303 Query: 640 AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819 +++KA+D+FDEMLVLGVVPDI+ YNVY++ L KQNNIE G++ML M ELGCKP+ VT Sbjct: 304 DGDYEKADDLFDEMLVLGVVPDIHAYNVYINSLCKQNNIEEGVRMLASMRELGCKPDYVT 363 Query: 820 YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999 YN LL + ++ ++ E+ R M + GVQ N TYRI++DG + GE+ E+C LLEEM Sbjct: 364 YNMLLEGMSKVRDLGGMRELAREMELE-GVQWNWETYRIMLDGLVGKGEIDESCSLLEEM 422 Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEA-LLSMFELSFK 1176 L F TFDE+IC LC+RGL +AL+++ +M+ + IAPG+ AWEA LL E SF Sbjct: 423 LDKYFSCWCSTFDEIICELCQRGLVCKALQLVNKMVRKTIAPGARAWEALLLGSVEFSFA 482 Query: 1177 ETN 1185 ET+ Sbjct: 483 ETS 485 >ref|XP_007138368.1| hypothetical protein PHAVU_009G202600g [Phaseolus vulgaris] gi|561011455|gb|ESW10362.1| hypothetical protein PHAVU_009G202600g [Phaseolus vulgaris] Length = 513 Score = 419 bits (1076), Expect = e-114 Identities = 214/425 (50%), Positives = 293/425 (68%), Gaps = 28/425 (6%) Frame = +1 Query: 1 LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180 LI+SF Y+CDPTP Y FL+ TLT SQF +PPVL +EHLEKF+ PE V+LIR Y Sbjct: 66 LIDSFKSYSCDPTPKAYYFLIKTLTCTSQFQDIPPVLDHLEHLEKFETPEFNLVYLIRFY 125 Query: 181 GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360 G ++ + DAVD+F RIPRFRCTP+ SLN +LS+LC++RE L+MV ++LLKS MN+R++ Sbjct: 126 GLSDKVQDAVDLFLRIPRFRCTPTVCSLNLVLSLLCRKRECLKMVPEILLKSQHMNIRVE 185 Query: 361 ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540 ES+F++LI ALC IK+V +AI +LN M + GY D T SLI+S+LC+ D++S E L Sbjct: 186 ESTFQVLIKALCRIKRVGYAIKMLNYMIEGGYGLDETMCSLIISSLCEQEDMTSVEALVI 245 Query: 541 LGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTS 639 +MRK GF P +DYTN+I FLVK K G+ + Sbjct: 246 WRDMRKLGFCPGIMDYTNMIRFLVKEGKGMDALDILNQQKKDGIKPDVVCYTMVLSGIIA 305 Query: 640 AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819 E+ K E++FDE+LV G+VPD+YTYNVY++GL KQNN++ LK++ M+EL CKPNVVT Sbjct: 306 EGEYVKLEELFDEILVFGLVPDVYTYNVYINGLCKQNNVDEALKIVASMEELECKPNVVT 365 Query: 820 YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999 N LLGALC G++R+A V++ M K GV+ +LH+YRI++DG + GE+ EAC LLEEM Sbjct: 366 CNILLGALCVAGDLRKARGVMKEMGWK-GVRLDLHSYRIMLDGLVGKGEIGEACFLLEEM 424 Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEALL-SMFELSFK 1176 L SF PR+ TFD +I +C++GL EA+++ K+++ ++ PG+ AWEALL S +L F Sbjct: 425 LEKSFFPRSSTFDHIIFQMCQKGLIVEAIELTKKIVAKSFVPGARAWEALLKSGSKLGFS 484 Query: 1177 ETNSS 1191 ET S Sbjct: 485 ETTFS 489 >ref|XP_006429524.1| hypothetical protein CICLE_v10013613mg [Citrus clementina] gi|557531581|gb|ESR42764.1| hypothetical protein CICLE_v10013613mg [Citrus clementina] Length = 506 Score = 414 bits (1065), Expect = e-113 Identities = 213/424 (50%), Positives = 294/424 (69%), Gaps = 29/424 (6%) Frame = +1 Query: 1 LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180 L++SFSIY C+P P Y F++ TL +NSQF + VL IE E F+ PE F+ LI+TY Sbjct: 67 LLHSFSIYNCEPPPEAYHFVIKTLAENSQFCDISSVLDHIEKRENFETPEFIFIDLIKTY 126 Query: 181 GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360 A+ D+V++F++IP+FRC PS +SLNALLSVLC+ +E ++MV Q+LLKS MN+R++ Sbjct: 127 ADAHRFQDSVNLFYKIPKFRCVPSVYSLNALLSVLCRNKEWVKMVPQILLKSQLMNIRIE 186 Query: 361 ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540 ESSFRILIS LC I +V FAI+ILN M + G+ D S ILS++C+ DLSS E+LGF Sbjct: 187 ESSFRILISTLCRINRVGFAIEILNCMINDGFCVDGKTCSWILSSVCEQRDLSSDELLGF 246 Query: 541 LGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTS 639 + EM+K GF VDYTNVI LVK +K +GV Sbjct: 247 VQEMKKLGFCFGMVDYTNVIRSLVKKEKVFDALGILNQMKSDGIKPDIVCYTMVLNGVIV 306 Query: 640 AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819 E++ KAE++FDE+LVLG+VPD+YTYNVY++GL KQNN+EAG+KM+ CM+ELG KP+V+T Sbjct: 307 QEDYVKAEELFDELLVLGLVPDVYTYNVYINGLCKQNNVEAGIKMIACMEELGSKPDVIT 366 Query: 820 YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999 YNTLL ALC++ E+ + E+++ M+ KG V NL TY I+IDG S G+++EAC LLEE Sbjct: 367 YNTLLQALCKVRELNRLRELVKEMKWKGIVL-NLQTYSIMIDGLASKGDIIEACGLLEEA 425 Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEALL--SMFELSF 1173 L+ ++ FDE IC LC+RGL +AL++LK+M ++++PG+ WEALL S+ +L F Sbjct: 426 LNKGLCTQSSMFDETICGLCQRGLVRKALELLKQMADKDVSPGARVWEALLLSSVSKLDF 485 Query: 1174 KETN 1185 T+ Sbjct: 486 VNTS 489 >ref|XP_007140168.1| hypothetical protein PHAVU_008G089500g [Phaseolus vulgaris] gi|561013301|gb|ESW12162.1| hypothetical protein PHAVU_008G089500g [Phaseolus vulgaris] Length = 514 Score = 411 bits (1057), Expect = e-112 Identities = 207/426 (48%), Positives = 290/426 (68%), Gaps = 29/426 (6%) Frame = +1 Query: 1 LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180 L+++F Y+CDPTP Y F++ TLT S +PPVL +E LE F+ PE V+LIR Y Sbjct: 66 LLDAFKAYSCDPTPKAYYFVIKTLTSTSHLQDIPPVLDHLEQLETFETPEFILVYLIRFY 125 Query: 181 GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360 G ++ + DAVD+F RIPRFRCTP+ +SLN +LS+LC++RE L+MV ++LLKS MN+R++ Sbjct: 126 GLSDRVQDAVDLFLRIPRFRCTPTVWSLNLVLSLLCRKRECLKMVPEILLKSQHMNIRVE 185 Query: 361 ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540 ES+F++LI ALC IK+V +AI +LN M + GY D T SLI+S+LC+ D++S E L Sbjct: 186 ESTFQVLIEALCRIKRVGYAIKMLNYMIEGGYGLDETICSLIISSLCEQEDMTSVEALVI 245 Query: 541 LGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTS 639 +MRK GF P +DYTN+I FLVK K G+ + Sbjct: 246 WRDMRKLGFCPGVMDYTNMIRFLVKEGKGTDALDILNQQKKDGIKPDVVCYTMVLSGIVA 305 Query: 640 AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819 E+ K E++FDE+LV G+VPD+YTYNVY++GL KQNN++ LK++ M+EL C+PNVVT Sbjct: 306 EGEYVKLEELFDEILVFGLVPDVYTYNVYINGLCKQNNVDEALKIVASMEELECRPNVVT 365 Query: 820 YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999 NTLLGALC G++R+A V++ M K GV NLH+YRI++DG + GE+ EAC LLEEM Sbjct: 366 CNTLLGALCVAGDLRKARGVMKEMGWK-GVGLNLHSYRIMLDGLVGKGEIGEACFLLEEM 424 Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEALL--SMFELSF 1173 L F PR+ TFD +I +C++GL +EA+++ K+++ ++ PG+ AWEALL S +L F Sbjct: 425 LEKCFFPRSSTFDHIIFQMCQKGLIAEAIELTKKIVAKSFVPGARAWEALLLKSGSKLGF 484 Query: 1174 KETNSS 1191 ET S Sbjct: 485 SETTFS 490 >ref|XP_003533674.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial-like [Glycine max] Length = 499 Score = 407 bits (1047), Expect = e-111 Identities = 202/411 (49%), Positives = 282/411 (68%), Gaps = 27/411 (6%) Frame = +1 Query: 1 LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180 L++SF Y+ DPTP Y F+L TLT SQ +PPVL+ +EHLEKF+ PE V+LIR Y Sbjct: 69 LLDSFKAYSIDPTPKAYFFVLKTLTSTSQLQDIPPVLYHLEHLEKFETPESILVYLIRFY 128 Query: 181 GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360 G ++ + DAVD+FFRIPRFRCTP+ SLN +LS+LC++R+ L+MV ++LLKS MN+R++ Sbjct: 129 GLSDRVQDAVDLFFRIPRFRCTPTVCSLNLVLSLLCRKRDCLEMVPEILLKSQHMNIRVE 188 Query: 361 ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540 ES+FR+LI ALC IK+V +AI +LN M + GY D SL++SALC+ DL+SAE L Sbjct: 189 ESTFRVLIRALCRIKRVGYAIKMLNFMVEDGYGLDEKICSLVISALCEQKDLTSAEALVV 248 Query: 541 LGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTS 639 +MRK GF P +DYTN+I FLVK + G+ + Sbjct: 249 WRDMRKLGFCPGVMDYTNMIRFLVKEGRGMDALDILNQQKQDGIKLDVVSYTMVLSGIVA 308 Query: 640 AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819 E+ +++FDEMLV+G++PD YTYNVY++GL KQNN+ L+++ M+ELGCKPNVVT Sbjct: 309 EGEYVMLDELFDEMLVIGLIPDAYTYNVYINGLCKQNNVAEALQIVASMEELGCKPNVVT 368 Query: 820 YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999 YNTLLGAL G+ +A E+++ M K GV NLHTYRI++DG + GE+ E+C LLEEM Sbjct: 369 YNTLLGALSVAGDFVKARELMKEMGWK-GVGLNLHTYRIVLDGLVGKGEIGESCLLLEEM 427 Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEALL 1152 L PR+ TFD +I +C++ L +EA+++ K+++ ++ PG+S WEALL Sbjct: 428 LEKCLFPRSSTFDNIIFQMCQKDLFTEAMELTKKVVAKSFLPGASTWEALL 478 >ref|XP_007026524.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma cacao] gi|508715129|gb|EOY07026.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma cacao] Length = 542 Score = 400 bits (1029), Expect = e-109 Identities = 198/424 (46%), Positives = 284/424 (66%), Gaps = 27/424 (6%) Frame = +1 Query: 1 LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180 L+ SFS+Y PTP Y FL+ TL QN F+ +P VLH +EH+EKF PE F LI TY Sbjct: 105 LVRSFSLYNVHPTPQAYHFLIKTLIQNLHFNHIPSVLHHLEHVEKFQTPEYIFADLITTY 164 Query: 181 GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360 G AN + DAVDIF+RIP+FRC PSA+SLN+LL++LC+ + L++V QVLLKS MN+R++ Sbjct: 165 GIANRIQDAVDIFYRIPKFRCVPSAYSLNSLLALLCRNQYSLKLVPQVLLKSLLMNIRVE 224 Query: 361 ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540 ES+ RIL+SALC + KV++AIDIL M D G + S ILS++C +DL +V+G Sbjct: 225 ESTLRILVSALCRMNKVSYAIDILQRMIDEGLGVNDKVCSFILSSICAKADLDGEDVMGL 284 Query: 541 LGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTS 639 E+ K GF P DY +I FLVK + +GV + Sbjct: 285 WRELGKLGFCPAMSDYNCLIRFLVKKGRGLDALDFLNQMKSVGIKPGIVSYTMALNGVIA 344 Query: 640 AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819 ++ A+++FDE+L+LG+VPD+YTYN Y+ L KQN +E G+KM+ CM+EL CKPNV+T Sbjct: 345 EGDYMLADELFDELLMLGLVPDVYTYNAYIDALCKQNKVEEGIKMVACMEELRCKPNVLT 404 Query: 820 YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999 YN LL A+C++GE+ +A E+++ M+ K G++ NL +Y ++IDG +S GE++EA L+EE+ Sbjct: 405 YNMLLEAICKVGEISRAMELVKEMKYK-GIEMNLVSYTVIIDGLVSKGEILEAHGLVEEV 463 Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEALLSMFELSFKE 1179 LH F ++ FDE+IC LC+RGL EAL++L++M+ +N++PG+ WEALL E Sbjct: 464 LHKCFCHQSLAFDEVICGLCQRGLVCEALELLRKMVAKNVSPGARGWEALLLSSESKINF 523 Query: 1180 TNSS 1191 N++ Sbjct: 524 ANTT 527 >ref|XP_003623530.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] gi|355498545|gb|AES79748.1| Pentatricopeptide repeat-containing protein [Medicago truncatula] Length = 653 Score = 399 bits (1024), Expect = e-108 Identities = 205/425 (48%), Positives = 287/425 (67%), Gaps = 31/425 (7%) Frame = +1 Query: 1 LINSFSIYACDPTPSPYSFLLNTLTQ--NSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIR 174 LI+SF Y DP+P Y FL+ T+T S ++P +L+ +EH EKF+ PE F++LIR Sbjct: 61 LIHSFKAYHTDPSPKAYFFLIKTITNINTSHLHEIPHILNHLEHNEKFETPEFIFMYLIR 120 Query: 175 TYGRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVR 354 YG + + DAVD+FFRIPRFRCTP+ SLN LLS+LC +RE L+MV +LLKS +M +R Sbjct: 121 FYGFNDRVQDAVDLFFRIPRFRCTPTVCSLNLLLSLLCGKRECLRMVPDILLKSRDMKIR 180 Query: 355 LDESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVL 534 L+ESSF +LI ALC IK+V +AI ++N M + GY D SLI+S+LC+ +DL+S E L Sbjct: 181 LEESSFWVLIKALCRIKRVDYAIKMMNCMVEDGYCLDDKICSLIISSLCEQNDLTSVEAL 240 Query: 535 GFLGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGV 633 G MRK GF P +D TN+I FLVK K G+ Sbjct: 241 VVWGNMRKLGFCPGVMDCTNMIRFLVKEGKGMDALEILNQLKEDGIKPDIVCYTIVLSGI 300 Query: 634 TSAEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNV 813 ++ K +++FDE+LVLG+VPD+YTYNVY++GL KQNN + LK++ M++LGCKPNV Sbjct: 301 VKEGDYVKLDELFDEILVLGLVPDVYTYNVYINGLCKQNNFDEALKIVVSMEKLGCKPNV 360 Query: 814 VTYNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLE 993 VTYNTLLGALC G++ +A+ V++ MR K GV+ NLHTYRI++DG + GE+ EAC LLE Sbjct: 361 VTYNTLLGALCMSGDLGKAKRVMKEMRLK-GVELNLHTYRIMLDGLVGKGEIGEACVLLE 419 Query: 994 EMLHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEALL--SMFEL 1167 EML F PR+ TFD ++ +C++GL S+AL ++ +++ ++ PG+ WEALL S ++ Sbjct: 420 EMLEKCFYPRSSTFDSIVHQMCQKGLISDALVLMNKIVAKSFDPGAKVWEALLLNSESKV 479 Query: 1168 SFKET 1182 ++ ET Sbjct: 480 TYSET 484 >ref|XP_004305097.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial-like [Fragaria vesca subsp. vesca] Length = 491 Score = 397 bits (1020), Expect = e-108 Identities = 203/411 (49%), Positives = 273/411 (66%), Gaps = 27/411 (6%) Frame = +1 Query: 1 LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180 LI+SF+ + CDPTP Y+F+L TL + SQ +P VL R+E +EKF PE F +LIR Y Sbjct: 58 LIHSFNTFNCDPTPEAYNFVLKTLFKTSQLSHIPSVLDRLESIEKFHPPESIFANLIRFY 117 Query: 181 GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360 G AN + DA+D+F RIP+FRC PSA SLN+LL VLC EGL+MV QVL+ S M +RL+ Sbjct: 118 GSANRVEDAIDVFCRIPKFRCDPSAVSLNSLLYVLCGSSEGLKMVPQVLMNSRAMGIRLE 177 Query: 361 ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540 ESSFRILISALC I V +AI+I+ M GYD D SL+LS+LC+ + EV+GF Sbjct: 178 ESSFRILISALCRIGSVGYAIEIMKCMISNGYDLDVKICSLVLSSLCEQKGVGGLEVVGF 237 Query: 541 LGEMRKAGFSPDGVDYTNVIAFLVKADKD---------------------------GVTS 639 + EM+K GF P +DY+NVI LVK K GV + Sbjct: 238 VEEMKKVGFCPGMLDYSNVIRCLVKQGKGLDALRVLCKMKVEGMKPDIVCYTMVLYGVIA 297 Query: 640 AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819 +++ A+ VFDE+LVLG+VPD+YTYNVY++GL QNN+EAG+KM+TCMDELGC+PN++T Sbjct: 298 NGDYKNADKVFDELLVLGLVPDVYTYNVYINGLCNQNNVEAGIKMITCMDELGCRPNLIT 357 Query: 820 YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999 YN LL ALC+ E+ +A E++ M + GV NL T+ I++DG G+V EAC +EEM Sbjct: 358 YNLLLKALCKNEELSRARELVSEM-TLNGVGVNLQTHIIMLDGLFCKGDVDEACIFMEEM 416 Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEALL 1152 L R +D++I LC+RGL +A+ +L +M+ +N+ PG+ AWEALL Sbjct: 417 LDKFMCRRCSAYDDVIYGLCQRGLVCKAMDLLLKMVDKNVVPGARAWEALL 467 >ref|XP_002309173.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] gi|550335936|gb|EEE92696.2| pentatricopeptide repeat-containing family protein [Populus trichocarpa] Length = 490 Score = 386 bits (992), Expect = e-104 Identities = 188/395 (47%), Positives = 272/395 (68%), Gaps = 27/395 (6%) Frame = +1 Query: 1 LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180 LI+SFSIY +P P + F+ TL + SQF +P VL +E +E F+ PE F +LI Y Sbjct: 64 LIHSFSIYDVEPAPKAFDFIFKTLVKTSQFHHIPSVLDHLEKVESFEPPESTFAYLIEVY 123 Query: 181 GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360 GR N H+A+++F+RIP+FRC PS +SLN L+SVLC+ +GL++V ++LLKS MN+R++ Sbjct: 124 GRTNKTHEAIELFYRIPKFRCVPSVYSLNTLISVLCRNSKGLKLVPEILLKSQVMNIRVE 183 Query: 361 ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540 ES+F++LI+ALC I+KV FAI++LN M + G+ ++ YSL+LS LC+ D + EV+GF Sbjct: 184 ESTFQVLITALCRIRKVGFAIEMLNCMVNDGFIVNAEIYSLLLSCLCEQKDATKFEVIGF 243 Query: 541 LGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTS 639 L ++RK GF P VDY+NVI FLVK + GV Sbjct: 244 LEQLRKLGFFPGMVDYSNVIRFLVKGKRGLDALHVLNHMKSDRIKPDIFCYTMVLHGVIE 303 Query: 640 AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819 +++ KA+++FDE+LV G+VPD YTYNVY++GL KQNN++AG+KM+ M+ELGCKPN++T Sbjct: 304 DKDYLKADELFDELLVFGLVPDAYTYNVYINGLCKQNNVQAGIKMVASMEELGCKPNLIT 363 Query: 820 YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999 YN L+ LC++GE+ +A E++R M K G+ N+ TYRI+IDG SNG+++EAC L EE Sbjct: 364 YNMLVKQLCKVGELSKAGELVREMGLK-GIGLNMQTYRIMIDGLASNGKIVEACGLFEEA 422 Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEM 1104 L ++ FDE+IC LC R L+ +ALK+L++M Sbjct: 423 LDKGLCTQSLMFDEIICGLCHRDLSCKALKLLEKM 457 >gb|AHB18409.1| pentatricopeptide repeat-containing protein [Gossypium hirsutum] Length = 480 Score = 382 bits (981), Expect = e-103 Identities = 187/411 (45%), Positives = 275/411 (66%), Gaps = 27/411 (6%) Frame = +1 Query: 1 LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180 L+ S S+Y +P Y FL+ TL N QF +P +LH ++ L+ F PE F HL++ Y Sbjct: 57 LLQSLSLYNLHQSPQAYHFLIKTLLHNRQFHHIPSLLHHLQ-LQHFQTPEYIFTHLVKFY 115 Query: 181 GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360 G+AN + DAVDIF+RIP+FRC PSA+SLNALL++LC+ + GL+++ QVLL S MN+RL+ Sbjct: 116 GKANRIQDAVDIFYRIPQFRCFPSAYSLNALLALLCRSQRGLKLLPQVLLNSLHMNIRLE 175 Query: 361 ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540 ES+FR+L+ LC + KVA+AI+IL M D G + +S +LS++C DL +V+GF Sbjct: 176 ESTFRLLVCTLCRMNKVAYAIEILQRMLDDGLGVNDKVFSFVLSSVCAEGDLDGEDVIGF 235 Query: 541 LGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTS 639 +RK GFSP DY V+ FLVK + +GVT+ Sbjct: 236 WRGLRKLGFSPAMGDYDGVVRFLVKKGRGLDAWDVLNQMKSDGIMPGIISYTMVLNGVTA 295 Query: 640 AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819 ++ A+++FDE+L+LG+VP++YTY Y+ L KQN +E G+KM+ CM+ELGCKPNV+ Sbjct: 296 EGDYILADELFDELLMLGLVPNVYTYKAYIDALCKQNKVEEGIKMVACMEELGCKPNVLI 355 Query: 820 YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999 YNTLL + + GE+ +A E+++ M+ K G++ N +Y I+IDG +SNGE++EAC L+EE+ Sbjct: 356 YNTLLRTISKAGEISRARELVKEMKYK-GIEMNWVSYTIIIDGLVSNGEILEACALVEEV 414 Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEALL 1152 LH ++ TFDE+IC LC+RGL +A ++L +M+ R+I+PG+ WEALL Sbjct: 415 LHKCIFIKSLTFDEVICGLCQRGLVCKARELLGKMVERSISPGARVWEALL 465 >gb|EXB42398.1| hypothetical protein L484_021993 [Morus notabilis] Length = 494 Score = 378 bits (971), Expect = e-102 Identities = 199/414 (48%), Positives = 276/414 (66%), Gaps = 30/414 (7%) Frame = +1 Query: 1 LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180 L+NSF+ Y C+PTP Y F+L TL + SQFD + VL RIE +EKF+ PE FF +I Y Sbjct: 56 LLNSFNSYDCNPTPEAYHFVLKTLIKTSQFDHIHSVLDRIEFVEKFETPEYFFAQIIGFY 115 Query: 181 GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360 G + + DA+DIF+RIP+FRC PS++SLN+LL VLC++ EGL+ V +VL+KS +MN+RL+ Sbjct: 116 GFLDRIEDAIDIFWRIPKFRCVPSSYSLNSLLYVLCRRNEGLRFVPEVLIKSRDMNIRLE 175 Query: 361 ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALC---KISDLSSAEV 531 E+SFRILI+ALC I KV +AI+IL+ M GYD D+ SLILS LC K DL+ +V Sbjct: 176 EASFRILITALCKIGKVGYAIEILDCMISDGYDIDARICSLILSFLCGKNKELDLAGFDV 235 Query: 532 LGFLGEMRKAGFSPDGVDYTNVIAFLV---------------KAD------------KDG 630 L L +M K GF P DY+ VI LV KAD G Sbjct: 236 LELLQKMEKMGFCPRMGDYSKVIRILVREKRGLEALDILGQMKADGMKPDVVCYTMVLHG 295 Query: 631 VTSAEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPN 810 + + E+ KA+++FDEMLVLG+VPD+YTYN Y++GL KQN+++ L + M+ELGCKPN Sbjct: 296 IVAEGEYSKADEMFDEMLVLGLVPDVYTYNAYINGLCKQNDVDGALDTILRMEELGCKPN 355 Query: 811 VVTYNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLL 990 ++TYN +L ALC+ GE +A+E++ M K G + L TY I++D + GE++EAC L+ Sbjct: 356 LITYNLILRALCKNGEFGRAKELVAEMSLK-GFEDYLQTYIIMLDVLLGKGEIVEACGLM 414 Query: 991 EEMLHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEALL 1152 EEML R +DE+I LC+RGL +A ++L +M+ +N+APG+ AW+ALL Sbjct: 415 EEMLDKLLCRRCSMYDEIIFGLCRRGLDCKASEMLGKMVGKNVAPGARAWDALL 468 >ref|XP_004137893.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial-like [Cucumis sativus] gi|449483740|ref|XP_004156675.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial-like [Cucumis sativus] Length = 491 Score = 373 bits (957), Expect = e-100 Identities = 194/423 (45%), Positives = 270/423 (63%), Gaps = 27/423 (6%) Frame = +1 Query: 1 LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180 L+ SF+ Y+C PTP+ Y F+L TL + SQF +PPVLHR++ LE F PE FV LI+ Y Sbjct: 59 LVTSFTAYSCHPTPNAYYFVLKTLARTSQFHHIPPVLHRLQFLENFQTPEYIFVDLIKLY 118 Query: 181 GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360 GR N + DAV +F RIP FRC PS SLN+LLS L + +GL ++ ++L SH M +RL+ Sbjct: 119 GRMNRIQDAVTLFRRIPMFRCVPSTLSLNSLLSQLSRNAQGLPIIPDIILNSHSMGIRLE 178 Query: 361 ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540 S+F+ILI+ALC + KV A+++ N M GY + SLIL++LC+ S VLGF Sbjct: 179 HSTFQILITALCKVNKVGHAMELFNYMITEGYGLNPQICSLILASLCQQKKSSGDVVLGF 238 Query: 541 LGEMRKAGFSPDGVDYTNVIAFLV---------------KAD------------KDGVTS 639 L EMR+ GF P VDY+NVI F V KAD +GV + Sbjct: 239 LEEMRQKGFCPAVVDYSNVIKFFVTRGMGSDAVDLLNKMKADGFKPDIVCYTMVLNGVIA 298 Query: 640 AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819 +++ A+++FDE+L+ G+VPDIYTYNVY+ GL KQ + AGL+M+ M+ LGC+PNV+T Sbjct: 299 DGDYKMADELFDELLLFGLVPDIYTYNVYIHGLCKQGDSVAGLQMIPHMEALGCQPNVIT 358 Query: 820 YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999 YN +L +LC+ GE+ +A ++ +M+ K G+ NL T+RI+IDG NGEV+EAC LLEEM Sbjct: 359 YNVILKSLCKTGELDEARKLRSKMQLK-GLAENLRTFRIMIDGLFHNGEVIEACVLLEEM 417 Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEALLSMFELSFKE 1179 L F P+ TF E++ LCKR + +A+++L M+ +N +PG AWE LL E Sbjct: 418 LGSRFPPQISTFSEILSWLCKRHMVGKAVELLALMVGKNFSPGPKAWEILLLSSESELTS 477 Query: 1180 TNS 1188 S Sbjct: 478 VKS 480 >ref|XP_004246310.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420, mitochondrial-like [Solanum lycopersicum] Length = 496 Score = 343 bits (881), Expect = 1e-91 Identities = 184/414 (44%), Positives = 260/414 (62%), Gaps = 28/414 (6%) Frame = +1 Query: 1 LINSFSIYACDPTPSPYSFLLNTLTQN-SQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRT 177 L++SFS Y CDPTP+ Y F+L TLTQN S +D++P +L I E F+ PE F +LI+ Sbjct: 78 LLDSFSAYECDPTPNAYYFILKTLTQNPSTWDEIPLILDYIRKFENFETPEYIFTYLIKF 137 Query: 178 YGRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRL 357 YG +NM H A ++FF +P +RC PS SLN L+ VLCK L++V QVL+KS +N+ + Sbjct: 138 YGDSNMTHLAYEMFFTMPAYRCNPSVKSLNCLIWVLCKNNYDLRIVLQVLVKSQLLNIWV 197 Query: 358 DESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLG 537 +ES+F+ILI ALC I K A+D+L LM D G++ D+ SLILS + + D E+ G Sbjct: 198 EESTFKILIRALCRIGKTNNAVDLLKLMVDSGFNLDANICSLILSTMPDVKDCVGVEIWG 257 Query: 538 FLGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVT 636 L EMRK G+SP VD NVI F V K +G+ Sbjct: 258 VLEEMRKLGYSPKRVDLCNVIRFYVNNGKGIDALEVLNKMKMCGMVPDVVCYNLVLNGLI 317 Query: 637 SAEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVV 816 E+ A+++FDE+LVLG+ PDI TYNVY++GL KQ+ + L++L CM++LGCKP + Sbjct: 318 FEGEYSNADELFDELLVLGLNPDIVTYNVYINGLCKQDKMVEALRVLGCMEDLGCKPEMN 377 Query: 817 TYNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEE 996 TY+T+L LCR G + +EVL +M+SK G+Q + H Y ++I+ I NGEV EA LL E Sbjct: 378 TYHTILDGLCRCGMLSSVKEVLGQMKSK-GLQLSSHIYGVIINCMIRNGEVDEAYNLLHE 436 Query: 997 MLHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEALLSM 1158 M+ M FVP++ TFD +I LC +G E +++L M T+N+ PG +WEA + + Sbjct: 437 MVDMGFVPQSITFDGLIGLLCNKGSFYEVMELLSIMSTKNLVPGIRSWEAFVQV 490 >ref|XP_006411054.1| hypothetical protein EUTSA_v10017948mg [Eutrema salsugineum] gi|557112223|gb|ESQ52507.1| hypothetical protein EUTSA_v10017948mg [Eutrema salsugineum] Length = 456 Score = 332 bits (850), Expect = 4e-88 Identities = 173/396 (43%), Positives = 259/396 (65%), Gaps = 27/396 (6%) Frame = +1 Query: 1 LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180 LI+SF ++ C+PTP Y F++ TL + SQ + + VL+ IE EKFD PE F +I Y Sbjct: 62 LISSFRLHNCEPTPQAYKFVIKTLAKTSQLENIASVLNHIEISEKFDTPESIFRDVIFAY 121 Query: 181 GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360 G + + +A+D+FF+IP FRC PSA++LNALLSVL ++R+GL+MV +VLLK+ ++ VRL+ Sbjct: 122 GFSGRIEEAIDVFFKIPNFRCVPSAYTLNALLSVLVRKRQGLKMVPEVLLKASKLGVRLE 181 Query: 361 ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540 ES+ ILI ALC I +V A D++ MSD Y D YSL+LS++CK D S +V+G+ Sbjct: 182 ESTLGILIDALCRIGEVDCATDLVKDMSDDCYIVDPRLYSLLLSSVCKHKDSSCFDVIGY 241 Query: 541 LGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTS 639 L +RK FSPD DYT V+ FLV+ + GV + Sbjct: 242 LEGLRKTRFSPDLRDYTAVMRFLVEGGRGKEVVSVLNQMKCDRIEPDIVCYTIILQGVIA 301 Query: 640 AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819 E+++KA+ +FDE+L+LG+VPD+YTYNVY++GL KQ++IE G+KM++CM++LG +PNVVT Sbjct: 302 DEDYKKADKLFDELLLLGLVPDVYTYNVYINGLCKQSDIECGIKMMSCMEKLGSEPNVVT 361 Query: 820 YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999 YN L+ AL + G++ +A+ + M + GV N H+Y I+++ I EV+ A LLEE Sbjct: 362 YNILIKALVKAGDMSRAKIIWEEMET-NGVDRNSHSYDIMVNASIEADEVVCAHGLLEEA 420 Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMI 1107 S V ++ +E+IC LC +GL +A+++L ++ Sbjct: 421 FSRSLVVKSSRTEEVICRLCDKGLMDKAVELLVHLV 456 >ref|XP_002533822.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] gi|223526239|gb|EEF28557.1| pentatricopeptide repeat-containing protein, putative [Ricinus communis] Length = 373 Score = 325 bits (832), Expect = 5e-86 Identities = 162/355 (45%), Positives = 237/355 (66%), Gaps = 27/355 (7%) Frame = +1 Query: 196 LHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLDESSFR 375 + +A+ +F+R P FRC PS + LN LLSVLC+ EGL V +VLLKS +MN+R++ESSFR Sbjct: 1 MQNAIHLFYRTPNFRCVPSVYLLNTLLSVLCRTNEGLNFVPEVLLKSQDMNIRMEESSFR 60 Query: 376 ILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGFLGEMR 555 +LI+ALC+I KV +A+++ N M + G+ DS SL+LS+LC +D+SS+EV+ FLGE+R Sbjct: 61 LLINALCSINKVGYAVEMFNCMINDGFSVDSKICSLLLSSLCYQADISSSEVMRFLGELR 120 Query: 556 KAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTSAEEFQ 654 K GF P DY+ VI FLV+ +GV + + Sbjct: 121 KFGFCPGIKDYSKVINFLVRRGMGMEALNVLNQMKLDGIKPDIVCYTTVLNGVIANGVYS 180 Query: 655 KAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVTYNTLL 834 KA+++FDE+LV G+VPD+YTYNVY+ GL KQNN+EAG++M+T M+ELGCKPN++TYN LL Sbjct: 181 KADELFDELLVFGLVPDVYTYNVYIYGLCKQNNVEAGIEMVTSMEELGCKPNLITYNILL 240 Query: 835 GALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEMLHMSF 1014 LC+ GE +A +++R M SK G+ + TY+++I G S G++++AC LLEE L Sbjct: 241 EDLCKNGEDSRARDLVRDMGSK-GIGLGMQTYKVMIHGLTSGGKIVKACSLLEEALDKGL 299 Query: 1015 VPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEALLSMFELSFKE 1179 PR FDE+I LC+ G +AL++L++++ +N++PG WE LL ++F E Sbjct: 300 CPRGLRFDEVIYGLCQTGSICKALELLEKVVNKNVSPGVRVWETLLLKSNINFVE 354 >ref|NP_181376.3| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] gi|218546769|sp|Q8L6Y7.2|PP193_ARATH RecName: Full=Pentatricopeptide repeat-containing protein At2g38420, mitochondrial; Flags: Precursor gi|3395430|gb|AAC28762.1| hypothetical protein [Arabidopsis thaliana] gi|330254441|gb|AEC09535.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana] Length = 453 Score = 311 bits (797), Expect = 5e-82 Identities = 160/396 (40%), Positives = 253/396 (63%), Gaps = 27/396 (6%) Frame = +1 Query: 1 LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180 L++SF ++ C+PTP Y F++ TL ++SQ + + VL+ +E EKFD PE F +I Y Sbjct: 59 LLSSFQLHNCEPTPQAYRFVIKTLAKSSQLENISSVLYHLEVSEKFDTPESIFRDVIAAY 118 Query: 181 GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360 G + + +A+++FF+IP FRC PSA++LNALL VL ++R+ L++V ++L+K+ M VRL+ Sbjct: 119 GFSGRIEEAIEVFFKIPNFRCVPSAYTLNALLLVLVRKRQSLELVPEILVKACRMGVRLE 178 Query: 361 ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540 ES+F ILI ALC I +V A +++ MS D YS +LS++CK D S +V+G+ Sbjct: 179 ESTFGILIDALCRIGEVDCATELVRYMSQDSVIVDPRLYSRLLSSVCKHKDSSCFDVIGY 238 Query: 541 LGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTS 639 L ++RK FSP DYT V+ FLV+ + GV + Sbjct: 239 LEDLRKTRFSPGLRDYTVVMRFLVEGGRGKEVVSVLNQMKCDRVEPDLVCYTIVLQGVIA 298 Query: 640 AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819 E++ KA+ +FDE+L+LG+ PD+YTYNVY++GL KQN+IE LKM++ M++LG +PNVVT Sbjct: 299 DEDYPKADKLFDELLLLGLAPDVYTYNVYINGLCKQNDIEGALKMMSSMNKLGSEPNVVT 358 Query: 820 YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999 YN L+ AL + G++ +A+ + + M + GV N HT+ I+I +I EV+ A LLEE Sbjct: 359 YNILIKALVKAGDLSRAKTLWKEMET-NGVNRNSHTFDIMISAYIEVDEVVCAHGLLEEA 417 Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMI 1107 +M+ ++ +E+I LC++GL +A+++L ++ Sbjct: 418 FNMNVFVKSSRIEEVISRLCEKGLMDQAVELLAHLV 453 >gb|AAM98219.1| unknown protein [Arabidopsis thaliana] gi|31376375|gb|AAP49514.1| At2g38420 [Arabidopsis thaliana] Length = 444 Score = 311 bits (797), Expect = 5e-82 Identities = 160/396 (40%), Positives = 253/396 (63%), Gaps = 27/396 (6%) Frame = +1 Query: 1 LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180 L++SF ++ C+PTP Y F++ TL ++SQ + + VL+ +E EKFD PE F +I Y Sbjct: 50 LLSSFQLHNCEPTPQAYRFVIKTLAKSSQLENISSVLYHLEVSEKFDTPESIFRDVIAAY 109 Query: 181 GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360 G + + +A+++FF+IP FRC PSA++LNALL VL ++R+ L++V ++L+K+ M VRL+ Sbjct: 110 GFSGRIEEAIEVFFKIPNFRCVPSAYTLNALLLVLVRKRQSLELVPEILVKACRMGVRLE 169 Query: 361 ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540 ES+F ILI ALC I +V A +++ MS D YS +LS++CK D S +V+G+ Sbjct: 170 ESTFGILIDALCRIGEVDCATELVRYMSQDSVIVDPRLYSRLLSSVCKHKDSSCFDVIGY 229 Query: 541 LGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTS 639 L ++RK FSP DYT V+ FLV+ + GV + Sbjct: 230 LEDLRKTRFSPGLRDYTVVMRFLVEGGRGKEVVSVLNQMKCDRVEPDLVCYTIVLQGVIA 289 Query: 640 AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819 E++ KA+ +FDE+L+LG+ PD+YTYNVY++GL KQN+IE LKM++ M++LG +PNVVT Sbjct: 290 DEDYPKADKLFDELLLLGLAPDVYTYNVYINGLCKQNDIEGALKMMSSMNKLGSEPNVVT 349 Query: 820 YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999 YN L+ AL + G++ +A+ + + M + GV N HT+ I+I +I EV+ A LLEE Sbjct: 350 YNILIKALVKAGDLSRAKTLWKEMET-NGVNRNSHTFDIMISAYIEVDEVVCAHGLLEEA 408 Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMI 1107 +M+ ++ +E+I LC++GL +A+++L ++ Sbjct: 409 FNMNVFVKSSRIEEVISRLCEKGLMDQAVELLAHLV 444 >ref|XP_006294146.1| hypothetical protein CARUB_v10023139mg [Capsella rubella] gi|482562854|gb|EOA27044.1| hypothetical protein CARUB_v10023139mg [Capsella rubella] Length = 470 Score = 311 bits (796), Expect = 7e-82 Identities = 164/396 (41%), Positives = 246/396 (62%), Gaps = 27/396 (6%) Frame = +1 Query: 1 LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180 L++SF ++ C+PTP Y F++ TL + SQ + + VL +E EKFD PE F +I Y Sbjct: 76 LVSSFRLHNCEPTPQAYRFVIKTLAKTSQLENIASVLSHLEVSEKFDTPESIFRDVIAAY 135 Query: 181 GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360 G A + +A+D+FF+IP FRC PSA++LNALL VL ++RE L++V ++L+K+ M VRL+ Sbjct: 136 GFAGRIGEAIDVFFKIPNFRCVPSAYTLNALLLVLVRKRESLELVPEILVKASRMGVRLE 195 Query: 361 ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540 ES+F ILI ALC I +V A +++ MS D YS +LS++CK D S +V+G+ Sbjct: 196 ESTFGILIDALCKIGEVDCATELVRYMSIDCVIVDPRLYSQLLSSVCKHKDSSCFDVVGY 255 Query: 541 LGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTS 639 L ++RK FSP DYT V++FLV+ + GV + Sbjct: 256 LEDLRKTRFSPGLRDYTVVMSFLVEGGRGKEVVSVLNQMKCDRIEPDIVCYTIVLQGVIA 315 Query: 640 AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819 E+ KA+ FDE+L+LG+ PD+YTYNVY++GL KQN+IE LKM++ M++LG +PNV+T Sbjct: 316 DAEYSKADKFFDELLLLGLAPDVYTYNVYMNGLCKQNDIEGALKMMSSMNKLGSEPNVIT 375 Query: 820 YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999 YN L+ AL G++ QA+ + M GV N HTY I+I FI G+V+ A LEE Sbjct: 376 YNILIKALVNAGDLSQAKTLWEEM-GINGVNRNSHTYDIMISAFIEVGDVVSAQGFLEEA 434 Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMI 1107 +M+ ++ +E+I LC +GL +A+++L ++ Sbjct: 435 FNMNVFAKSSRTEEVISRLCDKGLMDKAVELLAHLV 470 >ref|XP_002879744.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] gi|297325583|gb|EFH56003.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata subsp. lyrata] Length = 444 Score = 309 bits (792), Expect = 2e-81 Identities = 159/396 (40%), Positives = 251/396 (63%), Gaps = 27/396 (6%) Frame = +1 Query: 1 LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180 L++SF ++ C+PTP Y F++ TL + SQ + + VL +E EKFD PE F +I Y Sbjct: 50 LVSSFQLHNCEPTPQAYRFVIETLAKTSQLENIASVLDHLEVSEKFDTPESIFRDVIAAY 109 Query: 181 GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360 G + + +A+D+FF+IP FRC PSA++LNALL VL ++R+ L++V ++L+K+ M VRL+ Sbjct: 110 GFSGRIEEAIDVFFKIPNFRCVPSAYTLNALLLVLVRKRQSLELVPEILVKASRMGVRLE 169 Query: 361 ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540 ES+F ILI+ALC I +V A +++ MS+ D YSL+LS++CK D S +V+G+ Sbjct: 170 ESTFGILINALCRIGEVDCATELVRYMSEDSVIVDPRLYSLLLSSVCKHKDSSCFDVIGY 229 Query: 541 LGEMRKAGFSPDGVDYTNVIAFLVKADKD---------------------------GVTS 639 L ++RK F P DYT V+ FLV+ + GV + Sbjct: 230 LEDLRKTRFLPGLRDYTVVMRFLVEGGRGKEVVSVLNQMKCDRIDPDVVCYTIVLLGVIA 289 Query: 640 AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819 E++ KA+ +FDE+L+LG+ PD+YTYNVY++GL KQN+IE +KM++ M++LG +PNVVT Sbjct: 290 DEDYPKADKLFDELLLLGLDPDVYTYNVYINGLCKQNDIEGAIKMMSSMNKLGSEPNVVT 349 Query: 820 YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999 YN ++ L + G++ +A+ + + M GV N HTY I+I +I EV+ A LLEE Sbjct: 350 YNIVIKGLVKAGDLSRAKTLWKEM-EMNGVNRNSHTYDIMISAYIEVDEVVCAQGLLEEA 408 Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMI 1107 +M+ ++ +E+I LC++GL +A+++L ++ Sbjct: 409 FNMNLFVKSSKIEEVISRLCEKGLMDKAVELLAHLV 444 >ref|XP_006854116.1| hypothetical protein AMTR_s00048p00149840 [Amborella trichopoda] gi|548857785|gb|ERN15583.1| hypothetical protein AMTR_s00048p00149840 [Amborella trichopoda] Length = 464 Score = 274 bits (701), Expect = 7e-71 Identities = 153/374 (40%), Positives = 223/374 (59%), Gaps = 27/374 (7%) Frame = +1 Query: 70 LTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTYGRANMLHDAVDIFFRIPRFRCTP 249 L QN QF L +L ++ KF PE + LI++ + M+ +A+D+FF +P RC P Sbjct: 91 LAQNPQFSGLKTLLRCLQSNRKFSTPETRIIGLIQSCASSKMVKEALDLFFAMPHLRCQP 150 Query: 250 SAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLDESSFRILISALCNIKKVAFAIDI 429 S SLNALLSVLC + +V ++L+K+ EMN+RLD SSFRILI +LC I K+ FAI++ Sbjct: 151 STTSLNALLSVLC-DTDSFHLVPELLIKTLEMNIRLDASSFRILIGSLCRIGKLGFAIEL 209 Query: 430 LNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGFLGEMRKAGFSPDGVDYTNVIAFL 609 L LM D G PDS FY+ IL LC+ + S E+ GFL EM+ AGF PD + Y VI L Sbjct: 210 LRLMPDQGCWPDSGFYAEILCKLCEFGEFS--EIYGFLDEMKDAGFFPDKIAYAIVIDSL 267 Query: 610 VKADK---------------------------DGVTSAEEFQKAEDVFDEMLVLGVVPDI 708 K + DG EF++A +VFDEML +G+VPD+ Sbjct: 268 AKGGRLNEARAILNRMKLEGAKPDTITYTSMMDGFYKIGEFKQAGEVFDEMLAMGLVPDV 327 Query: 709 YTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVTYNTLLGALCRLGEVRQAEEVLRR 888 +TY+VY++GL ++ +E ++L M E+GC+PNV+TYNTL+ C G +R+A+E++ Sbjct: 328 FTYSVYINGLCRERKLEEAKEVLCVMREMGCRPNVITYNTLIRTFCSDGNLRRADELVAE 387 Query: 889 MRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEMLHMSFVPRAQTFDEMICALCKRG 1068 M S GV GN TYR LI+ ++ G V+EA LL +M+ F P T++ ++ + + Sbjct: 388 MGS-NGVCGNSVTYRTLINAYLREGMVVEANELLVQMVGKGFFPHFSTWEALLSSTVFKW 446 Query: 1069 LASEALKVLKEMIT 1110 +AL L+E+I+ Sbjct: 447 DILQALNALEELIS 460