BLASTX nr result

ID: Sinomenium21_contig00020099 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00020099
         (1615 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002265961.1| PREDICTED: pentatricopeptide repeat-containi...   424   e-116
ref|XP_007138368.1| hypothetical protein PHAVU_009G202600g [Phas...   419   e-114
ref|XP_006429524.1| hypothetical protein CICLE_v10013613mg [Citr...   414   e-113
ref|XP_007140168.1| hypothetical protein PHAVU_008G089500g [Phas...   411   e-112
ref|XP_003533674.1| PREDICTED: pentatricopeptide repeat-containi...   407   e-111
ref|XP_007026524.1| Pentatricopeptide repeat superfamily protein...   400   e-109
ref|XP_003623530.1| Pentatricopeptide repeat-containing protein ...   399   e-108
ref|XP_004305097.1| PREDICTED: pentatricopeptide repeat-containi...   397   e-108
ref|XP_002309173.2| pentatricopeptide repeat-containing family p...   386   e-104
gb|AHB18409.1| pentatricopeptide repeat-containing protein [Goss...   382   e-103
gb|EXB42398.1| hypothetical protein L484_021993 [Morus notabilis]     378   e-102
ref|XP_004137893.1| PREDICTED: pentatricopeptide repeat-containi...   373   e-100
ref|XP_004246310.1| PREDICTED: pentatricopeptide repeat-containi...   343   1e-91
ref|XP_006411054.1| hypothetical protein EUTSA_v10017948mg [Eutr...   332   4e-88
ref|XP_002533822.1| pentatricopeptide repeat-containing protein,...   325   5e-86
ref|NP_181376.3| pentatricopeptide repeat-containing protein [Ar...   311   5e-82
gb|AAM98219.1| unknown protein [Arabidopsis thaliana] gi|3137637...   311   5e-82
ref|XP_006294146.1| hypothetical protein CARUB_v10023139mg [Caps...   311   7e-82
ref|XP_002879744.1| pentatricopeptide repeat-containing protein ...   309   2e-81
ref|XP_006854116.1| hypothetical protein AMTR_s00048p00149840 [A...   274   7e-71

>ref|XP_002265961.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
            mitochondrial-like [Vitis vinifera]
          Length = 505

 Score =  424 bits (1089), Expect = e-116
 Identities = 225/423 (53%), Positives = 288/423 (68%), Gaps = 28/423 (6%)
 Frame = +1

Query: 1    LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180
            LI+SF IY  DPTP+ Y F+++TLT+  QF  LPP+LHR+E +EKF+ PE  F +LI+ Y
Sbjct: 64   LIDSFRIYNSDPTPNAYRFVISTLTRCRQFHHLPPLLHRLEKVEKFETPEFIFTNLIKVY 123

Query: 181  GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360
            G ANM  DAVD+FFRIP FRC PS +SLNALL VLCK+REGL MV Q+LLKS  MN+RL+
Sbjct: 124  GNANMFEDAVDLFFRIPNFRCVPSVYSLNALLYVLCKRREGLVMVPQILLKSQAMNIRLE 183

Query: 361  ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540
            ESSFRIL++ALC IKK  +AI ILN M + GY  D+   S+ILS+LC+   LS  EVL F
Sbjct: 184  ESSFRILVAALCRIKKHNYAIRILNYMLNDGYAVDAKMCSIILSSLCEQKGLSGDEVLRF 243

Query: 541  LGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTS 639
            + EMRK GF P  VD  NVI FLVK                              +GVT+
Sbjct: 244  MEEMRKLGFYPGRVDCNNVIRFLVKEGMVMDALGVFDQMKTDGIKPDTVSYTMILNGVTA 303

Query: 640  AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819
              +++KA+D+FDEMLVLGVVPDI+ YNVY++ L KQNNIE G++ML  M ELGCKP+ VT
Sbjct: 304  DGDYEKADDLFDEMLVLGVVPDIHAYNVYINSLCKQNNIEEGVRMLASMRELGCKPDYVT 363

Query: 820  YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999
            YN LL  + ++ ++    E+ R M  + GVQ N  TYRI++DG +  GE+ E+C LLEEM
Sbjct: 364  YNMLLEGMSKVRDLGGMRELAREMELE-GVQWNWETYRIMLDGLVGKGEIDESCSLLEEM 422

Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEA-LLSMFELSFK 1176
            L   F     TFDE+IC LC+RGL  +AL+++ +M+ + IAPG+ AWEA LL   E SF 
Sbjct: 423  LDKYFSCWCSTFDEIICELCQRGLVCKALQLVNKMVRKTIAPGARAWEALLLGSVEFSFA 482

Query: 1177 ETN 1185
            ET+
Sbjct: 483  ETS 485


>ref|XP_007138368.1| hypothetical protein PHAVU_009G202600g [Phaseolus vulgaris]
            gi|561011455|gb|ESW10362.1| hypothetical protein
            PHAVU_009G202600g [Phaseolus vulgaris]
          Length = 513

 Score =  419 bits (1076), Expect = e-114
 Identities = 214/425 (50%), Positives = 293/425 (68%), Gaps = 28/425 (6%)
 Frame = +1

Query: 1    LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180
            LI+SF  Y+CDPTP  Y FL+ TLT  SQF  +PPVL  +EHLEKF+ PE   V+LIR Y
Sbjct: 66   LIDSFKSYSCDPTPKAYYFLIKTLTCTSQFQDIPPVLDHLEHLEKFETPEFNLVYLIRFY 125

Query: 181  GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360
            G ++ + DAVD+F RIPRFRCTP+  SLN +LS+LC++RE L+MV ++LLKS  MN+R++
Sbjct: 126  GLSDKVQDAVDLFLRIPRFRCTPTVCSLNLVLSLLCRKRECLKMVPEILLKSQHMNIRVE 185

Query: 361  ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540
            ES+F++LI ALC IK+V +AI +LN M + GY  D T  SLI+S+LC+  D++S E L  
Sbjct: 186  ESTFQVLIKALCRIKRVGYAIKMLNYMIEGGYGLDETMCSLIISSLCEQEDMTSVEALVI 245

Query: 541  LGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTS 639
              +MRK GF P  +DYTN+I FLVK  K                            G+ +
Sbjct: 246  WRDMRKLGFCPGIMDYTNMIRFLVKEGKGMDALDILNQQKKDGIKPDVVCYTMVLSGIIA 305

Query: 640  AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819
              E+ K E++FDE+LV G+VPD+YTYNVY++GL KQNN++  LK++  M+EL CKPNVVT
Sbjct: 306  EGEYVKLEELFDEILVFGLVPDVYTYNVYINGLCKQNNVDEALKIVASMEELECKPNVVT 365

Query: 820  YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999
             N LLGALC  G++R+A  V++ M  K GV+ +LH+YRI++DG +  GE+ EAC LLEEM
Sbjct: 366  CNILLGALCVAGDLRKARGVMKEMGWK-GVRLDLHSYRIMLDGLVGKGEIGEACFLLEEM 424

Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEALL-SMFELSFK 1176
            L  SF PR+ TFD +I  +C++GL  EA+++ K+++ ++  PG+ AWEALL S  +L F 
Sbjct: 425  LEKSFFPRSSTFDHIIFQMCQKGLIVEAIELTKKIVAKSFVPGARAWEALLKSGSKLGFS 484

Query: 1177 ETNSS 1191
            ET  S
Sbjct: 485  ETTFS 489


>ref|XP_006429524.1| hypothetical protein CICLE_v10013613mg [Citrus clementina]
            gi|557531581|gb|ESR42764.1| hypothetical protein
            CICLE_v10013613mg [Citrus clementina]
          Length = 506

 Score =  414 bits (1065), Expect = e-113
 Identities = 213/424 (50%), Positives = 294/424 (69%), Gaps = 29/424 (6%)
 Frame = +1

Query: 1    LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180
            L++SFSIY C+P P  Y F++ TL +NSQF  +  VL  IE  E F+ PE  F+ LI+TY
Sbjct: 67   LLHSFSIYNCEPPPEAYHFVIKTLAENSQFCDISSVLDHIEKRENFETPEFIFIDLIKTY 126

Query: 181  GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360
              A+   D+V++F++IP+FRC PS +SLNALLSVLC+ +E ++MV Q+LLKS  MN+R++
Sbjct: 127  ADAHRFQDSVNLFYKIPKFRCVPSVYSLNALLSVLCRNKEWVKMVPQILLKSQLMNIRIE 186

Query: 361  ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540
            ESSFRILIS LC I +V FAI+ILN M + G+  D    S ILS++C+  DLSS E+LGF
Sbjct: 187  ESSFRILISTLCRINRVGFAIEILNCMINDGFCVDGKTCSWILSSVCEQRDLSSDELLGF 246

Query: 541  LGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTS 639
            + EM+K GF    VDYTNVI  LVK +K                           +GV  
Sbjct: 247  VQEMKKLGFCFGMVDYTNVIRSLVKKEKVFDALGILNQMKSDGIKPDIVCYTMVLNGVIV 306

Query: 640  AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819
             E++ KAE++FDE+LVLG+VPD+YTYNVY++GL KQNN+EAG+KM+ CM+ELG KP+V+T
Sbjct: 307  QEDYVKAEELFDELLVLGLVPDVYTYNVYINGLCKQNNVEAGIKMIACMEELGSKPDVIT 366

Query: 820  YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999
            YNTLL ALC++ E+ +  E+++ M+ KG V  NL TY I+IDG  S G+++EAC LLEE 
Sbjct: 367  YNTLLQALCKVRELNRLRELVKEMKWKGIVL-NLQTYSIMIDGLASKGDIIEACGLLEEA 425

Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEALL--SMFELSF 1173
            L+     ++  FDE IC LC+RGL  +AL++LK+M  ++++PG+  WEALL  S+ +L F
Sbjct: 426  LNKGLCTQSSMFDETICGLCQRGLVRKALELLKQMADKDVSPGARVWEALLLSSVSKLDF 485

Query: 1174 KETN 1185
              T+
Sbjct: 486  VNTS 489


>ref|XP_007140168.1| hypothetical protein PHAVU_008G089500g [Phaseolus vulgaris]
            gi|561013301|gb|ESW12162.1| hypothetical protein
            PHAVU_008G089500g [Phaseolus vulgaris]
          Length = 514

 Score =  411 bits (1057), Expect = e-112
 Identities = 207/426 (48%), Positives = 290/426 (68%), Gaps = 29/426 (6%)
 Frame = +1

Query: 1    LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180
            L+++F  Y+CDPTP  Y F++ TLT  S    +PPVL  +E LE F+ PE   V+LIR Y
Sbjct: 66   LLDAFKAYSCDPTPKAYYFVIKTLTSTSHLQDIPPVLDHLEQLETFETPEFILVYLIRFY 125

Query: 181  GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360
            G ++ + DAVD+F RIPRFRCTP+ +SLN +LS+LC++RE L+MV ++LLKS  MN+R++
Sbjct: 126  GLSDRVQDAVDLFLRIPRFRCTPTVWSLNLVLSLLCRKRECLKMVPEILLKSQHMNIRVE 185

Query: 361  ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540
            ES+F++LI ALC IK+V +AI +LN M + GY  D T  SLI+S+LC+  D++S E L  
Sbjct: 186  ESTFQVLIEALCRIKRVGYAIKMLNYMIEGGYGLDETICSLIISSLCEQEDMTSVEALVI 245

Query: 541  LGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTS 639
              +MRK GF P  +DYTN+I FLVK  K                            G+ +
Sbjct: 246  WRDMRKLGFCPGVMDYTNMIRFLVKEGKGTDALDILNQQKKDGIKPDVVCYTMVLSGIVA 305

Query: 640  AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819
              E+ K E++FDE+LV G+VPD+YTYNVY++GL KQNN++  LK++  M+EL C+PNVVT
Sbjct: 306  EGEYVKLEELFDEILVFGLVPDVYTYNVYINGLCKQNNVDEALKIVASMEELECRPNVVT 365

Query: 820  YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999
             NTLLGALC  G++R+A  V++ M  K GV  NLH+YRI++DG +  GE+ EAC LLEEM
Sbjct: 366  CNTLLGALCVAGDLRKARGVMKEMGWK-GVGLNLHSYRIMLDGLVGKGEIGEACFLLEEM 424

Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEALL--SMFELSF 1173
            L   F PR+ TFD +I  +C++GL +EA+++ K+++ ++  PG+ AWEALL  S  +L F
Sbjct: 425  LEKCFFPRSSTFDHIIFQMCQKGLIAEAIELTKKIVAKSFVPGARAWEALLLKSGSKLGF 484

Query: 1174 KETNSS 1191
             ET  S
Sbjct: 485  SETTFS 490


>ref|XP_003533674.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
            mitochondrial-like [Glycine max]
          Length = 499

 Score =  407 bits (1047), Expect = e-111
 Identities = 202/411 (49%), Positives = 282/411 (68%), Gaps = 27/411 (6%)
 Frame = +1

Query: 1    LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180
            L++SF  Y+ DPTP  Y F+L TLT  SQ   +PPVL+ +EHLEKF+ PE   V+LIR Y
Sbjct: 69   LLDSFKAYSIDPTPKAYFFVLKTLTSTSQLQDIPPVLYHLEHLEKFETPESILVYLIRFY 128

Query: 181  GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360
            G ++ + DAVD+FFRIPRFRCTP+  SLN +LS+LC++R+ L+MV ++LLKS  MN+R++
Sbjct: 129  GLSDRVQDAVDLFFRIPRFRCTPTVCSLNLVLSLLCRKRDCLEMVPEILLKSQHMNIRVE 188

Query: 361  ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540
            ES+FR+LI ALC IK+V +AI +LN M + GY  D    SL++SALC+  DL+SAE L  
Sbjct: 189  ESTFRVLIRALCRIKRVGYAIKMLNFMVEDGYGLDEKICSLVISALCEQKDLTSAEALVV 248

Query: 541  LGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTS 639
              +MRK GF P  +DYTN+I FLVK  +                            G+ +
Sbjct: 249  WRDMRKLGFCPGVMDYTNMIRFLVKEGRGMDALDILNQQKQDGIKLDVVSYTMVLSGIVA 308

Query: 640  AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819
              E+   +++FDEMLV+G++PD YTYNVY++GL KQNN+   L+++  M+ELGCKPNVVT
Sbjct: 309  EGEYVMLDELFDEMLVIGLIPDAYTYNVYINGLCKQNNVAEALQIVASMEELGCKPNVVT 368

Query: 820  YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999
            YNTLLGAL   G+  +A E+++ M  K GV  NLHTYRI++DG +  GE+ E+C LLEEM
Sbjct: 369  YNTLLGALSVAGDFVKARELMKEMGWK-GVGLNLHTYRIVLDGLVGKGEIGESCLLLEEM 427

Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEALL 1152
            L     PR+ TFD +I  +C++ L +EA+++ K+++ ++  PG+S WEALL
Sbjct: 428  LEKCLFPRSSTFDNIIFQMCQKDLFTEAMELTKKVVAKSFLPGASTWEALL 478


>ref|XP_007026524.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma
            cacao] gi|508715129|gb|EOY07026.1| Pentatricopeptide
            repeat superfamily protein, putative [Theobroma cacao]
          Length = 542

 Score =  400 bits (1029), Expect = e-109
 Identities = 198/424 (46%), Positives = 284/424 (66%), Gaps = 27/424 (6%)
 Frame = +1

Query: 1    LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180
            L+ SFS+Y   PTP  Y FL+ TL QN  F+ +P VLH +EH+EKF  PE  F  LI TY
Sbjct: 105  LVRSFSLYNVHPTPQAYHFLIKTLIQNLHFNHIPSVLHHLEHVEKFQTPEYIFADLITTY 164

Query: 181  GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360
            G AN + DAVDIF+RIP+FRC PSA+SLN+LL++LC+ +  L++V QVLLKS  MN+R++
Sbjct: 165  GIANRIQDAVDIFYRIPKFRCVPSAYSLNSLLALLCRNQYSLKLVPQVLLKSLLMNIRVE 224

Query: 361  ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540
            ES+ RIL+SALC + KV++AIDIL  M D G   +    S ILS++C  +DL   +V+G 
Sbjct: 225  ESTLRILVSALCRMNKVSYAIDILQRMIDEGLGVNDKVCSFILSSICAKADLDGEDVMGL 284

Query: 541  LGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTS 639
              E+ K GF P   DY  +I FLVK  +                           +GV +
Sbjct: 285  WRELGKLGFCPAMSDYNCLIRFLVKKGRGLDALDFLNQMKSVGIKPGIVSYTMALNGVIA 344

Query: 640  AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819
              ++  A+++FDE+L+LG+VPD+YTYN Y+  L KQN +E G+KM+ CM+EL CKPNV+T
Sbjct: 345  EGDYMLADELFDELLMLGLVPDVYTYNAYIDALCKQNKVEEGIKMVACMEELRCKPNVLT 404

Query: 820  YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999
            YN LL A+C++GE+ +A E+++ M+ K G++ NL +Y ++IDG +S GE++EA  L+EE+
Sbjct: 405  YNMLLEAICKVGEISRAMELVKEMKYK-GIEMNLVSYTVIIDGLVSKGEILEAHGLVEEV 463

Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEALLSMFELSFKE 1179
            LH  F  ++  FDE+IC LC+RGL  EAL++L++M+ +N++PG+  WEALL   E     
Sbjct: 464  LHKCFCHQSLAFDEVICGLCQRGLVCEALELLRKMVAKNVSPGARGWEALLLSSESKINF 523

Query: 1180 TNSS 1191
             N++
Sbjct: 524  ANTT 527


>ref|XP_003623530.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355498545|gb|AES79748.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 653

 Score =  399 bits (1024), Expect = e-108
 Identities = 205/425 (48%), Positives = 287/425 (67%), Gaps = 31/425 (7%)
 Frame = +1

Query: 1    LINSFSIYACDPTPSPYSFLLNTLTQ--NSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIR 174
            LI+SF  Y  DP+P  Y FL+ T+T    S   ++P +L+ +EH EKF+ PE  F++LIR
Sbjct: 61   LIHSFKAYHTDPSPKAYFFLIKTITNINTSHLHEIPHILNHLEHNEKFETPEFIFMYLIR 120

Query: 175  TYGRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVR 354
             YG  + + DAVD+FFRIPRFRCTP+  SLN LLS+LC +RE L+MV  +LLKS +M +R
Sbjct: 121  FYGFNDRVQDAVDLFFRIPRFRCTPTVCSLNLLLSLLCGKRECLRMVPDILLKSRDMKIR 180

Query: 355  LDESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVL 534
            L+ESSF +LI ALC IK+V +AI ++N M + GY  D    SLI+S+LC+ +DL+S E L
Sbjct: 181  LEESSFWVLIKALCRIKRVDYAIKMMNCMVEDGYCLDDKICSLIISSLCEQNDLTSVEAL 240

Query: 535  GFLGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGV 633
               G MRK GF P  +D TN+I FLVK  K                            G+
Sbjct: 241  VVWGNMRKLGFCPGVMDCTNMIRFLVKEGKGMDALEILNQLKEDGIKPDIVCYTIVLSGI 300

Query: 634  TSAEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNV 813
                ++ K +++FDE+LVLG+VPD+YTYNVY++GL KQNN +  LK++  M++LGCKPNV
Sbjct: 301  VKEGDYVKLDELFDEILVLGLVPDVYTYNVYINGLCKQNNFDEALKIVVSMEKLGCKPNV 360

Query: 814  VTYNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLE 993
            VTYNTLLGALC  G++ +A+ V++ MR K GV+ NLHTYRI++DG +  GE+ EAC LLE
Sbjct: 361  VTYNTLLGALCMSGDLGKAKRVMKEMRLK-GVELNLHTYRIMLDGLVGKGEIGEACVLLE 419

Query: 994  EMLHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEALL--SMFEL 1167
            EML   F PR+ TFD ++  +C++GL S+AL ++ +++ ++  PG+  WEALL  S  ++
Sbjct: 420  EMLEKCFYPRSSTFDSIVHQMCQKGLISDALVLMNKIVAKSFDPGAKVWEALLLNSESKV 479

Query: 1168 SFKET 1182
            ++ ET
Sbjct: 480  TYSET 484


>ref|XP_004305097.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
            mitochondrial-like [Fragaria vesca subsp. vesca]
          Length = 491

 Score =  397 bits (1020), Expect = e-108
 Identities = 203/411 (49%), Positives = 273/411 (66%), Gaps = 27/411 (6%)
 Frame = +1

Query: 1    LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180
            LI+SF+ + CDPTP  Y+F+L TL + SQ   +P VL R+E +EKF  PE  F +LIR Y
Sbjct: 58   LIHSFNTFNCDPTPEAYNFVLKTLFKTSQLSHIPSVLDRLESIEKFHPPESIFANLIRFY 117

Query: 181  GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360
            G AN + DA+D+F RIP+FRC PSA SLN+LL VLC   EGL+MV QVL+ S  M +RL+
Sbjct: 118  GSANRVEDAIDVFCRIPKFRCDPSAVSLNSLLYVLCGSSEGLKMVPQVLMNSRAMGIRLE 177

Query: 361  ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540
            ESSFRILISALC I  V +AI+I+  M   GYD D    SL+LS+LC+   +   EV+GF
Sbjct: 178  ESSFRILISALCRIGSVGYAIEIMKCMISNGYDLDVKICSLVLSSLCEQKGVGGLEVVGF 237

Query: 541  LGEMRKAGFSPDGVDYTNVIAFLVKADKD---------------------------GVTS 639
            + EM+K GF P  +DY+NVI  LVK  K                            GV +
Sbjct: 238  VEEMKKVGFCPGMLDYSNVIRCLVKQGKGLDALRVLCKMKVEGMKPDIVCYTMVLYGVIA 297

Query: 640  AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819
              +++ A+ VFDE+LVLG+VPD+YTYNVY++GL  QNN+EAG+KM+TCMDELGC+PN++T
Sbjct: 298  NGDYKNADKVFDELLVLGLVPDVYTYNVYINGLCNQNNVEAGIKMITCMDELGCRPNLIT 357

Query: 820  YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999
            YN LL ALC+  E+ +A E++  M +  GV  NL T+ I++DG    G+V EAC  +EEM
Sbjct: 358  YNLLLKALCKNEELSRARELVSEM-TLNGVGVNLQTHIIMLDGLFCKGDVDEACIFMEEM 416

Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEALL 1152
            L      R   +D++I  LC+RGL  +A+ +L +M+ +N+ PG+ AWEALL
Sbjct: 417  LDKFMCRRCSAYDDVIYGLCQRGLVCKAMDLLLKMVDKNVVPGARAWEALL 467


>ref|XP_002309173.2| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550335936|gb|EEE92696.2|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 490

 Score =  386 bits (992), Expect = e-104
 Identities = 188/395 (47%), Positives = 272/395 (68%), Gaps = 27/395 (6%)
 Frame = +1

Query: 1    LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180
            LI+SFSIY  +P P  + F+  TL + SQF  +P VL  +E +E F+ PE  F +LI  Y
Sbjct: 64   LIHSFSIYDVEPAPKAFDFIFKTLVKTSQFHHIPSVLDHLEKVESFEPPESTFAYLIEVY 123

Query: 181  GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360
            GR N  H+A+++F+RIP+FRC PS +SLN L+SVLC+  +GL++V ++LLKS  MN+R++
Sbjct: 124  GRTNKTHEAIELFYRIPKFRCVPSVYSLNTLISVLCRNSKGLKLVPEILLKSQVMNIRVE 183

Query: 361  ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540
            ES+F++LI+ALC I+KV FAI++LN M + G+  ++  YSL+LS LC+  D +  EV+GF
Sbjct: 184  ESTFQVLITALCRIRKVGFAIEMLNCMVNDGFIVNAEIYSLLLSCLCEQKDATKFEVIGF 243

Query: 541  LGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTS 639
            L ++RK GF P  VDY+NVI FLVK  +                            GV  
Sbjct: 244  LEQLRKLGFFPGMVDYSNVIRFLVKGKRGLDALHVLNHMKSDRIKPDIFCYTMVLHGVIE 303

Query: 640  AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819
             +++ KA+++FDE+LV G+VPD YTYNVY++GL KQNN++AG+KM+  M+ELGCKPN++T
Sbjct: 304  DKDYLKADELFDELLVFGLVPDAYTYNVYINGLCKQNNVQAGIKMVASMEELGCKPNLIT 363

Query: 820  YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999
            YN L+  LC++GE+ +A E++R M  K G+  N+ TYRI+IDG  SNG+++EAC L EE 
Sbjct: 364  YNMLVKQLCKVGELSKAGELVREMGLK-GIGLNMQTYRIMIDGLASNGKIVEACGLFEEA 422

Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEM 1104
            L      ++  FDE+IC LC R L+ +ALK+L++M
Sbjct: 423  LDKGLCTQSLMFDEIICGLCHRDLSCKALKLLEKM 457


>gb|AHB18409.1| pentatricopeptide repeat-containing protein [Gossypium hirsutum]
          Length = 480

 Score =  382 bits (981), Expect = e-103
 Identities = 187/411 (45%), Positives = 275/411 (66%), Gaps = 27/411 (6%)
 Frame = +1

Query: 1    LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180
            L+ S S+Y    +P  Y FL+ TL  N QF  +P +LH ++ L+ F  PE  F HL++ Y
Sbjct: 57   LLQSLSLYNLHQSPQAYHFLIKTLLHNRQFHHIPSLLHHLQ-LQHFQTPEYIFTHLVKFY 115

Query: 181  GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360
            G+AN + DAVDIF+RIP+FRC PSA+SLNALL++LC+ + GL+++ QVLL S  MN+RL+
Sbjct: 116  GKANRIQDAVDIFYRIPQFRCFPSAYSLNALLALLCRSQRGLKLLPQVLLNSLHMNIRLE 175

Query: 361  ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540
            ES+FR+L+  LC + KVA+AI+IL  M D G   +   +S +LS++C   DL   +V+GF
Sbjct: 176  ESTFRLLVCTLCRMNKVAYAIEILQRMLDDGLGVNDKVFSFVLSSVCAEGDLDGEDVIGF 235

Query: 541  LGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTS 639
               +RK GFSP   DY  V+ FLVK  +                           +GVT+
Sbjct: 236  WRGLRKLGFSPAMGDYDGVVRFLVKKGRGLDAWDVLNQMKSDGIMPGIISYTMVLNGVTA 295

Query: 640  AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819
              ++  A+++FDE+L+LG+VP++YTY  Y+  L KQN +E G+KM+ CM+ELGCKPNV+ 
Sbjct: 296  EGDYILADELFDELLMLGLVPNVYTYKAYIDALCKQNKVEEGIKMVACMEELGCKPNVLI 355

Query: 820  YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999
            YNTLL  + + GE+ +A E+++ M+ K G++ N  +Y I+IDG +SNGE++EAC L+EE+
Sbjct: 356  YNTLLRTISKAGEISRARELVKEMKYK-GIEMNWVSYTIIIDGLVSNGEILEACALVEEV 414

Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEALL 1152
            LH     ++ TFDE+IC LC+RGL  +A ++L +M+ R+I+PG+  WEALL
Sbjct: 415  LHKCIFIKSLTFDEVICGLCQRGLVCKARELLGKMVERSISPGARVWEALL 465


>gb|EXB42398.1| hypothetical protein L484_021993 [Morus notabilis]
          Length = 494

 Score =  378 bits (971), Expect = e-102
 Identities = 199/414 (48%), Positives = 276/414 (66%), Gaps = 30/414 (7%)
 Frame = +1

Query: 1    LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180
            L+NSF+ Y C+PTP  Y F+L TL + SQFD +  VL RIE +EKF+ PE FF  +I  Y
Sbjct: 56   LLNSFNSYDCNPTPEAYHFVLKTLIKTSQFDHIHSVLDRIEFVEKFETPEYFFAQIIGFY 115

Query: 181  GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360
            G  + + DA+DIF+RIP+FRC PS++SLN+LL VLC++ EGL+ V +VL+KS +MN+RL+
Sbjct: 116  GFLDRIEDAIDIFWRIPKFRCVPSSYSLNSLLYVLCRRNEGLRFVPEVLIKSRDMNIRLE 175

Query: 361  ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALC---KISDLSSAEV 531
            E+SFRILI+ALC I KV +AI+IL+ M   GYD D+   SLILS LC   K  DL+  +V
Sbjct: 176  EASFRILITALCKIGKVGYAIEILDCMISDGYDIDARICSLILSFLCGKNKELDLAGFDV 235

Query: 532  LGFLGEMRKAGFSPDGVDYTNVIAFLV---------------KAD------------KDG 630
            L  L +M K GF P   DY+ VI  LV               KAD              G
Sbjct: 236  LELLQKMEKMGFCPRMGDYSKVIRILVREKRGLEALDILGQMKADGMKPDVVCYTMVLHG 295

Query: 631  VTSAEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPN 810
            + +  E+ KA+++FDEMLVLG+VPD+YTYN Y++GL KQN+++  L  +  M+ELGCKPN
Sbjct: 296  IVAEGEYSKADEMFDEMLVLGLVPDVYTYNAYINGLCKQNDVDGALDTILRMEELGCKPN 355

Query: 811  VVTYNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLL 990
            ++TYN +L ALC+ GE  +A+E++  M  K G +  L TY I++D  +  GE++EAC L+
Sbjct: 356  LITYNLILRALCKNGEFGRAKELVAEMSLK-GFEDYLQTYIIMLDVLLGKGEIVEACGLM 414

Query: 991  EEMLHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEALL 1152
            EEML      R   +DE+I  LC+RGL  +A ++L +M+ +N+APG+ AW+ALL
Sbjct: 415  EEMLDKLLCRRCSMYDEIIFGLCRRGLDCKASEMLGKMVGKNVAPGARAWDALL 468


>ref|XP_004137893.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
            mitochondrial-like [Cucumis sativus]
            gi|449483740|ref|XP_004156675.1| PREDICTED:
            pentatricopeptide repeat-containing protein At2g38420,
            mitochondrial-like [Cucumis sativus]
          Length = 491

 Score =  373 bits (957), Expect = e-100
 Identities = 194/423 (45%), Positives = 270/423 (63%), Gaps = 27/423 (6%)
 Frame = +1

Query: 1    LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180
            L+ SF+ Y+C PTP+ Y F+L TL + SQF  +PPVLHR++ LE F  PE  FV LI+ Y
Sbjct: 59   LVTSFTAYSCHPTPNAYYFVLKTLARTSQFHHIPPVLHRLQFLENFQTPEYIFVDLIKLY 118

Query: 181  GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360
            GR N + DAV +F RIP FRC PS  SLN+LLS L +  +GL ++  ++L SH M +RL+
Sbjct: 119  GRMNRIQDAVTLFRRIPMFRCVPSTLSLNSLLSQLSRNAQGLPIIPDIILNSHSMGIRLE 178

Query: 361  ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540
             S+F+ILI+ALC + KV  A+++ N M   GY  +    SLIL++LC+    S   VLGF
Sbjct: 179  HSTFQILITALCKVNKVGHAMELFNYMITEGYGLNPQICSLILASLCQQKKSSGDVVLGF 238

Query: 541  LGEMRKAGFSPDGVDYTNVIAFLV---------------KAD------------KDGVTS 639
            L EMR+ GF P  VDY+NVI F V               KAD             +GV +
Sbjct: 239  LEEMRQKGFCPAVVDYSNVIKFFVTRGMGSDAVDLLNKMKADGFKPDIVCYTMVLNGVIA 298

Query: 640  AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819
              +++ A+++FDE+L+ G+VPDIYTYNVY+ GL KQ +  AGL+M+  M+ LGC+PNV+T
Sbjct: 299  DGDYKMADELFDELLLFGLVPDIYTYNVYIHGLCKQGDSVAGLQMIPHMEALGCQPNVIT 358

Query: 820  YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999
            YN +L +LC+ GE+ +A ++  +M+ K G+  NL T+RI+IDG   NGEV+EAC LLEEM
Sbjct: 359  YNVILKSLCKTGELDEARKLRSKMQLK-GLAENLRTFRIMIDGLFHNGEVIEACVLLEEM 417

Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEALLSMFELSFKE 1179
            L   F P+  TF E++  LCKR +  +A+++L  M+ +N +PG  AWE LL   E     
Sbjct: 418  LGSRFPPQISTFSEILSWLCKRHMVGKAVELLALMVGKNFSPGPKAWEILLLSSESELTS 477

Query: 1180 TNS 1188
              S
Sbjct: 478  VKS 480


>ref|XP_004246310.1| PREDICTED: pentatricopeptide repeat-containing protein At2g38420,
            mitochondrial-like [Solanum lycopersicum]
          Length = 496

 Score =  343 bits (881), Expect = 1e-91
 Identities = 184/414 (44%), Positives = 260/414 (62%), Gaps = 28/414 (6%)
 Frame = +1

Query: 1    LINSFSIYACDPTPSPYSFLLNTLTQN-SQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRT 177
            L++SFS Y CDPTP+ Y F+L TLTQN S +D++P +L  I   E F+ PE  F +LI+ 
Sbjct: 78   LLDSFSAYECDPTPNAYYFILKTLTQNPSTWDEIPLILDYIRKFENFETPEYIFTYLIKF 137

Query: 178  YGRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRL 357
            YG +NM H A ++FF +P +RC PS  SLN L+ VLCK    L++V QVL+KS  +N+ +
Sbjct: 138  YGDSNMTHLAYEMFFTMPAYRCNPSVKSLNCLIWVLCKNNYDLRIVLQVLVKSQLLNIWV 197

Query: 358  DESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLG 537
            +ES+F+ILI ALC I K   A+D+L LM D G++ D+   SLILS +  + D    E+ G
Sbjct: 198  EESTFKILIRALCRIGKTNNAVDLLKLMVDSGFNLDANICSLILSTMPDVKDCVGVEIWG 257

Query: 538  FLGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVT 636
             L EMRK G+SP  VD  NVI F V   K                           +G+ 
Sbjct: 258  VLEEMRKLGYSPKRVDLCNVIRFYVNNGKGIDALEVLNKMKMCGMVPDVVCYNLVLNGLI 317

Query: 637  SAEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVV 816
               E+  A+++FDE+LVLG+ PDI TYNVY++GL KQ+ +   L++L CM++LGCKP + 
Sbjct: 318  FEGEYSNADELFDELLVLGLNPDIVTYNVYINGLCKQDKMVEALRVLGCMEDLGCKPEMN 377

Query: 817  TYNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEE 996
            TY+T+L  LCR G +   +EVL +M+SK G+Q + H Y ++I+  I NGEV EA  LL E
Sbjct: 378  TYHTILDGLCRCGMLSSVKEVLGQMKSK-GLQLSSHIYGVIINCMIRNGEVDEAYNLLHE 436

Query: 997  MLHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEALLSM 1158
            M+ M FVP++ TFD +I  LC +G   E +++L  M T+N+ PG  +WEA + +
Sbjct: 437  MVDMGFVPQSITFDGLIGLLCNKGSFYEVMELLSIMSTKNLVPGIRSWEAFVQV 490


>ref|XP_006411054.1| hypothetical protein EUTSA_v10017948mg [Eutrema salsugineum]
            gi|557112223|gb|ESQ52507.1| hypothetical protein
            EUTSA_v10017948mg [Eutrema salsugineum]
          Length = 456

 Score =  332 bits (850), Expect = 4e-88
 Identities = 173/396 (43%), Positives = 259/396 (65%), Gaps = 27/396 (6%)
 Frame = +1

Query: 1    LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180
            LI+SF ++ C+PTP  Y F++ TL + SQ + +  VL+ IE  EKFD PE  F  +I  Y
Sbjct: 62   LISSFRLHNCEPTPQAYKFVIKTLAKTSQLENIASVLNHIEISEKFDTPESIFRDVIFAY 121

Query: 181  GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360
            G +  + +A+D+FF+IP FRC PSA++LNALLSVL ++R+GL+MV +VLLK+ ++ VRL+
Sbjct: 122  GFSGRIEEAIDVFFKIPNFRCVPSAYTLNALLSVLVRKRQGLKMVPEVLLKASKLGVRLE 181

Query: 361  ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540
            ES+  ILI ALC I +V  A D++  MSD  Y  D   YSL+LS++CK  D S  +V+G+
Sbjct: 182  ESTLGILIDALCRIGEVDCATDLVKDMSDDCYIVDPRLYSLLLSSVCKHKDSSCFDVIGY 241

Query: 541  LGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTS 639
            L  +RK  FSPD  DYT V+ FLV+  +                            GV +
Sbjct: 242  LEGLRKTRFSPDLRDYTAVMRFLVEGGRGKEVVSVLNQMKCDRIEPDIVCYTIILQGVIA 301

Query: 640  AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819
             E+++KA+ +FDE+L+LG+VPD+YTYNVY++GL KQ++IE G+KM++CM++LG +PNVVT
Sbjct: 302  DEDYKKADKLFDELLLLGLVPDVYTYNVYINGLCKQSDIECGIKMMSCMEKLGSEPNVVT 361

Query: 820  YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999
            YN L+ AL + G++ +A+ +   M +  GV  N H+Y I+++  I   EV+ A  LLEE 
Sbjct: 362  YNILIKALVKAGDMSRAKIIWEEMET-NGVDRNSHSYDIMVNASIEADEVVCAHGLLEEA 420

Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMI 1107
               S V ++   +E+IC LC +GL  +A+++L  ++
Sbjct: 421  FSRSLVVKSSRTEEVICRLCDKGLMDKAVELLVHLV 456


>ref|XP_002533822.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223526239|gb|EEF28557.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 373

 Score =  325 bits (832), Expect = 5e-86
 Identities = 162/355 (45%), Positives = 237/355 (66%), Gaps = 27/355 (7%)
 Frame = +1

Query: 196  LHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLDESSFR 375
            + +A+ +F+R P FRC PS + LN LLSVLC+  EGL  V +VLLKS +MN+R++ESSFR
Sbjct: 1    MQNAIHLFYRTPNFRCVPSVYLLNTLLSVLCRTNEGLNFVPEVLLKSQDMNIRMEESSFR 60

Query: 376  ILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGFLGEMR 555
            +LI+ALC+I KV +A+++ N M + G+  DS   SL+LS+LC  +D+SS+EV+ FLGE+R
Sbjct: 61   LLINALCSINKVGYAVEMFNCMINDGFSVDSKICSLLLSSLCYQADISSSEVMRFLGELR 120

Query: 556  KAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTSAEEFQ 654
            K GF P   DY+ VI FLV+                              +GV +   + 
Sbjct: 121  KFGFCPGIKDYSKVINFLVRRGMGMEALNVLNQMKLDGIKPDIVCYTTVLNGVIANGVYS 180

Query: 655  KAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVTYNTLL 834
            KA+++FDE+LV G+VPD+YTYNVY+ GL KQNN+EAG++M+T M+ELGCKPN++TYN LL
Sbjct: 181  KADELFDELLVFGLVPDVYTYNVYIYGLCKQNNVEAGIEMVTSMEELGCKPNLITYNILL 240

Query: 835  GALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEMLHMSF 1014
              LC+ GE  +A +++R M SK G+   + TY+++I G  S G++++AC LLEE L    
Sbjct: 241  EDLCKNGEDSRARDLVRDMGSK-GIGLGMQTYKVMIHGLTSGGKIVKACSLLEEALDKGL 299

Query: 1015 VPRAQTFDEMICALCKRGLASEALKVLKEMITRNIAPGSSAWEALLSMFELSFKE 1179
             PR   FDE+I  LC+ G   +AL++L++++ +N++PG   WE LL    ++F E
Sbjct: 300  CPRGLRFDEVIYGLCQTGSICKALELLEKVVNKNVSPGVRVWETLLLKSNINFVE 354


>ref|NP_181376.3| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|218546769|sp|Q8L6Y7.2|PP193_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g38420, mitochondrial; Flags: Precursor
            gi|3395430|gb|AAC28762.1| hypothetical protein
            [Arabidopsis thaliana] gi|330254441|gb|AEC09535.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 453

 Score =  311 bits (797), Expect = 5e-82
 Identities = 160/396 (40%), Positives = 253/396 (63%), Gaps = 27/396 (6%)
 Frame = +1

Query: 1    LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180
            L++SF ++ C+PTP  Y F++ TL ++SQ + +  VL+ +E  EKFD PE  F  +I  Y
Sbjct: 59   LLSSFQLHNCEPTPQAYRFVIKTLAKSSQLENISSVLYHLEVSEKFDTPESIFRDVIAAY 118

Query: 181  GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360
            G +  + +A+++FF+IP FRC PSA++LNALL VL ++R+ L++V ++L+K+  M VRL+
Sbjct: 119  GFSGRIEEAIEVFFKIPNFRCVPSAYTLNALLLVLVRKRQSLELVPEILVKACRMGVRLE 178

Query: 361  ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540
            ES+F ILI ALC I +V  A +++  MS      D   YS +LS++CK  D S  +V+G+
Sbjct: 179  ESTFGILIDALCRIGEVDCATELVRYMSQDSVIVDPRLYSRLLSSVCKHKDSSCFDVIGY 238

Query: 541  LGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTS 639
            L ++RK  FSP   DYT V+ FLV+  +                            GV +
Sbjct: 239  LEDLRKTRFSPGLRDYTVVMRFLVEGGRGKEVVSVLNQMKCDRVEPDLVCYTIVLQGVIA 298

Query: 640  AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819
             E++ KA+ +FDE+L+LG+ PD+YTYNVY++GL KQN+IE  LKM++ M++LG +PNVVT
Sbjct: 299  DEDYPKADKLFDELLLLGLAPDVYTYNVYINGLCKQNDIEGALKMMSSMNKLGSEPNVVT 358

Query: 820  YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999
            YN L+ AL + G++ +A+ + + M +  GV  N HT+ I+I  +I   EV+ A  LLEE 
Sbjct: 359  YNILIKALVKAGDLSRAKTLWKEMET-NGVNRNSHTFDIMISAYIEVDEVVCAHGLLEEA 417

Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMI 1107
             +M+   ++   +E+I  LC++GL  +A+++L  ++
Sbjct: 418  FNMNVFVKSSRIEEVISRLCEKGLMDQAVELLAHLV 453


>gb|AAM98219.1| unknown protein [Arabidopsis thaliana] gi|31376375|gb|AAP49514.1|
            At2g38420 [Arabidopsis thaliana]
          Length = 444

 Score =  311 bits (797), Expect = 5e-82
 Identities = 160/396 (40%), Positives = 253/396 (63%), Gaps = 27/396 (6%)
 Frame = +1

Query: 1    LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180
            L++SF ++ C+PTP  Y F++ TL ++SQ + +  VL+ +E  EKFD PE  F  +I  Y
Sbjct: 50   LLSSFQLHNCEPTPQAYRFVIKTLAKSSQLENISSVLYHLEVSEKFDTPESIFRDVIAAY 109

Query: 181  GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360
            G +  + +A+++FF+IP FRC PSA++LNALL VL ++R+ L++V ++L+K+  M VRL+
Sbjct: 110  GFSGRIEEAIEVFFKIPNFRCVPSAYTLNALLLVLVRKRQSLELVPEILVKACRMGVRLE 169

Query: 361  ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540
            ES+F ILI ALC I +V  A +++  MS      D   YS +LS++CK  D S  +V+G+
Sbjct: 170  ESTFGILIDALCRIGEVDCATELVRYMSQDSVIVDPRLYSRLLSSVCKHKDSSCFDVIGY 229

Query: 541  LGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTS 639
            L ++RK  FSP   DYT V+ FLV+  +                            GV +
Sbjct: 230  LEDLRKTRFSPGLRDYTVVMRFLVEGGRGKEVVSVLNQMKCDRVEPDLVCYTIVLQGVIA 289

Query: 640  AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819
             E++ KA+ +FDE+L+LG+ PD+YTYNVY++GL KQN+IE  LKM++ M++LG +PNVVT
Sbjct: 290  DEDYPKADKLFDELLLLGLAPDVYTYNVYINGLCKQNDIEGALKMMSSMNKLGSEPNVVT 349

Query: 820  YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999
            YN L+ AL + G++ +A+ + + M +  GV  N HT+ I+I  +I   EV+ A  LLEE 
Sbjct: 350  YNILIKALVKAGDLSRAKTLWKEMET-NGVNRNSHTFDIMISAYIEVDEVVCAHGLLEEA 408

Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMI 1107
             +M+   ++   +E+I  LC++GL  +A+++L  ++
Sbjct: 409  FNMNVFVKSSRIEEVISRLCEKGLMDQAVELLAHLV 444


>ref|XP_006294146.1| hypothetical protein CARUB_v10023139mg [Capsella rubella]
            gi|482562854|gb|EOA27044.1| hypothetical protein
            CARUB_v10023139mg [Capsella rubella]
          Length = 470

 Score =  311 bits (796), Expect = 7e-82
 Identities = 164/396 (41%), Positives = 246/396 (62%), Gaps = 27/396 (6%)
 Frame = +1

Query: 1    LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180
            L++SF ++ C+PTP  Y F++ TL + SQ + +  VL  +E  EKFD PE  F  +I  Y
Sbjct: 76   LVSSFRLHNCEPTPQAYRFVIKTLAKTSQLENIASVLSHLEVSEKFDTPESIFRDVIAAY 135

Query: 181  GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360
            G A  + +A+D+FF+IP FRC PSA++LNALL VL ++RE L++V ++L+K+  M VRL+
Sbjct: 136  GFAGRIGEAIDVFFKIPNFRCVPSAYTLNALLLVLVRKRESLELVPEILVKASRMGVRLE 195

Query: 361  ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540
            ES+F ILI ALC I +V  A +++  MS      D   YS +LS++CK  D S  +V+G+
Sbjct: 196  ESTFGILIDALCKIGEVDCATELVRYMSIDCVIVDPRLYSQLLSSVCKHKDSSCFDVVGY 255

Query: 541  LGEMRKAGFSPDGVDYTNVIAFLVKADK---------------------------DGVTS 639
            L ++RK  FSP   DYT V++FLV+  +                            GV +
Sbjct: 256  LEDLRKTRFSPGLRDYTVVMSFLVEGGRGKEVVSVLNQMKCDRIEPDIVCYTIVLQGVIA 315

Query: 640  AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819
              E+ KA+  FDE+L+LG+ PD+YTYNVY++GL KQN+IE  LKM++ M++LG +PNV+T
Sbjct: 316  DAEYSKADKFFDELLLLGLAPDVYTYNVYMNGLCKQNDIEGALKMMSSMNKLGSEPNVIT 375

Query: 820  YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999
            YN L+ AL   G++ QA+ +   M    GV  N HTY I+I  FI  G+V+ A   LEE 
Sbjct: 376  YNILIKALVNAGDLSQAKTLWEEM-GINGVNRNSHTYDIMISAFIEVGDVVSAQGFLEEA 434

Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMI 1107
             +M+   ++   +E+I  LC +GL  +A+++L  ++
Sbjct: 435  FNMNVFAKSSRTEEVISRLCDKGLMDKAVELLAHLV 470


>ref|XP_002879744.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297325583|gb|EFH56003.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 444

 Score =  309 bits (792), Expect = 2e-81
 Identities = 159/396 (40%), Positives = 251/396 (63%), Gaps = 27/396 (6%)
 Frame = +1

Query: 1    LINSFSIYACDPTPSPYSFLLNTLTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTY 180
            L++SF ++ C+PTP  Y F++ TL + SQ + +  VL  +E  EKFD PE  F  +I  Y
Sbjct: 50   LVSSFQLHNCEPTPQAYRFVIETLAKTSQLENIASVLDHLEVSEKFDTPESIFRDVIAAY 109

Query: 181  GRANMLHDAVDIFFRIPRFRCTPSAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLD 360
            G +  + +A+D+FF+IP FRC PSA++LNALL VL ++R+ L++V ++L+K+  M VRL+
Sbjct: 110  GFSGRIEEAIDVFFKIPNFRCVPSAYTLNALLLVLVRKRQSLELVPEILVKASRMGVRLE 169

Query: 361  ESSFRILISALCNIKKVAFAIDILNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGF 540
            ES+F ILI+ALC I +V  A +++  MS+     D   YSL+LS++CK  D S  +V+G+
Sbjct: 170  ESTFGILINALCRIGEVDCATELVRYMSEDSVIVDPRLYSLLLSSVCKHKDSSCFDVIGY 229

Query: 541  LGEMRKAGFSPDGVDYTNVIAFLVKADKD---------------------------GVTS 639
            L ++RK  F P   DYT V+ FLV+  +                            GV +
Sbjct: 230  LEDLRKTRFLPGLRDYTVVMRFLVEGGRGKEVVSVLNQMKCDRIDPDVVCYTIVLLGVIA 289

Query: 640  AEEFQKAEDVFDEMLVLGVVPDIYTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVT 819
             E++ KA+ +FDE+L+LG+ PD+YTYNVY++GL KQN+IE  +KM++ M++LG +PNVVT
Sbjct: 290  DEDYPKADKLFDELLLLGLDPDVYTYNVYINGLCKQNDIEGAIKMMSSMNKLGSEPNVVT 349

Query: 820  YNTLLGALCRLGEVRQAEEVLRRMRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEM 999
            YN ++  L + G++ +A+ + + M    GV  N HTY I+I  +I   EV+ A  LLEE 
Sbjct: 350  YNIVIKGLVKAGDLSRAKTLWKEM-EMNGVNRNSHTYDIMISAYIEVDEVVCAQGLLEEA 408

Query: 1000 LHMSFVPRAQTFDEMICALCKRGLASEALKVLKEMI 1107
             +M+   ++   +E+I  LC++GL  +A+++L  ++
Sbjct: 409  FNMNLFVKSSKIEEVISRLCEKGLMDKAVELLAHLV 444


>ref|XP_006854116.1| hypothetical protein AMTR_s00048p00149840 [Amborella trichopoda]
            gi|548857785|gb|ERN15583.1| hypothetical protein
            AMTR_s00048p00149840 [Amborella trichopoda]
          Length = 464

 Score =  274 bits (701), Expect = 7e-71
 Identities = 153/374 (40%), Positives = 223/374 (59%), Gaps = 27/374 (7%)
 Frame = +1

Query: 70   LTQNSQFDQLPPVLHRIEHLEKFDIPEPFFVHLIRTYGRANMLHDAVDIFFRIPRFRCTP 249
            L QN QF  L  +L  ++   KF  PE   + LI++   + M+ +A+D+FF +P  RC P
Sbjct: 91   LAQNPQFSGLKTLLRCLQSNRKFSTPETRIIGLIQSCASSKMVKEALDLFFAMPHLRCQP 150

Query: 250  SAFSLNALLSVLCKQREGLQMVRQVLLKSHEMNVRLDESSFRILISALCNIKKVAFAIDI 429
            S  SLNALLSVLC   +   +V ++L+K+ EMN+RLD SSFRILI +LC I K+ FAI++
Sbjct: 151  STTSLNALLSVLC-DTDSFHLVPELLIKTLEMNIRLDASSFRILIGSLCRIGKLGFAIEL 209

Query: 430  LNLMSDYGYDPDSTFYSLILSALCKISDLSSAEVLGFLGEMRKAGFSPDGVDYTNVIAFL 609
            L LM D G  PDS FY+ IL  LC+  + S  E+ GFL EM+ AGF PD + Y  VI  L
Sbjct: 210  LRLMPDQGCWPDSGFYAEILCKLCEFGEFS--EIYGFLDEMKDAGFFPDKIAYAIVIDSL 267

Query: 610  VKADK---------------------------DGVTSAEEFQKAEDVFDEMLVLGVVPDI 708
             K  +                           DG     EF++A +VFDEML +G+VPD+
Sbjct: 268  AKGGRLNEARAILNRMKLEGAKPDTITYTSMMDGFYKIGEFKQAGEVFDEMLAMGLVPDV 327

Query: 709  YTYNVYLSGLFKQNNIEAGLKMLTCMDELGCKPNVVTYNTLLGALCRLGEVRQAEEVLRR 888
            +TY+VY++GL ++  +E   ++L  M E+GC+PNV+TYNTL+   C  G +R+A+E++  
Sbjct: 328  FTYSVYINGLCRERKLEEAKEVLCVMREMGCRPNVITYNTLIRTFCSDGNLRRADELVAE 387

Query: 889  MRSKGGVQGNLHTYRILIDGFISNGEVMEACRLLEEMLHMSFVPRAQTFDEMICALCKRG 1068
            M S  GV GN  TYR LI+ ++  G V+EA  LL +M+   F P   T++ ++ +   + 
Sbjct: 388  MGS-NGVCGNSVTYRTLINAYLREGMVVEANELLVQMVGKGFFPHFSTWEALLSSTVFKW 446

Query: 1069 LASEALKVLKEMIT 1110
               +AL  L+E+I+
Sbjct: 447  DILQALNALEELIS 460


Top