BLASTX nr result

ID: Rehmannia23_contig00016271 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia23_contig00016271
         (1089 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002283907.2| PREDICTED: pentatricopeptide repeat-containi...   541   e-151
ref|XP_006433856.1| hypothetical protein CICLE_v10000867mg [Citr...   532   e-149
gb|EXC16677.1| hypothetical protein L484_007723 [Morus notabilis]     529   e-148
ref|XP_006472504.1| PREDICTED: pentatricopeptide repeat-containi...   529   e-148
gb|EOY33044.1| Pentatricopeptide repeat superfamily protein isof...   520   e-145
ref|XP_002532248.1| pentatricopeptide repeat-containing protein,...   506   e-141
ref|XP_004237977.1| PREDICTED: pentatricopeptide repeat-containi...   505   e-140
ref|XP_006338085.1| PREDICTED: pentatricopeptide repeat-containi...   499   e-138
ref|XP_004136259.1| PREDICTED: pentatricopeptide repeat-containi...   496   e-138
ref|XP_004301354.1| PREDICTED: LOW QUALITY PROTEIN: pentatricope...   494   e-137
ref|XP_003527867.1| PREDICTED: pentatricopeptide repeat-containi...   481   e-133
ref|XP_004501623.1| PREDICTED: pentatricopeptide repeat-containi...   480   e-133
ref|XP_002301082.1| hypothetical protein POPTR_0002s10380g [Popu...   474   e-131
ref|XP_003602939.1| Pentatricopeptide repeat-containing protein ...   473   e-131
emb|CAN80524.1| hypothetical protein VITISV_030537 [Vitis vinifera]   473   e-131
gb|ESW09636.1| hypothetical protein PHAVU_009G143500g, partial [...   458   e-126
gb|EPS69387.1| hypothetical protein M569_05378, partial [Genlise...   421   e-115
ref|XP_006403586.1| hypothetical protein EUTSA_v10010303mg [Eutr...   420   e-115
ref|XP_002873896.1| pentatricopeptide repeat-containing protein ...   420   e-115
ref|XP_006287559.1| hypothetical protein CARUB_v10000770mg [Caps...   417   e-114

>ref|XP_002283907.2| PREDICTED: pentatricopeptide repeat-containing protein At5g18475-like
            [Vitis vinifera]
          Length = 513

 Score =  541 bits (1393), Expect = e-151
 Identities = 261/363 (71%), Positives = 309/363 (85%)
 Frame = +1

Query: 1    MFYFIRPIARANPSLKAISTCLNILVDANQIDLARTFLLSTRKDFHLQPNTCIFNILVKH 180
            MF  IRPI R  PSLKAISTCLN+LV++NQ+DL R FLL+++K  +L+PNTCIFNILVKH
Sbjct: 150  MFDAIRPIVREKPSLKAISTCLNLLVESNQVDLTRKFLLNSKKSLNLEPNTCIFNILVKH 209

Query: 181  HCKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIV 360
            HCK GD++SAFEVV+EM+KS +S PNLITYSTLI+GLCG+GRL+EAI LFEEMVSK QI+
Sbjct: 210  HCKNGDIDSAFEVVEEMKKSHVSYPNLITYSTLINGLCGSGRLKEAIELFEEMVSKDQIL 269

Query: 361  PDALTYNLLINGFCRGGKTDRARKIMDFMKKNGCNPNAFNYSSLMNGLCRDGKFEEAKEI 540
            PDALTYN LINGFC G K DRA KIM+FMKKNGCNPN FNYS+LMNG C++G+ EEAKE+
Sbjct: 270  PDALTYNALINGFCHGEKVDRALKIMEFMKKNGCNPNVFNYSALMNGFCKEGRLEEAKEV 329

Query: 541  FIELKGVGLTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVTFNVILGGLCR 720
            F E+K +GL PDTV YTTLIN  CRAGR DEA+ELLK+M+EN CRAD VTFNVILGGLCR
Sbjct: 330  FDEMKSLGLKPDTVGYTTLINFFCRAGRVDEAMELLKDMRENKCRADTVTFNVILGGLCR 389

Query: 721  EYRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILMLHRGFLPHFRT 900
            E R+ EA  MLERLP +GVYLNKASYRIVLN LC+EGEL+KA +L+ LML RG LPHF T
Sbjct: 390  EGRFEEARGMLERLPYEGVYLNKASYRIVLNSLCREGELQKATQLVGLMLGRGVLPHFAT 449

Query: 901  SNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLLPAFELLDKLL 1080
            SNELLV LCEAG   DA   L GL+E+GFKPEP +W+++++L+  ERKLLPAFELLD L+
Sbjct: 450  SNELLVHLCEAGKVGDAVMALLGLLELGFKPEPNSWALLVELICRERKLLPAFELLDDLV 509

Query: 1081 MKD 1089
            +++
Sbjct: 510  IQE 512



 Score =  137 bits (344), Expect = 1e-29
 Identities = 85/310 (27%), Positives = 153/310 (49%), Gaps = 2/310 (0%)
 Frame = +1

Query: 157  IFNILVKHHCKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEE 336
            IF  L+KH  K    E   E+   +       P+L   ST ++ L  + +++        
Sbjct: 130  IFLNLMKHFSKLSLHERVVEMFDAIRPIVREKPSLKAISTCLNLLVESNQVDLTRKFLLN 189

Query: 337  MVSKHQIVPDALTYNLLINGFCRGGKTDRARKIMDFMKKNGCN-PNAFNYSSLMNGLCRD 513
                  + P+   +N+L+   C+ G  D A ++++ MKK+  + PN   YS+L+NGLC  
Sbjct: 190  SKKSLNLEPNTCIFNILVKHHCKNGDIDSAFEVVEEMKKSHVSYPNLITYSTLINGLCGS 249

Query: 514  GKFEEAKEIFIELKGVG-LTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVT 690
            G+ +EA E+F E+     + PD + Y  LIN  C   + D A+++++ MK+N C  +   
Sbjct: 250  GRLKEAIELFEEMVSKDQILPDALTYNALINGFCHGEKVDRALKIMEFMKKNGCNPNVFN 309

Query: 691  FNVILGGLCREYRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILML 870
            ++ ++ G C+E R  EA  + + + + G+  +   Y  ++NF C+ G +++A+ELL  M 
Sbjct: 310  YSALMNGFCKEGRLEEAKEVFDEMKSLGLKPDTVGYTTLINFFCRAGRVDEAMELLKDMR 369

Query: 871  HRGFLPHFRTSNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLL 1050
                     T N +L  LC  G   +A  +L  L   G      ++ IV++ +  E +L 
Sbjct: 370  ENKCRADTVTFNVILGGLCREGRFEEARGMLERLPYEGVYLNKASYRIVLNSLCREGELQ 429

Query: 1051 PAFELLDKLL 1080
             A +L+  +L
Sbjct: 430  KATQLVGLML 439


>ref|XP_006433856.1| hypothetical protein CICLE_v10000867mg [Citrus clementina]
            gi|567882597|ref|XP_006433857.1| hypothetical protein
            CICLE_v10000867mg [Citrus clementina]
            gi|557535978|gb|ESR47096.1| hypothetical protein
            CICLE_v10000867mg [Citrus clementina]
            gi|557535979|gb|ESR47097.1| hypothetical protein
            CICLE_v10000867mg [Citrus clementina]
          Length = 521

 Score =  532 bits (1371), Expect = e-149
 Identities = 257/363 (70%), Positives = 307/363 (84%)
 Frame = +1

Query: 1    MFYFIRPIARANPSLKAISTCLNILVDANQIDLARTFLLSTRKDFHLQPNTCIFNILVKH 180
            MF+ I PI R  PSLKAISTCLN+L+++NQ+DLA+ FL  + +   L+PNTCIFNIL+KH
Sbjct: 152  MFHKIHPITREKPSLKAISTCLNLLIESNQVDLAQNFLKYSNQHLRLKPNTCIFNILIKH 211

Query: 181  HCKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIV 360
            HCK+G LESAFEV+KEM+KS+MS PNLITYSTLIDGLC NGR  EAI LFEEMVSK QI+
Sbjct: 212  HCKRGTLESAFEVLKEMKKSQMSYPNLITYSTLIDGLCKNGRFREAIELFEEMVSKDQIL 271

Query: 361  PDALTYNLLINGFCRGGKTDRARKIMDFMKKNGCNPNAFNYSSLMNGLCRDGKFEEAKEI 540
            PDALTYN+LI+GFCRGGK DRA+KIM+FMK NGCNPN FNY++LMNG C++GK +EAKE+
Sbjct: 272  PDALTYNVLIDGFCRGGKVDRAKKIMEFMKNNGCNPNVFNYTTLMNGFCKEGKLQEAKEV 331

Query: 541  FIELKGVGLTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVTFNVILGGLCR 720
            F E+K   L PDT+ YTTLINC CRAGR DEA+ELLKEMKE  C+AD VTFN+ILGGLCR
Sbjct: 332  FDEMKNFLLKPDTIGYTTLINCFCRAGRVDEALELLKEMKERGCKADIVTFNIILGGLCR 391

Query: 721  EYRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILMLHRGFLPHFRT 900
            E +  EAL MLE+L  DG+YLNKASYRIVLNF C++GELEKA+ELL LML RGFLPH+ T
Sbjct: 392  EGKIEEALGMLEKLWYDGIYLNKASYRIVLNFSCQKGELEKAIELLRLMLCRGFLPHYAT 451

Query: 901  SNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLLPAFELLDKLL 1080
            SNELLV LC+AG A DAA  LFGLVEMGFKPE  +W+++++L+   RKLL AFELLD+L+
Sbjct: 452  SNELLVRLCKAGMAEDAAIALFGLVEMGFKPESDSWALLVELICRGRKLLFAFELLDELV 511

Query: 1081 MKD 1089
            +K+
Sbjct: 512  IKE 514


>gb|EXC16677.1| hypothetical protein L484_007723 [Morus notabilis]
          Length = 513

 Score =  529 bits (1363), Expect = e-148
 Identities = 259/360 (71%), Positives = 307/360 (85%)
 Frame = +1

Query: 1    MFYFIRPIARANPSLKAISTCLNILVDANQIDLARTFLLSTRKDFHLQPNTCIFNILVKH 180
            MF+ IR IAR  PSLKAISTCLN+LV+AN+IDLAR FL+ +RK+  L+PNTCIFNILVKH
Sbjct: 153  MFHAIRSIAREKPSLKAISTCLNLLVEANRIDLARQFLMHSRKNLSLKPNTCIFNILVKH 212

Query: 181  HCKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIV 360
            HC+ GDLESAFEVVKEM+K+++S PNLITYSTLIDGLC +GRL+ AI LFEEM+SK QI+
Sbjct: 213  HCRNGDLESAFEVVKEMKKAKISYPNLITYSTLIDGLCVSGRLKGAIELFEEMISKDQIL 272

Query: 361  PDALTYNLLINGFCRGGKTDRARKIMDFMKKNGCNPNAFNYSSLMNGLCRDGKFEEAKEI 540
            PDALT+N+LINGFCR GK DRARKIM+FMK NGC+PN FNYS+L+NG  + G+FEEA+EI
Sbjct: 273  PDALTFNVLINGFCRDGKVDRARKIMEFMKSNGCSPNVFNYSALINGFFKVGRFEEAEEI 332

Query: 541  FIELKGVGLTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVTFNVILGGLCR 720
            F E+K  G  PD V YTT+INC CR GRTDEA+ELLKEMK  +CRAD VTFNVI GGLCR
Sbjct: 333  FYEMKSFGPKPDKVGYTTIINCFCRTGRTDEAMELLKEMKGGECRADVVTFNVIFGGLCR 392

Query: 721  EYRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILMLHRGFLPHFRT 900
            E R  EAL MLERLP +G++LNKASYRIVLNFLC++GEL+KA  LL LML RGF+PHF T
Sbjct: 393  EGRLEEALRMLERLPYEGMHLNKASYRIVLNFLCQKGELKKATSLLDLMLGRGFVPHFAT 452

Query: 901  SNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLLPAFELLDKLL 1080
            SNELLV LC AG A+DAA  LFGL+EMGFKPEP +W+I++DL+  ERKLL +F+LLD+L+
Sbjct: 453  SNELLVRLCNAGMADDAAMALFGLLEMGFKPEPDSWAILVDLISRERKLLSSFQLLDELI 512


>ref|XP_006472504.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475-like
            isoform X1 [Citrus sinensis]
            gi|568836969|ref|XP_006472505.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At5g18475-like isoform X2 [Citrus sinensis]
          Length = 521

 Score =  529 bits (1362), Expect = e-148
 Identities = 255/363 (70%), Positives = 305/363 (84%)
 Frame = +1

Query: 1    MFYFIRPIARANPSLKAISTCLNILVDANQIDLARTFLLSTRKDFHLQPNTCIFNILVKH 180
            MF+ I PI R  PSLKAISTCLN+L+++NQ+DLA+ FL  + +   L+PNTCIFNIL+KH
Sbjct: 152  MFHKIHPITREKPSLKAISTCLNLLIESNQVDLAQNFLKYSNRHLRLKPNTCIFNILIKH 211

Query: 181  HCKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIV 360
            HCK+G LESAFEV+KEM+KS+MS PNLITYSTLIDGLC NGR  EAI LFEEMVSK QI+
Sbjct: 212  HCKRGTLESAFEVLKEMKKSQMSYPNLITYSTLIDGLCKNGRFREAIELFEEMVSKDQIL 271

Query: 361  PDALTYNLLINGFCRGGKTDRARKIMDFMKKNGCNPNAFNYSSLMNGLCRDGKFEEAKEI 540
            PDALTYN+LI+GFC GGK DRA+KIM+FMK NGCNPN FNY++LMNG C++GK +EAKE+
Sbjct: 272  PDALTYNVLIDGFCHGGKVDRAKKIMEFMKNNGCNPNVFNYTTLMNGFCKEGKLQEAKEV 331

Query: 541  FIELKGVGLTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVTFNVILGGLCR 720
            F E+K   L PDT+ YTTLINC CRAG  DEA+ELLKEMKE  C+AD VTFN+ILGGLCR
Sbjct: 332  FDEMKNFHLKPDTIGYTTLINCFCRAGGVDEALELLKEMKERGCKADIVTFNIILGGLCR 391

Query: 721  EYRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILMLHRGFLPHFRT 900
            E R  EAL MLE+L  DG+YLNKASYRIVLNFLC++GELEKA+ELL LML RGFLPH+ T
Sbjct: 392  EGRIEEALGMLEKLWYDGIYLNKASYRIVLNFLCQKGELEKAIELLRLMLCRGFLPHYAT 451

Query: 901  SNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLLPAFELLDKLL 1080
            SNELLV LC+AG A DAA  LFGLVEMGFKPE  +W+++++++   RKLL AF LLD+L+
Sbjct: 452  SNELLVRLCKAGMAEDAAIALFGLVEMGFKPESDSWALLVEMICRGRKLLFAFVLLDELV 511

Query: 1081 MKD 1089
            +K+
Sbjct: 512  IKE 514



 Score = 95.1 bits (235), Expect = 4e-17
 Identities = 57/219 (26%), Positives = 116/219 (52%), Gaps = 1/219 (0%)
 Frame = +1

Query: 34  NPSLKAISTCLNILVDANQIDLARTFLLSTRKDFHLQPNTCIFNILVKHHCKKGDLESAF 213
           NP++   +T +N      ++  A+  +    K+FHL+P+T  +  L+   C+ G ++ A 
Sbjct: 306 NPNVFNYTTLMNGFCKEGKLQEAKE-VFDEMKNFHLKPDTIGYTTLINCFCRAGGVDEAL 364

Query: 214 EVVKEMEKSEMSCP-NLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIVPDALTYNLLI 390
           E++KEM+  E  C  +++T++ ++ GLC  GR+EEA+ + E++     I  +  +Y +++
Sbjct: 365 ELLKEMK--ERGCKADIVTFNIILGGLCREGRIEEALGMLEKLWYDG-IYLNKASYRIVL 421

Query: 391 NGFCRGGKTDRARKIMDFMKKNGCNPNAFNYSSLMNGLCRDGKFEEAKEIFIELKGVGLT 570
           N  C+ G+ ++A +++  M   G  P+    + L+  LC+ G  E+A      L  +G  
Sbjct: 422 NFLCQKGELEKAIELLRLMLCRGFLPHYATSNELLVRLCKAGMAEDAAIALFGLVEMGFK 481

Query: 571 PDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEV 687
           P++  +  L+  +CR  +   A  LL E+   +   ++V
Sbjct: 482 PESDSWALLVEMICRGRKLLFAFVLLDELVIKESGTNQV 520


>gb|EOY33044.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma
            cacao] gi|508785789|gb|EOY33045.1| Pentatricopeptide
            repeat superfamily protein isoform 1 [Theobroma cacao]
          Length = 530

 Score =  520 bits (1340), Expect = e-145
 Identities = 246/363 (67%), Positives = 305/363 (84%)
 Frame = +1

Query: 1    MFYFIRPIARANPSLKAISTCLNILVDANQIDLARTFLLSTRKDFHLQPNTCIFNILVKH 180
            MFY I+PI R  PSLKAISTCLN+L+++NQ+DLAR FLL+++K   L+PNTCIFNILVKH
Sbjct: 152  MFYAIQPIVREKPSLKAISTCLNLLIESNQVDLARHFLLNSKKSLRLRPNTCIFNILVKH 211

Query: 181  HCKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIV 360
            HCK GDLESAFEVVKEM+KS +S PNLITYSTL+ GLC +GRL+EAI LFEEMV+K QI+
Sbjct: 212  HCKNGDLESAFEVVKEMKKSRVSYPNLITYSTLMGGLCESGRLKEAIELFEEMVAKDQIL 271

Query: 361  PDALTYNLLINGFCRGGKTDRARKIMDFMKKNGCNPNAFNYSSLMNGLCRDGKFEEAKEI 540
            PD LTYN+LINGFC  GK DRARKIM+FMK NGCNPN FNYS+L+NG C++G+++EAKE+
Sbjct: 272  PDVLTYNILINGFCCRGKVDRARKIMEFMKNNGCNPNLFNYSTLINGFCKEGRWQEAKEV 331

Query: 541  FIELKGVGLTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVTFNVILGGLCR 720
            F+E++ +GL PDT+ YTTLINCLCRA + +EA+ELLKEMKE +C+AD VT NV+LGGLCR
Sbjct: 332  FVEMESIGLKPDTIGYTTLINCLCRAAQIEEAMELLKEMKEKECQADVVTLNVLLGGLCR 391

Query: 721  EYRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILMLHRGFLPHFRT 900
            E R+ +AL MLE+LP +GVYLNKASYRIVLN LC++ E+EKA +L+ LML RGF+PH+ T
Sbjct: 392  EGRFQDALQMLEKLPYEGVYLNKASYRIVLNSLCQKDEMEKAAKLVGLMLDRGFVPHYAT 451

Query: 901  SNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLLPAFELLDKLL 1080
            SN+LL+ LC+AG  +DA T L GL E GFKPEP  W  + +L   ERKLL  FELLD+L+
Sbjct: 452  SNDLLIRLCKAGMVDDAVTALVGLAETGFKPEPHCWEFLTELNCKERKLLSVFELLDELV 511

Query: 1081 MKD 1089
            +K+
Sbjct: 512  IKE 514


>ref|XP_002532248.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223528066|gb|EEF30142.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 521

 Score =  506 bits (1302), Expect = e-141
 Identities = 249/363 (68%), Positives = 293/363 (80%)
 Frame = +1

Query: 1    MFYFIRPIARANPSLKAISTCLNILVDANQIDLARTFLLSTRKDFHLQPNTCIFNILVKH 180
            MFY I+PI R  PSLKAISTCLNILV++ QIDLA+  LL   +   ++PNTCIFNILVKH
Sbjct: 149  MFYAIQPIVREKPSLKAISTCLNILVESKQIDLAQKCLLYVNEHLKVRPNTCIFNILVKH 208

Query: 181  HCKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIV 360
            HCK GDLESA EV+ EM+KS  S PN+ITYSTLIDGLCGNGRL+EAI LFEEMVSK QI+
Sbjct: 209  HCKSGDLESALEVMHEMKKSRRSYPNVITYSTLIDGLCGNGRLKEAIELFEEMVSKDQIL 268

Query: 361  PDALTYNLLINGFCRGGKTDRARKIMDFMKKNGCNPNAFNYSSLMNGLCRDGKFEEAKEI 540
            PDALTY++LI GFC GGK DRARKIM+FM+ NGC+PN FNYS LMNG C++G+ EEAKE+
Sbjct: 269  PDALTYSVLIKGFCHGGKADRARKIMEFMRSNGCDPNVFNYSVLMNGFCKEGRLEEAKEV 328

Query: 541  FIELKGVGLTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVTFNVILGGLCR 720
            F E+K  GL PDTV YTTLINC C  GR DEA+ELLKEM E  C+AD VTFNV+L GLCR
Sbjct: 329  FDEMKSSGLKPDTVGYTTLINCFCGVGRIDEAMELLKEMTEMKCKADAVTFNVLLKGLCR 388

Query: 721  EYRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILMLHRGFLPHFRT 900
            E R+ EAL MLE L  +GVYLNK SYRIVLNFLC++GELEK+  LL LML RGF+PH+ T
Sbjct: 389  EGRFDEALRMLENLAYEGVYLNKGSYRIVLNFLCQKGELEKSCALLGLMLSRGFVPHYAT 448

Query: 901  SNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLLPAFELLDKLL 1080
            SNELLV LCEAG  ++A T LFGL +MGF PEP +W+ +I+ +  ERKLL  FEL+D+L+
Sbjct: 449  SNELLVCLCEAGMVDNAVTALFGLTQMGFTPEPKSWAHLIEYICRERKLLFVFELVDELV 508

Query: 1081 MKD 1089
             K+
Sbjct: 509  EKE 511



 Score =  132 bits (333), Expect = 2e-28
 Identities = 84/312 (26%), Positives = 156/312 (50%), Gaps = 2/312 (0%)
 Frame = +1

Query: 157  IFNILVKHHCKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEE 336
            IF  L+KH  K    E   E+   ++      P+L   ST ++ L  + +++ A      
Sbjct: 129  IFLNLMKHFYKSSLHERVLEMFYAIQPIVREKPSLKAISTCLNILVESKQIDLAQKCLLY 188

Query: 337  MVSKHQIVPDALTYNLLINGFCRGGKTDRARKIMDFMKKNGCN-PNAFNYSSLMNGLCRD 513
            +    ++ P+   +N+L+   C+ G  + A ++M  MKK+  + PN   YS+L++GLC +
Sbjct: 189  VNEHLKVRPNTCIFNILVKHHCKSGDLESALEVMHEMKKSRRSYPNVITYSTLIDGLCGN 248

Query: 514  GKFEEAKEIFIELKGVG-LTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVT 690
            G+ +EA E+F E+     + PD + Y+ LI   C  G+ D A ++++ M+ N C  +   
Sbjct: 249  GRLKEAIELFEEMVSKDQILPDALTYSVLIKGFCHGGKADRARKIMEFMRSNGCDPNVFN 308

Query: 691  FNVILGGLCREYRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILML 870
            ++V++ G C+E R  EA  + + + + G+  +   Y  ++N  C  G +++A+ELL  M 
Sbjct: 309  YSVLMNGFCKEGRLEEAKEVFDEMKSSGLKPDTVGYTTLINCFCGVGRIDEAMELLKEMT 368

Query: 871  HRGFLPHFRTSNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLL 1050
                     T N LL  LC  G  ++A  +L  L   G      ++ IV++ +  + +L 
Sbjct: 369  EMKCKADAVTFNVLLKGLCREGRFDEALRMLENLAYEGVYLNKGSYRIVLNFLCQKGELE 428

Query: 1051 PAFELLDKLLMK 1086
             +  LL  +L +
Sbjct: 429  KSCALLGLMLSR 440


>ref|XP_004237977.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475-like
            [Solanum lycopersicum]
          Length = 511

 Score =  505 bits (1301), Expect = e-140
 Identities = 245/362 (67%), Positives = 299/362 (82%)
 Frame = +1

Query: 1    MFYFIRPIARANPSLKAISTCLNILVDANQIDLARTFLLSTRKDFHLQPNTCIFNILVKH 180
            MF  I PI R  PSL AISTCLN+LV+A QI+LA+ FLL+ +K  +L+PNTCIFNILVK+
Sbjct: 150  MFDAILPIVREKPSLNAISTCLNLLVEAKQIELAKEFLLNVQKHLYLKPNTCIFNILVKY 209

Query: 181  HCKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIV 360
            HCKKGD+++AF VV+EM KS +S PNLITYSTL+DGLC  GRL++A++LFE+M++K QI 
Sbjct: 210  HCKKGDVDAAFVVVEEMRKSRVSHPNLITYSTLMDGLCRCGRLQDALDLFEKMLAKDQIP 269

Query: 361  PDALTYNLLINGFCRGGKTDRARKIMDFMKKNGCNPNAFNYSSLMNGLCRDGKFEEAKEI 540
            PDALTYN+LIN FCR GK DRAR I+ FM+KNGC PN  NY++LMNG C++G+ E+AKE+
Sbjct: 270  PDALTYNILINAFCRAGKVDRARNIIGFMRKNGCQPNIVNYTALMNGFCKEGRVEDAKEV 329

Query: 541  FIELKGVGLTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVTFNVILGGLCR 720
            F E+KGVGL PD V YTTLIN  CRAG+ DE IELL EMK+  C+AD+VT  +ILGGLCR
Sbjct: 330  FHEMKGVGLKPDVVGYTTLINSFCRAGKVDEGIELLDEMKDKGCKADDVTIKIILGGLCR 389

Query: 721  EYRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILMLHRGFLPHFRT 900
              R  EA NMLERLP DGV+L+K SYRIVLNFLCKEGEL KA++LL LML R F+PHF T
Sbjct: 390  ASRSSEAFNMLERLPYDGVHLSKESYRIVLNFLCKEGELVKAMDLLGLMLARRFVPHFAT 449

Query: 901  SNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLLPAFELLDKLL 1080
            SNEL+V LCEAG A DAA  LFGL+EMGFKPEP TWS++ID++  ERKLLPAF+LLD+L+
Sbjct: 450  SNELIVQLCEAGKAADAALALFGLLEMGFKPEPQTWSMLIDVICRERKLLPAFQLLDELV 509

Query: 1081 MK 1086
            ++
Sbjct: 510  LQ 511


>ref|XP_006338085.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475-like
            [Solanum tuberosum]
          Length = 511

 Score =  499 bits (1284), Expect = e-138
 Identities = 241/361 (66%), Positives = 297/361 (82%)
 Frame = +1

Query: 1    MFYFIRPIARANPSLKAISTCLNILVDANQIDLARTFLLSTRKDFHLQPNTCIFNILVKH 180
            MF  I PI R  PSL AISTCLN+L++A QI+LA+ FLL+ +K   L+PNTCIFNILVK+
Sbjct: 150  MFNAILPIVREKPSLNAISTCLNLLIEAKQIELAKEFLLNVQKHLDLKPNTCIFNILVKY 209

Query: 181  HCKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIV 360
            HC+KGD+E+AF VV+EM KS +S PNLITYSTL+DGLC  GRL++A++LFE+M++K QI 
Sbjct: 210  HCRKGDVEAAFVVVEEMRKSRVSHPNLITYSTLMDGLCRCGRLQDALDLFEKMLAKDQIP 269

Query: 361  PDALTYNLLINGFCRGGKTDRARKIMDFMKKNGCNPNAFNYSSLMNGLCRDGKFEEAKEI 540
            PDALTYN+LIN FCR GK DRAR I+ FM+KNGC PN  NY++LMNG C++G+  +AKE+
Sbjct: 270  PDALTYNILINAFCRAGKVDRARNIIGFMRKNGCQPNIVNYTALMNGFCKEGRVGDAKEV 329

Query: 541  FIELKGVGLTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVTFNVILGGLCR 720
            F E+KGVGL PD V YTTLIN  CRAG+ D+ IELL+EMK+  C+AD+VT  +ILGGLCR
Sbjct: 330  FHEMKGVGLKPDVVGYTTLINSFCRAGKVDKGIELLEEMKDKGCKADDVTIKIILGGLCR 389

Query: 721  EYRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILMLHRGFLPHFRT 900
              R  EA +MLERLP DGV+L+K SYRIVLNFLCKEGELEKA++LL LML R F+PHF T
Sbjct: 390  ASRSSEAFDMLERLPYDGVHLSKESYRIVLNFLCKEGELEKAMDLLGLMLARRFVPHFAT 449

Query: 901  SNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLLPAFELLDKLL 1080
            SNEL+V LCEAG A DAA  LFGL+EM FKPEP TWS++ID++  ERKLLPAF+LLD+L+
Sbjct: 450  SNELIVQLCEAGKAADAALALFGLLEMSFKPEPRTWSMLIDVICRERKLLPAFQLLDELV 509

Query: 1081 M 1083
            +
Sbjct: 510  L 510



 Score =  139 bits (351), Expect = 2e-30
 Identities = 85/312 (27%), Positives = 157/312 (50%), Gaps = 2/312 (0%)
 Frame = +1

Query: 157  IFNILVKHHCKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEE 336
            +F  L+KH+ K    E   E+   +       P+L   ST ++ L    ++E A      
Sbjct: 130  VFTNLMKHYSKSSLHEKVLEMFNAILPIVREKPSLNAISTCLNLLIEAKQIELAKEFLLN 189

Query: 337  MVSKHQIVPDALTYNLLINGFCRGGKTDRARKIMDFMKKNGCN-PNAFNYSSLMNGLCRD 513
            +     + P+   +N+L+   CR G  + A  +++ M+K+  + PN   YS+LM+GLCR 
Sbjct: 190  VQKHLDLKPNTCIFNILVKYHCRKGDVEAAFVVVEEMRKSRVSHPNLITYSTLMDGLCRC 249

Query: 514  GKFEEAKEIFIELKGVG-LTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVT 690
            G+ ++A ++F ++     + PD + Y  LIN  CRAG+ D A  ++  M++N C+ + V 
Sbjct: 250  GRLQDALDLFEKMLAKDQIPPDALTYNILINAFCRAGKVDRARNIIGFMRKNGCQPNIVN 309

Query: 691  FNVILGGLCREYRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILML 870
            +  ++ G C+E R  +A  +   +   G+  +   Y  ++N  C+ G+++K +ELL  M 
Sbjct: 310  YTALMNGFCKEGRVGDAKEVFHEMKGVGLKPDVVGYTTLINSFCRAGKVDKGIELLEEMK 369

Query: 871  HRGFLPHFRTSNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLL 1050
             +G      T   +L  LC A  +++A  +L  L   G      ++ IV++ +  E +L 
Sbjct: 370  DKGCKADDVTIKIILGGLCRASRSSEAFDMLERLPYDGVHLSKESYRIVLNFLCKEGELE 429

Query: 1051 PAFELLDKLLMK 1086
             A +LL  +L +
Sbjct: 430  KAMDLLGLMLAR 441


>ref|XP_004136259.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475-like
            [Cucumis sativus] gi|449497032|ref|XP_004160294.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At5g18475-like [Cucumis sativus]
          Length = 504

 Score =  496 bits (1277), Expect = e-138
 Identities = 235/363 (64%), Positives = 294/363 (80%)
 Frame = +1

Query: 1    MFYFIRPIARANPSLKAISTCLNILVDANQIDLARTFLLSTRKDFHLQPNTCIFNILVKH 180
            MFY I+ I R  PSLKAISTCLN+LV+++++DLAR  L++ R   +L+PNTCIFNILVKH
Sbjct: 140  MFYAIKSIVREKPSLKAISTCLNLLVESDRVDLARKLLVNARSKLNLRPNTCIFNILVKH 199

Query: 181  HCKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIV 360
            HC+ GDL++AFEVVKEM+ + +S PNL+TYSTLI GLC NG+L+EAI  FEEMVSK  I+
Sbjct: 200  HCRNGDLQAAFEVVKEMKSARVSYPNLVTYSTLIGGLCENGKLKEAIEFFEEMVSKDNIL 259

Query: 361  PDALTYNLLINGFCRGGKTDRARKIMDFMKKNGCNPNAFNYSSLMNGLCRDGKFEEAKEI 540
            PDALTYN+LINGFC+ GK DRAR I++FMK NGC+PN FNYS LMNG C++G+ +EAKE+
Sbjct: 260  PDALTYNILINGFCQRGKVDRARTILEFMKSNGCSPNVFNYSVLMNGYCKEGRLQEAKEV 319

Query: 541  FIELKGVGLTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVTFNVILGGLCR 720
            F E+K +G+ PDT+ YTTLINCLCR GR DEA ELL++MK+ DCRAD VTFNV+LGGLCR
Sbjct: 320  FNEIKSLGMKPDTISYTTLINCLCRTGRVDEATELLQQMKDKDCRADTVTFNVMLGGLCR 379

Query: 721  EYRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILMLHRGFLPHFRT 900
            E R+ EAL+M+++LP +G YLNK SYRIVLNFL ++GEL KA ELL LML+RGF+PH  T
Sbjct: 380  EGRFDEALDMVQKLPFEGFYLNKGSYRIVLNFLTQKGELRKATELLGLMLNRGFVPHHAT 439

Query: 901  SNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLLPAFELLDKLL 1080
            SN LL+ LC  G   DA   L GL+EMGFKPE  +W  ++DL+  ERK+LP FELLD L+
Sbjct: 440  SNTLLLLLCNNGMVKDAVESLLGLLEMGFKPEHESWFTLVDLICRERKMLPVFELLDVLV 499

Query: 1081 MKD 1089
             ++
Sbjct: 500  TQE 502



 Score =  143 bits (361), Expect = 1e-31
 Identities = 93/310 (30%), Positives = 155/310 (50%), Gaps = 2/310 (0%)
 Frame = +1

Query: 157  IFNILVKHHCKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEE 336
            IF  L+KH  K    E   ++   ++      P+L   ST ++ L  + R++ A  L   
Sbjct: 120  IFLNLMKHFSKSSMHERVLDMFYAIKSIVREKPSLKAISTCLNLLVESDRVDLARKLLVN 179

Query: 337  MVSKHQIVPDALTYNLLINGFCRGGKTDRARKIMDFMKKNGCN-PNAFNYSSLMNGLCRD 513
              SK  + P+   +N+L+   CR G    A +++  MK    + PN   YS+L+ GLC +
Sbjct: 180  ARSKLNLRPNTCIFNILVKHHCRNGDLQAAFEVVKEMKSARVSYPNLVTYSTLIGGLCEN 239

Query: 514  GKFEEAKEIFIELKGV-GLTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVT 690
            GK +EA E F E+     + PD + Y  LIN  C+ G+ D A  +L+ MK N C  +   
Sbjct: 240  GKLKEAIEFFEEMVSKDNILPDALTYNILINGFCQRGKVDRARTILEFMKSNGCSPNVFN 299

Query: 691  FNVILGGLCREYRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILML 870
            ++V++ G C+E R  EA  +   + + G+  +  SY  ++N LC+ G +++A ELL  M 
Sbjct: 300  YSVLMNGYCKEGRLQEAKEVFNEIKSLGMKPDTISYTTLINCLCRTGRVDEATELLQQMK 359

Query: 871  HRGFLPHFRTSNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLL 1050
             +       T N +L  LC  G  ++A  ++  L   GF     ++ IV++ +  + +L 
Sbjct: 360  DKDCRADTVTFNVMLGGLCREGRFDEALDMVQKLPFEGFYLNKGSYRIVLNFLTQKGELR 419

Query: 1051 PAFELLDKLL 1080
             A ELL  +L
Sbjct: 420  KATELLGLML 429



 Score =  129 bits (324), Expect = 2e-27
 Identities = 81/283 (28%), Positives = 141/283 (49%)
 Frame = +1

Query: 37   PSLKAISTCLNILVDANQIDLARTFLLSTRKDFHLQPNTCIFNILVKHHCKKGDLESAFE 216
            P+L   ST +  L +  ++  A  F        ++ P+   +NIL+   C++G ++ A  
Sbjct: 224  PNLVTYSTLIGGLCENGKLKEAIEFFEEMVSKDNILPDALTYNILINGFCQRGKVDRA-R 282

Query: 217  VVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIVPDALTYNLLING 396
             + E  KS    PN+  YS L++G C  GRL+EA  +F E+ S   + PD ++Y  LIN 
Sbjct: 283  TILEFMKSNGCSPNVFNYSVLMNGYCKEGRLQEAKEVFNEIKSLG-MKPDTISYTTLINC 341

Query: 397  FCRGGKTDRARKIMDFMKKNGCNPNAFNYSSLMNGLCRDGKFEEAKEIFIELKGVGLTPD 576
             CR G+ D A +++  MK   C  +   ++ ++ GLCR+G+F+EA ++  +L   G   +
Sbjct: 342  LCRTGRVDEATELLQQMKDKDCRADTVTFNVMLGGLCREGRFDEALDMVQKLPFEGFYLN 401

Query: 577  TVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVTFNVILGGLCREYRYYEALNMLE 756
               Y  ++N L + G   +A ELL  M          T N +L  LC      +A+  L 
Sbjct: 402  KGSYRIVLNFLTQKGELRKATELLGLMLNRGFVPHHATSNTLLLLLCNNGMVKDAVESLL 461

Query: 757  RLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILMLHRGFL 885
             L   G      S+  +++ +C+E ++    ELL +++ + +L
Sbjct: 462  GLLEMGFKPEHESWFTLVDLICRERKMLPVFELLDVLVTQEYL 504


>ref|XP_004301354.1| PREDICTED: LOW QUALITY PROTEIN: pentatricopeptide repeat-containing
            protein At5g18475-like [Fragaria vesca subsp. vesca]
          Length = 568

 Score =  494 bits (1272), Expect = e-137
 Identities = 236/363 (65%), Positives = 297/363 (81%)
 Frame = +1

Query: 1    MFYFIRPIARANPSLKAISTCLNILVDANQIDLARTFLLSTRKDFHLQPNTCIFNILVKH 180
            MF+ I+PI R  PSLK ISTCLN+L++ANQ+D+A+ FL+  +K  +L+ NTCI NILVKH
Sbjct: 204  MFHAIQPIVREKPSLKCISTCLNLLIEANQVDMAQQFLMHLKKSLNLKLNTCIANILVKH 263

Query: 181  HCKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIV 360
            +CK GDLESAFEVVK+M+KS++S PNLITYSTLIDGLC +G+L EA+++F+EM+SK QI+
Sbjct: 264  YCKNGDLESAFEVVKKMKKSKLSYPNLITYSTLIDGLCQSGKLTEAMDMFDEMISKEQIL 323

Query: 361  PDALTYNLLINGFCRGGKTDRARKIMDFMKKNGCNPNAFNYSSLMNGLCRDGKFEEAKEI 540
            PD LTYN+L+ GFCR GK DRARKI+DFMK  GCNPN +NYS+LMNG C++ + +EA+E+
Sbjct: 324  PDVLTYNILMKGFCRAGKVDRARKILDFMKSKGCNPNIYNYSTLMNGFCKEVRLKEAQEL 383

Query: 541  FIELKGVGLTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVTFNVILGGLCR 720
              E+K  G+ PDTV+YTTLI+C CR GR DEAIELLKEMKE  C+AD VTFNVILGGLCR
Sbjct: 384  LDEMKSFGIKPDTVVYTTLIDCHCRTGRVDEAIELLKEMKERRCKADTVTFNVILGGLCR 443

Query: 721  EYRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILMLHRGFLPHFRT 900
            E R  +AL ML+ LP +G+YLNK SYRIVLN L ++G+L KA ELL LM+ RGF+PH+ T
Sbjct: 444  ECRIEDALKMLDELPYEGIYLNKGSYRIVLNSLYQKGDLNKAKELLRLMMGRGFVPHYAT 503

Query: 901  SNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLLPAFELLDKLL 1080
            SN LLVSLCEAG  +DA T LFGLVEMGFKP   +W+  ++ +  ERKLLPAFELLD+L+
Sbjct: 504  SNGLLVSLCEAGMIDDATTALFGLVEMGFKPLLDSWAXFVESICRERKLLPAFELLDELV 563

Query: 1081 MKD 1089
             ++
Sbjct: 564  NEE 566



 Score =  130 bits (326), Expect = 1e-27
 Identities = 90/340 (26%), Positives = 165/340 (48%), Gaps = 7/340 (2%)
 Frame = +1

Query: 85   NQIDLARTFLLSTRKDFHLQPNTC-----IFNILVKHHCKKGDLESAFEVVKEMEKSEMS 249
            N++  ++ F       + ++ +TC     IF  L+KH  K    E   E+   ++     
Sbjct: 155  NKLSQSKKFKAVDAVLYQMKYDTCKFHEGIFLNLMKHFSKFSMHERVLEMFHAIQPIVRE 214

Query: 250  CPNLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIVPDALTYNLLINGFCRGGKTDRAR 429
             P+L   ST ++ L    +++ A      +     +  +    N+L+  +C+ G  + A 
Sbjct: 215  KPSLKCISTCLNLLIEANQVDMAQQFLMHLKKSLNLKLNTCIANILVKHYCKNGDLESAF 274

Query: 430  KIMDFMKKNGCN-PNAFNYSSLMNGLCRDGKFEEAKEIFIE-LKGVGLTPDTVIYTTLIN 603
            +++  MKK+  + PN   YS+L++GLC+ GK  EA ++F E +    + PD + Y  L+ 
Sbjct: 275  EVVKKMKKSKLSYPNLITYSTLIDGLCQSGKLTEAMDMFDEMISKEQILPDVLTYNILMK 334

Query: 604  CLCRAGRTDEAIELLKEMKENDCRADEVTFNVILGGLCREYRYYEALNMLERLPNDGVYL 783
              CRAG+ D A ++L  MK   C  +   ++ ++ G C+E R  EA  +L+ + + G+  
Sbjct: 335  GFCRAGKVDRARKILDFMKSKGCNPNIYNYSTLMNGFCKEVRLKEAQELLDEMKSFGIKP 394

Query: 784  NKASYRIVLNFLCKEGELEKAVELLILMLHRGFLPHFRTSNELLVSLCEAGNANDAATLL 963
            +   Y  +++  C+ G +++A+ELL  M  R       T N +L  LC      DA  +L
Sbjct: 395  DTVVYTTLIDCHCRTGRVDEAIELLKEMKERRCKADTVTFNVILGGLCRECRIEDALKML 454

Query: 964  FGLVEMGFKPEPFTWSIVIDLVFTERKLLPAFELLDKLLM 1083
              L   G      ++ IV++ ++ +  L  A ELL +L+M
Sbjct: 455  DELPYEGIYLNKGSYRIVLNSLYQKGDLNKAKELL-RLMM 493


>ref|XP_003527867.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475-like
            [Glycine max]
          Length = 546

 Score =  481 bits (1238), Expect = e-133
 Identities = 231/360 (64%), Positives = 290/360 (80%)
 Frame = +1

Query: 4    FYFIRPIARANPSLKAISTCLNILVDANQIDLARTFLLSTRKDFHLQPNTCIFNILVKHH 183
            ++ I+PI R  PS KA+STCLN+L+D+N++DLAR  LL  ++D   +PN C+FNILVK+H
Sbjct: 184  YFSIQPIVREKPSPKALSTCLNLLLDSNRVDLARKLLLHAKRDLTRKPNVCVFNILVKYH 243

Query: 184  CKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIVP 363
            CK GDL+SAFE+V+EM  SE S PNL+TYSTL+DGLC NGR++EA +LFEEMVS+  IVP
Sbjct: 244  CKNGDLDSAFEIVEEMRNSEFSYPNLVTYSTLMDGLCRNGRVKEAFDLFEEMVSRDHIVP 303

Query: 364  DALTYNLLINGFCRGGKTDRARKIMDFMKKNGCNPNAFNYSSLMNGLCRDGKFEEAKEIF 543
            D LTYN+LINGFCRGGK DRAR ++ FMK NGC PN +NYS+L++GLC+ GK E+AK + 
Sbjct: 304  DPLTYNVLINGFCRGGKPDRARNVIQFMKSNGCYPNVYNYSALVDGLCKVGKLEDAKGVL 363

Query: 544  IELKGVGLTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVTFNVILGGLCRE 723
             E+KG GL PD V YT+LIN LCR G++DEAIELL+EMKEN C+AD VTFNV+LGGLCRE
Sbjct: 364  AEIKGSGLKPDAVTYTSLINFLCRNGKSDEAIELLEEMKENGCQADSVTFNVLLGGLCRE 423

Query: 724  YRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILMLHRGFLPHFRTS 903
             ++ EAL+M+E+LP  GVYLNK SYRIVLN L ++ EL++A ELL LML RGF PH+ TS
Sbjct: 424  GKFEEALDMVEKLPQQGVYLNKGSYRIVLNSLTQKCELKRAKELLGLMLRRGFQPHYATS 483

Query: 904  NELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLLPAFELLDKLLM 1083
            NELLV LC+AG  +DAA  LF LVEMGF+P   TW ++I L+  ERKLL  FELLD+L++
Sbjct: 484  NELLVCLCKAGMVDDAAVALFDLVEMGFQPGLETWEVLIGLICRERKLLYVFELLDELVV 543



 Score =  138 bits (347), Expect = 4e-30
 Identities = 89/312 (28%), Positives = 156/312 (50%), Gaps = 2/312 (0%)
 Frame = +1

Query: 157  IFNILVKHHCKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEE 336
            IF  L+KH  K    E        ++      P+    ST ++ L  + R++ A  L   
Sbjct: 163  IFVNLMKHFSKSSLHEKLLHAYFSIQPIVREKPSPKALSTCLNLLLDSNRVDLARKLLLH 222

Query: 337  MVSKHQIVPDALTYNLLINGFCRGGKTDRARKIMDFMKKNGCN-PNAFNYSSLMNGLCRD 513
                    P+   +N+L+   C+ G  D A +I++ M+ +  + PN   YS+LM+GLCR+
Sbjct: 223  AKRDLTRKPNVCVFNILVKYHCKNGDLDSAFEIVEEMRNSEFSYPNLVTYSTLMDGLCRN 282

Query: 514  GKFEEAKEIFIELKGVG-LTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVT 690
            G+ +EA ++F E+     + PD + Y  LIN  CR G+ D A  +++ MK N C  +   
Sbjct: 283  GRVKEAFDLFEEMVSRDHIVPDPLTYNVLINGFCRGGKPDRARNVIQFMKSNGCYPNVYN 342

Query: 691  FNVILGGLCREYRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILML 870
            ++ ++ GLC+  +  +A  +L  +   G+  +  +Y  ++NFLC+ G+ ++A+ELL  M 
Sbjct: 343  YSALVDGLCKVGKLEDAKGVLAEIKGSGLKPDAVTYTSLINFLCRNGKSDEAIELLEEMK 402

Query: 871  HRGFLPHFRTSNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLL 1050
              G      T N LL  LC  G   +A  ++  L + G      ++ IV++ +  + +L 
Sbjct: 403  ENGCQADSVTFNVLLGGLCREGKFEEALDMVEKLPQQGVYLNKGSYRIVLNSLTQKCELK 462

Query: 1051 PAFELLDKLLMK 1086
             A ELL  +L +
Sbjct: 463  RAKELLGLMLRR 474


>ref|XP_004501623.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475-like
            isoform X1 [Cicer arietinum]
            gi|502133024|ref|XP_004501624.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At5g18475-like isoform X2 [Cicer arietinum]
          Length = 510

 Score =  480 bits (1235), Expect = e-133
 Identities = 233/359 (64%), Positives = 287/359 (79%)
 Frame = +1

Query: 4    FYFIRPIARANPSLKAISTCLNILVDANQIDLARTFLLSTRKDFHLQPNTCIFNILVKHH 183
            F+ I+PI R  PS KAISTCLN+LVD+NQ+DLAR  LL  ++    +PN CIFNILVK+H
Sbjct: 148  FFSIQPIVREKPSPKAISTCLNLLVDSNQVDLARQLLLHAKRSLIYKPNVCIFNILVKYH 207

Query: 184  CKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIVP 363
            C+ GD+ESAFEVV+EM KS+ S PN+ITYST++DGLC NGRL+EA  LFEEMVSK +IVP
Sbjct: 208  CRNGDIESAFEVVEEMRKSKYSYPNVITYSTMMDGLCRNGRLKEAFELFEEMVSKDRIVP 267

Query: 364  DALTYNLLINGFCRGGKTDRARKIMDFMKKNGCNPNAFNYSSLMNGLCRDGKFEEAKEIF 543
            D LTYN+LINGFCRGGK DRAR +++FMK NGC PN FNYS+L++GLC+ GK ++AK +F
Sbjct: 268  DPLTYNVLINGFCRGGKPDRARNVIEFMKSNGCCPNVFNYSALVDGLCKVGKLQDAKGVF 327

Query: 544  IELKGVGLTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVTFNVILGGLCRE 723
             E+K  GL PDTV YT+LIN  CR  + DEAIELLKEMKEN+C+AD V FNVILGG+CRE
Sbjct: 328  AEMKSSGLKPDTVTYTSLINFFCRNRKIDEAIELLKEMKENECQADTVAFNVILGGMCRE 387

Query: 724  YRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILMLHRGFLPHFRTS 903
             R+ EAL+M+E+LP  GVYLNK SYRIVLN L ++ EL KA +LL LML RGFLPH+ TS
Sbjct: 388  GRFEEALDMIEKLPQQGVYLNKGSYRIVLNSLTQKCELRKAKKLLELMLSRGFLPHYATS 447

Query: 904  NELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLLPAFELLDKLL 1080
            NELL+S C+ G  +DAA  LF LVEMGF+P    W ++I+L+  +RKLL  FELLD+L+
Sbjct: 448  NELLISFCKEGMVDDAAAALFDLVEMGFQPPLDCWELLIELICRDRKLLYVFELLDELV 506


>ref|XP_002301082.1| hypothetical protein POPTR_0002s10380g [Populus trichocarpa]
            gi|222842808|gb|EEE80355.1| hypothetical protein
            POPTR_0002s10380g [Populus trichocarpa]
          Length = 509

 Score =  474 bits (1221), Expect = e-131
 Identities = 237/359 (66%), Positives = 285/359 (79%)
 Frame = +1

Query: 1    MFYFIRPIARANPSLKAISTCLNILVDANQIDLARTFLLSTRKDFHLQPNTCIFNILVKH 180
            MF  I+PI R  PSLKAISTCLN+LV++ Q+DL R FLL   KD  L+PNTCIFNI +K+
Sbjct: 140  MFNKIQPIVREKPSLKAISTCLNLLVESKQVDLLRGFLLDLNKDHMLKPNTCIFNIFIKY 199

Query: 181  HCKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIV 360
            HCK GDLESAF VVKEM+KS +S PNLITYSTL+DGLC +GRL+EAI LFEEMVSK QI+
Sbjct: 200  HCKSGDLESAFAVVKEMKKSSISYPNLITYSTLMDGLCESGRLKEAIELFEEMVSKDQIL 259

Query: 361  PDALTYNLLINGFCRGGKTDRARKIMDFMKKNGCNPNAFNYSSLMNGLCRDGKFEEAKEI 540
            PDALTYN+LINGF   GK DRA+KIM+FMK NGC+PN FNYS+LM+G C++G+ EEA + 
Sbjct: 260  PDALTYNVLINGFSCWGKVDRAKKIMEFMKSNGCSPNVFNYSALMSGFCKEGRLEEAMDA 319

Query: 541  FIELKGVGLTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVTFNVILGGLCR 720
            F E+K  GL  DTV YT LIN  CR GR DEA+ LL+EMKE  C+AD VT NV+L G C 
Sbjct: 320  FEEMKIFGLKQDTVGYTILINYFCRFGRIDEAMALLEEMKETKCKADIVTVNVLLRGFCG 379

Query: 721  EYRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILMLHRGFLPHFRT 900
            E R  EAL ML RL ++G+YLNKASYRIVLN LC++G+L+KA+ELL L L RGF+PH  T
Sbjct: 380  EGRTEEALGMLNRLSSEGIYLNKASYRIVLNSLCQKGDLDKALELLGLTLSRGFVPHHAT 439

Query: 901  SNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLLPAFELLDKL 1077
            SNELLV LC+AG A+DA   L+GL EMGFKPE  +W+++++ V  ERKLL AFELLD+L
Sbjct: 440  SNELLVGLCKAGMADDAVVALYGLAEMGFKPEQDSWALLVEFVCRERKLLLAFELLDEL 498



 Score = 95.1 bits (235), Expect = 4e-17
 Identities = 50/182 (27%), Positives = 97/182 (53%)
 Frame = +1

Query: 127 KDFHLQPNTCIFNILVKHHCKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGR 306
           K F L+ +T  + IL+ + C+ G ++ A  +++EM++++    +++T + L+ G CG GR
Sbjct: 324 KIFGLKQDTVGYTILINYFCRFGRIDEAMALLEEMKETKCKA-DIVTVNVLLRGFCGEGR 382

Query: 307 LEEAINLFEEMVSKHQIVPDALTYNLLINGFCRGGKTDRARKIMDFMKKNGCNPNAFNYS 486
            EEA+ +   + S   I  +  +Y +++N  C+ G  D+A +++      G  P+    +
Sbjct: 383 TEEALGMLNRL-SSEGIYLNKASYRIVLNSLCQKGDLDKALELLGLTLSRGFVPHHATSN 441

Query: 487 SLMNGLCRDGKFEEAKEIFIELKGVGLTPDTVIYTTLINCLCRAGRTDEAIELLKEMKEN 666
            L+ GLC+ G  ++A      L  +G  P+   +  L+  +CR  +   A ELL E+  N
Sbjct: 442 ELLVGLCKAGMADDAVVALYGLAEMGFKPEQDSWALLVEFVCRERKLLLAFELLDELTAN 501

Query: 667 DC 672
           +C
Sbjct: 502 EC 503



 Score = 73.9 bits (180), Expect = 1e-10
 Identities = 61/314 (19%), Positives = 131/314 (41%), Gaps = 39/314 (12%)
 Frame = +1

Query: 253  PNLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIVPDALTYNLLINGFCRGGKTDRARK 432
            P  I++ T ++ +      + A+ +F  +V +     +  TY+ +I+   R  K      
Sbjct: 45   PKFISHETAVNLIKHERDPQHALEIFNLVVEQKGFNHNHATYSTIIDKLARAKKFQAVDA 104

Query: 433  IMDFMKKNGCNPNAFNYSSLMNGLCRDGKFEEAKEIF----------------------- 543
            ++  M    C  +   + +LM    +  +FE   E+F                       
Sbjct: 105  LLRQMMYETCKFHESLFLNLMKYFAKSSEFERVVEMFNKIQPIVREKPSLKAISTCLNLL 164

Query: 544  IELKGVG--------------LTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCR-A 678
            +E K V               L P+T I+   I   C++G  + A  ++KEMK++     
Sbjct: 165  VESKQVDLLRGFLLDLNKDHMLKPNTCIFNIFIKYHCKSGDLESAFAVVKEMKKSSISYP 224

Query: 679  DEVTFNVILGGLCREYRYYEALNMLERL-PNDGVYLNKASYRIVLNFLCKEGELEKAVEL 855
            + +T++ ++ GLC   R  EA+ + E +   D +  +  +Y +++N     G++++A ++
Sbjct: 225  NLITYSTLMDGLCESGRLKEAIELFEEMVSKDQILPDALTYNVLINGFSCWGKVDRAKKI 284

Query: 856  LILMLHRGFLPHFRTSNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFT 1035
            +  M   G  P+    + L+   C+ G   +A      +   G K +   ++I+I+    
Sbjct: 285  MEFMKSNGCSPNVFNYSALMSGFCKEGRLEEAMDAFEEMKIFGLKQDTVGYTILINYFCR 344

Query: 1036 ERKLLPAFELLDKL 1077
              ++  A  LL+++
Sbjct: 345  FGRIDEAMALLEEM 358


>ref|XP_003602939.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355491987|gb|AES73190.1| Pentatricopeptide
            repeat-containing protein [Medicago truncatula]
          Length = 586

 Score =  473 bits (1216), Expect = e-131
 Identities = 232/359 (64%), Positives = 284/359 (79%)
 Frame = +1

Query: 4    FYFIRPIARANPSLKAISTCLNILVDANQIDLARTFLLSTRKDFHLQPNTCIFNILVKHH 183
            F  I+ I R  PS KAIS+CLN+LVD+NQ+DL R  LL  ++    +PN CIFNILVK+H
Sbjct: 224  FLSIQTIVREKPSPKAISSCLNLLVDSNQVDLVRKLLLYAKRSLVYKPNVCIFNILVKYH 283

Query: 184  CKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIVP 363
            C++GD++SAFEVVKEM  S+ S PN+ITYSTL+DGLC NGRL+EA  LFEEMVSK QIVP
Sbjct: 284  CRRGDIDSAFEVVKEMRNSKYSYPNVITYSTLMDGLCRNGRLKEAFELFEEMVSKDQIVP 343

Query: 364  DALTYNLLINGFCRGGKTDRARKIMDFMKKNGCNPNAFNYSSLMNGLCRDGKFEEAKEIF 543
            D LTYN+LINGFCR GK DRAR +++FMK NGC PN FNYS+L++GLC+ GK ++AK + 
Sbjct: 344  DPLTYNVLINGFCREGKADRARNVIEFMKNNGCCPNVFNYSALVDGLCKAGKLQDAKGVL 403

Query: 544  IELKGVGLTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVTFNVILGGLCRE 723
             E+K  GL PD + YT+LIN   R G+ DEAIELL EMKENDC+AD VTFNVILGGLCRE
Sbjct: 404  AEMKSSGLKPDAITYTSLINFFSRNGQIDEAIELLTEMKENDCQADTVTFNVILGGLCRE 463

Query: 724  YRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILMLHRGFLPHFRTS 903
             R+ EAL+M+E+LP  GVYLNK SYRIVLN L +  EL KA +LL LML RGF+PH+ TS
Sbjct: 464  GRFDEALDMIEKLPQQGVYLNKGSYRIVLNSLTQNCELRKANKLLGLMLSRGFVPHYATS 523

Query: 904  NELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLLPAFELLDKLL 1080
            NELLV LC+ G ANDAAT LF LV+MGF+P+  +W ++IDL+  +RKLL  FELLD+L+
Sbjct: 524  NELLVRLCKEGMANDAATALFDLVDMGFQPQHDSWELLIDLICRDRKLLYVFELLDELV 582



 Score =  135 bits (341), Expect = 2e-29
 Identities = 85/314 (27%), Positives = 162/314 (51%), Gaps = 4/314 (1%)
 Frame = +1

Query: 157  IFNILVKHHCKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEE 336
            +F  L+KH+ K G  E  F+    ++      P+    S+ ++ L  + +++    L   
Sbjct: 203  VFINLMKHYSKCGFHEKVFDAFLSIQTIVREKPSPKAISSCLNLLVDSNQVDLVRKLL-- 260

Query: 337  MVSKHQIV--PDALTYNLLINGFCRGGKTDRARKIMDFMKKNGCN-PNAFNYSSLMNGLC 507
            + +K  +V  P+   +N+L+   CR G  D A +++  M+ +  + PN   YS+LM+GLC
Sbjct: 261  LYAKRSLVYKPNVCIFNILVKYHCRRGDIDSAFEVVKEMRNSKYSYPNVITYSTLMDGLC 320

Query: 508  RDGKFEEAKEIFIELKGVG-LTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADE 684
            R+G+ +EA E+F E+     + PD + Y  LIN  CR G+ D A  +++ MK N C  + 
Sbjct: 321  RNGRLKEAFELFEEMVSKDQIVPDPLTYNVLINGFCREGKADRARNVIEFMKNNGCCPNV 380

Query: 685  VTFNVILGGLCREYRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLIL 864
              ++ ++ GLC+  +  +A  +L  + + G+  +  +Y  ++NF  + G++++A+ELL  
Sbjct: 381  FNYSALVDGLCKAGKLQDAKGVLAEMKSSGLKPDAITYTSLINFFSRNGQIDEAIELLTE 440

Query: 865  MLHRGFLPHFRTSNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERK 1044
            M          T N +L  LC  G  ++A  ++  L + G      ++ IV++ +    +
Sbjct: 441  MKENDCQADTVTFNVILGGLCREGRFDEALDMIEKLPQQGVYLNKGSYRIVLNSLTQNCE 500

Query: 1045 LLPAFELLDKLLMK 1086
            L  A +LL  +L +
Sbjct: 501  LRKANKLLGLMLSR 514



 Score = 92.4 bits (228), Expect = 3e-16
 Identities = 72/310 (23%), Positives = 144/310 (46%), Gaps = 9/310 (2%)
 Frame = +1

Query: 187  KKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIVPD 366
            ++ D + A ++   + + +    N  TY+T++  L    + +    +       HQ+  +
Sbjct: 142  REKDPQHALKIFNMVSEQKGFNHNNATYATILQKLAQFKKFQAVDRVL------HQMTYE 195

Query: 367  ALTYN--LLINGFCRGGKTDRARKIMD-FMKKNGC---NPNAFNYSSLMNGLCRDGKFEE 528
            A  ++  + IN      K     K+ D F+         P+    SS +N L    + + 
Sbjct: 196  ACKFHEGVFINLMKHYSKCGFHEKVFDAFLSIQTIVREKPSPKAISSCLNLLVDSNQVDL 255

Query: 529  AKEIFIELK-GVGLTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEV-TFNVI 702
             +++ +  K  +   P+  I+  L+   CR G  D A E++KEM+ +      V T++ +
Sbjct: 256  VRKLLLYAKRSLVYKPNVCIFNILVKYHCRRGDIDSAFEVVKEMRNSKYSYPNVITYSTL 315

Query: 703  LGGLCREYRYYEALNMLERLPN-DGVYLNKASYRIVLNFLCKEGELEKAVELLILMLHRG 879
            + GLCR  R  EA  + E + + D +  +  +Y +++N  C+EG+ ++A  ++  M + G
Sbjct: 316  MDGLCRNGRLKEAFELFEEMVSKDQIVPDPLTYNVLINGFCREGKADRARNVIEFMKNNG 375

Query: 880  FLPHFRTSNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLLPAF 1059
              P+    + L+  LC+AG   DA  +L  +   G KP+  T++ +I+      ++  A 
Sbjct: 376  CCPNVFNYSALVDGLCKAGKLQDAKGVLAEMKSSGLKPDAITYTSLINFFSRNGQIDEAI 435

Query: 1060 ELLDKLLMKD 1089
            ELL ++   D
Sbjct: 436  ELLTEMKEND 445


>emb|CAN80524.1| hypothetical protein VITISV_030537 [Vitis vinifera]
          Length = 714

 Score =  473 bits (1216), Expect = e-131
 Identities = 238/363 (65%), Positives = 280/363 (77%)
 Frame = +1

Query: 1    MFYFIRPIARANPSLKAISTCLNILVDANQIDLARTFLLSTRKDFHLQPNTCIFNILVKH 180
            MF  I PI R  PSLKAISTCLN+LV++NQ  +                           
Sbjct: 220  MFDAIXPIVREKPSLKAISTCLNLLVESNQSSIT-------------------------- 253

Query: 181  HCKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIV 360
              K GD++SAFEVV+EM+KS +S PNLITYSTLI+GLCG+GRL+EAI LFEEMVSK QI+
Sbjct: 254  -AKNGDIDSAFEVVEEMKKSHVSYPNLITYSTLINGLCGSGRLKEAIELFEEMVSKDQIL 312

Query: 361  PDALTYNLLINGFCRGGKTDRARKIMDFMKKNGCNPNAFNYSSLMNGLCRDGKFEEAKEI 540
            PDALTYN LINGFC G K DRA KIM+FMKKNGCNPN FNYS+LMNG C++G+ EEAKE+
Sbjct: 313  PDALTYNALINGFCHGXKVDRALKIMEFMKKNGCNPNVFNYSALMNGFCKEGRLEEAKEV 372

Query: 541  FIELKGVGLTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVTFNVILGGLCR 720
            F E+K +GL PDTV YTTLIN  CRAGR DEA+ELLK+M EN CRAD VTFNVILGGLCR
Sbjct: 373  FDEMKSLGLKPDTVGYTTLINFFCRAGRVDEAMELLKDMXENKCRADTVTFNVILGGLCR 432

Query: 721  EYRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILMLHRGFLPHFRT 900
            E R+ EA  MLERLP +GVYLNKASYRIVLN LC+EGEL+KA +L+ LML RG LPHF T
Sbjct: 433  EGRFEEAXGMLERLPYEGVYLNKASYRIVLNSLCREGELQKATQLVGLMLGRGVLPHFAT 492

Query: 901  SNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLLPAFELLDKLL 1080
            SNELLV LCEAG   DA   L GL+E+GFKPEP +W+++++L+  ERKLLPAFELLD L+
Sbjct: 493  SNELLVHLCEAGKVGDAVMALLGLLELGFKPEPNSWALLVELICRERKLLPAFELLDDLV 552

Query: 1081 MKD 1089
            +++
Sbjct: 553  IQE 555


>gb|ESW09636.1| hypothetical protein PHAVU_009G143500g, partial [Phaseolus vulgaris]
          Length = 742

 Score =  458 bits (1179), Expect = e-126
 Identities = 227/349 (65%), Positives = 272/349 (77%)
 Frame = +1

Query: 4    FYFIRPIARANPSLKAISTCLNILVDANQIDLARTFLLSTRKDFHLQPNTCIFNILVKHH 183
            F+ I+PI R  PS KA++TCLN+L+D+N++DLAR  LL  ++    +PN CIFNILVK+H
Sbjct: 155  FFSIQPIVRDKPSPKALTTCLNLLLDSNRVDLARKLLLHAKRGLTHKPNVCIFNILVKYH 214

Query: 184  CKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIVP 363
            CK GDLESAFEVVKEM  SE S PNLITYSTL+DGLC NGRL EA  LFEEMVS+  IVP
Sbjct: 215  CKNGDLESAFEVVKEMRSSEFSYPNLITYSTLMDGLCRNGRLREAFQLFEEMVSRDHIVP 274

Query: 364  DALTYNLLINGFCRGGKTDRARKIMDFMKKNGCNPNAFNYSSLMNGLCRDGKFEEAKEIF 543
            D LTYN+LINGFCR GK D AR +++FMK NGC PN +NYS+L+NGLCR GK E+AK + 
Sbjct: 275  DPLTYNVLINGFCREGKPDHARNVIEFMKSNGCYPNVYNYSALVNGLCRIGKLEDAKGVL 334

Query: 544  IELKGVGLTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVTFNVILGGLCRE 723
             E+K  GL PD V YT+LIN LCR G+  EAI+LL+EMKEN  +AD V FN+ILGGLCRE
Sbjct: 335  AEMKNSGLKPDAVTYTSLINYLCRNGQVGEAIQLLEEMKENKIQADTVVFNLILGGLCRE 394

Query: 724  YRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILMLHRGFLPHFRTS 903
             R+ EAL+MLE+LP  GVYLNK SYRIVLN L + GEL+ A ELL LML RGFLPH+ +S
Sbjct: 395  DRFEEALDMLEKLPQQGVYLNKGSYRIVLNSLIQNGELKSAKELLGLMLSRGFLPHYASS 454

Query: 904  NELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLL 1050
            NELLV LC+ G A+DAA  LF LVEMGF+P   +W I+I L+  +RKLL
Sbjct: 455  NELLVCLCKGGMADDAARALFDLVEMGFQPGLESWEILIGLICRDRKLL 503



 Score =  113 bits (283), Expect = 1e-22
 Identities = 76/244 (31%), Positives = 131/244 (53%), Gaps = 5/244 (2%)
 Frame = +1

Query: 361  PDALT--YNLLINGFCRGGKTDRARKIMDFMKKNGCN-PNAFNYSSLMNGLCRDGKFEEA 531
            P ALT   NLL++      + D ARK++   K+   + PN   ++ L+   C++G  E A
Sbjct: 168  PKALTTCLNLLLDS----NRVDLARKLLLHAKRGLTHKPNVCIFNILVKYHCKNGDLESA 223

Query: 532  KEIFIELKGVGLT-PDTVIYTTLINCLCRAGRTDEAIELLKEMKEND-CRADEVTFNVIL 705
             E+  E++    + P+ + Y+TL++ LCR GR  EA +L +EM   D    D +T+NV++
Sbjct: 224  FEVVKEMRSSEFSYPNLITYSTLMDGLCRNGRLREAFQLFEEMVSRDHIVPDPLTYNVLI 283

Query: 706  GGLCREYRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILMLHRGFL 885
             G CRE +   A N++E + ++G Y N  +Y  ++N LC+ G+LE A  +L  M + G  
Sbjct: 284  NGFCREGKPDHARNVIEFMKSNGCYPNVYNYSALVNGLCRIGKLEDAKGVLAEMKNSGLK 343

Query: 886  PHFRTSNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLLPAFEL 1065
            P   T   L+  LC  G   +A  LL  + E   + +   +++++  +  E +   A ++
Sbjct: 344  PDAVTYTSLINYLCRNGQVGEAIQLLEEMKENKIQADTVVFNLILGGLCREDRFEEALDM 403

Query: 1066 LDKL 1077
            L+KL
Sbjct: 404  LEKL 407


>gb|EPS69387.1| hypothetical protein M569_05378, partial [Genlisea aurea]
          Length = 449

 Score =  421 bits (1083), Expect = e-115
 Identities = 216/355 (60%), Positives = 259/355 (72%), Gaps = 8/355 (2%)
 Frame = +1

Query: 1    MFYFIRPIARANPSLKAISTCLNILVDANQIDLARTFLLSTRKDFHLQPNTCIFNILVKH 180
            MF+ I PI R+ PS KAI+TCLN+LV+AN+I LAR  LL  R+DF L PN+CIFNILVK+
Sbjct: 95   MFHRIHPITRSKPSPKAITTCLNLLVEANEISLARALLLGLRRDFRLVPNSCIFNILVKY 154

Query: 181  HCKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIV 360
            HC+ GD+ SAF+++KEM+KS +S PNLITYSTL+DGLC  G+L EA+ L EEM+SK +IV
Sbjct: 155  HCRNGDMGSAFDLLKEMDKSHLSSPNLITYSTLMDGLCRTGKLREAVVLLEEMISKRRIV 214

Query: 361  PDALTYNLLINGFCRGGKTDRARKIMDFMKKNGCNPNAFNYSSLMNGLCRDGKFEEAKEI 540
            PDALTYNLLI+GFCR GKTD ARKI+ FM+KNGC PN  NYS+LMNGLC  GK EEA+E 
Sbjct: 215  PDALTYNLLISGFCRTGKTDEARKIIAFMEKNGCPPNVVNYSTLMNGLCSAGKIEEARET 274

Query: 541  FIEL-KGVGLTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVTFNVILGGLC 717
               +    GL PD V YTTLI+ +CRAG T  AIELL+EMK   C  D VTFNVILGGLC
Sbjct: 275  MSRMANSSGLKPDAVAYTTLISSMCRAGETSGAIELLEEMKSVGCEPDAVTFNVILGGLC 334

Query: 718  REYRYYEALNMLERLPNDG-------VYLNKASYRIVLNFLCKEGELEKAVELLILMLHR 876
            RE R  EA+ M+E L + G          NK SYRIVLN L KEG+ E  V L+  M+ R
Sbjct: 335  REGRSGEAVGMVEGLHSGGGGGYYNHRVDNKGSYRIVLNHLVKEGDWEGCVGLVTAMMRR 394

Query: 877  GFLPHFRTSNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTER 1041
            GF+PHF TSNEL+V LC AG  N AA  + GL+ MGF+P   TWS ++D  F ER
Sbjct: 395  GFVPHFGTSNELVVGLCGAGKGNAAAAAVVGLLGMGFQPADDTWSALVDFYFRER 449



 Score =  129 bits (325), Expect = 2e-27
 Identities = 82/275 (29%), Positives = 138/275 (50%), Gaps = 3/275 (1%)
 Frame = +1

Query: 157 IFNILVKHHCKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEE 336
           IF  L+KH  K    +   E+   +     S P+    +T ++ L     +  A  L   
Sbjct: 75  IFLTLMKHFAKLSMADRVVEMFHRIHPITRSKPSPKAITTCLNLLVEANEISLARALLLG 134

Query: 337 MVSKHQIVPDALTYNLLINGFCRGGKTDRARKIMDFMKKNG-CNPNAFNYSSLMNGLCRD 513
           +    ++VP++  +N+L+   CR G    A  ++  M K+   +PN   YS+LM+GLCR 
Sbjct: 135 LRRDFRLVPNSCIFNILVKYHCRNGDMGSAFDLLKEMDKSHLSSPNLITYSTLMDGLCRT 194

Query: 514 GKFEEAKEIFIE-LKGVGLTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVT 690
           GK  EA  +  E +    + PD + Y  LI+  CR G+TDEA +++  M++N C  + V 
Sbjct: 195 GKLREAVVLLEEMISKRRIVPDALTYNLLISGFCRTGKTDEARKIIAFMEKNGCPPNVVN 254

Query: 691 FNVILGGLCREYRYYEALNMLERLPN-DGVYLNKASYRIVLNFLCKEGELEKAVELLILM 867
           ++ ++ GLC   +  EA   + R+ N  G+  +  +Y  +++ +C+ GE   A+ELL  M
Sbjct: 255 YSTLMNGLCSAGKIEEARETMSRMANSSGLKPDAVAYTTLISSMCRAGETSGAIELLEEM 314

Query: 868 LHRGFLPHFRTSNELLVSLCEAGNANDAATLLFGL 972
              G  P   T N +L  LC  G + +A  ++ GL
Sbjct: 315 KSVGCEPDAVTFNVILGGLCREGRSGEAVGMVEGL 349


>ref|XP_006403586.1| hypothetical protein EUTSA_v10010303mg [Eutrema salsugineum]
            gi|557104705|gb|ESQ45039.1| hypothetical protein
            EUTSA_v10010303mg [Eutrema salsugineum]
          Length = 505

 Score =  420 bits (1080), Expect = e-115
 Identities = 210/360 (58%), Positives = 265/360 (73%)
 Frame = +1

Query: 1    MFYFIRPIARANPSLKAISTCLNILVDANQIDLARTFLLSTRKDFHLQPNTCIFNILVKH 180
            MF  I  IAR  PSL AISTCLN+L+D+ ++DLAR  LL  +    LQPNTCIFNILVKH
Sbjct: 145  MFNLILMIARVKPSLNAISTCLNLLIDSGEVDLARKLLLYAKNHLGLQPNTCIFNILVKH 204

Query: 181  HCKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIV 360
            HCK GD++SAF VV+EM +  +S PNLITYSTLI+ L  + R +EA+ LFE+M+S   I 
Sbjct: 205  HCKNGDVDSAFRVVEEMRRFGISYPNLITYSTLIECLFAHSRSKEAMELFEDMISNEGIS 264

Query: 361  PDALTYNLLINGFCRGGKTDRARKIMDFMKKNGCNPNAFNYSSLMNGLCRDGKFEEAKEI 540
            PD +T+N++INGFCR G+ +RA+ I++FMKKNGCNPN FNYS+LMNG C++GK +EAK I
Sbjct: 265  PDPVTFNVMINGFCRAGQVERAKMIIEFMKKNGCNPNVFNYSALMNGFCKEGKIQEAKLI 324

Query: 541  FIELKGVGLTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVTFNVILGGLCR 720
            F E+K  GL  DTV YTTL+NCLC+ G+ DEA+ELL EMK + C+AD +T+NVIL GL  
Sbjct: 325  FDEVKETGLKLDTVGYTTLMNCLCKNGQIDEAMELLVEMKASGCKADALTYNVILRGLSS 384

Query: 721  EYRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILMLHRGFLPHFRT 900
            E R  +AL ML +   +GV+LNK SYRI+LN LCK GELEKAVE L LM  +G  PH  T
Sbjct: 385  EGRAEQALEMLGQWGCEGVHLNKGSYRIILNALCKNGELEKAVEFLSLMSKKGVWPHHAT 444

Query: 901  SNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLLPAFELLDKLL 1080
             NEL+V LC +GNA+    +L G + +GFKPEP +W  V+  V  ERKLL   EL+D L+
Sbjct: 445  WNELVVQLCGSGNADIGVRVLKGFLGIGFKPEPQSWGAVVGSVCKERKLLHVIELVDSLV 504



 Score =  106 bits (264), Expect = 2e-20
 Identities = 78/329 (23%), Positives = 150/329 (45%), Gaps = 38/329 (11%)
 Frame = +1

Query: 196  DLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRL---------------------- 309
            D + A +V   + + +    N  TYS L+D L  + +                       
Sbjct: 67   DPQCALDVFNILSRQKGFNHNSATYSVLLDNLVRHKKFQAVDAILNQMKYETCRFQEGVF 126

Query: 310  -------------EEAINLFEEMVSKHQIVPDALTYNLLINGFCRGGKTDRARKIMDFMK 450
                         E+ + +F  ++   ++ P     +  +N     G+ D ARK++ + K
Sbjct: 127  LNLMRHYSRFDLHEKVMEMFNLILMIARVKPSLNAISTCLNLLIDSGEVDLARKLLLYAK 186

Query: 451  KN-GCNPNAFNYSSLMNGLCRDGKFEEAKEIFIELKGVGLT-PDTVIYTTLINCLCRAGR 624
             + G  PN   ++ L+   C++G  + A  +  E++  G++ P+ + Y+TLI CL    R
Sbjct: 187  NHLGLQPNTCIFNILVKHHCKNGDVDSAFRVVEEMRRFGISYPNLITYSTLIECLFAHSR 246

Query: 625  TDEAIELLKEMKEND-CRADEVTFNVILGGLCREYRYYEALNMLERLPNDGVYLNKASYR 801
            + EA+EL ++M  N+    D VTFNV++ G CR  +   A  ++E +  +G   N  +Y 
Sbjct: 247  SKEAMELFEDMISNEGISPDPVTFNVMINGFCRAGQVERAKMIIEFMKKNGCNPNVFNYS 306

Query: 802  IVLNFLCKEGELEKAVELLILMLHRGFLPHFRTSNELLVSLCEAGNANDAATLLFGLVEM 981
             ++N  CKEG++++A  +   +   G          L+  LC+ G  ++A  LL  +   
Sbjct: 307  ALMNGFCKEGKIQEAKLIFDEVKETGLKLDTVGYTTLMNCLCKNGQIDEAMELLVEMKAS 366

Query: 982  GFKPEPFTWSIVIDLVFTERKLLPAFELL 1068
            G K +  T+++++  + +E +   A E+L
Sbjct: 367  GCKADALTYNVILRGLSSEGRAEQALEML 395


>ref|XP_002873896.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297319733|gb|EFH50155.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 507

 Score =  420 bits (1080), Expect = e-115
 Identities = 206/359 (57%), Positives = 264/359 (73%)
 Frame = +1

Query: 1    MFYFIRPIARANPSLKAISTCLNILVDANQIDLARTFLLSTRKDFHLQPNTCIFNILVKH 180
            MF  I+ IAR  PSL AISTCLN+L+D+ ++DLAR  LL  + +  LQPNTCIFNILVKH
Sbjct: 147  MFNLIQVIARVKPSLNAISTCLNLLIDSGEVDLARKLLLYAKHNLALQPNTCIFNILVKH 206

Query: 181  HCKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIV 360
            HCK GD++SAF VV+EM++S +S PN ITYSTL+D L    R +EA+ LFE+M+SK  I 
Sbjct: 207  HCKNGDIDSAFRVVEEMKRSGISYPNSITYSTLMDCLFAQSRSKEAVELFEDMISKRGIS 266

Query: 361  PDALTYNLLINGFCRGGKTDRARKIMDFMKKNGCNPNAFNYSSLMNGLCRDGKFEEAKEI 540
            PD + +N++INGFCR G+ +RA+ I+DFMKKNGCNPN +NYS+LMNG C++GK +EAK++
Sbjct: 267  PDPVIFNVMINGFCRSGEVERAKMILDFMKKNGCNPNVYNYSALMNGFCKEGKIQEAKQV 326

Query: 541  FIELKGVGLTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVTFNVILGGLCR 720
            F E+K  GL  DTV YTTL+NCLCR G  DEA++LL EMK + CRAD +T+NVIL GL  
Sbjct: 327  FDEVKKTGLKLDTVGYTTLMNCLCRNGEIDEAMKLLGEMKASRCRADALTYNVILRGLSS 386

Query: 721  EYRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILMLHRGFLPHFRT 900
            E R  EAL ML++   +GV+LNK SYRI+LN LC  GELEKAV+ L +M  RG  PH  T
Sbjct: 387  EGRSEEALQMLDQWGCEGVHLNKGSYRIILNALCCNGELEKAVKFLSVMSKRGIWPHHAT 446

Query: 901  SNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLLPAFELLDKL 1077
             NEL+V LCE+GN      +L G + +G  P P +W  V++ +  ERKL+  FELLD L
Sbjct: 447  WNELVVRLCESGNTEIGVRVLIGFLGIGLIPAPKSWGAVVESICKERKLVHVFELLDSL 505


>ref|XP_006287559.1| hypothetical protein CARUB_v10000770mg [Capsella rubella]
            gi|565459122|ref|XP_006287560.1| hypothetical protein
            CARUB_v10000770mg [Capsella rubella]
            gi|482556265|gb|EOA20457.1| hypothetical protein
            CARUB_v10000770mg [Capsella rubella]
            gi|482556266|gb|EOA20458.1| hypothetical protein
            CARUB_v10000770mg [Capsella rubella]
          Length = 506

 Score =  417 bits (1071), Expect = e-114
 Identities = 206/360 (57%), Positives = 266/360 (73%)
 Frame = +1

Query: 1    MFYFIRPIARANPSLKAISTCLNILVDANQIDLARTFLLSTRKDFHLQPNTCIFNILVKH 180
            MF  I+ IAR  PSLK+ISTCLN+L+DA +I+LAR  LL  + +  LQPNTCIFNILVKH
Sbjct: 146  MFNLIQVIARVKPSLKSISTCLNLLIDAGEINLARNLLLYAKHNLGLQPNTCIFNILVKH 205

Query: 181  HCKKGDLESAFEVVKEMEKSEMSCPNLITYSTLIDGLCGNGRLEEAINLFEEMVSKHQIV 360
            HCK GD++SAF VV+EM++S +S PN ITYSTL+D L  + R +EA+ LFE+M+SK  I+
Sbjct: 206  HCKNGDIDSAFRVVEEMKRSGISYPNSITYSTLMDCLFAHSRSKEAVELFEDMISKEGIL 265

Query: 361  PDALTYNLLINGFCRGGKTDRARKIMDFMKKNGCNPNAFNYSSLMNGLCRDGKFEEAKEI 540
            PD +T+N++INGFCR G+  RA  I+DFMKKNGCNPN +NYS+LMNG C++G  +EAK I
Sbjct: 266  PDPVTFNVMINGFCRSGEVKRAEMILDFMKKNGCNPNVYNYSALMNGFCKEGNIQEAKRI 325

Query: 541  FIELKGVGLTPDTVIYTTLINCLCRAGRTDEAIELLKEMKENDCRADEVTFNVILGGLCR 720
            F E+K VGL  DTV YTTL+NCLC+ G  DEA++LL EMK + CR D +T NVIL GL  
Sbjct: 326  FNEVKEVGLRLDTVGYTTLMNCLCKNGAIDEAMKLLGEMKASRCRVDALTCNVILKGLSS 385

Query: 721  EYRYYEALNMLERLPNDGVYLNKASYRIVLNFLCKEGELEKAVELLILMLHRGFLPHFRT 900
            E R  EAL ML++   +GV+L+K SYRI+LN LC  G+LEKAV+ L +M  RG  PH  T
Sbjct: 386  EGRSEEALQMLDQWGCEGVHLDKGSYRIILNGLCHNGKLEKAVKFLSVMSERGMWPHHAT 445

Query: 901  SNELLVSLCEAGNANDAATLLFGLVEMGFKPEPFTWSIVIDLVFTERKLLPAFELLDKLL 1080
             NEL+V LC +GNA     +L G +++G +PEP +W  V++    ERKL+  FELLD L+
Sbjct: 446  WNELVVRLCGSGNAEMGVRVLIGFLKIGLQPEPSSWRAVVESSCRERKLVHVFELLDSLV 505


Top