BLASTX nr result

ID: Cinnamomum23_contig00020553 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cinnamomum23_contig00020553
         (1741 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010260473.1| PREDICTED: pentatricopeptide repeat-containi...   324   2e-85
ref|XP_010908695.1| PREDICTED: pentatricopeptide repeat-containi...   281   2e-72
ref|XP_010644371.1| PREDICTED: pentatricopeptide repeat-containi...   280   2e-72
ref|XP_006437483.1| hypothetical protein CICLE_v10033975mg [Citr...   275   6e-71
gb|KDO53049.1| hypothetical protein CISIN_1g044786mg [Citrus sin...   269   5e-69
ref|XP_012478059.1| PREDICTED: pentatricopeptide repeat-containi...   262   6e-67
ref|XP_010325622.1| PREDICTED: pentatricopeptide repeat-containi...   261   9e-67
ref|XP_007015351.1| Pentatricopeptide repeat-containing protein,...   261   1e-66
ref|XP_012837392.1| PREDICTED: pentatricopeptide repeat-containi...   259   5e-66
ref|XP_010097673.1| hypothetical protein L484_023813 [Morus nota...   258   8e-66
gb|AFK33630.1| unknown [Lotus japonicus]                              256   3e-65
ref|XP_010054002.1| PREDICTED: pentatricopeptide repeat-containi...   252   6e-64
ref|XP_003516509.1| PREDICTED: pentatricopeptide repeat-containi...   251   1e-63
ref|XP_007158057.1| hypothetical protein PHAVU_002G120500g [Phas...   249   4e-63
ref|XP_002519945.1| pentatricopeptide repeat-containing protein,...   248   1e-62
ref|XP_010680219.1| PREDICTED: pentatricopeptide repeat-containi...   244   2e-61
ref|XP_004512307.1| PREDICTED: pentatricopeptide repeat-containi...   244   2e-61
emb|CDP07175.1| unnamed protein product [Coffea canephora]            243   5e-61
gb|EYU37498.1| hypothetical protein MIMGU_mgv1a021373mg, partial...   239   7e-60
ref|XP_003612457.1| Pentatricopeptide repeat-containing protein ...   238   9e-60

>ref|XP_010260473.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790
            [Nelumbo nucifera] gi|720014365|ref|XP_010260474.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At1g31790 [Nelumbo nucifera]
            gi|720014368|ref|XP_010260475.1| PREDICTED:
            pentatricopeptide repeat-containing protein At1g31790
            [Nelumbo nucifera]
          Length = 414

 Score =  324 bits (830), Expect = 2e-85
 Identities = 181/372 (48%), Positives = 240/372 (64%), Gaps = 7/372 (1%)
 Frame = -1

Query: 1462 PSRIEIKSVE------STMATDILRLMDSLQLPISADMYATLLKECIESGDVTQGGKVHS 1301
            PS +EIK  +      +T  TD+L LMDSL L I AD+YA+LLKEC ++ D T+G +VH+
Sbjct: 50   PSNVEIKLNQHRSRSTTTTTTDVLCLMDSLHLRIPADIYASLLKECTDARDATRGAEVHA 109

Query: 1300 HMIHHIXXXXXXXXXXXXXXMYASCSQLDTACILFDGMPRRDSISWATIIAGHEENGDSQ 1121
            HM +                MY +CS+LD A  +FD M  RDSISWAT+IAG+  +GD +
Sbjct: 110  HM-NRSGFRPGLPLANRLLLMYVACSRLDDARKVFDKMTIRDSISWATLIAGYVNHGDCK 168

Query: 1120 NVLRLFAQMRE-NGLDPTAPLLVCVLKACVDSGDLRLGEQVHSWFIKKKTMLRNREDIPV 944
              + LF +M++ +GL+ T  ++V + KACV  G+  LG+Q+H W  K    +   +++  
Sbjct: 169  EAISLFLEMQQGSGLEFTDLIIVSIFKACVHIGEFGLGKQIHGWIFK----VGYHKNLFF 224

Query: 943  VASLINLYSGFGCLESARRVFDEMGRHCDGTVVWTILMDGYCKVGSFEEVLNIFKEMGRA 764
             ++LIN Y  F CLE A+ VF+   R    TV+W  ++ GY + G F+EVL+IFKEMGRA
Sbjct: 225  CSALINFYRKFKCLEDAQFVFNGANRR--DTVLWNDIITGYSREGQFDEVLDIFKEMGRA 282

Query: 763  GRKKKCFTFPTVLRACGRLGDDGQVGRQVHAAAIKVGVESDMFVQSSLVDMYGKCGLLND 584
            G  K  FTF TVLRA GR+ D GQ G+QVHA+ IK+GVE D+FVQSSLVDMYG+ GLL D
Sbjct: 283  GASKNNFTFSTVLRASGRVRDYGQCGKQVHASTIKLGVEMDLFVQSSLVDMYGRQGLLRD 342

Query: 583  ARKALEMTAAVSSFNKKNTNDNNIVICWNAMLHAYASHGLCTEAIKLLYEMKAAGIEPQE 404
            AR+  EM       N++N       +CWNAM   Y  HG   +AIK LYEMKAAG++PQE
Sbjct: 343  ARRVFEMIG-----NERND------VCWNAMFIGYLQHGFYNDAIKFLYEMKAAGLQPQE 391

Query: 403  SMLYQARIACGS 368
            SML + RIACGS
Sbjct: 392  SMLTKLRIACGS 403


>ref|XP_010908695.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790
            [Elaeis guineensis]
          Length = 484

 Score =  281 bits (718), Expect = 2e-72
 Identities = 155/365 (42%), Positives = 217/365 (59%), Gaps = 9/365 (2%)
 Frame = -1

Query: 1435 ESTMATDILRLMDSLQLPISADMYATLLKECIESGDVTQGGKVHSHMIHH----IXXXXX 1268
            E+  ATD+L LMD LQLPI AD+Y +L++EC  S D  QG +VH+H+       +     
Sbjct: 131  ETCTATDVLHLMDGLQLPIEADLYLSLVRECTHSRDAFQGAQVHAHIQRSRPRLLRRAAG 190

Query: 1267 XXXXXXXXXMYASCSQLDTACILFDGMPRRDSISWATIIAGHEENGDSQNVLRLFAQMRE 1088
                     MYA+C Q D+A  +FD MP RD +SWA ++A    +G  +  L+LFA+MR+
Sbjct: 191  LPLANRLLLMYATCGQADSARHMFDRMPFRDPMSWAAMLATLAHHGGHREALQLFAEMRK 250

Query: 1087 NGLDP-----TAPLLVCVLKACVDSGDLRLGEQVHSWFIKKKTMLRNREDIPVVASLINL 923
            + ++       A +LV  L++CV + +L L  QVH   +K            + +SL+  
Sbjct: 251  SAVEAGGRYLDALVLVTTLRSCVRARELGLARQVHGLALKVLGESGAIGCGGIGSSLLQC 310

Query: 922  YSGFGCLESARRVFDEMGRHCDGTVVWTILMDGYCKVGSFEEVLNIFKEMGRAGRKKKCF 743
            YS  GC +SARRVF+ M     G   WT ++ G C VG FEE L +F+EMGRAG ++   
Sbjct: 311  YSMLGCHQSARRVFERMRIGSRGAAAWTCMITGCCGVGRFEEALYVFREMGRAGHRRNSH 370

Query: 742  TFPTVLRACGRLGDDGQVGRQVHAAAIKVGVESDMFVQSSLVDMYGKCGLLNDARKALEM 563
               ++L AC R+GD G  GRQVHA+A+K+GV++D +V SSLVDMY K GLL DAR+A E 
Sbjct: 371  VVSSILAACARIGDSGWGGRQVHASAVKLGVDADRYVGSSLVDMYAKHGLLKDARRAFET 430

Query: 562  TAAVSSFNKKNTNDNNIVICWNAMLHAYASHGLCTEAIKLLYEMKAAGIEPQESMLYQAR 383
             A            +   +CWNAML  YA  G C+EAI LLY+M+AAG+ P+E ++ Q R
Sbjct: 431  IAG-----------SGDPVCWNAMLAGYARGGCCSEAISLLYQMRAAGVRPKELIVNQVR 479

Query: 382  IACGS 368
            +AC +
Sbjct: 480  MACST 484


>ref|XP_010644371.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790
            [Vitis vinifera]
          Length = 469

 Score =  280 bits (717), Expect = 2e-72
 Identities = 164/357 (45%), Positives = 217/357 (60%), Gaps = 6/357 (1%)
 Frame = -1

Query: 1420 TDILRLMDSLQLPISADMYATLLKECIESGDVTQGGKVHSHMIHHIXXXXXXXXXXXXXX 1241
            TDILRLMD L LPI  D+YA+L+KE   +GD TQ  ++ +H I+                
Sbjct: 117  TDILRLMDGLGLPIPPDIYASLIKESSTTGDATQATQLLAH-INRSGLPLSSALLNRILL 175

Query: 1240 MYASCSQLDTACILFDGMP--RRDSISWATIIAGHEENGDSQNVLRLFAQMRENG----L 1079
            MY SC  + TA  +FD M    ++SISWA ++A + +NG  +  + LF QM E      L
Sbjct: 176  MYVSCGLIHTARHMFDKMNVLNKNSISWAIMLAAYMDNGFYEEAIFLFVQMMELHSTIML 235

Query: 1078 DPTAPLLVCVLKACVDSGDLRLGEQVHSWFIKKKTMLRNREDIPVVASLINLYSGFGCLE 899
            +  A + +CVLKACV + +L LG+QVH W +K    +    ++ +   LI+ Y  F CL+
Sbjct: 236  ELPAWIFICVLKACVHTMNLTLGKQVHGWLLK----VGYATNLFLSCYLISFYGKFRCLD 291

Query: 898  SARRVFDEMGRHCDGTVVWTILMDGYCKVGSFEEVLNIFKEMGRAGRKKKCFTFPTVLRA 719
             A  VFD+       TV+WT  M   C+     E L  F EMGRAG K+  FT+ +VLRA
Sbjct: 292  DADFVFDQTSER--NTVIWTAKMVNKCQGEYMHEALVAFTEMGRAGVKRNEFTYSSVLRA 349

Query: 718  CGRLGDDGQVGRQVHAAAIKVGVESDMFVQSSLVDMYGKCGLLNDARKALEMTAAVSSFN 539
            CGR+ D G+ GR +HA+ IK+G+ESD++VQ  LVDMYGKCGLL +AR+  E    VS  N
Sbjct: 350  CGRMKDHGRCGRLIHASTIKLGLESDIYVQCGLVDMYGKCGLLVEARRVFE---TVSDTN 406

Query: 538  KKNTNDNNIVICWNAMLHAYASHGLCTEAIKLLYEMKAAGIEPQESMLYQARIACGS 368
            K N      ++CWNAML  Y  HGL  EAIK LY+MKAAGI+PQES+L + RIACGS
Sbjct: 407  KTN------IVCWNAMLTGYIRHGLYIEAIKFLYQMKAAGIQPQESLLNELRIACGS 457


>ref|XP_006437483.1| hypothetical protein CICLE_v10033975mg [Citrus clementina]
            gi|557539679|gb|ESR50723.1| hypothetical protein
            CICLE_v10033975mg [Citrus clementina]
          Length = 425

 Score =  275 bits (704), Expect = 6e-71
 Identities = 166/374 (44%), Positives = 218/374 (58%), Gaps = 5/374 (1%)
 Frame = -1

Query: 1432 STMATDILRLMDSLQLPISADMYATLLKECIESGDVTQGGKVHSHMIHHIXXXXXXXXXX 1253
            +T + +IL LMD+L LPI+ DMY  L+KEC    D     ++ +H+   +          
Sbjct: 66   NTSSANILHLMDNLCLPITTDMYTCLIKECTFQKDSAGAFELLNHIRKRVNIKPTLLFLN 125

Query: 1252 XXXXMYASCSQLDTACILFDGMPRRDSISWATIIAGHEENGDSQNVLRLFAQM--RENG- 1082
                M+ SC QLDTA  LFD MP RD  SWA +I G+ +  D Q  + LFA+M  R+ G 
Sbjct: 126  RLLLMHVSCGQLDTARQLFDEMPLRDFNSWAVMIVGYVDVADYQECITLFAEMMKRKKGH 185

Query: 1081 --LDPTAPLLVCVLKACVDSGDLRLGEQVHSWFIKKKTMLRNREDIPVVASLINLYSGFG 908
              L   A ++VCVLKACV + ++ LG+QVH    K    L +  +I +  SLIN Y  F 
Sbjct: 186  MLLVFPAWIIVCVLKACVCTMNMELGKQVHGLLFK----LGSSRNISLTGSLINFYGKFR 241

Query: 907  CLESARRVFDEMGRHCDGTVVWTILMDGYCKVGSFEEVLNIFKEMGRAGRKKKCFTFPTV 728
            CLE A  VF ++ RH   TVVWT  +   C+ G F +V N FKEMGR   KK  +TF +V
Sbjct: 242  CLEDADFVFSQLKRH--NTVVWTAKIVNNCREGHFHQVFNDFKEMGRERIKKNSYTFSSV 299

Query: 727  LRACGRLGDDGQVGRQVHAAAIKVGVESDMFVQSSLVDMYGKCGLLNDARKALEMTAAVS 548
            L+ACG + DDG  GRQVHA  +K+G+ESD +VQ  LVDMYGKC LL DA++  E+     
Sbjct: 300  LKACGGVDDDGNCGRQVHANIVKIGLESDEYVQCGLVDMYGKCRLLRDAKRVFELIV--- 356

Query: 547  SFNKKNTNDNNIVICWNAMLHAYASHGLCTEAIKLLYEMKAAGIEPQESMLYQARIACGS 368
              +KKN      +  WNAML  Y  +GL  EA K LY MKA+GI+ QES++   RIAC S
Sbjct: 357  --DKKN------IASWNAMLMGYIRNGLYVEATKFLYLMKASGIQIQESLINDLRIACSS 408

Query: 367  *ISSQINNILLQIV 326
              S  + N + Q V
Sbjct: 409  -SSDTLQNRMEQTV 421


>gb|KDO53049.1| hypothetical protein CISIN_1g044786mg [Citrus sinensis]
          Length = 340

 Score =  269 bits (688), Expect = 5e-69
 Identities = 160/357 (44%), Positives = 209/357 (58%), Gaps = 5/357 (1%)
 Frame = -1

Query: 1402 MDSLQLPISADMYATLLKECIESGDVTQGGKVHSHMIHHIXXXXXXXXXXXXXXMYASCS 1223
            MD+L LPI+ DMY  L+KEC    D     ++ +H+   +              M+ SC 
Sbjct: 1    MDNLCLPITTDMYTCLIKECTFQKDSAGAFELLNHIRKRVNIKPTLLFLNRLLLMHVSCG 60

Query: 1222 QLDTACILFDGMPRRDSISWATIIAGHEENGDSQNVLRLFAQM--RENG---LDPTAPLL 1058
            QLDTA  LFD MP RD  SWA +I G+ +  D Q  + LFA+M  R+ G   L   A ++
Sbjct: 61   QLDTARQLFDEMPLRDFNSWAVMIVGYVDVADYQECITLFAEMMKRKKGHMLLVFPAWII 120

Query: 1057 VCVLKACVDSGDLRLGEQVHSWFIKKKTMLRNREDIPVVASLINLYSGFGCLESARRVFD 878
            VCVLKACV + ++ LG+QVH    K    L +  +I +  SLIN Y  F CLE A  VF 
Sbjct: 121  VCVLKACVCTMNMELGKQVHGLLFK----LGSSRNISLTGSLINFYGKFRCLEDADFVFS 176

Query: 877  EMGRHCDGTVVWTILMDGYCKVGSFEEVLNIFKEMGRAGRKKKCFTFPTVLRACGRLGDD 698
            ++ RH   TVVWT  +   C+ G F +V N FKEMGR   KK  +TF +VL+ACG + DD
Sbjct: 177  QLKRH--NTVVWTAKIVNNCREGHFHQVFNDFKEMGRERIKKNSYTFSSVLKACGGVDDD 234

Query: 697  GQVGRQVHAAAIKVGVESDMFVQSSLVDMYGKCGLLNDARKALEMTAAVSSFNKKNTNDN 518
            G  GRQ+HA  +K+G+ESD +VQ  LVDMYGKC LL DA +  E+       +KKN    
Sbjct: 235  GNCGRQMHANIVKIGLESDEYVQCGLVDMYGKCRLLRDAERVFELIV-----DKKN---- 285

Query: 517  NIVICWNAMLHAYASHGLCTEAIKLLYEMKAAGIEPQESMLYQARIACGS*ISSQIN 347
              +  WNAML  Y  +GL  EA K LY MKA+GI+ QES++   RIAC S  +S+IN
Sbjct: 286  --IASWNAMLVGYIRNGLYVEATKFLYLMKASGIQIQESLINDLRIACSSISASKIN 340


>ref|XP_012478059.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790
            [Gossypium raimondii] gi|763741782|gb|KJB09281.1|
            hypothetical protein B456_001G132700 [Gossypium
            raimondii]
          Length = 397

 Score =  262 bits (670), Expect = 6e-67
 Identities = 159/391 (40%), Positives = 213/391 (54%), Gaps = 3/391 (0%)
 Frame = -1

Query: 1531 LGASRPACIRPASLHHSPKLQHSPSRIEIKSVESTMAT-DILRLMDSLQLPISADMYATL 1355
            L   R   + P  L   PK   +P+   I S   T  T DILRLMDSL +PI  D+YA+L
Sbjct: 19   LKTERQIQLPPRFLKQCPK---TPTAKPISSNPGTGTTSDILRLMDSLSIPIPPDIYASL 75

Query: 1354 LKECIESGDVTQGGKVHSHMIHHIXXXXXXXXXXXXXXMYASCSQLDTACILFDGMPRRD 1175
            +KEC  S    +  ++H+H I H               M+ SC  L+ A  +FD M  RD
Sbjct: 76   IKECTLSRHSVRALQLHNH-IRHRRIKLSLPLLNRLLLMHVSCGHLEIARQVFDQMFLRD 134

Query: 1174 SISWATIIAGHEENGDSQNVLRLFAQMRENGLDPTAP--LLVCVLKACVDSGDLRLGEQV 1001
              SWA +I    + GDS+  +  F  M         P  ++ C+LK+CV + ++ LG+QV
Sbjct: 135  FNSWAIMIVACLQAGDSEQAISYFVLMERCSSLFKFPAWIITCLLKSCVLTKNMELGKQV 194

Query: 1000 HSWFIKKKTMLRNREDIPVVASLINLYSGFGCLESARRVFDEMGRHCDGTVVWTILMDGY 821
            H   +K   +    +D+ +  SLIN Y  F CL+ A  VF++  R    TV WT  M   
Sbjct: 195  HGQLLKLGVI----DDLSLSGSLINFYGNFKCLDDANVVFNQSSRR--NTVTWTAKMVNS 248

Query: 820  CKVGSFEEVLNIFKEMGRAGRKKKCFTFPTVLRACGRLGDDGQVGRQVHAAAIKVGVESD 641
            C+   F +V + F EMGR G KK  FTF +VL+AC  + D+G  GRQVHA AIK+G+E +
Sbjct: 249  CRENQFHKVFDDFTEMGRQGIKKNSFTFSSVLKACAGMDDEGMSGRQVHAIAIKLGLECE 308

Query: 640  MFVQSSLVDMYGKCGLLNDARKALEMTAAVSSFNKKNTNDNNIVICWNAMLHAYASHGLC 461
             FVQ  L+DMYGKCGL+ DA KA            K   D   + CWNAM+  Y  + LC
Sbjct: 309  AFVQCGLIDMYGKCGLVRDAEKAF-----------KVAGDERNIACWNAMIMGYVHNKLC 357

Query: 460  TEAIKLLYEMKAAGIEPQESMLYQARIACGS 368
             +AIKLLY MK AG+E QES++   RIACG+
Sbjct: 358  IQAIKLLYGMKEAGLEVQESLINDVRIACGN 388


>ref|XP_010325622.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790
            [Solanum lycopersicum]
          Length = 417

 Score =  261 bits (668), Expect = 9e-67
 Identities = 158/383 (41%), Positives = 211/383 (55%), Gaps = 10/383 (2%)
 Frame = -1

Query: 1486 HSPKLQ-HSPSRIEIKSVESTMAT--DILRLMDSLQLPISADMYATLLKECIESGDVTQG 1316
            H P  + H P + EIK       T  D+LRLMDSL   I  D+Y +L+KEC ES D    
Sbjct: 45   HKPHYKIHQPIKPEIKKTTDPSCTISDVLRLMDSLGFNIPVDVYVSLIKECTESRDPLNA 104

Query: 1315 GKVHSHMIHHIXXXXXXXXXXXXXXMYASCSQLDTACILFDGMPRRDSISWATIIAGHEE 1136
             +V+ H+                  M   C   + A  LFD M  R+S SWA +IAG  E
Sbjct: 105  VEVYEHVCKS-DVIPSLPLLNRLLLMLVLCGCFEQARQLFDKMRVRNSQSWAAMIAGCVE 163

Query: 1135 NGDSQNVLRLFAQMRENGLDPTA-------PLLVCVLKACVDSGDLRLGEQVHSWFIKKK 977
            NG+    LRLF +M+    +           +LVCVLKACV+  +L  G Q+H W +K  
Sbjct: 164  NGECVGALRLFMEMQSEAGNLCKCGDLIDDGILVCVLKACVELMNLEFGRQIHGWLLK-- 221

Query: 976  TMLRNREDIPVVASLINLYSGFGCLESARRVFDEMGRHCDGTVVWTILMDGYCKVGSFEE 797
              L N E + + + LI  Y  FG LESA  VFD +  HC+ TVVWT  +   CK   FE 
Sbjct: 222  --LGNCESMVLNSFLIKFYGEFGYLESADNVFDHVP-HCN-TVVWTARIGNLCKEEQFEG 277

Query: 796  VLNIFKEMGRAGRKKKCFTFPTVLRACGRLGDDGQVGRQVHAAAIKVGVESDMFVQSSLV 617
             + IF+EM   G KK  FTF ++L+ACG+L D G  G+Q+HA ++KVG+++D +V  SL+
Sbjct: 278  AIRIFREMVSEGVKKNSFTFSSILKACGKLRDAGCCGQQIHATSVKVGLDTDSYVLCSLI 337

Query: 616  DMYGKCGLLNDARKALEMTAAVSSFNKKNTNDNNIVICWNAMLHAYASHGLCTEAIKLLY 437
            DMYGK GLL DAR+          FN +    N  + CWNAML     HG   EA+K+LY
Sbjct: 338  DMYGKYGLLKDARRV---------FNAREDKSN--IACWNAMLMGCIQHGFGVEAMKVLY 386

Query: 436  EMKAAGIEPQESMLYQARIACGS 368
            EMK AG++P ES++ +  + CGS
Sbjct: 387  EMKEAGLQPHESLINEVLLVCGS 409


>ref|XP_007015351.1| Pentatricopeptide repeat-containing protein, putative [Theobroma
            cacao] gi|508785714|gb|EOY32970.1| Pentatricopeptide
            repeat-containing protein, putative [Theobroma cacao]
          Length = 413

 Score =  261 bits (667), Expect = 1e-66
 Identities = 157/391 (40%), Positives = 210/391 (53%), Gaps = 2/391 (0%)
 Frame = -1

Query: 1534 PLGASRPACIRPASLHHSPKLQHSPSRIEIKSVESTMATDILRLMDSLQLPISADMYATL 1355
            P+  S+P    P S H +                    +DILRLMDSL LPI  D+YA+L
Sbjct: 60   PISTSKPISSNPCSSHTT--------------------SDILRLMDSLSLPIPPDIYASL 99

Query: 1354 LKECIESGDVTQGGKVHSHMIHHIXXXXXXXXXXXXXXMYASCSQLDTACILFDGMPRRD 1175
            +KEC  +    +  ++HSH I +               M+ SC  LD A  LFD M  RD
Sbjct: 100  VKECTVTRHSRRALELHSH-IRNSRIKPSLPLLNRLLLMHVSCGHLDIARHLFDQMLLRD 158

Query: 1174 SISWATIIAGHEENGDSQNVLRLFAQMRENGLDPTAP--LLVCVLKACVDSGDLRLGEQV 1001
              SWA +I      GDS+  +  F +M  + L    P  ++VC+LK+CV + ++ LG+QV
Sbjct: 159  FNSWAIMIVACLHAGDSEQAIAYFVRMERHNLLFKCPSWIIVCLLKSCVVTKNMGLGKQV 218

Query: 1000 HSWFIKKKTMLRNREDIPVVASLINLYSGFGCLESARRVFDEMGRHCDGTVVWTILMDGY 821
            H   +K    L    D  +  SLIN Y  F CL+ A  VF+++ R    TV WT  +   
Sbjct: 219  HGQLLK----LGASNDSSLSGSLINFYGKFRCLDDADFVFNQLSRR--NTVTWTARIVNS 272

Query: 820  CKVGSFEEVLNIFKEMGRAGRKKKCFTFPTVLRACGRLGDDGQVGRQVHAAAIKVGVESD 641
            C+   F +V++ F EMGR G KK  FTF  V +AC R+ DDG  GRQVHA A+K+G+ESD
Sbjct: 273  CREDQFGKVIDDFNEMGRQGIKKNNFTFSGVFKACARMDDDGMSGRQVHANALKLGLESD 332

Query: 640  MFVQSSLVDMYGKCGLLNDARKALEMTAAVSSFNKKNTNDNNIVICWNAMLHAYASHGLC 461
            +FVQ  L+ +YGKCG + DA KA E+             D   + CWNAML  Y  + LC
Sbjct: 333  VFVQCGLIHLYGKCGSVRDAEKAFEI-----------VGDKRNIACWNAMLMGYVHNELC 381

Query: 460  TEAIKLLYEMKAAGIEPQESMLYQARIACGS 368
              AIKLLY MK AGI+ QES++   RIAC +
Sbjct: 382  LRAIKLLYRMKEAGIKVQESLINDVRIACAT 412


>ref|XP_012837392.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790
            [Erythranthe guttatus]
          Length = 413

 Score =  259 bits (662), Expect = 5e-66
 Identities = 159/387 (41%), Positives = 209/387 (54%), Gaps = 12/387 (3%)
 Frame = -1

Query: 1492 LHHSPKLQHSPSRIEIKSVESTMAT--DILRLMDSLQLPISADMYATLLKECIESGDVTQ 1319
            +H  P+ +  P    IK V   + T  DIL LMDSL+LPI  D+Y +L+KEC E GD  +
Sbjct: 34   IHRRPEYKPVPKPARIKPVPKPVTTTSDILHLMDSLKLPIPPDIYTSLIKECTELGDPLK 93

Query: 1318 GGKVHSHMIHHIXXXXXXXXXXXXXXMYASCSQLDTACILFDGMPRRDSISWATIIAGHE 1139
              ++H HM                  MY S   LD A  LFD M  RD  SWA +IAG  
Sbjct: 94   SIELHEHM-RRSGFRFTLPLLNRLLLMYVSSGCLDRARQLFDQMFLRDFNSWAVLIAGFV 152

Query: 1138 ENGDSQNVLRLFAQMREN------GLD----PTAPLLVCVLKACVDSGDLRLGEQVHSWF 989
            ENG+    + LF +M         GLD      + +LVCVLKAC+ + D  LG QVH W 
Sbjct: 153  ENGEHDEAINLFVEMLNRQDMGNVGLDRMGFSVSGILVCVLKACLFTSDFELGTQVHGWL 212

Query: 988  IKKKTMLRNREDIPVVASLINLYSGFGCLESARRVFDEMGRHCDGTVVWTILMDGYCKVG 809
             K    +   E   +   LIN Y    C E A+ VFD +      T VWT  +  +C  G
Sbjct: 213  WK----MGFSESASLSCFLINFYGRLDCFEGAQTVFDHVRN--PNTAVWTSRIVSFCSNG 266

Query: 808  SFEEVLNIFKEMGRAGRKKKCFTFPTVLRACGRLGDDGQVGRQVHAAAIKVGVESDMFVQ 629
            +FEE +++FKEMGR G ++  +TF TVL+AC ++GD  + G+QVHA +IK G+ESD +VQ
Sbjct: 267  NFEEAVSVFKEMGREGVRENSYTFSTVLKACRKMGDI-RCGQQVHANSIKSGLESDSYVQ 325

Query: 628  SSLVDMYGKCGLLNDARKALEMTAAVSSFNKKNTNDNNIVICWNAMLHAYASHGLCTEAI 449
             +LVD YGKCG LNDA +  EM          + +  N   C NAML  Y  HGLC EA 
Sbjct: 326  CALVDFYGKCGFLNDATRVFEM----------DISKRNDASC-NAMLANYVRHGLCIEAN 374

Query: 448  KLLYEMKAAGIEPQESMLYQARIACGS 368
            ++L +MK +G  P ES+  +    CGS
Sbjct: 375  EILRQMKMSGSRPCESVFNEVSFVCGS 401


>ref|XP_010097673.1| hypothetical protein L484_023813 [Morus notabilis]
            gi|587881693|gb|EXB70628.1| hypothetical protein
            L484_023813 [Morus notabilis]
          Length = 453

 Score =  258 bits (660), Expect = 8e-66
 Identities = 149/353 (42%), Positives = 207/353 (58%), Gaps = 2/353 (0%)
 Frame = -1

Query: 1420 TDILRLMDSLQLPISADMYATLLKECIESGDVTQGGKVHSHMIHHIXXXXXXXXXXXXXX 1241
            +D+LRLMD+L LPIS DMY + +KEC  S D      +H+H+  +               
Sbjct: 107  SDVLRLMDALCLPISPDMYISFMKECTISADFCGAEDLHNHISRNSLQHLALPLLNRLLF 166

Query: 1240 MYASCSQLDTACILFDGMPRRDSISWATIIAGHEENGDSQNVLRLFAQMRE--NGLDPTA 1067
            M  SC +LD AC LF  MP +D  SWAT+I  +  N D +    LF +M    N L+  +
Sbjct: 167  MNVSCGRLDLACDLFYRMPFKDFKSWATMIVANVNNSDYEEATSLFLKMLHHINMLEFPS 226

Query: 1066 PLLVCVLKACVDSGDLRLGEQVHSWFIKKKTMLRNREDIPVVASLINLYSGFGCLESARR 887
             ++VC+LK CV + ++ LG+QVH+  +K    L +   + + + LIN Y  +GCLESA  
Sbjct: 227  WIIVCLLKTCVCTRNMELGKQVHACALK----LGHANSLYLASCLINFYGKYGCLESANL 282

Query: 886  VFDEMGRHCDGTVVWTILMDGYCKVGSFEEVLNIFKEMGRAGRKKKCFTFPTVLRACGRL 707
            VF+++ RH   T+ W   +    K   F EVL  F E+G+AG KK    F +VL+ACGR+
Sbjct: 283  VFNQLPRH--DTLTWMTRLINNSKEELFFEVLRDFNEVGKAGIKKNVLMFSSVLKACGRI 340

Query: 706  GDDGQVGRQVHAAAIKVGVESDMFVQSSLVDMYGKCGLLNDARKALEMTAAVSSFNKKNT 527
             D  + G+QVHA AIK+G ESD++VQ  L+DMYG+ GLL DA++          F K + 
Sbjct: 341  HDRRKSGQQVHANAIKLGFESDLYVQCGLIDMYGRSGLLRDAQRV---------FEKSSD 391

Query: 526  NDNNIVICWNAMLHAYASHGLCTEAIKLLYEMKAAGIEPQESMLYQARIACGS 368
              NN   CWNAML  Y  + L  EAIK +Y+MKA G++ Q+SML + RIACGS
Sbjct: 392  RRNN--ACWNAMLGGYIRNELYVEAIKFVYQMKAVGLQLQQSMLDELRIACGS 442


>gb|AFK33630.1| unknown [Lotus japonicus]
          Length = 356

 Score =  256 bits (655), Expect = 3e-65
 Identities = 148/358 (41%), Positives = 206/358 (57%), Gaps = 2/358 (0%)
 Frame = -1

Query: 1414 ILRLMDSLQLPISADMYATLLKECIESGDVTQGGKVHSHMIHHIXXXXXXXXXXXXXXMY 1235
            IL LMD L  PI  D+Y +L+KEC  S D     ++H+H I H               M+
Sbjct: 17   ILHLMDVLPFPIPIDIYTSLIKECTLSPDPQTAIELHTH-IAHSGIKPPLSFINRILVMF 75

Query: 1234 ASCSQLDTACILFDGMPRRDSISWATIIAGHEENGDSQNVLRLF-AQMRENGLDPTAP-L 1061
             SC  LD AC LFD MP +D  SWAT+   + +N D +  + +F A + + G+    P +
Sbjct: 76   VSCGLLDYACQLFDAMPVKDFNSWATLFIAYYDNADYEEAIDVFLAMLHQLGMSEFPPWI 135

Query: 1060 LVCVLKACVDSGDLRLGEQVHSWFIKKKTMLRNREDIPVVASLINLYSGFGCLESARRVF 881
              C LKAC    ++ LG QVH W +K  T     + + + +SLI  Y  F C++ A  VF
Sbjct: 136  CACFLKACACIENIPLGMQVHGWLLKLGTC----DHVLLSSSLIRFYGRFTCVKDANAVF 191

Query: 880  DEMGRHCDGTVVWTILMDGYCKVGSFEEVLNIFKEMGRAGRKKKCFTFPTVLRACGRLGD 701
            +++ RH   T  WT  +   C+   F EV N FKEMGR G KK  +TF +VL+ACG++ D
Sbjct: 192  NKLSRH--NTSTWTAKIVSGCREMDFPEVFNDFKEMGRQGIKKDTYTFSSVLKACGKMMD 249

Query: 700  DGQVGRQVHAAAIKVGVESDMFVQSSLVDMYGKCGLLNDARKALEMTAAVSSFNKKNTND 521
             G+ G QVHA A+K+G+ SD +VQ SL+ MYG+ GLL DA++  E     +S +++N + 
Sbjct: 250  HGRCGEQVHADAMKLGLASDNYVQCSLIAMYGRSGLLRDAKQVFE-----TSRSERNVDS 304

Query: 520  NNIVICWNAMLHAYASHGLCTEAIKLLYEMKAAGIEPQESMLYQARIACGS*ISSQIN 347
                  WNAML  Y  +GL  EA+K LY+MKAAG++P ES+L + RIACGS   S  N
Sbjct: 305  ------WNAMLMGYLENGLYIEAVKFLYQMKAAGLKPHESLLDKVRIACGSVTYSSTN 356


>ref|XP_010054002.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790
            [Eucalyptus grandis] gi|629113423|gb|KCW78383.1|
            hypothetical protein EUGRSUZ_D02554 [Eucalyptus grandis]
          Length = 436

 Score =  252 bits (644), Expect = 6e-64
 Identities = 159/394 (40%), Positives = 218/394 (55%), Gaps = 5/394 (1%)
 Frame = -1

Query: 1534 PLGASRPACIRPASLHH---SPKLQHSPSRIEIKSVESTMATDILRLMDSLQLPISADMY 1364
            PL    P     AS  H   +P+ +  P+R + K+  S  ATD+LRL+D L + +S D+Y
Sbjct: 62   PLELQAPTPRTDASTVHPRPAPEGRKVPTRKK-KNRASGAATDVLRLLDGLGVTVSPDVY 120

Query: 1363 ATLLKECIESGDVTQGGKVHSHMIHHIXXXXXXXXXXXXXXMYASCSQLDTACILFDGMP 1184
              L+KEC E+GD     ++H H I                 M+ SC  L +A  +F+GM 
Sbjct: 121  VLLIKECTENGDSAGALELHKH-IRRSGLRPGLHLLNRFLLMFVSCGCLVSARKVFEGML 179

Query: 1183 RRDSISWATIIAGHEENGDSQNVLRLFAQMRENGLDPTAPL--LVCVLKACVDSGDLRLG 1010
            +RD  SWA +I G  E G+ +  +RLF +M         PL  +VC+ KACV   +  LG
Sbjct: 180  QRDVSSWAILIVGFMEEGEYEEAMRLFVRMLCCINVSEVPLWTVVCIFKACVHCANKDLG 239

Query: 1009 EQVHSWFIKKKTMLRNREDIPVVASLINLYSGFGCLESARRVFDEMGRHCDGTVVWTILM 830
            EQ+H W +K+ +   N    P+  SLI+ Y  F C E A  +F ++  H   + +WT  +
Sbjct: 240  EQLHGWLLKQGSGGEN----PLPRSLIDFYGKFKCPECANIIFQQLSSH--NSAIWTAKL 293

Query: 829  DGYCKVGSFEEVLNIFKEMGRAGRKKKCFTFPTVLRACGRLGDDGQVGRQVHAAAIKVGV 650
                    ++ V + F+E+ R G +K    F +VL+ACGR+ DDGQ GRQVHA AIKVGV
Sbjct: 294  AYDYSGHQYDTVFSNFREIEREGIQKSRSMFLSVLKACGRVKDDGQCGRQVHAKAIKVGV 353

Query: 649  ESDMFVQSSLVDMYGKCGLLNDARKALEMTAAVSSFNKKNTNDNNIVICWNAMLHAYASH 470
            ESD FVQS L+DMY + GLL DAR   EM       NKK       ++ WNAM+  Y  H
Sbjct: 354  ESDAFVQSGLLDMYAQLGLLRDARTVFEMVG-----NKKG------IVFWNAMIKGYIRH 402

Query: 469  GLCTEAIKLLYEMKAAGIEPQESMLYQARIACGS 368
            GL  EAIKLLY+MK AG+  +E +L + RIA GS
Sbjct: 403  GLSIEAIKLLYQMKEAGLTLREDLLDEVRIASGS 436


>ref|XP_003516509.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790-like
            [Glycine max] gi|734393681|gb|KHN28310.1|
            Pentatricopeptide repeat-containing protein [Glycine
            soja]
          Length = 423

 Score =  251 bits (642), Expect = 1e-63
 Identities = 145/368 (39%), Positives = 208/368 (56%), Gaps = 2/368 (0%)
 Frame = -1

Query: 1444 KSVESTMATDILRLMDSLQLPISADMYATLLKECIESGDVTQGGKVHSHMIHHIXXXXXX 1265
            K  +    +DIL LM++L  P+  D+Y +L+KEC  SGD     ++ +H I         
Sbjct: 74   KKRKGATTSDILHLMEALPFPVPIDIYTSLIKECTVSGDPETAIELATH-ISKSGIKPPL 132

Query: 1264 XXXXXXXXMYASCSQLDTACILFDGMPRRDSISWATIIAGHEENGDSQNVLRLFAQM-RE 1088
                    M+ SC  L+ A  +FD M  RD  +WAT+   + +N D +    +F  M  +
Sbjct: 133  PFLNRILVMFVSCGLLENARHMFDKMRVRDFNTWATLFVAYYDNTDYEEATNVFVNMLTQ 192

Query: 1087 NGLDPTAPLL-VCVLKACVDSGDLRLGEQVHSWFIKKKTMLRNREDIPVVASLINLYSGF 911
             G+    P +  C+L+AC  + ++ LG QVH W +K  T     + + + +SLIN Y  F
Sbjct: 193  LGMMEFPPWIWACLLRACACTVNVPLGMQVHGWLLKLGTC----DHVLLSSSLINFYGRF 248

Query: 910  GCLESARRVFDEMGRHCDGTVVWTILMDGYCKVGSFEEVLNIFKEMGRAGRKKKCFTFPT 731
             CLE A  VFD + RH   T+ WT  +   C+   F EV + FKEMG  G KK CFTF +
Sbjct: 249  TCLEDASVVFDGVSRH--NTLTWTAKIVSGCRERHFSEVFDDFKEMGMRGVKKDCFTFSS 306

Query: 730  VLRACGRLGDDGQVGRQVHAAAIKVGVESDMFVQSSLVDMYGKCGLLNDARKALEMTAAV 551
            VL+ACGR+ +  + G QVH  AIK+G+ SD +VQ SL+ MYG+CGLL DA++  EM    
Sbjct: 307  VLKACGRMLNQERCGEQVHVDAIKLGLVSDHYVQCSLIAMYGRCGLLEDAKRVFEM---- 362

Query: 550  SSFNKKNTNDNNIVICWNAMLHAYASHGLCTEAIKLLYEMKAAGIEPQESMLYQARIACG 371
                   + +   V CWNAML  Y  +GL  EA+K LY+M+AAG++P+ES+L + R+ACG
Sbjct: 363  -------SQEERKVDCWNAMLMGYIQNGLYIEAVKFLYQMQAAGMQPRESLLKKLRMACG 415

Query: 370  S*ISSQIN 347
            S   S +N
Sbjct: 416  SISYSNMN 423


>ref|XP_007158057.1| hypothetical protein PHAVU_002G120500g [Phaseolus vulgaris]
            gi|561031472|gb|ESW30051.1| hypothetical protein
            PHAVU_002G120500g [Phaseolus vulgaris]
          Length = 420

 Score =  249 bits (637), Expect = 4e-63
 Identities = 149/372 (40%), Positives = 210/372 (56%), Gaps = 4/372 (1%)
 Frame = -1

Query: 1450 EIKSVESTMAT--DILRLMDSLQLPISADMYATLLKECIESGDVTQGGKVHSHMIHHIXX 1277
            EIK  +   AT  DIL LMD+L  PI+ D+Y +L+KEC  SGD     ++++H I     
Sbjct: 67   EIKKKKRKEATTLDILHLMDALPFPITIDIYTSLIKECTVSGDPETAIELYTH-ISKSDI 125

Query: 1276 XXXXXXXXXXXXMYASCSQLDTACILFDGMPRRDSISWATIIAGHEENGDSQNVLRLFAQ 1097
                        M+ SC  L+ A  +F+ M  RD  SWAT+   + +N + +    +F  
Sbjct: 126  KPPLPFLNRILIMFVSCGMLENARHMFEKMRVRDFNSWATLFVAYYDNAEYEEATAVFVN 185

Query: 1096 MR-ENGLDPTAPLL-VCVLKACVDSGDLRLGEQVHSWFIKKKTMLRNREDIPVVASLINL 923
            M  + G+    P +  C+L+AC  + ++ LG QVH W +K    L   + + + +SLIN 
Sbjct: 186  MLGQLGMLQFPPWIWACLLRACACTLNVPLGLQVHGWLLK----LGACDHVLLSSSLINF 241

Query: 922  YSGFGCLESARRVFDEMGRHCDGTVVWTILMDGYCKVGSFEEVLNIFKEMGRAGRKKKCF 743
            Y  F CLE A  VF+ + RH   T+ WT  +   C+   F EV   F+EMG  G KK CF
Sbjct: 242  YGRFTCLEDASAVFNGVSRH--NTLTWTAKIVSGCRERHFSEVFGDFREMGMRGVKKDCF 299

Query: 742  TFPTVLRACGRLGDDGQVGRQVHAAAIKVGVESDMFVQSSLVDMYGKCGLLNDARKALEM 563
            TF +VL+ACG++ +  + G QVHA AIK+G+ SD +VQ SL+ MYG+CGLL DA+   EM
Sbjct: 300  TFSSVLKACGKMLNQERCGEQVHADAIKLGLISDHYVQCSLIAMYGRCGLLTDAKDVFEM 359

Query: 562  TAAVSSFNKKNTNDNNIVICWNAMLHAYASHGLCTEAIKLLYEMKAAGIEPQESMLYQAR 383
                       T +   V CWNAML  Y  +G   EA+K LY+M+AAG++P ES+L + R
Sbjct: 360  -----------TREERKVDCWNAMLMGYTQNGFHIEAVKFLYQMQAAGMQPWESLLKKLR 408

Query: 382  IACGS*ISSQIN 347
            IACGS   S +N
Sbjct: 409  IACGSITYSNMN 420


>ref|XP_002519945.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223540991|gb|EEF42549.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 403

 Score =  248 bits (632), Expect = 1e-62
 Identities = 147/379 (38%), Positives = 207/379 (54%), Gaps = 10/379 (2%)
 Frame = -1

Query: 1474 LQHSPSRIEIKSVESTMATDILRLMDSLQLPISADMYATLLKECIESGDVTQGGKVHSHM 1295
            + H P++      +S  ++DI+RLMDSL  PI  D+Y +L+KEC  + D T+   +HSH+
Sbjct: 46   INHLPAK------KSCSSSDIMRLMDSLCHPIPPDIYTSLIKECTLTSDSTEALCLHSHL 99

Query: 1294 IHHIXXXXXXXXXXXXXXMYASCSQLDTACILFDGMP-RRDSISWATIIAGHEENGDSQN 1118
            I                 M+ SC QLD A  LFD MP ++D ISW  +I G   N   + 
Sbjct: 100  ISQTNLKLTPPLVHRLLLMHVSCGQLDIARNLFDKMPLKKDFISWVIVIVGCFSNSKYEA 159

Query: 1117 VLRLFAQMREN---------GLDPTAPLLVCVLKACVDSGDLRLGEQVHSWFIKKKTMLR 965
             + LF  M             L+    +++C++K C+ S ++ LG+QVH    K    + 
Sbjct: 160  GINLFIDMLLQHSVYDGLMFDLNTWNIIILCIIKCCIYSMNISLGKQVHGILFK----VG 215

Query: 964  NREDIPVVASLINLYSGFGCLESARRVFDEMGRHCDGTVVWTILMDGYCKVGSFEEVLNI 785
               +I    SL++ Y   GCLE    VF+++  H   T  WT  +   C+   F EV+  
Sbjct: 216  LTSEISFNVSLMDFYGKLGCLEDVNSVFNKLDNH--NTATWTAKIVNSCRNQRFYEVIED 273

Query: 784  FKEMGRAGRKKKCFTFPTVLRACGRLGDDGQVGRQVHAAAIKVGVESDMFVQSSLVDMYG 605
            FKEMG AG K+  FT  +VLRAC R+GD G  G+QVH   IK+G+ESD FVQ  L+ MYG
Sbjct: 274  FKEMGEAGIKRNSFTVSSVLRACARMGDGGNCGKQVHVIVIKLGLESDAFVQCGLIAMYG 333

Query: 604  KCGLLNDARKALEMTAAVSSFNKKNTNDNNIVICWNAMLHAYASHGLCTEAIKLLYEMKA 425
            KCG++  A+K  E+       +K NT       CWNA+L AY  + L  EA+KLLY+M+A
Sbjct: 334  KCGMIRKAKKVFELV-----IDKTNT------ACWNALLMAYVRNELFIEAMKLLYQMEA 382

Query: 424  AGIEPQESMLYQARIACGS 368
            A I+  ES+L   RIACG+
Sbjct: 383  AKIQVNESLLDHVRIACGT 401


>ref|XP_010680219.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790
            [Beta vulgaris subsp. vulgaris]
            gi|870857620|gb|KMT09168.1| hypothetical protein
            BVRB_6g132730 [Beta vulgaris subsp. vulgaris]
          Length = 439

 Score =  244 bits (622), Expect = 2e-61
 Identities = 151/398 (37%), Positives = 218/398 (54%), Gaps = 14/398 (3%)
 Frame = -1

Query: 1519 RPACIRPASLHHSPKLQHSPSRIEIKSVESTMATDILRLMDSLQLPISADMYATLLKECI 1340
            +P   +  ++   PK Q     +   +  +T++ D+LRLMD L+L +S D+Y +L  EC 
Sbjct: 46   KPDYYKNTAISAPPKQQIQLKNLSNNNNNNTIS-DVLRLMDGLKLIVSPDIYISLANECT 104

Query: 1339 ESGDVTQGGKVHSH------MIHHIXXXXXXXXXXXXXXMYASCSQLDTACILFDGMPRR 1178
               D  Q  ++++H      MI  I              M  +C   D A  LFD MP R
Sbjct: 105  RDRDCVQAAQLYTHLKKNTRMITFINSSSGLFLLNRILLMLVTCGCFDVAHQLFDEMPHR 164

Query: 1177 DSISWATIIAGHEENGDSQNVLRLFAQMRENGLDPTAPL-----LVCVLKACVDSGDLRL 1013
            +SIS A +IA   +N   +  L LF +M    +   + +     +V  LKACV   ++ L
Sbjct: 165  NSISLAIVIADLIDNHLYEQALGLFVKMHPCFVYQESDMGLQLVVVSFLKACVHLKEVYL 224

Query: 1012 GEQVHSWFIKKKTMLRNREDIPVVASLINLYSGFGCLESARRVFDEMGRHCDG--TVVWT 839
            G+ VH+W IK    +     + V  +L+  Y   GCL  A RVF +  R  D   TV+WT
Sbjct: 225  GKMVHAWLIK----MGYDRGLFVDTALVEFYGKSGCLREANRVFFDHVRIIDDSDTVLWT 280

Query: 838  ILMDGYCKVGSFEEVLNIFKEMGRAGRKKKCFTFPTVLRACGRL-GDDGQVGRQVHAAAI 662
              +   C+   FEEV+ IF EMG AG +   +TF TVL+ACGR+ GDDG  G+QVHA AI
Sbjct: 281  GAIVNNCRERCFEEVIRIFGEMGEAGVRMNEYTFSTVLKACGRVVGDDGSFGKQVHANAI 340

Query: 661  KVGVESDMFVQSSLVDMYGKCGLLNDARKALEMTAAVSSFNKKNTNDNNIVICWNAMLHA 482
            K+ V++ +FV  +L+DMYG+CGLL DA K          F++K  +      CWNAM+  
Sbjct: 341  KLAVDTRVFVMCALIDMYGRCGLLKDAIKV---------FDRKGYHSKRNGACWNAMIAG 391

Query: 481  YASHGLCTEAIKLLYEMKAAGIEPQESMLYQARIACGS 368
            +  HGL  EAIK++Y+MKAAG+ PQ+SM+ + R+ACG+
Sbjct: 392  FIHHGLYIEAIKMMYQMKAAGLHPQKSMIDELRLACGT 429


>ref|XP_004512307.1| PREDICTED: pentatricopeptide repeat-containing protein At1g31790
            [Cicer arietinum]
          Length = 418

 Score =  244 bits (622), Expect = 2e-61
 Identities = 152/390 (38%), Positives = 217/390 (55%), Gaps = 4/390 (1%)
 Frame = -1

Query: 1525 ASRPACIRPASLHHSPKLQHSPSRIEIKSVESTMATDILRLMDSLQLPISADMYATLLKE 1346
            +S+P  + P    ++ K +++  R      +S   + IL LMD+L  PI  D+Y +L+KE
Sbjct: 46   SSQPLTVTPPRNKNNTKNKNNNKR------KSATTSHILPLMDALHFPIPIDIYTSLVKE 99

Query: 1345 CIESGDVTQGGKVHSHMIHHIXXXXXXXXXXXXXXMYASCSQLDTACILFDGMPRRDSIS 1166
            C  SGD     ++HSH I                 M+ SC  L +A  +FD MP R+  S
Sbjct: 100  CTLSGDPETATELHSH-ITRSGIGPPLTLLNRILIMFVSCGLLQSARHVFDEMPVRNFHS 158

Query: 1165 WATIIAGHEENGDSQNVLRLFAQM-RENGLD--PTAPLL-VCVLKACVDSGDLRLGEQVH 998
            WA +   + EN D +N + +F +M R+ G+   P  P    C+L AC  + ++ LG QVH
Sbjct: 159  WAILFVAYYENSDYENAIDVFMRMLRQLGVMEFPFLPWFWSCLLTACACTVNVPLGMQVH 218

Query: 997  SWFIKKKTMLRNREDIPVVASLINLYSGFGCLESARRVFDEMGRHCDGTVVWTILMDGYC 818
                   T L   + + + +SLI  Y  F CLE A  VF+ + RH   T+ WT  +   C
Sbjct: 219  G----SLTKLGACDHVLISSSLIRFYGRFKCLEDANVVFNRVSRH--NTLTWTAKIVSGC 272

Query: 817  KVGSFEEVLNIFKEMGRAGRKKKCFTFPTVLRACGRLGDDGQVGRQVHAAAIKVGVESDM 638
            +   F +VL  FKEMGR G KK  FTF +VL+ACGR+ + G  G QVHA +IK+G++SD 
Sbjct: 273  RERHFTQVLGDFKEMGRVGIKKDSFTFSSVLKACGRMQNYGSCGEQVHADSIKLGLDSDN 332

Query: 637  FVQSSLVDMYGKCGLLNDARKALEMTAAVSSFNKKNTNDNNIVICWNAMLHAYASHGLCT 458
            +VQ SL+ MYG+ GLL DA+   E T      N++N +       WNAML  Y  +GL  
Sbjct: 333  YVQCSLIAMYGRSGLLRDAKLVFETT-----LNERNVDS------WNAMLMGYIQNGLYI 381

Query: 457  EAIKLLYEMKAAGIEPQESMLYQARIACGS 368
            +A+K +Y+MKAAG+ P ES+L + RIACGS
Sbjct: 382  KAVKFVYQMKAAGVHPHESLLEKLRIACGS 411


>emb|CDP07175.1| unnamed protein product [Coffea canephora]
          Length = 430

 Score =  243 bits (619), Expect = 5e-61
 Identities = 138/382 (36%), Positives = 211/382 (55%), Gaps = 9/382 (2%)
 Frame = -1

Query: 1486 HSPKLQHSPSRIEIKSVESTMATDILRLMDSLQLPISADMYATLLKECIESGDVTQGGKV 1307
            H P  +   +  + K    +  +D+L L+D L++P+S D+Y + + EC +SGD     ++
Sbjct: 59   HKPIQKTHKNSNDCKPSNRSTVSDVLGLLDCLKIPVSLDLYTSFIDECTKSGDPLLAIEL 118

Query: 1306 HSHMIHHIXXXXXXXXXXXXXXMYASCSQLDTACILFDGMPRRDSISWATIIAGHEENGD 1127
            H+H I                 MY SC+ +  A  LFD M  R S +WA ++AG+ ENGD
Sbjct: 119  HNH-IKTSCLRPSLSIFNRLLLMYVSCNLIGYARELFDKMTVRSSCTWAVMVAGYFENGD 177

Query: 1126 SQNVLRLFAQMR-------ENGLDP--TAPLLVCVLKACVDSGDLRLGEQVHSWFIKKKT 974
               V+ LF +MR       +  +D    + ++VCVLKAC  + ++ LG+QVH+W +K   
Sbjct: 178  YGEVIDLFLEMRCSERAKVDGDMDDIVASAIVVCVLKACAKTVNVELGKQVHAWVVK--- 234

Query: 973  MLRNREDIPVVASLINLYSGFGCLESARRVFDEMGRHCDGTVVWTILMDGYCKVGSFEEV 794
             +   E++     L++ Y   GCLE + +VFD++       V+WT  +  +C    F+E 
Sbjct: 235  -MGYGENLVFSGCLMSFYGKAGCLEGSDQVFDQVPYR--NKVIWTTKIVNHCYEEQFDEA 291

Query: 793  LNIFKEMGRAGRKKKCFTFPTVLRACGRLGDDGQVGRQVHAAAIKVGVESDMFVQSSLVD 614
             ++FK+MGR G KK  +TF +VL+AC  + D    G+QVHA  +K+G+E +  VQ  LV+
Sbjct: 292  FDVFKQMGREGVKKNSYTFSSVLKACASMRDGRCCGQQVHANVVKLGLELNEHVQCGLVN 351

Query: 613  MYGKCGLLNDARKALEMTAAVSSFNKKNTNDNNIVICWNAMLHAYASHGLCTEAIKLLYE 434
            MYGK GL+ DA+K             K   +N  V CWNAML  Y   G   EA +++ +
Sbjct: 352  MYGKGGLIKDAQKVF-----------KICGNNRNVACWNAMLTGYIQQGFGIEAFRIICD 400

Query: 433  MKAAGIEPQESMLYQARIACGS 368
            MKAAG++PQES+L + R  CGS
Sbjct: 401  MKAAGLQPQESLLNEVRFICGS 422


>gb|EYU37498.1| hypothetical protein MIMGU_mgv1a021373mg, partial [Erythranthe
            guttata]
          Length = 345

 Score =  239 bits (609), Expect = 7e-60
 Identities = 145/352 (41%), Positives = 191/352 (54%), Gaps = 10/352 (2%)
 Frame = -1

Query: 1393 LQLPISADMYATLLKECIESGDVTQGGKVHSHMIHHIXXXXXXXXXXXXXXMYASCSQLD 1214
            L+LPI  D+Y +L+KEC E GD  +  ++H HM                  MY S   LD
Sbjct: 1    LKLPIPPDIYTSLIKECTELGDPLKSIELHEHM-RRSGFRFTLPLLNRLLLMYVSSGCLD 59

Query: 1213 TACILFDGMPRRDSISWATIIAGHEENGDSQNVLRLFAQMREN------GLD----PTAP 1064
             A  LFD M  RD  SWA +IAG  ENG+    + LF +M         GLD      + 
Sbjct: 60   RARQLFDQMFLRDFNSWAVLIAGFVENGEHDEAINLFVEMLNRQDMGNVGLDRMGFSVSG 119

Query: 1063 LLVCVLKACVDSGDLRLGEQVHSWFIKKKTMLRNREDIPVVASLINLYSGFGCLESARRV 884
            +LVCVLKAC+ + D  LG QVH W  K    +   E   +   LIN Y    C E A+ V
Sbjct: 120  ILVCVLKACLFTSDFELGTQVHGWLWK----MGFSESASLSCFLINFYGRLDCFEGAQTV 175

Query: 883  FDEMGRHCDGTVVWTILMDGYCKVGSFEEVLNIFKEMGRAGRKKKCFTFPTVLRACGRLG 704
            FD +      T VWT  +  +C  G+FEE +++FKEMGR G ++  +TF TVL+AC ++G
Sbjct: 176  FDHVRN--PNTAVWTSRIVSFCSNGNFEEAVSVFKEMGREGVRENSYTFSTVLKACRKMG 233

Query: 703  DDGQVGRQVHAAAIKVGVESDMFVQSSLVDMYGKCGLLNDARKALEMTAAVSSFNKKNTN 524
            D  + G+QVHA +IK G+ESD +VQ +LVD YGKCG LNDA +  EM          + +
Sbjct: 234  DI-RCGQQVHANSIKSGLESDSYVQCALVDFYGKCGFLNDATRVFEM----------DIS 282

Query: 523  DNNIVICWNAMLHAYASHGLCTEAIKLLYEMKAAGIEPQESMLYQARIACGS 368
              N   C NAML  Y  HGLC EA ++L +MK +G  P ES+  +    CGS
Sbjct: 283  KRNDASC-NAMLANYVRHGLCIEANEILRQMKMSGSRPCESVFNEVSFVCGS 333


>ref|XP_003612457.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
            gi|355513792|gb|AES95415.1| PPR repeat protein [Medicago
            truncatula]
          Length = 418

 Score =  238 bits (608), Expect = 9e-60
 Identities = 148/387 (38%), Positives = 210/387 (54%), Gaps = 4/387 (1%)
 Frame = -1

Query: 1495 SLHHSPKLQHSPSRIEIKSVESTMATDILRLMDSLQLPISADMYATLLKECIESGDVTQG 1316
            SL H      +P +   +  +    + IL LMD+L  PI+ D+Y +L+KEC  S D    
Sbjct: 50   SLIHPSSQPITPPKKSKRRRKCDTTSHILPLMDALHFPITIDIYTSLVKECTLSTDPETA 109

Query: 1315 GKVHSHMIHHIXXXXXXXXXXXXXXMYASCSQLDTACILFDGMPRRDSISWATIIAGHEE 1136
             ++H+ +I                 M+ SC  L+ A  +FD M  RD  SWAT+   + E
Sbjct: 110  IELHTQIITR-GIELPLTLLNRILIMFVSCGLLENARRVFDVMSVRDFHSWATLFVSYYE 168

Query: 1135 NGDSQNVLRLFA----QMRENGLDPTAPLLVCVLKACVDSGDLRLGEQVHSWFIKKKTML 968
            NG+ +N + +F     Q+   G      +  C+LKAC  + ++ LG QVH   +K    L
Sbjct: 169  NGEYENAIDVFVSMLCQLDVMGFSFPPWIWSCLLKACACTMNVPLGMQVHGCLLK----L 224

Query: 967  RNREDIPVVASLINLYSGFGCLESARRVFDEMGRHCDGTVVWTILMDGYCKVGSFEEVLN 788
               + + + +SLI  Y  F CLE A  VF+ + RH   T+ WT  +   C+   F E L 
Sbjct: 225  GACDHVLISSSLIRFYGRFKCLEDANMVFNRVSRH--NTLTWTAKIVSSCRERHFSEALG 282

Query: 787  IFKEMGRAGRKKKCFTFPTVLRACGRLGDDGQVGRQVHAAAIKVGVESDMFVQSSLVDMY 608
             FK+MGR G KK  FTF +VL+ACGR+ + G  G QVHA AIK+G++SD +VQ SL+ MY
Sbjct: 283  DFKKMGRVGVKKDSFTFSSVLKACGRMQNRGSCGEQVHADAIKLGLDSDSYVQCSLIAMY 342

Query: 607  GKCGLLNDARKALEMTAAVSSFNKKNTNDNNIVICWNAMLHAYASHGLCTEAIKLLYEMK 428
            G+ GLL DA    EMT      N++N +        NAML  Y  +GL  EA+K +Y+MK
Sbjct: 343  GRSGLLRDAELVFEMTR-----NERNVDS------LNAMLMGYIQNGLYIEAVKFVYQMK 391

Query: 427  AAGIEPQESMLYQARIACGS*ISSQIN 347
            AAG++P E +L + RIACGS   S +N
Sbjct: 392  AAGVQPHEPLLEKLRIACGSSNFSSMN 418