BLASTX nr result

ID: Ephedra26_contig00002206 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra26_contig00002206
         (2339 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006840662.1| hypothetical protein AMTR_s00096p00019400 [A...   320   2e-84
ref|XP_004136469.1| PREDICTED: pentatricopeptide repeat-containi...   309   3e-81
ref|XP_004292965.1| PREDICTED: pentatricopeptide repeat-containi...   309   4e-81
ref|XP_002516618.1| pentatricopeptide repeat-containing protein,...   308   7e-81
ref|XP_002282419.1| PREDICTED: pentatricopeptide repeat-containi...   305   4e-80
ref|XP_004498635.1| PREDICTED: pentatricopeptide repeat-containi...   304   1e-79
gb|EMJ23233.1| hypothetical protein PRUPE_ppa003040mg [Prunus pe...   303   3e-79
ref|XP_006353639.1| PREDICTED: pentatricopeptide repeat-containi...   301   6e-79
ref|XP_004241813.1| PREDICTED: pentatricopeptide repeat-containi...   296   2e-77
ref|NP_196692.1| pentatricopeptide repeat-containing protein [Ar...   292   5e-76
ref|XP_006399632.1| hypothetical protein EUTSA_v10013015mg [Eutr...   291   9e-76
gb|ESW33251.1| hypothetical protein PHAVU_001G055200g [Phaseolus...   289   3e-75
ref|XP_002871469.1| pentatricopeptide repeat-containing protein ...   289   4e-75
gb|EOY31375.1| Pentatricopeptide repeat (PPR) superfamily protei...   288   6e-75
gb|EOY31373.1| Pentatricopeptide repeat superfamily protein isof...   288   6e-75
gb|EOY31372.1| Pentatricopeptide repeat superfamily protein isof...   288   6e-75
ref|XP_003549241.1| PREDICTED: pentatricopeptide repeat-containi...   287   1e-74
gb|EXC20787.1| hypothetical protein L484_007369 [Morus notabilis]     287   2e-74
ref|XP_002308636.1| hypothetical protein POPTR_0006s26360g [Popu...   286   3e-74
ref|XP_006450554.1| hypothetical protein CICLE_v10008018mg [Citr...   285   6e-74

>ref|XP_006840662.1| hypothetical protein AMTR_s00096p00019400 [Amborella trichopoda]
            gi|548842407|gb|ERN02337.1| hypothetical protein
            AMTR_s00096p00019400 [Amborella trichopoda]
          Length = 513

 Score =  320 bits (820), Expect = 2e-84
 Identities = 167/436 (38%), Positives = 265/436 (60%)
 Frame = +3

Query: 174  ENALENSDIIIDSQSLENILENNKYRLEFLLRIFQHATQNQGFIPSSRACNLLINSMSKY 353
            +++L+   + I    +  + +N  +       +F  A +   +  ++   N ++N   + 
Sbjct: 11   QSSLDELQLKISPDLINRVFQNCIFSSNSAYALFVWAQKQPNYSHTTAVLNSMVNLFGRM 70

Query: 354  REFDLAWMLIKEMKEHGLLNLDTFVILFRRYARAGMNNAALRSFDLMEYFGITRDLEALT 533
            REFD AW+LI  +K   L+N  TF IL RRYARAG+  AALR+ DL+ +FG+T   ++L 
Sbjct: 71   REFDSAWLLIDSLKP--LVNAQTFTILLRRYARAGLPQAALRTLDLIPHFGLTLTPDSLN 128

Query: 534  AFMKALCKEKKIQFASLVFSMRKDRFGSNTEMYNSLISSWCRIHGIKKAEVIFGQMVVSG 713
              + ALCKE +++ A+  F  RK +       YN L+  W R+  ++KAE ++  M V  
Sbjct: 129  NILDALCKEGQVREAARYFEARKSKLAL-ASTYNILLHGWFRLRNLRKAERLWEDMQVRN 187

Query: 714  IAPDVVTFASLVKGYCSAKRLEDAINLLKSMKTKGCQPNALVFNPLVNALGEAGRIFEAH 893
            +   VVT+ +L++GYC  +R+E A+ LL+ MK +G +PN + +NP+V+ALGEAGR  +A 
Sbjct: 188  VPASVVTYGTLIEGYCRMRRVERALELLEDMKFRGIEPNVITYNPIVDALGEAGRHGDAM 247

Query: 894  LMLDEMAIAGCQPNISTFNSLIKAYCENDDLDQATMIFKMMVEKDCLPTTTTYNCFIRYF 1073
             M D +   G  P IST+NSL+K + ++ D+  A+ + KMM+ + CLPT TTYN F ++F
Sbjct: 248  AMTDRIFTLGLTPTISTYNSLVKGFSKHGDMVGASKVLKMMIGRGCLPTPTTYNYFFKFF 307

Query: 1074 SKHGNIKEGMELYMKLAESGCSPDSLTYQSLIQMLCEKSSIELALQVHKDIETSGCELHL 1253
            S+ G I EGM LY KL +SG S D L+YQ LI+MLCEK  +E+ LQV +D+ + G +  L
Sbjct: 308  SRAGKIDEGMNLYTKLIKSGYSLDRLSYQLLIKMLCEKGRLEMTLQVIEDMHSKGYDSDL 367

Query: 1254 EAFTILIDTLCKEGRLVDAFQLFERMVQRGISPELTTYKILSDCLDEEKQYEKTQNLWEM 1433
               ++L+  LC  G+L +A + FE M+ +GI P+  T+++L D +      E+ + L EM
Sbjct: 368  ATSSMLVHLLCNLGKLEEACEEFEGMINKGIVPQYLTFQMLVDEVRRLGLVERARRLTEM 427

Query: 1434 MRALNPYFRRRPPSSK 1481
            M ++ P+ ++ P S K
Sbjct: 428  MDSV-PHSKKLPNSYK 442


>ref|XP_004136469.1| PREDICTED: pentatricopeptide repeat-containing protein At5g11310,
            mitochondrial-like [Cucumis sativus]
            gi|449503560|ref|XP_004162063.1| PREDICTED:
            pentatricopeptide repeat-containing protein At5g11310,
            mitochondrial-like [Cucumis sativus]
          Length = 615

 Score =  309 bits (792), Expect = 3e-81
 Identities = 182/466 (39%), Positives = 271/466 (58%), Gaps = 14/466 (3%)
 Frame = +3

Query: 114  NPNDNKTACSVKIFNSSEDA---ENALENSDIIIDSQSLENILENNKYRLEFLLRIFQHA 284
            +PND  T  S+    S       E+AL+ + I+  S  LE + ++     +FL  +F  A
Sbjct: 85   SPNDLSTISSILSDRSVRPGAALEDALDRTGIVPSSSLLEAVFDHFDSSPKFLHSLFLWA 144

Query: 285  TQNQGFIPSSRACNLLINSMSKYREFDLAWMLIKEMKEHG----LLNLDTFVILFRRYAR 452
             +  GF PS+   N LIN ++K REFD AW LI      G    L++++ FVIL RRYAR
Sbjct: 145  AKKSGFRPSAALFNRLINVLAKSREFDSAWSLITSRLRGGEESFLVSVEVFVILIRRYAR 204

Query: 453  AGMNNAALRSFDL---MEYFGITRDLEALTAFMKALCKEKKIQFASLVFSMRKDRFGSNT 623
            AGM   A+R+++    +E    T         + +LCKE  ++ AS  F+ RK   GS+ 
Sbjct: 205  AGMVQPAIRTYEFACNLETISGTGSEGLFEILLDSLCKEGHVRVASEYFN-RKREMGSSF 263

Query: 624  E----MYNSLISSWCRIHGIKKAEVIFGQMVVSGIAPDVVTFASLVKGYCSAKRLEDAIN 791
            E     YN LI+ W R   +K A+ ++ +M  + I+P VVT+ +L++GYC  + +E AI 
Sbjct: 264  EPSIRAYNILINGWFRSRKLKHAQRLWFEMKKNKISPTVVTYGTLIEGYCRMRSVEIAIE 323

Query: 792  LLKSMKTKGCQPNALVFNPLVNALGEAGRIFEAHLMLDEMAIAGCQPNISTFNSLIKAYC 971
            L+  M+ +G +PNA+V+NP+V+ALGEAGR  EA  M++   +    P IST+NSL+K YC
Sbjct: 324  LVDEMRREGIEPNAIVYNPIVDALGEAGRFKEALGMMERFMVLEQGPTISTYNSLVKGYC 383

Query: 972  ENDDLDQATMIFKMMVEKDCLPTTTTYNCFIRYFSKHGNIKEGMELYMKLAESGCSPDSL 1151
            +  DL  A+ I KMM+ +   PT TTYN F R+FSK+G I+E M LY K+ ESG +PD L
Sbjct: 384  KAGDLSGASKILKMMIGRGFTPTPTTYNYFFRFFSKYGKIEESMSLYNKMIESGYAPDKL 443

Query: 1152 TYQSLIQMLCEKSSIELALQVHKDIETSGCELHLEAFTILIDTLCKEGRLVDAFQLFERM 1331
            TY  L++MLCE+  + LA+QV  +++  G ++ L   T+L+  LCK  +  +AF  FE M
Sbjct: 444  TYHLLLKMLCEEERLNLAVQVCNEMKARGFDMDLATSTMLMHLLCKMHKFEEAFAEFEHM 503

Query: 1332 VQRGISPELTTYKILSDCLDEEKQYEKTQNLWEMMRALNPYFRRRP 1469
            + RGI P+  T+  L D   +    +    L EMM ++ P+  + P
Sbjct: 504  IHRGIVPQYLTFCRLHDEFMKRGLTKMASKLQEMMSSV-PHSEKLP 548


>ref|XP_004292965.1| PREDICTED: pentatricopeptide repeat-containing protein At5g11310,
            mitochondrial-like [Fragaria vesca subsp. vesca]
          Length = 582

 Score =  309 bits (791), Expect = 4e-81
 Identities = 182/530 (34%), Positives = 293/530 (55%), Gaps = 44/530 (8%)
 Frame = +3

Query: 102  PKLPNPN------DNKTACSVKIFNSSE-----DAENALENSDIIIDSQSLENILENNKY 248
            P  PNPN      +N  +   K+             +AL+   I      ++ + ++   
Sbjct: 41   PPNPNPNFDPKFSENDFSTITKLLTDPSIFPGASLRSALDRVGIDPSPSLVQAVFDHFDS 100

Query: 249  RLEFLLRIFQHATQNQGFIPSSRACNLLINSMSKYREFDLAWMLIKEM----KEHGLLNL 416
              + L  +F  A +  GF  S +    +IN ++K REF+ AW +I +     KE GL+++
Sbjct: 101  SPKLLHTLFVWAEEQPGFRCSVKLFTSVINVLAKAREFESAWSMILDRIGGDKEAGLVSV 160

Query: 417  DTFVILFRRYARAGMNNAALRSFD----LMEYFGITRDLEALTAFMKALCKEKKIQFASL 584
            D FVI+ RRYARAG   +A+R+F+    L  +     ++      + +LCKE  ++ A+ 
Sbjct: 161  DAFVIMIRRYARAGQPQSAIRAFEFATNLDSFLSSESEMSLFEILLDSLCKEGLVRVATE 220

Query: 585  VFSMRKDRFGS---NTEMYNSLISSWCRIHGIKKAEVIFGQMVVSGIAPDVVTFASLVKG 755
             F  ++        +  +YN L++ W R   +KKAE ++ +M   G+ P VVT+ +LV+G
Sbjct: 221  YFDGKRKSHRDWIPSVRVYNILLNGWFRSRKLKKAERLWVEMKSDGVKPSVVTYGTLVEG 280

Query: 756  YCSAKRLEDAINLLKSMKTKGCQPNALVFNPLVNALGEAGRIFEAHLMLDEMAIAGCQPN 935
            YC  +R E A+ L+  M+ +G +PNA+VFNP+++ALGEAGR  EA  M++  ++    P 
Sbjct: 281  YCRMRRPEIAMELVGEMRREGVEPNAIVFNPIIDALGEAGRFKEAWGMMERFSVLESGPT 340

Query: 936  ISTFNSLIKAYCENDDLDQATMIFKMMVEKDCLPTTTTYNCFIRYFSKHGNIKEGMELYM 1115
            IST+NSL+K YC+  +L +A+ I KMM+ +  +PT  TYN F RYFSK G I+EGM LY 
Sbjct: 341  ISTYNSLVKGYCKAGNLVEASRILKMMISRGIVPTPATYNYFFRYFSKSGKIEEGMNLYT 400

Query: 1116 KLAESGCSPDSLTYQSLIQMLCEKSSIELALQVHKDIETSGCELHLEAFTILIDTLCKEG 1295
            K+ ESG +PD LT+  L++MLCE+  ++LA+QV K++ T GC++ L   T+LI  LCK  
Sbjct: 401  KMIESGYTPDRLTFHLLLKMLCEEGRLDLAVQVSKEMRTRGCDMDLATSTMLIHLLCKMN 460

Query: 1296 RLVDAFQLFERMVQRGISPELTTYKILSDCLDEEKQYEKTQNLWEMMRAL-------NPY 1454
            +  +A   FE M+++G+ P+  T++ ++D L ++   E  + L  +M ++       N Y
Sbjct: 461  KFKEALSEFEDMIRKGLVPQYLTFQNMNDELRKQGMTEMARKLCALMSSVPHSTKLPNTY 520

Query: 1455 FRRRPPS---SKSFIK------------DDTKEPVKYIAASNIFEKKKNR 1559
             + R  S    KS IK             D +E VK+ ++    E + NR
Sbjct: 521  VKDRDESHERRKSIIKKAEAMSKVLKTCSDPRELVKHRSSPESVESRANR 570


>ref|XP_002516618.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223544438|gb|EEF45959.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 577

 Score =  308 bits (789), Expect = 7e-81
 Identities = 171/467 (36%), Positives = 270/467 (57%), Gaps = 10/467 (2%)
 Frame = +3

Query: 75   TPPLIFKFQPKLPNPN----DNKTACSVKIFNSSEDAENALENSDIIIDSQSLENILENN 242
            TPPL  K      NPN    D  T C++    + +  E AL+ + I  ++  L  + ++ 
Sbjct: 40   TPPLPEKLPSPTANPNYSHSDFSTLCNLLSDPNLKPLETALDQTGIKPETSLLNAVFDHF 99

Query: 243  KYRLEFLLRIFQHATQNQGFIPSSRACNLLINSMSKYREFDLAWMLIKEMKEHGLLNLDT 422
                + L  +F  A +   F  S+   N +IN++ K +EFD AW L+  +   GL++ DT
Sbjct: 100  NSSPKLLHSLFVWADKQPEFESSTTLFNSVINALGKMKEFDSAWCLV--LDRTGLVSSDT 157

Query: 423  FVILFRRYARAGMNNAALRSFDLMEYFGITRDLEA---LTAFMKALCKEKKIQFASLVFS 593
            F IL RRY RAGM  +A+R+F+         D      L   + +LCKE  ++ A   F 
Sbjct: 158  FAILIRRYTRAGMPQSAIRTFEYAISLDFICDYNCDALLEILLDSLCKEGHVRVAKEYFD 217

Query: 594  MRKDRFGS---NTEMYNSLISSWCRIHGIKKAEVIFGQMVVSGIAPDVVTFASLVKGYCS 764
             RK        +  +YN +++ W R   +K AE ++ +M  + ++P VVT+ +LV+GYC 
Sbjct: 218  SRKQLDSCWIPHVRIYNIMLNGWFRSRKLKHAERLWLEMKKNNVSPSVVTYGTLVEGYCR 277

Query: 765  AKRLEDAINLLKSMKTKGCQPNALVFNPLVNALGEAGRIFEAHLMLDEMAIAGCQPNIST 944
             +R+E AI L+  M+ +G +PNALV+NP+++AL E GR  E   M++    +   P IST
Sbjct: 278  MRRVERAIELVDVMRKEGIEPNALVYNPIIDALAEEGRFKEVSGMMEYFLQSESGPTIST 337

Query: 945  FNSLIKAYCENDDLDQATMIFKMMVEKDCLPTTTTYNCFIRYFSKHGNIKEGMELYMKLA 1124
            +NSL+K YC+  D   A+ + KMM+ +  +PT TTYN F R+FSK G I+EGM LY K+ 
Sbjct: 338  YNSLVKGYCKAKDPVGASKVLKMMISRGFVPTPTTYNYFFRHFSKFGMIEEGMNLYTKMI 397

Query: 1125 ESGCSPDSLTYQSLIQMLCEKSSIELALQVHKDIETSGCELHLEAFTILIDTLCKEGRLV 1304
            ESG +PD LT+  L++MLCE+  ++LA+Q+ K++ + GC++ L   T+LI   C+  R  
Sbjct: 398  ESGYTPDRLTFHLLLKMLCEEERLDLAVQISKEMRSRGCDMDLATSTMLIHLFCRMHRFE 457

Query: 1305 DAFQLFERMVQRGISPELTTYKILSDCLDEEKQYEKTQNLWEMMRAL 1445
            +AF  FE M+Q+GI P+  T++ L+D L +    E+ + L +MM ++
Sbjct: 458  EAFMEFEDMIQKGIVPQYLTFQRLNDELRKRGMVERARKLSDMMSSV 504


>ref|XP_002282419.1| PREDICTED: pentatricopeptide repeat-containing protein At5g11310,
            mitochondrial [Vitis vinifera]
            gi|296081989|emb|CBI20994.3| unnamed protein product
            [Vitis vinifera]
          Length = 597

 Score =  305 bits (782), Expect = 4e-80
 Identities = 188/534 (35%), Positives = 296/534 (55%), Gaps = 22/534 (4%)
 Frame = +3

Query: 111  PNPN----DNKTACSV---KIFNSSEDAENALENSDIIIDSQSLENILENNKYRLEFLLR 269
            PNPN    D  T C++      +S    E+AL  + I   S  L+ I  +     + L  
Sbjct: 63   PNPNFSQSDFSTICALLTDPALSSGAPLEDALNRTGIKPCSGLLQAIFSHFDASPKPLFT 122

Query: 270  IFQHATQNQGFIPSSRACNLLINSMSKYREFDLAWMLIKEMKEHG----LLNLDTFVILF 437
            +F+ A +  GF  S    N +I+ ++K R FD AW+L+ +  E G    L++ +TF +L 
Sbjct: 123  LFRWAMKQPGFESSMTLFNSMIDVLAKSRAFDSAWLLVLDRIEGGEEPELVSSNTFAVLI 182

Query: 438  RRYARAGMNNAALRSFDLMEYFGITRDLEALTAFMK----ALCKEKKIQFASLVFSMRKD 605
            RRYARAGM  +A+R+F+        RD ++  +  K    +LCKE  ++ AS  F  ++ 
Sbjct: 183  RRYARAGMTLSAIRTFEFAFSLDSIRDRDSEWSLFKILLDSLCKEGHVRVASEYFDQQRG 242

Query: 606  RFGS---NTEMYNSLISSWCRIHGIKKAEVIFGQMVVSGIAPDVVTFASLVKGYCSAKRL 776
               S   +  +YN L++ W R   +K+AE ++  M    + P VVT+ +LV+GYC  +R 
Sbjct: 243  LDPSWVPSIRVYNVLLNGWFRSRKLKRAEQLWRTMKRENVKPTVVTYGTLVEGYCRMRRS 302

Query: 777  EDAINLLKSMKTKGCQPNALVFNPLVNALGEAGRIFEAHLMLDEMAIAGCQPNISTFNSL 956
            E AI L+  M+ KG +PN +V+NP++++L EAGR  EA  M++   ++   P IST+NSL
Sbjct: 303  EKAIELVGEMRGKGIEPNVIVYNPIIDSLAEAGRFKEAMGMMERCLVSETGPTISTYNSL 362

Query: 957  IKAYCENDDLDQATMIFKMMVEKDCLPTTTTYNCFIRYFSKHGNIKEGMELYMKLAESGC 1136
            +K +C+  DL  A+ + KMM+ +   PT TTYN F RYFS+ G  +EGM LY K+ ESG 
Sbjct: 363  VKGFCKAGDLVGASKVLKMMISRGFDPTLTTYNYFFRYFSRCGKTEEGMNLYTKMIESGH 422

Query: 1137 SPDSLTYQSLIQMLCEKSSIELALQVHKDIETSGCELHLEAFTILIDTLCKEGRLVDAFQ 1316
            +PD LTY  LI+M+CE+  ++LA+QV K++   GC+L L   T+L+  LCK  RL +AF 
Sbjct: 423  TPDRLTYHLLIKMMCEEERLDLAVQVSKEMRARGCDLDLATSTMLVHLLCKMHRLEEAFA 482

Query: 1317 LFERMVQRGISPELTTYKILSDCLDEEKQYEKTQNLWEMMRALNPYFRRRPPSSKSFIKD 1496
             FE M++RGI P+  T++ +++ L +    E  + L +MM ++       P SSK     
Sbjct: 483  EFEDMIRRGIVPQYLTFERMNNALRKRGLTEMARKLCDMMASV-------PHSSKL---- 531

Query: 1497 DTKEPVKYIAASNIFEKKK----NRRRHLSYIKEEVDDGSDIVEVANRFTKTLM 1646
                P  Y    +    +K     R   +S I +  +D  ++V+  + F  T++
Sbjct: 532  ----PNTYSGDGDASRARKTSIIQRAEAMSDILKTCNDPRELVKRRSSFENTVL 581


>ref|XP_004498635.1| PREDICTED: pentatricopeptide repeat-containing protein At5g11310,
            mitochondrial-like [Cicer arietinum]
          Length = 596

 Score =  304 bits (779), Expect = 1e-79
 Identities = 181/528 (34%), Positives = 295/528 (55%), Gaps = 23/528 (4%)
 Frame = +3

Query: 102  PKLPNPNDNKTACSVKIFNSS----EDAENALENSDIIIDSQSLENILENNKYRLEFLLR 269
            P  P P+ N +  S    N S          L  + I  DS  L  + ++     + L  
Sbjct: 62   PNTPTPDSNFSLISTLFTNPSISPGSQLHAQLNRTGIKPDSPLLRAVFDHFASSPKLLHS 121

Query: 270  IFQHATQNQGFIPSSRACNLLINSMSKYREFDLAWMLI-------KEMKEHGLLNLDTFV 428
            +F  A +  GF P     + ++N+++K +EFD AW L+       +E KE  L+++ TF 
Sbjct: 122  LFLWADKQPGFKPDPTLFDSMVNALAKIKEFDSAWTLVLDRIHREEEEKEDKLVSIGTFA 181

Query: 429  ILFRRYARAGMNNAALRSFDLM-EYFGITRDLEALTAF---MKALCKEKKIQFASLVFSM 596
            IL RRYARAGM+ AA+R+F+   +   I   +  ++ F   + +LCKE  ++ AS  F  
Sbjct: 182  ILIRRYARAGMHEAAIRTFEFAKDKKSIVDSMSEMSLFGILIDSLCKEGSVREASEYFLR 241

Query: 597  RKDR---FGSNTEMYNSLISSWCRIHGIKKAEVIFGQMVVSGIAPDVVTFASLVKGYCSA 767
            RK+    +  +T +YN +++ W R   +K AE ++ +M    + P VVT+ +LV+GYC  
Sbjct: 242  RKETDLGWVPSTRVYNIMLNGWFRARKLKHAERLWEEMKKENVKPSVVTYGTLVEGYCRM 301

Query: 768  KRLEDAINLLKSMKTKGCQPNALVFNPLVNALGEAGRIFEAHLMLDEMAIAGCQPNISTF 947
            +R+E A+ ++  M  +G + NA+V+NP+++AL EAGR  EA  M++   +    P +ST+
Sbjct: 302  RRVEKALEMVGEMTKEGIEANAIVYNPIIDALAEAGRFKEALGMMERFHVLQIGPTLSTY 361

Query: 948  NSLIKAYCENDDLDQATMIFKMMVEKDCLPTTTTYNCFIRYFSKHGNIKEGMELYMKLAE 1127
            NSL+K +C+  DL+ A+ I K M+ +  LP  TTYN F RYFS+ G I+EGM LY K+ E
Sbjct: 362  NSLVKGFCKAGDLEGASKILKKMISRGFLPIPTTYNYFFRYFSRCGKIEEGMNLYTKMIE 421

Query: 1128 SGCSPDSLTYQSLIQMLCEKSSIELALQVHKDIETSGCELHLEAFTILIDTLCKEGRLVD 1307
            SG +PD LTY  +++MLCE+  ++LA+QV K++  +G ++ L   T+LI  LCK  RL +
Sbjct: 422  SGHTPDRLTYHLVLKMLCEEERLDLAVQVSKEMRHNGYDMDLATSTMLIHLLCKMHRLEE 481

Query: 1308 AFQLFERMVQRGISPELTTYKILSDCLDEEKQYEKTQNLWEMMRALNPYFRRRPPSSKSF 1487
            AF  FE M++RGI P+  T++ L+  L ++   E +Q L  +M   N       P++   
Sbjct: 482  AFAEFEDMIRRGIVPQYLTFQKLNVELKKQGMTEMSQKLCHLMS--NVPHSTNLPNTYGE 539

Query: 1488 IKDDTKEPVKYI-----AASNIFEKKKNRRRHLSYIKEEVDDGSDIVE 1616
            ++D+     K I     A S++ +  K   +  S  + +V   + ++E
Sbjct: 540  VRDNAHAHRKSIIQKAQAVSDLLKDPKELDKFRSSSENDVSIANCLIE 587


>gb|EMJ23233.1| hypothetical protein PRUPE_ppa003040mg [Prunus persica]
          Length = 609

 Score =  303 bits (775), Expect = 3e-79
 Identities = 178/515 (34%), Positives = 285/515 (55%), Gaps = 45/515 (8%)
 Frame = +3

Query: 111  PNPNDNKTACSVKIFNS------------SEDAENALENSDIIIDSQSLENILENNKYRL 254
            PNPN +    S   F++                ++AL+ + I      L+ + ++     
Sbjct: 70   PNPNSSGPNFSQNDFSTIANVLADPSISPGSSLQSALDRTGIEPGPCLLQAVFDHFDSSP 129

Query: 255  EFLLRIFQHATQNQGFIPSSRACNLLINSMSKYREFDLAWMLIKEM----KEHGLLNLDT 422
            + L  +F  A +  GF  S+     +IN ++K REF+ AW LI       +E GL+++DT
Sbjct: 130  KLLHTLFLWAEKRPGFRSSATLFGCMINVLAKSREFESAWSLILNRIGGDEEPGLVSVDT 189

Query: 423  FVILFRRYARAGMNNAALRSFD----LMEYFGITRDLEALTAFMKALCKEKKIQFASLVF 590
            FVI+ RRY+RAGM+ +A+R+F+    L  +     ++      + +LCKE  ++ AS  F
Sbjct: 190  FVIMIRRYSRAGMSQSAIRTFEFASNLDSFLNSESEMSLFEVLLDSLCKEGLVRVASEYF 249

Query: 591  SMRKDRFGS---NTEMYNSLISSWCRIHGIKKAEVIFGQMVVSGIAPDVVTFASLVKGYC 761
             M++        +  +YN L++ W R   +K+AE ++ +M    + P VVT+ +L++GYC
Sbjct: 250  DMKRKLHPDWIPSVRVYNILLNGWFRSRKLKRAERLWAEMKRDNVKPSVVTYGTLIEGYC 309

Query: 762  SAKRLEDAINLLKSMKTKGCQPNALVFNPLVNALGEAGRIFEAHLMLDEMAIAGCQPNIS 941
              +R E AI L+  M+++G +PNA+V+N +++ALGEAG+  EA  M++   +    P IS
Sbjct: 310  RMRRAEIAIELVSEMRSEGIEPNAIVYNAIIDALGEAGKFKEALGMMEHFLVLESGPTIS 369

Query: 942  TFNSLIKAYCENDDLDQATMIFKMMVEKDCLPTTTTYNCFIRYFSKHGNIKEGMELYMKL 1121
            T+NSL K +C+  DL  A+ I KMM+ K C+PT TTYN F RYFSK G I+EGM LY K+
Sbjct: 370  TYNSLAKGFCKAGDLVGASKILKMMISKGCVPTPTTYNYFFRYFSKFGKIEEGMNLYTKM 429

Query: 1122 AESGCSPDSLTYQSLIQMLCEKSSIELALQVHKDIETSGCELHLEAFTILIDTLCKEGRL 1301
             ESG +PD LT+  L++MLC++  + LA+QV K++ + G ++ L   T+LI  LC   + 
Sbjct: 430  IESGYTPDRLTFHLLLKMLCDEGRLGLAVQVSKEMRSRGLDMDLATSTMLIHLLCNVHKF 489

Query: 1302 VDAFQLFERMVQRGISPELTTYKILSDCLDEEKQYEKTQNLWEMMRAL-------NPYFR 1460
             +AF  FE M++RG+ P+  T++ ++  L ++   E    +  MM ++       N Y R
Sbjct: 490  KEAFAEFEDMIRRGLVPQYLTFQRMNVELRKQGMTEMAHKMCNMMSSVPHSTNLPNTYVR 549

Query: 1461 RRPPS---SKSFIK------------DDTKEPVKY 1520
             R  S    KS I+             D +E VKY
Sbjct: 550  ERDASHARRKSIIQKAEAMSDLLKTCSDPRELVKY 584


>ref|XP_006353639.1| PREDICTED: pentatricopeptide repeat-containing protein At5g11310,
            mitochondrial-like [Solanum tuberosum]
          Length = 604

 Score =  301 bits (772), Expect = 6e-79
 Identities = 176/478 (36%), Positives = 273/478 (57%), Gaps = 12/478 (2%)
 Frame = +3

Query: 99   QPKLPN--PNDNKTACSV---KIFNSSEDAENALENSDIIIDSQSLENILENNKYRLEFL 263
            QP+ PN  P D  T C +    I  +    ENAL+ + + ++      +  +     + L
Sbjct: 72   QPETPNYCPTDFTTLCEILRDPIIPAGPVLENALDRAGVEVNECMFLQLFNHFDSSPKPL 131

Query: 264  LRIFQHATQNQGFIPSSRACNLLINSMSKYREFDLAWMLIKE-MKEHGLLNLDTFVILFR 440
              ++  A + + F  S    N ++N++ K REFD AW LI + +      NLDTF I+ R
Sbjct: 132  FTLYLWAEKKEWFKFSLPVFNAVVNALGKEREFDSAWNLILDRLNSTERPNLDTFAIMIR 191

Query: 441  RYARAGMNNAALRSFDL---MEYFGITRDLEALTAFMKALCKEKKIQFASLVFSMRKDR- 608
            RYARAGM   A+R+++    +E   +  +       + +LCKE  I+ AS  F  RK + 
Sbjct: 192  RYARAGMLLPAVRTYEFSSNLEIHALGLEDNLFEILLDSLCKEGLIREASDYFYRRKGQD 251

Query: 609  --FGSNTEMYNSLISSWCRIHGIKKAEVIFGQMVVSGIAPDVVTFASLVKGYCSAKRLED 782
              +  +  +YN L++ W R   +KKAE ++ +M   GI P VVT+ +LV+G C  +R+E 
Sbjct: 252  SNWSPSIRVYNILLNGWFRSRKLKKAERLWTEMKKEGIKPSVVTYGTLVEGLCRMRRVEM 311

Query: 783  AINLLKSMKTKGCQPNALVFNPLVNALGEAGRIFEAHLMLDEMAIAGCQPNISTFNSLIK 962
            AI L+  MK +G  PNA+V+NP+++ALGEAGR  EA  M++ + +    P +ST+NSL+K
Sbjct: 312  AIELIDEMKEEGIPPNAVVYNPVIDALGEAGRFKEASGMMERLLVLESGPTLSTYNSLVK 371

Query: 963  AYCENDDLDQATMIFKMMVEKDCLPTTTTYNCFIRYFSKHGNIKEGMELYMKLAESGCSP 1142
             +C+  D+  A+ I KMM+ +  +PT TTYN F RYFSK G I+EG+ LY KL ESG   
Sbjct: 372  GFCKAGDIVGASKILKMMINRGLMPTPTTYNYFFRYFSKFGKIEEGLNLYTKLIESGYVA 431

Query: 1143 DSLTYQSLIQMLCEKSSIELALQVHKDIETSGCELHLEAFTILIDTLCKEGRLVDAFQLF 1322
            D LTY  L++MLCE+  + LALQ+ +++ T G +L L   T+LI   CK  +  +A + F
Sbjct: 432  DRLTYHLLVKMLCEQDRLNLALQIIQEMRTKGFDLDLATSTMLIHLFCKMHQFDEAVEWF 491

Query: 1323 ERMVQRGISPELTTYKILSDCLDEEKQYEKTQNLWEMMRALNPYFRRRPPSSKSFIKD 1496
              M++RG+ P+  TY+ L + L ++   +K + L   M    PY  + P    ++I+D
Sbjct: 492  HDMIRRGLVPQYLTYQRLCNDLAKQGMNDKAEKLRNTM-VSTPYSEKLP---NTYIRD 545


>ref|XP_004241813.1| PREDICTED: pentatricopeptide repeat-containing protein At5g11310,
            mitochondrial-like [Solanum lycopersicum]
          Length = 602

 Score =  296 bits (759), Expect = 2e-77
 Identities = 173/478 (36%), Positives = 271/478 (56%), Gaps = 12/478 (2%)
 Frame = +3

Query: 99   QPKLPN--PNDNKTACSV---KIFNSSEDAENALENSDIIIDSQSLENILENNKYRLEFL 263
            QP+ PN  P D  T   +            ENAL+ + I ++      +  +     + L
Sbjct: 72   QPETPNYCPTDFTTLSEILRDPTIPPGPALENALDRAGIEVNECMFLQLFNHFDSSPKPL 131

Query: 264  LRIFQHATQNQGFIPSSRACNLLINSMSKYREFDLAWMLIKE-MKEHGLLNLDTFVILFR 440
              ++  A + + F  S    N ++N++ K REFD AW LI + +      NL TF I+ R
Sbjct: 132  FTLYLWAEKKEWFKFSLPVFNAVVNALGKEREFDSAWNLILDRLNSTERPNLGTFAIMIR 191

Query: 441  RYARAGMNNAALRSFDL---MEYFGITRDLEALTAFMKALCKEKKIQFASLVFSMRKDR- 608
            RY+RAGM   A+R+++    +E  G+  +       + +LCKE  I+ AS  F  RK + 
Sbjct: 192  RYSRAGMLLPAIRTYEFSTNLEIHGLGLEDNLFEILLDSLCKEGHIREASDYFYRRKGKD 251

Query: 609  --FGSNTEMYNSLISSWCRIHGIKKAEVIFGQMVVSGIAPDVVTFASLVKGYCSAKRLED 782
              +  +  +YN L++ W R   +KKAE ++ +M   GI P VVT+ +LV+G C  +R+E 
Sbjct: 252  LNWSPSIRVYNILLNGWFRSRKLKKAERLWTEMKKEGIKPSVVTYGTLVEGLCRMRRVEM 311

Query: 783  AINLLKSMKTKGCQPNALVFNPLVNALGEAGRIFEAHLMLDEMAIAGCQPNISTFNSLIK 962
            AI L+  MK +G  PN +V+NP+++ALGEAGR  EA  M++ + +    P +ST+NSL+K
Sbjct: 312  AIELIDEMKEEGIHPNVVVYNPVIDALGEAGRFKEASGMMERLLVLESGPTLSTYNSLVK 371

Query: 963  AYCENDDLDQATMIFKMMVEKDCLPTTTTYNCFIRYFSKHGNIKEGMELYMKLAESGCSP 1142
             +C+  D+  A+ I KMM+++  +PT TTYN F RYFSK G I+EG+ LY KL ESG   
Sbjct: 372  GFCKAGDIAGASKILKMMIDRGFMPTPTTYNYFFRYFSKFGKIEEGLNLYTKLIESGYVA 431

Query: 1143 DSLTYQSLIQMLCEKSSIELALQVHKDIETSGCELHLEAFTILIDTLCKEGRLVDAFQLF 1322
            D LTY  L++MLCE+  ++LALQ+ +++ T G +L L   T+LI   CK  +  +A + F
Sbjct: 432  DRLTYHLLVKMLCEQDRLDLALQIIQEMRTKGFDLDLATSTMLIHLFCKMHQFDEAVEWF 491

Query: 1323 ERMVQRGISPELTTYKILSDCLDEEKQYEKTQNLWEMMRALNPYFRRRPPSSKSFIKD 1496
              M++RG+ P+  TY+ L + L ++   +  + L  MM    PY  + P    ++I+D
Sbjct: 492  HDMIRRGVVPQYLTYQRLCNDLAKQGMNDNAEKLRNMM-VSTPYAEKLP---NTYIRD 545


>ref|NP_196692.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75174149|sp|Q9LFM6.1|PP375_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At5g11310, mitochondrial; Flags: Precursor
            gi|8953393|emb|CAB96666.1| putative protein [Arabidopsis
            thaliana] gi|110738090|dbj|BAF00978.1| hypothetical
            protein [Arabidopsis thaliana]
            gi|332004275|gb|AED91658.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 602

 Score =  292 bits (747), Expect = 5e-76
 Identities = 187/576 (32%), Positives = 306/576 (53%), Gaps = 42/576 (7%)
 Frame = +3

Query: 15   KGFFQRRNAASTANFLTIDATPPLIFKFQ-PKLPNPNDNKTACSVKIFNSSEDAENALEN 191
            + FF  R  +S+     +    PLI + Q P +P+        +           N LEN
Sbjct: 20   RNFFLHRLLSSSRRSSPLIPVEPLIQRIQSPAVPDSTCTPPQQNTVSKTDLSTISNLLEN 79

Query: 192  SDIIIDSQSLENILENNKY------------RLE----FLLRIFQHATQNQGFIPSSRAC 323
            +D++  S SLE+ L+                RL      L  +F+ A    GF  S    
Sbjct: 80   TDVVPGS-SLESALDETGIEPSVELVHALFDRLSSSPMLLHSVFKWAEMKPGFTLSPSLF 138

Query: 324  NLLINSMSKYREFDLAWMLI----KEMKEHGLLNLDTFVILFRRYARAGMNNAALRSFDL 491
            + ++NS+ K REF++AW L+    +  +   L++ DTF++L RRYARAGM   A+R+F+ 
Sbjct: 139  DSVVNSLCKAREFEIAWSLVFDRVRSDEGSNLVSADTFIVLIRRYARAGMVQQAIRAFEF 198

Query: 492  MEYFG----ITRDLEALTAFMKALCKEKKIQFASLVFSMRKDRFGSN----TEMYNSLIS 647
               +        +L  L   + ALCKE  ++ AS+          SN      ++N L++
Sbjct: 199  ARSYEPVCKSATELRLLEVLLDALCKEGHVREASMYLERIGGTMDSNWVPSVRIFNILLN 258

Query: 648  SWCRIHGIKKAEVIFGQMVVSGIAPDVVTFASLVKGYCSAKRLEDAINLLKSMKTKGCQP 827
             W R   +K+AE ++ +M    + P VVT+ +L++GYC  +R++ A+ +L+ MK    + 
Sbjct: 259  GWFRSRKLKQAEKLWEEMKAMNVKPTVVTYGTLIEGYCRMRRVQIAMEVLEEMKMAEMEI 318

Query: 828  NALVFNPLVNALGEAGRIFEAHLMLDEMAIAGCQPNISTFNSLIKAYCENDDLDQATMIF 1007
            N +VFNP+++ LGEAGR+ EA  M++   +    P I T+NSL+K +C+  DL  A+ I 
Sbjct: 319  NFMVFNPIIDGLGEAGRLSEALGMMERFFVCESGPTIVTYNSLVKNFCKAGDLPGASKIL 378

Query: 1008 KMMVEKDCLPTTTTYNCFIRYFSKHGNIKEGMELYMKLAESGCSPDSLTYQSLIQMLCEK 1187
            KMM+ +   PTTTTYN F +YFSKH   +EGM LY KL E+G SPD LTY  +++MLCE 
Sbjct: 379  KMMMTRGVDPTTTTYNHFFKYFSKHNKTEEGMNLYFKLIEAGHSPDRLTYHLILKMLCED 438

Query: 1188 SSIELALQVHKDIETSGCELHLEAFTILIDTLCKEGRLVDAFQLFERMVQRGISPELTTY 1367
              + LA+QV+K+++  G +  L   T+LI  LC+   L +AF+ F+  V+RGI P+  T+
Sbjct: 439  GKLSLAMQVNKEMKNRGIDPDLLTTTMLIHLLCRLEMLEEAFEEFDNAVRRGIIPQYITF 498

Query: 1368 KILSDCLDEEKQYEKTQNLWEMMRALNPYFRRRPPSSKSFI-----KDDTKEPV-KYIAA 1529
            K++ + L  +   +  + L  +M +L P+ ++ P + +  +     KD  K  + +  A 
Sbjct: 499  KMIDNGLRSKGMSDMAKRLSSLMSSL-PHSKKLPNTYREAVDAPPDKDRRKSILHRAEAM 557

Query: 1530 SNIFEKKKNRRR-------HLSYIKEEVDDGSDIVE 1616
            S++ +  +N R+       H   + E+++   DI E
Sbjct: 558  SDVLKGCRNPRKLVKMRGSHKKAVGEDINLIDDINE 593


>ref|XP_006399632.1| hypothetical protein EUTSA_v10013015mg [Eutrema salsugineum]
            gi|557100722|gb|ESQ41085.1| hypothetical protein
            EUTSA_v10013015mg [Eutrema salsugineum]
          Length = 603

 Score =  291 bits (745), Expect = 9e-76
 Identities = 167/482 (34%), Positives = 283/482 (58%), Gaps = 18/482 (3%)
 Frame = +3

Query: 174  ENALENSDIIIDSQSLENILENNKYRLEFLLRIFQHATQNQGFIPSSRACNLLINSMSKY 353
            E+AL+ + I    Q ++ + +  +     L  +F+ A    GF PS    + +IN++ K 
Sbjct: 91   ESALDETGIEPSIQLIQALFDRLRSSPMLLHSLFKWAEMKPGFTPSPSMFDSVINALCKA 150

Query: 354  REFDLAWMLIKE-MKEHG---LLNLDTFVILFRRYARAGMNNAALRSFDLMEYFG----I 509
            REF++AW LI + ++  G   L++ DTFV+L RRYARAGM   A+R+F+    +      
Sbjct: 151  REFEIAWSLIFDRVRSDGGSDLVSADTFVVLIRRYARAGMVQQAIRAFEFARSYDPVCKS 210

Query: 510  TRDLEALTAFMKALCKEKKIQFASLVFSMRKDRFGSN----TEMYNSLISSWCRIHGIKK 677
              +L+ L   + ALCKE  ++ AS+    R+ R  SN      ++N L++ W R   +K+
Sbjct: 211  ASELKLLEVLLDALCKEGHVREASMYLERRR-RIDSNWVPSVRIFNILLNGWFRSRKLKQ 269

Query: 678  AEVIFGQMVVSGIAPDVVTFASLVKGYCSAKRLEDAINLLKSMKTKGCQPNALVFNPLVN 857
            AE ++ +M V  + P VVT+ +L++G+C  +R+E A+ +L+ MK    + N +VFNP+++
Sbjct: 270  AENLWAEMKVMNVKPTVVTYGTLIEGFCRMRRVEIAMEVLEEMKMAEMELNFMVFNPIID 329

Query: 858  ALGEAGRIFEAHLMLDEMAIAGCQPNISTFNSLIKAYCENDDLDQATMIFKMMVEKDCLP 1037
             LGE+GR+ EA  M++   ++   P I T+NSL+K++C+  DL  A+ I KMM+ +   P
Sbjct: 330  GLGESGRLQEALGMMERFFVSESGPTIVTYNSLVKSFCKAGDLTGASKILKMMMNRGVDP 389

Query: 1038 TTTTYNCFIRYFSKHGNIKEGMELYMKLAESGCSPDSLTYQSLIQMLCEKSSIELALQVH 1217
            T TTYN F ++FSKH   ++GM LY KL E+G SPD  TY  +++MLCE   + LA+QV+
Sbjct: 390  TPTTYNHFFKFFSKHNKTEQGMNLYFKLIEAGHSPDRFTYHLILKMLCEDGKLSLAMQVN 449

Query: 1218 KDIETSGCELHLEAFTILIDTLCKEGRLVDAFQLFERMVQRGISPELTTYKILSDCLDEE 1397
            K+++  G +  L   T++I  LC+   L +AF  FE+ V+RGI P+  T+K++ + L  +
Sbjct: 450  KEMKNRGIDPDLLTTTMMIHLLCRLDMLEEAFGEFEKAVRRGIVPQYITFKMIDNGLRSK 509

Query: 1398 KQYEKTQNLWEMMRALNPYFRRRPPSSKSFI----KDDTKEPV--KYIAASNIFEKKKNR 1559
               +  + L  +M +L P+ ++ P + +  +      D K+ +  K  A S++ +  +N 
Sbjct: 510  GMIDMAKRLSSVMSSL-PHSKKLPNTYREVVDAPPDRDRKKSILHKAEAMSDVLKGCRNP 568

Query: 1560 RR 1565
            R+
Sbjct: 569  RK 570


>gb|ESW33251.1| hypothetical protein PHAVU_001G055200g [Phaseolus vulgaris]
          Length = 606

 Score =  289 bits (740), Expect = 3e-75
 Identities = 157/419 (37%), Positives = 247/419 (58%), Gaps = 14/419 (3%)
 Frame = +3

Query: 255  EFLLRIFQHATQNQGFIPSSRACNLLINSMSKYREFDLAWMLIK-------EMKEHGLLN 413
            + L  +F  A    GF P  +  + ++N+++K +EFD AW L+        E +   L++
Sbjct: 128  KLLHSLFLWAQTRPGFRPGPKLFDAVVNALAKAKEFDAAWKLVLDNVDGDGEEENESLVS 187

Query: 414  LDTFVILFRRYARAGMNNAALRSFDLME----YFGITRDLEALTAFMKALCKEKKIQFAS 581
            + TF I+ RRYARAGM+  A+R+++             ++      M +LCKE  ++ AS
Sbjct: 188  VGTFAIMIRRYARAGMSKLAIRTYEFARNNKSIVDSGSEMSLFEILMDSLCKEGSVREAS 247

Query: 582  LVFSMRKD---RFGSNTEMYNSLISSWCRIHGIKKAEVIFGQMVVSGIAPDVVTFASLVK 752
              F  RK+    +  +  +YN +++ W R   +K+ E ++ +M    + P VVT+ +LV+
Sbjct: 248  EYFLWRKELDLSWVPSIRVYNIMLNGWFRSRKLKQGERLWEEMKKENVRPSVVTYGTLVE 307

Query: 753  GYCSAKRLEDAINLLKSMKTKGCQPNALVFNPLVNALGEAGRIFEAHLMLDEMAIAGCQP 932
            GYC  +R+E A+ ++  M  +G  PN +V+NP+++AL EAGR  EA  ML+   I    P
Sbjct: 308  GYCRMRRVEKALEMVGDMTKEGIAPNVIVYNPIIDALAEAGRFKEALGMLERFHILEIGP 367

Query: 933  NISTFNSLIKAYCENDDLDQATMIFKMMVEKDCLPTTTTYNCFIRYFSKHGNIKEGMELY 1112
              ST+NSLIK YC+  DL  A+ I KMM+ +  +P+ TTYN F RYFS+ G I+EGM LY
Sbjct: 368  TDSTYNSLIKGYCKAADLAGASKILKMMISRGFIPSPTTYNYFFRYFSRCGKIEEGMNLY 427

Query: 1113 MKLAESGCSPDSLTYQSLIQMLCEKSSIELALQVHKDIETSGCELHLEAFTILIDTLCKE 1292
             K+ ESG +PD LTY  L++MLCE+  ++LA+QV K++  +G ++ L   T+LI  LCK 
Sbjct: 428  RKMIESGYTPDRLTYHLLVKMLCEEGKLDLAVQVSKEMRHNGYDMDLATSTMLIHLLCKM 487

Query: 1293 GRLVDAFQLFERMVQRGISPELTTYKILSDCLDEEKQYEKTQNLWEMMRALNPYFRRRP 1469
             RL +AF  FE M++RGI P+  T++ +   L ++   E  Q L ++M ++ PY    P
Sbjct: 488  HRLEEAFAEFEDMIRRGIVPQYLTFQGMKAELKKQGMTEMAQKLCKLMSSV-PYSDNLP 545


>ref|XP_002871469.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297317306|gb|EFH47728.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 602

 Score =  289 bits (739), Expect = 4e-75
 Identities = 158/444 (35%), Positives = 259/444 (58%), Gaps = 12/444 (2%)
 Frame = +3

Query: 174  ENALENSDIIIDSQSLENILENNKYRLEFLLRIFQHATQNQGFIPSSRACNLLINSMSKY 353
            E+AL+ + I    Q ++ + +        L  +F+ A    GF  S    + +INS+ K 
Sbjct: 89   ESALDETGIEPSLQLVQALFDRLSSSPMLLHSVFKWAEMKPGFTLSPSLFDSVINSLCKA 148

Query: 354  REFDLAWMLI----KEMKEHGLLNLDTFVILFRRYARAGMNNAALRSFDLMEYFG----I 509
            REF++AW L+    +  +   L++ DTF++L RRYARAGM   A+R+F+    +      
Sbjct: 149  REFEIAWSLVFDRVRSDEGSNLVSADTFIVLIRRYARAGMVQQAIRAFEFARSYEPVCKS 208

Query: 510  TRDLEALTAFMKALCKEKKIQFASLVFSMRKDRFGSN----TEMYNSLISSWCRIHGIKK 677
              +L+ L   + ALCKE  ++ AS+    R+    SN      ++N L++ W R   +K+
Sbjct: 209  ASELKLLEVLLDALCKEGYVREASVYLERRRGMMDSNWVPSVRIFNILLNGWFRSRKLKQ 268

Query: 678  AEVIFGQMVVSGIAPDVVTFASLVKGYCSAKRLEDAINLLKSMKTKGCQPNALVFNPLVN 857
            AE ++ +M    + P VVT+ +L++GYC  +R+E A+ +L+ MK    +   +VFNP+++
Sbjct: 269  AEKLWEEMKAMNVKPTVVTYGTLIEGYCRMRRVEIAMEILEEMKMAEMELTFMVFNPIID 328

Query: 858  ALGEAGRIFEAHLMLDEMAIAGCQPNISTFNSLIKAYCENDDLDQATMIFKMMVEKDCLP 1037
             LGEAGR+ EA  M++   +    P I T+NSL+K +C+  DL  A+ I KMM+ +   P
Sbjct: 329  GLGEAGRLSEALGMMERFFVCESGPTIVTYNSLVKNFCKAGDLPGASKILKMMMTRGVEP 388

Query: 1038 TTTTYNCFIRYFSKHGNIKEGMELYMKLAESGCSPDSLTYQSLIQMLCEKSSIELALQVH 1217
            TT+TYN F +YFSKH   +EGM LY KL E+G SPD LTY  +++MLCE   + LA+QV+
Sbjct: 389  TTSTYNHFFKYFSKHNKTEEGMNLYFKLIEAGHSPDRLTYHLILKMLCEDGKLSLAIQVN 448

Query: 1218 KDIETSGCELHLEAFTILIDTLCKEGRLVDAFQLFERMVQRGISPELTTYKILSDCLDEE 1397
            K+++  G +  L   T+L+  LC+   L +AF+ F+  V+RGI P+  T+K++ + L  +
Sbjct: 449  KEMKNRGIDPDLLTTTMLMHLLCRLDMLEEAFEEFDNAVRRGIIPQYITFKMIDNGLRSK 508

Query: 1398 KQYEKTQNLWEMMRALNPYFRRRP 1469
               +  + L  +M +L P+ ++ P
Sbjct: 509  GMTDMAKRLSSLMSSL-PHSKKLP 531


>gb|EOY31375.1| Pentatricopeptide repeat (PPR) superfamily protein, putative isoform
            4, partial [Theobroma cacao] gi|508784121|gb|EOY31377.1|
            Pentatricopeptide repeat (PPR) superfamily protein,
            putative isoform 4, partial [Theobroma cacao]
          Length = 560

 Score =  288 bits (738), Expect = 6e-75
 Identities = 162/441 (36%), Positives = 260/441 (58%), Gaps = 12/441 (2%)
 Frame = +3

Query: 159  SSEDAENALENSDIIIDSQSLENILENNKYRLEFLLRIFQHATQNQGFIPSSRACNLLIN 338
            S    E+AL+ ++I  D   L+ I E      + L  +F  A +  GF  S+   + ++N
Sbjct: 44   SGSSLESALDQTEIDPDPGLLQAIFECFDSSPKLLHHLFLWAEKKPGFKSSATLFDSMVN 103

Query: 339  SMSKYREFDLAWMLIKE-----MKEHGLLNLDTFVILFRRYARAGMNNAALRSFD----L 491
             + K R F+ AW L+ +     M+   L++++TFVIL RRYARAGM   A+R+F+    L
Sbjct: 104  VLGKARGFEDAWSLVLDRIGDGMEGSTLVSVNTFVILIRRYARAGMPQPAIRTFEFAKSL 163

Query: 492  MEYFGITRDLEALTAFMKALCKEKKIQFASLVFSMRKDR---FGSNTEMYNSLISSWCRI 662
             +      +       + +LCKE  ++  S   + +++    +  + ++YN L++ W R 
Sbjct: 164  EQICNSDEETNLFEIMLDSLCKEGHVRVVSEYLTRKRETDLGWVPSIKVYNILLNGWFRS 223

Query: 663  HGIKKAEVIFGQMVVSGIAPDVVTFASLVKGYCSAKRLEDAINLLKSMKTKGCQPNALVF 842
              +K AE ++  M   G+ P VVT+ +LV+GYC+ +R+E AI L+  MK  G +PNA V+
Sbjct: 224  RKLKHAERLWLDMKKEGVLPSVVTYGTLVEGYCTMRRVERAIQLVDEMKGVGIEPNAKVY 283

Query: 843  NPLVNALGEAGRIFEAHLMLDEMAIAGCQPNISTFNSLIKAYCENDDLDQATMIFKMMVE 1022
            NP+++ALGEAGR+ EA  M++ + +    PNIS ++SL+K YC+  DL  A+ I KMM+ 
Sbjct: 284  NPIIDALGEAGRLKEALGMMERVFLCESGPNISMYSSLVKGYCKARDLVGASKILKMMIS 343

Query: 1023 KDCLPTTTTYNCFIRYFSKHGNIKEGMELYMKLAESGCSPDSLTYQSLIQMLCEKSSIEL 1202
            +  +PT TTYN F RYFS+   I+E M LY K+ ESG +PD LTY  L++ML E+  ++L
Sbjct: 344  RGFIPTPTTYNYFFRYFSQFRKIEEAMNLYTKMIESGHTPDRLTYHLLLKMLFEEERLDL 403

Query: 1203 ALQVHKDIETSGCELHLEAFTILIDTLCKEGRLVDAFQLFERMVQRGISPELTTYKILSD 1382
            A+Q+ K++   G +  L   T+LI  LCK  R  DAF  FE M++RG++P+  T++ ++D
Sbjct: 404  AVQISKEMRARGYDRDLATSTMLIHLLCKMHRFEDAFGEFEDMIRRGMAPQYLTFQRMND 463

Query: 1383 CLDEEKQYEKTQNLWEMMRAL 1445
             L +    +    L +MM ++
Sbjct: 464  ELKKRGMTDMASKLCDMMSSV 484


>gb|EOY31373.1| Pentatricopeptide repeat superfamily protein isoform 2, partial
            [Theobroma cacao]
          Length = 584

 Score =  288 bits (738), Expect = 6e-75
 Identities = 162/441 (36%), Positives = 260/441 (58%), Gaps = 12/441 (2%)
 Frame = +3

Query: 159  SSEDAENALENSDIIIDSQSLENILENNKYRLEFLLRIFQHATQNQGFIPSSRACNLLIN 338
            S    E+AL+ ++I  D   L+ I E      + L  +F  A +  GF  S+   + ++N
Sbjct: 68   SGSSLESALDQTEIDPDPGLLQAIFECFDSSPKLLHHLFLWAEKKPGFKSSATLFDSMVN 127

Query: 339  SMSKYREFDLAWMLIKE-----MKEHGLLNLDTFVILFRRYARAGMNNAALRSFD----L 491
             + K R F+ AW L+ +     M+   L++++TFVIL RRYARAGM   A+R+F+    L
Sbjct: 128  VLGKARGFEDAWSLVLDRIGDGMEGSTLVSVNTFVILIRRYARAGMPQPAIRTFEFAKSL 187

Query: 492  MEYFGITRDLEALTAFMKALCKEKKIQFASLVFSMRKDR---FGSNTEMYNSLISSWCRI 662
             +      +       + +LCKE  ++  S   + +++    +  + ++YN L++ W R 
Sbjct: 188  EQICNSDEETNLFEIMLDSLCKEGHVRVVSEYLTRKRETDLGWVPSIKVYNILLNGWFRS 247

Query: 663  HGIKKAEVIFGQMVVSGIAPDVVTFASLVKGYCSAKRLEDAINLLKSMKTKGCQPNALVF 842
              +K AE ++  M   G+ P VVT+ +LV+GYC+ +R+E AI L+  MK  G +PNA V+
Sbjct: 248  RKLKHAERLWLDMKKEGVLPSVVTYGTLVEGYCTMRRVERAIQLVDEMKGVGIEPNAKVY 307

Query: 843  NPLVNALGEAGRIFEAHLMLDEMAIAGCQPNISTFNSLIKAYCENDDLDQATMIFKMMVE 1022
            NP+++ALGEAGR+ EA  M++ + +    PNIS ++SL+K YC+  DL  A+ I KMM+ 
Sbjct: 308  NPIIDALGEAGRLKEALGMMERVFLCESGPNISMYSSLVKGYCKARDLVGASKILKMMIS 367

Query: 1023 KDCLPTTTTYNCFIRYFSKHGNIKEGMELYMKLAESGCSPDSLTYQSLIQMLCEKSSIEL 1202
            +  +PT TTYN F RYFS+   I+E M LY K+ ESG +PD LTY  L++ML E+  ++L
Sbjct: 368  RGFIPTPTTYNYFFRYFSQFRKIEEAMNLYTKMIESGHTPDRLTYHLLLKMLFEEERLDL 427

Query: 1203 ALQVHKDIETSGCELHLEAFTILIDTLCKEGRLVDAFQLFERMVQRGISPELTTYKILSD 1382
            A+Q+ K++   G +  L   T+LI  LCK  R  DAF  FE M++RG++P+  T++ ++D
Sbjct: 428  AVQISKEMRARGYDRDLATSTMLIHLLCKMHRFEDAFGEFEDMIRRGMAPQYLTFQRMND 487

Query: 1383 CLDEEKQYEKTQNLWEMMRAL 1445
             L +    +    L +MM ++
Sbjct: 488  ELKKRGMTDMASKLCDMMSSV 508


>gb|EOY31372.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma
            cacao] gi|508784118|gb|EOY31374.1| Pentatricopeptide
            repeat superfamily protein isoform 1 [Theobroma cacao]
            gi|508784120|gb|EOY31376.1| Pentatricopeptide repeat
            superfamily protein isoform 1 [Theobroma cacao]
          Length = 595

 Score =  288 bits (738), Expect = 6e-75
 Identities = 162/441 (36%), Positives = 260/441 (58%), Gaps = 12/441 (2%)
 Frame = +3

Query: 159  SSEDAENALENSDIIIDSQSLENILENNKYRLEFLLRIFQHATQNQGFIPSSRACNLLIN 338
            S    E+AL+ ++I  D   L+ I E      + L  +F  A +  GF  S+   + ++N
Sbjct: 79   SGSSLESALDQTEIDPDPGLLQAIFECFDSSPKLLHHLFLWAEKKPGFKSSATLFDSMVN 138

Query: 339  SMSKYREFDLAWMLIKE-----MKEHGLLNLDTFVILFRRYARAGMNNAALRSFD----L 491
             + K R F+ AW L+ +     M+   L++++TFVIL RRYARAGM   A+R+F+    L
Sbjct: 139  VLGKARGFEDAWSLVLDRIGDGMEGSTLVSVNTFVILIRRYARAGMPQPAIRTFEFAKSL 198

Query: 492  MEYFGITRDLEALTAFMKALCKEKKIQFASLVFSMRKDR---FGSNTEMYNSLISSWCRI 662
             +      +       + +LCKE  ++  S   + +++    +  + ++YN L++ W R 
Sbjct: 199  EQICNSDEETNLFEIMLDSLCKEGHVRVVSEYLTRKRETDLGWVPSIKVYNILLNGWFRS 258

Query: 663  HGIKKAEVIFGQMVVSGIAPDVVTFASLVKGYCSAKRLEDAINLLKSMKTKGCQPNALVF 842
              +K AE ++  M   G+ P VVT+ +LV+GYC+ +R+E AI L+  MK  G +PNA V+
Sbjct: 259  RKLKHAERLWLDMKKEGVLPSVVTYGTLVEGYCTMRRVERAIQLVDEMKGVGIEPNAKVY 318

Query: 843  NPLVNALGEAGRIFEAHLMLDEMAIAGCQPNISTFNSLIKAYCENDDLDQATMIFKMMVE 1022
            NP+++ALGEAGR+ EA  M++ + +    PNIS ++SL+K YC+  DL  A+ I KMM+ 
Sbjct: 319  NPIIDALGEAGRLKEALGMMERVFLCESGPNISMYSSLVKGYCKARDLVGASKILKMMIS 378

Query: 1023 KDCLPTTTTYNCFIRYFSKHGNIKEGMELYMKLAESGCSPDSLTYQSLIQMLCEKSSIEL 1202
            +  +PT TTYN F RYFS+   I+E M LY K+ ESG +PD LTY  L++ML E+  ++L
Sbjct: 379  RGFIPTPTTYNYFFRYFSQFRKIEEAMNLYTKMIESGHTPDRLTYHLLLKMLFEEERLDL 438

Query: 1203 ALQVHKDIETSGCELHLEAFTILIDTLCKEGRLVDAFQLFERMVQRGISPELTTYKILSD 1382
            A+Q+ K++   G +  L   T+LI  LCK  R  DAF  FE M++RG++P+  T++ ++D
Sbjct: 439  AVQISKEMRARGYDRDLATSTMLIHLLCKMHRFEDAFGEFEDMIRRGMAPQYLTFQRMND 498

Query: 1383 CLDEEKQYEKTQNLWEMMRAL 1445
             L +    +    L +MM ++
Sbjct: 499  ELKKRGMTDMASKLCDMMSSV 519


>ref|XP_003549241.1| PREDICTED: pentatricopeptide repeat-containing protein At5g11310,
            mitochondrial-like isoform X1 [Glycine max]
          Length = 622

 Score =  287 bits (735), Expect = 1e-74
 Identities = 182/542 (33%), Positives = 299/542 (55%), Gaps = 29/542 (5%)
 Frame = +3

Query: 102  PKLPNPNDNKTACSVKIFN----SSEDAENA-LENSDIIIDSQSLENILENNKYRLEFLL 266
            P  P+PN N  +    +F     S   A +A L+ + I  D   L  + +      + L 
Sbjct: 83   PDPPSPNPNALSVISNLFADPSLSPGPALHAELDRAGIEPDPALLLAVFDRFGSSPKLLH 142

Query: 267  RIFQHATQNQGFIPSSRACNLLINSMSKYREFDLAWMLI-----KEMKEHG----LLNLD 419
             +F  A     F P  +  + ++N+++K REFD AW L+     K+ +E G    L+++ 
Sbjct: 143  SLFLWAQTRPAFRPGPKLFDAVVNALAKAREFDAAWKLVLHHAEKDGEEEGEKERLVSVG 202

Query: 420  TFVILFRRYARAGMNNAALRSFDLM----EYFGITRDLEALTAFMKALCKEKKIQFASLV 587
            TF I+ RRYARAGM+  A+R+++             ++  L   M +LCKE  ++ AS  
Sbjct: 203  TFAIMIRRYARAGMSKLAIRTYEFATNNKSIVDSGSEMSLLEILMDSLCKEGSVREASEY 262

Query: 588  FSMRKD---RFGSNTEMYNSLISSWCRIHGIKKAEVIFGQMVVSGIAPDVVTFASLVKGY 758
            F  +K+    +  +  +YN +++ W R+  +K+ E ++ +M    + P VVT+ +LV+GY
Sbjct: 263  FLWKKELDLSWVPSIRVYNIMLNGWFRLRKLKQGERLWAEMK-ENMRPTVVTYGTLVEGY 321

Query: 759  CSAKRLEDAINLLKSMKTKGCQPNALVFNPLVNALGEAGRIFEAHLMLDEMAIAGCQPNI 938
            C  +R+E A+ ++  M  +G  PNA+V+NP+++AL EAGR  EA  ML+   +    P  
Sbjct: 322  CRMRRVEKALEMVGDMTKEGIAPNAIVYNPIIDALAEAGRFKEALGMLERFHVLEIGPTD 381

Query: 939  STFNSLIKAYCENDDLDQATMIFKMMVEKDCLPTTTTYNCFIRYFSKHGNIKEGMELYMK 1118
            ST+NSL+K +C+  DL  A+ I KMM+ +  LP+ TTYN F RYFS+   I+EGM LY K
Sbjct: 382  STYNSLVKGFCKAGDLVGASKILKMMISRGFLPSATTYNYFFRYFSRCRKIEEGMNLYTK 441

Query: 1119 LAESGCSPDSLTYQSLIQMLCEKSSIELALQVHKDIETSGCELHLEAFTILIDTLCKEGR 1298
            L +SG +PD LTY  L++MLCE+  ++LA+QV K++  +G ++ L   T+L+  LCK  R
Sbjct: 442  LIQSGYTPDRLTYHLLVKMLCEEEKLDLAVQVSKEMRHNGYDMDLATSTMLVHLLCKVRR 501

Query: 1299 LVDAFQLFERMVQRGISPELTTYKILSDCLDEEKQYEKTQNLWEMMRALNPYFRRRPPSS 1478
            L +AF  FE M++RGI P+  T++ +   L ++   E  Q L ++M ++ PY     P++
Sbjct: 502  LEEAFVEFEDMIRRGIVPQYLTFQRMKADLKKQGMTEMAQKLCKLMSSV-PY-SPNLPNT 559

Query: 1479 KSFIKDDTKEPVKYI-----AASNIFEKKKN---RRRHLSYIKEEVDDGSDIVEVANRFT 1634
               +++D     K I     A S++ +  K+    R+H S  +  V   + ++E   R  
Sbjct: 560  YGEVREDAYARRKSIIRKAKAFSDMLKDCKDPSELRKHRSSSENTVSSTNSLIEDIERKR 619

Query: 1635 KT 1640
             T
Sbjct: 620  NT 621


>gb|EXC20787.1| hypothetical protein L484_007369 [Morus notabilis]
          Length = 612

 Score =  287 bits (734), Expect = 2e-74
 Identities = 174/513 (33%), Positives = 281/513 (54%), Gaps = 31/513 (6%)
 Frame = +3

Query: 24   FQRRNAASTANFLTIDATPPLIFKFQP-KLPNP--------------NDNKTACSVKIFN 158
            F   ++AS  ++L++   P + +  +P  +PNP              + N+ A   ++  
Sbjct: 39   FFSSSSASGLSWLSVPGKPLIRWPHEPCSVPNPQPDPNPSPNPGAEFSQNEFAAISEVLT 98

Query: 159  SSE-----DAENALENSDIIIDSQSLENILENNKYRLEFLLRIFQHATQNQGFIPSSRAC 323
            +           AL+ + I      L+ + ++     + L  +F  A +  G+  S+   
Sbjct: 99   NPNISGGFSLHTALDRTGIEPSPSLLQAVFDHFDSSPKLLYSLFLWAEKQPGYRSSASLF 158

Query: 324  NLLINSMSKYREFDLAWMLIKEM----KEHGLLNLDTFVILFRRYARAGMNNAALRSFDL 491
              +IN ++K REFD AW LI       +E  L+  DTFVI+ RRYAR GM  +A+R+F+ 
Sbjct: 159  ASVINVLAKSREFDSAWSLILHRIGKEEEPRLVCEDTFVIMIRRYAREGMPQSAVRTFEF 218

Query: 492  ----MEYFGITRDLEALTAFMKALCKEKKIQFASLVFSMRKDRFGS---NTEMYNSLISS 650
                +       ++      + ALCKE  ++ AS  F+ +K    S   +   YN L++ 
Sbjct: 219  ASNSVPICSYISEISLFGILLDALCKEGHVRAASDYFNEKKKLDPSWIPSIRAYNILLNG 278

Query: 651  WCRIHGIKKAEVIFGQMVVSGIAPDVVTFASLVKGYCSAKRLEDAINLLKSMKTKGCQPN 830
            W R   +K+AE ++ +M    +   VVT+ +LV+GYC  +R E A+ L+K M+T+G +PN
Sbjct: 279  WFRSRKLKRAERLWMEMKRDNVRSTVVTYGTLVEGYCRMRRAEIAVELVKEMRTEGIEPN 338

Query: 831  ALVFNPLVNALGEAGRIFEAHLMLDEMAIAGCQPNISTFNSLIKAYCENDDLDQATMIFK 1010
            A+V+NP+++ALGEAGR  EA  M++   +    P IST+NSL+K +C+  +L  A+ I K
Sbjct: 339  AIVYNPIIDALGEAGRFKEALGMMERFLVLESGPTISTYNSLVKGFCKAGNLAGASKIIK 398

Query: 1011 MMVEKDCLPTTTTYNCFIRYFSKHGNIKEGMELYMKLAESGCSPDSLTYQSLIQMLCEKS 1190
            MM+ +  +PT TTYN F +YFSK G I+EGM LY K+  SG SPD LTY  L++MLCE+ 
Sbjct: 399  MMIGRGIIPTPTTYNYFFKYFSKFGKIEEGMNLYTKMIGSGHSPDRLTYHLLLKMLCEEG 458

Query: 1191 SIELALQVHKDIETSGCELHLEAFTILIDTLCKEGRLVDAFQLFERMVQRGISPELTTYK 1370
             ++LA+QV K++ + G ++ L   T+LI   C   R  +A+  F  M++RGI P+  TY 
Sbjct: 459  KLDLAVQVGKEMRSRGFDMDLATSTMLIHLFCNMRRFEEAYLEFGDMIRRGIVPQYLTYH 518

Query: 1371 ILSDCLDEEKQYEKTQNLWEMMRALNPYFRRRP 1469
             + D L +    E    L ++M ++ P+  + P
Sbjct: 519  RMKDELKKRGMTEMVSKLRDLMSSV-PHSTKLP 550


>ref|XP_002308636.1| hypothetical protein POPTR_0006s26360g [Populus trichocarpa]
            gi|222854612|gb|EEE92159.1| hypothetical protein
            POPTR_0006s26360g [Populus trichocarpa]
          Length = 607

 Score =  286 bits (732), Expect = 3e-74
 Identities = 165/458 (36%), Positives = 265/458 (57%), Gaps = 15/458 (3%)
 Frame = +3

Query: 120  NDNKTACSV----KIFNSSEDAENALENSDIIIDSQSLENILENNKYRLEFLLRIFQHAT 287
            ND  T C++    KI         AL+ + I  +   ++++ ++     + L  +F  A 
Sbjct: 80   NDFFTLCNILKDPKI-QLGPSLRTALDRTGIEPELGLIQSVFDHFDSSPKLLHSVFLWAE 138

Query: 288  QNQGFIPSSRACNLLINSMSKYREFDLAWMLIKEM---KEHG-LLNLDTFVILFRRYARA 455
            +  GF  S+   N ++N + K REF  AW L+ +     E G L++ DTF IL RRY RA
Sbjct: 139  KKPGFQSSAALFNSMVNFLGKAREFGSAWCLLLDRIGGNEGGDLVSSDTFAILIRRYTRA 198

Query: 456  GMNNAALRSFDLMEYFGITRDLEALTAFMK----ALCKEKKIQFASLVFSMRKDR---FG 614
            GM+ AA+R+F+      +  + EA T+  +    +LCKE  ++ A+  F  + ++   + 
Sbjct: 199  GMSEAAIRTFEYASSLDLIHNSEAGTSLFEILLDSLCKEGHVRVATDYFDRKVEKDPCWV 258

Query: 615  SNTEMYNSLISSWCRIHGIKKAEVIFGQMVVSGIAPDVVTFASLVKGYCSAKRLEDAINL 794
             +  +YN L++ W R   +K AE ++ +M    + P VVT+ +LV+GY   +R+E AI L
Sbjct: 259  PSVRIYNILLNGWFRSRKLKHAERLWLEMKKKNVKPSVVTYGTLVEGYSRMRRVERAIEL 318

Query: 795  LKSMKTKGCQPNALVFNPLVNALGEAGRIFEAHLMLDEMAIAGCQPNISTFNSLIKAYCE 974
            +  MK +G + NA+V+NP+++AL EAGR  E   M++   +    P IST+NSL+K YC+
Sbjct: 319  VDEMKREGIKSNAIVYNPIIDALAEAGRFKEVLGMMEHFFLCEEGPTISTYNSLVKGYCK 378

Query: 975  NDDLDQATMIFKMMVEKDCLPTTTTYNCFIRYFSKHGNIKEGMELYMKLAESGCSPDSLT 1154
              DL  A+ I KMM+ ++  PT TTYN F R+FSK   I+EGM LY K+ ESG +PD LT
Sbjct: 379  AGDLVGASKILKMMISREVFPTPTTYNYFFRHFSKCRKIEEGMNLYTKMIESGYTPDRLT 438

Query: 1155 YQSLIQMLCEKSSIELALQVHKDIETSGCELHLEAFTILIDTLCKEGRLVDAFQLFERMV 1334
            Y  L++MLCE+  ++LA+Q+ K++   GC++ L   T+    LCK  R  +AF  FE M+
Sbjct: 439  YHLLLKMLCEEERLDLAVQISKEMRARGCDMDLATSTMFTHLLCKMQRFEEAFAEFEDML 498

Query: 1335 QRGISPELTTYKILSDCLDEEKQYEKTQNLWEMMRALN 1448
            +RGI P+  T+  L+D   ++   E  + L ++M +++
Sbjct: 499  RRGIVPQYLTFHRLNDEFRKQGLTELARRLCKLMSSVS 536


>ref|XP_006450554.1| hypothetical protein CICLE_v10008018mg [Citrus clementina]
            gi|557553780|gb|ESR63794.1| hypothetical protein
            CICLE_v10008018mg [Citrus clementina]
          Length = 517

 Score =  285 bits (729), Expect = 6e-74
 Identities = 158/466 (33%), Positives = 263/466 (56%), Gaps = 18/466 (3%)
 Frame = +3

Query: 102  PKLPNPNDNKTACSV-------KIFNSSEDAENALENSDIIIDSQSLENILENNKYRLEF 260
            P  P  N ++T  SV          +S    E+ L  + +  +   L  + E+  +  + 
Sbjct: 39   PSTPPHNFSQTDFSVISGLLRNTAISSGPSLESELNQTGVEPEPALLLAVFEHFDHSPKL 98

Query: 261  LLRIFQHATQNQGFIPSSRACNLLINSMSKYREFDLAWMLIKEM----KEHGLLNLDTFV 428
            L  +F+ A     F  S+   N +I  ++K +EFD AW L+ +     +    ++ DTFV
Sbjct: 99   LHTLFRWAESKPEFKCSAALFNCVIKVLAKAKEFDSAWCLLLDKIGGHEAPDFVSKDTFV 158

Query: 429  ILFRRYARAGMNNAALRSFDLMEYFGITRDLEA----LTAFMKALCKEKKIQFASLVFSM 596
            IL RRYARAGM  AA+R+F+      + ++ ++        + +LCK+ +++ AS  F  
Sbjct: 159  ILIRRYARAGMVEAAIRTFEFANNLDMVKNFDSGASLFEILLDSLCKQGRVKAASEYFHK 218

Query: 597  RKD---RFGSNTEMYNSLISSWCRIHGIKKAEVIFGQMVVSGIAPDVVTFASLVKGYCSA 767
            RK+    +     +YN L++ W R   +K AE  + +M    + P+VVT+ +LV+GYC  
Sbjct: 219  RKELDQSWAPTVRVYNILLNGWFRSKNVKDAERFWLEMRKENVTPNVVTYGTLVEGYCRL 278

Query: 768  KRLEDAINLLKSMKTKGCQPNALVFNPLVNALGEAGRIFEAHLMLDEMAIAGCQPNISTF 947
            +R++ AI L+K M+ +G +PNA+V+N +++ L EAGR  E   M++   +    P + T+
Sbjct: 279  RRVDRAIRLVKEMRKEGIEPNAIVYNTVIDGLVEAGRFEEVSGMMERFLVCEPGPTMVTY 338

Query: 948  NSLIKAYCENDDLDQATMIFKMMVEKDCLPTTTTYNCFIRYFSKHGNIKEGMELYMKLAE 1127
             SL+K YC+  DL+ A+ I KMM+ +  LP+ TTYN F RYFSK G +++ M LY K+ E
Sbjct: 339  TSLVKGYCKAGDLEGASKILKMMISRGFLPSPTTYNYFFRYFSKFGKVEDAMNLYRKMIE 398

Query: 1128 SGCSPDSLTYQSLIQMLCEKSSIELALQVHKDIETSGCELHLEAFTILIDTLCKEGRLVD 1307
            SG +PD LTY  L++MLC++  ++LA+QV K+++  GC++ L+  T+LI  LC+  +  +
Sbjct: 399  SGYTPDRLTYHILLKMLCKEDKLDLAIQVSKEMKCRGCDIDLDTSTMLIHLLCRMYKFDE 458

Query: 1308 AFQLFERMVQRGISPELTTYKILSDCLDEEKQYEKTQNLWEMMRAL 1445
            A   FE M++RG+ P   T+K L+D   +       Q L  +M ++
Sbjct: 459  ASAEFEDMIRRGLVPHYLTFKRLNDEFKKRGMTALAQKLCNVMSSV 504


Top