BLASTX nr result

ID: Anemarrhena21_contig00030220 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Anemarrhena21_contig00030220
         (340 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010926974.1| PREDICTED: pentatricopeptide repeat-containi...   133   4e-29
ref|XP_009384372.1| PREDICTED: pentatricopeptide repeat-containi...   129   1e-27
ref|XP_010276650.1| PREDICTED: pentatricopeptide repeat-containi...   100   5e-19
ref|XP_006854804.1| PREDICTED: pentatricopeptide repeat-containi...    96   9e-18
ref|XP_012492637.1| PREDICTED: pentatricopeptide repeat-containi...    93   6e-17
ref|XP_008803361.1| PREDICTED: pentatricopeptide repeat-containi...    91   4e-16
ref|XP_002512645.1| pentatricopeptide repeat-containing protein,...    90   5e-16
ref|XP_011096712.1| PREDICTED: pentatricopeptide repeat-containi...    90   7e-16
ref|XP_008243776.1| PREDICTED: putative pentatricopeptide repeat...    89   1e-15
ref|XP_007226966.1| hypothetical protein PRUPE_ppa025241mg [Prun...    89   1e-15
ref|XP_010942245.1| PREDICTED: putative pentatricopeptide repeat...    88   2e-15
ref|XP_012827479.1| PREDICTED: pentatricopeptide repeat-containi...    88   3e-15
ref|XP_007029475.1| Pentatricopeptide repeat superfamily protein...    87   4e-15
ref|XP_012088740.1| PREDICTED: putative pentatricopeptide repeat...    87   6e-15
gb|KDP23280.1| hypothetical protein JCGZ_23113 [Jatropha curcas]       87   6e-15
ref|XP_010930665.1| PREDICTED: pentatricopeptide repeat-containi...    86   7e-15
ref|XP_008797615.1| PREDICTED: putative pentatricopeptide repeat...    86   7e-15
ref|XP_007212977.1| hypothetical protein PRUPE_ppa022473mg [Prun...    86   7e-15
ref|NP_200075.1| pentatricopeptide repeat protein MEF1 [Arabidop...    86   7e-15
ref|XP_010927442.1| PREDICTED: pentatricopeptide repeat-containi...    86   1e-14

>ref|XP_010926974.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g39530-like [Elaeis guineensis]
          Length = 781

 Score =  133 bits (335), Expect = 4e-29
 Identities = 64/110 (58%), Positives = 82/110 (74%)
 Frame = -2

Query: 330 GCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYSCV 151
           G T+LAHK+F  IP RDVV  T ML G  EAG   +AL IF++M+E+  L++NEHAYSC 
Sbjct: 170 GFTELAHKLFCGIPHRDVVAFTCMLMGYAEAGRYAEALRIFEEMIENDQLVMNEHAYSCA 229

Query: 150 LHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSARK 1
           LHACAG+  LF G+QIHA+V+KS M S+ FVGT LVD+Y K  +M+S +K
Sbjct: 230 LHACAGIPFLFDGRQIHAQVIKSTMASNPFVGTSLVDMYAKSGDMESTKK 279



 Score = 69.7 bits (169), Expect = 7e-10
 Identities = 37/112 (33%), Positives = 66/112 (58%), Gaps = 3/112 (2%)
 Frame = -2

Query: 327 CTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNL-MINEHAYSCV 151
           C D A ++F  I   D+V  TS++ G + +G++++A+ ++  MV   ++   N + Y+ +
Sbjct: 472 CLDDALRLFEQIHHPDLVLWTSLISGFSRSGKSQEAINLYVRMVAEGSVGPPNHYTYATI 531

Query: 150 LHACAGLCCLFGGQQIHARVVKSVMG--SDVFVGTGLVDLYVKCHEMDSARK 1
           L +CA L  L  G+QIHA+++KS      D FV +GL  +Y KC  ++ A +
Sbjct: 532 LSSCAQLAALGEGRQIHAQIIKSDFNFEHDTFVASGLSYMYAKCGYLEEASR 583


>ref|XP_009384372.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g02330-like [Musa acuminata subsp. malaccensis]
          Length = 608

 Score =  129 bits (323), Expect = 1e-27
 Identities = 65/110 (59%), Positives = 77/110 (70%)
 Frame = -2

Query: 330 GCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYSCV 151
           G  DLAHK+F  +P++DVVT TSML G  + G + +AL IF+ MVES    +NEH YSC 
Sbjct: 155 GYVDLAHKVFCGMPDQDVVTFTSMLTGYVQDGRHVEALRIFQGMVESGRFRLNEHVYSCA 214

Query: 150 LHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSARK 1
           L ACAG   L   QQIHA V+KS M SDVF GT LVDLYVKC EM+ AR+
Sbjct: 215 LRACAGNSTLSDAQQIHAHVLKSGMASDVFTGTSLVDLYVKCDEMECARR 264



 Score = 60.1 bits (144), Expect = 6e-07
 Identities = 34/103 (33%), Positives = 60/103 (58%), Gaps = 3/103 (2%)
 Frame = -2

Query: 306 MFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMI-NEHAYSCVLHACAGL 130
           +F  + +RD+   TS++ G +  GE++ AL ++  MV   ++   N + +S VL +CA +
Sbjct: 464 VFEKLHQRDLALWTSLISGFSRIGESDAALKLYVRMVTEESVEPPNHYMFSAVLSSCAQI 523

Query: 129 CCLFGGQQIHARVVKS--VMGSDVFVGTGLVDLYVKCHEMDSA 7
             L  G+QIHA+V+KS   +  D FV + L+ +Y K   ++ A
Sbjct: 524 AALEEGKQIHAQVIKSDHRVKCDTFVVSSLLHMYAKSGHIEEA 566


>ref|XP_010276650.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g39530-like [Nelumbo nucifera]
          Length = 765

 Score =  100 bits (248), Expect = 5e-19
 Identities = 47/109 (43%), Positives = 76/109 (69%)
 Frame = -2

Query: 330 GCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYSCV 151
           G  ++A ++F  +P+ DVV  T+M+ G T+AG+ E+AL +F++M+E   L+ NE   + +
Sbjct: 158 GPIEIAREVFDRMPQPDVVAYTAMMVGYTDAGDYEEALNLFRNMIEVERLVPNEFTLTSI 217

Query: 150 LHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSAR 4
           L ACAG   L  G+Q+HA ++K+ + S+VFVGT LV+LY KC+ M+ A+
Sbjct: 218 LSACAGNSSLLEGKQMHAYILKTSLQSNVFVGTALVNLYAKCNRMECAK 266



 Score = 74.3 bits (181), Expect = 3e-11
 Identities = 41/109 (37%), Positives = 67/109 (61%), Gaps = 3/109 (2%)
 Frame = -2

Query: 321 DLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMI-NEHAYSCVLH 145
           D A ++F  +   D+V  T+++ G + +GE+E AL ++  M+E     I N + YS VL 
Sbjct: 461 DDAVRVFNKVHSPDLVLWTTIISGFSRSGESEDALKLYTLMLEEELAEIPNNYTYSSVLC 520

Query: 144 ACAGLCCLFGGQQIHARVVKS--VMGSDVFVGTGLVDLYVKCHEMDSAR 4
           +CA L  +  G+QIHA+++KS   +G+D F+ + LVD+Y KC  +  AR
Sbjct: 521 SCANLAAVEEGKQIHAQIIKSNYKIGADPFIASSLVDMYAKCGYITEAR 569


>ref|XP_006854804.1| PREDICTED: pentatricopeptide repeat-containing protein At2g13600
           [Amborella trichopoda] gi|548858508|gb|ERN16271.1|
           hypothetical protein AMTR_s00063p00172440 [Amborella
           trichopoda]
          Length = 914

 Score = 95.9 bits (237), Expect = 9e-18
 Identities = 48/103 (46%), Positives = 70/103 (67%)
 Frame = -2

Query: 315 AHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYSCVLHACA 136
           A ++F    ERDVV  T+M++G TE    E+AL +F++MVE   +  NE ++SCVL ACA
Sbjct: 226 ARQLFDRTVERDVVVFTAMMRGYTEEENCEEALRLFREMVEEP-IPPNEFSFSCVLRACA 284

Query: 135 GLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSA 7
            L  L  G+Q+H+ ++K  + SDVFVGT LV+LY  C +M+S+
Sbjct: 285 NLLALEEGKQVHSYIIKEAVNSDVFVGTALVNLYANCGDMESS 327



 Score = 66.6 bits (161), Expect = 6e-09
 Identities = 38/107 (35%), Positives = 62/107 (57%), Gaps = 2/107 (1%)
 Frame = -2

Query: 315 AHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYSCVLHACA 136
           A + F  + + DVV  T+++ G  + GE  +AL  +  MV    +  N + YS +L A +
Sbjct: 490 AQEAFNRVSQPDVVLWTAIISGFAQGGEAARALQFYAKMVFEGFVTPNHYTYSSILSASS 549

Query: 135 GLCCLFGGQQIHARVVKS--VMGSDVFVGTGLVDLYVKCHEMDSARK 1
            L  +  G+QIH++++KS   + SDVFV +GLVD+Y +   +  ARK
Sbjct: 550 ELVAIEEGKQIHSQILKSGNDVHSDVFVASGLVDMYARSGFIMEARK 596


>ref|XP_012492637.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20770
           [Gossypium raimondii] gi|823195495|ref|XP_012492638.1|
           PREDICTED: pentatricopeptide repeat-containing protein
           At4g20770 [Gossypium raimondii]
           gi|823195498|ref|XP_012492640.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g20770
           [Gossypium raimondii] gi|823195501|ref|XP_012492641.1|
           PREDICTED: pentatricopeptide repeat-containing protein
           At4g20770 [Gossypium raimondii]
           gi|823195504|ref|XP_012492642.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g20770
           [Gossypium raimondii] gi|823195507|ref|XP_012492643.1|
           PREDICTED: pentatricopeptide repeat-containing protein
           At4g20770 [Gossypium raimondii]
           gi|823195510|ref|XP_012492644.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g20770
           [Gossypium raimondii] gi|823195513|ref|XP_012492645.1|
           PREDICTED: pentatricopeptide repeat-containing protein
           At4g20770 [Gossypium raimondii]
           gi|763777577|gb|KJB44700.1| hypothetical protein
           B456_007G267400 [Gossypium raimondii]
          Length = 773

 Score = 93.2 bits (230), Expect = 6e-17
 Identities = 48/112 (42%), Positives = 70/112 (62%)
 Frame = -2

Query: 336 RCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYS 157
           +CG  + A  +F ++PE D+V   SM+ GLT    N +A  +FK M + R ++  E +Y+
Sbjct: 464 KCGKIERAEHIFSSMPELDIVCWNSMIAGLTLNSLNREAFMLFKQMRQGR-MLPTEFSYA 522

Query: 156 CVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSARK 1
            VL  C  L  LF G+Q+H+++VK    SDVFVGT LVD+Y KC ++D A K
Sbjct: 523 TVLSCCTKLSSLFQGRQVHSQIVKDGYESDVFVGTALVDMYCKCGDIDGAWK 574



 Score = 61.2 bits (147), Expect = 3e-07
 Identities = 35/110 (31%), Positives = 59/110 (53%)
 Frame = -2

Query: 336 RCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYS 157
           + G    A K+F  +PER+VV+  +++  + + G  EKAL ++K MV     +     ++
Sbjct: 79  KTGNLTFARKVFEQMPERNVVSWNNLISLMLKNGHQEKALDVYKLMV-LEGFLPTHITFA 137

Query: 156 CVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSA 7
            VL AC  +  L  G++ H  V+K  +  ++FV  GL+ +Y KC  M  A
Sbjct: 138 SVLSACGSVFDLQLGKRCHGLVIKIGLDKNIFVCNGLLSVYAKCGVMREA 187



 Score = 60.5 bits (145), Expect = 4e-07
 Identities = 28/111 (25%), Positives = 67/111 (60%)
 Frame = -2

Query: 339 LRCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAY 160
           ++ G  + A +MF  +    V++  +++ G ++   + +A+ +F++M + +++  +    
Sbjct: 362 IKGGDVETARQMFDNMSCPSVISWNAIISGYSQNENHREAIDLFREM-QFQSVKPDRTTV 420

Query: 159 SCVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSA 7
           + +L +CAG+  L GG+Q+HA + K+ + +D +V  GL+ +Y KC +++ A
Sbjct: 421 TVILGSCAGMAFLEGGKQVHAALQKAALHTDNYVAGGLIGMYSKCGKIERA 471


>ref|XP_008803361.1| PREDICTED: pentatricopeptide repeat-containing protein At3g53360,
           mitochondrial-like [Phoenix dactylifera]
          Length = 398

 Score = 90.5 bits (223), Expect = 4e-16
 Identities = 45/84 (53%), Positives = 59/84 (70%)
 Frame = -2

Query: 252 GLTEAGENEKALGIFKDMVESRNLMINEHAYSCVLHACAGLCCLFGGQQIHARVVKSVMG 73
           G  EAGE+ +AL IFK+MVE+  L++NEH YSC L ACA    LF  ++I A+V+KS M 
Sbjct: 2   GYAEAGEHAEALRIFKEMVENDQLVMNEHVYSCALRACARTLFLFDRRRIQAQVIKSKMA 61

Query: 72  SDVFVGTGLVDLYVKCHEMDSARK 1
           S+ FVGT LVD+Y K  +  SA+K
Sbjct: 62  SNAFVGTSLVDVYEKSGDKQSAKK 85


>ref|XP_002512645.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223548606|gb|EEF50097.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 716

 Score = 90.1 bits (222), Expect = 5e-16
 Identities = 43/111 (38%), Positives = 71/111 (63%)
 Frame = -2

Query: 336 RCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYS 157
           +CG  D + K+F+ +P R+ VT  +M+ G  ++G+ +KAL ++K+M+E + +  +E  YS
Sbjct: 290 KCGRLDNSMKLFMELPNRNEVTWNTMIVGYVQSGDGDKALSLYKNMLECQ-VQASEVTYS 348

Query: 156 CVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSAR 4
            VL ACA L  +  G QIH+  +K++   DV VG  L+D+Y KC  + +AR
Sbjct: 349 SVLRACASLAAMELGTQIHSLSLKTIYDKDVVVGNALIDMYAKCGSIKNAR 399



 Score = 60.5 bits (145), Expect = 4e-07
 Identities = 29/103 (28%), Positives = 61/103 (59%)
 Frame = -2

Query: 309 KMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYSCVLHACAGL 130
           ++F  +P+ DV+  + M+    ++ ++ +A+ +F  M  +  ++ N+  ++ VL +CA +
Sbjct: 198 RVFEEMPKHDVIPWSFMISRYAQSNQSREAVELFGQMRRAF-VLPNQFTFASVLQSCASI 256

Query: 129 CCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSARK 1
             L  G+Q+H  V+K  +  +VFV   L+D+Y KC  +D++ K
Sbjct: 257 ENLQLGKQVHCHVLKVGLDGNVFVSNALMDVYAKCGRLDNSMK 299


>ref|XP_011096712.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20770
           [Sesamum indicum]
          Length = 762

 Score = 89.7 bits (221), Expect = 7e-16
 Identities = 42/112 (37%), Positives = 72/112 (64%)
 Frame = -2

Query: 336 RCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYS 157
           +CG  + A  +F T+P+ D+V   SML GL+    ++ +L  F+ M+  + L+  E +Y+
Sbjct: 465 KCGNIEAAKHIFNTVPQHDIVCWNSMLSGLSLNSLDKDSLNFFQQML-GKGLLPTEFSYT 523

Query: 156 CVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSARK 1
            VL+ C+ L  L  G+Q+H+ +VK+   +DV+VGT L+D+Y KC ++D AR+
Sbjct: 524 TVLNCCSSLTSLLQGRQVHSLIVKNGHANDVYVGTALIDMYCKCGDVDGARQ 575



 Score = 60.1 bits (144), Expect = 6e-07
 Identities = 31/112 (27%), Positives = 69/112 (61%)
 Frame = -2

Query: 339 LRCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAY 160
           L+ G  +   K+F ++    + +  ++L G ++   +++AL +F++M + R +  +   +
Sbjct: 363 LKSGDVEAGLKIFNSMSLPSLTSWNAILSGYSQNEYHQEALMLFREM-QFRKVRPDRTTF 421

Query: 159 SCVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSAR 4
           + VL +CA +  L GG+QIHA ++K+   +D++V +GL+ +Y KC  +++A+
Sbjct: 422 AIVLSSCAVMGLLEGGKQIHAALLKAEFCTDLYVTSGLIGVYSKCGNIEAAK 473


>ref|XP_008243776.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At1g68930 [Prunus mume]
          Length = 743

 Score = 89.0 bits (219), Expect = 1e-15
 Identities = 41/111 (36%), Positives = 72/111 (64%)
 Frame = -2

Query: 339 LRCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAY 160
           LRCG  + +  +F  +PE+D ++ T+M+ GLT+ G   KAL  F++M+    L ++++ +
Sbjct: 215 LRCGLIEDSECLFSKMPEKDSISWTTMITGLTQNGSGSKALDKFREMI-LEGLSMDQYTF 273

Query: 159 SCVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSA 7
             VL AC GL  L  G+Q+HA ++++ +  ++FVG+ LVD+Y KC  + +A
Sbjct: 274 GSVLTACGGLFALEEGKQVHAYIIRTELIDNIFVGSALVDMYCKCRSIKAA 324


>ref|XP_007226966.1| hypothetical protein PRUPE_ppa025241mg [Prunus persica]
           gi|462423902|gb|EMJ28165.1| hypothetical protein
           PRUPE_ppa025241mg [Prunus persica]
          Length = 743

 Score = 89.0 bits (219), Expect = 1e-15
 Identities = 41/111 (36%), Positives = 72/111 (64%)
 Frame = -2

Query: 339 LRCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAY 160
           LRCG  + +  +F  +PE+D ++ T+M+ GLT+ G   KAL  F++M+    L ++++ +
Sbjct: 215 LRCGLIEDSECLFSKMPEKDSISWTTMITGLTQNGSGSKALDKFREMI-LEGLSMDQYTF 273

Query: 159 SCVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSA 7
             VL AC GL  L  G+Q+HA ++++ +  ++FVG+ LVD+Y KC  + +A
Sbjct: 274 GSVLTACGGLFALEEGKQVHAYIIRTELIDNIFVGSALVDMYCKCRSIKAA 324


>ref|XP_010942245.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At1g68930 [Elaeis guineensis]
           gi|743857987|ref|XP_010942246.1| PREDICTED: putative
           pentatricopeptide repeat-containing protein At1g68930
           [Elaeis guineensis]
          Length = 744

 Score = 88.2 bits (217), Expect = 2e-15
 Identities = 42/108 (38%), Positives = 73/108 (67%)
 Frame = -2

Query: 339 LRCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAY 160
           LRCG  + + ++F  + ERD ++ T+M+ GLT+ G   +ALG F+DM  ++++ I+++ +
Sbjct: 216 LRCGMVEDSKQLFREMSERDSISWTTMVTGLTQNGLELEALGFFRDM-RAQDVGIDQYTF 274

Query: 159 SCVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEM 16
             VL AC GL  L  G+Q+HA ++++   ++VFVG+ LVD+Y KC  +
Sbjct: 275 GSVLTACGGLSALEQGKQVHAYIIRTHYDNNVFVGSALVDMYSKCRSV 322



 Score = 59.3 bits (142), Expect = 1e-06
 Identities = 34/112 (30%), Positives = 57/112 (50%)
 Frame = -2

Query: 336 RCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYS 157
           + GC     +MF ++PERD V+  S++ G    G  +KA+  ++ M+       N   +S
Sbjct: 84  KSGCLSEMEQMFDSMPERDCVSWNSLVSGFAGHGSPKKAVEAYRAMLREAQAAPNRITFS 143

Query: 156 CVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSARK 1
            +L   +       G+QIH +++K      VFVG+ LVD+Y K   +  AR+
Sbjct: 144 TMLIFSSARSSSDLGRQIHCQIIKYGFERYVFVGSPLVDMYSKVGLLGEARQ 195


>ref|XP_012827479.1| PREDICTED: pentatricopeptide repeat-containing protein At4g20770
           [Erythranthe guttatus] gi|604299153|gb|EYU19088.1|
           hypothetical protein MIMGU_mgv1a001760mg [Erythranthe
           guttata]
          Length = 763

 Score = 87.8 bits (216), Expect = 3e-15
 Identities = 42/112 (37%), Positives = 72/112 (64%)
 Frame = -2

Query: 336 RCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYS 157
           +CG  + A ++F T+P+ D+V   SML GL+    +++A   F+ M+  + +   E +Y+
Sbjct: 466 KCGKIEAAKRIFNTVPQYDIVCWNSMLSGLSLNSLDKEAFTFFQLML-GKGMSPTEFSYA 524

Query: 156 CVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSARK 1
            VL+ C+ L  L  G+Q+H  +VK+   +DV+VGTGL+D+Y KC ++D AR+
Sbjct: 525 TVLNCCSSLSSLSQGRQVHGLIVKNGYANDVYVGTGLIDMYCKCGDVDGARQ 576



 Score = 62.4 bits (150), Expect = 1e-07
 Identities = 29/113 (25%), Positives = 71/113 (62%)
 Frame = -2

Query: 339 LRCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAY 160
           L+ G  +   ++F ++    + +  ++L G ++   + +A+ +F++M + R +  +   +
Sbjct: 364 LKSGDVETGLRIFNSMSLPSLTSWNAILSGFSQNEYHWEAVMLFREM-QFRKVRPDRTTF 422

Query: 159 SCVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSARK 1
           + +L  CAG+  L GG+QIHA ++KS + +D++V +G++ +Y KC ++++A++
Sbjct: 423 AIILSCCAGMGLLEGGKQIHASLLKSDVSTDLYVASGMIGVYSKCGKIEAAKR 475


>ref|XP_007029475.1| Pentatricopeptide repeat superfamily protein, putative isoform 1
           [Theobroma cacao] gi|590638741|ref|XP_007029476.1|
           Pentatricopeptide repeat superfamily protein, putative
           isoform 1 [Theobroma cacao]
           gi|590638744|ref|XP_007029477.1| Pentatricopeptide
           repeat superfamily protein, putative isoform 1
           [Theobroma cacao] gi|590638748|ref|XP_007029478.1|
           Pentatricopeptide repeat superfamily protein, putative
           isoform 1 [Theobroma cacao] gi|508718080|gb|EOY09977.1|
           Pentatricopeptide repeat superfamily protein, putative
           isoform 1 [Theobroma cacao] gi|508718081|gb|EOY09978.1|
           Pentatricopeptide repeat superfamily protein, putative
           isoform 1 [Theobroma cacao] gi|508718082|gb|EOY09979.1|
           Pentatricopeptide repeat superfamily protein, putative
           isoform 1 [Theobroma cacao] gi|508718083|gb|EOY09980.1|
           Pentatricopeptide repeat superfamily protein, putative
           isoform 1 [Theobroma cacao]
          Length = 777

 Score = 87.0 bits (214), Expect = 4e-15
 Identities = 44/112 (39%), Positives = 69/112 (61%)
 Frame = -2

Query: 336 RCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYS 157
           +CG   +A  +F  +PE D+V   SM+ GLT    +++A  +FK M +   ++  E +Y+
Sbjct: 468 KCGKIKMAECIFSYVPELDIVCWNSMIAGLTLNSLDKEAFMLFKQMQQG-GMLPTEFSYT 526

Query: 156 CVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSARK 1
            +L  CA L   F G+Q+H+++VK    + VFVGT LVD+Y KC ++D ARK
Sbjct: 527 AILSCCAKLSSSFQGRQVHSQIVKDGFMNYVFVGTALVDMYCKCGDIDGARK 578



 Score = 63.2 bits (152), Expect = 7e-08
 Identities = 30/110 (27%), Positives = 68/110 (61%)
 Frame = -2

Query: 336 RCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYS 157
           +CG  + A +MF ++    V++  +++ G ++   +++A+ +F++M + +N+  +    +
Sbjct: 367 KCGDVETARRMFDSMLCPSVISWNAIISGYSQNENHKEAIELFREM-QFQNVKPDRTTVA 425

Query: 156 CVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSA 7
            +L +CAG+  L GG+Q+HA   K+ + +D +V +GL+ +Y KC ++  A
Sbjct: 426 VILGSCAGMEFLEGGKQVHAASQKAALYTDNYVASGLIGMYSKCGKIKMA 475



 Score = 60.8 bits (146), Expect = 3e-07
 Identities = 35/112 (31%), Positives = 59/112 (52%)
 Frame = -2

Query: 336 RCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYS 157
           + G    A K+F  +PER+V +  +++  + + G  EKAL ++K MV     +     ++
Sbjct: 83  KAGNLTFARKVFEQMPERNVASWNNLISLMVKNGFQEKALDVYKLMV-FEGFLPTHVTFA 141

Query: 156 CVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSARK 1
            VL AC  +  L  G++ H  V+K  +  ++FV  GL+ +Y KC  M  A K
Sbjct: 142 SVLSACGSVVHLELGKRCHGLVIKIGLDKNIFVCNGLLSVYAKCGVMKEAIK 193


>ref|XP_012088740.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At5g13230, mitochondrial [Jatropha curcas]
           gi|802754635|ref|XP_012088741.1| PREDICTED: putative
           pentatricopeptide repeat-containing protein At5g13230,
           mitochondrial [Jatropha curcas]
          Length = 830

 Score = 86.7 bits (213), Expect = 6e-15
 Identities = 43/111 (38%), Positives = 66/111 (59%)
 Frame = -2

Query: 336 RCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYS 157
           +CG  + + ++FV +P R+ VT  +M+ G  ++G  EKAL +FK M+E + +   E  YS
Sbjct: 404 KCGRVENSMELFVELPNRNDVTWNTMIVGYVQSGSGEKALSLFKTMLECQ-VQATEVTYS 462

Query: 156 CVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSAR 4
             L ACA L  +  G QIH+  VK++   D+ VG  L+D+Y KC  +  AR
Sbjct: 463 STLRACASLAAMEPGIQIHSLSVKTLYDKDIVVGNALIDMYAKCGSIKDAR 513



 Score = 58.9 bits (141), Expect = 1e-06
 Identities = 30/111 (27%), Positives = 68/111 (61%)
 Frame = -2

Query: 339 LRCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAY 160
           ++ G T  A ++F  +P+ DV+  + M+    ++ ++++A+ +F  M ++  ++ N+  +
Sbjct: 302 IKSGDTGGAVQVFEEMPKIDVIPWSFMIARYAQSNQSKEAVDLFCQMRQAF-VLPNQFTF 360

Query: 159 SCVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSA 7
           + VL ACA +  L  G+QIH+ ++K  +  ++FV   L+D+Y KC  ++++
Sbjct: 361 ASVLQACATMESLDFGRQIHSHILKVGLDINLFVANALMDVYAKCGRVENS 411


>gb|KDP23280.1| hypothetical protein JCGZ_23113 [Jatropha curcas]
          Length = 773

 Score = 86.7 bits (213), Expect = 6e-15
 Identities = 43/111 (38%), Positives = 66/111 (59%)
 Frame = -2

Query: 336 RCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYS 157
           +CG  + + ++FV +P R+ VT  +M+ G  ++G  EKAL +FK M+E + +   E  YS
Sbjct: 347 KCGRVENSMELFVELPNRNDVTWNTMIVGYVQSGSGEKALSLFKTMLECQ-VQATEVTYS 405

Query: 156 CVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSAR 4
             L ACA L  +  G QIH+  VK++   D+ VG  L+D+Y KC  +  AR
Sbjct: 406 STLRACASLAAMEPGIQIHSLSVKTLYDKDIVVGNALIDMYAKCGSIKDAR 456



 Score = 58.9 bits (141), Expect = 1e-06
 Identities = 30/111 (27%), Positives = 68/111 (61%)
 Frame = -2

Query: 339 LRCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAY 160
           ++ G T  A ++F  +P+ DV+  + M+    ++ ++++A+ +F  M ++  ++ N+  +
Sbjct: 245 IKSGDTGGAVQVFEEMPKIDVIPWSFMIARYAQSNQSKEAVDLFCQMRQAF-VLPNQFTF 303

Query: 159 SCVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSA 7
           + VL ACA +  L  G+QIH+ ++K  +  ++FV   L+D+Y KC  ++++
Sbjct: 304 ASVLQACATMESLDFGRQIHSHILKVGLDINLFVANALMDVYAKCGRVENS 354


>ref|XP_010930665.1| PREDICTED: pentatricopeptide repeat-containing protein At2g13600
           [Elaeis guineensis]
          Length = 690

 Score = 86.3 bits (212), Expect = 7e-15
 Identities = 45/112 (40%), Positives = 68/112 (60%)
 Frame = -2

Query: 336 RCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYS 157
           + GC D A ++F++IPE D  + +SM+ G  + G  E+AL  F  M  + + ++N H++S
Sbjct: 100 KSGCFDEAKQLFLSIPESDQCSWSSMVSGFAQHGRFEEALDFFVAM-HAEDFVLNAHSFS 158

Query: 156 CVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSARK 1
             L ACAGL     G QIHA + KS +  DV++G+ LVD+Y KC     AR+
Sbjct: 159 SALSACAGLMDSIIGVQIHALISKSRLAYDVYMGSALVDMYSKCRRPLDARR 210



 Score = 69.3 bits (168), Expect = 9e-10
 Identities = 40/111 (36%), Positives = 64/111 (57%), Gaps = 6/111 (5%)
 Frame = -2

Query: 315 AHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYSCVLHACA 136
           A  MF+ + ER++V   +++ G T+ GE+E+AL +F  + +  ++    + +  +L+ACA
Sbjct: 341 ARLMFLRMRERNIVAWNALIAGYTQNGEDEEALRLFLRL-KRESVWPTHYTFGNILNACA 399

Query: 135 GLCCLFGGQQIHARVVK------SVMGSDVFVGTGLVDLYVKCHEMDSARK 1
            L  L  GQQ  A V+K      S   SD+FVG  LVD+Y+KC  +D   K
Sbjct: 400 NLADLSLGQQAQAHVLKHGFRFESGPESDIFVGNSLVDMYLKCGSIDDGGK 450



 Score = 61.6 bits (148), Expect = 2e-07
 Identities = 37/105 (35%), Positives = 59/105 (56%), Gaps = 1/105 (0%)
 Frame = -2

Query: 315 AHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYSCVLHACA 136
           A ++F  +P+R+VV+  S++    + G   +AL +F  M++S  L  +E   + V+ ACA
Sbjct: 208 ARRVFEGMPDRNVVSWNSLITCYEQNGPVNEALLLFVTMMDS-GLEPDEVTLASVVSACA 266

Query: 135 GLCCLFGGQQIHARVVK-SVMGSDVFVGTGLVDLYVKCHEMDSAR 4
            L  L  G QIHA+ +K      D+ +G  LVD+Y KC  +  AR
Sbjct: 267 SLMALREGMQIHAQAIKFDKFRDDLVLGNALVDMYAKCRRIREAR 311


>ref|XP_008797615.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At1g68930 [Phoenix dactylifera]
          Length = 744

 Score = 86.3 bits (212), Expect = 7e-15
 Identities = 41/108 (37%), Positives = 71/108 (65%)
 Frame = -2

Query: 339 LRCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAY 160
           LRCG  + + ++F  + ERD ++ T+M+ GLT+ G   +AL  F+DM  ++ + I+++ +
Sbjct: 216 LRCGMVEDSKQLFGEMSERDSISWTTMVTGLTQNGLELEALSFFRDM-RAQGVGIDQYTF 274

Query: 159 SCVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEM 16
             VL AC GL  L  G+Q+HA ++++   ++VFVG+ LVD+Y KC  +
Sbjct: 275 GSVLTACGGLSALEQGKQVHAYIIRTYYDNNVFVGSALVDMYSKCRSI 322



 Score = 57.4 bits (137), Expect = 4e-06
 Identities = 33/111 (29%), Positives = 55/111 (49%)
 Frame = -2

Query: 336 RCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYS 157
           + GC     +MF ++PERD V+  S++ G    G  +KA   ++ M+       N   +S
Sbjct: 84  KSGCLYDMEQMFDSMPERDCVSWNSLVSGFAGHGSAKKAFEAYRAMLREGQAAPNRITFS 143

Query: 156 CVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSAR 4
            +L   +       G+Q+H +++K      VFVG+ LVD+Y K   +  AR
Sbjct: 144 TMLILASARSSADLGRQVHCQIIKYGFERYVFVGSPLVDMYSKVGLLGEAR 194


>ref|XP_007212977.1| hypothetical protein PRUPE_ppa022473mg [Prunus persica]
           gi|462408842|gb|EMJ14176.1| hypothetical protein
           PRUPE_ppa022473mg [Prunus persica]
          Length = 589

 Score = 86.3 bits (212), Expect = 7e-15
 Identities = 45/103 (43%), Positives = 63/103 (61%)
 Frame = -2

Query: 336 RCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYS 157
           +C   D A K+F  +PER +V+  +M+  LT++GE EKALG+F  M    N   +E   S
Sbjct: 62  KCSLVDCAGKVFDEMPERSLVSWNTMIGSLTQSGEEEKALGLFLQMRREAN-HFSEFTVS 120

Query: 156 CVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVK 28
            VL ACA  C +F  +Q+HA  VK  M  +V+VGT L+D+Y K
Sbjct: 121 SVLCACAAKCAVFECKQLHALAVKLAMNLNVYVGTALLDVYAK 163



 Score = 65.5 bits (158), Expect = 1e-08
 Identities = 36/103 (34%), Positives = 57/103 (55%)
 Frame = -2

Query: 315 AHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYSCVLHACA 136
           A  +F ++PER  VT +SM+ G  +    E+AL  F+   +   L  N+   S  + ACA
Sbjct: 170 ASSVFASLPERSEVTWSSMVAGYVQNELYEEALMFFR-RAKMIGLKQNQFTISSAICACA 228

Query: 135 GLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSA 7
           GL  L  G+Q+HA + K+  G + F+ + L+D+Y KC  +  A
Sbjct: 229 GLAALIEGKQVHAVLCKTGFGLNKFIVSSLIDMYAKCGSIKEA 271


>ref|NP_200075.1| pentatricopeptide repeat protein MEF1 [Arabidopsis thaliana]
           gi|75180446|sp|Q9LTF4.1|PP429_ARATH RecName:
           Full=Putative pentatricopeptide repeat-containing
           protein At5g52630 gi|8953718|dbj|BAA98081.1|
           selenium-binding protein-like [Arabidopsis thaliana]
           gi|332008860|gb|AED96243.1| pentatricopeptide repeat
           protein MEF1 [Arabidopsis thaliana]
          Length = 588

 Score = 86.3 bits (212), Expect = 7e-15
 Identities = 45/104 (43%), Positives = 65/104 (62%)
 Frame = -2

Query: 336 RCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYS 157
           +CG    A KMF  +P+R+VVT + M+ G  + GENE+AL +FK+ +   NL +N++++S
Sbjct: 163 KCGEIVYARKMFDEMPQRNVVTWSGMMYGYAQMGENEEALWLFKEAL-FENLAVNDYSFS 221

Query: 156 CVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKC 25
            V+  CA    L  G+QIH   +KS   S  FVG+ LV LY KC
Sbjct: 222 SVISVCANSTLLELGRQIHGLSIKSSFDSSSFVGSSLVSLYSKC 265


>ref|XP_010927442.1| PREDICTED: pentatricopeptide repeat-containing protein At4g02750
           [Elaeis guineensis]
          Length = 777

 Score = 85.9 bits (211), Expect = 1e-14
 Identities = 44/112 (39%), Positives = 67/112 (59%)
 Frame = -2

Query: 336 RCGCTDLAHKMFVTIPERDVVTCTSMLKGLTEAGENEKALGIFKDMVESRNLMINEHAYS 157
           + G  + AH+MF  +P+RD V+  +M+ GL++ G  E+AL +F +M  S   M N  +++
Sbjct: 351 QAGMIERAHEMFDMMPQRDSVSWAAMIAGLSQGGFGEEALRLFVEMGRSGERM-NRSSFT 409

Query: 156 CVLHACAGLCCLFGGQQIHARVVKSVMGSDVFVGTGLVDLYVKCHEMDSARK 1
           CVL  CA +  L  G Q+H R+VK+  G   FVG  L+ +Y KC  +D A K
Sbjct: 410 CVLSTCADIAMLECGTQVHGRLVKAGYGMGCFVGNALLAMYCKCGSIDEAYK 461


Top