BLASTX nr result

ID: Akebia24_contig00026684 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00026684
         (1039 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002270439.2| PREDICTED: pentatricopeptide repeat-containi...   416   e-114
ref|XP_007034318.1| Tetratricopeptide repeat-like superfamily pr...   390   e-106
ref|XP_004139010.1| PREDICTED: pentatricopeptide repeat-containi...   357   3e-96
ref|XP_002518071.1| pentatricopeptide repeat-containing protein,...   353   8e-95
ref|XP_006420980.1| hypothetical protein CICLE_v10004495mg [Citr...   352   1e-94
ref|XP_006420979.1| hypothetical protein CICLE_v10004495mg [Citr...   352   1e-94
ref|XP_002300166.1| pentatricopeptide repeat-containing family p...   352   2e-94
ref|XP_006489829.1| PREDICTED: pentatricopeptide repeat-containi...   351   2e-94
ref|XP_006489828.1| PREDICTED: pentatricopeptide repeat-containi...   351   2e-94
ref|XP_006489827.1| PREDICTED: pentatricopeptide repeat-containi...   351   2e-94
ref|XP_006489825.1| PREDICTED: pentatricopeptide repeat-containi...   351   2e-94
ref|XP_006360955.1| PREDICTED: pentatricopeptide repeat-containi...   334   3e-89
ref|XP_002892245.1| pentatricopeptide repeat-containing protein ...   317   4e-84
ref|XP_006418090.1| hypothetical protein EUTSA_v10007007mg [Eutr...   313   1e-82
ref|NP_171976.1| pentatricopeptide repeat-containing protein [Ar...   313   1e-82
ref|XP_006306938.1| hypothetical protein CARUB_v10008505mg, part...   301   3e-79
ref|XP_004247960.1| PREDICTED: pentatricopeptide repeat-containi...   275   2e-71
emb|CBI17228.3| unnamed protein product [Vitis vinifera]              231   3e-58
ref|XP_004295870.1| PREDICTED: pentatricopeptide repeat-containi...   209   1e-51
ref|XP_007050341.1| Basic helix-loop-helix DNA-binding superfami...   208   3e-51

>ref|XP_002270439.2| PREDICTED: pentatricopeptide repeat-containing protein
           At1g04840-like [Vitis vinifera]
          Length = 677

 Score =  416 bits (1070), Expect = e-114
 Identities = 208/314 (66%), Positives = 241/314 (76%)
 Frame = -3

Query: 944 ENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQHFY 765
           E HFI LIH S T  QL QIHAQ+FLHNL                   +DY + IF+ F 
Sbjct: 40  ETHFIPLIHASNTLPQLHQIHAQIFLHNLFSNSRVVTQLISSSCSLKSLDYALSIFRCFD 99

Query: 764 SPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGGT 585
            PN F+FNA IR L+ENS+F+ S+ HFVL LRLS++PDRLT PFVLKS A+L+   LG  
Sbjct: 100 HPNLFVFNALIRGLAENSRFEGSVSHFVLMLRLSIRPDRLTLPFVLKSVAALVDVGLGRC 159

Query: 584 LHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCC 405
           LHG + KLGL+FDSFV VSLVDMYVK G LGF LQLFDE+ +RNK  SILLWN+LINGCC
Sbjct: 160 LHGGVMKLGLEFDSFVRVSLVDMYVKIGELGFGLQLFDESPQRNKAESILLWNVLINGCC 219

Query: 404 KGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWTTMVAG 225
           K GDL KA  LFEAMPERN GSWN LINGF +N DL++A+ LF QMPEKNVVSWTTM+ G
Sbjct: 220 KVGDLSKAASLFEAMPERNAGSWNSLINGFVRNGDLDRARELFVQMPEKNVVSWTTMING 279

Query: 224 LSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNGFQLN 45
            SQN DHE+ALSMF+RML+EGVR NDLT+VSAL AC +IGAL+ G +IH+Y+S NGFQLN
Sbjct: 280 FSQNGDHEKALSMFWRMLEEGVRPNDLTVVSALLACTKIGALQVGERIHNYLSSNGFQLN 339

Query: 44  RAIGTSLVDMYAKC 3
           R IGT+LVDMYAKC
Sbjct: 340 RGIGTALVDMYAKC 353


>ref|XP_007034318.1| Tetratricopeptide repeat-like superfamily protein isoform 1
           [Theobroma cacao] gi|590656608|ref|XP_007034319.1|
           Tetratricopeptide repeat-like superfamily protein
           isoform 1 [Theobroma cacao]
           gi|590656611|ref|XP_007034320.1| Tetratricopeptide
           repeat-like superfamily protein isoform 1 [Theobroma
           cacao] gi|590656614|ref|XP_007034321.1|
           Tetratricopeptide repeat-like superfamily protein
           isoform 1 [Theobroma cacao] gi|508713347|gb|EOY05244.1|
           Tetratricopeptide repeat-like superfamily protein
           isoform 1 [Theobroma cacao] gi|508713348|gb|EOY05245.1|
           Tetratricopeptide repeat-like superfamily protein
           isoform 1 [Theobroma cacao] gi|508713349|gb|EOY05246.1|
           Tetratricopeptide repeat-like superfamily protein
           isoform 1 [Theobroma cacao] gi|508713350|gb|EOY05247.1|
           Tetratricopeptide repeat-like superfamily protein
           isoform 1 [Theobroma cacao]
          Length = 682

 Score =  390 bits (1002), Expect = e-106
 Identities = 198/316 (62%), Positives = 240/316 (75%)
 Frame = -3

Query: 950 PLENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQH 771
           PL+ HF SLI +SKTT QL+QIHAQ+F  NLS                  I Y I +F H
Sbjct: 43  PLKTHFASLIQSSKTTLQLRQIHAQIFRRNLSSSSNLTTLLISASSSLKSIPYAISLFNH 102

Query: 770 FYSPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLG 591
           F+  + F+FNA IR L++NS  +SSI HF+L L L V+PD+LT+PFVLKS A L +  LG
Sbjct: 103 FHHKSIFLFNALIRGLTDNSLLESSISHFLLMLSLGVRPDKLTYPFVLKSIAGLGLRCLG 162

Query: 590 GTLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILING 411
             LHG+I K G++FDSFV V+LV+MYVK   LGFALQ+FDE+ ERNK  SILLWN+LING
Sbjct: 163 LILHGRIIKSGVEFDSFVRVALVEMYVKLKELGFALQVFDESPERNKSGSILLWNVLING 222

Query: 410 CCKGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWTTMV 231
            CK G+L KAMELFEA PERN+GSWN LINGF +N DL+KA  LFD+M EK+VVSWTTMV
Sbjct: 223 YCKDGNLGKAMELFEATPERNIGSWNSLINGFMRNGDLDKAVELFDEMKEKDVVSWTTMV 282

Query: 230 AGLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNGFQ 51
            G SQN DHE+ALSMF++ML+  +R NDLT+V ALSACA+IGALE+G +IHDYV  NGF+
Sbjct: 283 NGFSQNGDHEKALSMFFKMLEAALRPNDLTLVPALSACAKIGALEAGARIHDYVLENGFR 342

Query: 50  LNRAIGTSLVDMYAKC 3
           LN+AIG +LVDMYAKC
Sbjct: 343 LNKAIGAALVDMYAKC 358



 Score = 63.9 bits (154), Expect = 1e-07
 Identities = 37/146 (25%), Positives = 70/146 (47%)
 Frame = -3

Query: 797 DYTILIFQHFYSPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSA 618
           D  + +F      +   +   +   S+N   + ++  F   L  +++P+ LT    L + 
Sbjct: 261 DKAVELFDEMKEKDVVSWTTMVNGFSQNGDHEKALSMFFKMLEAALRPNDLTLVPALSAC 320

Query: 617 ASLLVPRLGGTLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSI 438
           A +     G  +H  + + G   +  +  +LVDMY K G +  A ++FDET ER+    I
Sbjct: 321 AKIGALEAGARIHDYVLENGFRLNKAIGAALVDMYAKCGDIQSASKVFDETKERD----I 376

Query: 437 LLWNILINGCCKGGDLKKAMELFEAM 360
           L W+++I G    G  ++A++ F+ M
Sbjct: 377 LTWSVMIWGWAIHGYYEQAIQCFKKM 402


>ref|XP_004139010.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g04840-like [Cucumis sativus]
           gi|449505311|ref|XP_004162432.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At1g04840-like [Cucumis sativus]
          Length = 679

 Score =  357 bits (917), Expect = 3e-96
 Identities = 185/315 (58%), Positives = 229/315 (72%)
 Frame = -3

Query: 947 LENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQHF 768
           LE HFI LIH S +T +L+QIH QL+  N+                   +DY I IFQ F
Sbjct: 41  LETHFIDLIHASNSTHKLRQIHGQLYRCNVFSSSRVVTQFISSCSSLNSVDYAISIFQRF 100

Query: 767 YSPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGG 588
              NS++FNA IR L+ENS+F+SSI  FVL L+  + PDRLTFPFVLKSAA+L    +G 
Sbjct: 101 ELKNSYLFNALIRGLAENSRFESSISFFVLMLKWKISPDRLTFPFVLKSAAALSNGGVGR 160

Query: 587 TLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGC 408
            LH  I K GL+FDSFV VSLVDMYVK   LG AL++FDE+ E  K  S+L+WN+LI+G 
Sbjct: 161 ALHCGILKFGLEFDSFVRVSLVDMYVKVEELGSALKVFDESPESVKNGSVLIWNVLIHGY 220

Query: 407 CKGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWTTMVA 228
           C+ GDL KA ELF++MP+++ GSWN LINGF K  D+ +AK LF +MPEKNVVSWTTMV 
Sbjct: 221 CRMGDLVKATELFDSMPKKDTGSWNSLINGFMKMGDMGRAKELFVKMPEKNVVSWTTMVN 280

Query: 227 GLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNGFQL 48
           G SQN D E+AL  F+ ML+EG R ND TIVSALSACA+IGAL++G++IH+Y+S NGF+L
Sbjct: 281 GFSQNGDPEKALETFFCMLEEGARPNDYTIVSALSACAKIGALDAGLRIHNYLSGNGFKL 340

Query: 47  NRAIGTSLVDMYAKC 3
           N  IGT+LVDMYAKC
Sbjct: 341 NLVIGTALVDMYAKC 355


>ref|XP_002518071.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223542667|gb|EEF44204.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 404

 Score =  353 bits (905), Expect = 8e-95
 Identities = 179/315 (56%), Positives = 225/315 (71%)
 Frame = -3

Query: 947 LENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQHF 768
           +E H I LIH+SKT  QL QIH Q+ LHNLS                  I Y++ IF  +
Sbjct: 34  IETHIIPLIHSSKTALQLHQIHTQILLHNLSSSSHITAQLISSSSLRKSIAYSLSIFNSY 93

Query: 767 YSPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGG 588
           +  N ++FNA IR L++N ++  SI HF+L LR  ++PD LTF FVLKS ASL +  L  
Sbjct: 94  HPKNLYLFNALIRGLTDNYRYLDSIDHFILLLRSDIKPDHLTFSFVLKSIASLSLKGLAR 153

Query: 587 TLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGC 408
            LHG I + GL+FDSFV +S+VD+YVK   +  AL++FDE+ +R    S LLWN+LINGC
Sbjct: 154 ALHGMILRCGLEFDSFVRISMVDVYVKLEEVKLALKVFDESPQRFHEGSTLLWNVLINGC 213

Query: 407 CKGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWTTMVA 228
           CK GD++KA+ELFE MP RN  SWN LINGFFK  DLE+A   FD+MP K+VVSWTTMV 
Sbjct: 214 CKVGDMRKALELFEDMPLRNTASWNSLINGFFKIGDLEQAIEHFDRMPVKDVVSWTTMVN 273

Query: 227 GLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNGFQL 48
           G SQN DHE+ALS+F RML E V+ ND TIVSALSACA+IGALE+G++IH Y+  NGF+L
Sbjct: 274 GFSQNGDHEKALSVFSRMLDEDVKPNDFTIVSALSACAKIGALEAGLRIHKYLKDNGFRL 333

Query: 47  NRAIGTSLVDMYAKC 3
           NRA+G +LVDM+AKC
Sbjct: 334 NRAVGNALVDMHAKC 348


>ref|XP_006420980.1| hypothetical protein CICLE_v10004495mg [Citrus clementina]
           gi|557522853|gb|ESR34220.1| hypothetical protein
           CICLE_v10004495mg [Citrus clementina]
          Length = 664

 Score =  352 bits (903), Expect = 1e-94
 Identities = 180/314 (57%), Positives = 224/314 (71%)
 Frame = -3

Query: 944 ENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQHFY 765
           E H ISLIH+S +T+QL+QIHAQ+ LHNL                    DY + IF HF 
Sbjct: 31  ETHIISLIHSSNSTKQLRQIHAQIILHNLFASSRITTQLISSASLHKSTDYALSIFGHFT 90

Query: 764 SPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGGT 585
             N  IFN  IR L+ENS FQS I HFV  LRLSV+P+RLT+PFV KS ASL +  LG  
Sbjct: 91  PKNLHIFNVLIRGLAENSHFQSCISHFVFMLRLSVRPNRLTYPFVSKSVASLSLLSLGRG 150

Query: 584 LHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCC 405
           LH  I K G+++D+FV V L DMYV+ G    A ++FDET E+NK  S+LLWN+LINGC 
Sbjct: 151 LHCLIVKSGVEYDAFVRVHLADMYVQLGKTRGAFKVFDETPEKNKSESVLLWNVLINGCS 210

Query: 404 KGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWTTMVAG 225
           K G L+KA+ELF  MP++NV SW  LI+GF +  DL+KA  LF+QMPEK VVSWT M+ G
Sbjct: 211 KIGYLRKAVELFGMMPKKNVASWVSLIDGFMRKGDLKKAGELFEQMPEKGVVSWTAMING 270

Query: 224 LSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNGFQLN 45
            SQN + E+AL+MF++ML  GVRAND T+VSALSACA++GALE+GV++H+Y+S N F L 
Sbjct: 271 FSQNGEAEKALAMFFQMLDAGVRANDFTVVSALSACAKVGALEAGVRVHNYISCNDFGLK 330

Query: 44  RAIGTSLVDMYAKC 3
            AIGT+LVDMYAKC
Sbjct: 331 GAIGTALVDMYAKC 344


>ref|XP_006420979.1| hypothetical protein CICLE_v10004495mg [Citrus clementina]
           gi|557522852|gb|ESR34219.1| hypothetical protein
           CICLE_v10004495mg [Citrus clementina]
          Length = 466

 Score =  352 bits (903), Expect = 1e-94
 Identities = 180/314 (57%), Positives = 224/314 (71%)
 Frame = -3

Query: 944 ENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQHFY 765
           E H ISLIH+S +T+QL+QIHAQ+ LHNL                    DY + IF HF 
Sbjct: 31  ETHIISLIHSSNSTKQLRQIHAQIILHNLFASSRITTQLISSASLHKSTDYALSIFGHFT 90

Query: 764 SPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGGT 585
             N  IFN  IR L+ENS FQS I HFV  LRLSV+P+RLT+PFV KS ASL +  LG  
Sbjct: 91  PKNLHIFNVLIRGLAENSHFQSCISHFVFMLRLSVRPNRLTYPFVSKSVASLSLLSLGRG 150

Query: 584 LHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCC 405
           LH  I K G+++D+FV V L DMYV+ G    A ++FDET E+NK  S+LLWN+LINGC 
Sbjct: 151 LHCLIVKSGVEYDAFVRVHLADMYVQLGKTRGAFKVFDETPEKNKSESVLLWNVLINGCS 210

Query: 404 KGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWTTMVAG 225
           K G L+KA+ELF  MP++NV SW  LI+GF +  DL+KA  LF+QMPEK VVSWT M+ G
Sbjct: 211 KIGYLRKAVELFGMMPKKNVASWVSLIDGFMRKGDLKKAGELFEQMPEKGVVSWTAMING 270

Query: 224 LSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNGFQLN 45
            SQN + E+AL+MF++ML  GVRAND T+VSALSACA++GALE+GV++H+Y+S N F L 
Sbjct: 271 FSQNGEAEKALAMFFQMLDAGVRANDFTVVSALSACAKVGALEAGVRVHNYISCNDFGLK 330

Query: 44  RAIGTSLVDMYAKC 3
            AIGT+LVDMYAKC
Sbjct: 331 GAIGTALVDMYAKC 344


>ref|XP_002300166.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|222847424|gb|EEE84971.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 719

 Score =  352 bits (902), Expect = 2e-94
 Identities = 181/318 (56%), Positives = 225/318 (70%), Gaps = 1/318 (0%)
 Frame = -3

Query: 953  TPLENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQ 774
            TP E HFISLIH SKT  QL QIHAQ+ +HNLS                  I++++ +F 
Sbjct: 78   TPTEAHFISLIHGSKTILQLHQIHAQIIIHNLSSSSLITTQLISSSSLRKSINHSLAVFN 137

Query: 773  HFYSPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRL 594
            H    N F FNA IR L+ NS F ++IFHF L LR  ++PDRLT+PFVLKS A L    L
Sbjct: 138  HHKPKNLFTFNALIRGLTTNSHFFNAIFHFRLMLRSGIKPDRLTYPFVLKSMAGLFSTEL 197

Query: 593  GGTLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLER-NKVSSILLWNILI 417
            G  +H  I + G++ DSFV VSLVDMYVK   LG A ++FDE+ ER +  SS LLWN+LI
Sbjct: 198  GMAIHCMILRCGIELDSFVRVSLVDMYVKVEKLGSAFKVFDESPERFDSGSSALLWNVLI 257

Query: 416  NGCCKGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWTT 237
             GCCK G +KKA++LF+AMP++   SW+ LI+GF KN D+++A  LFDQMPEKNVVSWTT
Sbjct: 258  KGCCKAGSMKKAVKLFKAMPKKENVSWSTLIDGFAKNGDMDRAMELFDQMPEKNVVSWTT 317

Query: 236  MVAGLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNG 57
            MV G S+N D E+ALSMF +ML+EGVR N  TIVSALSACA+IG LE+G++IH Y+  NG
Sbjct: 318  MVDGFSRNGDSEKALSMFSKMLEEGVRPNAFTIVSALSACAKIGGLEAGLRIHKYIKDNG 377

Query: 56   FQLNRAIGTSLVDMYAKC 3
              L  A+GT+LVDMYAKC
Sbjct: 378  LHLTEALGTALVDMYAKC 395


>ref|XP_006489829.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g04840-like isoform X5 [Citrus sinensis]
          Length = 457

 Score =  351 bits (901), Expect = 2e-94
 Identities = 181/314 (57%), Positives = 222/314 (70%)
 Frame = -3

Query: 944 ENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQHFY 765
           E H ISLIH+S +T+QL+QIHAQ+ LHNL                   IDY + IF HF 
Sbjct: 31  ETHIISLIHSSNSTKQLRQIHAQIILHNLFASSRITTQLISSASLHKSIDYALSIFDHFT 90

Query: 764 SPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGGT 585
             N  IFN  IR L+ENS FQS I HFV  LRLSV+P+RLT+PFV KS ASL +  LG  
Sbjct: 91  PKNLHIFNVLIRGLAENSHFQSCISHFVFMLRLSVRPNRLTYPFVSKSVASLSLLSLGRG 150

Query: 584 LHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCC 405
           LH  I K G+++D+FV V L DMYVK G    A ++FDET ERNK  S+LLWN+LINGC 
Sbjct: 151 LHCLIVKSGVEYDAFVRVHLADMYVKLGKTRGAFKMFDETPERNKSESVLLWNVLINGCS 210

Query: 404 KGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWTTMVAG 225
           K G L+KA+ELF  MP++N  SW  LI+GF +  DL+KA  LF+QMPEK VVSWT M+ G
Sbjct: 211 KIGYLRKAVELFGVMPKKNAASWVSLIDGFMRKGDLKKAGELFEQMPEKGVVSWTAMING 270

Query: 224 LSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNGFQLN 45
            SQN + E AL+MF++ML  GVRAND T+VSALSACA++GALE+GV++H+Y+S N F L 
Sbjct: 271 FSQNGEAETALAMFFQMLDAGVRANDFTVVSALSACAKVGALEAGVRVHNYISCNDFGLK 330

Query: 44  RAIGTSLVDMYAKC 3
            AIGT+LV MYAKC
Sbjct: 331 GAIGTALVHMYAKC 344



 Score = 66.6 bits (161), Expect = 2e-08
 Identities = 47/199 (23%), Positives = 91/199 (45%), Gaps = 39/199 (19%)
 Frame = -3

Query: 533 VSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCCKGGDLKKAMELFEAMPE 354
           VSL+D +++ G L  A +LF++  E+  VS    W  +ING  + G+ + A+ +F  M +
Sbjct: 234 VSLIDGFMRKGDLKKAGELFEQMPEKGVVS----WTAMINGFSQNGEAETALAMFFQMLD 289

Query: 353 RNVGS---------------------------------------WNCLINGFFKNKDLEK 291
             V +                                          L++ + K  ++E 
Sbjct: 290 AGVRANDFTVVSALSACAKVGALEAGVRVHNYISCNDFGLKGAIGTALVHMYAKCGNIEA 349

Query: 290 AKLLFDQMPEKNVVSWTTMVAGLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACAR 111
           A L+F +  EK++++WT M+ GL+ +  +EQA+  F +M+  G+  +    ++ L+AC  
Sbjct: 350 ASLVFGETKEKDLLTWTAMIWGLAIHGRYEQAIQCFKKMMYSGIEPDGTVFLAILTACWY 409

Query: 110 IGALESGVQIHDYVSRNGF 54
            G ++  +   D +S + F
Sbjct: 410 SGQVKLALNFFDSMSFDYF 428


>ref|XP_006489828.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g04840-like isoform X4 [Citrus sinensis]
          Length = 458

 Score =  351 bits (901), Expect = 2e-94
 Identities = 181/314 (57%), Positives = 222/314 (70%)
 Frame = -3

Query: 944 ENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQHFY 765
           E H ISLIH+S +T+QL+QIHAQ+ LHNL                   IDY + IF HF 
Sbjct: 31  ETHIISLIHSSNSTKQLRQIHAQIILHNLFASSRITTQLISSASLHKSIDYALSIFDHFT 90

Query: 764 SPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGGT 585
             N  IFN  IR L+ENS FQS I HFV  LRLSV+P+RLT+PFV KS ASL +  LG  
Sbjct: 91  PKNLHIFNVLIRGLAENSHFQSCISHFVFMLRLSVRPNRLTYPFVSKSVASLSLLSLGRG 150

Query: 584 LHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCC 405
           LH  I K G+++D+FV V L DMYVK G    A ++FDET ERNK  S+LLWN+LINGC 
Sbjct: 151 LHCLIVKSGVEYDAFVRVHLADMYVKLGKTRGAFKMFDETPERNKSESVLLWNVLINGCS 210

Query: 404 KGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWTTMVAG 225
           K G L+KA+ELF  MP++N  SW  LI+GF +  DL+KA  LF+QMPEK VVSWT M+ G
Sbjct: 211 KIGYLRKAVELFGVMPKKNAASWVSLIDGFMRKGDLKKAGELFEQMPEKGVVSWTAMING 270

Query: 224 LSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNGFQLN 45
            SQN + E AL+MF++ML  GVRAND T+VSALSACA++GALE+GV++H+Y+S N F L 
Sbjct: 271 FSQNGEAETALAMFFQMLDAGVRANDFTVVSALSACAKVGALEAGVRVHNYISCNDFGLK 330

Query: 44  RAIGTSLVDMYAKC 3
            AIGT+LV MYAKC
Sbjct: 331 GAIGTALVHMYAKC 344



 Score = 66.6 bits (161), Expect = 2e-08
 Identities = 47/199 (23%), Positives = 91/199 (45%), Gaps = 39/199 (19%)
 Frame = -3

Query: 533 VSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCCKGGDLKKAMELFEAMPE 354
           VSL+D +++ G L  A +LF++  E+  VS    W  +ING  + G+ + A+ +F  M +
Sbjct: 234 VSLIDGFMRKGDLKKAGELFEQMPEKGVVS----WTAMINGFSQNGEAETALAMFFQMLD 289

Query: 353 RNVGS---------------------------------------WNCLINGFFKNKDLEK 291
             V +                                          L++ + K  ++E 
Sbjct: 290 AGVRANDFTVVSALSACAKVGALEAGVRVHNYISCNDFGLKGAIGTALVHMYAKCGNIEA 349

Query: 290 AKLLFDQMPEKNVVSWTTMVAGLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACAR 111
           A L+F +  EK++++WT M+ GL+ +  +EQA+  F +M+  G+  +    ++ L+AC  
Sbjct: 350 ASLVFGETKEKDLLTWTAMIWGLAIHGRYEQAIQCFKKMMYSGIEPDGTVFLAILTACWY 409

Query: 110 IGALESGVQIHDYVSRNGF 54
            G ++  +   D +S + F
Sbjct: 410 SGQVKLALNFFDSMSFDYF 428


>ref|XP_006489827.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g04840-like isoform X3 [Citrus sinensis]
          Length = 466

 Score =  351 bits (901), Expect = 2e-94
 Identities = 181/314 (57%), Positives = 222/314 (70%)
 Frame = -3

Query: 944 ENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQHFY 765
           E H ISLIH+S +T+QL+QIHAQ+ LHNL                   IDY + IF HF 
Sbjct: 31  ETHIISLIHSSNSTKQLRQIHAQIILHNLFASSRITTQLISSASLHKSIDYALSIFDHFT 90

Query: 764 SPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGGT 585
             N  IFN  IR L+ENS FQS I HFV  LRLSV+P+RLT+PFV KS ASL +  LG  
Sbjct: 91  PKNLHIFNVLIRGLAENSHFQSCISHFVFMLRLSVRPNRLTYPFVSKSVASLSLLSLGRG 150

Query: 584 LHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCC 405
           LH  I K G+++D+FV V L DMYVK G    A ++FDET ERNK  S+LLWN+LINGC 
Sbjct: 151 LHCLIVKSGVEYDAFVRVHLADMYVKLGKTRGAFKMFDETPERNKSESVLLWNVLINGCS 210

Query: 404 KGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWTTMVAG 225
           K G L+KA+ELF  MP++N  SW  LI+GF +  DL+KA  LF+QMPEK VVSWT M+ G
Sbjct: 211 KIGYLRKAVELFGVMPKKNAASWVSLIDGFMRKGDLKKAGELFEQMPEKGVVSWTAMING 270

Query: 224 LSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNGFQLN 45
            SQN + E AL+MF++ML  GVRAND T+VSALSACA++GALE+GV++H+Y+S N F L 
Sbjct: 271 FSQNGEAETALAMFFQMLDAGVRANDFTVVSALSACAKVGALEAGVRVHNYISCNDFGLK 330

Query: 44  RAIGTSLVDMYAKC 3
            AIGT+LV MYAKC
Sbjct: 331 GAIGTALVHMYAKC 344



 Score = 66.6 bits (161), Expect = 2e-08
 Identities = 47/199 (23%), Positives = 91/199 (45%), Gaps = 39/199 (19%)
 Frame = -3

Query: 533 VSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCCKGGDLKKAMELFEAMPE 354
           VSL+D +++ G L  A +LF++  E+  VS    W  +ING  + G+ + A+ +F  M +
Sbjct: 234 VSLIDGFMRKGDLKKAGELFEQMPEKGVVS----WTAMINGFSQNGEAETALAMFFQMLD 289

Query: 353 RNVGS---------------------------------------WNCLINGFFKNKDLEK 291
             V +                                          L++ + K  ++E 
Sbjct: 290 AGVRANDFTVVSALSACAKVGALEAGVRVHNYISCNDFGLKGAIGTALVHMYAKCGNIEA 349

Query: 290 AKLLFDQMPEKNVVSWTTMVAGLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACAR 111
           A L+F +  EK++++WT M+ GL+ +  +EQA+  F +M+  G+  +    ++ L+AC  
Sbjct: 350 ASLVFGETKEKDLLTWTAMIWGLAIHGRYEQAIQCFKKMMYSGIEPDGTVFLAILTACWY 409

Query: 110 IGALESGVQIHDYVSRNGF 54
            G ++  +   D +S + F
Sbjct: 410 SGQVKLALNFFDSMSFDYF 428


>ref|XP_006489825.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g04840-like isoform X1 [Citrus sinensis]
           gi|568873396|ref|XP_006489826.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At1g04840-like isoform X2 [Citrus sinensis]
          Length = 664

 Score =  351 bits (901), Expect = 2e-94
 Identities = 181/314 (57%), Positives = 222/314 (70%)
 Frame = -3

Query: 944 ENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQHFY 765
           E H ISLIH+S +T+QL+QIHAQ+ LHNL                   IDY + IF HF 
Sbjct: 31  ETHIISLIHSSNSTKQLRQIHAQIILHNLFASSRITTQLISSASLHKSIDYALSIFDHFT 90

Query: 764 SPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGGT 585
             N  IFN  IR L+ENS FQS I HFV  LRLSV+P+RLT+PFV KS ASL +  LG  
Sbjct: 91  PKNLHIFNVLIRGLAENSHFQSCISHFVFMLRLSVRPNRLTYPFVSKSVASLSLLSLGRG 150

Query: 584 LHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCC 405
           LH  I K G+++D+FV V L DMYVK G    A ++FDET ERNK  S+LLWN+LINGC 
Sbjct: 151 LHCLIVKSGVEYDAFVRVHLADMYVKLGKTRGAFKMFDETPERNKSESVLLWNVLINGCS 210

Query: 404 KGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWTTMVAG 225
           K G L+KA+ELF  MP++N  SW  LI+GF +  DL+KA  LF+QMPEK VVSWT M+ G
Sbjct: 211 KIGYLRKAVELFGVMPKKNAASWVSLIDGFMRKGDLKKAGELFEQMPEKGVVSWTAMING 270

Query: 224 LSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNGFQLN 45
            SQN + E AL+MF++ML  GVRAND T+VSALSACA++GALE+GV++H+Y+S N F L 
Sbjct: 271 FSQNGEAETALAMFFQMLDAGVRANDFTVVSALSACAKVGALEAGVRVHNYISCNDFGLK 330

Query: 44  RAIGTSLVDMYAKC 3
            AIGT+LV MYAKC
Sbjct: 331 GAIGTALVHMYAKC 344



 Score = 66.6 bits (161), Expect = 2e-08
 Identities = 47/199 (23%), Positives = 91/199 (45%), Gaps = 39/199 (19%)
 Frame = -3

Query: 533 VSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCCKGGDLKKAMELFEAMPE 354
           VSL+D +++ G L  A +LF++  E+  VS    W  +ING  + G+ + A+ +F  M +
Sbjct: 234 VSLIDGFMRKGDLKKAGELFEQMPEKGVVS----WTAMINGFSQNGEAETALAMFFQMLD 289

Query: 353 RNVGS---------------------------------------WNCLINGFFKNKDLEK 291
             V +                                          L++ + K  ++E 
Sbjct: 290 AGVRANDFTVVSALSACAKVGALEAGVRVHNYISCNDFGLKGAIGTALVHMYAKCGNIEA 349

Query: 290 AKLLFDQMPEKNVVSWTTMVAGLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACAR 111
           A L+F +  EK++++WT M+ GL+ +  +EQA+  F +M+  G+  +    ++ L+AC  
Sbjct: 350 ASLVFGETKEKDLLTWTAMIWGLAIHGRYEQAIQCFKKMMYSGIEPDGTVFLAILTACWY 409

Query: 110 IGALESGVQIHDYVSRNGF 54
            G ++  +   D +S + F
Sbjct: 410 SGQVKLALNFFDSMSFDYF 428


>ref|XP_006360955.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g04840-like isoform X1 [Solanum tuberosum]
           gi|565390461|ref|XP_006360956.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At1g04840-like isoform X2 [Solanum tuberosum]
          Length = 666

 Score =  334 bits (857), Expect = 3e-89
 Identities = 167/315 (53%), Positives = 222/315 (70%), Gaps = 1/315 (0%)
 Frame = -3

Query: 944 ENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQHFY 765
           E HFISLIH+SK T QL+QIH Q+   NLS                  I+Y + IF  F 
Sbjct: 28  EPHFISLIHSSKNTLQLQQIHGQIIRKNLSSNSRIVTQLISSASLHKSINYGLSIFNCFL 87

Query: 764 SPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGGT 585
             N F+FN  IR L ENS F+ SI +F   +++ V+PD+LT+PFVLKS  +L    +GG 
Sbjct: 88  DKNVFLFNVLIRGLKENSLFEKSILYFRKMVKMGVRPDKLTYPFVLKSVTALGEKGVGGG 147

Query: 584 LHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCC 405
           +H  + K+GL++D+FV V LV++YVK   + FALQLFDE+ ERNKV S++LWN++INGCC
Sbjct: 148 VHCGVLKVGLEYDTFVRVCLVELYVKVELVDFALQLFDESPERNKVESVILWNVVINGCC 207

Query: 404 KGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMP-EKNVVSWTTMVA 228
           K G +  A+ LFE MPERNVGSWN LI+G  +N +++KA  LFD+MP EKNVVSWT M+ 
Sbjct: 208 KIGRMSNALALFEEMPERNVGSWNTLISGLLRNGEVDKAMELFDEMPNEKNVVSWTCMIH 267

Query: 227 GLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNGFQL 48
           GL  N  H++AL +F++M++EGV+ N LT+VSALSACA+ GALE+G +IHD +  NG  L
Sbjct: 268 GLMLNGLHQKALDLFFKMVEEGVKPNGLTVVSALSACAKTGALEAGKKIHDNIMNNGLHL 327

Query: 47  NRAIGTSLVDMYAKC 3
           N A+G +L+DMYAKC
Sbjct: 328 NAAVGNALLDMYAKC 342


>ref|XP_002892245.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297338087|gb|EFH68504.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 664

 Score =  317 bits (813), Expect = 4e-84
 Identities = 161/325 (49%), Positives = 217/325 (66%)
 Frame = -3

Query: 977 YYINMNRCTPLENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXI 798
           Y+    R +P E+HFISLIHT K T  L+ +HA +    +                    
Sbjct: 18  YFPADRRASPDESHFISLIHTCKDTVSLRLVHAHILRRGVLSSRVAAQLVSCSSLLKSP- 76

Query: 797 DYTILIFQHFYSPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSA 618
           DY++ IF++    N F+FNA IR L+EN++F+ S+ HF+L L L V+PDRLTFPFVLKS 
Sbjct: 77  DYSLSIFRNSEERNPFVFNALIRGLTENARFECSVRHFILMLTLGVKPDRLTFPFVLKSN 136

Query: 617 ASLLVPRLGGTLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSI 438
           + L    LG  LH    K  +D DSFV VSLVDMY K G L  A Q+F+ET +R K  SI
Sbjct: 137 SKLGFRWLGRALHAATLKNFVDCDSFVRVSLVDMYAKTGQLNHAFQVFEETPDRIKKESI 196

Query: 437 LLWNILINGCCKGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEK 258
           LLWN+L+NG C+  D++ A  LF +MPERN GSW+ LI G+  N +L +AK LF+ MPEK
Sbjct: 197 LLWNVLVNGYCRAKDMQMATTLFRSMPERNSGSWSTLIKGYVDNGELNRAKQLFELMPEK 256

Query: 257 NVVSWTTMVAGLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIH 78
           NVVSWTT++ G SQ  D+E A+S ++ ML++G++ N+ T+ + LSAC++ GAL SG++IH
Sbjct: 257 NVVSWTTLINGFSQTGDYETAISTYFEMLEKGLKPNEYTVAAVLSACSKSGALGSGIRIH 316

Query: 77  DYVSRNGFQLNRAIGTSLVDMYAKC 3
            Y+  NG +L+RAIGTSL+DMYAKC
Sbjct: 317 GYILDNGIKLDRAIGTSLLDMYAKC 341


>ref|XP_006418090.1| hypothetical protein EUTSA_v10007007mg [Eutrema salsugineum]
           gi|557095861|gb|ESQ36443.1| hypothetical protein
           EUTSA_v10007007mg [Eutrema salsugineum]
          Length = 665

 Score =  313 bits (801), Expect = 1e-82
 Identities = 157/320 (49%), Positives = 216/320 (67%)
 Frame = -3

Query: 962 NRCTPLENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTIL 783
           +R +P E+H ISLIH  K T  L+++HA +    +                    DY++ 
Sbjct: 23  HRASPDESHIISLIHACKDTVCLRRVHAYILRRGVLSSRVAAQLVSSSSLLKSP-DYSLS 81

Query: 782 IFQHFYSPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLV 603
           IF++    N F+FNA IR L+E+++F+ S+ HF+L LRL V+PDRLTFPFVLKS + L  
Sbjct: 82  IFRYLKEKNLFVFNALIRGLAESARFKCSVRHFILMLRLGVRPDRLTFPFVLKSNSKLGF 141

Query: 602 PRLGGTLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNI 423
             LG  LH    K  +D DSFV VSLVDMY K G L +A Q+FDE+ +  K+  ILLWN+
Sbjct: 142 RWLGRALHAAALKDSVDCDSFVRVSLVDMYAKTGGLKYAFQVFDESPDWIKMERILLWNV 201

Query: 422 LINGCCKGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSW 243
           LING C+  D++ A  LF +MPERN GSW+ LI G+  N DL +A+ LF+ MPEK+VVSW
Sbjct: 202 LINGYCRAKDMQMATTLFGSMPERNSGSWSTLIKGYVDNGDLNRARQLFEVMPEKSVVSW 261

Query: 242 TTMVAGLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSR 63
           TT++ G SQN D+E A+S ++ ML+EG++ N+ T+ + LSAC++ GAL SG++IH Y+  
Sbjct: 262 TTLINGFSQNGDYESAISTYFEMLEEGMKPNEYTVAAVLSACSKSGALGSGIRIHGYILD 321

Query: 62  NGFQLNRAIGTSLVDMYAKC 3
           NG  L+RAIGT+L+DMYAKC
Sbjct: 322 NGINLDRAIGTALIDMYAKC 341


>ref|NP_171976.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75192500|sp|Q9MAT2.1|PPR10_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At1g04840 gi|7211995|gb|AAF40466.1|AC004809_24 F13M7.17
           [Arabidopsis thaliana] gi|332189629|gb|AEE27750.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           thaliana]
          Length = 665

 Score =  313 bits (801), Expect = 1e-82
 Identities = 159/325 (48%), Positives = 217/325 (66%)
 Frame = -3

Query: 977 YYINMNRCTPLENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXI 798
           Y+    + +P E+HFISLIH  K T  L+ +HAQ+    +                    
Sbjct: 18  YFPADRQASPDESHFISLIHACKDTASLRHVHAQILRRGVLSSRVAAQLVSCSSLLKSP- 76

Query: 797 DYTILIFQHFYSPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSA 618
           DY++ IF++    N F+ NA IR L+EN++F+SS+ HF+L LRL V+PDRLTFPFVLKS 
Sbjct: 77  DYSLSIFRNSEERNPFVLNALIRGLTENARFESSVRHFILMLRLGVKPDRLTFPFVLKSN 136

Query: 617 ASLLVPRLGGTLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSI 438
           + L    LG  LH    K  +D DSFV +SLVDMY K G L  A Q+F+E+ +R K  SI
Sbjct: 137 SKLGFRWLGRALHAATLKNFVDCDSFVRLSLVDMYAKTGQLKHAFQVFEESPDRIKKESI 196

Query: 437 LLWNILINGCCKGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEK 258
           L+WN+LING C+  D+  A  LF +MPERN GSW+ LI G+  + +L +AK LF+ MPEK
Sbjct: 197 LIWNVLINGYCRAKDMHMATTLFRSMPERNSGSWSTLIKGYVDSGELNRAKQLFELMPEK 256

Query: 257 NVVSWTTMVAGLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIH 78
           NVVSWTT++ G SQ  D+E A+S ++ ML++G++ N+ TI + LSAC++ GAL SG++IH
Sbjct: 257 NVVSWTTLINGFSQTGDYETAISTYFEMLEKGLKPNEYTIAAVLSACSKSGALGSGIRIH 316

Query: 77  DYVSRNGFQLNRAIGTSLVDMYAKC 3
            Y+  NG +L+RAIGT+LVDMYAKC
Sbjct: 317 GYILDNGIKLDRAIGTALVDMYAKC 341


>ref|XP_006306938.1| hypothetical protein CARUB_v10008505mg, partial [Capsella rubella]
           gi|482575649|gb|EOA39836.1| hypothetical protein
           CARUB_v10008505mg, partial [Capsella rubella]
          Length = 672

 Score =  301 bits (771), Expect = 3e-79
 Identities = 155/319 (48%), Positives = 212/319 (66%)
 Frame = -3

Query: 959 RCTPLENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILI 780
           R +P E+HFISLIH  K T  L+++HAQ+    +                    DY + I
Sbjct: 31  RASPDESHFISLIHACKDTVSLRRVHAQILRRGVLSSRVAAQLVSCSGLLQSP-DYCLSI 89

Query: 779 FQHFYSPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVP 600
           F++F   N F+FN  IR L+EN++  SS+ HF+L LRL V+PDRLTFPFVLKS + L   
Sbjct: 90  FRNFEEKNLFVFNVLIRGLTENARSASSVRHFILMLRLGVRPDRLTFPFVLKSNSKLGFR 149

Query: 599 RLGGTLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNIL 420
            LG  LH    K  +D DSFV VSLVDMY K   L +A Q+FDE+ +R K  S LL N+L
Sbjct: 150 WLGRALHAATLKNFVDCDSFVRVSLVDMYAKTRQLNYAFQVFDESPDRIKKESTLLSNVL 209

Query: 419 INGCCKGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWT 240
           I G C+  D++ A +LF +MPERN GSW+ LI G+     L +AK LF+ MPEK+VV+WT
Sbjct: 210 IKGYCRAKDMQMATKLFRSMPERNSGSWSTLIKGYADCSQLNRAKQLFELMPEKHVVTWT 269

Query: 239 TMVAGLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRN 60
           T++ G SQN  +E A+S ++ ML++G++ N+ T+ +ALSAC++ GAL SG++IH Y+  N
Sbjct: 270 TLINGFSQNGYYETAISTYFEMLEKGLKPNEYTVAAALSACSKSGALGSGIRIHAYILDN 329

Query: 59  GFQLNRAIGTSLVDMYAKC 3
           G +L+RAIGT+L+DMYAKC
Sbjct: 330 GIRLDRAIGTALIDMYAKC 348


>ref|XP_004247960.1| PREDICTED: pentatricopeptide repeat-containing protein
           At1g04840-like [Solanum lycopersicum]
          Length = 547

 Score =  275 bits (703), Expect = 2e-71
 Identities = 132/223 (59%), Positives = 175/223 (78%), Gaps = 1/223 (0%)
 Frame = -3

Query: 668 LSVQPDRLTFPFVLKSAASLLVPRLGGTLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGF 489
           + V+PD+LT+PFVLKS  +L   R+GG +H  I K+GL++D+FV V LV+MYVK   + F
Sbjct: 1   MGVRPDKLTYPFVLKSVTALGDKRVGGVVHCGILKMGLEYDTFVRVCLVEMYVKAELVDF 60

Query: 488 ALQLFDETLERNKVSSILLWNILINGCCKGGDLKKAMELFEAMPERNVGSWNCLINGFFK 309
           ALQLFDE+ ERNKV S++LWN++INGCCK G + KA+ LFE MPERNVGSWN LI+G  +
Sbjct: 61  ALQLFDESSERNKVESVILWNVVINGCCKIGRVSKALALFEEMPERNVGSWNTLISGLLR 120

Query: 308 NKDLEKAKLLFDQMP-EKNVVSWTTMVAGLSQNEDHEQALSMFYRMLKEGVRANDLTIVS 132
           N +++KA  LFD+M  EKNVVSWT M+ GL  NE H++AL +F++M++EGV+ N LT+VS
Sbjct: 121 NGEVDKAMELFDEMTNEKNVVSWTCMIHGLMLNELHQKALDLFFKMVEEGVKPNGLTVVS 180

Query: 131 ALSACARIGALESGVQIHDYVSRNGFQLNRAIGTSLVDMYAKC 3
           ALSACA+ GALE+G +IHD +  NG  LN A+G +L+DMYAKC
Sbjct: 181 ALSACAKTGALEAGKKIHDNIVNNGLHLNAAVGNALLDMYAKC 223


>emb|CBI17228.3| unnamed protein product [Vitis vinifera]
          Length = 590

 Score =  231 bits (590), Expect = 3e-58
 Identities = 128/304 (42%), Positives = 180/304 (59%), Gaps = 14/304 (4%)
 Frame = -3

Query: 944 ENHFISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQHFY 765
           E HFI LIH S T  QL QIHAQ+FLHNL                   +DY + IF+ F 
Sbjct: 40  ETHFIPLIHASNTLPQLHQIHAQIFLHNLFSNSRVVTQLISSSCSLKSLDYALSIFRCFD 99

Query: 764 SPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGGT 585
            PN F+FNA IR L+ENS+F+ S+ HFVL LRLS++PDRLT PFVLKS A+L+   LG  
Sbjct: 100 HPNLFVFNALIRGLAENSRFEGSVSHFVLMLRLSIRPDRLTLPFVLKSVAALVDVGLGRC 159

Query: 584 LHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWN------- 426
           LHG + KLGL+FDSFV VSLVDMYVK G LGF LQLFDE+ +RNK  SILLWN       
Sbjct: 160 LHGGVMKLGLEFDSFVRVSLVDMYVKIGELGFGLQLFDESPQRNKAESILLWNGVRPNDL 219

Query: 425 ---ILINGCCKGGDLKKAMELFEAMP----ERNVGSWNCLINGFFKNKDLEKAKLLFDQM 267
                +  C K G L+    +   +     + N G    L++ + K  +++ A  +F + 
Sbjct: 220 TVVSALLACTKIGALQVGERIHNYLSSNGFQLNRGIGTALVDMYAKCGNIKSASRVFVET 279

Query: 266 PEKNVVSWTTMVAGLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGV 87
             K++++W+ M+ G + +   +QAL  F +M   G+  +++  ++ L+AC+  G ++ G+
Sbjct: 280 KGKDLLTWSVMIWGWAIHGCFDQALQCFVKMKSAGINPDEVIFLAILTACSHSGNVDQGL 339

Query: 86  QIHD 75
              +
Sbjct: 340 NFFE 343



 Score = 98.2 bits (243), Expect = 5e-18
 Identities = 79/229 (34%), Positives = 109/229 (47%), Gaps = 24/229 (10%)
 Frame = -3

Query: 617 ASLLVPRLGGTLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSI 438
           AS  +P+L   +H QI    L  +S V+  L+       SL +AL +F      N    +
Sbjct: 49  ASNTLPQLH-QIHAQIFLHNLFSNSRVVTQLISSSCSLKSLDYALSIFRCFDHPN----L 103

Query: 437 LLWNILINGCCKGGDLKKAMELFEAM------PER--------------NVGSWNCLING 318
            ++N LI G  +    + ++  F  M      P+R              +VG   CL  G
Sbjct: 104 FVFNALIRGLAENSRFEGSVSHFVLMLRLSIRPDRLTLPFVLKSVAALVDVGLGRCLHGG 163

Query: 317 FFKNKDLEKAKLLFDQMPEKNVVSWTTMVA----GLSQNEDHEQALSMFYRMLKEGVRAN 150
             K        L FD     ++V     +     GL   ++  Q       +L  GVR N
Sbjct: 164 VMK------LGLEFDSFVRVSLVDMYVKIGELGFGLQLFDESPQRNKAESILLWNGVRPN 217

Query: 149 DLTIVSALSACARIGALESGVQIHDYVSRNGFQLNRAIGTSLVDMYAKC 3
           DLT+VSAL AC +IGAL+ G +IH+Y+S NGFQLNR IGT+LVDMYAKC
Sbjct: 218 DLTVVSALLACTKIGALQVGERIHNYLSSNGFQLNRGIGTALVDMYAKC 266


>ref|XP_004295870.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g48910-like [Fragaria vesca subsp. vesca]
          Length = 729

 Score =  209 bits (533), Expect = 1e-51
 Identities = 110/264 (41%), Positives = 168/264 (63%), Gaps = 3/264 (1%)
 Frame = -3

Query: 785 LIFQHFY-SPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRL--SVQPDRLTFPFVLKSAA 615
           LIF+HF  +PN F +NA +++ ++N+ +  +I +F  +L    +  PD  TF  VLK+ A
Sbjct: 146 LIFRHFLETPNIFAYNALLKAFAQNNDWHHTILYFNSQLLSPNAPTPDEYTFTSVLKACA 205

Query: 614 SLLVPRLGGTLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSIL 435
            LL    GG +H  + K G + + FV  SL DMY KFG +G A +LFDE   R+ VS   
Sbjct: 206 GLLRVTEGGKVHCLVTKFGCEENLFVRNSLTDMYFKFGKVGVAQKLFDEMRVRDVVS--- 262

Query: 434 LWNILINGCCKGGDLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKN 255
            WN L+ G C  G++ +A  +F+ M E++  SW+ +I+ + K  +LE+A+ LFD +P++N
Sbjct: 263 -WNTLVAGYCVSGEVGEARRVFDGMVEKSSFSWSTMISAYAKLGELEEAQRLFDAVPQRN 321

Query: 254 VVSWTTMVAGLSQNEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHD 75
           VVSW  M+AG +QNE +++A+ +F  M + G+  ND+T+VS LSACA +GAL+ G  I  
Sbjct: 322 VVSWNAMIAGYAQNEKYDEAVGLFREMQECGLAPNDVTLVSVLSACAHLGALDLGKWIDR 381

Query: 74  YVSRNGFQLNRAIGTSLVDMYAKC 3
           ++ R+G  L   +G +L DMYAKC
Sbjct: 382 FIKRSGMDLGLFLGNALADMYAKC 405



 Score = 62.4 bits (150), Expect = 3e-07
 Identities = 53/238 (22%), Positives = 97/238 (40%), Gaps = 41/238 (17%)
 Frame = -3

Query: 758  NSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGGTLH 579
            N   +NA I   ++N ++  ++  F       + P+ +T   VL + A L    LG  + 
Sbjct: 321  NVVSWNAMIAGYAQNEKYDEAVGLFREMQECGLAPNDVTLVSVLSACAHLGALDLGKWID 380

Query: 578  GQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCCKG 399
              I + G+D   F+  +L DMY K G +  A ++F+   ER+ +S    W+I+I G    
Sbjct: 381  RFIKRSGMDLGLFLGNALADMYAKCGCITEARRVFNNMQERDVIS----WSIIITGLAMN 436

Query: 398  GDLKKAMELFEAMPER----------------------------------------NVGS 339
            G   +A E F+ M E                                          +  
Sbjct: 437  GHADQAFECFDKMIEHGLKPNEITFMGLLTACTHAGLVDKGLEYFNMMEKAFGISPKIEH 496

Query: 338  WNCLINGFFKNKDLEKAKLLFDQMPEK-NVVSWTTMVAGLSQNEDHEQALSMFYRMLK 168
            + C+++   +   L KA+ + + MP K NV+ W  ++ G    +D ++   +  R+L+
Sbjct: 497  YGCVVDLLSRASRLAKAEDMINSMPMKPNVIVWGALLGGCRTYKDTDRGERVVRRILE 554


>ref|XP_007050341.1| Basic helix-loop-helix DNA-binding superfamily protein [Theobroma
           cacao] gi|508702602|gb|EOX94498.1| Basic
           helix-loop-helix DNA-binding superfamily protein
           [Theobroma cacao]
          Length = 600

 Score =  208 bits (530), Expect = 3e-51
 Identities = 111/311 (35%), Positives = 176/311 (56%), Gaps = 1/311 (0%)
 Frame = -3

Query: 932 ISLIHTSKTTQQLKQIHAQLFLHNLSQXXXXXXXXXXXXXXXXXIDYTILIFQHFYSPNS 753
           +  I       QL+ I+A +   N +Q                 +DY IL F     PN 
Sbjct: 52  VDQIKKCSNLNQLETIYATMIKTNANQDCFLTNQFVSACATFCRMDYAILAFTQMQKPNV 111

Query: 752 FIFNAFIRSLSE-NSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLLVPRLGGTLHG 576
           F++NA I+ L   ++ FQ+  +H  + LR  V P   TF  ++K+   +     G ++HG
Sbjct: 112 FVYNALIKGLVHCHNPFQALDYHKHM-LRAGVWPSSFTFSSLVKACGLVSELGFGESVHG 170

Query: 575 QIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWNILINGCCKGG 396
           Q+ K G +   FV  +LVD Y   G    + ++FDE  +R+    +  W  +++G  K G
Sbjct: 171 QVWKHGFESHVFVQTALVDFYANVGKFAESKRVFDEMPDRD----VFAWTTMVSGFLKAG 226

Query: 395 DLKKAMELFEAMPERNVGSWNCLINGFFKNKDLEKAKLLFDQMPEKNVVSWTTMVAGLSQ 216
           DL  +  LF+ MPERN  +WN +I+G+ +  D+E A+L F+QMP K+++SWT+M+   S+
Sbjct: 227 DLVSSRRLFDEMPERNTATWNAMIDGYARVGDVESAELFFNQMPVKDIISWTSMINCYSK 286

Query: 215 NEDHEQALSMFYRMLKEGVRANDLTIVSALSACARIGALESGVQIHDYVSRNGFQLNRAI 36
           N+   +AL++F  M +  V  +++T+ S +SACA +GAL +G +IH YV +NGF L+  I
Sbjct: 287 NKQFREALAVFEEMRRNKVSPDEVTMASVISACAHLGALNTGKEIHHYVMQNGFYLDVYI 346

Query: 35  GTSLVDMYAKC 3
           G++LVDMYAKC
Sbjct: 347 GSALVDMYAKC 357



 Score = 66.2 bits (160), Expect = 2e-08
 Identities = 42/147 (28%), Positives = 70/147 (47%)
 Frame = -3

Query: 785 LIFQHFYSPNSFIFNAFIRSLSENSQFQSSIFHFVLRLRLSVQPDRLTFPFVLKSAASLL 606
           L F      +   + + I   S+N QF+ ++  F    R  V PD +T   V+ + A L 
Sbjct: 264 LFFNQMPVKDIISWTSMINCYSKNKQFREALAVFEEMRRNKVSPDEVTMASVISACAHLG 323

Query: 605 VPRLGGTLHGQIAKLGLDFDSFVLVSLVDMYVKFGSLGFALQLFDETLERNKVSSILLWN 426
               G  +H  + + G   D ++  +LVDMY K GSL  +L  F +  E+N    +  WN
Sbjct: 324 ALNTGKEIHHYVMQNGFYLDVYIGSALVDMYAKCGSLERSLLAFFKLREKN----LFCWN 379

Query: 425 ILINGCCKGGDLKKAMELFEAMPERNV 345
            +I G    G  ++A+ +F++M   +V
Sbjct: 380 SVIEGLAVHGYAQEALAMFDSMERHHV 406


Top