BLASTX nr result

ID: Akebia24_contig00006894 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00006894
         (2202 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002285611.1| PREDICTED: pentatricopeptide repeat-containi...   897   0.0  
gb|EXC01179.1| hypothetical protein L484_025557 [Morus notabilis]     885   0.0  
ref|XP_007018302.1| Pentatricopeptide repeat (PPR-like) superfam...   883   0.0  
ref|XP_004288876.1| PREDICTED: pentatricopeptide repeat-containi...   880   0.0  
ref|XP_006364562.1| PREDICTED: pentatricopeptide repeat-containi...   874   0.0  
ref|XP_006433766.1| hypothetical protein CICLE_v10000605mg [Citr...   871   0.0  
emb|CAN61988.1| hypothetical protein VITISV_026694 [Vitis vinifera]   870   0.0  
ref|XP_006472405.1| PREDICTED: pentatricopeptide repeat-containi...   868   0.0  
ref|XP_002302359.2| hypothetical protein POPTR_0002s11020g [Popu...   865   0.0  
ref|XP_004240633.1| PREDICTED: pentatricopeptide repeat-containi...   863   0.0  
ref|XP_007137661.1| hypothetical protein PHAVU_009G145100g [Phas...   847   0.0  
ref|XP_004170776.1| PREDICTED: pentatricopeptide repeat-containi...   843   0.0  
ref|XP_003523769.1| PREDICTED: pentatricopeptide repeat-containi...   843   0.0  
ref|XP_003527866.1| PREDICTED: pentatricopeptide repeat-containi...   835   0.0  
ref|NP_172461.1| pentatricopeptide repeat-containing protein [Ar...   823   0.0  
ref|XP_002889775.1| pentatricopeptide repeat-containing protein ...   821   0.0  
ref|XP_006306156.1| hypothetical protein CARUB_v10011677mg, part...   785   0.0  
ref|XP_004501057.1| PREDICTED: pentatricopeptide repeat-containi...   782   0.0  
gb|EPS61251.1| hypothetical protein M569_13548, partial [Genlise...   699   0.0  
ref|XP_006856585.1| hypothetical protein AMTR_s00046p00202770 [A...   680   0.0  

>ref|XP_002285611.1| PREDICTED: pentatricopeptide repeat-containing protein At1g09900-like
            isoform 1 [Vitis vinifera]
          Length = 610

 Score =  897 bits (2319), Expect = 0.0
 Identities = 459/627 (73%), Positives = 505/627 (80%), Gaps = 2/627 (0%)
 Frame = +1

Query: 55   MDFVIPAKQTRGWFSSFNCVQRDTTEKNSLGSWVKTRVYSDFSLNNNNCSLTKVSLGYKX 234
            MD ++P  QT     S   V R+ T K  LG+  + R              + + LGYK 
Sbjct: 1    MDLIVPVSQTHEGLYSLQHVHRENTTKTCLGTRARFR--------------SNLVLGYKA 46

Query: 235  XXXXXXXXXXX--RKPYGYRNNGRTQTLVVSKLETVSSNGRLPKLAKTTHGDLXXXXXXX 408
                         +K  G RN  R Q     + +T SSN +LP   K  H  L       
Sbjct: 47   RFLALSDGTSNECKKIGGSRNQRRNQVFAALRADTFSSNDKLPYAEKNQHVHLSGGNYTS 106

Query: 409  XXXXXXXXFEDFETNNHLRRLVRNGELEEGFKFLVNMVYHGDIPDIIPCTSLIRGFCKVG 588
                     E+ E+NNHLRRLVRNGELE+GFKFL +MVY GDIPDIIPCTSLIRGFC++G
Sbjct: 107  NSSSS---IEEHESNNHLRRLVRNGELEDGFKFLESMVYRGDIPDIIPCTSLIRGFCRIG 163

Query: 589  KTKKATRVLEILEDSGAVPDVITYNVLISGYCKSGEIDNALQVLDRMSVAPDVVTYNTIL 768
            KTKKAT V+EILE SGAVPDVITYNVLISGYCKSGEIDNALQVLDRM+VAPDVVTYNTIL
Sbjct: 164  KTKKATWVMEILEQSGAVPDVITYNVLISGYCKSGEIDNALQVLDRMNVAPDVVTYNTIL 223

Query: 769  RSLCDSGKLKQAMEVLDHQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRTKGCK 948
            R+LCDSGKLKQAMEVLD QLQ+ECYPDVITYTILIEATCKESGVGQAMKLLDEMR KG K
Sbjct: 224  RTLCDSGKLKQAMEVLDRQLQKECYPDVITYTILIEATCKESGVGQAMKLLDEMRNKGSK 283

Query: 949  PDVVTYNVLINGICKEGRLDEAIKFLNNMSSYGCQPNVITHNIILRSMCSTGRWMDAEKL 1128
            PDVVTYNVLINGICKEGRLDEAIKFLNNM SYGCQPNVITHNIILRSMCSTGRWMDAEKL
Sbjct: 284  PDVVTYNVLINGICKEGRLDEAIKFLNNMPSYGCQPNVITHNIILRSMCSTGRWMDAEKL 343

Query: 1129 LADMLGKGCSPSVVTFNILINFLCRKGLLGRAIDILEKMPQHGCIPNSLSYNPLLHGFCK 1308
            L+DML KGCSPSVVTFNILINFLCR+GLLGRAIDILEKMP HGC PNSLSYNPLLHGFCK
Sbjct: 344  LSDMLRKGCSPSVVTFNILINFLCRQGLLGRAIDILEKMPMHGCTPNSLSYNPLLHGFCK 403

Query: 1309 EKKMDRAIQYLDVMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLHQLGSKGCTPVLIT 1488
            EKKMDRAI+YLD+MVSRGCYPDIVTYNTLLTALCKDGKV+VAVE+L+QL SKGC+PVLIT
Sbjct: 404  EKKMDRAIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKVDVAVEILNQLSSKGCSPVLIT 463

Query: 1489 YNTVIDGLSKMGDTERAIELLEEMLAKGLQPDVITYSSLVSGLSREGKVNEAIEFFHELE 1668
            YNTVIDGLSK+G TERAI+LL+EM  KGL+PD+ITYSSLVSGLSREGKV+EAI+FFH+LE
Sbjct: 464  YNTVIDGLSKVGKTERAIKLLDEMRRKGLKPDIITYSSLVSGLSREGKVDEAIKFFHDLE 523

Query: 1669 GLGVKPNAITYNSIMLGLCKARQTDRAIDLLASMVSKGCKPTEATYTILIEGLAYXXXXX 1848
            GLG++PNAITYNSIMLGLCK+RQTDRAID LA M+SK CKPTEATYTILIEG+AY     
Sbjct: 524  GLGIRPNAITYNSIMLGLCKSRQTDRAIDFLAYMISKRCKPTEATYTILIEGIAYEGLAK 583

Query: 1849 XXXXXXXXXCYRGVVQKSSAQHVTIKM 1929
                     C RG+V+KSSA+ V +KM
Sbjct: 584  EALDLLNELCSRGLVKKSSAEQVAVKM 610


>gb|EXC01179.1| hypothetical protein L484_025557 [Morus notabilis]
          Length = 610

 Score =  885 bits (2286), Expect = 0.0
 Identities = 459/626 (73%), Positives = 499/626 (79%), Gaps = 1/626 (0%)
 Frame = +1

Query: 55   MDFVIPAKQTRGWFSSFNCVQRDTTEKNSLGSWVKTRVYSDFSLNNNNCSLTKVSLGYKX 234
            MD V+P +QT   F SF+        K SL S   T  +          SL  V++G K 
Sbjct: 1    MDSVVPIRQTPDGFCSFH--------KTSLESTRNTSTFGG--------SLVGVAVGCKA 44

Query: 235  XXXXXXXXXXXRKPYGYRNNGRTQTLVVSKLETVSSNGRLPK-LAKTTHGDLXXXXXXXX 411
                       R   G   + R   L VSK+E +SSNGRL +   K     L        
Sbjct: 45   RLLVMSHNIQCRINEGSSKHRRKHVLAVSKIEALSSNGRLQENFEKNPFDHLNGTNSLAN 104

Query: 412  XXXXXXXFEDFETNNHLRRLVRNGELEEGFKFLVNMVYHGDIPDIIPCTSLIRGFCKVGK 591
                   FE+FE N  LRR VRNGELEEGFK L  MVYHGDIPDII CTSLIRGFCK+GK
Sbjct: 105  SGHSTRNFEEFEGNKRLRRFVRNGELEEGFKVLERMVYHGDIPDIIACTSLIRGFCKIGK 164

Query: 592  TKKATRVLEILEDSGAVPDVITYNVLISGYCKSGEIDNALQVLDRMSVAPDVVTYNTILR 771
            TKKA+RV+EILE+SGA PDVITYNVLISGYCKSGEIDNALQVLDRMSVAPDVVTYNTILR
Sbjct: 165  TKKASRVMEILEESGAAPDVITYNVLISGYCKSGEIDNALQVLDRMSVAPDVVTYNTILR 224

Query: 772  SLCDSGKLKQAMEVLDHQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRTKGCKP 951
            +LCDSGKLK+AMEVLD QL+RECYPDVITYTILIEATCKESGVGQAMKLLDEMR+KGCKP
Sbjct: 225  TLCDSGKLKEAMEVLDRQLRRECYPDVITYTILIEATCKESGVGQAMKLLDEMRSKGCKP 284

Query: 952  DVVTYNVLINGICKEGRLDEAIKFLNNMSSYGCQPNVITHNIILRSMCSTGRWMDAEKLL 1131
            DVVTYNVLINGICKEGRLDEAIKFLNNM SYGC  NVITHNIILRSMCSTGRWMDAEKLL
Sbjct: 285  DVVTYNVLINGICKEGRLDEAIKFLNNMPSYGCHSNVITHNIILRSMCSTGRWMDAEKLL 344

Query: 1132 ADMLGKGCSPSVVTFNILINFLCRKGLLGRAIDILEKMPQHGCIPNSLSYNPLLHGFCKE 1311
            A+M+ KGCSPSVVTFNILINFLCRKGLLGRAIDILEKMP+HGC PNSLSYNPLLHGFCKE
Sbjct: 345  AEMVRKGCSPSVVTFNILINFLCRKGLLGRAIDILEKMPKHGCTPNSLSYNPLLHGFCKE 404

Query: 1312 KKMDRAIQYLDVMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLHQLGSKGCTPVLITY 1491
            KKM RAI+YLDVMVSRGCYPDIVTYNTLLTALCKDGKV++AV +L+QL SKGC+PVLITY
Sbjct: 405  KKMARAIEYLDVMVSRGCYPDIVTYNTLLTALCKDGKVDIAVVILNQLSSKGCSPVLITY 464

Query: 1492 NTVIDGLSKMGDTERAIELLEEMLAKGLQPDVITYSSLVSGLSREGKVNEAIEFFHELEG 1671
            NTVIDGLSK G+TERAI+LL EM  KGL+PD+ITYSSLV GLSREGKV+EAI+FFH+LEG
Sbjct: 465  NTVIDGLSKAGETERAIKLLYEMQRKGLKPDIITYSSLVGGLSREGKVDEAIKFFHDLEG 524

Query: 1672 LGVKPNAITYNSIMLGLCKARQTDRAIDLLASMVSKGCKPTEATYTILIEGLAYXXXXXX 1851
             G+KPNAIT+NSIMLGLCKARQT RAID LA MVSKGCKPTEATYTILIEGLAY      
Sbjct: 525  FGIKPNAITFNSIMLGLCKARQTSRAIDFLAHMVSKGCKPTEATYTILIEGLAYEGLAKE 584

Query: 1852 XXXXXXXXCYRGVVQKSSAQHVTIKM 1929
                    C RGVV+KSSA  V ++M
Sbjct: 585  ALELLSELCARGVVKKSSADQVAVRM 610


>ref|XP_007018302.1| Pentatricopeptide repeat (PPR-like) superfamily protein [Theobroma
            cacao] gi|508723630|gb|EOY15527.1| Pentatricopeptide
            repeat (PPR-like) superfamily protein [Theobroma cacao]
          Length = 606

 Score =  883 bits (2282), Expect = 0.0
 Identities = 434/553 (78%), Positives = 482/553 (87%)
 Frame = +1

Query: 271  KPYGYRNNGRTQTLVVSKLETVSSNGRLPKLAKTTHGDLXXXXXXXXXXXXXXXFEDFET 450
            KP G++   R + L VSK+E+   NGR   L  ++ G L               FE+  +
Sbjct: 54   KPVGFQKQRRCRVLAVSKVESAGVNGRFQNLDSSSQGHLGNGHVSSSPLKSLHNFEESGS 113

Query: 451  NNHLRRLVRNGELEEGFKFLVNMVYHGDIPDIIPCTSLIRGFCKVGKTKKATRVLEILED 630
            NN LR+ VRNGELEEGFK L  MVYHG+IPDII CTSLIRGFCK GKT+KATRV+EI+ED
Sbjct: 114  NNQLRKFVRNGELEEGFKLLEGMVYHGEIPDIIACTSLIRGFCKKGKTRKATRVMEIIED 173

Query: 631  SGAVPDVITYNVLISGYCKSGEIDNALQVLDRMSVAPDVVTYNTILRSLCDSGKLKQAME 810
            SGAVPDVITYNVLISGYCK+GEIDNALQVLDRMSVAPDVVTYNTILRSLCDSGKLKQAME
Sbjct: 174  SGAVPDVITYNVLISGYCKAGEIDNALQVLDRMSVAPDVVTYNTILRSLCDSGKLKQAME 233

Query: 811  VLDHQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRTKGCKPDVVTYNVLINGIC 990
            V+D QLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMR++GCKPDVVTYNVL+NGIC
Sbjct: 234  VMDRQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRSRGCKPDVVTYNVLVNGIC 293

Query: 991  KEGRLDEAIKFLNNMSSYGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLGKGCSPSVV 1170
            KEGRLDEAIKFLNNM SYGCQPNVITHNIILRSMCSTGRWMDAE+LLADML KGCSPSVV
Sbjct: 294  KEGRLDEAIKFLNNMPSYGCQPNVITHNIILRSMCSTGRWMDAERLLADMLRKGCSPSVV 353

Query: 1171 TFNILINFLCRKGLLGRAIDILEKMPQHGCIPNSLSYNPLLHGFCKEKKMDRAIQYLDVM 1350
            TFNILINFLCRKGLLGRAIDILEKMP+HGC PNSLSYNPLLHGFCKEKKM+RAI+YL++M
Sbjct: 354  TFNILINFLCRKGLLGRAIDILEKMPKHGCTPNSLSYNPLLHGFCKEKKMERAIEYLEIM 413

Query: 1351 VSRGCYPDIVTYNTLLTALCKDGKVEVAVELLHQLGSKGCTPVLITYNTVIDGLSKMGDT 1530
            VSRGCYPDIVTYNTLLTALCKDGKV+VAVE+L+QL +KGC+PVLITYNTVIDGLSK+G T
Sbjct: 414  VSRGCYPDIVTYNTLLTALCKDGKVDVAVEILNQLSTKGCSPVLITYNTVIDGLSKVGKT 473

Query: 1531 ERAIELLEEMLAKGLQPDVITYSSLVSGLSREGKVNEAIEFFHELEGLGVKPNAITYNSI 1710
            ++AI+LLEEM AKGL+PD+ITYSSLV GLSREGKV++AI+FFH+ E +G++PNAITYNSI
Sbjct: 474  DQAIKLLEEMRAKGLKPDIITYSSLVGGLSREGKVDDAIKFFHDFERMGIRPNAITYNSI 533

Query: 1711 MLGLCKARQTDRAIDLLASMVSKGCKPTEATYTILIEGLAYXXXXXXXXXXXXXXCYRGV 1890
            MLGLCKARQTDRAID LA MV +GCKPTE+TYTILIEGLAY              C RGV
Sbjct: 534  MLGLCKARQTDRAIDFLAYMVMRGCKPTESTYTILIEGLAYEGFANEALELLNELCSRGV 593

Query: 1891 VQKSSAQHVTIKM 1929
            V+KSSA+ V +KM
Sbjct: 594  VKKSSAEQVAVKM 606


>ref|XP_004288876.1| PREDICTED: pentatricopeptide repeat-containing protein At1g09900-like
            [Fragaria vesca subsp. vesca]
          Length = 605

 Score =  880 bits (2273), Expect = 0.0
 Identities = 447/625 (71%), Positives = 508/625 (81%)
 Frame = +1

Query: 55   MDFVIPAKQTRGWFSSFNCVQRDTTEKNSLGSWVKTRVYSDFSLNNNNCSLTKVSLGYKX 234
            MD + P KQT   F SF+ +       +  G  +  ++       +   S ++V +G K 
Sbjct: 1    MDLIAPTKQTPDGFCSFHKLH------SGFGGRISNKL-------SRTPSSSRVFVGCKG 47

Query: 235  XXXXXXXXXXXRKPYGYRNNGRTQTLVVSKLETVSSNGRLPKLAKTTHGDLXXXXXXXXX 414
                       R   GY+ + R Q + VSK+E +SSNGRL  + KT +G+L         
Sbjct: 48   RVLVLSDSIRCRAHDGYQKHRRKQVVAVSKVEQLSSNGRLKNVEKTPYGNLNGGDSSNG- 106

Query: 415  XXXXXXFEDFETNNHLRRLVRNGELEEGFKFLVNMVYHGDIPDIIPCTSLIRGFCKVGKT 594
                   E+FE+NN LRRLVRNGELEEGF+ L +MVY GDIPDII CTSLIRGFCK GKT
Sbjct: 107  ------LEEFESNNQLRRLVRNGELEEGFRLLESMVYQGDIPDIIACTSLIRGFCKSGKT 160

Query: 595  KKATRVLEILEDSGAVPDVITYNVLISGYCKSGEIDNALQVLDRMSVAPDVVTYNTILRS 774
            +KATR++ ILE+SGAV DVITYNVLISGYC++GEIDNAL+VLDRMSV+PDVVTYNTILR+
Sbjct: 161  RKATRIMNILEESGAVLDVITYNVLISGYCRAGEIDNALRVLDRMSVSPDVVTYNTILRT 220

Query: 775  LCDSGKLKQAMEVLDHQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRTKGCKPD 954
            LCDSGKLKQAMEVLD QLQRECYPDVITYTILIEATCKESGV QAMKLLDEM++KGCKPD
Sbjct: 221  LCDSGKLKQAMEVLDRQLQRECYPDVITYTILIEATCKESGVEQAMKLLDEMKSKGCKPD 280

Query: 955  VVTYNVLINGICKEGRLDEAIKFLNNMSSYGCQPNVITHNIILRSMCSTGRWMDAEKLLA 1134
            VVTYNVLINGICKEGRLDEAI+FLNNM    CQPNVITHNIILRSMCSTGRWMDAE+LLA
Sbjct: 281  VVTYNVLINGICKEGRLDEAIEFLNNMPPSDCQPNVITHNIILRSMCSTGRWMDAERLLA 340

Query: 1135 DMLGKGCSPSVVTFNILINFLCRKGLLGRAIDILEKMPQHGCIPNSLSYNPLLHGFCKEK 1314
            +M+GKGCSPSVVTFNILINFLCRKGLLGRAIDILEKMP+HGC PNSLSYNPLLHGFCKEK
Sbjct: 341  EMVGKGCSPSVVTFNILINFLCRKGLLGRAIDILEKMPKHGCTPNSLSYNPLLHGFCKEK 400

Query: 1315 KMDRAIQYLDVMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLHQLGSKGCTPVLITYN 1494
            KMDRAI+YLD+MVSRGCYPDIVTYNTLLTALCKDGKV+VAVE+L+QL SKGC+PVLITYN
Sbjct: 401  KMDRAIEYLDIMVSRGCYPDIVTYNTLLTALCKDGKVDVAVEILNQLSSKGCSPVLITYN 460

Query: 1495 TVIDGLSKMGDTERAIELLEEMLAKGLQPDVITYSSLVSGLSREGKVNEAIEFFHELEGL 1674
            TVIDGLSK+G TERAIELLEEM  KGL+PD+ITYSSLV GLSREGKV+EAI+F  +LEG+
Sbjct: 461  TVIDGLSKVGKTERAIELLEEMRKKGLKPDIITYSSLVGGLSREGKVDEAIKFVRDLEGM 520

Query: 1675 GVKPNAITYNSIMLGLCKARQTDRAIDLLASMVSKGCKPTEATYTILIEGLAYXXXXXXX 1854
            GV+PNAIT+N IMLGLCKARQT RAID LA M+SKGCKPTEATYTILIEG+AY       
Sbjct: 521  GVRPNAITFNCIMLGLCKARQTSRAIDFLAHMISKGCKPTEATYTILIEGIAYEGLAEEA 580

Query: 1855 XXXXXXXCYRGVVQKSSAQHVTIKM 1929
                   CYRGVV++SSA+ V +K+
Sbjct: 581  LELLNELCYRGVVKRSSAEQVAVKI 605


>ref|XP_006364562.1| PREDICTED: pentatricopeptide repeat-containing protein At1g09900-like
            [Solanum tuberosum]
          Length = 623

 Score =  874 bits (2257), Expect = 0.0
 Identities = 451/638 (70%), Positives = 510/638 (79%), Gaps = 13/638 (2%)
 Frame = +1

Query: 55   MDFVIPAKQTRGWFSSFNCVQRDTTE--------KNS---LGSWVKTRVYSDFSLNNNNC 201
            M+ ++P KQT   F SF+  +++ T         KNS   +G   K R    FS++    
Sbjct: 1    MELIVPTKQTHEGFCSFHSTRKEITVNCCNNRRFKNSSLLVGQLRKQRQDMVFSISKIET 60

Query: 202  SLTKVSLGYKXXXXXXXXXXXXRKPYGYRNNGRTQTLVVSKLETV--SSNGRLPKLAKTT 375
              +    G+              +    R N   +   +S++E +  ++NGRL  + K T
Sbjct: 61   LSSVEKRGFGSSNRRCKNSLLGGQLRKQRQN---KVFAISRIEILGGTTNGRLSSVEKKT 117

Query: 376  HGDLXXXXXXXXXXXXXXXFEDFETNNHLRRLVRNGELEEGFKFLVNMVYHGDIPDIIPC 555
            +G +                E+FE+NN+LRRLVRNGELEE FK L +MVY GDIPDIIPC
Sbjct: 118  NGSISEN------------IEEFESNNYLRRLVRNGELEESFKHLESMVYRGDIPDIIPC 165

Query: 556  TSLIRGFCKVGKTKKATRVLEILEDSGAVPDVITYNVLISGYCKSGEIDNALQVLDRMSV 735
            TSLIRGFC++G+TKKATRVLEILEDSGAVPDVITYNVLISGYCKSGEIDNAL+VLDRMSV
Sbjct: 166  TSLIRGFCRIGQTKKATRVLEILEDSGAVPDVITYNVLISGYCKSGEIDNALKVLDRMSV 225

Query: 736  APDVVTYNTILRSLCDSGKLKQAMEVLDHQLQRECYPDVITYTILIEATCKESGVGQAMK 915
            APDVVTYNTILRSLCDSGKLKQAM VLD  LQ+ECYPDVITYTILIEATCKESGVGQAMK
Sbjct: 226  APDVVTYNTILRSLCDSGKLKQAMHVLDRMLQKECYPDVITYTILIEATCKESGVGQAMK 285

Query: 916  LLDEMRTKGCKPDVVTYNVLINGICKEGRLDEAIKFLNNMSSYGCQPNVITHNIILRSMC 1095
            LLDEMR+KGC PDVVTYNVLINGICKEGRL+EAIKFLNNM SYGCQPNVITHNIILRSMC
Sbjct: 286  LLDEMRSKGCVPDVVTYNVLINGICKEGRLNEAIKFLNNMPSYGCQPNVITHNIILRSMC 345

Query: 1096 STGRWMDAEKLLADMLGKGCSPSVVTFNILINFLCRKGLLGRAIDILEKMPQHGCIPNSL 1275
            STGRWMDAEKLLADM+ KGCSPSVVTFNILINFLCRKGLLGRAID+LEKMP++GC PNSL
Sbjct: 346  STGRWMDAEKLLADMVRKGCSPSVVTFNILINFLCRKGLLGRAIDLLEKMPKYGCTPNSL 405

Query: 1276 SYNPLLHGFCKEKKMDRAIQYLDVMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLHQL 1455
            SYNPLLH FCKEKKMDRAI+YL+VMVSRGCYPDIVTYNTLLTALCKDGKV+VAVE+L+QL
Sbjct: 406  SYNPLLHAFCKEKKMDRAIEYLEVMVSRGCYPDIVTYNTLLTALCKDGKVDVAVEILNQL 465

Query: 1456 GSKGCTPVLITYNTVIDGLSKMGDTERAIELLEEMLAKGLQPDVITYSSLVSGLSREGKV 1635
              KGC+PVLITYNTVIDGLSK+G TE AIELL EM  KGLQPD+ITYSS V+GLSREGKV
Sbjct: 466  SDKGCSPVLITYNTVIDGLSKVGKTELAIELLNEMREKGLQPDIITYSSFVAGLSREGKV 525

Query: 1636 NEAIEFFHELEGLGVKPNAITYNSIMLGLCKARQTDRAIDLLASMVSKGCKPTEATYTIL 1815
            +EAI+FFH++EGL V+PNAITYN+IMLGLCKARQTDRAID LA M+SKGCKPTE+TYTIL
Sbjct: 526  DEAIKFFHDIEGLDVRPNAITYNAIMLGLCKARQTDRAIDFLAYMISKGCKPTESTYTIL 585

Query: 1816 IEGLAYXXXXXXXXXXXXXXCYRGVVQKSSAQHVTIKM 1929
            IEG+AY              C RGVV+KSSA+ V +KM
Sbjct: 586  IEGIAYEGLAEEALELLNELCSRGVVKKSSAEQVVVKM 623


>ref|XP_006433766.1| hypothetical protein CICLE_v10000605mg [Citrus clementina]
            gi|557535888|gb|ESR47006.1| hypothetical protein
            CICLE_v10000605mg [Citrus clementina]
          Length = 619

 Score =  871 bits (2251), Expect = 0.0
 Identities = 443/629 (70%), Positives = 504/629 (80%), Gaps = 4/629 (0%)
 Frame = +1

Query: 55   MDFVIPAKQTRGWFSSFNCVQRDTTEKN---SLGSWVKTRVYSDFSLNNNNCSLTKVSLG 225
            MD ++PA      F SF    R++       S+G+    R  +          + KV +G
Sbjct: 1    MDIIVPANHAHEGFCSFQHFTRESCRNTGPFSVGAGDIARARA----------IRKVHVG 50

Query: 226  YKXXXXXXXXXXXXRKPY-GYRNNGRTQTLVVSKLETVSSNGRLPKLAKTTHGDLXXXXX 402
             K             K + G++   + +   +SK+ET+S NG++        G L     
Sbjct: 51   CKVSFSVQSADSVDFKKHKGFQKQRQNRVFAISKVETLSFNGKMKHGEAFVQGHLNNGHI 110

Query: 403  XXXXXXXXXXFEDFETNNHLRRLVRNGELEEGFKFLVNMVYHGDIPDIIPCTSLIRGFCK 582
                      FEDFE+NNHLRRLVRNGELEEGFKFL +MVYHGDIPDIIPCTSLIRGFCK
Sbjct: 111  SSGMENSSLNFEDFESNNHLRRLVRNGELEEGFKFLESMVYHGDIPDIIPCTSLIRGFCK 170

Query: 583  VGKTKKATRVLEILEDSGAVPDVITYNVLISGYCKSGEIDNALQVLDRMSVAPDVVTYNT 762
            VGKT+KATRV+EI+EDSGAVPDVITYNVLISGYC+ GEIDNALQVL+RMSVAPDVVTYNT
Sbjct: 171  VGKTRKATRVMEIVEDSGAVPDVITYNVLISGYCRLGEIDNALQVLERMSVAPDVVTYNT 230

Query: 763  ILRSLCDSGKLKQAMEVLDHQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRTKG 942
            ILR+LCDSGKL  AMEVL  QL++ECYPDVITYTILIEATCKESGVGQAMKLLDEMR KG
Sbjct: 231  ILRTLCDSGKLNLAMEVLHKQLEKECYPDVITYTILIEATCKESGVGQAMKLLDEMRNKG 290

Query: 943  CKPDVVTYNVLINGICKEGRLDEAIKFLNNMSSYGCQPNVITHNIILRSMCSTGRWMDAE 1122
            C PDVVTYNVL+NGICKEGRLDEAIKFLN+M SYGCQPNVITHNIILRSMCSTGRWMDAE
Sbjct: 291  CIPDVVTYNVLVNGICKEGRLDEAIKFLNDMPSYGCQPNVITHNIILRSMCSTGRWMDAE 350

Query: 1123 KLLADMLGKGCSPSVVTFNILINFLCRKGLLGRAIDILEKMPQHGCIPNSLSYNPLLHGF 1302
            +LLA+M+ KGCSPSVVTFNILINFLCRKGLLGRAIDILEKMP+HGC PNSLSYNP+LHGF
Sbjct: 351  RLLAEMVLKGCSPSVVTFNILINFLCRKGLLGRAIDILEKMPKHGCTPNSLSYNPVLHGF 410

Query: 1303 CKEKKMDRAIQYLDVMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLHQLGSKGCTPVL 1482
            CKEKKMDRAI+YL++MVSRGCYPDIVTYNTLLTALCKDGKV+VAVE+L+QL +K C+PVL
Sbjct: 411  CKEKKMDRAIEYLEIMVSRGCYPDIVTYNTLLTALCKDGKVDVAVEILNQLSNKHCSPVL 470

Query: 1483 ITYNTVIDGLSKMGDTERAIELLEEMLAKGLQPDVITYSSLVSGLSREGKVNEAIEFFHE 1662
            ITYNTVIDGLSK+G TE+A++LLEEM  KGL+PD ITYSSLV GLSREGKV+EAI+ FH+
Sbjct: 471  ITYNTVIDGLSKVGKTEQAMKLLEEMRTKGLKPDTITYSSLVGGLSREGKVDEAIKLFHD 530

Query: 1663 LEGLGVKPNAITYNSIMLGLCKARQTDRAIDLLASMVSKGCKPTEATYTILIEGLAYXXX 1842
            LE LGV+PN ITYNSI+LGLCKARQT RAID+LA MV++GCKPTEATYTILIEG+AY   
Sbjct: 531  LERLGVRPNVITYNSIILGLCKARQTYRAIDILADMVTRGCKPTEATYTILIEGIAYEGL 590

Query: 1843 XXXXXXXXXXXCYRGVVQKSSAQHVTIKM 1929
                       C RGVV+KSSA+ V +KM
Sbjct: 591  AKEALDLLNQLCSRGVVKKSSAEQVAVKM 619


>emb|CAN61988.1| hypothetical protein VITISV_026694 [Vitis vinifera]
          Length = 553

 Score =  870 bits (2249), Expect = 0.0
 Identities = 427/498 (85%), Positives = 462/498 (92%)
 Frame = +1

Query: 436  EDFETNNHLRRLVRNGELEEGFKFLVNMVYHGDIPDIIPCTSLIRGFCKVGKTKKATRVL 615
            E+ E+NNHLRRLVRNGELE+GFKFL +MVY GDIPDIIPCTSLIRGFC++GKTKKAT V+
Sbjct: 56   EEHESNNHLRRLVRNGELEDGFKFLESMVYRGDIPDIIPCTSLIRGFCRIGKTKKATWVM 115

Query: 616  EILEDSGAVPDVITYNVLISGYCKSGEIDNALQVLDRMSVAPDVVTYNTILRSLCDSGKL 795
            EILE SGAVPDVITYNVLISGYCKSGEIDNALQVLDRM+VAPDVVTYNTILR+LCDSGKL
Sbjct: 116  EILEQSGAVPDVITYNVLISGYCKSGEIDNALQVLDRMNVAPDVVTYNTILRTLCDSGKL 175

Query: 796  KQAMEVLDHQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRTKGCKPDVVTYNVL 975
            KQAMEVLD QLQ+ECYPDVITYTILIEATCKESGVGQAMKLLDEMR KG KPDVVTYNVL
Sbjct: 176  KQAMEVLDRQLQKECYPDVITYTILIEATCKESGVGQAMKLLDEMRNKGSKPDVVTYNVL 235

Query: 976  INGICKEGRLDEAIKFLNNMSSYGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLGKGC 1155
            INGICKEGRLDEAIKFLNNM SYGCQPNVITHNIILRSMCSTGRWMDAEKLL+DML KGC
Sbjct: 236  INGICKEGRLDEAIKFLNNMPSYGCQPNVITHNIILRSMCSTGRWMDAEKLLSDMLRKGC 295

Query: 1156 SPSVVTFNILINFLCRKGLLGRAIDILEKMPQHGCIPNSLSYNPLLHGFCKEKKMDRAIQ 1335
            SPSVVTFNILINFLCR+GLLGRAIDILEKMP HGC PNSLSYNPLLHGFCKEKKMDRAI+
Sbjct: 296  SPSVVTFNILINFLCRQGLLGRAIDILEKMPMHGCTPNSLSYNPLLHGFCKEKKMDRAIE 355

Query: 1336 YLDVMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLHQLGSKGCTPVLITYNTVIDGLS 1515
            YLD+MVSRGCYPDIVTYNTLLTALCKDGKV+VAVE+L+QL SKGC+PVLITYNTVIDGLS
Sbjct: 356  YLDIMVSRGCYPDIVTYNTLLTALCKDGKVDVAVEILNQLSSKGCSPVLITYNTVIDGLS 415

Query: 1516 KMGDTERAIELLEEMLAKGLQPDVITYSSLVSGLSREGKVNEAIEFFHELEGLGVKPNAI 1695
            K+G TERAI+LL+EM  KGL+PD+ITYSSLVSGLSREGKV+EAI+FFH+LEGLG++PNAI
Sbjct: 416  KVGKTERAIKLLDEMRRKGLKPDIITYSSLVSGLSREGKVDEAIKFFHDLEGLGIRPNAI 475

Query: 1696 TYNSIMLGLCKARQTDRAIDLLASMVSKGCKPTEATYTILIEGLAYXXXXXXXXXXXXXX 1875
            TYNSIMLGLCK+RQTDRAID LA M+SK CKPTEATYTILIEG+AY              
Sbjct: 476  TYNSIMLGLCKSRQTDRAIDFLAYMISKRCKPTEATYTILIEGIAYEGLAKEALDLLNEL 535

Query: 1876 CYRGVVQKSSAQHVTIKM 1929
            C RG+V+KSSA+ V +KM
Sbjct: 536  CSRGLVKKSSAEQVAVKM 553


>ref|XP_006472405.1| PREDICTED: pentatricopeptide repeat-containing protein At1g09900-like
            [Citrus sinensis]
          Length = 619

 Score =  868 bits (2243), Expect = 0.0
 Identities = 442/631 (70%), Positives = 500/631 (79%), Gaps = 6/631 (0%)
 Frame = +1

Query: 55   MDFVIPAKQTRGWFSSFNCVQRDTTEKNS-----LGSWVKTRVYSDFSLNNNNCSLTKVS 219
            MD ++PA      F SF    R++           G   + R            ++ KV 
Sbjct: 1    MDVIVPANHAHEGFCSFQHFTRESCRNTGPFSAGAGDIARAR------------AIRKVH 48

Query: 220  LGYKXXXXXXXXXXXXRKPY-GYRNNGRTQTLVVSKLETVSSNGRLPKLAKTTHGDLXXX 396
            +G K             K + G++   R +   +SK+ET+S N ++        G L   
Sbjct: 49   VGCKVSFSVQSADSVDFKKHKGFQKQRRNRVFAISKVETLSFNVKMKHGEAFVQGHLNNG 108

Query: 397  XXXXXXXXXXXXFEDFETNNHLRRLVRNGELEEGFKFLVNMVYHGDIPDIIPCTSLIRGF 576
                        FEDFE+NNHLRRLVRNGELEEGFKFL +MVYHGDIPDIIPCTSLIRGF
Sbjct: 109  HISSGMENSSLNFEDFESNNHLRRLVRNGELEEGFKFLESMVYHGDIPDIIPCTSLIRGF 168

Query: 577  CKVGKTKKATRVLEILEDSGAVPDVITYNVLISGYCKSGEIDNALQVLDRMSVAPDVVTY 756
            CKVGKT+KATRV+EI+EDSGAVPDVITYNVLISGYC+ GEIDNALQVL+RMSVAPDVVTY
Sbjct: 169  CKVGKTRKATRVMEIVEDSGAVPDVITYNVLISGYCRLGEIDNALQVLERMSVAPDVVTY 228

Query: 757  NTILRSLCDSGKLKQAMEVLDHQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRT 936
            NTILR+LCDSGKL  AMEVL  QL++ECYPDVITYTILIEATCKESGVGQAMKLLDEMR 
Sbjct: 229  NTILRTLCDSGKLNLAMEVLHKQLEKECYPDVITYTILIEATCKESGVGQAMKLLDEMRN 288

Query: 937  KGCKPDVVTYNVLINGICKEGRLDEAIKFLNNMSSYGCQPNVITHNIILRSMCSTGRWMD 1116
            KGC PDVVTYNVL+NGICKEGRLDEAIKFLN+M SYGCQPNVITHNIILRSMCSTGRWMD
Sbjct: 289  KGCIPDVVTYNVLVNGICKEGRLDEAIKFLNDMPSYGCQPNVITHNIILRSMCSTGRWMD 348

Query: 1117 AEKLLADMLGKGCSPSVVTFNILINFLCRKGLLGRAIDILEKMPQHGCIPNSLSYNPLLH 1296
            AE+LLA+M+ KGCSPSVVTFNILINFLCRKGLLGRAIDILEKMP+HGC PNSLSYNP+LH
Sbjct: 349  AERLLAEMVHKGCSPSVVTFNILINFLCRKGLLGRAIDILEKMPKHGCTPNSLSYNPVLH 408

Query: 1297 GFCKEKKMDRAIQYLDVMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLHQLGSKGCTP 1476
            GFCKEKKMDRAI+YL++MVSRGCYPDIVTYNTLLTALCKDGKV+VAVE+L+QL +K C+P
Sbjct: 409  GFCKEKKMDRAIEYLEIMVSRGCYPDIVTYNTLLTALCKDGKVDVAVEILNQLSNKHCSP 468

Query: 1477 VLITYNTVIDGLSKMGDTERAIELLEEMLAKGLQPDVITYSSLVSGLSREGKVNEAIEFF 1656
            VLITYNTVIDGLSK+G TE+A++LLEEM  KGL+PD ITYSSLV GLSREGKV+EAI+ F
Sbjct: 469  VLITYNTVIDGLSKVGKTEQAMKLLEEMRTKGLKPDTITYSSLVGGLSREGKVDEAIKLF 528

Query: 1657 HELEGLGVKPNAITYNSIMLGLCKARQTDRAIDLLASMVSKGCKPTEATYTILIEGLAYX 1836
            H+LE LGV+PN ITYNSIMLGLCKARQT RAID+LA MV++ CKPTEATYTILIEG+AY 
Sbjct: 529  HDLERLGVRPNVITYNSIMLGLCKARQTYRAIDILADMVTRSCKPTEATYTILIEGIAYE 588

Query: 1837 XXXXXXXXXXXXXCYRGVVQKSSAQHVTIKM 1929
                         C RGVV+KSSA+ V +KM
Sbjct: 589  GLAKEALDLLNQLCSRGVVKKSSAEQVAVKM 619


>ref|XP_002302359.2| hypothetical protein POPTR_0002s11020g [Populus trichocarpa]
            gi|550344756|gb|EEE81632.2| hypothetical protein
            POPTR_0002s11020g [Populus trichocarpa]
          Length = 637

 Score =  865 bits (2234), Expect = 0.0
 Identities = 436/633 (68%), Positives = 503/633 (79%)
 Frame = +1

Query: 31   C*LLLKKKMDFVIPAKQTRGWFSSFNCVQRDTTEKNSLGSWVKTRVYSDFSLNNNNCSLT 210
            C +L+ + MD ++P   T     SF    R TT ++S         ++   +  N+ S  
Sbjct: 39   CSILVVQVMDLIVPTSHTHEGLRSFQYFNRYTTRRSS---------FAGARIRGNDGSSR 89

Query: 211  KVSLGYKXXXXXXXXXXXXRKPYGYRNNGRTQTLVVSKLETVSSNGRLPKLAKTTHGDLX 390
            KV +G+                   R   +++   VS +ET  SNG+L  L K  +G + 
Sbjct: 90   KVHVGF-------------------RKLRKSRVFAVSGVETFRSNGKLQNLDKPLNGHMG 130

Query: 391  XXXXXXXXXXXXXXFEDFETNNHLRRLVRNGELEEGFKFLVNMVYHGDIPDIIPCTSLIR 570
                           E+FE+NNHLR+LVRNGELEEGF+FL NMVY G+IPDII  TSLIR
Sbjct: 131  NGHVSSSS------IEEFESNNHLRKLVRNGELEEGFRFLENMVYRGEIPDIIASTSLIR 184

Query: 571  GFCKVGKTKKATRVLEILEDSGAVPDVITYNVLISGYCKSGEIDNALQVLDRMSVAPDVV 750
            GFCK+GKT+KATR++EI+EDSGAVPDVITYNVLISGYCK+GEIDNAL+VLDRMSVAPDVV
Sbjct: 185  GFCKIGKTRKATRIMEIIEDSGAVPDVITYNVLISGYCKAGEIDNALRVLDRMSVAPDVV 244

Query: 751  TYNTILRSLCDSGKLKQAMEVLDHQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEM 930
            TYNTILR+LCDSGKLKQAMEVLD QL++ECYPDVITYTILIEATC ESGVGQAMKLLDEM
Sbjct: 245  TYNTILRTLCDSGKLKQAMEVLDRQLEKECYPDVITYTILIEATCAESGVGQAMKLLDEM 304

Query: 931  RTKGCKPDVVTYNVLINGICKEGRLDEAIKFLNNMSSYGCQPNVITHNIILRSMCSTGRW 1110
             ++GCKPDVVTYNVL+NG+CKEGRLDEAIKFLN+M SYG QPNVITHNIILRSMCSTGRW
Sbjct: 305  GSRGCKPDVVTYNVLVNGMCKEGRLDEAIKFLNSMPSYGSQPNVITHNIILRSMCSTGRW 364

Query: 1111 MDAEKLLADMLGKGCSPSVVTFNILINFLCRKGLLGRAIDILEKMPQHGCIPNSLSYNPL 1290
            MDAEKLL +M+ KGCSPSVVTFNILINFLCRKGLLGRAIDILEKMP HGC PNSLSYNPL
Sbjct: 365  MDAEKLLTEMVRKGCSPSVVTFNILINFLCRKGLLGRAIDILEKMPTHGCTPNSLSYNPL 424

Query: 1291 LHGFCKEKKMDRAIQYLDVMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLHQLGSKGC 1470
            LHGFCKEKKMDRAIQYL++MVSRGCYPDIVTYNT+LTALCKDGKV+ AVELL+QL SKGC
Sbjct: 425  LHGFCKEKKMDRAIQYLEIMVSRGCYPDIVTYNTMLTALCKDGKVDAAVELLNQLSSKGC 484

Query: 1471 TPVLITYNTVIDGLSKMGDTERAIELLEEMLAKGLQPDVITYSSLVSGLSREGKVNEAIE 1650
            +PVLITYNTVIDGLSK+G T++A+ELL EM  KGL+PDVITYSSL++GLSREGKV EAI+
Sbjct: 485  SPVLITYNTVIDGLSKVGKTDQAVELLHEMRGKGLKPDVITYSSLIAGLSREGKVEEAIK 544

Query: 1651 FFHELEGLGVKPNAITYNSIMLGLCKARQTDRAIDLLASMVSKGCKPTEATYTILIEGLA 1830
            FFH++EG GVKPNA TYNSIM GLCKA+QTDRAID LA M+SKGCKPTE +YTILIEG+A
Sbjct: 545  FFHDVEGFGVKPNAFTYNSIMFGLCKAQQTDRAIDFLAYMISKGCKPTEVSYTILIEGIA 604

Query: 1831 YXXXXXXXXXXXXXXCYRGVVQKSSAQHVTIKM 1929
                           C RGVV+KSSA+ V +++
Sbjct: 605  NEGLAKEALELLNELCSRGVVKKSSAEQVVVRL 637


>ref|XP_004240633.1| PREDICTED: pentatricopeptide repeat-containing protein At1g09900-like
            [Solanum lycopersicum]
          Length = 739

 Score =  863 bits (2231), Expect = 0.0
 Identities = 451/631 (71%), Positives = 505/631 (80%), Gaps = 12/631 (1%)
 Frame = +1

Query: 55   MDFVIPAKQTRGWFSSFNCVQRDTTE--------KNS---LGSWVKTRVYSDFSLNNNNC 201
            M+ ++P KQT   F SF+  ++D T         KNS   +G   K R    F ++    
Sbjct: 1    MELIVPTKQTHEGFCSFHSTRKDITVNCCNNRRFKNSSLLVGQLRKQRQDKVFPVSKIE- 59

Query: 202  SLTKVSLGYKXXXXXXXXXXXXRKPYGYRNNGRTQ-TLVVSKLETVSSNGRLPKLAKTTH 378
            +L+ V                 ++ +G  +N R + +L+V +L     N     L  TT+
Sbjct: 60   TLSSVE----------------KRGFGSSSNRRCKNSLLVGQLRKQRQNKVFAILGGTTN 103

Query: 379  GDLXXXXXXXXXXXXXXXFEDFETNNHLRRLVRNGELEEGFKFLVNMVYHGDIPDIIPCT 558
            G L                E+FE+NN+LRRLVRNGELEE FK L +MVY GDIPDIIPCT
Sbjct: 104  GRLSSVEKRTNGSVSEN-IEEFESNNYLRRLVRNGELEESFKHLESMVYRGDIPDIIPCT 162

Query: 559  SLIRGFCKVGKTKKATRVLEILEDSGAVPDVITYNVLISGYCKSGEIDNALQVLDRMSVA 738
            SLIRGFC++G+TKKATRVLEILEDSGAVPDVITYNVLISGYCKSGEIDNAL+VLDRMSVA
Sbjct: 163  SLIRGFCRIGQTKKATRVLEILEDSGAVPDVITYNVLISGYCKSGEIDNALKVLDRMSVA 222

Query: 739  PDVVTYNTILRSLCDSGKLKQAMEVLDHQLQRECYPDVITYTILIEATCKESGVGQAMKL 918
            PDVVTYNTILRSLCDSGKLKQAM VLD  LQ+ECYPDVITYTILIEATCKESGVGQAMKL
Sbjct: 223  PDVVTYNTILRSLCDSGKLKQAMHVLDRMLQKECYPDVITYTILIEATCKESGVGQAMKL 282

Query: 919  LDEMRTKGCKPDVVTYNVLINGICKEGRLDEAIKFLNNMSSYGCQPNVITHNIILRSMCS 1098
            LDEMR+KGC PDVVTYNVLINGICKEGRL+EAIKFLNNM SYGCQPNVITHNIILRSMCS
Sbjct: 283  LDEMRSKGCVPDVVTYNVLINGICKEGRLNEAIKFLNNMPSYGCQPNVITHNIILRSMCS 342

Query: 1099 TGRWMDAEKLLADMLGKGCSPSVVTFNILINFLCRKGLLGRAIDILEKMPQHGCIPNSLS 1278
            TGRWMDAEKLLADM+ KGCSPSVVTFNILINFLCRKGLLGRAID+LEKMP++GC PNSLS
Sbjct: 343  TGRWMDAEKLLADMVRKGCSPSVVTFNILINFLCRKGLLGRAIDLLEKMPKYGCTPNSLS 402

Query: 1279 YNPLLHGFCKEKKMDRAIQYLDVMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLHQLG 1458
            YNPLLH FCKEKKMDRAIQYL+VMVSRGCYPDIVTYNTLLTALCKDGKV+VAVE+L+QL 
Sbjct: 403  YNPLLHAFCKEKKMDRAIQYLEVMVSRGCYPDIVTYNTLLTALCKDGKVDVAVEILNQLS 462

Query: 1459 SKGCTPVLITYNTVIDGLSKMGDTERAIELLEEMLAKGLQPDVITYSSLVSGLSREGKVN 1638
             KGC+PVLITYNTVIDGLSK+G TE AIELL EM  KGLQPD+ITYSS V+GLSREGKV+
Sbjct: 463  DKGCSPVLITYNTVIDGLSKVGKTELAIELLNEMREKGLQPDIITYSSFVAGLSREGKVD 522

Query: 1639 EAIEFFHELEGLGVKPNAITYNSIMLGLCKARQTDRAIDLLASMVSKGCKPTEATYTILI 1818
            EAI+FFH++EGL V+PNAITYN+IMLGLCKARQTDRAID LA M+SKGCKPTE+TYTILI
Sbjct: 523  EAIKFFHDIEGLDVRPNAITYNAIMLGLCKARQTDRAIDFLAYMISKGCKPTESTYTILI 582

Query: 1819 EGLAYXXXXXXXXXXXXXXCYRGVVQKSSAQ 1911
            EG+AY              C RGVV+KSSA+
Sbjct: 583  EGIAYEGLAEEALELLNELCSRGVVKKSSAE 613


>ref|XP_007137661.1| hypothetical protein PHAVU_009G145100g [Phaseolus vulgaris]
            gi|561010748|gb|ESW09655.1| hypothetical protein
            PHAVU_009G145100g [Phaseolus vulgaris]
          Length = 600

 Score =  847 bits (2187), Expect = 0.0
 Identities = 417/551 (75%), Positives = 468/551 (84%), Gaps = 2/551 (0%)
 Frame = +1

Query: 283  YRNNGRTQTLVVSKLETVSSNGRLPKLAKTTHGDLXXXXXXXXXXXXXXX--FEDFETNN 456
            +R    ++   VSK ET   NGRL ++ +T +GDL                 FE+F +N 
Sbjct: 50   FRKRSESRVFAVSKSETSGLNGRLQQIVRTPNGDLNGIAMESSGNGVNCSRNFEEFASNI 109

Query: 457  HLRRLVRNGELEEGFKFLVNMVYHGDIPDIIPCTSLIRGFCKVGKTKKATRVLEILEDSG 636
            HLR+LVRNGELEEG KFL  M+Y GDIPD+I CTSLIRGFCK GKTKKATRV+EILE+SG
Sbjct: 110  HLRKLVRNGELEEGLKFLERMIYQGDIPDVIACTSLIRGFCKGGKTKKATRVMEILENSG 169

Query: 637  AVPDVITYNVLISGYCKSGEIDNALQVLDRMSVAPDVVTYNTILRSLCDSGKLKQAMEVL 816
            AVPDVITYNVLISGYCKSG+ID ALQVL+RMSVAPDVVTYNTILRSLC SGKLK+AMEVL
Sbjct: 170  AVPDVITYNVLISGYCKSGDIDRALQVLERMSVAPDVVTYNTILRSLCSSGKLKEAMEVL 229

Query: 817  DHQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRTKGCKPDVVTYNVLINGICKE 996
            D QLQRECYPDVITYTILIEATC ESGVGQAMKLLDEMR KGCKPDVVTYNVLINGICKE
Sbjct: 230  DRQLQRECYPDVITYTILIEATCNESGVGQAMKLLDEMRNKGCKPDVVTYNVLINGICKE 289

Query: 997  GRLDEAIKFLNNMSSYGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLGKGCSPSVVTF 1176
            GRLDEAIKFLN+M SYGCQPNVITHNIILRSMCSTGRWMDAE+LLADML KGCSPSVVTF
Sbjct: 290  GRLDEAIKFLNSMPSYGCQPNVITHNIILRSMCSTGRWMDAERLLADMLRKGCSPSVVTF 349

Query: 1177 NILINFLCRKGLLGRAIDILEKMPQHGCIPNSLSYNPLLHGFCKEKKMDRAIQYLDVMVS 1356
            NILINFLCRK LLGRAID+LEKMP+HGC+PNSLSYNPLLHGFC+EKKMDRAI+YL++MVS
Sbjct: 350  NILINFLCRKRLLGRAIDVLEKMPKHGCVPNSLSYNPLLHGFCQEKKMDRAIEYLEIMVS 409

Query: 1357 RGCYPDIVTYNTLLTALCKDGKVEVAVELLHQLGSKGCTPVLITYNTVIDGLSKMGDTER 1536
            RGCYPDIVTYNTLLTALCKDGKV+ A+E+L+QL SKGC+PVL+TYNTVIDGL+K+G TE 
Sbjct: 410  RGCYPDIVTYNTLLTALCKDGKVDAAIEILNQLSSKGCSPVLVTYNTVIDGLAKVGKTES 469

Query: 1537 AIELLEEMLAKGLQPDVITYSSLVSGLSREGKVNEAIEFFHELEGLGVKPNAITYNSIML 1716
            A+ELLEEM  KGL+PD+ITYSSL+ GL REGKV++AI+ F ++EGL +KPNAITYNSIM 
Sbjct: 470  AVELLEEMRRKGLKPDIITYSSLLRGLGREGKVDKAIKIFRDMEGLSIKPNAITYNSIMF 529

Query: 1717 GLCKARQTDRAIDLLASMVSKGCKPTEATYTILIEGLAYXXXXXXXXXXXXXXCYRGVVQ 1896
            GLCKA+QT RAID LA MV +GC+PTE TYTILIEG+A               C RG V+
Sbjct: 530  GLCKAQQTSRAIDFLAYMVEQGCRPTEVTYTILIEGIADEGLAEEALELLNVLCSRGFVK 589

Query: 1897 KSSAQHVTIKM 1929
            KSSA+ V +KM
Sbjct: 590  KSSAEQVAVKM 600


>ref|XP_004170776.1| PREDICTED: pentatricopeptide repeat-containing protein At1g09900-like
            [Cucumis sativus]
          Length = 665

 Score =  843 bits (2179), Expect = 0.0
 Identities = 421/539 (78%), Positives = 465/539 (86%), Gaps = 1/539 (0%)
 Frame = +1

Query: 316  VSKLETVSSNGRLPKLAKTTHGDLXXXXXXXXXXXXXXXF-EDFETNNHLRRLVRNGELE 492
            V +++T SSNGRL    K  H  L                 E+ E NNHLRRLVRNGELE
Sbjct: 68   VPRVDTFSSNGRLSHGEKNLHTHLNGSSSSSSSYSNHSQSSEEVENNNHLRRLVRNGELE 127

Query: 493  EGFKFLVNMVYHGDIPDIIPCTSLIRGFCKVGKTKKATRVLEILEDSGAVPDVITYNVLI 672
            EGFKFL +MV  GDIPDII CTSLIRG CK GKT KATRV+EILEDSGAVPDVITYNVLI
Sbjct: 128  EGFKFLEDMVCRGDIPDIIACTSLIRGLCKTGKTWKATRVMEILEDSGAVPDVITYNVLI 187

Query: 673  SGYCKSGEIDNALQVLDRMSVAPDVVTYNTILRSLCDSGKLKQAMEVLDHQLQRECYPDV 852
            SGYCK+GEI +ALQ+LDRMSV+PDVVTYNTILR+LCDSGKLK+AMEVLD Q+QRECYPDV
Sbjct: 188  SGYCKTGEIGSALQLLDRMSVSPDVVTYNTILRTLCDSGKLKEAMEVLDRQMQRECYPDV 247

Query: 853  ITYTILIEATCKESGVGQAMKLLDEMRTKGCKPDVVTYNVLINGICKEGRLDEAIKFLNN 1032
            ITYTILIEATCKESGVGQAMKLLDEMR KGCKPDVVTYNVLINGICKEGRLDEAI+FLN+
Sbjct: 248  ITYTILIEATCKESGVGQAMKLLDEMRDKGCKPDVVTYNVLINGICKEGRLDEAIRFLNH 307

Query: 1033 MSSYGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLGKGCSPSVVTFNILINFLCRKGL 1212
            M SYGCQPNVITHNIILRSMCSTGRWMDAEK LA+M+ KGCSPSVVTFNILINFLCRKGL
Sbjct: 308  MPSYGCQPNVITHNIILRSMCSTGRWMDAEKFLAEMIRKGCSPSVVTFNILINFLCRKGL 367

Query: 1213 LGRAIDILEKMPQHGCIPNSLSYNPLLHGFCKEKKMDRAIQYLDVMVSRGCYPDIVTYNT 1392
            +GRAID+LEKMPQHGC PNSLSYNPLLH  CK+KKM+RAI+YLD+MVSRGCYPDIVTYNT
Sbjct: 368  IGRAIDVLEKMPQHGCTPNSLSYNPLLHALCKDKKMERAIEYLDIMVSRGCYPDIVTYNT 427

Query: 1393 LLTALCKDGKVEVAVELLHQLGSKGCTPVLITYNTVIDGLSKMGDTERAIELLEEMLAKG 1572
            LLTALCKDGKV+VAVE+L+QLGSKGC+PVLITYNTVIDGLSK+G T+ AI+LL+EM  KG
Sbjct: 428  LLTALCKDGKVDVAVEILNQLGSKGCSPVLITYNTVIDGLSKVGKTDDAIKLLDEMKGKG 487

Query: 1573 LQPDVITYSSLVSGLSREGKVNEAIEFFHELEGLGVKPNAITYNSIMLGLCKARQTDRAI 1752
            L+PD+ITYS+LV GLSREGKV+EAI FFH+LE +GVKPNAITYNSIMLGLCKARQT RAI
Sbjct: 488  LKPDIITYSTLVGGLSREGKVDEAIAFFHDLEEMGVKPNAITYNSIMLGLCKARQTVRAI 547

Query: 1753 DLLASMVSKGCKPTEATYTILIEGLAYXXXXXXXXXXXXXXCYRGVVQKSSAQHVTIKM 1929
            D LA MV++GCKPTE +Y ILIEGLAY              C RGVV+KSSA+ V +K+
Sbjct: 548  DFLAYMVARGCKPTETSYMILIEGLAYEGLAKEALELLNELCSRGVVKKSSAEQVVVKI 606


>ref|XP_003523769.1| PREDICTED: pentatricopeptide repeat-containing protein At1g09900-like
            [Glycine max]
          Length = 602

 Score =  843 bits (2179), Expect = 0.0
 Identities = 415/540 (76%), Positives = 465/540 (86%), Gaps = 2/540 (0%)
 Frame = +1

Query: 316  VSKLETVSSNGRLPKLAKTTHGDLXXXXXXXXXXXXXXX--FEDFETNNHLRRLVRNGEL 489
            VSK E    NGRL ++  T +GDL                 FE+F +N HLR+LVRNGEL
Sbjct: 63   VSKSEASGLNGRLQQIVSTPNGDLNVIGMESSPIGVNGSRSFEEFASNIHLRKLVRNGEL 122

Query: 490  EEGFKFLVNMVYHGDIPDIIPCTSLIRGFCKVGKTKKATRVLEILEDSGAVPDVITYNVL 669
            EEG KFL  M+Y GDIPD+I CTSLIRGFC+ GKTKKATR++EILE+SGAVPDVITYNVL
Sbjct: 123  EEGLKFLERMIYQGDIPDVIACTSLIRGFCRSGKTKKATRIMEILENSGAVPDVITYNVL 182

Query: 670  ISGYCKSGEIDNALQVLDRMSVAPDVVTYNTILRSLCDSGKLKQAMEVLDHQLQRECYPD 849
            I GYCKSGEID AL+VL+RMSVAPDVVTYNTILRSLCDSGKLK+AMEVLD QLQRECYPD
Sbjct: 183  IGGYCKSGEIDKALEVLERMSVAPDVVTYNTILRSLCDSGKLKEAMEVLDRQLQRECYPD 242

Query: 850  VITYTILIEATCKESGVGQAMKLLDEMRTKGCKPDVVTYNVLINGICKEGRLDEAIKFLN 1029
            VITYTILIEATC +SGVGQAMKLLDEMR KGCKPDVVTYNVLINGICKEGRLDEAIKFLN
Sbjct: 243  VITYTILIEATCNDSGVGQAMKLLDEMRKKGCKPDVVTYNVLINGICKEGRLDEAIKFLN 302

Query: 1030 NMSSYGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLGKGCSPSVVTFNILINFLCRKG 1209
            NM SYGC+PNVITHNIILRSMCSTGRWMDAE+LL+DML KGCSPSVVTFNILINFLCRK 
Sbjct: 303  NMPSYGCKPNVITHNIILRSMCSTGRWMDAERLLSDMLRKGCSPSVVTFNILINFLCRKR 362

Query: 1210 LLGRAIDILEKMPQHGCIPNSLSYNPLLHGFCKEKKMDRAIQYLDVMVSRGCYPDIVTYN 1389
            LLGRAID+LEKMP+HGC+PNSLSYNPLLHGFC+EKKMDRAI+YL++MVSRGCYPDIVTYN
Sbjct: 363  LLGRAIDVLEKMPKHGCVPNSLSYNPLLHGFCQEKKMDRAIEYLEIMVSRGCYPDIVTYN 422

Query: 1390 TLLTALCKDGKVEVAVELLHQLGSKGCTPVLITYNTVIDGLSKMGDTERAIELLEEMLAK 1569
            TLLTALCKDGKV+ AVE+L+QL SKGC+PVLITYNTVIDGL+K+G TE A+ELLEEM  K
Sbjct: 423  TLLTALCKDGKVDAAVEILNQLSSKGCSPVLITYNTVIDGLTKVGKTEYAVELLEEMRRK 482

Query: 1570 GLQPDVITYSSLVSGLSREGKVNEAIEFFHELEGLGVKPNAITYNSIMLGLCKARQTDRA 1749
            GL+PD+ITYS+L+ GL REGKV+EAI+ FH++EGL +KP+A+TYN+IMLGLCKA+QT RA
Sbjct: 483  GLKPDIITYSTLLRGLGREGKVDEAIKIFHDMEGLSIKPSAVTYNAIMLGLCKAQQTSRA 542

Query: 1750 IDLLASMVSKGCKPTEATYTILIEGLAYXXXXXXXXXXXXXXCYRGVVQKSSAQHVTIKM 1929
            ID LA MV KGCKPTEATYTILIEG+A               C RG V+KSSA+ V +KM
Sbjct: 543  IDFLAYMVEKGCKPTEATYTILIEGIADEGLAEEALELLNELCSRGFVKKSSAEQVVVKM 602


>ref|XP_003527866.1| PREDICTED: pentatricopeptide repeat-containing protein At1g09900-like
            [Glycine max]
          Length = 603

 Score =  835 bits (2156), Expect = 0.0
 Identities = 412/541 (76%), Positives = 461/541 (85%), Gaps = 3/541 (0%)
 Frame = +1

Query: 316  VSKLETVSSNGRLPKLAKTTHGDLXXXXXXXXXXXXXXX---FEDFETNNHLRRLVRNGE 486
            VSK E    NGRL ++  T +GDL                  FE+F +N HLR+LVRNGE
Sbjct: 63   VSKSEASGMNGRLQQIVSTPNGDLNGIGMESSSPNGVNGSRSFEEFASNIHLRKLVRNGE 122

Query: 487  LEEGFKFLVNMVYHGDIPDIIPCTSLIRGFCKVGKTKKATRVLEILEDSGAVPDVITYNV 666
            LEEG KFL  M+Y GDIPD+I CTSLIRGFC+ GKT+KATR++EILE+SGAVPDVITYNV
Sbjct: 123  LEEGLKFLERMIYQGDIPDVIACTSLIRGFCRSGKTRKATRIMEILENSGAVPDVITYNV 182

Query: 667  LISGYCKSGEIDNALQVLDRMSVAPDVVTYNTILRSLCDSGKLKQAMEVLDHQLQRECYP 846
            LI GYCKSGEID ALQVL+RMSVAPDVVTYNTILRSLCDSGKLK+AMEVLD Q+QRECYP
Sbjct: 183  LIGGYCKSGEIDKALQVLERMSVAPDVVTYNTILRSLCDSGKLKEAMEVLDRQMQRECYP 242

Query: 847  DVITYTILIEATCKESGVGQAMKLLDEMRTKGCKPDVVTYNVLINGICKEGRLDEAIKFL 1026
            DVITYTILIEATC +SGVGQAMKLLDEMR KGCKPDVVTYNVLINGICKEGRLDEAIKFL
Sbjct: 243  DVITYTILIEATCNDSGVGQAMKLLDEMRKKGCKPDVVTYNVLINGICKEGRLDEAIKFL 302

Query: 1027 NNMSSYGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLGKGCSPSVVTFNILINFLCRK 1206
            NNM  YGCQPNVITHNIILRSMCSTGRWMDAE+LLADML KGCSPSVVTFNILINFLCRK
Sbjct: 303  NNMPLYGCQPNVITHNIILRSMCSTGRWMDAERLLADMLRKGCSPSVVTFNILINFLCRK 362

Query: 1207 GLLGRAIDILEKMPQHGCIPNSLSYNPLLHGFCKEKKMDRAIQYLDVMVSRGCYPDIVTY 1386
             LLGRAID+LEKMP+HGC+PNSLSYNPLLHGFC+EKKMDRAI+YL++MVSRGCYPDIVTY
Sbjct: 363  RLLGRAIDVLEKMPKHGCMPNSLSYNPLLHGFCQEKKMDRAIEYLEIMVSRGCYPDIVTY 422

Query: 1387 NTLLTALCKDGKVEVAVELLHQLGSKGCTPVLITYNTVIDGLSKMGDTERAIELLEEMLA 1566
            NTLLTALCKDGK + AVE+L+QL SKGC+PVLITYNTVIDGL+K+G TE A ELLEEM  
Sbjct: 423  NTLLTALCKDGKADAAVEILNQLSSKGCSPVLITYNTVIDGLTKVGKTEYAAELLEEMRR 482

Query: 1567 KGLQPDVITYSSLVSGLSREGKVNEAIEFFHELEGLGVKPNAITYNSIMLGLCKARQTDR 1746
            KGL+PD+ITYS+L+ GL  EGKV+EAI+ FH++EGL +KP+A+TYN+IMLGLCKA+QT R
Sbjct: 483  KGLKPDIITYSTLLRGLGCEGKVDEAIKIFHDMEGLSIKPSAVTYNAIMLGLCKAQQTSR 542

Query: 1747 AIDLLASMVSKGCKPTEATYTILIEGLAYXXXXXXXXXXXXXXCYRGVVQKSSAQHVTIK 1926
            AID LA MV KGCKPT+ATYTILIEG+A               C RG V+KSSA+ V +K
Sbjct: 543  AIDFLAYMVEKGCKPTKATYTILIEGIADEGLAEEALELLNELCSRGFVKKSSAEQVAVK 602

Query: 1927 M 1929
            M
Sbjct: 603  M 603


>ref|NP_172461.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|122215618|sp|Q3EDF8.1|PPR28_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g09900 gi|332190391|gb|AEE28512.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 598

 Score =  823 bits (2127), Expect = 0.0
 Identities = 410/554 (74%), Positives = 463/554 (83%), Gaps = 2/554 (0%)
 Frame = +1

Query: 274  PYGYRNNGRTQTL-VVSKLETVSSNGRLPKLAKTTHG-DLXXXXXXXXXXXXXXXFEDFE 447
            P G R   R   +   SK+E+   NGR  K    + G                   ED E
Sbjct: 45   PLGSRKRNRLVLVSAASKVESSGLNGRAQKFETLSSGYSNSNGNGHYSSVNSSFALEDVE 104

Query: 448  TNNHLRRLVRNGELEEGFKFLVNMVYHGDIPDIIPCTSLIRGFCKVGKTKKATRVLEILE 627
            +NNHLR++VR GELEEGFKFL NMVYHG++PDIIPCT+LIRGFC++GKT+KA ++LEILE
Sbjct: 105  SNNHLRQMVRTGELEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRLGKTRKAAKILEILE 164

Query: 628  DSGAVPDVITYNVLISGYCKSGEIDNALQVLDRMSVAPDVVTYNTILRSLCDSGKLKQAM 807
             SGAVPDVITYNV+ISGYCK+GEI+NAL VLDRMSV+PDVVTYNTILRSLCDSGKLKQAM
Sbjct: 165  GSGAVPDVITYNVMISGYCKAGEINNALSVLDRMSVSPDVVTYNTILRSLCDSGKLKQAM 224

Query: 808  EVLDHQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRTKGCKPDVVTYNVLINGI 987
            EVLD  LQR+CYPDVITYTILIEATC++SGVG AMKLLDEMR +GC PDVVTYNVL+NGI
Sbjct: 225  EVLDRMLQRDCYPDVITYTILIEATCRDSGVGHAMKLLDEMRDRGCTPDVVTYNVLVNGI 284

Query: 988  CKEGRLDEAIKFLNNMSSYGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLGKGCSPSV 1167
            CKEGRLDEAIKFLN+M S GCQPNVITHNIILRSMCSTGRWMDAEKLLADML KG SPSV
Sbjct: 285  CKEGRLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPSV 344

Query: 1168 VTFNILINFLCRKGLLGRAIDILEKMPQHGCIPNSLSYNPLLHGFCKEKKMDRAIQYLDV 1347
            VTFNILINFLCRKGLLGRAIDILEKMPQHGC PNSLSYNPLLHGFCKEKKMDRAI+YL+ 
Sbjct: 345  VTFNILINFLCRKGLLGRAIDILEKMPQHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLER 404

Query: 1348 MVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLHQLGSKGCTPVLITYNTVIDGLSKMGD 1527
            MVSRGCYPDIVTYNT+LTALCKDGKVE AVE+L+QL SKGC+PVLITYNTVIDGL+K G 
Sbjct: 405  MVSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAGK 464

Query: 1528 TERAIELLEEMLAKGLQPDVITYSSLVSGLSREGKVNEAIEFFHELEGLGVKPNAITYNS 1707
            T +AI+LL+EM AK L+PD ITYSSLV GLSREGKV+EAI+FFHE E +G++PNA+T+NS
Sbjct: 465  TGKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIRPNAVTFNS 524

Query: 1708 IMLGLCKARQTDRAIDLLASMVSKGCKPTEATYTILIEGLAYXXXXXXXXXXXXXXCYRG 1887
            IMLGLCK+RQTDRAID L  M+++GCKP E +YTILIEGLAY              C +G
Sbjct: 525  IMLGLCKSRQTDRAIDFLVFMINRGCKPNETSYTILIEGLAYEGMAKEALELLNELCNKG 584

Query: 1888 VVQKSSAQHVTIKM 1929
            +++KSSA+ V  KM
Sbjct: 585  LMKKSSAEQVAGKM 598


>ref|XP_002889775.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297335617|gb|EFH66034.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 598

 Score =  821 bits (2121), Expect = 0.0
 Identities = 399/498 (80%), Positives = 448/498 (89%)
 Frame = +1

Query: 436  EDFETNNHLRRLVRNGELEEGFKFLVNMVYHGDIPDIIPCTSLIRGFCKVGKTKKATRVL 615
            ED E+NNHLR+LVR GELEEGFKFL NMVYHG++PDIIPCT+LIRGFC++GKT+KA ++L
Sbjct: 101  EDVESNNHLRQLVRTGELEEGFKFLENMVYHGNVPDIIPCTTLIRGFCRMGKTRKAAKIL 160

Query: 616  EILEDSGAVPDVITYNVLISGYCKSGEIDNALQVLDRMSVAPDVVTYNTILRSLCDSGKL 795
            E+LE SGAVPDVITYNV+ISGYCK+GEI+NAL VLDRMSV+PDVVTYNTILRSLCDSGKL
Sbjct: 161  EVLEGSGAVPDVITYNVMISGYCKAGEINNALSVLDRMSVSPDVVTYNTILRSLCDSGKL 220

Query: 796  KQAMEVLDHQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRTKGCKPDVVTYNVL 975
            KQAMEVLD  LQR+CYPDVITYTILIEATC++SGVGQAMKLLDEMR +GC PDVVTYNVL
Sbjct: 221  KQAMEVLDRMLQRDCYPDVITYTILIEATCRDSGVGQAMKLLDEMRDRGCTPDVVTYNVL 280

Query: 976  INGICKEGRLDEAIKFLNNMSSYGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLGKGC 1155
            +NGICKEGRLDEAIKFLN+M S GCQPNVITHNIILRSMCSTGRWMDAEKLLADML KG 
Sbjct: 281  VNGICKEGRLDEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGF 340

Query: 1156 SPSVVTFNILINFLCRKGLLGRAIDILEKMPQHGCIPNSLSYNPLLHGFCKEKKMDRAIQ 1335
            SPSVVTFNILINFLCRKGLLGRAIDILEKMP+HGC PNSLSYNPLLHGFCKEKKMDRAI+
Sbjct: 341  SPSVVTFNILINFLCRKGLLGRAIDILEKMPKHGCQPNSLSYNPLLHGFCKEKKMDRAIE 400

Query: 1336 YLDVMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLHQLGSKGCTPVLITYNTVIDGLS 1515
            YL+ MVSRGCYPDIVTYNT+LTALCKDGKVE AVE+L+QL SKGC+PVLITYNTVIDGL+
Sbjct: 401  YLERMVSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLA 460

Query: 1516 KMGDTERAIELLEEMLAKGLQPDVITYSSLVSGLSREGKVNEAIEFFHELEGLGVKPNAI 1695
            K G T +AI+LL+EM AK L+PD ITYSSLV GLSREGKV+EAI+FFHE E +GV+PNA+
Sbjct: 461  KAGKTGKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGVRPNAV 520

Query: 1696 TYNSIMLGLCKARQTDRAIDLLASMVSKGCKPTEATYTILIEGLAYXXXXXXXXXXXXXX 1875
            T+NSIMLGLCK RQTDRAID L  M+++GCKPTE +YTILIEGLAY              
Sbjct: 521  TFNSIMLGLCKTRQTDRAIDFLVYMINRGCKPTETSYTILIEGLAYEGMAKEALELLNEL 580

Query: 1876 CYRGVVQKSSAQHVTIKM 1929
            C +G++++SSA+ V  KM
Sbjct: 581  CNKGLMKRSSAEQVAGKM 598


>ref|XP_006306156.1| hypothetical protein CARUB_v10011677mg, partial [Capsella rubella]
            gi|482574867|gb|EOA39054.1| hypothetical protein
            CARUB_v10011677mg, partial [Capsella rubella]
          Length = 609

 Score =  785 bits (2027), Expect = 0.0
 Identities = 396/555 (71%), Positives = 451/555 (81%), Gaps = 3/555 (0%)
 Frame = +1

Query: 274  PYGYRNNGRTQTL-VVSKLETVSSNGRLPKLAKTTHGDLXXXXXXXXXXXXXXXF--EDF 444
            P G R   R  T+   SK+E+   NGR  K   ++                   F  ED 
Sbjct: 46   PLGSRKRNRLVTVFAASKVESSGLNGRAQKFETSSASGHTNSNGNGHYSTANSSFALEDV 105

Query: 445  ETNNHLRRLVRNGELEEGFKFLVNMVYHGDIPDIIPCTSLIRGFCKVGKTKKATRVLEIL 624
            E+NNHLR+LVR GELEEGF+FL NMVYHG++PDIIPCT+LIRGFC++GKT+KA ++LEIL
Sbjct: 106  ESNNHLRQLVRTGELEEGFRFLENMVYHGNVPDIIPCTTLIRGFCRMGKTRKAAKILEIL 165

Query: 625  EDSGAVPDVITYNVLISGYCKSGEIDNALQVLDRMSVAPDVVTYNTILRSLCDSGKLKQA 804
            E SGAVPDVITYNV+ISGYCK+GEI NAL VLDRMSV+PDVVTYNTILRSLCDSGKLKQA
Sbjct: 166  EGSGAVPDVITYNVMISGYCKAGEISNALSVLDRMSVSPDVVTYNTILRSLCDSGKLKQA 225

Query: 805  MEVLDHQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRTKGCKPDVVTYNVLING 984
            MEVLD  LQR+             +TC++SGVGQAMKLLDEMR +GC PDVVTYNVL+NG
Sbjct: 226  MEVLDRMLQRD-------------STCRDSGVGQAMKLLDEMRDRGCTPDVVTYNVLVNG 272

Query: 985  ICKEGRLDEAIKFLNNMSSYGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLGKGCSPS 1164
            ICKEGRL+EAIKFLN+M S GCQPNVITHNIILRSMCSTGRWMDAEKLLADML KG SPS
Sbjct: 273  ICKEGRLNEAIKFLNDMPSSGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLRKGFSPS 332

Query: 1165 VVTFNILINFLCRKGLLGRAIDILEKMPQHGCIPNSLSYNPLLHGFCKEKKMDRAIQYLD 1344
            VVTFNILINFLCRKGLLGRAIDILEKMP HGC PNSLSYNPLLHGFCKEKKMDRAI+YL+
Sbjct: 333  VVTFNILINFLCRKGLLGRAIDILEKMPNHGCQPNSLSYNPLLHGFCKEKKMDRAIEYLE 392

Query: 1345 VMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLHQLGSKGCTPVLITYNTVIDGLSKMG 1524
             MVSRGCYPDIVTYNT+LTALCKDGKVE AVE+L+QL SKGC+PVLITYNTVIDGL+K G
Sbjct: 393  RMVSRGCYPDIVTYNTMLTALCKDGKVEDAVEILNQLSSKGCSPVLITYNTVIDGLAKAG 452

Query: 1525 DTERAIELLEEMLAKGLQPDVITYSSLVSGLSREGKVNEAIEFFHELEGLGVKPNAITYN 1704
             T +AI+LL+EM AK L+PD ITYSSLV GLSREGKV+EAI+FFHE E +G++PNA+T+N
Sbjct: 453  KTGKAIKLLDEMRAKDLKPDTITYSSLVGGLSREGKVDEAIKFFHEFERMGIRPNAVTFN 512

Query: 1705 SIMLGLCKARQTDRAIDLLASMVSKGCKPTEATYTILIEGLAYXXXXXXXXXXXXXXCYR 1884
            SIMLGLCK RQTDRAID L  M+++GCKPTE +YTILIEGLAY              C +
Sbjct: 513  SIMLGLCKTRQTDRAIDFLVYMINRGCKPTETSYTILIEGLAYEGMAKEALELLNELCNK 572

Query: 1885 GVVQKSSAQHVTIKM 1929
            G+++KSSA+ V  K+
Sbjct: 573  GLMKKSSAEQVAGKI 587


>ref|XP_004501057.1| PREDICTED: pentatricopeptide repeat-containing protein At1g09900-like
            [Cicer arietinum]
          Length = 591

 Score =  782 bits (2020), Expect = 0.0
 Identities = 383/494 (77%), Positives = 432/494 (87%)
 Frame = +1

Query: 436  EDFETNNHLRRLVRNGELEEGFKFLVNMVYHGDIPDIIPCTSLIRGFCKVGKTKKATRVL 615
            E+ + N++L +LVR G+LE+GF+FL  M Y GD+PD+I CT+LIR FCK GKTKKATRVL
Sbjct: 89   EEIDNNSYLVKLVRIGKLEQGFRFLERMSYQGDMPDVIACTNLIRQFCKTGKTKKATRVL 148

Query: 616  EILEDSGAVPDVITYNVLISGYCKSGEIDNALQVLDRMSVAPDVVTYNTILRSLCDSGKL 795
            +ILEDSGAVPDVITYNVLISGYCKSGE++ ALQVL+RMSV+PDVVTYNTILRSLCDSGKL
Sbjct: 149  QILEDSGAVPDVITYNVLISGYCKSGEVEEALQVLERMSVSPDVVTYNTILRSLCDSGKL 208

Query: 796  KQAMEVLDHQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRTKGCKPDVVTYNVL 975
            KQAMEVLD QL+R CYPDVITYTILIEA CKESGVG+AMKL D MR KGCKPDV T+NVL
Sbjct: 209  KQAMEVLDRQLERVCYPDVITYTILIEAICKESGVGEAMKLFDAMRIKGCKPDVFTFNVL 268

Query: 976  INGICKEGRLDEAIKFLNNMSSYGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLGKGC 1155
            ING CKEGRLD+AIKFLN+MSSYGC+PNVITHNIILRS+C TGRW DAE LL+DML KGC
Sbjct: 269  INGFCKEGRLDKAIKFLNDMSSYGCEPNVITHNIILRSLCGTGRWRDAESLLSDMLRKGC 328

Query: 1156 SPSVVTFNILINFLCRKGLLGRAIDILEKMPQHGCIPNSLSYNPLLHGFCKEKKMDRAIQ 1335
            SPSVVTFNILINFLCRKGLLGRAIDILEKM  HGC PNSLSYNPLLHGFC+EKKMDRAI+
Sbjct: 329  SPSVVTFNILINFLCRKGLLGRAIDILEKMKNHGCTPNSLSYNPLLHGFCQEKKMDRAIE 388

Query: 1336 YLDVMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLHQLGSKGCTPVLITYNTVIDGLS 1515
            YL+VMVSRGCYPDIVTYNTLLTALCKDGKV+VA+ELL+QL SKGC+PV ITYNTVI GLS
Sbjct: 389  YLEVMVSRGCYPDIVTYNTLLTALCKDGKVDVALELLNQLSSKGCSPVAITYNTVIGGLS 448

Query: 1516 KMGDTERAIELLEEMLAKGLQPDVITYSSLVSGLSREGKVNEAIEFFHELEGLGVKPNAI 1695
            K+G TERA++LL+EM  KGL+PDV+TYSSL++G  REGKV+ AI+ FHELE LG++ NA+
Sbjct: 449  KVGATERAMKLLDEMCRKGLKPDVVTYSSLIAGFIREGKVDVAIKIFHELERLGIRANAV 508

Query: 1696 TYNSIMLGLCKARQTDRAIDLLASMVSKGCKPTEATYTILIEGLAYXXXXXXXXXXXXXX 1875
            TYNSIM GLCKAR+T  AIDLLA M++KGCKPTEATYTILIEG+AY              
Sbjct: 509  TYNSIMSGLCKARRTSHAIDLLARMIAKGCKPTEATYTILIEGIAYEGLAEEALGLLNEL 568

Query: 1876 CYRGVVQKSSAQHV 1917
              RG V+KSSA  V
Sbjct: 569  SSRGFVKKSSADKV 582


>gb|EPS61251.1| hypothetical protein M569_13548, partial [Genlisea aurea]
          Length = 488

 Score =  699 bits (1805), Expect = 0.0
 Identities = 340/469 (72%), Positives = 408/469 (86%), Gaps = 9/469 (1%)
 Frame = +1

Query: 451  NNHLRRLVRNGELEEGFKFLVNMVYHGDIPDIIPCTSLIRGFCKVGKTKKATRVLEILED 630
            N+ LRRLVR+G+LE+  + +  MV   +IPDIIPCTSLIRGFC+ GKT KAT V++ILE+
Sbjct: 1    NSSLRRLVRHGQLEKALRHIQGMVSQREIPDIIPCTSLIRGFCRAGKTNKATVVMQILEE 60

Query: 631  SGAVPDVITYNVLISGYCKSGEIDNALQVLDRMSVAPDVVTYNTILRSLCDSG------- 789
            SGA PD+ITYNVLISG+CK GE+ NALQ+L+ M+VAPDVVTYNTILR+LC+ G       
Sbjct: 61   SGAAPDLITYNVLISGFCKLGEVGNALQLLESMTVAPDVVTYNTILRALCNGGGGGGGGR 120

Query: 790  -KLKQAMEVLDHQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRTKGCKPDVVTY 966
             +L +AMEV+D  L +EC+PDVITYTILIEAT KE+GV QAM+LLD+M+ +GCKPD+VTY
Sbjct: 121  GRLSEAMEVIDRMLLKECHPDVITYTILIEATLKENGVDQAMELLDDMKRRGCKPDIVTY 180

Query: 967  NVLINGICKEGRLDEAIKFLNNMSSYGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLG 1146
            NVLI+GICKEG+LDEAIKFL+ MSSYGC+PNVITHNIILRSMCSTGRWMDAEKLL++ML 
Sbjct: 181  NVLIDGICKEGKLDEAIKFLDTMSSYGCRPNVITHNIILRSMCSTGRWMDAEKLLSEMLV 240

Query: 1147 KGCSPSVVTFNILINFLCRKGLLGRAIDILEKMPQHGCIPNSLSYNPLLHGFCKEKKMDR 1326
            KGCSPSVVTFNILINFLCRKGLL RA+D+LE+MP++GC PNSLSYN LLH FCKEKKMD 
Sbjct: 241  KGCSPSVVTFNILINFLCRKGLLLRAVDVLERMPENGCTPNSLSYNSLLHTFCKEKKMDS 300

Query: 1327 AIQYLDVMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLHQLGSKG-CTPVLITYNTVI 1503
            A++YL++MVSRGCYPDIVTYNT+LTALC+DGKV+ AV +L++L SKG C+PVLITYNTVI
Sbjct: 301  ALEYLELMVSRGCYPDIVTYNTMLTALCRDGKVDAAVAILNRLRSKGRCSPVLITYNTVI 360

Query: 1504 DGLSKMGDTERAIELLEEMLAKGLQPDVITYSSLVSGLSREGKVNEAIEFFHELEGLGVK 1683
            DGLSKMG T+ A+ELL EM  +GL+PDVIT SS++ GLS+EGKV E++EFF  LEG G++
Sbjct: 361  DGLSKMGRTDEAMELLVEMRGRGLRPDVITCSSIMMGLSKEGKVEESVEFFESLEGSGIR 420

Query: 1684 PNAITYNSIMLGLCKARQTDRAIDLLASMVSKGCKPTEATYTILIEGLA 1830
            PNA  YNS+MLG+CKAR+TDRAID L  MV  GCKPTE+TYTILIEGL+
Sbjct: 421  PNANIYNSMMLGMCKARRTDRAIDFLDRMVDGGCKPTESTYTILIEGLS 469



 Score =  234 bits (596), Expect = 2e-58
 Identities = 120/337 (35%), Positives = 200/337 (59%), Gaps = 4/337 (1%)
 Frame = +1

Query: 469  LVRNGELEEGFKFLVNMVYHGDIPDIIPCTSLIRGFCKVGKTKKATRVLEILEDSGAVPD 648
            L  NG +++  + L +M   G  PDI+    LI G CK GK  +A + L+ +   G  P+
Sbjct: 153  LKENG-VDQAMELLDDMKRRGCKPDIVTYNVLIDGICKEGKLDEAIKFLDTMSSYGCRPN 211

Query: 649  VITYNVLISGYCKSGEIDNALQVLDRMSV---APDVVTYNTILRSLCDSGKLKQAMEVLD 819
            VIT+N+++   C +G   +A ++L  M V   +P VVT+N ++  LC  G L +A++VL+
Sbjct: 212  VITHNIILRSMCSTGRWMDAEKLLSEMLVKGCSPSVVTFNILINFLCRKGLLLRAVDVLE 271

Query: 820  HQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRTKGCKPDVVTYNVLINGICKEG 999
               +  C P+ ++Y  L+   CKE  +  A++ L+ M ++GC PD+VTYN ++  +C++G
Sbjct: 272  RMPENGCTPNSLSYNSLLHTFCKEKKMDSALEYLELMVSRGCYPDIVTYNTMLTALCRDG 331

Query: 1000 RLDEAIKFLNNMSSYG-CQPNVITHNIILRSMCSTGRWMDAEKLLADMLGKGCSPSVVTF 1176
            ++D A+  LN + S G C P +IT+N ++  +   GR  +A +LL +M G+G  P V+T 
Sbjct: 332  KVDAAVAILNRLRSKGRCSPVLITYNTVIDGLSKMGRTDEAMELLVEMRGRGLRPDVITC 391

Query: 1177 NILINFLCRKGLLGRAIDILEKMPQHGCIPNSLSYNPLLHGFCKEKKMDRAIQYLDVMVS 1356
            + ++  L ++G +  +++  E +   G  PN+  YN ++ G CK ++ DRAI +LD MV 
Sbjct: 392  SSIMMGLSKEGKVEESVEFFESLEGSGIRPNANIYNSMMLGMCKARRTDRAIDFLDRMVD 451

Query: 1357 RGCYPDIVTYNTLLTALCKDGKVEVAVELLHQLGSKG 1467
             GC P   TY  L+  L K+G  E A+ELL++L S+G
Sbjct: 452  GGCKPTESTYTILIEGLSKEGLSEEALELLNELRSRG 488


>ref|XP_006856585.1| hypothetical protein AMTR_s00046p00202770 [Amborella trichopoda]
            gi|548860466|gb|ERN18052.1| hypothetical protein
            AMTR_s00046p00202770 [Amborella trichopoda]
          Length = 585

 Score =  680 bits (1754), Expect = 0.0
 Identities = 341/514 (66%), Positives = 397/514 (77%), Gaps = 15/514 (2%)
 Frame = +1

Query: 433  FEDFETNNHLRRLVRNGELEEGFKFLVNMVYHGDIPDIIPCTSLIRGFCKVGKTKKATRV 612
            F+DFE+N+ L+R VRNGELEE   FL NM  +G+IPDIIPCTSLIRGFCK+GKTKK TRV
Sbjct: 107  FDDFESNDLLKRHVRNGELEEALVFLENMARNGEIPDIIPCTSLIRGFCKIGKTKKGTRV 166

Query: 613  LEILEDSGAVPDVITYNVLISGYCKSGEIDNALQVLDRMSVAPDVVTYNTILRSLCDSGK 792
            +EI+ +SGAVPDVITYNVLISGYCKSGE+DNAL VL+RMS +PDVVTYNTILRSLCD GK
Sbjct: 167  MEIIHESGAVPDVITYNVLISGYCKSGEVDNALLVLERMSCSPDVVTYNTILRSLCDEGK 226

Query: 793  LKQAMEVLDHQLQRECYPDVITYTILIEATCKESGVGQAMKLLDEMRTKGCKPDVVTYNV 972
            LKQAMEVLD  + R C+PDVITYTILIEATCKESGVGQAMKLLDEMR+KGCKPDVVTYNV
Sbjct: 227  LKQAMEVLDRMMNRGCFPDVITYTILIEATCKESGVGQAMKLLDEMRSKGCKPDVVTYNV 286

Query: 973  LINGICKEGRLDEAIKFLNNMSSYGCQPNVITHNIILRSMCSTGRWMDAEKLLADMLGKG 1152
            LINGICKEG+L+EAIKFLN+M SYGC+PNVITHNIILRSMCSTGRWMDAEKLL++M+  G
Sbjct: 287  LINGICKEGKLNEAIKFLNSMPSYGCRPNVITHNIILRSMCSTGRWMDAEKLLSEMIENG 346

Query: 1153 CSPSVVTFNILINFLCRKGLLGRAIDILEKMPQHGCIPNSLSYNPLLHGFCKEKKMDRAI 1332
            CSPSVVTFNILINFLCRKGL+ RAID+LE+MP+HGC PNSLSYNP+LHGFCKEK MDR I
Sbjct: 347  CSPSVVTFNILINFLCRKGLMRRAIDVLERMPEHGCTPNSLSYNPILHGFCKEKNMDRVI 406

Query: 1333 QYLDVMVSRGCYPDIVTYNTLLTALCKDGKVEVAVELLHQLGSKGCTPVLITYNTVIDGL 1512
            +YL+VMV RGC+PDIVTYNTLLTALCKDGKV+ A+E+L QL SKGC+PVLITYNTVIDGL
Sbjct: 407  EYLEVMVLRGCFPDIVTYNTLLTALCKDGKVDAALEILRQLRSKGCSPVLITYNTVIDGL 466

Query: 1513 SKMGDTERAIEL---------------LEEMLAKGLQPDVITYSSLVSGLSREGKVNEAI 1647
            SKMG TE AIEL                 EM  KG+ P+ ITY++L+ GL +  +  +AI
Sbjct: 467  SKMGKTEEAIELQMRCREGKVDKAIEFFFEMEGKGIGPNAITYNALILGLCKAQRTGQAI 526

Query: 1648 EFFHELEGLGVKPNAITYNSIMLGLCKARQTDRAIDLLASMVSKGCKPTEATYTILIEGL 1827
            +F   +   G KP   TY  ++ G+    +   A++LL  +                   
Sbjct: 527  DFLAHMVSKGCKPTESTYTILIEGVANEGRPKEALNLLNEL------------------- 567

Query: 1828 AYXXXXXXXXXXXXXXCYRGVVQKSSAQHVTIKM 1929
                            C RGVV++SSAQ+V + M
Sbjct: 568  ----------------CERGVVKRSSAQNVAVNM 585


Top