BLASTX nr result

ID: Catharanthus23_contig00013890 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00013890
         (2032 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006338488.1| PREDICTED: putative pentatricopeptide repeat...   636   e-179
emb|CBI35029.3| unnamed protein product [Vitis vinifera]              613   e-173
ref|XP_002276684.1| PREDICTED: putative pentatricopeptide repeat...   613   e-173
ref|XP_004233685.1| PREDICTED: putative pentatricopeptide repeat...   608   e-171
gb|EMJ17608.1| hypothetical protein PRUPE_ppa022709mg, partial [...   600   e-169
ref|XP_004304947.1| PREDICTED: pentatricopeptide repeat-containi...   569   e-159
ref|NP_189507.2| pentatricopeptide repeat-containing protein [Ar...   526   e-146
ref|NP_189505.2| putative pentatricopeptide repeat-containing pr...   521   e-145
ref|XP_006290938.1| hypothetical protein CARUB_v10017051mg [Caps...   521   e-145
gb|EOY03349.1| Tetratricopeptide repeat (TPR)-like superfamily p...   516   e-143
ref|XP_006395353.1| hypothetical protein EUTSA_v10005682mg [Eutr...   513   e-142
gb|EXB36666.1| hypothetical protein L484_002079 [Morus notabilis]     509   e-141
ref|XP_002877120.1| pentatricopeptide repeat-containing protein ...   447   e-123
ref|XP_002517451.1| pentatricopeptide repeat-containing protein,...   410   e-111
ref|NP_001173276.1| Os03g0158900 [Oryza sativa Japonica Group] g...   365   4e-98
gb|EMT01880.1| hypothetical protein F775_14784 [Aegilops tauschii]    363   2e-97
ref|XP_006650550.1| PREDICTED: pentatricopeptide repeat-containi...   362   4e-97
gb|EMS68422.1| hypothetical protein TRIUR3_23855 [Triticum urartu]    361   6e-97
ref|XP_002439151.1| hypothetical protein SORBIDRAFT_09g001360 [S...   359   2e-96
ref|XP_004981863.1| PREDICTED: putative pentatricopeptide repeat...   358   4e-96

>ref|XP_006338488.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At3g28640-like [Solanum tuberosum]
          Length = 512

 Score =  636 bits (1640), Expect = e-179
 Identities = 307/498 (61%), Positives = 382/498 (76%), Gaps = 2/498 (0%)
 Frame = +2

Query: 149  SDRSIKAWRWCMSLAQRCINMGQLKTIHAIFIAHGIHRNSYAVGKLISFCTLSENGDISY 328
            ++ S + W+WCMS+A++C N+GQLK IHAI+I  G+HRN+YAV KL+ FC LS  GD+SY
Sbjct: 13   ANNSFQIWKWCMSMAEKCTNIGQLKAIHAIYITLGLHRNTYAVSKLLDFCALSNTGDLSY 72

Query: 329  GSFLFSQLPNPNSFVYNNLIRAYSRSPQPQLAVNYFNLML--NSRLRPDGYTFPFVFMAC 502
             S +F+Q+  PN+F+YN LIRAYS S QPQ ++NYFNLML  ++   PD +TFPF+ +AC
Sbjct: 73   ASRIFAQVQTPNTFLYNALIRAYSSSSQPQFSLNYFNLMLQTSNAAAPDSFTFPFLIIAC 132

Query: 503  ANGFLWSEGRQLHTWVIKNGIFESNAHVQSALVRFYAEHKVLDDARKVFDEITNVDAIQC 682
            ANG L  EG+Q+H+WVIKN    SNAHVQ+AL+RFY   K LDDARKVFDEIT++D IQC
Sbjct: 133  ANGPLEVEGKQIHSWVIKNSFSASNAHVQTALIRFYTNCKALDDARKVFDEITDIDVIQC 192

Query: 683  NILINGYVKCGMALEAQDVFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKN 862
            N+L++G+++ G+A EA  +F++ML RG+ PDE+C+TTAL ACAQLG L+QGKWIHE+V  
Sbjct: 193  NVLMSGHLQSGLAKEALSIFQDMLGRGVGPDEYCVTTALGACAQLGALEQGKWIHEHVTK 252

Query: 863  RKNLRVADEFLGTALVDMYAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAF 1042
             + L   D F+G+ALVDMYAKCG I+ A EVF  MPKRNK SWA +I G+AVHG    A 
Sbjct: 253  SEWLEY-DVFIGSALVDMYAKCGCINMASEVFESMPKRNKHSWATMIRGFAVHGRPELAI 311

Query: 1043 QCLERMQAEDGIMPDGIAVLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVD 1222
             CLERMQ  DG+ PDG+ +L VLAAC H+GL K+GQ LL+ MESLYG+ PEHEH+SCVVD
Sbjct: 312  SCLERMQVADGLKPDGVVILAVLAACAHSGLQKEGQGLLDEMESLYGVTPEHEHFSCVVD 371

Query: 1223 LLCRAGQLGEAVELIRRMPMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXX 1402
            LLCRAG+L +A++LIRRMPMKP ASVWGALLSGCR+HNNV                    
Sbjct: 372  LLCRAGRLDDALKLIRRMPMKPRASVWGALLSGCRNHNNVNLAELAVKEILLVEDGNEAE 431

Query: 1403 XXXAYVQLSNIYLGARQYDDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSR 1582
               AYVQLSNIYL ARQ DDARRIRR IG++GL+KTPGYSA+EIDG+ +EFISGDVSH+ 
Sbjct: 432  EDSAYVQLSNIYLAARQCDDARRIRRRIGDRGLRKTPGYSAIEIDGMINEFISGDVSHTC 491

Query: 1583 LGEIHAVLYLMSLEVSID 1636
            L +IH VL L  L+  ID
Sbjct: 492  LADIHKVLDLTYLDPEID 509


>emb|CBI35029.3| unnamed protein product [Vitis vinifera]
          Length = 1596

 Score =  613 bits (1581), Expect = e-173
 Identities = 298/491 (60%), Positives = 371/491 (75%), Gaps = 1/491 (0%)
 Frame = +2

Query: 161  IKAWRWCMSLAQRCINMGQLKTIHAIFIAHGIHRNSYAVGKLISFCTLSENGDISYGSFL 340
            ++AW+ C+SLAQ C NM Q K IHA+FI +G+H N+YA+ KLISFC LS +G +SY S +
Sbjct: 1    MEAWKRCISLAQSCSNMRQFKAIHALFIVNGLHLNNYAISKLISFCALSNSGSLSYASLI 60

Query: 341  FSQLPNPNSFVYNNLIRAYSRSPQPQLAVNYFNLMLNSR-LRPDGYTFPFVFMACANGFL 517
            FSQ+ NPN F YN LIRAYSRS  PQLA++YF LML+   + PD +TFPF+  AC N   
Sbjct: 61   FSQIQNPNLFAYNTLIRAYSRSSTPQLALHYFQLMLDDENVGPDQHTFPFIISACTNSLW 120

Query: 518  WSEGRQLHTWVIKNGIFESNAHVQSALVRFYAEHKVLDDARKVFDEITNVDAIQCNILIN 697
               G+Q+H WV+KNG+  S+ HVQ+ALVRFYAE   + DARK+FDEI N+D +Q N+L+N
Sbjct: 121  MLLGKQIHNWVLKNGVASSDRHVQTALVRFYAECCAMGDARKLFDEIPNLDVVQWNVLLN 180

Query: 698  GYVKCGMALEAQDVFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLR 877
            GYV+ G+A EA + FRNMLV G+EPDEFCLTTAL  CAQLG L QGKWIHEYV  RK L 
Sbjct: 181  GYVRRGLAPEALNAFRNMLVSGVEPDEFCLTTALKGCAQLGALQQGKWIHEYVTKRKWLE 240

Query: 878  VADEFLGTALVDMYAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLER 1057
             AD F+GTALVDMYAKCG ID + EVF  M KRN FSW+A+IGG+A+HG   +A QCLER
Sbjct: 241  -ADVFIGTALVDMYAKCGCIDRSVEVFEGMTKRNVFSWSAMIGGFALHGHVRKAMQCLER 299

Query: 1058 MQAEDGIMPDGIAVLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRA 1237
            MQ EDG+ PDG+ +LGV+ AC HAGL ++GQFLLENME+ YG++P+HEHYSC+VDLLCRA
Sbjct: 300  MQVEDGLRPDGVVLLGVIMACAHAGLQEEGQFLLENMEARYGILPKHEHYSCMVDLLCRA 359

Query: 1238 GQLGEAVELIRRMPMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAY 1417
            GQL EA++LIRRMPMKP A+VWGALLSGCR+HNNV                       AY
Sbjct: 360  GQLDEALKLIRRMPMKPRAAVWGALLSGCRTHNNVDLAELAARELLMVGNGDGTEEDGAY 419

Query: 1418 VQLSNIYLGARQYDDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIH 1597
            VQLSNIYL A++ +DA RIRRMIG+K +K  PG S +E++G  ++F+SGD+SH  L +IH
Sbjct: 420  VQLSNIYLAAQKCEDACRIRRMIGDKRIKTKPGCSLIEVEGEVNQFVSGDISHPCLAQIH 479

Query: 1598 AVLYLMSLEVS 1630
             +L L+SL+ S
Sbjct: 480  EMLDLVSLQHS 490


>ref|XP_002276684.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At3g28640-like [Vitis vinifera]
          Length = 511

 Score =  613 bits (1581), Expect = e-173
 Identities = 298/491 (60%), Positives = 371/491 (75%), Gaps = 1/491 (0%)
 Frame = +2

Query: 161  IKAWRWCMSLAQRCINMGQLKTIHAIFIAHGIHRNSYAVGKLISFCTLSENGDISYGSFL 340
            ++AW+ C+SLAQ C NM Q K IHA+FI +G+H N+YA+ KLISFC LS +G +SY S +
Sbjct: 1    MEAWKRCISLAQSCSNMRQFKAIHALFIVNGLHLNNYAISKLISFCALSNSGSLSYASLI 60

Query: 341  FSQLPNPNSFVYNNLIRAYSRSPQPQLAVNYFNLMLNSR-LRPDGYTFPFVFMACANGFL 517
            FSQ+ NPN F YN LIRAYSRS  PQLA++YF LML+   + PD +TFPF+  AC N   
Sbjct: 61   FSQIQNPNLFAYNTLIRAYSRSSTPQLALHYFQLMLDDENVGPDQHTFPFIISACTNSLW 120

Query: 518  WSEGRQLHTWVIKNGIFESNAHVQSALVRFYAEHKVLDDARKVFDEITNVDAIQCNILIN 697
               G+Q+H WV+KNG+  S+ HVQ+ALVRFYAE   + DARK+FDEI N+D +Q N+L+N
Sbjct: 121  MLLGKQIHNWVLKNGVASSDRHVQTALVRFYAECCAMGDARKLFDEIPNLDVVQWNVLLN 180

Query: 698  GYVKCGMALEAQDVFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLR 877
            GYV+ G+A EA + FRNMLV G+EPDEFCLTTAL  CAQLG L QGKWIHEYV  RK L 
Sbjct: 181  GYVRRGLAPEALNAFRNMLVSGVEPDEFCLTTALKGCAQLGALQQGKWIHEYVTKRKWLE 240

Query: 878  VADEFLGTALVDMYAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLER 1057
             AD F+GTALVDMYAKCG ID + EVF  M KRN FSW+A+IGG+A+HG   +A QCLER
Sbjct: 241  -ADVFIGTALVDMYAKCGCIDRSVEVFEGMTKRNVFSWSAMIGGFALHGHVRKAMQCLER 299

Query: 1058 MQAEDGIMPDGIAVLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRA 1237
            MQ EDG+ PDG+ +LGV+ AC HAGL ++GQFLLENME+ YG++P+HEHYSC+VDLLCRA
Sbjct: 300  MQVEDGLRPDGVVLLGVIMACAHAGLQEEGQFLLENMEARYGILPKHEHYSCMVDLLCRA 359

Query: 1238 GQLGEAVELIRRMPMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAY 1417
            GQL EA++LIRRMPMKP A+VWGALLSGCR+HNNV                       AY
Sbjct: 360  GQLDEALKLIRRMPMKPRAAVWGALLSGCRTHNNVDLAELAARELLMVGNGDGTEEDGAY 419

Query: 1418 VQLSNIYLGARQYDDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIH 1597
            VQLSNIYL A++ +DA RIRRMIG+K +K  PG S +E++G  ++F+SGD+SH  L +IH
Sbjct: 420  VQLSNIYLAAQKCEDACRIRRMIGDKRIKTKPGCSLIEVEGEVNQFVSGDISHPCLAQIH 479

Query: 1598 AVLYLMSLEVS 1630
             +L L+SL+ S
Sbjct: 480  EMLDLVSLQHS 490


>ref|XP_004233685.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At3g28640-like [Solanum lycopersicum]
          Length = 487

 Score =  608 bits (1569), Expect = e-171
 Identities = 297/485 (61%), Positives = 370/485 (76%), Gaps = 2/485 (0%)
 Frame = +2

Query: 188  LAQRCINMGQLKTIHAIFIAHGIHRNSYAVGKLISFCTLSENGDISYGSFLFSQLPNPNS 367
            +A++C NM QLK IHAI+I  G+ RN+YAV KL+ FC LS +GD+SY S +F+Q+  PN+
Sbjct: 1    MAEKCNNMRQLKAIHAIYITLGLQRNTYAVSKLLDFCALSNSGDLSYASRIFAQVQTPNA 60

Query: 368  FVYNNLIRAYSRSPQPQLAVNYFNLML--NSRLRPDGYTFPFVFMACANGFLWSEGRQLH 541
            F+YN LIRAYS SPQPQ+++NYFNLM+  ++   PD +TFPF+ +ACANG L  EG+Q+H
Sbjct: 61   FLYNALIRAYSSSPQPQVSLNYFNLMVQTSNAAAPDSFTFPFLLIACANGPLEVEGKQIH 120

Query: 542  TWVIKNGIFESNAHVQSALVRFYAEHKVLDDARKVFDEITNVDAIQCNILINGYVKCGMA 721
            +W+IKN    SNAHVQ+AL+RFY   K LDDARKVFDEIT++D IQCN+L++G+++ G+A
Sbjct: 121  SWIIKNSFSASNAHVQTALIRFYTNCKALDDARKVFDEITDIDVIQCNVLMSGHLQSGLA 180

Query: 722  LEAQDVFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGT 901
             EA  +F++ML RG+ PDE+C+TTAL ACAQLG L+QGKWIHE+V   + L   D F+G+
Sbjct: 181  KEALSIFQDMLGRGVGPDEYCVTTALGACAQLGALEQGKWIHEHVTKSEWLEY-DVFIGS 239

Query: 902  ALVDMYAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIM 1081
            ALVDMYAKCG I+ A EVF  MP RNK SWA +I G+AVHG    A  CLERMQ  DG+ 
Sbjct: 240  ALVDMYAKCGSINLASEVFESMPTRNKHSWATMIRGFAVHGRPELALSCLERMQVADGLK 299

Query: 1082 PDGIAVLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVE 1261
            PDG+ +L VLAAC H+GL K+GQ LL+ MESLYG+ PEHEH+SCVVDLLCRAG+L +A++
Sbjct: 300  PDGVVILAVLAACAHSGLQKEGQGLLDEMESLYGVTPEHEHFSCVVDLLCRAGRLDDALK 359

Query: 1262 LIRRMPMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAYVQLSNIYL 1441
            LIRRMPMKP ASVWGALLSGCR+HNNV                       AYVQLSNIYL
Sbjct: 360  LIRRMPMKPRASVWGALLSGCRNHNNVNLAELAVKEILLVEDGNEAEEDSAYVQLSNIYL 419

Query: 1442 GARQYDDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIHAVLYLMSL 1621
             ARQ DDARRIRR IG++GL+KTPGYSA+EIDG+ +EFISGDVSH  L +IH VL L  L
Sbjct: 420  AARQCDDARRIRRRIGDRGLRKTPGYSAIEIDGMVNEFISGDVSHICLADIHKVLDLTYL 479

Query: 1622 EVSID 1636
            +   D
Sbjct: 480  DPHFD 484


>gb|EMJ17608.1| hypothetical protein PRUPE_ppa022709mg, partial [Prunus persica]
          Length = 541

 Score =  600 bits (1548), Expect = e-169
 Identities = 299/479 (62%), Positives = 360/479 (75%), Gaps = 1/479 (0%)
 Frame = +2

Query: 182  MSLAQRCINMGQLKTIHAIFIAHGIHRNSYAVGKLISFCTLSENGDISYGSFLFSQLPNP 361
            MSLAQ C NM +LK  HAIFI +G+H N+YA+ KLI+FC LS +GD+SY S LF+Q+  P
Sbjct: 1    MSLAQGCSNMRKLKATHAIFITNGLHLNNYAISKLIAFCALSNSGDLSYASLLFNQIQTP 60

Query: 362  NSFVYNNLIRAYSRSPQPQLAVNYFNLMLN-SRLRPDGYTFPFVFMACANGFLWSEGRQL 538
            NS++YN LIRAYSRS QP LAV+YF LML  S L PD YTF FV +ACAN      GRQ+
Sbjct: 61   NSYLYNTLIRAYSRSSQPHLAVHYFLLMLKQSSLGPDNYTFNFVILACANCSWLVSGRQI 120

Query: 539  HTWVIKNGIFESNAHVQSALVRFYAEHKVLDDARKVFDEITNVDAIQCNILINGYVKCGM 718
            H WV+KNG+F  +AHVQ+ALVR YAE KVLDD++KVFDEI   D IQ N+L+NGYV+CG+
Sbjct: 121  HNWVVKNGLFLVDAHVQTALVRLYAECKVLDDSKKVFDEIPERDVIQWNVLMNGYVRCGL 180

Query: 719  ALEAQDVFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLG 898
            A EA  VFR+MLV G EPD FC+ T L ACA LG L QGKWI EYVK R  L+ +D F+G
Sbjct: 181  ASEALKVFRDMLVTGFEPDNFCVATGLAACAHLGALRQGKWIDEYVKKRTGLK-SDVFIG 239

Query: 899  TALVDMYAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGI 1078
            TALVDMYAKCG ID A E F  MPKRN  SWAA+IGG+A HGCA  A   LERMQ +DG+
Sbjct: 240  TALVDMYAKCGCIDLAVEAFEGMPKRNVVSWAAMIGGFAAHGCATNAIHSLERMQVDDGL 299

Query: 1079 MPDGIAVLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAV 1258
             PDG+ +L VL AC HAGL+++G+ LL+NM++ YG+VP+HEHYSCV+DLLC+AG+L EA+
Sbjct: 300  RPDGVVLLVVLMACTHAGLLEKGKLLLDNMKTQYGIVPKHEHYSCVIDLLCKAGRLNEAL 359

Query: 1259 ELIRRMPMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAYVQLSNIY 1438
            +LIR+MPMKPLASVWGALLSGCR HNNV                       AYVQLSNIY
Sbjct: 360  KLIRKMPMKPLASVWGALLSGCRIHNNVDLAELAVKELLQLENDVRGEEVGAYVQLSNIY 419

Query: 1439 LGARQYDDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIHAVLYLM 1615
            LGAR+ +DA RIR+MIGE G+KKTPG S +E+DG  +EF+SGDVSHS    I A+L L+
Sbjct: 420  LGARRGEDAIRIRKMIGESGIKKTPGCSMIEVDGKVNEFVSGDVSHSHQAWICAMLDLI 478


>ref|XP_004304947.1| PREDICTED: pentatricopeptide repeat-containing protein At3g28660-like
            [Fragaria vesca subsp. vesca]
          Length = 501

 Score =  569 bits (1467), Expect = e-159
 Identities = 280/489 (57%), Positives = 352/489 (71%), Gaps = 1/489 (0%)
 Frame = +2

Query: 161  IKAWRWCMSLAQRCINMGQLKTIHAIFIAHGIHRNSYAVGKLISFCTLSENGDISYGSFL 340
            I+AW+ CMSLAQ C  M  LK  HA+FI HG+H N++AV KL++FC LS++G + Y S +
Sbjct: 9    IQAWKRCMSLAQCCTTMRSLKPTHAVFITHGLHLNNFAVSKLLAFCALSDSGSLRYASLI 68

Query: 341  FSQLPNPNSFVYNNLIRAYSRSPQPQLAVNYFNLMLNS-RLRPDGYTFPFVFMACANGFL 517
            F Q+P PN+++YN LIRA+S S  P LA+ YF LM     L PD +TF F  + C N   
Sbjct: 69   FHQVPAPNAYMYNTLIRAHSASSDPHLAMYYFQLMSKQIDLEPDNFTFHFAILGCVNCGW 128

Query: 518  WSEGRQLHTWVIKNGIFESNAHVQSALVRFYAEHKVLDDARKVFDEITNVDAIQCNILIN 697
               GRQ+H  V+KNG+  ++AHVQ+A+VR Y E  VL DA KVFDEI   D +Q N+++N
Sbjct: 129  IGPGRQMHCLVVKNGLVAADAHVQTAVVRLYVECGVLGDAHKVFDEIPERDMVQWNVIMN 188

Query: 698  GYVKCGMALEAQDVFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLR 877
            GYVK G+A EA  VF++MLVRG EPD FC+ T L ACA LG L QGKWIHEYV+ R+ L 
Sbjct: 189  GYVKRGLASEALRVFQDMLVRGFEPDGFCVATGLAACAHLGALWQGKWIHEYVRKREGLN 248

Query: 878  VADEFLGTALVDMYAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLER 1057
             +D F+GTALVDMYAKCG ID A E F  M KRN  SW+A+IG Y VHG A +A  CLER
Sbjct: 249  -SDVFIGTALVDMYAKCGCIDLAVEAFEGMGKRNVVSWSAMIGAYGVHGYATEAISCLER 307

Query: 1058 MQAEDGIMPDGIAVLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRA 1237
            MQ +DG+ PDG+ +LGVL AC H GL+++G+ LL+NM++ YG+VP+HEHYSCV+DLLC+A
Sbjct: 308  MQVDDGVKPDGVVLLGVLTACNHGGLLEKGKALLDNMKAKYGIVPKHEHYSCVIDLLCKA 367

Query: 1238 GQLGEAVELIRRMPMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAY 1417
            G+L +A ELIRRMPMKPLASVWGALLSGCR HNNV                       AY
Sbjct: 368  GRLSDAFELIRRMPMKPLASVWGALLSGCRIHNNVDLAEIAVEQLLQVANDDRGEEVGAY 427

Query: 1418 VQLSNIYLGARQYDDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIH 1597
            VQLSNIYLGA++ +DA RIR+MIGEKG+KKTPG S +E+DG  +EF+SGDVSHS   +I 
Sbjct: 428  VQLSNIYLGAQRSEDALRIRKMIGEKGIKKTPGCSMLEVDGKVNEFVSGDVSHSHCVQIC 487

Query: 1598 AVLYLMSLE 1624
             +L L+S +
Sbjct: 488  TMLDLISAD 496


>ref|NP_189507.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75273574|sp|Q9LJI9.1|PP260_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At3g28660 gi|9294280|dbj|BAB02182.1| unnamed protein
            product [Arabidopsis thaliana] gi|20259531|gb|AAM13885.1|
            unknown protein [Arabidopsis thaliana]
            gi|24030460|gb|AAN41382.1| unknown protein [Arabidopsis
            thaliana] gi|332643950|gb|AEE77471.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 504

 Score =  526 bits (1354), Expect = e-146
 Identities = 251/494 (50%), Positives = 353/494 (71%), Gaps = 5/494 (1%)
 Frame = +2

Query: 164  KAWRWCMSLAQRCINMGQLKTIHAIFIAHGIHRNSYAVGKLIS-FCTLSE-NGDISYGSF 337
            ++W+  +  +QRC  + Q+K+ H++FI HG+HRN+YA+ KL++ F  L   N    Y S 
Sbjct: 9    QSWKSLILASQRCNTVKQIKSTHSLFIIHGLHRNTYAISKLLTAFLHLPNLNKHFHYASS 68

Query: 338  LFSQLPNPNSFVYNNLIRAYSRSPQPQLAVNYFNLMLNSR---LRPDGYTFPFVFMACAN 508
            +F  +  PNSFVY+ +IR  SRS QP L + YF LM+      + P   TF F+ +AC  
Sbjct: 69   IFDSIEIPNSFVYDTMIRICSRSSQPHLGLRYFLLMVKEEEEDITPSYLTFHFLIVACLK 128

Query: 509  GFLWSEGRQLHTWVIKNGIFESNAHVQSALVRFYAEHKVLDDARKVFDEITNVDAIQCNI 688
               +S G+Q+H WV+KNG+F S+ HVQ+ ++R Y E K+L DARKVFDEI   D ++ ++
Sbjct: 129  ACFFSVGKQIHCWVVKNGVFLSDGHVQTGVLRIYVEDKLLFDARKVFDEIPQPDVVKWDV 188

Query: 689  LINGYVKCGMALEAQDVFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRK 868
            L+NGYV+CG+  E  +VF+ MLVRGIEPDEF +TTALTACAQ+G L QGKWIHE+VK ++
Sbjct: 189  LMNGYVRCGLGSEGLEVFKEMLVRGIEPDEFSVTTALTACAQVGALAQGKWIHEFVKKKR 248

Query: 869  NLRVADEFLGTALVDMYAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQC 1048
             +  +D F+GTALVDMYAKCG I+TA EVF ++ +RN FSWAA+IGGYA +G A +A  C
Sbjct: 249  WIE-SDVFVGTALVDMYAKCGCIETAVEVFEKLTRRNVFSWAALIGGYAAYGYAKKATTC 307

Query: 1049 LERMQAEDGIMPDGIAVLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLL 1228
            L+R++ EDGI PD + +LGVLAAC H G +++G+ +LENME+ YG+ P+HEHYSC+VDL+
Sbjct: 308  LDRIEREDGIKPDSVVLLGVLAACAHGGFLEEGRTMLENMEARYGITPKHEHYSCIVDLM 367

Query: 1229 CRAGQLGEAVELIRRMPMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXX 1408
            CRAG+L +A++LI +MPMKPLASVWGALL+GCR+H NV                      
Sbjct: 368  CRAGRLDDALDLIEKMPMKPLASVWGALLNGCRTHKNVELGELAVQNLLDLEKGNVEEEE 427

Query: 1409 XAYVQLSNIYLGARQYDDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLG 1588
             A VQLSNIY   ++  +A ++R MI ++G++KTPG+S +E+DG+  +F+SGDVSH  L 
Sbjct: 428  AALVQLSNIYFSVQRNPEAFKVRGMIEQRGIRKTPGWSLLEVDGIVTKFVSGDVSHPNLL 487

Query: 1589 EIHAVLYLMSLEVS 1630
            +IH +++L+S++ S
Sbjct: 488  QIHTLIHLLSVDAS 501


>ref|NP_189505.2| putative pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana] gi|75273576|sp|Q9LJJ1.1|PP259_ARATH RecName:
            Full=Putative pentatricopeptide repeat-containing protein
            At3g28640 gi|9294278|dbj|BAB02180.1| unnamed protein
            product [Arabidopsis thaliana]
            gi|332643948|gb|AEE77469.1| putative pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 504

 Score =  521 bits (1343), Expect = e-145
 Identities = 251/492 (51%), Positives = 350/492 (71%), Gaps = 5/492 (1%)
 Frame = +2

Query: 164  KAWRWCMSLAQRCINMGQLKTIHAIFIAHGIHRNSYAVGKLIS-FCTLSE-NGDISYGSF 337
            ++W+  +  +QRC  + Q+K+ H++FI HG+HRN+YA+ KL++ F  L   N    Y S 
Sbjct: 9    QSWKSLILASQRCNTVKQIKSTHSLFIIHGLHRNTYAISKLLTAFLHLPNLNKHFHYASS 68

Query: 338  LFSQLPNPNSFVYNNLIRAYSRSPQPQLAVNYFNLMLNSR---LRPDGYTFPFVFMACAN 508
            +F  +  PNSFVY+ +IR  SRS QP L + YF LM+      + P   TF F+ +AC  
Sbjct: 69   IFDSIEIPNSFVYDTMIRICSRSSQPHLGLRYFLLMVKEEEEDIAPSYLTFHFLIVACLK 128

Query: 509  GFLWSEGRQLHTWVIKNGIFESNAHVQSALVRFYAEHKVLDDARKVFDEITNVDAIQCNI 688
               +S G+Q+H WV+KNG+F S++HVQ+ ++R Y E K+L DARKVFDEI   D ++ ++
Sbjct: 129  ACFFSVGKQIHCWVVKNGVFLSDSHVQTGVLRIYVEDKLLLDARKVFDEIPQPDVVKWDV 188

Query: 689  LINGYVKCGMALEAQDVFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRK 868
            L+NGYV+CG+  E  +VFR MLV+G+EPDEF +TTALTACAQ+G L QGKWIHE+VK +K
Sbjct: 189  LMNGYVRCGLGSEGLEVFREMLVKGLEPDEFSVTTALTACAQVGALAQGKWIHEFVK-KK 247

Query: 869  NLRVADEFLGTALVDMYAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQC 1048
            +   +D F+GTALVDMYAKCG I+TA EVF ++ +RN FSWAA+IGGYA +G A +A  C
Sbjct: 248  SWIESDVFVGTALVDMYAKCGCIETAVEVFKKLTRRNVFSWAALIGGYAAYGYAKKAMTC 307

Query: 1049 LERMQAEDGIMPDGIAVLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLL 1228
            LER++ EDGI PD + +LGVLAAC H G +++G+ +LENME+ Y + P+HEHYSC+VDL+
Sbjct: 308  LERLEREDGIKPDSVVLLGVLAACAHGGFLEEGRSMLENMEARYEITPKHEHYSCIVDLM 367

Query: 1229 CRAGQLGEAVELIRRMPMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXX 1408
            CRAG+L +A+ LI +MPMKPLASVWGALL+GCR+H NV                      
Sbjct: 368  CRAGRLDDALNLIEKMPMKPLASVWGALLNGCRTHKNVELGELAVKNLLDLEKGNVEEEE 427

Query: 1409 XAYVQLSNIYLGARQYDDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLG 1588
             A VQLSNIY   ++  +A ++R MI ++G++KTPG+S +E+DG   +F+SGDVSH  L 
Sbjct: 428  AALVQLSNIYFSVQRNPEASKVRGMIEQRGVRKTPGWSVLEVDGNVTKFVSGDVSHPNLL 487

Query: 1589 EIHAVLYLMSLE 1624
            +IH V++L+S++
Sbjct: 488  QIHTVIHLLSVD 499


>ref|XP_006290938.1| hypothetical protein CARUB_v10017051mg [Capsella rubella]
            gi|482559645|gb|EOA23836.1| hypothetical protein
            CARUB_v10017051mg [Capsella rubella]
          Length = 507

 Score =  521 bits (1343), Expect = e-145
 Identities = 254/495 (51%), Positives = 348/495 (70%), Gaps = 8/495 (1%)
 Frame = +2

Query: 164  KAWRWCMSLAQRCINMGQLKTIHAIFIAHGIHRNSYAVGKLIS-FCTLSE-NGDISYGSF 337
            ++W+  +  +QRC  + Q+K+ HA+FI HG+HRN+YA+ KL++ F  L   N    Y S 
Sbjct: 9    QSWKTLILASQRCNTVKQIKSTHALFIIHGLHRNTYAISKLLTAFLHLPNLNKHFHYAST 68

Query: 338  LFSQLPNPNSFVYNNLIRAYSRSPQPQLAVNYFNLMLNSR------LRPDGYTFPFVFMA 499
            +F  +   N+FVY+ +IR  SRS  PQL + YF LM++        + P   TF F+ +A
Sbjct: 69   IFDSIEIRNTFVYDTMIRICSRSSLPQLGLRYFRLMVSEDEKEEEDIAPSYLTFHFLIVA 128

Query: 500  CANGFLWSEGRQLHTWVIKNGIFESNAHVQSALVRFYAEHKVLDDARKVFDEITNVDAIQ 679
            C    L+S G+Q+H WV+KNG+F S+ HVQ+ ++R Y E +VL DARKVFDEI   D ++
Sbjct: 129  CLKACLFSVGKQIHCWVVKNGVFLSDGHVQTGVLRIYVEDRVLVDARKVFDEIPQPDVVK 188

Query: 680  CNILINGYVKCGMALEAQDVFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVK 859
             ++L+NGYV+CG+  E  +VFR MLVRGIEPDEF +TTALTACAQ+G L QGKWIHE+VK
Sbjct: 189  WDVLMNGYVRCGLGSEGLEVFREMLVRGIEPDEFSVTTALTACAQVGALAQGKWIHEFVK 248

Query: 860  NRKNLRVADEFLGTALVDMYAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQA 1039
             +K ++ +D F+GTALVDMYAKCG I+TA EVF ++ +RN FSWAA+IGGYA +G A +A
Sbjct: 249  KKKWVK-SDVFVGTALVDMYAKCGCIETAVEVFEKLTRRNVFSWAALIGGYAAYGYAREA 307

Query: 1040 FQCLERMQAEDGIMPDGIAVLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVV 1219
              CL+RM+ ED I PD + +LGVLAAC H G +++G+ +L+NMES YG+ P+HEHYSC+V
Sbjct: 308  IMCLDRMEREDAIKPDSVVLLGVLAACAHGGFLQEGRSMLDNMESRYGITPKHEHYSCIV 367

Query: 1220 DLLCRAGQLGEAVELIRRMPMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXX 1399
            DL+CRAG+L  A++LI +MPMKPLASVWGALL+GCR+H NV                   
Sbjct: 368  DLMCRAGRLDGALDLIEKMPMKPLASVWGALLNGCRTHKNVELGELAVKNLLELEKGNVD 427

Query: 1400 XXXXAYVQLSNIYLGARQYDDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHS 1579
                A VQLSNIY   ++  +A +IR MI +KG+KK PG S +E+DG    F+SGD+SH 
Sbjct: 428  EEEAALVQLSNIYFSVQRNPEASKIRGMIDQKGIKKAPGCSVLEVDGDVTRFVSGDLSHP 487

Query: 1580 RLGEIHAVLYLMSLE 1624
             L +IH V++L+S++
Sbjct: 488  NLLQIHTVIHLLSVD 502


>gb|EOY03349.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative
            [Theobroma cacao]
          Length = 499

 Score =  516 bits (1328), Expect = e-143
 Identities = 252/485 (51%), Positives = 331/485 (68%), Gaps = 1/485 (0%)
 Frame = +2

Query: 164  KAWRWCMSLAQRCINMGQLKTIHAIFIAHGIHRNSYAVGKLISFCTLSENGDISYGSFLF 343
            + W  C++L QRC    Q++ IHA+ I  G+HRN   + KLISF + S   ++ Y S LF
Sbjct: 10   QCWTRCLTLLQRCTKASQIEPIHALLITQGLHRNPCIISKLISFLS-SPPTNLHYSSLLF 68

Query: 344  SQLPNPNSFVYNNLIRAYSRSPQPQLAVNYFNLMLNSR-LRPDGYTFPFVFMACANGFLW 520
            +QL     F+YN LI+A+S SP PQ + +YFN +L    +RP+  T  F+ ++CA     
Sbjct: 69   NQLHKSTLFIYNTLIKAHSNSPHPQTSFHYFNHLLEEETIRPNCQTLNFILVSCAKTCSL 128

Query: 521  SEGRQLHTWVIKNGIFESNAHVQSALVRFYAEHKVLDDARKVFDEITNVDAIQCNILING 700
              G+Q+  WV KNG+F S+++VQ+ ++R Y E ++  DARKVFDEI  VD ++ N+L++G
Sbjct: 129  RSGKQIQNWVFKNGMFSSDSYVQTGVIRLYVEARLWVDARKVFDEIAYVDVVKWNVLMSG 188

Query: 701  YVKCGMALEAQDVFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRV 880
              +C +  +A  VF+ +LV GI+PDEFCLTTALTACAQ G L +GKWIHEY++ R+    
Sbjct: 189  LARCRLGTQALSVFKELLVFGIQPDEFCLTTALTACAQNGSLREGKWIHEYLRKREKCLE 248

Query: 881  ADEFLGTALVDMYAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERM 1060
             D F+GTALVDMYAKCG +D A EVF  M KRN +SWAA+IGG+AVHG A +A  C ERM
Sbjct: 249  LDVFIGTALVDMYAKCGCLDLAVEVFEGMSKRNVYSWAAMIGGFAVHGHARKAIHCFERM 308

Query: 1061 QAEDGIMPDGIAVLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAG 1240
            Q  DGI PDG+ +LGVL AC HAGL ++G FLL NME  Y +VP+HEHYSCVVDLLCR G
Sbjct: 309  Q-NDGIRPDGVVLLGVLTACTHAGLAEEGLFLLNNMEGQYRIVPKHEHYSCVVDLLCRTG 367

Query: 1241 QLGEAVELIRRMPMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAYV 1420
            +  EA++LIRRMPM+PLASVWGALL+ CR +NNV                       A V
Sbjct: 368  KFDEALKLIRRMPMRPLASVWGALLNSCRIYNNVQLAELAVKELLELEDCDGDEEDAALV 427

Query: 1421 QLSNIYLGARQYDDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIHA 1600
            QLSNIY  A++ +D  RIRRMIG++GLKK PG S +E+DG   EF+SGD+SH    +IH 
Sbjct: 428  QLSNIYFSAQKSEDGHRIRRMIGDRGLKKAPGCSMIEVDGRMTEFVSGDISHPLHSQIHT 487

Query: 1601 VLYLM 1615
            +L L+
Sbjct: 488  ILRLL 492


>ref|XP_006395353.1| hypothetical protein EUTSA_v10005682mg [Eutrema salsugineum]
            gi|557091992|gb|ESQ32639.1| hypothetical protein
            EUTSA_v10005682mg [Eutrema salsugineum]
          Length = 505

 Score =  513 bits (1321), Expect = e-142
 Identities = 254/493 (51%), Positives = 342/493 (69%), Gaps = 6/493 (1%)
 Frame = +2

Query: 164  KAWRWCMSLAQRCINMGQLKTIHAIFIAHGIHRNSYAVGKLISFCTLSENGD--ISYGSF 337
            ++WR  +  +QRC  + Q+K+ HA+FI HGIHRN+YA+ KL++      N D    Y S 
Sbjct: 9    QSWRSLILASQRCTTLRQIKSTHALFIIHGIHRNTYAISKLLTAFLPLPNLDKHFHYASI 68

Query: 338  LFSQLPNPNSFVYNNLIRAYSRSPQPQLAVNYFNLMLNSR----LRPDGYTFPFVFMACA 505
            +F  +   NSFVY+ +IR  SRS +P L V YF LML       + P   TF F+ +A  
Sbjct: 69   IFDSIELRNSFVYDTMIRICSRSSRPHLGVRYFRLMLTEDDEEDIAPSYLTFHFLLVAFL 128

Query: 506  NGFLWSEGRQLHTWVIKNGIFESNAHVQSALVRFYAEHKVLDDARKVFDEITNVDAIQCN 685
            N  L+S G+Q+H WVIKNG+  S+ HVQ+ ++R Y E KVL DARKVFDEI + D ++ +
Sbjct: 129  NASLFSVGKQIHCWVIKNGVLSSDGHVQTGIIRLYIEGKVLPDARKVFDEIPHPDVVKWD 188

Query: 686  ILINGYVKCGMALEAQDVFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNR 865
            +L+NGYV+CG+  E   VFR ML RG EPD+F +TTALTACAQ+G L QGK IH+ +K +
Sbjct: 189  VLMNGYVRCGLGSEGLHVFREMLARGTEPDKFSVTTALTACAQVGALAQGKLIHKLLKKK 248

Query: 866  KNLRVADEFLGTALVDMYAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQ 1045
            K L  +D ++GTALVDMYAKCG I+TA EVF  + +RN FSWA +IGGYA +G A +A  
Sbjct: 249  KLLE-SDIYVGTALVDMYAKCGCIETALEVFENLSRRNVFSWAVLIGGYAAYGYAKKAIM 307

Query: 1046 CLERMQAEDGIMPDGIAVLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDL 1225
            CL++M+ EDGI PD + +L VLAAC H G +++G+ LL+NME+ YG+ P+HEHYSC+VDL
Sbjct: 308  CLDQMEREDGIKPDSVVLLTVLAACAHGGFLQEGRALLDNMEARYGITPKHEHYSCIVDL 367

Query: 1226 LCRAGQLGEAVELIRRMPMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXX 1405
            +CRAG+L +AV+LI  MPMKPLASVWGALL+GCR+H NV                     
Sbjct: 368  ICRAGRLDDAVDLIEGMPMKPLASVWGALLNGCRTHKNVELGELAVKNLLDLEKGNADEE 427

Query: 1406 XXAYVQLSNIYLGARQYDDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRL 1585
              A VQLSNIYL A++  +A  IRRMIG+KG++K PG S +E+DG   +F+SGDVSH  L
Sbjct: 428  EAALVQLSNIYLIAQRNTEASNIRRMIGQKGIRKAPGCSVLEVDGNVTKFVSGDVSHQNL 487

Query: 1586 GEIHAVLYLMSLE 1624
             +IH +++L+S++
Sbjct: 488  LQIHTMIHLLSVD 500


>gb|EXB36666.1| hypothetical protein L484_002079 [Morus notabilis]
          Length = 487

 Score =  509 bits (1311), Expect = e-141
 Identities = 266/489 (54%), Positives = 331/489 (67%), Gaps = 2/489 (0%)
 Frame = +2

Query: 158  SIKAWRW--CMSLAQRCINMGQLKTIHAIFIAHGIHRNSYAVGKLISFCTLSENGDISYG 331
            SI++W W   +SLAQRC NM QLK IHA+FI  G+H N+YA+ KLI+FC LS++GD+ + 
Sbjct: 7    SIQSWNWKRFISLAQRCANMRQLKPIHALFITTGLHLNNYAISKLIAFCALSDSGDLRHA 66

Query: 332  SFLFSQLPNPNSFVYNNLIRAYSRSPQPQLAVNYFNLMLNSRLRPDGYTFPFVFMACANG 511
            S +F+Q+  PNSF+YN LIRAYSRS QP LA+ YF   +  ++  D  TF FV +AC NG
Sbjct: 67   SLMFNQIQTPNSFIYNTLIRAYSRSSQPHLALRYFQPTVKDKVA-DNLTFSFVLLACVNG 125

Query: 512  FLWSEGRQLHTWVIKNGIFESNAHVQSALVRFYAEHKVLDDARKVFDEITNVDAIQCNIL 691
             L  EG Q+H +    G  ES        VR +                   D  Q N L
Sbjct: 126  GLVLEGTQVHCY---GGCKES--------VRGHR------------------DLFQWNAL 156

Query: 692  INGYVKCGMALEAQDVFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKN 871
            ++GY++C +A EA  VFR+ML  G+E DE C  TALTACAQ G L  GKWIHEY++ R+ 
Sbjct: 157  MDGYIRCSLASEALGVFRDMLKFGVELDECCAVTALTACAQSGALWWGKWIHEYIEKREG 216

Query: 872  LRVADEFLGTALVDMYAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCL 1051
               +D F+GTALVDMY KCG +D A EVF +MP RN FSWAAIIGG+AVHG   +A +CL
Sbjct: 217  FE-SDVFVGTALVDMYTKCGCLDMAVEVFEKMPTRNAFSWAAIIGGFAVHGQVMEAIRCL 275

Query: 1052 ERMQAEDGIMPDGIAVLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLC 1231
            ERMQA+DG+ PDG+ +LGVL AC HAGL K+GQ LL NMES YG++P+HEHYSCVVDLLC
Sbjct: 276  ERMQADDGLKPDGVVLLGVLTACTHAGLQKEGQLLLHNMESQYGILPKHEHYSCVVDLLC 335

Query: 1232 RAGQLGEAVELIRRMPMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXX 1411
            RAG+L EA +LIRRMPMKPLASVWGALLSGCR  N V                       
Sbjct: 336  RAGKLREAYQLIRRMPMKPLASVWGALLSGCRIRNYVDLAELAVKELVLLENEDKRGQDG 395

Query: 1412 AYVQLSNIYLGARQYDDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGE 1591
             YVQLSNIYL A++ +DA  IR+MIG+KG++KTPG S VE+DG  +EF+SGD+ HS   +
Sbjct: 396  VYVQLSNIYLAAQRTEDAVLIRKMIGDKGIRKTPGCSTVEVDGRVNEFVSGDIVHSCQAK 455

Query: 1592 IHAVLYLMS 1618
            I  +LYL+S
Sbjct: 456  ICVMLYLLS 464


>ref|XP_002877120.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297322958|gb|EFH53379.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 399

 Score =  447 bits (1150), Expect = e-123
 Identities = 209/389 (53%), Positives = 286/389 (73%)
 Frame = +2

Query: 458  LRPDGYTFPFVFMACANGFLWSEGRQLHTWVIKNGIFESNAHVQSALVRFYAEHKVLDDA 637
            + P   TF F+ +AC    L+S G+Q+H WV+KNG+F S+ HVQ+ ++R Y E KVL DA
Sbjct: 7    IAPSYLTFYFLIVACFKACLFSVGKQIHCWVVKNGVFLSDGHVQTGILRIYVEDKVLLDA 66

Query: 638  RKVFDEITNVDAIQCNILINGYVKCGMALEAQDVFRNMLVRGIEPDEFCLTTALTACAQL 817
             KVFDEI   D ++ ++L+NGYV+CG+  E  +VFR MLVRG+EPDEF +TTALTACAQ+
Sbjct: 67   HKVFDEIPKPDVVKWDVLMNGYVRCGLGSEGLEVFREMLVRGVEPDEFSVTTALTACAQV 126

Query: 818  GDLDQGKWIHEYVKNRKNLRVADEFLGTALVDMYAKCGYIDTAEEVFSRMPKRNKFSWAA 997
            G L QGKWIHE+VK ++ +  +D F+GTALVDMYAKCG I+ A EVF ++ +RN FSWAA
Sbjct: 127  GALAQGKWIHEFVKKKRWIE-SDVFVGTALVDMYAKCGCIEMAVEVFEKLSRRNVFSWAA 185

Query: 998  IIGGYAVHGCANQAFQCLERMQAEDGIMPDGIAVLGVLAACKHAGLVKQGQFLLENMESL 1177
            +IGGYA +G A +A  CL+RM+ EDGI PD + +LGVLAAC H G +++G+ +L NME+ 
Sbjct: 186  LIGGYAAYGYAKKAMTCLDRMEREDGIKPDSVVLLGVLAACAHGGFLQEGRAMLGNMEAR 245

Query: 1178 YGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRMPMKPLASVWGALLSGCRSHNNVIFXXX 1357
            YG+ P+HEHYSC+VDL+CRAG+L +A++LI +MPMKPLASVWGALL+GCR+H NV     
Sbjct: 246  YGITPKHEHYSCIVDLMCRAGRLDDALDLIEKMPMKPLASVWGALLNGCRTHKNVELGEL 305

Query: 1358 XXXXXXXXXXXXXXXXXXAYVQLSNIYLGARQYDDARRIRRMIGEKGLKKTPGYSAVEID 1537
                              A VQLSNIY   ++  +A ++R MI ++G++KTPG+S +E+D
Sbjct: 306  AVKNLLDLEKGNAEEEEAALVQLSNIYFSVQRNPEASKVRGMIEQRGIRKTPGWSVLEVD 365

Query: 1538 GLFHEFISGDVSHSRLGEIHAVLYLMSLE 1624
            G   +F+SGDVSH  L +IH V++L+S++
Sbjct: 366  GNVTKFVSGDVSHPNLLQIHTVIHLLSVD 394



 Score = 91.3 bits (225), Expect = 1e-15
 Identities = 63/230 (27%), Positives = 109/230 (47%), Gaps = 2/230 (0%)
 Frame = +2

Query: 338  LFSQLPNPNSFVYNNLIRAYSRSPQPQLAVNYFNLMLNSRLRPDGYTFPFVFMACANGFL 517
            +F ++P P+   ++ L+  Y R       +  F  ML   + PD ++      ACA    
Sbjct: 69   VFDEIPKPDVVKWDVLMNGYVRCGLGSEGLEVFREMLVRGVEPDEFSVTTALTACAQVGA 128

Query: 518  WSEGRQLHTWVIKNGIFESNAHVQSALVRFYAEHKVLDDARKVFDEITNVDAIQCNILIN 697
             ++G+ +H +V K    ES+  V +ALV  YA+   ++ A +VF++++  +      LI 
Sbjct: 129  LAQGKWIHEFVKKKRWIESDVFVGTALVDMYAKCGCIEMAVEVFEKLSRRNVFSWAALIG 188

Query: 698  GYVKCGMALEAQDVFRNM-LVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNL 874
            GY   G A +A      M    GI+PD   L   L ACA  G L +G+ +   ++ R  +
Sbjct: 189  GYAAYGYAKKAMTCLDRMEREDGIKPDSVVLLGVLAACAHGGFLQEGRAMLGNMEARYGI 248

Query: 875  RVADEFLGTALVDMYAKCGYIDTAEEVFSRMPKRNKFS-WAAIIGGYAVH 1021
                E   + +VD+  + G +D A ++  +MP +   S W A++ G   H
Sbjct: 249  TPKHEHY-SCIVDLMCRAGRLDDALDLIEKMPMKPLASVWGALLNGCRTH 297


>ref|XP_002517451.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223543462|gb|EEF44993.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 428

 Score =  410 bits (1053), Expect = e-111
 Identities = 198/347 (57%), Positives = 256/347 (73%)
 Frame = +2

Query: 575  NAHVQSALVRFYAEHKVLDDARKVFDEITNVDAIQCNILINGYVKCGMALEAQDVFRNML 754
            + H+Q+A+VR YA+ K++ DA K+FDEI   D IQ N+L+NGY++  +  EA  VFR M 
Sbjct: 63   DGHIQTAVVRLYAKCKIMSDAHKMFDEIHRPDVIQWNVLMNGYIESNLESEALRVFRFMF 122

Query: 755  VRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDMYAKCGY 934
            V+G+EPDEFC+TTAL ACA+ G L QGKWIHEYVK  K     D F+GTALVDMYAKCG+
Sbjct: 123  VKGVEPDEFCVTTALAACAKSGALWQGKWIHEYVK--KTTLGFDVFIGTALVDMYAKCGW 180

Query: 935  IDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIAVLGVLA 1114
            I+ A +VF  MPKR+ FSWAA+IGGYA+HG A +A   LERM AEDG+ PDG+ +LGVL 
Sbjct: 181  INMAVQVFEEMPKRSAFSWAAMIGGYAIHGYAREAIHYLERMHAEDGLRPDGVVLLGVLT 240

Query: 1115 ACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRMPMKPLA 1294
            AC HAGL ++G+FLL+NM++ YG+VP HEHYSCVVDLLCRAG+  EA+ LI+RMPMKPLA
Sbjct: 241  ACTHAGLQEEGRFLLDNMKARYGIVPRHEHYSCVVDLLCRAGRWDEALALIKRMPMKPLA 300

Query: 1295 SVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXXXXXXXAYVQLSNIYLGARQYDDARRI 1474
            SVWGA+LS CR+H N                        A+VQL NIYL   + +DA +I
Sbjct: 301  SVWGAVLSSCRTHKNAELAEFAVQELLQLENGNGNEEDAAFVQLWNIYLSTGRGEDASKI 360

Query: 1475 RRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSHSRLGEIHAVLYLM 1615
             R+IGE+GLKKTPG S +E++G+ +EF+SGDVS+  + ++HA+L L+
Sbjct: 361  HRLIGERGLKKTPGCSMIEVNGMVNEFVSGDVSNKDVAQMHAILELL 407



 Score =  149 bits (375), Expect = 6e-33
 Identities = 106/325 (32%), Positives = 156/325 (48%), Gaps = 19/325 (5%)
 Frame = +2

Query: 158  SIKAWRWCMSLAQRCINMGQLKTIHAIFIAHGIHRNSYAVGKLISFCTLSENGD------ 319
            SIKAW+ CMSL Q+C NM QLK IHA FI +GIH N+YA+ KLI FC LS +        
Sbjct: 11   SIKAWKHCMSLVQQCANMRQLKAIHATFIVNGIHTNTYAISKLIDFCALSPHDGHIQTAV 70

Query: 320  ---------ISYGSFLFSQLPNPNSFVYNNLIRAYSRSPQPQLAVNYFNLMLNSRLRPDG 472
                     +S    +F ++  P+   +N L+  Y  S     A+  F  M    + PD 
Sbjct: 71   VRLYAKCKIMSDAHKMFDEIHRPDVIQWNVLMNGYIESNLESEALRVFRFMFVKGVEPDE 130

Query: 473  YTFPFVFMACA-NGFLWSEGRQLHTWVIKNGIFESNAHVQSALVRFYAEHKVLDDARKVF 649
            +       ACA +G LW +G+ +H +V K      +  + +ALV  YA+   ++ A +VF
Sbjct: 131  FCVTTALAACAKSGALW-QGKWIHEYV-KKTTLGFDVFIGTALVDMYAKCGWINMAVQVF 188

Query: 650  DEITNVDAIQCNILINGYVKCGMALEAQDVFRNMLVR-GIEPDEFCLTTALTACAQLGDL 826
            +E+    A     +I GY   G A EA      M    G+ PD   L   LTAC   G  
Sbjct: 189  EEMPKRSAFSWAAMIGGYAIHGYAREAIHYLERMHAEDGLRPDGVVLLGVLTACTHAGLQ 248

Query: 827  DQGKWIHEYVKNRKNLRVADEFLGTALVDMYAKCGYIDTAEEVFSRMPKRNKFS-WAAII 1003
            ++G+++ + +K R  +    E   + +VD+  + G  D A  +  RMP +   S W A++
Sbjct: 249  EEGRFLLDNMKARYGIVPRHEHY-SCVVDLLCRAGRWDEALALIKRMPMKPLASVWGAVL 307

Query: 1004 GGYAVHGCANQA-FQCLERMQAEDG 1075
                 H  A  A F   E +Q E+G
Sbjct: 308  SSCRTHKNAELAEFAVQELLQLENG 332


>ref|NP_001173276.1| Os03g0158900 [Oryza sativa Japonica Group] gi|22773237|gb|AAN06843.1|
            Hypothetical protein [Oryza sativa Japonica Group]
            gi|108706287|gb|ABF94082.1| pentatricopeptide, putative,
            expressed [Oryza sativa Japonica Group]
            gi|125584986|gb|EAZ25650.1| hypothetical protein
            OsJ_09480 [Oryza sativa Japonica Group]
            gi|255674223|dbj|BAH92004.1| Os03g0158900 [Oryza sativa
            Japonica Group]
          Length = 490

 Score =  365 bits (937), Expect = 4e-98
 Identities = 196/434 (45%), Positives = 267/434 (61%), Gaps = 7/434 (1%)
 Frame = +2

Query: 338  LFSQLPNPNSFVYNNLIRAYSR---SP-QPQLAVNYFNLMLNSRLR---PDGYTFPFVFM 496
            L  + P+ +    N+L+R  SR   SP  P LA+    LML+       PD  +FPF   
Sbjct: 46   LLPRHPDLSLVALNSLLRVLSRRASSPAHPLLALRLLLLMLSPASPLPPPDHLSFPFALS 105

Query: 497  ACANGFLWSEGRQLHTWVIKNGIFESNAHVQSALVRFYAEHKVLDDARKVFDEITNVDAI 676
            A A     S G QLH  ++KNG F S+ +V +AL++        DDAR+VFDE+   +AI
Sbjct: 106  AAAT-VSPSPGAQLHALLVKNGHFPSDHYVTTALLQLQLHAARPDDARRVFDELPRREAI 164

Query: 677  QCNILINGYVKCGMALEAQDVFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYV 856
              +++I  Y + GMA E   VFR M V G+ PD   LTTA+ ACAQ G L+ G+W+H YV
Sbjct: 165  HYDLVIGAYTRTGMAGEGLGVFRAMFVDGVAPDAVVLTTAIAACAQAGALECGEWVHRYV 224

Query: 857  KNRKNLRVADEFLGTALVDMYAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQ 1036
            +      + D F+G+ALV MYAKCG ++ A  VF  MP+RN + W  ++G +AVHG A +
Sbjct: 225  EASAPWLLGDAFVGSALVSMYAKCGCLEQAVRVFDGMPERNDYVWGTMVGAFAVHGMAEE 284

Query: 1037 AFQCLERMQAEDGIMPDGIAVLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCV 1216
            A  CL+RM  EDG+ PDG+AVLG L+AC HAG V+ G  LL+ M   YG+ P HEHY+C 
Sbjct: 285  AVSCLDRMAREDGVRPDGVAVLGALSACAHAGKVEDGLRLLKEMRRRYGVAPGHEHYACT 344

Query: 1217 VDLLCRAGQLGEAVELIRRMPMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXX 1396
            VD+LCR G+L +AV LI  MPM PLASVWG++L+GCR++ NV                  
Sbjct: 345  VDMLCRVGRLEDAVALIETMPMAPLASVWGSVLTGCRTYANV-----ELAEVAAAELGKL 399

Query: 1397 XXXXXAYVQLSNIYLGARQYDDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSH 1576
                  YVQLSNIYL + + DDARR+R++IG +G++K P YSAVE+DG+   F++ D +H
Sbjct: 400  GADEGVYVQLSNIYLDSNRKDDARRVRKLIGSRGIRKVPAYSAVEVDGVVRSFVADDQAH 459

Query: 1577 SRLGEIHAVLYLMS 1618
             +  EI  VL L++
Sbjct: 460  PQRVEIWEVLGLLA 473


>gb|EMT01880.1| hypothetical protein F775_14784 [Aegilops tauschii]
          Length = 426

 Score =  363 bits (931), Expect = 2e-97
 Identities = 190/416 (45%), Positives = 260/416 (62%), Gaps = 1/416 (0%)
 Frame = +2

Query: 464  PDGYTFPFVFMACANGFLWSE-GRQLHTWVIKNGIFESNAHVQSALVRFYAEHKVLDDAR 640
            PD  +FPF   A A   +    G QLH  ++KN +F S+ +V +AL++ +A     DDAR
Sbjct: 11   PDHLSFPFALSAAAAAPVAPPPGPQLHALLVKNALFPSDHYVTTALLQLHAPRP--DDAR 68

Query: 641  KVFDEITNVDAIQCNILINGYVKCGMALEAQDVFRNMLVRGIEPDEFCLTTALTACAQLG 820
            +VFDE+   +AI  +++I  Y + GMA E   +FR M V G+ PD   LTTA+ ACAQ G
Sbjct: 69   RVFDELPRREAIHYDLVIGAYARAGMAAEGLALFRAMFVDGVAPDAVVLTTAVAACAQSG 128

Query: 821  DLDQGKWIHEYVKNRKNLRVADEFLGTALVDMYAKCGYIDTAEEVFSRMPKRNKFSWAAI 1000
             L+ G+W+H YV++     +AD F+G+ALV MYAKCG +  A  VF  MP+RN++ W  +
Sbjct: 129  ALECGEWVHRYVESNAPGLLADAFVGSALVSMYAKCGCLQEAVRVFEGMPERNEYVWGTM 188

Query: 1001 IGGYAVHGCANQAFQCLERMQAEDGIMPDGIAVLGVLAACKHAGLVKQGQFLLENMESLY 1180
            +G +AVHG A +A  CLERM  EDG+ PDG+AVLG L+AC HAG V+ G  LL+ M   Y
Sbjct: 189  VGAFAVHGMAREAVACLERMAGEDGVRPDGVAVLGALSACAHAGKVEDGLRLLKEMRRRY 248

Query: 1181 GLVPEHEHYSCVVDLLCRAGQLGEAVELIRRMPMKPLASVWGALLSGCRSHNNVIFXXXX 1360
            G+ P HEHYSC VD+LCR G+L +AV LI  MPM PLASVWG+LL+GCR + NV      
Sbjct: 249  GVTPGHEHYSCTVDMLCRVGRLEDAVGLIGTMPMTPLASVWGSLLAGCRMYGNV---KLA 305

Query: 1361 XXXXXXXXXXXXXXXXXAYVQLSNIYLGARQYDDARRIRRMIGEKGLKKTPGYSAVEIDG 1540
                              YVQLSNIYL A + DDARR+R++IG +GLKK P YSAVE+DG
Sbjct: 306  EVAAKELEKLGVGADEGVYVQLSNIYLDANRKDDARRVRKLIGSRGLKKVPAYSAVEVDG 365

Query: 1541 LFHEFISGDVSHSRLGEIHAVLYLMSLEVSIDQNK*RRHETSKSLLLSVNCIHKKK 1708
                F++ D +H R  EI  +L L++ ++ +  ++    E  ++      C +K+K
Sbjct: 366  ELSSFVADDQAHPRRFEIWDLLGLLADQMGLKSDE--EDEEEETTFSLRPCSNKRK 419



 Score = 79.0 bits (193), Expect = 7e-12
 Identities = 67/256 (26%), Positives = 116/256 (45%), Gaps = 9/256 (3%)
 Frame = +2

Query: 338  LFSQLPNPNSFVYNNLIRAYSRSPQPQLAVNYFNLMLNSRLRPDGYTFPFVFMACANGFL 517
            +F +LP   +  Y+ +I AY+R+      +  F  M    + PD         ACA    
Sbjct: 70   VFDELPRREAIHYDLVIGAYARAGMAAEGLALFRAMFVDGVAPDAVVLTTAVAACAQSGA 129

Query: 518  WSEGRQLHTWVIKN--GIFESNAHVQSALVRFYAEHKVLDDARKVFDEITNVDAIQCNIL 691
               G  +H +V  N  G+  ++A V SALV  YA+   L +A +VF+ +   +      +
Sbjct: 130  LECGEWVHRYVESNAPGLL-ADAFVGSALVSMYAKCGCLQEAVRVFEGMPERNEYVWGTM 188

Query: 692  INGYVKCGMALEAQDVFRNMLVR-GIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRK 868
            +  +   GMA EA      M    G+ PD   +  AL+ACA  G ++ G  + + ++ R 
Sbjct: 189  VGAFAVHGMAREAVACLERMAGEDGVRPDGVAVLGALSACAHAGKVEDGLRLLKEMRRRY 248

Query: 869  NLRVADEFLGTALVDMYAKCGYIDTAEEVFSRMPKRNKFS-WAAIIGGYAVHG---CANQ 1036
             +    E   +  VDM  + G ++ A  +   MP     S W +++ G  ++G    A  
Sbjct: 249  GVTPGHEHY-SCTVDMLCRVGRLEDAVGLIGTMPMTPLASVWGSLLAGCRMYGNVKLAEV 307

Query: 1037 AFQCLERM--QAEDGI 1078
            A + LE++   A++G+
Sbjct: 308  AAKELEKLGVGADEGV 323


>ref|XP_006650550.1| PREDICTED: pentatricopeptide repeat-containing protein At3g28660-like
            [Oryza brachyantha]
          Length = 493

 Score =  362 bits (928), Expect = 4e-97
 Identities = 196/434 (45%), Positives = 265/434 (61%), Gaps = 7/434 (1%)
 Frame = +2

Query: 338  LFSQLPNPNSFVYNNLIRAYSR---SP-QPQLAVNYFNLMLNSRL---RPDGYTFPFVFM 496
            L  + P+ +    N+L+R  SR   SP  P LA+     ML+       PD  +FPF   
Sbjct: 50   LLPRHPDLSLVALNSLLRVLSRRAASPAHPMLALRLLVRMLSPGSPLPSPDHLSFPFALS 109

Query: 497  ACANGFLWSEGRQLHTWVIKNGIFESNAHVQSALVRFYAEHKVLDDARKVFDEITNVDAI 676
            A A G     G QLH  V++NG+F S+ +V +AL++ +A     DDAR+VFDE+   +AI
Sbjct: 110  AAA-GAAPPPGDQLHALVVRNGLFPSDHYVTTALLQLHAPRP--DDARRVFDELPRREAI 166

Query: 677  QCNILINGYVKCGMALEAQDVFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGKWIHEYV 856
              +++I  Y + GMA E    FR M   G+ PD   LTTA+ ACAQ G L+ G+W+H YV
Sbjct: 167  HYDLVIGAYARAGMAAEGLGGFRAMFADGVLPDAVVLTTAIAACAQAGALECGEWMHRYV 226

Query: 857  KNRKNLRVADEFLGTALVDMYAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAVHGCANQ 1036
            +      + D F+G+ALV MYAKCG ++ A  VF  MP+RN + W  ++G +AVHG A +
Sbjct: 227  ERTAPGLLGDAFVGSALVSMYAKCGCLEQAVRVFDGMPERNDYVWGTMVGAFAVHGMAEE 286

Query: 1037 AFQCLERMQAEDGIMPDGIAVLGVLAACKHAGLVKQGQFLLENMESLYGLVPEHEHYSCV 1216
            A  CL+RM  EDG+ PDG+AVLG L+AC HAG V+ G  LL+ M   YG+ P HEHYSC 
Sbjct: 287  AVACLDRMAREDGVRPDGVAVLGALSACAHAGKVEDGLRLLKEMRRRYGVAPGHEHYSCT 346

Query: 1217 VDLLCRAGQLGEAVELIRRMPMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXXXXXXXX 1396
            VD+LCR G+L +AV LI+ MPM PL SVWG++L+GCR + NV                  
Sbjct: 347  VDMLCRVGRLEDAVALIKTMPMAPLTSVWGSVLTGCRIYANV---ELAEVAAGELAKLGA 403

Query: 1397 XXXXXAYVQLSNIYLGARQYDDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFISGDVSH 1576
                  YVQLSNIYL A + DDARR+R++IG +G++K P YSAVE+DG    F++ D +H
Sbjct: 404  GADEGVYVQLSNIYLDANRKDDARRVRKLIGSRGIRKVPAYSAVEVDGEVSSFVADDQAH 463

Query: 1577 SRLGEIHAVLYLMS 1618
             +  EI  VL L+S
Sbjct: 464  PQRVEIWGVLGLLS 477


>gb|EMS68422.1| hypothetical protein TRIUR3_23855 [Triticum urartu]
          Length = 425

 Score =  361 bits (927), Expect = 6e-97
 Identities = 189/405 (46%), Positives = 255/405 (62%), Gaps = 1/405 (0%)
 Frame = +2

Query: 464  PDGYTFPFVFMACANGFLWSE-GRQLHTWVIKNGIFESNAHVQSALVRFYAEHKVLDDAR 640
            PD  +FPF   A A   +    G QLH  ++KN +F S+ +V +AL++ +A     DDAR
Sbjct: 11   PDHLSFPFALSAAAVAPVAPPPGPQLHALLLKNALFPSDHYVTTALLQLHAPRP--DDAR 68

Query: 641  KVFDEITNVDAIQCNILINGYVKCGMALEAQDVFRNMLVRGIEPDEFCLTTALTACAQLG 820
            +VFDE+   +AI  +++I  Y + GMA E   +FR M   G+ PD   LTTA+ ACAQ G
Sbjct: 69   RVFDELPRREAIHYDLVIGAYARAGMAAEGLALFRAMFADGVAPDAVVLTTAIAACAQSG 128

Query: 821  DLDQGKWIHEYVKNRKNLRVADEFLGTALVDMYAKCGYIDTAEEVFSRMPKRNKFSWAAI 1000
             L+ G+W+H YV++     +AD F+G+ALV MYAKCG +  A  VF  MP+RN++ W  +
Sbjct: 129  ALECGEWVHRYVESNAPGLLADAFVGSALVSMYAKCGCLQEAVRVFEGMPERNEYVWGTM 188

Query: 1001 IGGYAVHGCANQAFQCLERMQAEDGIMPDGIAVLGVLAACKHAGLVKQGQFLLENMESLY 1180
            +G +AVHG A +A  CLERM  EDG+ PDG+AVLG L+AC HAG V+ G  LL+ M   Y
Sbjct: 189  VGAFAVHGMAREAVACLERMAGEDGVRPDGVAVLGALSACAHAGKVEDGLRLLKEMRRRY 248

Query: 1181 GLVPEHEHYSCVVDLLCRAGQLGEAVELIRRMPMKPLASVWGALLSGCRSHNNVIFXXXX 1360
            G+ P HEH+SC VD+LCR G+L +AV LI  MPM PLASVWG+LL+GCR + NV      
Sbjct: 249  GVTPGHEHFSCTVDMLCRVGRLEDAVGLIGTMPMTPLASVWGSLLAGCRMYGNV---ELA 305

Query: 1361 XXXXXXXXXXXXXXXXXAYVQLSNIYLGARQYDDARRIRRMIGEKGLKKTPGYSAVEIDG 1540
                              YVQLSNIYL A + DDARR+R++IG +GLKK P YSAVEIDG
Sbjct: 306  EVAAKELEKLGMGADEGVYVQLSNIYLDANRKDDARRVRKLIGNRGLKKVPAYSAVEIDG 365

Query: 1541 LFHEFISGDVSHSRLGEIHAVLYLMSLEVSIDQNK*RRHETSKSL 1675
                F++ D +H R  EI  +L L++ ++    ++    ET+ SL
Sbjct: 366  ELSSFVADDQAHPRRFEIWDLLGLLADQMGRKPDEEEEEETTFSL 410



 Score = 81.3 bits (199), Expect = 2e-12
 Identities = 68/256 (26%), Positives = 116/256 (45%), Gaps = 9/256 (3%)
 Frame = +2

Query: 338  LFSQLPNPNSFVYNNLIRAYSRSPQPQLAVNYFNLMLNSRLRPDGYTFPFVFMACANGFL 517
            +F +LP   +  Y+ +I AY+R+      +  F  M    + PD         ACA    
Sbjct: 70   VFDELPRREAIHYDLVIGAYARAGMAAEGLALFRAMFADGVAPDAVVLTTAIAACAQSGA 129

Query: 518  WSEGRQLHTWVIKN--GIFESNAHVQSALVRFYAEHKVLDDARKVFDEITNVDAIQCNIL 691
               G  +H +V  N  G+  ++A V SALV  YA+   L +A +VF+ +   +      +
Sbjct: 130  LECGEWVHRYVESNAPGLL-ADAFVGSALVSMYAKCGCLQEAVRVFEGMPERNEYVWGTM 188

Query: 692  INGYVKCGMALEAQDVFRNMLVR-GIEPDEFCLTTALTACAQLGDLDQGKWIHEYVKNRK 868
            +  +   GMA EA      M    G+ PD   +  AL+ACA  G ++ G  + + ++ R 
Sbjct: 189  VGAFAVHGMAREAVACLERMAGEDGVRPDGVAVLGALSACAHAGKVEDGLRLLKEMRRRY 248

Query: 869  NLRVADEFLGTALVDMYAKCGYIDTAEEVFSRMPKRNKFS-WAAIIGGYAVHG---CANQ 1036
             +    E   +  VDM  + G ++ A  +   MP     S W +++ G  ++G    A  
Sbjct: 249  GVTPGHEHF-SCTVDMLCRVGRLEDAVGLIGTMPMTPLASVWGSLLAGCRMYGNVELAEV 307

Query: 1037 AFQCLER--MQAEDGI 1078
            A + LE+  M A++G+
Sbjct: 308  AAKELEKLGMGADEGV 323


>ref|XP_002439151.1| hypothetical protein SORBIDRAFT_09g001360 [Sorghum bicolor]
            gi|190688727|gb|ACE86390.1| pentatricopeptide (PPR)
            repeat-containing protein [Sorghum bicolor]
            gi|241944436|gb|EES17581.1| hypothetical protein
            SORBIDRAFT_09g001360 [Sorghum bicolor]
          Length = 517

 Score =  359 bits (922), Expect = 2e-96
 Identities = 201/473 (42%), Positives = 274/473 (57%), Gaps = 21/473 (4%)
 Frame = +2

Query: 338  LFSQLPNPNSFVYNNLIRAYSRSP----QPQLAVNYFNLMLNSRLR-------PDGYTFP 484
            L  + P+      N+L+ + SR       P LA+    LML+           PD  +FP
Sbjct: 44   LLPRHPDLALLALNSLLHSLSRRAACPAHPHLALRLLRLMLSPTTTTPPALPAPDHLSFP 103

Query: 485  FVFMACA-------NGFLWSEGRQLHTWVIKNGIFESNAHVQSALVRFYAEHKVLDDARK 643
            F   A A       +   +S G QLH  +++N +F ++ +V +AL++ Y+ H   D AR+
Sbjct: 104  FALSAAAALDATTPSSSSYSTGPQLHALLVRNALFPADHYVTTALLQLYSPHP--DLARR 161

Query: 644  VFDEITNVDAIQCNILINGYVKCGMALEAQDVFRNMLVRG---IEPDEFCLTTALTACAQ 814
            VFDE+   +AI  ++LI  Y + G   E   VFR M       + PD   LTTA+ ACAQ
Sbjct: 162  VFDELPRREAIHYDLLIGSYARAGAPTEGLAVFRAMFDDDDGVVVPDAVVLTTAVAACAQ 221

Query: 815  LGDLDQGKWIHEYVKNRKNLRVADEFLGTALVDMYAKCGYIDTAEEVFSRMPKRNKFSWA 994
             G LD G W+H YV+      +AD FLG+ALV MYAKCG +D A  VF  MP+RN + W 
Sbjct: 222  AGALDHGAWVHRYVERTAPGLLADAFLGSALVGMYAKCGCLDDAVRVFDGMPERNAYVWG 281

Query: 995  AIIGGYAVHGCANQAFQCLERMQAEDGIMPDGIAVLGVLAACKHAGLVKQGQFLLENMES 1174
             ++G  AVHG A +A  CL+RM  EDG+ PDG+AVLG L+AC HAG V++G  LL  M  
Sbjct: 282  TMVGALAVHGMAAEAVACLDRMAVEDGVRPDGVAVLGALSACAHAGNVEEGLCLLREMRP 341

Query: 1175 LYGLVPEHEHYSCVVDLLCRAGQLGEAVELIRRMPMKPLASVWGALLSGCRSHNNVIFXX 1354
             YG+VP HEHYSC VD+LCR G+L +AV L+  MPM PLASVWG++L+GCRS+ NV    
Sbjct: 342  RYGVVPGHEHYSCAVDMLCRVGRLEDAVGLVETMPMAPLASVWGSVLAGCRSYGNV---E 398

Query: 1355 XXXXXXXXXXXXXXXXXXXAYVQLSNIYLGARQYDDARRIRRMIGEKGLKKTPGYSAVEI 1534
                                YVQLSNIYL A + DDARR+R++IG +G+KK P YSAVE+
Sbjct: 399  LAEVAVRELEKLGGTADEGVYVQLSNIYLDANRKDDARRVRKLIGSRGIKKVPAYSAVEV 458

Query: 1535 DGLFHEFISGDVSHSRLGEIHAVLYLMSLEVSIDQNK*RRHETSKSLLLSVNC 1693
            DG    F++ D +H R  EI  VL L++ ++   + K +  E    L+ ++ C
Sbjct: 459  DGEVSSFVADDQAHPRRFEIWEVLRLLADQM---RQKPKEEEEFVELICNIAC 508


>ref|XP_004981863.1| PREDICTED: putative pentatricopeptide repeat-containing protein
            At3g28640-like [Setaria italica]
          Length = 495

 Score =  358 bits (920), Expect = 4e-96
 Identities = 204/462 (44%), Positives = 270/462 (58%), Gaps = 15/462 (3%)
 Frame = +2

Query: 338  LFSQLPNPNSFVYNNLIRAYSR---SP-QPQLAVNYFNLMLNSRL---RPDGYTFPFVFM 496
            L  + P+ +    N+L+ A SR   SP  P+LA+     ML+       PD  +FPF   
Sbjct: 44   LLPRHPDLSLLALNSLLHALSRRAASPAHPRLALGLLRDMLSPATPLPAPDHLSFPFALS 103

Query: 497  ACA------NGFLWSEGRQLHTWVIKNGIFESNAHVQSALVRFYAEHKVLDDARKVFDEI 658
            A A      +      G QLH  +++N +F  + +V +AL++ YA    L  AR+VFDE+
Sbjct: 104  AAAAVDAPDSSSDAGAGAQLHALLVRNALFPVDHYVTTALLQLYAPRPEL--ARRVFDEL 161

Query: 659  TNVDAIQCNILINGYVKCGMALEAQDVFRNMLVRGIEPDEFCLTTALTACAQLGDLDQGK 838
               +AI  +++I  Y + GM  E   VFR M   G+ PD   LTTA+ ACAQ G LD G 
Sbjct: 162  PRREAIHYDLVIGAYARAGMPAEGLAVFRAMFEDGVAPDAVVLTTAVAACAQAGALDCGA 221

Query: 839  WIHEYVKNRKNLRVADEFLGTALVDMYAKCGYIDTAEEVFSRMPKRNKFSWAAIIGGYAV 1018
            W+H YV+      + D F+G+ALV MYAKCG +D A  VF  MP+RN++ W  ++G +AV
Sbjct: 222  WVHRYVERAAPGLLGDAFVGSALVSMYAKCGCLDEAVRVFDGMPERNEYVWGTMVGAFAV 281

Query: 1019 HGCANQAFQCLERMQAEDGIMPDGIAVLGVLAACKHAGLVKQGQFLLENMESLYGLVPEH 1198
            HG A +A  CLERM  EDG+ PDG+AVLG L+AC HAG V  G  LL  M   YG+ P H
Sbjct: 282  HGMAAEAVACLERMAREDGVRPDGVAVLGALSACAHAGKVDDGLRLLREMRGRYGVAPGH 341

Query: 1199 EHYSCVVDLLCRAGQLGEAVELIRRMPMKPLASVWGALLSGCRSHNNVIFXXXXXXXXXX 1378
            EHYSC VD+LCR G+L +AV LI  MPM PL SVWG++L+GCRS+ NV            
Sbjct: 342  EHYSCTVDMLCRVGRLEDAVGLIGTMPMAPLESVWGSVLAGCRSYGNV---ELAEVAARE 398

Query: 1379 XXXXXXXXXXXAYVQLSNIYLGARQYDDARRIRRMIGEKGLKKTPGYSAVEIDGLFHEFI 1558
                        YVQLSNIYL A + DDARR+R++IG +G+KK P YSAVE+DG    F+
Sbjct: 399  LEKLGGTADEGVYVQLSNIYLDANRKDDARRVRKLIGSRGIKKAPAYSAVEVDGEVSSFV 458

Query: 1559 SGDVSHSRLGEIHAVLYLMSLEVSIDQ--NK*RRHETSKSLL 1678
            + D +H R  EI  VL L+     +DQ   K    ET ++LL
Sbjct: 459  ADDQAHPRCFEIWEVLRLL-----VDQMAQKPDEEETLRALL 495


Top