BLASTX nr result

ID: Sinomenium21_contig00010581 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium21_contig00010581
         (1633 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citr...   433   e-118
ref|XP_007200730.1| hypothetical protein PRUPE_ppa021547mg [Prun...   430   e-118
ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containi...   426   e-116
ref|XP_007041729.1| Pentatricopeptide (PPR) repeat-containing pr...   424   e-116
ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containi...   416   e-113
gb|EXB24044.1| hypothetical protein L484_006076 [Morus notabilis]     413   e-112
ref|XP_002521239.1| pentatricopeptide repeat-containing protein,...   406   e-110
ref|NP_849962.1| pentatricopeptide repeat-containing protein [Ar...   405   e-110
dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana]           405   e-110
ref|NP_565402.1| pentatricopeptide repeat-containing protein [Ar...   405   e-110
ref|XP_002884032.1| pentatricopeptide repeat-containing protein ...   405   e-110
ref|XP_002312938.2| hypothetical protein POPTR_0009s14120g [Popu...   397   e-107
ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutr...   396   e-107
ref|XP_006297442.1| hypothetical protein CARUB_v10013465mg [Caps...   391   e-106
gb|EYU37991.1| hypothetical protein MIMGU_mgv1a006093mg [Mimulus...   384   e-104
gb|AAU04769.1| pentatricopeptide (PPR) repeat protein-like [Cucu...   384   e-104
ref|XP_004156246.1| PREDICTED: uncharacterized protein LOC101223...   384   e-104
ref|XP_004141623.1| PREDICTED: uncharacterized protein LOC101204...   383   e-103
ref|XP_006360892.1| PREDICTED: pentatricopeptide repeat-containi...   383   e-103
gb|AGH33847.1| PPR [Cucumis melo]                                     383   e-103

>ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citrus clementina]
            gi|568866680|ref|XP_006486677.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g17033-like [Citrus sinensis]
            gi|557524456|gb|ESR35762.1| hypothetical protein
            CICLE_v10028424mg [Citrus clementina]
          Length = 451

 Score =  433 bits (1114), Expect = e-118
 Identities = 225/396 (56%), Positives = 279/396 (70%), Gaps = 3/396 (0%)
 Frame = +2

Query: 143  RLIRKFVASSPKXXXXXXXXXXXXXXXH--RYSSLALPMYERISETRWFNWNPKLVSEVI 316
            RLI KFVASSP+                  R SSLA P+Y RI+E  WF WNPKLV+E+I
Sbjct: 59   RLISKFVASSPQFIALNALSHLLSPDTTHPRLSSLAFPLYMRITEESWFQWNPKLVAEII 118

Query: 317  ASLEKQGRFDEAENLISESVEKLGFREGEVALFYCDLIDSLSKQASSRGVFDAYGRLKQL 496
            A L+KQG+ +EAE LI E++ KLG RE E+ LFYC+LIDS  K  S RG  D Y RL QL
Sbjct: 119  AFLDKQGQREEAETLILETLSKLGSRERELVLFYCNLIDSFCKHDSKRGFDDTYARLNQL 178

Query: 497  LSGFXXXXXXTVKRRAFESMINGLCRLDLPHEAGKMMEEMRVAGLRPSAFEFRSIVNAYG 676
            ++         VKR+A +SMI+GLC +  PHEA  ++EEMRV GL PS FE++ I+  YG
Sbjct: 179  VNSSSSVY---VKRQALKSMISGLCEMGQPHEAENLIEEMRVKGLEPSGFEYKCIIYGYG 235

Query: 677  KLALFEDMRKTLDEMETSDYALDTICSNIVLSSYGAHGELSAMASWMQKIKSLDTPFSIR 856
            +L L EDM + +++ME+    +DT+CSN+VLSSYG H ELS M  W+QK+K    PFS+R
Sbjct: 236  RLGLLEDMERIVNQMESDGTRVDTVCSNMVLSSYGDHNELSRMVLWLQKMKDSGIPFSVR 295

Query: 857  TYNSVLNSCPSIMSMLQNL-SNPLPLSIEELTHKLPEDEALLVRELTGSSVLVESLQWCS 1033
            TYNSVLNSC +IMSMLQ+L SN  PLSI ELT  L E+E  +V+EL  SSVL E+++W S
Sbjct: 296  TYNSVLNSCSTIMSMLQDLNSNDFPLSILELTEVLNEEEVSVVKELEDSSVLDEAMKWDS 355

Query: 1034 SEAKLDLHGMHLGSAYVIILQWVEELRSRFARESTDIPTEIKVVCGKGKHSNIRGESPLK 1213
             E KLDLHGMHLGSAY IILQW++E+R+RF  E   IP EI VVCG GKHS +RGES +K
Sbjct: 356  GETKLDLHGMHLGSAYFIILQWMDEMRNRFNNEKHVIPAEITVVCGSGKHSTVRGESSVK 415

Query: 1214 SLVSEMMVRWKSPMRIDRTNIATFIAKGSTVKNWLC 1321
            ++V +MMVR  SPMR+ R NI  FIAKG  VK+WLC
Sbjct: 416  AMVKKMMVRTSSPMRVHRNNIGCFIAKGHVVKDWLC 451


>ref|XP_007200730.1| hypothetical protein PRUPE_ppa021547mg [Prunus persica]
            gi|462396130|gb|EMJ01929.1| hypothetical protein
            PRUPE_ppa021547mg [Prunus persica]
          Length = 447

 Score =  430 bits (1106), Expect = e-118
 Identities = 226/421 (53%), Positives = 285/421 (67%), Gaps = 2/421 (0%)
 Frame = +2

Query: 65   KCALSKQSCRLXXXXXXXXXXXXXXHRLIRKFVASSPKXXXXXXXXXXXXXXXH--RYSS 238
            +CA++KQ  R               ++LI KF+ SS K                    SS
Sbjct: 31   QCAVTKQGQRFLTKLAANARDAKVTNKLIAKFLTSSTKSIALNTLSYLLSPDTTLPHLSS 90

Query: 239  LALPMYERISETRWFNWNPKLVSEVIASLEKQGRFDEAENLISESVEKLGFREGEVALFY 418
            LALP Y +I+E  WF WNPKLV+ ++A L+KQG+ +EAE LISE++ KLG RE E+ALF+
Sbjct: 91   LALPFYSKITEASWFEWNPKLVAALVALLDKQGQHNEAEVLISETISKLGSRERELALFH 150

Query: 419  CDLIDSLSKQASSRGVFDAYGRLKQLLSGFXXXXXXTVKRRAFESMINGLCRLDLPHEAG 598
            C L++S SK +S  G   +Y  L QLL          VK RAFESM++GLC +D P EA 
Sbjct: 151  CQLVESHSKLSSKHGFDSSYSYLYQLLHNSSSVY---VKNRAFESMVSGLCEMDRPREAD 207

Query: 599  KMMEEMRVAGLRPSAFEFRSIVNAYGKLALFEDMRKTLDEMETSDYALDTICSNIVLSSY 778
             ++EEMRV GL+PS FEFRS+V  YG+L LFEDM K +++ME    A+DTICSN+VLSSY
Sbjct: 208  NLIEEMRVRGLKPSVFEFRSVVYGYGRLGLFEDMLKVVEQMENQGIAIDTICSNMVLSSY 267

Query: 779  GAHGELSAMASWMQKIKSLDTPFSIRTYNSVLNSCPSIMSMLQNLSNPLPLSIEELTHKL 958
            GAH EL+AM  W++K+KSL  PFSIRTYNSVLNSC +IM+MLQ      P SIEEL   L
Sbjct: 268  GAHSELAAMLVWLRKMKSLSLPFSIRTYNSVLNSCLTIMAMLQE-PKDFPCSIEELNGVL 326

Query: 959  PEDEALLVRELTGSSVLVESLQWCSSEAKLDLHGMHLGSAYVIILQWVEELRSRFAREST 1138
              DEALLV+EL  S+VL E + W   EAKLDLHGMHLGSAY+I+L+W E +R RF     
Sbjct: 327  NGDEALLVKELVESTVLDEVMVWEPLEAKLDLHGMHLGSAYLILLEWFEAMRCRFNSGKD 386

Query: 1139 DIPTEIKVVCGKGKHSNIRGESPLKSLVSEMMVRWKSPMRIDRTNIATFIAKGSTVKNWL 1318
             IP E+ V+CG GKHS++RGESP+K LV +MM+R +SPMRIDR N+  F+AKG  VK+WL
Sbjct: 387  VIPAEVVVICGSGKHSSVRGESPVKGLVKQMMLRMESPMRIDRKNVGCFVAKGRAVKDWL 446

Query: 1319 C 1321
            C
Sbjct: 447  C 447


>ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Vitis vinifera] gi|297744557|emb|CBI37819.3| unnamed
            protein product [Vitis vinifera]
          Length = 435

 Score =  426 bits (1094), Expect = e-116
 Identities = 233/440 (52%), Positives = 299/440 (67%), Gaps = 2/440 (0%)
 Frame = +2

Query: 8    LQLTLHFKWDGRHSPVIVPKCALSKQSCRLXXXXXXXXXXXXXXHRLIRKFVASSPKXXX 187
            LQ++    W+ R SP+++ +CALSKQ                  +RLI KF+ASS K   
Sbjct: 5    LQVSRPQPWNHR-SPLLI-QCALSKQG---QLFLSSVARDPSASNRLICKFIASSSKSIA 59

Query: 188  XXXXXXXXXXXX-HRY-SSLALPMYERISETRWFNWNPKLVSEVIASLEKQGRFDEAENL 361
                         H Y SSLALP+Y RISE  WF+WNPKL+++VIA L KQG+  EAE L
Sbjct: 60   LNALSHLLSPTTTHPYLSSLALPLYSRISEASWFSWNPKLIADVIALLYKQGQLKEAETL 119

Query: 362  ISESVEKLGFREGEVALFYCDLIDSLSKQASSRGVFDAYGRLKQLLSGFXXXXXXTVKRR 541
            +SE++ KLG RE ++  FYC+LIDS SK +S++GVFD   RL +++S         VK R
Sbjct: 120  VSETLIKLGSRERDLVSFYCNLIDSHSKHSSNQGVFDVISRLSRIVS---ESSSVYVKER 176

Query: 542  AFESMINGLCRLDLPHEAGKMMEEMRVAGLRPSAFEFRSIVNAYGKLALFEDMRKTLDEM 721
            A++SMI+ LC + LP EA  ++EEMRV GL+PS FEFRS+V  YG++ L EDM++ L +M
Sbjct: 177  AYKSMISSLCAVGLPLEAENLIEEMRVKGLKPSVFEFRSVVYGYGRVGLSEDMQRILLQM 236

Query: 722  ETSDYALDTICSNIVLSSYGAHGELSAMASWMQKIKSLDTPFSIRTYNSVLNSCPSIMSM 901
                + LDT+ SN+VLSSYGA+ + S M SW+Q++K+   PFSIRTYNSVLNSCP IMS+
Sbjct: 237  GNEGFELDTVVSNMVLSSYGAYNKQSEMVSWLQRMKNSSIPFSIRTYNSVLNSCPMIMSI 296

Query: 902  LQNLSNPLPLSIEELTHKLPEDEALLVRELTGSSVLVESLQWCSSEAKLDLHGMHLGSAY 1081
            LQ+L    P +I+EL   L  DEALLV+EL GS VL E ++W  SE KLDLHGMHLGSAY
Sbjct: 297  LQDLKT-FPPTIDELMETLKGDEALLVKELIGSMVLAELMEWDCSEGKLDLHGMHLGSAY 355

Query: 1082 VIILQWVEELRSRFARESTDIPTEIKVVCGKGKHSNIRGESPLKSLVSEMMVRWKSPMRI 1261
            +I+LQW EELR R       +P EI VVCG GKHS++RGESP+K +V EMM R +SPM+I
Sbjct: 356  LIMLQWREELRYRLNAAEYVMPVEITVVCGSGKHSSVRGESPVKRMVREMMTRTRSPMKI 415

Query: 1262 DRTNIATFIAKGSTVKNWLC 1321
            DR NI  F+AK   VKNWLC
Sbjct: 416  DRKNIGCFVAKAKVVKNWLC 435


>ref|XP_007041729.1| Pentatricopeptide (PPR) repeat-containing protein, putative
            [Theobroma cacao] gi|508705664|gb|EOX97560.1|
            Pentatricopeptide (PPR) repeat-containing protein,
            putative [Theobroma cacao]
          Length = 456

 Score =  424 bits (1089), Expect = e-116
 Identities = 220/396 (55%), Positives = 282/396 (71%), Gaps = 3/396 (0%)
 Frame = +2

Query: 140  HRLIRKFVASSPKXXXXXXXXXXXXXXXHR--YSSLALPMYERISETRWFNWNPKLVSEV 313
            +RLI+KFVASSPK                    S+LA P+Y +ISET W+NWNPKLV+E+
Sbjct: 58   NRLIKKFVASSPKSIALNALSHLLSPRNSHPHLSALAFPLYTKISETSWYNWNPKLVAEL 117

Query: 314  IASLEKQGRFDEAENLISESVEKLGFREGEVALFYCDLIDSLSKQASSRGVFDAYGRLKQ 493
            IA L KQGR+DE+E LIS++V KL FRE ++  FYC+ I+S SK  S  G  DAY  L +
Sbjct: 118  IALLVKQGRYDESEALISQAVSKLKFRERDLVQFYCNWIESCSKHNSKEGFNDAYCYLSE 177

Query: 494  LLSGFXXXXXXTVKRRAFESMINGLCRLDLPHEAGKMMEEMRVAGLRPSAFEFRSIVNAY 673
            L+          VKR+ ++SM++ LC +D P+EA  ++EEMR  GL P+ FEFR I   Y
Sbjct: 178  LICNSSSVY---VKRQGYKSMVSSLCEMDRPNEAENLVEEMRKNGLTPTLFEFRFISYGY 234

Query: 674  GKLALFEDMRKTLDEMETSDYALDTICSNIVLSSYGAHGELSAMASWMQKIKSLDTPFSI 853
            G+L LFEDM + + EME   + +DTICSN+VLSSYGA+   S M  W+QK+K+L  PFSI
Sbjct: 235  GQLGLFEDMERMVCEMEIEGFEVDTICSNMVLSSYGAYNAFSKMVPWLQKMKTLQIPFSI 294

Query: 854  RTYNSVLNSCPSIMSMLQNLSNPLPLSIEELTHKLPEDEALLVRELT-GSSVLVESLQWC 1030
            RTYNSVLNSCP IMS++Q L + +PLS+ EL   L EDEALLV+EL   SSVL E+++W 
Sbjct: 295  RTYNSVLNSCPEIMSLVQGLDS-VPLSLGELAKILNEDEALLVQELVKSSSVLDEAMEWN 353

Query: 1031 SSEAKLDLHGMHLGSAYVIILQWVEELRSRFARESTDIPTEIKVVCGKGKHSNIRGESPL 1210
             SE KLDLHGMHLGSAY+I+LQW+EE++ RF  E   IP +I +VCG GKHS++RGESP+
Sbjct: 354  GSEGKLDLHGMHLGSAYLIMLQWIEEMKCRFKVEECVIPAQITIVCGSGKHSSVRGESPV 413

Query: 1211 KSLVSEMMVRWKSPMRIDRTNIATFIAKGSTVKNWL 1318
            K+L+ +MMV+ KSPM+IDR NI  FIAKG  VKNWL
Sbjct: 414  KTLMRKMMVKMKSPMKIDRKNIGCFIAKGQVVKNWL 449


>ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like
            [Fragaria vesca subsp. vesca]
          Length = 448

 Score =  416 bits (1070), Expect = e-113
 Identities = 215/421 (51%), Positives = 286/421 (67%), Gaps = 2/421 (0%)
 Frame = +2

Query: 65   KCALSKQSCRLXXXXXXXXXXXXXXHRLIRKFVASSPKXXXXXXXXXXXXXXXHR--YSS 238
            +CAL+KQ  R               ++LI KF+++SPK                    SS
Sbjct: 32   QCALTKQGQRFLTKLAANAGNPSVANKLISKFLSTSPKSTALTTLSYLLSPHTAHPHLSS 91

Query: 239  LALPMYERISETRWFNWNPKLVSEVIASLEKQGRFDEAENLISESVEKLGFREGEVALFY 418
            LALPMY +I+E  WF WNPKLV+ ++A L KQG+  ++E LISE++ KLG +E E+  F+
Sbjct: 92   LALPMYSKITEASWFEWNPKLVAALVALLAKQGQQSQSEALISETISKLGNKERELVQFH 151

Query: 419  CDLIDSLSKQASSRGVFDAYGRLKQLLSGFXXXXXXTVKRRAFESMINGLCRLDLPHEAG 598
            C L++S SK +S  G   A   L QLL          VKRRAFESM+ GLC +D P EA 
Sbjct: 152  CQLVESHSKMSSKCGFDRACTYLHQLLQNSSSVY---VKRRAFESMVGGLCAMDRPGEAD 208

Query: 599  KMMEEMRVAGLRPSAFEFRSIVNAYGKLALFEDMRKTLDEMETSDYALDTICSNIVLSSY 778
            +++EEMRV GL+ S FEFRS+V  YG+L +FE+M K +D+ME   +  DTIC N+VLSSY
Sbjct: 209  ELIEEMRVKGLKASVFEFRSVVYGYGRLGMFEEMLKIVDQMEKQGFGDDTICCNMVLSSY 268

Query: 779  GAHGELSAMASWMQKIKSLDTPFSIRTYNSVLNSCPSIMSMLQNLSNPLPLSIEELTHKL 958
            GAH EL+AMA+W++K+K    PFS+RTYNSVLNSCP+IM+MLQ     +P S+ EL+  L
Sbjct: 269  GAHNELAAMANWLRKMKESSVPFSVRTYNSVLNSCPTIMAMLQE-PKAVPCSVGELSGVL 327

Query: 959  PEDEALLVRELTGSSVLVESLQWCSSEAKLDLHGMHLGSAYVIILQWVEELRSRFAREST 1138
              DEAL+V+EL GS+V+ E++ W S+EAKLDLHGMHLGSAY+++L+W E + +RF     
Sbjct: 328  DGDEALVVKELVGSAVVDEAMVWDSAEAKLDLHGMHLGSAYLVMLEWFEAMGNRFKSAEC 387

Query: 1139 DIPTEIKVVCGKGKHSNIRGESPLKSLVSEMMVRWKSPMRIDRTNIATFIAKGSTVKNWL 1318
             +P E+ +VCG GKHS++RGESP+K LV EMM + +SPMRIDR N+  FIAKG  VK+WL
Sbjct: 388  VVPAEVVIVCGLGKHSSVRGESPVKDLVKEMMHQMESPMRIDRKNVGCFIAKGRAVKDWL 447

Query: 1319 C 1321
            C
Sbjct: 448  C 448


>gb|EXB24044.1| hypothetical protein L484_006076 [Morus notabilis]
          Length = 517

 Score =  413 bits (1061), Expect = e-112
 Identities = 222/421 (52%), Positives = 284/421 (67%), Gaps = 2/421 (0%)
 Frame = +2

Query: 65   KCALSKQSCRLXXXXXXXXXXXXXXHRLIRKFVASSPKXXXXXXXXXXXXXXX-HRY-SS 238
            +CAL+KQ  R               ++LI KFVASSPK                H + +S
Sbjct: 101  QCALTKQGHRFLSTLSINAGNASAANKLIGKFVASSPKSISLNALSHLLSPDTTHTHLTS 160

Query: 239  LALPMYERISETRWFNWNPKLVSEVIASLEKQGRFDEAENLISESVEKLGFREGEVALFY 418
             +L +Y +I E  WF ++PKLV+ + A L+KQGR+ EAE LI+E+V KLG R+ E+A+FY
Sbjct: 161  HSLHLYSKIREASWFVYSPKLVAALAALLDKQGRYSEAEALIAEAVSKLGHRQRELAVFY 220

Query: 419  CDLIDSLSKQASSRGVFDAYGRLKQLLSGFXXXXXXTVKRRAFESMINGLCRLDLPHEAG 598
            C L++S SKQ+S  G   +Y  L QLL          VK RAFE+M+  LC +D P EA 
Sbjct: 221  CSLVESHSKQSSKHGFDSSYAYLYQLLRDSSSAY---VKCRAFETMVGALCTMDRPCEAE 277

Query: 599  KMMEEMRVAGLRPSAFEFRSIVNAYGKLALFEDMRKTLDEMETSDYALDTICSNIVLSSY 778
             +MEEMR  GL+PS FEFRS+V  YG+L L+EDM +T+++ME     +DTICSN+VLSSY
Sbjct: 278  SLMEEMRHKGLKPSVFEFRSLVYGYGRLGLWEDMLRTVNQMEIEGLVIDTICSNMVLSSY 337

Query: 779  GAHGELSAMASWMQKIKSLDTPFSIRTYNSVLNSCPSIMSMLQNLSNPLPLSIEELTHKL 958
            GAH EL  M  W+QK+++   PFSIRTYNSVLN CP+I +MLQ+L + +PLS+ EL   L
Sbjct: 338  GAHNELQQMVLWLQKMRTSSIPFSIRTYNSVLNWCPTITAMLQDLKD-IPLSMYELNATL 396

Query: 959  PEDEALLVRELTGSSVLVESLQWCSSEAKLDLHGMHLGSAYVIILQWVEELRSRFAREST 1138
              DE LLV EL GSSVL E L W S E KLDLHGMHLGSAY+I+L+W+EE+  RF   + 
Sbjct: 397  RGDEGLLVMELVGSSVLEEVLVWDSLEVKLDLHGMHLGSAYLIMLEWMEEMTRRFNDGNH 456

Query: 1139 DIPTEIKVVCGKGKHSNIRGESPLKSLVSEMMVRWKSPMRIDRTNIATFIAKGSTVKNWL 1318
             IP E+ VVCG GKHSN+RG SP+K LV EMMV+ KSPM+IDR N   F+AKG TV++WL
Sbjct: 457  GIPAEVVVVCGSGKHSNVRGVSPVKILVKEMMVQMKSPMKIDRKNAGCFLAKGKTVRDWL 516

Query: 1319 C 1321
            C
Sbjct: 517  C 517


>ref|XP_002521239.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223539507|gb|EEF41095.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 460

 Score =  406 bits (1044), Expect = e-110
 Identities = 208/396 (52%), Positives = 274/396 (69%), Gaps = 2/396 (0%)
 Frame = +2

Query: 140  HRLIRKFVASSPKXXXXXXXXXXXXXXXHR--YSSLALPMYERISETRWFNWNPKLVSEV 313
            +RLI+KFVA+SPK                    SSLA  +Y +I+E RWF WNPKLV++V
Sbjct: 69   NRLIKKFVAASPKSIALDALSHLLNPHSSHSHLSSLAFTLYLKIAEARWFQWNPKLVADV 128

Query: 314  IASLEKQGRFDEAENLISESVEKLGFREGEVALFYCDLIDSLSKQASSRGVFDAYGRLKQ 493
            +A L+KQGR+DE+  L+S+S+ KL  +E ++A FYC+L++S SKQ S RG  ++   L Q
Sbjct: 129  VAFLDKQGRYDESATLVSDSISKLQVKERDLARFYCNLVESQSKQNSIRGFDNSVASLMQ 188

Query: 494  LLSGFXXXXXXTVKRRAFESMINGLCRLDLPHEAGKMMEEMRVAGLRPSAFEFRSIVNAY 673
            L+          VKR+ ++SM+NGLC +  P EA  ++EEM   G+RPS FEF+ +V AY
Sbjct: 189  LVCNSNSVY---VKRQGYKSMVNGLCEMGRPREAETLIEEMGKEGVRPSMFEFKCVVYAY 245

Query: 674  GKLALFEDMRKTLDEMETSDYALDTICSNIVLSSYGAHGELSAMASWMQKIKSLDTPFSI 853
            G L  FE+M K L +ME + + +DT+CSN++L+SYGAH  L  M  W+QK+K L  PFS+
Sbjct: 246  GSLGSFEEMNKCLHQMERAGFRVDTVCSNMILASYGAHNALPEMVLWLQKMKDLGIPFSL 305

Query: 854  RTYNSVLNSCPSIMSMLQNLSNPLPLSIEELTHKLPEDEALLVRELTGSSVLVESLQWCS 1033
            RT NS LNSCP+IMSM+QN SN  P+SI +L   L EDEALLV+E+  SSVL E+++W  
Sbjct: 306  RTCNSALNSCPTIMSMMQN-SNDFPISIHDLMKILSEDEALLVKEIVTSSVLDEAMKWDV 364

Query: 1034 SEAKLDLHGMHLGSAYVIILQWVEELRSRFARESTDIPTEIKVVCGKGKHSNIRGESPLK 1213
            +EAKLDLHG HL SAY+IIL W+EE+R RF   +   PTEI VVCG G HS +RGESP+K
Sbjct: 365  AEAKLDLHGTHLCSAYLIILLWIEEMRKRFKSVNYVNPTEITVVCGSGNHSIVRGESPVK 424

Query: 1214 SLVSEMMVRWKSPMRIDRTNIATFIAKGSTVKNWLC 1321
             +V + MVR +SPMRIDR NI  FIAKG  V+ WLC
Sbjct: 425  CMVKDFMVRARSPMRIDRRNIGCFIAKGKVVEEWLC 460


>ref|NP_849962.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75244359|sp|Q8GWA9.1|PP157_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g17033 gi|26452937|dbj|BAC43545.1| unknown protein
            [Arabidopsis thaliana] gi|330251482|gb|AEC06576.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 505

 Score =  405 bits (1042), Expect = e-110
 Identities = 204/396 (51%), Positives = 279/396 (70%), Gaps = 2/396 (0%)
 Frame = +2

Query: 140  HRLIRKFVASSPKXXXXXXXXXXXXXXXHR--YSSLALPMYERISETRWFNWNPKLVSEV 313
            +R I+KFVA+SPK                    S  AL +Y  I+E  WF+WNPKL++E+
Sbjct: 113  NRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFALSLYSEITEASWFDWNPKLIAEL 172

Query: 314  IASLEKQGRFDEAENLISESVEKLGFREGEVALFYCDLIDSLSKQASSRGVFDAYGRLKQ 493
            IA L KQ RFDE+E L+S +V +L   E +  LF C+L++S SKQ S +G  +A  RL++
Sbjct: 173  IALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNLVESNSKQGSIQGFSEASFRLRE 232

Query: 494  LLSGFXXXXXXTVKRRAFESMINGLCRLDLPHEAGKMMEEMRVAGLRPSAFEFRSIVNAY 673
            ++          VK +A++SM++GLC +D PH+A +++EEMR+  ++P  FE++S++  Y
Sbjct: 233  IIQ---RSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEMRMEKIKPGLFEYKSVLYGY 289

Query: 674  GKLALFEDMRKTLDEMETSDYALDTICSNIVLSSYGAHGELSAMASWMQKIKSLDTPFSI 853
            G+L LF+DM + +  M T  + +DT+CSN+VLSSYGAH  L  M SW+QK+K  + PFSI
Sbjct: 290  GRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGFNVPFSI 349

Query: 854  RTYNSVLNSCPSIMSMLQNLSNPLPLSIEELTHKLPEDEALLVRELTGSSVLVESLQWCS 1033
            RTYNSVLNSCP+I+SML++L +  P+S+ EL   L EDEALLV ELT SSVL E+++W +
Sbjct: 350  RTYNSVLNSCPTIISMLKDLDS-CPVSLSELRTFLNEDEALLVHELTQSSVLDEAIEWNA 408

Query: 1034 SEAKLDLHGMHLGSAYVIILQWVEELRSRFARESTDIPTEIKVVCGKGKHSNIRGESPLK 1213
             E KLDLHGMHL S+Y+I+LQW++E R RF+ E   IP EI VV G GKHSN+RGESP+K
Sbjct: 409  VEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIPAEIVVVSGSGKHSNVRGESPVK 468

Query: 1214 SLVSEMMVRWKSPMRIDRTNIATFIAKGSTVKNWLC 1321
            +LV ++MVR  SPMRIDR N+ +FIAKG TVK WLC
Sbjct: 469  ALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 504


>dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana]
          Length = 501

 Score =  405 bits (1042), Expect = e-110
 Identities = 204/396 (51%), Positives = 279/396 (70%), Gaps = 2/396 (0%)
 Frame = +2

Query: 140  HRLIRKFVASSPKXXXXXXXXXXXXXXXHR--YSSLALPMYERISETRWFNWNPKLVSEV 313
            +R I+KFVA+SPK                    S  AL +Y  I+E  WF+WNPKL++E+
Sbjct: 109  NRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFALSLYSEITEASWFDWNPKLIAEL 168

Query: 314  IASLEKQGRFDEAENLISESVEKLGFREGEVALFYCDLIDSLSKQASSRGVFDAYGRLKQ 493
            IA L KQ RFDE+E L+S +V +L   E +  LF C+L++S SKQ S +G  +A  RL++
Sbjct: 169  IALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNLVESNSKQGSIQGFSEASFRLRE 228

Query: 494  LLSGFXXXXXXTVKRRAFESMINGLCRLDLPHEAGKMMEEMRVAGLRPSAFEFRSIVNAY 673
            ++          VK +A++SM++GLC +D PH+A +++EEMR+  ++P  FE++S++  Y
Sbjct: 229  IIQ---RSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEMRMEKIKPGLFEYKSVLYGY 285

Query: 674  GKLALFEDMRKTLDEMETSDYALDTICSNIVLSSYGAHGELSAMASWMQKIKSLDTPFSI 853
            G+L LF+DM + +  M T  + +DT+CSN+VLSSYGAH  L  M SW+QK+K  + PFSI
Sbjct: 286  GRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGFNVPFSI 345

Query: 854  RTYNSVLNSCPSIMSMLQNLSNPLPLSIEELTHKLPEDEALLVRELTGSSVLVESLQWCS 1033
            RTYNSVLNSCP+I+SML++L +  P+S+ EL   L EDEALLV ELT SSVL E+++W +
Sbjct: 346  RTYNSVLNSCPTIISMLKDLDS-CPVSLSELRTFLNEDEALLVHELTQSSVLDEAIEWNA 404

Query: 1034 SEAKLDLHGMHLGSAYVIILQWVEELRSRFARESTDIPTEIKVVCGKGKHSNIRGESPLK 1213
             E KLDLHGMHL S+Y+I+LQW++E R RF+ E   IP EI VV G GKHSN+RGESP+K
Sbjct: 405  VEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIPAEIVVVSGSGKHSNVRGESPVK 464

Query: 1214 SLVSEMMVRWKSPMRIDRTNIATFIAKGSTVKNWLC 1321
            +LV ++MVR  SPMRIDR N+ +FIAKG TVK WLC
Sbjct: 465  ALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 500


>ref|NP_565402.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|13877877|gb|AAK44016.1|AF370201_1 unknown protein
            [Arabidopsis thaliana] gi|21280879|gb|AAM44931.1| unknown
            protein [Arabidopsis thaliana]
            gi|330251481|gb|AEC06575.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 504

 Score =  405 bits (1042), Expect = e-110
 Identities = 204/396 (51%), Positives = 279/396 (70%), Gaps = 2/396 (0%)
 Frame = +2

Query: 140  HRLIRKFVASSPKXXXXXXXXXXXXXXXHR--YSSLALPMYERISETRWFNWNPKLVSEV 313
            +R I+KFVA+SPK                    S  AL +Y  I+E  WF+WNPKL++E+
Sbjct: 112  NRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFALSLYSEITEASWFDWNPKLIAEL 171

Query: 314  IASLEKQGRFDEAENLISESVEKLGFREGEVALFYCDLIDSLSKQASSRGVFDAYGRLKQ 493
            IA L KQ RFDE+E L+S +V +L   E +  LF C+L++S SKQ S +G  +A  RL++
Sbjct: 172  IALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNLVESNSKQGSIQGFSEASFRLRE 231

Query: 494  LLSGFXXXXXXTVKRRAFESMINGLCRLDLPHEAGKMMEEMRVAGLRPSAFEFRSIVNAY 673
            ++          VK +A++SM++GLC +D PH+A +++EEMR+  ++P  FE++S++  Y
Sbjct: 232  IIQ---RSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEMRMEKIKPGLFEYKSVLYGY 288

Query: 674  GKLALFEDMRKTLDEMETSDYALDTICSNIVLSSYGAHGELSAMASWMQKIKSLDTPFSI 853
            G+L LF+DM + +  M T  + +DT+CSN+VLSSYGAH  L  M SW+QK+K  + PFSI
Sbjct: 289  GRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGFNVPFSI 348

Query: 854  RTYNSVLNSCPSIMSMLQNLSNPLPLSIEELTHKLPEDEALLVRELTGSSVLVESLQWCS 1033
            RTYNSVLNSCP+I+SML++L +  P+S+ EL   L EDEALLV ELT SSVL E+++W +
Sbjct: 349  RTYNSVLNSCPTIISMLKDLDS-CPVSLSELRTFLNEDEALLVHELTQSSVLDEAIEWNA 407

Query: 1034 SEAKLDLHGMHLGSAYVIILQWVEELRSRFARESTDIPTEIKVVCGKGKHSNIRGESPLK 1213
             E KLDLHGMHL S+Y+I+LQW++E R RF+ E   IP EI VV G GKHSN+RGESP+K
Sbjct: 408  VEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIPAEIVVVSGSGKHSNVRGESPVK 467

Query: 1214 SLVSEMMVRWKSPMRIDRTNIATFIAKGSTVKNWLC 1321
            +LV ++MVR  SPMRIDR N+ +FIAKG TVK WLC
Sbjct: 468  ALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 503


>ref|XP_002884032.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297329872|gb|EFH60291.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 504

 Score =  405 bits (1041), Expect = e-110
 Identities = 204/396 (51%), Positives = 278/396 (70%), Gaps = 2/396 (0%)
 Frame = +2

Query: 140  HRLIRKFVASSPKXXXXXXXXXXXXXXXH--RYSSLALPMYERISETRWFNWNPKLVSEV 313
            HR I+KFVA+SPK                    S  AL +Y  I+E  WF+WNPKL++E+
Sbjct: 112  HRHIKKFVAASPKSVTLNVLSHLLSDQTSYPHLSFFALSLYSEITEASWFDWNPKLIAEL 171

Query: 314  IASLEKQGRFDEAENLISESVEKLGFREGEVALFYCDLIDSLSKQASSRGVFDAYGRLKQ 493
            +A L  Q RFDE+E L+S +V +L   E + ALF C+L++S SKQ S +G  +A  RL++
Sbjct: 172  VAVLNNQERFDESETLLSTAVSRLKSNERDFALFLCNLVESNSKQGSIQGFNEACFRLRE 231

Query: 494  LLSGFXXXXXXTVKRRAFESMINGLCRLDLPHEAGKMMEEMRVAGLRPSAFEFRSIVNAY 673
             +          VK +A++SM+ GLC +D PH+A +++EEMRV  ++P +FE +S++  Y
Sbjct: 232  RIQ---RSSSVYVKTQAYKSMVAGLCNMDQPHDAERVIEEMRVEKIKPGSFEHKSVLYGY 288

Query: 674  GKLALFEDMRKTLDEMETSDYALDTICSNIVLSSYGAHGELSAMASWMQKIKSLDTPFSI 853
            G+L LF+DM + +  MET  + +DT+CSN+VLSSYGAH  L  M SW+QK+K  + PFSI
Sbjct: 289  GRLGLFDDMNRVVHRMETEGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGFNVPFSI 348

Query: 854  RTYNSVLNSCPSIMSMLQNLSNPLPLSIEELTHKLPEDEALLVRELTGSSVLVESLQWCS 1033
            RTYNSVLNSCP+IMS+L++L N  P+S+ EL   L EDEALLV ELT S+VL E+++W +
Sbjct: 349  RTYNSVLNSCPTIMSLLKDL-NSCPVSLSELRTFLNEDEALLVLELTQSTVLDEAIEWNA 407

Query: 1034 SEAKLDLHGMHLGSAYVIILQWVEELRSRFARESTDIPTEIKVVCGKGKHSNIRGESPLK 1213
             E KLDLHGMHL S+Y+I+LQW++E+R RF  +   IP EI VV G GKHSN+RGESP+K
Sbjct: 408  VEGKLDLHGMHLSSSYLILLQWMDEIRLRFRDQKCVIPAEIVVVSGSGKHSNVRGESPVK 467

Query: 1214 SLVSEMMVRWKSPMRIDRTNIATFIAKGSTVKNWLC 1321
            +LV ++MVR +SPMRIDR N+ +FIAKG  VK WLC
Sbjct: 468  ALVKKIMVRTESPMRIDRKNVGSFIAKGKNVKEWLC 503


>ref|XP_002312938.2| hypothetical protein POPTR_0009s14120g [Populus trichocarpa]
            gi|550331693|gb|EEE86893.2| hypothetical protein
            POPTR_0009s14120g [Populus trichocarpa]
          Length = 473

 Score =  397 bits (1019), Expect = e-107
 Identities = 201/397 (50%), Positives = 274/397 (69%), Gaps = 3/397 (0%)
 Frame = +2

Query: 140  HRLIRKFVASSPKXXXXXXXXXXXXXXXHRYSSL---ALPMYERISETRWFNWNPKLVSE 310
            +RLI+KFVASSPK                 +  L    LP+Y +ISE  WF+WNPKLV++
Sbjct: 80   NRLIKKFVASSPKSIALDALSNLLSPDSTHHPLLYLLTLPLYLKISEASWFSWNPKLVAQ 139

Query: 311  VIASLEKQGRFDEAENLISESVEKLGFREGEVALFYCDLIDSLSKQASSRGVFDAYGRLK 490
            V+  L+KQG   E + L+SE+V +L F+E E+ LFYC+LI   SK    RG  D+Y RL 
Sbjct: 140  VVVLLDKQGLDKELKALMSETVSRLQFKERELVLFYCNLIGFNSKHNWVRGFDDSYSRLN 199

Query: 491  QLLSGFXXXXXXTVKRRAFESMINGLCRLDLPHEAGKMMEEMRVAGLRPSAFEFRSIVNA 670
            Q +S         VK++ +++MI+GLC +    EA  ++ EMR  GL+P  FEFR ++  
Sbjct: 200  QFVSDSNSVY---VKKQGYKAMISGLCEMGRAREAEDLIGEMRERGLKPKLFEFRCVLYG 256

Query: 671  YGKLALFEDMRKTLDEMETSDYALDTICSNIVLSSYGAHGELSAMASWMQKIKSLDTPFS 850
            YG+L LF+DM + LD+ME+ +  +DT+C+N+VL+SYGAH  L  M  W++K+K+L  P S
Sbjct: 257  YGRLGLFKDMERILDKMESGEIEVDTVCANMVLASYGAHNALPEMGLWLRKMKTLGIPLS 316

Query: 851  IRTYNSVLNSCPSIMSMLQNLSNPLPLSIEELTHKLPEDEALLVRELTGSSVLVESLQWC 1030
            IRT NSVLNSCP+IM++++NL    P+SI+EL   L E+EA+LV+EL  SSVL E+ +W 
Sbjct: 317  IRTCNSVLNSCPTIMALMRNLDASYPVSIQELLKILSEEEAMLVKELIESSVLKEATKWD 376

Query: 1031 SSEAKLDLHGMHLGSAYVIILQWVEELRSRFARESTDIPTEIKVVCGKGKHSNIRGESPL 1210
            +SE KLDLHGMHLGSAYVI+LQW+EE R+R +     IP EI VVCG G HS +RGESP+
Sbjct: 377  TSEGKLDLHGMHLGSAYVIMLQWMEETRNRLSDGEHVIPAEITVVCGSGNHSTVRGESPV 436

Query: 1211 KSLVSEMMVRWKSPMRIDRTNIATFIAKGSTVKNWLC 1321
            KS+++E+M + +SPMRIDR NI  F+AKG+ VK WLC
Sbjct: 437  KSMITEIMAQTRSPMRIDRKNIGCFVAKGNVVKKWLC 473


>ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutrema salsugineum]
            gi|557110519|gb|ESQ50810.1| hypothetical protein
            EUTSA_v10022675mg [Eutrema salsugineum]
          Length = 469

 Score =  396 bits (1018), Expect = e-107
 Identities = 198/396 (50%), Positives = 277/396 (69%), Gaps = 2/396 (0%)
 Frame = +2

Query: 140  HRLIRKFVASSPKXXXXXXXXXXXXXXXHR--YSSLALPMYERISETRWFNWNPKLVSEV 313
            +R I+KFVA+SPK                    S  AL +Y  I+E  WF+WNPKL++E+
Sbjct: 77   NRHIKKFVAASPKSVSLNVLSHLLSAQTSHPHLSFFALSLYSEITEASWFDWNPKLIAEL 136

Query: 314  IASLEKQGRFDEAENLISESVEKLGFREGEVALFYCDLIDSLSKQASSRGVFDAYGRLKQ 493
            +A L KQ R  E+E L+S +V +L   E ++ALFYC+L++S SKQ S +G  +A  RL++
Sbjct: 137  VALLNKQERSHESETLLSNAVSRLKSNERDIALFYCNLVESNSKQGSIQGFNEACVRLRE 196

Query: 494  LLSGFXXXXXXTVKRRAFESMINGLCRLDLPHEAGKMMEEMRVAGLRPSAFEFRSIVNAY 673
            +           VK +A++SM++GLC +D PH+A  ++EEMR+A ++P  FE++S++  Y
Sbjct: 197  ITR---RSTSVYVKTQAYKSMVSGLCNMDQPHDAESVIEEMRIAKIKPGLFEYKSVLYGY 253

Query: 674  GKLALFEDMRKTLDEMETSDYALDTICSNIVLSSYGAHGELSAMASWMQKIKSLDTPFSI 853
            G+L LFEDM + +  MET  + +DT+CSN+VLSSYGAH  L  M SW+QK+K  + P S 
Sbjct: 254  GRLGLFEDMNRVVHRMETEGHKIDTVCSNMVLSSYGAHNALPQMGSWLQKLKDSNVPLSE 313

Query: 854  RTYNSVLNSCPSIMSMLQNLSNPLPLSIEELTHKLPEDEALLVRELTGSSVLVESLQWCS 1033
            RTYNSVLNSCP+I+S+L++L +  P+S+ EL   L +DE +LVR LT SSVL E+++W S
Sbjct: 314  RTYNSVLNSCPTILSLLKDLDS-CPVSLSELLTFLNKDEEVLVRGLTQSSVLDEAIEWSS 372

Query: 1034 SEAKLDLHGMHLGSAYVIILQWVEELRSRFARESTDIPTEIKVVCGKGKHSNIRGESPLK 1213
             E KLDLHGMHL S+Y+I++QW++E+R RF+     +P EI +V G GKHSN+RGESP+K
Sbjct: 373  LEGKLDLHGMHLSSSYLIMMQWMDEMRIRFSEGKCVVPAEIVLVSGSGKHSNVRGESPVK 432

Query: 1214 SLVSEMMVRWKSPMRIDRTNIATFIAKGSTVKNWLC 1321
            +LV ++MVR  SPMRIDR NI +FIAKG TVK WLC
Sbjct: 433  ALVKKIMVRTGSPMRIDRKNIGSFIAKGKTVKEWLC 468


>ref|XP_006297442.1| hypothetical protein CARUB_v10013465mg [Capsella rubella]
            gi|482566151|gb|EOA30340.1| hypothetical protein
            CARUB_v10013465mg [Capsella rubella]
          Length = 516

 Score =  391 bits (1005), Expect = e-106
 Identities = 199/397 (50%), Positives = 279/397 (70%), Gaps = 3/397 (0%)
 Frame = +2

Query: 140  HRLIRKFVASSPKXXXXXXXXXXXXXXX-HRYSSLALP-MYERISETRWFNWNPKLVSEV 313
            +RLI+KFVA+SPK                H + S   P +Y  I+E  WF+WNPKL+ E+
Sbjct: 123  NRLIKKFVAASPKSVALNVLSHLLSDNTSHPHLSYFAPQLYLEITEASWFDWNPKLIGEL 182

Query: 314  IASLEKQGRFDEAENLISESVEKLGFREGEVALFYCDLIDSLSKQASSRGVFDAYGRLKQ 493
            ++ L KQ RF E+E L+S +V +L   E + ALF C+L++S SKQ S +G  DA  RL++
Sbjct: 183  VSLLNKQERFVESETLLSTAVSRLESNERDFALFLCNLVESNSKQGSIQGFSDACSRLRE 242

Query: 494  LLSGFXXXXXXTVKRRAFESMINGLCRLDLPHEAGKMMEEMRVAGLRPSAFEFRSIVNAY 673
            ++          VK +A++SM++GLC +D P +A +++EEMR+  ++P  FE++S++  Y
Sbjct: 243  IIQ---RSSSVYVKTQAYKSMVSGLCNMDQPLDAERVIEEMRMETIKPGLFEYKSVLYGY 299

Query: 674  GKLALFEDMRKTLDEMETSDYALDTICSNIVLSSYGAHGELSAMASWMQKIKSLDTPFSI 853
            G+L LF+DM + +  MET  + +DT+CSN+VLSSYGAH  L  M SW+QK+K  + P SI
Sbjct: 300  GRLGLFDDMNRIVHRMETQGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGYNVPLSI 359

Query: 854  RTYNSVLNSCPSIMSMLQNLSNPLPLSIEELTHKLPEDEALLVRELTGSSVLVESLQWCS 1033
            RTYNSVLNSCP+I+S+L++L +  PLS+ EL   L EDEALLVRELT S VL E+++W +
Sbjct: 360  RTYNSVLNSCPTIISLLKDLDS-CPLSLSELLPILNEDEALLVRELTQSLVLDEAIEWNA 418

Query: 1034 SEAKLDLHGMHLGSAYVIILQWVEELRSRFARE-STDIPTEIKVVCGKGKHSNIRGESPL 1210
             E KLDLHGMHL ++Y+I+LQW++E R RF+ +    +P EI VV G GKHSN+RGESP+
Sbjct: 419  VEGKLDLHGMHLSASYLIMLQWMDETRLRFSEDKKCVVPAEIVVVSGSGKHSNVRGESPV 478

Query: 1211 KSLVSEMMVRWKSPMRIDRTNIATFIAKGSTVKNWLC 1321
            K++V ++MVR KSPMRIDR N+ +FIAKG  VK WLC
Sbjct: 479  KAMVKKIMVRTKSPMRIDRKNVGSFIAKGKNVKEWLC 515


>gb|EYU37991.1| hypothetical protein MIMGU_mgv1a006093mg [Mimulus guttatus]
          Length = 458

 Score =  384 bits (986), Expect = e-104
 Identities = 206/429 (48%), Positives = 276/429 (64%), Gaps = 2/429 (0%)
 Frame = +2

Query: 41   RHSPVIVPKCALSKQSCRLXXXXXXXXXXXXXXHRLIRKFVASSPKXXXXXXXXXXXXXX 220
            R  P +V  C L+KQ  RL                L+RKFVASS K              
Sbjct: 21   RQLPPLV--CVLTKQGQRLLSSIATSEQPSAAIS-LLRKFVASSSKHVALSTLSHLLSPS 77

Query: 221  XH--RYSSLALPMYERISETRWFNWNPKLVSEVIASLEKQGRFDEAENLISESVEKLGFR 394
                R SSLA P+Y  I +  WF WN KLV+++I+ L K  RFDEA+NL  E+V KLGF+
Sbjct: 78   TSHPRLSSLAFPLYGIIEQESWFTWNSKLVADLISLLYKAERFDEADNLFGETVSKLGFK 137

Query: 395  EGEVALFYCDLIDSLSKQASSRGVFDAYGRLKQLLSGFXXXXXXTVKRRAFESMINGLCR 574
            E ++  FYC+L+DS +K  S RGV D+  RLKQL+          VK++ +ESMI G C 
Sbjct: 138  ERDLCTFYCNLVDSHAKHMSERGVSDSCTRLKQLILASSSVY---VKQKGYESMIAGFCE 194

Query: 575  LDLPHEAGKMMEEMRVAGLRPSAFEFRSIVNAYGKLALFEDMRKTLDEMETSDYALDTIC 754
            +  P +A  +MEEMR  GL+PSAFE R++V  YG++ L EDM++++ +ME   + LDT+C
Sbjct: 195  IGSPDKAENLMEEMRQNGLKPSAFELRTLVYGYGQMGLLEDMKRSVGQMEKEGFELDTVC 254

Query: 755  SNIVLSSYGAHGELSAMASWMQKIKSLDTPFSIRTYNSVLNSCPSIMSMLQNLSNPLPLS 934
             N+VLSS+GA  E   M  W++K+++   PFSIRTYNSVLNSCP+++ +L+++ + LPLS
Sbjct: 255  YNMVLSSFGARNEFLDMLLWLKKMRNSGIPFSIRTYNSVLNSCPTVILLLEDMKS-LPLS 313

Query: 935  IEELTHKLPEDEALLVRELTGSSVLVESLQWCSSEAKLDLHGMHLGSAYVIILQWVEELR 1114
            + EL   L   EA LV EL  S VL + ++W S+E KLD+HGMHL +AY+I+LQW +EL+
Sbjct: 314  VNELVDNLKTGEADLVLELMKSDVLDQVMEWKSTELKLDMHGMHLSTAYLILLQWFKELK 373

Query: 1115 SRFARESTDIPTEIKVVCGKGKHSNIRGESPLKSLVSEMMVRWKSPMRIDRTNIATFIAK 1294
             RF   + + PTEI VVCG GKHS+ RGESP+K L  EM+ R K P+RIDR NI  FI K
Sbjct: 374  VRFGDGNHETPTEILVVCGSGKHSSKRGESPVKVLAKEMVTRMKCPLRIDRKNIGCFIGK 433

Query: 1295 GSTVKNWLC 1321
            G T K+WLC
Sbjct: 434  GKTFKDWLC 442


>gb|AAU04769.1| pentatricopeptide (PPR) repeat protein-like [Cucumis melo]
          Length = 488

 Score =  384 bits (986), Expect = e-104
 Identities = 205/400 (51%), Positives = 273/400 (68%), Gaps = 6/400 (1%)
 Frame = +2

Query: 140  HRLIRKFVASSPKXXXXXXXXXXXXXXXHR--YSSLALPMYERISETRWFNWNPKLVSEV 313
            +RLIRKFVASSPK                +    S AL +Y RI+E  WF WN KLV+++
Sbjct: 69   NRLIRKFVASSPKSITLSVLSNIVSTHTPQPELCSAALTLYSRITEASWFTWNSKLVADL 128

Query: 314  IASLEKQGRFDEAENLISESVEKLGFREGEVALFYCDLIDSLSKQASSRGVFDAYGRLKQ 493
            +A L + G + E+E LISE++ KLG +E ++  FY  L++S SK    RG  D+Y RL +
Sbjct: 129  VAFLGQNGLYSESEALISEAISKLGSQERKLVNFYSQLVESQSKHGFERGFGDSYSRLFE 188

Query: 494  LLSGFXXXXXXTVKRRAFESMINGLCRLDLPHEAGKMMEEMRVAGLRPSAFEFRSIVNAY 673
            LL          VKRRA+ESM+ GLC +  PHEA  +++EMR  G+ P+A+E+RSI+ AY
Sbjct: 189  LLYNSPSVY---VKRRAYESMVTGLCSMKRPHEAESLVKEMRSKGITPTAYEYRSIIYAY 245

Query: 674  GKLALFEDMRKTLDEMETSDYALDTICSNIVLSSYGAHGELSAMASWMQKIK-SLDTPFS 850
            G L LFE+M+++L +ME  +  LDT+CSN+VLSSYGAH +L  M  W+Q++K S     S
Sbjct: 246  GTLGLFEEMKRSLKQMENDNIELDTVCSNMVLSSYGAHNKLGDMLLWLQRMKTSSHCKSS 305

Query: 851  IRTYNSVLNSCPSIMSMLQN-LSNPLPLSIEELTHKLPEDE-ALLVREL-TGSSVLVESL 1021
            +RTYNSVLNSCP I SMLQ+  S  LP+ IE+L   L  DE ALLV+EL  GSSVL E +
Sbjct: 306  VRTYNSVLNSCPKITSMLQDHKSGDLPVLIEDLIAILDGDEEALLVKELLVGSSVLNEIM 365

Query: 1022 QWCSSEAKLDLHGMHLGSAYVIILQWVEELRSRFARESTDIPTEIKVVCGKGKHSNIRGE 1201
             W + E KLDLHG H+G+AYVI+LQW++E+R  F  ES  IP ++ ++CG GKHS +RGE
Sbjct: 366  VWDAMELKLDLHGAHVGAAYVIMLQWIKEMRLNFEDESNVIPAQVTLICGSGKHSIVRGE 425

Query: 1202 SPLKSLVSEMMVRWKSPMRIDRTNIATFIAKGSTVKNWLC 1321
            SP+K+L+ E+MVR +SP+RIDR N   FI+KG  VKNWLC
Sbjct: 426  SPVKALIKEIMVRTESPLRIDRKNTGCFISKGKAVKNWLC 465


>ref|XP_004156246.1| PREDICTED: uncharacterized protein LOC101223617 [Cucumis sativus]
          Length = 1296

 Score =  384 bits (985), Expect = e-104
 Identities = 205/410 (50%), Positives = 277/410 (67%), Gaps = 6/410 (1%)
 Frame = +2

Query: 140  HRLIRKFVASSPKXXXXXXXXXXXXXXXHR--YSSLALPMYERISETRWFNWNPKLVSEV 313
            +RLIRKFVASSPK                +    S AL +Y RI+E  WF WN KLV+++
Sbjct: 69   NRLIRKFVASSPKSITLSVLSNIVSTHTPQPELCSAALTLYSRITEASWFTWNSKLVADL 128

Query: 314  IASLEKQGRFDEAENLISESVEKLGFREGEVALFYCDLIDSLSKQASSRGVFDAYGRLKQ 493
            +A L++ G + E+E LISE++ KLG +E ++  FY  L++S SK    RG  D+Y RL +
Sbjct: 129  VAFLDQNGLYSESEVLISEAISKLGSQERKLVNFYSQLVESQSKHGFERGFVDSYSRLLE 188

Query: 494  LLSGFXXXXXXTVKRRAFESMINGLCRLDLPHEAGKMMEEMRVAGLRPSAFEFRSIVNAY 673
            LL          VKRRA+ESM+ GLC +  PHEA  +++EMR  G+ P+A+E+RSI+ AY
Sbjct: 189  LLYNSPSVY---VKRRAYESMVTGLCSMKRPHEAENLVKEMRSKGITPTAYEYRSIIYAY 245

Query: 674  GKLALFEDMRKTLDEMETSDYALDTICSNIVLSSYGAHGELSAMASWMQKIK-SLDTPFS 850
            G L LFE+M+++L +ME  +  LDT+CSN+VLSSYGAH +L  M  W+Q++K S     S
Sbjct: 246  GTLGLFEEMKRSLKQMENDNIELDTVCSNMVLSSYGAHNKLGDMVLWLQRMKTSPHCNSS 305

Query: 851  IRTYNSVLNSCPSIMSMLQN-LSNPLPLSIEELTHKLPEDE-ALLVREL-TGSSVLVESL 1021
            +RTYNSVLNSCP I +MLQ+  S  LP+ IE+L   L  DE ALLV EL  GSSVL E +
Sbjct: 306  VRTYNSVLNSCPKITAMLQDHKSTNLPVLIEDLIAVLDGDEEALLVEELLAGSSVLNEIM 365

Query: 1022 QWCSSEAKLDLHGMHLGSAYVIILQWVEELRSRFARESTDIPTEIKVVCGKGKHSNIRGE 1201
             W + E KLDLHG H+G+AYVI+LQW++E+R  F  ES  IP ++ ++CG GKHS +RGE
Sbjct: 366  VWDAMELKLDLHGAHVGAAYVIMLQWIKEMRLNFEDESYVIPAQVTLICGSGKHSIVRGE 425

Query: 1202 SPLKSLVSEMMVRWKSPMRIDRTNIATFIAKGSTVKNWLC*IRNDLLISD 1351
            SP+K+L+ E+MVR +SP+RIDR N   FI+KG  VKNWLC +    ++ D
Sbjct: 426  SPVKALIKEIMVRTESPLRIDRKNTGCFISKGKAVKNWLCSLPGKRILYD 475


>ref|XP_004141623.1| PREDICTED: uncharacterized protein LOC101204365 [Cucumis sativus]
          Length = 1913

 Score =  383 bits (984), Expect = e-103
 Identities = 204/400 (51%), Positives = 273/400 (68%), Gaps = 6/400 (1%)
 Frame = +2

Query: 140  HRLIRKFVASSPKXXXXXXXXXXXXXXXHR--YSSLALPMYERISETRWFNWNPKLVSEV 313
            +RLIRKFVASSPK                +    S AL +Y RI+E  WF WN KLV+++
Sbjct: 69   NRLIRKFVASSPKSITLSVLSNIVSTHTPQPELCSAALTLYSRITEASWFTWNSKLVADL 128

Query: 314  IASLEKQGRFDEAENLISESVEKLGFREGEVALFYCDLIDSLSKQASSRGVFDAYGRLKQ 493
            +A L++ G + E+E LISE++ KLG +E ++  FY  L++S SK    RG  D+Y RL +
Sbjct: 129  VAFLDQNGLYSESEVLISEAISKLGSQERKLVNFYSQLVESQSKHGFERGFVDSYSRLLE 188

Query: 494  LLSGFXXXXXXTVKRRAFESMINGLCRLDLPHEAGKMMEEMRVAGLRPSAFEFRSIVNAY 673
            LL          VKRRA+ESM+ GLC +  PHEA  +++EMR  G+ P+A+E+RSI+ AY
Sbjct: 189  LLYNSPSVY---VKRRAYESMVTGLCSMKRPHEAENLVKEMRSKGITPTAYEYRSIIYAY 245

Query: 674  GKLALFEDMRKTLDEMETSDYALDTICSNIVLSSYGAHGELSAMASWMQKIK-SLDTPFS 850
            G L LFE+M+++L +ME  +  LDT+CSN+VLSSYGAH +L  M  W+Q++K S     S
Sbjct: 246  GTLGLFEEMKRSLKQMENDNIELDTVCSNMVLSSYGAHNKLGDMVLWLQRMKTSPHCNSS 305

Query: 851  IRTYNSVLNSCPSIMSMLQN-LSNPLPLSIEELTHKLPEDE-ALLVREL-TGSSVLVESL 1021
            +RTYNSVLNSCP I +MLQ+  S  LP+ IE+L   L  DE ALLV EL  GSSVL E +
Sbjct: 306  VRTYNSVLNSCPKITAMLQDHKSTNLPVLIEDLIAVLDGDEEALLVEELLAGSSVLNEIM 365

Query: 1022 QWCSSEAKLDLHGMHLGSAYVIILQWVEELRSRFARESTDIPTEIKVVCGKGKHSNIRGE 1201
             W + E KLDLHG H+G+AYVI+LQW++E+R  F  ES  IP ++ ++CG GKHS +RGE
Sbjct: 366  VWDAMELKLDLHGAHVGAAYVIMLQWIKEMRLNFEDESYVIPAQVTLICGSGKHSIVRGE 425

Query: 1202 SPLKSLVSEMMVRWKSPMRIDRTNIATFIAKGSTVKNWLC 1321
            SP+K+L+ E+MVR +SP+RIDR N   FI+KG  VKNWLC
Sbjct: 426  SPVKALIKEIMVRTESPLRIDRKNTGCFISKGKAVKNWLC 465


>ref|XP_006360892.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like
            [Solanum tuberosum]
          Length = 459

 Score =  383 bits (983), Expect = e-103
 Identities = 213/433 (49%), Positives = 280/433 (64%), Gaps = 4/433 (0%)
 Frame = +2

Query: 32   WDGRHSPVIVPKCALSKQSCR-LXXXXXXXXXXXXXXHRLIRKFVASSPKXXXXXXXXXX 208
            W+ R  P   P+C+LSKQ  R L                L+RKFVASS K          
Sbjct: 21   WNRRPRPC--PRCSLSKQGHRFLSTLIAADSEDISATRHLLRKFVASSSKHVALSTLSHL 78

Query: 209  XXXXX---HRYSSLALPMYERISETRWFNWNPKLVSEVIASLEKQGRFDEAENLISESVE 379
                    +R  SLALP+Y  ISE  WF+WN KLV++++A L K  RFDEAE L++E+V 
Sbjct: 79   VSPTTTSHYRLCSLALPLYLEISEASWFDWNSKLVADLVALLYKLERFDEAETLVTETVS 138

Query: 380  KLGFREGEVALFYCDLIDSLSKQASSRGVFDAYGRLKQLLSGFXXXXXXTVKRRAFESMI 559
            KLG RE ++  FY  LI S SK  S RGV D   +LK +L          +K+R + SM+
Sbjct: 139  KLGSRERDLCSFYSQLIHSQSKHNSERGVLDFCTKLKLVL---LRSSSVYLKQRGYASMV 195

Query: 560  NGLCRLDLPHEAGKMMEEMRVAGLRPSAFEFRSIVNAYGKLALFEDMRKTLDEMETSDYA 739
             G C + LP +A ++MEEM+  GL+ S FEFRS+V +YGK     DM++ + EME+  + 
Sbjct: 196  EGFCLIGLPRKAEELMEEMKELGLKLSKFEFRSLVYSYGKSGYLRDMKRIVVEMESMGFQ 255

Query: 740  LDTICSNIVLSSYGAHGELSAMASWMQKIKSLDTPFSIRTYNSVLNSCPSIMSMLQNLSN 919
            LDT+ SN+VL+S+G+H ELS + S +QKI++   PFSIRTYNSVLNSCP+I  +LQ+L +
Sbjct: 256  LDTVSSNMVLNSFGSHNELSEVVSSLQKIEASGVPFSIRTYNSVLNSCPTISLLLQDLKS 315

Query: 920  PLPLSIEELTHKLPEDEALLVRELTGSSVLVESLQWCSSEAKLDLHGMHLGSAYVIILQW 1099
             +PLS+EEL   L E+EA+LV  L GSSVL E++QW  SE KLDLHGMHL SAYVIILQW
Sbjct: 316  -VPLSLEELMGNLDENEAVLVNILVGSSVLEETMQWKPSELKLDLHGMHLTSAYVIILQW 374

Query: 1100 VEELRSRFARESTDIPTEIKVVCGKGKHSNIRGESPLKSLVSEMMVRWKSPMRIDRTNIA 1279
              +L+ +F  E+  +P EI VVCG GKHS +RGESP+K L+ E+++R   P+RIDR NI 
Sbjct: 375  FHQLQCKFLAENRVLPGEIIVVCGAGKHSVVRGESPVKRLIKEILLRIGCPLRIDRKNIG 434

Query: 1280 TFIAKGSTVKNWL 1318
             FIAKG +   WL
Sbjct: 435  CFIAKGKSFMEWL 447


>gb|AGH33847.1| PPR [Cucumis melo]
          Length = 488

 Score =  383 bits (983), Expect = e-103
 Identities = 205/400 (51%), Positives = 273/400 (68%), Gaps = 6/400 (1%)
 Frame = +2

Query: 140  HRLIRKFVASSPKXXXXXXXXXXXXXXXHR--YSSLALPMYERISETRWFNWNPKLVSEV 313
            +RLIRKFVASSPK                +    S AL +Y RI+E  WF WN KLV+++
Sbjct: 69   NRLIRKFVASSPKSITLSVLSNIVSTHTPQPELCSAALTLYSRITEASWFTWNSKLVADL 128

Query: 314  IASLEKQGRFDEAENLISESVEKLGFREGEVALFYCDLIDSLSKQASSRGVFDAYGRLKQ 493
            +A L + G + E+E LISE++ KLG +E ++  FY  L++S SK    RG  D+Y RL +
Sbjct: 129  VAFLGQNGLYSESEALISEAISKLGSQERKLVNFYSQLVESQSKHGFERGFGDSYSRLFE 188

Query: 494  LLSGFXXXXXXTVKRRAFESMINGLCRLDLPHEAGKMMEEMRVAGLRPSAFEFRSIVNAY 673
            LL          VKRRA+ESM+ GLC +  PHEA  +++EMR  G+ P+A+E+RSI+ AY
Sbjct: 189  LLYNSPSVY---VKRRAYESMVTGLCSMKRPHEAESLVKEMRSKGITPTAYEYRSIIYAY 245

Query: 674  GKLALFEDMRKTLDEMETSDYALDTICSNIVLSSYGAHGELSAMASWMQKIK-SLDTPFS 850
            G L LFE+M+++L +ME  +  LDT+CSN+VLSSYGAH +L  M  W+Q++K S     S
Sbjct: 246  GTLGLFEEMKRSLKQMENDNIELDTVCSNMVLSSYGAHNKLGDMLLWLQRMKTSPHCKSS 305

Query: 851  IRTYNSVLNSCPSIMSMLQN-LSNPLPLSIEELTHKLPEDE-ALLVREL-TGSSVLVESL 1021
            +RTYNSVLNSCP I SMLQ+  S  LP+ IE+L   L  DE ALLV+EL  GSSVL E +
Sbjct: 306  VRTYNSVLNSCPKITSMLQDHKSGDLPVLIEDLIAILDGDEEALLVKELLVGSSVLNEIM 365

Query: 1022 QWCSSEAKLDLHGMHLGSAYVIILQWVEELRSRFARESTDIPTEIKVVCGKGKHSNIRGE 1201
             W + E KLDLHG H+G+AYVI+LQW++E+R  F  ES  IP ++ ++CG GKHS +RGE
Sbjct: 366  VWDAMELKLDLHGAHVGAAYVIMLQWIKEMRLNFEDESYVIPAQVTLICGSGKHSIVRGE 425

Query: 1202 SPLKSLVSEMMVRWKSPMRIDRTNIATFIAKGSTVKNWLC 1321
            SP+K+L+ E+MVR +SP+RIDR N   FI+KG  VKNWLC
Sbjct: 426  SPVKALIKEIMVRTESPLRIDRKNTGCFISKGKAVKNWLC 465


Top