BLASTX nr result

ID: Rehmannia23_contig00015253 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia23_contig00015253
         (1150 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006360892.1| PREDICTED: pentatricopeptide repeat-containi...   425   e-116
ref|XP_004248641.1| PREDICTED: pentatricopeptide repeat-containi...   419   e-115
ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containi...   403   e-110
ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citr...   379   e-103
gb|EOX97560.1| Pentatricopeptide (PPR) repeat-containing protein...   378   e-102
gb|EMJ01929.1| hypothetical protein PRUPE_ppa021547mg [Prunus pe...   374   e-101
ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containi...   373   e-101
gb|EXB24044.1| hypothetical protein L484_006076 [Morus notabilis]     366   1e-98
ref|XP_002521239.1| pentatricopeptide repeat-containing protein,...   352   2e-94
ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutr...   347   4e-93
ref|XP_002884032.1| pentatricopeptide repeat-containing protein ...   346   1e-92
gb|AAU04769.1| pentatricopeptide (PPR) repeat protein-like [Cucu...   344   3e-92
gb|AGH33847.1| PPR [Cucumis melo]                                     343   1e-91
ref|XP_006297442.1| hypothetical protein CARUB_v10013465mg [Caps...   341   4e-91
ref|NP_849962.1| pentatricopeptide repeat-containing protein [Ar...   340   5e-91
dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana]           340   5e-91
ref|NP_565402.1| pentatricopeptide repeat-containing protein [Ar...   340   5e-91
ref|XP_004156246.1| PREDICTED: uncharacterized protein LOC101223...   340   6e-91
ref|XP_004141623.1| PREDICTED: uncharacterized protein LOC101204...   340   6e-91
ref|XP_002312938.2| hypothetical protein POPTR_0009s14120g [Popu...   322   2e-85

>ref|XP_006360892.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like
            [Solanum tuberosum]
          Length = 459

 Score =  425 bits (1093), Expect = e-116
 Identities = 219/361 (60%), Positives = 278/361 (77%), Gaps = 3/361 (0%)
 Frame = +3

Query: 72   CALTKQGHRFLSSL--ATTDEPSAATGLLRKFVASSSKHVAXXXXXXXXXXXXXDP-RLS 242
            C+L+KQGHRFLS+L  A +++ SA   LLRKFVASSSKHVA                RL 
Sbjct: 31   CSLSKQGHRFLSTLIAADSEDISATRHLLRKFVASSSKHVALSTLSHLVSPTTTSHYRLC 90

Query: 243  SLAFPFYSMIKRESWFCWNAKLVADLIALLHKEERFDEAENLISETVLKLGFKERDLCVF 422
            SLA P Y  I   SWF WN+KLVADL+ALL+K ERFDEAE L++ETV KLG +ERDLC F
Sbjct: 91   SLALPLYLEISEASWFDWNSKLVADLVALLYKLERFDEAETLVTETVSKLGSRERDLCSF 150

Query: 423  YCNLVDSHAKHRSERGVLDYCTQLRQLILQSSSVYVKQRGYESMIAGFCEIGLPNKAEKL 602
            Y  L+ S +KH SERGVLD+CT+L+ ++L+SSSVY+KQRGY SM+ GFC IGLP KAE+L
Sbjct: 151  YSQLIHSQSKHNSERGVLDFCTKLKLVLLRSSSVYLKQRGYASMVEGFCLIGLPRKAEEL 210

Query: 603  IEEMREKGLKPSVFELRSLVYGYGQKGFLEDMKRSIVQIEKEGFELDTICCNMVLSSFGA 782
            +EEM+E GLK S FE RSLVY YG+ G+L DMKR +V++E  GF+LDT+  NMVL+SFG+
Sbjct: 211  MEEMKELGLKLSKFEFRSLVYSYGKSGYLRDMKRIVVEMESMGFQLDTVSSNMVLNSFGS 270

Query: 783  HNELLDMLSWLKKMRNSGIPFSVRTYNSVLNSCPTIFLLLEDMKSLPLSIDELVDNLKIN 962
            HNEL +++S L+K+  SG+PFS+RTYNSVLNSCPTI LLL+D+KS+PLS++EL+ NL  N
Sbjct: 271  HNELSEVVSSLQKIEASGVPFSIRTYNSVLNSCPTISLLLQDLKSVPLSLEELMGNLDEN 330

Query: 963  GEANLVLELLKSNVLDQVMEWNSSELKLDLHGTHLSTAYLILLQWFDNLKRRFESGNRTA 1142
             EA LV  L+ S+VL++ M+W  SELKLDLHG HL++AY+I+LQWF  L+ +F + NR  
Sbjct: 331  -EAVLVNILVGSSVLEETMQWKPSELKLDLHGMHLTSAYVIILQWFHQLQCKFLAENRVL 389

Query: 1143 P 1145
            P
Sbjct: 390  P 390


>ref|XP_004248641.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like
            [Solanum lycopersicum]
          Length = 459

 Score =  419 bits (1078), Expect = e-115
 Identities = 219/368 (59%), Positives = 275/368 (74%), Gaps = 3/368 (0%)
 Frame = +3

Query: 51   RQYPPLVCALTKQGHRFLSSLATTDEP--SAATGLLRKFVASSSKHVAXXXXXXXXXXXX 224
            R  P   C+L+KQGHRFLS+L  TD    SA   LLRKFV SSSKHVA            
Sbjct: 24   RPRPGPRCSLSKQGHRFLSTLIATDSDDISATRHLLRKFVGSSSKHVALSTLSHLVSPTT 83

Query: 225  XDP-RLSSLAFPFYSMIKRESWFCWNAKLVADLIALLHKEERFDEAENLISETVLKLGFK 401
                RL SLA P Y  I   SWF WN+KLVA+L+ALL+K ERFDEAE L++E+V KLG +
Sbjct: 84   TSHYRLCSLALPLYLEISEASWFDWNSKLVAELVALLYKLERFDEAETLVTESVSKLGSR 143

Query: 402  ERDLCVFYCNLVDSHAKHRSERGVLDYCTQLRQLILQSSSVYVKQRGYESMIAGFCEIGL 581
            ERDLC FY  L+ S +KH SERGVLDYCT+L+ ++L SSSVY+KQRGY SM+ GFC IGL
Sbjct: 144  ERDLCSFYSQLIYSQSKHNSERGVLDYCTKLKLVLLHSSSVYLKQRGYASMVEGFCLIGL 203

Query: 582  PNKAEKLIEEMREKGLKPSVFELRSLVYGYGQKGFLEDMKRSIVQIEKEGFELDTICCNM 761
            P KAE+L+EEM+E GLK S FE RSLVY YG+ G+L DMKR +V++E+ GF+LDT+  NM
Sbjct: 204  PRKAEELMEEMKELGLKLSKFEFRSLVYSYGKSGYLRDMKRIVVEMERMGFQLDTVGSNM 263

Query: 762  VLSSFGAHNELLDMLSWLKKMRNSGIPFSVRTYNSVLNSCPTIFLLLEDMKSLPLSIDEL 941
            VL+SFG+HNEL +++S L+K+  SG+ FS+RTYNSVLNSCPTI LLL+D+KS+PLS++EL
Sbjct: 264  VLNSFGSHNELSELVSSLQKIEASGVLFSIRTYNSVLNSCPTISLLLQDLKSVPLSLEEL 323

Query: 942  VDNLKINGEANLVLELLKSNVLDQVMEWNSSELKLDLHGTHLSTAYLILLQWFDNLKRRF 1121
            + NL  N EA LV  L+ S+VL++ M+W   ELKLDLHG HL++AYLI+LQWF  L+ +F
Sbjct: 324  MGNLDEN-EAVLVKILVGSSVLEETMQWKPKELKLDLHGMHLTSAYLIILQWFHQLQCKF 382

Query: 1122 ESGNRTAP 1145
             + NR  P
Sbjct: 383  LAENRVLP 390


>ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Vitis vinifera] gi|297744557|emb|CBI37819.3| unnamed
            protein product [Vitis vinifera]
          Length = 435

 Score =  403 bits (1036), Expect = e-110
 Identities = 203/358 (56%), Positives = 265/358 (74%)
 Frame = +3

Query: 72   CALTKQGHRFLSSLATTDEPSAATGLLRKFVASSSKHVAXXXXXXXXXXXXXDPRLSSLA 251
            CAL+KQG  FLSS+A   +PSA+  L+ KF+ASSSK +A              P LSSLA
Sbjct: 23   CALSKQGQLFLSSVAR--DPSASNRLICKFIASSSKSIALNALSHLLSPTTTHPYLSSLA 80

Query: 252  FPFYSMIKRESWFCWNAKLVADLIALLHKEERFDEAENLISETVLKLGFKERDLCVFYCN 431
             P YS I   SWF WN KL+AD+IALL+K+ +  EAE L+SET++KLG +ERDL  FYCN
Sbjct: 81   LPLYSRISEASWFSWNPKLIADVIALLYKQGQLKEAETLVSETLIKLGSRERDLVSFYCN 140

Query: 432  LVDSHAKHRSERGVLDYCTQLRQLILQSSSVYVKQRGYESMIAGFCEIGLPNKAEKLIEE 611
            L+DSH+KH S +GV D  ++L +++ +SSSVYVK+R Y+SMI+  C +GLP +AE LIEE
Sbjct: 141  LIDSHSKHSSNQGVFDVISRLSRIVSESSSVYVKERAYKSMISSLCAVGLPLEAENLIEE 200

Query: 612  MREKGLKPSVFELRSLVYGYGQKGFLEDMKRSIVQIEKEGFELDTICCNMVLSSFGAHNE 791
            MR KGLKPSVFE RS+VYGYG+ G  EDM+R ++Q+  EGFELDT+  NMVLSS+GA+N+
Sbjct: 201  MRVKGLKPSVFEFRSVVYGYGRVGLSEDMQRILLQMGNEGFELDTVVSNMVLSSYGAYNK 260

Query: 792  LLDMLSWLKKMRNSGIPFSVRTYNSVLNSCPTIFLLLEDMKSLPLSIDELVDNLKINGEA 971
              +M+SWL++M+NS IPFS+RTYNSVLNSCP I  +L+D+K+ P +IDEL++ LK   EA
Sbjct: 261  QSEMVSWLQRMKNSSIPFSIRTYNSVLNSCPMIMSILQDLKTFPPTIDELMETLK-GDEA 319

Query: 972  NLVLELLKSNVLDQVMEWNSSELKLDLHGTHLSTAYLILLQWFDNLKRRFESGNRTAP 1145
             LV EL+ S VL ++MEW+ SE KLDLHG HL +AYLI+LQW + L+ R  +     P
Sbjct: 320  LLVKELIGSMVLAELMEWDCSEGKLDLHGMHLGSAYLIMLQWREELRYRLNAAEYVMP 377


>ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citrus clementina]
            gi|568866680|ref|XP_006486677.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g17033-like [Citrus sinensis]
            gi|557524456|gb|ESR35762.1| hypothetical protein
            CICLE_v10028424mg [Citrus clementina]
          Length = 451

 Score =  379 bits (974), Expect = e-103
 Identities = 199/364 (54%), Positives = 255/364 (70%), Gaps = 4/364 (1%)
 Frame = +3

Query: 66   LVCALTKQGHRFLSSLA--TTDEPSAATGLLRKFVASSSKHVAXXXXXXXXXXXXXDPRL 239
            L   LTKQG RFLSSLA   T +  AA+ L+ KFVASS + +A              PRL
Sbjct: 31   LTARLTKQGQRFLSSLALAVTRDSKAASRLISKFVASSPQFIALNALSHLLSPDTTHPRL 90

Query: 240  SSLAFPFYSMIKRESWFCWNAKLVADLIALLHKEERFDEAENLISETVLKLGFKERDLCV 419
            SSLAFP Y  I  ESWF WN KLVA++IA L K+ + +EAE LI ET+ KLG +ER+L +
Sbjct: 91   SSLAFPLYMRITEESWFQWNPKLVAEIIAFLDKQGQREEAETLILETLSKLGSRERELVL 150

Query: 420  FYCNLVDSHAKHRSERGVLDYCTQLRQLILQSSSVYVKQRGYESMIAGFCEIGLPNKAEK 599
            FYCNL+DS  KH S+RG  D   +L QL+  SSSVYVK++  +SMI+G CE+G P++AE 
Sbjct: 151  FYCNLIDSFCKHDSKRGFDDTYARLNQLVNSSSSVYVKRQALKSMISGLCEMGQPHEAEN 210

Query: 600  LIEEMREKGLKPSVFELRSLVYGYGQKGFLEDMKRSIVQIEKEGFELDTICCNMVLSSFG 779
            LIEEMR KGL+PS FE + ++YGYG+ G LEDM+R + Q+E +G  +DT+C NMVLSS+G
Sbjct: 211  LIEEMRVKGLEPSGFEYKCIIYGYGRLGLLEDMERIVNQMESDGTRVDTVCSNMVLSSYG 270

Query: 780  AHNELLDMLSWLKKMRNSGIPFSVRTYNSVLNSCPTIFLLLEDMKS--LPLSIDELVDNL 953
             HNEL  M+ WL+KM++SGIPFSVRTYNSVLNSC TI  +L+D+ S   PLSI EL + L
Sbjct: 271  DHNELSRMVLWLQKMKDSGIPFSVRTYNSVLNSCSTIMSMLQDLNSNDFPLSILELTEVL 330

Query: 954  KINGEANLVLELLKSNVLDQVMEWNSSELKLDLHGTHLSTAYLILLQWFDNLKRRFESGN 1133
                E ++V EL  S+VLD+ M+W+S E KLDLHG HL +AY I+LQW D ++ RF +  
Sbjct: 331  N-EEEVSVVKELEDSSVLDEAMKWDSGETKLDLHGMHLGSAYFIILQWMDEMRNRFNNEK 389

Query: 1134 RTAP 1145
               P
Sbjct: 390  HVIP 393


>gb|EOX97560.1| Pentatricopeptide (PPR) repeat-containing protein, putative
            [Theobroma cacao]
          Length = 456

 Score =  378 bits (970), Expect = e-102
 Identities = 195/353 (55%), Positives = 251/353 (71%), Gaps = 4/353 (1%)
 Frame = +3

Query: 78   LTKQGHRFLSSLATT---DEPSAATGLLRKFVASSSKHVAXXXXXXXXXXXXXDPRLSSL 248
            LTKQGHRF SSLA T   ++P+ A  L++KFVASS K +A              P LS+L
Sbjct: 34   LTKQGHRFFSSLAATADVNDPATANRLIKKFVASSPKSIALNALSHLLSPRNSHPHLSAL 93

Query: 249  AFPFYSMIKRESWFCWNAKLVADLIALLHKEERFDEAENLISETVLKLGFKERDLCVFYC 428
            AFP Y+ I   SW+ WN KLVA+LIALL K+ R+DE+E LIS+ V KL F+ERDL  FYC
Sbjct: 94   AFPLYTKISETSWYNWNPKLVAELIALLVKQGRYDESEALISQAVSKLKFRERDLVQFYC 153

Query: 429  NLVDSHAKHRSERGVLDYCTQLRQLILQSSSVYVKQRGYESMIAGFCEIGLPNKAEKLIE 608
            N ++S +KH S+ G  D    L +LI  SSSVYVK++GY+SM++  CE+  PN+AE L+E
Sbjct: 154  NWIESCSKHNSKEGFNDAYCYLSELICNSSSVYVKRQGYKSMVSSLCEMDRPNEAENLVE 213

Query: 609  EMREKGLKPSVFELRSLVYGYGQKGFLEDMKRSIVQIEKEGFELDTICCNMVLSSFGAHN 788
            EMR+ GL P++FE R + YGYGQ G  EDM+R + ++E EGFE+DTIC NMVLSS+GA+N
Sbjct: 214  EMRKNGLTPTLFEFRFISYGYGQLGLFEDMERMVCEMEIEGFEVDTICSNMVLSSYGAYN 273

Query: 789  ELLDMLSWLKKMRNSGIPFSVRTYNSVLNSCPTIFLLLEDMKSLPLSIDELVDNLKINGE 968
                M+ WL+KM+   IPFS+RTYNSVLNSCP I  L++ + S+PLS+ EL   L    E
Sbjct: 274  AFSKMVPWLQKMKTLQIPFSIRTYNSVLNSCPEIMSLVQGLDSVPLSLGELAKILN-EDE 332

Query: 969  ANLVLELLK-SNVLDQVMEWNSSELKLDLHGTHLSTAYLILLQWFDNLKRRFE 1124
            A LV EL+K S+VLD+ MEWN SE KLDLHG HL +AYLI+LQW + +K RF+
Sbjct: 333  ALLVQELVKSSSVLDEAMEWNGSEGKLDLHGMHLGSAYLIMLQWIEEMKCRFK 385


>gb|EMJ01929.1| hypothetical protein PRUPE_ppa021547mg [Prunus persica]
          Length = 447

 Score =  374 bits (959), Expect = e-101
 Identities = 198/374 (52%), Positives = 257/374 (68%), Gaps = 2/374 (0%)
 Frame = +3

Query: 30   PPISAGFRQYPPLVCALTKQGHRFLSSLATTDEPSAATG-LLRKFVASSSKHVAXXXXXX 206
            PP+++      P+ CA+TKQG RFL+ LA     +  T  L+ KF+ SS+K +A      
Sbjct: 24   PPLTS------PIQCAVTKQGQRFLTKLAANARDAKVTNKLIAKFLTSSTKSIALNTLSY 77

Query: 207  XXXXXXXDPRLSSLAFPFYSMIKRESWFCWNAKLVADLIALLHKEERFDEAENLISETVL 386
                    P LSSLA PFYS I   SWF WN KLVA L+ALL K+ + +EAE LISET+ 
Sbjct: 78   LLSPDTTLPHLSSLALPFYSKITEASWFEWNPKLVAALVALLDKQGQHNEAEVLISETIS 137

Query: 387  KLGFKERDLCVFYCNLVDSHAKHRSERGVLDYCTQLRQLILQSSSVYVKQRGYESMIAGF 566
            KLG +ER+L +F+C LV+SH+K  S+ G     + L QL+  SSSVYVK R +ESM++G 
Sbjct: 138  KLGSRERELALFHCQLVESHSKLSSKHGFDSSYSYLYQLLHNSSSVYVKNRAFESMVSGL 197

Query: 567  CEIGLPNKAEKLIEEMREKGLKPSVFELRSLVYGYGQKGFLEDMKRSIVQIEKEGFELDT 746
            CE+  P +A+ LIEEMR +GLKPSVFE RS+VYGYG+ G  EDM + + Q+E +G  +DT
Sbjct: 198  CEMDRPREADNLIEEMRVRGLKPSVFEFRSVVYGYGRLGLFEDMLKVVEQMENQGIAIDT 257

Query: 747  ICCNMVLSSFGAHNELLDMLSWLKKMRNSGIPFSVRTYNSVLNSCPTIFLLLEDMKSLPL 926
            IC NMVLSS+GAH+EL  ML WL+KM++  +PFS+RTYNSVLNSC TI  +L++ K  P 
Sbjct: 258  ICSNMVLSSYGAHSELAAMLVWLRKMKSLSLPFSIRTYNSVLNSCLTIMAMLQEPKDFPC 317

Query: 927  SIDELVDNLKING-EANLVLELLKSNVLDQVMEWNSSELKLDLHGTHLSTAYLILLQWFD 1103
            SI+EL  N  +NG EA LV EL++S VLD+VM W   E KLDLHG HL +AYLILL+WF+
Sbjct: 318  SIEEL--NGVLNGDEALLVKELVESTVLDEVMVWEPLEAKLDLHGMHLGSAYLILLEWFE 375

Query: 1104 NLKRRFESGNRTAP 1145
             ++ RF SG    P
Sbjct: 376  AMRCRFNSGKDVIP 389


>ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like
            [Fragaria vesca subsp. vesca]
          Length = 448

 Score =  373 bits (958), Expect = e-101
 Identities = 199/390 (51%), Positives = 259/390 (66%), Gaps = 9/390 (2%)
 Frame = +3

Query: 3    GGRGLQLSAPPISAGFRQYPP--------LVCALTKQGHRFLSSLATT-DEPSAATGLLR 155
            GG G    +  ++  +R  PP        + CALTKQG RFL+ LA     PS A  L+ 
Sbjct: 2    GGLGSAQLSFSVALPWRHDPPQHSKLSLQIQCALTKQGQRFLTKLAANAGNPSVANKLIS 61

Query: 156  KFVASSSKHVAXXXXXXXXXXXXXDPRLSSLAFPFYSMIKRESWFCWNAKLVADLIALLH 335
            KF+++S K  A              P LSSLA P YS I   SWF WN KLVA L+ALL 
Sbjct: 62   KFLSTSPKSTALTTLSYLLSPHTAHPHLSSLALPMYSKITEASWFEWNPKLVAALVALLA 121

Query: 336  KEERFDEAENLISETVLKLGFKERDLCVFYCNLVDSHAKHRSERGVLDYCTQLRQLILQS 515
            K+ +  ++E LISET+ KLG KER+L  F+C LV+SH+K  S+ G    CT L QL+  S
Sbjct: 122  KQGQQSQSEALISETISKLGNKERELVQFHCQLVESHSKMSSKCGFDRACTYLHQLLQNS 181

Query: 516  SSVYVKQRGYESMIAGFCEIGLPNKAEKLIEEMREKGLKPSVFELRSLVYGYGQKGFLED 695
            SSVYVK+R +ESM+ G C +  P +A++LIEEMR KGLK SVFE RS+VYGYG+ G  E+
Sbjct: 182  SSVYVKRRAFESMVGGLCAMDRPGEADELIEEMRVKGLKASVFEFRSVVYGYGRLGMFEE 241

Query: 696  MKRSIVQIEKEGFELDTICCNMVLSSFGAHNELLDMLSWLKKMRNSGIPFSVRTYNSVLN 875
            M + + Q+EK+GF  DTICCNMVLSS+GAHNEL  M +WL+KM+ S +PFSVRTYNSVLN
Sbjct: 242  MLKIVDQMEKQGFGDDTICCNMVLSSYGAHNELAAMANWLRKMKESSVPFSVRTYNSVLN 301

Query: 876  SCPTIFLLLEDMKSLPLSIDELVDNLKINGEANLVLELLKSNVLDQVMEWNSSELKLDLH 1055
            SCPTI  +L++ K++P S+ EL   L    EA +V EL+ S V+D+ M W+S+E KLDLH
Sbjct: 302  SCPTIMAMLQEPKAVPCSVGELSGVLD-GDEALVVKELVGSAVVDEAMVWDSAEAKLDLH 360

Query: 1056 GTHLSTAYLILLQWFDNLKRRFESGNRTAP 1145
            G HL +AYL++L+WF+ +  RF+S     P
Sbjct: 361  GMHLGSAYLVMLEWFEAMGNRFKSAECVVP 390


>gb|EXB24044.1| hypothetical protein L484_006076 [Morus notabilis]
          Length = 517

 Score =  366 bits (939), Expect = 1e-98
 Identities = 192/359 (53%), Positives = 247/359 (68%), Gaps = 1/359 (0%)
 Frame = +3

Query: 72   CALTKQGHRFLSSLA-TTDEPSAATGLLRKFVASSSKHVAXXXXXXXXXXXXXDPRLSSL 248
            CALTKQGHRFLS+L+      SAA  L+ KFVASS K ++                L+S 
Sbjct: 102  CALTKQGHRFLSTLSINAGNASAANKLIGKFVASSPKSISLNALSHLLSPDTTHTHLTSH 161

Query: 249  AFPFYSMIKRESWFCWNAKLVADLIALLHKEERFDEAENLISETVLKLGFKERDLCVFYC 428
            +   YS I+  SWF ++ KLVA L ALL K+ R+ EAE LI+E V KLG ++R+L VFYC
Sbjct: 162  SLHLYSKIREASWFVYSPKLVAALAALLDKQGRYSEAEALIAEAVSKLGHRQRELAVFYC 221

Query: 429  NLVDSHAKHRSERGVLDYCTQLRQLILQSSSVYVKQRGYESMIAGFCEIGLPNKAEKLIE 608
            +LV+SH+K  S+ G       L QL+  SSS YVK R +E+M+   C +  P +AE L+E
Sbjct: 222  SLVESHSKQSSKHGFDSSYAYLYQLLRDSSSAYVKCRAFETMVGALCTMDRPCEAESLME 281

Query: 609  EMREKGLKPSVFELRSLVYGYGQKGFLEDMKRSIVQIEKEGFELDTICCNMVLSSFGAHN 788
            EMR KGLKPSVFE RSLVYGYG+ G  EDM R++ Q+E EG  +DTIC NMVLSS+GAHN
Sbjct: 282  EMRHKGLKPSVFEFRSLVYGYGRLGLWEDMLRTVNQMEIEGLVIDTICSNMVLSSYGAHN 341

Query: 789  ELLDMLSWLKKMRNSGIPFSVRTYNSVLNSCPTIFLLLEDMKSLPLSIDELVDNLKINGE 968
            EL  M+ WL+KMR S IPFS+RTYNSVLN CPTI  +L+D+K +PLS+ EL   L+   E
Sbjct: 342  ELQQMVLWLQKMRTSSIPFSIRTYNSVLNWCPTITAMLQDLKDIPLSMYELNATLR-GDE 400

Query: 969  ANLVLELLKSNVLDQVMEWNSSELKLDLHGTHLSTAYLILLQWFDNLKRRFESGNRTAP 1145
              LV+EL+ S+VL++V+ W+S E+KLDLHG HL +AYLI+L+W + + RRF  GN   P
Sbjct: 401  GLLVMELVGSSVLEEVLVWDSLEVKLDLHGMHLGSAYLIMLEWMEEMTRRFNDGNHGIP 459


>ref|XP_002521239.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223539507|gb|EEF41095.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 460

 Score =  352 bits (902), Expect = 2e-94
 Identities = 179/361 (49%), Positives = 249/361 (68%), Gaps = 3/361 (0%)
 Frame = +3

Query: 75   ALTKQGHRFLSSLA--TTDEPSAATG-LLRKFVASSSKHVAXXXXXXXXXXXXXDPRLSS 245
            AL+KQG RFLSSLA  TT   + AT  L++KFVA+S K +A                LSS
Sbjct: 44   ALSKQGQRFLSSLAIATTKGDTVATNRLIKKFVAASPKSIALDALSHLLNPHSSHSHLSS 103

Query: 246  LAFPFYSMIKRESWFCWNAKLVADLIALLHKEERFDEAENLISETVLKLGFKERDLCVFY 425
            LAF  Y  I    WF WN KLVAD++A L K+ R+DE+  L+S+++ KL  KERDL  FY
Sbjct: 104  LAFTLYLKIAEARWFQWNPKLVADVVAFLDKQGRYDESATLVSDSISKLQVKERDLARFY 163

Query: 426  CNLVDSHAKHRSERGVLDYCTQLRQLILQSSSVYVKQRGYESMIAGFCEIGLPNKAEKLI 605
            CNLV+S +K  S RG  +    L QL+  S+SVYVK++GY+SM+ G CE+G P +AE LI
Sbjct: 164  CNLVESQSKQNSIRGFDNSVASLMQLVCNSNSVYVKRQGYKSMVNGLCEMGRPREAETLI 223

Query: 606  EEMREKGLKPSVFELRSLVYGYGQKGFLEDMKRSIVQIEKEGFELDTICCNMVLSSFGAH 785
            EEM ++G++PS+FE + +VY YG  G  E+M + + Q+E+ GF +DT+C NM+L+S+GAH
Sbjct: 224  EEMGKEGVRPSMFEFKCVVYAYGSLGSFEEMNKCLHQMERAGFRVDTVCSNMILASYGAH 283

Query: 786  NELLDMLSWLKKMRNSGIPFSVRTYNSVLNSCPTIFLLLEDMKSLPLSIDELVDNLKING 965
            N L +M+ WL+KM++ GIPFS+RT NS LNSCPTI  ++++    P+SI +L+  L    
Sbjct: 284  NALPEMVLWLQKMKDLGIPFSLRTCNSALNSCPTIMSMMQNSNDFPISIHDLMKILS-ED 342

Query: 966  EANLVLELLKSNVLDQVMEWNSSELKLDLHGTHLSTAYLILLQWFDNLKRRFESGNRTAP 1145
            EA LV E++ S+VLD+ M+W+ +E KLDLHGTHL +AYLI+L W + +++RF+S N   P
Sbjct: 343  EALLVKEIVTSSVLDEAMKWDVAEAKLDLHGTHLCSAYLIILLWIEEMRKRFKSVNYVNP 402

Query: 1146 T 1148
            T
Sbjct: 403  T 403


>ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutrema salsugineum]
            gi|557110519|gb|ESQ50810.1| hypothetical protein
            EUTSA_v10022675mg [Eutrema salsugineum]
          Length = 469

 Score =  347 bits (891), Expect = 4e-93
 Identities = 177/359 (49%), Positives = 244/359 (67%), Gaps = 3/359 (0%)
 Frame = +3

Query: 78   LTKQGHRFLSSL---ATTDEPSAATGLLRKFVASSSKHVAXXXXXXXXXXXXXDPRLSSL 248
            L KQGHRFLSSL   A   +PSA    ++KFVA+S K V+              P LS  
Sbjct: 53   LMKQGHRFLSSLSSPALAGDPSATNRHIKKFVAASPKSVSLNVLSHLLSAQTSHPHLSFF 112

Query: 249  AFPFYSMIKRESWFCWNAKLVADLIALLHKEERFDEAENLISETVLKLGFKERDLCVFYC 428
            A   YS I   SWF WN KL+A+L+ALL+K+ER  E+E L+S  V +L   ERD+ +FYC
Sbjct: 113  ALSLYSEITEASWFDWNPKLIAELVALLNKQERSHESETLLSNAVSRLKSNERDIALFYC 172

Query: 429  NLVDSHAKHRSERGVLDYCTQLRQLILQSSSVYVKQRGYESMIAGFCEIGLPNKAEKLIE 608
            NLV+S++K  S +G  + C +LR++  +S+SVYVK + Y+SM++G C +  P+ AE +IE
Sbjct: 173  NLVESNSKQGSIQGFNEACVRLREITRRSTSVYVKTQAYKSMVSGLCNMDQPHDAESVIE 232

Query: 609  EMREKGLKPSVFELRSLVYGYGQKGFLEDMKRSIVQIEKEGFELDTICCNMVLSSFGAHN 788
            EMR   +KP +FE +S++YGYG+ G  EDM R + ++E EG ++DT+C NMVLSS+GAHN
Sbjct: 233  EMRIAKIKPGLFEYKSVLYGYGRLGLFEDMNRVVHRMETEGHKIDTVCSNMVLSSYGAHN 292

Query: 789  ELLDMLSWLKKMRNSGIPFSVRTYNSVLNSCPTIFLLLEDMKSLPLSIDELVDNLKINGE 968
             L  M SWL+K+++S +P S RTYNSVLNSCPTI  LL+D+ S P+S+ EL+  L  + E
Sbjct: 293  ALPQMGSWLQKLKDSNVPLSERTYNSVLNSCPTILSLLKDLDSCPVSLSELLTFLNKDEE 352

Query: 969  ANLVLELLKSNVLDQVMEWNSSELKLDLHGTHLSTAYLILLQWFDNLKRRFESGNRTAP 1145
              LV  L +S+VLD+ +EW+S E KLDLHG HLS++YLI++QW D ++ RF  G    P
Sbjct: 353  V-LVRGLTQSSVLDEAIEWSSLEGKLDLHGMHLSSSYLIMMQWMDEMRIRFSEGKCVVP 410


>ref|XP_002884032.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297329872|gb|EFH60291.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 504

 Score =  346 bits (887), Expect = 1e-92
 Identities = 181/362 (50%), Positives = 242/362 (66%), Gaps = 3/362 (0%)
 Frame = +3

Query: 69   VCALTKQGHRFLSSL---ATTDEPSAATGLLRKFVASSSKHVAXXXXXXXXXXXXXDPRL 239
            V  L KQG RFLSSL   A   +PSA    ++KFVA+S K V               P L
Sbjct: 85   VVPLMKQGDRFLSSLSSPALAGDPSATHRHIKKFVAASPKSVTLNVLSHLLSDQTSYPHL 144

Query: 240  SSLAFPFYSMIKRESWFCWNAKLVADLIALLHKEERFDEAENLISETVLKLGFKERDLCV 419
            S  A   YS I   SWF WN KL+A+L+A+L+ +ERFDE+E L+S  V +L   ERD  +
Sbjct: 145  SFFALSLYSEITEASWFDWNPKLIAELVAVLNNQERFDESETLLSTAVSRLKSNERDFAL 204

Query: 420  FYCNLVDSHAKHRSERGVLDYCTQLRQLILQSSSVYVKQRGYESMIAGFCEIGLPNKAEK 599
            F CNLV+S++K  S +G  + C +LR+ I +SSSVYVK + Y+SM+AG C +  P+ AE+
Sbjct: 205  FLCNLVESNSKQGSIQGFNEACFRLRERIQRSSSVYVKTQAYKSMVAGLCNMDQPHDAER 264

Query: 600  LIEEMREKGLKPSVFELRSLVYGYGQKGFLEDMKRSIVQIEKEGFELDTICCNMVLSSFG 779
            +IEEMR + +KP  FE +S++YGYG+ G  +DM R + ++E EG ++DT+C NMVLSS+G
Sbjct: 265  VIEEMRVEKIKPGSFEHKSVLYGYGRLGLFDDMNRVVHRMETEGHKIDTVCSNMVLSSYG 324

Query: 780  AHNELLDMLSWLKKMRNSGIPFSVRTYNSVLNSCPTIFLLLEDMKSLPLSIDELVDNLKI 959
            AH+ L  M SWL+K++   +PFS+RTYNSVLNSCPTI  LL+D+ S P+S+ EL   L  
Sbjct: 325  AHDALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIMSLLKDLNSCPVSLSELRTFLN- 383

Query: 960  NGEANLVLELLKSNVLDQVMEWNSSELKLDLHGTHLSTAYLILLQWFDNLKRRFESGNRT 1139
              EA LVLEL +S VLD+ +EWN+ E KLDLHG HLS++YLILLQW D ++ RF      
Sbjct: 384  EDEALLVLELTQSTVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDEIRLRFRDQKCV 443

Query: 1140 AP 1145
             P
Sbjct: 444  IP 445


>gb|AAU04769.1| pentatricopeptide (PPR) repeat protein-like [Cucumis melo]
          Length = 488

 Score =  344 bits (883), Expect = 3e-92
 Identities = 191/395 (48%), Positives = 253/395 (64%), Gaps = 15/395 (3%)
 Frame = +3

Query: 6    GRGLQLSAPPISA--GFRQYPPL------VCALTKQGHRFLSSLATTD---EPSAATGLL 152
            G G++L   P+    GFR YP L         LTKQ HRFLS+L+TT    + SA   L+
Sbjct: 13   GDGVRLFLQPLKRLDGFRSYPFLPNLQVKCTTLTKQTHRFLSTLSTTGATGDQSATNRLI 72

Query: 153  RKFVASSSKHVAXXXXXXXXXXXXXDPRLSSLAFPFYSMIKRESWFCWNAKLVADLIALL 332
            RKFVASS K +               P L S A   YS I   SWF WN+KLVADL+A L
Sbjct: 73   RKFVASSPKSITLSVLSNIVSTHTPQPELCSAALTLYSRITEASWFTWNSKLVADLVAFL 132

Query: 333  HKEERFDEAENLISETVLKLGFKERDLCVFYCNLVDSHAKHRSERGVLDYCTQLRQLILQ 512
             +   + E+E LISE + KLG +ER L  FY  LV+S +KH  ERG  D  ++L +L+  
Sbjct: 133  GQNGLYSESEALISEAISKLGSQERKLVNFYSQLVESQSKHGFERGFGDSYSRLFELLYN 192

Query: 513  SSSVYVKQRGYESMIAGFCEIGLPNKAEKLIEEMREKGLKPSVFELRSLVYGYGQKGFLE 692
            S SVYVK+R YESM+ G C +  P++AE L++EMR KG+ P+ +E RS++Y YG  G  E
Sbjct: 193  SPSVYVKRRAYESMVTGLCSMKRPHEAESLVKEMRSKGITPTAYEYRSIIYAYGTLGLFE 252

Query: 693  DMKRSIVQIEKEGFELDTICCNMVLSSFGAHNELLDMLSWLKKMRNSG-IPFSVRTYNSV 869
            +MKRS+ Q+E +  ELDT+C NMVLSS+GAHN+L DML WL++M+ S     SVRTYNSV
Sbjct: 253  EMKRSLKQMENDNIELDTVCSNMVLSSYGAHNKLGDMLLWLQRMKTSSHCKSSVRTYNSV 312

Query: 870  LNSCPTIFLLLEDMKS--LPLSIDELVDNLKINGEANLVLELL-KSNVLDQVMEWNSSEL 1040
            LNSCP I  +L+D KS  LP+ I++L+  L  + EA LV ELL  S+VL+++M W++ EL
Sbjct: 313  LNSCPKITSMLQDHKSGDLPVLIEDLIAILDGDEEALLVKELLVGSSVLNEIMVWDAMEL 372

Query: 1041 KLDLHGTHLSTAYLILLQWFDNLKRRFESGNRTAP 1145
            KLDLHG H+  AY+I+LQW   ++  FE  +   P
Sbjct: 373  KLDLHGAHVGAAYVIMLQWIKEMRLNFEDESNVIP 407


>gb|AGH33847.1| PPR [Cucumis melo]
          Length = 488

 Score =  343 bits (879), Expect = 1e-91
 Identities = 191/395 (48%), Positives = 253/395 (64%), Gaps = 15/395 (3%)
 Frame = +3

Query: 6    GRGLQLSAPPISA--GFRQYPPL------VCALTKQGHRFLSSLATT---DEPSAATGLL 152
            G G++L   P+    GFR YP L         LTKQ HRFLS+L+TT    + SA   L+
Sbjct: 13   GDGVRLLLQPLKRLDGFRSYPFLPNLQVKCTTLTKQTHRFLSTLSTTAATGDQSATNRLI 72

Query: 153  RKFVASSSKHVAXXXXXXXXXXXXXDPRLSSLAFPFYSMIKRESWFCWNAKLVADLIALL 332
            RKFVASS K +               P L S A   YS I   SWF WN+KLVADL+A L
Sbjct: 73   RKFVASSPKSITLSVLSNIVSTHTPQPELCSAALTLYSRITEASWFTWNSKLVADLVAFL 132

Query: 333  HKEERFDEAENLISETVLKLGFKERDLCVFYCNLVDSHAKHRSERGVLDYCTQLRQLILQ 512
             +   + E+E LISE + KLG +ER L  FY  LV+S +KH  ERG  D  ++L +L+  
Sbjct: 133  GQNGLYSESEALISEAISKLGSQERKLVNFYSQLVESQSKHGFERGFGDSYSRLFELLYN 192

Query: 513  SSSVYVKQRGYESMIAGFCEIGLPNKAEKLIEEMREKGLKPSVFELRSLVYGYGQKGFLE 692
            S SVYVK+R YESM+ G C +  P++AE L++EMR KG+ P+ +E RS++Y YG  G  E
Sbjct: 193  SPSVYVKRRAYESMVTGLCSMKRPHEAESLVKEMRSKGITPTAYEYRSIIYAYGTLGLFE 252

Query: 693  DMKRSIVQIEKEGFELDTICCNMVLSSFGAHNELLDMLSWLKKMRNS-GIPFSVRTYNSV 869
            +MKRS+ Q+E +  ELDT+C NMVLSS+GAHN+L DML WL++M+ S     SVRTYNSV
Sbjct: 253  EMKRSLKQMENDNIELDTVCSNMVLSSYGAHNKLGDMLLWLQRMKTSPHCKSSVRTYNSV 312

Query: 870  LNSCPTIFLLLEDMKS--LPLSIDELVDNLKINGEANLVLELL-KSNVLDQVMEWNSSEL 1040
            LNSCP I  +L+D KS  LP+ I++L+  L  + EA LV ELL  S+VL+++M W++ EL
Sbjct: 313  LNSCPKITSMLQDHKSGDLPVLIEDLIAILDGDEEALLVKELLVGSSVLNEIMVWDAMEL 372

Query: 1041 KLDLHGTHLSTAYLILLQWFDNLKRRFESGNRTAP 1145
            KLDLHG H+  AY+I+LQW   ++  FE  +   P
Sbjct: 373  KLDLHGAHVGAAYVIMLQWIKEMRLNFEDESYVIP 407


>ref|XP_006297442.1| hypothetical protein CARUB_v10013465mg [Capsella rubella]
            gi|482566151|gb|EOA30340.1| hypothetical protein
            CARUB_v10013465mg [Capsella rubella]
          Length = 516

 Score =  341 bits (874), Expect = 4e-91
 Identities = 175/356 (49%), Positives = 240/356 (67%), Gaps = 3/356 (0%)
 Frame = +3

Query: 78   LTKQGHRFLSSL---ATTDEPSAATGLLRKFVASSSKHVAXXXXXXXXXXXXXDPRLSSL 248
            L KQGH+FLSSL   A   +P A   L++KFVA+S K VA              P LS  
Sbjct: 99   LMKQGHQFLSSLSSPALAGDPPATNRLIKKFVAASPKSVALNVLSHLLSDNTSHPHLSYF 158

Query: 249  AFPFYSMIKRESWFCWNAKLVADLIALLHKEERFDEAENLISETVLKLGFKERDLCVFYC 428
            A   Y  I   SWF WN KL+ +L++LL+K+ERF E+E L+S  V +L   ERD  +F C
Sbjct: 159  APQLYLEITEASWFDWNPKLIGELVSLLNKQERFVESETLLSTAVSRLESNERDFALFLC 218

Query: 429  NLVDSHAKHRSERGVLDYCTQLRQLILQSSSVYVKQRGYESMIAGFCEIGLPNKAEKLIE 608
            NLV+S++K  S +G  D C++LR++I +SSSVYVK + Y+SM++G C +  P  AE++IE
Sbjct: 219  NLVESNSKQGSIQGFSDACSRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPLDAERVIE 278

Query: 609  EMREKGLKPSVFELRSLVYGYGQKGFLEDMKRSIVQIEKEGFELDTICCNMVLSSFGAHN 788
            EMR + +KP +FE +S++YGYG+ G  +DM R + ++E +G ++DT+C NMVLSS+GAH+
Sbjct: 279  EMRMETIKPGLFEYKSVLYGYGRLGLFDDMNRIVHRMETQGHKIDTVCSNMVLSSYGAHD 338

Query: 789  ELLDMLSWLKKMRNSGIPFSVRTYNSVLNSCPTIFLLLEDMKSLPLSIDELVDNLKINGE 968
             L  M SWL+K++   +P S+RTYNSVLNSCPTI  LL+D+ S PLS+ EL+  L    E
Sbjct: 339  ALPQMGSWLQKLKGYNVPLSIRTYNSVLNSCPTIISLLKDLDSCPLSLSELLPILN-EDE 397

Query: 969  ANLVLELLKSNVLDQVMEWNSSELKLDLHGTHLSTAYLILLQWFDNLKRRFESGNR 1136
            A LV EL +S VLD+ +EWN+ E KLDLHG HLS +YLI+LQW D  + RF    +
Sbjct: 398  ALLVRELTQSLVLDEAIEWNAVEGKLDLHGMHLSASYLIMLQWMDETRLRFSEDKK 453


>ref|NP_849962.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75244359|sp|Q8GWA9.1|PP157_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g17033 gi|26452937|dbj|BAC43545.1| unknown protein
            [Arabidopsis thaliana] gi|330251482|gb|AEC06576.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 505

 Score =  340 bits (873), Expect = 5e-91
 Identities = 178/359 (49%), Positives = 241/359 (67%), Gaps = 3/359 (0%)
 Frame = +3

Query: 78   LTKQGHRFLSSL---ATTDEPSAATGLLRKFVASSSKHVAXXXXXXXXXXXXXDPRLSSL 248
            L K G RFLSSL   A   +PSA    ++KFVA+S K VA              P LS  
Sbjct: 89   LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 148

Query: 249  AFPFYSMIKRESWFCWNAKLVADLIALLHKEERFDEAENLISETVLKLGFKERDLCVFYC 428
            A   YS I   SWF WN KL+A+LIALL+K+ERFDE+E L+S  V +L   ERD  +F C
Sbjct: 149  ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 208

Query: 429  NLVDSHAKHRSERGVLDYCTQLRQLILQSSSVYVKQRGYESMIAGFCEIGLPNKAEKLIE 608
            NLV+S++K  S +G  +   +LR++I +SSSVYVK + Y+SM++G C +  P+ AE++IE
Sbjct: 209  NLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIE 268

Query: 609  EMREKGLKPSVFELRSLVYGYGQKGFLEDMKRSIVQIEKEGFELDTICCNMVLSSFGAHN 788
            EMR + +KP +FE +S++YGYG+ G  +DM R + ++  EG ++DT+C NMVLSS+GAH+
Sbjct: 269  EMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHD 328

Query: 789  ELLDMLSWLKKMRNSGIPFSVRTYNSVLNSCPTIFLLLEDMKSLPLSIDELVDNLKINGE 968
             L  M SWL+K++   +PFS+RTYNSVLNSCPTI  +L+D+ S P+S+ EL   L    E
Sbjct: 329  ALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLN-EDE 387

Query: 969  ANLVLELLKSNVLDQVMEWNSSELKLDLHGTHLSTAYLILLQWFDNLKRRFESGNRTAP 1145
            A LV EL +S+VLD+ +EWN+ E KLDLHG HLS++YLILLQW D  + RF       P
Sbjct: 388  ALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIP 446


>dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana]
          Length = 501

 Score =  340 bits (873), Expect = 5e-91
 Identities = 178/359 (49%), Positives = 241/359 (67%), Gaps = 3/359 (0%)
 Frame = +3

Query: 78   LTKQGHRFLSSL---ATTDEPSAATGLLRKFVASSSKHVAXXXXXXXXXXXXXDPRLSSL 248
            L K G RFLSSL   A   +PSA    ++KFVA+S K VA              P LS  
Sbjct: 85   LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 144

Query: 249  AFPFYSMIKRESWFCWNAKLVADLIALLHKEERFDEAENLISETVLKLGFKERDLCVFYC 428
            A   YS I   SWF WN KL+A+LIALL+K+ERFDE+E L+S  V +L   ERD  +F C
Sbjct: 145  ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 204

Query: 429  NLVDSHAKHRSERGVLDYCTQLRQLILQSSSVYVKQRGYESMIAGFCEIGLPNKAEKLIE 608
            NLV+S++K  S +G  +   +LR++I +SSSVYVK + Y+SM++G C +  P+ AE++IE
Sbjct: 205  NLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIE 264

Query: 609  EMREKGLKPSVFELRSLVYGYGQKGFLEDMKRSIVQIEKEGFELDTICCNMVLSSFGAHN 788
            EMR + +KP +FE +S++YGYG+ G  +DM R + ++  EG ++DT+C NMVLSS+GAH+
Sbjct: 265  EMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHD 324

Query: 789  ELLDMLSWLKKMRNSGIPFSVRTYNSVLNSCPTIFLLLEDMKSLPLSIDELVDNLKINGE 968
             L  M SWL+K++   +PFS+RTYNSVLNSCPTI  +L+D+ S P+S+ EL   L    E
Sbjct: 325  ALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLN-EDE 383

Query: 969  ANLVLELLKSNVLDQVMEWNSSELKLDLHGTHLSTAYLILLQWFDNLKRRFESGNRTAP 1145
            A LV EL +S+VLD+ +EWN+ E KLDLHG HLS++YLILLQW D  + RF       P
Sbjct: 384  ALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIP 442


>ref|NP_565402.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|13877877|gb|AAK44016.1|AF370201_1 unknown protein
            [Arabidopsis thaliana] gi|21280879|gb|AAM44931.1| unknown
            protein [Arabidopsis thaliana]
            gi|330251481|gb|AEC06575.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 504

 Score =  340 bits (873), Expect = 5e-91
 Identities = 178/359 (49%), Positives = 241/359 (67%), Gaps = 3/359 (0%)
 Frame = +3

Query: 78   LTKQGHRFLSSL---ATTDEPSAATGLLRKFVASSSKHVAXXXXXXXXXXXXXDPRLSSL 248
            L K G RFLSSL   A   +PSA    ++KFVA+S K VA              P LS  
Sbjct: 88   LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 147

Query: 249  AFPFYSMIKRESWFCWNAKLVADLIALLHKEERFDEAENLISETVLKLGFKERDLCVFYC 428
            A   YS I   SWF WN KL+A+LIALL+K+ERFDE+E L+S  V +L   ERD  +F C
Sbjct: 148  ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 207

Query: 429  NLVDSHAKHRSERGVLDYCTQLRQLILQSSSVYVKQRGYESMIAGFCEIGLPNKAEKLIE 608
            NLV+S++K  S +G  +   +LR++I +SSSVYVK + Y+SM++G C +  P+ AE++IE
Sbjct: 208  NLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIE 267

Query: 609  EMREKGLKPSVFELRSLVYGYGQKGFLEDMKRSIVQIEKEGFELDTICCNMVLSSFGAHN 788
            EMR + +KP +FE +S++YGYG+ G  +DM R + ++  EG ++DT+C NMVLSS+GAH+
Sbjct: 268  EMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHD 327

Query: 789  ELLDMLSWLKKMRNSGIPFSVRTYNSVLNSCPTIFLLLEDMKSLPLSIDELVDNLKINGE 968
             L  M SWL+K++   +PFS+RTYNSVLNSCPTI  +L+D+ S P+S+ EL   L    E
Sbjct: 328  ALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLN-EDE 386

Query: 969  ANLVLELLKSNVLDQVMEWNSSELKLDLHGTHLSTAYLILLQWFDNLKRRFESGNRTAP 1145
            A LV EL +S+VLD+ +EWN+ E KLDLHG HLS++YLILLQW D  + RF       P
Sbjct: 387  ALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIP 445


>ref|XP_004156246.1| PREDICTED: uncharacterized protein LOC101223617 [Cucumis sativus]
          Length = 1296

 Score =  340 bits (872), Expect = 6e-91
 Identities = 189/395 (47%), Positives = 254/395 (64%), Gaps = 15/395 (3%)
 Frame = +3

Query: 6    GRGLQLSAPPISA--GFRQYP-----PLVC-ALTKQGHRFLSSLATT---DEPSAATGLL 152
            G G++L   P      FR YP      + C +LTKQ HRFLS+L+TT    + SA   L+
Sbjct: 13   GDGVRLFLHPFKRLHAFRSYPFVPNLQVKCTSLTKQTHRFLSTLSTTAATGDQSATNRLI 72

Query: 153  RKFVASSSKHVAXXXXXXXXXXXXXDPRLSSLAFPFYSMIKRESWFCWNAKLVADLIALL 332
            RKFVASS K +               P L S A   YS I   SWF WN+KLVADL+A L
Sbjct: 73   RKFVASSPKSITLSVLSNIVSTHTPQPELCSAALTLYSRITEASWFTWNSKLVADLVAFL 132

Query: 333  HKEERFDEAENLISETVLKLGFKERDLCVFYCNLVDSHAKHRSERGVLDYCTQLRQLILQ 512
             +   + E+E LISE + KLG +ER L  FY  LV+S +KH  ERG +D  ++L +L+  
Sbjct: 133  DQNGLYSESEVLISEAISKLGSQERKLVNFYSQLVESQSKHGFERGFVDSYSRLLELLYN 192

Query: 513  SSSVYVKQRGYESMIAGFCEIGLPNKAEKLIEEMREKGLKPSVFELRSLVYGYGQKGFLE 692
            S SVYVK+R YESM+ G C +  P++AE L++EMR KG+ P+ +E RS++Y YG  G  E
Sbjct: 193  SPSVYVKRRAYESMVTGLCSMKRPHEAENLVKEMRSKGITPTAYEYRSIIYAYGTLGLFE 252

Query: 693  DMKRSIVQIEKEGFELDTICCNMVLSSFGAHNELLDMLSWLKKMRNS-GIPFSVRTYNSV 869
            +MKRS+ Q+E +  ELDT+C NMVLSS+GAHN+L DM+ WL++M+ S     SVRTYNSV
Sbjct: 253  EMKRSLKQMENDNIELDTVCSNMVLSSYGAHNKLGDMVLWLQRMKTSPHCNSSVRTYNSV 312

Query: 870  LNSCPTIFLLLEDMKS--LPLSIDELVDNLKINGEANLVLELLK-SNVLDQVMEWNSSEL 1040
            LNSCP I  +L+D KS  LP+ I++L+  L  + EA LV ELL  S+VL+++M W++ EL
Sbjct: 313  LNSCPKITAMLQDHKSTNLPVLIEDLIAVLDGDEEALLVEELLAGSSVLNEIMVWDAMEL 372

Query: 1041 KLDLHGTHLSTAYLILLQWFDNLKRRFESGNRTAP 1145
            KLDLHG H+  AY+I+LQW   ++  FE  +   P
Sbjct: 373  KLDLHGAHVGAAYVIMLQWIKEMRLNFEDESYVIP 407


>ref|XP_004141623.1| PREDICTED: uncharacterized protein LOC101204365 [Cucumis sativus]
          Length = 1913

 Score =  340 bits (872), Expect = 6e-91
 Identities = 189/395 (47%), Positives = 254/395 (64%), Gaps = 15/395 (3%)
 Frame = +3

Query: 6    GRGLQLSAPPISA--GFRQYP-----PLVC-ALTKQGHRFLSSLATT---DEPSAATGLL 152
            G G++L   P      FR YP      + C +LTKQ HRFLS+L+TT    + SA   L+
Sbjct: 13   GDGVRLFLHPFKRLHAFRSYPFVPNLQVKCTSLTKQTHRFLSTLSTTAATGDQSATNRLI 72

Query: 153  RKFVASSSKHVAXXXXXXXXXXXXXDPRLSSLAFPFYSMIKRESWFCWNAKLVADLIALL 332
            RKFVASS K +               P L S A   YS I   SWF WN+KLVADL+A L
Sbjct: 73   RKFVASSPKSITLSVLSNIVSTHTPQPELCSAALTLYSRITEASWFTWNSKLVADLVAFL 132

Query: 333  HKEERFDEAENLISETVLKLGFKERDLCVFYCNLVDSHAKHRSERGVLDYCTQLRQLILQ 512
             +   + E+E LISE + KLG +ER L  FY  LV+S +KH  ERG +D  ++L +L+  
Sbjct: 133  DQNGLYSESEVLISEAISKLGSQERKLVNFYSQLVESQSKHGFERGFVDSYSRLLELLYN 192

Query: 513  SSSVYVKQRGYESMIAGFCEIGLPNKAEKLIEEMREKGLKPSVFELRSLVYGYGQKGFLE 692
            S SVYVK+R YESM+ G C +  P++AE L++EMR KG+ P+ +E RS++Y YG  G  E
Sbjct: 193  SPSVYVKRRAYESMVTGLCSMKRPHEAENLVKEMRSKGITPTAYEYRSIIYAYGTLGLFE 252

Query: 693  DMKRSIVQIEKEGFELDTICCNMVLSSFGAHNELLDMLSWLKKMRNS-GIPFSVRTYNSV 869
            +MKRS+ Q+E +  ELDT+C NMVLSS+GAHN+L DM+ WL++M+ S     SVRTYNSV
Sbjct: 253  EMKRSLKQMENDNIELDTVCSNMVLSSYGAHNKLGDMVLWLQRMKTSPHCNSSVRTYNSV 312

Query: 870  LNSCPTIFLLLEDMKS--LPLSIDELVDNLKINGEANLVLELLK-SNVLDQVMEWNSSEL 1040
            LNSCP I  +L+D KS  LP+ I++L+  L  + EA LV ELL  S+VL+++M W++ EL
Sbjct: 313  LNSCPKITAMLQDHKSTNLPVLIEDLIAVLDGDEEALLVEELLAGSSVLNEIMVWDAMEL 372

Query: 1041 KLDLHGTHLSTAYLILLQWFDNLKRRFESGNRTAP 1145
            KLDLHG H+  AY+I+LQW   ++  FE  +   P
Sbjct: 373  KLDLHGAHVGAAYVIMLQWIKEMRLNFEDESYVIP 407


>ref|XP_002312938.2| hypothetical protein POPTR_0009s14120g [Populus trichocarpa]
            gi|550331693|gb|EEE86893.2| hypothetical protein
            POPTR_0009s14120g [Populus trichocarpa]
          Length = 473

 Score =  322 bits (824), Expect = 2e-85
 Identities = 170/364 (46%), Positives = 236/364 (64%), Gaps = 5/364 (1%)
 Frame = +3

Query: 69   VCALTKQGHRFLSSL---ATTDEPSAATGLLRKFVASSSKHVAXXXXXXXXXXXXXD-PR 236
            + A++KQ  RF S++     T + SA   L++KFVASS K +A               P 
Sbjct: 53   LAAISKQAQRFFSAVLPTVATSDTSATNRLIKKFVASSPKSIALDALSNLLSPDSTHHPL 112

Query: 237  LSSLAFPFYSMIKRESWFCWNAKLVADLIALLHKEERFDEAENLISETVLKLGFKERDLC 416
            L  L  P Y  I   SWF WN KLVA ++ LL K+    E + L+SETV +L FKER+L 
Sbjct: 113  LYLLTLPLYLKISEASWFSWNPKLVAQVVVLLDKQGLDKELKALMSETVSRLQFKERELV 172

Query: 417  VFYCNLVDSHAKHRSERGVLDYCTQLRQLILQSSSVYVKQRGYESMIAGFCEIGLPNKAE 596
            +FYCNL+  ++KH   RG  D  ++L Q +  S+SVYVK++GY++MI+G CE+G   +AE
Sbjct: 173  LFYCNLIGFNSKHNWVRGFDDSYSRLNQFVSDSNSVYVKKQGYKAMISGLCEMGRAREAE 232

Query: 597  KLIEEMREKGLKPSVFELRSLVYGYGQKGFLEDMKRSIVQIEKEGFELDTICCNMVLSSF 776
             LI EMRE+GLKP +FE R ++YGYG+ G  +DM+R + ++E    E+DT+C NMVL+S+
Sbjct: 233  DLIGEMRERGLKPKLFEFRCVLYGYGRLGLFKDMERILDKMESGEIEVDTVCANMVLASY 292

Query: 777  GAHNELLDMLSWLKKMRNSGIPFSVRTYNSVLNSCPTIFLLLEDM-KSLPLSIDELVDNL 953
            GAHN L +M  WL+KM+  GIP S+RT NSVLNSCPTI  L+ ++  S P+SI EL+  L
Sbjct: 293  GAHNALPEMGLWLRKMKTLGIPLSIRTCNSVLNSCPTIMALMRNLDASYPVSIQELLKIL 352

Query: 954  KINGEANLVLELLKSNVLDQVMEWNSSELKLDLHGTHLSTAYLILLQWFDNLKRRFESGN 1133
                EA LV EL++S+VL +  +W++SE KLDLHG HL +AY+I+LQW +  + R   G 
Sbjct: 353  S-EEEAMLVKELIESSVLKEATKWDTSEGKLDLHGMHLGSAYVIMLQWMEETRNRLSDGE 411

Query: 1134 RTAP 1145
               P
Sbjct: 412  HVIP 415


Top