BLASTX nr result

ID: Papaver25_contig00014215 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver25_contig00014215
         (1521 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007200730.1| hypothetical protein PRUPE_ppa021547mg [Prun...   455   e-125
ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containi...   433   e-118
ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citr...   426   e-116
ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containi...   424   e-116
gb|EXB24044.1| hypothetical protein L484_006076 [Morus notabilis]     419   e-114
ref|XP_002884032.1| pentatricopeptide repeat-containing protein ...   408   e-111
ref|NP_849962.1| pentatricopeptide repeat-containing protein [Ar...   406   e-110
dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana]           406   e-110
ref|NP_565402.1| pentatricopeptide repeat-containing protein [Ar...   406   e-110
ref|XP_007041729.1| Pentatricopeptide (PPR) repeat-containing pr...   405   e-110
gb|EYU37991.1| hypothetical protein MIMGU_mgv1a006093mg [Mimulus...   402   e-109
ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutr...   401   e-109
ref|XP_006297442.1| hypothetical protein CARUB_v10013465mg [Caps...   397   e-108
ref|XP_002312938.2| hypothetical protein POPTR_0009s14120g [Popu...   387   e-105
ref|XP_002521239.1| pentatricopeptide repeat-containing protein,...   386   e-104
ref|XP_004156246.1| PREDICTED: uncharacterized protein LOC101223...   378   e-102
ref|XP_004141623.1| PREDICTED: uncharacterized protein LOC101204...   378   e-102
ref|XP_006360892.1| PREDICTED: pentatricopeptide repeat-containi...   377   e-101
gb|AAU04769.1| pentatricopeptide (PPR) repeat protein-like [Cucu...   374   e-101
gb|AGH33847.1| PPR [Cucumis melo]                                     373   e-100

>ref|XP_007200730.1| hypothetical protein PRUPE_ppa021547mg [Prunus persica]
            gi|462396130|gb|EMJ01929.1| hypothetical protein
            PRUPE_ppa021547mg [Prunus persica]
          Length = 447

 Score =  455 bits (1171), Expect = e-125
 Identities = 242/445 (54%), Positives = 316/445 (71%), Gaps = 3/445 (0%)
 Frame = +1

Query: 55   LQFSVSLKWGNXXXXXXNQSSAAIICSLSKKGQRXXXXXXXXXXXXXXXHRLIRKFVASS 234
            L FS++L W          S   I C+++K+GQR               ++LI KF+ SS
Sbjct: 9    LSFSIALPWNPLRPLPPLTSP--IQCAVTKQGQRFLTKLAANARDAKVTNKLIAKFLTSS 66

Query: 235  SKSIALNTLSHLLSSD--IRHHSSLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFD 408
            +KSIALNTLS+LLS D  + H SSLALP Y  ITEASWFEWNPKLV+ ++A L+ QGQ +
Sbjct: 67   TKSIALNTLSYLLSPDTTLPHLSSLALPFYSKITEASWFEWNPKLVAALVALLDKQGQHN 126

Query: 409  VSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXX-N 585
             +E LIS+++ KL S+ERELALF+C L+ S+SK  SK G   SY                
Sbjct: 127  EAEVLISETISKLGSRERELALFHCQLVESHSKLSSKHGFDSSYSYLYQLLHNSSSVYVK 186

Query: 586  RQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLD 765
             +A+ESM++GLC +D P +A+ ++E+MR+ G KPS FEFRSVV  YGRLGL  DM +V++
Sbjct: 187  NRAFESMVSGLCEMDRPREADNLIEEMRVRGLKPSVFEFRSVVYGYGRLGLFEDMLKVVE 246

Query: 766  QMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIM 945
            QME  G  +DTI SN+VLSSYG + EL+ M+VW++KMKS ++PFS+RTYNSVLNSC TIM
Sbjct: 247  QMENQGIAIDTICSNMVLSSYGAHSELAAMLVWLRKMKSLSLPFSIRTYNSVLNSCLTIM 306

Query: 946  AMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGT 1125
            AML+EP+  P SIE+L   +   E +LV+EL++ S+VL E + W+ LE KLDLHGMHLG+
Sbjct: 307  AMLQEPKDFPCSIEELNGVLNGDEALLVKELVE-STVLDEVMVWEPLEAKLDLHGMHLGS 365

Query: 1126 AYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKS 1305
            AY+ILLEW + ++ RF  G D  VIPAE+ V+CG GKHSSV G+SP+K LV +MM+R++S
Sbjct: 366  AYLILLEWFEAMRCRFNSGKD--VIPAEVVVICGSGKHSSVRGESPVKGLVKQMMLRMES 423

Query: 1306 PMRIGRKNDVGSFVGKGKAVKDWLC 1380
            PMRI RKN VG FV KG+AVKDWLC
Sbjct: 424  PMRIDRKN-VGCFVAKGRAVKDWLC 447


>ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like
            [Fragaria vesca subsp. vesca]
          Length = 448

 Score =  433 bits (1113), Expect = e-118
 Identities = 233/446 (52%), Positives = 310/446 (69%), Gaps = 4/446 (0%)
 Frame = +1

Query: 55   LQFSVSLKWGNXXXXXXNQSSAAIICSLSKKGQRXXXXXXXXXXXXXXXHRLIRKFVASS 234
            L FSV+L W +      ++ S  I C+L+K+GQR               ++LI KF+++S
Sbjct: 9    LSFSVALPWRHDPPQH-SKLSLQIQCALTKQGQRFLTKLAANAGNPSVANKLISKFLSTS 67

Query: 235  SKSIALNTLSHLLSSDIRHH--SSLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFD 408
             KS AL TLS+LLS    H   SSLALPMY  ITEASWFEWNPKLV+ ++A L  QGQ  
Sbjct: 68   PKSTALTTLSYLLSPHTAHPHLSSLALPMYSKITEASWFEWNPKLVAALVALLAKQGQQS 127

Query: 409  VSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXX-- 582
             SE LIS+++ KL ++EREL  F+C L+ S+SK  SK G FD                  
Sbjct: 128  QSEALISETISKLGNKERELVQFHCQLVESHSKMSSKCG-FDRACTYLHQLLQNSSSVYV 186

Query: 583  NRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVL 762
             R+A+ESM+ GLC +D P +A+E++E+MR++G K S FEFRSVV  YGRLG+  +M +++
Sbjct: 187  KRRAFESMVGGLCAMDRPGEADELIEEMRVKGLKASVFEFRSVVYGYGRLGMFEEMLKIV 246

Query: 763  DQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTI 942
            DQMEK G+  DTI  N+VLSSYG + EL+ M  W++KMK S+VPFSVRTYNSVLNSCPTI
Sbjct: 247  DQMEKQGFGDDTICCNMVLSSYGAHNELAAMANWLRKMKESSVPFSVRTYNSVLNSCPTI 306

Query: 943  MAMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLG 1122
            MAML+EP+ +P S+ +L   +   E ++V+EL+ GS+V+ E + WDS E KLDLHGMHLG
Sbjct: 307  MAMLQEPKAVPCSVGELSGVLDGDEALVVKELV-GSAVVDEAMVWDSAEAKLDLHGMHLG 365

Query: 1123 TAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLK 1302
            +AY+++LEW + + +RF       V+PAE+ +VCGLGKHSSV G+SP+K LV EMM +++
Sbjct: 366  SAYLVMLEWFEAMGNRFKSA--ECVVPAEVVIVCGLGKHSSVRGESPVKDLVKEMMHQME 423

Query: 1303 SPMRIGRKNDVGSFVGKGKAVKDWLC 1380
            SPMRI RKN VG F+ KG+AVKDWLC
Sbjct: 424  SPMRIDRKN-VGCFIAKGRAVKDWLC 448


>ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citrus clementina]
            gi|568866680|ref|XP_006486677.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g17033-like [Citrus sinensis]
            gi|557524456|gb|ESR35762.1| hypothetical protein
            CICLE_v10028424mg [Citrus clementina]
          Length = 451

 Score =  426 bits (1095), Expect = e-116
 Identities = 232/451 (51%), Positives = 305/451 (67%), Gaps = 9/451 (1%)
 Frame = +1

Query: 55   LQFSVSLKWGNXXXXXXNQSSAAIIC---SLSKKGQRXXXXXXXXXXXXXXX-HRLIRKF 222
            L   +   W +       Q    + C    L+K+GQR                 RLI KF
Sbjct: 5    LHMRIPPPWNSRCCRLRQQRLTLVQCLTARLTKQGQRFLSSLALAVTRDSKAASRLISKF 64

Query: 223  VASSSKSIALNTLSHLLSSDIRHH--SSLALPMYEWITEASWFEWNPKLVSNVIASLENQ 396
            VASS + IALN LSHLLS D  H   SSLA P+Y  ITE SWF+WNPKLV+ +IA L+ Q
Sbjct: 65   VASSPQFIALNALSHLLSPDTTHPRLSSLAFPLYMRITEESWFQWNPKLVAEIIAFLDKQ 124

Query: 397  GQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXX 576
            GQ + +E LI +++ KL S+EREL LFYCNLI+S+ K+ SK+G  D+Y            
Sbjct: 125  GQREEAETLILETLSKLGSRERELVLFYCNLIDSFCKHDSKRGFDDTYARLNQLVNSSSS 184

Query: 577  XX-NRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMR 753
                RQA +SMI+GLC +  PH+AE ++E+MR++G +PS FE++ ++  YGRLGLL DM 
Sbjct: 185  VYVKRQALKSMISGLCEMGQPHEAENLIEEMRVKGLEPSGFEYKCIIYGYGRLGLLEDME 244

Query: 754  RVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSC 933
            R+++QME  G  +DT+ SN+VLSSYGD+ ELS MV+W++KMK S +PFSVRTYNSVLNSC
Sbjct: 245  RIVNQMESDGTRVDTVCSNMVLSSYGDHNELSRMVLWLQKMKDSGIPFSVRTYNSVLNSC 304

Query: 934  PTIMAMLKE--PEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEGKLDLH 1107
             TIM+ML++      PLSI +L + + + E  +V+EL D SSVL E +KWDS E KLDLH
Sbjct: 305  STIMSMLQDLNSNDFPLSILELTEVLNEEEVSVVKELED-SSVLDEAMKWDSGETKLDLH 363

Query: 1108 GMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEM 1287
            GMHLG+AY I+L+WMD +++RF    +  VIPAEI VVCG GKHS+V G+S +KA+V +M
Sbjct: 364  GMHLGSAYFIILQWMDEMRNRF--NNEKHVIPAEITVVCGSGKHSTVRGESSVKAMVKKM 421

Query: 1288 MVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 1380
            MVR  SPMR+ R N++G F+ KG  VKDWLC
Sbjct: 422  MVRTSSPMRVHR-NNIGCFIAKGHVVKDWLC 451


>ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Vitis vinifera] gi|297744557|emb|CBI37819.3| unnamed
            protein product [Vitis vinifera]
          Length = 435

 Score =  424 bits (1091), Expect = e-116
 Identities = 227/428 (53%), Positives = 303/428 (70%), Gaps = 3/428 (0%)
 Frame = +1

Query: 106  NQSSAAIICSLSKKGQRXXXXXXXXXXXXXXXHRLIRKFVASSSKSIALNTLSHLLSSDI 285
            ++S   I C+LSK+GQ                +RLI KF+ASSSKSIALN LSHLLS   
Sbjct: 15   HRSPLLIQCALSKQGQ---LFLSSVARDPSASNRLICKFIASSSKSIALNALSHLLSPTT 71

Query: 286  RHH--SSLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFDVSEKLISDSVQKLESQE 459
             H   SSLALP+Y  I+EASWF WNPKL+++VIA L  QGQ   +E L+S+++ KL S+E
Sbjct: 72   THPYLSSLALPLYSRISEASWFSWNPKLIADVIALLYKQGQLKEAETLVSETLIKLGSRE 131

Query: 460  RELALFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXXNRQ-AYESMINGLCTLDLP 636
            R+L  FYCNLI+S+SK+ S +GVFD                 ++ AY+SMI+ LC + LP
Sbjct: 132  RDLVSFYCNLIDSHSKHSSNQGVFDVISRLSRIVSESSSVYVKERAYKSMISSLCAVGLP 191

Query: 637  HDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLDQMEKSGYTLDTISSNIV 816
             +AE ++E+MR++G KPS FEFRSVV  YGR+GL  DM+R+L QM   G+ LDT+ SN+V
Sbjct: 192  LEAENLIEEMRVKGLKPSVFEFRSVVYGYGRVGLSEDMQRILLQMGNEGFELDTVVSNMV 251

Query: 817  LSSYGDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIMAMLKEPEFLPLSIEDLI 996
            LSSYG   + SEMV W+++MK+S++PFS+RTYNSVLNSCP IM++L++ +  P +I++L+
Sbjct: 252  LSSYGAYNKQSEMVSWLQRMKNSSIPFSIRTYNSVLNSCPMIMSILQDLKTFPPTIDELM 311

Query: 997  DNVKKVEGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGTAYVILLEWMDVLKSRFG 1176
            + +K  E +LV+ELI GS VL E ++WD  EGKLDLHGMHLG+AY+I+L+W + L+ R  
Sbjct: 312  ETLKGDEALLVKELI-GSMVLAELMEWDCSEGKLDLHGMHLGSAYLIMLQWREELRYRLN 370

Query: 1177 GGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKSPMRIGRKNDVGSFVGKG 1356
                  V+P EI VVCG GKHSSV G+SP+K +V EMM R +SPM+I RKN +G FV K 
Sbjct: 371  AA--EYVMPVEITVVCGSGKHSSVRGESPVKRMVREMMTRTRSPMKIDRKN-IGCFVAKA 427

Query: 1357 KAVKDWLC 1380
            K VK+WLC
Sbjct: 428  KVVKNWLC 435


>gb|EXB24044.1| hypothetical protein L484_006076 [Morus notabilis]
          Length = 517

 Score =  419 bits (1078), Expect = e-114
 Identities = 224/426 (52%), Positives = 301/426 (70%), Gaps = 3/426 (0%)
 Frame = +1

Query: 112  SSAAIICSLSKKGQRXXXXXXXXXXXXXXXHRLIRKFVASSSKSIALNTLSHLLSSDIRH 291
            +S++I C+L+K+G R               ++LI KFVASS KSI+LN LSHLLS D  H
Sbjct: 96   ASSSIQCALTKQGHRFLSTLSINAGNASAANKLIGKFVASSPKSISLNALSHLLSPDTTH 155

Query: 292  H--SSLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFDVSEKLISDSVQKLESQERE 465
               +S +L +Y  I EASWF ++PKLV+ + A L+ QG++  +E LI+++V KL  ++RE
Sbjct: 156  THLTSHSLHLYSKIREASWFVYSPKLVAALAALLDKQGRYSEAEALIAEAVSKLGHRQRE 215

Query: 466  LALFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXXNR-QAYESMINGLCTLDLPHD 642
            LA+FYC+L+ S+SK  SK G   SY               + +A+E+M+  LCT+D P +
Sbjct: 216  LAVFYCSLVESHSKQSSKHGFDSSYAYLYQLLRDSSSAYVKCRAFETMVGALCTMDRPCE 275

Query: 643  AEEIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLDQMEKSGYTLDTISSNIVLS 822
            AE +ME+MR +G KPS FEFRS+V  YGRLGL  DM R ++QME  G  +DTI SN+VLS
Sbjct: 276  AESLMEEMRHKGLKPSVFEFRSLVYGYGRLGLWEDMLRTVNQMEIEGLVIDTICSNMVLS 335

Query: 823  SYGDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIMAMLKEPEFLPLSIEDLIDN 1002
            SYG + EL +MV+W++KM++S++PFS+RTYNSVLN CPTI AML++ + +PLS+ +L   
Sbjct: 336  SYGAHNELQQMVLWLQKMRTSSIPFSIRTYNSVLNWCPTITAMLQDLKDIPLSMYELNAT 395

Query: 1003 VKKVEGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGTAYVILLEWMDVLKSRFGGG 1182
            ++  EG+LV EL+ GSSVL E L WDSLE KLDLHGMHLG+AY+I+LEWM+ +  RF  G
Sbjct: 396  LRGDEGLLVMELV-GSSVLEEVLVWDSLEVKLDLHGMHLGSAYLIMLEWMEEMTRRFNDG 454

Query: 1183 GDSVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKSPMRIGRKNDVGSFVGKGKA 1362
                 IPAE+ VVCG GKHS+V G SP+K LV EMMV++KSPM+I RKN  G F+ KGK 
Sbjct: 455  NHG--IPAEVVVVCGSGKHSNVRGVSPVKILVKEMMVQMKSPMKIDRKN-AGCFLAKGKT 511

Query: 1363 VKDWLC 1380
            V+DWLC
Sbjct: 512  VRDWLC 517


>ref|XP_002884032.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297329872|gb|EFH60291.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 504

 Score =  408 bits (1049), Expect = e-111
 Identities = 212/396 (53%), Positives = 283/396 (71%), Gaps = 3/396 (0%)
 Frame = +1

Query: 202  HRLIRKFVASSSKSIALNTLSHLLSSDIR--HHSSLALPMYEWITEASWFEWNPKLVSNV 375
            HR I+KFVA+S KS+ LN LSHLLS      H S  AL +Y  ITEASWF+WNPKL++ +
Sbjct: 112  HRHIKKFVAASPKSVTLNVLSHLLSDQTSYPHLSFFALSLYSEITEASWFDWNPKLIAEL 171

Query: 376  IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDS-YXXXX 552
            +A L NQ +FD SE L+S +V +L+S ER+ ALF CNL+ S SK GS +G  ++ +    
Sbjct: 172  VAVLNNQERFDESETLLSTAVSRLKSNERDFALFLCNLVESNSKQGSIQGFNEACFRLRE 231

Query: 553  XXXXXXXXXXNRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 732
                        QAY+SM+ GLC +D PHDAE ++E+MR+E  KP  FE +SV+  YGRL
Sbjct: 232  RIQRSSSVYVKTQAYKSMVAGLCNMDQPHDAERVIEEMRVEKIKPGSFEHKSVLYGYGRL 291

Query: 733  GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTY 912
            GL +DM RV+ +ME  G+ +DT+ SN+VLSSYG +  L +M  W++K+K  NVPFS+RTY
Sbjct: 292  GLFDDMNRVVHRMETEGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGFNVPFSIRTY 351

Query: 913  NSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEG 1092
            NSVLNSCPTIM++LK+    P+S+ +L   + + E +LV EL   S+VL E ++W+++EG
Sbjct: 352  NSVLNSCPTIMSLLKDLNSCPVSLSELRTFLNEDEALLVLELTQ-STVLDEAIEWNAVEG 410

Query: 1093 KLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKA 1272
            KLDLHGMHL ++Y+ILL+WMD ++ RF       VIPAEI VV G GKHS+V G+SP+KA
Sbjct: 411  KLDLHGMHLSSSYLILLQWMDEIRLRF--RDQKCVIPAEIVVVSGSGKHSNVRGESPVKA 468

Query: 1273 LVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 1380
            LV ++MVR +SPMRI RKN VGSF+ KGK VK+WLC
Sbjct: 469  LVKKIMVRTESPMRIDRKN-VGSFIAKGKNVKEWLC 503


>ref|NP_849962.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75244359|sp|Q8GWA9.1|PP157_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g17033 gi|26452937|dbj|BAC43545.1| unknown protein
            [Arabidopsis thaliana] gi|330251482|gb|AEC06576.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 505

 Score =  406 bits (1044), Expect = e-110
 Identities = 212/396 (53%), Positives = 283/396 (71%), Gaps = 3/396 (0%)
 Frame = +1

Query: 202  HRLIRKFVASSSKSIALNTLSHLLSSDIRHH--SSLALPMYEWITEASWFEWNPKLVSNV 375
            +R I+KFVA+S KS+ALN LSHLLS    H   S  AL +Y  ITEASWF+WNPKL++ +
Sbjct: 113  NRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFALSLYSEITEASWFDWNPKLIAEL 172

Query: 376  IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFD-SYXXXX 552
            IA L  Q +FD SE L+S +V +L+S ER+  LF CNL+ S SK GS +G  + S+    
Sbjct: 173  IALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNLVESNSKQGSIQGFSEASFRLRE 232

Query: 553  XXXXXXXXXXNRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 732
                        QAY+SM++GLC +D PHDAE ++E+MR+E  KP  FE++SV+  YGRL
Sbjct: 233  IIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEMRMEKIKPGLFEYKSVLYGYGRL 292

Query: 733  GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTY 912
            GL +DM RV+ +M   G+ +DT+ SN+VLSSYG +  L +M  W++K+K  NVPFS+RTY
Sbjct: 293  GLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGFNVPFSIRTY 352

Query: 913  NSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEG 1092
            NSVLNSCPTI++MLK+ +  P+S+ +L   + + E +LV EL   SSVL E ++W+++EG
Sbjct: 353  NSVLNSCPTIISMLKDLDSCPVSLSELRTFLNEDEALLVHELTQ-SSVLDEAIEWNAVEG 411

Query: 1093 KLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKA 1272
            KLDLHGMHL ++Y+ILL+WMD  + RF    +  VIPAEI VV G GKHS+V G+SP+KA
Sbjct: 412  KLDLHGMHLSSSYLILLQWMDETRLRF--SEEKCVIPAEIVVVSGSGKHSNVRGESPVKA 469

Query: 1273 LVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 1380
            LV ++MVR  SPMRI RKN VGSF+ KGK VK+WLC
Sbjct: 470  LVKKIMVRTGSPMRIDRKN-VGSFIAKGKTVKEWLC 504


>dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana]
          Length = 501

 Score =  406 bits (1044), Expect = e-110
 Identities = 212/396 (53%), Positives = 283/396 (71%), Gaps = 3/396 (0%)
 Frame = +1

Query: 202  HRLIRKFVASSSKSIALNTLSHLLSSDIRHH--SSLALPMYEWITEASWFEWNPKLVSNV 375
            +R I+KFVA+S KS+ALN LSHLLS    H   S  AL +Y  ITEASWF+WNPKL++ +
Sbjct: 109  NRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFALSLYSEITEASWFDWNPKLIAEL 168

Query: 376  IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFD-SYXXXX 552
            IA L  Q +FD SE L+S +V +L+S ER+  LF CNL+ S SK GS +G  + S+    
Sbjct: 169  IALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNLVESNSKQGSIQGFSEASFRLRE 228

Query: 553  XXXXXXXXXXNRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 732
                        QAY+SM++GLC +D PHDAE ++E+MR+E  KP  FE++SV+  YGRL
Sbjct: 229  IIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEMRMEKIKPGLFEYKSVLYGYGRL 288

Query: 733  GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTY 912
            GL +DM RV+ +M   G+ +DT+ SN+VLSSYG +  L +M  W++K+K  NVPFS+RTY
Sbjct: 289  GLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGFNVPFSIRTY 348

Query: 913  NSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEG 1092
            NSVLNSCPTI++MLK+ +  P+S+ +L   + + E +LV EL   SSVL E ++W+++EG
Sbjct: 349  NSVLNSCPTIISMLKDLDSCPVSLSELRTFLNEDEALLVHELTQ-SSVLDEAIEWNAVEG 407

Query: 1093 KLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKA 1272
            KLDLHGMHL ++Y+ILL+WMD  + RF    +  VIPAEI VV G GKHS+V G+SP+KA
Sbjct: 408  KLDLHGMHLSSSYLILLQWMDETRLRF--SEEKCVIPAEIVVVSGSGKHSNVRGESPVKA 465

Query: 1273 LVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 1380
            LV ++MVR  SPMRI RKN VGSF+ KGK VK+WLC
Sbjct: 466  LVKKIMVRTGSPMRIDRKN-VGSFIAKGKTVKEWLC 500


>ref|NP_565402.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|13877877|gb|AAK44016.1|AF370201_1 unknown protein
            [Arabidopsis thaliana] gi|21280879|gb|AAM44931.1| unknown
            protein [Arabidopsis thaliana]
            gi|330251481|gb|AEC06575.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 504

 Score =  406 bits (1044), Expect = e-110
 Identities = 212/396 (53%), Positives = 283/396 (71%), Gaps = 3/396 (0%)
 Frame = +1

Query: 202  HRLIRKFVASSSKSIALNTLSHLLSSDIRHH--SSLALPMYEWITEASWFEWNPKLVSNV 375
            +R I+KFVA+S KS+ALN LSHLLS    H   S  AL +Y  ITEASWF+WNPKL++ +
Sbjct: 112  NRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFALSLYSEITEASWFDWNPKLIAEL 171

Query: 376  IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFD-SYXXXX 552
            IA L  Q +FD SE L+S +V +L+S ER+  LF CNL+ S SK GS +G  + S+    
Sbjct: 172  IALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNLVESNSKQGSIQGFSEASFRLRE 231

Query: 553  XXXXXXXXXXNRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 732
                        QAY+SM++GLC +D PHDAE ++E+MR+E  KP  FE++SV+  YGRL
Sbjct: 232  IIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEMRMEKIKPGLFEYKSVLYGYGRL 291

Query: 733  GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTY 912
            GL +DM RV+ +M   G+ +DT+ SN+VLSSYG +  L +M  W++K+K  NVPFS+RTY
Sbjct: 292  GLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGFNVPFSIRTY 351

Query: 913  NSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEG 1092
            NSVLNSCPTI++MLK+ +  P+S+ +L   + + E +LV EL   SSVL E ++W+++EG
Sbjct: 352  NSVLNSCPTIISMLKDLDSCPVSLSELRTFLNEDEALLVHELTQ-SSVLDEAIEWNAVEG 410

Query: 1093 KLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKA 1272
            KLDLHGMHL ++Y+ILL+WMD  + RF    +  VIPAEI VV G GKHS+V G+SP+KA
Sbjct: 411  KLDLHGMHLSSSYLILLQWMDETRLRF--SEEKCVIPAEIVVVSGSGKHSNVRGESPVKA 468

Query: 1273 LVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 1380
            LV ++MVR  SPMRI RKN VGSF+ KGK VK+WLC
Sbjct: 469  LVKKIMVRTGSPMRIDRKN-VGSFIAKGKTVKEWLC 503


>ref|XP_007041729.1| Pentatricopeptide (PPR) repeat-containing protein, putative
            [Theobroma cacao] gi|508705664|gb|EOX97560.1|
            Pentatricopeptide (PPR) repeat-containing protein,
            putative [Theobroma cacao]
          Length = 456

 Score =  405 bits (1041), Expect = e-110
 Identities = 207/397 (52%), Positives = 285/397 (71%), Gaps = 3/397 (0%)
 Frame = +1

Query: 202  HRLIRKFVASSSKSIALNTLSHLLS--SDIRHHSSLALPMYEWITEASWFEWNPKLVSNV 375
            +RLI+KFVASS KSIALN LSHLLS  +   H S+LA P+Y  I+E SW+ WNPKLV+ +
Sbjct: 58   NRLIKKFVASSPKSIALNALSHLLSPRNSHPHLSALAFPLYTKISETSWYNWNPKLVAEL 117

Query: 376  IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXX 555
            IA L  QG++D SE LIS +V KL+ +ER+L  FYCN I S SK+ SK+G  D+Y     
Sbjct: 118  IALLVKQGRYDESEALISQAVSKLKFRERDLVQFYCNWIESCSKHNSKEGFNDAYCYLSE 177

Query: 556  XXXXXXXXX-NRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 732
                       RQ Y+SM++ LC +D P++AE ++E+MR  G  P+ FEFR +   YG+L
Sbjct: 178  LICNSSSVYVKRQGYKSMVSSLCEMDRPNEAENLVEEMRKNGLTPTLFEFRFISYGYGQL 237

Query: 733  GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTY 912
            GL  DM R++ +ME  G+ +DTI SN+VLSSYG     S+MV W++KMK+  +PFS+RTY
Sbjct: 238  GLFEDMERMVCEMEIEGFEVDTICSNMVLSSYGAYNAFSKMVPWLQKMKTLQIPFSIRTY 297

Query: 913  NSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEG 1092
            NSVLNSCP IM++++  + +PLS+ +L   + + E +LVQEL+  SSVL E ++W+  EG
Sbjct: 298  NSVLNSCPEIMSLVQGLDSVPLSLGELAKILNEDEALLVQELVKSSSVLDEAMEWNGSEG 357

Query: 1093 KLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKA 1272
            KLDLHGMHLG+AY+I+L+W++ +K RF    +  VIPA+I +VCG GKHSSV G+SP+K 
Sbjct: 358  KLDLHGMHLGSAYLIMLQWIEEMKCRF--KVEECVIPAQITIVCGSGKHSSVRGESPVKT 415

Query: 1273 LVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLCP 1383
            L+ +MMV++KSPM+I RKN +G F+ KG+ VK+WL P
Sbjct: 416  LMRKMMVKMKSPMKIDRKN-IGCFIAKGQVVKNWLIP 451


>gb|EYU37991.1| hypothetical protein MIMGU_mgv1a006093mg [Mimulus guttatus]
          Length = 458

 Score =  402 bits (1033), Expect = e-109
 Identities = 213/433 (49%), Positives = 288/433 (66%), Gaps = 3/433 (0%)
 Frame = +1

Query: 124  IICSLSKKGQRXXXXXXXXXXXXXXXHRLIRKFVASSSKSIALNTLSHLLSSDIRHH--S 297
            ++C L+K+GQR                 L+RKFVASSSK +AL+TLSHLLS    H   S
Sbjct: 26   LVCVLTKQGQRLLSSIATSEQPSAAIS-LLRKFVASSSKHVALSTLSHLLSPSTSHPRLS 84

Query: 298  SLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFDVSEKLISDSVQKLESQERELALF 477
            SLA P+Y  I + SWF WN KLV+++I+ L    +FD ++ L  ++V KL  +ER+L  F
Sbjct: 85   SLAFPLYGIIEQESWFTWNSKLVADLISLLYKAERFDEADNLFGETVSKLGFKERDLCTF 144

Query: 478  YCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXXNRQ-AYESMINGLCTLDLPHDAEEI 654
            YCNL++S++K+ S++GV DS                +Q  YESMI G C +  P  AE +
Sbjct: 145  YCNLVDSHAKHMSERGVSDSCTRLKQLILASSSVYVKQKGYESMIAGFCEIGSPDKAENL 204

Query: 655  MEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGD 834
            ME+MR  G KPS FE R++V  YG++GLL DM+R + QMEK G+ LDT+  N+VLSS+G 
Sbjct: 205  MEEMRQNGLKPSAFELRTLVYGYGQMGLLEDMKRSVGQMEKEGFELDTVCYNMVLSSFGA 264

Query: 835  NKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKV 1014
              E  +M++W+KKM++S +PFS+RTYNSVLNSCPT++ +L++ + LPLS+ +L+DN+K  
Sbjct: 265  RNEFLDMLLWLKKMRNSGIPFSIRTYNSVLNSCPTVILLLEDMKSLPLSVNELVDNLKTG 324

Query: 1015 EGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSV 1194
            E  LV EL+  S VL + ++W S E KLD+HGMHL TAY+ILL+W   LK RFG G    
Sbjct: 325  EADLVLELMK-SDVLDQVMEWKSTELKLDMHGMHLSTAYLILLQWFKELKVRFGDGNHET 383

Query: 1195 VIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDW 1374
              P EI VVCG GKHSS  G+SP+K L  EM+ R+K P+RI RKN +G F+GKGK  KDW
Sbjct: 384  --PTEILVVCGSGKHSSKRGESPVKVLAKEMVTRMKCPLRIDRKN-IGCFIGKGKTFKDW 440

Query: 1375 LCPVVSNETLARL 1413
            LC   SN+  A +
Sbjct: 441  LCNEDSNKNPAEI 453


>ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutrema salsugineum]
            gi|557110519|gb|ESQ50810.1| hypothetical protein
            EUTSA_v10022675mg [Eutrema salsugineum]
          Length = 469

 Score =  401 bits (1031), Expect = e-109
 Identities = 206/396 (52%), Positives = 284/396 (71%), Gaps = 3/396 (0%)
 Frame = +1

Query: 202  HRLIRKFVASSSKSIALNTLSHLLSSDIRHH--SSLALPMYEWITEASWFEWNPKLVSNV 375
            +R I+KFVA+S KS++LN LSHLLS+   H   S  AL +Y  ITEASWF+WNPKL++ +
Sbjct: 77   NRHIKKFVAASPKSVSLNVLSHLLSAQTSHPHLSFFALSLYSEITEASWFDWNPKLIAEL 136

Query: 376  IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDS-YXXXX 552
            +A L  Q +   SE L+S++V +L+S ER++ALFYCNL+ S SK GS +G  ++      
Sbjct: 137  VALLNKQERSHESETLLSNAVSRLKSNERDIALFYCNLVESNSKQGSIQGFNEACVRLRE 196

Query: 553  XXXXXXXXXXNRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 732
                        QAY+SM++GLC +D PHDAE ++E+MR+   KP  FE++SV+  YGRL
Sbjct: 197  ITRRSTSVYVKTQAYKSMVSGLCNMDQPHDAESVIEEMRIAKIKPGLFEYKSVLYGYGRL 256

Query: 733  GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTY 912
            GL  DM RV+ +ME  G+ +DT+ SN+VLSSYG +  L +M  W++K+K SNVP S RTY
Sbjct: 257  GLFEDMNRVVHRMETEGHKIDTVCSNMVLSSYGAHNALPQMGSWLQKLKDSNVPLSERTY 316

Query: 913  NSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEG 1092
            NSVLNSCPTI+++LK+ +  P+S+ +L+  + K E +LV+ L   SSVL E ++W SLEG
Sbjct: 317  NSVLNSCPTILSLLKDLDSCPVSLSELLTFLNKDEEVLVRGLTQ-SSVLDEAIEWSSLEG 375

Query: 1093 KLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKA 1272
            KLDLHGMHL ++Y+I+++WMD ++ RF  G    V+PAEI +V G GKHS+V G+SP+KA
Sbjct: 376  KLDLHGMHLSSSYLIMMQWMDEMRIRFSEG--KCVVPAEIVLVSGSGKHSNVRGESPVKA 433

Query: 1273 LVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 1380
            LV ++MVR  SPMRI RKN +GSF+ KGK VK+WLC
Sbjct: 434  LVKKIMVRTGSPMRIDRKN-IGSFIAKGKTVKEWLC 468


>ref|XP_006297442.1| hypothetical protein CARUB_v10013465mg [Capsella rubella]
            gi|482566151|gb|EOA30340.1| hypothetical protein
            CARUB_v10013465mg [Capsella rubella]
          Length = 516

 Score =  397 bits (1019), Expect = e-108
 Identities = 207/396 (52%), Positives = 283/396 (71%), Gaps = 3/396 (0%)
 Frame = +1

Query: 202  HRLIRKFVASSSKSIALNTLSHLLSSDIRH-HSSLALP-MYEWITEASWFEWNPKLVSNV 375
            +RLI+KFVA+S KS+ALN LSHLLS +  H H S   P +Y  ITEASWF+WNPKL+  +
Sbjct: 123  NRLIKKFVAASPKSVALNVLSHLLSDNTSHPHLSYFAPQLYLEITEASWFDWNPKLIGEL 182

Query: 376  IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXX 555
            ++ L  Q +F  SE L+S +V +LES ER+ ALF CNL+ S SK GS +G  D+      
Sbjct: 183  VSLLNKQERFVESETLLSTAVSRLESNERDFALFLCNLVESNSKQGSIQGFSDACSRLRE 242

Query: 556  XXXXXXXXXNR-QAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 732
                      + QAY+SM++GLC +D P DAE ++E+MR+E  KP  FE++SV+  YGRL
Sbjct: 243  IIQRSSSVYVKTQAYKSMVSGLCNMDQPLDAERVIEEMRMETIKPGLFEYKSVLYGYGRL 302

Query: 733  GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTY 912
            GL +DM R++ +ME  G+ +DT+ SN+VLSSYG +  L +M  W++K+K  NVP S+RTY
Sbjct: 303  GLFDDMNRIVHRMETQGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGYNVPLSIRTY 362

Query: 913  NSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEG 1092
            NSVLNSCPTI+++LK+ +  PLS+ +L+  + + E +LV+EL   S VL E ++W+++EG
Sbjct: 363  NSVLNSCPTIISLLKDLDSCPLSLSELLPILNEDEALLVRELTQ-SLVLDEAIEWNAVEG 421

Query: 1093 KLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKA 1272
            KLDLHGMHL  +Y+I+L+WMD  + RF       V+PAEI VV G GKHS+V G+SP+KA
Sbjct: 422  KLDLHGMHLSASYLIMLQWMDETRLRF-SEDKKCVVPAEIVVVSGSGKHSNVRGESPVKA 480

Query: 1273 LVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 1380
            +V ++MVR KSPMRI RKN VGSF+ KGK VK+WLC
Sbjct: 481  MVKKIMVRTKSPMRIDRKN-VGSFIAKGKNVKEWLC 515


>ref|XP_002312938.2| hypothetical protein POPTR_0009s14120g [Populus trichocarpa]
            gi|550331693|gb|EEE86893.2| hypothetical protein
            POPTR_0009s14120g [Populus trichocarpa]
          Length = 473

 Score =  387 bits (993), Expect = e-105
 Identities = 204/398 (51%), Positives = 277/398 (69%), Gaps = 5/398 (1%)
 Frame = +1

Query: 202  HRLIRKFVASSSKSIALNTLSHLLSSDIRHHSSL---ALPMYEWITEASWFEWNPKLVSN 372
            +RLI+KFVASS KSIAL+ LS+LLS D  HH  L    LP+Y  I+EASWF WNPKLV+ 
Sbjct: 80   NRLIKKFVASSPKSIALDALSNLLSPDSTHHPLLYLLTLPLYLKISEASWFSWNPKLVAQ 139

Query: 373  VIASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXX 552
            V+  L+ QG     + L+S++V +L+ +EREL LFYCNLI   SK+   +G  DSY    
Sbjct: 140  VVVLLDKQGLDKELKALMSETVSRLQFKERELVLFYCNLIGFNSKHNWVRGFDDSYSRLN 199

Query: 553  XXXXXXXXXX-NRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGR 729
                        +Q Y++MI+GLC +    +AE+++ +MR  G KP  FEFR V+  YGR
Sbjct: 200  QFVSDSNSVYVKKQGYKAMISGLCEMGRAREAEDLIGEMRERGLKPKLFEFRCVLYGYGR 259

Query: 730  LGLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRT 909
            LGL  DM R+LD+ME     +DT+ +N+VL+SYG +  L EM +W++KMK+  +P S+RT
Sbjct: 260  LGLFKDMERILDKMESGEIEVDTVCANMVLASYGAHNALPEMGLWLRKMKTLGIPLSIRT 319

Query: 910  YNSVLNSCPTIMAMLKEPE-FLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSL 1086
             NSVLNSCPTIMA+++  +   P+SI++L+  + + E MLV+ELI+ SSVL E  KWD+ 
Sbjct: 320  CNSVLNSCPTIMALMRNLDASYPVSIQELLKILSEEEAMLVKELIE-SSVLKEATKWDTS 378

Query: 1087 EGKLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPI 1266
            EGKLDLHGMHLG+AYVI+L+WM+  ++R   G    VIPAEI VVCG G HS+V G+SP+
Sbjct: 379  EGKLDLHGMHLGSAYVIMLQWMEETRNRLSDG--EHVIPAEITVVCGSGNHSTVRGESPV 436

Query: 1267 KALVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 1380
            K++++E+M + +SPMRI RKN +G FV KG  VK WLC
Sbjct: 437  KSMITEIMAQTRSPMRIDRKN-IGCFVAKGNVVKKWLC 473


>ref|XP_002521239.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223539507|gb|EEF41095.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 460

 Score =  386 bits (992), Expect = e-104
 Identities = 205/421 (48%), Positives = 283/421 (67%), Gaps = 5/421 (1%)
 Frame = +1

Query: 133  SLSKKGQRXXXXXXXXXXXXXXX--HRLIRKFVASSSKSIALNTLSHLLS--SDIRHHSS 300
            +LSK+GQR                 +RLI+KFVA+S KSIAL+ LSHLL+  S   H SS
Sbjct: 44   ALSKQGQRFLSSLAIATTKGDTVATNRLIKKFVAASPKSIALDALSHLLNPHSSHSHLSS 103

Query: 301  LALPMYEWITEASWFEWNPKLVSNVIASLENQGQFDVSEKLISDSVQKLESQERELALFY 480
            LA  +Y  I EA WF+WNPKLV++V+A L+ QG++D S  L+SDS+ KL+ +ER+LA FY
Sbjct: 104  LAFTLYLKIAEARWFQWNPKLVADVVAFLDKQGRYDESATLVSDSISKLQVKERDLARFY 163

Query: 481  CNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXX-NRQAYESMINGLCTLDLPHDAEEIM 657
            CNL+ S SK  S +G  +S                 RQ Y+SM+NGLC +  P +AE ++
Sbjct: 164  CNLVESQSKQNSIRGFDNSVASLMQLVCNSNSVYVKRQGYKSMVNGLCEMGRPREAETLI 223

Query: 658  EKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDN 837
            E+M  EG +PS FEF+ VV AYG LG   +M + L QME++G+ +DT+ SN++L+SYG +
Sbjct: 224  EEMGKEGVRPSMFEFKCVVYAYGSLGSFEEMNKCLHQMERAGFRVDTVCSNMILASYGAH 283

Query: 838  KELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKVE 1017
              L EMV+W++KMK   +PFS+RT NS LNSCPTIM+M++     P+SI DL+  + + E
Sbjct: 284  NALPEMVLWLQKMKDLGIPFSLRTCNSALNSCPTIMSMMQNSNDFPISIHDLMKILSEDE 343

Query: 1018 GMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVV 1197
             +LV+E++  SSVL E +KWD  E KLDLHG HL +AY+I+L W++ ++ RF     + V
Sbjct: 344  ALLVKEIVT-SSVLDEAMKWDVAEAKLDLHGTHLCSAYLIILLWIEEMRKRF--KSVNYV 400

Query: 1198 IPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWL 1377
             P EI VVCG G HS V G+SP+K +V + MVR +SPMRI R+N +G F+ KGK V++WL
Sbjct: 401  NPTEITVVCGSGNHSIVRGESPVKCMVKDFMVRARSPMRIDRRN-IGCFIAKGKVVEEWL 459

Query: 1378 C 1380
            C
Sbjct: 460  C 460


>ref|XP_004156246.1| PREDICTED: uncharacterized protein LOC101223617 [Cucumis sativus]
          Length = 1296

 Score =  378 bits (971), Expect = e-102
 Identities = 203/408 (49%), Positives = 283/408 (69%), Gaps = 7/408 (1%)
 Frame = +1

Query: 202  HRLIRKFVASSSKSIALNTLSHLLSSDIRHHS--SLALPMYEWITEASWFEWNPKLVSNV 375
            +RLIRKFVASS KSI L+ LS+++S+        S AL +Y  ITEASWF WN KLV+++
Sbjct: 69   NRLIRKFVASSPKSITLSVLSNIVSTHTPQPELCSAALTLYSRITEASWFTWNSKLVADL 128

Query: 376  IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXX 555
            +A L+  G +  SE LIS+++ KL SQER+L  FY  L+ S SK+G ++G  DSY     
Sbjct: 129  VAFLDQNGLYSESEVLISEAISKLGSQERKLVNFYSQLVESQSKHGFERGFVDSYSRLLE 188

Query: 556  XXXXXXXXX-NRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 732
                       R+AYESM+ GLC++  PH+AE ++++MR +G  P+ +E+RS++ AYG L
Sbjct: 189  LLYNSPSVYVKRRAYESMVTGLCSMKRPHEAENLVKEMRSKGITPTAYEYRSIIYAYGTL 248

Query: 733  GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSS-NVPFSVRT 909
            GL  +M+R L QME     LDT+ SN+VLSSYG + +L +MV+W+++MK+S +   SVRT
Sbjct: 249  GLFEEMKRSLKQMENDNIELDTVCSNMVLSSYGAHNKLGDMVLWLQRMKTSPHCNSSVRT 308

Query: 910  YNSVLNSCPTIMAMLKEPEF--LPLSIEDLIDNVK-KVEGMLVQELIDGSSVLLENLKWD 1080
            YNSVLNSCP I AML++ +   LP+ IEDLI  +    E +LV+EL+ GSSVL E + WD
Sbjct: 309  YNSVLNSCPKITAMLQDHKSTNLPVLIEDLIAVLDGDEEALLVEELLAGSSVLNEIMVWD 368

Query: 1081 SLEGKLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKS 1260
            ++E KLDLHG H+G AYVI+L+W+  ++  F    +S VIPA++ ++CG GKHS V G+S
Sbjct: 369  AMELKLDLHGAHVGAAYVIMLQWIKEMRLNF--EDESYVIPAQVTLICGSGKHSIVRGES 426

Query: 1261 PIKALVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLCPVVSNETL 1404
            P+KAL+ E+MVR +SP+RI RKN  G F+ KGKAVK+WLC +     L
Sbjct: 427  PVKALIKEIMVRTESPLRIDRKN-TGCFISKGKAVKNWLCSLPGKRIL 473


>ref|XP_004141623.1| PREDICTED: uncharacterized protein LOC101204365 [Cucumis sativus]
          Length = 1913

 Score =  378 bits (970), Expect = e-102
 Identities = 202/400 (50%), Positives = 281/400 (70%), Gaps = 7/400 (1%)
 Frame = +1

Query: 202  HRLIRKFVASSSKSIALNTLSHLLSSDIRHHS--SLALPMYEWITEASWFEWNPKLVSNV 375
            +RLIRKFVASS KSI L+ LS+++S+        S AL +Y  ITEASWF WN KLV+++
Sbjct: 69   NRLIRKFVASSPKSITLSVLSNIVSTHTPQPELCSAALTLYSRITEASWFTWNSKLVADL 128

Query: 376  IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXX 555
            +A L+  G +  SE LIS+++ KL SQER+L  FY  L+ S SK+G ++G  DSY     
Sbjct: 129  VAFLDQNGLYSESEVLISEAISKLGSQERKLVNFYSQLVESQSKHGFERGFVDSYSRLLE 188

Query: 556  XXXXXXXXX-NRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 732
                       R+AYESM+ GLC++  PH+AE ++++MR +G  P+ +E+RS++ AYG L
Sbjct: 189  LLYNSPSVYVKRRAYESMVTGLCSMKRPHEAENLVKEMRSKGITPTAYEYRSIIYAYGTL 248

Query: 733  GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSS-NVPFSVRT 909
            GL  +M+R L QME     LDT+ SN+VLSSYG + +L +MV+W+++MK+S +   SVRT
Sbjct: 249  GLFEEMKRSLKQMENDNIELDTVCSNMVLSSYGAHNKLGDMVLWLQRMKTSPHCNSSVRT 308

Query: 910  YNSVLNSCPTIMAMLKEPEF--LPLSIEDLIDNVK-KVEGMLVQELIDGSSVLLENLKWD 1080
            YNSVLNSCP I AML++ +   LP+ IEDLI  +    E +LV+EL+ GSSVL E + WD
Sbjct: 309  YNSVLNSCPKITAMLQDHKSTNLPVLIEDLIAVLDGDEEALLVEELLAGSSVLNEIMVWD 368

Query: 1081 SLEGKLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKS 1260
            ++E KLDLHG H+G AYVI+L+W+  ++  F    +S VIPA++ ++CG GKHS V G+S
Sbjct: 369  AMELKLDLHGAHVGAAYVIMLQWIKEMRLNF--EDESYVIPAQVTLICGSGKHSIVRGES 426

Query: 1261 PIKALVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 1380
            P+KAL+ E+MVR +SP+RI RKN  G F+ KGKAVK+WLC
Sbjct: 427  PVKALIKEIMVRTESPLRIDRKN-TGCFISKGKAVKNWLC 465


>ref|XP_006360892.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like
            [Solanum tuberosum]
          Length = 459

 Score =  377 bits (967), Expect = e-101
 Identities = 204/421 (48%), Positives = 288/421 (68%), Gaps = 5/421 (1%)
 Frame = +1

Query: 130  CSLSKKGQRXXXXXXXXXXXXXXXHR-LIRKFVASSSKSIALNTLSHLLSSDIRHH---S 297
            CSLSK+G R                R L+RKFVASSSK +AL+TLSHL+S     H    
Sbjct: 31   CSLSKQGHRFLSTLIAADSEDISATRHLLRKFVASSSKHVALSTLSHLVSPTTTSHYRLC 90

Query: 298  SLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFDVSEKLISDSVQKLESQERELALF 477
            SLALP+Y  I+EASWF+WN KLV++++A L    +FD +E L++++V KL S+ER+L  F
Sbjct: 91   SLALPLYLEISEASWFDWNSKLVADLVALLYKLERFDEAETLVTETVSKLGSRERDLCSF 150

Query: 478  YCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXXNRQ-AYESMINGLCTLDLPHDAEEI 654
            Y  LI+S SK+ S++GV D                 +Q  Y SM+ G C + LP  AEE+
Sbjct: 151  YSQLIHSQSKHNSERGVLDFCTKLKLVLLRSSSVYLKQRGYASMVEGFCLIGLPRKAEEL 210

Query: 655  MEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGD 834
            ME+M+  G K S FEFRS+V +YG+ G L DM+R++ +ME  G+ LDT+SSN+VL+S+G 
Sbjct: 211  MEEMKELGLKLSKFEFRSLVYSYGKSGYLRDMKRIVVEMESMGFQLDTVSSNMVLNSFGS 270

Query: 835  NKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKV 1014
            + ELSE+V  ++K+++S VPFS+RTYNSVLNSCPTI  +L++ + +PLS+E+L+ N+ + 
Sbjct: 271  HNELSEVVSSLQKIEASGVPFSIRTYNSVLNSCPTISLLLQDLKSVPLSLEELMGNLDEN 330

Query: 1015 EGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSV 1194
            E +LV  L+ GSSVL E ++W   E KLDLHGMHL +AYVI+L+W   L+ +F    ++ 
Sbjct: 331  EAVLVNILV-GSSVLEETMQWKPSELKLDLHGMHLTSAYVIILQWFHQLQCKF--LAENR 387

Query: 1195 VIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDW 1374
            V+P EI VVCG GKHS V G+SP+K L+ E+++R+  P+RI RKN +G F+ KGK+  +W
Sbjct: 388  VLPGEIIVVCGAGKHSVVRGESPVKRLIKEILLRIGCPLRIDRKN-IGCFIAKGKSFMEW 446

Query: 1375 L 1377
            L
Sbjct: 447  L 447


>gb|AAU04769.1| pentatricopeptide (PPR) repeat protein-like [Cucumis melo]
          Length = 488

 Score =  374 bits (959), Expect = e-101
 Identities = 201/400 (50%), Positives = 280/400 (70%), Gaps = 7/400 (1%)
 Frame = +1

Query: 202  HRLIRKFVASSSKSIALNTLSHLLSSDIRHHS--SLALPMYEWITEASWFEWNPKLVSNV 375
            +RLIRKFVASS KSI L+ LS+++S+        S AL +Y  ITEASWF WN KLV+++
Sbjct: 69   NRLIRKFVASSPKSITLSVLSNIVSTHTPQPELCSAALTLYSRITEASWFTWNSKLVADL 128

Query: 376  IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXX 555
            +A L   G +  SE LIS+++ KL SQER+L  FY  L+ S SK+G ++G  DSY     
Sbjct: 129  VAFLGQNGLYSESEALISEAISKLGSQERKLVNFYSQLVESQSKHGFERGFGDSYSRLFE 188

Query: 556  XXXXXXXXX-NRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 732
                       R+AYESM+ GLC++  PH+AE ++++MR +G  P+ +E+RS++ AYG L
Sbjct: 189  LLYNSPSVYVKRRAYESMVTGLCSMKRPHEAESLVKEMRSKGITPTAYEYRSIIYAYGTL 248

Query: 733  GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMK-SSNVPFSVRT 909
            GL  +M+R L QME     LDT+ SN+VLSSYG + +L +M++W+++MK SS+   SVRT
Sbjct: 249  GLFEEMKRSLKQMENDNIELDTVCSNMVLSSYGAHNKLGDMLLWLQRMKTSSHCKSSVRT 308

Query: 910  YNSVLNSCPTIMAMLKEPEF--LPLSIEDLIDNVK-KVEGMLVQELIDGSSVLLENLKWD 1080
            YNSVLNSCP I +ML++ +   LP+ IEDLI  +    E +LV+EL+ GSSVL E + WD
Sbjct: 309  YNSVLNSCPKITSMLQDHKSGDLPVLIEDLIAILDGDEEALLVKELLVGSSVLNEIMVWD 368

Query: 1081 SLEGKLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKS 1260
            ++E KLDLHG H+G AYVI+L+W+  ++  F    +S VIPA++ ++CG GKHS V G+S
Sbjct: 369  AMELKLDLHGAHVGAAYVIMLQWIKEMRLNF--EDESNVIPAQVTLICGSGKHSIVRGES 426

Query: 1261 PIKALVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 1380
            P+KAL+ E+MVR +SP+RI RKN  G F+ KGKAVK+WLC
Sbjct: 427  PVKALIKEIMVRTESPLRIDRKN-TGCFISKGKAVKNWLC 465


>gb|AGH33847.1| PPR [Cucumis melo]
          Length = 488

 Score =  373 bits (958), Expect = e-100
 Identities = 200/400 (50%), Positives = 280/400 (70%), Gaps = 7/400 (1%)
 Frame = +1

Query: 202  HRLIRKFVASSSKSIALNTLSHLLSSDIRHHS--SLALPMYEWITEASWFEWNPKLVSNV 375
            +RLIRKFVASS KSI L+ LS+++S+        S AL +Y  ITEASWF WN KLV+++
Sbjct: 69   NRLIRKFVASSPKSITLSVLSNIVSTHTPQPELCSAALTLYSRITEASWFTWNSKLVADL 128

Query: 376  IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXX 555
            +A L   G +  SE LIS+++ KL SQER+L  FY  L+ S SK+G ++G  DSY     
Sbjct: 129  VAFLGQNGLYSESEALISEAISKLGSQERKLVNFYSQLVESQSKHGFERGFGDSYSRLFE 188

Query: 556  XXXXXXXXX-NRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 732
                       R+AYESM+ GLC++  PH+AE ++++MR +G  P+ +E+RS++ AYG L
Sbjct: 189  LLYNSPSVYVKRRAYESMVTGLCSMKRPHEAESLVKEMRSKGITPTAYEYRSIIYAYGTL 248

Query: 733  GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSS-NVPFSVRT 909
            GL  +M+R L QME     LDT+ SN+VLSSYG + +L +M++W+++MK+S +   SVRT
Sbjct: 249  GLFEEMKRSLKQMENDNIELDTVCSNMVLSSYGAHNKLGDMLLWLQRMKTSPHCKSSVRT 308

Query: 910  YNSVLNSCPTIMAMLKEPEF--LPLSIEDLIDNVK-KVEGMLVQELIDGSSVLLENLKWD 1080
            YNSVLNSCP I +ML++ +   LP+ IEDLI  +    E +LV+EL+ GSSVL E + WD
Sbjct: 309  YNSVLNSCPKITSMLQDHKSGDLPVLIEDLIAILDGDEEALLVKELLVGSSVLNEIMVWD 368

Query: 1081 SLEGKLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKS 1260
            ++E KLDLHG H+G AYVI+L+W+  ++  F    +S VIPA++ ++CG GKHS V G+S
Sbjct: 369  AMELKLDLHGAHVGAAYVIMLQWIKEMRLNF--EDESYVIPAQVTLICGSGKHSIVRGES 426

Query: 1261 PIKALVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 1380
            P+KAL+ E+MVR +SP+RI RKN  G F+ KGKAVK+WLC
Sbjct: 427  PVKALIKEIMVRTESPLRIDRKN-TGCFISKGKAVKNWLC 465