BLASTX nr result

ID: Papaver31_contig00027633 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Papaver31_contig00027633
         (1515 letters)

Database: ./nr 
           77,306,371 sequences; 28,104,191,420 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_010266067.1| PREDICTED: pentatricopeptide repeat-containi...   494   e-137
ref|XP_007200730.1| hypothetical protein PRUPE_ppa021547mg [Prun...   458   e-126
ref|XP_008358363.1| PREDICTED: pentatricopeptide repeat-containi...   444   e-122
ref|XP_008358358.1| PREDICTED: pentatricopeptide repeat-containi...   444   e-122
ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containi...   437   e-119
ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citr...   426   e-116
ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containi...   425   e-116
ref|XP_010086846.1| hypothetical protein L484_006076 [Morus nota...   419   e-114
ref|XP_002884032.1| pentatricopeptide repeat-containing protein ...   408   e-111
ref|XP_007041729.1| Pentatricopeptide (PPR) repeat-containing pr...   407   e-110
ref|NP_849962.1| pentatricopeptide repeat-containing protein [Ar...   406   e-110
dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana]           406   e-110
ref|NP_565402.1| pentatricopeptide repeat-containing protein [Ar...   406   e-110
ref|XP_012837220.1| PREDICTED: pentatricopeptide repeat-containi...   402   e-109
ref|XP_012077696.1| PREDICTED: pentatricopeptide repeat-containi...   401   e-109
ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutr...   401   e-109
ref|XP_010548124.1| PREDICTED: pentatricopeptide repeat-containi...   398   e-108
ref|XP_006297442.1| hypothetical protein CARUB_v10013465mg [Caps...   397   e-107
ref|XP_013622437.1| PREDICTED: pentatricopeptide repeat-containi...   396   e-107
emb|CDP14534.1| unnamed protein product [Coffea canephora]            396   e-107

>ref|XP_010266067.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Nelumbo nucifera]
          Length = 451

 Score =  494 bits (1273), Expect = e-137
 Identities = 261/444 (58%), Positives = 330/444 (74%), Gaps = 2/444 (0%)
 Frame = -1

Query: 1467 MLLQFSVSLKWGNHHHHNQSSAAIICSLSKKGQRXXXXXXXXXXXXXXSHRLIRKFVASS 1288
            + L FS  L   +HHHH +      C+LSKKG R              ++RLIRKFVASS
Sbjct: 8    LALAFSADLL--HHHHHRRPLFLPWCALSKKGHRFFTSLAAAAGDSAAANRLIRKFVASS 65

Query: 1287 SKSIALNTLSHLLSSDIRHH--SSLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFD 1114
            SKS ALN LSHL+SS+  H   SSL LPMY  I E  WF WNPKLV++VIA L+ QGQ +
Sbjct: 66   SKSDALNALSHLISSNTTHFHLSSLVLPMYRRIAETPWFNWNPKLVASVIAYLDKQGQPE 125

Query: 1113 VSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXLNR 934
             +E LIS+SVQKL  QER++ALFYC+LI+SYSK  S+ GVF+SY             L+R
Sbjct: 126  EAEALISESVQKLGFQERDVALFYCDLIDSYSKQRSRIGVFESYARLKQLFSDSSSSLSR 185

Query: 933  QAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLDQ 754
            +AYE++I  LC++DLP DAE ++E+M + GFKPS FEFRS+V  YGRLGL  DMRRVL +
Sbjct: 186  RAYETIICSLCSVDLPRDAENMVEEMTISGFKPSAFEFRSLVSGYGRLGLFTDMRRVLRK 245

Query: 753  MEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIMA 574
            ME +GY LDTI SN+VLSS+G + ELSEM  W++KMK SN+ FS+RTYNSV+NSCPTI +
Sbjct: 246  MEDAGYCLDTICSNMVLSSFGAHSELSEMASWLRKMKDSNISFSIRTYNSVMNSCPTITS 305

Query: 573  MLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGTA 394
            +LK+ +F+PLS+EDL   ++K E +LV++LI GSSVL++ LKW   EGKLDLHGMHL TA
Sbjct: 306  LLKDLKFVPLSMEDLKGRLQKDETLLVEQLI-GSSVLMDALKWCPSEGKLDLHGMHLATA 364

Query: 393  YVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKSP 214
            Y+I+L+W+ VL+SRF  G  + VIP E RV+CG GKHSSV G+SP+KALV +MMVR+KSP
Sbjct: 365  YLIMLQWVQVLRSRFSAG--NWVIPTEFRVICGSGKHSSVRGESPVKALVKQMMVRMKSP 422

Query: 213  MRIGRKNDVGSFVGKGKAVKDWLC 142
            M+I R N+VG FVG+GKAV+DWLC
Sbjct: 423  MKIDR-NNVGCFVGRGKAVRDWLC 445


>ref|XP_007200730.1| hypothetical protein PRUPE_ppa021547mg [Prunus persica]
            gi|462396130|gb|EMJ01929.1| hypothetical protein
            PRUPE_ppa021547mg [Prunus persica]
          Length = 447

 Score =  458 bits (1179), Expect = e-126
 Identities = 241/443 (54%), Positives = 318/443 (71%), Gaps = 3/443 (0%)
 Frame = -1

Query: 1461 LQFSVSLKWGNHHHHNQSSAAIICSLSKKGQRXXXXXXXXXXXXXXSHRLIRKFVASSSK 1282
            L FS++L W         ++ I C+++K+GQR              +++LI KF+ SS+K
Sbjct: 9    LSFSIALPWNPLRPLPPLTSPIQCAVTKQGQRFLTKLAANARDAKVTNKLIAKFLTSSTK 68

Query: 1281 SIALNTLSHLLSSD--IRHHSSLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFDVS 1108
            SIALNTLS+LLS D  + H SSLALP Y  ITEASWFEWNPKLV+ ++A L+ QGQ + +
Sbjct: 69   SIALNTLSYLLSPDTTLPHLSSLALPFYSKITEASWFEWNPKLVAALVALLDKQGQHNEA 128

Query: 1107 EKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXL-NRQ 931
            E LIS+++ KL S+ERELALF+C L+ S+SK  SK G   SY                 +
Sbjct: 129  EVLISETISKLGSRERELALFHCQLVESHSKLSSKHGFDSSYSYLYQLLHNSSSVYVKNR 188

Query: 930  AYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLDQM 751
            A+ESM++GLC +D P +A+ ++E+MR+ G KPS FEFRSVV  YGRLGL  DM +V++QM
Sbjct: 189  AFESMVSGLCEMDRPREADNLIEEMRVRGLKPSVFEFRSVVYGYGRLGLFEDMLKVVEQM 248

Query: 750  EKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIMAM 571
            E  G  +DTI SN+VLSSYG + EL+ M+VW++KMKS ++PFS+RTYNSVLNSC TIMAM
Sbjct: 249  ENQGIAIDTICSNMVLSSYGAHSELAAMLVWLRKMKSLSLPFSIRTYNSVLNSCLTIMAM 308

Query: 570  LKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGTAY 391
            L+EP+  P SIE+L   +   E +LV+EL++ S+VL E + W+ LE KLDLHGMHLG+AY
Sbjct: 309  LQEPKDFPCSIEELNGVLNGDEALLVKELVE-STVLDEVMVWEPLEAKLDLHGMHLGSAY 367

Query: 390  VILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKSPM 211
            +ILLEW + ++ RF  G D  VIPAE+ V+CG GKHSSV G+SP+K LV +MM+R++SPM
Sbjct: 368  LILLEWFEAMRCRFNSGKD--VIPAEVVVICGSGKHSSVRGESPVKGLVKQMMLRMESPM 425

Query: 210  RIGRKNDVGSFVGKGKAVKDWLC 142
            RI RKN VG FV KG+AVKDWLC
Sbjct: 426  RIDRKN-VGCFVAKGRAVKDWLC 447


>ref|XP_008358363.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like
            [Malus domestica]
          Length = 461

 Score =  444 bits (1143), Expect = e-122
 Identities = 235/445 (52%), Positives = 315/445 (70%), Gaps = 5/445 (1%)
 Frame = -1

Query: 1461 LQFSVSLKWGNHHHHNQS--SAAIICSLSKKGQRXXXXXXXXXXXXXXSHRLIRKFVASS 1288
            L FSV+  W +H        ++++ C L+K+GQR              +++LI KF++SS
Sbjct: 18   LSFSVASPWKHHQPRPTPPLASSVQCVLTKQGQRFLTKLAANARDPKFTNKLISKFLSSS 77

Query: 1287 SKSIALNTLSHLLSSDIR--HHSSLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFD 1114
             KSIAL+TLS+LLS D    H SSLA P+Y  ITE SWFEWNPKLV++++A L+NQG + 
Sbjct: 78   PKSIALSTLSYLLSPDSTPPHLSSLAFPLYSKITEESWFEWNPKLVASLVALLDNQGLYS 137

Query: 1113 VSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXL-N 937
             SE LIS+++ KL S+ERELALF+C L+ S+SK  SK G   +Y                
Sbjct: 138  QSEALISETISKLGSRERELALFHCQLLESHSKLSSKHGFDSTYSYLHQLLHNSSSVYVK 197

Query: 936  RQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLD 757
            R+A+ESM+ GLC +D P +A+ ++E+M ++G KPS FEFRSVV  YGRLGL  +M +V++
Sbjct: 198  RRAFESMVGGLCAMDRPQEADILIEEMMVKGLKPSVFEFRSVVYGYGRLGLFEEMLKVVE 257

Query: 756  QMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIM 577
            +ME  G  +DTI SN+VLSSYG   EL+ MV+W++KMK   +PFS+RTYNSVLNSCPTIM
Sbjct: 258  KMEGQGLAVDTICSNMVLSSYGAYSELAAMVLWLRKMKILRLPFSIRTYNSVLNSCPTIM 317

Query: 576  AMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGT 397
            AML++P+ +P SIE L   +   EG++V+EL+ GS+VL E + W+SLE KLDLHG+HLG+
Sbjct: 318  AMLQDPKDVPCSIEQLNGVLNGDEGLVVKELV-GSTVLEEVMVWESLEAKLDLHGLHLGS 376

Query: 396  AYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKS 217
            AY+I+LEW + ++ RF  G    VIPAE+ +VCGLGKHSSV G+SP+K LV  MM R+ S
Sbjct: 377  AYLIMLEWFEAMRHRFNCG--ECVIPAEVVIVCGLGKHSSVRGESPVKGLVKVMMHRMGS 434

Query: 216  PMRIGRKNDVGSFVGKGKAVKDWLC 142
            PMRI RKN VG F+ KG+AVKDWLC
Sbjct: 435  PMRIDRKN-VGCFIAKGRAVKDWLC 458


>ref|XP_008358358.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like
            [Malus domestica]
          Length = 461

 Score =  444 bits (1143), Expect = e-122
 Identities = 235/445 (52%), Positives = 315/445 (70%), Gaps = 5/445 (1%)
 Frame = -1

Query: 1461 LQFSVSLKWGNHHHHNQS--SAAIICSLSKKGQRXXXXXXXXXXXXXXSHRLIRKFVASS 1288
            L FSV+  W +H        ++++ C L+K+GQR              +++LI KF++SS
Sbjct: 18   LSFSVASPWKHHQPRPTPPLASSVQCVLTKQGQRFLTKLAANARDPKFTNKLISKFLSSS 77

Query: 1287 SKSIALNTLSHLLSSDIR--HHSSLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFD 1114
             KSIAL+TLS+LLS D    H SSLA P+Y  ITE SWFEWNPKLV++++A L+NQG + 
Sbjct: 78   PKSIALSTLSYLLSPDSTPPHLSSLAFPLYSKITEESWFEWNPKLVASLVALLDNQGLYS 137

Query: 1113 VSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXL-N 937
             SE LIS+++ KL S+ERELALF+C L+ S+SK  SK G   +Y                
Sbjct: 138  QSEALISETISKLGSRERELALFHCQLLESHSKLSSKHGFDSTYSYLHQLLHNSSSVYVK 197

Query: 936  RQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLD 757
            R+A+ESM+ GLC +D P +A+ ++E+M ++G KPS FEFRSVV  YGRLGL  +M +V++
Sbjct: 198  RRAFESMVGGLCAMDRPQEADILIEEMMVKGLKPSVFEFRSVVYGYGRLGLFEEMLKVVE 257

Query: 756  QMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIM 577
            +ME  G  +DTI SN+VLSSYG   EL+ MV+W++KMK   +PFS+RTYNSVLNSCPTIM
Sbjct: 258  KMEGQGLAVDTICSNMVLSSYGAYSELAAMVLWLRKMKILRLPFSIRTYNSVLNSCPTIM 317

Query: 576  AMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGT 397
            AML++P+ +P SIE L   +   EG++V+EL+ GS+VL E + W+SLE KLDLHG+HLG+
Sbjct: 318  AMLQDPKDVPCSIEQLNGVLNGDEGLVVKELV-GSTVLEEVMVWESLEAKLDLHGLHLGS 376

Query: 396  AYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKS 217
            AY+I+LEW + ++ RF  G    VIPAE+ +VCGLGKHSSV G+SP+K LV  MM R+ S
Sbjct: 377  AYLIMLEWFEAMRHRFNCG--ECVIPAEVVIVCGLGKHSSVRGESPVKGLVKVMMHRMGS 434

Query: 216  PMRIGRKNDVGSFVGKGKAVKDWLC 142
            PMRI RKN VG F+ KG+AVKDWLC
Sbjct: 435  PMRIDRKN-VGCFIAKGRAVKDWLC 458


>ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Fragaria vesca subsp. vesca]
          Length = 448

 Score =  437 bits (1123), Expect = e-119
 Identities = 234/445 (52%), Positives = 313/445 (70%), Gaps = 5/445 (1%)
 Frame = -1

Query: 1461 LQFSVSLKWGNHH-HHNQSSAAIICSLSKKGQRXXXXXXXXXXXXXXSHRLIRKFVASSS 1285
            L FSV+L W +    H++ S  I C+L+K+GQR              +++LI KF+++S 
Sbjct: 9    LSFSVALPWRHDPPQHSKLSLQIQCALTKQGQRFLTKLAANAGNPSVANKLISKFLSTSP 68

Query: 1284 KSIALNTLSHLLSSDIRHH--SSLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFDV 1111
            KS AL TLS+LLS    H   SSLALPMY  ITEASWFEWNPKLV+ ++A L  QGQ   
Sbjct: 69   KSTALTTLSYLLSPHTAHPHLSSLALPMYSKITEASWFEWNPKLVAALVALLAKQGQQSQ 128

Query: 1110 SEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXL--N 937
            SE LIS+++ KL ++EREL  F+C L+ S+SK  SK G FD               +   
Sbjct: 129  SEALISETISKLGNKERELVQFHCQLVESHSKMSSKCG-FDRACTYLHQLLQNSSSVYVK 187

Query: 936  RQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLD 757
            R+A+ESM+ GLC +D P +A+E++E+MR++G K S FEFRSVV  YGRLG+  +M +++D
Sbjct: 188  RRAFESMVGGLCAMDRPGEADELIEEMRVKGLKASVFEFRSVVYGYGRLGMFEEMLKIVD 247

Query: 756  QMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIM 577
            QMEK G+  DTI  N+VLSSYG + EL+ M  W++KMK S+VPFSVRTYNSVLNSCPTIM
Sbjct: 248  QMEKQGFGDDTICCNMVLSSYGAHNELAAMANWLRKMKESSVPFSVRTYNSVLNSCPTIM 307

Query: 576  AMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGT 397
            AML+EP+ +P S+ +L   +   E ++V+EL+ GS+V+ E + WDS E KLDLHGMHLG+
Sbjct: 308  AMLQEPKAVPCSVGELSGVLDGDEALVVKELV-GSAVVDEAMVWDSAEAKLDLHGMHLGS 366

Query: 396  AYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKS 217
            AY+++LEW + + +RF       V+PAE+ +VCGLGKHSSV G+SP+K LV EMM +++S
Sbjct: 367  AYLVMLEWFEAMGNRFKSA--ECVVPAEVVIVCGLGKHSSVRGESPVKDLVKEMMHQMES 424

Query: 216  PMRIGRKNDVGSFVGKGKAVKDWLC 142
            PMRI RKN VG F+ KG+AVKDWLC
Sbjct: 425  PMRIDRKN-VGCFIAKGRAVKDWLC 448


>ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citrus clementina]
            gi|568866680|ref|XP_006486677.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g17033-like [Citrus sinensis]
            gi|557524456|gb|ESR35762.1| hypothetical protein
            CICLE_v10028424mg [Citrus clementina]
          Length = 451

 Score =  426 bits (1094), Expect = e-116
 Identities = 223/397 (56%), Positives = 291/397 (73%), Gaps = 5/397 (1%)
 Frame = -1

Query: 1317 RLIRKFVASSSKSIALNTLSHLLSSDIRHH--SSLALPMYEWITEASWFEWNPKLVSNVI 1144
            RLI KFVASS + IALN LSHLLS D  H   SSLA P+Y  ITE SWF+WNPKLV+ +I
Sbjct: 59   RLISKFVASSPQFIALNALSHLLSPDTTHPRLSSLAFPLYMRITEESWFQWNPKLVAEII 118

Query: 1143 ASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXXX 964
            A L+ QGQ + +E LI +++ KL S+EREL LFYCNLI+S+ K+ SK+G  D+Y      
Sbjct: 119  AFLDKQGQREEAETLILETLSKLGSRERELVLFYCNLIDSFCKHDSKRGFDDTYARLNQL 178

Query: 963  XXXXXXXL-NRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRLG 787
                      RQA +SMI+GLC +  PH+AE ++E+MR++G +PS FE++ ++  YGRLG
Sbjct: 179  VNSSSSVYVKRQALKSMISGLCEMGQPHEAENLIEEMRVKGLEPSGFEYKCIIYGYGRLG 238

Query: 786  LLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTYN 607
            LL DM R+++QME  G  +DT+ SN+VLSSYGD+ ELS MV+W++KMK S +PFSVRTYN
Sbjct: 239  LLEDMERIVNQMESDGTRVDTVCSNMVLSSYGDHNELSRMVLWLQKMKDSGIPFSVRTYN 298

Query: 606  SVLNSCPTIMAMLKE--PEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLE 433
            SVLNSC TIM+ML++      PLSI +L + + + E  +V+EL D SSVL E +KWDS E
Sbjct: 299  SVLNSCSTIMSMLQDLNSNDFPLSILELTEVLNEEEVSVVKELED-SSVLDEAMKWDSGE 357

Query: 432  GKLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIK 253
             KLDLHGMHLG+AY I+L+WMD +++RF    +  VIPAEI VVCG GKHS+V G+S +K
Sbjct: 358  TKLDLHGMHLGSAYFIILQWMDEMRNRF--NNEKHVIPAEITVVCGSGKHSTVRGESSVK 415

Query: 252  ALVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 142
            A+V +MMVR  SPMR+ R N++G F+ KG  VKDWLC
Sbjct: 416  AMVKKMMVRTSSPMRVHR-NNIGCFIAKGHVVKDWLC 451


>ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Vitis vinifera] gi|297744557|emb|CBI37819.3| unnamed
            protein product [Vitis vinifera]
          Length = 435

 Score =  425 bits (1093), Expect = e-116
 Identities = 234/443 (52%), Positives = 309/443 (69%), Gaps = 3/443 (0%)
 Frame = -1

Query: 1461 LQFSVSLKWGNHHHHNQSSAAIICSLSKKGQRXXXXXXXXXXXXXXSHRLIRKFVASSSK 1282
            LQ S    W NH    +S   I C+LSK+GQ               S+RLI KF+ASSSK
Sbjct: 5    LQVSRPQPW-NH----RSPLLIQCALSKQGQ---LFLSSVARDPSASNRLICKFIASSSK 56

Query: 1281 SIALNTLSHLLSSDIRHH--SSLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFDVS 1108
            SIALN LSHLLS    H   SSLALP+Y  I+EASWF WNPKL+++VIA L  QGQ   +
Sbjct: 57   SIALNALSHLLSPTTTHPYLSSLALPLYSRISEASWFSWNPKLIADVIALLYKQGQLKEA 116

Query: 1107 EKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXLNRQ- 931
            E L+S+++ KL S+ER+L  FYCNLI+S+SK+ S +GVFD                 ++ 
Sbjct: 117  ETLVSETLIKLGSRERDLVSFYCNLIDSHSKHSSNQGVFDVISRLSRIVSESSSVYVKER 176

Query: 930  AYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLDQM 751
            AY+SMI+ LC + LP +AE ++E+MR++G KPS FEFRSVV  YGR+GL  DM+R+L QM
Sbjct: 177  AYKSMISSLCAVGLPLEAENLIEEMRVKGLKPSVFEFRSVVYGYGRVGLSEDMQRILLQM 236

Query: 750  EKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIMAM 571
               G+ LDT+ SN+VLSSYG   + SEMV W+++MK+S++PFS+RTYNSVLNSCP IM++
Sbjct: 237  GNEGFELDTVVSNMVLSSYGAYNKQSEMVSWLQRMKNSSIPFSIRTYNSVLNSCPMIMSI 296

Query: 570  LKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGTAY 391
            L++ +  P +I++L++ +K  E +LV+ELI GS VL E ++WD  EGKLDLHGMHLG+AY
Sbjct: 297  LQDLKTFPPTIDELMETLKGDEALLVKELI-GSMVLAELMEWDCSEGKLDLHGMHLGSAY 355

Query: 390  VILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKSPM 211
            +I+L+W + L+ R        V+P EI VVCG GKHSSV G+SP+K +V EMM R +SPM
Sbjct: 356  LIMLQWREELRYRLNAA--EYVMPVEITVVCGSGKHSSVRGESPVKRMVREMMTRTRSPM 413

Query: 210  RIGRKNDVGSFVGKGKAVKDWLC 142
            +I RKN +G FV K K VK+WLC
Sbjct: 414  KIDRKN-IGCFVAKAKVVKNWLC 435


>ref|XP_010086846.1| hypothetical protein L484_006076 [Morus notabilis]
            gi|587833217|gb|EXB24044.1| hypothetical protein
            L484_006076 [Morus notabilis]
          Length = 517

 Score =  419 bits (1078), Expect = e-114
 Identities = 224/426 (52%), Positives = 302/426 (70%), Gaps = 3/426 (0%)
 Frame = -1

Query: 1410 SSAAIICSLSKKGQRXXXXXXXXXXXXXXSHRLIRKFVASSSKSIALNTLSHLLSSDIRH 1231
            +S++I C+L+K+G R              +++LI KFVASS KSI+LN LSHLLS D  H
Sbjct: 96   ASSSIQCALTKQGHRFLSTLSINAGNASAANKLIGKFVASSPKSISLNALSHLLSPDTTH 155

Query: 1230 H--SSLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFDVSEKLISDSVQKLESQERE 1057
               +S +L +Y  I EASWF ++PKLV+ + A L+ QG++  +E LI+++V KL  ++RE
Sbjct: 156  THLTSHSLHLYSKIREASWFVYSPKLVAALAALLDKQGRYSEAEALIAEAVSKLGHRQRE 215

Query: 1056 LALFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXLNR-QAYESMINGLCTLDLPHD 880
            LA+FYC+L+ S+SK  SK G   SY               + +A+E+M+  LCT+D P +
Sbjct: 216  LAVFYCSLVESHSKQSSKHGFDSSYAYLYQLLRDSSSAYVKCRAFETMVGALCTMDRPCE 275

Query: 879  AEEIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLDQMEKSGYTLDTISSNIVLS 700
            AE +ME+MR +G KPS FEFRS+V  YGRLGL  DM R ++QME  G  +DTI SN+VLS
Sbjct: 276  AESLMEEMRHKGLKPSVFEFRSLVYGYGRLGLWEDMLRTVNQMEIEGLVIDTICSNMVLS 335

Query: 699  SYGDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIMAMLKEPEFLPLSIEDLIDN 520
            SYG + EL +MV+W++KM++S++PFS+RTYNSVLN CPTI AML++ + +PLS+ +L   
Sbjct: 336  SYGAHNELQQMVLWLQKMRTSSIPFSIRTYNSVLNWCPTITAMLQDLKDIPLSMYELNAT 395

Query: 519  VKKVEGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGTAYVILLEWMDVLKSRFGGG 340
            ++  EG+LV EL+ GSSVL E L WDSLE KLDLHGMHLG+AY+I+LEWM+ +  RF  G
Sbjct: 396  LRGDEGLLVMELV-GSSVLEEVLVWDSLEVKLDLHGMHLGSAYLIMLEWMEEMTRRFNDG 454

Query: 339  GDSVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKSPMRIGRKNDVGSFVGKGKA 160
                 IPAE+ VVCG GKHS+V G SP+K LV EMMV++KSPM+I RKN  G F+ KGK 
Sbjct: 455  NHG--IPAEVVVVCGSGKHSNVRGVSPVKILVKEMMVQMKSPMKIDRKN-AGCFLAKGKT 511

Query: 159  VKDWLC 142
            V+DWLC
Sbjct: 512  VRDWLC 517


>ref|XP_002884032.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297329872|gb|EFH60291.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 504

 Score =  408 bits (1049), Expect = e-111
 Identities = 212/396 (53%), Positives = 284/396 (71%), Gaps = 3/396 (0%)
 Frame = -1

Query: 1320 HRLIRKFVASSSKSIALNTLSHLLSSDIR--HHSSLALPMYEWITEASWFEWNPKLVSNV 1147
            HR I+KFVA+S KS+ LN LSHLLS      H S  AL +Y  ITEASWF+WNPKL++ +
Sbjct: 112  HRHIKKFVAASPKSVTLNVLSHLLSDQTSYPHLSFFALSLYSEITEASWFDWNPKLIAEL 171

Query: 1146 IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDS-YXXXX 970
            +A L NQ +FD SE L+S +V +L+S ER+ ALF CNL+ S SK GS +G  ++ +    
Sbjct: 172  VAVLNNQERFDESETLLSTAVSRLKSNERDFALFLCNLVESNSKQGSIQGFNEACFRLRE 231

Query: 969  XXXXXXXXXLNRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 790
                     +  QAY+SM+ GLC +D PHDAE ++E+MR+E  KP  FE +SV+  YGRL
Sbjct: 232  RIQRSSSVYVKTQAYKSMVAGLCNMDQPHDAERVIEEMRVEKIKPGSFEHKSVLYGYGRL 291

Query: 789  GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTY 610
            GL +DM RV+ +ME  G+ +DT+ SN+VLSSYG +  L +M  W++K+K  NVPFS+RTY
Sbjct: 292  GLFDDMNRVVHRMETEGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGFNVPFSIRTY 351

Query: 609  NSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEG 430
            NSVLNSCPTIM++LK+    P+S+ +L   + + E +LV EL   S+VL E ++W+++EG
Sbjct: 352  NSVLNSCPTIMSLLKDLNSCPVSLSELRTFLNEDEALLVLELTQ-STVLDEAIEWNAVEG 410

Query: 429  KLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKA 250
            KLDLHGMHL ++Y+ILL+WMD ++ RF       VIPAEI VV G GKHS+V G+SP+KA
Sbjct: 411  KLDLHGMHLSSSYLILLQWMDEIRLRF--RDQKCVIPAEIVVVSGSGKHSNVRGESPVKA 468

Query: 249  LVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 142
            LV ++MVR +SPMRI RKN VGSF+ KGK VK+WLC
Sbjct: 469  LVKKIMVRTESPMRIDRKN-VGSFIAKGKNVKEWLC 503


>ref|XP_007041729.1| Pentatricopeptide (PPR) repeat-containing protein, putative
            [Theobroma cacao] gi|508705664|gb|EOX97560.1|
            Pentatricopeptide (PPR) repeat-containing protein,
            putative [Theobroma cacao]
          Length = 456

 Score =  407 bits (1047), Expect = e-110
 Identities = 216/441 (48%), Positives = 299/441 (67%), Gaps = 10/441 (2%)
 Frame = -1

Query: 1431 NHHHHNQSSAAIICS-----LSKKGQRXXXXXXXXXXXXXXS--HRLIRKFVASSSKSIA 1273
            NH H   +  +I C      L+K+G R              +  +RLI+KFVASS KSIA
Sbjct: 14   NHRHLRPTRPSIKCESGGVPLTKQGHRFFSSLAATADVNDPATANRLIKKFVASSPKSIA 73

Query: 1272 LNTLSHLLS--SDIRHHSSLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFDVSEKL 1099
            LN LSHLLS  +   H S+LA P+Y  I+E SW+ WNPKLV+ +IA L  QG++D SE L
Sbjct: 74   LNALSHLLSPRNSHPHLSALAFPLYTKISETSWYNWNPKLVAELIALLVKQGRYDESEAL 133

Query: 1098 ISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXL-NRQAYE 922
            IS +V KL+ +ER+L  FYCN I S SK+ SK+G  D+Y                RQ Y+
Sbjct: 134  ISQAVSKLKFRERDLVQFYCNWIESCSKHNSKEGFNDAYCYLSELICNSSSVYVKRQGYK 193

Query: 921  SMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLDQMEKS 742
            SM++ LC +D P++AE ++E+MR  G  P+ FEFR +   YG+LGL  DM R++ +ME  
Sbjct: 194  SMVSSLCEMDRPNEAENLVEEMRKNGLTPTLFEFRFISYGYGQLGLFEDMERMVCEMEIE 253

Query: 741  GYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIMAMLKE 562
            G+ +DTI SN+VLSSYG     S+MV W++KMK+  +PFS+RTYNSVLNSCP IM++++ 
Sbjct: 254  GFEVDTICSNMVLSSYGAYNAFSKMVPWLQKMKTLQIPFSIRTYNSVLNSCPEIMSLVQG 313

Query: 561  PEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGTAYVIL 382
             + +PLS+ +L   + + E +LVQEL+  SSVL E ++W+  EGKLDLHGMHLG+AY+I+
Sbjct: 314  LDSVPLSLGELAKILNEDEALLVQELVKSSSVLDEAMEWNGSEGKLDLHGMHLGSAYLIM 373

Query: 381  LEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKSPMRIG 202
            L+W++ +K RF    +  VIPA+I +VCG GKHSSV G+SP+K L+ +MMV++KSPM+I 
Sbjct: 374  LQWIEEMKCRF--KVEECVIPAQITIVCGSGKHSSVRGESPVKTLMRKMMVKMKSPMKID 431

Query: 201  RKNDVGSFVGKGKAVKDWLCP 139
            RKN +G F+ KG+ VK+WL P
Sbjct: 432  RKN-IGCFIAKGQVVKNWLIP 451


>ref|NP_849962.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75244359|sp|Q8GWA9.1|PP157_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g17033 gi|26452937|dbj|BAC43545.1| unknown protein
            [Arabidopsis thaliana] gi|330251482|gb|AEC06576.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 505

 Score =  406 bits (1044), Expect = e-110
 Identities = 212/396 (53%), Positives = 284/396 (71%), Gaps = 3/396 (0%)
 Frame = -1

Query: 1320 HRLIRKFVASSSKSIALNTLSHLLSSDIRHH--SSLALPMYEWITEASWFEWNPKLVSNV 1147
            +R I+KFVA+S KS+ALN LSHLLS    H   S  AL +Y  ITEASWF+WNPKL++ +
Sbjct: 113  NRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFALSLYSEITEASWFDWNPKLIAEL 172

Query: 1146 IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFD-SYXXXX 970
            IA L  Q +FD SE L+S +V +L+S ER+  LF CNL+ S SK GS +G  + S+    
Sbjct: 173  IALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNLVESNSKQGSIQGFSEASFRLRE 232

Query: 969  XXXXXXXXXLNRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 790
                     +  QAY+SM++GLC +D PHDAE ++E+MR+E  KP  FE++SV+  YGRL
Sbjct: 233  IIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEMRMEKIKPGLFEYKSVLYGYGRL 292

Query: 789  GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTY 610
            GL +DM RV+ +M   G+ +DT+ SN+VLSSYG +  L +M  W++K+K  NVPFS+RTY
Sbjct: 293  GLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGFNVPFSIRTY 352

Query: 609  NSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEG 430
            NSVLNSCPTI++MLK+ +  P+S+ +L   + + E +LV EL   SSVL E ++W+++EG
Sbjct: 353  NSVLNSCPTIISMLKDLDSCPVSLSELRTFLNEDEALLVHELTQ-SSVLDEAIEWNAVEG 411

Query: 429  KLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKA 250
            KLDLHGMHL ++Y+ILL+WMD  + RF    +  VIPAEI VV G GKHS+V G+SP+KA
Sbjct: 412  KLDLHGMHLSSSYLILLQWMDETRLRF--SEEKCVIPAEIVVVSGSGKHSNVRGESPVKA 469

Query: 249  LVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 142
            LV ++MVR  SPMRI RKN VGSF+ KGK VK+WLC
Sbjct: 470  LVKKIMVRTGSPMRIDRKN-VGSFIAKGKTVKEWLC 504


>dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana]
          Length = 501

 Score =  406 bits (1044), Expect = e-110
 Identities = 212/396 (53%), Positives = 284/396 (71%), Gaps = 3/396 (0%)
 Frame = -1

Query: 1320 HRLIRKFVASSSKSIALNTLSHLLSSDIRHH--SSLALPMYEWITEASWFEWNPKLVSNV 1147
            +R I+KFVA+S KS+ALN LSHLLS    H   S  AL +Y  ITEASWF+WNPKL++ +
Sbjct: 109  NRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFALSLYSEITEASWFDWNPKLIAEL 168

Query: 1146 IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFD-SYXXXX 970
            IA L  Q +FD SE L+S +V +L+S ER+  LF CNL+ S SK GS +G  + S+    
Sbjct: 169  IALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNLVESNSKQGSIQGFSEASFRLRE 228

Query: 969  XXXXXXXXXLNRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 790
                     +  QAY+SM++GLC +D PHDAE ++E+MR+E  KP  FE++SV+  YGRL
Sbjct: 229  IIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEMRMEKIKPGLFEYKSVLYGYGRL 288

Query: 789  GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTY 610
            GL +DM RV+ +M   G+ +DT+ SN+VLSSYG +  L +M  W++K+K  NVPFS+RTY
Sbjct: 289  GLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGFNVPFSIRTY 348

Query: 609  NSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEG 430
            NSVLNSCPTI++MLK+ +  P+S+ +L   + + E +LV EL   SSVL E ++W+++EG
Sbjct: 349  NSVLNSCPTIISMLKDLDSCPVSLSELRTFLNEDEALLVHELTQ-SSVLDEAIEWNAVEG 407

Query: 429  KLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKA 250
            KLDLHGMHL ++Y+ILL+WMD  + RF    +  VIPAEI VV G GKHS+V G+SP+KA
Sbjct: 408  KLDLHGMHLSSSYLILLQWMDETRLRF--SEEKCVIPAEIVVVSGSGKHSNVRGESPVKA 465

Query: 249  LVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 142
            LV ++MVR  SPMRI RKN VGSF+ KGK VK+WLC
Sbjct: 466  LVKKIMVRTGSPMRIDRKN-VGSFIAKGKTVKEWLC 500


>ref|NP_565402.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|13877877|gb|AAK44016.1|AF370201_1 unknown protein
            [Arabidopsis thaliana] gi|21280879|gb|AAM44931.1| unknown
            protein [Arabidopsis thaliana]
            gi|330251481|gb|AEC06575.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 504

 Score =  406 bits (1044), Expect = e-110
 Identities = 212/396 (53%), Positives = 284/396 (71%), Gaps = 3/396 (0%)
 Frame = -1

Query: 1320 HRLIRKFVASSSKSIALNTLSHLLSSDIRHH--SSLALPMYEWITEASWFEWNPKLVSNV 1147
            +R I+KFVA+S KS+ALN LSHLLS    H   S  AL +Y  ITEASWF+WNPKL++ +
Sbjct: 112  NRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFALSLYSEITEASWFDWNPKLIAEL 171

Query: 1146 IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFD-SYXXXX 970
            IA L  Q +FD SE L+S +V +L+S ER+  LF CNL+ S SK GS +G  + S+    
Sbjct: 172  IALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNLVESNSKQGSIQGFSEASFRLRE 231

Query: 969  XXXXXXXXXLNRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 790
                     +  QAY+SM++GLC +D PHDAE ++E+MR+E  KP  FE++SV+  YGRL
Sbjct: 232  IIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEMRMEKIKPGLFEYKSVLYGYGRL 291

Query: 789  GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTY 610
            GL +DM RV+ +M   G+ +DT+ SN+VLSSYG +  L +M  W++K+K  NVPFS+RTY
Sbjct: 292  GLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGFNVPFSIRTY 351

Query: 609  NSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEG 430
            NSVLNSCPTI++MLK+ +  P+S+ +L   + + E +LV EL   SSVL E ++W+++EG
Sbjct: 352  NSVLNSCPTIISMLKDLDSCPVSLSELRTFLNEDEALLVHELTQ-SSVLDEAIEWNAVEG 410

Query: 429  KLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKA 250
            KLDLHGMHL ++Y+ILL+WMD  + RF    +  VIPAEI VV G GKHS+V G+SP+KA
Sbjct: 411  KLDLHGMHLSSSYLILLQWMDETRLRF--SEEKCVIPAEIVVVSGSGKHSNVRGESPVKA 468

Query: 249  LVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 142
            LV ++MVR  SPMRI RKN VGSF+ KGK VK+WLC
Sbjct: 469  LVKKIMVRTGSPMRIDRKN-VGSFIAKGKTVKEWLC 503


>ref|XP_012837220.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Erythranthe guttatus] gi|604333640|gb|EYU37991.1|
            hypothetical protein MIMGU_mgv1a006093mg [Erythranthe
            guttata]
          Length = 458

 Score =  402 bits (1033), Expect = e-109
 Identities = 213/433 (49%), Positives = 288/433 (66%), Gaps = 3/433 (0%)
 Frame = -1

Query: 1398 IICSLSKKGQRXXXXXXXXXXXXXXSHRLIRKFVASSSKSIALNTLSHLLSSDIRHH--S 1225
            ++C L+K+GQR                 L+RKFVASSSK +AL+TLSHLLS    H   S
Sbjct: 26   LVCVLTKQGQRLLSSIATSEQPSAAIS-LLRKFVASSSKHVALSTLSHLLSPSTSHPRLS 84

Query: 1224 SLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFDVSEKLISDSVQKLESQERELALF 1045
            SLA P+Y  I + SWF WN KLV+++I+ L    +FD ++ L  ++V KL  +ER+L  F
Sbjct: 85   SLAFPLYGIIEQESWFTWNSKLVADLISLLYKAERFDEADNLFGETVSKLGFKERDLCTF 144

Query: 1044 YCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXLNRQ-AYESMINGLCTLDLPHDAEEI 868
            YCNL++S++K+ S++GV DS                +Q  YESMI G C +  P  AE +
Sbjct: 145  YCNLVDSHAKHMSERGVSDSCTRLKQLILASSSVYVKQKGYESMIAGFCEIGSPDKAENL 204

Query: 867  MEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGD 688
            ME+MR  G KPS FE R++V  YG++GLL DM+R + QMEK G+ LDT+  N+VLSS+G 
Sbjct: 205  MEEMRQNGLKPSAFELRTLVYGYGQMGLLEDMKRSVGQMEKEGFELDTVCYNMVLSSFGA 264

Query: 687  NKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKV 508
              E  +M++W+KKM++S +PFS+RTYNSVLNSCPT++ +L++ + LPLS+ +L+DN+K  
Sbjct: 265  RNEFLDMLLWLKKMRNSGIPFSIRTYNSVLNSCPTVILLLEDMKSLPLSVNELVDNLKTG 324

Query: 507  EGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSV 328
            E  LV EL+  S VL + ++W S E KLD+HGMHL TAY+ILL+W   LK RFG G    
Sbjct: 325  EADLVLELMK-SDVLDQVMEWKSTELKLDMHGMHLSTAYLILLQWFKELKVRFGDGNHET 383

Query: 327  VIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDW 148
              P EI VVCG GKHSS  G+SP+K L  EM+ R+K P+RI RKN +G F+GKGK  KDW
Sbjct: 384  --PTEILVVCGSGKHSSKRGESPVKVLAKEMVTRMKCPLRIDRKN-IGCFIGKGKTFKDW 440

Query: 147  LCPVVSNETLARL 109
            LC   SN+  A +
Sbjct: 441  LCNEDSNKNPAEI 453


>ref|XP_012077696.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Jatropha curcas]
          Length = 473

 Score =  401 bits (1031), Expect = e-109
 Identities = 224/447 (50%), Positives = 302/447 (67%), Gaps = 19/447 (4%)
 Frame = -1

Query: 1428 HHHHNQ--------SSAAII---CSLSKKGQRXXXXXXXXXXXXXXS--HRLIRKFVASS 1288
            HHHH Q        SS   +    +LSK+GQR              S  + LI+KFVA+S
Sbjct: 29   HHHHIQVGPLETKLSSKWRVFECAALSKQGQRFLSSLATATAARDNSATNSLIKKFVAAS 88

Query: 1287 SKSIALNTLSHLLS--SDIRHHSSLALPMYEWITEASWFEWNPKLVSNVIASLENQGQFD 1114
             KSIAL+ LSHLLS  S   H SSLA P+Y  I EA WF+WNPKLV+ V+A L+ QGQ++
Sbjct: 89   PKSIALDALSHLLSPNSSYSHLSSLAFPLYLKIQEAHWFDWNPKLVAEVVALLDKQGQYN 148

Query: 1113 VSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXL-N 937
             S  LISDS+ KL+ +ER+LALFYCNL+ S+SK    +G  DS+                
Sbjct: 149  ESGTLISDSISKLKLRERDLALFYCNLVESHSKQNCVQGFEDSFARLNQLVFSSNSVYIK 208

Query: 936  RQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLD 757
            +QAY+SMI+GLC +  P +A++++E+MR +G KPS +EFR V+ AYG+LGL  +M+ +LD
Sbjct: 209  KQAYKSMISGLCEMGRPKEAQDLIEEMRGKGVKPSVYEFRCVLHAYGKLGLFQEMQMILD 268

Query: 756  QMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIM 577
            QME  G+ +DT+ SN+VLSSYG    L E+V W+KKMK   +PFS RT NSVLNSCPT+M
Sbjct: 269  QMESGGFKVDTVCSNMVLSSYGVYNALPEIVSWLKKMKDLGIPFSSRTCNSVLNSCPTMM 328

Query: 576  AMLK--EPEFLPLSIEDLIDNVKKVEGMLVQELIDG-SSVLLENLKWDSLEGKLDLHGMH 406
            + ++       P+SI++L+  ++  E M+V ELI G SSVL E ++WD+LE KLDLHGMH
Sbjct: 329  STVQNSNANTYPISIQELMKILRGDEAMVVNELIIGSSSVLEEAMQWDALESKLDLHGMH 388

Query: 405  LGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVR 226
            L +AY+I+L W + +K+RF GG  + VIPAEI VVCG G HS V G+SP+K ++  +MV+
Sbjct: 389  LCSAYLIMLLWFEEMKNRFNGG--NYVIPAEITVVCGSGNHSIVRGESPVKRMIKSIMVQ 446

Query: 225  LKSPMRIGRKNDVGSFVGKGKAVKDWL 145
             +SPMR+ RKN +G F+ KGK VK+WL
Sbjct: 447  TRSPMRVDRKN-LGCFIAKGKVVKEWL 472


>ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutrema salsugineum]
            gi|557110519|gb|ESQ50810.1| hypothetical protein
            EUTSA_v10022675mg [Eutrema salsugineum]
          Length = 469

 Score =  401 bits (1031), Expect = e-109
 Identities = 206/396 (52%), Positives = 285/396 (71%), Gaps = 3/396 (0%)
 Frame = -1

Query: 1320 HRLIRKFVASSSKSIALNTLSHLLSSDIRHH--SSLALPMYEWITEASWFEWNPKLVSNV 1147
            +R I+KFVA+S KS++LN LSHLLS+   H   S  AL +Y  ITEASWF+WNPKL++ +
Sbjct: 77   NRHIKKFVAASPKSVSLNVLSHLLSAQTSHPHLSFFALSLYSEITEASWFDWNPKLIAEL 136

Query: 1146 IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDS-YXXXX 970
            +A L  Q +   SE L+S++V +L+S ER++ALFYCNL+ S SK GS +G  ++      
Sbjct: 137  VALLNKQERSHESETLLSNAVSRLKSNERDIALFYCNLVESNSKQGSIQGFNEACVRLRE 196

Query: 969  XXXXXXXXXLNRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 790
                     +  QAY+SM++GLC +D PHDAE ++E+MR+   KP  FE++SV+  YGRL
Sbjct: 197  ITRRSTSVYVKTQAYKSMVSGLCNMDQPHDAESVIEEMRIAKIKPGLFEYKSVLYGYGRL 256

Query: 789  GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTY 610
            GL  DM RV+ +ME  G+ +DT+ SN+VLSSYG +  L +M  W++K+K SNVP S RTY
Sbjct: 257  GLFEDMNRVVHRMETEGHKIDTVCSNMVLSSYGAHNALPQMGSWLQKLKDSNVPLSERTY 316

Query: 609  NSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEG 430
            NSVLNSCPTI+++LK+ +  P+S+ +L+  + K E +LV+ L   SSVL E ++W SLEG
Sbjct: 317  NSVLNSCPTILSLLKDLDSCPVSLSELLTFLNKDEEVLVRGLTQ-SSVLDEAIEWSSLEG 375

Query: 429  KLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKA 250
            KLDLHGMHL ++Y+I+++WMD ++ RF  G    V+PAEI +V G GKHS+V G+SP+KA
Sbjct: 376  KLDLHGMHLSSSYLIMMQWMDEMRIRFSEG--KCVVPAEIVLVSGSGKHSNVRGESPVKA 433

Query: 249  LVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 142
            LV ++MVR  SPMRI RKN +GSF+ KGK VK+WLC
Sbjct: 434  LVKKIMVRTGSPMRIDRKN-IGSFIAKGKTVKEWLC 468


>ref|XP_010548124.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Tarenaya hassleriana] gi|729371006|ref|XP_010548125.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At2g17033 [Tarenaya hassleriana]
          Length = 462

 Score =  398 bits (1022), Expect = e-108
 Identities = 206/396 (52%), Positives = 281/396 (70%), Gaps = 3/396 (0%)
 Frame = -1

Query: 1320 HRLIRKFVASSSKSIALNTLSHLLS--SDIRHHSSLALPMYEWITEASWFEWNPKLVSNV 1147
            +R IRKFVA+S KS+ALN LSHLLS  +   H SS+AL +Y  I EA WF+WNPKLV+++
Sbjct: 70   NRQIRKFVAASPKSVALNVLSHLLSPLNSHPHLSSIALNLYSEIAEAPWFDWNPKLVADL 129

Query: 1146 IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXX 967
            +A L  Q QF  SE L+S +V +L+  ER LALF+CNL+ S SK GS +G  DSY     
Sbjct: 130  VALLNKQEQFPESESLLSAAVSRLKPNERGLALFHCNLVESNSKQGSTRGFNDSYSCLRE 189

Query: 966  XXXXXXXXLNR-QAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 790
                      + Q Y+S+++GLC +D P+DAE ++ +M+ EG KP  FE+RSV+  YGRL
Sbjct: 190  IIQRSSSVYVKSQGYKSIVSGLCNMDRPYDAERVLAEMKTEGIKPELFEYRSVLYGYGRL 249

Query: 789  GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTY 610
            GL  DM R + +ME  G+ +DT+ SN+VLSSYG    L EM  W++K+K   +P S+RTY
Sbjct: 250  GLFFDMNRTVHEMESDGHKIDTVCSNMVLSSYGARDALPEMGSWLQKLKGFGIPLSIRTY 309

Query: 609  NSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEG 430
            NSVLNSCPTI ++LK+ +  P+S+ +L   + + E +L +EL+  SSVL E ++W++LEG
Sbjct: 310  NSVLNSCPTITSLLKDLDSCPVSLSELTGLLNEDEMLLTRELVQ-SSVLDEAMEWNALEG 368

Query: 429  KLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKA 250
            KLDLHGMHL ++Y+I+++WMD ++ RF  G    VIP EI +V G GKHS+V G+SP+KA
Sbjct: 369  KLDLHGMHLSSSYLIMMQWMDKVRIRFEEG--KHVIPVEIVIVSGSGKHSNVRGESPVKA 426

Query: 249  LVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 142
            LV ++MVR  SPMRI RKN +GSF+ KGKAVK+WLC
Sbjct: 427  LVKKIMVRTGSPMRIDRKN-IGSFIAKGKAVKEWLC 461


>ref|XP_006297442.1| hypothetical protein CARUB_v10013465mg [Capsella rubella]
            gi|482566151|gb|EOA30340.1| hypothetical protein
            CARUB_v10013465mg [Capsella rubella]
          Length = 516

 Score =  397 bits (1019), Expect = e-107
 Identities = 207/396 (52%), Positives = 283/396 (71%), Gaps = 3/396 (0%)
 Frame = -1

Query: 1320 HRLIRKFVASSSKSIALNTLSHLLSSDIRH-HSSLALP-MYEWITEASWFEWNPKLVSNV 1147
            +RLI+KFVA+S KS+ALN LSHLLS +  H H S   P +Y  ITEASWF+WNPKL+  +
Sbjct: 123  NRLIKKFVAASPKSVALNVLSHLLSDNTSHPHLSYFAPQLYLEITEASWFDWNPKLIGEL 182

Query: 1146 IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDSYXXXXX 967
            ++ L  Q +F  SE L+S +V +LES ER+ ALF CNL+ S SK GS +G  D+      
Sbjct: 183  VSLLNKQERFVESETLLSTAVSRLESNERDFALFLCNLVESNSKQGSIQGFSDACSRLRE 242

Query: 966  XXXXXXXXLNR-QAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 790
                      + QAY+SM++GLC +D P DAE ++E+MR+E  KP  FE++SV+  YGRL
Sbjct: 243  IIQRSSSVYVKTQAYKSMVSGLCNMDQPLDAERVIEEMRMETIKPGLFEYKSVLYGYGRL 302

Query: 789  GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTY 610
            GL +DM R++ +ME  G+ +DT+ SN+VLSSYG +  L +M  W++K+K  NVP S+RTY
Sbjct: 303  GLFDDMNRIVHRMETQGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGYNVPLSIRTY 362

Query: 609  NSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEG 430
            NSVLNSCPTI+++LK+ +  PLS+ +L+  + + E +LV+EL   S VL E ++W+++EG
Sbjct: 363  NSVLNSCPTIISLLKDLDSCPLSLSELLPILNEDEALLVRELTQ-SLVLDEAIEWNAVEG 421

Query: 429  KLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKA 250
            KLDLHGMHL  +Y+I+L+WMD  + RF       V+PAEI VV G GKHS+V G+SP+KA
Sbjct: 422  KLDLHGMHLSASYLIMLQWMDETRLRF-SEDKKCVVPAEIVVVSGSGKHSNVRGESPVKA 480

Query: 249  LVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 142
            +V ++MVR KSPMRI RKN VGSF+ KGK VK+WLC
Sbjct: 481  MVKKIMVRTKSPMRIDRKN-VGSFIAKGKNVKEWLC 515


>ref|XP_013622437.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Brassica oleracea var. oleracea]
            gi|922432660|ref|XP_013622438.1| PREDICTED:
            pentatricopeptide repeat-containing protein At2g17033
            [Brassica oleracea var. oleracea]
            gi|922432662|ref|XP_013622439.1| PREDICTED:
            pentatricopeptide repeat-containing protein At2g17033
            [Brassica oleracea var. oleracea]
          Length = 461

 Score =  396 bits (1018), Expect = e-107
 Identities = 205/396 (51%), Positives = 282/396 (71%), Gaps = 3/396 (0%)
 Frame = -1

Query: 1320 HRLIRKFVASSSKSIALNTLSHLLS--SDIRHHSSLALPMYEWITEASWFEWNPKLVSNV 1147
            +R I+KFVA+S KS++LN LSHLLS  + + H S  AL +Y  IT+ASWF+WNPKL++++
Sbjct: 69   NRHIKKFVAASPKSVSLNVLSHLLSPHTSLPHLSFFALSLYSEITDASWFDWNPKLIADL 128

Query: 1146 IASLENQGQFDVSEKLISDSVQKLESQERELALFYCNLINSYSKYGSKKGVFDS-YXXXX 970
            +A L  Q +F  SE L+S +V  L+S ER+ ALF CNL  S SK GS +G  ++      
Sbjct: 129  VALLNKQERFHESETLLSTAVTNLKSNERDFALFLCNLAESNSKQGSAQGFKEACLRLRE 188

Query: 969  XXXXXXXXXLNRQAYESMINGLCTLDLPHDAEEIMEKMRLEGFKPSDFEFRSVVLAYGRL 790
                     +  QAY+SM++GLC +D P+DAE ++E+MRLE  KP  FE++SV+  YGRL
Sbjct: 189  VLQTSSSVYVKTQAYKSMVSGLCNMDQPNDAETVIEEMRLEKLKPGVFEYKSVLYGYGRL 248

Query: 789  GLLNDMRRVLDQMEKSGYTLDTISSNIVLSSYGDNKELSEMVVWIKKMKSSNVPFSVRTY 610
            GL +DM R++ +ME  G+ +DT+ SN+VLSSYG +  L +M  W++++K  NVP S+RTY
Sbjct: 249  GLFDDMNRIVHRMETEGHRVDTVCSNMVLSSYGAHDALPQMGSWLQRLKDFNVPLSLRTY 308

Query: 609  NSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVKKVEGMLVQELIDGSSVLLENLKWDSLEG 430
            N+VLNSCPT+ +MLK+    PLS+ +++  + + E +LV+ L   SSVL E ++W SLEG
Sbjct: 309  NTVLNSCPTVTSMLKDLNSCPLSVSEVLTFLNEDEVVLVRALTQ-SSVLHEAMEWSSLEG 367

Query: 429  KLDLHGMHLGTAYVILLEWMDVLKSRFGGGGDSVVIPAEIRVVCGLGKHSSVIGKSPIKA 250
            KLDLHGMHL +AY+I+++WMD +K RF   G+  V+PAEI VV G GKHSSV G+SP+KA
Sbjct: 368  KLDLHGMHLSSAYLIMMQWMDEIKVRF--SGEKCVVPAEIVVVSGSGKHSSVRGESPVKA 425

Query: 249  LVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVKDWLC 142
            LV +MMVR  SPMRI RKN VG F+ KGK VK+W C
Sbjct: 426  LVKKMMVRTGSPMRIDRKN-VGCFIAKGKTVKEWFC 460


>emb|CDP14534.1| unnamed protein product [Coffea canephora]
          Length = 449

 Score =  396 bits (1017), Expect = e-107
 Identities = 213/424 (50%), Positives = 295/424 (69%), Gaps = 5/424 (1%)
 Frame = -1

Query: 1398 IICSLSKKGQRXXXXXXXXXXXXXXSH-RLIRKFVASSSKSIALNTLSHLLSSDIRH-HS 1225
            + CSL K+GQR              +H R +RKFV +SSK +AL+TLSHLLS    H H 
Sbjct: 30   VCCSLCKQGQRFLSSLATTDESSSAAHHRSLRKFVKTSSKHVALDTLSHLLSPTTAHPHL 89

Query: 1224 S--LALPMYEWITEASWFEWNPKLVSNVIASLENQGQFDVSEKLISDSVQKLESQERELA 1051
            S  LALP+Y  I++ASWF WN KL+++V A +  Q +F  +E LI  +++KL + +R+L 
Sbjct: 90   SYHLALPLYLIISQASWFSWNAKLLADVTALMYKQERFIEAEALILQALKKLPAHDRDLC 149

Query: 1050 LFYCNLINSYSKYGSKKGVFDSYXXXXXXXXXXXXXL-NRQAYESMINGLCTLDLPHDAE 874
             FYC+L++S +K+ S+KGVFDS                 ++AYESMI+GLC + LP +AE
Sbjct: 150  NFYCHLLHSNAKHRSRKGVFDSLTSLKQLLARSSSVYVQKRAYESMISGLCEIGLPGEAE 209

Query: 873  EIMEKMRLEGFKPSDFEFRSVVLAYGRLGLLNDMRRVLDQMEKSGYTLDTISSNIVLSSY 694
             +ME+MR  G KPS FEF+S+V AYGRLGL  DM+R + QME +G  LDT+ SN+VLSS 
Sbjct: 210  NLMEEMRGVGLKPSGFEFKSLVHAYGRLGLFEDMKRSVTQMEDAGVELDTVCSNMVLSSL 269

Query: 693  GDNKELSEMVVWIKKMKSSNVPFSVRTYNSVLNSCPTIMAMLKEPEFLPLSIEDLIDNVK 514
            G +K  SEMV W+++MK S V FS+RTYNSVLNSCPT++ +L++P+ +PLS+EDL+ N+ 
Sbjct: 270  GSHKVFSEMVSWLRRMKDSEVSFSIRTYNSVLNSCPTLILLLQDPKTIPLSMEDLMGNLS 329

Query: 513  KVEGMLVQELIDGSSVLLENLKWDSLEGKLDLHGMHLGTAYVILLEWMDVLKSRFGGGGD 334
            + E  LV+EL+  SSVL E ++ +S E KLDLHGMHL T+ +I L+W+D L+ RF  G +
Sbjct: 330  QEEADLVRELV-ASSVLDEAMECNSAELKLDLHGMHLSTSCLIFLQWIDRLRLRFSAGDN 388

Query: 333  SVVIPAEIRVVCGLGKHSSVIGKSPIKALVSEMMVRLKSPMRIGRKNDVGSFVGKGKAVK 154
              ++P +I VVCG GKHS+  G+SP+K L+ EM++R+K P+RI R+N +G FV KGK   
Sbjct: 389  --MVPTQITVVCGSGKHSASRGESPVKGLLREMILRIKCPLRIDRRN-LGCFVAKGKVFS 445

Query: 153  DWLC 142
            DWLC
Sbjct: 446  DWLC 449


Top