BLASTX nr result

ID: Rauwolfia21_contig00001061 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rauwolfia21_contig00001061
         (1645 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006360892.1| PREDICTED: pentatricopeptide repeat-containi...   468   e-129
ref|XP_004248641.1| PREDICTED: pentatricopeptide repeat-containi...   462   e-127
ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containi...   447   e-123
ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citr...   445   e-122
gb|EOX97560.1| Pentatricopeptide (PPR) repeat-containing protein...   444   e-122
gb|EMJ01929.1| hypothetical protein PRUPE_ppa021547mg [Prunus pe...   426   e-116
ref|XP_002521239.1| pentatricopeptide repeat-containing protein,...   417   e-114
ref|XP_002312938.2| hypothetical protein POPTR_0009s14120g [Popu...   414   e-113
ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutr...   410   e-112
gb|EXB24044.1| hypothetical protein L484_006076 [Morus notabilis]     409   e-111
ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containi...   407   e-111
ref|XP_006297442.1| hypothetical protein CARUB_v10013465mg [Caps...   405   e-110
ref|NP_849962.1| pentatricopeptide repeat-containing protein [Ar...   405   e-110
dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana]           405   e-110
ref|NP_565402.1| pentatricopeptide repeat-containing protein [Ar...   405   e-110
ref|XP_002884032.1| pentatricopeptide repeat-containing protein ...   402   e-109
gb|AAU04769.1| pentatricopeptide (PPR) repeat protein-like [Cucu...   392   e-106
gb|AGH33847.1| PPR [Cucumis melo]                                     390   e-106
ref|XP_004156246.1| PREDICTED: uncharacterized protein LOC101223...   390   e-105
ref|XP_004141623.1| PREDICTED: uncharacterized protein LOC101204...   390   e-105

>ref|XP_006360892.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like
            [Solanum tuberosum]
          Length = 459

 Score =  468 bits (1203), Expect = e-129
 Identities = 243/436 (55%), Positives = 316/436 (72%), Gaps = 4/436 (0%)
 Frame = +1

Query: 10   LSRRCSLDQRAL--CP-LALSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVAL 180
            LS R SL  R    CP  +LSKQGHRFL++L A  S  D SAT+   RKFV SS KHVAL
Sbjct: 14   LSHRLSLWNRRPRPCPRCSLSKQGHRFLSTLIAADS-EDISATRHLLRKFVASSSKHVAL 72

Query: 181  DXXXXXXXXXXXXXXXX-AMAFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETL 357
                              ++A PLYL I++ SWF+WN+KLVAD++A++YK E F +AETL
Sbjct: 73   STLSHLVSPTTTSHYRLCSLALPLYLEISEASWFDWNSKLVADLVALLYKLERFDEAETL 132

Query: 358  IMETMKKLDVQERDMCKFYCYLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAY 537
            + ET+ KL  +ERD+C FY  LI S +KH  +  V D    +K      SSS+Y+K+R Y
Sbjct: 133  VTETVSKLGSRERDLCSFYSQLIHSQSKHNSERGVLDFCTKLKLVLL-RSSSVYLKQRGY 191

Query: 538  VSMISSLCEIGQPREAEEVMEEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQS 717
             SM+   C IG PR+AEE+MEEM+ LGLK S FEFRSLVY+YG+ G + DMKR VV+M+S
Sbjct: 192  ASMVEGFCLIGLPRKAEELMEEMKELGLKLSKFEFRSLVYSYGKSGYLRDMKRIVVEMES 251

Query: 718  QGFELDTVCANMVLASLGAHGELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMK 897
             GF+LDTV +NMVL S G+H EL ++VS L+ +++SG+PFSIRTYNSVLNSCP I L+++
Sbjct: 252  MGFQLDTVSSNMVLNSFGSHNELSEVVSSLQKIEASGVPFSIRTYNSVLNSCPTISLLLQ 311

Query: 898  DIKSIPISTEELLNSLSEDEANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIF 1077
            D+KS+P+S EEL+ +L E+EA +V  LVGSSVL+E ++W  SELKLDLHGMHL+S+Y+I 
Sbjct: 312  DLKSVPLSLEELMGNLDENEAVLVNILVGSSVLEETMQWKPSELKLDLHGMHLTSAYVII 371

Query: 1078 LQWIDHLRSRLTPGNQMLPIEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARN 1257
            LQW   L+ +    N++LP EI VVCG+GKHS VRG+SPVK LI+E++LR+ CPL+I R 
Sbjct: 372  LQWFHQLQCKFLAENRVLPGEIIVVCGAGKHSVVRGESPVKRLIKEILLRIGCPLRIDRK 431

Query: 1258 NAGCFVAKGKVFMDWL 1305
            N GCF+AKGK FM+WL
Sbjct: 432  NIGCFIAKGKSFMEWL 447


>ref|XP_004248641.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like
            [Solanum lycopersicum]
          Length = 459

 Score =  462 bits (1188), Expect = e-127
 Identities = 240/436 (55%), Positives = 315/436 (72%), Gaps = 4/436 (0%)
 Frame = +1

Query: 10   LSRRCSLDQRALCP---LALSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVAL 180
            LS R SL  R   P    +LSKQGHRFL++L AT S  D SAT+   RKFV SS KHVAL
Sbjct: 14   LSHRLSLWNRRPRPGPRCSLSKQGHRFLSTLIATDS-DDISATRHLLRKFVGSSSKHVAL 72

Query: 181  DXXXXXXXXXXXXXXXX-AMAFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETL 357
                              ++A PLYL I++ SWF+WN+KLVA+++A++YK E F +AETL
Sbjct: 73   STLSHLVSPTTTSHYRLCSLALPLYLEISEASWFDWNSKLVAELVALLYKLERFDEAETL 132

Query: 358  IMETMKKLDVQERDMCKFYCYLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAY 537
            + E++ KL  +ERD+C FY  LI S +KH  +  V D    +K      SSS+Y+K+R Y
Sbjct: 133  VTESVSKLGSRERDLCSFYSQLIYSQSKHNSERGVLDYCTKLKLVLL-HSSSVYLKQRGY 191

Query: 538  VSMISSLCEIGQPREAEEVMEEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQS 717
             SM+   C IG PR+AEE+MEEM+ LGLK S FEFRSLVY+YG+ G + DMKR VV+M+ 
Sbjct: 192  ASMVEGFCLIGLPRKAEELMEEMKELGLKLSKFEFRSLVYSYGKSGYLRDMKRIVVEMER 251

Query: 718  QGFELDTVCANMVLASLGAHGELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMK 897
             GF+LDTV +NMVL S G+H EL ++VS L+ +++SG+ FSIRTYNSVLNSCP I L+++
Sbjct: 252  MGFQLDTVGSNMVLNSFGSHNELSELVSSLQKIEASGVLFSIRTYNSVLNSCPTISLLLQ 311

Query: 898  DIKSIPISTEELLNSLSEDEANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIF 1077
            D+KS+P+S EEL+ +L E+EA +V+ LVGSSVL+E ++W   ELKLDLHGMHL+S+YLI 
Sbjct: 312  DLKSVPLSLEELMGNLDENEAVLVKILVGSSVLEETMQWKPKELKLDLHGMHLTSAYLII 371

Query: 1078 LQWIDHLRSRLTPGNQMLPIEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARN 1257
            LQW   L+ +    N++LP EI VVCG+GKHS VRG+SPVK LI+E++LR+ CPL+I R 
Sbjct: 372  LQWFHQLQCKFLAENRVLPGEIIVVCGAGKHSVVRGESPVKRLIKEILLRIGCPLRIDRK 431

Query: 1258 NAGCFVAKGKVFMDWL 1305
            N GCF+AKGKVFM+WL
Sbjct: 432  NVGCFIAKGKVFMEWL 447


>ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Vitis vinifera] gi|297744557|emb|CBI37819.3| unnamed
            protein product [Vitis vinifera]
          Length = 435

 Score =  447 bits (1151), Expect = e-123
 Identities = 229/422 (54%), Positives = 303/422 (71%)
 Frame = +1

Query: 43   LCPLALSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXX 222
            L   ALSKQG  FL+S+A     RD SA+ R   KF+ SS K +AL+             
Sbjct: 20   LIQCALSKQGQLFLSSVA-----RDPSASNRLICKFIASSSKSIALNALSHLLSPTTTHP 74

Query: 223  XXXAMAFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDM 402
               ++A PLY  I++ SWF+WN KL+ADVIA++YK     +AETL+ ET+ KL  +ERD+
Sbjct: 75   YLSSLALPLYSRISEASWFSWNPKLIADVIALLYKQGQLKEAETLVSETLIKLGSRERDL 134

Query: 403  CKFYCYLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPRE 582
              FYC LI+S +KH   + V D+   + +  +  SSS+YVK+RAY SMISSLC +G P E
Sbjct: 135  VSFYCNLIDSHSKHSSNQGVFDVISRLSRIVS-ESSSVYVKERAYKSMISSLCAVGLPLE 193

Query: 583  AEEVMEEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLA 762
            AE ++EEMR  GLK S FEFRS+VY YGR+GL EDM+R +++M ++GFELDTV +NMVL+
Sbjct: 194  AENLIEEMRVKGLKPSVFEFRSVVYGYGRVGLSEDMQRILLQMGNEGFELDTVVSNMVLS 253

Query: 763  SLGAHGELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDIKSIPISTEELLNS 942
            S GA+ +  +MVSWL+ MK+S IPFSIRTYNSVLNSCP I+ +++D+K+ P + +EL+ +
Sbjct: 254  SYGAYNKQSEMVSWLQRMKNSSIPFSIRTYNSVLNSCPMIMSILQDLKTFPPTIDELMET 313

Query: 943  LSEDEANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGN 1122
            L  DEA +V+EL+GS VL E +EW+ SE KLDLHGMHL S+YLI LQW + LR RL    
Sbjct: 314  LKGDEALLVKELIGSMVLAELMEWDCSEGKLDLHGMHLGSAYLIMLQWREELRYRLNAAE 373

Query: 1123 QMLPIEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDW 1302
             ++P+EITVVCGSGKHS+VRG+SPVK ++REM+ R + P+KI R N GCFVAK KV  +W
Sbjct: 374  YVMPVEITVVCGSGKHSSVRGESPVKRMVREMMTRTRSPMKIDRKNIGCFVAKAKVVKNW 433

Query: 1303 LC 1308
            LC
Sbjct: 434  LC 435


>ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citrus clementina]
            gi|568866680|ref|XP_006486677.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g17033-like [Citrus sinensis]
            gi|557524456|gb|ESR35762.1| hypothetical protein
            CICLE_v10028424mg [Citrus clementina]
          Length = 451

 Score =  445 bits (1144), Expect = e-122
 Identities = 230/439 (52%), Positives = 304/439 (69%), Gaps = 7/439 (1%)
 Frame = +1

Query: 13   SRRCSLDQRAL----CPLA-LSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVA 177
            SR C L Q+ L    C  A L+KQG RFL+SLA   + RD  A  R   KFV SSP+ +A
Sbjct: 15   SRCCRLRQQRLTLVQCLTARLTKQGQRFLSSLALAVT-RDSKAASRLISKFVASSPQFIA 73

Query: 178  LDXXXXXXXXXXXXXXXXAMAFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETL 357
            L+                ++AFPLY+ IT+ SWF WN KLVA++IA + K     +AETL
Sbjct: 74   LNALSHLLSPDTTHPRLSSLAFPLYMRITEESWFQWNPKLVAEIIAFLDKQGQREEAETL 133

Query: 358  IMETMKKLDVQERDMCKFYCYLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAY 537
            I+ET+ KL  +ER++  FYC LI+S  KH  K    D Y  + Q    SSSS+YVK++A 
Sbjct: 134  ILETLSKLGSRERELVLFYCNLIDSFCKHDSKRGFDDTYARLNQL-VNSSSSVYVKRQAL 192

Query: 538  VSMISSLCEIGQPREAEEVMEEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQS 717
             SMIS LCE+GQP EAE ++EEMR  GL+ SGFE++ ++Y YGR+GL+EDM+R V +M+S
Sbjct: 193  KSMISGLCEMGQPHEAENLIEEMRVKGLEPSGFEYKCIIYGYGRLGLLEDMERIVNQMES 252

Query: 718  QGFELDTVCANMVLASLGAHGELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMK 897
             G  +DTVC+NMVL+S G H EL +MV WL+ MK SGIPFS+RTYNSVLNSC  I+ M++
Sbjct: 253  DGTRVDTVCSNMVLSSYGDHNELSRMVLWLQKMKDSGIPFSVRTYNSVLNSCSTIMSMLQ 312

Query: 898  DIKS--IPISTEELLNSLSEDEANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYL 1071
            D+ S   P+S  EL   L+E+E ++V+EL  SSVLDEA++W+S E KLDLHGMHL S+Y 
Sbjct: 313  DLNSNDFPLSILELTEVLNEEEVSVVKELEDSSVLDEAMKWDSGETKLDLHGMHLGSAYF 372

Query: 1072 IFLQWIDHLRSRLTPGNQMLPIEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIA 1251
            I LQW+D +R+R      ++P EITVVCGSGKHS VRG+S VK ++++M++R   P+++ 
Sbjct: 373  IILQWMDEMRNRFNNEKHVIPAEITVVCGSGKHSTVRGESSVKAMVKKMMVRTSSPMRVH 432

Query: 1252 RNNAGCFVAKGKVFMDWLC 1308
            RNN GCF+AKG V  DWLC
Sbjct: 433  RNNIGCFIAKGHVVKDWLC 451


>gb|EOX97560.1| Pentatricopeptide (PPR) repeat-containing protein, putative
            [Theobroma cacao]
          Length = 456

 Score =  444 bits (1142), Expect = e-122
 Identities = 220/417 (52%), Positives = 301/417 (72%), Gaps = 1/417 (0%)
 Frame = +1

Query: 58   LSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXAM 237
            L+KQGHRF +SLAAT+ V D +   R  +KFV SSPK +AL+                A+
Sbjct: 34   LTKQGHRFFSSLAATADVNDPATANRLIKKFVASSPKSIALNALSHLLSPRNSHPHLSAL 93

Query: 238  AFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFYC 417
            AFPLY  I++TSW+NWN KLVA++IA++ K   + ++E LI + + KL  +ERD+ +FYC
Sbjct: 94   AFPLYTKISETSWYNWNPKLVAELIALLVKQGRYDESEALISQAVSKLKFRERDLVQFYC 153

Query: 418  YLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEVM 597
              IES +KH  KE  +D Y C       +SSS+YVK++ Y SM+SSLCE+ +P EAE ++
Sbjct: 154  NWIESCSKHNSKEGFNDAY-CYLSELICNSSSVYVKRQGYKSMVSSLCEMDRPNEAENLV 212

Query: 598  EEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGAH 777
            EEMR  GL  + FEFR + Y YG++GL EDM+R V +M+ +GFE+DT+C+NMVL+S GA+
Sbjct: 213  EEMRKNGLTPTLFEFRFISYGYGQLGLFEDMERMVCEMEIEGFEVDTICSNMVLSSYGAY 272

Query: 778  GELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDIKSIPISTEELLNSLSEDE 957
                +MV WL+ MK+  IPFSIRTYNSVLNSCP I+ +++ + S+P+S  EL   L+EDE
Sbjct: 273  NAFSKMVPWLQKMKTLQIPFSIRTYNSVLNSCPEIMSLVQGLDSVPLSLGELAKILNEDE 332

Query: 958  ANMVRELV-GSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGNQMLP 1134
            A +V+ELV  SSVLDEA+EWN SE KLDLHGMHL S+YLI LQWI+ ++ R      ++P
Sbjct: 333  ALLVQELVKSSSVLDEAMEWNGSEGKLDLHGMHLGSAYLIMLQWIEEMKCRFKVEECVIP 392

Query: 1135 IEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDWL 1305
             +IT+VCGSGKHS+VRG+SPVK L+R+M++++K P+KI R N GCF+AKG+V  +WL
Sbjct: 393  AQITIVCGSGKHSSVRGESPVKTLMRKMMVKMKSPMKIDRKNIGCFIAKGQVVKNWL 449


>gb|EMJ01929.1| hypothetical protein PRUPE_ppa021547mg [Prunus persica]
          Length = 447

 Score =  426 bits (1094), Expect = e-116
 Identities = 213/418 (50%), Positives = 285/418 (68%)
 Frame = +1

Query: 55   ALSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXA 234
            A++KQG RFLT LAA +  RD   T +   KF+ SS K +AL+                +
Sbjct: 33   AVTKQGQRFLTKLAANA--RDAKVTNKLIAKFLTSSTKSIALNTLSYLLSPDTTLPHLSS 90

Query: 235  MAFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFY 414
            +A P Y  IT+ SWF WN KLVA ++A++ K     +AE LI ET+ KL  +ER++  F+
Sbjct: 91   LALPFYSKITEASWFEWNPKLVAALVALLDKQGQHNEAEVLISETISKLGSRERELALFH 150

Query: 415  CYLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEV 594
            C L+ES +K   K      Y  + Q    +SSS+YVK RA+ SM+S LCE+ +PREA+ +
Sbjct: 151  CQLVESHSKLSSKHGFDSSYSYLYQLLH-NSSSVYVKNRAFESMVSGLCEMDRPREADNL 209

Query: 595  MEEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGA 774
            +EEMR  GLK S FEFRS+VY YGR+GL EDM + V +M++QG  +DT+C+NMVL+S GA
Sbjct: 210  IEEMRVRGLKPSVFEFRSVVYGYGRLGLFEDMLKVVEQMENQGIAIDTICSNMVLSSYGA 269

Query: 775  HGELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDIKSIPISTEELLNSLSED 954
            H EL  M+ WLR MKS  +PFSIRTYNSVLNSC  I+ M+++ K  P S EEL   L+ D
Sbjct: 270  HSELAAMLVWLRKMKSLSLPFSIRTYNSVLNSCLTIMAMLQEPKDFPCSIEELNGVLNGD 329

Query: 955  EANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGNQMLP 1134
            EA +V+ELV S+VLDE + W   E KLDLHGMHL S+YLI L+W + +R R   G  ++P
Sbjct: 330  EALLVKELVESTVLDEVMVWEPLEAKLDLHGMHLGSAYLILLEWFEAMRCRFNSGKDVIP 389

Query: 1135 IEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDWLC 1308
             E+ V+CGSGKHS+VRG+SPVKGL+++M+LR++ P++I R N GCFVAKG+   DWLC
Sbjct: 390  AEVVVICGSGKHSSVRGESPVKGLVKQMMLRMESPMRIDRKNVGCFVAKGRAVKDWLC 447


>ref|XP_002521239.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223539507|gb|EEF41095.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 460

 Score =  417 bits (1073), Expect = e-114
 Identities = 206/418 (49%), Positives = 286/418 (68%)
 Frame = +1

Query: 55   ALSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXA 234
            ALSKQG RFL+SLA  ++  D  AT R  +KFV +SPK +ALD                +
Sbjct: 44   ALSKQGQRFLSSLAIATTKGDTVATNRLIKKFVAASPKSIALDALSHLLNPHSSHSHLSS 103

Query: 235  MAFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFY 414
            +AF LYL I +  WF WN KLVADV+A + K   + ++ TL+ +++ KL V+ERD+ +FY
Sbjct: 104  LAFTLYLKIAEARWFQWNPKLVADVVAFLDKQGRYDESATLVSDSISKLQVKERDLARFY 163

Query: 415  CYLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEV 594
            C L+ES +K        +    + Q    +S+S+YVK++ Y SM++ LCE+G+PREAE +
Sbjct: 164  CNLVESQSKQNSIRGFDNSVASLMQLVC-NSNSVYVKRQGYKSMVNGLCEMGRPREAETL 222

Query: 595  MEEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGA 774
            +EEM   G++ S FEF+ +VYAYG +G  E+M + + +M+  GF +DTVC+NM+LAS GA
Sbjct: 223  IEEMGKEGVRPSMFEFKCVVYAYGSLGSFEEMNKCLHQMERAGFRVDTVCSNMILASYGA 282

Query: 775  HGELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDIKSIPISTEELLNSLSED 954
            H  L +MV WL+ MK  GIPFS+RT NS LNSCP I+ MM++    PIS  +L+  LSED
Sbjct: 283  HNALPEMVLWLQKMKDLGIPFSLRTCNSALNSCPTIMSMMQNSNDFPISIHDLMKILSED 342

Query: 955  EANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGNQMLP 1134
            EA +V+E+V SSVLDEA++W+ +E KLDLHG HL S+YLI L WI+ +R R    N + P
Sbjct: 343  EALLVKEIVTSSVLDEAMKWDVAEAKLDLHGTHLCSAYLIILLWIEEMRKRFKSVNYVNP 402

Query: 1135 IEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDWLC 1308
             EITVVCGSG HS VRG+SPVK ++++ ++R + P++I R N GCF+AKGKV  +WLC
Sbjct: 403  TEITVVCGSGNHSIVRGESPVKCMVKDFMVRARSPMRIDRRNIGCFIAKGKVVEEWLC 460


>ref|XP_002312938.2| hypothetical protein POPTR_0009s14120g [Populus trichocarpa]
            gi|550331693|gb|EEE86893.2| hypothetical protein
            POPTR_0009s14120g [Populus trichocarpa]
          Length = 473

 Score =  414 bits (1063), Expect = e-113
 Identities = 213/423 (50%), Positives = 288/423 (68%), Gaps = 2/423 (0%)
 Frame = +1

Query: 46   CPLALSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXX 225
            C  A+SKQ  RF +++  T +  D SAT R  +KFV SSPK +ALD              
Sbjct: 52   CLAAISKQAQRFFSAVLPTVATSDTSATNRLIKKFVASSPKSIALDALSNLLSPDSTHHP 111

Query: 226  XX-AMAFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDM 402
                +  PLYL I++ SWF+WN KLVA V+ ++ K     + + L+ ET+ +L  +ER++
Sbjct: 112  LLYLLTLPLYLKISEASWFSWNPKLVAQVVVLLDKQGLDKELKALMSETVSRLQFKEREL 171

Query: 403  CKFYCYLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPRE 582
              FYC LI  ++KH       D Y  + QF +  S+S+YVKK+ Y +MIS LCE+G+ RE
Sbjct: 172  VLFYCNLIGFNSKHNWVRGFDDSYSRLNQFVS-DSNSVYVKKQGYKAMISGLCEMGRARE 230

Query: 583  AEEVMEEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLA 762
            AE+++ EMR  GLK   FEFR ++Y YGR+GL +DM+R + KM+S   E+DTVCANMVLA
Sbjct: 231  AEDLIGEMRERGLKPKLFEFRCVLYGYGRLGLFKDMERILDKMESGEIEVDTVCANMVLA 290

Query: 763  SLGAHGELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDIK-SIPISTEELLN 939
            S GAH  L +M  WLR MK+ GIP SIRT NSVLNSCP I+ +M+++  S P+S +ELL 
Sbjct: 291  SYGAHNALPEMGLWLRKMKTLGIPLSIRTCNSVLNSCPTIMALMRNLDASYPVSIQELLK 350

Query: 940  SLSEDEANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPG 1119
             LSE+EA +V+EL+ SSVL EA +W++SE KLDLHGMHL S+Y+I LQW++  R+RL+ G
Sbjct: 351  ILSEEEAMLVKELIESSVLKEATKWDTSEGKLDLHGMHLGSAYVIMLQWMEETRNRLSDG 410

Query: 1120 NQMLPIEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMD 1299
              ++P EITVVCGSG HS VRG+SPVK +I E++ + + P++I R N GCFVAKG V   
Sbjct: 411  EHVIPAEITVVCGSGNHSTVRGESPVKSMITEIMAQTRSPMRIDRKNIGCFVAKGNVVKK 470

Query: 1300 WLC 1308
            WLC
Sbjct: 471  WLC 473


>ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutrema salsugineum]
            gi|557110519|gb|ESQ50810.1| hypothetical protein
            EUTSA_v10022675mg [Eutrema salsugineum]
          Length = 469

 Score =  410 bits (1054), Expect = e-112
 Identities = 208/435 (47%), Positives = 293/435 (67%)
 Frame = +1

Query: 4    TSLSRRCSLDQRALCPLALSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALD 183
            TS+  RC         + L KQGHRFL+SL++ +   D SAT R  +KFV +SPK V+L+
Sbjct: 40   TSMEVRCKAGT-----VPLMKQGHRFLSSLSSPALAGDPSATNRHIKKFVAASPKSVSLN 94

Query: 184  XXXXXXXXXXXXXXXXAMAFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIM 363
                              A  LY  IT+ SWF+WN KL+A+++A++ K E   ++ETL+ 
Sbjct: 95   VLSHLLSAQTSHPHLSFFALSLYSEITEASWFDWNPKLIAELVALLNKQERSHESETLLS 154

Query: 364  ETMKKLDVQERDMCKFYCYLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVS 543
              + +L   ERD+  FYC L+ES++K    +  ++    +++  T  S+S+YVK +AY S
Sbjct: 155  NAVSRLKSNERDIALFYCNLVESNSKQGSIQGFNEACVRLREI-TRRSTSVYVKTQAYKS 213

Query: 544  MISSLCEIGQPREAEEVMEEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQG 723
            M+S LC + QP +AE V+EEMR   +K   FE++S++Y YGR+GL EDM R V +M+++G
Sbjct: 214  MVSGLCNMDQPHDAESVIEEMRIAKIKPGLFEYKSVLYGYGRLGLFEDMNRVVHRMETEG 273

Query: 724  FELDTVCANMVLASLGAHGELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDI 903
             ++DTVC+NMVL+S GAH  L QM SWL+ +K S +P S RTYNSVLNSCP I+ ++KD+
Sbjct: 274  HKIDTVCSNMVLSSYGAHNALPQMGSWLQKLKDSNVPLSERTYNSVLNSCPTILSLLKDL 333

Query: 904  KSIPISTEELLNSLSEDEANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQ 1083
             S P+S  ELL  L++DE  +VR L  SSVLDEA+EW+S E KLDLHGMHLSSSYLI +Q
Sbjct: 334  DSCPVSLSELLTFLNKDEEVLVRGLTQSSVLDEAIEWSSLEGKLDLHGMHLSSSYLIMMQ 393

Query: 1084 WIDHLRSRLTPGNQMLPIEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNA 1263
            W+D +R R + G  ++P EI +V GSGKHS VRG+SPVK L++++++R   P++I R N 
Sbjct: 394  WMDEMRIRFSEGKCVVPAEIVLVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNI 453

Query: 1264 GCFVAKGKVFMDWLC 1308
            G F+AKGK   +WLC
Sbjct: 454  GSFIAKGKTVKEWLC 468


>gb|EXB24044.1| hypothetical protein L484_006076 [Morus notabilis]
          Length = 517

 Score =  409 bits (1051), Expect = e-111
 Identities = 211/418 (50%), Positives = 284/418 (67%)
 Frame = +1

Query: 55   ALSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXA 234
            AL+KQGHRFL++L+  +   + SA  +   KFV SSPK ++L+                +
Sbjct: 103  ALTKQGHRFLSTLSINAG--NASAANKLIGKFVASSPKSISLNALSHLLSPDTTHTHLTS 160

Query: 235  MAFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFY 414
             +  LY  I + SWF ++ KLVA + A++ K   + +AE LI E + KL  ++R++  FY
Sbjct: 161  HSLHLYSKIREASWFVYSPKLVAALAALLDKQGRYSEAEALIAEAVSKLGHRQRELAVFY 220

Query: 415  CYLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEV 594
            C L+ES +K   K      Y  + Q    SSS+ YVK RA+ +M+ +LC + +P EAE +
Sbjct: 221  CSLVESHSKQSSKHGFDSSYAYLYQLLRDSSSA-YVKCRAFETMVGALCTMDRPCEAESL 279

Query: 595  MEEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGA 774
            MEEMR  GLK S FEFRSLVY YGR+GL EDM R+V +M+ +G  +DT+C+NMVL+S GA
Sbjct: 280  MEEMRHKGLKPSVFEFRSLVYGYGRLGLWEDMLRTVNQMEIEGLVIDTICSNMVLSSYGA 339

Query: 775  HGELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDIKSIPISTEELLNSLSED 954
            H EL QMV WL+ M++S IPFSIRTYNSVLN CP I  M++D+K IP+S  EL  +L  D
Sbjct: 340  HNELQQMVLWLQKMRTSSIPFSIRTYNSVLNWCPTITAMLQDLKDIPLSMYELNATLRGD 399

Query: 955  EANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGNQMLP 1134
            E  +V ELVGSSVL+E + W+S E+KLDLHGMHL S+YLI L+W++ +  R   GN  +P
Sbjct: 400  EGLLVMELVGSSVLEEVLVWDSLEVKLDLHGMHLGSAYLIMLEWMEEMTRRFNDGNHGIP 459

Query: 1135 IEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDWLC 1308
             E+ VVCGSGKHS VRG SPVK L++EM++++K P+KI R NAGCF+AKGK   DWLC
Sbjct: 460  AEVVVVCGSGKHSNVRGVSPVKILVKEMMVQMKSPMKIDRKNAGCFLAKGKTVRDWLC 517


>ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like
            [Fragaria vesca subsp. vesca]
          Length = 448

 Score =  407 bits (1045), Expect = e-111
 Identities = 202/418 (48%), Positives = 284/418 (67%)
 Frame = +1

Query: 55   ALSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXA 234
            AL+KQG RFLT LAA +   + S   +   KF+++SPK  AL                 +
Sbjct: 34   ALTKQGQRFLTKLAANAG--NPSVANKLISKFLSTSPKSTALTTLSYLLSPHTAHPHLSS 91

Query: 235  MAFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFY 414
            +A P+Y  IT+ SWF WN KLVA ++A++ K     Q+E LI ET+ KL  +ER++ +F+
Sbjct: 92   LALPMYSKITEASWFEWNPKLVAALVALLAKQGQQSQSEALISETISKLGNKERELVQFH 151

Query: 415  CYLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEV 594
            C L+ES +K   K         + Q    +SSS+YVK+RA+ SM+  LC + +P EA+E+
Sbjct: 152  CQLVESHSKMSSKCGFDRACTYLHQLLQ-NSSSVYVKRRAFESMVGGLCAMDRPGEADEL 210

Query: 595  MEEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGA 774
            +EEMR  GLK S FEFRS+VY YGR+G+ E+M + V +M+ QGF  DT+C NMVL+S GA
Sbjct: 211  IEEMRVKGLKASVFEFRSVVYGYGRLGMFEEMLKIVDQMEKQGFGDDTICCNMVLSSYGA 270

Query: 775  HGELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDIKSIPISTEELLNSLSED 954
            H EL  M +WLR MK S +PFS+RTYNSVLNSCP I+ M+++ K++P S  EL   L  D
Sbjct: 271  HNELAAMANWLRKMKESSVPFSVRTYNSVLNSCPTIMAMLQEPKAVPCSVGELSGVLDGD 330

Query: 955  EANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGNQMLP 1134
            EA +V+ELVGS+V+DEA+ W+S+E KLDLHGMHL S+YL+ L+W + + +R      ++P
Sbjct: 331  EALVVKELVGSAVVDEAMVWDSAEAKLDLHGMHLGSAYLVMLEWFEAMGNRFKSAECVVP 390

Query: 1135 IEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDWLC 1308
             E+ +VCG GKHS+VRG+SPVK L++EM+ +++ P++I R N GCF+AKG+   DWLC
Sbjct: 391  AEVVIVCGLGKHSSVRGESPVKDLVKEMMHQMESPMRIDRKNVGCFIAKGRAVKDWLC 448


>ref|XP_006297442.1| hypothetical protein CARUB_v10013465mg [Capsella rubella]
            gi|482566151|gb|EOA30340.1| hypothetical protein
            CARUB_v10013465mg [Capsella rubella]
          Length = 516

 Score =  405 bits (1042), Expect = e-110
 Identities = 206/418 (49%), Positives = 286/418 (68%), Gaps = 1/418 (0%)
 Frame = +1

Query: 58   LSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXAM 237
            L KQGH+FL+SL++ +   D  AT R  +KFV +SPK VAL+                  
Sbjct: 99   LMKQGHQFLSSLSSPALAGDPPATNRLIKKFVAASPKSVALNVLSHLLSDNTSHPHLSYF 158

Query: 238  AFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFYC 417
            A  LYL IT+ SWF+WN KL+ ++++++ K E F ++ETL+   + +L+  ERD   F C
Sbjct: 159  APQLYLEITEASWFDWNPKLIGELVSLLNKQERFVESETLLSTAVSRLESNERDFALFLC 218

Query: 418  YLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEVM 597
             L+ES++K    +  SD    +++     SSS+YVK +AY SM+S LC + QP +AE V+
Sbjct: 219  NLVESNSKQGSIQGFSDACSRLREIIQ-RSSSVYVKTQAYKSMVSGLCNMDQPLDAERVI 277

Query: 598  EEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGAH 777
            EEMR   +K   FE++S++Y YGR+GL +DM R V +M++QG ++DTVC+NMVL+S GAH
Sbjct: 278  EEMRMETIKPGLFEYKSVLYGYGRLGLFDDMNRIVHRMETQGHKIDTVCSNMVLSSYGAH 337

Query: 778  GELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDIKSIPISTEELLNSLSEDE 957
              L QM SWL+ +K   +P SIRTYNSVLNSCP I+ ++KD+ S P+S  ELL  L+EDE
Sbjct: 338  DALPQMGSWLQKLKGYNVPLSIRTYNSVLNSCPTIISLLKDLDSCPLSLSELLPILNEDE 397

Query: 958  ANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGNQ-MLP 1134
            A +VREL  S VLDEA+EWN+ E KLDLHGMHLS+SYLI LQW+D  R R +   + ++P
Sbjct: 398  ALLVRELTQSLVLDEAIEWNAVEGKLDLHGMHLSASYLIMLQWMDETRLRFSEDKKCVVP 457

Query: 1135 IEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDWLC 1308
             EI VV GSGKHS VRG+SPVK +++++++R K P++I R N G F+AKGK   +WLC
Sbjct: 458  AEIVVVSGSGKHSNVRGESPVKAMVKKIMVRTKSPMRIDRKNVGSFIAKGKNVKEWLC 515


>ref|NP_849962.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75244359|sp|Q8GWA9.1|PP157_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g17033 gi|26452937|dbj|BAC43545.1| unknown protein
            [Arabidopsis thaliana] gi|330251482|gb|AEC06576.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 505

 Score =  405 bits (1040), Expect = e-110
 Identities = 207/417 (49%), Positives = 280/417 (67%)
 Frame = +1

Query: 58   LSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXAM 237
            L K G RFL+SL++ +   D SA  R  +KFV +SPK VAL+                  
Sbjct: 89   LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 148

Query: 238  AFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFYC 417
            A  LY  IT+ SWF+WN KL+A++IA++ K E F ++ETL+   + +L   ERD   F C
Sbjct: 149  ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 208

Query: 418  YLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEVM 597
             L+ES++K    +  S+    +++     SSS+YVK +AY SM+S LC + QP +AE V+
Sbjct: 209  NLVESNSKQGSIQGFSEASFRLREIIQ-RSSSVYVKTQAYKSMVSGLCNMDQPHDAERVI 267

Query: 598  EEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGAH 777
            EEMR   +K   FE++S++Y YGR+GL +DM R V +M ++G ++DTVC+NMVL+S GAH
Sbjct: 268  EEMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAH 327

Query: 778  GELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDIKSIPISTEELLNSLSEDE 957
              L QM SWL+ +K   +PFSIRTYNSVLNSCP I+ M+KD+ S P+S  EL   L+EDE
Sbjct: 328  DALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNEDE 387

Query: 958  ANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGNQMLPI 1137
            A +V EL  SSVLDEA+EWN+ E KLDLHGMHLSSSYLI LQW+D  R R +    ++P 
Sbjct: 388  ALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIPA 447

Query: 1138 EITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDWLC 1308
            EI VV GSGKHS VRG+SPVK L++++++R   P++I R N G F+AKGK   +WLC
Sbjct: 448  EIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 504


>dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana]
          Length = 501

 Score =  405 bits (1040), Expect = e-110
 Identities = 207/417 (49%), Positives = 280/417 (67%)
 Frame = +1

Query: 58   LSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXAM 237
            L K G RFL+SL++ +   D SA  R  +KFV +SPK VAL+                  
Sbjct: 85   LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 144

Query: 238  AFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFYC 417
            A  LY  IT+ SWF+WN KL+A++IA++ K E F ++ETL+   + +L   ERD   F C
Sbjct: 145  ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 204

Query: 418  YLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEVM 597
             L+ES++K    +  S+    +++     SSS+YVK +AY SM+S LC + QP +AE V+
Sbjct: 205  NLVESNSKQGSIQGFSEASFRLREIIQ-RSSSVYVKTQAYKSMVSGLCNMDQPHDAERVI 263

Query: 598  EEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGAH 777
            EEMR   +K   FE++S++Y YGR+GL +DM R V +M ++G ++DTVC+NMVL+S GAH
Sbjct: 264  EEMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAH 323

Query: 778  GELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDIKSIPISTEELLNSLSEDE 957
              L QM SWL+ +K   +PFSIRTYNSVLNSCP I+ M+KD+ S P+S  EL   L+EDE
Sbjct: 324  DALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNEDE 383

Query: 958  ANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGNQMLPI 1137
            A +V EL  SSVLDEA+EWN+ E KLDLHGMHLSSSYLI LQW+D  R R +    ++P 
Sbjct: 384  ALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIPA 443

Query: 1138 EITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDWLC 1308
            EI VV GSGKHS VRG+SPVK L++++++R   P++I R N G F+AKGK   +WLC
Sbjct: 444  EIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 500


>ref|NP_565402.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|13877877|gb|AAK44016.1|AF370201_1 unknown protein
            [Arabidopsis thaliana] gi|21280879|gb|AAM44931.1| unknown
            protein [Arabidopsis thaliana]
            gi|330251481|gb|AEC06575.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 504

 Score =  405 bits (1040), Expect = e-110
 Identities = 207/417 (49%), Positives = 280/417 (67%)
 Frame = +1

Query: 58   LSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXAM 237
            L K G RFL+SL++ +   D SA  R  +KFV +SPK VAL+                  
Sbjct: 88   LMKHGDRFLSSLSSPALAGDPSAINRHIKKFVAASPKSVALNVLSHLLSDQTSHPHLSFF 147

Query: 238  AFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFYC 417
            A  LY  IT+ SWF+WN KL+A++IA++ K E F ++ETL+   + +L   ERD   F C
Sbjct: 148  ALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLC 207

Query: 418  YLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEVM 597
             L+ES++K    +  S+    +++     SSS+YVK +AY SM+S LC + QP +AE V+
Sbjct: 208  NLVESNSKQGSIQGFSEASFRLREIIQ-RSSSVYVKTQAYKSMVSGLCNMDQPHDAERVI 266

Query: 598  EEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGAH 777
            EEMR   +K   FE++S++Y YGR+GL +DM R V +M ++G ++DTVC+NMVL+S GAH
Sbjct: 267  EEMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAH 326

Query: 778  GELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDIKSIPISTEELLNSLSEDE 957
              L QM SWL+ +K   +PFSIRTYNSVLNSCP I+ M+KD+ S P+S  EL   L+EDE
Sbjct: 327  DALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNEDE 386

Query: 958  ANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGNQMLPI 1137
            A +V EL  SSVLDEA+EWN+ E KLDLHGMHLSSSYLI LQW+D  R R +    ++P 
Sbjct: 387  ALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSEEKCVIPA 446

Query: 1138 EITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDWLC 1308
            EI VV GSGKHS VRG+SPVK L++++++R   P++I R N G F+AKGK   +WLC
Sbjct: 447  EIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 503


>ref|XP_002884032.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297329872|gb|EFH60291.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 504

 Score =  402 bits (1032), Expect = e-109
 Identities = 203/417 (48%), Positives = 281/417 (67%)
 Frame = +1

Query: 58   LSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXAM 237
            L KQG RFL+SL++ +   D SAT R  +KFV +SPK V L+                  
Sbjct: 88   LMKQGDRFLSSLSSPALAGDPSATHRHIKKFVAASPKSVTLNVLSHLLSDQTSYPHLSFF 147

Query: 238  AFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFYC 417
            A  LY  IT+ SWF+WN KL+A+++AV+   E F ++ETL+   + +L   ERD   F C
Sbjct: 148  ALSLYSEITEASWFDWNPKLIAELVAVLNNQERFDESETLLSTAVSRLKSNERDFALFLC 207

Query: 418  YLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEVM 597
             L+ES++K    +  ++    +++     SSS+YVK +AY SM++ LC + QP +AE V+
Sbjct: 208  NLVESNSKQGSIQGFNEACFRLRERIQ-RSSSVYVKTQAYKSMVAGLCNMDQPHDAERVI 266

Query: 598  EEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGAH 777
            EEMR   +K   FE +S++Y YGR+GL +DM R V +M+++G ++DTVC+NMVL+S GAH
Sbjct: 267  EEMRVEKIKPGSFEHKSVLYGYGRLGLFDDMNRVVHRMETEGHKIDTVCSNMVLSSYGAH 326

Query: 778  GELMQMVSWLRIMKSSGIPFSIRTYNSVLNSCPRIVLMMKDIKSIPISTEELLNSLSEDE 957
              L QM SWL+ +K   +PFSIRTYNSVLNSCP I+ ++KD+ S P+S  EL   L+EDE
Sbjct: 327  DALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIMSLLKDLNSCPVSLSELRTFLNEDE 386

Query: 958  ANMVRELVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGNQMLPI 1137
            A +V EL  S+VLDEA+EWN+ E KLDLHGMHLSSSYLI LQW+D +R R      ++P 
Sbjct: 387  ALLVLELTQSTVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDEIRLRFRDQKCVIPA 446

Query: 1138 EITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDWLC 1308
            EI VV GSGKHS VRG+SPVK L++++++R + P++I R N G F+AKGK   +WLC
Sbjct: 447  EIVVVSGSGKHSNVRGESPVKALVKKIMVRTESPMRIDRKNVGSFIAKGKNVKEWLC 503


>gb|AAU04769.1| pentatricopeptide (PPR) repeat protein-like [Cucumis melo]
          Length = 488

 Score =  392 bits (1006), Expect = e-106
 Identities = 199/422 (47%), Positives = 286/422 (67%), Gaps = 5/422 (1%)
 Frame = +1

Query: 58   LSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXAM 237
            L+KQ HRFL++L+ T +  D SAT R  RKFV SSPK + L                 + 
Sbjct: 45   LTKQTHRFLSTLSTTGATGDQSATNRLIRKFVASSPKSITLSVLSNIVSTHTPQPELCSA 104

Query: 238  AFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFYC 417
            A  LY  IT+ SWF WN+KLVAD++A + ++  + ++E LI E + KL  QER +  FY 
Sbjct: 105  ALTLYSRITEASWFTWNSKLVADLVAFLGQNGLYSESEALISEAISKLGSQERKLVNFYS 164

Query: 418  YLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEVM 597
             L+ES +KH  +    D Y  + +    +S S+YVK+RAY SM++ LC + +P EAE ++
Sbjct: 165  QLVESQSKHGFERGFGDSYSRLFELLY-NSPSVYVKRRAYESMVTGLCSMKRPHEAESLV 223

Query: 598  EEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGAH 777
            +EMR  G+  + +E+RS++YAYG +GL E+MKRS+ +M++   ELDTVC+NMVL+S GAH
Sbjct: 224  KEMRSKGITPTAYEYRSIIYAYGTLGLFEEMKRSLKQMENDNIELDTVCSNMVLSSYGAH 283

Query: 778  GELMQMVSWLRIMK-SSGIPFSIRTYNSVLNSCPRIVLMMKDIKS--IPISTEELLNSLS 948
             +L  M+ WL+ MK SS    S+RTYNSVLNSCP+I  M++D KS  +P+  E+L+  L 
Sbjct: 284  NKLGDMLLWLQRMKTSSHCKSSVRTYNSVLNSCPKITSMLQDHKSGDLPVLIEDLIAILD 343

Query: 949  EDE-ANMVREL-VGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGN 1122
             DE A +V+EL VGSSVL+E + W++ ELKLDLHG H+ ++Y+I LQWI  +R      +
Sbjct: 344  GDEEALLVKELLVGSSVLNEIMVWDAMELKLDLHGAHVGAAYVIMLQWIKEMRLNFEDES 403

Query: 1123 QMLPIEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDW 1302
             ++P ++T++CGSGKHS VRG+SPVK LI+E+++R + PL+I R N GCF++KGK   +W
Sbjct: 404  NVIPAQVTLICGSGKHSIVRGESPVKALIKEIMVRTESPLRIDRKNTGCFISKGKAVKNW 463

Query: 1303 LC 1308
            LC
Sbjct: 464  LC 465


>gb|AGH33847.1| PPR [Cucumis melo]
          Length = 488

 Score =  390 bits (1003), Expect = e-106
 Identities = 198/422 (46%), Positives = 287/422 (68%), Gaps = 5/422 (1%)
 Frame = +1

Query: 58   LSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXAM 237
            L+KQ HRFL++L+ T++  D SAT R  RKFV SSPK + L                 + 
Sbjct: 45   LTKQTHRFLSTLSTTAATGDQSATNRLIRKFVASSPKSITLSVLSNIVSTHTPQPELCSA 104

Query: 238  AFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFYC 417
            A  LY  IT+ SWF WN+KLVAD++A + ++  + ++E LI E + KL  QER +  FY 
Sbjct: 105  ALTLYSRITEASWFTWNSKLVADLVAFLGQNGLYSESEALISEAISKLGSQERKLVNFYS 164

Query: 418  YLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEVM 597
             L+ES +KH  +    D Y  + +    +S S+YVK+RAY SM++ LC + +P EAE ++
Sbjct: 165  QLVESQSKHGFERGFGDSYSRLFELLY-NSPSVYVKRRAYESMVTGLCSMKRPHEAESLV 223

Query: 598  EEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGAH 777
            +EMR  G+  + +E+RS++YAYG +GL E+MKRS+ +M++   ELDTVC+NMVL+S GAH
Sbjct: 224  KEMRSKGITPTAYEYRSIIYAYGTLGLFEEMKRSLKQMENDNIELDTVCSNMVLSSYGAH 283

Query: 778  GELMQMVSWLRIMKSSG-IPFSIRTYNSVLNSCPRIVLMMKDIKS--IPISTEELLNSLS 948
             +L  M+ WL+ MK+S     S+RTYNSVLNSCP+I  M++D KS  +P+  E+L+  L 
Sbjct: 284  NKLGDMLLWLQRMKTSPHCKSSVRTYNSVLNSCPKITSMLQDHKSGDLPVLIEDLIAILD 343

Query: 949  EDE-ANMVREL-VGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPGN 1122
             DE A +V+EL VGSSVL+E + W++ ELKLDLHG H+ ++Y+I LQWI  +R      +
Sbjct: 344  GDEEALLVKELLVGSSVLNEIMVWDAMELKLDLHGAHVGAAYVIMLQWIKEMRLNFEDES 403

Query: 1123 QMLPIEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMDW 1302
             ++P ++T++CGSGKHS VRG+SPVK LI+E+++R + PL+I R N GCF++KGK   +W
Sbjct: 404  YVIPAQVTLICGSGKHSIVRGESPVKALIKEIMVRTESPLRIDRKNTGCFISKGKAVKNW 463

Query: 1303 LC 1308
            LC
Sbjct: 464  LC 465


>ref|XP_004156246.1| PREDICTED: uncharacterized protein LOC101223617 [Cucumis sativus]
          Length = 1296

 Score =  390 bits (1001), Expect = e-105
 Identities = 196/423 (46%), Positives = 285/423 (67%), Gaps = 5/423 (1%)
 Frame = +1

Query: 55   ALSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXA 234
            +L+KQ HRFL++L+ T++  D SAT R  RKFV SSPK + L                 +
Sbjct: 44   SLTKQTHRFLSTLSTTAATGDQSATNRLIRKFVASSPKSITLSVLSNIVSTHTPQPELCS 103

Query: 235  MAFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFY 414
             A  LY  IT+ SWF WN+KLVAD++A + ++  + ++E LI E + KL  QER +  FY
Sbjct: 104  AALTLYSRITEASWFTWNSKLVADLVAFLDQNGLYSESEVLISEAISKLGSQERKLVNFY 163

Query: 415  CYLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEV 594
              L+ES +KH  +    D Y  + +    +S S+YVK+RAY SM++ LC + +P EAE +
Sbjct: 164  SQLVESQSKHGFERGFVDSYSRLLELLY-NSPSVYVKRRAYESMVTGLCSMKRPHEAENL 222

Query: 595  MEEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGA 774
            ++EMR  G+  + +E+RS++YAYG +GL E+MKRS+ +M++   ELDTVC+NMVL+S GA
Sbjct: 223  VKEMRSKGITPTAYEYRSIIYAYGTLGLFEEMKRSLKQMENDNIELDTVCSNMVLSSYGA 282

Query: 775  HGELMQMVSWLRIMKSSG-IPFSIRTYNSVLNSCPRIVLMMKDIKS--IPISTEELLNSL 945
            H +L  MV WL+ MK+S     S+RTYNSVLNSCP+I  M++D KS  +P+  E+L+  L
Sbjct: 283  HNKLGDMVLWLQRMKTSPHCNSSVRTYNSVLNSCPKITAMLQDHKSTNLPVLIEDLIAVL 342

Query: 946  SEDEANMVRE--LVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPG 1119
              DE  ++ E  L GSSVL+E + W++ ELKLDLHG H+ ++Y+I LQWI  +R      
Sbjct: 343  DGDEEALLVEELLAGSSVLNEIMVWDAMELKLDLHGAHVGAAYVIMLQWIKEMRLNFEDE 402

Query: 1120 NQMLPIEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMD 1299
            + ++P ++T++CGSGKHS VRG+SPVK LI+E+++R + PL+I R N GCF++KGK   +
Sbjct: 403  SYVIPAQVTLICGSGKHSIVRGESPVKALIKEIMVRTESPLRIDRKNTGCFISKGKAVKN 462

Query: 1300 WLC 1308
            WLC
Sbjct: 463  WLC 465


>ref|XP_004141623.1| PREDICTED: uncharacterized protein LOC101204365 [Cucumis sativus]
          Length = 1913

 Score =  390 bits (1001), Expect = e-105
 Identities = 196/423 (46%), Positives = 285/423 (67%), Gaps = 5/423 (1%)
 Frame = +1

Query: 55   ALSKQGHRFLTSLAATSSVRDFSATQRSFRKFVNSSPKHVALDXXXXXXXXXXXXXXXXA 234
            +L+KQ HRFL++L+ T++  D SAT R  RKFV SSPK + L                 +
Sbjct: 44   SLTKQTHRFLSTLSTTAATGDQSATNRLIRKFVASSPKSITLSVLSNIVSTHTPQPELCS 103

Query: 235  MAFPLYLSITQTSWFNWNAKLVADVIAVMYKHENFGQAETLIMETMKKLDVQERDMCKFY 414
             A  LY  IT+ SWF WN+KLVAD++A + ++  + ++E LI E + KL  QER +  FY
Sbjct: 104  AALTLYSRITEASWFTWNSKLVADLVAFLDQNGLYSESEVLISEAISKLGSQERKLVNFY 163

Query: 415  CYLIESSAKHQLKERVSDLYKCMKQFFTGSSSSIYVKKRAYVSMISSLCEIGQPREAEEV 594
              L+ES +KH  +    D Y  + +    +S S+YVK+RAY SM++ LC + +P EAE +
Sbjct: 164  SQLVESQSKHGFERGFVDSYSRLLELLY-NSPSVYVKRRAYESMVTGLCSMKRPHEAENL 222

Query: 595  MEEMRGLGLKQSGFEFRSLVYAYGRIGLVEDMKRSVVKMQSQGFELDTVCANMVLASLGA 774
            ++EMR  G+  + +E+RS++YAYG +GL E+MKRS+ +M++   ELDTVC+NMVL+S GA
Sbjct: 223  VKEMRSKGITPTAYEYRSIIYAYGTLGLFEEMKRSLKQMENDNIELDTVCSNMVLSSYGA 282

Query: 775  HGELMQMVSWLRIMKSSG-IPFSIRTYNSVLNSCPRIVLMMKDIKS--IPISTEELLNSL 945
            H +L  MV WL+ MK+S     S+RTYNSVLNSCP+I  M++D KS  +P+  E+L+  L
Sbjct: 283  HNKLGDMVLWLQRMKTSPHCNSSVRTYNSVLNSCPKITAMLQDHKSTNLPVLIEDLIAVL 342

Query: 946  SEDEANMVRE--LVGSSVLDEAVEWNSSELKLDLHGMHLSSSYLIFLQWIDHLRSRLTPG 1119
              DE  ++ E  L GSSVL+E + W++ ELKLDLHG H+ ++Y+I LQWI  +R      
Sbjct: 343  DGDEEALLVEELLAGSSVLNEIMVWDAMELKLDLHGAHVGAAYVIMLQWIKEMRLNFEDE 402

Query: 1120 NQMLPIEITVVCGSGKHSAVRGQSPVKGLIREMILRLKCPLKIARNNAGCFVAKGKVFMD 1299
            + ++P ++T++CGSGKHS VRG+SPVK LI+E+++R + PL+I R N GCF++KGK   +
Sbjct: 403  SYVIPAQVTLICGSGKHSIVRGESPVKALIKEIMVRTESPLRIDRKNTGCFISKGKAVKN 462

Query: 1300 WLC 1308
            WLC
Sbjct: 463  WLC 465


Top