BLASTX nr result

ID: Catharanthus22_contig00014370 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00014370
         (2679 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006360892.1| PREDICTED: pentatricopeptide repeat-containi...   466   e-128
ref|XP_004248641.1| PREDICTED: pentatricopeptide repeat-containi...   461   e-127
ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containi...   449   e-123
gb|EOX97560.1| Pentatricopeptide (PPR) repeat-containing protein...   444   e-122
ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citr...   431   e-118
gb|EMJ01929.1| hypothetical protein PRUPE_ppa021547mg [Prunus pe...   431   e-117
gb|EXB24044.1| hypothetical protein L484_006076 [Morus notabilis]     418   e-114
ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containi...   417   e-113
ref|XP_002521239.1| pentatricopeptide repeat-containing protein,...   409   e-111
ref|XP_004156246.1| PREDICTED: uncharacterized protein LOC101223...   408   e-111
ref|XP_004141623.1| PREDICTED: uncharacterized protein LOC101204...   408   e-111
gb|AAU04769.1| pentatricopeptide (PPR) repeat protein-like [Cucu...   407   e-110
gb|AGH33847.1| PPR [Cucumis melo]                                     406   e-110
ref|XP_006297442.1| hypothetical protein CARUB_v10013465mg [Caps...   400   e-108
ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutr...   395   e-107
ref|NP_849962.1| pentatricopeptide repeat-containing protein [Ar...   393   e-106
dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana]           393   e-106
ref|NP_565402.1| pentatricopeptide repeat-containing protein [Ar...   393   e-106
ref|XP_002312938.2| hypothetical protein POPTR_0009s14120g [Popu...   389   e-105
ref|XP_002884032.1| pentatricopeptide repeat-containing protein ...   388   e-105

>ref|XP_006360892.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like
            [Solanum tuberosum]
          Length = 459

 Score =  466 bits (1199), Expect = e-128
 Identities = 241/437 (55%), Positives = 318/437 (72%), Gaps = 1/437 (0%)
 Frame = -1

Query: 2466 RRCSPCRRPGLSLRLRALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVA 2287
            RR  PC R        +LSKQGHRF             D SAT R L+RKFV SS KHVA
Sbjct: 23   RRPRPCPRC-------SLSKQGHRFLSTLIAADS---EDISAT-RHLLRKFVASSSKHVA 71

Query: 2286 LDXXXXXXXXXXXXXXXS-AVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAET 2110
            L                  ++A+PLYL I+EASWF+WN+KLVAD++A++YK E+FDEAET
Sbjct: 72   LSTLSHLVSPTTTSHYRLCSLALPLYLEISEASWFDWNSKLVADLVALLYKLERFDEAET 131

Query: 2109 LILETMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAY 1930
            L+ ET+ K+G +ER++C+FY  LI S +KH  +  V D    +K +   SSS Y+K+R Y
Sbjct: 132  LVTETVSKLGSRERDLCSFYSQLIHSQSKHNSERGVLDFCTKLKLVLLRSSSVYLKQRGY 191

Query: 1929 ESMVRSLCDIGQPREAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQN 1750
             SMV   C IG PR+AE+LMEEM+ELGLK S FE R+LVY+YGK G + DMKR V+E+++
Sbjct: 192  ASMVEGFCLIGLPRKAEELMEEMKELGLKLSKFEFRSLVYSYGKSGYLRDMKRIVVEMES 251

Query: 1749 QGFELDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQ 1570
             GF+LDTV +NMVL+S G+H ELSE+VS LQ++++  + FS+RTYNSVLNSCPTI L+LQ
Sbjct: 252  MGFQLDTVSSNMVLNSFGSHNELSEVVSSLQKIEASGVPFSIRTYNSVLNSCPTISLLLQ 311

Query: 1569 DIKSVPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIF 1390
            D+KSVP+S+E L+ NL ++E ++V  L+GSSVL+E M+W  SE+KLDLHGMHL+ +Y+I 
Sbjct: 312  DLKSVPLSLEELMGNLDENEAVLVNILVGSSVLEETMQWKPSELKLDLHGMHLTSAYVII 371

Query: 1389 LQWIDVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDR 1210
            LQW   ++ +F + N++L P EI VVCG GKHS VRG+SPVK L+KE++LR+ CPL+IDR
Sbjct: 372  LQWFHQLQCKFLAENRVL-PGEIIVVCGAGKHSVVRGESPVKRLIKEILLRIGCPLRIDR 430

Query: 1209 KNVGCFIAKGKVFRDWL 1159
            KN+GCFIAKGK F +WL
Sbjct: 431  KNIGCFIAKGKSFMEWL 447


>ref|XP_004248641.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like
            [Solanum lycopersicum]
          Length = 459

 Score =  461 bits (1187), Expect = e-127
 Identities = 240/431 (55%), Positives = 316/431 (73%), Gaps = 1/431 (0%)
 Frame = -1

Query: 2448 RRPGLSLRLRALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDXXXX 2269
            RRP    R  +LSKQGHRF             D SAT R L+RKFV SS KHVAL     
Sbjct: 23   RRPRPGPRC-SLSKQGHRFLSTLIATDSD---DISAT-RHLLRKFVGSSSKHVALSTLSH 77

Query: 2268 XXXXXXXXXXXS-AVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETM 2092
                         ++A+PLYL I+EASWF+WN+KLVA+++A++YK E+FDEAETL+ E++
Sbjct: 78   LVSPTTTSHYRLCSLALPLYLEISEASWFDWNSKLVAELVALLYKLERFDEAETLVTESV 137

Query: 2091 KKIGIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRS 1912
             K+G +ER++C+FY  LI S +KH  +  V D    +K +   SSS Y+K+R Y SMV  
Sbjct: 138  SKLGSRERDLCSFYSQLIYSQSKHNSERGVLDYCTKLKLVLLHSSSVYLKQRGYASMVEG 197

Query: 1911 LCDIGQPREAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELD 1732
             C IG PR+AE+LMEEM+ELGLK S FE R+LVY+YGK G + DMKR V+E++  GF+LD
Sbjct: 198  FCLIGLPRKAEELMEEMKELGLKLSKFEFRSLVYSYGKSGYLRDMKRIVVEMERMGFQLD 257

Query: 1731 TVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVP 1552
            TV +NMVL+S G+H ELSE+VS LQ++++  + FS+RTYNSVLNSCPTI L+LQD+KSVP
Sbjct: 258  TVGSNMVLNSFGSHNELSELVSSLQKIEASGVLFSIRTYNSVLNSCPTISLLLQDLKSVP 317

Query: 1551 ISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDV 1372
            +S+E L+ NL ++E ++V  L+GSSVL+E M+W   E+KLDLHGMHL+ +YLI LQW   
Sbjct: 318  LSLEELMGNLDENEAVLVKILVGSSVLEETMQWKPKELKLDLHGMHLTSAYLIILQWFHQ 377

Query: 1371 MRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCF 1192
            ++ +F + N++L P EI VVCG GKHS VRG+SPVK L+KE++LR+ CPL+IDRKNVGCF
Sbjct: 378  LQCKFLAENRVL-PGEIIVVCGAGKHSVVRGESPVKRLIKEILLRIGCPLRIDRKNVGCF 436

Query: 1191 IAKGKVFRDWL 1159
            IAKGKVF +WL
Sbjct: 437  IAKGKVFMEWL 447


>ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Vitis vinifera] gi|297744557|emb|CBI37819.3| unnamed
            protein product [Vitis vinifera]
          Length = 435

 Score =  449 bits (1155), Expect = e-123
 Identities = 228/421 (54%), Positives = 309/421 (73%)
 Frame = -1

Query: 2418 ALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXX 2239
            ALSKQG  F            RD SA+ R LI KF+ SS K +AL+              
Sbjct: 24   ALSKQGQLFLSSV-------ARDPSASNR-LICKFIASSSKSIALNALSHLLSPTTTHPY 75

Query: 2238 XSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVC 2059
             S++A+PLY  I+EASWF+WN KL+ADVIA++YK  Q  EAETL+ ET+ K+G +ER++ 
Sbjct: 76   LSSLALPLYSRISEASWFSWNPKLIADVIALLYKQGQLKEAETLVSETLIKLGSRERDLV 135

Query: 2058 NFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPREAE 1879
            +FYCNLI+S +KH   + V D+ + +  I + SSS YVK+RAY+SM+ SLC +G P EAE
Sbjct: 136  SFYCNLIDSHSKHSSNQGVFDVISRLSRIVSESSSVYVKERAYKSMISSLCAVGLPLEAE 195

Query: 1878 DLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSL 1699
            +L+EEMR  GLK S FE R++VY YG++GL EDM+R ++++ N+GFELDTV +NMVLSS 
Sbjct: 196  NLIEEMRVKGLKPSVFEFRSVVYGYGRVGLSEDMQRILLQMGNEGFELDTVVSNMVLSSY 255

Query: 1698 GTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLT 1519
            G + + SEMVSWLQRMK+  I FS+RTYNSVLNSCP I+ +LQD+K+ P +++ L++ L 
Sbjct: 256  GAYNKQSEMVSWLQRMKNSSIPFSIRTYNSVLNSCPMIMSILQDLKTFPPTIDELMETLK 315

Query: 1518 KDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQM 1339
             DE L+V EL+GS VL E+MEW+ SE KLDLHGMHL  +YLI LQW + +R+R ++  + 
Sbjct: 316  GDEALLVKELIGSMVLAELMEWDCSEGKLDLHGMHLGSAYLIMLQWREELRYRLNAA-EY 374

Query: 1338 LVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWL 1159
            ++P EITVVCG GKHS+VRG+SPVK +++EM+ R + P+KIDRKN+GCF+AK KV ++WL
Sbjct: 375  VMPVEITVVCGSGKHSSVRGESPVKRMVREMMTRTRSPMKIDRKNIGCFVAKAKVVKNWL 434

Query: 1158 C 1156
            C
Sbjct: 435  C 435


>gb|EOX97560.1| Pentatricopeptide (PPR) repeat-containing protein, putative
            [Theobroma cacao]
          Length = 456

 Score =  444 bits (1143), Expect = e-122
 Identities = 221/420 (52%), Positives = 302/420 (71%), Gaps = 1/420 (0%)
 Frame = -1

Query: 2415 LSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXXX 2236
            L+KQGHRF             +  AT   LI+KFV SSPK +AL+               
Sbjct: 34   LTKQGHRFFSSLAATADV---NDPATANRLIKKFVASSPKSIALNALSHLLSPRNSHPHL 90

Query: 2235 SAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVCN 2056
            SA+A PLY  I+E SW+NWN KLVA++IA++ K  ++DE+E LI + + K+  +ER++  
Sbjct: 91   SALAFPLYTKISETSWYNWNPKLVAELIALLVKQGRYDESEALISQAVSKLKFRERDLVQ 150

Query: 2055 FYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPREAED 1876
            FYCN IES +KH  KE  +D Y Y+  +   SSS YVK++ Y+SMV SLC++ +P EAE+
Sbjct: 151  FYCNWIESCSKHNSKEGFNDAYCYLSELICNSSSVYVKRQGYKSMVSSLCEMDRPNEAEN 210

Query: 1875 LMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSLG 1696
            L+EEMR+ GL  + FE R + Y YG++GL EDM+R V E++ +GFE+DT+C+NMVLSS G
Sbjct: 211  LVEEMRKNGLTPTLFEFRFISYGYGQLGLFEDMERMVCEMEIEGFEVDTICSNMVLSSYG 270

Query: 1695 THGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLTK 1516
             +   S+MV WLQ+MK+L+I FS+RTYNSVLNSCP I+ ++Q + SVP+S+  L K L +
Sbjct: 271  AYNAFSKMVPWLQKMKTLQIPFSIRTYNSVLNSCPEIMSLVQGLDSVPLSLGELAKILNE 330

Query: 1515 DEVLVVGELM-GSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQM 1339
            DE L+V EL+  SSVLDE MEWN SE KLDLHGMHL  +YLI LQWI+ M+ RF    + 
Sbjct: 331  DEALLVQELVKSSSVLDEAMEWNGSEGKLDLHGMHLGSAYLIMLQWIEEMKCRFKV-EEC 389

Query: 1338 LVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWL 1159
            ++P +IT+VCG GKHS+VRG+SPVK+LM++M+++MK P+KIDRKN+GCFIAKG+V ++WL
Sbjct: 390  VIPAQITIVCGSGKHSSVRGESPVKTLMRKMMVKMKSPMKIDRKNIGCFIAKGQVVKNWL 449


>ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citrus clementina]
            gi|568866680|ref|XP_006486677.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g17033-like [Citrus sinensis]
            gi|557524456|gb|ESR35762.1| hypothetical protein
            CICLE_v10028424mg [Citrus clementina]
          Length = 451

 Score =  431 bits (1109), Expect = e-118
 Identities = 228/441 (51%), Positives = 299/441 (67%), Gaps = 5/441 (1%)
 Frame = -1

Query: 2463 RCSPCRRPGLSL---RLRALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKH 2293
            RC   R+  L+L       L+KQG RF            RDS A  R LI KFV SSP+ 
Sbjct: 16   RCCRLRQQRLTLVQCLTARLTKQGQRFLSSLALAV---TRDSKAASR-LISKFVASSPQF 71

Query: 2292 VALDXXXXXXXXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAE 2113
            +AL+               S++A PLY+ ITE SWF WN KLVA++IA + K  Q +EAE
Sbjct: 72   IALNALSHLLSPDTTHPRLSSLAFPLYMRITEESWFQWNPKLVAEIIAFLDKQGQREEAE 131

Query: 2112 TLILETMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRA 1933
            TLILET+ K+G +ER +  FYCNLI+S  KH  K    D Y  +  +   SSS YVK++A
Sbjct: 132  TLILETLSKLGSRERELVLFYCNLIDSFCKHDSKRGFDDTYARLNQLVNSSSSVYVKRQA 191

Query: 1932 YESMVRSLCDIGQPREAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQ 1753
             +SM+  LC++GQP EAE+L+EEMR  GL+ S FE + ++Y YG++GL+EDM+R V +++
Sbjct: 192  LKSMISGLCEMGQPHEAENLIEEMRVKGLEPSGFEYKCIIYGYGRLGLLEDMERIVNQME 251

Query: 1752 NQGFELDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLML 1573
            + G  +DTVC+NMVLSS G H ELS MV WLQ+MK   I FSVRTYNSVLNSC TI+ ML
Sbjct: 252  SDGTRVDTVCSNMVLSSYGDHNELSRMVLWLQKMKDSGIPFSVRTYNSVLNSCSTIMSML 311

Query: 1572 QDIKS--VPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSY 1399
            QD+ S   P+S+  L + L ++EV VV EL  SSVLDE M+W+S E KLDLHGMHL  +Y
Sbjct: 312  QDLNSNDFPLSILELTEVLNEEEVSVVKELEDSSVLDEAMKWDSGETKLDLHGMHLGSAY 371

Query: 1398 LIFLQWIDVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLK 1219
             I LQW+D MR RF++  + ++P EITVVCG GKHS VRG+S VK+++K+M++R   P++
Sbjct: 372  FIILQWMDEMRNRFNN-EKHVIPAEITVVCGSGKHSTVRGESSVKAMVKKMMVRTSSPMR 430

Query: 1218 IDRKNVGCFIAKGKVFRDWLC 1156
            + R N+GCFIAKG V +DWLC
Sbjct: 431  VHRNNIGCFIAKGHVVKDWLC 451


>gb|EMJ01929.1| hypothetical protein PRUPE_ppa021547mg [Prunus persica]
          Length = 447

 Score =  431 bits (1107), Expect = e-117
 Identities = 218/421 (51%), Positives = 292/421 (69%)
 Frame = -1

Query: 2418 ALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXX 2239
            A++KQG RF            RD+  T + LI KF+ SS K +AL+              
Sbjct: 33   AVTKQGQRFLTKLAAN----ARDAKVTNK-LIAKFLTSSTKSIALNTLSYLLSPDTTLPH 87

Query: 2238 XSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVC 2059
             S++A+P Y  ITEASWF WN KLVA ++A++ K  Q +EAE LI ET+ K+G +ER + 
Sbjct: 88   LSSLALPFYSKITEASWFEWNPKLVAALVALLDKQGQHNEAEVLISETISKLGSRERELA 147

Query: 2058 NFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPREAE 1879
             F+C L+ES +K   K      Y+Y+  +   SSS YVK RA+ESMV  LC++ +PREA+
Sbjct: 148  LFHCQLVESHSKLSSKHGFDSSYSYLYQLLHNSSSVYVKNRAFESMVSGLCEMDRPREAD 207

Query: 1878 DLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSL 1699
            +L+EEMR  GLK S FE R++VY YG++GL EDM + V +++NQG  +DT+C+NMVLSS 
Sbjct: 208  NLIEEMRVRGLKPSVFEFRSVVYGYGRLGLFEDMLKVVEQMENQGIAIDTICSNMVLSSY 267

Query: 1698 GTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLT 1519
            G H EL+ M+ WL++MKSL + FS+RTYNSVLNSC TI+ MLQ+ K  P S+E L   L 
Sbjct: 268  GAHSELAAMLVWLRKMKSLSLPFSIRTYNSVLNSCLTIMAMLQEPKDFPCSIEELNGVLN 327

Query: 1518 KDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQM 1339
             DE L+V EL+ S+VLDEVM W   E KLDLHGMHL  +YLI L+W + MR RF+SG   
Sbjct: 328  GDEALLVKELVESTVLDEVMVWEPLEAKLDLHGMHLGSAYLILLEWFEAMRCRFNSGKD- 386

Query: 1338 LVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWL 1159
            ++P E+ V+CG GKHS+VRG+SPVK L+K+M+LRM+ P++IDRKNVGCF+AKG+  +DWL
Sbjct: 387  VIPAEVVVICGSGKHSSVRGESPVKGLVKQMMLRMESPMRIDRKNVGCFVAKGRAVKDWL 446

Query: 1158 C 1156
            C
Sbjct: 447  C 447


>gb|EXB24044.1| hypothetical protein L484_006076 [Morus notabilis]
          Length = 517

 Score =  418 bits (1075), Expect = e-114
 Identities = 219/435 (50%), Positives = 291/435 (66%), Gaps = 1/435 (0%)
 Frame = -1

Query: 2457 SPCRRPGLSLRLR-ALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALD 2281
            SP R    S  ++ AL+KQGHRF              +++    LI KFV SSPK ++L+
Sbjct: 89   SPTRSAAASSSIQCALTKQGHRFLSTLSINAG-----NASAANKLIGKFVASSPKSISLN 143

Query: 2280 XXXXXXXXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLIL 2101
                           ++ ++ LY  I EASWF ++ KLVA + A++ K  ++ EAE LI 
Sbjct: 144  ALSHLLSPDTTHTHLTSHSLHLYSKIREASWFVYSPKLVAALAALLDKQGRYSEAEALIA 203

Query: 2100 ETMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESM 1921
            E + K+G ++R +  FYC+L+ES +K   K      Y Y+  +   SSS YVK RA+E+M
Sbjct: 204  EAVSKLGHRQRELAVFYCSLVESHSKQSSKHGFDSSYAYLYQLLRDSSSAYVKCRAFETM 263

Query: 1920 VRSLCDIGQPREAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGF 1741
            V +LC + +P EAE LMEEMR  GLK S FE R+LVY YG++GL EDM R V +++ +G 
Sbjct: 264  VGALCTMDRPCEAESLMEEMRHKGLKPSVFEFRSLVYGYGRLGLWEDMLRTVNQMEIEGL 323

Query: 1740 ELDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIK 1561
             +DT+C+NMVLSS G H EL +MV WLQ+M++  I FS+RTYNSVLN CPTI  MLQD+K
Sbjct: 324  VIDTICSNMVLSSYGAHNELQQMVLWLQKMRTSSIPFSIRTYNSVLNWCPTITAMLQDLK 383

Query: 1560 SVPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQW 1381
             +P+SM  L   L  DE L+V EL+GSSVL+EV+ W+S E+KLDLHGMHL  +YLI L+W
Sbjct: 384  DIPLSMYELNATLRGDEGLLVMELVGSSVLEEVLVWDSLEVKLDLHGMHLGSAYLIMLEW 443

Query: 1380 IDVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNV 1201
            ++ M  RF+ GN   +P E+ VVCG GKHS VRG SPVK L+KEM+++MK P+KIDRKN 
Sbjct: 444  MEEMTRRFNDGNHG-IPAEVVVVCGSGKHSNVRGVSPVKILVKEMMVQMKSPMKIDRKNA 502

Query: 1200 GCFIAKGKVFRDWLC 1156
            GCF+AKGK  RDWLC
Sbjct: 503  GCFLAKGKTVRDWLC 517


>ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like
            [Fragaria vesca subsp. vesca]
          Length = 448

 Score =  417 bits (1071), Expect = e-113
 Identities = 215/437 (49%), Positives = 294/437 (67%), Gaps = 1/437 (0%)
 Frame = -1

Query: 2463 RCSPCRRPGLSLRLR-ALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVA 2287
            R  P +   LSL+++ AL+KQG RF              + +    LI KF+++SPK  A
Sbjct: 18   RHDPPQHSKLSLQIQCALTKQGQRFLTKLAANAG-----NPSVANKLISKFLSTSPKSTA 72

Query: 2286 LDXXXXXXXXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETL 2107
            L                S++A+P+Y  ITEASWF WN KLVA ++A++ K  Q  ++E L
Sbjct: 73   LTTLSYLLSPHTAHPHLSSLALPMYSKITEASWFEWNPKLVAALVALLAKQGQQSQSEAL 132

Query: 2106 ILETMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYE 1927
            I ET+ K+G +ER +  F+C L+ES +K   K        Y+  +   SSS YVK+RA+E
Sbjct: 133  ISETISKLGNKERELVQFHCQLVESHSKMSSKCGFDRACTYLHQLLQNSSSVYVKRRAFE 192

Query: 1926 SMVRSLCDIGQPREAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQ 1747
            SMV  LC + +P EA++L+EEMR  GLK S FE R++VY YG++G+ E+M + V +++ Q
Sbjct: 193  SMVGGLCAMDRPGEADELIEEMRVKGLKASVFEFRSVVYGYGRLGMFEEMLKIVDQMEKQ 252

Query: 1746 GFELDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQD 1567
            GF  DT+C NMVLSS G H EL+ M +WL++MK   + FSVRTYNSVLNSCPTI+ MLQ+
Sbjct: 253  GFGDDTICCNMVLSSYGAHNELAAMANWLRKMKESSVPFSVRTYNSVLNSCPTIMAMLQE 312

Query: 1566 IKSVPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFL 1387
             K+VP S+  L   L  DE LVV EL+GS+V+DE M W+S+E KLDLHGMHL  +YL+ L
Sbjct: 313  PKAVPCSVGELSGVLDGDEALVVKELVGSAVVDEAMVWDSAEAKLDLHGMHLGSAYLVML 372

Query: 1386 QWIDVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRK 1207
            +W + M  RF S  + +VP E+ +VCGLGKHS+VRG+SPVK L+KEM+ +M+ P++IDRK
Sbjct: 373  EWFEAMGNRFKSA-ECVVPAEVVIVCGLGKHSSVRGESPVKDLVKEMMHQMESPMRIDRK 431

Query: 1206 NVGCFIAKGKVFRDWLC 1156
            NVGCFIAKG+  +DWLC
Sbjct: 432  NVGCFIAKGRAVKDWLC 448


>ref|XP_002521239.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223539507|gb|EEF41095.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 460

 Score =  409 bits (1050), Expect = e-111
 Identities = 208/424 (49%), Positives = 287/424 (67%)
 Frame = -1

Query: 2427 RLRALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXX 2248
            R  ALSKQG RF             D+ AT R LI+KFV +SPK +ALD           
Sbjct: 41   RCAALSKQGQRFLSSLAIATTKG--DTVATNR-LIKKFVAASPKSIALDALSHLLNPHSS 97

Query: 2247 XXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQER 2068
                S++A  LYL I EA WF WN KLVADV+A + K  ++DE+ TL+ +++ K+ ++ER
Sbjct: 98   HSHLSSLAFTLYLKIAEARWFQWNPKLVADVVAFLDKQGRYDESATLVSDSISKLQVKER 157

Query: 2067 NVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPR 1888
            ++  FYCNL+ES +K        +    +  +   S+S YVK++ Y+SMV  LC++G+PR
Sbjct: 158  DLARFYCNLVESQSKQNSIRGFDNSVASLMQLVCNSNSVYVKRQGYKSMVNGLCEMGRPR 217

Query: 1887 EAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVL 1708
            EAE L+EEM + G++ S FE + +VYAYG +G  E+M + + +++  GF +DTVC+NM+L
Sbjct: 218  EAETLIEEMGKEGVRPSMFEFKCVVYAYGSLGSFEEMNKCLHQMERAGFRVDTVCSNMIL 277

Query: 1707 SSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLK 1528
            +S G H  L EMV WLQ+MK L I FS+RT NS LNSCPTI+ M+Q+    PIS+  L+K
Sbjct: 278  ASYGAHNALPEMVLWLQKMKDLGIPFSLRTCNSALNSCPTIMSMMQNSNDFPISIHDLMK 337

Query: 1527 NLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSG 1348
             L++DE L+V E++ SSVLDE M+W+ +E KLDLHG HL  +YLI L WI+ MR RF S 
Sbjct: 338  ILSEDEALLVKEIVTSSVLDEAMKWDVAEAKLDLHGTHLCSAYLIILLWIEEMRKRFKSV 397

Query: 1347 NQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFR 1168
            N  + PTEITVVCG G HS VRG+SPVK ++K+ ++R + P++IDR+N+GCFIAKGKV  
Sbjct: 398  N-YVNPTEITVVCGSGNHSIVRGESPVKCMVKDFMVRARSPMRIDRRNIGCFIAKGKVVE 456

Query: 1167 DWLC 1156
            +WLC
Sbjct: 457  EWLC 460


>ref|XP_004156246.1| PREDICTED: uncharacterized protein LOC101223617 [Cucumis sativus]
          Length = 1296

 Score =  408 bits (1048), Expect = e-111
 Identities = 214/436 (49%), Positives = 291/436 (66%), Gaps = 5/436 (1%)
 Frame = -1

Query: 2442 PGLSLRLRALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXX 2263
            P L ++  +L+KQ HRF             D SAT R LIRKFV SSPK + L       
Sbjct: 36   PNLQVKCTSLTKQTHRFLSTLSTTAATG--DQSATNR-LIRKFVASSPKSITLSVLSNIV 92

Query: 2262 XXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKI 2083
                      + A+ LY  ITEASWF WN+KLVAD++A + ++  + E+E LI E + K+
Sbjct: 93   STHTPQPELCSAALTLYSRITEASWFTWNSKLVADLVAFLDQNGLYSESEVLISEAISKL 152

Query: 2082 GIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCD 1903
            G QER + NFY  L+ES +KH  +    D Y+ +  +   S S YVK+RAYESMV  LC 
Sbjct: 153  GSQERKLVNFYSQLVESQSKHGFERGFVDSYSRLLELLYNSPSVYVKRRAYESMVTGLCS 212

Query: 1902 IGQPREAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVC 1723
            + +P EAE+L++EMR  G+  + +E R+++YAYG +GL E+MKR++ +++N   ELDTVC
Sbjct: 213  MKRPHEAENLVKEMRSKGITPTAYEYRSIIYAYGTLGLFEEMKRSLKQMENDNIELDTVC 272

Query: 1722 ANMVLSSLGTHGELSEMVSWLQRMK-SLRIQFSVRTYNSVLNSCPTIVLMLQDIKS--VP 1552
            +NMVLSS G H +L +MV WLQRMK S     SVRTYNSVLNSCP I  MLQD KS  +P
Sbjct: 273  SNMVLSSYGAHNKLGDMVLWLQRMKTSPHCNSSVRTYNSVLNSCPKITAMLQDHKSTNLP 332

Query: 1551 ISMEHLLKNLTKD-EVLVVGELM-GSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWI 1378
            + +E L+  L  D E L+V EL+ GSSVL+E+M W++ E+KLDLHG H+  +Y+I LQWI
Sbjct: 333  VLIEDLIAVLDGDEEALLVEELLAGSSVLNEIMVWDAMELKLDLHGAHVGAAYVIMLQWI 392

Query: 1377 DVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVG 1198
              MR  F      ++P ++T++CG GKHS VRG+SPVK+L+KE+++R + PL+IDRKN G
Sbjct: 393  KEMRLNFED-ESYVIPAQVTLICGSGKHSIVRGESPVKALIKEIMVRTESPLRIDRKNTG 451

Query: 1197 CFIAKGKVFRDWLCCL 1150
            CFI+KGK  ++WLC L
Sbjct: 452  CFISKGKAVKNWLCSL 467


>ref|XP_004141623.1| PREDICTED: uncharacterized protein LOC101204365 [Cucumis sativus]
          Length = 1913

 Score =  408 bits (1048), Expect = e-111
 Identities = 214/436 (49%), Positives = 291/436 (66%), Gaps = 5/436 (1%)
 Frame = -1

Query: 2442 PGLSLRLRALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXX 2263
            P L ++  +L+KQ HRF             D SAT R LIRKFV SSPK + L       
Sbjct: 36   PNLQVKCTSLTKQTHRFLSTLSTTAATG--DQSATNR-LIRKFVASSPKSITLSVLSNIV 92

Query: 2262 XXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKI 2083
                      + A+ LY  ITEASWF WN+KLVAD++A + ++  + E+E LI E + K+
Sbjct: 93   STHTPQPELCSAALTLYSRITEASWFTWNSKLVADLVAFLDQNGLYSESEVLISEAISKL 152

Query: 2082 GIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCD 1903
            G QER + NFY  L+ES +KH  +    D Y+ +  +   S S YVK+RAYESMV  LC 
Sbjct: 153  GSQERKLVNFYSQLVESQSKHGFERGFVDSYSRLLELLYNSPSVYVKRRAYESMVTGLCS 212

Query: 1902 IGQPREAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVC 1723
            + +P EAE+L++EMR  G+  + +E R+++YAYG +GL E+MKR++ +++N   ELDTVC
Sbjct: 213  MKRPHEAENLVKEMRSKGITPTAYEYRSIIYAYGTLGLFEEMKRSLKQMENDNIELDTVC 272

Query: 1722 ANMVLSSLGTHGELSEMVSWLQRMK-SLRIQFSVRTYNSVLNSCPTIVLMLQDIKS--VP 1552
            +NMVLSS G H +L +MV WLQRMK S     SVRTYNSVLNSCP I  MLQD KS  +P
Sbjct: 273  SNMVLSSYGAHNKLGDMVLWLQRMKTSPHCNSSVRTYNSVLNSCPKITAMLQDHKSTNLP 332

Query: 1551 ISMEHLLKNLTKD-EVLVVGELM-GSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWI 1378
            + +E L+  L  D E L+V EL+ GSSVL+E+M W++ E+KLDLHG H+  +Y+I LQWI
Sbjct: 333  VLIEDLIAVLDGDEEALLVEELLAGSSVLNEIMVWDAMELKLDLHGAHVGAAYVIMLQWI 392

Query: 1377 DVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVG 1198
              MR  F      ++P ++T++CG GKHS VRG+SPVK+L+KE+++R + PL+IDRKN G
Sbjct: 393  KEMRLNFED-ESYVIPAQVTLICGSGKHSIVRGESPVKALIKEIMVRTESPLRIDRKNTG 451

Query: 1197 CFIAKGKVFRDWLCCL 1150
            CFI+KGK  ++WLC L
Sbjct: 452  CFISKGKAVKNWLCSL 467


>gb|AAU04769.1| pentatricopeptide (PPR) repeat protein-like [Cucumis melo]
          Length = 488

 Score =  407 bits (1045), Expect = e-110
 Identities = 212/436 (48%), Positives = 290/436 (66%), Gaps = 5/436 (1%)
 Frame = -1

Query: 2442 PGLSLRLRALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXX 2263
            P L ++   L+KQ HRF             D SAT R LIRKFV SSPK + L       
Sbjct: 36   PNLQVKCTTLTKQTHRFLSTLSTTGATG--DQSATNR-LIRKFVASSPKSITLSVLSNIV 92

Query: 2262 XXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKI 2083
                      + A+ LY  ITEASWF WN+KLVAD++A + ++  + E+E LI E + K+
Sbjct: 93   STHTPQPELCSAALTLYSRITEASWFTWNSKLVADLVAFLGQNGLYSESEALISEAISKL 152

Query: 2082 GIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCD 1903
            G QER + NFY  L+ES +KH  +    D Y+ +  +   S S YVK+RAYESMV  LC 
Sbjct: 153  GSQERKLVNFYSQLVESQSKHGFERGFGDSYSRLFELLYNSPSVYVKRRAYESMVTGLCS 212

Query: 1902 IGQPREAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVC 1723
            + +P EAE L++EMR  G+  + +E R+++YAYG +GL E+MKR++ +++N   ELDTVC
Sbjct: 213  MKRPHEAESLVKEMRSKGITPTAYEYRSIIYAYGTLGLFEEMKRSLKQMENDNIELDTVC 272

Query: 1722 ANMVLSSLGTHGELSEMVSWLQRMK-SLRIQFSVRTYNSVLNSCPTIVLMLQDIKS--VP 1552
            +NMVLSS G H +L +M+ WLQRMK S   + SVRTYNSVLNSCP I  MLQD KS  +P
Sbjct: 273  SNMVLSSYGAHNKLGDMLLWLQRMKTSSHCKSSVRTYNSVLNSCPKITSMLQDHKSGDLP 332

Query: 1551 ISMEHLLKNLTKDE--VLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWI 1378
            + +E L+  L  DE  +LV   L+GSSVL+E+M W++ E+KLDLHG H+  +Y+I LQWI
Sbjct: 333  VLIEDLIAILDGDEEALLVKELLVGSSVLNEIMVWDAMELKLDLHGAHVGAAYVIMLQWI 392

Query: 1377 DVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVG 1198
              MR  F   +  ++P ++T++CG GKHS VRG+SPVK+L+KE+++R + PL+IDRKN G
Sbjct: 393  KEMRLNFEDESN-VIPAQVTLICGSGKHSIVRGESPVKALIKEIMVRTESPLRIDRKNTG 451

Query: 1197 CFIAKGKVFRDWLCCL 1150
            CFI+KGK  ++WLC L
Sbjct: 452  CFISKGKAVKNWLCSL 467


>gb|AGH33847.1| PPR [Cucumis melo]
          Length = 488

 Score =  406 bits (1044), Expect = e-110
 Identities = 212/436 (48%), Positives = 289/436 (66%), Gaps = 5/436 (1%)
 Frame = -1

Query: 2442 PGLSLRLRALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXX 2263
            P L ++   L+KQ HRF             D SAT R LIRKFV SSPK + L       
Sbjct: 36   PNLQVKCTTLTKQTHRFLSTLSTTAATG--DQSATNR-LIRKFVASSPKSITLSVLSNIV 92

Query: 2262 XXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKI 2083
                      + A+ LY  ITEASWF WN+KLVAD++A + ++  + E+E LI E + K+
Sbjct: 93   STHTPQPELCSAALTLYSRITEASWFTWNSKLVADLVAFLGQNGLYSESEALISEAISKL 152

Query: 2082 GIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCD 1903
            G QER + NFY  L+ES +KH  +    D Y+ +  +   S S YVK+RAYESMV  LC 
Sbjct: 153  GSQERKLVNFYSQLVESQSKHGFERGFGDSYSRLFELLYNSPSVYVKRRAYESMVTGLCS 212

Query: 1902 IGQPREAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVC 1723
            + +P EAE L++EMR  G+  + +E R+++YAYG +GL E+MKR++ +++N   ELDTVC
Sbjct: 213  MKRPHEAESLVKEMRSKGITPTAYEYRSIIYAYGTLGLFEEMKRSLKQMENDNIELDTVC 272

Query: 1722 ANMVLSSLGTHGELSEMVSWLQRMK-SLRIQFSVRTYNSVLNSCPTIVLMLQDIKS--VP 1552
            +NMVLSS G H +L +M+ WLQRMK S   + SVRTYNSVLNSCP I  MLQD KS  +P
Sbjct: 273  SNMVLSSYGAHNKLGDMLLWLQRMKTSPHCKSSVRTYNSVLNSCPKITSMLQDHKSGDLP 332

Query: 1551 ISMEHLLKNLTKDE--VLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWI 1378
            + +E L+  L  DE  +LV   L+GSSVL+E+M W++ E+KLDLHG H+  +Y+I LQWI
Sbjct: 333  VLIEDLIAILDGDEEALLVKELLVGSSVLNEIMVWDAMELKLDLHGAHVGAAYVIMLQWI 392

Query: 1377 DVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVG 1198
              MR  F      ++P ++T++CG GKHS VRG+SPVK+L+KE+++R + PL+IDRKN G
Sbjct: 393  KEMRLNFED-ESYVIPAQVTLICGSGKHSIVRGESPVKALIKEIMVRTESPLRIDRKNTG 451

Query: 1197 CFIAKGKVFRDWLCCL 1150
            CFI+KGK  ++WLC L
Sbjct: 452  CFISKGKAVKNWLCSL 467


>ref|XP_006297442.1| hypothetical protein CARUB_v10013465mg [Capsella rubella]
            gi|482566151|gb|EOA30340.1| hypothetical protein
            CARUB_v10013465mg [Capsella rubella]
          Length = 516

 Score =  400 bits (1028), Expect = e-108
 Identities = 207/420 (49%), Positives = 283/420 (67%)
 Frame = -1

Query: 2415 LSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXXX 2236
            L KQGH+F             D  AT R LI+KFV +SPK VAL+               
Sbjct: 99   LMKQGHQFLSSLSSPALAG--DPPATNR-LIKKFVAASPKSVALNVLSHLLSDNTSHPHL 155

Query: 2235 SAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVCN 2056
            S  A  LYL ITEASWF+WN KL+ ++++++ K E+F E+ETL+   + ++   ER+   
Sbjct: 156  SYFAPQLYLEITEASWFDWNPKLIGELVSLLNKQERFVESETLLSTAVSRLESNERDFAL 215

Query: 2055 FYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPREAED 1876
            F CNL+ES++K    +  SD  + ++ I   SSS YVK +AY+SMV  LC++ QP +AE 
Sbjct: 216  FLCNLVESNSKQGSIQGFSDACSRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPLDAER 275

Query: 1875 LMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSLG 1696
            ++EEMR   +K   FE ++++Y YG++GL +DM R V  ++ QG ++DTVC+NMVLSS G
Sbjct: 276  VIEEMRMETIKPGLFEYKSVLYGYGRLGLFDDMNRIVHRMETQGHKIDTVCSNMVLSSYG 335

Query: 1695 THGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLTK 1516
             H  L +M SWLQ++K   +  S+RTYNSVLNSCPTI+ +L+D+ S P+S+  LL  L +
Sbjct: 336  AHDALPQMGSWLQKLKGYNVPLSIRTYNSVLNSCPTIISLLKDLDSCPLSLSELLPILNE 395

Query: 1515 DEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQML 1336
            DE L+V EL  S VLDE +EWN+ E KLDLHGMHLS SYLI LQW+D  R RFS   + +
Sbjct: 396  DEALLVRELTQSLVLDEAIEWNAVEGKLDLHGMHLSASYLIMLQWMDETRLRFSEDKKCV 455

Query: 1335 VPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWLC 1156
            VP EI VV G GKHS VRG+SPVK+++K++++R K P++IDRKNVG FIAKGK  ++WLC
Sbjct: 456  VPAEIVVVSGSGKHSNVRGESPVKAMVKKIMVRTKSPMRIDRKNVGSFIAKGKNVKEWLC 515


>ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutrema salsugineum]
            gi|557110519|gb|ESQ50810.1| hypothetical protein
            EUTSA_v10022675mg [Eutrema salsugineum]
          Length = 469

 Score =  395 bits (1015), Expect = e-107
 Identities = 207/434 (47%), Positives = 290/434 (66%), Gaps = 4/434 (0%)
 Frame = -1

Query: 2445 RPGLSLRLRA----LSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDX 2278
            R  + +R +A    L KQGHRF             D SAT R  I+KFV +SPK V+L+ 
Sbjct: 39   RTSMEVRCKAGTVPLMKQGHRFLSSLSSPALAG--DPSATNRH-IKKFVAASPKSVSLNV 95

Query: 2277 XXXXXXXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILE 2098
                          S  A+ LY  ITEASWF+WN KL+A+++A++ K E+  E+ETL+  
Sbjct: 96   LSHLLSAQTSHPHLSFFALSLYSEITEASWFDWNPKLIAELVALLNKQERSHESETLLSN 155

Query: 2097 TMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMV 1918
             + ++   ER++  FYCNL+ES++K    +  ++    ++ I   S+S YVK +AY+SMV
Sbjct: 156  AVSRLKSNERDIALFYCNLVESNSKQGSIQGFNEACVRLREITRRSTSVYVKTQAYKSMV 215

Query: 1917 RSLCDIGQPREAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFE 1738
              LC++ QP +AE ++EEMR   +K   FE ++++Y YG++GL EDM R V  ++ +G +
Sbjct: 216  SGLCNMDQPHDAESVIEEMRIAKIKPGLFEYKSVLYGYGRLGLFEDMNRVVHRMETEGHK 275

Query: 1737 LDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKS 1558
            +DTVC+NMVLSS G H  L +M SWLQ++K   +  S RTYNSVLNSCPTI+ +L+D+ S
Sbjct: 276  IDTVCSNMVLSSYGAHNALPQMGSWLQKLKDSNVPLSERTYNSVLNSCPTILSLLKDLDS 335

Query: 1557 VPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWI 1378
             P+S+  LL  L KDE ++V  L  SSVLDE +EW+S E KLDLHGMHLS SYLI +QW+
Sbjct: 336  CPVSLSELLTFLNKDEEVLVRGLTQSSVLDEAIEWSSLEGKLDLHGMHLSSSYLIMMQWM 395

Query: 1377 DVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVG 1198
            D MR RFS G + +VP EI +V G GKHS VRG+SPVK+L+K++++R   P++IDRKN+G
Sbjct: 396  DEMRIRFSEG-KCVVPAEIVLVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNIG 454

Query: 1197 CFIAKGKVFRDWLC 1156
             FIAKGK  ++WLC
Sbjct: 455  SFIAKGKTVKEWLC 468


>ref|NP_849962.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75244359|sp|Q8GWA9.1|PP157_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g17033 gi|26452937|dbj|BAC43545.1| unknown protein
            [Arabidopsis thaliana] gi|330251482|gb|AEC06576.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 505

 Score =  393 bits (1010), Expect = e-106
 Identities = 202/399 (50%), Positives = 275/399 (68%)
 Frame = -1

Query: 2352 DSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXXXSAVAIPLYLSITEASWFNWNA 2173
            D SA  R  I+KFV +SPK VAL+               S  A+ LY  ITEASWF+WN 
Sbjct: 108  DPSAINRH-IKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFALSLYSEITEASWFDWNP 166

Query: 2172 KLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDL 1993
            KL+A++IA++ K E+FDE+ETL+   + ++   ER+   F CNL+ES++K    +  S+ 
Sbjct: 167  KLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNLVESNSKQGSIQGFSEA 226

Query: 1992 YNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPREAEDLMEEMRELGLKQSDFEIRALV 1813
               ++ I   SSS YVK +AY+SMV  LC++ QP +AE ++EEMR   +K   FE ++++
Sbjct: 227  SFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEMRMEKIKPGLFEYKSVL 286

Query: 1812 YAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQ 1633
            Y YG++GL +DM R V  +  +G ++DTVC+NMVLSS G H  L +M SWLQ++K   + 
Sbjct: 287  YGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGFNVP 346

Query: 1632 FSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEW 1453
            FS+RTYNSVLNSCPTI+ ML+D+ S P+S+  L   L +DE L+V EL  SSVLDE +EW
Sbjct: 347  FSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNEDEALLVHELTQSSVLDEAIEW 406

Query: 1452 NSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQS 1273
            N+ E KLDLHGMHLS SYLI LQW+D  R RFS   + ++P EI VV G GKHS VRG+S
Sbjct: 407  NAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSE-EKCVIPAEIVVVSGSGKHSNVRGES 465

Query: 1272 PVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWLC 1156
            PVK+L+K++++R   P++IDRKNVG FIAKGK  ++WLC
Sbjct: 466  PVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 504


>dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana]
          Length = 501

 Score =  393 bits (1010), Expect = e-106
 Identities = 202/399 (50%), Positives = 275/399 (68%)
 Frame = -1

Query: 2352 DSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXXXSAVAIPLYLSITEASWFNWNA 2173
            D SA  R  I+KFV +SPK VAL+               S  A+ LY  ITEASWF+WN 
Sbjct: 104  DPSAINRH-IKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFALSLYSEITEASWFDWNP 162

Query: 2172 KLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDL 1993
            KL+A++IA++ K E+FDE+ETL+   + ++   ER+   F CNL+ES++K    +  S+ 
Sbjct: 163  KLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNLVESNSKQGSIQGFSEA 222

Query: 1992 YNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPREAEDLMEEMRELGLKQSDFEIRALV 1813
               ++ I   SSS YVK +AY+SMV  LC++ QP +AE ++EEMR   +K   FE ++++
Sbjct: 223  SFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEMRMEKIKPGLFEYKSVL 282

Query: 1812 YAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQ 1633
            Y YG++GL +DM R V  +  +G ++DTVC+NMVLSS G H  L +M SWLQ++K   + 
Sbjct: 283  YGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGFNVP 342

Query: 1632 FSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEW 1453
            FS+RTYNSVLNSCPTI+ ML+D+ S P+S+  L   L +DE L+V EL  SSVLDE +EW
Sbjct: 343  FSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNEDEALLVHELTQSSVLDEAIEW 402

Query: 1452 NSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQS 1273
            N+ E KLDLHGMHLS SYLI LQW+D  R RFS   + ++P EI VV G GKHS VRG+S
Sbjct: 403  NAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSE-EKCVIPAEIVVVSGSGKHSNVRGES 461

Query: 1272 PVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWLC 1156
            PVK+L+K++++R   P++IDRKNVG FIAKGK  ++WLC
Sbjct: 462  PVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 500


>ref|NP_565402.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|13877877|gb|AAK44016.1|AF370201_1 unknown protein
            [Arabidopsis thaliana] gi|21280879|gb|AAM44931.1| unknown
            protein [Arabidopsis thaliana]
            gi|330251481|gb|AEC06575.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 504

 Score =  393 bits (1010), Expect = e-106
 Identities = 202/399 (50%), Positives = 275/399 (68%)
 Frame = -1

Query: 2352 DSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXXXSAVAIPLYLSITEASWFNWNA 2173
            D SA  R  I+KFV +SPK VAL+               S  A+ LY  ITEASWF+WN 
Sbjct: 107  DPSAINRH-IKKFVAASPKSVALNVLSHLLSDQTSHPHLSFFALSLYSEITEASWFDWNP 165

Query: 2172 KLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDL 1993
            KL+A++IA++ K E+FDE+ETL+   + ++   ER+   F CNL+ES++K    +  S+ 
Sbjct: 166  KLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTLFLCNLVESNSKQGSIQGFSEA 225

Query: 1992 YNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPREAEDLMEEMRELGLKQSDFEIRALV 1813
               ++ I   SSS YVK +AY+SMV  LC++ QP +AE ++EEMR   +K   FE ++++
Sbjct: 226  SFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAERVIEEMRMEKIKPGLFEYKSVL 285

Query: 1812 YAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQ 1633
            Y YG++GL +DM R V  +  +G ++DTVC+NMVLSS G H  L +M SWLQ++K   + 
Sbjct: 286  YGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYGAHDALPQMGSWLQKLKGFNVP 345

Query: 1632 FSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEW 1453
            FS+RTYNSVLNSCPTI+ ML+D+ S P+S+  L   L +DE L+V EL  SSVLDE +EW
Sbjct: 346  FSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNEDEALLVHELTQSSVLDEAIEW 405

Query: 1452 NSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQS 1273
            N+ E KLDLHGMHLS SYLI LQW+D  R RFS   + ++P EI VV G GKHS VRG+S
Sbjct: 406  NAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSE-EKCVIPAEIVVVSGSGKHSNVRGES 464

Query: 1272 PVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWLC 1156
            PVK+L+K++++R   P++IDRKNVG FIAKGK  ++WLC
Sbjct: 465  PVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 503


>ref|XP_002312938.2| hypothetical protein POPTR_0009s14120g [Populus trichocarpa]
            gi|550331693|gb|EEE86893.2| hypothetical protein
            POPTR_0009s14120g [Populus trichocarpa]
          Length = 473

 Score =  389 bits (1000), Expect = e-105
 Identities = 201/425 (47%), Positives = 287/425 (67%), Gaps = 2/425 (0%)
 Frame = -1

Query: 2424 LRALSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXX 2245
            L A+SKQ  RF             D+SAT R LI+KFV SSPK +ALD            
Sbjct: 53   LAAISKQAQRFFSAVLPTVA--TSDTSATNR-LIKKFVASSPKSIALDALSNLLSPDSTH 109

Query: 2244 XXXS-AVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQER 2068
                  + +PLYL I+EASWF+WN KLVA V+ ++ K     E + L+ ET+ ++  +ER
Sbjct: 110  HPLLYLLTLPLYLKISEASWFSWNPKLVAQVVVLLDKQGLDKELKALMSETVSRLQFKER 169

Query: 2067 NVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPR 1888
             +  FYCNLI  ++KH       D Y+ +    + S+S YVKK+ Y++M+  LC++G+ R
Sbjct: 170  ELVLFYCNLIGFNSKHNWVRGFDDSYSRLNQFVSDSNSVYVKKQGYKAMISGLCEMGRAR 229

Query: 1887 EAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVL 1708
            EAEDL+ EMRE GLK   FE R ++Y YG++GL +DM+R + ++++   E+DTVCANMVL
Sbjct: 230  EAEDLIGEMRERGLKPKLFEFRCVLYGYGRLGLFKDMERILDKMESGEIEVDTVCANMVL 289

Query: 1707 SSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDI-KSVPISMEHLL 1531
            +S G H  L EM  WL++MK+L I  S+RT NSVLNSCPTI+ +++++  S P+S++ LL
Sbjct: 290  ASYGAHNALPEMGLWLRKMKTLGIPLSIRTCNSVLNSCPTIMALMRNLDASYPVSIQELL 349

Query: 1530 KNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSS 1351
            K L+++E ++V EL+ SSVL E  +W++SE KLDLHGMHL  +Y+I LQW++  R R S 
Sbjct: 350  KILSEEEAMLVKELIESSVLKEATKWDTSEGKLDLHGMHLGSAYVIMLQWMEETRNRLSD 409

Query: 1350 GNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVF 1171
            G + ++P EITVVCG G HS VRG+SPVKS++ E++ + + P++IDRKN+GCF+AKG V 
Sbjct: 410  G-EHVIPAEITVVCGSGNHSTVRGESPVKSMITEIMAQTRSPMRIDRKNIGCFVAKGNVV 468

Query: 1170 RDWLC 1156
            + WLC
Sbjct: 469  KKWLC 473


>ref|XP_002884032.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297329872|gb|EFH60291.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 504

 Score =  388 bits (997), Expect = e-105
 Identities = 202/420 (48%), Positives = 281/420 (66%)
 Frame = -1

Query: 2415 LSKQGHRFXXXXXXXXXXAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXXX 2236
            L KQG RF             D SAT R  I+KFV +SPK V L+               
Sbjct: 88   LMKQGDRFLSSLSSPALAG--DPSATHRH-IKKFVAASPKSVTLNVLSHLLSDQTSYPHL 144

Query: 2235 SAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVCN 2056
            S  A+ LY  ITEASWF+WN KL+A+++AV+   E+FDE+ETL+   + ++   ER+   
Sbjct: 145  SFFALSLYSEITEASWFDWNPKLIAELVAVLNNQERFDESETLLSTAVSRLKSNERDFAL 204

Query: 2055 FYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPREAED 1876
            F CNL+ES++K    +  ++    ++     SSS YVK +AY+SMV  LC++ QP +AE 
Sbjct: 205  FLCNLVESNSKQGSIQGFNEACFRLRERIQRSSSVYVKTQAYKSMVAGLCNMDQPHDAER 264

Query: 1875 LMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSLG 1696
            ++EEMR   +K   FE ++++Y YG++GL +DM R V  ++ +G ++DTVC+NMVLSS G
Sbjct: 265  VIEEMRVEKIKPGSFEHKSVLYGYGRLGLFDDMNRVVHRMETEGHKIDTVCSNMVLSSYG 324

Query: 1695 THGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLTK 1516
             H  L +M SWLQ++K   + FS+RTYNSVLNSCPTI+ +L+D+ S P+S+  L   L +
Sbjct: 325  AHDALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIMSLLKDLNSCPVSLSELRTFLNE 384

Query: 1515 DEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQML 1336
            DE L+V EL  S+VLDE +EWN+ E KLDLHGMHLS SYLI LQW+D +R RF    + +
Sbjct: 385  DEALLVLELTQSTVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDEIRLRFRD-QKCV 443

Query: 1335 VPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWLC 1156
            +P EI VV G GKHS VRG+SPVK+L+K++++R + P++IDRKNVG FIAKGK  ++WLC
Sbjct: 444  IPAEIVVVSGSGKHSNVRGESPVKALVKKIMVRTESPMRIDRKNVGSFIAKGKNVKEWLC 503


Top