BLASTX nr result

ID: Catharanthus23_contig00006008 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00006008
         (1615 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006360892.1| PREDICTED: pentatricopeptide repeat-containi...   470   e-130
ref|XP_004248641.1| PREDICTED: pentatricopeptide repeat-containi...   468   e-129
ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containi...   456   e-125
gb|EOX97560.1| Pentatricopeptide (PPR) repeat-containing protein...   454   e-125
ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citr...   441   e-121
gb|EMJ01929.1| hypothetical protein PRUPE_ppa021547mg [Prunus pe...   438   e-120
ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containi...   426   e-116
gb|EXB24044.1| hypothetical protein L484_006076 [Morus notabilis]     426   e-116
ref|XP_004156246.1| PREDICTED: uncharacterized protein LOC101223...   421   e-115
ref|XP_004141623.1| PREDICTED: uncharacterized protein LOC101204...   421   e-115
gb|AGH33847.1| PPR [Cucumis melo]                                     420   e-114
gb|AAU04769.1| pentatricopeptide (PPR) repeat protein-like [Cucu...   420   e-114
ref|XP_002521239.1| pentatricopeptide repeat-containing protein,...   419   e-114
ref|XP_006297442.1| hypothetical protein CARUB_v10013465mg [Caps...   410   e-111
ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutr...   405   e-110
ref|NP_849962.1| pentatricopeptide repeat-containing protein [Ar...   403   e-109
dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana]           403   e-109
ref|NP_565402.1| pentatricopeptide repeat-containing protein [Ar...   403   e-109
ref|XP_002884032.1| pentatricopeptide repeat-containing protein ...   398   e-108
ref|XP_002312938.2| hypothetical protein POPTR_0009s14120g [Popu...   395   e-107

>ref|XP_006360892.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like
            [Solanum tuberosum]
          Length = 459

 Score =  470 bits (1210), Expect = e-130
 Identities = 242/437 (55%), Positives = 322/437 (73%), Gaps = 1/437 (0%)
 Frame = -3

Query: 1430 RRCSPCRRPGLSLRLRALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVA 1251
            RR  PC R        +LSKQGHRFL++L         D SAT R L+RKFV SS KHVA
Sbjct: 23   RRPRPCPRC-------SLSKQGHRFLSTLIAADSE---DISAT-RHLLRKFVASSSKHVA 71

Query: 1250 LDXXXXXXXXXXXXXXXS-AVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAET 1074
            L                  ++A+PLYL I+EASWF+WN+KLVAD++A++YK E+FDEAET
Sbjct: 72   LSTLSHLVSPTTTSHYRLCSLALPLYLEISEASWFDWNSKLVADLVALLYKLERFDEAET 131

Query: 1073 LILETMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAY 894
            L+ ET+ K+G +ER++C+FY  LI S +KH  +  V D    +K +   SSS Y+K+R Y
Sbjct: 132  LVTETVSKLGSRERDLCSFYSQLIHSQSKHNSERGVLDFCTKLKLVLLRSSSVYLKQRGY 191

Query: 893  ESMVRSLCDIGQPQEAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQN 714
             SMV   C IG P++AE+LMEEM+ELGLK S FE R+LVY+YGK G + DMKR V+E+++
Sbjct: 192  ASMVEGFCLIGLPRKAEELMEEMKELGLKLSKFEFRSLVYSYGKSGYLRDMKRIVVEMES 251

Query: 713  QGFELDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQ 534
             GF+LDTV +NMVL+S G+H ELSE+VS LQ++++  + FS+RTYNSVLNSCPTI L+LQ
Sbjct: 252  MGFQLDTVSSNMVLNSFGSHNELSEVVSSLQKIEASGVPFSIRTYNSVLNSCPTISLLLQ 311

Query: 533  DIKSVPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIF 354
            D+KSVP+S+E L+ NL ++E ++V  L+GSSVL+E M+W  SE+KLDLHGMHL+ +Y+I 
Sbjct: 312  DLKSVPLSLEELMGNLDENEAVLVNILVGSSVLEETMQWKPSELKLDLHGMHLTSAYVII 371

Query: 353  LQWIDVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDR 174
            LQW   ++ +F + N++L P EI VVCG GKHS VRG+SPVK L+KE++LR+ CPL+IDR
Sbjct: 372  LQWFHQLQCKFLAENRVL-PGEIIVVCGAGKHSVVRGESPVKRLIKEILLRIGCPLRIDR 430

Query: 173  KNVGCFIAKGKVFRDWL 123
            KN+GCFIAKGK F +WL
Sbjct: 431  KNIGCFIAKGKSFMEWL 447


>ref|XP_004248641.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like
            [Solanum lycopersicum]
          Length = 459

 Score =  468 bits (1204), Expect = e-129
 Identities = 242/431 (56%), Positives = 321/431 (74%), Gaps = 1/431 (0%)
 Frame = -3

Query: 1412 RRPGLSLRLRALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXX 1233
            RRP    R  +LSKQGHRFL++L  T      D SAT R L+RKFV SS KHVAL     
Sbjct: 23   RRPRPGPRC-SLSKQGHRFLSTLIATDSD---DISAT-RHLLRKFVGSSSKHVALSTLSH 77

Query: 1232 XXXXXXXXXXXS-AVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETM 1056
                         ++A+PLYL I+EASWF+WN+KLVA+++A++YK E+FDEAETL+ E++
Sbjct: 78   LVSPTTTSHYRLCSLALPLYLEISEASWFDWNSKLVAELVALLYKLERFDEAETLVTESV 137

Query: 1055 KKIGIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRS 876
             K+G +ER++C+FY  LI S +KH  +  V D    +K +   SSS Y+K+R Y SMV  
Sbjct: 138  SKLGSRERDLCSFYSQLIYSQSKHNSERGVLDYCTKLKLVLLHSSSVYLKQRGYASMVEG 197

Query: 875  LCDIGQPQEAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELD 696
             C IG P++AE+LMEEM+ELGLK S FE R+LVY+YGK G + DMKR V+E++  GF+LD
Sbjct: 198  FCLIGLPRKAEELMEEMKELGLKLSKFEFRSLVYSYGKSGYLRDMKRIVVEMERMGFQLD 257

Query: 695  TVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVP 516
            TV +NMVL+S G+H ELSE+VS LQ++++  + FS+RTYNSVLNSCPTI L+LQD+KSVP
Sbjct: 258  TVGSNMVLNSFGSHNELSELVSSLQKIEASGVLFSIRTYNSVLNSCPTISLLLQDLKSVP 317

Query: 515  ISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDV 336
            +S+E L+ NL ++E ++V  L+GSSVL+E M+W   E+KLDLHGMHL+ +YLI LQW   
Sbjct: 318  LSLEELMGNLDENEAVLVKILVGSSVLEETMQWKPKELKLDLHGMHLTSAYLIILQWFHQ 377

Query: 335  MRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCF 156
            ++ +F + N++L P EI VVCG GKHS VRG+SPVK L+KE++LR+ CPL+IDRKNVGCF
Sbjct: 378  LQCKFLAENRVL-PGEIIVVCGAGKHSVVRGESPVKRLIKEILLRIGCPLRIDRKNVGCF 436

Query: 155  IAKGKVFRDWL 123
            IAKGKVF +WL
Sbjct: 437  IAKGKVFMEWL 447


>ref|XP_002278390.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033
            [Vitis vinifera] gi|297744557|emb|CBI37819.3| unnamed
            protein product [Vitis vinifera]
          Length = 435

 Score =  456 bits (1173), Expect = e-125
 Identities = 231/421 (54%), Positives = 314/421 (74%)
 Frame = -3

Query: 1382 ALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXX 1203
            ALSKQG  FL+S+A       RD SA+ R LI KF+ SS K +AL+              
Sbjct: 24   ALSKQGQLFLSSVA-------RDPSASNR-LICKFIASSSKSIALNALSHLLSPTTTHPY 75

Query: 1202 XSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVC 1023
             S++A+PLY  I+EASWF+WN KL+ADVIA++YK  Q  EAETL+ ET+ K+G +ER++ 
Sbjct: 76   LSSLALPLYSRISEASWFSWNPKLIADVIALLYKQGQLKEAETLVSETLIKLGSRERDLV 135

Query: 1022 NFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPQEAE 843
            +FYCNLI+S +KH   + V D+ + +  I + SSS YVK+RAY+SM+ SLC +G P EAE
Sbjct: 136  SFYCNLIDSHSKHSSNQGVFDVISRLSRIVSESSSVYVKERAYKSMISSLCAVGLPLEAE 195

Query: 842  DLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSL 663
            +L+EEMR  GLK S FE R++VY YG++GL EDM+R ++++ N+GFELDTV +NMVLSS 
Sbjct: 196  NLIEEMRVKGLKPSVFEFRSVVYGYGRVGLSEDMQRILLQMGNEGFELDTVVSNMVLSSY 255

Query: 662  GTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLT 483
            G + + SEMVSWLQRMK+  I FS+RTYNSVLNSCP I+ +LQD+K+ P +++ L++ L 
Sbjct: 256  GAYNKQSEMVSWLQRMKNSSIPFSIRTYNSVLNSCPMIMSILQDLKTFPPTIDELMETLK 315

Query: 482  KDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQM 303
             DE L+V EL+GS VL E+MEW+ SE KLDLHGMHL  +YLI LQW + +R+R ++  + 
Sbjct: 316  GDEALLVKELIGSMVLAELMEWDCSEGKLDLHGMHLGSAYLIMLQWREELRYRLNAA-EY 374

Query: 302  LVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWL 123
            ++P EITVVCG GKHS+VRG+SPVK +++EM+ R + P+KIDRKN+GCF+AK KV ++WL
Sbjct: 375  VMPVEITVVCGSGKHSSVRGESPVKRMVREMMTRTRSPMKIDRKNIGCFVAKAKVVKNWL 434

Query: 122  C 120
            C
Sbjct: 435  C 435


>gb|EOX97560.1| Pentatricopeptide (PPR) repeat-containing protein, putative
            [Theobroma cacao]
          Length = 456

 Score =  454 bits (1169), Expect = e-125
 Identities = 226/420 (53%), Positives = 308/420 (73%), Gaps = 1/420 (0%)
 Frame = -3

Query: 1379 LSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXXX 1200
            L+KQGHRF +SLA T   A  +  AT   LI+KFV SSPK +AL+               
Sbjct: 34   LTKQGHRFFSSLAAT---ADVNDPATANRLIKKFVASSPKSIALNALSHLLSPRNSHPHL 90

Query: 1199 SAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVCN 1020
            SA+A PLY  I+E SW+NWN KLVA++IA++ K  ++DE+E LI + + K+  +ER++  
Sbjct: 91   SALAFPLYTKISETSWYNWNPKLVAELIALLVKQGRYDESEALISQAVSKLKFRERDLVQ 150

Query: 1019 FYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPQEAED 840
            FYCN IES +KH  KE  +D Y Y+  +   SSS YVK++ Y+SMV SLC++ +P EAE+
Sbjct: 151  FYCNWIESCSKHNSKEGFNDAYCYLSELICNSSSVYVKRQGYKSMVSSLCEMDRPNEAEN 210

Query: 839  LMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSLG 660
            L+EEMR+ GL  + FE R + Y YG++GL EDM+R V E++ +GFE+DT+C+NMVLSS G
Sbjct: 211  LVEEMRKNGLTPTLFEFRFISYGYGQLGLFEDMERMVCEMEIEGFEVDTICSNMVLSSYG 270

Query: 659  THGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLTK 480
             +   S+MV WLQ+MK+L+I FS+RTYNSVLNSCP I+ ++Q + SVP+S+  L K L +
Sbjct: 271  AYNAFSKMVPWLQKMKTLQIPFSIRTYNSVLNSCPEIMSLVQGLDSVPLSLGELAKILNE 330

Query: 479  DEVLVVGELM-GSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQM 303
            DE L+V EL+  SSVLDE MEWN SE KLDLHGMHL  +YLI LQWI+ M+ RF    + 
Sbjct: 331  DEALLVQELVKSSSVLDEAMEWNGSEGKLDLHGMHLGSAYLIMLQWIEEMKCRFKV-EEC 389

Query: 302  LVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWL 123
            ++P +IT+VCG GKHS+VRG+SPVK+LM++M+++MK P+KIDRKN+GCFIAKG+V ++WL
Sbjct: 390  VIPAQITIVCGSGKHSSVRGESPVKTLMRKMMVKMKSPMKIDRKNIGCFIAKGQVVKNWL 449


>ref|XP_006422522.1| hypothetical protein CICLE_v10028424mg [Citrus clementina]
            gi|568866680|ref|XP_006486677.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At2g17033-like [Citrus sinensis]
            gi|557524456|gb|ESR35762.1| hypothetical protein
            CICLE_v10028424mg [Citrus clementina]
          Length = 451

 Score =  441 bits (1134), Expect = e-121
 Identities = 232/441 (52%), Positives = 305/441 (69%), Gaps = 5/441 (1%)
 Frame = -3

Query: 1427 RCSPCRRPGLSL---RLRALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKH 1257
            RC   R+  L+L       L+KQG RFL+SLA    +  RDS A  R LI KFV SSP+ 
Sbjct: 16   RCCRLRQQRLTLVQCLTARLTKQGQRFLSSLAL---AVTRDSKAASR-LISKFVASSPQF 71

Query: 1256 VALDXXXXXXXXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAE 1077
            +AL+               S++A PLY+ ITE SWF WN KLVA++IA + K  Q +EAE
Sbjct: 72   IALNALSHLLSPDTTHPRLSSLAFPLYMRITEESWFQWNPKLVAEIIAFLDKQGQREEAE 131

Query: 1076 TLILETMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRA 897
            TLILET+ K+G +ER +  FYCNLI+S  KH  K    D Y  +  +   SSS YVK++A
Sbjct: 132  TLILETLSKLGSRERELVLFYCNLIDSFCKHDSKRGFDDTYARLNQLVNSSSSVYVKRQA 191

Query: 896  YESMVRSLCDIGQPQEAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQ 717
             +SM+  LC++GQP EAE+L+EEMR  GL+ S FE + ++Y YG++GL+EDM+R V +++
Sbjct: 192  LKSMISGLCEMGQPHEAENLIEEMRVKGLEPSGFEYKCIIYGYGRLGLLEDMERIVNQME 251

Query: 716  NQGFELDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLML 537
            + G  +DTVC+NMVLSS G H ELS MV WLQ+MK   I FSVRTYNSVLNSC TI+ ML
Sbjct: 252  SDGTRVDTVCSNMVLSSYGDHNELSRMVLWLQKMKDSGIPFSVRTYNSVLNSCSTIMSML 311

Query: 536  QDIKS--VPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSY 363
            QD+ S   P+S+  L + L ++EV VV EL  SSVLDE M+W+S E KLDLHGMHL  +Y
Sbjct: 312  QDLNSNDFPLSILELTEVLNEEEVSVVKELEDSSVLDEAMKWDSGETKLDLHGMHLGSAY 371

Query: 362  LIFLQWIDVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLK 183
             I LQW+D MR RF++  + ++P EITVVCG GKHS VRG+S VK+++K+M++R   P++
Sbjct: 372  FIILQWMDEMRNRFNN-EKHVIPAEITVVCGSGKHSTVRGESSVKAMVKKMMVRTSSPMR 430

Query: 182  IDRKNVGCFIAKGKVFRDWLC 120
            + R N+GCFIAKG V +DWLC
Sbjct: 431  VHRNNIGCFIAKGHVVKDWLC 451


>gb|EMJ01929.1| hypothetical protein PRUPE_ppa021547mg [Prunus persica]
          Length = 447

 Score =  438 bits (1127), Expect = e-120
 Identities = 221/421 (52%), Positives = 296/421 (70%)
 Frame = -3

Query: 1382 ALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXX 1203
            A++KQG RFLT LA       RD+  T + LI KF+ SS K +AL+              
Sbjct: 33   AVTKQGQRFLTKLAANA----RDAKVTNK-LIAKFLTSSTKSIALNTLSYLLSPDTTLPH 87

Query: 1202 XSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVC 1023
             S++A+P Y  ITEASWF WN KLVA ++A++ K  Q +EAE LI ET+ K+G +ER + 
Sbjct: 88   LSSLALPFYSKITEASWFEWNPKLVAALVALLDKQGQHNEAEVLISETISKLGSRERELA 147

Query: 1022 NFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPQEAE 843
             F+C L+ES +K   K      Y+Y+  +   SSS YVK RA+ESMV  LC++ +P+EA+
Sbjct: 148  LFHCQLVESHSKLSSKHGFDSSYSYLYQLLHNSSSVYVKNRAFESMVSGLCEMDRPREAD 207

Query: 842  DLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSL 663
            +L+EEMR  GLK S FE R++VY YG++GL EDM + V +++NQG  +DT+C+NMVLSS 
Sbjct: 208  NLIEEMRVRGLKPSVFEFRSVVYGYGRLGLFEDMLKVVEQMENQGIAIDTICSNMVLSSY 267

Query: 662  GTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLT 483
            G H EL+ M+ WL++MKSL + FS+RTYNSVLNSC TI+ MLQ+ K  P S+E L   L 
Sbjct: 268  GAHSELAAMLVWLRKMKSLSLPFSIRTYNSVLNSCLTIMAMLQEPKDFPCSIEELNGVLN 327

Query: 482  KDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQM 303
             DE L+V EL+ S+VLDEVM W   E KLDLHGMHL  +YLI L+W + MR RF+SG   
Sbjct: 328  GDEALLVKELVESTVLDEVMVWEPLEAKLDLHGMHLGSAYLILLEWFEAMRCRFNSGKD- 386

Query: 302  LVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWL 123
            ++P E+ V+CG GKHS+VRG+SPVK L+K+M+LRM+ P++IDRKNVGCF+AKG+  +DWL
Sbjct: 387  VIPAEVVVICGSGKHSSVRGESPVKGLVKQMMLRMESPMRIDRKNVGCFVAKGRAVKDWL 446

Query: 122  C 120
            C
Sbjct: 447  C 447


>ref|XP_004292639.1| PREDICTED: pentatricopeptide repeat-containing protein At2g17033-like
            [Fragaria vesca subsp. vesca]
          Length = 448

 Score =  426 bits (1096), Expect = e-116
 Identities = 219/437 (50%), Positives = 298/437 (68%), Gaps = 1/437 (0%)
 Frame = -3

Query: 1427 RCSPCRRPGLSLRLR-ALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVA 1251
            R  P +   LSL+++ AL+KQG RFLT LA         + +    LI KF+++SPK  A
Sbjct: 18   RHDPPQHSKLSLQIQCALTKQGQRFLTKLAANA-----GNPSVANKLISKFLSTSPKSTA 72

Query: 1250 LDXXXXXXXXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETL 1071
            L                S++A+P+Y  ITEASWF WN KLVA ++A++ K  Q  ++E L
Sbjct: 73   LTTLSYLLSPHTAHPHLSSLALPMYSKITEASWFEWNPKLVAALVALLAKQGQQSQSEAL 132

Query: 1070 ILETMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYE 891
            I ET+ K+G +ER +  F+C L+ES +K   K        Y+  +   SSS YVK+RA+E
Sbjct: 133  ISETISKLGNKERELVQFHCQLVESHSKMSSKCGFDRACTYLHQLLQNSSSVYVKRRAFE 192

Query: 890  SMVRSLCDIGQPQEAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQ 711
            SMV  LC + +P EA++L+EEMR  GLK S FE R++VY YG++G+ E+M + V +++ Q
Sbjct: 193  SMVGGLCAMDRPGEADELIEEMRVKGLKASVFEFRSVVYGYGRLGMFEEMLKIVDQMEKQ 252

Query: 710  GFELDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQD 531
            GF  DT+C NMVLSS G H EL+ M +WL++MK   + FSVRTYNSVLNSCPTI+ MLQ+
Sbjct: 253  GFGDDTICCNMVLSSYGAHNELAAMANWLRKMKESSVPFSVRTYNSVLNSCPTIMAMLQE 312

Query: 530  IKSVPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFL 351
             K+VP S+  L   L  DE LVV EL+GS+V+DE M W+S+E KLDLHGMHL  +YL+ L
Sbjct: 313  PKAVPCSVGELSGVLDGDEALVVKELVGSAVVDEAMVWDSAEAKLDLHGMHLGSAYLVML 372

Query: 350  QWIDVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRK 171
            +W + M  RF S  + +VP E+ +VCGLGKHS+VRG+SPVK L+KEM+ +M+ P++IDRK
Sbjct: 373  EWFEAMGNRFKSA-ECVVPAEVVIVCGLGKHSSVRGESPVKDLVKEMMHQMESPMRIDRK 431

Query: 170  NVGCFIAKGKVFRDWLC 120
            NVGCFIAKG+  +DWLC
Sbjct: 432  NVGCFIAKGRAVKDWLC 448


>gb|EXB24044.1| hypothetical protein L484_006076 [Morus notabilis]
          Length = 517

 Score =  426 bits (1094), Expect = e-116
 Identities = 222/435 (51%), Positives = 296/435 (68%), Gaps = 1/435 (0%)
 Frame = -3

Query: 1421 SPCRRPGLSLRLR-ALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALD 1245
            SP R    S  ++ AL+KQGHRFL++L+    +A     +    LI KFV SSPK ++L+
Sbjct: 89   SPTRSAAASSSIQCALTKQGHRFLSTLSINAGNA-----SAANKLIGKFVASSPKSISLN 143

Query: 1244 XXXXXXXXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLIL 1065
                           ++ ++ LY  I EASWF ++ KLVA + A++ K  ++ EAE LI 
Sbjct: 144  ALSHLLSPDTTHTHLTSHSLHLYSKIREASWFVYSPKLVAALAALLDKQGRYSEAEALIA 203

Query: 1064 ETMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESM 885
            E + K+G ++R +  FYC+L+ES +K   K      Y Y+  +   SSS YVK RA+E+M
Sbjct: 204  EAVSKLGHRQRELAVFYCSLVESHSKQSSKHGFDSSYAYLYQLLRDSSSAYVKCRAFETM 263

Query: 884  VRSLCDIGQPQEAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGF 705
            V +LC + +P EAE LMEEMR  GLK S FE R+LVY YG++GL EDM R V +++ +G 
Sbjct: 264  VGALCTMDRPCEAESLMEEMRHKGLKPSVFEFRSLVYGYGRLGLWEDMLRTVNQMEIEGL 323

Query: 704  ELDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIK 525
             +DT+C+NMVLSS G H EL +MV WLQ+M++  I FS+RTYNSVLN CPTI  MLQD+K
Sbjct: 324  VIDTICSNMVLSSYGAHNELQQMVLWLQKMRTSSIPFSIRTYNSVLNWCPTITAMLQDLK 383

Query: 524  SVPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQW 345
             +P+SM  L   L  DE L+V EL+GSSVL+EV+ W+S E+KLDLHGMHL  +YLI L+W
Sbjct: 384  DIPLSMYELNATLRGDEGLLVMELVGSSVLEEVLVWDSLEVKLDLHGMHLGSAYLIMLEW 443

Query: 344  IDVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNV 165
            ++ M  RF+ GN   +P E+ VVCG GKHS VRG SPVK L+KEM+++MK P+KIDRKN 
Sbjct: 444  MEEMTRRFNDGNHG-IPAEVVVVCGSGKHSNVRGVSPVKILVKEMMVQMKSPMKIDRKNA 502

Query: 164  GCFIAKGKVFRDWLC 120
            GCF+AKGK  RDWLC
Sbjct: 503  GCFLAKGKTVRDWLC 517


>ref|XP_004156246.1| PREDICTED: uncharacterized protein LOC101223617 [Cucumis sativus]
          Length = 1296

 Score =  421 bits (1083), Expect = e-115
 Identities = 219/436 (50%), Positives = 300/436 (68%), Gaps = 5/436 (1%)
 Frame = -3

Query: 1406 PGLSLRLRALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXX 1227
            P L ++  +L+KQ HRFL++L+TT  +A  D SAT R LIRKFV SSPK + L       
Sbjct: 36   PNLQVKCTSLTKQTHRFLSTLSTT--AATGDQSATNR-LIRKFVASSPKSITLSVLSNIV 92

Query: 1226 XXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKI 1047
                      + A+ LY  ITEASWF WN+KLVAD++A + ++  + E+E LI E + K+
Sbjct: 93   STHTPQPELCSAALTLYSRITEASWFTWNSKLVADLVAFLDQNGLYSESEVLISEAISKL 152

Query: 1046 GIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCD 867
            G QER + NFY  L+ES +KH  +    D Y+ +  +   S S YVK+RAYESMV  LC 
Sbjct: 153  GSQERKLVNFYSQLVESQSKHGFERGFVDSYSRLLELLYNSPSVYVKRRAYESMVTGLCS 212

Query: 866  IGQPQEAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVC 687
            + +P EAE+L++EMR  G+  + +E R+++YAYG +GL E+MKR++ +++N   ELDTVC
Sbjct: 213  MKRPHEAENLVKEMRSKGITPTAYEYRSIIYAYGTLGLFEEMKRSLKQMENDNIELDTVC 272

Query: 686  ANMVLSSLGTHGELSEMVSWLQRMK-SLRIQFSVRTYNSVLNSCPTIVLMLQDIKS--VP 516
            +NMVLSS G H +L +MV WLQRMK S     SVRTYNSVLNSCP I  MLQD KS  +P
Sbjct: 273  SNMVLSSYGAHNKLGDMVLWLQRMKTSPHCNSSVRTYNSVLNSCPKITAMLQDHKSTNLP 332

Query: 515  ISMEHLLKNLTKD-EVLVVGELM-GSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWI 342
            + +E L+  L  D E L+V EL+ GSSVL+E+M W++ E+KLDLHG H+  +Y+I LQWI
Sbjct: 333  VLIEDLIAVLDGDEEALLVEELLAGSSVLNEIMVWDAMELKLDLHGAHVGAAYVIMLQWI 392

Query: 341  DVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVG 162
              MR  F      ++P ++T++CG GKHS VRG+SPVK+L+KE+++R + PL+IDRKN G
Sbjct: 393  KEMRLNFED-ESYVIPAQVTLICGSGKHSIVRGESPVKALIKEIMVRTESPLRIDRKNTG 451

Query: 161  CFIAKGKVFRDWLCCL 114
            CFI+KGK  ++WLC L
Sbjct: 452  CFISKGKAVKNWLCSL 467


>ref|XP_004141623.1| PREDICTED: uncharacterized protein LOC101204365 [Cucumis sativus]
          Length = 1913

 Score =  421 bits (1083), Expect = e-115
 Identities = 219/436 (50%), Positives = 300/436 (68%), Gaps = 5/436 (1%)
 Frame = -3

Query: 1406 PGLSLRLRALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXX 1227
            P L ++  +L+KQ HRFL++L+TT  +A  D SAT R LIRKFV SSPK + L       
Sbjct: 36   PNLQVKCTSLTKQTHRFLSTLSTT--AATGDQSATNR-LIRKFVASSPKSITLSVLSNIV 92

Query: 1226 XXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKI 1047
                      + A+ LY  ITEASWF WN+KLVAD++A + ++  + E+E LI E + K+
Sbjct: 93   STHTPQPELCSAALTLYSRITEASWFTWNSKLVADLVAFLDQNGLYSESEVLISEAISKL 152

Query: 1046 GIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCD 867
            G QER + NFY  L+ES +KH  +    D Y+ +  +   S S YVK+RAYESMV  LC 
Sbjct: 153  GSQERKLVNFYSQLVESQSKHGFERGFVDSYSRLLELLYNSPSVYVKRRAYESMVTGLCS 212

Query: 866  IGQPQEAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVC 687
            + +P EAE+L++EMR  G+  + +E R+++YAYG +GL E+MKR++ +++N   ELDTVC
Sbjct: 213  MKRPHEAENLVKEMRSKGITPTAYEYRSIIYAYGTLGLFEEMKRSLKQMENDNIELDTVC 272

Query: 686  ANMVLSSLGTHGELSEMVSWLQRMK-SLRIQFSVRTYNSVLNSCPTIVLMLQDIKS--VP 516
            +NMVLSS G H +L +MV WLQRMK S     SVRTYNSVLNSCP I  MLQD KS  +P
Sbjct: 273  SNMVLSSYGAHNKLGDMVLWLQRMKTSPHCNSSVRTYNSVLNSCPKITAMLQDHKSTNLP 332

Query: 515  ISMEHLLKNLTKD-EVLVVGELM-GSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWI 342
            + +E L+  L  D E L+V EL+ GSSVL+E+M W++ E+KLDLHG H+  +Y+I LQWI
Sbjct: 333  VLIEDLIAVLDGDEEALLVEELLAGSSVLNEIMVWDAMELKLDLHGAHVGAAYVIMLQWI 392

Query: 341  DVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVG 162
              MR  F      ++P ++T++CG GKHS VRG+SPVK+L+KE+++R + PL+IDRKN G
Sbjct: 393  KEMRLNFED-ESYVIPAQVTLICGSGKHSIVRGESPVKALIKEIMVRTESPLRIDRKNTG 451

Query: 161  CFIAKGKVFRDWLCCL 114
            CFI+KGK  ++WLC L
Sbjct: 452  CFISKGKAVKNWLCSL 467


>gb|AGH33847.1| PPR [Cucumis melo]
          Length = 488

 Score =  420 bits (1079), Expect = e-114
 Identities = 217/436 (49%), Positives = 298/436 (68%), Gaps = 5/436 (1%)
 Frame = -3

Query: 1406 PGLSLRLRALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXX 1227
            P L ++   L+KQ HRFL++L+TT  +A  D SAT R LIRKFV SSPK + L       
Sbjct: 36   PNLQVKCTTLTKQTHRFLSTLSTT--AATGDQSATNR-LIRKFVASSPKSITLSVLSNIV 92

Query: 1226 XXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKI 1047
                      + A+ LY  ITEASWF WN+KLVAD++A + ++  + E+E LI E + K+
Sbjct: 93   STHTPQPELCSAALTLYSRITEASWFTWNSKLVADLVAFLGQNGLYSESEALISEAISKL 152

Query: 1046 GIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCD 867
            G QER + NFY  L+ES +KH  +    D Y+ +  +   S S YVK+RAYESMV  LC 
Sbjct: 153  GSQERKLVNFYSQLVESQSKHGFERGFGDSYSRLFELLYNSPSVYVKRRAYESMVTGLCS 212

Query: 866  IGQPQEAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVC 687
            + +P EAE L++EMR  G+  + +E R+++YAYG +GL E+MKR++ +++N   ELDTVC
Sbjct: 213  MKRPHEAESLVKEMRSKGITPTAYEYRSIIYAYGTLGLFEEMKRSLKQMENDNIELDTVC 272

Query: 686  ANMVLSSLGTHGELSEMVSWLQRMK-SLRIQFSVRTYNSVLNSCPTIVLMLQDIKS--VP 516
            +NMVLSS G H +L +M+ WLQRMK S   + SVRTYNSVLNSCP I  MLQD KS  +P
Sbjct: 273  SNMVLSSYGAHNKLGDMLLWLQRMKTSPHCKSSVRTYNSVLNSCPKITSMLQDHKSGDLP 332

Query: 515  ISMEHLLKNLTKDE--VLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWI 342
            + +E L+  L  DE  +LV   L+GSSVL+E+M W++ E+KLDLHG H+  +Y+I LQWI
Sbjct: 333  VLIEDLIAILDGDEEALLVKELLVGSSVLNEIMVWDAMELKLDLHGAHVGAAYVIMLQWI 392

Query: 341  DVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVG 162
              MR  F      ++P ++T++CG GKHS VRG+SPVK+L+KE+++R + PL+IDRKN G
Sbjct: 393  KEMRLNFED-ESYVIPAQVTLICGSGKHSIVRGESPVKALIKEIMVRTESPLRIDRKNTG 451

Query: 161  CFIAKGKVFRDWLCCL 114
            CFI+KGK  ++WLC L
Sbjct: 452  CFISKGKAVKNWLCSL 467


>gb|AAU04769.1| pentatricopeptide (PPR) repeat protein-like [Cucumis melo]
          Length = 488

 Score =  420 bits (1079), Expect = e-114
 Identities = 217/436 (49%), Positives = 298/436 (68%), Gaps = 5/436 (1%)
 Frame = -3

Query: 1406 PGLSLRLRALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXX 1227
            P L ++   L+KQ HRFL++L+TT   A  D SAT R LIRKFV SSPK + L       
Sbjct: 36   PNLQVKCTTLTKQTHRFLSTLSTT--GATGDQSATNR-LIRKFVASSPKSITLSVLSNIV 92

Query: 1226 XXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKI 1047
                      + A+ LY  ITEASWF WN+KLVAD++A + ++  + E+E LI E + K+
Sbjct: 93   STHTPQPELCSAALTLYSRITEASWFTWNSKLVADLVAFLGQNGLYSESEALISEAISKL 152

Query: 1046 GIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCD 867
            G QER + NFY  L+ES +KH  +    D Y+ +  +   S S YVK+RAYESMV  LC 
Sbjct: 153  GSQERKLVNFYSQLVESQSKHGFERGFGDSYSRLFELLYNSPSVYVKRRAYESMVTGLCS 212

Query: 866  IGQPQEAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVC 687
            + +P EAE L++EMR  G+  + +E R+++YAYG +GL E+MKR++ +++N   ELDTVC
Sbjct: 213  MKRPHEAESLVKEMRSKGITPTAYEYRSIIYAYGTLGLFEEMKRSLKQMENDNIELDTVC 272

Query: 686  ANMVLSSLGTHGELSEMVSWLQRMK-SLRIQFSVRTYNSVLNSCPTIVLMLQDIKS--VP 516
            +NMVLSS G H +L +M+ WLQRMK S   + SVRTYNSVLNSCP I  MLQD KS  +P
Sbjct: 273  SNMVLSSYGAHNKLGDMLLWLQRMKTSSHCKSSVRTYNSVLNSCPKITSMLQDHKSGDLP 332

Query: 515  ISMEHLLKNLTKDE--VLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWI 342
            + +E L+  L  DE  +LV   L+GSSVL+E+M W++ E+KLDLHG H+  +Y+I LQWI
Sbjct: 333  VLIEDLIAILDGDEEALLVKELLVGSSVLNEIMVWDAMELKLDLHGAHVGAAYVIMLQWI 392

Query: 341  DVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVG 162
              MR  F   +  ++P ++T++CG GKHS VRG+SPVK+L+KE+++R + PL+IDRKN G
Sbjct: 393  KEMRLNFEDESN-VIPAQVTLICGSGKHSIVRGESPVKALIKEIMVRTESPLRIDRKNTG 451

Query: 161  CFIAKGKVFRDWLCCL 114
            CFI+KGK  ++WLC L
Sbjct: 452  CFISKGKAVKNWLCSL 467


>ref|XP_002521239.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223539507|gb|EEF41095.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 460

 Score =  419 bits (1076), Expect = e-114
 Identities = 212/424 (50%), Positives = 293/424 (69%)
 Frame = -3

Query: 1391 RLRALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXX 1212
            R  ALSKQG RFL+SLA  T     D+ AT R LI+KFV +SPK +ALD           
Sbjct: 41   RCAALSKQGQRFLSSLAIATTKG--DTVATNR-LIKKFVAASPKSIALDALSHLLNPHSS 97

Query: 1211 XXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQER 1032
                S++A  LYL I EA WF WN KLVADV+A + K  ++DE+ TL+ +++ K+ ++ER
Sbjct: 98   HSHLSSLAFTLYLKIAEARWFQWNPKLVADVVAFLDKQGRYDESATLVSDSISKLQVKER 157

Query: 1031 NVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPQ 852
            ++  FYCNL+ES +K        +    +  +   S+S YVK++ Y+SMV  LC++G+P+
Sbjct: 158  DLARFYCNLVESQSKQNSIRGFDNSVASLMQLVCNSNSVYVKRQGYKSMVNGLCEMGRPR 217

Query: 851  EAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVL 672
            EAE L+EEM + G++ S FE + +VYAYG +G  E+M + + +++  GF +DTVC+NM+L
Sbjct: 218  EAETLIEEMGKEGVRPSMFEFKCVVYAYGSLGSFEEMNKCLHQMERAGFRVDTVCSNMIL 277

Query: 671  SSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLK 492
            +S G H  L EMV WLQ+MK L I FS+RT NS LNSCPTI+ M+Q+    PIS+  L+K
Sbjct: 278  ASYGAHNALPEMVLWLQKMKDLGIPFSLRTCNSALNSCPTIMSMMQNSNDFPISIHDLMK 337

Query: 491  NLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSG 312
             L++DE L+V E++ SSVLDE M+W+ +E KLDLHG HL  +YLI L WI+ MR RF S 
Sbjct: 338  ILSEDEALLVKEIVTSSVLDEAMKWDVAEAKLDLHGTHLCSAYLIILLWIEEMRKRFKSV 397

Query: 311  NQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFR 132
            N  + PTEITVVCG G HS VRG+SPVK ++K+ ++R + P++IDR+N+GCFIAKGKV  
Sbjct: 398  N-YVNPTEITVVCGSGNHSIVRGESPVKCMVKDFMVRARSPMRIDRRNIGCFIAKGKVVE 456

Query: 131  DWLC 120
            +WLC
Sbjct: 457  EWLC 460


>ref|XP_006297442.1| hypothetical protein CARUB_v10013465mg [Capsella rubella]
            gi|482566151|gb|EOA30340.1| hypothetical protein
            CARUB_v10013465mg [Capsella rubella]
          Length = 516

 Score =  410 bits (1053), Expect = e-111
 Identities = 210/420 (50%), Positives = 290/420 (69%)
 Frame = -3

Query: 1379 LSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXXX 1200
            L KQGH+FL+SL++   +   D  AT R LI+KFV +SPK VAL+               
Sbjct: 99   LMKQGHQFLSSLSSPALAG--DPPATNR-LIKKFVAASPKSVALNVLSHLLSDNTSHPHL 155

Query: 1199 SAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVCN 1020
            S  A  LYL ITEASWF+WN KL+ ++++++ K E+F E+ETL+   + ++   ER+   
Sbjct: 156  SYFAPQLYLEITEASWFDWNPKLIGELVSLLNKQERFVESETLLSTAVSRLESNERDFAL 215

Query: 1019 FYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPQEAED 840
            F CNL+ES++K    +  SD  + ++ I   SSS YVK +AY+SMV  LC++ QP +AE 
Sbjct: 216  FLCNLVESNSKQGSIQGFSDACSRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPLDAER 275

Query: 839  LMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSLG 660
            ++EEMR   +K   FE ++++Y YG++GL +DM R V  ++ QG ++DTVC+NMVLSS G
Sbjct: 276  VIEEMRMETIKPGLFEYKSVLYGYGRLGLFDDMNRIVHRMETQGHKIDTVCSNMVLSSYG 335

Query: 659  THGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLTK 480
             H  L +M SWLQ++K   +  S+RTYNSVLNSCPTI+ +L+D+ S P+S+  LL  L +
Sbjct: 336  AHDALPQMGSWLQKLKGYNVPLSIRTYNSVLNSCPTIISLLKDLDSCPLSLSELLPILNE 395

Query: 479  DEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQML 300
            DE L+V EL  S VLDE +EWN+ E KLDLHGMHLS SYLI LQW+D  R RFS   + +
Sbjct: 396  DEALLVRELTQSLVLDEAIEWNAVEGKLDLHGMHLSASYLIMLQWMDETRLRFSEDKKCV 455

Query: 299  VPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWLC 120
            VP EI VV G GKHS VRG+SPVK+++K++++R K P++IDRKNVG FIAKGK  ++WLC
Sbjct: 456  VPAEIVVVSGSGKHSNVRGESPVKAMVKKIMVRTKSPMRIDRKNVGSFIAKGKNVKEWLC 515


>ref|XP_006409357.1| hypothetical protein EUTSA_v10022675mg [Eutrema salsugineum]
            gi|557110519|gb|ESQ50810.1| hypothetical protein
            EUTSA_v10022675mg [Eutrema salsugineum]
          Length = 469

 Score =  405 bits (1040), Expect = e-110
 Identities = 210/434 (48%), Positives = 297/434 (68%), Gaps = 4/434 (0%)
 Frame = -3

Query: 1409 RPGLSLRLRA----LSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDX 1242
            R  + +R +A    L KQGHRFL+SL++   +   D SAT R  I+KFV +SPK V+L+ 
Sbjct: 39   RTSMEVRCKAGTVPLMKQGHRFLSSLSSPALAG--DPSATNRH-IKKFVAASPKSVSLNV 95

Query: 1241 XXXXXXXXXXXXXXSAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILE 1062
                          S  A+ LY  ITEASWF+WN KL+A+++A++ K E+  E+ETL+  
Sbjct: 96   LSHLLSAQTSHPHLSFFALSLYSEITEASWFDWNPKLIAELVALLNKQERSHESETLLSN 155

Query: 1061 TMKKIGIQERNVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMV 882
             + ++   ER++  FYCNL+ES++K    +  ++    ++ I   S+S YVK +AY+SMV
Sbjct: 156  AVSRLKSNERDIALFYCNLVESNSKQGSIQGFNEACVRLREITRRSTSVYVKTQAYKSMV 215

Query: 881  RSLCDIGQPQEAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFE 702
              LC++ QP +AE ++EEMR   +K   FE ++++Y YG++GL EDM R V  ++ +G +
Sbjct: 216  SGLCNMDQPHDAESVIEEMRIAKIKPGLFEYKSVLYGYGRLGLFEDMNRVVHRMETEGHK 275

Query: 701  LDTVCANMVLSSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKS 522
            +DTVC+NMVLSS G H  L +M SWLQ++K   +  S RTYNSVLNSCPTI+ +L+D+ S
Sbjct: 276  IDTVCSNMVLSSYGAHNALPQMGSWLQKLKDSNVPLSERTYNSVLNSCPTILSLLKDLDS 335

Query: 521  VPISMEHLLKNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWI 342
             P+S+  LL  L KDE ++V  L  SSVLDE +EW+S E KLDLHGMHLS SYLI +QW+
Sbjct: 336  CPVSLSELLTFLNKDEEVLVRGLTQSSVLDEAIEWSSLEGKLDLHGMHLSSSYLIMMQWM 395

Query: 341  DVMRFRFSSGNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVG 162
            D MR RFS G + +VP EI +V G GKHS VRG+SPVK+L+K++++R   P++IDRKN+G
Sbjct: 396  DEMRIRFSEG-KCVVPAEIVLVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNIG 454

Query: 161  CFIAKGKVFRDWLC 120
             FIAKGK  ++WLC
Sbjct: 455  SFIAKGKTVKEWLC 468


>ref|NP_849962.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|75244359|sp|Q8GWA9.1|PP157_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At2g17033 gi|26452937|dbj|BAC43545.1| unknown protein
            [Arabidopsis thaliana] gi|330251482|gb|AEC06576.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 505

 Score =  403 bits (1035), Expect = e-109
 Identities = 210/420 (50%), Positives = 287/420 (68%)
 Frame = -3

Query: 1379 LSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXXX 1200
            L K G RFL+SL++   +   D SA  R  I+KFV +SPK VAL+               
Sbjct: 89   LMKHGDRFLSSLSSPALAG--DPSAINRH-IKKFVAASPKSVALNVLSHLLSDQTSHPHL 145

Query: 1199 SAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVCN 1020
            S  A+ LY  ITEASWF+WN KL+A++IA++ K E+FDE+ETL+   + ++   ER+   
Sbjct: 146  SFFALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTL 205

Query: 1019 FYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPQEAED 840
            F CNL+ES++K    +  S+    ++ I   SSS YVK +AY+SMV  LC++ QP +AE 
Sbjct: 206  FLCNLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAER 265

Query: 839  LMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSLG 660
            ++EEMR   +K   FE ++++Y YG++GL +DM R V  +  +G ++DTVC+NMVLSS G
Sbjct: 266  VIEEMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYG 325

Query: 659  THGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLTK 480
             H  L +M SWLQ++K   + FS+RTYNSVLNSCPTI+ ML+D+ S P+S+  L   L +
Sbjct: 326  AHDALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNE 385

Query: 479  DEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQML 300
            DE L+V EL  SSVLDE +EWN+ E KLDLHGMHLS SYLI LQW+D  R RFS   + +
Sbjct: 386  DEALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSE-EKCV 444

Query: 299  VPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWLC 120
            +P EI VV G GKHS VRG+SPVK+L+K++++R   P++IDRKNVG FIAKGK  ++WLC
Sbjct: 445  IPAEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 504


>dbj|BAF01049.1| hypothetical protein [Arabidopsis thaliana]
          Length = 501

 Score =  403 bits (1035), Expect = e-109
 Identities = 210/420 (50%), Positives = 287/420 (68%)
 Frame = -3

Query: 1379 LSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXXX 1200
            L K G RFL+SL++   +   D SA  R  I+KFV +SPK VAL+               
Sbjct: 85   LMKHGDRFLSSLSSPALAG--DPSAINRH-IKKFVAASPKSVALNVLSHLLSDQTSHPHL 141

Query: 1199 SAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVCN 1020
            S  A+ LY  ITEASWF+WN KL+A++IA++ K E+FDE+ETL+   + ++   ER+   
Sbjct: 142  SFFALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTL 201

Query: 1019 FYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPQEAED 840
            F CNL+ES++K    +  S+    ++ I   SSS YVK +AY+SMV  LC++ QP +AE 
Sbjct: 202  FLCNLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAER 261

Query: 839  LMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSLG 660
            ++EEMR   +K   FE ++++Y YG++GL +DM R V  +  +G ++DTVC+NMVLSS G
Sbjct: 262  VIEEMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYG 321

Query: 659  THGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLTK 480
             H  L +M SWLQ++K   + FS+RTYNSVLNSCPTI+ ML+D+ S P+S+  L   L +
Sbjct: 322  AHDALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNE 381

Query: 479  DEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQML 300
            DE L+V EL  SSVLDE +EWN+ E KLDLHGMHLS SYLI LQW+D  R RFS   + +
Sbjct: 382  DEALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSE-EKCV 440

Query: 299  VPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWLC 120
            +P EI VV G GKHS VRG+SPVK+L+K++++R   P++IDRKNVG FIAKGK  ++WLC
Sbjct: 441  IPAEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 500


>ref|NP_565402.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|13877877|gb|AAK44016.1|AF370201_1 unknown protein
            [Arabidopsis thaliana] gi|21280879|gb|AAM44931.1| unknown
            protein [Arabidopsis thaliana]
            gi|330251481|gb|AEC06575.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 504

 Score =  403 bits (1035), Expect = e-109
 Identities = 210/420 (50%), Positives = 287/420 (68%)
 Frame = -3

Query: 1379 LSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXXX 1200
            L K G RFL+SL++   +   D SA  R  I+KFV +SPK VAL+               
Sbjct: 88   LMKHGDRFLSSLSSPALAG--DPSAINRH-IKKFVAASPKSVALNVLSHLLSDQTSHPHL 144

Query: 1199 SAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVCN 1020
            S  A+ LY  ITEASWF+WN KL+A++IA++ K E+FDE+ETL+   + ++   ER+   
Sbjct: 145  SFFALSLYSEITEASWFDWNPKLIAELIALLNKQERFDESETLLSTAVSRLKSNERDFTL 204

Query: 1019 FYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPQEAED 840
            F CNL+ES++K    +  S+    ++ I   SSS YVK +AY+SMV  LC++ QP +AE 
Sbjct: 205  FLCNLVESNSKQGSIQGFSEASFRLREIIQRSSSVYVKTQAYKSMVSGLCNMDQPHDAER 264

Query: 839  LMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSLG 660
            ++EEMR   +K   FE ++++Y YG++GL +DM R V  +  +G ++DTVC+NMVLSS G
Sbjct: 265  VIEEMRMEKIKPGLFEYKSVLYGYGRLGLFDDMNRVVHRMGTEGHKIDTVCSNMVLSSYG 324

Query: 659  THGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLTK 480
             H  L +M SWLQ++K   + FS+RTYNSVLNSCPTI+ ML+D+ S P+S+  L   L +
Sbjct: 325  AHDALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIISMLKDLDSCPVSLSELRTFLNE 384

Query: 479  DEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQML 300
            DE L+V EL  SSVLDE +EWN+ E KLDLHGMHLS SYLI LQW+D  R RFS   + +
Sbjct: 385  DEALLVHELTQSSVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDETRLRFSE-EKCV 443

Query: 299  VPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWLC 120
            +P EI VV G GKHS VRG+SPVK+L+K++++R   P++IDRKNVG FIAKGK  ++WLC
Sbjct: 444  IPAEIVVVSGSGKHSNVRGESPVKALVKKIMVRTGSPMRIDRKNVGSFIAKGKTVKEWLC 503


>ref|XP_002884032.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297329872|gb|EFH60291.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 504

 Score =  398 bits (1022), Expect = e-108
 Identities = 205/420 (48%), Positives = 288/420 (68%)
 Frame = -3

Query: 1379 LSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXXXXX 1200
            L KQG RFL+SL++   +   D SAT R  I+KFV +SPK V L+               
Sbjct: 88   LMKQGDRFLSSLSSPALAG--DPSATHRH-IKKFVAASPKSVTLNVLSHLLSDQTSYPHL 144

Query: 1199 SAVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQERNVCN 1020
            S  A+ LY  ITEASWF+WN KL+A+++AV+   E+FDE+ETL+   + ++   ER+   
Sbjct: 145  SFFALSLYSEITEASWFDWNPKLIAELVAVLNNQERFDESETLLSTAVSRLKSNERDFAL 204

Query: 1019 FYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPQEAED 840
            F CNL+ES++K    +  ++    ++     SSS YVK +AY+SMV  LC++ QP +AE 
Sbjct: 205  FLCNLVESNSKQGSIQGFNEACFRLRERIQRSSSVYVKTQAYKSMVAGLCNMDQPHDAER 264

Query: 839  LMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVLSSLG 660
            ++EEMR   +K   FE ++++Y YG++GL +DM R V  ++ +G ++DTVC+NMVLSS G
Sbjct: 265  VIEEMRVEKIKPGSFEHKSVLYGYGRLGLFDDMNRVVHRMETEGHKIDTVCSNMVLSSYG 324

Query: 659  THGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDIKSVPISMEHLLKNLTK 480
             H  L +M SWLQ++K   + FS+RTYNSVLNSCPTI+ +L+D+ S P+S+  L   L +
Sbjct: 325  AHDALPQMGSWLQKLKGFNVPFSIRTYNSVLNSCPTIMSLLKDLNSCPVSLSELRTFLNE 384

Query: 479  DEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSSGNQML 300
            DE L+V EL  S+VLDE +EWN+ E KLDLHGMHLS SYLI LQW+D +R RF    + +
Sbjct: 385  DEALLVLELTQSTVLDEAIEWNAVEGKLDLHGMHLSSSYLILLQWMDEIRLRFRD-QKCV 443

Query: 299  VPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVFRDWLC 120
            +P EI VV G GKHS VRG+SPVK+L+K++++R + P++IDRKNVG FIAKGK  ++WLC
Sbjct: 444  IPAEIVVVSGSGKHSNVRGESPVKALVKKIMVRTESPMRIDRKNVGSFIAKGKNVKEWLC 503


>ref|XP_002312938.2| hypothetical protein POPTR_0009s14120g [Populus trichocarpa]
            gi|550331693|gb|EEE86893.2| hypothetical protein
            POPTR_0009s14120g [Populus trichocarpa]
          Length = 473

 Score =  395 bits (1015), Expect = e-107
 Identities = 202/425 (47%), Positives = 292/425 (68%), Gaps = 2/425 (0%)
 Frame = -3

Query: 1388 LRALSKQGHRFLTSLATTTYSAVRDSSATGRSLIRKFVNSSPKHVALDXXXXXXXXXXXX 1209
            L A+SKQ  RF +++  T   A  D+SAT R LI+KFV SSPK +ALD            
Sbjct: 53   LAAISKQAQRFFSAVLPTV--ATSDTSATNR-LIKKFVASSPKSIALDALSNLLSPDSTH 109

Query: 1208 XXXS-AVAIPLYLSITEASWFNWNAKLVADVIAVMYKHEQFDEAETLILETMKKIGIQER 1032
                  + +PLYL I+EASWF+WN KLVA V+ ++ K     E + L+ ET+ ++  +ER
Sbjct: 110  HPLLYLLTLPLYLKISEASWFSWNPKLVAQVVVLLDKQGLDKELKALMSETVSRLQFKER 169

Query: 1031 NVCNFYCNLIESSAKHQLKESVSDLYNYMKHIFTGSSSNYVKKRAYESMVRSLCDIGQPQ 852
             +  FYCNLI  ++KH       D Y+ +    + S+S YVKK+ Y++M+  LC++G+ +
Sbjct: 170  ELVLFYCNLIGFNSKHNWVRGFDDSYSRLNQFVSDSNSVYVKKQGYKAMISGLCEMGRAR 229

Query: 851  EAEDLMEEMRELGLKQSDFEIRALVYAYGKIGLVEDMKRNVIELQNQGFELDTVCANMVL 672
            EAEDL+ EMRE GLK   FE R ++Y YG++GL +DM+R + ++++   E+DTVCANMVL
Sbjct: 230  EAEDLIGEMRERGLKPKLFEFRCVLYGYGRLGLFKDMERILDKMESGEIEVDTVCANMVL 289

Query: 671  SSLGTHGELSEMVSWLQRMKSLRIQFSVRTYNSVLNSCPTIVLMLQDI-KSVPISMEHLL 495
            +S G H  L EM  WL++MK+L I  S+RT NSVLNSCPTI+ +++++  S P+S++ LL
Sbjct: 290  ASYGAHNALPEMGLWLRKMKTLGIPLSIRTCNSVLNSCPTIMALMRNLDASYPVSIQELL 349

Query: 494  KNLTKDEVLVVGELMGSSVLDEVMEWNSSEMKLDLHGMHLSCSYLIFLQWIDVMRFRFSS 315
            K L+++E ++V EL+ SSVL E  +W++SE KLDLHGMHL  +Y+I LQW++  R R S 
Sbjct: 350  KILSEEEAMLVKELIESSVLKEATKWDTSEGKLDLHGMHLGSAYVIMLQWMEETRNRLSD 409

Query: 314  GNQMLVPTEITVVCGLGKHSAVRGQSPVKSLMKEMILRMKCPLKIDRKNVGCFIAKGKVF 135
            G + ++P EITVVCG G HS VRG+SPVKS++ E++ + + P++IDRKN+GCF+AKG V 
Sbjct: 410  G-EHVIPAEITVVCGSGNHSTVRGESPVKSMITEIMAQTRSPMRIDRKNIGCFVAKGNVV 468

Query: 134  RDWLC 120
            + WLC
Sbjct: 469  KKWLC 473


Top