BLASTX nr result

ID: Catharanthus22_contig00007143 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00007143
         (2318 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004229730.1| PREDICTED: pentatricopeptide repeat-containi...   712   0.0  
ref|XP_006347992.1| PREDICTED: pentatricopeptide repeat-containi...   710   0.0  
ref|XP_006375170.1| pentatricopeptide repeat-containing family p...   698   0.0  
ref|XP_002327026.1| predicted protein [Populus trichocarpa]           698   0.0  
ref|XP_002270492.1| PREDICTED: pentatricopeptide repeat-containi...   687   0.0  
ref|XP_002301239.2| pentatricopeptide repeat-containing family p...   687   0.0  
ref|XP_002526471.1| pentatricopeptide repeat-containing protein,...   682   0.0  
gb|EMJ20170.1| hypothetical protein PRUPE_ppa003822mg [Prunus pe...   679   0.0  
gb|EOX96514.1| Tetratricopeptide repeat (TPR)-like superfamily p...   677   0.0  
ref|XP_004307244.1| PREDICTED: pentatricopeptide repeat-containi...   670   0.0  
ref|XP_004133941.1| PREDICTED: pentatricopeptide repeat-containi...   666   0.0  
ref|XP_006445447.1| hypothetical protein CICLE_v10019658mg [Citr...   663   0.0  
gb|EXB38379.1| hypothetical protein L484_008037 [Morus notabilis]     660   0.0  
ref|XP_003530115.1| PREDICTED: pentatricopeptide repeat-containi...   636   e-179
ref|XP_006418504.1| hypothetical protein EUTSA_v10007383mg [Eutr...   627   e-177
ref|XP_003520417.1| PREDICTED: pentatricopeptide repeat-containi...   625   e-176
ref|XP_006306047.1| hypothetical protein CARUB_v10011354mg [Caps...   620   e-175
ref|XP_002892022.1| pentatricopeptide repeat-containing protein ...   617   e-174
gb|ESW06405.1| hypothetical protein PHAVU_010G045500g [Phaseolus...   613   e-173
ref|NP_171717.2| pentatricopeptide repeat-containing protein [Ar...   607   e-171

>ref|XP_004229730.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02150-like
            [Solanum lycopersicum]
          Length = 545

 Score =  712 bits (1837), Expect = 0.0
 Identities = 360/547 (65%), Positives = 434/547 (79%), Gaps = 12/547 (2%)
 Frame = +3

Query: 96   MLLQPSTSKLPLNSHVXXXXXXXXXXXXXXXXXFPYGLSD-STILKPI---SNHKNPVFT 263
            MLLQP+T+  P   H                  FP G  +     KP+    NH + +  
Sbjct: 1    MLLQPTTTVKP--PHQKTEKYVSFSSSLSYSLSFPSGFCNLGGFTKPLMCSKNHHSVISC 58

Query: 264  CSISQVHSYGTMDYERRPELKWNAVYKRISMMEDPEKGSASVLNQWENEGKRLSKWEICR 443
             S SQVHSYGT+DYERRP +KWNA+YKRISM + PE+GS SVLNQWENEGK+++KWE+ R
Sbjct: 59   SSTSQVHSYGTVDYERRPIVKWNAIYKRISMNDGPERGSVSVLNQWENEGKKVTKWELSR 118

Query: 444  IVKELRKFRRFKLALEVYEWMNNREERFRLTSSDTAIHLDLIAKVRGIESAEEYFLKLPD 623
            ++KELRKFRR+KLA EVYEWMNNR ERFRLT+SDTAI LDLIAKV GI SAEEYF KLPD
Sbjct: 119  VIKELRKFRRYKLAFEVYEWMNNRPERFRLTTSDTAIQLDLIAKVHGISSAEEYFDKLPD 178

Query: 624  SLKDKRIYGALLNAYVHSRMAEKAESLMEKMRTRGYAIHALPFNVIMTLYMNLKQFDKVE 803
            +LKDKRIYG+LLNA+V SR  E+AESL++KMR RGY  HALPFNV+MTLYMNLK +DKVE
Sbjct: 179  TLKDKRIYGSLLNAFVRSRKKEQAESLLDKMRNRGYTDHALPFNVMMTLYMNLKDYDKVE 238

Query: 804  SVVSEMTEKKIRLDLYSYNIWLSCRGSQGSAEKMEQVLEQMKLDTSINPNWTTYSTLATL 983
            SVVSEM EK+I LD+YSYNIWLS  GSQGS EKME+VLEQM LDT INPNWTT+ST+AT+
Sbjct: 239  SVVSEMKEKRIPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDINPNWTTFSTMATM 298

Query: 984  YIRLGQLEKAEECLKQIEGRITGRDRIPYHYLISLYGSVGKKEEVLRVWDVYKSTFPTIP 1163
            YI+LGQ++KAE+ LK +E RITGRDRIPYHYLISLYGS+GKKE+VLR+W  Y+S FP IP
Sbjct: 299  YIKLGQMKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKEDVLRIWKTYQSQFPNIP 358

Query: 1164 NLGYHALISSLIRLNDIEVAEKIYEEWISVKSRYDPRIGNLLLGWYVRQG-SDKAAAFYD 1340
            NLGYH++ISSL+RL+DIE AEKIY+EW+ VK  YDPRIGNLLLG+YVR+G  DKA+AF+D
Sbjct: 359  NLGYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKGFVDKASAFFD 418

Query: 1341 KMIEHGGKPNSTTWEILAESRIRERRIPEALSYLKDAFSTDGSKSWKPKPRNVTNILEIC 1520
            +MI  GGKPNS T EILAE  IR+RRI EALS LKDA S++GSKSW+PKP  V++IL +C
Sbjct: 419  QMIGAGGKPNSMTCEILAEGHIRDRRISEALSCLKDAVSSEGSKSWRPKPATVSSILRLC 478

Query: 1521 EEEGDVASKEALLGMLRQVGCLDNENY-----ASGGGETPTAKDRTED--DNNDSADMLL 1679
            E+E D+ +KE LL +L+QVGCLD+E Y      S G  T + ++  +D  DN++ +D+LL
Sbjct: 479  EQEDDIQNKEVLLEVLKQVGCLDDEKYMSYIPLSNGSFTSSEREIEKDTSDNDEGSDILL 538

Query: 1680 NQLQGSL 1700
            NQLQ SL
Sbjct: 539  NQLQESL 545


>ref|XP_006347992.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02150-like
            [Solanum tuberosum]
          Length = 545

 Score =  710 bits (1832), Expect = 0.0
 Identities = 361/547 (65%), Positives = 430/547 (78%), Gaps = 12/547 (2%)
 Frame = +3

Query: 96   MLLQPSTSKLPLNSHVXXXXXXXXXXXXXXXXXFPYGLSD-STILKPI---SNHKNPVFT 263
            MLLQP+T+  P   H                  FP G  +     KP+    NH + +  
Sbjct: 1    MLLQPTTTVKP--PHQKTENYVSFSSSLSYSLSFPSGFCNLGGFTKPLMCSKNHHSVISC 58

Query: 264  CSISQVHSYGTMDYERRPELKWNAVYKRISMMEDPEKGSASVLNQWENEGKRLSKWEICR 443
             S  QVHSYGT+DYERRP +KWNA+YKRISM + PE+GS SVLNQWENEGK+++KWE+ R
Sbjct: 59   SSTPQVHSYGTVDYERRPIVKWNAIYKRISMNDGPERGSVSVLNQWENEGKKVTKWELSR 118

Query: 444  IVKELRKFRRFKLALEVYEWMNNREERFRLTSSDTAIHLDLIAKVRGIESAEEYFLKLPD 623
            ++KELRKFRR+KLA EVYEWMNNR ERFRLT+SDTAI LDLIAKV GI SAEEYF KLPD
Sbjct: 119  VIKELRKFRRYKLAFEVYEWMNNRPERFRLTTSDTAIQLDLIAKVHGISSAEEYFEKLPD 178

Query: 624  SLKDKRIYGALLNAYVHSRMAEKAESLMEKMRTRGYAIHALPFNVIMTLYMNLKQFDKVE 803
            +LKDKRIYG+LLNA+V SR  E+AESL++KMR RGY  HALPFNV+MTLYMNLK ++KVE
Sbjct: 179  TLKDKRIYGSLLNAFVRSRKKEQAESLLDKMRNRGYTDHALPFNVMMTLYMNLKDYNKVE 238

Query: 804  SVVSEMTEKKIRLDLYSYNIWLSCRGSQGSAEKMEQVLEQMKLDTSINPNWTTYSTLATL 983
            SVVSEM EKKI LD+YSYNIWLS  GSQGS EKME+VLEQM LDT INPNWTT+ST+AT+
Sbjct: 239  SVVSEMKEKKIPLDIYSYNIWLSSCGSQGSIEKMEKVLEQMNLDTDINPNWTTFSTMATM 298

Query: 984  YIRLGQLEKAEECLKQIEGRITGRDRIPYHYLISLYGSVGKKEEVLRVWDVYKSTFPTIP 1163
            YI+LG+L+KAE+ LK +E RITGRDRIPYHYLISLYGS+GKKEEVLR+W  Y+S FP IP
Sbjct: 299  YIKLGELKKAEDSLKSVESRITGRDRIPYHYLISLYGSLGKKEEVLRIWKTYQSQFPNIP 358

Query: 1164 NLGYHALISSLIRLNDIEVAEKIYEEWISVKSRYDPRIGNLLLGWYVRQG-SDKAAAFYD 1340
            NLGYH++ISSL+RL+DIE AEKIY+EW+ VK  YDPRIGNLLLG+YVR+G  DKA+AF+D
Sbjct: 359  NLGYHSVISSLVRLDDIEGAEKIYDEWLPVKVHYDPRIGNLLLGYYVRKGFVDKASAFFD 418

Query: 1341 KMIEHGGKPNSTTWEILAESRIRERRIPEALSYLKDAFSTDGSKSWKPKPRNVTNILEIC 1520
            +MI  GGKPNS T EILAE  IR+RRI EALS LKDA ST+GSKSW+PKP  V++IL +C
Sbjct: 419  QMIGAGGKPNSMTCEILAEGHIRDRRISEALSCLKDAVSTEGSKSWRPKPATVSSILRLC 478

Query: 1521 EEEGDVASKEALLGMLRQVGCLDNENYAS----GGGETPTAKDRTE---DDNNDSADMLL 1679
            E+E D  +KEALL +L+QVGCLD+E Y S      G   +++   E    DN + +D+LL
Sbjct: 479  EQEDDTQNKEALLEVLKQVGCLDDEKYMSYIPLSNGTITSSEPEIEKDTSDNGEGSDILL 538

Query: 1680 NQLQGSL 1700
            NQLQ SL
Sbjct: 539  NQLQESL 545


>ref|XP_006375170.1| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550323489|gb|ERP52967.1|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 539

 Score =  698 bits (1801), Expect = 0.0
 Identities = 334/515 (64%), Positives = 420/515 (81%), Gaps = 14/515 (2%)
 Frame = +3

Query: 198  PYGLSDSTILKPISNHKNPVFTCSISQVHSYGTMDYERRPELKWNAVYKRISMMEDPEKG 377
            P+    ST+ + ++  K PV  CSISQVH+YGT+DYERRP +KWN +Y+RIS+ME+PE G
Sbjct: 25   PWKSPKSTLHQTVNYKKLPVIICSISQVHNYGTVDYERRPMIKWNGIYRRISLMENPELG 84

Query: 378  SASVLNQWENEGKRLSKWEICRIVKELRKFRRFKLALEVYEWMNNREERFRLTSSDTAIH 557
            S SVLN+WENEGKRL+KWE+CR+VKELRK++R++ ALEVY+WM NR+ERFRL+ SD AI 
Sbjct: 85   SGSVLNRWENEGKRLTKWELCRVVKELRKYKRYQQALEVYDWMKNRQERFRLSPSDAAIQ 144

Query: 558  LDLIAKVRGIESAEEYFLKLPDSLKDKRIYGALLNAYVHSRMAEKAESLMEKMRTRGYAI 737
            LDLIAKVRG+ +AE++FL LP++ KD+R+YGALLNAYV +RM EKAE+L ++MR +GY  
Sbjct: 145  LDLIAKVRGVSTAEDFFLSLPNTFKDRRVYGALLNAYVQNRMREKAETLFDEMRDKGYVT 204

Query: 738  HALPFNVIMTLYMNLKQFDKVESVVSEMTEKKIRLDLYSYNIWLSCRGSQGSAEKMEQVL 917
            HALPFNV MTLYMN+K++DKV+ ++SEM EK I+LD+YSYNIWLS  GSQGSA+KMEQV 
Sbjct: 205  HALPFNVTMTLYMNIKEYDKVDLMISEMNEKNIKLDIYSYNIWLSSCGSQGSADKMEQVY 264

Query: 918  EQMKLDTSINPNWTTYSTLATLYIRLGQLEKAEECLKQIEGRITGRDRIPYHYLISLYGS 1097
            EQMK D SINPNWTT+ST+AT+YI++GQ EKAE+CL+++E RITGRDRIPYHYL+SLYG+
Sbjct: 265  EQMKSDRSINPNWTTFSTMATMYIKMGQFEKAEDCLRRVESRITGRDRIPYHYLLSLYGN 324

Query: 1098 VGKKEEVLRVWDVYKSTFPTIPNLGYHALISSLIRLNDIEVAEKIYEEWISVKSRYDPRI 1277
            VG KEEV RVW++YKS FP+IPNLGYHA+ISSL+RL+DIE AEKI+EEW+S+K+ YDPRI
Sbjct: 325  VGNKEEVYRVWNIYKSIFPSIPNLGYHAIISSLVRLDDIEGAEKIFEEWLSIKTSYDPRI 384

Query: 1278 GNLLLGWYVRQGS-DKAAAFYDKMIEHGGKPNSTTWEILAESRIRERRIPEALSYLKDAF 1454
             NL +  YV QG+ D+A +F+D M+E GGKPNS TWEILA+  I ERR  EALS LK+AF
Sbjct: 385  ANLFIAAYVYQGNLDEAKSFFDHMLEDGGKPNSNTWEILAQGHISERRTSEALSCLKEAF 444

Query: 1455 STDGSKSWKPKPRNVTNILEICEEEGDVASKEALLGMLRQVGCLDNENYAS------GGG 1616
             T GSKSWKP P NVT+  ++CEEE D+A+KEAL G LRQ G L ++ YAS       G 
Sbjct: 445  VTPGSKSWKPNPANVTSFFKLCEEEADMANKEALEGFLRQSGHLKDKAYASLLGMPVTGD 504

Query: 1617 ETPTAKDRT-------EDDNNDSADMLLNQLQGSL 1700
            E  T +DRT       EDD +D A+ML++ LQGSL
Sbjct: 505  ELSTKEDRTGDQIDNEEDDEDDGAEMLVSHLQGSL 539


>ref|XP_002327026.1| predicted protein [Populus trichocarpa]
          Length = 539

 Score =  698 bits (1801), Expect = 0.0
 Identities = 334/515 (64%), Positives = 420/515 (81%), Gaps = 14/515 (2%)
 Frame = +3

Query: 198  PYGLSDSTILKPISNHKNPVFTCSISQVHSYGTMDYERRPELKWNAVYKRISMMEDPEKG 377
            P+    ST+ + +++ K PV  CSISQVH+YGT+DYERRP +KWN +Y+RIS+ME+PE G
Sbjct: 25   PWKSPKSTLHQTVNHKKLPVIICSISQVHNYGTVDYERRPMIKWNGIYRRISLMENPELG 84

Query: 378  SASVLNQWENEGKRLSKWEICRIVKELRKFRRFKLALEVYEWMNNREERFRLTSSDTAIH 557
            S SVLN+WENEGKRL+KWE+CR+VKELRK++R++ ALEVY+WM NR+ERFRL+ SD AI 
Sbjct: 85   SGSVLNRWENEGKRLTKWELCRVVKELRKYKRYQQALEVYDWMKNRQERFRLSPSDAAIQ 144

Query: 558  LDLIAKVRGIESAEEYFLKLPDSLKDKRIYGALLNAYVHSRMAEKAESLMEKMRTRGYAI 737
            LDLIAKVRG+ +AE++FL LP++ KD+R+YGALLNAYV +RM EKAE+L ++MR +GY  
Sbjct: 145  LDLIAKVRGVSTAEDFFLSLPNTFKDRRVYGALLNAYVQNRMREKAETLFDEMRDKGYVT 204

Query: 738  HALPFNVIMTLYMNLKQFDKVESVVSEMTEKKIRLDLYSYNIWLSCRGSQGSAEKMEQVL 917
            HALPFNV MTLYMN+K++DKV+ ++SEM EK I+LD+YSYNIWLS  GSQGSA+KMEQV 
Sbjct: 205  HALPFNVTMTLYMNIKEYDKVDLMISEMNEKNIKLDIYSYNIWLSSCGSQGSADKMEQVY 264

Query: 918  EQMKLDTSINPNWTTYSTLATLYIRLGQLEKAEECLKQIEGRITGRDRIPYHYLISLYGS 1097
            EQMK D SINPNWTT+ST+AT+YI++GQ EKAE+CL+++E RITGRDRIPYHYL+SLYG+
Sbjct: 265  EQMKSDRSINPNWTTFSTMATMYIKMGQFEKAEDCLRRVESRITGRDRIPYHYLLSLYGN 324

Query: 1098 VGKKEEVLRVWDVYKSTFPTIPNLGYHALISSLIRLNDIEVAEKIYEEWISVKSRYDPRI 1277
            VG KEEV RVW++YKS FP+IPNLGYHA+ISSL+RL+DIE AEKIYEEW+S+K+ YDPRI
Sbjct: 325  VGNKEEVYRVWNIYKSIFPSIPNLGYHAIISSLVRLDDIEGAEKIYEEWLSIKTSYDPRI 384

Query: 1278 GNLLLGWYVRQGS-DKAAAFYDKMIEHGGKPNSTTWEILAESRIRERRIPEALSYLKDAF 1454
             NL +  YV QG+ D+A +F+D M+E GGKPNS TWEILA+  I ERR  EALS LK+AF
Sbjct: 385  ANLFIAAYVYQGNLDEAKSFFDHMLEDGGKPNSNTWEILAQGHISERRTSEALSCLKEAF 444

Query: 1455 STDGSKSWKPKPRNVTNILEICEEEGDVASKEALLGMLRQVGCLDNENYAS------GGG 1616
             T GSKSWKP P NVT+  ++CEEE D+A+KEAL G LRQ G L ++ YAS       G 
Sbjct: 445  VTPGSKSWKPNPANVTSFFKLCEEEADMANKEALEGFLRQSGHLKDKAYASLLGMPVTGD 504

Query: 1617 ETPTAKDRT-------EDDNNDSADMLLNQLQGSL 1700
            E  T +D T       EDD +D A+ML++ LQGSL
Sbjct: 505  ELSTKEDGTGDQIDNEEDDEDDGAEMLVSHLQGSL 539


>ref|XP_002270492.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02150
            [Vitis vinifera]
          Length = 527

 Score =  687 bits (1774), Expect = 0.0
 Identities = 338/503 (67%), Positives = 405/503 (80%), Gaps = 16/503 (3%)
 Frame = +3

Query: 240  NHKNPVFTCSISQVHSYGTMDYERRPELKWNAVYKRISMMEDPEKGSASVLNQWENEGKR 419
            N++    TCSISQ+HSYGT+DYERRP +KWNAVY+RIS+ME+PE GSASVLNQWENEGKR
Sbjct: 28   NYRKHSITCSISQIHSYGTVDYERRPLVKWNAVYRRISLMENPEMGSASVLNQWENEGKR 87

Query: 420  LSKWEICRIVKELRKFRRFKLALEVYEWMNNREERFRLTSSDTAIHLDLIAKVRGIESAE 599
            L+KWE+CR+VKELRKF+RFK+ALEVYEWMNNR ERFRL+SSD AI LDLIAKV G+ SAE
Sbjct: 88   LTKWELCRVVKELRKFKRFKMALEVYEWMNNRGERFRLSSSDAAIQLDLIAKVCGVSSAE 147

Query: 600  EYFLKLPDSLKDKRIYGALLNAYVHSRMAEKAESLMEKMRTRGYAIHALPFNVIMTLYMN 779
            +YF +LPD+LKDKRIYGALLNAYV ++M +KAE L+EK+R +GYA   LPFNV+MTLYMN
Sbjct: 148  DYFSRLPDTLKDKRIYGALLNAYVQAKMRDKAEILIEKLRNKGYATTPLPFNVMMTLYMN 207

Query: 780  LKQFDKVESVVSEMTEKKIRLDLYSYNIWLSCRGSQGSAEKMEQVLEQMKLDTSINPNWT 959
            LK+ DKV+S++SEM  K I+LD+YSYNIWLS   S  S E+MEQV EQMKL+ +INPNWT
Sbjct: 208  LKELDKVQSMISEMMNKNIQLDIYSYNIWLS---SCESTERMEQVFEQMKLERTINPNWT 264

Query: 960  TYSTLATLYIRLGQLEKAEECLKQIEGRITGRDRIPYHYLISLYGSVGKKEEVLRVWDVY 1139
            T+ST+AT+YI+LGQ EKAEECLK++E RIT RDR+PYHYLISLYGS G K EV R W++Y
Sbjct: 265  TFSTMATMYIKLGQFEKAEECLKKVESRITNRDRMPYHYLISLYGSTGNKAEVYRAWNIY 324

Query: 1140 KSTFPTIPNLGYHALISSLIRLNDIEVAEKIYEEWISVKSRYDPRIGNLLLGWYVRQG-S 1316
            KS FP IPNLGYHALISSL+R+ D+E AEKIYEEW+SVKS YDPRIGNLLLG YV++G  
Sbjct: 325  KSKFPNIPNLGYHALISSLVRVGDLEGAEKIYEEWLSVKSSYDPRIGNLLLGCYVKEGFL 384

Query: 1317 DKAAAFYDKMIEHGGKPNSTTWEILAESRIRERRIPEALSYLKDAFSTDGSKSWKPKPRN 1496
            +KA  F D MIE GGKPNSTTWEILAE     ++I +ALS  K A   +GS  WKPKP N
Sbjct: 385  EKAEGFLDHMIEAGGKPNSTTWEILAEGNTGVKKISDALSCFKRAVLAEGSNGWKPKPVN 444

Query: 1497 VTNILEICEEEGDVASKEALLGMLRQVGCLDNENYAS---------GGGETPTAKDRT-- 1643
            V+  L++CEEE D A+KEAL+G+LRQ+GCL++E YAS          G E    KDRT  
Sbjct: 445  VSAFLDLCEEEADTATKEALMGLLRQMGCLEDEPYASLFGLHTGSVTGNELSNEKDRTGA 504

Query: 1644 ----EDDNNDSADMLLNQLQGSL 1700
                ++D +D A+MLLNQ Q  L
Sbjct: 505  DKDIDEDEDDGAEMLLNQFQSGL 527


>ref|XP_002301239.2| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550344984|gb|EEE80512.2|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 539

 Score =  687 bits (1773), Expect = 0.0
 Identities = 336/551 (60%), Positives = 434/551 (78%), Gaps = 16/551 (2%)
 Frame = +3

Query: 96   MLLQPSTS--KLPLNSHVXXXXXXXXXXXXXXXXXFPYGLSDSTILKPISNHKNPVFTCS 269
            MLLQPS    K+ L+S +                  P+   + T+ + ++  K PV TCS
Sbjct: 1    MLLQPSLHHHKVSLSSTISYSHP------------LPWKNPNFTLHQTVNYKKLPVITCS 48

Query: 270  ISQVHSYGTMDYERRPELKWNAVYKRISMMEDPEKGSASVLNQWENEGKRLSKWEICRIV 449
            ISQ+H+YGT+DYERRP +KWNA+Y+RIS+ME+PE GS SVLNQWEN+GKRL+KWE+CR+V
Sbjct: 49   ISQIHNYGTVDYERRPMMKWNAIYRRISLMENPELGSGSVLNQWENDGKRLTKWELCRVV 108

Query: 450  KELRKFRRFKLALEVYEWMNNREERFRLTSSDTAIHLDLIAKVRGIESAEEYFLKLPDSL 629
            KELRK++R++ ALEVY+WMNNR+ERF L+ SD AI LDLIAKVRG+ SAE++FL+LP++ 
Sbjct: 109  KELRKYKRYQQALEVYDWMNNRQERFGLSPSDAAIQLDLIAKVRGVSSAEDFFLRLPNTF 168

Query: 630  KDKRIYGALLNAYVHSRMAEKAESLMEKMRTRGYAIHALPFNVIMTLYMNLKQFDKVESV 809
            KD+RIYGALLNAYV +RM EKAESL+++MR + Y  HALP+NV+MTLYMN+ ++DKV+ +
Sbjct: 169  KDRRIYGALLNAYVRNRMREKAESLIDEMRGKDYVTHALPYNVMMTLYMNINEYDKVDLI 228

Query: 810  VSEMTEKKIRLDLYSYNIWLSCRGSQGSAEKMEQVLEQMKLDTSINPNWTTYSTLATLYI 989
            +SEM EK I+LD+YSYNIWLS  G QGSA+KMEQV EQMK D SINPNWTT+ST+AT+YI
Sbjct: 229  ISEMNEKNIKLDIYSYNIWLSSCGLQGSADKMEQVFEQMKSDGSINPNWTTFSTMATMYI 288

Query: 990  RLGQLEKAEECLKQIEGRITGRDRIPYHYLISLYGSVGKKEEVLRVWDVYKSTFPTIPNL 1169
            ++G+ EKAE+CL+++E RITGRDRIPYHYL+SLYG+VG KEEV RVW++YKS FP+IPNL
Sbjct: 289  KMGKFEKAEDCLRRVESRITGRDRIPYHYLLSLYGNVGNKEEVYRVWNIYKSIFPSIPNL 348

Query: 1170 GYHALISSLIRLNDIEVAEKIYEEWISVKSRYDPRIGNLLLGWYVRQGS-DKAAAFYDKM 1346
            GYHA+ISSL+R++DIE AEKIYEEW+S+K+ YDPRI NL +  +V QG+ DKA +F+D M
Sbjct: 349  GYHAMISSLVRMDDIEGAEKIYEEWLSIKTSYDPRIANLFMAAFVYQGNLDKAESFFDHM 408

Query: 1347 IEHGGKPNSTTWEILAESRIRERRIPEALSYLKDAFSTDGSKSWKPKPRNVTNILEICEE 1526
            +E GGKPNS +WEILA+  I ERR  EALS LK+AF+T GSKSWKP P NV++  ++CEE
Sbjct: 409  LEEGGKPNSHSWEILAQGHISERRTSEALSCLKEAFATPGSKSWKPNPANVSSFFKLCEE 468

Query: 1527 EGDVASKEALLGMLRQVGCLDNENYA------SGGGETPTAKDRTED-------DNNDSA 1667
            E D+ASKEAL   LRQ G L ++ YA        G E  T ++RTED       D ++ +
Sbjct: 469  EVDMASKEALASFLRQSGHLKDKAYALLLGMPVTGDELSTKEERTEDQIDNEENDGDNGS 528

Query: 1668 DMLLNQLQGSL 1700
            +ML++QLQGSL
Sbjct: 529  EMLVSQLQGSL 539


>ref|XP_002526471.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223534146|gb|EEF35862.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 533

 Score =  682 bits (1761), Expect = 0.0
 Identities = 326/501 (65%), Positives = 415/501 (82%), Gaps = 8/501 (1%)
 Frame = +3

Query: 219  TILKPISNHKNPVFTCSISQVHSYGTMDYERRPELKWNAVYKRISMMEDPEKGSASVLNQ 398
            T+ + ++  K P+ TCSIS+VHSYGT+DYERRP +KWN+VY+RIS+ME PE G+A+VLN+
Sbjct: 33   TLHQTVNYRKLPI-TCSISKVHSYGTVDYERRPMIKWNSVYRRISLMEKPELGAATVLNE 91

Query: 399  WENEGKRLSKWEICRIVKELRKFRRFKLALEVYEWMNNREERFRLTSSDTAIHLDLIAKV 578
             E +GK+L+KWE+CR+VKELRK++R K ALEVY+WMNNREERFRL++SD AI LDL+AKV
Sbjct: 92   MEKDGKKLTKWELCRVVKELRKYKRHKQALEVYDWMNNREERFRLSASDAAIQLDLVAKV 151

Query: 579  RGIESAEEYFLKLPDSLKDKRIYGALLNAYVHSRMAEKAESLMEKMRTRGYAIHALPFNV 758
            RG+ SAE+YF++L D++KD+R+YGALLN+YV +RM EKAESL+EKMR + Y  HALPFNV
Sbjct: 152  RGVSSAEDYFMRLSDNVKDRRVYGALLNSYVKARMREKAESLIEKMRKKDYTTHALPFNV 211

Query: 759  IMTLYMNLKQFDKVESVVSEMTEKKIRLDLYSYNIWLSCRGSQGSAEKMEQVLEQMKLDT 938
            +MTLYMNLK++DKV+ ++SEM  K IRLD+YSYNIWLS RGSQGS E+ME+V EQMKLD+
Sbjct: 212  MMTLYMNLKEYDKVDMMISEMMAKNIRLDIYSYNIWLSSRGSQGSIERMEEVYEQMKLDS 271

Query: 939  SINPNWTTYSTLATLYIRLGQLEKAEECLKQIEGRITGRDRIPYHYLISLYGSVGKKEEV 1118
            +INPNWTT+ST+AT+YI++GQLEKAE+CL+++E RITGRDRIPYHYL+SLYG+VG KEE+
Sbjct: 272  TINPNWTTFSTMATMYIKMGQLEKAEDCLRRVESRITGRDRIPYHYLLSLYGNVGNKEEI 331

Query: 1119 LRVWDVYKSTFPTIPNLGYHALISSLIRLNDIEVAEKIYEEWISVKSRYDPRIGNLLLGW 1298
             RVW++YKS F TIPNLGYHA+ISSL+R++DIE AEKIYEEW+ VKS YDPRIGNLL+GW
Sbjct: 332  YRVWNIYKSIFATIPNLGYHAIISSLVRMDDIEGAEKIYEEWLPVKSSYDPRIGNLLMGW 391

Query: 1299 YVRQGS-DKAAAFYDKMIEHGGKPNSTTWEILAESRIRERRIPEALSYLKDAFSTDGSKS 1475
            YVR G+ DKA +F+D M+E GGKPNS+TWEILA+   RE+RI EALS  K+AF   GSKS
Sbjct: 392  YVRGGNLDKAESFFDHMMEVGGKPNSSTWEILADGHTREKRISEALSCFKEAFLAQGSKS 451

Query: 1476 WKPKPRNVTNILEICEEEGDVASKEALLGMLRQVGCLDNENYAS------GGGETPTAKD 1637
            WKPKP  +++  ++CEEE D+AS   L  +L Q G L+++ YAS         E  T KD
Sbjct: 452  WKPKPVIISSFFKLCEEEADMASTGVLEDLLAQSGYLEDKTYASLIGSSVPSNELSTEKD 511

Query: 1638 RTEDDNN-DSADMLLNQLQGS 1697
            RT D N  +  +  LNQLQG+
Sbjct: 512  RTGDRNEVEENETFLNQLQGN 532


>gb|EMJ20170.1| hypothetical protein PRUPE_ppa003822mg [Prunus persica]
          Length = 546

 Score =  679 bits (1751), Expect = 0.0
 Identities = 318/506 (62%), Positives = 415/506 (82%), Gaps = 18/506 (3%)
 Frame = +3

Query: 234  ISNHKNPVFTCSISQVHSYGTMDYERRPELKWNAVYKRISMMEDPEKGSASVLNQWENEG 413
            I+  + P  +CSISQVH+YGT+DYERRP +KWNA+Y++IS+ +DPE  SA VLNQWE EG
Sbjct: 39   INFQRLPSISCSISQVHNYGTVDYERRPMVKWNAIYRKISLTDDPEVRSADVLNQWEKEG 98

Query: 414  KRLSKWEICRIVKELRKFRRFKLALEVYEWMNNREERFRLTSSDTAIHLDLIAKVRGIES 593
            ++L+KWE+CR+VKELRK++R+  ALEVY+WM+NR ERFR+++SD AI LDL+AKVRG+ S
Sbjct: 99   RKLTKWELCRVVKELRKYKRYDRALEVYDWMSNRGERFRISTSDAAIQLDLVAKVRGVAS 158

Query: 594  AEEYFLKLPDSLKDKRIYGALLNAYVHSRMAEKAESLMEKMRTRGYAIHALPFNVIMTLY 773
            AE YFL LPD+LKD+RIYGALLNAYV +RM EKAESL++KMR++G+A+ +LPFNV+MTLY
Sbjct: 159  AENYFLSLPDTLKDRRIYGALLNAYVRTRMKEKAESLLDKMRSKGHALQSLPFNVMMTLY 218

Query: 774  MNLKQFDKVESVVSEMTEKKIRLDLYSYNIWLSCRGSQGSAEKMEQVLEQMKLDTSINPN 953
            MNLK++DKV+S++SEM EK I+LD+YSYNIWLS RGSQGS E+MEQV EQMKLD ++NPN
Sbjct: 219  MNLKEYDKVDSIISEMMEKNIQLDIYSYNIWLSSRGSQGSEERMEQVFEQMKLDRTVNPN 278

Query: 954  WTTYSTLATLYIRLGQLEKAEECLKQIEGRITGRDRIPYHYLISLYGSVGKKEEVLRVWD 1133
            WTT+ST+AT+YI++GQLEKAE CLK++E RITGRDRIPYHYL+SLYG+VG KEE+ RVW+
Sbjct: 279  WTTFSTMATMYIKMGQLEKAEACLKKVESRITGRDRIPYHYLLSLYGNVGNKEELYRVWN 338

Query: 1134 VYKSTFPTIPNLGYHALISSLIRLNDIEVAEKIYEEWISVKSRYDPRIGNLLLGWYVRQG 1313
            +YKS+FP+IPNLGYHA++SSL+R+ D+E AEKIYEEW++VKS YDPRI N+ + +Y++ G
Sbjct: 339  IYKSSFPSIPNLGYHAIMSSLLRVGDVEGAEKIYEEWLTVKSTYDPRIANVFIAYYIKDG 398

Query: 1314 S-DKAAAFYDKMIEHGGKPNSTTWEILAESRIRERRIPEALSYLKDAFSTDGSKSWKPKP 1490
              +KA +FYD M++ GGKPNSTTWE LAE  I E+RI EALS  K+AFS +GSKSW+PKP
Sbjct: 399  DFEKAQSFYDHMVDVGGKPNSTTWETLAEGHIEEQRISEALSCWKEAFSAEGSKSWRPKP 458

Query: 1491 RNVTNILEICEEEGDVASKEALLGMLRQVGCLDNENYAS---------GGGETPTAKDRT 1643
             NV+  LE+CE+E +  SKE  +G+L+Q G L N++YAS            +    KDRT
Sbjct: 459  VNVSAFLELCEQEANSVSKEFFMGLLKQSGQLKNKSYASLIGLADEDVSDDDLSLKKDRT 518

Query: 1644 --------EDDNNDSADMLLNQLQGS 1697
                    E +  D +++LLN+LQG+
Sbjct: 519  NITKDDDDEKEAGDGSELLLNELQGT 544


>gb|EOX96514.1| Tetratricopeptide repeat (TPR)-like superfamily protein isoform 1
            [Theobroma cacao]
          Length = 549

 Score =  677 bits (1746), Expect = 0.0
 Identities = 322/500 (64%), Positives = 410/500 (82%), Gaps = 8/500 (1%)
 Frame = +3

Query: 222  ILKPISNHKNPVFTCSISQVHSYGTMDYERRPELKWNAVYKRISMMEDPEKGSASVLNQW 401
            IL    +++    TCSISQ+HSYGT+DYERRP +KWNA+YK+IS+ME+PE GSASVLN+W
Sbjct: 33   ILSQTQSYQKLPVTCSISQIHSYGTVDYERRPMIKWNAIYKKISLMENPELGSASVLNEW 92

Query: 402  ENEGKRLSKWEICRIVKELRKFRRFKLALEVYEWMNNREERFRLTSSDTAIHLDLIAKVR 581
            E  G++L+KWE+CR+VKELRK++R+K ALEVY+WMNNR ERFRL++SD AI LDLIAKVR
Sbjct: 93   EKGGRKLTKWELCRVVKELRKYKRYKQALEVYDWMNNRGERFRLSASDAAIQLDLIAKVR 152

Query: 582  GIESAEEYFLKLPDSLKDKRIYGALLNAYVHSRMAEKAESLMEKMRTRGYAIHALPFNVI 761
            G+ SAE++F++LPD++KDKRIYGALLNAYV ++M +KAE+L++ MR +GYA+H LPFNV+
Sbjct: 153  GVSSAEDFFVQLPDTMKDKRIYGALLNAYVRAKMRDKAETLIDNMRGKGYAMHPLPFNVM 212

Query: 762  MTLYMNLKQFDKVESVVSEMTEKKIRLDLYSYNIWLSCRGSQGSAEKMEQVLEQMKLDTS 941
            MTLYMNLK++DKVES+VSEM EK IRLD+YSYNIWLS  GSQGS EKME+V EQMK D S
Sbjct: 213  MTLYMNLKEYDKVESMVSEMMEKNIRLDIYSYNIWLSSCGSQGSVEKMEEVYEQMKQDQS 272

Query: 942  INPNWTTYSTLATLYIRLGQLEKAEECLKQIEGRITGRDRIPYHYLISLYGSVGKKEEVL 1121
            INPNWTT+ST+AT+YI++G  EKAEECL+ +E RITGRDRIPYHYLISLYG VG +EEV 
Sbjct: 273  INPNWTTFSTMATMYIKMGLTEKAEECLRNVESRITGRDRIPYHYLISLYGGVGNREEVY 332

Query: 1122 RVWDVYKSTFPTIPNLGYHALISSLIRLNDIEVAEKIYEEWISVKSRYDPRIGNLLLGWY 1301
            RVW VYKS FP+IPNLG+HA+ISSL+R  DI+ AE+IYEEW++VK+ YDPRI NLL+GWY
Sbjct: 333  RVWKVYKSIFPSIPNLGFHAVISSLVRAGDIQGAERIYEEWLTVKTSYDPRIANLLMGWY 392

Query: 1302 VRQGS-DKAAAFYDKMIEHGGKPNSTTWEILAESRIRERRIPEALSYLKDAFSTDGSKSW 1478
            V++G+ DKA + +  + E GGKPNS++WEILAE  I E+RIP+ALS LKDAF+T+GS+ W
Sbjct: 393  VKEGNLDKAESLFSHIAEVGGKPNSSSWEILAEGHILEKRIPDALSCLKDAFATEGSRGW 452

Query: 1479 KPKPRNVTNILEICEEEGDVASKEALLGMLRQVGCLDNENYASGGG-------ETPTAKD 1637
            +PKP +V+    +CEE+ D+AS+E  +G+LRQ GCL NE YAS  G       E+   +D
Sbjct: 453  RPKPTSVSAFFNLCEEKVDMASREVFVGLLRQSGCLKNEAYASLIGLSEEALSESELPRD 512

Query: 1638 RTEDDNNDSADMLLNQLQGS 1697
            +    +  S+D   NQ  GS
Sbjct: 513  KNRKSSYSSSDE--NQDDGS 530


>ref|XP_004307244.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02150-like
            [Fragaria vesca subsp. vesca]
          Length = 541

 Score =  670 bits (1729), Expect = 0.0
 Identities = 321/504 (63%), Positives = 406/504 (80%), Gaps = 17/504 (3%)
 Frame = +3

Query: 240  NHKNPVFTCSISQVHSYGTMDYERRPELKWNAVYKRISMM-EDPEKGSASVLNQWENEGK 416
            N++    + SISQVH+YGT+DYERRP +KWNA+Y++IS++ +DPE  ++SVLNQWE EGK
Sbjct: 38   NYQRLTISSSISQVHNYGTVDYERRPIVKWNAIYRKISLLADDPELNASSVLNQWEKEGK 97

Query: 417  RLSKWEICRIVKELRKFRRFKLALEVYEWMNNREERFRLTSSDTAIHLDLIAKVRGIESA 596
            +LSKWE+CR+VKELRKF+R+  ALEVY+WM NR ERFR +SSD AI LDL+ KVRG+ SA
Sbjct: 98   KLSKWELCRVVKELRKFKRYGRALEVYDWMINRAERFRFSSSDAAIQLDLVGKVRGVSSA 157

Query: 597  EEYFLKLPDSLKDKRIYGALLNAYVHSRMAEKAESLMEKMRTRGYAIHALPFNVIMTLYM 776
            E YFL LPD+LKDKRIYGALLNAYV ++M EKAESL++KMR++G+A+H LPFNV+MTLYM
Sbjct: 158  ENYFLSLPDNLKDKRIYGALLNAYVRAKMQEKAESLLDKMRSKGHALHPLPFNVMMTLYM 217

Query: 777  NLKQFDKVESVVSEMTEKKIRLDLYSYNIWLSCRGSQGSAEKMEQVLEQMKLDTSINPNW 956
            NLK+++KVES++SEM EK I+LD+YSYNIWLS RGSQGSAE+MEQV EQMKLD +INPNW
Sbjct: 218  NLKEYEKVESIISEMMEKNIQLDIYSYNIWLSSRGSQGSAERMEQVFEQMKLDRTINPNW 277

Query: 957  TTYSTLATLYIRLGQLEKAEECLKQIEGRITGRDRIPYHYLISLYGSVGKKEEVLRVWDV 1136
            TT+ST+AT+YI++G  EKAE CLK++E RITGRDRIPYHYL+SLYG VG K+E+ RVW+V
Sbjct: 278  TTFSTMATMYIKMGLFEKAEACLKKVESRITGRDRIPYHYLLSLYGGVGNKDEIYRVWNV 337

Query: 1137 YKSTFPTIPNLGYHALISSLIRLNDIEVAEKIYEEWISVKSRYDPRIGNLLLGWYVRQGS 1316
            YKS+FP+IPNLGYHA+I++LIR+ D+E AEKI+EEW++VK  YDPRI NL +  Y+ +G 
Sbjct: 338  YKSSFPSIPNLGYHAIIAALIRVGDVEGAEKIFEEWLTVKPSYDPRIVNLFIVSYIEEGD 397

Query: 1317 -DKAAAFYDKMIEHGGKPNSTTWEILAESRIRERRIPEALSYLKDAFSTDGSKSWKPKPR 1493
             DKA +F+D M+E GGKPNS+TWE LAE  I E+RI EALS  K+AF  +GSKSW+PKP 
Sbjct: 398  FDKAQSFFDNMVEAGGKPNSSTWEALAEGHIEEKRISEALSCWKEAFMAEGSKSWRPKPV 457

Query: 1494 NVTNILEICEEEGDVASKEALLGMLRQVGCLDNENYA---------SGGGETPTAKDRTE 1646
            NVT   E CE+EGD+ SKE  LG+LRQ G L N++YA         S   +    KD   
Sbjct: 458  NVTTFYEFCEQEGDLRSKEIFLGLLRQSGQLKNKSYALLVGLSDEDSSDNDISLEKDSIN 517

Query: 1647 DD------NNDSADMLLNQLQGSL 1700
            D+      ++D +DMLLNQL  +L
Sbjct: 518  DNQDGDEKSDDGSDMLLNQLHSTL 541


>ref|XP_004133941.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02150-like
            [Cucumis sativus] gi|449525818|ref|XP_004169913.1|
            PREDICTED: pentatricopeptide repeat-containing protein
            At1g02150-like [Cucumis sativus]
          Length = 537

 Score =  666 bits (1719), Expect = 0.0
 Identities = 314/469 (66%), Positives = 394/469 (84%), Gaps = 1/469 (0%)
 Frame = +3

Query: 261  TCSISQVHSYGTMDYERRPELKWNAVYKRISMMEDPEKGSASVLNQWENEGKRLSKWEIC 440
            TCSISQVHSYGT+D+ERRP  KWNA+Y+RIS+ME+PE GSASVLNQWENEGK ++KWE+ 
Sbjct: 47   TCSISQVHSYGTVDFERRPMFKWNAIYRRISLMENPELGSASVLNQWENEGKNITKWELS 106

Query: 441  RIVKELRKFRRFKLALEVYEWMNNREERFRLTSSDTAIHLDLIAKVRGIESAEEYFLKLP 620
            R+VKELRK++RF+ ALE+Y+WM+NREERFRLT+SD AI LDLI+KVRGI+SAEEYFL+LP
Sbjct: 107  RVVKELRKYKRFERALEIYDWMSNREERFRLTTSDAAIQLDLISKVRGIKSAEEYFLRLP 166

Query: 621  DSLKDKRIYGALLNAYVHSRMAEKAESLMEKMRTRGYAIHALPFNVIMTLYMNLKQFDKV 800
            + LKD+RIYGALLNAY   R  EKAE+L+EKMRT+G+  H LPFNV+MTLYMN+K+++KV
Sbjct: 167  NHLKDRRIYGALLNAYAKGRQREKAENLLEKMRTKGFTTHPLPFNVMMTLYMNVKEYEKV 226

Query: 801  ESVVSEMTEKKIRLDLYSYNIWLSCRGSQGSAEKMEQVLEQMKLDTSINPNWTTYSTLAT 980
            ES+VSEMTE  I+LD+YSYNIWLS  G QGS EKME+V EQMK D +IN NWTT+ST+AT
Sbjct: 227  ESLVSEMTENSIQLDIYSYNIWLSSCGLQGSTEKMEEVYEQMKQDRTINANWTTFSTMAT 286

Query: 981  LYIRLGQLEKAEECLKQIEGRITGRDRIPYHYLISLYGSVGKKEEVLRVWDVYKSTFPTI 1160
            +YI++G +EKAEECL+++E RI GRDRIPYHYLISLYGSVG KEE+ RVW++YK+ FPTI
Sbjct: 287  MYIKMGLMEKAEECLRRVESRIVGRDRIPYHYLISLYGSVGNKEEMYRVWNIYKNVFPTI 346

Query: 1161 PNLGYHALISSLIRLNDIEVAEKIYEEWISVKSRYDPRIGNLLLGWYVRQG-SDKAAAFY 1337
            PNLGYHA+IS+LIR+ D+E AEKIYEEW++VKS YDPRI NL +GWYV++G + KA +F+
Sbjct: 347  PNLGYHAIISALIRVGDVEGAEKIYEEWLTVKSTYDPRIANLFIGWYVKEGNTSKAESFF 406

Query: 1338 DKMIEHGGKPNSTTWEILAESRIRERRIPEALSYLKDAFSTDGSKSWKPKPRNVTNILEI 1517
            D M+E GGKPNS+TWEIL +   +E R+ +AL+  K+AFS +GSKSW+PKP NV    ++
Sbjct: 407  DHMVEVGGKPNSSTWEILVDRHTKEGRVSDALASWKEAFSAEGSKSWRPKPYNVLAYFDL 466

Query: 1518 CEEEGDVASKEALLGMLRQVGCLDNENYASGGGETPTAKDRTEDDNNDS 1664
            CE+EGD+ASKE L+G+LRQ   L ++ YAS  G      D T D+N  S
Sbjct: 467  CEKEGDIASKEVLVGLLRQPKYLQDKTYASLIG----LLDETIDNNEVS 511


>ref|XP_006445447.1| hypothetical protein CICLE_v10019658mg [Citrus clementina]
            gi|568819745|ref|XP_006464406.1| PREDICTED:
            pentatricopeptide repeat-containing protein
            At1g02150-like [Citrus sinensis]
            gi|557547709|gb|ESR58687.1| hypothetical protein
            CICLE_v10019658mg [Citrus clementina]
          Length = 535

 Score =  663 bits (1710), Expect = 0.0
 Identities = 311/487 (63%), Positives = 403/487 (82%), Gaps = 5/487 (1%)
 Frame = +3

Query: 246  KNPVFTCSISQVHSYGTMDYERRPELKWNAVYKRISMMEDPEKGSASVLNQWENEGKRLS 425
            K PV  CS+SQ+HSYGT+D+ERRP +KWNA+++++S+M++P+ GSASVLN WE  G+ L+
Sbjct: 46   KLPVIKCSMSQIHSYGTVDFERRPMIKWNAIFRKLSLMDNPQLGSASVLNDWEKGGRSLT 105

Query: 426  KWEICRIVKELRKFRRFKLALEVYEWMNNREERFRLTSSDTAIHLDLIAKVRGIESAEEY 605
            KWE+CR+VKELRKFRR+K ALEVY+WMNNR ERFRL++SD AI LDLIAKV G+ SAE++
Sbjct: 106  KWELCRVVKELRKFRRYKHALEVYDWMNNRGERFRLSASDAAIQLDLIAKVHGVASAEDF 165

Query: 606  FLKLPDSLKDKRIYGALLNAYVHSRMAEKAESLMEKMRTRGYAIHALPFNVIMTLYMNLK 785
            FL LPD+LKD+R+YGALLNAYV +RM   AE L++KMR +GYA+H+LP+NV+MTLYM +K
Sbjct: 166  FLSLPDTLKDRRVYGALLNAYVRARMRGNAELLIDKMRDKGYAVHSLPYNVMMTLYMKIK 225

Query: 786  QFDKVESVVSEMTEKKIRLDLYSYNIWLSCRGSQGSAEKMEQVLEQMKLDTSINPNWTTY 965
            ++D+VES+VSEM EK IRLD+YSYNIWLS  GSQGS EKME V E MK+D ++NPNWTT+
Sbjct: 226  EYDEVESMVSEMKEKGIRLDVYSYNIWLSSCGSQGSTEKMEGVFELMKVDKAVNPNWTTF 285

Query: 966  STLATLYIRLGQLEKAEECLKQIEGRITGRDRIPYHYLISLYGSVGKKEEVLRVWDVYKS 1145
            ST+AT+YI++GQ+EKAEE L+++E RITGRDR+PYHYL+SLYGSVGKKEEV RVW++Y+S
Sbjct: 286  STMATMYIKMGQVEKAEESLRRVESRITGRDRVPYHYLLSLYGSVGKKEEVYRVWNLYRS 345

Query: 1146 TFPTIPNLGYHALISSLIRLNDIEVAEKIYEEWISVKSRYDPRIGNLLLGWYVRQGS-DK 1322
             FP + NLGYHA+ISSL R+ DIE  EKI+EEW+SVKS YDPRI NL++ WYV++G+ DK
Sbjct: 346  VFPGVTNLGYHAMISSLARIGDIEGMEKIFEEWLSVKSSYDPRIANLMMSWYVKEGNFDK 405

Query: 1323 AAAFYDKMIEHGGKPNSTTWEILAESRIRERRIPEALSYLKDAFSTDGSKSWKPKPRNVT 1502
            A AF++ +IE GGKPNST+WE LAE  IRERRI EALS LK AF+ +G+KSW+PKP NV 
Sbjct: 406  AEAFFNSIIEEGGKPNSTSWETLAEGHIRERRILEALSCLKGAFAAEGAKSWRPKPVNVI 465

Query: 1503 NILEICEEEGDVASKEALLGMLRQVGCLDNENYASGGGETPTA----KDRTEDDNNDSAD 1670
            N  + CEEE D+ SKEA + +LRQ G    ++Y S  G T  A      + ++D+++ ++
Sbjct: 466  NFFKACEEESDMGSKEAFVALLRQPGYRKEKDYMSLIGLTDEAVAENNKKNDEDSDEDSE 525

Query: 1671 MLLNQLQ 1691
            MLL+QLQ
Sbjct: 526  MLLSQLQ 532


>gb|EXB38379.1| hypothetical protein L484_008037 [Morus notabilis]
          Length = 546

 Score =  660 bits (1702), Expect = 0.0
 Identities = 317/494 (64%), Positives = 400/494 (80%), Gaps = 16/494 (3%)
 Frame = +3

Query: 264  CSISQ--VHSYGTMDYERRPELKWNAVYKRISMMEDPEKGSASVLNQWENEGKRLSKWEI 437
            CSISQ  +HSYGT+DYERRP +KWNA+YKRIS+ME PE GS +VL+QWE EG++LSKWE+
Sbjct: 52   CSISQSQIHSYGTVDYERRPMVKWNAIYKRISLMEKPELGSGTVLSQWEREGRQLSKWEL 111

Query: 438  CRIVKELRKFRRFKLALEVYEWMNNREERFRLTSSDTAIHLDLIAKVRGIESAEEYFLKL 617
            CR+VKELRK++RF  ALEVY+WMNNR ERFRL+SSD AI LDLI KVRGI SAE +FL L
Sbjct: 112  CRVVKELRKYKRFDRALEVYDWMNNRGERFRLSSSDAAIQLDLIGKVRGISSAENFFLSL 171

Query: 618  PDSLKDKRIYGALLNAYVHSRMAEKAESLMEKMRTRGYAIHALPFNVIMTLYMNLKQFDK 797
             D+ KD+RIYGALLNAYV +RM EKAESL+++MR +GYAIH+LPFNV+MTLYMNLK++ K
Sbjct: 172  SDTSKDRRIYGALLNAYVQARMKEKAESLLDRMRGKGYAIHSLPFNVMMTLYMNLKEYKK 231

Query: 798  VESVVSEMTEKKIRLDLYSYNIWLSCRGSQGSAEKMEQVLEQMKLDTSINPNWTTYSTLA 977
            V+++VSEM +K I+LD+YSYNIWLSC GSQGSAE MEQV EQM+ D SINPNWTT+ST+A
Sbjct: 232  VDAMVSEMMDKNIQLDVYSYNIWLSCCGSQGSAEGMEQVFEQMQQDKSINPNWTTFSTMA 291

Query: 978  TLYIRLGQLEKAEECLKQIEGRITGRDRIPYHYLISLYGSVGKKEEVLRVWDVYKSTFPT 1157
            T+YI++GQ +KAEECL+++E RITGRDRIPYHYL+SLYGSVG KEE+ RVW VYK+ FP+
Sbjct: 292  TMYIKMGQFQKAEECLRKVESRITGRDRIPYHYLLSLYGSVGNKEEIYRVWKVYKAIFPS 351

Query: 1158 IPNLGYHALISSLIRLNDIEVAEKIYEEWISVKSRYDPRIGNLLLGWYVRQGS-DKAAAF 1334
            IPNLGYHA+ISSL+R+ DIE AE IY EW+ VKS YDPRI NL + +YVR G+ +KA + 
Sbjct: 352  IPNLGYHAIISSLLRIGDIEGAENIYNEWLPVKSSYDPRIANLFMSYYVRNGNLEKATSL 411

Query: 1335 YDKMIEHGGKPNSTTWEILAESRIRERRIPEALSYLKDAFSTDGSKSWKPKPRNVTNILE 1514
             D +IE GGKPNS TWEILA     ERRI EALSY K+AF+ +G+K+W+PKP NV+  L+
Sbjct: 412  VDHIIEVGGKPNSATWEILAAGHTGERRISEALSYWKEAFAAEGAKNWRPKPVNVSAFLD 471

Query: 1515 ICEEEGDVASKEALLGMLRQVGCLDNENYASGGGETPTA-------------KDRTEDDN 1655
            +CE+E D+  KE L+G+LR+ G L +++YAS  G +  A             ++  +++ 
Sbjct: 472  LCEQEADLECKEVLVGLLREAGYLKDQSYASFVGFSHEAINDNGITSVDVSFENDNDENK 531

Query: 1656 NDSADMLLNQLQGS 1697
            +D + +L NQLQGS
Sbjct: 532  DDESGILFNQLQGS 545


>ref|XP_003530115.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02150-like
            [Glycine max]
          Length = 546

 Score =  636 bits (1641), Expect = e-179
 Identities = 312/490 (63%), Positives = 394/490 (80%), Gaps = 9/490 (1%)
 Frame = +3

Query: 255  VFTCSISQVHSYGTMDYERRPELKWNAVYKRISMMEDPEKGSASVLNQWENEGKRLSKWE 434
            V TCSIS++HSYGT+DYERRP + WN VY+RIS+  +P+ GSA VLNQWENEG+ L+KWE
Sbjct: 56   VTTCSISKIHSYGTVDYERRPIVGWNDVYRRISLNPNPQVGSAEVLNQWENEGRHLTKWE 115

Query: 435  ICRIVKELRKFRRFKLALEVYEWMNNREERFRLTSSDTAIHLDLIAKVRGIESAEEYFLK 614
            + R+VKELRK++RF+ ALEVY+WMNNR ERFR++ SD AI LDLIAKVRG+ SAE +FL 
Sbjct: 116  LSRVVKELRKYKRFRRALEVYDWMNNRPERFRVSESDAAIQLDLIAKVRGLSSAEAFFLS 175

Query: 615  LPDSLKDKRIYGALLNAYVHSRMAEKAESLMEKMRTRGYAIHALPFNVIMTLYMNLKQFD 794
            L D LKDK+ YGALLN YVHSR  EKAESL + MR++GY IHALPFNV+MTLYMNL ++ 
Sbjct: 176  LEDKLKDKKTYGALLNVYVHSRSKEKAESLFDTMRSKGYVIHALPFNVMMTLYMNLNEYA 235

Query: 795  KVESVVSEMTEKKIRLDLYSYNIWLSCRGSQGSAEKMEQVLEQMKLDTSINPNWTTYSTL 974
            KV+ + SEM EK I+LD+Y+YNIWLS  GSQGS EKMEQV EQM+ D SI PNW+T+ST+
Sbjct: 236  KVDILASEMMEKNIQLDIYTYNIWLSSCGSQGSVEKMEQVFEQMEKDPSIIPNWSTFSTM 295

Query: 975  ATLYIRLGQLEKAEECLKQIEGRITGRDRIPYHYLISLYGSVGKKEEVLRVWDVYKSTFP 1154
            A++YIR+ Q EKAEECL+++EGRI GRDRIP+HYL+SLYGSVGKK+EV RVW+ YKS FP
Sbjct: 296  ASMYIRMDQNEKAEECLRKVEGRIKGRDRIPFHYLLSLYGSVGKKDEVCRVWNTYKSIFP 355

Query: 1155 TIPNLGYHALISSLIRLNDIEVAEKIYEEWISVKSRYDPRIGNLLLGWYVRQG-SDKAAA 1331
            +IPNLGYHA+ISSL++L+DIEVAEK+YEEWISVKS YDPRIGNLL+GWYV++G +DKA +
Sbjct: 356  SIPNLGYHAIISSLVKLDDIEVAEKLYEEWISVKSSYDPRIGNLLIGWYVKKGDTDKALS 415

Query: 1332 FYDKMIEHGGKPNSTTWEILAESRIRERRIPEALSYLKDAF-STDGSKSWKPKPRNVTNI 1508
            F+++M+  G  PNS TWEIL+E  I ++RI EA+S LK+AF +  GSKSW+PKP  ++  
Sbjct: 416  FFEQMLNDGCIPNSNTWEILSEGHIADKRISEAMSCLKEAFMAAGGSKSWRPKPSYLSAF 475

Query: 1509 LEICEEEGDVASKEALLGMLRQVGCLDNENYAS---GGGETP--TAKDRTED--DNNDSA 1667
            LE+C+E+ D+ S E L+G+LRQ     ++ YAS      E P     DRT+D  D+ +  
Sbjct: 476  LELCQEQDDMESAEVLIGLLRQSKFNKSKVYASLIGSSDELPKIDTADRTDDAVDSENMD 535

Query: 1668 DMLLNQLQGS 1697
            + LLNQL  S
Sbjct: 536  NDLLNQLGSS 545


>ref|XP_006418504.1| hypothetical protein EUTSA_v10007383mg [Eutrema salsugineum]
            gi|557096275|gb|ESQ36857.1| hypothetical protein
            EUTSA_v10007383mg [Eutrema salsugineum]
          Length = 517

 Score =  627 bits (1618), Expect = e-177
 Identities = 301/499 (60%), Positives = 393/499 (78%), Gaps = 2/499 (0%)
 Frame = +3

Query: 210  SDSTILKPISNHKNPVFTCSISQVHSYGTMDYERRPELKWNAVYKRISMMEDPEKGSASV 389
            S S ++    + K     CSISQV+ YGT+DYERRP ++WNA+YK+IS+ME PE G+ASV
Sbjct: 25   SRSPVVSVALSKKKTAIVCSISQVYGYGTVDYERRPIIQWNAIYKKISLMEKPELGAASV 84

Query: 390  LNQWENEGKRLSKWEICRIVKELRKFRRFKLALEVYEWMNNREERFRLTSSDTAIHLDLI 569
            LNQWE  G++L+KWE+CR+VKELRK++R   ALEVY+WMNNR ERFRL++SD AI LDLI
Sbjct: 85   LNQWEKGGRKLTKWELCRVVKELRKYKRPNQALEVYDWMNNRGERFRLSASDAAIQLDLI 144

Query: 570  AKVRGIESAEEYFLKLPDSLKDKRIYGALLNAYVHSRMAEKAESLMEKMRTRGYAIHALP 749
             KVRGI  AEE+FL LP++ KD+R+YG+LLNAYV ++  EKAE+L++KMR +GYA+H LP
Sbjct: 145  GKVRGISDAEEFFLSLPENFKDRRVYGSLLNAYVRAKSREKAEALIDKMREKGYALHPLP 204

Query: 750  FNVIMTLYMNLKQFDKVESVVSEMTEKKIRLDLYSYNIWLSCRGSQGSAEKMEQVLEQMK 929
            FNV+MTLYMNL+++DKV+++V EM +K IRLD+YSYNIWLS  GS GS EKMEQV +QMK
Sbjct: 205  FNVMMTLYMNLREYDKVDAMVYEMKQKDIRLDIYSYNIWLSSCGSHGSVEKMEQVYQQMK 264

Query: 930  LDTSINPNWTTYSTLATLYIRLGQLEKAEECLKQIEGRITGRDRIPYHYLISLYGSVGKK 1109
             D SINPNWTT+ST+AT+YI++G+ EKAE+ L+++E RITGR+RIPYHYL+SLYGSVG K
Sbjct: 265  SDVSINPNWTTFSTMATMYIKMGENEKAEDALRKVEARITGRNRIPYHYLLSLYGSVGNK 324

Query: 1110 EEVLRVWDVYKSTFPTIPNLGYHALISSLIRLNDIEVAEKIYEEWISVKSRYDPRIGNLL 1289
            +E+ RVW+VYKS  P+IPNLGYHAL+SSL+R+ DI+ AEK+YEEW+ VKS YDPRI NLL
Sbjct: 325  KELYRVWNVYKSVVPSIPNLGYHALVSSLVRMGDIQGAEKVYEEWLPVKSSYDPRIPNLL 384

Query: 1290 LGWYVRQGS-DKAAAFYDKMIEHGGKPNSTTWEILAESRIRERRIPEALSYLKDAFSTDG 1466
            +  YV+    DKA   +D MIE GGKP+S+TWEILA    R+R I EAL+ LK+AFS +G
Sbjct: 385  MNVYVKNDQLDKAEGLFDHMIEMGGKPSSSTWEILAHGHTRKRNITEALTCLKEAFSAEG 444

Query: 1467 SKSWKPKPRNVTNILEICEEEGDVASKEALLGMLRQVGCLDNENYASGGGETPTAKDRTE 1646
            S +W+PK   ++   ++CEEE DVASKEA+L +LRQ G L +++Y +         D  E
Sbjct: 445  SSNWRPKVFMLSGFFKLCEEESDVASKEAVLELLRQSGHLQDKSYQA------LIDDAQE 498

Query: 1647 DDN-NDSADMLLNQLQGSL 1700
             ++ ++  D+LL QLQ  L
Sbjct: 499  SESESEGTDVLLTQLQDDL 517


>ref|XP_003520417.1| PREDICTED: pentatricopeptide repeat-containing protein At1g02150-like
            [Glycine max]
          Length = 555

 Score =  625 bits (1613), Expect = e-176
 Identities = 315/520 (60%), Positives = 396/520 (76%), Gaps = 22/520 (4%)
 Frame = +3

Query: 195  FPYGLSDSTILKPISNHKNP---VFTCSISQVHSYGTMDYERRPELKWNAVYKRISMMED 365
            F +  S ST L       +P   V TCSIS +HSYGT+DYERRP ++WN VY+RIS+ ++
Sbjct: 32   FNHSSSSSTTLTHAHTCFHPRLSVVTCSISNIHSYGTVDYERRPIVRWNDVYRRISLNQN 91

Query: 366  PEKGSASVLNQWENEGKRLSKWEICRIVKELRKFRRFKLALEVYEWMNNREERFRLTSSD 545
            P+ GSA VLNQWENEG+ L+KWE+ R+VKELRK++RF  ALEVY+WMNNR ERFR++ SD
Sbjct: 92   PQVGSAEVLNQWENEGRHLTKWELSRVVKELRKYKRFPRALEVYDWMNNRPERFRVSESD 151

Query: 546  TAIHLDLIAKVRGIESAEEYFLKLPDSLKDKRIYGALLNAYVHSRMAEKAESLMEKMRTR 725
             AI LDLIAKVRG+ SAE +FL L D LKDKR YGALLN YVHSR  EKAESL + MR++
Sbjct: 152  AAIQLDLIAKVRGVSSAEAFFLSLEDKLKDKRTYGALLNVYVHSRSKEKAESLFDTMRSK 211

Query: 726  GYAIHALPFNVIMTLYMNLKQFDKVESVVSEMTEKKIRLDLYSYNIWLSCRGSQGSAEKM 905
            GY IHALP NV+MTLYMNL ++ KV+ + SEM EK I+LD+Y+YNIWLS  GSQGS EKM
Sbjct: 212  GYVIHALPINVMMTLYMNLNEYAKVDMLASEMMEKNIQLDIYTYNIWLSSCGSQGSVEKM 271

Query: 906  EQVLEQMKLDTSINPNWTTYSTLATLYIRLGQLEKAEECLKQIEGRITGRDRIPYHYLIS 1085
            EQV EQM+ D +I PNW+T+STLA++YIR+ Q EKAE+CL+++EGRI GRDRIP+HYL+S
Sbjct: 272  EQVFEQMERDPTIVPNWSTFSTLASMYIRMNQNEKAEKCLRKVEGRIKGRDRIPFHYLLS 331

Query: 1086 LYGSVGKKEEVLRVWDVYKSTFPTIPNLGYHALISSLIRLNDIEVAEKIYEEWISVKSRY 1265
            LYGSVGKK+EV RVW+ YKS FP IPNLGYHA+ISSL++L+DIE AEK+YEEWISVKS Y
Sbjct: 332  LYGSVGKKDEVYRVWNTYKSIFPRIPNLGYHAIISSLVKLDDIEGAEKLYEEWISVKSSY 391

Query: 1266 DPRIGNLLLGWYVRQ-GSDKAAAFYDKMIEHGGKPNSTTWEILAESRIRERRIPEALSYL 1442
            DPRIGNLL+GWYV++  +DKA +F++++   G  PNS TWEIL+E  I ++RI EALS L
Sbjct: 392  DPRIGNLLMGWYVKKDDTDKALSFFEQISNDGCIPNSNTWEILSEGHIADKRISEALSCL 451

Query: 1443 KDAFS-TDGSKSWKPKPRNVTNILEICEEEGDVASKEALLGMLRQVGCLDNENYAS---- 1607
            K+AF    GSKSW+PKP  ++  LE+C+E+ D+ S E L+G+LRQ      + YAS    
Sbjct: 452  KEAFMVAGGSKSWRPKPSYLSAFLELCQEQNDMESAEVLIGLLRQSKFSKIKVYASIIGS 511

Query: 1608 -----GGGETPT---AKDRTED-----DNNDSADMLLNQL 1688
                   GE  +     DRT+D     + +D + MLLNQL
Sbjct: 512  PDCTIDNGELQSKIDITDRTDDAVDSENMDDDSQMLLNQL 551


>ref|XP_006306047.1| hypothetical protein CARUB_v10011354mg [Capsella rubella]
            gi|482574758|gb|EOA38945.1| hypothetical protein
            CARUB_v10011354mg [Capsella rubella]
          Length = 524

 Score =  620 bits (1600), Expect = e-175
 Identities = 295/496 (59%), Positives = 390/496 (78%), Gaps = 4/496 (0%)
 Frame = +3

Query: 216  STILKPISNHKNPVFTCSISQVHSYGTMDYERRPELKWNAVYKRISMMEDPEKGSASVLN 395
            S ++    + K     CSISQV+ YGT+DYERRP ++WNA+YK+IS+ME PE G+ASVLN
Sbjct: 27   SPVICVAPSKKTAAIVCSISQVYGYGTVDYERRPIIQWNAIYKKISLMEKPELGAASVLN 86

Query: 396  QWENEGKRLSKWEICRIVKELRKFRRFKLALEVYEWMNNREERFRLTSSDTAIHLDLIAK 575
            QWE  G++L+KWE+CR+VKELRK++R   ALEVY+WMNNR ERFRL++SD AI LDLI K
Sbjct: 87   QWEKGGRKLTKWELCRVVKELRKYKRPNQALEVYDWMNNRGERFRLSASDAAIQLDLIGK 146

Query: 576  VRGIESAEEYFLKLPDSLKDKRIYGALLNAYVHSRMAEKAESLMEKMRTRGYAIHALPFN 755
            VRGI  AEE+FL LP++ KD+R+YG+LLNAYV ++  EKAE+L+  MR +GYA+H LPFN
Sbjct: 147  VRGISDAEEFFLTLPETFKDRRVYGSLLNAYVRAKSREKAEALLNTMREKGYALHPLPFN 206

Query: 756  VIMTLYMNLKQFDKVESVVSEMTEKKIRLDLYSYNIWLSCRGSQGSAEKMEQVLEQMKLD 935
            V+MTLYMNL+++DKV+++V EM +K IRLD+YSYNIWLS  GS GS EKME V +QMK D
Sbjct: 207  VMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSSCGSLGSVEKMELVYQQMKSD 266

Query: 936  TSINPNWTTYSTLATLYIRLGQLEKAEECLKQIEGRITGRDRIPYHYLISLYGSVGKKEE 1115
             +INPNWTT+ST+AT+YI++G++EKAE+ L+++E RITGR+RIPYHYL+SLYGSVG K+E
Sbjct: 267  VAINPNWTTFSTMATMYIKMGEIEKAEDALRKVEARITGRNRIPYHYLLSLYGSVGNKKE 326

Query: 1116 VLRVWDVYKSTFPTIPNLGYHALISSLIRLNDIEVAEKIYEEWISVKSRYDPRIGNLLLG 1295
            + RVW+VYKS  P+IPNLGYHAL+SSL+R+ DIE AEK+YEEW+ VKS YDPRI NLL+ 
Sbjct: 327  LYRVWNVYKSVAPSIPNLGYHALVSSLVRMGDIEGAEKVYEEWLPVKSSYDPRIPNLLMN 386

Query: 1296 WYVRQGS-DKAAAFYDKMIEHGGKPNSTTWEILAESRIRERRIPEALSYLKDAFSTDGSK 1472
             YV+    +KA   +D M+E GGKP+S+TWEILA+   R+R IPEAL+ L+ AFS +GS 
Sbjct: 387  VYVKNDQLEKAEGLFDHMVEMGGKPSSSTWEILADGHTRKRCIPEALTCLRKAFSAEGSS 446

Query: 1473 SWKPKPRNVTNILEICEEEGDVASKEALLGMLRQVGCLDNENYAS--GGGETPTAKDRTE 1646
            +W+PK   ++   ++CEEE D+ SKEA+L +LRQ G L +++Y +     E  T  +   
Sbjct: 447  NWRPKVLMLSGFFKLCEEESDITSKEAVLELLRQAGHLQDKSYQALIDVDENRTVNNSEN 506

Query: 1647 D-DNNDSADMLLNQLQ 1691
            D   +D  D+LL+QLQ
Sbjct: 507  DAHESDGTDVLLSQLQ 522


>ref|XP_002892022.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
            subsp. lyrata] gi|297337864|gb|EFH68281.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 523

 Score =  617 bits (1592), Expect = e-174
 Identities = 298/499 (59%), Positives = 390/499 (78%), Gaps = 4/499 (0%)
 Frame = +3

Query: 216  STILKPISNHKNPVFTCSISQVHSYGTMDYERRPELKWNAVYKRISMMEDPEKGSASVLN 395
            S +L    + K     CSISQV+ YGT+DYERRP ++WNA+YK+IS+ME PE G+ASVLN
Sbjct: 28   SPVLSVALSKKTAAIVCSISQVYGYGTVDYERRPIVQWNAIYKKISLMEKPELGAASVLN 87

Query: 396  QWENEGKRLSKWEICRIVKELRKFRRFKLALEVYEWMNNREERFRLTSSDTAIHLDLIAK 575
            QWE  G++L+KWE+CR+VKELRK++R   ALEVY+WMNNR ERFRL++SD AI LDLI K
Sbjct: 88   QWEKGGRKLTKWELCRVVKELRKYKRPNQALEVYDWMNNRGERFRLSASDAAIQLDLIGK 147

Query: 576  VRGIESAEEYFLKLPDSLKDKRIYGALLNAYVHSRMAEKAESLMEKMRTRGYAIHALPFN 755
            VRGI  AE++FL LP++ KD+R+YG+LLNAYV ++  EKAE+L+  MR +GYA+H LPFN
Sbjct: 148  VRGISDAEQFFLTLPENFKDRRVYGSLLNAYVRAKSREKAEALLHTMRDKGYALHPLPFN 207

Query: 756  VIMTLYMNLKQFDKVESVVSEMTEKKIRLDLYSYNIWLSCRGSQGSAEKMEQVLEQMKLD 935
            V+MTLYMNL+++DKV+++V EM +K IRLD+YSYNIWLS  GS GS EKME V +QMK D
Sbjct: 208  VMMTLYMNLREYDKVDAMVFEMKQKDIRLDIYSYNIWLSSCGSLGSVEKMELVYQQMKSD 267

Query: 936  TSINPNWTTYSTLATLYIRLGQLEKAEECLKQIEGRITGRDRIPYHYLISLYGSVGKKEE 1115
             SINPNWTT+ST+AT+YI++G+ EKAE+ L+++E RITGR+RIPYHYL+SLYGSVG K+E
Sbjct: 268  VSINPNWTTFSTMATMYIKMGETEKAEDALRKVEARITGRNRIPYHYLLSLYGSVGNKKE 327

Query: 1116 VLRVWDVYKSTFPTIPNLGYHALISSLIRLNDIEVAEKIYEEWISVKSRYDPRIGNLLLG 1295
            + RVW+VYKS  P+IPNLGYHAL+SSL R+ DIE AEK+YEEW+ VKS YDPRI NLL+ 
Sbjct: 328  LYRVWNVYKSVVPSIPNLGYHALVSSLARMGDIEGAEKVYEEWLPVKSSYDPRIPNLLMN 387

Query: 1296 WYVRQGS-DKAAAFYDKMIEHGGKPNSTTWEILAESRIRERRIPEALSYLKDAFSTDGSK 1472
             YV+    +KA   +D M+E GGKP+S+TWEILA+   R+R IPEAL+ L+ AFS +GS 
Sbjct: 388  VYVKNDQLEKAEGLFDHMVEMGGKPSSSTWEILADGHTRKRCIPEALTCLRKAFSAEGSS 447

Query: 1473 SWKPKPRNVTNILEICEEEGDVASKEALLGMLRQVGCLDNENYASGGGETPTAKDRTEDD 1652
            +W+PK   ++   ++CEEE DV SKEA+L +LRQ G L+++ Y +        ++RTE++
Sbjct: 448  NWRPKVLMLSGFFKLCEEESDVTSKEAVLELLRQSGHLEDKAYQA---LIDVDENRTENN 504

Query: 1653 NNDSA---DMLLNQLQGSL 1700
            +   A   D LL QLQ  L
Sbjct: 505  SEIDAHETDALLTQLQDDL 523


>gb|ESW06405.1| hypothetical protein PHAVU_010G045500g [Phaseolus vulgaris]
          Length = 542

 Score =  613 bits (1582), Expect = e-173
 Identities = 302/517 (58%), Positives = 396/517 (76%), Gaps = 16/517 (3%)
 Frame = +3

Query: 195  FPYGLSDSTILKPISNHKN-PVFTCSISQVHSYGTMDYERRPELKWNAVYKRISMMEDPE 371
            FP+ LS ST+   ++     P+ TCS+S+VHSYGT+DYERRP ++WN VY+RI++  DP+
Sbjct: 25   FPFKLSSSTLPFALTRATCLPLITCSVSKVHSYGTVDYERRPIVRWNEVYRRITLNPDPD 84

Query: 372  KGSASVLNQWENEGKRLSKWEICRIVKELRKFRRFKLALEVYEWMNNREERFRLTSSDTA 551
              SA VLN+WENEGK L+KWE+ R+VKELRK+++F+ ALEVY+W+NNR ERFR++ SD A
Sbjct: 85   MSSAEVLNRWENEGKHLTKWELSRVVKELRKYKKFRRALEVYDWINNRPERFRVSESDAA 144

Query: 552  IHLDLIAKVRGIESAEEYFLKLPDSLKDKRIYGALLNAYVHSRMAEKAESLMEKMRTRGY 731
            I LDLIAKVRG  SAE +FL L D LK+KR YGALLN YVHSR+ EKAESL + MR++GY
Sbjct: 145  IQLDLIAKVRGFSSAEVFFLSLEDQLKNKRTYGALLNVYVHSRLKEKAESLFDTMRSKGY 204

Query: 732  AIHALPFNVIMTLYMNLKQFDKVESVVSEMTEKKIRLDLYSYNIWLSCRGSQGSAEKMEQ 911
             +HALPFNV+MTLYMN+K++ KV+ +VSEM EKKI+LD+Y+YNIWLS  GSQGS EKMEQ
Sbjct: 205  VVHALPFNVMMTLYMNVKEYVKVDMLVSEMMEKKIQLDIYTYNIWLSSSGSQGSIEKMEQ 264

Query: 912  VLEQMKLDTSINPNWTTYSTLATLYIRLGQLEKAEECLKQIEGRITGRDRIPYHYLISLY 1091
            V EQM+ D +I PNW+T+ST+A++YIR+ Q EKAEECL+++E RI GRDRIP+HYL+SLY
Sbjct: 265  VFEQMEKDATIIPNWSTFSTMASMYIRMDQTEKAEECLRKVESRIKGRDRIPFHYLLSLY 324

Query: 1092 GSVGKKEEVLRVWDVYKSTFP-TIPNLGYHALISSLIRLNDIEVAEKIYEEWISVKSRYD 1268
            G V  K+EV RVW+ YK+ FP   PNLGYHA+I+SL++L+DI  AEK+Y+EW+SVKS YD
Sbjct: 325  GRVRNKDEVYRVWNSYKTVFPKNTPNLGYHAIIASLVKLDDIAGAEKLYQEWVSVKSSYD 384

Query: 1269 PRIGNLLLGWYVRQGS-DKAAAFYDKMIEHGGKPNSTTWEILAESRIRERRIPEALSYLK 1445
            PR+GNLLLGWYV  G   KA +F+ +M E GG PNS TWEIL+E  I ++RI EALS LK
Sbjct: 385  PRVGNLLLGWYVEAGDIHKALSFFKQMKEDGGFPNSNTWEILSEGYIADKRISEALSCLK 444

Query: 1446 DAFS-TDGSKSWKPKPRNVTNILEICEEEGDVASKEALLGMLRQVGCLDNENYAS---GG 1613
            DAF   D S+SW+PKP N++  LE+C+E+GD+ S E  + +L+      ++ YAS     
Sbjct: 445  DAFMVADNSRSWRPKPLNLSAFLELCQEQGDMESAETFIVLLKLSKFSKHKTYASLIGSS 504

Query: 1614 GETPTAKDRTEDDNND---------SADMLLNQLQGS 1697
            G+  ++K  T   N+D          ++MLLN+L+ S
Sbjct: 505  GDGLSSKIDTIGRNDDIHDREDMDGESEMLLNELESS 541


>ref|NP_171717.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|193806400|sp|Q8LPS6.2|PPR3_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g02150 gi|2317908|gb|AAC24372.1| Unknown protein
            [Arabidopsis thaliana] gi|332189272|gb|AEE27393.1|
            pentatricopeptide repeat-containing protein [Arabidopsis
            thaliana]
          Length = 524

 Score =  607 bits (1565), Expect = e-171
 Identities = 289/486 (59%), Positives = 378/486 (77%), Gaps = 1/486 (0%)
 Frame = +3

Query: 246  KNPVFTCSISQVHSYGTMDYERRPELKWNAVYKRISMMEDPEKGSASVLNQWENEGKRLS 425
            K     CSISQV+ YGT+DYERRP ++WNA+YK+IS+ME PE G+ASVLNQWE  G++L+
Sbjct: 39   KTAAIVCSISQVYGYGTVDYERRPIVQWNAIYKKISLMEKPELGAASVLNQWEKAGRKLT 98

Query: 426  KWEICRIVKELRKFRRFKLALEVYEWMNNREERFRLTSSDTAIHLDLIAKVRGIESAEEY 605
            KWE+CR+VKELRK++R   ALEVY+WMNNR ERFRL++SD AI LDLI KVRGI  AEE+
Sbjct: 99   KWELCRVVKELRKYKRANQALEVYDWMNNRGERFRLSASDAAIQLDLIGKVRGIPDAEEF 158

Query: 606  FLKLPDSLKDKRIYGALLNAYVHSRMAEKAESLMEKMRTRGYAIHALPFNVIMTLYMNLK 785
            FL+LP++ KD+R+YG+LLNAYV ++  EKAE+L+  MR +GYA+H LPFNV+MTLYMNL+
Sbjct: 159  FLQLPENFKDRRVYGSLLNAYVRAKSREKAEALLNTMRDKGYALHPLPFNVMMTLYMNLR 218

Query: 786  QFDKVESVVSEMTEKKIRLDLYSYNIWLSCRGSQGSAEKMEQVLEQMKLDTSINPNWTTY 965
            ++DKV+++V EM +K IRLD+YSYNIWLS  GS GS EKME V +QMK D SI PNWTT+
Sbjct: 219  EYDKVDAMVFEMKQKDIRLDIYSYNIWLSSCGSLGSVEKMELVYQQMKSDVSIYPNWTTF 278

Query: 966  STLATLYIRLGQLEKAEECLKQIEGRITGRDRIPYHYLISLYGSVGKKEEVLRVWDVYKS 1145
            ST+AT+YI++G+ EKAE+ L+++E RITGR+RIPYHYL+SLYGS+G K+E+ RVW VYKS
Sbjct: 279  STMATMYIKMGETEKAEDALRKVEARITGRNRIPYHYLLSLYGSLGNKKELYRVWHVYKS 338

Query: 1146 TFPTIPNLGYHALISSLIRLNDIEVAEKIYEEWISVKSRYDPRIGNLLLGWYVRQGS-DK 1322
              P+IPNLGYHAL+SSL+R+ DIE AEK+YEEW+ VKS YDPRI NLL+  YV+    + 
Sbjct: 339  VVPSIPNLGYHALVSSLVRMGDIEGAEKVYEEWLPVKSSYDPRIPNLLMNAYVKNDQLET 398

Query: 1323 AAAFYDKMIEHGGKPNSTTWEILAESRIRERRIPEALSYLKDAFSTDGSKSWKPKPRNVT 1502
            A   +D M+E GGKP+S+TWEILA    R+R I EAL+ L++AFS +GS +W+PK   ++
Sbjct: 399  AEGLFDHMVEMGGKPSSSTWEILAVGHTRKRCISEALTCLRNAFSAEGSSNWRPKVLMLS 458

Query: 1503 NILEICEEEGDVASKEALLGMLRQVGCLDNENYASGGGETPTAKDRTEDDNNDSADMLLN 1682
               ++CEEE DV SKEA+L +LRQ G L++++Y +             + +    D LL 
Sbjct: 459  GFFKLCEEESDVTSKEAVLELLRQSGDLEDKSYLALIDVDENRTVNNSEIDAHETDALLT 518

Query: 1683 QLQGSL 1700
            QLQ  L
Sbjct: 519  QLQDDL 524


Top