BLASTX nr result

ID: Cocculus23_contig00005458 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00005458
         (2120 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002270184.2| PREDICTED: pentatricopeptide repeat-containi...   468   e-129
ref|XP_004141206.1| PREDICTED: pentatricopeptide repeat-containi...   445   e-122
ref|XP_007042227.1| Pentatricopeptide repeat superfamily protein...   443   e-121
ref|XP_006431883.1| hypothetical protein CICLE_v10000525mg [Citr...   439   e-120
ref|XP_006343482.1| PREDICTED: pentatricopeptide repeat-containi...   436   e-119
ref|XP_006826767.1| hypothetical protein AMTR_s00136p00085920 [A...   436   e-119
ref|XP_007042228.1| Pentatricopeptide repeat (PPR) superfamily p...   436   e-119
ref|XP_004299940.1| PREDICTED: pentatricopeptide repeat-containi...   435   e-119
ref|XP_006343483.1| PREDICTED: pentatricopeptide repeat-containi...   433   e-118
ref|XP_006417404.1| hypothetical protein EUTSA_v10007006mg [Eutr...   433   e-118
ref|XP_004250704.1| PREDICTED: pentatricopeptide repeat-containi...   432   e-118
ref|XP_006343484.1| PREDICTED: pentatricopeptide repeat-containi...   432   e-118
ref|XP_006343481.1| PREDICTED: pentatricopeptide repeat-containi...   432   e-118
ref|XP_007225150.1| hypothetical protein PRUPE_ppa002505mg [Prun...   431   e-118
ref|NP_172560.2| pentatricopeptide repeat-containing protein [Ar...   429   e-117
ref|XP_002529286.1| pentatricopeptide repeat-containing protein,...   429   e-117
ref|XP_007148512.1| hypothetical protein PHAVU_006G214900g [Phas...   428   e-117
ref|XP_002889841.1| hypothetical protein ARALYDRAFT_888388 [Arab...   428   e-117
ref|XP_002299667.2| hypothetical protein POPTR_0001s21880g [Popu...   422   e-115
gb|EYU26539.1| hypothetical protein MIMGU_mgv1a002527mg [Mimulus...   419   e-114

>ref|XP_002270184.2| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic [Vitis vinifera]
            gi|298204537|emb|CBI23812.3| unnamed protein product
            [Vitis vinifera]
          Length = 659

 Score =  468 bits (1204), Expect = e-129
 Identities = 236/318 (74%), Positives = 269/318 (84%)
 Frame = -2

Query: 2119 GYAVDEMPYCLLMDGLAKAGRIPEAKAIFDEIETKGVKSDGYSHSIMISAFCRSGCLEEA 1940
            GYA DEMPYCLLMDGLAK+ RI EAK+IF+E++ K VKSDGY +SIMISAFCRSG L+EA
Sbjct: 341  GYAEDEMPYCLLMDGLAKSRRILEAKSIFEEMKKKQVKSDGYCYSIMISAFCRSGLLKEA 400

Query: 1939 KQLARDFEARYDKYDLVMLNTMLRAYCKAGEMGSVMQMLRKMDEFKISPDWNTFHILIKY 1760
            KQLARDFEA YDKYDLVMLNTML AYC+AGEM SVMQM+RKMDE  ISPDWNTFHILIKY
Sbjct: 401  KQLARDFEATYDKYDLVMLNTMLCAYCRAGEMESVMQMMRKMDELAISPDWNTFHILIKY 460

Query: 1759 FYKEKLYQLAYRTMVDMHSKGHHLDEEISSSLIVQLGKIGAPSEAFSVYNMLRFSKRTMC 1580
            F KEKLY LAYRTM DMH+KGH  +EE+ SSLI  LGKI A S+AFSVYNMLR+SKRTMC
Sbjct: 461  FCKEKLYLLAYRTMEDMHNKGHQPEEELCSSLISHLGKIRAHSQAFSVYNMLRYSKRTMC 520

Query: 1579 KALHEKVLKILVAGGLLKDAYVVVKDNAEQISRNTLKKFAISFMKSGNINLINDVMRAVH 1400
            KALHEK+L ILVAG LLKDAYVVVKDN   IS+ ++KKFA +FMK GN+NLINDVM+A+H
Sbjct: 521  KALHEKILHILVAGRLLKDAYVVVKDNEGLISKPSIKKFATAFMKFGNVNLINDVMKAIH 580

Query: 1399 SSGHKIDQEVFDLAISRYIGEPXXXXXXXXXXQWMTGQGYVVDSSSRNLLLKNSHLFGRR 1220
             SG+KIDQE+F +A++RYI EP          QWM GQGYVVDSS+RN++LKNSHLFGR+
Sbjct: 581  GSGYKIDQELFQMAVTRYIAEPEKKELLLHLLQWMPGQGYVVDSSTRNMILKNSHLFGRQ 640

Query: 1219 LIAETLSKQQRMSKMLIT 1166
            LIAE LSKQ   +K LI+
Sbjct: 641  LIAEMLSKQHARAKALIS 658


>ref|XP_004141206.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like [Cucumis sativus]
          Length = 668

 Score =  445 bits (1145), Expect = e-122
 Identities = 220/314 (70%), Positives = 259/314 (82%)
 Frame = -2

Query: 2119 GYAVDEMPYCLLMDGLAKAGRIPEAKAIFDEIETKGVKSDGYSHSIMISAFCRSGCLEEA 1940
            GY  +EMPYCLLMDGLAKAG I EAK +FDE++ K VK+DGY+HSIMISAFCR G LEEA
Sbjct: 343  GYGENEMPYCLLMDGLAKAGSIREAKTVFDEMKAKNVKTDGYAHSIMISAFCRGGLLEEA 402

Query: 1939 KQLARDFEARYDKYDLVMLNTMLRAYCKAGEMGSVMQMLRKMDEFKISPDWNTFHILIKY 1760
            K LA+DFEA YD+YD+V+LNTML AYC+AGEM SVMQMLRKMD+  ISPD+NTFHILIKY
Sbjct: 403  KLLAKDFEATYDRYDIVILNTMLCAYCRAGEMESVMQMLRKMDDLAISPDYNTFHILIKY 462

Query: 1759 FYKEKLYQLAYRTMVDMHSKGHHLDEEISSSLIVQLGKIGAPSEAFSVYNMLRFSKRTMC 1580
            F+KEKLY L YRT+ DMH KGH  +EE+ SSLI+ LG I A SEAFSVYN+L++SKRTMC
Sbjct: 463  FFKEKLYLLCYRTLEDMHRKGHQPEEELCSSLILSLGNIRAYSEAFSVYNILKYSKRTMC 522

Query: 1579 KALHEKVLKILVAGGLLKDAYVVVKDNAEQISRNTLKKFAISFMKSGNINLINDVMRAVH 1400
            KALHEK+L IL+AG LLKDAYVVVKDNA  IS+  ++KFA  FMK GN+NLINDVM+A+H
Sbjct: 523  KALHEKILHILIAGRLLKDAYVVVKDNAGVISKPAIRKFAFGFMKFGNVNLINDVMKAIH 582

Query: 1399 SSGHKIDQEVFDLAISRYIGEPXXXXXXXXXXQWMTGQGYVVDSSSRNLLLKNSHLFGRR 1220
             SG+KIDQ++F +A SRYI  P          +WM GQGYVVDSS+RNL+LKN+HLFGR+
Sbjct: 583  GSGYKIDQDLFMIATSRYIELPEKKDLFIQLLKWMPGQGYVVDSSTRNLILKNAHLFGRQ 642

Query: 1219 LIAETLSKQQRMSK 1178
            LIAE LSK   +SK
Sbjct: 643  LIAEILSKHSLLSK 656


>ref|XP_007042227.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma
            cacao] gi|508706162|gb|EOX98058.1| Pentatricopeptide
            repeat superfamily protein isoform 1 [Theobroma cacao]
          Length = 717

 Score =  443 bits (1140), Expect = e-121
 Identities = 220/315 (69%), Positives = 262/315 (83%)
 Frame = -2

Query: 2119 GYAVDEMPYCLLMDGLAKAGRIPEAKAIFDEIETKGVKSDGYSHSIMISAFCRSGCLEEA 1940
            GYA DEMP+CLLMDGL+KAGR+ EA+++F E++ K VKSDGYSHSIMISA CR+G  EEA
Sbjct: 335  GYAEDEMPFCLLMDGLSKAGRLDEARSVFVEMQQKCVKSDGYSHSIMISALCRAGLFEEA 394

Query: 1939 KQLARDFEARYDKYDLVMLNTMLRAYCKAGEMGSVMQMLRKMDEFKISPDWNTFHILIKY 1760
            K+LA+DFEA+Y+KYDLVMLNTML AYC+AGEM SVMQ ++KMDE  ISPD+NTFHILIKY
Sbjct: 395  KELAQDFEAQYNKYDLVMLNTMLCAYCRAGEMESVMQTMKKMDELAISPDYNTFHILIKY 454

Query: 1759 FYKEKLYQLAYRTMVDMHSKGHHLDEEISSSLIVQLGKIGAPSEAFSVYNMLRFSKRTMC 1580
            F KEKLY LAY+TM DMH KG+H +EE+ SSLI QLGK+ A  EAFSVYNMLR+SKRTMC
Sbjct: 455  FCKEKLYLLAYKTMEDMHGKGYHPEEELCSSLIFQLGKMKAHLEAFSVYNMLRYSKRTMC 514

Query: 1579 KALHEKVLKILVAGGLLKDAYVVVKDNAEQISRNTLKKFAISFMKSGNINLINDVMRAVH 1400
            KALHEK+L IL+AG LLKDAYVVVKDNAE IS+  + KFA +FMK GNIN+INDV++ +H
Sbjct: 515  KALHEKILHILIAGQLLKDAYVVVKDNAELISQPAITKFATAFMKLGNINMINDVLKVLH 574

Query: 1399 SSGHKIDQEVFDLAISRYIGEPXXXXXXXXXXQWMTGQGYVVDSSSRNLLLKNSHLFGRR 1220
             SG+KIDQ +F +AISRY+G+P          QWM G GYVVDSS+RN++LKNS L GR+
Sbjct: 575  GSGYKIDQGLFQMAISRYLGQPEKKELLLQLLQWMPGHGYVVDSSTRNMILKNSQLLGRQ 634

Query: 1219 LIAETLSKQQRMSKM 1175
            L AE LSKQ  MSK+
Sbjct: 635  LTAEILSKQHMMSKV 649


>ref|XP_006431883.1| hypothetical protein CICLE_v10000525mg [Citrus clementina]
            gi|557534005|gb|ESR45123.1| hypothetical protein
            CICLE_v10000525mg [Citrus clementina]
          Length = 660

 Score =  439 bits (1128), Expect = e-120
 Identities = 213/314 (67%), Positives = 261/314 (83%)
 Frame = -2

Query: 2119 GYAVDEMPYCLLMDGLAKAGRIPEAKAIFDEIETKGVKSDGYSHSIMISAFCRSGCLEEA 1940
            GYA +EMPYCLLMDGL+KAG + EA+ +F+E++ K VKSDGY+HSIMISAFCR GC EEA
Sbjct: 340  GYAENEMPYCLLMDGLSKAGCLDEARVVFNEMQEKCVKSDGYAHSIMISAFCRGGCFEEA 399

Query: 1939 KQLARDFEARYDKYDLVMLNTMLRAYCKAGEMGSVMQMLRKMDEFKISPDWNTFHILIKY 1760
            KQLA DFEA+YDKYD+V+LN+ML AYC+ G+M SVM ++RK+DE  ISPD+NTFHILIKY
Sbjct: 400  KQLAGDFEAKYDKYDVVLLNSMLCAYCRTGDMESVMHVMRKLDELAISPDYNTFHILIKY 459

Query: 1759 FYKEKLYQLAYRTMVDMHSKGHHLDEEISSSLIVQLGKIGAPSEAFSVYNMLRFSKRTMC 1580
            F KEK+Y LAYRTMVDMH KGH  +EE+ SSLI  LGK+ A SEA SVYNMLR+SKR+MC
Sbjct: 460  FCKEKMYILAYRTMVDMHRKGHQPEEELCSSLIFHLGKMRAHSEALSVYNMLRYSKRSMC 519

Query: 1579 KALHEKVLKILVAGGLLKDAYVVVKDNAEQISRNTLKKFAISFMKSGNINLINDVMRAVH 1400
            KALHEK+L IL++G LLKDAYVVVKDN+E IS   +KKFA +F++ GNINL+NDVM+A+H
Sbjct: 520  KALHEKILHILISGKLLKDAYVVVKDNSESISHPVIKKFASAFVRLGNINLVNDVMKAIH 579

Query: 1399 SSGHKIDQEVFDLAISRYIGEPXXXXXXXXXXQWMTGQGYVVDSSSRNLLLKNSHLFGRR 1220
            ++G++IDQ +F +AI+RYI E           +WMTGQGYVVDSS+RNL+LKNSHL GR+
Sbjct: 580  TTGYRIDQGIFHIAIARYIAEREKKELLLKLLEWMTGQGYVVDSSTRNLILKNSHLLGRQ 639

Query: 1219 LIAETLSKQQRMSK 1178
            LIA+ LSKQ   SK
Sbjct: 640  LIADILSKQHMKSK 653


>ref|XP_006343482.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like isoform X2 [Solanum tuberosum]
          Length = 651

 Score =  436 bits (1122), Expect = e-119
 Identities = 214/314 (68%), Positives = 261/314 (83%)
 Frame = -2

Query: 2119 GYAVDEMPYCLLMDGLAKAGRIPEAKAIFDEIETKGVKSDGYSHSIMISAFCRSGCLEEA 1940
            GYA DEMP+CLLMDGLAK+G + EAK++FDE+  K VK+DGYS+SIMISAFCRSG LE+A
Sbjct: 328  GYAKDEMPFCLLMDGLAKSGHLLEAKSVFDEMMEKHVKTDGYSYSIMISAFCRSGLLEDA 387

Query: 1939 KQLARDFEARYDKYDLVMLNTMLRAYCKAGEMGSVMQMLRKMDEFKISPDWNTFHILIKY 1760
            K++A +FE +YDKYD+V+LN ML AYC+AG+M +VM M++KMD+  ISPDWNTF+ILI+Y
Sbjct: 388  KKVASEFEEKYDKYDIVILNAMLSAYCRAGKMENVMSMMKKMDDSAISPDWNTFNILIRY 447

Query: 1759 FYKEKLYQLAYRTMVDMHSKGHHLDEEISSSLIVQLGKIGAPSEAFSVYNMLRFSKRTMC 1580
            F KEKLY LAYRTM DMHSKGH  +E + SSLI  LGK GA SEAFSVYNMLR+SKRT+ 
Sbjct: 448  FCKEKLYLLAYRTMEDMHSKGHQPEEGLCSSLIYHLGKTGAHSEAFSVYNMLRYSKRTIS 507

Query: 1579 KALHEKVLKILVAGGLLKDAYVVVKDNAEQISRNTLKKFAISFMKSGNINLINDVMRAVH 1400
             ALHE +L IL+AG LLKDAYVVVKDNA  IS+  +KKF+++FM+SGN+NLINDVM A+H
Sbjct: 508  NALHEHILHILIAGRLLKDAYVVVKDNAGFISQPAIKKFSVNFMRSGNVNLINDVMNAMH 567

Query: 1399 SSGHKIDQEVFDLAISRYIGEPXXXXXXXXXXQWMTGQGYVVDSSSRNLLLKNSHLFGRR 1220
            SSGHKIDQE+FDLAI+RYI +P          +WM G+GY +DSS+RNL+LKNSHLFG +
Sbjct: 568  SSGHKIDQELFDLAIARYIAKPEKKELLLWLLKWMPGKGYAIDSSTRNLILKNSHLFGHQ 627

Query: 1219 LIAETLSKQQRMSK 1178
            LIAE+LSK   MSK
Sbjct: 628  LIAESLSKHLVMSK 641


>ref|XP_006826767.1| hypothetical protein AMTR_s00136p00085920 [Amborella trichopoda]
            gi|548831187|gb|ERM94004.1| hypothetical protein
            AMTR_s00136p00085920 [Amborella trichopoda]
          Length = 690

 Score =  436 bits (1122), Expect = e-119
 Identities = 218/315 (69%), Positives = 261/315 (82%)
 Frame = -2

Query: 2119 GYAVDEMPYCLLMDGLAKAGRIPEAKAIFDEIETKGVKSDGYSHSIMISAFCRSGCLEEA 1940
            G+A DEMPYCLLMDGLAKAG I EAKA+F++++ K VKSDGYSHSI+ISA+CR G LEEA
Sbjct: 368  GFARDEMPYCLLMDGLAKAGHIDEAKAVFEDMKQKNVKSDGYSHSIIISAYCREGLLEEA 427

Query: 1939 KQLARDFEARYDKYDLVMLNTMLRAYCKAGEMGSVMQMLRKMDEFKISPDWNTFHILIKY 1760
            K LA+DFE+   KYDLVMLNT+LRAYCK GEM  VMQ ++KMDE  ISPD +TF ILIKY
Sbjct: 428  KLLAKDFESTSGKYDLVMLNTLLRAYCKGGEMQYVMQTMKKMDELAISPDLHTFSILIKY 487

Query: 1759 FYKEKLYQLAYRTMVDMHSKGHHLDEEISSSLIVQLGKIGAPSEAFSVYNMLRFSKRTMC 1580
            F KEKLY LAYRT+ DMH++G  +DEE+ +SLI++LGK GA SEA+SVYN LR++KRT+C
Sbjct: 488  FSKEKLYNLAYRTVEDMHARGLQIDEELCTSLILELGKAGAASEAYSVYNKLRYTKRTLC 547

Query: 1579 KALHEKVLKILVAGGLLKDAYVVVKDNAEQISRNTLKKFAISFMKSGNINLINDVMRAVH 1400
            KALHEKVLKILVAG LLKDAYV+VKDN+E IS++ L KF  SFMK GNINLINDV+RA+H
Sbjct: 548  KALHEKVLKILVAGRLLKDAYVLVKDNSELISKSALDKFVTSFMKFGNINLINDVLRALH 607

Query: 1399 SSGHKIDQEVFDLAISRYIGEPXXXXXXXXXXQWMTGQGYVVDSSSRNLLLKNSHLFGRR 1220
            ++G+ I+Q VF LA+SRY+GEP          +WM+GQGYVVDS SRNLLLKN  LFG++
Sbjct: 608  NNGYLINQGVFSLAVSRYVGEPEKKELLLHMLEWMSGQGYVVDSESRNLLLKNCDLFGKQ 667

Query: 1219 LIAETLSKQQRMSKM 1175
            LIAE LSKQ  MSK+
Sbjct: 668  LIAEGLSKQHAMSKI 682


>ref|XP_007042228.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 2
            [Theobroma cacao] gi|508706163|gb|EOX98059.1|
            Pentatricopeptide repeat (PPR) superfamily protein
            isoform 2 [Theobroma cacao]
          Length = 649

 Score =  436 bits (1120), Expect = e-119
 Identities = 219/315 (69%), Positives = 260/315 (82%)
 Frame = -2

Query: 2119 GYAVDEMPYCLLMDGLAKAGRIPEAKAIFDEIETKGVKSDGYSHSIMISAFCRSGCLEEA 1940
            GYA DEMP+CLLMDGL+KAGR+ EA+++F E++ K VKSDGYSHSIMISA CR+G  EEA
Sbjct: 335  GYAEDEMPFCLLMDGLSKAGRLDEARSVFVEMQQKCVKSDGYSHSIMISALCRAGLFEEA 394

Query: 1939 KQLARDFEARYDKYDLVMLNTMLRAYCKAGEMGSVMQMLRKMDEFKISPDWNTFHILIKY 1760
            K+LA+DFEA+Y+KYDLVMLNTML AYC+AGEM SVMQ ++KMDE  ISPD+NTFHILIKY
Sbjct: 395  KELAQDFEAQYNKYDLVMLNTMLCAYCRAGEMESVMQTMKKMDELAISPDYNTFHILIKY 454

Query: 1759 FYKEKLYQLAYRTMVDMHSKGHHLDEEISSSLIVQLGKIGAPSEAFSVYNMLRFSKRTMC 1580
            F KEKLY LAY+TM DMH KG+H +EE+ SSLI QLGK+ A  EAFSVYNMLR+SKRTMC
Sbjct: 455  FCKEKLYLLAYKTMEDMHGKGYHPEEELCSSLIFQLGKMKAHLEAFSVYNMLRYSKRTMC 514

Query: 1579 KALHEKVLKILVAGGLLKDAYVVVKDNAEQISRNTLKKFAISFMKSGNINLINDVMRAVH 1400
            KALHEK+L IL+AG LLKDAYVVVKDNAE IS+  + KFA +FMK GNIN+INDV++ +H
Sbjct: 515  KALHEKILHILIAGQLLKDAYVVVKDNAELISQPAITKFATAFMKLGNINMINDVLKVLH 574

Query: 1399 SSGHKIDQEVFDLAISRYIGEPXXXXXXXXXXQWMTGQGYVVDSSSRNLLLKNSHLFGRR 1220
             SG+KIDQ    +AISRY+G+P          QWM G GYVVDSS+RN++LKNS L GR+
Sbjct: 575  GSGYKIDQ----MAISRYLGQPEKKELLLQLLQWMPGHGYVVDSSTRNMILKNSQLLGRQ 630

Query: 1219 LIAETLSKQQRMSKM 1175
            L AE LSKQ  MSK+
Sbjct: 631  LTAEILSKQHMMSKV 645


>ref|XP_004299940.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 642

 Score =  435 bits (1119), Expect = e-119
 Identities = 219/314 (69%), Positives = 255/314 (81%)
 Frame = -2

Query: 2119 GYAVDEMPYCLLMDGLAKAGRIPEAKAIFDEIETKGVKSDGYSHSIMISAFCRSGCLEEA 1940
            GYA DEMPYC+LMD  AKAGRI +AK +FDEI+ K V+SDGYS+SIMISAFCR G +++A
Sbjct: 324  GYAEDEMPYCILMDAFAKAGRIEDAKLVFDEIKEKSVRSDGYSYSIMISAFCRGGLVDDA 383

Query: 1939 KQLARDFEARYDKYDLVMLNTMLRAYCKAGEMGSVMQMLRKMDEFKISPDWNTFHILIKY 1760
            KQLA+DFE  YDKYDLVMLNTM+ AYC+AGEM SVM+MLRKMDE KI+PD NTFHILIKY
Sbjct: 384  KQLAKDFERTYDKYDLVMLNTMICAYCRAGEMDSVMEMLRKMDELKITPDNNTFHILIKY 443

Query: 1759 FYKEKLYQLAYRTMVDMHSKGHHLDEEISSSLIVQLGKIGAPSEAFSVYNMLRFSKRTMC 1580
            F KEKLY LAY+TM DMH+KG+  DEE+ SSL+  LGKI A SEA+S+YN+LR+SKRTMC
Sbjct: 444  FCKEKLYMLAYKTMEDMHNKGYPPDEELCSSLMFHLGKIRAYSEAYSIYNILRYSKRTMC 503

Query: 1579 KALHEKVLKILVAGGLLKDAYVVVKDNAEQISRNTLKKFAISFMKSGNINLINDVMRAVH 1400
            KALHEK+L ILVAG LLKDAYVVVKDN   IS+    KFA +FMK GNINLINDV++A+ 
Sbjct: 504  KALHEKILHILVAGRLLKDAYVVVKDNPRLISKAATMKFATAFMKLGNINLINDVLKAID 563

Query: 1399 SSGHKIDQEVFDLAISRYIGEPXXXXXXXXXXQWMTGQGYVVDSSSRNLLLKNSHLFGRR 1220
             SG KIDQ +F +AISRYI +P          QWM GQGY VDSS+RNL+LKNSHLF R+
Sbjct: 564  GSGCKIDQGIFQMAISRYISDPDKKDLLLQLLQWMPGQGYTVDSSTRNLILKNSHLFDRQ 623

Query: 1219 LIAETLSKQQRMSK 1178
             IAE LSKQ  +SK
Sbjct: 624  HIAEMLSKQHMISK 637


>ref|XP_006343483.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like isoform X3 [Solanum tuberosum]
          Length = 646

 Score =  433 bits (1113), Expect = e-118
 Identities = 214/319 (67%), Positives = 264/319 (82%), Gaps = 1/319 (0%)
 Frame = -2

Query: 2119 GYAVDEMPYCLLMDGLAKAGRIPEAKAIFDEIETKGVKS-DGYSHSIMISAFCRSGCLEE 1943
            GYA DEMP+CLLMDGLAK+G + EAK++FDE+  K VK+ DGYS+SIMISAFCRSG LE+
Sbjct: 328  GYAKDEMPFCLLMDGLAKSGHLLEAKSVFDEMMEKHVKTADGYSYSIMISAFCRSGLLED 387

Query: 1942 AKQLARDFEARYDKYDLVMLNTMLRAYCKAGEMGSVMQMLRKMDEFKISPDWNTFHILIK 1763
            AK++A +FE +YDKYD+V+LN ML AYC+AG+M +VM M++KMD+  ISPDWNTF+ILI+
Sbjct: 388  AKKVASEFEEKYDKYDIVILNAMLSAYCRAGKMENVMSMMKKMDDSAISPDWNTFNILIR 447

Query: 1762 YFYKEKLYQLAYRTMVDMHSKGHHLDEEISSSLIVQLGKIGAPSEAFSVYNMLRFSKRTM 1583
            YF KEKLY LAYRTM DMHSKGH  +E + SSLI  LGK GA SEAFSVYNMLR+SKRT+
Sbjct: 448  YFCKEKLYLLAYRTMEDMHSKGHQPEEGLCSSLIYHLGKTGAHSEAFSVYNMLRYSKRTI 507

Query: 1582 CKALHEKVLKILVAGGLLKDAYVVVKDNAEQISRNTLKKFAISFMKSGNINLINDVMRAV 1403
              ALHE +L IL+AG LLKDAYVVVKDNA  IS+  +KKF+++FM+SGN+NLINDVM A+
Sbjct: 508  SNALHEHILHILIAGRLLKDAYVVVKDNAGFISQPAIKKFSVNFMRSGNVNLINDVMNAM 567

Query: 1402 HSSGHKIDQEVFDLAISRYIGEPXXXXXXXXXXQWMTGQGYVVDSSSRNLLLKNSHLFGR 1223
            HSSGHKIDQE+FDLAI+RYI +P          +WM G+GY +DSS+RNL+LKNSHLFG 
Sbjct: 568  HSSGHKIDQELFDLAIARYIAKPEKKELLLWLLKWMPGKGYAIDSSTRNLILKNSHLFGH 627

Query: 1222 RLIAETLSKQQRMSKMLIT 1166
            +LIAE+LSK   MSK +++
Sbjct: 628  QLIAESLSKHLVMSKKVLS 646


>ref|XP_006417404.1| hypothetical protein EUTSA_v10007006mg [Eutrema salsugineum]
            gi|557095175|gb|ESQ35757.1| hypothetical protein
            EUTSA_v10007006mg [Eutrema salsugineum]
          Length = 666

 Score =  433 bits (1113), Expect = e-118
 Identities = 215/317 (67%), Positives = 263/317 (82%)
 Frame = -2

Query: 2119 GYAVDEMPYCLLMDGLAKAGRIPEAKAIFDEIETKGVKSDGYSHSIMISAFCRSGCLEEA 1940
            GYA +EMPYC+LMDGL+KAG+  EA++IFDE++ KGVKSDGY++SIMISA CRS   EEA
Sbjct: 338  GYAENEMPYCMLMDGLSKAGKFEEARSIFDEMKGKGVKSDGYANSIMISALCRSKRFEEA 397

Query: 1939 KQLARDFEARYDKYDLVMLNTMLRAYCKAGEMGSVMQMLRKMDEFKISPDWNTFHILIKY 1760
            KQLARD E+ Y+K DLVMLNTML AYC+AGEM SVM+M++KMDE  +SPD+NTFHILIKY
Sbjct: 398  KQLARDSESTYEKCDLVMLNTMLCAYCRAGEMESVMRMMKKMDEQAVSPDYNTFHILIKY 457

Query: 1759 FYKEKLYQLAYRTMVDMHSKGHHLDEEISSSLIVQLGKIGAPSEAFSVYNMLRFSKRTMC 1580
            F KEKL+ LAY+T++DMHSKGH L+EE+ SSLI  LGKI A SEAFSVY+MLR+SKRT+C
Sbjct: 458  FIKEKLHLLAYQTLLDMHSKGHRLEEELCSSLIYHLGKIRAHSEAFSVYSMLRYSKRTIC 517

Query: 1579 KALHEKVLKILVAGGLLKDAYVVVKDNAEQISRNTLKKFAISFMKSGNINLINDVMRAVH 1400
            K LHEK+L IL+ G LLKDAYVVVKDNA+ IS+ TLK+F  +FM SGN+NL+NDV++ +H
Sbjct: 518  KDLHEKILHILIHGKLLKDAYVVVKDNAKMISQPTLKRFGRAFMNSGNVNLVNDVLKVLH 577

Query: 1399 SSGHKIDQEVFDLAISRYIGEPXXXXXXXXXXQWMTGQGYVVDSSSRNLLLKNSHLFGRR 1220
             SGHKIDQ  F++AISRYI +P          QWM GQGYVVDSS+RNL+LKNS+LFGR+
Sbjct: 578  GSGHKIDQVQFEIAISRYISQPDKKELLLQLLQWMPGQGYVVDSSTRNLILKNSNLFGRQ 637

Query: 1219 LIAETLSKQQRMSKMLI 1169
            LIAE LSK    S+ ++
Sbjct: 638  LIAEILSKHHIASRTMV 654


>ref|XP_004250704.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like [Solanum lycopersicum]
          Length = 642

 Score =  432 bits (1112), Expect = e-118
 Identities = 213/314 (67%), Positives = 259/314 (82%)
 Frame = -2

Query: 2119 GYAVDEMPYCLLMDGLAKAGRIPEAKAIFDEIETKGVKSDGYSHSIMISAFCRSGCLEEA 1940
            GYA DEMP+CLLMDGLAK+G + EAK++FDE+  K VK+DGYS+SIMISAFCR G LE+A
Sbjct: 328  GYAKDEMPFCLLMDGLAKSGHLLEAKSVFDEMMEKQVKTDGYSYSIMISAFCRRGLLEDA 387

Query: 1939 KQLARDFEARYDKYDLVMLNTMLRAYCKAGEMGSVMQMLRKMDEFKISPDWNTFHILIKY 1760
            K+LA +FE +YDKYD+V+LN ML AYC+AG+M +VM M++KMD+  ISPDWNTF+ILI+Y
Sbjct: 388  KKLASEFEEKYDKYDIVILNAMLSAYCRAGKMENVMSMMKKMDDSAISPDWNTFNILIRY 447

Query: 1759 FYKEKLYQLAYRTMVDMHSKGHHLDEEISSSLIVQLGKIGAPSEAFSVYNMLRFSKRTMC 1580
            F KEKLY LAYRTM DMHSKGH  +E + SSLI  LGK GA SEAFSVYNMLR+SKRT+ 
Sbjct: 448  FCKEKLYLLAYRTMEDMHSKGHQPEEGLCSSLIYHLGKTGAHSEAFSVYNMLRYSKRTIS 507

Query: 1579 KALHEKVLKILVAGGLLKDAYVVVKDNAEQISRNTLKKFAISFMKSGNINLINDVMRAVH 1400
             ALHE +L IL+AG LLKDAYVVVKDNA  IS+  +KKF+++FM+SGN+NLINDVM A+H
Sbjct: 508  NALHENILHILIAGRLLKDAYVVVKDNAGFISQPAIKKFSVNFMRSGNVNLINDVMNAMH 567

Query: 1399 SSGHKIDQEVFDLAISRYIGEPXXXXXXXXXXQWMTGQGYVVDSSSRNLLLKNSHLFGRR 1220
            SSGHKIDQE+FDLAI+RYI +P          +WM  +GY +DSS+RNL+LKNSHLFG +
Sbjct: 568  SSGHKIDQELFDLAIARYIAKPEKKELLLWLLKWMPVKGYAIDSSTRNLILKNSHLFGHQ 627

Query: 1219 LIAETLSKQQRMSK 1178
            LIAE+LSK   MSK
Sbjct: 628  LIAESLSKHLVMSK 641


>ref|XP_006343484.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like isoform X4 [Solanum tuberosum]
          Length = 539

 Score =  432 bits (1110), Expect = e-118
 Identities = 214/315 (67%), Positives = 261/315 (82%), Gaps = 1/315 (0%)
 Frame = -2

Query: 2119 GYAVDEMPYCLLMDGLAKAGRIPEAKAIFDEIETKGVKS-DGYSHSIMISAFCRSGCLEE 1943
            GYA DEMP+CLLMDGLAK+G + EAK++FDE+  K VK+ DGYS+SIMISAFCRSG LE+
Sbjct: 215  GYAKDEMPFCLLMDGLAKSGHLLEAKSVFDEMMEKHVKTADGYSYSIMISAFCRSGLLED 274

Query: 1942 AKQLARDFEARYDKYDLVMLNTMLRAYCKAGEMGSVMQMLRKMDEFKISPDWNTFHILIK 1763
            AK++A +FE +YDKYD+V+LN ML AYC+AG+M +VM M++KMD+  ISPDWNTF+ILI+
Sbjct: 275  AKKVASEFEEKYDKYDIVILNAMLSAYCRAGKMENVMSMMKKMDDSAISPDWNTFNILIR 334

Query: 1762 YFYKEKLYQLAYRTMVDMHSKGHHLDEEISSSLIVQLGKIGAPSEAFSVYNMLRFSKRTM 1583
            YF KEKLY LAYRTM DMHSKGH  +E + SSLI  LGK GA SEAFSVYNMLR+SKRT+
Sbjct: 335  YFCKEKLYLLAYRTMEDMHSKGHQPEEGLCSSLIYHLGKTGAHSEAFSVYNMLRYSKRTI 394

Query: 1582 CKALHEKVLKILVAGGLLKDAYVVVKDNAEQISRNTLKKFAISFMKSGNINLINDVMRAV 1403
              ALHE +L IL+AG LLKDAYVVVKDNA  IS+  +KKF+++FM+SGN+NLINDVM A+
Sbjct: 395  SNALHEHILHILIAGRLLKDAYVVVKDNAGFISQPAIKKFSVNFMRSGNVNLINDVMNAM 454

Query: 1402 HSSGHKIDQEVFDLAISRYIGEPXXXXXXXXXXQWMTGQGYVVDSSSRNLLLKNSHLFGR 1223
            HSSGHKIDQE+FDLAI+RYI +P          +WM G+GY +DSS+RNL+LKNSHLFG 
Sbjct: 455  HSSGHKIDQELFDLAIARYIAKPEKKELLLWLLKWMPGKGYAIDSSTRNLILKNSHLFGH 514

Query: 1222 RLIAETLSKQQRMSK 1178
            +LIAE+LSK   MSK
Sbjct: 515  QLIAESLSKHLVMSK 529


>ref|XP_006343481.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like isoform X1 [Solanum tuberosum]
          Length = 652

 Score =  432 bits (1110), Expect = e-118
 Identities = 214/315 (67%), Positives = 261/315 (82%), Gaps = 1/315 (0%)
 Frame = -2

Query: 2119 GYAVDEMPYCLLMDGLAKAGRIPEAKAIFDEIETKGVKS-DGYSHSIMISAFCRSGCLEE 1943
            GYA DEMP+CLLMDGLAK+G + EAK++FDE+  K VK+ DGYS+SIMISAFCRSG LE+
Sbjct: 328  GYAKDEMPFCLLMDGLAKSGHLLEAKSVFDEMMEKHVKTADGYSYSIMISAFCRSGLLED 387

Query: 1942 AKQLARDFEARYDKYDLVMLNTMLRAYCKAGEMGSVMQMLRKMDEFKISPDWNTFHILIK 1763
            AK++A +FE +YDKYD+V+LN ML AYC+AG+M +VM M++KMD+  ISPDWNTF+ILI+
Sbjct: 388  AKKVASEFEEKYDKYDIVILNAMLSAYCRAGKMENVMSMMKKMDDSAISPDWNTFNILIR 447

Query: 1762 YFYKEKLYQLAYRTMVDMHSKGHHLDEEISSSLIVQLGKIGAPSEAFSVYNMLRFSKRTM 1583
            YF KEKLY LAYRTM DMHSKGH  +E + SSLI  LGK GA SEAFSVYNMLR+SKRT+
Sbjct: 448  YFCKEKLYLLAYRTMEDMHSKGHQPEEGLCSSLIYHLGKTGAHSEAFSVYNMLRYSKRTI 507

Query: 1582 CKALHEKVLKILVAGGLLKDAYVVVKDNAEQISRNTLKKFAISFMKSGNINLINDVMRAV 1403
              ALHE +L IL+AG LLKDAYVVVKDNA  IS+  +KKF+++FM+SGN+NLINDVM A+
Sbjct: 508  SNALHEHILHILIAGRLLKDAYVVVKDNAGFISQPAIKKFSVNFMRSGNVNLINDVMNAM 567

Query: 1402 HSSGHKIDQEVFDLAISRYIGEPXXXXXXXXXXQWMTGQGYVVDSSSRNLLLKNSHLFGR 1223
            HSSGHKIDQE+FDLAI+RYI +P          +WM G+GY +DSS+RNL+LKNSHLFG 
Sbjct: 568  HSSGHKIDQELFDLAIARYIAKPEKKELLLWLLKWMPGKGYAIDSSTRNLILKNSHLFGH 627

Query: 1222 RLIAETLSKQQRMSK 1178
            +LIAE+LSK   MSK
Sbjct: 628  QLIAESLSKHLVMSK 642


>ref|XP_007225150.1| hypothetical protein PRUPE_ppa002505mg [Prunus persica]
            gi|462422086|gb|EMJ26349.1| hypothetical protein
            PRUPE_ppa002505mg [Prunus persica]
          Length = 664

 Score =  431 bits (1108), Expect = e-118
 Identities = 215/314 (68%), Positives = 259/314 (82%)
 Frame = -2

Query: 2119 GYAVDEMPYCLLMDGLAKAGRIPEAKAIFDEIETKGVKSDGYSHSIMISAFCRSGCLEEA 1940
            GYA DEMPYCLLMD LAKAGRI EAK +FDE++ K ++S+GYS+SIMISAFCR G LE+A
Sbjct: 340  GYAEDEMPYCLLMDALAKAGRIHEAKLVFDEMKEKSIRSNGYSYSIMISAFCRGGLLEDA 399

Query: 1939 KQLARDFEARYDKYDLVMLNTMLRAYCKAGEMGSVMQMLRKMDEFKISPDWNTFHILIKY 1760
            KQL++D E  +DK+DLVMLNTM+ AYC+AGEM SVM+M+RKMDE KI+PD+NTFHILIKY
Sbjct: 400  KQLSKDVERTHDKFDLVMLNTMICAYCRAGEMDSVMEMMRKMDEQKITPDYNTFHILIKY 459

Query: 1759 FYKEKLYQLAYRTMVDMHSKGHHLDEEISSSLIVQLGKIGAPSEAFSVYNMLRFSKRTMC 1580
            F KEKLY LAY+TM DMH+KGH  DEE+ SSL+  LGKI A SEA+SVYN+LR+SKRTMC
Sbjct: 460  FCKEKLYLLAYQTMEDMHNKGHQPDEELCSSLMFLLGKIRAYSEAYSVYNILRYSKRTMC 519

Query: 1579 KALHEKVLKILVAGGLLKDAYVVVKDNAEQISRNTLKKFAISFMKSGNINLINDVMRAVH 1400
            KALHEK+L IL+AG LLKDAYVVVKDNA  IS+  +KKF+ +F+K GNINLINDV++ + 
Sbjct: 520  KALHEKILHILLAGQLLKDAYVVVKDNAGLISKPAVKKFSTAFLKLGNINLINDVLKVID 579

Query: 1399 SSGHKIDQEVFDLAISRYIGEPXXXXXXXXXXQWMTGQGYVVDSSSRNLLLKNSHLFGRR 1220
            +SG KIDQ +F +AISRYI  P           WM GQGYVVDS++RNL+LKNSHLFGR+
Sbjct: 580  ASGCKIDQGLFQMAISRYIALPEKKELLIQMLLWMPGQGYVVDSATRNLILKNSHLFGRQ 639

Query: 1219 LIAETLSKQQRMSK 1178
             IA+ LSKQ  +SK
Sbjct: 640  HIADVLSKQHMISK 653


>ref|NP_172560.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|122242678|sp|Q0WVV0.1|PPR31_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g10910, chloroplastic; Flags: Precursor
            gi|110741600|dbj|BAE98748.1| membrane-associated
            salt-inducible protein isolog [Arabidopsis thaliana]
            gi|332190541|gb|AEE28662.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 664

 Score =  429 bits (1104), Expect = e-117
 Identities = 212/317 (66%), Positives = 261/317 (82%)
 Frame = -2

Query: 2119 GYAVDEMPYCLLMDGLAKAGRIPEAKAIFDEIETKGVKSDGYSHSIMISAFCRSGCLEEA 1940
            GYA +EMPYC+LMDGL+KAG++ EA++IFD+++ KGV+SDGY++SIMISA CRS   +EA
Sbjct: 337  GYAENEMPYCMLMDGLSKAGKLEEARSIFDDMKGKGVRSDGYANSIMISALCRSKRFKEA 396

Query: 1939 KQLARDFEARYDKYDLVMLNTMLRAYCKAGEMGSVMQMLRKMDEFKISPDWNTFHILIKY 1760
            K+L+RD E  Y+K DLVMLNTML AYC+AGEM SVM+M++KMDE  +SPD+NTFHILIKY
Sbjct: 397  KELSRDSETTYEKCDLVMLNTMLCAYCRAGEMESVMRMMKKMDEQAVSPDYNTFHILIKY 456

Query: 1759 FYKEKLYQLAYRTMVDMHSKGHHLDEEISSSLIVQLGKIGAPSEAFSVYNMLRFSKRTMC 1580
            F KEKL+ LAY+T +DMHSKGH L+EE+ SSLI  LGKI A +EAFSVYNMLR+SKRT+C
Sbjct: 457  FIKEKLHLLAYQTTLDMHSKGHRLEEELCSSLIYHLGKIRAQAEAFSVYNMLRYSKRTIC 516

Query: 1579 KALHEKVLKILVAGGLLKDAYVVVKDNAEQISRNTLKKFAISFMKSGNINLINDVMRAVH 1400
            K LHEK+L IL+ G LLKDAY+VVKDNA+ IS+ TLKKF  +FM SGNINL+NDV++ +H
Sbjct: 517  KELHEKILHILIQGNLLKDAYIVVKDNAKMISQPTLKKFGRAFMISGNINLVNDVLKVLH 576

Query: 1399 SSGHKIDQEVFDLAISRYIGEPXXXXXXXXXXQWMTGQGYVVDSSSRNLLLKNSHLFGRR 1220
             SGHKIDQ  F++AISRYI +P          QWM GQGYVVDSS+RNL+LKNSH+FGR 
Sbjct: 577  GSGHKIDQVQFEIAISRYISQPDKKELLLQLLQWMPGQGYVVDSSTRNLILKNSHMFGRL 636

Query: 1219 LIAETLSKQQRMSKMLI 1169
            LIAE LSK    S+ +I
Sbjct: 637  LIAEILSKHHVASRPMI 653


>ref|XP_002529286.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223531275|gb|EEF33118.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 672

 Score =  429 bits (1103), Expect = e-117
 Identities = 216/318 (67%), Positives = 259/318 (81%), Gaps = 2/318 (0%)
 Frame = -2

Query: 2119 GYAVDEMPYCLLMDGLAKAGRIPEAKAIFDEIETKGVKSDGYSHSIMISAFCRSGCLEEA 1940
            GYA DEMPYCLLMDGL+KAGR+ EA++ FDE++ K VKSDGY++SIMISA+CR   LEEA
Sbjct: 352  GYAEDEMPYCLLMDGLSKAGRVDEARSFFDEMKEKNVKSDGYAYSIMISAYCRGRLLEEA 411

Query: 1939 KQLARDFEARYDKYDLVMLNTMLRAYCKAGEMGSVMQMLRKMDEFKISPDWNTFHILIKY 1760
            KQLA++FEA+YDKYD+V+LNTML AYC+AG+M SVMQ +RKMDE  ISP + TFHILIKY
Sbjct: 412  KQLAKEFEAKYDKYDVVILNTMLCAYCRAGDMESVMQTMRKMDELAISPSYCTFHILIKY 471

Query: 1759 FYKEKLYQLAYRTMVDMHSKGHHLDEEISSSLIVQLGKIGAPSEAFSVYNMLRFSKRTMC 1580
            F K+KLY LAY+TM DMH KGH  +EE+ S LI  LGK  A +EAFSVY ML++ KRTMC
Sbjct: 472  FCKQKLYLLAYQTMEDMHRKGHQPEEELCSMLIFHLGKAKAYTEAFSVYTMLKYGKRTMC 531

Query: 1579 KALHEKVLKILVAGGLLKDAYVVVKDNAEQISRNTLKKFAISFMKSGNINLINDVMRAVH 1400
            KALHEK+L +L+ G LLKDAYVVVKDNAE IS+  +KKFA +FMK GNINLINDVM+ +H
Sbjct: 532  KALHEKILHVLLGGQLLKDAYVVVKDNAELISQAAIKKFANAFMKLGNINLINDVMKVIH 591

Query: 1399 SSGHKIDQ--EVFDLAISRYIGEPXXXXXXXXXXQWMTGQGYVVDSSSRNLLLKNSHLFG 1226
            SSG+KIDQ  E+F +AISRYI +P          QWM G GYVVD+S+RNL+LK+SHLFG
Sbjct: 592  SSGYKIDQASELFQMAISRYIAQPEKKDLLVQLLQWMPGHGYVVDASTRNLILKSSHLFG 651

Query: 1225 RRLIAETLSKQQRMSKML 1172
            R+LIAE LSKQ  +SK L
Sbjct: 652  RQLIAEILSKQHIISKTL 669


>ref|XP_007148512.1| hypothetical protein PHAVU_006G214900g [Phaseolus vulgaris]
            gi|561021735|gb|ESW20506.1| hypothetical protein
            PHAVU_006G214900g [Phaseolus vulgaris]
          Length = 639

 Score =  428 bits (1101), Expect = e-117
 Identities = 212/310 (68%), Positives = 255/310 (82%)
 Frame = -2

Query: 2119 GYAVDEMPYCLLMDGLAKAGRIPEAKAIFDEIETKGVKSDGYSHSIMISAFCRSGCLEEA 1940
            GYA DEMPYC+LMDGLAKAG+I EAK IFDE+    V+SDGY+HSIMISA CRS    EA
Sbjct: 320  GYAEDEMPYCILMDGLAKAGQIHEAKLIFDEMMKNHVRSDGYAHSIMISALCRSKLFREA 379

Query: 1939 KQLARDFEARYDKYDLVMLNTMLRAYCKAGEMGSVMQMLRKMDEFKISPDWNTFHILIKY 1760
            KQLA+DFE   +KYD+V+LN+ML A+C+ GEM SVM+ L+KMDE  ISP +NTFHILIKY
Sbjct: 380  KQLAKDFETTSNKYDIVILNSMLCAFCRVGEMESVMETLKKMDELAISPSYNTFHILIKY 439

Query: 1759 FYKEKLYQLAYRTMVDMHSKGHHLDEEISSSLIVQLGKIGAPSEAFSVYNMLRFSKRTMC 1580
            F +EK+Y LAYRTM DMHSKGH   EE+ S+LI  LG++ A SEAFSVYNMLR+ KRTMC
Sbjct: 440  FCREKMYLLAYRTMKDMHSKGHQPGEELCSTLISHLGQVNAYSEAFSVYNMLRYGKRTMC 499

Query: 1579 KALHEKVLKILVAGGLLKDAYVVVKDNAEQISRNTLKKFAISFMKSGNINLINDVMRAVH 1400
            K+LHEK+L IL+AG LLKDAYVVVKDNA+ ISR   KKFAI+FMKSGNIN INDV++ +H
Sbjct: 500  KSLHEKILYILLAGHLLKDAYVVVKDNAKYISRPPTKKFAIAFMKSGNINYINDVLKTLH 559

Query: 1399 SSGHKIDQEVFDLAISRYIGEPXXXXXXXXXXQWMTGQGYVVDSSSRNLLLKNSHLFGRR 1220
             SG+K+DQ++F +A+SRY+GEP          QWM+GQGY+VDSS+RNL+LK+SHLFGR+
Sbjct: 560  DSGYKLDQDLFAMAVSRYLGEPEKKDLLLHLLQWMSGQGYMVDSSTRNLILKHSHLFGRQ 619

Query: 1219 LIAETLSKQQ 1190
            LIAE LSKQQ
Sbjct: 620  LIAEVLSKQQ 629


>ref|XP_002889841.1| hypothetical protein ARALYDRAFT_888388 [Arabidopsis lyrata subsp.
            lyrata] gi|297335683|gb|EFH66100.1| hypothetical protein
            ARALYDRAFT_888388 [Arabidopsis lyrata subsp. lyrata]
          Length = 665

 Score =  428 bits (1101), Expect = e-117
 Identities = 214/314 (68%), Positives = 258/314 (82%)
 Frame = -2

Query: 2119 GYAVDEMPYCLLMDGLAKAGRIPEAKAIFDEIETKGVKSDGYSHSIMISAFCRSGCLEEA 1940
            GYA +EMPYC+LMDGL+KAG++ EA++IFD+++ KGVKSDGY++SIMISA CRS   EEA
Sbjct: 338  GYAENEMPYCMLMDGLSKAGKLEEARSIFDDMKGKGVKSDGYANSIMISALCRSKRFEEA 397

Query: 1939 KQLARDFEARYDKYDLVMLNTMLRAYCKAGEMGSVMQMLRKMDEFKISPDWNTFHILIKY 1760
            K+L+RD E  Y+K DLVMLNTML AYC+AGEM SVM+M++KMDE  I PD+NTFHILIKY
Sbjct: 398  KELSRDSETTYEKCDLVMLNTMLCAYCRAGEMESVMRMMKKMDEQAIIPDYNTFHILIKY 457

Query: 1759 FYKEKLYQLAYRTMVDMHSKGHHLDEEISSSLIVQLGKIGAPSEAFSVYNMLRFSKRTMC 1580
            F KEKL+ LAY+T +DMHSKGH L+EE+ SSLI  LGKI APSEAFSVYNMLR+SKRT+C
Sbjct: 458  FIKEKLHLLAYQTTLDMHSKGHRLEEELCSSLIYHLGKIRAPSEAFSVYNMLRYSKRTIC 517

Query: 1579 KALHEKVLKILVAGGLLKDAYVVVKDNAEQISRNTLKKFAISFMKSGNINLINDVMRAVH 1400
            K LHEK+L IL+ G LLKDAY+VVKDNA+ IS+ TLKKF  +FM SGNINL+NDV++ +H
Sbjct: 518  KELHEKILHILIHGDLLKDAYIVVKDNAKMISQPTLKKFGRAFMISGNINLVNDVLKVLH 577

Query: 1399 SSGHKIDQEVFDLAISRYIGEPXXXXXXXXXXQWMTGQGYVVDSSSRNLLLKNSHLFGRR 1220
             SGHKIDQ  F++AISRYI  P          QWM GQGY+VDSS+RNL+LKNSH+FGR 
Sbjct: 578  GSGHKIDQVQFEIAISRYILLPDKKELLLQLLQWMPGQGYIVDSSTRNLILKNSHMFGRL 637

Query: 1219 LIAETLSKQQRMSK 1178
            LIAE LSK    S+
Sbjct: 638  LIAEILSKHHVASR 651


>ref|XP_002299667.2| hypothetical protein POPTR_0001s21880g [Populus trichocarpa]
            gi|550347847|gb|EEE84472.2| hypothetical protein
            POPTR_0001s21880g [Populus trichocarpa]
          Length = 673

 Score =  422 bits (1086), Expect = e-115
 Identities = 214/316 (67%), Positives = 255/316 (80%)
 Frame = -2

Query: 2119 GYAVDEMPYCLLMDGLAKAGRIPEAKAIFDEIETKGVKSDGYSHSIMISAFCRSGCLEEA 1940
            G+A +EMPYCLLMDGLAK G + EA+++F+E++ K VKS GYS+SIMIS+FCR G  EEA
Sbjct: 355  GFAKNEMPYCLLMDGLAKNGLLDEARSVFNEMKEKRVKSGGYSYSIMISSFCRGGLFEEA 414

Query: 1939 KQLARDFEARYDKYDLVMLNTMLRAYCKAGEMGSVMQMLRKMDEFKISPDWNTFHILIKY 1760
            K+LA +FEA+YDKYD+V+LNT+L AYC+ GE  SVM+ +RKMDE  ISPD+NTFHILIKY
Sbjct: 415  KELAEEFEAKYDKYDVVILNTILCAYCRTGEKESVMRTMRKMDELAISPDYNTFHILIKY 474

Query: 1759 FYKEKLYQLAYRTMVDMHSKGHHLDEEISSSLIVQLGKIGAPSEAFSVYNMLRFSKRTMC 1580
            F KEKLY LAY+TM DMH KGH   EE+ SSLI+ LGKI A +EAFSVY+ML+ SKRTM 
Sbjct: 475  FCKEKLYMLAYQTMEDMHRKGHQPMEELCSSLILHLGKIKAHAEAFSVYSMLKSSKRTMS 534

Query: 1579 KALHEKVLKILVAGGLLKDAYVVVKDNAEQISRNTLKKFAISFMKSGNINLINDVMRAVH 1400
            KA HE +L IL+AG LLKDAYVVVKDNAE IS   +KKFA SF+K G+INLINDVM+ +H
Sbjct: 535  KAFHEDILHILIAGRLLKDAYVVVKDNAELISPAAIKKFASSFVKLGDINLINDVMKVIH 594

Query: 1399 SSGHKIDQEVFDLAISRYIGEPXXXXXXXXXXQWMTGQGYVVDSSSRNLLLKNSHLFGRR 1220
             SG+KIDQE+F +A+SRYI EP          QWM GQGYVVDSS+RNL+LKNSHLFGR+
Sbjct: 595  GSGYKIDQELFLMAVSRYIAEPEKKDLLIQLLQWMPGQGYVVDSSTRNLILKNSHLFGRQ 654

Query: 1219 LIAETLSKQQRMSKML 1172
            LIAE LSKQ   SK L
Sbjct: 655  LIAEILSKQHMTSKAL 670


>gb|EYU26539.1| hypothetical protein MIMGU_mgv1a002527mg [Mimulus guttatus]
          Length = 663

 Score =  419 bits (1077), Expect = e-114
 Identities = 203/314 (64%), Positives = 254/314 (80%)
 Frame = -2

Query: 2119 GYAVDEMPYCLLMDGLAKAGRIPEAKAIFDEIETKGVKSDGYSHSIMISAFCRSGCLEEA 1940
            GYA DEMPYCLLMDGLAK+G++PEAK++FDE+  K VK+DG+S+SIMISA CRSG +EEA
Sbjct: 348  GYAEDEMPYCLLMDGLAKSGKVPEAKSLFDEMRQKEVKNDGFSYSIMISALCRSGLIEEA 407

Query: 1939 KQLARDFEARYDKYDLVMLNTMLRAYCKAGEMGSVMQMLRKMDEFKISPDWNTFHILIKY 1760
            K LA +FE +YDKYD+V+LN+ML AYC++GEM +VM+ ++KMDE  ISPDWNTFHILIKY
Sbjct: 408  KMLACEFETKYDKYDVVILNSMLCAYCRSGEMENVMKTMKKMDESSISPDWNTFHILIKY 467

Query: 1759 FYKEKLYQLAYRTMVDMHSKGHHLDEEISSSLIVQLGKIGAPSEAFSVYNMLRFSKRTMC 1580
            F KEKLY LAYRTMVDMH KGH L+E++   LI  LGK GA +EAFSVY+ML++SKRT+ 
Sbjct: 468  FCKEKLYLLAYRTMVDMHKKGHQLEEDLCVFLIHHLGKTGAHAEAFSVYSMLKYSKRTIN 527

Query: 1579 KALHEKVLKILVAGGLLKDAYVVVKDNAEQISRNTLKKFAISFMKSGNINLINDVMRAVH 1400
            K LHEK+L  L+AGGL KDAYV+VKDNA+ IS + ++KF  +FM+ GNINLINDV++++H
Sbjct: 528  KTLHEKILHTLLAGGLFKDAYVLVKDNAKYISESAIRKFTTTFMRKGNINLINDVIKSIH 587

Query: 1399 SSGHKIDQEVFDLAISRYIGEPXXXXXXXXXXQWMTGQGYVVDSSSRNLLLKNSHLFGRR 1220
            SS +KIDQ++F +AISRYI +P          QWM GQGY VDSS+RNL+L+N+ LFGR 
Sbjct: 588  SSSYKIDQDIFHMAISRYIEQPEKKELLLHLLQWMRGQGYPVDSSTRNLILENAELFGRN 647

Query: 1219 LIAETLSKQQRMSK 1178
             I E LSK    SK
Sbjct: 648  SITEILSKHYAASK 661


Top