BLASTX nr result

ID: Rheum21_contig00008219 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00008219
         (2603 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002270184.2| PREDICTED: pentatricopeptide repeat-containi...   815   0.0  
gb|EMJ26349.1| hypothetical protein PRUPE_ppa002505mg [Prunus pe...   786   0.0  
ref|XP_004299940.1| PREDICTED: pentatricopeptide repeat-containi...   781   0.0  
ref|XP_002529286.1| pentatricopeptide repeat-containing protein,...   779   0.0  
ref|XP_004141206.1| PREDICTED: pentatricopeptide repeat-containi...   771   0.0  
ref|XP_002299667.2| hypothetical protein POPTR_0001s21880g [Popu...   763   0.0  
ref|XP_006431883.1| hypothetical protein CICLE_v10000525mg [Citr...   754   0.0  
gb|EOX98058.1| Pentatricopeptide repeat superfamily protein isof...   749   0.0  
gb|EOX98059.1| Pentatricopeptide repeat (PPR) superfamily protei...   740   0.0  
ref|XP_006343482.1| PREDICTED: pentatricopeptide repeat-containi...   736   0.0  
ref|XP_004250704.1| PREDICTED: pentatricopeptide repeat-containi...   733   0.0  
ref|XP_006343481.1| PREDICTED: pentatricopeptide repeat-containi...   731   0.0  
ref|XP_006343483.1| PREDICTED: pentatricopeptide repeat-containi...   729   0.0  
ref|XP_006417404.1| hypothetical protein EUTSA_v10007006mg [Eutr...   729   0.0  
gb|ESW20506.1| hypothetical protein PHAVU_006G214900g [Phaseolus...   722   0.0  
ref|XP_004486236.1| PREDICTED: pentatricopeptide repeat-containi...   720   0.0  
ref|XP_003531588.2| PREDICTED: pentatricopeptide repeat-containi...   719   0.0  
ref|NP_172560.2| pentatricopeptide repeat-containing protein [Ar...   718   0.0  
ref|XP_002889841.1| hypothetical protein ARALYDRAFT_888388 [Arab...   717   0.0  
ref|XP_006826767.1| hypothetical protein AMTR_s00136p00085920 [A...   705   0.0  

>ref|XP_002270184.2| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic [Vitis vinifera]
            gi|298204537|emb|CBI23812.3| unnamed protein product
            [Vitis vinifera]
          Length = 659

 Score =  815 bits (2104), Expect = 0.0
 Identities = 398/577 (68%), Positives = 490/577 (84%)
 Frame = +2

Query: 320  ILDIRNSPKLLDALARSGEILKAHDLNIILREFGRLKRYTDLSQLFEWMEKHGKVNITSY 499
            IL+++ S  L  ALAR G++LK  DLN+ILR FG+L R+ DLSQLF+WM+KH K+  +SY
Sbjct: 79   ILEVQQSSDLGSALARLGDMLKVQDLNVILRHFGKLCRWQDLSQLFDWMQKHEKITFSSY 138

Query: 500  SSYIKFTGESLNPTKAVEAYNHISDEAVRNHTSVCNSVLSCLIKNRKFDSASSLFYQMKQ 679
            S+YIKF G+SLNP KA+E YN I DE+VRN+ SVCNSVLSCLI+N KF+++  LF+QMKQ
Sbjct: 139  STYIKFMGKSLNPIKALEIYNSIQDESVRNNVSVCNSVLSCLIRNGKFENSLKLFHQMKQ 198

Query: 680  NGLEPDVVTYSTLLAGCIKNMDGYSKAMQLVKELESSSLQMDDVTYGTLIAVYAANGQCK 859
            +GL PD VTYSTLLAGC+K   GYSKA++LV+E+E S L MD V YGTL+AV A+N +CK
Sbjct: 199  DGLRPDAVTYSTLLAGCMKVKHGYSKALELVQEMERSRLPMDSVIYGTLLAVCASNNRCK 258

Query: 860  EAEECFIRMKSEGHSPNVYHYSSLLNAFSADGNYQKGDELVNDMKSAGLVPNKVMLTTLL 1039
            EAE  F +MK EGH PNV+HYSSLLNA+SADG+Y+K D LV DMKSAGLVPNKV+LTTLL
Sbjct: 259  EAENYFNQMKDEGHLPNVFHYSSLLNAYSADGDYKKADMLVQDMKSAGLVPNKVILTTLL 318

Query: 1040 KVYARAGLFEKSRQLLSELEASGYAGDEIPYCLLMDAFSKSGQIDEAKLVLDEMCRRRVK 1219
            KVY R GLFEKSR+LL+ELE  GYA DE+PYCLLMD  +KS +I EAK + +EM +++VK
Sbjct: 319  KVYVRGGLFEKSRELLAELEDLGYAEDEMPYCLLMDGLAKSRRILEAKSIFEEMKKKQVK 378

Query: 1220 SDGYAYSIMISAFCRSGHLEEAKRLAKDYEFTHDKFDVVISNTMLCAYCRVGDMESVMQI 1399
            SDGY YSIMISAFCRSG L+EAK+LA+D+E T+DK+D+V+ NTMLCAYCR G+MESVMQ+
Sbjct: 379  SDGYCYSIMISAFCRSGLLKEAKQLARDFEATYDKYDLVMLNTMLCAYCRAGEMESVMQM 438

Query: 1400 LRKMDALAISPDKNTFGILIKYFFKEKLYPLAYRTMEDMHSKGHQPSEETCTLLMINLGK 1579
            +RKMD LAISPD NTF ILIKYF KEKLY LAYRTMEDMH+KGHQP EE C+ L+ +LGK
Sbjct: 439  MRKMDELAISPDWNTFHILIKYFCKEKLYLLAYRTMEDMHNKGHQPEEELCSSLISHLGK 498

Query: 1580 MGAHNEAYAVYNILRYSRRTICKTLHEKILRILIAGQLLKEAYVVAKDNAKCISQPAMKK 1759
            + AH++A++VYN+LRYS+RT+CK LHEKIL IL+AG+LLK+AYVV KDN   IS+P++KK
Sbjct: 499  IRAHSQAFSVYNMLRYSKRTMCKALHEKILHILVAGRLLKDAYVVVKDNEGLISKPSIKK 558

Query: 1760 FLTAFMRSGNINLINDVVKVITASGCKIDQDIFQLAISRYIAQPEKKDLLLQLLKWMPSQ 1939
            F TAFM+ GN+NLINDV+K I  SG KIDQ++FQ+A++RYIA+PEKK+LLL LL+WMP Q
Sbjct: 559  FATAFMKFGNVNLINDVMKAIHGSGYKIDQELFQMAVTRYIAEPEKKELLLHLLQWMPGQ 618

Query: 1940 GYVVDSATRNLILKNSHLYGHQLIAEMLSTQHALARS 2050
            GYVVDS+TRN+ILKNSHL+G QLIAEMLS QHA A++
Sbjct: 619  GYVVDSSTRNMILKNSHLFGRQLIAEMLSKQHARAKA 655


>gb|EMJ26349.1| hypothetical protein PRUPE_ppa002505mg [Prunus persica]
          Length = 664

 Score =  786 bits (2030), Expect = 0.0
 Identities = 386/580 (66%), Positives = 481/580 (82%)
 Frame = +2

Query: 320  ILDIRNSPKLLDALARSGEILKAHDLNIILREFGRLKRYTDLSQLFEWMEKHGKVNITSY 499
            IL+++ S  L  AL R G  LK  DLN I+R FG LKR+ DLSQLFEWM+++GK++ +SY
Sbjct: 78   ILEVQESSDLDSALTRLGGSLKVQDLNAIIRHFGILKRWHDLSQLFEWMQQNGKISASSY 137

Query: 500  SSYIKFTGESLNPTKAVEAYNHISDEAVRNHTSVCNSVLSCLIKNRKFDSASSLFYQMKQ 679
            SSYIKF G+SLNP KA+E YN+I D + + +  +CNSVL  LI++ KFD +  LF+QMKQ
Sbjct: 138  SSYIKFMGKSLNPVKALEIYNNIQDASTKKNVHICNSVLGSLIRSGKFDGSFKLFHQMKQ 197

Query: 680  NGLEPDVVTYSTLLAGCIKNMDGYSKAMQLVKELESSSLQMDDVTYGTLIAVYAANGQCK 859
            +GL PD VTYSTLLAGC K   GYSKA++LV+EL+ + LQMD V YGTL+AV A+N + +
Sbjct: 198  DGLTPDAVTYSTLLAGCNKVKHGYSKALELVQELQRNELQMDSVIYGTLLAVCASNNKLE 257

Query: 860  EAEECFIRMKSEGHSPNVYHYSSLLNAFSADGNYQKGDELVNDMKSAGLVPNKVMLTTLL 1039
            EAE  F +MK+EG+ PNV+HYS++LNA+S  GNY++ D+LV DMKSAGLVPNKV+LTTLL
Sbjct: 258  EAEGYFKQMKNEGYLPNVFHYSAMLNAYSISGNYKEADDLVQDMKSAGLVPNKVILTTLL 317

Query: 1040 KVYARAGLFEKSRQLLSELEASGYAGDEIPYCLLMDAFSKSGQIDEAKLVLDEMCRRRVK 1219
            KVY R GLFEKSR+LL+ELEA GYA DE+PYCLLMDA +K+G+I EAKLV DEM  + ++
Sbjct: 318  KVYVRGGLFEKSRELLAELEALGYAEDEMPYCLLMDALAKAGRIHEAKLVFDEMKEKSIR 377

Query: 1220 SDGYAYSIMISAFCRSGHLEEAKRLAKDYEFTHDKFDVVISNTMLCAYCRVGDMESVMQI 1399
            S+GY+YSIMISAFCR G LE+AK+L+KD E THDKFD+V+ NTM+CAYCR G+M+SVM++
Sbjct: 378  SNGYSYSIMISAFCRGGLLEDAKQLSKDVERTHDKFDLVMLNTMICAYCRAGEMDSVMEM 437

Query: 1400 LRKMDALAISPDKNTFGILIKYFFKEKLYPLAYRTMEDMHSKGHQPSEETCTLLMINLGK 1579
            +RKMD   I+PD NTF ILIKYF KEKLY LAY+TMEDMH+KGHQP EE C+ LM  LGK
Sbjct: 438  MRKMDEQKITPDYNTFHILIKYFCKEKLYLLAYQTMEDMHNKGHQPDEELCSSLMFLLGK 497

Query: 1580 MGAHNEAYAVYNILRYSRRTICKTLHEKILRILIAGQLLKEAYVVAKDNAKCISQPAMKK 1759
            + A++EAY+VYNILRYS+RT+CK LHEKIL IL+AGQLLK+AYVV KDNA  IS+PA+KK
Sbjct: 498  IRAYSEAYSVYNILRYSKRTMCKALHEKILHILLAGQLLKDAYVVVKDNAGLISKPAVKK 557

Query: 1760 FLTAFMRSGNINLINDVVKVITASGCKIDQDIFQLAISRYIAQPEKKDLLLQLLKWMPSQ 1939
            F TAF++ GNINLINDV+KVI ASGCKIDQ +FQ+AISRYIA PEKK+LL+Q+L WMP Q
Sbjct: 558  FSTAFLKLGNINLINDVLKVIDASGCKIDQGLFQMAISRYIALPEKKELLIQMLLWMPGQ 617

Query: 1940 GYVVDSATRNLILKNSHLYGHQLIAEMLSTQHALARSKST 2059
            GYVVDSATRNLILKNSHL+G Q IA++LS QH ++++  +
Sbjct: 618  GYVVDSATRNLILKNSHLFGRQHIADVLSKQHMISKASKS 657


>ref|XP_004299940.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 642

 Score =  781 bits (2018), Expect = 0.0
 Identities = 379/580 (65%), Positives = 476/580 (82%)
 Frame = +2

Query: 320  ILDIRNSPKLLDALARSGEILKAHDLNIILREFGRLKRYTDLSQLFEWMEKHGKVNITSY 499
            IL +++S  L  AL R G  L   DLN I+R FG LKR+ DLSQLFEWM+++GKV+ +SY
Sbjct: 62   ILQVQHSSDLESALTRLGGSLNVQDLNAIIRHFGMLKRWHDLSQLFEWMQQNGKVSASSY 121

Query: 500  SSYIKFTGESLNPTKAVEAYNHISDEAVRNHTSVCNSVLSCLIKNRKFDSASSLFYQMKQ 679
            SSYIKF G+SLNP KA+E YN I DE+ + +  +CNSVL  L+++ KFD +  LF+QMKQ
Sbjct: 122  SSYIKFMGKSLNPVKALEIYNSIQDESTKKNVHICNSVLGSLVRSGKFDGSIKLFHQMKQ 181

Query: 680  NGLEPDVVTYSTLLAGCIKNMDGYSKAMQLVKELESSSLQMDDVTYGTLIAVYAANGQCK 859
            +GL PD VTYSTLLAGCIK   GYSKA++LV+EL+++ LQMD V YGTL+A+ A+N + +
Sbjct: 182  DGLTPDAVTYSTLLAGCIKFKHGYSKALELVQELQNNELQMDSVIYGTLLAICASNNKWE 241

Query: 860  EAEECFIRMKSEGHSPNVYHYSSLLNAFSADGNYQKGDELVNDMKSAGLVPNKVMLTTLL 1039
            EAE  F +MK EGH PN +HYSSLLNA+S  GNY+K D++V DMKSAGLVPNKV LTTLL
Sbjct: 242  EAESYFKQMKDEGHLPNEFHYSSLLNAYSISGNYKKADDVVQDMKSAGLVPNKVTLTTLL 301

Query: 1040 KVYARAGLFEKSRQLLSELEASGYAGDEIPYCLLMDAFSKSGQIDEAKLVLDEMCRRRVK 1219
            K Y R GLFEKSR+LL+ELEA GYA DE+PYC+LMDAF+K+G+I++AKLV DE+  + V+
Sbjct: 302  KAYVRGGLFEKSRELLTELEALGYAEDEMPYCILMDAFAKAGRIEDAKLVFDEIKEKSVR 361

Query: 1220 SDGYAYSIMISAFCRSGHLEEAKRLAKDYEFTHDKFDVVISNTMLCAYCRVGDMESVMQI 1399
            SDGY+YSIMISAFCR G +++AK+LAKD+E T+DK+D+V+ NTM+CAYCR G+M+SVM++
Sbjct: 362  SDGYSYSIMISAFCRGGLVDDAKQLAKDFERTYDKYDLVMLNTMICAYCRAGEMDSVMEM 421

Query: 1400 LRKMDALAISPDKNTFGILIKYFFKEKLYPLAYRTMEDMHSKGHQPSEETCTLLMINLGK 1579
            LRKMD L I+PD NTF ILIKYF KEKLY LAY+TMEDMH+KG+ P EE C+ LM +LGK
Sbjct: 422  LRKMDELKITPDNNTFHILIKYFCKEKLYMLAYKTMEDMHNKGYPPDEELCSSLMFHLGK 481

Query: 1580 MGAHNEAYAVYNILRYSRRTICKTLHEKILRILIAGQLLKEAYVVAKDNAKCISQPAMKK 1759
            + A++EAY++YNILRYS+RT+CK LHEKIL IL+AG+LLK+AYVV KDN + IS+ A  K
Sbjct: 482  IRAYSEAYSIYNILRYSKRTMCKALHEKILHILVAGRLLKDAYVVVKDNPRLISKAATMK 541

Query: 1760 FLTAFMRSGNINLINDVVKVITASGCKIDQDIFQLAISRYIAQPEKKDLLLQLLKWMPSQ 1939
            F TAFM+ GNINLINDV+K I  SGCKIDQ IFQ+AISRYI+ P+KKDLLLQLL+WMP Q
Sbjct: 542  FATAFMKLGNINLINDVLKAIDGSGCKIDQGIFQMAISRYISDPDKKDLLLQLLQWMPGQ 601

Query: 1940 GYVVDSATRNLILKNSHLYGHQLIAEMLSTQHALARSKST 2059
            GY VDS+TRNLILKNSHL+  Q IAEMLS QH ++++  +
Sbjct: 602  GYTVDSSTRNLILKNSHLFDRQHIAEMLSKQHMISKASKS 641


>ref|XP_002529286.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis] gi|223531275|gb|EEF33118.1| pentatricopeptide
            repeat-containing protein, putative [Ricinus communis]
          Length = 672

 Score =  779 bits (2011), Expect = 0.0
 Identities = 378/579 (65%), Positives = 477/579 (82%), Gaps = 2/579 (0%)
 Frame = +2

Query: 320  ILDIRNSPKLLDALARSGEILKAHDLNIILREFGRLKRYTDLSQLFEWMEKHGKVNITSY 499
            IL+++ SP L  AL R G ILKA DLN+ILR  G+  R+ DLS+LF+WM++H K++++SY
Sbjct: 90   ILEVQQSPDLDSALRRLGAILKAQDLNVILRNLGKQSRWQDLSKLFDWMQQHSKISVSSY 149

Query: 500  SSYIKFTGESLNPTKAVEAYNHISDEAVRNHTSVCNSVLSCLIKNRKFDSASSLFYQMKQ 679
            +SY+KF G+SLNP KA+E YN I+DE+V+N+  +CNSVLSCL+++ KFD +  LF++MKQ
Sbjct: 150  TSYMKFMGKSLNPAKALEIYNSIADESVKNNVFICNSVLSCLVRSGKFDISLKLFHKMKQ 209

Query: 680  NGLEPDVVTYSTLLAGCIKNMDGYSKAMQLVKELESSSLQMDDVTYGTLIAVYAANGQCK 859
            NGL PD +TYSTLL+GCIK  DGYSK +  V+EL+ + LQMD V YGT++AV A++ +C+
Sbjct: 210  NGLTPDTITYSTLLSGCIKAKDGYSKTLDFVQELKYNGLQMDTVIYGTILAVCASHNRCE 269

Query: 860  EAEECFIRMKSEGHSPNVYHYSSLLNAFSADGNYQKGDELVNDMKSAGLVPNKVMLTTLL 1039
            EAE  F +MK+EGH PNV+HYSSLLNA+++ GNY+K +ELV DMKS GLVPNKV+ TTLL
Sbjct: 270  EAESYFSQMKNEGHLPNVFHYSSLLNAYASSGNYKKAEELVQDMKSLGLVPNKVIWTTLL 329

Query: 1040 KVYARAGLFEKSRQLLSELEASGYAGDEIPYCLLMDAFSKSGQIDEAKLVLDEMCRRRVK 1219
            KVY R GLFEKS+QLL ELE  GYA DE+PYCLLMD  SK+G++DEA+   DEM  + VK
Sbjct: 330  KVYVRGGLFEKSQQLLLELETLGYAEDEMPYCLLMDGLSKAGRVDEARSFFDEMKEKNVK 389

Query: 1220 SDGYAYSIMISAFCRSGHLEEAKRLAKDYEFTHDKFDVVISNTMLCAYCRVGDMESVMQI 1399
            SDGYAYSIMISA+CR   LEEAK+LAK++E  +DK+DVVI NTMLCAYCR GDMESVMQ 
Sbjct: 390  SDGYAYSIMISAYCRGRLLEEAKQLAKEFEAKYDKYDVVILNTMLCAYCRAGDMESVMQT 449

Query: 1400 LRKMDALAISPDKNTFGILIKYFFKEKLYPLAYRTMEDMHSKGHQPSEETCTLLMINLGK 1579
            +RKMD LAISP   TF ILIKYF K+KLY LAY+TMEDMH KGHQP EE C++L+ +LGK
Sbjct: 450  MRKMDELAISPSYCTFHILIKYFCKQKLYLLAYQTMEDMHRKGHQPEEELCSMLIFHLGK 509

Query: 1580 MGAHNEAYAVYNILRYSRRTICKTLHEKILRILIAGQLLKEAYVVAKDNAKCISQPAMKK 1759
              A+ EA++VY +L+Y +RT+CK LHEKIL +L+ GQLLK+AYVV KDNA+ ISQ A+KK
Sbjct: 510  AKAYTEAFSVYTMLKYGKRTMCKALHEKILHVLLGGQLLKDAYVVVKDNAELISQAAIKK 569

Query: 1760 FLTAFMRSGNINLINDVVKVITASGCKIDQ--DIFQLAISRYIAQPEKKDLLLQLLKWMP 1933
            F  AFM+ GNINLINDV+KVI +SG KIDQ  ++FQ+AISRYIAQPEKKDLL+QLL+WMP
Sbjct: 570  FANAFMKLGNINLINDVMKVIHSSGYKIDQASELFQMAISRYIAQPEKKDLLVQLLQWMP 629

Query: 1934 SQGYVVDSATRNLILKNSHLYGHQLIAEMLSTQHALARS 2050
              GYVVD++TRNLILK+SHL+G QLIAE+LS QH ++++
Sbjct: 630  GHGYVVDASTRNLILKSSHLFGRQLIAEILSKQHIISKT 668


>ref|XP_004141206.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like [Cucumis sativus]
          Length = 668

 Score =  771 bits (1992), Expect = 0.0
 Identities = 374/577 (64%), Positives = 471/577 (81%)
 Frame = +2

Query: 320  ILDIRNSPKLLDALARSGEILKAHDLNIILREFGRLKRYTDLSQLFEWMEKHGKVNITSY 499
            I  +++  +L  ALAR G +LKA DLN+ILR FG L R+ DLSQLFEWM++ GK N++SY
Sbjct: 81   IAQVKDCSELAPALARYGGLLKAQDLNVILRHFGMLSRWKDLSQLFEWMQETGKTNVSSY 140

Query: 500  SSYIKFTGESLNPTKAVEAYNHISDEAVRNHTSVCNSVLSCLIKNRKFDSASSLFYQMKQ 679
            SSYIKF G  LNP KA+E YN+I + +++N   +CNS+L+CL++N KFD++  LF+QMK 
Sbjct: 141  SSYIKFMGRGLNPLKALEVYNNIEEVSIKNSIFICNSILNCLVRNGKFDTSVKLFHQMKN 200

Query: 680  NGLEPDVVTYSTLLAGCIKNMDGYSKAMQLVKELESSSLQMDDVTYGTLIAVYAANGQCK 859
            +GL PD VTYST+L GCI+   GY+KAM+L+KEL+ + L MD V+YGTLIA+ A++ + +
Sbjct: 201  DGLCPDTVTYSTMLTGCIRVKHGYAKAMELLKELQDNGLCMDCVSYGTLIAICASHNRLE 260

Query: 860  EAEECFIRMKSEGHSPNVYHYSSLLNAFSADGNYQKGDELVNDMKSAGLVPNKVMLTTLL 1039
            +AE  F +M++EGHSPN++HY SLLNA+S +G+Y+K DEL+ DMK  GLVPNKV+LTTLL
Sbjct: 261  DAERFFNQMRAEGHSPNMFHYGSLLNAYSINGDYKKADELIEDMKLTGLVPNKVILTTLL 320

Query: 1040 KVYARAGLFEKSRQLLSELEASGYAGDEIPYCLLMDAFSKSGQIDEAKLVLDEMCRRRVK 1219
            KVY R GLFEKSR+LLSELE+ GY  +E+PYCLLMD  +K+G I EAK V DEM  + VK
Sbjct: 321  KVYVRGGLFEKSRKLLSELESLGYGENEMPYCLLMDGLAKAGSIREAKTVFDEMKAKNVK 380

Query: 1220 SDGYAYSIMISAFCRSGHLEEAKRLAKDYEFTHDKFDVVISNTMLCAYCRVGDMESVMQI 1399
            +DGYA+SIMISAFCR G LEEAK LAKD+E T+D++D+VI NTMLCAYCR G+MESVMQ+
Sbjct: 381  TDGYAHSIMISAFCRGGLLEEAKLLAKDFEATYDRYDIVILNTMLCAYCRAGEMESVMQM 440

Query: 1400 LRKMDALAISPDKNTFGILIKYFFKEKLYPLAYRTMEDMHSKGHQPSEETCTLLMINLGK 1579
            LRKMD LAISPD NTF ILIKYFFKEKLY L YRT+EDMH KGHQP EE C+ L+++LG 
Sbjct: 441  LRKMDDLAISPDYNTFHILIKYFFKEKLYLLCYRTLEDMHRKGHQPEEELCSSLILSLGN 500

Query: 1580 MGAHNEAYAVYNILRYSRRTICKTLHEKILRILIAGQLLKEAYVVAKDNAKCISQPAMKK 1759
            + A++EA++VYNIL+YS+RT+CK LHEKIL ILIAG+LLK+AYVV KDNA  IS+PA++K
Sbjct: 501  IRAYSEAFSVYNILKYSKRTMCKALHEKILHILIAGRLLKDAYVVVKDNAGVISKPAIRK 560

Query: 1760 FLTAFMRSGNINLINDVVKVITASGCKIDQDIFQLAISRYIAQPEKKDLLLQLLKWMPSQ 1939
            F   FM+ GN+NLINDV+K I  SG KIDQD+F +A SRYI  PEKKDL +QLLKWMP Q
Sbjct: 561  FAFGFMKFGNVNLINDVMKAIHGSGYKIDQDLFMIATSRYIELPEKKDLFIQLLKWMPGQ 620

Query: 1940 GYVVDSATRNLILKNSHLYGHQLIAEMLSTQHALARS 2050
            GYVVDS+TRNLILKN+HL+G QLIAE+LS    L++S
Sbjct: 621  GYVVDSSTRNLILKNAHLFGRQLIAEILSKHSLLSKS 657


>ref|XP_002299667.2| hypothetical protein POPTR_0001s21880g [Populus trichocarpa]
            gi|550347847|gb|EEE84472.2| hypothetical protein
            POPTR_0001s21880g [Populus trichocarpa]
          Length = 673

 Score =  763 bits (1969), Expect = 0.0
 Identities = 370/577 (64%), Positives = 470/577 (81%)
 Frame = +2

Query: 320  ILDIRNSPKLLDALARSGEILKAHDLNIILREFGRLKRYTDLSQLFEWMEKHGKVNITSY 499
            IL+++ SP L  AL R G +LK  DLNIILR FG   R+ DLSQLF+WM++H K++ +SY
Sbjct: 93   ILEVQQSPHLDSALQRLGGMLKVQDLNIILRNFGEQCRWQDLSQLFDWMQRHNKISASSY 152

Query: 500  SSYIKFTGESLNPTKAVEAYNHISDEAVRNHTSVCNSVLSCLIKNRKFDSASSLFYQMKQ 679
            SSYIKF G SLNP KA+E Y+ I DE+ + +  +CNS+L CL++N KFDS+   F++MK 
Sbjct: 153  SSYIKFMGTSLNPAKALEIYHSIPDESKKTNVFICNSLLRCLVRNTKFDSSMKFFHKMKN 212

Query: 680  NGLEPDVVTYSTLLAGCIKNMDGYSKAMQLVKELESSSLQMDDVTYGTLIAVYAANGQCK 859
            NGL PD +TYSTLLAGC+K  DGYSKA+ LV+EL  + LQMD + YGTL+AV A+N +C+
Sbjct: 213  NGLTPDAITYSTLLAGCMKIKDGYSKALDLVQELNYNGLQMDSIMYGTLLAVCASNNRCE 272

Query: 860  EAEECFIRMKSEGHSPNVYHYSSLLNAFSADGNYQKGDELVNDMKSAGLVPNKVMLTTLL 1039
            EA+  F +MK EGHSPN++HYSSLLNA+S+DGNY+K +ELV DMKS+GLVPNKV+LTTLL
Sbjct: 273  EAQSYFNQMKDEGHSPNIFHYSSLLNAYSSDGNYKKAEELVQDMKSSGLVPNKVILTTLL 332

Query: 1040 KVYARAGLFEKSRQLLSELEASGYAGDEIPYCLLMDAFSKSGQIDEAKLVLDEMCRRRVK 1219
            KVY R GLFEKSR LL EL+  G+A +E+PYCLLMD  +K+G +DEA+ V +EM  +RVK
Sbjct: 333  KVYVRGGLFEKSRDLLVELDTLGFAKNEMPYCLLMDGLAKNGLLDEARSVFNEMKEKRVK 392

Query: 1220 SDGYAYSIMISAFCRSGHLEEAKRLAKDYEFTHDKFDVVISNTMLCAYCRVGDMESVMQI 1399
            S GY+YSIMIS+FCR G  EEAK LA+++E  +DK+DVVI NT+LCAYCR G+ ESVM+ 
Sbjct: 393  SGGYSYSIMISSFCRGGLFEEAKELAEEFEAKYDKYDVVILNTILCAYCRTGEKESVMRT 452

Query: 1400 LRKMDALAISPDKNTFGILIKYFFKEKLYPLAYRTMEDMHSKGHQPSEETCTLLMINLGK 1579
            +RKMD LAISPD NTF ILIKYF KEKLY LAY+TMEDMH KGHQP EE C+ L+++LGK
Sbjct: 453  MRKMDELAISPDYNTFHILIKYFCKEKLYMLAYQTMEDMHRKGHQPMEELCSSLILHLGK 512

Query: 1580 MGAHNEAYAVYNILRYSRRTICKTLHEKILRILIAGQLLKEAYVVAKDNAKCISQPAMKK 1759
            + AH EA++VY++L+ S+RT+ K  HE IL ILIAG+LLK+AYVV KDNA+ IS  A+KK
Sbjct: 513  IKAHAEAFSVYSMLKSSKRTMSKAFHEDILHILIAGRLLKDAYVVVKDNAELISPAAIKK 572

Query: 1760 FLTAFMRSGNINLINDVVKVITASGCKIDQDIFQLAISRYIAQPEKKDLLLQLLKWMPSQ 1939
            F ++F++ G+INLINDV+KVI  SG KIDQ++F +A+SRYIA+PEKKDLL+QLL+WMP Q
Sbjct: 573  FASSFVKLGDINLINDVMKVIHGSGYKIDQELFLMAVSRYIAEPEKKDLLIQLLQWMPGQ 632

Query: 1940 GYVVDSATRNLILKNSHLYGHQLIAEMLSTQHALARS 2050
            GYVVDS+TRNLILKNSHL+G QLIAE+LS QH  +++
Sbjct: 633  GYVVDSSTRNLILKNSHLFGRQLIAEILSKQHMTSKA 669


>ref|XP_006431883.1| hypothetical protein CICLE_v10000525mg [Citrus clementina]
            gi|557534005|gb|ESR45123.1| hypothetical protein
            CICLE_v10000525mg [Citrus clementina]
          Length = 660

 Score =  754 bits (1946), Expect = 0.0
 Identities = 371/580 (63%), Positives = 469/580 (80%)
 Frame = +2

Query: 320  ILDIRNSPKLLDALARSGEILKAHDLNIILREFGRLKRYTDLSQLFEWMEKHGKVNITSY 499
            IL+++ S  L  +L R G ILK  DLN ILR FG L R  D+ QLFEWM++HGK +I+SY
Sbjct: 78   ILEVQQSSDLTSSLERLGGILKVPDLNAILRHFGDLGRGRDVLQLFEWMQQHGKTSISSY 137

Query: 500  SSYIKFTGESLNPTKAVEAYNHISDEAVRNHTSVCNSVLSCLIKNRKFDSASSLFYQMKQ 679
            SSYIKF G+S N  KA+E YN I+DE+ + +  +CNS+LSCL++N KF+S+  LF +MKQ
Sbjct: 138  SSYIKFLGKSGNSLKALEIYNSITDESDKVNVFICNSILSCLVRNGKFESSLKLFDKMKQ 197

Query: 680  NGLEPDVVTYSTLLAGCIKNMDGYSKAMQLVKELESSSLQMDDVTYGTLIAVYAANGQCK 859
            +GL PD VTY+TLL GCIK+ +GYSKA++LV+EL+ +  QMD+V YG L+A+ A+N  C 
Sbjct: 198  SGLTPDAVTYNTLLTGCIKDKNGYSKALELVQELKYNGAQMDNVMYGILLAICASNNLCA 257

Query: 860  EAEECFIRMKSEGHSPNVYHYSSLLNAFSADGNYQKGDELVNDMKSAGLVPNKVMLTTLL 1039
            +A+  F +MK EGHSPNVYHYSSLLNA+S+ G+Y K DEL+ DMKS+GLVPNKV+LTTLL
Sbjct: 258  KAQSYFNQMKVEGHSPNVYHYSSLLNAYSSGGDYTKADELIQDMKSSGLVPNKVILTTLL 317

Query: 1040 KVYARAGLFEKSRQLLSELEASGYAGDEIPYCLLMDAFSKSGQIDEAKLVLDEMCRRRVK 1219
            KVY R GLFEKSR+LL+EL+  GYA +E+PYCLLMD  SK+G +DEA++V +EM  + VK
Sbjct: 318  KVYVRGGLFEKSRELLAELDTLGYAENEMPYCLLMDGLSKAGCLDEARVVFNEMQEKCVK 377

Query: 1220 SDGYAYSIMISAFCRSGHLEEAKRLAKDYEFTHDKFDVVISNTMLCAYCRVGDMESVMQI 1399
            SDGYA+SIMISAFCR G  EEAK+LA D+E  +DK+DVV+ N+MLCAYCR GDMESVM +
Sbjct: 378  SDGYAHSIMISAFCRGGCFEEAKQLAGDFEAKYDKYDVVLLNSMLCAYCRTGDMESVMHV 437

Query: 1400 LRKMDALAISPDKNTFGILIKYFFKEKLYPLAYRTMEDMHSKGHQPSEETCTLLMINLGK 1579
            +RK+D LAISPD NTF ILIKYF KEK+Y LAYRTM DMH KGHQP EE C+ L+ +LGK
Sbjct: 438  MRKLDELAISPDYNTFHILIKYFCKEKMYILAYRTMVDMHRKGHQPEEELCSSLIFHLGK 497

Query: 1580 MGAHNEAYAVYNILRYSRRTICKTLHEKILRILIAGQLLKEAYVVAKDNAKCISQPAMKK 1759
            M AH+EA +VYN+LRYS+R++CK LHEKIL ILI+G+LLK+AYVV KDN++ IS P +KK
Sbjct: 498  MRAHSEALSVYNMLRYSKRSMCKALHEKILHILISGKLLKDAYVVVKDNSESISHPVIKK 557

Query: 1760 FLTAFMRSGNINLINDVVKVITASGCKIDQDIFQLAISRYIAQPEKKDLLLQLLKWMPSQ 1939
            F +AF+R GNINL+NDV+K I  +G +IDQ IF +AI+RYIA+ EKK+LLL+LL+WM  Q
Sbjct: 558  FASAFVRLGNINLVNDVMKAIHTTGYRIDQGIFHIAIARYIAEREKKELLLKLLEWMTGQ 617

Query: 1940 GYVVDSATRNLILKNSHLYGHQLIAEMLSTQHALARSKST 2059
            GYVVDS+TRNLILKNSHL G QLIA++LS QH  ++S  T
Sbjct: 618  GYVVDSSTRNLILKNSHLLGRQLIADILSKQHMKSKSSKT 657


>gb|EOX98058.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma
            cacao]
          Length = 717

 Score =  749 bits (1935), Expect = 0.0
 Identities = 371/576 (64%), Positives = 465/576 (80%)
 Frame = +2

Query: 320  ILDIRNSPKLLDALARSGEILKAHDLNIILREFGRLKRYTDLSQLFEWMEKHGKVNITSY 499
            +L+++ S  L  AL   G ILK  DLN+I+R FG+L ++  LS+LF WM++HGK N +SY
Sbjct: 73   LLEVQQSSDLNSALQNFGGILKPQDLNVIIRHFGKLGKWHHLSELFAWMQQHGKTNGSSY 132

Query: 500  SSYIKFTGESLNPTKAVEAYNHISDEAVRNHTSVCNSVLSCLIKNRKFDSASSLFYQMKQ 679
            SSYIK  G+ L+P KA+E YN I DE+ R +  +CNS+LS L++N KF+S   LF +MKQ
Sbjct: 133  SSYIKIMGKKLSPIKALEIYNSIPDESTRINVFICNSLLSSLVRNGKFESGIKLFDKMKQ 192

Query: 680  NGLEPDVVTYSTLLAGCIKNMDGYSKAMQLVKELESSSLQMDDVTYGTLIAVYAANGQCK 859
            +GL PD VTY+TLLAGCIK   G+SKA++L+KEL+ + L+MD V YGTL+AV A++G  +
Sbjct: 193  DGLTPDSVTYNTLLAGCIKIKHGHSKALELIKELKYNGLKMDSVMYGTLLAVCASSGLHE 252

Query: 860  EAEECFIRMKSEGHSPNVYHYSSLLNAFSADGNYQKGDELVNDMKSAGLVPNKVMLTTLL 1039
            EA+  F +M+ EGHSPN+YHYSSLLNA+S DGNY K DELV  MKS+GLVPNKV+LTTLL
Sbjct: 253  EAQNYFNQMREEGHSPNLYHYSSLLNAYSYDGNYCKADELVEQMKSSGLVPNKVILTTLL 312

Query: 1040 KVYARAGLFEKSRQLLSELEASGYAGDEIPYCLLMDAFSKSGQIDEAKLVLDEMCRRRVK 1219
            KVY R GLFEKS +LL+ELEA GYA DE+P+CLLMD  SK+G++DEA+ V  EM ++ VK
Sbjct: 313  KVYVRGGLFEKSTKLLAELEALGYAEDEMPFCLLMDGLSKAGRLDEARSVFVEMQQKCVK 372

Query: 1220 SDGYAYSIMISAFCRSGHLEEAKRLAKDYEFTHDKFDVVISNTMLCAYCRVGDMESVMQI 1399
            SDGY++SIMISA CR+G  EEAK LA+D+E  ++K+D+V+ NTMLCAYCR G+MESVMQ 
Sbjct: 373  SDGYSHSIMISALCRAGLFEEAKELAQDFEAQYNKYDLVMLNTMLCAYCRAGEMESVMQT 432

Query: 1400 LRKMDALAISPDKNTFGILIKYFFKEKLYPLAYRTMEDMHSKGHQPSEETCTLLMINLGK 1579
            ++KMD LAISPD NTF ILIKYF KEKLY LAY+TMEDMH KG+ P EE C+ L+  LGK
Sbjct: 433  MKKMDELAISPDYNTFHILIKYFCKEKLYLLAYKTMEDMHGKGYHPEEELCSSLIFQLGK 492

Query: 1580 MGAHNEAYAVYNILRYSRRTICKTLHEKILRILIAGQLLKEAYVVAKDNAKCISQPAMKK 1759
            M AH EA++VYN+LRYS+RT+CK LHEKIL ILIAGQLLK+AYVV KDNA+ ISQPA+ K
Sbjct: 493  MKAHLEAFSVYNMLRYSKRTMCKALHEKILHILIAGQLLKDAYVVVKDNAELISQPAITK 552

Query: 1760 FLTAFMRSGNINLINDVVKVITASGCKIDQDIFQLAISRYIAQPEKKDLLLQLLKWMPSQ 1939
            F TAFM+ GNIN+INDV+KV+  SG KIDQ +FQ+AISRY+ QPEKK+LLLQLL+WMP  
Sbjct: 553  FATAFMKLGNINMINDVLKVLHGSGYKIDQGLFQMAISRYLGQPEKKELLLQLLQWMPGH 612

Query: 1940 GYVVDSATRNLILKNSHLYGHQLIAEMLSTQHALAR 2047
            GYVVDS+TRN+ILKNS L G QL AE+LS QH +++
Sbjct: 613  GYVVDSSTRNMILKNSQLLGRQLTAEILSKQHMMSK 648


>gb|EOX98059.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 2
            [Theobroma cacao]
          Length = 649

 Score =  740 bits (1911), Expect = 0.0
 Identities = 372/581 (64%), Positives = 466/581 (80%), Gaps = 1/581 (0%)
 Frame = +2

Query: 320  ILDIRNSPKLLDALARSGEILKAHDLNIILREFGRLKRYTDLSQLFEWMEKHGKVNITSY 499
            +L+++ S  L  AL   G ILK  DLN+I+R FG+L ++  LS+LF WM++HGK N +SY
Sbjct: 73   LLEVQQSSDLNSALQNFGGILKPQDLNVIIRHFGKLGKWHHLSELFAWMQQHGKTNGSSY 132

Query: 500  SSYIKFTGESLNPTKAVEAYNHISDEAVRNHTSVCNSVLSCLIKNRKFDSASSLFYQMKQ 679
            SSYIK  G+ L+P KA+E YN I DE+ R +  +CNS+LS L++N KF+S   LF +MKQ
Sbjct: 133  SSYIKIMGKKLSPIKALEIYNSIPDESTRINVFICNSLLSSLVRNGKFESGIKLFDKMKQ 192

Query: 680  NGLEPDVVTYSTLLAGCIKNMDGYSKAMQLVKELESSSLQMDDVTYGTLIAVYAANGQCK 859
            +GL PD VTY+TLLAGCIK   G+SKA++L+KEL+ + L+MD V YGTL+AV A++G  +
Sbjct: 193  DGLTPDSVTYNTLLAGCIKIKHGHSKALELIKELKYNGLKMDSVMYGTLLAVCASSGLHE 252

Query: 860  EAEECFIRMKSEGHSPNVYHYSSLLNAFSADGNYQKGDELVNDMKSAGLVPNKVMLTTLL 1039
            EA+  F +M+ EGHSPN+YHYSSLLNA+S DGNY K DELV  MKS+GLVPNKV+LTTLL
Sbjct: 253  EAQNYFNQMREEGHSPNLYHYSSLLNAYSYDGNYCKADELVEQMKSSGLVPNKVILTTLL 312

Query: 1040 KVYARAGLFEKSRQLLSELEASGYAGDEIPYCLLMDAFSKSGQIDEAKLVLDEMCRRRVK 1219
            KVY R GLFEKS +LL+ELEA GYA DE+P+CLLMD  SK+G++DEA+ V  EM ++ VK
Sbjct: 313  KVYVRGGLFEKSTKLLAELEALGYAEDEMPFCLLMDGLSKAGRLDEARSVFVEMQQKCVK 372

Query: 1220 SDGYAYSIMISAFCRSGHLEEAKRLAKDYEFTHDKFDVVISNTMLCAYCRVGDMESVMQI 1399
            SDGY++SIMISA CR+G  EEAK LA+D+E  ++K+D+V+ NTMLCAYCR G+MESVMQ 
Sbjct: 373  SDGYSHSIMISALCRAGLFEEAKELAQDFEAQYNKYDLVMLNTMLCAYCRAGEMESVMQT 432

Query: 1400 LRKMDALAISPDKNTFGILIKYFFKEKLYPLAYRTMEDMHSKGHQPSEETCTLLMINLGK 1579
            ++KMD LAISPD NTF ILIKYF KEKLY LAY+TMEDMH KG+ P EE C+ L+  LGK
Sbjct: 433  MKKMDELAISPDYNTFHILIKYFCKEKLYLLAYKTMEDMHGKGYHPEEELCSSLIFQLGK 492

Query: 1580 MGAHNEAYAVYNILRYSRRTICKTLHEKILRILIAGQLLKEAYVVAKDNAKCISQPAMKK 1759
            M AH EA++VYN+LRYS+RT+CK LHEKIL ILIAGQLLK+AYVV KDNA+ ISQPA+ K
Sbjct: 493  MKAHLEAFSVYNMLRYSKRTMCKALHEKILHILIAGQLLKDAYVVVKDNAELISQPAITK 552

Query: 1760 FLTAFMRSGNINLINDVVKVITASGCKIDQDIFQLAISRYIAQPEKKDLLLQLLKWMPSQ 1939
            F TAFM+ GNIN+INDV+KV+  SG KID    Q+AISRY+ QPEKK+LLLQLL+WMP  
Sbjct: 553  FATAFMKLGNINMINDVLKVLHGSGYKID----QMAISRYLGQPEKKELLLQLLQWMPGH 608

Query: 1940 GYVVDSATRNLILKNSHLYGHQLIAEMLSTQHALAR-SKST 2059
            GYVVDS+TRN+ILKNS L G QL AE+LS QH +++ S+ST
Sbjct: 609  GYVVDSSTRNMILKNSQLLGRQLTAEILSKQHMMSKVSRST 649


>ref|XP_006343482.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like isoform X2 [Solanum tuberosum]
          Length = 651

 Score =  736 bits (1900), Expect = 0.0
 Identities = 366/581 (62%), Positives = 459/581 (79%)
 Frame = +2

Query: 320  ILDIRNSPKLLDALARSGEILKAHDLNIILREFGRLKRYTDLSQLFEWMEKHGKVNITSY 499
            IL I++S  L  ALAR G+ LK  D+N+ILR FG+L R  +L Q FEWM+++ K+N+ SY
Sbjct: 66   ILHIQDSSDLASALARHGDTLKVQDMNVILRYFGKLSRRRELYQAFEWMQQNQKINVASY 125

Query: 500  SSYIKFTGESLNPTKAVEAYNHISDEAVRNHTSVCNSVLSCLIKNRKFDSASSLFYQMKQ 679
            SSY+KF G+SL+   AVE Y  I D +++ + SVCN+ LS LIKN K +S+  LF QMK+
Sbjct: 126  SSYVKFMGKSLSCVDAVEMYRDIKDRSIKYNVSVCNAFLSSLIKNGKSESSLKLFTQMKR 185

Query: 680  NGLEPDVVTYSTLLAGCIKNMDGYSKAMQLVKELESSSLQMDDVTYGTLIAVYAANGQCK 859
            +GL PDV TYSTLLAGC K   GY KA++LV+EL S+ LQMD VTYG+L++V A++ +C 
Sbjct: 186  DGLVPDVFTYSTLLAGCAKVNGGYYKALELVQELMSNGLQMDSVTYGSLLSVCASHKECN 245

Query: 860  EAEECFIRMKSEGHSPNVYHYSSLLNAFSADGNYQKGDELVNDMKSAGLVPNKVMLTTLL 1039
            EA + F +MK EGHSPNVYHYSSLLNA+SAD NY+K + L+ +M+SAGLV NKV+ TTLL
Sbjct: 246  EAAKYFQKMKDEGHSPNVYHYSSLLNAYSADRNYEKAEVLIEEMRSAGLVLNKVIYTTLL 305

Query: 1040 KVYARAGLFEKSRQLLSELEASGYAGDEIPYCLLMDAFSKSGQIDEAKLVLDEMCRRRVK 1219
            KVY + GLFEKS++LL ELEA GYA DE+P+CLLMD  +KSG + EAK V DEM  + VK
Sbjct: 306  KVYVKGGLFEKSKELLKELEALGYAKDEMPFCLLMDGLAKSGHLLEAKSVFDEMMEKHVK 365

Query: 1220 SDGYAYSIMISAFCRSGHLEEAKRLAKDYEFTHDKFDVVISNTMLCAYCRVGDMESVMQI 1399
            +DGY+YSIMISAFCRSG LE+AK++A ++E  +DK+D+VI N ML AYCR G ME+VM +
Sbjct: 366  TDGYSYSIMISAFCRSGLLEDAKKVASEFEEKYDKYDIVILNAMLSAYCRAGKMENVMSM 425

Query: 1400 LRKMDALAISPDKNTFGILIKYFFKEKLYPLAYRTMEDMHSKGHQPSEETCTLLMINLGK 1579
            ++KMD  AISPD NTF ILI+YF KEKLY LAYRTMEDMHSKGHQP E  C+ L+ +LGK
Sbjct: 426  MKKMDDSAISPDWNTFNILIRYFCKEKLYLLAYRTMEDMHSKGHQPEEGLCSSLIYHLGK 485

Query: 1580 MGAHNEAYAVYNILRYSRRTICKTLHEKILRILIAGQLLKEAYVVAKDNAKCISQPAMKK 1759
             GAH+EA++VYN+LRYS+RTI   LHE IL ILIAG+LLK+AYVV KDNA  ISQPA+KK
Sbjct: 486  TGAHSEAFSVYNMLRYSKRTISNALHEHILHILIAGRLLKDAYVVVKDNAGFISQPAIKK 545

Query: 1760 FLTAFMRSGNINLINDVVKVITASGCKIDQDIFQLAISRYIAQPEKKDLLLQLLKWMPSQ 1939
            F   FMRSGN+NLINDV+  + +SG KIDQ++F LAI+RYIA+PEKK+LLL LLKWMP +
Sbjct: 546  FSVNFMRSGNVNLINDVMNAMHSSGHKIDQELFDLAIARYIAKPEKKELLLWLLKWMPGK 605

Query: 1940 GYVVDSATRNLILKNSHLYGHQLIAEMLSTQHALARSKSTH 2062
            GY +DS+TRNLILKNSHL+GHQLIAE LS    +++    H
Sbjct: 606  GYAIDSSTRNLILKNSHLFGHQLIAESLSKHLVMSKKVKLH 646


>ref|XP_004250704.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like [Solanum lycopersicum]
          Length = 642

 Score =  733 bits (1891), Expect = 0.0
 Identities = 363/576 (63%), Positives = 459/576 (79%)
 Frame = +2

Query: 320  ILDIRNSPKLLDALARSGEILKAHDLNIILREFGRLKRYTDLSQLFEWMEKHGKVNITSY 499
            IL I++S  L  ALAR G+ LK  D+N+ILR FG+L R  +L Q+FEWM+++ K+N+ SY
Sbjct: 66   ILHIQDSSDLASALARHGDTLKVQDMNVILRYFGKLNRRPELCQVFEWMQQNQKINVASY 125

Query: 500  SSYIKFTGESLNPTKAVEAYNHISDEAVRNHTSVCNSVLSCLIKNRKFDSASSLFYQMKQ 679
            SSY+KF G+SL+   AVE Y  I D +++ + SVCN+ LS LIKN K +S+  LF QMK+
Sbjct: 126  SSYVKFMGKSLSCVDAVEMYRDIKDRSIKFNVSVCNAFLSSLIKNGKSESSLKLFTQMKR 185

Query: 680  NGLEPDVVTYSTLLAGCIKNMDGYSKAMQLVKELESSSLQMDDVTYGTLIAVYAANGQCK 859
            +GL PDV TYSTLLAGC K   GY KA++LV+E+ S+ L+MD VTYG+L++V A++ +C 
Sbjct: 186  DGLVPDVFTYSTLLAGCAKVNGGYYKALELVQEMMSNGLEMDSVTYGSLLSVCASHKECN 245

Query: 860  EAEECFIRMKSEGHSPNVYHYSSLLNAFSADGNYQKGDELVNDMKSAGLVPNKVMLTTLL 1039
            EA + F +MK EGHSPNVYHYSSLLNA+SAD NY+K + L+ +M+SAGLV NKV+ TTLL
Sbjct: 246  EAAKYFQKMKDEGHSPNVYHYSSLLNAYSADRNYEKAEALIEEMRSAGLVLNKVIYTTLL 305

Query: 1040 KVYARAGLFEKSRQLLSELEASGYAGDEIPYCLLMDAFSKSGQIDEAKLVLDEMCRRRVK 1219
            KVY + GLFEKS++LL ELEA GYA DE+P+CLLMD  +KSG + EAK V DEM  ++VK
Sbjct: 306  KVYVKGGLFEKSKELLKELEALGYAKDEMPFCLLMDGLAKSGHLLEAKSVFDEMMEKQVK 365

Query: 1220 SDGYAYSIMISAFCRSGHLEEAKRLAKDYEFTHDKFDVVISNTMLCAYCRVGDMESVMQI 1399
            +DGY+YSIMISAFCR G LE+AK+LA ++E  +DK+D+VI N ML AYCR G ME+VM +
Sbjct: 366  TDGYSYSIMISAFCRRGLLEDAKKLASEFEEKYDKYDIVILNAMLSAYCRAGKMENVMSM 425

Query: 1400 LRKMDALAISPDKNTFGILIKYFFKEKLYPLAYRTMEDMHSKGHQPSEETCTLLMINLGK 1579
            ++KMD  AISPD NTF ILI+YF KEKLY LAYRTMEDMHSKGHQP E  C+ L+ +LGK
Sbjct: 426  MKKMDDSAISPDWNTFNILIRYFCKEKLYLLAYRTMEDMHSKGHQPEEGLCSSLIYHLGK 485

Query: 1580 MGAHNEAYAVYNILRYSRRTICKTLHEKILRILIAGQLLKEAYVVAKDNAKCISQPAMKK 1759
             GAH+EA++VYN+LRYS+RTI   LHE IL ILIAG+LLK+AYVV KDNA  ISQPA+KK
Sbjct: 486  TGAHSEAFSVYNMLRYSKRTISNALHENILHILIAGRLLKDAYVVVKDNAGFISQPAIKK 545

Query: 1760 FLTAFMRSGNINLINDVVKVITASGCKIDQDIFQLAISRYIAQPEKKDLLLQLLKWMPSQ 1939
            F   FMRSGN+NLINDV+  + +SG KIDQ++F LAI+RYIA+PEKK+LLL LLKWMP +
Sbjct: 546  FSVNFMRSGNVNLINDVMNAMHSSGHKIDQELFDLAIARYIAKPEKKELLLWLLKWMPVK 605

Query: 1940 GYVVDSATRNLILKNSHLYGHQLIAEMLSTQHALAR 2047
            GY +DS+TRNLILKNSHL+GHQLIAE LS    +++
Sbjct: 606  GYAIDSSTRNLILKNSHLFGHQLIAESLSKHLVMSK 641


>ref|XP_006343481.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like isoform X1 [Solanum tuberosum]
          Length = 652

 Score =  731 bits (1888), Expect = 0.0
 Identities = 366/582 (62%), Positives = 459/582 (78%), Gaps = 1/582 (0%)
 Frame = +2

Query: 320  ILDIRNSPKLLDALARSGEILKAHDLNIILREFGRLKRYTDLSQLFEWMEKHGKVNITSY 499
            IL I++S  L  ALAR G+ LK  D+N+ILR FG+L R  +L Q FEWM+++ K+N+ SY
Sbjct: 66   ILHIQDSSDLASALARHGDTLKVQDMNVILRYFGKLSRRRELYQAFEWMQQNQKINVASY 125

Query: 500  SSYIKFTGESLNPTKAVEAYNHISDEAVRNHTSVCNSVLSCLIKNRKFDSASSLFYQMKQ 679
            SSY+KF G+SL+   AVE Y  I D +++ + SVCN+ LS LIKN K +S+  LF QMK+
Sbjct: 126  SSYVKFMGKSLSCVDAVEMYRDIKDRSIKYNVSVCNAFLSSLIKNGKSESSLKLFTQMKR 185

Query: 680  NGLEPDVVTYSTLLAGCIKNMDGYSKAMQLVKELESSSLQMDDVTYGTLIAVYAANGQCK 859
            +GL PDV TYSTLLAGC K   GY KA++LV+EL S+ LQMD VTYG+L++V A++ +C 
Sbjct: 186  DGLVPDVFTYSTLLAGCAKVNGGYYKALELVQELMSNGLQMDSVTYGSLLSVCASHKECN 245

Query: 860  EAEECFIRMKSEGHSPNVYHYSSLLNAFSADGNYQKGDELVNDMKSAGLVPNKVMLTTLL 1039
            EA + F +MK EGHSPNVYHYSSLLNA+SAD NY+K + L+ +M+SAGLV NKV+ TTLL
Sbjct: 246  EAAKYFQKMKDEGHSPNVYHYSSLLNAYSADRNYEKAEVLIEEMRSAGLVLNKVIYTTLL 305

Query: 1040 KVYARAGLFEKSRQLLSELEASGYAGDEIPYCLLMDAFSKSGQIDEAKLVLDEMCRRRVK 1219
            KVY + GLFEKS++LL ELEA GYA DE+P+CLLMD  +KSG + EAK V DEM  + VK
Sbjct: 306  KVYVKGGLFEKSKELLKELEALGYAKDEMPFCLLMDGLAKSGHLLEAKSVFDEMMEKHVK 365

Query: 1220 S-DGYAYSIMISAFCRSGHLEEAKRLAKDYEFTHDKFDVVISNTMLCAYCRVGDMESVMQ 1396
            + DGY+YSIMISAFCRSG LE+AK++A ++E  +DK+D+VI N ML AYCR G ME+VM 
Sbjct: 366  TADGYSYSIMISAFCRSGLLEDAKKVASEFEEKYDKYDIVILNAMLSAYCRAGKMENVMS 425

Query: 1397 ILRKMDALAISPDKNTFGILIKYFFKEKLYPLAYRTMEDMHSKGHQPSEETCTLLMINLG 1576
            +++KMD  AISPD NTF ILI+YF KEKLY LAYRTMEDMHSKGHQP E  C+ L+ +LG
Sbjct: 426  MMKKMDDSAISPDWNTFNILIRYFCKEKLYLLAYRTMEDMHSKGHQPEEGLCSSLIYHLG 485

Query: 1577 KMGAHNEAYAVYNILRYSRRTICKTLHEKILRILIAGQLLKEAYVVAKDNAKCISQPAMK 1756
            K GAH+EA++VYN+LRYS+RTI   LHE IL ILIAG+LLK+AYVV KDNA  ISQPA+K
Sbjct: 486  KTGAHSEAFSVYNMLRYSKRTISNALHEHILHILIAGRLLKDAYVVVKDNAGFISQPAIK 545

Query: 1757 KFLTAFMRSGNINLINDVVKVITASGCKIDQDIFQLAISRYIAQPEKKDLLLQLLKWMPS 1936
            KF   FMRSGN+NLINDV+  + +SG KIDQ++F LAI+RYIA+PEKK+LLL LLKWMP 
Sbjct: 546  KFSVNFMRSGNVNLINDVMNAMHSSGHKIDQELFDLAIARYIAKPEKKELLLWLLKWMPG 605

Query: 1937 QGYVVDSATRNLILKNSHLYGHQLIAEMLSTQHALARSKSTH 2062
            +GY +DS+TRNLILKNSHL+GHQLIAE LS    +++    H
Sbjct: 606  KGYAIDSSTRNLILKNSHLFGHQLIAESLSKHLVMSKKVKLH 647


>ref|XP_006343483.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like isoform X3 [Solanum tuberosum]
          Length = 646

 Score =  729 bits (1883), Expect = 0.0
 Identities = 365/577 (63%), Positives = 458/577 (79%), Gaps = 1/577 (0%)
 Frame = +2

Query: 320  ILDIRNSPKLLDALARSGEILKAHDLNIILREFGRLKRYTDLSQLFEWMEKHGKVNITSY 499
            IL I++S  L  ALAR G+ LK  D+N+ILR FG+L R  +L Q FEWM+++ K+N+ SY
Sbjct: 66   ILHIQDSSDLASALARHGDTLKVQDMNVILRYFGKLSRRRELYQAFEWMQQNQKINVASY 125

Query: 500  SSYIKFTGESLNPTKAVEAYNHISDEAVRNHTSVCNSVLSCLIKNRKFDSASSLFYQMKQ 679
            SSY+KF G+SL+   AVE Y  I D +++ + SVCN+ LS LIKN K +S+  LF QMK+
Sbjct: 126  SSYVKFMGKSLSCVDAVEMYRDIKDRSIKYNVSVCNAFLSSLIKNGKSESSLKLFTQMKR 185

Query: 680  NGLEPDVVTYSTLLAGCIKNMDGYSKAMQLVKELESSSLQMDDVTYGTLIAVYAANGQCK 859
            +GL PDV TYSTLLAGC K   GY KA++LV+EL S+ LQMD VTYG+L++V A++ +C 
Sbjct: 186  DGLVPDVFTYSTLLAGCAKVNGGYYKALELVQELMSNGLQMDSVTYGSLLSVCASHKECN 245

Query: 860  EAEECFIRMKSEGHSPNVYHYSSLLNAFSADGNYQKGDELVNDMKSAGLVPNKVMLTTLL 1039
            EA + F +MK EGHSPNVYHYSSLLNA+SAD NY+K + L+ +M+SAGLV NKV+ TTLL
Sbjct: 246  EAAKYFQKMKDEGHSPNVYHYSSLLNAYSADRNYEKAEVLIEEMRSAGLVLNKVIYTTLL 305

Query: 1040 KVYARAGLFEKSRQLLSELEASGYAGDEIPYCLLMDAFSKSGQIDEAKLVLDEMCRRRVK 1219
            KVY + GLFEKS++LL ELEA GYA DE+P+CLLMD  +KSG + EAK V DEM  + VK
Sbjct: 306  KVYVKGGLFEKSKELLKELEALGYAKDEMPFCLLMDGLAKSGHLLEAKSVFDEMMEKHVK 365

Query: 1220 S-DGYAYSIMISAFCRSGHLEEAKRLAKDYEFTHDKFDVVISNTMLCAYCRVGDMESVMQ 1396
            + DGY+YSIMISAFCRSG LE+AK++A ++E  +DK+D+VI N ML AYCR G ME+VM 
Sbjct: 366  TADGYSYSIMISAFCRSGLLEDAKKVASEFEEKYDKYDIVILNAMLSAYCRAGKMENVMS 425

Query: 1397 ILRKMDALAISPDKNTFGILIKYFFKEKLYPLAYRTMEDMHSKGHQPSEETCTLLMINLG 1576
            +++KMD  AISPD NTF ILI+YF KEKLY LAYRTMEDMHSKGHQP E  C+ L+ +LG
Sbjct: 426  MMKKMDDSAISPDWNTFNILIRYFCKEKLYLLAYRTMEDMHSKGHQPEEGLCSSLIYHLG 485

Query: 1577 KMGAHNEAYAVYNILRYSRRTICKTLHEKILRILIAGQLLKEAYVVAKDNAKCISQPAMK 1756
            K GAH+EA++VYN+LRYS+RTI   LHE IL ILIAG+LLK+AYVV KDNA  ISQPA+K
Sbjct: 486  KTGAHSEAFSVYNMLRYSKRTISNALHEHILHILIAGRLLKDAYVVVKDNAGFISQPAIK 545

Query: 1757 KFLTAFMRSGNINLINDVVKVITASGCKIDQDIFQLAISRYIAQPEKKDLLLQLLKWMPS 1936
            KF   FMRSGN+NLINDV+  + +SG KIDQ++F LAI+RYIA+PEKK+LLL LLKWMP 
Sbjct: 546  KFSVNFMRSGNVNLINDVMNAMHSSGHKIDQELFDLAIARYIAKPEKKELLLWLLKWMPG 605

Query: 1937 QGYVVDSATRNLILKNSHLYGHQLIAEMLSTQHALAR 2047
            +GY +DS+TRNLILKNSHL+GHQLIAE LS    +++
Sbjct: 606  KGYAIDSSTRNLILKNSHLFGHQLIAESLSKHLVMSK 642


>ref|XP_006417404.1| hypothetical protein EUTSA_v10007006mg [Eutrema salsugineum]
            gi|557095175|gb|ESQ35757.1| hypothetical protein
            EUTSA_v10007006mg [Eutrema salsugineum]
          Length = 666

 Score =  729 bits (1881), Expect = 0.0
 Identities = 356/577 (61%), Positives = 466/577 (80%)
 Frame = +2

Query: 320  ILDIRNSPKLLDALARSGEILKAHDLNIILREFGRLKRYTDLSQLFEWMEKHGKVNITSY 499
            I ++  SP  L +L R   +LK  DLN+ILR+FG   R+ DL QLF+WM++ GK+++++Y
Sbjct: 77   ISEVERSPDFLSSLQRLAGVLKVQDLNVILRDFGISGRWQDLIQLFDWMQQQGKISVSTY 136

Query: 500  SSYIKFTGESLNPTKAVEAYNHISDEAVRNHTSVCNSVLSCLIKNRKFDSASSLFYQMKQ 679
            SS IKF G   + +KA+E Y  I DE+ + +  +CNS+LSCL+KN K +S   LF QMK+
Sbjct: 137  SSCIKFVGAK-SVSKALEIYQSIPDESTKINVYICNSILSCLVKNGKLESCFKLFDQMKR 195

Query: 680  NGLEPDVVTYSTLLAGCIKNMDGYSKAMQLVKELESSSLQMDDVTYGTLIAVYAANGQCK 859
            +GL+PDV+TY+TLLAGCIK  +GYSKAM+LV EL  + +QMD V YGT++A+ A+NG+C+
Sbjct: 196  DGLKPDVITYNTLLAGCIKVKNGYSKAMELVGELPHNGIQMDGVMYGTVLAICASNGRCE 255

Query: 860  EAEECFIRMKSEGHSPNVYHYSSLLNAFSADGNYQKGDELVNDMKSAGLVPNKVMLTTLL 1039
            EAE    +MK +GHSPN+YHYSSLLN++S  G+Y+K DEL+ +MKS G+VPNKVM+TTLL
Sbjct: 256  EAESFIQQMKVKGHSPNIYHYSSLLNSYSWKGDYKKADELMTEMKSVGIVPNKVMMTTLL 315

Query: 1040 KVYARAGLFEKSRQLLSELEASGYAGDEIPYCLLMDAFSKSGQIDEAKLVLDEMCRRRVK 1219
            KVY R GLFE+SR+LLSELE++GYA +E+PYC+LMD  SK+G+ +EA+ + DEM  + VK
Sbjct: 316  KVYIRGGLFERSRELLSELESAGYAENEMPYCMLMDGLSKAGKFEEARSIFDEMKGKGVK 375

Query: 1220 SDGYAYSIMISAFCRSGHLEEAKRLAKDYEFTHDKFDVVISNTMLCAYCRVGDMESVMQI 1399
            SDGYA SIMISA CRS   EEAK+LA+D E T++K D+V+ NTMLCAYCR G+MESVM++
Sbjct: 376  SDGYANSIMISALCRSKRFEEAKQLARDSESTYEKCDLVMLNTMLCAYCRAGEMESVMRM 435

Query: 1400 LRKMDALAISPDKNTFGILIKYFFKEKLYPLAYRTMEDMHSKGHQPSEETCTLLMINLGK 1579
            ++KMD  A+SPD NTF ILIKYF KEKL+ LAY+T+ DMHSKGH+  EE C+ L+ +LGK
Sbjct: 436  MKKMDEQAVSPDYNTFHILIKYFIKEKLHLLAYQTLLDMHSKGHRLEEELCSSLIYHLGK 495

Query: 1580 MGAHNEAYAVYNILRYSRRTICKTLHEKILRILIAGQLLKEAYVVAKDNAKCISQPAMKK 1759
            + AH+EA++VY++LRYS+RTICK LHEKIL ILI G+LLK+AYVV KDNAK ISQP +K+
Sbjct: 496  IRAHSEAFSVYSMLRYSKRTICKDLHEKILHILIHGKLLKDAYVVVKDNAKMISQPTLKR 555

Query: 1760 FLTAFMRSGNINLINDVVKVITASGCKIDQDIFQLAISRYIAQPEKKDLLLQLLKWMPSQ 1939
            F  AFM SGN+NL+NDV+KV+  SG KIDQ  F++AISRYI+QP+KK+LLLQLL+WMP Q
Sbjct: 556  FGRAFMNSGNVNLVNDVLKVLHGSGHKIDQVQFEIAISRYISQPDKKELLLQLLQWMPGQ 615

Query: 1940 GYVVDSATRNLILKNSHLYGHQLIAEMLSTQHALARS 2050
            GYVVDS+TRNLILKNS+L+G QLIAE+LS  H  +R+
Sbjct: 616  GYVVDSSTRNLILKNSNLFGRQLIAEILSKHHIASRT 652


>gb|ESW20506.1| hypothetical protein PHAVU_006G214900g [Phaseolus vulgaris]
          Length = 639

 Score =  722 bits (1863), Expect = 0.0
 Identities = 355/578 (61%), Positives = 456/578 (78%)
 Frame = +2

Query: 323  LDIRNSPKLLDALARSGEILKAHDLNIILREFGRLKRYTDLSQLFEWMEKHGKVNITSYS 502
            L+I+ S  L  ALAR GE L   DLN  L  F    ++  +SQLF+WM+++ K++++SYS
Sbjct: 59   LEIQRSSDLPSALARLGETLTVKDLNAALYHFKNSNKFNHISQLFKWMQENNKLDVSSYS 118

Query: 503  SYIKFTGESLNPTKAVEAYNHISDEAVRNHTSVCNSVLSCLIKNRKFDSASSLFYQMKQN 682
             Y++F   +L+  + ++ Y+ I DE+ R +  VCNSVL CLIK  KFDS   LF QM+ +
Sbjct: 119  HYMRFMANNLDAAEMLQLYHSIQDESARKNILVCNSVLGCLIKKGKFDSGMKLFRQMQLD 178

Query: 683  GLEPDVVTYSTLLAGCIKNMDGYSKAMQLVKELESSSLQMDDVTYGTLIAVYAANGQCKE 862
            GL PD VTYSTLLAGCIK  +GY KA++L++EL+ S LQMD V YGT++AV A+NG+ +E
Sbjct: 179  GLVPDPVTYSTLLAGCIKIENGYPKALELIQELQHSKLQMDGVIYGTILAVCASNGKWEE 238

Query: 863  AEECFIRMKSEGHSPNVYHYSSLLNAFSADGNYQKGDELVNDMKSAGLVPNKVMLTTLLK 1042
            AE+ F +MK EGHS NVYHYSSLLNA+S  GNY+K D L  DMKS GLVPNKV+LTTLLK
Sbjct: 239  AEKYFNQMKDEGHSRNVYHYSSLLNAYSTCGNYKKADILFQDMKSEGLVPNKVILTTLLK 298

Query: 1043 VYARAGLFEKSRQLLSELEASGYAGDEIPYCLLMDAFSKSGQIDEAKLVLDEMCRRRVKS 1222
            VY + GLF+KSR+LL+EL++ GYA DE+PYC+LMD  +K+GQI EAKL+ DEM +  V+S
Sbjct: 299  VYVKGGLFDKSRELLAELKSLGYAEDEMPYCILMDGLAKAGQIHEAKLIFDEMMKNHVRS 358

Query: 1223 DGYAYSIMISAFCRSGHLEEAKRLAKDYEFTHDKFDVVISNTMLCAYCRVGDMESVMQIL 1402
            DGYA+SIMISA CRS    EAK+LAKD+E T +K+D+VI N+MLCA+CRVG+MESVM+ L
Sbjct: 359  DGYAHSIMISALCRSKLFREAKQLAKDFETTSNKYDIVILNSMLCAFCRVGEMESVMETL 418

Query: 1403 RKMDALAISPDKNTFGILIKYFFKEKLYPLAYRTMEDMHSKGHQPSEETCTLLMINLGKM 1582
            +KMD LAISP  NTF ILIKYF +EK+Y LAYRTM+DMHSKGHQP EE C+ L+ +LG++
Sbjct: 419  KKMDELAISPSYNTFHILIKYFCREKMYLLAYRTMKDMHSKGHQPGEELCSTLISHLGQV 478

Query: 1583 GAHNEAYAVYNILRYSRRTICKTLHEKILRILIAGQLLKEAYVVAKDNAKCISQPAMKKF 1762
             A++EA++VYN+LRY +RT+CK+LHEKIL IL+AG LLK+AYVV KDNAK IS+P  KKF
Sbjct: 479  NAYSEAFSVYNMLRYGKRTMCKSLHEKILYILLAGHLLKDAYVVVKDNAKYISRPPTKKF 538

Query: 1763 LTAFMRSGNINLINDVVKVITASGCKIDQDIFQLAISRYIAQPEKKDLLLQLLKWMPSQG 1942
              AFM+SGNIN INDV+K +  SG K+DQD+F +A+SRY+ +PEKKDLLL LL+WM  QG
Sbjct: 539  AIAFMKSGNINYINDVLKTLHDSGYKLDQDLFAMAVSRYLGEPEKKDLLLHLLQWMSGQG 598

Query: 1943 YVVDSATRNLILKNSHLYGHQLIAEMLSTQHALARSKS 2056
            Y+VDS+TRNLILK+SHL+G QLIAE+LS Q    + K+
Sbjct: 599  YMVDSSTRNLILKHSHLFGRQLIAEVLSKQQVQLKHKN 636


>ref|XP_004486236.1| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like [Cicer arietinum]
          Length = 642

 Score =  720 bits (1859), Expect = 0.0
 Identities = 350/578 (60%), Positives = 459/578 (79%)
 Frame = +2

Query: 323  LDIRNSPKLLDALARSGEILKAHDLNIILREFGRLKRYTDLSQLFEWMEKHGKVNITSYS 502
            L +  +  L   L++ G+ L   +LN  L  FG   ++  +SQLF WM+++ K+++ SYS
Sbjct: 62   LQLHRASDLNSVLSKVGKTLTVKELNSTLHHFGNSNKFNHISQLFLWMQENKKLDVYSYS 121

Query: 503  SYIKFTGESLNPTKAVEAYNHISDEAVRNHTSVCNSVLSCLIKNRKFDSASSLFYQMKQN 682
            +YIKF    L+ +  ++ YN+I DE+ +++  VCNSVLSCLIK  KFD+A  LF+QMKQ+
Sbjct: 122  NYIKFMANKLDASTVLKLYNNIQDESAKDNVYVCNSVLSCLIKKGKFDTAIKLFHQMKQD 181

Query: 683  GLEPDVVTYSTLLAGCIKNMDGYSKAMQLVKELESSSLQMDDVTYGTLIAVYAANGQCKE 862
            GL PD+VTYS L+AGC+K  DGYSKA+QL++EL+ + L+MD+V YG ++AV A+NG+ +E
Sbjct: 182  GLVPDLVTYSMLIAGCVKVKDGYSKALQLIQELQDNKLRMDNVIYGAILAVCASNGKWEE 241

Query: 863  AEECFIRMKSEGHSPNVYHYSSLLNAFSADGNYQKGDELVNDMKSAGLVPNKVMLTTLLK 1042
            AE  F  MK+EGHSPNVYHYSSLLNA+SA GN++K D L+ DMKS GLVPNKV+LTTLLK
Sbjct: 242  AEHYFNGMKNEGHSPNVYHYSSLLNAYSASGNFKKADSLIQDMKSEGLVPNKVILTTLLK 301

Query: 1043 VYARAGLFEKSRQLLSELEASGYAGDEIPYCLLMDAFSKSGQIDEAKLVLDEMCRRRVKS 1222
            VY R GL EKSR+LL++LE+  YA DE+PYC+LMD  +K+GQ+ EAK+V DEM ++ V+S
Sbjct: 302  VYVRGGLLEKSRELLTKLESLSYAEDEMPYCVLMDGLAKAGQVHEAKIVFDEMMKKHVRS 361

Query: 1223 DGYAYSIMISAFCRSGHLEEAKRLAKDYEFTHDKFDVVISNTMLCAYCRVGDMESVMQIL 1402
            DGYA+SIMISAFCR+   EEAK+LAK+++ T +K+DVVI N+MLCA+CR G+MESVM+ L
Sbjct: 362  DGYAHSIMISAFCRAKLFEEAKQLAKNFQTTFNKYDVVIMNSMLCAFCRAGEMESVMETL 421

Query: 1403 RKMDALAISPDKNTFGILIKYFFKEKLYPLAYRTMEDMHSKGHQPSEETCTLLMINLGKM 1582
            RKMD LAISPD NTF ILIKYF ++ +Y LAY+TMEDMHSKG+QP EE C+ L+ +LG+ 
Sbjct: 422  RKMDELAISPDYNTFNILIKYFCRQNMYLLAYQTMEDMHSKGYQPVEELCSSLIYHLGQA 481

Query: 1583 GAHNEAYAVYNILRYSRRTICKTLHEKILRILIAGQLLKEAYVVAKDNAKCISQPAMKKF 1762
             A++EA++VYN+L+YS+RTI KTLHEKIL IL+AG+LLK+AYVV KDNA  IS    KKF
Sbjct: 482  NAYSEAFSVYNMLKYSKRTIRKTLHEKILHILLAGKLLKDAYVVFKDNATFISGHTTKKF 541

Query: 1763 LTAFMRSGNINLINDVVKVITASGCKIDQDIFQLAISRYIAQPEKKDLLLQLLKWMPSQG 1942
             +AFM+ GNINLINDV+K +   G KIDQD+F++A++RY+ QPEKKDLLL LL+WMP QG
Sbjct: 542  ASAFMKLGNINLINDVMKTLHNCGYKIDQDLFEMAVTRYLGQPEKKDLLLHLLQWMPGQG 601

Query: 1943 YVVDSATRNLILKNSHLYGHQLIAEMLSTQHALARSKS 2056
            YVVD +TRNLILKNSHL+G QLIAE+LS Q    + K+
Sbjct: 602  YVVDPSTRNLILKNSHLFGRQLIAEVLSKQRVSLKPKN 639


>ref|XP_003531588.2| PREDICTED: pentatricopeptide repeat-containing protein At1g10910,
            chloroplastic-like [Glycine max]
          Length = 646

 Score =  719 bits (1855), Expect = 0.0
 Identities = 352/579 (60%), Positives = 462/579 (79%), Gaps = 1/579 (0%)
 Frame = +2

Query: 323  LDIRNSPKLLDALARSGEILKAHDLNIILREFGRLKRYTDLSQLFEWMEKHGKVNITSYS 502
            L++RN+  L  ALAR G+ L   DLN  L  F +  ++  +SQLF WM+++ K++  SYS
Sbjct: 65   LEVRNASDLASALARVGDALTVKDLNAALYHFKKSNKFNHISQLFSWMQENNKLDALSYS 124

Query: 503  SYIKF-TGESLNPTKAVEAYNHISDEAVRNHTSVCNSVLSCLIKNRKFDSASSLFYQMKQ 679
             YI+F    +L+  K ++ Y+ I +++ + +  VCNSVLSCLIK  KF+SA +LF QMK 
Sbjct: 125  HYIRFMASHNLDAAKMLQLYHSIQNQSAKINVLVCNSVLSCLIKKAKFNSALNLFQQMKL 184

Query: 680  NGLEPDVVTYSTLLAGCIKNMDGYSKAMQLVKELESSSLQMDDVTYGTLIAVYAANGQCK 859
            +GL PD+VTY+TLLAGCIK  +GY+KA++L++EL+ + LQMD V YGT++AV A+N + +
Sbjct: 185  DGLLPDLVTYTTLLAGCIKIENGYAKALELIQELQHNKLQMDGVIYGTIMAVCASNTKWE 244

Query: 860  EAEECFIRMKSEGHSPNVYHYSSLLNAFSADGNYQKGDELVNDMKSAGLVPNKVMLTTLL 1039
            EAE  F +MK EGH+PNVYHYSSL+NA+SA GNY+K D L+ DMKS GLVPNKV+LTTLL
Sbjct: 245  EAEYYFNQMKDEGHTPNVYHYSSLINAYSACGNYKKADMLIQDMKSEGLVPNKVILTTLL 304

Query: 1040 KVYARAGLFEKSRQLLSELEASGYAGDEIPYCLLMDAFSKSGQIDEAKLVLDEMCRRRVK 1219
            KVY + GLFEKSR+LL+EL++ GYA DE+PYC+ MD  +K+GQI EAKL+ DEM +  V+
Sbjct: 305  KVYVKGGLFEKSRELLAELKSLGYAEDEMPYCIFMDGLAKAGQIHEAKLIFDEMMKNHVR 364

Query: 1220 SDGYAYSIMISAFCRSGHLEEAKRLAKDYEFTHDKFDVVISNTMLCAYCRVGDMESVMQI 1399
            SDGYA+SIMISAFCR+    EAK+LAKD+E T +K+D+VI N+MLCA+CRVG+ME VM+ 
Sbjct: 365  SDGYAHSIMISAFCRAKLFREAKQLAKDFETTSNKYDLVILNSMLCAFCRVGEMERVMET 424

Query: 1400 LRKMDALAISPDKNTFGILIKYFFKEKLYPLAYRTMEDMHSKGHQPSEETCTLLMINLGK 1579
            L+KMD LAI+P  NTF ILIKYF +EK+Y LAYRTM+DMHSKGHQP EE C+ L+ +LG+
Sbjct: 425  LKKMDELAINPGYNTFHILIKYFCREKMYLLAYRTMKDMHSKGHQPVEELCSSLISHLGQ 484

Query: 1580 MGAHNEAYAVYNILRYSRRTICKTLHEKILRILIAGQLLKEAYVVAKDNAKCISQPAMKK 1759
            + A++EA++VYN+L+YS+RT+CK+LHEKIL IL+AGQLLK+AYVV KDNAK IS+PA KK
Sbjct: 485  VNAYSEAFSVYNMLKYSKRTMCKSLHEKILHILLAGQLLKDAYVVVKDNAKFISRPATKK 544

Query: 1760 FLTAFMRSGNINLINDVVKVITASGCKIDQDIFQLAISRYIAQPEKKDLLLQLLKWMPSQ 1939
            F +AFM+SGN+N INDV+K +   G K+DQD+F +A+SRY+ QPEKKDLLL LL+WM  Q
Sbjct: 545  FASAFMKSGNLNYINDVLKTLHDCGYKLDQDLFAMAVSRYLDQPEKKDLLLHLLQWMAGQ 604

Query: 1940 GYVVDSATRNLILKNSHLYGHQLIAEMLSTQHALARSKS 2056
            GY VDS+TRNLILKNSHL+G QLIAE+LS Q    + K+
Sbjct: 605  GYAVDSSTRNLILKNSHLFGRQLIAEVLSKQQVKLKHKN 643


>ref|NP_172560.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
            gi|122242678|sp|Q0WVV0.1|PPR31_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At1g10910, chloroplastic; Flags: Precursor
            gi|110741600|dbj|BAE98748.1| membrane-associated
            salt-inducible protein isolog [Arabidopsis thaliana]
            gi|332190541|gb|AEE28662.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 664

 Score =  718 bits (1853), Expect = 0.0
 Identities = 352/576 (61%), Positives = 459/576 (79%)
 Frame = +2

Query: 320  ILDIRNSPKLLDALARSGEILKAHDLNIILREFGRLKRYTDLSQLFEWMEKHGKVNITSY 499
            I +++ S   L +L R   +LK  DLN+ILR+FG   R+ DL QLFEWM++HGK+++++Y
Sbjct: 76   ISEVQRSSDFLSSLQRLATVLKVQDLNVILRDFGISGRWQDLIQLFEWMQQHGKISVSTY 135

Query: 500  SSYIKFTGESLNPTKAVEAYNHISDEAVRNHTSVCNSVLSCLIKNRKFDSASSLFYQMKQ 679
            SS IKF G   N +KA+E Y  I DE+ + +  +CNS+LSCL+KN K DS   LF QMK+
Sbjct: 136  SSCIKFVGAK-NVSKALEIYQSIPDESTKINVYICNSILSCLVKNGKLDSCIKLFDQMKR 194

Query: 680  NGLEPDVVTYSTLLAGCIKNMDGYSKAMQLVKELESSSLQMDDVTYGTLIAVYAANGQCK 859
            +GL+PDVVTY+TLLAGCIK  +GY KA++L+ EL  + +QMD V YGT++A+ A+NG+ +
Sbjct: 195  DGLKPDVVTYNTLLAGCIKVKNGYPKAIELIGELPHNGIQMDSVMYGTVLAICASNGRSE 254

Query: 860  EAEECFIRMKSEGHSPNVYHYSSLLNAFSADGNYQKGDELVNDMKSAGLVPNKVMLTTLL 1039
            EAE    +MK EGHSPN+YHYSSLLN++S  G+Y+K DEL+ +MKS GLVPNKVM+TTLL
Sbjct: 255  EAENFIQQMKVEGHSPNIYHYSSLLNSYSWKGDYKKADELMTEMKSIGLVPNKVMMTTLL 314

Query: 1040 KVYARAGLFEKSRQLLSELEASGYAGDEIPYCLLMDAFSKSGQIDEAKLVLDEMCRRRVK 1219
            KVY + GLF++SR+LLSELE++GYA +E+PYC+LMD  SK+G+++EA+ + D+M  + V+
Sbjct: 315  KVYIKGGLFDRSRELLSELESAGYAENEMPYCMLMDGLSKAGKLEEARSIFDDMKGKGVR 374

Query: 1220 SDGYAYSIMISAFCRSGHLEEAKRLAKDYEFTHDKFDVVISNTMLCAYCRVGDMESVMQI 1399
            SDGYA SIMISA CRS   +EAK L++D E T++K D+V+ NTMLCAYCR G+MESVM++
Sbjct: 375  SDGYANSIMISALCRSKRFKEAKELSRDSETTYEKCDLVMLNTMLCAYCRAGEMESVMRM 434

Query: 1400 LRKMDALAISPDKNTFGILIKYFFKEKLYPLAYRTMEDMHSKGHQPSEETCTLLMINLGK 1579
            ++KMD  A+SPD NTF ILIKYF KEKL+ LAY+T  DMHSKGH+  EE C+ L+ +LGK
Sbjct: 435  MKKMDEQAVSPDYNTFHILIKYFIKEKLHLLAYQTTLDMHSKGHRLEEELCSSLIYHLGK 494

Query: 1580 MGAHNEAYAVYNILRYSRRTICKTLHEKILRILIAGQLLKEAYVVAKDNAKCISQPAMKK 1759
            + A  EA++VYN+LRYS+RTICK LHEKIL ILI G LLK+AY+V KDNAK ISQP +KK
Sbjct: 495  IRAQAEAFSVYNMLRYSKRTICKELHEKILHILIQGNLLKDAYIVVKDNAKMISQPTLKK 554

Query: 1760 FLTAFMRSGNINLINDVVKVITASGCKIDQDIFQLAISRYIAQPEKKDLLLQLLKWMPSQ 1939
            F  AFM SGNINL+NDV+KV+  SG KIDQ  F++AISRYI+QP+KK+LLLQLL+WMP Q
Sbjct: 555  FGRAFMISGNINLVNDVLKVLHGSGHKIDQVQFEIAISRYISQPDKKELLLQLLQWMPGQ 614

Query: 1940 GYVVDSATRNLILKNSHLYGHQLIAEMLSTQHALAR 2047
            GYVVDS+TRNLILKNSH++G  LIAE+LS  H  +R
Sbjct: 615  GYVVDSSTRNLILKNSHMFGRLLIAEILSKHHVASR 650


>ref|XP_002889841.1| hypothetical protein ARALYDRAFT_888388 [Arabidopsis lyrata subsp.
            lyrata] gi|297335683|gb|EFH66100.1| hypothetical protein
            ARALYDRAFT_888388 [Arabidopsis lyrata subsp. lyrata]
          Length = 665

 Score =  717 bits (1851), Expect = 0.0
 Identities = 351/576 (60%), Positives = 458/576 (79%)
 Frame = +2

Query: 320  ILDIRNSPKLLDALARSGEILKAHDLNIILREFGRLKRYTDLSQLFEWMEKHGKVNITSY 499
            I +++ S   L +L R   +LK  DLN+ILR+FG   R+ DL QLF+WM++HGK+++++Y
Sbjct: 77   ISEVQRSSDFLSSLHRLERVLKVQDLNVILRDFGISGRWQDLIQLFDWMQQHGKISVSTY 136

Query: 500  SSYIKFTGESLNPTKAVEAYNHISDEAVRNHTSVCNSVLSCLIKNRKFDSASSLFYQMKQ 679
            SS IKF G   N +KA+E Y  I DE+ + +  +CNS+LSCL+KN K DS   LF QMK+
Sbjct: 137  SSCIKFVGAK-NVSKALEIYQSIPDESTKINVYICNSILSCLVKNGKLDSCIKLFDQMKR 195

Query: 680  NGLEPDVVTYSTLLAGCIKNMDGYSKAMQLVKELESSSLQMDDVTYGTLIAVYAANGQCK 859
             GL+PDV+TY+TLLAGCIK  +GY KA++L+ EL  + +QMD V YGT++A+ A+NG+C+
Sbjct: 196  GGLKPDVITYNTLLAGCIKVKNGYPKAVELIGELPHNGIQMDSVMYGTVLAICASNGRCE 255

Query: 860  EAEECFIRMKSEGHSPNVYHYSSLLNAFSADGNYQKGDELVNDMKSAGLVPNKVMLTTLL 1039
            EAE    +MK+EGHSPN+YHYSSLLN++S  G+Y+K DEL+ +MKS GLVPNKVM+TTLL
Sbjct: 256  EAENFIQQMKAEGHSPNIYHYSSLLNSYSWKGDYKKADELMTEMKSIGLVPNKVMMTTLL 315

Query: 1040 KVYARAGLFEKSRQLLSELEASGYAGDEIPYCLLMDAFSKSGQIDEAKLVLDEMCRRRVK 1219
            KVY + GLF++SR+LLSELE++GYA +E+PYC+LMD  SK+G+++EA+ + D+M  + VK
Sbjct: 316  KVYIKGGLFDRSRELLSELESAGYAENEMPYCMLMDGLSKAGKLEEARSIFDDMKGKGVK 375

Query: 1220 SDGYAYSIMISAFCRSGHLEEAKRLAKDYEFTHDKFDVVISNTMLCAYCRVGDMESVMQI 1399
            SDGYA SIMISA CRS   EEAK L++D E T++K D+V+ NTMLCAYCR G+MESVM++
Sbjct: 376  SDGYANSIMISALCRSKRFEEAKELSRDSETTYEKCDLVMLNTMLCAYCRAGEMESVMRM 435

Query: 1400 LRKMDALAISPDKNTFGILIKYFFKEKLYPLAYRTMEDMHSKGHQPSEETCTLLMINLGK 1579
            ++KMD  AI PD NTF ILIKYF KEKL+ LAY+T  DMHSKGH+  EE C+ L+ +LGK
Sbjct: 436  MKKMDEQAIIPDYNTFHILIKYFIKEKLHLLAYQTTLDMHSKGHRLEEELCSSLIYHLGK 495

Query: 1580 MGAHNEAYAVYNILRYSRRTICKTLHEKILRILIAGQLLKEAYVVAKDNAKCISQPAMKK 1759
            + A +EA++VYN+LRYS+RTICK LHEKIL ILI G LLK+AY+V KDNAK ISQP +KK
Sbjct: 496  IRAPSEAFSVYNMLRYSKRTICKELHEKILHILIHGDLLKDAYIVVKDNAKMISQPTLKK 555

Query: 1760 FLTAFMRSGNINLINDVVKVITASGCKIDQDIFQLAISRYIAQPEKKDLLLQLLKWMPSQ 1939
            F  AFM SGNINL+NDV+KV+  SG KIDQ  F++AISRYI  P+KK+LLLQLL+WMP Q
Sbjct: 556  FGRAFMISGNINLVNDVLKVLHGSGHKIDQVQFEIAISRYILLPDKKELLLQLLQWMPGQ 615

Query: 1940 GYVVDSATRNLILKNSHLYGHQLIAEMLSTQHALAR 2047
            GY+VDS+TRNLILKNSH++G  LIAE+LS  H  +R
Sbjct: 616  GYIVDSSTRNLILKNSHMFGRLLIAEILSKHHVASR 651


>ref|XP_006826767.1| hypothetical protein AMTR_s00136p00085920 [Amborella trichopoda]
            gi|548831187|gb|ERM94004.1| hypothetical protein
            AMTR_s00136p00085920 [Amborella trichopoda]
          Length = 690

 Score =  705 bits (1820), Expect = 0.0
 Identities = 344/580 (59%), Positives = 459/580 (79%)
 Frame = +2

Query: 320  ILDIRNSPKLLDALARSGEILKAHDLNIILREFGRLKRYTDLSQLFEWMEKHGKVNITSY 499
            I +I+ +  L  AL+R G  L+  DLNIILR FG+  ++ ++SQLF WM+K GKVNI+SY
Sbjct: 106  ITEIQGASDLGSALSRLGGKLQLQDLNIILRNFGKSNKWREISQLFNWMQKLGKVNISSY 165

Query: 500  SSYIKFTGESLNPTKAVEAYNHISDEAVRNHTSVCNSVLSCLIKNRKFDSASSLFYQMKQ 679
            SS+IK+ G S N  KA++ Y  I DE      +VCNS+L CL +N KF+S+  LF QMK+
Sbjct: 166  SSFIKYMGRSGNTVKALQVYQSIKDEPTLYDVTVCNSILGCLARNGKFESSIKLFEQMKK 225

Query: 680  NGLEPDVVTYSTLLAGCIKNMDGYSKAMQLVKELESSSLQMDDVTYGTLIAVYAANGQCK 859
             GL PD VTYS+LLAGC KN +GYS+A+QL+KEL+ S L MD V YG+L+A+ A+N QC+
Sbjct: 226  GGLTPDTVTYSSLLAGCNKNKNGYSQALQLIKELKISGLCMDSVIYGSLLAICASNNQCE 285

Query: 860  EAEECFIRMKSEGHSPNVYHYSSLLNAFSADGNYQKGDELVNDMKSAGLVPNKVMLTTLL 1039
            EAE  F +M++EG SPN++HYSSLLNA++ +GN++K D+LV D+KSAGLVPNKV+LTTLL
Sbjct: 286  EAETFFQQMRAEGFSPNIFHYSSLLNAYAVEGNHKKADKLVEDIKSAGLVPNKVILTTLL 345

Query: 1040 KVYARAGLFEKSRQLLSELEASGYAGDEIPYCLLMDAFSKSGQIDEAKLVLDEMCRRRVK 1219
            KVY R   F+KSR+LL+EL+  G+A DE+PYCLLMD  +K+G IDEAK V ++M ++ VK
Sbjct: 346  KVYVRGCFFDKSRELLAELDTLGFARDEMPYCLLMDGLAKAGHIDEAKAVFEDMKQKNVK 405

Query: 1220 SDGYAYSIMISAFCRSGHLEEAKRLAKDYEFTHDKFDVVISNTMLCAYCRVGDMESVMQI 1399
            SDGY++SI+ISA+CR G LEEAK LAKD+E T  K+D+V+ NT+L AYC+ G+M+ VMQ 
Sbjct: 406  SDGYSHSIIISAYCREGLLEEAKLLAKDFESTSGKYDLVMLNTLLRAYCKGGEMQYVMQT 465

Query: 1400 LRKMDALAISPDKNTFGILIKYFFKEKLYPLAYRTMEDMHSKGHQPSEETCTLLMINLGK 1579
            ++KMD LAISPD +TF ILIKYF KEKLY LAYRT+EDMH++G Q  EE CT L++ LGK
Sbjct: 466  MKKMDELAISPDLHTFSILIKYFSKEKLYNLAYRTVEDMHARGLQIDEELCTSLILELGK 525

Query: 1580 MGAHNEAYAVYNILRYSRRTICKTLHEKILRILIAGQLLKEAYVVAKDNAKCISQPAMKK 1759
             GA +EAY+VYN LRY++RT+CK LHEK+L+IL+AG+LLK+AYV+ KDN++ IS+ A+ K
Sbjct: 526  AGAASEAYSVYNKLRYTKRTLCKALHEKVLKILVAGRLLKDAYVLVKDNSELISKSALDK 585

Query: 1760 FLTAFMRSGNINLINDVVKVITASGCKIDQDIFQLAISRYIAQPEKKDLLLQLLKWMPSQ 1939
            F+T+FM+ GNINLINDV++ +  +G  I+Q +F LA+SRY+ +PEKK+LLL +L+WM  Q
Sbjct: 586  FVTSFMKFGNINLINDVLRALHNNGYLINQGVFSLAVSRYVGEPEKKELLLHMLEWMSGQ 645

Query: 1940 GYVVDSATRNLILKNSHLYGHQLIAEMLSTQHALARSKST 2059
            GYVVDS +RNL+LKN  L+G QLIAE LS QHA+++ + T
Sbjct: 646  GYVVDSESRNLLLKNCDLFGKQLIAEGLSKQHAMSKIRRT 685


Top