BLASTX nr result

ID: Mentha28_contig00010589 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha28_contig00010589
         (4497 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU35938.1| hypothetical protein MIMGU_mgv1a001219mg [Mimulus...  1352   0.0  
ref|XP_006363206.1| PREDICTED: pentatricopeptide repeat-containi...  1221   0.0  
ref|XP_004233766.1| PREDICTED: pentatricopeptide repeat-containi...  1215   0.0  
ref|XP_002275546.2| PREDICTED: pentatricopeptide repeat-containi...  1181   0.0  
ref|XP_007223989.1| hypothetical protein PRUPE_ppa014757mg [Prun...  1157   0.0  
ref|XP_006468579.1| PREDICTED: pentatricopeptide repeat-containi...  1152   0.0  
ref|XP_006448595.1| hypothetical protein CICLE_v10014221mg [Citr...  1145   0.0  
ref|XP_007040995.1| Tetratricopeptide repeat (TPR)-like superfam...  1131   0.0  
ref|XP_002299387.2| pentatricopeptide repeat-containing family p...  1130   0.0  
gb|EXB83263.1| hypothetical protein L484_011557 [Morus notabilis]    1125   0.0  
ref|XP_004295518.1| PREDICTED: pentatricopeptide repeat-containi...  1109   0.0  
ref|XP_004487896.1| PREDICTED: pentatricopeptide repeat-containi...  1102   0.0  
ref|XP_002878152.1| hypothetical protein ARALYDRAFT_486188 [Arab...  1089   0.0  
ref|XP_006290586.1| hypothetical protein CARUB_v10016675mg [Caps...  1077   0.0  
ref|NP_191302.2| protein ORGANELLE TRANSCRIPT PROCESSING 84 [Ara...  1077   0.0  
gb|AAP40452.1| unknown protein [Arabidopsis thaliana]                1077   0.0  
ref|XP_007138858.1| hypothetical protein PHAVU_009G243400g [Phas...  1075   0.0  
ref|XP_006402877.1| hypothetical protein EUTSA_v10005782mg [Eutr...  1074   0.0  
ref|XP_006597752.1| PREDICTED: pentatricopeptide repeat-containi...  1070   0.0  
emb|CAB66100.1| putative protein [Arabidopsis thaliana]              1051   0.0  

>gb|EYU35938.1| hypothetical protein MIMGU_mgv1a001219mg [Mimulus guttatus]
          Length = 863

 Score = 1352 bits (3500), Expect = 0.0
 Identities = 658/855 (76%), Positives = 749/855 (87%), Gaps = 1/855 (0%)
 Frame = -3

Query: 4378 SLPNSVQTQNSPSQLKHSWIESLRSHARSNSFRQAITTFVQMQTAGILPDNYAYPAVLKA 4199
            S+P+S+QT NS       WI+SLRS AR+NSF++AI TF+QMQ +G++PDN+A+PAVLKA
Sbjct: 19   SVPSSLQTHNSIVL----WIDSLRSQARANSFQEAIATFIQMQASGVVPDNFAFPAVLKA 74

Query: 4198 ATALQDLHLGKQIHASLVKLGYHSHS-TVSNTVLHMYAKCGGDVDHVFKVFDRIPQRDQV 4022
             TALQDL LGKQIHAS+VKLGY SHS TVSNT+LHMYA+CG DV  VFKVFDRIPQRDQV
Sbjct: 75   TTALQDLDLGKQIHASVVKLGYDSHSVTVSNTLLHMYARCGDDVRQVFKVFDRIPQRDQV 134

Query: 4021 SWNSLINALCKYQDWELALEAFRLMGLEKIEPSSFTLVSVALACSNLNRRDGLRLGKQLH 3842
            SWNS+INALCK+Q+WELALEAFRLMGLE+IEPSSFTLVSVALACSNLNR DGLRLG+Q+H
Sbjct: 135  SWNSMINALCKFQEWELALEAFRLMGLERIEPSSFTLVSVALACSNLNRHDGLRLGRQVH 194

Query: 3841 GYILRVDERKTFTDNSLMAMYAKLGRVDDAKIIFEYFAHRDMVSWNTIISSFSQSDRFYE 3662
            GY LRVD+ KTFT+NSLMAMYAKLGR++DAK++FE F + DMVSWNT+IS+FSQ+DRF E
Sbjct: 195  GYSLRVDDMKTFTNNSLMAMYAKLGRIEDAKVVFESFGNNDMVSWNTVISAFSQNDRFNE 254

Query: 3661 ALEYFRYMNDDGFKPDGFTVSSVLPACSHLELLDLGKEIHAFLFRNDHDFMRNSFVTSAL 3482
            ALEYF +M D+G KPDG T+SSVLPACSHLEL+D GKEIHA++FRN  D +RNS+V SAL
Sbjct: 255  ALEYFSFMVDEGLKPDGVTISSVLPACSHLELIDAGKEIHAYVFRNG-DLLRNSYVASAL 313

Query: 3481 VDMYCNCKQVVSGSRVFDAALDRKLGTWNAMFAGYAQNGFYSEAVMLFMKLMVVPGLFPN 3302
            VDMYCNCKQVVSG RVFD A+DR+L  WNAM  GY QNGFY+EAV+LFM LM V GL PN
Sbjct: 314  VDMYCNCKQVVSGRRVFDTAVDRRLALWNAMLTGYTQNGFYTEAVLLFMNLMTVLGLLPN 373

Query: 3301 PTTMASVLPACVHCEAFVDKEVMHGYVLKLGLERDRYVQNALMDLYSRVGKVDIAEYIFH 3122
            PTTMASVLPACVHC+AF DKE MHGYVLKLGL +DRYVQNALMDLYSR+GK+D  +Y+FH
Sbjct: 374  PTTMASVLPACVHCKAFADKEAMHGYVLKLGLGKDRYVQNALMDLYSRIGKIDNTKYMFH 433

Query: 3121 NMESKDIVSYNTMITGYVVCGYHEDALILLHDMQIAEMKHEEGDDDQFAKNSEVSFRPNS 2942
            +MESKD+VS+NTMITG VVCGYHEDAL+LLH+MQIA  K  E  +D+F    EVSF+PNS
Sbjct: 434  DMESKDMVSWNTMITGCVVCGYHEDALVLLHEMQIAGGKGAE--EDRFDGKIEVSFKPNS 491

Query: 2941 VTLMTILPGCAALAALTKGKEIHAYAFRNGLELDVGVGSALVDMYAKCGCLTMARRVFDI 2762
            VTLMT+LPGCAALAALTKGKEIH YA RNGLE DV VGSALVDMYAKCGCL MARRVFD 
Sbjct: 492  VTLMTVLPGCAALAALTKGKEIHNYAIRNGLESDVAVGSALVDMYAKCGCLYMARRVFDR 551

Query: 2761 MPNRNVITWNAVILAYGMHGEGDGALTLFRKMVAAGGELKPNEVTFIALFAACSHSGMVD 2582
            MP RNVITWN +I+AYGMHGEG+ ALTLF  MVA   E+KPN VTFI++FAACSHSGMVD
Sbjct: 552  MPIRNVITWNVIIMAYGMHGEGEEALTLFENMVA---EVKPNGVTFISVFAACSHSGMVD 608

Query: 2581 EGKQLFQQMKVDHGVKPTSDHYACVVDLLGRAGRLDEAYDIIKFIPVGVDKIGAWSSLLG 2402
            +G++LF +MK +HG++P  DHYACVVDLLGRAGRLDEA +II  +P G+DK+GAWSSLLG
Sbjct: 609  KGRELFHRMKNEHGLEPNGDHYACVVDLLGRAGRLDEACEIIDSMPSGLDKVGAWSSLLG 668

Query: 2401 ACRIHQNVELGEISASNLFKSEPNVASHYVLLSNIYSSAGLWEKANEVRRKMKSLRLKKE 2222
            ACR+HQNV+LGEISA  L + EPNVASHYVLLSNIYSSAGLWEKAN+VR+ MK   ++KE
Sbjct: 669  ACRVHQNVQLGEISAMKLLELEPNVASHYVLLSNIYSSAGLWEKANKVRKNMKETGVRKE 728

Query: 2221 PGCSWIEYNDEVHKFVAGDTRHPQREQLYEYLNDLLVRMKEDGYVPDTSCVLHNVDEEEK 2042
            PGCSWIE  ++VHKF+AGDT HPQ EQLY YLNDL  RMK +GYV DTSCVLHNVDE+EK
Sbjct: 729  PGCSWIESGEKVHKFLAGDTSHPQSEQLYGYLNDLFGRMKREGYVADTSCVLHNVDEQEK 788

Query: 2041 ENLLCGHSERLAIAFGLLNTPPGKTIRVSKNLRVCNDCHSATKFISKISGREIVVRDVRR 1862
            ENLLCGHSERLAIAFGLLNTPPG  IRV+KNLRVCNDCHSATKFIS+I  REIVVRDVRR
Sbjct: 789  ENLLCGHSERLAIAFGLLNTPPGTPIRVAKNLRVCNDCHSATKFISRIVDREIVVRDVRR 848

Query: 1861 FHHFKDGACSCGDYW 1817
            FHHFKDGAC+C DYW
Sbjct: 849  FHHFKDGACTCRDYW 863


>ref|XP_006363206.1| PREDICTED: pentatricopeptide repeat-containing protein At3g57430,
            chloroplastic-like [Solanum tuberosum]
          Length = 889

 Score = 1221 bits (3158), Expect = 0.0
 Identities = 581/852 (68%), Positives = 714/852 (83%), Gaps = 1/852 (0%)
 Frame = -3

Query: 4369 NSVQTQNSPSQLKHSWIESLRSHARSNSFRQAITTFVQMQTAGILPDNYAYPAVLKAATA 4190
            N  Q   S +    SWI++LRS  R N F++AI T++QM + G+ PDN+ +PAVLKAAT 
Sbjct: 46   NFQQEPTSETPSAASWIDALRSQVRLNCFKEAIFTYIQMTSEGVRPDNFVFPAVLKAATG 105

Query: 4189 LQDLHLGKQIHASLVKLGYHSHS-TVSNTVLHMYAKCGGDVDHVFKVFDRIPQRDQVSWN 4013
            LQDL+LGKQI+ ++VK GY + S TV+N+V+H+  +CGG +D V+KVFDRI QRDQVSWN
Sbjct: 106  LQDLNLGKQIYGAVVKFGYDTTSVTVANSVIHLLGRCGGSIDDVYKVFDRITQRDQVSWN 165

Query: 4012 SLINALCKYQDWELALEAFRLMGLEKIEPSSFTLVSVALACSNLNRRDGLRLGKQLHGYI 3833
            SLINALCK++ WELALEAFRL+GL+  E SSFTLVS+ALACSNL R DGLRLGKQ+HG+ 
Sbjct: 166  SLINALCKFEKWELALEAFRLIGLDGFEASSFTLVSIALACSNLPRTDGLRLGKQVHGHS 225

Query: 3832 LRVDERKTFTDNSLMAMYAKLGRVDDAKIIFEYFAHRDMVSWNTIISSFSQSDRFYEALE 3653
            LR+D+R+T+T+N+LM+MYAKLGRVDD++ +FE FA RD+VSWNTIISSFSQ+D+F EAL+
Sbjct: 226  LRIDDRRTYTNNALMSMYAKLGRVDDSRAVFELFADRDIVSWNTIISSFSQNDQFREALD 285

Query: 3652 YFRYMNDDGFKPDGFTVSSVLPACSHLELLDLGKEIHAFLFRNDHDFMRNSFVTSALVDM 3473
             FR M  +  KPDG T+SSV+PACSHL LLD+GKEIH ++ +ND D + NSFV S+LVDM
Sbjct: 286  CFRVMIQEEIKPDGVTISSVVPACSHLTLLDVGKEIHCYVLKND-DLIGNSFVDSSLVDM 344

Query: 3472 YCNCKQVVSGSRVFDAALDRKLGTWNAMFAGYAQNGFYSEAVMLFMKLMVVPGLFPNPTT 3293
            YCNC+QV SGSRVFD+AL R +G WNAM AGY QNGF++EA+ LF+++M   GL PNPTT
Sbjct: 345  YCNCQQVESGSRVFDSALKRSIGIWNAMLAGYTQNGFFTEALTLFIEMMEFSGLSPNPTT 404

Query: 3292 MASVLPACVHCEAFVDKEVMHGYVLKLGLERDRYVQNALMDLYSRVGKVDIAEYIFHNME 3113
            +ASV PACVHCEAF  KEV+HGYV+KLG   ++YVQNALMDLYSR+GK++I++YIF NME
Sbjct: 405  VASVFPACVHCEAFTLKEVIHGYVIKLGFSDEKYVQNALMDLYSRMGKINISKYIFDNME 464

Query: 3112 SKDIVSYNTMITGYVVCGYHEDALILLHDMQIAEMKHEEGDDDQFAKNSEVSFRPNSVTL 2933
            SKDIVS+NTMITG+VVCGYHEDALI+LH+MQ  + +H + ++     N E   +PNS+TL
Sbjct: 465  SKDIVSWNTMITGFVVCGYHEDALIMLHEMQTTK-RHNDSEN-----NVEFLLKPNSITL 518

Query: 2932 MTILPGCAALAALTKGKEIHAYAFRNGLELDVGVGSALVDMYAKCGCLTMARRVFDIMPN 2753
            MT+LPGCA+L AL KGKEIHAYA RN L +D+ VGSALVDMYAKCGCL +ARRVFD M  
Sbjct: 519  MTVLPGCASLVALAKGKEIHAYAIRNALAMDIAVGSALVDMYAKCGCLDIARRVFDSMTT 578

Query: 2752 RNVITWNAVILAYGMHGEGDGALTLFRKMVAAGGELKPNEVTFIALFAACSHSGMVDEGK 2573
            +NVITWN +I+AYGMHG+G+ AL LFR MV    ++KPN VTFIA+FA CSHSGMVD+G+
Sbjct: 579  KNVITWNVLIMAYGMHGKGEEALELFRMMVLER-KVKPNNVTFIAIFAGCSHSGMVDQGR 637

Query: 2572 QLFQQMKVDHGVKPTSDHYACVVDLLGRAGRLDEAYDIIKFIPVGVDKIGAWSSLLGACR 2393
            +LF++MK  +G++PT+DHYAC+VDLLGR+G L+EAY ++  +P   +KIGAWSSLLGACR
Sbjct: 638  ELFREMKNAYGIEPTADHYACIVDLLGRSGHLEEAYQLVNEMPSKYNKIGAWSSLLGACR 697

Query: 2392 IHQNVELGEISASNLFKSEPNVASHYVLLSNIYSSAGLWEKANEVRRKMKSLRLKKEPGC 2213
            IH+NVELGEISA NLF+ + +VASHYVLLSNIYSSAG+WEKAN VRR MK + ++KEPGC
Sbjct: 698  IHRNVELGEISARNLFELDSHVASHYVLLSNIYSSAGIWEKANMVRRNMKKVGVRKEPGC 757

Query: 2212 SWIEYNDEVHKFVAGDTRHPQREQLYEYLNDLLVRMKEDGYVPDTSCVLHNVDEEEKENL 2033
            SWIE+ DEVHKFVAGD  HPQ EQLY YL  L  +MK++GYVPDTSCVLHNV+E+EKENL
Sbjct: 758  SWIEFGDEVHKFVAGDASHPQSEQLYGYLETLSEKMKKEGYVPDTSCVLHNVNEDEKENL 817

Query: 2032 LCGHSERLAIAFGLLNTPPGKTIRVSKNLRVCNDCHSATKFISKISGREIVVRDVRRFHH 1853
            LCGHSE+LAIAFG+LNTPPG  IR++KNLRVCNDCH ATKFISKI  REI+VRDVRRFHH
Sbjct: 818  LCGHSEKLAIAFGILNTPPGTPIRIAKNLRVCNDCHEATKFISKIVNREIIVRDVRRFHH 877

Query: 1852 FKDGACSCGDYW 1817
            F++G CSCGDYW
Sbjct: 878  FRNGTCSCGDYW 889


>ref|XP_004233766.1| PREDICTED: pentatricopeptide repeat-containing protein At3g57430,
            chloroplastic-like [Solanum lycopersicum]
          Length = 889

 Score = 1215 bits (3144), Expect = 0.0
 Identities = 576/848 (67%), Positives = 713/848 (84%), Gaps = 1/848 (0%)
 Frame = -3

Query: 4357 TQNSPSQLKHSWIESLRSHARSNSFRQAITTFVQMQTAGILPDNYAYPAVLKAATALQDL 4178
            T  +PS    SWI++LRS  R N F++AI T++QM + G+ PDN+ +PAVLKAAT LQDL
Sbjct: 52   TSETPSSA--SWIDTLRSQVRLNCFKEAIFTYIQMTSEGVRPDNFVFPAVLKAATGLQDL 109

Query: 4177 HLGKQIHASLVKLGYHSHS-TVSNTVLHMYAKCGGDVDHVFKVFDRIPQRDQVSWNSLIN 4001
            +LGKQI+ ++VK GY + S TVSN+V+H+  +CGG +D V+K+FDRI QRDQVSWNSLIN
Sbjct: 110  NLGKQIYGAVVKFGYDTISVTVSNSVIHLLGRCGGSIDDVYKLFDRITQRDQVSWNSLIN 169

Query: 4000 ALCKYQDWELALEAFRLMGLEKIEPSSFTLVSVALACSNLNRRDGLRLGKQLHGYILRVD 3821
            ALCK++ WELALEAFRLMG +  E SSFTLVS+ALACSNL R DGLRLGKQ+HGY LR+D
Sbjct: 170  ALCKFEKWELALEAFRLMGFDGFEASSFTLVSIALACSNLPRTDGLRLGKQVHGYSLRID 229

Query: 3820 ERKTFTDNSLMAMYAKLGRVDDAKIIFEYFAHRDMVSWNTIISSFSQSDRFYEALEYFRY 3641
            +R+T+T+N+LM+MYAKLGRVDD++ +FE FA RD+VSWNTIISSFSQ+D+F EAL+ FR 
Sbjct: 230  DRRTYTNNALMSMYAKLGRVDDSRAVFELFADRDIVSWNTIISSFSQNDQFREALDSFRV 289

Query: 3640 MNDDGFKPDGFTVSSVLPACSHLELLDLGKEIHAFLFRNDHDFMRNSFVTSALVDMYCNC 3461
            M  +  KPDG T+SSV+PACSHL LLD+GK+IH ++ +ND D + NSFV S+LVDMYCNC
Sbjct: 290  MIQEEIKPDGVTISSVVPACSHLTLLDVGKQIHCYVLKND-DLIGNSFVDSSLVDMYCNC 348

Query: 3460 KQVVSGSRVFDAALDRKLGTWNAMFAGYAQNGFYSEAVMLFMKLMVVPGLFPNPTTMASV 3281
            +QV SG RVFD+AL R +G WNAM AGY QNGF++EA+MLF++++   GL PNPTT+ASV
Sbjct: 349  QQVESGRRVFDSALKRSIGIWNAMLAGYTQNGFFTEALMLFIEMLEFSGLSPNPTTVASV 408

Query: 3280 LPACVHCEAFVDKEVMHGYVLKLGLERDRYVQNALMDLYSRVGKVDIAEYIFHNMESKDI 3101
             PACVHCEAF  KEV+HGYV+KLG   ++YVQNALMDLYSR+GK++I++YIF NMESKDI
Sbjct: 409  FPACVHCEAFTLKEVIHGYVIKLGFADEKYVQNALMDLYSRMGKINISKYIFDNMESKDI 468

Query: 3100 VSYNTMITGYVVCGYHEDALILLHDMQIAEMKHEEGDDDQFAKNSEVSFRPNSVTLMTIL 2921
            VS+NTMITG+VVCGYHEDALI+LH+MQ  + +H + ++     N E   +PNS+TL+T+L
Sbjct: 469  VSWNTMITGFVVCGYHEDALIMLHEMQTTK-RHNDSEN-----NVEFRLKPNSITLITVL 522

Query: 2920 PGCAALAALTKGKEIHAYAFRNGLELDVGVGSALVDMYAKCGCLTMARRVFDIMPNRNVI 2741
            PGCA+L AL KGKEIHAYA RN L +D+ VGSALVDMYAKCGCL +ARRVF+ M  +NVI
Sbjct: 523  PGCASLVALAKGKEIHAYAIRNALAMDIAVGSALVDMYAKCGCLDIARRVFNSMTTKNVI 582

Query: 2740 TWNAVILAYGMHGEGDGALTLFRKMVAAGGELKPNEVTFIALFAACSHSGMVDEGKQLFQ 2561
            TWN +I+AYGMHG+G+ AL LFR MV    ++KPN VTFIA+FA CSHSGMVD+G++LF+
Sbjct: 583  TWNVLIMAYGMHGKGEEALQLFRMMVLER-KVKPNNVTFIAIFAGCSHSGMVDQGRELFR 641

Query: 2560 QMKVDHGVKPTSDHYACVVDLLGRAGRLDEAYDIIKFIPVGVDKIGAWSSLLGACRIHQN 2381
            +MK  +G++PT+DHYAC+VDLLGR+G L+EAY ++  +P   +KIGAWSSLLGACRIH N
Sbjct: 642  EMKNAYGIEPTADHYACIVDLLGRSGHLEEAYQLVNEMPSKYNKIGAWSSLLGACRIHGN 701

Query: 2380 VELGEISASNLFKSEPNVASHYVLLSNIYSSAGLWEKANEVRRKMKSLRLKKEPGCSWIE 2201
            +ELGEISA NLF+ +P+VASHYVLLSNIYSSAG+WEKAN VRR MK + ++KEPGCSWIE
Sbjct: 702  IELGEISARNLFELDPHVASHYVLLSNIYSSAGIWEKANMVRRNMKKVGVRKEPGCSWIE 761

Query: 2200 YNDEVHKFVAGDTRHPQREQLYEYLNDLLVRMKEDGYVPDTSCVLHNVDEEEKENLLCGH 2021
            + DEVHKFVAGD  HPQ EQLY YL  L  +MK++GYVPDTSCVLHNV+E+EKENLLCGH
Sbjct: 762  FGDEVHKFVAGDASHPQSEQLYGYLETLSEKMKKEGYVPDTSCVLHNVNEDEKENLLCGH 821

Query: 2020 SERLAIAFGLLNTPPGKTIRVSKNLRVCNDCHSATKFISKISGREIVVRDVRRFHHFKDG 1841
            SE+LAIAFG+LNTPPG  IR++KNLRVCNDCH A+K+IS I  REI+VRDVRRFHHF++G
Sbjct: 822  SEKLAIAFGILNTPPGTPIRIAKNLRVCNDCHEASKYISNIVNREIIVRDVRRFHHFRNG 881

Query: 1840 ACSCGDYW 1817
            ACSCGDYW
Sbjct: 882  ACSCGDYW 889


>ref|XP_002275546.2| PREDICTED: pentatricopeptide repeat-containing protein At3g57430,
            chloroplastic-like [Vitis vinifera]
          Length = 896

 Score = 1181 bits (3056), Expect = 0.0
 Identities = 575/852 (67%), Positives = 700/852 (82%), Gaps = 8/852 (0%)
 Frame = -3

Query: 4348 SPSQLKHSWIESLRSHARSNSFRQAITTFVQMQTAGILPDNYAYPAVLKAATALQDLHLG 4169
            SPS+   SW+++LRS  RSN FR+AI+T+++M  +G  PDN+A+PAVLKA + LQDL  G
Sbjct: 52   SPSRSTASWVDALRSRTRSNDFREAISTYIEMTVSGARPDNFAFPAVLKAVSGLQDLKTG 111

Query: 4168 KQIHASLVKLGYHSHS-TVSNTVLHMYAKCGGDVDHVFKVFDRIPQRDQVSWNSLINALC 3992
            +QIHA+ VK GY S S TV+NT+++MY KCGG +  V KVFDRI  RDQVSWNS I ALC
Sbjct: 112  EQIHAAAVKFGYGSSSVTVANTLVNMYGKCGG-IGDVCKVFDRITDRDQVSWNSFIAALC 170

Query: 3991 KYQDWELALEAFRLMGLEKIEPSSFTLVSVALACSNLNRRDGLRLGKQLHGYILRVDERK 3812
            +++ WE ALEAFR M +E +E SSFTLVSVALACSNL    GLRLGKQLHGY LRV ++K
Sbjct: 171  RFEKWEQALEAFRAMQMENMELSSFTLVSVALACSNLGVMHGLRLGKQLHGYSLRVGDQK 230

Query: 3811 TFTDNSLMAMYAKLGRVDDAKIIFEYFAHRDMVSWNTIISSFSQSDRFYEALEYFRYMND 3632
            TFT+N+LMAMYAKLGRVDD+K +FE F  RDMVSWNT+ISSFSQSDRF EAL +FR M  
Sbjct: 231  TFTNNALMAMYAKLGRVDDSKALFESFVDRDMVSWNTMISSFSQSDRFSEALAFFRLMVL 290

Query: 3631 DGFKPDGFTVSSVLPACSHLELLDLGKEIHAFLFRNDHDFMRNSFVTSALVDMYCNCKQV 3452
            +G + DG T++SVLPACSHLE LD+GKEIHA++ RN+ D + NSFV SALVDMYCNC+QV
Sbjct: 291  EGVELDGVTIASVLPACSHLERLDVGKEIHAYVLRNN-DLIENSFVGSALVDMYCNCRQV 349

Query: 3451 VSGSRVFDAALDRKLGTWNAMFAGYAQNGFYSEAVMLFMKLMVVPGLFPNPTTMASVLPA 3272
             SG RVFD  L R++  WNAM +GYA+NG   +A++LF++++ V GL PN TTMASV+PA
Sbjct: 350  ESGRRVFDHILGRRIELWNAMISGYARNGLDEKALILFIEMIKVAGLLPNTTTMASVMPA 409

Query: 3271 CVHCEAFVDKEVMHGYVLKLGLERDRYVQNALMDLYSRVGKVDIAEYIFHNMESKDIVSY 3092
            CVHCEAF +KE +HGY +KLG + DRYVQNALMD+YSR+GK+DI+E IF +ME +D VS+
Sbjct: 410  CVHCEAFSNKESIHGYAVKLGFKEDRYVQNALMDMYSRMGKMDISETIFDSMEVRDRVSW 469

Query: 3091 NTMITGYVVCGYHEDALILLHDMQIAE----MKHEEGDDDQFAKNSEVSFRPNSVTLMTI 2924
            NTMITGYV+ G + +AL+LLH+MQ  E    +K ++ DD++        ++PN++TLMT+
Sbjct: 470  NTMITGYVLSGRYSNALVLLHEMQRMENTKDVKKDDNDDEKGGP-----YKPNAITLMTV 524

Query: 2923 LPGCAALAALTKGKEIHAYAFRNGLELDVGVGSALVDMYAKCGCLTMARRVFDIMPNRNV 2744
            LPGCAALAA+ KGKEIHAYA RN L  D+ VGSALVDMYAKCGCL ++RRVF+ MPN+NV
Sbjct: 525  LPGCAALAAIAKGKEIHAYAIRNMLASDITVGSALVDMYAKCGCLNLSRRVFNEMPNKNV 584

Query: 2743 ITWNAVILAYGMHGEGDGALTLFRKMVAA---GGELKPNEVTFIALFAACSHSGMVDEGK 2573
            ITWN +I+A GMHG+G+ AL LF+ MVA    GGE KPNEVTFI +FAACSHSG++ EG 
Sbjct: 585  ITWNVLIMACGMHGKGEEALELFKNMVAEAGRGGEAKPNEVTFITVFAACSHSGLISEGL 644

Query: 2572 QLFQQMKVDHGVKPTSDHYACVVDLLGRAGRLDEAYDIIKFIPVGVDKIGAWSSLLGACR 2393
             LF +MK DHGV+PTSDHYACVVDLLGRAG+L+EAY+++  +P   DK+GAWSSLLGACR
Sbjct: 645  NLFYRMKHDHGVEPTSDHYACVVDLLGRAGQLEEAYELVNTMPAEFDKVGAWSSLLGACR 704

Query: 2392 IHQNVELGEISASNLFKSEPNVASHYVLLSNIYSSAGLWEKANEVRRKMKSLRLKKEPGC 2213
            IHQNVELGE++A NL   EPNVASHYVLLSNIYSSAGLW KA EVR+ M+ + +KKEPGC
Sbjct: 705  IHQNVELGEVAAKNLLHLEPNVASHYVLLSNIYSSAGLWNKAMEVRKNMRQMGVKKEPGC 764

Query: 2212 SWIEYNDEVHKFVAGDTRHPQREQLYEYLNDLLVRMKEDGYVPDTSCVLHNVDEEEKENL 2033
            SWIE+ DEVHKF+AGD  HPQ EQL+ +L  L  +M+++GYVPDTSCVLHNVDE+EKENL
Sbjct: 765  SWIEFRDEVHKFMAGDVSHPQSEQLHGFLETLSEKMRKEGYVPDTSCVLHNVDEDEKENL 824

Query: 2032 LCGHSERLAIAFGLLNTPPGKTIRVSKNLRVCNDCHSATKFISKISGREIVVRDVRRFHH 1853
            LCGHSE+LAIAFG+LNTPPG TIRV+KNLRVCNDCH+ATKFISKI  REI+VRDVRRFHH
Sbjct: 825  LCGHSEKLAIAFGILNTPPGTTIRVAKNLRVCNDCHAATKFISKIMEREIIVRDVRRFHH 884

Query: 1852 FKDGACSCGDYW 1817
            FK+G CSCGDYW
Sbjct: 885  FKEGTCSCGDYW 896


>ref|XP_007223989.1| hypothetical protein PRUPE_ppa014757mg [Prunus persica]
            gi|462420925|gb|EMJ25188.1| hypothetical protein
            PRUPE_ppa014757mg [Prunus persica]
          Length = 901

 Score = 1157 bits (2993), Expect = 0.0
 Identities = 563/856 (65%), Positives = 696/856 (81%), Gaps = 9/856 (1%)
 Frame = -3

Query: 4357 TQNSPSQLKHS-----WIESLRSHARSNSFRQAITTFVQMQTAGILPDNYAYPAVLKAAT 4193
            T + P  L HS     WIE+LRS  RSN FR+AI T+++M  +GI+PDN+A+PAVLKA T
Sbjct: 49   TTSPPKLLSHSRTPASWIETLRSQTRSNHFREAILTYIEMTLSGIVPDNFAFPAVLKAVT 108

Query: 4192 ALQDLHLGKQIHASLVKLGYHSHS-TVSNTVLHMYAKCGGDVDHVFKVFDRIPQRDQVSW 4016
            +LQDL+LGKQIHA +VK GY S S TV+NT++++Y KCG D+    KVFD I +RDQVSW
Sbjct: 109  SLQDLNLGKQIHAHIVKFGYGSSSVTVANTLVNVYGKCG-DIGDACKVFDGIIERDQVSW 167

Query: 4015 NSLINALCKYQDWELALEAFRLMGLEKIEPSSFTLVSVALACSNLNRRDGLRLGKQLHGY 3836
            NS+I ALC++++WELALEAFR M +E +EPSSFTLVSVALACSNL++RDGLRLGKQ+H Y
Sbjct: 168  NSMIAALCRFEEWELALEAFRSMLMENMEPSSFTLVSVALACSNLHKRDGLRLGKQVHAY 227

Query: 3835 ILRVDERKTFTDNSLMAMYAKLGRVDDAKIIFEYFAHRDMVSWNTIISSFSQSDRFYEAL 3656
             +R+ E KTFT N+L+AMY+KLG  + ++ +FE +   DMVSWNT+ISS SQ+D+F EAL
Sbjct: 228  SVRMSECKTFTINALLAMYSKLGEAEYSRALFELYEDCDMVSWNTMISSLSQNDQFMEAL 287

Query: 3655 EYFRYMNDDGFKPDGFTVSSVLPACSHLELLDLGKEIHAFLFRNDHDFMRNSFVTSALVD 3476
            E+FR M   GFKPDG TV+SVLPACSHLE+LD GKEIHA+  R + + + NS+V SALVD
Sbjct: 288  EFFRLMVLAGFKPDGVTVASVLPACSHLEMLDTGKEIHAYALRTN-ELIENSYVGSALVD 346

Query: 3475 MYCNCKQVVSGSRVFDAALDRKLGTWNAMFAGYAQNGFYSEAVMLFMKLMVVPGLFPNPT 3296
            MYCNC+QV SG RVF+A L+RK+  WNAM  GYAQN +  EA+ LF+++    GL PN T
Sbjct: 347  MYCNCRQVSSGCRVFNAVLERKIALWNAMITGYAQNEYNKEALNLFLEMCAASGLSPNST 406

Query: 3295 TMASVLPACVHCEAFVDKEVMHGYVLKLGLERDRYVQNALMDLYSRVGKVDIAEYIFHNM 3116
            TM+S++PA V CEAF DKE +HGYV+K GLE++RYVQNALMD+YSR+GK  I+E IF++M
Sbjct: 407  TMSSIVPASVRCEAFSDKESIHGYVIKRGLEKNRYVQNALMDMYSRMGKTQISETIFNSM 466

Query: 3115 ESKDIVSYNTMITGYVVCGYHEDALILLHDMQIAEMKHEEGDDDQFAKNSEVSFRPNSVT 2936
            E +DIVS+NTMITGYV+CG H DAL L++DMQ  + K +  +D+ +     V  +PNS+T
Sbjct: 467  EVRDIVSWNTMITGYVICGRHGDALNLIYDMQRVKEK-KNMNDNAYDDEGRVPLKPNSIT 525

Query: 2935 LMTILPGCAALAALTKGKEIHAYAFRNGLELDVGVGSALVDMYAKCGCLTMARRVFDIMP 2756
             MTILPGCAALAAL KGKEIH+YA ++ L  DV VGSALVDMYAKCGC+ +AR VF+ +P
Sbjct: 526  FMTILPGCAALAALAKGKEIHSYAIKHLLAFDVAVGSALVDMYAKCGCIDLARAVFNQIP 585

Query: 2755 NRNVITWNAVILAYGMHGEGDGALTLFRKMVAAG---GELKPNEVTFIALFAACSHSGMV 2585
             +NVITWN +I+AYGMHG G+ AL LF+ MV  G    E++PNEVTFIALFAACSHSGMV
Sbjct: 586  IKNVITWNVLIMAYGMHGRGEEALELFKNMVDEGCRNKEVRPNEVTFIALFAACSHSGMV 645

Query: 2584 DEGKQLFQQMKVDHGVKPTSDHYACVVDLLGRAGRLDEAYDIIKFIPVGVDKIGAWSSLL 2405
            DEG  LF +MK DHGV+P +DHYACVVDLLGRAG ++EAY ++  +P  +DK GAWSSLL
Sbjct: 646  DEGLNLFHKMKSDHGVEPATDHYACVVDLLGRAGNVEEAYQLVNTMPSELDKAGAWSSLL 705

Query: 2404 GACRIHQNVELGEISASNLFKSEPNVASHYVLLSNIYSSAGLWEKANEVRRKMKSLRLKK 2225
            GACRIHQNVE+GEI+A+ L + EP+VASHYVLLSNIYSS+GLW+KA +VRRKMK + +KK
Sbjct: 706  GACRIHQNVEIGEIAANQLLELEPSVASHYVLLSNIYSSSGLWDKAMDVRRKMKEMGVKK 765

Query: 2224 EPGCSWIEYNDEVHKFVAGDTRHPQREQLYEYLNDLLVRMKEDGYVPDTSCVLHNVDEEE 2045
            EPGCSWIE+ DEVHKF+AGD  HPQ EQL+E+L  L  +MK++GYVPDTSCVLHNVDEEE
Sbjct: 766  EPGCSWIEFGDEVHKFLAGDLSHPQSEQLHEFLETLSEKMKKEGYVPDTSCVLHNVDEEE 825

Query: 2044 KENLLCGHSERLAIAFGLLNTPPGKTIRVSKNLRVCNDCHSATKFISKISGREIVVRDVR 1865
            KE LLCGHSE+LA+AFG+LNT PG TIRV+KNLRVCNDCH A+K+ISKI  REI++RDVR
Sbjct: 826  KETLLCGHSEKLALAFGILNTRPGTTIRVAKNLRVCNDCHMASKYISKILDREIILRDVR 885

Query: 1864 RFHHFKDGACSCGDYW 1817
            RFHHFK+G CSCGDYW
Sbjct: 886  RFHHFKNGTCSCGDYW 901


>ref|XP_006468579.1| PREDICTED: pentatricopeptide repeat-containing protein At3g57430,
            chloroplastic-like [Citrus sinensis]
          Length = 882

 Score = 1152 bits (2979), Expect = 0.0
 Identities = 566/862 (65%), Positives = 691/862 (80%), Gaps = 5/862 (0%)
 Frame = -3

Query: 4387 TSVSLPNSVQTQNSPSQLKHSWIESLRSHARSNSFRQAITTFVQMQTAGILPDNYAYPAV 4208
            TS+ LP S QT++     K SWIESLRS  RSN FR+AI ++++M  + I PDN+A+P+V
Sbjct: 30   TSLPLPGS-QTRS-----KESWIESLRSQTRSNQFREAILSYIEMTRSDIQPDNFAFPSV 83

Query: 4207 LKAATALQDLHLGKQIHASLVKLGYHSHS-TVSNTVLHMYAKCGGDVDHVFKVFDRIPQR 4031
            LKA   +QDL LGKQIHA +VK GY   S TV+NT+++MY KCG D+  V+KVFDRI ++
Sbjct: 84   LKAVAGIQDLSLGKQIHAHVVKYGYGLSSVTVANTLVNMYGKCGSDMWDVYKVFDRITEK 143

Query: 4030 DQVSWNSLINALCKYQDWELALEAFRLMGLEKIEPSSFTLVSVALACSNLNRRDGLRLGK 3851
            DQVSWNS+I  LC++  W+LALEAFR+M    +EPSSFTLVSVALACSNL+RRDGLRLG+
Sbjct: 144  DQVSWNSMIATLCRFGKWDLALEAFRMMLYSNVEPSSFTLVSVALACSNLSRRDGLRLGR 203

Query: 3850 QLHGYILRVDERKTFTDNSLMAMYAKLGRVDDAKIIFEYFAHRDMVSWNTIISSFSQSDR 3671
            Q+HG  LRV E  TF  N+LMAMYAKLGRVDDAK +F+ F  RD+VSWNTI+SS SQ+D+
Sbjct: 204  QVHGNSLRVGEWNTFIMNALMAMYAKLGRVDDAKTLFKSFEDRDLVSWNTIVSSLSQNDK 263

Query: 3670 FYEALEYFRYMNDDGFKPDGFTVSSVLPACSHLELLDLGKEIHAFLFRNDHDFMRNSFVT 3491
            F EA+ + R M   G KPDG +++SVLPACSHLE+LD GKEIHA+  RND   + NSFV 
Sbjct: 264  FLEAVMFLRQMALRGIKPDGVSIASVLPACSHLEMLDTGKEIHAYALRNDI-LIDNSFVG 322

Query: 3490 SALVDMYCNCKQVVSGSRVFDAALDRKLGTWNAMFAGYAQNGFYSEAVMLFMKLMVVPGL 3311
            SALVDMYCNC++V  G RVFD   D+K+  WNAM  GY QN +  EA+MLF+K+  V GL
Sbjct: 323  SALVDMYCNCREVECGRRVFDFISDKKIALWNAMITGYGQNEYDEEALMLFIKMEEVAGL 382

Query: 3310 FPNPTTMASVLPACVHCEAFVDKEVMHGYVLKLGLERDRYVQNALMDLYSRVGKVDIAEY 3131
            +PN TTM+SV+PACV  EAF DKE +HG+ +KLGL RDRYVQNALMD+YSR+G+++I++ 
Sbjct: 383  WPNATTMSSVVPACVRSEAFPDKEGIHGHAIKLGLGRDRYVQNALMDMYSRMGRIEISKT 442

Query: 3130 IFHNMESKDIVSYNTMITGYVVCGYHEDALILLHDMQIAEMKHEEGDDDQFAKNSEVSF- 2954
            IF +ME +D VS+NTMITGY +CG H DAL+LL +MQ   M+ ++  ++ +  +  V   
Sbjct: 443  IFDDMEVRDTVSWNTMITGYTICGQHGDALMLLREMQ--NMEEDKNRNNVYDLDETVLRP 500

Query: 2953 RPNSVTLMTILPGCAALAALTKGKEIHAYAFRNGLELDVGVGSALVDMYAKCGCLTMARR 2774
            +PNS+TLMT+LPGC AL+AL KGKEIHAYA RN L  DV VGSALVDMYAKCGCL  ARR
Sbjct: 501  KPNSITLMTVLPGCGALSALAKGKEIHAYAIRNMLATDVVVGSALVDMYAKCGCLNFARR 560

Query: 2773 VFDIMPNRNVITWNAVILAYGMHGEGDGALTLFRKMVAAG---GELKPNEVTFIALFAAC 2603
            VFD+MP RNVITWN +I+AYGMHGEG   L L + MVA G   GE+KPNEVTFIALFAAC
Sbjct: 561  VFDLMPVRNVITWNVIIMAYGMHGEGQEVLELLKNMVAEGSRGGEVKPNEVTFIALFAAC 620

Query: 2602 SHSGMVDEGKQLFQQMKVDHGVKPTSDHYACVVDLLGRAGRLDEAYDIIKFIPVGVDKIG 2423
            SHSGMV EG  LF +MK D+G++P+ DHYACVVDLLGRAG++++AY +I  +P   DK G
Sbjct: 621  SHSGMVSEGMDLFYKMKDDYGIEPSPDHYACVVDLLGRAGKVEDAYQLINMMPPEFDKAG 680

Query: 2422 AWSSLLGACRIHQNVELGEISASNLFKSEPNVASHYVLLSNIYSSAGLWEKANEVRRKMK 2243
            AWSSLLGACRIHQNVE+GEI+A NLF  EP+VASHYVLLSNIYSSA LW+KA +VR+KMK
Sbjct: 681  AWSSLLGACRIHQNVEIGEIAAQNLFLLEPDVASHYVLLSNIYSSAQLWDKAMDVRKKMK 740

Query: 2242 SLRLKKEPGCSWIEYNDEVHKFVAGDTRHPQREQLYEYLNDLLVRMKEDGYVPDTSCVLH 2063
             + ++KEPGCSWIE+ DE+HKF+AGD  H Q EQL+ +L +L  RM+++GYVPDTSCVLH
Sbjct: 741  EMGVRKEPGCSWIEFGDEIHKFLAGDGSHQQSEQLHGFLENLSERMRKEGYVPDTSCVLH 800

Query: 2062 NVDEEEKENLLCGHSERLAIAFGLLNTPPGKTIRVSKNLRVCNDCHSATKFISKISGREI 1883
            NV+EEEKE LLCGHSE+LAIAFG+LNTPPG TIRV+KNLRVCNDCH ATKFISKI  REI
Sbjct: 801  NVNEEEKETLLCGHSEKLAIAFGILNTPPGTTIRVAKNLRVCNDCHQATKFISKIESREI 860

Query: 1882 VVRDVRRFHHFKDGACSCGDYW 1817
            ++RDVRRFHHFK+G CSCGDYW
Sbjct: 861  ILRDVRRFHHFKNGTCSCGDYW 882


>ref|XP_006448595.1| hypothetical protein CICLE_v10014221mg [Citrus clementina]
            gi|557551206|gb|ESR61835.1| hypothetical protein
            CICLE_v10014221mg [Citrus clementina]
          Length = 882

 Score = 1145 bits (2961), Expect = 0.0
 Identities = 564/862 (65%), Positives = 689/862 (79%), Gaps = 5/862 (0%)
 Frame = -3

Query: 4387 TSVSLPNSVQTQNSPSQLKHSWIESLRSHARSNSFRQAITTFVQMQTAGILPDNYAYPAV 4208
            TS+ LP S QT++     K SWIESLRS  RSN FR+AI ++++M  + I PDN+A+PAV
Sbjct: 30   TSLPLPGS-QTRS-----KESWIESLRSQTRSNQFREAILSYIEMTRSDIQPDNFAFPAV 83

Query: 4207 LKAATALQDLHLGKQIHASLVKLGYHSHS-TVSNTVLHMYAKCGGDVDHVFKVFDRIPQR 4031
            LKA   +QDL LGKQIHA +VK GY   S TV+NT+++MY KCG D+  V+KVFDRI ++
Sbjct: 84   LKAVAGIQDLSLGKQIHAHVVKYGYGLSSVTVANTLVNMYGKCGSDMWDVYKVFDRITEK 143

Query: 4030 DQVSWNSLINALCKYQDWELALEAFRLMGLEKIEPSSFTLVSVALACSNLNRRDGLRLGK 3851
            DQVSWNS+I  LC+++ W+LALEAFR+M    +EPSSFTLVSVALACSNL+RRDGLRLG+
Sbjct: 144  DQVSWNSMIATLCRFEKWDLALEAFRMMLYSNVEPSSFTLVSVALACSNLSRRDGLRLGR 203

Query: 3850 QLHGYILRVDERKTFTDNSLMAMYAKLGRVDDAKIIFEYFAHRDMVSWNTIISSFSQSDR 3671
            Q+HG  LRV E  TF  N+LMAMYAKLGRVDDAK +F+ F   D+VSWNTIISS SQ+D+
Sbjct: 204  QVHGNSLRVGEWNTFIMNALMAMYAKLGRVDDAKTLFKSFEDCDLVSWNTIISSSSQNDK 263

Query: 3670 FYEALEYFRYMNDDGFKPDGFTVSSVLPACSHLELLDLGKEIHAFLFRNDHDFMRNSFVT 3491
            F EA+ + R M   G KPDG +++SVLPACSHLE+LD GKEIHA+  RND   + NSFV 
Sbjct: 264  FLEAVMFLRQMALRGIKPDGVSIASVLPACSHLEMLDTGKEIHAYALRNDI-LIDNSFVG 322

Query: 3490 SALVDMYCNCKQVVSGSRVFDAALDRKLGTWNAMFAGYAQNGFYSEAVMLFMKLMVVPGL 3311
            SALVDMYCNC++V  G RVFD   D+K+  WNAM  GYAQN +  EA+MLF+K+  V GL
Sbjct: 323  SALVDMYCNCREVECGRRVFDFISDKKIALWNAMITGYAQNEYDEEALMLFIKMEEVAGL 382

Query: 3310 FPNPTTMASVLPACVHCEAFVDKEVMHGYVLKLGLERDRYVQNALMDLYSRVGKVDIAEY 3131
            +PN TT++SV+P CV  EAF DKE +HG+ +KLGL RDRYVQNALMD+YSR+G+++I++ 
Sbjct: 383  WPNATTLSSVVPVCVRSEAFPDKEGIHGHAIKLGLGRDRYVQNALMDMYSRMGRIEISKT 442

Query: 3130 IFHNMESKDIVSYNTMITGYVVCGYHEDALILLHDMQIAEMKHEEGDDDQFAKNSEVSF- 2954
            IF +ME +D VS+NTMITGY +C  H DAL+LL +MQ   M+ E+  ++ +  +  V   
Sbjct: 443  IFDDMEVRDTVSWNTMITGYTICSQHGDALMLLREMQ--NMEEEKNRNNVYDLDERVLRP 500

Query: 2953 RPNSVTLMTILPGCAALAALTKGKEIHAYAFRNGLELDVGVGSALVDMYAKCGCLTMARR 2774
            +PNS+TLMT+LPGC AL+AL KGKEIHAYA RN L  DV VGSALVDMYAKCGCL  ARR
Sbjct: 501  KPNSITLMTVLPGCGALSALAKGKEIHAYAIRNMLATDVVVGSALVDMYAKCGCLNFARR 560

Query: 2773 VFDIMPNRNVITWNAVILAYGMHGEGDGALTLFRKMV---AAGGELKPNEVTFIALFAAC 2603
            VFD+MP RNVI+WN +I+AYGMHGEG   L L + MV   + GGE+KPNEVTFIALFAAC
Sbjct: 561  VFDLMPVRNVISWNVIIMAYGMHGEGREVLELLKNMVTEGSRGGEVKPNEVTFIALFAAC 620

Query: 2602 SHSGMVDEGKQLFQQMKVDHGVKPTSDHYACVVDLLGRAGRLDEAYDIIKFIPVGVDKIG 2423
            SHSGMV EG  LF +MK D+G++P+ DHYACVVDLLGRAG++++AY +I  +P   DK G
Sbjct: 621  SHSGMVSEGMDLFYKMKDDYGIEPSPDHYACVVDLLGRAGQVEDAYQLINMMPPEFDKAG 680

Query: 2422 AWSSLLGACRIHQNVELGEISASNLFKSEPNVASHYVLLSNIYSSAGLWEKANEVRRKMK 2243
            AWSSLLGACRIHQNVE+GEI A NLF  EP+VASHYVLLSNIYSSA LW+KA +VR+KMK
Sbjct: 681  AWSSLLGACRIHQNVEIGEIGAQNLFLLEPDVASHYVLLSNIYSSAQLWDKAMDVRKKMK 740

Query: 2242 SLRLKKEPGCSWIEYNDEVHKFVAGDTRHPQREQLYEYLNDLLVRMKEDGYVPDTSCVLH 2063
             + ++KEPGCSWIE+ DE+HKF+AGD  H Q EQL+ +L +L  RM+++GYVPDTSCVLH
Sbjct: 741  EMGVRKEPGCSWIEFGDEIHKFLAGDGSHQQSEQLHGFLENLSERMRKEGYVPDTSCVLH 800

Query: 2062 NVDEEEKENLLCGHSERLAIAFGLLNTPPGKTIRVSKNLRVCNDCHSATKFISKISGREI 1883
            NV+EEEKE LLCGHSE+LAIAFG+LNTPPG TIRV+KNLRVCNDCH ATKFISKI  REI
Sbjct: 801  NVNEEEKETLLCGHSEKLAIAFGILNTPPGTTIRVAKNLRVCNDCHQATKFISKIESREI 860

Query: 1882 VVRDVRRFHHFKDGACSCGDYW 1817
            ++RDVRRFHHFK+G CSCGDYW
Sbjct: 861  ILRDVRRFHHFKNGTCSCGDYW 882


>ref|XP_007040995.1| Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma
            cacao] gi|508704930|gb|EOX96826.1| Tetratricopeptide
            repeat (TPR)-like superfamily protein [Theobroma cacao]
          Length = 955

 Score = 1131 bits (2925), Expect = 0.0
 Identities = 546/843 (64%), Positives = 676/843 (80%), Gaps = 6/843 (0%)
 Frame = -3

Query: 4327 SWIESLRSHARSNSFRQAITTFVQMQTAGILPDNYAYPAVLKAATALQDLHLGKQIHASL 4148
            SW ESLRS+ RSN F QAI T+V M ++GI PD++A+PAVLKA TAL DL LGKQIHA +
Sbjct: 118  SWTESLRSNTRSNRFHQAILTYVSMSSSGIPPDHFAFPAVLKAVTALHDLALGKQIHAQV 177

Query: 4147 VKLGYH---SHSTVSNTVLHMYAKCGGDVDHVFKVFDRIPQRDQVSWNSLINALCKYQDW 3977
            +K GY    S  TV+NT+++ Y KCG D+  V+KVFDRI QRD VSWNS I+A C+ +DW
Sbjct: 178  LKFGYGFGTSSVTVANTLVNFYGKCG-DIWDVYKVFDRIHQRDTVSWNSFISAFCRLEDW 236

Query: 3976 ELALEAFRLMGLEKIEPSSFTLVSVALACSNLNRRDGLRLGKQLHGYILRVDERKTFTDN 3797
            E ALEAFRLM L+ +EPSSFTLVS+A ACSNL  RDGL LGKQLH Y LR+ + KTFT N
Sbjct: 237  EAALEAFRLMLLDNVEPSSFTLVSIAHACSNLPSRDGLHLGKQLHAYSLRIGDAKTFTYN 296

Query: 3796 SLMAMYAKLGRVDDAKIIFEYFAHRDMVSWNTIISSFSQSDRFYEALEYFRYMNDDGFKP 3617
            +LM MY+KLG ++DAK++FE F  RD++SWNT++SS SQ+D+F EAL     M  +G KP
Sbjct: 297  ALMTMYSKLGHLNDAKLLFELFKERDLISWNTMLSSLSQNDKFTEALLLLHRMVLEGLKP 356

Query: 3616 DGFTVSSVLPACSHLELLDLGKEIHAFLFRNDHDFMRNSFVTSALVDMYCNCKQVVSGSR 3437
            DG T++SVLPACSHLELLD+GK++HA+  R+D   + NSFV SALVDMYCNC++  SG +
Sbjct: 357  DGVTIASVLPACSHLELLDIGKQLHAYALRHDI-LIDNSFVGSALVDMYCNCRKAQSGRQ 415

Query: 3436 VFDAALDRKLGTWNAMFAGYAQNGFYSEAVMLFMKLMVVPGLFPNPTTMASVLPACVHCE 3257
            VFD  +D+K G WNAM  GY+QN    +A++LF+++  V GL PN TTMAS++PACV  E
Sbjct: 416  VFDCVIDKKTGLWNAMITGYSQNEHDEDALILFIEMEAVAGLCPNATTMASIVPACVRSE 475

Query: 3256 AFVDKEVMHGYVLKLGLERDRYVQNALMDLYSRVGKVDIAEYIFHNMESKDIVSYNTMIT 3077
            AFV K+ +HGYV+K GL  D YVQNALMD+Y R+GK+ I++ IF NME +DIVS+NTMIT
Sbjct: 476  AFVHKQGIHGYVVKRGLASDPYVQNALMDMYCRMGKIQISKTIFDNMEVRDIVSWNTMIT 535

Query: 3076 GYVVCGYHEDALILLHDMQIAEMKHEEGDDDQFAKNSEVSFRPNSVTLMTILPGCAALAA 2897
            GYV+CG+H++AL+LLH+MQ  E   +E   D +     +  +PNS+TLMT+LPGCA L+A
Sbjct: 536  GYVICGHHDNALLLLHEMQRVE---QEKSADYYEDEKRIPLKPNSITLMTVLPGCATLSA 592

Query: 2896 LTKGKEIHAYAFRNGLELDVGVGSALVDMYAKCGCLTMARRVFDIMPNRNVITWNAVILA 2717
            L+KGKEIHAYA RN L  DVGVGSALVDMYAKCGCL   R+VFDI+P RNVITWN +I+A
Sbjct: 593  LSKGKEIHAYAIRNMLASDVGVGSALVDMYAKCGCLNFCRKVFDIIPLRNVITWNVIIMA 652

Query: 2716 YGMHGEGDGALTLFRKMVAAGG---ELKPNEVTFIALFAACSHSGMVDEGKQLFQQMKVD 2546
            YGMHG+G  AL LF  MVA      E+KPNEVTFIA+FAACSHSGMV EG  LF +MK +
Sbjct: 653  YGMHGKGAEALELFNCMVAEASKVKEVKPNEVTFIAIFAACSHSGMVREGLNLFYRMKDE 712

Query: 2545 HGVKPTSDHYACVVDLLGRAGRLDEAYDIIKFIPVGVDKIGAWSSLLGACRIHQNVELGE 2366
            +G++PT DHYAC+VDLLGRAG+++E+Y +I  +P   DK GAWSSLLG+CRIHQNVE+GE
Sbjct: 713  YGIEPTPDHYACIVDLLGRAGQVEESYQLINTMPSQFDKAGAWSSLLGSCRIHQNVEIGE 772

Query: 2365 ISASNLFKSEPNVASHYVLLSNIYSSAGLWEKANEVRRKMKSLRLKKEPGCSWIEYNDEV 2186
            I+A NLF  EP+VASHYVLLSNIYSSA LW+KAN+VR+KMK + ++KEPGCSWIE+ DEV
Sbjct: 773  IAARNLFYLEPDVASHYVLLSNIYSSAQLWDKANDVRKKMKEMGVRKEPGCSWIEFGDEV 832

Query: 2185 HKFVAGDTRHPQREQLYEYLNDLLVRMKEDGYVPDTSCVLHNVDEEEKENLLCGHSERLA 2006
            HKF+AGD  H Q  QL+++L  L  +M+++GYVPDTSCVLHNVDEEEKE LLCGHSE+LA
Sbjct: 833  HKFLAGDASHAQSGQLHKFLETLSEKMRKEGYVPDTSCVLHNVDEEEKETLLCGHSEKLA 892

Query: 2005 IAFGLLNTPPGKTIRVSKNLRVCNDCHSATKFISKISGREIVVRDVRRFHHFKDGACSCG 1826
            IA+GLLN PPG TIRV+KNLRVCNDCH ATK+IS+I+ REI++RDVRRFHHF++G CSCG
Sbjct: 893  IAYGLLNYPPGTTIRVAKNLRVCNDCHEATKYISRITDREIILRDVRRFHHFRNGRCSCG 952

Query: 1825 DYW 1817
            DYW
Sbjct: 953  DYW 955


>ref|XP_002299387.2| pentatricopeptide repeat-containing family protein [Populus
            trichocarpa] gi|550347073|gb|EEE84192.2|
            pentatricopeptide repeat-containing family protein
            [Populus trichocarpa]
          Length = 894

 Score = 1130 bits (2922), Expect = 0.0
 Identities = 550/884 (62%), Positives = 693/884 (78%), Gaps = 9/884 (1%)
 Frame = -3

Query: 4441 PASSSADVAMXXXXXXXPTSVSLPNSVQTQN-SPSQLKHS---WIESLRSHARSNSFRQA 4274
            P S S  +         P S S P  + + +  P  + HS   WIESLRS +RSN FR+A
Sbjct: 15   PFSPSTSLPSHQQPTKPPISSSSPKPISSSSPKPISISHSQASWIESLRSRSRSNLFREA 74

Query: 4273 ITTFVQMQTAGILPDNYAYPAVLKAATALQDLHLGKQIHASLVKLGYHSHS--TVSNTVL 4100
            I+T+++M  +G+ PDN+A+PAVLKA   +Q+L+LGKQIHA + K GY S S  T+ NT++
Sbjct: 75   ISTYIEMIGSGVSPDNFAFPAVLKAVAGIQELYLGKQIHAHVFKFGYGSFSSVTIDNTLV 134

Query: 4099 HMYAKCGGDVDHVFKVFDRIPQRDQVSWNSLINALCKYQDWELALEAFRLMGLEKIEPSS 3920
            +MY KCGG  D  +KVFDRI +RDQVSWNS+I+ALC++++WE+A++AFRLM +E  EPSS
Sbjct: 135  NMYGKCGGLGD-AYKVFDRITERDQVSWNSIISALCRFEEWEVAIKAFRLMLMEGFEPSS 193

Query: 3919 FTLVSVALACSNLNRRDGLRLGKQLHGYILRVDERKTFTDNSLMAMYAKLGRVDDAKIIF 3740
            FTLVS+ALACSNL +RDGL LGKQ+HG   R    +TF++N+LMAMYAKLGR+DDAK + 
Sbjct: 194  FTLVSMALACSNLRKRDGLWLGKQIHGCCFRKGHWRTFSNNALMAMYAKLGRLDDAKSLL 253

Query: 3739 EYFAHRDMVSWNTIISSFSQSDRFYEALEYFRYMNDDGFKPDGFTVSSVLPACSHLELLD 3560
              F  RD+V+WN++ISSFSQ++RF EAL + R M  +G KPDG T +SVLPACSHL+LL 
Sbjct: 254  VLFEDRDLVTWNSMISSFSQNERFMEALMFLRLMVLEGVKPDGVTFASVLPACSHLDLLR 313

Query: 3559 LGKEIHAFLFRNDHDFMRNSFVTSALVDMYCNCKQVVSGSRVFDAALDRKLGTWNAMFAG 3380
             GKEIHA+  R D D + NSFV SALVDMYCNC QV SG  VFD  LDRK+G WNAM AG
Sbjct: 314  TGKEIHAYALRTD-DVIENSFVGSALVDMYCNCGQVESGRLVFDGVLDRKIGLWNAMIAG 372

Query: 3379 YAQNGFYSEAVMLFMKLMVVPGLFPNPTTMASVLPACVHCEAFVDKEVMHGYVLKLGLER 3200
            YAQ+    +A+MLF+++    GL+ N TTM+S++PA V CE    KE +HGYV+K GLE 
Sbjct: 373  YAQSEHDEKALMLFIEMEAAAGLYSNATTMSSIVPAYVRCEGISRKEGIHGYVIKRGLET 432

Query: 3199 DRYVQNALMDLYSRVGKVDIAEYIFHNMESKDIVSYNTMITGYVVCGYHEDALILLHDMQ 3020
            +RY+QNAL+D+YSR+G +  ++ IF +ME +DIVS+NT+IT YV+CG   DAL+LLH+MQ
Sbjct: 433  NRYLQNALIDMYSRMGDIKTSKRIFDSMEDRDIVSWNTIITSYVICGRSSDALLLLHEMQ 492

Query: 3019 IAEMKHEEGDDDQFAKNSEVSFRPNSVTLMTILPGCAALAALTKGKEIHAYAFRNGLELD 2840
              E K     D  +    +V F+PNS+TLMT+LPGCA+L+AL KGKEIHAYA RN L   
Sbjct: 493  RIEEKSTY--DGDYNDEKQVPFKPNSITLMTVLPGCASLSALAKGKEIHAYAIRNLLASQ 550

Query: 2839 VGVGSALVDMYAKCGCLTMARRVFDIMPNRNVITWNAVILAYGMHGEGDGALTLFRKMVA 2660
            V VGSALVDMYAKCGCL +ARRVFD MP RNVITWN +I+AYGMHG+G  +L LF  MVA
Sbjct: 551  VTVGSALVDMYAKCGCLNLARRVFDQMPIRNVITWNVIIMAYGMHGKGKESLELFEDMVA 610

Query: 2659 AG---GELKPNEVTFIALFAACSHSGMVDEGKQLFQQMKVDHGVKPTSDHYACVVDLLGR 2489
             G   GE+KP EVTFIALFA+CSHSGMVDEG  LF +MK +HG++P  DHYAC+VDL+GR
Sbjct: 611  EGAKGGEVKPTEVTFIALFASCSHSGMVDEGLSLFHKMKNEHGIEPAPDHYACIVDLVGR 670

Query: 2488 AGRLDEAYDIIKFIPVGVDKIGAWSSLLGACRIHQNVELGEISASNLFKSEPNVASHYVL 2309
            AG+++EAY ++  +P G DK+GAWSSLLGACRI+ N+E+GEI+A NL + +P+VASHYVL
Sbjct: 671  AGKVEEAYGLVNTMPSGFDKVGAWSSLLGACRIYHNIEIGEIAAENLLQLQPDVASHYVL 730

Query: 2308 LSNIYSSAGLWEKANEVRRKMKSLRLKKEPGCSWIEYNDEVHKFVAGDTRHPQREQLYEY 2129
            LSNIYSSAGLW+KA  +RR+MK++ +KKEPGCSWIEY DEVHKF+AGD  HPQ E+L+++
Sbjct: 731  LSNIYSSAGLWDKAMNLRRRMKAMGVKKEPGCSWIEYGDEVHKFLAGDLSHPQSEKLHDF 790

Query: 2128 LNDLLVRMKEDGYVPDTSCVLHNVDEEEKENLLCGHSERLAIAFGLLNTPPGKTIRVSKN 1949
            L  L  R+K++GYVPDT+CVLH++DEEEKE +LCGHSE+LAIAFG+LNTPPG TIRV+KN
Sbjct: 791  LETLSERLKKEGYVPDTACVLHDIDEEEKETILCGHSEKLAIAFGILNTPPGTTIRVAKN 850

Query: 1948 LRVCNDCHSATKFISKISGREIVVRDVRRFHHFKDGACSCGDYW 1817
            LRVCNDCH+A+KFISKI  REI++RD RRFHHFKDG CSCGDYW
Sbjct: 851  LRVCNDCHTASKFISKIEDREIILRDARRFHHFKDGTCSCGDYW 894


>gb|EXB83263.1| hypothetical protein L484_011557 [Morus notabilis]
          Length = 877

 Score = 1125 bits (2909), Expect = 0.0
 Identities = 555/853 (65%), Positives = 674/853 (79%), Gaps = 5/853 (0%)
 Frame = -3

Query: 4360 QTQNSPSQLKHSWIESLRSHARSNSFRQAITTFVQMQTAGILPDNYAYPAVLKAATALQD 4181
            Q+Q+  +  + SWIESLRS  R+N FR A++T+  M T  I PDN+A+P +LKAAT+L+D
Sbjct: 32   QSQSQTNNPQSSWIESLRSQVRNNLFRDAVSTYTSM-TMAIPPDNFAFPPILKAATSLRD 90

Query: 4180 LHLGKQIHASLVKLGYHSHS-TVSNTVLHMYAKCGGDVDHVFKVFDRIPQRDQVSWNSLI 4004
            L LG+QIHA + K GY S S TV+NT+++MY KCG D+    KVFDRIPQRDQVSWNS+I
Sbjct: 91   LSLGRQIHAHVFKFGYASSSVTVANTLVNMYGKCG-DIGDAHKVFDRIPQRDQVSWNSMI 149

Query: 4003 NALCKYQDWELALEAFRLM-GLEKIEPSSFTLVSVALACSNLNRRDGLRLGKQLHGYILR 3827
             ALC + +W LALEAFR M   E ++PSSFTLVSV+LACSNL R  GL LGKQ+HGY LR
Sbjct: 150  AALCHFGEWALALEAFRAMLAEENVDPSSFTLVSVSLACSNLERFYGLWLGKQVHGYSLR 209

Query: 3826 VDERKTFTDNSLMAMYAKLGRVDDAKIIFEYFAHRDMVSWNTIISSFSQSDRFYEALEYF 3647
             D+RKTFT N+LMAMYAKLGRVDD+  +FE F +RD+VSWNT+ISS SQ+D F EAL   
Sbjct: 210  KDDRKTFTINALMAMYAKLGRVDDSVALFELFENRDLVSWNTVISSLSQNDMFVEALALL 269

Query: 3646 RYMNDDGFKPDGFTVSSVLPACSHLELLDLGKEIHAFLFRNDHDFMRNSFVTSALVDMYC 3467
            R M  +G   DG T++SVLPACSHLE+LDLGKEIHA+  RND D + NSFV SALVDMYC
Sbjct: 270  RRMVREGVGLDGVTIASVLPACSHLEMLDLGKEIHAYAVRND-DLIENSFVGSALVDMYC 328

Query: 3466 NCKQVVSGSRVFDAALDRKLGTWNAMFAGYAQNGFYSEAVMLFMKLMVVPGLFPNPTTMA 3287
            NC++V +G RVFD+ L+RK   WNAM AGYAQN F  EA+ LF++++ V GL PN TTMA
Sbjct: 329  NCRRVKTGRRVFDSILERKTALWNAMIAGYAQNEFDEEALNLFLEMLAVLGLSPNATTMA 388

Query: 3286 SVLPACVHCEAFVDKEVMHGYVLKLGLERDRYVQNALMDLYSRVGKVDIAEYIFHNMESK 3107
            S++PAC  C+A  DKE +HGYV+K+GLE DRYVQNALMD YSR+GK++I+  IF  ME K
Sbjct: 389  SIVPACARCKALCDKESIHGYVVKMGLEGDRYVQNALMDFYSRIGKIEISRSIFKTMEEK 448

Query: 3106 DIVSYNTMITGYVVCGYHEDALILLHDMQIAEMKHEEGDDDQFAKNSEVSFRPNSVTLMT 2927
            DIVS+NTMITGYV+CG+H +AL +LH+M     K +  D +  ++      + NSVTLMT
Sbjct: 449  DIVSWNTMITGYVICGFHNEALCMLHEMT----KEKISDAELKSETGRNMLKLNSVTLMT 504

Query: 2926 ILPGCAALAALTKGKEIHAYAFRNGLELDVGVGSALVDMYAKCGCLTMARRVFDIMPNRN 2747
            ILPGCAAL+ L KG+EIHAYA R+ L  DV VGSALVDMYAKCGC  +AR VF+ MP RN
Sbjct: 505  ILPGCAALSVLAKGREIHAYAIRHLLASDVAVGSALVDMYAKCGCSDIARAVFEEMPMRN 564

Query: 2746 VITWNAVILAYGMHGEGDGALTLFRKMVAAG---GELKPNEVTFIALFAACSHSGMVDEG 2576
            VITWN +I+AYGMHG G  AL LF  MV  G    E +P EVTFIA+FAACSHS MV EG
Sbjct: 565  VITWNVLIMAYGMHGRGREALELFENMVKEGMRNKEARPTEVTFIAVFAACSHSKMVTEG 624

Query: 2575 KQLFQQMKVDHGVKPTSDHYACVVDLLGRAGRLDEAYDIIKFIPVGVDKIGAWSSLLGAC 2396
              LF +MK D+GV+P +DHYAC+VDLLGRAG+++EAY +I  +P+  DK GAWSSLLG C
Sbjct: 625  LDLFHRMKKDYGVEPLADHYACIVDLLGRAGKVEEAYQLINTMPLDFDKTGAWSSLLGTC 684

Query: 2395 RIHQNVELGEISASNLFKSEPNVASHYVLLSNIYSSAGLWEKANEVRRKMKSLRLKKEPG 2216
            R+H +VE+GEI+A NL + EPNVASHYVLLSNIYSSAGLW++A +VRR+MK + ++KEPG
Sbjct: 685  RVHHSVEIGEIAAENLLQVEPNVASHYVLLSNIYSSAGLWDEAMDVRRRMKEMGVRKEPG 744

Query: 2215 CSWIEYNDEVHKFVAGDTRHPQREQLYEYLNDLLVRMKEDGYVPDTSCVLHNVDEEEKEN 2036
            CSWIE+ DEVHKF+AGD  HPQ E+L+E+L +L +RMK+ GYVPDTSCVLH+VDEE KE 
Sbjct: 745  CSWIEFGDEVHKFLAGDGSHPQSEKLHEFLENLAMRMKKAGYVPDTSCVLHDVDEEAKET 804

Query: 2035 LLCGHSERLAIAFGLLNTPPGKTIRVSKNLRVCNDCHSATKFISKISGREIVVRDVRRFH 1856
            LLCGHSE+LAIAFG+LNTPPG TIRV+KNLRVCNDCH+A K ISKI  REI++RDVRRFH
Sbjct: 805  LLCGHSEKLAIAFGILNTPPGTTIRVAKNLRVCNDCHAAAKVISKIMDREIILRDVRRFH 864

Query: 1855 HFKDGACSCGDYW 1817
            HFK G CSCGDYW
Sbjct: 865  HFKSGTCSCGDYW 877


>ref|XP_004295518.1| PREDICTED: pentatricopeptide repeat-containing protein At3g57430,
            chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 893

 Score = 1109 bits (2869), Expect = 0.0
 Identities = 534/853 (62%), Positives = 682/853 (79%), Gaps = 6/853 (0%)
 Frame = -3

Query: 4357 TQNSPSQLKHS--WIESLRSHARSNSFRQAITTFVQMQTAGILPDNYAYPAVLKAATALQ 4184
            T ++P  +  S  WI+++R+  RS  + +AI+T++ M  +GI PDN+A+PAVLKA  AL 
Sbjct: 44   TTSTPKPISDSRTWIDTIRTQTRSGHYNEAISTYINMTRSGIRPDNFAFPAVLKAVAALH 103

Query: 4183 DLHLGKQIHASLVKLGYHSHS-TVSNTVLHMYAKCGGDVDHVFKVFDRIPQRDQVSWNSL 4007
            DL LG+Q+HA +VK GY S S TV+N+++++Y KCG D+   +KVFD + +RDQVSWNS+
Sbjct: 104  DLRLGQQVHACVVKFGYESGSVTVANSLVNVYGKCG-DIGDAYKVFDGMTERDQVSWNSM 162

Query: 4006 INALCKYQDWELALEAFRLMGLEKIEPSSFTLVSVALACSNLNRRDGLRLGKQLHGYILR 3827
            I ALC++++WELALEAFR M  + + PSSFTLVS ALACSNL++RDGLRLGKQ+HGY +R
Sbjct: 163  IAALCRFEEWELALEAFRSMFEDNVVPSSFTLVSAALACSNLDKRDGLRLGKQVHGYSVR 222

Query: 3826 VDERKTFTDNSLMAMYAKLGRVDDAKIIFEYFAHRDMVSWNTIISSFSQSDRFYEALEYF 3647
            + E KTFT N+LM+MYAKLG V  ++ +FE F   D+VSWNT++SS SQ+DRF EALE+F
Sbjct: 223  MCESKTFTVNALMSMYAKLGMVGYSRGVFELFEECDLVSWNTMVSSLSQNDRFMEALEFF 282

Query: 3646 RYMNDDGFKPDGFTVSSVLPACSHLELLDLGKEIHAFLFRNDHDFMRNSFVTSALVDMYC 3467
            R M  +G +PDG T++SVLPACSHLE+L+ GKEIHA+  R + +   NS+V SALVDMYC
Sbjct: 283  RLMILEGIRPDGVTIASVLPACSHLEMLEAGKEIHAYALRAN-ELTGNSYVGSALVDMYC 341

Query: 3466 NCKQVVSGSRVFDAALDRKLGTWNAMFAGYAQNGFYSEAVMLFMKLMVVPGLFPNPTTMA 3287
            NC++V SG RVFDA ++ K+  WNAM  GYAQN +  EA+ LF+++  V GL PN TTM+
Sbjct: 342  NCREVESGRRVFDAVMEWKVPLWNAMITGYAQNEYDEEALDLFLEMYAVSGLNPNATTMS 401

Query: 3286 SVLPACVHCEAFVDKEVMHGYVLKLGLERDRYVQNALMDLYSRVGKVDIAEYIFHNMESK 3107
            S++PACV CEAF  KE +H +V+K  LE++RY+QNALMD+YSR+G+  I+E IF++ME K
Sbjct: 402  SIVPACVRCEAFSGKESIHAFVIKRSLEKNRYIQNALMDMYSRMGRTGISETIFNSMEGK 461

Query: 3106 DIVSYNTMITGYVVCGYHEDALILLHDMQIAEMKHEEGDDDQFAKNSEVSFRPNSVTLMT 2927
            DIVS+NTMITGYV+ G H+DAL LL++MQ  E +++  D   +     V  +PN++TLMT
Sbjct: 462  DIVSWNTMITGYVISGRHDDALNLLYEMQRVE-ENKNTDSTGYDDERRVPLKPNTITLMT 520

Query: 2926 ILPGCAALAALTKGKEIHAYAFRNGLELDVGVGSALVDMYAKCGCLTMARRVFDIMPNRN 2747
            +LP CA L+AL KGKEIHAYA R+ L LD+ VGSALVDMYAKCGCL ++R +F+ MP +N
Sbjct: 521  LLPSCAVLSALAKGKEIHAYATRHLLALDIAVGSALVDMYAKCGCLDLSRAMFNQMPLKN 580

Query: 2746 VITWNAVILAYGMHGEGDGALTLFRKMVAAGG---ELKPNEVTFIALFAACSHSGMVDEG 2576
            VITWN +I+AYGMHG G+ AL LF+ MV  G    EL+PNEVTFIA+FAACSHSGMV+EG
Sbjct: 581  VITWNVLIMAYGMHGRGEEALELFKNMVDEGRWNKELRPNEVTFIAIFAACSHSGMVEEG 640

Query: 2575 KQLFQQMKVDHGVKPTSDHYACVVDLLGRAGRLDEAYDIIKFIPVGVDKIGAWSSLLGAC 2396
              LF  MK +HG++P  DHYACVVDLLGRAG ++ AY+I+K +P   DK GAWSSLLGAC
Sbjct: 641  LNLFHTMKQEHGIEPAPDHYACVVDLLGRAGSVERAYEIVKTMPSKFDKAGAWSSLLGAC 700

Query: 2395 RIHQNVELGEISASNLFKSEPNVASHYVLLSNIYSSAGLWEKANEVRRKMKSLRLKKEPG 2216
            R+HQNVE+GEI+A +L + EP+VASHYVLLSNIYSS+GLWEKA ++RRKMK + ++KEPG
Sbjct: 701  RLHQNVEIGEIAAHHLLQLEPDVASHYVLLSNIYSSSGLWEKAMDIRRKMKEMGVRKEPG 760

Query: 2215 CSWIEYNDEVHKFVAGDTRHPQREQLYEYLNDLLVRMKEDGYVPDTSCVLHNVDEEEKEN 2036
            CSWIE+ DEVHKF+AGD  HPQ EQL+EYL  L  RMK++GYVPDTSCVLHNVDE+EKE 
Sbjct: 761  CSWIEFEDEVHKFLAGDMSHPQSEQLHEYLETLSERMKKEGYVPDTSCVLHNVDEDEKET 820

Query: 2035 LLCGHSERLAIAFGLLNTPPGKTIRVSKNLRVCNDCHSATKFISKISGREIVVRDVRRFH 1856
            LLCGHSE+LA+AFGLLNT PG TIRV+KNLRVCNDCH A K+ISK+  REI++RDVRRFH
Sbjct: 821  LLCGHSEKLAMAFGLLNTRPGTTIRVAKNLRVCNDCHLAAKYISKMLDREIILRDVRRFH 880

Query: 1855 HFKDGACSCGDYW 1817
            HF++G CSCGDYW
Sbjct: 881  HFRNGNCSCGDYW 893


>ref|XP_004487896.1| PREDICTED: pentatricopeptide repeat-containing protein At3g57430,
            chloroplastic-like isoform X1 [Cicer arietinum]
            gi|502085351|ref|XP_004487897.1| PREDICTED:
            pentatricopeptide repeat-containing protein At3g57430,
            chloroplastic-like isoform X2 [Cicer arietinum]
          Length = 872

 Score = 1102 bits (2851), Expect = 0.0
 Identities = 539/858 (62%), Positives = 674/858 (78%), Gaps = 6/858 (0%)
 Frame = -3

Query: 4372 PNSVQTQNSPSQLKHSWIESLRSHARSNSFRQAITTFVQMQTAGILPDNYAYPAVLKAAT 4193
            P++    +SPS    +WI+ LRS  +S+SF QAI+T+  M TAG+ PDN+A+PAVLKA  
Sbjct: 23   PSTSAEPHSPS----AWIDRLRSQVQSSSFHQAISTYTNMVTAGVPPDNFAFPAVLKATA 78

Query: 4192 ALQDLHLGKQIHASLVKLGY---HSHSTVSNTVLHMYAKCGGDVDHVFKVFDRIPQRDQV 4022
            A QDL+LGKQIH  + K G     S + V+N++++MY KCG D+D   +VFD I  RD V
Sbjct: 79   ATQDLNLGKQIHGHVFKFGQALPSSAAAVANSLVNMYGKCG-DIDDARRVFDEISHRDDV 137

Query: 4021 SWNSLINALCKYQDWELALEAFRLMGLEKIEPSSFTLVSVALACSNLNRRDGLRLGKQLH 3842
            SWNS+I A C+++ WEL++  FRLM LE + P+SFTLVSVA ACSNL  R+GL LGKQ+H
Sbjct: 138  SWNSMIAAACRFEKWELSIHLFRLMLLEHVGPTSFTLVSVAHACSNL--RNGLLLGKQVH 195

Query: 3841 GYILRVDERKTFTDNSLMAMYAKLGRVDDAKIIFEYFAHRDMVSWNTIISSFSQSDRFYE 3662
             ++LR D+ +TFT+N+L+ MYAKLGRV +AK +F+ F  +D+VSWNTIISS SQ+DRF E
Sbjct: 196  AFMLRNDDWRTFTNNALVTMYAKLGRVFEAKALFDVFDDKDLVSWNTIISSLSQNDRFEE 255

Query: 3661 ALEYFRYMNDDGFKPDGFTVSSVLPACSHLELLDLGKEIHAFLFRNDHDFMRNSFVTSAL 3482
            AL Y  +M   G +PDG T++S LPACSHLE+L  GKEIH+F+ RN+ D + NSFV SAL
Sbjct: 256  ALLYLHFMLQSGVRPDGVTLASALPACSHLEMLSYGKEIHSFVLRNN-DLIENSFVGSAL 314

Query: 3481 VDMYCNCKQVVSGSRVFDAALDRKLGTWNAMFAGYAQNGFYSEAVMLFMKLMVVPGLFPN 3302
            VDMYCNC Q   G  VFD    + +  WNAM AGY +N F  EA+ LF++++   G+ PN
Sbjct: 315  VDMYCNCNQPEKGRIVFDGMFRKTVAVWNAMIAGYVRNEFDYEAIELFVEMVFELGMSPN 374

Query: 3301 PTTMASVLPACVHCEAFVDKEVMHGYVLKLGLERDRYVQNALMDLYSRVGKVDIAEYIFH 3122
              T++SVLPACV CEAF+DKE +HG V+K G E+D+YVQNALMD+YSR+G ++I++ IF 
Sbjct: 375  SVTLSSVLPACVRCEAFLDKEGIHGCVVKWGFEKDKYVQNALMDMYSRMGMIEISKSIFG 434

Query: 3121 NMESKDIVSYNTMITGYVVCGYHEDALILLHDMQIAEMKHEEGDDDQFAKNSEVSFRPNS 2942
            +M  +DIVS+NTMITGYVVCG H DAL LLHDMQ  + +      D +  N  V  +PNS
Sbjct: 435  SMSRRDIVSWNTMITGYVVCGRHNDALNLLHDMQRGQEEDRINTFDDYEVNRSVPIKPNS 494

Query: 2941 VTLMTILPGCAALAALTKGKEIHAYAFRNGLELDVGVGSALVDMYAKCGCLTMARRVFDI 2762
            VTLMT+LPGCAALAAL KGKEIHAYA +  +  DV VGSALVDMYAKCGCL ++R VF+ 
Sbjct: 495  VTLMTVLPGCAALAALGKGKEIHAYAVKQMISKDVAVGSALVDMYAKCGCLNLSRTVFEQ 554

Query: 2761 MPNRNVITWNAVILAYGMHGEGDGALTLFRKMVAAGG---ELKPNEVTFIALFAACSHSG 2591
            M  RNVITWN +I+AYGMHG+G+ AL LFR+MVA G    E++PNEVT+IA+FAACSHSG
Sbjct: 555  MSVRNVITWNVLIMAYGMHGKGEEALKLFRRMVAEGDKNIEIRPNEVTYIAIFAACSHSG 614

Query: 2590 MVDEGKQLFQQMKVDHGVKPTSDHYACVVDLLGRAGRLDEAYDIIKFIPVGVDKIGAWSS 2411
            MVDEG  LF  MK  HG++PTSDHYAC+VDLLGR+G+++E+Y +IK +P  ++K+ AWSS
Sbjct: 615  MVDEGLNLFHTMKAKHGIEPTSDHYACLVDLLGRSGQIEESYKLIKTMPSNMNKVDAWSS 674

Query: 2410 LLGACRIHQNVELGEISASNLFKSEPNVASHYVLLSNIYSSAGLWEKANEVRRKMKSLRL 2231
            LLGA +IHQN+E+GEI+A +LF  EPNVASHYVLLSNIYSSAGLW+KA +VR+KMK + +
Sbjct: 675  LLGASKIHQNLEIGEIAAKHLFVLEPNVASHYVLLSNIYSSAGLWDKAMDVRKKMKEMGV 734

Query: 2230 KKEPGCSWIEYNDEVHKFVAGDTRHPQREQLYEYLNDLLVRMKEDGYVPDTSCVLHNVDE 2051
            +KEPGCSWIE+ DEVHKF+AGDT HPQ ++L+EYL  L  RMK++GYVPDTSCVLHNVDE
Sbjct: 735  RKEPGCSWIEHGDEVHKFLAGDTSHPQSKELHEYLETLSQRMKKEGYVPDTSCVLHNVDE 794

Query: 2050 EEKENLLCGHSERLAIAFGLLNTPPGKTIRVSKNLRVCNDCHSATKFISKISGREIVVRD 1871
            EEKE++LCGHSERLAIAFGLLNT  G TIRV+KNLRVCNDCH ATKFISKI  REI+VRD
Sbjct: 795  EEKESMLCGHSERLAIAFGLLNTSHGTTIRVAKNLRVCNDCHVATKFISKIVDREIIVRD 854

Query: 1870 VRRFHHFKDGACSCGDYW 1817
            VRRFHHF++G CSCGDYW
Sbjct: 855  VRRFHHFRNGTCSCGDYW 872


>ref|XP_002878152.1| hypothetical protein ARALYDRAFT_486188 [Arabidopsis lyrata subsp.
            lyrata] gi|297323990|gb|EFH54411.1| hypothetical protein
            ARALYDRAFT_486188 [Arabidopsis lyrata subsp. lyrata]
          Length = 886

 Score = 1089 bits (2817), Expect = 0.0
 Identities = 533/845 (63%), Positives = 657/845 (77%), Gaps = 1/845 (0%)
 Frame = -3

Query: 4348 SPSQLKHSWIESLRSHARSNSFRQAITTFVQMQTAGILPDNYAYPAVLKAATALQDLHLG 4169
            S S     WI+ LRS  RSN  R+A+ T++ M   GI PDN+A+PA+LKA   LQD+ LG
Sbjct: 53   SQSHSPEWWIDLLRSKVRSNLLREAVLTYIDMIVLGIKPDNFAFPALLKAVADLQDMDLG 112

Query: 4168 KQIHASLVKLGYHSHS-TVSNTVLHMYAKCGGDVDHVFKVFDRIPQRDQVSWNSLINALC 3992
            KQIHA + K GY   S TV+NT++++Y KCG D   V+KVFDRI +R+QVSWNSLI++LC
Sbjct: 113  KQIHAHVYKFGYGVDSVTVANTLVNLYRKCG-DFGAVYKVFDRISERNQVSWNSLISSLC 171

Query: 3991 KYQDWELALEAFRLMGLEKIEPSSFTLVSVALACSNLNRRDGLRLGKQLHGYILRVDERK 3812
             ++ WE+ALEAFR M  E +EPSSFTLVSVALACSN    +GL +GKQ+H Y LR  E  
Sbjct: 172  SFEKWEMALEAFRCMLDEDVEPSSFTLVSVALACSNFPMPEGLLMGKQVHAYGLRKGELN 231

Query: 3811 TFTDNSLMAMYAKLGRVDDAKIIFEYFAHRDMVSWNTIISSFSQSDRFYEALEYFRYMND 3632
            +F  N+L+AMY K+G++  +K++   F  RD+V+WNT++SS  Q+++F EALEY R M  
Sbjct: 232  SFIINTLVAMYGKMGKLASSKVLLGSFEGRDLVTWNTVLSSLCQNEQFLEALEYLREMVL 291

Query: 3631 DGFKPDGFTVSSVLPACSHLELLDLGKEIHAFLFRNDHDFMRNSFVTSALVDMYCNCKQV 3452
            +G +PDGFT+SSVLPACSHLE+L  GKE+HA+  +N      NSFV SALVDMYCNCKQV
Sbjct: 292  EGVEPDGFTISSVLPACSHLEMLRTGKELHAYALKNG-SLDENSFVGSALVDMYCNCKQV 350

Query: 3451 VSGSRVFDAALDRKLGTWNAMFAGYAQNGFYSEAVMLFMKLMVVPGLFPNPTTMASVLPA 3272
            +SG RVFD   DRK+G WNAM  GYAQN +  EA++LF+++    GL  N TTMA V+PA
Sbjct: 351  LSGCRVFDGMFDRKIGLWNAMITGYAQNEYDEEALLLFIEMEESAGLLANSTTMAGVVPA 410

Query: 3271 CVHCEAFVDKEVMHGYVLKLGLERDRYVQNALMDLYSRVGKVDIAEYIFHNMESKDIVSY 3092
            CV   AF  KE +HG+V+K GL+RDR+VQNALMD+YSR+GK+DIA+ IF  ME +D+V++
Sbjct: 411  CVRSGAFSKKEAIHGFVVKRGLDRDRFVQNALMDMYSRLGKIDIAKRIFGKMEDRDLVTW 470

Query: 3091 NTMITGYVVCGYHEDALILLHDMQIAEMKHEEGDDDQFAKNSEVSFRPNSVTLMTILPGC 2912
            NT+ITGYV    HEDAL++LH MQI E K  E       + S VS +PNS+TLMTILP C
Sbjct: 471  NTIITGYVFSERHEDALLMLHKMQILERKASE-------RASRVSLKPNSITLMTILPSC 523

Query: 2911 AALAALTKGKEIHAYAFRNGLELDVGVGSALVDMYAKCGCLTMARRVFDIMPNRNVITWN 2732
            AAL+AL KGKEIHAYA +N L  DV VGSALVDMYAKCGCL M+R+VFD +P RNVITWN
Sbjct: 524  AALSALAKGKEIHAYAIKNNLATDVAVGSALVDMYAKCGCLQMSRKVFDQIPIRNVITWN 583

Query: 2731 AVILAYGMHGEGDGALTLFRKMVAAGGELKPNEVTFIALFAACSHSGMVDEGKQLFQQMK 2552
             +++AYGMHG    A+ + R M+  G  +KPNEVTFI++FAACSHSGMV+EG ++F  MK
Sbjct: 584  VIVMAYGMHGNSQDAIDMLRMMMVQG--VKPNEVTFISVFAACSHSGMVNEGLKIFYNMK 641

Query: 2551 VDHGVKPTSDHYACVVDLLGRAGRLDEAYDIIKFIPVGVDKIGAWSSLLGACRIHQNVEL 2372
             D+GV+P+SDHYACVVDLLGRAGR+ EAY +I  IP   DK GAWSSLLGACRIH N+E+
Sbjct: 642  KDYGVEPSSDHYACVVDLLGRAGRVKEAYQLINLIPRNFDKAGAWSSLLGACRIHNNLEI 701

Query: 2371 GEISASNLFKSEPNVASHYVLLSNIYSSAGLWEKANEVRRKMKSLRLKKEPGCSWIEYND 2192
            GEI+A NL + EPNVASHYVLL+NIYSSAGLW KA EVRR MK+  ++KEPGCSWIE+ D
Sbjct: 702  GEIAAQNLIQLEPNVASHYVLLANIYSSAGLWYKATEVRRNMKAQGVRKEPGCSWIEHGD 761

Query: 2191 EVHKFVAGDTRHPQREQLYEYLNDLLVRMKEDGYVPDTSCVLHNVDEEEKENLLCGHSER 2012
            EVHKFVAGD+ HPQ E+L  YL  L  RM+++GY+PDTSCVLHNV+E+EKE LLCGHSE+
Sbjct: 762  EVHKFVAGDSSHPQSEKLRGYLETLWERMRKEGYIPDTSCVLHNVEEDEKEILLCGHSEK 821

Query: 2011 LAIAFGLLNTPPGKTIRVSKNLRVCNDCHSATKFISKISGREIVVRDVRRFHHFKDGACS 1832
            LAIAFG+LNT PG  IRV+KNLRVCNDCH ATKFISK+  REI++RDVRRFHHFK+G CS
Sbjct: 822  LAIAFGILNTSPGTIIRVAKNLRVCNDCHLATKFISKVVDREIILRDVRRFHHFKNGTCS 881

Query: 1831 CGDYW 1817
            CGDYW
Sbjct: 882  CGDYW 886


>ref|XP_006290586.1| hypothetical protein CARUB_v10016675mg [Capsella rubella]
            gi|482559293|gb|EOA23484.1| hypothetical protein
            CARUB_v10016675mg [Capsella rubella]
          Length = 882

 Score = 1077 bits (2786), Expect = 0.0
 Identities = 529/845 (62%), Positives = 653/845 (77%), Gaps = 1/845 (0%)
 Frame = -3

Query: 4348 SPSQLKHSWIESLRSHARSNSFRQAITTFVQMQTAGILPDNYAYPAVLKAATALQDLHLG 4169
            S S+    WI+SLRS  R++  R+A+ T++ M   GI PD +A+PA+LKA   LQD+ LG
Sbjct: 49   SQSRSPEWWIDSLRSKVRASLLREAVLTYIDMIVLGIKPDKFAFPALLKAVADLQDMDLG 108

Query: 4168 KQIHASLVKLGYHSHS-TVSNTVLHMYAKCGGDVDHVFKVFDRIPQRDQVSWNSLINALC 3992
            KQIHA + K GY   S TV+NT++++Y KCG D   V+KVFDRI +R+QVSWNSLI++LC
Sbjct: 109  KQIHAHVYKFGYGVDSVTVANTLVNLYRKCG-DFGAVYKVFDRISERNQVSWNSLISSLC 167

Query: 3991 KYQDWELALEAFRLMGLEKIEPSSFTLVSVALACSNLNRRDGLRLGKQLHGYILRVDERK 3812
             ++ WE+ALEAFR M  E +EPSSFTLVSVALACSN+   +GLRLGKQ+H Y LR  E  
Sbjct: 168  SFEKWEMALEAFRCMLDENVEPSSFTLVSVALACSNVPMPEGLRLGKQVHAYSLRKGELN 227

Query: 3811 TFTDNSLMAMYAKLGRVDDAKIIFEYFAHRDMVSWNTIISSFSQSDRFYEALEYFRYMND 3632
            +F  N+L+AMY KLG++  +K +   F  RD+V+WNT++SS  Q+++F EALEY R M  
Sbjct: 228  SFIINTLVAMYGKLGKLASSKSLLGSFEGRDLVTWNTLLSSLCQNEQFLEALEYLREMVL 287

Query: 3631 DGFKPDGFTVSSVLPACSHLELLDLGKEIHAFLFRNDHDFMRNSFVTSALVDMYCNCKQV 3452
             G +PDGFT+SSVLP CSHLE+L  GKE+HA+  +N      NSFV SALVDMYCNCK+V
Sbjct: 288  KGVEPDGFTISSVLPVCSHLEMLRTGKELHAYALKNG-SLDENSFVGSALVDMYCNCKRV 346

Query: 3451 VSGSRVFDAALDRKLGTWNAMFAGYAQNGFYSEAVMLFMKLMVVPGLFPNPTTMASVLPA 3272
            +S  RVFD   DRK+G WNAM  GYAQN    EA++LF+++    GL  N TTMA V+PA
Sbjct: 347  LSARRVFDGMFDRKIGLWNAMITGYAQNEHDVEALLLFIEMEQSAGLLANTTTMAGVVPA 406

Query: 3271 CVHCEAFVDKEVMHGYVLKLGLERDRYVQNALMDLYSRVGKVDIAEYIFHNMESKDIVSY 3092
            CV  +AF  KE +HG+V+K GL+RDR+V+NALMD+YSR+GK+DIA+ IF  ME +D+V++
Sbjct: 407  CVRSDAFSKKEAIHGFVVKRGLDRDRFVKNALMDMYSRLGKIDIAKQIFSKMEDRDLVTW 466

Query: 3091 NTMITGYVVCGYHEDALILLHDMQIAEMKHEEGDDDQFAKNSEVSFRPNSVTLMTILPGC 2912
            NTMITGYV    HEDAL++LH MQ  E K  EG          V  +PNS+TLMTILP C
Sbjct: 467  NTMITGYVFLERHEDALLVLHKMQNLERKASEGA-------IRVGLKPNSITLMTILPSC 519

Query: 2911 AALAALTKGKEIHAYAFRNGLELDVGVGSALVDMYAKCGCLTMARRVFDIMPNRNVITWN 2732
            AAL+AL KGKEIHAYA +N L  DV VGSA+VDMYAKCGCL M+R+VFD +P RNVITWN
Sbjct: 520  AALSALAKGKEIHAYAIKNNLATDVAVGSAIVDMYAKCGCLHMSRKVFDQIPFRNVITWN 579

Query: 2731 AVILAYGMHGEGDGALTLFRKMVAAGGELKPNEVTFIALFAACSHSGMVDEGKQLFQQMK 2552
             +I+AYGMHG G  A+ L R M+  G   KPNEVTFI++FAACSHSGMVDEG ++F  MK
Sbjct: 580  VIIMAYGMHGNGQDAIDLLRMMMVQGA--KPNEVTFISVFAACSHSGMVDEGLRIFYNMK 637

Query: 2551 VDHGVKPTSDHYACVVDLLGRAGRLDEAYDIIKFIPVGVDKIGAWSSLLGACRIHQNVEL 2372
             ++GV+P+SDHYACVVDLLGRAGR+ EAY ++  +P+  DK GAWSSLLGACRIH N+E+
Sbjct: 638  NNYGVEPSSDHYACVVDLLGRAGRVKEAYQLMNMMPLDFDKAGAWSSLLGACRIHNNLEI 697

Query: 2371 GEISASNLFKSEPNVASHYVLLSNIYSSAGLWEKANEVRRKMKSLRLKKEPGCSWIEYND 2192
            GE+ A NL + EP VASHYVLL+NIYSSAG W+KA EVRRKMK   ++KEPGCSWIE+ D
Sbjct: 698  GEVVAQNLIQLEPKVASHYVLLANIYSSAGHWDKATEVRRKMKEQGVRKEPGCSWIEHGD 757

Query: 2191 EVHKFVAGDTRHPQREQLYEYLNDLLVRMKEDGYVPDTSCVLHNVDEEEKENLLCGHSER 2012
            EVHKFVAGD+ HPQ E+L+ YL  L  +M+E+GYVPDTSCVLHNV+E+EKE LLCGHSE+
Sbjct: 758  EVHKFVAGDSSHPQSEKLHGYLETLWEKMREEGYVPDTSCVLHNVEEDEKEVLLCGHSEK 817

Query: 2011 LAIAFGLLNTPPGKTIRVSKNLRVCNDCHSATKFISKISGREIVVRDVRRFHHFKDGACS 1832
            LAIAFG+LNT PG  IRV+KNLRVCNDCH ATKFISKI  REI++RDVRRFHHFK+G CS
Sbjct: 818  LAIAFGILNTSPGTIIRVAKNLRVCNDCHLATKFISKIVDREIILRDVRRFHHFKNGICS 877

Query: 1831 CGDYW 1817
            CGDYW
Sbjct: 878  CGDYW 882


>ref|NP_191302.2| protein ORGANELLE TRANSCRIPT PROCESSING 84 [Arabidopsis thaliana]
            gi|218525905|sp|Q7Y211.2|PP285_ARATH RecName:
            Full=Pentatricopeptide repeat-containing protein
            At3g57430, chloroplastic; Flags: Precursor
            gi|332646133|gb|AEE79654.1| pentatricopeptide
            repeat-containing protein [Arabidopsis thaliana]
          Length = 890

 Score = 1077 bits (2785), Expect = 0.0
 Identities = 533/845 (63%), Positives = 651/845 (77%), Gaps = 1/845 (0%)
 Frame = -3

Query: 4348 SPSQLKHSWIESLRSHARSNSFRQAITTFVQMQTAGILPDNYAYPAVLKAATALQDLHLG 4169
            S S+    WI+ LRS  RSN  R+A+ T+V M   GI PDNYA+PA+LKA   LQD+ LG
Sbjct: 57   SQSRSPEWWIDLLRSKVRSNLLREAVLTYVDMIVLGIKPDNYAFPALLKAVADLQDMELG 116

Query: 4168 KQIHASLVKLGYHSHS-TVSNTVLHMYAKCGGDVDHVFKVFDRIPQRDQVSWNSLINALC 3992
            KQIHA + K GY   S TV+NT++++Y KCG D   V+KVFDRI +R+QVSWNSLI++LC
Sbjct: 117  KQIHAHVYKFGYGVDSVTVANTLVNLYRKCG-DFGAVYKVFDRISERNQVSWNSLISSLC 175

Query: 3991 KYQDWELALEAFRLMGLEKIEPSSFTLVSVALACSNLNRRDGLRLGKQLHGYILRVDERK 3812
             ++ WE+ALEAFR M  E +EPSSFTLVSV  ACSNL   +GL +GKQ+H Y LR  E  
Sbjct: 176  SFEKWEMALEAFRCMLDENVEPSSFTLVSVVTACSNLPMPEGLMMGKQVHAYGLRKGELN 235

Query: 3811 TFTDNSLMAMYAKLGRVDDAKIIFEYFAHRDMVSWNTIISSFSQSDRFYEALEYFRYMND 3632
            +F  N+L+AMY KLG++  +K++   F  RD+V+WNT++SS  Q+++  EALEY R M  
Sbjct: 236  SFIINTLVAMYGKLGKLASSKVLLGSFGGRDLVTWNTVLSSLCQNEQLLEALEYLREMVL 295

Query: 3631 DGFKPDGFTVSSVLPACSHLELLDLGKEIHAFLFRNDHDFMRNSFVTSALVDMYCNCKQV 3452
            +G +PD FT+SSVLPACSHLE+L  GKE+HA+  +N      NSFV SALVDMYCNCKQV
Sbjct: 296  EGVEPDEFTISSVLPACSHLEMLRTGKELHAYALKNG-SLDENSFVGSALVDMYCNCKQV 354

Query: 3451 VSGSRVFDAALDRKLGTWNAMFAGYAQNGFYSEAVMLFMKLMVVPGLFPNPTTMASVLPA 3272
            +SG RVFD   DRK+G WNAM AGY+QN    EA++LF+ +    GL  N TTMA V+PA
Sbjct: 355  LSGRRVFDGMFDRKIGLWNAMIAGYSQNEHDKEALLLFIGMEESAGLLANSTTMAGVVPA 414

Query: 3271 CVHCEAFVDKEVMHGYVLKLGLERDRYVQNALMDLYSRVGKVDIAEYIFHNMESKDIVSY 3092
            CV   AF  KE +HG+V+K GL+RDR+VQN LMD+YSR+GK+DIA  IF  ME +D+V++
Sbjct: 415  CVRSGAFSRKEAIHGFVVKRGLDRDRFVQNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTW 474

Query: 3091 NTMITGYVVCGYHEDALILLHDMQIAEMKHEEGDDDQFAKNSEVSFRPNSVTLMTILPGC 2912
            NTMITGYV   +HEDAL+LLH MQ  E K  +G        S VS +PNS+TLMTILP C
Sbjct: 475  NTMITGYVFSEHHEDALLLLHKMQNLERKVSKGA-------SRVSLKPNSITLMTILPSC 527

Query: 2911 AALAALTKGKEIHAYAFRNGLELDVGVGSALVDMYAKCGCLTMARRVFDIMPNRNVITWN 2732
            AAL+AL KGKEIHAYA +N L  DV VGSALVDMYAKCGCL M+R+VFD +P +NVITWN
Sbjct: 528  AALSALAKGKEIHAYAIKNNLATDVAVGSALVDMYAKCGCLQMSRKVFDQIPQKNVITWN 587

Query: 2731 AVILAYGMHGEGDGALTLFRKMVAAGGELKPNEVTFIALFAACSHSGMVDEGKQLFQQMK 2552
             +I+AYGMHG G  A+ L R M+  G  +KPNEVTFI++FAACSHSGMVDEG ++F  MK
Sbjct: 588  VIIMAYGMHGNGQEAIDLLRMMMVQG--VKPNEVTFISVFAACSHSGMVDEGLRIFYVMK 645

Query: 2551 VDHGVKPTSDHYACVVDLLGRAGRLDEAYDIIKFIPVGVDKIGAWSSLLGACRIHQNVEL 2372
             D+GV+P+SDHYACVVDLLGRAGR+ EAY ++  +P   +K GAWSSLLGA RIH N+E+
Sbjct: 646  PDYGVEPSSDHYACVVDLLGRAGRIKEAYQLMNMMPRDFNKAGAWSSLLGASRIHNNLEI 705

Query: 2371 GEISASNLFKSEPNVASHYVLLSNIYSSAGLWEKANEVRRKMKSLRLKKEPGCSWIEYND 2192
            GEI+A NL + EPNVASHYVLL+NIYSSAGLW+KA EVRR MK   ++KEPGCSWIE+ D
Sbjct: 706  GEIAAQNLIQLEPNVASHYVLLANIYSSAGLWDKATEVRRNMKEQGVRKEPGCSWIEHGD 765

Query: 2191 EVHKFVAGDTRHPQREQLYEYLNDLLVRMKEDGYVPDTSCVLHNVDEEEKENLLCGHSER 2012
            EVHKFVAGD+ HPQ E+L  YL  L  RM+++GYVPDTSCVLHNV+E+EKE LLCGHSE+
Sbjct: 766  EVHKFVAGDSSHPQSEKLSGYLETLWERMRKEGYVPDTSCVLHNVEEDEKEILLCGHSEK 825

Query: 2011 LAIAFGLLNTPPGKTIRVSKNLRVCNDCHSATKFISKISGREIVVRDVRRFHHFKDGACS 1832
            LAIAFG+LNT PG  IRV+KNLRVCNDCH ATKFISKI  REI++RDVRRFH FK+G CS
Sbjct: 826  LAIAFGILNTSPGTIIRVAKNLRVCNDCHLATKFISKIVDREIILRDVRRFHRFKNGTCS 885

Query: 1831 CGDYW 1817
            CGDYW
Sbjct: 886  CGDYW 890


>gb|AAP40452.1| unknown protein [Arabidopsis thaliana]
          Length = 890

 Score = 1077 bits (2785), Expect = 0.0
 Identities = 533/845 (63%), Positives = 651/845 (77%), Gaps = 1/845 (0%)
 Frame = -3

Query: 4348 SPSQLKHSWIESLRSHARSNSFRQAITTFVQMQTAGILPDNYAYPAVLKAATALQDLHLG 4169
            S S+    WI+ LRS  RSN  R+A+ T+V M   GI PDNYA+PA+LKA   LQD+ LG
Sbjct: 57   SQSRSPEWWIDLLRSKVRSNLLREAVLTYVDMIVLGIKPDNYAFPALLKAVADLQDMELG 116

Query: 4168 KQIHASLVKLGYHSHS-TVSNTVLHMYAKCGGDVDHVFKVFDRIPQRDQVSWNSLINALC 3992
            KQIHA + K GY   S TV+NT++++Y KCG D   V+KVFDRI +R+QVSWNSLI++LC
Sbjct: 117  KQIHAHVYKFGYGVDSVTVANTLVNLYRKCG-DFGAVYKVFDRISERNQVSWNSLISSLC 175

Query: 3991 KYQDWELALEAFRLMGLEKIEPSSFTLVSVALACSNLNRRDGLRLGKQLHGYILRVDERK 3812
             ++ WE+ALEAFR M  E +EPSSFTLVSV  ACSNL   +GL +GKQ+H Y LR  E  
Sbjct: 176  SFEKWEMALEAFRCMLDENVEPSSFTLVSVVTACSNLPMPEGLMMGKQVHAYGLRKGELN 235

Query: 3811 TFTDNSLMAMYAKLGRVDDAKIIFEYFAHRDMVSWNTIISSFSQSDRFYEALEYFRYMND 3632
            +F  N+L+AMY KLG++  +K++   F  RD+V+WNT++SS  Q+++  EALEY R M  
Sbjct: 236  SFIINTLVAMYGKLGKLASSKVLLGSFGGRDLVTWNTVLSSLCQNEQLLEALEYLREMVL 295

Query: 3631 DGFKPDGFTVSSVLPACSHLELLDLGKEIHAFLFRNDHDFMRNSFVTSALVDMYCNCKQV 3452
            +G +PD FT+SSVLPACSHLE+L  GKE+HA+  +N      NSFV SALVDMYCNCKQV
Sbjct: 296  EGVEPDEFTISSVLPACSHLEMLRTGKELHAYALKNG-SLDENSFVGSALVDMYCNCKQV 354

Query: 3451 VSGSRVFDAALDRKLGTWNAMFAGYAQNGFYSEAVMLFMKLMVVPGLFPNPTTMASVLPA 3272
            +SG RVFD   DRK+G WNAM AGY+QN    EA++LF+ +    GL  N TTMA V+PA
Sbjct: 355  LSGRRVFDGMFDRKIGLWNAMIAGYSQNEHDKEALLLFIGMEESAGLLANSTTMAGVVPA 414

Query: 3271 CVHCEAFVDKEVMHGYVLKLGLERDRYVQNALMDLYSRVGKVDIAEYIFHNMESKDIVSY 3092
            CV   AF  KE +HG+V+K GL+RDR+VQN LMD+YSR+GK+DIA  IF  ME +D+V++
Sbjct: 415  CVRSGAFSRKEAIHGFVVKRGLDRDRFVQNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTW 474

Query: 3091 NTMITGYVVCGYHEDALILLHDMQIAEMKHEEGDDDQFAKNSEVSFRPNSVTLMTILPGC 2912
            NTMITGYV   +HEDAL+LLH MQ  E K  +G        S VS +PNS+TLMTILP C
Sbjct: 475  NTMITGYVFSEHHEDALLLLHKMQNLERKVSKGA-------SRVSLKPNSITLMTILPSC 527

Query: 2911 AALAALTKGKEIHAYAFRNGLELDVGVGSALVDMYAKCGCLTMARRVFDIMPNRNVITWN 2732
            AAL+AL KGKEIHAYA +N L  DV VGSALVDMYAKCGCL M+R+VFD +P +NVITWN
Sbjct: 528  AALSALAKGKEIHAYAIKNNLATDVAVGSALVDMYAKCGCLQMSRKVFDQIPQKNVITWN 587

Query: 2731 AVILAYGMHGEGDGALTLFRKMVAAGGELKPNEVTFIALFAACSHSGMVDEGKQLFQQMK 2552
             +I+AYGMHG G  A+ L R M+  G  +KPNEVTFI++FAACSHSGMVDEG ++F  MK
Sbjct: 588  VIIMAYGMHGNGQEAIDLLRMMMVQG--VKPNEVTFISVFAACSHSGMVDEGLRIFYVMK 645

Query: 2551 VDHGVKPTSDHYACVVDLLGRAGRLDEAYDIIKFIPVGVDKIGAWSSLLGACRIHQNVEL 2372
             D+GV+P+SDHYACVVDLLGRAGR+ EAY ++  +P   +K GAWSSLLGA RIH N+E+
Sbjct: 646  PDYGVEPSSDHYACVVDLLGRAGRIKEAYQLMNMMPRDFNKAGAWSSLLGASRIHNNLEI 705

Query: 2371 GEISASNLFKSEPNVASHYVLLSNIYSSAGLWEKANEVRRKMKSLRLKKEPGCSWIEYND 2192
            GEI+A NL + EPNVASHYVLL+NIYSSAGLW+KA EVRR MK   ++KEPGCSWIE+ D
Sbjct: 706  GEIAAQNLIQLEPNVASHYVLLANIYSSAGLWDKATEVRRNMKEQGVRKEPGCSWIEHGD 765

Query: 2191 EVHKFVAGDTRHPQREQLYEYLNDLLVRMKEDGYVPDTSCVLHNVDEEEKENLLCGHSER 2012
            EVHKFVAGD+ HPQ E+L  YL  L  RM+++GYVPDTSCVLHNV+E+EKE LLCGHSE+
Sbjct: 766  EVHKFVAGDSSHPQSEKLSGYLETLWERMRKEGYVPDTSCVLHNVEEDEKEILLCGHSEK 825

Query: 2011 LAIAFGLLNTPPGKTIRVSKNLRVCNDCHSATKFISKISGREIVVRDVRRFHHFKDGACS 1832
            LAIAFG+LNT PG  IRV+KNLRVCNDCH ATKFISKI  REI++RDVRRFH FK+G CS
Sbjct: 826  LAIAFGILNTSPGTIIRVAKNLRVCNDCHLATKFISKIVDREIILRDVRRFHRFKNGTCS 885

Query: 1831 CGDYW 1817
            CGDYW
Sbjct: 886  CGDYW 890


>ref|XP_007138858.1| hypothetical protein PHAVU_009G243400g [Phaseolus vulgaris]
            gi|561011945|gb|ESW10852.1| hypothetical protein
            PHAVU_009G243400g [Phaseolus vulgaris]
          Length = 882

 Score = 1075 bits (2781), Expect = 0.0
 Identities = 528/857 (61%), Positives = 662/857 (77%), Gaps = 5/857 (0%)
 Frame = -3

Query: 4372 PNSVQTQNSPSQLKHSWIESLRSHARSNSFRQAITTFVQMQTAGILPDNYAYPAVLKAAT 4193
            P +   + SPSQ    WI+ LRS  +S+SFR AI T+  M  A   PDN+A+PAVLKAAT
Sbjct: 36   PTAAVERRSPSQ----WIDLLRSQTQSSSFRDAIATYAAMLAAAAAPDNFAFPAVLKAAT 91

Query: 4192 ALQDLHLGKQIHASLVKLGYHSHSTVSNTVLHMYAKCGGDVDHVFKVFDRIPQRDQVSWN 4013
            A+ DL LGKQ+HA + K G      V+NT+L+MY KCG D+    ++FD IP+RD VSWN
Sbjct: 92   AVHDLSLGKQLHAHVFKFGQAPSVAVANTLLNMYGKCG-DLAAARRLFDEIPERDHVSWN 150

Query: 4012 SLINALCKYQDWELALEAFRLMGLEKIEPSSFTLVSVALACSNLNRRDGLRLGKQLHGYI 3833
            S+I  LC++++WEL+L  FRLM  E +EPSSFTLVSVA ACS L  R G RLGKQ+H + 
Sbjct: 151  SMIATLCRFEEWELSLHLFRLMLSENVEPSSFTLVSVAHACSYL--RGGTRLGKQVHAFT 208

Query: 3832 LRVDERKTFTDNSLMAMYAKLGRVDDAKIIFEYFAHRDMVSWNTIISSFSQSDRFYEALE 3653
            LR D+ +T+T+N+L++MYA+LGRV+DAK +F+ F  +D+VSWNT+ISS SQ+DRF EAL 
Sbjct: 209  LRNDDLRTYTNNALVSMYARLGRVNDAKALFDVFDGKDIVSWNTVISSLSQNDRFEEALM 268

Query: 3652 YFRYMNDDGFKPDGFTVSSVLPACSHLELLDLGKEIHAFLFRNDHDFMRNSFVTSALVDM 3473
            Y   M  DG +PDG T++SVLPACS LE L +G+EIH +  +N  D + NSFV +ALVDM
Sbjct: 269  YMYLMIVDGVRPDGVTLASVLPACSQLERLRIGREIHCYALKNG-DLIENSFVGTALVDM 327

Query: 3472 YCNCKQVVSGSRVFDAALDRKLGTWNAMFAGYAQNGFYSEAVMLFMKLMVVPGLFPNPTT 3293
            YCNCKQ V G  VFD    + +  WNAM AGYA+N F  +A+ LF++++      PN TT
Sbjct: 328  YCNCKQAVKGRLVFDRVWRKTVAVWNAMLAGYARNEFDDQALRLFIEMISESEFCPNATT 387

Query: 3292 MASVLPACVHCEAFVDKEVMHGYVLKLGLERDRYVQNALMDLYSRVGKVDIAEYIFHNME 3113
            ++SVLPACV CE+F+DKE +HGY++K G  +D+YV+NALMD+YSR+G++ I++ IF  M 
Sbjct: 388  LSSVLPACVRCESFLDKEGIHGYIVKRGFGKDKYVKNALMDMYSRMGRIQISKMIFGGMG 447

Query: 3112 SKDIVSYNTMITGYVVCGYHEDALILLHDMQIAEMKHEEGDD--DQFAKNSEVSFRPNSV 2939
             +DIVS+NTMITG VVCG +EDAL LLH+MQ  +   E+G D  D       +  +PNSV
Sbjct: 448  RRDIVSWNTMITGCVVCGQYEDALNLLHEMQRGQ--GEDGGDTFDDCEDEESLPLKPNSV 505

Query: 2938 TLMTILPGCAALAALTKGKEIHAYAFRNGLELDVGVGSALVDMYAKCGCLTMARRVFDIM 2759
            TLMT+LPGCAALAAL KGKEIHAYA +  L +DV VGSALVDMYAKCGCL +AR VFD M
Sbjct: 506  TLMTVLPGCAALAALGKGKEIHAYAIKEMLAMDVAVGSALVDMYAKCGCLNLARIVFDQM 565

Query: 2758 PNRNVITWNAVILAYGMHGEGDGALTLFRKMVAAGGE---LKPNEVTFIALFAACSHSGM 2588
            P RNVITWN +I+AYGMHG+G+ AL LFR+M   G     ++PNEVT+IA+FAACSHSGM
Sbjct: 566  PIRNVITWNVLIMAYGMHGKGEEALKLFRRMTEGGSNREVIRPNEVTYIAIFAACSHSGM 625

Query: 2587 VDEGKQLFQQMKVDHGVKPTSDHYACVVDLLGRAGRLDEAYDIIKFIPVGVDKIGAWSSL 2408
            V+EG  LF  MK  HG++  +DHYAC+VDLLGR+GR+ EA +++  +P  ++KI AWSSL
Sbjct: 626  VNEGLHLFHTMKASHGIEARADHYACLVDLLGRSGRIKEACELVHTMPSSLNKIDAWSSL 685

Query: 2407 LGACRIHQNVELGEISASNLFKSEPNVASHYVLLSNIYSSAGLWEKANEVRRKMKSLRLK 2228
            LGACRIHQ+VE+GEI+A NL   EPNVASHYVLLSNIYSSAGLWE+A EVR+KMK + ++
Sbjct: 686  LGACRIHQSVEIGEIAAKNLLVLEPNVASHYVLLSNIYSSAGLWEQAIEVRKKMKEMGVR 745

Query: 2227 KEPGCSWIEYNDEVHKFVAGDTRHPQREQLYEYLNDLLVRMKEDGYVPDTSCVLHNVDEE 2048
            KEPGCSWIE+ DEVHKF+AGD  HPQ ++L+EY+  L  RM+++GYVPDTSCVLHNVD+E
Sbjct: 746  KEPGCSWIEHGDEVHKFLAGDASHPQSKELHEYIETLSQRMRKEGYVPDTSCVLHNVDDE 805

Query: 2047 EKENLLCGHSERLAIAFGLLNTPPGKTIRVSKNLRVCNDCHSATKFISKISGREIVVRDV 1868
            EKE +LCGHSERLAIAFGLLNT PG TIRV+KNLRVCNDCH ATK ISKI  REI++RDV
Sbjct: 806  EKETMLCGHSERLAIAFGLLNTLPGTTIRVAKNLRVCNDCHIATKIISKIVDREIILRDV 865

Query: 1867 RRFHHFKDGACSCGDYW 1817
            RRFHHF++G CSCGDYW
Sbjct: 866  RRFHHFRNGTCSCGDYW 882


>ref|XP_006402877.1| hypothetical protein EUTSA_v10005782mg [Eutrema salsugineum]
            gi|557103976|gb|ESQ44330.1| hypothetical protein
            EUTSA_v10005782mg [Eutrema salsugineum]
          Length = 888

 Score = 1074 bits (2778), Expect = 0.0
 Identities = 528/845 (62%), Positives = 652/845 (77%), Gaps = 1/845 (0%)
 Frame = -3

Query: 4348 SPSQLKHSWIESLRSHARSNSFRQAITTFVQMQTAGILPDNYAYPAVLKAATALQDLHLG 4169
            S S+    WI+SLRS  RSN  R+A+ T++ M   GI PDN+ +PA+LKA   LQD+ LG
Sbjct: 55   SRSRSTEWWIDSLRSKVRSNLLREAVFTYIDMVLLGIKPDNFVFPALLKAVADLQDMDLG 114

Query: 4168 KQIHASLVKLGYHSHS-TVSNTVLHMYAKCGGDVDHVFKVFDRIPQRDQVSWNSLINALC 3992
            KQIHA + K GY   S TV+NT+++ Y KCG D   V+KVFDRI +R+QVSWNS+I++LC
Sbjct: 115  KQIHAHVYKFGYGVDSVTVANTLVNFYRKCG-DFGAVYKVFDRISERNQVSWNSMISSLC 173

Query: 3991 KYQDWELALEAFRLMGLEKIEPSSFTLVSVALACSNLNRRDGLRLGKQLHGYILRVDERK 3812
             ++ WE+ALEAFR M  E +EPSSFTLVSVA+ACSNL   +GL +GKQ+H Y LR  +  
Sbjct: 174  SFEKWEMALEAFRCMLDENVEPSSFTLVSVAIACSNLPIPEGLMMGKQVHAYSLRKGDLN 233

Query: 3811 TFTDNSLMAMYAKLGRVDDAKIIFEYFAHRDMVSWNTIISSFSQSDRFYEALEYFRYMND 3632
            +F  N+L+AMY KLG++  +KI+   F  R++V+WNT++SS  Q+++F EALEY R M  
Sbjct: 234  SFIINTLVAMYGKLGKLASSKILLGTFEGRNLVTWNTVLSSLCQNEQFLEALEYLREMVL 293

Query: 3631 DGFKPDGFTVSSVLPACSHLELLDLGKEIHAFLFRNDHDFMRNSFVTSALVDMYCNCKQV 3452
             G +PDGFT+SSVLP CSHLE+L  GKE+HA+  +N      NSFV SALVDMYCNCKQV
Sbjct: 294  KGVEPDGFTISSVLPVCSHLEMLRTGKEMHAYALKNG-SLDENSFVGSALVDMYCNCKQV 352

Query: 3451 VSGSRVFDAALDRKLGTWNAMFAGYAQNGFYSEAVMLFMKLMVVPGLFPNPTTMASVLPA 3272
            +S  RVFD   DR++G WNAM AGYAQN    EA+ LF+++    GL  N TTMAS++PA
Sbjct: 353  LSARRVFDVIFDRRIGLWNAMIAGYAQNEHDEEALSLFIEMEETTGLLANTTTMASIVPA 412

Query: 3271 CVHCEAFVDKEVMHGYVLKLGLERDRYVQNALMDLYSRVGKVDIAEYIFHNMESKDIVSY 3092
            CV   AF  KE +HG+V+K GL+ DR+VQNALMD+YSR+GK+DIAE IF  ME +D+V++
Sbjct: 413  CVRSNAFSRKEAIHGFVMKRGLDGDRFVQNALMDMYSRLGKIDIAEMIFCKMEDRDLVTW 472

Query: 3091 NTMITGYVVCGYHEDALILLHDMQIAEMKHEEGDDDQFAKNSEVSFRPNSVTLMTILPGC 2912
            NTMITGYV    HEDAL++LH MQ  E K  EG        S V  +PNS+TLMTILP C
Sbjct: 473  NTMITGYVFSECHEDALLVLHKMQNIERKVGEGV-------SRVGLKPNSITLMTILPSC 525

Query: 2911 AALAALTKGKEIHAYAFRNGLELDVGVGSALVDMYAKCGCLTMARRVFDIMPNRNVITWN 2732
            AAL+AL KGKEIHAYA +N L  DV VGSALVDMYAKCGCL M+R+VFD +P +NVITWN
Sbjct: 526  AALSALAKGKEIHAYAIKNNLATDVAVGSALVDMYAKCGCLHMSRKVFDQIPIKNVITWN 585

Query: 2731 AVILAYGMHGEGDGALTLFRKMVAAGGELKPNEVTFIALFAACSHSGMVDEGKQLFQQMK 2552
             +I+AYGMHG G  A+ L + M+    ++KPNEVT I++FAACSHSGMVDEG ++F  MK
Sbjct: 586  VIIMAYGMHGNGQDAIELLKMMMVQ--KVKPNEVTLISVFAACSHSGMVDEGLKIFYNMK 643

Query: 2551 VDHGVKPTSDHYACVVDLLGRAGRLDEAYDIIKFIPVGVDKIGAWSSLLGACRIHQNVEL 2372
              +GV+P+SDHYACVVDLLGRAGR+ EAY+++  +P+G DK GAWSSLLGACRI  N E+
Sbjct: 644  KHYGVEPSSDHYACVVDLLGRAGRVKEAYELMNMMPLGFDKAGAWSSLLGACRIQNNQEI 703

Query: 2371 GEISASNLFKSEPNVASHYVLLSNIYSSAGLWEKANEVRRKMKSLRLKKEPGCSWIEYND 2192
            GEI+A NL + EP VASHYVLL+NIYSSAGLW+KA EVRRKMK   ++KEPGCSWIEY D
Sbjct: 704  GEIAAQNLIQLEPKVASHYVLLANIYSSAGLWDKATEVRRKMKEQGVRKEPGCSWIEYGD 763

Query: 2191 EVHKFVAGDTRHPQREQLYEYLNDLLVRMKEDGYVPDTSCVLHNVDEEEKENLLCGHSER 2012
             VHKFVAGD+ HPQ E+L+ YL  L  +M+++GYVPDTSCVLHNV+E+EKE LLCGHSE+
Sbjct: 764  GVHKFVAGDSSHPQSEKLHGYLESLWEKMRKEGYVPDTSCVLHNVEEDEKEVLLCGHSEK 823

Query: 2011 LAIAFGLLNTPPGKTIRVSKNLRVCNDCHSATKFISKISGREIVVRDVRRFHHFKDGACS 1832
            LAIAFG+LNT PG  IRV+KNLRVCNDCH ATKFISKI  REI++RDVRRFHHFK+G CS
Sbjct: 824  LAIAFGILNTSPGTVIRVAKNLRVCNDCHLATKFISKIVDREIILRDVRRFHHFKNGTCS 883

Query: 1831 CGDYW 1817
            CGDYW
Sbjct: 884  CGDYW 888


>ref|XP_006597752.1| PREDICTED: pentatricopeptide repeat-containing protein At3g57430,
            chloroplastic-like [Glycine max]
          Length = 880

 Score = 1070 bits (2766), Expect = 0.0
 Identities = 529/860 (61%), Positives = 662/860 (76%), Gaps = 8/860 (0%)
 Frame = -3

Query: 4372 PNSVQTQNSPSQLKHSWIESLRSHARSNSFRQAITTFVQMQTAGILPDNYAYPAVLKAAT 4193
            P +   + SPSQ    WI+ LRS   S+SFR AI+T+  M  A   PDN+A+PAVLKAA 
Sbjct: 31   PPTTVERRSPSQ----WIDLLRSQTHSSSFRDAISTYAAMLAAPAPPDNFAFPAVLKAAA 86

Query: 4192 ALQDLHLGKQIHASLVKLGYHSHSTVS--NTVLHMYAKCGGDVDHVFKVFDRIPQRDQVS 4019
            A+ DL LGKQIHA + K G+   S+V+  N++++MY KCG D+    +VFD IP RD VS
Sbjct: 87   AVHDLCLGKQIHAHVFKFGHAPPSSVAVANSLVNMYGKCG-DLTAARQVFDDIPDRDHVS 145

Query: 4018 WNSLINALCKYQDWELALEAFRLMGLEKIEPSSFTLVSVALACSNLNRRDGLRLGKQLHG 3839
            WNS+I  LC++++WEL+L  FRLM  E ++P+SFTLVSVA ACS++  R G+RLGKQ+H 
Sbjct: 146  WNSMIATLCRFEEWELSLHLFRLMLSENVDPTSFTLVSVAHACSHV--RGGVRLGKQVHA 203

Query: 3838 YILRVDERKTFTDNSLMAMYAKLGRVDDAKIIFEYFAHRDMVSWNTIISSFSQSDRFYEA 3659
            Y LR  + +T+T+N+L+ MYA+LGRV+DAK +F  F  +D+VSWNT+ISS SQ+DRF EA
Sbjct: 204  YTLRNGDLRTYTNNALVTMYARLGRVNDAKALFGVFDGKDLVSWNTVISSLSQNDRFEEA 263

Query: 3658 LEYFRYMNDDGFKPDGFTVSSVLPACSHLELLDLGKEIHAFLFRNDHDFMRNSFVTSALV 3479
            L Y   M  DG +PDG T++SVLPACS LE L +G+EIH +  RN  D + NSFV +ALV
Sbjct: 264  LMYVYLMIVDGVRPDGVTLASVLPACSQLERLRIGREIHCYALRNG-DLIENSFVGTALV 322

Query: 3478 DMYCNCKQVVSGSRVFDAALDRKLGTWNAMFAGYAQNGFYSEAVMLFMKLMVVPGLFPNP 3299
            DMYCNCKQ   G  VFD  + R +  WNA+ AGYA+N F  +A+ LF++++      PN 
Sbjct: 323  DMYCNCKQPKKGRLVFDGVVRRTVAVWNALLAGYARNEFDDQALRLFVEMISESEFCPNA 382

Query: 3298 TTMASVLPACVHCEAFVDKEVMHGYVLKLGLERDRYVQNALMDLYSRVGKVDIAEYIFHN 3119
            TT ASVLPACV C+ F DKE +HGY++K G  +D+YVQNALMD+YSR+G+V+I++ IF  
Sbjct: 383  TTFASVLPACVRCKVFSDKEGIHGYIVKRGFGKDKYVQNALMDMYSRMGRVEISKTIFGR 442

Query: 3118 MESKDIVSYNTMITGYVVCGYHEDALILLHDMQIAEMKHEEGDDD--QFAKNSEVSFRPN 2945
            M  +DIVS+NTMITG +VCG ++DAL LLH+MQ    + E+G D    +  +  V F+PN
Sbjct: 443  MNKRDIVSWNTMITGCIVCGRYDDALNLLHEMQ--RRQGEDGSDTFVDYEDDGGVPFKPN 500

Query: 2944 SVTLMTILPGCAALAALTKGKEIHAYAFRNGLELDVGVGSALVDMYAKCGCLTMARRVFD 2765
            SVTLMT+LPGCAALAAL KGKEIHAYA +  L +DV VGSALVDMYAKCGCL +A RVFD
Sbjct: 501  SVTLMTVLPGCAALAALGKGKEIHAYAVKQKLAMDVAVGSALVDMYAKCGCLNLASRVFD 560

Query: 2764 IMPNRNVITWNAVILAYGMHGEGDGALTLFRKMVAAGGE----LKPNEVTFIALFAACSH 2597
             MP RNVITWN +I+AYGMHG+G+ AL LFR M A GG     ++PNEVT+IA+FAACSH
Sbjct: 561  QMPIRNVITWNVLIMAYGMHGKGEEALELFRIMTAGGGSNREVIRPNEVTYIAIFAACSH 620

Query: 2596 SGMVDEGKQLFQQMKVDHGVKPTSDHYACVVDLLGRAGRLDEAYDIIKFIPVGVDKIGAW 2417
            SGMVDEG  LF  MK  HGV+P  DHYAC+VDLLGR+GR+ EAY++I  +P  ++K+ AW
Sbjct: 621  SGMVDEGLHLFHTMKASHGVEPRGDHYACLVDLLGRSGRVKEAYELINTMPSNLNKVDAW 680

Query: 2416 SSLLGACRIHQNVELGEISASNLFKSEPNVASHYVLLSNIYSSAGLWEKANEVRRKMKSL 2237
            SSLLGACRIHQ+VE GEI+A +LF  EPNVASHYVL+SNIYSSAGLW++A  VR+KMK +
Sbjct: 681  SSLLGACRIHQSVEFGEIAAKHLFVLEPNVASHYVLMSNIYSSAGLWDQALGVRKKMKEM 740

Query: 2236 RLKKEPGCSWIEYNDEVHKFVAGDTRHPQREQLYEYLNDLLVRMKEDGYVPDTSCVLHNV 2057
             ++KEPGCSWIE+ DEVHKF++GD  HPQ ++L+EYL  L  RM+++GYVPD SCVLHNV
Sbjct: 741  GVRKEPGCSWIEHGDEVHKFLSGDASHPQSKELHEYLETLSQRMRKEGYVPDISCVLHNV 800

Query: 2056 DEEEKENLLCGHSERLAIAFGLLNTPPGKTIRVSKNLRVCNDCHSATKFISKISGREIVV 1877
            D+EEKE +LCGHSERLAIAFGLLNTPPG TIRV+KNLRVCNDCH ATK ISKI  REI++
Sbjct: 801  DDEEKETMLCGHSERLAIAFGLLNTPPGTTIRVAKNLRVCNDCHVATKIISKIVDREIIL 860

Query: 1876 RDVRRFHHFKDGACSCGDYW 1817
            RDVRRFHHF +G CSCGDYW
Sbjct: 861  RDVRRFHHFANGTCSCGDYW 880


>emb|CAB66100.1| putative protein [Arabidopsis thaliana]
          Length = 803

 Score = 1051 bits (2718), Expect = 0.0
 Identities = 519/814 (63%), Positives = 632/814 (77%), Gaps = 1/814 (0%)
 Frame = -3

Query: 4255 MQTAGILPDNYAYPAVLKAATALQDLHLGKQIHASLVKLGYHSHS-TVSNTVLHMYAKCG 4079
            M   GI PDNYA+PA+LKA   LQD+ LGKQIHA + K GY   S TV+NT++++Y KCG
Sbjct: 1    MIVLGIKPDNYAFPALLKAVADLQDMELGKQIHAHVYKFGYGVDSVTVANTLVNLYRKCG 60

Query: 4078 GDVDHVFKVFDRIPQRDQVSWNSLINALCKYQDWELALEAFRLMGLEKIEPSSFTLVSVA 3899
             D   V+KVFDRI +R+QVSWNSLI++LC ++ WE+ALEAFR M  E +EPSSFTLVSV 
Sbjct: 61   -DFGAVYKVFDRISERNQVSWNSLISSLCSFEKWEMALEAFRCMLDENVEPSSFTLVSVV 119

Query: 3898 LACSNLNRRDGLRLGKQLHGYILRVDERKTFTDNSLMAMYAKLGRVDDAKIIFEYFAHRD 3719
             ACSNL   +GL +GKQ+H Y LR  E  +F  N+L+AMY KLG++  +K++   F  RD
Sbjct: 120  TACSNLPMPEGLMMGKQVHAYGLRKGELNSFIINTLVAMYGKLGKLASSKVLLGSFGGRD 179

Query: 3718 MVSWNTIISSFSQSDRFYEALEYFRYMNDDGFKPDGFTVSSVLPACSHLELLDLGKEIHA 3539
            +V+WNT++SS  Q+++  EALEY R M  +G +PD FT+SSVLPACSHLE+L  GKE+HA
Sbjct: 180  LVTWNTVLSSLCQNEQLLEALEYLREMVLEGVEPDEFTISSVLPACSHLEMLRTGKELHA 239

Query: 3538 FLFRNDHDFMRNSFVTSALVDMYCNCKQVVSGSRVFDAALDRKLGTWNAMFAGYAQNGFY 3359
            +  +N      NSFV SALVDMYCNCKQV+SG RVFD   DRK+G WNAM AGY+QN   
Sbjct: 240  YALKNG-SLDENSFVGSALVDMYCNCKQVLSGRRVFDGMFDRKIGLWNAMIAGYSQNEHD 298

Query: 3358 SEAVMLFMKLMVVPGLFPNPTTMASVLPACVHCEAFVDKEVMHGYVLKLGLERDRYVQNA 3179
             EA++LF+ +    GL  N TTMA V+PACV   AF  KE +HG+V+K GL+RDR+VQN 
Sbjct: 299  KEALLLFIGMEESAGLLANSTTMAGVVPACVRSGAFSRKEAIHGFVVKRGLDRDRFVQNT 358

Query: 3178 LMDLYSRVGKVDIAEYIFHNMESKDIVSYNTMITGYVVCGYHEDALILLHDMQIAEMKHE 2999
            LMD+YSR+GK+DIA  IF  ME +D+V++NTMITGYV   +HEDAL+LLH MQ  E K  
Sbjct: 359  LMDMYSRLGKIDIAMRIFGKMEDRDLVTWNTMITGYVFSEHHEDALLLLHKMQNLERKVS 418

Query: 2998 EGDDDQFAKNSEVSFRPNSVTLMTILPGCAALAALTKGKEIHAYAFRNGLELDVGVGSAL 2819
            +G        S VS +PNS+TLMTILP CAAL+AL KGKEIHAYA +N L  DV VGSAL
Sbjct: 419  KGA-------SRVSLKPNSITLMTILPSCAALSALAKGKEIHAYAIKNNLATDVAVGSAL 471

Query: 2818 VDMYAKCGCLTMARRVFDIMPNRNVITWNAVILAYGMHGEGDGALTLFRKMVAAGGELKP 2639
            VDMYAKCGCL M+R+VFD +P +NVITWN +I+AYGMHG G  A+ L R M+  G  +KP
Sbjct: 472  VDMYAKCGCLQMSRKVFDQIPQKNVITWNVIIMAYGMHGNGQEAIDLLRMMMVQG--VKP 529

Query: 2638 NEVTFIALFAACSHSGMVDEGKQLFQQMKVDHGVKPTSDHYACVVDLLGRAGRLDEAYDI 2459
            NEVTFI++FAACSHSGMVDEG ++F  MK D+GV+P+SDHYACVVDLLGRAGR+ EAY +
Sbjct: 530  NEVTFISVFAACSHSGMVDEGLRIFYVMKPDYGVEPSSDHYACVVDLLGRAGRIKEAYQL 589

Query: 2458 IKFIPVGVDKIGAWSSLLGACRIHQNVELGEISASNLFKSEPNVASHYVLLSNIYSSAGL 2279
            +  +P   +K GAWSSLLGA RIH N+E+GEI+A NL + EPNVASHYVLL+NIYSSAGL
Sbjct: 590  MNMMPRDFNKAGAWSSLLGASRIHNNLEIGEIAAQNLIQLEPNVASHYVLLANIYSSAGL 649

Query: 2278 WEKANEVRRKMKSLRLKKEPGCSWIEYNDEVHKFVAGDTRHPQREQLYEYLNDLLVRMKE 2099
            W+KA EVRR MK   ++KEPGCSWIE+ DEVHKFVAGD+ HPQ E+L  YL  L  RM++
Sbjct: 650  WDKATEVRRNMKEQGVRKEPGCSWIEHGDEVHKFVAGDSSHPQSEKLSGYLETLWERMRK 709

Query: 2098 DGYVPDTSCVLHNVDEEEKENLLCGHSERLAIAFGLLNTPPGKTIRVSKNLRVCNDCHSA 1919
            +GYVPDTSCVLHNV+E+EKE LLCGHSE+LAIAFG+LNT PG  IRV+KNLRVCNDCH A
Sbjct: 710  EGYVPDTSCVLHNVEEDEKEILLCGHSEKLAIAFGILNTSPGTIIRVAKNLRVCNDCHLA 769

Query: 1918 TKFISKISGREIVVRDVRRFHHFKDGACSCGDYW 1817
            TKFISKI  REI++RDVRRFH FK+G CSCGDYW
Sbjct: 770  TKFISKIVDREIILRDVRRFHRFKNGTCSCGDYW 803



 Score =  176 bits (447), Expect = 7e-41
 Identities = 132/426 (30%), Positives = 213/426 (50%), Gaps = 18/426 (4%)
 Frame = -3

Query: 4327 SWIESLRSHARSNSFRQAITTFVQMQTAGILPDNYAYPAVLKAATALQDLHLGKQIHASL 4148
            +W   L S  ++    +A+    +M   G+ PD +   +VL A + L+ L  GK++HA  
Sbjct: 182  TWNTVLSSLCQNEQLLEALEYLREMVLEGVEPDEFTISSVLPACSHLEMLRTGKELHAYA 241

Query: 4147 VKLG-YHSHSTVSNTVLHMYAKCGGDVDHVFKVFDRIPQRDQVSWNSLINALCKYQDWEL 3971
            +K G    +S V + ++ MY  C   V    +VFD +  R    WN++I    + +  + 
Sbjct: 242  LKNGSLDENSFVGSALVDMYCNC-KQVLSGRRVFDGMFDRKIGLWNAMIAGYSQNEHDKE 300

Query: 3970 ALEAFRLMGLEK---IEPSSFTLVSVALACSNLNRRDGLRLGKQLHGYIL-RVDERKTFT 3803
            AL  F  +G+E+   +  +S T+  V  AC    R       + +HG+++ R  +R  F 
Sbjct: 301  ALLLF--IGMEESAGLLANSTTMAGVVPACV---RSGAFSRKEAIHGFVVKRGLDRDRFV 355

Query: 3802 DNSLMAMYAKLGRVDDAKIIFEYFAHRDMVSWNTIISSFSQSDRFYEALEYFRYMND--- 3632
             N+LM MY++LG++D A  IF     RD+V+WNT+I+ +  S+   +AL     M +   
Sbjct: 356  QNTLMDMYSRLGKIDIAMRIFGKMEDRDLVTWNTMITGYVFSEHHEDALLLLHKMQNLER 415

Query: 3631 --------DGFKPDGFTVSSVLPACSHLELLDLGKEIHAFLFRNDHDFMRNSFVTSALVD 3476
                       KP+  T+ ++LP+C+ L  L  GKEIHA+  +N  +   +  V SALVD
Sbjct: 416  KVSKGASRVSLKPNSITLMTILPSCAALSALAKGKEIHAYAIKN--NLATDVAVGSALVD 473

Query: 3475 MYCNCKQVVSGSRVFDAALDRKLGTWNAMFAGYAQNGFYSEAVMLFMKLMVVPGLFPNPT 3296
            MY  C  +    +VFD    + + TWN +   Y  +G   EA+ L +++M+V G+ PN  
Sbjct: 474  MYAKCGCLQMSRKVFDQIPQKNVITWNVIIMAYGMHGNGQEAIDL-LRMMMVQGVKPNEV 532

Query: 3295 TMASVLPACVHCEAFVDKEVMHGYVLK--LGLERDRYVQNALMDLYSRVGKVDIAEYIFH 3122
            T  SV  AC H    VD+ +   YV+K   G+E        ++DL  R G++  A Y   
Sbjct: 533  TFISVFAACSH-SGMVDEGLRIFYVMKPDYGVEPSSDHYACVVDLLGRAGRIKEA-YQLM 590

Query: 3121 NMESKD 3104
            NM  +D
Sbjct: 591  NMMPRD 596


Top