BLASTX nr result

ID: Chrysanthemum21_contig00002179 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Chrysanthemum21_contig00002179
         (3587 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_022000416.1| pentatricopeptide repeat-containing protein ...  1224   0.0  
gb|KVH88994.1| Pentatricopeptide repeat-containing protein [Cyna...  1119   0.0  
ref|XP_023739512.1| pentatricopeptide repeat-containing protein ...  1077   0.0  
ref|XP_019265161.1| PREDICTED: pentatricopeptide repeat-containi...   913   0.0  
ref|XP_016494016.1| PREDICTED: pentatricopeptide repeat-containi...   912   0.0  
ref|XP_009589366.1| PREDICTED: pentatricopeptide repeat-containi...   911   0.0  
ref|XP_009766339.1| PREDICTED: pentatricopeptide repeat-containi...   910   0.0  
gb|PHT56486.1| hypothetical protein CQW23_04972 [Capsicum baccatum]   907   0.0  
gb|PHU26943.1| hypothetical protein BC332_05275 [Capsicum chinense]   900   0.0  
gb|EOY31969.1| Pentatricopeptide repeat (PPR) superfamily protei...   899   0.0  
ref|XP_017982149.1| PREDICTED: pentatricopeptide repeat-containi...   898   0.0  
ref|XP_021654749.1| pentatricopeptide repeat-containing protein ...   897   0.0  
ref|XP_016560496.1| PREDICTED: pentatricopeptide repeat-containi...   896   0.0  
ref|XP_006340744.2| PREDICTED: pentatricopeptide repeat-containi...   889   0.0  
ref|XP_015065860.1| PREDICTED: pentatricopeptide repeat-containi...   887   0.0  
ref|XP_021276401.1| pentatricopeptide repeat-containing protein ...   886   0.0  
ref|XP_010316424.1| PREDICTED: pentatricopeptide repeat-containi...   884   0.0  
ref|XP_022755453.1| pentatricopeptide repeat-containing protein ...   885   0.0  
ref|XP_002529510.1| PREDICTED: pentatricopeptide repeat-containi...   882   0.0  
gb|OMP06484.1| hypothetical protein COLO4_08106 [Corchorus olito...   881   0.0  

>ref|XP_022000416.1| pentatricopeptide repeat-containing protein At1g79540 [Helianthus
            annuus]
 ref|XP_022000417.1| pentatricopeptide repeat-containing protein At1g79540 [Helianthus
            annuus]
 ref|XP_022000419.1| pentatricopeptide repeat-containing protein At1g79540 [Helianthus
            annuus]
 gb|OTG00877.1| putative pentatricopeptide repeat (PPR) superfamily protein
            [Helianthus annuus]
          Length = 796

 Score = 1224 bits (3168), Expect = 0.0
 Identities = 606/796 (76%), Positives = 685/796 (86%), Gaps = 13/796 (1%)
 Frame = -3

Query: 3519 MKHITSLFPKPKSFSSLSRVTTISTEVSNIVHSIDPMEPSLEQVAPFLTPDVITHVLQDQ 3340
            MK + S   +  + SS SR +TISTEVSNIVHS+DPMEP+L+Q+A FL+PDVITHV+Q Q
Sbjct: 1    MKKLHSHLLRHFTSSSTSRTSTISTEVSNIVHSVDPMEPALDQIASFLSPDVITHVIQTQ 60

Query: 3339 QDPFLCFRFFVWAAKRKHFRSWASHNLMISMLVG----------NKIDTFWSVLDDVRKC 3190
            Q+P LCFRFF+W+++RK FRSW SHNL+I+ML+           +  D + +VLD+V K 
Sbjct: 61   QNPHLCFRFFIWSSQRKQFRSWVSHNLIINMLISPNSTIPTKKLDLFDEYLTVLDEVNKL 120

Query: 3189 GFRVPSDAFAVLIGGYXXXXXXXXXXXAFGRMKEFDCEPNLFTYNLVLSVLVGKGMVLLA 3010
            G+RV SDAFA LIGGY            FG+MKE+ CE NLFTYNLVL VLV KGMVLLA
Sbjct: 121  GYRVSSDAFAALIGGYWGVKNGEKVMEVFGKMKEYGCEANLFTYNLVLHVLVSKGMVLLA 180

Query: 3009 LAVYNLMLKLNCHLNCSTYSILIDGLCKSGKVADALVLFDEMMDRGIVPSKITYTVVLTG 2830
            LAVYN+MLK+NCHLNC+TYSIL+DGLCKSGK++DAL LFDEM  RGI PSK+TYTVVL G
Sbjct: 181  LAVYNVMLKVNCHLNCATYSILVDGLCKSGKISDALELFDEMSKRGIAPSKVTYTVVLNG 240

Query: 2829 LCNAKRIDDAYRLFENMKSSGNKPDGITYNALLNGVCKLGRMDEAFALLKDFRKDGFDLD 2650
            LC AKRIDDAYRLFE+M+    + D +  NALLNGVCKLGRM+EA  LLK F KDGF+LD
Sbjct: 241  LCQAKRIDDAYRLFEDMRRGFGEVDSVACNALLNGVCKLGRMEEALGLLKRFEKDGFELD 300

Query: 2649 LNSYSSLIDGLFRTKRFKEGHEMYRKMLETDIQPDVVLHTIMIRGLCDEGRVHDAFKLVR 2470
            LN YSSLIDGLFRTKRFKEGH+M+++M+E DI PDVVL+TIMIRGLCDEGRVHDAFK+++
Sbjct: 301  LNGYSSLIDGLFRTKRFKEGHDMFKQMIEADITPDVVLYTIMIRGLCDEGRVHDAFKMIK 360

Query: 2469 EMTDRGLIPDTQAYNTLIKGFCDEGLLDEARSLKLEVSGVDEFPDTSTYTILISGMCRYG 2290
            EMTDR L+PDTQAYNTLIKGFCD GLLDEARSLKLE+SGVDEFPD  TYTILISGMCRYG
Sbjct: 361  EMTDRNLMPDTQAYNTLIKGFCDAGLLDEARSLKLEISGVDEFPDACTYTILISGMCRYG 420

Query: 2289 LVGEAQNIFNEMEKAGCVPSVVTFNALIDGLCKSGELQKAHLMFYKMEIGRNPSLFLRLT 2110
            LVG+A+NIFNEMEK GCVPSVVTFN+LIDGLCKSGELQKAHL+FYKMEIGRNPSLFLRL 
Sbjct: 421  LVGDAENIFNEMEKHGCVPSVVTFNSLIDGLCKSGELQKAHLLFYKMEIGRNPSLFLRLA 480

Query: 2109 QGSDRVVDSGSLQTMVTKLCESGSTLKAYKLLTQIADTAIMPTITTYNILINGLCEAGIL 1930
            QGSDRVVDSGSLQT+VTKLCESGSTLKAYKLL Q+ADTAIMPTITTYNILINGLC+ G L
Sbjct: 481  QGSDRVVDSGSLQTLVTKLCESGSTLKAYKLLNQLADTAIMPTITTYNILINGLCKTGRL 540

Query: 1929 DGAFNLFKELQLKGISPDSVTYGTLINGLQIAGREDDAFLLLEEMVSNGCKPTPAVYRTL 1750
            DGAF LFKEL+LKGISPDSVTYGTLINGLQ  GREDDAF LLEEMV NGCKPTPAVY+TL
Sbjct: 541  DGAFKLFKELRLKGISPDSVTYGTLINGLQSVGREDDAFTLLEEMVGNGCKPTPAVYKTL 600

Query: 1749 MKWSCRRKKTFAAFNLWLRYLKSLPKRDEKIITLVEECLRKDEVERAVRLLLDMDIKLQD 1570
            MKWSCRR+KTFAAFNLWL+YLK++PKRDE +I LVE+ L+K E+ER VRLLLDMD+KL D
Sbjct: 601  MKWSCRRRKTFAAFNLWLKYLKTIPKRDETVIKLVEDYLQKGEIERPVRLLLDMDLKLHD 660

Query: 1569 LDSAPYTIWLIGFCQGRKTTEALMLLSILKEYNISLTPASCVMLISTLYREGKLSFAVEV 1390
             DSAPYTIWLIGFCQG+KT EAL LLSIL EYNI+LTP SCVMLISTL REGKLS AVEV
Sbjct: 661  FDSAPYTIWLIGFCQGQKTNEALTLLSILNEYNINLTPPSCVMLISTLCREGKLSLAVEV 720

Query: 1389 FLYALQKGFILKPRICNNLLKSLLRSRYI---DDAFDLMEKMDSCGYNLDDYLDVGTRFL 1219
            FLYAL+KGF LKPRICN+L+KSLLRSR+I   DDAF LMEKM SCGY+LD YLD  T FL
Sbjct: 721  FLYALEKGFTLKPRICNSLIKSLLRSRFIAANDDAFALMEKMVSCGYDLDGYLDDDTNFL 780

Query: 1218 VRNHQRKQEMQNLLTK 1171
            + N  RKQE+Q++LT+
Sbjct: 781  LNNRLRKQEVQHVLTR 796


>gb|KVH88994.1| Pentatricopeptide repeat-containing protein [Cynara cardunculus var.
            scolymus]
          Length = 793

 Score = 1119 bits (2895), Expect = 0.0
 Identities = 554/793 (69%), Positives = 660/793 (83%), Gaps = 10/793 (1%)
 Frame = -3

Query: 3519 MKHITSLFPKPK------SFSSLSRVTTISTEVSNIVHSIDPMEPSLEQVAPFLTPDVIT 3358
            MK  +SL  K +      S  S SR   IS+EV NIV+S++PME +LEQV PFL+P +IT
Sbjct: 1    MKKFSSLIYKSRCHCFSSSSPSSSRGAAISSEVLNIVNSLEPMETALEQVVPFLSPGIIT 60

Query: 3357 HVLQDQQDPFLCFRFFVWAAKRKHFRSWASHNLMISMLVGNKIDTF---WSVLDDVRKCG 3187
             VLQ+QQ+P LCFRF+VWAAKRK FRSW SHNL+I MLV + +D F   W VL++++ CG
Sbjct: 61   SVLQEQQNPSLCFRFYVWAAKRKQFRSWESHNLLIDMLVSSTLDMFDAYWKVLEEIKSCG 120

Query: 3186 FRVPSDAFAVLIGGYXXXXXXXXXXXAFGRMKEFDCEPNLFTYNLVLSVLVGKGMVLLAL 3007
             R+PSDAF VLI GY           +FG+MK+FDCEPNLFTYNL+L +L+ +GMVLLAL
Sbjct: 121  IRIPSDAFTVLIDGYWKMNNAEKAVESFGKMKDFDCEPNLFTYNLILHILINRGMVLLAL 180

Query: 3006 AVYNLMLKLNCHLNCSTYSILIDGLCKSGKVADALVLFDEMMDRGIVPSKITYTVVLTGL 2827
            AVYN+MLKLN HLNCSTYSILI+GLCKS K +DAL LFDEMM +GI+PSK+TYT+VL+GL
Sbjct: 181  AVYNMMLKLNSHLNCSTYSILINGLCKSEKTSDALELFDEMMQKGIMPSKVTYTIVLSGL 240

Query: 2826 CNAKRIDDAYRLFENMKSSGNKPDGITYNALLNGVCKLGRMDEAFALLKDFRKDGFDLDL 2647
            C AKR+DDAYRLF NM+SS  KPD ITYN L+NGVCKLGRM+EAF LLK F KDG+DLDL
Sbjct: 241  CQAKRMDDAYRLFNNMRSSHCKPDFITYNTLVNGVCKLGRMEEAFVLLKAFNKDGYDLDL 300

Query: 2646 NSYSSLIDGLFRTKRFKEGHEMYRKMLETDIQPDVVLHTIMIRGLCDEGRVHDAFKLVRE 2467
            N YS LIDGLFR + FKE H+M++KM+E  I PDV+L+TIMIRGLCD GRV DAF+ +R 
Sbjct: 301  NGYSCLIDGLFRARMFKEAHDMFQKMMEAGITPDVILYTIMIRGLCDAGRVQDAFEFLRN 360

Query: 2466 MTDRGLIPDTQAYNTLIKGFCDEGLLDEARSLKLEVSGVDEFPDTSTYTILISGMCRYGL 2287
            M+ RGL+PDT+AYNTLIKGFCD+GLLDEARSLKLE+S V++F D+ TYTILISGMC++GL
Sbjct: 361  MSSRGLVPDTRAYNTLIKGFCDKGLLDEARSLKLEISEVNQFADSCTYTILISGMCKHGL 420

Query: 2286 VGEAQNIFNEMEKAGCVPSVVTFNALIDGLCKSGELQKAHLMFYKMEIGRNPSLFLRLTQ 2107
            VGEAQNIF+EMEK GC+PSVVTFNAL+DGLCKSGELQKAH +FY+MEIGRNPSLFLRLTQ
Sbjct: 421  VGEAQNIFDEMEKLGCIPSVVTFNALMDGLCKSGELQKAHYLFYRMEIGRNPSLFLRLTQ 480

Query: 2106 GSDRVVDSGSLQTMVTKLCESGSTLKAYKLLTQIADTAIMPTITTYNILINGLCEAGILD 1927
            GSDRVVDSGSLQT+VTKLCESG TLKAYKLLTQ+ADT I+P ITTYNILINGLC++G ++
Sbjct: 481  GSDRVVDSGSLQTLVTKLCESGLTLKAYKLLTQLADTTILPNITTYNILINGLCKSGKIN 540

Query: 1926 GAFNLFKELQLKGISPDSVTYGTLINGLQIAGREDDAFLLLEEMVSNGCKPTPAVYRTLM 1747
            GA  L KELQLKG SPDSVTYGTLI+GLQ  GRE+DAF+LLEEMV NGC PT A+YR+LM
Sbjct: 541  GALKLLKELQLKGKSPDSVTYGTLIDGLQSIGRENDAFMLLEEMVKNGCTPTAAIYRSLM 600

Query: 1746 KWSCRRKKTFAAFNLWLRYLKSLPKRDEKIITLVEECLRKDEVERAVRLLLDMDIKLQDL 1567
            KWSCRRKKTFAAF+LWL++L S  KR+EK + LVEE L+K EVER VRLLLDMDIKL D 
Sbjct: 601  KWSCRRKKTFAAFSLWLKFLSSTLKREEKTMKLVEEQLQKGEVERPVRLLLDMDIKLGDF 660

Query: 1566 DSAPYTIWLIGFCQGRKTTEALMLLSILKEYNISLTPASCVMLISTLYREGKLSFAVEVF 1387
            DSAPYTIWLIG CQ   T EAL L  ILKEYNI++TP SCV+LI+TL ++ KL+ A+EVF
Sbjct: 661  DSAPYTIWLIGLCQAHNTVEALKLFDILKEYNINVTPPSCVILIATLCKQKKLNLAIEVF 720

Query: 1386 LYALQKGFILKPRICNNLLKSLLRSRY-IDDAFDLMEKMDSCGYNLDDYLDVGTRFLVRN 1210
            LY+L+KGFILKPRICNNLLKSL+ S+Y  + AF+L++KM+SCGY+LD  L+  TRFL+ +
Sbjct: 721  LYSLKKGFILKPRICNNLLKSLVHSQYEKEHAFELIKKMESCGYDLDACLNDHTRFLLSS 780

Query: 1209 HQRKQEMQNLLTK 1171
              R Q+   + T+
Sbjct: 781  CWRTQDTGIISTR 793


>ref|XP_023739512.1| pentatricopeptide repeat-containing protein At1g79540 [Lactuca
            sativa]
 gb|PLY69402.1| hypothetical protein LSAT_5X161280 [Lactuca sativa]
          Length = 768

 Score = 1077 bits (2785), Expect = 0.0
 Identities = 541/789 (68%), Positives = 648/789 (82%), Gaps = 6/789 (0%)
 Frame = -3

Query: 3519 MKHITSLFPKPK--SFSSLSRVTTISTEVSNIVHSIDPMEPSLEQVAPFLTPDVITHVLQ 3346
            MK ++ L  KP+   FSS SRVT IS+EVS+IVHS+DPMEP+LEQV PFL+P++IT VLQ
Sbjct: 1    MKKVSYLIHKPRFRCFSSSSRVTAISSEVSSIVHSVDPMEPALEQVVPFLSPEIITSVLQ 60

Query: 3345 DQQDPFLCFRFFVWAAKRKHFRSWASHNLMISMLVGNK---IDTFWSVLDDVRKCGFRVP 3175
            +Q++P+LCFRFFVWAAKRK FRSW SHNLMI++LV +     D +W  L++++  G R+P
Sbjct: 61   EQRNPYLCFRFFVWAAKRKQFRSWGSHNLMINLLVSDNSHLFDAYWRALEEIKSYGLRIP 120

Query: 3174 SDAFAVLIGGYXXXXXXXXXXXAFGRMKEFDCEPNLFTYNLVLSVLVGKGMVLLALAVYN 2995
            SDAFAVLI GY           +FG+MK+FDC+PNLFTYNL+L +LV KGM+LLALAVYN
Sbjct: 121  SDAFAVLIAGYWETKNAEKAVESFGKMKDFDCQPNLFTYNLILHILVNKGMILLALAVYN 180

Query: 2994 LMLKLNCHLNCSTYSILIDGLCKSGKVADALVLFDEMMDRGIVPSKITYTVVLTGLCNAK 2815
            LMLKLNCH+NCSTY+ILIDGLCKS  ++DAL LF+EM  RGIVPSKITYTVVL+GLC AK
Sbjct: 181  LMLKLNCHMNCSTYTILIDGLCKSDDMSDALNLFNEMSQRGIVPSKITYTVVLSGLCQAK 240

Query: 2814 RIDDAYRLFENMKSSGNKPDGITYNALLNGVCKLGRMDEAFALLKDFRKDGFDLDLNSYS 2635
            RID+AY+LFENMK++  KPD ITYNALLNGVCKLG+MDEAF LLK F+KDGFDLDLNSYS
Sbjct: 241  RIDEAYQLFENMKTTV-KPDFITYNALLNGVCKLGKMDEAFDLLKSFKKDGFDLDLNSYS 299

Query: 2634 SLIDGLFRTKRFKEGHEMYRKMLETDIQPDVVLHTIMIRGLCDEGRVHDAFKLVREMTDR 2455
            SLIDGLFR  RFKE H+ + K   + I PD +L+TIMIRGLCDEGRV DAFKL++EMTD+
Sbjct: 300  SLIDGLFRAHRFKEAHDTFHKATNSGITPDGILYTIMIRGLCDEGRVDDAFKLIKEMTDK 359

Query: 2454 GLIPDTQAYNTLIKGFCDEGLLDEARSLKLEVSGVDEFPDTSTYTILISGMCRYGLVGEA 2275
             L+PDTQAYNTLIKGFCD+GLLDEA+SLKLE+SGVD FPD+ TYTILIS MCRYGLV +A
Sbjct: 360  DLVPDTQAYNTLIKGFCDKGLLDEAQSLKLEISGVDNFPDSCTYTILISSMCRYGLVDDA 419

Query: 2274 QNIFNEMEKAGCVPSVVTFNALIDGLCKSGELQKAHLMFYKMEIGRNPSLFLRLTQGSDR 2095
            QNIFNEMEK GCVPSV+TFNALIDGL KS + +K+  +F++MEIG+NP LFLR  QGSDR
Sbjct: 420  QNIFNEMEKYGCVPSVITFNALIDGLYKSKQPEKSFHLFHRMEIGKNPFLFLRFNQGSDR 479

Query: 2094 VVDSGSLQTMVTKLCESGSTLKAYKLLTQIADTAIMPTITTYNILINGLCEAGILDGAFN 1915
            VV+       VT+LCESGS L+AYKLLTQI   + +PTITT NILINGLC+AG+LD A N
Sbjct: 480  VVE------RVTELCESGSPLEAYKLLTQI---SFLPTITTNNILINGLCKAGLLDAALN 530

Query: 1914 LFKELQLKGISPDSVTYGTLINGLQIAGREDDAFLLLEEMVSNGCKPTPAVYRTLMKWSC 1735
            LFK+L++KG +PDSVTY TLI+GL+  GRE DA +L++EMV NGCKPTPAVY+TLMKWSC
Sbjct: 531  LFKKLKVKGNTPDSVTYSTLIHGLESVGREKDAIMLVKEMVENGCKPTPAVYKTLMKWSC 590

Query: 1734 RRKKTFAAFNLWLRYLKSLPKRDEKIITLVEECLRKDEVERAVRLLLDMDIKLQDLDSAP 1555
            RR KTFAAFNLWL YLK++ KRDE++I LVE  L+  E+ER +RLLLDMDIK +DLD AP
Sbjct: 591  RRNKTFAAFNLWLEYLKTVSKRDEEVIKLVENHLQNGEIERPMRLLLDMDIKSRDLDPAP 650

Query: 1554 YTIWLIGFCQGRKTTEALMLLSILKEYNISLTPASCVMLISTLYREGKLSFAVEVFLYAL 1375
            YTIWLIGFCQ  KTTEAL L +ILKEY+I+LTPASCVMLI+TL  E  L+ A+EVF Y+L
Sbjct: 651  YTIWLIGFCQVHKTTEALKLFAILKEYDITLTPASCVMLITTLLGESNLNMALEVFHYSL 710

Query: 1374 QKGFILKPRICNNLLKSLLRSRYIDD-AFDLMEKMDSCGYNLDDYLDVGTRFLVRNHQRK 1198
            QKGFILKPRIC+NLL+SL+RS  + + A +LM+KMDSCGY++D                K
Sbjct: 711  QKGFILKPRICSNLLRSLIRSHDMSNHAIELMKKMDSCGYDMDSI-----------KWFK 759

Query: 1197 QEMQNLLTK 1171
            Q +QN+LTK
Sbjct: 760  QGIQNVLTK 768


>ref|XP_019265161.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            [Nicotiana attenuata]
 ref|XP_019265162.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            [Nicotiana attenuata]
 ref|XP_019265163.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            [Nicotiana attenuata]
 ref|XP_019265164.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            [Nicotiana attenuata]
 ref|XP_019265165.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            [Nicotiana attenuata]
 ref|XP_019265166.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            [Nicotiana attenuata]
 gb|OIT35901.1| pentatricopeptide repeat-containing protein [Nicotiana attenuata]
          Length = 789

 Score =  913 bits (2359), Expect = 0.0
 Identities = 456/768 (59%), Positives = 585/768 (76%), Gaps = 5/768 (0%)
 Frame = -3

Query: 3486 KSFSSLSRVTTISTEVSNIVHSIDPMEPSLEQVAPFLTPDVITHVLQDQ---QDPFLCFR 3316
            KSFSS SR  ++S EV NI+  ++PMEP+L+++  FL P+ I+ +L++Q   Q+P L FR
Sbjct: 15   KSFSS-SREMSVSNEVLNIIERVNPMEPALDKIVHFLCPNTISSILEEQRQNQNPQLGFR 73

Query: 3315 FFVWAAKRKHFRSWASHNLMISMLVGNK-IDTFWSVLDDVRKCGFRVPSDAFAVLIGGYX 3139
            FF+WAAKRK FRSW S NL++ MLV       +W+VLD+++  G  + SDAF  LI GY 
Sbjct: 74   FFIWAAKRKRFRSWVSQNLIVDMLVKEGGFGLYWNVLDELKVNGVSISSDAFGALIWGYW 133

Query: 3138 XXXXXXXXXXAFGRMKEFDCEPNLFTYNLVLSVLVGKGMVLLALAVYNLMLKLNCHLNCS 2959
                      AFG+M++++C+P+LFTYN++L + V K  +LLALAVYN+MLKLN   N S
Sbjct: 134  KVNKAEKAVEAFGKMRDYECKPDLFTYNMILHITVRKDAILLALAVYNVMLKLNSRPNSS 193

Query: 2958 TYSILIDGLCKSGKVADALVLFDEMMDRGIVPSKITYTVVLTGLCNAKRIDDAYRLFENM 2779
            T+SILIDGLCKSGK  DAL LFDEM +RG++PSKITYTV+L+GLC AKR DDAYRL   M
Sbjct: 194  TFSILIDGLCKSGKTHDALKLFDEMSERGVLPSKITYTVILSGLCQAKRTDDAYRLLNVM 253

Query: 2778 KSSGNKPDGITYNALLNGVCKLGRMDEAFALLKDFRKDGFDLDLNSYSSLIDGLFRTKRF 2599
            KS G +PD +TYNALLNG CKLGR++EA ALLK F  +G+ +DL  Y+ L+DG  R KR 
Sbjct: 254  KSRGCRPDFVTYNALLNGFCKLGRINEAQALLKSFENEGYLVDLKGYTCLVDGFVRIKRI 313

Query: 2598 KEGHEMYRKMLETDIQPDVVLHTIMIRGLCDEGRVHDAFKLVREMTDRGLIPDTQAYNTL 2419
             E   +++K+ E ++ PDVVL+T MIRGL   GRV +A  L+R+MT RG++PDTQ YNTL
Sbjct: 314  DEAQSVFKKLFENNVVPDVVLYTTMIRGLSGAGRVKEALNLLRDMTGRGVLPDTQCYNTL 373

Query: 2418 IKGFCDEGLLDEARSLKLEVSGVDEFPDTSTYTILISGMCRYGLVGEAQNIFNEMEKAGC 2239
            IKGFCD GLLD+A+SL+LE+S  D FPDT TY+ILI GMCR GLV EA+ IFNEMEK GC
Sbjct: 374  IKGFCDMGLLDQAQSLRLEISENDCFPDTCTYSILICGMCRNGLVEEARLIFNEMEKLGC 433

Query: 2238 VPSVVTFNALIDGLCKSGELQKAHLMFYKMEIGRNPSLFLRLTQGSDRVVDSGSLQTMVT 2059
             PSVVTFN LIDGLCK+GEL++AHLMFYKMEIG+NPSLFLRL+QG+DRV+DS SLQ MV 
Sbjct: 434  FPSVVTFNTLIDGLCKAGELKEAHLMFYKMEIGKNPSLFLRLSQGADRVLDSASLQKMVE 493

Query: 2058 KLCESGSTLKAYKLLTQIADTAIMPTITTYNILINGLCEAGILDGAFNLFKELQLKGISP 1879
            KLCESG  LKAYKLL Q+AD  ++P I TYNILINGLC++G + GAFNLF+ELQLKG  P
Sbjct: 494  KLCESGKILKAYKLLLQLADCGVVPNIITYNILINGLCKSGKISGAFNLFEELQLKGHFP 553

Query: 1878 DSVTYGTLINGLQIAGREDDAFLLLEEMVSNGCKPTPAVYRTLMKWSCRRKKTFAAFNLW 1699
            D++TYGTLI+GLQ A RE++AF LL++M  NGC P+  VY++LM WSCRR K   AF+LW
Sbjct: 554  DTITYGTLIDGLQRADREEEAFKLLDQMSKNGCMPSAEVYQSLMTWSCRRGKISIAFSLW 613

Query: 1698 LRYLKSLPKRDEKIITLVEECLRKDEVERAVRLLLDMDIKLQDLDSAPYTIWLIGFCQGR 1519
            L+YLK+   R+ ++I L+E+ + K ++E+AVR LL+MD+KL+D +S+PY IWLIG CQ R
Sbjct: 614  LKYLKTQAVRESEVIRLIEKHIEKGDLEKAVRGLLEMDLKLEDFNSSPYNIWLIGLCQAR 673

Query: 1518 KTTEALMLLSILKEYNISLTPASCVMLISTLYREGKLSFAVEVFLYALQKGFILKPRICN 1339
            K  + L + S+LKE+ +S++  SCVMLI +L  EG L  AVEVFLY +++G  L PRICN
Sbjct: 674  KPGDGLKIFSLLKEFGVSISAPSCVMLIHSLCEEGNLDQAVEVFLYTVERGVRLMPRICN 733

Query: 1338 NLLKSLLRSR-YIDDAFDLMEKMDSCGYNLDDYLDVGTRFLVRNHQRK 1198
             LL++LLRS+     A DL+E+M S GYNL+DYL  GTR L +   R+
Sbjct: 734  KLLQTLLRSQDKAHHAVDLLERMRSTGYNLNDYLHSGTRSLFQRWNRR 781


>ref|XP_016494016.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540-like
            [Nicotiana tabacum]
 ref|XP_016494017.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540-like
            [Nicotiana tabacum]
 ref|XP_016494018.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540-like
            [Nicotiana tabacum]
          Length = 803

 Score =  912 bits (2358), Expect = 0.0
 Identities = 464/802 (57%), Positives = 596/802 (74%), Gaps = 7/802 (0%)
 Frame = -3

Query: 3582 PHKKITKTPMK--QQQQW*QNPKMKHITSLFPKPKSFSSLSRVTTISTEVSNIVHSIDPM 3409
            P  +I K+ +   Q  +   +P  + I+S     KSFSS      +S EV NI+  ++PM
Sbjct: 5    PETRIRKSTLHFLQAMKLLHSPLFRSISS-----KSFSS------VSNEVLNIIERVNPM 53

Query: 3408 EPSLEQVAPFLTPDVITHVLQDQ---QDPFLCFRFFVWAAKRKHFRSWASHNLMISMLVG 3238
            EP+L+++  FL P+ I+ +LQ+Q   Q+P L FRFF+W AKRK FRSW S NL++ MLV 
Sbjct: 54   EPALDKLVHFLCPNTISSILQEQRQNQNPQLGFRFFIWTAKRKRFRSWVSQNLILDMLVK 113

Query: 3237 NK-IDTFWSVLDDVRKCGFRVPSDAFAVLIGGYXXXXXXXXXXXAFGRMKEFDCEPNLFT 3061
                D +W+VLD+++  G  + SDAF  LI GY           AFG+MK+++C+P+LFT
Sbjct: 114  EGGFDLYWNVLDELKVNGVSISSDAFGALIWGYWKVNKAEKAVEAFGKMKDYECKPDLFT 173

Query: 3060 YNLVLSVLVGKGMVLLALAVYNLMLKLNCHLNCSTYSILIDGLCKSGKVADALVLFDEMM 2881
            YN++L + V K  +LLALAVYN+MLKLN   N ST+SILIDGLCKSGK  DAL LFDEM 
Sbjct: 174  YNMILHITVRKDAILLALAVYNVMLKLNSRPNSSTFSILIDGLCKSGKTHDALKLFDEMS 233

Query: 2880 DRGIVPSKITYTVVLTGLCNAKRIDDAYRLFENMKSSGNKPDGITYNALLNGVCKLGRMD 2701
            +RG++PSKITYTV+L+GLC AKR DDAYRL   MKS G +PD +TYNALLNG CKLGR++
Sbjct: 234  ERGVLPSKITYTVILSGLCQAKRTDDAYRLLNVMKSRGCRPDFVTYNALLNGFCKLGRIN 293

Query: 2700 EAFALLKDFRKDGFDLDLNSYSSLIDGLFRTKRFKEGHEMYRKMLETDIQPDVVLHTIMI 2521
            EA ALLK F  +G+ +DL  Y+ L+DG  R KR  E   +++K+ E ++ PDVVL+T MI
Sbjct: 294  EAQALLKSFENEGYLVDLKGYTCLVDGFVRIKRIDEAQSVFKKLFENNVVPDVVLYTTMI 353

Query: 2520 RGLCDEGRVHDAFKLVREMTDRGLIPDTQAYNTLIKGFCDEGLLDEARSLKLEVSGVDEF 2341
            RGL   GRV +A  L+ +MT RG++PDTQ YNTLIKGFCD GLLD+ARSL+LE+S  D F
Sbjct: 354  RGLSGAGRVKEALSLLGDMTGRGVLPDTQCYNTLIKGFCDMGLLDQARSLRLEISENDCF 413

Query: 2340 PDTSTYTILISGMCRYGLVGEAQNIFNEMEKAGCVPSVVTFNALIDGLCKSGELQKAHLM 2161
            PDT TY+ILI GMCR+GLV EA+ IFNEMEK GC PSVVTFN LIDGLCK+GEL++AHLM
Sbjct: 414  PDTCTYSILICGMCRHGLVEEARLIFNEMEKLGCFPSVVTFNTLIDGLCKAGELKEAHLM 473

Query: 2160 FYKMEIGRNPSLFLRLTQGSDRVVDSGSLQTMVTKLCESGSTLKAYKLLTQIADTAIMPT 1981
            FYKMEIG+NPSLFLRL+QG DRV+DS SLQ MV KLCESG  LKAYKLL Q+AD  ++P 
Sbjct: 474  FYKMEIGKNPSLFLRLSQGVDRVLDSASLQKMVEKLCESGKILKAYKLLMQLADCGVVPN 533

Query: 1980 ITTYNILINGLCEAGILDGAFNLFKELQLKGISPDSVTYGTLINGLQIAGREDDAFLLLE 1801
            I TYNILINGLC++G + GAFNLF+ELQLKG  PD++TYGTLI+GLQ A RE++AF LL+
Sbjct: 534  IITYNILINGLCKSGKISGAFNLFEELQLKGHFPDTITYGTLIDGLQRADREEEAFKLLD 593

Query: 1800 EMVSNGCKPTPAVYRTLMKWSCRRKKTFAAFNLWLRYLKSLPKRDEKIITLVEECLRKDE 1621
            +M  NGC P+  VY++LM WSCRR K   AF+LWL+YLK+   R+ ++I L+E+ + K +
Sbjct: 594  QMSKNGCMPSAEVYQSLMTWSCRRGKISIAFSLWLKYLKTQAVRESEVIGLIEKHIEKGD 653

Query: 1620 VERAVRLLLDMDIKLQDLDSAPYTIWLIGFCQGRKTTEALMLLSILKEYNISLTPASCVM 1441
            +E+AVR LL+MD+KL+D +S+PY IWLIG CQ RK  +AL + S+LKE+ +S++  SCVM
Sbjct: 654  LEKAVRGLLEMDLKLEDFNSSPYNIWLIGLCQARKPGDALKIFSLLKEFGVSISAPSCVM 713

Query: 1440 LISTLYREGKLSFAVEVFLYALQKGFILKPRICNNLLKSLLRSR-YIDDAFDLMEKMDSC 1264
            LI +L  EG L  AVEVFLY +++G  L PRICN LL++LLRS+     A DL+E+M S 
Sbjct: 714  LIHSLCEEGNLDQAVEVFLYTVERGVRLMPRICNRLLQTLLRSQDKAQHAVDLLERMRST 773

Query: 1263 GYNLDDYLDVGTRFLVRNHQRK 1198
            GYNL+DYL  GTR L +   R+
Sbjct: 774  GYNLNDYLHSGTRSLFQRWNRR 795


>ref|XP_009589366.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            [Nicotiana tomentosiformis]
 ref|XP_018623078.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            [Nicotiana tomentosiformis]
 ref|XP_018623079.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            [Nicotiana tomentosiformis]
 ref|XP_018623080.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            [Nicotiana tomentosiformis]
 ref|XP_018623081.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            [Nicotiana tomentosiformis]
 ref|XP_018623082.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            [Nicotiana tomentosiformis]
 ref|XP_018623083.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            [Nicotiana tomentosiformis]
          Length = 803

 Score =  911 bits (2355), Expect = 0.0
 Identities = 463/802 (57%), Positives = 596/802 (74%), Gaps = 7/802 (0%)
 Frame = -3

Query: 3582 PHKKITKTPMK--QQQQW*QNPKMKHITSLFPKPKSFSSLSRVTTISTEVSNIVHSIDPM 3409
            P  +I K+ +   Q  +   +P  + I+S     KSFSS      +S EV NI+  ++PM
Sbjct: 5    PETRIRKSTLHFLQAMKLLHSPLFRSISS-----KSFSS------VSNEVLNIIERVNPM 53

Query: 3408 EPSLEQVAPFLTPDVITHVLQDQ---QDPFLCFRFFVWAAKRKHFRSWASHNLMISMLVG 3238
            EP+L+++  FL P+ I+ +LQ+Q   Q+P L FRFF+W AKRK FRSW S NL++ MLV 
Sbjct: 54   EPALDKLVHFLCPNTISSILQEQRQNQNPQLGFRFFIWTAKRKRFRSWVSQNLILDMLVK 113

Query: 3237 NK-IDTFWSVLDDVRKCGFRVPSDAFAVLIGGYXXXXXXXXXXXAFGRMKEFDCEPNLFT 3061
                D +W+VLD+++  G  + SDAF  LI GY           AFG+MK+++C+P+LFT
Sbjct: 114  EGGFDLYWNVLDELKVNGVSISSDAFGALIWGYWKVNKAEKAVEAFGKMKDYECKPDLFT 173

Query: 3060 YNLVLSVLVGKGMVLLALAVYNLMLKLNCHLNCSTYSILIDGLCKSGKVADALVLFDEMM 2881
            YN++L + V K  +LLALAVYN+MLKLN   N ST+SILIDGLCKSGK  DAL LFDEM 
Sbjct: 174  YNMILHITVRKDAILLALAVYNVMLKLNSRPNSSTFSILIDGLCKSGKTHDALKLFDEMS 233

Query: 2880 DRGIVPSKITYTVVLTGLCNAKRIDDAYRLFENMKSSGNKPDGITYNALLNGVCKLGRMD 2701
            +RG++PSKITYTV+L+GLC AKR DDAYRL   MKS G +PD +TYNALLNG CKLGR++
Sbjct: 234  ERGVLPSKITYTVILSGLCQAKRTDDAYRLLNVMKSRGCRPDFVTYNALLNGFCKLGRIN 293

Query: 2700 EAFALLKDFRKDGFDLDLNSYSSLIDGLFRTKRFKEGHEMYRKMLETDIQPDVVLHTIMI 2521
            EA ALLK F  +G+ +DL  Y+ L+DG  R KR  E   +++K+ E ++ PDVVL+T MI
Sbjct: 294  EAQALLKSFENEGYLVDLKGYTCLVDGFVRIKRIDEAQSVFKKLFENNVVPDVVLYTTMI 353

Query: 2520 RGLCDEGRVHDAFKLVREMTDRGLIPDTQAYNTLIKGFCDEGLLDEARSLKLEVSGVDEF 2341
            RGL   GRV +A  L+ +MT RG++PDTQ YNTLIKGFCD GLLD+ARSL+LE+S  D F
Sbjct: 354  RGLSGAGRVKEALSLLGDMTGRGVLPDTQCYNTLIKGFCDMGLLDQARSLRLEISENDCF 413

Query: 2340 PDTSTYTILISGMCRYGLVGEAQNIFNEMEKAGCVPSVVTFNALIDGLCKSGELQKAHLM 2161
            PDT TY+ILI GMCR+GLV EA+ IFNEMEK GC PSVVTFN LIDGLCK+GEL++AHLM
Sbjct: 414  PDTCTYSILICGMCRHGLVEEARLIFNEMEKLGCFPSVVTFNTLIDGLCKAGELKEAHLM 473

Query: 2160 FYKMEIGRNPSLFLRLTQGSDRVVDSGSLQTMVTKLCESGSTLKAYKLLTQIADTAIMPT 1981
            FYKMEIG+NPSLFLRL+QG+DRV+DS SLQ  V KLCESG  LKAYKLL Q+AD  ++P 
Sbjct: 474  FYKMEIGKNPSLFLRLSQGADRVLDSASLQKTVEKLCESGKILKAYKLLMQLADCGVVPN 533

Query: 1980 ITTYNILINGLCEAGILDGAFNLFKELQLKGISPDSVTYGTLINGLQIAGREDDAFLLLE 1801
            I TYNILINGLC++G + GAFNLF+ELQLKG  PD++TYGTLI+GLQ A RE++AF LL+
Sbjct: 534  IITYNILINGLCKSGKISGAFNLFEELQLKGHFPDTITYGTLIDGLQRADREEEAFKLLD 593

Query: 1800 EMVSNGCKPTPAVYRTLMKWSCRRKKTFAAFNLWLRYLKSLPKRDEKIITLVEECLRKDE 1621
            +M  NGC P+  VY++LM WSCRR K   AF+LWL+YLK+   R+ ++I L+E+ + K +
Sbjct: 594  QMSKNGCMPSAEVYQSLMTWSCRRGKISIAFSLWLKYLKTQAVRESEVIGLIEKHIEKGD 653

Query: 1620 VERAVRLLLDMDIKLQDLDSAPYTIWLIGFCQGRKTTEALMLLSILKEYNISLTPASCVM 1441
            +E+AVR LL+MD+KL+D +S+PY IWLIG CQ RK  +AL + S+LKE+ +S++  SCVM
Sbjct: 654  LEKAVRGLLEMDLKLEDFNSSPYNIWLIGLCQARKPGDALKIFSLLKEFGVSISAPSCVM 713

Query: 1440 LISTLYREGKLSFAVEVFLYALQKGFILKPRICNNLLKSLLRSR-YIDDAFDLMEKMDSC 1264
            LI +L  EG L  AVEVFLY +++G  L PRICN LL++LLRS+     A DL+E+M S 
Sbjct: 714  LIHSLCEEGNLDQAVEVFLYTVERGVRLMPRICNRLLQTLLRSQDKAQHAVDLLERMRST 773

Query: 1263 GYNLDDYLDVGTRFLVRNHQRK 1198
            GYNL+DYL  GTR L +   R+
Sbjct: 774  GYNLNDYLHSGTRSLFQRWNRR 795


>ref|XP_009766339.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            [Nicotiana sylvestris]
 ref|XP_009766340.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            [Nicotiana sylvestris]
 ref|XP_009766341.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            [Nicotiana sylvestris]
 ref|XP_009766343.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            [Nicotiana sylvestris]
 ref|XP_009766344.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            [Nicotiana sylvestris]
 ref|XP_009766345.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            [Nicotiana sylvestris]
 ref|XP_016461853.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540-like
            [Nicotiana tabacum]
 ref|XP_016461854.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540-like
            [Nicotiana tabacum]
 ref|XP_016461855.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540-like
            [Nicotiana tabacum]
 ref|XP_016461856.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540-like
            [Nicotiana tabacum]
 ref|XP_016461857.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540-like
            [Nicotiana tabacum]
          Length = 789

 Score =  910 bits (2353), Expect = 0.0
 Identities = 456/768 (59%), Positives = 585/768 (76%), Gaps = 5/768 (0%)
 Frame = -3

Query: 3486 KSFSSLSRVTTISTEVSNIVHSIDPMEPSLEQVAPFLTPDVITHVLQDQ---QDPFLCFR 3316
            KSFSS SR  ++S +V NI+  ++PMEP+L+++  FL P+ I+ +L++Q   Q+P L FR
Sbjct: 15   KSFSS-SREMSVSNKVLNIIERVNPMEPALDKLVHFLCPNTISSILEEQRQNQNPQLGFR 73

Query: 3315 FFVWAAKRKHFRSWASHNLMISMLVGNK-IDTFWSVLDDVRKCGFRVPSDAFAVLIGGYX 3139
            FF+W AKRK FRSW   NL++ MLV     D +W+VLD+++  G  + SDAF  LI GY 
Sbjct: 74   FFIWTAKRKRFRSWVLQNLIVDMLVKEGGFDLYWNVLDELKVNGVSISSDAFGALIWGYW 133

Query: 3138 XXXXXXXXXXAFGRMKEFDCEPNLFTYNLVLSVLVGKGMVLLALAVYNLMLKLNCHLNCS 2959
                      AFG+MK+++C+P+LFTYN++L ++V K  +LLALAVYN+MLKLN   N S
Sbjct: 134  KVNKAEKAVEAFGKMKDYECKPDLFTYNMILHIMVRKDAILLALAVYNVMLKLNSRPNSS 193

Query: 2958 TYSILIDGLCKSGKVADALVLFDEMMDRGIVPSKITYTVVLTGLCNAKRIDDAYRLFENM 2779
            T+SILIDGLCKS K  DAL LFDEM +RG++PSKITYTV+L+GLC AKR DDAYRL   M
Sbjct: 194  TFSILIDGLCKSRKTHDALKLFDEMSERGVLPSKITYTVILSGLCQAKRADDAYRLLNVM 253

Query: 2778 KSSGNKPDGITYNALLNGVCKLGRMDEAFALLKDFRKDGFDLDLNSYSSLIDGLFRTKRF 2599
            KS G +PD +TYNALLNG CKLGR++EA ALLK F  +G+ +DL  Y+ L+DG  R KR 
Sbjct: 254  KSRGCRPDFVTYNALLNGFCKLGRINEAQALLKSFENEGYLVDLKGYTCLVDGFVRIKRI 313

Query: 2598 KEGHEMYRKMLETDIQPDVVLHTIMIRGLCDEGRVHDAFKLVREMTDRGLIPDTQAYNTL 2419
             E   +++K+ E ++ PDVVL+T MIRGL   GRV +A  L+R+MT RG++PDTQ YNTL
Sbjct: 314  DEAQSVFKKLFENNVVPDVVLYTTMIRGLSGAGRVKEALSLLRDMTGRGVLPDTQCYNTL 373

Query: 2418 IKGFCDEGLLDEARSLKLEVSGVDEFPDTSTYTILISGMCRYGLVGEAQNIFNEMEKAGC 2239
            IKGFCD GLLD+ARSL+LE+S  D FPDT TY+ILI GMCR GLV EA+ IFNEMEK GC
Sbjct: 374  IKGFCDMGLLDQARSLRLEISENDCFPDTFTYSILICGMCRNGLVEEARLIFNEMEKLGC 433

Query: 2238 VPSVVTFNALIDGLCKSGELQKAHLMFYKMEIGRNPSLFLRLTQGSDRVVDSGSLQTMVT 2059
             PSVVTFN LIDGLCK+GEL++AHLMFYKMEIG+NPSLFLRL+QG+DRV+DS SLQ MV 
Sbjct: 434  FPSVVTFNTLIDGLCKAGELKEAHLMFYKMEIGKNPSLFLRLSQGADRVLDSASLQKMVE 493

Query: 2058 KLCESGSTLKAYKLLTQIADTAIMPTITTYNILINGLCEAGILDGAFNLFKELQLKGISP 1879
            KLCESG  LKAYKLL Q+AD  ++P I TYNILINGLC++G + GAFNLF+ELQLKG  P
Sbjct: 494  KLCESGKILKAYKLLMQLADCGVVPNIITYNILINGLCKSGKISGAFNLFEELQLKGHFP 553

Query: 1878 DSVTYGTLINGLQIAGREDDAFLLLEEMVSNGCKPTPAVYRTLMKWSCRRKKTFAAFNLW 1699
            D++TYGTLI+GLQ A RE++AF LL++M  NGC P+  VY++LM WSCRR K   AF+LW
Sbjct: 554  DTITYGTLIDGLQRADREEEAFKLLDQMSKNGCMPSAEVYQSLMTWSCRRGKISIAFSLW 613

Query: 1698 LRYLKSLPKRDEKIITLVEECLRKDEVERAVRLLLDMDIKLQDLDSAPYTIWLIGFCQGR 1519
            L+YLK+   R+ ++I L+E+ + K ++E+AVR LL+MD+KL+D +S+PY IWLIG CQ R
Sbjct: 614  LKYLKTQAVRESEMIGLIEKHIEKGDLEKAVRGLLEMDLKLEDFNSSPYNIWLIGLCQAR 673

Query: 1518 KTTEALMLLSILKEYNISLTPASCVMLISTLYREGKLSFAVEVFLYALQKGFILKPRICN 1339
            K  +AL + S+LKE+ +S++  SCVMLI +L  EG L  AVEVFLY +++G  L PRICN
Sbjct: 674  KPGDALKIFSLLKEFGVSISAPSCVMLIHSLCEEGNLDQAVEVFLYTVERGVRLMPRICN 733

Query: 1338 NLLKSLLRSR-YIDDAFDLMEKMDSCGYNLDDYLDVGTRFLVRNHQRK 1198
             LL++LLRS+     A DL+E+M S GYNL+DYL  GTR L +   R+
Sbjct: 734  RLLQTLLRSQDKAQHAVDLLERMRSTGYNLNDYLHSGTRSLFQRWNRR 781


>gb|PHT56486.1| hypothetical protein CQW23_04972 [Capsicum baccatum]
          Length = 814

 Score =  907 bits (2344), Expect = 0.0
 Identities = 451/767 (58%), Positives = 579/767 (75%), Gaps = 3/767 (0%)
 Frame = -3

Query: 3486 KSFSSLSRVTTISTEVSNIVHSIDPMEPSLEQVAPFLTPDVITHVL-QDQQDPFLCFRFF 3310
            K++   SR   IS EV NI+   DP+EP+L+++  F+ P+V++ +L Q +++  L FRFF
Sbjct: 41   KTYIPSSRGMGISNEVLNIIEKEDPIEPALDELVDFMCPNVVSFILEQKRENRELSFRFF 100

Query: 3309 VWAAKRKHFRSWASHNLMISMLVGNK-IDTFWSVLDDVRKCGFRVPSDAFAVLIGGYXXX 3133
            +WAAKR  F SW S N++  ML+ +   D +WSVLD ++  G  + S AF  LI GY   
Sbjct: 101  IWAAKRNRFISWVSENMIEDMLLRDGGFDLYWSVLDKLKFSGIDITSRAFGTLIWGYWKV 160

Query: 3132 XXXXXXXXAFGRMKEFDCEPNLFTYNLVLSVLVGKGMVLLALAVYNLMLKLNCHLNCSTY 2953
                    AFGRM +FDC+PNLFTYN++L + V K  +LLALAVYN+MLKLN + NCST+
Sbjct: 161  NKAEKAVEAFGRMNDFDCKPNLFTYNMILHITVQKDAILLALAVYNVMLKLNSNPNCSTF 220

Query: 2952 SILIDGLCKSGKVADALVLFDEMMDRGIVPSKITYTVVLTGLCNAKRIDDAYRLFENMKS 2773
            SILIDGLCKSGK  DAL LFDEM +RG++P+KITYTV+L+GLC AKR DDA+RL   MKS
Sbjct: 221  SILIDGLCKSGKTQDALKLFDEMSERGVLPNKITYTVILSGLCQAKRTDDAHRLLNVMKS 280

Query: 2772 SGNKPDGITYNALLNGVCKLGRMDEAFALLKDFRKDGFDLDLNSYSSLIDGLFRTKRFKE 2593
             G KPD + YNALLNG CKLGR+DEA+ LL+ F  +G+  D+  ++ L+DG  RTKR  E
Sbjct: 281  RGCKPDFVAYNALLNGFCKLGRVDEAYTLLRSFESEGYVADIKGFTCLVDGFVRTKRIDE 340

Query: 2592 GHEMYRKMLETDIQPDVVLHTIMIRGLCDEGRVHDAFKLVREMTDRGLIPDTQAYNTLIK 2413
               +++K+LETD+ PDVVL+T MIRGL   GRV +A  L+R+MT RG+ PDTQ YNTLIK
Sbjct: 341  AQSVFKKLLETDVVPDVVLYTTMIRGLSGAGRVKEALNLLRDMTGRGVQPDTQCYNTLIK 400

Query: 2412 GFCDEGLLDEARSLKLEVSGVDEFPDTSTYTILISGMCRYGLVGEAQNIFNEMEKAGCVP 2233
            GFCD  LLD+ARSL+LE+S  + FPDT TY+ILI GMCR GLV EA+NIFNEME  GC P
Sbjct: 401  GFCDMDLLDQARSLQLEISENECFPDTCTYSILICGMCRNGLVEEARNIFNEMENLGCFP 460

Query: 2232 SVVTFNALIDGLCKSGELQKAHLMFYKMEIGRNPSLFLRLTQGSDRVVDSGSLQTMVTKL 2053
            SVVTFN LIDGLCK+GEL++AHLMFYKMEIG+NPSLFLRL+QG+DRV+D+ SLQ MV KL
Sbjct: 461  SVVTFNTLIDGLCKAGELEEAHLMFYKMEIGKNPSLFLRLSQGADRVLDNVSLQKMVEKL 520

Query: 2052 CESGSTLKAYKLLTQIADTAIMPTITTYNILINGLCEAGILDGAFNLFKELQLKGISPDS 1873
            CESG  LKAYKLL Q+AD  ++P + TYNILINGLC++G ++GAF LF+ELQ+KG  PDS
Sbjct: 521  CESGKILKAYKLLMQLADCGVVPNLVTYNILINGLCKSGKINGAFKLFQELQVKGHLPDS 580

Query: 1872 VTYGTLINGLQIAGREDDAFLLLEEMVSNGCKPTPAVYRTLMKWSCRRKKTFAAFNLWLR 1693
            +TYGTLI+GLQ   RE++AF LL++M  NGC P+  VY++LM WSCRR +   AFNLWL+
Sbjct: 581  ITYGTLIDGLQRVDREEEAFKLLDQMSKNGCMPSAEVYKSLMTWSCRRGQISIAFNLWLK 640

Query: 1692 YLKSLPKRDEKIITLVEECLRKDEVERAVRLLLDMDIKLQDLDSAPYTIWLIGFCQGRKT 1513
            YL++   RD+++I L+E+ L K E+E+ VR LL+MD+KL+D DS+PY IWLIG CQ RK 
Sbjct: 641  YLRNQAIRDDEVIGLIEKHLEKGELEKVVRGLLEMDLKLEDFDSSPYNIWLIGMCQERKP 700

Query: 1512 TEALMLLSILKEYNISLTPASCVMLISTLYREGKLSFAVEVFLYALQKGFILKPRICNNL 1333
            ++AL + S+L+E+N+ ++  SCVMLI +L  EG L  AVEVFLY L++G  L PRICN L
Sbjct: 701  SDALKIFSLLEEFNVMISAPSCVMLIHSLCEEGNLDQAVEVFLYTLERGVRLMPRICNRL 760

Query: 1332 LKSLLRSR-YIDDAFDLMEKMDSCGYNLDDYLDVGTRFLVRNHQRKQ 1195
            L+SLLRS+     A  L+E+M S GYNL+DYL  GTRFL R   R++
Sbjct: 761  LQSLLRSQDKAQHAVGLLERMRSVGYNLNDYLHSGTRFLFRRWNRRE 807


>gb|PHU26943.1| hypothetical protein BC332_05275 [Capsicum chinense]
          Length = 788

 Score =  900 bits (2326), Expect = 0.0
 Identities = 448/767 (58%), Positives = 576/767 (75%), Gaps = 3/767 (0%)
 Frame = -3

Query: 3486 KSFSSLSRVTTISTEVSNIVHSIDPMEPSLEQVAPFLTPDVITHVL-QDQQDPFLCFRFF 3310
            K++   SR   IS EV NI+   DP+EP+L+ +  F+ P+V++ +L Q +++  L FRFF
Sbjct: 15   KTYIPSSRGMGISNEVLNIIEKEDPIEPALDDLVDFMCPNVVSFILEQKRENRELSFRFF 74

Query: 3309 VWAAKRKHFRSWASHNLMISMLVGNK-IDTFWSVLDDVRKCGFRVPSDAFAVLIGGYXXX 3133
            +WAAKR  F SW S N++  ML+ +   D +WSVLD ++  G  + S AF  LI GY   
Sbjct: 75   IWAAKRNRFISWVSENMIEDMLLRDGGFDLYWSVLDKLKLSGIDITSRAFGTLIWGYWKV 134

Query: 3132 XXXXXXXXAFGRMKEFDCEPNLFTYNLVLSVLVGKGMVLLALAVYNLMLKLNCHLNCSTY 2953
                    AFGRM +FDC+PNLFTYN++L + V K  +LLALAVYN+MLKLN + NCST+
Sbjct: 135  NKAEKAVEAFGRMNDFDCKPNLFTYNMILHITVQKDAILLALAVYNVMLKLNSNPNCSTF 194

Query: 2952 SILIDGLCKSGKVADALVLFDEMMDRGIVPSKITYTVVLTGLCNAKRIDDAYRLFENMKS 2773
            SILIDGLCKSGK  DAL LFDEM +RG++P+KITYTV+L+GLC AKR DDA+RL   MKS
Sbjct: 195  SILIDGLCKSGKTQDALKLFDEMSERGVLPNKITYTVILSGLCQAKRTDDAHRLLNVMKS 254

Query: 2772 SGNKPDGITYNALLNGVCKLGRMDEAFALLKDFRKDGFDLDLNSYSSLIDGLFRTKRFKE 2593
             G KPD + YNALLNG CKLGR+DEA+ LL+ F  +G+  D+  ++ L+DG  RTKR  E
Sbjct: 255  RGCKPDFVAYNALLNGFCKLGRVDEAYTLLRSFESEGYVADIKGFTCLVDGFVRTKRIDE 314

Query: 2592 GHEMYRKMLETDIQPDVVLHTIMIRGLCDEGRVHDAFKLVREMTDRGLIPDTQAYNTLIK 2413
               +++K+LETD+ PDVVL+T MIRGL   GRV +A  L+R+MT RG+ PDTQ YNTLIK
Sbjct: 315  AQSVFKKLLETDVVPDVVLYTTMIRGLSGAGRVKEALNLLRDMTGRGVQPDTQCYNTLIK 374

Query: 2412 GFCDEGLLDEARSLKLEVSGVDEFPDTSTYTILISGMCRYGLVGEAQNIFNEMEKAGCVP 2233
            GFCD  LLD+AR+L+LE+S  + FPDT TY+ILI GMCR GLV EA+NIFNEME  GC P
Sbjct: 375  GFCDMDLLDQARALQLEISENECFPDTCTYSILICGMCRNGLVEEARNIFNEMENLGCFP 434

Query: 2232 SVVTFNALIDGLCKSGELQKAHLMFYKMEIGRNPSLFLRLTQGSDRVVDSGSLQTMVTKL 2053
            SVVTFN LIDGLCK+GEL++AHLMFYKMEIG+NPSLFLRL+QG+DRV+D+ SLQ MV KL
Sbjct: 435  SVVTFNTLIDGLCKAGELEEAHLMFYKMEIGKNPSLFLRLSQGADRVLDNVSLQKMVEKL 494

Query: 2052 CESGSTLKAYKLLTQIADTAIMPTITTYNILINGLCEAGILDGAFNLFKELQLKGISPDS 1873
            CESG  LKAYKLL Q+AD  ++P + TYNILINGLC++G ++GAF LF+ELQ+KG  PDS
Sbjct: 495  CESGKILKAYKLLMQLADCGVVPNLVTYNILINGLCKSGKINGAFKLFQELQVKGHLPDS 554

Query: 1872 VTYGTLINGLQIAGREDDAFLLLEEMVSNGCKPTPAVYRTLMKWSCRRKKTFAAFNLWLR 1693
            +TYGTLI+GLQ   RE++AF LL++M  NGC P+  VY++LM WSCRR +   AF+LWL+
Sbjct: 555  ITYGTLIDGLQRVDREEEAFKLLDQMSKNGCMPSAEVYKSLMTWSCRRGQISIAFSLWLK 614

Query: 1692 YLKSLPKRDEKIITLVEECLRKDEVERAVRLLLDMDIKLQDLDSAPYTIWLIGFCQGRKT 1513
            YL++   RD+++I L+E+ L K E+E+ VR LL+MD+KL+D DS+PY IWLIG CQ RK 
Sbjct: 615  YLRNQAVRDDEVIGLIEKHLEKGELEKVVRGLLEMDLKLEDFDSSPYNIWLIGMCQERKP 674

Query: 1512 TEALMLLSILKEYNISLTPASCVMLISTLYREGKLSFAVEVFLYALQKGFILKPRICNNL 1333
             +AL + S+L+E+N+ ++  SCVMLI +L  EG L  AVEVFLY L++G  L PRICN L
Sbjct: 675  RDALKIFSLLEEFNVMISAPSCVMLIHSLCEEGNLDQAVEVFLYTLERGVRLMPRICNRL 734

Query: 1332 LKSLLRSR-YIDDAFDLMEKMDSCGYNLDDYLDVGTRFLVRNHQRKQ 1195
            L+SLLRS+     A  L+E+M S GYNL+DYL  GTR L R   R++
Sbjct: 735  LQSLLRSQDKAQHAVGLLERMRSVGYNLNDYLHSGTRSLFRRWNRRE 781


>gb|EOY31969.1| Pentatricopeptide repeat (PPR) superfamily protein, putative
            [Theobroma cacao]
          Length = 800

 Score =  899 bits (2324), Expect = 0.0
 Identities = 456/776 (58%), Positives = 575/776 (74%), Gaps = 3/776 (0%)
 Frame = -3

Query: 3501 LFPKPKSFSSLSRVTTISTEVSNIVHSIDPMEPSLEQVAPFLTPDVITHVLQDQQDPFLC 3322
            L P   SFSSL   + +S E+ +I+  ++PMEP+LE + PFL+PD++T ++QDQ +P L 
Sbjct: 22   LSPNFSSFSSLQDFS-VSNEIHSILDIVNPMEPALEPLLPFLSPDIVTSIIQDQPNPQLG 80

Query: 3321 FRFFVWAAKRKHFRSWASHNLMISMLV--GNKIDTFWSVLDDVRKCGFRVPSDAFAVLIG 3148
            FRFF+WA +RK  RS AS  L++ ML+   N  D +W  L++++KCG  + SDAF VLI 
Sbjct: 81   FRFFIWAMQRKRLRSSASDKLVVDMLLRKDNGFDMYWQTLEEIKKCGALIVSDAFKVLIS 140

Query: 3147 GYXXXXXXXXXXXAFGRMKEFDCEPNLFTYNLVLSVLVGKGMVLLALAVYNLMLKLNCHL 2968
            GY            FG+MK+FDC+P++FTYN +L V+V + ++LLALAVYN MLK N   
Sbjct: 141  GYSKLGLDEKAVECFGKMKDFDCKPDVFTYNTILYVMVRRKVLLLALAVYNQMLKNNYKP 200

Query: 2967 NCSTYSILIDGLCKSGKVADALVLFDEMMDRGIVPSKITYTVVLTGLCNAKRIDDAYRLF 2788
            N +T+SILIDGLCK+GK  DAL +FDEM  RGI P++ +YT++++GLC A R DDA RL 
Sbjct: 201  NRATFSILIDGLCKNGKTEDALNMFDEMTQRGIEPNRCSYTIIVSGLCQADRADDACRLL 260

Query: 2787 ENMKSSGNKPDGITYNALLNGVCKLGRMDEAFALLKDFRKDGFDLDLNSYSSLIDGLFRT 2608
              MK SG  PD + YNALLNG C+LGR+DEAFALL+ F+KDGF L L  YSS I+GLFR 
Sbjct: 261  NKMKESGCSPDFVAYNALLNGFCQLGRVDEAFALLQSFQKDGFVLGLRGYSSFINGLFRA 320

Query: 2607 KRFKEGHEMYRKMLETDIQPDVVLHTIMIRGLCDEGRVHDAFKLVREMTDRGLIPDTQAY 2428
            +RF+E +  Y KM E +++PDVVL+ IM+RGL   G+V DA KL+ EMT+RGL+PDT  Y
Sbjct: 321  RRFEEAYAWYTKMFEENVKPDVVLYAIMLRGLSVAGKVEDAMKLLSEMTERGLVPDTYCY 380

Query: 2427 NTLIKGFCDEGLLDEARSLKLEVSGVDEFPDTSTYTILISGMCRYGLVGEAQNIFNEMEK 2248
            N +IKGFCD GLLD+ARSL+LE+S  D FP+  TYTILISGMC+ GLVGEAQ IF+EMEK
Sbjct: 381  NAVIKGFCDTGLLDQARSLQLEISSYDCFPNACTYTILISGMCQNGLVGEAQQIFDEMEK 440

Query: 2247 AGCVPSVVTFNALIDGLCKSGELQKAHLMFYKMEIGRNPSLFLRLTQGSDRVVDSGSLQT 2068
             GC PSVVTFNALIDGL K+G+L+KAHL+FYKMEIGRNPSLFLRL+ GS  V+DS SLQT
Sbjct: 441  LGCFPSVVTFNALIDGLSKAGQLEKAHLLFYKMEIGRNPSLFLRLSHGSSGVLDSSSLQT 500

Query: 2067 MVTKLCESGSTLKAYKLLTQIADTAIMPTITTYNILINGLCEAGILDGAFNLFKELQLKG 1888
            MV +L ESG  LKAY++L Q+AD   +P I TYNILI+G C+AG ++GAF LFKELQLKG
Sbjct: 501  MVEQLYESGRILKAYRILMQLADGGNVPDIFTYNILIHGFCKAGNINGAFKLFKELQLKG 560

Query: 1887 ISPDSVTYGTLINGLQIAGREDDAFLLLEEMVSNGCKPTPAVYRTLMKWSCRRKKTFAAF 1708
            ISPDSVTYGTLING Q+AGRE+DAF + ++MV NGCKP+ AVYR+LM WSCRR+K   AF
Sbjct: 561  ISPDSVTYGTLINGFQMAGREEDAFRIFDQMVKNGCKPSVAVYRSLMTWSCRRRKVSLAF 620

Query: 1707 NLWLRYLKSLPKRDEKIITLVEECLRKDEVERAVRLLLDMDIKLQDLDSAPYTIWLIGFC 1528
            NLWL YL+SLP R + +I  VE+   + +VE+AVR LL MD KL     APYTIWLIG C
Sbjct: 621  NLWLMYLRSLPGRQDTVIKEVEKYFDEGQVEKAVRGLLRMDFKLNSFSVAPYTIWLIGLC 680

Query: 1527 QGRKTTEALMLLSILKEYNISLTPASCVMLISTLYREGKLSFAVEVFLYALQKGFILKPR 1348
            Q  +  EAL +  IL+E  + +TP SCV LI  L +EG L  AV+VFLY L++GF L PR
Sbjct: 681  QAGRVEEALKIFYILEECKVVVTPPSCVRLIVGLCKEGNLDLAVDVFLYTLEQGFKLMPR 740

Query: 1347 ICNNLLKSLLRSR-YIDDAFDLMEKMDSCGYNLDDYLDVGTRFLVRNHQRKQEMQN 1183
            ICN LLKSLLRS+     AF L+ KM+S  Y+LD YL   T+ L+  H    +M+N
Sbjct: 741  ICNYLLKSLLRSKDKRMHAFGLLSKMNSQRYDLDAYLHKTTKSLLYRHWHTWKMEN 796


>ref|XP_017982149.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            [Theobroma cacao]
          Length = 800

 Score =  898 bits (2321), Expect = 0.0
 Identities = 454/777 (58%), Positives = 576/777 (74%), Gaps = 3/777 (0%)
 Frame = -3

Query: 3501 LFPKPKSFSSLSRVTTISTEVSNIVHSIDPMEPSLEQVAPFLTPDVITHVLQDQQDPFLC 3322
            L P   SFSSL   + +S E+ +I+  ++PMEP+LE + PFL+PD++T ++QDQ +P L 
Sbjct: 22   LSPNFSSFSSLQDFS-VSNEIHSILDIVNPMEPALEPLLPFLSPDIVTSIIQDQPNPQLG 80

Query: 3321 FRFFVWAAKRKHFRSWASHNLMISMLV--GNKIDTFWSVLDDVRKCGFRVPSDAFAVLIG 3148
            FRFF+WA + K  RS AS  L++ ML+   N  D +W  L++++KCG  + SDAF VLI 
Sbjct: 81   FRFFIWAMQSKRLRSSASDKLVVDMLLRKDNGFDMYWQTLEEIKKCGALIVSDAFKVLIS 140

Query: 3147 GYXXXXXXXXXXXAFGRMKEFDCEPNLFTYNLVLSVLVGKGMVLLALAVYNLMLKLNCHL 2968
            GY            FG+MK+FDC+P++FTYN +L V+V + ++LLALAVYN MLK N   
Sbjct: 141  GYSKLGLDEKAVECFGKMKDFDCKPDVFTYNTILYVMVRRKVLLLALAVYNQMLKNNYKA 200

Query: 2967 NCSTYSILIDGLCKSGKVADALVLFDEMMDRGIVPSKITYTVVLTGLCNAKRIDDAYRLF 2788
            N +T+SILIDGLCK+GK  DAL +FDEM  RGI P++ +YT++++GLC A R DDA RL 
Sbjct: 201  NRATFSILIDGLCKNGKTEDALNMFDEMTQRGIEPNRCSYTIIVSGLCQADRADDACRLL 260

Query: 2787 ENMKSSGNKPDGITYNALLNGVCKLGRMDEAFALLKDFRKDGFDLDLNSYSSLIDGLFRT 2608
              MK SG  PD + YNALLNG C+LGR+DEAFALL+ F+KDGF L L  YSS I+GLFR 
Sbjct: 261  NKMKESGCSPDFVAYNALLNGFCQLGRVDEAFALLQSFQKDGFVLGLRGYSSFINGLFRA 320

Query: 2607 KRFKEGHEMYRKMLETDIQPDVVLHTIMIRGLCDEGRVHDAFKLVREMTDRGLIPDTQAY 2428
            +RF+E +  Y KM E +++PDVVL+ IM+RGL   G+V DA KL+ EMT+RGL+PDT  Y
Sbjct: 321  RRFEEAYAWYTKMFEENVKPDVVLYAIMLRGLSVAGKVEDAMKLLSEMTERGLVPDTYCY 380

Query: 2427 NTLIKGFCDEGLLDEARSLKLEVSGVDEFPDTSTYTILISGMCRYGLVGEAQNIFNEMEK 2248
            N +IKGFCD GLLD+ARSL+LE+S  D FP+  TYTILISGMC+ GLVGEAQ IF+EMEK
Sbjct: 381  NAVIKGFCDTGLLDQARSLQLEISSYDCFPNACTYTILISGMCQNGLVGEAQQIFDEMEK 440

Query: 2247 AGCVPSVVTFNALIDGLCKSGELQKAHLMFYKMEIGRNPSLFLRLTQGSDRVVDSGSLQT 2068
             GC PSVVTFNALIDGL K+G+L+KAHL+FYKMEIGRNPSLFLRL+ GS  V+DS SLQT
Sbjct: 441  LGCFPSVVTFNALIDGLSKAGQLEKAHLLFYKMEIGRNPSLFLRLSHGSSGVLDSSSLQT 500

Query: 2067 MVTKLCESGSTLKAYKLLTQIADTAIMPTITTYNILINGLCEAGILDGAFNLFKELQLKG 1888
            MV +L ESG  LKAY++L Q+AD   +P I TYNILI+G C+AG ++GAF LFKELQLKG
Sbjct: 501  MVEQLYESGRILKAYRILMQLADGGNVPDIFTYNILIHGFCKAGNINGAFKLFKELQLKG 560

Query: 1887 ISPDSVTYGTLINGLQIAGREDDAFLLLEEMVSNGCKPTPAVYRTLMKWSCRRKKTFAAF 1708
            ISPDSVTYGTLING Q+AGRE+DAF + ++MV NGCKP+ AVYR+LM WSCRR+K   AF
Sbjct: 561  ISPDSVTYGTLINGFQMAGREEDAFRIFDQMVKNGCKPSVAVYRSLMTWSCRRRKVSLAF 620

Query: 1707 NLWLRYLKSLPKRDEKIITLVEECLRKDEVERAVRLLLDMDIKLQDLDSAPYTIWLIGFC 1528
            NLWL YL+SLP R + +I  VE+   + +VE+AVR LL MD KL     APYTIWLIG C
Sbjct: 621  NLWLMYLRSLPGRQDTVIKEVEKYFDEGQVEKAVRGLLKMDFKLNSFSVAPYTIWLIGLC 680

Query: 1527 QGRKTTEALMLLSILKEYNISLTPASCVMLISTLYREGKLSFAVEVFLYALQKGFILKPR 1348
            Q  +  EAL +  IL+E  + ++P SCV LI  L +EG L  AV+VFLY L++GF L PR
Sbjct: 681  QAGRVEEALKIFYILEECKVVVSPPSCVRLIVGLCKEGNLDLAVDVFLYTLEQGFKLMPR 740

Query: 1347 ICNNLLKSLLRSR-YIDDAFDLMEKMDSCGYNLDDYLDVGTRFLVRNHQRKQEMQNL 1180
            ICN+LLKSLLRS+     AF L+ KM+S  Y+LD YL   T+ L+  H    +M+N+
Sbjct: 741  ICNHLLKSLLRSKDKRMHAFGLLSKMNSQRYDLDAYLHKTTKSLLYRHWHTWKMENV 797


>ref|XP_021654749.1| pentatricopeptide repeat-containing protein At1g79540 [Hevea
            brasiliensis]
          Length = 790

 Score =  897 bits (2317), Expect = 0.0
 Identities = 440/767 (57%), Positives = 575/767 (74%), Gaps = 5/767 (0%)
 Frame = -3

Query: 3477 SSLSRVTTISTEVSNIVHSIDPMEPSLEQVAPFLTPDVITHVLQDQQDPFLCFRFFVWAA 3298
            +S +     S++V  IV ++DPMEP+LE + PFL+P ++T ++++   P L FRFF+WA+
Sbjct: 23   TSCTEKIATSSDVLPIVATVDPMEPALEPMVPFLSPGLVTSIIKNPPSPQLGFRFFIWAS 82

Query: 3297 KRKHFRSWASHNLMISMLV-GNKIDTFWSVLDDVRKCGFRVPSDAFAVLIGGYXXXXXXX 3121
            K K FRSW  H +++ ML+  N ++ +W VL+D++KC   + + AF VLI  Y       
Sbjct: 83   KYKRFRSWLFHTVILDMLIKDNGLELYWQVLNDIKKCDLSISAAAFTVLIQAYAKMGMVE 142

Query: 3120 XXXXAFGRMKEFDCEPNLFTYNLVLSVLVGKGMVLLALAVYNLMLKLNCHLNCSTYSILI 2941
                AF RMK+ DCEP++FT N +LSV+V K + LLAL +YN MLK+NC  N  TYS+LI
Sbjct: 143  KAVEAFERMKDVDCEPDIFTCNTILSVIVRKEVFLLALGIYNRMLKINCLPNSYTYSMLI 202

Query: 2940 DGLCKSGKVADALVLFDEMMDRGIVPSKITYTVVLTGLCNAKRIDDAYRLFENMKSSGNK 2761
            D LCKSGK  +AL + DEM  RGI P+K+TYT++++GLC A+R DDAYRLF  MK SG +
Sbjct: 203  DVLCKSGKTHNALQMLDEMTQRGISPNKVTYTIIISGLCQAQRTDDAYRLFNTMKDSGCR 262

Query: 2760 PDGITYNALLNGVCKLGRMDEAFALLKDFRKDGFDLDLNSYSSLIDGLFRTKRFKEGHEM 2581
            PD +TYNALL+G CKLGR+DEA ALLK F+KDG+ L+   YS LIDGLFR ++F++    
Sbjct: 263  PDFVTYNALLDGFCKLGRVDEAMALLKFFKKDGYVLNKEGYSCLIDGLFRARKFEDAQLW 322

Query: 2580 YRKMLETDIQPDVVLHTIMIRGLCDEGRVHDAFKLVREMTDRGLIPDTQAYNTLIKGFCD 2401
            YR+M+E +I+ DV+L+TIM++GL   G+V DA KL+ EMT+RG++PD+  YN LIKGFCD
Sbjct: 323  YRQMIEDNIKVDVILYTIMMKGLLKAGKVKDALKLLSEMTERGVVPDSHCYNALIKGFCD 382

Query: 2400 EGLLDEARSLKLEVSGVDEFPDTSTYTILISGMCRYGLVGEAQNIFNEMEKAGCVPSVVT 2221
             GLLDEA+SL+LE+S  D FP+T TYTILI GMCR GLVG+AQ +FNEMEK GC PSV+T
Sbjct: 383  MGLLDEAKSLRLEISNHDSFPNTCTYTILICGMCRKGLVGDAQQMFNEMEKLGCYPSVIT 442

Query: 2220 FNALIDGLCKSGELQKAHLMFYKMEIGRNPSLFLRLTQGSDRVVDSGSLQTMVTKLCESG 2041
            FNALIDGLCK+G+L++A L+FYKMEIGRNPSLFLRL+QG+DRV+D+G LQTMV +LC SG
Sbjct: 443  FNALIDGLCKAGKLEEAQLLFYKMEIGRNPSLFLRLSQGADRVLDTGGLQTMVEQLCVSG 502

Query: 2040 STLKAYKLLTQIADTAIMPTITTYNILINGLCEAGILDGAFNLFKELQLKGISPDSVTYG 1861
              LKAYK+L Q+AD+   P I TYNILING C+AG ++GAF L KELQLKG+SPDSVTYG
Sbjct: 503  LILKAYKILMQLADSGFAPNINTYNILINGFCKAGNINGAFKLVKELQLKGLSPDSVTYG 562

Query: 1860 TLINGLQIAGREDDAFLLLEEMVSNGCKPTPAVYRTLMKWSCRRKKTFAAFNLWLRYLKS 1681
            TLINGL    RE+DAF +L++M  +GC PT AVY++LM WSCRRKK   AFN+WL+YL++
Sbjct: 563  TLINGLLSIKREEDAFRVLDQMFKSGCTPTTAVYKSLMTWSCRRKKVSLAFNIWLQYLQN 622

Query: 1680 LPKRDEKIITLVEECLRKDEVERAVRLLLDMDIKLQDLDSAPYTIWLIGFCQGRKTTEAL 1501
            +  RD +++ ++EE   K EVE+AVR LL+MD KL D + APY+IWLIG CQ  +  EAL
Sbjct: 623  VSGRDNEVVKIIEEYFEKGEVEKAVRKLLEMDFKLNDFELAPYSIWLIGLCQAGRVEEAL 682

Query: 1500 MLLSILKEYNISLTPASCVMLISTLYREGKLSFAVEVFLYALQKGFILKPRICNNLLKSL 1321
             +  IL+E  + +TP SCV LI  L REG L  AVE+FLY ++KG+IL PRICN LLK L
Sbjct: 683  NIFFILQECKVVVTPPSCVKLIRGLCREGNLDLAVEMFLYTIEKGYILMPRICNLLLKLL 742

Query: 1320 LRSR-YIDDAFDLMEKMDSCGYNLDDYLDVGTRFLVRN---HQRKQE 1192
            LRS    D AFDL+ +M+S GYNLD +L   T+FL+ +   H R ++
Sbjct: 743  LRSEDKRDHAFDLLSRMESLGYNLDAHLRPTTKFLLHSDTVHHRHEK 789


>ref|XP_016560496.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            isoform X1 [Capsicum annuum]
 ref|XP_016560497.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            isoform X1 [Capsicum annuum]
 ref|XP_016560498.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            isoform X1 [Capsicum annuum]
 ref|XP_016560499.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            isoform X1 [Capsicum annuum]
 ref|XP_016560500.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            isoform X1 [Capsicum annuum]
 gb|PHT91103.1| hypothetical protein T459_06216 [Capsicum annuum]
          Length = 788

 Score =  896 bits (2315), Expect = 0.0
 Identities = 447/767 (58%), Positives = 575/767 (74%), Gaps = 3/767 (0%)
 Frame = -3

Query: 3486 KSFSSLSRVTTISTEVSNIVHSIDPMEPSLEQVAPFLTPDVITHVL-QDQQDPFLCFRFF 3310
            K++   SR   IS EV NI+   DP+EP+L+ +  F+ P+V++ +L Q +++  L FRFF
Sbjct: 15   KTYIPSSRGMGISNEVLNIIEKEDPIEPALDDLVDFMCPNVVSFILEQKRENRELSFRFF 74

Query: 3309 VWAAKRKHFRSWASHNLMISMLVGNK-IDTFWSVLDDVRKCGFRVPSDAFAVLIGGYXXX 3133
            +WAAKR  F SW S N++  ML+ +   D +WSVLD ++  G  + S AF  LI GY   
Sbjct: 75   IWAAKRNRFISWVSENMIEDMLLRDGGFDLYWSVLDKLKLSGIDITSRAFGTLIWGYWKV 134

Query: 3132 XXXXXXXXAFGRMKEFDCEPNLFTYNLVLSVLVGKGMVLLALAVYNLMLKLNCHLNCSTY 2953
                    AFGRM +FDC+PNLFTYN++L + V K  +LLALAVYN+MLKLN + NCST+
Sbjct: 135  NKAEKAVEAFGRMNDFDCKPNLFTYNMILHITVQKDAILLALAVYNVMLKLNSNPNCSTF 194

Query: 2952 SILIDGLCKSGKVADALVLFDEMMDRGIVPSKITYTVVLTGLCNAKRIDDAYRLFENMKS 2773
            SILIDGLCKSGK  DAL LFDEM +RG++P+KITYTV+L+GLC AKR DDA+RL   MKS
Sbjct: 195  SILIDGLCKSGKTQDALKLFDEMSERGVLPNKITYTVILSGLCQAKRTDDAHRLLNVMKS 254

Query: 2772 SGNKPDGITYNALLNGVCKLGRMDEAFALLKDFRKDGFDLDLNSYSSLIDGLFRTKRFKE 2593
             G KPD + YNALLNG CKLGR+DEA+ LL+ F  +G+  D+  ++ L+DG  RTKR  E
Sbjct: 255  RGCKPDFVAYNALLNGFCKLGRVDEAYTLLRSFESEGYVADIKGFTCLVDGFVRTKRIDE 314

Query: 2592 GHEMYRKMLETDIQPDVVLHTIMIRGLCDEGRVHDAFKLVREMTDRGLIPDTQAYNTLIK 2413
               +++K+LETD+ PDVVL+T MIRGL   GRV +A  L+R+MT RG+ PDTQ YNTLIK
Sbjct: 315  AQSVFKKLLETDVVPDVVLYTTMIRGLSGAGRVKEALNLLRDMTGRGVQPDTQCYNTLIK 374

Query: 2412 GFCDEGLLDEARSLKLEVSGVDEFPDTSTYTILISGMCRYGLVGEAQNIFNEMEKAGCVP 2233
            GFCD  LLD+A+SL+LE+S  + FPDT TY+ILI  MCR GLV EA+NIFNEME  GC P
Sbjct: 375  GFCDMDLLDQAQSLQLEISENECFPDTCTYSILICRMCRNGLVEEARNIFNEMENLGCFP 434

Query: 2232 SVVTFNALIDGLCKSGELQKAHLMFYKMEIGRNPSLFLRLTQGSDRVVDSGSLQTMVTKL 2053
            SVVTFN LIDGLCK+GEL++AHLMFYKMEIG+NPSLFLRL+QG+DRV+D+ SLQ MV KL
Sbjct: 435  SVVTFNTLIDGLCKAGELEEAHLMFYKMEIGKNPSLFLRLSQGADRVLDNVSLQKMVEKL 494

Query: 2052 CESGSTLKAYKLLTQIADTAIMPTITTYNILINGLCEAGILDGAFNLFKELQLKGISPDS 1873
            CESG  LKAYKLL Q+AD  ++P + TYNILINGLC++G ++GAF LF+ELQ+KG  PDS
Sbjct: 495  CESGKILKAYKLLMQLADCGVVPNLVTYNILINGLCKSGKINGAFKLFQELQVKGHLPDS 554

Query: 1872 VTYGTLINGLQIAGREDDAFLLLEEMVSNGCKPTPAVYRTLMKWSCRRKKTFAAFNLWLR 1693
            +TYGTLI+GLQ   RE++AF LL++M  NGC P+  VY++LM WSCRR +   AF+LWL+
Sbjct: 555  ITYGTLIDGLQRVDREEEAFKLLDQMSKNGCMPSAEVYKSLMTWSCRRGQIPIAFSLWLK 614

Query: 1692 YLKSLPKRDEKIITLVEECLRKDEVERAVRLLLDMDIKLQDLDSAPYTIWLIGFCQGRKT 1513
            YL++   RD+++I L+E+ L K E+E+ VR LL+MD+KL+D DS+PY IWLIG CQ RK 
Sbjct: 615  YLRNQAVRDDEVIGLIEKHLEKGELEKVVRGLLEMDLKLEDFDSSPYNIWLIGMCQERKP 674

Query: 1512 TEALMLLSILKEYNISLTPASCVMLISTLYREGKLSFAVEVFLYALQKGFILKPRICNNL 1333
             +AL + S+L+E+N+ ++  SCVMLI +L  EG L  AVEVFLY L++G  L PRICN L
Sbjct: 675  RDALKIFSLLEEFNVMISAPSCVMLIHSLCEEGNLDQAVEVFLYTLERGVRLMPRICNRL 734

Query: 1332 LKSLLRSR-YIDDAFDLMEKMDSCGYNLDDYLDVGTRFLVRNHQRKQ 1195
            L+SLLRS+     A  L+E+M S GYNL+DYL  GTR L R   R++
Sbjct: 735  LQSLLRSQDKAQHAVGLLERMRSVGYNLNDYLHSGTRSLFRRWNRRE 781


>ref|XP_006340744.2| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            [Solanum tuberosum]
          Length = 775

 Score =  889 bits (2297), Expect = 0.0
 Identities = 444/761 (58%), Positives = 573/761 (75%), Gaps = 3/761 (0%)
 Frame = -3

Query: 3486 KSFSSLSRVTTISTEVSNIVHSIDPMEPSLEQVAPFLTPDVITHVLQDQQ-DPFLCFRFF 3310
            KSFS+ SR   +S EV NI+  +DP+EP+L+++  FL P++I+ +L++++ +P L FRFF
Sbjct: 15   KSFST-SREMAVSNEVLNIIERVDPLEPALDKLVRFLCPNIISFILEEKRKNPELGFRFF 73

Query: 3309 VWAAKRKHFRSWASHNLMISMLVGNK-IDTFWSVLDDVRKCGFRVPSDAFAVLIGGYXXX 3133
            +WAAKRK F+SW   NL+  ML  +   D +W+VLD ++  G  + S+AFA LI GY   
Sbjct: 74   IWAAKRKRFQSWVPKNLIADMLAQDGGFDLYWNVLDKLKFSGIPIASNAFAALIWGYWKV 133

Query: 3132 XXXXXXXXAFGRMKEFDCEPNLFTYNLVLSVLVGKGMVLLALAVYNLMLKLNCHLNCSTY 2953
                    AFGRMK+FDC+PN++TYN++L + V K  +LLALAVYN+MLKLN   N ST+
Sbjct: 134  NKAEKAVEAFGRMKDFDCKPNIYTYNMILHIAVQKDAILLALAVYNVMLKLNSQPNSSTF 193

Query: 2952 SILIDGLCKSGKVADALVLFDEMMDRGIVPSKITYTVVLTGLCNAKRIDDAYRLFENMKS 2773
            SILIDGLCKSG+  DAL LFDEM +RG++PSKITYTV+L+GLC AKR DDAYRL   MK+
Sbjct: 194  SILIDGLCKSGRTHDALALFDEMTERGVLPSKITYTVILSGLCQAKRTDDAYRLLNVMKT 253

Query: 2772 SGNKPDGITYNALLNGVCKLGRMDEAFALLKDFRKDGFDLDLNSYSSLIDGLFRTKRFKE 2593
             G +PD +TYNALLNG CKLGR+DE  ALL+ F  +G+ +D+  Y+ LIDG  RTKR  E
Sbjct: 254  RGCRPDFVTYNALLNGFCKLGRVDETHALLRSFENEGYLMDIKGYTCLIDGFVRTKRIDE 313

Query: 2592 GHEMYRKMLETDIQPDVVLHTIMIRGLCDEGRVHDAFKLVREMTDRGLIPDTQAYNTLIK 2413
               +++K+ E ++ PDVVL+T MIRGL   GRV +A  L+R+MT RG+ PDTQ YNTLIK
Sbjct: 314  AQSVFKKLFEKNVVPDVVLYTTMIRGLSGAGRVKEALSLLRDMTGRGVQPDTQCYNTLIK 373

Query: 2412 GFCDEGLLDEARSLKLEVSGVDEFPDTSTYTILISGMCRYGLVGEAQNIFNEMEKAGCVP 2233
            GFCD G+LD+ARSL+LE+S  D FPDT TY+I+I GMCR GLV EA++IFNEMEK GC P
Sbjct: 374  GFCDVGILDQARSLQLEISENDCFPDTYTYSIVICGMCRNGLVEEARHIFNEMEKLGCFP 433

Query: 2232 SVVTFNALIDGLCKSGELQKAHLMFYKMEIGRNPSLFLRLTQGSDRVVDSGSLQTMVTKL 2053
            SVVTFN LIDGLCK+GEL++AHLMFYKMEIG+NPSLFLRL+QG+DRV+DS SLQ M+ KL
Sbjct: 434  SVVTFNTLIDGLCKAGELEEAHLMFYKMEIGKNPSLFLRLSQGADRVLDSVSLQKMIEKL 493

Query: 2052 CESGSTLKAYKLLTQIADTAIMPTITTYNILINGLCEAGILDGAFNLFKELQLKGISPDS 1873
            CE+G  LKAYKLL Q+AD   +P I TYNILINGLC++GI++GA  LF+ELQ+KG  PDS
Sbjct: 494  CETGKILKAYKLLMQLADCGFVPNIVTYNILINGLCKSGIINGALKLFQELQVKGHFPDS 553

Query: 1872 VTYGTLINGLQIAGREDDAFLLLEEMVSNGCKPTPAVYRTLMKWSCRRKKTFAAFNLWLR 1693
            +TYGTLI+GLQ  GR D++F L ++M  NGC P+  VY++LM WSCRR +   AF+LW +
Sbjct: 554  ITYGTLIDGLQRVGRVDESFKLFDQMSKNGCMPSAEVYKSLMTWSCRRGQISIAFSLWFQ 613

Query: 1692 YLKSLPKRDEKIITLVEECLRKDEVERAVRLLLDMDIKLQDLDSAPYTIWLIGFCQGRKT 1513
            YL++   RD ++I L+E+ L K ++E+ VR LL++D+K  D DS+PY IWLIG CQ  K 
Sbjct: 614  YLRNHAVRDGEVIGLIEKHLEKGDLEKVVRGLLEIDLKRVDFDSSPYNIWLIGMCQECKP 673

Query: 1512 TEALMLLSILKEYNISLTPASCVMLISTLYREGKLSFAVEVFLYALQKGFILKPRICNNL 1333
             EAL + S+L E+++ ++  SCVMLI +L  EG L  AVEVFLY L++G  L PRICN L
Sbjct: 674  HEALKIFSLLVEFHVMVSAPSCVMLIHSLCEEGNLDQAVEVFLYTLERGVRLMPRICNKL 733

Query: 1332 LKSLLRSR-YIDDAFDLMEKMDSCGYNLDDYLDVGTRFLVR 1213
            L+SLL S+     AF L+E+M S GYNLDDYL  GTR L R
Sbjct: 734  LQSLLHSQDKAHHAFGLLERMRSTGYNLDDYLHRGTRSLFR 774


>ref|XP_015065860.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            isoform X1 [Solanum pennellii]
          Length = 775

 Score =  887 bits (2293), Expect = 0.0
 Identities = 444/761 (58%), Positives = 570/761 (74%), Gaps = 3/761 (0%)
 Frame = -3

Query: 3486 KSFSSLSRVTTISTEVSNIVHSIDPMEPSLEQVAPFLTPDVITHVLQDQQ-DPFLCFRFF 3310
            KSFS+ SR   +S EV NI+  +DP+EP+L+++  FL P++I+ +L++++ +P L FRFF
Sbjct: 15   KSFST-SREMAVSNEVLNIIDRVDPLEPALDELVRFLCPNIISFILEEKRKNPELGFRFF 73

Query: 3309 VWAAKRKHFRSWASHNLMISMLVGNK-IDTFWSVLDDVRKCGFRVPSDAFAVLIGGYXXX 3133
            +WAAKRK F+ W   NL+  ML  +   D +W+VLD ++  G  + S+AFA LI GY   
Sbjct: 74   IWAAKRKRFQRWIPKNLIADMLSKDGGFDLYWNVLDKLKFSGIPIASNAFAALIWGYWKV 133

Query: 3132 XXXXXXXXAFGRMKEFDCEPNLFTYNLVLSVLVGKGMVLLALAVYNLMLKLNCHLNCSTY 2953
                    AF RMK+FDC+PN++TYN++L + V K  +LLALAVYN+MLKLN   N ST+
Sbjct: 134  NKAEKAIEAFSRMKDFDCKPNIYTYNMILHIAVQKDAILLALAVYNVMLKLNSQPNSSTF 193

Query: 2952 SILIDGLCKSGKVADALVLFDEMMDRGIVPSKITYTVVLTGLCNAKRIDDAYRLFENMKS 2773
            SILIDGLCKSG+  DAL LFDEM +RG++PSKITYTV+L+GLC AKR DDAYRL   MK+
Sbjct: 194  SILIDGLCKSGRTHDALALFDEMTERGVLPSKITYTVILSGLCQAKRTDDAYRLLNVMKT 253

Query: 2772 SGNKPDGITYNALLNGVCKLGRMDEAFALLKDFRKDGFDLDLNSYSSLIDGLFRTKRFKE 2593
             G KPD +TYN LLNG CKLGR+DEA  LL+ F K+G+ +D+  Y+ LIDG  RTKR  E
Sbjct: 254  RGCKPDFVTYNTLLNGFCKLGRVDEAHVLLRSFEKEGYLMDIKGYTCLIDGFVRTKRIDE 313

Query: 2592 GHEMYRKMLETDIQPDVVLHTIMIRGLCDEGRVHDAFKLVREMTDRGLIPDTQAYNTLIK 2413
               +++ + E ++ PDVVL+T MIRGL   GRV +A  L+R+MT RG+ PDTQ YNTLIK
Sbjct: 314  AQSIFKNLFEKNVVPDVVLYTTMIRGLSGAGRVKEALSLLRDMTGRGVQPDTQCYNTLIK 373

Query: 2412 GFCDEGLLDEARSLKLEVSGVDEFPDTSTYTILISGMCRYGLVGEAQNIFNEMEKAGCVP 2233
            GFCD G+LD+ARSL+LE+S  D FPDT TY+I+I GMCR GLV EA++IFNEMEK GC P
Sbjct: 374  GFCDMGVLDQARSLQLEISENDCFPDTYTYSIVICGMCRNGLVEEARHIFNEMEKLGCFP 433

Query: 2232 SVVTFNALIDGLCKSGELQKAHLMFYKMEIGRNPSLFLRLTQGSDRVVDSGSLQTMVTKL 2053
            SVVTFN LIDGLCK+GEL++AHLMFYKMEIG+NPSLFLRL+QG+DRV+DS SLQ M+ KL
Sbjct: 434  SVVTFNTLIDGLCKAGELEEAHLMFYKMEIGKNPSLFLRLSQGADRVLDSASLQKMIEKL 493

Query: 2052 CESGSTLKAYKLLTQIADTAIMPTITTYNILINGLCEAGILDGAFNLFKELQLKGISPDS 1873
            CE+G  LKAYKLL Q+AD   +P I TYNILINGLC++GI++GA  LF+ELQ+KG  PDS
Sbjct: 494  CETGKILKAYKLLMQLADCGFVPNIVTYNILINGLCKSGIINGALKLFQELQVKGHFPDS 553

Query: 1872 VTYGTLINGLQIAGREDDAFLLLEEMVSNGCKPTPAVYRTLMKWSCRRKKTFAAFNLWLR 1693
            +TYGTLI+GLQ  GR D++F L ++M  NGC P+  VY++LM WSCRR +   AF+LW +
Sbjct: 554  ITYGTLIDGLQRVGRVDESFKLFDQMSKNGCMPSAEVYKSLMTWSCRRGQISIAFSLWFQ 613

Query: 1692 YLKSLPKRDEKIITLVEECLRKDEVERAVRLLLDMDIKLQDLDSAPYTIWLIGFCQGRKT 1513
            YL++   RD ++I L+EE L K ++E+ VR LL+ D+K  D DS+PY IWLIG CQ  K 
Sbjct: 614  YLRNHAFRDGEVIGLIEEHLEKGDLEKVVRGLLEFDLKRADFDSSPYNIWLIGMCQECKP 673

Query: 1512 TEALMLLSILKEYNISLTPASCVMLISTLYREGKLSFAVEVFLYALQKGFILKPRICNNL 1333
             EAL + S+L E+++ ++  SCVMLI +L  EG L  AVEVFLY L++G  L PRICN L
Sbjct: 674  HEALKIFSLLVEFHVMVSAPSCVMLIHSLCEEGNLDQAVEVFLYTLERGVRLMPRICNKL 733

Query: 1332 LKSLLRSR-YIDDAFDLMEKMDSCGYNLDDYLDVGTRFLVR 1213
            L+SLLRS+     AF L+E+M S GYNLDDYL  GTR L R
Sbjct: 734  LQSLLRSQDKAQHAFGLLERMRSTGYNLDDYLHRGTRSLFR 774


>ref|XP_021276401.1| pentatricopeptide repeat-containing protein At1g79540 [Herrania
            umbratica]
          Length = 800

 Score =  886 bits (2290), Expect = 0.0
 Identities = 449/781 (57%), Positives = 576/781 (73%), Gaps = 5/781 (0%)
 Frame = -3

Query: 3507 TSLFPKPKSFSSLSRVT--TISTEVSNIVHSIDPMEPSLEQVAPFLTPDVITHVLQDQQD 3334
            TS F  P +FSS S +   ++S E+ +I+ +++PMEP+LE + PFL+  ++T ++QDQ +
Sbjct: 18   TSKFLSP-NFSSFSCLQDFSVSNEIHSILDTVNPMEPALEPLLPFLSSGIVTSIIQDQPN 76

Query: 3333 PFLCFRFFVWAAKRKHFRSWASHNLMISMLV--GNKIDTFWSVLDDVRKCGFRVPSDAFA 3160
            P L FRFF+WA +RK  R  AS  L++ ML+   N  D +W  L++++KCG  + SDAF 
Sbjct: 77   PQLGFRFFIWAIQRKRLRCSASDKLVVDMLLRKDNAFDMYWQTLEEIKKCGVLIVSDAFK 136

Query: 3159 VLIGGYXXXXXXXXXXXAFGRMKEFDCEPNLFTYNLVLSVLVGKGMVLLALAVYNLMLKL 2980
            VLI GY            FG+MK+FDC+P++FTYN +L V++ K ++LLALAVYN MLK 
Sbjct: 137  VLISGYSKLGLDEKAVECFGKMKDFDCKPDVFTYNTILYVMIRKKVLLLALAVYNQMLKN 196

Query: 2979 NCHLNCSTYSILIDGLCKSGKVADALVLFDEMMDRGIVPSKITYTVVLTGLCNAKRIDDA 2800
            N   N +T+ ILIDGLCK+GK  DAL +FDEM  RGI P++ +YT++++GLC A R DDA
Sbjct: 197  NYKPNRATFGILIDGLCKNGKTEDALNMFDEMTRRGIEPNRSSYTIIISGLCQADRADDA 256

Query: 2799 YRLFENMKSSGNKPDGITYNALLNGVCKLGRMDEAFALLKDFRKDGFDLDLNSYSSLIDG 2620
             RL   MK SG  PD + YNALLNG C+LGR+DEAFALL+ F+KDG+ L L  YSS I+G
Sbjct: 257  CRLLNKMKGSGCSPDFVAYNALLNGFCQLGRVDEAFALLRSFQKDGYVLGLRGYSSFING 316

Query: 2619 LFRTKRFKEGHEMYRKMLETDIQPDVVLHTIMIRGLCDEGRVHDAFKLVREMTDRGLIPD 2440
            LFR +RFKE +  Y KM E +++PDVVL+ IM+RGL   GRV DA KL+ EMT+RGL+PD
Sbjct: 317  LFRARRFKEAYAWYTKMSEENVKPDVVLYAIMLRGLSVAGRVEDAMKLLSEMTERGLVPD 376

Query: 2439 TQAYNTLIKGFCDEGLLDEARSLKLEVSGVDEFPDTSTYTILISGMCRYGLVGEAQNIFN 2260
            T  YN +IKGFCD GLLD+A SL+LE+S  D FP+  TYTILISGMC+ GLV EAQ IFN
Sbjct: 377  TYCYNAVIKGFCDIGLLDQAWSLQLEISSYDCFPNACTYTILISGMCQNGLVREAQQIFN 436

Query: 2259 EMEKAGCVPSVVTFNALIDGLCKSGELQKAHLMFYKMEIGRNPSLFLRLTQGSDRVVDSG 2080
            EMEK GC PSVVTFNALIDGL K+G+L+KAHL+FYKMEIGRNPSLFLRL+ GS  V+DS 
Sbjct: 437  EMEKLGCFPSVVTFNALIDGLSKAGQLEKAHLLFYKMEIGRNPSLFLRLSHGSSGVLDSS 496

Query: 2079 SLQTMVTKLCESGSTLKAYKLLTQIADTAIMPTITTYNILINGLCEAGILDGAFNLFKEL 1900
            SLQTMV ++ ESG  LKAY++L Q+AD   +P I TYNILI+G C+AG ++GAF +FKEL
Sbjct: 497  SLQTMVEQIYESGRILKAYRILMQLADGGNVPDIFTYNILIHGFCKAGNINGAFKVFKEL 556

Query: 1899 QLKGISPDSVTYGTLINGLQIAGREDDAFLLLEEMVSNGCKPTPAVYRTLMKWSCRRKKT 1720
            Q+KG+SPDSVTYGTLING Q+AGRE+DAF + ++MV NGCKP+ AVYR+LM WSCRR+K 
Sbjct: 557  QIKGLSPDSVTYGTLINGFQMAGREEDAFRIFDQMVKNGCKPSVAVYRSLMTWSCRRRKV 616

Query: 1719 FAAFNLWLRYLKSLPKRDEKIITLVEECLRKDEVERAVRLLLDMDIKLQDLDSAPYTIWL 1540
              AFNLWL YL+SLP R + +I  VE+   + +VERAVR LL+MD KL +   APYTIWL
Sbjct: 617  SLAFNLWLMYLRSLPGRQDTVIKEVEKYFDEGQVERAVRGLLEMDFKLNNFSVAPYTIWL 676

Query: 1539 IGFCQGRKTTEALMLLSILKEYNISLTPASCVMLISTLYREGKLSFAVEVFLYALQKGFI 1360
            IG C   +  EAL +  IL+E N+ ++P SCVMLI  L  EG L  AV+VFLY L++GF 
Sbjct: 677  IGLCHAGRVEEALKIFYILEECNVIVSPPSCVMLIGGLCEEGNLDLAVDVFLYTLEQGFK 736

Query: 1359 LKPRICNNLLKSLLRSR-YIDDAFDLMEKMDSCGYNLDDYLDVGTRFLVRNHQRKQEMQN 1183
            L PRICN+LL+SLLRS+     AF L+ KM S  Y+LD YL   T+ L+  H    +M+N
Sbjct: 737  LMPRICNHLLRSLLRSKDKRMHAFGLLSKMKSQRYDLDAYLHQTTKSLLYRHWDTWKMEN 796

Query: 1182 L 1180
            +
Sbjct: 797  V 797


>ref|XP_010316424.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            [Solanum lycopersicum]
 ref|XP_010316425.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            [Solanum lycopersicum]
          Length = 775

 Score =  884 bits (2285), Expect = 0.0
 Identities = 443/761 (58%), Positives = 569/761 (74%), Gaps = 3/761 (0%)
 Frame = -3

Query: 3486 KSFSSLSRVTTISTEVSNIVHSIDPMEPSLEQVAPFLTPDVITHVLQDQQ-DPFLCFRFF 3310
            KSFS+ SR   +S EV NI+  +DP+EP+L+++  FL PD+I+ +L++++ +P L FRFF
Sbjct: 15   KSFST-SREMAVSNEVLNIIDRVDPLEPALDELVRFLCPDIISFILEEKRKNPELGFRFF 73

Query: 3309 VWAAKRKHFRSWASHNLMISMLVGNK-IDTFWSVLDDVRKCGFRVPSDAFAVLIGGYXXX 3133
            +WAAKRK F+ W   NL+  ML  +   D +W+VLD ++  G  + S+AFA LI GY   
Sbjct: 74   IWAAKRKRFQRWIPKNLIADMLSKDGGFDLYWNVLDKLKFSGIPIASNAFAALIWGYWKV 133

Query: 3132 XXXXXXXXAFGRMKEFDCEPNLFTYNLVLSVLVGKGMVLLALAVYNLMLKLNCHLNCSTY 2953
                    AF RMK+FDC+PN++TYN++L + V K  +LLALAVYN+MLKLN   N ST+
Sbjct: 134  NKAEKAIEAFSRMKDFDCKPNIYTYNMILHIAVQKDAILLALAVYNVMLKLNSQPNSSTF 193

Query: 2952 SILIDGLCKSGKVADALVLFDEMMDRGIVPSKITYTVVLTGLCNAKRIDDAYRLFENMKS 2773
            SILIDGLCKSG+  DAL LFDEM +RG++PSKITYTV+L+GLC AKR DDAYRL   MK+
Sbjct: 194  SILIDGLCKSGRTHDALALFDEMTERGVLPSKITYTVILSGLCQAKRTDDAYRLLNVMKT 253

Query: 2772 SGNKPDGITYNALLNGVCKLGRMDEAFALLKDFRKDGFDLDLNSYSSLIDGLFRTKRFKE 2593
             G KPD +TYNALLNG CKLGR+DEA  LL+ F  +G+ +D+  Y+ LIDG  RTKR  E
Sbjct: 254  RGCKPDFVTYNALLNGFCKLGRVDEAHVLLRSFENEGYLMDIKGYTCLIDGFVRTKRIDE 313

Query: 2592 GHEMYRKMLETDIQPDVVLHTIMIRGLCDEGRVHDAFKLVREMTDRGLIPDTQAYNTLIK 2413
               +++ + E ++ PDVVL+T MIRGL   GRV +A  L+R+MT RG+ PDTQ YNTLIK
Sbjct: 314  AQSVFKNLFEKNVVPDVVLYTTMIRGLSGAGRVKEALSLLRDMTGRGVQPDTQCYNTLIK 373

Query: 2412 GFCDEGLLDEARSLKLEVSGVDEFPDTSTYTILISGMCRYGLVGEAQNIFNEMEKAGCVP 2233
            GFCD G+LD+ARSL+LE+S  D FPDT TY+I+I GMCR GLV EA++IFNEMEK GC P
Sbjct: 374  GFCDMGVLDQARSLQLEISENDCFPDTYTYSIVICGMCRNGLVEEARHIFNEMEKLGCFP 433

Query: 2232 SVVTFNALIDGLCKSGELQKAHLMFYKMEIGRNPSLFLRLTQGSDRVVDSGSLQTMVTKL 2053
            SVVTFN LIDGLCK+GEL++AHLMFYKMEIG+NPSLFLRL+QG+DRV+DS SLQ M+ KL
Sbjct: 434  SVVTFNTLIDGLCKAGELEEAHLMFYKMEIGKNPSLFLRLSQGADRVLDSVSLQKMIEKL 493

Query: 2052 CESGSTLKAYKLLTQIADTAIMPTITTYNILINGLCEAGILDGAFNLFKELQLKGISPDS 1873
            CE+G   KAYKLL Q+AD   +P I TYNILINGLC++G+++GA  LF+ELQ+KG  PDS
Sbjct: 494  CETGKIHKAYKLLMQLADCGFVPNIVTYNILINGLCKSGLINGALKLFQELQVKGHFPDS 553

Query: 1872 VTYGTLINGLQIAGREDDAFLLLEEMVSNGCKPTPAVYRTLMKWSCRRKKTFAAFNLWLR 1693
            +TYGTLI+GLQ  GR D++F L ++M  NGC P+  VY++LM WSCRR +   AF+LW +
Sbjct: 554  ITYGTLIDGLQRVGRVDESFKLFDQMSKNGCMPSAEVYKSLMTWSCRRGQISIAFSLWFQ 613

Query: 1692 YLKSLPKRDEKIITLVEECLRKDEVERAVRLLLDMDIKLQDLDSAPYTIWLIGFCQGRKT 1513
            YL++   RD ++I L+EE L K ++E+ VR LL+ D+K  D DS+PY IWLIG CQ  K 
Sbjct: 614  YLRNHAFRDGEVIGLIEEHLEKGDLEKVVRGLLEFDLKRADFDSSPYNIWLIGMCQECKP 673

Query: 1512 TEALMLLSILKEYNISLTPASCVMLISTLYREGKLSFAVEVFLYALQKGFILKPRICNNL 1333
             EAL + S+L E+++ ++  SCVMLI +L  EG L  AVEVFLY L++G  L PRICN L
Sbjct: 674  HEALKIFSLLVEFDVMVSAPSCVMLIHSLCEEGNLDQAVEVFLYTLERGVRLMPRICNKL 733

Query: 1332 LKSLLRSR-YIDDAFDLMEKMDSCGYNLDDYLDVGTRFLVR 1213
            L+SLLRS+     AF L+E+M S GYNLDDYL  GTR L R
Sbjct: 734  LQSLLRSQDKAQHAFGLLERMRSTGYNLDDYLHRGTRSLFR 774


>ref|XP_022755453.1| pentatricopeptide repeat-containing protein At1g79540 [Durio
            zibethinus]
          Length = 800

 Score =  885 bits (2286), Expect = 0.0
 Identities = 444/780 (56%), Positives = 575/780 (73%), Gaps = 4/780 (0%)
 Frame = -3

Query: 3507 TSLFPKPKSFSSLSRVT-TISTEVSNIVHSIDPMEPSLEQVAPFLTPDVITHVLQDQQDP 3331
            T+ F      SS+S +  ++S E+ +I+ +++PMEP+LE + PFL+PD++T ++QDQ +P
Sbjct: 18   TAKFHSTNLSSSVSLLDFSVSNEIHSILDTVNPMEPALEPLLPFLSPDIVTSIIQDQPNP 77

Query: 3330 FLCFRFFVWAAKRKHFRSWASHNLMISMLV--GNKIDTFWSVLDDVRKCGFRVPSDAFAV 3157
             L FRFF+WA +RK  RS+AS  L++ ML+   N  D +W  LD+ +KCG  +  D F +
Sbjct: 78   QLGFRFFIWATQRKRLRSYASVKLVLDMLLRKDNAFDIYWQTLDEAKKCGVLIAPDVFQI 137

Query: 3156 LIGGYXXXXXXXXXXXAFGRMKEFDCEPNLFTYNLVLSVLVGKGMVLLALAVYNLMLKLN 2977
            LI GY            FG+MK+FDC+P++FTYN +L V++ K ++LLALAVYN MLK N
Sbjct: 138  LISGYSKKDLDEKAVECFGQMKDFDCKPDVFTYNAILYVMIRKKVLLLALAVYNQMLKFN 197

Query: 2976 CHLNCSTYSILIDGLCKSGKVADALVLFDEMMDRGIVPSKITYTVVLTGLCNAKRIDDAY 2797
               N +T+SILI+GLCK+GK   AL +FDEM  R + P++  Y +V++GLC + R DDA 
Sbjct: 198  YKPNRATFSILINGLCKNGKTDYALKMFDEMTMRDVEPNRCIYAIVISGLCRSDRADDAC 257

Query: 2796 RLFENMKSSGNKPDGITYNALLNGVCKLGRMDEAFALLKDFRKDGFDLDLNSYSSLIDGL 2617
            RL   MK SG  PD + YNALLNG C+LGR+DEAFALL  F+KDGF L L  YSSLI+GL
Sbjct: 258  RLLIKMKESGCSPDFVAYNALLNGFCELGRVDEAFALLLSFQKDGFVLGLRGYSSLINGL 317

Query: 2616 FRTKRFKEGHEMYRKMLETDIQPDVVLHTIMIRGLCDEGRVHDAFKLVREMTDRGLIPDT 2437
            F+ +R+KE +E Y KM E +++PDVVL+ IM+RGL + G+V DA KL+ EMT+RGL+PDT
Sbjct: 318  FKARRYKEAYEWYTKMFEDNVKPDVVLYAIMLRGLSEAGKVEDAMKLLTEMTERGLVPDT 377

Query: 2436 QAYNTLIKGFCDEGLLDEARSLKLEVSGVDEFPDTSTYTILISGMCRYGLVGEAQNIFNE 2257
              YN +IKGFCD GLLD+ARSL+LE+S  D FP+T TYTILISGMCR GLVGEAQ +FNE
Sbjct: 378  YCYNAVIKGFCDIGLLDQARSLQLEISSHDCFPNTCTYTILISGMCRNGLVGEAQQMFNE 437

Query: 2256 MEKAGCVPSVVTFNALIDGLCKSGELQKAHLMFYKMEIGRNPSLFLRLTQGSDRVVDSGS 2077
            MEK GC PSVVTFNALIDGL K+G+L+KAHL+FYKMEIGRNPSLFLRL+ GS RV+DS S
Sbjct: 438  MEKLGCFPSVVTFNALIDGLSKAGQLEKAHLLFYKMEIGRNPSLFLRLSHGSTRVLDSSS 497

Query: 2076 LQTMVTKLCESGSTLKAYKLLTQIADTAIMPTITTYNILINGLCEAGILDGAFNLFKELQ 1897
            LQTMV +L ESG  LKAY++L Q+AD   +P I TYNILI+G C++G ++GAF LFKELQ
Sbjct: 498  LQTMVEQLYESGKILKAYRILMQLADGGNVPDIFTYNILIHGFCKSGNINGAFKLFKELQ 557

Query: 1896 LKGISPDSVTYGTLINGLQIAGREDDAFLLLEEMVSNGCKPTPAVYRTLMKWSCRRKKTF 1717
            LKG+SPDSVTYGTLING Q+AGRE+DAF + ++M  NGCKP+ A+YR+LM WS RR K  
Sbjct: 558  LKGLSPDSVTYGTLINGFQMAGREEDAFRIFDQMEKNGCKPSMAIYRSLMTWSSRRGKVS 617

Query: 1716 AAFNLWLRYLKSLPKRDEKIITLVEECLRKDEVERAVRLLLDMDIKLQDLDSAPYTIWLI 1537
             AFNLWL YL SLP R   +I  +E+   + EVE+AVR LL+MD KL +   APYTIWLI
Sbjct: 618  LAFNLWLNYLSSLPGRQATVIKEIEKHFVEGEVEKAVRGLLEMDFKLNNFSLAPYTIWLI 677

Query: 1536 GFCQGRKTTEALMLLSILKEYNISLTPASCVMLISTLYREGKLSFAVEVFLYALQKGFIL 1357
            GFCQ  +  EA+ + S+L+E N+ ++P SCV LI  L +E  L  AV+VFLY L+KGF L
Sbjct: 678  GFCQAGRVEEAVKIFSVLEESNVIVSPPSCVSLIVGLCKERNLDLAVDVFLYTLEKGFKL 737

Query: 1356 KPRICNNLLKSLLRSR-YIDDAFDLMEKMDSCGYNLDDYLDVGTRFLVRNHQRKQEMQNL 1180
             PR+CN LL  LL S+     AFDL+ KM+S GY+LD YLD  T+ L+   +   EM+++
Sbjct: 738  MPRVCNYLLGCLLHSKDKRMHAFDLLSKMNSQGYDLDAYLDETTKSLLYRQRHTWEMESV 797


>ref|XP_002529510.1| PREDICTED: pentatricopeptide repeat-containing protein At1g79540
            [Ricinus communis]
 gb|EEF32879.1| pentatricopeptide repeat-containing protein, putative [Ricinus
            communis]
          Length = 804

 Score =  882 bits (2278), Expect = 0.0
 Identities = 434/764 (56%), Positives = 561/764 (73%), Gaps = 3/764 (0%)
 Frame = -3

Query: 3495 PKPKSFSSLSRVT-TISTEVSNIVHSIDPMEPSLEQVAPFLTPDVITHVLQDQQDPFLCF 3319
            P  + F + S V   IS EV  I+ S++P+EP+LE   PFL+P ++T+++++  +  L F
Sbjct: 17   PWKQHFHTYSAVDFAISNEVLTIIDSVNPIEPALESKVPFLSPSIVTYIIKNPPNSLLGF 76

Query: 3318 RFFVWAAKRKHFRSWASHNLMISMLV-GNKIDTFWSVLDDVRKCGFRVPSDAFAVLIGGY 3142
            RFF+WA+K +  RSW SHN++I ML+  N  + +W VL ++++CGF + +DAF VLI  Y
Sbjct: 77   RFFIWASKFRRLRSWVSHNMIIDMLIKDNGFELYWQVLKEIKRCGFSISADAFTVLIQAY 136

Query: 3141 XXXXXXXXXXXAFGRMKEFDCEPNLFTYNLVLSVLVGKGMVLLALAVYNLMLKLNCHLNC 2962
                       +F  MK+FDC+P++FTYN VL V+V K +VLLAL +YN MLKLNC  N 
Sbjct: 137  AKMDMIEKAVESFEMMKDFDCKPDVFTYNTVLHVMVRKEVVLLALGIYNRMLKLNCLPNI 196

Query: 2961 STYSILIDGLCKSGKVADALVLFDEMMDRGIVPSKITYTVVLTGLCNAKRIDDAYRLFEN 2782
            +T+SILIDG+CKSGK  +AL +FDEM  R I+P+KITYT++++GLC A++ D AYRLF  
Sbjct: 197  ATFSILIDGMCKSGKTQNALQMFDEMTQRRILPNKITYTIIISGLCQAQKADVAYRLFIA 256

Query: 2781 MKSSGNKPDGITYNALLNGVCKLGRMDEAFALLKDFRKDGFDLDLNSYSSLIDGLFRTKR 2602
            MK  G  PD +TYNALL+G CKLGR+DEA  LLK F KD + LD   YS LIDGLFR +R
Sbjct: 257  MKDHGCIPDSVTYNALLHGFCKLGRVDEALGLLKYFEKDRYVLDKQGYSCLIDGLFRARR 316

Query: 2601 FKEGHEMYRKMLETDIQPDVVLHTIMIRGLCDEGRVHDAFKLVREMTDRGLIPDTQAYNT 2422
            F++    YRKM E +I+PDV+L+TIM++GL   G+  DA +L+ EMT+RGL+PDT  YN 
Sbjct: 317  FEDAQVWYRKMTEHNIKPDVILYTIMMKGLSKAGKFKDALRLLNEMTERGLVPDTHCYNA 376

Query: 2421 LIKGFCDEGLLDEARSLKLEVSGVDEFPDTSTYTILISGMCRYGLVGEAQNIFNEMEKAG 2242
            LIKG+CD GLLDEA+SL LE+S  D F    TYTILI GMCR GLVG+AQ IFNEMEK G
Sbjct: 377  LIKGYCDLGLLDEAKSLHLEISKNDCFSSACTYTILICGMCRSGLVGDAQQIFNEMEKHG 436

Query: 2241 CVPSVVTFNALIDGLCKSGELQKAHLMFYKMEIGRNPSLFLRLTQGSDRVVDSGSLQTMV 2062
            C PSVVTFNALIDG CK+G ++KA L+FYKMEIGRNPSLFLRL+QG++RV+D+ SLQTMV
Sbjct: 437  CYPSVVTFNALIDGFCKAGNIEKAQLLFYKMEIGRNPSLFLRLSQGANRVLDTASLQTMV 496

Query: 2061 TKLCESGSTLKAYKLLTQIADTAIMPTITTYNILINGLCEAGILDGAFNLFKELQLKGIS 1882
             +LC+SG  LKAY +L Q+ D+   P I TYNILI+G C+AG ++GAF LFKELQLKG+S
Sbjct: 497  EQLCDSGLILKAYNILMQLTDSGFAPNIITYNILIHGFCKAGNINGAFKLFKELQLKGLS 556

Query: 1881 PDSVTYGTLINGLQIAGREDDAFLLLEEMVSNGCKPTPAVYRTLMKWSCRRKKTFAAFNL 1702
            PDSVTYGTLINGL  A RE+DAF +L++++ NGC P   VY++ M WSCRR K   AF+L
Sbjct: 557  PDSVTYGTLINGLLSANREEDAFTVLDQILKNGCTPITEVYKSFMTWSCRRNKITLAFSL 616

Query: 1701 WLRYLKSLPKRDEKIITLVEECLRKDEVERAVRLLLDMDIKLQDLDSAPYTIWLIGFCQG 1522
            WL+YL+S+P RD +++  VEE   K EVE AVR LL+MD KL D   APYTIWLIG CQ 
Sbjct: 617  WLKYLRSIPGRDSEVLKSVEENFEKGEVEEAVRGLLEMDFKLNDFQLAPYTIWLIGLCQA 676

Query: 1521 RKTTEALMLLSILKEYNISLTPASCVMLISTLYREGKLSFAVEVFLYALQKGFILKPRIC 1342
             +  EAL +   L+E+N+ +TP SCV LI  L + G L  A E+FLY + KG++L PRIC
Sbjct: 677  GRLEEALKIFFTLEEHNVLVTPPSCVKLIYRLLKVGNLDLAAEIFLYTIDKGYMLMPRIC 736

Query: 1341 NNLLKSLLRSR-YIDDAFDLMEKMDSCGYNLDDYLDVGTRFLVR 1213
            N LLKSLLRS    + AFDL+ +M S GY+LD +L   T+FL++
Sbjct: 737  NRLLKSLLRSEDKRNRAFDLLSRMKSLGYDLDSHLHQTTKFLLQ 780


>gb|OMP06484.1| hypothetical protein COLO4_08106 [Corchorus olitorius]
          Length = 799

 Score =  881 bits (2277), Expect = 0.0
 Identities = 444/780 (56%), Positives = 571/780 (73%), Gaps = 4/780 (0%)
 Frame = -3

Query: 3507 TSLFPKPKSFSSLSRVT-TISTEVSNIVHSIDPMEPSLEQVAPFLTPDVITHVLQDQQDP 3331
            TS F  P  F S S    T++ EV +I+HS++PMEP+LE + PFL+PDV+T +++DQ +P
Sbjct: 17   TSKFQSPNLFPSSSHQDYTVAHEVHSILHSVNPMEPALEPLLPFLSPDVVTSIIRDQPNP 76

Query: 3330 FLCFRFFVWAAKRKHFRSWASHNLMISMLV--GNKIDTFWSVLDDVRKCGFRVPSDAFAV 3157
             L FRFF+WA +R+  RS AS  L+  ML+   N  D +W  L++ ++CG  +  D F V
Sbjct: 77   QLGFRFFMWAIQRERLRSSASDKLVFDMLLRKDNAFDMYWQTLEEAKQCGVLIVPDVFRV 136

Query: 3156 LIGGYXXXXXXXXXXXAFGRMKEFDCEPNLFTYNLVLSVLVGKGMVLLALAVYNLMLKLN 2977
            LI GY            FG+MK+FDC+P++FTYN +L VLV K ++LLALAVYN +LK N
Sbjct: 137  LISGYAKMGLDEKAVECFGKMKDFDCQPDVFTYNAILYVLVRKKVLLLALAVYNKLLKSN 196

Query: 2976 CHLNCSTYSILIDGLCKSGKVADALVLFDEMMDRGIVPSKITYTVVLTGLCNAKRIDDAY 2797
            C  +  T+SILIDGLCKSGK  DAL + DEM  RGI P+   YT++++GLC A R DDA 
Sbjct: 197  CKPSIGTFSILIDGLCKSGKTEDALNMLDEMTQRGIEPNTYVYTIIISGLCRADRADDAC 256

Query: 2796 RLFENMKSSGNKPDGITYNALLNGVCKLGRMDEAFALLKDFRKDGFDLDLNSYSSLIDGL 2617
            RL   MK  G  PD +TYNALLNG C+LGR+DEA ALL+ FRKDGF L L  YSS I+  
Sbjct: 257  RLLTKMKDCGCPPDFVTYNALLNGFCELGRVDEALALLRSFRKDGFVLGLRGYSSFINSF 316

Query: 2616 FRTKRFKEGHEMYRKMLETDIQPDVVLHTIMIRGLCDEGRVHDAFKLVREMTDRGLIPDT 2437
            FR  R+K+ +  Y KM E +IQPD+VL+ IM++ L + G+V DA KL+ EMT+RGL+PDT
Sbjct: 317  FRAGRYKDAYAWYTKMFEENIQPDIVLYGIMLQRLSEVGKVEDALKLLSEMTERGLVPDT 376

Query: 2436 QAYNTLIKGFCDEGLLDEARSLKLEVSGVDEFPDTSTYTILISGMCRYGLVGEAQNIFNE 2257
              YN +IKGFCD GLLDEA+SL+LE+S  D FP+  TYTILISGMCR GLV EAQ IF E
Sbjct: 377  HCYNAVIKGFCDIGLLDEAQSLQLEISSHDCFPNAYTYTILISGMCRNGLVVEAQQIFKE 436

Query: 2256 MEKAGCVPSVVTFNALIDGLCKSGELQKAHLMFYKMEIGRNPSLFLRLTQGSDRVVDSGS 2077
            MEK GC PSVVTFN LIDGL K+GEL++AHL+F KMEIGRNPS+FLRL+ GS RV+DS S
Sbjct: 437  MEKLGCSPSVVTFNTLIDGLSKAGELEEAHLLFCKMEIGRNPSVFLRLSHGSTRVLDSSS 496

Query: 2076 LQTMVTKLCESGSTLKAYKLLTQIADTAIMPTITTYNILINGLCEAGILDGAFNLFKELQ 1897
            L+TMV +L ESG  LKAY++L Q+ D   +P I TYNILI+G C+AG ++GAF LFKELQ
Sbjct: 497  LKTMVDQLYESGRILKAYRILMQLVDGGNVPDIFTYNILIHGFCKAGNMNGAFKLFKELQ 556

Query: 1896 LKGISPDSVTYGTLINGLQIAGREDDAFLLLEEMVSNGCKPTPAVYRTLMKWSCRRKKTF 1717
            LKG+SPDSVTYGTLING Q+AGR++D+F + ++M  NGCK + AVYR+LM WSCRR+K  
Sbjct: 557  LKGLSPDSVTYGTLINGFQMAGRDEDSFSMFDQMEKNGCKSSVAVYRSLMTWSCRRRKVS 616

Query: 1716 AAFNLWLRYLKSLPKRDEKIITLVEECLRKDEVERAVRLLLDMDIKLQDLDSAPYTIWLI 1537
             AFNLWL+YL+S+P  ++ +I  VE+   + EVE+A+R LL+MD KL     APYTIWLI
Sbjct: 617  FAFNLWLKYLRSVPGCEDTVINEVEKHFDEGEVEKAIRGLLEMDFKLSKFSLAPYTIWLI 676

Query: 1536 GFCQGRKTTEALMLLSILKEYNISLTPASCVMLISTLYREGKLSFAVEVFLYALQKGFIL 1357
            G CQ R+  EAL + ++L+E  + ++P SCVMLI+ L +EG L  AV+VFLY L+KGF L
Sbjct: 677  GLCQARRVEEALKIFNVLEECKVIVSPPSCVMLIAGLCKEGNLDAAVDVFLYTLEKGFKL 736

Query: 1356 KPRICNNLLKSLLRSRYID-DAFDLMEKMDSCGYNLDDYLDVGTRFLVRNHQRKQEMQNL 1180
             PR+CN LLKSLLRS+     AFDL+ KM+S  Y+LD YLD  T++L+  +QR  E +N+
Sbjct: 737  MPRVCNYLLKSLLRSKDKKMHAFDLLGKMESQRYDLDAYLDKTTKYLLYGNQRTWEKENV 796


Top