BLASTX nr result

ID: Astragalus22_contig00000147 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00000147
         (1488 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004501623.1| PREDICTED: pentatricopeptide repeat-containi...   728   0.0  
dbj|GAU23693.1| hypothetical protein TSUD_304680 [Trifolium subt...   714   0.0  
ref|XP_019438511.1| PREDICTED: pentatricopeptide repeat-containi...   710   0.0  
ref|XP_003602939.2| PPR containing plant-like protein [Medicago ...   702   0.0  
gb|KHN09023.1| Pentatricopeptide repeat-containing protein [Glyc...   677   0.0  
ref|XP_003527867.1| PREDICTED: pentatricopeptide repeat-containi...   677   0.0  
ref|XP_016163692.1| pentatricopeptide repeat-containing protein ...   671   0.0  
ref|XP_015934861.1| pentatricopeptide repeat-containing protein ...   670   0.0  
ref|XP_017421782.1| PREDICTED: pentatricopeptide repeat-containi...   658   0.0  
ref|XP_020211825.1| pentatricopeptide repeat-containing protein ...   656   0.0  
ref|XP_014502211.1| pentatricopeptide repeat-containing protein ...   653   0.0  
gb|PNY15267.1| pentatricopeptide repeat-containing protein at5g1...   650   0.0  
ref|XP_007137642.1| hypothetical protein PHAVU_009G143500g, part...   648   0.0  
gb|POF09155.1| pentatricopeptide repeat-containing protein [Quer...   613   0.0  
ref|XP_023892775.1| pentatricopeptide repeat-containing protein ...   613   0.0  
gb|OMO77960.1| hypothetical protein COLO4_24918 [Corchorus olito...   607   0.0  
ref|XP_021808283.1| pentatricopeptide repeat-containing protein ...   600   0.0  
gb|OMO98439.1| hypothetical protein CCACVL1_04225 [Corchorus cap...   598   0.0  
ref|XP_008219391.1| PREDICTED: pentatricopeptide repeat-containi...   597   0.0  
ref|XP_018812757.1| PREDICTED: pentatricopeptide repeat-containi...   596   0.0  

>ref|XP_004501623.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475
            isoform X1 [Cicer arietinum]
 ref|XP_004501624.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475
            isoform X1 [Cicer arietinum]
 ref|XP_012571621.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475
            isoform X1 [Cicer arietinum]
 ref|XP_012571623.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475
            isoform X1 [Cicer arietinum]
          Length = 510

 Score =  728 bits (1880), Expect = 0.0
 Identities = 359/411 (87%), Positives = 380/411 (92%)
 Frame = +2

Query: 2    LAQFKKFQAIDRVLHQMTYETCKFHEGIFINLMKHYSKSSLHEKVLGTFFSIQPIVREKP 181
            LAQFKKFQA+DRVLHQMTYETC+FHEGIFINLMKHYSK S HEKVL  FFSIQPIVREKP
Sbjct: 100  LAQFKKFQAVDRVLHQMTYETCQFHEGIFINLMKHYSKCSFHEKVLDAFFSIQPIVREKP 159

Query: 182  SPKATSTCLNLLIDSNQVDLARQLLLHAKRSLINKPNVCIFNILVKHHCKNGDLESAFEV 361
            SPKA STCLNLL+DSNQVDLARQLLLHAKRSLI KPNVCIFNILVK+HC+NGD+ESAFEV
Sbjct: 160  SPKAISTCLNLLVDSNQVDLARQLLLHAKRSLIYKPNVCIFNILVKYHCRNGDIESAFEV 219

Query: 362  VKEMRISKSSYPNIITYSTLMDGLCRNGRLKEAFELFEEMVSRDRIVPDPLTYNVLINGF 541
            V+EMR SK SYPN+ITYST+MDGLCRNGRLKEAFELFEEMVS+DRIVPDPLTYNVLINGF
Sbjct: 220  VEEMRKSKYSYPNVITYSTMMDGLCRNGRLKEAFELFEEMVSKDRIVPDPLTYNVLINGF 279

Query: 542  CRAGQPDRARNVIEFMKNNGCRPNVFNYSALMDGLCKAGKLQDAKRVFAEMKSSGLKPDT 721
            CR G+PDRARNVIEFMK+NGC PNVFNYSAL+DGLCK GKLQDAK VFAEMKSSGLKPDT
Sbjct: 280  CRGGKPDRARNVIEFMKSNGCCPNVFNYSALVDGLCKVGKLQDAKGVFAEMKSSGLKPDT 339

Query: 722  ITYTSLINFFCRNGQIDEAIELLKEMKENECQADTVTFNVILGGLCREGRFEEALDMIEK 901
            +TYTSLINFFCRN +IDEAIELLKEMKENECQADTV FNVILGG+CREGRFEEALDMIEK
Sbjct: 340  VTYTSLINFFCRNRKIDEAIELLKEMKENECQADTVAFNVILGGMCREGRFEEALDMIEK 399

Query: 902  LPHQGVYLNKGSYRIVLNSLTQKLDLQRANKLLQLMLGRGFLPHYATSNELLISLCKEGM 1081
            LP QGVYLNKGSYRIVLNSLTQK +L++A KLL+LML RGFLPHYATSNELLIS CKEGM
Sbjct: 400  LPQQGVYLNKGSYRIVLNSLTQKCELRKAKKLLELMLSRGFLPHYATSNELLISFCKEGM 459

Query: 1082 XXXXXXXLFDLVDMGFQPRLDSWELLIELICRDRKLLKVFELLDELVLTNS 1234
                   LFDLV+MGFQP LD WELLIELICRDRKLL VFELLDELV  NS
Sbjct: 460  VDDAAAALFDLVEMGFQPPLDCWELLIELICRDRKLLYVFELLDELVTANS 510


>dbj|GAU23693.1| hypothetical protein TSUD_304680 [Trifolium subterraneum]
          Length = 512

 Score =  714 bits (1843), Expect = 0.0
 Identities = 349/411 (84%), Positives = 376/411 (91%)
 Frame = +2

Query: 2    LAQFKKFQAIDRVLHQMTYETCKFHEGIFINLMKHYSKSSLHEKVLGTFFSIQPIVREKP 181
            LAQFKKFQA++RVLHQMTYETCKFHEG+FINLMKHYSK   HEKVL TF SIQ IVREKP
Sbjct: 102  LAQFKKFQAVERVLHQMTYETCKFHEGVFINLMKHYSKCCFHEKVLDTFLSIQTIVREKP 161

Query: 182  SPKATSTCLNLLIDSNQVDLARQLLLHAKRSLINKPNVCIFNILVKHHCKNGDLESAFEV 361
            SPKA S+CLNLL+DSN++DLAR+LLLHAKRSL  KPNVCIFNILVK+HCKNGDLESAFEV
Sbjct: 162  SPKAISSCLNLLVDSNRIDLARKLLLHAKRSLTYKPNVCIFNILVKYHCKNGDLESAFEV 221

Query: 362  VKEMRISKSSYPNIITYSTLMDGLCRNGRLKEAFELFEEMVSRDRIVPDPLTYNVLINGF 541
            VKEMR SK SYPN+ITYSTLMDGLCRNG+LKEAFELFEEMVS+DRI+PDPLTYNVLINGF
Sbjct: 222  VKEMRNSKYSYPNVITYSTLMDGLCRNGKLKEAFELFEEMVSKDRIMPDPLTYNVLINGF 281

Query: 542  CRAGQPDRARNVIEFMKNNGCRPNVFNYSALMDGLCKAGKLQDAKRVFAEMKSSGLKPDT 721
            CR G+PDRARNVIEFMKNNGC PNVFNYSAL+DGLCKAGK QDAK VFAEMKSSGLKPDT
Sbjct: 282  CRGGKPDRARNVIEFMKNNGCCPNVFNYSALVDGLCKAGKPQDAKEVFAEMKSSGLKPDT 341

Query: 722  ITYTSLINFFCRNGQIDEAIELLKEMKENECQADTVTFNVILGGLCREGRFEEALDMIEK 901
            +TYTSLINFFCRNGQIDEAIELLKEMKEN+C+ADTVTFNV+LGGLCREGRF EALDMIEK
Sbjct: 342  VTYTSLINFFCRNGQIDEAIELLKEMKENKCEADTVTFNVMLGGLCREGRFYEALDMIEK 401

Query: 902  LPHQGVYLNKGSYRIVLNSLTQKLDLQRANKLLQLMLGRGFLPHYATSNELLISLCKEGM 1081
            LPHQG+YLNKGSYRIVLNSLTQK +L++A  LL LML RGF+PHYATSNELL+  CKEGM
Sbjct: 402  LPHQGIYLNKGSYRIVLNSLTQKCELRKAKNLLGLMLTRGFVPHYATSNELLVRFCKEGM 461

Query: 1082 XXXXXXXLFDLVDMGFQPRLDSWELLIELICRDRKLLKVFELLDELVLTNS 1234
                   LFDLVDMGFQP+ D WEL+IELICRDRKLL VFELLDELV  NS
Sbjct: 462  VDDAAAALFDLVDMGFQPQPDCWELIIELICRDRKLLYVFELLDELVTANS 512


>ref|XP_019438511.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475
            [Lupinus angustifolius]
 gb|OIW14551.1| hypothetical protein TanjilG_14937 [Lupinus angustifolius]
          Length = 511

 Score =  710 bits (1833), Expect = 0.0
 Identities = 346/411 (84%), Positives = 377/411 (91%)
 Frame = +2

Query: 2    LAQFKKFQAIDRVLHQMTYETCKFHEGIFINLMKHYSKSSLHEKVLGTFFSIQPIVREKP 181
            LAQ KKFQA+DRVLHQMTYE CKFHEGIF+NLMKH+SKSS++EKVL TFFSI+PIVREKP
Sbjct: 101  LAQSKKFQAVDRVLHQMTYEACKFHEGIFVNLMKHFSKSSMYEKVLQTFFSIKPIVREKP 160

Query: 182  SPKATSTCLNLLIDSNQVDLARQLLLHAKRSLINKPNVCIFNILVKHHCKNGDLESAFEV 361
            SPKA STCLNLL+DSNQVDL RQLLLHAKRSL +KPNVC+FNILVK+HCKNGDL+SAFEV
Sbjct: 161  SPKAISTCLNLLVDSNQVDLVRQLLLHAKRSLTHKPNVCVFNILVKYHCKNGDLDSAFEV 220

Query: 362  VKEMRISKSSYPNIITYSTLMDGLCRNGRLKEAFELFEEMVSRDRIVPDPLTYNVLINGF 541
            V+EMR SK SYPN+ITYSTLMDGLC+NGRLKEAFE+FEEMVS+DRIVPDP+TYNVLINGF
Sbjct: 221  VEEMRNSKFSYPNLITYSTLMDGLCQNGRLKEAFEIFEEMVSKDRIVPDPMTYNVLINGF 280

Query: 542  CRAGQPDRARNVIEFMKNNGCRPNVFNYSALMDGLCKAGKLQDAKRVFAEMKSSGLKPDT 721
            C  G+PDRARNVIEFMKNNGCRPNV NYSAL++GLCK GKLQDAK VFAEM+SSGLKPDT
Sbjct: 281  CCGGKPDRARNVIEFMKNNGCRPNVINYSALVNGLCKVGKLQDAKEVFAEMRSSGLKPDT 340

Query: 722  ITYTSLINFFCRNGQIDEAIELLKEMKENECQADTVTFNVILGGLCREGRFEEALDMIEK 901
            + YTSLIN+ CRNG+IDEAIELLKEMKENECQ DTVTFNVILGGLCREGRFEEALDM+E 
Sbjct: 341  VCYTSLINYVCRNGKIDEAIELLKEMKENECQPDTVTFNVILGGLCREGRFEEALDMVEN 400

Query: 902  LPHQGVYLNKGSYRIVLNSLTQKLDLQRANKLLQLMLGRGFLPHYATSNELLISLCKEGM 1081
            LPH+GVYLNKGSYRIVLNSLTQ  DL +A KLL LMLGRGFLPHYATSNELL+SLCK GM
Sbjct: 401  LPHEGVYLNKGSYRIVLNSLTQNCDLNKAKKLLGLMLGRGFLPHYATSNELLVSLCKAGM 460

Query: 1082 XXXXXXXLFDLVDMGFQPRLDSWELLIELICRDRKLLKVFELLDELVLTNS 1234
                   LF LV+MGFQP  DSWELLIELICR+RKLL VFELLDELV+TNS
Sbjct: 461  ADDAAMALFGLVEMGFQPGPDSWELLIELICRERKLLYVFELLDELVITNS 511


>ref|XP_003602939.2| PPR containing plant-like protein [Medicago truncatula]
 gb|AES73190.2| PPR containing plant-like protein [Medicago truncatula]
          Length = 550

 Score =  702 bits (1812), Expect = 0.0
 Identities = 346/411 (84%), Positives = 372/411 (90%)
 Frame = +2

Query: 2    LAQFKKFQAIDRVLHQMTYETCKFHEGIFINLMKHYSKSSLHEKVLGTFFSIQPIVREKP 181
            LAQFKKFQA+DRVLHQMTYE CKFHEG+FINLMKHYSK   HEKV   F SIQ IVREKP
Sbjct: 140  LAQFKKFQAVDRVLHQMTYEACKFHEGVFINLMKHYSKCGFHEKVFDAFLSIQTIVREKP 199

Query: 182  SPKATSTCLNLLIDSNQVDLARQLLLHAKRSLINKPNVCIFNILVKHHCKNGDLESAFEV 361
            SPKA S+CLNLL+DSNQVDL R+LLL+AKRSL+ KPNVCIFNILVK+HC+ GD++SAFEV
Sbjct: 200  SPKAISSCLNLLVDSNQVDLVRKLLLYAKRSLVYKPNVCIFNILVKYHCRRGDIDSAFEV 259

Query: 362  VKEMRISKSSYPNIITYSTLMDGLCRNGRLKEAFELFEEMVSRDRIVPDPLTYNVLINGF 541
            VKEMR SK SYPN+ITYSTLMDGLCRNGRLKEAFELFEEMVS+D+IVPDPLTYNVLINGF
Sbjct: 260  VKEMRNSKYSYPNVITYSTLMDGLCRNGRLKEAFELFEEMVSKDQIVPDPLTYNVLINGF 319

Query: 542  CRAGQPDRARNVIEFMKNNGCRPNVFNYSALMDGLCKAGKLQDAKRVFAEMKSSGLKPDT 721
            CR G+ DRARNVIEFMKNNGC PNVFNYSAL+DGLCKAGKLQDAK V AEMKSSGLKPD 
Sbjct: 320  CREGKADRARNVIEFMKNNGCCPNVFNYSALVDGLCKAGKLQDAKGVLAEMKSSGLKPDA 379

Query: 722  ITYTSLINFFCRNGQIDEAIELLKEMKENECQADTVTFNVILGGLCREGRFEEALDMIEK 901
            ITYTSLINFF RNGQIDEAIELL EMKEN+CQADTVTFNVILGGLCREGRF+EALDMIEK
Sbjct: 380  ITYTSLINFFSRNGQIDEAIELLTEMKENDCQADTVTFNVILGGLCREGRFDEALDMIEK 439

Query: 902  LPHQGVYLNKGSYRIVLNSLTQKLDLQRANKLLQLMLGRGFLPHYATSNELLISLCKEGM 1081
            LP QGVYLNKGSYRIVLNSLTQ  +L++ANKLL LML RGF+PHYATSNELL+ LCKEGM
Sbjct: 440  LPQQGVYLNKGSYRIVLNSLTQNCELRKANKLLGLMLSRGFVPHYATSNELLVRLCKEGM 499

Query: 1082 XXXXXXXLFDLVDMGFQPRLDSWELLIELICRDRKLLKVFELLDELVLTNS 1234
                   LFDLVDMGFQP+ DSWELLI+LICRDRKLL VFELLDELV +NS
Sbjct: 500  ANDAATALFDLVDMGFQPQHDSWELLIDLICRDRKLLYVFELLDELVTSNS 550


>gb|KHN09023.1| Pentatricopeptide repeat-containing protein [Glycine soja]
          Length = 507

 Score =  677 bits (1748), Expect = 0.0
 Identities = 326/411 (79%), Positives = 370/411 (90%)
 Frame = +2

Query: 2    LAQFKKFQAIDRVLHQMTYETCKFHEGIFINLMKHYSKSSLHEKVLGTFFSIQPIVREKP 181
            LA+   F A+DRVLHQMTYETCKFHEGIF+NLMKH+SKSSLHEK+L  +FSIQPIVREKP
Sbjct: 97   LARCNNFHAVDRVLHQMTYETCKFHEGIFVNLMKHFSKSSLHEKLLHAYFSIQPIVREKP 156

Query: 182  SPKATSTCLNLLIDSNQVDLARQLLLHAKRSLINKPNVCIFNILVKHHCKNGDLESAFEV 361
            SPKA STCLNLL+DSN+VDLAR+LLLHAKR L  KPNVC+FNILVK+HCKNGDL+SAFE+
Sbjct: 157  SPKALSTCLNLLLDSNRVDLARKLLLHAKRDLTRKPNVCVFNILVKYHCKNGDLDSAFEI 216

Query: 362  VKEMRISKSSYPNIITYSTLMDGLCRNGRLKEAFELFEEMVSRDRIVPDPLTYNVLINGF 541
            V+EMR S+ SYPN++TYSTLMDGLCRNGR+KEAF+LFEEMVSRD IVPDPLTYNVLINGF
Sbjct: 217  VEEMRNSEFSYPNLVTYSTLMDGLCRNGRVKEAFDLFEEMVSRDHIVPDPLTYNVLINGF 276

Query: 542  CRAGQPDRARNVIEFMKNNGCRPNVFNYSALMDGLCKAGKLQDAKRVFAEMKSSGLKPDT 721
            CR G+PDRARNVI+FMK+NGC PNV+NYSAL+DGLCK GKL+DAK V AE+K SGLKPD 
Sbjct: 277  CRGGKPDRARNVIQFMKSNGCYPNVYNYSALVDGLCKVGKLEDAKGVLAEIKGSGLKPDA 336

Query: 722  ITYTSLINFFCRNGQIDEAIELLKEMKENECQADTVTFNVILGGLCREGRFEEALDMIEK 901
            +TYTSLINF CRNG+ DEAIELL+EMKEN CQAD+VTFNV+LGGLCREG+FEEALDM+EK
Sbjct: 337  VTYTSLINFLCRNGKSDEAIELLEEMKENGCQADSVTFNVLLGGLCREGKFEEALDMVEK 396

Query: 902  LPHQGVYLNKGSYRIVLNSLTQKLDLQRANKLLQLMLGRGFLPHYATSNELLISLCKEGM 1081
            LP QGVYLNKGSYRIVLNSLTQK +L+RA +LL LML RGF PHYATSNELL+ LCK GM
Sbjct: 397  LPQQGVYLNKGSYRIVLNSLTQKCELKRAKELLGLMLRRGFQPHYATSNELLVCLCKAGM 456

Query: 1082 XXXXXXXLFDLVDMGFQPRLDSWELLIELICRDRKLLKVFELLDELVLTNS 1234
                   LFDLV+MGFQP L++WE+LI LICR+RKLL VFELLDELV+TN+
Sbjct: 457  VDDAAVALFDLVEMGFQPGLETWEVLIGLICRERKLLYVFELLDELVVTNT 507


>ref|XP_003527867.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475
            [Glycine max]
 gb|KRH52886.1| hypothetical protein GLYMA_06G093000 [Glycine max]
          Length = 546

 Score =  677 bits (1748), Expect = 0.0
 Identities = 326/411 (79%), Positives = 370/411 (90%)
 Frame = +2

Query: 2    LAQFKKFQAIDRVLHQMTYETCKFHEGIFINLMKHYSKSSLHEKVLGTFFSIQPIVREKP 181
            LA+   F A+DRVLHQMTYETCKFHEGIF+NLMKH+SKSSLHEK+L  +FSIQPIVREKP
Sbjct: 136  LARCNNFHAVDRVLHQMTYETCKFHEGIFVNLMKHFSKSSLHEKLLHAYFSIQPIVREKP 195

Query: 182  SPKATSTCLNLLIDSNQVDLARQLLLHAKRSLINKPNVCIFNILVKHHCKNGDLESAFEV 361
            SPKA STCLNLL+DSN+VDLAR+LLLHAKR L  KPNVC+FNILVK+HCKNGDL+SAFE+
Sbjct: 196  SPKALSTCLNLLLDSNRVDLARKLLLHAKRDLTRKPNVCVFNILVKYHCKNGDLDSAFEI 255

Query: 362  VKEMRISKSSYPNIITYSTLMDGLCRNGRLKEAFELFEEMVSRDRIVPDPLTYNVLINGF 541
            V+EMR S+ SYPN++TYSTLMDGLCRNGR+KEAF+LFEEMVSRD IVPDPLTYNVLINGF
Sbjct: 256  VEEMRNSEFSYPNLVTYSTLMDGLCRNGRVKEAFDLFEEMVSRDHIVPDPLTYNVLINGF 315

Query: 542  CRAGQPDRARNVIEFMKNNGCRPNVFNYSALMDGLCKAGKLQDAKRVFAEMKSSGLKPDT 721
            CR G+PDRARNVI+FMK+NGC PNV+NYSAL+DGLCK GKL+DAK V AE+K SGLKPD 
Sbjct: 316  CRGGKPDRARNVIQFMKSNGCYPNVYNYSALVDGLCKVGKLEDAKGVLAEIKGSGLKPDA 375

Query: 722  ITYTSLINFFCRNGQIDEAIELLKEMKENECQADTVTFNVILGGLCREGRFEEALDMIEK 901
            +TYTSLINF CRNG+ DEAIELL+EMKEN CQAD+VTFNV+LGGLCREG+FEEALDM+EK
Sbjct: 376  VTYTSLINFLCRNGKSDEAIELLEEMKENGCQADSVTFNVLLGGLCREGKFEEALDMVEK 435

Query: 902  LPHQGVYLNKGSYRIVLNSLTQKLDLQRANKLLQLMLGRGFLPHYATSNELLISLCKEGM 1081
            LP QGVYLNKGSYRIVLNSLTQK +L+RA +LL LML RGF PHYATSNELL+ LCK GM
Sbjct: 436  LPQQGVYLNKGSYRIVLNSLTQKCELKRAKELLGLMLRRGFQPHYATSNELLVCLCKAGM 495

Query: 1082 XXXXXXXLFDLVDMGFQPRLDSWELLIELICRDRKLLKVFELLDELVLTNS 1234
                   LFDLV+MGFQP L++WE+LI LICR+RKLL VFELLDELV+TN+
Sbjct: 496  VDDAAVALFDLVEMGFQPGLETWEVLIGLICRERKLLYVFELLDELVVTNT 546


>ref|XP_016163692.1| pentatricopeptide repeat-containing protein At5g18475 [Arachis
            ipaensis]
 ref|XP_016163694.1| pentatricopeptide repeat-containing protein At5g18475 [Arachis
            ipaensis]
 ref|XP_020962154.1| pentatricopeptide repeat-containing protein At5g18475 [Arachis
            ipaensis]
 ref|XP_020962156.1| pentatricopeptide repeat-containing protein At5g18475 [Arachis
            ipaensis]
 ref|XP_020962157.1| pentatricopeptide repeat-containing protein At5g18475 [Arachis
            ipaensis]
 ref|XP_020962158.1| pentatricopeptide repeat-containing protein At5g18475 [Arachis
            ipaensis]
 ref|XP_020962159.1| pentatricopeptide repeat-containing protein At5g18475 [Arachis
            ipaensis]
          Length = 517

 Score =  671 bits (1730), Expect = 0.0
 Identities = 325/411 (79%), Positives = 373/411 (90%)
 Frame = +2

Query: 2    LAQFKKFQAIDRVLHQMTYETCKFHEGIFINLMKHYSKSSLHEKVLGTFFSIQPIVREKP 181
            LAQ KKFQA+DRVLHQMTYETCKFHEGIFINLMKH SKSSLHEKVL  FFSIQPIVREKP
Sbjct: 107  LAQSKKFQAVDRVLHQMTYETCKFHEGIFINLMKHLSKSSLHEKVLQIFFSIQPIVREKP 166

Query: 182  SPKATSTCLNLLIDSNQVDLARQLLLHAKRSLINKPNVCIFNILVKHHCKNGDLESAFEV 361
            SPKA STCLNLL+DSNQVDLARQLLLHAKRSL ++PNVCIFNILVK+HCKNGDL+SAFEV
Sbjct: 167  SPKAISTCLNLLVDSNQVDLARQLLLHAKRSLTHRPNVCIFNILVKYHCKNGDLDSAFEV 226

Query: 362  VKEMRISKSSYPNIITYSTLMDGLCRNGRLKEAFELFEEMVSRDRIVPDPLTYNVLINGF 541
            V+EM+ SKSSYPN+ITYSTLMDGLC+NGRLKEAFELFEEMVS+D+IVPDPLTYNVLINGF
Sbjct: 227  VEEMKNSKSSYPNLITYSTLMDGLCQNGRLKEAFELFEEMVSKDQIVPDPLTYNVLINGF 286

Query: 542  CRAGQPDRARNVIEFMKNNGCRPNVFNYSALMDGLCKAGKLQDAKRVFAEMKSSGLKPDT 721
            CR G+PDRARN+IEFM++NGC PNV+NYSAL++GLCKAGKLQ+AK V  EMKS+GLKPD 
Sbjct: 287  CRGGKPDRARNIIEFMRSNGCHPNVYNYSALVNGLCKAGKLQEAKGVLDEMKSAGLKPDA 346

Query: 722  ITYTSLINFFCRNGQIDEAIELLKEMKENECQADTVTFNVILGGLCREGRFEEALDMIEK 901
            ++YT+LINF CRN +IDEA+ELL+EMKEN C+ADTVTFNVILGGLCRE R++EALDM+E+
Sbjct: 347  VSYTALINFCCRNKRIDEALELLEEMKENSCRADTVTFNVILGGLCREERYKEALDMLER 406

Query: 902  LPHQGVYLNKGSYRIVLNSLTQKLDLQRANKLLQLMLGRGFLPHYATSNELLISLCKEGM 1081
            LP +G+Y+NKGSYRIVLN LTQ+ +L++A +LL LM+GRGF+PHYATSNELL+ LCK GM
Sbjct: 407  LPLEGIYMNKGSYRIVLNCLTQRGELKKAKELLGLMVGRGFVPHYATSNELLVQLCKAGM 466

Query: 1082 XXXXXXXLFDLVDMGFQPRLDSWELLIELICRDRKLLKVFELLDELVLTNS 1234
                   LF LV+MGFQP  DSW  LI+ IC+DRKLL VFELLDELV++NS
Sbjct: 467  VNDAAMALFALVEMGFQPGSDSWGHLIDQICKDRKLLYVFELLDELVISNS 517


>ref|XP_015934861.1| pentatricopeptide repeat-containing protein At5g18475 [Arachis
            duranensis]
 ref|XP_020984505.1| pentatricopeptide repeat-containing protein At5g18475 [Arachis
            duranensis]
          Length = 517

 Score =  670 bits (1728), Expect = 0.0
 Identities = 325/411 (79%), Positives = 371/411 (90%)
 Frame = +2

Query: 2    LAQFKKFQAIDRVLHQMTYETCKFHEGIFINLMKHYSKSSLHEKVLGTFFSIQPIVREKP 181
            LAQ KKFQA+DRVLHQMTYETCKFHEGIFINLMKH SKSSLHEKVL  FFSIQPIVREKP
Sbjct: 107  LAQSKKFQAVDRVLHQMTYETCKFHEGIFINLMKHLSKSSLHEKVLQIFFSIQPIVREKP 166

Query: 182  SPKATSTCLNLLIDSNQVDLARQLLLHAKRSLINKPNVCIFNILVKHHCKNGDLESAFEV 361
            SPKA STCLNLL+DSNQVDLARQLLLHAKRSL ++PNVCIFNILVK+HCKNGDL+SAFEV
Sbjct: 167  SPKAISTCLNLLVDSNQVDLARQLLLHAKRSLTHRPNVCIFNILVKYHCKNGDLDSAFEV 226

Query: 362  VKEMRISKSSYPNIITYSTLMDGLCRNGRLKEAFELFEEMVSRDRIVPDPLTYNVLINGF 541
            V+EM+ SKSSYPN+ITYSTLMDGLC+N RLKEAFELFEEMVS+D+IVPDPLTYNVLINGF
Sbjct: 227  VEEMKNSKSSYPNLITYSTLMDGLCQNERLKEAFELFEEMVSKDQIVPDPLTYNVLINGF 286

Query: 542  CRAGQPDRARNVIEFMKNNGCRPNVFNYSALMDGLCKAGKLQDAKRVFAEMKSSGLKPDT 721
            CR G+PDRARN+IEFMK+NGC PN++NYSAL++GLCK GKLQ+AK V  EMKSSGLKPD 
Sbjct: 287  CRGGKPDRARNIIEFMKSNGCHPNIYNYSALVNGLCKVGKLQEAKGVLDEMKSSGLKPDA 346

Query: 722  ITYTSLINFFCRNGQIDEAIELLKEMKENECQADTVTFNVILGGLCREGRFEEALDMIEK 901
            ++YT+LINF CRN +IDEA+ELL+EMKEN C+ADTVTFNVILGGLCRE R++EALDM+E+
Sbjct: 347  VSYTALINFCCRNKRIDEALELLEEMKENSCRADTVTFNVILGGLCREERYKEALDMLER 406

Query: 902  LPHQGVYLNKGSYRIVLNSLTQKLDLQRANKLLQLMLGRGFLPHYATSNELLISLCKEGM 1081
            LP +G+Y+NKGSYRIVLN LTQ+ +L++A +LL LMLGRGF+PHYATSNELL+ LCK GM
Sbjct: 407  LPLEGIYMNKGSYRIVLNCLTQRGELKKAKELLDLMLGRGFVPHYATSNELLVQLCKAGM 466

Query: 1082 XXXXXXXLFDLVDMGFQPRLDSWELLIELICRDRKLLKVFELLDELVLTNS 1234
                   LF LV+MGFQP  DSW  LI+ IC+DRKLL VFELLDELV++NS
Sbjct: 467  VNDAAMALFALVEMGFQPGSDSWGHLIDQICKDRKLLYVFELLDELVISNS 517


>ref|XP_017421782.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475
            [Vigna angularis]
 gb|KOM41794.1| hypothetical protein LR48_Vigan04g199200 [Vigna angularis]
 dbj|BAT78437.1| hypothetical protein VIGAN_02111300 [Vigna angularis var. angularis]
          Length = 519

 Score =  658 bits (1698), Expect = 0.0
 Identities = 318/411 (77%), Positives = 366/411 (89%)
 Frame = +2

Query: 2    LAQFKKFQAIDRVLHQMTYETCKFHEGIFINLMKHYSKSSLHEKVLGTFFSIQPIVREKP 181
            LA+  KF  +D+V+HQMTYETCKFHEGIF+NLM H+SKSSLHEKVL  FFSIQPIVR+KP
Sbjct: 109  LARCNKFHTVDQVIHQMTYETCKFHEGIFVNLMNHFSKSSLHEKVLQAFFSIQPIVRDKP 168

Query: 182  SPKATSTCLNLLIDSNQVDLARQLLLHAKRSLINKPNVCIFNILVKHHCKNGDLESAFEV 361
            SPKA +TCLNLL++SN+VDLAR+LLLHAKR L +KPNVCIFNILVK+HCKNGDLESAFEV
Sbjct: 169  SPKALATCLNLLLESNRVDLARKLLLHAKRGLTHKPNVCIFNILVKYHCKNGDLESAFEV 228

Query: 362  VKEMRISKSSYPNIITYSTLMDGLCRNGRLKEAFELFEEMVSRDRIVPDPLTYNVLINGF 541
            VKEMR S+ SYPN++TYSTLMDGLCRNGRL+EAF+LFEEMVSRD IVPDPLTYNVLINGF
Sbjct: 229  VKEMRNSEFSYPNLVTYSTLMDGLCRNGRLREAFQLFEEMVSRDHIVPDPLTYNVLINGF 288

Query: 542  CRAGQPDRARNVIEFMKNNGCRPNVFNYSALMDGLCKAGKLQDAKRVFAEMKSSGLKPDT 721
            CR G+PD ARNVIEFMK+NGC PNV+NYSAL++GLCK GKL+DAK V AEMK++GL PD 
Sbjct: 289  CREGKPDHARNVIEFMKSNGCYPNVYNYSALVNGLCKIGKLEDAKGVLAEMKNAGLTPDA 348

Query: 722  ITYTSLINFFCRNGQIDEAIELLKEMKENECQADTVTFNVILGGLCREGRFEEALDMIEK 901
            + YTSLI++ C NGQ+ E I+LL+EMKEN+ QADTVTFNVILGGLCREGRFEEALDM+ K
Sbjct: 349  VIYTSLISYLCTNGQVGEGIQLLEEMKENKIQADTVTFNVILGGLCREGRFEEALDMLWK 408

Query: 902  LPHQGVYLNKGSYRIVLNSLTQKLDLQRANKLLQLMLGRGFLPHYATSNELLISLCKEGM 1081
            LP QGVYLNKGSYRIVLNSLT+K +L+RA +LL LML RGFLPHYATSNELL+ LCK GM
Sbjct: 409  LPQQGVYLNKGSYRIVLNSLTRKGELKRAKELLGLMLSRGFLPHYATSNELLVCLCKGGM 468

Query: 1082 XXXXXXXLFDLVDMGFQPRLDSWELLIELICRDRKLLKVFELLDELVLTNS 1234
                   LFDLV+MGFQP L++WE+LI LICRDRKLL VFELLDEL++T++
Sbjct: 469  ADDAARALFDLVEMGFQPELETWEVLIGLICRDRKLLHVFELLDELLVTDT 519


>ref|XP_020211825.1| pentatricopeptide repeat-containing protein At5g18475 isoform X1
            [Cajanus cajan]
 gb|KYP70082.1| Pentatricopeptide repeat-containing protein At5g18475 family [Cajanus
            cajan]
          Length = 504

 Score =  656 bits (1692), Expect = 0.0
 Identities = 316/411 (76%), Positives = 365/411 (88%)
 Frame = +2

Query: 2    LAQFKKFQAIDRVLHQMTYETCKFHEGIFINLMKHYSKSSLHEKVLGTFFSIQPIVREKP 181
            LA+  +F A++RVLHQMTYETCKFHEGIF+NLMKH+S++SLHE VL  FFSIQPIVREKP
Sbjct: 94   LARRDQFHAVNRVLHQMTYETCKFHEGIFVNLMKHFSRASLHENVLQAFFSIQPIVREKP 153

Query: 182  SPKATSTCLNLLIDSNQVDLARQLLLHAKRSLINKPNVCIFNILVKHHCKNGDLESAFEV 361
            SPKA +TCL+LL+ S+++DLAR LLLHAKR+L +KPNVC+FNILVK+HCK GDLESAF V
Sbjct: 154  SPKALATCLDLLLHSDRLDLARNLLLHAKRTLTHKPNVCVFNILVKYHCKKGDLESAFHV 213

Query: 362  VKEMRISKSSYPNIITYSTLMDGLCRNGRLKEAFELFEEMVSRDRIVPDPLTYNVLINGF 541
            VKEMR S  SYPN++TYSTLMDGLC+ GRLKEAF++FEEMVSRDR+VPDPLTYN LINGF
Sbjct: 214  VKEMRNSVLSYPNLVTYSTLMDGLCKIGRLKEAFQVFEEMVSRDRVVPDPLTYNTLINGF 273

Query: 542  CRAGQPDRARNVIEFMKNNGCRPNVFNYSALMDGLCKAGKLQDAKRVFAEMKSSGLKPDT 721
            CR G P+RARNVI+FMK+NGCRPN+FNYSAL++G CKAGKLQDAK V+ EMK  GLKPDT
Sbjct: 274  CRGGDPERARNVIQFMKSNGCRPNLFNYSALVNGFCKAGKLQDAKGVWDEMKECGLKPDT 333

Query: 722  ITYTSLINFFCRNGQIDEAIELLKEMKENECQADTVTFNVILGGLCREGRFEEALDMIEK 901
            +TYTSLI+F CRNG+  EA++LL+EMKENEC+ADTVT NVILGGLCREGR EEALDM+EK
Sbjct: 334  VTYTSLIDFLCRNGETGEAMDLLEEMKENECRADTVTINVILGGLCREGRLEEALDMLEK 393

Query: 902  LPHQGVYLNKGSYRIVLNSLTQKLDLQRANKLLQLMLGRGFLPHYATSNELLISLCKEGM 1081
            LPHQGVYLNK SYRIVLNSLTQK +L++A  LL LMLGRGFLPHYATSNELL+ LC+ GM
Sbjct: 394  LPHQGVYLNKASYRIVLNSLTQKCELEKAKGLLGLMLGRGFLPHYATSNELLVLLCEAGM 453

Query: 1082 XXXXXXXLFDLVDMGFQPRLDSWELLIELICRDRKLLKVFELLDELVLTNS 1234
                   LFDLV+MGFQP L+SWE+LI LICRDRKLL VFELLDELV+TN+
Sbjct: 454  VDAAAMALFDLVEMGFQPGLESWEVLIGLICRDRKLLYVFELLDELVVTNA 504


>ref|XP_014502211.1| pentatricopeptide repeat-containing protein At5g18475 [Vigna radiata
            var. radiata]
 ref|XP_014502213.1| pentatricopeptide repeat-containing protein At5g18475 [Vigna radiata
            var. radiata]
 ref|XP_014502214.1| pentatricopeptide repeat-containing protein At5g18475 [Vigna radiata
            var. radiata]
 ref|XP_022636464.1| pentatricopeptide repeat-containing protein At5g18475 [Vigna radiata
            var. radiata]
 ref|XP_022636465.1| pentatricopeptide repeat-containing protein At5g18475 [Vigna radiata
            var. radiata]
 ref|XP_022636466.1| pentatricopeptide repeat-containing protein At5g18475 [Vigna radiata
            var. radiata]
          Length = 519

 Score =  653 bits (1684), Expect = 0.0
 Identities = 315/411 (76%), Positives = 364/411 (88%)
 Frame = +2

Query: 2    LAQFKKFQAIDRVLHQMTYETCKFHEGIFINLMKHYSKSSLHEKVLGTFFSIQPIVREKP 181
            LA+  KF  +D+V+HQMTYETCKFHEGIF+NLM ++SKSSLHEKVL  FFSIQPIVR+KP
Sbjct: 109  LARCNKFHTVDQVIHQMTYETCKFHEGIFVNLMNYFSKSSLHEKVLQAFFSIQPIVRDKP 168

Query: 182  SPKATSTCLNLLIDSNQVDLARQLLLHAKRSLINKPNVCIFNILVKHHCKNGDLESAFEV 361
            SPKA +TCLNLL++SN+VDLARQLLLHAKR L +KPNVC+FNILVK+HCKNGDLESAFEV
Sbjct: 169  SPKALTTCLNLLLESNRVDLARQLLLHAKRGLTHKPNVCVFNILVKYHCKNGDLESAFEV 228

Query: 362  VKEMRISKSSYPNIITYSTLMDGLCRNGRLKEAFELFEEMVSRDRIVPDPLTYNVLINGF 541
            VKEMR  + SYPN++TYSTLMDGLCRNGRL+EAF+LFEEMVSRD IVPDPLTYNVLINGF
Sbjct: 229  VKEMRNCEFSYPNLVTYSTLMDGLCRNGRLREAFQLFEEMVSRDHIVPDPLTYNVLINGF 288

Query: 542  CRAGQPDRARNVIEFMKNNGCRPNVFNYSALMDGLCKAGKLQDAKRVFAEMKSSGLKPDT 721
            CR G+PD ARNVIEFMK+NGC PNV+NYSAL++GLCK GKL+DAK + AEMK++GL PD 
Sbjct: 289  CREGKPDHARNVIEFMKSNGCYPNVYNYSALVNGLCKIGKLEDAKGLLAEMKNAGLTPDA 348

Query: 722  ITYTSLINFFCRNGQIDEAIELLKEMKENECQADTVTFNVILGGLCREGRFEEALDMIEK 901
            + YTSLIN+ C NGQ+ E I+LL+EMKEN+ QADTVTFNVILGGLCREGRFEEALDM+ K
Sbjct: 349  VIYTSLINYLCTNGQVGEGIQLLEEMKENKIQADTVTFNVILGGLCREGRFEEALDMLGK 408

Query: 902  LPHQGVYLNKGSYRIVLNSLTQKLDLQRANKLLQLMLGRGFLPHYATSNELLISLCKEGM 1081
            LP QGVYLNKGSYRIVLNSLT+K +L+RA +LL LM  RGFLPHYATSNELL+ LCK GM
Sbjct: 409  LPQQGVYLNKGSYRIVLNSLTRKGELKRAKELLGLMQSRGFLPHYATSNELLVCLCKGGM 468

Query: 1082 XXXXXXXLFDLVDMGFQPRLDSWELLIELICRDRKLLKVFELLDELVLTNS 1234
                   LFDLV+MGFQP L++WE+LI LICRDRKLL VFELLDEL++T++
Sbjct: 469  ADDAARALFDLVEMGFQPGLETWEVLIGLICRDRKLLYVFELLDELLVTDT 519


>gb|PNY15267.1| pentatricopeptide repeat-containing protein at5g18475-like protein
            [Trifolium pratense]
          Length = 497

 Score =  650 bits (1678), Expect = 0.0
 Identities = 316/372 (84%), Positives = 342/372 (91%)
 Frame = +2

Query: 2    LAQFKKFQAIDRVLHQMTYETCKFHEGIFINLMKHYSKSSLHEKVLGTFFSIQPIVREKP 181
            LAQFKKFQA++RVLHQMTYETCKFHEG+FINLMKHYSK   HEKVL TFF IQ IVREKP
Sbjct: 102  LAQFKKFQAVERVLHQMTYETCKFHEGVFINLMKHYSKCCFHEKVLDTFFVIQTIVREKP 161

Query: 182  SPKATSTCLNLLIDSNQVDLARQLLLHAKRSLINKPNVCIFNILVKHHCKNGDLESAFEV 361
            SPKA S+CLNLL+DSN+VDLAR+LLLHAKRSL  KPNVCIFNILVK+HCKNGDLESAFEV
Sbjct: 162  SPKAISSCLNLLVDSNRVDLARKLLLHAKRSLTYKPNVCIFNILVKYHCKNGDLESAFEV 221

Query: 362  VKEMRISKSSYPNIITYSTLMDGLCRNGRLKEAFELFEEMVSRDRIVPDPLTYNVLINGF 541
            VKEMR SK SYPN+ITYSTLMDGLC+NG+LKEAFELFEEM+S+DRIVPDPLTYNVLINGF
Sbjct: 222  VKEMRNSKYSYPNVITYSTLMDGLCQNGKLKEAFELFEEMISKDRIVPDPLTYNVLINGF 281

Query: 542  CRAGQPDRARNVIEFMKNNGCRPNVFNYSALMDGLCKAGKLQDAKRVFAEMKSSGLKPDT 721
            CR G+ DRARNVIEFMKNNGC PNVFNYSAL+DGLCKAGK QDAK V AEMKSSGLKPDT
Sbjct: 282  CRGGKADRARNVIEFMKNNGCCPNVFNYSALVDGLCKAGKPQDAKEVLAEMKSSGLKPDT 341

Query: 722  ITYTSLINFFCRNGQIDEAIELLKEMKENECQADTVTFNVILGGLCREGRFEEALDMIEK 901
            +TYTSLINF CRNGQ+DEAIELLKEMKENEC+ADTVTFNVILGGLCREGRF+EALDM+EK
Sbjct: 342  VTYTSLINFLCRNGQVDEAIELLKEMKENECEADTVTFNVILGGLCREGRFDEALDMVEK 401

Query: 902  LPHQGVYLNKGSYRIVLNSLTQKLDLQRANKLLQLMLGRGFLPHYATSNELLISLCKEGM 1081
            LPHQG+YLNKGSYRIVLNSLTQK +L++A KLL LML RGF+PHYATSNELL+ LCKEGM
Sbjct: 402  LPHQGIYLNKGSYRIVLNSLTQKCELRKAKKLLGLMLSRGFVPHYATSNELLVCLCKEGM 461

Query: 1082 XXXXXXXLFDLV 1117
                   LFDLV
Sbjct: 462  ADDAAAALFDLV 473



 Score =  103 bits (256), Expect = 8e-20
 Identities = 68/265 (25%), Positives = 137/265 (51%), Gaps = 3/265 (1%)
 Frame = +2

Query: 434  CRNGRLKEAFELFEEMVSRDRIVPDPLTYNVLINGFCRAGQPDRARNVIEFMKNN-GCRP 610
            C + ++ + F + + +V   R  P P   +  +N    + + D AR ++   K +   +P
Sbjct: 141  CFHEKVLDTFFVIQTIV---REKPSPKAISSCLNLLVDSNRVDLARKLLLHAKRSLTYKP 197

Query: 611  NVFNYSALMDGLCKAGKLQDAKRVFAEMKSSGLK-PDTITYTSLINFFCRNGQIDEAIEL 787
            NV  ++ L+   CK G L+ A  V  EM++S    P+ ITY++L++  C+NG++ EA EL
Sbjct: 198  NVCIFNILVKYHCKNGDLESAFEVVKEMRNSKYSYPNVITYSTLMDGLCQNGKLKEAFEL 257

Query: 788  LKEM-KENECQADTVTFNVILGGLCREGRFEEALDMIEKLPHQGVYLNKGSYRIVLNSLT 964
             +EM  ++    D +T+NV++ G CR G+ + A ++IE + + G   N  +Y  +++ L 
Sbjct: 258  FEEMISKDRIVPDPLTYNVLINGFCRGGKADRARNVIEFMKNNGCCPNVFNYSALVDGLC 317

Query: 965  QKLDLQRANKLLQLMLGRGFLPHYATSNELLISLCKEGMXXXXXXXLFDLVDMGFQPRLD 1144
            +    Q A ++L  M   G  P   T   L+  LC+ G        L ++ +   +    
Sbjct: 318  KAGKPQDAKEVLAEMKSSGLKPDTVTYTSLINFLCRNGQVDEAIELLKEMKENECEADTV 377

Query: 1145 SWELLIELICRDRKLLKVFELLDEL 1219
            ++ +++  +CR+ +  +  +++++L
Sbjct: 378  TFNVILGGLCREGRFDEALDMVEKL 402


>ref|XP_007137642.1| hypothetical protein PHAVU_009G143500g, partial [Phaseolus vulgaris]
 gb|ESW09636.1| hypothetical protein PHAVU_009G143500g, partial [Phaseolus vulgaris]
          Length = 742

 Score =  648 bits (1671), Expect = 0.0
 Identities = 314/399 (78%), Positives = 355/399 (88%)
 Frame = +2

Query: 2    LAQFKKFQAIDRVLHQMTYETCKFHEGIFINLMKHYSKSSLHEKVLGTFFSIQPIVREKP 181
            LA+  KF A+DRVLHQMTYETCKFHEGIF+NLM H+SKSSLH+KVL  FFSIQPIVR+KP
Sbjct: 107  LARCNKFHAVDRVLHQMTYETCKFHEGIFVNLMSHFSKSSLHDKVLQAFFSIQPIVRDKP 166

Query: 182  SPKATSTCLNLLIDSNQVDLARQLLLHAKRSLINKPNVCIFNILVKHHCKNGDLESAFEV 361
            SPKA +TCLNLL+DSN+VDLAR+LLLHAKR L +KPNVCIFNILVK+HCKNGDLESAFEV
Sbjct: 167  SPKALTTCLNLLLDSNRVDLARKLLLHAKRGLTHKPNVCIFNILVKYHCKNGDLESAFEV 226

Query: 362  VKEMRISKSSYPNIITYSTLMDGLCRNGRLKEAFELFEEMVSRDRIVPDPLTYNVLINGF 541
            VKEMR S+ SYPN+ITYSTLMDGLCRNGRL+EAF+LFEEMVSRD IVPDPLTYNVLINGF
Sbjct: 227  VKEMRSSEFSYPNLITYSTLMDGLCRNGRLREAFQLFEEMVSRDHIVPDPLTYNVLINGF 286

Query: 542  CRAGQPDRARNVIEFMKNNGCRPNVFNYSALMDGLCKAGKLQDAKRVFAEMKSSGLKPDT 721
            CR G+PD ARNVIEFMK+NGC PNV+NYSAL++GLC+ GKL+DAK V AEMK+SGLKPD 
Sbjct: 287  CREGKPDHARNVIEFMKSNGCYPNVYNYSALVNGLCRIGKLEDAKGVLAEMKNSGLKPDA 346

Query: 722  ITYTSLINFFCRNGQIDEAIELLKEMKENECQADTVTFNVILGGLCREGRFEEALDMIEK 901
            +TYTSLIN+ CRNGQ+ EAI+LL+EMKEN+ QADTV FN+ILGGLCRE RFEEALDM+EK
Sbjct: 347  VTYTSLINYLCRNGQVGEAIQLLEEMKENKIQADTVVFNLILGGLCREDRFEEALDMLEK 406

Query: 902  LPHQGVYLNKGSYRIVLNSLTQKLDLQRANKLLQLMLGRGFLPHYATSNELLISLCKEGM 1081
            LP QGVYLNKGSYRIVLNSL Q  +L+ A +LL LML RGFLPHYA+SNELL+ LCK GM
Sbjct: 407  LPQQGVYLNKGSYRIVLNSLIQNGELKSAKELLGLMLSRGFLPHYASSNELLVCLCKGGM 466

Query: 1082 XXXXXXXLFDLVDMGFQPRLDSWELLIELICRDRKLLKV 1198
                   LFDLV+MGFQP L+SWE+LI LICRDRKLL +
Sbjct: 467  ADDAARALFDLVEMGFQPGLESWEILIGLICRDRKLLYI 505



 Score =  119 bits (298), Expect = 8e-25
 Identities = 81/306 (26%), Positives = 149/306 (48%), Gaps = 2/306 (0%)
 Frame = +2

Query: 299  IFNILVKHHCKNGDLESAFEVVKEMRISKSSYPNIITYSTLMDGLCRNGRLKEAFELFEE 478
            IF  L+ H  K+   +   +    ++      P+    +T ++ L  + R+  A +L   
Sbjct: 134  IFVNLMSHFSKSSLHDKVLQAFFSIQPIVRDKPSPKALTTCLNLLLDSNRVDLARKLLLH 193

Query: 479  MVSRDRIVPDPLTYNVLINGFCRAGQPDRARNVIEFMKNNGCR-PNVFNYSALMDGLCKA 655
                    P+   +N+L+   C+ G  + A  V++ M+++    PN+  YS LMDGLC+ 
Sbjct: 194  AKRGLTHKPNVCIFNILVKYHCKNGDLESAFEVVKEMRSSEFSYPNLITYSTLMDGLCRN 253

Query: 656  GKLQDAKRVFAEMKSSG-LKPDTITYTSLINFFCRNGQIDEAIELLKEMKENECQADTVT 832
            G+L++A ++F EM S   + PD +TY  LIN FCR G+ D A  +++ MK N C  +   
Sbjct: 254  GRLREAFQLFEEMVSRDHIVPDPLTYNVLINGFCREGKPDHARNVIEFMKSNGCYPNVYN 313

Query: 833  FNVILGGLCREGRFEEALDMIEKLPHQGVYLNKGSYRIVLNSLTQKLDLQRANKLLQLML 1012
            ++ ++ GLCR G+ E+A  ++ ++ + G+  +  +Y  ++N L +   +  A +LL+ M 
Sbjct: 314  YSALVNGLCRIGKLEDAKGVLAEMKNSGLKPDAVTYTSLINYLCRNGQVGEAIQLLEEMK 373

Query: 1013 GRGFLPHYATSNELLISLCKEGMXXXXXXXLFDLVDMGFQPRLDSWELLIELICRDRKLL 1192
                       N +L  LC+E         L  L   G      S+ +++  + ++ +L 
Sbjct: 374  ENKIQADTVVFNLILGGLCREDRFEEALDMLEKLPQQGVYLNKGSYRIVLNSLIQNGELK 433

Query: 1193 KVFELL 1210
               ELL
Sbjct: 434  SAKELL 439



 Score =  112 bits (280), Expect = 2e-22
 Identities = 76/261 (29%), Positives = 134/261 (51%), Gaps = 3/261 (1%)
 Frame = +2

Query: 446  RLKEAFELFEEMVSRDRIVPDPLTYNVLINGFCRAGQPDRARNVIEFMKNNGC-RPNVFN 622
            ++ +AF   + +V RD+  P P      +N    + + D AR ++   K     +PNV  
Sbjct: 150  KVLQAFFSIQPIV-RDK--PSPKALTTCLNLLLDSNRVDLARKLLLHAKRGLTHKPNVCI 206

Query: 623  YSALMDGLCKAGKLQDAKRVFAEMKSSGLK-PDTITYTSLINFFCRNGQIDEAIELLKEM 799
            ++ L+   CK G L+ A  V  EM+SS    P+ ITY++L++  CRNG++ EA +L +EM
Sbjct: 207  FNILVKYHCKNGDLESAFEVVKEMRSSEFSYPNLITYSTLMDGLCRNGRLREAFQLFEEM 266

Query: 800  -KENECQADTVTFNVILGGLCREGRFEEALDMIEKLPHQGVYLNKGSYRIVLNSLTQKLD 976
               +    D +T+NV++ G CREG+ + A ++IE +   G Y N  +Y  ++N L +   
Sbjct: 267  VSRDHIVPDPLTYNVLINGFCREGKPDHARNVIEFMKSNGCYPNVYNYSALVNGLCRIGK 326

Query: 977  LQRANKLLQLMLGRGFLPHYATSNELLISLCKEGMXXXXXXXLFDLVDMGFQPRLDSWEL 1156
            L+ A  +L  M   G  P   T   L+  LC+ G        L ++ +   Q     + L
Sbjct: 327  LEDAKGVLAEMKNSGLKPDAVTYTSLINYLCRNGQVGEAIQLLEEMKENKIQADTVVFNL 386

Query: 1157 LIELICRDRKLLKVFELLDEL 1219
            ++  +CR+ +  +  ++L++L
Sbjct: 387  ILGGLCREDRFEEALDMLEKL 407


>gb|POF09155.1| pentatricopeptide repeat-containing protein [Quercus suber]
 gb|POF21441.1| pentatricopeptide repeat-containing protein [Quercus suber]
          Length = 511

 Score =  613 bits (1580), Expect = 0.0
 Identities = 296/407 (72%), Positives = 348/407 (85%)
 Frame = +2

Query: 2    LAQFKKFQAIDRVLHQMTYETCKFHEGIFINLMKHYSKSSLHEKVLGTFFSIQPIVREKP 181
            LA+ KKF+AID VLHQMTYETCKFHEG+F+NLMKH+SKSSLHE+VL  F++IQP VREKP
Sbjct: 104  LARSKKFEAIDAVLHQMTYETCKFHEGVFLNLMKHFSKSSLHERVLKMFYAIQPYVREKP 163

Query: 182  SPKATSTCLNLLIDSNQVDLARQLLLHAKRSLINKPNVCIFNILVKHHCKNGDLESAFEV 361
            S KA STCLNLL++SNQ++LAR+ LLH K+SL  +PN CIFNILVKHHCK+ DLESAFEV
Sbjct: 164  SLKAISTCLNLLVESNQINLAREFLLHTKKSLNLRPNTCIFNILVKHHCKSRDLESAFEV 223

Query: 362  VKEMRISKSSYPNIITYSTLMDGLCRNGRLKEAFELFEEMVSRDRIVPDPLTYNVLINGF 541
            V EM+ SK SYPN+ITYSTLMDGLC +GRLKEA +LFEEMVS+D+I+PD LTYN+LINGF
Sbjct: 224  VNEMKKSKISYPNLITYSTLMDGLCESGRLKEAIDLFEEMVSKDQILPDALTYNILINGF 283

Query: 542  CRAGQPDRARNVIEFMKNNGCRPNVFNYSALMDGLCKAGKLQDAKRVFAEMKSSGLKPDT 721
            CR G+ DRAR ++EFMK+NGC PNVFNYSALM+G CK G++Q+AK +F EMKSSGLK DT
Sbjct: 284  CRGGKVDRARKIMEFMKSNGCSPNVFNYSALMNGFCKEGRVQEAKELFYEMKSSGLKADT 343

Query: 722  ITYTSLINFFCRNGQIDEAIELLKEMKENECQADTVTFNVILGGLCREGRFEEALDMIEK 901
            I YT+LIN FCR G+IDEA ELLKEMKE EC+ADTVTFNVIL GLC EG+FEEALDM+EK
Sbjct: 344  IGYTTLINCFCRAGKIDEATELLKEMKERECRADTVTFNVILKGLCGEGKFEEALDMLEK 403

Query: 902  LPHQGVYLNKGSYRIVLNSLTQKLDLQRANKLLQLMLGRGFLPHYATSNELLISLCKEGM 1081
            LP+ GVYLNK SYRIVLN L QK +L+RA +LL LMLGRGF+PHYATSNELL+ LCK  M
Sbjct: 404  LPYDGVYLNKASYRIVLNFLCQKGELKRATELLDLMLGRGFVPHYATSNELLVHLCKANM 463

Query: 1082 XXXXXXXLFDLVDMGFQPRLDSWELLIELICRDRKLLKVFELLDELV 1222
                   +F LV+MGF+P  DSW  ++EL+CR+R+LL  FELLDELV
Sbjct: 464  ADDAAIAMFGLVEMGFKPEPDSWAHVLELVCRERRLLSAFELLDELV 510


>ref|XP_023892775.1| pentatricopeptide repeat-containing protein At5g18475-like [Quercus
            suber]
 ref|XP_023913576.1| pentatricopeptide repeat-containing protein At5g18475-like [Quercus
            suber]
          Length = 514

 Score =  613 bits (1580), Expect = 0.0
 Identities = 296/407 (72%), Positives = 348/407 (85%)
 Frame = +2

Query: 2    LAQFKKFQAIDRVLHQMTYETCKFHEGIFINLMKHYSKSSLHEKVLGTFFSIQPIVREKP 181
            LA+ KKF+AID VLHQMTYETCKFHEG+F+NLMKH+SKSSLHE+VL  F++IQP VREKP
Sbjct: 107  LARSKKFEAIDAVLHQMTYETCKFHEGVFLNLMKHFSKSSLHERVLKMFYAIQPYVREKP 166

Query: 182  SPKATSTCLNLLIDSNQVDLARQLLLHAKRSLINKPNVCIFNILVKHHCKNGDLESAFEV 361
            S KA STCLNLL++SNQ++LAR+ LLH K+SL  +PN CIFNILVKHHCK+ DLESAFEV
Sbjct: 167  SLKAISTCLNLLVESNQINLAREFLLHTKKSLNLRPNTCIFNILVKHHCKSRDLESAFEV 226

Query: 362  VKEMRISKSSYPNIITYSTLMDGLCRNGRLKEAFELFEEMVSRDRIVPDPLTYNVLINGF 541
            V EM+ SK SYPN+ITYSTLMDGLC +GRLKEA +LFEEMVS+D+I+PD LTYN+LINGF
Sbjct: 227  VNEMKKSKISYPNLITYSTLMDGLCESGRLKEAIDLFEEMVSKDQILPDALTYNILINGF 286

Query: 542  CRAGQPDRARNVIEFMKNNGCRPNVFNYSALMDGLCKAGKLQDAKRVFAEMKSSGLKPDT 721
            CR G+ DRAR ++EFMK+NGC PNVFNYSALM+G CK G++Q+AK +F EMKSSGLK DT
Sbjct: 287  CRGGKVDRARKIMEFMKSNGCSPNVFNYSALMNGFCKEGRVQEAKELFYEMKSSGLKADT 346

Query: 722  ITYTSLINFFCRNGQIDEAIELLKEMKENECQADTVTFNVILGGLCREGRFEEALDMIEK 901
            I YT+LIN FCR G+IDEA ELLKEMKE EC+ADTVTFNVIL GLC EG+FEEALDM+EK
Sbjct: 347  IGYTTLINCFCRAGKIDEATELLKEMKERECRADTVTFNVILKGLCGEGKFEEALDMLEK 406

Query: 902  LPHQGVYLNKGSYRIVLNSLTQKLDLQRANKLLQLMLGRGFLPHYATSNELLISLCKEGM 1081
            LP+ GVYLNK SYRIVLN L QK +L+RA +LL LMLGRGF+PHYATSNELL+ LCK  M
Sbjct: 407  LPYDGVYLNKASYRIVLNFLCQKGELKRATELLDLMLGRGFVPHYATSNELLVHLCKANM 466

Query: 1082 XXXXXXXLFDLVDMGFQPRLDSWELLIELICRDRKLLKVFELLDELV 1222
                   +F LV+MGF+P  DSW  ++EL+CR+R+LL  FELLDELV
Sbjct: 467  ADDAAIAMFGLVEMGFKPEPDSWAHVLELVCRERRLLSAFELLDELV 513


>gb|OMO77960.1| hypothetical protein COLO4_24918 [Corchorus olitorius]
          Length = 515

 Score =  607 bits (1564), Expect = 0.0
 Identities = 294/411 (71%), Positives = 346/411 (84%)
 Frame = +2

Query: 2    LAQFKKFQAIDRVLHQMTYETCKFHEGIFINLMKHYSKSSLHEKVLGTFFSIQPIVREKP 181
            LAQ KKFQAID +LHQMTYETCKFHEGIFINLM+H+SK SLH++VL  F++I+PIVR+KP
Sbjct: 105  LAQSKKFQAIDSILHQMTYETCKFHEGIFINLMRHFSKISLHDRVLEMFYTIEPIVRDKP 164

Query: 182  SPKATSTCLNLLIDSNQVDLARQLLLHAKRSLINKPNVCIFNILVKHHCKNGDLESAFEV 361
            S KA STCLNLLI+SNQV+LAR  LL++K+SL  +PN CIFNILVKHHCKNGDLESAFEV
Sbjct: 165  SLKAISTCLNLLIESNQVELARDFLLNSKKSLKLRPNTCIFNILVKHHCKNGDLESAFEV 224

Query: 362  VKEMRISKSSYPNIITYSTLMDGLCRNGRLKEAFELFEEMVSRDRIVPDPLTYNVLINGF 541
            VKEM+ S+ SYPN ITYSTLM GLC +GRLKEA +LFEEMVS+D+I+PD LTYNVLINGF
Sbjct: 225  VKEMKKSRVSYPNQITYSTLMGGLCESGRLKEAIDLFEEMVSKDQILPDVLTYNVLINGF 284

Query: 542  CRAGQPDRARNVIEFMKNNGCRPNVFNYSALMDGLCKAGKLQDAKRVFAEMKSSGLKPDT 721
            CR G+ DRAR ++EFM NNGC PN+FNYSALM+G CK G+ Q+AK VF EMKS+GL+PDT
Sbjct: 285  CRGGKVDRARKIMEFMTNNGCNPNLFNYSALMNGFCKEGRWQEAKEVFIEMKSAGLRPDT 344

Query: 722  ITYTSLINFFCRNGQIDEAIELLKEMKENECQADTVTFNVILGGLCREGRFEEALDMIEK 901
            I YT+LIN  CR G+IDE +ELLKEMKE ECQAD VT NV+LGGLCREGRF++AL M+EK
Sbjct: 345  IGYTTLINCLCRAGRIDEGMELLKEMKEKECQADVVTLNVLLGGLCREGRFQDALQMLEK 404

Query: 902  LPHQGVYLNKGSYRIVLNSLTQKLDLQRANKLLQLMLGRGFLPHYATSNELLISLCKEGM 1081
            LP++GVYLNKGSYRIVLNSL Q   +++A KLL LML RGF PHYATSNE+L+ LCK GM
Sbjct: 405  LPYEGVYLNKGSYRIVLNSLCQNDKMEKATKLLVLMLERGFWPHYATSNEILVRLCKAGM 464

Query: 1082 XXXXXXXLFDLVDMGFQPRLDSWELLIELICRDRKLLKVFELLDELVLTNS 1234
                   LF+L + GF+P    WE+LIEL CR+RKLL +FELLDELV+  S
Sbjct: 465  VDDAVTALFELAETGFKPEPHCWEILIELNCRERKLLSIFELLDELVIKES 515


>ref|XP_021808283.1| pentatricopeptide repeat-containing protein At5g18475 [Prunus avium]
          Length = 517

 Score =  600 bits (1548), Expect = 0.0
 Identities = 289/408 (70%), Positives = 347/408 (85%)
 Frame = +2

Query: 2    LAQFKKFQAIDRVLHQMTYETCKFHEGIFINLMKHYSKSSLHEKVLGTFFSIQPIVREKP 181
            LAQ KKF+AID +LHQMTYETCKFHEGIF+NLMKH+SKSS+HE+VL  F++IQPIVREKP
Sbjct: 106  LAQSKKFKAIDAILHQMTYETCKFHEGIFLNLMKHFSKSSMHERVLEMFYAIQPIVREKP 165

Query: 182  SPKATSTCLNLLIDSNQVDLARQLLLHAKRSLINKPNVCIFNILVKHHCKNGDLESAFEV 361
            S K  STCLNLLI+SNQVDLA+Q L+H K++L  KPN CI NILVKHHCKNGDLESAFEV
Sbjct: 166  SLKCISTCLNLLIESNQVDLAQQFLMHLKKNLNFKPNTCIVNILVKHHCKNGDLESAFEV 225

Query: 362  VKEMRISKSSYPNIITYSTLMDGLCRNGRLKEAFELFEEMVSRDRIVPDPLTYNVLINGF 541
            VKEM+ SK SYPN++TYSTL+ GLC + RL EA ELFEEM+S+D+I+PD LTYNVLINGF
Sbjct: 226  VKEMKKSKISYPNLVTYSTLLGGLCESDRLTEAMELFEEMISKDQILPDALTYNVLINGF 285

Query: 542  CRAGQPDRARNVIEFMKNNGCRPNVFNYSALMDGLCKAGKLQDAKRVFAEMKSSGLKPDT 721
            CR G+ DRAR ++EFMK+NGC+PNVFNY+ALM+G CK  +LQ+AK +F EM S G+KPDT
Sbjct: 286  CRGGKVDRARKILEFMKSNGCQPNVFNYTALMNGFCKEKRLQEAKEIFHEMMSFGIKPDT 345

Query: 722  ITYTSLINFFCRNGQIDEAIELLKEMKENECQADTVTFNVILGGLCREGRFEEALDMIEK 901
            + YT+LIN  CR G+++EAIELLKEMKE EC+ADTVTFNVILGGLCREGR E+AL+M+EK
Sbjct: 346  VGYTTLINCCCRTGKMNEAIELLKEMKERECKADTVTFNVILGGLCREGRIEDALEMLEK 405

Query: 902  LPHQGVYLNKGSYRIVLNSLTQKLDLQRANKLLQLMLGRGFLPHYATSNELLISLCKEGM 1081
            LP++GVYLNK SYRIVLN L QK +L +A +LL LM+GRGF+PHYATSN+LL+ L + GM
Sbjct: 406  LPYEGVYLNKASYRIVLNFLCQKGELNKATQLLGLMMGRGFVPHYATSNDLLVRLSEAGM 465

Query: 1082 XXXXXXXLFDLVDMGFQPRLDSWELLIELICRDRKLLKVFELLDELVL 1225
                   L  LV+MGF+P+ DSW LL+E ICR+RKLL  FELLDELV+
Sbjct: 466  AENAVMALSRLVEMGFKPQPDSWALLVESICRERKLLSAFELLDELVV 513


>gb|OMO98439.1| hypothetical protein CCACVL1_04225 [Corchorus capsularis]
          Length = 515

 Score =  598 bits (1542), Expect = 0.0
 Identities = 290/411 (70%), Positives = 345/411 (83%)
 Frame = +2

Query: 2    LAQFKKFQAIDRVLHQMTYETCKFHEGIFINLMKHYSKSSLHEKVLGTFFSIQPIVREKP 181
            LAQ KKFQAID +LHQMTYETCKFHEGIFINLM+H+SK SL+++VL  F++I+PIVREKP
Sbjct: 105  LAQSKKFQAIDSILHQMTYETCKFHEGIFINLMRHFSKISLYDRVLEMFYTIEPIVREKP 164

Query: 182  SPKATSTCLNLLIDSNQVDLARQLLLHAKRSLINKPNVCIFNILVKHHCKNGDLESAFEV 361
            S KA STCLNL+I+SNQV+LAR  LL++K+SL  +PN CIFNILVKHHCKNGDLESAFEV
Sbjct: 165  SLKAISTCLNLMIESNQVELARDFLLNSKKSLKLRPNTCIFNILVKHHCKNGDLESAFEV 224

Query: 362  VKEMRISKSSYPNIITYSTLMDGLCRNGRLKEAFELFEEMVSRDRIVPDPLTYNVLINGF 541
            VKEM+ S+ SYPN ITYSTLM GLC +GRLKEA +LFEEMVS+D+I+PD LTYNVLINGF
Sbjct: 225  VKEMKKSRVSYPNQITYSTLMGGLCESGRLKEAIDLFEEMVSKDQILPDVLTYNVLINGF 284

Query: 542  CRAGQPDRARNVIEFMKNNGCRPNVFNYSALMDGLCKAGKLQDAKRVFAEMKSSGLKPDT 721
            CR G+ DRAR ++EFM NNGC PN+FNYSALM+G CK G+ Q+AK VF EMKS+GLKPDT
Sbjct: 285  CRGGKVDRARKIMEFMTNNGCNPNLFNYSALMNGFCKEGRWQEAKEVFIEMKSAGLKPDT 344

Query: 722  ITYTSLINFFCRNGQIDEAIELLKEMKENECQADTVTFNVILGGLCREGRFEEALDMIEK 901
            I YT+LIN  CR G+IDE +ELLKEMK+ ECQAD VT NV+LGGLCREGRF++AL M+EK
Sbjct: 345  IGYTTLINCLCRAGRIDEGMELLKEMKKKECQADVVTLNVLLGGLCREGRFQDALQMLEK 404

Query: 902  LPHQGVYLNKGSYRIVLNSLTQKLDLQRANKLLQLMLGRGFLPHYATSNELLISLCKEGM 1081
            LP++GV LNKGSYRIVLNSL Q  ++++  KLL LML RGF PHYATSNE+L+ LCK GM
Sbjct: 405  LPYEGVCLNKGSYRIVLNSLCQNDEMEKVTKLLVLMLERGFWPHYATSNEILVRLCKAGM 464

Query: 1082 XXXXXXXLFDLVDMGFQPRLDSWELLIELICRDRKLLKVFELLDELVLTNS 1234
                   LF+L + GF+P    WE+LIEL CR+RKLL +F+LLDELV+  S
Sbjct: 465  VDDAVTALFELAETGFKPEPHCWEILIELNCRERKLLYIFKLLDELVIKES 515


>ref|XP_008219391.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475
            [Prunus mume]
          Length = 517

 Score =  597 bits (1540), Expect = 0.0
 Identities = 286/408 (70%), Positives = 346/408 (84%)
 Frame = +2

Query: 2    LAQFKKFQAIDRVLHQMTYETCKFHEGIFINLMKHYSKSSLHEKVLGTFFSIQPIVREKP 181
            LAQ KKF+AID +LHQMTYETCKFHEGIF+NLMKH+SKSS+HE+VL  F++IQP+VREKP
Sbjct: 106  LAQSKKFKAIDAILHQMTYETCKFHEGIFLNLMKHFSKSSMHERVLEMFYAIQPVVREKP 165

Query: 182  SPKATSTCLNLLIDSNQVDLARQLLLHAKRSLINKPNVCIFNILVKHHCKNGDLESAFEV 361
            S K  STCLNLLI+SNQVDLA+Q L+H K++L  KPN CI NILVKHHCKNGDLESAFEV
Sbjct: 166  SLKCISTCLNLLIESNQVDLAQQFLMHLKKNLNFKPNTCIVNILVKHHCKNGDLESAFEV 225

Query: 362  VKEMRISKSSYPNIITYSTLMDGLCRNGRLKEAFELFEEMVSRDRIVPDPLTYNVLINGF 541
            VKEM+ SK SYPN++TYSTL+ GLC++ RL EA ELFEEM+S+D+I+PD LTYNVLINGF
Sbjct: 226  VKEMKKSKISYPNLVTYSTLLGGLCKSDRLTEAMELFEEMISKDQILPDALTYNVLINGF 285

Query: 542  CRAGQPDRARNVIEFMKNNGCRPNVFNYSALMDGLCKAGKLQDAKRVFAEMKSSGLKPDT 721
            C  G+ DRAR ++EFMK+NGC+PNVFNY+ALM+G CK  +LQ+AK +F EM S G+KPDT
Sbjct: 286  CHGGKVDRARKILEFMKSNGCQPNVFNYTALMNGFCKEKRLQEAKEIFHEMTSFGIKPDT 345

Query: 722  ITYTSLINFFCRNGQIDEAIELLKEMKENECQADTVTFNVILGGLCREGRFEEALDMIEK 901
            + YT+LIN  CR G+++EAIELLKEMKE EC+ADTVTFNVILGGLCREGR E+AL+M+EK
Sbjct: 346  VGYTALINCCCRTGKMNEAIELLKEMKERECKADTVTFNVILGGLCREGRIEDALEMLEK 405

Query: 902  LPHQGVYLNKGSYRIVLNSLTQKLDLQRANKLLQLMLGRGFLPHYATSNELLISLCKEGM 1081
            LP++GVYLNK SYRIVLN L QK +L +A +LL LM+GRGF+PHYATSN+LL+ L + GM
Sbjct: 406  LPYEGVYLNKASYRIVLNFLCQKGELNKATQLLGLMMGRGFVPHYATSNDLLVRLSEAGM 465

Query: 1082 XXXXXXXLFDLVDMGFQPRLDSWELLIELICRDRKLLKVFELLDELVL 1225
                   L  L +MGF+P+ DSW LL+E ICR+RKLL  FELLDELV+
Sbjct: 466  AENAVMALSRLAEMGFKPQPDSWALLVESICRERKLLSAFELLDELVV 513


>ref|XP_018812757.1| PREDICTED: pentatricopeptide repeat-containing protein At5g18475
            isoform X1 [Juglans regia]
          Length = 524

 Score =  596 bits (1536), Expect = 0.0
 Identities = 291/408 (71%), Positives = 341/408 (83%)
 Frame = +2

Query: 2    LAQFKKFQAIDRVLHQMTYETCKFHEGIFINLMKHYSKSSLHEKVLGTFFSIQPIVREKP 181
            LA+ KKF A+D VL QMTYETCKFHEGIF+NLMKH+S+SSLHE+VL  F +IQP+VREKP
Sbjct: 114  LARSKKFPAVDAVLRQMTYETCKFHEGIFLNLMKHFSQSSLHERVLEMFHAIQPVVREKP 173

Query: 182  SPKATSTCLNLLIDSNQVDLARQLLLHAKRSLINKPNVCIFNILVKHHCKNGDLESAFEV 361
            S KA STCLNLL+ SNQ+DLAR+ LLH+++ L  KPN CIFNILVKHHCKN +L+SAFEV
Sbjct: 174  SLKAISTCLNLLVKSNQIDLAREFLLHSRKRLNLKPNSCIFNILVKHHCKNKNLKSAFEV 233

Query: 362  VKEMRISKSSYPNIITYSTLMDGLCRNGRLKEAFELFEEMVSRDRIVPDPLTYNVLINGF 541
              EM+ SK SYPN+ITYSTLMDGLC +GRLKEA +LFEEMVS+++I+PD LTYNVLINGF
Sbjct: 234  FNEMKKSKISYPNVITYSTLMDGLCESGRLKEAVDLFEEMVSKEQILPDALTYNVLINGF 293

Query: 542  CRAGQPDRARNVIEFMKNNGCRPNVFNYSALMDGLCKAGKLQDAKRVFAEMKSSGLKPDT 721
            C AG+ DRA+ ++EFM+ NGC PNVFNYS LM+G CK  KL +AK VF EMKS GLKPDT
Sbjct: 294  CHAGKVDRAKKMMEFMRKNGCNPNVFNYSTLMNGFCKEKKLLEAKEVFDEMKSFGLKPDT 353

Query: 722  ITYTSLINFFCRNGQIDEAIELLKEMKENECQADTVTFNVILGGLCREGRFEEALDMIEK 901
            I+Y++LIN FCR G+IDEA ELLKEMKE EC+ADTVTFNVILGGLCREGRFE ALDM+EK
Sbjct: 354  ISYSTLINCFCRAGKIDEATELLKEMKEKECKADTVTFNVILGGLCREGRFETALDMLEK 413

Query: 902  LPHQGVYLNKGSYRIVLNSLTQKLDLQRANKLLQLMLGRGFLPHYATSNELLISLCKEGM 1081
            LP  GVYLNK SYRIVLN L QK +L +A +LL LMLGRGFLPHYATSNELL+ LCK GM
Sbjct: 414  LPCDGVYLNKASYRIVLNFLCQKGELNKATELLDLMLGRGFLPHYATSNELLVRLCKAGM 473

Query: 1082 XXXXXXXLFDLVDMGFQPRLDSWELLIELICRDRKLLKVFELLDELVL 1225
                   +F+L++MGF+P  DSW   +ELIC++RKLL  FELLDEL +
Sbjct: 474  VDDAAVAMFELMEMGFKPEPDSWAHALELICKERKLLCTFELLDELTV 521



 Score = 94.7 bits (234), Expect = 6e-17
 Identities = 47/176 (26%), Positives = 94/176 (53%)
 Frame = +2

Query: 284 KPNVCIFNILVKHHCKNGDLESAFEVVKEMRISKSSYPNIITYSTLMDGLCRNGRLKEAF 463
           KP+   ++ L+   C+ G ++ A E++KEM+  K    + +T++ ++ GLCR GR + A 
Sbjct: 350 KPDTISYSTLINCFCRAGKIDEATELLKEMK-EKECKADTVTFNVILGGLCREGRFETAL 408

Query: 464 ELFEEMVSRDRIVPDPLTYNVLINGFCRAGQPDRARNVIEFMKNNGCRPNVFNYSALMDG 643
           ++ E++   D +  +  +Y +++N  C+ G+ ++A  +++ M   G  P+    + L+  
Sbjct: 409 DMLEKLPC-DGVYLNKASYRIVLNFLCQKGELNKATELLDLMLGRGFLPHYATSNELLVR 467

Query: 644 LCKAGKLQDAKRVFAEMKSSGLKPDTITYTSLINFFCRNGQIDEAIELLKEMKENE 811
           LCKAG + DA     E+   G KP+  ++   +   C+  ++    ELL E+   E
Sbjct: 468 LCKAGMVDDAAVAMFELMEMGFKPEPDSWAHALELICKERKLLCTFELLDELTVRE 523


Top