BLASTX nr result

ID: Astragalus22_contig00027358 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00027358
         (668 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_012573667.1| PREDICTED: pentatricopeptide repeat-containi...   293   2e-92
ref|XP_007155935.1| hypothetical protein PHAVU_003G244800g [Phas...   254   6e-78
ref|XP_014506479.1| pentatricopeptide repeat-containing protein ...   254   1e-77
ref|XP_017410302.1| PREDICTED: putative pentatricopeptide repeat...   249   5e-76
ref|XP_003551036.1| PREDICTED: pentatricopeptide repeat-containi...   248   3e-75
ref|XP_015954901.1| pentatricopeptide repeat-containing protein ...   244   1e-73
ref|XP_016206998.1| pentatricopeptide repeat-containing protein ...   240   2e-72
gb|ONH96536.1| hypothetical protein PRUPE_7G135300 [Prunus persica]   224   3e-66
gb|KYP42158.1| Pentatricopeptide repeat-containing protein At3g1...   222   4e-66
ref|XP_020424413.1| pentatricopeptide repeat-containing protein ...   224   9e-66
ref|XP_020239719.1| pentatricopeptide repeat-containing protein ...   222   1e-65
ref|XP_002277337.2| PREDICTED: pentatricopeptide repeat-containi...   223   2e-65
ref|XP_008344308.1| PREDICTED: pentatricopeptide repeat-containi...   219   8e-64
ref|XP_009366958.1| PREDICTED: pentatricopeptide repeat-containi...   219   1e-63
ref|XP_008340659.1| PREDICTED: pentatricopeptide repeat-containi...   219   5e-63
ref|XP_021274359.1| pentatricopeptide repeat-containing protein ...   213   7e-62
ref|XP_007047218.1| PREDICTED: pentatricopeptide repeat-containi...   212   1e-61
ref|XP_015890127.1| PREDICTED: pentatricopeptide repeat-containi...   211   4e-61
gb|KRH04632.1| hypothetical protein GLYMA_17G175800 [Glycine max]     207   4e-61
ref|XP_024183906.1| putative pentatricopeptide repeat-containing...   209   2e-60

>ref|XP_012573667.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g16610-like [Cicer arietinum]
          Length = 653

 Score =  293 bits (750), Expect = 2e-92
 Identities = 142/184 (77%), Positives = 161/184 (87%), Gaps = 1/184 (0%)
 Frame = -1

Query: 560 NFNSFLQPCSSSKSINQVKQLHQRLVVLLHSSHYNPFFVTKLIQTYADCDDLRSAHLLLN 381
           N NS LQ CS+SK++NQ KQLHQRL+ L ++S+ NPFF TKLIQ Y+DC+D+RSA  LL+
Sbjct: 3   NLNSVLQACSTSKNLNQAKQLHQRLI-LFNASNPNPFFTTKLIQIYSDCNDIRSATFLLH 61

Query: 380 QLSQPNIFAFTSILSFHSRHSHSRQCIQTYAQLR-LNAIVPDGYVFPKVLKACAQSACLH 204
           QLS PNIF+FTSILSFHSRHS   QCIQTYAQLR LN +VPDGYVFPKV KACA SA  H
Sbjct: 62  QLSHPNIFSFTSILSFHSRHSLHSQCIQTYAQLRRLNGLVPDGYVFPKVFKACALSASFH 121

Query: 203 VGIVVHKDVLVFGWDPSLRVCNSVLDMYSKCRDVGSATQVFDEMRERDVFSWNSMMSCYV 24
           VG+VVHKDV+VFGW+P+LRVCNSVLDMYSKC DVGSA +VFDEMR+RDVFSWNSMMSCYV
Sbjct: 122 VGVVVHKDVIVFGWNPNLRVCNSVLDMYSKCGDVGSAVKVFDEMRKRDVFSWNSMMSCYV 181

Query: 23  CNGL 12
           CNGL
Sbjct: 182 CNGL 185


>ref|XP_007155935.1| hypothetical protein PHAVU_003G244800g [Phaseolus vulgaris]
 gb|ESW27929.1| hypothetical protein PHAVU_003G244800g [Phaseolus vulgaris]
          Length = 619

 Score =  254 bits (650), Expect = 6e-78
 Identities = 130/181 (71%), Positives = 146/181 (80%)
 Frame = -1

Query: 557 FNSFLQPCSSSKSINQVKQLHQRLVVLLHSSHYNPFFVTKLIQTYADCDDLRSAHLLLNQ 378
           FNS L  C   K++NQ KQLH    +L  +SH NPFFVTKLIQ YADC+DLRSA  LL+Q
Sbjct: 8   FNSLLGAC---KTLNQAKQLHN--CILQTASHRNPFFVTKLIQIYADCNDLRSALTLLHQ 62

Query: 377 LSQPNIFAFTSILSFHSRHSHSRQCIQTYAQLRLNAIVPDGYVFPKVLKACAQSACLHVG 198
           LSQPN+FAFTSILSFHS+H H   CIQTYA+LR N +VPDGYVFPKVLKACAQ + L  G
Sbjct: 63  LSQPNVFAFTSILSFHSKHGHPHHCIQTYAKLRQNGVVPDGYVFPKVLKACAQLSRLGTG 122

Query: 197 IVVHKDVLVFGWDPSLRVCNSVLDMYSKCRDVGSATQVFDEMRERDVFSWNSMMSCYVCN 18
            VV+KDV+VFG + +L+V NSVLDMYSKC DV SATQVFDEM ERDVFSWNSMMS YVCN
Sbjct: 123 TVVYKDVIVFGAESNLQVRNSVLDMYSKCGDVWSATQVFDEMPERDVFSWNSMMSGYVCN 182

Query: 17  G 15
           G
Sbjct: 183 G 183


>ref|XP_014506479.1| pentatricopeptide repeat-containing protein At5g39350-like [Vigna
           radiata var. radiata]
          Length = 619

 Score =  254 bits (648), Expect = 1e-77
 Identities = 128/182 (70%), Positives = 147/182 (80%)
 Frame = -1

Query: 560 NFNSFLQPCSSSKSINQVKQLHQRLVVLLHSSHYNPFFVTKLIQTYADCDDLRSAHLLLN 381
           +FNS ++ C   K++NQ KQLH    +L  +SH NPFFVTKLIQ YADC+DLRSA  LL+
Sbjct: 7   SFNSVIRAC---KTLNQAKQLHN--CILQTASHRNPFFVTKLIQIYADCNDLRSALALLH 61

Query: 380 QLSQPNIFAFTSILSFHSRHSHSRQCIQTYAQLRLNAIVPDGYVFPKVLKACAQSACLHV 201
           QLS PN+FAFTSILSF+S+H H   CIQTYA+LR N +VPDGYVFPKVLKACAQ + L  
Sbjct: 62  QLSHPNVFAFTSILSFYSKHGHPHHCIQTYAKLRQNGVVPDGYVFPKVLKACAQLSHLGT 121

Query: 200 GIVVHKDVLVFGWDPSLRVCNSVLDMYSKCRDVGSATQVFDEMRERDVFSWNSMMSCYVC 21
           G VV+KDV+VFG D +L+V NSVLDMYSKC DV SA QVFDEMRERDVFSWNSMMS YVC
Sbjct: 122 GTVVYKDVIVFGVDSNLQVRNSVLDMYSKCGDVWSAIQVFDEMRERDVFSWNSMMSAYVC 181

Query: 20  NG 15
           NG
Sbjct: 182 NG 183


>ref|XP_017410302.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At1g17630 [Vigna angularis]
 gb|KOM29543.1| hypothetical protein LR48_Vigan727s000200 [Vigna angularis]
 dbj|BAT75575.1| hypothetical protein VIGAN_01345300 [Vigna angularis var.
           angularis]
          Length = 619

 Score =  249 bits (637), Expect = 5e-76
 Identities = 126/186 (67%), Positives = 147/186 (79%)
 Frame = -1

Query: 572 NPLFNFNSFLQPCSSSKSINQVKQLHQRLVVLLHSSHYNPFFVTKLIQTYADCDDLRSAH 393
           N   +FNS ++ C   K++NQ KQLH    +L  +SH NPFFVTKLIQ YADC+DLRSA 
Sbjct: 3   NLTHSFNSVIRAC---KTLNQAKQLHN--CILQTASHRNPFFVTKLIQIYADCNDLRSAL 57

Query: 392 LLLNQLSQPNIFAFTSILSFHSRHSHSRQCIQTYAQLRLNAIVPDGYVFPKVLKACAQSA 213
            LL+QLS PN+FAFTSILSF+S+H H   CIQ YA+LR N ++PDGYVFPKVLKACAQ +
Sbjct: 58  ALLHQLSHPNVFAFTSILSFYSKHGHPHHCIQIYAKLRQNGVIPDGYVFPKVLKACAQLS 117

Query: 212 CLHVGIVVHKDVLVFGWDPSLRVCNSVLDMYSKCRDVGSATQVFDEMRERDVFSWNSMMS 33
            L  G VV+KDV+VFG + +L+V NSVLDMYSKC DV SATQVFDEM ERDVFSWNSMMS
Sbjct: 118 HLGTGTVVYKDVIVFGAESNLQVRNSVLDMYSKCGDVWSATQVFDEMPERDVFSWNSMMS 177

Query: 32  CYVCNG 15
            YVCNG
Sbjct: 178 AYVCNG 183


>ref|XP_003551036.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g39350-like [Glycine max]
          Length = 619

 Score =  248 bits (632), Expect = 3e-75
 Identities = 129/183 (70%), Positives = 145/183 (79%)
 Frame = -1

Query: 560 NFNSFLQPCSSSKSINQVKQLHQRLVVLLHSSHYNPFFVTKLIQTYADCDDLRSAHLLLN 381
           +FNS LQ C   K++NQ KQLH R  +LL  SH+N FFVTKLIQ YAD +DLRSA  LL+
Sbjct: 7   SFNSLLQAC---KTLNQAKQLHHR--ILLTGSHHNHFFVTKLIQIYADSNDLRSAVTLLH 61

Query: 380 QLSQPNIFAFTSILSFHSRHSHSRQCIQTYAQLRLNAIVPDGYVFPKVLKACAQSACLHV 201
           Q+S PN+FAFTSILSFHSRH    QCIQTYA+LR N +VPDGYVFPKVLKACAQ +    
Sbjct: 62  QISHPNVFAFTSILSFHSRHGLGHQCIQTYAELRRNGVVPDGYVFPKVLKACAQLSRFGS 121

Query: 200 GIVVHKDVLVFGWDPSLRVCNSVLDMYSKCRDVGSATQVFDEMRERDVFSWNSMMSCYVC 21
           G  VHKDV+VFG + +L+V NSVLDMYSKC DVGSA QVFDEM ERDVFSWNSMMS YV 
Sbjct: 122 GRGVHKDVVVFGEESNLQVRNSVLDMYSKCGDVGSARQVFDEMSERDVFSWNSMMSGYVW 181

Query: 20  NGL 12
           NGL
Sbjct: 182 NGL 184


>ref|XP_015954901.1| pentatricopeptide repeat-containing protein DOT4,
           chloroplastic-like [Arachis duranensis]
 ref|XP_020993529.1| pentatricopeptide repeat-containing protein DOT4,
           chloroplastic-like [Arachis duranensis]
          Length = 641

 Score =  244 bits (622), Expect = 1e-73
 Identities = 116/180 (64%), Positives = 140/180 (77%)
 Frame = -1

Query: 545 LQPCSSSKSINQVKQLHQRLVVLLHSSHYNPFFVTKLIQTYADCDDLRSAHLLLNQLSQP 366
           LQ CSSSKS+ + KQ+HQ++++  + +H NPFF TKLIQ Y DCDD+ SA  LL+ L  P
Sbjct: 26  LQACSSSKSLTKAKQIHQQIII--NGTHRNPFFATKLIQLYIDCDDISSALFLLHHLHPP 83

Query: 365 NIFAFTSILSFHSRHSHSRQCIQTYAQLRLNAIVPDGYVFPKVLKACAQSACLHVGIVVH 186
           N+FAFTSIL F+SRH   RQCI+TY  LR+  +VPDGYVFPKVLKACAQS     G+ VH
Sbjct: 84  NVFAFTSILRFYSRHGQMRQCIRTYVDLRVMGVVPDGYVFPKVLKACAQSQWFETGVAVH 143

Query: 185 KDVLVFGWDPSLRVCNSVLDMYSKCRDVGSATQVFDEMRERDVFSWNSMMSCYVCNGLFE 6
           KDV+ FG + +L+ CN+VLDMYSKC DV SA +VF+EM ERDVFSWNSMMS YV NGLFE
Sbjct: 144 KDVITFGSESNLQACNAVLDMYSKCADVQSAQKVFNEMSERDVFSWNSMMSGYVSNGLFE 203



 Score = 61.2 bits (147), Expect = 3e-07
 Identities = 40/183 (21%), Positives = 85/183 (46%)
 Frame = -1

Query: 560 NFNSFLQPCSSSKSINQVKQLHQRLVVLLHSSHYNPFFVTKLIQTYADCDDLRSAHLLLN 381
           + +  L  C    S+    ++H   V ++    +       L+  YA+C  L  A  + +
Sbjct: 291 SLSGILVSCRFLGSLTSGNEVHCYGVKVISGDAFYKSAGAALLTLYANCGRLNDAENVFD 350

Query: 380 QLSQPNIFAFTSILSFHSRHSHSRQCIQTYAQLRLNAIVPDGYVFPKVLKACAQSACLHV 201
           ++ + ++  + +++        + + +Q + +++ + +  D      +L AC     L  
Sbjct: 351 RMDKSDVVTWNAMIYGLIDMGLANEAVQCFKEMQASNVKVDQTTVSTLLLACD----LRR 406

Query: 200 GIVVHKDVLVFGWDPSLRVCNSVLDMYSKCRDVGSATQVFDEMRERDVFSWNSMMSCYVC 21
           G  +H  VL   ++  + VCN+++  YSKC  +  A  VF  M  RD+ SWN+++S +  
Sbjct: 407 GKEMHAYVLKHRYNWVIPVCNALIHTYSKCGCIAYAYSVFSTMAVRDLVSWNTIISGFGM 466

Query: 20  NGL 12
           +GL
Sbjct: 467 HGL 469


>ref|XP_016206998.1| pentatricopeptide repeat-containing protein DOT4,
           chloroplastic-like [Arachis ipaensis]
 ref|XP_020973997.1| pentatricopeptide repeat-containing protein DOT4,
           chloroplastic-like [Arachis ipaensis]
          Length = 609

 Score =  240 bits (613), Expect = 2e-72
 Identities = 115/180 (63%), Positives = 140/180 (77%)
 Frame = -1

Query: 545 LQPCSSSKSINQVKQLHQRLVVLLHSSHYNPFFVTKLIQTYADCDDLRSAHLLLNQLSQP 366
           LQ CSSSKS+ + KQ+HQ++++  + +H NPFF TKLIQ Y DCDD+ SA  LL+ L  P
Sbjct: 26  LQACSSSKSLTKAKQIHQQIII--NGTHRNPFFATKLIQLYIDCDDISSALFLLHHLHPP 83

Query: 365 NIFAFTSILSFHSRHSHSRQCIQTYAQLRLNAIVPDGYVFPKVLKACAQSACLHVGIVVH 186
           N+FAFTSIL F+SRH   RQCI+TY  LR+  +VPDGYVFPKVLKACAQS     G+ VH
Sbjct: 84  NVFAFTSILRFYSRHGQMRQCIRTYVDLRVMGVVPDGYVFPKVLKACAQSRWFETGVAVH 143

Query: 185 KDVLVFGWDPSLRVCNSVLDMYSKCRDVGSATQVFDEMRERDVFSWNSMMSCYVCNGLFE 6
           KDV++FG + +L+  N+VLDMYSKC DV SA +VF+EM ERDVFSWNSMMS YV NGLFE
Sbjct: 144 KDVIIFGSESNLQARNAVLDMYSKCADVQSAQKVFNEMSERDVFSWNSMMSGYVSNGLFE 203



 Score = 61.2 bits (147), Expect = 3e-07
 Identities = 40/183 (21%), Positives = 85/183 (46%)
 Frame = -1

Query: 560 NFNSFLQPCSSSKSINQVKQLHQRLVVLLHSSHYNPFFVTKLIQTYADCDDLRSAHLLLN 381
           + +  L  C    S+    ++H   V ++    +       L+  YA+C  L  A  + +
Sbjct: 291 SLSGILVSCRFLGSLTSGNEVHCYGVKVISGDAFYKSAGAALLTLYANCGRLNDAENVFD 350

Query: 380 QLSQPNIFAFTSILSFHSRHSHSRQCIQTYAQLRLNAIVPDGYVFPKVLKACAQSACLHV 201
           ++ + ++  + +++        + + +Q + +++ + +  D      +L AC     L  
Sbjct: 351 RMDKSDVVTWNAMIYGLIDMGLANEAVQCFKEMQASNVKVDQTTVSTLLLACD----LRR 406

Query: 200 GIVVHKDVLVFGWDPSLRVCNSVLDMYSKCRDVGSATQVFDEMRERDVFSWNSMMSCYVC 21
           G  +H  VL   ++  + VCN+++  YSKC  +  A  VF  M  RD+ SWN+++S +  
Sbjct: 407 GKEMHAYVLKHRYNWVIPVCNALIHTYSKCGCIAYAYSVFSTMAVRDLVSWNTIISGFGM 466

Query: 20  NGL 12
           +GL
Sbjct: 467 HGL 469


>gb|ONH96536.1| hypothetical protein PRUPE_7G135300 [Prunus persica]
          Length = 646

 Score =  224 bits (572), Expect = 3e-66
 Identities = 114/216 (52%), Positives = 155/216 (71%), Gaps = 1/216 (0%)
 Frame = -1

Query: 656 MKPWLRSHTATLITQRHISSLSSSTNTQNPL-FNFNSFLQPCSSSKSINQVKQLHQRLVV 480
           M+ W R+H A         S+ S+TN++ P       +LQ CS+SKS+NQ K +HQ+++ 
Sbjct: 1   MRVW-RAHRAI--------SILSTTNSKAPSPSELGVYLQLCSNSKSLNQGKHVHQKIIQ 51

Query: 479 LLHSSHYNPFFVTKLIQTYADCDDLRSAHLLLNQLSQPNIFAFTSILSFHSRHSHSRQCI 300
                  NPF VTKL+Q YADCDDL S+  L + L +PN+FA+T+IL F+SRH    +C+
Sbjct: 52  C--GLDQNPFIVTKLVQMYADCDDLVSSWKLFDNLLKPNVFAWTAILGFYSRHGMHEECV 109

Query: 299 QTYAQLRLNAIVPDGYVFPKVLKACAQSACLHVGIVVHKDVLVFGWDPSLRVCNSVLDMY 120
           + Y ++ LN ++PDGYVFPKVL+ACAQ   L VGIVVHKDV++ G + +L+VCNS++DMY
Sbjct: 110 RAYVEMILNDVLPDGYVFPKVLRACAQLLRLKVGIVVHKDVIICGLNLNLQVCNSLIDMY 169

Query: 119 SKCRDVGSATQVFDEMRERDVFSWNSMMSCYVCNGL 12
           SKC D+GSA +VFDEM  RD++SWNSM+S YVCNGL
Sbjct: 170 SKCEDIGSAKRVFDEMVGRDLWSWNSMISGYVCNGL 205



 Score = 58.2 bits (139), Expect = 3e-06
 Identities = 49/214 (22%), Positives = 93/214 (43%), Gaps = 6/214 (2%)
 Frame = -1

Query: 638 SHTATLITQRHISSLSSSTNTQNPLFNFNSFLQPCSSSKSINQVKQLH------QRLVVL 477
           SH A+L   R    +  S+     L + ++ L  C    S+   K++H      +  +  
Sbjct: 271 SHEASL---RIFRDMIGSSMVDPDLDSLSTVLVSCRHLGSLLNGKEIHGYGIKRESGIAF 327

Query: 476 LHSSHYNPFFVTKLIQTYADCDDLRSAHLLLNQLSQPNIFAFTSILSFHSRHSHSRQCIQ 297
            HS+         L+  YA+C  +  A  +   ++  ++ ++ +++            + 
Sbjct: 328 YHSAG------PALLTMYANCRRIHDATNVFKLMNPAHVVSWNAMILGFIDLGLEDLALD 381

Query: 296 TYAQLRLNAIVPDGYVFPKVLKACAQSACLHVGIVVHKDVLVFGWDPSLRVCNSVLDMYS 117
           ++ +++   I  D      +L AC     L  G  +H  +    +D  + V N+++ MYS
Sbjct: 382 SFRRMQRARINVDQTTISTILPACN----LKFGKQIHAFIRKISFDLVVPVWNALIHMYS 437

Query: 116 KCRDVGSATQVFDEMRERDVFSWNSMMSCYVCNG 15
           KC  +GSA  VF  M  RD+ SWNSM+  +  +G
Sbjct: 438 KCGCIGSAYSVFSNMINRDLVSWNSMIGGFGMHG 471


>gb|KYP42158.1| Pentatricopeptide repeat-containing protein At3g12770 family
           [Cajanus cajan]
          Length = 549

 Score =  222 bits (566), Expect = 4e-66
 Identities = 111/183 (60%), Positives = 140/183 (76%)
 Frame = -1

Query: 560 NFNSFLQPCSSSKSINQVKQLHQRLVVLLHSSHYNPFFVTKLIQTYADCDDLRSAHLLLN 381
           +F S LQ C   K++ Q K+LH   ++LL +SH+NPFF+TKL Q YADCDDLRSA  L  
Sbjct: 7   SFTSLLQSC---KTLKQAKKLHA--LILLTASHHNPFFITKLTQIYADCDDLRSALALT- 60

Query: 380 QLSQPNIFAFTSILSFHSRHSHSRQCIQTYAQLRLNAIVPDGYVFPKVLKACAQSACLHV 201
            L +PN+FAFT+IL FHSRH+H+ QCI TYA+LR NA+VPD YVFPKVLKACA+ +    
Sbjct: 61  -LLRPNVFAFTAILYFHSRHAHAHQCILTYARLRQNAVVPDNYVFPKVLKACAKLSLFVT 119

Query: 200 GIVVHKDVLVFGWDPSLRVCNSVLDMYSKCRDVGSATQVFDEMRERDVFSWNSMMSCYVC 21
           G V+HKDV+ FG + ++ V NS+L MY+KC D+ SA +VF EM +RDVFSWNS+MS YVC
Sbjct: 120 GTVIHKDVVAFGSESNVHVRNSLLGMYAKCGDMASAERVFGEMPQRDVFSWNSVMSGYVC 179

Query: 20  NGL 12
           NGL
Sbjct: 180 NGL 182


>ref|XP_020424413.1| pentatricopeptide repeat-containing protein At5g39350 isoform X1
           [Prunus persica]
          Length = 699

 Score =  224 bits (572), Expect = 9e-66
 Identities = 114/216 (52%), Positives = 155/216 (71%), Gaps = 1/216 (0%)
 Frame = -1

Query: 656 MKPWLRSHTATLITQRHISSLSSSTNTQNPL-FNFNSFLQPCSSSKSINQVKQLHQRLVV 480
           M+ W R+H A         S+ S+TN++ P       +LQ CS+SKS+NQ K +HQ+++ 
Sbjct: 54  MRVW-RAHRAI--------SILSTTNSKAPSPSELGVYLQLCSNSKSLNQGKHVHQKIIQ 104

Query: 479 LLHSSHYNPFFVTKLIQTYADCDDLRSAHLLLNQLSQPNIFAFTSILSFHSRHSHSRQCI 300
                  NPF VTKL+Q YADCDDL S+  L + L +PN+FA+T+IL F+SRH    +C+
Sbjct: 105 C--GLDQNPFIVTKLVQMYADCDDLVSSWKLFDNLLKPNVFAWTAILGFYSRHGMHEECV 162

Query: 299 QTYAQLRLNAIVPDGYVFPKVLKACAQSACLHVGIVVHKDVLVFGWDPSLRVCNSVLDMY 120
           + Y ++ LN ++PDGYVFPKVL+ACAQ   L VGIVVHKDV++ G + +L+VCNS++DMY
Sbjct: 163 RAYVEMILNDVLPDGYVFPKVLRACAQLLRLKVGIVVHKDVIICGLNLNLQVCNSLIDMY 222

Query: 119 SKCRDVGSATQVFDEMRERDVFSWNSMMSCYVCNGL 12
           SKC D+GSA +VFDEM  RD++SWNSM+S YVCNGL
Sbjct: 223 SKCEDIGSAKRVFDEMVGRDLWSWNSMISGYVCNGL 258



 Score = 58.2 bits (139), Expect = 3e-06
 Identities = 49/214 (22%), Positives = 93/214 (43%), Gaps = 6/214 (2%)
 Frame = -1

Query: 638 SHTATLITQRHISSLSSSTNTQNPLFNFNSFLQPCSSSKSINQVKQLH------QRLVVL 477
           SH A+L   R    +  S+     L + ++ L  C    S+   K++H      +  +  
Sbjct: 324 SHEASL---RIFRDMIGSSMVDPDLDSLSTVLVSCRHLGSLLNGKEIHGYGIKRESGIAF 380

Query: 476 LHSSHYNPFFVTKLIQTYADCDDLRSAHLLLNQLSQPNIFAFTSILSFHSRHSHSRQCIQ 297
            HS+         L+  YA+C  +  A  +   ++  ++ ++ +++            + 
Sbjct: 381 YHSAG------PALLTMYANCRRIHDATNVFKLMNPAHVVSWNAMILGFIDLGLEDLALD 434

Query: 296 TYAQLRLNAIVPDGYVFPKVLKACAQSACLHVGIVVHKDVLVFGWDPSLRVCNSVLDMYS 117
           ++ +++   I  D      +L AC     L  G  +H  +    +D  + V N+++ MYS
Sbjct: 435 SFRRMQRARINVDQTTISTILPACN----LKFGKQIHAFIRKISFDLVVPVWNALIHMYS 490

Query: 116 KCRDVGSATQVFDEMRERDVFSWNSMMSCYVCNG 15
           KC  +GSA  VF  M  RD+ SWNSM+  +  +G
Sbjct: 491 KCGCIGSAYSVFSNMINRDLVSWNSMIGGFGMHG 524


>ref|XP_020239719.1| pentatricopeptide repeat-containing protein At5g39350-like [Cajanus
           cajan]
          Length = 614

 Score =  222 bits (566), Expect = 1e-65
 Identities = 111/183 (60%), Positives = 140/183 (76%)
 Frame = -1

Query: 560 NFNSFLQPCSSSKSINQVKQLHQRLVVLLHSSHYNPFFVTKLIQTYADCDDLRSAHLLLN 381
           +F S LQ C   K++ Q K+LH   ++LL +SH+NPFF+TKL Q YADCDDLRSA  L  
Sbjct: 7   SFTSLLQSC---KTLKQAKKLHA--LILLTASHHNPFFITKLTQIYADCDDLRSALALT- 60

Query: 380 QLSQPNIFAFTSILSFHSRHSHSRQCIQTYAQLRLNAIVPDGYVFPKVLKACAQSACLHV 201
            L +PN+FAFT+IL FHSRH+H+ QCI TYA+LR NA+VPD YVFPKVLKACA+ +    
Sbjct: 61  -LLRPNVFAFTAILYFHSRHAHAHQCILTYARLRQNAVVPDNYVFPKVLKACAKLSLFVT 119

Query: 200 GIVVHKDVLVFGWDPSLRVCNSVLDMYSKCRDVGSATQVFDEMRERDVFSWNSMMSCYVC 21
           G V+HKDV+ FG + ++ V NS+L MY+KC D+ SA +VF EM +RDVFSWNS+MS YVC
Sbjct: 120 GTVIHKDVVAFGSESNVHVRNSLLGMYAKCGDMASAERVFGEMPQRDVFSWNSVMSGYVC 179

Query: 20  NGL 12
           NGL
Sbjct: 180 NGL 182


>ref|XP_002277337.2| PREDICTED: pentatricopeptide repeat-containing protein
           At5g39350-like [Vitis vinifera]
          Length = 634

 Score =  223 bits (567), Expect = 2e-65
 Identities = 110/203 (54%), Positives = 147/203 (72%)
 Frame = -1

Query: 614 QRHISSLSSSTNTQNPLFNFNSFLQPCSSSKSINQVKQLHQRLVVLLHSSHYNPFFVTKL 435
           +R ISSL +S       F  N  LQ CS+SK+++Q KQLHQ ++  L    ++PF +TKL
Sbjct: 7   KRAISSLPTSNPNLLSSFQLNHLLQLCSNSKALHQGKQLHQHII--LCGLDHHPFMLTKL 64

Query: 434 IQTYADCDDLRSAHLLLNQLSQPNIFAFTSILSFHSRHSHSRQCIQTYAQLRLNAIVPDG 255
           +Q YADC DL SA  L ++LSQPN+FA+T+IL F+SR+  S +C++TY++++L  ++PD 
Sbjct: 65  VQMYADCGDLGSAQALFDKLSQPNVFAWTAILGFYSRNGLSDECVRTYSEMKLKGVLPDK 124

Query: 254 YVFPKVLKACAQSACLHVGIVVHKDVLVFGWDPSLRVCNSVLDMYSKCRDVGSATQVFDE 75
           YVFPKV +AC Q   L VGI VHKDV++ G +  L+VCNS++DMYSK  DVGS  +VFDE
Sbjct: 125 YVFPKVFRACGQLLWLEVGIQVHKDVVICGCEFDLQVCNSLIDMYSKSGDVGSGRRVFDE 184

Query: 74  MRERDVFSWNSMMSCYVCNGLFE 6
           M ERDV SWNSM+S YVCNG  E
Sbjct: 185 MVERDVLSWNSMISGYVCNGFLE 207


>ref|XP_008344308.1| PREDICTED: pentatricopeptide repeat-containing protein DOT4,
           chloroplastic-like isoform X1 [Malus domestica]
          Length = 690

 Score =  219 bits (558), Expect = 8e-64
 Identities = 104/197 (52%), Positives = 148/197 (75%), Gaps = 1/197 (0%)
 Frame = -1

Query: 599 SLSSSTNTQNPL-FNFNSFLQPCSSSKSINQVKQLHQRLVVLLHSSHYNPFFVTKLIQTY 423
           S+ S+ N+++P     N +LQ C +SKS+N+ KQ HQ  +++ +    NPF VTKL+Q Y
Sbjct: 63  SILSTINSKSPSPSQLNLYLQLCCNSKSLNKGKQAHQ--MIIEYGFDXNPFLVTKLVQMY 120

Query: 422 ADCDDLRSAHLLLNQLSQPNIFAFTSILSFHSRHSHSRQCIQTYAQLRLNAIVPDGYVFP 243
           ADCDDL SA  L ++L  PN+FA+T+IL F+SRH    +C++ Y ++ L  ++PDGYVFP
Sbjct: 121 ADCDDLVSAWKLFDKLLNPNVFAWTAILGFYSRHGMYEKCVRAYGEMILKGVLPDGYVFP 180

Query: 242 KVLKACAQSACLHVGIVVHKDVLVFGWDPSLRVCNSVLDMYSKCRDVGSATQVFDEMRER 63
           KVLKAC+Q + + VG +VHKDV++ G++ +++VCNS++DMYSKC+DV SA QVFDEM ER
Sbjct: 181 KVLKACSQLSSVKVGFLVHKDVIIRGFELNVQVCNSLIDMYSKCKDVRSAKQVFDEMVER 240

Query: 62  DVFSWNSMMSCYVCNGL 12
           D+ SWN M+S YVCNG+
Sbjct: 241 DJLSWNFMISGYVCNGM 257


>ref|XP_009366958.1| PREDICTED: pentatricopeptide repeat-containing protein At5g39350
           isoform X1 [Pyrus x bretschneideri]
          Length = 690

 Score =  219 bits (557), Expect = 1e-63
 Identities = 103/197 (52%), Positives = 150/197 (76%), Gaps = 1/197 (0%)
 Frame = -1

Query: 599 SLSSSTNTQNPL-FNFNSFLQPCSSSKSINQVKQLHQRLVVLLHSSHYNPFFVTKLIQTY 423
           S+ S+ N+++P     N +LQ CS+SKS+N+ KQ HQ  +++ +  + NPF VTKL+Q Y
Sbjct: 63  SILSTINSKSPSPSQLNLYLQLCSNSKSLNKGKQAHQ--MIIEYGLNKNPFLVTKLVQMY 120

Query: 422 ADCDDLRSAHLLLNQLSQPNIFAFTSILSFHSRHSHSRQCIQTYAQLRLNAIVPDGYVFP 243
           ADCDDL SA  L ++L  PN+FA+T+IL F+SRH    +C++ Y ++ L  ++PDGYVFP
Sbjct: 121 ADCDDLVSAWKLFDKLLNPNVFAWTAILGFYSRHGMYEECVRAYGEMILKGVLPDGYVFP 180

Query: 242 KVLKACAQSACLHVGIVVHKDVLVFGWDPSLRVCNSVLDMYSKCRDVGSATQVFDEMRER 63
           KVLKAC+Q + + VG +VHK+V++ G++ +++VCNS++DMYSKC+DV SA QVFDEM ER
Sbjct: 181 KVLKACSQLSSVKVGFLVHKEVIIRGFELNVQVCNSLIDMYSKCKDVRSAKQVFDEMVER 240

Query: 62  DVFSWNSMMSCYVCNGL 12
           D+ SWN ++S YVCNG+
Sbjct: 241 DLLSWNFLISGYVCNGM 257


>ref|XP_008340659.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g39350-like [Malus domestica]
          Length = 817

 Score =  219 bits (558), Expect = 5e-63
 Identities = 104/197 (52%), Positives = 148/197 (75%), Gaps = 1/197 (0%)
 Frame = -1

Query: 599 SLSSSTNTQNPL-FNFNSFLQPCSSSKSINQVKQLHQRLVVLLHSSHYNPFFVTKLIQTY 423
           S+ S+ N+++P     N +LQ C +SKS+N+ KQ HQ  +++ +    NPF VTKL+Q Y
Sbjct: 63  SILSTINSKSPSPSQLNLYLQLCCNSKSLNKGKQAHQ--MIIEYGFDENPFLVTKLVQMY 120

Query: 422 ADCDDLRSAHLLLNQLSQPNIFAFTSILSFHSRHSHSRQCIQTYAQLRLNAIVPDGYVFP 243
           ADCDDL SA  L ++L  PN+FA+T+IL F+SRH    +C++ Y ++ L  ++PDGYVFP
Sbjct: 121 ADCDDLVSAWXLFDKLLNPNVFAWTAILGFYSRHGMYEKCVRAYGEMILKGVLPDGYVFP 180

Query: 242 KVLKACAQSACLHVGIVVHKDVLVFGWDPSLRVCNSVLDMYSKCRDVGSATQVFDEMRER 63
           KVLKAC+Q + + VG +VHKDV++ G++ +++VCNS++DMYSKC+DV SA QVFDEM ER
Sbjct: 181 KVLKACSQLSSVKVGFLVHKDVIIRGFELNVQVCNSLIDMYSKCKDVRSAKQVFDEMVER 240

Query: 62  DVFSWNSMMSCYVCNGL 12
           D+ SWN M+S YVCNG+
Sbjct: 241 DJLSWNFMISGYVCNGM 257


>ref|XP_021274359.1| pentatricopeptide repeat-containing protein DOT4,
           chloroplastic-like [Herrania umbratica]
          Length = 635

 Score =  213 bits (542), Expect = 7e-62
 Identities = 100/202 (49%), Positives = 146/202 (72%)
 Frame = -1

Query: 611 RHISSLSSSTNTQNPLFNFNSFLQPCSSSKSINQVKQLHQRLVVLLHSSHYNPFFVTKLI 432
           R   +LS+ +  +  L   N+ L  CS SKS+NQ KQ+H +++   + SH N F +TKL+
Sbjct: 5   RSQGALSTLSKPRISLSQLNNLLHLCSKSKSLNQGKQIHPQIIS--NGSHQNTFIITKLV 62

Query: 431 QTYADCDDLRSAHLLLNQLSQPNIFAFTSILSFHSRHSHSRQCIQTYAQLRLNAIVPDGY 252
           Q YADCDDL SA+ L ++L QPN+F++T+IL+F+S+H   ++CI++Y +++++ ++PDGY
Sbjct: 63  QMYADCDDLVSANKLFDRLPQPNVFSWTAILAFYSKHGMYKKCIESYCEMKMSGVLPDGY 122

Query: 251 VFPKVLKACAQSACLHVGIVVHKDVLVFGWDPSLRVCNSVLDMYSKCRDVGSATQVFDEM 72
           VFPKVL+A  +  CL  GI VHKDV+V G +  L VCNS++DMY +C D+ SA QVF+EM
Sbjct: 123 VFPKVLRASVEGLCLETGICVHKDVIVCGCEFYLEVCNSLIDMYGRCGDLTSARQVFNEM 182

Query: 71  RERDVFSWNSMMSCYVCNGLFE 6
            ERD+ SWN M+S YV NG+ E
Sbjct: 183 VERDLLSWNLMISGYVGNGMLE 204


>ref|XP_007047218.1| PREDICTED: pentatricopeptide repeat-containing protein DOT4,
           chloroplastic [Theobroma cacao]
 gb|EOX91375.1| Pentatricopeptide repeat-containing protein, putative [Theobroma
           cacao]
          Length = 635

 Score =  212 bits (540), Expect = 1e-61
 Identities = 101/202 (50%), Positives = 145/202 (71%)
 Frame = -1

Query: 611 RHISSLSSSTNTQNPLFNFNSFLQPCSSSKSINQVKQLHQRLVVLLHSSHYNPFFVTKLI 432
           R   +LS+ +  +  L   N+ LQ CS SKS++Q KQ+H +++   + SH N F +TKL+
Sbjct: 5   RSQGALSTLSKPRISLSQLNNLLQLCSKSKSLSQGKQIHPQIIS--NGSHQNTFIITKLV 62

Query: 431 QTYADCDDLRSAHLLLNQLSQPNIFAFTSILSFHSRHSHSRQCIQTYAQLRLNAIVPDGY 252
           Q YADCDDL SA+ L ++L QPN+F++T+IL  +SRH   R+CI++Y +++++ ++PDG+
Sbjct: 63  QMYADCDDLVSANKLFDRLPQPNVFSWTAILGLYSRHGMYRKCIESYCEMKMSGVLPDGF 122

Query: 251 VFPKVLKACAQSACLHVGIVVHKDVLVFGWDPSLRVCNSVLDMYSKCRDVGSATQVFDEM 72
           VFPKVL+A  Q  CL  GI VHKDV+V G +  L VCNS++DMY +C D+ SA +VFDEM
Sbjct: 123 VFPKVLRASVQGLCLETGICVHKDVIVCGCEFYLEVCNSLIDMYGRCGDLTSARRVFDEM 182

Query: 71  RERDVFSWNSMMSCYVCNGLFE 6
             RD+FSWN M+S YV NG+ E
Sbjct: 183 VGRDLFSWNLMISGYVGNGMLE 204


>ref|XP_015890127.1| PREDICTED: pentatricopeptide repeat-containing protein DOT4,
           chloroplastic-like [Ziziphus jujuba]
          Length = 635

 Score =  211 bits (537), Expect = 4e-61
 Identities = 104/206 (50%), Positives = 143/206 (69%)
 Frame = -1

Query: 623 LITQRHISSLSSSTNTQNPLFNFNSFLQPCSSSKSINQVKQLHQRLVVLLHSSHYNPFFV 444
           ++  R  SS S++       F  N  LQ C SSK++N  KQ+HQ+++        NPF V
Sbjct: 1   MVVVRATSSPSTTNFGGYTCFQLNRLLQICCSSKTLNHGKQVHQQIIQ--GGLGRNPFLV 58

Query: 443 TKLIQTYADCDDLRSAHLLLNQLSQPNIFAFTSILSFHSRHSHSRQCIQTYAQLRLNAIV 264
           TKL+Q YADCD L SA +L +QLSQPN+FA+T+I+ F+SRH   ++C++TYA++ L  + 
Sbjct: 59  TKLVQMYADCDHLLSARILFDQLSQPNVFAWTAIIGFYSRHGMYQKCVRTYAEMSLMGVS 118

Query: 263 PDGYVFPKVLKACAQSACLHVGIVVHKDVLVFGWDPSLRVCNSVLDMYSKCRDVGSATQV 84
           PD YVFPKVLK CAQS+CL  G+ +HKDV+  G++ S  VCNS+++MYSKC DV +A +V
Sbjct: 119 PDEYVFPKVLKVCAQSSCLKAGMQIHKDVITSGFEFSSEVCNSLIEMYSKCMDVQNAKRV 178

Query: 83  FDEMRERDVFSWNSMMSCYVCNGLFE 6
           FD +  RD+ SWN M+S YV NGL E
Sbjct: 179 FDVIVGRDLLSWNLMISGYVYNGLLE 204


>gb|KRH04632.1| hypothetical protein GLYMA_17G175800 [Glycine max]
          Length = 456

 Score =  207 bits (526), Expect = 4e-61
 Identities = 105/142 (73%), Positives = 116/142 (81%)
 Frame = -1

Query: 437 LIQTYADCDDLRSAHLLLNQLSQPNIFAFTSILSFHSRHSHSRQCIQTYAQLRLNAIVPD 258
           LIQ YAD +DLRSA  LL+Q+S PN+FAFTSILSFHSRH    QCIQTYA+LR N +VPD
Sbjct: 22  LIQIYADSNDLRSAVTLLHQISHPNVFAFTSILSFHSRHGLGHQCIQTYAELRRNGVVPD 81

Query: 257 GYVFPKVLKACAQSACLHVGIVVHKDVLVFGWDPSLRVCNSVLDMYSKCRDVGSATQVFD 78
           GYVFPKVLKACAQ +    G  VHKDV+VFG + +L+V NSVLDMYSKC DVGSA QVFD
Sbjct: 82  GYVFPKVLKACAQLSRFGSGRGVHKDVVVFGEESNLQVRNSVLDMYSKCGDVGSARQVFD 141

Query: 77  EMRERDVFSWNSMMSCYVCNGL 12
           EM ERDVFSWNSMMS YV NGL
Sbjct: 142 EMSERDVFSWNSMMSGYVWNGL 163


>ref|XP_024183906.1| putative pentatricopeptide repeat-containing protein At3g23330
           isoform X1 [Rosa chinensis]
          Length = 666

 Score =  209 bits (533), Expect = 2e-60
 Identities = 105/202 (51%), Positives = 146/202 (72%), Gaps = 1/202 (0%)
 Frame = -1

Query: 614 QRHISSLSSSTNTQNPLFN-FNSFLQPCSSSKSINQVKQLHQRLVVLLHSSHYNPFFVTK 438
           QR IS LS  TN++ P  +  +  LQ CS+SKS+NQ KQ HQ+++        NPF VTK
Sbjct: 37  QRGISILS--TNSKPPSSSKLHYCLQLCSNSKSLNQGKQTHQKIIQCQLGK--NPFLVTK 92

Query: 437 LIQTYADCDDLRSAHLLLNQLSQPNIFAFTSILSFHSRHSHSRQCIQTYAQLRLNAIVPD 258
           L+Q YADCDDL SA  L ++L +PN+FA+T+IL F+SRH    +C+  Y ++ L  ++PD
Sbjct: 93  LVQMYADCDDLASARKLFDELLEPNVFAWTAILGFYSRHGMYEECVGAYGEMILRGVLPD 152

Query: 257 GYVFPKVLKACAQSACLHVGIVVHKDVLVFGWDPSLRVCNSVLDMYSKCRDVGSATQVFD 78
           GYVFPKVL+ACA  + L VGI VHKDV++ G++ +++VCNS+++MYSKC D+  A QVFD
Sbjct: 153 GYVFPKVLRACAHFSSLKVGIRVHKDVIISGFEINVQVCNSLIEMYSKCGDIMCAKQVFD 212

Query: 77  EMRERDVFSWNSMMSCYVCNGL 12
           EM  RD+ +WN ++S YVCNG+
Sbjct: 213 EMVGRDLLTWNLIISGYVCNGM 234



 Score = 60.1 bits (144), Expect = 7e-07
 Identities = 37/142 (26%), Positives = 66/142 (46%)
 Frame = -1

Query: 437 LIQTYADCDDLRSAHLLLNQLSQPNIFAFTSILSFHSRHSHSRQCIQTYAQLRLNAIVPD 258
           L+  YA+C  ++ A  +   +      ++ +++            ++ + ++++  I  D
Sbjct: 364 LLTMYANCRKIQDAENVFRFMDPAQAVSWNAMILGFIDLGLEDLALECFRKMQIAEIKLD 423

Query: 257 GYVFPKVLKACAQSACLHVGIVVHKDVLVFGWDPSLRVCNSVLDMYSKCRDVGSATQVFD 78
                 VL  C     L  G  +H  +    +D  + V N+++ MYSKC  +G+A  VF 
Sbjct: 424 QTTLSTVLPTCN----LKFGKQIHAFIRKSSFDLVVPVWNALIHMYSKCGCIGAAYSVFS 479

Query: 77  EMRERDVFSWNSMMSCYVCNGL 12
            M  RD+ SWNSMM  +  NGL
Sbjct: 480 NMLNRDLVSWNSMMGGFAMNGL 501


Top