BLASTX nr result

ID: Sinomenium22_contig00035027 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Sinomenium22_contig00035027
         (854 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006603878.1| PREDICTED: pentatricopeptide repeat-containi...   106   4e-36
ref|XP_004489421.1| PREDICTED: pentatricopeptide repeat-containi...   104   2e-35
ref|XP_002873720.1| pentatricopeptide repeat-containing protein ...    92   6e-32
ref|XP_006400058.1| hypothetical protein EUTSA_v10012971mg [Eutr...    89   7e-32
ref|NP_197038.1| pentatricopeptide repeat-containing protein [Ar...    90   2e-31
ref|XP_006290195.1| hypothetical protein CARUB_v10003880mg [Caps...    79   4e-28
ref|XP_002281711.1| PREDICTED: pentatricopeptide repeat-containi...    78   1e-26
emb|CBI28140.3| unnamed protein product [Vitis vinifera]               78   1e-26
gb|EYU19608.1| hypothetical protein MIMGU_mgv1a003193mg [Mimulus...    70   1e-26
ref|XP_003520419.1| PREDICTED: pentatricopeptide repeat-containi...    74   9e-25
ref|XP_007206274.1| hypothetical protein PRUPE_ppa016070mg [Prun...    71   1e-24
ref|XP_006601143.1| PREDICTED: pentatricopeptide repeat-containi...    79   2e-24
ref|XP_007161217.1| hypothetical protein PHAVU_001G0518001g, par...    77   2e-24
ref|XP_004971712.1| PREDICTED: putative pentatricopeptide repeat...    70   3e-24
ref|XP_004291465.1| PREDICTED: pentatricopeptide repeat-containi...    76   3e-24
ref|XP_002314675.1| pentatricopeptide repeat-containing family p...    74   2e-23
gb|EXB96783.1| hypothetical protein L484_001891 [Morus notabilis]      75   2e-23
ref|XP_007225539.1| hypothetical protein PRUPE_ppa026705mg [Prun...    75   4e-23
gb|AFW80179.1| hypothetical protein ZEAMMB73_142662 [Zea mays]         69   6e-23
ref|XP_007032614.1| Pentatricopeptide repeat (PPR) superfamily p...   113   8e-23

>ref|XP_006603878.1| PREDICTED: pentatricopeptide repeat-containing protein At5g15340,
           mitochondrial-like [Glycine max]
          Length = 706

 Score =  106 bits (265), Expect(2) = 4e-36
 Identities = 68/160 (42%), Positives = 91/160 (56%), Gaps = 6/160 (3%)
 Frame = +1

Query: 1   EIVLGSLLAACGLHGKLQLGEYLLRELIQMEXXXXXXXXXXXXXXXFGWRAE*G*FFPAG 180
           E+VLGSLL AC  HGKL+LGE ++REL+QM+                  +A+      A 
Sbjct: 494 EVVLGSLLGACYAHGKLRLGEKIMRELVQMDPLNTEYHILLSNMYALCGKAD-----KAN 548

Query: 181 SQASG-N*KGTRR---MSSIRANGQIHQFIAGDKSHPQAQDVYVLLDEMIWRLRSAGYVP 348
           S       +G R+   MSSI  +GQ+H+FIAGDKSHP+  D+Y+ LD+MI +LR AGYVP
Sbjct: 549 SLRKVLKNRGIRKVPGMSSIYVDGQLHRFIAGDKSHPRTADIYMKLDDMICKLRLAGYVP 608

Query: 349 NIVSKIFCF*KYVDDAHEV--EEEQMLFFHSDLVCPCTSL 462
           N   ++       DD  E   E EQ+LF HS+ +  C  L
Sbjct: 609 NTNCQVLFGCSNGDDCMEAFEEVEQVLFTHSEKLALCFGL 648



 Score = 72.4 bits (176), Expect(2) = 4e-36
 Identities = 33/47 (70%), Positives = 37/47 (78%)
 Frame = +3

Query: 447 PLHIFKNLRICQDCHSAMKLISGIYKREIVIRDQNRFHRFTQSFSYC 587
           PL IFKNLRICQDCHSA+K+ S IYKREIV+RD+ RFH F Q    C
Sbjct: 656 PLCIFKNLRICQDCHSAIKIASDIYKREIVVRDRYRFHSFKQGSCSC 702


>ref|XP_004489421.1| PREDICTED: pentatricopeptide repeat-containing protein At5g15340,
           mitochondrial-like [Cicer arietinum]
          Length = 635

 Score =  104 bits (260), Expect(2) = 2e-35
 Identities = 64/156 (41%), Positives = 86/156 (55%), Gaps = 2/156 (1%)
 Frame = +1

Query: 1   EIVLGSLLAACGLHGKLQLGEYLLRELIQMEXXXXXXXXXXXXXXXFGWRAE*G*FFPAG 180
           E+VLGSLL +C  HGKL+LGE ++REL++M+                  + +        
Sbjct: 423 EVVLGSLLGSCYAHGKLKLGEKIMRELVEMDPFNTEYHIVLSNMYALSGKVDKANSLRKV 482

Query: 181 SQASGN*KGTRRMSSIRANGQIHQFIAGDKSHPQAQDVYVLLDEMIWRLRSAGYVPNIVS 360
            +  G  K    MSSI A+GQ+HQFIAGDKSH +  ++Y+ LDEMI RLR  GYVPN   
Sbjct: 483 LKKKGI-KKAPGMSSIYADGQLHQFIAGDKSHTRTSEIYMKLDEMICRLRFGGYVPNTSC 541

Query: 361 KIFCF*KYVDDAHEV--EEEQMLFFHSDLVCPCTSL 462
           ++       DD  E   E EQ+LF HS+ +  C  L
Sbjct: 542 QVLFGCSNRDDYSEALEEVEQVLFTHSEKLALCFGL 577



 Score = 72.4 bits (176), Expect(2) = 2e-35
 Identities = 31/49 (63%), Positives = 38/49 (77%)
 Frame = +3

Query: 441 GLPLHIFKNLRICQDCHSAMKLISGIYKREIVIRDQNRFHRFTQSFSYC 587
           G PL+IFKNLRICQDCHSA+K+ S +Y+REIV+RD+ RFH F      C
Sbjct: 583 GSPLYIFKNLRICQDCHSAIKIASDVYRREIVVRDRYRFHSFKNGSCSC 631


>ref|XP_002873720.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297319557|gb|EFH49979.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 623

 Score = 92.0 bits (227), Expect(2) = 6e-32
 Identities = 57/157 (36%), Positives = 86/157 (54%), Gaps = 3/157 (1%)
 Frame = +1

Query: 1   EIVLGSLLAACGLHGKLQLGEYLLRELIQMEXXXXXXXXXXXXXXXFGWRAE*G*FFPAG 180
           E+VLGSLL +C +HGKL++ E + RELIQM                   R++       G
Sbjct: 418 EVVLGSLLGSCSVHGKLEIAERIKRELIQMSPGHTEYQILMSNMYVAEGRSD----IADG 473

Query: 181 SQASGN*KGTRR---MSSIRANGQIHQFIAGDKSHPQAQDVYVLLDEMIWRLRSAGYVPN 351
            + S   +G R+   +SSI  N  +H+F +GD+SHP+ ++VY+ L+E+I R+RSAGYVP+
Sbjct: 474 LRGSLRNRGIRKIPGLSSIYVNDSVHRFSSGDRSHPRTKEVYLKLNEVIERIRSAGYVPD 533

Query: 352 IVSKIFCF*KYVDDAHEVEEEQMLFFHSDLVCPCTSL 462
           I   +        +    E+EQ L  HS+ +  C  L
Sbjct: 534 ISGLV-----SPSEGDLEEKEQALCCHSEKLAVCFGL 565



 Score = 73.2 bits (178), Expect(2) = 6e-32
 Identities = 31/50 (62%), Positives = 39/50 (78%)
 Frame = +3

Query: 438 PGLPLHIFKNLRICQDCHSAMKLISGIYKREIVIRDQNRFHRFTQSFSYC 587
           P  PL +FKNLRIC+DCHSAMK++S +Y REI+IRD+NRFH+F      C
Sbjct: 570 PRTPLLVFKNLRICRDCHSAMKIVSKVYDREIIIRDRNRFHQFKGGSCSC 619


>ref|XP_006400058.1| hypothetical protein EUTSA_v10012971mg [Eutrema salsugineum]
           gi|557101148|gb|ESQ41511.1| hypothetical protein
           EUTSA_v10012971mg [Eutrema salsugineum]
          Length = 623

 Score = 89.4 bits (220), Expect(2) = 7e-32
 Identities = 55/157 (35%), Positives = 85/157 (54%), Gaps = 3/157 (1%)
 Frame = +1

Query: 1   EIVLGSLLAACGLHGKLQLGEYLLRELIQMEXXXXXXXXXXXXXXXFGWRAE*G*FFPAG 180
           E+VLGSLL +C +HGKL++ E + REL+QM                   R++       G
Sbjct: 418 EVVLGSLLGSCSVHGKLEIAERVKRELVQMSPDNMGYQILVSNMYAAEGRSD----VVDG 473

Query: 181 SQASGN*KGTRR---MSSIRANGQIHQFIAGDKSHPQAQDVYVLLDEMIWRLRSAGYVPN 351
            + S   +G ++   MSSI  NG I++F +GD+SHP+ +++Y+ L+E+I R+RSAGYVP 
Sbjct: 474 LRGSMRNQGLKKIPGMSSIHLNGSIYRFSSGDRSHPRTKEIYLKLNEVIERIRSAGYVPG 533

Query: 352 IVSKIFCF*KYVDDAHEVEEEQMLFFHSDLVCPCTSL 462
           +   I        +    + EQ L  HS+ +  C  L
Sbjct: 534 VSGLI-----SPSEGDLEDNEQALSCHSEKLAVCFGL 565



 Score = 75.5 bits (184), Expect(2) = 7e-32
 Identities = 32/50 (64%), Positives = 40/50 (80%)
 Frame = +3

Query: 438 PGLPLHIFKNLRICQDCHSAMKLISGIYKREIVIRDQNRFHRFTQSFSYC 587
           P  PL++FKNLRIC+DCHSAMK++S IY REI+IRD+NRFH+F      C
Sbjct: 570 PRTPLYVFKNLRICRDCHSAMKIVSKIYDREIIIRDRNRFHQFKGGSCSC 619


>ref|NP_197038.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75180838|sp|Q9LXE8.1|PP386_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At5g15340, mitochondrial; Flags: Precursor
           gi|7671503|emb|CAB89344.1| putative protein [Arabidopsis
           thaliana] gi|332004768|gb|AED92151.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 623

 Score = 89.7 bits (221), Expect(2) = 2e-31
 Identities = 54/157 (34%), Positives = 86/157 (54%), Gaps = 3/157 (1%)
 Frame = +1

Query: 1   EIVLGSLLAACGLHGKLQLGEYLLRELIQMEXXXXXXXXXXXXXXXFGWRAE*G*FFPAG 180
           E+VLGSLL +C +HGK+++ E + RELIQM                   R++       G
Sbjct: 418 EVVLGSLLGSCSVHGKVEIAERIKRELIQMSPGNTEYQILMSNMYVAEGRSD----IADG 473

Query: 181 SQASGN*KGTRR---MSSIRANGQIHQFIAGDKSHPQAQDVYVLLDEMIWRLRSAGYVPN 351
            + S   +G R+   +SSI  N  +H+F +GD+SHP+ +++Y+ L+E+I R+RSAGYVP+
Sbjct: 474 LRGSLRKRGIRKIPGLSSIYVNDSVHRFSSGDRSHPRTKEIYLKLNEVIERIRSAGYVPD 533

Query: 352 IVSKIFCF*KYVDDAHEVEEEQMLFFHSDLVCPCTSL 462
           +   +        +    E+EQ L  HS+ +  C  L
Sbjct: 534 VSGLV-----SHSEGDLEEKEQALCCHSEKLAVCFGL 565



 Score = 73.9 bits (180), Expect(2) = 2e-31
 Identities = 31/50 (62%), Positives = 39/50 (78%)
 Frame = +3

Query: 438 PGLPLHIFKNLRICQDCHSAMKLISGIYKREIVIRDQNRFHRFTQSFSYC 587
           P  PL +FKNLRIC+DCHSAMK++S +Y REI+IRD+NRFH+F      C
Sbjct: 570 PSTPLLVFKNLRICRDCHSAMKIVSKVYDREIIIRDRNRFHQFKGGSCSC 619


>ref|XP_006290195.1| hypothetical protein CARUB_v10003880mg [Capsella rubella]
           gi|482558901|gb|EOA23093.1| hypothetical protein
           CARUB_v10003880mg [Capsella rubella]
          Length = 624

 Score = 79.0 bits (193), Expect(2) = 4e-28
 Identities = 51/157 (32%), Positives = 82/157 (52%), Gaps = 3/157 (1%)
 Frame = +1

Query: 1   EIVLGSLLAACGLHGKLQLGEYLLRELIQMEXXXXXXXXXXXXXXXFGWRAE*G*FFPAG 180
           E+VLGSLL +C +H KL + E + +EL+QM                   R++       G
Sbjct: 419 EVVLGSLLGSCSVHRKLDIAERIKQELVQMNPGNTECQILMSNMYVAEGRSD----IADG 474

Query: 181 SQASGN*KGTRR---MSSIRANGQIHQFIAGDKSHPQAQDVYVLLDEMIWRLRSAGYVPN 351
            + S   +G R+   +SSI  N  +H+F +GD+SHP+ + +Y+ L+E+I R+RSAGYV +
Sbjct: 475 LRRSLRNRGIRKIPGLSSIYVNDSVHRFSSGDRSHPRTKVIYLKLNEVIERIRSAGYVLD 534

Query: 352 IVSKIFCF*KYVDDAHEVEEEQMLFFHSDLVCPCTSL 462
           +   +        +    E+EQ L  HS+ +  C  L
Sbjct: 535 VSGLV-----SHSEGDLEEKEQALCCHSEKLAVCFGL 566



 Score = 73.2 bits (178), Expect(2) = 4e-28
 Identities = 31/50 (62%), Positives = 39/50 (78%)
 Frame = +3

Query: 438 PGLPLHIFKNLRICQDCHSAMKLISGIYKREIVIRDQNRFHRFTQSFSYC 587
           P  PL +FKNLRIC+DCHSAMK++S +Y REI+IRD+NRFH+F      C
Sbjct: 571 PRTPLLVFKNLRICRDCHSAMKIVSKVYDREIIIRDRNRFHQFKGGSCSC 620


>ref|XP_002281711.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760,
           chloroplastic [Vitis vinifera]
          Length = 711

 Score = 77.8 bits (190), Expect(2) = 1e-26
 Identities = 33/58 (56%), Positives = 42/58 (72%)
 Frame = +3

Query: 414 ADAVLSQ*PGLPLHIFKNLRICQDCHSAMKLISGIYKREIVIRDQNRFHRFTQSFSYC 587
           A  +LS  PG P+ + KNLR+C DCHSAMK IS +Y REI++RD+NRFH FT+    C
Sbjct: 650 AFGLLSTTPGTPIRVVKNLRVCSDCHSAMKFISEVYNREIIVRDRNRFHHFTKGSCSC 707



 Score = 69.3 bits (168), Expect(2) = 1e-26
 Identities = 47/146 (32%), Positives = 69/146 (47%), Gaps = 2/146 (1%)
 Frame = +1

Query: 7   VLGSLLAACGLHGKLQLGEYLLRELIQMEXXXXXXXXXXXXXXXF--GWRAE*G*FFPAG 180
           VL  LL+AC +HG L + E   ++LI+++                   W A         
Sbjct: 510 VLVGLLSACRIHGNLVVAERAAQQLIELDPKNGGTYVLLSNIYSSMKNWEAAKK---MRE 566

Query: 181 SQASGN*KGTRRMSSIRANGQIHQFIAGDKSHPQAQDVYVLLDEMIWRLRSAGYVPNIVS 360
                N K     S+I   G +H+F+ GD SHPQ+ ++Y  LD+M+ RL+SAGYVP+   
Sbjct: 567 LMVERNIKKPPGCSAIEVGGVVHEFVKGDVSHPQSSEIYETLDDMMRRLKSAGYVPDKSE 626

Query: 361 KIFCF*KYVDDAHEVEEEQMLFFHSD 438
            +F       D  E E+E  L  HS+
Sbjct: 627 VLF-------DMDEKEKENELSLHSE 645


>emb|CBI28140.3| unnamed protein product [Vitis vinifera]
          Length = 580

 Score = 77.8 bits (190), Expect(2) = 1e-26
 Identities = 33/58 (56%), Positives = 42/58 (72%)
 Frame = +3

Query: 414 ADAVLSQ*PGLPLHIFKNLRICQDCHSAMKLISGIYKREIVIRDQNRFHRFTQSFSYC 587
           A  +LS  PG P+ + KNLR+C DCHSAMK IS +Y REI++RD+NRFH FT+    C
Sbjct: 519 AFGLLSTTPGTPIRVVKNLRVCSDCHSAMKFISEVYNREIIVRDRNRFHHFTKGSCSC 576



 Score = 69.3 bits (168), Expect(2) = 1e-26
 Identities = 47/146 (32%), Positives = 69/146 (47%), Gaps = 2/146 (1%)
 Frame = +1

Query: 7   VLGSLLAACGLHGKLQLGEYLLRELIQMEXXXXXXXXXXXXXXXF--GWRAE*G*FFPAG 180
           VL  LL+AC +HG L + E   ++LI+++                   W A         
Sbjct: 379 VLVGLLSACRIHGNLVVAERAAQQLIELDPKNGGTYVLLSNIYSSMKNWEAAKK---MRE 435

Query: 181 SQASGN*KGTRRMSSIRANGQIHQFIAGDKSHPQAQDVYVLLDEMIWRLRSAGYVPNIVS 360
                N K     S+I   G +H+F+ GD SHPQ+ ++Y  LD+M+ RL+SAGYVP+   
Sbjct: 436 LMVERNIKKPPGCSAIEVGGVVHEFVKGDVSHPQSSEIYETLDDMMRRLKSAGYVPDKSE 495

Query: 361 KIFCF*KYVDDAHEVEEEQMLFFHSD 438
            +F       D  E E+E  L  HS+
Sbjct: 496 VLF-------DMDEKEKENELSLHSE 514


>gb|EYU19608.1| hypothetical protein MIMGU_mgv1a003193mg [Mimulus guttatus]
          Length = 601

 Score = 70.1 bits (170), Expect(3) = 1e-26
 Identities = 28/57 (49%), Positives = 40/57 (70%)
 Frame = +3

Query: 420 AVLSQ*PGLPLHIFKNLRICQDCHSAMKLISGIYKREIVIRDQNRFHRFTQSFSYCG 590
           A ++  PG+P+ + KNLR+C+DCH A+K IS I +REI++RD NR+H F      CG
Sbjct: 542 ATINSGPGVPIRVTKNLRVCEDCHIAIKFISKITEREIIVRDTNRYHHFENGICSCG 598



 Score = 57.8 bits (138), Expect(3) = 1e-26
 Identities = 26/74 (35%), Positives = 44/74 (59%)
 Frame = +1

Query: 217 MSSIRANGQIHQFIAGDKSHPQAQDVYVLLDEMIWRLRSAGYVPNIVSKIFCF*KYVDDA 396
           +S++   GQ+H F+AGD+SHPQ +++Y  LD +I +++  GYVP   + +        D 
Sbjct: 469 ISNVELRGQVHTFLAGDRSHPQCEEIYRELDVLIGKMKEIGYVPKTDTAL-------HDV 521

Query: 397 HEVEEEQMLFFHSD 438
            + E+E  L  HS+
Sbjct: 522 EDEEKENHLVVHSE 535



 Score = 39.3 bits (90), Expect(3) = 1e-26
 Identities = 16/44 (36%), Positives = 27/44 (61%)
 Frame = +2

Query: 83  FRWNPLNTEYHILLSYMYTLAGERNEANSFRQVLKRQGIRKVPG 214
           F+  P  + Y++LLS +Y  AG   +    R ++K +GI+K+PG
Sbjct: 425 FQLAPEKSGYYVLLSNIYAKAGRWKDVTLIRSIMKERGIKKIPG 468


>ref|XP_003520419.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g22690-like [Glycine max]
          Length = 801

 Score = 74.3 bits (181), Expect(2) = 9e-25
 Identities = 34/61 (55%), Positives = 45/61 (73%), Gaps = 1/61 (1%)
 Frame = +3

Query: 414 ADAVLSQ*PGLPLHIFKNLRICQDCHSAMKLISGIYKREIVIRDQNRFHRFTQ-SFSYCG 590
           A  ++S  PG+P+ I KNLR+C DCH+A KL+S IY REI++RD+NRFH F + S S C 
Sbjct: 740 AYGLISTAPGVPIRIVKNLRVCDDCHNATKLLSKIYGREIIVRDRNRFHHFKEGSCSCCD 799

Query: 591 Y 593
           Y
Sbjct: 800 Y 800



 Score = 66.6 bits (161), Expect(2) = 9e-25
 Identities = 39/116 (33%), Positives = 58/116 (50%)
 Frame = +1

Query: 7   VLGSLLAACGLHGKLQLGEYLLRELIQMEXXXXXXXXXXXXXXXFGWRAE*G*FFPAGSQ 186
           V GS LAAC LH  ++LGE+  ++ + +E                  R     +     +
Sbjct: 600 VFGSFLAACKLHKNIKLGEWAAKQFLSLEPHKSGYNVLMSNIYASANRWGDVAYIRRAMK 659

Query: 187 ASGN*KGTRRMSSIRANGQIHQFIAGDKSHPQAQDVYVLLDEMIWRLRSAGYVPNI 354
             G  K    +SSI  NG +H+FI GD+ HP A+ VY ++DEM  +L  AGY P++
Sbjct: 660 DEGIVKEPG-VSSIEVNGLLHEFIMGDREHPDAKKVYEMIDEMREKLEDAGYTPDV 714


>ref|XP_007206274.1| hypothetical protein PRUPE_ppa016070mg [Prunus persica]
           gi|462401916|gb|EMJ07473.1| hypothetical protein
           PRUPE_ppa016070mg [Prunus persica]
          Length = 608

 Score = 70.9 bits (172), Expect(2) = 1e-24
 Identities = 48/148 (32%), Positives = 71/148 (47%), Gaps = 3/148 (2%)
 Frame = +1

Query: 4   IVLGSLLAACGLHGKLQLGEYLLRELIQMEXXXXXXXXXXXXXXXFGWRAE*G*FFPAGS 183
           IV G+LLAAC +H    L E   REL+++E                  R         G 
Sbjct: 406 IVWGALLAACKIHKNPNLAEVAARELLELEPQNCGYNILMSNIYAASNRWN----EVDGV 461

Query: 184 QASGN*KGTRR---MSSIRANGQIHQFIAGDKSHPQAQDVYVLLDEMIWRLRSAGYVPNI 354
           +     +GT++   +SSI  NG +H FI GDK+HPQ + +Y +L EM  +L+ AGY PN 
Sbjct: 462 RKYMKDRGTKKEPGLSSIEVNGSVHDFIMGDKAHPQTRKIYEMLAEMTKKLKEAGYTPNT 521

Query: 355 VSKIFCF*KYVDDAHEVEEEQMLFFHSD 438
                     + +  E E+E  + +HS+
Sbjct: 522 S-------VVLQNIDEEEKETAVNYHSE 542



 Score = 69.7 bits (169), Expect(2) = 1e-24
 Identities = 29/59 (49%), Positives = 41/59 (69%)
 Frame = +3

Query: 414 ADAVLSQ*PGLPLHIFKNLRICQDCHSAMKLISGIYKREIVIRDQNRFHRFTQSFSYCG 590
           A  ++S   G P+ I KNLR+C+DCH+A KL+S IY R +++RD+NRFH F   +  CG
Sbjct: 547 AFGLISTAAGTPIRIVKNLRVCEDCHTATKLLSKIYGRVMIVRDRNRFHHFRDGYCSCG 605


>ref|XP_006601143.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g21065-like isoform X1 [Glycine max]
           gi|571538394|ref|XP_006601144.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At4g21065-like isoform X2 [Glycine max]
           gi|571538398|ref|XP_006601145.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At4g21065-like isoform X3 [Glycine max]
           gi|571538402|ref|XP_006601146.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At4g21065-like isoform X4 [Glycine max]
          Length = 615

 Score = 79.0 bits (193), Expect(2) = 2e-24
 Identities = 35/59 (59%), Positives = 42/59 (71%)
 Frame = +3

Query: 414 ADAVLSQ*PGLPLHIFKNLRICQDCHSAMKLISGIYKREIVIRDQNRFHRFTQSFSYCG 590
           A A+LS  PG P+ I KNLR+C+DCHSA K IS +Y REIV+RD+NRFH F      CG
Sbjct: 554 AFALLSTPPGTPIRIVKNLRVCEDCHSATKFISKVYNREIVVRDRNRFHHFKNGLCSCG 612



 Score = 61.2 bits (147), Expect(2) = 2e-24
 Identities = 42/149 (28%), Positives = 74/149 (49%), Gaps = 3/149 (2%)
 Frame = +1

Query: 1   EIVLGSLLAACGLHGKLQLGEYLLRELIQMEXXXXXXXXXXXXXXXFGWRAE*G*FFPAG 180
           +++  S++ AC   G+L+LGE + +ELI+ E                  R E        
Sbjct: 412 QVIWRSIVTACHARGELKLGESVAKELIRREPSHESNYVLLSNIYAKLLRWE----KKTK 467

Query: 181 SQASGN*KGTRRM---SSIRANGQIHQFIAGDKSHPQAQDVYVLLDEMIWRLRSAGYVPN 351
            +   + KG R++   + I  N +I++F+AGDKSH Q +++Y +++EM   ++ AGYVP 
Sbjct: 468 VREMMDVKGMRKIPGSTMIEMNNEIYEFVAGDKSHDQYKEIYEMVEEMGREIKRAGYVPT 527

Query: 352 IVSKIFCF*KYVDDAHEVEEEQMLFFHSD 438
               +        D  E ++E  L+ HS+
Sbjct: 528 TSQVLL-------DIDEEDKEDALYRHSE 549


>ref|XP_007161217.1| hypothetical protein PHAVU_001G0518001g, partial [Phaseolus
           vulgaris] gi|561034681|gb|ESW33211.1| hypothetical
           protein PHAVU_001G0518001g, partial [Phaseolus vulgaris]
          Length = 380

 Score = 77.4 bits (189), Expect(2) = 2e-24
 Identities = 34/59 (57%), Positives = 41/59 (69%)
 Frame = +3

Query: 414 ADAVLSQ*PGLPLHIFKNLRICQDCHSAMKLISGIYKREIVIRDQNRFHRFTQSFSYCG 590
           A  +LS  PG P+ I KNLR+C+DCHSA K IS +Y REIV+RD+NRFH F      CG
Sbjct: 319 AFGLLSTPPGTPIRIVKNLRVCEDCHSATKFISKVYSREIVVRDRNRFHHFKNGLCSCG 377



 Score = 62.4 bits (150), Expect(2) = 2e-24
 Identities = 41/149 (27%), Positives = 74/149 (49%), Gaps = 3/149 (2%)
 Frame = +1

Query: 1   EIVLGSLLAACGLHGKLQLGEYLLRELIQMEXXXXXXXXXXXXXXXFGWRAE*G*FFPAG 180
           +++  S++ AC   G+L+LGE + +EL++ E                  R E        
Sbjct: 177 QVIWRSIVTACNARGELRLGESMAKELVRSEPMHESNYVLLSNIYAKLMRWE----KKTK 232

Query: 181 SQASGN*KGTRRM---SSIRANGQIHQFIAGDKSHPQAQDVYVLLDEMIWRLRSAGYVPN 351
            +   + KG R++   + I  N +I++F+AGDKSH Q +++Y +++EM   ++ AGYVP 
Sbjct: 233 VREMMDVKGMRKIPGSTMIEMNNEIYEFVAGDKSHDQYKEIYEMVEEMGKEIKRAGYVPT 292

Query: 352 IVSKIFCF*KYVDDAHEVEEEQMLFFHSD 438
               +        D  E ++E  L+ HS+
Sbjct: 293 TSQVLL-------DIDEEDKEDALYRHSE 314


>ref|XP_004971712.1| PREDICTED: putative pentatricopeptide repeat-containing protein
           At3g15930-like [Setaria italica]
          Length = 667

 Score = 69.7 bits (169), Expect(2) = 3e-24
 Identities = 43/144 (29%), Positives = 75/144 (52%)
 Frame = +1

Query: 7   VLGSLLAACGLHGKLQLGEYLLRELIQMEXXXXXXXXXXXXXXXFGWRAE*G*FFPAGSQ 186
           +LG+LLAAC +HG L +GE + + L++++                  R E          
Sbjct: 466 ILGTLLAACRVHGNLDIGELVAKRLLELDPENSTVYILLSNMYAKSNRWEDVRRLRQSIM 525

Query: 187 ASGN*KGTRRMSSIRANGQIHQFIAGDKSHPQAQDVYVLLDEMIWRLRSAGYVPNIVSKI 366
             G  K     S I  NG IH+F+AGD+SHP + ++Y  L+ +I  L + GY P+I +++
Sbjct: 526 EKGI-KKEPGCSLIEMNGMIHEFVAGDRSHPMSNEIYSKLENIITDLENLGYSPDI-TEV 583

Query: 367 FCF*KYVDDAHEVEEEQMLFFHSD 438
           F       +  E E+++++++HS+
Sbjct: 584 FV------EVAEKEKQKIIYWHSE 601



 Score = 69.3 bits (168), Expect(2) = 3e-24
 Identities = 31/56 (55%), Positives = 39/56 (69%)
 Frame = +3

Query: 420 AVLSQ*PGLPLHIFKNLRICQDCHSAMKLISGIYKREIVIRDQNRFHRFTQSFSYC 587
           A+LS  P   + I KNLR+C DCHSA+KLIS +Y RE+V+RD+ RFH F   F  C
Sbjct: 608 ALLSSEPNTVIRIVKNLRMCLDCHSAIKLISRLYGREVVVRDRTRFHHFRHGFCSC 663


>ref|XP_004291465.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g21065-like [Fragaria vesca subsp. vesca]
          Length = 588

 Score = 75.9 bits (185), Expect(2) = 3e-24
 Identities = 34/58 (58%), Positives = 41/58 (70%)
 Frame = +3

Query: 414 ADAVLSQ*PGLPLHIFKNLRICQDCHSAMKLISGIYKREIVIRDQNRFHRFTQSFSYC 587
           A A+L+  PG P+ I KNLR+C+DCHSA K IS IY REIV+RD+NRFH F      C
Sbjct: 527 AFALLNTPPGTPIRIVKNLRVCEDCHSATKFISKIYNREIVVRDRNRFHHFKNGLCSC 584



 Score = 63.2 bits (152), Expect(2) = 3e-24
 Identities = 43/150 (28%), Positives = 78/150 (52%), Gaps = 5/150 (3%)
 Frame = +1

Query: 4   IVLGSLLAACGLHGKLQLGEYLLRELIQMEXXXXXXXXXXXXXXXF--GWRAE*G*FFPA 177
           IVL +L++AC  HG+L+LGE + +ELI+ E                   W  +       
Sbjct: 386 IVLRTLISACRAHGELKLGESITKELIRAEPMHESNYVLLSNIYAKMNHWEKK------T 439

Query: 178 GSQASGN*KGTRRM---SSIRANGQIHQFIAGDKSHPQAQDVYVLLDEMIWRLRSAGYVP 348
            ++ + + KG +++   + I  + +I++F+AGDKSH Q++++Y ++DEM  +++ AGYV 
Sbjct: 440 KTREAMDKKGMKKIPGSTMIELDNEIYEFVAGDKSHKQSKEIYEMVDEMGRKMKRAGYVA 499

Query: 349 NIVSKIFCF*KYVDDAHEVEEEQMLFFHSD 438
                +        D  E ++E  L  HS+
Sbjct: 500 TTSEVLL-------DIDEEDKEDALNRHSE 522


>ref|XP_002314675.1| pentatricopeptide repeat-containing family protein [Populus
           trichocarpa] gi|222863715|gb|EEF00846.1|
           pentatricopeptide repeat-containing family protein
           [Populus trichocarpa]
          Length = 845

 Score = 74.3 bits (181), Expect(2) = 2e-23
 Identities = 33/59 (55%), Positives = 41/59 (69%)
 Frame = +3

Query: 414 ADAVLSQ*PGLPLHIFKNLRICQDCHSAMKLISGIYKREIVIRDQNRFHRFTQSFSYCG 590
           A A++S   G+P+ + KNLRIC DCHS  KL+S  Y REI++RD NRFH F Q F  CG
Sbjct: 784 AFALISTGQGMPIRVAKNLRICSDCHSFAKLVSKSYSREIIVRDNNRFHFFQQGFCSCG 842



 Score = 62.0 bits (149), Expect(2) = 2e-23
 Identities = 42/149 (28%), Positives = 70/149 (46%), Gaps = 3/149 (2%)
 Frame = +1

Query: 1    EIVLGSLLAACGLHGKLQLGEYLLRELIQMEXXXXXXXXXXXXXXXFGWRAE*G*FFPAG 180
            +++ GSLLAAC +H  + +  Y    + +++                  R +      A 
Sbjct: 642  DVIWGSLLAACRVHKNVDIAAYAAERISELDPERTGIHVLLSNIYASAGRWD----DVAK 697

Query: 181  SQASGN*KGTRRM---SSIRANGQIHQFIAGDKSHPQAQDVYVLLDEMIWRLRSAGYVPN 351
             +     KG  +M   SSI  NG+I +F  GD+SHP+   +  +L E+  RLR  GYVP+
Sbjct: 698  VRLHLKEKGAHKMPGSSSIEINGKIFEFTTGDESHPEMTHIEPMLKEICCRLRDIGYVPD 757

Query: 352  IVSKIFCF*KYVDDAHEVEEEQMLFFHSD 438
            + + +        D +E E+E +L  HS+
Sbjct: 758  LTNVLL-------DVNEKEKEYLLSRHSE 779


>gb|EXB96783.1| hypothetical protein L484_001891 [Morus notabilis]
          Length = 599

 Score = 75.1 bits (183), Expect(2) = 2e-23
 Identities = 34/58 (58%), Positives = 40/58 (68%)
 Frame = +3

Query: 414 ADAVLSQ*PGLPLHIFKNLRICQDCHSAMKLISGIYKREIVIRDQNRFHRFTQSFSYC 587
           A A+L+  PG P+ I KNLR+C DCHSA K IS IY REIV+RD+NRFH F      C
Sbjct: 538 AFALLNTPPGTPIRIVKNLRVCSDCHSATKFISKIYNREIVVRDRNRFHHFMDGLCSC 595



 Score = 61.2 bits (147), Expect(2) = 2e-23
 Identities = 44/145 (30%), Positives = 70/145 (48%)
 Frame = +1

Query: 4   IVLGSLLAACGLHGKLQLGEYLLRELIQMEXXXXXXXXXXXXXXXFGWRAE*G*FFPAGS 183
           ++L +L++AC  HGKL+LGE + + LI  E                  R E         
Sbjct: 397 VILRTLVSACRAHGKLRLGETISKSLISNEPTHESNYVLLSNIYAKMSRWENKTKIRKMM 456

Query: 184 QASGN*KGTRRMSSIRANGQIHQFIAGDKSHPQAQDVYVLLDEMIWRLRSAGYVPNIVSK 363
              G  K T   + I  + +I++F+AGDKSH Q +++Y ++DEM   ++ AGYVP+    
Sbjct: 457 GKKGM-KKTPGSTMIELDNEIYEFVAGDKSHKQYKEIYDMVDEMGREMKRAGYVPSTSEV 515

Query: 364 IFCF*KYVDDAHEVEEEQMLFFHSD 438
           +        D  E ++E  L  HS+
Sbjct: 516 LL-------DIDEEDKEDALNRHSE 533


>ref|XP_007225539.1| hypothetical protein PRUPE_ppa026705mg [Prunus persica]
           gi|462422475|gb|EMJ26738.1| hypothetical protein
           PRUPE_ppa026705mg [Prunus persica]
          Length = 484

 Score = 75.1 bits (183), Expect(2) = 4e-23
 Identities = 34/58 (58%), Positives = 40/58 (68%)
 Frame = +3

Query: 414 ADAVLSQ*PGLPLHIFKNLRICQDCHSAMKLISGIYKREIVIRDQNRFHRFTQSFSYC 587
           A A+L+  PG P+ I KNLR+C DCHSA K IS IY REIV+RD+NRFH F      C
Sbjct: 423 AFALLNTPPGTPIRIVKNLRVCDDCHSATKFISKIYNREIVVRDRNRFHHFKDGMCSC 480



 Score = 60.5 bits (145), Expect(2) = 4e-23
 Identities = 43/150 (28%), Positives = 75/150 (50%), Gaps = 5/150 (3%)
 Frame = +1

Query: 4   IVLGSLLAACGLHGKLQLGEYLLRELIQMEXXXXXXXXXXXXXXX--FGWRAE*G*FFPA 177
           IVL +L++AC  HG+L+LGE + +ELI+ E                   W  +      A
Sbjct: 282 IVLRTLISACRAHGELKLGESITKELIRNEPMQESNYVLLSNIYAKMTHWEKK------A 335

Query: 178 GSQASGN*KGTRRM---SSIRANGQIHQFIAGDKSHPQAQDVYVLLDEMIWRLRSAGYVP 348
             +   + +G +++   + I  + +I++F+AGDKSH Q + +Y ++DEM   ++ AGY+P
Sbjct: 336 KIREVMDKRGMKKIPGSTMIELDHEIYEFVAGDKSHKQYKQIYEMVDEMGREMKRAGYIP 395

Query: 349 NIVSKIFCF*KYVDDAHEVEEEQMLFFHSD 438
                +        D  E ++E  L  HS+
Sbjct: 396 TTSEVLL-------DIDEEDKEDALNRHSE 418


>gb|AFW80179.1| hypothetical protein ZEAMMB73_142662 [Zea mays]
          Length = 649

 Score = 68.6 bits (166), Expect(2) = 6e-23
 Identities = 31/58 (53%), Positives = 40/58 (68%)
 Frame = +3

Query: 414 ADAVLSQ*PGLPLHIFKNLRICQDCHSAMKLISGIYKREIVIRDQNRFHRFTQSFSYC 587
           A A+LS  P   + I KNLR+C DCH+A+KLIS +Y RE+V+RD+ RFH F   F  C
Sbjct: 588 AFALLSSEPNTVIRIVKNLRMCLDCHNAIKLISRLYGREVVVRDRTRFHHFRHGFCSC 645



 Score = 66.2 bits (160), Expect(2) = 6e-23
 Identities = 45/144 (31%), Positives = 74/144 (51%)
 Frame = +1

Query: 7   VLGSLLAACGLHGKLQLGEYLLRELIQMEXXXXXXXXXXXXXXXFGWRAE*G*FFPAGSQ 186
           + G+LLAAC +HG  ++GE +   L+QM+                  R E          
Sbjct: 448 IWGTLLAACRVHGNSEIGELVTERLLQMDPENSTVYTLLSNIYAKCNRWEDVRRLRHTIM 507

Query: 187 ASGN*KGTRRMSSIRANGQIHQFIAGDKSHPQAQDVYVLLDEMIWRLRSAGYVPNIVSKI 366
             G  K     S I  NG IH+F+AGD+SHP ++++Y  L+ +I  L + GY P+ V+++
Sbjct: 508 EKGI-KKEPGCSLIEMNGIIHEFVAGDQSHPMSKEIYCKLESIINDLNNVGYFPD-VTEV 565

Query: 367 FCF*KYVDDAHEVEEEQMLFFHSD 438
           F       +  E E++++LF+HS+
Sbjct: 566 FV------EVAEEEKQKVLFWHSE 583


>ref|XP_007032614.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma
           cacao] gi|508711643|gb|EOY03540.1| Pentatricopeptide
           repeat (PPR) superfamily protein [Theobroma cacao]
          Length = 633

 Score =  113 bits (283), Expect = 8e-23
 Identities = 71/164 (43%), Positives = 93/164 (56%), Gaps = 10/164 (6%)
 Frame = +1

Query: 1   EIVLGSLLAACGLHGKLQLGEYLLRELIQMEXXXXXXXXXXXXXXXFGWRAE*G*FFPAG 180
           E+VLGSLL +C  HGKLQLGE+ L+ LI+M+                  + +        
Sbjct: 421 EVVLGSLLGSCSAHGKLQLGEHALQRLIEMDPHNTEYHILLSNMYALAGKRD-------- 472

Query: 181 SQASG-----N*KGTRR---MSSIRANGQIHQFIAGDKSHPQAQDVYVLLDEMIWRLRSA 336
            QA+        KG R+   MSSI  +GQ+HQF AGDKSH + QD+Y++LD MI RLRSA
Sbjct: 473 -QANALRTVLKTKGIRKVPGMSSIHVDGQVHQFSAGDKSHSKTQDIYLMLDNMIQRLRSA 531

Query: 337 GYVPNIVSKIFCF*KYVDD-AHEVEE-EQMLFFHSDLVCPCTSL 462
           GYVPN  S++F      +D A + EE EQ LF HS+ +  C  L
Sbjct: 532 GYVPNTASQVFSGSDGAEDNARDSEEKEQALFLHSEKLAVCFGL 575



 Score = 79.7 bits (195), Expect = 1e-12
 Identities = 34/55 (61%), Positives = 44/55 (80%)
 Frame = +3

Query: 423 VLSQ*PGLPLHIFKNLRICQDCHSAMKLISGIYKREIVIRDQNRFHRFTQSFSYC 587
           +LS  PG PL+IFKNLRICQDCH+A+K++S IY R++V+RD+NRFH F Q    C
Sbjct: 575 LLSTKPGTPLYIFKNLRICQDCHAALKIVSKIYNRKVVVRDRNRFHYFKQGSCSC 629


Top