BLASTX nr result

ID: Akebia24_contig00030141 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00030141
         (344 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI24452.3| unnamed protein product [Vitis vinifera]              154   9e-36
ref|XP_007212815.1| hypothetical protein PRUPE_ppa021315mg [Prun...   147   1e-33
ref|XP_004295549.1| PREDICTED: pentatricopeptide repeat-containi...   141   8e-32
ref|XP_004146207.1| PREDICTED: pentatricopeptide repeat-containi...   135   6e-30
ref|XP_004509407.1| PREDICTED: pentatricopeptide repeat-containi...   133   3e-29
gb|EXC35289.1| hypothetical protein L484_026611 [Morus notabilis]     129   4e-28
ref|XP_003629226.1| Pentatricopeptide repeat-containing protein ...   127   2e-27
gb|ACU21153.1| unknown [Glycine max]                                  125   5e-27
ref|XP_006588587.1| PREDICTED: pentatricopeptide repeat-containi...   120   1e-25
ref|XP_002307076.2| pentatricopeptide repeat-containing family p...   120   3e-25
ref|XP_007029178.1| Pentatricopeptide repeat (PPR) superfamily p...   120   3e-25
ref|XP_003610363.1| Pentatricopeptide repeat-containing protein ...   119   4e-25
ref|XP_006395522.1| hypothetical protein EUTSA_v10003772mg [Eutr...   119   6e-25
ref|XP_006485046.1| PREDICTED: pentatricopeptide repeat-containi...   118   1e-24
ref|XP_006437011.1| hypothetical protein CICLE_v10033882mg [Citr...   118   1e-24
ref|XP_003617675.1| Pentatricopeptide repeat-containing protein ...   118   1e-24
ref|XP_006472911.1| PREDICTED: pentatricopeptide repeat-containi...   117   1e-24
ref|XP_007050939.1| Pentatricopeptide repeat (PPR) superfamily p...   117   1e-24
ref|XP_004503027.1| PREDICTED: pentatricopeptide repeat-containi...   117   1e-24
ref|XP_002875341.1| binding protein [Arabidopsis lyrata subsp. l...   117   2e-24

>emb|CBI24452.3| unnamed protein product [Vitis vinifera]
          Length = 503

 Score =  154 bits (390), Expect = 9e-36
 Identities = 69/113 (61%), Positives = 90/113 (79%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           HSY+IKSG+++D  LGS LIAMYANCG L +A D+F R+ ++NIVVWNA++R +GMH HA
Sbjct: 235 HSYVIKSGIELDAALGSGLIAMYANCGLLNSARDVFDRIDDKNIVVWNAIIRCYGMHGHA 294

Query: 183 NEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKGWEILAKMKDYGVEKS 341
           +EAL+MFS +++SG+ PD + FLC LSA SH G V +G E+  KM DYGVEKS
Sbjct: 295 DEALKMFSGLIDSGLHPDGVIFLCLLSAFSHAGMVAEGMELFEKMGDYGVEKS 347



 Score = 74.7 bits (182), Expect = 1e-11
 Identities = 32/115 (27%), Positives = 68/115 (59%), Gaps = 4/115 (3%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H +++K G+ +D+ +G++L+A YA C  +  +  +F  + E++IV WN+M+ G+ ++  A
Sbjct: 130 HGHVVKHGLDLDLFVGNALVAFYAKCNEIGASRRVFDMISEKDIVTWNSMISGYAINGCA 189

Query: 183 NEALEMFSRMV----ESGIRPDAISFLCALSACSHGGFVDKGWEILAKMKDYGVE 335
           ++AL +F  M+    ++   PD+ + +  L AC+    + +G  I + +   G+E
Sbjct: 190 DDALVLFHNMLQVQGDTVYAPDSATLVAILPACAQAAAIQEGLWIHSYVIKSGIE 244



 Score = 59.3 bits (142), Expect = 5e-07
 Identities = 34/113 (30%), Positives = 57/113 (50%), Gaps = 2/113 (1%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANC--GRLETAHDIFHRLPERNIVVWNAMVRGFGMHS 176
           H+ II  G + +  LG+ L+  YA C    +E A  +F  LP+R++ VWN +++G+    
Sbjct: 27  HAQIIIGGFEENPFLGAKLVGKYAQCYESNIEDARKVFDCLPDRDVFVWNTIIQGYANLG 86

Query: 177 HANEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKGWEILAKMKDYGVE 335
              EAL ++  M  SG+  +  +F   L AC       KG  I   +  +G++
Sbjct: 87  PFMEALNIYEYMRCSGVAANRYTFPFVLKACGAMKDGKKGQAIHGHVVKHGLD 139


>ref|XP_007212815.1| hypothetical protein PRUPE_ppa021315mg [Prunus persica]
           gi|462408680|gb|EMJ14014.1| hypothetical protein
           PRUPE_ppa021315mg [Prunus persica]
          Length = 534

 Score =  147 bits (371), Expect = 1e-33
 Identities = 65/114 (57%), Positives = 89/114 (78%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           HSY IKS +++D  LGS+LI+MYA+CGR+  A  IF ++ E+N+V+W+AM+R +GMH HA
Sbjct: 275 HSYTIKSSVEVDAALGSALISMYASCGRVTIARFIFDQISEKNVVLWSAMMRCYGMHGHA 334

Query: 183 NEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKGWEILAKMKDYGVEKSD 344
           +EAL+MFS+ VESG+ PD + FLC LS CSH G V KG E+  +M DYGVEK++
Sbjct: 335 DEALQMFSQFVESGLHPDGVVFLCLLSTCSHSGMVTKGLELFEEMGDYGVEKNE 388



 Score = 68.2 bits (165), Expect = 1e-09
 Identities = 30/103 (29%), Positives = 60/103 (58%), Gaps = 2/103 (1%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H +++K G+  D+ +G++LIA+Y+ C  +E +  +F  +P ++ V WN+M+ G+  + + 
Sbjct: 172 HGHVVKCGLHSDLFVGNALIALYSKCEEIEISRRVFDEIPWKDSVSWNSMISGYTANGYP 231

Query: 183 NEALEMFSRMVESGIR--PDAISFLCALSACSHGGFVDKGWEI 305
           +EAL +F  M++      PD  + +  L AC     ++ G+ I
Sbjct: 232 HEALMLFRAMLQDHATSLPDHATLVSILPACVQASAIEVGFWI 274



 Score = 57.0 bits (136), Expect = 3e-06
 Identities = 29/91 (31%), Positives = 51/91 (56%), Gaps = 2/91 (2%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGR--LETAHDIFHRLPERNIVVWNAMVRGFGMHS 176
           H+ II  G + +  + + ++  Y  C    +ETA  +F RL ER++ VWN +++G+    
Sbjct: 69  HAQIIIGGFEQNPFVVAKIVGKYVECSEPSMETARKVFDRLLERDVFVWNMVIQGYANVE 128

Query: 177 HANEALEMFSRMVESGIRPDAISFLCALSAC 269
              EAL+M++RM  SG+  +  ++   L AC
Sbjct: 129 PFVEALKMYNRMRLSGVPANQYTYPFVLKAC 159


>ref|XP_004295549.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46790,
           chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 499

 Score =  141 bits (356), Expect = 8e-32
 Identities = 61/114 (53%), Positives = 86/114 (75%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H Y++K G+K+D  LGS+LI MYANCGR+  +  IF R+ ++N+V+W+A++R +GMH HA
Sbjct: 240 HCYVVKYGVKVDSALGSALITMYANCGRVRASRVIFDRISDKNVVLWSAVMRCYGMHGHA 299

Query: 183 NEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKGWEILAKMKDYGVEKSD 344
            E L+MF +  ESG++PDA+  LC LS CSH G V KG EI  KM++YGVEK++
Sbjct: 300 EEVLQMFLQFEESGLQPDAVVLLCLLSTCSHAGMVAKGLEIFDKMEEYGVEKNE 353



 Score = 85.1 bits (209), Expect = 9e-15
 Identities = 36/113 (31%), Positives = 70/113 (61%), Gaps = 2/113 (1%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H  I+K+G+++ V +G++L+A+Y+ CG +E +  +F  LP++++V WN+M+ G+  + + 
Sbjct: 137 HGQIVKAGLELQVFVGNALVALYSKCGEVEVSRRVFEELPKKDLVSWNSMISGYVANGYP 196

Query: 183 NEALEMFSRMV--ESGIRPDAISFLCALSACSHGGFVDKGWEILAKMKDYGVE 335
           NE +E+F  M+  +    P+  + +C L AC     V+ G+ I   +  YGV+
Sbjct: 197 NEGVEVFRAMLQDDGACLPEHATLVCVLPACVEASSVEVGFWIHCYVVKYGVK 249


>ref|XP_004146207.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760,
           chloroplastic-like [Cucumis sativus]
          Length = 480

 Score =  135 bits (340), Expect = 6e-30
 Identities = 58/114 (50%), Positives = 83/114 (72%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           HSY+IK+G+++   LGS LI MY NCG +  A D+F R+ ++N++VW+A++R +GMH  A
Sbjct: 221 HSYVIKTGIEVGAPLGSCLICMYGNCGHVNIARDVFDRIDDKNVIVWSAIIRCYGMHGFA 280

Query: 183 NEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKGWEILAKMKDYGVEKSD 344
           +EA  MF R+ E+G++PD + FL  LSACSH G V KG EI  KM+ YG+E+ D
Sbjct: 281 DEAFNMFRRLEEAGVKPDGLIFLNLLSACSHAGLVAKGHEIYEKMEAYGLERKD 334



 Score = 69.7 bits (169), Expect = 4e-10
 Identities = 33/113 (29%), Positives = 66/113 (58%), Gaps = 2/113 (1%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H +++K G+ +D+ +G++LIA Y+ C  +ETA  +F  +  R+IV WN+M+ G+ ++   
Sbjct: 118 HGHVVKCGLDLDLFVGNALIAFYSKCQDVETARKVFDDMSLRDIVSWNSMIVGYTLNGKE 177

Query: 183 NEALEMFSRMV--ESGIRPDAISFLCALSACSHGGFVDKGWEILAKMKDYGVE 335
           +EA+  F  M+  ++   PD+ + +  L AC+       G+ + + +   G+E
Sbjct: 178 DEAIMFFHAMLHNQADCTPDSATLVAILPACATKSASQVGFWVHSYVIKTGIE 230


>ref|XP_004509407.1| PREDICTED: pentatricopeptide repeat-containing protein At3g46790,
           chloroplastic-like [Cicer arietinum]
          Length = 513

 Score =  133 bits (334), Expect = 3e-29
 Identities = 57/114 (50%), Positives = 80/114 (70%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H YI+K+GMK+D  +G  LI +Y+NCG +  A  +F ++ +RN++VWNA++R +GMH   
Sbjct: 249 HCYIVKTGMKLDPAVGCGLITLYSNCGYISMARAVFDQISDRNVIVWNAIIRCYGMHGFP 308

Query: 183 NEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKGWEILAKMKDYGVEKSD 344
            EAL MF  +VESG+ PD I FLC LSACSH G   +GW++   M+ YGV KS+
Sbjct: 309 QEALGMFRCLVESGLHPDGIVFLCLLSACSHAGMHAQGWQLFQTMETYGVVKSE 362



 Score = 60.5 bits (145), Expect = 2e-07
 Identities = 29/103 (28%), Positives = 56/103 (54%), Gaps = 2/103 (1%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H + +K G+  D+ + ++ +A YA C  +E +  +F  +PER+IV WN+M+ G+  + + 
Sbjct: 146 HGHAVKCGLDFDLFVCNAFVAFYAKCQEVEVSRKLFDEMPERDIVSWNSMISGYIANGYV 205

Query: 183 NEALEMFSRMVESGI--RPDAISFLCALSACSHGGFVDKGWEI 305
           ++A+ +F  M+       PD  + +  L A S    +  G+ I
Sbjct: 206 DDAVIIFFNMLRDDDIGFPDNATLVTVLPAFSEKADIHAGYWI 248


>gb|EXC35289.1| hypothetical protein L484_026611 [Morus notabilis]
          Length = 508

 Score =  129 bits (324), Expect = 4e-28
 Identities = 58/114 (50%), Positives = 84/114 (73%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           HSY++KSGM+++  L S LI+MYA  GR+  A  +F    ++NI VW+AM+R +GM+ +A
Sbjct: 268 HSYVVKSGMEVNAALCSGLISMYAKFGRVSIAKRVFDGSRDKNIEVWSAMMRCYGMYGYA 327

Query: 183 NEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKGWEILAKMKDYGVEKSD 344
           +EAL++F R+++ G+ PD + FLC LSACSH G V+KG EI  KM D+GVEK +
Sbjct: 328 DEALKLFQRLLDFGLYPDGVVFLCLLSACSHSGMVEKGCEIFEKMGDFGVEKKE 381



 Score = 73.2 bits (178), Expect = 4e-11
 Identities = 33/113 (29%), Positives = 65/113 (57%), Gaps = 2/113 (1%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H +++KSG+ +D+ +G++LIA Y+    +  +  +FH +P+++I+ WN+M+ G+    HA
Sbjct: 165 HGHVLKSGLDLDLFVGNALIAFYSKSQDMRASRKVFHEMPQKDIISWNSMISGYASKGHA 224

Query: 183 NEALEMFSRMVESGIR--PDAISFLCALSACSHGGFVDKGWEILAKMKDYGVE 335
            +AL++F  +V        D  + +  L AC H   +  G+ I + +   G+E
Sbjct: 225 EDALKLFCSVVRDHTTCFLDHATLVSTLPACVHTSGLQVGFWIHSYVVKSGME 277


>ref|XP_003629226.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355523248|gb|AET03702.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 510

 Score =  127 bits (319), Expect = 2e-27
 Identities = 54/114 (47%), Positives = 79/114 (69%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H YI+K+GMK+D  +G  LI +Y+NCG +  A  +F ++P+RN++VW+A++R +GMH  A
Sbjct: 246 HCYIVKTGMKLDPAVGCGLITLYSNCGYIRMAKAVFDQIPDRNVIVWSAIIRCYGMHGFA 305

Query: 183 NEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKGWEILAKMKDYGVEKSD 344
            EAL MF ++VE G+  D I FL  LSACSH G  ++GW +   M+ YGV K +
Sbjct: 306 QEALSMFRQLVELGLHLDGIVFLSLLSACSHAGMHEEGWHLFQTMETYGVVKGE 359



 Score = 63.5 bits (153), Expect = 3e-08
 Identities = 31/103 (30%), Positives = 60/103 (58%), Gaps = 2/103 (1%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H  ++K G++ D+ +G++ +A YA C  +E +  +F  + ER+IV WN+M+ G+  + + 
Sbjct: 143 HGNVVKCGLEFDLFVGNAFVAFYAKCKEIEASRKVFDEMLERDIVSWNSMMSGYIANGYV 202

Query: 183 NEALEMFSRMV-ESGIR-PDAISFLCALSACSHGGFVDKGWEI 305
           +EA+ +F  M+ + GI  PD  + +  L A +    +  G+ I
Sbjct: 203 DEAVMLFCDMLRDDGIGFPDNATLVTVLPAFAEKADIHAGYWI 245


>gb|ACU21153.1| unknown [Glycine max]
          Length = 529

 Score =  125 bits (315), Expect = 5e-27
 Identities = 52/114 (45%), Positives = 82/114 (71%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H YI+K+ M +D  +G+ LI++Y+NCG +  A  IF R+ +R+++VW+A++R +G H  A
Sbjct: 244 HCYIVKTRMGLDSAVGTGLISLYSNCGYVRMARAIFDRISDRSVIVWSAIIRCYGTHGLA 303

Query: 183 NEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKGWEILAKMKDYGVEKSD 344
            EAL +F ++V +G+RPD + FLC LSACSH G +++GW +   M+ YGV KS+
Sbjct: 304 QEALALFRQLVGAGLRPDGVVFLCLLSACSHAGLLEQGWHLFNAMETYGVAKSE 357



 Score = 71.6 bits (174), Expect = 1e-10
 Identities = 34/103 (33%), Positives = 62/103 (60%), Gaps = 2/103 (1%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H + +K GM +D+ +G++L+A YA C  +E +  +F  +P R+IV WN+MV G+ ++ + 
Sbjct: 141 HEHAVKCGMDLDLFVGNALVAFYAKCQDVEVSRKVFDEIPHRDIVSWNSMVSGYTVNGYV 200

Query: 183 NEALEMFSRMV--ESGIRPDAISFLCALSACSHGGFVDKGWEI 305
           ++A+ +F  M+  ES   PD  +F+  L A +    +  G+ I
Sbjct: 201 DDAILLFYDMLRDESVGGPDHATFVTVLPAFAQAADIHAGYWI 243


>ref|XP_006588587.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g62890-like [Glycine max]
          Length = 568

 Score =  120 bits (302), Expect = 1e-25
 Identities = 58/112 (51%), Positives = 79/112 (70%), Gaps = 2/112 (1%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRL-PERNIVVWNAMVRGFGMHSH 179
           H+YI K+GMKIDV LG+SLI MYA CG +E A  IF  L PE++++ W+AM+  F MH  
Sbjct: 218 HAYIDKTGMKIDVVLGTSLIDMYAKCGSIERAKCIFDNLGPEKDVMAWSAMITAFSMHGL 277

Query: 180 ANEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKGWEILAK-MKDYGV 332
           + E LE+F+RMV  G+RP+A++F+  L AC HGG V +G E   + M +YGV
Sbjct: 278 SEECLELFARMVNDGVRPNAVTFVAVLCACVHGGLVSEGNEYFKRMMNEYGV 329


>ref|XP_002307076.2| pentatricopeptide repeat-containing family protein, partial
           [Populus trichocarpa] gi|550338333|gb|EEE94072.2|
           pentatricopeptide repeat-containing family protein,
           partial [Populus trichocarpa]
          Length = 744

 Score =  120 bits (300), Expect = 3e-25
 Identities = 46/111 (41%), Positives = 84/111 (75%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H YI + G +++V+LG++L+ MYA CG+LE + ++F+ + E++++ WN M+ G+G+H  A
Sbjct: 525 HQYIKEGGFELNVSLGTALVDMYAKCGQLEQSRELFNSMKEKDVISWNVMISGYGLHGDA 584

Query: 183 NEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKGWEILAKMKDYGVE 335
           N A+E+F +M +S ++P+AI+FL  LSAC+H G+VD+G ++  +M+ Y ++
Sbjct: 585 NSAMEVFQQMEQSNVKPNAITFLSLLSACTHAGYVDEGKQLFDRMQYYSIK 635



 Score = 73.2 bits (178), Expect = 4e-11
 Identities = 37/111 (33%), Positives = 62/111 (55%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H YIIK+ +  DV++ +SLI MY   G L  A  +F R  +R++V WN ++  +    H 
Sbjct: 425 HCYIIKNSVDEDVSIANSLIDMYGKGGNLSIAWKMFCRT-QRDVVTWNTLISSYTHSGHY 483

Query: 183 NEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKGWEILAKMKDYGVE 335
            EA+ +F  M+   + P++ + +  LSAC H   ++KG  +   +K+ G E
Sbjct: 484 AEAITLFDEMISEKLNPNSATLVIVLSACCHLPSLEKGKMVHQYIKEGGFE 534



 Score = 56.6 bits (135), Expect = 3e-06
 Identities = 27/98 (27%), Positives = 49/98 (50%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H   +K+G+     + SSL++MY+ CG +E AH+ F ++ ++++  W +++         
Sbjct: 259 HGLAVKTGLGCSQVVQSSLLSMYSKCGNVEEAHNSFCQVVDKDVFSWTSVIGVCARFGFM 318

Query: 183 NEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKG 296
           NE L +F  M    + PD I   C L    +   V +G
Sbjct: 319 NECLNLFWDMQVDDVYPDGIVVSCILLGFGNSMMVREG 356


>ref|XP_007029178.1| Pentatricopeptide repeat (PPR) superfamily protein [Theobroma
           cacao] gi|508717783|gb|EOY09680.1| Pentatricopeptide
           repeat (PPR) superfamily protein [Theobroma cacao]
          Length = 626

 Score =  120 bits (300), Expect = 3e-25
 Identities = 54/112 (48%), Positives = 76/112 (67%), Gaps = 1/112 (0%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H YI ++ + ++V LG++L+ MYA CG +E A  +F  LPER+++ W A++ G  MH +A
Sbjct: 277 HEYIFRNNLSLNVILGTALVDMYARCGSIEKAIGVFEELPERDVLSWTALIAGLAMHGYA 336

Query: 183 NEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKGWEILAKMK-DYGVE 335
             AL  FS MV+SG++P  ISF   LSACSHGG V KG E+   MK D+G+E
Sbjct: 337 ERALWFFSEMVKSGLKPRDISFTAVLSACSHGGLVGKGLELFGSMKRDFGIE 388



 Score = 66.6 bits (161), Expect = 3e-09
 Identities = 34/129 (26%), Positives = 64/129 (49%), Gaps = 31/129 (24%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRL--------------------- 119
           H  IIK G + +V + +SL+ MY+ CG ++ A+ IF R+                     
Sbjct: 145 HGQIIKHGFESNVYVQNSLVHMYSTCGDIKAANAIFQRMTFLNVVSWTSMIAGLNKVGDV 204

Query: 120 ----------PERNIVVWNAMVRGFGMHSHANEALEMFSRMVESGIRPDAISFLCALSAC 269
                     PE+N+V W+ M+ G+  +S+  +A+E+F  + E G++ +    +  +S+C
Sbjct: 205 EMARKLFDTMPEKNLVTWSIMISGYAKNSYFEKAVELFQVLQEEGVQANETVMVSVISSC 264

Query: 270 SHGGFVDKG 296
           +H G ++ G
Sbjct: 265 AHLGAIELG 273


>ref|XP_003610363.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355511418|gb|AES92560.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 734

 Score =  119 bits (298), Expect = 4e-25
 Identities = 52/107 (48%), Positives = 74/107 (69%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H+ IIK G K++V +GS+L AMY  CG L+  + IF R+P R+++ WNAM+ G   + H 
Sbjct: 444 HARIIKYGFKLEVPIGSALSAMYTKCGSLDDGYLIFWRMPSRDVISWNAMISGLSQNGHG 503

Query: 183 NEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKGWEILAKMKD 323
           N+ALE+F +M+  GI+PD ++F+  LSACSH G VD+GWE    M D
Sbjct: 504 NKALELFEKMLLEGIKPDPVTFVNLLSACSHMGLVDRGWEYFKMMFD 550



 Score = 70.5 bits (171), Expect = 2e-10
 Identities = 31/90 (34%), Positives = 56/90 (62%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           HS  IK+G+   V++ ++L+ MYA CG L+ A   F    ++N + W+AMV G+     +
Sbjct: 242 HSLAIKNGLLAIVSVANALVTMYAKCGSLDDAVRTFEFSGDKNSITWSAMVTGYAQGGDS 301

Query: 183 NEALEMFSRMVESGIRPDAISFLCALSACS 272
           ++AL++F++M  SG+ P   + +  ++ACS
Sbjct: 302 DKALKLFNKMHSSGVLPSEFTLVGVINACS 331



 Score = 63.5 bits (153), Expect = 3e-08
 Identities = 35/101 (34%), Positives = 53/101 (52%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           HS  +K+G   DV +GSSL+ MY   G +  A  +F R+PERN V W  M+ G+     A
Sbjct: 141 HSVAVKTGCSGDVYVGSSLLNMYCKTGFVFDARKLFDRMPERNTVSWATMISGYASSDIA 200

Query: 183 NEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKGWEI 305
           ++A+E+F  M       +  +    LSA +   FV  G ++
Sbjct: 201 DKAVEVFELMRREEEIQNEFALTSVLSALTSDVFVYTGRQV 241



 Score = 60.1 bits (144), Expect = 3e-07
 Identities = 28/109 (25%), Positives = 58/109 (53%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           HS+  K G  + + + S+++ MYA CG L  A   F  + + ++V+W +++ G+  +   
Sbjct: 343 HSFAFKLGFGLQLYVLSAVVDMYAKCGSLADARKGFECVQQPDVVLWTSIITGYVQNGDY 402

Query: 183 NEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKGWEILAKMKDYG 329
              L ++ +M    + P+ ++    L ACS    +D+G ++ A++  YG
Sbjct: 403 EGGLNLYGKMQMERVIPNELTMASVLRACSSLAALDQGKQMHARIIKYG 451


>ref|XP_006395522.1| hypothetical protein EUTSA_v10003772mg [Eutrema salsugineum]
           gi|557092161|gb|ESQ32808.1| hypothetical protein
           EUTSA_v10003772mg [Eutrema salsugineum]
          Length = 664

 Score =  119 bits (297), Expect = 6e-25
 Identities = 53/112 (47%), Positives = 78/112 (69%), Gaps = 1/112 (0%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H  +I+ G++ DV +G+SLI MY  CGR+ETA   F R+  +N+  W AM+ G+GMH HA
Sbjct: 315 HDLVIRMGLEDDVIVGTSLIDMYCKCGRVETARKAFDRMKNKNVRTWTAMIAGYGMHGHA 374

Query: 183 NEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKGWEILAKMKD-YGVE 335
           ++ALE+F  M++SG+RP+ I+F+  L+ACSH G   +GW    +MK  +GVE
Sbjct: 375 DKALELFPVMIDSGVRPNHITFVSVLAACSHAGLHVEGWRWFNEMKGRFGVE 426



 Score = 70.9 bits (172), Expect = 2e-10
 Identities = 37/109 (33%), Positives = 60/109 (55%), Gaps = 11/109 (10%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H      G + D+ + S+LI MY+ CG+LE A  +F  +P RNIV W +M+RG+ ++ +A
Sbjct: 99  HQQAFVFGFQSDIFVSSALIVMYSTCGQLEDARKVFDEIPNRNIVSWTSMIRGYDLNGNA 158

Query: 183 NEALEMFSRMVESG-----------IRPDAISFLCALSACSHGGFVDKG 296
            EA+ +F  ++ SG           +  D++  +  +SACS     DKG
Sbjct: 159 LEAVSLFKDLLVSGACGDYDDDDASMFLDSMGMVSVISACSR--VSDKG 205


>ref|XP_006485046.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g33990-like [Citrus sinensis]
          Length = 685

 Score =  118 bits (295), Expect = 1e-24
 Identities = 54/113 (47%), Positives = 77/113 (68%), Gaps = 2/113 (1%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRL--PERNIVVWNAMVRGFGMHS 176
           H YII S MKID TL ++++ MYA CG L+TA ++F+ +   ERN+  WN ++ G+GMH 
Sbjct: 335 HGYIINSNMKIDATLRNAVMDMYAKCGDLDTAENMFNDIHPSERNVSSWNVLIAGYGMHG 394

Query: 177 HANEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKGWEILAKMKDYGVE 335
           H  +ALE FS+M+E G++PD I+F   LSACSH G +D+G +  A M    V+
Sbjct: 395 HGRKALEFFSQMLEEGVKPDHITFTSILSACSHAGLIDEGRKCFADMTKLSVK 447



 Score = 73.2 bits (178), Expect = 4e-11
 Identities = 32/93 (34%), Positives = 58/93 (62%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H Y I +    D+ + +S++AMYA CG +E A  +F  + +R+++ WN+M+ G+  +  A
Sbjct: 234 HGYAICNAFLEDLCIQNSIVAMYARCGNVEKARLVFDMMEKRDLISWNSMLTGYIQNGQA 293

Query: 183 NEALEMFSRMVESGIRPDAISFLCALSACSHGG 281
           +EAL +F  M  S  +P+ ++ L  +SAC++ G
Sbjct: 294 SEALLLFDEMQNSDCKPNPVTALILVSACTYLG 326


>ref|XP_006437011.1| hypothetical protein CICLE_v10033882mg [Citrus clementina]
           gi|557539207|gb|ESR50251.1| hypothetical protein
           CICLE_v10033882mg [Citrus clementina]
          Length = 685

 Score =  118 bits (295), Expect = 1e-24
 Identities = 54/113 (47%), Positives = 77/113 (68%), Gaps = 2/113 (1%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRL--PERNIVVWNAMVRGFGMHS 176
           H YII S MKID TL ++++ MYA CG L+TA ++F+ +   ERN+  WN ++ G+GMH 
Sbjct: 335 HGYIINSNMKIDATLRNAVMDMYAKCGDLDTAENMFNDIHPSERNVSSWNVLIAGYGMHG 394

Query: 177 HANEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKGWEILAKMKDYGVE 335
           H  +ALE FS+M+E G++PD I+F   LSACSH G +D+G +  A M    V+
Sbjct: 395 HGRKALEFFSQMLEEGVKPDHITFTSILSACSHAGLIDEGRKCFADMTKLSVK 447



 Score = 73.2 bits (178), Expect = 4e-11
 Identities = 32/93 (34%), Positives = 58/93 (62%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H Y I +    D+ + +S++AMYA CG +E A  +F  + +R+++ WN+M+ G+  +  A
Sbjct: 234 HGYAICNAFLEDLCIQNSIVAMYARCGNVEKARLVFDMMEKRDLISWNSMLSGYIQNGQA 293

Query: 183 NEALEMFSRMVESGIRPDAISFLCALSACSHGG 281
           +EAL +F  M  S  +P+ ++ L  +SAC++ G
Sbjct: 294 SEALLLFDEMQNSDCKPNPVTALILVSACAYLG 326



 Score = 56.6 bits (135), Expect = 3e-06
 Identities = 31/101 (30%), Positives = 52/101 (51%), Gaps = 3/101 (2%)
 Frame = +3

Query: 3   HSYIIKSGM-KIDVTLGSSLIAMYANCGRLETAHDIFHRL--PERNIVVWNAMVRGFGMH 173
           HS +  SG+    + LG+ +I  Y   G   TA  +F+ +   + N  +WN M+R +  +
Sbjct: 29  HSSLTTSGLINQALHLGAKIIIKYTTYGEPNTARSLFNSIHNDKSNSFLWNTMIRAYANN 88

Query: 174 SHANEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKG 296
            H  E LE++S M  SGI  ++ +F   L AC+    + +G
Sbjct: 89  GHCVETLELYSTMRRSGISSNSYTFPFVLKACASNSLILEG 129


>ref|XP_003617675.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355519010|gb|AET00634.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 758

 Score =  118 bits (295), Expect = 1e-24
 Identities = 48/111 (43%), Positives = 81/111 (72%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H YI + G K+++ LG++L+ MYA CG+LE + ++F  + E++++ WNAM+ G+GM+ +A
Sbjct: 538 HRYINEKGFKLNLPLGTALVDMYAKCGQLEKSREVFDSMMEKDVICWNAMISGYGMNGYA 597

Query: 183 NEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKGWEILAKMKDYGVE 335
             A+E+F+ M ES ++P+ I+FL  LSAC+H G V++G  + AKM+ Y V+
Sbjct: 598 ESAIEIFNLMEESNVKPNEITFLSLLSACAHAGLVEEGKNVFAKMQSYSVK 648



 Score = 67.4 bits (163), Expect = 2e-09
 Identities = 32/98 (32%), Positives = 56/98 (57%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H  +IK  +   +++ +SLI MY  C ++  +  IF+R  ER++++WNA++       H 
Sbjct: 438 HCNVIKGFVDETISVTNSLIEMYGKCDKMNVSWRIFNR-SERDVILWNALISAHIHVKHY 496

Query: 183 NEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKG 296
            EA+ +F  M+     P+  + +  LSACSH  F++KG
Sbjct: 497 EEAISLFDIMIMEDQNPNTATLVVVLSACSHLAFLEKG 534


>ref|XP_006472911.1| PREDICTED: pentatricopeptide repeat-containing protein
           At3g21470-like [Citrus sinensis]
          Length = 562

 Score =  117 bits (294), Expect = 1e-24
 Identities = 55/110 (50%), Positives = 74/110 (67%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           HS I K  MK++  + ++L+ MYA CG L  A  IF  +  RN+V WN+++ GF  H H 
Sbjct: 323 HSMIDKKMMKLNQFVLNALVDMYAKCGDLANARSIFEEMVHRNVVCWNSLISGFATHGHC 382

Query: 183 NEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKGWEILAKMKDYGV 332
            EALE FSRM  +   PD I+FL  LSAC+HGGFVD+G EI +KM++YG+
Sbjct: 383 KEALEFFSRMEITNEMPDKITFLSVLSACAHGGFVDEGLEIFSKMENYGL 432



 Score = 72.0 bits (175), Expect = 8e-11
 Identities = 28/72 (38%), Positives = 49/72 (68%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H+  +KSG   +V +G+SL+ MYA CG +  + ++F  +P+RN+V WNAM+ G+  H + 
Sbjct: 96  HAEAVKSGADTEVMIGTSLVNMYAKCGDILASRNVFDEMPDRNVVTWNAMIGGYLKHGNT 155

Query: 183 NEALEMFSRMVE 218
           + A  +F++M+E
Sbjct: 156 DSAFGLFAQMLE 167



 Score = 71.2 bits (173), Expect = 1e-10
 Identities = 34/85 (40%), Positives = 53/85 (62%)
 Frame = +3

Query: 51  SSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHANEALEMFSRMVESGIR 230
           SS+I+ Y + G ++ A  +F+R+P RN+V WN+++ G   +    EALE F +M      
Sbjct: 238 SSMISGYFDRGDVKEAQAMFNRIPVRNLVNWNSLISGLAQNGFFEEALEAFWKMQGERFE 297

Query: 231 PDAISFLCALSACSHGGFVDKGWEI 305
           PD ++F   LSAC+H G++D G EI
Sbjct: 298 PDEVTFASILSACAHLGWLDTGKEI 322


>ref|XP_007050939.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 1
           [Theobroma cacao] gi|590718992|ref|XP_007050940.1|
           Pentatricopeptide repeat (PPR) superfamily protein
           isoform 1 [Theobroma cacao] gi|508703200|gb|EOX95096.1|
           Pentatricopeptide repeat (PPR) superfamily protein
           isoform 1 [Theobroma cacao] gi|508703201|gb|EOX95097.1|
           Pentatricopeptide repeat (PPR) superfamily protein
           isoform 1 [Theobroma cacao]
          Length = 525

 Score =  117 bits (294), Expect = 1e-24
 Identities = 53/112 (47%), Positives = 79/112 (70%), Gaps = 2/112 (1%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRL-PERNIVVWNAMVRGFGMHSH 179
           H+YI K G+KIDV LG+SLI MY  CG +E A D+F  L P+++++ W+AM+ G  MH H
Sbjct: 219 HAYIDKCGIKIDVVLGTSLIDMYGKCGSIEKARDVFSNLGPDKDVMAWSAMISGLAMHGH 278

Query: 180 ANEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKGWEILAKM-KDYGV 332
            +E L++FS M++  +RP+A++FL  L AC HGG V+ G E   +M K++G+
Sbjct: 279 GDECLKLFSEMIKRQVRPNAVTFLGVLCACVHGGLVNDGKEYFRRMSKEFGI 330



 Score = 55.8 bits (133), Expect = 6e-06
 Identities = 28/90 (31%), Positives = 47/90 (52%), Gaps = 3/90 (3%)
 Frame = +3

Query: 36  DVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHANEALEMFSRM- 212
           DV   +S+I  Y   G ++ A  +F ++PERN+  W++++ GF       EAL +F  M 
Sbjct: 126 DVASWNSIIHAYVKVGLIDLARGLFDKMPERNVRSWSSLINGFVRCGKYKEALALFREMQ 185

Query: 213 --VESGIRPDAISFLCALSACSHGGFVDKG 296
               + +RP+  +    LSAC   G ++ G
Sbjct: 186 MLAVNDVRPNEFTMSAVLSACGRLGALEHG 215


>ref|XP_004503027.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g21065-like [Cicer arietinum]
          Length = 561

 Score =  117 bits (294), Expect = 1e-24
 Identities = 50/112 (44%), Positives = 75/112 (66%), Gaps = 1/112 (0%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           HS+I++ G+ + V LGSSLI MY+ CG ++ +  +F  +P RN+V W A++ G  +H  +
Sbjct: 212 HSFIVRIGLPLTVPLGSSLINMYSRCGSIDRSVMVFDEMPHRNVVTWTALINGLAVHGCS 271

Query: 183 NEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKGWEILAKMKD-YGVE 335
            E LE F  M ESG++PD  +F+ AL ACSHGG V+ GW +   M+D +G+E
Sbjct: 272 REGLEAFYDMTESGLKPDRAAFIAALVACSHGGLVEDGWRVFRSMRDEFGIE 323


>ref|XP_002875341.1| binding protein [Arabidopsis lyrata subsp. lyrata]
           gi|297321179|gb|EFH51600.1| binding protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 659

 Score =  117 bits (293), Expect = 2e-24
 Identities = 53/112 (47%), Positives = 76/112 (67%), Gaps = 1/112 (0%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H  +I+ G++ DV +G+S+I MY  CGR+ETA   F R+  +N+  W AM+ G+GMH HA
Sbjct: 310 HDQVIRMGLEDDVIVGTSIIDMYCKCGRVETARLAFDRMKNKNVRSWTAMIAGYGMHGHA 369

Query: 183 NEALEMFSRMVESGIRPDAISFLCALSACSHGGFVDKGWEILAKMKD-YGVE 335
            +ALE+F  M++SG+RP+ I+F+  L+ACSH G  D GW     MK  +GVE
Sbjct: 370 AKALELFPAMIDSGVRPNYITFVSVLAACSHAGLHDVGWHWFNAMKGRFGVE 421



 Score = 69.7 bits (169), Expect = 4e-10
 Identities = 34/96 (35%), Positives = 55/96 (57%), Gaps = 6/96 (6%)
 Frame = +3

Query: 3   HSYIIKSGMKIDVTLGSSLIAMYANCGRLETAHDIFHRLPERNIVVWNAMVRGFGMHSHA 182
           H      G + D+ + S+LI MY+ CG+LE A  +F  +P+RNIV W +M+RG+ ++ +A
Sbjct: 99  HQQAFVFGYQSDIFVSSALIVMYSTCGKLEDARKVFDEIPKRNIVSWTSMIRGYDLNGNA 158

Query: 183 NEALEMFSRMVESGIRPDAISFL------CALSACS 272
            +A+ +F  ++      DA  FL        +SACS
Sbjct: 159 LDAVSLFKDLLIEENDDDATMFLDSMGMVSVISACS 194


Top