BLASTX nr result

ID: Akebia23_contig00040331 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00040331
         (778 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006440247.1| hypothetical protein CICLE_v10019985mg [Citr...   210   5e-52
ref|XP_006477135.1| PREDICTED: pentatricopeptide repeat-containi...   209   9e-52
ref|XP_007039757.1| Tetratricopeptide repeat-like superfamily pr...   208   2e-51
ref|XP_003631455.1| PREDICTED: pentatricopeptide repeat-containi...   201   3e-49
gb|ACP39958.1| pentatricopeptide repeat protein [Gossypium hirsu...   200   4e-49
emb|CBI25851.3| unnamed protein product [Vitis vinifera]              200   5e-49
ref|XP_002531466.1| pentatricopeptide repeat-containing protein,...   194   2e-47
ref|XP_007212439.1| hypothetical protein PRUPE_ppa016777mg, part...   190   4e-46
ref|XP_006368989.1| hypothetical protein POPTR_0001s15470g [Popu...   190   6e-46
ref|XP_004300367.1| PREDICTED: pentatricopeptide repeat-containi...   181   3e-43
gb|EXC02094.1| hypothetical protein L484_024059 [Morus notabilis]     180   4e-43
ref|XP_007131288.1| hypothetical protein PHAVU_011G001300g [Phas...   179   1e-42
ref|XP_004515635.1| PREDICTED: pentatricopeptide repeat-containi...   176   6e-42
ref|XP_004139002.1| PREDICTED: pentatricopeptide repeat-containi...   175   1e-41
ref|XP_003604902.1| Pentatricopeptide repeat-containing protein ...   162   1e-37
ref|XP_006359252.1| PREDICTED: pentatricopeptide repeat-containi...   159   1e-36
ref|XP_006359251.1| PREDICTED: pentatricopeptide repeat-containi...   159   1e-36
ref|XP_002863348.1| pentatricopeptide repeat-containing protein ...   157   4e-36
ref|NP_199547.1| pentatricopeptide repeat-containing protein [Ar...   157   5e-36
ref|XP_006282107.1| hypothetical protein CARUB_v10028355mg, part...   152   1e-34

>ref|XP_006440247.1| hypothetical protein CICLE_v10019985mg [Citrus clementina]
           gi|567895520|ref|XP_006440248.1| hypothetical protein
           CICLE_v10019985mg [Citrus clementina]
           gi|567895522|ref|XP_006440249.1| hypothetical protein
           CICLE_v10019985mg [Citrus clementina]
           gi|557542509|gb|ESR53487.1| hypothetical protein
           CICLE_v10019985mg [Citrus clementina]
           gi|557542510|gb|ESR53488.1| hypothetical protein
           CICLE_v10019985mg [Citrus clementina]
           gi|557542511|gb|ESR53489.1| hypothetical protein
           CICLE_v10019985mg [Citrus clementina]
          Length = 475

 Score =  210 bits (534), Expect = 5e-52
 Identities = 102/175 (58%), Positives = 130/175 (74%)
 Frame = +1

Query: 4   GCLPNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEEA 183
           GC PNRVTI TLIKG CVEG +DE Y+LIDKVV  GS+ S  CYSSL+V L++ K L+EA
Sbjct: 301 GCAPNRVTISTLIKGFCVEGNLDEAYQLIDKVVAGGSVSSGGCYSSLVVELVRTKRLKEA 360

Query: 184 EKLFGKMLVSGMKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSIIL 363
           EKLF KML SG+KPDGLAC  +I++LC  G+ L+GF  Y ++EK   L+SVDSDI+S++L
Sbjct: 361 EKLFSKMLASGVKPDGLACSVMIRELCLRGQVLEGFCLYEDIEKIGFLSSVDSDIHSVLL 420

Query: 364 AGLCRQGHLVEAAKLINIMVERKIRLKSPYADGIVEYLMKSSERELALRLTSLVG 528
            GLCR+ H VEAAKL   M++++I L+ PY D IVE+L KS + EL   L  + G
Sbjct: 421 LGLCRKNHSVEAAKLARFMLKKRIWLQGPYVDKIVEHLKKSGDEELITNLPKIGG 475


>ref|XP_006477135.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g47360-like isoform X1 [Citrus sinensis]
           gi|568846596|ref|XP_006477136.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At5g47360-like isoform X2 [Citrus sinensis]
           gi|568846598|ref|XP_006477137.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At5g47360-like isoform X3 [Citrus sinensis]
          Length = 475

 Score =  209 bits (532), Expect = 9e-52
 Identities = 102/175 (58%), Positives = 130/175 (74%)
 Frame = +1

Query: 4   GCLPNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEEA 183
           GC PNRVTI TLIKG CVEG +DE Y+LIDKVV  GS+ S  CYSSL+V L++ K L+EA
Sbjct: 301 GCAPNRVTISTLIKGFCVEGNLDEAYQLIDKVVAGGSVSSGGCYSSLVVELVRTKRLKEA 360

Query: 184 EKLFGKMLVSGMKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSIIL 363
           EKLF KML SG+KPDGLAC  +I++LC  G+ L+GF  Y ++EK   L+SVDSDI+S++L
Sbjct: 361 EKLFSKMLASGVKPDGLACSVMIRELCLGGQVLEGFCLYEDIEKIGFLSSVDSDIHSVLL 420

Query: 364 AGLCRQGHLVEAAKLINIMVERKIRLKSPYADGIVEYLMKSSERELALRLTSLVG 528
            GLCR+ H VEAAKL   M++++I L+ PY D IVE+L KS + EL   L  + G
Sbjct: 421 LGLCRKNHSVEAAKLARFMLKKRIWLQGPYVDKIVEHLKKSGDEELITNLPKIGG 475


>ref|XP_007039757.1| Tetratricopeptide repeat-like superfamily protein, putative isoform
           1 [Theobroma cacao] gi|590676515|ref|XP_007039758.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|590676519|ref|XP_007039759.1| Tetratricopeptide
           repeat-like superfamily protein, putative isoform 1
           [Theobroma cacao] gi|590676523|ref|XP_007039760.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|508777002|gb|EOY24258.1| Tetratricopeptide
           repeat-like superfamily protein, putative isoform 1
           [Theobroma cacao] gi|508777003|gb|EOY24259.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|508777004|gb|EOY24260.1| Tetratricopeptide
           repeat-like superfamily protein, putative isoform 1
           [Theobroma cacao] gi|508777005|gb|EOY24261.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
          Length = 483

 Score =  208 bits (529), Expect = 2e-51
 Identities = 97/170 (57%), Positives = 127/170 (74%)
 Frame = +1

Query: 4   GCLPNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEEA 183
           GC PNRVT+ TLIK LC EG ++E YKLIDKVV  G +    CYSSL+VSL+++K L+EA
Sbjct: 301 GCAPNRVTVSTLIKRLCAEGHVEEAYKLIDKVVPGGGVSDGDCYSSLVVSLIRIKRLDEA 360

Query: 184 EKLFGKMLVSGMKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSIIL 363
           EKLF KML +G KPD +AC  +I+++C EGR LDGF  Y E+E+   L+S+D+DIYSI+L
Sbjct: 361 EKLFRKMLATGAKPDSIACSIMIREICQEGRVLDGFYLYEEIERMRYLSSIDADIYSILL 420

Query: 364 AGLCRQGHLVEAAKLINIMVERKIRLKSPYADGIVEYLMKSSERELALRL 513
            GLCRQ H VEAAKL   M+E++IRLK+PY D I+E+L    +++L   L
Sbjct: 421 VGLCRQSHSVEAAKLARSMLEKRIRLKAPYVDKIIEHLKNCGDKQLVTEL 470



 Score = 58.9 bits (141), Expect = 2e-06
 Identities = 37/139 (26%), Positives = 69/139 (49%), Gaps = 2/139 (1%)
 Frame = +1

Query: 13  PNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEEAEKL 192
           P+ +T + +IKG C  GR+++   L   +   G  P+   YS+L+  + +  ++E+A +L
Sbjct: 197 PDMITYLAMIKGFCNAGRLEDACGLFQVMREHGCFPNAVAYSALLEGICRYGSVEKALEL 256

Query: 193 FGKMLV--SGMKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSIILA 366
            G+M     G  P+ +    +I+  C +G+          M    C  + +    S ++ 
Sbjct: 257 LGEMEKEGDGCSPNVITYTSVIQSFCEKGQTTKALRVLDRM--GTCGCAPNRVTVSTLIK 314

Query: 367 GLCRQGHLVEAAKLINIMV 423
            LC +GH+ EA KLI+ +V
Sbjct: 315 RLCAEGHVEEAYKLIDKVV 333


>ref|XP_003631455.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g47360-like [Vitis vinifera]
          Length = 638

 Score =  201 bits (510), Expect = 3e-49
 Identities = 98/171 (57%), Positives = 128/171 (74%)
 Frame = +1

Query: 4   GCLPNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEEA 183
           GC PNRVT+  L+KG C EGR++E +KLIDKVV  G++    CYSSLIVSL+  KNL+EA
Sbjct: 297 GCAPNRVTVSILMKGFCAEGRVEEAFKLIDKVVAGGNVSYGECYSSLIVSLVGNKNLQEA 356

Query: 184 EKLFGKMLVSGMKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSIIL 363
           EKLF +ML + +KPDGLACG LIK LC EGR LDGF  + E E  + L+ +DSDIYSI+L
Sbjct: 357 EKLFRRMLANAVKPDGLACGTLIKALCLEGRVLDGFHLFDEFENMEGLSYLDSDIYSILL 416

Query: 364 AGLCRQGHLVEAAKLINIMVERKIRLKSPYADGIVEYLMKSSERELALRLT 516
            GL ++ H VEA KL  +MV+R I+LK+PY D IVE+L +S ++E+ + L+
Sbjct: 417 VGLSQKRHSVEAVKLARLMVDRGIQLKTPYFDSIVEHLKESGDKEIVMYLS 467



 Score = 63.2 bits (152), Expect = 1e-07
 Identities = 38/139 (27%), Positives = 71/139 (51%), Gaps = 2/139 (1%)
 Frame = +1

Query: 13  PNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEEAEKL 192
           PN +T +T+IKG C  GR+++  KL   +   G  P+   Y+ ++  + +  +LE A +L
Sbjct: 193 PNMITYVTMIKGFCNVGRLEDACKLFKVMKGHGCSPNVVVYTVILDGVCRFGSLERALEL 252

Query: 193 FGKMLVSG--MKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSIILA 366
            G+M        P+ +    +I+  C +G+ ++       M  + C  + +    SI++ 
Sbjct: 253 LGEMEKESGDCSPNVVTYTSMIQSCCEKGKLMEALEILDRM--RACGCAPNRVTVSILMK 310

Query: 367 GLCRQGHLVEAAKLINIMV 423
           G C +G + EA KLI+ +V
Sbjct: 311 GFCAEGRVEEAFKLIDKVV 329


>gb|ACP39958.1| pentatricopeptide repeat protein [Gossypium hirsutum]
           gi|227463014|gb|ACP39959.1| pentatricopeptide repeat
           protein [Gossypium hirsutum]
          Length = 288

 Score =  200 bits (509), Expect = 4e-49
 Identities = 92/172 (53%), Positives = 132/172 (76%)
 Frame = +1

Query: 7   CLPNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEEAE 186
           C+PNR+T+ITLI GLC +G ++E YKLID+V   G   SD CYSSL+++L+++  L EAE
Sbjct: 116 CVPNRITVITLITGLCTKGHVEEAYKLIDRVAGRGVSNSD-CYSSLVLALIRINRLNEAE 174

Query: 187 KLFGKMLVSGMKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSIILA 366
           KLF KMLVSG KP G+AC  +I+++C EGR LDGF  Y+E+E+   ++S+D+DIYSI+L 
Sbjct: 175 KLFRKMLVSGAKPSGIACSTMIREICHEGRVLDGFCLYNEIERMQYISSIDTDIYSILLV 234

Query: 367 GLCRQGHLVEAAKLINIMVERKIRLKSPYADGIVEYLMKSSERELALRLTSL 522
           GLCRQ H VEA KL  +M+ R+IRL++PY D I+++L  S+++EL  +L+ +
Sbjct: 235 GLCRQSHSVEAVKLARLMLRRRIRLEAPYVDEIIKHLKNSTDKELVTQLSRI 286



 Score = 69.3 bits (168), Expect = 1e-09
 Identities = 42/150 (28%), Positives = 77/150 (51%), Gaps = 2/150 (1%)
 Frame = +1

Query: 13  PNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEEAEKL 192
           P+ +T   +IKG C  GR++E  +L   +   G  P+   YS L+  + + ++ E+A +L
Sbjct: 11  PDMMTYFAMIKGFCNAGRLEEACELFQAMKGQGFSPNAVTYSVLLEGICKYRSTEKALEL 70

Query: 193 FGKMLVSG--MKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSIILA 366
            G+M  +G    P+ +    +IK  C +G+ ++       ME   C+ +  + I   ++ 
Sbjct: 71  LGEMEKAGGNCSPNVITYTSMIKSFCEKGQTIEALRILDRMEACQCVPNRITVI--TLIT 128

Query: 367 GLCRQGHLVEAAKLINIMVERKIRLKSPYA 456
           GLC +GH+ EA KLI+ +  R +     Y+
Sbjct: 129 GLCTKGHVEEAYKLIDRVAGRGVSNSDCYS 158


>emb|CBI25851.3| unnamed protein product [Vitis vinifera]
          Length = 528

 Score =  200 bits (508), Expect = 5e-49
 Identities = 98/173 (56%), Positives = 127/173 (73%)
 Frame = +1

Query: 4   GCLPNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEEA 183
           GC PNRVT+  L+KG C EGR++E +KLIDKVV  G++    CYSSLIVSL+  KNL+EA
Sbjct: 303 GCAPNRVTVSILMKGFCAEGRVEEAFKLIDKVVAGGNVSYGECYSSLIVSLVGNKNLQEA 362

Query: 184 EKLFGKMLVSGMKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSIIL 363
           EKLF +ML + +KPDGLACG LIK LC EGR LDGF  + E E  + L+ +DSDIYSI+L
Sbjct: 363 EKLFRRMLANAVKPDGLACGTLIKALCLEGRVLDGFHLFDEFENMEGLSYLDSDIYSILL 422

Query: 364 AGLCRQGHLVEAAKLINIMVERKIRLKSPYADGIVEYLMKSSERELALRLTSL 522
            GL ++ H VEA KL  +MV+R I+LK+PY D IVE+L +S ++E+     +L
Sbjct: 423 VGLSQKRHSVEAVKLARLMVDRGIQLKTPYFDSIVEHLKESGDKEICTHFCTL 475



 Score = 63.2 bits (152), Expect = 1e-07
 Identities = 38/139 (27%), Positives = 71/139 (51%), Gaps = 2/139 (1%)
 Frame = +1

Query: 13  PNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEEAEKL 192
           PN +T +T+IKG C  GR+++  KL   +   G  P+   Y+ ++  + +  +LE A +L
Sbjct: 199 PNMITYVTMIKGFCNVGRLEDACKLFKVMKGHGCSPNVVVYTVILDGVCRFGSLERALEL 258

Query: 193 FGKMLVSG--MKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSIILA 366
            G+M        P+ +    +I+  C +G+ ++       M  + C  + +    SI++ 
Sbjct: 259 LGEMEKESGDCSPNVVTYTSMIQSCCEKGKLMEALEILDRM--RACGCAPNRVTVSILMK 316

Query: 367 GLCRQGHLVEAAKLINIMV 423
           G C +G + EA KLI+ +V
Sbjct: 317 GFCAEGRVEEAFKLIDKVV 335


>ref|XP_002531466.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223528920|gb|EEF30916.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 518

 Score =  194 bits (494), Expect = 2e-47
 Identities = 89/173 (51%), Positives = 127/173 (73%)
 Frame = +1

Query: 4   GCLPNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEEA 183
           GC PNRVT+ TL+K LC++G ++E YKLID+VV  GS+ S  CYS ++V L+++K +EEA
Sbjct: 300 GCAPNRVTVSTLLKRLCMDGHLEEAYKLIDRVVAGGSVSSCDCYSPIVVCLIRIKKVEEA 359

Query: 184 EKLFGKMLVSGMKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSIIL 363
           EKLF + +VSG+KPDGLAC  +IK+LC   R LDG+  + E+EK   L+++DSD YS++L
Sbjct: 360 EKLFRRAVVSGVKPDGLACSLMIKELCFVNRVLDGYCLHDEIEKIGSLSTIDSDTYSVLL 419

Query: 364 AGLCRQGHLVEAAKLINIMVERKIRLKSPYADGIVEYLMKSSERELALRLTSL 522
            GLC+QG+ +EAAKL   ++E++I LK PY D +VEY+ K    +L   L S+
Sbjct: 420 VGLCQQGYSLEAAKLARSLIEKRIHLKHPYVDKVVEYMKKFGVTDLVTELASI 472



 Score = 79.0 bits (193), Expect = 2e-12
 Identities = 48/139 (34%), Positives = 77/139 (55%), Gaps = 2/139 (1%)
 Frame = +1

Query: 13  PNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEEAEKL 192
           P+ VT +++IKG C  GR++E  +L+ ++   G +P+   YS+L+  + +  ++E A +L
Sbjct: 196 PDMVTYVSIIKGFCDIGRLEEACRLVKEMRAHGCVPNVVVYSTLVDGICRFGSVERALEL 255

Query: 193 FGKMLVSG--MKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSIILA 366
            G M   G    P+ L    +I+ LC +GR +D FA    ME   C  + +    S +L 
Sbjct: 256 LGGMEKEGGDCNPNVLTYTSVIQGLCEKGRTMDAFAVLDRMEA--CGCAPNRVTVSTLLK 313

Query: 367 GLCRQGHLVEAAKLINIMV 423
            LC  GHL EA KLI+ +V
Sbjct: 314 RLCMDGHLEEAYKLIDRVV 332


>ref|XP_007212439.1| hypothetical protein PRUPE_ppa016777mg, partial [Prunus persica]
           gi|462408304|gb|EMJ13638.1| hypothetical protein
           PRUPE_ppa016777mg, partial [Prunus persica]
          Length = 394

 Score =  190 bits (483), Expect = 4e-46
 Identities = 90/166 (54%), Positives = 121/166 (72%)
 Frame = +1

Query: 4   GCLPNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEEA 183
           GC P+RVT+  LIK  CVE +++E YKLID+VVV  S+    CYSSL+VSL + +  EEA
Sbjct: 226 GCAPSRVTVSILIKSFCVEDQVEEAYKLIDRVVVGRSVTYSDCYSSLVVSLARGRKPEEA 285

Query: 184 EKLFGKMLVSGMKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSIIL 363
           EK+   ML SG+KP+ LAC  ++K++C EGR +DGF  + E+EK +CL+S+DSD YSI+L
Sbjct: 286 EKVLRMMLDSGLKPNSLACSIMLKKVCLEGRVIDGFCLFDELEKMECLSSIDSDTYSILL 345

Query: 364 AGLCRQGHLVEAAKLINIMVERKIRLKSPYADGIVEYLMKSSEREL 501
            GLC Q HL+EAAKL  +M+ + I+LK+PY D I E L KS + EL
Sbjct: 346 VGLCEQRHLLEAAKLARLMLNKGIKLKAPYVDSIAEILKKSGDEEL 391



 Score = 65.5 bits (158), Expect = 2e-08
 Identities = 48/173 (27%), Positives = 82/173 (47%), Gaps = 5/173 (2%)
 Frame = +1

Query: 10  LPNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEEAEK 189
           LP+ +T + +I G C  GR+D+   L   +   G +P+   YS+L+    + +N+E A +
Sbjct: 121 LPDLITYVVMINGFCKVGRLDDACGLFKVMKGHGCLPNAVVYSALLDGFCRSENMERALE 180

Query: 190 LFGKMLVSG--MKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSIIL 363
           L  +M   G    P+ +    +I++LC +GR  +       ME   C  S      SI++
Sbjct: 181 LLTEMEKEGGDCSPNVVTYTSVIQKLCDKGRSKEALVILDRMEACGCAPS--RVTVSILI 238

Query: 364 AGLCRQGHLVEAAKLIN-IMVERKIRLKSPYADGIVEYL--MKSSERELALRL 513
              C +  + EA KLI+ ++V R +     Y+  +V      K  E E  LR+
Sbjct: 239 KSFCVEDQVEEAYKLIDRVVVGRSVTYSDCYSSLVVSLARGRKPEEAEKVLRM 291


>ref|XP_006368989.1| hypothetical protein POPTR_0001s15470g [Populus trichocarpa]
           gi|550347348|gb|ERP65558.1| hypothetical protein
           POPTR_0001s15470g [Populus trichocarpa]
          Length = 476

 Score =  190 bits (482), Expect = 6e-46
 Identities = 88/175 (50%), Positives = 123/175 (70%)
 Frame = +1

Query: 1   QGCLPNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEE 180
           +GC PNRVT    I G+C  G++ + Y  I+++V  GS+    CYSSL+V L+++K +EE
Sbjct: 301 RGCAPNRVTASAWINGICTNGQLQDVYNFIERIVAGGSVSIGDCYSSLVVCLIKIKKVEE 360

Query: 181 AEKLFGKMLVSGMKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSII 360
           AEK F + L SGMKPD LAC  +I+++CSE R LDGF  Y E+EK  CL+S+D DIYSI+
Sbjct: 361 AEKTFRRALSSGMKPDSLACSMMIREICSEKRVLDGFCLYEEVEKTGCLSSIDIDIYSIL 420

Query: 361 LAGLCRQGHLVEAAKLINIMVERKIRLKSPYADGIVEYLMKSSERELALRLTSLV 525
           LAGLC+QGH  EAA+L   M+E++I L++P+ + IVE+L     +EL   L S+V
Sbjct: 421 LAGLCQQGHSAEAARLARSMLEKRIPLRAPHVEKIVEHLKNFGGKELVAELVSMV 475



 Score = 57.4 bits (137), Expect = 6e-06
 Identities = 37/139 (26%), Positives = 67/139 (48%), Gaps = 2/139 (1%)
 Frame = +1

Query: 13  PNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEEAEKL 192
           P+ +T +++IKG C  GR++E + L   + V G  P+   YS+L+  + +   +E A +L
Sbjct: 198 PDMITYVSMIKGFCDVGRLEEAFALFPVMSVHGCYPNVVAYSALLDGICRFGIVERAFEL 257

Query: 193 FGKM--LVSGMKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSIILA 366
             +M     G  P+ +    +I+  C +GR  D  +    ME + C  + +    S  + 
Sbjct: 258 LAEMEKQGEGCCPNVITYTSVIQSFCEQGRTKDALSVLELMEVRGC--APNRVTASAWIN 315

Query: 367 GLCRQGHLVEAAKLINIMV 423
           G+C  G L +    I  +V
Sbjct: 316 GICTNGQLQDVYNFIERIV 334


>ref|XP_004300367.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g47360-like isoform 1 [Fragaria vesca subsp. vesca]
           gi|470128894|ref|XP_004300368.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At5g47360-like isoform 2 [Fragaria vesca subsp. vesca]
          Length = 421

 Score =  181 bits (459), Expect = 3e-43
 Identities = 86/172 (50%), Positives = 122/172 (70%)
 Frame = +1

Query: 1   QGCLPNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEE 180
           +GCLPNRVT+ TLI GL  E +++  YKL+D+VV  GS+    CYS+ +VSL ++   EE
Sbjct: 248 RGCLPNRVTVSTLITGLVKEDQVEHAYKLVDRVVKSGSVTKTDCYSTFVVSLERVGRPEE 307

Query: 181 AEKLFGKMLVSGMKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSII 360
           AEK+   ML SG+KP+ L C  ++K+ C EGR +D +  + E+EK +CL+S++SD YSI+
Sbjct: 308 AEKVLRMMLNSGVKPNSLVCTIMLKKCCLEGRMVDAYCLFGELEKMECLSSIESDTYSIL 367

Query: 361 LAGLCRQGHLVEAAKLINIMVERKIRLKSPYADGIVEYLMKSSERELALRLT 516
           L GLC+Q HLVEAA+L  +M+ + I+LK PY D I E L+KS + EL  +LT
Sbjct: 368 LLGLCQQRHLVEAAELARVMLSKGIKLKGPYVDIISEVLVKSGDEELVKQLT 419


>gb|EXC02094.1| hypothetical protein L484_024059 [Morus notabilis]
          Length = 474

 Score =  180 bits (457), Expect = 4e-43
 Identities = 89/173 (51%), Positives = 122/173 (70%)
 Frame = +1

Query: 4   GCLPNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEEA 183
           GC PNRVT+  LI+  C EGR++E  KLID+VV  G +  D C SS +VSL +    EEA
Sbjct: 301 GCFPNRVTVSCLIERFCAEGRVEEVSKLIDRVV-KGGVSYDECCSSFVVSLKRTGQFEEA 359

Query: 184 EKLFGKMLVSGMKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSIIL 363
           EK+F KM+ +G+KPD LAC  +IK+LC  GR LDG+    E+EK    +S+DSD+YS+++
Sbjct: 360 EKVFRKMINNGLKPDSLACTIVIKELCLIGRVLDGYQLCDEIEKIGFWSSIDSDVYSLLI 419

Query: 364 AGLCRQGHLVEAAKLINIMVERKIRLKSPYADGIVEYLMKSSERELALRLTSL 522
            GLC+QGHLVEAA L+++M+++ I+L +PY D IVE L KS + EL   LT +
Sbjct: 420 VGLCQQGHLVEAANLVSLMLKKGIQLSAPYVDRIVEILKKSGDEELIHHLTRI 472


>ref|XP_007131288.1| hypothetical protein PHAVU_011G001300g [Phaseolus vulgaris]
           gi|561004288|gb|ESW03282.1| hypothetical protein
           PHAVU_011G001300g [Phaseolus vulgaris]
          Length = 474

 Score =  179 bits (454), Expect = 1e-42
 Identities = 84/173 (48%), Positives = 121/173 (69%)
 Frame = +1

Query: 4   GCLPNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEEA 183
           GC  N VT+ TL+  LCVEGR+ E YKLIDK VV+  +    C SSL++SL+++K L+EA
Sbjct: 298 GCHANHVTVFTLVDRLCVEGRVGEAYKLIDKFVVEHGVSYGNCCSSLVISLIRIKKLDEA 357

Query: 184 EKLFGKMLVSGMKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSIIL 363
           EKLF +ML   ++PD LA   L+K+LC + + LDGF     ME K CL+++D+ IYSI+L
Sbjct: 358 EKLFMEMLSGDVRPDSLASSLLLKELCMKDQVLDGFHLLEAMENKGCLSTIDNGIYSILL 417

Query: 364 AGLCRQGHLVEAAKLINIMVERKIRLKSPYADGIVEYLMKSSERELALRLTSL 522
            GLC++ HL EA KL  IM+++ + L+ PY DG ++ L+KS E++L  +LT +
Sbjct: 418 VGLCQRNHLTEATKLAKIMLKKSVPLQPPYKDGAIDILIKSGEKDLVNQLTCI 470


>ref|XP_004515635.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g47360-like [Cicer arietinum]
          Length = 477

 Score =  176 bits (447), Expect = 6e-42
 Identities = 84/173 (48%), Positives = 119/173 (68%)
 Frame = +1

Query: 4   GCLPNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEEA 183
           GC  N VT+ TLI+ LC+EGR++E YKL+DK VV+  +     YSSL++SL+++K LEEA
Sbjct: 301 GCFANHVTVFTLIESLCIEGRVEEAYKLVDKFVVEHGVSRGDSYSSLVISLIRIKKLEEA 360

Query: 184 EKLFGKMLVSGMKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSIIL 363
           EKLF +ML   +KPD LA   L+K+ C + R LDGF     +E K  L+S+DSDIYSI+L
Sbjct: 361 EKLFKEMLDGEIKPDTLASSLLLKEFCLKDRVLDGFYLLDAIENKGFLSSIDSDIYSILL 420

Query: 364 AGLCRQGHLVEAAKLINIMVERKIRLKSPYADGIVEYLMKSSERELALRLTSL 522
            GLCR+ HL+EA KL  IM+++ + L+ PY D  ++ L K  E+ +  +LT +
Sbjct: 421 VGLCRENHLMEATKLATIMLKKGVSLRPPYRDSAIDVLNKYGEKGIVNQLTGI 473


>ref|XP_004139002.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g47360-like [Cucumis sativus]
           gi|449505643|ref|XP_004162530.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At5g47360-like [Cucumis sativus]
          Length = 475

 Score =  175 bits (444), Expect = 1e-41
 Identities = 82/170 (48%), Positives = 119/170 (70%)
 Frame = +1

Query: 4   GCLPNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEEA 183
           G  PNRV +  L+K  C +G ++E YKLID+VV  G +    CYSSL+V+L+++K + EA
Sbjct: 301 GYAPNRVAVSFLVKEFCKDGHVEEAYKLIDRVVARGGVSYGDCYSSLVVTLVKMKKIAEA 360

Query: 184 EKLFGKMLVSGMKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSIIL 363
           EKLF  ML +G+KPDG+AC  +I++LC E R LDGF    E+++   L S+D+DIYS++L
Sbjct: 361 EKLFRNMLANGVKPDGVACSLMIRELCLEERVLDGFNLCYEVDRNGYLCSIDADIYSLLL 420

Query: 364 AGLCRQGHLVEAAKLINIMVERKIRLKSPYADGIVEYLMKSSERELALRL 513
            GLC   H V+AAKL  +M+++ IRLK  YA+ I+++L K  +REL + L
Sbjct: 421 VGLCEHDHSVDAAKLARLMLKKGIRLKPHYAESIIKHLKKFEDRELVMHL 470



 Score = 62.8 bits (151), Expect = 1e-07
 Identities = 39/141 (27%), Positives = 71/141 (50%), Gaps = 2/141 (1%)
 Frame = +1

Query: 13  PNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEEAEKL 192
           PN +T I+++KG C  GR ++ Y L   +  +G  P+   YS L+   ++L+ ++   ++
Sbjct: 197 PNMITYISMLKGFCDVGRWEDAYGLFKDMKENGCAPNTVVYSVLVNGAIRLRIMDRLMEM 256

Query: 193 FGKMLVSG--MKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSIILA 366
             +M   G    P+ +    +I+ LC EG  L+       ME+     + +    S ++ 
Sbjct: 257 LKEMEKQGGTCSPNTVTYTSIIQSLCEEGHPLEALKVLDRMEEYG--YAPNRVAVSFLVK 314

Query: 367 GLCRQGHLVEAAKLINIMVER 429
             C+ GH+ EA KLI+ +V R
Sbjct: 315 EFCKDGHVEEAYKLIDRVVAR 335


>ref|XP_003604902.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355505957|gb|AES87099.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 449

 Score =  162 bits (410), Expect = 1e-37
 Identities = 78/164 (47%), Positives = 111/164 (67%)
 Frame = +1

Query: 4   GCLPNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEEA 183
           GC  N VT+ TLI+ LC EGR+DE YK++DK+VV+  +    CY+SL++S +++K LE A
Sbjct: 276 GCFANHVTVFTLIESLCTEGRVDEAYKVVDKLVVEHCVSRGDCYNSLVISFIRVKKLEGA 335

Query: 184 EKLFGKMLVSGMKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSIIL 363
           E LF +ML + +KPD LA   L+K+LC + R LDGF     +E    L+S+DSDIYSI+L
Sbjct: 336 ENLFKEMLAAEIKPDTLASSLLLKELCLKDRVLDGFYLLDTIENMGFLSSIDSDIYSIML 395

Query: 364 AGLCRQGHLVEAAKLINIMVERKIRLKSPYADGIVEYLMKSSER 495
            GL ++ HL EA KL  IM+++ I L+ PY D  ++ L K  E+
Sbjct: 396 IGLWQKNHLTEATKLAKIMLKKAIPLRPPYKDRAIDILRKYGEK 439



 Score = 64.3 bits (155), Expect = 5e-08
 Identities = 41/160 (25%), Positives = 84/160 (52%), Gaps = 2/160 (1%)
 Frame = +1

Query: 4   GCLPNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEEA 183
           G  P+ +T +T+I+GLC  GR++E Y+++  +  +G  P+    S+++  L +L ++E A
Sbjct: 170 GICPDLITYMTMIEGLCSAGRLEEAYEMVKVMRGNGCSPNSVVLSAVLDGLCRLDSMERA 229

Query: 184 EKLFGKMLVSG-MKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSII 360
            +L  +M  SG   P+ +    LI+  C  G + +       M    C A+    ++++I
Sbjct: 230 LELLDEMEKSGDCCPNVVTYTSLIQSFCKRGEWTEALNILDRMRAFGCFAN-HVTVFTLI 288

Query: 361 LAGLCRQGHLVEAAKLIN-IMVERKIRLKSPYADGIVEYL 477
              LC +G + EA K+++ ++VE  +     Y   ++ ++
Sbjct: 289 -ESLCTEGRVDEAYKVVDKLVVEHCVSRGDCYNSLVISFI 327


>ref|XP_006359252.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g47360-like isoform X2 [Solanum tuberosum]
          Length = 487

 Score =  159 bits (401), Expect = 1e-36
 Identities = 80/167 (47%), Positives = 110/167 (65%)
 Frame = +1

Query: 4   GCLPNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEEA 183
           GC PNRV I TLI GLC EG ++E +K+ID+V   G I  D CYSSL++SL ++  +EEA
Sbjct: 307 GCKPNRVLISTLIHGLCKEGHVEEAHKVIDRVAKSG-ISYDSCYSSLVLSLFRIGKVEEA 365

Query: 184 EKLFGKMLVSGMKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSIIL 363
           E  F +ML  G+KPD      +I+ LC + R LDG   Y  +E+   ++S+DSDIYSI++
Sbjct: 366 EMFFRRMLTGGLKPDSFTSSTIIRWLCQQNRILDG---YHLIEQSASVSSIDSDIYSILM 422

Query: 364 AGLCRQGHLVEAAKLINIMVERKIRLKSPYADGIVEYLMKSSERELA 504
           AGLC   HL EAAKL ++MVE++I+LK P    + E L    + +LA
Sbjct: 423 AGLCEANHLAEAAKLAHLMVEKRIQLKGPCVKNVTECLRHCGKEDLA 469


>ref|XP_006359251.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g47360-like isoform X1 [Solanum tuberosum]
          Length = 488

 Score =  159 bits (401), Expect = 1e-36
 Identities = 80/167 (47%), Positives = 110/167 (65%)
 Frame = +1

Query: 4   GCLPNRVTIITLIKGLCVEGRIDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLEEA 183
           GC PNRV I TLI GLC EG ++E +K+ID+V   G I  D CYSSL++SL ++  +EEA
Sbjct: 307 GCKPNRVLISTLIHGLCKEGHVEEAHKVIDRVAKSG-ISYDSCYSSLVLSLFRIGKVEEA 365

Query: 184 EKLFGKMLVSGMKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSIIL 363
           E  F +ML  G+KPD      +I+ LC + R LDG   Y  +E+   ++S+DSDIYSI++
Sbjct: 366 EMFFRRMLTGGLKPDSFTSSTIIRWLCQQNRILDG---YHLIEQSASVSSIDSDIYSILM 422

Query: 364 AGLCRQGHLVEAAKLINIMVERKIRLKSPYADGIVEYLMKSSERELA 504
           AGLC   HL EAAKL ++MVE++I+LK P    + E L    + +LA
Sbjct: 423 AGLCEANHLAEAAKLAHLMVEKRIQLKGPCVKNVTECLRHCGKEDLA 469


>ref|XP_002863348.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297309183|gb|EFH39607.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 477

 Score =  157 bits (397), Expect = 4e-36
 Identities = 78/174 (44%), Positives = 119/174 (68%), Gaps = 1/174 (0%)
 Frame = +1

Query: 1   QGCLPNRVTIITLIKGLCVEGR-IDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLE 177
           +GC PNRVT   LI+G+      + +  KLIDK+V  G +    C+SS  VSL+++K  E
Sbjct: 303 RGCTPNRVTASVLIQGVLENDEDVKDLSKLIDKLVKLGGVSLSECFSSATVSLIRMKRWE 362

Query: 178 EAEKLFGKMLVSGMKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSI 357
           EAEK+F  MLV G++PDGLAC ++ ++LC   R+LD F  Y E+EK+D  +++DSDIY++
Sbjct: 363 EAEKIFRLMLVRGIRPDGLACTHVFRELCLSERYLDCFVLYQEIEKEDVKSTMDSDIYAV 422

Query: 358 ILAGLCRQGHLVEAAKLINIMVERKIRLKSPYADGIVEYLMKSSERELALRLTS 519
           +L GLC+QG+  EAAKL   M+++K+RLK  + + I+E L K+ + +L  R ++
Sbjct: 423 LLLGLCQQGNSWEAAKLAKSMLDKKMRLKVSHVEKIIEALKKTGDEDLMSRFST 476


>ref|NP_199547.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75180684|sp|Q9LVS3.1|PP422_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At5g47360 gi|8809619|dbj|BAA97170.1| unnamed protein
           product [Arabidopsis thaliana]
           gi|332008119|gb|AED95502.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 477

 Score =  157 bits (396), Expect = 5e-36
 Identities = 78/173 (45%), Positives = 118/173 (68%), Gaps = 1/173 (0%)
 Frame = +1

Query: 1   QGCLPNRVTIITLIKGLCVEGR-IDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLE 177
           +GC+PNRVT   LI+G+      +    KLIDK+V  G +    C+SS  VSL+++K  E
Sbjct: 303 RGCMPNRVTACVLIQGVLENDEDVKALSKLIDKLVKLGGVSLSECFSSATVSLIRMKRWE 362

Query: 178 EAEKLFGKMLVSGMKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSI 357
           EAEK+F  MLV G++PDGLAC ++ ++LC   R+LD F  Y E+EKKD  +++DSDI+++
Sbjct: 363 EAEKIFRLMLVRGVRPDGLACSHVFRELCLLERYLDCFLLYQEIEKKDVKSTIDSDIHAV 422

Query: 358 ILAGLCRQGHLVEAAKLINIMVERKIRLKSPYADGIVEYLMKSSERELALRLT 516
           +L GLC+QG+  EAAKL   M+++K+RLK  + + I+E L K+ + +L  R +
Sbjct: 423 LLLGLCQQGNSWEAAKLAKSMLDKKMRLKVSHVEKIIEALKKTGDEDLMSRFS 475


>ref|XP_006282107.1| hypothetical protein CARUB_v10028355mg, partial [Capsella rubella]
           gi|482550811|gb|EOA15005.1| hypothetical protein
           CARUB_v10028355mg, partial [Capsella rubella]
          Length = 493

 Score =  152 bits (384), Expect = 1e-34
 Identities = 76/174 (43%), Positives = 117/174 (67%), Gaps = 1/174 (0%)
 Frame = +1

Query: 1   QGCLPNRVTIITLIKGLCVEGR-IDETYKLIDKVVVDGSIPSDRCYSSLIVSLLQLKNLE 177
           +GC PNRVT   LI+G+      + +  K+IDK+V  G +    C+SS  VSL+++K  E
Sbjct: 319 RGCTPNRVTASVLIQGVLENNEDVKDLTKVIDKLVKLGGVSLSECFSSATVSLIRMKRWE 378

Query: 178 EAEKLFGKMLVSGMKPDGLACGYLIKQLCSEGRFLDGFARYSEMEKKDCLASVDSDIYSI 357
           EA+K+F  MLV G++PDGLAC  ++++LC   R+LD F  Y E+EK D  +++DSDI++I
Sbjct: 379 EADKIFRLMLVRGIRPDGLACSLVLRELCLLERYLDCFLLYQEIEKADVKSTIDSDIHAI 438

Query: 358 ILAGLCRQGHLVEAAKLINIMVERKIRLKSPYADGIVEYLMKSSERELALRLTS 519
           +L GLC+QG   EAAKL   M+++K+RLK  + + I+E L K+ + +L  R ++
Sbjct: 439 LLLGLCKQGSSWEAAKLAKSMLDKKMRLKVSHVEKIIEALKKTGDEDLMRRFST 492


Top