BLASTX nr result

ID: Rehmannia22_contig00010512 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia22_contig00010512
         (990 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EPS62438.1| hypothetical protein M569_12351, partial [Genlise...   277   4e-72
ref|XP_004253081.1| PREDICTED: pentatricopeptide repeat-containi...   272   1e-70
ref|XP_006342489.1| PREDICTED: pentatricopeptide repeat-containi...   260   7e-67
ref|XP_006422375.1| hypothetical protein CICLE_v10028759mg [Citr...   248   3e-63
ref|XP_006486551.1| PREDICTED: pentatricopeptide repeat-containi...   246   7e-63
gb|EOX98954.1| Pentatricopeptide repeat superfamily protein, put...   245   2e-62
emb|CAN78573.1| hypothetical protein VITISV_020581 [Vitis vinifera]   242   2e-61
ref|XP_002868835.1| pentatricopeptide repeat-containing protein ...   241   4e-61
ref|XP_002283311.1| PREDICTED: pentatricopeptide repeat-containi...   240   5e-61
gb|AFK36371.1| unknown [Lotus japonicus]                              240   7e-61
gb|EXB40453.1| hypothetical protein L484_013756 [Morus notabilis]     239   9e-61
ref|XP_006590678.1| PREDICTED: pentatricopeptide repeat-containi...   238   2e-60
ref|XP_003537572.1| PREDICTED: pentatricopeptide repeat-containi...   238   2e-60
ref|NP_195528.1| pentatricopeptide repeat-containing protein [Ar...   235   2e-59
ref|XP_006394459.1| hypothetical protein EUTSA_v10005467mg [Eutr...   234   4e-59
gb|AAL77701.1| AT4g38150/F20D10_270 [Arabidopsis thaliana] gi|23...   234   5e-59
ref|XP_002514391.1| pentatricopeptide repeat-containing protein,...   233   1e-58
ref|XP_006573589.1| PREDICTED: pentatricopeptide repeat-containi...   231   3e-58
ref|XP_006573588.1| PREDICTED: pentatricopeptide repeat-containi...   231   3e-58
ref|XP_003516576.2| PREDICTED: pentatricopeptide repeat-containi...   231   3e-58

>gb|EPS62438.1| hypothetical protein M569_12351, partial [Genlisea aurea]
          Length = 272

 Score =  277 bits (709), Expect = 4e-72
 Identities = 142/210 (67%), Positives = 167/210 (79%), Gaps = 3/210 (1%)
 Frame = +3

Query: 369 PPEPIPNRPLRRQSYPYGSPRIPKPNRGREIENQNSFRG-ETDADFLERFKLGFDSKVEN 545
           PPEPIPNRPLR +S    S   PK +R R   N  +    E+D+DFLERFKLGFD K   
Sbjct: 1   PPEPIPNRPLRGRSV--ASRITPKSDRIRGSGNPRAAAAAESDSDFLERFKLGFDRKTTT 58

Query: 546 P--KIDSADKSIQSEKAENMEPLSPPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQE 719
           P  ++  ++K+   E+ E  +PLSPPE+ADEIF+KMKETGLIPNAVAMLDGLCKDGLVQ+
Sbjct: 59  PPGRVVESEKAGGEEEKEEQQPLSPPENADEIFRKMKETGLIPNAVAMLDGLCKDGLVQD 118

Query: 720 AMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKFDDAVRIFKKMQSNGVIPNVFSYQVLVR 899
           A+KLFG MREKG+IP+VVVYTAVVEGFCKA K DDA+RIFKKM+SNG+ PN FSYQ+L+R
Sbjct: 119 ALKLFGTMREKGSIPDVVVYTAVVEGFCKAQKHDDAIRIFKKMKSNGIAPNAFSYQILIR 178

Query: 900 GLFSGKRLEDAYGFTIEMLEAGHSPNIATF 989
           GL  GKRLEDA GFT EMLE G+SPN+ATF
Sbjct: 179 GLCDGKRLEDASGFTAEMLETGYSPNLATF 208


>ref|XP_004253081.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g38150-like isoform 1 [Solanum lycopersicum]
           gi|460415472|ref|XP_004253082.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At4g38150-like isoform 2 [Solanum lycopersicum]
          Length = 340

 Score =  272 bits (696), Expect = 1e-70
 Identities = 143/234 (61%), Positives = 167/234 (71%), Gaps = 14/234 (5%)
 Frame = +3

Query: 330 FSSIDDGLGRSDYPP--EPIPNRPLRRQSY-PYGSPRIPKPN-----------RGREIEN 467
           FS   D    S+YPP  EPIPNRPLR  S  P+   +   P+           R     N
Sbjct: 44  FSDYSDESAESNYPPPPEPIPNRPLRADSRRPFNPSQRQHPSNRSSPNHSTTFRRSSENN 103

Query: 468 QNSFRGETDADFLERFKLGFDSKVENPKIDSADKSIQSEKAENMEPLSPPEDADEIFKKM 647
           ++  + +   DFL+RF+LGFD K ENP  +   +S     +E   P +PPEDADEIFKKM
Sbjct: 104 ESQMKSQDSEDFLKRFQLGFDRKEENPNTNPKAESRDCPVSE--APPAPPEDADEIFKKM 161

Query: 648 KETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKFDDA 827
           KETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVV+YTAVV+GFCKA KFDDA
Sbjct: 162 KETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVDGFCKAQKFDDA 221

Query: 828 VRIFKKMQSNGVIPNVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSPNIATF 989
           VRIF+KMQ NG+IPN FSY +++RGL  GKRL+DA  F +EMLEAGHSPN+ TF
Sbjct: 222 VRIFRKMQGNGIIPNAFSYGIIIRGLSQGKRLDDALEFCLEMLEAGHSPNVVTF 275


>ref|XP_006342489.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g38150-like [Solanum tuberosum]
          Length = 354

 Score =  260 bits (664), Expect = 7e-67
 Identities = 140/245 (57%), Positives = 164/245 (66%), Gaps = 26/245 (10%)
 Frame = +3

Query: 333 SSIDDGLGRSDYPP--EPIPNRPLRRQSY-----------------PYGSPRIPKPNRGR 455
           S+  D   +S+YPP  +PIPNRPLR  S                  P  +     P    
Sbjct: 45  SNYSDEFTQSNYPPPPDPIPNRPLRGDSKRPLRDDSRRPLRDDFRRPLRADSSNNPTHST 104

Query: 456 EIE-----NQNSFRGETDADFLERFKLGFDSKVENPKIDSA--DKSIQSEKAENMEPLSP 614
            +      N    + +   DFL+RF+LGFD K ENP  + A   K   S+   +  P +P
Sbjct: 105 TLRRSGENNGGQMKSQDSEDFLKRFQLGFDRKEENPNTNPALHPKGESSDSPVSEAPPAP 164

Query: 615 PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVE 794
           PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVV+YTAVV+
Sbjct: 165 PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVD 224

Query: 795 GFCKAHKFDDAVRIFKKMQSNGVIPNVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSP 974
           GF KA KFDDAVRIF+KMQ NG+IPN FSY +L+RGL  G RL+DA+ F +EMLEAGHSP
Sbjct: 225 GFFKAQKFDDAVRIFRKMQGNGIIPNAFSYGILIRGLSQGNRLDDAFEFCLEMLEAGHSP 284

Query: 975 NIATF 989
           N+ TF
Sbjct: 285 NVVTF 289



 Score = 57.4 bits (137), Expect = 8e-06
 Identities = 30/86 (34%), Positives = 48/86 (55%), Gaps = 3/86 (3%)
 Frame = +3

Query: 618 EDADEIFKKMKETGLIPNAVA---MLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAV 788
           +DA  IF+KM+  G+IPNA +   ++ GL +   + +A +    M E G  P VV +  +
Sbjct: 233 DDAVRIFRKMQGNGIIPNAFSYGILIRGLSQGNRLDDAFEFCLEMLEAGHSPNVVTFVTL 292

Query: 789 VEGFCKAHKFDDAVRIFKKMQSNGVI 866
           V+GFCK    +DA  + K ++  G I
Sbjct: 293 VDGFCKEKSLEDAQNMIKTVRQKGFI 318


>ref|XP_006422375.1| hypothetical protein CICLE_v10028759mg [Citrus clementina]
           gi|557524309|gb|ESR35615.1| hypothetical protein
           CICLE_v10028759mg [Citrus clementina]
          Length = 344

 Score =  248 bits (632), Expect = 3e-63
 Identities = 132/230 (57%), Positives = 163/230 (70%), Gaps = 23/230 (10%)
 Frame = +3

Query: 369 PPEPIPNRPLR-----------RQSYP-----YGSPRIPK------PNRGREIENQNSFR 482
           PPEPIP+RPLR           R+S+      Y   + P+      PNR R         
Sbjct: 55  PPEPIPDRPLRGERPFTNQNQNRRSFQPRFNNYQQQQRPQQQSFQSPNRPRPKSPDGV-- 112

Query: 483 GETDADFLERFKLGFDSKVENPKIDSADKSIQSEKAENMEPLS-PPEDADEIFKKMKETG 659
            ++D +FL++FKL  D K +NP+ + +    Q +K    EP+S PP++ADEIFKKMKETG
Sbjct: 113 -QSDENFLDQFKLAIDKKPDNPQQNESLGERQEQKPNRNEPISEPPQEADEIFKKMKETG 171

Query: 660 LIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKFDDAVRIF 839
           LIPNAVAMLDGLCKDGL+QEAMKLFGLMREKGTIPEVV+YTAVV+GFCKA KFDDA RIF
Sbjct: 172 LIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVIYTAVVDGFCKAQKFDDAKRIF 231

Query: 840 KKMQSNGVIPNVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSPNIATF 989
           +KMQSNG+ PN FSY +L++GL+   +LE+A  + IEMLEAGHSPN+ TF
Sbjct: 232 RKMQSNGIAPNAFSYNLLIQGLYKCNKLEEAVEYCIEMLEAGHSPNVTTF 281


>ref|XP_006486551.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g38150-like [Citrus sinensis]
          Length = 387

 Score =  246 bits (629), Expect = 7e-63
 Identities = 133/237 (56%), Positives = 165/237 (69%), Gaps = 22/237 (9%)
 Frame = +3

Query: 345 DGLGRSDY-PPEPIPNRPLR--------RQSYPYGSPRIPKPNRGREIENQNSFRG---- 485
           D   R+D  PPEPIP+RPLR         Q+     PR     + ++   Q SF+     
Sbjct: 89  DNDNRNDQNPPEPIPDRPLRGERPFTNQNQNRRSFQPRFNNYQQ-QQRPQQQSFQSPNGP 147

Query: 486 --------ETDADFLERFKLGFDSKVENPKIDSADKSIQSEKAENMEPLS-PPEDADEIF 638
                   ++D +FL++FKL  D K  NP+ + +    Q +K    EP+S PP++ADEIF
Sbjct: 148 RPKSPDGVQSDENFLDQFKLAIDKKPGNPQQNESLGQRQEQKPNRNEPISEPPQEADEIF 207

Query: 639 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKF 818
           KKMKETGLIPNAVAMLDGLCKDGL+QEAMKLFGLMREKGTIPEVV+YTAVV+GFCKA KF
Sbjct: 208 KKMKETGLIPNAVAMLDGLCKDGLIQEAMKLFGLMREKGTIPEVVIYTAVVDGFCKAQKF 267

Query: 819 DDAVRIFKKMQSNGVIPNVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSPNIATF 989
           DDA RIF+KMQSNG+ PN FSY +L++GL+   +LE+A  + IEMLEAGHSPN+ TF
Sbjct: 268 DDAKRIFRKMQSNGIAPNAFSYNLLIQGLYKCNKLEEAVEYCIEMLEAGHSPNVTTF 324


>gb|EOX98954.1| Pentatricopeptide repeat superfamily protein, putative [Theobroma
           cacao]
          Length = 345

 Score =  245 bits (626), Expect = 2e-62
 Identities = 132/228 (57%), Positives = 158/228 (69%), Gaps = 16/228 (7%)
 Frame = +3

Query: 354 GRSDYPPEPIPNRPLRRQSYPYGSPRIPKPNRGREIENQNSFRG---------------- 485
           G  D PPEPIPNR L  Q  P+ +P   +        N +SF+                 
Sbjct: 58  GDGDKPPEPIPNRSLEGQR-PF-NPSFRETKGATLNSNGSSFQSFNTKFASDPNRKREDS 115

Query: 486 ETDADFLERFKLGFDSKVENPKIDSADKSIQSEKAENMEPLSPPEDADEIFKKMKETGLI 665
           ++D +FLE+FKLG D+K      DS   ++   K +  +P SPP+DADEIFKKMKETGLI
Sbjct: 116 QSDENFLEKFKLGLDNKRGKQPSDSEAAALLRRKEQEEKP-SPPQDADEIFKKMKETGLI 174

Query: 666 PNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKFDDAVRIFKK 845
           PNAVAMLDGLCKDGL+QEAMKLFG MREKGTIPEVV+YTAVV+GFCKAHK DDA RIF+K
Sbjct: 175 PNAVAMLDGLCKDGLIQEAMKLFGSMREKGTIPEVVIYTAVVDGFCKAHKLDDAKRIFRK 234

Query: 846 MQSNGVIPNVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSPNIATF 989
           MQS GV PN FSY VL++GL+   +L+DA  F +EMLEAGHSPN+ TF
Sbjct: 235 MQSKGVTPNSFSYIVLIQGLYRCNKLDDAIEFCLEMLEAGHSPNVTTF 282


>emb|CAN78573.1| hypothetical protein VITISV_020581 [Vitis vinifera]
          Length = 381

 Score =  242 bits (617), Expect = 2e-61
 Identities = 134/235 (57%), Positives = 157/235 (66%), Gaps = 17/235 (7%)
 Frame = +3

Query: 333 SSIDDGLGRSDYPPEPIPNRPLRRQS--------YPYGSPRIPKPNRGREIENQNSFRGE 488
           SS   G G S  PP PIPNRPLR +          P     +PK          + F   
Sbjct: 85  SSSCGGGGSSSNPPNPIPNRPLRGEQRMNRPPPHIPQRKLGLPKDEGVDRASQASPFNQP 144

Query: 489 TDAD---------FLERFKLGFDSKVENPKIDSADKSIQSEKAENMEPLSPPEDADEIFK 641
           + A+         FLERFKLG   K E P+ +SA      E+  N     PP++ADEIF+
Sbjct: 145 SPAEKVGATLEDGFLERFKLGVQKK-ERPQ-ESAAAQPSREQDANHGKEQPPQNADEIFR 202

Query: 642 KMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKFD 821
           KMKE+GLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVV+YTAVVEGFCKA + D
Sbjct: 203 KMKESGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKARQLD 262

Query: 822 DAVRIFKKMQSNGVIPNVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSPNIAT 986
           DAVRIF+KMQ+NG+ PN FSY VL+RG++ G RL+ A  F +EMLEAGHSPN+AT
Sbjct: 263 DAVRIFRKMQNNGISPNAFSYTVLIRGMYKGNRLDIAVDFCVEMLEAGHSPNVAT 317


>ref|XP_002868835.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297314671|gb|EFH45094.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 301

 Score =  241 bits (614), Expect = 4e-61
 Identities = 132/222 (59%), Positives = 157/222 (70%), Gaps = 3/222 (1%)
 Frame = +3

Query: 333 SSIDDGLGRSDYPPEPIPNRPLR--RQSYPYGSPRIPKPNRGREIENQNSFRGETDADFL 506
           S+ D G  +   PPEP+PNRPLR  R S  +  P   + +   +I+N  S     D  FL
Sbjct: 32  STGDKGQEKQQNPPEPLPNRPLRGERSSNSHREPPARQAHDLGKIDNTLS-----DDGFL 86

Query: 507 ERFKLGFDS-KVENPKIDSADKSIQSEKAENMEPLSPPEDADEIFKKMKETGLIPNAVAM 683
           E+FKLG +    E PK +   +          +PL PPED+DEIFKKMKE GLIPNAVAM
Sbjct: 87  EQFKLGVNQDSQETPKPEQYPQ----------DPLLPPEDSDEIFKKMKEGGLIPNAVAM 136

Query: 684 LDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKFDDAVRIFKKMQSNGV 863
           LDGLCKDGLVQEAMKLFGLMR+KGTIPEVV+YTAVVEGFCKAHK +DA RIF+KMQ+NG+
Sbjct: 137 LDGLCKDGLVQEAMKLFGLMRDKGTIPEVVIYTAVVEGFCKAHKIEDAKRIFRKMQTNGI 196

Query: 864 IPNVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSPNIATF 989
            PN FSY VLV+GL++   L+DA  F  EMLE+GHSPNI TF
Sbjct: 197 TPNAFSYGVLVQGLYNCNMLDDAVTFCCEMLESGHSPNIPTF 238


>ref|XP_002283311.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g38150-like [Vitis vinifera]
          Length = 380

 Score =  240 bits (613), Expect = 5e-61
 Identities = 133/235 (56%), Positives = 157/235 (66%), Gaps = 17/235 (7%)
 Frame = +3

Query: 333 SSIDDGLGRSDYPPEPIPNRPLRRQS--------YPYGSPRIPKPNRGREIENQNSFRGE 488
           SS   G G S  PP PIPNRPLR +          P     +PK          + F   
Sbjct: 84  SSSSCGGGSSSNPPNPIPNRPLRGEQRMNRPPPHIPQRKLGLPKDEGVDRASQASPFNQP 143

Query: 489 TDAD---------FLERFKLGFDSKVENPKIDSADKSIQSEKAENMEPLSPPEDADEIFK 641
           + A+         FLERFKLG   K E P+ +SA      E+  N     PP++ADEIF+
Sbjct: 144 SPAEKVGATLEDGFLERFKLGVQKK-ERPQ-ESAAAQPSREQDANHGKEQPPQNADEIFR 201

Query: 642 KMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKFD 821
           KMKE+GLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVV+YTAVVEGFCKA + +
Sbjct: 202 KMKESGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVIYTAVVEGFCKARQLN 261

Query: 822 DAVRIFKKMQSNGVIPNVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSPNIAT 986
           DAVRIF+KMQ+NG+ PN FSY VL+RG++ G RL+ A  F +EMLEAGHSPN+AT
Sbjct: 262 DAVRIFRKMQNNGISPNAFSYTVLIRGMYKGNRLDIAVDFCVEMLEAGHSPNVAT 316


>gb|AFK36371.1| unknown [Lotus japonicus]
          Length = 372

 Score =  240 bits (612), Expect = 7e-61
 Identities = 137/237 (57%), Positives = 162/237 (68%), Gaps = 32/237 (13%)
 Frame = +3

Query: 375 EPIPNRPLR--------RQSYPYGSPRIPKP----NRGR----EIENQNS-----FRGET 491
           EPIPNR LR         + Y  GS R  +P    NRGR    E+ N++S     F+G  
Sbjct: 74  EPIPNRALRGTQPVNPHSREYNRGS-RSSRPRFDGNRGRPDDVEMTNKSSQTDIGFQGRN 132

Query: 492 DAD-----------FLERFKLGFDSKVENPKIDSADKSIQSEKAENMEPLSPPEDADEIF 638
            +D           FL++FKLGFD+K  N    +A    +  K+ N    + PEDADEIF
Sbjct: 133 MSDTNKVVNKLGDSFLDKFKLGFDNKAGNSSEVAASNLSEEAKSANSNQPAMPEDADEIF 192

Query: 639 KKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKF 818
           KKMKETGLIPNAVAMLDGLCKDGLVQEA+KLFGLMREKGTIPE+V+YTAVVEG+ KAHK 
Sbjct: 193 KKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLMREKGTIPEIVIYTAVVEGYTKAHKA 252

Query: 819 DDAVRIFKKMQSNGVIPNVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSPNIATF 989
           DDA RIF+KMQSNG+ PN FSY VLV+GL    RL+DA+ F +EMLEAGHSPN+ TF
Sbjct: 253 DDAKRIFRKMQSNGISPNAFSYTVLVQGLCKCSRLQDAFEFCVEMLEAGHSPNMTTF 309


>gb|EXB40453.1| hypothetical protein L484_013756 [Morus notabilis]
          Length = 306

 Score =  239 bits (611), Expect = 9e-61
 Identities = 132/214 (61%), Positives = 151/214 (70%)
 Frame = +3

Query: 348 GLGRSDYPPEPIPNRPLRRQSYPYGSPRIPKPNRGREIENQNSFRGETDADFLERFKLGF 527
           G G SD    P  ++  R +S P       +P RGR          E D+ FLE+FKLG 
Sbjct: 43  GNGESDETTGPSFSQNPRERSRPN------RPPRGR-----GPLTSEDDS-FLEKFKLGL 90

Query: 528 DSKVENPKIDSADKSIQSEKAENMEPLSPPEDADEIFKKMKETGLIPNAVAMLDGLCKDG 707
           DS  +  + +   +     K    +P  PPEDADEIFKKMKETGLIPNAVAMLDGLCKDG
Sbjct: 91  DSSKDGMQ-EKPRREAARPKPPLPQPPPPPEDADEIFKKMKETGLIPNAVAMLDGLCKDG 149

Query: 708 LVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKFDDAVRIFKKMQSNGVIPNVFSYQ 887
           LVQEAMKLFGLM+EKGTIPEVV+YTAVV+GFCKA K DDAVRIF+KMQSNG+ PN FSY 
Sbjct: 150 LVQEAMKLFGLMKEKGTIPEVVIYTAVVDGFCKAQKLDDAVRIFRKMQSNGIEPNAFSYS 209

Query: 888 VLVRGLFSGKRLEDAYGFTIEMLEAGHSPNIATF 989
           VLV+GL  GKRLED   F +EMLEAGHSPN+ATF
Sbjct: 210 VLVQGLCGGKRLEDGLEFCVEMLEAGHSPNVATF 243


>ref|XP_006590678.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g38150-like isoform X2 [Glycine max]
          Length = 395

 Score =  238 bits (608), Expect = 2e-60
 Identities = 140/259 (54%), Positives = 164/259 (63%), Gaps = 40/259 (15%)
 Frame = +3

Query: 333 SSIDDGLGRSDYP-PEPIPNRPLRR-----------QSYPYGSPRIP------------- 437
           SS  D  G SD    EPIP+RPLR            Q Y  GS   P             
Sbjct: 77  SSFKDN-GESDQSLSEPIPSRPLRSRKPVNQPPPRFQEYDRGSHSFPPRFYDNHGGPDEL 135

Query: 438 -KPNRGREIE---------NQNSFRGETDADFLERFKLGFDSKVENPKIDSADKSIQSEK 587
            + N+  +I+           N   G++   FL +FKLGFD K  N    +A K  QSE+
Sbjct: 136 DQTNKSSKIDLAFQNTNVAKTNRDAGQSGDSFLNKFKLGFDDKTVNLSEVAASK--QSEE 193

Query: 588 AENMEPLSP-----PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREK 752
           A+   P  P     P+DADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEA+KLFGLMREK
Sbjct: 194 AKRSNPNQPAQESMPQDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLMREK 253

Query: 753 GTIPEVVVYTAVVEGFCKAHKFDDAVRIFKKMQSNGVIPNVFSYQVLVRGLFSGKRLEDA 932
           GTIPE+V+YTAVVEG+ KAHK DDA RIF+KMQS+GV PN FSY VL++GL+   RL DA
Sbjct: 254 GTIPEIVIYTAVVEGYTKAHKADDAKRIFRKMQSSGVSPNAFSYMVLIQGLYKCSRLHDA 313

Query: 933 YGFTIEMLEAGHSPNIATF 989
           + F +EMLEAGHSPN+ TF
Sbjct: 314 FEFCVEMLEAGHSPNVTTF 332


>ref|XP_003537572.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g38150-like isoform X1 [Glycine max]
          Length = 388

 Score =  238 bits (608), Expect = 2e-60
 Identities = 140/259 (54%), Positives = 164/259 (63%), Gaps = 40/259 (15%)
 Frame = +3

Query: 333 SSIDDGLGRSDYP-PEPIPNRPLRR-----------QSYPYGSPRIP------------- 437
           SS  D  G SD    EPIP+RPLR            Q Y  GS   P             
Sbjct: 70  SSFKDN-GESDQSLSEPIPSRPLRSRKPVNQPPPRFQEYDRGSHSFPPRFYDNHGGPDEL 128

Query: 438 -KPNRGREIE---------NQNSFRGETDADFLERFKLGFDSKVENPKIDSADKSIQSEK 587
            + N+  +I+           N   G++   FL +FKLGFD K  N    +A K  QSE+
Sbjct: 129 DQTNKSSKIDLAFQNTNVAKTNRDAGQSGDSFLNKFKLGFDDKTVNLSEVAASK--QSEE 186

Query: 588 AENMEPLSP-----PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREK 752
           A+   P  P     P+DADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEA+KLFGLMREK
Sbjct: 187 AKRSNPNQPAQESMPQDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLMREK 246

Query: 753 GTIPEVVVYTAVVEGFCKAHKFDDAVRIFKKMQSNGVIPNVFSYQVLVRGLFSGKRLEDA 932
           GTIPE+V+YTAVVEG+ KAHK DDA RIF+KMQS+GV PN FSY VL++GL+   RL DA
Sbjct: 247 GTIPEIVIYTAVVEGYTKAHKADDAKRIFRKMQSSGVSPNAFSYMVLIQGLYKCSRLHDA 306

Query: 933 YGFTIEMLEAGHSPNIATF 989
           + F +EMLEAGHSPN+ TF
Sbjct: 307 FEFCVEMLEAGHSPNVTTF 325


>ref|NP_195528.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|79326453|ref|NP_001031806.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
           gi|75266764|sp|Q9SZL5.1|PP356_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At4g38150 gi|4467121|emb|CAB37555.1| putative protein
           [Arabidopsis thaliana] gi|7270799|emb|CAB80480.1|
           putative protein [Arabidopsis thaliana]
           gi|26453272|dbj|BAC43709.1| unknown protein [Arabidopsis
           thaliana] gi|332661484|gb|AEE86884.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
           gi|332661485|gb|AEE86885.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 302

 Score =  235 bits (599), Expect = 2e-59
 Identities = 127/209 (60%), Positives = 151/209 (72%), Gaps = 2/209 (0%)
 Frame = +3

Query: 369 PPEPIPNRPLRRQSYPYGSPRIPKPNRGREIENQNSFRGETDADFLERFKLGF--DSKVE 542
           PPEP+PNRPLR +     S R P   +   +   ++    +D  FLE+FKLG   DS+ E
Sbjct: 45  PPEPLPNRPLRGERSS-NSHREPPARQAHNLGKSDTTL--SDDGFLEQFKLGVNQDSR-E 100

Query: 543 NPKIDSADKSIQSEKAENMEPLSPPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEA 722
            PK +   +          EPL PPED+DEIFKKMKE GLIPNAVAMLDGLCKDGLVQEA
Sbjct: 101 TPKPEQYPQ----------EPLPPPEDSDEIFKKMKEGGLIPNAVAMLDGLCKDGLVQEA 150

Query: 723 MKLFGLMREKGTIPEVVVYTAVVEGFCKAHKFDDAVRIFKKMQSNGVIPNVFSYQVLVRG 902
           MKLFGLMR+KGTIPEVV+YTAVVE FCKAHK +DA RIF+KMQ+NG+ PN FSY VLV+G
Sbjct: 151 MKLFGLMRDKGTIPEVVIYTAVVEAFCKAHKIEDAKRIFRKMQNNGIAPNAFSYGVLVQG 210

Query: 903 LFSGKRLEDAYGFTIEMLEAGHSPNIATF 989
           L++   L+DA  F  EMLE+GHSPN+ TF
Sbjct: 211 LYNCNMLDDAVAFCSEMLESGHSPNVPTF 239


>ref|XP_006394459.1| hypothetical protein EUTSA_v10005467mg [Eutrema salsugineum]
           gi|557091098|gb|ESQ31745.1| hypothetical protein
           EUTSA_v10005467mg [Eutrema salsugineum]
          Length = 295

 Score =  234 bits (597), Expect = 4e-59
 Identities = 126/217 (58%), Positives = 151/217 (69%), Gaps = 1/217 (0%)
 Frame = +3

Query: 342 DDGLGRSDYPPEPIPNRPLRRQSYPYGSPRIPKPNRGREIENQNSFRGETDADFLERFKL 521
           D+   +   PPEP+PNRPLR +             RG      +     +D DFLE+FKL
Sbjct: 36  DNSQQQQQNPPEPLPNRPLRGE-------------RGSNSARPSQPAKLSDHDFLEQFKL 82

Query: 522 GFDSKVENPKIDSADKSIQSEKAENM-EPLSPPEDADEIFKKMKETGLIPNAVAMLDGLC 698
           G        K D + K+ Q  + E   EPL  PED++EIFK MKE GLIPNAVAMLDGLC
Sbjct: 83  GV-------KQDDSRKTEQKPQQETSPEPLPAPEDSEEIFKNMKEGGLIPNAVAMLDGLC 135

Query: 699 KDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKFDDAVRIFKKMQSNGVIPNVF 878
           KDGLVQEAMKLFGLMR+KGTIPEVV+YTAVVEGFCKAHK +DA RIF+KMQ+NG++PN F
Sbjct: 136 KDGLVQEAMKLFGLMRDKGTIPEVVIYTAVVEGFCKAHKIEDAKRIFRKMQTNGIVPNAF 195

Query: 879 SYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSPNIATF 989
           SY VLV+GL +   L+DA  F  EMLE+GHSPN++TF
Sbjct: 196 SYGVLVQGLCNCNMLDDAVDFCGEMLESGHSPNVSTF 232


>gb|AAL77701.1| AT4g38150/F20D10_270 [Arabidopsis thaliana]
           gi|23505863|gb|AAN28791.1| At4g38150/F20D10_270
           [Arabidopsis thaliana]
          Length = 302

 Score =  234 bits (596), Expect = 5e-59
 Identities = 126/209 (60%), Positives = 151/209 (72%), Gaps = 2/209 (0%)
 Frame = +3

Query: 369 PPEPIPNRPLRRQSYPYGSPRIPKPNRGREIENQNSFRGETDADFLERFKLGF--DSKVE 542
           PPEP+PNRPLR +     S R P   +   +   ++    +D  FLE+FKLG   DS+ E
Sbjct: 45  PPEPLPNRPLRGERSS-NSHREPPARQAHNLGKSDTTL--SDDGFLEQFKLGVNQDSR-E 100

Query: 543 NPKIDSADKSIQSEKAENMEPLSPPEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEA 722
            PK +   +          EPL PPED+DEIFKKMKE GLIPNAVAMLDGLCKDGLVQEA
Sbjct: 101 TPKPEQYPQ----------EPLPPPEDSDEIFKKMKEGGLIPNAVAMLDGLCKDGLVQEA 150

Query: 723 MKLFGLMREKGTIPEVVVYTAVVEGFCKAHKFDDAVRIFKKMQSNGVIPNVFSYQVLVRG 902
           MKLFGLMR+KGTIPEVV+YTAVVE FCKAHK +DA RIF+KMQ+NG+ PN FSY VLV+G
Sbjct: 151 MKLFGLMRDKGTIPEVVIYTAVVEAFCKAHKIEDAKRIFRKMQNNGIAPNAFSYGVLVQG 210

Query: 903 LFSGKRLEDAYGFTIEMLEAGHSPNIATF 989
           L++   L+DA  F  +MLE+GHSPN+ TF
Sbjct: 211 LYNCNMLDDAVAFCSDMLESGHSPNVPTF 239


>ref|XP_002514391.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223546488|gb|EEF47987.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 313

 Score =  233 bits (593), Expect = 1e-58
 Identities = 125/220 (56%), Positives = 147/220 (66%), Gaps = 4/220 (1%)
 Frame = +3

Query: 342 DDGLGRSDYPPEPIPNRPLRRQSY----PYGSPRIPKPNRGREIENQNSFRGETDADFLE 509
           DD     + PP PIPNRPLR Q+        SPRIP+ N      NQN    +   DFLE
Sbjct: 39  DDASNVDNSPPHPIPNRPLRGQTSFNQSQSQSPRIPRRNT-----NQNHLSSD---DFLE 90

Query: 510 RFKLGFDSKVENPKIDSADKSIQSEKAENMEPLSPPEDADEIFKKMKETGLIPNAVAMLD 689
           +FKL   +  +       + + + E      P  PP DA++IF KMKETGLIPNAVAMLD
Sbjct: 91  KFKLNKRNHKDEIPHQINNHTSKDENINKSSPPPPPPDANDIFNKMKETGLIPNAVAMLD 150

Query: 690 GLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVEGFCKAHKFDDAVRIFKKMQSNGVIP 869
           GLCKDGLVQEAMKLFGLMR+KGTIPEVVVYTAVV+GFCKAHK DDA RIFKKM  NG+ P
Sbjct: 151 GLCKDGLVQEAMKLFGLMRQKGTIPEVVVYTAVVDGFCKAHKTDDAKRIFKKMIDNGITP 210

Query: 870 NVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSPNIATF 989
           N FSY V ++GL     ++DA  F  +ML+AGHSPN+ TF
Sbjct: 211 NAFSYTVTIQGLCKCNAVDDAVDFCFQMLDAGHSPNVTTF 250


>ref|XP_006573589.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g38150-like isoform X3 [Glycine max]
           gi|571435834|ref|XP_006573590.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At4g38150-like isoform X4 [Glycine max]
          Length = 403

 Score =  231 bits (589), Expect = 3e-58
 Identities = 130/245 (53%), Positives = 161/245 (65%), Gaps = 40/245 (16%)
 Frame = +3

Query: 375 EPIPNRPLRRQ------------------SYP------YGSP-RIPKPNRGREIE----- 464
           EPIP+RPLR +                  S+P      +G P  + K N+  +I+     
Sbjct: 98  EPIPSRPLRGKKPINQPPPRFREYDRGSHSFPPRFDDNHGGPDELDKINKSSQIDLAFQG 157

Query: 465 -----NQNSFRGETDADFLERFKLGFDSKVENPKIDSADKSIQSEKAENMEPLSP----- 614
                  N   G++   FL++FKLGFD K  N    +A K  QSE+A+   P  P     
Sbjct: 158 TTNVAETNRDVGKSGGSFLDKFKLGFDDKTVNLSEVAASK--QSEEAKRSNPNQPAQESM 215

Query: 615 PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVE 794
           P+DA+EIFKKMKETGLIPNAVAMLDGLCKDGLVQEA+KLFGL+REKGTIPE+V+YTAVVE
Sbjct: 216 PQDANEIFKKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLIREKGTIPEIVIYTAVVE 275

Query: 795 GFCKAHKFDDAVRIFKKMQSNGVIPNVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSP 974
           G+ KAHK DDA RIF+KMQS+G+ PN FSY VL++GL+   RL DA+ F +EMLEAGHSP
Sbjct: 276 GYTKAHKADDAKRIFRKMQSSGISPNAFSYTVLIQGLYKCNRLHDAFEFCVEMLEAGHSP 335

Query: 975 NIATF 989
           N+  F
Sbjct: 336 NVTAF 340


>ref|XP_006573588.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g38150-like isoform X2 [Glycine max]
          Length = 431

 Score =  231 bits (589), Expect = 3e-58
 Identities = 130/245 (53%), Positives = 161/245 (65%), Gaps = 40/245 (16%)
 Frame = +3

Query: 375 EPIPNRPLRRQ------------------SYP------YGSP-RIPKPNRGREIE----- 464
           EPIP+RPLR +                  S+P      +G P  + K N+  +I+     
Sbjct: 126 EPIPSRPLRGKKPINQPPPRFREYDRGSHSFPPRFDDNHGGPDELDKINKSSQIDLAFQG 185

Query: 465 -----NQNSFRGETDADFLERFKLGFDSKVENPKIDSADKSIQSEKAENMEPLSP----- 614
                  N   G++   FL++FKLGFD K  N    +A K  QSE+A+   P  P     
Sbjct: 186 TTNVAETNRDVGKSGGSFLDKFKLGFDDKTVNLSEVAASK--QSEEAKRSNPNQPAQESM 243

Query: 615 PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVE 794
           P+DA+EIFKKMKETGLIPNAVAMLDGLCKDGLVQEA+KLFGL+REKGTIPE+V+YTAVVE
Sbjct: 244 PQDANEIFKKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLIREKGTIPEIVIYTAVVE 303

Query: 795 GFCKAHKFDDAVRIFKKMQSNGVIPNVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSP 974
           G+ KAHK DDA RIF+KMQS+G+ PN FSY VL++GL+   RL DA+ F +EMLEAGHSP
Sbjct: 304 GYTKAHKADDAKRIFRKMQSSGISPNAFSYTVLIQGLYKCNRLHDAFEFCVEMLEAGHSP 363

Query: 975 NIATF 989
           N+  F
Sbjct: 364 NVTAF 368


>ref|XP_003516576.2| PREDICTED: pentatricopeptide repeat-containing protein
           At4g38150-like isoform X1 [Glycine max]
          Length = 457

 Score =  231 bits (589), Expect = 3e-58
 Identities = 130/245 (53%), Positives = 161/245 (65%), Gaps = 40/245 (16%)
 Frame = +3

Query: 375 EPIPNRPLRRQ------------------SYP------YGSP-RIPKPNRGREIE----- 464
           EPIP+RPLR +                  S+P      +G P  + K N+  +I+     
Sbjct: 152 EPIPSRPLRGKKPINQPPPRFREYDRGSHSFPPRFDDNHGGPDELDKINKSSQIDLAFQG 211

Query: 465 -----NQNSFRGETDADFLERFKLGFDSKVENPKIDSADKSIQSEKAENMEPLSP----- 614
                  N   G++   FL++FKLGFD K  N    +A K  QSE+A+   P  P     
Sbjct: 212 TTNVAETNRDVGKSGGSFLDKFKLGFDDKTVNLSEVAASK--QSEEAKRSNPNQPAQESM 269

Query: 615 PEDADEIFKKMKETGLIPNAVAMLDGLCKDGLVQEAMKLFGLMREKGTIPEVVVYTAVVE 794
           P+DA+EIFKKMKETGLIPNAVAMLDGLCKDGLVQEA+KLFGL+REKGTIPE+V+YTAVVE
Sbjct: 270 PQDANEIFKKMKETGLIPNAVAMLDGLCKDGLVQEALKLFGLIREKGTIPEIVIYTAVVE 329

Query: 795 GFCKAHKFDDAVRIFKKMQSNGVIPNVFSYQVLVRGLFSGKRLEDAYGFTIEMLEAGHSP 974
           G+ KAHK DDA RIF+KMQS+G+ PN FSY VL++GL+   RL DA+ F +EMLEAGHSP
Sbjct: 330 GYTKAHKADDAKRIFRKMQSSGISPNAFSYTVLIQGLYKCNRLHDAFEFCVEMLEAGHSP 389

Query: 975 NIATF 989
           N+  F
Sbjct: 390 NVTAF 394


Top