BLASTX nr result

ID: Catharanthus23_contig00029520 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus23_contig00029520
         (481 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN65709.1| hypothetical protein VITISV_020733 [Vitis vinifera]   183   2e-44
ref|XP_002276456.1| PREDICTED: pentatricopeptide repeat-containi...   182   4e-44
gb|EXB80834.1| hypothetical protein L484_020091 [Morus notabilis]     181   9e-44
gb|EOY14282.1| Tetratricopeptide repeat (TPR)-like superfamily p...   175   5e-42
ref|XP_006493865.1| PREDICTED: pentatricopeptide repeat-containi...   168   8e-40
ref|XP_006428065.1| hypothetical protein CICLE_v10025107mg [Citr...   168   8e-40
ref|XP_006493880.1| PREDICTED: pentatricopeptide repeat-containi...   167   1e-39
ref|XP_006428067.1| hypothetical protein CICLE_v10027134mg [Citr...   167   1e-39
gb|EMJ23246.1| hypothetical protein PRUPE_ppa003088mg [Prunus pe...   167   1e-39
ref|XP_004304848.1| PREDICTED: pentatricopeptide repeat-containi...   166   2e-39
ref|XP_004143574.1| PREDICTED: pentatricopeptide repeat-containi...   161   7e-38
ref|XP_002532043.1| pentatricopeptide repeat-containing protein,...   160   2e-37
ref|XP_003532847.1| PREDICTED: pentatricopeptide repeat-containi...   159   5e-37
gb|ESW31745.1| hypothetical protein PHAVU_002G264000g [Phaseolus...   156   2e-36
gb|EPS59657.1| hypothetical protein M569_15147, partial [Genlise...   149   3e-34
ref|NP_001119013.1| pentatricopeptide repeat-containing protein ...   147   2e-33
emb|CAA16708.1| putatative protein [Arabidopsis thaliana] gi|726...   147   2e-33
ref|XP_006285645.1| hypothetical protein CARUB_v10007100mg [Caps...   146   3e-33
ref|XP_002869996.1| putative protein [Arabidopsis lyrata subsp. ...   142   4e-32
ref|XP_006413996.1| hypothetical protein EUTSA_v10024633mg [Eutr...   140   2e-31

>emb|CAN65709.1| hypothetical protein VITISV_020733 [Vitis vinifera]
          Length = 609

 Score =  183 bits (464), Expect = 2e-44
 Identities = 90/160 (56%), Positives = 117/160 (73%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           DSV+V+ LT    S+KD+ +L S+H  GIK G++ +VSV+NTWIAAYAKCG    AE VF
Sbjct: 154 DSVTVIGLTHSALSLKDLKMLESIHSFGIKIGIDTDVSVSNTWIAAYAKCGEFGLAETVF 213

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
           DGID    T VSWNSMI+G A+  +  KA+  +++ML  GF  DL TIL+LL S V+PE 
Sbjct: 214 DGIDKGLKTXVSWNSMIAGYAHFEQCSKAVGFFKKMLXGGFRADLSTILSLLSSCVQPEV 273

Query: 362 LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           L+ GKLIH HGI +GC+SDI V+NTLI MYS+C +I S++
Sbjct: 274 LFHGKLIHAHGIQVGCDSDIQVINTLISMYSKCGDIGSAR 313



 Score = 86.3 bits (212), Expect = 4e-15
 Identities = 51/155 (32%), Positives = 87/155 (56%), Gaps = 2/155 (1%)
 Frame = +2

Query: 23  LTQLTSSVKDVMLLSS--VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDL 196
           L+ L+S V+  +L     +H  GI+ G + ++ V NT I+ Y+KCG+  +A  +FD  ++
Sbjct: 262 LSLLSSCVQPEVLFHGKLIHAHGIQVGCDSDIQVINTLISMYSKCGDIGSARYLFD--NM 319

Query: 197 CYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGK 376
              T VSW +MI+G A  G+  +AM ++  M   G  PDL TI++L+    +  AL  GK
Sbjct: 320 LGKTRVSWTAMIAGXAEKGDLDEAMTLFSAMEAVGEKPDLVTIISLMSGCGQTGALELGK 379

Query: 377 LIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
            I  +    G   ++ V N LI +Y++C  + +++
Sbjct: 380 WIDTYATANGLKDNLMVCNALIDVYAKCGSMDNAR 414



 Score = 63.9 bits (154), Expect = 2e-08
 Identities = 40/132 (30%), Positives = 65/132 (49%)
 Frame = +2

Query: 71  VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYA 250
           VH   +K+  + ++ V  + +  Y KC     A  +F  +      V SWNSMI G A  
Sbjct: 76  VHTHVVKSRFQADLFVQTSVVDMYVKCSQLGFAYNLFSRMPX--RDVASWNSMIXGFAQL 133

Query: 251 GEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKLIHCHGIHLGCNSDISVL 430
           G   + + ++  M + G   D  T++ L  S +  + L   + IH  GI +G ++D+SV 
Sbjct: 134 GFVDRVVSLFCEMGIEGIRADSVTVIGLTHSALSLKDLKMLESIHSFGIKIGIDTDVSVS 193

Query: 431 NTLIYMYSRCAE 466
           NT I  Y++C E
Sbjct: 194 NTWIAAYAKCGE 205


>ref|XP_002276456.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19191,
           mitochondrial [Vitis vinifera]
           gi|296082485|emb|CBI21490.3| unnamed protein product
           [Vitis vinifera]
          Length = 637

 Score =  182 bits (462), Expect = 4e-44
 Identities = 90/160 (56%), Positives = 117/160 (73%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           DSV+V+ LT    S+KD+ +L S+H  GIK G++ +VSV+NTWIAAYAKCG    AE VF
Sbjct: 154 DSVTVIGLTHSALSLKDLKMLESIHSFGIKIGIDTDVSVSNTWIAAYAKCGEFGLAETVF 213

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
           DGID    T VSWNSMI+G A+  +  KA+  +++ML  GF  DL TIL+LL S V+PE 
Sbjct: 214 DGIDKGLKTGVSWNSMIAGYAHFEQCSKAVGFFKKMLCGGFRADLSTILSLLSSCVQPEV 273

Query: 362 LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           L+ GKLIH HGI +GC+SDI V+NTLI MYS+C +I S++
Sbjct: 274 LFHGKLIHAHGIQVGCDSDIQVINTLISMYSKCGDIGSAR 313



 Score = 85.9 bits (211), Expect = 5e-15
 Identities = 51/155 (32%), Positives = 87/155 (56%), Gaps = 2/155 (1%)
 Frame = +2

Query: 23  LTQLTSSVKDVMLLSS--VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDL 196
           L+ L+S V+  +L     +H  GI+ G + ++ V NT I+ Y+KCG+  +A  +FD  ++
Sbjct: 262 LSLLSSCVQPEVLFHGKLIHAHGIQVGCDSDIQVINTLISMYSKCGDIGSARYLFD--NM 319

Query: 197 CYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGK 376
              T VSW +MI+G A  G+  +AM ++  M   G  PDL TI++L+    +  AL  GK
Sbjct: 320 LGKTRVSWTAMIAGYAEKGDLDEAMTLFSAMEAVGEKPDLVTIISLMSGCGQTGALELGK 379

Query: 377 LIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
            I  +    G   ++ V N LI +Y++C  + +++
Sbjct: 380 WIDTYATANGLKDNLMVCNALIDVYAKCGSMDNAR 414



 Score = 63.2 bits (152), Expect = 4e-08
 Identities = 40/132 (30%), Positives = 65/132 (49%)
 Frame = +2

Query: 71  VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYA 250
           VH   +K+  + ++ V  + +  Y KC     A  +F  +      V SWNSMI G A  
Sbjct: 76  VHTHVVKSRFQADLFVQTSVVDMYVKCSQLGFAYNLFSRMPK--RDVASWNSMILGFAQL 133

Query: 251 GEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKLIHCHGIHLGCNSDISVL 430
           G   + + ++  M + G   D  T++ L  S +  + L   + IH  GI +G ++D+SV 
Sbjct: 134 GFVDRVVSLFCEMGIEGIRADSVTVIGLTHSALSLKDLKMLESIHSFGIKIGIDTDVSVS 193

Query: 431 NTLIYMYSRCAE 466
           NT I  Y++C E
Sbjct: 194 NTWIAAYAKCGE 205


>gb|EXB80834.1| hypothetical protein L484_020091 [Morus notabilis]
          Length = 609

 Score =  181 bits (459), Expect = 9e-44
 Identities = 90/161 (55%), Positives = 120/161 (74%), Gaps = 1/161 (0%)
 Frame = +2

Query: 2   DSVSVMWLTQ-LTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMV 178
           DS+++M LTQ +TS  K+V LL ++H LGI+ GLE ++SVANTWI+AYAKC +  +A++V
Sbjct: 110 DSITIMGLTQAITSHSKNVELLKAIHSLGIRIGLEADISVANTWISAYAKCSDLDSAKVV 169

Query: 179 FDGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPE 358
           FDGIDL   TVVSWNSMI+  +   +   A+  Y+RMLV G+ PD  TIL LL S  +P+
Sbjct: 170 FDGIDLDVRTVVSWNSMIAAYSNFEKSIDALNCYKRMLVDGYRPDSSTILGLLSSCAQPK 229

Query: 359 ALYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           AL QG  +HCHGI LGC+SDI+V NTLI MYSRC +I +++
Sbjct: 230 ALLQGASVHCHGIQLGCDSDIAVANTLISMYSRCGDIFAAR 270



 Score = 94.0 bits (232), Expect = 2e-17
 Identities = 52/160 (32%), Positives = 94/160 (58%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           DS +++ L    +  K ++  +SVH  GI+ G + +++VANT I+ Y++CG+  AA ++F
Sbjct: 214 DSSTILGLLSSCAQPKALLQGASVHCHGIQLGCDSDIAVANTLISMYSRCGDIFAARLLF 273

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
           D +     T V+WN MISG A  G+  +A+E++  M  +G  PDL T+L+++    +  +
Sbjct: 274 DCVSC--RTCVTWNVMISGYADKGDLDEALELFDAMETAGEKPDLVTMLSVISGCSQTGS 331

Query: 362 LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           L  GK I+ +    G   +  + N LI MY++C  ++ ++
Sbjct: 332 LEVGKWINGYAFSNGLRDNTIICNALIDMYAKCGSMNDAR 371



 Score = 66.6 bits (161), Expect = 3e-09
 Identities = 45/161 (27%), Positives = 80/161 (49%), Gaps = 1/161 (0%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           D+ +  ++ +  S + ++     +H   +K+ +  +V V    +  Y KC   + A  VF
Sbjct: 9   DNFTFPFVAKACSKLSNLRFSQIIHTHVVKSPVCSDVFVQTAVVDMYLKCDRLSDAHGVF 68

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVK-PE 358
           + +      V +WNSMI G A  G   +   ++ RM ++G   D  TI+ L  +     +
Sbjct: 69  EKMPA--RDVAAWNSMIVGLAQLGFTNRVFCLFHRMRLAGIQLDSITIMGLTQAITSHSK 126

Query: 359 ALYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
            +   K IH  GI +G  +DISV NT I  Y++C+++ S+K
Sbjct: 127 NVELLKAIHSLGIRIGLEADISVANTWISAYAKCSDLDSAK 167



 Score = 63.5 bits (153), Expect = 3e-08
 Identities = 37/101 (36%), Positives = 53/101 (52%)
 Frame = +2

Query: 71  VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYA 250
           ++G     GL  N  + N  I  YAKCG+   A  +F  +     TVVSW ++ISGCA  
Sbjct: 338 INGYAFSNGLRDNTIICNALIDMYAKCGSMNDARKLFYALP--EKTVVSWTTIISGCALN 395

Query: 251 GEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQG 373
           GE  +A+ ++ RML SG  P+  T L +L +      L +G
Sbjct: 396 GEVKEALNLFNRMLESGLKPNHVTFLAVLQACAHAGLLEKG 436


>gb|EOY14282.1| Tetratricopeptide repeat (TPR)-like superfamily protein, putative
           isoform 1 [Theobroma cacao] gi|508722386|gb|EOY14283.1|
           Tetratricopeptide repeat (TPR)-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
           gi|508722387|gb|EOY14284.1| Tetratricopeptide repeat
           (TPR)-like superfamily protein, putative isoform 1
           [Theobroma cacao]
          Length = 653

 Score =  175 bits (444), Expect = 5e-42
 Identities = 85/160 (53%), Positives = 118/160 (73%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           DSV+V+ L+Q  S  K + L+  +H  GI+ G+  NV+VANTWIA YAKCG+ A+AE VF
Sbjct: 154 DSVTVVGLSQGVSVAKSLELVEGLHSFGIRIGVAPNVTVANTWIAVYAKCGDLASAEKVF 213

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
           D ID+   TV+SWNSMI+G A       A ++YQ+MLV G  PD  +I++L+ S V+PEA
Sbjct: 214 DEIDVAVRTVISWNSMIAGYAIFENFLAAFDLYQQMLVDGIRPDASSIVSLISSCVQPEA 273

Query: 362 LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           L+QGKLIH HG+ LGC+ ++SV+NTLI MYS+C +I+S++
Sbjct: 274 LFQGKLIHSHGMQLGCDLNLSVINTLISMYSKCGDINSAR 313



 Score = 89.0 bits (219), Expect = 6e-16
 Identities = 50/137 (36%), Positives = 78/137 (56%)
 Frame = +2

Query: 71  VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYA 250
           +H  G++ G + N+SV NT I+ Y+KCG+  +A  +FD +     T VSW  MISG A  
Sbjct: 280 IHSHGMQLGCDLNLSVINTLISMYSKCGDINSARFLFDCMS--DRTCVSWTVMISGYAEK 337

Query: 251 GEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKLIHCHGIHLGCNSDISVL 430
           G+  +AM ++  M  +G +PDL T+L+L+    +  +L  GK I  +    G   D+ + 
Sbjct: 338 GDMDEAMTLFHSMEKAGETPDLVTVLSLISGCGQTGSLELGKWIDSYAKSRGFKEDVMIC 397

Query: 431 NTLIYMYSRCAEISSSK 481
           N LI MYS+C  I  ++
Sbjct: 398 NALIDMYSKCGGICEAQ 414



 Score = 62.4 bits (150), Expect = 6e-08
 Identities = 35/154 (22%), Positives = 76/154 (49%)
 Frame = +2

Query: 20  WLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLC 199
           ++ +  + + ++    S+H   +K+    ++ +    +  Y KC +   A  VF+ +   
Sbjct: 59  FIAKACAKLSNIKYSQSIHTQIVKSPFGNDIFIQTAMVNVYVKCDHVDYAYKVFERMP-- 116

Query: 200 YLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKL 379
              V +WN+M+ G A  G   K   ++  M  +G  PD  T++ L       ++L   + 
Sbjct: 117 QRDVAAWNAMLIGFARLGFLDKVFSLFGEMRFAGIHPDSVTVVGLSQGVSVAKSLELVEG 176

Query: 380 IHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           +H  GI +G   +++V NT I +Y++C +++S++
Sbjct: 177 LHSFGIRIGVAPNVTVANTWIAVYAKCGDLASAE 210


>ref|XP_006493865.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19191,
           mitochondrial-like isoform X1 [Citrus sinensis]
          Length = 672

 Score =  168 bits (425), Expect = 8e-40
 Identities = 82/160 (51%), Positives = 110/160 (68%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           D V+VM LTQ     K + LL SVH  GI  G++ +VSV NTWI+AYAKC +   AE+VF
Sbjct: 154 DFVTVMGLTQAAIHAKHLSLLKSVHSFGIHIGVDADVSVCNTWISAYAKCNDLKMAELVF 213

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
            GI+    TVVSWNS+I GC Y  +   ++  Y+ M+  GF PD+ T+++LL S V PEA
Sbjct: 214 RGIEEGLRTVVSWNSIIGGCTYGDKFDDSLNFYRHMIYDGFRPDVTTVVSLLSSCVCPEA 273

Query: 362 LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           L QG+L+H HGIH G + D+SV+NTLI MYS+C +I S++
Sbjct: 274 LVQGRLVHSHGIHYGFDLDVSVINTLISMYSKCGDIDSAR 313



 Score = 92.0 bits (227), Expect = 7e-17
 Identities = 52/137 (37%), Positives = 78/137 (56%)
 Frame = +2

Query: 71  VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYA 250
           VH  GI  G + +VSV NT I+ Y+KCG+  +A  +FDG  +C  T VSW +MISG A  
Sbjct: 280 VHSHGIHYGFDLDVSVINTLISMYSKCGDIDSARFLFDG--MCDRTRVSWTAMISGYAQK 337

Query: 251 GEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKLIHCHGIHLGCNSDISVL 430
           G+  +A+ ++  M  +G  PDL T+L+++    +  AL  GK    +    G   ++ V 
Sbjct: 338 GDLDEALRLFFVMEAAGEIPDLVTVLSMISGCGQSGALELGKWFDNYACSGGLKDNVMVC 397

Query: 431 NTLIYMYSRCAEISSSK 481
           N LI MYS+C  I  ++
Sbjct: 398 NALIDMYSKCGSIGDAR 414



 Score = 75.9 bits (185), Expect = 5e-12
 Identities = 43/160 (26%), Positives = 82/160 (51%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           ++++  ++ +  + + D +    +HG  +K+    ++ V  T +  YAKC     A  +F
Sbjct: 53  NNLTFPFIAKACAKLSDFLYSQMIHGHIVKSPFWSDIFVQTTMVDMYAKCDRLDCAYKLF 112

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
           D +      V SWN+MI G A  G   K + ++  M + G   D  T++ L  + +  + 
Sbjct: 113 DKMP--DRDVASWNAMIVGFAQMGFLEKVLCLFYNMRLVGIQADFVTVMGLTQAAIHAKH 170

Query: 362 LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           L   K +H  GIH+G ++D+SV NT I  Y++C ++  ++
Sbjct: 171 LSLLKSVHSFGIHIGVDADVSVCNTWISAYAKCNDLKMAE 210



 Score = 55.8 bits (133), Expect = 6e-06
 Identities = 31/81 (38%), Positives = 47/81 (58%)
 Frame = +2

Query: 95  GLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYAGEGCKAME 274
           GL+ NV V N  I  Y+KCG+   A  +F  +     TVVSW +MI+GCA  GE  +A++
Sbjct: 389 GLKDNVMVCNALIDMYSKCGSIGDARELFYALP--EKTVVSWTTMIAGCALNGEFVEALD 446

Query: 275 IYQRMLVSGFSPDLGTILNLL 337
           ++ +M+     P+  T L +L
Sbjct: 447 LFHQMMELDLRPNRVTFLAVL 467


>ref|XP_006428065.1| hypothetical protein CICLE_v10025107mg [Citrus clementina]
           gi|568882073|ref|XP_006493866.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19191,
           mitochondrial-like isoform X2 [Citrus sinensis]
           gi|557530055|gb|ESR41305.1| hypothetical protein
           CICLE_v10025107mg [Citrus clementina]
          Length = 653

 Score =  168 bits (425), Expect = 8e-40
 Identities = 82/160 (51%), Positives = 110/160 (68%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           D V+VM LTQ     K + LL SVH  GI  G++ +VSV NTWI+AYAKC +   AE+VF
Sbjct: 154 DFVTVMGLTQAAIHAKHLSLLKSVHSFGIHIGVDADVSVCNTWISAYAKCNDLKMAELVF 213

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
            GI+    TVVSWNS+I GC Y  +   ++  Y+ M+  GF PD+ T+++LL S V PEA
Sbjct: 214 RGIEEGLRTVVSWNSIIGGCTYGDKFDDSLNFYRHMIYDGFRPDVTTVVSLLSSCVCPEA 273

Query: 362 LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           L QG+L+H HGIH G + D+SV+NTLI MYS+C +I S++
Sbjct: 274 LVQGRLVHSHGIHYGFDLDVSVINTLISMYSKCGDIDSAR 313



 Score = 92.0 bits (227), Expect = 7e-17
 Identities = 52/137 (37%), Positives = 78/137 (56%)
 Frame = +2

Query: 71  VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYA 250
           VH  GI  G + +VSV NT I+ Y+KCG+  +A  +FDG  +C  T VSW +MISG A  
Sbjct: 280 VHSHGIHYGFDLDVSVINTLISMYSKCGDIDSARFLFDG--MCDRTRVSWTAMISGYAQK 337

Query: 251 GEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKLIHCHGIHLGCNSDISVL 430
           G+  +A+ ++  M  +G  PDL T+L+++    +  AL  GK    +    G   ++ V 
Sbjct: 338 GDLDEALRLFFVMEAAGEIPDLVTVLSMISGCGQSGALELGKWFDNYACSGGLKDNVMVC 397

Query: 431 NTLIYMYSRCAEISSSK 481
           N LI MYS+C  I  ++
Sbjct: 398 NALIDMYSKCGSIGDAR 414



 Score = 75.9 bits (185), Expect = 5e-12
 Identities = 43/160 (26%), Positives = 82/160 (51%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           ++++  ++ +  + + D +    +HG  +K+    ++ V  T +  YAKC     A  +F
Sbjct: 53  NNLTFPFIAKACAKLSDFLYSQMIHGHIVKSPFWSDIFVQTTMVDMYAKCDRLDCAYKLF 112

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
           D +      V SWN+MI G A  G   K + ++  M + G   D  T++ L  + +  + 
Sbjct: 113 DKMP--DRDVASWNAMIVGFAQMGFLEKVLCLFYNMRLVGIQADFVTVMGLTQAAIHAKH 170

Query: 362 LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           L   K +H  GIH+G ++D+SV NT I  Y++C ++  ++
Sbjct: 171 LSLLKSVHSFGIHIGVDADVSVCNTWISAYAKCNDLKMAE 210



 Score = 55.8 bits (133), Expect = 6e-06
 Identities = 31/81 (38%), Positives = 47/81 (58%)
 Frame = +2

Query: 95  GLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYAGEGCKAME 274
           GL+ NV V N  I  Y+KCG+   A  +F  +     TVVSW +MI+GCA  GE  +A++
Sbjct: 389 GLKDNVMVCNALIDMYSKCGSIGDARELFYALP--EKTVVSWTTMIAGCALNGEFVEALD 446

Query: 275 IYQRMLVSGFSPDLGTILNLL 337
           ++ +M+     P+  T L +L
Sbjct: 447 LFHQMMELDLRPNRVTFLAVL 467


>ref|XP_006493880.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19191,
           mitochondrial-like, partial [Citrus sinensis]
          Length = 471

 Score =  167 bits (423), Expect = 1e-39
 Identities = 81/160 (50%), Positives = 112/160 (70%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           D V+VM LTQ     K + LL SVH  GI  G++ +VSV NTWI++YAKC +   AE+VF
Sbjct: 154 DFVTVMGLTQAAIHAKHLSLLKSVHSFGIHIGVDADVSVCNTWISSYAKCDDLKMAELVF 213

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
            GI+    TVVSWNSM++GC Y  +   ++  Y+ M+ +GF  D+ T+++LL SFV PEA
Sbjct: 214 CGIEERLRTVVSWNSMVAGCTYGDKFDDSLNFYRHMMYNGFRLDVTTVVSLLSSFVCPEA 273

Query: 362 LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           L QG+L+H HGIH G + D+SV+NTLI MYS+C +I S++
Sbjct: 274 LVQGRLVHSHGIHYGFDLDVSVINTLISMYSKCGDIDSAR 313



 Score = 93.2 bits (230), Expect = 3e-17
 Identities = 53/137 (38%), Positives = 79/137 (57%)
 Frame = +2

Query: 71  VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYA 250
           VH  GI  G + +VSV NT I+ Y+KCG+  +A ++FDGI  C  T VSW +MISG A  
Sbjct: 280 VHSHGIHYGFDLDVSVINTLISMYSKCGDIDSARVLFDGI--CDRTRVSWTAMISGYAQK 337

Query: 251 GEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKLIHCHGIHLGCNSDISVL 430
           G+  +A+ ++  M  +G  PDL T+L+++    +  AL  GK    +    G   ++ V 
Sbjct: 338 GDLDEALRLFFVMEAAGEIPDLVTVLSMISGCGQSGALELGKWFDNYACSGGLKDNVMVC 397

Query: 431 NTLIYMYSRCAEISSSK 481
           N LI MYS+C  I  ++
Sbjct: 398 NALIDMYSKCGSIGDAR 414



 Score = 75.9 bits (185), Expect = 5e-12
 Identities = 43/160 (26%), Positives = 82/160 (51%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           ++++  ++ +  + + D +    +HG  +K+    ++ V  T +  YAKC     A  +F
Sbjct: 53  NNLTFPFIAKACAKLSDFLYSQMIHGHIVKSPFWSDIFVQTTMVDMYAKCDRLDCAYKLF 112

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
           D +      V SWN+MI G A  G   K + ++  M + G   D  T++ L  + +  + 
Sbjct: 113 DKMP--DRDVASWNAMIVGFAQMGFLEKVLCLFYNMRLVGIQADFVTVMGLTQAAIHAKH 170

Query: 362 LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           L   K +H  GIH+G ++D+SV NT I  Y++C ++  ++
Sbjct: 171 LSLLKSVHSFGIHIGVDADVSVCNTWISSYAKCDDLKMAE 210



 Score = 55.8 bits (133), Expect = 6e-06
 Identities = 31/81 (38%), Positives = 47/81 (58%)
 Frame = +2

Query: 95  GLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYAGEGCKAME 274
           GL+ NV V N  I  Y+KCG+   A  +F  +     TVVSW +MI+GCA  GE  +A++
Sbjct: 389 GLKDNVMVCNALIDMYSKCGSIGDARELFYALP--EKTVVSWTTMIAGCALNGEFVEALD 446

Query: 275 IYQRMLVSGFSPDLGTILNLL 337
           ++ +M+     P+  T L +L
Sbjct: 447 LFHQMMELDLRPNRVTFLAVL 467


>ref|XP_006428067.1| hypothetical protein CICLE_v10027134mg [Citrus clementina]
           gi|557530057|gb|ESR41307.1| hypothetical protein
           CICLE_v10027134mg [Citrus clementina]
          Length = 641

 Score =  167 bits (423), Expect = 1e-39
 Identities = 81/160 (50%), Positives = 112/160 (70%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           D V+VM LTQ     K + LL SVH  GI  G++ +VSV NTWI++YAKC +   AE+VF
Sbjct: 140 DFVTVMGLTQAAIHAKHLSLLKSVHSFGIHIGVDADVSVCNTWISSYAKCDDLKMAELVF 199

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
            GI+    TVVSWNSM++GC Y  +   ++  Y+ M+ +GF  D+ T+++LL SFV PEA
Sbjct: 200 CGIEERLRTVVSWNSMVAGCTYGDKFDDSLNFYRHMMYNGFRLDVTTVVSLLSSFVCPEA 259

Query: 362 LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           L QG+L+H HGIH G + D+SV+NTLI MYS+C +I S++
Sbjct: 260 LVQGRLVHSHGIHYGFDLDVSVINTLISMYSKCGDIDSAR 299



 Score = 89.0 bits (219), Expect = 6e-16
 Identities = 52/137 (37%), Positives = 77/137 (56%)
 Frame = +2

Query: 71  VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYA 250
           VH  GI  G + +VSV NT I+ Y+KCG+  +A ++FDGI  C  T VSW +MISG A  
Sbjct: 266 VHSHGIHYGFDLDVSVINTLISMYSKCGDIDSARVLFDGI--CDRTRVSWTAMISGYAQK 323

Query: 251 GEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKLIHCHGIHLGCNSDISVL 430
               +A+ ++  M  +G  PDL T+L+++    +  AL  GK    +    G   ++ V 
Sbjct: 324 RYLDEALRLFFAMEAAGELPDLVTVLSMISGCGQSGALELGKWFDNYACSGGLKDNVLVC 383

Query: 431 NTLIYMYSRCAEISSSK 481
           N LI MYS+C  I  ++
Sbjct: 384 NALIDMYSKCGSIGDAR 400



 Score = 65.1 bits (157), Expect = 9e-09
 Identities = 37/160 (23%), Positives = 77/160 (48%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           ++++  ++ +  + + D++    +HG  +K+               + KC     A  +F
Sbjct: 53  NNLTFPFIAKACAKLSDLIYSQMIHGHIVKS--------------PFVKCDRLDCAYKIF 98

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
           D  ++    V SWN+M+ G A  G     + ++  M + G   D  T++ L  + +  + 
Sbjct: 99  D--EMAVRDVASWNAMLVGFAQMGFLENVLRLFYNMRLVGIQADFVTVMGLTQAAIHAKH 156

Query: 362 LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           L   K +H  GIH+G ++D+SV NT I  Y++C ++  ++
Sbjct: 157 LSLLKSVHSFGIHIGVDADVSVCNTWISSYAKCDDLKMAE 196



 Score = 55.5 bits (132), Expect = 7e-06
 Identities = 31/81 (38%), Positives = 47/81 (58%)
 Frame = +2

Query: 95  GLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYAGEGCKAME 274
           GL+ NV V N  I  Y+KCG+   A  +F  +     TVVSW +MI+GCA  GE  +A++
Sbjct: 375 GLKDNVLVCNALIDMYSKCGSIGDARELFYALP--EKTVVSWTTMIAGCALNGEFVEALD 432

Query: 275 IYQRMLVSGFSPDLGTILNLL 337
           ++ +M+     P+  T L +L
Sbjct: 433 LFHQMMELDLRPNRVTFLAVL 453


>gb|EMJ23246.1| hypothetical protein PRUPE_ppa003088mg [Prunus persica]
          Length = 605

 Score =  167 bits (423), Expect = 1e-39
 Identities = 79/160 (49%), Positives = 117/160 (73%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           D+++VM LTQ +   K++ L+ ++H  GI+ G++ +VS+ANTWI+AY+KC + ++A  VF
Sbjct: 110 DTITVMGLTQASLETKNLALVKAIHAFGIQIGIDGDVSMANTWISAYSKCDDLSSAMAVF 169

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
           +GI++   TVVSWNSMI+G A   +   A+  Y+ ML  G+ PD+ TI++LL S ++P+ 
Sbjct: 170 NGIEIGARTVVSWNSMIAGYANLEKFLDALNFYKWMLCDGYRPDISTIVSLLSSCIQPDK 229

Query: 362 LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           L QG LIHCHGI +GC+SDI V+N LI MYSRC +I SS+
Sbjct: 230 LLQGVLIHCHGIQMGCDSDIFVVNALISMYSRCGDILSSR 269



 Score = 89.4 bits (220), Expect = 5e-16
 Identities = 53/152 (34%), Positives = 86/152 (56%), Gaps = 2/152 (1%)
 Frame = +2

Query: 32  LTSSVKDVMLLSSV--HGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYL 205
           L+S ++   LL  V  H  GI+ G + ++ V N  I+ Y++CG+  ++  +FDG+     
Sbjct: 221 LSSCIQPDKLLQGVLIHCHGIQMGCDSDIFVVNALISMYSRCGDILSSRFLFDGMS--NR 278

Query: 206 TVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKLIH 385
           T VSW +MISG A  G+  +A+ ++  M  +G  PDL T+L+L+    +  AL  GK IH
Sbjct: 279 TCVSWTAMISGYADKGDLNEALRLFHAMEAAGEKPDLVTVLSLVSGCGQTGALELGKWIH 338

Query: 386 CHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
            +    G    I V N LI M+++C  I+S++
Sbjct: 339 NYAFSNGLRDSIVVCNALIDMHAKCGNINSAR 370



 Score = 70.9 bits (172), Expect = 2e-10
 Identities = 41/153 (26%), Positives = 76/153 (49%)
 Frame = +2

Query: 20  WLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLC 199
           +L +  + + ++     +H   +K+    ++ V    +  Y KC   A A ++F+ I + 
Sbjct: 15  FLAKACAKLSNLKFSQIIHTNVLKSPFRSDIFVQTAMLDMYVKCDRLADAYILFERIPM- 73

Query: 200 YLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKL 379
              V  WN M+ G A  G   K + ++  M  +   PD  T++ L  + ++ + L   K 
Sbjct: 74  -RDVACWNVMLMGFAQLGFLDKVLRLFHEMRFARILPDTITVMGLTQASLETKNLALVKA 132

Query: 380 IHCHGIHLGCNSDISVLNTLIYMYSRCAEISSS 478
           IH  GI +G + D+S+ NT I  YS+C ++SS+
Sbjct: 133 IHAFGIQIGIDGDVSMANTWISAYSKCDDLSSA 165


>ref|XP_004304848.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19191,
           mitochondrial-like [Fragaria vesca subsp. vesca]
          Length = 1042

 Score =  166 bits (421), Expect = 2e-39
 Identities = 77/160 (48%), Positives = 116/160 (72%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           D+++VM L Q + + K++ L+ S+H  GI+ G+E +VSV NTWI+AY+KC +  +A+ VF
Sbjct: 145 DTITVMGLIQASLTTKNLDLVKSIHAFGIQIGIECDVSVTNTWISAYSKCNDLGSAKAVF 204

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
           D ID+   TVVSWNSMI+G + + +    + +Y++ML  G+ PD+ TIL++L S  +P  
Sbjct: 205 DRIDMGMRTVVSWNSMIAGYSNSDKFVDVLNVYKQMLSDGYRPDISTILSILSSCNQPNK 264

Query: 362 LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           L+QG LIHCHGI LGC+SDI V+N LI MYS+C ++ S++
Sbjct: 265 LFQGVLIHCHGIQLGCDSDIYVINDLISMYSKCGDLPSAR 304



 Score = 95.1 bits (235), Expect = 8e-18
 Identities = 49/137 (35%), Positives = 82/137 (59%)
 Frame = +2

Query: 71  VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYA 250
           +H  GI+ G + ++ V N  I+ Y+KCG+  +A  +FDG  +C  T VSW +MISG A  
Sbjct: 271 IHCHGIQLGCDSDIYVINDLISMYSKCGDLPSARFLFDG--MCDRTCVSWTAMISGYAEN 328

Query: 251 GEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKLIHCHGIHLGCNSDISVL 430
           G   +A+E++  M  +G  PDL T+L+L+        L  GK IH + + +G   ++ V 
Sbjct: 329 GNMNEALELFHFMEAAGEKPDLVTVLSLVSGCGHTGTLELGKWIHNYALSIGLRDNVVVC 388

Query: 431 NTLIYMYSRCAEISSSK 481
           N LI MY++C ++++++
Sbjct: 389 NALIDMYAKCGDLNNAR 405



 Score = 64.7 bits (156), Expect = 1e-08
 Identities = 41/154 (26%), Positives = 74/154 (48%)
 Frame = +2

Query: 20  WLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLC 199
           +L +  + + ++ L  ++H   +K+  + ++ V    +  Y KC     A  +F+ I + 
Sbjct: 50  FLAKACAKLSNLKLSQTIHTDVVKSPFQSDIFVQTAILDMYVKCDRLGDAYNLFERIAM- 108

Query: 200 YLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKL 379
                S N M+ G   +G   K   +   M  +G  PD  T++ L+ + +  + L   K 
Sbjct: 109 -KDTASCNVMLMGFVQSGFLDKVFCLVHDMRFAGIQPDTITVMGLIQASLTTKNLDLVKS 167

Query: 380 IHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           IH  GI +G   D+SV NT I  YS+C ++ S+K
Sbjct: 168 IHAFGIQIGIECDVSVTNTWISAYSKCNDLGSAK 201



 Score = 62.8 bits (151), Expect = 5e-08
 Identities = 41/124 (33%), Positives = 62/124 (50%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           D V+V+ L         + L   +H   +  GL  NV V N  I  YAKCG+   A  +F
Sbjct: 349 DLVTVLSLVSGCGHTGTLELGKWIHNYALSIGLRDNVVVCNALIDMYAKCGDLNNARELF 408

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
             + +   TVVSW S+IS CA  G+  +A++++  ML +G  P+  T L +L +      
Sbjct: 409 YALPV--RTVVSWTSIISACALNGQSKQALDLFCLMLETGMKPNHLTFLAILQACTHAGL 466

Query: 362 LYQG 373
           L +G
Sbjct: 467 LEEG 470


>ref|XP_004143574.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19191,
           mitochondrial-like [Cucumis sativus]
           gi|449516723|ref|XP_004165396.1| PREDICTED:
           pentatricopeptide repeat-containing protein At4g19191,
           mitochondrial-like [Cucumis sativus]
          Length = 651

 Score =  161 bits (408), Expect = 7e-38
 Identities = 81/159 (50%), Positives = 111/159 (69%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           D+ +V+ LT+   S K +  L +VH +GI+TGL+ + SV+NTWIAAY+KCG    A+MVF
Sbjct: 152 DAATVIGLTRAVISAKSLRFLKAVHAIGIETGLDADTSVSNTWIAAYSKCGELQLAKMVF 211

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
            GI     + VSWNS+I+  A+ G+   A++ Y+ +L  GF PD  TI++LL S  +PEA
Sbjct: 212 HGIQKTARSSVSWNSLIACYAHFGKYVDAVKSYKGLLCDGFKPDASTIISLLSSCQQPEA 271

Query: 362 LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSS 478
           L  G LIH HG  LGC+SDIS++NTLI MYSRC +ISS+
Sbjct: 272 LIYGFLIHGHGFQLGCDSDISLINTLISMYSRCGDISSA 310



 Score = 85.1 bits (209), Expect = 9e-15
 Identities = 45/137 (32%), Positives = 79/137 (57%)
 Frame = +2

Query: 71  VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYA 250
           +HG G + G + ++S+ NT I+ Y++CG+ ++A ++FDG+ +   T VSW +MISG +  
Sbjct: 278 IHGHGFQLGCDSDISLINTLISMYSRCGDISSATILFDGMSI--RTCVSWTAMISGYSEV 335

Query: 251 GEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKLIHCHGIHLGCNSDISVL 430
           G    A+ ++  M  +G  PD+ T+L+L+    K  AL  G  I  +        D+ V 
Sbjct: 336 GRVDDALVLFNAMEETGEKPDIVTVLSLISGCGKTGALGLGHWIDNYASLHELKKDVVVC 395

Query: 431 NTLIYMYSRCAEISSSK 481
           N LI MY++C  ++ ++
Sbjct: 396 NALIDMYAKCGSLNDAR 412



 Score = 72.4 bits (176), Expect = 6e-11
 Identities = 40/137 (29%), Positives = 68/137 (49%)
 Frame = +2

Query: 71  VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYA 250
           +H   +K+    ++ V    +  Y KCG    A  +FD + +    + SWN+MI G +  
Sbjct: 74  IHTHVVKSPFYSDIYVQTAMVDMYVKCGKVDDAYNLFDKMPV--RNIASWNAMIIGFSQI 131

Query: 251 GEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKLIHCHGIHLGCNSDISVL 430
           G   +   ++  M + G  PD  T++ L  + +  ++L   K +H  GI  G ++D SV 
Sbjct: 132 GSLDRVFNLFMGMRLVGTRPDAATVIGLTRAVISAKSLRFLKAVHAIGIETGLDADTSVS 191

Query: 431 NTLIYMYSRCAEISSSK 481
           NT I  YS+C E+  +K
Sbjct: 192 NTWIAAYSKCGELQLAK 208


>ref|XP_002532043.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223528286|gb|EEF30333.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 468

 Score =  160 bits (404), Expect = 2e-37
 Identities = 80/160 (50%), Positives = 112/160 (70%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           DSV++M ++   S +KD+ L  SVH  GI+ G+  +VSVANTWI+ YAKC + A AE VF
Sbjct: 138 DSVTLMGVSGAISCMKDLELAKSVHSFGIRIGIHNDVSVANTWISLYAKCYDLAMAESVF 197

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
           +GI++   +VVSWNSMI+G AY  +   A+  Y+RML  GF PD+ TI+ LL S ++PEA
Sbjct: 198 NGIEVGLRSVVSWNSMIAGYAYLEKRIDALNSYKRMLHDGFMPDISTIVTLLSSCLQPEA 257

Query: 362 LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           + QG  +H HGI  GC+S+I V NTLI MYS+  ++ S++
Sbjct: 258 VRQGMQVHSHGIRFGCDSEIHVANTLISMYSKFGDVYSAR 297



 Score = 83.6 bits (205), Expect = 3e-14
 Identities = 47/136 (34%), Positives = 75/136 (55%)
 Frame = +2

Query: 71  VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYA 250
           VH  GI+ G +  + VANT I+ Y+K G+  +A  +FD   +C  + V+W SMISG A  
Sbjct: 264 VHSHGIRFGCDSEIHVANTLISMYSKFGDVYSARCLFDS--MCNRSCVTWTSMISGYAEK 321

Query: 251 GEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKLIHCHGIHLGCNSDISVL 430
           G   +A++++  M  +G  PDL T+L+++    +   L  GK IH +        ++ V 
Sbjct: 322 GNMDEALKLFNAMEAAGEKPDLVTVLSVISGCGQTGILEVGKWIHVYADSNCLKHNVVVC 381

Query: 431 NTLIYMYSRCAEISSS 478
           N LI MY++C  I  +
Sbjct: 382 NALIDMYAKCGSIDDA 397



 Score = 64.3 bits (155), Expect = 2e-08
 Identities = 39/137 (28%), Positives = 69/137 (50%)
 Frame = +2

Query: 71  VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYA 250
           +H   IK+    NV V    +    KC     A  VF  + +    V SWN+M+ G A  
Sbjct: 60  IHTHVIKSPFYSNVFVQTALLDMCVKCHQLDIAYNVF--VKMPKRDVTSWNAMLLGFAQL 117

Query: 251 GEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKLIHCHGIHLGCNSDISVL 430
           G   +   +++ M  +G  PD  T++ + G+    + L   K +H  GI +G ++D+SV 
Sbjct: 118 GFSERVFCMFREMRFAGVFPDSVTLMGVSGAISCMKDLELAKSVHSFGIRIGIHNDVSVA 177

Query: 431 NTLIYMYSRCAEISSSK 481
           NT I +Y++C +++ ++
Sbjct: 178 NTWISLYAKCYDLAMAE 194


>ref|XP_003532847.1| PREDICTED: pentatricopeptide repeat-containing protein At4g19191,
           mitochondrial-like [Glycine max]
          Length = 651

 Score =  159 bits (401), Expect = 5e-37
 Identities = 79/160 (49%), Positives = 112/160 (70%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           D+V+V+ L      VK +  L +V+  GI+ G+  +VSVANT IAAY+KCGN  +AE +F
Sbjct: 153 DAVTVLLLIDSILRVKSLTSLGAVYSFGIRIGVHMDVSVANTLIAAYSKCGNLCSAETLF 212

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
           D I+    +VVSWNSMI+  A   +  KA+  Y+ ML  GFSPD+ TILNLL S ++P+A
Sbjct: 213 DEINSGLRSVVSWNSMIAAYANFEKHVKAVNCYKGMLDGGFSPDISTILNLLSSCMQPKA 272

Query: 362 LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           L+ G L+H HG+ LGC+SD+ V+NTLI MYS+C ++ S++
Sbjct: 273 LFHGLLVHSHGVKLGCDSDVCVVNTLICMYSKCGDVHSAR 312



 Score = 87.8 bits (216), Expect = 1e-15
 Identities = 51/137 (37%), Positives = 75/137 (54%)
 Frame = +2

Query: 71  VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYA 250
           VH  G+K G + +V V NT I  Y+KCG+  +A  +F+G+     T VSW  MIS  A  
Sbjct: 279 VHSHGVKLGCDSDVCVVNTLICMYSKCGDVHSARFLFNGMS--DKTCVSWTVMISAYAEK 336

Query: 251 GEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKLIHCHGIHLGCNSDISVL 430
           G   +AM ++  M  +G  PDL T+L L+    +  AL  GK I  + I+ G   ++ V 
Sbjct: 337 GYMSEAMTLFNAMEAAGEKPDLVTVLALISGCGQTGALELGKWIDNYSINNGLKDNVVVC 396

Query: 431 NTLIYMYSRCAEISSSK 481
           N LI MY++C   + +K
Sbjct: 397 NALIDMYAKCGGFNDAK 413



 Score = 77.4 bits (189), Expect = 2e-12
 Identities = 42/137 (30%), Positives = 74/137 (54%)
 Frame = +2

Query: 71  VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYA 250
           +H   +K+  + N+ V    +  Y KCG    A  VF  +++    + SWN+M+ G A +
Sbjct: 75  IHAHVLKSCFQSNIFVQTATVDMYVKCGRLEDAHNVF--VEMPVRDIASWNAMLLGFAQS 132

Query: 251 GEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKLIHCHGIHLGCNSDISVL 430
           G   +   + + M +SG  PD  T+L L+ S ++ ++L     ++  GI +G + D+SV 
Sbjct: 133 GFLDRLSCLLRHMRLSGIRPDAVTVLLLIDSILRVKSLTSLGAVYSFGIRIGVHMDVSVA 192

Query: 431 NTLIYMYSRCAEISSSK 481
           NTLI  YS+C  + S++
Sbjct: 193 NTLIAAYSKCGNLCSAE 209



 Score = 56.6 bits (135), Expect = 3e-06
 Identities = 39/112 (34%), Positives = 55/112 (49%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           D V+V+ L         + L   +    I  GL+ NV V N  I  YAKCG    A+ +F
Sbjct: 357 DLVTVLALISGCGQTGALELGKWIDNYSINNGLKDNVVVCNALIDMYAKCGGFNDAKELF 416

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLL 337
               +   TVVSW +MI+ CA  G+   A+E++  ML  G  P+  T L +L
Sbjct: 417 --YTMANRTVVSWTTMITACALNGDVKDALELFFMMLEMGMKPNHITFLAVL 466


>gb|ESW31745.1| hypothetical protein PHAVU_002G264000g [Phaseolus vulgaris]
          Length = 670

 Score =  156 bits (395), Expect = 2e-36
 Identities = 79/160 (49%), Positives = 110/160 (68%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           DSV+V+ L      V+ +  L +VH  GI+ G+  ++SVANT IA YAKCG+  +AEMVF
Sbjct: 154 DSVTVLLLMDAILRVRSLTFLGAVHSFGIRIGIHDDLSVANTLIAGYAKCGDLGSAEMVF 213

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
           D ID    +VVSWNSMI+  A   +  KA++ Y+ ML   F PD+ TILNLL S V+P+A
Sbjct: 214 DEIDTGLRSVVSWNSMIAAYAKFEKYVKAVDCYKGMLDGAFCPDISTILNLLSSCVQPKA 273

Query: 362 LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           L+ G L+H HG+ LGC+SD+ V+NTLI MYS+  ++ S++
Sbjct: 274 LFHGLLVHSHGVKLGCDSDVCVVNTLICMYSKGGDVHSAR 313



 Score = 82.0 bits (201), Expect = 7e-14
 Identities = 52/155 (33%), Positives = 81/155 (52%), Gaps = 2/155 (1%)
 Frame = +2

Query: 23  LTQLTSSVKDVMLLSS--VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDL 196
           L  L+S V+   L     VH  G+K G + +V V NT I  Y+K G+  +A  +FD +  
Sbjct: 262 LNLLSSCVQPKALFHGLLVHSHGVKLGCDSDVCVVNTLICMYSKGGDVHSARFLFDCMS- 320

Query: 197 CYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGK 376
              T VSW  MISG A  G   +A+ ++ +M  +G  PD  T+L L+    +  AL  GK
Sbjct: 321 -DQTCVSWTVMISGYAEKGFMSEALTLFNKMEAAGEKPDSVTVLALISGCGQTGALELGK 379

Query: 377 LIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
            +  + ++ G   ++ V N LI MY++C   + +K
Sbjct: 380 WVDNYSVNKGLKDNVVVCNALIDMYAKCGSFNDAK 414



 Score = 79.0 bits (193), Expect = 6e-13
 Identities = 42/137 (30%), Positives = 76/137 (55%)
 Frame = +2

Query: 71  VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYA 250
           +H   +K+  + N+ V    +  Y KCG    A  VF  +++    + SWN+M+ G A++
Sbjct: 76  IHAHVLKSRFQSNIYVQTAMVDMYVKCGQLEDAHNVF--VEMPVRDIASWNAMLLGFAHS 133

Query: 251 GEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKLIHCHGIHLGCNSDISVL 430
           G   +   I ++M +SG  PD  T+L L+ + ++  +L     +H  GI +G + D+SV 
Sbjct: 134 GFLDRLSCILRQMRLSGIRPDSVTVLLLMDAILRVRSLTFLGAVHSFGIRIGIHDDLSVA 193

Query: 431 NTLIYMYSRCAEISSSK 481
           NTLI  Y++C ++ S++
Sbjct: 194 NTLIAGYAKCGDLGSAE 210



 Score = 60.5 bits (145), Expect = 2e-07
 Identities = 41/112 (36%), Positives = 58/112 (51%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           DSV+V+ L         + L   V    +  GL+ NV V N  I  YAKCG+   A+ VF
Sbjct: 358 DSVTVLALISGCGQTGALELGKWVDNYSVNKGLKDNVVVCNALIDMYAKCGSFNDAKEVF 417

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLL 337
            G  +   TVVSW +MI+ CA  G+   A++++  ML  G  P+  T L +L
Sbjct: 418 YG--MANRTVVSWTTMITACALNGDVKDALDLFFMMLEIGMKPNHITFLAVL 467


>gb|EPS59657.1| hypothetical protein M569_15147, partial [Genlisea aurea]
          Length = 626

 Score =  149 bits (377), Expect = 3e-34
 Identities = 74/160 (46%), Positives = 108/160 (67%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           D+V+V+ L QL + ++   LL++VH  G K G   + SV NT+I+ YAKCG+  +AE VF
Sbjct: 139 DAVTVIALAQLAAVLRSGRLLAAVHCFGTKCGHATDASVVNTYISGYAKCGDLHSAERVF 198

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
               L  L VVSWN+ ISG A +G+  KA+E Y+RML  G+ PDL T++NL  SF +P+ 
Sbjct: 199 HETGLDSLNVVSWNAAISGSASSGQPSKAVEFYRRMLWEGYRPDLSTVINLASSFQQPDY 258

Query: 362 LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           L  G+ +H  G+ LGC++DI+ LNTLI MYS+C  ++ ++
Sbjct: 259 LPLGRSVHAQGVKLGCDADIAFLNTLISMYSKCGHVAPAR 298



 Score = 74.7 bits (182), Expect = 1e-11
 Identities = 43/142 (30%), Positives = 78/142 (54%), Gaps = 4/142 (2%)
 Frame = +2

Query: 68  SVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAY 247
           SVH  G+K G + +++  NT I+ Y+KCG+ A A  +F+   +   + V+W +MI G + 
Sbjct: 264 SVHAQGVKLGCDADIAFLNTLISMYSKCGHVAPARSIFN--RMVEKSAVTWTAMIGGYSE 321

Query: 248 AGEGCKAMEIYQRMLVS-GFSPDLGTILNLLGSFVKPEALYQGKLIHCHGIHLGCNSD-- 418
           AG+  +A+ ++  M  S G+SPD   +L+L+ +  +  +L  G+ +       G   D  
Sbjct: 322 AGDLDEALSLFDEMAGSGGWSPDPVAVLHLIAACGRAGSLEVGRRLDAFAASTGLKGDGR 381

Query: 419 -ISVLNTLIYMYSRCAEISSSK 481
             +V N L+ MYS+C  +  ++
Sbjct: 382 PPAVCNALMDMYSKCGSVDDAE 403



 Score = 63.2 bits (152), Expect = 4e-08
 Identities = 45/164 (27%), Positives = 79/164 (48%), Gaps = 4/164 (2%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           D V+ ++L +  +          +HG  I+T    +  +  + I AYAKCG P +A  VF
Sbjct: 36  DRVTFLFLAKACAGRTSESAAVVLHGHVIRTPHRSDAYLQTSVIDAYAKCGRPESARKVF 95

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAM--EIYQRMLVSGFSPDLGTILNL--LGSFV 349
           D  ++    + SWN+++   A +G    A    +   M     +PD  T++ L  L + +
Sbjct: 96  D--EMPVRDIASWNAILLCHARSGSSSAAAFPSLLAEMRSERIAPDAVTVIALAQLAAVL 153

Query: 350 KPEALYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           +   L     +HC G   G  +D SV+NT I  Y++C ++ S++
Sbjct: 154 RSGRLLAA--VHCFGTKCGHATDASVVNTYISGYAKCGDLHSAE 195


>ref|NP_001119013.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|223635616|sp|P0C8Q2.1|PP323_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At4g19191, mitochondrial; Flags: Precursor
           gi|332658758|gb|AEE84158.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 654

 Score =  147 bits (370), Expect = 2e-33
 Identities = 75/160 (46%), Positives = 100/160 (62%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           DSV+VM L Q  S  K + LL ++H +GI+ G++  V+VANTWI+ Y KCG+  +A++VF
Sbjct: 152 DSVTVMTLIQSASFEKSLKLLEAMHAVGIRLGVDVQVTVANTWISTYGKCGDLDSAKLVF 211

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
           + ID    TVVSWNSM    +  GE   A  +Y  ML   F PDL T +NL  S   PE 
Sbjct: 212 EAIDRGDRTVVSWNSMFKAYSVFGEAFDAFGLYCLMLREEFKPDLSTFINLAASCQNPET 271

Query: 362 LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           L QG+LIH H IHLG + DI  +NT I MYS+  +  S++
Sbjct: 272 LTQGRLIHSHAIHLGTDQDIEAINTFISMYSKSEDTCSAR 311



 Score = 80.1 bits (196), Expect = 3e-13
 Identities = 47/138 (34%), Positives = 74/138 (53%), Gaps = 1/138 (0%)
 Frame = +2

Query: 71  VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYA 250
           +H   I  G + ++   NT+I+ Y+K  +  +A ++FD   +   T VSW  MISG A  
Sbjct: 278 IHSHAIHLGTDQDIEAINTFISMYSKSEDTCSARLLFD--IMTSRTCVSWTVMISGYAEK 335

Query: 251 GEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKLIHCHGIHLGCNSD-ISV 427
           G+  +A+ ++  M+ SG  PDL T+L+L+    K  +L  GK I       GC  D + +
Sbjct: 336 GDMDEALALFHAMIKSGEKPDLVTLLSLISGCGKFGSLETGKWIDARADIYGCKRDNVMI 395

Query: 428 LNTLIYMYSRCAEISSSK 481
            N LI MYS+C  I  ++
Sbjct: 396 CNALIDMYSKCGSIHEAR 413



 Score = 68.9 bits (167), Expect = 6e-10
 Identities = 42/154 (27%), Positives = 78/154 (50%)
 Frame = +2

Query: 20  WLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLC 199
           ++ +  + + DV     VH   IK+    +V V    +  + KC +   A  VF+ +   
Sbjct: 57  FVAKACARLADVGCCEMVHAHLIKSPFWSDVFVGTATVDMFVKCNSVDYAAKVFERMPER 116

Query: 200 YLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKL 379
             T  +WN+M+SG   +G   KA  +++ M ++  +PD  T++ L+ S    ++L   + 
Sbjct: 117 DAT--TWNAMLSGFCQSGHTDKAFSLFREMRLNEITPDSVTVMTLIQSASFEKSLKLLEA 174

Query: 380 IHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           +H  GI LG +  ++V NT I  Y +C ++ S+K
Sbjct: 175 MHAVGIRLGVDVQVTVANTWISTYGKCGDLDSAK 208


>emb|CAA16708.1| putatative protein [Arabidopsis thaliana] gi|7268714|emb|CAB78921.1|
            putatative protein [Arabidopsis thaliana]
          Length = 1260

 Score =  147 bits (370), Expect = 2e-33
 Identities = 75/160 (46%), Positives = 100/160 (62%)
 Frame = +2

Query: 2    DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
            DSV+VM L Q  S  K + LL ++H +GI+ G++  V+VANTWI+ Y KCG+  +A++VF
Sbjct: 758  DSVTVMTLIQSASFEKSLKLLEAMHAVGIRLGVDVQVTVANTWISTYGKCGDLDSAKLVF 817

Query: 182  DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
            + ID    TVVSWNSM    +  GE   A  +Y  ML   F PDL T +NL  S   PE 
Sbjct: 818  EAIDRGDRTVVSWNSMFKAYSVFGEAFDAFGLYCLMLREEFKPDLSTFINLAASCQNPET 877

Query: 362  LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
            L QG+LIH H IHLG + DI  +NT I MYS+  +  S++
Sbjct: 878  LTQGRLIHSHAIHLGTDQDIEAINTFISMYSKSEDTCSAR 917



 Score = 80.1 bits (196), Expect = 3e-13
 Identities = 47/138 (34%), Positives = 74/138 (53%), Gaps = 1/138 (0%)
 Frame = +2

Query: 71   VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYA 250
            +H   I  G + ++   NT+I+ Y+K  +  +A ++FD   +   T VSW  MISG A  
Sbjct: 884  IHSHAIHLGTDQDIEAINTFISMYSKSEDTCSARLLFD--IMTSRTCVSWTVMISGYAEK 941

Query: 251  GEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKLIHCHGIHLGCNSD-ISV 427
            G+  +A+ ++  M+ SG  PDL T+L+L+    K  +L  GK I       GC  D + +
Sbjct: 942  GDMDEALALFHAMIKSGEKPDLVTLLSLISGCGKFGSLETGKWIDARADIYGCKRDNVMI 1001

Query: 428  LNTLIYMYSRCAEISSSK 481
             N LI MYS+C  I  ++
Sbjct: 1002 CNALIDMYSKCGSIHEAR 1019



 Score = 68.9 bits (167), Expect = 6e-10
 Identities = 42/154 (27%), Positives = 78/154 (50%)
 Frame = +2

Query: 20   WLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLC 199
            ++ +  + + DV     VH   IK+    +V V    +  + KC +   A  VF+ +   
Sbjct: 663  FVAKACARLADVGCCEMVHAHLIKSPFWSDVFVGTATVDMFVKCNSVDYAAKVFERMPER 722

Query: 200  YLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKL 379
              T  +WN+M+SG   +G   KA  +++ M ++  +PD  T++ L+ S    ++L   + 
Sbjct: 723  DAT--TWNAMLSGFCQSGHTDKAFSLFREMRLNEITPDSVTVMTLIQSASFEKSLKLLEA 780

Query: 380  IHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
            +H  GI LG +  ++V NT I  Y +C ++ S+K
Sbjct: 781  MHAVGIRLGVDVQVTVANTWISTYGKCGDLDSAK 814


>ref|XP_006285645.1| hypothetical protein CARUB_v10007100mg [Capsella rubella]
           gi|482554350|gb|EOA18543.1| hypothetical protein
           CARUB_v10007100mg [Capsella rubella]
          Length = 657

 Score =  146 bits (368), Expect = 3e-33
 Identities = 76/160 (47%), Positives = 99/160 (61%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           DSV+V+ L Q  S  K + LL ++H LGI  G+   V+VANTWI+ YAKCG+  +A+ VF
Sbjct: 152 DSVTVLTLIQSASFGKSLKLLKAIHALGIHLGVSVQVTVANTWISTYAKCGDLDSAKSVF 211

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
           + ID    TVVSWNSM    +  GE   A  +Y+ ML   F PDLGT +NL  S   P  
Sbjct: 212 EAIDRGDRTVVSWNSMFKAYSVFGEVYDAFGLYRLMLREEFKPDLGTFINLAASGQNPAT 271

Query: 362 LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           L QG+LIH H IHLG + DI  +NT I MYS+  +  S++
Sbjct: 272 LTQGRLIHSHAIHLGTDQDIEAINTFISMYSKSGDTCSAR 311



 Score = 75.5 bits (184), Expect = 7e-12
 Identities = 45/138 (32%), Positives = 72/138 (52%), Gaps = 1/138 (0%)
 Frame = +2

Query: 71  VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYA 250
           +H   I  G + ++   NT+I+ Y+K G+  +A ++FD +   + T VSW  MIS  A  
Sbjct: 278 IHSHAIHLGTDQDIEAINTFISMYSKSGDTCSARLLFDIMP--FRTCVSWTVMISAYAEK 335

Query: 251 GEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKLIHCHGIHLGCNSD-ISV 427
           G   +A+ ++     +G  PDL T+L+L+    K  +L  GK I       GC  D + +
Sbjct: 336 GYMDEALALFHANTKTGEKPDLVTLLSLISGCGKFGSLEIGKWIDARADMYGCKKDNVMI 395

Query: 428 LNTLIYMYSRCAEISSSK 481
            N LI MYS+C  I  ++
Sbjct: 396 CNALIDMYSKCGSIPEAR 413



 Score = 73.2 bits (178), Expect = 3e-11
 Identities = 43/154 (27%), Positives = 79/154 (51%)
 Frame = +2

Query: 20  WLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLC 199
           ++ +  + + D+     V+   +K+    +V V    +  + KC +   A  VF+ +   
Sbjct: 57  FVAKACARLSDISYCEMVNAHVLKSPFWSDVFVGTATVDMFVKCNSLDHAAKVFERMP-- 114

Query: 200 YLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKL 379
              V +WN+M+SG  ++G   KA  +++ M +    PD  T+L L+ S    ++L   K 
Sbjct: 115 ERDVTTWNAMLSGFCHSGHIDKAFSLFREMRLHKIPPDSVTVLTLIQSASFGKSLKLLKA 174

Query: 380 IHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           IH  GIHLG +  ++V NT I  Y++C ++ S+K
Sbjct: 175 IHALGIHLGVSVQVTVANTWISTYAKCGDLDSAK 208


>ref|XP_002869996.1| putative protein [Arabidopsis lyrata subsp. lyrata]
            gi|297315832|gb|EFH46255.1| putative protein [Arabidopsis
            lyrata subsp. lyrata]
          Length = 1251

 Score =  142 bits (359), Expect = 4e-32
 Identities = 73/160 (45%), Positives = 98/160 (61%)
 Frame = +2

Query: 2    DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
            DSV+VM L Q  S  K + LL  +H  GI+ G++   +V+NTWI+AY KCG+  +A++VF
Sbjct: 748  DSVTVMTLIQSASFEKSLKLLKVMHAFGIRLGVDLQATVSNTWISAYGKCGDLDSAKLVF 807

Query: 182  DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
            + ID    TVVSWNS+    A  GE   A   Y+ ML   F PDL T +NL  S   P+ 
Sbjct: 808  EAIDRGDRTVVSWNSVFKAFAVFGEAFDAFGHYRLMLRDEFKPDLSTFINLAASCQNPQT 867

Query: 362  LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
            L QG+LIH H IHLG + DI  +NT I MYS+  +  S++
Sbjct: 868  LTQGRLIHSHAIHLGTDQDIEAINTFISMYSKSGDSCSAR 907



 Score = 80.9 bits (198), Expect = 2e-13
 Identities = 48/138 (34%), Positives = 75/138 (54%), Gaps = 1/138 (0%)
 Frame = +2

Query: 71   VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYA 250
            +H   I  G + ++   NT+I+ Y+K G+  +A ++FD +     T VSW  MISG A  
Sbjct: 874  IHSHAIHLGTDQDIEAINTFISMYSKSGDSCSARLLFDIMPS--RTCVSWTVMISGYAEK 931

Query: 251  GEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKLIHCHGIHLGCNSD-ISV 427
            G+  +A+ ++  M  +G +PDL T+L+L+    K  +L  GK I       GC  D + V
Sbjct: 932  GDMDEALALFHAMAKTGVNPDLVTLLSLISGCGKFGSLEIGKWIDGRADMYGCKKDNVMV 991

Query: 428  LNTLIYMYSRCAEISSSK 481
             N LI MYS+C  I  ++
Sbjct: 992  CNALIDMYSKCGSIDEAR 1009



 Score = 65.5 bits (158), Expect = 7e-09
 Identities = 40/137 (29%), Positives = 69/137 (50%)
 Frame = +2

Query: 71   VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYA 250
            VH   IK+    +V V    +  + KC +   A  VF+ + +   T  +WN+M+SG   +
Sbjct: 670  VHTHLIKSPFWSDVFVGTATVDMFVKCDSLDYAAKVFERMPVRDAT--TWNAMLSGFCQS 727

Query: 251  GEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKLIHCHGIHLGCNSDISVL 430
            G   K   +++ M +    PD  T++ L+ S    ++L   K++H  GI LG +   +V 
Sbjct: 728  GHTDKVFSLFREMRLDEIPPDSVTVMTLIQSASFEKSLKLLKVMHAFGIRLGVDLQATVS 787

Query: 431  NTLIYMYSRCAEISSSK 481
            NT I  Y +C ++ S+K
Sbjct: 788  NTWISAYGKCGDLDSAK 804


>ref|XP_006413996.1| hypothetical protein EUTSA_v10024633mg [Eutrema salsugineum]
           gi|557115166|gb|ESQ55449.1| hypothetical protein
           EUTSA_v10024633mg [Eutrema salsugineum]
          Length = 655

 Score =  140 bits (352), Expect = 2e-31
 Identities = 74/160 (46%), Positives = 97/160 (60%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           DSV+VM L Q  S  K + LL +VH  GI+ G++  V+VANTWI+ YAKC +  +A+ VF
Sbjct: 152 DSVTVMALIQAASFEKSLKLLKAVHAFGIRLGVDVQVTVANTWISTYAKCSDLDSAKSVF 211

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
           + I+    TVVSWNSM    A  GE   A   Y+ ML  GF PDL T +NL  S   P+ 
Sbjct: 212 EAIERSDRTVVSWNSMFKAYAVFGEAFVAFGFYRWMLREGFKPDLSTFINLAASCQNPDT 271

Query: 362 LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           L  G+LIH   I LG + DI V+NT I MYS+  +  S++
Sbjct: 272 LTPGRLIHSQAICLGTDQDIEVINTFISMYSKSGDTYSAR 311



 Score = 84.0 bits (206), Expect = 2e-14
 Identities = 50/138 (36%), Positives = 76/138 (55%), Gaps = 1/138 (0%)
 Frame = +2

Query: 71  VHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVFDGIDLCYLTVVSWNSMISGCAYA 250
           +H   I  G + ++ V NT+I+ Y+K G+  +A ++FDG+     T VSW  MISG A  
Sbjct: 278 IHSQAICLGTDQDIEVINTFISMYSKSGDTYSARLLFDGMPS--RTRVSWTVMISGFAEK 335

Query: 251 GEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEALYQGKLIHCHGIHLGCNSD-ISV 427
           G+  +A+ ++  M  +G  PDL T+L+L+    K   L  GK I       GC  D + V
Sbjct: 336 GDMDEALALFNAMTEAGVKPDLVTLLSLISGCGKFGLLEIGKWIDAQADTYGCKRDNVIV 395

Query: 428 LNTLIYMYSRCAEISSSK 481
            N LI MYS+C  I+ ++
Sbjct: 396 CNALIDMYSKCGSITEAR 413



 Score = 69.7 bits (169), Expect = 4e-10
 Identities = 42/160 (26%), Positives = 82/160 (51%)
 Frame = +2

Query: 2   DSVSVMWLTQLTSSVKDVMLLSSVHGLGIKTGLEFNVSVANTWIAAYAKCGNPAAAEMVF 181
           +S +  ++ +  + + D+     VH   +K+    +V V    +  + KC     A  VF
Sbjct: 51  NSFTFPFVAKACARLADIGYCEMVHAHVLKSPFWSDVFVGTATVDMFVKCDCLDYAAKVF 110

Query: 182 DGIDLCYLTVVSWNSMISGCAYAGEGCKAMEIYQRMLVSGFSPDLGTILNLLGSFVKPEA 361
           + +     T  +WN+M+SG + +G   KA  +++ M +   SPD  T++ L+ +    ++
Sbjct: 111 ERMPERDAT--TWNAMLSGFSQSGHTDKAFSLFREMRLGEISPDSVTVMALIQAASFEKS 168

Query: 362 LYQGKLIHCHGIHLGCNSDISVLNTLIYMYSRCAEISSSK 481
           L   K +H  GI LG +  ++V NT I  Y++C+++ S+K
Sbjct: 169 LKLLKAVHAFGIRLGVDVQVTVANTWISTYAKCSDLDSAK 208


Top