BLASTX nr result

ID: Mentha22_contig00026724 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha22_contig00026724
         (626 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU39754.1| hypothetical protein MIMGU_mgv1a001778mg [Mimulus...   305   6e-81
ref|XP_004250507.1| PREDICTED: pentatricopeptide repeat-containi...   279   6e-73
ref|XP_006353247.1| PREDICTED: pentatricopeptide repeat-containi...   275   6e-72
gb|EPS65453.1| hypothetical protein M569_09325, partial [Genlise...   273   3e-71
ref|XP_004140525.1| PREDICTED: pentatricopeptide repeat-containi...   239   6e-61
ref|XP_007051367.1| Pentatricopeptide repeat-containing protein,...   235   9e-60
ref|XP_002272556.1| PREDICTED: pentatricopeptide repeat-containi...   234   2e-59
ref|XP_006289934.1| hypothetical protein CARUB_v10003556mg [Caps...   233   4e-59
ref|XP_004308750.1| PREDICTED: pentatricopeptide repeat-containi...   233   5e-59
ref|XP_006396354.1| hypothetical protein EUTSA_v10028437mg [Eutr...   228   1e-57
ref|XP_002302689.2| hypothetical protein POPTR_0002s18390g [Popu...   227   2e-57
ref|XP_006386676.1| pentatricopeptide repeat-containing family p...   227   2e-57
gb|AAC62783.1| F11O4.7 [Arabidopsis thaliana]                         227   3e-57
ref|NP_192066.2| pentatricopeptide repeat-containing protein [Ar...   227   3e-57
ref|XP_002874971.1| pentatricopeptide repeat-containing protein ...   226   3e-57
ref|XP_006444679.1| hypothetical protein CICLE_v10023806mg [Citr...   224   1e-56
ref|XP_006491416.1| PREDICTED: pentatricopeptide repeat-containi...   224   2e-56
gb|EXC13626.1| hypothetical protein L484_019583 [Morus notabilis]     222   6e-56
ref|XP_002515124.1| pentatricopeptide repeat-containing protein,...   212   6e-53
ref|XP_007132032.1| hypothetical protein PHAVU_011G061000g [Phas...   200   3e-49

>gb|EYU39754.1| hypothetical protein MIMGU_mgv1a001778mg [Mimulus guttatus]
          Length = 760

 Score =  305 bits (782), Expect = 6e-81
 Identities = 153/209 (73%), Positives = 177/209 (84%), Gaps = 1/209 (0%)
 Frame = -1

Query: 626 DGLALDSSTLKMILDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQ 447
           D  ALDS TLK+IL+ FI++GKYDSALEVLD  ER+LI T+  S D+YSPV+VAL+ KNQ
Sbjct: 109 DAAALDSPTLKLILNSFIRSGKYDSALEVLDCVERDLIQTTSLSPDIYSPVIVALIRKNQ 168

Query: 446 ISIALSVFLKLLDSALFAKNGENVVIPDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKL 267
           ISIALS+FLKLLDS+       +  IPDAIACNE+LV LKK+DM+DEF+Q+++ LRKTKL
Sbjct: 169 ISIALSIFLKLLDSS-------SSEIPDAIACNELLVALKKSDMKDEFKQVFAKLRKTKL 221

Query: 266 YPMDRWGYNICIHALGCWGDLNTALALFKEMK-ERNGPFEPDLCTYNSLIHVLCLLGKVK 90
           YP+DR GYNICIH LGCWGDL+T+L LFKEMK E N    PDLCTYNSLIHVLCLLGKVK
Sbjct: 222 YPLDRCGYNICIHTLGCWGDLSTSLNLFKEMKRETNIRLNPDLCTYNSLIHVLCLLGKVK 281

Query: 89  DALIVWEELKASSGYEPDEFTYRIMIQGC 3
           DALIVWEELKASSG+EPD FTYRI+IQGC
Sbjct: 282 DALIVWEELKASSGHEPDAFTYRILIQGC 310


>ref|XP_004250507.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g01570-like [Solanum lycopersicum]
          Length = 819

 Score =  279 bits (713), Expect = 6e-73
 Identities = 128/208 (61%), Positives = 166/208 (79%)
 Frame = -1

Query: 626 DGLALDSSTLKMILDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQ 447
           D + L+S+T K++LD F + G +DSALE+L++ E +L  +SC S DVY+ VL+ALV KNQ
Sbjct: 137 DEVLLNSATFKLLLDSFTRTGNFDSALEILEFVEGDLANSSCLSPDVYNSVLIALVQKNQ 196

Query: 446 ISIALSVFLKLLDSALFAKNGENVVIPDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKL 267
           +++ALS+FLKLL++     +G ++ +  AIACNE+LVGLK+ +MR EF+Q++  LR   +
Sbjct: 197 VNLALSIFLKLLET----NDGNSIGVSSAIACNELLVGLKRGNMRAEFKQVFDKLRGGNV 252

Query: 266 YPMDRWGYNICIHALGCWGDLNTALALFKEMKERNGPFEPDLCTYNSLIHVLCLLGKVKD 87
           +P DRWGYNICIHA GCWGDL+ +L+LFKEMKER   F PDLCTYNSLIHVLCLLGKVKD
Sbjct: 253 FPFDRWGYNICIHAFGCWGDLSRSLSLFKEMKERGSCFSPDLCTYNSLIHVLCLLGKVKD 312

Query: 86  ALIVWEELKASSGYEPDEFTYRIMIQGC 3
           A +VWEELK SSG EPD +TYRI+IQGC
Sbjct: 313 AFVVWEELKGSSGLEPDAYTYRIVIQGC 340


>ref|XP_006353247.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g01570-like isoform X1 [Solanum tuberosum]
          Length = 816

 Score =  275 bits (704), Expect = 6e-72
 Identities = 125/208 (60%), Positives = 166/208 (79%)
 Frame = -1

Query: 626 DGLALDSSTLKMILDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQ 447
           D + L+++T K++LD F + G +DSALE+L++ E +L  +SC S DVY+ VL+ALV KNQ
Sbjct: 134 DKVLLNAATFKLLLDSFTRTGNFDSALEILEFVEGDLDNSSCLSPDVYNSVLIALVQKNQ 193

Query: 446 ISIALSVFLKLLDSALFAKNGENVVIPDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKL 267
           +++ALS+FLKLL++     +G ++ +  A+ACNE+LVGLK+ +MR EF+Q++  LR   +
Sbjct: 194 VNLALSIFLKLLET----NDGNSIGVSSAVACNELLVGLKRGNMRAEFKQVFDKLRGGNV 249

Query: 266 YPMDRWGYNICIHALGCWGDLNTALALFKEMKERNGPFEPDLCTYNSLIHVLCLLGKVKD 87
           +P DRWGYNICIH  GCWGDL+++L+LFKEMKER   F PDLCTYNSLIHVLCLLGKVKD
Sbjct: 250 FPFDRWGYNICIHTFGCWGDLSSSLSLFKEMKERGSWFSPDLCTYNSLIHVLCLLGKVKD 309

Query: 86  ALIVWEELKASSGYEPDEFTYRIMIQGC 3
           A +VWEELK SSG EPD +TYRI+IQGC
Sbjct: 310 AFVVWEELKGSSGLEPDAYTYRIVIQGC 337


>gb|EPS65453.1| hypothetical protein M569_09325, partial [Genlisea aurea]
          Length = 770

 Score =  273 bits (698), Expect = 3e-71
 Identities = 133/208 (63%), Positives = 160/208 (76%)
 Frame = -1

Query: 626 DGLALDSSTLKMILDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQ 447
           DG+ LDS TLK IL+G I+A K+D AL+VLDY E++ +     S DVYSPVLVALV K+Q
Sbjct: 97  DGVILDSDTLKRILNGLIRAQKFDYALDVLDYIEKDSVIAGNLSPDVYSPVLVALVRKDQ 156

Query: 446 ISIALSVFLKLLDSALFAKNGENVVIPDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKL 267
           ISIAL VF KLL S           IPDA ACNE+L GLKK  M++EFR++++ LR+T  
Sbjct: 157 ISIALPVFFKLLHSQF------EDYIPDAFACNELLAGLKKKKMKNEFREVFAKLRETAR 210

Query: 266 YPMDRWGYNICIHALGCWGDLNTALALFKEMKERNGPFEPDLCTYNSLIHVLCLLGKVKD 87
           YP DRWGYNICIH+ GCWGDL+TAL+LFKEMK+R G   PDLCTYNSLI V C LG++ D
Sbjct: 211 YPSDRWGYNICIHSFGCWGDLSTALSLFKEMKDRGGSVYPDLCTYNSLIQVFCSLGRLND 270

Query: 86  ALIVWEELKASSGYEPDEFTYRIMIQGC 3
           AL++W+ELK SSGYEPD FTYRI+IQGC
Sbjct: 271 ALVIWKELKNSSGYEPDRFTYRILIQGC 298


>ref|XP_004140525.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g01570-like [Cucumis sativus]
           gi|449523383|ref|XP_004168703.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At4g01570-like [Cucumis sativus]
          Length = 803

 Score =  239 bits (609), Expect = 6e-61
 Identities = 124/219 (56%), Positives = 163/219 (74%), Gaps = 11/219 (5%)
 Frame = -1

Query: 626 DGLALDSSTLKMILDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQ 447
           DG+++DS T K++LD FI++GKYD+ALE+LD+ E   +GTS    + Y+ VLVAL+ KNQ
Sbjct: 119 DGVSVDSHTFKVLLDAFIRSGKYDAALEILDHMED--LGTS-LELNTYNSVLVALLRKNQ 175

Query: 446 ISIALSVFLKLLDSALFAKNGENV--------VIPDAIACNEVLVGLKKADMRDEFRQLY 291
           + +ALS+F KLLD      NG  V         +P+++ACNE+LV L+K DMR EF++++
Sbjct: 176 VGLALSIFFKLLDGF---NNGGQVDSAATTFHFLPNSLACNELLVALRKLDMRVEFKKVF 232

Query: 290 SNLRKTKLYPMDRWGYNICIHALGCWGDLNTALALFKEMKER---NGPFEPDLCTYNSLI 120
             LR  + +    +GYNICI+A GCWG L+TAL+LFKEMKE+   +  F PDLCTYNS+I
Sbjct: 233 DKLRAIESFEFSVYGYNICIYAFGCWGYLDTALSLFKEMKEKSLVSESFSPDLCTYNSII 292

Query: 119 HVLCLLGKVKDALIVWEELKASSGYEPDEFTYRIMIQGC 3
           HVLCL+GKVKDALIVWEELK  SG+EPD FTYRI+IQGC
Sbjct: 293 HVLCLVGKVKDALIVWEELK-GSGHEPDAFTYRIIIQGC 330


>ref|XP_007051367.1| Pentatricopeptide repeat-containing protein, putative [Theobroma
           cacao] gi|508703628|gb|EOX95524.1| Pentatricopeptide
           repeat-containing protein, putative [Theobroma cacao]
          Length = 807

 Score =  235 bits (599), Expect = 9e-60
 Identities = 121/213 (56%), Positives = 156/213 (73%), Gaps = 5/213 (2%)
 Frame = -1

Query: 626 DGLALDSSTLKMILDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQ 447
           DG+ +DS T K +LD FI++GK+DSALE+LD+ E    G    +  VY  VLVAL+ K+Q
Sbjct: 117 DGVLVDSDTFKFLLDAFIRSGKFDSALEILDFMEELGAG---LNLRVYDSVLVALIRKDQ 173

Query: 446 ISIALSVFLKLLDSALFAKNGENV--VIPDAIACNEVLVGLKKADMRDEFRQLYSNLRKT 273
           + +ALS+F KLL++     +G +V   +P +IA NE+LV L+KA MR EF+Q++  LR+ 
Sbjct: 174 VGLALSLFFKLLEACNGNDDGNSVDSSLPGSIAINELLVALRKAHMRREFKQVFDILREK 233

Query: 272 KLYPMDRWGYNICIHALGCWGDLNTALALFKEMKERN---GPFEPDLCTYNSLIHVLCLL 102
           + +  D  GYNICIH+ GCWGDL  +L LFKEMKE+    G F PDLCTYNSLI VLCL+
Sbjct: 234 REFEFDTCGYNICIHSFGCWGDLGASLKLFKEMKEKEKSFGSFGPDLCTYNSLIDVLCLV 293

Query: 101 GKVKDALIVWEELKASSGYEPDEFTYRIMIQGC 3
           GKVKDAL+VWEELK  SG+EPD FTYRI+IQGC
Sbjct: 294 GKVKDALVVWEELKV-SGHEPDAFTYRILIQGC 325


>ref|XP_002272556.1| PREDICTED: pentatricopeptide repeat-containing protein At4g01570
           [Vitis vinifera]
          Length = 792

 Score =  234 bits (596), Expect = 2e-59
 Identities = 121/211 (57%), Positives = 154/211 (72%), Gaps = 3/211 (1%)
 Frame = -1

Query: 626 DGLALDSSTLKMILDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQ 447
           DG+ +   T K++LD  I+AGK+DSALE+LD+ E   +GT   ++ VY  VLVAL+ KNQ
Sbjct: 114 DGVVVGQETFKLLLDSLIRAGKFDSALEILDHIEE--LGTG-LNSYVYDSVLVALIRKNQ 170

Query: 446 ISIALSVFLKLLDSALFAKNGENVVIPDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKL 267
           + +AL +F KLL      +    V +P++ ACN++LV L+KADM+ EFR ++  LR  K 
Sbjct: 171 LGLALPLFFKLLGGD---EGQGGVPVPESNACNQLLVALRKADMKIEFRNVFEKLRAKKD 227

Query: 266 YPMDRWGYNICIHALGCWGDLNTALALFKEMKER---NGPFEPDLCTYNSLIHVLCLLGK 96
           + +D  GYNICIHA GCWGDL TAL LFKEMK++   +  F PDLCTYNSLI VLCL+GK
Sbjct: 228 FDLDTQGYNICIHAFGCWGDLGTALNLFKEMKDKSLNSSSFGPDLCTYNSLIRVLCLVGK 287

Query: 95  VKDALIVWEELKASSGYEPDEFTYRIMIQGC 3
           VKDALIVWEELK  SG+EPD FTYRI+IQGC
Sbjct: 288 VKDALIVWEELK-GSGHEPDAFTYRILIQGC 317



 Score = 57.0 bits (136), Expect = 5e-06
 Identities = 54/199 (27%), Positives = 90/199 (45%), Gaps = 3/199 (1%)
 Frame = -1

Query: 614  LDSSTLKMI---LDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQI 444
            +DS  + M+   L  F+  GK   A ++ +      +    ++   Y+ ++ A V K   
Sbjct: 580  IDSFDIDMVNTYLSIFLAKGKLSLACKLFEIFSNMGVDPVIYT---YNSMMTAFVKKGYF 636

Query: 443  SIALSVFLKLLDSALFAKNGENVVIPDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKLY 264
            + A  VF ++         GE V  PD    N ++ GL K    D    +   L K   Y
Sbjct: 637  NEAWGVFHEM---------GEKVCPPDIATYNVIIQGLGKMGRADLASAVLDMLMKQGGY 687

Query: 263  PMDRWGYNICIHALGCWGDLNTALALFKEMKERNGPFEPDLCTYNSLIHVLCLLGKVKDA 84
             +D   YN  I+ALG  G ++ A  LF++M  R+    PD+ T+N+LI +    G++K A
Sbjct: 688  -LDIVMYNTLINALGKAGRIDEATKLFEQM--RSSGINPDVVTFNTLIEIHAKAGQLK-A 743

Query: 83   LIVWEELKASSGYEPDEFT 27
               + +L   +G  P+  T
Sbjct: 744  AYKFLKLMLDAGCSPNHVT 762


>ref|XP_006289934.1| hypothetical protein CARUB_v10003556mg [Capsella rubella]
           gi|482558640|gb|EOA22832.1| hypothetical protein
           CARUB_v10003556mg [Capsella rubella]
          Length = 802

 Score =  233 bits (594), Expect = 4e-59
 Identities = 119/216 (55%), Positives = 155/216 (71%), Gaps = 8/216 (3%)
 Frame = -1

Query: 626 DGLALDSSTLKMILDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQ 447
           DG+ LD +  K++LD  I++GK+DSAL VLDY E   +G  C +  +Y  VLVALV KN+
Sbjct: 121 DGVNLDQTMAKVLLDSLIRSGKFDSALGVLDYMEE--LG-DCLNPGLYDSVLVALVKKNE 177

Query: 446 ISIALSVFLKLLDSALFAKNGENVVI----PDAIACNEVLVGLKKADMRDEFRQLYSNLR 279
           + +ALS+F KLL+++    +G   VI    P  +A NE+LVGL++A MR EF++++  LR
Sbjct: 178 MRLALSIFFKLLEASDNHSDGTGGVIVSYLPGTVAVNELLVGLRRAGMRSEFKRVFEKLR 237

Query: 278 KTKLYPMDRWGYNICIHALGCWGDLNTALALFKEMKERN----GPFEPDLCTYNSLIHVL 111
           + K +  D WGYNICIH  GCWGDL+ AL+LFKEMK ++      F PD+CTYNSLIHVL
Sbjct: 238 EVKRFKFDTWGYNICIHGFGCWGDLDAALSLFKEMKVQSSVSGSSFGPDICTYNSLIHVL 297

Query: 110 CLLGKVKDALIVWEELKASSGYEPDEFTYRIMIQGC 3
           CL GK KDALIVW+ELK  SG+EPD  TYRI+IQGC
Sbjct: 298 CLFGKAKDALIVWDELKV-SGHEPDNSTYRILIQGC 332


>ref|XP_004308750.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g01570-like [Fragaria vesca subsp. vesca]
          Length = 789

 Score =  233 bits (593), Expect = 5e-59
 Identities = 118/209 (56%), Positives = 150/209 (71%), Gaps = 1/209 (0%)
 Frame = -1

Query: 626 DGLALDSSTLKMILDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQ 447
           D LA+DS T K +LD FI+ GK+D A+E+LD  +      +  +AD+Y+ VLVALV K Q
Sbjct: 111 DSLAVDSGTFKSLLDAFIREGKFDMAIEILDTMQEV---NAELNADMYNSVLVALVRKGQ 167

Query: 446 ISIALSVFLKLLDSALFAKNGENVVIPDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKL 267
           + +A+S+ ++LL+       G    +P  IACNE+LVGL+K DMR EF+Q+Y  LR  + 
Sbjct: 168 LRLAMSILVRLLEG------GSCDQVPSCIACNELLVGLRKGDMRVEFKQVYDKLRGNEW 221

Query: 266 YPMDRWGYNICIHALGCWGDLNTALALFKEMKERNG-PFEPDLCTYNSLIHVLCLLGKVK 90
           + MD WGYNICIHA GCWGDL T+L+LFKEMK+ N     PDL TYNSLIHVLCL+GKV 
Sbjct: 222 FEMDTWGYNICIHAFGCWGDLGTSLSLFKEMKDLNSDSVFPDLSTYNSLIHVLCLVGKVD 281

Query: 89  DALIVWEELKASSGYEPDEFTYRIMIQGC 3
           DA+ VWEELK  SG+EPD  TYRI+IQGC
Sbjct: 282 DAITVWEELKC-SGHEPDAITYRILIQGC 309


>ref|XP_006396354.1| hypothetical protein EUTSA_v10028437mg [Eutrema salsugineum]
           gi|557097371|gb|ESQ37807.1| hypothetical protein
           EUTSA_v10028437mg [Eutrema salsugineum]
          Length = 801

 Score =  228 bits (580), Expect = 1e-57
 Identities = 118/215 (54%), Positives = 155/215 (72%), Gaps = 7/215 (3%)
 Frame = -1

Query: 626 DGLALDSSTLKMILDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQ 447
           DG+ LD +T K++LD  I++GKYDSAL VLDY E EL G  C +  +Y  VL+ALV KN+
Sbjct: 121 DGVNLDQTTSKLLLDSLIRSGKYDSALGVLDYME-ELGG--CLNPRLYDSVLIALVKKNE 177

Query: 446 ISIALSVFLKLLDSALFAKNGENVVI---PDAIACNEVLVGLKKADMRDEFRQLYSNLRK 276
           + +ALS+F KLL+++        V +   P  +A NE+LVGL+KA+M+ EF+ ++  L+ 
Sbjct: 178 LRLALSIFFKLLEASDNPSETGGVSVSYLPGTVAVNELLVGLRKANMKLEFKGVFDKLKG 237

Query: 275 TKLYPMDRWGYNICIHALGCWGDLNTALALFKEMKERN----GPFEPDLCTYNSLIHVLC 108
            + +  D WGYNICIH  GCWGDL+ AL+LFKEMKE++        PD+CTYNSLIHVLC
Sbjct: 238 MERFKFDTWGYNICIHGFGCWGDLDAALSLFKEMKEQSSISGSCAGPDICTYNSLIHVLC 297

Query: 107 LLGKVKDALIVWEELKASSGYEPDEFTYRIMIQGC 3
           L+GK KDALIVW+ELK  SG+EPD  TYRI+IQGC
Sbjct: 298 LVGKAKDALIVWDELKV-SGHEPDNSTYRILIQGC 331


>ref|XP_002302689.2| hypothetical protein POPTR_0002s18390g [Populus trichocarpa]
           gi|550345304|gb|EEE81962.2| hypothetical protein
           POPTR_0002s18390g [Populus trichocarpa]
          Length = 776

 Score =  227 bits (579), Expect = 2e-57
 Identities = 114/214 (53%), Positives = 159/214 (74%), Gaps = 6/214 (2%)
 Frame = -1

Query: 626 DGLALDSSTLKMILDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQ 447
           DG+ + S T K++LD FI++GK+DSAL++LD+ E   +G++  +  +Y  ++VAL  KNQ
Sbjct: 94  DGVVVGSETFKLLLDAFIRSGKFDSALDILDHMEE--LGSNP-NPHMYDSIIVALAKKNQ 150

Query: 446 ISIALSVFLKLLDSALFAKNGENVV---IPDAIACNEVLVGLKKADMRDEFRQLYSNLRK 276
           + +ALS+  KLL+++    N EN V   +P ++ACN +LV L+  +M+ EF+ +++ LR 
Sbjct: 151 VGLALSIMFKLLEAS--DGNEENAVGVSLPGSVACNALLVALRNGEMKVEFKTVFAKLRG 208

Query: 275 TKLYPMDRWGYNICIHALGCWGDLNTALALFKEMKER---NGPFEPDLCTYNSLIHVLCL 105
              + ++ WGYNICIHA GCWGDL T+L LFKEMKE+   +G  +PDLCTYNSLIHVLCL
Sbjct: 209 KGGFELNTWGYNICIHAFGCWGDLTTSLRLFKEMKEKSLASGSLDPDLCTYNSLIHVLCL 268

Query: 104 LGKVKDALIVWEELKASSGYEPDEFTYRIMIQGC 3
            GKVKDA+IV+EELK  SG+EPD FTYRI+IQGC
Sbjct: 269 AGKVKDAVIVYEELKV-SGHEPDAFTYRILIQGC 301


>ref|XP_006386676.1| pentatricopeptide repeat-containing family protein [Populus
           trichocarpa] gi|550345301|gb|ERP64473.1|
           pentatricopeptide repeat-containing family protein
           [Populus trichocarpa]
          Length = 776

 Score =  227 bits (579), Expect = 2e-57
 Identities = 114/214 (53%), Positives = 159/214 (74%), Gaps = 6/214 (2%)
 Frame = -1

Query: 626 DGLALDSSTLKMILDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQ 447
           DG+ + S T K++LD FI++GK+DSAL++LD+ E   +G++  +  +Y  ++VAL  KNQ
Sbjct: 94  DGVVVGSETFKLLLDAFIRSGKFDSALDILDHMEE--LGSNP-NPHMYDSIIVALAKKNQ 150

Query: 446 ISIALSVFLKLLDSALFAKNGENVV---IPDAIACNEVLVGLKKADMRDEFRQLYSNLRK 276
           + +ALS+  KLL+++    N EN V   +P ++ACN +LV L+  +M+ EF+ +++ LR 
Sbjct: 151 VGLALSIMFKLLEAS--DGNEENAVRVSLPGSVACNALLVALRNGEMKVEFKTVFAKLRG 208

Query: 275 TKLYPMDRWGYNICIHALGCWGDLNTALALFKEMKER---NGPFEPDLCTYNSLIHVLCL 105
              + ++ WGYNICIHA GCWGDL T+L LFKEMKE+   +G  +PDLCTYNSLIHVLCL
Sbjct: 209 KVGFKLNTWGYNICIHAFGCWGDLTTSLRLFKEMKEKSLASGSLDPDLCTYNSLIHVLCL 268

Query: 104 LGKVKDALIVWEELKASSGYEPDEFTYRIMIQGC 3
            GKVKDA+IV+EELK  SG+EPD FTYRI+IQGC
Sbjct: 269 AGKVKDAVIVYEELKV-SGHEPDAFTYRILIQGC 301


>gb|AAC62783.1| F11O4.7 [Arabidopsis thaliana]
          Length = 508

 Score =  227 bits (578), Expect = 3e-57
 Identities = 116/218 (53%), Positives = 153/218 (70%), Gaps = 10/218 (4%)
 Frame = -1

Query: 626 DGLALDSSTLKMILDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQ 447
           DG+ LD +  K++LD  I++GK++SAL VLDY E   +G  C +  VY  VL+ALV K++
Sbjct: 121 DGVNLDQTMAKILLDSLIRSGKFESALGVLDYMEE--LG-DCLNPSVYDSVLIALVKKHE 177

Query: 446 ISIALSVFLKLL---DSALFAKNGENVVI---PDAIACNEVLVGLKKADMRDEFRQLYSN 285
           + +ALS+  KLL   D+      G  +++   P  +A NE+LVGL++ADMR EF++++  
Sbjct: 178 LRLALSILFKLLEASDNHSDDDTGRVIIVSYLPGTVAVNELLVGLRRADMRSEFKRVFEK 237

Query: 284 LRKTKLYPMDRWGYNICIHALGCWGDLNTALALFKEMKERN----GPFEPDLCTYNSLIH 117
           L+  K +  D W YNICIH  GCWGDL+ AL+LFKEMKER+      F PD+CTYNSLIH
Sbjct: 238 LKGMKRFKFDTWSYNICIHGFGCWGDLDAALSLFKEMKERSSVYGSSFGPDICTYNSLIH 297

Query: 116 VLCLLGKVKDALIVWEELKASSGYEPDEFTYRIMIQGC 3
           VLCL GK KDALIVW+ELK  SG+EPD  TYRI+IQGC
Sbjct: 298 VLCLFGKAKDALIVWDELKV-SGHEPDNSTYRILIQGC 334


>ref|NP_192066.2| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|75161629|sp|Q8VZE4.1|PP299_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At4g01570 gi|18086402|gb|AAL57659.1| AT4g01570/T15B16_21
           [Arabidopsis thaliana] gi|24797024|gb|AAN64524.1|
           At4g01570/T15B16_21 [Arabidopsis thaliana]
           gi|332656643|gb|AEE82043.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 805

 Score =  227 bits (578), Expect = 3e-57
 Identities = 116/218 (53%), Positives = 153/218 (70%), Gaps = 10/218 (4%)
 Frame = -1

Query: 626 DGLALDSSTLKMILDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQ 447
           DG+ LD +  K++LD  I++GK++SAL VLDY E   +G  C +  VY  VL+ALV K++
Sbjct: 121 DGVNLDQTMAKILLDSLIRSGKFESALGVLDYMEE--LG-DCLNPSVYDSVLIALVKKHE 177

Query: 446 ISIALSVFLKLL---DSALFAKNGENVVI---PDAIACNEVLVGLKKADMRDEFRQLYSN 285
           + +ALS+  KLL   D+      G  +++   P  +A NE+LVGL++ADMR EF++++  
Sbjct: 178 LRLALSILFKLLEASDNHSDDDTGRVIIVSYLPGTVAVNELLVGLRRADMRSEFKRVFEK 237

Query: 284 LRKTKLYPMDRWGYNICIHALGCWGDLNTALALFKEMKERN----GPFEPDLCTYNSLIH 117
           L+  K +  D W YNICIH  GCWGDL+ AL+LFKEMKER+      F PD+CTYNSLIH
Sbjct: 238 LKGMKRFKFDTWSYNICIHGFGCWGDLDAALSLFKEMKERSSVYGSSFGPDICTYNSLIH 297

Query: 116 VLCLLGKVKDALIVWEELKASSGYEPDEFTYRIMIQGC 3
           VLCL GK KDALIVW+ELK  SG+EPD  TYRI+IQGC
Sbjct: 298 VLCLFGKAKDALIVWDELKV-SGHEPDNSTYRILIQGC 334


>ref|XP_002874971.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297320808|gb|EFH51230.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 802

 Score =  226 bits (577), Expect = 3e-57
 Identities = 114/218 (52%), Positives = 153/218 (70%), Gaps = 10/218 (4%)
 Frame = -1

Query: 626 DGLALDSSTLKMILDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQ 447
           DG+ LD +  K++LD  I++GK++SAL VLDY E   +G  C +  +Y  VL+AL  KN+
Sbjct: 121 DGVNLDQTMAKILLDSLIRSGKFESALGVLDYMEE--LG-DCLNPSLYDSVLIALAKKNE 177

Query: 446 ISIALSVFLKLLDSALFAKNGENV------VIPDAIACNEVLVGLKKADMRDEFRQLYSN 285
           + +ALS+F KLL+++    +G++        +P  +A NE+LVGL++ADMR EF+ ++  
Sbjct: 178 LRLALSIFFKLLEAS--DNHGDDTSGVTVSYLPGRVAVNELLVGLRRADMRSEFKTVFEK 235

Query: 284 LRKTKLYPMDRWGYNICIHALGCWGDLNTALALFKEMKERN----GPFEPDLCTYNSLIH 117
           L+    +  D W YNICIH  GCWGDL+ AL+LFKEMKER+      F PD+CTYNSLIH
Sbjct: 236 LKGMNRFKFDTWSYNICIHGFGCWGDLDAALSLFKEMKERSSVSGSSFAPDICTYNSLIH 295

Query: 116 VLCLLGKVKDALIVWEELKASSGYEPDEFTYRIMIQGC 3
           VLCL GK KDALIVW+ELK  SG+EPD  TYRI+IQGC
Sbjct: 296 VLCLFGKAKDALIVWDELKV-SGHEPDNSTYRILIQGC 332


>ref|XP_006444679.1| hypothetical protein CICLE_v10023806mg [Citrus clementina]
           gi|557546941|gb|ESR57919.1| hypothetical protein
           CICLE_v10023806mg [Citrus clementina]
          Length = 619

 Score =  224 bits (572), Expect = 1e-56
 Identities = 116/210 (55%), Positives = 155/210 (73%), Gaps = 2/210 (0%)
 Frame = -1

Query: 626 DGLALDSSTLKMILDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQ 447
           D + +DS T K++L+  IK+GK D A+E+LDY E   +GTS  S +VY  VLV+LV K Q
Sbjct: 113 DDVVVDSETFKLLLEACIKSGKIDFAIEILDYMEE--LGTS-LSPNVYDSVLVSLVRKKQ 169

Query: 446 ISIALSVFLKLLDSALFAKNGENVV--IPDAIACNEVLVGLKKADMRDEFRQLYSNLRKT 273
           + +A+S+  KLL++        +VV  +P  +ACNE+LV L+K+D R EF+Q++  L++ 
Sbjct: 170 LGLAMSILFKLLEACNDNTADNSVVESLPGCVACNELLVALRKSDRRSEFKQVFERLKEQ 229

Query: 272 KLYPMDRWGYNICIHALGCWGDLNTALALFKEMKERNGPFEPDLCTYNSLIHVLCLLGKV 93
           K +  D +GYNICIHA GCWGDL+T+L LFKEMKE+     PDL TYNSLI VLC++GKV
Sbjct: 230 KEFEFDIYGYNICIHAFGCWGDLHTSLRLFKEMKEKG--LVPDLHTYNSLIQVLCVVGKV 287

Query: 92  KDALIVWEELKASSGYEPDEFTYRIMIQGC 3
           KDALIVWEELK  SG+EP+EFT+RI+IQGC
Sbjct: 288 KDALIVWEELK-GSGHEPNEFTHRIIIQGC 316


>ref|XP_006491416.1| PREDICTED: pentatricopeptide repeat-containing protein
           At4g01570-like [Citrus sinensis]
          Length = 790

 Score =  224 bits (570), Expect = 2e-56
 Identities = 116/210 (55%), Positives = 155/210 (73%), Gaps = 2/210 (0%)
 Frame = -1

Query: 626 DGLALDSSTLKMILDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQ 447
           D + +DS T K++L+  IK+GK D A+E+LDY E   +GTS  S +VY  VLV+LV K Q
Sbjct: 113 DDVVVDSETFKLLLEPCIKSGKIDFAIEILDYMEE--LGTS-LSPNVYDSVLVSLVRKKQ 169

Query: 446 ISIALSVFLKLLDSALFAKNGENVV--IPDAIACNEVLVGLKKADMRDEFRQLYSNLRKT 273
           + +A+S+  KLL++        +VV  +P  +ACNE+LV L+K+D R EF+Q++  L++ 
Sbjct: 170 LGLAMSILFKLLEACNDNTADNSVVESLPGCVACNELLVALRKSDRRSEFKQVFERLKEQ 229

Query: 272 KLYPMDRWGYNICIHALGCWGDLNTALALFKEMKERNGPFEPDLCTYNSLIHVLCLLGKV 93
           K +  D +GYNICIHA GCWGDL+T+L LFKEMKE+     PDL TYNSLI VLC++GKV
Sbjct: 230 KEFEFDIYGYNICIHAFGCWGDLHTSLRLFKEMKEKG--LVPDLHTYNSLIQVLCVVGKV 287

Query: 92  KDALIVWEELKASSGYEPDEFTYRIMIQGC 3
           KDALIVWEELK  SG+EP+EFT+RI+IQGC
Sbjct: 288 KDALIVWEELK-GSGHEPNEFTHRIIIQGC 316


>gb|EXC13626.1| hypothetical protein L484_019583 [Morus notabilis]
          Length = 788

 Score =  222 bits (566), Expect = 6e-56
 Identities = 113/208 (54%), Positives = 153/208 (73%)
 Frame = -1

Query: 626 DGLALDSSTLKMILDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQ 447
           +G+ +DS T K +LD FI++GK+D ALE+LD  E   +G +  ++ +Y  VL+ALV K+Q
Sbjct: 114 NGVIIDSWTFKTLLDTFIRSGKFDFALEILDTMEE--LGVT-LNSHMYDSVLIALVRKDQ 170

Query: 446 ISIALSVFLKLLDSALFAKNGENVVIPDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKL 267
           +S ALS+F K+L+ +          +P +I CNE+LV LKK+DMR EF+Q++  +R+ K 
Sbjct: 171 LSFALSIFFKILEDSSH--------VPSSIGCNELLVALKKSDMRVEFKQVFDGIREKKG 222

Query: 266 YPMDRWGYNICIHALGCWGDLNTALALFKEMKERNGPFEPDLCTYNSLIHVLCLLGKVKD 87
           + M+ WGYNICIHA G WGDL T+L+L++EMK   G   PDLCTYNSLIHVLC  GKVKD
Sbjct: 223 FGMNVWGYNICIHAFGFWGDLGTSLSLYREMKVSVG---PDLCTYNSLIHVLCFFGKVKD 279

Query: 86  ALIVWEELKASSGYEPDEFTYRIMIQGC 3
           AL+V+EELK  SG++PD FTYRI+IQGC
Sbjct: 280 ALVVYEELK-GSGHQPDRFTYRILIQGC 306


>ref|XP_002515124.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223545604|gb|EEF47108.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 898

 Score =  212 bits (540), Expect = 6e-53
 Identities = 115/211 (54%), Positives = 148/211 (70%), Gaps = 3/211 (1%)
 Frame = -1

Query: 626 DGLALDSSTLKMILDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQ 447
           D   + + T K +LD FI  G +D ALE+LD  E   +GT+  +  +Y  VLVAL  KNQ
Sbjct: 142 DCAIVGTGTFKFLLDTFINLGNFDFALELLDVMEE--LGTN-LNPHMYDSVLVALTRKNQ 198

Query: 446 ISIALSVFLKLLDSALFAKNGENVVIPDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKL 267
           I +ALS+F KLL+++     G  V +P ++ACN +LV L+KADMR EF++++  L K   
Sbjct: 199 IGLALSIFFKLLETSNDIDIG--VSVPGSVACNTLLVALRKADMRVEFKKVFDKL-KGMG 255

Query: 266 YPMDRWGYNICIHALGCWGDLNTALALFKEMKERNGPFE---PDLCTYNSLIHVLCLLGK 96
           + +D WGYNICIHA GCW DL TAL LFKEMKE++  F    PDLCTYNSLI +LC  GK
Sbjct: 256 FELDTWGYNICIHAFGCWSDLGTALRLFKEMKEKSKGFGSCCPDLCTYNSLIRLLCFSGK 315

Query: 95  VKDALIVWEELKASSGYEPDEFTYRIMIQGC 3
           VKDAL+V+EELK  SG+EPD FTYRI+I+GC
Sbjct: 316 VKDALVVYEELKI-SGHEPDAFTYRIIIEGC 345


>ref|XP_007132032.1| hypothetical protein PHAVU_011G061000g [Phaseolus vulgaris]
           gi|561005032|gb|ESW04026.1| hypothetical protein
           PHAVU_011G061000g [Phaseolus vulgaris]
          Length = 718

 Score =  200 bits (509), Expect = 3e-49
 Identities = 102/209 (48%), Positives = 140/209 (66%), Gaps = 1/209 (0%)
 Frame = -1

Query: 626 DGLALDSSTLKMILDGFIKAGKYDSALEVLDYAERELIGTSCFSADVYSPVLVALVMKNQ 447
           DG+ LD  +L ++L  FI +  ++ A ++LDY +   +  +     +Y+ ++ AL+ KNQ
Sbjct: 101 DGVVLDPHSLNLLLHSFIFSSNFNLAFQLLDYVQHLQLDATT----IYNSLIAALLTKNQ 156

Query: 446 ISIALSVFLKLLDSALFAKNGENVVIPDAIACNEVLVGLKKADMRDEFRQLYSNLRKTKL 267
           +S+ALS+F KLLD+          V   +IACN++LV L+KADMR EF+Q+++ LR+ K 
Sbjct: 157 LSLALSIFFKLLDT----------VDSKSIACNQLLVALRKADMRVEFKQVFTKLREKKG 206

Query: 266 YPMDRWGYNICIHALGCWGDLNTALALFKEMK-ERNGPFEPDLCTYNSLIHVLCLLGKVK 90
           +  D WGYN+CIHA GCWGDL +  ALFKEMK +  G   PDLCTYNSLI  LC LGKV 
Sbjct: 207 FSFDTWGYNVCIHAFGCWGDLASCFALFKEMKDDGKGLVSPDLCTYNSLITALCRLGKVD 266

Query: 89  DALIVWEELKASSGYEPDEFTYRIMIQGC 3
           DAL+VWEEL AS  ++PD FTY  +I  C
Sbjct: 267 DALVVWEELNASV-HQPDRFTYTNLIHAC 294


Top