BLASTX nr result

ID: Mentha26_contig00037766 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha26_contig00037766
         (479 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU45383.1| hypothetical protein MIMGU_mgv1a023657mg [Mimulus...   253   3e-65
gb|EPS69071.1| hypothetical protein M569_05691, partial [Genlise...   213   2e-53
ref|XP_007206426.1| hypothetical protein PRUPE_ppa001946mg [Prun...   209   3e-52
gb|EXC01449.1| hypothetical protein L484_022020 [Morus notabilis]     207   2e-51
ref|XP_006350917.1| PREDICTED: pentatricopeptide repeat-containi...   201   9e-50
ref|XP_004145320.1| PREDICTED: pentatricopeptide repeat-containi...   199   3e-49
emb|CBI15973.3| unnamed protein product [Vitis vinifera]              199   3e-49
ref|XP_002279360.1| PREDICTED: pentatricopeptide repeat-containi...   199   3e-49
emb|CAN72716.1| hypothetical protein VITISV_032470 [Vitis vinifera]   199   3e-49
ref|XP_004241167.1| PREDICTED: pentatricopeptide repeat-containi...   198   7e-49
ref|XP_004295750.1| PREDICTED: pentatricopeptide repeat-containi...   190   2e-46
ref|XP_006480615.1| PREDICTED: pentatricopeptide repeat-containi...   189   3e-46
ref|XP_006428806.1| hypothetical protein CICLE_v10011151mg [Citr...   189   3e-46
ref|XP_007016122.1| Tetratricopeptide repeat (TPR)-like superfam...   186   2e-45
ref|XP_002525630.1| pentatricopeptide repeat-containing protein,...   183   2e-44
gb|AFN53666.1| hypothetical protein [Linum usitatissimum]             181   1e-43
ref|XP_002314110.1| pentatricopeptide repeat-containing family p...   179   3e-43
ref|XP_006293749.1| hypothetical protein CARUB_v10022711mg [Caps...   174   9e-42
ref|XP_006410036.1| hypothetical protein EUTSA_v10016305mg [Eutr...   172   4e-41
ref|XP_002879234.1| predicted protein [Arabidopsis lyrata subsp....   169   4e-40

>gb|EYU45383.1| hypothetical protein MIMGU_mgv1a023657mg [Mimulus guttatus]
          Length = 701

 Score =  253 bits (645), Expect = 3e-65
 Identities = 122/159 (76%), Positives = 136/159 (85%)
 Frame = +1

Query: 1   HPTVALIEKCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQI 180
           HPTV LIEKCSN +QLKQIHAQMLR GLFFDPFSASKL+QS ALSE SS+ YA+KVFDQI
Sbjct: 23  HPTVTLIEKCSNSRQLKQIHAQMLRCGLFFDPFSASKLVQSYALSELSSLHYAYKVFDQI 82

Query: 181 PQPNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELG 360
           PQPNLYSWN LIRA A+  +P++CL +F RLL+   EKPNKFTYPFVIKA  KL D +LG
Sbjct: 83  PQPNLYSWNILIRASASSPQPINCLLMFIRLLHVGGEKPNKFTYPFVIKASAKLDDFQLG 142

Query: 361 KGIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           KGIHGM IKEGFSSDL+V NCLI+FY+ECGCLDMA R F
Sbjct: 143 KGIHGMVIKEGFSSDLFVSNCLIYFYSECGCLDMARRVF 181



 Score = 59.3 bits (142), Expect = 5e-07
 Identities = 37/143 (25%), Positives = 64/143 (44%)
 Frame = +1

Query: 49  KQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQPNLYSWNALIRAYA 228
           K IH  +++ G   D F ++ LI     SE   +D A +VF  + + ++ SWN ++   A
Sbjct: 143 KGIHGMVIKEGFSSDLFVSNCLIYF--YSECGCLDMARRVFSSMSERDVVSWNTMVNGLA 200

Query: 229 TRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKGIHGMAIKEGFSSDL 408
                   +  F R+  +   KPN  T   V+ A  K  D++ G+ +H           L
Sbjct: 201 QNGYVDEAVECFHRMEEEGL-KPNDVTMVGVLSACGKKSDVKFGRWVHSYIETNRIRLSL 259

Query: 409 YVGNCLIHFYAECGCLDMASRAF 477
            + N ++  Y +CG +  A + F
Sbjct: 260 ILCNAILDMYTKCGSMKDAKKLF 282



 Score = 58.9 bits (141), Expect = 7e-07
 Identities = 34/113 (30%), Positives = 57/113 (50%), Gaps = 2/113 (1%)
 Frame = +1

Query: 145 SVDYAHKVFDQI--PQPNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPF 318
           S+  A K+FD++   + ++ +WNALI AY     P   +++F+ L      KP++ T   
Sbjct: 274 SMKDAKKLFDKMLPSKEDIATWNALISAYEQSGNPNEAIAIFNELQLSKASKPDEVTLVS 333

Query: 319 VIKALTKLKDLELGKGIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
            + A ++L   E G  IH    KEG   + ++   L+  Y++CG L  A   F
Sbjct: 334 TLSACSQLGATEFGSWIHVYMKKEGMRLNRHLVTALVDMYSKCGDLHKALEIF 386


>gb|EPS69071.1| hypothetical protein M569_05691, partial [Genlisea aurea]
          Length = 726

 Score =  213 bits (542), Expect = 2e-53
 Identities = 102/159 (64%), Positives = 125/159 (78%)
 Frame = +1

Query: 1   HPTVALIEKCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQI 180
           HPTV LI++C++ KQLKQIH QMLR+GL  DPF+ASKLI  SALS+FSS+ YA KVFDQ+
Sbjct: 11  HPTVTLIDRCTSQKQLKQIHCQMLRSGLLDDPFAASKLISLSALSDFSSLAYAQKVFDQM 70

Query: 181 PQPNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELG 360
           P+PNL+SWN L+RAYA+ SRPLH LS+F RLL+   + P+KFTYPF IKA   L DL LG
Sbjct: 71  PRPNLFSWNILVRAYASASRPLHSLSLFIRLLHHSPDPPDKFTYPFAIKACADLSDLRLG 130

Query: 361 KGIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           +GIHGMA+K   +SD++V N LI FY+EC CL  A R F
Sbjct: 131 RGIHGMAVKGNHASDVFVSNSLIRFYSECRCLVAAYRIF 169



 Score = 66.6 bits (161), Expect = 3e-09
 Identities = 38/109 (34%), Positives = 60/109 (55%), Gaps = 2/109 (1%)
 Frame = +1

Query: 157 AHKVFDQIPQPNLYSWNALIRAYATRSRPLHCLSVFSRLL--NDCTEKPNKFTYPFVIKA 330
           A  +FD +P  ++ SWNALI AY  R      +++F+ L   N+ TE P+  T    + A
Sbjct: 303 ARDLFDALPTKDITSWNALISAYEQRGNAKEAIAIFNELQQSNNDTE-PDGVTLVSTLSA 361

Query: 331 LTKLKDLELGKGIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
            ++L  +ELG  IH    K G S + ++   LI  Y++CG L+ A++ F
Sbjct: 362 CSQLGAIELGTRIHNYVKKRGMSLNCHLVTSLIDMYSKCGDLEKAAQVF 410



 Score = 56.2 bits (134), Expect = 5e-06
 Identities = 41/161 (25%), Positives = 73/161 (45%), Gaps = 8/161 (4%)
 Frame = +1

Query: 19  IEKCSNPKQLKQ---IHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQP 189
           I+ C++   L+    IH   ++     D F ++ LI+    SE   +  A+++F+ +P+ 
Sbjct: 118 IKACADLSDLRLGRGIHGMAVKGNHASDVFVSNSLIRF--YSECRCLVAAYRIFETMPRT 175

Query: 190 --NLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEK---PNKFTYPFVIKALTKLKDLE 354
             ++ SWN++I            + +F R++ +  E+   PN  T   V+       DLE
Sbjct: 176 RRDVVSWNSMINGLVQNKWHDDAMELFHRMVAEEEEEGVEPNGVTMLSVLGICGTKSDLE 235

Query: 355 LGKGIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           LGK  H    K G    L + N ++  Y +CG +  A   F
Sbjct: 236 LGKWAHSYVNKNGMEGSLILDNAILDMYTKCGGMKEAREVF 276


>ref|XP_007206426.1| hypothetical protein PRUPE_ppa001946mg [Prunus persica]
           gi|462402068|gb|EMJ07625.1| hypothetical protein
           PRUPE_ppa001946mg [Prunus persica]
          Length = 738

 Score =  209 bits (532), Expect = 3e-52
 Identities = 95/159 (59%), Positives = 126/159 (79%)
 Frame = +1

Query: 1   HPTVALIEKCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQI 180
           HP ++LI++C++ KQLKQ+HAQMLR G+ FDP+SASKLI +SALS FSS+DYA +VFDQI
Sbjct: 31  HPALSLIDQCTSIKQLKQVHAQMLRTGVLFDPYSASKLITASALSSFSSLDYARQVFDQI 90

Query: 181 PQPNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELG 360
           PQPN+Y+WN LIRAYA+ S P   + VF  +L+ C+E P+K+TYPF IKA ++L+ L++G
Sbjct: 91  PQPNVYTWNTLIRAYASSSDPAESILVFLDMLDHCSECPDKYTYPFAIKAASELRALQVG 150

Query: 361 KGIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           +G HGMAIK    SD+Y+ N L+HFY  CG LD+A R F
Sbjct: 151 RGFHGMAIKASLGSDIYILNSLVHFYGSCGDLDLARRVF 189



 Score = 65.5 bits (158), Expect = 8e-09
 Identities = 38/140 (27%), Positives = 66/140 (47%)
 Frame = +1

Query: 58  HAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQPNLYSWNALIRAYATRS 237
           H   ++A L  D +  + L+          +D A +VF + P+ ++ SWN++I  +A  +
Sbjct: 154 HGMAIKASLGSDIYILNSLVHF--YGSCGDLDLARRVFMKTPKKDVVSWNSMITVFAQGN 211

Query: 238 RPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKGIHGMAIKEGFSSDLYVG 417
            P   L +F  +  +   KPN  T   V+ A  K  DLE G+ +     +     +L + 
Sbjct: 212 CPQEALELFKEMEAE-NVKPNDVTMVSVLSACAKKVDLEFGRWVCSHIQRNEIKENLTLN 270

Query: 418 NCLIHFYAECGCLDMASRAF 477
           N ++  Y +CG +D A R F
Sbjct: 271 NAMLDMYVKCGSVDDAKRLF 290



 Score = 64.7 bits (156), Expect = 1e-08
 Identities = 34/115 (29%), Positives = 59/115 (51%)
 Frame = +1

Query: 133 SEFSSVDYAHKVFDQIPQPNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTY 312
           ++  + + A +VF  +P  ++ +WN LI +Y    +P   L+VF+ L    + KP++ T 
Sbjct: 309 AQLGNYEEAWRVFAAMPSQDIAAWNVLISSYEQSGKPKEALAVFNELQKSKSPKPDEVTL 368

Query: 313 PFVIKALTKLKDLELGKGIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
              + A  +L  ++LG  IH    K+    + ++   LI  YA+CG LD A   F
Sbjct: 369 VSTLAACAQLGAIDLGGWIHVYIKKQVMKLNCHLTTSLIDMYAKCGDLDKALEVF 423


>gb|EXC01449.1| hypothetical protein L484_022020 [Morus notabilis]
          Length = 739

 Score =  207 bits (526), Expect = 2e-51
 Identities = 94/159 (59%), Positives = 124/159 (77%)
 Frame = +1

Query: 1   HPTVALIEKCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQI 180
           +P ++LIE+C++ K+LKQIHAQMLR GLFFDPFSASKLI   A+S FSS+DYAH+VFDQI
Sbjct: 32  YPLLSLIEQCTSLKELKQIHAQMLRTGLFFDPFSASKLITVCAMSSFSSLDYAHQVFDQI 91

Query: 181 PQPNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELG 360
           P+PNLY+WN +IRAYA+ S P+  + VF R+L+ C E PNK+TYPFV+KA ++LK   +G
Sbjct: 92  PKPNLYTWNTIIRAYASSSDPIQSIVVFLRMLDQCCESPNKYTYPFVLKAASELKASRVG 151

Query: 361 KGIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           +G HGM +K   +SD+++ N L+HFY  C  LD A R F
Sbjct: 152 RGFHGMVMKSSLASDVFILNSLVHFYGSCDDLDSAYRVF 190



 Score = 65.5 bits (158), Expect = 8e-09
 Identities = 34/109 (31%), Positives = 58/109 (53%)
 Frame = +1

Query: 151 DYAHKVFDQIPQPNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKA 330
           D A +VF+ +P  ++ +WN LI +Y     P   LSVF +L    + KP++ T    + A
Sbjct: 316 DEALRVFEAMPNQDIAAWNVLISSYEQNGMPKEALSVFHKLQVSKSAKPDEVTLVSSLSA 375

Query: 331 LTKLKDLELGKGIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
            ++L  ++ G+ IH    ++G   + ++   LI  YA+CG L+ A   F
Sbjct: 376 CSQLGSIDPGRWIHIYIKRQGIKLNCHLTTSLIDMYAKCGDLEKALEVF 424



 Score = 62.8 bits (151), Expect = 5e-08
 Identities = 36/140 (25%), Positives = 68/140 (48%)
 Frame = +1

Query: 58  HAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQPNLYSWNALIRAYATRS 237
           H  ++++ L  D F  + L+      +   +D A++VF  IP  ++ SWN++I+A+    
Sbjct: 155 HGMVMKSSLASDVFILNSLVHFYGSCD--DLDSAYRVFLNIPSKDVVSWNSMIKAFVEGD 212

Query: 238 RPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKGIHGMAIKEGFSSDLYVG 417
            P     +F  +  +   KPN  T   V+ A  K  D+E G+ +     + G + +L + 
Sbjct: 213 CPDEAFQLFREMEME-NLKPNDITMVGVLCACGKKADIEFGRWLCSYIQRNGIAVNLTLN 271

Query: 418 NCLIHFYAECGCLDMASRAF 477
           N ++  Y +CG ++ A   F
Sbjct: 272 NAMLDMYVKCGSVEDAKELF 291


>ref|XP_006350917.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760,
           chloroplastic-like [Solanum tuberosum]
          Length = 744

 Score =  201 bits (511), Expect = 9e-50
 Identities = 97/160 (60%), Positives = 121/160 (75%), Gaps = 1/160 (0%)
 Frame = +1

Query: 1   HPTVALIEKCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQI 180
           HP V LI+KC + KQLKQIHA MLR GLF DPFSASKLI++S+LS FSS+DYAHKVFD+I
Sbjct: 36  HPLVLLIDKCQSIKQLKQIHAYMLRIGLFSDPFSASKLIEASSLSHFSSLDYAHKVFDEI 95

Query: 181 PQPNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELG 360
           PQPNL+SWNALIRAY++   P+  + +F  ++ +  E P+KFTYPFV KA  K+K L  G
Sbjct: 96  PQPNLFSWNALIRAYSSSQDPIQSILMFVNMICEGREFPSKFTYPFVFKASAKMKALRFG 155

Query: 361 KGIHGMAIK-EGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           +G+HGM +K      D++V N LIHFYA+CGCLD A   F
Sbjct: 156 RGLHGMVVKGRDVGLDIFVLNSLIHFYADCGCLDEAYLVF 195



 Score = 61.2 bits (147), Expect = 1e-07
 Identities = 32/107 (29%), Positives = 55/107 (51%)
 Frame = +1

Query: 157 AHKVFDQIPQPNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALT 336
           A  + + +P  ++ +WNALI AY    +P   LSVF+ L      +P++ T    + A  
Sbjct: 323 ARSILNTMPSQDIAAWNALISAYEQSGKPKEALSVFNELQLIKKAEPDEVTLVCALSACA 382

Query: 337 KLKDLELGKGIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           +L  ++LG  IH    K+G   + ++   LI  Y++CG ++ A   F
Sbjct: 383 QLGAIDLGGWIHVYIKKQGIKLNCHLTTALIDMYSKCGDVEKALEMF 429



 Score = 59.7 bits (143), Expect = 4e-07
 Identities = 37/129 (28%), Positives = 61/129 (47%)
 Frame = +1

Query: 91  DPFSASKLIQSSALSEFSSVDYAHKVFDQIPQPNLYSWNALIRAYATRSRPLHCLSVFSR 270
           D F  + LI   A  +   +D A+ VF+ +   ++ SWN +I  +A        L +F R
Sbjct: 171 DIFVLNSLIHFYA--DCGCLDEAYLVFENMQTRDVVSWNTMILGFAEGGYADEALKMFHR 228

Query: 271 LLNDCTEKPNKFTYPFVIKALTKLKDLELGKGIHGMAIKEGFSSDLYVGNCLIHFYAECG 450
           +  +   +PN  T   V+ A  K  DLE G+ +H    + G    L + N ++  Y +CG
Sbjct: 229 MGEE-NVRPNGVTMMAVLSACGKKLDLEFGRWVHVFIKRNGIRESLILDNAILDMYMKCG 287

Query: 451 CLDMASRAF 477
            ++ A R F
Sbjct: 288 SIEDAERLF 296


>ref|XP_004145320.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760,
           chloroplastic-like [Cucumis sativus]
           gi|449470513|ref|XP_004152961.1| PREDICTED:
           pentatricopeptide repeat-containing protein At2g29760,
           chloroplastic-like [Cucumis sativus]
           gi|449523079|ref|XP_004168552.1| PREDICTED:
           pentatricopeptide repeat-containing protein At2g29760,
           chloroplastic-like [Cucumis sativus]
          Length = 733

 Score =  199 bits (506), Expect = 3e-49
 Identities = 94/159 (59%), Positives = 117/159 (73%)
 Frame = +1

Query: 1   HPTVALIEKCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQI 180
           H  ++ I+KCS+ KQLK++HA+MLR GLFFDPFSASKL  +SALS FS++DYA  +FDQI
Sbjct: 26  HQILSTIDKCSSSKQLKEVHARMLRTGLFFDPFSASKLFTASALSSFSTLDYARNLFDQI 85

Query: 181 PQPNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELG 360
           PQPNLY+WN LIRAYA+ S P     +F  LL+ C + PNKFT+PFVIKA ++LK   +G
Sbjct: 86  PQPNLYTWNTLIRAYASSSDPFQSFVIFLDLLDKCEDLPNKFTFPFVIKAASELKASRVG 145

Query: 361 KGIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
             +HGMAIK  F  DLY+ N L+ FY  CG L MA R F
Sbjct: 146 TAVHGMAIKLSFGMDLYILNSLVRFYGACGDLSMAERLF 184



 Score = 67.4 bits (163), Expect = 2e-09
 Identities = 40/151 (26%), Positives = 70/151 (46%)
 Frame = +1

Query: 25  KCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQPNLYSW 204
           KC +    +++  +M    +F      S  I     ++    D A  VF+ +P   + +W
Sbjct: 274 KCGSVDDAQKLFDEMPERDVF------SWTIMLDGYAKMGDYDAARLVFNAMPVKEIAAW 327

Query: 205 NALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKGIHGMAI 384
           N LI AY    +P   L++F+ L      KP++ T    + A  +L  ++LG  IH    
Sbjct: 328 NVLISAYEQNGKPKEALAIFNELQLSKIAKPDEVTLVSTLSACAQLGAIDLGGWIHVYIK 387

Query: 385 KEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           +EG   + ++ + L+  YA+CG L+ A   F
Sbjct: 388 REGIVLNCHLISSLVDMYAKCGSLEKALEVF 418



 Score = 59.3 bits (142), Expect = 5e-07
 Identities = 35/141 (24%), Positives = 66/141 (46%)
 Frame = +1

Query: 55  IHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQPNLYSWNALIRAYATR 234
           +H   ++     D +  + L++         +  A ++F  I   ++ SWN++I A+A  
Sbjct: 148 VHGMAIKLSFGMDLYILNSLVRFYGAC--GDLSMAERLFKGISCKDVVSWNSMISAFAQG 205

Query: 235 SRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKGIHGMAIKEGFSSDLYV 414
           + P   L +F ++  +    PN  T   V+ A  K  DLE G+ +     ++G   DL +
Sbjct: 206 NCPEDALELFLKMERE-NVMPNSVTMVGVLSACAKKLDLEFGRWVCSYIERKGIKVDLTL 264

Query: 415 GNCLIHFYAECGCLDMASRAF 477
            N ++  Y +CG +D A + F
Sbjct: 265 CNAMLDMYTKCGSVDDAQKLF 285


>emb|CBI15973.3| unnamed protein product [Vitis vinifera]
          Length = 652

 Score =  199 bits (506), Expect = 3e-49
 Identities = 95/159 (59%), Positives = 122/159 (76%)
 Frame = +1

Query: 1   HPTVALIEKCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQI 180
           HPT++LI++CS  KQLKQIHAQMLR GLFFDPFSAS+LI ++ALS F S+DYA +VFDQI
Sbjct: 36  HPTLSLIDQCSETKQLKQIHAQMLRTGLFFDPFSASRLITAAALSPFPSLDYAQQVFDQI 95

Query: 181 PQPNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELG 360
           P PNLY+WN LIRAYA+ S P   L +F R+L+   + P+KFT+PF+IKA ++L++L  G
Sbjct: 96  PHPNLYTWNTLIRAYASSSNPHQSLLIFLRMLHQSPDFPDKFTFPFLIKAASELEELFTG 155

Query: 361 KGIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           K  HGM IK    SD+++ N LIHFYA+CG L +  R F
Sbjct: 156 KAFHGMVIKVLLGSDVFILNSLIHFYAKCGELGLGYRVF 194


>ref|XP_002279360.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760,
           chloroplastic-like [Vitis vinifera]
          Length = 743

 Score =  199 bits (506), Expect = 3e-49
 Identities = 95/159 (59%), Positives = 122/159 (76%)
 Frame = +1

Query: 1   HPTVALIEKCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQI 180
           HPT++LI++CS  KQLKQIHAQMLR GLFFDPFSAS+LI ++ALS F S+DYA +VFDQI
Sbjct: 36  HPTLSLIDQCSETKQLKQIHAQMLRTGLFFDPFSASRLITAAALSPFPSLDYAQQVFDQI 95

Query: 181 PQPNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELG 360
           P PNLY+WN LIRAYA+ S P   L +F R+L+   + P+KFT+PF+IKA ++L++L  G
Sbjct: 96  PHPNLYTWNTLIRAYASSSNPHQSLLIFLRMLHQSPDFPDKFTFPFLIKAASELEELFTG 155

Query: 361 KGIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           K  HGM IK    SD+++ N LIHFYA+CG L +  R F
Sbjct: 156 KAFHGMVIKVLLGSDVFILNSLIHFYAKCGELGLGYRVF 194



 Score = 67.4 bits (163), Expect = 2e-09
 Identities = 43/157 (27%), Positives = 72/157 (45%), Gaps = 3/157 (1%)
 Frame = +1

Query: 16  LIEKCSNPKQL---KQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQ 186
           LI+  S  ++L   K  H  +++  L  D F  + LI   A  +   +   ++VF  IP+
Sbjct: 142 LIKAASELEELFTGKAFHGMVIKVLLGSDVFILNSLIHFYA--KCGELGLGYRVFVNIPR 199

Query: 187 PNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKG 366
            ++ SWN++I A+     P   L +F  +      KPN  T   V+ A  K  D E G+ 
Sbjct: 200 RDVVSWNSMITAFVQGGCPEEALELFQEMETQ-NVKPNGITMVGVLSACAKKSDFEFGRW 258

Query: 367 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           +H    +      L + N ++  Y +CG ++ A R F
Sbjct: 259 VHSYIERNRIGESLTLSNAMLDMYTKCGSVEDAKRLF 295



 Score = 67.0 bits (162), Expect = 3e-09
 Identities = 36/115 (31%), Positives = 56/115 (48%)
 Frame = +1

Query: 133 SEFSSVDYAHKVFDQIPQPNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTY 312
           ++    D A  +FD +P  ++ +WNALI AY    +P   L +F  L    T KP++ T 
Sbjct: 314 AKIGEYDAAQGIFDAMPNQDIAAWNALISAYEQCGKPKEALELFHELQLSKTAKPDEVTL 373

Query: 313 PFVIKALTKLKDLELGKGIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
              + A  +L  ++LG  IH    K+G   + ++   LI  Y +CG L  A   F
Sbjct: 374 VSTLSACAQLGAMDLGGWIHVYIKKQGMKLNCHLTTSLIDMYCKCGDLQKALMVF 428


>emb|CAN72716.1| hypothetical protein VITISV_032470 [Vitis vinifera]
          Length = 694

 Score =  199 bits (506), Expect = 3e-49
 Identities = 95/159 (59%), Positives = 122/159 (76%)
 Frame = +1

Query: 1   HPTVALIEKCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQI 180
           HPT++LI++CS  KQLKQIHAQMLR GLFFDPFSAS+LI ++ALS F S+DYA +VFDQI
Sbjct: 36  HPTLSLIDQCSETKQLKQIHAQMLRTGLFFDPFSASRLITAAALSPFPSLDYAQQVFDQI 95

Query: 181 PQPNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELG 360
           P PNLY+WN LIRAYA+ S P   L +F R+L+   + P+KFT+PF+IKA ++L++L  G
Sbjct: 96  PHPNLYTWNTLIRAYASSSNPHQSLLIFLRMLHQSPDFPDKFTFPFLIKAASELEELFTG 155

Query: 361 KGIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           K  HGM IK    SD+++ N LIHFYA+CG L +  R F
Sbjct: 156 KAFHGMVIKVLLGSDVFILNSLIHFYAKCGELGLGYRVF 194



 Score = 65.1 bits (157), Expect = 1e-08
 Identities = 42/157 (26%), Positives = 71/157 (45%), Gaps = 3/157 (1%)
 Frame = +1

Query: 16  LIEKCSNPKQL---KQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQ 186
           LI+  S  ++L   K  H  +++  L  D F  + LI   A  +   +   ++VF   P+
Sbjct: 142 LIKAASELEELFTGKAFHGMVIKVLLGSDVFILNSLIHFYA--KCGELGLGYRVFVNXPR 199

Query: 187 PNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKG 366
            ++ SWN++I A+     P   L +F  +      KPN  T   V+ A  K  D E G+ 
Sbjct: 200 RDVVSWNSMITAFVQGGCPEEALELFQEMETQ-NVKPNGITMVGVLSACAKKSDFEFGRW 258

Query: 367 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           +H    +      L + N ++  Y +CG ++ A R F
Sbjct: 259 VHSYIERNRIXESLTLSNAMLDMYTKCGSVEDAKRLF 295


>ref|XP_004241167.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760,
           chloroplastic-like [Solanum lycopersicum]
          Length = 744

 Score =  198 bits (503), Expect = 7e-49
 Identities = 96/160 (60%), Positives = 120/160 (75%), Gaps = 1/160 (0%)
 Frame = +1

Query: 1   HPTVALIEKCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQI 180
           HP V LI+K  +  QLKQIHA MLR GLFFDPFSASKLI++S+LS FSS+DYAHKVFD+I
Sbjct: 36  HPLVLLIDKSQSINQLKQIHAYMLRIGLFFDPFSASKLIEASSLSHFSSLDYAHKVFDEI 95

Query: 181 PQPNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELG 360
           PQPNL+SWNALIRAY++   P+  + +F  +L +  E P+KFTYPFV KA  K+K +  G
Sbjct: 96  PQPNLFSWNALIRAYSSSQDPIQSILMFVNMLCEGREFPSKFTYPFVFKASAKMKAIRFG 155

Query: 361 KGIHGMAIK-EGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           +G+HGM +K      D++V N LIHFYA+CGCLD A   F
Sbjct: 156 RGLHGMVVKGRDVGLDIFVLNSLIHFYADCGCLDEAYLIF 195



 Score = 62.4 bits (150), Expect = 6e-08
 Identities = 36/129 (27%), Positives = 61/129 (47%)
 Frame = +1

Query: 91  DPFSASKLIQSSALSEFSSVDYAHKVFDQIPQPNLYSWNALIRAYATRSRPLHCLSVFSR 270
           D F  + LI   A  +   +D A+ +F+ +   ++ SWN +I  +A        L +F R
Sbjct: 171 DIFVLNSLIHFYA--DCGCLDEAYLIFENMQTRDVVSWNTMILGFAEGGYADEALKIFHR 228

Query: 271 LLNDCTEKPNKFTYPFVIKALTKLKDLELGKGIHGMAIKEGFSSDLYVGNCLIHFYAECG 450
           +  +   +PN  T   V+ A  K  DLE G+ +H    + G    L + N ++  Y +CG
Sbjct: 229 MGEE-NVRPNDVTMMAVLSACAKKLDLEFGRWVHAFIKRNGIRESLILDNAILDMYMKCG 287

Query: 451 CLDMASRAF 477
            ++ A R F
Sbjct: 288 SIEDAERLF 296



 Score = 61.6 bits (148), Expect = 1e-07
 Identities = 32/107 (29%), Positives = 55/107 (51%)
 Frame = +1

Query: 157 AHKVFDQIPQPNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALT 336
           A  + + +P  ++ +WNALI AY    +P   LSVF+ L      +P++ T    + A  
Sbjct: 323 ARSILNTMPSQDIVAWNALISAYEQSGKPKEALSVFNELQLIKKAEPDEVTLVCALSACA 382

Query: 337 KLKDLELGKGIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           +L  ++LG  IH    K+G   + ++   LI  Y++CG ++ A   F
Sbjct: 383 QLGAIDLGGWIHVYIKKQGIKFNCHLTTALIDMYSKCGDVEKALEMF 429


>ref|XP_004295750.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760,
           chloroplastic-like [Fragaria vesca subsp. vesca]
          Length = 1049

 Score =  190 bits (482), Expect = 2e-46
 Identities = 84/154 (54%), Positives = 119/154 (77%)
 Frame = +1

Query: 16  LIEKCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQPNL 195
           LI++C+    LKQ+HAQML+  LFFDP+SASKLI ++ALS FSS+DYA +VFD+IP+PNL
Sbjct: 92  LIDQCTTLNHLKQVHAQMLKTSLFFDPYSASKLITAAALSPFSSLDYARQVFDEIPEPNL 151

Query: 196 YSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKGIHG 375
           ++WNALIRAYA+   P+  + +F ++L++C E PNKFT+PF++KA ++L+  ++G+G HG
Sbjct: 152 FTWNALIRAYASSPDPVESIRIFLQMLDECNECPNKFTFPFLLKAASELRASKIGRGFHG 211

Query: 376 MAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           M +K    SD+Y+ N LIHFY  CG LD+A   F
Sbjct: 212 MVVKAELGSDVYIVNSLIHFYGSCGELDLARLVF 245



 Score = 68.2 bits (165), Expect = 1e-09
 Identities = 35/115 (30%), Positives = 59/115 (51%)
 Frame = +1

Query: 133 SEFSSVDYAHKVFDQIPQPNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTY 312
           +   + D A +VF  +P  ++ +WN LI +Y    +P   L+VF  L  +   KP++ T 
Sbjct: 365 ARMGNYDEARRVFGTMPSQDIATWNVLISSYEQNGKPKEALAVFHELQKNKGPKPDEVTL 424

Query: 313 PFVIKALTKLKDLELGKGIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
              + A ++L  ++LG  IH    K+G   + ++   LI  YA+CG L+ A   F
Sbjct: 425 VSTLAACSQLGAIDLGGWIHVYVKKQGMKLNCHLTTSLIDMYAKCGNLEKALEVF 479



 Score = 64.7 bits (156), Expect = 1e-08
 Identities = 40/140 (28%), Positives = 69/140 (49%)
 Frame = +1

Query: 58  HAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQPNLYSWNALIRAYATRS 237
           H  +++A L  D +  + LI          +D A  VF +  + ++ SWN++I A+A  +
Sbjct: 210 HGMVVKAELGSDVYIVNSLIHF--YGSCGELDLARLVFLKSYKKDVVSWNSVITAFAQGN 267

Query: 238 RPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKGIHGMAIKEGFSSDLYVG 417
            P   L +F  +  +   KPN  T   V+ A  K+ DLE G+ +     + G   +L + 
Sbjct: 268 CPEVALELFKEMEAE-NMKPNDVTLVSVLSACAKMADLEFGRWVCSHVERHGVEENLTLN 326

Query: 418 NCLIHFYAECGCLDMASRAF 477
           N ++  YA+CG ++ A R F
Sbjct: 327 NAMLDMYAKCGSVEDAERLF 346


>ref|XP_006480615.1| PREDICTED: pentatricopeptide repeat-containing protein At2g29760,
           chloroplastic-like [Citrus sinensis]
          Length = 746

 Score =  189 bits (480), Expect = 3e-46
 Identities = 89/159 (55%), Positives = 113/159 (71%)
 Frame = +1

Query: 1   HPTVALIEKCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQI 180
           HP  +LI++C N KQLKQIH QMLR GLFFDP+SASKL    AL  FSS++YA ++FDQI
Sbjct: 40  HPVFSLIKQCKNIKQLKQIHTQMLRTGLFFDPYSASKLFTPCALGTFSSLEYAREMFDQI 99

Query: 181 PQPNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELG 360
           PQPNLY+WN LIRAY++ + P+    +F +L+ +    PN+FT+PFVIKA  +L    +G
Sbjct: 100 PQPNLYTWNTLIRAYSSSAEPIQSFMIFLQLVYNSPYFPNEFTFPFVIKAAARLVQFRVG 159

Query: 361 KGIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           + IHGM IK  F  DL++ N LIHFYA CG L MA   F
Sbjct: 160 QAIHGMVIKSSFEDDLFISNSLIHFYAICGDLAMAYCVF 198



 Score = 69.3 bits (168), Expect = 5e-10
 Identities = 48/152 (31%), Positives = 72/152 (47%), Gaps = 1/152 (0%)
 Frame = +1

Query: 25  KCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSA-LSEFSSVDYAHKVFDQIPQPNLYS 201
           KC + +  K +  +M       D  S + +I   A L EF   D A  V   +P   + +
Sbjct: 288 KCGSLEDAKSLFDKMEEK----DIVSWTTMIDGYAKLGEF---DAAMSVLAAVPIQQIAT 340

Query: 202 WNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKGIHGMA 381
           WNALI AY    +P   LS+F   L+     P++FT+  V+ A  +L  +++G  IH   
Sbjct: 341 WNALISAYEQNGKPNEALSIFHEQLSK-NVNPDEFTFVSVLSACAQLGAMDIGVQIHAKM 399

Query: 382 IKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
            K+G   + Y+   LI  Y +CG LD A   F
Sbjct: 400 KKQGIKLNCYLTTSLIDMYTKCGNLDKALEVF 431



 Score = 57.8 bits (138), Expect = 2e-06
 Identities = 36/141 (25%), Positives = 67/141 (47%)
 Frame = +1

Query: 55  IHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQPNLYSWNALIRAYATR 234
           IH  ++++    D F ++ LI   A+     +  A+ VF  I + ++ SWN++I  +   
Sbjct: 162 IHGMVIKSSFEDDLFISNSLIHFYAIC--GDLAMAYCVFVMIGKKDVVSWNSMISGFVQG 219

Query: 235 SRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKGIHGMAIKEGFSSDLYV 414
                 + ++  +  +   KP++ T   V+ A  K +DLE G+ +     K G   DL +
Sbjct: 220 GFFEKAIELYREMEME-NVKPDEVTMVAVLSACAKKRDLEFGRWVCSYIEKNGIKMDLTL 278

Query: 415 GNCLIHFYAECGCLDMASRAF 477
            N ++  Y +CG L+ A   F
Sbjct: 279 SNAMLDMYVKCGSLEDAKSLF 299


>ref|XP_006428806.1| hypothetical protein CICLE_v10011151mg [Citrus clementina]
           gi|557530863|gb|ESR42046.1| hypothetical protein
           CICLE_v10011151mg [Citrus clementina]
          Length = 737

 Score =  189 bits (480), Expect = 3e-46
 Identities = 89/159 (55%), Positives = 113/159 (71%)
 Frame = +1

Query: 1   HPTVALIEKCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQI 180
           HP  +LI++C N KQLKQIH QMLR GLFFDP+SASKL    AL  FSS++YA ++FDQI
Sbjct: 31  HPVFSLIKQCKNIKQLKQIHTQMLRTGLFFDPYSASKLFTPCALGTFSSLEYAREMFDQI 90

Query: 181 PQPNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELG 360
           PQPNLY+WN LIRAY++ + P+    +F +L+ +    PN+FT+PFVIKA  +L    +G
Sbjct: 91  PQPNLYTWNTLIRAYSSSAEPIQSFMIFLQLVYNSPYFPNEFTFPFVIKAAARLVQFRVG 150

Query: 361 KGIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           + IHGM IK  F  DL++ N LIHFYA CG L MA   F
Sbjct: 151 QAIHGMVIKSSFEDDLFISNSLIHFYAICGDLAMAYCVF 189



 Score = 69.3 bits (168), Expect = 5e-10
 Identities = 48/152 (31%), Positives = 72/152 (47%), Gaps = 1/152 (0%)
 Frame = +1

Query: 25  KCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSA-LSEFSSVDYAHKVFDQIPQPNLYS 201
           KC + +  K +  +M       D  S + +I   A L EF   D A  V   +P   + +
Sbjct: 279 KCGSLEDAKSLFDKMEEK----DIVSWTTMIDGYAKLGEF---DAAMSVLAAVPIQQIAT 331

Query: 202 WNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKGIHGMA 381
           WNALI AY    +P   LS+F   L+     P++FT+  V+ A  +L  +++G  IH   
Sbjct: 332 WNALISAYEQNGKPNEALSIFHEQLSK-NVNPDEFTFVSVLSACAQLGAMDIGVQIHAKM 390

Query: 382 IKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
            K+G   + Y+   LI  Y +CG LD A   F
Sbjct: 391 KKQGIKLNCYLTTSLIDMYTKCGNLDKALEVF 422



 Score = 57.8 bits (138), Expect = 2e-06
 Identities = 36/141 (25%), Positives = 67/141 (47%)
 Frame = +1

Query: 55  IHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQPNLYSWNALIRAYATR 234
           IH  ++++    D F ++ LI   A+     +  A+ VF  I + ++ SWN++I  +   
Sbjct: 153 IHGMVIKSSFEDDLFISNSLIHFYAIC--GDLAMAYCVFVMIGKKDVVSWNSMISGFVQG 210

Query: 235 SRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKGIHGMAIKEGFSSDLYV 414
                 + ++  +  +   KP++ T   V+ A  K +DLE G+ +     K G   DL +
Sbjct: 211 GFFEKAIELYREMEME-NVKPDEVTMVAVLSACAKKRDLEFGRWVCSYIEKNGIKMDLTL 269

Query: 415 GNCLIHFYAECGCLDMASRAF 477
            N ++  Y +CG L+ A   F
Sbjct: 270 SNAMLDMYVKCGSLEDAKSLF 290


>ref|XP_007016122.1| Tetratricopeptide repeat (TPR)-like superfamily protein [Theobroma
           cacao] gi|508786485|gb|EOY33741.1| Tetratricopeptide
           repeat (TPR)-like superfamily protein [Theobroma cacao]
          Length = 733

 Score =  186 bits (473), Expect = 2e-45
 Identities = 87/158 (55%), Positives = 115/158 (72%)
 Frame = +1

Query: 4   PTVALIEKCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIP 183
           P ++ I +C+N  QLKQIHAQMLR GLFF+P+SASKL  +SALS FSS+DYA KVFDQIP
Sbjct: 27  PVLSRINQCTNLNQLKQIHAQMLRTGLFFNPYSASKLFAASALSPFSSLDYARKVFDQIP 86

Query: 184 QPNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGK 363
           +PNLY+WN LIR YA+   PL  + +F R++++    PNKFT+PFVIKA  ++  + +G+
Sbjct: 87  KPNLYTWNTLIRVYASGPEPLQGILIFLRMVDESPYYPNKFTFPFVIKAAAEIVSVCVGQ 146

Query: 364 GIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
            +HGM IK    +D+++ N LIH Y  CG LD A R F
Sbjct: 147 ALHGMVIKASLGADVFISNSLIHLYLSCGDLDSAYRVF 184



 Score = 73.2 bits (178), Expect = 4e-11
 Identities = 45/141 (31%), Positives = 72/141 (51%)
 Frame = +1

Query: 55  IHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQPNLYSWNALIRAYATR 234
           +H  +++A L  D F ++ LI          +D A++VF  I + ++ SWN+LI   A +
Sbjct: 148 LHGMVIKASLGADVFISNSLIH--LYLSCGDLDSAYRVFMMIGEKDVVSWNSLITGLAQK 205

Query: 235 SRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKGIHGMAIKEGFSSDLYV 414
                 L +F R+  +   KPN  T   V+ A TK  DLE G+ +     + G S +L +
Sbjct: 206 GCAEKALELFRRMDAESV-KPNDVTMVGVLSACTKKLDLEFGRWVCSYIERNGISVNLTL 264

Query: 415 GNCLIHFYAECGCLDMASRAF 477
            N ++  YA+CG L+ A R F
Sbjct: 265 SNAMLDMYAKCGSLEDAKRLF 285



 Score = 66.2 bits (160), Expect = 4e-09
 Identities = 30/115 (26%), Positives = 60/115 (52%)
 Frame = +1

Query: 133 SEFSSVDYAHKVFDQIPQPNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTY 312
           ++    + A +V D +P+ ++ +WNALI  Y    +P   L+++  L      KP++ T 
Sbjct: 304 AKLGEYEAARRVLDIMPRQDIAAWNALISGYEQNGKPKEALAIYHELKLSKIAKPDEITL 363

Query: 313 PFVIKALTKLKDLELGKGIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
              + A  +L  +++G+GIH    ++G   + ++   LI  Y++CG ++ A   F
Sbjct: 364 VSTLSACAQLGAMDIGRGIHAYVKEQGIQLNCHLTTSLIDMYSKCGDVNKALEVF 418


>ref|XP_002525630.1| pentatricopeptide repeat-containing protein, putative [Ricinus
           communis] gi|223535066|gb|EEF36748.1| pentatricopeptide
           repeat-containing protein, putative [Ricinus communis]
          Length = 765

 Score =  183 bits (464), Expect = 2e-44
 Identities = 87/157 (55%), Positives = 114/157 (72%)
 Frame = +1

Query: 7   TVALIEKCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQ 186
           T++LI++C+N K LK++HA +LR+GLFF P++ASKL   +ALS FSS+DYA KVF++I Q
Sbjct: 34  TLSLIDQCTNLKHLKELHATILRSGLFFHPYNASKLFSVAALSSFSSLDYARKVFEEISQ 93

Query: 187 PNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKG 366
           PNLY+WN LIRA+A+   P+H L +F R+L D  + PNKFT+PFVIKA   +  L   + 
Sbjct: 94  PNLYTWNTLIRAFASSPEPIHSLLIFIRMLYDSPDFPNKFTFPFVIKAAAGVASLPFSQA 153

Query: 367 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           IHGMAIK    SDL++ N LIH YA CG LD A   F
Sbjct: 154 IHGMAIKASLGSDLFILNSLIHCYASCGDLDSAYSVF 190



 Score = 64.7 bits (156), Expect = 1e-08
 Identities = 42/141 (29%), Positives = 69/141 (48%)
 Frame = +1

Query: 55  IHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQPNLYSWNALIRAYATR 234
           IH   ++A L  D F  + LI   A      +D A+ VF +I + ++ SWN++I+ +   
Sbjct: 154 IHGMAIKASLGSDLFILNSLIHCYA--SCGDLDSAYSVFVKIEEKDVVSWNSMIKGFVLG 211

Query: 235 SRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKGIHGMAIKEGFSSDLYV 414
             P   L +F +L+     +PN  T   V+ A  K  DLE G+ +     + G + +L V
Sbjct: 212 GCPDKALELF-QLMKAENVRPNDVTMVGVLSACAKKMDLEFGRRVCHYIERNGINVNLTV 270

Query: 415 GNCLIHFYAECGCLDMASRAF 477
            N ++  Y + G L+ A R F
Sbjct: 271 SNAMLDMYVKNGSLEDARRLF 291



 Score = 63.9 bits (154), Expect = 2e-08
 Identities = 38/129 (29%), Positives = 63/129 (48%)
 Frame = +1

Query: 91  DPFSASKLIQSSALSEFSSVDYAHKVFDQIPQPNLYSWNALIRAYATRSRPLHCLSVFSR 270
           D FS + +I   A       D A  VFD +P+ ++ +WN LI AY    +P   L++F  
Sbjct: 298 DIFSWTTMIDGYAKRR--DFDAARSVFDAMPRQDISAWNVLISAYEQDGKPKEALAIFHE 355

Query: 271 LLNDCTEKPNKFTYPFVIKALTKLKDLELGKGIHGMAIKEGFSSDLYVGNCLIHFYAECG 450
           L    T KP++ T    + A  +L  +++G  IH    K+    + ++   LI  Y++CG
Sbjct: 356 LQLSKTAKPDEVTLVSTLSACAQLGAIDIGGWIHVYIKKQDIKLNCHLTTSLIDMYSKCG 415

Query: 451 CLDMASRAF 477
            ++ A   F
Sbjct: 416 EVEKALDIF 424


>gb|AFN53666.1| hypothetical protein [Linum usitatissimum]
          Length = 850

 Score =  181 bits (458), Expect = 1e-43
 Identities = 88/155 (56%), Positives = 113/155 (72%)
 Frame = +1

Query: 13  ALIEKCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQPN 192
           AL ++C++ KQLKQIHAQMLR     DP++AS+L  ++A S FS++DYA KVFDQIPQPN
Sbjct: 144 ALFQQCTSFKQLKQIHAQMLRTNKLHDPYAASELFTAAAFSSFSALDYARKVFDQIPQPN 203

Query: 193 LYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKGIH 372
           LYSWN LIRA AT S P+  + VF R+L+D    PNKFT+P +IKA+ + +   +GK +H
Sbjct: 204 LYSWNILIRALATSSDPIQSVLVFIRMLHDSPFGPNKFTFPVLIKAVAERRCFLVGKAVH 263

Query: 373 GMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           GMAIK  F  D++V N LIHFYA CG LD+A   F
Sbjct: 264 GMAIKTSFGDDVFVLNSLIHFYASCGHLDLAYLVF 298



 Score = 58.5 bits (140), Expect = 9e-07
 Identities = 32/116 (27%), Positives = 57/116 (49%), Gaps = 1/116 (0%)
 Frame = +1

Query: 133 SEFSSVDYAHKVFDQIPQPNLYSWNALIRAYATRSRPLHCLSVFSRL-LNDCTEKPNKFT 309
           ++ S    A  +FD +P+ ++ +WN LI  Y    RP   L++F  L L     +P++ T
Sbjct: 420 AKMSEHGIARDIFDSMPRKDIPAWNVLISGYEQSGRPKEALAIFRELQLTKSGARPDQVT 479

Query: 310 YPFVIKALTKLKDLELGKGIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
               + A  +L  +++G+ IHG   KE    +  +   LI  Y++ G ++ A   F
Sbjct: 480 LLSTLSACAQLGAMDIGEWIHGYIKKERIQLNRNLATSLIDMYSKSGDVEKAIEVF 535


>ref|XP_002314110.1| pentatricopeptide repeat-containing family protein [Populus
           trichocarpa] gi|222850518|gb|EEE88065.1|
           pentatricopeptide repeat-containing family protein
           [Populus trichocarpa]
          Length = 738

 Score =  179 bits (455), Expect = 3e-43
 Identities = 87/154 (56%), Positives = 112/154 (72%)
 Frame = +1

Query: 16  LIEKCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQPNL 195
           LI+KC+N K LKQ+HA MLR GLFFDP SA+KL  + ALS  SS+DYA KVFDQIP+PNL
Sbjct: 36  LIDKCANKKHLKQLHAHMLRTGLFFDPPSATKLFTACALSSPSSLDYACKVFDQIPRPNL 95

Query: 196 YSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKGIHG 375
           Y+WN LIRA+A+  +P+  L VF ++L++    PN +T+PFVIKA T++  L  G+ IHG
Sbjct: 96  YTWNTLIRAFASSPKPIQGLLVFIQMLHESQRFPNSYTFPFVIKAATEVSSLLAGQAIHG 155

Query: 376 MAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           M +K  F SDL++ N LIHFY+  G LD A   F
Sbjct: 156 MVMKASFGSDLFISNSLIHFYSSLGDLDSAYLVF 189



 Score = 72.8 bits (177), Expect = 5e-11
 Identities = 36/109 (33%), Positives = 59/109 (54%)
 Frame = +1

Query: 151 DYAHKVFDQIPQPNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKA 330
           D A +VFD +P+ ++ +WNALI +Y    +P   L++F  L  +   KPN+ T    + A
Sbjct: 315 DAARRVFDVMPREDITAWNALISSYQQNGKPKEALAIFRELQLNKNTKPNEVTLASTLAA 374

Query: 331 LTKLKDLELGKGIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
             +L  ++LG  IH    K+G   + ++   LI  Y++CG L+ A   F
Sbjct: 375 CAQLGAMDLGGWIHVYIKKQGIKLNFHITTSLIDMYSKCGHLEKALEVF 423



 Score = 70.9 bits (172), Expect = 2e-10
 Identities = 41/141 (29%), Positives = 69/141 (48%)
 Frame = +1

Query: 55  IHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQPNLYSWNALIRAYATR 234
           IH  +++A    D F ++ LI     S    +D A+ VF +I + ++ SWN++I  +   
Sbjct: 153 IHGMVMKASFGSDLFISNSLIHF--YSSLGDLDSAYLVFSKIVEKDIVSWNSMISGFVQG 210

Query: 235 SRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKGIHGMAIKEGFSSDLYV 414
             P   L +F R+  +   +PN+ T   V+ A  K  DLE G+       + G   +L +
Sbjct: 211 GSPEEALQLFKRMKME-NARPNRVTMVGVLSACAKRIDLEFGRWACDYIERNGIDINLIL 269

Query: 415 GNCLIHFYAECGCLDMASRAF 477
            N ++  Y +CG L+ A R F
Sbjct: 270 SNAMLDMYVKCGSLEDARRLF 290


>ref|XP_006293749.1| hypothetical protein CARUB_v10022711mg [Capsella rubella]
           gi|482562457|gb|EOA26647.1| hypothetical protein
           CARUB_v10022711mg [Capsella rubella]
          Length = 739

 Score =  174 bits (442), Expect = 9e-42
 Identities = 80/157 (50%), Positives = 112/157 (71%)
 Frame = +1

Query: 7   TVALIEKCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQ 186
           T++LI++CSN +QLKQ HA M+R G F DP+SASKL   +ALS F+S++YA KVFD+IPQ
Sbjct: 34  TISLIDRCSNLRQLKQTHAHMIRTGTFSDPYSASKLFAIAALSSFASLEYARKVFDEIPQ 93

Query: 187 PNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKG 366
           PN ++WN LIRAYA+   P+  + +F  ++++    PNK+T+PF++KA  ++  L LG+ 
Sbjct: 94  PNSFTWNTLIRAYASGPDPVRSIWIFLDMVSESQCYPNKYTFPFLVKAAAEVSSLSLGQS 153

Query: 367 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           +HGMAIK     DL+V N LIH Y  CG LD A + F
Sbjct: 154 LHGMAIKSAVGCDLFVANSLIHCYFSCGDLDSACKVF 190



 Score = 75.9 bits (185), Expect = 6e-12
 Identities = 45/151 (29%), Positives = 75/151 (49%)
 Frame = +1

Query: 25  KCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQPNLYSW 204
           KC + ++ K++   M       D  + + ++   A+SE    + A +V + +P+ ++ +W
Sbjct: 280 KCGSIEEAKRLFDTMEEK----DNVTFTTMLDGYAISE--DYEAAREVLNSMPKKDIVAW 333

Query: 205 NALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKGIHGMAI 384
           NALI AY    +P   L VF  L      K N+ T    + A  ++  LELG+ IH    
Sbjct: 334 NALISAYEQNGKPNEALLVFHELQLQKNIKLNQITLVSTLSACAQVGALELGRWIHSYIK 393

Query: 385 KEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           K G   + Y+ + LIH Y++CG L+ A   F
Sbjct: 394 KHGIRMNFYITSALIHMYSKCGDLEKAREVF 424



 Score = 64.3 bits (155), Expect = 2e-08
 Identities = 36/143 (25%), Positives = 72/143 (50%)
 Frame = +1

Query: 49  KQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQPNLYSWNALIRAYA 228
           + +H   +++ +  D F A+ LI          +D A KVF  I + ++ SWN++I  + 
Sbjct: 152 QSLHGMAIKSAVGCDLFVANSLIH--CYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFV 209

Query: 229 TRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKGIHGMAIKEGFSSDL 408
            +  P   L +F ++ ++   K +  T   V+ A TKL++LE G+ +     +   + ++
Sbjct: 210 QKGSPDKALELFKKMESEDV-KASHVTMVGVLSACTKLRNLEFGRQVCSFIEENRVNVNM 268

Query: 409 YVGNCLIHFYAECGCLDMASRAF 477
            + N ++  Y +CG ++ A R F
Sbjct: 269 TLANAMLDMYTKCGSIEEAKRLF 291


>ref|XP_006410036.1| hypothetical protein EUTSA_v10016305mg [Eutrema salsugineum]
           gi|557111205|gb|ESQ51489.1| hypothetical protein
           EUTSA_v10016305mg [Eutrema salsugineum]
          Length = 739

 Score =  172 bits (436), Expect = 4e-41
 Identities = 82/157 (52%), Positives = 113/157 (71%)
 Frame = +1

Query: 7   TVALIEKCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQ 186
           T++LI++C++ +QLKQIHAQM+R GLF D +SASKL   +ALS F+S+DYA KVFDQIPQ
Sbjct: 34  TLSLIDRCADLRQLKQIHAQMVRTGLFNDHYSASKLFAIAALSPFASLDYACKVFDQIPQ 93

Query: 187 PNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKG 366
           PN ++WN LIRAYA+   PL  + VF  ++++    PN +T+PF+IKA  ++  L LG+ 
Sbjct: 94  PNSFTWNTLIRAYASGPDPLRSICVFLDMVSESQCYPNTYTFPFLIKAAAEVSSLSLGQS 153

Query: 367 IHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           +HGMA+K     D++V N LIH Y  CG LD A + F
Sbjct: 154 LHGMAVKSSVGCDVFVANSLIHCYFSCGDLDSACKVF 190



 Score = 72.0 bits (175), Expect = 8e-11
 Identities = 39/107 (36%), Positives = 55/107 (51%)
 Frame = +1

Query: 157 AHKVFDQIPQPNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALT 336
           A  V + +P+ ++ +WNALI AY    +P   L VF  L      K N+ T    + A  
Sbjct: 318 ARDVLNSMPKKDIVAWNALISAYEQNGKPNEALLVFHELQLQKNIKLNQITLVSTLSACA 377

Query: 337 KLKDLELGKGIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           ++  LELG+ IH    K G  S+ YV + LIH Y++CG L  A   F
Sbjct: 378 QVGALELGRWIHSYIKKHGIRSNFYVTSALIHMYSKCGDLVKAREVF 424



 Score = 65.1 bits (157), Expect = 1e-08
 Identities = 37/143 (25%), Positives = 71/143 (49%)
 Frame = +1

Query: 49  KQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQPNLYSWNALIRAYA 228
           + +H   +++ +  D F A+ LI          +D A KVF  I + ++ SWN++I  + 
Sbjct: 152 QSLHGMAVKSSVGCDVFVANSLIH--CYFSCGDLDSACKVFTTIQEKDVVSWNSMITGFV 209

Query: 229 TRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKGIHGMAIKEGFSSDL 408
            +  P   L +F ++ ++   K +  T   V+ A  KL++LE G+ +     + G   +L
Sbjct: 210 QKGSPDKALELFKKMESE-EVKASHVTMVGVLSACAKLRNLEFGRQVCSYIEENGVKMNL 268

Query: 409 YVGNCLIHFYAECGCLDMASRAF 477
            + N ++  Y +CG ++ A R F
Sbjct: 269 TLANAMLDMYTKCGSIEDAKRLF 291


>ref|XP_002879234.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
           gi|297325073|gb|EFH55493.1| predicted protein
           [Arabidopsis lyrata subsp. lyrata]
          Length = 740

 Score =  169 bits (428), Expect = 4e-40
 Identities = 80/158 (50%), Positives = 114/158 (72%), Gaps = 1/158 (0%)
 Frame = +1

Query: 7   TVALIEKCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQ 186
           T++LI++CS+ +QLKQ HA M+R G+F DP+SASKL   +ALS F+S++YA KVFD+IPQ
Sbjct: 34  TISLIDRCSSLRQLKQTHAHMIRTGMFSDPYSASKLFAIAALSSFASLEYARKVFDEIPQ 93

Query: 187 PNLYSWNALIRAYATRSRPLHCLSVFSRLLNDCTE-KPNKFTYPFVIKALTKLKDLELGK 363
           PN ++WN LIRAYA+   P+  +  F  +++  ++  PNK+T+PF+IKA  ++  L LG+
Sbjct: 94  PNSFTWNTLIRAYASGPDPVCSIWAFLDMVSSESQCYPNKYTFPFLIKAAAEVSSLSLGQ 153

Query: 364 GIHGMAIKEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
            +HGMAIK    SD++V N LIH Y  CG LD A + F
Sbjct: 154 SLHGMAIKSAVGSDVFVANSLIHCYFSCGDLDSACKVF 191



 Score = 75.1 bits (183), Expect = 1e-11
 Identities = 46/151 (30%), Positives = 74/151 (49%)
 Frame = +1

Query: 25  KCSNPKQLKQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQPNLYSW 204
           KC + +  K++   M       D  + + ++   A+SE    + A +V + +P+ ++ +W
Sbjct: 281 KCGSIEDAKRLFDAMEEK----DNVTWTTMLDGYAISE--DYEAAREVLNAMPKKDIVAW 334

Query: 205 NALIRAYATRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKGIHGMAI 384
           NALI AY    +P   L VF  L      K N+ T    + A  ++  LELG+ IH    
Sbjct: 335 NALISAYEQNGKPNEALLVFHELQLQKNIKLNQITLVSTLSACAQVGALELGRWIHSYIK 394

Query: 385 KEGFSSDLYVGNCLIHFYAECGCLDMASRAF 477
           K G   + YV + LIH Y++CG L+ A   F
Sbjct: 395 KNGIKMNFYVTSALIHMYSKCGDLEKAREVF 425



 Score = 63.9 bits (154), Expect = 2e-08
 Identities = 36/143 (25%), Positives = 71/143 (49%)
 Frame = +1

Query: 49  KQIHAQMLRAGLFFDPFSASKLIQSSALSEFSSVDYAHKVFDQIPQPNLYSWNALIRAYA 228
           + +H   +++ +  D F A+ LI          +D A KVF  I + ++ SWN++I  + 
Sbjct: 153 QSLHGMAIKSAVGSDVFVANSLIH--CYFSCGDLDSACKVFTTIKEKDVVSWNSMINGFV 210

Query: 229 TRSRPLHCLSVFSRLLNDCTEKPNKFTYPFVIKALTKLKDLELGKGIHGMAIKEGFSSDL 408
            +  P   L +F ++ ++   K +  T   V+ A  K++DLE G+ +     +   + +L
Sbjct: 211 QKGSPDKALELFKKMESEDV-KASHVTMVGVLSACAKIRDLEFGRRVCSYIEENRVNVNL 269

Query: 409 YVGNCLIHFYAECGCLDMASRAF 477
            + N ++  Y +CG ++ A R F
Sbjct: 270 TLANAMLDMYTKCGSIEDAKRLF 292


Top