BLASTX nr result

ID: Rheum21_contig00034697 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rheum21_contig00034697
         (694 letters)

Database: nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006339424.1| PREDICTED: pentatricopeptide repeat-containi...   179   1e-42
emb|CBI26162.3| unnamed protein product [Vitis vinifera]              178   2e-42
ref|XP_004297017.1| PREDICTED: pentatricopeptide repeat-containi...   177   3e-42
ref|XP_006489230.1| PREDICTED: pentatricopeptide repeat-containi...   176   8e-42
gb|EOY06281.1| Tetratricopeptide repeat-like superfamily protein...   172   9e-41
ref|XP_004229464.1| PREDICTED: pentatricopeptide repeat-containi...   169   6e-40
ref|XP_006419767.1| hypothetical protein CICLE_v10007051mg, part...   168   2e-39
ref|XP_004155892.1| PREDICTED: pentatricopeptide repeat-containi...   162   1e-37
ref|XP_004134313.1| PREDICTED: pentatricopeptide repeat-containi...   162   1e-37
ref|XP_003617724.1| Pentatricopeptide repeat-containing protein ...   159   8e-37
ref|XP_004491488.1| PREDICTED: pentatricopeptide repeat-containi...   155   9e-36
ref|XP_003530332.1| PREDICTED: pentatricopeptide repeat-containi...   150   4e-34
ref|XP_006289665.1| hypothetical protein CARUB_v10003224mg [Caps...   141   2e-31
gb|ESW25446.1| hypothetical protein PHAVU_003G036500g [Phaseolus...   140   3e-31
gb|ABD96889.1| hypothetical protein [Cleome spinosa]                  138   1e-30
ref|XP_002873920.1| pentatricopeptide repeat-containing protein ...   137   3e-30
ref|NP_197396.1| pentatricopeptide repeat-containing protein [Ar...   134   3e-29
gb|EMJ18641.1| hypothetical protein PRUPE_ppb015972mg [Prunus pe...   127   3e-27
gb|EOY06282.1| Tetratricopeptide repeat-like superfamily protein...   107   4e-21
gb|EPS61555.1| hypothetical protein M569_13242, partial [Genlise...    99   1e-18

>ref|XP_006339424.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g18950-like [Solanum tuberosum]
          Length = 601

 Score =  179 bits (453), Expect = 1e-42
 Identities = 94/216 (43%), Positives = 131/216 (60%)
 Frame = +2

Query: 29  ERPSFEDAVVQVCRMIRTRSQWESILLTRIPSHYLSDTKFLSEVLKRQKNVKLSLRFFNW 208
           E+ SF +    VC++IRTR +WE ILL+  P+   +D +F +EVLK QKNV LSLRF  W
Sbjct: 53  EQQSFAEIAKDVCKVIRTRPRWEQILLSDFPTVNFTDPRFYTEVLKAQKNVMLSLRFHFW 112

Query: 209 LNSHFEFKPDDLSCKLMFXXXXXXXXXXXXXSFIENVNFVPDSDAFQRLIRLLCRRGSID 388
           L+S   F  D  S +++F              F +N+NFVP     +  I+ LC  G I+
Sbjct: 113 LSSQNGFSRDQFSDEVIFSGLVQAKAASAAKCFRQNMNFVPQPSCLEAYIQCLCENGLIE 172

Query: 389 VAVDLFSKCKDYGICPSLATWNSIFSGNIRRGRTDVFWGLYEEMIGSGVSTDAETVECLV 568
            A+D+F++ +  G CPSL  WNS  S +IR GRTD+ W LYE+M  SGV  D +T+  L+
Sbjct: 173 DALDVFTELRGVGHCPSLRIWNSALSDSIRAGRTDIVWKLYEDMTESGVVADVDTIGHLI 232

Query: 569 WAFCVDDEVERGYELLRHVLIDGVKPSCSSIFDKLI 676
            AFC++++   G++LLR VL  G  PS S  F+KLI
Sbjct: 233 QAFCMENKFPEGHQLLRQVLEAGHAPS-SVAFNKLI 267


>emb|CBI26162.3| unnamed protein product [Vitis vinifera]
          Length = 636

 Score =  178 bits (451), Expect = 2e-42
 Identities = 91/222 (40%), Positives = 130/222 (58%)
 Frame = +2

Query: 26  EERPSFEDAVVQVCRMIRTRSQWESILLTRIPSHYLSDTKFLSEVLKRQKNVKLSLRFFN 205
           +++   E+ V +V  + RTR +WE  LL+  PS    D  FLS  ++ QKN  +SLRFF+
Sbjct: 89  QQQQHLEEIVKRVSDITRTRPRWEQTLLSDFPSFNFLDPTFLSHFVEHQKNALISLRFFH 148

Query: 206 WLNSHFEFKPDDLSCKLMFXXXXXXXXXXXXXSFIENVNFVPDSDAFQRLIRLLCRRGSI 385
           WL+S   F PD  SC ++F             SF+++ NF P   + +  IR LC+ G +
Sbjct: 149 WLSSQSGFSPDSSSCNVLFDALVEAGACNAAKSFLDSTNFNPKPASLEAYIRCLCKGGLV 208

Query: 386 DVAVDLFSKCKDYGICPSLATWNSIFSGNIRRGRTDVFWGLYEEMIGSGVSTDAETVECL 565
           + A+ +F + K  G+C S+ATWNS+  G++R GR D  W LY EM+ S V  D  TV  L
Sbjct: 209 EEAISVFGQLKGIGVCASIATWNSVLRGSVRAGRIDFVWELYGEMVESSVVADVHTVGYL 268

Query: 566 VWAFCVDDEVERGYELLRHVLIDGVKPSCSSIFDKLISVFGK 691
           V AFC ++ +  G+ LLR VL DGV P  ++ F+KLIS F K
Sbjct: 269 VQAFCDENRISDGHNLLRRVLEDGVVPR-NAAFNKLISGFCK 309


>ref|XP_004297017.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g18950-like [Fragaria vesca subsp. vesca]
          Length = 382

 Score =  177 bits (449), Expect = 3e-42
 Identities = 89/207 (42%), Positives = 124/207 (59%)
 Frame = +2

Query: 59  QVCRMIRTRSQWESILLTRIPSHYLSDTKFLSEVLKRQKNVKLSLRFFNWLNSHFEFKPD 238
           Q+C +IRT+ +WE+ L +  PS   SD  F+ EV+K+Q NV LS+RFF WL +   F PD
Sbjct: 45  QICHVIRTKPRWENTLSSEYPSSNFSDPLFIREVVKQQSNVFLSVRFFLWLGTREGFSPD 104

Query: 239 DLSCKLMFXXXXXXXXXXXXXSFIENVNFVPDSDAFQRLIRLLCRRGSIDVAVDLFSKCK 418
            +SC  +F             SFI++  F P+    +   R L   G +  A  +F + K
Sbjct: 105 PISCNAVFGALVEGNACSAAKSFIKHTGFSPEPVLLESYARCLWEAGRVKEASSVFKRLK 164

Query: 419 DYGICPSLATWNSIFSGNIRRGRTDVFWGLYEEMIGSGVSTDAETVECLVWAFCVDDEVE 598
           + G+CP + TWN+  SG I+  RTD+ W LY+EM+  GV+ D ETVECLV  +C D+EV 
Sbjct: 165 EAGVCPGIGTWNAALSGCIKARRTDMVWKLYQEMMEYGVAADVETVECLVRGYCDDNEVL 224

Query: 599 RGYELLRHVLIDGVKPSCSSIFDKLIS 679
           +GY LL  VL DGV P   ++FD+LIS
Sbjct: 225 KGYGLLSQVLGDGVVPG-KAVFDRLIS 250


>ref|XP_006489230.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g18950-like [Citrus sinensis]
          Length = 589

 Score =  176 bits (445), Expect = 8e-42
 Identities = 92/222 (41%), Positives = 131/222 (59%), Gaps = 2/222 (0%)
 Frame = +2

Query: 26  EERPSFEDAVVQVCRMIRTRSQWESILLTRIPSHYLSDTKFLSEVLKRQKNVKLSLRFFN 205
           E +  + +   QVC++ RT+ +WE  LL+  PS   +D  F  E LK+Q N+ LS+RFF 
Sbjct: 35  ESQQLYTEIAKQVCKITRTKPRWEQTLLSDFPSFNFNDPLFFREFLKQQNNMLLSIRFFQ 94

Query: 206 WLNSHFEFKPDDLSCKLMFXXXXXXXXXXXXXSFIENVNFVPDSDAFQRLIRLLCRRGSI 385
           WL+SH+ F PD  SC ++F              F++   F P+ ++ +  I+ LC  G I
Sbjct: 95  WLHSHYGFSPDLDSCNVLFDSLVEARAFKVAMDFLDITGFSPNPNSLELYIQCLCESGMI 154

Query: 386 DVAVDLFSKCKDYGICPSLATWNSIFSGNIRRGRTDVFWGLYEEMIGSGV--STDAETVE 559
           + A  +FSK K+ G+  S+ TWNS   G I+  RTD+ W LY +MI SG+    DAET+ 
Sbjct: 155 EEAFRVFSKLKEMGVFGSIKTWNSALLGCIKVDRTDLLWKLYHDMIESGIVADVDAETIG 214

Query: 560 CLVWAFCVDDEVERGYELLRHVLIDGVKPSCSSIFDKLISVF 685
            L+ AFC D +V  GYELLR VL DG+ P  ++ F+KLIS F
Sbjct: 215 YLIQAFCNDGKVSEGYELLRQVLEDGLVPE-NTAFNKLISRF 255



 Score = 61.6 bits (148), Expect = 2e-07
 Identities = 35/122 (28%), Positives = 62/122 (50%), Gaps = 3/122 (2%)
 Frame = +2

Query: 326 VPDSDAFQRLIRLLCRRGSIDVAVDLFSKCKDYGICPSLATWNSIFSGNIRRGRTDVFWG 505
           +P+   +  +I   CR  +++ A  L  +  D G   +  T+N++ +G    GRTD  + 
Sbjct: 347 LPNEYTYNSMIHGYCRIDNLEEAKRLHKEMLDKGYGETTVTYNTLIAGLCLHGRTDKAYH 406

Query: 506 LYEEMIGSGVSTDAETVECLVWAFCVDDEVERGYELLRHVLIDGVKPSCSS---IFDKLI 676
           L+EEM   G+  D  T   L+  +C + ++    +LL  +L  G++PS SS   + +KL 
Sbjct: 407 LFEEMAQKGIFRDVITYNTLIQGYCEEGKIVNSKKLLEELLALGLQPSASSYTHLIEKLC 466

Query: 677 SV 682
            V
Sbjct: 467 QV 468


>gb|EOY06281.1| Tetratricopeptide repeat-like superfamily protein, putative isoform
           1 [Theobroma cacao]
          Length = 610

 Score =  172 bits (436), Expect = 9e-41
 Identities = 92/217 (42%), Positives = 128/217 (58%), Gaps = 2/217 (0%)
 Frame = +2

Query: 47  DAVVQVCRMIRTRSQWESILLTRIPSHYLSDTKFLSEVLKRQKNVKLSLRFFNWLNSHFE 226
           D V QVC++ RT  +WE  LL++ PS   SD  F  E+L++Q+NV LSL FF+WL S ++
Sbjct: 63  DIVKQVCKITRTIPRWEENLLSKFPSFNFSDPVFFRELLRQQENVFLSLCFFHWLRSKYD 122

Query: 227 FKPDDLSCKLMFXXXXXXXXXXXXXSFIENVNFVPDSDAFQRLIRLLCRRGSIDVAVDLF 406
           F PD  SC ++F             +F+E   F P+  A +  +R LC  G ++ AV++F
Sbjct: 123 FSPDLDSCNVLFDKLVEANACKAARNFLEQTGFSPEPRALELYLRRLCEVGLVEEAVEMF 182

Query: 407 SKCKDYGICPSLATWNSIFSGNIRRGRTDVFWGLYEEMIGSGVSTDAE--TVECLVWAFC 580
           S     G  PS+ATWN      ++ GR D  W LY++MI SGV  D +  TV CL+ AFC
Sbjct: 183 SMLNKIGYRPSVATWNLALLAFLKVGRNDFVWKLYQDMIDSGVVVDIDVATVGCLIQAFC 242

Query: 581 VDDEVERGYELLRHVLIDGVKPSCSSIFDKLISVFGK 691
            D    +GYELLR VL DG+ P  + +F+KLI+ F K
Sbjct: 243 NDGNASKGYELLRQVLEDGLVPD-NVVFNKLIAGFCK 278



 Score = 58.9 bits (141), Expect = 1e-06
 Identities = 31/115 (26%), Positives = 58/115 (50%)
 Frame = +2

Query: 314 NVNFVPDSDAFQRLIRLLCRRGSIDVAVDLFSKCKDYGICPSLATWNSIFSGNIRRGRTD 493
           N   +P+   +  L+  L +   ++ A  LF +  D G   +  ++N++ +G    GR D
Sbjct: 364 NKGMLPNEYTYNALLHGLYKVHDLEEAEKLFKEMLDRGYGETTVSYNTMIAGFCWHGRMD 423

Query: 494 VFWGLYEEMIGSGVSTDAETVECLVWAFCVDDEVERGYELLRHVLIDGVKPSCSS 658
             + L+EEM   G+  D  T   L+  FC++ ++     LL+ +L+ G++PS  S
Sbjct: 424 EAYRLFEEMPEKGIVRDLITFNTLIKGFCMEGKIVESLNLLKELLVQGLQPSTPS 478


>ref|XP_004229464.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g18950-like [Solanum lycopersicum]
          Length = 601

 Score =  169 bits (429), Expect = 6e-40
 Identities = 91/216 (42%), Positives = 127/216 (58%)
 Frame = +2

Query: 29  ERPSFEDAVVQVCRMIRTRSQWESILLTRIPSHYLSDTKFLSEVLKRQKNVKLSLRFFNW 208
           E+ SF +    VC++IRTR +WE ILL+  P+   +D +F +EVLK QKN+ LSLRF  W
Sbjct: 53  EQLSFAEIAKDVCKVIRTRPRWEQILLSDFPTVNFTDPRFYTEVLKAQKNIMLSLRFHFW 112

Query: 209 LNSHFEFKPDDLSCKLMFXXXXXXXXXXXXXSFIENVNFVPDSDAFQRLIRLLCRRGSID 388
           L+S   F  D  S +++F              F +N+ FVP  +  +  I+ LC  G I+
Sbjct: 113 LSSQNGFSRDQFSDEVIFSGLVQAKAASAAKCFRQNMIFVPQPNCLEAYIQCLCENGLIE 172

Query: 389 VAVDLFSKCKDYGICPSLATWNSIFSGNIRRGRTDVFWGLYEEMIGSGVSTDAETVECLV 568
            A+D+F++ +  G CPSL  WNS  S +IR GRTD  W LYE+M  SGV  D  T+  L+
Sbjct: 173 DALDVFTELRSVGHCPSLRIWNSALSDSIRAGRTDTVWKLYEDMTESGVVADVGTIGHLI 232

Query: 569 WAFCVDDEVERGYELLRHVLIDGVKPSCSSIFDKLI 676
            AFC+++    G++LLR  L  G  PS S  F+KLI
Sbjct: 233 QAFCMENNFPDGHQLLRQALEAGHAPS-SVAFNKLI 267


>ref|XP_006419767.1| hypothetical protein CICLE_v10007051mg, partial [Citrus clementina]
           gi|557521640|gb|ESR33007.1| hypothetical protein
           CICLE_v10007051mg, partial [Citrus clementina]
          Length = 540

 Score =  168 bits (425), Expect = 2e-39
 Identities = 86/208 (41%), Positives = 121/208 (58%), Gaps = 2/208 (0%)
 Frame = +2

Query: 26  EERPSFEDAVVQVCRMIRTRSQWESILLTRIPSHYLSDTKFLSEVLKRQKNVKLSLRFFN 205
           E +  + +   QVC++ RT+ +WE  LL+  PS   +D  F  E LK+Q N+ LS+RFF 
Sbjct: 3   ESQQLYTEIAKQVCKITRTKPRWEQTLLSDFPSFNFNDPLFFREFLKQQNNMLLSIRFFQ 62

Query: 206 WLNSHFEFKPDDLSCKLMFXXXXXXXXXXXXXSFIENVNFVPDSDAFQRLIRLLCRRGSI 385
           WL+SH+ F PD  SC ++F              F+    F P+ ++ +  I+ LC  G I
Sbjct: 63  WLHSHYGFSPDLDSCNVLFDSLVEARAFKVAKEFLAITGFSPNPNSLELYIQCLCESGMI 122

Query: 386 DVAVDLFSKCKDYGICPSLATWNSIFSGNIRRGRTDVFWGLYEEMIGSGV--STDAETVE 559
           + A  +FSK K+ G+  S+ TWNS   G I+  RTD+ W LY +MI SG+    DAET+ 
Sbjct: 123 EEAFRVFSKLKEMGVFGSIKTWNSALLGCIKVDRTDLLWKLYHDMIESGIVADVDAETIG 182

Query: 560 CLVWAFCVDDEVERGYELLRHVLIDGVK 643
            L+ AFC D +V  GYELLR VL DG+K
Sbjct: 183 YLIQAFCNDGKVAEGYELLRQVLEDGLK 210



 Score = 61.6 bits (148), Expect = 2e-07
 Identities = 35/122 (28%), Positives = 62/122 (50%), Gaps = 3/122 (2%)
 Frame = +2

Query: 326 VPDSDAFQRLIRLLCRRGSIDVAVDLFSKCKDYGICPSLATWNSIFSGNIRRGRTDVFWG 505
           +P+   +  +I   CR  +++ A  L  +  D G   +  T+N++ +G    GRTD  + 
Sbjct: 298 LPNEYTYNSMIHGYCRIDNLEEAKRLHKEMLDKGYGETTVTYNTLIAGLCLHGRTDKAYH 357

Query: 506 LYEEMIGSGVSTDAETVECLVWAFCVDDEVERGYELLRHVLIDGVKPSCSS---IFDKLI 676
           L+EEM   G+  D  T   L+  +C + ++    +LL  +L  G++PS SS   + +KL 
Sbjct: 358 LFEEMAQKGIFRDVITYNTLIQGYCKEGKIVNSKKLLEELLALGLQPSASSYTHLIEKLC 417

Query: 677 SV 682
            V
Sbjct: 418 QV 419


>ref|XP_004155892.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g18950-like [Cucumis sativus]
          Length = 638

 Score =  162 bits (409), Expect = 1e-37
 Identities = 90/225 (40%), Positives = 126/225 (56%), Gaps = 2/225 (0%)
 Frame = +2

Query: 23  LEERPSFEDAVVQVCRMIRTRSQWESILLTRIPSHYLSDTKFLSEVLKRQKNVKLSLRFF 202
           L +R    +   +V ++IR++ +WE  LL+  PS    D  F SE+LK+  NV LSLRFF
Sbjct: 87  LTQRKDVSEIAAEVGKVIRSKPRWEQSLLSDYPSFNFHDPSFFSELLKQLNNVFLSLRFF 146

Query: 203 NWLNSHFEFKPDDLSCKLMFXXXXXXXXXXXXXSFIENVNFVPDSDAFQRLIRLLCRRGS 382
            WL+S  EF P  +SC  +F             SF+ +  F P+  + +  IR +C  G 
Sbjct: 147 LWLSSQPEFLPHPVSCNKLFDALLEAKACVPAKSFLYSFEFSPEPASLENYIRCVCEGGL 206

Query: 383 IDVAVDLFSKCKDYGICPSLATWNSIFSGNIRRGRTDVFWGLYEEMIGSGV--STDAETV 556
           ++ AV  F   K+ G  P + TWN  F   ++ GRTD+ W LYE M+ +GV    D ETV
Sbjct: 207 VEEAVYTFDMLKEAGYRPYVETWNFAFQSCLKFGRTDLIWKLYEGMMETGVQKDVDIETV 266

Query: 557 ECLVWAFCVDDEVERGYELLRHVLIDGVKPSCSSIFDKLISVFGK 691
             L+ AFC D++V R YE+LR  L DG+ P C+  F+KLIS F K
Sbjct: 267 GYLIQAFCNDNKVSRAYEILRQSLEDGLTP-CNDAFNKLISGFCK 310



 Score = 61.2 bits (147), Expect = 3e-07
 Identities = 33/112 (29%), Positives = 58/112 (51%)
 Frame = +2

Query: 323 FVPDSDAFQRLIRLLCRRGSIDVAVDLFSKCKDYGICPSLATWNSIFSGNIRRGRTDVFW 502
           F+P+  ++  LI   C+ G++D A+ L+ K  D G   +  + N++  G    GRTD  +
Sbjct: 399 FLPNEYSYNTLIYGFCKIGNLDEAMKLYKKMLDSGYKETTLSCNTLILGLCLHGRTDEAY 458

Query: 503 GLYEEMIGSGVSTDAETVECLVWAFCVDDEVERGYELLRHVLIDGVKPSCSS 658
             + EM    +  D  T   L+  FC + +V +  +LL+ +   G++PS SS
Sbjct: 459 DFFREMPCKNIVCDVITYNTLIQGFCREGKVLQSTDLLKELQAKGLQPSTSS 510


>ref|XP_004134313.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g18950-like [Cucumis sativus]
          Length = 602

 Score =  162 bits (409), Expect = 1e-37
 Identities = 90/225 (40%), Positives = 126/225 (56%), Gaps = 2/225 (0%)
 Frame = +2

Query: 23  LEERPSFEDAVVQVCRMIRTRSQWESILLTRIPSHYLSDTKFLSEVLKRQKNVKLSLRFF 202
           L +R    +   +V ++IR++ +WE  LL+  PS    D  F SE+LK+  NV LSLRFF
Sbjct: 51  LTQRKDVSEIAAEVGKVIRSKPRWEQSLLSDYPSFNFHDPSFFSELLKQLNNVFLSLRFF 110

Query: 203 NWLNSHFEFKPDDLSCKLMFXXXXXXXXXXXXXSFIENVNFVPDSDAFQRLIRLLCRRGS 382
            WL+S  EF P  +SC  +F             SF+ +  F P+  + +  IR +C  G 
Sbjct: 111 LWLSSQPEFLPHPVSCNKLFDALLEAKACVPAKSFLYSFEFSPEPASLENYIRCVCEGGL 170

Query: 383 IDVAVDLFSKCKDYGICPSLATWNSIFSGNIRRGRTDVFWGLYEEMIGSGV--STDAETV 556
           ++ AV  F   K+ G  P + TWN  F   ++ GRTD+ W LYE M+ +GV    D ETV
Sbjct: 171 VEEAVYTFDMLKEAGYRPYVETWNFAFQSCLKFGRTDLIWKLYEGMMETGVQKDVDIETV 230

Query: 557 ECLVWAFCVDDEVERGYELLRHVLIDGVKPSCSSIFDKLISVFGK 691
             L+ AFC D++V R YE+LR  L DG+ P C+  F+KLIS F K
Sbjct: 231 GYLIQAFCNDNKVSRAYEILRQSLEDGLTP-CNDAFNKLISGFCK 274



 Score = 61.2 bits (147), Expect = 3e-07
 Identities = 33/112 (29%), Positives = 58/112 (51%)
 Frame = +2

Query: 323 FVPDSDAFQRLIRLLCRRGSIDVAVDLFSKCKDYGICPSLATWNSIFSGNIRRGRTDVFW 502
           F+P+  ++  LI   C+ G++D A+ L+ K  D G   +  + N++  G    GRTD  +
Sbjct: 363 FLPNEYSYNTLIYGFCKIGNLDEAMKLYKKMLDSGYKETTLSCNTLILGLCLHGRTDEAY 422

Query: 503 GLYEEMIGSGVSTDAETVECLVWAFCVDDEVERGYELLRHVLIDGVKPSCSS 658
             + EM    +  D  T   L+  FC + +V +  +LL+ +   G++PS SS
Sbjct: 423 DFFREMPCKNIVCDVITYNTLIQGFCREGKVLQSTDLLKELQAKGLQPSTSS 474


>ref|XP_003617724.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355519059|gb|AET00683.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 861

 Score =  159 bits (402), Expect = 8e-37
 Identities = 85/221 (38%), Positives = 127/221 (57%), Gaps = 2/221 (0%)
 Frame = +2

Query: 38  SFEDAVVQVCRMIRTRSQWESILLTRIPSHYLSDTKFLSEVLKRQKNVKLSLRFFNWLNS 217
           +F   + ++C + R++ +WE+ L+++ PS   S+ KF    LK Q N  LSLRF +WL S
Sbjct: 30  NFTQTLNEICTITRSKPRWENTLISQYPSFNFSNPKFFLSYLKHQNNTFLSLRFLHWLTS 89

Query: 218 HFEFKPDDLSCKLMFXXXXXXXXXXXXXSFIENVNFVPDSDAFQRLIRLLCRRGSIDVAV 397
           H  FKPD  SC  +F             S +E  +FVP +D+ +  +RLL   G ++   
Sbjct: 90  HCGFKPDQSSCNALFDALVDAGAVKAAKSLLEYPDFVPKNDSLEGYVRLLGENGMVEEVF 149

Query: 398 DLFSKCKDYGICPSLATWNSIFSGNIRRGRTDVFWGLYEEMI--GSGVSTDAETVECLVW 571
           D+F   K  G  PS +++N      ++ GRTD+ W LYE MI  G GV+ D ETV CL+ 
Sbjct: 150 DVFVSLKKVGFLPSASSFNVCLLACLKVGRTDLVWKLYELMIESGVGVNIDVETVGCLIK 209

Query: 572 AFCVDDEVERGYELLRHVLIDGVKPSCSSIFDKLISVFGKR 694
           AFC +++V  GYELLR VL  G+    +++F+ LI+ F K+
Sbjct: 210 AFCAENKVFNGYELLRQVLEKGLCVD-NTVFNALINGFCKQ 249


>ref|XP_004491488.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g18950-like isoform X1 [Cicer arietinum]
           gi|502099479|ref|XP_004491489.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At5g18950-like isoform X2 [Cicer arietinum]
          Length = 598

 Score =  155 bits (393), Expect = 9e-36
 Identities = 84/217 (38%), Positives = 124/217 (57%), Gaps = 2/217 (0%)
 Frame = +2

Query: 47  DAVVQVCRMIRTRSQWESILLTRIPSHYLSDTKFLSEVLKRQKNVKLSLRFFNWLNSHFE 226
           D V ++C++ RT+ +WE+ LL++ PS   SD  F    L  Q N  LSLRF +WL+SH  
Sbjct: 55  DIVDEICKITRTKPRWENTLLSQYPSFNFSDPNFFLLYLNHQNNSFLSLRFLHWLSSHCS 114

Query: 227 FKPDDLSCKLMFXXXXXXXXXXXXXSFIENVNFVPDSDAFQRLIRLLCRRGSIDVAVDLF 406
           F PD  SC ++F             S ++   F P   + +  IR L   G ++ A+D+F
Sbjct: 115 FSPDQSSCNVLFDALVDAEACKAAKSLLDYPGFTPKPASLESYIRCLINGGMVEDALDVF 174

Query: 407 SKCKDYGICPSLATWNSIFSGNIRRGRTDVFWGLYEEMIGSGV--STDAETVECLVWAFC 580
              K  G  PS++T+N+     ++ GRTD+ W LYE M+ SG+  S D ETV  L+ AFC
Sbjct: 175 VTLKKVGFLPSVSTFNASLLACLKVGRTDLVWTLYERMLESGIVASIDVETVGYLIKAFC 234

Query: 581 VDDEVERGYELLRHVLIDGVKPSCSSIFDKLISVFGK 691
            +++V  GYELLR VL  G+ P  +++F+ LI+ F K
Sbjct: 235 AENKVFNGYELLRQVLDKGLCPD-NTVFNSLIAGFCK 270


>ref|XP_003530332.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g18950-like isoform X1 [Glycine max]
           gi|571466579|ref|XP_006583703.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At5g18950-like isoform X2 [Glycine max]
          Length = 577

 Score =  150 bits (379), Expect = 4e-34
 Identities = 84/215 (39%), Positives = 123/215 (57%), Gaps = 2/215 (0%)
 Frame = +2

Query: 53  VVQVCRMIRTRSQWESILLTRIPSHYLSDTKFLSEVLKRQKNVKLSLRFFNWLNSHFEFK 232
           V ++CR+ RT+ +WE  LL++ PS    D  F    LK Q N  LSLRFF+WL S   F 
Sbjct: 51  VYEICRITRTKPRWEDTLLSQYPSFNFKDPSFFLLYLKHQNNAFLSLRFFHWLCSSCGFS 110

Query: 233 PDDLSCKLMFXXXXXXXXXXXXXSFIENVNFVPDSDAFQRLIRLLCRRGSIDVAVDLFSK 412
           PD  SC ++F             S +++  F P+  + +  I+ L   G ++ AVD+   
Sbjct: 111 PDQSSCNVLFQVLVDAGAGKLAKSLLDSPGFTPEPASLEGYIQCLSGAGMVEDAVDML-- 168

Query: 413 CKDYGICPSLATWNSIFSGNIRRGRTDVFWGLYEEMIGSGV--STDAETVECLVWAFCVD 586
            K    CPS+ATWN+   G +R  RTD+ W LYE+M+ SGV  S + ETV  L+ AFC +
Sbjct: 169 -KRVVFCPSVATWNASLLGCLRARRTDLVWTLYEQMMESGVVASINVETVGYLIMAFCAE 227

Query: 587 DEVERGYELLRHVLIDGVKPSCSSIFDKLISVFGK 691
            +V +GYELL+ +L +G+ P  + +F++LI  F K
Sbjct: 228 YKVLKGYELLKELLENGLCPD-NVVFNELIRGFCK 261



 Score = 57.8 bits (138), Expect = 3e-06
 Identities = 34/118 (28%), Positives = 58/118 (49%)
 Frame = +2

Query: 323 FVPDSDAFQRLIRLLCRRGSIDVAVDLFSKCKDYGICPSLATWNSIFSGNIRRGRTDVFW 502
           F P+   +  ++   C+ G +  A  +F   +D G   +  ++ ++ SG    GRTD   
Sbjct: 349 FQPNEYTYNVMMHGYCKIGDLAEARKIFEDMRDRGYAETTVSYGTMISGLCLHGRTDEAQ 408

Query: 503 GLYEEMIGSGVSTDAETVECLVWAFCVDDEVERGYELLRHVLIDGVKPSCSSIFDKLI 676
            L+EEM   G+  D  T  CL+ A C + ++ +  +LL  +L  G++ S  S F  LI
Sbjct: 409 SLFEEMFQKGIVPDLITYNCLIKALCKEVKIVKARKLLNLLLAQGLELSVFS-FSPLI 465


>ref|XP_006289665.1| hypothetical protein CARUB_v10003224mg [Capsella rubella]
           gi|482558371|gb|EOA22563.1| hypothetical protein
           CARUB_v10003224mg [Capsella rubella]
          Length = 486

 Score =  141 bits (356), Expect = 2e-31
 Identities = 77/234 (32%), Positives = 125/234 (53%), Gaps = 5/234 (2%)
 Frame = +2

Query: 5   DASNAPLEERPS-----FEDAVVQVCRMIRTRSQWESILLTRIPSHYLSDTKFLSEVLKR 169
           D+ + P E++ +     + +    V  ++R R +W+  L++  PS   +D  F  E+LK 
Sbjct: 33  DSESKPDEQKSAGGGTTYTEMAKTVSTVMRERQRWQQTLVSDFPSFNFADPLFFRELLKS 92

Query: 170 QKNVKLSLRFFNWLNSHFEFKPDDLSCKLMFXXXXXXXXXXXXXSFIENVNFVPDSDAFQ 349
           Q NV  SL FF WL S++++ PD  S  L+F             SF++   F P+    +
Sbjct: 93  QNNVLFSLWFFRWLCSNYDYAPDPASLSLLFGALLDAKAVKAAKSFLDTTGFKPEPTLLE 152

Query: 350 RLIRLLCRRGSIDVAVDLFSKCKDYGICPSLATWNSIFSGNIRRGRTDVFWGLYEEMIGS 529
           + ++ L   G ++ A+D+++  K+ GI PS+ T NS+  G ++  + D FW L+++M+ S
Sbjct: 153 QYVKCLSEDGLVEEAIDVYNVLKEMGISPSIVTCNSVLLGCVKARKLDCFWELHQKMMES 212

Query: 530 GVSTDAETVECLVWAFCVDDEVERGYELLRHVLIDGVKPSCSSIFDKLISVFGK 691
            V  D E + CL+ A C   EV  GYELLR  L  G+ P    ++ KLIS F K
Sbjct: 213 EV--DLERIRCLILALCDAGEVSEGYELLRQGLKQGLDPG-HDVYGKLISGFCK 263


>gb|ESW25446.1| hypothetical protein PHAVU_003G036500g [Phaseolus vulgaris]
          Length = 593

 Score =  140 bits (354), Expect = 3e-31
 Identities = 79/213 (37%), Positives = 116/213 (54%), Gaps = 2/213 (0%)
 Frame = +2

Query: 59  QVCRMIRTRSQWESILLTRIPSHYLSDTKFLSEVLKRQKNVKLSLRFFNWLNSHFEFKPD 238
           ++CR+ R++ +WE  LL+  PS   SD  F    L  Q N  LSLRFF+WL S   F PD
Sbjct: 54  EICRITRSKPRWEDNLLSLYPSFNFSDPSFFLLYLNHQNNALLSLRFFHWLCSSCGFSPD 113

Query: 239 DLSCKLMFXXXXXXXXXXXXXSFIENVNFVPDSDAFQRLIRLLCRRGSIDVAVDLFSKCK 418
             S   +F             + ++     P+  + +  I+ L R G ++ AVD+    K
Sbjct: 114 QASYNALFCALVDAGACKAAKALLDCPGLTPEPASLEGYIQCLSRTGMVEDAVDML---K 170

Query: 419 DYGICPSLATWNSIFSGNIRRGRTDVFWGLYEEMIGSGV--STDAETVECLVWAFCVDDE 592
             G CPS+ TWN+     +R GRT++ W LYE+M+ SGV  S + ETV  L+  FC +++
Sbjct: 171 QVGFCPSVTTWNASLLSCLRAGRTNLVWTLYEQMMESGVVASINVETVGYLIMTFCAENK 230

Query: 593 VERGYELLRHVLIDGVKPSCSSIFDKLISVFGK 691
           V +GYELLR +L +G+ P  + +F  LI  F K
Sbjct: 231 VLKGYELLRELLENGLHPD-NVVFTALIRGFCK 262


>gb|ABD96889.1| hypothetical protein [Cleome spinosa]
          Length = 719

 Score =  138 bits (348), Expect = 1e-30
 Identities = 75/220 (34%), Positives = 124/220 (56%), Gaps = 2/220 (0%)
 Frame = +2

Query: 32  RPSFEDAVVQVCRMIRTRSQWESILLTRIPSHYLSDTKFLSEVLKRQKNVKLSLRFFNWL 211
           R ++ +    V  + R + +WE  L++  PS   +D  F  E++  Q NV LSLRFF WL
Sbjct: 55  RRNYTEMAKIVATITREKPRWEQTLVSDFPSFNFADPLFFRELVATQNNVLLSLRFFQWL 114

Query: 212 NSHFEFKPDDLSCKLMFXXXXXXXXXXXXXSFIENVNFVPDSDAFQRLIRLLCRRGSIDV 391
            ++ +  PD +S  ++F                +   F+PDS + ++ ++ LC  G I+ 
Sbjct: 115 CTNHDCTPDPISSNMLFEALLDAKAVRAAKMVRDIAGFIPDSASLEQYVKCLCGVGFIEE 174

Query: 392 AVDLFSKCKDYGICPSLATWNSIFSGNIRRGRTDVFWGLYEEMIGSGVSTDA--ETVECL 565
           A++++ + K+ GI  S+   NSI SG ++ G+T++ +  Y+EMI +G ++DA  ETV CL
Sbjct: 175 AIEVYFQLKEAGIRISIVACNSILSGCLKAGKTELLFEFYQEMIKAGTASDANTETVGCL 234

Query: 566 VWAFCVDDEVERGYELLRHVLIDGVKPSCSSIFDKLISVF 685
           + AFC   +V RGYELL   L  G+ P  +  ++KLI+ F
Sbjct: 235 IQAFCDSGQVARGYELLNQFLKTGLDPG-NPTYNKLIAGF 273


>ref|XP_002873920.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297319757|gb|EFH50179.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 483

 Score =  137 bits (345), Expect = 3e-30
 Identities = 74/229 (32%), Positives = 123/229 (53%), Gaps = 2/229 (0%)
 Frame = +2

Query: 5   DASNAPLEERP--SFEDAVVQVCRMIRTRSQWESILLTRIPSHYLSDTKFLSEVLKRQKN 178
           D+ + P E++   S+ +    V  ++R R +W+  L++  PS   +D  F  ++LK Q N
Sbjct: 33  DSESKPDEQKSAVSYTEMAKTVSTIMRQRQRWQQTLVSDFPSFDFADPLFFRQLLKSQNN 92

Query: 179 VKLSLRFFNWLNSHFEFKPDDLSCKLMFXXXXXXXXXXXXXSFIENVNFVPDSDAFQRLI 358
           V  SL FF WL S++++ PD +S  L+F             SF++   F P+    ++ +
Sbjct: 93  VMFSLWFFRWLCSNYDYTPDSVSLNLLFGALLDGKAVKAAKSFLDTTGFKPEPTLLEQYV 152

Query: 359 RLLCRRGSIDVAVDLFSKCKDYGICPSLATWNSIFSGNIRRGRTDVFWGLYEEMIGSGVS 538
           + L   G ++ A+++++  K+ GI  S+ T NS+  G ++  + D FW L++EMI S   
Sbjct: 153 KCLSEEGLVEEAIEVYNVLKEMGISSSVVTCNSVLLGCLKARKLDRFWELHKEMIES--E 210

Query: 539 TDAETVECLVWAFCVDDEVERGYELLRHVLIDGVKPSCSSIFDKLISVF 685
            D E + CL+ A C   EV  GYELL+  L  G+ P    ++ KLIS F
Sbjct: 211 FDLERIRCLIQALCDGGEVSEGYELLKQGLKQGLDPG-HDVYAKLISGF 258


>ref|NP_197396.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|223635758|sp|Q8GYM2.2|PP393_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At5g18950 gi|332005249|gb|AED92632.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 483

 Score =  134 bits (337), Expect = 3e-29
 Identities = 72/229 (31%), Positives = 122/229 (53%), Gaps = 2/229 (0%)
 Frame = +2

Query: 5   DASNAPLEERP--SFEDAVVQVCRMIRTRSQWESILLTRIPSHYLSDTKFLSEVLKRQKN 178
           D  + P E++   S+ +    V  ++R R +W+  L++  PS   +D  F  E+LK Q N
Sbjct: 33  DCESKPDEQKSAVSYTEMAKTVSTIMRERQRWQQTLVSDFPSFDFADPLFFGELLKSQNN 92

Query: 179 VKLSLRFFNWLNSHFEFKPDDLSCKLMFXXXXXXXXXXXXXSFIENVNFVPDSDAFQRLI 358
           V  SL FF WL S++++ P  +S  ++F             SF++   F P+    ++ +
Sbjct: 93  VLFSLWFFRWLCSNYDYTPGPVSLNILFGALLDGKAVKAAKSFLDTTGFKPEPTLLEQYV 152

Query: 359 RLLCRRGSIDVAVDLFSKCKDYGICPSLATWNSIFSGNIRRGRTDVFWGLYEEMIGSGVS 538
           + L   G ++ A+++++  KD GI  S+ T NS+  G ++  + D FW L++EM+ S   
Sbjct: 153 KCLSEEGLVEEAIEVYNVLKDMGISSSVVTCNSVLLGCLKARKLDRFWELHKEMVES--E 210

Query: 539 TDAETVECLVWAFCVDDEVERGYELLRHVLIDGVKPSCSSIFDKLISVF 685
            D+E + CL+ A C   +V  GYELL+  L  G+ P    ++ KLIS F
Sbjct: 211 FDSERIRCLIRALCDGGDVSEGYELLKQGLKQGLDPG-QYVYAKLISGF 258


>gb|EMJ18641.1| hypothetical protein PRUPE_ppb015972mg [Prunus persica]
          Length = 221

 Score =  127 bits (319), Expect = 3e-27
 Identities = 72/174 (41%), Positives = 99/174 (56%), Gaps = 2/174 (1%)
 Frame = +2

Query: 176 NVKLSLRFFNWLNSHFEFKPDDLSCKLMFXXXXXXXXXXXXXSFIENVNFVPDSDAFQRL 355
           NV LSLR F WL+SH EF PD +SC  +              SF+E+ +F P+  +F++L
Sbjct: 2   NVFLSLRCFFWLSSHNEFSPDPISCNALVSAFVETKVCNPAKSFLEHTSFSPELASFRKL 61

Query: 356 IRLLCRRGSIDVAVDLFSKCKDYGICPSLATWNSIFSGNIRRGRTDVFWGLYEEMIGSGV 535
             +              S  K+ G+CP++ TW +  SG ++ GRTD+ W LY+EMI  GV
Sbjct: 62  YSV--------------SLLKEAGVCPAIMTWKAALSGCLKVGRTDIIWKLYQEMIECGV 107

Query: 536 STDAE--TVECLVWAFCVDDEVERGYELLRHVLIDGVKPSCSSIFDKLISVFGK 691
             D E   +  L+ AFC D+ V  GYELLR VLIDG+ P  ++ F+KLIS F K
Sbjct: 108 VADVELRLLGYLIQAFCADNRVLEGYELLRQVLIDGLVPE-NAAFNKLISGFCK 160


>gb|EOY06282.1| Tetratricopeptide repeat-like superfamily protein, putative isoform
           2, partial [Theobroma cacao]
          Length = 535

 Score =  107 bits (267), Expect = 4e-21
 Identities = 57/132 (43%), Positives = 79/132 (59%), Gaps = 2/132 (1%)
 Frame = +2

Query: 302 SFIENVNFVPDSDAFQRLIRLLCRRGSIDVAVDLFSKCKDYGICPSLATWNSIFSGNIRR 481
           +F+E   F P+  A +  +R LC  G ++ AV++FS     G  PS+ATWN      ++ 
Sbjct: 73  NFLEQTGFSPEPRALELYLRRLCEVGLVEEAVEMFSMLNKIGYRPSVATWNLALLAFLKV 132

Query: 482 GRTDVFWGLYEEMIGSGVSTDAE--TVECLVWAFCVDDEVERGYELLRHVLIDGVKPSCS 655
           GR D  W LY++MI SGV  D +  TV CL+ AFC D    +GYELLR VL DG+ P  +
Sbjct: 133 GRNDFVWKLYQDMIDSGVVVDIDVATVGCLIQAFCNDGNASKGYELLRQVLEDGLVPD-N 191

Query: 656 SIFDKLISVFGK 691
            +F+KLI+ F K
Sbjct: 192 VVFNKLIAGFCK 203



 Score = 58.9 bits (141), Expect = 1e-06
 Identities = 31/115 (26%), Positives = 58/115 (50%)
 Frame = +2

Query: 314 NVNFVPDSDAFQRLIRLLCRRGSIDVAVDLFSKCKDYGICPSLATWNSIFSGNIRRGRTD 493
           N   +P+   +  L+  L +   ++ A  LF +  D G   +  ++N++ +G    GR D
Sbjct: 289 NKGMLPNEYTYNALLHGLYKVHDLEEAEKLFKEMLDRGYGETTVSYNTMIAGFCWHGRMD 348

Query: 494 VFWGLYEEMIGSGVSTDAETVECLVWAFCVDDEVERGYELLRHVLIDGVKPSCSS 658
             + L+EEM   G+  D  T   L+  FC++ ++     LL+ +L+ G++PS  S
Sbjct: 349 EAYRLFEEMPEKGIVRDLITFNTLIKGFCMEGKIVESLNLLKELLVQGLQPSTPS 403


>gb|EPS61555.1| hypothetical protein M569_13242, partial [Genlisea aurea]
          Length = 360

 Score = 99.0 bits (245), Expect = 1e-18
 Identities = 53/156 (33%), Positives = 75/156 (48%)
 Frame = +2

Query: 179 VKLSLRFFNWLNSHFEFKPDDLSCKLMFXXXXXXXXXXXXXSFIENVNFVPDSDAFQRLI 358
           +  S RF+ WL S      D    KLMF             +F++   F P+    +  I
Sbjct: 3   ISSSFRFYKWLESRNGHSSDPTLRKLMFSRLAKSNGIDSARAFLKETEFEPEPRDLELYI 62

Query: 359 RLLCRRGSIDVAVDLFSKCKDYGICPSLATWNSIFSGNIRRGRTDVFWGLYEEMIGSGVS 538
           R LCR G +D AV +    +  G C SL TWN     ++   R +V W L+ EMI SG  
Sbjct: 63  RSLCRNGFVDEAVGIIKTLRTAGYCVSLQTWNLALGSSVTARRVNVTWTLHSEMIESGAE 122

Query: 539 TDAETVECLVWAFCVDDEVERGYELLRHVLIDGVKP 646
           T+ ET+  L+ AFC++  + + Y LLR +L  G  P
Sbjct: 123 TNVETIGHLIRAFCLEKNLAKAYGLLRQLLDAGHFP 158


Top