BLASTX nr result

ID: Cocculus23_contig00027391 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cocculus23_contig00027391
         (823 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI26162.3| unnamed protein product [Vitis vinifera]              186   9e-45
ref|XP_006419767.1| hypothetical protein CICLE_v10007051mg, part...   181   3e-43
ref|XP_006489230.1| PREDICTED: pentatricopeptide repeat-containi...   179   8e-43
ref|XP_004491488.1| PREDICTED: pentatricopeptide repeat-containi...   170   7e-40
ref|XP_006339424.1| PREDICTED: pentatricopeptide repeat-containi...   166   1e-38
ref|XP_004297017.1| PREDICTED: pentatricopeptide repeat-containi...   160   5e-37
ref|XP_003617724.1| Pentatricopeptide repeat-containing protein ...   159   9e-37
ref|XP_004229464.1| PREDICTED: pentatricopeptide repeat-containi...   157   3e-36
ref|XP_003530332.1| PREDICTED: pentatricopeptide repeat-containi...   153   7e-35
ref|XP_004155892.1| PREDICTED: pentatricopeptide repeat-containi...   151   3e-34
ref|XP_004134313.1| PREDICTED: pentatricopeptide repeat-containi...   151   3e-34
ref|XP_007035355.1| Tetratricopeptide repeat-like superfamily pr...   147   4e-33
ref|XP_007153452.1| hypothetical protein PHAVU_003G036500g [Phas...   145   2e-32
gb|ABD96889.1| hypothetical protein [Cleome spinosa]                  140   4e-31
ref|XP_006289665.1| hypothetical protein CARUB_v10003224mg [Caps...   139   2e-30
ref|XP_002873920.1| pentatricopeptide repeat-containing protein ...   135   2e-29
ref|NP_197396.1| pentatricopeptide repeat-containing protein [Ar...   131   3e-28
ref|XP_006844767.1| hypothetical protein AMTR_s00016p00258280 [A...   105   2e-20
ref|XP_007035356.1| Tetratricopeptide repeat-like superfamily pr...    82   3e-13
ref|XP_007217442.1| hypothetical protein PRUPE_ppb015972mg [Prun...    75   2e-11

>emb|CBI26162.3| unnamed protein product [Vitis vinifera]
          Length = 636

 Score =  186 bits (472), Expect = 9e-45
 Identities = 102/245 (41%), Positives = 145/245 (59%)
 Frame = -1

Query: 736 LKTQ*LIMARTHQSSIFNLIGKNVHKRHSFDRFDQSRHFTVGGGEIKCKHEETMIIVEDG 557
           +KTQ  +   +  SS+ + + +N + R           +    G+++  H+ T    +  
Sbjct: 40  VKTQTSMARPSSSSSVISFLRQNPNSRIRNLGVVSGNQY----GDVEGAHQHT----QQQ 91

Query: 556 QQFTDIARKVCEIVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRFFNWLC 377
           Q   +I ++V +I R+RPRWEQTLLS+FPS    +P  +   ++HQ NAL+S+RFF+WL 
Sbjct: 92  QHLEEIVKRVSDITRTRPRWEQTLLSDFPSFNFLDPTFLSHFVEHQKNALISLRFFHWLS 151

Query: 376 THQDMMGFSPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRCLCK 197
           +     GFSP+  S SCN LFD LV A A   A S L S +  F P+  SLE+ IRCLCK
Sbjct: 152 SQS---GFSPD--SSSCNVLFDALVEAGACNAAKSFLDSTN--FNPKPASLEAYIRCLCK 204

Query: 196 DGFVEEAIDVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLVGDGDT 17
            G VEEAI VF ++K IGV  +++ WNS L  S+R  + D  W+LYGEM+ S +V D  T
Sbjct: 205 GGLVEEAISVFGQLKGIGVCASIATWNSVLRGSVRAGRIDFVWELYGEMVESSVVADVHT 264

Query: 16  AGYLI 2
            GYL+
Sbjct: 265 VGYLV 269


>ref|XP_006419767.1| hypothetical protein CICLE_v10007051mg, partial [Citrus clementina]
           gi|557521640|gb|ESR33007.1| hypothetical protein
           CICLE_v10007051mg, partial [Citrus clementina]
          Length = 540

 Score =  181 bits (459), Expect = 3e-43
 Identities = 97/192 (50%), Positives = 126/192 (65%), Gaps = 2/192 (1%)
 Frame = -1

Query: 571 IVEDGQQFTDIARKVCEIVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRF 392
           I E  Q +T+IA++VC+I R++PRWEQTLLS+FPS    +P    + +K QNN LLSIRF
Sbjct: 1   IKESQQLYTEIAKQVCKITRTKPRWEQTLLSDFPSFNFNDPLFFREFLKQQNNMLLSIRF 60

Query: 391 FNWLCTHQDMMGFSPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLI 212
           F WL +H    GFSP+    SCN LFD LV A+A KVA   L  A   F P   SLE  I
Sbjct: 61  FQWLHSHY---GFSPD--LDSCNVLFDSLVEARAFKVAKEFL--AITGFSPNPNSLELYI 113

Query: 211 RCLCKDGFVEEAIDVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLV 32
           +CLC+ G +EEA  VFS++K +GV  ++  WNSAL   ++V +TD+ W LY +M+ SG+V
Sbjct: 114 QCLCESGMIEEAFRVFSKLKEMGVFGSIKTWNSALLGCIKVDRTDLLWKLYHDMIESGIV 173

Query: 31  GDGD--TAGYLI 2
            D D  T GYLI
Sbjct: 174 ADVDAETIGYLI 185


>ref|XP_006489230.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g18950-like [Citrus sinensis]
          Length = 589

 Score =  179 bits (455), Expect = 8e-43
 Identities = 95/190 (50%), Positives = 124/190 (65%), Gaps = 2/190 (1%)
 Frame = -1

Query: 565 EDGQQFTDIARKVCEIVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRFFN 386
           E  Q +T+IA++VC+I R++PRWEQTLLS+FPS    +P    + +K QNN LLSIRFF 
Sbjct: 35  ESQQLYTEIAKQVCKITRTKPRWEQTLLSDFPSFNFNDPLFFREFLKQQNNMLLSIRFFQ 94

Query: 385 WLCTHQDMMGFSPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRC 206
           WL +H    GFSP+    SCN LFD LV A+A KVA   L      F P   SLE  I+C
Sbjct: 95  WLHSHY---GFSPD--LDSCNVLFDSLVEARAFKVAMDFLDITG--FSPNPNSLELYIQC 147

Query: 205 LCKDGFVEEAIDVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLVGD 26
           LC+ G +EEA  VFS++K +GV  ++  WNSAL   ++V +TD+ W LY +M+ SG+V D
Sbjct: 148 LCESGMIEEAFRVFSKLKEMGVFGSIKTWNSALLGCIKVDRTDLLWKLYHDMIESGIVAD 207

Query: 25  GD--TAGYLI 2
            D  T GYLI
Sbjct: 208 VDAETIGYLI 217


>ref|XP_004491488.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g18950-like isoform X1 [Cicer arietinum]
           gi|502099479|ref|XP_004491489.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At5g18950-like isoform X2 [Cicer arietinum]
          Length = 598

 Score =  170 bits (430), Expect = 7e-40
 Identities = 89/187 (47%), Positives = 122/187 (65%), Gaps = 2/187 (1%)
 Frame = -1

Query: 556 QQFTDIARKVCEIVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRFFNWLC 377
           Q+ TDI  ++C+I R++PRWE TLLS++PS    +P      + HQNN+ LS+RF +WL 
Sbjct: 51  QKLTDIVDEICKITRTKPRWENTLLSQYPSFNFSDPNFFLLYLNHQNNSFLSLRFLHWLS 110

Query: 376 THQDMMGFSPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRCLCK 197
           +H     FSP+    SCN LFD LV A+A K A SLL   +  F P+  SLES IRCL  
Sbjct: 111 SH---CSFSPDQ--SSCNVLFDALVDAEACKAAKSLLD--YPGFTPKPASLESYIRCLIN 163

Query: 196 DGFVEEAIDVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLVG--DG 23
            G VE+A+DVF  +K +G +P+VS +N++L A L+V +TD+ W LY  M+ SG+V   D 
Sbjct: 164 GGMVEDALDVFVTLKKVGFLPSVSTFNASLLACLKVGRTDLVWTLYERMLESGIVASIDV 223

Query: 22  DTAGYLI 2
           +T GYLI
Sbjct: 224 ETVGYLI 230


>ref|XP_006339424.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g18950-like [Solanum tuberosum]
          Length = 601

 Score =  166 bits (420), Expect = 1e-38
 Identities = 81/185 (43%), Positives = 123/185 (66%)
 Frame = -1

Query: 556 QQFTDIARKVCEIVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRFFNWLC 377
           Q F +IA+ VC+++R+RPRWEQ LLS+FP+    +P+   +++K Q N +LS+RF  WL 
Sbjct: 55  QSFAEIAKDVCKVIRTRPRWEQILLSDFPTVNFTDPRFYTEVLKAQKNVMLSLRFHFWLS 114

Query: 376 THQDMMGFSPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRCLCK 197
           +     GFS + +S     +F  LV AKA+  A    Q+ +  FVP+ + LE+ I+CLC+
Sbjct: 115 SQN---GFSRDQFSDE--VIFSGLVQAKAASAAKCFRQNMN--FVPQPSCLEAYIQCLCE 167

Query: 196 DGFVEEAIDVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLVGDGDT 17
           +G +E+A+DVF+ ++ +G  P++ IWNSAL  S+R  +TDI W LY +M  SG+V D DT
Sbjct: 168 NGLIEDALDVFTELRGVGHCPSLRIWNSALSDSIRAGRTDIVWKLYEDMTESGVVADVDT 227

Query: 16  AGYLI 2
            G+LI
Sbjct: 228 IGHLI 232


>ref|XP_004297017.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g18950-like [Fragaria vesca subsp. vesca]
          Length = 382

 Score =  160 bits (405), Expect = 5e-37
 Identities = 94/238 (39%), Positives = 131/238 (55%)
 Frame = -1

Query: 715 MARTHQSSIFNLIGKNVHKRHSFDRFDQSRHFTVGGGEIKCKHEETMIIVEDGQQFTDIA 536
           MAR   SSI   + +N H R   D   Q RH +V          ET    +D    T +A
Sbjct: 1   MARA-SSSILTFLRQNPHARQKPDT--QIRHLSV----------ET----KDCSDLTQVA 43

Query: 535 RKVCEIVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRFFNWLCTHQDMMG 356
           +++C ++R++PRWE TL SE+PS    +P  + +++K Q+N  LS+RFF WL T +   G
Sbjct: 44  QQICHVIRTKPRWENTLSSEYPSSNFSDPLFIREVVKQQSNVFLSVRFFLWLGTRE---G 100

Query: 355 FSPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRCLCKDGFVEEA 176
           FSP+  S  CNA+F  LV   A   A S ++  H  F PE   LES  RCL + G V+EA
Sbjct: 101 FSPDPIS--CNAVFGALVEGNACSAAKSFIK--HTGFSPEPVLLESYARCLWEAGRVKEA 156

Query: 175 IDVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLVGDGDTAGYLI 2
             VF R+K  GV P +  WN+AL   ++  +TD+ W LY EMM  G+  D +T   L+
Sbjct: 157 SSVFKRLKEAGVCPGIGTWNAALSGCIKARRTDMVWKLYQEMMEYGVAADVETVECLV 214


>ref|XP_003617724.1| Pentatricopeptide repeat-containing protein [Medicago truncatula]
           gi|355519059|gb|AET00683.1| Pentatricopeptide
           repeat-containing protein [Medicago truncatula]
          Length = 861

 Score =  159 bits (403), Expect = 9e-37
 Identities = 89/188 (47%), Positives = 117/188 (62%), Gaps = 3/188 (1%)
 Frame = -1

Query: 556 QQFTDIARKVCEIVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRFFNWLC 377
           Q FT    ++C I RS+PRWE TL+S++PS    NPK     +KHQNN  LS+RF +WL 
Sbjct: 29  QNFTQTLNEICTITRSKPRWENTLISQYPSFNFSNPKFFLSYLKHQNNTFLSLRFLHWLT 88

Query: 376 THQDMMGFSPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRCLCK 197
           +H    GF P+    SCNALFD LV A A K A SLL+  +  FVP++ SLE  +R L +
Sbjct: 89  SH---CGFKPD--QSSCNALFDALVDAGAVKAAKSLLE--YPDFVPKNDSLEGYVRLLGE 141

Query: 196 DGFVEEAIDVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLVG---D 26
           +G VEE  DVF  +K +G +P+ S +N  L A L+V +TD+ W LY  M+ SG VG   D
Sbjct: 142 NGMVEEVFDVFVSLKKVGFLPSASSFNVCLLACLKVGRTDLVWKLYELMIESG-VGVNID 200

Query: 25  GDTAGYLI 2
            +T G LI
Sbjct: 201 VETVGCLI 208


>ref|XP_004229464.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g18950-like [Solanum lycopersicum]
          Length = 601

 Score =  157 bits (398), Expect = 3e-36
 Identities = 78/183 (42%), Positives = 119/183 (65%)
 Frame = -1

Query: 550 FTDIARKVCEIVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRFFNWLCTH 371
           F +IA+ VC+++R+RPRWEQ LLS+FP+    +P+   +++K Q N +LS+RF  WL + 
Sbjct: 57  FAEIAKDVCKVIRTRPRWEQILLSDFPTVNFTDPRFYTEVLKAQKNIMLSLRFHFWLSSQ 116

Query: 370 QDMMGFSPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRCLCKDG 191
               GFS + +S     +F  LV AKA+  A    Q+    FVP+   LE+ I+CLC++G
Sbjct: 117 N---GFSRDQFSDE--VIFSGLVQAKAASAAKCFRQNMI--FVPQPNCLEAYIQCLCENG 169

Query: 190 FVEEAIDVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLVGDGDTAG 11
            +E+A+DVF+ ++++G  P++ IWNSAL  S+R  +TD  W LY +M  SG+V D  T G
Sbjct: 170 LIEDALDVFTELRSVGHCPSLRIWNSALSDSIRAGRTDTVWKLYEDMTESGVVADVGTIG 229

Query: 10  YLI 2
           +LI
Sbjct: 230 HLI 232


>ref|XP_003530332.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g18950-like isoform X1 [Glycine max]
           gi|571466579|ref|XP_006583703.1| PREDICTED:
           pentatricopeptide repeat-containing protein
           At5g18950-like isoform X2 [Glycine max]
          Length = 577

 Score =  153 bits (387), Expect = 7e-35
 Identities = 84/179 (46%), Positives = 113/179 (63%), Gaps = 2/179 (1%)
 Frame = -1

Query: 532 KVCEIVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRFFNWLCTHQDMMGF 353
           ++C I R++PRWE TLLS++PS    +P      +KHQNNA LS+RFF+WLC+     GF
Sbjct: 53  EICRITRTKPRWEDTLLSQYPSFNFKDPSFFLLYLKHQNNAFLSLRFFHWLCS---SCGF 109

Query: 352 SPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRCLCKDGFVEEAI 173
           SP+    SCN LF  LV A A K+A SLL S    F PE  SLE  I+CL   G VE+A+
Sbjct: 110 SPD--QSSCNVLFQVLVDAGAGKLAKSLLDSP--GFTPEPASLEGYIQCLSGAGMVEDAV 165

Query: 172 DVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLVG--DGDTAGYLI 2
           D+   +K +   P+V+ WN++L   LR  +TD+ W LY +MM SG+V   + +T GYLI
Sbjct: 166 DM---LKRVVFCPSVATWNASLLGCLRARRTDLVWTLYEQMMESGVVASINVETVGYLI 221


>ref|XP_004155892.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g18950-like [Cucumis sativus]
          Length = 638

 Score =  151 bits (381), Expect = 3e-34
 Identities = 84/210 (40%), Positives = 122/210 (58%), Gaps = 4/210 (1%)
 Frame = -1

Query: 619 TVGG--GEIKCKHEETMIIVEDGQQFTDIARKVCEIVRSRPRWEQTLLSEFPSQTLFNPK 446
           T GG  G  + +  E ++ +   +  ++IA +V +++RS+PRWEQ+LLS++PS    +P 
Sbjct: 68  TNGGNNGREEIESSEKLLNLTQRKDVSEIAAEVGKVIRSKPRWEQSLLSDYPSFNFHDPS 127

Query: 445 CVEQIIKHQNNALLSIRFFNWLCTHQDMMGFSPEDYSKSCNALFDRLVHAKASKVAHSLL 266
              +++K  NN  LS+RFF WL +  + +      +  SCN LFD L+ AKA   A S L
Sbjct: 128 FFSELLKQLNNVFLSLRFFLWLSSQPEFL-----PHPVSCNKLFDALLEAKACVPAKSFL 182

Query: 265 QSAHHQFVPESTSLESLIRCLCKDGFVEEAIDVFSRIKAIGVIPTVSIWNSALYASLRVP 86
            S   +F PE  SLE+ IRC+C+ G VEEA+  F  +K  G  P V  WN A  + L+  
Sbjct: 183 YS--FEFSPEPASLENYIRCVCEGGLVEEAVYTFDMLKEAGYRPYVETWNFAFQSCLKFG 240

Query: 85  KTDIFWDLYGEMMASGLVGDGD--TAGYLI 2
           +TD+ W LY  MM +G+  D D  T GYLI
Sbjct: 241 RTDLIWKLYEGMMETGVQKDVDIETVGYLI 270


>ref|XP_004134313.1| PREDICTED: pentatricopeptide repeat-containing protein
           At5g18950-like [Cucumis sativus]
          Length = 602

 Score =  151 bits (381), Expect = 3e-34
 Identities = 84/210 (40%), Positives = 122/210 (58%), Gaps = 4/210 (1%)
 Frame = -1

Query: 619 TVGG--GEIKCKHEETMIIVEDGQQFTDIARKVCEIVRSRPRWEQTLLSEFPSQTLFNPK 446
           T GG  G  + +  E ++ +   +  ++IA +V +++RS+PRWEQ+LLS++PS    +P 
Sbjct: 32  TNGGNNGREEIESSEKLLNLTQRKDVSEIAAEVGKVIRSKPRWEQSLLSDYPSFNFHDPS 91

Query: 445 CVEQIIKHQNNALLSIRFFNWLCTHQDMMGFSPEDYSKSCNALFDRLVHAKASKVAHSLL 266
              +++K  NN  LS+RFF WL +  + +      +  SCN LFD L+ AKA   A S L
Sbjct: 92  FFSELLKQLNNVFLSLRFFLWLSSQPEFL-----PHPVSCNKLFDALLEAKACVPAKSFL 146

Query: 265 QSAHHQFVPESTSLESLIRCLCKDGFVEEAIDVFSRIKAIGVIPTVSIWNSALYASLRVP 86
            S   +F PE  SLE+ IRC+C+ G VEEA+  F  +K  G  P V  WN A  + L+  
Sbjct: 147 YS--FEFSPEPASLENYIRCVCEGGLVEEAVYTFDMLKEAGYRPYVETWNFAFQSCLKFG 204

Query: 85  KTDIFWDLYGEMMASGLVGDGD--TAGYLI 2
           +TD+ W LY  MM +G+  D D  T GYLI
Sbjct: 205 RTDLIWKLYEGMMETGVQKDVDIETVGYLI 234


>ref|XP_007035355.1| Tetratricopeptide repeat-like superfamily protein, putative isoform
           1 [Theobroma cacao] gi|508714384|gb|EOY06281.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 1 [Theobroma cacao]
          Length = 610

 Score =  147 bits (372), Expect = 4e-33
 Identities = 79/177 (44%), Positives = 111/177 (62%)
 Frame = -1

Query: 544 DIARKVCEIVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRFFNWLCTHQD 365
           DI ++VC+I R+ PRWE+ LLS+FPS    +P    ++++ Q N  LS+ FF+WL +  D
Sbjct: 63  DIVKQVCKITRTIPRWEENLLSKFPSFNFSDPVFFRELLRQQENVFLSLCFFHWLRSKYD 122

Query: 364 MMGFSPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRCLCKDGFV 185
              FSP+    SCN LFD+LV A A K A + L+     F PE  +LE  +R LC+ G V
Sbjct: 123 ---FSPD--LDSCNVLFDKLVEANACKAARNFLEQTG--FSPEPRALELYLRRLCEVGLV 175

Query: 184 EEAIDVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLVGDGDTA 14
           EEA+++FS +  IG  P+V+ WN AL A L+V + D  W LY +M+ SG+V D D A
Sbjct: 176 EEAVEMFSMLNKIGYRPSVATWNLALLAFLKVGRNDFVWKLYQDMIDSGVVVDIDVA 232


>ref|XP_007153452.1| hypothetical protein PHAVU_003G036500g [Phaseolus vulgaris]
           gi|561026806|gb|ESW25446.1| hypothetical protein
           PHAVU_003G036500g [Phaseolus vulgaris]
          Length = 593

 Score =  145 bits (365), Expect = 2e-32
 Identities = 81/179 (45%), Positives = 111/179 (62%), Gaps = 2/179 (1%)
 Frame = -1

Query: 532 KVCEIVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRFFNWLCTHQDMMGF 353
           ++C I RS+PRWE  LLS +PS    +P      + HQNNALLS+RFF+WLC+     GF
Sbjct: 54  EICRITRSKPRWEDNLLSLYPSFNFSDPSFFLLYLNHQNNALLSLRFFHWLCS---SCGF 110

Query: 352 SPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRCLCKDGFVEEAI 173
           SP+    S NALF  LV A A K A +LL        PE  SLE  I+CL + G VE+A+
Sbjct: 111 SPD--QASYNALFCALVDAGACKAAKALLDCP--GLTPEPASLEGYIQCLSRTGMVEDAV 166

Query: 172 DVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLVG--DGDTAGYLI 2
           D+   +K +G  P+V+ WN++L + LR  +T++ W LY +MM SG+V   + +T GYLI
Sbjct: 167 DM---LKQVGFCPSVTTWNASLLSCLRAGRTNLVWTLYEQMMESGVVASINVETVGYLI 222


>gb|ABD96889.1| hypothetical protein [Cleome spinosa]
          Length = 719

 Score =  140 bits (354), Expect = 4e-31
 Identities = 72/180 (40%), Positives = 112/180 (62%)
 Frame = -1

Query: 556 QQFTDIARKVCEIVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRFFNWLC 377
           + +T++A+ V  I R +PRWEQTL+S+FPS    +P    +++  QNN LLS+RFF WLC
Sbjct: 56  RNYTEMAKIVATITREKPRWEQTLVSDFPSFNFADPLFFRELVATQNNVLLSLRFFQWLC 115

Query: 376 THQDMMGFSPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRCLCK 197
           T+ D    +P+  S   N LF+ L+ AKA + A  +   A   F+P+S SLE  ++CLC 
Sbjct: 116 TNHDC---TPDPISS--NMLFEALLDAKAVRAAKMVRDIA--GFIPDSASLEQYVKCLCG 168

Query: 196 DGFVEEAIDVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLVGDGDT 17
            GF+EEAI+V+ ++K  G+  ++   NS L   L+  KT++ ++ Y EM+ +G   D +T
Sbjct: 169 VGFIEEAIEVYFQLKEAGIRISIVACNSILSGCLKAGKTELLFEFYQEMIKAGTASDANT 228


>ref|XP_006289665.1| hypothetical protein CARUB_v10003224mg [Capsella rubella]
           gi|482558371|gb|EOA22563.1| hypothetical protein
           CARUB_v10003224mg [Capsella rubella]
          Length = 486

 Score =  139 bits (349), Expect = 2e-30
 Identities = 80/220 (36%), Positives = 126/220 (57%)
 Frame = -1

Query: 700 QSSIFNLIGKNVHKRHSFDRFDQSRHFTVGGGEIKCKHEETMIIVEDGQQFTDIARKVCE 521
           QS + +L+ K + +  +     Q R   +   + + K +E       G  +T++A+ V  
Sbjct: 5   QSYLISLVRKRIRQNPNA----QMRSLALESRDSESKPDEQKS-AGGGTTYTEMAKTVST 59

Query: 520 IVRSRPRWEQTLLSEFPSQTLFNPKCVEQIIKHQNNALLSIRFFNWLCTHQDMMGFSPED 341
           ++R R RW+QTL+S+FPS    +P    +++K QNN L S+ FF WLC++ D   ++P+ 
Sbjct: 60  VMRERQRWQQTLVSDFPSFNFADPLFFRELLKSQNNVLFSLWFFRWLCSNYD---YAPDP 116

Query: 340 YSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRCLCKDGFVEEAIDVFS 161
            S S   LF  L+ AKA K A S L +    F PE T LE  ++CL +DG VEEAIDV++
Sbjct: 117 ASLSL--LFGALLDAKAVKAAKSFLDTTG--FKPEPTLLEQYVKCLSEDGLVEEAIDVYN 172

Query: 160 RIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMAS 41
            +K +G+ P++   NS L   ++  K D FW+L+ +MM S
Sbjct: 173 VLKEMGISPSIVTCNSVLLGCVKARKLDCFWELHQKMMES 212


>ref|XP_002873920.1| pentatricopeptide repeat-containing protein [Arabidopsis lyrata
           subsp. lyrata] gi|297319757|gb|EFH50179.1|
           pentatricopeptide repeat-containing protein [Arabidopsis
           lyrata subsp. lyrata]
          Length = 483

 Score =  135 bits (340), Expect = 2e-29
 Identities = 78/198 (39%), Positives = 116/198 (58%)
 Frame = -1

Query: 634 QSRHFTVGGGEIKCKHEETMIIVEDGQQFTDIARKVCEIVRSRPRWEQTLLSEFPSQTLF 455
           Q R  TV   + + K +E    V     +T++A+ V  I+R R RW+QTL+S+FPS    
Sbjct: 23  QIRSLTVESRDSESKPDEQKSAVS----YTEMAKTVSTIMRQRQRWQQTLVSDFPSFDFA 78

Query: 454 NPKCVEQIIKHQNNALLSIRFFNWLCTHQDMMGFSPEDYSKSCNALFDRLVHAKASKVAH 275
           +P    Q++K QNN + S+ FF WLC++ D   ++P+  S S N LF  L+  KA K A 
Sbjct: 79  DPLFFRQLLKSQNNVMFSLWFFRWLCSNYD---YTPD--SVSLNLLFGALLDGKAVKAAK 133

Query: 274 SLLQSAHHQFVPESTSLESLIRCLCKDGFVEEAIDVFSRIKAIGVIPTVSIWNSALYASL 95
           S L +    F PE T LE  ++CL ++G VEEAI+V++ +K +G+  +V   NS L   L
Sbjct: 134 SFLDTT--GFKPEPTLLEQYVKCLSEEGLVEEAIEVYNVLKEMGISSSVVTCNSVLLGCL 191

Query: 94  RVPKTDIFWDLYGEMMAS 41
           +  K D FW+L+ EM+ S
Sbjct: 192 KARKLDRFWELHKEMIES 209


>ref|NP_197396.1| pentatricopeptide repeat-containing protein [Arabidopsis thaliana]
           gi|223635758|sp|Q8GYM2.2|PP393_ARATH RecName:
           Full=Pentatricopeptide repeat-containing protein
           At5g18950 gi|332005249|gb|AED92632.1| pentatricopeptide
           repeat-containing protein [Arabidopsis thaliana]
          Length = 483

 Score =  131 bits (329), Expect = 3e-28
 Identities = 77/198 (38%), Positives = 114/198 (57%)
 Frame = -1

Query: 634 QSRHFTVGGGEIKCKHEETMIIVEDGQQFTDIARKVCEIVRSRPRWEQTLLSEFPSQTLF 455
           Q R  TV   + + K +E    V     +T++A+ V  I+R R RW+QTL+S+FPS    
Sbjct: 23  QIRSLTVESRDCESKPDEQKSAVS----YTEMAKTVSTIMRERQRWQQTLVSDFPSFDFA 78

Query: 454 NPKCVEQIIKHQNNALLSIRFFNWLCTHQDMMGFSPEDYSKSCNALFDRLVHAKASKVAH 275
           +P    +++K QNN L S+ FF WLC++ D   ++P     S N LF  L+  KA K A 
Sbjct: 79  DPLFFGELLKSQNNVLFSLWFFRWLCSNYD---YTPGPV--SLNILFGALLDGKAVKAAK 133

Query: 274 SLLQSAHHQFVPESTSLESLIRCLCKDGFVEEAIDVFSRIKAIGVIPTVSIWNSALYASL 95
           S L +    F PE T LE  ++CL ++G VEEAI+V++ +K +G+  +V   NS L   L
Sbjct: 134 SFLDTT--GFKPEPTLLEQYVKCLSEEGLVEEAIEVYNVLKDMGISSSVVTCNSVLLGCL 191

Query: 94  RVPKTDIFWDLYGEMMAS 41
           +  K D FW+L+ EM+ S
Sbjct: 192 KARKLDRFWELHKEMVES 209


>ref|XP_006844767.1| hypothetical protein AMTR_s00016p00258280 [Amborella trichopoda]
           gi|548847238|gb|ERN06442.1| hypothetical protein
           AMTR_s00016p00258280 [Amborella trichopoda]
          Length = 696

 Score =  105 bits (263), Expect = 2e-20
 Identities = 60/156 (38%), Positives = 88/156 (56%)
 Frame = -1

Query: 469 SQTLFNPKCVEQIIKHQNNALLSIRFFNWLCTHQDMMGFSPEDYSKSCNALFDRLVHAKA 290
           S     P CV +++   ++ +LS RFF  LC    + G+ P+D  +SCN + + LV A+ 
Sbjct: 177 SHDFLKPNCVHKVLGILDDTILSFRFFKRLCY---LEGYKPDD--ESCNMVLETLVSAEE 231

Query: 289 SKVAHSLLQSAHHQFVPESTSLESLIRCLCKDGFVEEAIDVFSRIKAIGVIPTVSIWNSA 110
            K A ++L  A  +F P+   LE+LI    +   + EA  VF  +K  G  P++ +WNS 
Sbjct: 232 YKTAKTVLSLA--EFRPQPQLLEALILGFSRTTKISEAFGVFLEMKRHGFSPSLLVWNSM 289

Query: 109 LYASLRVPKTDIFWDLYGEMMASGLVGDGDTAGYLI 2
           L   LR  +TD  W++YGEMM SG+ GD  T GYLI
Sbjct: 290 LLGFLRDLRTDRLWEIYGEMMESGVAGDATTYGYLI 325



 Score = 89.0 bits (219), Expect = 2e-15
 Identities = 49/150 (32%), Positives = 85/150 (56%)
 Frame = -1

Query: 643 RFDQSRHFTVGGGEIKCKHEETMIIVEDGQQFTDIARKVCEIVRSRPRWEQTLLSEFPSQ 464
           R +   H    G   K    E      +  + +++A KVC+ +R+   WE++L+S+F S 
Sbjct: 31  RSNNQCHEKEEGEASKSSFSEGQEKANEEAELSEVAEKVCQRLRTHRLWEKSLVSDF-SP 89

Query: 463 TLFNPKCVEQIIKHQNNALLSIRFFNWLCTHQDMMGFSPEDYSKSCNALFDRLVHAKASK 284
            +F   C+ +++++QNN  LS RFF+WLC  +   GF P+  ++SC+ +F  LV+AKA K
Sbjct: 90  YIFETTCIHKVLRNQNNPFLSFRFFSWLCLQE---GFKPD--TQSCDTIFGVLVNAKAWK 144

Query: 283 VAHSLLQSAHHQFVPESTSLESLIRCLCKD 194
            A ++L      F P+ + LE+++   C+D
Sbjct: 145 AAITVLNLV--DFRPQPSVLEAMVAGFCRD 172


>ref|XP_007035356.1| Tetratricopeptide repeat-like superfamily protein, putative isoform
           2, partial [Theobroma cacao] gi|508714385|gb|EOY06282.1|
           Tetratricopeptide repeat-like superfamily protein,
           putative isoform 2, partial [Theobroma cacao]
          Length = 535

 Score = 81.6 bits (200), Expect = 3e-13
 Identities = 48/108 (44%), Positives = 65/108 (60%)
 Frame = -1

Query: 337 SKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPESTSLESLIRCLCKDGFVEEAIDVFSR 158
           SKS N+L    V A A K A + L+     F PE  +LE  +R LC+ G VEEA+++FS 
Sbjct: 56  SKSNNSL----VEANACKAARNFLEQTG--FSPEPRALELYLRRLCEVGLVEEAVEMFSM 109

Query: 157 IKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYGEMMASGLVGDGDTA 14
           +  IG  P+V+ WN AL A L+V + D  W LY +M+ SG+V D D A
Sbjct: 110 LNKIGYRPSVATWNLALLAFLKVGRNDFVWKLYQDMIDSGVVVDIDVA 157


>ref|XP_007217442.1| hypothetical protein PRUPE_ppb015972mg [Prunus persica]
           gi|462413592|gb|EMJ18641.1| hypothetical protein
           PRUPE_ppb015972mg [Prunus persica]
          Length = 221

 Score = 75.5 bits (184), Expect = 2e-11
 Identities = 52/140 (37%), Positives = 65/140 (46%), Gaps = 2/140 (1%)
 Frame = -1

Query: 415 NALLSIRFFNWLCTHQDMMGFSPEDYSKSCNALFDRLVHAKASKVAHSLLQSAHHQFVPE 236
           N  LS+R F WL +H +   FSP+  S  CNAL    V  K    A S L+  H  F PE
Sbjct: 2   NVFLSLRCFFWLSSHNE---FSPDPIS--CNALVSAFVETKVCNPAKSFLE--HTSFSPE 54

Query: 235 STSLESLIRCLCKDGFVEEAIDVFSRIKAIGVIPTVSIWNSALYASLRVPKTDIFWDLYG 56
             S   L                 S +K  GV P +  W +AL   L+V +TDI W LY 
Sbjct: 55  LASFRKLYSV--------------SLLKEAGVCPAIMTWKAALSGCLKVGRTDIIWKLYQ 100

Query: 55  EMMASGLVGDGD--TAGYLI 2
           EM+  G+V D +    GYLI
Sbjct: 101 EMIECGVVADVELRLLGYLI 120


Top