BLASTX nr result

ID: Catharanthus22_contig00004056 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Catharanthus22_contig00004056
         (1663 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002269582.1| PREDICTED: uncharacterized protein LOC100242...   171   7e-40
ref|XP_006349877.1| PREDICTED: uncharacterized protein DDB_G0271...   158   8e-36
ref|XP_004252959.1| PREDICTED: uncharacterized protein LOC101251...   156   3e-35
gb|EMJ24418.1| hypothetical protein PRUPE_ppa007163mg [Prunus pe...   155   6e-35
ref|XP_002314335.2| hypothetical protein POPTR_0010s00560g [Popu...   154   1e-34
ref|XP_006840357.1| hypothetical protein AMTR_s00045p00115420 [A...   142   3e-31
gb|AFK37850.1| unknown [Lotus japonicus]                              140   2e-30
ref|XP_006393443.1| hypothetical protein EUTSA_v10011577mg [Eutr...   139   4e-30
ref|XP_002891423.1| predicted protein [Arabidopsis lyrata subsp....   138   6e-30
ref|XP_004297065.1| PREDICTED: uncharacterized protein LOC101305...   137   2e-29
ref|XP_002519339.1| soluble diacylglycerol acyltransferase [Rici...   133   2e-28
gb|AAD49767.1|AC007932_15 ESTs gb|N97074, gb|T13943 and gb|R8996...   132   4e-28
ref|XP_006444611.1| hypothetical protein CICLE_v10020833mg [Citr...   132   6e-28
gb|EOX95446.1| Uncharacterized protein TCM_004941 [Theobroma cacao]   125   5e-26
ref|NP_175264.2| uncharacterized protein [Arabidopsis thaliana] ...   125   7e-26
ref|XP_002301551.1| hypothetical protein POPTR_0002s18840g [Popu...   124   9e-26
gb|EXB75649.1| hypothetical protein L484_026126 [Morus notabilis]     123   3e-25
ref|XP_006304457.1| hypothetical protein CARUB_v10011099mg [Caps...   123   3e-25
ref|XP_003609890.1| hypothetical protein MTR_4g124080 [Medicago ...   122   4e-25
gb|ACJ86204.1| unknown [Medicago truncatula] gi|388508412|gb|AFK...   121   1e-24

>ref|XP_002269582.1| PREDICTED: uncharacterized protein LOC100242564 [Vitis vinifera]
            gi|147865786|emb|CAN81152.1| hypothetical protein
            VITISV_020818 [Vitis vinifera]
          Length = 362

 Score =  171 bits (434), Expect = 7e-40
 Identities = 140/392 (35%), Positives = 193/392 (49%), Gaps = 19/392 (4%)
 Frame = -1

Query: 1429 VVSRIIPTISAGTGVGSQPPNASFSGLFSSAAVRSERIS-------RASRKSLKLARPSN 1271
            VV R +P  S G G+ +Q   +SFSG+   +  R    S       R SR +++  +PS 
Sbjct: 6    VVFRQVPFFS-GAGIDTQSSKSSFSGVSVDSGNRISAFSELRLLGSRDSRVAVRPRKPSG 64

Query: 1270 FVFSDESYLDYYSSGRKVRCGTCXXXXXXXXXXXXXXXMLKGLSRDLALLAEMGS--DPD 1097
            F   DES+L YY      RCG                 +LK LS+DL+L +++G   D D
Sbjct: 65   F--RDESHLKYYYESP--RCGA--KKDKDKVTTKKKSKLLKALSKDLSLFSDLGFGVDSD 118

Query: 1096 NALVDPVKAKMISEAAEFLQVELQKLWSEGKELKPR-KEDQSRHKETLMQNMPGCETXXX 920
              L   VK KMISEAAE L  +LQ++ +E KELK R KE++++ K T M+    CE+   
Sbjct: 119  EGLFGEVKGKMISEAAEVLLKQLQQMRAEEKELKRRRKEEKAKLKATRMETGVVCESSSS 178

Query: 919  XXXXXXXXXXXDCEAVMDMNLLDCRPKTKQIPDASQAIPENEASTRQELLACIQSPQVKS 740
                        C  V+DM  L      + I D SQ +                      
Sbjct: 179  ESSDSE------CGEVVDMTHLRSGAVVEPIKDESQPV---------------------- 210

Query: 739  VHQRSADAEEIKFATGMEIPCYPAGVPDPAQG--ISQKRSIDTISLNGSKTTPA----TK 578
                      I+ A G+E P     V    +G   +   +  +++++ ++ T       K
Sbjct: 211  ----------IQEAKGLEEPSLLQPVTTTLKGECCTAVNTATSVAVDQNEKTQVMGAGAK 260

Query: 577  KIEVCIGGKCKKSGALALMEGFCRALAGNEAAVSGCKCMGKCRDGPNVRISNNF---DEE 407
            +IEVC+GGKCKKSGA AL+E F R + G E AV GCKCMGKCR GPNVR+ N+    + E
Sbjct: 261  RIEVCMGGKCKKSGAEALLEEFERVV-GVEGAVVGCKCMGKCRVGPNVRVLNSIEGVEAE 319

Query: 406  GMHDHAKLPAPAANQLCIGVGLEDVDLIVDNF 311
            GM D  + P   AN LC+GVGL+DV +IV NF
Sbjct: 320  GMDDSVRTP---ANPLCVGVGLQDVGIIVSNF 348


>ref|XP_006349877.1| PREDICTED: uncharacterized protein DDB_G0271670-like [Solanum
            tuberosum]
          Length = 413

 Score =  158 bits (399), Expect = 8e-36
 Identities = 140/423 (33%), Positives = 197/423 (46%), Gaps = 34/423 (8%)
 Frame = -1

Query: 1447 MDAFGVVVSRIIPTISAGTGVGSQPPNASFSGLFSSAAVRSERISRASRKSLKLARPSNF 1268
            M+A   V+ R      AG G+ + P N            RS R S  S KS+K       
Sbjct: 1    MEASAGVLRRFPSVTGAGAGISNNPLN------------RSCRFSCFSLKSVK-KELVGC 47

Query: 1267 VFSDESYLDYYSSGRK--VRCGTCXXXXXXXXXXXXXXXM--LKGLSRDLALLAEMGSD- 1103
             F DE  L+YYSSGR   +RCG                 M  LKGL+++L+ L EMG   
Sbjct: 48   EFYDEGCLEYYSSGRGGIIRCGKKKKEKDMGELKKTKKKMKLLKGLTKNLSNLNEMGLGF 107

Query: 1102 -PDNALVDPVKAKMISEAAEFLQVELQKLWSEGKELK-PRKEDQSRHK-----ETLMQNM 944
              D  LVD V+ K ISEAA+ L  +LQ+L +E K LK  RKE++++ K       +  N 
Sbjct: 108  GCDVDLVDQVQGKTISEAADLLLGQLQQLKAEEKALKRKRKEEKAQMKMKVAASQVQSNT 167

Query: 943  PGC--ETXXXXXXXXXXXXXXDCEAVMDMNLLDCRPKTKQIPDASQAIPENE------AS 788
              C   +              +C+ ++DM  L        IP+A     EN       ++
Sbjct: 168  RSCAMSSSSSSSSESSESSDDECQNLVDMKSLKIGTLALTIPEACDRALENATLNPFLST 227

Query: 787  TRQELLACIQSPQVKSVHQRSADAEEIKFATGME--------IPCYPAGVPDPAQGISQK 632
               +    I S  + S  + +     ++F+   +          C+   V       S +
Sbjct: 228  PEVDTTVEISSMPLTSTEESTEKTTSLEFSVPEQKGECCLEASDCHIGNVGSSITLGSSR 287

Query: 631  RSIDTISLNGSKTTPATKKIEVCIGGKCKKSGALALMEGFCRALAGNEAAVSGCKCMGKC 452
             ++ +I+   + T   TK+IEVC+GGKCK+ GA AL+E F R +AG EAAVSGCKCMGKC
Sbjct: 288  SNVSSIAAATTTTAEGTKRIEVCMGGKCKRLGAGALLEEFQR-IAGIEAAVSGCKCMGKC 346

Query: 451  RDGPNVRI------SNNFDEEGMHDHAKLPAPAANQLCIGVGLEDVDLIVDNFVRSQTKF 290
            + GPNVR+      S++  + G           +N LCIGVGLEDV LI  N +    + 
Sbjct: 347  KVGPNVRVSGCSSSSSDAFQAGDSVAVSSTPTTSNSLCIGVGLEDVSLIAANLLGRYQEV 406

Query: 289  GLA 281
            GLA
Sbjct: 407  GLA 409


>ref|XP_004252959.1| PREDICTED: uncharacterized protein LOC101251763 [Solanum
            lycopersicum]
          Length = 409

 Score =  156 bits (394), Expect = 3e-35
 Identities = 140/424 (33%), Positives = 197/424 (46%), Gaps = 35/424 (8%)
 Frame = -1

Query: 1447 MDAFGVVVSRIIPTISAGTGVGSQPPNASFSGLFSSAAVRSERISRASRKSLKLARPSNF 1268
            M+A   V+ R      AG GV S P N S S            +SR S KS+        
Sbjct: 1    MEASAGVLRRFPSVTGAGVGVSSNPLNRSCS------------VSRVSLKSMGCG----- 43

Query: 1267 VFSDESYLDYYSSGRK--VRCG---TCXXXXXXXXXXXXXXXMLKGLSRDLALLAEMGSD 1103
             F DE  L+YYSSGR   +RCG                    +LKGL+++L+ L  MG  
Sbjct: 44   -FYDEGCLEYYSSGRGGIIRCGKKKNKEKDMAELKKTKKKMKLLKGLTKNLSNLNGMGLG 102

Query: 1102 --PDNALVDPVKAKMISEAAEFLQVELQKLWSEGKELK-PRKEDQSRHK-----ETLMQN 947
               D  LVD V+ K ISEAAE L  +LQ+L +E KELK  RKE++++ K       +  N
Sbjct: 103  FGCDVDLVDQVQGKTISEAAELLLGQLQQLKAEEKELKRKRKEEKAQMKMKVAASEVQSN 162

Query: 946  MPGC--ETXXXXXXXXXXXXXXDCEAVMDMNLLDCRPKTKQIPDASQAIPEN-------- 797
               C   +              +C+ ++DM  L      + IP+A     +N        
Sbjct: 163  TRSCAMSSSSSSSSESSDSSDDECQNLVDMKSLKIGTLAQTIPEACDRALDNATLNPSLS 222

Query: 796  --EASTRQELLACIQSPQVKSVHQRSA---DAEEIKFATGMEIPCYPAGVPDPAQGISQK 632
              E  T  E+ +   +   +S  + ++      E K    +E      G    +     +
Sbjct: 223  TPEVDTTVEISSMPSTTTEESTGKTTSFEFSVPEQKGECCLEASDCHIGNVGSSITPGTR 282

Query: 631  RSIDTISLNGSKTTPATKKIEVCIGGKCKKSGALALMEGFCRALAGNEAAVSGCKCMGKC 452
             ++ +I+   + T   TK+IEVC+GGKCK+ GA AL+E F R +AG EAAVSGCKCMGKC
Sbjct: 283  SNVSSIAAATTTTAEGTKRIEVCMGGKCKRLGAGALLEEFQR-VAGIEAAVSGCKCMGKC 341

Query: 451  RDGPNVRIS-------NNFDEEGMHDHAKLPAPAANQLCIGVGLEDVDLIVDNFVRSQTK 293
            + GPNV++S       ++  + G           +N LCIGVGLEDV LI  N +    +
Sbjct: 342  KVGPNVKVSGCSSSSISDAFQAGDSVAVSSAPTTSNSLCIGVGLEDVSLIAANLLGRYQE 401

Query: 292  FGLA 281
             GLA
Sbjct: 402  VGLA 405


>gb|EMJ24418.1| hypothetical protein PRUPE_ppa007163mg [Prunus persica]
          Length = 379

 Score =  155 bits (391), Expect = 6e-35
 Identities = 132/386 (34%), Positives = 182/386 (47%), Gaps = 12/386 (3%)
 Frame = -1

Query: 1429 VVSRIIPTISAGTGVGSQPPNASFSGLFSSAAVRSERISRASRKSLKLARPSNFVFSDES 1250
            VVSR +P  S      ++   ASF G F    V+  R    S ++ +     N  F D+ 
Sbjct: 6    VVSRQVPCFSGAEIDTTRSSTASFHGEFR---VQGRRNFGVSLRTRRFGGQLNSGFCDDG 62

Query: 1249 YLDYYSSGRKVRCGTCXXXXXXXXXXXXXXXMLKGLSRDLALLAEMGSDP---DNALVDP 1079
            ++ YY  G   RCG                 +LKGLS+DL+  ++MG  P      LV  
Sbjct: 63   HVQYYHVGP--RCG-----FKKEKEIKKKLKLLKGLSKDLSASSQMGFGPLDYQKGLVAQ 115

Query: 1078 VKAKMISEAAEFLQV-ELQKLWSEGKELK-PRKEDQSRHKETLMQNMPGCETXXXXXXXX 905
             + K+ISE AE L + +L++L +E KELK  RKE+++R K   M+NM   E+        
Sbjct: 116  FQEKLISEDAEALLLKQLEQLRAEEKELKRKRKEEKARLKAERMKNMVDSESSSSSSSES 175

Query: 904  XXXXXXDCEAVMDMNLLDCRPKTKQIPDASQAIPENEASTRQELLACIQSPQVKSVHQRS 725
                   C  ++DMN L      K I D+ Q     E +     +  + S    + HQ +
Sbjct: 176  SESE---CGELVDMNRLRSEAPAKPILDSLQPFNHQEGA-----VLTLPSSLAIATHQEN 227

Query: 724  ADAE---EIKFATGMEIPCYPAGVPDPAQGISQKRSID-TISLNGSKTTPATKKIEVCIG 557
               E   E   +   E  C           +S   SI    +L+ S    +  KIEVC+G
Sbjct: 228  TTVEHVTEFGISQNQEAECCSG---TSTSCVSSSGSIGHNDALSSSVMGASALKIEVCMG 284

Query: 556  GKCKKSGALALMEGFCRALAGNEAAVSGCKCMGKCRDGPNVRISN---NFDEEGMHDHAK 386
             KCKKSG  AL+E F R + G E  V GCKCMGKC++GPN+R+SN       EG  D  +
Sbjct: 285  NKCKKSGGGALLEEFERVM-GVEGTVVGCKCMGKCKNGPNIRVSNTVGGIQSEGTDDSVR 343

Query: 385  LPAPAANQLCIGVGLEDVDLIVDNFV 308
            +P    N L IGVGLEDV LIV N +
Sbjct: 344  VP---TNPLYIGVGLEDVSLIVANLI 366


>ref|XP_002314335.2| hypothetical protein POPTR_0010s00560g [Populus trichocarpa]
            gi|550328807|gb|EEF00506.2| hypothetical protein
            POPTR_0010s00560g [Populus trichocarpa]
          Length = 351

 Score =  154 bits (388), Expect = 1e-34
 Identities = 123/384 (32%), Positives = 175/384 (45%), Gaps = 15/384 (3%)
 Frame = -1

Query: 1387 VGSQPPNASFSGLFSSAAVRSERISRASR--KSLKLARPSNFVFSDESYLDYY--SSGRK 1220
            +  Q P  S    F +   R E  SR+ R      +A   +  F D+ +L YY  S G  
Sbjct: 7    ISRQSPCFSSDSGFGNHQTRCEAFSRSRRIVNGFGVASKVHTGFRDKGHLKYYYGSEGLV 66

Query: 1219 VRCGTCXXXXXXXXXXXXXXXMLKGLSRDLALLAEMGSDPDNALVDPVKAKMISEAAEFL 1040
            VRCG                       + L LL E+   P N  VD V+A +I+EA + L
Sbjct: 67   VRCG------------GKKKDKETSTKKKLKLLKELSVVPHNDSVDDVQANLIAEATQLL 114

Query: 1039 QVELQKLWSEGKELK-PRKEDQSRHKETLMQNMPGCETXXXXXXXXXXXXXXDCEAVMDM 863
              +L +L +E KELK  +KE++++ K   M+ M  CE+               C  V+DM
Sbjct: 115  MKQLGQLRAEEKELKRKKKEEKAKLKAVKMKTMLDCESSSSSESSDSE-----CGEVIDM 169

Query: 862  NLLDCRPKTKQIPDASQAIPENEASTRQELLACIQS--PQVKSVH--------QRSADAE 713
              L      + I    Q++ + E ++    L   +S   ++   H        +    A 
Sbjct: 170  KRLRNEAVAEPIIGELQSVAQEEPTSILPALLTQESNVTEINGYHDHGLGIHGEECGGAR 229

Query: 712  EIKFATGMEIPCYPAGVPDPAQGISQKRSIDTISLNGSKTTPATKKIEVCIGGKCKKSGA 533
                +  + + C P                     +   +  + K+IEVC+G KCKKSG 
Sbjct: 230  STSCSNAIRVSCNPTS-------------------SSMMSGTSDKRIEVCMGNKCKKSGG 270

Query: 532  LALMEGFCRALAGNEAAVSGCKCMGKCRDGPNVRISNNFDEEGMHDHAKLPAPAANQLCI 353
            +AL+E F +A+ G   AV GCKCMGKCRDGPNVRI  +   EG+ D  ++  PAAN LCI
Sbjct: 271  VALLEEFEKAV-GIGGAVVGCKCMGKCRDGPNVRILKS-GNEGVDDSVRI--PAANPLCI 326

Query: 352  GVGLEDVDLIVDNFVRSQTKFGLA 281
            GVGLEDVD+IV NF   +    LA
Sbjct: 327  GVGLEDVDVIVANFFGKELSVALA 350


>ref|XP_006840357.1| hypothetical protein AMTR_s00045p00115420 [Amborella trichopoda]
            gi|548842075|gb|ERN02032.1| hypothetical protein
            AMTR_s00045p00115420 [Amborella trichopoda]
          Length = 398

 Score =  142 bits (359), Expect = 3e-31
 Identities = 112/348 (32%), Positives = 160/348 (45%), Gaps = 18/348 (5%)
 Frame = -1

Query: 1264 FSDESYLDYYSSGRKVRCGTCXXXXXXXXXXXXXXXMLKGLSRDLALLAEM--GSDPDNA 1091
            FSD S++ YY  G   R                   +LKGL +DL   + M  G++ +  
Sbjct: 76   FSDASHVMYYCEGATCRSNV--------KEEKKRKKLLKGLEKDLFSFSGMFNGAEIEGN 127

Query: 1090 LVDPVKAKMISEAAEFLQVELQKLWSEGKELKPRKEDQSRHKETLMQNMPGCETXXXXXX 911
            L   VK KMISEA   L  +LQ+L +E KE+K R++     ++ +       ++      
Sbjct: 128  LAGEVKGKMISEATNVLLTQLQQLKAEQKEIKKRRKAMKAAQKEMRSPREAVDSSSSSSE 187

Query: 910  XXXXXXXXDCEAVMDMNLLDCRPKTKQIPDASQAIPENEASTRQE--------------- 776
                     CE V+ MN L     TK  PD    + E   S  +                
Sbjct: 188  SSDSE----CE-VVTMNRLRNSLLTKPSPDPQAPVEEATLSLTESEPVEETTLSLTESEP 242

Query: 775  -LLACIQSPQVKSVHQRSADAEEIKFATGMEIPCYPAGVPDPAQGISQKRSIDTISLNGS 599
             LL  +Q   + +  ++SA +  +            +G P    G S   +    S+ G 
Sbjct: 243  ILLYPVQELSIPAQLEKSAVSSAMNQIRNKPAETCCSGSPLHCSG-SPLPTAAVESIGGE 301

Query: 598  KTTPATKKIEVCIGGKCKKSGALALMEGFCRALAGNEAAVSGCKCMGKCRDGPNVRISNN 419
            +     ++I+VC+GGKCKKSGA  LME F R+L G++ +V GCKCMGKC+DGPNVR+S+N
Sbjct: 302  RMEEKGERIQVCMGGKCKKSGAAELMEAFQRSL-GSQGSVVGCKCMGKCKDGPNVRVSSN 360

Query: 418  FDEEGMHDHAKLPAPAANQLCIGVGLEDVDLIVDNFVRSQTKFGLAAS 275
              EEG+             LCIGVGLEDV  IV N++      GL A+
Sbjct: 361  NGEEGV----------LKALCIGVGLEDVGSIVANYLGGNKDVGLVAA 398


>gb|AFK37850.1| unknown [Lotus japonicus]
          Length = 334

 Score =  140 bits (352), Expect = 2e-30
 Identities = 118/388 (30%), Positives = 174/388 (44%), Gaps = 8/388 (2%)
 Frame = -1

Query: 1447 MDAFGVVVSRIIPTISAGTGVGSQPPNASFSGLFSSAAVRSERISRASRKSLKLARPSNF 1268
            MD  G ++ ++     AGT   S                R  R+  A+R + ++   S F
Sbjct: 1    MDVSGTILRQLTYVTGAGTNAHS----------------RGARVW-AARPTARVVMGSGF 43

Query: 1267 VFSDESYLDYYSSGRKVRCGTCXXXXXXXXXXXXXXXMLKGLSRDLALLAEMGS--DPDN 1094
              SDE +L YY   +K +                   +LK +S+ ++L  E+G   DP+ 
Sbjct: 44   --SDEGHLQYYQDKKKGK--------PVVLTAKNKVKLLKRVSKGMSLFDELGFALDPNQ 93

Query: 1093 -ALVDPVKAKMISEAAEFLQVELQKLWSEGKELKPRKEDQSRHK--ETLMQNMPGCETXX 923
             AL++ ++  + S++ E L  EL+KL +E KELK +++D+ + K   + M+  P CE+  
Sbjct: 94   RALLNDLQTNLTSDSGEGLLKELEKLRAEEKELKRKRKDEKKAKLKASKMKTGPDCESSS 153

Query: 922  XXXXXXXXXXXXDCEAVMDMNLLDCRPKTKQIPDASQAIPENEASTRQELLACIQSPQVK 743
                        +C+ V+DMN    R      P    A+P +  +               
Sbjct: 154  SSSSESSESSASECDEVVDMNTF--RGGVAVAPAPPPALPPSGPA--------------- 196

Query: 742  SVHQRSADAEEIKFATGMEIPCYPAGVPDPAQGISQKRSIDTISLNGSKTTPATKKIEVC 563
                  A   E      + I     G+ +   G+S               T   K+IEVC
Sbjct: 197  ------ALLPETFVGGDVSIGSVSVGLKNENHGVS---------------TAPQKRIEVC 235

Query: 562  IGGKCKKSGALALMEGFCRALAGNEAAVSGCKCMGKCRDGPNVRISNNFDE---EGMHDH 392
            +G KCKKSGA ALM+ F   +     AV GCKCMGKC+  PNVRI N  D    EG++D 
Sbjct: 236  MGNKCKKSGAAALMQQFESVVGVEGGAVVGCKCMGKCKSAPNVRIQNAVDHELAEGLNDS 295

Query: 391  AKLPAPAANQLCIGVGLEDVDLIVDNFV 308
             K+P   AN LCIGVGLEDVD +V  F+
Sbjct: 296  VKVP---ANPLCIGVGLEDVDAVVARFL 320


>ref|XP_006393443.1| hypothetical protein EUTSA_v10011577mg [Eutrema salsugineum]
            gi|557090021|gb|ESQ30729.1| hypothetical protein
            EUTSA_v10011577mg [Eutrema salsugineum]
          Length = 377

 Score =  139 bits (350), Expect = 4e-30
 Identities = 119/389 (30%), Positives = 186/389 (47%), Gaps = 10/389 (2%)
 Frame = -1

Query: 1447 MDAFGVVVSRIIPTISAGTGVGSQPPNASFSGL--FSSAAVRSERISRASRKSLKLARPS 1274
            M+  GVV+ R IP++S     G       +SGL   S  +  S  +S  +RK   +    
Sbjct: 1    MEVSGVVL-RQIPSVSGAVADGR------YSGLRSVSKFSGNSRTVSFQTRKFHGVM--C 51

Query: 1273 NFVFSDESYLDYYSSGRKVRCG----TCXXXXXXXXXXXXXXXMLKGLSRDLALLAEMGS 1106
            N  F+D+ +++YY   +  RCG                     +LK LS++L + + +G 
Sbjct: 52   NNEFADKEHMNYYF--KPTRCGGEKEKVKLMEKEKKALKKKAKVLKSLSKNLNMFSSIGF 109

Query: 1105 --DPDNALVDPVKAKMISEAAEFLQVELQKLWSEGKELKP-RKEDQSRHKETLMQNMPGC 935
              +P+  LV  ++ K ISEA E L  +L++L +E K LK  RKE++++ K   M      
Sbjct: 110  GLNPEAGLVSEIQNKSISEATEILVKQLEQLKAEEKLLKKQRKEEKAKAKAMKMTTDMDS 169

Query: 934  ETXXXXXXXXXXXXXXDCEAVMDMNLLDCRPKTKQIPDASQAIPENEASTRQELL-ACIQ 758
            E+                  V+DMN L  R K K I +  Q  PE+  +T   +    I 
Sbjct: 170  ESSSSSSESSDSDCGKG--KVVDMNTL--RNKEKPILEPLQ--PESTLATLPRIQETLIS 223

Query: 757  SPQVKSVHQRSADAEEIKFATGMEIPCYPAGVPDPAQGISQKRSIDTISLNGSKTTPATK 578
            + +++   Q + +A ++              + +P Q +    ++  + L         K
Sbjct: 224  NKKLEEDAQITGEALQLALLQSAAASTVFPLMANPGQRLKTVEAVSVVGL-------PLK 276

Query: 577  KIEVCIGGKCKKSGALALMEGFCRALAGNEAAVSGCKCMGKCRDGPNVRISNNFDEEGMH 398
            ++EVC+GGKCKKSG   L+  F RA+ G E +   CKCMGKCRDGPNVR+ N   +  M 
Sbjct: 277  RVEVCMGGKCKKSGGAVLLNEFQRAMTGMEGSAVACKCMGKCRDGPNVRVVNETYDSMMT 336

Query: 397  DHAKLPAPAANQLCIGVGLEDVDLIVDNF 311
            D  K P   +  +C+GVGL+DV+ IV +F
Sbjct: 337  DSVKTP---SKTVCVGVGLQDVETIVTSF 362


>ref|XP_002891423.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
            gi|297337265|gb|EFH67682.1| predicted protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 360

 Score =  138 bits (348), Expect = 6e-30
 Identities = 116/385 (30%), Positives = 186/385 (48%), Gaps = 6/385 (1%)
 Frame = -1

Query: 1447 MDAFGVVVSRIIPTISAGTGVGSQPPNASFSGLFSSAAVRSERISRASRKSLKLARPSNF 1268
            M+  GVV+ R IP +S+G+ V      + FSG   +   R+++               N 
Sbjct: 1    MEVSGVVL-RQIPCVSSGS-VACLRLVSEFSGNTRTVGFRTKKFRGIV---------CNN 49

Query: 1267 VFSDESYLDYYSSGRKVRCGT----CXXXXXXXXXXXXXXXMLKGLSRDLALLAEMGS-- 1106
             F+D+ ++ YY      RCG                     +LK LS++L + + +G   
Sbjct: 50   EFADKGHMSYYIE--PTRCGEEKEKVKVMEKEKKALKKKEKVLKSLSKNLNMFSSIGFGL 107

Query: 1105 DPDNALVDPVKAKMISEAAEFLQVELQKLWSEGKELKPRKEDQSRHKETLMQNMPGCETX 926
            DP+  LV  ++ K ISEA E L  +L++L +E K LK +++++ + K   M+ M   ++ 
Sbjct: 108  DPEAGLVGEIQTKTISEATEILVKQLEQLKAEEKLLKKQRKEE-KAKAKAMKKMTEMDSE 166

Query: 925  XXXXXXXXXXXXXDCEAVMDMNLLDCRPKTKQIPDASQAIPENEASTRQELLACIQSPQV 746
                           + V+DM+ L  R KTK + +  Q  PE   +T   +    QS + 
Sbjct: 167  SSSSSESSDSDRDKGK-VVDMSSL--RNKTKPVLEPLQ--PEATVATLPRIQEDAQSCK- 220

Query: 745  KSVHQRSADAEEIKFATGMEIPCYPAGVPDPAQGISQKRSIDTISLNGSKTTPATKKIEV 566
                  +++A +I   T    P     +P+P Q +    ++  + L         K++EV
Sbjct: 221  -----NTSEALQIALQTSTVFP----SMPNPVQTLKTVEAVSVVGL-------PLKRVEV 264

Query: 565  CIGGKCKKSGALALMEGFCRALAGNEAAVSGCKCMGKCRDGPNVRISNNFDEEGMHDHAK 386
            C+GGKCKKSG   L++ F RA+ G + +   CKCMGKCRDGPNVR+ N  D   M D  +
Sbjct: 265  CMGGKCKKSGGALLLDEFQRAMTGFQGSAVACKCMGKCRDGPNVRVVNETDSV-MTDSVR 323

Query: 385  LPAPAANQLCIGVGLEDVDLIVDNF 311
             P   +  +C+GVGL+DV+ IV +F
Sbjct: 324  TP---SKTVCVGVGLQDVETIVTSF 345


>ref|XP_004297065.1| PREDICTED: uncharacterized protein LOC101305747 [Fragaria vesca
            subsp. vesca]
          Length = 354

 Score =  137 bits (344), Expect = 2e-29
 Identities = 121/385 (31%), Positives = 178/385 (46%), Gaps = 11/385 (2%)
 Frame = -1

Query: 1429 VVSRIIPTISAGTGVGSQPPNASFSGLFSSAAVRSERISRASRKSLKLARPSNFVFSDES 1250
            +VSR IP  S G G+ ++  + SFSG            S  SR+        N  F D  
Sbjct: 6    LVSRQIPCFS-GAGIDTRS-SGSFSGEVHFPGRNEYGASLRSRRV-----DMNSGFCDNG 58

Query: 1249 YLDYYSSGRKVRCGTCXXXXXXXXXXXXXXXMLKGLSR-DLALLAEMGSDPDNALVDPVK 1073
            +++YY SG +     C               +LKGL+    +     G D +      V+
Sbjct: 59   HVEYYYSGPR-----CGGGNKKEKEIKKKLKLLKGLAELSSSTQISTGLDSEEGSAAQVQ 113

Query: 1072 AKMISEAAEFLQVELQKLWSEGKELKPRK-EDQSRHKETLMQNMPGCETXXXXXXXXXXX 896
             K+ISE AE L  +L+++ +E KE+K +K E+++R K   M+ M  CE+           
Sbjct: 114  RKLISEGAEALLQQLEQVRAEEKEMKKKKKEEKARLKAERMKTMKDCESSSSSESSESE- 172

Query: 895  XXXDCEAVMDMNLLDCRPKTKQIPDASQAIPENEASTRQELLACIQSPQVKSVHQRSADA 716
                C  V+DMN +       ++P   QA+ E      QE +A +  P + +  Q     
Sbjct: 173  ----CGEVIDMNRV-----RSEVP--KQAMLEGVKPLAQERVALLTQPSLVTTLQEDKCC 221

Query: 715  EEIKFATGMEIPCYPAGVPDPAQGISQKRSIDTISLNGSKTTPATK-----KIEVCIGGK 551
                 + G                     S D+I LN + ++ AT+     K+EVC+G K
Sbjct: 222  NGTSTSFG---------------------SSDSIDLNNASSSSATEASIVAKVEVCMGKK 260

Query: 550  CKKSGALALMEGFCRALAGNEAAVSGCKCMGKCRDGPNVRISNNFD----EEGMHDHAKL 383
            CK SG + L+E F R L G + +V GCKC+GKC++GPNVR+ N+ +    EEG  D  + 
Sbjct: 261  CKTSGGVELLEEFER-LMGVQGSVVGCKCLGKCKNGPNVRVVNSVNGIKSEEGTEDSVRN 319

Query: 382  PAPAANQLCIGVGLEDVDLIVDNFV 308
            P    N L IGVGLEDV LIV N +
Sbjct: 320  P---TNPLYIGVGLEDVSLIVANLM 341


>ref|XP_002519339.1| soluble diacylglycerol acyltransferase [Ricinus communis]
            gi|223541654|gb|EEF43203.1| soluble diacylglycerol
            acyltransferase [Ricinus communis]
          Length = 332

 Score =  133 bits (335), Expect = 2e-28
 Identities = 114/373 (30%), Positives = 158/373 (42%), Gaps = 10/373 (2%)
 Frame = -1

Query: 1399 AGTGVGSQPPNASFSGLFSSAAVRSERISRASRKSLKLARPSNFVFSDESYLDYYSSGRK 1220
            +G G  S     S  G   S  V S R     RK+      S   F D  +L YY  G  
Sbjct: 4    SGLGCFSSAATPSLCGAVDSGGVSSLR----PRKAFHRVSDSCLGFRDNGHLQYYCQGGF 59

Query: 1219 VRCGTCXXXXXXXXXXXXXXXMLKGLSRDLALLAEMGSDPDNALVDPVKAKMISEAAEFL 1040
            VRCG                 ++K LS D ++        +NAL+   ++  + EAA+ L
Sbjct: 60   VRCG-----GGNKKSIKKKLKLVKSLSEDFSMFPH-----NNALLHQPQSISLQEAAQGL 109

Query: 1039 QVELQKLWSEGKELKPRKEDQSRHK-ETLMQNMPGCETXXXXXXXXXXXXXXDCEAVMDM 863
              +LQ+L ++ KELK +K+ + + K ++   +    E+                +  +  
Sbjct: 110  MKQLQELRAKEKELKRQKKQEKKAKLKSESSSSSSSESSSDSERGEVIHMSRFRDETIPA 169

Query: 862  NLLDCRPKTKQIPDASQAI-------PENEASTRQELLACIQSPQVKSVHQRSADAEEIK 704
             L    P T   P ++  +       P +  ST  E   C+                   
Sbjct: 170  ALPQLHPLTHHHPTSTLPVSPTQECNPMDYTSTHHEKRCCVG------------------ 211

Query: 703  FATGMEIPCYPAGVPDPAQGISQKRSIDTISLNGSKTTP--ATKKIEVCIGGKCKKSGAL 530
                      P+   D A G       D  +   S  T   +  +IEVC+G KCKKSG  
Sbjct: 212  ----------PSTGADNAVG-------DCCNDRNSSMTEELSANRIEVCMGNKCKKSGGA 254

Query: 529  ALMEGFCRALAGNEAAVSGCKCMGKCRDGPNVRISNNFDEEGMHDHAKLPAPAANQLCIG 350
            AL+E F R L G EAAV GCKCMG CRDGPNVR+ N+  +    D  + P   +N LCIG
Sbjct: 255  ALLEEFQRVL-GVEAAVVGCKCMGNCRDGPNVRVRNSVQDRNTDDSVRTP---SNPLCIG 310

Query: 349  VGLEDVDLIVDNF 311
            VGLEDVD+IV NF
Sbjct: 311  VGLEDVDVIVANF 323


>gb|AAD49767.1|AC007932_15 ESTs gb|N97074, gb|T13943 and gb|R89965 come from this gene
            [Arabidopsis thaliana]
          Length = 360

 Score =  132 bits (332), Expect = 4e-28
 Identities = 112/385 (29%), Positives = 182/385 (47%), Gaps = 6/385 (1%)
 Frame = -1

Query: 1447 MDAFGVVVSRIIPTISAGTGVGSQPPNASFSGLFSSAAVRSERISRASRKSLKLARPSNF 1268
            M+  GVV+ R IP +S+G+  G +  +  FSG   +   R+ R               N 
Sbjct: 1    MEVSGVVL-RQIPCVSSGSVAGLRLVS-EFSGNTRTVGFRTRRFRGIV---------CNN 49

Query: 1267 VFSDESYLDYYSSGRKVRCGT----CXXXXXXXXXXXXXXXMLKGLSRDLALLAEMGS-- 1106
             F+D+ +++YY      RCG                     +LK LS++L + + +G   
Sbjct: 50   EFADKGHVNYYIE--PTRCGEEKEKVKVMEKEKKALKKKAKVLKSLSKNLDMFSSIGFGL 107

Query: 1105 DPDNALVDPVKAKMISEAAEFLQVELQKLWSEGKELKPRKEDQSRHKETLMQNMPGCETX 926
            DP+  LV  ++ K ISEA E L  +L++L +E K LK +++++ + K   M+ M   ++ 
Sbjct: 108  DPEAGLVGEIQTKTISEATEILVKQLEQLKAEEKILKKQRKEE-KAKAKAMKKMTEMDSE 166

Query: 925  XXXXXXXXXXXXXDCEAVMDMNLLDCRPKTKQIPDASQAIPENEASTRQELLACIQSPQV 746
                          C+    +++   R K K + +  Q  PE   +T    L  IQ   +
Sbjct: 167  SSSSSESSDSD---CDKGKVVDMSSLRNKAKPVLEPLQ--PEATVAT----LPRIQEDAI 217

Query: 745  KSVHQRSADAEEIKFATGMEIPCYPAGVPDPAQGISQKRSIDTISLNGSKTTPATKKIEV 566
                + +++A +I   T    P     + +P Q +    ++  + L          ++EV
Sbjct: 218  SC--KNTSEALQIALQTSTIFP----SMANPGQTLKTVEAVSVVGL-------PLNRVEV 264

Query: 565  CIGGKCKKSGALALMEGFCRALAGNEAAVSGCKCMGKCRDGPNVRISNNFDEEGMHDHAK 386
            C+GGKCK+SG   L++ F RA+ G E +   CKCMGKCRDGPNVR+    D   M D  +
Sbjct: 265  CMGGKCKRSGGALLLDEFQRAMTGFEGSAVACKCMGKCRDGPNVRVVKETDAV-MTDSVR 323

Query: 385  LPAPAANQLCIGVGLEDVDLIVDNF 311
             P   +  LC+GVGL+DV+ IV +F
Sbjct: 324  TP---SKTLCVGVGLQDVETIVTSF 345


>ref|XP_006444611.1| hypothetical protein CICLE_v10020833mg [Citrus clementina]
            gi|568878887|ref|XP_006492415.1| PREDICTED: RNA
            polymerase-associated protein RTF1 homolog [Citrus
            sinensis] gi|557546873|gb|ESR57851.1| hypothetical
            protein CICLE_v10020833mg [Citrus clementina]
          Length = 355

 Score =  132 bits (331), Expect = 6e-28
 Identities = 111/338 (32%), Positives = 157/338 (46%)
 Frame = -1

Query: 1321 RISRASRKSLKLARPSNFVFSDESYLDYYSSGRKVRCGTCXXXXXXXXXXXXXXXMLKGL 1142
            R+S  +R S    R  +  FSD  +  YY S RK                      +K L
Sbjct: 34   RVSERTRNSF---RRLSSRFSDCGHQQYYVSPRKETKKKEKEKSSEIKSVRKKLKFMKRL 90

Query: 1141 SRDLALLAEMGSDPDNALVDPVKAKMISEAAEFLQVELQKLWSEGKELKPRKEDQSRHKE 962
            S DL  L   G++  ++L +  +   ISEA + LQ +L +L SE KELK   +++    +
Sbjct: 91   SSDLLPLEAFGNEDASSLRNEER---ISEAVQVLQAQLLQLRSEQKELKKMMKEKKAQIK 147

Query: 961  TLMQNMPGCETXXXXXXXXXXXXXXDCEAVMDMNLLDCRPKTKQIPDASQAIPENEASTR 782
            T M+   G                 D     + +++D  P    I   S+ I       +
Sbjct: 148  TTMKERKGNRKSGSSSSSSSESSDSD-----NGDVIDTIPLRSNILKLSENI-----EVQ 197

Query: 781  QELLACIQSPQVKSVHQRSADAEEIKFATGMEIPCYPAGVPDPAQGISQKRSIDTISLNG 602
            +E++    +    S+ Q+    E+  F +G E         D   GI ++          
Sbjct: 198  KEIIVPALAKSATSLIQQDK-LEDCCFRSGGECRNLNGRSNDQGYGIVERT--------- 247

Query: 601  SKTTPATKKIEVCIGGKCKKSGALALMEGFCRALAGNEAAVSGCKCMGKCRDGPNVRISN 422
              TT  TK+IEVC+GGKCKK GA AL+E F R  AG E  VS CKCMGKCRDGPN+R+SN
Sbjct: 248  --TTTRTKRIEVCMGGKCKKLGAGALLEEFERK-AGAECDVSMCKCMGKCRDGPNLRVSN 304

Query: 421  NFDEEGMHDHAKLPAPAANQLCIGVGLEDVDLIVDNFV 308
            + D +          P+ N LCIGVGLEDVD+I+ N +
Sbjct: 305  SHDGDSAIRIQGYAKPSINSLCIGVGLEDVDMILANIL 342


>gb|EOX95446.1| Uncharacterized protein TCM_004941 [Theobroma cacao]
          Length = 401

 Score =  125 bits (314), Expect = 5e-26
 Identities = 127/420 (30%), Positives = 186/420 (44%), Gaps = 40/420 (9%)
 Frame = -1

Query: 1447 MDAFGVVVSRIIPTISAGTGVGSQPPNASFSGLFSSAAVRSERISRASRKSLKLARPSNF 1268
            M+  G+V  R+ P + AG  V        F G FS         SR S +        + 
Sbjct: 1    METSGIVYRRV-PRL-AGIRV-------DFGGRFSRELNLGVGDSRVSVRPRNSCGKLSC 51

Query: 1267 VFSDESYLDYYSSGR------KVRCGTCXXXXXXXXXXXXXXXMLKGLSRDLALLAEM-- 1112
             FSD  ++ YY S R      K +  +C                +K LS+DL++L  M  
Sbjct: 52   QFSDSGHIQYYVSPRAGAAKKKEKEKSCEIKRVKTKLKF-----IKRLSKDLSMLPRMAD 106

Query: 1111 GSDPDNALVDPVKAKMISEAAEFLQVELQKLWSEGKELKPR-KEDQSRHKETLMQNMPGC 935
            G D    L+  VK  MISEA++ L  +LQ+L SE KELK + KE+++R K TL ++    
Sbjct: 107  GEDIGIGLMGEVKTTMISEASDVLLAQLQQLRSEQKELKGKLKEERARLKATLEKSESSS 166

Query: 934  ETXXXXXXXXXXXXXXDCEAVMDMNLL---------------DCRPKTKQIPDASQAIPE 800
             +               C  V+DM  L               D   K  +  +A+  + E
Sbjct: 167  SSESSDSE---------CGKVVDMKRLRSNALKPLQDLEAPSDNALKRTEDMEAAPTVTE 217

Query: 799  N--------EASTRQELLACIQSPQVKSVHQRSADAEEIKFATG-----MEIPCYPAGVP 659
                     E   +   LA +++P V      ++   E++ +       ++ PC   G  
Sbjct: 218  EGTLANSVMELGNKHRALADMEAPMVTEEATLASSLMELENSDSSPQIRIQEPCSGFG-- 275

Query: 658  DPAQGISQKRSIDTISLNGSKTTPATKKIEVCIGGKCKKSGALALMEGFCRALAGNEAAV 479
              ++  S     D IS N      +TKKIE+C+GGKCKK GA AL+E F R + G E  V
Sbjct: 276  --SECCSSNGFKDDIS-NRIVEGASTKKIEICMGGKCKKLGAAALLEEFERKV-GAEGTV 331

Query: 478  SGCKCMGKCRDGPNVRISNN---FDEEGMHDHAKLPAPAANQLCIGVGLEDVDLIVDNFV 308
             GCKCMGKC+  PNVR+ ++    +   + D  ++     N  C  VGL+DVDLIV N +
Sbjct: 332  VGCKCMGKCKTAPNVRVCDSPSGIEARSIQDSIRI---GINPTCTSVGLQDVDLIVANLL 388


>ref|NP_175264.2| uncharacterized protein [Arabidopsis thaliana]
            gi|12744987|gb|AAK06873.1|AF344322_1 unknown protein
            [Arabidopsis thaliana] gi|332194151|gb|AEE32272.1|
            uncharacterized protein AT1G48300 [Arabidopsis thaliana]
          Length = 285

 Score =  125 bits (313), Expect = 7e-26
 Identities = 89/283 (31%), Positives = 144/283 (50%), Gaps = 2/283 (0%)
 Frame = -1

Query: 1153 LKGLSRDLALLAEMGS--DPDNALVDPVKAKMISEAAEFLQVELQKLWSEGKELKPRKED 980
            LK LS++L + + +G   DP+  LV  ++ K ISEA E L  +L++L +E K LK ++++
Sbjct: 15   LKSLSKNLDMFSSIGFGLDPEAGLVGEIQTKTISEATEILVKQLEQLKAEEKILKKQRKE 74

Query: 979  QSRHKETLMQNMPGCETXXXXXXXXXXXXXXDCEAVMDMNLLDCRPKTKQIPDASQAIPE 800
            + + K   M+ M   ++               C+    +++   R K K + +  Q  PE
Sbjct: 75   E-KAKAKAMKKMTEMDSESSSSSESSDSD---CDKGKVVDMSSLRNKAKPVLEPLQ--PE 128

Query: 799  NEASTRQELLACIQSPQVKSVHQRSADAEEIKFATGMEIPCYPAGVPDPAQGISQKRSID 620
               +T    L  IQ   +    + +++A +I   T    P     + +P Q +    ++ 
Sbjct: 129  ATVAT----LPRIQEDAISC--KNTSEALQIALQTSTIFP----SMANPGQTLKTVEAVS 178

Query: 619  TISLNGSKTTPATKKIEVCIGGKCKKSGALALMEGFCRALAGNEAAVSGCKCMGKCRDGP 440
             + L          ++EVC+GGKCK+SG   L++ F RA+ G E +   CKCMGKCRDGP
Sbjct: 179  VVGL-------PLNRVEVCMGGKCKRSGGALLLDEFQRAMTGFEGSAVACKCMGKCRDGP 231

Query: 439  NVRISNNFDEEGMHDHAKLPAPAANQLCIGVGLEDVDLIVDNF 311
            NVR+    D   M D  + P   +  LC+GVGL+DV+ IV +F
Sbjct: 232  NVRVVKETDAV-MTDSVRTP---SKTLCVGVGLQDVETIVTSF 270


>ref|XP_002301551.1| hypothetical protein POPTR_0002s18840g [Populus trichocarpa]
            gi|222843277|gb|EEE80824.1| hypothetical protein
            POPTR_0002s18840g [Populus trichocarpa]
          Length = 350

 Score =  124 bits (312), Expect = 9e-26
 Identities = 103/317 (32%), Positives = 147/317 (46%), Gaps = 3/317 (0%)
 Frame = -1

Query: 1264 FSDESYLDYYSSGRKVRCGTCXXXXXXXXXXXXXXXMLKGLSRDLALL--AEMGSDPDNA 1091
            FSD  +L YY S    RC                  +L+ LSRDL +   A  G + + +
Sbjct: 55   FSDSGHLKYYVS--PARCS-----GKKEKSKKKQLKLLRRLSRDLPIFSYAVCGEEGNGS 107

Query: 1090 LVDPVKAKMISEAAEFLQVELQKLWSEGKELKPRKEDQSRHKETLMQNMPGCETXXXXXX 911
            L+  VK KMISEA E L  ELQ    E KE K ++ D+S    TL++N P C++      
Sbjct: 108  LIGEVKEKMISEATEILLAELQNRRLERKEQKRKRRDESA---TLIKNRPRCDSG----- 159

Query: 910  XXXXXXXXDCEAVMDMNLLDCRPKTKQIPDASQAIPENEASTRQELLACIQSPQVKSVHQ 731
                            +           P++S +    E  + +++ +   +P ++    
Sbjct: 160  --------------SSSSSSSSSSGSSSPESSDSDCSREVVSMKQMRSKALNPFIEI--- 202

Query: 730  RSADAEEIKFATGMEIPCYPAGVPDPAQGISQKRSIDTISLNGSKTTPATKKIEVCIGGK 551
                A+ IK AT  +         D   G     S      +G +   + +KIE+C+GGK
Sbjct: 203  --ESAKAIKEATQEDQH------RDTVSGAKSNDSSPQNLSDGVQIGASGRKIEICMGGK 254

Query: 550  CKKSGALALMEGFCRALAGNEAAVSGCKCMGKCRDGPNVRISN-NFDEEGMHDHAKLPAP 374
            C+K GA AL+E F R + G E+AV GCKCMGKC  GPNVR+ N   + E M     +  P
Sbjct: 255  CRKLGAAALLEEFERKI-GMESAVVGCKCMGKCMKGPNVRVFNCTVENEDMRVEDSI-KP 312

Query: 373  AANQLCIGVGLEDVDLI 323
              N LCIGVGL+DV +I
Sbjct: 313  PLNLLCIGVGLKDVGII 329


>gb|EXB75649.1| hypothetical protein L484_026126 [Morus notabilis]
          Length = 358

 Score =  123 bits (308), Expect = 3e-25
 Identities = 97/296 (32%), Positives = 147/296 (49%), Gaps = 3/296 (1%)
 Frame = -1

Query: 1153 LKGLSRDLALLAEMGSDPDNALVDPVKAKMISEAAEFLQVELQKLWSEGKELKP-RKEDQ 977
            LK LS++L++ +++    D+            E+A+ L   L+KL +E KELK  +K+D+
Sbjct: 96   LKALSQNLSVFSDVSQLQDH-----------QESADVLLKHLEKLRAEEKELKVMKKQDK 144

Query: 976  SRHKETLMQNM-PGCETXXXXXXXXXXXXXXDCEAVMDMNLLDCRPKTKQIPDASQAIPE 800
            +  K   M  M     +               C  V+DMN L  +       +A+Q   +
Sbjct: 145  ANLKAEQMAIMIDNDSSSSSSSSESSESSDSKCGEVIDMNRLRSQ-------NAAQHYYD 197

Query: 799  NEASTRQELLACIQSPQVKSVHQRSADAEEIKFATGMEIPCYPAGVPDPAQGISQKRSID 620
            +  S   ++ A + +  ++S+   +A  EE              G  D    + QK   +
Sbjct: 198  DSLSAAAQV-AAVLAASLRSLPTANATEEE-------------RGSEDVTSLLLQKHEKE 243

Query: 619  TISLNGSKTTPATKKIEVCIGGKCKKSGALALMEGFCRALAGNEAAVSGCKCMGKCRDGP 440
            +     + +   +K+++VC+G KCKKSG++ALME F R + G E AV GCKCMGKCR  P
Sbjct: 244  S-ECRPTTSGCCSKRVDVCMGNKCKKSGSVALMEEFARQMGG-EGAVVGCKCMGKCRSAP 301

Query: 439  NVRISNNFDEEGMHDHAKLPAPAANQLCIGVGLEDVDLIVDNFVRSQTK-FGLAAS 275
            NVR+ N+   EG  D     A   N LC+GVGLEDV  IV + V   T+ FGLAA+
Sbjct: 302  NVRVVNSTRVEGATDVCVRTAKNGNPLCLGVGLEDVSAIVASLVGEDTRDFGLAAA 357


>ref|XP_006304457.1| hypothetical protein CARUB_v10011099mg [Capsella rubella]
            gi|482573168|gb|EOA37355.1| hypothetical protein
            CARUB_v10011099mg [Capsella rubella]
          Length = 372

 Score =  123 bits (308), Expect = 3e-25
 Identities = 97/329 (29%), Positives = 156/329 (47%), Gaps = 8/329 (2%)
 Frame = -1

Query: 1273 NFVFSDESYLDYYSSGRKVRCGT------CXXXXXXXXXXXXXXXMLKGLSRDLALLAEM 1112
            N  F+D+ ++ YY      RCG                       +LK LS++L + + +
Sbjct: 52   NSEFADKGHVSYYIE--PTRCGEEKEKEKMKVMEKEKKALKKKAKVLKSLSKNLDMFSSL 109

Query: 1111 GS--DPDNALVDPVKAKMISEAAEFLQVELQKLWSEGKELKPRKEDQSRHKETLMQNMPG 938
            G   DP+  LV  ++ K ISEA E L  +L++L +E K LK +++++ + K   M+ M  
Sbjct: 110  GFGLDPEAGLVGEIQTKTISEATEILVKQLEQLKAEEKLLKKQRKEE-KAKAKAMKKMTE 168

Query: 937  CETXXXXXXXXXXXXXXDCEAVMDMNLLDCRPKTKQIPDASQAIPENEASTRQELLACIQ 758
             ++                + V+DM+    + K    P   +A       T++ +L    
Sbjct: 169  MDSESSSSSESSDSDCDKGK-VVDMSSFRNKAKPSLEPLQPEATVATLLKTQETVLPNKL 227

Query: 757  SPQVKSVHQRSADAEEIKFATGMEIPCYPAGVPDPAQGISQKRSIDTISLNGSKTTPATK 578
                 S +    +A +I   T    P     V +P Q +   ++++ + L         K
Sbjct: 228  EEDATSCNN-IREALQIALQTSTVFP----SVANPGQTL---KAVEAVGL-------PLK 272

Query: 577  KIEVCIGGKCKKSGALALMEGFCRALAGNEAAVSGCKCMGKCRDGPNVRISNNFDEEGMH 398
            ++EVC+GGKCKK G   L++ F RA+ G E +   CKCMGKCRDGPNVR+ N  D   M 
Sbjct: 273  RVEVCMGGKCKKLGGALLLDEFQRAMTGFEGSAVACKCMGKCRDGPNVRVVNETDAV-MT 331

Query: 397  DHAKLPAPAANQLCIGVGLEDVDLIVDNF 311
            D  + P   +  +C+GVGL+DV+ IV +F
Sbjct: 332  DSVRTP---SKTVCVGVGLQDVETIVTSF 357


>ref|XP_003609890.1| hypothetical protein MTR_4g124080 [Medicago truncatula]
            gi|355510945|gb|AES92087.1| hypothetical protein
            MTR_4g124080 [Medicago truncatula]
          Length = 341

 Score =  122 bits (307), Expect = 4e-25
 Identities = 101/341 (29%), Positives = 152/341 (44%), Gaps = 10/341 (2%)
 Frame = -1

Query: 1264 FSDESYLDYYSSGRKVRCGTCXXXXXXXXXXXXXXXMLKGLSRDLALLAEMGSDPDNALV 1085
            F DE ++ YY   +K                     +LK  S++++ L ++G   D  L+
Sbjct: 39   FHDEGHVQYYQDVKK-------NTEPVIISNKKKIKLLKRFSKNVSQLPQLGFAQDPNLL 91

Query: 1084 DPVKAKMISEAAEFLQVELQKLWSEGKELKPRKEDQSRHKETLMQNMPGCETXXXXXXXX 905
            D +   +I+E  E L  EL+K+ +E KELK + + + +  +     M  C          
Sbjct: 92   DQLHQNLITEGGEELLRELEKVRAEEKELKKKMKQEKKKAKLKPSKMKTCNKSESSSSSS 151

Query: 904  XXXXXXD--CEAVMDMNLLDCRPKTKQIPDASQAIPENEASTRQELLACIQSPQVKSVHQ 731
                  D  C  V+DMN          +  A++ + E E   +Q +L+    P+  + H 
Sbjct: 152  SESESSDSDCGEVVDMNTFR---GAGVVDVATKPVDELELKLKQPMLSI---PEDSTSHH 205

Query: 730  RSADA---EEIKFATGMEIPCYPAGVPDPAQGISQKRSIDTISLNGSKTTPATKKIEVCI 560
               D          TG                   K+  + +         A K+IEVC+
Sbjct: 206  HVMDVCTTNNASLVTGF------------------KKETNVV------IPTAQKRIEVCM 241

Query: 559  GGKCKKSGALALMEGFCRALA-GNEAAVSGCKCMGKCRDGPNVRISNNFD---EEGMHDH 392
            G KCKKSGA AL++ F + +    E  V GCKCMGKC+  PNVRI N+ D    +G+ D 
Sbjct: 242  GNKCKKSGAAALLQEFEKVVGVEGEGVVVGCKCMGKCKTAPNVRIQNSVDLNMVQGIDDS 301

Query: 391  AKLPAPAANQLCIGVGLEDVDLIVDNFVRSQTK-FGLAASS 272
             K+P   +N LCIGVGLEDVD IV  F+    K  G+ A++
Sbjct: 302  VKIP---SNPLCIGVGLEDVDTIVARFLGEDYKDVGMVAAA 339


>gb|ACJ86204.1| unknown [Medicago truncatula] gi|388508412|gb|AFK42272.1| unknown
            [Medicago truncatula] gi|388510858|gb|AFK43495.1| unknown
            [Medicago truncatula]
          Length = 341

 Score =  121 bits (303), Expect = 1e-24
 Identities = 100/341 (29%), Positives = 152/341 (44%), Gaps = 10/341 (2%)
 Frame = -1

Query: 1264 FSDESYLDYYSSGRKVRCGTCXXXXXXXXXXXXXXXMLKGLSRDLALLAEMGSDPDNALV 1085
            F DE ++ YY   +K                     +LK  S++++ L ++G   D  L+
Sbjct: 39   FHDEGHVQYYQDVKK-------NTEPVIISNKKKIKLLKRFSKNVSQLPQLGFAQDPNLL 91

Query: 1084 DPVKAKMISEAAEFLQVELQKLWSEGKELKPRKEDQSRHKETLMQNMPGCETXXXXXXXX 905
            D +   +I+E  E L  EL+K+ +E KELK + + + +  +     M  C          
Sbjct: 92   DQLHQNLITEGGEELLRELEKVRAEEKELKKKMKQEKKKAKLKPSKMKTCNKSESSSSSS 151

Query: 904  XXXXXXD--CEAVMDMNLLDCRPKTKQIPDASQAIPENEASTRQELLACIQSPQVKSVHQ 731
                  D  C  V+DMN          +  A++ + E E   ++ +L+    P+  + H 
Sbjct: 152  SESESSDSDCGEVVDMNTFR---GAGVVDVATKPVDELELKLKRPMLSI---PEDSTSHH 205

Query: 730  RSADA---EEIKFATGMEIPCYPAGVPDPAQGISQKRSIDTISLNGSKTTPATKKIEVCI 560
               D          TG                   K+  + +         A K+IEVC+
Sbjct: 206  HVMDVCTTNNASLVTGF------------------KKETNVV------IPTAQKRIEVCM 241

Query: 559  GGKCKKSGALALMEGFCRALA-GNEAAVSGCKCMGKCRDGPNVRISNNFD---EEGMHDH 392
            G KCKKSGA AL++ F + +    E  V GCKCMGKC+  PNVRI N+ D    +G+ D 
Sbjct: 242  GNKCKKSGAAALLQEFEKVVGVEGEGVVVGCKCMGKCKTAPNVRIQNSVDLNMVQGIDDS 301

Query: 391  AKLPAPAANQLCIGVGLEDVDLIVDNFVRSQTK-FGLAASS 272
             K+P   +N LCIGVGLEDVD IV  F+    K  G+ A++
Sbjct: 302  VKIP---SNPLCIGVGLEDVDTIVARFLGEDYKDVGMVAAA 339


Top