BLASTX nr result

ID: Ephedra28_contig00016437 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra28_contig00016437
         (1769 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006853038.1| hypothetical protein AMTR_s00038p00020700 [A...   341   7e-91
ref|XP_006358484.1| PREDICTED: uncharacterized protein LOC102593...   318   6e-84
ref|XP_004230386.1| PREDICTED: uncharacterized protein LOC101247...   318   6e-84
ref|XP_003534756.2| PREDICTED: uncharacterized protein LOC100781...   287   1e-74
gb|EOY25838.1| Uncharacterized protein isoform 1 [Theobroma cacao]    285   3e-74
ref|XP_004959865.1| PREDICTED: uncharacterized protein LOC101766...   285   4e-74
gb|EXC47697.1| hypothetical protein L484_002408 [Morus notabilis]     284   1e-73
gb|ESW19537.1| hypothetical protein PHAVU_006G133500g [Phaseolus...   282   4e-73
ref|XP_002304112.2| hypothetical protein POPTR_0003s03710g [Popu...   281   5e-73
ref|XP_006470786.1| PREDICTED: uncharacterized protein LOC102629...   279   3e-72
ref|XP_006431360.1| hypothetical protein CICLE_v10001110mg [Citr...   278   5e-72
ref|XP_002519384.1| conserved hypothetical protein [Ricinus comm...   276   2e-71
gb|EEC82605.1| hypothetical protein OsI_27177 [Oryza sativa Indi...   273   1e-70
gb|EEE67737.1| hypothetical protein OsJ_25428 [Oryza sativa Japo...   268   4e-69
gb|EOY25839.1| Uncharacterized protein isoform 2 [Theobroma cacao]    256   2e-65
dbj|BAC15471.1| hypothetical protein [Oryza sativa Japonica Grou...   252   4e-64
gb|EOY25840.1| Uncharacterized protein isoform 3 [Theobroma cacao]    249   3e-63
gb|EOY14232.1| Uncharacterized protein TCM_033523 [Theobroma cacao]   244   1e-61
ref|XP_006470787.1| PREDICTED: uncharacterized protein LOC102629...   238   6e-60
gb|EOY25841.1| Uncharacterized protein isoform 4 [Theobroma cacao]    225   5e-56

>ref|XP_006853038.1| hypothetical protein AMTR_s00038p00020700 [Amborella trichopoda]
            gi|548856677|gb|ERN14505.1| hypothetical protein
            AMTR_s00038p00020700 [Amborella trichopoda]
          Length = 458

 Score =  341 bits (874), Expect = 7e-91
 Identities = 195/454 (42%), Positives = 254/454 (55%), Gaps = 30/454 (6%)
 Frame = +3

Query: 63   GGECT-LRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXX 239
            G E T L + V  SF+LE+AVCS+GFFMM+PN W S+ +TL RPLRL D           
Sbjct: 4    GAERTVLTLPVNESFELEKAVCSHGFFMMAPNLWFSSSQTLQRPLRLTDRSSVPVRITQL 63

Query: 240  XXXXXXR----VFGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFG 407
                       V G S+L   D+ ++ AQV RMLR+SE +D  ++ FH ++  AK  GFG
Sbjct: 64   SLSSQKSLQILVLGASKLYQHDQQYLLAQVARMLRISEEDDLKVNKFHEMYPVAKETGFG 123

Query: 408  RVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXF 587
            RVFRSPTLFED+VK+ LLCNC+W RTLSMA +LC+LQ EL G  L               
Sbjct: 124  RVFRSPTLFEDMVKSILLCNCQWTRTLSMARALCELQLELNGNSLRQSNKDTDFSKSVNL 183

Query: 588  YPKTPTKTRLKRRE-----------------------CSEILRPAKLR--FDETSQCKVM 692
             P TP +   K+R                          E LRP  L   F + S     
Sbjct: 184  SPVTPMQLEHKKRRKNPNQNIIMNLMTKFSENETHLAADESLRPIDLAKDFSKNSPTMFS 243

Query: 693  SGKISEGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELA 872
            S +   G     ++S  K         LG  + L +  ++   L   L AGNFP P+ELA
Sbjct: 244  SEEGRNGKLNYDQVSEEK---------LGDGAILDNQLLENKTLSFFLEAGNFPCPEELA 294

Query: 873  SLSENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLEDLKAYLQKLDG 1052
            +L E  L KRC VG+R++RI+ LA+ I  G++DL  +E       + L+ L   L  + G
Sbjct: 295  NLDEKILEKRCKVGFRSKRIVKLAQSIVEGALDLGKIEVLSQQDPIHLDGLMRQLLSIYG 354

Query: 1053 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFL 1232
            VG + C+ VLM MGIYQ +P DTET+RHLKQ   R  CTI ++  D+EE+Y K+ PFQFL
Sbjct: 355  VGPYVCNNVLMSMGIYQRIPADTETLRHLKQFHARKQCTIGTIQKDIEEIYGKHEPFQFL 414

Query: 1233 AYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNMK 1334
             YW E+W+ YEK+FG+LS MPPSDY LI+ HNMK
Sbjct: 415  VYWSEMWEFYEKRFGKLSQMPPSDYELITAHNMK 448


>ref|XP_006358484.1| PREDICTED: uncharacterized protein LOC102593287 isoform X1 [Solanum
            tuberosum] gi|565385158|ref|XP_006358485.1| PREDICTED:
            uncharacterized protein LOC102593287 isoform X2 [Solanum
            tuberosum]
          Length = 485

 Score =  318 bits (814), Expect = 6e-84
 Identities = 185/441 (41%), Positives = 246/441 (55%), Gaps = 25/441 (5%)
 Frame = +3

Query: 96   SSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXXXXX------ 257
            ++FDLE+AVCS+G FMM+PNRW S  KTL RPL L +                       
Sbjct: 29   ATFDLEKAVCSHGLFMMAPNRWDSLSKTLERPLHLSENINDDDHEQSVLVQINQPSDSPH 88

Query: 258  ----RVFGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSP 425
                RVFG + L+   +  +  QV RM+RLS  E+  +  F  +  +AK +G GRVFRSP
Sbjct: 89   SLLLRVFGTASLSTIHQRSLLGQVRRMVRLSVEENKRVKQFQEICGEAKDRGLGRVFRSP 148

Query: 426  TLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK--------GKPLGWXXXXXXXXXXX 581
            TLFED+VK  LLCNC+W RTLSMA +LC+LQ EL           P              
Sbjct: 149  TLFEDMVKCMLLCNCQWSRTLSMAEALCELQLELNCPSSAASFPDPDNQNQLKGVTFKSE 208

Query: 582  XFYPKTPTKTRLKRRE----CSEILRPAKLRFDETSQCKVMSGKISEGCSIVSEI--SIT 743
             F P+TP     ++R     CS  L       +E          ++   S+  E+     
Sbjct: 209  HFTPRTPAGKESRKRAGAYGCSRKLLERLTEVEEIIDIGKPGVTVTPAFSVGEEVLKKSN 268

Query: 744  KCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRA 923
             C     V ++G+S+    +  +  +L S    GNFP+PKELASL E++L KRCG+GYRA
Sbjct: 269  LCRDTTEVCDVGTSAPFNLDPSEDRKLSSFNQLGNFPSPKELASLDESFLAKRCGLGYRA 328

Query: 924  RRILNLAKQICNGSIDLDSLENPDGSVQMKLEDLKA-YLQKLDGVGKFTCDVVLMCMGIY 1100
             RI+ LAK I  GSI L  LE    +  +   D  A  L+++DG G FTC  VLMC+G Y
Sbjct: 329  GRIIKLAKGIVEGSIQLKELEEACSNPSLSDYDKMAEQLREIDGFGPFTCANVLMCLGYY 388

Query: 1101 QCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGR 1280
              +PTD+ET+RHLKQV  R+  TI++V  DVE +Y KY PFQFLAYW E+W  YE++FG+
Sbjct: 389  HVIPTDSETIRHLKQVHARTS-TIQNVQRDVENIYGKYAPFQFLAYWSEVWHFYEERFGK 447

Query: 1281 LSHMPPSDYGLISGHNMKEER 1343
            LS MP S+Y LI+  NM+ +R
Sbjct: 448  LSEMPHSEYKLITAANMRRKR 468


>ref|XP_004230386.1| PREDICTED: uncharacterized protein LOC101247758 [Solanum
            lycopersicum]
          Length = 483

 Score =  318 bits (814), Expect = 6e-84
 Identities = 189/459 (41%), Positives = 251/459 (54%), Gaps = 25/459 (5%)
 Frame = +3

Query: 42   ERMMEENGGECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXX 221
            E  +E+  G C       +SFDLE+AVCS+G FMM+PNRW +  KTL RPLRL +     
Sbjct: 16   ELPLEDGNGYC-------ASFDLEKAVCSHGLFMMAPNRWDTLSKTLERPLRLSENINDD 68

Query: 222  XXXXXXXXXXXX----------RVFGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFH 371
                                  RV     L+   +  +  QV RM+RLS  E+  +  F 
Sbjct: 69   DHEQSVLVQITQPSDYPHSLLLRVLDTDSLSTIHQRSLLGQVRRMVRLSVEENKRVKLFQ 128

Query: 372  RVHSQAKSQGFGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK------- 530
             +  +AK +GFGRVFRSPTLFED+VK  LLCNC+W RTLSMA +LC+LQ EL        
Sbjct: 129  EICGEAKERGFGRVFRSPTLFEDMVKCMLLCNCQWSRTLSMAEALCELQLELNCPSSAAS 188

Query: 531  -GKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRE----CSEILRPAKLRFDETSQCKVMS 695
               P               F P+TP    L++R     CS  L       +E        
Sbjct: 189  FPDPDNQNQLKGVTSKSEHFTPRTPAGKELRKRAGAYGCSRNLLERLNEVEEIVDIDKPG 248

Query: 696  GKISEGCSIVSEI--SITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKEL 869
              ++   S+  E+      C     V  +  S+ L  +  +  +L S    GNFP+PK+L
Sbjct: 249  VTVTPAFSVGEEVLQKSNLCQDTTEVWEVSVSAPLNPDPSEDRKLSSFNQLGNFPSPKQL 308

Query: 870  ASLSENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLEDLKA-YLQKL 1046
            ASL E++L KRCG+GYRA RI+ LAK I  GSI L+ LE    +  +   D  A  L+++
Sbjct: 309  ASLDESFLAKRCGLGYRAGRIIKLAKGIVEGSIQLNELEEACSNPSLSNYDKMAEQLREI 368

Query: 1047 DGVGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQ 1226
            DG G FTC  VLMC+G Y  +PTD+ET+RHLKQV  R+  TI++V  DVE +Y KY PFQ
Sbjct: 369  DGFGPFTCANVLMCLGYYHVIPTDSETIRHLKQVHARTS-TIQNVQRDVENIYGKYAPFQ 427

Query: 1227 FLAYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNMKEER 1343
            FLAYW E+W  YE++FG+LS MP S+Y LI+  NM+ +R
Sbjct: 428  FLAYWSEVWHFYEERFGKLSEMPHSEYKLITAANMRPKR 466


>ref|XP_003534756.2| PREDICTED: uncharacterized protein LOC100781827 [Glycine max]
          Length = 443

 Score =  287 bits (734), Expect = 1e-74
 Identities = 179/429 (41%), Positives = 233/429 (54%), Gaps = 12/429 (2%)
 Frame = +3

Query: 84   VSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXXXXXRV 263
            + + S F LE+AVCS+G FMM PN W    KTL RPLR                    RV
Sbjct: 18   MELPSPFQLEQAVCSHGLFMMPPNHWDPLSKTLIRPLR-SSPSSFLVSLSQHSQSLAVRV 76

Query: 264  FGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVH--SQAKSQGFGRVFRSPTLFE 437
                 L+ Q +NHI AQV RMLR SE E+ A+  F  +H          GRVFRSPTLFE
Sbjct: 77   HATHALSPQQQNHITAQVSRMLRFSEAEEKAVREFRSLHVVDHPNRSFSGRVFRSPTLFE 136

Query: 438  DIVKAFLLCNCRWQRTLSMAASLCDLQSELK-GKPLGWXXXXXXXXXXXXFYPKTPTKTR 614
            D+VK  LLCNC+W RTLSMA +LC+LQ EL+ G P               F PKTP    
Sbjct: 137  DMVKCILLCNCQWPRTLSMAQALCELQLELQNGSPCTIAVSGNSKGESEGFIPKTPASKE 196

Query: 615  LKRRECSE--ILRPAKLRFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVPNLGSSS 788
             +R + S   +    KL  D   Q   +    S   ++             +  + G S 
Sbjct: 197  TRRNKVSTKGMFCKKKLELDGNLQIDHVVASSSTATTL-------------LTTDNGDSE 243

Query: 789  KLTSNTMDTSELPSN-----LMAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQI 953
            +L S+  D+    SN        GNFP+P ELA+L E++L KRCG+GYRA  I+ LA+ I
Sbjct: 244  ELRSH--DSCHEFSNGNEYFSRTGNFPSPSELANLDESFLAKRCGLGYRAGYIIELARAI 301

Query: 954  CNGSIDLDSLE--NPDGSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIYQCVPTDTET 1127
              G I L  LE  + D S+    + L   L+++ G G FT   VLMC+G Y  +PTD+ET
Sbjct: 302  VEGKIQLGQLEELSKDASLS-NYKQLDDQLKQIRGYGPFTRANVLMCLGYYHVIPTDSET 360

Query: 1128 VRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSHMPPSDY 1307
            VRHLKQV  R   T K++  ++EE+Y KY P+QFLA+W E+WD YE +FG+L+ M  SDY
Sbjct: 361  VRHLKQVHSRY-TTSKTIERELEEIYGKYEPYQFLAFWSEVWDFYETRFGKLNEMHSSDY 419

Query: 1308 GLISGHNMK 1334
             LI+  NM+
Sbjct: 420  KLITACNMR 428


>gb|EOY25838.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 467

 Score =  285 bits (730), Expect = 3e-74
 Identities = 182/454 (40%), Positives = 246/454 (54%), Gaps = 18/454 (3%)
 Frame = +3

Query: 27   AASLCERMMEENGGECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCD 206
            ++S C  ++E   GE          F+LE+AVCS+G FMM+PN+W    ++L RPLRL D
Sbjct: 41   SSSCCSVLIELPVGEAAAAEGA-GPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPLRLLD 99

Query: 207  EXXXXXXXXXXXXXXXX-----RVFGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFH 371
                                  RV+G   L+ Q  + +  QV RMLRLSE E+  +  F 
Sbjct: 100  HHSPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKVREFR 159

Query: 372  RV----HSQAKSQG------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQS 521
            ++    H + ++         GRVFRSPTLFED+VK  LLCNC++ RTLSMA +LC+LQ 
Sbjct: 160  KIVEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQFSRTLSMAKALCELQF 219

Query: 522  ELKGKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGK 701
            E +    G             F PKTP    LKR          KLR             
Sbjct: 220  ETQRPFSG------VRAAEDDFIPKTPAGNELKR----------KLR------------- 250

Query: 702  ISEGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLS 881
                   VS++S+ + +G++  P    S      + +  E  +    G+FP+P+ELA+L 
Sbjct: 251  -------VSKVSM-RLEGKFAEPRADHSKSDLQPSQELDEPHAYKGMGSFPSPEELANLD 302

Query: 882  ENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLED---LKAYLQKLDG 1052
            E++L KRC +GYRA RIL LAK I  G I L  LE  +G  ++ L     L   L+++DG
Sbjct: 303  ESFLAKRCNLGYRASRILKLAKGIVQGIIQLMQLE--EGCKEISLSSYNKLAEQLRQIDG 360

Query: 1053 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFL 1232
             G FTC  VLMCMG Y  +P D+ET+RHLKQV  +S  T+++V  DVE +YAKY PFQFL
Sbjct: 361  FGPFTCANVLMCMGFYHVIPADSETIRHLKQVHSKSS-TMQTVGRDVEGIYAKYAPFQFL 419

Query: 1233 AYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNMK 1334
            AYW E+W  YE++FG+LS MP   Y LI+  NMK
Sbjct: 420  AYWAELWHYYEQRFGKLSEMPFCGYKLITASNMK 453


>ref|XP_004959865.1| PREDICTED: uncharacterized protein LOC101766322 [Setaria italica]
          Length = 461

 Score =  285 bits (729), Expect = 4e-74
 Identities = 171/446 (38%), Positives = 237/446 (53%), Gaps = 32/446 (7%)
 Frame = +3

Query: 102  FDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXXXXX------RV 263
            FDL  AVCS+G FMM+PNRW  A + L RPLRL  +                       V
Sbjct: 36   FDLAAAVCSHGLFMMAPNRWDPAARALVRPLRLASDRSASLLARVSAHPARPGTALLVAV 95

Query: 264  FGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSPTLFEDI 443
             G   L+  D ++I  QV RMLRLSE +  A+  F  +H+ A+ +GFGR+FRSPTLFED+
Sbjct: 96   EGADALSSLDRDYILEQVRRMLRLSEEDGAAVAEFQAMHAAAREEGFGRIFRSPTLFEDM 155

Query: 444  VKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTP----TKT 611
            VK  LLCNC+W RTLSMA +LC++Q ELK                  F  +TP     K 
Sbjct: 156  VKCILLCNCQWTRTLSMATALCEIQLELK-----------CSSSVEDFQSRTPPIRERKR 204

Query: 612  RLKRRECSEILRPAKLRFDETSQCKVMSGK------------ISEGCSIVSEISITKCDG 755
            +  +R+   I    +   D+     + SG             +S   S+ SE   + CD 
Sbjct: 205  KRSKRQSVRIKLETRFAEDKLEGPTIASGTSNDLTHPETNEYLSSLASVASETG-SACDS 263

Query: 756  EYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRARRIL 935
               +P+L +S    +N     +       G+FPTP+ELA+L E +L KRC +GYRA+RI+
Sbjct: 264  ---LPSLDNSELSLNNAPGLED-----CIGDFPTPEELANLDEGFLAKRCNLGYRAKRIV 315

Query: 936  NLAKQICNGSIDLDSLE----------NPDGSVQMKLEDLKAYLQKLDGVGKFTCDVVLM 1085
             LA+ +  G + L  LE              +++   E L   L  + G G FT   VLM
Sbjct: 316  MLARGVVEGKVCLQKLEEMCRISVPAAEEVSTIESACERLNKELSAISGFGPFTRANVLM 375

Query: 1086 CMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYE 1265
            CMG    +P DTET+RHLKQV  R+  TI SV  +++++Y KY PFQFLAYW+E+W  Y 
Sbjct: 376  CMGFNHTIPADTETIRHLKQVHKRAS-TISSVHQELDKIYGKYAPFQFLAYWFELWGFYN 434

Query: 1266 KQFGRLSHMPPSDYGLISGHNMKEER 1343
            KQFG++  M PS+Y L +  ++K+ +
Sbjct: 435  KQFGKICEMEPSNYRLFTASHLKKAK 460


>gb|EXC47697.1| hypothetical protein L484_002408 [Morus notabilis]
          Length = 472

 Score =  284 bits (726), Expect = 1e-73
 Identities = 186/447 (41%), Positives = 240/447 (53%), Gaps = 34/447 (7%)
 Frame = +3

Query: 96   SSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLC------------DEXXXXXXXXXX 239
            ++F LE AVCS+G FMM+PN+W    KTL RPLRL             D+          
Sbjct: 14   ATFRLETAVCSHGLFMMAPNQWDPLSKTLLRPLRLTLHHHHWNPQQQQDDSVMARISQPH 73

Query: 240  XXXXXXRVF---GISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGR 410
                  RV    G   LT  ++  + AQV RMLRLS+ E+     F  V+      G GR
Sbjct: 74   DRLHCLRVLVHAGTRSLTSDNKQALLAQVSRMLRLSQTEERICREFSEVYGCGS--GLGR 131

Query: 411  VFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFY 590
            VFRSPTLFED+VK  LLCNC+W RTLSMA +LCDLQ EL+ + +              F 
Sbjct: 132  VFRSPTLFEDMVKCILLCNCQWPRTLSMAQALCDLQRELQLQSV--------PSKTVDFV 183

Query: 591  PKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVP 770
            PKTP     KR+     +   K     TSQ    S +  E  S  +++SI          
Sbjct: 184  PKTPAGKEPKRK-----VEKLKASTCLTSQFDAQSNEGLESHS--NDLSIDISQPTPSAQ 236

Query: 771  NLGSSSKLT----------SNTMDTSEL--PSNLM------AGNFPTPKELASLSENYLT 896
            NL  SS L+          S  +D++ L  P  L        G+FPTP ELA L E +L 
Sbjct: 237  NLSPSSLLSVPMENVTCEESYGVDSASLCNPQILRDREFEGTGDFPTPTELAKLDEKFLA 296

Query: 897  KRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQM-KLEDLKAYLQKLDGVGKFTCD 1073
            KRC +GYRA RIL LA+ I  G I L  LE       +     L   L+++DG G FTC 
Sbjct: 297  KRCKLGYRAGRILKLARGIVEGRIQLRELEETCMERSLCSYSKLAVQLRQIDGFGPFTCA 356

Query: 1074 VVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIW 1253
             VLMCMG Y  +P+D+ET+RHL+QV GR+  T++++  DV+++YAKY PFQFLAYW E+W
Sbjct: 357  NVLMCMGFYHVIPSDSETIRHLQQVHGRNS-TVRTIERDVQQIYAKYEPFQFLAYWSELW 415

Query: 1254 DSYEKQFGRLSHMPPSDYGLISGHNMK 1334
              YEK+FG++S MP S Y L +  NMK
Sbjct: 416  HFYEKKFGKISEMPCSAYKLFTASNMK 442


>gb|ESW19537.1| hypothetical protein PHAVU_006G133500g [Phaseolus vulgaris]
          Length = 474

 Score =  282 bits (721), Expect = 4e-73
 Identities = 180/445 (40%), Positives = 243/445 (54%), Gaps = 19/445 (4%)
 Frame = +3

Query: 102  FDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRL-----CDEXXXXXXXXXXXXXXXXRVF 266
            F L++AVCS+GFFMM+PN W    KTL RPL L                        RV 
Sbjct: 46   FQLDQAVCSHGFFMMAPNHWDPLSKTLTRPLLLHNPSSSSSSSLLVSLSQRPQSLAVRVH 105

Query: 267  GISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHS-QAKSQGFG-RVFRSPTLFED 440
             +  ++ Q + HIKAQ+ RMLRLSE E+ A+  F  VH+    ++ FG RVFRSPTLFED
Sbjct: 106  SVHFISPQQQRHIKAQITRMLRLSEAEEKAVREFRSVHAADHPNRSFGGRVFRSPTLFED 165

Query: 441  IVKAFLLCNCRWQRTLSMAASLCDLQSELK-GKPLGWXXXXXXXXXXXXFYPKTPT--KT 611
            +VK  LLCNC+W RTLSMA +LC+LQS L+ G P               F PKTP   + 
Sbjct: 166  MVKCILLCNCQWPRTLSMAQALCELQSGLQNGLPCAVEGSGNPKVEAEEFVPKTPASKEN 225

Query: 612  RLKRRECSEILRPAKLRF------DETSQCKVMSGKISEGCSIVSEISITKCDGEYI-VP 770
            R K+     +L   KL        D   Q   M    S+  +++ ++ + + D      P
Sbjct: 226  RRKKAPTKGVLLKKKLELELEMEVDGNLQMDHMFASSSD-TTLLGDLEVLRSDDSCCQFP 284

Query: 771  NLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQ 950
            N G     T               GNFP+P ELA+LSE++L KRC +GYRA  IL LA+ 
Sbjct: 285  NEGEYFDHT---------------GNFPSPIELANLSESFLAKRCKLGYRAGYILELAQG 329

Query: 951  ICNGSIDLDSLE--NPDGSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIYQCVPTDTE 1124
            I  G I L+ LE  + D S+    + L   L+ + G G FT   VLMC+G Y  +P D+E
Sbjct: 330  IVEGKIQLEQLEELSKDASLSC-YKQLGDQLKPIKGFGPFTRANVLMCLGYYHVIPWDSE 388

Query: 1125 TVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSHMPPSD 1304
            TVRHLKQV  ++  + K++  D+EE+Y KY P+QFLA+W EIWD YE +FG+++ M  S+
Sbjct: 389  TVRHLKQVHSKNTSS-KTIERDLEEIYGKYEPYQFLAFWSEIWDFYETRFGKMNEMHSSE 447

Query: 1305 YGLISGHNMKEERAVTSIDPDKSQE 1379
            Y  I+  NM+  R  T+     SQ+
Sbjct: 448  YKRITASNMRSTRKATNKRKRPSQK 472


>ref|XP_002304112.2| hypothetical protein POPTR_0003s03710g [Populus trichocarpa]
            gi|550342350|gb|EEE79091.2| hypothetical protein
            POPTR_0003s03710g [Populus trichocarpa]
          Length = 489

 Score =  281 bits (720), Expect = 5e-73
 Identities = 182/456 (39%), Positives = 238/456 (52%), Gaps = 42/456 (9%)
 Frame = +3

Query: 99   SFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLC-------------DEXXXXXXXXXX 239
            +F+LE+AVCS+G FMMSPN W     T  RPLRL                          
Sbjct: 29   TFNLEKAVCSHGLFMMSPNHWDPLSLTFSRPLRLSLSDSDPQVSTPTTSLFVSISHPPHL 88

Query: 240  XXXXXXRVFGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQ------- 398
                  RV+G   L+ + +  + AQVVRMLRLSE ++     F ++   A ++       
Sbjct: 89   PRSLSVRVYGTRCLSPKHQESLVAQVVRMLRLSETDERNAREFRKIAEAAAAEENNSWLT 148

Query: 399  GFG-RVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXX 575
            GFG RVFRSPTLFED+VK  LLCNC+W RTLSMA +LC+LQ EL+ K  G          
Sbjct: 149  GFGGRVFRSPTLFEDMVKCILLCNCQWPRTLSMARALCELQCELQCKSSGVFVAQAVNAT 208

Query: 576  XXX--------FYPKTPTKTRLKRR-ECSEILRPAKLRFDET-------SQCKVMSGKIS 707
                       F P T      KR    S++ +    +  ET       +  K  S  I 
Sbjct: 209  VKNKCNDTAHNFIPNTSAGKESKRNIRASKVTKNLASKIVETETLLEADANLKTDSAHIG 268

Query: 708  -EGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPS--NLMAGNFPTPKELASL 878
             E    V   S  +C   +     GS S    +      +    N M  NFP+P+ELA+L
Sbjct: 269  RETLESVENDSCARCSSRH-----GSDSWAPDSLQSQHGIQPGVNKMICNFPSPRELANL 323

Query: 879  SENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENP--DGSVQMKLEDLKAYLQKLDG 1052
             E++L KRC +GYRA RI+ LA+ I  G I L  +E    +G+       L    +++DG
Sbjct: 324  DESFLAKRCNLGYRAIRIIKLAQSIVEGRIPLREVEEDCANGASSSCYNKLADQFRQIDG 383

Query: 1053 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFL 1232
             G FTC  VLMCMG Y  +PTD+ETVRHLKQV  +   TI++V  DVEE+Y KY PFQFL
Sbjct: 384  FGPFTCANVLMCMGFYHIIPTDSETVRHLKQVHAKKS-TIQTVQRDVEEIYGKYAPFQFL 442

Query: 1233 AYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNMKEE 1340
            AYW E+W  YEK+FG+LS +P SDY LI+  NM+ +
Sbjct: 443  AYWAELWHFYEKRFGKLSEIPTSDYKLITASNMRSK 478


>ref|XP_006470786.1| PREDICTED: uncharacterized protein LOC102629917 isoform X1 [Citrus
            sinensis]
          Length = 454

 Score =  279 bits (713), Expect = 3e-72
 Identities = 177/451 (39%), Positives = 240/451 (53%), Gaps = 30/451 (6%)
 Frame = +3

Query: 69   ECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXX 248
            E  L++ +  +F+LE AVCS+G FMMSPNRW    ++L RPL L +              
Sbjct: 5    ESLLKLPLAETFNLETAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVDV 64

Query: 249  XXXRVF----------------GISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVH 380
               +                      L+ + ++ + AQV RMLRLSE ++  +  F R+ 
Sbjct: 65   TICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVREFKRIV 124

Query: 381  SQ-AKSQG---------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK 530
             Q A+ +G          GRVFRSPTLFED+VK  LLCNC+W RTLSMA +LC+LQ    
Sbjct: 125  RQVAQEEGEETQYMEDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLSMARALCELQ---- 180

Query: 531  GKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRE-CSEILRPAKLRFDETSQCKVMSGKIS 707
                 W            F P+TP     KRR+  S++      R  E+         + 
Sbjct: 181  -----WELQHCSPSISEDFIPQTPAGKESKRRQKVSKVASKLTSRIAESKASSEDYMNLK 235

Query: 708  EGCSIVSEISITKCDGEYIVPNLGSSSKLTS-NTMDTSELPSNL-MAGNFPTPKELASLS 881
              C+ V E ++         P     S L   N + T++ PS     GNFP+P+ELA+L 
Sbjct: 236  LDCAGVLEENVQPS-----FPQNDIESDLHGLNELSTTDPPSARDRIGNFPSPRELANLD 290

Query: 882  ENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLE-DLKAYLQKLDGVG 1058
            E++L KRC +GYRA RIL LA+ I +G I L  LE+      +     L   L +++G G
Sbjct: 291  ESFLAKRCNLGYRAGRILKLARGIVDGQIQLRELEDMCNEASLTAYVKLAEQLSQINGFG 350

Query: 1059 KFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAY 1238
             FT + VL+C+G Y  +PTD+ET+RHLKQV  R+ CT K+V M  E +Y KY PFQFLAY
Sbjct: 351  PFTRNNVLVCIGFYHVIPTDSETIRHLKQVHARN-CTSKTVQMIAESIYGKYAPFQFLAY 409

Query: 1239 WWEIWDSYEKQFGRLSHMPPSDYGLISGHNM 1331
            W E+W  YEK+FG+LS MP SDY LI+  NM
Sbjct: 410  WSELWHFYEKRFGKLSEMPYSDYKLITASNM 440


>ref|XP_006431360.1| hypothetical protein CICLE_v10001110mg [Citrus clementina]
            gi|557533482|gb|ESR44600.1| hypothetical protein
            CICLE_v10001110mg [Citrus clementina]
          Length = 454

 Score =  278 bits (711), Expect = 5e-72
 Identities = 173/453 (38%), Positives = 243/453 (53%), Gaps = 32/453 (7%)
 Frame = +3

Query: 69   ECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXX 248
            E  L++ +  +F+LE AVCS+G FMMSPNRW    ++L RPL L +              
Sbjct: 5    ESVLKLPLAETFNLEAAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVDV 64

Query: 249  XXXRVF----------------GISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRV- 377
               +                      L+ + ++ + AQV RMLRLSE ++  +  F R+ 
Sbjct: 65   TICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVRDFKRIV 124

Query: 378  ---------HSQAKSQGFGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK 530
                      SQ  +   GRVFRSPTLFED+VK  LLCNC+W RTL+MA +LC+LQ    
Sbjct: 125  RQVAQEEGEESQYMTDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLNMARALCELQ---- 180

Query: 531  GKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGKISE 710
                 W            F P+TP     KRR+     + +K+    TS  ++   K S 
Sbjct: 181  -----WELQHCSPSISEDFIPQTPAGKESKRRQ-----KVSKVASKLTS--RIAESKASS 228

Query: 711  GCSIVSEISITKCDGEYIVPNLGSSSKLTS----NTMDTSELPSNL-MAGNFPTPKELAS 875
               +  ++  T    E + P+   +   +     N + T++ PS     GNFP+P+ELA+
Sbjct: 229  EDDMNLKLDCTGALEENVQPSFPRNDIESDLHGLNELSTTDPPSACDRIGNFPSPRELAN 288

Query: 876  LSENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLED-LKAYLQKLDG 1052
            L E++L KRC +GYRA RIL LA+ I +G I L  LE+      +   + L   L +++G
Sbjct: 289  LDESFLAKRCNLGYRAGRILKLAQGIVDGQIQLRELEDTCNEASLTTYNKLAEQLSQING 348

Query: 1053 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFL 1232
             G FT + VL+C+G Y  +PTD+ET+RHLKQV  R+ CT K+V +  E +Y KY PFQFL
Sbjct: 349  FGPFTRNNVLVCIGFYHVIPTDSETIRHLKQVHARN-CTSKTVQIIAESIYGKYSPFQFL 407

Query: 1233 AYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNM 1331
            AYW E+W  YEK+FG+LS MP SDY LI+  NM
Sbjct: 408  AYWSELWHFYEKRFGKLSEMPYSDYKLITASNM 440


>ref|XP_002519384.1| conserved hypothetical protein [Ricinus communis]
            gi|223541451|gb|EEF43001.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 458

 Score =  276 bits (706), Expect = 2e-71
 Identities = 167/436 (38%), Positives = 235/436 (53%), Gaps = 22/436 (5%)
 Frame = +3

Query: 99   SFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXXXXX---RVFG 269
            +FDLE+ VCS+G FM+SPN W    +T  RPLRL D+                   RV+G
Sbjct: 21   TFDLEKTVCSHGLFMLSPNHWDPLSRTFSRPLRLNDDTDNSLMVSISQHLSKSLLVRVYG 80

Query: 270  ISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGF-------GRVFRSPT 428
               L+ + +  +  Q+VRMLRLS+ ++     F ++ S  + +         GRV RSPT
Sbjct: 81   NRSLSPKHQESLLVQIVRMLRLSDMDEFNAREFRKIVSAFEGEECPLIGDFGGRVLRSPT 140

Query: 429  LFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTPTK 608
            LFED+VK  LLCNC+W RTLSMA +LC  Q EL  +                F P TP K
Sbjct: 141  LFEDMVKCILLCNCQWSRTLSMADALCKFQIELHSQS------PQQKHAFNHFIPNTPVK 194

Query: 609  TRLKRRECSEILRPAKLRFDETSQCKVMSG---KISEGCSIVSEISITKCDGEYIVPNLG 779
               KR+     +    +  +    C        KIS   + V + S      + +    G
Sbjct: 195  KEPKRKIRLSKVPTESMDLEAADTCLTTDDSQMKISNSLNCVDDGSF-----DNLKSCQG 249

Query: 780  SSSKLTSNTMDTSELPSNLMA--------GNFPTPKELASLSENYLTKRCGVGYRARRIL 935
            S++  ++    TS++ S+L+         GNFP+P+ELA+L E +L KRCG+GYRA RI+
Sbjct: 250  SNTFYSTGPYATSDIQSHLVTQHCAKKTTGNFPSPRELANLDERFLAKRCGLGYRAGRII 309

Query: 936  NLAKQICNGSIDLDSLEN-PDGSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIYQCVP 1112
             LA+ I  G I L   E   +G        L   L++++G G FT   VLMCMG Y  +P
Sbjct: 310  KLAQGIVEGRIPLREFEQVSNGGSLSTYSKLTDQLREIEGFGPFTRANVLMCMGFYHVIP 369

Query: 1113 TDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSHM 1292
            TD+ETVRH KQV  ++  TIK+V  + EE+Y K+ PFQFL YW E+W  YE++FG+LS M
Sbjct: 370  TDSETVRHFKQVHAKNS-TIKTVQSEAEEIYRKFAPFQFLVYWAELWHFYEQRFGKLSEM 428

Query: 1293 PPSDYGLISGHNMKEE 1340
            P S+Y LI+  N++ +
Sbjct: 429  PCSNYKLITASNLRNK 444


>gb|EEC82605.1| hypothetical protein OsI_27177 [Oryza sativa Indica Group]
          Length = 463

 Score =  273 bits (699), Expect = 1e-70
 Identities = 169/439 (38%), Positives = 230/439 (52%), Gaps = 27/439 (6%)
 Frame = +3

Query: 102  FDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXXXXXR------V 263
            FDLE AVCS+G FMM+PNRW  A + L RPLRL  +                       V
Sbjct: 37   FDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVRVSRHPARPSDALLVSV 96

Query: 264  FGI--SQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSPTLFE 437
             G     L+  D+  I  QV RMLRL E +  A   F  +H+ A+  GFGR+FRSPTLFE
Sbjct: 97   LGAPGDALSPPDQTSILEQVRRMLRLDEEDGRAAAEFQAMHAVAREAGFGRIFRSPTLFE 156

Query: 438  DIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTPTKTRL 617
            D+VK  LLCNC+W RTLSM+ +LC+LQ EL+                  F  +TP     
Sbjct: 157  DMVKCILLCNCQWTRTLSMSTALCELQLELRSS-----------SSTENFQSRTPPIREC 205

Query: 618  KRRECSEILRPAKL--RFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVPNLGSSSK 791
            KR+  ++     KL  +F+E     +    ++         ++   +  + +P+  S + 
Sbjct: 206  KRKRSNKRNVRVKLETKFNEDKLVCLEDPNLA-----TDTANLQTYENSFNLPSAASGTG 260

Query: 792  LTSN-TMDTSELPSNL------MAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQ 950
             TS  ++D SEL            G+FPTP+ELA+L E++L KRC +GYRARRI+ LA+ 
Sbjct: 261  NTSEVSLDHSELKLRNEPCLEDCGGDFPTPEELANLDEDFLAKRCNLGYRARRIVMLARS 320

Query: 951  ICNGSIDLDSLENPD----------GSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIY 1100
            I  G I L  LE              +     + L   L  + G G FT   VLMCMG +
Sbjct: 321  IVEGKICLQKLEEIRKMSVPTVEGLSTTPSTYDRLNEELSTISGFGPFTRANVLMCMGFF 380

Query: 1101 QCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGR 1280
              +P DTET+RHLKQ   R+  TI SV  +++ +Y KY PFQFLAYW E+W  Y KQFG+
Sbjct: 381  HMIPADTETIRHLKQFHKRAS-TISSVQKELDNIYGKYAPFQFLAYWCELWGFYNKQFGK 439

Query: 1281 LSHMPPSDYGLISGHNMKE 1337
            +S M P +Y L +   +K+
Sbjct: 440  ISDMEPINYRLFTASKLKK 458


>gb|EEE67737.1| hypothetical protein OsJ_25428 [Oryza sativa Japonica Group]
          Length = 442

 Score =  268 bits (686), Expect = 4e-69
 Identities = 169/433 (39%), Positives = 229/433 (52%), Gaps = 21/433 (4%)
 Frame = +3

Query: 102  FDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDE------XXXXXXXXXXXXXXXXRV 263
            FDLE AVCS+G FMM+PNRW  A + L RPLRL  +                       V
Sbjct: 37   FDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVRVSRHPARPSDALLVSV 96

Query: 264  FGI---SQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSPTLF 434
             G      L+  D+  I  QV RMLRL E +  A+  F  +H+ A+  GFGR+FRSPTLF
Sbjct: 97   LGAPDDDALSPLDQTSILEQVRRMLRLDEEDGRAVAEFQAMHAVAREVGFGRIFRSPTLF 156

Query: 435  EDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTPTKTR 614
            ED++K  LLCNC+W RTLSM+ +LC+LQ EL+                  F  +TP    
Sbjct: 157  EDMIKCILLCNCQWTRTLSMSTALCELQLELRSS-----------SSTENFQSRTPPIRE 205

Query: 615  LKRRECSEILRPAKL--RFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVPNLGSSS 788
             KR+  ++     KL  +F+E     +    ++           T    E +  +L SS+
Sbjct: 206  CKRKRSNKRNVRVKLETKFNEDKMVCLEDPNLA-----------TNTANENLF-SLPSSA 253

Query: 789  KLTSNTMDTS----------ELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRARRILN 938
              T NT + S          EL      G+FPTP+ELA+L E++L KRC +GYRARRI+ 
Sbjct: 254  NETGNTSEVSLDHSELKLRYELCLEDCGGDFPTPEELANLDEDFLAKRCNLGYRARRIVM 313

Query: 939  LAKQICNGSIDLDSLENPDGSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIYQCVPTD 1118
            LA+ I  G I L  LE      ++ +E+L      + G+  F    VLMCMG +  +P D
Sbjct: 314  LARSIVEGKICLQKLEE---IRKILIEELST----ISGIWPFHSCNVLMCMGFFHMIPAD 366

Query: 1119 TETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSHMPP 1298
            TET+RHLKQ   R+  TI SV  +++ +Y KY PFQFLAYW E+W  Y KQFG +S M P
Sbjct: 367  TETIRHLKQFHKRAS-TISSVQKELDNIYGKYAPFQFLAYWCELWGFYNKQFGIISDMEP 425

Query: 1299 SDYGLISGHNMKE 1337
             +Y L +   +K+
Sbjct: 426  INYRLFTASKLKK 438


>gb|EOY25839.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 426

 Score =  256 bits (654), Expect = 2e-65
 Identities = 170/454 (37%), Positives = 230/454 (50%), Gaps = 18/454 (3%)
 Frame = +3

Query: 27   AASLCERMMEENGGECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCD 206
            ++S C  ++E   GE          F+LE+AVCS+G FMM+PN+W    ++L RPLRL D
Sbjct: 26   SSSCCSVLIELPVGEAAAAEGA-GPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPLRLLD 84

Query: 207  EXXXXXXXXXXXXXXXX-----RVFGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFH 371
                                  RV+G   L+ Q  + +  QV RMLRLSE E+  +  F 
Sbjct: 85   HHSPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKVREFR 144

Query: 372  RV----HSQAKSQG------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQS 521
            ++    H + ++         GRVFRSPTLFED+VK  LLCNC+                
Sbjct: 145  KIVEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQ---------------- 188

Query: 522  ELKGKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGK 701
                                 F PKTP    LKR          KLR             
Sbjct: 189  ----------------AAEDDFIPKTPAGNELKR----------KLR------------- 209

Query: 702  ISEGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLS 881
                   VS++S+ + +G++  P    S      + +  E  +    G+FP+P+ELA+L 
Sbjct: 210  -------VSKVSM-RLEGKFAEPRADHSKSDLQPSQELDEPHAYKGMGSFPSPEELANLD 261

Query: 882  ENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLED---LKAYLQKLDG 1052
            E++L KRC +GYRA RIL LAK I  G I L  LE  +G  ++ L     L   L+++DG
Sbjct: 262  ESFLAKRCNLGYRASRILKLAKGIVQGIIQLMQLE--EGCKEISLSSYNKLAEQLRQIDG 319

Query: 1053 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFL 1232
             G FTC  VLMCMG Y  +P D+ET+RHLKQV  +S  T+++V  DVE +YAKY PFQFL
Sbjct: 320  FGPFTCANVLMCMGFYHVIPADSETIRHLKQVHSKSS-TMQTVGRDVEGIYAKYAPFQFL 378

Query: 1233 AYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNMK 1334
            AYW E+W  YE++FG+LS MP   Y LI+  NMK
Sbjct: 379  AYWAELWHYYEQRFGKLSEMPFCGYKLITASNMK 412


>dbj|BAC15471.1| hypothetical protein [Oryza sativa Japonica Group]
            gi|50510134|dbj|BAD31099.1| hypothetical protein [Oryza
            sativa Japonica Group]
          Length = 501

 Score =  252 bits (643), Expect = 4e-64
 Identities = 169/481 (35%), Positives = 230/481 (47%), Gaps = 69/481 (14%)
 Frame = +3

Query: 102  FDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXXXXXR------V 263
            FDLE AVCS+G FMM+PNRW  A + L RPLRL  +                       V
Sbjct: 37   FDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVRVSRHPARPSDALLVSV 96

Query: 264  FGISQ---LTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSPTLF 434
             G      L+  D+  I  QV RMLRL E +  A+  F  +H+ A+  GFGR+FRSPTLF
Sbjct: 97   LGAPDDDALSPLDQTSILEQVRRMLRLDEEDGRAVAEFQAMHAVAREVGFGRIFRSPTLF 156

Query: 435  EDIVKAFLLCNC------------------------------------------RWQRTL 488
            ED++K  LLCNC                                          RW RTL
Sbjct: 157  EDMIKCILLCNCQFSLPLPLPSLASTSMRNSDTNMSRYLGIAIFHLHSTVLFNCRWTRTL 216

Query: 489  SMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKL--R 662
            SM+ +LC+LQ EL+                  F  +TP     KR+  ++     KL  +
Sbjct: 217  SMSTALCELQLELRSSS-----------STENFQSRTPPIRECKRKRSNKRNVRVKLETK 265

Query: 663  FDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNL-- 836
            F+E     +    ++   +  +  S+     E      G++S+++   +D SEL      
Sbjct: 266  FNEDKMVCLEDPNLATNTANENLFSLPSSANE-----TGNTSEVS---LDHSELKLRYEL 317

Query: 837  ----MAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPD--- 995
                  G+FPTP+ELA+L E++L KRC +GYRARRI+ LA+ I  G I L  LE      
Sbjct: 318  CLEDCGGDFPTPEELANLDEDFLAKRCNLGYRARRIVMLARSIVEGKICLQKLEEIRKMS 377

Query: 996  -------GSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVG 1154
                    +     + L   L  + G G FT   VLMCMG +  +P DTET+RHLKQ   
Sbjct: 378  VPTVEGLSTTPSTYDRLNEELSTISGFGPFTRANVLMCMGFFHMIPADTETIRHLKQFHK 437

Query: 1155 RSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNMK 1334
            R+  TI SV  +++ +Y KY PFQFLAYW E+W  Y KQFG +S M P +Y L +   +K
Sbjct: 438  RAS-TISSVQKELDNIYGKYAPFQFLAYWCELWGFYNKQFGIISDMEPINYRLFTASKLK 496

Query: 1335 E 1337
            +
Sbjct: 497  K 497


>gb|EOY25840.1| Uncharacterized protein isoform 3 [Theobroma cacao]
          Length = 421

 Score =  249 bits (636), Expect = 3e-63
 Identities = 165/422 (39%), Positives = 225/422 (53%), Gaps = 18/422 (4%)
 Frame = +3

Query: 27   AASLCERMMEENGGECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCD 206
            ++S C  ++E   GE          F+LE+AVCS+G FMM+PN+W    ++L RPLRL D
Sbjct: 41   SSSCCSVLIELPVGEAAAAEGA-GPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPLRLLD 99

Query: 207  EXXXXXXXXXXXXXXXX-----RVFGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFH 371
                                  RV+G   L+ Q  + +  QV RMLRLSE E+  +  F 
Sbjct: 100  HHSPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKVREFR 159

Query: 372  RV----HSQAKSQG------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQS 521
            ++    H + ++         GRVFRSPTLFED+VK  LLCNC++ RTLSMA +LC+LQ 
Sbjct: 160  KIVEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQFSRTLSMAKALCELQF 219

Query: 522  ELKGKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGK 701
            E + +P               F PKTP    LKR          KLR             
Sbjct: 220  ETQ-RPFS-----GVRAAEDDFIPKTPAGNELKR----------KLR------------- 250

Query: 702  ISEGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLS 881
                   VS++S+ + +G++  P    S      + +  E  +    G+FP+P+ELA+L 
Sbjct: 251  -------VSKVSM-RLEGKFAEPRADHSKSDLQPSQELDEPHAYKGMGSFPSPEELANLD 302

Query: 882  ENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLED---LKAYLQKLDG 1052
            E++L KRC +GYRA RIL LAK I  G I L  LE  +G  ++ L     L   L+++DG
Sbjct: 303  ESFLAKRCNLGYRASRILKLAKGIVQGIIQLMQLE--EGCKEISLSSYNKLAEQLRQIDG 360

Query: 1053 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFL 1232
             G FTC  VLMCMG Y  +P D+ET+RHLKQV  +S  T+++V  DVE +YAKY PFQFL
Sbjct: 361  FGPFTCANVLMCMGFYHVIPADSETIRHLKQVHSKSS-TMQTVGRDVEGIYAKYAPFQFL 419

Query: 1233 AY 1238
            AY
Sbjct: 420  AY 421


>gb|EOY14232.1| Uncharacterized protein TCM_033523 [Theobroma cacao]
          Length = 374

 Score =  244 bits (622), Expect = 1e-61
 Identities = 153/424 (36%), Positives = 215/424 (50%), Gaps = 5/424 (1%)
 Frame = +3

Query: 66   GECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXX 245
            GEC+      SSF++E+AVC++G FMMSPN W+ + K+L RPLRL D             
Sbjct: 12   GECS------SSFNMEKAVCNHGLFMMSPNVWIPSTKSLRRPLRLADSSGSVYVTISHPA 65

Query: 246  XXXX----RVFGI-SQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGR 410
                    +V G+ + ++  D+  I  QV RMLR+S  ++  +  F  +H  AK +GFGR
Sbjct: 66   PNHPFLVIQVNGLQNSISSADKAVIMEQVARMLRISSKDERDVREFQTLHGSAKDRGFGR 125

Query: 411  VFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFY 590
            +FRSP+ FED VK+ LLCNC                        GW              
Sbjct: 126  IFRSPSFFEDAVKSILLCNC------------------------GWK------------- 148

Query: 591  PKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVP 770
             +T T   + R  C+  L+ A             + KIS           TK        
Sbjct: 149  -RTLT---MARALCALQLQLASAHLQHKRVASNSNVKIS-----------TKRLKHKKYT 193

Query: 771  NLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQ 950
               S+S+L+ +  D S        GNFPT  ELA L E YL +RC +GYRAR IL LA++
Sbjct: 194  KASSTSELSMSGFDQS-------IGNFPTSTELACLDEKYLNERCNLGYRARCILQLARK 246

Query: 951  ICNGSIDLDSLENPDGSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIYQCVPTDTETV 1130
            + NG ++L+ LE  + S     E     L K+ G G F C  ++MC+G Y+ +P D+ET+
Sbjct: 247  VENGELELNKLE--ESSDTTSYERFYQKLMKIKGFGPFVCSNIMMCIGFYERIPFDSETI 304

Query: 1131 RHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSHMPPSDYG 1310
            RHLK V G+  C+ K++  D+EE+Y KY PFQ +AYW E+ D YE +FG+LS +  S Y 
Sbjct: 305  RHLKMVHGKGKCSRKTIEKDIEEIYGKYAPFQCMAYWLELLDEYENKFGKLSELESSSYH 364

Query: 1311 LISG 1322
            L +G
Sbjct: 365  LATG 368


>ref|XP_006470787.1| PREDICTED: uncharacterized protein LOC102629917 isoform X2 [Citrus
            sinensis]
          Length = 409

 Score =  238 bits (607), Expect = 6e-60
 Identities = 158/420 (37%), Positives = 218/420 (51%), Gaps = 30/420 (7%)
 Frame = +3

Query: 69   ECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXX 248
            E  L++ +  +F+LE AVCS+G FMMSPNRW    ++L RPL L +              
Sbjct: 5    ESLLKLPLAETFNLETAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVDV 64

Query: 249  XXXRVF----------------GISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVH 380
               +                      L+ + ++ + AQV RMLRLSE ++  +  F R+ 
Sbjct: 65   TICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVREFKRIV 124

Query: 381  SQ-AKSQG---------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK 530
             Q A+ +G          GRVFRSPTLFED+VK  LLCNC+W RTLSMA +LC+LQ    
Sbjct: 125  RQVAQEEGEETQYMEDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLSMARALCELQ---- 180

Query: 531  GKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRE-CSEILRPAKLRFDETSQCKVMSGKIS 707
                 W            F P+TP     KRR+  S++      R  E+         + 
Sbjct: 181  -----WELQHCSPSISEDFIPQTPAGKESKRRQKVSKVASKLTSRIAESKASSEDYMNLK 235

Query: 708  EGCSIVSEISITKCDGEYIVPNLGSSSKLTS-NTMDTSELPS-NLMAGNFPTPKELASLS 881
              C+ V E ++     +   P     S L   N + T++ PS     GNFP+P+ELA+L 
Sbjct: 236  LDCAGVLEENV-----QPSFPQNDIESDLHGLNELSTTDPPSARDRIGNFPSPRELANLD 290

Query: 882  ENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQM-KLEDLKAYLQKLDGVG 1058
            E++L KRC +GYRA RIL LA+ I +G I L  LE+      +     L   L +++G G
Sbjct: 291  ESFLAKRCNLGYRAGRILKLARGIVDGQIQLRELEDMCNEASLTAYVKLAEQLSQINGFG 350

Query: 1059 KFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAY 1238
             FT + VL+C+G Y  +PTD+ET+RHLKQV  R +CT K+V M  E +Y KY PFQFLAY
Sbjct: 351  PFTRNNVLVCIGFYHVIPTDSETIRHLKQVHAR-NCTSKTVQMIAESIYGKYAPFQFLAY 409


>gb|EOY25841.1| Uncharacterized protein isoform 4 [Theobroma cacao]
          Length = 406

 Score =  225 bits (573), Expect = 5e-56
 Identities = 154/408 (37%), Positives = 213/408 (52%), Gaps = 18/408 (4%)
 Frame = +3

Query: 27   AASLCERMMEENGGECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCD 206
            ++S C  ++E   GE          F+LE+AVCS+G FMM+PN+W    ++L RPLRL D
Sbjct: 26   SSSCCSVLIELPVGEAAAAEGA-GPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPLRLLD 84

Query: 207  EXXXXXXXXXXXXXXXX-----RVFGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFH 371
                                  RV+G   L+ Q  + +  QV RMLRLSE E+  +  F 
Sbjct: 85   HHSPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKVREFR 144

Query: 372  RV----HSQAKSQG------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQS 521
            ++    H + ++         GRVFRSPTLFED+VK  LLCNC++ RTLSMA +LC+LQ 
Sbjct: 145  KIVEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQFSRTLSMAKALCELQF 204

Query: 522  ELKGKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGK 701
            E + +P               F PKTP    LKR          KLR             
Sbjct: 205  ETQ-RPFS-----GVRAAEDDFIPKTPAGNELKR----------KLR------------- 235

Query: 702  ISEGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLS 881
                   VS++S+ + +G++  P    S      + +  E  +    G+FP+P+ELA+L 
Sbjct: 236  -------VSKVSM-RLEGKFAEPRADHSKSDLQPSQELDEPHAYKGMGSFPSPEELANLD 287

Query: 882  ENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLED---LKAYLQKLDG 1052
            E++L KRC +GYRA RIL LAK I  G I L  LE  +G  ++ L     L   L+++DG
Sbjct: 288  ESFLAKRCNLGYRASRILKLAKGIVQGIIQLMQLE--EGCKEISLSSYNKLAEQLRQIDG 345

Query: 1053 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVE 1196
             G FTC  VLMCMG Y  +P D+ET+RHLKQV  +S  T+++V  DVE
Sbjct: 346  FGPFTCANVLMCMGFYHVIPADSETIRHLKQVHSKSS-TMQTVGRDVE 392


Top