BLASTX nr result

ID: Ephedra27_contig00020983 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra27_contig00020983
         (1968 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006853038.1| hypothetical protein AMTR_s00038p00020700 [A...   341   8e-91
ref|XP_006358484.1| PREDICTED: uncharacterized protein LOC102593...   318   7e-84
ref|XP_004230386.1| PREDICTED: uncharacterized protein LOC101247...   318   7e-84
ref|XP_003534756.2| PREDICTED: uncharacterized protein LOC100781...   287   1e-74
ref|XP_004959865.1| PREDICTED: uncharacterized protein LOC101766...   285   5e-74
gb|EOY25838.1| Uncharacterized protein isoform 1 [Theobroma cacao]    285   7e-74
gb|EXC47697.1| hypothetical protein L484_002408 [Morus notabilis]     284   1e-73
gb|ESW19537.1| hypothetical protein PHAVU_006G133500g [Phaseolus...   282   4e-73
ref|XP_002304112.2| hypothetical protein POPTR_0003s03710g [Popu...   281   6e-73
ref|XP_006470786.1| PREDICTED: uncharacterized protein LOC102629...   279   4e-72
ref|XP_006431360.1| hypothetical protein CICLE_v10001110mg [Citr...   278   6e-72
ref|XP_002519384.1| conserved hypothetical protein [Ricinus comm...   276   2e-71
gb|EEC82605.1| hypothetical protein OsI_27177 [Oryza sativa Indi...   273   2e-70
gb|EEE67737.1| hypothetical protein OsJ_25428 [Oryza sativa Japo...   268   5e-69
gb|EOY25839.1| Uncharacterized protein isoform 2 [Theobroma cacao]    255   4e-65
dbj|BAC15471.1| hypothetical protein [Oryza sativa Japonica Grou...   252   5e-64
gb|EOY25840.1| Uncharacterized protein isoform 3 [Theobroma cacao]    248   5e-63
gb|EOY14232.1| Uncharacterized protein TCM_033523 [Theobroma cacao]   244   1e-61
ref|XP_006470787.1| PREDICTED: uncharacterized protein LOC102629...   238   7e-60
gb|EMT03969.1| hypothetical protein F775_22747 [Aegilops tauschii]    225   6e-56

>ref|XP_006853038.1| hypothetical protein AMTR_s00038p00020700 [Amborella trichopoda]
            gi|548856677|gb|ERN14505.1| hypothetical protein
            AMTR_s00038p00020700 [Amborella trichopoda]
          Length = 458

 Score =  341 bits (874), Expect = 8e-91
 Identities = 195/454 (42%), Positives = 254/454 (55%), Gaps = 30/454 (6%)
 Frame = +1

Query: 118  GGECT-LRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXX 294
            G E T L + V  SF+LE+AVCS+GFFMM+PN W S+ +TL RPLRL D           
Sbjct: 4    GAERTVLTLPVNESFELEKAVCSHGFFMMAPNLWFSSSQTLQRPLRLTDRSSVPVRITQL 63

Query: 295  XXXXXXR----VFGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFG 462
                       V G S+L   D+ ++ AQV RMLR+SE +D  ++ FH ++  AK  GFG
Sbjct: 64   SLSSQKSLQILVLGASKLYQHDQQYLLAQVARMLRISEEDDLKVNKFHEMYPVAKETGFG 123

Query: 463  RVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXF 642
            RVFRSPTLFED+VK+ LLCNC+W RTLSMA +LC+LQ EL G  L               
Sbjct: 124  RVFRSPTLFEDMVKSILLCNCQWTRTLSMARALCELQLELNGNSLRQSNKDTDFSKSVNL 183

Query: 643  YPKTPTKTRLKRRE-----------------------CSEILRPAKLR--FDETSQCKVM 747
             P TP +   K+R                          E LRP  L   F + S     
Sbjct: 184  SPVTPMQLEHKKRRKNPNQNIIMNLMTKFSENETHLAADESLRPIDLAKDFSKNSPTMFS 243

Query: 748  SGKISEGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELA 927
            S +   G     ++S  K         LG  + L +  ++   L   L AGNFP P+ELA
Sbjct: 244  SEEGRNGKLNYDQVSEEK---------LGDGAILDNQLLENKTLSFFLEAGNFPCPEELA 294

Query: 928  SLSENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLEDLKAYLQKLDG 1107
            +L E  L KRC VG+R++RI+ LA+ I  G++DL  +E       + L+ L   L  + G
Sbjct: 295  NLDEKILEKRCKVGFRSKRIVKLAQSIVEGALDLGKIEVLSQQDPIHLDGLMRQLLSIYG 354

Query: 1108 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFL 1287
            VG + C+ VLM MGIYQ +P DTET+RHLKQ   R  CTI ++  D+EE+Y K+ PFQFL
Sbjct: 355  VGPYVCNNVLMSMGIYQRIPADTETLRHLKQFHARKQCTIGTIQKDIEEIYGKHEPFQFL 414

Query: 1288 AYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNMK 1389
             YW E+W+ YEK+FG+LS MPPSDY LI+ HNMK
Sbjct: 415  VYWSEMWEFYEKRFGKLSQMPPSDYELITAHNMK 448


>ref|XP_006358484.1| PREDICTED: uncharacterized protein LOC102593287 isoform X1 [Solanum
            tuberosum] gi|565385158|ref|XP_006358485.1| PREDICTED:
            uncharacterized protein LOC102593287 isoform X2 [Solanum
            tuberosum]
          Length = 485

 Score =  318 bits (814), Expect = 7e-84
 Identities = 185/441 (41%), Positives = 246/441 (55%), Gaps = 25/441 (5%)
 Frame = +1

Query: 151  SSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXXXXX------ 312
            ++FDLE+AVCS+G FMM+PNRW S  KTL RPL L +                       
Sbjct: 29   ATFDLEKAVCSHGLFMMAPNRWDSLSKTLERPLHLSENINDDDHEQSVLVQINQPSDSPH 88

Query: 313  ----RVFGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSP 480
                RVFG + L+   +  +  QV RM+RLS  E+  +  F  +  +AK +G GRVFRSP
Sbjct: 89   SLLLRVFGTASLSTIHQRSLLGQVRRMVRLSVEENKRVKQFQEICGEAKDRGLGRVFRSP 148

Query: 481  TLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK--------GKPLGWXXXXXXXXXXX 636
            TLFED+VK  LLCNC+W RTLSMA +LC+LQ EL           P              
Sbjct: 149  TLFEDMVKCMLLCNCQWSRTLSMAEALCELQLELNCPSSAASFPDPDNQNQLKGVTFKSE 208

Query: 637  XFYPKTPTKTRLKRRE----CSEILRPAKLRFDETSQCKVMSGKISEGCSIVSEI--SIT 798
             F P+TP     ++R     CS  L       +E          ++   S+  E+     
Sbjct: 209  HFTPRTPAGKESRKRAGAYGCSRKLLERLTEVEEIIDIGKPGVTVTPAFSVGEEVLKKSN 268

Query: 799  KCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRA 978
             C     V ++G+S+    +  +  +L S    GNFP+PKELASL E++L KRCG+GYRA
Sbjct: 269  LCRDTTEVCDVGTSAPFNLDPSEDRKLSSFNQLGNFPSPKELASLDESFLAKRCGLGYRA 328

Query: 979  RRILNLAKQICNGSIDLDSLENPDGSVQMKLEDLKA-YLQKLDGVGKFTCDVVLMCMGIY 1155
             RI+ LAK I  GSI L  LE    +  +   D  A  L+++DG G FTC  VLMC+G Y
Sbjct: 329  GRIIKLAKGIVEGSIQLKELEEACSNPSLSDYDKMAEQLREIDGFGPFTCANVLMCLGYY 388

Query: 1156 QCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGR 1335
              +PTD+ET+RHLKQV  R+  TI++V  DVE +Y KY PFQFLAYW E+W  YE++FG+
Sbjct: 389  HVIPTDSETIRHLKQVHARTS-TIQNVQRDVENIYGKYAPFQFLAYWSEVWHFYEERFGK 447

Query: 1336 LSHMPPSDYGLISGHNMKEER 1398
            LS MP S+Y LI+  NM+ +R
Sbjct: 448  LSEMPHSEYKLITAANMRRKR 468


>ref|XP_004230386.1| PREDICTED: uncharacterized protein LOC101247758 [Solanum
            lycopersicum]
          Length = 483

 Score =  318 bits (814), Expect = 7e-84
 Identities = 189/459 (41%), Positives = 251/459 (54%), Gaps = 25/459 (5%)
 Frame = +1

Query: 97   ERMMEENGGECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXX 276
            E  +E+  G C       +SFDLE+AVCS+G FMM+PNRW +  KTL RPLRL +     
Sbjct: 16   ELPLEDGNGYC-------ASFDLEKAVCSHGLFMMAPNRWDTLSKTLERPLRLSENINDD 68

Query: 277  XXXXXXXXXXXX----------RVFGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFH 426
                                  RV     L+   +  +  QV RM+RLS  E+  +  F 
Sbjct: 69   DHEQSVLVQITQPSDYPHSLLLRVLDTDSLSTIHQRSLLGQVRRMVRLSVEENKRVKLFQ 128

Query: 427  RVHSQAKSQGFGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK------- 585
             +  +AK +GFGRVFRSPTLFED+VK  LLCNC+W RTLSMA +LC+LQ EL        
Sbjct: 129  EICGEAKERGFGRVFRSPTLFEDMVKCMLLCNCQWSRTLSMAEALCELQLELNCPSSAAS 188

Query: 586  -GKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRE----CSEILRPAKLRFDETSQCKVMS 750
               P               F P+TP    L++R     CS  L       +E        
Sbjct: 189  FPDPDNQNQLKGVTSKSEHFTPRTPAGKELRKRAGAYGCSRNLLERLNEVEEIVDIDKPG 248

Query: 751  GKISEGCSIVSEI--SITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKEL 924
              ++   S+  E+      C     V  +  S+ L  +  +  +L S    GNFP+PK+L
Sbjct: 249  VTVTPAFSVGEEVLQKSNLCQDTTEVWEVSVSAPLNPDPSEDRKLSSFNQLGNFPSPKQL 308

Query: 925  ASLSENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLEDLKA-YLQKL 1101
            ASL E++L KRCG+GYRA RI+ LAK I  GSI L+ LE    +  +   D  A  L+++
Sbjct: 309  ASLDESFLAKRCGLGYRAGRIIKLAKGIVEGSIQLNELEEACSNPSLSNYDKMAEQLREI 368

Query: 1102 DGVGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQ 1281
            DG G FTC  VLMC+G Y  +PTD+ET+RHLKQV  R+  TI++V  DVE +Y KY PFQ
Sbjct: 369  DGFGPFTCANVLMCLGYYHVIPTDSETIRHLKQVHARTS-TIQNVQRDVENIYGKYAPFQ 427

Query: 1282 FLAYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNMKEER 1398
            FLAYW E+W  YE++FG+LS MP S+Y LI+  NM+ +R
Sbjct: 428  FLAYWSEVWHFYEERFGKLSEMPHSEYKLITAANMRPKR 466


>ref|XP_003534756.2| PREDICTED: uncharacterized protein LOC100781827 [Glycine max]
          Length = 443

 Score =  287 bits (734), Expect = 1e-74
 Identities = 179/429 (41%), Positives = 233/429 (54%), Gaps = 12/429 (2%)
 Frame = +1

Query: 139  VSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXXXXXRV 318
            + + S F LE+AVCS+G FMM PN W    KTL RPLR                    RV
Sbjct: 18   MELPSPFQLEQAVCSHGLFMMPPNHWDPLSKTLIRPLR-SSPSSFLVSLSQHSQSLAVRV 76

Query: 319  FGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVH--SQAKSQGFGRVFRSPTLFE 492
                 L+ Q +NHI AQV RMLR SE E+ A+  F  +H          GRVFRSPTLFE
Sbjct: 77   HATHALSPQQQNHITAQVSRMLRFSEAEEKAVREFRSLHVVDHPNRSFSGRVFRSPTLFE 136

Query: 493  DIVKAFLLCNCRWQRTLSMAASLCDLQSELK-GKPLGWXXXXXXXXXXXXFYPKTPTKTR 669
            D+VK  LLCNC+W RTLSMA +LC+LQ EL+ G P               F PKTP    
Sbjct: 137  DMVKCILLCNCQWPRTLSMAQALCELQLELQNGSPCTIAVSGNSKGESEGFIPKTPASKE 196

Query: 670  LKRRECSE--ILRPAKLRFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVPNLGSSS 843
             +R + S   +    KL  D   Q   +    S   ++             +  + G S 
Sbjct: 197  TRRNKVSTKGMFCKKKLELDGNLQIDHVVASSSTATTL-------------LTTDNGDSE 243

Query: 844  KLTSNTMDTSELPSN-----LMAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQI 1008
            +L S+  D+    SN        GNFP+P ELA+L E++L KRCG+GYRA  I+ LA+ I
Sbjct: 244  ELRSH--DSCHEFSNGNEYFSRTGNFPSPSELANLDESFLAKRCGLGYRAGYIIELARAI 301

Query: 1009 CNGSIDLDSLE--NPDGSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIYQCVPTDTET 1182
              G I L  LE  + D S+    + L   L+++ G G FT   VLMC+G Y  +PTD+ET
Sbjct: 302  VEGKIQLGQLEELSKDASLS-NYKQLDDQLKQIRGYGPFTRANVLMCLGYYHVIPTDSET 360

Query: 1183 VRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSHMPPSDY 1362
            VRHLKQV  R   T K++  ++EE+Y KY P+QFLA+W E+WD YE +FG+L+ M  SDY
Sbjct: 361  VRHLKQVHSRY-TTSKTIERELEEIYGKYEPYQFLAFWSEVWDFYETRFGKLNEMHSSDY 419

Query: 1363 GLISGHNMK 1389
             LI+  NM+
Sbjct: 420  KLITACNMR 428


>ref|XP_004959865.1| PREDICTED: uncharacterized protein LOC101766322 [Setaria italica]
          Length = 461

 Score =  285 bits (729), Expect = 5e-74
 Identities = 171/446 (38%), Positives = 237/446 (53%), Gaps = 32/446 (7%)
 Frame = +1

Query: 157  FDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXXXXX------RV 318
            FDL  AVCS+G FMM+PNRW  A + L RPLRL  +                       V
Sbjct: 36   FDLAAAVCSHGLFMMAPNRWDPAARALVRPLRLASDRSASLLARVSAHPARPGTALLVAV 95

Query: 319  FGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSPTLFEDI 498
             G   L+  D ++I  QV RMLRLSE +  A+  F  +H+ A+ +GFGR+FRSPTLFED+
Sbjct: 96   EGADALSSLDRDYILEQVRRMLRLSEEDGAAVAEFQAMHAAAREEGFGRIFRSPTLFEDM 155

Query: 499  VKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTP----TKT 666
            VK  LLCNC+W RTLSMA +LC++Q ELK                  F  +TP     K 
Sbjct: 156  VKCILLCNCQWTRTLSMATALCEIQLELK-----------CSSSVEDFQSRTPPIRERKR 204

Query: 667  RLKRRECSEILRPAKLRFDETSQCKVMSGK------------ISEGCSIVSEISITKCDG 810
            +  +R+   I    +   D+     + SG             +S   S+ SE   + CD 
Sbjct: 205  KRSKRQSVRIKLETRFAEDKLEGPTIASGTSNDLTHPETNEYLSSLASVASETG-SACDS 263

Query: 811  EYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRARRIL 990
               +P+L +S    +N     +       G+FPTP+ELA+L E +L KRC +GYRA+RI+
Sbjct: 264  ---LPSLDNSELSLNNAPGLED-----CIGDFPTPEELANLDEGFLAKRCNLGYRAKRIV 315

Query: 991  NLAKQICNGSIDLDSLE----------NPDGSVQMKLEDLKAYLQKLDGVGKFTCDVVLM 1140
             LA+ +  G + L  LE              +++   E L   L  + G G FT   VLM
Sbjct: 316  MLARGVVEGKVCLQKLEEMCRISVPAAEEVSTIESACERLNKELSAISGFGPFTRANVLM 375

Query: 1141 CMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYE 1320
            CMG    +P DTET+RHLKQV  R+  TI SV  +++++Y KY PFQFLAYW+E+W  Y 
Sbjct: 376  CMGFNHTIPADTETIRHLKQVHKRAS-TISSVHQELDKIYGKYAPFQFLAYWFELWGFYN 434

Query: 1321 KQFGRLSHMPPSDYGLISGHNMKEER 1398
            KQFG++  M PS+Y L +  ++K+ +
Sbjct: 435  KQFGKICEMEPSNYRLFTASHLKKAK 460


>gb|EOY25838.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 467

 Score =  285 bits (728), Expect = 7e-74
 Identities = 182/452 (40%), Positives = 244/452 (53%), Gaps = 18/452 (3%)
 Frame = +1

Query: 88   SLCERMMEENGGECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEX 267
            S C  ++E   GE          F+LE+AVCS+G FMM+PN+W    ++L RPLRL D  
Sbjct: 43   SCCSVLIELPVGEAAAAEGA-GPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPLRLLDHH 101

Query: 268  XXXXXXXXXXXXXXX-----RVFGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRV 432
                                RV+G   L+ Q  + +  QV RMLRLSE E+  +  F ++
Sbjct: 102  SPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKVREFRKI 161

Query: 433  ----HSQAKSQG------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSEL 582
                H + ++         GRVFRSPTLFED+VK  LLCNC++ RTLSMA +LC+LQ E 
Sbjct: 162  VEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQFSRTLSMAKALCELQFET 221

Query: 583  KGKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGKIS 762
            +    G             F PKTP    LKR          KLR               
Sbjct: 222  QRPFSG------VRAAEDDFIPKTPAGNELKR----------KLR--------------- 250

Query: 763  EGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSEN 942
                 VS++S+ + +G++  P    S      + +  E  +    G+FP+P+ELA+L E+
Sbjct: 251  -----VSKVSM-RLEGKFAEPRADHSKSDLQPSQELDEPHAYKGMGSFPSPEELANLDES 304

Query: 943  YLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLED---LKAYLQKLDGVG 1113
            +L KRC +GYRA RIL LAK I  G I L  LE  +G  ++ L     L   L+++DG G
Sbjct: 305  FLAKRCNLGYRASRILKLAKGIVQGIIQLMQLE--EGCKEISLSSYNKLAEQLRQIDGFG 362

Query: 1114 KFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAY 1293
             FTC  VLMCMG Y  +P D+ET+RHLKQV  +S  T+++V  DVE +YAKY PFQFLAY
Sbjct: 363  PFTCANVLMCMGFYHVIPADSETIRHLKQVHSKSS-TMQTVGRDVEGIYAKYAPFQFLAY 421

Query: 1294 WWEIWDSYEKQFGRLSHMPPSDYGLISGHNMK 1389
            W E+W  YE++FG+LS MP   Y LI+  NMK
Sbjct: 422  WAELWHYYEQRFGKLSEMPFCGYKLITASNMK 453


>gb|EXC47697.1| hypothetical protein L484_002408 [Morus notabilis]
          Length = 472

 Score =  284 bits (726), Expect = 1e-73
 Identities = 186/447 (41%), Positives = 240/447 (53%), Gaps = 34/447 (7%)
 Frame = +1

Query: 151  SSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLC------------DEXXXXXXXXXX 294
            ++F LE AVCS+G FMM+PN+W    KTL RPLRL             D+          
Sbjct: 14   ATFRLETAVCSHGLFMMAPNQWDPLSKTLLRPLRLTLHHHHWNPQQQQDDSVMARISQPH 73

Query: 295  XXXXXXRVF---GISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGR 465
                  RV    G   LT  ++  + AQV RMLRLS+ E+     F  V+      G GR
Sbjct: 74   DRLHCLRVLVHAGTRSLTSDNKQALLAQVSRMLRLSQTEERICREFSEVYGCGS--GLGR 131

Query: 466  VFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFY 645
            VFRSPTLFED+VK  LLCNC+W RTLSMA +LCDLQ EL+ + +              F 
Sbjct: 132  VFRSPTLFEDMVKCILLCNCQWPRTLSMAQALCDLQRELQLQSV--------PSKTVDFV 183

Query: 646  PKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVP 825
            PKTP     KR+     +   K     TSQ    S +  E  S  +++SI          
Sbjct: 184  PKTPAGKEPKRK-----VEKLKASTCLTSQFDAQSNEGLESHS--NDLSIDISQPTPSAQ 236

Query: 826  NLGSSSKLT----------SNTMDTSEL--PSNLM------AGNFPTPKELASLSENYLT 951
            NL  SS L+          S  +D++ L  P  L        G+FPTP ELA L E +L 
Sbjct: 237  NLSPSSLLSVPMENVTCEESYGVDSASLCNPQILRDREFEGTGDFPTPTELAKLDEKFLA 296

Query: 952  KRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQM-KLEDLKAYLQKLDGVGKFTCD 1128
            KRC +GYRA RIL LA+ I  G I L  LE       +     L   L+++DG G FTC 
Sbjct: 297  KRCKLGYRAGRILKLARGIVEGRIQLRELEETCMERSLCSYSKLAVQLRQIDGFGPFTCA 356

Query: 1129 VVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIW 1308
             VLMCMG Y  +P+D+ET+RHL+QV GR+  T++++  DV+++YAKY PFQFLAYW E+W
Sbjct: 357  NVLMCMGFYHVIPSDSETIRHLQQVHGRNS-TVRTIERDVQQIYAKYEPFQFLAYWSELW 415

Query: 1309 DSYEKQFGRLSHMPPSDYGLISGHNMK 1389
              YEK+FG++S MP S Y L +  NMK
Sbjct: 416  HFYEKKFGKISEMPCSAYKLFTASNMK 442


>gb|ESW19537.1| hypothetical protein PHAVU_006G133500g [Phaseolus vulgaris]
          Length = 474

 Score =  282 bits (721), Expect = 4e-73
 Identities = 180/445 (40%), Positives = 243/445 (54%), Gaps = 19/445 (4%)
 Frame = +1

Query: 157  FDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRL-----CDEXXXXXXXXXXXXXXXXRVF 321
            F L++AVCS+GFFMM+PN W    KTL RPL L                        RV 
Sbjct: 46   FQLDQAVCSHGFFMMAPNHWDPLSKTLTRPLLLHNPSSSSSSSLLVSLSQRPQSLAVRVH 105

Query: 322  GISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHS-QAKSQGFG-RVFRSPTLFED 495
             +  ++ Q + HIKAQ+ RMLRLSE E+ A+  F  VH+    ++ FG RVFRSPTLFED
Sbjct: 106  SVHFISPQQQRHIKAQITRMLRLSEAEEKAVREFRSVHAADHPNRSFGGRVFRSPTLFED 165

Query: 496  IVKAFLLCNCRWQRTLSMAASLCDLQSELK-GKPLGWXXXXXXXXXXXXFYPKTPT--KT 666
            +VK  LLCNC+W RTLSMA +LC+LQS L+ G P               F PKTP   + 
Sbjct: 166  MVKCILLCNCQWPRTLSMAQALCELQSGLQNGLPCAVEGSGNPKVEAEEFVPKTPASKEN 225

Query: 667  RLKRRECSEILRPAKLRF------DETSQCKVMSGKISEGCSIVSEISITKCDGEYI-VP 825
            R K+     +L   KL        D   Q   M    S+  +++ ++ + + D      P
Sbjct: 226  RRKKAPTKGVLLKKKLELELEMEVDGNLQMDHMFASSSD-TTLLGDLEVLRSDDSCCQFP 284

Query: 826  NLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQ 1005
            N G     T               GNFP+P ELA+LSE++L KRC +GYRA  IL LA+ 
Sbjct: 285  NEGEYFDHT---------------GNFPSPIELANLSESFLAKRCKLGYRAGYILELAQG 329

Query: 1006 ICNGSIDLDSLE--NPDGSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIYQCVPTDTE 1179
            I  G I L+ LE  + D S+    + L   L+ + G G FT   VLMC+G Y  +P D+E
Sbjct: 330  IVEGKIQLEQLEELSKDASLSC-YKQLGDQLKPIKGFGPFTRANVLMCLGYYHVIPWDSE 388

Query: 1180 TVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSHMPPSD 1359
            TVRHLKQV  ++  + K++  D+EE+Y KY P+QFLA+W EIWD YE +FG+++ M  S+
Sbjct: 389  TVRHLKQVHSKNTSS-KTIERDLEEIYGKYEPYQFLAFWSEIWDFYETRFGKMNEMHSSE 447

Query: 1360 YGLISGHNMKEERAVTSIDPDKSQE 1434
            Y  I+  NM+  R  T+     SQ+
Sbjct: 448  YKRITASNMRSTRKATNKRKRPSQK 472


>ref|XP_002304112.2| hypothetical protein POPTR_0003s03710g [Populus trichocarpa]
            gi|550342350|gb|EEE79091.2| hypothetical protein
            POPTR_0003s03710g [Populus trichocarpa]
          Length = 489

 Score =  281 bits (720), Expect = 6e-73
 Identities = 182/456 (39%), Positives = 238/456 (52%), Gaps = 42/456 (9%)
 Frame = +1

Query: 154  SFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLC-------------DEXXXXXXXXXX 294
            +F+LE+AVCS+G FMMSPN W     T  RPLRL                          
Sbjct: 29   TFNLEKAVCSHGLFMMSPNHWDPLSLTFSRPLRLSLSDSDPQVSTPTTSLFVSISHPPHL 88

Query: 295  XXXXXXRVFGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQ------- 453
                  RV+G   L+ + +  + AQVVRMLRLSE ++     F ++   A ++       
Sbjct: 89   PRSLSVRVYGTRCLSPKHQESLVAQVVRMLRLSETDERNAREFRKIAEAAAAEENNSWLT 148

Query: 454  GFG-RVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXX 630
            GFG RVFRSPTLFED+VK  LLCNC+W RTLSMA +LC+LQ EL+ K  G          
Sbjct: 149  GFGGRVFRSPTLFEDMVKCILLCNCQWPRTLSMARALCELQCELQCKSSGVFVAQAVNAT 208

Query: 631  XXX--------FYPKTPTKTRLKRR-ECSEILRPAKLRFDET-------SQCKVMSGKIS 762
                       F P T      KR    S++ +    +  ET       +  K  S  I 
Sbjct: 209  VKNKCNDTAHNFIPNTSAGKESKRNIRASKVTKNLASKIVETETLLEADANLKTDSAHIG 268

Query: 763  -EGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPS--NLMAGNFPTPKELASL 933
             E    V   S  +C   +     GS S    +      +    N M  NFP+P+ELA+L
Sbjct: 269  RETLESVENDSCARCSSRH-----GSDSWAPDSLQSQHGIQPGVNKMICNFPSPRELANL 323

Query: 934  SENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENP--DGSVQMKLEDLKAYLQKLDG 1107
             E++L KRC +GYRA RI+ LA+ I  G I L  +E    +G+       L    +++DG
Sbjct: 324  DESFLAKRCNLGYRAIRIIKLAQSIVEGRIPLREVEEDCANGASSSCYNKLADQFRQIDG 383

Query: 1108 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFL 1287
             G FTC  VLMCMG Y  +PTD+ETVRHLKQV  +   TI++V  DVEE+Y KY PFQFL
Sbjct: 384  FGPFTCANVLMCMGFYHIIPTDSETVRHLKQVHAKKS-TIQTVQRDVEEIYGKYAPFQFL 442

Query: 1288 AYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNMKEE 1395
            AYW E+W  YEK+FG+LS +P SDY LI+  NM+ +
Sbjct: 443  AYWAELWHFYEKRFGKLSEIPTSDYKLITASNMRSK 478


>ref|XP_006470786.1| PREDICTED: uncharacterized protein LOC102629917 isoform X1 [Citrus
            sinensis]
          Length = 454

 Score =  279 bits (713), Expect = 4e-72
 Identities = 177/451 (39%), Positives = 240/451 (53%), Gaps = 30/451 (6%)
 Frame = +1

Query: 124  ECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXX 303
            E  L++ +  +F+LE AVCS+G FMMSPNRW    ++L RPL L +              
Sbjct: 5    ESLLKLPLAETFNLETAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVDV 64

Query: 304  XXXRVF----------------GISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVH 435
               +                      L+ + ++ + AQV RMLRLSE ++  +  F R+ 
Sbjct: 65   TICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVREFKRIV 124

Query: 436  SQ-AKSQG---------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK 585
             Q A+ +G          GRVFRSPTLFED+VK  LLCNC+W RTLSMA +LC+LQ    
Sbjct: 125  RQVAQEEGEETQYMEDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLSMARALCELQ---- 180

Query: 586  GKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRE-CSEILRPAKLRFDETSQCKVMSGKIS 762
                 W            F P+TP     KRR+  S++      R  E+         + 
Sbjct: 181  -----WELQHCSPSISEDFIPQTPAGKESKRRQKVSKVASKLTSRIAESKASSEDYMNLK 235

Query: 763  EGCSIVSEISITKCDGEYIVPNLGSSSKLTS-NTMDTSELPSNL-MAGNFPTPKELASLS 936
              C+ V E ++         P     S L   N + T++ PS     GNFP+P+ELA+L 
Sbjct: 236  LDCAGVLEENVQPS-----FPQNDIESDLHGLNELSTTDPPSARDRIGNFPSPRELANLD 290

Query: 937  ENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLE-DLKAYLQKLDGVG 1113
            E++L KRC +GYRA RIL LA+ I +G I L  LE+      +     L   L +++G G
Sbjct: 291  ESFLAKRCNLGYRAGRILKLARGIVDGQIQLRELEDMCNEASLTAYVKLAEQLSQINGFG 350

Query: 1114 KFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAY 1293
             FT + VL+C+G Y  +PTD+ET+RHLKQV  R+ CT K+V M  E +Y KY PFQFLAY
Sbjct: 351  PFTRNNVLVCIGFYHVIPTDSETIRHLKQVHARN-CTSKTVQMIAESIYGKYAPFQFLAY 409

Query: 1294 WWEIWDSYEKQFGRLSHMPPSDYGLISGHNM 1386
            W E+W  YEK+FG+LS MP SDY LI+  NM
Sbjct: 410  WSELWHFYEKRFGKLSEMPYSDYKLITASNM 440


>ref|XP_006431360.1| hypothetical protein CICLE_v10001110mg [Citrus clementina]
            gi|557533482|gb|ESR44600.1| hypothetical protein
            CICLE_v10001110mg [Citrus clementina]
          Length = 454

 Score =  278 bits (711), Expect = 6e-72
 Identities = 173/453 (38%), Positives = 243/453 (53%), Gaps = 32/453 (7%)
 Frame = +1

Query: 124  ECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXX 303
            E  L++ +  +F+LE AVCS+G FMMSPNRW    ++L RPL L +              
Sbjct: 5    ESVLKLPLAETFNLEAAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVDV 64

Query: 304  XXXRVF----------------GISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRV- 432
               +                      L+ + ++ + AQV RMLRLSE ++  +  F R+ 
Sbjct: 65   TICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVRDFKRIV 124

Query: 433  ---------HSQAKSQGFGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK 585
                      SQ  +   GRVFRSPTLFED+VK  LLCNC+W RTL+MA +LC+LQ    
Sbjct: 125  RQVAQEEGEESQYMTDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLNMARALCELQ---- 180

Query: 586  GKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGKISE 765
                 W            F P+TP     KRR+     + +K+    TS  ++   K S 
Sbjct: 181  -----WELQHCSPSISEDFIPQTPAGKESKRRQ-----KVSKVASKLTS--RIAESKASS 228

Query: 766  GCSIVSEISITKCDGEYIVPNLGSSSKLTS----NTMDTSELPSNL-MAGNFPTPKELAS 930
               +  ++  T    E + P+   +   +     N + T++ PS     GNFP+P+ELA+
Sbjct: 229  EDDMNLKLDCTGALEENVQPSFPRNDIESDLHGLNELSTTDPPSACDRIGNFPSPRELAN 288

Query: 931  LSENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLED-LKAYLQKLDG 1107
            L E++L KRC +GYRA RIL LA+ I +G I L  LE+      +   + L   L +++G
Sbjct: 289  LDESFLAKRCNLGYRAGRILKLAQGIVDGQIQLRELEDTCNEASLTTYNKLAEQLSQING 348

Query: 1108 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFL 1287
             G FT + VL+C+G Y  +PTD+ET+RHLKQV  R+ CT K+V +  E +Y KY PFQFL
Sbjct: 349  FGPFTRNNVLVCIGFYHVIPTDSETIRHLKQVHARN-CTSKTVQIIAESIYGKYSPFQFL 407

Query: 1288 AYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNM 1386
            AYW E+W  YEK+FG+LS MP SDY LI+  NM
Sbjct: 408  AYWSELWHFYEKRFGKLSEMPYSDYKLITASNM 440


>ref|XP_002519384.1| conserved hypothetical protein [Ricinus communis]
            gi|223541451|gb|EEF43001.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 458

 Score =  276 bits (706), Expect = 2e-71
 Identities = 167/436 (38%), Positives = 235/436 (53%), Gaps = 22/436 (5%)
 Frame = +1

Query: 154  SFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXXXXX---RVFG 324
            +FDLE+ VCS+G FM+SPN W    +T  RPLRL D+                   RV+G
Sbjct: 21   TFDLEKTVCSHGLFMLSPNHWDPLSRTFSRPLRLNDDTDNSLMVSISQHLSKSLLVRVYG 80

Query: 325  ISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGF-------GRVFRSPT 483
               L+ + +  +  Q+VRMLRLS+ ++     F ++ S  + +         GRV RSPT
Sbjct: 81   NRSLSPKHQESLLVQIVRMLRLSDMDEFNAREFRKIVSAFEGEECPLIGDFGGRVLRSPT 140

Query: 484  LFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTPTK 663
            LFED+VK  LLCNC+W RTLSMA +LC  Q EL  +                F P TP K
Sbjct: 141  LFEDMVKCILLCNCQWSRTLSMADALCKFQIELHSQS------PQQKHAFNHFIPNTPVK 194

Query: 664  TRLKRRECSEILRPAKLRFDETSQCKVMSG---KISEGCSIVSEISITKCDGEYIVPNLG 834
               KR+     +    +  +    C        KIS   + V + S      + +    G
Sbjct: 195  KEPKRKIRLSKVPTESMDLEAADTCLTTDDSQMKISNSLNCVDDGSF-----DNLKSCQG 249

Query: 835  SSSKLTSNTMDTSELPSNLMA--------GNFPTPKELASLSENYLTKRCGVGYRARRIL 990
            S++  ++    TS++ S+L+         GNFP+P+ELA+L E +L KRCG+GYRA RI+
Sbjct: 250  SNTFYSTGPYATSDIQSHLVTQHCAKKTTGNFPSPRELANLDERFLAKRCGLGYRAGRII 309

Query: 991  NLAKQICNGSIDLDSLEN-PDGSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIYQCVP 1167
             LA+ I  G I L   E   +G        L   L++++G G FT   VLMCMG Y  +P
Sbjct: 310  KLAQGIVEGRIPLREFEQVSNGGSLSTYSKLTDQLREIEGFGPFTRANVLMCMGFYHVIP 369

Query: 1168 TDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSHM 1347
            TD+ETVRH KQV  ++  TIK+V  + EE+Y K+ PFQFL YW E+W  YE++FG+LS M
Sbjct: 370  TDSETVRHFKQVHAKNS-TIKTVQSEAEEIYRKFAPFQFLVYWAELWHFYEQRFGKLSEM 428

Query: 1348 PPSDYGLISGHNMKEE 1395
            P S+Y LI+  N++ +
Sbjct: 429  PCSNYKLITASNLRNK 444


>gb|EEC82605.1| hypothetical protein OsI_27177 [Oryza sativa Indica Group]
          Length = 463

 Score =  273 bits (699), Expect = 2e-70
 Identities = 169/439 (38%), Positives = 230/439 (52%), Gaps = 27/439 (6%)
 Frame = +1

Query: 157  FDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXXXXXR------V 318
            FDLE AVCS+G FMM+PNRW  A + L RPLRL  +                       V
Sbjct: 37   FDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVRVSRHPARPSDALLVSV 96

Query: 319  FGI--SQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSPTLFE 492
             G     L+  D+  I  QV RMLRL E +  A   F  +H+ A+  GFGR+FRSPTLFE
Sbjct: 97   LGAPGDALSPPDQTSILEQVRRMLRLDEEDGRAAAEFQAMHAVAREAGFGRIFRSPTLFE 156

Query: 493  DIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTPTKTRL 672
            D+VK  LLCNC+W RTLSM+ +LC+LQ EL+                  F  +TP     
Sbjct: 157  DMVKCILLCNCQWTRTLSMSTALCELQLELRSS-----------SSTENFQSRTPPIREC 205

Query: 673  KRRECSEILRPAKL--RFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVPNLGSSSK 846
            KR+  ++     KL  +F+E     +    ++         ++   +  + +P+  S + 
Sbjct: 206  KRKRSNKRNVRVKLETKFNEDKLVCLEDPNLA-----TDTANLQTYENSFNLPSAASGTG 260

Query: 847  LTSN-TMDTSELPSNL------MAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQ 1005
             TS  ++D SEL            G+FPTP+ELA+L E++L KRC +GYRARRI+ LA+ 
Sbjct: 261  NTSEVSLDHSELKLRNEPCLEDCGGDFPTPEELANLDEDFLAKRCNLGYRARRIVMLARS 320

Query: 1006 ICNGSIDLDSLENPD----------GSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIY 1155
            I  G I L  LE              +     + L   L  + G G FT   VLMCMG +
Sbjct: 321  IVEGKICLQKLEEIRKMSVPTVEGLSTTPSTYDRLNEELSTISGFGPFTRANVLMCMGFF 380

Query: 1156 QCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGR 1335
              +P DTET+RHLKQ   R+  TI SV  +++ +Y KY PFQFLAYW E+W  Y KQFG+
Sbjct: 381  HMIPADTETIRHLKQFHKRAS-TISSVQKELDNIYGKYAPFQFLAYWCELWGFYNKQFGK 439

Query: 1336 LSHMPPSDYGLISGHNMKE 1392
            +S M P +Y L +   +K+
Sbjct: 440  ISDMEPINYRLFTASKLKK 458


>gb|EEE67737.1| hypothetical protein OsJ_25428 [Oryza sativa Japonica Group]
          Length = 442

 Score =  268 bits (686), Expect = 5e-69
 Identities = 169/433 (39%), Positives = 229/433 (52%), Gaps = 21/433 (4%)
 Frame = +1

Query: 157  FDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDE------XXXXXXXXXXXXXXXXRV 318
            FDLE AVCS+G FMM+PNRW  A + L RPLRL  +                       V
Sbjct: 37   FDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVRVSRHPARPSDALLVSV 96

Query: 319  FGI---SQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSPTLF 489
             G      L+  D+  I  QV RMLRL E +  A+  F  +H+ A+  GFGR+FRSPTLF
Sbjct: 97   LGAPDDDALSPLDQTSILEQVRRMLRLDEEDGRAVAEFQAMHAVAREVGFGRIFRSPTLF 156

Query: 490  EDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTPTKTR 669
            ED++K  LLCNC+W RTLSM+ +LC+LQ EL+                  F  +TP    
Sbjct: 157  EDMIKCILLCNCQWTRTLSMSTALCELQLELRSS-----------SSTENFQSRTPPIRE 205

Query: 670  LKRRECSEILRPAKL--RFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVPNLGSSS 843
             KR+  ++     KL  +F+E     +    ++           T    E +  +L SS+
Sbjct: 206  CKRKRSNKRNVRVKLETKFNEDKMVCLEDPNLA-----------TNTANENLF-SLPSSA 253

Query: 844  KLTSNTMDTS----------ELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRARRILN 993
              T NT + S          EL      G+FPTP+ELA+L E++L KRC +GYRARRI+ 
Sbjct: 254  NETGNTSEVSLDHSELKLRYELCLEDCGGDFPTPEELANLDEDFLAKRCNLGYRARRIVM 313

Query: 994  LAKQICNGSIDLDSLENPDGSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIYQCVPTD 1173
            LA+ I  G I L  LE      ++ +E+L      + G+  F    VLMCMG +  +P D
Sbjct: 314  LARSIVEGKICLQKLEE---IRKILIEELST----ISGIWPFHSCNVLMCMGFFHMIPAD 366

Query: 1174 TETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSHMPP 1353
            TET+RHLKQ   R+  TI SV  +++ +Y KY PFQFLAYW E+W  Y KQFG +S M P
Sbjct: 367  TETIRHLKQFHKRAS-TISSVQKELDNIYGKYAPFQFLAYWCELWGFYNKQFGIISDMEP 425

Query: 1354 SDYGLISGHNMKE 1392
             +Y L +   +K+
Sbjct: 426  INYRLFTASKLKK 438


>gb|EOY25839.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 426

 Score =  255 bits (652), Expect = 4e-65
 Identities = 170/452 (37%), Positives = 228/452 (50%), Gaps = 18/452 (3%)
 Frame = +1

Query: 88   SLCERMMEENGGECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEX 267
            S C  ++E   GE          F+LE+AVCS+G FMM+PN+W    ++L RPLRL D  
Sbjct: 28   SCCSVLIELPVGEAAAAEGA-GPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPLRLLDHH 86

Query: 268  XXXXXXXXXXXXXXX-----RVFGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRV 432
                                RV+G   L+ Q  + +  QV RMLRLSE E+  +  F ++
Sbjct: 87   SPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKVREFRKI 146

Query: 433  ----HSQAKSQG------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSEL 582
                H + ++         GRVFRSPTLFED+VK  LLCNC+                  
Sbjct: 147  VEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQ------------------ 188

Query: 583  KGKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGKIS 762
                               F PKTP    LKR          KLR               
Sbjct: 189  --------------AAEDDFIPKTPAGNELKR----------KLR--------------- 209

Query: 763  EGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSEN 942
                 VS++S+ + +G++  P    S      + +  E  +    G+FP+P+ELA+L E+
Sbjct: 210  -----VSKVSM-RLEGKFAEPRADHSKSDLQPSQELDEPHAYKGMGSFPSPEELANLDES 263

Query: 943  YLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLED---LKAYLQKLDGVG 1113
            +L KRC +GYRA RIL LAK I  G I L  LE  +G  ++ L     L   L+++DG G
Sbjct: 264  FLAKRCNLGYRASRILKLAKGIVQGIIQLMQLE--EGCKEISLSSYNKLAEQLRQIDGFG 321

Query: 1114 KFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAY 1293
             FTC  VLMCMG Y  +P D+ET+RHLKQV  +S  T+++V  DVE +YAKY PFQFLAY
Sbjct: 322  PFTCANVLMCMGFYHVIPADSETIRHLKQVHSKSS-TMQTVGRDVEGIYAKYAPFQFLAY 380

Query: 1294 WWEIWDSYEKQFGRLSHMPPSDYGLISGHNMK 1389
            W E+W  YE++FG+LS MP   Y LI+  NMK
Sbjct: 381  WAELWHYYEQRFGKLSEMPFCGYKLITASNMK 412


>dbj|BAC15471.1| hypothetical protein [Oryza sativa Japonica Group]
            gi|50510134|dbj|BAD31099.1| hypothetical protein [Oryza
            sativa Japonica Group]
          Length = 501

 Score =  252 bits (643), Expect = 5e-64
 Identities = 169/481 (35%), Positives = 230/481 (47%), Gaps = 69/481 (14%)
 Frame = +1

Query: 157  FDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXXXXXR------V 318
            FDLE AVCS+G FMM+PNRW  A + L RPLRL  +                       V
Sbjct: 37   FDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVRVSRHPARPSDALLVSV 96

Query: 319  FGISQ---LTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSPTLF 489
             G      L+  D+  I  QV RMLRL E +  A+  F  +H+ A+  GFGR+FRSPTLF
Sbjct: 97   LGAPDDDALSPLDQTSILEQVRRMLRLDEEDGRAVAEFQAMHAVAREVGFGRIFRSPTLF 156

Query: 490  EDIVKAFLLCNC------------------------------------------RWQRTL 543
            ED++K  LLCNC                                          RW RTL
Sbjct: 157  EDMIKCILLCNCQFSLPLPLPSLASTSMRNSDTNMSRYLGIAIFHLHSTVLFNCRWTRTL 216

Query: 544  SMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKL--R 717
            SM+ +LC+LQ EL+                  F  +TP     KR+  ++     KL  +
Sbjct: 217  SMSTALCELQLELRSSS-----------STENFQSRTPPIRECKRKRSNKRNVRVKLETK 265

Query: 718  FDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNL-- 891
            F+E     +    ++   +  +  S+     E      G++S+++   +D SEL      
Sbjct: 266  FNEDKMVCLEDPNLATNTANENLFSLPSSANE-----TGNTSEVS---LDHSELKLRYEL 317

Query: 892  ----MAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPD--- 1050
                  G+FPTP+ELA+L E++L KRC +GYRARRI+ LA+ I  G I L  LE      
Sbjct: 318  CLEDCGGDFPTPEELANLDEDFLAKRCNLGYRARRIVMLARSIVEGKICLQKLEEIRKMS 377

Query: 1051 -------GSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVG 1209
                    +     + L   L  + G G FT   VLMCMG +  +P DTET+RHLKQ   
Sbjct: 378  VPTVEGLSTTPSTYDRLNEELSTISGFGPFTRANVLMCMGFFHMIPADTETIRHLKQFHK 437

Query: 1210 RSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNMK 1389
            R+  TI SV  +++ +Y KY PFQFLAYW E+W  Y KQFG +S M P +Y L +   +K
Sbjct: 438  RAS-TISSVQKELDNIYGKYAPFQFLAYWCELWGFYNKQFGIISDMEPINYRLFTASKLK 496

Query: 1390 E 1392
            +
Sbjct: 497  K 497


>gb|EOY25840.1| Uncharacterized protein isoform 3 [Theobroma cacao]
          Length = 421

 Score =  248 bits (634), Expect = 5e-63
 Identities = 165/420 (39%), Positives = 223/420 (53%), Gaps = 18/420 (4%)
 Frame = +1

Query: 88   SLCERMMEENGGECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEX 267
            S C  ++E   GE          F+LE+AVCS+G FMM+PN+W    ++L RPLRL D  
Sbjct: 43   SCCSVLIELPVGEAAAAEGA-GPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPLRLLDHH 101

Query: 268  XXXXXXXXXXXXXXX-----RVFGISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRV 432
                                RV+G   L+ Q  + +  QV RMLRLSE E+  +  F ++
Sbjct: 102  SPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKVREFRKI 161

Query: 433  ----HSQAKSQG------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSEL 582
                H + ++         GRVFRSPTLFED+VK  LLCNC++ RTLSMA +LC+LQ E 
Sbjct: 162  VEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQFSRTLSMAKALCELQFET 221

Query: 583  KGKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGKIS 762
            + +P               F PKTP    LKR          KLR               
Sbjct: 222  Q-RPFS-----GVRAAEDDFIPKTPAGNELKR----------KLR--------------- 250

Query: 763  EGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSEN 942
                 VS++S+ + +G++  P    S      + +  E  +    G+FP+P+ELA+L E+
Sbjct: 251  -----VSKVSM-RLEGKFAEPRADHSKSDLQPSQELDEPHAYKGMGSFPSPEELANLDES 304

Query: 943  YLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLED---LKAYLQKLDGVG 1113
            +L KRC +GYRA RIL LAK I  G I L  LE  +G  ++ L     L   L+++DG G
Sbjct: 305  FLAKRCNLGYRASRILKLAKGIVQGIIQLMQLE--EGCKEISLSSYNKLAEQLRQIDGFG 362

Query: 1114 KFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAY 1293
             FTC  VLMCMG Y  +P D+ET+RHLKQV  +S  T+++V  DVE +YAKY PFQFLAY
Sbjct: 363  PFTCANVLMCMGFYHVIPADSETIRHLKQVHSKSS-TMQTVGRDVEGIYAKYAPFQFLAY 421


>gb|EOY14232.1| Uncharacterized protein TCM_033523 [Theobroma cacao]
          Length = 374

 Score =  244 bits (622), Expect = 1e-61
 Identities = 153/424 (36%), Positives = 215/424 (50%), Gaps = 5/424 (1%)
 Frame = +1

Query: 121  GECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXX 300
            GEC+      SSF++E+AVC++G FMMSPN W+ + K+L RPLRL D             
Sbjct: 12   GECS------SSFNMEKAVCNHGLFMMSPNVWIPSTKSLRRPLRLADSSGSVYVTISHPA 65

Query: 301  XXXX----RVFGI-SQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGR 465
                    +V G+ + ++  D+  I  QV RMLR+S  ++  +  F  +H  AK +GFGR
Sbjct: 66   PNHPFLVIQVNGLQNSISSADKAVIMEQVARMLRISSKDERDVREFQTLHGSAKDRGFGR 125

Query: 466  VFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFY 645
            +FRSP+ FED VK+ LLCNC                        GW              
Sbjct: 126  IFRSPSFFEDAVKSILLCNC------------------------GWK------------- 148

Query: 646  PKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVP 825
             +T T   + R  C+  L+ A             + KIS           TK        
Sbjct: 149  -RTLT---MARALCALQLQLASAHLQHKRVASNSNVKIS-----------TKRLKHKKYT 193

Query: 826  NLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQ 1005
               S+S+L+ +  D S        GNFPT  ELA L E YL +RC +GYRAR IL LA++
Sbjct: 194  KASSTSELSMSGFDQS-------IGNFPTSTELACLDEKYLNERCNLGYRARCILQLARK 246

Query: 1006 ICNGSIDLDSLENPDGSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIYQCVPTDTETV 1185
            + NG ++L+ LE  + S     E     L K+ G G F C  ++MC+G Y+ +P D+ET+
Sbjct: 247  VENGELELNKLE--ESSDTTSYERFYQKLMKIKGFGPFVCSNIMMCIGFYERIPFDSETI 304

Query: 1186 RHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSHMPPSDYG 1365
            RHLK V G+  C+ K++  D+EE+Y KY PFQ +AYW E+ D YE +FG+LS +  S Y 
Sbjct: 305  RHLKMVHGKGKCSRKTIEKDIEEIYGKYAPFQCMAYWLELLDEYENKFGKLSELESSSYH 364

Query: 1366 LISG 1377
            L +G
Sbjct: 365  LATG 368


>ref|XP_006470787.1| PREDICTED: uncharacterized protein LOC102629917 isoform X2 [Citrus
            sinensis]
          Length = 409

 Score =  238 bits (607), Expect = 7e-60
 Identities = 158/420 (37%), Positives = 218/420 (51%), Gaps = 30/420 (7%)
 Frame = +1

Query: 124  ECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSAEKTLYRPLRLCDEXXXXXXXXXXXXX 303
            E  L++ +  +F+LE AVCS+G FMMSPNRW    ++L RPL L +              
Sbjct: 5    ESLLKLPLAETFNLETAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVDV 64

Query: 304  XXXRVF----------------GISQLTHQDENHIKAQVVRMLRLSEHEDDAIDGFHRVH 435
               +                      L+ + ++ + AQV RMLRLSE ++  +  F R+ 
Sbjct: 65   TICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVREFKRIV 124

Query: 436  SQ-AKSQG---------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK 585
             Q A+ +G          GRVFRSPTLFED+VK  LLCNC+W RTLSMA +LC+LQ    
Sbjct: 125  RQVAQEEGEETQYMEDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLSMARALCELQ---- 180

Query: 586  GKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRE-CSEILRPAKLRFDETSQCKVMSGKIS 762
                 W            F P+TP     KRR+  S++      R  E+         + 
Sbjct: 181  -----WELQHCSPSISEDFIPQTPAGKESKRRQKVSKVASKLTSRIAESKASSEDYMNLK 235

Query: 763  EGCSIVSEISITKCDGEYIVPNLGSSSKLTS-NTMDTSELPS-NLMAGNFPTPKELASLS 936
              C+ V E ++     +   P     S L   N + T++ PS     GNFP+P+ELA+L 
Sbjct: 236  LDCAGVLEENV-----QPSFPQNDIESDLHGLNELSTTDPPSARDRIGNFPSPRELANLD 290

Query: 937  ENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQM-KLEDLKAYLQKLDGVG 1113
            E++L KRC +GYRA RIL LA+ I +G I L  LE+      +     L   L +++G G
Sbjct: 291  ESFLAKRCNLGYRAGRILKLARGIVDGQIQLRELEDMCNEASLTAYVKLAEQLSQINGFG 350

Query: 1114 KFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAY 1293
             FT + VL+C+G Y  +PTD+ET+RHLKQV  R +CT K+V M  E +Y KY PFQFLAY
Sbjct: 351  PFTRNNVLVCIGFYHVIPTDSETIRHLKQVHAR-NCTSKTVQMIAESIYGKYAPFQFLAY 409


>gb|EMT03969.1| hypothetical protein F775_22747 [Aegilops tauschii]
          Length = 333

 Score =  225 bits (573), Expect = 6e-56
 Identities = 133/341 (39%), Positives = 188/341 (55%), Gaps = 19/341 (5%)
 Frame = +1

Query: 430  VHSQAKSQGFGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXX 609
            +H+ A+  GFGR+FRSPTLFED+VK  LLCNC+W RTLSMA +LC+LQ ELK        
Sbjct: 1    MHAAAREAGFGRIFRSPTLFEDMVKCILLCNCQWTRTLSMATALCELQLELK-------- 52

Query: 610  XXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETS-QC-------KVMSGKISE 765
                         +TP     KR+         KL    T  +C            +++ 
Sbjct: 53   ---CSAGTEDLQLRTPPIREHKRKRSKNQNVRVKLEKKFTELECLEDPRVETAQDTRVAT 109

Query: 766  GCSIVSEISITKCDGEYI-VPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSEN 942
            G S V  I+  + D +   +P +   +     + D+SEL      G+FPTP+ELA+L E+
Sbjct: 110  GTSDV--ITHLEADEKLASLPQVAPETGSVCQSFDSSELSLEGCIGDFPTPEELANLDED 167

Query: 943  YLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPD----------GSVQMKLEDLKAYL 1092
            +L KRCG+GYRA RI+ LA+ I  G +   +LE              ++    E L   L
Sbjct: 168  FLAKRCGLGYRAERIVLLARSIVEGKVCPQNLEEMQKMSLPATEELSTIPSTYERLNNEL 227

Query: 1093 QKLDGVGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYV 1272
              + G G FT   VLMCMG +  +P DTET+RHLKQ    +  TIKSV M+++++Y +Y 
Sbjct: 228  TTISGFGPFTRANVLMCMGFFHMIPADTETIRHLKQCHEIAS-TIKSVHMELDKIYGEYA 286

Query: 1273 PFQFLAYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNMKEE 1395
            PFQFLAYW+E+W  Y+KQFG+++ M PS Y L +   +K++
Sbjct: 287  PFQFLAYWFELWGFYDKQFGKITEMDPSTYRLFTASALKKQ 327


Top