BLASTX nr result

ID: Ephedra25_contig00022251 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra25_contig00022251
         (1359 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006853038.1| hypothetical protein AMTR_s00038p00020700 [A...   340   6e-91
ref|XP_004230386.1| PREDICTED: uncharacterized protein LOC101247...   314   6e-83
ref|XP_006358484.1| PREDICTED: uncharacterized protein LOC102593...   313   8e-83
ref|XP_004959865.1| PREDICTED: uncharacterized protein LOC101766...   284   5e-74
ref|XP_003534756.2| PREDICTED: uncharacterized protein LOC100781...   284   7e-74
gb|EOY25838.1| Uncharacterized protein isoform 1 [Theobroma cacao]    283   1e-73
gb|EXC47697.1| hypothetical protein L484_002408 [Morus notabilis]     283   2e-73
ref|XP_002304112.2| hypothetical protein POPTR_0003s03710g [Popu...   281   6e-73
ref|XP_006470786.1| PREDICTED: uncharacterized protein LOC102629...   280   1e-72
ref|XP_006431360.1| hypothetical protein CICLE_v10001110mg [Citr...   279   2e-72
gb|ESW19537.1| hypothetical protein PHAVU_006G133500g [Phaseolus...   276   1e-71
ref|XP_002519384.1| conserved hypothetical protein [Ricinus comm...   275   2e-71
gb|EEC82605.1| hypothetical protein OsI_27177 [Oryza sativa Indi...   271   6e-70
gb|EEE67737.1| hypothetical protein OsJ_25428 [Oryza sativa Japo...   266   1e-68
gb|EOY25839.1| Uncharacterized protein isoform 2 [Theobroma cacao]    254   8e-65
dbj|BAC15471.1| hypothetical protein [Oryza sativa Japonica Grou...   249   1e-63
gb|EOY25840.1| Uncharacterized protein isoform 3 [Theobroma cacao]    247   9e-63
gb|EOY14232.1| Uncharacterized protein TCM_033523 [Theobroma cacao]   243   1e-61
ref|XP_006470787.1| PREDICTED: uncharacterized protein LOC102629...   239   3e-60
gb|EMT03969.1| hypothetical protein F775_22747 [Aegilops tauschii]    224   7e-56

>ref|XP_006853038.1| hypothetical protein AMTR_s00038p00020700 [Amborella trichopoda]
            gi|548856677|gb|ERN14505.1| hypothetical protein
            AMTR_s00038p00020700 [Amborella trichopoda]
          Length = 458

 Score =  340 bits (873), Expect = 6e-91
 Identities = 195/454 (42%), Positives = 253/454 (55%), Gaps = 30/454 (6%)
 Frame = +3

Query: 84   GGECT-LRVSVKSSFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEXXXXXXXXXX 260
            G E T L + V  SF+LE+AVCS+GFFMM+PN W S  +TL RPLRL D           
Sbjct: 4    GAERTVLTLPVNESFELEKAVCSHGFFMMAPNLWFSSSQTLQRPLRLTDRSSVPVRITQL 63

Query: 261  XXXXXXR----VFGISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFG 428
                       V G S+L   D+ ++ AQV RMLR+SE +D  ++ FH ++  AK  GFG
Sbjct: 64   SLSSQKSLQILVLGASKLYQHDQQYLLAQVARMLRISEEDDLKVNKFHEMYPVAKETGFG 123

Query: 429  RVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXF 608
            RVFRSPTLFED+VK+ LLCNC+W RTLSMA +LC+LQ EL G  L               
Sbjct: 124  RVFRSPTLFEDMVKSILLCNCQWTRTLSMARALCELQLELNGNSLRQSNKDTDFSKSVNL 183

Query: 609  YPKTPTKTRLKRRE-----------------------CSEILRPAKLR--FDETSQCKVM 713
             P TP +   K+R                          E LRP  L   F + S     
Sbjct: 184  SPVTPMQLEHKKRRKNPNQNIIMNLMTKFSENETHLAADESLRPIDLAKDFSKNSPTMFS 243

Query: 714  SGKISEGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELA 893
            S +   G     ++S  K         LG  + L +  ++   L   L AGNFP P+ELA
Sbjct: 244  SEEGRNGKLNYDQVSEEK---------LGDGAILDNQLLENKTLSFFLEAGNFPCPEELA 294

Query: 894  SLSENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLEDLKAYLQKLDG 1073
            +L E  L KRC VG+R++RI+ LA+ I  G++DL  +E       + L+ L   L  + G
Sbjct: 295  NLDEKILEKRCKVGFRSKRIVKLAQSIVEGALDLGKIEVLSQQDPIHLDGLMRQLLSIYG 354

Query: 1074 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFL 1253
            VG + C+ VLM MGIYQ +P DTET+RHLKQ   R  CTI ++  D+EE+Y K+ PFQFL
Sbjct: 355  VGPYVCNNVLMSMGIYQRIPADTETLRHLKQFHARKQCTIGTIQKDIEEIYGKHEPFQFL 414

Query: 1254 AYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNMK 1355
             YW E+W+ YEK+FG+LS MPPSDY LI+ HNMK
Sbjct: 415  VYWSEMWEFYEKRFGKLSQMPPSDYELITAHNMK 448


>ref|XP_004230386.1| PREDICTED: uncharacterized protein LOC101247758 [Solanum
            lycopersicum]
          Length = 483

 Score =  314 bits (804), Expect = 6e-83
 Identities = 188/456 (41%), Positives = 249/456 (54%), Gaps = 25/456 (5%)
 Frame = +3

Query: 63   ERMMEENGGECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEXXXX 242
            E  +E+  G C       +SFDLE+AVCS+G FMM+PNRW +  KTL RPLRL +     
Sbjct: 16   ELPLEDGNGYC-------ASFDLEKAVCSHGLFMMAPNRWDTLSKTLERPLRLSENINDD 68

Query: 243  XXXXXXXXXXXX----------RVFGISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFH 392
                                  RV     L+   +  +  QV RM+RLS  E+  +  F 
Sbjct: 69   DHEQSVLVQITQPSDYPHSLLLRVLDTDSLSTIHQRSLLGQVRRMVRLSVEENKRVKLFQ 128

Query: 393  RVHSQAKSQGFGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK------- 551
             +  +AK +GFGRVFRSPTLFED+VK  LLCNC+W RTLSMA +LC+LQ EL        
Sbjct: 129  EICGEAKERGFGRVFRSPTLFEDMVKCMLLCNCQWSRTLSMAEALCELQLELNCPSSAAS 188

Query: 552  -GKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRE----CSEILRPAKLRFDETSQCKVMS 716
               P               F P+TP    L++R     CS  L       +E        
Sbjct: 189  FPDPDNQNQLKGVTSKSEHFTPRTPAGKELRKRAGAYGCSRNLLERLNEVEEIVDIDKPG 248

Query: 717  GKISEGCSIVSEI--SITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKEL 890
              ++   S+  E+      C     V  +  S+ L  +  +  +L S    GNFP+PK+L
Sbjct: 249  VTVTPAFSVGEEVLQKSNLCQDTTEVWEVSVSAPLNPDPSEDRKLSSFNQLGNFPSPKQL 308

Query: 891  ASLSENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLEDLKA-YLQKL 1067
            ASL E++L KRCG+GYRA RI+ LAK I  GSI L+ LE    +  +   D  A  L+++
Sbjct: 309  ASLDESFLAKRCGLGYRAGRIIKLAKGIVEGSIQLNELEEACSNPSLSNYDKMAEQLREI 368

Query: 1068 DGVGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQ 1247
            DG G FTC  VLMC+G Y  +PTD+ET+RHLKQV  R+  TI++V  DVE +Y KY PFQ
Sbjct: 369  DGFGPFTCANVLMCLGYYHVIPTDSETIRHLKQVHARTS-TIQNVQRDVENIYGKYAPFQ 427

Query: 1248 FLAYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNMK 1355
            FLAYW E+W  YE++FG+LS MP S+Y LI+  NM+
Sbjct: 428  FLAYWSEVWHFYEERFGKLSEMPHSEYKLITAANMR 463


>ref|XP_006358484.1| PREDICTED: uncharacterized protein LOC102593287 isoform X1 [Solanum
            tuberosum] gi|565385158|ref|XP_006358485.1| PREDICTED:
            uncharacterized protein LOC102593287 isoform X2 [Solanum
            tuberosum]
          Length = 485

 Score =  313 bits (803), Expect = 8e-83
 Identities = 184/438 (42%), Positives = 244/438 (55%), Gaps = 25/438 (5%)
 Frame = +3

Query: 117  SSFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEXXXXXXXXXXXXXXXX------ 278
            ++FDLE+AVCS+G FMM+PNRW S  KTL RPL L +                       
Sbjct: 29   ATFDLEKAVCSHGLFMMAPNRWDSLSKTLERPLHLSENINDDDHEQSVLVQINQPSDSPH 88

Query: 279  ----RVFGISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSP 446
                RVFG + L+   +  +  QV RM+RLS  E+  +  F  +  +AK +G GRVFRSP
Sbjct: 89   SLLLRVFGTASLSTIHQRSLLGQVRRMVRLSVEENKRVKQFQEICGEAKDRGLGRVFRSP 148

Query: 447  TLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK--------GKPLGWXXXXXXXXXXX 602
            TLFED+VK  LLCNC+W RTLSMA +LC+LQ EL           P              
Sbjct: 149  TLFEDMVKCMLLCNCQWSRTLSMAEALCELQLELNCPSSAASFPDPDNQNQLKGVTFKSE 208

Query: 603  XFYPKTPTKTRLKRRE----CSEILRPAKLRFDETSQCKVMSGKISEGCSIVSEI--SIT 764
             F P+TP     ++R     CS  L       +E          ++   S+  E+     
Sbjct: 209  HFTPRTPAGKESRKRAGAYGCSRKLLERLTEVEEIIDIGKPGVTVTPAFSVGEEVLKKSN 268

Query: 765  KCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRA 944
             C     V ++G+S+    +  +  +L S    GNFP+PKELASL E++L KRCG+GYRA
Sbjct: 269  LCRDTTEVCDVGTSAPFNLDPSEDRKLSSFNQLGNFPSPKELASLDESFLAKRCGLGYRA 328

Query: 945  RRILNLAKQICNGSIDLDSLENPDGSVQMKLEDLKA-YLQKLDGVGKFTCDVVLMCMGIY 1121
             RI+ LAK I  GSI L  LE    +  +   D  A  L+++DG G FTC  VLMC+G Y
Sbjct: 329  GRIIKLAKGIVEGSIQLKELEEACSNPSLSDYDKMAEQLREIDGFGPFTCANVLMCLGYY 388

Query: 1122 QCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGR 1301
              +PTD+ET+RHLKQV  R+  TI++V  DVE +Y KY PFQFLAYW E+W  YE++FG+
Sbjct: 389  HVIPTDSETIRHLKQVHARTS-TIQNVQRDVENIYGKYAPFQFLAYWSEVWHFYEERFGK 447

Query: 1302 LSHMPPSDYGLISGHNMK 1355
            LS MP S+Y LI+  NM+
Sbjct: 448  LSEMPHSEYKLITAANMR 465


>ref|XP_004959865.1| PREDICTED: uncharacterized protein LOC101766322 [Setaria italica]
          Length = 461

 Score =  284 bits (727), Expect = 5e-74
 Identities = 171/444 (38%), Positives = 235/444 (52%), Gaps = 32/444 (7%)
 Frame = +3

Query: 123  FDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEXXXXXXXXXXXXXXXX------RV 284
            FDL  AVCS+G FMM+PNRW    + L RPLRL  +                       V
Sbjct: 36   FDLAAAVCSHGLFMMAPNRWDPAARALVRPLRLASDRSASLLARVSAHPARPGTALLVAV 95

Query: 285  FGISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSPTLFEDI 464
             G   L+  D D+I  QV RMLRLSE +  A+  F  +H+ A+ +GFGR+FRSPTLFED+
Sbjct: 96   EGADALSSLDRDYILEQVRRMLRLSEEDGAAVAEFQAMHAAAREEGFGRIFRSPTLFEDM 155

Query: 465  VKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTP----TKT 632
            VK  LLCNC+W RTLSMA +LC++Q ELK                  F  +TP     K 
Sbjct: 156  VKCILLCNCQWTRTLSMATALCEIQLELK-----------CSSSVEDFQSRTPPIRERKR 204

Query: 633  RLKRRECSEILRPAKLRFDETSQCKVMSGK------------ISEGCSIVSEISITKCDG 776
            +  +R+   I    +   D+     + SG             +S   S+ SE   + CD 
Sbjct: 205  KRSKRQSVRIKLETRFAEDKLEGPTIASGTSNDLTHPETNEYLSSLASVASETG-SACDS 263

Query: 777  EYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRARRIL 956
               +P+L +S    +N     +       G+FPTP+ELA+L E +L KRC +GYRA+RI+
Sbjct: 264  ---LPSLDNSELSLNNAPGLED-----CIGDFPTPEELANLDEGFLAKRCNLGYRAKRIV 315

Query: 957  NLAKQICNGSIDLDSLE----------NPDGSVQMKLEDLKAYLQKLDGVGKFTCDVVLM 1106
             LA+ +  G + L  LE              +++   E L   L  + G G FT   VLM
Sbjct: 316  MLARGVVEGKVCLQKLEEMCRISVPAAEEVSTIESACERLNKELSAISGFGPFTRANVLM 375

Query: 1107 CMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYE 1286
            CMG    +P DTET+RHLKQV  R+  TI SV  +++++Y KY PFQFLAYW+E+W  Y 
Sbjct: 376  CMGFNHTIPADTETIRHLKQVHKRAS-TISSVHQELDKIYGKYAPFQFLAYWFELWGFYN 434

Query: 1287 KQFGRLSHMPPSDYGLISGHNMKE 1358
            KQFG++  M PS+Y L +  ++K+
Sbjct: 435  KQFGKICEMEPSNYRLFTASHLKK 458


>ref|XP_003534756.2| PREDICTED: uncharacterized protein LOC100781827 [Glycine max]
          Length = 443

 Score =  284 bits (726), Expect = 7e-74
 Identities = 178/429 (41%), Positives = 233/429 (54%), Gaps = 12/429 (2%)
 Frame = +3

Query: 105  VSVKSSFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEXXXXXXXXXXXXXXXXRV 284
            + + S F LE+AVCS+G FMM PN W    KTL RPLR                    RV
Sbjct: 18   MELPSPFQLEQAVCSHGLFMMPPNHWDPLSKTLIRPLR-SSPSSFLVSLSQHSQSLAVRV 76

Query: 285  FGISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRVH--SQAKSQGFGRVFRSPTLFE 458
                 L+ Q ++HI AQV RMLR SE E+ A+  F  +H          GRVFRSPTLFE
Sbjct: 77   HATHALSPQQQNHITAQVSRMLRFSEAEEKAVREFRSLHVVDHPNRSFSGRVFRSPTLFE 136

Query: 459  DIVKAFLLCNCRWQRTLSMAASLCDLQSELK-GKPLGWXXXXXXXXXXXXFYPKTPTKTR 635
            D+VK  LLCNC+W RTLSMA +LC+LQ EL+ G P               F PKTP    
Sbjct: 137  DMVKCILLCNCQWPRTLSMAQALCELQLELQNGSPCTIAVSGNSKGESEGFIPKTPASKE 196

Query: 636  LKRRECSE--ILRPAKLRFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVPNLGSSS 809
             +R + S   +    KL  D   Q   +    S   ++             +  + G S 
Sbjct: 197  TRRNKVSTKGMFCKKKLELDGNLQIDHVVASSSTATTL-------------LTTDNGDSE 243

Query: 810  KLTSNTMDTSELPSN-----LMAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQI 974
            +L S+  D+    SN        GNFP+P ELA+L E++L KRCG+GYRA  I+ LA+ I
Sbjct: 244  ELRSH--DSCHEFSNGNEYFSRTGNFPSPSELANLDESFLAKRCGLGYRAGYIIELARAI 301

Query: 975  CNGSIDLDSLE--NPDGSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIYQCVPTDTET 1148
              G I L  LE  + D S+    + L   L+++ G G FT   VLMC+G Y  +PTD+ET
Sbjct: 302  VEGKIQLGQLEELSKDASLS-NYKQLDDQLKQIRGYGPFTRANVLMCLGYYHVIPTDSET 360

Query: 1149 VRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSHMPPSDY 1328
            VRHLKQV  R   T K++  ++EE+Y KY P+QFLA+W E+WD YE +FG+L+ M  SDY
Sbjct: 361  VRHLKQVHSRY-TTSKTIERELEEIYGKYEPYQFLAFWSEVWDFYETRFGKLNEMHSSDY 419

Query: 1329 GLISGHNMK 1355
             LI+  NM+
Sbjct: 420  KLITACNMR 428


>gb|EOY25838.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 467

 Score =  283 bits (724), Expect = 1e-73
 Identities = 182/452 (40%), Positives = 243/452 (53%), Gaps = 18/452 (3%)
 Frame = +3

Query: 54   SLCERMMEENGGECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEX 233
            S C  ++E   GE          F+LE+AVCS+G FMM+PN+W    ++L RPLRL D  
Sbjct: 43   SCCSVLIELPVGEAAAAEGA-GPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPLRLLDHH 101

Query: 234  XXXXXXXXXXXXXXX-----RVFGISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRV 398
                                RV+G   L+ Q    +  QV RMLRLSE E+  +  F ++
Sbjct: 102  SPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKVREFRKI 161

Query: 399  ----HSQAKSQG------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSEL 548
                H + ++         GRVFRSPTLFED+VK  LLCNC++ RTLSMA +LC+LQ E 
Sbjct: 162  VEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQFSRTLSMAKALCELQFET 221

Query: 549  KGKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGKIS 728
            +    G             F PKTP    LKR          KLR               
Sbjct: 222  QRPFSG------VRAAEDDFIPKTPAGNELKR----------KLR--------------- 250

Query: 729  EGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSEN 908
                 VS++S+ + +G++  P    S      + +  E  +    G+FP+P+ELA+L E+
Sbjct: 251  -----VSKVSM-RLEGKFAEPRADHSKSDLQPSQELDEPHAYKGMGSFPSPEELANLDES 304

Query: 909  YLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLED---LKAYLQKLDGVG 1079
            +L KRC +GYRA RIL LAK I  G I L  LE  +G  ++ L     L   L+++DG G
Sbjct: 305  FLAKRCNLGYRASRILKLAKGIVQGIIQLMQLE--EGCKEISLSSYNKLAEQLRQIDGFG 362

Query: 1080 KFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAY 1259
             FTC  VLMCMG Y  +P D+ET+RHLKQV  +S  T+++V  DVE +YAKY PFQFLAY
Sbjct: 363  PFTCANVLMCMGFYHVIPADSETIRHLKQVHSKSS-TMQTVGRDVEGIYAKYAPFQFLAY 421

Query: 1260 WWEIWDSYEKQFGRLSHMPPSDYGLISGHNMK 1355
            W E+W  YE++FG+LS MP   Y LI+  NMK
Sbjct: 422  WAELWHYYEQRFGKLSEMPFCGYKLITASNMK 453


>gb|EXC47697.1| hypothetical protein L484_002408 [Morus notabilis]
          Length = 472

 Score =  283 bits (723), Expect = 2e-73
 Identities = 186/447 (41%), Positives = 240/447 (53%), Gaps = 34/447 (7%)
 Frame = +3

Query: 117  SSFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLC------------DEXXXXXXXXXX 260
            ++F LE AVCS+G FMM+PN+W    KTL RPLRL             D+          
Sbjct: 14   ATFRLETAVCSHGLFMMAPNQWDPLSKTLLRPLRLTLHHHHWNPQQQQDDSVMARISQPH 73

Query: 261  XXXXXXRVF---GISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGR 431
                  RV    G   LT  ++  + AQV RMLRLS+ E+     F  V+      G GR
Sbjct: 74   DRLHCLRVLVHAGTRSLTSDNKQALLAQVSRMLRLSQTEERICREFSEVYGCGS--GLGR 131

Query: 432  VFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFY 611
            VFRSPTLFED+VK  LLCNC+W RTLSMA +LCDLQ EL+ + +              F 
Sbjct: 132  VFRSPTLFEDMVKCILLCNCQWPRTLSMAQALCDLQRELQLQSV--------PSKTVDFV 183

Query: 612  PKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVP 791
            PKTP     KR+     +   K     TSQ    S +  E  S  +++SI          
Sbjct: 184  PKTPAGKEPKRK-----VEKLKASTCLTSQFDAQSNEGLESHS--NDLSIDISQPTPSAQ 236

Query: 792  NLGSSSKLT----------SNTMDTSEL--PSNLM------AGNFPTPKELASLSENYLT 917
            NL  SS L+          S  +D++ L  P  L        G+FPTP ELA L E +L 
Sbjct: 237  NLSPSSLLSVPMENVTCEESYGVDSASLCNPQILRDREFEGTGDFPTPTELAKLDEKFLA 296

Query: 918  KRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQM-KLEDLKAYLQKLDGVGKFTCD 1094
            KRC +GYRA RIL LA+ I  G I L  LE       +     L   L+++DG G FTC 
Sbjct: 297  KRCKLGYRAGRILKLARGIVEGRIQLRELEETCMERSLCSYSKLAVQLRQIDGFGPFTCA 356

Query: 1095 VVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIW 1274
             VLMCMG Y  +P+D+ET+RHL+QV GR+  T++++  DV+++YAKY PFQFLAYW E+W
Sbjct: 357  NVLMCMGFYHVIPSDSETIRHLQQVHGRNS-TVRTIERDVQQIYAKYEPFQFLAYWSELW 415

Query: 1275 DSYEKQFGRLSHMPPSDYGLISGHNMK 1355
              YEK+FG++S MP S Y L +  NMK
Sbjct: 416  HFYEKKFGKISEMPCSAYKLFTASNMK 442


>ref|XP_002304112.2| hypothetical protein POPTR_0003s03710g [Populus trichocarpa]
            gi|550342350|gb|EEE79091.2| hypothetical protein
            POPTR_0003s03710g [Populus trichocarpa]
          Length = 489

 Score =  281 bits (718), Expect = 6e-73
 Identities = 182/454 (40%), Positives = 238/454 (52%), Gaps = 42/454 (9%)
 Frame = +3

Query: 120  SFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLC-------------DEXXXXXXXXXX 260
            +F+LE+AVCS+G FMMSPN W     T  RPLRL                          
Sbjct: 29   TFNLEKAVCSHGLFMMSPNHWDPLSLTFSRPLRLSLSDSDPQVSTPTTSLFVSISHPPHL 88

Query: 261  XXXXXXRVFGISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQ------- 419
                  RV+G   L+ + ++ + AQVVRMLRLSE ++     F ++   A ++       
Sbjct: 89   PRSLSVRVYGTRCLSPKHQESLVAQVVRMLRLSETDERNAREFRKIAEAAAAEENNSWLT 148

Query: 420  GFG-RVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXX 596
            GFG RVFRSPTLFED+VK  LLCNC+W RTLSMA +LC+LQ EL+ K  G          
Sbjct: 149  GFGGRVFRSPTLFEDMVKCILLCNCQWPRTLSMARALCELQCELQCKSSGVFVAQAVNAT 208

Query: 597  XXX--------FYPKTPTKTRLKRR-ECSEILRPAKLRFDET-------SQCKVMSGKIS 728
                       F P T      KR    S++ +    +  ET       +  K  S  I 
Sbjct: 209  VKNKCNDTAHNFIPNTSAGKESKRNIRASKVTKNLASKIVETETLLEADANLKTDSAHIG 268

Query: 729  -EGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPS--NLMAGNFPTPKELASL 899
             E    V   S  +C   +     GS S    +      +    N M  NFP+P+ELA+L
Sbjct: 269  RETLESVENDSCARCSSRH-----GSDSWAPDSLQSQHGIQPGVNKMICNFPSPRELANL 323

Query: 900  SENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENP--DGSVQMKLEDLKAYLQKLDG 1073
             E++L KRC +GYRA RI+ LA+ I  G I L  +E    +G+       L    +++DG
Sbjct: 324  DESFLAKRCNLGYRAIRIIKLAQSIVEGRIPLREVEEDCANGASSSCYNKLADQFRQIDG 383

Query: 1074 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFL 1253
             G FTC  VLMCMG Y  +PTD+ETVRHLKQV  +   TI++V  DVEE+Y KY PFQFL
Sbjct: 384  FGPFTCANVLMCMGFYHIIPTDSETVRHLKQVHAKKS-TIQTVQRDVEEIYGKYAPFQFL 442

Query: 1254 AYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNMK 1355
            AYW E+W  YEK+FG+LS +P SDY LI+  NM+
Sbjct: 443  AYWAELWHFYEKRFGKLSEIPTSDYKLITASNMR 476


>ref|XP_006470786.1| PREDICTED: uncharacterized protein LOC102629917 isoform X1 [Citrus
            sinensis]
          Length = 454

 Score =  280 bits (715), Expect = 1e-72
 Identities = 178/451 (39%), Positives = 240/451 (53%), Gaps = 30/451 (6%)
 Frame = +3

Query: 90   ECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEXXXXXXXXXXXXX 269
            E  L++ +  +F+LE AVCS+G FMMSPNRW    ++L RPL L +              
Sbjct: 5    ESLLKLPLAETFNLETAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVDV 64

Query: 270  XXXRVF----------------GISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRVH 401
               +                      L+ + +D + AQV RMLRLSE ++  +  F R+ 
Sbjct: 65   TICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVREFKRIV 124

Query: 402  SQ-AKSQG---------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK 551
             Q A+ +G          GRVFRSPTLFED+VK  LLCNC+W RTLSMA +LC+LQ    
Sbjct: 125  RQVAQEEGEETQYMEDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLSMARALCELQ---- 180

Query: 552  GKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRE-CSEILRPAKLRFDETSQCKVMSGKIS 728
                 W            F P+TP     KRR+  S++      R  E+         + 
Sbjct: 181  -----WELQHCSPSISEDFIPQTPAGKESKRRQKVSKVASKLTSRIAESKASSEDYMNLK 235

Query: 729  EGCSIVSEISITKCDGEYIVPNLGSSSKLTS-NTMDTSELPSNL-MAGNFPTPKELASLS 902
              C+ V E ++         P     S L   N + T++ PS     GNFP+P+ELA+L 
Sbjct: 236  LDCAGVLEENVQPS-----FPQNDIESDLHGLNELSTTDPPSARDRIGNFPSPRELANLD 290

Query: 903  ENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLE-DLKAYLQKLDGVG 1079
            E++L KRC +GYRA RIL LA+ I +G I L  LE+      +     L   L +++G G
Sbjct: 291  ESFLAKRCNLGYRAGRILKLARGIVDGQIQLRELEDMCNEASLTAYVKLAEQLSQINGFG 350

Query: 1080 KFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAY 1259
             FT + VL+C+G Y  +PTD+ET+RHLKQV  R+ CT K+V M  E +Y KY PFQFLAY
Sbjct: 351  PFTRNNVLVCIGFYHVIPTDSETIRHLKQVHARN-CTSKTVQMIAESIYGKYAPFQFLAY 409

Query: 1260 WWEIWDSYEKQFGRLSHMPPSDYGLISGHNM 1352
            W E+W  YEK+FG+LS MP SDY LI+  NM
Sbjct: 410  WSELWHFYEKRFGKLSEMPYSDYKLITASNM 440


>ref|XP_006431360.1| hypothetical protein CICLE_v10001110mg [Citrus clementina]
            gi|557533482|gb|ESR44600.1| hypothetical protein
            CICLE_v10001110mg [Citrus clementina]
          Length = 454

 Score =  279 bits (713), Expect = 2e-72
 Identities = 174/453 (38%), Positives = 243/453 (53%), Gaps = 32/453 (7%)
 Frame = +3

Query: 90   ECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEXXXXXXXXXXXXX 269
            E  L++ +  +F+LE AVCS+G FMMSPNRW    ++L RPL L +              
Sbjct: 5    ESVLKLPLAETFNLEAAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVDV 64

Query: 270  XXXRVF----------------GISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRV- 398
               +                      L+ + +D + AQV RMLRLSE ++  +  F R+ 
Sbjct: 65   TICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVRDFKRIV 124

Query: 399  ---------HSQAKSQGFGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK 551
                      SQ  +   GRVFRSPTLFED+VK  LLCNC+W RTL+MA +LC+LQ    
Sbjct: 125  RQVAQEEGEESQYMTDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLNMARALCELQ---- 180

Query: 552  GKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGKISE 731
                 W            F P+TP     KRR+     + +K+    TS  ++   K S 
Sbjct: 181  -----WELQHCSPSISEDFIPQTPAGKESKRRQ-----KVSKVASKLTS--RIAESKASS 228

Query: 732  GCSIVSEISITKCDGEYIVPNLGSSSKLTS----NTMDTSELPSNL-MAGNFPTPKELAS 896
               +  ++  T    E + P+   +   +     N + T++ PS     GNFP+P+ELA+
Sbjct: 229  EDDMNLKLDCTGALEENVQPSFPRNDIESDLHGLNELSTTDPPSACDRIGNFPSPRELAN 288

Query: 897  LSENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLED-LKAYLQKLDG 1073
            L E++L KRC +GYRA RIL LA+ I +G I L  LE+      +   + L   L +++G
Sbjct: 289  LDESFLAKRCNLGYRAGRILKLAQGIVDGQIQLRELEDTCNEASLTTYNKLAEQLSQING 348

Query: 1074 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFL 1253
             G FT + VL+C+G Y  +PTD+ET+RHLKQV  R+ CT K+V +  E +Y KY PFQFL
Sbjct: 349  FGPFTRNNVLVCIGFYHVIPTDSETIRHLKQVHARN-CTSKTVQIIAESIYGKYSPFQFL 407

Query: 1254 AYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNM 1352
            AYW E+W  YEK+FG+LS MP SDY LI+  NM
Sbjct: 408  AYWSELWHFYEKRFGKLSEMPYSDYKLITASNM 440


>gb|ESW19537.1| hypothetical protein PHAVU_006G133500g [Phaseolus vulgaris]
          Length = 474

 Score =  276 bits (706), Expect = 1e-71
 Identities = 176/430 (40%), Positives = 237/430 (55%), Gaps = 19/430 (4%)
 Frame = +3

Query: 123  FDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRL-----CDEXXXXXXXXXXXXXXXXRVF 287
            F L++AVCS+GFFMM+PN W    KTL RPL L                        RV 
Sbjct: 46   FQLDQAVCSHGFFMMAPNHWDPLSKTLTRPLLLHNPSSSSSSSLLVSLSQRPQSLAVRVH 105

Query: 288  GISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRVHS-QAKSQGFG-RVFRSPTLFED 461
             +  ++ Q + HIKAQ+ RMLRLSE E+ A+  F  VH+    ++ FG RVFRSPTLFED
Sbjct: 106  SVHFISPQQQRHIKAQITRMLRLSEAEEKAVREFRSVHAADHPNRSFGGRVFRSPTLFED 165

Query: 462  IVKAFLLCNCRWQRTLSMAASLCDLQSELK-GKPLGWXXXXXXXXXXXXFYPKTPT--KT 632
            +VK  LLCNC+W RTLSMA +LC+LQS L+ G P               F PKTP   + 
Sbjct: 166  MVKCILLCNCQWPRTLSMAQALCELQSGLQNGLPCAVEGSGNPKVEAEEFVPKTPASKEN 225

Query: 633  RLKRRECSEILRPAKLRF------DETSQCKVMSGKISEGCSIVSEISITKCDGEYI-VP 791
            R K+     +L   KL        D   Q   M    S+  +++ ++ + + D      P
Sbjct: 226  RRKKAPTKGVLLKKKLELELEMEVDGNLQMDHMFASSSD-TTLLGDLEVLRSDDSCCQFP 284

Query: 792  NLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQ 971
            N G     T               GNFP+P ELA+LSE++L KRC +GYRA  IL LA+ 
Sbjct: 285  NEGEYFDHT---------------GNFPSPIELANLSESFLAKRCKLGYRAGYILELAQG 329

Query: 972  ICNGSIDLDSLE--NPDGSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIYQCVPTDTE 1145
            I  G I L+ LE  + D S+    + L   L+ + G G FT   VLMC+G Y  +P D+E
Sbjct: 330  IVEGKIQLEQLEELSKDASLSC-YKQLGDQLKPIKGFGPFTRANVLMCLGYYHVIPWDSE 388

Query: 1146 TVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSHMPPSD 1325
            TVRHLKQV  ++  + K++  D+EE+Y KY P+QFLA+W EIWD YE +FG+++ M  S+
Sbjct: 389  TVRHLKQVHSKNTSS-KTIERDLEEIYGKYEPYQFLAFWSEIWDFYETRFGKMNEMHSSE 447

Query: 1326 YGLISGHNMK 1355
            Y  I+  NM+
Sbjct: 448  YKRITASNMR 457


>ref|XP_002519384.1| conserved hypothetical protein [Ricinus communis]
            gi|223541451|gb|EEF43001.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 458

 Score =  275 bits (704), Expect = 2e-71
 Identities = 167/434 (38%), Positives = 235/434 (54%), Gaps = 22/434 (5%)
 Frame = +3

Query: 120  SFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEXXXXXXXXXXXXXXXX---RVFG 290
            +FDLE+ VCS+G FM+SPN W    +T  RPLRL D+                   RV+G
Sbjct: 21   TFDLEKTVCSHGLFMLSPNHWDPLSRTFSRPLRLNDDTDNSLMVSISQHLSKSLLVRVYG 80

Query: 291  ISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGF-------GRVFRSPT 449
               L+ + ++ +  Q+VRMLRLS+ ++     F ++ S  + +         GRV RSPT
Sbjct: 81   NRSLSPKHQESLLVQIVRMLRLSDMDEFNAREFRKIVSAFEGEECPLIGDFGGRVLRSPT 140

Query: 450  LFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTPTK 629
            LFED+VK  LLCNC+W RTLSMA +LC  Q EL  +                F P TP K
Sbjct: 141  LFEDMVKCILLCNCQWSRTLSMADALCKFQIELHSQS------PQQKHAFNHFIPNTPVK 194

Query: 630  TRLKRRECSEILRPAKLRFDETSQCKVMSG---KISEGCSIVSEISITKCDGEYIVPNLG 800
               KR+     +    +  +    C        KIS   + V + S      + +    G
Sbjct: 195  KEPKRKIRLSKVPTESMDLEAADTCLTTDDSQMKISNSLNCVDDGSF-----DNLKSCQG 249

Query: 801  SSSKLTSNTMDTSELPSNLMA--------GNFPTPKELASLSENYLTKRCGVGYRARRIL 956
            S++  ++    TS++ S+L+         GNFP+P+ELA+L E +L KRCG+GYRA RI+
Sbjct: 250  SNTFYSTGPYATSDIQSHLVTQHCAKKTTGNFPSPRELANLDERFLAKRCGLGYRAGRII 309

Query: 957  NLAKQICNGSIDLDSLEN-PDGSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIYQCVP 1133
             LA+ I  G I L   E   +G        L   L++++G G FT   VLMCMG Y  +P
Sbjct: 310  KLAQGIVEGRIPLREFEQVSNGGSLSTYSKLTDQLREIEGFGPFTRANVLMCMGFYHVIP 369

Query: 1134 TDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSHM 1313
            TD+ETVRH KQV  ++  TIK+V  + EE+Y K+ PFQFL YW E+W  YE++FG+LS M
Sbjct: 370  TDSETVRHFKQVHAKNS-TIKTVQSEAEEIYRKFAPFQFLVYWAELWHFYEQRFGKLSEM 428

Query: 1314 PPSDYGLISGHNMK 1355
            P S+Y LI+  N++
Sbjct: 429  PCSNYKLITASNLR 442


>gb|EEC82605.1| hypothetical protein OsI_27177 [Oryza sativa Indica Group]
          Length = 463

 Score =  271 bits (692), Expect = 6e-70
 Identities = 168/439 (38%), Positives = 229/439 (52%), Gaps = 27/439 (6%)
 Frame = +3

Query: 123  FDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEXXXXXXXXXXXXXXXXR------V 284
            FDLE AVCS+G FMM+PNRW    + L RPLRL  +                       V
Sbjct: 37   FDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVRVSRHPARPSDALLVSV 96

Query: 285  FGI--SQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSPTLFE 458
             G     L+  D+  I  QV RMLRL E +  A   F  +H+ A+  GFGR+FRSPTLFE
Sbjct: 97   LGAPGDALSPPDQTSILEQVRRMLRLDEEDGRAAAEFQAMHAVAREAGFGRIFRSPTLFE 156

Query: 459  DIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTPTKTRL 638
            D+VK  LLCNC+W RTLSM+ +LC+LQ EL+                  F  +TP     
Sbjct: 157  DMVKCILLCNCQWTRTLSMSTALCELQLELRSS-----------SSTENFQSRTPPIREC 205

Query: 639  KRRECSEILRPAKL--RFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVPNLGSSSK 812
            KR+  ++     KL  +F+E     +    ++         ++   +  + +P+  S + 
Sbjct: 206  KRKRSNKRNVRVKLETKFNEDKLVCLEDPNLA-----TDTANLQTYENSFNLPSAASGTG 260

Query: 813  LTSN-TMDTSELPSNL------MAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQ 971
             TS  ++D SEL            G+FPTP+ELA+L E++L KRC +GYRARRI+ LA+ 
Sbjct: 261  NTSEVSLDHSELKLRNEPCLEDCGGDFPTPEELANLDEDFLAKRCNLGYRARRIVMLARS 320

Query: 972  ICNGSIDLDSLENPD----------GSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIY 1121
            I  G I L  LE              +     + L   L  + G G FT   VLMCMG +
Sbjct: 321  IVEGKICLQKLEEIRKMSVPTVEGLSTTPSTYDRLNEELSTISGFGPFTRANVLMCMGFF 380

Query: 1122 QCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGR 1301
              +P DTET+RHLKQ   R+  TI SV  +++ +Y KY PFQFLAYW E+W  Y KQFG+
Sbjct: 381  HMIPADTETIRHLKQFHKRAS-TISSVQKELDNIYGKYAPFQFLAYWCELWGFYNKQFGK 439

Query: 1302 LSHMPPSDYGLISGHNMKE 1358
            +S M P +Y L +   +K+
Sbjct: 440  ISDMEPINYRLFTASKLKK 458


>gb|EEE67737.1| hypothetical protein OsJ_25428 [Oryza sativa Japonica Group]
          Length = 442

 Score =  266 bits (680), Expect = 1e-68
 Identities = 169/433 (39%), Positives = 228/433 (52%), Gaps = 21/433 (4%)
 Frame = +3

Query: 123  FDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLC-DEXXXXXXXXXXXXXXXXRVFGISQ 299
            FDLE AVCS+G FMM+PNRW    + L RPLRL  D                     +S 
Sbjct: 37   FDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVRVSRHPARPSDALLVSV 96

Query: 300  LTHQDED--------HIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSPTLF 455
            L   D+D         I  QV RMLRL E +  A+  F  +H+ A+  GFGR+FRSPTLF
Sbjct: 97   LGAPDDDALSPLDQTSILEQVRRMLRLDEEDGRAVAEFQAMHAVAREVGFGRIFRSPTLF 156

Query: 456  EDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTPTKTR 635
            ED++K  LLCNC+W RTLSM+ +LC+LQ EL+                  F  +TP    
Sbjct: 157  EDMIKCILLCNCQWTRTLSMSTALCELQLELRSS-----------SSTENFQSRTPPIRE 205

Query: 636  LKRRECSEILRPAKL--RFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVPNLGSSS 809
             KR+  ++     KL  +F+E     +    ++           T    E +  +L SS+
Sbjct: 206  CKRKRSNKRNVRVKLETKFNEDKMVCLEDPNLA-----------TNTANENLF-SLPSSA 253

Query: 810  KLTSNTMDTS----------ELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRARRILN 959
              T NT + S          EL      G+FPTP+ELA+L E++L KRC +GYRARRI+ 
Sbjct: 254  NETGNTSEVSLDHSELKLRYELCLEDCGGDFPTPEELANLDEDFLAKRCNLGYRARRIVM 313

Query: 960  LAKQICNGSIDLDSLENPDGSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIYQCVPTD 1139
            LA+ I  G I L  LE      ++ +E+L      + G+  F    VLMCMG +  +P D
Sbjct: 314  LARSIVEGKICLQKLEE---IRKILIEELST----ISGIWPFHSCNVLMCMGFFHMIPAD 366

Query: 1140 TETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSHMPP 1319
            TET+RHLKQ   R+  TI SV  +++ +Y KY PFQFLAYW E+W  Y KQFG +S M P
Sbjct: 367  TETIRHLKQFHKRAS-TISSVQKELDNIYGKYAPFQFLAYWCELWGFYNKQFGIISDMEP 425

Query: 1320 SDYGLISGHNMKE 1358
             +Y L +   +K+
Sbjct: 426  INYRLFTASKLKK 438


>gb|EOY25839.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 426

 Score =  254 bits (648), Expect = 8e-65
 Identities = 170/452 (37%), Positives = 227/452 (50%), Gaps = 18/452 (3%)
 Frame = +3

Query: 54   SLCERMMEENGGECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEX 233
            S C  ++E   GE          F+LE+AVCS+G FMM+PN+W    ++L RPLRL D  
Sbjct: 28   SCCSVLIELPVGEAAAAEGA-GPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPLRLLDHH 86

Query: 234  XXXXXXXXXXXXXXX-----RVFGISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRV 398
                                RV+G   L+ Q    +  QV RMLRLSE E+  +  F ++
Sbjct: 87   SPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKVREFRKI 146

Query: 399  ----HSQAKSQG------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSEL 548
                H + ++         GRVFRSPTLFED+VK  LLCNC+                  
Sbjct: 147  VEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQ------------------ 188

Query: 549  KGKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGKIS 728
                               F PKTP    LKR          KLR               
Sbjct: 189  --------------AAEDDFIPKTPAGNELKR----------KLR--------------- 209

Query: 729  EGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSEN 908
                 VS++S+ + +G++  P    S      + +  E  +    G+FP+P+ELA+L E+
Sbjct: 210  -----VSKVSM-RLEGKFAEPRADHSKSDLQPSQELDEPHAYKGMGSFPSPEELANLDES 263

Query: 909  YLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLED---LKAYLQKLDGVG 1079
            +L KRC +GYRA RIL LAK I  G I L  LE  +G  ++ L     L   L+++DG G
Sbjct: 264  FLAKRCNLGYRASRILKLAKGIVQGIIQLMQLE--EGCKEISLSSYNKLAEQLRQIDGFG 321

Query: 1080 KFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAY 1259
             FTC  VLMCMG Y  +P D+ET+RHLKQV  +S  T+++V  DVE +YAKY PFQFLAY
Sbjct: 322  PFTCANVLMCMGFYHVIPADSETIRHLKQVHSKSS-TMQTVGRDVEGIYAKYAPFQFLAY 380

Query: 1260 WWEIWDSYEKQFGRLSHMPPSDYGLISGHNMK 1355
            W E+W  YE++FG+LS MP   Y LI+  NMK
Sbjct: 381  WAELWHYYEQRFGKLSEMPFCGYKLITASNMK 412


>dbj|BAC15471.1| hypothetical protein [Oryza sativa Japonica Group]
            gi|50510134|dbj|BAD31099.1| hypothetical protein [Oryza
            sativa Japonica Group]
          Length = 501

 Score =  249 bits (637), Expect = 1e-63
 Identities = 169/481 (35%), Positives = 229/481 (47%), Gaps = 69/481 (14%)
 Frame = +3

Query: 123  FDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLC-DEXXXXXXXXXXXXXXXXRVFGISQ 299
            FDLE AVCS+G FMM+PNRW    + L RPLRL  D                     +S 
Sbjct: 37   FDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVRVSRHPARPSDALLVSV 96

Query: 300  LTHQDED--------HIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSPTLF 455
            L   D+D         I  QV RMLRL E +  A+  F  +H+ A+  GFGR+FRSPTLF
Sbjct: 97   LGAPDDDALSPLDQTSILEQVRRMLRLDEEDGRAVAEFQAMHAVAREVGFGRIFRSPTLF 156

Query: 456  EDIVKAFLLCNC------------------------------------------RWQRTL 509
            ED++K  LLCNC                                          RW RTL
Sbjct: 157  EDMIKCILLCNCQFSLPLPLPSLASTSMRNSDTNMSRYLGIAIFHLHSTVLFNCRWTRTL 216

Query: 510  SMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKL--R 683
            SM+ +LC+LQ EL+                  F  +TP     KR+  ++     KL  +
Sbjct: 217  SMSTALCELQLELRSSS-----------STENFQSRTPPIRECKRKRSNKRNVRVKLETK 265

Query: 684  FDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNL-- 857
            F+E     +    ++   +  +  S+     E      G++S+++   +D SEL      
Sbjct: 266  FNEDKMVCLEDPNLATNTANENLFSLPSSANE-----TGNTSEVS---LDHSELKLRYEL 317

Query: 858  ----MAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPD--- 1016
                  G+FPTP+ELA+L E++L KRC +GYRARRI+ LA+ I  G I L  LE      
Sbjct: 318  CLEDCGGDFPTPEELANLDEDFLAKRCNLGYRARRIVMLARSIVEGKICLQKLEEIRKMS 377

Query: 1017 -------GSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVG 1175
                    +     + L   L  + G G FT   VLMCMG +  +P DTET+RHLKQ   
Sbjct: 378  VPTVEGLSTTPSTYDRLNEELSTISGFGPFTRANVLMCMGFFHMIPADTETIRHLKQFHK 437

Query: 1176 RSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNMK 1355
            R+  TI SV  +++ +Y KY PFQFLAYW E+W  Y KQFG +S M P +Y L +   +K
Sbjct: 438  RAS-TISSVQKELDNIYGKYAPFQFLAYWCELWGFYNKQFGIISDMEPINYRLFTASKLK 496

Query: 1356 E 1358
            +
Sbjct: 497  K 497


>gb|EOY25840.1| Uncharacterized protein isoform 3 [Theobroma cacao]
          Length = 421

 Score =  247 bits (630), Expect = 9e-63
 Identities = 165/420 (39%), Positives = 222/420 (52%), Gaps = 18/420 (4%)
 Frame = +3

Query: 54   SLCERMMEENGGECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEX 233
            S C  ++E   GE          F+LE+AVCS+G FMM+PN+W    ++L RPLRL D  
Sbjct: 43   SCCSVLIELPVGEAAAAEGA-GPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPLRLLDHH 101

Query: 234  XXXXXXXXXXXXXXX-----RVFGISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRV 398
                                RV+G   L+ Q    +  QV RMLRLSE E+  +  F ++
Sbjct: 102  SPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKVREFRKI 161

Query: 399  ----HSQAKSQG------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSEL 548
                H + ++         GRVFRSPTLFED+VK  LLCNC++ RTLSMA +LC+LQ E 
Sbjct: 162  VEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQFSRTLSMAKALCELQFET 221

Query: 549  KGKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGKIS 728
            + +P               F PKTP    LKR          KLR               
Sbjct: 222  Q-RPFS-----GVRAAEDDFIPKTPAGNELKR----------KLR--------------- 250

Query: 729  EGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSEN 908
                 VS++S+ + +G++  P    S      + +  E  +    G+FP+P+ELA+L E+
Sbjct: 251  -----VSKVSM-RLEGKFAEPRADHSKSDLQPSQELDEPHAYKGMGSFPSPEELANLDES 304

Query: 909  YLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLED---LKAYLQKLDGVG 1079
            +L KRC +GYRA RIL LAK I  G I L  LE  +G  ++ L     L   L+++DG G
Sbjct: 305  FLAKRCNLGYRASRILKLAKGIVQGIIQLMQLE--EGCKEISLSSYNKLAEQLRQIDGFG 362

Query: 1080 KFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAY 1259
             FTC  VLMCMG Y  +P D+ET+RHLKQV  +S  T+++V  DVE +YAKY PFQFLAY
Sbjct: 363  PFTCANVLMCMGFYHVIPADSETIRHLKQVHSKSS-TMQTVGRDVEGIYAKYAPFQFLAY 421


>gb|EOY14232.1| Uncharacterized protein TCM_033523 [Theobroma cacao]
          Length = 374

 Score =  243 bits (621), Expect = 1e-61
 Identities = 153/424 (36%), Positives = 214/424 (50%), Gaps = 5/424 (1%)
 Frame = +3

Query: 87   GECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEXXXXXXXXXXXX 266
            GEC+      SSF++E+AVC++G FMMSPN W+   K+L RPLRL D             
Sbjct: 12   GECS------SSFNMEKAVCNHGLFMMSPNVWIPSTKSLRRPLRLADSSGSVYVTISHPA 65

Query: 267  XXXX----RVFGI-SQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGR 431
                    +V G+ + ++  D+  I  QV RMLR+S  ++  +  F  +H  AK +GFGR
Sbjct: 66   PNHPFLVIQVNGLQNSISSADKAVIMEQVARMLRISSKDERDVREFQTLHGSAKDRGFGR 125

Query: 432  VFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFY 611
            +FRSP+ FED VK+ LLCNC                        GW              
Sbjct: 126  IFRSPSFFEDAVKSILLCNC------------------------GWK------------- 148

Query: 612  PKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVP 791
             +T T   + R  C+  L+ A             + KIS           TK        
Sbjct: 149  -RTLT---MARALCALQLQLASAHLQHKRVASNSNVKIS-----------TKRLKHKKYT 193

Query: 792  NLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQ 971
               S+S+L+ +  D S        GNFPT  ELA L E YL +RC +GYRAR IL LA++
Sbjct: 194  KASSTSELSMSGFDQS-------IGNFPTSTELACLDEKYLNERCNLGYRARCILQLARK 246

Query: 972  ICNGSIDLDSLENPDGSVQMKLEDLKAYLQKLDGVGKFTCDVVLMCMGIYQCVPTDTETV 1151
            + NG ++L+ LE  + S     E     L K+ G G F C  ++MC+G Y+ +P D+ET+
Sbjct: 247  VENGELELNKLE--ESSDTTSYERFYQKLMKIKGFGPFVCSNIMMCIGFYERIPFDSETI 304

Query: 1152 RHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSHMPPSDYG 1331
            RHLK V G+  C+ K++  D+EE+Y KY PFQ +AYW E+ D YE +FG+LS +  S Y 
Sbjct: 305  RHLKMVHGKGKCSRKTIEKDIEEIYGKYAPFQCMAYWLELLDEYENKFGKLSELESSSYH 364

Query: 1332 LISG 1343
            L +G
Sbjct: 365  LATG 368


>ref|XP_006470787.1| PREDICTED: uncharacterized protein LOC102629917 isoform X2 [Citrus
            sinensis]
          Length = 409

 Score =  239 bits (609), Expect = 3e-60
 Identities = 159/420 (37%), Positives = 218/420 (51%), Gaps = 30/420 (7%)
 Frame = +3

Query: 90   ECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEXXXXXXXXXXXXX 269
            E  L++ +  +F+LE AVCS+G FMMSPNRW    ++L RPL L +              
Sbjct: 5    ESLLKLPLAETFNLETAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVDV 64

Query: 270  XXXRVF----------------GISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRVH 401
               +                      L+ + +D + AQV RMLRLSE ++  +  F R+ 
Sbjct: 65   TICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVREFKRIV 124

Query: 402  SQ-AKSQG---------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK 551
             Q A+ +G          GRVFRSPTLFED+VK  LLCNC+W RTLSMA +LC+LQ    
Sbjct: 125  RQVAQEEGEETQYMEDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLSMARALCELQ---- 180

Query: 552  GKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRE-CSEILRPAKLRFDETSQCKVMSGKIS 728
                 W            F P+TP     KRR+  S++      R  E+         + 
Sbjct: 181  -----WELQHCSPSISEDFIPQTPAGKESKRRQKVSKVASKLTSRIAESKASSEDYMNLK 235

Query: 729  EGCSIVSEISITKCDGEYIVPNLGSSSKLTS-NTMDTSELPS-NLMAGNFPTPKELASLS 902
              C+ V E ++     +   P     S L   N + T++ PS     GNFP+P+ELA+L 
Sbjct: 236  LDCAGVLEENV-----QPSFPQNDIESDLHGLNELSTTDPPSARDRIGNFPSPRELANLD 290

Query: 903  ENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQM-KLEDLKAYLQKLDGVG 1079
            E++L KRC +GYRA RIL LA+ I +G I L  LE+      +     L   L +++G G
Sbjct: 291  ESFLAKRCNLGYRAGRILKLARGIVDGQIQLRELEDMCNEASLTAYVKLAEQLSQINGFG 350

Query: 1080 KFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAY 1259
             FT + VL+C+G Y  +PTD+ET+RHLKQV  R +CT K+V M  E +Y KY PFQFLAY
Sbjct: 351  PFTRNNVLVCIGFYHVIPTDSETIRHLKQVHAR-NCTSKTVQMIAESIYGKYAPFQFLAY 409


>gb|EMT03969.1| hypothetical protein F775_22747 [Aegilops tauschii]
          Length = 333

 Score =  224 bits (571), Expect = 7e-56
 Identities = 133/340 (39%), Positives = 187/340 (55%), Gaps = 19/340 (5%)
 Frame = +3

Query: 396  VHSQAKSQGFGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXX 575
            +H+ A+  GFGR+FRSPTLFED+VK  LLCNC+W RTLSMA +LC+LQ ELK        
Sbjct: 1    MHAAAREAGFGRIFRSPTLFEDMVKCILLCNCQWTRTLSMATALCELQLELK-------- 52

Query: 576  XXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETS-QC-------KVMSGKISE 731
                         +TP     KR+         KL    T  +C            +++ 
Sbjct: 53   ---CSAGTEDLQLRTPPIREHKRKRSKNQNVRVKLEKKFTELECLEDPRVETAQDTRVAT 109

Query: 732  GCSIVSEISITKCDGEYI-VPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSEN 908
            G S V  I+  + D +   +P +   +     + D+SEL      G+FPTP+ELA+L E+
Sbjct: 110  GTSDV--ITHLEADEKLASLPQVAPETGSVCQSFDSSELSLEGCIGDFPTPEELANLDED 167

Query: 909  YLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPD----------GSVQMKLEDLKAYL 1058
            +L KRCG+GYRA RI+ LA+ I  G +   +LE              ++    E L   L
Sbjct: 168  FLAKRCGLGYRAERIVLLARSIVEGKVCPQNLEEMQKMSLPATEELSTIPSTYERLNNEL 227

Query: 1059 QKLDGVGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYV 1238
              + G G FT   VLMCMG +  +P DTET+RHLKQ    +  TIKSV M+++++Y +Y 
Sbjct: 228  TTISGFGPFTRANVLMCMGFFHMIPADTETIRHLKQCHEIAS-TIKSVHMELDKIYGEYA 286

Query: 1239 PFQFLAYWWEIWDSYEKQFGRLSHMPPSDYGLISGHNMKE 1358
            PFQFLAYW+E+W  Y+KQFG+++ M PS Y L +   +K+
Sbjct: 287  PFQFLAYWFELWGFYDKQFGKITEMDPSTYRLFTASALKK 326


Top