BLASTX nr result

ID: Ephedra26_contig00009632 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Ephedra26_contig00009632
         (2099 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006853038.1| hypothetical protein AMTR_s00038p00020700 [A...   342   4e-91
ref|XP_006358484.1| PREDICTED: uncharacterized protein LOC102593...   317   1e-83
ref|XP_004230386.1| PREDICTED: uncharacterized protein LOC101247...   317   1e-83
ref|XP_003534756.2| PREDICTED: uncharacterized protein LOC100781...   288   8e-75
gb|EOY25838.1| Uncharacterized protein isoform 1 [Theobroma cacao]    286   3e-74
ref|XP_004959865.1| PREDICTED: uncharacterized protein LOC101766...   285   4e-74
ref|XP_002304112.2| hypothetical protein POPTR_0003s03710g [Popu...   285   5e-74
gb|ESW19537.1| hypothetical protein PHAVU_006G133500g [Phaseolus...   284   1e-73
gb|EXC47697.1| hypothetical protein L484_002408 [Morus notabilis]     282   4e-73
ref|XP_006470786.1| PREDICTED: uncharacterized protein LOC102629...   281   6e-73
ref|XP_006431360.1| hypothetical protein CICLE_v10001110mg [Citr...   281   1e-72
ref|XP_002519384.1| conserved hypothetical protein [Ricinus comm...   280   2e-72
gb|EEC82605.1| hypothetical protein OsI_27177 [Oryza sativa Indi...   272   4e-70
gb|EEE67737.1| hypothetical protein OsJ_25428 [Oryza sativa Japo...   266   2e-68
gb|EOY25839.1| Uncharacterized protein isoform 2 [Theobroma cacao]    256   2e-65
dbj|BAC15471.1| hypothetical protein [Oryza sativa Japonica Grou...   251   9e-64
gb|EOY25840.1| Uncharacterized protein isoform 3 [Theobroma cacao]    249   4e-63
gb|EOY14232.1| Uncharacterized protein TCM_033523 [Theobroma cacao]   244   8e-62
ref|XP_006470787.1| PREDICTED: uncharacterized protein LOC102629...   240   2e-60
gb|EMT03969.1| hypothetical protein F775_22747 [Aegilops tauschii]    227   2e-56

>ref|XP_006853038.1| hypothetical protein AMTR_s00038p00020700 [Amborella trichopoda]
            gi|548856677|gb|ERN14505.1| hypothetical protein
            AMTR_s00038p00020700 [Amborella trichopoda]
          Length = 458

 Score =  342 bits (877), Expect = 4e-91
 Identities = 196/454 (43%), Positives = 254/454 (55%), Gaps = 30/454 (6%)
 Frame = +1

Query: 238  GGECT-LRVSVKSSFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEXXXXXXXXXX 414
            G E T L + V  SF+LE+AVCS+GFFMM+PN W S  +TL RPLRL D           
Sbjct: 4    GAERTVLTLPVNESFELEKAVCSHGFFMMAPNLWFSSSQTLQRPLRLTDRSSVPVRITQL 63

Query: 415  XXXXXXR----VFGISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFG 582
                       V G S+L   D+ ++ AQV RMLR+SE +D  ++ FH ++  AK  GFG
Sbjct: 64   SLSSQKSLQILVLGASKLYQHDQQYLLAQVARMLRISEEDDLKVNKFHEMYPVAKETGFG 123

Query: 583  RVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXF 762
            RVFRSPTLFED+VK+ LLCNC+W RTLSMA +LC+LQ EL G  L               
Sbjct: 124  RVFRSPTLFEDMVKSILLCNCQWTRTLSMARALCELQLELNGNSLRQSNKDTDFSKSVNL 183

Query: 763  YPKTPTKTRLKRRE-----------------------CSEILRPAKLR--FDETSQCKVM 867
             P TP +   K+R                          E LRP  L   F + S     
Sbjct: 184  SPVTPMQLEHKKRRKNPNQNIIMNLMTKFSENETHLAADESLRPIDLAKDFSKNSPTMFS 243

Query: 868  SGKISEGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELA 1047
            S +   G     ++S  K         LG  + L +  ++   L   L AGNFP P+ELA
Sbjct: 244  SEEGRNGKLNYDQVSEEK---------LGDGAILDNQLLENKTLSFFLEAGNFPCPEELA 294

Query: 1048 SLSENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLEDLKDYLQKLDG 1227
            +L E  L KRC VG+R++RI+ LA+ I  G++DL  +E       + L+ L   L  + G
Sbjct: 295  NLDEKILEKRCKVGFRSKRIVKLAQSIVEGALDLGKIEVLSQQDPIHLDGLMRQLLSIYG 354

Query: 1228 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFL 1407
            VG + C+ VLM MGIYQ +P DTET+RHLKQ   R  CTI ++  D+EE+Y K+ PFQFL
Sbjct: 355  VGPYVCNNVLMSMGIYQRIPADTETLRHLKQFHARKQCTIGTIQKDIEEIYGKHEPFQFL 414

Query: 1408 AYWWEIWDSYEKQFGRLSQMPPSDYGLISGHNMK 1509
             YW E+W+ YEK+FG+LSQMPPSDY LI+ HNMK
Sbjct: 415  VYWSEMWEFYEKRFGKLSQMPPSDYELITAHNMK 448


>ref|XP_006358484.1| PREDICTED: uncharacterized protein LOC102593287 isoform X1 [Solanum
            tuberosum] gi|565385158|ref|XP_006358485.1| PREDICTED:
            uncharacterized protein LOC102593287 isoform X2 [Solanum
            tuberosum]
          Length = 485

 Score =  317 bits (813), Expect = 1e-83
 Identities = 183/441 (41%), Positives = 248/441 (56%), Gaps = 25/441 (5%)
 Frame = +1

Query: 271  SSFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEXXXXXXXXXXXXXXXX------ 432
            ++FDLE+AVCS+G FMM+PNRW S  KTL RPL L +                       
Sbjct: 29   ATFDLEKAVCSHGLFMMAPNRWDSLSKTLERPLHLSENINDDDHEQSVLVQINQPSDSPH 88

Query: 433  ----RVFGISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSP 600
                RVFG + L+   +  +  QV RM+RLS  E+  +  F  +  +AK +G GRVFRSP
Sbjct: 89   SLLLRVFGTASLSTIHQRSLLGQVRRMVRLSVEENKRVKQFQEICGEAKDRGLGRVFRSP 148

Query: 601  TLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK--------GKPLGWXXXXXXXXXXX 756
            TLFED+VK  LLCNC+W RTLSMA +LC+LQ EL           P              
Sbjct: 149  TLFEDMVKCMLLCNCQWSRTLSMAEALCELQLELNCPSSAASFPDPDNQNQLKGVTFKSE 208

Query: 757  XFYPKTPTKTRLKRRE----CSEILRPAKLRFDETSQCKVMSGKISEGCSIVSEI--SIT 918
             F P+TP     ++R     CS  L       +E          ++   S+  E+     
Sbjct: 209  HFTPRTPAGKESRKRAGAYGCSRKLLERLTEVEEIIDIGKPGVTVTPAFSVGEEVLKKSN 268

Query: 919  KCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRA 1098
             C     V ++G+S+    +  +  +L S    GNFP+PKELASL E++L KRCG+GYRA
Sbjct: 269  LCRDTTEVCDVGTSAPFNLDPSEDRKLSSFNQLGNFPSPKELASLDESFLAKRCGLGYRA 328

Query: 1099 RRILNLAKQICNGSIDLDSLENPDGSVQMK-LEDLKDYLQKLDGVGKFTCDVVLMCMGIY 1275
             RI+ LAK I  GSI L  LE    +  +   + + + L+++DG G FTC  VLMC+G Y
Sbjct: 329  GRIIKLAKGIVEGSIQLKELEEACSNPSLSDYDKMAEQLREIDGFGPFTCANVLMCLGYY 388

Query: 1276 QCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGR 1455
              +PTD+ET+RHLKQV  R+  TI++V  DVE +Y KY PFQFLAYW E+W  YE++FG+
Sbjct: 389  HVIPTDSETIRHLKQVHARTS-TIQNVQRDVENIYGKYAPFQFLAYWSEVWHFYEERFGK 447

Query: 1456 LSQMPPSDYGLISGHNMKEER 1518
            LS+MP S+Y LI+  NM+ +R
Sbjct: 448  LSEMPHSEYKLITAANMRRKR 468


>ref|XP_004230386.1| PREDICTED: uncharacterized protein LOC101247758 [Solanum
            lycopersicum]
          Length = 483

 Score =  317 bits (813), Expect = 1e-83
 Identities = 188/459 (40%), Positives = 253/459 (55%), Gaps = 25/459 (5%)
 Frame = +1

Query: 217  ERMMEENGGECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEXXXX 396
            E  +E+  G C       +SFDLE+AVCS+G FMM+PNRW +  KTL RPLRL +     
Sbjct: 16   ELPLEDGNGYC-------ASFDLEKAVCSHGLFMMAPNRWDTLSKTLERPLRLSENINDD 68

Query: 397  XXXXXXXXXXXX----------RVFGISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFH 546
                                  RV     L+   +  +  QV RM+RLS  E+  +  F 
Sbjct: 69   DHEQSVLVQITQPSDYPHSLLLRVLDTDSLSTIHQRSLLGQVRRMVRLSVEENKRVKLFQ 128

Query: 547  RVHSQAKSQGFGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK------- 705
             +  +AK +GFGRVFRSPTLFED+VK  LLCNC+W RTLSMA +LC+LQ EL        
Sbjct: 129  EICGEAKERGFGRVFRSPTLFEDMVKCMLLCNCQWSRTLSMAEALCELQLELNCPSSAAS 188

Query: 706  -GKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRE----CSEILRPAKLRFDETSQCKVMS 870
               P               F P+TP    L++R     CS  L       +E        
Sbjct: 189  FPDPDNQNQLKGVTSKSEHFTPRTPAGKELRKRAGAYGCSRNLLERLNEVEEIVDIDKPG 248

Query: 871  GKISEGCSIVSEI--SITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKEL 1044
              ++   S+  E+      C     V  +  S+ L  +  +  +L S    GNFP+PK+L
Sbjct: 249  VTVTPAFSVGEEVLQKSNLCQDTTEVWEVSVSAPLNPDPSEDRKLSSFNQLGNFPSPKQL 308

Query: 1045 ASLSENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLED-LKDYLQKL 1221
            ASL E++L KRCG+GYRA RI+ LAK I  GSI L+ LE    +  +   D + + L+++
Sbjct: 309  ASLDESFLAKRCGLGYRAGRIIKLAKGIVEGSIQLNELEEACSNPSLSNYDKMAEQLREI 368

Query: 1222 DGVGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQ 1401
            DG G FTC  VLMC+G Y  +PTD+ET+RHLKQV  R+  TI++V  DVE +Y KY PFQ
Sbjct: 369  DGFGPFTCANVLMCLGYYHVIPTDSETIRHLKQVHARTS-TIQNVQRDVENIYGKYAPFQ 427

Query: 1402 FLAYWWEIWDSYEKQFGRLSQMPPSDYGLISGHNMKEER 1518
            FLAYW E+W  YE++FG+LS+MP S+Y LI+  NM+ +R
Sbjct: 428  FLAYWSEVWHFYEERFGKLSEMPHSEYKLITAANMRPKR 466


>ref|XP_003534756.2| PREDICTED: uncharacterized protein LOC100781827 [Glycine max]
          Length = 443

 Score =  288 bits (736), Expect = 8e-75
 Identities = 179/429 (41%), Positives = 235/429 (54%), Gaps = 12/429 (2%)
 Frame = +1

Query: 259  VSVKSSFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEXXXXXXXXXXXXXXXXRV 438
            + + S F LE+AVCS+G FMM PN W    KTL RPLR                    RV
Sbjct: 18   MELPSPFQLEQAVCSHGLFMMPPNHWDPLSKTLIRPLR-SSPSSFLVSLSQHSQSLAVRV 76

Query: 439  FGISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRVH--SQAKSQGFGRVFRSPTLFE 612
                 L+ Q ++HI AQV RMLR SE E+ A+  F  +H          GRVFRSPTLFE
Sbjct: 77   HATHALSPQQQNHITAQVSRMLRFSEAEEKAVREFRSLHVVDHPNRSFSGRVFRSPTLFE 136

Query: 613  DIVKAFLLCNCRWQRTLSMAASLCDLQSELK-GKPLGWXXXXXXXXXXXXFYPKTPTKTR 789
            D+VK  LLCNC+W RTLSMA +LC+LQ EL+ G P               F PKTP    
Sbjct: 137  DMVKCILLCNCQWPRTLSMAQALCELQLELQNGSPCTIAVSGNSKGESEGFIPKTPASKE 196

Query: 790  LKRRECSE--ILRPAKLRFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVPNLGSSS 963
             +R + S   +    KL  D   Q   +    S   ++             +  + G S 
Sbjct: 197  TRRNKVSTKGMFCKKKLELDGNLQIDHVVASSSTATTL-------------LTTDNGDSE 243

Query: 964  KLTSNTMDTSELPSN-----LMAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQI 1128
            +L S+  D+    SN        GNFP+P ELA+L E++L KRCG+GYRA  I+ LA+ I
Sbjct: 244  ELRSH--DSCHEFSNGNEYFSRTGNFPSPSELANLDESFLAKRCGLGYRAGYIIELARAI 301

Query: 1129 CNGSIDLDSLE--NPDGSVQMKLEDLKDYLQKLDGVGKFTCDVVLMCMGIYQCVPTDTET 1302
              G I L  LE  + D S+    + L D L+++ G G FT   VLMC+G Y  +PTD+ET
Sbjct: 302  VEGKIQLGQLEELSKDASLS-NYKQLDDQLKQIRGYGPFTRANVLMCLGYYHVIPTDSET 360

Query: 1303 VRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSQMPPSDY 1482
            VRHLKQV  R   T K++  ++EE+Y KY P+QFLA+W E+WD YE +FG+L++M  SDY
Sbjct: 361  VRHLKQVHSRY-TTSKTIERELEEIYGKYEPYQFLAFWSEVWDFYETRFGKLNEMHSSDY 419

Query: 1483 GLISGHNMK 1509
             LI+  NM+
Sbjct: 420  KLITACNMR 428


>gb|EOY25838.1| Uncharacterized protein isoform 1 [Theobroma cacao]
          Length = 467

 Score =  286 bits (731), Expect = 3e-74
 Identities = 182/454 (40%), Positives = 247/454 (54%), Gaps = 18/454 (3%)
 Frame = +1

Query: 202  TASLCERMMEENGGECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCD 381
            ++S C  ++E   GE          F+LE+AVCS+G FMM+PN+W    ++L RPLRL D
Sbjct: 41   SSSCCSVLIELPVGEAAAAEGA-GPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPLRLLD 99

Query: 382  EXXXXXXXXXXXXXXXX-----RVFGISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFH 546
                                  RV+G   L+ Q    +  QV RMLRLSE E+  +  F 
Sbjct: 100  HHSPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKVREFR 159

Query: 547  RV----HSQAKSQG------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQS 696
            ++    H + ++         GRVFRSPTLFED+VK  LLCNC++ RTLSMA +LC+LQ 
Sbjct: 160  KIVEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQFSRTLSMAKALCELQF 219

Query: 697  ELKGKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGK 876
            E +    G             F PKTP    LKR          KLR             
Sbjct: 220  ETQRPFSG------VRAAEDDFIPKTPAGNELKR----------KLR------------- 250

Query: 877  ISEGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLS 1056
                   VS++S+ + +G++  P    S      + +  E  +    G+FP+P+ELA+L 
Sbjct: 251  -------VSKVSM-RLEGKFAEPRADHSKSDLQPSQELDEPHAYKGMGSFPSPEELANLD 302

Query: 1057 ENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLED---LKDYLQKLDG 1227
            E++L KRC +GYRA RIL LAK I  G I L  LE  +G  ++ L     L + L+++DG
Sbjct: 303  ESFLAKRCNLGYRASRILKLAKGIVQGIIQLMQLE--EGCKEISLSSYNKLAEQLRQIDG 360

Query: 1228 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFL 1407
             G FTC  VLMCMG Y  +P D+ET+RHLKQV  +S  T+++V  DVE +YAKY PFQFL
Sbjct: 361  FGPFTCANVLMCMGFYHVIPADSETIRHLKQVHSKSS-TMQTVGRDVEGIYAKYAPFQFL 419

Query: 1408 AYWWEIWDSYEKQFGRLSQMPPSDYGLISGHNMK 1509
            AYW E+W  YE++FG+LS+MP   Y LI+  NMK
Sbjct: 420  AYWAELWHYYEQRFGKLSEMPFCGYKLITASNMK 453


>ref|XP_004959865.1| PREDICTED: uncharacterized protein LOC101766322 [Setaria italica]
          Length = 461

 Score =  285 bits (730), Expect = 4e-74
 Identities = 171/446 (38%), Positives = 237/446 (53%), Gaps = 32/446 (7%)
 Frame = +1

Query: 277  FDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEXXXXXXXXXXXXXXXX------RV 438
            FDL  AVCS+G FMM+PNRW    + L RPLRL  +                       V
Sbjct: 36   FDLAAAVCSHGLFMMAPNRWDPAARALVRPLRLASDRSASLLARVSAHPARPGTALLVAV 95

Query: 439  FGISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSPTLFEDI 618
             G   L+  D D+I  QV RMLRLSE +  A+  F  +H+ A+ +GFGR+FRSPTLFED+
Sbjct: 96   EGADALSSLDRDYILEQVRRMLRLSEEDGAAVAEFQAMHAAAREEGFGRIFRSPTLFEDM 155

Query: 619  VKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTP----TKT 786
            VK  LLCNC+W RTLSMA +LC++Q ELK                  F  +TP     K 
Sbjct: 156  VKCILLCNCQWTRTLSMATALCEIQLELK-----------CSSSVEDFQSRTPPIRERKR 204

Query: 787  RLKRRECSEILRPAKLRFDETSQCKVMSGK------------ISEGCSIVSEISITKCDG 930
            +  +R+   I    +   D+     + SG             +S   S+ SE   + CD 
Sbjct: 205  KRSKRQSVRIKLETRFAEDKLEGPTIASGTSNDLTHPETNEYLSSLASVASETG-SACDS 263

Query: 931  EYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRARRIL 1110
               +P+L +S    +N     +       G+FPTP+ELA+L E +L KRC +GYRA+RI+
Sbjct: 264  ---LPSLDNSELSLNNAPGLED-----CIGDFPTPEELANLDEGFLAKRCNLGYRAKRIV 315

Query: 1111 NLAKQICNGSIDLDSLE----------NPDGSVQMKLEDLKDYLQKLDGVGKFTCDVVLM 1260
             LA+ +  G + L  LE              +++   E L   L  + G G FT   VLM
Sbjct: 316  MLARGVVEGKVCLQKLEEMCRISVPAAEEVSTIESACERLNKELSAISGFGPFTRANVLM 375

Query: 1261 CMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYE 1440
            CMG    +P DTET+RHLKQV  R+  TI SV  +++++Y KY PFQFLAYW+E+W  Y 
Sbjct: 376  CMGFNHTIPADTETIRHLKQVHKRAS-TISSVHQELDKIYGKYAPFQFLAYWFELWGFYN 434

Query: 1441 KQFGRLSQMPPSDYGLISGHNMKEER 1518
            KQFG++ +M PS+Y L +  ++K+ +
Sbjct: 435  KQFGKICEMEPSNYRLFTASHLKKAK 460


>ref|XP_002304112.2| hypothetical protein POPTR_0003s03710g [Populus trichocarpa]
            gi|550342350|gb|EEE79091.2| hypothetical protein
            POPTR_0003s03710g [Populus trichocarpa]
          Length = 489

 Score =  285 bits (729), Expect = 5e-74
 Identities = 183/456 (40%), Positives = 241/456 (52%), Gaps = 42/456 (9%)
 Frame = +1

Query: 274  SFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLC-------------DEXXXXXXXXXX 414
            +F+LE+AVCS+G FMMSPN W     T  RPLRL                          
Sbjct: 29   TFNLEKAVCSHGLFMMSPNHWDPLSLTFSRPLRLSLSDSDPQVSTPTTSLFVSISHPPHL 88

Query: 415  XXXXXXRVFGISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQ------- 573
                  RV+G   L+ + ++ + AQVVRMLRLSE ++     F ++   A ++       
Sbjct: 89   PRSLSVRVYGTRCLSPKHQESLVAQVVRMLRLSETDERNAREFRKIAEAAAAEENNSWLT 148

Query: 574  GFG-RVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXX 750
            GFG RVFRSPTLFED+VK  LLCNC+W RTLSMA +LC+LQ EL+ K  G          
Sbjct: 149  GFGGRVFRSPTLFEDMVKCILLCNCQWPRTLSMARALCELQCELQCKSSGVFVAQAVNAT 208

Query: 751  XXX--------FYPKTPTKTRLKRR-ECSEILRPAKLRFDET-------SQCKVMSGKIS 882
                       F P T      KR    S++ +    +  ET       +  K  S  I 
Sbjct: 209  VKNKCNDTAHNFIPNTSAGKESKRNIRASKVTKNLASKIVETETLLEADANLKTDSAHIG 268

Query: 883  -EGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPS--NLMAGNFPTPKELASL 1053
             E    V   S  +C   +     GS S    +      +    N M  NFP+P+ELA+L
Sbjct: 269  RETLESVENDSCARCSSRH-----GSDSWAPDSLQSQHGIQPGVNKMICNFPSPRELANL 323

Query: 1054 SENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENP--DGSVQMKLEDLKDYLQKLDG 1227
             E++L KRC +GYRA RI+ LA+ I  G I L  +E    +G+       L D  +++DG
Sbjct: 324  DESFLAKRCNLGYRAIRIIKLAQSIVEGRIPLREVEEDCANGASSSCYNKLADQFRQIDG 383

Query: 1228 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFL 1407
             G FTC  VLMCMG Y  +PTD+ETVRHLKQV  +   TI++V  DVEE+Y KY PFQFL
Sbjct: 384  FGPFTCANVLMCMGFYHIIPTDSETVRHLKQVHAKKS-TIQTVQRDVEEIYGKYAPFQFL 442

Query: 1408 AYWWEIWDSYEKQFGRLSQMPPSDYGLISGHNMKEE 1515
            AYW E+W  YEK+FG+LS++P SDY LI+  NM+ +
Sbjct: 443  AYWAELWHFYEKRFGKLSEIPTSDYKLITASNMRSK 478


>gb|ESW19537.1| hypothetical protein PHAVU_006G133500g [Phaseolus vulgaris]
          Length = 474

 Score =  284 bits (726), Expect = 1e-73
 Identities = 179/444 (40%), Positives = 242/444 (54%), Gaps = 18/444 (4%)
 Frame = +1

Query: 277  FDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRL-----CDEXXXXXXXXXXXXXXXXRVF 441
            F L++AVCS+GFFMM+PN W    KTL RPL L                        RV 
Sbjct: 46   FQLDQAVCSHGFFMMAPNHWDPLSKTLTRPLLLHNPSSSSSSSLLVSLSQRPQSLAVRVH 105

Query: 442  GISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRVHS-QAKSQGFG-RVFRSPTLFED 615
             +  ++ Q + HIKAQ+ RMLRLSE E+ A+  F  VH+    ++ FG RVFRSPTLFED
Sbjct: 106  SVHFISPQQQRHIKAQITRMLRLSEAEEKAVREFRSVHAADHPNRSFGGRVFRSPTLFED 165

Query: 616  IVKAFLLCNCRWQRTLSMAASLCDLQSELK-GKPLGWXXXXXXXXXXXXFYPKTPT--KT 786
            +VK  LLCNC+W RTLSMA +LC+LQS L+ G P               F PKTP   + 
Sbjct: 166  MVKCILLCNCQWPRTLSMAQALCELQSGLQNGLPCAVEGSGNPKVEAEEFVPKTPASKEN 225

Query: 787  RLKRRECSEILRPAKLRF------DETSQCKVMSGKISEGCSIVSEISITKCDGEYI-VP 945
            R K+     +L   KL        D   Q   M    S+  +++ ++ + + D      P
Sbjct: 226  RRKKAPTKGVLLKKKLELELEMEVDGNLQMDHMFASSSD-TTLLGDLEVLRSDDSCCQFP 284

Query: 946  NLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQ 1125
            N G     T               GNFP+P ELA+LSE++L KRC +GYRA  IL LA+ 
Sbjct: 285  NEGEYFDHT---------------GNFPSPIELANLSESFLAKRCKLGYRAGYILELAQG 329

Query: 1126 ICNGSIDLDSLENPDGSVQMKL-EDLKDYLQKLDGVGKFTCDVVLMCMGIYQCVPTDTET 1302
            I  G I L+ LE       +   + L D L+ + G G FT   VLMC+G Y  +P D+ET
Sbjct: 330  IVEGKIQLEQLEELSKDASLSCYKQLGDQLKPIKGFGPFTRANVLMCLGYYHVIPWDSET 389

Query: 1303 VRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSQMPPSDY 1482
            VRHLKQV  ++  + K++  D+EE+Y KY P+QFLA+W EIWD YE +FG++++M  S+Y
Sbjct: 390  VRHLKQVHSKNTSS-KTIERDLEEIYGKYEPYQFLAFWSEIWDFYETRFGKMNEMHSSEY 448

Query: 1483 GLISGHNMKEERAVTSIDPDKSQE 1554
              I+  NM+  R  T+     SQ+
Sbjct: 449  KRITASNMRSTRKATNKRKRPSQK 472


>gb|EXC47697.1| hypothetical protein L484_002408 [Morus notabilis]
          Length = 472

 Score =  282 bits (722), Expect = 4e-73
 Identities = 186/447 (41%), Positives = 241/447 (53%), Gaps = 34/447 (7%)
 Frame = +1

Query: 271  SSFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLC------------DEXXXXXXXXXX 414
            ++F LE AVCS+G FMM+PN+W    KTL RPLRL             D+          
Sbjct: 14   ATFRLETAVCSHGLFMMAPNQWDPLSKTLLRPLRLTLHHHHWNPQQQQDDSVMARISQPH 73

Query: 415  XXXXXXRVF---GISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGR 585
                  RV    G   LT  ++  + AQV RMLRLS+ E+     F  V+      G GR
Sbjct: 74   DRLHCLRVLVHAGTRSLTSDNKQALLAQVSRMLRLSQTEERICREFSEVYGCGS--GLGR 131

Query: 586  VFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFY 765
            VFRSPTLFED+VK  LLCNC+W RTLSMA +LCDLQ EL+ + +              F 
Sbjct: 132  VFRSPTLFEDMVKCILLCNCQWPRTLSMAQALCDLQRELQLQSV--------PSKTVDFV 183

Query: 766  PKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVP 945
            PKTP     KR+     +   K     TSQ    S +  E  S  +++SI          
Sbjct: 184  PKTPAGKEPKRK-----VEKLKASTCLTSQFDAQSNEGLESHS--NDLSIDISQPTPSAQ 236

Query: 946  NLGSSSKLT----------SNTMDTSEL--PSNLM------AGNFPTPKELASLSENYLT 1071
            NL  SS L+          S  +D++ L  P  L        G+FPTP ELA L E +L 
Sbjct: 237  NLSPSSLLSVPMENVTCEESYGVDSASLCNPQILRDREFEGTGDFPTPTELAKLDEKFLA 296

Query: 1072 KRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQM-KLEDLKDYLQKLDGVGKFTCD 1248
            KRC +GYRA RIL LA+ I  G I L  LE       +     L   L+++DG G FTC 
Sbjct: 297  KRCKLGYRAGRILKLARGIVEGRIQLRELEETCMERSLCSYSKLAVQLRQIDGFGPFTCA 356

Query: 1249 VVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIW 1428
             VLMCMG Y  +P+D+ET+RHL+QV GR+  T++++  DV+++YAKY PFQFLAYW E+W
Sbjct: 357  NVLMCMGFYHVIPSDSETIRHLQQVHGRNS-TVRTIERDVQQIYAKYEPFQFLAYWSELW 415

Query: 1429 DSYEKQFGRLSQMPPSDYGLISGHNMK 1509
              YEK+FG++S+MP S Y L +  NMK
Sbjct: 416  HFYEKKFGKISEMPCSAYKLFTASNMK 442


>ref|XP_006470786.1| PREDICTED: uncharacterized protein LOC102629917 isoform X1 [Citrus
            sinensis]
          Length = 454

 Score =  281 bits (720), Expect = 6e-73
 Identities = 178/451 (39%), Positives = 242/451 (53%), Gaps = 30/451 (6%)
 Frame = +1

Query: 244  ECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEXXXXXXXXXXXXX 423
            E  L++ +  +F+LE AVCS+G FMMSPNRW    ++L RPL L +              
Sbjct: 5    ESLLKLPLAETFNLETAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVDV 64

Query: 424  XXXRVF----------------GISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRVH 555
               +                      L+ + +D + AQV RMLRLSE ++  +  F R+ 
Sbjct: 65   TICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVREFKRIV 124

Query: 556  SQ-AKSQG---------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK 705
             Q A+ +G          GRVFRSPTLFED+VK  LLCNC+W RTLSMA +LC+LQ    
Sbjct: 125  RQVAQEEGEETQYMEDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLSMARALCELQ---- 180

Query: 706  GKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRE-CSEILRPAKLRFDETSQCKVMSGKIS 882
                 W            F P+TP     KRR+  S++      R  E+         + 
Sbjct: 181  -----WELQHCSPSISEDFIPQTPAGKESKRRQKVSKVASKLTSRIAESKASSEDYMNLK 235

Query: 883  EGCSIVSEISITKCDGEYIVPNLGSSSKLTS-NTMDTSELPSNL-MAGNFPTPKELASLS 1056
              C+ V E ++         P     S L   N + T++ PS     GNFP+P+ELA+L 
Sbjct: 236  LDCAGVLEENVQPS-----FPQNDIESDLHGLNELSTTDPPSARDRIGNFPSPRELANLD 290

Query: 1057 ENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLE-DLKDYLQKLDGVG 1233
            E++L KRC +GYRA RIL LA+ I +G I L  LE+      +     L + L +++G G
Sbjct: 291  ESFLAKRCNLGYRAGRILKLARGIVDGQIQLRELEDMCNEASLTAYVKLAEQLSQINGFG 350

Query: 1234 KFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAY 1413
             FT + VL+C+G Y  +PTD+ET+RHLKQV  R+ CT K+V M  E +Y KY PFQFLAY
Sbjct: 351  PFTRNNVLVCIGFYHVIPTDSETIRHLKQVHARN-CTSKTVQMIAESIYGKYAPFQFLAY 409

Query: 1414 WWEIWDSYEKQFGRLSQMPPSDYGLISGHNM 1506
            W E+W  YEK+FG+LS+MP SDY LI+  NM
Sbjct: 410  WSELWHFYEKRFGKLSEMPYSDYKLITASNM 440


>ref|XP_006431360.1| hypothetical protein CICLE_v10001110mg [Citrus clementina]
            gi|557533482|gb|ESR44600.1| hypothetical protein
            CICLE_v10001110mg [Citrus clementina]
          Length = 454

 Score =  281 bits (718), Expect = 1e-72
 Identities = 174/453 (38%), Positives = 245/453 (54%), Gaps = 32/453 (7%)
 Frame = +1

Query: 244  ECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEXXXXXXXXXXXXX 423
            E  L++ +  +F+LE AVCS+G FMMSPNRW    ++L RPL L +              
Sbjct: 5    ESVLKLPLAETFNLEAAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVDV 64

Query: 424  XXXRVF----------------GISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRV- 552
               +                      L+ + +D + AQV RMLRLSE ++  +  F R+ 
Sbjct: 65   TICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVRDFKRIV 124

Query: 553  ---------HSQAKSQGFGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK 705
                      SQ  +   GRVFRSPTLFED+VK  LLCNC+W RTL+MA +LC+LQ    
Sbjct: 125  RQVAQEEGEESQYMTDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLNMARALCELQ---- 180

Query: 706  GKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGKISE 885
                 W            F P+TP     KRR+     + +K+    TS  ++   K S 
Sbjct: 181  -----WELQHCSPSISEDFIPQTPAGKESKRRQ-----KVSKVASKLTS--RIAESKASS 228

Query: 886  GCSIVSEISITKCDGEYIVPNLGSSSKLTS----NTMDTSELPSNL-MAGNFPTPKELAS 1050
               +  ++  T    E + P+   +   +     N + T++ PS     GNFP+P+ELA+
Sbjct: 229  EDDMNLKLDCTGALEENVQPSFPRNDIESDLHGLNELSTTDPPSACDRIGNFPSPRELAN 288

Query: 1051 LSENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLED-LKDYLQKLDG 1227
            L E++L KRC +GYRA RIL LA+ I +G I L  LE+      +   + L + L +++G
Sbjct: 289  LDESFLAKRCNLGYRAGRILKLAQGIVDGQIQLRELEDTCNEASLTTYNKLAEQLSQING 348

Query: 1228 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFL 1407
             G FT + VL+C+G Y  +PTD+ET+RHLKQV  R+ CT K+V +  E +Y KY PFQFL
Sbjct: 349  FGPFTRNNVLVCIGFYHVIPTDSETIRHLKQVHARN-CTSKTVQIIAESIYGKYSPFQFL 407

Query: 1408 AYWWEIWDSYEKQFGRLSQMPPSDYGLISGHNM 1506
            AYW E+W  YEK+FG+LS+MP SDY LI+  NM
Sbjct: 408  AYWSELWHFYEKRFGKLSEMPYSDYKLITASNM 440


>ref|XP_002519384.1| conserved hypothetical protein [Ricinus communis]
            gi|223541451|gb|EEF43001.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 458

 Score =  280 bits (715), Expect = 2e-72
 Identities = 168/436 (38%), Positives = 238/436 (54%), Gaps = 22/436 (5%)
 Frame = +1

Query: 274  SFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEXXXXXXXXXXXXXXXX---RVFG 444
            +FDLE+ VCS+G FM+SPN W    +T  RPLRL D+                   RV+G
Sbjct: 21   TFDLEKTVCSHGLFMLSPNHWDPLSRTFSRPLRLNDDTDNSLMVSISQHLSKSLLVRVYG 80

Query: 445  ISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGF-------GRVFRSPT 603
               L+ + ++ +  Q+VRMLRLS+ ++     F ++ S  + +         GRV RSPT
Sbjct: 81   NRSLSPKHQESLLVQIVRMLRLSDMDEFNAREFRKIVSAFEGEECPLIGDFGGRVLRSPT 140

Query: 604  LFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTPTK 783
            LFED+VK  LLCNC+W RTLSMA +LC  Q EL  +                F P TP K
Sbjct: 141  LFEDMVKCILLCNCQWSRTLSMADALCKFQIELHSQS------PQQKHAFNHFIPNTPVK 194

Query: 784  TRLKRRECSEILRPAKLRFDETSQCKVMSG---KISEGCSIVSEISITKCDGEYIVPNLG 954
               KR+     +    +  +    C        KIS   + V + S      + +    G
Sbjct: 195  KEPKRKIRLSKVPTESMDLEAADTCLTTDDSQMKISNSLNCVDDGSF-----DNLKSCQG 249

Query: 955  SSSKLTSNTMDTSELPSNLMA--------GNFPTPKELASLSENYLTKRCGVGYRARRIL 1110
            S++  ++    TS++ S+L+         GNFP+P+ELA+L E +L KRCG+GYRA RI+
Sbjct: 250  SNTFYSTGPYATSDIQSHLVTQHCAKKTTGNFPSPRELANLDERFLAKRCGLGYRAGRII 309

Query: 1111 NLAKQICNGSIDLDSLEN-PDGSVQMKLEDLKDYLQKLDGVGKFTCDVVLMCMGIYQCVP 1287
             LA+ I  G I L   E   +G        L D L++++G G FT   VLMCMG Y  +P
Sbjct: 310  KLAQGIVEGRIPLREFEQVSNGGSLSTYSKLTDQLREIEGFGPFTRANVLMCMGFYHVIP 369

Query: 1288 TDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSQM 1467
            TD+ETVRH KQV  ++  TIK+V  + EE+Y K+ PFQFL YW E+W  YE++FG+LS+M
Sbjct: 370  TDSETVRHFKQVHAKNS-TIKTVQSEAEEIYRKFAPFQFLVYWAELWHFYEQRFGKLSEM 428

Query: 1468 PPSDYGLISGHNMKEE 1515
            P S+Y LI+  N++ +
Sbjct: 429  PCSNYKLITASNLRNK 444


>gb|EEC82605.1| hypothetical protein OsI_27177 [Oryza sativa Indica Group]
          Length = 463

 Score =  272 bits (696), Expect = 4e-70
 Identities = 168/439 (38%), Positives = 230/439 (52%), Gaps = 27/439 (6%)
 Frame = +1

Query: 277  FDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEXXXXXXXXXXXXXXXXR------V 438
            FDLE AVCS+G FMM+PNRW    + L RPLRL  +                       V
Sbjct: 37   FDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVRVSRHPARPSDALLVSV 96

Query: 439  FGI--SQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSPTLFE 612
             G     L+  D+  I  QV RMLRL E +  A   F  +H+ A+  GFGR+FRSPTLFE
Sbjct: 97   LGAPGDALSPPDQTSILEQVRRMLRLDEEDGRAAAEFQAMHAVAREAGFGRIFRSPTLFE 156

Query: 613  DIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTPTKTRL 792
            D+VK  LLCNC+W RTLSM+ +LC+LQ EL+                  F  +TP     
Sbjct: 157  DMVKCILLCNCQWTRTLSMSTALCELQLELRSS-----------SSTENFQSRTPPIREC 205

Query: 793  KRRECSEILRPAKL--RFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVPNLGSSSK 966
            KR+  ++     KL  +F+E     +    ++         ++   +  + +P+  S + 
Sbjct: 206  KRKRSNKRNVRVKLETKFNEDKLVCLEDPNLA-----TDTANLQTYENSFNLPSAASGTG 260

Query: 967  LTSN-TMDTSELPSNL------MAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQ 1125
             TS  ++D SEL            G+FPTP+ELA+L E++L KRC +GYRARRI+ LA+ 
Sbjct: 261  NTSEVSLDHSELKLRNEPCLEDCGGDFPTPEELANLDEDFLAKRCNLGYRARRIVMLARS 320

Query: 1126 ICNGSIDLDSLENPD----------GSVQMKLEDLKDYLQKLDGVGKFTCDVVLMCMGIY 1275
            I  G I L  LE              +     + L + L  + G G FT   VLMCMG +
Sbjct: 321  IVEGKICLQKLEEIRKMSVPTVEGLSTTPSTYDRLNEELSTISGFGPFTRANVLMCMGFF 380

Query: 1276 QCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGR 1455
              +P DTET+RHLKQ   R+  TI SV  +++ +Y KY PFQFLAYW E+W  Y KQFG+
Sbjct: 381  HMIPADTETIRHLKQFHKRAS-TISSVQKELDNIYGKYAPFQFLAYWCELWGFYNKQFGK 439

Query: 1456 LSQMPPSDYGLISGHNMKE 1512
            +S M P +Y L +   +K+
Sbjct: 440  ISDMEPINYRLFTASKLKK 458


>gb|EEE67737.1| hypothetical protein OsJ_25428 [Oryza sativa Japonica Group]
          Length = 442

 Score =  266 bits (681), Expect = 2e-68
 Identities = 169/433 (39%), Positives = 226/433 (52%), Gaps = 21/433 (4%)
 Frame = +1

Query: 277  FDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLC-DEXXXXXXXXXXXXXXXXRVFGISQ 453
            FDLE AVCS+G FMM+PNRW    + L RPLRL  D                     +S 
Sbjct: 37   FDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVRVSRHPARPSDALLVSV 96

Query: 454  LTHQDED--------HIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSPTLF 609
            L   D+D         I  QV RMLRL E +  A+  F  +H+ A+  GFGR+FRSPTLF
Sbjct: 97   LGAPDDDALSPLDQTSILEQVRRMLRLDEEDGRAVAEFQAMHAVAREVGFGRIFRSPTLF 156

Query: 610  EDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTPTKTR 789
            ED++K  LLCNC+W RTLSM+ +LC+LQ EL+                  F  +TP    
Sbjct: 157  EDMIKCILLCNCQWTRTLSMSTALCELQLELRSS-----------SSTENFQSRTPPIRE 205

Query: 790  LKRRECSEILRPAKL--RFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVPNLGSSS 963
             KR+  ++     KL  +F+E     +    ++           T    E +  +L SS+
Sbjct: 206  CKRKRSNKRNVRVKLETKFNEDKMVCLEDPNLA-----------TNTANENLF-SLPSSA 253

Query: 964  KLTSNTMDTS----------ELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRARRILN 1113
              T NT + S          EL      G+FPTP+ELA+L E++L KRC +GYRARRI+ 
Sbjct: 254  NETGNTSEVSLDHSELKLRYELCLEDCGGDFPTPEELANLDEDFLAKRCNLGYRARRIVM 313

Query: 1114 LAKQICNGSIDLDSLENPDGSVQMKLEDLKDYLQKLDGVGKFTCDVVLMCMGIYQCVPTD 1293
            LA+ I  G I L  LE          + L + L  + G+  F    VLMCMG +  +P D
Sbjct: 314  LARSIVEGKICLQKLEE-------IRKILIEELSTISGIWPFHSCNVLMCMGFFHMIPAD 366

Query: 1294 TETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSQMPP 1473
            TET+RHLKQ   R+  TI SV  +++ +Y KY PFQFLAYW E+W  Y KQFG +S M P
Sbjct: 367  TETIRHLKQFHKRAS-TISSVQKELDNIYGKYAPFQFLAYWCELWGFYNKQFGIISDMEP 425

Query: 1474 SDYGLISGHNMKE 1512
             +Y L +   +K+
Sbjct: 426  INYRLFTASKLKK 438


>gb|EOY25839.1| Uncharacterized protein isoform 2 [Theobroma cacao]
          Length = 426

 Score =  256 bits (655), Expect = 2e-65
 Identities = 170/454 (37%), Positives = 231/454 (50%), Gaps = 18/454 (3%)
 Frame = +1

Query: 202  TASLCERMMEENGGECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCD 381
            ++S C  ++E   GE          F+LE+AVCS+G FMM+PN+W    ++L RPLRL D
Sbjct: 26   SSSCCSVLIELPVGEAAAAEGA-GPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPLRLLD 84

Query: 382  EXXXXXXXXXXXXXXXX-----RVFGISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFH 546
                                  RV+G   L+ Q    +  QV RMLRLSE E+  +  F 
Sbjct: 85   HHSPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKVREFR 144

Query: 547  RV----HSQAKSQG------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQS 696
            ++    H + ++         GRVFRSPTLFED+VK  LLCNC+                
Sbjct: 145  KIVEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQ---------------- 188

Query: 697  ELKGKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGK 876
                                 F PKTP    LKR          KLR             
Sbjct: 189  ----------------AAEDDFIPKTPAGNELKR----------KLR------------- 209

Query: 877  ISEGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLS 1056
                   VS++S+ + +G++  P    S      + +  E  +    G+FP+P+ELA+L 
Sbjct: 210  -------VSKVSM-RLEGKFAEPRADHSKSDLQPSQELDEPHAYKGMGSFPSPEELANLD 261

Query: 1057 ENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLED---LKDYLQKLDG 1227
            E++L KRC +GYRA RIL LAK I  G I L  LE  +G  ++ L     L + L+++DG
Sbjct: 262  ESFLAKRCNLGYRASRILKLAKGIVQGIIQLMQLE--EGCKEISLSSYNKLAEQLRQIDG 319

Query: 1228 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFL 1407
             G FTC  VLMCMG Y  +P D+ET+RHLKQV  +S  T+++V  DVE +YAKY PFQFL
Sbjct: 320  FGPFTCANVLMCMGFYHVIPADSETIRHLKQVHSKSS-TMQTVGRDVEGIYAKYAPFQFL 378

Query: 1408 AYWWEIWDSYEKQFGRLSQMPPSDYGLISGHNMK 1509
            AYW E+W  YE++FG+LS+MP   Y LI+  NMK
Sbjct: 379  AYWAELWHYYEQRFGKLSEMPFCGYKLITASNMK 412


>dbj|BAC15471.1| hypothetical protein [Oryza sativa Japonica Group]
            gi|50510134|dbj|BAD31099.1| hypothetical protein [Oryza
            sativa Japonica Group]
          Length = 501

 Score =  251 bits (641), Expect = 9e-64
 Identities = 169/481 (35%), Positives = 230/481 (47%), Gaps = 69/481 (14%)
 Frame = +1

Query: 277  FDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLC-DEXXXXXXXXXXXXXXXXRVFGISQ 453
            FDLE AVCS+G FMM+PNRW    + L RPLRL  D                     +S 
Sbjct: 37   FDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVRVSRHPARPSDALLVSV 96

Query: 454  LTHQDED--------HIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGRVFRSPTLF 609
            L   D+D         I  QV RMLRL E +  A+  F  +H+ A+  GFGR+FRSPTLF
Sbjct: 97   LGAPDDDALSPLDQTSILEQVRRMLRLDEEDGRAVAEFQAMHAVAREVGFGRIFRSPTLF 156

Query: 610  EDIVKAFLLCNC------------------------------------------RWQRTL 663
            ED++K  LLCNC                                          RW RTL
Sbjct: 157  EDMIKCILLCNCQFSLPLPLPSLASTSMRNSDTNMSRYLGIAIFHLHSTVLFNCRWTRTL 216

Query: 664  SMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKL--R 837
            SM+ +LC+LQ EL+                  F  +TP     KR+  ++     KL  +
Sbjct: 217  SMSTALCELQLELRSSS-----------STENFQSRTPPIRECKRKRSNKRNVRVKLETK 265

Query: 838  FDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNL-- 1011
            F+E     +    ++   +  +  S+     E      G++S+++   +D SEL      
Sbjct: 266  FNEDKMVCLEDPNLATNTANENLFSLPSSANE-----TGNTSEVS---LDHSELKLRYEL 317

Query: 1012 ----MAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPD--- 1170
                  G+FPTP+ELA+L E++L KRC +GYRARRI+ LA+ I  G I L  LE      
Sbjct: 318  CLEDCGGDFPTPEELANLDEDFLAKRCNLGYRARRIVMLARSIVEGKICLQKLEEIRKMS 377

Query: 1171 -------GSVQMKLEDLKDYLQKLDGVGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVG 1329
                    +     + L + L  + G G FT   VLMCMG +  +P DTET+RHLKQ   
Sbjct: 378  VPTVEGLSTTPSTYDRLNEELSTISGFGPFTRANVLMCMGFFHMIPADTETIRHLKQFHK 437

Query: 1330 RSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSQMPPSDYGLISGHNMK 1509
            R+  TI SV  +++ +Y KY PFQFLAYW E+W  Y KQFG +S M P +Y L +   +K
Sbjct: 438  RAS-TISSVQKELDNIYGKYAPFQFLAYWCELWGFYNKQFGIISDMEPINYRLFTASKLK 496

Query: 1510 E 1512
            +
Sbjct: 497  K 497


>gb|EOY25840.1| Uncharacterized protein isoform 3 [Theobroma cacao]
          Length = 421

 Score =  249 bits (635), Expect = 4e-63
 Identities = 165/422 (39%), Positives = 225/422 (53%), Gaps = 18/422 (4%)
 Frame = +1

Query: 202  TASLCERMMEENGGECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCD 381
            ++S C  ++E   GE          F+LE+AVCS+G FMM+PN+W    ++L RPLRL D
Sbjct: 41   SSSCCSVLIELPVGEAAAAEGA-GPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPLRLLD 99

Query: 382  EXXXXXXXXXXXXXXXX-----RVFGISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFH 546
                                  RV+G   L+ Q    +  QV RMLRLSE E+  +  F 
Sbjct: 100  HHSPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKVREFR 159

Query: 547  RV----HSQAKSQG------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQS 696
            ++    H + ++         GRVFRSPTLFED+VK  LLCNC++ RTLSMA +LC+LQ 
Sbjct: 160  KIVEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQFSRTLSMAKALCELQF 219

Query: 697  ELKGKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGK 876
            E + +P               F PKTP    LKR          KLR             
Sbjct: 220  ETQ-RPFS-----GVRAAEDDFIPKTPAGNELKR----------KLR------------- 250

Query: 877  ISEGCSIVSEISITKCDGEYIVPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLS 1056
                   VS++S+ + +G++  P    S      + +  E  +    G+FP+P+ELA+L 
Sbjct: 251  -------VSKVSM-RLEGKFAEPRADHSKSDLQPSQELDEPHAYKGMGSFPSPEELANLD 302

Query: 1057 ENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQMKLED---LKDYLQKLDG 1227
            E++L KRC +GYRA RIL LAK I  G I L  LE  +G  ++ L     L + L+++DG
Sbjct: 303  ESFLAKRCNLGYRASRILKLAKGIVQGIIQLMQLE--EGCKEISLSSYNKLAEQLRQIDG 360

Query: 1228 VGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFL 1407
             G FTC  VLMCMG Y  +P D+ET+RHLKQV  +S  T+++V  DVE +YAKY PFQFL
Sbjct: 361  FGPFTCANVLMCMGFYHVIPADSETIRHLKQVHSKSS-TMQTVGRDVEGIYAKYAPFQFL 419

Query: 1408 AY 1413
            AY
Sbjct: 420  AY 421


>gb|EOY14232.1| Uncharacterized protein TCM_033523 [Theobroma cacao]
          Length = 374

 Score =  244 bits (624), Expect = 8e-62
 Identities = 153/424 (36%), Positives = 215/424 (50%), Gaps = 5/424 (1%)
 Frame = +1

Query: 241  GECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEXXXXXXXXXXXX 420
            GEC+      SSF++E+AVC++G FMMSPN W+   K+L RPLRL D             
Sbjct: 12   GECS------SSFNMEKAVCNHGLFMMSPNVWIPSTKSLRRPLRLADSSGSVYVTISHPA 65

Query: 421  XXXX----RVFGI-SQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRVHSQAKSQGFGR 585
                    +V G+ + ++  D+  I  QV RMLR+S  ++  +  F  +H  AK +GFGR
Sbjct: 66   PNHPFLVIQVNGLQNSISSADKAVIMEQVARMLRISSKDERDVREFQTLHGSAKDRGFGR 125

Query: 586  VFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXXXXXXXXXXXXFY 765
            +FRSP+ FED VK+ LLCNC                        GW              
Sbjct: 126  IFRSPSFFEDAVKSILLCNC------------------------GWK------------- 148

Query: 766  PKTPTKTRLKRRECSEILRPAKLRFDETSQCKVMSGKISEGCSIVSEISITKCDGEYIVP 945
             +T T   + R  C+  L+ A             + KIS           TK        
Sbjct: 149  -RTLT---MARALCALQLQLASAHLQHKRVASNSNVKIS-----------TKRLKHKKYT 193

Query: 946  NLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSENYLTKRCGVGYRARRILNLAKQ 1125
               S+S+L+ +  D S        GNFPT  ELA L E YL +RC +GYRAR IL LA++
Sbjct: 194  KASSTSELSMSGFDQS-------IGNFPTSTELACLDEKYLNERCNLGYRARCILQLARK 246

Query: 1126 ICNGSIDLDSLENPDGSVQMKLEDLKDYLQKLDGVGKFTCDVVLMCMGIYQCVPTDTETV 1305
            + NG ++L+ LE  + S     E     L K+ G G F C  ++MC+G Y+ +P D+ET+
Sbjct: 247  VENGELELNKLE--ESSDTTSYERFYQKLMKIKGFGPFVCSNIMMCIGFYERIPFDSETI 304

Query: 1306 RHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAYWWEIWDSYEKQFGRLSQMPPSDYG 1485
            RHLK V G+  C+ K++  D+EE+Y KY PFQ +AYW E+ D YE +FG+LS++  S Y 
Sbjct: 305  RHLKMVHGKGKCSRKTIEKDIEEIYGKYAPFQCMAYWLELLDEYENKFGKLSELESSSYH 364

Query: 1486 LISG 1497
            L +G
Sbjct: 365  LATG 368


>ref|XP_006470787.1| PREDICTED: uncharacterized protein LOC102629917 isoform X2 [Citrus
            sinensis]
          Length = 409

 Score =  240 bits (612), Expect = 2e-60
 Identities = 159/420 (37%), Positives = 219/420 (52%), Gaps = 30/420 (7%)
 Frame = +1

Query: 244  ECTLRVSVKSSFDLERAVCSYGFFMMSPNRWLSDEKTLYRPLRLCDEXXXXXXXXXXXXX 423
            E  L++ +  +F+LE AVCS+G FMMSPNRW    ++L RPL L +              
Sbjct: 5    ESLLKLPLAETFNLETAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVDV 64

Query: 424  XXXRVF----------------GISQLTHQDEDHIKAQVVRMLRLSEHEDDAIDGFHRVH 555
               +                      L+ + +D + AQV RMLRLSE ++  +  F R+ 
Sbjct: 65   TICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVREFKRIV 124

Query: 556  SQ-AKSQG---------FGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELK 705
             Q A+ +G          GRVFRSPTLFED+VK  LLCNC+W RTLSMA +LC+LQ    
Sbjct: 125  RQVAQEEGEETQYMEDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLSMARALCELQ---- 180

Query: 706  GKPLGWXXXXXXXXXXXXFYPKTPTKTRLKRRE-CSEILRPAKLRFDETSQCKVMSGKIS 882
                 W            F P+TP     KRR+  S++      R  E+         + 
Sbjct: 181  -----WELQHCSPSISEDFIPQTPAGKESKRRQKVSKVASKLTSRIAESKASSEDYMNLK 235

Query: 883  EGCSIVSEISITKCDGEYIVPNLGSSSKLTS-NTMDTSELPS-NLMAGNFPTPKELASLS 1056
              C+ V E ++     +   P     S L   N + T++ PS     GNFP+P+ELA+L 
Sbjct: 236  LDCAGVLEENV-----QPSFPQNDIESDLHGLNELSTTDPPSARDRIGNFPSPRELANLD 290

Query: 1057 ENYLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPDGSVQM-KLEDLKDYLQKLDGVG 1233
            E++L KRC +GYRA RIL LA+ I +G I L  LE+      +     L + L +++G G
Sbjct: 291  ESFLAKRCNLGYRAGRILKLARGIVDGQIQLRELEDMCNEASLTAYVKLAEQLSQINGFG 350

Query: 1234 KFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYVPFQFLAY 1413
             FT + VL+C+G Y  +PTD+ET+RHLKQV  R +CT K+V M  E +Y KY PFQFLAY
Sbjct: 351  PFTRNNVLVCIGFYHVIPTDSETIRHLKQVHAR-NCTSKTVQMIAESIYGKYAPFQFLAY 409


>gb|EMT03969.1| hypothetical protein F775_22747 [Aegilops tauschii]
          Length = 333

 Score =  227 bits (578), Expect = 2e-56
 Identities = 133/341 (39%), Positives = 190/341 (55%), Gaps = 19/341 (5%)
 Frame = +1

Query: 550  VHSQAKSQGFGRVFRSPTLFEDIVKAFLLCNCRWQRTLSMAASLCDLQSELKGKPLGWXX 729
            +H+ A+  GFGR+FRSPTLFED+VK  LLCNC+W RTLSMA +LC+LQ ELK        
Sbjct: 1    MHAAAREAGFGRIFRSPTLFEDMVKCILLCNCQWTRTLSMATALCELQLELK-------- 52

Query: 730  XXXXXXXXXXFYPKTPTKTRLKRRECSEILRPAKLRFDETS-QC-------KVMSGKISE 885
                         +TP     KR+         KL    T  +C            +++ 
Sbjct: 53   ---CSAGTEDLQLRTPPIREHKRKRSKNQNVRVKLEKKFTELECLEDPRVETAQDTRVAT 109

Query: 886  GCSIVSEISITKCDGEYI-VPNLGSSSKLTSNTMDTSELPSNLMAGNFPTPKELASLSEN 1062
            G S V  I+  + D +   +P +   +     + D+SEL      G+FPTP+ELA+L E+
Sbjct: 110  GTSDV--ITHLEADEKLASLPQVAPETGSVCQSFDSSELSLEGCIGDFPTPEELANLDED 167

Query: 1063 YLTKRCGVGYRARRILNLAKQICNGSIDLDSLENPD----------GSVQMKLEDLKDYL 1212
            +L KRCG+GYRA RI+ LA+ I  G +   +LE              ++    E L + L
Sbjct: 168  FLAKRCGLGYRAERIVLLARSIVEGKVCPQNLEEMQKMSLPATEELSTIPSTYERLNNEL 227

Query: 1213 QKLDGVGKFTCDVVLMCMGIYQCVPTDTETVRHLKQVVGRSDCTIKSVVMDVEEVYAKYV 1392
              + G G FT   VLMCMG +  +P DTET+RHLKQ    +  TIKSV M+++++Y +Y 
Sbjct: 228  TTISGFGPFTRANVLMCMGFFHMIPADTETIRHLKQCHEIAS-TIKSVHMELDKIYGEYA 286

Query: 1393 PFQFLAYWWEIWDSYEKQFGRLSQMPPSDYGLISGHNMKEE 1515
            PFQFLAYW+E+W  Y+KQFG++++M PS Y L +   +K++
Sbjct: 287  PFQFLAYWFELWGFYDKQFGKITEMDPSTYRLFTASALKKQ 327


Top