BLASTX nr result

ID: Akebia27_contig00020716 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00020716
         (934 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002280809.1| PREDICTED: uncharacterized protein LOC100244...   327   3e-87
ref|XP_006476303.1| PREDICTED: uncharacterized protein LOC102612...   320   5e-85
ref|XP_002518898.1| conserved hypothetical protein [Ricinus comm...   314   3e-83
ref|XP_002297952.1| hypothetical protein POPTR_0001s11300g [Popu...   311   3e-82
ref|XP_006439240.1| hypothetical protein CICLE_v10021179mg [Citr...   308   2e-81
ref|XP_006343794.1| PREDICTED: uncharacterized protein LOC102593...   307   3e-81
ref|XP_007039415.1| Uncharacterized protein isoform 1 [Theobroma...   306   6e-81
ref|XP_007209367.1| hypothetical protein PRUPE_ppa008762mg [Prun...   303   6e-80
ref|XP_004245467.1| PREDICTED: uncharacterized protein LOC101267...   301   2e-79
gb|ADN33930.1| hypothetical protein [Cucumis melo subsp. melo]        298   2e-78
ref|XP_004146131.1| PREDICTED: uncharacterized protein LOC101213...   297   4e-78
ref|XP_004309583.1| PREDICTED: uncharacterized protein LOC101293...   290   7e-76
ref|NP_001241068.1| uncharacterized protein LOC100803482 [Glycin...   288   2e-75
gb|EXB39941.1| hypothetical protein L484_001700 [Morus notabilis]     285   2e-74
ref|XP_007160578.1| hypothetical protein PHAVU_002G333700g [Phas...   276   6e-72
ref|XP_003598013.1| hypothetical protein MTR_3g005290 [Medicago ...   273   5e-71
gb|AAO42239.1| unknown protein [Arabidopsis thaliana]                 267   4e-69
ref|NP_194140.2| uncharacterized protein [Arabidopsis thaliana] ...   267   4e-69
ref|XP_002869739.1| hypothetical protein ARALYDRAFT_492454 [Arab...   267   5e-69
gb|EXB81854.1| hypothetical protein L484_015328 [Morus notabilis]     266   1e-68

>ref|XP_002280809.1| PREDICTED: uncharacterized protein LOC100244479 [Vitis vinifera]
           gi|147827009|emb|CAN62286.1| hypothetical protein
           VITISV_034703 [Vitis vinifera]
           gi|297742426|emb|CBI34575.3| unnamed protein product
           [Vitis vinifera]
          Length = 315

 Score =  327 bits (839), Expect = 3e-87
 Identities = 172/267 (64%), Positives = 204/267 (76%), Gaps = 2/267 (0%)
 Frame = +3

Query: 3   KCSSEQRDNEGLKDVLSGMVDKXXXXXXXXXXXXXXXDGLEKASRRVEIAKKELEDIERQ 182
           KCSSE+ +N G K+ LSGMV K               +GLEKAS+RVE A++EL +IE+Q
Sbjct: 47  KCSSEESENGGFKEALSGMVGKQVEELLNREENRVLLEGLEKASQRVEKARRELAEIEKQ 106

Query: 183 EIEATQLRNYINHLEIRASEIAECQKEILEARSMVEEAESSLTLNL--DGFGSGNVLQES 356
           EIEA QLRNYI  LE R+SEIAECQ+EILEAR+ VEEAE SL++N+  DG  +  +  ES
Sbjct: 107 EIEAKQLRNYIEQLEGRSSEIAECQREILEARAKVEEAERSLSINMGEDGDRASFLETES 166

Query: 357 EQSDIDEERWESIKAASVSAIVGTLAGLPFSFSQGTSGVQLILPLAITFISCALFGVTFR 536
           ++ + +EER ES+KAA VSAIVGT AGLP   +Q TS  QL+LPLAI F+SCALFG+TFR
Sbjct: 167 KEINKEEERLESVKAALVSAIVGTFAGLPIFLTQVTSSSQLLLPLAINFVSCALFGITFR 226

Query: 537 YTIRRDLDNVQLKTGTSAAFGFIKGLSVLGEGPPLELDYASFLSHAVDGAVYVSENFFIF 716
           YTIRRDLDN+QLKTGTSAAFGF+KGL+ LG G PLELD  SFLSHA DG +YVSEN FIF
Sbjct: 227 YTIRRDLDNIQLKTGTSAAFGFVKGLATLGGGSPLELDAGSFLSHAFDGVLYVSENLFIF 286

Query: 717 FFAAVGLDYCFKTRLLSPFPIKKQFDK 797
            FAAVGLD+CFK R LSPFPIK    +
Sbjct: 287 VFAAVGLDFCFKMRFLSPFPIKSSVSR 313


>ref|XP_006476303.1| PREDICTED: uncharacterized protein LOC102612869 [Citrus sinensis]
          Length = 322

 Score =  320 bits (820), Expect = 5e-85
 Identities = 167/263 (63%), Positives = 205/263 (77%), Gaps = 4/263 (1%)
 Frame = +3

Query: 9   SSEQRDNE--GLKDVLSGMVDKXXXXXXXXXXXXXXXDGLEKASRRVEIAKKELEDIERQ 182
           SSE+R+     LKD L+GMVD+               DGLEKAS RVE+AKKEL +IE+Q
Sbjct: 52  SSEERETSDGNLKDALTGMVDQRVEELLNKEENRALLDGLEKASLRVEMAKKELAEIEKQ 111

Query: 183 EIEATQLRNYINHLEIRASEIAECQKEILEARSMVEEAESSLTLNLDGFGSGNVLQESEQ 362
           E+EA Q+R+Y+N LE RA EIAECQ+EI++AR++++EAE SL+ N + F     L E E 
Sbjct: 112 ELEAKQMRDYVNKLESRAFEIAECQREIVDARAILDEAERSLSQNGNEFRDERSLAEKES 171

Query: 363 SDI--DEERWESIKAASVSAIVGTLAGLPFSFSQGTSGVQLILPLAITFISCALFGVTFR 536
             I  D ERWESIKAA++SA+VG+LAGLP SF+Q TS  QL+LPLA+TFISCALFGVTFR
Sbjct: 172 EGINKDIERWESIKAAAISALVGSLAGLPISFTQVTSSSQLLLPLAVTFISCALFGVTFR 231

Query: 537 YTIRRDLDNVQLKTGTSAAFGFIKGLSVLGEGPPLELDYASFLSHAVDGAVYVSENFFIF 716
           YTIRRDLDN+QLKTGTS+AFGF+KGL+ L  GPPLEL+  SFLSHA DGA+YVS+N  IF
Sbjct: 232 YTIRRDLDNIQLKTGTSSAFGFVKGLATLSGGPPLELNTESFLSHAFDGALYVSQNLLIF 291

Query: 717 FFAAVGLDYCFKTRLLSPFPIKK 785
            FAAV LD+CFKTR+LSP+P+KK
Sbjct: 292 IFAAVTLDFCFKTRVLSPYPMKK 314


>ref|XP_002518898.1| conserved hypothetical protein [Ricinus communis]
           gi|223541885|gb|EEF43431.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 308

 Score =  314 bits (805), Expect = 3e-83
 Identities = 170/262 (64%), Positives = 196/262 (74%), Gaps = 3/262 (1%)
 Frame = +3

Query: 9   SSEQRDNEGLKDVLSGMVDKXXXXXXXXXXXXXXXDGLEKASRRVEIAKKELEDIERQEI 188
           + + +++  LKD LSGMVDK               DGLEKAS+R+E AK++L +IERQE+
Sbjct: 52  NDDSQESNSLKDALSGMVDKQVEALFSREENRVLIDGLEKASQRLERAKRDLAEIERQEL 111

Query: 189 EATQLRNYINHLEIRASEIAECQKEILEARSMVEEAESSLTLNLD---GFGSGNVLQESE 359
           EA Q+R YIN LE RASEIAECQ+EI+EAR+ VEEAE SL+LN D   G G        E
Sbjct: 112 EAEQMRVYINQLESRASEIAECQQEIIEARAKVEEAERSLSLNKDPEDGLGG-------E 164

Query: 360 QSDIDEERWESIKAASVSAIVGTLAGLPFSFSQGTSGVQLILPLAITFISCALFGVTFRY 539
             + DEERWESIKAAS+SA VGTLAGLP S +Q TS  QLILP A TFISCALFGVTFRY
Sbjct: 165 TINKDEERWESIKAASISAFVGTLAGLPISLTQVTSIPQLILPSATTFISCALFGVTFRY 224

Query: 540 TIRRDLDNVQLKTGTSAAFGFIKGLSVLGEGPPLELDYASFLSHAVDGAVYVSENFFIFF 719
            +RRDLDN QLKTGTSAAFGF+KGL  L  GPP EL+ ASFLSHA++GAVYVSEN  IF 
Sbjct: 225 AVRRDLDNFQLKTGTSAAFGFVKGLGTLAGGPPFELNPASFLSHALNGAVYVSENLLIFA 284

Query: 720 FAAVGLDYCFKTRLLSPFPIKK 785
           FAAV LD+C K RLLSPFP+KK
Sbjct: 285 FAAVSLDFCIKMRLLSPFPMKK 306


>ref|XP_002297952.1| hypothetical protein POPTR_0001s11300g [Populus trichocarpa]
           gi|222845210|gb|EEE82757.1| hypothetical protein
           POPTR_0001s11300g [Populus trichocarpa]
          Length = 318

 Score =  311 bits (796), Expect = 3e-82
 Identities = 171/267 (64%), Positives = 196/267 (73%), Gaps = 6/267 (2%)
 Frame = +3

Query: 3   KCSSEQRDNEG------LKDVLSGMVDKXXXXXXXXXXXXXXXDGLEKASRRVEIAKKEL 164
           KCSS   +  G      LKD LSGMVDK               DGLEKAS+RVE+A++EL
Sbjct: 49  KCSSNSSEEIGPSNGNNLKDALSGMVDKQVEELLNRQENRVLLDGLEKASQRVEMARREL 108

Query: 165 EDIERQEIEATQLRNYINHLEIRASEIAECQKEILEARSMVEEAESSLTLNLDGFGSGNV 344
            +IERQE+EA QLR+YIN LE RASEIAECQ+EILEAR+MVEEAE SL+LN DG      
Sbjct: 109 AEIERQELEAKQLRDYINQLESRASEIAECQQEILEARAMVEEAERSLSLNNDGDAL--- 165

Query: 345 LQESEQSDIDEERWESIKAASVSAIVGTLAGLPFSFSQGTSGVQLILPLAITFISCALFG 524
             ES++   D+ER ESIKA  VSA+VGTLAGLP S +Q TS  QLILP  ITFISCALFG
Sbjct: 166 --ESKEISRDQERLESIKAGFVSALVGTLAGLPISLTQVTSNAQLILPSTITFISCALFG 223

Query: 525 VTFRYTIRRDLDNVQLKTGTSAAFGFIKGLSVLGEGPPLELDYASFLSHAVDGAVYVSEN 704
           +TFRY +RRDLDN QLKTGT+AAFG +KGL+ L  G PLELD  SFLSHA +GA YVSEN
Sbjct: 224 LTFRYAVRRDLDNFQLKTGTAAAFGIVKGLATLAGGQPLELDPESFLSHAFNGAKYVSEN 283

Query: 705 FFIFFFAAVGLDYCFKTRLLSPFPIKK 785
             IF FAAV LD+CFK  LLSPFP+K+
Sbjct: 284 LLIFAFAAVSLDFCFKMGLLSPFPMKR 310


>ref|XP_006439240.1| hypothetical protein CICLE_v10021179mg [Citrus clementina]
           gi|557541502|gb|ESR52480.1| hypothetical protein
           CICLE_v10021179mg [Citrus clementina]
          Length = 322

 Score =  308 bits (789), Expect = 2e-81
 Identities = 162/263 (61%), Positives = 201/263 (76%), Gaps = 4/263 (1%)
 Frame = +3

Query: 9   SSEQRDNE--GLKDVLSGMVDKXXXXXXXXXXXXXXXDGLEKASRRVEIAKKELEDIERQ 182
           SSE+R+     LKD L+ MVD+               DGLEKAS RVE+AKKEL +IE+Q
Sbjct: 52  SSEERETSDGNLKDALTRMVDQRVEELLNKEENRALLDGLEKASLRVEMAKKELAEIEKQ 111

Query: 183 EIEATQLRNYINHLEIRASEIAECQKEILEARSMVEEAESSLTLNLDGFGSGNVLQESEQ 362
           E+EA Q+R+Y+N LE RA EIAECQ+EI+EAR++++EA  SL+ N + F     L E E 
Sbjct: 112 ELEAKQMRDYVNKLESRAFEIAECQREIVEARAILDEAARSLSQNGNEFRDERNLAEKES 171

Query: 363 SDI--DEERWESIKAASVSAIVGTLAGLPFSFSQGTSGVQLILPLAITFISCALFGVTFR 536
             I  D ERWESIKAA++SA+VG+LAGLP SF+Q TS  QL+LPLA+TFISCALFGVT+R
Sbjct: 172 EGINKDIERWESIKAAAISALVGSLAGLPISFTQVTSSSQLLLPLAVTFISCALFGVTYR 231

Query: 537 YTIRRDLDNVQLKTGTSAAFGFIKGLSVLGEGPPLELDYASFLSHAVDGAVYVSENFFIF 716
           YTIRRDLDN+QLKTGTS+AFGF+KGL+ L  GPPLE +  SFLSHA DGA+YVS+N  IF
Sbjct: 232 YTIRRDLDNIQLKTGTSSAFGFVKGLATLSGGPPLEPNTESFLSHAFDGALYVSQNLLIF 291

Query: 717 FFAAVGLDYCFKTRLLSPFPIKK 785
            FAAV LD+CFKT++LSP+P+ K
Sbjct: 292 IFAAVTLDFCFKTQVLSPYPMNK 314


>ref|XP_006343794.1| PREDICTED: uncharacterized protein LOC102593433 [Solanum tuberosum]
          Length = 311

 Score =  307 bits (787), Expect = 3e-81
 Identities = 166/263 (63%), Positives = 194/263 (73%)
 Frame = +3

Query: 9   SSEQRDNEGLKDVLSGMVDKXXXXXXXXXXXXXXXDGLEKASRRVEIAKKELEDIERQEI 188
           SSE  D   LKD L+G+VD+               DGLEKA+ RVE+AKKEL +IERQE+
Sbjct: 47  SSENGDMGNLKDALTGIVDERVEELLKREENRVLLDGLEKATLRVEMAKKELAEIERQEL 106

Query: 189 EATQLRNYINHLEIRASEIAECQKEILEARSMVEEAESSLTLNLDGFGSGNVLQESEQSD 368
           EA  L++YI  LE R SEIAECQK+ILEAR+M+EEAE SL  N+ G        + +  +
Sbjct: 107 EAKLLKDYITQLETRTSEIAECQKDILEARAMIEEAERSL--NVSGDARRRDATDGDVVN 164

Query: 369 IDEERWESIKAASVSAIVGTLAGLPFSFSQGTSGVQLILPLAITFISCALFGVTFRYTIR 548
            DEER ESIKAAS+SAIVGTLAGLP   S+ TS  +LILPL+ITFISCALFGVTFRY +R
Sbjct: 165 RDEERVESIKAASLSAIVGTLAGLPIFLSRITSSSELILPLSITFISCALFGVTFRYAVR 224

Query: 549 RDLDNVQLKTGTSAAFGFIKGLSVLGEGPPLELDYASFLSHAVDGAVYVSENFFIFFFAA 728
           RDLDN QLK+GTSAAFG +KGL+ LG GPPLELD ASF SHA+DGAVYVSEN  IF FA 
Sbjct: 225 RDLDNFQLKSGTSAAFGVVKGLATLGGGPPLELDAASFWSHALDGAVYVSENLLIFLFAG 284

Query: 729 VGLDYCFKTRLLSPFPIKKQFDK 797
           VGLD CFK R+LSPFPI +   +
Sbjct: 285 VGLDLCFKLRILSPFPIDRSISE 307


>ref|XP_007039415.1| Uncharacterized protein isoform 1 [Theobroma cacao]
           gi|508776660|gb|EOY23916.1| Uncharacterized protein
           isoform 1 [Theobroma cacao]
          Length = 322

 Score =  306 bits (785), Expect = 6e-81
 Identities = 166/266 (62%), Positives = 196/266 (73%), Gaps = 5/266 (1%)
 Frame = +3

Query: 3   KCSSEQRDNE---GLKDVLSGMVDKXXXXXXXXXXXXXXXDGLEKASRRVEIAKKELEDI 173
           KCSS   + E    LKD LSG+V K               DGLEKAS+RVE+AK++L +I
Sbjct: 48  KCSSNSEEKETSNDLKDALSGIVGKQVEELLNREENKGLLDGLEKASQRVEMAKRQLVEI 107

Query: 174 ERQEIEATQLRNYINHLEIRASEIAECQKEILEARSMVEEAESSLTLNLDGFGSGNVLQ- 350
           E+QE+EA  L NYIN LE RASEIAECQ+EI +AR+MVEEAE SL+LN D     +  + 
Sbjct: 108 EKQELEAQLLGNYINQLEARASEIAECQQEISQARAMVEEAELSLSLNADNVEDRDAFRS 167

Query: 351 -ESEQSDIDEERWESIKAASVSAIVGTLAGLPFSFSQGTSGVQLILPLAITFISCALFGV 527
            + E  D D+ER ESIKAA +SA+VGTLAGLP S +Q +S  QL+LPL+ TFISCALFGV
Sbjct: 168 KDGEGIDNDKERLESIKAALISAVVGTLAGLPISLTQVSSRTQLLLPLSATFISCALFGV 227

Query: 528 TFRYTIRRDLDNVQLKTGTSAAFGFIKGLSVLGEGPPLELDYASFLSHAVDGAVYVSENF 707
           TFRY +RRDLDN QLKTGTSAAFGF+KGL  L  G PLELD  SFLS+A DGAVYVS+N 
Sbjct: 228 TFRYAVRRDLDNFQLKTGTSAAFGFVKGLGTLVGGSPLELDPGSFLSYAFDGAVYVSQNL 287

Query: 708 FIFFFAAVGLDYCFKTRLLSPFPIKK 785
            IF FAAVGLD CFK R+LSPFP+K+
Sbjct: 288 IIFLFAAVGLDLCFKMRILSPFPMKR 313


>ref|XP_007209367.1| hypothetical protein PRUPE_ppa008762mg [Prunus persica]
           gi|462405102|gb|EMJ10566.1| hypothetical protein
           PRUPE_ppa008762mg [Prunus persica]
          Length = 320

 Score =  303 bits (776), Expect = 6e-80
 Identities = 158/263 (60%), Positives = 197/263 (74%), Gaps = 3/263 (1%)
 Frame = +3

Query: 3   KCSSEQRDNEG-LKDVLSGMVDKXXXXXXXXXXXXXXXDGLEKASRRVEIAKKELEDIER 179
           KC+       G LKD LSG+V +               DGLEKAS+RVE+AK+EL +IE+
Sbjct: 49  KCTRSTSGESGSLKDALSGIVGEQVEELLKREENRDLLDGLEKASQRVEMAKRELAEIEK 108

Query: 180 QEIEATQLRNYINHLEIRASEIAECQKEILEARSMVEEAESSLTLNLDGFGSGNVLQESE 359
           QE+EA ++R+YIN LE RASEIAECQKEILEA++MVEEAE +L+ + D  G G    E+ 
Sbjct: 109 QELEAKRVRDYINQLESRASEIAECQKEILEAKAMVEEAERALSQDGDQLGDGYGFSETG 168

Query: 360 QSDIDE--ERWESIKAASVSAIVGTLAGLPFSFSQGTSGVQLILPLAITFISCALFGVTF 533
             +ID+  ERW+SIKAAS+SA+VGT+AGLPFSF+Q +S  +LILPLAITF+SCALFGVTF
Sbjct: 169 NGEIDKDKERWQSIKAASISALVGTVAGLPFSFTQVSSSSELILPLAITFVSCALFGVTF 228

Query: 534 RYTIRRDLDNVQLKTGTSAAFGFIKGLSVLGEGPPLELDYASFLSHAVDGAVYVSENFFI 713
           RY +RRDLD+V LKTG  AAFG +KGL+ L  G PL+L+  SFL HA  GA+YVSE+ F+
Sbjct: 229 RYAVRRDLDDVHLKTGAPAAFGVVKGLATLDGGQPLQLNAGSFLLHAFHGALYVSESLFV 288

Query: 714 FFFAAVGLDYCFKTRLLSPFPIK 782
           F  AA+ LDYCFK RLLSPFP+K
Sbjct: 289 FLSAAIALDYCFKARLLSPFPVK 311


>ref|XP_004245467.1| PREDICTED: uncharacterized protein LOC101267149 [Solanum
           lycopersicum]
          Length = 311

 Score =  301 bits (772), Expect = 2e-79
 Identities = 162/263 (61%), Positives = 193/263 (73%)
 Frame = +3

Query: 9   SSEQRDNEGLKDVLSGMVDKXXXXXXXXXXXXXXXDGLEKASRRVEIAKKELEDIERQEI 188
           SSE  +   LKD L+G+VD+               DGLEKA+ RVE+AKKEL +IERQE+
Sbjct: 47  SSENGEMGNLKDALTGIVDERVEELLKREENRVLLDGLEKATLRVEMAKKELAEIERQEL 106

Query: 189 EATQLRNYINHLEIRASEIAECQKEILEARSMVEEAESSLTLNLDGFGSGNVLQESEQSD 368
           EA  L++YI  LE R SEIAECQK+ILEAR+M+EEAE SL  N+ G        + +  +
Sbjct: 107 EAKLLKDYITQLETRTSEIAECQKDILEARAMIEEAERSL--NVSGDARKRDPTDGDVVN 164

Query: 369 IDEERWESIKAASVSAIVGTLAGLPFSFSQGTSGVQLILPLAITFISCALFGVTFRYTIR 548
            DEER ES+KAAS+SAIVGTLAGLP   S+ +S  +LILPL+ITFISCALFGVTFRY +R
Sbjct: 165 RDEERVESVKAASLSAIVGTLAGLPIFLSRISSSSELILPLSITFISCALFGVTFRYAVR 224

Query: 549 RDLDNVQLKTGTSAAFGFIKGLSVLGEGPPLELDYASFLSHAVDGAVYVSENFFIFFFAA 728
           RD DN QLK+GTSAAFG +KGL+ LG GPPLELD ASF SHA+DGAVYVSEN  IF FA 
Sbjct: 225 RDFDNFQLKSGTSAAFGVVKGLATLGGGPPLELDAASFWSHALDGAVYVSENLLIFLFAG 284

Query: 729 VGLDYCFKTRLLSPFPIKKQFDK 797
           VGLD CFK R+LSPFPI +   +
Sbjct: 285 VGLDLCFKLRILSPFPIDRSISE 307


>gb|ADN33930.1| hypothetical protein [Cucumis melo subsp. melo]
          Length = 319

 Score =  298 bits (763), Expect = 2e-78
 Identities = 162/271 (59%), Positives = 191/271 (70%), Gaps = 6/271 (2%)
 Frame = +3

Query: 3   KCSSEQRDNE----GLKDVLSGMVDKXXXXXXXXXXXXXXXDGLEKASRRVEIAKKELED 170
           KC+SE  D       LK+ LS MV +               DGLEKAS RVEIAKK+L +
Sbjct: 51  KCTSESNDQAQNDFNLKNALSSMVGEQVEELLNREENRSLLDGLEKASMRVEIAKKQLAE 110

Query: 171 IERQEIEATQLRNYINHLEIRASEIAECQKEILEARSMVEEAESSLTLNLDGFGSGNVLQ 350
           IE+QE+E  + ++Y++ LE RASEI ECQKEILEAR M+EEAE SL  +      GN ++
Sbjct: 111 IEKQELELKRFKDYVSQLENRASEIEECQKEILEARGMIEEAERSLAQS----EGGNAIR 166

Query: 351 ESEQS--DIDEERWESIKAASVSAIVGTLAGLPFSFSQGTSGVQLILPLAITFISCALFG 524
           + E    D DEER+ES+K AS+SAIVGTLAGLP   +Q  S  QL+LP AITFISCALFG
Sbjct: 167 DGEDGGLDRDEERFESVKVASISAIVGTLAGLPIFLNQVNSTSQLLLPTAITFISCALFG 226

Query: 525 VTFRYTIRRDLDNVQLKTGTSAAFGFIKGLSVLGEGPPLELDYASFLSHAVDGAVYVSEN 704
           VTFRYTIRRDLDN+QLKTGTSAAFGF+KGL+ L  G PLEL   SF SH +D AVYVSEN
Sbjct: 227 VTFRYTIRRDLDNIQLKTGTSAAFGFVKGLATLDGGVPLELSAESFSSHVIDAAVYVSEN 286

Query: 705 FFIFFFAAVGLDYCFKTRLLSPFPIKKQFDK 797
            +IF  AAV LDYCFK  LLSPFPI+K   +
Sbjct: 287 LYIFICAAVALDYCFKMSLLSPFPIRKSISR 317


>ref|XP_004146131.1| PREDICTED: uncharacterized protein LOC101213449 [Cucumis sativus]
           gi|449495026|ref|XP_004159713.1| PREDICTED:
           uncharacterized protein LOC101224994 [Cucumis sativus]
          Length = 319

 Score =  297 bits (761), Expect = 4e-78
 Identities = 162/271 (59%), Positives = 191/271 (70%), Gaps = 6/271 (2%)
 Frame = +3

Query: 3   KCSSEQRDNE----GLKDVLSGMVDKXXXXXXXXXXXXXXXDGLEKASRRVEIAKKELED 170
           KC+SE  D       LK+ LS MV +               DGLEKAS RVEIAKK+L +
Sbjct: 51  KCTSESNDQAQNDFSLKNALSSMVGEQVEELLNREENRSLLDGLEKASMRVEIAKKQLAE 110

Query: 171 IERQEIEATQLRNYINHLEIRASEIAECQKEILEARSMVEEAESSLTLNLDGFGSGNVLQ 350
           IE+QE+E  + ++Y++ LE RASEI ECQKEILEAR M+EEAE SL  +      GN ++
Sbjct: 111 IEKQELELKRFKDYVSQLENRASEIEECQKEILEARGMIEEAERSLAQS----EGGNAIR 166

Query: 351 ESEQS--DIDEERWESIKAASVSAIVGTLAGLPFSFSQGTSGVQLILPLAITFISCALFG 524
           + E    D DEER+ES+KAAS+SAIVGTLAGLP   +Q  S  QL+LP AITFISCALFG
Sbjct: 167 DGEDGGLDRDEERFESVKAASISAIVGTLAGLPIFLNQVNSTSQLLLPTAITFISCALFG 226

Query: 525 VTFRYTIRRDLDNVQLKTGTSAAFGFIKGLSVLGEGPPLELDYASFLSHAVDGAVYVSEN 704
           VTFRYTIRRDLDN+QLKTGT AAFGF+KGL+ L  G PLEL   SF SH +D AVYVSEN
Sbjct: 227 VTFRYTIRRDLDNIQLKTGTFAAFGFVKGLATLDGGVPLELSAESFSSHVIDAAVYVSEN 286

Query: 705 FFIFFFAAVGLDYCFKTRLLSPFPIKKQFDK 797
            +IF  AAV LDYCFK  LLSPFPI+K   +
Sbjct: 287 LYIFICAAVALDYCFKMSLLSPFPIRKSISR 317


>ref|XP_004309583.1| PREDICTED: uncharacterized protein LOC101293946 [Fragaria vesca
           subsp. vesca]
          Length = 315

 Score =  290 bits (741), Expect = 7e-76
 Identities = 161/268 (60%), Positives = 189/268 (70%), Gaps = 3/268 (1%)
 Frame = +3

Query: 3   KCSSEQRDNEG-LKDVLSGMVDKXXXXXXXXXXXXXXXDGLEKASRRVEIAKKELEDIER 179
           KC +   +N G  KD L G+V +               DGL KAS RVE AK+EL  IER
Sbjct: 44  KCRASSSENSGSFKDSLGGIVGEQVEELLNKEENKVLLDGLVKASGRVENAKRELAAIER 103

Query: 180 QEIEATQLRNYINHLEIRASEIAECQKEILEARSMVEEAESSLTLNLDGFGSGNVLQESE 359
           QE EA   R YI  LE RASEI ECQKEI EA++MVEEAE +LT   D F +G    E+E
Sbjct: 104 QEREAKLAREYIKELETRASEIEECQKEISEAKAMVEEAERALTQTGDEFKNGYASAETE 163

Query: 360 QSDID--EERWESIKAASVSAIVGTLAGLPFSFSQGTSGVQLILPLAITFISCALFGVTF 533
             +ID  +E+ ESIKAASV+++VGT+AGLPFSF+Q ++  +LILPLAITFISCALFGVTF
Sbjct: 164 NGEIDKDQEKLESIKAASVASLVGTVAGLPFSFTQVSNSSELILPLAITFISCALFGVTF 223

Query: 534 RYTIRRDLDNVQLKTGTSAAFGFIKGLSVLGEGPPLELDYASFLSHAVDGAVYVSENFFI 713
           RY IRRDLDN  LKTG  AAFG +KGL++L  G PLEL+  S +SHA DGAVYVSEN F+
Sbjct: 224 RYAIRRDLDNGHLKTGAPAAFGVVKGLAMLEAGKPLELNTDSLISHAFDGAVYVSENLFV 283

Query: 714 FFFAAVGLDYCFKTRLLSPFPIKKQFDK 797
           F  AAV LDY FKTRLLSPFPIKK   +
Sbjct: 284 FLSAAVALDYLFKTRLLSPFPIKKSVSR 311


>ref|NP_001241068.1| uncharacterized protein LOC100803482 [Glycine max]
           gi|255647391|gb|ACU24161.1| unknown [Glycine max]
          Length = 293

 Score =  288 bits (737), Expect = 2e-75
 Identities = 150/259 (57%), Positives = 191/259 (73%), Gaps = 2/259 (0%)
 Frame = +3

Query: 9   SSEQRDNEGLKDVLSGMVDKXXXXXXXXXXXXXXXDGLEKASRRVEIAKKELEDIERQEI 188
           SS+++ +E +KD LSG+VD+               DGLEKAS+RVE+AK+EL  I +QE 
Sbjct: 34  SSDEKGSEDVKDTLSGVVDEQVEEFLSRKENKVLLDGLEKASQRVEMAKRELALIRKQEF 93

Query: 189 EATQLRNYINHLEIRASEIAECQKEILEARSMVEEAESSLTLNLDG--FGSGNVLQESEQ 362
              QL++Y+N LE +  EI ECQ++I EA+++VEEAE SL +N+ G   GS ++  +SE 
Sbjct: 94  AVKQLKDYVNQLESKVFEIGECQRDISEAKALVEEAERSLLVNVGGPENGSTSMGMKSED 153

Query: 363 SDIDEERWESIKAASVSAIVGTLAGLPFSFSQGTSGVQLILPLAITFISCALFGVTFRYT 542
            D DEERWES+KAAS+SA+VGT +GLP  F+Q T+  QL+LPLAI FI CALFGVTFRYT
Sbjct: 154 IDRDEERWESVKAASISALVGTFSGLPICFTQVTNTTQLLLPLAINFICCALFGVTFRYT 213

Query: 543 IRRDLDNVQLKTGTSAAFGFIKGLSVLGEGPPLELDYASFLSHAVDGAVYVSENFFIFFF 722
           IRR+LD+VQLKTG +AAFG +KGL+ LG GP  E +  SFLSHA +G +Y+SEN  IF  
Sbjct: 214 IRRNLDDVQLKTGVAAAFGVVKGLATLGGGPLPEPNIESFLSHAQEGTIYLSENLLIFVS 273

Query: 723 AAVGLDYCFKTRLLSPFPI 779
           AAV LDYC KTRLLSPFPI
Sbjct: 274 AAVALDYCLKTRLLSPFPI 292


>gb|EXB39941.1| hypothetical protein L484_001700 [Morus notabilis]
          Length = 320

 Score =  285 bits (728), Expect = 2e-74
 Identities = 157/263 (59%), Positives = 192/263 (73%), Gaps = 3/263 (1%)
 Frame = +3

Query: 3   KCSSE---QRDNEGLKDVLSGMVDKXXXXXXXXXXXXXXXDGLEKASRRVEIAKKELEDI 173
           KCS+    +  +  LKD L+ +V +               DGLE+AS RVE AK+EL +I
Sbjct: 52  KCSNNGGLREQSFNLKDSLNDVVGEQVQELLNREENKALLDGLERASLRVEKAKRELAEI 111

Query: 174 ERQEIEATQLRNYINHLEIRASEIAECQKEILEARSMVEEAESSLTLNLDGFGSGNVLQE 353
           ERQE+EA Q+R Y++ LE RASEIAECQKEI EAR+MVEEAE SL+ + +G  +G   + 
Sbjct: 112 ERQELEANQMREYVDQLERRASEIAECQKEISEARAMVEEAERSLSQSEEGSYAG---KG 168

Query: 354 SEQSDIDEERWESIKAASVSAIVGTLAGLPFSFSQGTSGVQLILPLAITFISCALFGVTF 533
           +E+ D DEER ESIKAAS+SA VGT+AGLP S +Q ++  QLILPLAITF SCALFGVTF
Sbjct: 169 NEEIDKDEERLESIKAASISAFVGTIAGLPISLTQVSTTSQLILPLAITFASCALFGVTF 228

Query: 534 RYTIRRDLDNVQLKTGTSAAFGFIKGLSVLGEGPPLELDYASFLSHAVDGAVYVSENFFI 713
           RYTIRRDLD+V LKTG  AAFG +KGL+ L  G PLEL+  S  SHA DGA++VS++ F+
Sbjct: 229 RYTIRRDLDDVHLKTGACAAFGVVKGLATLSGGQPLELNTESISSHAFDGAIHVSQDLFL 288

Query: 714 FFFAAVGLDYCFKTRLLSPFPIK 782
           F  AAVGLDYCFK  LLSPFPIK
Sbjct: 289 FVSAAVGLDYCFKMGLLSPFPIK 311


>ref|XP_007160578.1| hypothetical protein PHAVU_002G333700g [Phaseolus vulgaris]
           gi|561033993|gb|ESW32572.1| hypothetical protein
           PHAVU_002G333700g [Phaseolus vulgaris]
          Length = 293

 Score =  276 bits (707), Expect = 6e-72
 Identities = 145/259 (55%), Positives = 186/259 (71%), Gaps = 2/259 (0%)
 Frame = +3

Query: 9   SSEQRDNEGLKDVLSGMVDKXXXXXXXXXXXXXXXDGLEKASRRVEIAKKELEDIERQEI 188
           SS+++  + +KD L+G+VD+               DGLEKAS+RVE+AK+ELE I++QE+
Sbjct: 34  SSDEKGRDDVKDALNGLVDEQVQELLSRKENKILLDGLEKASQRVEMAKRELELIQKQEL 93

Query: 189 EATQLRNYINHLEIRASEIAECQKEILEARSMVEEAESSLTLNLDGFGSGNVLQ--ESEQ 362
              QL++Y++ LE    EI ECQ++I EA++MVE+AE SL +N+ G   G      +SE+
Sbjct: 94  AVKQLKDYVSQLEGEIIEIEECQRDISEAKAMVEKAEHSLLVNVGGSEGGGTSMGMKSEE 153

Query: 363 SDIDEERWESIKAASVSAIVGTLAGLPFSFSQGTSGVQLILPLAITFISCALFGVTFRYT 542
            D DEERWESIKAAS+SA+VGT++GLP  F+Q T   QL+LPL I FISCALFGVTFRYT
Sbjct: 154 IDRDEERWESIKAASISALVGTISGLPICFTQVTDTTQLLLPLTINFISCALFGVTFRYT 213

Query: 543 IRRDLDNVQLKTGTSAAFGFIKGLSVLGEGPPLELDYASFLSHAVDGAVYVSENFFIFFF 722
           IRR+LD+VQLKTG +AAFG +KGL  LG G  LE ++ SFLS   +G +YV EN  IF  
Sbjct: 214 IRRNLDDVQLKTGVAAAFGVVKGLGTLGGGLLLEPNFQSFLSLVQEGTLYVCENLLIFIS 273

Query: 723 AAVGLDYCFKTRLLSPFPI 779
            AV LDYCFKTRLLS FPI
Sbjct: 274 VAVALDYCFKTRLLSAFPI 292


>ref|XP_003598013.1| hypothetical protein MTR_3g005290 [Medicago truncatula]
           gi|355487061|gb|AES68264.1| hypothetical protein
           MTR_3g005290 [Medicago truncatula]
          Length = 300

 Score =  273 bits (699), Expect = 5e-71
 Identities = 145/266 (54%), Positives = 186/266 (69%), Gaps = 5/266 (1%)
 Frame = +3

Query: 3   KCSSEQR---DNEGLKDVLSGMVDKXXXXXXXXXXXXXXXDGLEKASRRVEIAKKELEDI 173
           +CSS  +   +N  LKD LSGM+                 D L+KAS+RVEIAK +L  I
Sbjct: 33  RCSSSSKNSEENNDLKDALSGMMGDQVEELLNREENKVLFDNLQKASQRVEIAKTQLAFI 92

Query: 174 ERQEIEATQLRNYINHLEIRASEIAECQKEILEARSMVEEAESSLTLNLDGFGSGNVLQ- 350
           E+QE+   Q ++Y   L+  A +IAE Q+EI EA++M+EEAE SL LN+ G   G     
Sbjct: 93  EKQELALKQYKDYTQQLQGNAFQIAESQREISEAKAMLEEAERSLLLNVGGAEEGGAFMG 152

Query: 351 -ESEQSDIDEERWESIKAASVSAIVGTLAGLPFSFSQGTSGVQLILPLAITFISCALFGV 527
            +SE+ D DEER ES+KAAS+SA+VGTL+GLP  F+Q T+  QL+L +AI F+ CALFGV
Sbjct: 153 MKSEEVDRDEERLESVKAASISALVGTLSGLPICFTQATNTTQLLLSVAINFVCCALFGV 212

Query: 528 TFRYTIRRDLDNVQLKTGTSAAFGFIKGLSVLGEGPPLELDYASFLSHAVDGAVYVSENF 707
           TFRYT+RR+LD+ QLKTG +AAFG +KGL++L  GP LEL++ S L++A DG +YVSEN 
Sbjct: 213 TFRYTVRRNLDDAQLKTGVAAAFGVVKGLAILSAGPLLELNFESLLAYAWDGTIYVSENL 272

Query: 708 FIFFFAAVGLDYCFKTRLLSPFPIKK 785
            IF FAAV LDYC KTRLLSPFPI K
Sbjct: 273 IIFVFAAVSLDYCLKTRLLSPFPIDK 298


>gb|AAO42239.1| unknown protein [Arabidopsis thaliana]
          Length = 308

 Score =  267 bits (683), Expect = 4e-69
 Identities = 147/254 (57%), Positives = 177/254 (69%), Gaps = 3/254 (1%)
 Frame = +3

Query: 24  DNEGLKDVLSGMVDKXXXXXXXXXXXXXXXDGLEKASRRVEIAKKELEDIERQEIEATQL 203
           D   LK+ LSG+V                 DGLEKAS RVEIAK+ELEDIERQEIEA  L
Sbjct: 57  DGGDLKNSLSGIVGNQVEELLSGEENKGLLDGLEKASLRVEIAKRELEDIERQEIEAKLL 116

Query: 204 RNYINHLEIRASEIAECQKEILEARSMVEEAESSLTLNLDGFGSGNVLQESEQS---DID 374
           ++YIN LE RA+EIAECQ+EI  ARSMVEEAE SL+L        + +  SE+    D D
Sbjct: 117 QDYINQLESRAAEIAECQQEIDAARSMVEEAERSLSL-----ADNSTIGSSEKGYSIDKD 171

Query: 375 EERWESIKAASVSAIVGTLAGLPFSFSQGTSGVQLILPLAITFISCALFGVTFRYTIRRD 554
           +ER ES KAA ++A VGT+A LPF+ SQ  S  QL+LPL I F SCALFGVTFRY +RRD
Sbjct: 172 KERLESAKAAVIAAAVGTIAELPFALSQVASMEQLVLPLGIAFASCALFGVTFRYAVRRD 231

Query: 555 LDNVQLKTGTSAAFGFIKGLSVLGEGPPLELDYASFLSHAVDGAVYVSENFFIFFFAAVG 734
           LD+  LK+G  AAFGF+KGL +L  GPPLEL + S  SH +DGAV VS++  IF FA++G
Sbjct: 232 LDDNHLKSGAVAAFGFVKGLGMLSRGPPLELSWESLFSHGIDGAVLVSQSVLIFAFASIG 291

Query: 735 LDYCFKTRLLSPFP 776
           LD+CFK +LL PFP
Sbjct: 292 LDFCFKMKLLRPFP 305


>ref|NP_194140.2| uncharacterized protein [Arabidopsis thaliana]
           gi|53828627|gb|AAU94423.1| At4g24090 [Arabidopsis
           thaliana] gi|332659449|gb|AEE84849.1| uncharacterized
           protein AT4G24090 [Arabidopsis thaliana]
          Length = 308

 Score =  267 bits (683), Expect = 4e-69
 Identities = 147/254 (57%), Positives = 177/254 (69%), Gaps = 3/254 (1%)
 Frame = +3

Query: 24  DNEGLKDVLSGMVDKXXXXXXXXXXXXXXXDGLEKASRRVEIAKKELEDIERQEIEATQL 203
           D   LK+ LSG+V                 DGLEKAS RVEIAK+ELEDIERQEIEA  L
Sbjct: 57  DGGDLKNSLSGIVGNQVEELLSREENKGLLDGLEKASLRVEIAKRELEDIERQEIEAKLL 116

Query: 204 RNYINHLEIRASEIAECQKEILEARSMVEEAESSLTLNLDGFGSGNVLQESEQS---DID 374
           ++YIN LE RA+EIAECQ+EI  ARSMVEEAE SL+L        + +  SE+    D D
Sbjct: 117 QDYINQLESRAAEIAECQQEIDAARSMVEEAERSLSL-----ADNSTIGSSEKGYSIDKD 171

Query: 375 EERWESIKAASVSAIVGTLAGLPFSFSQGTSGVQLILPLAITFISCALFGVTFRYTIRRD 554
           +ER ES KAA ++A VGT+A LPF+ SQ  S  QL+LPL I F SCALFGVTFRY +RRD
Sbjct: 172 KERLESAKAAVIAAAVGTIAELPFALSQVASMEQLVLPLGIAFASCALFGVTFRYAVRRD 231

Query: 555 LDNVQLKTGTSAAFGFIKGLSVLGEGPPLELDYASFLSHAVDGAVYVSENFFIFFFAAVG 734
           LD+  LK+G  AAFGF+KGL +L  GPPLEL + S  SH +DGAV VS++  IF FA++G
Sbjct: 232 LDDNHLKSGAVAAFGFVKGLGMLSRGPPLELSWESLFSHGIDGAVLVSQSVLIFAFASIG 291

Query: 735 LDYCFKTRLLSPFP 776
           LD+CFK +LL PFP
Sbjct: 292 LDFCFKMKLLRPFP 305


>ref|XP_002869739.1| hypothetical protein ARALYDRAFT_492454 [Arabidopsis lyrata subsp.
           lyrata] gi|297315575|gb|EFH45998.1| hypothetical protein
           ARALYDRAFT_492454 [Arabidopsis lyrata subsp. lyrata]
          Length = 308

 Score =  267 bits (682), Expect = 5e-69
 Identities = 143/252 (56%), Positives = 179/252 (71%), Gaps = 1/252 (0%)
 Frame = +3

Query: 24  DNEGLKDVLSGMVDKXXXXXXXXXXXXXXXDGLEKASRRVEIAKKELEDIERQEIEATQL 203
           D   LK  LSG+V                 DGLEKAS RVEIAK+ELE+IERQE EA  L
Sbjct: 57  DGGDLKKSLSGIVGNQVEELFSREENKNLLDGLEKASLRVEIAKRELEEIERQESEAKLL 116

Query: 204 RNYINHLEIRASEIAECQKEILEARSMVEEAESSLTL-NLDGFGSGNVLQESEQSDIDEE 380
           ++Y+N LE RA+EIAECQ+EI+ ARSMVEEAE +L+L + +  GS    ++    D D+E
Sbjct: 117 QDYVNQLESRAAEIAECQQEIVAARSMVEEAERALSLADTEAIGSS---EKGYSIDKDKE 173

Query: 381 RWESIKAASVSAIVGTLAGLPFSFSQGTSGVQLILPLAITFISCALFGVTFRYTIRRDLD 560
           R ES KAA ++A VGT+A LPF+ SQ +S  QL+LPL I F SCALFGVTFRY +RRDLD
Sbjct: 174 RLESAKAAVIAAAVGTIAELPFALSQVSSIEQLVLPLGIAFASCALFGVTFRYAVRRDLD 233

Query: 561 NVQLKTGTSAAFGFIKGLSVLGEGPPLELDYASFLSHAVDGAVYVSENFFIFFFAAVGLD 740
           +  LK+G  AAFGF+KGL +L  GPPLEL + S  SH +DGA+ VS++  IF FA++GLD
Sbjct: 234 DSHLKSGAVAAFGFVKGLGMLSRGPPLELSWESLFSHGIDGAILVSQSVLIFAFASIGLD 293

Query: 741 YCFKTRLLSPFP 776
           +CFKT+LL PFP
Sbjct: 294 FCFKTKLLRPFP 305


>gb|EXB81854.1| hypothetical protein L484_015328 [Morus notabilis]
          Length = 311

 Score =  266 bits (679), Expect = 1e-68
 Identities = 149/264 (56%), Positives = 187/264 (70%), Gaps = 3/264 (1%)
 Frame = +3

Query: 3   KCSSE---QRDNEGLKDVLSGMVDKXXXXXXXXXXXXXXXDGLEKASRRVEIAKKELEDI 173
           KCS+    +  +  LKD L+ +V +               DGLE+AS RVE AK+EL +I
Sbjct: 52  KCSNNGGLREQSFNLKDSLNDVVGEQVQELLNREENKALLDGLERASLRVEKAKRELAEI 111

Query: 174 ERQEIEATQLRNYINHLEIRASEIAECQKEILEARSMVEEAESSLTLNLDGFGSGNVLQE 353
           ERQE+EA Q+R Y++ LE RASEIAECQKEI EAR+MVEEAE SL+ + +G  +G   + 
Sbjct: 112 ERQELEANQMREYVDQLERRASEIAECQKEISEARAMVEEAERSLSQSEEGSYAG---KG 168

Query: 354 SEQSDIDEERWESIKAASVSAIVGTLAGLPFSFSQGTSGVQLILPLAITFISCALFGVTF 533
           +E+ D DEER ESIKAAS+SA VGT+AGLP S +Q ++  QLILPLAITF SCALFG+TF
Sbjct: 169 NEEIDKDEERLESIKAASISAFVGTIAGLPISLTQVSTTSQLILPLAITFASCALFGITF 228

Query: 534 RYTIRRDLDNVQLKTGTSAAFGFIKGLSVLGEGPPLELDYASFLSHAVDGAVYVSENFFI 713
           RYTIRRDLD+V LKTG  AA G +KGL+ L  G PLEL+  S  SHA DGA++VS++ F+
Sbjct: 229 RYTIRRDLDDVHLKTGACAASGVVKGLATLSGGQPLELNTESISSHAFDGAIHVSQDLFV 288

Query: 714 FFFAAVGLDYCFKTRLLSPFPIKK 785
           F  AAVGLD+C       PFPIK+
Sbjct: 289 FVSAAVGLDHC-------PFPIKR 305


Top