BLASTX nr result

ID: Akebia24_contig00010755 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia24_contig00010755
         (1556 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002303644.2| hypothetical protein POPTR_0003s13920g [Popu...   240   1e-60
ref|XP_002270995.1| PREDICTED: uncharacterized protein LOC100254...   237   1e-59
ref|XP_002299479.1| hypothetical protein POPTR_0001s10550g [Popu...   236   2e-59
ref|XP_006476393.1| PREDICTED: uncharacterized protein LOC102612...   231   9e-58
ref|XP_007040503.1| Uncharacterized protein isoform 1 [Theobroma...   228   6e-57
ref|XP_004149749.1| PREDICTED: uncharacterized protein LOC101203...   227   1e-56
ref|XP_006439359.1| hypothetical protein CICLE_v10020669mg [Citr...   223   2e-55
ref|XP_002509953.1| conserved hypothetical protein [Ricinus comm...   210   1e-51
ref|XP_004298896.1| PREDICTED: uncharacterized protein LOC101312...   199   3e-48
gb|EXB66083.1| hypothetical protein L484_003884 [Morus notabilis...   181   8e-43
gb|EYU23365.1| hypothetical protein MIMGU_mgv1a009979mg [Mimulus...   177   9e-42
ref|XP_006343865.1| PREDICTED: uncharacterized protein LOC102589...   172   4e-40
ref|XP_004245519.1| PREDICTED: uncharacterized protein LOC101257...   166   3e-38
ref|XP_006850554.1| hypothetical protein AMTR_s00159p00083590 [A...   159   2e-36
ref|XP_002887891.1| hypothetical protein ARALYDRAFT_474911 [Arab...   158   7e-36
ref|XP_006302456.1| hypothetical protein CARUB_v10020548mg [Caps...   157   2e-35
ref|XP_007209413.1| hypothetical protein PRUPE_ppa009291mg [Prun...   156   3e-35
ref|XP_006391622.1| hypothetical protein EUTSA_v10023582mg [Eutr...   155   5e-35
ref|NP_683468.1| uncharacterized protein [Arabidopsis thaliana] ...   149   3e-33
ref|XP_007040504.1| Uncharacterized protein isoform 2 [Theobroma...   147   1e-32

>ref|XP_002303644.2| hypothetical protein POPTR_0003s13920g [Populus trichocarpa]
            gi|550343126|gb|EEE78623.2| hypothetical protein
            POPTR_0003s13920g [Populus trichocarpa]
          Length = 373

 Score =  240 bits (612), Expect = 1e-60
 Identities = 156/376 (41%), Positives = 200/376 (53%), Gaps = 5/376 (1%)
 Frame = +1

Query: 301  LTAFLVLSLVVIFSFGAELEVKKTDLTKGID-QNNSSRLKDNSIVKPSEPKSVPKVDRVK 477
            L   LVL  VVI S     E   T L   +D   NSS+    S ++ +  +     D+ K
Sbjct: 9    LGLILVLLAVVICSLADSKESASTGLNPKVDVTTNSSKGAGGSNLETNSTED----DKGK 64

Query: 478  KGSGDQGGIKNDPLVXXXXXXXXXXXXXKVGTNKED---DSSKKSQVNKEDKTKLIDXXX 648
            +  G     K                  K   N ++   +SS++SQ  K D +K  D   
Sbjct: 65   EKGGQDDKSKESIADDVNKNKMNSQSGSKDNDNAKEGKHNSSEESQAKKGDHSKKEDSSS 124

Query: 649  XXXXXXXXXXXXXXXDSVLSKPMNKEEGAWGEECHSSNKCTDEKNELIACLRVPGNESPD 828
                           D+      +++EG   EEC  SNKCTDE+N+L+ACLRVPGNESPD
Sbjct: 125  GVESEDLSKEKNDKGDT-----QSRKEGPRVEECDQSNKCTDEENKLVACLRVPGNESPD 179

Query: 829  LSLLIQNKGTSPLTVKISAPDFVHLEKTSVGLQEKEDKKLKVSIGKDNNKNTLIVLTAKN 1008
            LSLLIQNKG   L+V ISAPDFVHLEKT + L+EKEDKK+KVSI    ++N LIVL A N
Sbjct: 180  LSLLIQNKGKGSLSVTISAPDFVHLEKTKIQLKEKEDKKVKVSITSRGSEN-LIVLRAGN 238

Query: 1009 GDCSIDFGDLLLHNSGQKKIDYTAKST-YTNFLKKSPSIAYMXXXXXXXXXXXXMCARLW 1185
            G C +D  D + H  G K+ D + KST   NF+ ++ +I  +            MC    
Sbjct: 239  GQCKLDIKDTIAHYFG-KEFDKSHKSTDIINFMSRTSTIVVLSFAALLILASGWMCISFR 297

Query: 1186 RKHHCSDSTKYQKLETELPICGGGKTEPEVTXXXXXXXXXXXXXEEAPRTPSKPVTPSLS 1365
            RKH  ++++KYQ+LE ELP+ G GKTE E               EEAP+ PS PVTPSLS
Sbjct: 298  RKHPSNNTSKYQRLEMELPVSGEGKTESETNDGWDNSWGDDWDDEEAPKAPSLPVTPSLS 357

Query: 1366 SKGLASRRFPKDGWKD 1413
            SKGLASRR  K+ WKD
Sbjct: 358  SKGLASRRLSKEAWKD 373


>ref|XP_002270995.1| PREDICTED: uncharacterized protein LOC100254757 [Vitis vinifera]
            gi|297742326|emb|CBI34475.3| unnamed protein product
            [Vitis vinifera]
          Length = 381

 Score =  237 bits (604), Expect = 1e-59
 Identities = 157/391 (40%), Positives = 213/391 (54%), Gaps = 16/391 (4%)
 Frame = +1

Query: 289  NYSVLTAFLVLSLVVIFSFGAELEVKKTDLTKGIDQNNSSRLKDNSIVKPSEPKSVPKVD 468
            N+ +L  FL++ L V  S+GA+ EVKK     G+D   +      +I   +   S   +D
Sbjct: 4    NFVLLLGFLLVLLAVDSSYGADSEVKKLP-NSGLDPKKTVVSTHTNIPNETLSGSDSGLD 62

Query: 469  RVK----KGSGDQGGIKNDPLVXXXXXXXXXXXXXKVGTNKEDDSSK-------KSQVNK 615
             +K    K   DQ G+  + +              K+ + K+ DS +       K  ++K
Sbjct: 63   SLKAEQAKKDEDQVGVPKEGV---------ESTKEKISSIKQLDSKEADNEHTGKGSLSK 113

Query: 616  EDKTKLID---XXXXXXXXXXXXXXXXXXDSVL--SKPMNKEEGAWGEECHSSNKCTDEK 780
            E +T+  D                     + VL  SKP  K+E   GEEC  SN+C D+ 
Sbjct: 114  ELETEGGDNKKEKPGDGSKSKQASKEGGNEGVLESSKP-GKKESLQGEECDPSNQCVDDI 172

Query: 781  NELIACLRVPGNESPDLSLLIQNKGTSPLTVKISAPDFVHLEKTSVGLQEKEDKKLKVSI 960
            N+L+ACLRVPGN+SPDLSLLIQNKG + LTV ISAPDFV LE T + LQEKEDKK+KVSI
Sbjct: 173  NKLVACLRVPGNDSPDLSLLIQNKGKTALTVTISAPDFVKLESTKIELQEKEDKKVKVSI 232

Query: 961  GKDNNKNTLIVLTAKNGDCSIDFGDLLLHNSGQKKIDYTAKSTYTNFLKKSPSIAYMXXX 1140
                + N+ IVLTA  G CS+DF DL+     QK  D   +ST  NFL ++ S+A++   
Sbjct: 233  RNGGSDNS-IVLTAGKGRCSLDFKDLIA-QIAQKGTDNIPESTDGNFLTRTSSLAFLFLV 290

Query: 1141 XXXXXXXXXMCARLWRKHHCSDSTKYQKLETELPICGGGKTEPEVTXXXXXXXXXXXXXE 1320
                     +C    RK+  S  +KYQKL+ ELP+ GGGK E ++              E
Sbjct: 291  ALVAAASAWICISFKRKYFPSSGSKYQKLDMELPVSGGGKVEADINDGWDNSWGDTWDDE 350

Query: 1321 EAPRTPSKPVTPSLSSKGLASRRFPKDGWKD 1413
            EAP+TPS P+TPSLS++GLA+RR  K+GWKD
Sbjct: 351  EAPKTPSMPLTPSLSARGLAARRLSKEGWKD 381


>ref|XP_002299479.1| hypothetical protein POPTR_0001s10550g [Populus trichocarpa]
            gi|222846737|gb|EEE84284.1| hypothetical protein
            POPTR_0001s10550g [Populus trichocarpa]
          Length = 373

 Score =  236 bits (603), Expect = 2e-59
 Identities = 154/382 (40%), Positives = 195/382 (51%), Gaps = 11/382 (2%)
 Frame = +1

Query: 301  LTAFLVLSLVVIFSF-------GAELEVKKTDLTKGIDQNNSSRLKDNSIVKPSEPKSVP 459
            L   LVL  VV+ S        G  L+ K    T    +   S LK NS           
Sbjct: 9    LGLILVLLAVVVCSLADSKESAGTGLDPKSDATTNASKEAGGSNLKSNSTEDDKGKGKGG 68

Query: 460  KVDRVKKGSGDQ-GGIKNDPLVXXXXXXXXXXXXXKVGTNKEDD---SSKKSQVNKEDKT 627
            +VD+ K+   D    IK D                K   N ++D   SS++ Q  + D  
Sbjct: 69   QVDKSKEDKADDLNNIKMDS-----------QSGSKDNENAKEDKGNSSEEFQAKEGDHN 117

Query: 628  KLIDXXXXXXXXXXXXXXXXXXDSVLSKPMNKEEGAWGEECHSSNKCTDEKNELIACLRV 807
            K                     D+      +++EG   EEC  SNKCTDE+N+L+ACLRV
Sbjct: 118  KKKGLSGGEESKDFPEEKNDERDT-----QSRKEGPHVEECDPSNKCTDEENKLVACLRV 172

Query: 808  PGNESPDLSLLIQNKGTSPLTVKISAPDFVHLEKTSVGLQEKEDKKLKVSIGKDNNKNTL 987
            PGNESPDLSLLIQNKG  PL V ISAPDFVHLEKT + LQEK++KK+KVSI    ++N L
Sbjct: 173  PGNESPDLSLLIQNKGKGPLNVTISAPDFVHLEKTKIQLQEKDNKKVKVSITGGGSEN-L 231

Query: 988  IVLTAKNGDCSIDFGDLLLHNSGQKKIDYTAKSTYTNFLKKSPSIAYMXXXXXXXXXXXX 1167
            IVLTA  G C +D  D + H  G++       +   N + ++ +IA +            
Sbjct: 232  IVLTAGKGQCKLDIKDTIAHYLGKELHKSHESADIINSMSRTSTIAVLSFAALLILASGW 291

Query: 1168 MCARLWRKHHCSDSTKYQKLETELPICGGGKTEPEVTXXXXXXXXXXXXXEEAPRTPSKP 1347
            MC    RKH   ++ +YQ+LE ELP+ GGGKTE +               EEAP+TPS P
Sbjct: 292  MCISFRRKHLSYNNPRYQRLEMELPVSGGGKTESKTNDGWDNNWGDDWDDEEAPKTPSLP 351

Query: 1348 VTPSLSSKGLASRRFPKDGWKD 1413
            VTPSLSSKGLASRR  KDGWKD
Sbjct: 352  VTPSLSSKGLASRRLSKDGWKD 373


>ref|XP_006476393.1| PREDICTED: uncharacterized protein LOC102612566 isoform X1 [Citrus
            sinensis]
          Length = 372

 Score =  231 bits (588), Expect = 9e-58
 Identities = 149/391 (38%), Positives = 202/391 (51%), Gaps = 24/391 (6%)
 Frame = +1

Query: 313  LVLSLVVIFSFGAELEVKKTDLTKGIDQN-------NSSRLKDNSIVKPSEPKSV----- 456
            L   LV++ +  +  +      + G+D N       N +    N +   S+ K+V     
Sbjct: 9    LAFFLVLLVNGCSAADKTNFSASSGLDPNLIGSRSSNDTTGGSNLVTNSSQTKNVNGNRG 68

Query: 457  PKVDRVKKGSGDQGGIKNDPLVXXXXXXXXXXXXXKVGTNKEDDSSKKSQVNKEDKTKLI 636
             +V++  KG+ D+ GI  +                    +K  D+ +K  V  + K +L 
Sbjct: 69   DQVNKSVKGADDKNGINKNNTFHPLG-------------SKNADNVQKGNVVPKGKKELS 115

Query: 637  DXXXXXXXXXXXXXXXXXXDSVLSKPMNKE------------EGAWGEECHSSNKCTDEK 780
            D                  D V SK ++KE            EG   EECHSSNKC DEK
Sbjct: 116  DRKDNLS------------DEVKSKDVSKEGGPDEDSGKSRKEGTRVEECHSSNKCMDEK 163

Query: 781  NELIACLRVPGNESPDLSLLIQNKGTSPLTVKISAPDFVHLEKTSVGLQEKEDKKLKVSI 960
             + +ACLRVPGN+SPDLSLLIQNK   PLTV+ISAPD+V LEKT V L+E E  +L+VSI
Sbjct: 164  MQFVACLRVPGNDSPDLSLLIQNKVKGPLTVRISAPDYVRLEKTKVQLRENEGNELRVSI 223

Query: 961  GKDNNKNTLIVLTAKNGDCSIDFGDLLLHNSGQKKIDYTAKSTYTNFLKKSPSIAYMXXX 1140
             +    N LI + A NG+CS+DF DL+ HNSG+   D + KSTY  FL K P++ ++   
Sbjct: 224  RRKGTVN-LITIKAGNGNCSLDFKDLMAHNSGE-DFDNSLKSTYFKFLSKKPTVPFISFA 281

Query: 1141 XXXXXXXXXMCARLWRKHHCSDSTKYQKLETELPICGGGKTEPEVTXXXXXXXXXXXXXE 1320
                     +C  L  K   S  +KYQ+L+ E+P+   G +E +               E
Sbjct: 282  ALLILASGCLCVSLRCKQLSSGKSKYQRLDMEVPVASLGNSESDNNHGWDNSWDDNWDDE 341

Query: 1321 EAPRTPSKPVTPSLSSKGLASRRFPKDGWKD 1413
            EAP+TPS PVTPSLSSKGLASRR  K+GWKD
Sbjct: 342  EAPKTPSLPVTPSLSSKGLASRRLSKEGWKD 372


>ref|XP_007040503.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508777748|gb|EOY25004.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 443

 Score =  228 bits (581), Expect = 6e-57
 Identities = 122/240 (50%), Positives = 146/240 (60%)
 Frame = +1

Query: 694  DSVLSKPMNKEEGAWGEECHSSNKCTDEKNELIACLRVPGNESPDLSLLIQNKGTSPLTV 873
            DSV   P  + +G  GEEC  SN C D+     ACLRVPGNESPDLSLLIQNKG  PLT+
Sbjct: 214  DSVPPPPPTRTDGFRGEECDPSNMCMDKNERFAACLRVPGNESPDLSLLIQNKGKGPLTI 273

Query: 874  KISAPDFVHLEKTSVGLQEKEDKKLKVSIGKDNNKNTLIVLTAKNGDCSIDFGDLLLHNS 1053
            KISAP FV LE+T V LQEK+DKK+KVSI KD+    LIVL    G+CS+DF DL++HNS
Sbjct: 274  KISAPAFVQLEETDVELQEKQDKKVKVSI-KDSGTGNLIVLKDGRGECSLDFKDLIVHNS 332

Query: 1054 GQKKIDYTAKSTYTNFLKKSPSIAYMXXXXXXXXXXXXMCARLWRKHHCSDSTKYQKLET 1233
             +         +Y NFL ++P+   +            MC    R+       KYQ+L+ 
Sbjct: 333  AE---------SYVNFLSQTPTTTLIFVAAILILASGWMCMSFKRRQLARSGLKYQRLDM 383

Query: 1234 ELPICGGGKTEPEVTXXXXXXXXXXXXXEEAPRTPSKPVTPSLSSKGLASRRFPKDGWKD 1413
            ELP+  G KTEP+V              EEAP TP  PVTPSLSSKGLASRR  K+GWKD
Sbjct: 384  ELPVSAGAKTEPDVNDGWDNSWGNNWEDEEAPMTPLMPVTPSLSSKGLASRRLSKEGWKD 443


>ref|XP_004149749.1| PREDICTED: uncharacterized protein LOC101203513 [Cucumis sativus]
          Length = 376

 Score =  227 bits (578), Expect = 1e-56
 Identities = 149/351 (42%), Positives = 191/351 (54%), Gaps = 5/351 (1%)
 Frame = +1

Query: 376  LTKGIDQNNS-SRLKD-NSIVKPSEPKSVPKVDRVKKG-SGDQGGIKNDPLVXXXXXXXX 546
            + KG D N      KD NS+    E KS  +V   K+G    +  IK DP          
Sbjct: 40   VNKGNDANKDPGPNKDLNSVSAGKEKKSEQQVSVSKEGVKNREDKIKKDP---------E 90

Query: 547  XXXXXKVGTNK--EDDSSKKSQVNKEDKTKLIDXXXXXXXXXXXXXXXXXXDSVLSKPMN 720
                 K G +K  +DD   +   NK DK K                     +S +S    
Sbjct: 91   SETVSKEGADKVKKDDGLGEEGRNKGDKVK--GKPVDNSVSKDGSKSSGKGESTVSSASK 148

Query: 721  KEEGAWGEECHSSNKCTDEKNELIACLRVPGNESPDLSLLIQNKGTSPLTVKISAPDFVH 900
            + +G+ GE+C SSNKCTDE  +L+ACLRVPGN+SP L LLIQNKG  PLT KISAPDFVH
Sbjct: 149  RNDGSSGEDCDSSNKCTDEAKKLVACLRVPGNDSPQLLLLIQNKGKGPLTAKISAPDFVH 208

Query: 901  LEKTSVGLQEKEDKKLKVSIGKDNNKNTLIVLTAKNGDCSIDFGDLLLHNSGQKKIDYTA 1080
            LEK+ V LQE+E+KK+KVSIG   + NT IVLT+  G CS+DF DL+ H++  K  D   
Sbjct: 209  LEKSEVQLQERENKKVKVSIGDGGDGNT-IVLTSGGGRCSLDFRDLVAHHNA-KDSDNVP 266

Query: 1081 KSTYTNFLKKSPSIAYMXXXXXXXXXXXXMCARLWRKHHCSDSTKYQKLETELPICGGGK 1260
            KS++ ++L K   IA +            +   + RK+  S ++KYQ+L+ ELP+  GGK
Sbjct: 267  KSSWFSYLTKPHVIAILAFGVILTIAAVSVIISIRRKNFVSSNSKYQRLDMELPVSLGGK 326

Query: 1261 TEPEVTXXXXXXXXXXXXXEEAPRTPSKPVTPSLSSKGLASRRFPKDGWKD 1413
               +               +E P TPS PVTPSLSSKGLASRR  KDGWKD
Sbjct: 327  AVAD-NNDGWENSWDDNWDDETPHTPSLPVTPSLSSKGLASRRLNKDGWKD 376


>ref|XP_006439359.1| hypothetical protein CICLE_v10020669mg [Citrus clementina]
            gi|567893744|ref|XP_006439360.1| hypothetical protein
            CICLE_v10020669mg [Citrus clementina]
            gi|557541621|gb|ESR52599.1| hypothetical protein
            CICLE_v10020669mg [Citrus clementina]
            gi|557541622|gb|ESR52600.1| hypothetical protein
            CICLE_v10020669mg [Citrus clementina]
          Length = 372

 Score =  223 bits (568), Expect = 2e-55
 Identities = 144/385 (37%), Positives = 201/385 (52%), Gaps = 15/385 (3%)
 Frame = +1

Query: 304  TAFLVLSLVVIFSFGAELEVKKTDLT--KGIDQN-NSSRLKD------NSIVKPSEPKSV 456
            T+  +L+  ++          KT+ +   G+D N N SR  +      N +   S+ K+V
Sbjct: 4    TSIFLLAFFLVLLVNGSSAADKTNFSASSGLDPNLNGSRSSNDTTGGSNLVTNSSQTKNV 63

Query: 457  -----PKVDRVKKGSGDQGGI-KNDPLVXXXXXXXXXXXXXKVGTNKEDDSSKKSQVNKE 618
                  +V++  +G+ D+  + KN+                 +G+    +  K + V K 
Sbjct: 64   NGNRGDQVNKSVEGTDDKNRVDKNNTF-------------HPLGSKNAKNVQKGNSVPKG 110

Query: 619  DKTKLIDXXXXXXXXXXXXXXXXXXDSVLSKPMNKEEGAWGEECHSSNKCTDEKNELIAC 798
             K +L D                  D       +++EG   EECHSSNKC DEK + +AC
Sbjct: 111  QK-ELSDRKDNLSDEVKSKDASKEGDPDEDSGKSRKEGTRVEECHSSNKCMDEKMQFVAC 169

Query: 799  LRVPGNESPDLSLLIQNKGTSPLTVKISAPDFVHLEKTSVGLQEKEDKKLKVSIGKDNNK 978
            LRVPGN+SPDLSLLIQNK   PLTV+ISAPD+V LEKT V L+E E  +L+VSI +    
Sbjct: 170  LRVPGNDSPDLSLLIQNKVKGPLTVRISAPDYVRLEKTKVQLRENEGNELRVSIRRKGTV 229

Query: 979  NTLIVLTAKNGDCSIDFGDLLLHNSGQKKIDYTAKSTYTNFLKKSPSIAYMXXXXXXXXX 1158
            N LI + A NG+C +DF DL+ HNSG+   D + KSTY  FL K P++  +         
Sbjct: 230  N-LITIKAGNGNCRLDFKDLMAHNSGE-DFDNSLKSTYFKFLSKKPTVPVITFAALLILA 287

Query: 1159 XXXMCARLWRKHHCSDSTKYQKLETELPICGGGKTEPEVTXXXXXXXXXXXXXEEAPRTP 1338
               +C  L  +   S  +KYQ+L+ E+P+   G +E +               EEAP+TP
Sbjct: 288  SGCLCVSLRCRQLSSGKSKYQRLDMEVPVASLGNSESDNNHGWDNSWDDNWDDEEAPKTP 347

Query: 1339 SKPVTPSLSSKGLASRRFPKDGWKD 1413
            S PVTPSLSSKGLASRR  K+GWKD
Sbjct: 348  SLPVTPSLSSKGLASRRLSKEGWKD 372


>ref|XP_002509953.1| conserved hypothetical protein [Ricinus communis]
            gi|223549852|gb|EEF51340.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 372

 Score =  210 bits (535), Expect = 1e-51
 Identities = 139/383 (36%), Positives = 190/383 (49%), Gaps = 8/383 (2%)
 Frame = +1

Query: 289  NYSVLTAFLVLSLVV---IFSFGAELEVKKTDLTKGIDQNNS-----SRLKDNSIVKPSE 444
            N ++    +VL LVV   I  F   +  K    +     +N      S   D++ V   +
Sbjct: 4    NRALYLGLIVLLLVVDCSILDFKVNVSAKTDSQSNSTKDSNDQGGELSSFSDSNGVNKEK 63

Query: 445  PKSVPKVDRVKKGSGDQGGIKNDPLVXXXXXXXXXXXXXKVGTNKEDDSSKKSQVNKEDK 624
             +   +VD +K+  G  G +KN+                    N  D +S+      ++ 
Sbjct: 64   KRKENQVDDLKEKIG--GDMKNNKNNLSSQSGSKKDDMKTNNINGNDLNSQSESKKTDNS 121

Query: 625  TKLIDXXXXXXXXXXXXXXXXXXDSVLSKPMNKEEGAWGEECHSSNKCTDEKNELIACLR 804
             + ++                  DS L+      + +  EEC  SNKCTDE+N+L+ACLR
Sbjct: 122  ERKVEDDDSKKKTIPKENNINQGDSGLAS-----KDSHVEECDPSNKCTDEENQLVACLR 176

Query: 805  VPGNESPDLSLLIQNKGTSPLTVKISAPDFVHLEKTSVGLQEKEDKKLKVSIGKDNNKNT 984
            VPGN+    SLL+QNKG +PLTV ISAPD+VH+EKT + LQ KEDKK+ VSI    N N 
Sbjct: 177  VPGNDQ--YSLLVQNKGKNPLTVTISAPDYVHIEKTEIQLQSKEDKKVPVSIRHGGNDN- 233

Query: 985  LIVLTAKNGDCSIDFGDLLLHNSGQKKIDYTAKSTYTNFLKKSPSIAYMXXXXXXXXXXX 1164
            LIVL   NG C++D   L+  N     +D + KS Y N++ ++P IA +           
Sbjct: 234  LIVLRTGNGRCNLDIKHLVTENF----LDISQKSGYINYMSRTPVIAVLAFAALLILAAG 289

Query: 1165 XMCARLWRKHHCSDSTKYQKLETELPICGGGKTEPEVTXXXXXXXXXXXXXEEAPRTPSK 1344
              C    RK   S  +KYQ+L+ ELP+  G K E E               EEAP+TPS 
Sbjct: 290  WTCISFRRKQLSSSGSKYQRLDMELPVSTGEKAESEQNDGWDDKWGDDWDDEEAPKTPSL 349

Query: 1345 PVTPSLSSKGLASRRFPKDGWKD 1413
            PVTPSLSSKGLASRR  K+GWKD
Sbjct: 350  PVTPSLSSKGLASRRLSKEGWKD 372


>ref|XP_004298896.1| PREDICTED: uncharacterized protein LOC101312440 [Fragaria vesca
            subsp. vesca]
          Length = 372

 Score =  199 bits (506), Expect = 3e-48
 Identities = 142/385 (36%), Positives = 201/385 (52%), Gaps = 9/385 (2%)
 Frame = +1

Query: 286  KNYSVLTAFLVLSLVVIFSFGAELEVKKTDLTKGIDQNNSSRLKDNSIVKPSEPKSVP-- 459
            K +++L   L+L L++  S GA+L+V++   T  +D   SS  + ++     + K V   
Sbjct: 3    KRFALLVGVLLLQLMIHCS-GADLKVEEGAKTV-VDPKVSSTSEGSNSSDDKKQKVVTNL 60

Query: 460  -----KVDRVKKGSGDQGGIKNDPLVXXXXXXXXXXXXXKVGTNKEDDSSKKSQVNKEDK 624
                 +V  VKK   DQGG  N+ +              K G++ E  S++   V K +K
Sbjct: 61   VSDGNEVQEVKKDK-DQGGGSNNGV---------GKSKEKTGSDGEVGSTETHSVAKGEK 110

Query: 625  TKLIDXXXXXXXXXXXXXXXXXXDSVLSKPMNKEEGAWGEECHSSNKCTDEKNELIACLR 804
                                   ++    P+ +E+G   EEC S+N CT ++N+L+ACLR
Sbjct: 111  GSNDGKNGKSSEESKAMAREEVGNAGNVNPV-REDGTPREECGSANMCTVKENKLVACLR 169

Query: 805  VPGNE-SPDLSLLIQNKGTSPLTVKISAPDFVHLEKTSVGLQEKEDKKLKVSIGKDNNKN 981
            VPG++ SP LSLLIQNKG  PL V ISAP+FV L+KT V L+EK++ K+ VS+G      
Sbjct: 170  VPGDDDSPHLSLLIQNKGKDPLVVTISAPEFVRLDKTKVQLKEKDNAKVDVSVG-SGGAT 228

Query: 982  TLIVLTAKNGDCSIDFGDLLLHNSGQKKIDYTAKSTYTNFLKKSPSIAYMXXXXXXXXXX 1161
            ++IVL A NG+CS+DF DL+ H+S QK+ D ++ +TY       P+I  +          
Sbjct: 229  SIIVLKAGNGNCSLDFKDLITHSS-QKEPDNSSNTTYLFLWTHRPAIGILLVALLMILVF 287

Query: 1162 XXMCARLWRKHHCSDSTKYQKL-ETELPICGGGKTEPEVTXXXXXXXXXXXXXEEAPRTP 1338
              M  R  +K   S   KYQKL +  LP+    K E  +              EEAP TP
Sbjct: 288  AGMYVRFMKKRVSSSGFKYQKLDDVHLPVLSSEKPELHINDGWDDTWDDKWDDEEAPHTP 347

Query: 1339 SKPVTPSLSSKGLASRRFPKDGWKD 1413
            S PVTPSLS KGLASRR  K+GWKD
Sbjct: 348  SMPVTPSLSGKGLASRRLNKEGWKD 372


>gb|EXB66083.1| hypothetical protein L484_003884 [Morus notabilis]
            gi|587991190|gb|EXC75508.1| hypothetical protein
            L484_000430 [Morus notabilis]
          Length = 474

 Score =  181 bits (459), Expect = 8e-43
 Identities = 108/240 (45%), Positives = 144/240 (60%)
 Frame = +1

Query: 694  DSVLSKPMNKEEGAWGEECHSSNKCTDEKNELIACLRVPGNESPDLSLLIQNKGTSPLTV 873
            D V S P  K+EG+ G+EC+SS +CTD++ ++IACLRVPGNESP LSLLIQNKG   +TV
Sbjct: 245  DGVTSDP-EKKEGSSGDECYSSIRCTDQEKKMIACLRVPGNESPHLSLLIQNKGNDSITV 303

Query: 874  KISAPDFVHLEKTSVGLQEKEDKKLKVSIGKDNNKNTLIVLTAKNGDCSIDFGDLLLHNS 1053
             ISAPDFVHL+ T+V + +KE+KK++VSIG +   ++LI LT+ N  C +DF DL+  +S
Sbjct: 304  NISAPDFVHLDTTTVRIGKKENKKVEVSIG-NGGTDSLINLTSGNRVCILDFKDLITQSS 362

Query: 1054 GQKKIDYTAKSTYTNFLKKSPSIAYMXXXXXXXXXXXXMCARLWRKHHCSDSTKYQKLET 1233
                   +    Y N   + P+IA++            M     RK   S+   YQK++ 
Sbjct: 363  -------SPNFKYLNLPARRPTIAFLSFSALLIMVSAWMFLSFRRKKLLSNGYAYQKVDM 415

Query: 1234 ELPICGGGKTEPEVTXXXXXXXXXXXXXEEAPRTPSKPVTPSLSSKGLASRRFPKDGWKD 1413
             L +  G K   +               EEAPRTPSKP +PSLSSK LASRR  K+ WKD
Sbjct: 416  GLLVSSGIKQRLKDNDGWDENWGDDWNDEEAPRTPSKP-SPSLSSKRLASRRLSKETWKD 474


>gb|EYU23365.1| hypothetical protein MIMGU_mgv1a009979mg [Mimulus guttatus]
          Length = 325

 Score =  177 bits (450), Expect = 9e-42
 Identities = 100/227 (44%), Positives = 137/227 (60%), Gaps = 3/227 (1%)
 Frame = +1

Query: 742  EECHSS-NKCTDEKNELIACLRVPGNESPDLSLLIQNKGTSPLTVKISAPDFVHLEKTSV 918
            E+C SS N+CTD+    +ACLRVPGNESP LSLLIQN G   L++ ISAPD V LEK  +
Sbjct: 103  EKCDSSSNRCTDDDKTFVACLRVPGNESPALSLLIQNMGKGSLSINISAPDLVQLEKNQI 162

Query: 919  GLQEKEDKKLKVSIGKDNNKNTLIVLTAKNGDCSIDFGDLLLHNSGQKKIDYTAKSTYTN 1098
             L+EK+D ++KVSI    N + +I+LTA +G+CS++  D LL   G+ KID++ +    N
Sbjct: 163  ELEEKKDTEVKVSITGIENGH-IIILTAGHGNCSLNIRDQLL---GKNKIDHSNEPPKPN 218

Query: 1099 FLKKSPSIAYM--XXXXXXXXXXXXMCARLWRKHHCSDSTKYQKLETELPICGGGKTEPE 1272
                + S A++              +C +L  K+      KYQKL+ +LP+  G + EP 
Sbjct: 219  IFNPTLSTAFLLIVAALLIVALSVFVCTKLGIKYFARKVPKYQKLDMDLPVSHGSRIEPG 278

Query: 1273 VTXXXXXXXXXXXXXEEAPRTPSKPVTPSLSSKGLASRRFPKDGWKD 1413
                           EEAP+TPS P+TPSLSSKG+ASR+F K+ WKD
Sbjct: 279  EIKGWDDSWDDSWDDEEAPKTPSLPLTPSLSSKGMASRKFSKESWKD 325


>ref|XP_006343865.1| PREDICTED: uncharacterized protein LOC102589846 [Solanum tuberosum]
          Length = 395

 Score =  172 bits (436), Expect = 4e-40
 Identities = 102/231 (44%), Positives = 131/231 (56%)
 Frame = +1

Query: 721  KEEGAWGEECHSSNKCTDEKNELIACLRVPGNESPDLSLLIQNKGTSPLTVKISAPDFVH 900
            ++E   GEEC SS  CT E+  L+ACLRVPGNESPDLSLL+QNKG    ++ I AP FV 
Sbjct: 179  RKESFHGEECDSSYSCTIEEKALVACLRVPGNESPDLSLLVQNKGKDTASISIMAPKFVK 238

Query: 901  LEKTSVGLQEKEDKKLKVSIGKDNNKNTLIVLTAKNGDCSIDFGDLLLHNSGQKKIDYTA 1080
            LE   + LQ KE+KK+KVSIG   N N +I+L A +G CS+DF  L         ID   
Sbjct: 239  LEHNEIELQGKENKKMKVSIGNGGNDN-IIILKAGDGQCSLDFRGL---------IDNAD 288

Query: 1081 KSTYTNFLKKSPSIAYMXXXXXXXXXXXXMCARLWRKHHCSDSTKYQKLETELPICGGGK 1260
            K++  N++   PS   M            +  +  R+   S+   YQKL+  LP+  GGK
Sbjct: 289  KTSQFNYV--LPSFGIMCLVAIALVATILLYIK--RRLLVSNGHTYQKLDNALPVSSGGK 344

Query: 1261 TEPEVTXXXXXXXXXXXXXEEAPRTPSKPVTPSLSSKGLASRRFPKDGWKD 1413
             E   T             EEAP+ PS PVTPSLSSK +++RR  K+GWKD
Sbjct: 345  VETLSTDGWDNNWDDNWDDEEAPKAPSLPVTPSLSSKIISARRSSKEGWKD 395


>ref|XP_004245519.1| PREDICTED: uncharacterized protein LOC101257691 [Solanum
            lycopersicum]
          Length = 391

 Score =  166 bits (420), Expect = 3e-38
 Identities = 100/231 (43%), Positives = 127/231 (54%)
 Frame = +1

Query: 721  KEEGAWGEECHSSNKCTDEKNELIACLRVPGNESPDLSLLIQNKGTSPLTVKISAPDFVH 900
            ++E   GEEC SS  CT E+  L+ACLRVPGNESPDLSLL+QNKG    ++ I AP FV 
Sbjct: 175  RKESFHGEECDSSYSCTIEEKALVACLRVPGNESPDLSLLVQNKGKDTASISIKAPKFVT 234

Query: 901  LEKTSVGLQEKEDKKLKVSIGKDNNKNTLIVLTAKNGDCSIDFGDLLLHNSGQKKIDYTA 1080
            LE   + LQ KE+KK+KVSIG   N N +I L   +G CS+DF  L         ID   
Sbjct: 235  LEHNEIELQGKENKKMKVSIGNGGNDN-IITLKVGDGQCSLDFRGL---------IDSAE 284

Query: 1081 KSTYTNFLKKSPSIAYMXXXXXXXXXXXXMCARLWRKHHCSDSTKYQKLETELPICGGGK 1260
            K++  N+    PS   M            +  +  R+   S+   YQKL+  LP+  GGK
Sbjct: 285  KTSQFNY--ALPSFGIMCLVAIALVATILLYIK--RRLLVSNGHMYQKLDNALPVSSGGK 340

Query: 1261 TEPEVTXXXXXXXXXXXXXEEAPRTPSKPVTPSLSSKGLASRRFPKDGWKD 1413
             E   T             EEAP+ PS PVTPSLSSK +++R   K+GWKD
Sbjct: 341  VETLSTDGWDNNWDDNWDDEEAPKAPSLPVTPSLSSKIISARWSSKEGWKD 391


>ref|XP_006850554.1| hypothetical protein AMTR_s00159p00083590 [Amborella trichopoda]
            gi|548854205|gb|ERN12135.1| hypothetical protein
            AMTR_s00159p00083590 [Amborella trichopoda]
          Length = 417

 Score =  159 bits (403), Expect = 2e-36
 Identities = 118/378 (31%), Positives = 181/378 (47%), Gaps = 12/378 (3%)
 Frame = +1

Query: 304  TAFLVLSLVVIFSFGAELEVKKTDLTKGIDQNNSSR--------LKDNSIVKPSEPKSVP 459
            T  L  S  ++ S GAEL +  +   +G+ + +           +K+ ++ K S   +  
Sbjct: 65   TEVLSHSSALLNSSGAELVMNHSSQGEGVKEKDDKESKQEIHFHVKNGTLEKESSEMAYG 124

Query: 460  KVDRVKKGSGDQGGIKNDPLVXXXXXXXXXXXXXKVGTNKEDDSSKKSQVNKEDKTKLID 639
                VK    D+  + N+                +   N++++ S+K +V K   +    
Sbjct: 125  HTSHVKNEKLDKTNVPNEESNPENMTVEGSKGNPQKEGNEKENLSEKPKVQKGVPSS--- 181

Query: 640  XXXXXXXXXXXXXXXXXXDSVLSKPMNKEEGAWGEECHSSNKCTDEKNELIACLRVPGNE 819
                                  SKP  K++    EEC +SN+C DEK +L+ACLRVPGNE
Sbjct: 182  ----------------------SKPARKDKYG-AEECDASNQCMDEKKKLVACLRVPGNE 218

Query: 820  SPDLSLLIQNKGTSPLTVKISAPDFVHLEKTSVGLQEKEDKKLKVSIGKDNNKNTLIVLT 999
            SP+LSLLIQN G   LT+ I AP+FV LE+  V L++++D+++KVSIG  NN N+ IVLT
Sbjct: 219  SPELSLLIQNIGNETLTINIMAPNFVRLEQNIVQLKKQDDREVKVSIGISNNDNSAIVLT 278

Query: 1000 AKNGDCSIDFGDLLLHNSGQ----KKIDYTAKSTYTNFLKKSPSIAYMXXXXXXXXXXXX 1167
               G C +D   ++L  S +    +++ Y    T T  +     ++ +            
Sbjct: 279  TGKGRCILDLRGVVLPESSKPTLFQRLTYRTIGTRTTVI----YLSVLSSMLLFIGGTWF 334

Query: 1168 MCARLWRKHHCSDSTKYQKLETELPICGGGKTEPEVTXXXXXXXXXXXXXEEAPRTPSKP 1347
             C +L          KYQ++ET+LPI G GK  P++              EEAPRTPS+P
Sbjct: 335  CCNKL-----RPGGVKYQEVETDLPISGPGK--PDLEVGWDEGWGDGWEDEEAPRTPSRP 387

Query: 1348 VTPSLSSKGLASRRFPKD 1401
            +  SLS+ GL +RR  KD
Sbjct: 388  L-QSLSASGLITRRAGKD 404


>ref|XP_002887891.1| hypothetical protein ARALYDRAFT_474911 [Arabidopsis lyrata subsp.
            lyrata] gi|297333732|gb|EFH64150.1| hypothetical protein
            ARALYDRAFT_474911 [Arabidopsis lyrata subsp. lyrata]
          Length = 342

 Score =  158 bits (399), Expect = 7e-36
 Identities = 95/251 (37%), Positives = 132/251 (52%), Gaps = 11/251 (4%)
 Frame = +1

Query: 694  DSVLSKPMNKEEGAWGEECHSSNKCTDEKNELIACLRVPGNESPDLSLLIQNKGTSPLTV 873
            +  +S    K+EG  GEEC  SN CTD+++E  ACLRVPGN++P LSLLIQNKG  PL V
Sbjct: 93   EDAMSDSSRKKEGFHGEECDPSNMCTDDQHEFAACLRVPGNDAPHLSLLIQNKGKRPLIV 152

Query: 874  KISAPDFVHLEKTSVGLQEKEDKKLKVSIGKDNNKNTLIVLTAKNGDCSIDFGDL-LLHN 1050
             I+AP FV LEK  V L + ED K+KVSI K  + ++ IVL +  G CS++  DL   H 
Sbjct: 153  TITAPGFVRLEKDKVQLLQNEDTKVKVSIKKGGSNDSAIVLASSKGRCSLELKDLAAAHE 212

Query: 1051 SGQKKIDYTAKSTYTNFLKKSPSIAYMXXXXXXXXXXXXMCARLWRKHHCSDSTKYQKLE 1230
            +        ++ +      ++  +  M            +   ++ K+    + KYQ+L+
Sbjct: 213  TESDDTVSVSRPSILYISSRTLIVIIMISFLVLSLVIIPVIIHVY-KNKSRGNNKYQRLD 271

Query: 1231 TELPICGGG---KTEPEV-------TXXXXXXXXXXXXXEEAPRTPSKPVTPSLSSKGLA 1380
             ELP+       K++ E                      EE P TP  P+TPSLSS+GLA
Sbjct: 272  MELPVSNPALVTKSDQESGDDGWNNNWGDDWDDENGGGDEEQPNTPVLPLTPSLSSRGLA 331

Query: 1381 SRRFPKDGWKD 1413
             RR  K+GWKD
Sbjct: 332  PRRLSKEGWKD 342


>ref|XP_006302456.1| hypothetical protein CARUB_v10020548mg [Capsella rubella]
            gi|482571166|gb|EOA35354.1| hypothetical protein
            CARUB_v10020548mg [Capsella rubella]
          Length = 354

 Score =  157 bits (396), Expect = 2e-35
 Identities = 95/248 (38%), Positives = 132/248 (53%), Gaps = 12/248 (4%)
 Frame = +1

Query: 706  SKPMNKEEGAWGEECHSSNKCTDEKNELIACLRVPGNESPDLSLLIQNKGTSPLTVKISA 885
            S    K++G  GEEC  SN CTD+++E +ACLRVPGN++P LSLLIQNKG   L V I+A
Sbjct: 109  SNSSRKKQGFHGEECDPSNMCTDQEDEFVACLRVPGNDAPHLSLLIQNKGKRALLVTITA 168

Query: 886  PDFVHLEKTSVGLQEKEDKKLKVSIGKDNNKNTLIVLTAKNGDCSIDFGDLLLHNSGQKK 1065
            P FV LEK  V L + ED K+KVSI K  + ++ IVLT+  G CS++  DL    + + +
Sbjct: 169  PGFVRLEKNKVQLLQNEDTKVKVSIKKGGSNDSAIVLTSSKGRCSLELKDLAA--AQETE 226

Query: 1066 IDYTAKSTYTNFLKKSPSIAYMXXXXXXXXXXXXMCARLWR--KHHCSDSTKYQKLETEL 1239
             D T   +  + L   P    +            +   ++   K+    + KYQ+L+ EL
Sbjct: 227  SDDTVSVSRPSILNIHPRTLIVILMISFLVLSLVIIPVIYHVYKNKSRGNNKYQRLDMEL 286

Query: 1240 PICGG---GKTEPEV-------TXXXXXXXXXXXXXEEAPRTPSKPVTPSLSSKGLASRR 1389
            P+       K++ E                      EE P TP  P+TPS+SS+GLA RR
Sbjct: 287  PVSNPALVAKSDKESGDEGWNNNWGDDWDDENGDGDEEQPNTPVLPLTPSVSSRGLAPRR 346

Query: 1390 FPKDGWKD 1413
              K+GWKD
Sbjct: 347  LSKEGWKD 354


>ref|XP_007209413.1| hypothetical protein PRUPE_ppa009291mg [Prunus persica]
            gi|462405148|gb|EMJ10612.1| hypothetical protein
            PRUPE_ppa009291mg [Prunus persica]
          Length = 298

 Score =  156 bits (394), Expect = 3e-35
 Identities = 89/195 (45%), Positives = 120/195 (61%), Gaps = 2/195 (1%)
 Frame = +1

Query: 700  VLSKPMNKEEGAWGEECHSSNKCTDEKNELIACLRVPGNESPDLSLLIQNKGTSPLTVKI 879
            V+  P+ KE G   EEC   N+CT E+++L+ACLRVPGN+SP LSLLIQNKG  PL V I
Sbjct: 30   VIVNPVRKE-GPGTEECDPVNRCTAEESKLVACLRVPGNDSPHLSLLIQNKGKGPLLVTI 88

Query: 880  SAPDFVHLEKTSVGLQEKEDKKLKVSIGKDNNKNTLIVLTAKNGDCSIDFGDLLLHNSGQ 1059
             APDFV LE+T + L+EKE+KK+KVS+G +    + IVL A  G C +D  DL+ H+S +
Sbjct: 89   VAPDFVALEETKIQLEEKENKKVKVSVG-NGGTGSSIVLKAGKGHCDLDLKDLITHSS-R 146

Query: 1060 KKIDYTAKSTYTNFLKKSPSIAYMXXXXXXXXXXXXMCARLWRKHHCSDSTKYQKLETEL 1239
            K+ + ++  TYTNFL + P+I  +            MC     +   S+  KYQKL+ +L
Sbjct: 147  KEPENSSNLTYTNFLTQRPTIVIVFFASLLILAAAWMCISFRHRRLSSNGFKYQKLDEDL 206

Query: 1240 PICGG--GKTEPEVT 1278
            P   G  G   P VT
Sbjct: 207  PKNRGCYGHVRPSVT 221


>ref|XP_006391622.1| hypothetical protein EUTSA_v10023582mg [Eutrema salsugineum]
            gi|567126687|ref|XP_006391623.1| hypothetical protein
            EUTSA_v10023582mg [Eutrema salsugineum]
            gi|557088128|gb|ESQ28908.1| hypothetical protein
            EUTSA_v10023582mg [Eutrema salsugineum]
            gi|557088129|gb|ESQ28909.1| hypothetical protein
            EUTSA_v10023582mg [Eutrema salsugineum]
          Length = 336

 Score =  155 bits (392), Expect = 5e-35
 Identities = 93/252 (36%), Positives = 130/252 (51%), Gaps = 12/252 (4%)
 Frame = +1

Query: 694  DSVLSKPMNKEEGAWGEECHSSNKCTDEKNELIACLRVPGNESPDLSLLIQNKGTSPLTV 873
            D  +S    K++G  GEEC  S  CTDE++  +ACLRVPGN++P LSLLIQN G   L V
Sbjct: 85   DEAMSNSSRKKQGFHGEECDPSYMCTDEEDHFVACLRVPGNDAPHLSLLIQNIGKDALLV 144

Query: 874  KISAPDFVHLEKTSVGLQEKEDKKLKVSIGKDNNKNTLIVLTAKNGDCSIDFGDL-LLHN 1050
             I+AP FV LEK  V L E ED K+KVSI K  + ++ I+L +  G CS++  DL   H 
Sbjct: 145  TITAPGFVGLEKNKVELLENEDTKVKVSIKKGGSNDSAIILASFKGHCSLELKDLAAAHE 204

Query: 1051 SGQKKIDYTAKSTYTNFLKKSPSIAYMXXXXXXXXXXXXMCARLWRKHHCSDSTKYQKLE 1230
            +G +     ++ +  N   ++  I  +                   ++    + KYQ+L+
Sbjct: 205  TGNEDTAVVSRPSILNIRPRTLIIIIIIISFLVVSLVIIPMIIHVYRNKAKGNNKYQRLD 264

Query: 1231 TELPICG----GGKTEPEV-------TXXXXXXXXXXXXXEEAPRTPSKPVTPSLSSKGL 1377
             ELP+        K++ E                      EE P TP  P+TPS+SS+GL
Sbjct: 265  MELPVSNNTDLASKSDLEAGDDGWNNNWGDDWDEENGDGDEEQPNTPVLPLTPSVSSRGL 324

Query: 1378 ASRRFPKDGWKD 1413
            ASRR  K+GWKD
Sbjct: 325  ASRRLSKEGWKD 336


>ref|NP_683468.1| uncharacterized protein [Arabidopsis thaliana]
            gi|27311781|gb|AAO00856.1| Unknown protein [Arabidopsis
            thaliana] gi|30984576|gb|AAP42751.1| At1g64385
            [Arabidopsis thaliana] gi|110742365|dbj|BAE99105.1|
            hypothetical protein [Arabidopsis thaliana]
            gi|332196114|gb|AEE34235.1| uncharacterized protein
            AT1G64385 [Arabidopsis thaliana]
          Length = 351

 Score =  149 bits (377), Expect = 3e-33
 Identities = 92/252 (36%), Positives = 130/252 (51%), Gaps = 12/252 (4%)
 Frame = +1

Query: 694  DSVLSKPMNKEEGAWGEECHSSNKCTDEKNELIACLRVPGNESPDLSLLIQNKGTSPLTV 873
            ++V      K++G  GEEC  SN C D+++E  ACLRVPGN++P LSLLIQNKG   L V
Sbjct: 101  EAVSKNSSRKKQGFHGEECDPSNMCIDDEHEFSACLRVPGNDAPHLSLLIQNKGKRALIV 160

Query: 874  KISAPDFVHLEKTSVGLQEKEDKKLKVSIGKDNNKNTLIVLTAKNGDCSIDFGDL--LLH 1047
             I+AP FV LEK  V L + ED K+KVSI K  + ++ IVL +  G C ++  DL    H
Sbjct: 161  TITAPVFVRLEKDKVQLLQNEDIKVKVSIKKGGSNDSAIVLASSKGRCRLELKDLAAAAH 220

Query: 1048 NSGQKKIDYTAKSTYTNFLKKSPSIAYMXXXXXXXXXXXXMCARLWRKHHCSDSTKYQKL 1227
             +        ++ +  N   ++  +  M            +   ++ K+    + KYQ+L
Sbjct: 221  ETESDDTVSVSRPSILNISSRTLIVIIMISFLVLSLVIIPVIIHVY-KNKSRGNNKYQRL 279

Query: 1228 ETELPICGGG---KTEPEV-------TXXXXXXXXXXXXXEEAPRTPSKPVTPSLSSKGL 1377
            + ELP+       K++ E                      EE P TP  P+TPSLSS+GL
Sbjct: 280  DMELPVSNPALVTKSDQESGDDGWNNNWGDDWDDENGGGDEEQPNTPVLPLTPSLSSRGL 339

Query: 1378 ASRRFPKDGWKD 1413
            A RR  K+GWKD
Sbjct: 340  APRRLSKEGWKD 351


>ref|XP_007040504.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508777749|gb|EOY25005.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 340

 Score =  147 bits (372), Expect = 1e-32
 Identities = 75/125 (60%), Positives = 89/125 (71%)
 Frame = +1

Query: 694  DSVLSKPMNKEEGAWGEECHSSNKCTDEKNELIACLRVPGNESPDLSLLIQNKGTSPLTV 873
            DSV   P  + +G  GEEC  SN C D+     ACLRVPGNESPDLSLLIQNKG  PLT+
Sbjct: 214  DSVPPPPPTRTDGFRGEECDPSNMCMDKNERFAACLRVPGNESPDLSLLIQNKGKGPLTI 273

Query: 874  KISAPDFVHLEKTSVGLQEKEDKKLKVSIGKDNNKNTLIVLTAKNGDCSIDFGDLLLHNS 1053
            KISAP FV LE+T V LQEK+DKK+KVSI KD+    LIVL    G+CS+DF DL++HNS
Sbjct: 274  KISAPAFVQLEETDVELQEKQDKKVKVSI-KDSGTGNLIVLKDGRGECSLDFKDLIVHNS 332

Query: 1054 GQKKI 1068
             +  +
Sbjct: 333  AESYV 337


Top