BLASTX nr result

ID: Mentha24_contig00037754 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Mentha24_contig00037754
         (1297 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EYU45904.1| hypothetical protein MIMGU_mgv1a000216mg [Mimulus...   468   e-129
gb|EYU21289.1| hypothetical protein MIMGU_mgv1a000325mg [Mimulus...   461   e-127
gb|EPS68902.1| hypothetical protein M569_05867, partial [Genlise...   427   e-117
gb|EXC01337.1| ABC transporter B family member 19 [Morus notabilis]   406   e-110
ref|XP_002271999.1| PREDICTED: uncharacterized protein LOC100251...   405   e-110
ref|XP_007217656.1| hypothetical protein PRUPE_ppa000250mg [Prun...   404   e-110
ref|XP_007024720.1| Uncharacterized protein isoform 6 [Theobroma...   404   e-110
ref|XP_007024719.1| Uncharacterized protein isoform 5 [Theobroma...   404   e-110
ref|XP_007024718.1| Uncharacterized protein isoform 4 [Theobroma...   404   e-110
ref|XP_007024717.1| Uncharacterized protein isoform 3 [Theobroma...   404   e-110
ref|XP_007024715.1| Uncharacterized protein isoform 1 [Theobroma...   404   e-110
ref|XP_004305768.1| PREDICTED: uncharacterized protein LOC101291...   404   e-110
emb|CBI35826.3| unnamed protein product [Vitis vinifera]              386   e-104
ref|XP_004141819.1| PREDICTED: uncharacterized protein LOC101213...   382   e-103
ref|XP_006362089.1| PREDICTED: uncharacterized protein LOC102584...   380   e-103
ref|XP_006465839.1| PREDICTED: uncharacterized protein LOC102629...   379   e-102
ref|XP_006465838.1| PREDICTED: uncharacterized protein LOC102629...   379   e-102
ref|XP_007135400.1| hypothetical protein PHAVU_010G126300g [Phas...   369   2e-99
ref|XP_006426753.1| hypothetical protein CICLE_v10024713mg [Citr...   367   8e-99
ref|XP_006583177.1| PREDICTED: dentin sialophosphoprotein-like i...   366   1e-98

>gb|EYU45904.1| hypothetical protein MIMGU_mgv1a000216mg [Mimulus guttatus]
          Length = 1420

 Score =  468 bits (1205), Expect = e-129
 Identities = 255/411 (62%), Positives = 298/411 (72%), Gaps = 21/411 (5%)
 Frame = -1

Query: 1213 MKSDAPLDFAVFQLSPRHSRCELYVSSGGSTEKLASGLLKPFVTHLKVVEEQVASNAQSI 1034
            MKSD+ LD+A FQLSP+HSRCEL+VSSGGSTEKLASGLLKPFV HL++ EE+VAS + S+
Sbjct: 1    MKSDSTLDYAEFQLSPKHSRCELFVSSGGSTEKLASGLLKPFVAHLQIAEERVASASLSV 60

Query: 1033 NLDCGRDKNAEVWFTKGTLDRFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGMGDQH 854
             L+ G++KNAE WFTKGTL+RFVRFVSTPEVLELV+T DAEMSQLEAARRIYSQG GDQ 
Sbjct: 61   KLEVGKNKNAETWFTKGTLERFVRFVSTPEVLELVSTLDAEMSQLEAARRIYSQGAGDQL 120

Query: 853  SGGNGSGLTAAEDATKKELLRAIDVRLVAAQQDLSNXXXXXXXXXXXADTVPELQMFADR 674
            SGG GSG TAA+DATKKELLRAIDVRLVA +QDLS            ADTV ELQMFADR
Sbjct: 121  SGGGGSGATAADDATKKELLRAIDVRLVAVRQDLSTACARAAAAGFNADTVSELQMFADR 180

Query: 673  FGAHRLNEACGKYISLCERRPNLINQWKSGSDDRALRSSCTSDMSI-------XXXXXXX 515
            FGAHRLNEAC K+ISL ER P LI+  KSG +DRA+RSS  SDMSI              
Sbjct: 181  FGAHRLNEACSKFISLSERGPELIHPRKSGHEDRAVRSSYGSDMSIDDDPTSPPPDPETA 240

Query: 514  XXXXXXXXPTTFPLRRSFTMVSSVEREGENKPENSTGESDKKEETSPPEQTSSIQASQPG 335
                    P TFPLRR+F+  SSV+RE  NK  ++  E D+K+E+S P+Q+  I ASQP 
Sbjct: 241  TYQQPNPPPVTFPLRRTFSRESSVDREDGNKTNDTVPEKDRKDESSSPDQSVPISASQPA 300

Query: 334  RRLSVQDRVKLFENKQKENSGEKPVVVKPVELRRLSSDVSMMGAAAEKAVLRRWSGVSDM 155
            RRLSVQDR+ +FENKQK+ SG KPVVVK VELRR+SSD+S      EK VLRRWSG SDM
Sbjct: 301  RRLSVQDRISMFENKQKDTSGGKPVVVKAVELRRMSSDLSSSSTVVEKGVLRRWSGASDM 360

Query: 154  SIDLTAEKKESVN--------------NVMESNSDTTKSSSMIKPDSNATP 44
            SIDL+AEKK++ +               V+  N D  + SS+ KP+    P
Sbjct: 361  SIDLSAEKKDTESPSCTPTSAVVSQDKKVLRLNDDNAEISSVSKPEIKVIP 411


>gb|EYU21289.1| hypothetical protein MIMGU_mgv1a000325mg [Mimulus guttatus]
          Length = 1255

 Score =  461 bits (1185), Expect = e-127
 Identities = 261/435 (60%), Positives = 295/435 (67%), Gaps = 45/435 (10%)
 Frame = -1

Query: 1213 MKSDAPLDFAVFQLSPRHSRCELYVSSGGSTEKLASGLLKPFVTHLKVVEEQVASNAQSI 1034
            MK DAPLDFAVFQLSP+ SRCEL+VS GGSTEKLASGL+KPF+ HLKV EEQVASNAQS+
Sbjct: 1    MKMDAPLDFAVFQLSPKRSRCELFVSRGGSTEKLASGLVKPFIAHLKVAEEQVASNAQSV 60

Query: 1033 NLDCGRDKNAEVWFTKGTLDRFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGMGDQH 854
             L+ GR +N E WFTKGTL+RFVRFVSTPE+LELVNTFDAEMSQLEAARRIYSQG GDQ 
Sbjct: 61   KLEIGRRRNGEAWFTKGTLERFVRFVSTPEILELVNTFDAEMSQLEAARRIYSQGAGDQL 120

Query: 853  S----------------------------GGNGSGLTAAEDATKKELLRAIDVRLVAAQQ 758
            S                            GG+GSG  AA+DATKKELLRAID+RL A QQ
Sbjct: 121  SGMSSDYYLFNLFLTKLNYNYNAYNSIGPGGSGSGAKAADDATKKELLRAIDLRLAAVQQ 180

Query: 757  DLSNXXXXXXXXXXXADTVPELQMFADRFGAHRLNEACGKYISLCERRPNLINQWKSGSD 578
            DLS             DTV ELQMFADRFGAHRLNEACGK+ISL ERRPNLINQWK G +
Sbjct: 181  DLSATCARADAAGFNVDTVSELQMFADRFGAHRLNEACGKFISLSERRPNLINQWKPGPE 240

Query: 577  DRALRSSCTSDMSI------XXXXXXXXXXXXXXXPTTFPLRRSFTMVSSVER--EGENK 422
            DRALRSSC SDMSI                      TTFP RR F+  SSVE   +G+NK
Sbjct: 241  DRALRSSCGSDMSIDDDSLPTRHDSATCQPSDPPPATTFPSRRPFSRESSVEEKDDGDNK 300

Query: 421  PENSTGESDKKEETSPPEQTSSIQASQPGRRLSVQDRVKLFENKQKENSGEKPVV--VKP 248
              ++ GE + K++       + +QAS   RRLSVQDR+ LFENKQKENSG KPVV   KP
Sbjct: 301  WNDAFGEKETKDD-------APVQASHHARRLSVQDRISLFENKQKENSGGKPVVPPAKP 353

Query: 247  VELRRLSSDVSMMGAAAEKAVLRRWSGVSDMSIDLTAEKKES-------VNNVMESNSDT 89
            VELRRLSSDVS MG+AA   VLRRWSG SDMS+DL  EKK++        N  +  N   
Sbjct: 354  VELRRLSSDVSAMGSAAAAVVLRRWSGASDMSLDLGVEKKDAEIPAVSQENKGLNLNDGI 413

Query: 88   TKSSSMIKPDSNATP 44
             K+SS++K +    P
Sbjct: 414  VKNSSVVKTEIKVIP 428


>gb|EPS68902.1| hypothetical protein M569_05867, partial [Genlisea aurea]
          Length = 406

 Score =  427 bits (1098), Expect = e-117
 Identities = 231/410 (56%), Positives = 292/410 (71%), Gaps = 23/410 (5%)
 Frame = -1

Query: 1216 KMKSDAPLDFAVFQLSPRHSRCELYVSSGGSTEKLASGLLKPFVTHLKVVEEQVASNAQS 1037
            +M +DAPLD+AV QL+PR SRC+L+VSSGGSTEK+ASGLLKPFV HLK  EEQ+AS AQS
Sbjct: 2    RMNADAPLDYAVLQLTPRRSRCDLFVSSGGSTEKIASGLLKPFVAHLKFAEEQIASTAQS 61

Query: 1036 INLDCGRDKNAEVWFTKGTLDRFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGMGDQ 857
            + L+ GR KN E WFTKGTL+RFV FVSTPE+LELVNT+DAEM+QLEAAR+IYSQG GD 
Sbjct: 62   VKLEVGRRKNDEEWFTKGTLERFVHFVSTPEILELVNTYDAEMTQLEAARKIYSQGAGDH 121

Query: 856  HSGGNGSGLTAAEDATKKELLRAIDVRLVAAQQDLSNXXXXXXXXXXXADTVPELQMFAD 677
             S       TAA+DATKKELLRAIDVRLVA QQDLS             ++V EL+MFAD
Sbjct: 122  PSES-----TAADDATKKELLRAIDVRLVAVQQDLSAATARSAAAGFNLESVSELRMFAD 176

Query: 676  RFGAHRLNEACGKYISLCERRPNLINQWKS-GSDDRALRSSCTSDMSI--------XXXX 524
            +FGAHRLN+ACGK++SL ERRP+LI QW+S G+++RA+RSS  SDMSI            
Sbjct: 177  KFGAHRLNDACGKFLSLSERRPHLIGQWRSCGNEERAVRSSYGSDMSIDSEPPSSPALQE 236

Query: 523  XXXXXXXXXXXPTTFPLRRSFTMVSSVEREGENKPENSTGESDKKEETSPPEQTSSIQAS 344
                       P  FPLRR+ +  S V  +   + ++++GE DKK+ ++  +QT SIQ S
Sbjct: 237  SVSVQHPSTSQPLLFPLRRTVSGSSCVVSDVAVRQDSASGEEDKKDGSATSDQTESIQVS 296

Query: 343  QPGRRLSVQDRVKLFENKQKENSGEKPVVVKPVELRRLSSDVSMMGAAAEKAVLRRWSGV 164
            QP RRLSVQDR+ +FE+KQKENSG KP++ K VELRR+SSDVS +G   EK VLRRWSG 
Sbjct: 297  QPTRRLSVQDRINMFESKQKENSGGKPILTKSVELRRMSSDVSTVGLPPEKGVLRRWSGA 356

Query: 163  SDMSIDLTAEKKESVN--------------NVMESNSDTTKSSSMIKPDS 56
            SDMSIDL++EK+++ +               ++  N D  ++ S +KP++
Sbjct: 357  SDMSIDLSSEKRDAESPLCTPSSVAVSQEAKIVSQNDDALENLSDLKPET 406


>gb|EXC01337.1| ABC transporter B family member 19 [Morus notabilis]
          Length = 2625

 Score =  406 bits (1043), Expect = e-110
 Identities = 242/429 (56%), Positives = 275/429 (64%), Gaps = 31/429 (7%)
 Frame = -1

Query: 1213 MKSDAPLDFAVFQLSPRHSRCELYVSSGGSTEKLASGLLKPFVTHLKVVEEQVASNAQSI 1034
            MKSD  LD+AVFQLSP+ SRCEL VSSGG TEKLASG +KPF+THLKV EEQVA   QSI
Sbjct: 1    MKSDTLLDYAVFQLSPKRSRCELLVSSGGYTEKLASGSVKPFLTHLKVAEEQVALAVQSI 60

Query: 1033 NLDCGRDKNAEVWFTKGTLDRFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGMGDQH 854
             L+  + KNAE WFTKGTL+RFVRFVSTPEVLELVNTFDAE+SQLEAAR+IYSQ   +  
Sbjct: 61   KLESEKSKNAETWFTKGTLERFVRFVSTPEVLELVNTFDAELSQLEAARKIYSQNNNEIF 120

Query: 853  ----SGGNGSGLTAAEDATKKELLRAIDVRLVAAQQDLSNXXXXXXXXXXXADTVPELQM 686
                SGGNG+G+TAA DATKKELLRAIDVRL A +QDL+             DT+ +LQ+
Sbjct: 121  ICFTSGGNGAGITAAADATKKELLRAIDVRLTAVRQDLTTAYARASAAGFNPDTISDLQV 180

Query: 685  FADRFGAHRLNEACGKYISLCERRPNLINQWKSGSDDRALRSSCTSDMSI---------- 536
            FADRFGAHRLNE C K+ SLC+RRP+LINQWK   DD A+RSS  SDMSI          
Sbjct: 181  FADRFGAHRLNEVCAKFTSLCQRRPDLINQWKPSVDDGAVRSSYGSDMSIDDPTEDPSGP 240

Query: 535  ------------XXXXXXXXXXXXXXXPTTFPLRRSFTMVSSVEREGENKPENSTGESDK 392
                                       PT+FP  R+    +  E E  N+      E +K
Sbjct: 241  HHRPQNKREQQPEQSRLSTCQQPNSLIPTSFPTLRNVNGKNDAEEESPNE----ASEKEK 296

Query: 391  KEETSPPEQTSSIQASQPGRRLSVQDRVKLFENKQKE----NSGEKPVVVKPVELRRLSS 224
            KEE+    ++SS  A  P RRLSVQDR+ LFENKQKE     SG KPVV K VELRRLSS
Sbjct: 297  KEESQTESRSSSTLAGPPARRLSVQDRINLFENKQKEQSSAGSGGKPVVGKSVELRRLSS 356

Query: 223  DVSMMGAAAEKAVLRRWSGVSDMSIDLTAEKKESVNNVMESNSDTTKS-SSMIKPDSNAT 47
            DVS      EKAVLRRWSGVSDMSIDL+AEK        ES   T  S SS+    SN  
Sbjct: 357  DVSSAAVGVEKAVLRRWSGVSDMSIDLSAEKD------TESPLCTPSSVSSVSHAKSNNV 410

Query: 46   PSESSKVKD 20
                S+ KD
Sbjct: 411  TGGGSEGKD 419


>ref|XP_002271999.1| PREDICTED: uncharacterized protein LOC100251482 [Vitis vinifera]
          Length = 1409

 Score =  405 bits (1042), Expect = e-110
 Identities = 241/411 (58%), Positives = 285/411 (69%), Gaps = 18/411 (4%)
 Frame = -1

Query: 1213 MKSDAPLDFAVFQLSPRHSRCELYVSSGGSTEKLASGLLKPFVTHLKVVEEQVASNAQSI 1034
            MKSD  LD+AVFQLSP+ SRCEL+VS  G+TEKLASGL+KPFVTHLKVVEEQVA   QSI
Sbjct: 1    MKSDGALDYAVFQLSPKRSRCELFVSRDGNTEKLASGLVKPFVTHLKVVEEQVALAVQSI 60

Query: 1033 NLDCGRDKNAEVWFTKGTLDRFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGMGD-- 860
             L+  + KNA++WFTKGTL+RFVRFVSTPEVLELVNTFDAE+SQLEAAR IYSQG+GD  
Sbjct: 61   KLEVEKYKNADLWFTKGTLERFVRFVSTPEVLELVNTFDAEVSQLEAARTIYSQGVGDPV 120

Query: 859  -QHSGGNGSGLTAAEDATKKELLRAIDVRLVAAQQDLSNXXXXXXXXXXXADTVPELQMF 683
               SGG+ +G  AA DATKKELLRAIDVRLVA +QDL+             +TV ELQ+F
Sbjct: 121  SSASGGDVTGSVAAADATKKELLRAIDVRLVAVRQDLTMACSRASAAGFNPETVAELQIF 180

Query: 682  ADRFGAHRLNEACGKYISLCERRPNLIN--QWKSGSDDRALRSSCTSDMSI---XXXXXX 518
            +DRFGAHRL+EAC K+ SLC+RRP+LI+   WK G+DDRA+RSS  SDMSI         
Sbjct: 181  SDRFGAHRLSEACSKFFSLCQRRPDLISTATWKGGADDRAVRSSSGSDMSIDEPPENKQP 240

Query: 517  XXXXXXXXXPTTFPLRRSFTMVSSVEREGENKPENSTGESDKKEETSPPEQT---SSIQA 347
                     P+T    +S T+     R    K +   G+   ++ET  P +T   SSIQ 
Sbjct: 241  AAQEPDVPKPSTCQPTKSTTLNFPGRRSLGEKEKEKEGDGGPEKETPTPTETSSASSIQG 300

Query: 346  SQPGRRLSVQDRVKLFENKQKEN----SGEKPVVVKPVELRRLSSDVSMMGAAAEKAVLR 179
            SQP RRLSVQDR+ LFENKQKE+    SG K VV K VELRRLSSDVS   A  EKAVLR
Sbjct: 301  SQPARRLSVQDRINLFENKQKESSTSGSGGKVVVGKSVELRRLSSDVSSAPAVVEKAVLR 360

Query: 178  RWSGVSDMSIDLTAEKKESVNNVMESNSDTTKSSSMIKPDS---NATPSES 35
            RWSG SDMSIDL+ EKK++     ES   T  +SS+ +  S    ATP+ +
Sbjct: 361  RWSGASDMSIDLSFEKKDT-----ESPLCTPSTSSLPQTKSLTDTATPNSA 406


>ref|XP_007217656.1| hypothetical protein PRUPE_ppa000250mg [Prunus persica]
            gi|462413806|gb|EMJ18855.1| hypothetical protein
            PRUPE_ppa000250mg [Prunus persica]
          Length = 1402

 Score =  404 bits (1039), Expect = e-110
 Identities = 235/413 (56%), Positives = 283/413 (68%), Gaps = 34/413 (8%)
 Frame = -1

Query: 1213 MKSDAPLDFAVFQLSPRHSRCELYVSSGGSTEKLASGLLKPFVTHLKVVEEQVASNAQSI 1034
            MKSD PLD+AVFQLSP+HSRCEL+VSS G+TEKLASG +KPFVTHLKV EEQVA   QSI
Sbjct: 1    MKSDTPLDYAVFQLSPKHSRCELFVSSNGNTEKLASGSVKPFVTHLKVAEEQVALAVQSI 60

Query: 1033 NLDCGRDKNAEVWFTKGTLDRFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGMGDQH 854
             L+  + K AE WFTKGTL+RFVRFVSTPEVLELVNTFDAEMSQLEAA RIYSQGMG QH
Sbjct: 61   KLEVEKRKYAETWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAAWRIYSQGMGGQH 120

Query: 853  S---GGNGSGLTAAEDATKKELLRAIDVRLVAAQQDLSNXXXXXXXXXXXADTVPELQMF 683
            +   GG G+G+TAA DATKKELLRAIDVRLVA +QDL+             DTV +L++F
Sbjct: 121  AGALGGGGTGITAAADATKKELLRAIDVRLVAVRQDLTTACARASAAGFNPDTVSQLKLF 180

Query: 682  ADRFGAHRLNEACGKYISLCERRPNLINQWKSGSDDRALRSSCTSDMSIXXXXXXXXXXX 503
            AD+FGAH LNEAC K+ISLC+RR ++IN WK   DDRA+RSSC SDMSI           
Sbjct: 181  ADQFGAHCLNEACTKFISLCQRRSDVINPWKPSVDDRAVRSSCESDMSI----------D 230

Query: 502  XXXXPTTFPLRRSFTMVSSVEREGENKPENST-------------------GESDKKEET 380
                 T+ P  +  +   + + + E+   +ST                    E D+ E+ 
Sbjct: 231  DPTEDTSGPHVKPHSQPQNKQEKLEDPSRHSTCQHPTSLNTNFPTQQCKNVTEKDRDEDK 290

Query: 379  S-------PPEQTSSIQASQPGRRLSVQDRVKLFENKQKE----NSGEKPVVV-KPVELR 236
            +       P  +++ +  SQP RRLSVQDR+ LFENKQKE    +SG KPVVV KPVELR
Sbjct: 291  ARVEKKDEPQTESTPLGVSQPARRLSVQDRISLFENKQKESSSSSSGGKPVVVAKPVELR 350

Query: 235  RLSSDVSMMGAAAEKAVLRRWSGVSDMSIDLTAEKKESVNNVMESNSDTTKSS 77
            RLSSDVS     +  AVLRRWSG SDMSIDL+AEKKE+ +++   +S ++ SS
Sbjct: 351  RLSSDVS-----SAPAVLRRWSGASDMSIDLSAEKKETESSLCTPSSVSSVSS 398


>ref|XP_007024720.1| Uncharacterized protein isoform 6 [Theobroma cacao]
            gi|508780086|gb|EOY27342.1| Uncharacterized protein
            isoform 6 [Theobroma cacao]
          Length = 1415

 Score =  404 bits (1038), Expect = e-110
 Identities = 240/433 (55%), Positives = 286/433 (66%), Gaps = 30/433 (6%)
 Frame = -1

Query: 1213 MKSDAPLDFAVFQLSPRHSRCELYVSSGGSTEKLASGLLKPFVTHLKVVEEQVASNAQSI 1034
            MKSD  LD+AVFQLSP+ SRCEL+VSS G+TEKLASGL+KPFVTHLKV EEQVA + QSI
Sbjct: 1    MKSDTLLDYAVFQLSPKRSRCELFVSSNGNTEKLASGLVKPFVTHLKVAEEQVALSIQSI 60

Query: 1033 NLDCGRDKNAEVWFTKGTLDRFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGMGDQH 854
             L+  + KNAE WFTKGTL+RFVRFVSTPEVLELVNTFDAEMSQLEAA+RIYSQG+GDQ 
Sbjct: 61   KLEIEKRKNAETWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAAQRIYSQGVGDQP 120

Query: 853  S---GGNGSGLTAAEDATKKELLRAIDVRLVAAQQDLSNXXXXXXXXXXXADTVPELQMF 683
            S   GG+G+G+TAA DATKKELLRAIDVRL+  QQDL+            +DTV ELQ F
Sbjct: 121  SGALGGDGAGMTAAADATKKELLRAIDVRLITVQQDLATAFARASAAGFNSDTVSELQQF 180

Query: 682  ADRFGAHRLNEACGKYISLCERRPNLINQWKSGSDDRALRSSCTSDMSI----------- 536
            ADRFGAHRL+EAC K+ISLC+RRP LI+ WK G DD+ +R+S  SDMSI           
Sbjct: 181  ADRFGAHRLHEACTKFISLCQRRPELISPWKPGVDDQVVRASWGSDMSIDDPNEDQIGSH 240

Query: 535  --------XXXXXXXXXXXXXXXPTTFPLRRSFTMVSSVEREG---ENKPENSTGESDKK 389
                                    T   + +S   +S   +     + + +N   E +KK
Sbjct: 241  VNSRSHQPPQNKHQEQQLQPNATQTQHHIDQSKPAISQQPKPSITTQQRSQNENKEEEKK 300

Query: 388  EETSPPEQTSSIQASQPGRRLSVQDRVKLFENKQKE--NSGEKPVVV-KPVELRRLSSDV 218
            +E     ++S  Q SQP RRLSVQDR+ LFENKQKE  +SG KP+ V K VELRRLSS+V
Sbjct: 301  DE--GVTESSPSQVSQPARRLSVQDRINLFENKQKESSSSGGKPIAVGKSVELRRLSSEV 358

Query: 217  SMMGAAAEKAVLRRWSGVSDMSIDLTAEKKESVNNVMESNSDTTKSSSMIKPDSNATP-- 44
            S   A  EKAVLRRWSG SDMSIDL  +KK+      +S   T  SSS  +  SN     
Sbjct: 359  SSAPAVVEKAVLRRWSGASDMSIDLGNDKKDGST---DSPLCTPSSSSASQGKSNVFQGL 415

Query: 43   SESSKVKDGSSMA 5
            SE  + KD   ++
Sbjct: 416  SEDKEQKDEKGLS 428


>ref|XP_007024719.1| Uncharacterized protein isoform 5 [Theobroma cacao]
            gi|508780085|gb|EOY27341.1| Uncharacterized protein
            isoform 5 [Theobroma cacao]
          Length = 1444

 Score =  404 bits (1038), Expect = e-110
 Identities = 240/433 (55%), Positives = 286/433 (66%), Gaps = 30/433 (6%)
 Frame = -1

Query: 1213 MKSDAPLDFAVFQLSPRHSRCELYVSSGGSTEKLASGLLKPFVTHLKVVEEQVASNAQSI 1034
            MKSD  LD+AVFQLSP+ SRCEL+VSS G+TEKLASGL+KPFVTHLKV EEQVA + QSI
Sbjct: 1    MKSDTLLDYAVFQLSPKRSRCELFVSSNGNTEKLASGLVKPFVTHLKVAEEQVALSIQSI 60

Query: 1033 NLDCGRDKNAEVWFTKGTLDRFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGMGDQH 854
             L+  + KNAE WFTKGTL+RFVRFVSTPEVLELVNTFDAEMSQLEAA+RIYSQG+GDQ 
Sbjct: 61   KLEIEKRKNAETWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAAQRIYSQGVGDQP 120

Query: 853  S---GGNGSGLTAAEDATKKELLRAIDVRLVAAQQDLSNXXXXXXXXXXXADTVPELQMF 683
            S   GG+G+G+TAA DATKKELLRAIDVRL+  QQDL+            +DTV ELQ F
Sbjct: 121  SGALGGDGAGMTAAADATKKELLRAIDVRLITVQQDLATAFARASAAGFNSDTVSELQQF 180

Query: 682  ADRFGAHRLNEACGKYISLCERRPNLINQWKSGSDDRALRSSCTSDMSI----------- 536
            ADRFGAHRL+EAC K+ISLC+RRP LI+ WK G DD+ +R+S  SDMSI           
Sbjct: 181  ADRFGAHRLHEACTKFISLCQRRPELISPWKPGVDDQVVRASWGSDMSIDDPNEDQIGSH 240

Query: 535  --------XXXXXXXXXXXXXXXPTTFPLRRSFTMVSSVEREG---ENKPENSTGESDKK 389
                                    T   + +S   +S   +     + + +N   E +KK
Sbjct: 241  VNSRSHQPPQNKHQEQQLQPNATQTQHHIDQSKPAISQQPKPSITTQQRSQNENKEEEKK 300

Query: 388  EETSPPEQTSSIQASQPGRRLSVQDRVKLFENKQKE--NSGEKPVVV-KPVELRRLSSDV 218
            +E     ++S  Q SQP RRLSVQDR+ LFENKQKE  +SG KP+ V K VELRRLSS+V
Sbjct: 301  DE--GVTESSPSQVSQPARRLSVQDRINLFENKQKESSSSGGKPIAVGKSVELRRLSSEV 358

Query: 217  SMMGAAAEKAVLRRWSGVSDMSIDLTAEKKESVNNVMESNSDTTKSSSMIKPDSNATP-- 44
            S   A  EKAVLRRWSG SDMSIDL  +KK+      +S   T  SSS  +  SN     
Sbjct: 359  SSAPAVVEKAVLRRWSGASDMSIDLGNDKKDGST---DSPLCTPSSSSASQGKSNVFQGL 415

Query: 43   SESSKVKDGSSMA 5
            SE  + KD   ++
Sbjct: 416  SEDKEQKDEKGLS 428


>ref|XP_007024718.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508780084|gb|EOY27340.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 1400

 Score =  404 bits (1038), Expect = e-110
 Identities = 240/433 (55%), Positives = 286/433 (66%), Gaps = 30/433 (6%)
 Frame = -1

Query: 1213 MKSDAPLDFAVFQLSPRHSRCELYVSSGGSTEKLASGLLKPFVTHLKVVEEQVASNAQSI 1034
            MKSD  LD+AVFQLSP+ SRCEL+VSS G+TEKLASGL+KPFVTHLKV EEQVA + QSI
Sbjct: 1    MKSDTLLDYAVFQLSPKRSRCELFVSSNGNTEKLASGLVKPFVTHLKVAEEQVALSIQSI 60

Query: 1033 NLDCGRDKNAEVWFTKGTLDRFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGMGDQH 854
             L+  + KNAE WFTKGTL+RFVRFVSTPEVLELVNTFDAEMSQLEAA+RIYSQG+GDQ 
Sbjct: 61   KLEIEKRKNAETWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAAQRIYSQGVGDQP 120

Query: 853  S---GGNGSGLTAAEDATKKELLRAIDVRLVAAQQDLSNXXXXXXXXXXXADTVPELQMF 683
            S   GG+G+G+TAA DATKKELLRAIDVRL+  QQDL+            +DTV ELQ F
Sbjct: 121  SGALGGDGAGMTAAADATKKELLRAIDVRLITVQQDLATAFARASAAGFNSDTVSELQQF 180

Query: 682  ADRFGAHRLNEACGKYISLCERRPNLINQWKSGSDDRALRSSCTSDMSI----------- 536
            ADRFGAHRL+EAC K+ISLC+RRP LI+ WK G DD+ +R+S  SDMSI           
Sbjct: 181  ADRFGAHRLHEACTKFISLCQRRPELISPWKPGVDDQVVRASWGSDMSIDDPNEDQIGSH 240

Query: 535  --------XXXXXXXXXXXXXXXPTTFPLRRSFTMVSSVEREG---ENKPENSTGESDKK 389
                                    T   + +S   +S   +     + + +N   E +KK
Sbjct: 241  VNSRSHQPPQNKHQEQQLQPNATQTQHHIDQSKPAISQQPKPSITTQQRSQNENKEEEKK 300

Query: 388  EETSPPEQTSSIQASQPGRRLSVQDRVKLFENKQKE--NSGEKPVVV-KPVELRRLSSDV 218
            +E     ++S  Q SQP RRLSVQDR+ LFENKQKE  +SG KP+ V K VELRRLSS+V
Sbjct: 301  DE--GVTESSPSQVSQPARRLSVQDRINLFENKQKESSSSGGKPIAVGKSVELRRLSSEV 358

Query: 217  SMMGAAAEKAVLRRWSGVSDMSIDLTAEKKESVNNVMESNSDTTKSSSMIKPDSNATP-- 44
            S   A  EKAVLRRWSG SDMSIDL  +KK+      +S   T  SSS  +  SN     
Sbjct: 359  SSAPAVVEKAVLRRWSGASDMSIDLGNDKKDGST---DSPLCTPSSSSASQGKSNVFQGL 415

Query: 43   SESSKVKDGSSMA 5
            SE  + KD   ++
Sbjct: 416  SEDKEQKDEKGLS 428


>ref|XP_007024717.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508780083|gb|EOY27339.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 1431

 Score =  404 bits (1038), Expect = e-110
 Identities = 240/433 (55%), Positives = 286/433 (66%), Gaps = 30/433 (6%)
 Frame = -1

Query: 1213 MKSDAPLDFAVFQLSPRHSRCELYVSSGGSTEKLASGLLKPFVTHLKVVEEQVASNAQSI 1034
            MKSD  LD+AVFQLSP+ SRCEL+VSS G+TEKLASGL+KPFVTHLKV EEQVA + QSI
Sbjct: 1    MKSDTLLDYAVFQLSPKRSRCELFVSSNGNTEKLASGLVKPFVTHLKVAEEQVALSIQSI 60

Query: 1033 NLDCGRDKNAEVWFTKGTLDRFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGMGDQH 854
             L+  + KNAE WFTKGTL+RFVRFVSTPEVLELVNTFDAEMSQLEAA+RIYSQG+GDQ 
Sbjct: 61   KLEIEKRKNAETWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAAQRIYSQGVGDQP 120

Query: 853  S---GGNGSGLTAAEDATKKELLRAIDVRLVAAQQDLSNXXXXXXXXXXXADTVPELQMF 683
            S   GG+G+G+TAA DATKKELLRAIDVRL+  QQDL+            +DTV ELQ F
Sbjct: 121  SGALGGDGAGMTAAADATKKELLRAIDVRLITVQQDLATAFARASAAGFNSDTVSELQQF 180

Query: 682  ADRFGAHRLNEACGKYISLCERRPNLINQWKSGSDDRALRSSCTSDMSI----------- 536
            ADRFGAHRL+EAC K+ISLC+RRP LI+ WK G DD+ +R+S  SDMSI           
Sbjct: 181  ADRFGAHRLHEACTKFISLCQRRPELISPWKPGVDDQVVRASWGSDMSIDDPNEDQIGSH 240

Query: 535  --------XXXXXXXXXXXXXXXPTTFPLRRSFTMVSSVEREG---ENKPENSTGESDKK 389
                                    T   + +S   +S   +     + + +N   E +KK
Sbjct: 241  VNSRSHQPPQNKHQEQQLQPNATQTQHHIDQSKPAISQQPKPSITTQQRSQNENKEEEKK 300

Query: 388  EETSPPEQTSSIQASQPGRRLSVQDRVKLFENKQKE--NSGEKPVVV-KPVELRRLSSDV 218
            +E     ++S  Q SQP RRLSVQDR+ LFENKQKE  +SG KP+ V K VELRRLSS+V
Sbjct: 301  DE--GVTESSPSQVSQPARRLSVQDRINLFENKQKESSSSGGKPIAVGKSVELRRLSSEV 358

Query: 217  SMMGAAAEKAVLRRWSGVSDMSIDLTAEKKESVNNVMESNSDTTKSSSMIKPDSNATP-- 44
            S   A  EKAVLRRWSG SDMSIDL  +KK+      +S   T  SSS  +  SN     
Sbjct: 359  SSAPAVVEKAVLRRWSGASDMSIDLGNDKKDGST---DSPLCTPSSSSASQGKSNVFQGL 415

Query: 43   SESSKVKDGSSMA 5
            SE  + KD   ++
Sbjct: 416  SEDKEQKDEKGLS 428


>ref|XP_007024715.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|590621133|ref|XP_007024716.1| Uncharacterized protein
            isoform 1 [Theobroma cacao] gi|508780081|gb|EOY27337.1|
            Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508780082|gb|EOY27338.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 1428

 Score =  404 bits (1038), Expect = e-110
 Identities = 240/433 (55%), Positives = 286/433 (66%), Gaps = 30/433 (6%)
 Frame = -1

Query: 1213 MKSDAPLDFAVFQLSPRHSRCELYVSSGGSTEKLASGLLKPFVTHLKVVEEQVASNAQSI 1034
            MKSD  LD+AVFQLSP+ SRCEL+VSS G+TEKLASGL+KPFVTHLKV EEQVA + QSI
Sbjct: 1    MKSDTLLDYAVFQLSPKRSRCELFVSSNGNTEKLASGLVKPFVTHLKVAEEQVALSIQSI 60

Query: 1033 NLDCGRDKNAEVWFTKGTLDRFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGMGDQH 854
             L+  + KNAE WFTKGTL+RFVRFVSTPEVLELVNTFDAEMSQLEAA+RIYSQG+GDQ 
Sbjct: 61   KLEIEKRKNAETWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLEAAQRIYSQGVGDQP 120

Query: 853  S---GGNGSGLTAAEDATKKELLRAIDVRLVAAQQDLSNXXXXXXXXXXXADTVPELQMF 683
            S   GG+G+G+TAA DATKKELLRAIDVRL+  QQDL+            +DTV ELQ F
Sbjct: 121  SGALGGDGAGMTAAADATKKELLRAIDVRLITVQQDLATAFARASAAGFNSDTVSELQQF 180

Query: 682  ADRFGAHRLNEACGKYISLCERRPNLINQWKSGSDDRALRSSCTSDMSI----------- 536
            ADRFGAHRL+EAC K+ISLC+RRP LI+ WK G DD+ +R+S  SDMSI           
Sbjct: 181  ADRFGAHRLHEACTKFISLCQRRPELISPWKPGVDDQVVRASWGSDMSIDDPNEDQIGSH 240

Query: 535  --------XXXXXXXXXXXXXXXPTTFPLRRSFTMVSSVEREG---ENKPENSTGESDKK 389
                                    T   + +S   +S   +     + + +N   E +KK
Sbjct: 241  VNSRSHQPPQNKHQEQQLQPNATQTQHHIDQSKPAISQQPKPSITTQQRSQNENKEEEKK 300

Query: 388  EETSPPEQTSSIQASQPGRRLSVQDRVKLFENKQKE--NSGEKPVVV-KPVELRRLSSDV 218
            +E     ++S  Q SQP RRLSVQDR+ LFENKQKE  +SG KP+ V K VELRRLSS+V
Sbjct: 301  DE--GVTESSPSQVSQPARRLSVQDRINLFENKQKESSSSGGKPIAVGKSVELRRLSSEV 358

Query: 217  SMMGAAAEKAVLRRWSGVSDMSIDLTAEKKESVNNVMESNSDTTKSSSMIKPDSNATP-- 44
            S   A  EKAVLRRWSG SDMSIDL  +KK+      +S   T  SSS  +  SN     
Sbjct: 359  SSAPAVVEKAVLRRWSGASDMSIDLGNDKKDGST---DSPLCTPSSSSASQGKSNVFQGL 415

Query: 43   SESSKVKDGSSMA 5
            SE  + KD   ++
Sbjct: 416  SEDKEQKDEKGLS 428


>ref|XP_004305768.1| PREDICTED: uncharacterized protein LOC101291165 [Fragaria vesca
            subsp. vesca]
          Length = 1344

 Score =  404 bits (1037), Expect = e-110
 Identities = 226/387 (58%), Positives = 269/387 (69%), Gaps = 7/387 (1%)
 Frame = -1

Query: 1213 MKSDAPLDFAVFQLSPRHSRCELYVSSGGSTEKLASGLLKPFVTHLKVVEEQVASNAQSI 1034
            MKS+ PLD+AVFQLSP+HSRCELYVSS G+TEKLASG +KPFVTHLKV EEQVA   QSI
Sbjct: 1    MKSETPLDYAVFQLSPKHSRCELYVSSNGNTEKLASGSIKPFVTHLKVAEEQVALAVQSI 60

Query: 1033 NLDCGRDKNAEVWFTKGTLDRFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGMGDQH 854
             L+  + K+AE WFTKGTL+RFVRFVSTPEVLELVNTFDAEMSQLE+ARRIYSQGMG Q 
Sbjct: 61   KLEVEKRKHAEKWFTKGTLERFVRFVSTPEVLELVNTFDAEMSQLESARRIYSQGMGGQP 120

Query: 853  S---GGNGSGLTAAEDATKKELLRAIDVRLVAAQQDLSNXXXXXXXXXXXADTVPELQMF 683
            S   GG+G+G TAA DATKKELLRAIDVRLVA +QDLS             DTV ELQ+F
Sbjct: 121  SGARGGDGTGSTAAADATKKELLRAIDVRLVAVRQDLSTACARASAAGFNPDTVSELQLF 180

Query: 682  ADRFGAHRLNEACGKYISLCERRPNLINQWKSGSDDRALRSSCTSDMSIXXXXXXXXXXX 503
            AD+FGAHRL+EA  K+ISL ERR  LI+ WK   DDR +R+SC SDMSI           
Sbjct: 181  ADQFGAHRLHEASTKFISLWERRSELISPWKPAGDDRLVRASCESDMSIDDPTEDTTGFH 240

Query: 502  XXXXPTTFPLRRSFTMVSSVEREGE-NKPENSTGESDKKEETSPPEQTSSIQASQPGRRL 326
                      ++  ++ S+   +   N       + DK ++   P+   ++ + QP RRL
Sbjct: 241  PEDLSKPSTCQQQKSLASNFPTQQRCNNVTEEDKDGDKNKKVEEPQTEPTLASQQPARRL 300

Query: 325  SVQDRVKLFENKQKE---NSGEKPVVVKPVELRRLSSDVSMMGAAAEKAVLRRWSGVSDM 155
            SVQDR+KLFENKQ     +SG KPVV KP ELRRLSSDVS + A     VLRRWSG SDM
Sbjct: 301  SVQDRIKLFENKQDSPGGSSGGKPVVAKPAELRRLSSDVSSVPAG---TVLRRWSGASDM 357

Query: 154  SIDLTAEKKESVNNVMESNSDTTKSSS 74
            SIDL+AEKK+  + +   +S ++ S S
Sbjct: 358  SIDLSAEKKDGESPLCTPSSVSSVSLS 384


>emb|CBI35826.3| unnamed protein product [Vitis vinifera]
          Length = 1163

 Score =  386 bits (991), Expect = e-104
 Identities = 229/405 (56%), Positives = 271/405 (66%), Gaps = 12/405 (2%)
 Frame = -1

Query: 1213 MKSDAPLDFAVFQLSPRHSRCELYVSSGGSTEKLASGLLKPFVTHLKVVEEQVASNAQSI 1034
            MKSD  LD+AVFQLSP+ SRCEL+VS  G+TEKLASGL+KPFVTHLKVVEEQVA   QSI
Sbjct: 1    MKSDGALDYAVFQLSPKRSRCELFVSRDGNTEKLASGLVKPFVTHLKVVEEQVALAVQSI 60

Query: 1033 NLDCGRDKNAEVWFTKGTLDRFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGMGD-- 860
             L+  + KNA++WFTKGTL+RFVRFVSTPEVLELVNTFDAE+SQLEAAR IYSQG+GD  
Sbjct: 61   KLEVEKYKNADLWFTKGTLERFVRFVSTPEVLELVNTFDAEVSQLEAARTIYSQGVGDPV 120

Query: 859  -QHSGGNGSGLTAAEDATKKELLRAIDVRLVAAQQDLSNXXXXXXXXXXXADTVPELQMF 683
               SGG+ +G  AA DATKKELLRAIDVRLVA +QDL+             +TV ELQ+F
Sbjct: 121  SSASGGDVTGSVAAADATKKELLRAIDVRLVAVRQDLTMACSRASAAGFNPETVAELQIF 180

Query: 682  ADRFGAHRLNEACGKYISLCERRPNLIN--QWKSGSDDRALRSSCTSDMSIXXXXXXXXX 509
            +DRFGAHRL+EAC K+ SLC+RRP+LI+   WK G+DDRA+RSS  SDMSI         
Sbjct: 181  SDRFGAHRLSEACSKFFSLCQRRPDLISTATWKGGADDRAVRSSSGSDMSI--------- 231

Query: 508  XXXXXXPTTFPLRRSFTMVSSVEREGENKPENSTGESDKKEETSPPEQTSSIQASQPGRR 329
                                      +  PEN    + + +   P        ++QP RR
Sbjct: 232  --------------------------DEPPENKQPAAQEPDVPKP--------STQPARR 257

Query: 328  LSVQDRVKLFENKQKEN----SGEKPVVVKPVELRRLSSDVSMMGAAAEKAVLRRWSGVS 161
            LSVQDR+ LFENKQKE+    SG K VV K VELRRLSSDVS   A  EKAVLRRWSG S
Sbjct: 258  LSVQDRINLFENKQKESSTSGSGGKVVVGKSVELRRLSSDVSSAPAVVEKAVLRRWSGAS 317

Query: 160  DMSIDLTAEKKESVNNVMESNSDTTKSSSMIKPDS---NATPSES 35
            DMSIDL+ EKK++     ES   T  +SS+ +  S    ATP+ +
Sbjct: 318  DMSIDLSFEKKDT-----ESPLCTPSTSSLPQTKSLTDTATPNSA 357


>ref|XP_004141819.1| PREDICTED: uncharacterized protein LOC101213033 [Cucumis sativus]
            gi|449480667|ref|XP_004155962.1| PREDICTED:
            uncharacterized LOC101213033 [Cucumis sativus]
          Length = 1411

 Score =  382 bits (982), Expect = e-103
 Identities = 233/434 (53%), Positives = 267/434 (61%), Gaps = 39/434 (8%)
 Frame = -1

Query: 1213 MKSDAPLDFAVFQLSPRHSRCELYVSSGGSTEKLASGLLKPFVTHLKVVEEQVASNAQSI 1034
            MK + PLDFAVFQLSPR SRCEL+VSS G+TEKLASG +KPFVT LKV EEQ A   Q+I
Sbjct: 1    MKPETPLDFAVFQLSPRRSRCELFVSSHGNTEKLASGSVKPFVTQLKVAEEQFAHAVQAI 60

Query: 1033 NLDCGRDKNAEVWFTKGTLDRFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGMGDQH 854
             L+  R  N + WFTKGTL+RFVRFVSTPE+LELVNTFDAEMSQLEAARRIYSQG GD+H
Sbjct: 61   KLEVERGGNGDAWFTKGTLERFVRFVSTPEILELVNTFDAEMSQLEAARRIYSQGEGDRH 120

Query: 853  SGGNGSGLTAA--EDATKKELLRAIDVRLVAAQQDLSNXXXXXXXXXXXADTVPELQMFA 680
            SG +G   T A   D TKKELL+AIDVRL+A +QDL               TV +LQ+FA
Sbjct: 121  SGTSGGDGTGAGSTDETKKELLKAIDVRLLAVRQDLVTAATRALAAGFNPSTVSDLQLFA 180

Query: 679  DRFGAHRLNEACGKYISLCERRPNLINQWKSGSDDRALRSSCTSDMSI------------ 536
            D+FGAHRL EAC  ++SL  RRP L+N W  G DDRA+RSSC SDMSI            
Sbjct: 181  DQFGAHRLTEACSSFLSLSRRRPELVNTWTPGMDDRAVRSSCGSDMSIDDPTEDPIGRHN 240

Query: 535  -----------------XXXXXXXXXXXXXXXPTTFPLRRSFTMVSSVEREGENKPENST 407
                                             T  P + S T+ S    + E   EN  
Sbjct: 241  KPQYQTENKHDPQSGTTSRTEEQSSHVDESKPTTCQPAKSSATVPSRRNVKDETLLENL- 299

Query: 406  GESDKKEETSPPEQTSSIQASQPGRRLSVQDRVKLFENKQKENS----GEKPVVVKPVEL 239
             E +K  E +P E   S     P RRLSVQDR+ LFENKQKEN+    G KPV  KP+EL
Sbjct: 300  -EKEKNGEETPTE-LKSTPVGPPARRLSVQDRINLFENKQKENTGGSGGGKPVSGKPLEL 357

Query: 238  RRLSSDVSMMGAAAEKAVLRRWSGVSDMSIDLTAEKKESVNNVMESNSDTTKSSSMIKPD 59
            RRLSSDVS   +A EKAVLRRWSGVSDMSID + EKK+     +ES   T  SSS+    
Sbjct: 358  RRLSSDVSSAPSAVEKAVLRRWSGVSDMSIDFSNEKKD-----IESPLCTPSSSSISDTK 412

Query: 58   SN----ATPSESSK 29
            SN    AT  ES K
Sbjct: 413  SNVFSSATEIESEK 426


>ref|XP_006362089.1| PREDICTED: uncharacterized protein LOC102584476 [Solanum tuberosum]
          Length = 1440

 Score =  380 bits (976), Expect = e-103
 Identities = 229/392 (58%), Positives = 262/392 (66%), Gaps = 31/392 (7%)
 Frame = -1

Query: 1213 MKSDAPLDFAVFQLSPRHSRCELYVSSGGSTEKLASGLLKPFVTHLKVVEEQVASNAQSI 1034
            M+S   LD+AVFQLSP+ SRCEL+VSS G+TEKLASGLLKPFVTHLKV EEQVA   QSI
Sbjct: 1    MESSMLLDYAVFQLSPKRSRCELFVSSDGNTEKLASGLLKPFVTHLKVAEEQVALAVQSI 60

Query: 1033 NLDCGRDKNAEVWFTKGTLDRFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGMGDQH 854
             L+  R KN+E WFTKGTL+RFVRFVSTPEVLELVNT DAEMSQLEAARRIYSQG G Q 
Sbjct: 61   KLEVKRCKNSETWFTKGTLERFVRFVSTPEVLELVNTLDAEMSQLEAARRIYSQGEGYQF 120

Query: 853  S--GGNGSGLTAAEDATKKELLRAIDVRLVAAQQDLSNXXXXXXXXXXXADTVPELQMFA 680
            S  G  GSG+T   DATKKELLRAIDVRL A +QDLS             DTV ELQMFA
Sbjct: 121  SSTGSGGSGVTVVADATKKELLRAIDVRLTAVRQDLSTASSRAAAAGFNLDTVSELQMFA 180

Query: 679  DRFGAHRLNEACGKYISLCERRPNLINQWKS-GSDDRALRSSCTSDMSIXXXXXXXXXXX 503
            D+F AHRLNEAC K+ISL ERRP+LIN WK    DD+A+R S  SDMSI           
Sbjct: 181  DQFDAHRLNEACNKFISLSERRPDLINPWKGVPRDDQAVRCSYGSDMSIDEDPAISVQPS 240

Query: 502  XXXXPTTFPLRRSF-------------------------TMVSSVEREGENKPENSTGES 398
                 T+   R S+                         +  S+++ E ++K      E 
Sbjct: 241  TLSHSTS---RESYLKQHPHHLDQYMPSIGQQLTPLLQHSRESNIKSEEKSKEREVIAEK 297

Query: 397  DKKEETSPPEQTSSIQASQPGRRLSVQDRVKLFENKQK-ENSGE--KPVVVKPVELRRLS 227
            +K+E+TS  +Q  S + S+  RRLSVQDR+ LFENKQK ENSG   KPVV KPVEL+RLS
Sbjct: 298  EKEEDTS-SKQAESTELSRHKRRLSVQDRISLFENKQKEENSGSAGKPVVGKPVELQRLS 356

Query: 226  SDVSMMGAAAEKAVLRRWSGVSDMSIDLTAEK 131
            S VS +    EKAVLRRWSG SDMSIDLT +K
Sbjct: 357  SGVS-VPPVTEKAVLRRWSGASDMSIDLTGDK 387


>ref|XP_006465839.1| PREDICTED: uncharacterized protein LOC102629330 isoform X2 [Citrus
            sinensis]
          Length = 1374

 Score =  379 bits (973), Expect = e-102
 Identities = 231/430 (53%), Positives = 277/430 (64%), Gaps = 32/430 (7%)
 Frame = -1

Query: 1213 MKSDAPLDFAVFQLSPRHSRCELYVSSGGSTEKLASGLLKPFVTHLKVVEEQVASNAQSI 1034
            MK+D  LD+AVFQL+P+ SRCEL+VSS G TEKLASGL+KPFVTHLKV EEQVA   QSI
Sbjct: 1    MKADTRLDYAVFQLTPKRSRCELFVSSEGHTEKLASGLVKPFVTHLKVAEEQVARAVQSI 60

Query: 1033 NLDCGRDKNAEVWFTKGTLDRFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGMGDQH 854
             L+ G+  NAE WFTKGT++RFVRFVSTPEVLELVNTFDAEMSQLEAAR+IYSQG  DQ 
Sbjct: 61   KLEVGKRDNAETWFTKGTIERFVRFVSTPEVLELVNTFDAEMSQLEAARKIYSQGSRDQL 120

Query: 853  S---GGNGSGLTAAEDATKKELLRAIDVRLVAAQQDLSNXXXXXXXXXXXADTVPELQMF 683
            S   GG+G+G  AA DATKKELLRAIDVRLVA +QDL+             +TV ELQ F
Sbjct: 121  SGAIGGDGAGTMAAADATKKELLRAIDVRLVAVRQDLTTAYARAASAGFNPETVSELQNF 180

Query: 682  ADRFGAHRLNEACGKYISLCERRPNLINQWKSGSDDRALRSSCTSDMSI----------- 536
            AD FGAHRLNEAC K+ S+C+RRP+LI+ WK   +++ +RSS  SDMSI           
Sbjct: 181  ADWFGAHRLNEACTKFTSVCDRRPDLISLWKPVVNEQVIRSSWGSDMSIDDSTEDQNRPH 240

Query: 535  ----XXXXXXXXXXXXXXXPTTFPLRRSFTMVSSVER-----EGENKPENSTGESDKKEE 383
                                T    + + +  S+ ++       + + +N     +KK+E
Sbjct: 241  QISQNKPHNPSSQETPQQQITAQTQQLNLSKPSTCQQPKSVFPAQQRNQNENSNDEKKKE 300

Query: 382  TSPPEQTSSIQASQPGRRLSVQDRVKLFENKQKEN---SGEKPVVV-KPVELRRLSSDVS 215
             +  E ++    SQP RRLSVQDR+KLFE+ QKEN   SG KP+VV K  ELRRLSSDVS
Sbjct: 301  EAVIESST----SQPARRLSVQDRIKLFESTQKENSSGSGGKPIVVGKSAELRRLSSDVS 356

Query: 214  MMGAAA-----EKAVLRRWSGVSDMSIDLTAEKKESVNNVMESNSDTTKSSSMIKPDSNA 50
               A       EKAVLRRWSGVSDMSIDL  ++KE  NN  ES   T  SS + +  SN 
Sbjct: 357  SSSATTPTGPIEKAVLRRWSGVSDMSIDLGNDRKE--NNNTESPLCTPSSSFVSQSKSNV 414

Query: 49   TPSESSKVKD 20
                S   KD
Sbjct: 415  FSGFSEDNKD 424


>ref|XP_006465838.1| PREDICTED: uncharacterized protein LOC102629330 isoform X1 [Citrus
            sinensis]
          Length = 1419

 Score =  379 bits (973), Expect = e-102
 Identities = 231/430 (53%), Positives = 277/430 (64%), Gaps = 32/430 (7%)
 Frame = -1

Query: 1213 MKSDAPLDFAVFQLSPRHSRCELYVSSGGSTEKLASGLLKPFVTHLKVVEEQVASNAQSI 1034
            MK+D  LD+AVFQL+P+ SRCEL+VSS G TEKLASGL+KPFVTHLKV EEQVA   QSI
Sbjct: 1    MKADTRLDYAVFQLTPKRSRCELFVSSEGHTEKLASGLVKPFVTHLKVAEEQVARAVQSI 60

Query: 1033 NLDCGRDKNAEVWFTKGTLDRFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGMGDQH 854
             L+ G+  NAE WFTKGT++RFVRFVSTPEVLELVNTFDAEMSQLEAAR+IYSQG  DQ 
Sbjct: 61   KLEVGKRDNAETWFTKGTIERFVRFVSTPEVLELVNTFDAEMSQLEAARKIYSQGSRDQL 120

Query: 853  S---GGNGSGLTAAEDATKKELLRAIDVRLVAAQQDLSNXXXXXXXXXXXADTVPELQMF 683
            S   GG+G+G  AA DATKKELLRAIDVRLVA +QDL+             +TV ELQ F
Sbjct: 121  SGAIGGDGAGTMAAADATKKELLRAIDVRLVAVRQDLTTAYARAASAGFNPETVSELQNF 180

Query: 682  ADRFGAHRLNEACGKYISLCERRPNLINQWKSGSDDRALRSSCTSDMSI----------- 536
            AD FGAHRLNEAC K+ S+C+RRP+LI+ WK   +++ +RSS  SDMSI           
Sbjct: 181  ADWFGAHRLNEACTKFTSVCDRRPDLISLWKPVVNEQVIRSSWGSDMSIDDSTEDQNRPH 240

Query: 535  ----XXXXXXXXXXXXXXXPTTFPLRRSFTMVSSVER-----EGENKPENSTGESDKKEE 383
                                T    + + +  S+ ++       + + +N     +KK+E
Sbjct: 241  QISQNKPHNPSSQETPQQQITAQTQQLNLSKPSTCQQPKSVFPAQQRNQNENSNDEKKKE 300

Query: 382  TSPPEQTSSIQASQPGRRLSVQDRVKLFENKQKEN---SGEKPVVV-KPVELRRLSSDVS 215
             +  E ++    SQP RRLSVQDR+KLFE+ QKEN   SG KP+VV K  ELRRLSSDVS
Sbjct: 301  EAVIESST----SQPARRLSVQDRIKLFESTQKENSSGSGGKPIVVGKSAELRRLSSDVS 356

Query: 214  MMGAAA-----EKAVLRRWSGVSDMSIDLTAEKKESVNNVMESNSDTTKSSSMIKPDSNA 50
               A       EKAVLRRWSGVSDMSIDL  ++KE  NN  ES   T  SS + +  SN 
Sbjct: 357  SSSATTPTGPIEKAVLRRWSGVSDMSIDLGNDRKE--NNNTESPLCTPSSSFVSQSKSNV 414

Query: 49   TPSESSKVKD 20
                S   KD
Sbjct: 415  FSGFSEDNKD 424


>ref|XP_007135400.1| hypothetical protein PHAVU_010G126300g [Phaseolus vulgaris]
            gi|561008445|gb|ESW07394.1| hypothetical protein
            PHAVU_010G126300g [Phaseolus vulgaris]
          Length = 1257

 Score =  369 bits (946), Expect = 2e-99
 Identities = 223/410 (54%), Positives = 260/410 (63%), Gaps = 10/410 (2%)
 Frame = -1

Query: 1213 MKSDAPLDFAVFQLSPRHSRCELYVSSGGSTEKLASGLLKPFVTHLKVVEEQVASNAQSI 1034
            MKSD  LD+AVFQLSPR SRCEL VSS G+TEKLASGLLKPF+T+LKV EEQVA  A SI
Sbjct: 1    MKSDTLLDYAVFQLSPRRSRCELLVSSDGNTEKLASGLLKPFLTNLKVAEEQVALAASSI 60

Query: 1033 NLDCGRDKNAEVWFTKGTLDRFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGMGDQH 854
             L+  R KNAE WFTKGT +RFVRFVSTPEVLE+VNT+DAEMSQLEAARRIYSQG GDQ 
Sbjct: 61   KLEIDRHKNAEAWFTKGTFERFVRFVSTPEVLEMVNTYDAEMSQLEAARRIYSQGAGDQR 120

Query: 853  S---GGNGSGLTAAEDATKKELLRAIDVRLVAAQQDLSNXXXXXXXXXXXADTVPELQMF 683
            S   GGNG+G     DAT KELLRAIDVRL A +QDL+              T+  L+ F
Sbjct: 121  SDPQGGNGAGAITVADATTKELLRAIDVRLSAVRQDLTTACARASASGFNPHTISHLKHF 180

Query: 682  ADRFGAHRLNEACGKYISLCERRPNLINQWKSGSDDRALRSSCTSDMSI-----XXXXXX 518
            + RFGAHRLNEAC KY+SL ERRP+LI+ W  G DDR LRSS +SDMSI           
Sbjct: 181  SHRFGAHRLNEACTKYMSLYERRPDLISHW-PGGDDRELRSSVSSDMSIDNDDGPNQPQA 239

Query: 517  XXXXXXXXXPTTFPLRRSFTMVSSVEREGEN-KPENSTGESDKKEETSPPEQTSSIQASQ 341
                      +  P  +    ++S+ R   +    +   ++  KEET  P   +S   + 
Sbjct: 240  QAQAQAHDQLSDPPKPKPSANLASLRRSNTSVNSRDDNNDTPTKEETESPASATSASTAP 299

Query: 340  PGRRLSVQDRVKLFENKQKENSGEKPVVVKPVELRRLSSDVSMMGAAAEKAVLRRWSGVS 161
             GRRLSVQDR+ LFENKQKENS       KP ELRRLSSD           VLRRWS  S
Sbjct: 300  AGRRLSVQDRINLFENKQKENSSG-----KPPELRRLSSD-----------VLRRWSVAS 343

Query: 160  DMSIDLTAEKKESVNNVMESNSDTTKSSSMIKPDSNATPSES-SKVKDGS 14
            DMSID++ EKKES ++ + S+   TKS    + D N   SE   K   GS
Sbjct: 344  DMSIDVSGEKKES-DSPLSSSVSQTKSLVSEEKDRNDNISEKFGKTDQGS 392


>ref|XP_006426753.1| hypothetical protein CICLE_v10024713mg [Citrus clementina]
            gi|557528743|gb|ESR39993.1| hypothetical protein
            CICLE_v10024713mg [Citrus clementina]
          Length = 1409

 Score =  367 bits (941), Expect = 8e-99
 Identities = 225/427 (52%), Positives = 271/427 (63%), Gaps = 29/427 (6%)
 Frame = -1

Query: 1213 MKSDAPLDFAVFQLSPRHSRCELYVSSGGSTEKLASGLLKPFVTHLKVVEEQVASNAQSI 1034
            MK+D  LD+AVFQL+P+ SRCEL+VSS G TEKLASGL+KPFVTHLKV EEQVA   QSI
Sbjct: 1    MKADTRLDYAVFQLTPKRSRCELFVSSEGHTEKLASGLVKPFVTHLKVAEEQVARAVQSI 60

Query: 1033 NLDCGRDKNAEVWFTKGTLDRFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGMGDQH 854
             L+ G+  NAE WFTKGT++RFVRFVSTPEVLELVNTFDAEMSQLEAA +IYSQ      
Sbjct: 61   KLEVGKRDNAETWFTKGTIERFVRFVSTPEVLELVNTFDAEMSQLEAACKIYSQ------ 114

Query: 853  SGGNGSGLTAAEDATKKELLRAIDVRLVAAQQDLSNXXXXXXXXXXXADTVPELQMFADR 674
             GG+G+G  AA DATKKELLRAIDVRLVA +QDL+             +TV ELQ FAD 
Sbjct: 115  -GGDGAGTMAAADATKKELLRAIDVRLVAVRQDLTTAYARAASAGFNPETVSELQNFADW 173

Query: 673  FGAHRLNEACGKYISLCERRPNLINQWKSGSDDRALRSSCTSDMSI-------------- 536
            FGAHRLNEAC K+ S+C+RRP+LI+ WK   +++ +RSS  SDMSI              
Sbjct: 174  FGAHRLNEACTKFTSVCDRRPDLISPWKPVVNEQVIRSSWGSDMSIDDSTEDQNRPHQIS 233

Query: 535  -XXXXXXXXXXXXXXXPTTFPLRRSFTMVSSVER-----EGENKPENSTGESDKKEETSP 374
                             T    + + +  S+ ++       + + +N     +KK+E + 
Sbjct: 234  QNKAHNPSSQETPQQQITAQTQQLNLSKPSTCQQPKSVFPAQQRNQNENSNDEKKKEEAV 293

Query: 373  PEQTSSIQASQPGRRLSVQDRVKLFENKQKEN---SGEKPVVV-KPVELRRLSSDVSMMG 206
             E ++    SQP RRLSVQDR+KLFE+ QKEN   SG KP+VV K  ELRRLSSDVS   
Sbjct: 294  TESST----SQPARRLSVQDRIKLFESTQKENSSGSGGKPIVVGKSAELRRLSSDVSSSS 349

Query: 205  AA-----AEKAVLRRWSGVSDMSIDLTAEKKESVNNVMESNSDTTKSSSMIKPDSNATPS 41
            A       EKAVLRRWSGVSDMSIDL   +KE+ N   ES   T  SS + +  SN    
Sbjct: 350  ATTPTGPVEKAVLRRWSGVSDMSIDLGNGRKENDNT--ESPLCTPSSSFVSQSKSNVFSG 407

Query: 40   ESSKVKD 20
             S   KD
Sbjct: 408  FSEDNKD 414


>ref|XP_006583177.1| PREDICTED: dentin sialophosphoprotein-like isoform X3 [Glycine max]
          Length = 1009

 Score =  366 bits (939), Expect = 1e-98
 Identities = 216/399 (54%), Positives = 256/399 (64%), Gaps = 4/399 (1%)
 Frame = -1

Query: 1213 MKSDAPLDFAVFQLSPRHSRCELYVSSGGSTEKLASGLLKPFVTHLKVVEEQVASNAQSI 1034
            MKSD  LD+AVFQLSPRHSRCEL VSS G TEKLASGL+KPF+THLKV EEQVA  A SI
Sbjct: 1    MKSDTLLDYAVFQLSPRHSRCELLVSSDGHTEKLASGLVKPFLTHLKVAEEQVALAASSI 60

Query: 1033 NLDCGRDKNAEVWFTKGTLDRFVRFVSTPEVLELVNTFDAEMSQLEAARRIYSQGMGDQH 854
             L+  R KNAE WFTKGT +RFVR+VSTPEVLE+VNTFDAEMSQLEAARRIY+QG GDQ 
Sbjct: 61   KLEIDRHKNAETWFTKGTFERFVRYVSTPEVLEMVNTFDAEMSQLEAARRIYAQGAGDQR 120

Query: 853  S---GGNGSGLTAAEDATKKELLRAIDVRLVAAQQDLSNXXXXXXXXXXXADTVPELQMF 683
            S   GGNG+G     DAT KELLRAIDVRL A +QDL+              TV  L+ F
Sbjct: 121  SDPQGGNGAGAITVADATTKELLRAIDVRLSAVRQDLTTACARASASGFNPHTVSHLKHF 180

Query: 682  ADRFGAHRLNEACGKYISLCERRPNLINQWKSGSDDRALRSSCTSDMSIXXXXXXXXXXX 503
            ADRFGAHR NEAC KY+SL +RRP+LI+ W  G DDR LRSS +SDMSI           
Sbjct: 181  ADRFGAHRFNEACTKYMSLYKRRPDLISHW-PGGDDRELRSSVSSDMSIDNDDGPNQAQD 239

Query: 502  XXXXPTTFPLRRSFTMVSSVEREGEN-KPENSTGESDKKEETSPPEQTSSIQASQPGRRL 326
                    P  +  +  +S+ R   +   ++ T ++  KEET  P    +   S  GRRL
Sbjct: 240  QAQPIDP-PKPKPISNFASLRRSNTSVSSKDETSDTPTKEETESPAPAPTTAPS--GRRL 296

Query: 325  SVQDRVKLFENKQKENSGEKPVVVKPVELRRLSSDVSMMGAAAEKAVLRRWSGVSDMSID 146
            SVQDR+ LFENKQKENSG      +  ELRRLSSD           VLRRWSG SDMSID
Sbjct: 297  SVQDRINLFENKQKENSGG-----RAPELRRLSSD-----------VLRRWSGASDMSID 340

Query: 145  LTAEKKESVNNVMESNSDTTKSSSMIKPDSNATPSESSK 29
             + EKK+  + +    S  +++ S++  +      +S K
Sbjct: 341  GSGEKKDFDSPLPPPASSVSETKSVVVSEDKVRIDKSEK 379


Top