BLASTX nr result

ID: Panax24_contig00015548 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Panax24_contig00015548
         (1891 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

KZM90816.1 hypothetical protein DCAR_021819 [Daucus carota subsp...   403   e-129
XP_017259125.1 PREDICTED: uncharacterized protein LOC108228142 [...   403   e-129
KVH93465.1 hypothetical protein Ccrd_004483 [Cynara cardunculus ...   355   e-118
XP_017258903.1 PREDICTED: uncharacterized protein LOC108227982 [...   372   e-117
KZN09159.1 hypothetical protein DCAR_001815 [Daucus carota subsp...   372   e-117
XP_002265987.2 PREDICTED: uncharacterized protein LOC100241871 [...   355   e-110
KVI10980.1 hypothetical protein Ccrd_010614 [Cynara cardunculus ...   333   e-102
XP_017224884.1 PREDICTED: cell wall protein RBR3-like [Daucus ca...   330   e-102
XP_011087795.1 PREDICTED: uncharacterized protein LOC105169167 [...   329   e-101
KZM81625.1 hypothetical protein DCAR_029238 [Daucus carota subsp...   330   7e-98
KVI01779.1 hypothetical protein Ccrd_019964, partial [Cynara car...   317   4e-97
XP_011092382.1 PREDICTED: uncharacterized protein LOC105172576 [...   318   6e-97
XP_010277689.1 PREDICTED: uncharacterized protein YMR317W-like [...   313   3e-94
XP_010245093.1 PREDICTED: uncharacterized protein LOC104588732 i...   308   9e-93
XP_010245092.1 PREDICTED: uncharacterized protein LOC104588732 i...   308   2e-92
CDO97516.1 unnamed protein product [Coffea canephora]                 305   6e-92
GAV67368.1 hypothetical protein CFOL_v3_10874 [Cephalotus follic...   305   1e-91
XP_007041568.2 PREDICTED: mucin-17 [Theobroma cacao]                  296   2e-88
EOX97399.1 Uncharacterized protein TCM_006425 isoform 2 [Theobro...   293   4e-87
EOX97398.1 Uncharacterized protein TCM_006425 isoform 1 [Theobro...   293   4e-87

>KZM90816.1 hypothetical protein DCAR_021819 [Daucus carota subsp. sativus]
          Length = 617

 Score =  403 bits (1036), Expect = e-129
 Identities = 216/302 (71%), Positives = 242/302 (80%), Gaps = 1/302 (0%)
 Frame = +1

Query: 16   KNNYNNGNTRLAGSSPASSVHKTAFDRDFPSLGADERQADPEIGRIPSPGLSTAIQSLPI 195
            K+N+NNGN+RLA SSP SSVHK +FDRDFPSLGA+ERQ DPEIGR+PSPGL TAIQ+LP 
Sbjct: 159  KSNHNNGNSRLAVSSPISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPT 218

Query: 196  GSSAVIGDGGWTSALAEVPVMIGGNGTXXXXXXXXXXXXXXXXXXXXTTGLNMAETLAQG 375
            GSSA I DGGWTSALAEVP MIG NGT                     TGLNMAETL QG
Sbjct: 219  GSSAGIADGGWTSALAEVPAMIGSNGT-TASSVPHSVSSSASVVPSMMTGLNMAETLVQG 277

Query: 376  PPRVQATPQLSVGTQRLEELAIKQSRQLIPVTPSMPKALVLNPLEKPKPKVGQQQHQNSS 555
            PPRVQA PQLSV TQRLEELAIKQSRQLIPVTPS+PKALVLN  +K K KVG QQ Q++S
Sbjct: 278  PPRVQADPQLSVETQRLEELAIKQSRQLIPVTPSLPKALVLNSSDKAKGKVGLQQ-QSAS 336

Query: 556  THIVNHSPRGAPTKSDISKTSSLGKLQVLKPARERNGVS-YAKDSLSPTSGSKVANNPLA 732
            T++V+HSPRGAPTK++I KTSSLGKLQVLKPARERNGVS  +KD+LSPTS SK+ANNPLA
Sbjct: 337  TNLVHHSPRGAPTKNEIIKTSSLGKLQVLKPARERNGVSNTSKDTLSPTSSSKLANNPLA 396

Query: 733  AAPSAVGSAPLRSPVNNPILASTERKPVPAAFVPPILEKRPTSQTQSRNDFFNLVRKKSM 912
             A + VGSAPLRS +N+ IL S ERK  P   V P+LEKRP+ Q +SRNDFFN +RKKSM
Sbjct: 397  PALATVGSAPLRSSMNHSILVSAERKSAPPVMVTPMLEKRPSPQAKSRNDFFNSMRKKSM 456

Query: 913  TN 918
            TN
Sbjct: 457  TN 458



 Score =  124 bits (310), Expect = 5e-26
 Identities = 69/107 (64%), Positives = 79/107 (73%)
 Frame = +3

Query: 1029 ADGDQMSEIGGGLICNSDSCDGPQKSLDNGENHSSSDVIIYSEEEEAAFLRSLGWDESES 1208
            +D  +++E   G I NS    GPQ SLDNG NHSS+DVI+ SEEEEAAFLRSLGW+  E+
Sbjct: 500  SDEGKINECRDGSIQNSH---GPQNSLDNGVNHSSTDVILSSEEEEAAFLRSLGWE--EN 554

Query: 1209 AGEDEGLTEEEISAFYRDVSKYINSGPSSKTFLGTKPKFLVPLKLQM 1349
            AGEDEGLTEEEI+AFYRD   YINS P SKT LGTK K   P+  QM
Sbjct: 555  AGEDEGLTEEEINAFYRD---YINSAPPSKTLLGTKQKLFGPINFQM 598


>XP_017259125.1 PREDICTED: uncharacterized protein LOC108228142 [Daucus carota subsp.
            sativus]
          Length = 620

 Score =  403 bits (1036), Expect = e-129
 Identities = 216/302 (71%), Positives = 242/302 (80%), Gaps = 1/302 (0%)
 Frame = +1

Query: 16   KNNYNNGNTRLAGSSPASSVHKTAFDRDFPSLGADERQADPEIGRIPSPGLSTAIQSLPI 195
            K+N+NNGN+RLA SSP SSVHK +FDRDFPSLGA+ERQ DPEIGR+PSPGL TAIQ+LP 
Sbjct: 159  KSNHNNGNSRLAVSSPISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPT 218

Query: 196  GSSAVIGDGGWTSALAEVPVMIGGNGTXXXXXXXXXXXXXXXXXXXXTTGLNMAETLAQG 375
            GSSA I DGGWTSALAEVP MIG NGT                     TGLNMAETL QG
Sbjct: 219  GSSAGIADGGWTSALAEVPAMIGSNGT-TASSVPHSVSSSASVVPSMMTGLNMAETLVQG 277

Query: 376  PPRVQATPQLSVGTQRLEELAIKQSRQLIPVTPSMPKALVLNPLEKPKPKVGQQQHQNSS 555
            PPRVQA PQLSV TQRLEELAIKQSRQLIPVTPS+PKALVLN  +K K KVG QQ Q++S
Sbjct: 278  PPRVQADPQLSVETQRLEELAIKQSRQLIPVTPSLPKALVLNSSDKAKGKVGLQQ-QSAS 336

Query: 556  THIVNHSPRGAPTKSDISKTSSLGKLQVLKPARERNGVS-YAKDSLSPTSGSKVANNPLA 732
            T++V+HSPRGAPTK++I KTSSLGKLQVLKPARERNGVS  +KD+LSPTS SK+ANNPLA
Sbjct: 337  TNLVHHSPRGAPTKNEIIKTSSLGKLQVLKPARERNGVSNTSKDTLSPTSSSKLANNPLA 396

Query: 733  AAPSAVGSAPLRSPVNNPILASTERKPVPAAFVPPILEKRPTSQTQSRNDFFNLVRKKSM 912
             A + VGSAPLRS +N+ IL S ERK  P   V P+LEKRP+ Q +SRNDFFN +RKKSM
Sbjct: 397  PALATVGSAPLRSSMNHSILVSAERKSAPPVMVTPMLEKRPSPQAKSRNDFFNSMRKKSM 456

Query: 913  TN 918
            TN
Sbjct: 457  TN 458



 Score =  134 bits (337), Expect = 2e-29
 Identities = 72/107 (67%), Positives = 82/107 (76%)
 Frame = +3

Query: 1029 ADGDQMSEIGGGLICNSDSCDGPQKSLDNGENHSSSDVIIYSEEEEAAFLRSLGWDESES 1208
            +D  +++E   G I NS    GPQ SLDNG NHSS+DVI+ SEEEEAAFLRSLGW+E+  
Sbjct: 500  SDEGKINECRDGSIQNSH---GPQNSLDNGVNHSSTDVILSSEEEEAAFLRSLGWEEN-- 554

Query: 1209 AGEDEGLTEEEISAFYRDVSKYINSGPSSKTFLGTKPKFLVPLKLQM 1349
            AGEDEGLTEEEI+AFYRDVSKYINS P SKT LGTK K   P+  QM
Sbjct: 555  AGEDEGLTEEEINAFYRDVSKYINSAPPSKTLLGTKQKLFGPINFQM 601


>KVH93465.1 hypothetical protein Ccrd_004483 [Cynara cardunculus var. scolymus]
          Length = 636

 Score =  355 bits (910), Expect(2) = e-118
 Identities = 198/309 (64%), Positives = 232/309 (75%), Gaps = 3/309 (0%)
 Frame = +1

Query: 1    SSSASKNNYNNGNTRLAGSSPASSVHKTAFDRDFPSLGADERQADPEIGRIPSPGLSTAI 180
            SSSA+KN++NNG+   +G+    SV KTAF+RDFPSLGA+E+Q DPEIGR+PSPGL+TAI
Sbjct: 194  SSSANKNSHNNGSALRSGAGAIGSV-KTAFERDFPSLGAEEKQIDPEIGRVPSPGLTTAI 252

Query: 181  QSLPIGSSAVIGDGGWTSALAEVPVMIGGNGTXXXXXXXXXXXXXXXXXXXXTTGLNMAE 360
            QSLPIG+SAVIG  GWTSALAEVPV++G NG+                     TG NMAE
Sbjct: 253  QSLPIGNSAVIGGDGWTSALAEVPVIVGSNGSNTSVPPPLQSTSISATASM-ATGRNMAE 311

Query: 361  TLAQGPPRVQATPQLSVGTQRLEELAIKQSRQLIPVTPSMPKALVLNPLEKPKPKVGQQQ 540
            TLAQGPPR Q  PQLSVGTQRLEELA+KQSRQLIP+TPS+PKAL LN  +KPK KVGQ Q
Sbjct: 312  TLAQGPPRAQTAPQLSVGTQRLEELAVKQSRQLIPMTPSLPKALALNSSDKPKSKVGQLQ 371

Query: 541  HQNSSTHIVN--HSPRGAPTKSDISKTSSLGKLQVLKPARERNGVS-YAKDSLSPTSGSK 711
             Q  S+H+VN  HSPR   TK D+SKTSS+GKL VLKP+RERNG++  AKD+LSPT  SK
Sbjct: 372  LQ--SSHLVNHTHSPRPVSTKFDVSKTSSVGKLHVLKPSRERNGITPIAKDNLSPTGASK 429

Query: 712  VANNPLAAAPSAVGSAPLRSPVNNPILASTERKPVPAAFVPPILEKRPTSQTQSRNDFFN 891
            + N+PLA   S VGSAPLR+  NNP +A   +  V A      LEKRP+SQ QSRNDFFN
Sbjct: 430  LPNSPLAVT-SVVGSAPLRNLGNNPAVAVAVKPGVAAT-----LEKRPSSQAQSRNDFFN 483

Query: 892  LVRKKSMTN 918
            L+RKKSMTN
Sbjct: 484  LMRKKSMTN 492



 Score =  102 bits (253), Expect(2) = e-118
 Identities = 56/102 (54%), Positives = 71/102 (69%), Gaps = 1/102 (0%)
 Frame = +3

Query: 1032 DGDQMSEIGGGLICNSDSCDGPQKSLDNGENHSSSD-VIIYSEEEEAAFLRSLGWDESES 1208
            DG    ++  G   +  SC+G      NG+N+SSSD +I+YSEEEEA FLRSLGW+E+  
Sbjct: 524  DGSGGVQVSSGNKVDLSSCNGEATERSNGKNNSSSDAIILYSEEEEARFLRSLGWEETGE 583

Query: 1209 AGEDEGLTEEEISAFYRDVSKYINSGPSSKTFLGTKPKFLVP 1334
              E+EGLTEEEIS+FYRDVSKY+N   +SK F   KPK L+P
Sbjct: 584  --EEEGLTEEEISSFYRDVSKYLNLQAASKIF---KPKLLMP 620


>XP_017258903.1 PREDICTED: uncharacterized protein LOC108227982 [Daucus carota subsp.
            sativus]
          Length = 591

 Score =  372 bits (954), Expect = e-117
 Identities = 204/307 (66%), Positives = 231/307 (75%), Gaps = 2/307 (0%)
 Frame = +1

Query: 4    SSASKNNYNNGNTRLAGSSPASSVHKTAFDRDFPSLGADERQADPEIGRIPSPGLSTAIQ 183
            +S  K+NYNNG++ LAGSS  S+V K AFDRDFPSLGADERQ D E+ R+PSPGLST +Q
Sbjct: 164  NSFDKSNYNNGSSLLAGSSAISTVRKAAFDRDFPSLGADERQTDYELRRVPSPGLSTNMQ 223

Query: 184  SLPIGSSAVIGDGGWTSALAEVPVMIGGNGTXXXXXXXXXXXXXXXXXXXXTTGLNMAET 363
            +LPIG SAV G+ GWTSALAEV V +G NG                     T+GLNMAET
Sbjct: 224  NLPIGYSAVTGEIGWTSALAEVQVKVGANGINKSSVAQAALPSSASVASSMTSGLNMAET 283

Query: 364  LAQGPPRVQATPQLSVGTQRLEELAIKQSRQLIPVTPSMPKALVLNPLEKPKPKVGQQQH 543
            LAQGPP V AT Q SVGTQRLEE+AIKQS+QLIPVTPSMPKALVLN  EK K K  QQQH
Sbjct: 284  LAQGPPHVHAT-QFSVGTQRLEEIAIKQSKQLIPVTPSMPKALVLNSSEKSKTKAAQQQH 342

Query: 544  QNSSTHIVNHSPRGAPTKSDISKTSSLGKLQVLKPARERNGVSY-AKDSLSPTSGSKVAN 720
            Q SSTH  NHSPRG P KSD+SKTSSLGKLQVLKPARERN +SY  KD+LSPT+ SKV N
Sbjct: 343  QTSSTHHFNHSPRGTPMKSDMSKTSSLGKLQVLKPARERNDISYQTKDTLSPTNASKVPN 402

Query: 721  NPLAAAPSAVGSAP-LRSPVNNPILASTERKPVPAAFVPPILEKRPTSQTQSRNDFFNLV 897
            NPL AA S+VG  P LRSP+ NPI+AS          VP +LEK+P++Q +SRNDFFNLV
Sbjct: 403  NPLTAA-SSVGVPPSLRSPIKNPIVAS--------GVVPTVLEKKPSAQLRSRNDFFNLV 453

Query: 898  RKKSMTN 918
            RKKS+TN
Sbjct: 454  RKKSLTN 460



 Score =  115 bits (288), Expect = 2e-23
 Identities = 63/85 (74%), Positives = 66/85 (77%), Gaps = 3/85 (3%)
 Frame = +3

Query: 1062 GLICNSDSCDGPQKSLDNGEN---HSSSDVIIYSEEEEAAFLRSLGWDESESAGEDEGLT 1232
            GLI N D+CDG  KS DNGEN    SSSDVI+ SEEEEAAFLRSLGWDE+  AGEDEGLT
Sbjct: 509  GLISNRDACDGTPKSPDNGENGETRSSSDVILCSEEEEAAFLRSLGWDEN--AGEDEGLT 566

Query: 1233 EEEISAFYRDVSKYINSGPSSKTFL 1307
            EEEI  FYRD SKYI   PSSKT L
Sbjct: 567  EEEIREFYRDASKYIKPRPSSKTSL 591


>KZN09159.1 hypothetical protein DCAR_001815 [Daucus carota subsp. sativus]
          Length = 593

 Score =  372 bits (954), Expect = e-117
 Identities = 204/307 (66%), Positives = 231/307 (75%), Gaps = 2/307 (0%)
 Frame = +1

Query: 4    SSASKNNYNNGNTRLAGSSPASSVHKTAFDRDFPSLGADERQADPEIGRIPSPGLSTAIQ 183
            +S  K+NYNNG++ LAGSS  S+V K AFDRDFPSLGADERQ D E+ R+PSPGLST +Q
Sbjct: 164  NSFDKSNYNNGSSLLAGSSAISTVRKAAFDRDFPSLGADERQTDYELRRVPSPGLSTNMQ 223

Query: 184  SLPIGSSAVIGDGGWTSALAEVPVMIGGNGTXXXXXXXXXXXXXXXXXXXXTTGLNMAET 363
            +LPIG SAV G+ GWTSALAEV V +G NG                     T+GLNMAET
Sbjct: 224  NLPIGYSAVTGEIGWTSALAEVQVKVGANGINKSSVAQAALPSSASVASSMTSGLNMAET 283

Query: 364  LAQGPPRVQATPQLSVGTQRLEELAIKQSRQLIPVTPSMPKALVLNPLEKPKPKVGQQQH 543
            LAQGPP V AT Q SVGTQRLEE+AIKQS+QLIPVTPSMPKALVLN  EK K K  QQQH
Sbjct: 284  LAQGPPHVHAT-QFSVGTQRLEEIAIKQSKQLIPVTPSMPKALVLNSSEKSKTKAAQQQH 342

Query: 544  QNSSTHIVNHSPRGAPTKSDISKTSSLGKLQVLKPARERNGVSY-AKDSLSPTSGSKVAN 720
            Q SSTH  NHSPRG P KSD+SKTSSLGKLQVLKPARERN +SY  KD+LSPT+ SKV N
Sbjct: 343  QTSSTHHFNHSPRGTPMKSDMSKTSSLGKLQVLKPARERNDISYQTKDTLSPTNASKVPN 402

Query: 721  NPLAAAPSAVGSAP-LRSPVNNPILASTERKPVPAAFVPPILEKRPTSQTQSRNDFFNLV 897
            NPL AA S+VG  P LRSP+ NPI+AS          VP +LEK+P++Q +SRNDFFNLV
Sbjct: 403  NPLTAA-SSVGVPPSLRSPIKNPIVAS--------GVVPTVLEKKPSAQLRSRNDFFNLV 453

Query: 898  RKKSMTN 918
            RKKS+TN
Sbjct: 454  RKKSLTN 460



 Score =  102 bits (253), Expect = 5e-19
 Identities = 55/73 (75%), Positives = 58/73 (79%), Gaps = 3/73 (4%)
 Frame = +3

Query: 1062 GLICNSDSCDGPQKSLDNGEN---HSSSDVIIYSEEEEAAFLRSLGWDESESAGEDEGLT 1232
            GLI N D+CDG  KS DNGEN    SSSDVI+ SEEEEAAFLRSLGWD  E+AGEDEGLT
Sbjct: 509  GLISNRDACDGTPKSPDNGENGETRSSSDVILCSEEEEAAFLRSLGWD--ENAGEDEGLT 566

Query: 1233 EEEISAFYRDVSK 1271
            EEEI  FYRD SK
Sbjct: 567  EEEIREFYRDASK 579


>XP_002265987.2 PREDICTED: uncharacterized protein LOC100241871 [Vitis vinifera]
          Length = 665

 Score =  355 bits (910), Expect = e-110
 Identities = 192/308 (62%), Positives = 229/308 (74%), Gaps = 3/308 (0%)
 Frame = +1

Query: 4    SSASKNNYNNGNTRLAGSSPASSVHKTAFDRDFPSLGADERQADPEIGRIPSPGLSTAIQ 183
            S+ +K  ++NG+ +LA     SSV K AFDR+FPSLGA+++Q  P+IGR+ SPGL++AIQ
Sbjct: 162  STVNKTIHSNGDGQLASGIVTSSVQKAAFDRNFPSLGAEDKQGAPDIGRVTSPGLTSAIQ 221

Query: 184  SLPIGSSAVIGDGGWTSALAEVPVMIGGNGTXXXXXXXXXXXXXXXXXXXXTTGLNMAET 363
            SLPIG++ VIG  GWTSALAEVPV+IG N T                    T+GLNMAET
Sbjct: 222  SLPIGNTVVIGGDGWTSALAEVPVIIGSNTTGVSSVQQSVSASSVSVAPSTTSGLNMAET 281

Query: 364  LAQGPPRVQ--ATPQLSVGTQRLEELAIKQSRQLIPVTPSMPKALVLNPLEKPKPKVGQQ 537
            L QGP R +  ATPQLSVGTQRLEELA+KQSRQLIP+TPSMPK LV +P +KPK K+G Q
Sbjct: 282  LVQGPARARANATPQLSVGTQRLEELALKQSRQLIPMTPSMPKTLVPSPSDKPKSKIGLQ 341

Query: 538  QHQNSSTHIVNHSPRGAPTKSDISKTSSLGKLQVLKPARERNGVS-YAKDSLSPTSGSKV 714
                   H+VNHS RG P +SD++KTS++GKL VLKP+RERNGVS  AKDSLSPT GS+V
Sbjct: 342  -----PLHLVNHSQRGGPARSDVTKTSNVGKLHVLKPSRERNGVSPTAKDSLSPTMGSRV 396

Query: 715  ANNPLAAAPSAVGSAPLRSPVNNPILASTERKPVPAAFVPPILEKRPTSQTQSRNDFFNL 894
            AN+PLA  PSA GSA LRSP NNP LAS ER+P   + V   +EKRPTSQ QSRNDFFNL
Sbjct: 397  ANSPLAVTPSAAGSASLRSPRNNPTLASAERRP---SVVLTSVEKRPTSQAQSRNDFFNL 453

Query: 895  VRKKSMTN 918
            +RKKS TN
Sbjct: 454  MRKKSSTN 461



 Score = 97.1 bits (240), Expect = 2e-17
 Identities = 56/104 (53%), Positives = 66/104 (63%)
 Frame = +3

Query: 1038 DQMSEIGGGLICNSDSCDGPQKSLDNGENHSSSDVIIYSEEEEAAFLRSLGWDESESAGE 1217
            DQ  E+  G   N D+CD  QK LDNGE HSS D ++Y +EEEAAFLRSLGW+E+   GE
Sbjct: 552  DQGDEVHDG---NGDACDVSQKFLDNGEKHSSPDEVLYPDEEEAAFLRSLGWEEN---GE 605

Query: 1218 DEGLTEEEISAFYRDVSKYINSGPSSKTFLGTKPKFLVPLKLQM 1349
            DEGLTEEEI+AFY++  K     PSS       PK    L  QM
Sbjct: 606  DEGLTEEEINAFYKECMKL---KPSSNLLQRMLPKISPLLDSQM 646


>KVI10980.1 hypothetical protein Ccrd_010614 [Cynara cardunculus var. scolymus]
          Length = 629

 Score =  333 bits (855), Expect = e-102
 Identities = 189/315 (60%), Positives = 227/315 (72%), Gaps = 12/315 (3%)
 Frame = +1

Query: 4    SSASKNNYNNGNTRLAGSSPASSVHKTAFDRDFPSLGADERQADPEIGRIPSPGLSTAIQ 183
            S A+K++++NG   L+G S  S+V KT+F+RDFPSLGADE+QADP+IGR+PSPGLS+AIQ
Sbjct: 190  SIANKSSHSNGTALLSGGSSLSNV-KTSFERDFPSLGADEKQADPDIGRVPSPGLSSAIQ 248

Query: 184  SLPIGSSAVIGDGGWTSALAEVPVMIGGNGTXXXXXXXXXXXXXXXXXXXXTTGLNMAET 363
            SLPIG+SAVIG  GWTSALAEVPV++G NG                     T G NMAET
Sbjct: 249  SLPIGNSAVIGGDGWTSALAEVPVIVGSNGN-STSVSQPVQPTSITATTSMTGGRNMAET 307

Query: 364  LAQGPPRVQATPQ-----------LSVGTQRLEELAIKQSRQLIPVTPSMPKALVLNPLE 510
            LA GPPR Q  PQ           L+VGTQRLEELA+KQSRQLIP+TPSMPKAL L+  +
Sbjct: 308  LAHGPPRTQTAPQVAQMLLMGSTILTVGTQRLEELAVKQSRQLIPMTPSMPKALALSSSD 367

Query: 511  KPKPKVGQQQHQNSSTHIVNHSPRGAPTKSDISKTSSLGKLQVLKPARERNGVS-YAKDS 687
            KPK K+GQ Q  N       H+PR    KSD+SKTS++GKL VLKP+RERNG+S  AK+S
Sbjct: 368  KPKLKIGQSQLVNHP-----HTPRPLSVKSDVSKTSTVGKLLVLKPSRERNGISPTAKES 422

Query: 688  LSPTSGSKVANNPLAAAPSAVGSAPLRSPVNNPILASTERKPVPAAFVPPILEKRPTSQT 867
            LSPT GSK+ N+PL A PSA+GSAPLR+  NNP + + ERKP  A      LEKRP+SQ 
Sbjct: 423  LSPTGGSKLPNSPL-AVPSAIGSAPLRNMGNNPGVTAVERKPSVAT-----LEKRPSSQA 476

Query: 868  QSRNDFFNLVRKKSM 912
            QSRN+FFNL+RKKSM
Sbjct: 477  QSRNNFFNLMRKKSM 491



 Score = 99.4 bits (246), Expect = 4e-18
 Identities = 51/91 (56%), Positives = 65/91 (71%)
 Frame = +3

Query: 1065 LICNSDSCDGPQKSLDNGENHSSSDVIIYSEEEEAAFLRSLGWDESESAGEDEGLTEEEI 1244
            L C  D+C    +S +NG+NHS  D ++ SEEEEA FLRSLGWD  E+A E+EGLTEEEI
Sbjct: 536  LTCKGDACVATVRSTNNGKNHSGPDAVLCSEEEEARFLRSLGWD--ETAEEEEGLTEEEI 593

Query: 1245 SAFYRDVSKYINSGPSSKTFLGTKPKFLVPL 1337
            S+FYR+   Y+N  P+SK   GTKPK L+ +
Sbjct: 594  SSFYRN---YLNLKPTSKILKGTKPKPLMEI 621


>XP_017224884.1 PREDICTED: cell wall protein RBR3-like [Daucus carota subsp. sativus]
          Length = 585

 Score =  330 bits (847), Expect = e-102
 Identities = 188/305 (61%), Positives = 213/305 (69%), Gaps = 1/305 (0%)
 Frame = +1

Query: 4    SSASKNNYNNGNTRLAGSSPASSVHKTAFDRDFPSLGADERQADPEIGRIPSPGLSTAIQ 183
            +S++K+NYNN ++RL GSS  SSV KT+FDRDFPSLGADERQ D  I  IPSPGLST +Q
Sbjct: 156  NSSNKSNYNNSSSRLLGSSGISSVRKTSFDRDFPSLGADERQTDHGIRNIPSPGLSTNMQ 215

Query: 184  SLPIGSSAVIGDGGWTSALAEVPVMIGGNGTXXXXXXXXXXXXXXXXXXXXTTGLNMAET 363
            SL  G S V  + GWTSALAEVPVM+G NG                        LNMAET
Sbjct: 216  SLSTGYSTVANEVGWTSALAEVPVMVGANGPITSSVLQAALPSSTSVPSSTAASLNMAET 275

Query: 364  LAQGPPRVQATPQLSVGTQRLEELAIKQSRQLIPVTPSMPKALVLNPLEKPKPKVGQQQH 543
            LAQGP RV   PQ+SV TQRLEELAIKQSRQLIP+TPSMPK+LVLN  EK K KV QQQH
Sbjct: 276  LAQGPLRVDTAPQVSVETQRLEELAIKQSRQLIPMTPSMPKSLVLNSSEKSKVKVSQQQH 335

Query: 544  QNSSTHIVNHSPRGAPTKSDISKTSSLGKLQVLKPARERNGVSYAK-DSLSPTSGSKVAN 720
            Q SS     HS RG   KSD+ KT SLGKLQVLKPARERNGVSY + D+LS T+ S VAN
Sbjct: 336  QTSSI----HSLRGTLEKSDVPKTLSLGKLQVLKPARERNGVSYPEIDNLSLTNDSTVAN 391

Query: 721  NPLAAAPSAVGSAPLRSPVNNPILASTERKPVPAAFVPPILEKRPTSQTQSRNDFFNLVR 900
            NPL   P AV   P R+ + NP   +  RKP  A  VP  LEK+P++Q QSRN+FFNLVR
Sbjct: 392  NPLTTLP-AVVPPPSRTQIKNPNPLNVNRKPA-AIMVPATLEKKPSAQLQSRNEFFNLVR 449

Query: 901  KKSMT 915
            KKS+T
Sbjct: 450  KKSLT 454



 Score =  105 bits (261), Expect = 5e-20
 Identities = 57/90 (63%), Positives = 65/90 (72%)
 Frame = +3

Query: 1038 DQMSEIGGGLICNSDSCDGPQKSLDNGENHSSSDVIIYSEEEEAAFLRSLGWDESESAGE 1217
            D   E    LI N ++ +G Q+S  NGE  S SD+I+ SEEEEAAFLRSLGWD  E+AGE
Sbjct: 498  DHYKENVNALISNINNGNGHQQSCGNGETRSRSDMILCSEEEEAAFLRSLGWD--ENAGE 555

Query: 1218 DEGLTEEEISAFYRDVSKYINSGPSSKTFL 1307
            DEGLTEEEI+ FYRD SKYI  G SSKT L
Sbjct: 556  DEGLTEEEINEFYRDASKYIKPGSSSKTSL 585


>XP_011087795.1 PREDICTED: uncharacterized protein LOC105169167 [Sesamum indicum]
          Length = 624

 Score =  329 bits (843), Expect = e-101
 Identities = 185/306 (60%), Positives = 210/306 (68%), Gaps = 1/306 (0%)
 Frame = +1

Query: 4    SSASKNNYNNGNTRLAGSSPASSVHKTAFDRDFPSLGADERQADPEIGRIPSPGLSTAIQ 183
            SSAS  N N    R  GS       K  F++DFPSLGADER   PE+GR+PSPGLSTAIQ
Sbjct: 155  SSASGKNANGLLYR--GSPVGGRAKKATFEKDFPSLGADERAVVPEVGRVPSPGLSTAIQ 212

Query: 184  SLPIGSSAVIGDGGWTSALAEVPVMIGGNGTXXXXXXXXXXXXXXXXXXXXTTGLNMAET 363
            SLP+G+S +I    WTSALAEVPV++G NGT                    TT LNMAE 
Sbjct: 213  SLPVGTSGLIVGEKWTSALAEVPVLVGSNGTALSSVQQAAPSSSASVALGSTTSLNMAEA 272

Query: 364  LAQGPPRVQATPQLSVGTQRLEELAIKQSRQLIPVTPSMPKALVLNPLEKPKPKVGQQQH 543
            +AQGP R Q TPQLSVGTQRLEELAIKQSRQLIPVTPSMPKALVL   +KPK KVGQQQH
Sbjct: 273  VAQGPSRAQTTPQLSVGTQRLEELAIKQSRQLIPVTPSMPKALVLTSSDKPKGKVGQQQH 332

Query: 544  QNSSTHIVNHSPRGAPTKSDISKTSSLGKLQVLKPARERNGVS-YAKDSLSPTSGSKVAN 720
              SS+  +NHSPRG   K D++K S++GKLQVLKP RE+NGV+   KD+LSPTS SKV  
Sbjct: 333  SISSSLPLNHSPRGGAVKGDVAKASNVGKLQVLKPVREKNGVTPVVKDNLSPTSSSKVVT 392

Query: 721  NPLAAAPSAVGSAPLRSPVNNPILASTERKPVPAAFVPPILEKRPTSQTQSRNDFFNLVR 900
            + LA +PS  GSA  R   NN +    +RKP        +LEKRPTSQ QSRNDFFNLVR
Sbjct: 393  STLAVSPSVSGSAATRGLPNNGV---HDRKPSLT-----VLEKRPTSQAQSRNDFFNLVR 444

Query: 901  KKSMTN 918
            KKSM N
Sbjct: 445  KKSMPN 450



 Score = 99.8 bits (247), Expect = 3e-18
 Identities = 55/99 (55%), Positives = 70/99 (70%)
 Frame = +3

Query: 1038 DQMSEIGGGLICNSDSCDGPQKSLDNGENHSSSDVIIYSEEEEAAFLRSLGWDESESAGE 1217
            D++SE  G L  N D+CD  Q  + NG+ + SSD II SEEEEAAFLRSLGWDE+   G 
Sbjct: 507  DRLSEEKGDLTSNGDACDA-QNYVRNGKKYPSSDPII-SEEEEAAFLRSLGWDENSDEG- 563

Query: 1218 DEGLTEEEISAFYRDVSKYINSGPSSKTFLGTKPKFLVP 1334
               LT+EEI+AFYRD++KYI+S PS +   G + KFL+P
Sbjct: 564  --ALTDEEINAFYRDLTKYIDSNPSFRILQGVQLKFLLP 600


>KZM81625.1 hypothetical protein DCAR_029238 [Daucus carota subsp. sativus]
          Length = 993

 Score =  330 bits (847), Expect = 7e-98
 Identities = 188/305 (61%), Positives = 213/305 (69%), Gaps = 1/305 (0%)
 Frame = +1

Query: 4    SSASKNNYNNGNTRLAGSSPASSVHKTAFDRDFPSLGADERQADPEIGRIPSPGLSTAIQ 183
            +S++K+NYNN ++RL GSS  SSV KT+FDRDFPSLGADERQ D  I  IPSPGLST +Q
Sbjct: 156  NSSNKSNYNNSSSRLLGSSGISSVRKTSFDRDFPSLGADERQTDHGIRNIPSPGLSTNMQ 215

Query: 184  SLPIGSSAVIGDGGWTSALAEVPVMIGGNGTXXXXXXXXXXXXXXXXXXXXTTGLNMAET 363
            SL  G S V  + GWTSALAEVPVM+G NG                        LNMAET
Sbjct: 216  SLSTGYSTVANEVGWTSALAEVPVMVGANGPITSSVLQAALPSSTSVPSSTAASLNMAET 275

Query: 364  LAQGPPRVQATPQLSVGTQRLEELAIKQSRQLIPVTPSMPKALVLNPLEKPKPKVGQQQH 543
            LAQGP RV   PQ+SV TQRLEELAIKQSRQLIP+TPSMPK+LVLN  EK K KV QQQH
Sbjct: 276  LAQGPLRVDTAPQVSVETQRLEELAIKQSRQLIPMTPSMPKSLVLNSSEKSKVKVSQQQH 335

Query: 544  QNSSTHIVNHSPRGAPTKSDISKTSSLGKLQVLKPARERNGVSYAK-DSLSPTSGSKVAN 720
            Q SS     HS RG   KSD+ KT SLGKLQVLKPARERNGVSY + D+LS T+ S VAN
Sbjct: 336  QTSSI----HSLRGTLEKSDVPKTLSLGKLQVLKPARERNGVSYPEIDNLSLTNDSTVAN 391

Query: 721  NPLAAAPSAVGSAPLRSPVNNPILASTERKPVPAAFVPPILEKRPTSQTQSRNDFFNLVR 900
            NPL   P AV   P R+ + NP   +  RKP  A  VP  LEK+P++Q QSRN+FFNLVR
Sbjct: 392  NPLTTLP-AVVPPPSRTQIKNPNPLNVNRKPA-AIMVPATLEKKPSAQLQSRNEFFNLVR 449

Query: 901  KKSMT 915
            KKS+T
Sbjct: 450  KKSLT 454



 Score = 95.5 bits (236), Expect = 1e-16
 Identities = 51/83 (61%), Positives = 60/83 (72%)
 Frame = +3

Query: 1038 DQMSEIGGGLICNSDSCDGPQKSLDNGENHSSSDVIIYSEEEEAAFLRSLGWDESESAGE 1217
            D   E    LI N ++ +G Q+S  NGE  S SD+I+ SEEEEAAFLRSLGWD  E+AGE
Sbjct: 498  DHYKENVNALISNINNGNGHQQSCGNGETRSRSDMILCSEEEEAAFLRSLGWD--ENAGE 555

Query: 1218 DEGLTEEEISAFYRDVSKYINSG 1286
            DEGLTEEEI+ FYRD SKY + G
Sbjct: 556  DEGLTEEEINEFYRDASKYGDGG 578


>KVI01779.1 hypothetical protein Ccrd_019964, partial [Cynara cardunculus var.
            scolymus]
          Length = 551

 Score =  317 bits (812), Expect = 4e-97
 Identities = 184/305 (60%), Positives = 217/305 (71%), Gaps = 3/305 (0%)
 Frame = +1

Query: 7    SASKNNYNNGNTRLAGSSPASSVHKTAFDRDFPSLGADERQADPEIGRIPSPGLSTAIQS 186
            +  KN +NNG+  L     +SS  K AF+RDFPSLGA+E+QAD EIGR+PSPGL+TAIQS
Sbjct: 153  AGDKNGHNNGSA-LPSVGTSSSSGKAAFERDFPSLGAEEKQADTEIGRVPSPGLTTAIQS 211

Query: 187  LPIGSSAVIGDGGWTSALAEVPVMIGGNGTXXXXXXXXXXXXXXXXXXXXTTGLNMAETL 366
            LPIGSSAVI    WTSALAEVP+++G NG+                    TTG NMAETL
Sbjct: 212  LPIGSSAVICGDMWTSALAEVPMIVGSNGSNISVQQPIQPTSVSATTSM-TTGRNMAETL 270

Query: 367  AQGPPRVQATPQLSVGTQRLEELAIKQSRQLIPVTPSMPKALVLNPLEKPKPKVGQQQHQ 546
            AQGP R + TPQLSVGTQRLEELA+KQSRQLIP+TPSMPKAL LN  +KPK KVGQ Q Q
Sbjct: 271  AQGPSRARTTPQLSVGTQRLEELAVKQSRQLIPMTPSMPKALALNSSDKPKLKVGQSQLQ 330

Query: 547  NSSTHIVNHSP--RGAPTKSDISKTSSLGKLQVLKPARERNG-VSYAKDSLSPTSGSKVA 717
            NS  HIVNH P  R    KSD++K S++GKL +LK +RERNG  S AK+SLSPT GSK+ 
Sbjct: 331  NS--HIVNHPPSLRPVSVKSDVTKVSTVGKLHILKSSRERNGTTSTAKESLSPTGGSKLP 388

Query: 718  NNPLAAAPSAVGSAPLRSPVNNPILASTERKPVPAAFVPPILEKRPTSQTQSRNDFFNLV 897
            N+PL A P  VGSA LR+   + I+A  +RK        P +EKRP+ Q QSRNDFFNL+
Sbjct: 389  NSPL-AVPVVVGSASLRNTGGSTIVA--DRK--------PCVEKRPSPQAQSRNDFFNLM 437

Query: 898  RKKSM 912
            RKKSM
Sbjct: 438  RKKSM 442



 Score = 78.2 bits (191), Expect = 2e-11
 Identities = 40/61 (65%), Positives = 47/61 (77%), Gaps = 1/61 (1%)
 Frame = +3

Query: 1083 SCDGPQKSLDNGE-NHSSSDVIIYSEEEEAAFLRSLGWDESESAGEDEGLTEEEISAFYR 1259
            SC+G      N E NHSSSD I+YSEEEEA FLRSLGW+E+    E+EGLTEEEI++FYR
Sbjct: 494  SCNGDATERSNNEKNHSSSDAILYSEEEEARFLRSLGWEETT---EEEGLTEEEINSFYR 550

Query: 1260 D 1262
            D
Sbjct: 551  D 551


>XP_011092382.1 PREDICTED: uncharacterized protein LOC105172576 [Sesamum indicum]
          Length = 616

 Score =  318 bits (816), Expect = 6e-97
 Identities = 175/298 (58%), Positives = 207/298 (69%), Gaps = 1/298 (0%)
 Frame = +1

Query: 28   NNGNTRLAGSSPASSVHKTAFDRDFPSLGADERQADPEIGRIPSPGLSTAIQSLPIGSSA 207
            NN N   +  SP   V+KT F+RDFPSLGA+ER A PE+GR+PSPG+S+A+QSLPIG+  
Sbjct: 160  NNTNGLPSKGSPIGGVNKTTFERDFPSLGAEERAAIPEVGRVPSPGVSSALQSLPIGTPT 219

Query: 208  VIGDGGWTSALAEVPVMIGGNGTXXXXXXXXXXXXXXXXXXXXTTGLNMAETLAQGPPRV 387
            +I    W SALAEVPV++G N T                    TT LNMAE +AQGP R 
Sbjct: 220  IIRGEKWRSALAEVPVLVGNNVTGISSVQQAAPSSSASVALGSTTSLNMAEAVAQGPSRA 279

Query: 388  QATPQLSVGTQRLEELAIKQSRQLIPVTPSMPKALVLNPLEKPKPKVGQQQHQNSSTHIV 567
            Q TPQLS+GTQRLEELAIKQSRQLIPVTPSMPK L     +K K KVGQQQH  +S+   
Sbjct: 280  QTTPQLSIGTQRLEELAIKQSRQLIPVTPSMPKPLAACSADKQKTKVGQQQHVVTSSLAA 339

Query: 568  NHSPRGAPTKSDISKTSSLGKLQVLKPARERNGVS-YAKDSLSPTSGSKVANNPLAAAPS 744
            N SPRG P K+D+SKTS++GKL VLKP RE+NG +   K++LSPTSGSK+ ++PL AAPS
Sbjct: 340  NQSPRGGPVKADVSKTSNVGKLHVLKPVREKNGTTPVVKENLSPTSGSKLVSSPL-AAPS 398

Query: 745  AVGSAPLRSPVNNPILASTERKPVPAAFVPPILEKRPTSQTQSRNDFFNLVRKKSMTN 918
              GSA  R   NNP+    +RKPV       +LEKRPTSQ QSRNDFFN VRKKSM N
Sbjct: 399  LSGSAATRVLPNNPV---ADRKPVWT-----VLEKRPTSQAQSRNDFFNSVRKKSMAN 448



 Score = 95.9 bits (237), Expect = 5e-17
 Identities = 51/100 (51%), Positives = 67/100 (67%)
 Frame = +3

Query: 1035 GDQMSEIGGGLICNSDSCDGPQKSLDNGENHSSSDVIIYSEEEEAAFLRSLGWDESESAG 1214
            G+ +S       CN D CD  Q  + NG+ + +SD I +SEEEEAAFLRSLGW+E+   G
Sbjct: 501  GENLSGTRSDTACNGDVCDA-QNYVSNGKKNHTSDPI-FSEEEEAAFLRSLGWEENADEG 558

Query: 1215 EDEGLTEEEISAFYRDVSKYINSGPSSKTFLGTKPKFLVP 1334
               GLT+EEISAF+RDV+KY++S PS K     +PK L+P
Sbjct: 559  ---GLTDEEISAFFRDVTKYVDSKPSLKILQAVQPKILLP 595


>XP_010277689.1 PREDICTED: uncharacterized protein YMR317W-like [Nelumbo nucifera]
          Length = 655

 Score =  313 bits (801), Expect = 3e-94
 Identities = 179/315 (56%), Positives = 210/315 (66%), Gaps = 15/315 (4%)
 Frame = +1

Query: 19   NNYNNGNTRLAGSSPASSVHKTAFDRDFPSLGADERQADPEIGRIPSPGLSTAIQSLPIG 198
            NN+NNGN  L G S  SS+ K AF+RDFPSLGA+E+Q   +IGR+ SPGLS+++QSLPIG
Sbjct: 176  NNHNNGNGLLVGGSIVSSIQKAAFERDFPSLGAEEKQGALDIGRVSSPGLSSSVQSLPIG 235

Query: 199  SSAVIGDGGWTSALAEVPVMIGGNGTXXXXXXXXXXXXXXXXXXXXTTGLNMAETLAQGP 378
            SSAVIG  GWTSALAEVPV+IG N                      +TGLNMAETLAQ P
Sbjct: 236  SSAVIGGDGWTSALAEVPVIIGNNSIGPSSVQQATPASSTSGAPNSSTGLNMAETLAQAP 295

Query: 379  PRVQATPQLSVGTQRLEELAIKQSRQLIPVTPSMPKALVLNPLEKPKPKV---------- 528
             R + +PQLSV TQRLEELAIKQSRQLIP+TPSMPK   LN  EK KPK           
Sbjct: 296  SRTRISPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSALNSSEKAKPKAVVRTGEMGIS 355

Query: 529  --GQQQHQNSSTHIVNHSPRGAPTKSDISKTSSLGKLQVLKPARERNGVS-YAKDSLSPT 699
                QQ Q  S+H+VNHS RG P +SD+ KTS  GKL VLK  RE+NG+S  AKD LSPT
Sbjct: 356  AKTSQQQQLPSSHLVNHSLRGGPVRSDVPKTSHGGKLLVLKAPREKNGISPSAKDGLSPT 415

Query: 700  SGSKVANNPLAAAPSAVGSAPLRSPVNNPILASTERKPVPAAFV-PPILEKRP-TSQTQS 873
            + SKV NN L  AP A  + P+RSP N+ +    ERK V ++      +EKRP TSQ QS
Sbjct: 416  NASKVVNNSLVLAPLAAYAPPMRSPNNSKL--PNERKSVASSLTHGSAVEKRPTTSQVQS 473

Query: 874  RNDFFNLVRKKSMTN 918
            RNDFFNL+RKK+  N
Sbjct: 474  RNDFFNLMRKKTSGN 488



 Score = 99.4 bits (246), Expect = 4e-18
 Identities = 53/103 (51%), Positives = 70/103 (67%)
 Frame = +3

Query: 1038 DQMSEIGGGLICNSDSCDGPQKSLDNGENHSSSDVIIYSEEEEAAFLRSLGWDESESAGE 1217
            D  +E GG L+ N D  +  Q+  +NGE  S++D  +Y +EEEAAFLRSLGWD  E+AGE
Sbjct: 536  DWSTENGGDLVSNGDVSEESQRFSNNGEKRSTADAFVYPDEEEAAFLRSLGWD--ENAGE 593

Query: 1218 DEGLTEEEISAFYRDVSKYINSGPSSKTFLGTKPKFLVPLKLQ 1346
            +EGLTEEEISAFYR+   Y+   PSS+   G + +  VPL L+
Sbjct: 594  EEGLTEEEISAFYRE---YMKVRPSSRLCQGAQQQTKVPLPLE 633


>XP_010245093.1 PREDICTED: uncharacterized protein LOC104588732 isoform X2 [Nelumbo
            nucifera]
          Length = 616

 Score =  308 bits (788), Expect = 9e-93
 Identities = 177/306 (57%), Positives = 212/306 (69%), Gaps = 7/306 (2%)
 Frame = +1

Query: 22   NYNNGNTRLAGSSPASSVHKTAFDRDFPSLGADERQADPEIGRIPSPGLSTAIQSLPIGS 201
            N N  N  L G S  SS+ K AF+RDFPSLGA+E+   P+IGR+ SPGLS+A+QSLP+GS
Sbjct: 148  NQNTSNGLLVGGSIVSSIQKAAFERDFPSLGAEEKPGTPDIGRVSSPGLSSAVQSLPMGS 207

Query: 202  SAVIGDGGWTSALAEVPVMIGGNGTXXXXXXXXXXXXXXXXXXXXTTGLNMAETLAQGPP 381
            SA+IG  GWTSALAEVP++IG NGT                    +TGLNMAETLAQ P 
Sbjct: 208  SALIGGDGWTSALAEVPMIIGNNGTGISSVQQATLGSSASGATNSSTGLNMAETLAQAPS 267

Query: 382  RVQATPQLSVGTQRLEELAIKQSRQLIPVTPSMPKALVLNPLEKPKPKVGQQQHQNSSTH 561
            R + +PQLSV TQRLEELAIKQSRQLIP+TPSMPK  VLN LEK KPK+  +  + ++T 
Sbjct: 268  RARISPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSVLNSLEKAKPKISVRTGEMNATK 327

Query: 562  IVNH----SPRGAPTKSDISKTSSLGKLQVLKPARERNGVS-YAKDSLSPTSGSKVANNP 726
             +      S RGAP +SD+SKTS  GKL VLK  RE+NG+S  AKD  SPT+ SKVANNP
Sbjct: 328  TIQQQQLSSLRGAPMRSDVSKTSHGGKLLVLKAPREKNGISPIAKDGQSPTNVSKVANNP 387

Query: 727  LAAAPSAVGSAPLRSPVNNPILASTERKPVPAAFV-PPILEKRP-TSQTQSRNDFFNLVR 900
            LA APSA    PL+SP N+ +  S ERK   A+ +    +EKRP TSQ QSRNDFFNL+R
Sbjct: 388  LALAPSA-AFTPLKSPNNSKL--SNERKSAAASLMHGSSVEKRPTTSQVQSRNDFFNLMR 444

Query: 901  KKSMTN 918
            KK+  N
Sbjct: 445  KKTSGN 450



 Score = 99.0 bits (245), Expect = 5e-18
 Identities = 52/103 (50%), Positives = 71/103 (68%)
 Frame = +3

Query: 1038 DQMSEIGGGLICNSDSCDGPQKSLDNGENHSSSDVIIYSEEEEAAFLRSLGWDESESAGE 1217
            D  +E G   I N ++ +  Q+ L+NGE HSS D  +Y +EEEAAFLRSLGWD  E+AGE
Sbjct: 497  DWSTENGSETISNGNASEESQRFLNNGEKHSSPDAFVYPDEEEAAFLRSLGWD--ENAGE 554

Query: 1218 DEGLTEEEISAFYRDVSKYINSGPSSKTFLGTKPKFLVPLKLQ 1346
            +EGLTEEEISAFY++   Y+   PSSK   G++ +  +P+ L+
Sbjct: 555  EEGLTEEEISAFYKE---YMKLRPSSKLCRGSQQQVKLPMPLE 594


>XP_010245092.1 PREDICTED: uncharacterized protein LOC104588732 isoform X1 [Nelumbo
            nucifera]
          Length = 645

 Score =  308 bits (788), Expect = 2e-92
 Identities = 177/306 (57%), Positives = 212/306 (69%), Gaps = 7/306 (2%)
 Frame = +1

Query: 22   NYNNGNTRLAGSSPASSVHKTAFDRDFPSLGADERQADPEIGRIPSPGLSTAIQSLPIGS 201
            N N  N  L G S  SS+ K AF+RDFPSLGA+E+   P+IGR+ SPGLS+A+QSLP+GS
Sbjct: 177  NQNTSNGLLVGGSIVSSIQKAAFERDFPSLGAEEKPGTPDIGRVSSPGLSSAVQSLPMGS 236

Query: 202  SAVIGDGGWTSALAEVPVMIGGNGTXXXXXXXXXXXXXXXXXXXXTTGLNMAETLAQGPP 381
            SA+IG  GWTSALAEVP++IG NGT                    +TGLNMAETLAQ P 
Sbjct: 237  SALIGGDGWTSALAEVPMIIGNNGTGISSVQQATLGSSASGATNSSTGLNMAETLAQAPS 296

Query: 382  RVQATPQLSVGTQRLEELAIKQSRQLIPVTPSMPKALVLNPLEKPKPKVGQQQHQNSSTH 561
            R + +PQLSV TQRLEELAIKQSRQLIP+TPSMPK  VLN LEK KPK+  +  + ++T 
Sbjct: 297  RARISPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSVLNSLEKAKPKISVRTGEMNATK 356

Query: 562  IVNH----SPRGAPTKSDISKTSSLGKLQVLKPARERNGVS-YAKDSLSPTSGSKVANNP 726
             +      S RGAP +SD+SKTS  GKL VLK  RE+NG+S  AKD  SPT+ SKVANNP
Sbjct: 357  TIQQQQLSSLRGAPMRSDVSKTSHGGKLLVLKAPREKNGISPIAKDGQSPTNVSKVANNP 416

Query: 727  LAAAPSAVGSAPLRSPVNNPILASTERKPVPAAFV-PPILEKRP-TSQTQSRNDFFNLVR 900
            LA APSA    PL+SP N+ +  S ERK   A+ +    +EKRP TSQ QSRNDFFNL+R
Sbjct: 417  LALAPSA-AFTPLKSPNNSKL--SNERKSAAASLMHGSSVEKRPTTSQVQSRNDFFNLMR 473

Query: 901  KKSMTN 918
            KK+  N
Sbjct: 474  KKTSGN 479



 Score = 99.0 bits (245), Expect = 6e-18
 Identities = 52/103 (50%), Positives = 71/103 (68%)
 Frame = +3

Query: 1038 DQMSEIGGGLICNSDSCDGPQKSLDNGENHSSSDVIIYSEEEEAAFLRSLGWDESESAGE 1217
            D  +E G   I N ++ +  Q+ L+NGE HSS D  +Y +EEEAAFLRSLGWD  E+AGE
Sbjct: 526  DWSTENGSETISNGNASEESQRFLNNGEKHSSPDAFVYPDEEEAAFLRSLGWD--ENAGE 583

Query: 1218 DEGLTEEEISAFYRDVSKYINSGPSSKTFLGTKPKFLVPLKLQ 1346
            +EGLTEEEISAFY++   Y+   PSSK   G++ +  +P+ L+
Sbjct: 584  EEGLTEEEISAFYKE---YMKLRPSSKLCRGSQQQVKLPMPLE 623


>CDO97516.1 unnamed protein product [Coffea canephora]
          Length = 599

 Score =  305 bits (781), Expect = 6e-92
 Identities = 170/306 (55%), Positives = 209/306 (68%), Gaps = 2/306 (0%)
 Frame = +1

Query: 1    SSSASKNNYNNGNTRLAGSSPASSVHKTAFDRDFPSLGADERQADPEIGRIPSPGLSTAI 180
            S+SAS+N   +GN+ L       +VHK  F+RDFPSLG++ERQA  E+GR+PSPGL+TAI
Sbjct: 158  SNSASRNKSTDGNSLLDKGDSVGTVHKVVFERDFPSLGSEERQATSEVGRVPSPGLNTAI 217

Query: 181  QSLPIGSSAVIGDGGWTSALAEVPVMIGGNGTXXXXXXXXXXXXXXXXXXXXTT-GLNMA 357
              LPI +SA+I    WTSALAEVP ++GG GT                    T+ GLNMA
Sbjct: 218  HGLPISASAIIAGDKWTSALAEVPAIVGGGGTGLSPGRQASLPSSPASLPSSTSAGLNMA 277

Query: 358  ETLAQGPPRVQATPQLSVGTQRLEELAIKQSRQLIPVTPSMPKALVLNPLEKPKPKVGQQ 537
            ET+AQG PRVQA P+++ GTQRLEELAI+QSRQLIP+TPSMPK  +LN  +K K K GQ 
Sbjct: 278  ETVAQG-PRVQAAPKITSGTQRLEELAIRQSRQLIPMTPSMPKPSILNSSDKGKAKAGQP 336

Query: 538  QHQNSSTHIVNHSPRGAPTKSDISKTSSLGKLQVLKPARERNGVSYA-KDSLSPTSGSKV 714
            QH  SS  +++ S RG P K+D SKTS+ GKL VLKP RERNGVS A KD+LSPTS ++ 
Sbjct: 337  QHPVSSP-LLSPSLRGGPVKTDASKTSNAGKLLVLKPPRERNGVSTASKDTLSPTSSTRA 395

Query: 715  ANNPLAAAPSAVGSAPLRSPVNNPILASTERKPVPAAFVPPILEKRPTSQTQSRNDFFNL 894
            A + +A A S  G A  R P  NP+    ERK        P+LEK+P+SQ QSRNDFFNL
Sbjct: 396  ATSGIAVATSVTGLATSRGPAINPVSPGAERK-----HALPMLEKKPSSQAQSRNDFFNL 450

Query: 895  VRKKSM 912
            +RKKSM
Sbjct: 451  MRKKSM 456



 Score = 82.8 bits (203), Expect = 7e-13
 Identities = 43/87 (49%), Positives = 58/87 (66%), Gaps = 8/87 (9%)
 Frame = +3

Query: 1089 DGPQKSLDNGENHSSSDVI--------IYSEEEEAAFLRSLGWDESESAGEDEGLTEEEI 1244
            D P     NG  H+ +D+         ++SEEEEAAFL  LGW E+    +++GLTEEEI
Sbjct: 493  DVPSLDRLNGCQHTENDLFGIQSRSLPLFSEEEEAAFLHQLGWQEN---ADEDGLTEEEI 549

Query: 1245 SAFYRDVSKYINSGPSSKTFLGTKPKF 1325
            +AF+RD+SKY+NS PSSK+  G +PKF
Sbjct: 550  NAFFRDLSKYMNSKPSSKSLQGVQPKF 576


>GAV67368.1 hypothetical protein CFOL_v3_10874 [Cephalotus follicularis]
          Length = 625

 Score =  305 bits (781), Expect = 1e-91
 Identities = 170/304 (55%), Positives = 207/304 (68%), Gaps = 1/304 (0%)
 Frame = +1

Query: 10   ASKNNYNNGNTRLAGSSPASSVHKTAFDRDFPSLGADERQADPEIGRIPSPGLSTAIQSL 189
            A K+N ++G TRLAG S  SSVH +AF+RDFPSLGA+E Q  PEI R+ SPGLST+IQS 
Sbjct: 166  AKKSNRSDGITRLAGVSAVSSVHNSAFERDFPSLGAEESQGGPEISRVSSPGLSTSIQSF 225

Query: 190  PIGSSAVIGDGGWTSALAEVPVMIGGNGTXXXXXXXXXXXXXXXXXXXXTTGLNMAETLA 369
            P+G+S+VIG  GWTSALAEVPV++G + T                     +GLNMAETL 
Sbjct: 226  PVGTSSVIGSDGWTSALAEVPVVMGTSTTGVASAQQSVSASSAPLSPSVMSGLNMAETLV 285

Query: 370  QGPPRVQATPQLSVGTQRLEELAIKQSRQLIPVTPSMPKALVLNPLEKPKPKVGQQQHQN 549
            QGP R +  P  +VGTQRLEELAI+QSRQLIP+TPSMPK LV++P EK KPK+G QQH  
Sbjct: 286  QGPSRARTPPLSTVGTQRLEELAIRQSRQLIPMTPSMPKPLVVSPSEKSKPKIGPQQH-- 343

Query: 550  SSTHIVNHSPRGAPTKSDISKTSSLGKLQVLKPARERNGVSYA-KDSLSPTSGSKVANNP 726
                 VNH+ RG P + D  KTS+ G+LQ+LK +R+ NG S A KDS SPTSG+K  N+P
Sbjct: 344  -LLQTVNHT-RGGPARPDSPKTSNDGRLQILKSSRDLNGASSAPKDSSSPTSGNKAVNSP 401

Query: 727  LAAAPSAVGSAPLRSPVNNPILASTERKPVPAAFVPPILEKRPTSQTQSRNDFFNLVRKK 906
                 SA GS PLRS  N+P   S +R P P        EKRP SQ QSRNDFF+L++KK
Sbjct: 402  RVVTSSATGSTPLRSSSNSPNF-SIDRNPAPFRV---SAEKRPISQAQSRNDFFSLLKKK 457

Query: 907  SMTN 918
            S T+
Sbjct: 458  SSTS 461



 Score = 75.9 bits (185), Expect = 1e-10
 Identities = 44/75 (58%), Positives = 52/75 (69%)
 Frame = +3

Query: 1074 NSDSCDGPQKSLDNGENHSSSDVIIYSEEEEAAFLRSLGWDESESAGEDEGLTEEEISAF 1253
            N  + D  Q+ L NGE HSS  VI+Y +EEEA FLRSLGW+E+   GEDEGLTEEEISAF
Sbjct: 519  NGIAYDVSQECLSNGEKHSSPGVILYPDEEEA-FLRSLGWEEN--GGEDEGLTEEEISAF 575

Query: 1254 YRDVSKYINSGPSSK 1298
             ++   Y    PSSK
Sbjct: 576  LKE---YTKLKPSSK 587


>XP_007041568.2 PREDICTED: mucin-17 [Theobroma cacao]
          Length = 620

 Score =  296 bits (758), Expect = 2e-88
 Identities = 167/309 (54%), Positives = 212/309 (68%), Gaps = 3/309 (0%)
 Frame = +1

Query: 1    SSSASKNNYNNGNTRLAGSSPASSVHKTAFDRDFPSLGADERQADPEIGRIPSPGLSTAI 180
            SS+++K+N+++GN  L+G S     +K+AF+R+FP LGA+ERQ   EIGR+ SPGLSTA 
Sbjct: 160  SSTSNKSNHSSGNGLLSGVSTTVG-NKSAFEREFPVLGAEERQVGSEIGRVSSPGLSTAG 218

Query: 181  QSLPIGSSAVIGDGGWTSALAEVPVMIGGNGTXXXXXXXXXXXXXXXXXXXXTTGLNMAE 360
            QSLP+G+SA+ G  GWTSALA++P  +G +GT                     TGLNMAE
Sbjct: 219  QSLPVGTSAISGSDGWTSALADMPAGVGSSGTGVAVASQNVSASSASMASTTMTGLNMAE 278

Query: 361  TLAQGPPRVQATPQLSVGTQRLEELAIKQSRQLIP-VTPSMPKALVLNPLEKPKPKVGQQ 537
            TL QGP R +  P L+VGTQRLEELAIKQSRQL+P VT S PK LV++P EK KPKVGQQ
Sbjct: 279  TLVQGPSRARTPPLLNVGTQRLEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPKVGQQ 338

Query: 538  QHQNSSTHIVNHSPRGAPTKSDISKTSSLGKLQVLKPARERNGVS-YAKDSLSPTSG-SK 711
            QH + S +      RG  ++SD  K S+ G+L++LKP+RE NGVS   KD+LSPT+G SK
Sbjct: 339  QHASLSLNYT----RGGTSRSDSLKVSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSSK 394

Query: 712  VANNPLAAAPSAVGSAPLRSPVNNPILASTERKPVPAAFVPPILEKRPTSQTQSRNDFFN 891
            + N+PL   PSA  SAP RS  N+P  A+ ER   P       +EKRPT+Q QSRNDFFN
Sbjct: 395  LVNSPLNVTPSASASAPFRSSGNSPSFATAERNQTPFRI---NIEKRPTAQAQSRNDFFN 451

Query: 892  LVRKKSMTN 918
            L++KKS TN
Sbjct: 452  LLKKKSTTN 460



 Score = 80.1 bits (196), Expect = 5e-12
 Identities = 43/88 (48%), Positives = 60/88 (68%)
 Frame = +3

Query: 1074 NSDSCDGPQKSLDNGENHSSSDVIIYSEEEEAAFLRSLGWDESESAGEDEGLTEEEISAF 1253
            N D+  G Q+   NG+ H+  D  +Y +EEEAAFLRSLGW+  E+AG+DEGLTEEEISAF
Sbjct: 520  NGDAYAGSQQCSSNGDRHARPDAFLYPDEEEAAFLRSLGWE--ENAGDDEGLTEEEISAF 577

Query: 1254 YRDVSKYINSGPSSKTFLGTKPKFLVPL 1337
            + +   ++   PS+K F   + + +VPL
Sbjct: 578  FEE---HMKLKPSAKLF--HRMQSIVPL 600


>EOX97399.1 Uncharacterized protein TCM_006425 isoform 2 [Theobroma cacao]
          Length = 620

 Score =  293 bits (750), Expect = 4e-87
 Identities = 165/309 (53%), Positives = 211/309 (68%), Gaps = 3/309 (0%)
 Frame = +1

Query: 1    SSSASKNNYNNGNTRLAGSSPASSVHKTAFDRDFPSLGADERQADPEIGRIPSPGLSTAI 180
            SS+++K+N+++ N  L+G S     +K+ F+R+FP LGA+ERQ   EIGR+ SPGLSTA 
Sbjct: 160  SSTSNKSNHSSSNGLLSGVSTTVG-NKSVFEREFPVLGAEERQVASEIGRVSSPGLSTAG 218

Query: 181  QSLPIGSSAVIGDGGWTSALAEVPVMIGGNGTXXXXXXXXXXXXXXXXXXXXTTGLNMAE 360
            QSLP+G+SA+ G  GWTSALA++P  +G +GT                     TGLNMAE
Sbjct: 219  QSLPVGTSAISGSDGWTSALADMPAGVGSSGTGVAVASQNVSASSASMASTTMTGLNMAE 278

Query: 361  TLAQGPPRVQATPQLSVGTQRLEELAIKQSRQLIP-VTPSMPKALVLNPLEKPKPKVGQQ 537
            TL QGP R +  P L+VGTQRLEELAIKQSRQL+P VT S PK LV++P EK KPKVGQQ
Sbjct: 279  TLVQGPSRARTPPLLNVGTQRLEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPKVGQQ 338

Query: 538  QHQNSSTHIVNHSPRGAPTKSDISKTSSLGKLQVLKPARERNGVS-YAKDSLSPTSG-SK 711
            QH + S +      RG  ++SD  K S+ G+L++LKP+RE NGVS   KD+LSPT+G SK
Sbjct: 339  QHASLSLNYT----RGGTSRSDSLKVSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSSK 394

Query: 712  VANNPLAAAPSAVGSAPLRSPVNNPILASTERKPVPAAFVPPILEKRPTSQTQSRNDFFN 891
            + N+PL+  PSA  SAP RS  N+P  A+ ER   P       +EKRPT+Q QSRNDFFN
Sbjct: 395  LVNSPLSVTPSASASAPFRSSGNSPSFATAERNQTPFRI---NIEKRPTAQAQSRNDFFN 451

Query: 892  LVRKKSMTN 918
            L++KKS TN
Sbjct: 452  LLKKKSTTN 460



 Score = 80.9 bits (198), Expect = 3e-12
 Identities = 43/88 (48%), Positives = 60/88 (68%)
 Frame = +3

Query: 1074 NSDSCDGPQKSLDNGENHSSSDVIIYSEEEEAAFLRSLGWDESESAGEDEGLTEEEISAF 1253
            N D+  G Q+   NG+ H+  D  +Y +EEEAAFLRSLGW+  E+AG+DEGLTEEEISAF
Sbjct: 520  NGDAYSGSQQCSSNGDRHARPDAFLYPDEEEAAFLRSLGWE--ENAGDDEGLTEEEISAF 577

Query: 1254 YRDVSKYINSGPSSKTFLGTKPKFLVPL 1337
            + +   ++   PS+K F   + + +VPL
Sbjct: 578  FEE---HMKLKPSAKLF--HRMQSIVPL 600


>EOX97398.1 Uncharacterized protein TCM_006425 isoform 1 [Theobroma cacao]
          Length = 625

 Score =  293 bits (750), Expect = 4e-87
 Identities = 165/309 (53%), Positives = 211/309 (68%), Gaps = 3/309 (0%)
 Frame = +1

Query: 1    SSSASKNNYNNGNTRLAGSSPASSVHKTAFDRDFPSLGADERQADPEIGRIPSPGLSTAI 180
            SS+++K+N+++ N  L+G S     +K+ F+R+FP LGA+ERQ   EIGR+ SPGLSTA 
Sbjct: 165  SSTSNKSNHSSSNGLLSGVSTTVG-NKSVFEREFPVLGAEERQVASEIGRVSSPGLSTAG 223

Query: 181  QSLPIGSSAVIGDGGWTSALAEVPVMIGGNGTXXXXXXXXXXXXXXXXXXXXTTGLNMAE 360
            QSLP+G+SA+ G  GWTSALA++P  +G +GT                     TGLNMAE
Sbjct: 224  QSLPVGTSAISGSDGWTSALADMPAGVGSSGTGVAVASQNVSASSASMASTTMTGLNMAE 283

Query: 361  TLAQGPPRVQATPQLSVGTQRLEELAIKQSRQLIP-VTPSMPKALVLNPLEKPKPKVGQQ 537
            TL QGP R +  P L+VGTQRLEELAIKQSRQL+P VT S PK LV++P EK KPKVGQQ
Sbjct: 284  TLVQGPSRARTPPLLNVGTQRLEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPKVGQQ 343

Query: 538  QHQNSSTHIVNHSPRGAPTKSDISKTSSLGKLQVLKPARERNGVS-YAKDSLSPTSG-SK 711
            QH + S +      RG  ++SD  K S+ G+L++LKP+RE NGVS   KD+LSPT+G SK
Sbjct: 344  QHASLSLNYT----RGGTSRSDSLKVSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSSK 399

Query: 712  VANNPLAAAPSAVGSAPLRSPVNNPILASTERKPVPAAFVPPILEKRPTSQTQSRNDFFN 891
            + N+PL+  PSA  SAP RS  N+P  A+ ER   P       +EKRPT+Q QSRNDFFN
Sbjct: 400  LVNSPLSVTPSASASAPFRSSGNSPSFATAERNQTPFRI---NIEKRPTAQAQSRNDFFN 456

Query: 892  LVRKKSMTN 918
            L++KKS TN
Sbjct: 457  LLKKKSTTN 465



 Score = 80.9 bits (198), Expect = 3e-12
 Identities = 43/88 (48%), Positives = 60/88 (68%)
 Frame = +3

Query: 1074 NSDSCDGPQKSLDNGENHSSSDVIIYSEEEEAAFLRSLGWDESESAGEDEGLTEEEISAF 1253
            N D+  G Q+   NG+ H+  D  +Y +EEEAAFLRSLGW+  E+AG+DEGLTEEEISAF
Sbjct: 525  NGDAYSGSQQCSSNGDRHARPDAFLYPDEEEAAFLRSLGWE--ENAGDDEGLTEEEISAF 582

Query: 1254 YRDVSKYINSGPSSKTFLGTKPKFLVPL 1337
            + +   ++   PS+K F   + + +VPL
Sbjct: 583  FEE---HMKLKPSAKLF--HRMQSIVPL 605


Top