BLASTX nr result

ID: Lithospermum23_contig00010586 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Lithospermum23_contig00010586
         (2719 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_011087795.1 PREDICTED: uncharacterized protein LOC105169167 [...   409   e-128
CDO97516.1 unnamed protein product [Coffea canephora]                 394   e-123
XP_011092382.1 PREDICTED: uncharacterized protein LOC105172576 [...   394   e-122
KVH93465.1 hypothetical protein Ccrd_004483 [Cynara cardunculus ...   376   e-115
XP_002265987.2 PREDICTED: uncharacterized protein LOC100241871 [...   375   e-115
KVI10980.1 hypothetical protein Ccrd_010614 [Cynara cardunculus ...   364   e-111
XP_017259125.1 PREDICTED: uncharacterized protein LOC108228142 [...   362   e-110
KVI01779.1 hypothetical protein Ccrd_019964, partial [Cynara car...   356   e-109
XP_012828376.1 PREDICTED: uncharacterized protein LOC105949617 i...   353   e-107
KZV20788.1 hypothetical protein F511_30430 [Dorcoceras hygrometr...   353   e-107
KZM90816.1 hypothetical protein DCAR_021819 [Daucus carota subsp...   352   e-107
XP_012828377.1 PREDICTED: uncharacterized protein LOC105949617 i...   327   3e-98
XP_010245092.1 PREDICTED: uncharacterized protein LOC104588732 i...   325   3e-96
XP_017258903.1 PREDICTED: uncharacterized protein LOC108227982 [...   320   4e-95
XP_017224884.1 PREDICTED: cell wall protein RBR3-like [Daucus ca...   318   3e-94
XP_010277689.1 PREDICTED: uncharacterized protein YMR317W-like [...   318   2e-93
EOX97398.1 Uncharacterized protein TCM_006425 isoform 1 [Theobro...   315   8e-93
EOX97399.1 Uncharacterized protein TCM_006425 isoform 2 [Theobro...   315   1e-92
XP_010245093.1 PREDICTED: uncharacterized protein LOC104588732 i...   314   2e-92
XP_007041568.2 PREDICTED: mucin-17 [Theobroma cacao]                  314   3e-92

>XP_011087795.1 PREDICTED: uncharacterized protein LOC105169167 [Sesamum indicum]
          Length = 624

 Score =  409 bits (1050), Expect = e-128
 Identities = 274/642 (42%), Positives = 374/642 (58%), Gaps = 31/642 (4%)
 Frame = -2

Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSS-STNDQNLGR 2104
            M+++EPTLVP+WL+                      DD+   +V R  S  ++N    GR
Sbjct: 1    MERSEPTLVPEWLKNTGNLTGAGSISHS--------DDHAASRVARNKSFVNSNGHEFGR 52

Query: 2103 SSASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRR 1924
            SS+SER  S +FRRS S+N S N R+ + F ++  DRD    VY+S ++D+++L D    
Sbjct: 53   SSSSERTTSSYFRRSSSSNSSGNFRSYSSFGRSQRDRDWEKDVYDSRDQDKSVLADHWHW 112

Query: 1923 DFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNAN----RA 1756
            DFSD LGN+   + ER  LRRS S++S +R +T P+ V     SA   S  NAN    R 
Sbjct: 113  DFSDPLGNSLLSKYERDGLRRSQSMVSGKRGDTWPKKVVTDLSSA---SGKNANGLLYRG 169

Query: 1755 SD-NVLLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTS 1585
            S       KA+FE+DFPSL  +E+    E+ RVPSP L+ +I + PV  S +I  +KWTS
Sbjct: 170  SPVGGRAKKATFEKDFPSLGADERAVVPEVGRVPSPGLSTAIQSLPVGTSGLIVGEKWTS 229

Query: 1584 ALAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTMDSSISTALSMA 1405
            ALAEVP LV +NGT +S+   A               S+ + A GS       +T+L+MA
Sbjct: 230  ALAEVPVLVGSNGTALSSVQQAAPS------------SSASVALGS-------TTSLNMA 270

Query: 1404 ETVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEKPKAKTGQP 1228
            E VA  P R +TTPQLS  +QR +ELAIKQSRQLIPVTPS PKALV    +KPK K GQ 
Sbjct: 271  EAVAQGPSRAQTTPQLSVGTQRLEELAIKQSRQLIPVTPSMPKALVLTSSDKPKGKVGQQ 330

Query: 1227 H--------LIHSPRGGSAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKRDSRT 1072
                     L HSPRGG+ K DV+K S++GKLQVLKPVRE+NG +    DNLSP   S+ 
Sbjct: 331  QHSISSSLPLNHSPRGGAVKGDVAKASNVGKLQVLKPVREKNGVTPVVKDNLSPTSSSKV 390

Query: 1071 VCSVPAATHSVAGSSSVRAQVNNSGHPAAERKHLLPLLEKRAASQSQTQSRNDFFSLMRK 892
            V S  A + SV+GS++ R   NN  H   +RK  L +LEKR  SQ+  QSRNDFF+L+RK
Sbjct: 391  VTSTLAVSPSVSGSAATRGLPNNGVH---DRKPSLTVLEKRPTSQA--QSRNDFFNLVRK 445

Query: 891  KSISNSSPAPESGPVISASDKKLGEGEDAALPVTDHSGKVGVLANIHSIKSLE--YTNAK 718
            KS+ NSS A     + + S   L  G   +   +D   ++ +L + ++ K+ +   +N+ 
Sbjct: 446  KSMPNSSSAVADSAMANCS-SVLDTGTAISPSFSDKDVEIDILPSSNTPKAADVPLSNSL 504

Query: 717  VSDGHSYNEG---------ESLNSKKNGS--TCNXXXXXXXXXXXFLRSLGWEENTEEGG 571
             +D  S  +G         ++ N  +NG     +           FLRSLGW+EN++EG 
Sbjct: 505  SADRLSEEKGDLTSNGDACDAQNYVRNGKKYPSSDPIISEEEEAAFLRSLGWDENSDEGA 564

Query: 570  LTDEEINAFYKDVTKYINSKPSWKILQGMP-RFLLALESQIG 448
            LTDEEINAFY+D+TKYI+S PS++ILQG+  +FLL   S++G
Sbjct: 565  LTDEEINAFYRDLTKYIDSNPSFRILQGVQLKFLLPFGSELG 606


>CDO97516.1 unnamed protein product [Coffea canephora]
          Length = 599

 Score =  394 bits (1012), Expect = e-123
 Identities = 262/628 (41%), Positives = 355/628 (56%), Gaps = 20/628 (3%)
 Frame = -2

Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSS-STNDQNLGR 2104
            M+++EP+LVP+WL+                      DD+   K+ R  SS + ND  +GR
Sbjct: 1    MERSEPSLVPEWLKSSGSATGSGTTSHPLSPS----DDHAVSKLARNKSSVNHNDHEIGR 56

Query: 2103 SSASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRR 1924
            SS S+R  + +FRRS S+NGS  +++ + F +NH  RD +  +YE  ++D  ++   + R
Sbjct: 57   SSVSDRTSASYFRRSSSSNGSGQMQSYSSFGRNHRGRDWDKDLYEPRDRDNLVVGGHKHR 116

Query: 1923 DFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNANRASDNV 1744
            D+ D   NN     E+  LRRS S++S +R+E  P+   A S SA +    + N   D  
Sbjct: 117  DYLDPPVNNFPGNFEKDGLRRSQSMVSRKRNEIWPKRSIADSNSASRNKSTDGNSLLDKG 176

Query: 1743 ----LLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTSA 1582
                 +HK  FE+DFPSL  EE+ A SE+ RVPSP L  +IH  P+SASA+I  DKWTSA
Sbjct: 177  DSVGTVHKVVFERDFPSLGSEERQATSEVGRVPSPGLNTAIHGLPISASAIIAGDKWTSA 236

Query: 1581 LAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSA--KAHAPGSP-TMDSSISTALS 1411
            LAEVP +V   G G                  GTG+S   +A  P SP ++ SS S  L+
Sbjct: 237  LAEVPAIV---GGG------------------GTGLSPGRQASLPSSPASLPSSTSAGLN 275

Query: 1410 MAETVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEKPKAKTG 1234
            MAETVA   PRV+  P++++ +QR +ELAI+QSRQLIP+TPS PK  + N  +K KAK G
Sbjct: 276  MAETVAQ-GPRVQAAPKITSGTQRLEELAIRQSRQLIPMTPSMPKPSILNSSDKGKAKAG 334

Query: 1233 QP-HLIHSP------RGGSAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKRDSR 1075
            QP H + SP      RGG  K D SKTS+ GKL VLKP RERNG S    D LSP   +R
Sbjct: 335  QPQHPVSSPLLSPSLRGGPVKTDASKTSNAGKLLVLKPPRERNGVSTASKDTLSPTSSTR 394

Query: 1074 TVCSVPAATHSVAGSSSVRAQVNNSGHPAAERKHLLPLLEKRAASQSQTQSRNDFFSLMR 895
               S  A   SV G ++ R    N   P AERKH LP+LEK+ +SQ+  QSRNDFF+LMR
Sbjct: 395  AATSGIAVATSVTGLATSRGPAINPVSPGAERKHALPMLEKKPSSQA--QSRNDFFNLMR 452

Query: 894  KKSISNSSPAPESGPVISASD-KKLGEGEDAALPVTDHSGKVGVLANIHSIKSLEYTNAK 718
            KKS+ +SS   ++G  +SAS   + GE E    PV      V  L  ++  +        
Sbjct: 453  KKSMPSSSSVADAGSAVSASTLDEPGELEVIPAPVIHEDEDVPSLDRLNGCQ-------- 504

Query: 717  VSDGHSYNEGESLNSKKNGSTCNXXXXXXXXXXXFLRSLGWEENTEEGGLTDEEINAFYK 538
                H+ N+   + S+      +           FL  LGW+EN +E GLT+EEINAF++
Sbjct: 505  ----HTENDLFGIQSR------SLPLFSEEEEAAFLHQLGWQENADEDGLTEEEINAFFR 554

Query: 537  DVTKYINSKPSWKILQGM-PRFLLALES 457
            D++KY+NSKPS K LQG+ P+F L L S
Sbjct: 555  DLSKYMNSKPSSKSLQGVQPKFPLLLSS 582


>XP_011092382.1 PREDICTED: uncharacterized protein LOC105172576 [Sesamum indicum]
          Length = 616

 Score =  394 bits (1012), Expect = e-122
 Identities = 272/643 (42%), Positives = 364/643 (56%), Gaps = 32/643 (4%)
 Frame = -2

Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSS-STNDQNLGR 2104
            M+++EPTL+P+WLR                      D+  T K+ R  S  ++N  +  R
Sbjct: 1    MERSEPTLIPEWLRSAGSLNGGGSISHS--------DEQTTTKLARNKSLVNSNGHDSAR 52

Query: 2103 SSASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRR 1924
            S +S+R  S +FRRS S+NGS +LR+ + F +NHHDRD      +S +KD+++L D   R
Sbjct: 53   SFSSDRTTSSYFRRSSSSNGSGHLRSHSSFGRNHHDRDWEKDACDSRDKDKSVLGDRWHR 112

Query: 1923 DFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNAN----RA 1756
            DFSD++GN    + ER  LRRS S+IS +R +T  + V      A   S NN N    + 
Sbjct: 113  DFSDAMGNTLLSKFERDGLRRSQSMISGKRGDTWHKKVGTDLNIA---SGNNTNGLPSKG 169

Query: 1755 SDNVLLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTSA 1582
            S    ++K +FE+DFPSL  EE+ A  E+ RVPSP ++ ++ + P+    +I  +KW SA
Sbjct: 170  SPIGGVNKTTFERDFPSLGAEERAAIPEVGRVPSPGVSSALQSLPIGTPTIIRGEKWRSA 229

Query: 1581 LAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTMDSSISTALSMAE 1402
            LAEVP LV  N TG+S+   A               S+ + A GS       +T+L+MAE
Sbjct: 230  LAEVPVLVGNNVTGISSVQQAAPS------------SSASVALGS-------TTSLNMAE 270

Query: 1401 TVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEKPKAKTG-QP 1228
             VA  P R +TTPQLS  +QR +ELAIKQSRQLIPVTPS PK L     +K K K G Q 
Sbjct: 271  AVAQGPSRAQTTPQLSIGTQRLEELAIKQSRQLIPVTPSMPKPLAACSADKQKTKVGQQQ 330

Query: 1227 HLI-------HSPRGGSAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKRDSRTV 1069
            H++        SPRGG  K DVSKTS++GKL VLKPVRE+NG +    +NLSP   S+ V
Sbjct: 331  HVVTSSLAANQSPRGGPVKADVSKTSNVGKLHVLKPVREKNGTTPVVKENLSPTSGSKLV 390

Query: 1068 CSVPAATHSVAGSSSVRAQVNNSGHPAAERKHLLPLLEKRAASQSQTQSRNDFFSLMRKK 889
             S P A  S++GS++ R   NN   P A+RK +  +LEKR  SQ+  QSRNDFF+ +RKK
Sbjct: 391  SS-PLAAPSLSGSAATRVLPNN---PVADRKPVWTVLEKRPTSQA--QSRNDFFNSVRKK 444

Query: 888  S-----------ISNSSPAPESGPVISASDKKLGEGEDAALPVT-DHSGKVGVL---ANI 754
            S           I+NSSP   +     +   KL E E    P T D +   GV     N+
Sbjct: 445  SMANSTSVADAAIANSSPVDTAPAASPSFSDKLTETEIVVAPNTQDRNASSGVNLSGENL 504

Query: 753  HSIKSLEYTNAKVSDGHSYNEGESLNSKKNGSTCNXXXXXXXXXXXFLRSLGWEENTEEG 574
               +S    N  V D  +Y      N KKN +  +           FLRSLGWEEN +EG
Sbjct: 505  SGTRSDTACNGDVCDAQNYVS----NGKKNHT--SDPIFSEEEEAAFLRSLGWEENADEG 558

Query: 573  GLTDEEINAFYKDVTKYINSKPSWKILQGM-PRFLLALESQIG 448
            GLTDEEI+AF++DVTKY++SKPS KILQ + P+ LL  +S IG
Sbjct: 559  GLTDEEISAFFRDVTKYVDSKPSLKILQAVQPKILLPFDSHIG 601


>KVH93465.1 hypothetical protein Ccrd_004483 [Cynara cardunculus var. scolymus]
          Length = 636

 Score =  376 bits (966), Expect = e-115
 Identities = 262/586 (44%), Positives = 342/586 (58%), Gaps = 18/586 (3%)
 Frame = -2

Query: 2172 DDYGTPKVVRKNSS-STNDQNLGRSSASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHD 1996
            D+ G  K  R  S  + +D  LGR S S+R  S +FRR+ S+NGS +LR+ + F +NH D
Sbjct: 73   DEQGVSKATRNKSFVNISDNELGRPSVSDRTTSSYFRRT-SSNGSSHLRSYSSFGRNHRD 131

Query: 1995 RDSNDSVYESGNKDRTILEDFRRRDFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPR 1816
            RD +  ++E   K++    D R RD+SD LGN    R E++ LRRSHS +S +R E+ PR
Sbjct: 132  RDWDKDIHEFREKEKP---DGRLRDYSDPLGNILPSRFEKEGLRRSHSSVSAKRGESWPR 188

Query: 1815 NVEAGSKSAYKISHNNANRASDN---VLLHKASFEQDFPSL--EEKLAASEIRRVPSPCL 1651
             V   S SA K SHNN +        +   K +FE+DFPSL  EEK    EI RVPSP L
Sbjct: 189  KVVVDSSSANKNSHNNGSALRSGAGAIGSVKTAFERDFPSLGAEEKQIDPEIGRVPSPGL 248

Query: 1650 TPSIHTFPVSASAMIGSDKWTSALAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVS 1471
            T +I + P+  SA+IG D WTSALAEVP +V +NG+  S        VP  +++  T +S
Sbjct: 249  TTAIQSLPIGNSAVIGGDGWTSALAEVPVIVGSNGSNTS--------VPPPLQS--TSIS 298

Query: 1470 AKAHAPGSPTMDSSISTALSMAETVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVT 1294
            A A          S++T  +MAET+A  PPR +T PQLS  +QR +ELA+KQSRQLIP+T
Sbjct: 299  ATA----------SMATGRNMAETLAQGPPRAQTAPQLSVGTQRLEELAVKQSRQLIPMT 348

Query: 1293 PSAPKALVPNPPEKPKAKTGQ-----PHLI---HSPRGGSAKPDVSKTSSIGKLQVLKPV 1138
            PS PKAL  N  +KPK+K GQ      HL+   HSPR  S K DVSKTSS+GKL VLKP 
Sbjct: 349  PSLPKALALNSSDKPKSKVGQLQLQSSHLVNHTHSPRPVSTKFDVSKTSSVGKLHVLKPS 408

Query: 1137 RERNGDSLPPMDNLSPKRDSRTVCSVPAATHSVAGSSSVRAQVNNSGHPAAERKHLLPLL 958
            RERNG +    DNLSP   S+   S P A  SV GS+ +R   NN     A +  +   L
Sbjct: 409  RERNGITPIAKDNLSPTGASKLPNS-PLAVTSVVGSAPLRNLGNNPAVAVAVKPGVAATL 467

Query: 957  EKRAASQSQTQSRNDFFSLMRKKSISNSSP--APESGPVISASDKKLGEGEDAALPVTDH 784
            EKR +SQ+  QSRNDFF+LMRKKS++N+S    P++G  ISA DK           V D 
Sbjct: 468  EKRPSSQA--QSRNDFFNLMRKKSMTNNSSPVTPDTGSSISAGDKPTATEGGIDPAVVDG 525

Query: 783  SGKVGVLANIHSIKSLEYTNAKVSDGHSYNEGESLNSKKNGSTCNXXXXXXXXXXXFLRS 604
            SG  GV  +  +   L   N + +        E  N K N S+             FLRS
Sbjct: 526  SG--GVQVSSGNKVDLSSCNGEAT--------ERSNGKNNSSSDAIILYSEEEEARFLRS 575

Query: 603  LGWEE-NTEEGGLTDEEINAFYKDVTKYINSKPSWKILQGMPRFLL 469
            LGWEE   EE GLT+EEI++FY+DV+KY+N + + KI +  P+ L+
Sbjct: 576  LGWEETGEEEEGLTEEEISSFYRDVSKYLNLQAASKIFK--PKLLM 619


>XP_002265987.2 PREDICTED: uncharacterized protein LOC100241871 [Vitis vinifera]
          Length = 665

 Score =  375 bits (963), Expect = e-115
 Identities = 266/671 (39%), Positives = 362/671 (53%), Gaps = 60/671 (8%)
 Frame = -2

Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSSSTNDQNLGRS 2101
            MDK EP LVP+WL+                     SDD    K  RK   ++ND + GRS
Sbjct: 1    MDKTEPALVPEWLKSSGSVTGGGSTNHHFAPSLLQSDDGAALKPARKLMVNSNDHDTGRS 60

Query: 2100 SASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRRD 1921
            S  ER  S +FRRS S+NGS + R+ + F + + +R+    +++  +KD+++L D R RD
Sbjct: 61   SNLERTTSSYFRRSSSSNGSGHPRSFSSFGRTNREREWEKDIHDYRDKDKSVLSDHRHRD 120

Query: 1920 FSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNANR--ASDN 1747
            +SD LGN    R+ER  LRRS S+I+ +R +  PR V A   +  K  H+N +   AS  
Sbjct: 121  YSDPLGNILPGRLERDMLRRSQSMITGKRGDMWPRKVAADVSTVNKTIHSNGDGQLASGI 180

Query: 1746 VL--LHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTSAL 1579
            V   + KA+F+++FPSL  E+K  A +I RV SP LT +I + P+  + +IG D WTSAL
Sbjct: 181  VTSSVQKAAFDRNFPSLGAEDKQGAPDIGRVTSPGLTSAIQSLPIGNTVVIGGDGWTSAL 240

Query: 1578 AEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTMDSSISTALSMAET 1399
            AEVP ++ +N                   T G     ++ +  S ++  S ++ L+MAET
Sbjct: 241  AEVPVIIGSN-------------------TTGVSSVQQSVSASSVSVAPSTTSGLNMAET 281

Query: 1398 VAHLPPRVKT--TPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEKPKAKTG-Q 1231
            +   P R +   TPQLS  +QR +ELA+KQSRQLIP+TPS PK LVP+P +KPK+K G Q
Sbjct: 282  LVQGPARARANATPQLSVGTQRLEELALKQSRQLIPMTPSMPKTLVPSPSDKPKSKIGLQ 341

Query: 1230 P-HLI-HSPRGGSAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKRDSRTVCSVP 1057
            P HL+ HS RGG A+ DV+KTS++GKL VLKP RERNG S    D+LSP   SR   S  
Sbjct: 342  PLHLVNHSQRGGPARSDVTKTSNVGKLHVLKPSRERNGVSPTAKDSLSPTMGSRVANSPL 401

Query: 1056 AATHSVAGSSSVRAQVNNSGHPAAERKHLLPLLEKRAASQSQTQSRNDFFSLMRKKSISN 877
            A T S AGS+S+R+  NN    +AER+  + L        SQ QSRNDFF+LMRKKS +N
Sbjct: 402  AVTPSAAGSASLRSPRNNPTLASAERRPSVVLTSVEKRPTSQAQSRNDFFNLMRKKSSTN 461

Query: 876  -SSPAPESGPVISASDKKLGE---GEDAALPVT---------DHSG-------------- 778
              S  PESGP +S+S  +  +    E    PVT         D+SG              
Sbjct: 462  PPSAVPESGPAVSSSVSEKSDELITEVVTAPVTPKGRDILSSDNSGLDWSNENRGDKTEN 521

Query: 777  ----KVGVLANIHSIKSLEYTNAKVSDGHSYNEGESLNSKKNGSTC-------------- 652
                  GV  N      ++  N    D    ++G+ ++   NG  C              
Sbjct: 522  GNNEACGVSQNDRD-DEIDNVNGDACDVSQRDQGDEVHD-GNGDACDVSQKFLDNGEKHS 579

Query: 651  --NXXXXXXXXXXXFLRSLGWEENTEEGGLTDEEINAFYKDVTKYINSKPSWKILQGM-P 481
              +           FLRSLGWEEN E+ GLT+EEINAFYK+  K    KPS  +LQ M P
Sbjct: 580  SPDEVLYPDEEEAAFLRSLGWEENGEDEGLTEEEINAFYKECMKL---KPSSNLLQRMLP 636

Query: 480  RFLLALESQIG 448
            +    L+SQ+G
Sbjct: 637  KISPLLDSQMG 647


>KVI10980.1 hypothetical protein Ccrd_010614 [Cynara cardunculus var. scolymus]
          Length = 629

 Score =  364 bits (934), Expect = e-111
 Identities = 260/599 (43%), Positives = 349/599 (58%), Gaps = 27/599 (4%)
 Frame = -2

Query: 2172 DDYGTPKVVRKNSS-STNDQNLGRSSASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHD 1996
            D+ G+ K  R  SS +++D +LGR+S S+R  S +FRR+   NGS +LR+ + F +NH D
Sbjct: 67   DEQGSSKSGRNKSSVNSSDNDLGRTSVSDRTTSSYFRRTSLGNGSTHLRSYSSFGRNHRD 126

Query: 1995 RDSNDSVYESGNKDRTILEDFRRRDFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPR 1816
            RD +  +YE  +K+++   D R RD+SD L N    R E+  LRRSHS +S +R E+ PR
Sbjct: 127  RDWDKDIYEFWSKEKS---DNRHRDYSDPLDNILPSRFEKDGLRRSHSSVSGKRGESWPR 183

Query: 1815 NVEAGSKSAYKISHNNANR---ASDNVLLHKASFEQDFPSL--EEKLAASEIRRVPSPCL 1651
             V +    A K SH+N         ++   K SFE+DFPSL  +EK A  +I RVPSP L
Sbjct: 184  KVVSDLSIANKSSHSNGTALLSGGSSLSNVKTSFERDFPSLGADEKQADPDIGRVPSPGL 243

Query: 1650 TPSIHTFPVSASAMIGSDKWTSALAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVS 1471
            + +I + P+  SA+IG D WTSALAEVP +V +NG                   N T VS
Sbjct: 244  SSAIQSLPIGNSAVIGGDGWTSALAEVPVIVGSNG-------------------NSTSVS 284

Query: 1470 AKAHAPGSPTMDSSISTALSMAETVAHLPPRVKTTPQ-----------LSTESQR-QELA 1327
                 P S T  +S++   +MAET+AH PPR +T PQ           L+  +QR +ELA
Sbjct: 285  QPVQ-PTSITATTSMTGGRNMAETLAHGPPRTQTAPQVAQMLLMGSTILTVGTQRLEELA 343

Query: 1326 IKQSRQLIPVTPSAPKALVPNPPEKPKAKTGQPHLI---HSPRGGSAKPDVSKTSSIGKL 1156
            +KQSRQLIP+TPS PKAL  +  +KPK K GQ  L+   H+PR  S K DVSKTS++GKL
Sbjct: 344  VKQSRQLIPMTPSMPKALALSSSDKPKLKIGQSQLVNHPHTPRPLSVKSDVSKTSTVGKL 403

Query: 1155 QVLKPVRERNGDSLPPMDNLSPKRDSRTVCSVPAATHSVAGSSSVRAQVNNSGHPAAERK 976
             VLKP RERNG S    ++LSP   S+   S P A  S  GS+ +R   NN G  A ERK
Sbjct: 404  LVLKPSRERNGISPTAKESLSPTGGSKLPNS-PLAVPSAIGSAPLRNMGNNPGVTAVERK 462

Query: 975  HLLPLLEKRAASQSQTQSRNDFFSLMRKKS-ISNSSPAPESGPVISASDKKLGEGEDAAL 799
              +  LEKR +SQ+  QSRN+FF+LMRKKS ISNSS AP++G  +S+S+K    G   A 
Sbjct: 463  PSVATLEKRPSSQA--QSRNNFFNLMRKKSMISNSSVAPDTGSSVSSSEK---PGAPVAP 517

Query: 798  PVTDHSGKVGVLANIHSIKSLEYT---NAKVSDGHSYNEGESLNSKKNGSTCNXXXXXXX 628
            P   H G  G  +N      ++ T   +A V+   S N G      KN S  +       
Sbjct: 518  PA--HLG--GSESNTTVETKVDLTCKGDACVATVRSTNNG------KNHSGPDAVLCSEE 567

Query: 627  XXXXFLRSLGWEENT-EEGGLTDEEINAFYKDVTKYINSKPSWKILQG-MPRFLLALES 457
                FLRSLGW+E   EE GLT+EEI++FY++   Y+N KP+ KIL+G  P+ L+ + S
Sbjct: 568  EEARFLRSLGWDETAEEEEGLTEEEISSFYRN---YLNLKPTSKILKGTKPKPLMEISS 623


>XP_017259125.1 PREDICTED: uncharacterized protein LOC108228142 [Daucus carota subsp.
            sativus]
          Length = 620

 Score =  362 bits (928), Expect = e-110
 Identities = 257/621 (41%), Positives = 344/621 (55%), Gaps = 23/621 (3%)
 Frame = -2

Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSSSTNDQNLGRS 2101
            M+K+EPTLVP+WL+                      D+  T K  R  S      N+G  
Sbjct: 1    MEKSEPTLVPEWLKSSGSVTGGVSTNHLNPSLHQ--DNQATLKAARNKSLV----NIGDH 54

Query: 2100 SASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRRD 1921
                R  S +FRRS S+NG+ +LR+   F +N+ DRD +  +++  +K+++ L D + R 
Sbjct: 55   DIGHRTTSSYFRRS-SSNGTSHLRSYGSFGRNNRDRDWDRDIHDIRDKEKSNLGDRKYRQ 113

Query: 1920 FSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNANR----AS 1753
            FSDS  +NS  R E+  LRR+ S IS    E  PR V +  K+  K +HNN N     +S
Sbjct: 114  FSDSFESNSLSRFEKDGLRRTQSTISRTGVEPWPRRVPSDLKNIDKSNHNNGNSRLAVSS 173

Query: 1752 DNVLLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTSAL 1579
                +HKASF++DFPSL  EE+    EI RVPSP L  +I   P  +SA I    WTSAL
Sbjct: 174  PISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIADGGWTSAL 233

Query: 1578 AEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHA-PGSPTMDSSISTALSMAE 1402
            AEVP +                     + +NGT  S+  H+   S ++  S+ T L+MAE
Sbjct: 234  AEVPAM---------------------IGSNGTTASSVPHSVSSSASVVPSMMTGLNMAE 272

Query: 1401 TVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEKPKAKTG--- 1234
            T+   PPRV+  PQLS E+QR +ELAIKQSRQLIPVTPS PKALV N  +K K K G   
Sbjct: 273  TLVQGPPRVQADPQLSVETQRLEELAIKQSRQLIPVTPSLPKALVLNSSDKAKGKVGLQQ 332

Query: 1233 ---QPHLIH-SPRGGSAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKRDSRTVC 1066
                 +L+H SPRG   K ++ KTSS+GKLQVLKP RERNG S    D LSP   S+   
Sbjct: 333  QSASTNLVHHSPRGAPTKNEIIKTSSLGKLQVLKPARERNGVSNTSKDTLSPTSSSKLAN 392

Query: 1065 SVPAATHSVAGSSSVRAQVNNSGHPAAERKH-----LLPLLEKRAASQSQTQSRNDFFSL 901
            +  A   +  GS+ +R+ +N+S   +AERK      + P+LEKR + Q+  +SRNDFF+ 
Sbjct: 393  NPLAPALATVGSAPLRSSMNHSILVSAERKSAPPVMVTPMLEKRPSPQA--KSRNDFFNS 450

Query: 900  MRKKSISNSSPAPESGPVISASDKKLGE-GEDAALPVTDHSGK-VGVLANIHSIKSLEYT 727
            MRKKS++NSS A  S  V + S   LG+  E  A    D  G+ V V+ +    K  E  
Sbjct: 451  MRKKSMTNSSSA-VSNTVSAVSPSDLGKNSEGEASASLDSQGRDVPVVESSDEGKINECR 509

Query: 726  NAKVSDGHSYNEGESLNSKKNGSTCNXXXXXXXXXXXFLRSLGWEENT-EEGGLTDEEIN 550
            +  + + H      SL++  N S+ +           FLRSLGWEEN  E+ GLT+EEIN
Sbjct: 510  DGSIQNSH--GPQNSLDNGVNHSSTDVILSSEEEEAAFLRSLGWEENAGEDEGLTEEEIN 567

Query: 549  AFYKDVTKYINSKPSWKILQG 487
            AFY+DV+KYINS P  K L G
Sbjct: 568  AFYRDVSKYINSAPPSKTLLG 588


>KVI01779.1 hypothetical protein Ccrd_019964, partial [Cynara cardunculus var.
            scolymus]
          Length = 551

 Score =  356 bits (914), Expect = e-109
 Identities = 250/603 (41%), Positives = 343/603 (56%), Gaps = 21/603 (3%)
 Frame = -2

Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVR-KNSSSTNDQNLGR 2104
            M++ EPT VP+WL+                      DD G  K +R K+  ++ D +LGR
Sbjct: 1    MERTEPTFVPEWLKSSGSLSTISHQFTSSSLHP---DDQGVSKSLRTKSLVNSGDNDLGR 57

Query: 2103 SSASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRR 1924
            +S S+R  S +FRR+ S+NG+ +LR+ N F +NH DRD +  +YE  +K+++   D R R
Sbjct: 58   TSVSDRTTSSYFRRTSSSNGAAHLRSYNSFSRNHRDRDWDKDIYEFRDKEKS---DNRHR 114

Query: 1923 DFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNANR---AS 1753
            D+SD L N    R E+  LRRSHS +S +R E+ PR V AG K+     HNN +      
Sbjct: 115  DYSDHLANILPSRFEKDGLRRSHSSLSAKRGESWPRKV-AGDKNG----HNNGSALPSVG 169

Query: 1752 DNVLLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTSAL 1579
             +    KA+FE+DFPSL  EEK A +EI RVPSP LT +I + P+ +SA+I  D WTSAL
Sbjct: 170  TSSSSGKAAFERDFPSLGAEEKQADTEIGRVPSPGLTTAIQSLPIGSSAVICGDMWTSAL 229

Query: 1578 AEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTMDSSISTALSMAET 1399
            AEVP +V +NG+ +S +                    +   P S +  +S++T  +MAET
Sbjct: 230  AEVPMIVGSNGSNISVQ--------------------QPIQPTSVSATTSMTTGRNMAET 269

Query: 1398 VAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEKPKAKTGQ--- 1231
            +A  P R +TTPQLS  +QR +ELA+KQSRQLIP+TPS PKAL  N  +KPK K GQ   
Sbjct: 270  LAQGPSRARTTPQLSVGTQRLEELAVKQSRQLIPMTPSMPKALALNSSDKPKLKVGQSQL 329

Query: 1230 --PHLIHSP---RGGSAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKRDSRTVC 1066
               H+++ P   R  S K DV+K S++GKL +LK  RERNG +    ++LSP   S+   
Sbjct: 330  QNSHIVNHPPSLRPVSVKSDVTKVSTVGKLHILKSSRERNGTTSTAKESLSPTGGSKLPN 389

Query: 1065 SVPAATHSVAGSSSVRAQVNNSGHP-AAERKHLLPLLEKRAASQSQTQSRNDFFSLMRKK 889
            S P A   V GS+S+R   N  G    A+RK   P +EKR + Q+  QSRNDFF+LMRKK
Sbjct: 390  S-PLAVPVVVGSASLR---NTGGSTIVADRK---PCVEKRPSPQA--QSRNDFFNLMRKK 440

Query: 888  SISNSSPAP---ESGPVISASDKKLGEGEDAALP--VTDHSGKVGVLANIHSIKSLEYTN 724
            S++ +S +P   E+G   S +DK  GE +       V D S  V  L           + 
Sbjct: 441  SMATNSSSPGASEAGSSESTNDKP-GEPQVGGYDPVVVDRSCGVQTL-----------SE 488

Query: 723  AKVSDGHSYNEGESLNSKKNGSTCNXXXXXXXXXXXFLRSLGWEENTEEGGLTDEEINAF 544
             KV    + +  E  N++KN S+ +           FLRSLGWEE TEE GLT+EEIN+F
Sbjct: 489  NKVDFSCNGDATERSNNEKNHSSSDAILYSEEEEARFLRSLGWEETTEEEGLTEEEINSF 548

Query: 543  YKD 535
            Y+D
Sbjct: 549  YRD 551


>XP_012828376.1 PREDICTED: uncharacterized protein LOC105949617 isoform X1
            [Erythranthe guttata]
          Length = 575

 Score =  353 bits (905), Expect = e-107
 Identities = 252/635 (39%), Positives = 347/635 (54%), Gaps = 24/635 (3%)
 Frame = -2

Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSS-STNDQNLGR 2104
            MD++EP+LVPQWL+                      D++   +V R  S  +TN  + GR
Sbjct: 1    MDRSEPSLVPQWLKNSGSSTGGG-------------DNHPASRVARNKSFVNTNGNDFGR 47

Query: 2103 SSASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRR 1924
            +S S +  S +FRRS S+N S + ++ + F +N  DRD     Y S +K+R +L   R R
Sbjct: 48   ASGSAKTTSSYFRRSSSSNSSGSSKSYSSFGRNQRDRDWEKDTYNSRDKERLVLGGDRHR 107

Query: 1923 -DFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNAN----R 1759
             + S+ LGN S  + ER  LRRSHS+IS +  ET P+ V   S S      NN N    +
Sbjct: 108  YESSELLGNPSLSKYERDGLRRSHSMISGKHGETWPKKVVTESSSGS--GKNNGNGFLAK 165

Query: 1758 ASDNVLLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTS 1585
             S   + +KA+FE+DFPSL  +++    E+ RV SP L+ ++ + P+ +SA IG ++WTS
Sbjct: 166  GSPVGVANKATFERDFPSLGTDDRAVVPEVGRVASPGLSSALQSLPIGSSASIGGERWTS 225

Query: 1584 ALAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTG-VSAKAHAPGSPTMDSSIS--TAL 1414
            ALAEVP L                     V +NGT  +S +  AP S T    +S  T+L
Sbjct: 226  ALAEVPML---------------------VVSNGTASLSVQQAAPSSTTASVVVSSTTSL 264

Query: 1413 SMAETVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEKPKAKT 1237
            +MAE VA  P R +T PQLS  +QR +ELAIKQSRQLIPVTP+ PK LV +  +K K+K 
Sbjct: 265  NMAEAVAQGPTRAQTAPQLSLGTQRLEELAIKQSRQLIPVTPTMPKTLVLSSSDKQKSKV 324

Query: 1236 G--QPH-------LIHSPRGGS-AKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPK 1087
            G  Q H       +  SPRG   +KPD SK S++GKL VLKPVRE+NG +    D LSP 
Sbjct: 325  GLIQQHPTPSSLPINQSPRGAPPSKPDFSKASNVGKLHVLKPVREKNGVTPSVKDKLSPT 384

Query: 1086 RDSRTVCSVPAATHSVAGSSSVRAQVNNSGHPAAERKHLLPLLEKRAASQSQTQSRNDFF 907
               + V S   A+                  P+A +  L   LEKR  +Q+Q  SRNDFF
Sbjct: 385  GSGKAVNSTLPAS------------------PSAVKPLLTTALEKRPTTQAQ--SRNDFF 424

Query: 906  SLMRKKSISNSSPAPESGPVISASDKKLGEGEDAALPVTDHSGKVGVLANIHSIKSLEYT 727
              MR+KS+SNSS A E+G  IS           AA+     +G V  L    ++++    
Sbjct: 425  KRMREKSVSNSSSASETGTAISPEKHAKVAVVPAAI-----TGAVEPLPEEKAVRTTCNG 479

Query: 726  NAK-VSDGHSYNEGESLNSKKNGSTCNXXXXXXXXXXXFLRSLGWEENTEEGGLTDEEIN 550
              + +S+G  YN    ++ ++                 FLRS+GW+EN +EGGLT+EEI+
Sbjct: 480  GVQHISNGKKYNSEPIISEEEEAK--------------FLRSMGWDENDDEGGLTEEEIS 525

Query: 549  AFYKDVTKYINSKPSWKILQGMP-RFLLALESQIG 448
            AFY+D TKYINSKPS +ILQG+  +FLL  +SQIG
Sbjct: 526  AFYRDFTKYINSKPSLRILQGVRLKFLLPFDSQIG 560


>KZV20788.1 hypothetical protein F511_30430 [Dorcoceras hygrometricum]
          Length = 601

 Score =  353 bits (906), Expect = e-107
 Identities = 248/622 (39%), Positives = 335/622 (53%), Gaps = 19/622 (3%)
 Frame = -2

Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNS-SSTNDQNLGR 2104
            MDK+EPTLVP+WL+                      DD   PK+ R NS  S+N  + GR
Sbjct: 1    MDKSEPTLVPEWLKNSGNQSGGGSTLHS--------DDKSAPKLSRNNSFMSSNGHDFGR 52

Query: 2103 SSASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRR 1924
            SS+SE+  S +F RS S+NGS NLR+ N F +N  DRD     Y+S +KD+++  D   R
Sbjct: 53   SSSSEKTTSSYFHRSSSSNGSGNLRSYNSFGRNRRDRDWEKDRYDSQDKDKSVSGDRWHR 112

Query: 1923 DFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNAN----RA 1756
             FSDS GN+ S + E   LRRS S  S    +T  + V   S SA     NN N    + 
Sbjct: 113  VFSDSSGNSFSGKFEWDGLRRSQSATSGPHGDTWTKKVVTDSSSA---GGNNTNTLLTKG 169

Query: 1755 SDNVLLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTSA 1582
            +    + K  FE++FPSL  EE+    E+ RVPSP L+ +I + P+  +A +G +KWTSA
Sbjct: 170  APGGGVTKTRFERNFPSLGSEERAVIPEVGRVPSPGLSSAIQSLPIGTAAAVGGEKWTSA 229

Query: 1581 LAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTMDSSISTALSMAE 1402
            LAEVP +V +NG GVS+   +                       S  + SS +T L+MAE
Sbjct: 230  LAEVPVIVGSNGIGVSSVTQS----------------------ASTQLASSTTTTLNMAE 267

Query: 1401 TVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEKPKAKTG-QP 1228
             VA  P R    PQ+S  +QR +ELAIKQSRQLIPVTPS PK LV N  +K K K G Q 
Sbjct: 268  AVAQGPSRSPAMPQISVGTQRLEELAIKQSRQLIPVTPSMPKTLVSNSSDKQKTKLGQQQ 327

Query: 1227 HLIHSPRGGSAKPDVSKTSS-IGKLQVLKPVRERNGDSLPPMDNLSPKRDSRTVCSVPAA 1051
            H I+S     +K D+SK+SS +GKL VLK  RE+NG +    DNLSP      V S    
Sbjct: 328  HSINSLPINHSKSDMSKSSSNVGKLHVLKLTREKNGVAPVVKDNLSPTTAVNAVSSTLLT 387

Query: 1050 THSVAGSSSVRAQVNNSGHPAAERKHLLPLLEKRAASQSQTQSRNDFFSLMRKKSISNSS 871
            + SV G+ + +   N    P   RK  L +LEKR  SQ+Q QSR +FF+L+RKKS++ S+
Sbjct: 388  SPSVTGAVASKGPPN---MPVLNRKPSLAVLEKRNTSQAQAQSRKEFFNLVRKKSMAIST 444

Query: 870  PAP--------ESGPVISASDKKLGEGEDAALPVTDHSGKVGVLANIHSIKSLEYTNAKV 715
             A         +SG  +S    +  E ED   P T         A++      E  +   
Sbjct: 445  SATDAENFSSVDSGHAVSPPPSETSEKEDVPAPNTSQIDDAQSSASLSDDLFPEKRDDVT 504

Query: 714  SDGHSYNEGESLNSKKNGSTCNXXXXXXXXXXXFLRSLGWEENTEEGGLTDEEINAFYKD 535
                + +  + L +  N S              FLRSLGWEEN++EGGLT+EEI++F+KD
Sbjct: 505  CPDDTCSMPKYLGNGMNASM--DPLFSEEEEAAFLRSLGWEENSDEGGLTEEEISSFFKD 562

Query: 534  VTKYINSKPSWKILQGM-PRFL 472
             TKY NSKP+ +IL+ + P+F+
Sbjct: 563  ATKY-NSKPALRILEVVQPKFI 583


>KZM90816.1 hypothetical protein DCAR_021819 [Daucus carota subsp. sativus]
          Length = 617

 Score =  352 bits (904), Expect = e-107
 Identities = 255/621 (41%), Positives = 341/621 (54%), Gaps = 23/621 (3%)
 Frame = -2

Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSSSTNDQNLGRS 2101
            M+K+EPTLVP+WL+                      D+  T K  R  S      N+G  
Sbjct: 1    MEKSEPTLVPEWLKSSGSVTGGVSTNHLNPSLHQ--DNQATLKAARNKSLV----NIGDH 54

Query: 2100 SASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRRD 1921
                R  S +FRRS S+NG+ +LR+   F +N+ DRD +  +++  +K+++ L D + R 
Sbjct: 55   DIGHRTTSSYFRRS-SSNGTSHLRSYGSFGRNNRDRDWDRDIHDIRDKEKSNLGDRKYRQ 113

Query: 1920 FSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNANR----AS 1753
            FSDS  +NS  R E+  LRR+ S IS    E  PR V +  K+  K +HNN N     +S
Sbjct: 114  FSDSFESNSLSRFEKDGLRRTQSTISRTGVEPWPRRVPSDLKNIDKSNHNNGNSRLAVSS 173

Query: 1752 DNVLLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTSAL 1579
                +HKASF++DFPSL  EE+    EI RVPSP L  +I   P  +SA I    WTSAL
Sbjct: 174  PISSVHKASFDRDFPSLGAEERQKDPEIGRVPSPGLGTAIQNLPTGSSAGIADGGWTSAL 233

Query: 1578 AEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHA-PGSPTMDSSISTALSMAE 1402
            AEVP +                     + +NGT  S+  H+   S ++  S+ T L+MAE
Sbjct: 234  AEVPAM---------------------IGSNGTTASSVPHSVSSSASVVPSMMTGLNMAE 272

Query: 1401 TVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEKPKAKTG--- 1234
            T+   PPRV+  PQLS E+QR +ELAIKQSRQLIPVTPS PKALV N  +K K K G   
Sbjct: 273  TLVQGPPRVQADPQLSVETQRLEELAIKQSRQLIPVTPSLPKALVLNSSDKAKGKVGLQQ 332

Query: 1233 ---QPHLIH-SPRGGSAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKRDSRTVC 1066
                 +L+H SPRG   K ++ KTSS+GKLQVLKP RERNG S    D LSP   S+   
Sbjct: 333  QSASTNLVHHSPRGAPTKNEIIKTSSLGKLQVLKPARERNGVSNTSKDTLSPTSSSKLAN 392

Query: 1065 SVPAATHSVAGSSSVRAQVNNSGHPAAERKH-----LLPLLEKRAASQSQTQSRNDFFSL 901
            +  A   +  GS+ +R+ +N+S   +AERK      + P+LEKR + Q+  +SRNDFF+ 
Sbjct: 393  NPLAPALATVGSAPLRSSMNHSILVSAERKSAPPVMVTPMLEKRPSPQA--KSRNDFFNS 450

Query: 900  MRKKSISNSSPAPESGPVISASDKKLGE-GEDAALPVTDHSGK-VGVLANIHSIKSLEYT 727
            MRKKS++NSS A  S  V + S   LG+  E  A    D  G+ V V+ +    K  E  
Sbjct: 451  MRKKSMTNSSSA-VSNTVSAVSPSDLGKNSEGEASASLDSQGRDVPVVESSDEGKINECR 509

Query: 726  NAKVSDGHSYNEGESLNSKKNGSTCNXXXXXXXXXXXFLRSLGWEENT-EEGGLTDEEIN 550
            +  + + H      SL++  N S+ +           FLRSLGWEEN  E+ GLT+EEIN
Sbjct: 510  DGSIQNSH--GPQNSLDNGVNHSSTDVILSSEEEEAAFLRSLGWEENAGEDEGLTEEEIN 567

Query: 549  AFYKDVTKYINSKPSWKILQG 487
            AFY+D   YINS P  K L G
Sbjct: 568  AFYRD---YINSAPPSKTLLG 585


>XP_012828377.1 PREDICTED: uncharacterized protein LOC105949617 isoform X2
            [Erythranthe guttata]
          Length = 550

 Score =  327 bits (839), Expect = 3e-98
 Identities = 234/607 (38%), Positives = 326/607 (53%), Gaps = 22/607 (3%)
 Frame = -2

Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSS-STNDQNLGR 2104
            MD++EP+LVPQWL+                      D++   +V R  S  +TN  + GR
Sbjct: 1    MDRSEPSLVPQWLKNSGSSTGGG-------------DNHPASRVARNKSFVNTNGNDFGR 47

Query: 2103 SSASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRR 1924
            +S S +  S +FRRS S+N S + ++ + F +N  DRD     Y S +K+R +L   R R
Sbjct: 48   ASGSAKTTSSYFRRSSSSNSSGSSKSYSSFGRNQRDRDWEKDTYNSRDKERLVLGGDRHR 107

Query: 1923 -DFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNAN----R 1759
             + S+ LGN S  + ER  LRRSHS+IS +  ET P+ V   S S      NN N    +
Sbjct: 108  YESSELLGNPSLSKYERDGLRRSHSMISGKHGETWPKKVVTESSSGS--GKNNGNGFLAK 165

Query: 1758 ASDNVLLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTS 1585
             S   + +KA+FE+DFPSL  +++    E+ RV SP L+ ++ + P+ +SA IG ++WTS
Sbjct: 166  GSPVGVANKATFERDFPSLGTDDRAVVPEVGRVASPGLSSALQSLPIGSSASIGGERWTS 225

Query: 1584 ALAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTMDSSIS--TALS 1411
            ALAEVP LV +NGT                      +S +  AP S T    +S  T+L+
Sbjct: 226  ALAEVPMLVVSNGT--------------------ASLSVQQAAPSSTTASVVVSSTTSLN 265

Query: 1410 MAETVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEKPKAKTG 1234
            MAE VA  P R +T PQLS  +QR +ELAIKQSRQLIPVTP+ PK LV +  +K K+K G
Sbjct: 266  MAEAVAQGPTRAQTAPQLSLGTQRLEELAIKQSRQLIPVTPTMPKTLVLSSSDKQKSKVG 325

Query: 1233 --QPH-------LIHSPRGG-SAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKR 1084
              Q H       +  SPRG   +KPD SK S++GKL VLKPVRE+NG +    D LSP  
Sbjct: 326  LIQQHPTPSSLPINQSPRGAPPSKPDFSKASNVGKLHVLKPVREKNGVTPSVKDKLSPTG 385

Query: 1083 DSRTVCSVPAATHSVAGSSSVRAQVNNSGHPAAERKHLLPLLEKRAASQSQTQSRNDFFS 904
              + V S   A+                  P+A +  L   LEKR  +Q+  QSRNDFF 
Sbjct: 386  SGKAVNSTLPAS------------------PSAVKPLLTTALEKRPTTQA--QSRNDFFK 425

Query: 903  LMRKKSISNSSPAPESGPVISASDKKLGEGEDAALPVTDHSGKVGVLANIHSIKSLEYTN 724
             MR+KS+SNSS A E+G  IS         + A +P    +G V  L    ++++     
Sbjct: 426  RMREKSVSNSSSASETGTAISPEK----HAKVAVVPAA-ITGAVEPLPEEKAVRTTCNGG 480

Query: 723  AK-VSDGHSYNEGESLNSKKNGSTCNXXXXXXXXXXXFLRSLGWEENTEEGGLTDEEINA 547
             + +S+G  YN    ++ ++                 FLRS+GW+EN +EGGLT+EEI+A
Sbjct: 481  VQHISNGKKYNSEPIISEEEEAK--------------FLRSMGWDENDDEGGLTEEEISA 526

Query: 546  FYKDVTK 526
            FY+D TK
Sbjct: 527  FYRDFTK 533


>XP_010245092.1 PREDICTED: uncharacterized protein LOC104588732 isoform X1 [Nelumbo
            nucifera]
          Length = 645

 Score =  325 bits (833), Expect = 3e-96
 Identities = 251/656 (38%), Positives = 347/656 (52%), Gaps = 45/656 (6%)
 Frame = -2

Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSS-STNDQNLGR 2104
            M K+EPTLVP+WL+                     SDD       R  SS S  D +  R
Sbjct: 1    MAKSEPTLVPEWLKGTGGITGAGSTTHHFASSSLQSDDNAVALPTRNRSSLSIGDYDTPR 60

Query: 2103 SSA-SERVKSPHFRRSFSNNGSV--------NLRTSNIFCKNHHDRDSNDSVYESGNKDR 1951
            SSA S+R  S + RRS S+NGS+          R+ + F ++H DRD    + +  +K+R
Sbjct: 61   SSAFSDRTSSAYSRRSSSSNGSIVHDKEIPSYTRSYSAFARSHRDRDWEKDILDFRDKER 120

Query: 1950 TILEDFRRRDFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHN 1771
            ++  D R  DFSD L +  + R+E+ +LRRS S++S +R E  PR V A       +++ 
Sbjct: 121  SVPGDHRDLDFSDPLVSILTSRIEKDTLRRSQSMVSGKRGEVWPRKVAAD------LNNG 174

Query: 1770 NANRASDNVLL---------HKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPV 1624
            N N+ + N LL          KA+FE+DFPSL  EEK    +I RV SP L+ ++ + P+
Sbjct: 175  NINQNTSNGLLVGGSIVSSIQKAAFERDFPSLGAEEKPGTPDIGRVSSPGLSSAVQSLPM 234

Query: 1623 SASAMIGSDKWTSALAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSP 1444
             +SA+IG D WTSALAEVP ++  NGTG+S+   A         T G+  S   ++    
Sbjct: 235  GSSALIGGDGWTSALAEVPMIIGNNGTGISSVQQA---------TLGSSASGATNS---- 281

Query: 1443 TMDSSISTALSMAETVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVP 1267
                  ST L+MAET+A  P R + +PQLS E+QR +ELAIKQSRQLIP+TPS PK  V 
Sbjct: 282  ------STGLNMAETLAQAPSRARISPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSVL 335

Query: 1266 NPPEKPK------------AKTGQPHLIHSPRGGSAKPDVSKTSSIGKLQVLKPVRERNG 1123
            N  EK K             KT Q   + S RG   + DVSKTS  GKL VLK  RE+NG
Sbjct: 336  NSLEKAKPKISVRTGEMNATKTIQQQQLSSLRGAPMRSDVSKTSHGGKLLVLKAPREKNG 395

Query: 1122 DSLPPMDNLSPKRDSRTVCSVPAATHSVAGS---SSVRAQVNNSGHPAAERKHLLPLLEK 952
             S    D  SP   S+   +  A   S A +   S   ++++N    AA        +EK
Sbjct: 396  ISPIAKDGQSPTNVSKVANNPLALAPSAAFTPLKSPNNSKLSNERKSAAASLMHGSSVEK 455

Query: 951  RAASQSQTQSRNDFFSLMRKKSISN-SSPAPESGPVISAS--DKKLGEGEDAALPVTDHS 781
            R  + SQ QSRNDFF+LMRKK+  N SS AP+  PV+S+S  DK   +    A PV+  S
Sbjct: 456  RPTT-SQVQSRNDFFNLMRKKTSGNLSSAAPDPSPVVSSSLLDKSTEQTALPAAPVSPQS 514

Query: 780  GKVGVLANIHSIKSLEYTNAKVSDGHSYNEGES-LNSKKNGSTCNXXXXXXXXXXXFLRS 604
                         S E  +  +S+G++  E +  LN+ +  S+ +           FLRS
Sbjct: 515  SDAPSPDPSCLDWSTENGSETISNGNASEESQRFLNNGEKHSSPDAFVYPDEEEAAFLRS 574

Query: 603  LGWEENT-EEGGLTDEEINAFYKDVTKYINSKPSWKILQG---MPRFLLALESQIG 448
            LGW+EN  EE GLT+EEI+AFYK+   Y+  +PS K+ +G     +  + LES++G
Sbjct: 575  LGWDENAGEEEGLTEEEISAFYKE---YMKLRPSSKLCRGSQQQVKLPMPLESRVG 627


>XP_017258903.1 PREDICTED: uncharacterized protein LOC108227982 [Daucus carota subsp.
            sativus]
          Length = 591

 Score =  320 bits (821), Expect = 4e-95
 Identities = 239/621 (38%), Positives = 326/621 (52%), Gaps = 27/621 (4%)
 Frame = -2

Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXS-DDYGTPKVVRKNSSSTND---QN 2113
            M+KNEPT VP+WL+                       DD  T K  R N SS +D    N
Sbjct: 1    MEKNEPTFVPEWLKSSGSVTSAVSTNHHQIASSSLLSDDRATLKSTR-NKSSIDDISAHN 59

Query: 2112 LGRSSASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDF 1933
             G S  S+R  S +FRRS ++NGS  LR+   F + + D+  +    E  + D+  + D 
Sbjct: 60   SGSSPVSDRTTSSYFRRSSTSNGS-QLRSYGSFGRTNRDKGWDKDTNEYHDSDKLRIGDH 118

Query: 1932 RRRDFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNANR-- 1759
            R R+FSD LG+N S R E+  L+R+ S IS + +E   R V A   S  K ++NN +   
Sbjct: 119  RHRNFSDPLGSNFSNRFEKDGLKRTQSSISGKYNEPWSRKVSADMNSFDKSNYNNGSSLL 178

Query: 1758 --ASDNVLLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKW 1591
              +S    + KA+F++DFPSL  +E+    E+RRVPSP L+ ++   P+  SA+ G   W
Sbjct: 179  AGSSAISTVRKAAFDRDFPSLGADERQTDYELRRVPSPGLSTNMQNLPIGYSAVTGEIGW 238

Query: 1590 TSALAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTMDSSISTALS 1411
            TSALAEV   V  NG   S+ A A                     P S ++ SS+++ L+
Sbjct: 239  TSALAEVQVKVGANGINKSSVAQAAL-------------------PSSASVASSMTSGLN 279

Query: 1410 MAETVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEKPKAKTG 1234
            MAET+A  PP V  T Q S  +QR +E+AIKQS+QLIPVTPS PKALV N  EK K K  
Sbjct: 280  MAETLAQGPPHVHAT-QFSVGTQRLEEIAIKQSKQLIPVTPSMPKALVLNSSEKSKTKAA 338

Query: 1233 QP--------HLIHSPRGGSAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKRDS 1078
            Q         H  HSPRG   K D+SKTSS+GKLQVLKP RERN  S    D LSP   S
Sbjct: 339  QQQHQTSSTHHFNHSPRGTPMKSDMSKTSSLGKLQVLKPARERNDISYQTKDTLSPTNAS 398

Query: 1077 RTVCSVPAATHSVAGSSSVRAQVNNSGHPAAERKHLLPLLEKRAASQSQTQSRNDFFSLM 898
            +   +   A  SV    S+R+ + N   P      +  +LEK+ +  +Q +SRNDFF+L+
Sbjct: 399  KVPNNPLTAASSVGVPPSLRSPIKN---PIVASGVVPTVLEKKPS--AQLRSRNDFFNLV 453

Query: 897  RKKSISN-SSPAPESGPVISASDKKLGEGEDAALPVTDHSGKVGVLANIHSIKSLEY--- 730
            RKKS++N SSP  +S   +S S  +      A  P     G+  +LAN     +++Y   
Sbjct: 454  RKKSLTNHSSPVVDSVSTVSQSILEQPSEHKAGAP---PPGEDSLLAN--QSDTVQYKMN 508

Query: 729  ---TNAKVSDGHSYNEGESLNSKKNGSTCNXXXXXXXXXXXFLRSLGWEENT-EEGGLTD 562
               +N    DG   +     N +   S+ +           FLRSLGW+EN  E+ GLT+
Sbjct: 509  GLISNRDACDGTPKSPDNGENGETRSSS-DVILCSEEEEAAFLRSLGWDENAGEDEGLTE 567

Query: 561  EEINAFYKDVTKYINSKPSWK 499
            EEI  FY+D +KYI  +PS K
Sbjct: 568  EEIREFYRDASKYIKPRPSSK 588


>XP_017224884.1 PREDICTED: cell wall protein RBR3-like [Daucus carota subsp. sativus]
          Length = 585

 Score =  318 bits (815), Expect = 3e-94
 Identities = 234/613 (38%), Positives = 329/613 (53%), Gaps = 19/613 (3%)
 Frame = -2

Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVR-KNSSSTNDQNLGR 2104
            M+K+EP+ VP+WL+                      +D+ T K  R K S+  +  + GR
Sbjct: 1    MEKSEPSFVPEWLKSSGSVTVAVSTNHRQ-------NDHMTLKPTRNKLSADVSAHDSGR 53

Query: 2103 SSASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRR 1924
            S  S+R  S +FRR+ S+NGS N R+   F +N+ DR  +    E  + DR  L D R +
Sbjct: 54   SPVSDRTTSSYFRRTSSSNGSGNSRSYGSFGRNNRDRGWDRDKNEYRDHDRLRLGDRRHQ 113

Query: 1923 DFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNANR----A 1756
            ++S SLG++ S R E+  LRR+ S ++ +  E L R V A   S+ K ++NN++     +
Sbjct: 114  NYSGSLGSDFSDRFEKNGLRRTQSSVAGKHSEPLSRRVSADLNSSNKSNYNNSSSRLLGS 173

Query: 1755 SDNVLLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTSA 1582
            S    + K SF++DFPSL  +E+     IR +PSP L+ ++ +     S +     WTSA
Sbjct: 174  SGISSVRKTSFDRDFPSLGADERQTDHGIRNIPSPGLSTNMQSLSTGYSTVANEVGWTSA 233

Query: 1581 LAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTMDSSISTALSMAE 1402
            LAEVP +V  NG   S+                     +A  P S ++ SS + +L+MAE
Sbjct: 234  LAEVPVMVGANGPITSS-------------------VLQAALPSSTSVPSSTAASLNMAE 274

Query: 1401 TVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEKPKAKTGQPH 1225
            T+A  P RV T PQ+S E+QR +ELAIKQSRQLIP+TPS PK+LV N  EK K K  Q  
Sbjct: 275  TLAQGPLRVDTAPQVSVETQRLEELAIKQSRQLIPMTPSMPKSLVLNSSEKSKVKVSQQQ 334

Query: 1224 ----LIHSPRGGSAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKRDSRTVCSVP 1057
                 IHS RG   K DV KT S+GKLQVLKP RERNG S P +DNLS   DS TV + P
Sbjct: 335  HQTSSIHSLRGTLEKSDVPKTLSLGKLQVLKPARERNGVSYPEIDNLSLTNDS-TVANNP 393

Query: 1056 AATHSVAGSSSVRAQVNNSGHPAAERKH---LLP-LLEKRAASQSQTQSRNDFFSLMRKK 889
              T         R Q+ N       RK    ++P  LEK+ +  +Q QSRN+FF+L+RKK
Sbjct: 394  LTTLPAVVPPPSRTQIKNPNPLNVNRKPAAIMVPATLEKKPS--AQLQSRNEFFNLVRKK 451

Query: 888  SISNSSPAPESGPVISASDKKLGEGEDAALPVTDHSGKVGVLANIHSIKSL-EYTNAKVS 712
            S++ SS   +S   +S    +       A P++   GK  + AN  ++    E  NA +S
Sbjct: 452  SLTKSSSVADSVSTVSQFVVEQPSETQTASPLS--QGKDSLSANQSNMDHYKENVNALIS 509

Query: 711  DGHSYN-EGESLNSKKNGSTCNXXXXXXXXXXXFLRSLGWEENT-EEGGLTDEEINAFYK 538
            + ++ N   +S  + +  S  +           FLRSLGW+EN  E+ GLT+EEIN FY+
Sbjct: 510  NINNGNGHQQSCGNGETRSRSDMILCSEEEEAAFLRSLGWDENAGEDEGLTEEEINEFYR 569

Query: 537  DVTKYINSKPSWK 499
            D +KYI    S K
Sbjct: 570  DASKYIKPGSSSK 582


>XP_010277689.1 PREDICTED: uncharacterized protein YMR317W-like [Nelumbo nucifera]
          Length = 655

 Score =  318 bits (815), Expect = 2e-93
 Identities = 258/663 (38%), Positives = 342/663 (51%), Gaps = 52/663 (7%)
 Frame = -2

Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKN-SSSTNDQNLGR 2104
            M K EPTLVP+WL+                     SDD+      R   + ST D +  R
Sbjct: 1    MAKGEPTLVPEWLKGTGSITGGGNTTHHFASSSTHSDDHAVALTTRNRLTMSTGDYDTPR 60

Query: 2103 SSAS-ERVKSPHFRRSFSNNGSVN--------LRTSNIFCKNHHDRDSNDSVYESGNKDR 1951
            SSA  +R  S +FRRS S+NGS+          R+ + F ++H DRD      +  +K++
Sbjct: 61   SSAFLDRTSSAYFRRSSSSNGSMMHDKETSTYSRSYSSFTRSHRDRDWEKDTLDYRDKEK 120

Query: 1950 TILEDFRRRDFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHN 1771
            +IL D R RD+SD L +  + R E+ +LRRS S+IS +R E   R V A + +    +HN
Sbjct: 121  SILGDHRDRDYSDPLASILTSRXEKDTLRRSQSMISGKRGEGWSRRVAADTNNGNN-NHN 179

Query: 1770 NANR----ASDNVLLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAM 1609
            N N      S    + KA+FE+DFPSL  EEK  A +I RV SP L+ S+ + P+ +SA+
Sbjct: 180  NGNGLLVGGSIVSSIQKAAFERDFPSLGAEEKQGALDIGRVSSPGLSSSVQSLPIGSSAV 239

Query: 1608 IGSDKWTSALAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTMDSS 1429
            IG D WTSALAEVP ++  N  G S+   A            T  S+ + AP S      
Sbjct: 240  IGGDGWTSALAEVPVIIGNNSIGPSSVQQA------------TPASSTSGAPNS------ 281

Query: 1428 ISTALSMAETVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNPPEK 1252
             ST L+MAET+A  P R + +PQLS E+QR +ELAIKQSRQLIP+TPS PK    N  EK
Sbjct: 282  -STGLNMAETLAQAPSRTRISPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSALNSSEK 340

Query: 1251 PK-------------AKTGQ------PHLI-HSPRGGSAKPDVSKTSSIGKLQVLKPVRE 1132
             K             AKT Q       HL+ HS RGG  + DV KTS  GKL VLK  RE
Sbjct: 341  AKPKAVVRTGEMGISAKTSQQQQLPSSHLVNHSLRGGPVRSDVPKTSHGGKLLVLKAPRE 400

Query: 1131 RNGDSLPPMDNLSPKRDSRTVCSVPAATHSVAGSSSVRAQVNNSGHPAAERKHLLPLL-- 958
            +NG S    D LSP   S+ V +        A +  +R+  NNS  P  ERK +   L  
Sbjct: 401  KNGISPSAKDGLSPTNASKVVNNSLVLAPLAAYAPPMRSP-NNSKLP-NERKSVASSLTH 458

Query: 957  ----EKRAASQSQTQSRNDFFSLMRKKSISN-SSPAPESGPVISAS-DKKLGEGEDA--A 802
                EKR  + SQ QSRNDFF+LMRKK+  N +S  P+  P  S+S  +K  E  +    
Sbjct: 459  GSAVEKRPTT-SQVQSRNDFFNLMRKKTSGNLASAVPDPSPTASSSLLEKSSEPTEVVPT 517

Query: 801  LPVTDHSGKVGVLANIHSIKSLEYTNAKVSDGHSYNEGESL-NSKKNGSTCNXXXXXXXX 625
             PV+  S             S E     VS+G    E +   N+ +  ST +        
Sbjct: 518  APVSPQSSDAPSSEPSGLDWSTENGGDLVSNGDVSEESQRFSNNGEKRSTADAFVYPDEE 577

Query: 624  XXXFLRSLGWEENT-EEGGLTDEEINAFYKDVTKYINSKPSWKILQG---MPRFLLALES 457
               FLRSLGW+EN  EE GLT+EEI+AFY++   Y+  +PS ++ QG     +  L LES
Sbjct: 578  EAAFLRSLGWDENAGEEEGLTEEEISAFYRE---YMKVRPSSRLCQGAQQQTKVPLPLES 634

Query: 456  QIG 448
             +G
Sbjct: 635  HVG 637


>EOX97398.1 Uncharacterized protein TCM_006425 isoform 1 [Theobroma cacao]
          Length = 625

 Score =  315 bits (808), Expect = 8e-93
 Identities = 223/621 (35%), Positives = 339/621 (54%), Gaps = 21/621 (3%)
 Frame = -2

Query: 2283 MMDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSSSTNDQNLGR 2104
            +M+++EP+LVP+WL+                     SD++   +  R   S   D ++G 
Sbjct: 5    VMERSEPSLVPEWLKSGGSVTGSGNSNHQFTSSSLHSDNHSALRPTRNKLSVAGDHDVGG 64

Query: 2103 SSASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRR 1924
            +S  +R  S +FRRS S+NGS +LR+ + F K H DRD +  +    +++++++ D R R
Sbjct: 65   TSVLDRTTSAYFRRSSSSNGSAHLRSYSSFTKGHRDRDWDKDINGYHDREKSVISDHRNR 124

Query: 1923 DFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNANRASDNV 1744
            +FSDSL N      E+  L RS S I+ +R +T P+ V + S ++ K +H+++N     V
Sbjct: 125  NFSDSLDNMLPSVFEKDVLWRSQS-ITGKRSDTWPKKVTSDSSTSNKSNHSSSNGLLSGV 183

Query: 1743 ---LLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTSAL 1579
               + +K+ FE++FP L  EE+  ASEI RV SP L+ +  + PV  SA+ GSD WTSAL
Sbjct: 184  STTVGNKSVFEREFPVLGAEERQVASEIGRVSSPGLSTAGQSLPVGTSAISGSDGWTSAL 243

Query: 1578 AEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTMDSSISTALSMAET 1399
            A++P  V ++GTGV+                   V+++  +  S +M S+  T L+MAET
Sbjct: 244  ADMPAGVGSSGTGVA-------------------VASQNVSASSASMASTTMTGLNMAET 284

Query: 1398 VAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIP-VTPSAPKALVPNPPEKPKAKTGQPH 1225
            +   P R +T P L+  +QR +ELAIKQSRQL+P VT S PK LV +P EK K K GQ  
Sbjct: 285  LVQGPSRARTPPLLNVGTQRLEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPKVGQQQ 344

Query: 1224 ----LIHSPRGGSAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKRDSRTVCSVP 1057
                 ++  RGG+++ D  K S+ G+L++LKP RE NG SL   DNLSP   S  + + P
Sbjct: 345  HASLSLNYTRGGTSRSDSLKVSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSSKLVNSP 404

Query: 1056 -AATHSVAGSSSVRAQVNNSGHPAAERKHLLPLLEKRAASQSQTQSRNDFFSLMRKKSIS 880
             + T S + S+  R+  N+     AER      +       +Q QSRNDFF+L++KKS +
Sbjct: 405  LSVTPSASASAPFRSSGNSPSFATAERNQTPFRINIEKRPTAQAQSRNDFFNLLKKKSTT 464

Query: 879  NSSPA-----PESGPVISASDKKLGEGEDAALPVTDHSGKVGVLANIHSIKSLEYTNAK- 718
            NS  +     P + P +S    +LG  EDA+  VT   G V   ++  SI  L   N   
Sbjct: 465  NSPSSVADRGPAASPSVSEKSDELGT-EDASTSVTLQGGSVP--SSEISIADLPTDNRSE 521

Query: 717  -VSDGHSYNEGESLNSK-KNGSTCNXXXXXXXXXXXFLRSLGWEENT-EEGGLTDEEINA 547
               +G +Y+  +  +S     +  +           FLRSLGWEEN  ++ GLT+EEI+A
Sbjct: 522  ITHNGDAYSGSQQCSSNGDRHARPDAFLYPDEEEAAFLRSLGWEENAGDDEGLTEEEISA 581

Query: 546  FYKDVTKYINSKPSWKILQGM 484
            F+++   ++  KPS K+   M
Sbjct: 582  FFEE---HMKLKPSAKLFHRM 599


>EOX97399.1 Uncharacterized protein TCM_006425 isoform 2 [Theobroma cacao]
          Length = 620

 Score =  315 bits (807), Expect = 1e-92
 Identities = 223/620 (35%), Positives = 338/620 (54%), Gaps = 21/620 (3%)
 Frame = -2

Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSSSTNDQNLGRS 2101
            M+++EP+LVP+WL+                     SD++   +  R   S   D ++G +
Sbjct: 1    MERSEPSLVPEWLKSGGSVTGSGNSNHQFTSSSLHSDNHSALRPTRNKLSVAGDHDVGGT 60

Query: 2100 SASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRRD 1921
            S  +R  S +FRRS S+NGS +LR+ + F K H DRD +  +    +++++++ D R R+
Sbjct: 61   SVLDRTTSAYFRRSSSSNGSAHLRSYSSFTKGHRDRDWDKDINGYHDREKSVISDHRNRN 120

Query: 1920 FSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNANRASDNV- 1744
            FSDSL N      E+  L RS S I+ +R +T P+ V + S ++ K +H+++N     V 
Sbjct: 121  FSDSLDNMLPSVFEKDVLWRSQS-ITGKRSDTWPKKVTSDSSTSNKSNHSSSNGLLSGVS 179

Query: 1743 --LLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTSALA 1576
              + +K+ FE++FP L  EE+  ASEI RV SP L+ +  + PV  SA+ GSD WTSALA
Sbjct: 180  TTVGNKSVFEREFPVLGAEERQVASEIGRVSSPGLSTAGQSLPVGTSAISGSDGWTSALA 239

Query: 1575 EVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTMDSSISTALSMAETV 1396
            ++P  V ++GTGV+                   V+++  +  S +M S+  T L+MAET+
Sbjct: 240  DMPAGVGSSGTGVA-------------------VASQNVSASSASMASTTMTGLNMAETL 280

Query: 1395 AHLPPRVKTTPQLSTESQR-QELAIKQSRQLIP-VTPSAPKALVPNPPEKPKAKTGQPH- 1225
               P R +T P L+  +QR +ELAIKQSRQL+P VT S PK LV +P EK K K GQ   
Sbjct: 281  VQGPSRARTPPLLNVGTQRLEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPKVGQQQH 340

Query: 1224 ---LIHSPRGGSAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKRDSRTVCSVP- 1057
                ++  RGG+++ D  K S+ G+L++LKP RE NG SL   DNLSP   S  + + P 
Sbjct: 341  ASLSLNYTRGGTSRSDSLKVSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSSKLVNSPL 400

Query: 1056 AATHSVAGSSSVRAQVNNSGHPAAERKHLLPLLEKRAASQSQTQSRNDFFSLMRKKSISN 877
            + T S + S+  R+  N+     AER      +       +Q QSRNDFF+L++KKS +N
Sbjct: 401  SVTPSASASAPFRSSGNSPSFATAERNQTPFRINIEKRPTAQAQSRNDFFNLLKKKSTTN 460

Query: 876  SSPA-----PESGPVISASDKKLGEGEDAALPVTDHSGKVGVLANIHSIKSLEYTNAK-- 718
            S  +     P + P +S    +LG  EDA+  VT   G V   ++  SI  L   N    
Sbjct: 461  SPSSVADRGPAASPSVSEKSDELGT-EDASTSVTLQGGSVP--SSEISIADLPTDNRSEI 517

Query: 717  VSDGHSYNEGESLNSK-KNGSTCNXXXXXXXXXXXFLRSLGWEENT-EEGGLTDEEINAF 544
              +G +Y+  +  +S     +  +           FLRSLGWEEN  ++ GLT+EEI+AF
Sbjct: 518  THNGDAYSGSQQCSSNGDRHARPDAFLYPDEEEAAFLRSLGWEENAGDDEGLTEEEISAF 577

Query: 543  YKDVTKYINSKPSWKILQGM 484
            +++   ++  KPS K+   M
Sbjct: 578  FEE---HMKLKPSAKLFHRM 594


>XP_010245093.1 PREDICTED: uncharacterized protein LOC104588732 isoform X2 [Nelumbo
            nucifera]
          Length = 616

 Score =  314 bits (804), Expect = 2e-92
 Identities = 244/654 (37%), Positives = 341/654 (52%), Gaps = 43/654 (6%)
 Frame = -2

Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSSSTNDQNLGRS 2101
            M K+EPTLVP+WL+                         GT  +    + ST       S
Sbjct: 1    MAKSEPTLVPEWLK-------------------------GTGGIT--GAGSTTHHFASSS 33

Query: 2100 SASERVKSPHFRRSFSNNGSV--------NLRTSNIFCKNHHDRDSNDSVYESGNKDRTI 1945
              S+R  S + RRS S+NGS+          R+ + F ++H DRD    + +  +K+R++
Sbjct: 34   LQSDRTSSAYSRRSSSSNGSIVHDKEIPSYTRSYSAFARSHRDRDWEKDILDFRDKERSV 93

Query: 1944 LEDFRRRDFSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNA 1765
              D R  DFSD L +  + R+E+ +LRRS S++S +R E  PR V A       +++ N 
Sbjct: 94   PGDHRDLDFSDPLVSILTSRIEKDTLRRSQSMVSGKRGEVWPRKVAAD------LNNGNI 147

Query: 1764 NRASDNVLL---------HKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSA 1618
            N+ + N LL          KA+FE+DFPSL  EEK    +I RV SP L+ ++ + P+ +
Sbjct: 148  NQNTSNGLLVGGSIVSSIQKAAFERDFPSLGAEEKPGTPDIGRVSSPGLSSAVQSLPMGS 207

Query: 1617 SAMIGSDKWTSALAEVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTM 1438
            SA+IG D WTSALAEVP ++  NGTG+S+   A         T G+  S   ++      
Sbjct: 208  SALIGGDGWTSALAEVPMIIGNNGTGISSVQQA---------TLGSSASGATNS------ 252

Query: 1437 DSSISTALSMAETVAHLPPRVKTTPQLSTESQR-QELAIKQSRQLIPVTPSAPKALVPNP 1261
                ST L+MAET+A  P R + +PQLS E+QR +ELAIKQSRQLIP+TPS PK  V N 
Sbjct: 253  ----STGLNMAETLAQAPSRARISPQLSVETQRLEELAIKQSRQLIPMTPSMPKTSVLNS 308

Query: 1260 PEKPK------------AKTGQPHLIHSPRGGSAKPDVSKTSSIGKLQVLKPVRERNGDS 1117
             EK K             KT Q   + S RG   + DVSKTS  GKL VLK  RE+NG S
Sbjct: 309  LEKAKPKISVRTGEMNATKTIQQQQLSSLRGAPMRSDVSKTSHGGKLLVLKAPREKNGIS 368

Query: 1116 LPPMDNLSPKRDSRTVCSVPAATHSVAGS---SSVRAQVNNSGHPAAERKHLLPLLEKRA 946
                D  SP   S+   +  A   S A +   S   ++++N    AA        +EKR 
Sbjct: 369  PIAKDGQSPTNVSKVANNPLALAPSAAFTPLKSPNNSKLSNERKSAAASLMHGSSVEKRP 428

Query: 945  ASQSQTQSRNDFFSLMRKKSISN-SSPAPESGPVISAS--DKKLGEGEDAALPVTDHSGK 775
             + SQ QSRNDFF+LMRKK+  N SS AP+  PV+S+S  DK   +    A PV+  S  
Sbjct: 429  TT-SQVQSRNDFFNLMRKKTSGNLSSAAPDPSPVVSSSLLDKSTEQTALPAAPVSPQSSD 487

Query: 774  VGVLANIHSIKSLEYTNAKVSDGHSYNEGES-LNSKKNGSTCNXXXXXXXXXXXFLRSLG 598
                       S E  +  +S+G++  E +  LN+ +  S+ +           FLRSLG
Sbjct: 488  APSPDPSCLDWSTENGSETISNGNASEESQRFLNNGEKHSSPDAFVYPDEEEAAFLRSLG 547

Query: 597  WEENT-EEGGLTDEEINAFYKDVTKYINSKPSWKILQG---MPRFLLALESQIG 448
            W+EN  EE GLT+EEI+AFYK+   Y+  +PS K+ +G     +  + LES++G
Sbjct: 548  WDENAGEEEGLTEEEISAFYKE---YMKLRPSSKLCRGSQQQVKLPMPLESRVG 598


>XP_007041568.2 PREDICTED: mucin-17 [Theobroma cacao]
          Length = 620

 Score =  314 bits (804), Expect = 3e-92
 Identities = 223/620 (35%), Positives = 336/620 (54%), Gaps = 21/620 (3%)
 Frame = -2

Query: 2280 MDKNEPTLVPQWLRXXXXXXXXXXXXXXXXXXXXXSDDYGTPKVVRKNSSSTNDQNLGRS 2101
            M+++EP+LVP+WL+                     SD++   +  R   S   D ++G +
Sbjct: 1    MERSEPSLVPEWLKSGGSVTGSGNSNHQFTSSSLHSDNHSALRPARNKLSVAGDHDVGGT 60

Query: 2100 SASERVKSPHFRRSFSNNGSVNLRTSNIFCKNHHDRDSNDSVYESGNKDRTILEDFRRRD 1921
            S  +R  S +FRRS S+NGSV+LR+ + F K H DRD +  +    +++++++ D R R+
Sbjct: 61   SVLDRTTSAYFRRSSSSNGSVHLRSYSSFTKGHRDRDWDKDINGYHDREKSVISDHRNRN 120

Query: 1920 FSDSLGNNSSYRVERKSLRRSHSVISDQRDETLPRNVEAGSKSAYKISHNNANRASDNV- 1744
            FSDSL N      E+  L RS S I+ +R +T P+ V + S ++ K +H++ N     V 
Sbjct: 121  FSDSLDNMLPSVFEKDVLWRSQS-ITGKRSDTWPKKVTSDSSTSNKSNHSSGNGLLSGVS 179

Query: 1743 --LLHKASFEQDFPSL--EEKLAASEIRRVPSPCLTPSIHTFPVSASAMIGSDKWTSALA 1576
              + +K++FE++FP L  EE+   SEI RV SP L+ +  + PV  SA+ GSD WTSALA
Sbjct: 180  TTVGNKSAFEREFPVLGAEERQVGSEIGRVSSPGLSTAGQSLPVGTSAISGSDGWTSALA 239

Query: 1575 EVPGLVKTNGTGVSAKAHALAEVPGLVKTNGTGVSAKAHAPGSPTMDSSISTALSMAETV 1396
            ++P  V ++GTGV+                   V+++  +  S +M S+  T L+MAET+
Sbjct: 240  DMPAGVGSSGTGVA-------------------VASQNVSASSASMASTTMTGLNMAETL 280

Query: 1395 AHLPPRVKTTPQLSTESQR-QELAIKQSRQLIP-VTPSAPKALVPNPPEKPKAKTGQPH- 1225
               P R +T P L+  +QR +ELAIKQSRQL+P VT S PK LV +P EK K K GQ   
Sbjct: 281  VQGPSRARTPPLLNVGTQRLEELAIKQSRQLVPLVTTSTPKILVVSPSEKSKPKVGQQQH 340

Query: 1224 ---LIHSPRGGSAKPDVSKTSSIGKLQVLKPVRERNGDSLPPMDNLSPKRDSRTVCSVPA 1054
                ++  RGG+++ D  K S+ G+L++LKP RE NG SL   DNLSP   S  + + P 
Sbjct: 341  ASLSLNYTRGGTSRSDSLKVSNEGRLRILKPSRELNGVSLMTKDNLSPTNGSSKLVNSPL 400

Query: 1053 -ATHSVAGSSSVRAQVNNSGHPAAERKHLLPLLEKRAASQSQTQSRNDFFSLMRKKSISN 877
              T S + S+  R+  N+     AER      +       +Q QSRNDFF+L++KKS +N
Sbjct: 401  NVTPSASASAPFRSSGNSPSFATAERNQTPFRINIEKRPTAQAQSRNDFFNLLKKKSTTN 460

Query: 876  SSPA-----PESGPVISASDKKLGEGEDAALPVTDHSGKVGVLANIHSIKSLEYTNAK-- 718
            S  +     P + P +S    +LG  EDA+  VT   G V   ++  SI  L   N    
Sbjct: 461  SPSSVADRGPAASPSVSEKSDELGT-EDASTSVTLQGGSVP--SSEISIADLPTDNRSEI 517

Query: 717  VSDGHSYNEGESLNSK-KNGSTCNXXXXXXXXXXXFLRSLGWEENT-EEGGLTDEEINAF 544
              +G +Y   +  +S     +  +           FLRSLGWEEN  ++ GLT+EEI+AF
Sbjct: 518  THNGDAYAGSQQCSSNGDRHARPDAFLYPDEEEAAFLRSLGWEENAGDDEGLTEEEISAF 577

Query: 543  YKDVTKYINSKPSWKILQGM 484
            +++   ++  KPS K+   M
Sbjct: 578  FEE---HMKLKPSAKLFHRM 594


Top