BLASTX nr result
ID: Rehmannia31_contig00018138
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia31_contig00018138 (895 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011072884.1| uncharacterized protein LOC105157994 [Sesamu... 398 e-134 gb|KZV31558.1| hypothetical protein F511_07409 [Dorcoceras hygro... 377 e-126 ref|XP_023875628.1| uncharacterized protein LOC111988098 isoform... 343 e-113 ref|XP_018852507.1| PREDICTED: uncharacterized protein LOC109014... 331 e-108 ref|XP_022849761.1| uncharacterized protein LOC111371818 [Olea e... 329 e-108 ref|XP_022873064.1| uncharacterized protein LOC111392007 [Olea e... 327 e-106 emb|CDP12040.1| unnamed protein product [Coffea canephora] 327 e-106 ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241... 327 e-106 gb|PON62903.1| hydroxyproline-rich glycoprotein family protein [... 323 e-105 gb|PON84555.1| hydroxyproline-rich glycoprotein family protein [... 322 e-104 ref|XP_015896111.1| PREDICTED: mucin-2 [Ziziphus jujuba] 321 e-104 ref|XP_011080935.1| uncharacterized protein LOC105164075 isoform... 321 e-104 ref|XP_011080936.1| uncharacterized protein LOC105164075 isoform... 317 e-103 ref|XP_012836270.1| PREDICTED: COPII coat assembly protein sec16... 315 e-102 ref|XP_020550755.1| uncharacterized protein LOC105164075 isoform... 317 e-102 ref|XP_018829358.1| PREDICTED: uncharacterized protein LOC108997... 316 e-102 ref|XP_019151582.1| PREDICTED: uncharacterized protein LOC109148... 313 e-101 dbj|GAV76096.1| hypothetical protein CFOL_v3_19571 [Cephalotus f... 312 e-101 ref|XP_012086872.1| uncharacterized protein LOC105645786 [Jatrop... 311 e-100 ref|XP_010089083.1| uncharacterized protein LOC21407002 [Morus n... 311 e-100 >ref|XP_011072884.1| uncharacterized protein LOC105157994 [Sesamum indicum] Length = 455 Score = 398 bits (1022), Expect = e-134 Identities = 205/298 (68%), Positives = 223/298 (74%) Frame = -2 Query: 894 GVNGTDXXXXXXXXXXXXXXXXARGPHDSVQKRRWGSCLSLYSCFGSNKTKRIGHAAVIP 715 GVNG D RG DSVQKRRWG SLY CFGSNKTKRIGHA ++P Sbjct: 5 GVNGADALETINAAATAIASVETRGLQDSVQKRRWGRWWSLYWCFGSNKTKRIGHAVLVP 64 Query: 714 ETTPTRADNAPTSEHPSQPPSVIXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSAN 535 ETT AD APT+EHP+QPPS++ SA QSPTGLLS+TSVSA+ Sbjct: 65 ETTAAGAD-APTAEHPAQPPSIVLPFVAPPSSPASFLPSEPPSAAQSPTGLLSLTSVSAS 123 Query: 534 MYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFA 355 MYSPGGP SIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPES+HLTTPSSPEVPFA Sbjct: 124 MYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPESVHLTTPSSPEVPFA 183 Query: 354 RLLEPNLRNGEAGQRYPLTQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAA 175 RLLEPNL+N EAGQR+ ++ YEFQSYQLQPGSPVSHL PFPDR+FAA Sbjct: 184 RLLEPNLQNSEAGQRFHISPYEFQSYQLQPGSPVSHLISPSSGISGSGTSSPFPDREFAA 243 Query: 174 GYPFFLEFRTGNPPKLLDLDKIVRREWESCQGSGAVTPDAVGPRSRDNRLLNRQDSDV 1 G+PFFLEFRTGNPPKLLDLD IV REW S Q SGAVTPDA GPRS D+ LLNRQ+SDV Sbjct: 244 GHPFFLEFRTGNPPKLLDLDNIVVREWGSRQESGAVTPDAAGPRSCDSCLLNRQNSDV 301 >gb|KZV31558.1| hypothetical protein F511_07409 [Dorcoceras hygrometricum] Length = 452 Score = 377 bits (967), Expect = e-126 Identities = 197/299 (65%), Positives = 210/299 (70%), Gaps = 1/299 (0%) Frame = -2 Query: 894 GVNGTDXXXXXXXXXXXXXXXXARGPHDSVQKRRWGSCLSLYSCFGSNKTK-RIGHAAVI 718 GVN D R P S QKRRWGSC SLY CFGS+K RIGHA ++ Sbjct: 4 GVNRADAIETISAAATAVASLENRVPQASFQKRRWGSCWSLYWCFGSHKNSHRIGHAVLV 63 Query: 717 PETTPTRADNAPTSEHPSQPPSVIXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSA 538 PET D P HPSQPPS++ SATQSPTGLLS+TS SA Sbjct: 64 PETATAGTDVPPG--HPSQPPSIVLPFVAPPSSPASFLPSEPPSATQSPTGLLSLTSASA 121 Query: 537 NMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPF 358 NMYSPGGP SIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPF Sbjct: 122 NMYSPGGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPF 181 Query: 357 ARLLEPNLRNGEAGQRYPLTQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFA 178 ARL EPNL NGE QRYPL+QYEFQSYQLQPGSPVSHL PFPD +FA Sbjct: 182 ARLFEPNLHNGEGAQRYPLSQYEFQSYQLQPGSPVSHLISPCSGISGSGTASPFPDHEFA 241 Query: 177 AGYPFFLEFRTGNPPKLLDLDKIVRREWESCQGSGAVTPDAVGPRSRDNRLLNRQDSDV 1 +G+PFFLEFRTGN PKLLDLDKI+ R WES QGSGAVTPDA G S D RL+ RQDSDV Sbjct: 242 SGHPFFLEFRTGNAPKLLDLDKIILRGWESRQGSGAVTPDAAGAISCDPRLMKRQDSDV 300 >ref|XP_023875628.1| uncharacterized protein LOC111988098 isoform X1 [Quercus suber] Length = 459 Score = 343 bits (879), Expect = e-113 Identities = 174/276 (63%), Positives = 200/276 (72%), Gaps = 1/276 (0%) Frame = -2 Query: 825 RGPHDSVQKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSV 649 RGP +VQKRRWGSC SLY CFGS K KRIGHA ++PETT RAD +E+P Q P+ Sbjct: 31 RGPQATVQKRRWGSCWSLYWCFGSPKHRKRIGHAVLVPETTAPRAD-VSAAENPIQAPAQ 89 Query: 648 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 469 + SA QSP G+LS+TS+SANMYSPGGP SIF+IGPYAHETQL Sbjct: 90 VLPFVAPPSSPASFLQSEPPSAAQSPVGILSLTSISANMYSPGGPASIFSIGPYAHETQL 149 Query: 468 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 289 VSPP FS FTTEPSTAPFTPPPES+HLTTPSSPEVPFA+LL+PNLRNGEAG R+PL+ YE Sbjct: 150 VSPPAFSAFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPNLRNGEAGSRFPLSHYE 209 Query: 288 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 109 F SYQL PGSPV L PFPD +FA G P +EFRTG+PPKLL+ +K+ Sbjct: 210 FHSYQLHPGSPVGQLISPSSGISGSGTSSPFPDHEFAGGGPLIVEFRTGDPPKLLNFEKL 269 Query: 108 VRREWESCQGSGAVTPDAVGPRSRDNRLLNRQDSDV 1 EW S QGSG++TPDAV P SRD+ LLNRQ S+V Sbjct: 270 SNHEWGSRQGSGSLTPDAVRPTSRDSFLLNRQMSEV 305 >ref|XP_018852507.1| PREDICTED: uncharacterized protein LOC109014483 [Juglans regia] Length = 457 Score = 331 bits (848), Expect = e-108 Identities = 174/276 (63%), Positives = 199/276 (72%), Gaps = 1/276 (0%) Frame = -2 Query: 825 RGPHDSVQKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSV 649 R P +VQKRRWGSC S Y CFGS+K TKRIGHA ++PETT +R D A SE+ Q P+V Sbjct: 31 RAPQATVQKRRWGSCWSTYWCFGSHKHTKRIGHAVLVPETTSSRTD-ASGSENLIQTPAV 89 Query: 648 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 469 + SATQSP G+LS +S++ANMYSPGGP SIFAIGPYAHETQL Sbjct: 90 VFPFVAPPSSPASFLQSEPPSATQSPVGILSRSSIAANMYSPGGPASIFAIGPYAHETQL 149 Query: 468 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 289 VSPPVFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+LL+PNLRNGEA Q++P + YE Sbjct: 150 VSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPNLRNGEACQKFPFSYYE 209 Query: 288 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 109 FQSYQL PGSPV L PFPD +FAAG LEFRTG+PPKLL L K+ Sbjct: 210 FQSYQLHPGSPVGQLISPSSGISGSGTSSPFPDHEFAAGGHHILEFRTGDPPKLLSLYKL 269 Query: 108 VRREWESCQGSGAVTPDAVGPRSRDNRLLNRQDSDV 1 R+ S QGSG++TPDAV P S D LLN Q S+V Sbjct: 270 STRDHRSHQGSGSLTPDAVRPTSHDGFLLNGQISEV 305 >ref|XP_022849761.1| uncharacterized protein LOC111371818 [Olea europaea var. sylvestris] Length = 437 Score = 329 bits (844), Expect = e-108 Identities = 180/300 (60%), Positives = 201/300 (67%), Gaps = 2/300 (0%) Frame = -2 Query: 894 GVNGTDXXXXXXXXXXXXXXXXARGPHDSVQKRRWGSCLSLYSCFGSNKT-KRIGHAAVI 718 GVNGTD R P S+QKRRWGSC SLY CFGS+K KRIGHA ++ Sbjct: 4 GVNGTDVLETINAAATAISLNENRFPQSSIQKRRWGSCWSLYWCFGSHKNNKRIGHAVLV 63 Query: 717 PETTPTRADNAPTSEHPSQPPSVIXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSA 538 P+T T AD A E+P+QPPS++ SATQSPTGL TS+SA Sbjct: 64 PQTPETGAD-ALRYENPTQPPSIVVPFIAPPSSPASFFLSEPPSATQSPTGLSLHTSISA 122 Query: 537 NMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPF 358 +MYSP GP SIFAIGPYA+ETQLVSPPVFST TTEPSTAPFTPPPE +HLTTPSSPEVPF Sbjct: 123 SMYSPSGPASIFAIGPYAYETQLVSPPVFSTITTEPSTAPFTPPPEYVHLTTPSSPEVPF 182 Query: 357 ARLLEPNLRNGEAGQRYPLTQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFA 178 ARLLEPNL+N EAG+RYP+ YEFQSYQLQPGSPVSHL Sbjct: 183 ARLLEPNLQNREAGERYPIFHYEFQSYQLQPGSPVSHLI----------------SPSSG 226 Query: 177 AGYPFF-LEFRTGNPPKLLDLDKIVRREWESCQGSGAVTPDAVGPRSRDNRLLNRQDSDV 1 PF EF TG+P KL +LDK+V RE ES QGSG +TPDAVGP RD+ LLNRQDSDV Sbjct: 227 TSSPFLECEFGTGDPSKLFNLDKLVLREGESHQGSGTLTPDAVGPGPRDSLLLNRQDSDV 286 >ref|XP_022873064.1| uncharacterized protein LOC111392007 [Olea europaea var. sylvestris] Length = 454 Score = 327 bits (837), Expect = e-106 Identities = 169/271 (62%), Positives = 200/271 (73%), Gaps = 1/271 (0%) Frame = -2 Query: 810 SVQKRRWGSCLSLYSCFGSNKTKR-IGHAAVIPETTPTRADNAPTSEHPSQPPSVIXXXX 634 SVQKRRW S SL+ CFGS+K K+ I HA ++PET T AD APT+E+P Q S++ Sbjct: 34 SVQKRRWRSYWSLFWCFGSHKNKQQIAHAVLVPETPTTGAD-APTAENPPQLLSIVLPFV 92 Query: 633 XXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLVSPPV 454 SATQSPTGLLS+TS+SANM+SPGG SIFAIGPYA+ETQ+VSPPV Sbjct: 93 APPSSPASFLPSGPPSATQSPTGLLSLTSISANMHSPGGRPSIFAIGPYAYETQIVSPPV 152 Query: 453 FSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYEFQSYQ 274 FSTFTTEPSTAP+TPPPES+HLTTPSSPEVPFARLLEP+ ++GE GQRYPL+QYEFQSYQ Sbjct: 153 FSTFTTEPSTAPYTPPPESVHLTTPSSPEVPFARLLEPSFQSGEGGQRYPLSQYEFQSYQ 212 Query: 273 LQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIVRREW 94 LQPGSPVSHL PFPD + A G+P F+EF+TGN K L+ DKIV R+W Sbjct: 213 LQPGSPVSHLISPSSGISGSGTSSPFPDLEGAPGHPLFVEFKTGNHAKFLNFDKIVLRKW 272 Query: 93 ESCQGSGAVTPDAVGPRSRDNRLLNRQDSDV 1 ES GSGA+T D P+S D+ LL+ Q SD+ Sbjct: 273 ESFHGSGALTSDIARPKSHDSFLLSHQGSDI 303 >emb|CDP12040.1| unnamed protein product [Coffea canephora] Length = 466 Score = 327 bits (838), Expect = e-106 Identities = 172/275 (62%), Positives = 194/275 (70%), Gaps = 1/275 (0%) Frame = -2 Query: 825 RGPHDSVQKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSV 649 R P VQKRRW SC SLY CFGS K TKRIGHA ++PE RAD P E+ +Q SV Sbjct: 36 RVPQVGVQKRRWASCWSLYWCFGSYKHTKRIGHAVLVPEPIAPRAD-PPAVENQTQAASV 94 Query: 648 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 469 SATQSP GLLS+TS+SA+MYSPGGP S+FAIGPYAHETQL Sbjct: 95 ALPFIAPPSSPASFLQSEPPSATQSPPGLLSLTSMSASMYSPGGPASMFAIGPYAHETQL 154 Query: 468 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 289 V+PPVFSTFTTEPSTAPFTPPPES+H+TTPSSPEVPFARLL+P +N + GQRYPL QYE Sbjct: 155 VTPPVFSTFTTEPSTAPFTPPPESVHMTTPSSPEVPFARLLDPIDQNCQDGQRYPLPQYE 214 Query: 288 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 109 FQSYQLQPGSP SHL PFPD +F G P FLEFR+G+PPKLL+L+KI Sbjct: 215 FQSYQLQPGSPASHLISPSSGISGSGTSSPFPDGEFVYGRPHFLEFRSGDPPKLLNLEKI 274 Query: 108 VRREWESCQGSGAVTPDAVGPRSRDNRLLNRQDSD 4 EW S QGSG +TPD V PR R+ LL+ Q SD Sbjct: 275 APHEWGSRQGSGTITPDTVAPRYRNGFLLDNQKSD 309 >ref|XP_002270742.2| PREDICTED: uncharacterized protein LOC100241023 [Vitis vinifera] Length = 479 Score = 327 bits (838), Expect = e-106 Identities = 169/276 (61%), Positives = 195/276 (70%), Gaps = 1/276 (0%) Frame = -2 Query: 825 RGPHDSVQKRRWGSCLSLYSCFGSNKTKRIGHAAVIPETTPTRADNAPTSEHPSQPPSVI 646 R P +VQKRRWGSC Y CF S K KRIGHA + PE+ P +E+ +Q P+++ Sbjct: 31 RVPQPTVQKRRWGSCWGEYWCFRSPKDKRIGHAVLAPESRAP-GSGVPAAENLTQAPTIV 89 Query: 645 XXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLV 466 SATQSP+GLLS+TS++AN+YSPGGP SIFAIGPYAHETQLV Sbjct: 90 LPFVAPPSSPASFLQSEPPSATQSPSGLLSLTSINANIYSPGGPASIFAIGPYAHETQLV 149 Query: 465 SPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYEF 286 SPPVFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+L +PN RNGEAG R+ L+QYEF Sbjct: 150 SPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLFDPNNRNGEAGHRFLLSQYEF 209 Query: 285 QSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDF-AAGYPFFLEFRTGNPPKLLDLDKI 109 QSYQL PGSPV HL PFPDRDF +G FLEFR G PPKLL LDK+ Sbjct: 210 QSYQLYPGSPVGHLISPSSGISGSGTSSPFPDRDFVCSGSSQFLEFRAGGPPKLLTLDKL 269 Query: 108 VRREWESCQGSGAVTPDAVGPRSRDNRLLNRQDSDV 1 EW S GSG++TPDA+GP SRD +L+RQ SDV Sbjct: 270 SNHEWGSRIGSGSITPDALGPPSRDGSVLDRQVSDV 305 >gb|PON62903.1| hydroxyproline-rich glycoprotein family protein [Parasponia andersonii] Length = 460 Score = 323 bits (827), Expect = e-105 Identities = 165/276 (59%), Positives = 196/276 (71%), Gaps = 1/276 (0%) Frame = -2 Query: 825 RGPHDSVQKRRWGSCLSLYSCFGSNKT-KRIGHAAVIPETTPTRADNAPTSEHPSQPPSV 649 R P SV+KRRWGSC S+Y CFGS K KRIGHA ++PET ++AP +E+P+QPP++ Sbjct: 34 RAPQASVRKRRWGSCWSIYRCFGSPKNRKRIGHAVLVPETAQP-GNSAPRAENPAQPPAI 92 Query: 648 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 469 + SATQSP GL+S VSA+MYSPGGP SIFAIGPYAHETQL Sbjct: 93 VLPFIAPPSSPASFLQSEPPSATQSPAGLIS---VSASMYSPGGPTSIFAIGPYAHETQL 149 Query: 468 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 289 VSPPVFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+LL+P +R+GE GQR+P+ E Sbjct: 150 VSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPKIRSGEPGQRFPIFHNE 209 Query: 288 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 109 FQSYQLQPGSPV HL PFPD +F + P FLEFRTG+PPKLL+LDK+ Sbjct: 210 FQSYQLQPGSPVGHLISPSSGISGSGTSSPFPDGEFVSSGPHFLEFRTGDPPKLLNLDKL 269 Query: 108 VRREWESCQGSGAVTPDAVGPRSRDNRLLNRQDSDV 1 +W S QGSG++TPD V P D +L Q +V Sbjct: 270 SNFDWGSRQGSGSLTPDTVKPTPSDGFVLKTQTFEV 305 >gb|PON84555.1| hydroxyproline-rich glycoprotein family protein [Trema orientalis] Length = 460 Score = 322 bits (825), Expect = e-104 Identities = 165/276 (59%), Positives = 197/276 (71%), Gaps = 1/276 (0%) Frame = -2 Query: 825 RGPHDSVQKRRWGSCLSLYSCFGSNKT-KRIGHAAVIPETTPTRADNAPTSEHPSQPPSV 649 R P SV+KRRWGSC S+Y CFGS K KRIGHA ++PET ++AP +E+P+Q P++ Sbjct: 34 RAPQASVRKRRWGSCWSIYRCFGSPKNRKRIGHAVLVPETAQP-GNSAPRAENPAQAPAI 92 Query: 648 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 469 + SATQSP GL+S VSA+MYSPGGP SIFAIGPYAHETQL Sbjct: 93 VLPFIAPPSSPASFLQSEPPSATQSPAGLIS---VSASMYSPGGPTSIFAIGPYAHETQL 149 Query: 468 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 289 VSPPVFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+LL+P +R+GE GQR+P+ E Sbjct: 150 VSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPKIRSGEPGQRFPIFHNE 209 Query: 288 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 109 FQSYQLQPGSPV HL PFPD +FA+ P FLEFRTG+PPKLL+LDK+ Sbjct: 210 FQSYQLQPGSPVGHLISPSSGISGSGTSSPFPDGEFASSGPHFLEFRTGDPPKLLNLDKL 269 Query: 108 VRREWESCQGSGAVTPDAVGPRSRDNRLLNRQDSDV 1 + +W S QGSG++TPD V P D +L Q +V Sbjct: 270 SKFDWGSRQGSGSLTPDTVKPTPSDGFVLKTQTFEV 305 >ref|XP_015896111.1| PREDICTED: mucin-2 [Ziziphus jujuba] Length = 440 Score = 321 bits (822), Expect = e-104 Identities = 167/267 (62%), Positives = 193/267 (72%), Gaps = 2/267 (0%) Frame = -2 Query: 813 DSVQKRRWGSCLSLYSCFGSNKT-KRIGHAAVIPETTPTRADNAPTSEHPSQPPSVIXXX 637 D +QKRRWGSCLSLY CFGS K+ KRIGHA ++PET D AP +E+P+Q P+ + Sbjct: 8 DFMQKRRWGSCLSLYWCFGSLKSRKRIGHAVLVPETIAAGPD-APRAENPTQAPANVLPF 66 Query: 636 XXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLVSPP 457 S TQSP G+LS+TS+SANMYSPGGP SIFAIGPYAHETQLVSPP Sbjct: 67 VAPPSSPASFLQSEPPSVTQSPAGVLSLTSISANMYSPGGPASIFAIGPYAHETQLVSPP 126 Query: 456 VFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYEFQSY 277 VFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+LL+P++RN EAGQR+PL+ YEFQSY Sbjct: 127 VFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPHIRNCEAGQRFPLSHYEFQSY 186 Query: 276 QLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIVRRE 97 QL PGSPV L PFPD +FA G FLEF+ GNPPKLLDLDK+ + Sbjct: 187 QLYPGSPVGQLISPRSGISGSGTSSPFPDGEFAPGGNHFLEFQPGNPPKLLDLDKLSNFD 246 Query: 96 WESCQGSGAVTPDAVG-PRSRDNRLLN 19 W S QGSG++TPD G + D LLN Sbjct: 247 WGSRQGSGSLTPDGAGKSTTSDGFLLN 273 >ref|XP_011080935.1| uncharacterized protein LOC105164075 isoform X2 [Sesamum indicum] Length = 444 Score = 321 bits (822), Expect = e-104 Identities = 179/301 (59%), Positives = 201/301 (66%), Gaps = 3/301 (0%) Frame = -2 Query: 894 GVNGTDXXXXXXXXXXXXXXXXARGPHDSVQKRRWGSCLSLYSCFGSNK-TKRIGHAAVI 718 G+NGTD R SVQKRRWGSC SLY CFGS K KRIG A ++ Sbjct: 5 GINGTDSLETINAAAAAIESRVRRA---SVQKRRWGSCWSLYRCFGSYKHNKRIGRAVIV 61 Query: 717 PETTPTRADNAPTSEHPSQPPSVIXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSA 538 PET+ + D PT+EHP +PP + S+ QSPTG+LS+TSVSA Sbjct: 62 PETSASVMD-VPTAEHPPRPPPLELPFVVPPSSPASFLPSDPPSSAQSPTGVLSLTSVSA 120 Query: 537 NMYSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPF 358 NMYSPGGP SIFAIGPYAHETQLVSPPVFSTF TEPSTAP+T PPES+HLTTPSSPEVPF Sbjct: 121 NMYSPGGPPSIFAIGPYAHETQLVSPPVFSTFATEPSTAPYT-PPESVHLTTPSSPEVPF 179 Query: 357 ARLLEPNLRNGEAGQRYPLTQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFA 178 +RLLEP L+NGEA QRY +QYEFQSYQLQPGSPVSHL P P+ +FA Sbjct: 180 SRLLEPTLQNGEACQRYGFSQYEFQSYQLQPGSPVSHL-----ISPSSGTSSPLPELEFA 234 Query: 177 AGYPFFLEFRTGNPPKLLDLDKIVRREWESCQGSGAVTPDAVG--PRSRDNRLLNRQDSD 4 G PF L F TG+PPKLLDLDKI R EWES + SG +PDA PRS + NRQ SD Sbjct: 235 TGIPFLLGFTTGHPPKLLDLDKIARGEWESREVSGEASPDATATEPRSSNCCHYNRQHSD 294 Query: 3 V 1 V Sbjct: 295 V 295 >ref|XP_011080936.1| uncharacterized protein LOC105164075 isoform X3 [Sesamum indicum] Length = 416 Score = 317 bits (811), Expect = e-103 Identities = 171/272 (62%), Positives = 193/272 (70%), Gaps = 3/272 (1%) Frame = -2 Query: 807 VQKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSVIXXXXX 631 +QKRRWGSC SLY CFGS K KRIG A ++PET+ + D PT+EHP +PP + Sbjct: 3 IQKRRWGSCWSLYRCFGSYKHNKRIGRAVIVPETSASVMD-VPTAEHPPRPPPLELPFVV 61 Query: 630 XXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLVSPPVF 451 S+ QSPTG+LS+TSVSANMYSPGGP SIFAIGPYAHETQLVSPPVF Sbjct: 62 PPSSPASFLPSDPPSSAQSPTGVLSLTSVSANMYSPGGPPSIFAIGPYAHETQLVSPPVF 121 Query: 450 STFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYEFQSYQL 271 STF TEPSTAP+T PPES+HLTTPSSPEVPF+RLLEP L+NGEA QRY +QYEFQSYQL Sbjct: 122 STFATEPSTAPYT-PPESVHLTTPSSPEVPFSRLLEPTLQNGEACQRYGFSQYEFQSYQL 180 Query: 270 QPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIVRREWE 91 QPGSPVSHL P P+ +FA G PF L F TG+PPKLLDLDKI R EWE Sbjct: 181 QPGSPVSHL-----ISPSSGTSSPLPELEFATGIPFLLGFTTGHPPKLLDLDKIARGEWE 235 Query: 90 SCQGSGAVTPDAVG--PRSRDNRLLNRQDSDV 1 S + SG +PDA PRS + NRQ SDV Sbjct: 236 SREVSGEASPDATATEPRSSNCCHYNRQHSDV 267 >ref|XP_012836270.1| PREDICTED: COPII coat assembly protein sec16 [Erythranthe guttata] gb|EYU38335.1| hypothetical protein MIMGU_mgv1a007082mg [Erythranthe guttata] Length = 420 Score = 315 bits (808), Expect = e-102 Identities = 178/300 (59%), Positives = 198/300 (66%), Gaps = 4/300 (1%) Frame = -2 Query: 888 NGTDXXXXXXXXXXXXXXXXARGPH-DSVQKRRWGSCLSLYSCFGSNKTKRIGHAAVIPE 712 NGTD A G H S+QKRRW S SLY CF N KRIGHA ++ E Sbjct: 7 NGTDALETISAAASAIASAEAHGAHASSLQKRRWRSFWSLYWCFRPNNNKRIGHAVLVTE 66 Query: 711 TTPTRADNAPTSEHPSQPPSVIXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANM 532 T+ + PT+E P QPPS++ S+TQSPTGLLS++S S N+ Sbjct: 67 TSSSDTAYTPTAERPFQPPSIVLPFTAPPSSPASFIPSEPPSSTQSPTGLLSLSSPSGNI 126 Query: 531 YSPGGPNSIFAIGPYAHETQLVSPPVFSTFTTEPSTAPFTPPPE-SIHLTTPSSPEVPFA 355 YSP GP SIFAIGPYAHETQLVSPPVFSTFTTEPSTAP+TPPPE S HLTTPSSPEVPFA Sbjct: 127 YSPSGPASIFAIGPYAHETQLVSPPVFSTFTTEPSTAPYTPPPEFSAHLTTPSSPEVPFA 186 Query: 354 RLLEPNLRNGEAGQRYPLTQYEFQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAA 175 RLLEPN QRYPL+QYEFQSYQLQPGSPVSHL PF DRDFAA Sbjct: 187 RLLEPN-------QRYPLSQYEFQSYQLQPGSPVSHLISPCSGISGSGASSPFLDRDFAA 239 Query: 174 GYPFFLEFRTGNPPKLLDLDKIVRREWESCQGSGAVTP-DAVGPRSRDN-RLLNRQDSDV 1 +PFFLEF GNPP+ R +WESCQ SG VTP DAVGPRSRD+ LLNRQ+SD+ Sbjct: 240 VHPFFLEFGGGNPPR--------RDQWESCQESGVVTPTDAVGPRSRDSCVLLNRQNSDI 291 >ref|XP_020550755.1| uncharacterized protein LOC105164075 isoform X1 [Sesamum indicum] Length = 460 Score = 317 bits (811), Expect = e-102 Identities = 171/272 (62%), Positives = 193/272 (70%), Gaps = 3/272 (1%) Frame = -2 Query: 807 VQKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSVIXXXXX 631 +QKRRWGSC SLY CFGS K KRIG A ++PET+ + D PT+EHP +PP + Sbjct: 47 IQKRRWGSCWSLYRCFGSYKHNKRIGRAVIVPETSASVMD-VPTAEHPPRPPPLELPFVV 105 Query: 630 XXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLVSPPVF 451 S+ QSPTG+LS+TSVSANMYSPGGP SIFAIGPYAHETQLVSPPVF Sbjct: 106 PPSSPASFLPSDPPSSAQSPTGVLSLTSVSANMYSPGGPPSIFAIGPYAHETQLVSPPVF 165 Query: 450 STFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYEFQSYQL 271 STF TEPSTAP+T PPES+HLTTPSSPEVPF+RLLEP L+NGEA QRY +QYEFQSYQL Sbjct: 166 STFATEPSTAPYT-PPESVHLTTPSSPEVPFSRLLEPTLQNGEACQRYGFSQYEFQSYQL 224 Query: 270 QPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIVRREWE 91 QPGSPVSHL P P+ +FA G PF L F TG+PPKLLDLDKI R EWE Sbjct: 225 QPGSPVSHL-----ISPSSGTSSPLPELEFATGIPFLLGFTTGHPPKLLDLDKIARGEWE 279 Query: 90 SCQGSGAVTPDAVG--PRSRDNRLLNRQDSDV 1 S + SG +PDA PRS + NRQ SDV Sbjct: 280 SREVSGEASPDATATEPRSSNCCHYNRQHSDV 311 >ref|XP_018829358.1| PREDICTED: uncharacterized protein LOC108997489 [Juglans regia] Length = 458 Score = 316 bits (810), Expect = e-102 Identities = 166/276 (60%), Positives = 195/276 (70%), Gaps = 1/276 (0%) Frame = -2 Query: 825 RGPHDSVQKRRWGSCLSLYSCFGSNKTK-RIGHAAVIPETTPTRADNAPTSEHPSQPPSV 649 R +V+KRRWGSC S+Y CFG+ K + RIGHA + PETTP + D ++ Q ++ Sbjct: 31 RASQATVRKRRWGSCWSIYWCFGAYKHRTRIGHAVLHPETTPPQTD-VSVPQNLIQAHAI 89 Query: 648 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 469 + SATQSP GLLS S+SANMYSPGGP+SIFAIGPYAHETQL Sbjct: 90 VLPFVAPPSSPASFLQSEPPSATQSPVGLLSRISISANMYSPGGPSSIFAIGPYAHETQL 149 Query: 468 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 289 VSPPVFSTF TEPSTAPFTPPPES+HLTTPSSPEVPFA+LL+PNLRNGEA +R+P + E Sbjct: 150 VSPPVFSTFPTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPNLRNGEAFRRFPPSHSE 209 Query: 288 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 109 FQSYQL PGSPV L PFPD +FA G+P FLEFRTG+PPKLL+L K+ Sbjct: 210 FQSYQLHPGSPVGQLVSPSSGISGSGTSSPFPDHEFAVGFPHFLEFRTGDPPKLLNLYKL 269 Query: 108 VRREWESCQGSGAVTPDAVGPRSRDNRLLNRQDSDV 1 RE S QG G++TPDAV P SRD+ LLN Q S+V Sbjct: 270 STRESRSHQGDGSLTPDAVRPTSRDDYLLNGQISEV 305 >ref|XP_019151582.1| PREDICTED: uncharacterized protein LOC109148192 [Ipomoea nil] Length = 469 Score = 313 bits (803), Expect = e-101 Identities = 161/271 (59%), Positives = 190/271 (70%), Gaps = 2/271 (0%) Frame = -2 Query: 807 VQKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPP-SVIXXXX 634 VQKRRW SC S+Y CFGS K KRI HA PET R D + ++P++ S++ Sbjct: 41 VQKRRWASCFSIYWCFGSQKQNKRIVHAVFDPETAAPREDRLHSVDNPTRTSTSIMLPFV 100 Query: 633 XXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQLVSPPV 454 SATQSP G++S +S+SAN+YSPGGPNSIFA GPYAHETQLVSPPV Sbjct: 101 APPSSPVSFLQSEPPSATQSPAGVISFSSMSANIYSPGGPNSIFATGPYAHETQLVSPPV 160 Query: 453 FSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYEFQSYQ 274 FSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+LL+PNL+N A +PL+ +EFQSYQ Sbjct: 161 FSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAKLLDPNLQNVGARNMFPLSPHEFQSYQ 220 Query: 273 LQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKIVRREW 94 LQPGSP+S L PFPDR+F G P+FLEFRTG+PPKLL+L+KI EW Sbjct: 221 LQPGSPISRLLSPGSAISGSGTSSPFPDREFGVGDPYFLEFRTGDPPKLLNLEKIAPHEW 280 Query: 93 ESCQGSGAVTPDAVGPRSRDNRLLNRQDSDV 1 S Q SG +TPD +G RS DN +N Q SDV Sbjct: 281 GSRQRSGTLTPDRMGTRSHDNFKINHQSSDV 311 >dbj|GAV76096.1| hypothetical protein CFOL_v3_19571 [Cephalotus follicularis] Length = 451 Score = 312 bits (800), Expect = e-101 Identities = 168/276 (60%), Positives = 192/276 (69%), Gaps = 1/276 (0%) Frame = -2 Query: 825 RGPHDSVQKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSV 649 R P S+QKRRWGSC S+Y CFGS+K KRIGHA ++PETT D+A E+P+Q P+ Sbjct: 31 RVPRASIQKRRWGSCWSIYWCFGSHKHRKRIGHAILVPETTAPGTDSAQV-ENPTQAPAP 89 Query: 648 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 469 + SATQSP GLLS +SANMYSP GP SIFAIGPYAHETQL Sbjct: 90 VFPFVAPPSSPASFLQSEPPSATQSPVGLLS---ISANMYSPSGPASIFAIGPYAHETQL 146 Query: 468 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 289 VSPPVFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFARLL+PNL NG+A QR+PL+ YE Sbjct: 147 VSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFARLLDPNLPNGDAVQRFPLSHYE 206 Query: 288 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 109 FQSYQLQPGSPV L PFPD +FA P F EFR G PP+L +LDK Sbjct: 207 FQSYQLQPGSPVGQLISPSSGISGSGTSSPFPDGEFATCGPHFPEFRIGEPPRLFNLDK- 265 Query: 108 VRREWESCQGSGAVTPDAVGPRSRDNRLLNRQDSDV 1 W S QGSG++TPDAV R+ LL+RQ SD+ Sbjct: 266 ----WGSRQGSGSLTPDAVRSTPRNGFLLDRQTSDI 297 >ref|XP_012086872.1| uncharacterized protein LOC105645786 [Jatropha curcas] gb|KDP25415.1| hypothetical protein JCGZ_20571 [Jatropha curcas] Length = 455 Score = 311 bits (798), Expect = e-100 Identities = 162/263 (61%), Positives = 190/263 (72%), Gaps = 1/263 (0%) Frame = -2 Query: 825 RGPHDSVQKRRWGSCLSLYSCFGSNK-TKRIGHAAVIPETTPTRADNAPTSEHPSQPPSV 649 R P +VQKRRWGSC S+Y CFG N+ KRIGHA ++PET R D++ +E+ +Q P++ Sbjct: 31 RVPQATVQKRRWGSCFSVYWCFGYNRHRKRIGHAVLVPETPGPRNDSS-AAENSTQTPTI 89 Query: 648 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 469 SA+QSPTG+LS+TS+SANMYSP GP+SIFAIGPYAHETQL Sbjct: 90 TLPFVAPPSSPASFLQSEPPSASQSPTGVLSLTSISANMYSPSGPSSIFAIGPYAHETQL 149 Query: 468 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 289 VSPPVFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+LL+P++RN EAG R+PL+ YE Sbjct: 150 VSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPSIRNVEAGLRFPLSNYE 209 Query: 288 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 109 FQSYQL PGSPV L PFPD +FAAG FLEFR G PPKLL+LDK+ Sbjct: 210 FQSYQLYPGSPVGQLISPSSGISGSGTSSPFPDGEFAAG---FLEFRMGEPPKLLNLDKL 266 Query: 108 VRREWESCQGSGAVTPDAVGPRS 40 EW S GSG +TPDAV P S Sbjct: 267 STHEWGSRCGSGTLTPDAVRPTS 289 >ref|XP_010089083.1| uncharacterized protein LOC21407002 [Morus notabilis] gb|EXB37330.1| hypothetical protein L484_024256 [Morus notabilis] Length = 455 Score = 311 bits (797), Expect = e-100 Identities = 159/263 (60%), Positives = 190/263 (72%), Gaps = 1/263 (0%) Frame = -2 Query: 825 RGPHDSVQKRRWGSCLSLYSCFGSNKTK-RIGHAAVIPETTPTRADNAPTSEHPSQPPSV 649 R P +V+KRRWG CLS+Y CFG+ K + RIGH ++PET ++AP +E+ +Q +V Sbjct: 33 RVPQATVRKRRWGGCLSIYWCFGTPKNRTRIGHGVLVPETAQP-GNSAPRAENSTQTHAV 91 Query: 648 IXXXXXXXXXXXXXXXXXXXSATQSPTGLLSMTSVSANMYSPGGPNSIFAIGPYAHETQL 469 I SATQSP GLLS+TSVSA+MYSPGGP SIFAIGPYAHETQL Sbjct: 92 ILPFIAPPSSPASFLQSEPPSATQSPAGLLSLTSVSASMYSPGGPASIFAIGPYAHETQL 151 Query: 468 VSPPVFSTFTTEPSTAPFTPPPESIHLTTPSSPEVPFARLLEPNLRNGEAGQRYPLTQYE 289 VSPPVFSTFTTEPSTAPFTPPPES+HLTTPSSPEVPFA+LL+PN+ NGE GQR+P+ E Sbjct: 152 VSPPVFSTFTTEPSTAPFTPPPESVHLTTPSSPEVPFAQLLDPNIHNGEPGQRFPIFHNE 211 Query: 288 FQSYQLQPGSPVSHLXXXXXXXXXXXXXXPFPDRDFAAGYPFFLEFRTGNPPKLLDLDKI 109 FQSY QPGSP+ L PFPD +FAA P FLEFRTG+PPKLL+LDK+ Sbjct: 212 FQSYYFQPGSPIGQLISPSSGISGSGTSSPFPDPEFAARGPHFLEFRTGDPPKLLNLDKL 271 Query: 108 VRREWESCQGSGAVTPDAVGPRS 40 + +W S QGSG++TPD+V P S Sbjct: 272 SKFDWGSRQGSGSLTPDSVKPIS 294