BLASTX nr result
ID: Astragalus22_contig00014912
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00014912 (762 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004497416.1| PREDICTED: uncharacterized protein C3orf26 h... 327 e-110 dbj|GAU12057.1| hypothetical protein TSUD_00220 [Trifolium subte... 314 e-105 gb|PNY06135.1| protein CMSS1-like [Trifolium pratense] 312 e-104 ref|XP_019452516.1| PREDICTED: uncharacterized protein C3orf26 h... 311 e-104 ref|XP_019441585.1| PREDICTED: uncharacterized protein C3orf26 h... 306 e-102 ref|XP_015942328.1| uncharacterized protein C3orf26 homolog isof... 301 e-100 ref|XP_016175514.1| protein CMS1 isoform X1 [Arachis ipaensis] 301 e-100 ref|XP_003592852.1| DEAD-box helicase family protein [Medicago t... 298 6e-99 gb|KHN27142.1| Hypothetical protein glysoja_032910 [Glycine soja] 297 1e-98 ref|XP_003556578.1| PREDICTED: protein CMSS1 [Glycine max] >gi|9... 297 2e-98 ref|XP_023909205.1| uncharacterized protein C3orf26 homolog [Que... 295 3e-97 ref|XP_018815173.1| PREDICTED: protein CMSS1 [Juglans regia] 293 9e-97 gb|KYP75518.1| Uncharacterized protein C3orf26 isogeny [Cajanus ... 291 2e-96 ref|XP_020207294.1| uncharacterized protein C3orf26 homolog [Caj... 291 3e-96 ref|XP_016699167.1| PREDICTED: protein CMSS1-like [Gossypium hir... 293 3e-96 ref|XP_012457024.1| PREDICTED: protein CMSS1 [Gossypium raimondi... 292 6e-96 ref|XP_017646838.1| PREDICTED: protein CMSS1 [Gossypium arboreum] 290 4e-95 ref|XP_016677780.1| PREDICTED: protein CMSS1-like [Gossypium hir... 290 4e-95 ref|XP_010107216.1| protein CMS1 [Morus notabilis] >gi|587927016... 289 1e-94 ref|XP_021290395.1| protein CMSS1 isoform X1 [Herrania umbratica] 288 3e-94 >ref|XP_004497416.1| PREDICTED: uncharacterized protein C3orf26 homolog [Cicer arietinum] Length = 257 Score = 327 bits (839), Expect = e-110 Identities = 175/242 (72%), Positives = 196/242 (80%), Gaps = 6/242 (2%) Frame = -2 Query: 710 EKVKRKGAGGGKNEDDRRXXXXXKVSSLPSNESVSKEGAQEQLRFFVDQFEAAKGVQLSS 531 EKVKRK + +N DDR VSS SKEGA EQLRFFVDQF+ A QLSS Sbjct: 4 EKVKRKRSAT-ENNDDRSEKKKTTVSSS------SKEGAGEQLRFFVDQFQEANHCQLSS 56 Query: 530 LELESLKDTCIVEVSQ----DADSDVKKLGNNIKVAFGESWKESLCEGKV--GKIPAGSP 369 +ELES KDT I+E+S D D+DVK LG++IK AFG+SWKE LCEG + GKIPAGSP Sbjct: 57 IELESFKDTSILELSHSQDTDTDNDVKMLGDDIKAAFGKSWKEVLCEGDLLQGKIPAGSP 116 Query: 368 AVVVISSSALRSIHLLRGFHSITKECPPVKLFSKHIKIEEQISLLKNRVNIASGTPSRIK 189 +VV++SSSALRSIHLL+GF S TK+C VKLFSKHIK++EQ SLLKNRVNIASGTPSRIK Sbjct: 117 SVVILSSSALRSIHLLKGFRSFTKQCSAVKLFSKHIKLQEQTSLLKNRVNIASGTPSRIK 176 Query: 188 KLIDIEALGLSRLRVLVLDIHPDVKGYSLFTLPQVRDEFWDLFHNYFYQPMIQGHLRICL 9 KLID+EALGLSRL+VLVLD+HPDVKGYSL TLPQVRDEFWDLF NYFYQPMIQG LRICL Sbjct: 177 KLIDMEALGLSRLKVLVLDMHPDVKGYSLLTLPQVRDEFWDLFKNYFYQPMIQGDLRICL 236 Query: 8 YG 3 YG Sbjct: 237 YG 238 >dbj|GAU12057.1| hypothetical protein TSUD_00220 [Trifolium subterraneum] Length = 261 Score = 314 bits (804), Expect = e-105 Identities = 167/244 (68%), Positives = 195/244 (79%), Gaps = 8/244 (3%) Frame = -2 Query: 710 EKVKRKGAGGGKNEDDRRXXXXXKVSSLPSNESVSKEGAQEQLRFFVDQFEAAKGVQLSS 531 +K KRK G N+DDR ++ E+ S + A+EQLRFFVDQFEAA G+Q+SS Sbjct: 7 KKAKRKRKEGNDNDDDR-------MNKKEKTETNSNK-AEEQLRFFVDQFEAANGLQVSS 58 Query: 530 LELESLKDTC-IVEV------SQDADSDVKKLGNNIKVAFGESWKESLCEGKV-GKIPAG 375 +E ESLKDT I+E+ D D DVK LG NIK AFG SW+++LCE ++ G IP G Sbjct: 59 IEFESLKDTSPILELPPQSLKDTDTDLDVKMLGINIKAAFGNSWRQALCESELDGAIPPG 118 Query: 374 SPAVVVISSSALRSIHLLRGFHSITKECPPVKLFSKHIKIEEQISLLKNRVNIASGTPSR 195 SP+V++IS SALRSIHLL+GF SITK+C VKLFSKHIK++EQISLLKNRVNIASGTPSR Sbjct: 119 SPSVLIISYSALRSIHLLKGFRSITKQCSAVKLFSKHIKLQEQISLLKNRVNIASGTPSR 178 Query: 194 IKKLIDIEALGLSRLRVLVLDIHPDVKGYSLFTLPQVRDEFWDLFHNYFYQPMIQGHLRI 15 IKKLIDIEALGLSRL+VL+LD+HPDVKGYSLFTLPQVRDEFWDLF NYFYQPMI+G LRI Sbjct: 179 IKKLIDIEALGLSRLKVLLLDVHPDVKGYSLFTLPQVRDEFWDLFKNYFYQPMIKGDLRI 238 Query: 14 CLYG 3 CLYG Sbjct: 239 CLYG 242 >gb|PNY06135.1| protein CMSS1-like [Trifolium pratense] Length = 283 Score = 312 bits (799), Expect = e-104 Identities = 168/251 (66%), Positives = 195/251 (77%), Gaps = 16/251 (6%) Frame = -2 Query: 707 KVKRKGAGGGKNEDDRRXXXXXKVSSLPSNESVSKEGAQEQLRFFVDQFEAAKGVQLSSL 528 K KRK G N+DDR ++ SN+ + A+EQLRFF DQF+AA G+QLSS+ Sbjct: 15 KRKRKEEGNDDNDDDRMNKKEKTETNSNSNK-LEGSRAEEQLRFFFDQFQAANGLQLSSI 73 Query: 527 ELESLKD----TCIVEVSQ--------DADSDVKKLGNNIKVAFGESWKESLCEGKVG-- 390 ELESLKD + I+E+ Q D D DVK LG NIK AFG SW+++L E ++G Sbjct: 74 ELESLKDASSSSSILELPQSSSSLKNTDTDLDVKMLGINIKAAFGNSWRQALYESELGHG 133 Query: 389 --KIPAGSPAVVVISSSALRSIHLLRGFHSITKECPPVKLFSKHIKIEEQISLLKNRVNI 216 KIP GSP+V++IS SALRSIHLL+GF SITK+C VKLFSKHIK+EEQISLLKNRVNI Sbjct: 134 DGKIPPGSPSVLIISYSALRSIHLLKGFRSITKQCSAVKLFSKHIKLEEQISLLKNRVNI 193 Query: 215 ASGTPSRIKKLIDIEALGLSRLRVLVLDIHPDVKGYSLFTLPQVRDEFWDLFHNYFYQPM 36 ASGTPSRIKKLIDIEALGLSRL+VL+LD+HPDVKGYSLFTLPQVRDEFWDLF NYFYQPM Sbjct: 194 ASGTPSRIKKLIDIEALGLSRLKVLLLDVHPDVKGYSLFTLPQVRDEFWDLFKNYFYQPM 253 Query: 35 IQGHLRICLYG 3 I+G LRICLYG Sbjct: 254 IKGDLRICLYG 264 >ref|XP_019452516.1| PREDICTED: uncharacterized protein C3orf26 homolog [Lupinus angustifolius] gb|OIW06862.1| hypothetical protein TanjilG_18244 [Lupinus angustifolius] Length = 257 Score = 311 bits (796), Expect = e-104 Identities = 160/229 (69%), Positives = 184/229 (80%), Gaps = 4/229 (1%) Frame = -2 Query: 677 KNEDDRRXXXXXKVSSLPSNESV--SKEGAQEQLRFFVDQFEAAKGVQLSSLELESLKDT 504 KN + R +++ S V + A EQL FFV+QF++A G+Q+SSLELESLKDT Sbjct: 9 KNSNSNRKRKRKQIAPTKSLNKVPLASASASEQLGFFVEQFQSANGLQISSLELESLKDT 68 Query: 503 CIVEVSQDADSDVKKLGNNIKVAFGESWKESLCEGKV--GKIPAGSPAVVVISSSALRSI 330 CI+E+ QD+ DV LG NI+ AFG SWKE LCEGK+ GKI AGSP+V++ISSSALR I Sbjct: 69 CILELPQDSHLDVNMLGENIRPAFGTSWKEELCEGKLVEGKIDAGSPSVLIISSSALRCI 128 Query: 329 HLLRGFHSITKECPPVKLFSKHIKIEEQISLLKNRVNIASGTPSRIKKLIDIEALGLSRL 150 LLRGF S TKEC VKLFSKH+K+EEQI LLKNRVNIASGTPSRIKKLIDIEAL LSRL Sbjct: 129 ELLRGFRSFTKECHAVKLFSKHMKVEEQIPLLKNRVNIASGTPSRIKKLIDIEALSLSRL 188 Query: 149 RVLVLDIHPDVKGYSLFTLPQVRDEFWDLFHNYFYQPMIQGHLRICLYG 3 +VLVLD+HPDVKGYSL TLPQVRDEFWDLF NY+YQPMI+G LRICLYG Sbjct: 189 KVLVLDMHPDVKGYSLLTLPQVRDEFWDLFKNYYYQPMIKGDLRICLYG 237 >ref|XP_019441585.1| PREDICTED: uncharacterized protein C3orf26 homolog [Lupinus angustifolius] gb|OIW12812.1| hypothetical protein TanjilG_24745 [Lupinus angustifolius] Length = 261 Score = 306 bits (785), Expect = e-102 Identities = 157/206 (76%), Positives = 175/206 (84%), Gaps = 2/206 (0%) Frame = -2 Query: 614 SVSKEGAQEQLRFFVDQFEAAKGVQLSSLELESLKDTCIVEVSQDADSDVKKLGNNIKVA 435 S + A EQL FFV+QF++A +QLSSLELESLKDTCI+E+ QD+ DV LG +IK A Sbjct: 43 SPTSASASEQLDFFVEQFQSANALQLSSLELESLKDTCILELPQDSHLDVNALGKDIKPA 102 Query: 434 FGESWKESLCEGKV--GKIPAGSPAVVVISSSALRSIHLLRGFHSITKECPPVKLFSKHI 261 FG SWKE LCEGK+ G+I AGSPAV++ISSSALRSI LLRGF S TKEC VKLFSKH+ Sbjct: 103 FGASWKEVLCEGKLVQGEIDAGSPAVLIISSSALRSIELLRGFRSFTKECHAVKLFSKHM 162 Query: 260 KIEEQISLLKNRVNIASGTPSRIKKLIDIEALGLSRLRVLVLDIHPDVKGYSLFTLPQVR 81 K+EEQ+SLLKNRVNIASGTPSRIKKLIDIEAL LSRL+VLVLD+ DVKGYSL TLPQVR Sbjct: 163 KVEEQVSLLKNRVNIASGTPSRIKKLIDIEALSLSRLKVLVLDMQSDVKGYSLLTLPQVR 222 Query: 80 DEFWDLFHNYFYQPMIQGHLRICLYG 3 DEFWDLF NYFYQPMIQG LRICLYG Sbjct: 223 DEFWDLFKNYFYQPMIQGDLRICLYG 248 >ref|XP_015942328.1| uncharacterized protein C3orf26 homolog isoform X1 [Arachis duranensis] Length = 262 Score = 301 bits (771), Expect = e-100 Identities = 157/239 (65%), Positives = 189/239 (79%), Gaps = 8/239 (3%) Frame = -2 Query: 695 KGAGGGKNEDDRRXXXXXKVSSLPSNESVSKEGAQ--EQLRFFVDQFEAAKGVQLSSLEL 522 K A + + R+ KV + S ++ + E A EQLRFFVD+F++AKGV+LSSLEL Sbjct: 5 KEATQNEQQRKRKRKEKGKVKGVKSKKNSASESASASEQLRFFVDEFQSAKGVKLSSLEL 64 Query: 521 ESLKDTCIVEVSQ----DADSDVKKLGNNIKVAFGESWKESLCEGKV--GKIPAGSPAVV 360 +SLKD+ I+ + D+DSDVK LGNNIK AFG SWK+ LCE + G++ GSPA++ Sbjct: 65 DSLKDSSILTLPHSSVSDSDSDVKILGNNIKAAFGASWKQVLCEPGLVPGEVLPGSPAIL 124 Query: 359 VISSSALRSIHLLRGFHSITKECPPVKLFSKHIKIEEQISLLKNRVNIASGTPSRIKKLI 180 +++SSALRS+HLLRGF S TKEC KLFSKH+K+EEQ+S+L NRVNIASGTPSRIKKLI Sbjct: 125 IVASSALRSVHLLRGFRSFTKECHAAKLFSKHMKLEEQVSVLNNRVNIASGTPSRIKKLI 184 Query: 179 DIEALGLSRLRVLVLDIHPDVKGYSLFTLPQVRDEFWDLFHNYFYQPMIQGHLRICLYG 3 D EAL LSRL+VL+LDIHPDVKGYSLFTLPQVRDEFWDLF NYFYQ MIQGHLRICLYG Sbjct: 185 DTEALSLSRLQVLLLDIHPDVKGYSLFTLPQVRDEFWDLFKNYFYQGMIQGHLRICLYG 243 >ref|XP_016175514.1| protein CMS1 isoform X1 [Arachis ipaensis] Length = 262 Score = 301 bits (770), Expect = e-100 Identities = 155/239 (64%), Positives = 186/239 (77%), Gaps = 8/239 (3%) Frame = -2 Query: 695 KGAGGGKNEDDRRXXXXXKVSSLPSNESVSKEGAQEQLRFFVDQFEAAKGVQLSSLELES 516 K A + + R+ KV + S ++ + A EQLRFFVD+F++AKGV+LSSLEL+S Sbjct: 5 KEATQNEQQRKRKRKEKGKVEGVKSKKNSASASASEQLRFFVDEFQSAKGVKLSSLELDS 64 Query: 515 LKDTCIVEV------SQDADSDVKKLGNNIKVAFGESWKESLCEGKV--GKIPAGSPAVV 360 LKD+ I+ + D+DSDVK LGNNIK AFG SWK+ LCE + G + GSPA++ Sbjct: 65 LKDSSILTLPHSSVSDSDSDSDVKILGNNIKAAFGASWKQVLCEPDLVPGNVLPGSPAIL 124 Query: 359 VISSSALRSIHLLRGFHSITKECPPVKLFSKHIKIEEQISLLKNRVNIASGTPSRIKKLI 180 +++SSALRS+HLLRGF S TKEC KLFSKH+K+EEQ+S+L NRVNIASGTPSRIKKLI Sbjct: 125 IVASSALRSLHLLRGFRSFTKECHAAKLFSKHMKLEEQVSVLNNRVNIASGTPSRIKKLI 184 Query: 179 DIEALGLSRLRVLVLDIHPDVKGYSLFTLPQVRDEFWDLFHNYFYQPMIQGHLRICLYG 3 D EAL LSRL+VL+LDIHPDVKGYSL TLPQVRDEFWDLF NYFYQ MIQGHLRICLYG Sbjct: 185 DTEALSLSRLQVLLLDIHPDVKGYSLLTLPQVRDEFWDLFKNYFYQGMIQGHLRICLYG 243 >ref|XP_003592852.1| DEAD-box helicase family protein [Medicago truncatula] gb|AES63103.1| DEAD-box helicase family protein [Medicago truncatula] Length = 245 Score = 298 bits (762), Expect = 6e-99 Identities = 159/241 (65%), Positives = 189/241 (78%), Gaps = 8/241 (3%) Frame = -2 Query: 701 KRKGAGGGKNEDDRRXXXXXKVSSLPSNESVSKEGAQEQLRFFVDQFEAAKGVQLSSLEL 522 KRKG +N DDRR + ++ EQLRFFV++F++A VQLSS+EL Sbjct: 3 KRKGR---ENNDDRRK----------KKKKKTETTGSEQLRFFVNEFQSANDVQLSSIEL 49 Query: 521 ESLK-DTCIVEVS-----QDADSDVKKLGNNIKVAFGESWKESLCEGKV--GKIPAGSPA 366 ESLK D+ I+E+S +D D DVK LG +IK AFG W++ LCE +V GKIP GSP+ Sbjct: 50 ESLKADSSILELSNSQNSRDTDLDVKLLGGDIKGAFGNCWRQVLCESEVVEGKIPPGSPS 109 Query: 365 VVVISSSALRSIHLLRGFHSITKECPPVKLFSKHIKIEEQISLLKNRVNIASGTPSRIKK 186 V+++S SALRSIHLL+GF +TK+C VKLFSKHIK++EQISLLKNRVNIASGTPSRIKK Sbjct: 110 VLIVSPSALRSIHLLKGFRFMTKQCSAVKLFSKHIKLQEQISLLKNRVNIASGTPSRIKK 169 Query: 185 LIDIEALGLSRLRVLVLDIHPDVKGYSLFTLPQVRDEFWDLFHNYFYQPMIQGHLRICLY 6 LID+EALGLSRL+VLVLD+HPDVKGYSLFTLPQVRDEFWDLF NYFYQPMI+G LRICLY Sbjct: 170 LIDVEALGLSRLQVLVLDMHPDVKGYSLFTLPQVRDEFWDLFKNYFYQPMIKGDLRICLY 229 Query: 5 G 3 G Sbjct: 230 G 230 >gb|KHN27142.1| Hypothetical protein glysoja_032910 [Glycine soja] Length = 238 Score = 297 bits (760), Expect = 1e-98 Identities = 157/228 (68%), Positives = 180/228 (78%), Gaps = 3/228 (1%) Frame = -2 Query: 677 KNEDDRRXXXXXKVSSLPSNESVSKEGAQEQLRFFVDQFEAAKGVQLSSLELESLKDT-C 501 +NE +R S + E A +QLRFF E+A G++LSSLELESLKD C Sbjct: 4 ENEKRKRKESSDVRSKKKKKQRSEDEEATKQLRFF----ESAMGIELSSLELESLKDNKC 59 Query: 500 IVEVSQDADSDVKKLGNNIKVAFGESWKESLCEGKV--GKIPAGSPAVVVISSSALRSIH 327 I+EVS+ ADSDV LG I+ AFG SWKE+LCEGK GK+ AGSPAV++I+SSALR I Sbjct: 60 ILEVSEAADSDVTVLGKTIRAAFGASWKEALCEGKPVEGKVIAGSPAVLIITSSALRCID 119 Query: 326 LLRGFHSITKECPPVKLFSKHIKIEEQISLLKNRVNIASGTPSRIKKLIDIEALGLSRLR 147 LLRGF S+T++C KLFSKH+K+EEQISLLKNRVNIASGTPSRIKKLID EAL LSRL+ Sbjct: 120 LLRGFRSMTEQCHAAKLFSKHMKLEEQISLLKNRVNIASGTPSRIKKLIDAEALDLSRLQ 179 Query: 146 VLVLDIHPDVKGYSLFTLPQVRDEFWDLFHNYFYQPMIQGHLRICLYG 3 VLVLD+HPDVKGYSL TLPQVRDEFWDLF NYFYQPM+QGHLRICLYG Sbjct: 180 VLVLDLHPDVKGYSLLTLPQVRDEFWDLFKNYFYQPMLQGHLRICLYG 227 >ref|XP_003556578.1| PREDICTED: protein CMSS1 [Glycine max] gb|KRG93111.1| hypothetical protein GLYMA_20G248400 [Glycine max] Length = 261 Score = 297 bits (760), Expect = 2e-98 Identities = 157/228 (68%), Positives = 180/228 (78%), Gaps = 3/228 (1%) Frame = -2 Query: 677 KNEDDRRXXXXXKVSSLPSNESVSKEGAQEQLRFFVDQFEAAKGVQLSSLELESLKDT-C 501 +NE +R S + E A +QLRFF E+A G++LSSLELESLKD C Sbjct: 16 ENEKRKRKESSDVRSKKKKKQRSEDEEATKQLRFF----ESAMGIELSSLELESLKDNKC 71 Query: 500 IVEVSQDADSDVKKLGNNIKVAFGESWKESLCEGKV--GKIPAGSPAVVVISSSALRSIH 327 I+EVS+ ADSDV LG I+ AFG SWKE+LCEGK GK+ AGSPAV++I+SSALR I Sbjct: 72 ILEVSEAADSDVTVLGKTIRAAFGASWKEALCEGKPVEGKVIAGSPAVLIITSSALRCID 131 Query: 326 LLRGFHSITKECPPVKLFSKHIKIEEQISLLKNRVNIASGTPSRIKKLIDIEALGLSRLR 147 LLRGF S+T++C KLFSKH+K+EEQISLLKNRVNIASGTPSRIKKLID EAL LSRL+ Sbjct: 132 LLRGFRSMTEQCHAAKLFSKHMKLEEQISLLKNRVNIASGTPSRIKKLIDAEALDLSRLQ 191 Query: 146 VLVLDIHPDVKGYSLFTLPQVRDEFWDLFHNYFYQPMIQGHLRICLYG 3 VLVLD+HPDVKGYSL TLPQVRDEFWDLF NYFYQPM+QGHLRICLYG Sbjct: 192 VLVLDLHPDVKGYSLLTLPQVRDEFWDLFKNYFYQPMLQGHLRICLYG 239 >ref|XP_023909205.1| uncharacterized protein C3orf26 homolog [Quercus suber] gb|POF14757.1| protein cmss1 [Quercus suber] Length = 296 Score = 295 bits (756), Expect = 3e-97 Identities = 155/248 (62%), Positives = 192/248 (77%), Gaps = 11/248 (4%) Frame = -2 Query: 713 NEKVKRKGAGGGKNEDDRRXXXXXKV-----SSLPSNESVSK----EGAQEQLRFFVDQF 561 N K KRK N ++++ ++ ++ +NE + K A EQL F++QF Sbjct: 35 NPKNKRKKPLAPTNLNNKKKSKKPRIQKPETNNYNNNEIIQKVQQPATASEQLSCFLNQF 94 Query: 560 EAAKGVQLSSLELESLKDTCIVEVSQDADSDVKKLGNNIKVAFGESWKESLCEGKV--GK 387 ++A GVQLSSLELESLKDTCIVE+SQ D DV+ LG ++K AFG SWKE LC +V GK Sbjct: 95 QSANGVQLSSLELESLKDTCIVELSQGTDQDVENLGKHMKAAFGASWKEVLCGKQVLEGK 154 Query: 386 IPAGSPAVVVISSSALRSIHLLRGFHSITKECPPVKLFSKHIKIEEQISLLKNRVNIASG 207 + GSPA++V+SSSALR+I LLRGF ++TK+C VKLFSKH+K+EEQ+SLLKNRVNIASG Sbjct: 155 VDPGSPALLVVSSSALRAIELLRGFRTLTKDCHAVKLFSKHMKVEEQVSLLKNRVNIASG 214 Query: 206 TPSRIKKLIDIEALGLSRLRVLVLDIHPDVKGYSLFTLPQVRDEFWDLFHNYFYQPMIQG 27 TPSRIKKLIDIEAL LSRL V++LDIHPDVKGYSLFTLPQVRDEFWDL+ NYF+Q +++G Sbjct: 215 TPSRIKKLIDIEALALSRLAVILLDIHPDVKGYSLFTLPQVRDEFWDLYKNYFHQRLLEG 274 Query: 26 HLRICLYG 3 LRICLYG Sbjct: 275 DLRICLYG 282 >ref|XP_018815173.1| PREDICTED: protein CMSS1 [Juglans regia] Length = 278 Score = 293 bits (751), Expect = 9e-97 Identities = 146/235 (62%), Positives = 184/235 (78%) Frame = -2 Query: 707 KVKRKGAGGGKNEDDRRXXXXXKVSSLPSNESVSKEGAQEQLRFFVDQFEAAKGVQLSSL 528 K K K +++++ + S + E+ A+ QL FF+DQF++A GVQLSSL Sbjct: 28 KTKLKSKKKKRDKNEAELIQNIQSSHENNEEAYPASAAKRQLCFFLDQFQSANGVQLSSL 87 Query: 527 ELESLKDTCIVEVSQDADSDVKKLGNNIKVAFGESWKESLCEGKVGKIPAGSPAVVVISS 348 ELES+KDT IVE+SQDA DV+ LG ++K AFG WKE+LC + GK+ G PA++++SS Sbjct: 88 ELESIKDTSIVELSQDATQDVQNLGKHMKAAFGSPWKEALCGKQEGKVDPGCPALLIVSS 147 Query: 347 SALRSIHLLRGFHSITKECPPVKLFSKHIKIEEQISLLKNRVNIASGTPSRIKKLIDIEA 168 SALRSI LLRGF S+TK+C VKLFSKH+K+EEQ+SLLK+RVNIA GTPSR+KKLIDIEA Sbjct: 148 SALRSIELLRGFRSLTKDCHAVKLFSKHMKVEEQVSLLKSRVNIACGTPSRVKKLIDIEA 207 Query: 167 LGLSRLRVLVLDIHPDVKGYSLFTLPQVRDEFWDLFHNYFYQPMIQGHLRICLYG 3 LGLSRL V++LDIHPDVKGYSLFTLPQVRDEFWDL+ YF+ ++QG+LRICLYG Sbjct: 208 LGLSRLAVILLDIHPDVKGYSLFTLPQVRDEFWDLYKCYFHPRLLQGNLRICLYG 262 >gb|KYP75518.1| Uncharacterized protein C3orf26 isogeny [Cajanus cajan] Length = 247 Score = 291 bits (746), Expect = 2e-96 Identities = 156/236 (66%), Positives = 179/236 (75%) Frame = -2 Query: 710 EKVKRKGAGGGKNEDDRRXXXXXKVSSLPSNESVSKEGAQEQLRFFVDQFEAAKGVQLSS 531 EK KRK N + R K + E QL F ++A G+QLSS Sbjct: 4 EKAKRKS---NSNSEGVRREKKKKKTKKHEEEEAIDASESAQLNFL----QSAMGIQLSS 56 Query: 530 LELESLKDTCIVEVSQDADSDVKKLGNNIKVAFGESWKESLCEGKVGKIPAGSPAVVVIS 351 LELESLKD CI+E ADSDV+ LG NIK AFG SWKE LCEG+ GK+ AGSPAV++++ Sbjct: 57 LELESLKDRCILE----ADSDVEMLGKNIKAAFGASWKEVLCEGQ-GKVDAGSPAVLILT 111 Query: 350 SSALRSIHLLRGFHSITKECPPVKLFSKHIKIEEQISLLKNRVNIASGTPSRIKKLIDIE 171 SSALR IHLLRGF S+TK+C KLFSKH+K++EQI+LLKNRVNIASGTPSRIKKL+D E Sbjct: 112 SSALRCIHLLRGFRSMTKQCHAAKLFSKHMKLQEQIALLKNRVNIASGTPSRIKKLMDAE 171 Query: 170 ALGLSRLRVLVLDIHPDVKGYSLFTLPQVRDEFWDLFHNYFYQPMIQGHLRICLYG 3 ALGLSRL+VLVLD+HPDVKGYSLFTLPQVRDEFW+LF NYFYQPMIQG LRICLYG Sbjct: 172 ALGLSRLQVLVLDLHPDVKGYSLFTLPQVRDEFWELFKNYFYQPMIQGDLRICLYG 227 >ref|XP_020207294.1| uncharacterized protein C3orf26 homolog [Cajanus cajan] Length = 258 Score = 291 bits (746), Expect = 3e-96 Identities = 156/236 (66%), Positives = 179/236 (75%) Frame = -2 Query: 710 EKVKRKGAGGGKNEDDRRXXXXXKVSSLPSNESVSKEGAQEQLRFFVDQFEAAKGVQLSS 531 EK KRK N + R K + E QL F ++A G+QLSS Sbjct: 15 EKAKRKS---NSNSEGVRREKKKKKTKKHEEEEAIDASESAQLNFL----QSAMGIQLSS 67 Query: 530 LELESLKDTCIVEVSQDADSDVKKLGNNIKVAFGESWKESLCEGKVGKIPAGSPAVVVIS 351 LELESLKD CI+E ADSDV+ LG NIK AFG SWKE LCEG+ GK+ AGSPAV++++ Sbjct: 68 LELESLKDRCILE----ADSDVEMLGKNIKAAFGASWKEVLCEGQ-GKVDAGSPAVLILT 122 Query: 350 SSALRSIHLLRGFHSITKECPPVKLFSKHIKIEEQISLLKNRVNIASGTPSRIKKLIDIE 171 SSALR IHLLRGF S+TK+C KLFSKH+K++EQI+LLKNRVNIASGTPSRIKKL+D E Sbjct: 123 SSALRCIHLLRGFRSMTKQCHAAKLFSKHMKLQEQIALLKNRVNIASGTPSRIKKLMDAE 182 Query: 170 ALGLSRLRVLVLDIHPDVKGYSLFTLPQVRDEFWDLFHNYFYQPMIQGHLRICLYG 3 ALGLSRL+VLVLD+HPDVKGYSLFTLPQVRDEFW+LF NYFYQPMIQG LRICLYG Sbjct: 183 ALGLSRLQVLVLDLHPDVKGYSLFTLPQVRDEFWELFKNYFYQPMIQGDLRICLYG 238 >ref|XP_016699167.1| PREDICTED: protein CMSS1-like [Gossypium hirsutum] Length = 290 Score = 293 bits (749), Expect = 3e-96 Identities = 147/209 (70%), Positives = 172/209 (82%), Gaps = 2/209 (0%) Frame = -2 Query: 623 SNESVSKEGAQEQLRFFVDQFEAAKGVQLSSLELESLKDTCIVEVSQDADSDVKKLGNNI 444 +N+ +QLR+F+ QFE+A GVQLSSLELES+KD+CI++VSQ+ DV KL I Sbjct: 57 NNQPSPPASPSQQLRYFLSQFESANGVQLSSLELESIKDSCILDVSQELGQDVMKLEKRI 116 Query: 443 KVAFGESWKESLCEGK--VGKIPAGSPAVVVISSSALRSIHLLRGFHSITKECPPVKLFS 270 K AFG WKE LCEGK VGKI AGSPAV+V++ SALRSI LLRG ++TKEC VKLFS Sbjct: 117 KEAFGAKWKEELCEGKHIVGKIEAGSPAVLVVAPSALRSIELLRGMRTLTKECHAVKLFS 176 Query: 269 KHIKIEEQISLLKNRVNIASGTPSRIKKLIDIEALGLSRLRVLVLDIHPDVKGYSLFTLP 90 KH+KI+EQ+SLL NRVNIASGTPSRIKKLIDIEALGLSRL VL+LDIH DVKGYSL TLP Sbjct: 177 KHMKIDEQVSLLMNRVNIASGTPSRIKKLIDIEALGLSRLSVLLLDIHTDVKGYSLLTLP 236 Query: 89 QVRDEFWDLFHNYFYQPMIQGHLRICLYG 3 QVRDEFWDL+ NYF+Q +++G LRICLYG Sbjct: 237 QVRDEFWDLYKNYFHQQLVKGDLRICLYG 265 >ref|XP_012457024.1| PREDICTED: protein CMSS1 [Gossypium raimondii] gb|KJB71843.1| hypothetical protein B456_011G144300 [Gossypium raimondii] Length = 290 Score = 292 bits (747), Expect = 6e-96 Identities = 146/209 (69%), Positives = 173/209 (82%), Gaps = 2/209 (0%) Frame = -2 Query: 623 SNESVSKEGAQEQLRFFVDQFEAAKGVQLSSLELESLKDTCIVEVSQDADSDVKKLGNNI 444 +N+ +QLR+F+ QFE+A GVQLSSLELES+KD+CI++VSQ+ DV KL +I Sbjct: 57 NNQPSPPASPSQQLRYFLSQFESANGVQLSSLELESIKDSCILDVSQELGQDVMKLEKHI 116 Query: 443 KVAFGESWKESLCEGK--VGKIPAGSPAVVVISSSALRSIHLLRGFHSITKECPPVKLFS 270 K AFG WKE LCEGK VGKI AGSPAV+V++ SALRSI LLRG ++TKEC VKLFS Sbjct: 117 KEAFGAKWKEELCEGKHIVGKIEAGSPAVLVVAPSALRSIELLRGMRTLTKECHAVKLFS 176 Query: 269 KHIKIEEQISLLKNRVNIASGTPSRIKKLIDIEALGLSRLRVLVLDIHPDVKGYSLFTLP 90 KH+KI+EQ+SLL NRVNIASGTPSRIKKLIDIEALGLSRL VL++DIH DVKGYSL TLP Sbjct: 177 KHMKIDEQVSLLMNRVNIASGTPSRIKKLIDIEALGLSRLSVLLVDIHTDVKGYSLLTLP 236 Query: 89 QVRDEFWDLFHNYFYQPMIQGHLRICLYG 3 QVRDEFWDL+ NYF+Q +++G LRICLYG Sbjct: 237 QVRDEFWDLYKNYFHQQLVKGDLRICLYG 265 >ref|XP_017646838.1| PREDICTED: protein CMSS1 [Gossypium arboreum] Length = 290 Score = 290 bits (741), Expect = 4e-95 Identities = 145/209 (69%), Positives = 171/209 (81%), Gaps = 2/209 (0%) Frame = -2 Query: 623 SNESVSKEGAQEQLRFFVDQFEAAKGVQLSSLELESLKDTCIVEVSQDADSDVKKLGNNI 444 +N+ +QLR+F+ QFE+A GVQLSSLELES+KD+C ++VSQ+ DV KL +I Sbjct: 57 NNQPSQPASPSQQLRYFLSQFESANGVQLSSLELESIKDSCFLDVSQELGQDVMKLEKHI 116 Query: 443 KVAFGESWKESLCEGK--VGKIPAGSPAVVVISSSALRSIHLLRGFHSITKECPPVKLFS 270 K AFG WKE LCEGK GKI AGSPAV+V++ SALRSI LLRG ++TKEC VKLFS Sbjct: 117 KEAFGAKWKEELCEGKHIEGKIEAGSPAVLVVAPSALRSIELLRGMRTLTKECHAVKLFS 176 Query: 269 KHIKIEEQISLLKNRVNIASGTPSRIKKLIDIEALGLSRLRVLVLDIHPDVKGYSLFTLP 90 KH+KI+EQ+SLL NRVNIASGTPSRIKKLIDIEALGLSRL VL+LDIH DVKGYSL TLP Sbjct: 177 KHMKIDEQVSLLMNRVNIASGTPSRIKKLIDIEALGLSRLSVLLLDIHTDVKGYSLLTLP 236 Query: 89 QVRDEFWDLFHNYFYQPMIQGHLRICLYG 3 QVRDEFWDL+ NYF+Q +++G LRICLYG Sbjct: 237 QVRDEFWDLYKNYFHQQLVKGDLRICLYG 265 >ref|XP_016677780.1| PREDICTED: protein CMSS1-like [Gossypium hirsutum] Length = 290 Score = 290 bits (741), Expect = 4e-95 Identities = 145/209 (69%), Positives = 171/209 (81%), Gaps = 2/209 (0%) Frame = -2 Query: 623 SNESVSKEGAQEQLRFFVDQFEAAKGVQLSSLELESLKDTCIVEVSQDADSDVKKLGNNI 444 +N+ +QLR+F+ QFE+A GVQLSSLELES+KD+C ++VSQ+ DV KL +I Sbjct: 57 NNQPSQPASPSQQLRYFLSQFESANGVQLSSLELESIKDSCFLDVSQELGQDVMKLEKHI 116 Query: 443 KVAFGESWKESLCEGK--VGKIPAGSPAVVVISSSALRSIHLLRGFHSITKECPPVKLFS 270 K AFG WKE LCEGK GKI AGSPAV+V++ SALRSI LLRG ++TKEC VKLFS Sbjct: 117 KEAFGAKWKEELCEGKHIEGKIEAGSPAVLVVAPSALRSIELLRGMRTLTKECHAVKLFS 176 Query: 269 KHIKIEEQISLLKNRVNIASGTPSRIKKLIDIEALGLSRLRVLVLDIHPDVKGYSLFTLP 90 KH+KI+EQ+SLL NRVNIASGTPSRIKKLIDIEALGLSRL VL+LDIH DVKGYSL TLP Sbjct: 177 KHMKIDEQVSLLMNRVNIASGTPSRIKKLIDIEALGLSRLSVLLLDIHTDVKGYSLLTLP 236 Query: 89 QVRDEFWDLFHNYFYQPMIQGHLRICLYG 3 QVRDEFWDL+ NYF+Q +++G LRICLYG Sbjct: 237 QVRDEFWDLYKNYFHQQLVKGDLRICLYG 265 >ref|XP_010107216.1| protein CMS1 [Morus notabilis] gb|EXC14240.1| hypothetical protein L484_021737 [Morus notabilis] Length = 296 Score = 289 bits (739), Expect = 1e-94 Identities = 146/227 (64%), Positives = 179/227 (78%), Gaps = 2/227 (0%) Frame = -2 Query: 677 KNEDDRRXXXXXKVSSLPSNESVSKEGAQEQLRFFVDQFEAAKGVQLSSLELESLKDTCI 498 KN++D R +NE V + A QLR F+DQF++ G+QLSSLELE++KDTCI Sbjct: 56 KNDNDDRVEEQNDNGESGNNEVV-QTSASTQLRVFLDQFQSGNGIQLSSLELEAIKDTCI 114 Query: 497 VEVSQDADSDVKKLGNNIKVAFGESWKESLCEGKV--GKIPAGSPAVVVISSSALRSIHL 324 +E+SQ D D K LG ++KVAFG SWKE LCE ++ GK+ G PA+++ISSSA+RSI L Sbjct: 115 LELSQCTDEDAKSLGKDMKVAFGPSWKEVLCEKQLLEGKVDPGCPAILIISSSAMRSIEL 174 Query: 323 LRGFHSITKECPPVKLFSKHIKIEEQISLLKNRVNIASGTPSRIKKLIDIEALGLSRLRV 144 LRG S+TKECP VKLFSKH+K+E+Q+SLLKNRVNIASGTPSRIKKLIDIEALGLSRL Sbjct: 175 LRGLQSLTKECPAVKLFSKHMKVEDQVSLLKNRVNIASGTPSRIKKLIDIEALGLSRLAA 234 Query: 143 LVLDIHPDVKGYSLFTLPQVRDEFWDLFHNYFYQPMIQGHLRICLYG 3 +VLD+ PDVKGYSLFTLPQVRDEFWDL+ N F+Q + +G LR+ LYG Sbjct: 235 IVLDVRPDVKGYSLFTLPQVRDEFWDLYKNCFHQRIAEGSLRLGLYG 281 >ref|XP_021290395.1| protein CMSS1 isoform X1 [Herrania umbratica] Length = 300 Score = 288 bits (737), Expect = 3e-94 Identities = 149/241 (61%), Positives = 184/241 (76%), Gaps = 4/241 (1%) Frame = -2 Query: 713 NEKVKRKGAGGGKNEDDR--RXXXXXKVSSLPSNESVSKEGAQEQLRFFVDQFEAAKGVQ 540 ++K K K +N+ D+ V+ +N ++QL +F+ QF++A GVQ Sbjct: 42 SKKKKTKNNHNHQNDTDQIETKTKTTVVTGNNNNPPPPPASPRQQLSYFLSQFQSANGVQ 101 Query: 539 LSSLELESLKDTCIVEVSQDADSDVKKLGNNIKVAFGESWKESLCEGKV--GKIPAGSPA 366 LSSLELES+KD+CI++VSQ++ DV +L IK AFG WKE LCEGK+ GKI AGSP Sbjct: 102 LSSLELESVKDSCILDVSQESGQDVMRLERYIKEAFGAKWKEELCEGKLIEGKIEAGSPT 161 Query: 365 VVVISSSALRSIHLLRGFHSITKECPPVKLFSKHIKIEEQISLLKNRVNIASGTPSRIKK 186 V+V+++SALRSI LLRG S TKEC VKLFSKH+KI+EQ+SLLKNRVNIASGTPSRIKK Sbjct: 162 VLVVAASALRSIELLRGMRSFTKECHAVKLFSKHMKIDEQVSLLKNRVNIASGTPSRIKK 221 Query: 185 LIDIEALGLSRLRVLVLDIHPDVKGYSLFTLPQVRDEFWDLFHNYFYQPMIQGHLRICLY 6 LIDIEALGLSRL +++LDIH DVKGYSL TLPQVRDEFWDL+ NYF+Q ++QG LRICLY Sbjct: 222 LIDIEALGLSRLSLILLDIHTDVKGYSLLTLPQVRDEFWDLYKNYFHQQVVQGDLRICLY 281 Query: 5 G 3 G Sbjct: 282 G 282