BLASTX nr result
ID: Catharanthus22_contig00012634
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus22_contig00012634 (1140 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EOY07220.1| Uncharacterized protein isoform 2 [Theobroma cacao] 350 5e-94 gb|EOY07219.1| Uncharacterized protein isoform 1 [Theobroma cacao] 350 5e-94 ref|XP_003530281.2| PREDICTED: UPF0420 protein C16orf58 homolog ... 342 2e-91 ref|XP_006363594.1| PREDICTED: UPF0420 protein C16orf58 homolog ... 342 2e-91 gb|ESW12345.1| hypothetical protein PHAVU_008G104800g [Phaseolus... 339 1e-90 ref|XP_002309136.2| hypothetical protein POPTR_0006s10060g [Popu... 338 2e-90 ref|XP_004148619.1| PREDICTED: UPF0420 protein C16orf58 homolog ... 337 4e-90 ref|XP_002528102.1| conserved hypothetical protein [Ricinus comm... 336 9e-90 ref|XP_004305128.1| PREDICTED: UPF0420 protein-like [Fragaria ve... 335 3e-89 ref|XP_006481025.1| PREDICTED: UPF0420 protein C16orf58 homolog ... 333 6e-89 ref|XP_004492435.1| PREDICTED: UPF0420 protein C16orf58-like iso... 331 3e-88 gb|EMJ08984.1| hypothetical protein PRUPE_ppa025851mg, partial [... 328 2e-87 ref|XP_003623296.1| hypothetical protein MTR_7g068310 [Medicago ... 325 2e-86 ref|XP_002274737.1| PREDICTED: UPF0420 protein-like [Vitis vinif... 321 3e-85 ref|XP_006602667.1| PREDICTED: UPF0420 protein C16orf58 homolog ... 318 2e-84 ref|XP_006602666.1| PREDICTED: UPF0420 protein C16orf58 homolog ... 318 2e-84 ref|XP_006602663.1| PREDICTED: UPF0420 protein C16orf58 homolog ... 318 2e-84 ref|XP_006398608.1| hypothetical protein EUTSA_v10013293mg [Eutr... 313 6e-83 ref|NP_195771.2| uncharacterized protein [Arabidopsis thaliana] ... 308 2e-81 ref|XP_002873007.1| hypothetical protein ARALYDRAFT_907999 [Arab... 307 5e-81 >gb|EOY07220.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 420 Score = 350 bits (899), Expect = 5e-94 Identities = 189/277 (68%), Positives = 206/277 (74%), Gaps = 1/277 (0%) Frame = -3 Query: 925 FEFLRKSLPFPVKPHRHFHTLCIPPNSESHGDGPDASRRSEATRKDTEGEGPVILVERYG 746 F RKS+ K RH L +S+ PD R SE+ + VIL+ERYG Sbjct: 16 FPSRRKSIE---KRLRHLQNL---HSSKEGQQEPDGDRNSES-------QDQVILLERYG 62 Query: 745 NGTTKRYEIRKDSRMSTFLEKHVPKSNGSLDSQTSEFDLSWLPQIVKDLFLPTGYPGSVS 566 NGT KRY + D ++ FL KH SN DS S +LSWLP I+KD LP G+PGSVS Sbjct: 63 NGTIKRYMLGDDLQIRAFLGKHDSTSNEFQDSHLSNPNLSWLPGILKDFILPAGFPGSVS 122 Query: 565 DDYLAYMLLQFPTNVTGWICRTLVTSSLLKAVGVGSFSGTTXXXXXXAIRWVSKDGIGAV 386 DDYL YMLLQFPTNVTGWIC TLVTSSLLKAVGVGSFSGT+ AIRWVSKDGIGAV Sbjct: 123 DDYLQYMLLQFPTNVTGWICHTLVTSSLLKAVGVGSFSGTSAAASAAAIRWVSKDGIGAV 182 Query: 385 GRLFIGGRFGNLFDDDPKQWRMYADFVGSAGSIFDLTTQLYPAYFLPLASLGNLTKAVAR 206 GRLFIGGRFGNLFDDDPKQWRMYADF+GSAGSIFDLTTQ+YPAYFLPLASLGNL KAVAR Sbjct: 183 GRLFIGGRFGNLFDDDPKQWRMYADFIGSAGSIFDLTTQVYPAYFLPLASLGNLAKAVAR 242 Query: 205 GLKDPSFRVIQNHFAISGNLGEVAAK-VYWNIFFHIL 98 GLKDPSFRVIQNHFAISGNLGEVAAK W + +L Sbjct: 243 GLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVTAQLL 279 >gb|EOY07219.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 492 Score = 350 bits (899), Expect = 5e-94 Identities = 189/277 (68%), Positives = 206/277 (74%), Gaps = 1/277 (0%) Frame = -3 Query: 925 FEFLRKSLPFPVKPHRHFHTLCIPPNSESHGDGPDASRRSEATRKDTEGEGPVILVERYG 746 F RKS+ K RH L +S+ PD R SE+ + VIL+ERYG Sbjct: 16 FPSRRKSIE---KRLRHLQNL---HSSKEGQQEPDGDRNSES-------QDQVILLERYG 62 Query: 745 NGTTKRYEIRKDSRMSTFLEKHVPKSNGSLDSQTSEFDLSWLPQIVKDLFLPTGYPGSVS 566 NGT KRY + D ++ FL KH SN DS S +LSWLP I+KD LP G+PGSVS Sbjct: 63 NGTIKRYMLGDDLQIRAFLGKHDSTSNEFQDSHLSNPNLSWLPGILKDFILPAGFPGSVS 122 Query: 565 DDYLAYMLLQFPTNVTGWICRTLVTSSLLKAVGVGSFSGTTXXXXXXAIRWVSKDGIGAV 386 DDYL YMLLQFPTNVTGWIC TLVTSSLLKAVGVGSFSGT+ AIRWVSKDGIGAV Sbjct: 123 DDYLQYMLLQFPTNVTGWICHTLVTSSLLKAVGVGSFSGTSAAASAAAIRWVSKDGIGAV 182 Query: 385 GRLFIGGRFGNLFDDDPKQWRMYADFVGSAGSIFDLTTQLYPAYFLPLASLGNLTKAVAR 206 GRLFIGGRFGNLFDDDPKQWRMYADF+GSAGSIFDLTTQ+YPAYFLPLASLGNL KAVAR Sbjct: 183 GRLFIGGRFGNLFDDDPKQWRMYADFIGSAGSIFDLTTQVYPAYFLPLASLGNLAKAVAR 242 Query: 205 GLKDPSFRVIQNHFAISGNLGEVAAK-VYWNIFFHIL 98 GLKDPSFRVIQNHFAISGNLGEVAAK W + +L Sbjct: 243 GLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVTAQLL 279 >ref|XP_003530281.2| PREDICTED: UPF0420 protein C16orf58 homolog [Glycine max] Length = 499 Score = 342 bits (877), Expect = 2e-91 Identities = 182/257 (70%), Positives = 199/257 (77%), Gaps = 2/257 (0%) Frame = -3 Query: 892 VKPHRHFHTLCIPPNSE-SHGDGPDASRRSEATRKDTEGEGPVILVERYGNGTTKRYEIR 716 V+P R F LC +S DG D ++R VILVERY NGT KRY + Sbjct: 26 VRP-RGFQFLCSSEHSSFKDEDGADNGGGQVSSR--------VILVERYSNGTAKRYVLG 76 Query: 715 KDSRMSTFL-EKHVPKSNGSLDSQTSEFDLSWLPQIVKDLFLPTGYPGSVSDDYLAYMLL 539 DS++ FL E+ N D +S+ LSWLP+I+KD LP G+PGSVSDDYL YMLL Sbjct: 77 DDSQLQAFLVEEDRSTPNRFQDLHSSDESLSWLPEIIKDFVLPAGFPGSVSDDYLDYMLL 136 Query: 538 QFPTNVTGWICRTLVTSSLLKAVGVGSFSGTTXXXXXXAIRWVSKDGIGAVGRLFIGGRF 359 QFPTNVTGWIC TLVTSSLLKAVG+GSF+GTT AIRWVSKDGIGAVGRLFIGGRF Sbjct: 137 QFPTNVTGWICHTLVTSSLLKAVGIGSFTGTTAAASAAAIRWVSKDGIGAVGRLFIGGRF 196 Query: 358 GNLFDDDPKQWRMYADFVGSAGSIFDLTTQLYPAYFLPLASLGNLTKAVARGLKDPSFRV 179 G+LFDDDPKQWRMYADF+GSAGSIFDLTTQLYPAYFLPLASLGNLTKAVARGLKDPSFRV Sbjct: 197 GSLFDDDPKQWRMYADFIGSAGSIFDLTTQLYPAYFLPLASLGNLTKAVARGLKDPSFRV 256 Query: 178 IQNHFAISGNLGEVAAK 128 IQNHFAISGNLGEVAAK Sbjct: 257 IQNHFAISGNLGEVAAK 273 >ref|XP_006363594.1| PREDICTED: UPF0420 protein C16orf58 homolog [Solanum tuberosum] Length = 504 Score = 342 bits (877), Expect = 2e-91 Identities = 175/250 (70%), Positives = 198/250 (79%), Gaps = 8/250 (3%) Frame = -3 Query: 823 DASRRSEATRKDTEGE---GPVILVERYGNGTTKRYEIRKDSRMSTFLEKHVPKSNGSLD 653 D+S+ E +D+ GE G VILVE+Y NGT KRY I DS M FLE+HVP ++ S D Sbjct: 41 DSSKSKEEAIQDSSGENDKGYVILVEKYRNGTLKRYVIDNDSEMKMFLEEHVPTTSRSQD 100 Query: 652 SQTSEFDLSWLPQIVKDLFLPTGYPGSVSDDYLAYMLLQFPTNVTGWICRTLVTSSLLKA 473 S +LSWLP+++KD LP+G+P +VSDDYL YMLLQFPTNVTGWIC TLVTSSLLKA Sbjct: 101 LDISGMELSWLPKVIKDFVLPSGFPDTVSDDYLDYMLLQFPTNVTGWICHTLVTSSLLKA 160 Query: 472 VGVGSFSGTTXXXXXXA----IRWVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADFV 305 VGVGSFSGT+ A IRWVSKDGIGA+GR FIGGRFGNLFDDDPKQWRMYADF+ Sbjct: 161 VGVGSFSGTSAAASAAASAAAIRWVSKDGIGALGRFFIGGRFGNLFDDDPKQWRMYADFI 220 Query: 304 GSAGSIFDLTTQLYPAYFLPLASLGNLTKAVARGLKDPSFRVIQNHFAISGNLGEVAAK- 128 GSAGSIFDL T LYP+YFLPLASLGNL KAVARGLKDPSFRVIQNHFAI+GNLG+VAAK Sbjct: 221 GSAGSIFDLCTPLYPSYFLPLASLGNLAKAVARGLKDPSFRVIQNHFAIAGNLGDVAAKE 280 Query: 127 VYWNIFFHIL 98 W + +L Sbjct: 281 EVWEVAAELL 290 >gb|ESW12345.1| hypothetical protein PHAVU_008G104800g [Phaseolus vulgaris] Length = 500 Score = 339 bits (870), Expect = 1e-90 Identities = 171/216 (79%), Positives = 182/216 (84%), Gaps = 2/216 (0%) Frame = -3 Query: 769 VILVERYGNGTTKRYEIRKDSRMSTFL--EKHVPKSNGSLDSQTSEFDLSWLPQIVKDLF 596 VILVERY NGT KRY + DS++ TFL E+ N DS + + LSWLP +KD Sbjct: 59 VILVERYSNGTAKRYVLGDDSKLQTFLVEEESSTTPNRFQDSHSPDERLSWLPDTIKDFI 118 Query: 595 LPTGYPGSVSDDYLAYMLLQFPTNVTGWICRTLVTSSLLKAVGVGSFSGTTXXXXXXAIR 416 LP+G+PGSVSDDYL YMLLQFPTNVTGWIC TLVTSSLLKAVGVGSFSG+T AIR Sbjct: 119 LPSGFPGSVSDDYLHYMLLQFPTNVTGWICHTLVTSSLLKAVGVGSFSGSTAAASAAAIR 178 Query: 415 WVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADFVGSAGSIFDLTTQLYPAYFLPLAS 236 WVSKDGIGA GRLFIGGRFGNLFDDDPKQWRMYADF+GSAGSIFDLTTQLYP YFLPLAS Sbjct: 179 WVSKDGIGATGRLFIGGRFGNLFDDDPKQWRMYADFIGSAGSIFDLTTQLYPGYFLPLAS 238 Query: 235 LGNLTKAVARGLKDPSFRVIQNHFAISGNLGEVAAK 128 LGNLTKAVARGLKDPSFRVIQNHFAISGNLGEVAAK Sbjct: 239 LGNLTKAVARGLKDPSFRVIQNHFAISGNLGEVAAK 274 >ref|XP_002309136.2| hypothetical protein POPTR_0006s10060g [Populus trichocarpa] gi|550335903|gb|EEE92659.2| hypothetical protein POPTR_0006s10060g [Populus trichocarpa] Length = 500 Score = 338 bits (868), Expect = 2e-90 Identities = 182/297 (61%), Positives = 211/297 (71%), Gaps = 1/297 (0%) Frame = -3 Query: 964 LSSSLQPHTPHFRFEFLRKSLPFPVKPHRHFHTLCIPPNSESHGDGPDASRRSEATRKDT 785 +S LQ P FE S K HF TLC +S + + ++ Sbjct: 1 MSYPLQLSFPGLAFE---SSKTRTRKKAHHFQTLCC------------SSLQHPSLQEKP 45 Query: 784 EGEGPVILVERYGNGTTKRYEIRKDSRMSTFLEKHVPKSNGSLDSQTSEFDLSWLPQIVK 605 + E VIL+ERYGNGT KRY + ++ FLEK+ ++ +S+ SE LSWLP I+K Sbjct: 46 DNE--VILLERYGNGTAKRYTLDDAVQLQGFLEKNGSENRSFEESRLSEAGLSWLPDILK 103 Query: 604 DLFLPTGYPGSVSDDYLAYMLLQFPTNVTGWICRTLVTSSLLKAVGVGSFSGTTXXXXXX 425 D LP G+PGSVSDDYL YM+LQFPTN+TGWIC TLVTSSLLKAVG GSF+GT Sbjct: 104 DFILPAGFPGSVSDDYLQYMVLQFPTNITGWICHTLVTSSLLKAVGAGSFTGTDAAASAA 163 Query: 424 AIRWVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADFVGSAGSIFDLTTQLYPAYFLP 245 AIRWVSKDGIGA+GRLFIGGRFG+LFDDDPKQWRMYADF+GSAGSIFDLTTQ+YPAYFLP Sbjct: 164 AIRWVSKDGIGALGRLFIGGRFGDLFDDDPKQWRMYADFIGSAGSIFDLTTQVYPAYFLP 223 Query: 244 LASLGNLTKAVARGLKDPSFRVIQNHFAISGNLGEVAAK-VYWNIFFHILVICTSAL 77 LASLGNLTKAVARGLKDPSFRVIQNHFA+SGNLGEVAAK W + +L + L Sbjct: 224 LASLGNLTKAVARGLKDPSFRVIQNHFAVSGNLGEVAAKEEVWEVGAQLLGLALGIL 280 >ref|XP_004148619.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus] gi|449518467|ref|XP_004166263.1| PREDICTED: UPF0420 protein C16orf58 homolog [Cucumis sativus] Length = 495 Score = 337 bits (865), Expect = 4e-90 Identities = 173/263 (65%), Positives = 197/263 (74%), Gaps = 2/263 (0%) Frame = -3 Query: 859 IPPNSESHGDGPDASRRSEATRKDTEGEGPVILVERYGNGTTKRYEIRKDSRMSTFLEKH 680 +P + +G D SR R VILVE+YGN K+Y + + R+ FL++ Sbjct: 38 LPHREDDDKNGVDCSREQIQRR--------VILVEKYGNSALKKYFLDDNQRLQFFLDEQ 89 Query: 679 V-PKSNGSLDSQTSEFDLSWLPQIVKDLFLPTGYPGSVSDDYLAYMLLQFPTNVTGWICR 503 P SNG +S+ SE LSWLP ++KD LPTG+P SVSDDYL YM+ QFPTNVTGWIC Sbjct: 90 TSPTSNGFKESRFSETKLSWLPGLIKDFILPTGFPESVSDDYLQYMIRQFPTNVTGWICH 149 Query: 502 TLVTSSLLKAVGVGSFSGTTXXXXXXAIRWVSKDGIGAVGRLFIGGRFGNLFDDDPKQWR 323 TLVTSSLLKAVG+GSFSGTT AIRWVSKDGIGAVGRLFIGGRFGNLFDDDPKQWR Sbjct: 150 TLVTSSLLKAVGIGSFSGTTTAASAVAIRWVSKDGIGAVGRLFIGGRFGNLFDDDPKQWR 209 Query: 322 MYADFVGSAGSIFDLTTQLYPAYFLPLASLGNLTKAVARGLKDPSFRVIQNHFAISGNLG 143 MYADF+GSAGSIFDL T LYP+YFLPLASLGNLTKAVARGLKDPSFRVIQNHFA+SGNLG Sbjct: 210 MYADFIGSAGSIFDLATPLYPSYFLPLASLGNLTKAVARGLKDPSFRVIQNHFAVSGNLG 269 Query: 142 EVAAK-VYWNIFFHILVICTSAL 77 E+AAK W + +L + L Sbjct: 270 EIAAKEEVWEVVAQLLGLAIGIL 292 >ref|XP_002528102.1| conserved hypothetical protein [Ricinus communis] gi|223532491|gb|EEF34281.1| conserved hypothetical protein [Ricinus communis] Length = 485 Score = 336 bits (862), Expect = 9e-90 Identities = 170/246 (69%), Positives = 197/246 (80%), Gaps = 3/246 (1%) Frame = -3 Query: 805 EATRKDTEGEGPVILVERYGNGTTKRYEIRKDSRMSTFLEKHVPKSNGSLDSQTSEFDLS 626 +A + +T VILVERY NGT++RY + D+++ FLE+ K++ +S +S+ +LS Sbjct: 33 QAGKDETNNCRNVILVERYANGTSRRYVLDDDAQLKPFLEEQGAKNSALQESYSSDINLS 92 Query: 625 WLPQIVKDLFLPTGYPGSVSDDYLAYMLLQFPTNVTGWICRTLVTSSLLKAVGVGSFSGT 446 WLP I+KD LP G+PGSVSDDY YMLLQFPTNVTGWIC TLVTSSLLKAVGVGSF+G+ Sbjct: 93 WLPYIIKDFILPAGFPGSVSDDYFQYMLLQFPTNVTGWICHTLVTSSLLKAVGVGSFTGS 152 Query: 445 TXXXXXXA--IRWVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADFVGSAGSIFDLTT 272 T A IRWVSKDGIGA+GRLFIGGRFG+LFDDDPKQWRMYADF+GSAGSIFDL T Sbjct: 153 TAAAAASAAAIRWVSKDGIGALGRLFIGGRFGSLFDDDPKQWRMYADFIGSAGSIFDLIT 212 Query: 271 QLYPAYFLPLASLGNLTKAVARGLKDPSFRVIQNHFAISGNLGEVAAK-VYWNIFFHILV 95 Q+YPAYFLPLASLGNLTKAVARGLKDPSFRVIQNHFAISGNLGEVAAK W + +L Sbjct: 213 QVYPAYFLPLASLGNLTKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVGAQLLG 272 Query: 94 ICTSAL 77 + L Sbjct: 273 LALGIL 278 >ref|XP_004305128.1| PREDICTED: UPF0420 protein-like [Fragaria vesca subsp. vesca] Length = 489 Score = 335 bits (858), Expect = 3e-89 Identities = 174/259 (67%), Positives = 198/259 (76%), Gaps = 2/259 (0%) Frame = -3 Query: 868 TLCIPPNSESHGDGPDASRRSEATRKDTEGEGPVILVERYGNGTTKRYEIRKDSRMSTFL 689 T C +S+S +G + G+ VILVERYG+GT KRY + + ++ TF+ Sbjct: 29 TCCSSSSSQSDDNGGN------------RGQPHVILVERYGDGTAKRYLVDDELQVQTFV 76 Query: 688 EKHVPKSNG-SLDSQTSEFDLSWLPQIVKDLFLPTGYPGSVSDDYLAYMLLQFPTNVTGW 512 E+ PK + S S S +LSWLP IVKD P G+PGSVSDDYL YMLLQFPTNVT W Sbjct: 77 EEPSPKPDTTSHSSHFSNTELSWLPDIVKDFIFPAGFPGSVSDDYLLYMLLQFPTNVTAW 136 Query: 511 ICRTLVTSSLLKAVGVGSFSGTTXXXXXXAIRWVSKDGIGAVGRLFIGGRFGNLFDDDPK 332 IC+TLVTSSLLKAVGVGSFSG+T AIRWVSKDGIGAVGR FIGGRFGNLFDDDPK Sbjct: 137 ICQTLVTSSLLKAVGVGSFSGSTAAASAAAIRWVSKDGIGAVGRFFIGGRFGNLFDDDPK 196 Query: 331 QWRMYADFVGSAGSIFDLTTQLYPAYFLPLASLGNLTKAVARGLKDPSFRVIQNHFAISG 152 QWR+YADF+GSAGSIFDLTT LYPAYFLPLASLGNLTKAVARGLKDPSFRVIQ+HFAISG Sbjct: 197 QWRLYADFIGSAGSIFDLTTPLYPAYFLPLASLGNLTKAVARGLKDPSFRVIQSHFAISG 256 Query: 151 NLGEVAAK-VYWNIFFHIL 98 NLG++AAK W + +L Sbjct: 257 NLGDIAAKEEVWEVTAQLL 275 >ref|XP_006481025.1| PREDICTED: UPF0420 protein C16orf58 homolog [Citrus sinensis] Length = 497 Score = 333 bits (855), Expect = 6e-89 Identities = 165/232 (71%), Positives = 188/232 (81%), Gaps = 1/232 (0%) Frame = -3 Query: 820 ASRRSEATRKDTEGEGPVILVERYGNGTTKRYEIRKDSRMSTFLEKHVPKSNGSLD-SQT 644 + EA + + V+LVERYGNGT +R+ + + ++ TF H P ++ L SQ Sbjct: 40 SEEEDEAGNGRAQSQQHVVLVERYGNGTARRFILDDEWQVQTFDADHDPTTDTRLQGSQF 99 Query: 643 SEFDLSWLPQIVKDLFLPTGYPGSVSDDYLAYMLLQFPTNVTGWICRTLVTSSLLKAVGV 464 S+ +LSWLP +VKD LP G+PGSVSDDYL YMLLQFPTNVTGWIC +VTSSLLKAVG+ Sbjct: 100 SDTNLSWLPSVVKDFLLPAGFPGSVSDDYLGYMLLQFPTNVTGWICHAIVTSSLLKAVGI 159 Query: 463 GSFSGTTXXXXXXAIRWVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADFVGSAGSIF 284 SFSGTT AI+W+SKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADF+GSAGSIF Sbjct: 160 DSFSGTTAAASAAAIKWISKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADFIGSAGSIF 219 Query: 283 DLTTQLYPAYFLPLASLGNLTKAVARGLKDPSFRVIQNHFAISGNLGEVAAK 128 DL TQ+YPAYFLPLASLGNL+KAVARGLKDPSFRVIQNHFAISGNLGEVAAK Sbjct: 220 DLATQVYPAYFLPLASLGNLSKAVARGLKDPSFRVIQNHFAISGNLGEVAAK 271 >ref|XP_004492435.1| PREDICTED: UPF0420 protein C16orf58-like isoform X1 [Cicer arietinum] Length = 493 Score = 331 bits (849), Expect = 3e-88 Identities = 167/220 (75%), Positives = 183/220 (83%), Gaps = 1/220 (0%) Frame = -3 Query: 784 EGEGPVILVERYGNGTTKRYEIRKDSRMSTFL-EKHVPKSNGSLDSQTSEFDLSWLPQIV 608 EG VILVERY NGT KRY + D ++ T L E+ +N S + + LSWLP+++ Sbjct: 49 EGLSRVILVERYSNGTAKRYVLGDDLQLRTILIEEDRSMANRFGVSHSPDKRLSWLPKMI 108 Query: 607 KDLFLPTGYPGSVSDDYLAYMLLQFPTNVTGWICRTLVTSSLLKAVGVGSFSGTTXXXXX 428 KD LP G+P SVSDDYL YMLLQFPTNVTGWIC T+VTSSLLKAVG+GSFSGTT Sbjct: 109 KDFILPAGFPASVSDDYLQYMLLQFPTNVTGWICHTIVTSSLLKAVGIGSFSGTTAAASA 168 Query: 427 XAIRWVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADFVGSAGSIFDLTTQLYPAYFL 248 AIRWVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADF+GSAGSIFDLTTQLYPAYFL Sbjct: 169 AAIRWVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADFIGSAGSIFDLTTQLYPAYFL 228 Query: 247 PLASLGNLTKAVARGLKDPSFRVIQNHFAISGNLGEVAAK 128 PLASLGNLTKA+ARGLKDPSFRVIQNHFAIS N+GEVAAK Sbjct: 229 PLASLGNLTKAIARGLKDPSFRVIQNHFAISSNVGEVAAK 268 >gb|EMJ08984.1| hypothetical protein PRUPE_ppa025851mg, partial [Prunus persica] Length = 443 Score = 328 bits (842), Expect = 2e-87 Identities = 166/226 (73%), Positives = 181/226 (80%), Gaps = 2/226 (0%) Frame = -3 Query: 769 VILVERYGNGTTKRYEIRKDSRMSTFLEKHVPK-SNGSLDSQTSEFDLSWLPQIVKDLFL 593 VILVERYGNGT KRY + D ++ F+E+ SN S S S LSWLP IVKD Sbjct: 4 VILVERYGNGTAKRYVVDDDLKVQNFVEEERSLLSNNSESSHFSNSTLSWLPDIVKDFIF 63 Query: 592 PTGYPGSVSDDYLAYMLLQFPTNVTGWICRTLVTSSLLKAVGVGSFSGTTXXXXXXAIRW 413 P G+PGSVSDDYL YMLLQFPTNVT WIC TLVTSSLLKAVGVGSFSG+T AIRW Sbjct: 64 PAGFPGSVSDDYLLYMLLQFPTNVTAWICHTLVTSSLLKAVGVGSFSGSTAAASAAAIRW 123 Query: 412 VSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADFVGSAGSIFDLTTQLYPAYFLPLASL 233 VSKDGIGAVGRLF+GGRFGN+FDDDPKQWR+YADF+GSAGSIFDLTT LYPAYFLPLASL Sbjct: 124 VSKDGIGAVGRLFVGGRFGNVFDDDPKQWRLYADFIGSAGSIFDLTTPLYPAYFLPLASL 183 Query: 232 GNLTKAVARGLKDPSFRVIQNHFAISGNLGEVAAK-VYWNIFFHIL 98 GNL KAVARGLKDPS RVIQNHFA+ GNLGE+AAK W + +L Sbjct: 184 GNLAKAVARGLKDPSNRVIQNHFAVEGNLGEIAAKEEVWEVAAQLL 229 >ref|XP_003623296.1| hypothetical protein MTR_7g068310 [Medicago truncatula] gi|355498311|gb|AES79514.1| hypothetical protein MTR_7g068310 [Medicago truncatula] Length = 492 Score = 325 bits (833), Expect = 2e-86 Identities = 169/234 (72%), Positives = 182/234 (77%), Gaps = 3/234 (1%) Frame = -3 Query: 820 ASRRSEATRKDTEGEGPVILVERYGNGTTKRYEIRKDSRMSTFL---EKHVPKSNGSLDS 650 +S + + + EG VILVERY NGT KRY I DSR+ T L ++ G L S Sbjct: 36 SSFKDDDVNEGGEGLSRVILVERYSNGTAKRYIIGDDSRLRTILIEEDRSTQNRFGVLHS 95 Query: 649 QTSEFDLSWLPQIVKDLFLPTGYPGSVSDDYLAYMLLQFPTNVTGWICRTLVTSSLLKAV 470 LSWLP VK LP G+PGSVSDDYL YMLLQFPTNVTGWIC T+VTSSLLKAV Sbjct: 96 PDKR--LSWLPDTVKAFILPAGFPGSVSDDYLQYMLLQFPTNVTGWICHTIVTSSLLKAV 153 Query: 469 GVGSFSGTTXXXXXXAIRWVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADFVGSAGS 290 GVGSFSGTT AIRWVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADF+GSAGS Sbjct: 154 GVGSFSGTTAAASAAAIRWVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADFIGSAGS 213 Query: 289 IFDLTTQLYPAYFLPLASLGNLTKAVARGLKDPSFRVIQNHFAISGNLGEVAAK 128 IFDLTT LYP YFLPLASLGNLTKA+ARGLKDPS RVIQ+HFAIS NLGE+AAK Sbjct: 214 IFDLTTPLYPGYFLPLASLGNLTKAIARGLKDPSSRVIQSHFAISANLGEIAAK 267 >ref|XP_002274737.1| PREDICTED: UPF0420 protein-like [Vitis vinifera] Length = 503 Score = 321 bits (823), Expect = 3e-85 Identities = 179/287 (62%), Positives = 202/287 (70%), Gaps = 10/287 (3%) Frame = -3 Query: 928 RFEFLRKSLPFPVKPHRH----FHTLCIPPNSESHGDGPDASRRSEATRKDTEGEGPVIL 761 +F F P+K R F +C S+S +G + +A K + VIL Sbjct: 6 QFSFPVSGFQTPLKIRRRKFGDFGIVCSSTLSDSPEEGQEIG---DAGNKRGQCPQHVIL 62 Query: 760 VERYGNGTTK-RYEIRKDSRMSTFLEKHVPKSNGSLDSQTSEFDLSWLPQIVKDLFLPTG 584 +E+Y NGT K R+ + D ++ TFLE+ K+ S S+ LSWLP IVKD LP G Sbjct: 63 LEKYNNGTAKSRFILDDDIQIQTFLEEEGSKTERVQGSSFSDTQLSWLPIIVKDFILPAG 122 Query: 583 YPGSVSDDYLAYMLLQFPTNVTGWICRTLVTSSLLKAVGVGSFSGTTXXXXXXA----IR 416 +PGSVSDDYL YMLLQFPTNVT WIC TLVTSSLLKAVGVGSFS TT A IR Sbjct: 123 FPGSVSDDYLEYMLLQFPTNVTAWICHTLVTSSLLKAVGVGSFSATTAAASAAASAAAIR 182 Query: 415 WVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADFVGSAGSIFDLTTQLYPAYFLPLAS 236 WVSKDGIGAVGRLFIGG+FGNLFDDDPKQWRMYAD +GSAGSIFDL+TQLYPAYFL LAS Sbjct: 183 WVSKDGIGAVGRLFIGGQFGNLFDDDPKQWRMYADLIGSAGSIFDLSTQLYPAYFLQLAS 242 Query: 235 LGNLTKAVARGLKDPSFRVIQNHFAISGNLGEVAAK-VYWNIFFHIL 98 LGNL KAVARGLKDPSFRVIQNHFAISGNLGEVAAK W + +L Sbjct: 243 LGNLAKAVARGLKDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLL 289 >ref|XP_006602667.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform X5 [Glycine max] Length = 325 Score = 318 bits (816), Expect = 2e-84 Identities = 173/274 (63%), Positives = 194/274 (70%), Gaps = 2/274 (0%) Frame = -3 Query: 892 VKPHRHFHTLCIPPNSESHGDGPDASRRSEATRKDTEGEGPVILVERYGNGTTKRYEIRK 713 V+P R F LC S H D +A + VI VERY NGT KR + Sbjct: 9 VRP-RGFQILC----SSEHSSFKD---EDDAENGGGQVSSRVIQVERYSNGTAKRCVLGD 60 Query: 712 DSRMSTFL-EKHVPKSNGSLDSQTSEFDLSWLPQIVKDLFLPTGYPGSVSDDYLAYMLLQ 536 D ++ TFL E+ DS + + LSWLP +KD LP G+PGSVSDDYL YMLLQ Sbjct: 61 DLQLQTFLVEEDTSTPKRFQDSYSPDESLSWLPDTIKDFILPAGFPGSVSDDYLDYMLLQ 120 Query: 535 FPTNVTGWICRTLVTSSLLKAVGVGSFSGTTXXXXXXAIRWVSKDGIGAVGRLFIGGRFG 356 FPTNVTGWIC TLVTSSLLKAVG+GSFSGT+ AIRWVSKDGIGAVGRL +GGRFG Sbjct: 121 FPTNVTGWICHTLVTSSLLKAVGIGSFSGTSATASASAIRWVSKDGIGAVGRLCLGGRFG 180 Query: 355 NLFDDDPKQWRMYADFVGSAGSIFDLTTQLYPAYFLPLASLGNLTKAVARGLKDPSFRVI 176 +LFDDDPKQWRMYADF+GSAGSIF LTTQ+YP YFLPLASLGNLTKAVARGLKDPSF VI Sbjct: 181 SLFDDDPKQWRMYADFIGSAGSIFYLTTQVYPDYFLPLASLGNLTKAVARGLKDPSFCVI 240 Query: 175 QNHFAISGNLGEVAAK-VYWNIFFHILVICTSAL 77 QNHFAISGNLGEVAAK W + ++ + L Sbjct: 241 QNHFAISGNLGEVAAKEEIWEVVAQLIGLALGIL 274 >ref|XP_006602666.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform X4 [Glycine max] Length = 367 Score = 318 bits (816), Expect = 2e-84 Identities = 173/274 (63%), Positives = 194/274 (70%), Gaps = 2/274 (0%) Frame = -3 Query: 892 VKPHRHFHTLCIPPNSESHGDGPDASRRSEATRKDTEGEGPVILVERYGNGTTKRYEIRK 713 V+P R F LC S H D +A + VI VERY NGT KR + Sbjct: 9 VRP-RGFQILC----SSEHSSFKD---EDDAENGGGQVSSRVIQVERYSNGTAKRCVLGD 60 Query: 712 DSRMSTFL-EKHVPKSNGSLDSQTSEFDLSWLPQIVKDLFLPTGYPGSVSDDYLAYMLLQ 536 D ++ TFL E+ DS + + LSWLP +KD LP G+PGSVSDDYL YMLLQ Sbjct: 61 DLQLQTFLVEEDTSTPKRFQDSYSPDESLSWLPDTIKDFILPAGFPGSVSDDYLDYMLLQ 120 Query: 535 FPTNVTGWICRTLVTSSLLKAVGVGSFSGTTXXXXXXAIRWVSKDGIGAVGRLFIGGRFG 356 FPTNVTGWIC TLVTSSLLKAVG+GSFSGT+ AIRWVSKDGIGAVGRL +GGRFG Sbjct: 121 FPTNVTGWICHTLVTSSLLKAVGIGSFSGTSATASASAIRWVSKDGIGAVGRLCLGGRFG 180 Query: 355 NLFDDDPKQWRMYADFVGSAGSIFDLTTQLYPAYFLPLASLGNLTKAVARGLKDPSFRVI 176 +LFDDDPKQWRMYADF+GSAGSIF LTTQ+YP YFLPLASLGNLTKAVARGLKDPSF VI Sbjct: 181 SLFDDDPKQWRMYADFIGSAGSIFYLTTQVYPDYFLPLASLGNLTKAVARGLKDPSFCVI 240 Query: 175 QNHFAISGNLGEVAAK-VYWNIFFHILVICTSAL 77 QNHFAISGNLGEVAAK W + ++ + L Sbjct: 241 QNHFAISGNLGEVAAKEEIWEVVAQLIGLALGIL 274 >ref|XP_006602663.1| PREDICTED: UPF0420 protein C16orf58 homolog isoform X1 [Glycine max] Length = 415 Score = 318 bits (816), Expect = 2e-84 Identities = 173/274 (63%), Positives = 194/274 (70%), Gaps = 2/274 (0%) Frame = -3 Query: 892 VKPHRHFHTLCIPPNSESHGDGPDASRRSEATRKDTEGEGPVILVERYGNGTTKRYEIRK 713 V+P R F LC S H D +A + VI VERY NGT KR + Sbjct: 9 VRP-RGFQILC----SSEHSSFKD---EDDAENGGGQVSSRVIQVERYSNGTAKRCVLGD 60 Query: 712 DSRMSTFL-EKHVPKSNGSLDSQTSEFDLSWLPQIVKDLFLPTGYPGSVSDDYLAYMLLQ 536 D ++ TFL E+ DS + + LSWLP +KD LP G+PGSVSDDYL YMLLQ Sbjct: 61 DLQLQTFLVEEDTSTPKRFQDSYSPDESLSWLPDTIKDFILPAGFPGSVSDDYLDYMLLQ 120 Query: 535 FPTNVTGWICRTLVTSSLLKAVGVGSFSGTTXXXXXXAIRWVSKDGIGAVGRLFIGGRFG 356 FPTNVTGWIC TLVTSSLLKAVG+GSFSGT+ AIRWVSKDGIGAVGRL +GGRFG Sbjct: 121 FPTNVTGWICHTLVTSSLLKAVGIGSFSGTSATASASAIRWVSKDGIGAVGRLCLGGRFG 180 Query: 355 NLFDDDPKQWRMYADFVGSAGSIFDLTTQLYPAYFLPLASLGNLTKAVARGLKDPSFRVI 176 +LFDDDPKQWRMYADF+GSAGSIF LTTQ+YP YFLPLASLGNLTKAVARGLKDPSF VI Sbjct: 181 SLFDDDPKQWRMYADFIGSAGSIFYLTTQVYPDYFLPLASLGNLTKAVARGLKDPSFCVI 240 Query: 175 QNHFAISGNLGEVAAK-VYWNIFFHILVICTSAL 77 QNHFAISGNLGEVAAK W + ++ + L Sbjct: 241 QNHFAISGNLGEVAAKEEIWEVVAQLIGLALGIL 274 >ref|XP_006398608.1| hypothetical protein EUTSA_v10013293mg [Eutrema salsugineum] gi|557099698|gb|ESQ40061.1| hypothetical protein EUTSA_v10013293mg [Eutrema salsugineum] Length = 509 Score = 313 bits (803), Expect = 6e-83 Identities = 165/241 (68%), Positives = 186/241 (77%), Gaps = 6/241 (2%) Frame = -3 Query: 832 DGPDASRRSEATRKDTEGEGPVILVERYGNGTTKRYEIRKD-SRMSTFLEKHVPKSNG-S 659 +G + EA K +G ++ VERYGNGT+KRY + D S + FLE+ PK + S Sbjct: 42 EGEEDEGEEEANDKRVQGLVSIV-VERYGNGTSKRYLLDDDDSPLRGFLEEREPKPDDKS 100 Query: 658 LDSQTSEFDLSWLPQIVKDLFLPTGYPGSVSDDYLAYMLLQFPTNVTGWICRTLVTSSLL 479 +S +SE ++ WLP +VKD PTG+PGSVSDDYL YML QFPTN+TGWIC LVTSSLL Sbjct: 101 QESNSSETNMLWLPDVVKDFVFPTGFPGSVSDDYLDYMLWQFPTNITGWICNVLVTSSLL 160 Query: 478 KAVGVGSFSGT----TXXXXXXAIRWVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYAD 311 KAVGVGSFSGT T AIRWVSKDGIGA+GRL IGGRFG+LFDDDPKQWRMYAD Sbjct: 161 KAVGVGSFSGTSAAATAAASAAAIRWVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYAD 220 Query: 310 FVGSAGSIFDLTTQLYPAYFLPLASLGNLTKAVARGLKDPSFRVIQNHFAISGNLGEVAA 131 F+GSAGS FDL TQLYPA FL LAS GNL KAVARGL+DPSFRVIQNHFAISGNLGEVAA Sbjct: 221 FIGSAGSFFDLATQLYPAQFLLLASTGNLAKAVARGLRDPSFRVIQNHFAISGNLGEVAA 280 Query: 130 K 128 K Sbjct: 281 K 281 >ref|NP_195771.2| uncharacterized protein [Arabidopsis thaliana] gi|209863158|gb|ACI88737.1| At5g01510 [Arabidopsis thaliana] gi|332002971|gb|AED90354.1| uncharacterized protein AT5G01510 [Arabidopsis thaliana] Length = 509 Score = 308 bits (790), Expect = 2e-81 Identities = 173/285 (60%), Positives = 194/285 (68%), Gaps = 14/285 (4%) Frame = -3 Query: 889 KPHRHFHTLCIPPNSESHGDGPDASRRSEATRKDTEGEGPVILVERYGNGTTKRYEIRKD 710 K R H C S D DA R + I+VERYGNGT+KRY + D Sbjct: 25 KRRRVEHLRCSAQPSSIREDDEDADDRRVGVERRIS-----IVVERYGNGTSKRYFLDDD 79 Query: 709 -SRMSTFLEKHVPK-SNGSLDSQTSEFDLSWLPQIVKDLFLPTGYPGSVSDDYLAYMLLQ 536 S + LE+ K N S S +SE ++ WLP +V+D P+G+PGSVSDDYL YML Q Sbjct: 80 DSPLQGILEERETKPDNNSQSSNSSETNILWLPDVVRDFVFPSGFPGSVSDDYLDYMLWQ 139 Query: 535 FPTNVTGWICRTLVTSSLLKAVGVGSFSGT----TXXXXXXAIRWVSKDGIGAVGRLFIG 368 FPTN+TGWIC LVTSSLLKAVGVGSFSGT T AIRWVSKDGIGA+GRL IG Sbjct: 140 FPTNITGWICNVLVTSSLLKAVGVGSFSGTSAAATAAASAAAIRWVSKDGIGALGRLLIG 199 Query: 367 GRFGNLFDDDPKQWRMYADFVGSAGSIFDLTTQLYPAYFLPLASLGNLTKAVARGLKDPS 188 GRFG+LFDDDPKQWRMYADF+GSAGS FDL TQLYP+ FL LAS GNL KAVARGL+DPS Sbjct: 200 GRFGSLFDDDPKQWRMYADFIGSAGSFFDLATQLYPSQFLLLASTGNLAKAVARGLRDPS 259 Query: 187 FRVIQNHFAISGNLGEVAAK-VYWNIF-------FHILVICTSAL 77 FRVIQNHFAISGNLGEVAAK W + F IL+I T L Sbjct: 260 FRVIQNHFAISGNLGEVAAKEEVWEVAAQLIGLGFGILIIDTPGL 304 >ref|XP_002873007.1| hypothetical protein ARALYDRAFT_907999 [Arabidopsis lyrata subsp. lyrata] gi|297318844|gb|EFH49266.1| hypothetical protein ARALYDRAFT_907999 [Arabidopsis lyrata subsp. lyrata] Length = 510 Score = 307 bits (787), Expect = 5e-81 Identities = 179/301 (59%), Positives = 198/301 (65%), Gaps = 21/301 (6%) Frame = -3 Query: 916 LRKSLPFPVKPHRHFHTLCIPPNS--ESHGDGPDASRRSEATRKDTEGEGPV----ILVE 755 LR LP + R + C P E S R + D G I+VE Sbjct: 5 LRFPLPLHIPQTRTMSSSCQPKRRRLEHLRCSAQPSLREDDEEADDRSVGVARRISIVVE 64 Query: 754 RYGNGTTKRYEIRKD--SRMSTFLEKHVPK-SNGSLDSQTSEFDLSWLPQIVKDLFLPTG 584 RYGNGT+KRY + D S + FLE+ K N S S +SE + WLP +VKD PTG Sbjct: 65 RYGNGTSKRYFLDDDDDSPLQGFLEERELKPDNDSQSSDSSETNTLWLPDVVKDFVFPTG 124 Query: 583 YPGSVSDDYLAYMLLQFPTNVTGWICRTLVTSSLLKAVGVGSFSGT----TXXXXXXAIR 416 +P SVSDDYL YML QFPTNVTGWIC LVTSSLLKAVGVGSFSGT T AIR Sbjct: 125 FPASVSDDYLDYMLWQFPTNVTGWICNVLVTSSLLKAVGVGSFSGTSAAATAAASAAAIR 184 Query: 415 WVSKDGIGAVGRLFIGGRFGNLFDDDPKQWRMYADFVGSAGSIFDLTTQLYPAYFLPLAS 236 WVSKDGIGA+GRL IGGRFG+LFDDDPKQWRMYADF+GSAGS FDL TQLYP+ FL LAS Sbjct: 185 WVSKDGIGALGRLLIGGRFGSLFDDDPKQWRMYADFIGSAGSFFDLATQLYPSQFLLLAS 244 Query: 235 LGNLTKAVARGLKDPSFRVIQNHFAISGNLGEVAAK-VYWNIF-------FHILVICTSA 80 GNL KAVARGL+DPSFRVIQNHFAISGNLGEVAAK W + F IL+I T Sbjct: 245 TGNLAKAVARGLRDPSFRVIQNHFAISGNLGEVAAKEEVWEVAAQLIGLGFGILIIDTPG 304 Query: 79 L 77 L Sbjct: 305 L 305