BLASTX nr result
ID: Angelica23_contig00030754
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00030754 (789 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CBI37806.3| unnamed protein product [Vitis vinifera] 292 6e-77 ref|XP_002278562.1| PREDICTED: uncharacterized protein LOC100258... 292 6e-77 ref|XP_002312932.1| predicted protein [Populus trichocarpa] gi|2... 274 2e-71 ref|XP_002528430.1| conserved hypothetical protein [Ricinus comm... 267 2e-69 ref|NP_195557.2| uncharacterized protein [Arabidopsis thaliana] ... 258 1e-66 >emb|CBI37806.3| unnamed protein product [Vitis vinifera] Length = 1505 Score = 292 bits (747), Expect = 6e-77 Identities = 148/226 (65%), Positives = 179/226 (79%), Gaps = 5/226 (2%) Frame = +2 Query: 26 GEMNSLELLRFHSDVHESCSTFIETLIEQFAAVSYGDMIYGRQVAIYLHRCVEAPVRLAA 205 GE NS+E LRF SD+HES STFIETL+EQFAA+SYGD+IYGRQVAIYLHR VEAPVRLAA Sbjct: 1287 GEKNSIEFLRFQSDIHESYSTFIETLVEQFAAISYGDLIYGRQVAIYLHRSVEAPVRLAA 1346 Query: 206 WNSLSNARVLELLPPIEKCIANAEGYLEPAEDNAKILEAYVKSWTSGALDRAVGRGSVAF 385 WN+LSNARVLELLPP+EKC A+AEGYLEP E+N ILEAYVKSW +GALDRA RGSV F Sbjct: 1347 WNALSNARVLELLPPLEKCSADAEGYLEPVENNEGILEAYVKSWVTGALDRAATRGSVTF 1406 Query: 386 TIVLHHLASLIFGNPLGDRVTVRNKLVKSLLRDYSGKKQHQSMMMDLIRYKRPSDDQKHG 565 T+VLHHL+S+IF + ++++RNKL KSLLRDYS K+QH+ +M+ L+RY Sbjct: 1407 TLVLHHLSSVIFEDDADVKLSLRNKLAKSLLRDYSRKRQHEGLMLQLLRY---------N 1457 Query: 566 KELVVPQMD-----EFDKRFDLLKQACEGSTSLLSEVDKLESSFRK 688 K+ PQ + E +KRF L +ACEG+ SLL EV+KL+SSFR+ Sbjct: 1458 KQFASPQPEWMKEGETEKRFRFLTEACEGNASLLKEVEKLKSSFRQ 1503 >ref|XP_002278562.1| PREDICTED: uncharacterized protein LOC100258889 [Vitis vinifera] Length = 1602 Score = 292 bits (747), Expect = 6e-77 Identities = 148/226 (65%), Positives = 179/226 (79%), Gaps = 5/226 (2%) Frame = +2 Query: 26 GEMNSLELLRFHSDVHESCSTFIETLIEQFAAVSYGDMIYGRQVAIYLHRCVEAPVRLAA 205 GE NS+E LRF SD+HES STFIETL+EQFAA+SYGD+IYGRQVAIYLHR VEAPVRLAA Sbjct: 1384 GEKNSIEFLRFQSDIHESYSTFIETLVEQFAAISYGDLIYGRQVAIYLHRSVEAPVRLAA 1443 Query: 206 WNSLSNARVLELLPPIEKCIANAEGYLEPAEDNAKILEAYVKSWTSGALDRAVGRGSVAF 385 WN+LSNARVLELLPP+EKC A+AEGYLEP E+N ILEAYVKSW +GALDRA RGSV F Sbjct: 1444 WNALSNARVLELLPPLEKCSADAEGYLEPVENNEGILEAYVKSWVTGALDRAATRGSVTF 1503 Query: 386 TIVLHHLASLIFGNPLGDRVTVRNKLVKSLLRDYSGKKQHQSMMMDLIRYKRPSDDQKHG 565 T+VLHHL+S+IF + ++++RNKL KSLLRDYS K+QH+ +M+ L+RY Sbjct: 1504 TLVLHHLSSVIFEDDADVKLSLRNKLAKSLLRDYSRKRQHEGLMLQLLRY---------N 1554 Query: 566 KELVVPQMD-----EFDKRFDLLKQACEGSTSLLSEVDKLESSFRK 688 K+ PQ + E +KRF L +ACEG+ SLL EV+KL+SSFR+ Sbjct: 1555 KQFASPQPEWMKEGETEKRFRFLTEACEGNASLLKEVEKLKSSFRQ 1600 >ref|XP_002312932.1| predicted protein [Populus trichocarpa] gi|222849340|gb|EEE86887.1| predicted protein [Populus trichocarpa] Length = 1530 Score = 274 bits (700), Expect = 2e-71 Identities = 132/217 (60%), Positives = 171/217 (78%) Frame = +2 Query: 38 SLELLRFHSDVHESCSTFIETLIEQFAAVSYGDMIYGRQVAIYLHRCVEAPVRLAAWNSL 217 S LRF S++HES STF+ETL+EQFA++SYGD+I+GRQVA+YLHRC E PVRLAAWN L Sbjct: 1307 SRSFLRFQSEIHESYSTFLETLVEQFASISYGDIIFGRQVAVYLHRCTETPVRLAAWNGL 1366 Query: 218 SNARVLELLPPIEKCIANAEGYLEPAEDNAKILEAYVKSWTSGALDRAVGRGSVAFTIVL 397 +NA VLE+LPP+EKC A AEGYLEP EDN ILEAYVK+W SGALDRA RGS+AFT+VL Sbjct: 1367 ANAHVLEILPPLEKCFAEAEGYLEPVEDNEGILEAYVKAWVSGALDRAATRGSMAFTLVL 1426 Query: 398 HHLASLIFGNPLGDRVTVRNKLVKSLLRDYSGKKQHQSMMMDLIRYKRPSDDQKHGKELV 577 HHL+S IF D++T+RNKL KSLLRDYS K++H+ +M++L+ Y + S +E + Sbjct: 1427 HHLSSFIFLFHANDKITLRNKLAKSLLRDYSKKQRHEGIMLELVCYYKLSSRLPEKQEGL 1486 Query: 578 VPQMDEFDKRFDLLKQACEGSTSLLSEVDKLESSFRK 688 Q + +KRF++L +AC+ +SLL EV+KL+S+F K Sbjct: 1487 PLQASDIEKRFEVLVEACDRDSSLLIEVEKLKSAFVK 1523 >ref|XP_002528430.1| conserved hypothetical protein [Ricinus communis] gi|223532166|gb|EEF33972.1| conserved hypothetical protein [Ricinus communis] Length = 1552 Score = 267 bits (682), Expect = 2e-69 Identities = 131/219 (59%), Positives = 173/219 (78%) Frame = +2 Query: 41 LELLRFHSDVHESCSTFIETLIEQFAAVSYGDMIYGRQVAIYLHRCVEAPVRLAAWNSLS 220 +ELLRF S++HES STF+ETL+EQFAAVSYGD+I+GRQV++YLHRC EA +RL AWN+LS Sbjct: 1334 VELLRFQSEIHESYSTFLETLVEQFAAVSYGDLIFGRQVSLYLHRCNEAAMRLYAWNALS 1393 Query: 221 NARVLELLPPIEKCIANAEGYLEPAEDNAKILEAYVKSWTSGALDRAVGRGSVAFTIVLH 400 NARV E+LPP++KCIA A+GYLEP EDN ILEAYVKSW SGALD++ RGS+A +VLH Sbjct: 1394 NARVFEILPPLDKCIAEADGYLEPIEDNEDILEAYVKSWISGALDKSAARGSMALHLVLH 1453 Query: 401 HLASLIFGNPLGDRVTVRNKLVKSLLRDYSGKKQHQSMMMDLIRYKRPSDDQKHGKELVV 580 HL+S IF D++++RNKLVKSLL D S K++H+ MM++LI+Y +PS Q + L + Sbjct: 1454 HLSSFIFLIHSHDKISLRNKLVKSLLLDCSQKQKHRVMMLELIQYSKPSTSQSPVEGLSL 1513 Query: 581 PQMDEFDKRFDLLKQACEGSTSLLSEVDKLESSFRKVAN 697 + +KRF++L +ACE +SLL+EV+ L S+F K N Sbjct: 1514 RNNNSTEKRFEVLVEACERDSSLLAEVENLRSAFVKKLN 1552 >ref|NP_195557.2| uncharacterized protein [Arabidopsis thaliana] gi|26449867|dbj|BAC42056.1| unknown protein [Arabidopsis thaliana] gi|28973069|gb|AAO63859.1| unknown protein [Arabidopsis thaliana] gi|332661529|gb|AEE86929.1| uncharacterized protein [Arabidopsis thaliana] Length = 1465 Score = 258 bits (659), Expect = 1e-66 Identities = 124/226 (54%), Positives = 175/226 (77%) Frame = +2 Query: 2 DESWLLHKGEMNSLELLRFHSDVHESCSTFIETLIEQFAAVSYGDMIYGRQVAIYLHRCV 181 DE+ L H+ ELLRF SD+HE+ STF+E ++EQ+AAVSYGD++YGRQV++YLH+CV Sbjct: 1242 DEARLNHR----DTELLRFKSDIHENYSTFLEMVVEQYAAVSYGDVVYGRQVSVYLHQCV 1297 Query: 182 EAPVRLAAWNSLSNARVLELLPPIEKCIANAEGYLEPAEDNAKILEAYVKSWTSGALDRA 361 E VRL+AW LSNARVLELLP ++KC+ A+GYLEP E+N +LEAY+KSWT GALDRA Sbjct: 1298 EHSVRLSAWTVLSNARVLELLPSLDKCLGEADGYLEPVEENEAVLEAYLKSWTCGALDRA 1357 Query: 362 VGRGSVAFTIVLHHLASLIFGNPLGDRVTVRNKLVKSLLRDYSGKKQHQSMMMDLIRYKR 541 RGSVA+T+V+HH +SL+F N D+V++RNK+VK+L+RD S K+ + MM+DL+RYK+ Sbjct: 1358 ATRGSVAYTLVVHHFSSLVFCNQAKDKVSLRNKIVKTLVRDLSRKRHREGMMLDLLRYKK 1417 Query: 542 PSDDQKHGKELVVPQMDEFDKRFDLLKQACEGSTSLLSEVDKLESS 679 S + + + E +KR ++LK+ CEG+++LL E++KL+S+ Sbjct: 1418 GSANAMEEEVIAA----ETEKRMEVLKEGCEGNSTLLLELEKLKSA 1459