BLASTX nr result
ID: Angelica23_contig00019418
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00019418 (659 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN59936.1| hypothetical protein VITISV_001878 [Vitis vinifera] 96 5e-18 gb|AER13167.1| putative retrovirus-like polyprotein [Phaseolus v... 94 3e-17 dbj|BAF00783.1| hypothetical protein [Arabidopsis thaliana] 91 2e-16 ref|XP_003555650.1| PREDICTED: uncharacterized protein LOC100817... 89 1e-15 ref|XP_004141415.1| PREDICTED: uncharacterized protein LOC101212... 88 2e-15 >emb|CAN59936.1| hypothetical protein VITISV_001878 [Vitis vinifera] Length = 1031 Score = 96.3 bits (238), Expect = 5e-18 Identities = 70/240 (29%), Positives = 113/240 (47%), Gaps = 22/240 (9%) Frame = +2 Query: 5 MLMQEENHKKVNQSSVTAIDDGMACIANRRNFSDKFRPQNFDKRGGNSSFDNKTKNTFFC 184 +++QEE + + S+ A + S +F+ + NSS K + C Sbjct: 207 LVVQEERQRSLTTSNSPAFTAPV---------SSRFQAASRASSPTNSSRSRKDRP--LC 255 Query: 185 DHCKMPGHSIQRCYKLHGYP*NFRNDRDKR---------VVAMVQTDEEESHDNET---- 325 HC + GH++ +CYK+HGYP FRN + R + + T++ D T Sbjct: 256 THCNILGHTVDQCYKIHGYPPGFRNRPNFRPNGSHPNQMLPNSLHTNQLTLTDGSTASAS 315 Query: 326 -MTFTATQYQKLLQLIAKDSSTEDSQIQVDHPKAAYAVGKY-----CFTSSSGFN---WI 478 + T Q+ +LL L++ SS+ S D ++ + SSS N WI Sbjct: 316 PLPLTHDQHNQLLALLSLHSSSGSSTSFGDSNPLQQSISNFTGILSLSPSSSTLNPSIWI 375 Query: 479 VDSGAKDHMCYDLSLFKSYAEIKENGTYITIPDGNKVLIKYVGTVHLNNDLILKDVLYVP 658 +DSGA H+C + S+F S N +T+P G K+ I +GT+HL+ L+L+ VLY+P Sbjct: 376 LDSGATHHVCTNSSMFHSIHSFSSN--TVTLPTGTKIPITGIGTIHLSPHLVLEHVLYIP 433 >gb|AER13167.1| putative retrovirus-like polyprotein [Phaseolus vulgaris] Length = 1009 Score = 93.6 bits (231), Expect = 3e-17 Identities = 77/245 (31%), Positives = 110/245 (44%), Gaps = 26/245 (10%) Frame = +2 Query: 2 RMLMQEENHKKVNQSSVTAIDDGMACIANRRNFSDK---FRPQNFDKRGGNSSFDNKTKN 172 R L + +N ++ I + C +N + FR F + N SF N N Sbjct: 441 RQLDNNFSVSNINSANSNRISSSVICTFCGKNGHTENVCFRKVGFPNQE-NKSFKNNG-N 498 Query: 173 TFFCDHCKMPGHSIQRCYKLHGYP*NFR--NDRDKRVVAMVQTDE------EESHDNETM 328 C HC GH+I+ CYK HGYP ++ + +V +V +D ++ + Sbjct: 499 KKMCTHCGRNGHTIETCYKKHGYPPGYKFYGSKTNQVNNIVISDYVLSEPCQKEQEKGEY 558 Query: 329 TFTATQYQKLLQLIAKDSSTED-SQIQVDHPKAAYAVGKYCFTSSS-------------G 466 TA QYQ L L + ++ + IQV+ + A + S G Sbjct: 559 HLTAQQYQVLSDLFRQSTNNNTAANIQVNRVGSFTADTNHKMAKRSPTGNILTLNNLQHG 618 Query: 467 FN-WIVDSGAKDHMCYDLSLFKSYAEIKENGTYITIPDGNKVLIKYVGTVHLNNDLILKD 643 N WI+DSGA DH+C LS F SY IK I++P+G+ V KY GTV N+ L D Sbjct: 619 RNYWILDSGATDHVCSSLSEFTSYKSIKP--ISISLPNGHHVFAKYSGTVIFNHKFYLID 676 Query: 644 VLYVP 658 VLYVP Sbjct: 677 VLYVP 681 >dbj|BAF00783.1| hypothetical protein [Arabidopsis thaliana] Length = 556 Score = 90.9 bits (224), Expect = 2e-16 Identities = 61/204 (29%), Positives = 93/204 (45%), Gaps = 32/204 (15%) Frame = +2 Query: 143 NSSFDNKTKNTFFCDHCKMPGHSIQRCYKLHGYP*NFRNDRDKRVVAMVQTDEEES---- 310 N++F+N C HC GH++ RCYK+HGYP F++ + V ++ S Sbjct: 248 NATFNNAKPQKVICSHCGYTGHTVDRCYKIHGYPLGFKHKNKNQSDKSVSLEKSVSTVKP 307 Query: 311 ---HDNETMTFTATQYQKLLQLIAKD----------SSTEDSQIQVDHPKAAYAVGKYCF 451 H T + T L +++ KD S ++S I A+ F Sbjct: 308 VVAHMALTDSTTNDLINGLTKVLTKDQINGVVAYFNSQMQNSSIASSSGATITALPGIAF 367 Query: 452 TSSS-GF--------------NWIVDSGAKDHMCYDLSLFKSYAEIKENGTYITIPDGNK 586 +SS+ GF WI+DSGA H+C+D +L +E + +T+P G Sbjct: 368 SSSTLGFIGVLKATVNVLSSETWIIDSGATHHVCHDKNLLMRLSETMNSS--VTLPTGFG 425 Query: 587 VLIKYVGTVHLNNDLILKDVLYVP 658 V I +GTV LN L+L +VLY+P Sbjct: 426 VKITCIGTVKLNEFLVLNNVLYIP 449 >ref|XP_003555650.1| PREDICTED: uncharacterized protein LOC100817175 [Glycine max] Length = 2045 Score = 88.6 bits (218), Expect = 1e-15 Identities = 62/199 (31%), Positives = 97/199 (48%), Gaps = 21/199 (10%) Frame = +2 Query: 125 FDKRGGNSSFD--NKTKNTFFCDHCKMPGHSIQRCYKLHGYP*NFRNDRDKRVVAMV--- 289 + K G S++D NK+ C HC GH++ CY+ HGYP ++ + V V Sbjct: 609 YKKHGVPSNYDARNKSNGRKACTHCGKIGHTVDVCYRKHGYPPGYKPYSGRTTVNNVVAV 668 Query: 290 ---QTDEEESH--DNETMTFTATQYQKLLQLIAKDSSTEDSQIQVDHPKAAYAV------ 436 TD++ H +E + F+ QY+ LL LI + S+ + Q PK ++ Sbjct: 669 ESKATDDQAQHHESHEFVRFSPEQYKALLALIQEPSAGNTALTQ---PKQVASISSCTVN 725 Query: 437 -----GKYCFTSSSGFNWIVDSGAKDHMCYDLSLFKSYAEIKENGTYITIPDGNKVLIKY 601 G S+S +WI+DSGA DH+ L S+ I N + +P+G V + Sbjct: 726 NPTNPGMSLSLSASLTSWILDSGATDHVTCSLHNLHSHKRI--NPITVKLPNGQYVHATH 783 Query: 602 VGTVHLNNDLILKDVLYVP 658 GTV L++++ L DVLY+P Sbjct: 784 SGTVQLSSNITLHDVLYIP 802 >ref|XP_004141415.1| PREDICTED: uncharacterized protein LOC101212632 [Cucumis sativus] gi|449449869|ref|XP_004142687.1| PREDICTED: uncharacterized protein LOC101213831 [Cucumis sativus] Length = 440 Score = 87.8 bits (216), Expect = 2e-15 Identities = 63/233 (27%), Positives = 107/233 (45%), Gaps = 15/233 (6%) Frame = +2 Query: 5 MLMQEENHKKVNQSSVTAIDDGMACIAN--RRNFSDKFRPQNFDKRGGNSSFDNKTKNTF 178 +++QEE + + S + +A +AN RR SDK +K K+T Sbjct: 168 LIIQEERQRSIGSSPSI---ESIALMANSERRFSSDK----------------SKKKDTR 208 Query: 179 -FCDHCKMPGHSIQRCYKLHGYP*NFRNDRDKRVVAMVQTDE---------EESHDNETM 328 C +C GH+ +CYKLHGYP R V Q + E S N++ Sbjct: 209 PICSNCGYKGHTADKCYKLHGYPPGHRLANSNNSVHQRQDNTIQDGNDKVIEVSKRNQSA 268 Query: 329 TFTAT---QYQKLLQLIAKDSSTEDSQIQVDHPKAAYAVGKYCFTSSSGFNWIVDSGAKD 499 F + QY +LL ++ +T + + A + + WI+DSGA Sbjct: 269 FFASLNSDQYTQLLDMLQTHLNTPQNGENFKNETTHIAGTCLSNSLNDPLTWIIDSGASS 328 Query: 500 HMCYDLSLFKSYAEIKENGTYITIPDGNKVLIKYVGTVHLNNDLILKDVLYVP 658 H+C+D +F + + ++ +P ++ ++++G V ++NDL+LKDVLY+P Sbjct: 329 HICHDKFMFTNLYSAQN--MFVILPTKTRLKVEHIGDVFISNDLVLKDVLYIP 379