BLASTX nr result
ID: Angelica23_contig00035222
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Angelica23_contig00035222 (1632 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002876031.1| hypothetical protein ARALYDRAFT_485395 [Arab... 254 5e-65 ref|NP_190612.3| uncharacterized protein [Arabidopsis thaliana] ... 251 4e-64 emb|CAB62472.1| hypothetical protein [Arabidopsis thaliana] 239 2e-60 ref|XP_003519917.1| PREDICTED: uncharacterized protein LOC100812... 221 6e-55 ref|XP_002522353.1| conserved hypothetical protein [Ricinus comm... 211 4e-52 >ref|XP_002876031.1| hypothetical protein ARALYDRAFT_485395 [Arabidopsis lyrata subsp. lyrata] gi|297321869|gb|EFH52290.1| hypothetical protein ARALYDRAFT_485395 [Arabidopsis lyrata subsp. lyrata] Length = 648 Score = 254 bits (649), Expect = 5e-65 Identities = 165/442 (37%), Positives = 228/442 (51%), Gaps = 7/442 (1%) Frame = +2 Query: 89 LFSGNLVQFFCSLAAHGYASDASAVCKDEPVACIISDAM---PKILAWCLSKQEDRNKTS 259 +F GN VQF CS+ H + S EP I+ + P ++ WC K + ++ + Sbjct: 260 VFLGNFVQFLCSMVQHVRVVEDSD--DSEPSHLILQKTIKLVPDLIRWCQPKLKSQSGSC 317 Query: 260 TSQYXXXXXXXXXXXXTYQMHLQCQVLVSWLNIIGKYFQDLLAEPLTRVENNLDDSLEGS 439 S+Y T + +++C +L+SWL + + Q L LT+ + D+ LEGS Sbjct: 318 MSRYLGHKLLVLMIRLTDKSNIKCTILLSWLQYLQRDSQGFLQHTLTKFKPVQDNCLEGS 377 Query: 440 PFLSSFS-KESKGISDRHLQRLAVFLFLRCSLSLVCQRDGIHEHCICAKINPSLLSEQMG 616 PF S S +E HLQRL+VFLFLRCS +L+ + C Sbjct: 378 PFFVSLSDREINETHSNHLQRLSVFLFLRCSFTLIYSSRHNGKQC--------------- 422 Query: 617 NHSCCNRKQGLLGLYQWLHGHFTRDISVDNDIYIQKCASFSLSFLRLYIHEDDILFKVLL 796 C RK+G+ +++W+ I D+ IY +K FS SF+RL++HEDD+LFKVLL Sbjct: 423 EFDC--RKKGMAEMFKWIVRQIPGIICSDHRIYSKKSVEFSASFVRLFMHEDDLLFKVLL 480 Query: 797 QLFTIPLSVK--PVCEGSKTPQKVEDEENM-LHLISDLLNPICLFHLFLAELLYDHKVLL 967 QL ++PL + P EG +EDEE + L S L NP+ LF +FL+EL YDH+VLL Sbjct: 481 QLLSVPLHRQELPNVEGGS----LEDEEQITLFRFSTLFNPVTLFCIFLSELHYDHQVLL 536 Query: 968 DYLMSKDTGASSAEYLLRCLRAVCDSWTFFVEFSWDEEVTNSRNSKKRKVSVDVLDFEGG 1147 DYL+SKD G S AEYLLRCLRAVCDSWT FVEF + E TN+ + K+RKV + + E Sbjct: 537 DYLISKDIGDSCAEYLLRCLRAVCDSWTLFVEFPF-EGSTNASSPKRRKVLPETSEVE-- 593 Query: 1148 EISVLRENDDSPLSLLKRCMTEDVYTSRQHTTTRLSFEDAMECXXXXXXXXXXXXXXXXF 1327 R H +FEDA +C F Sbjct: 594 ------------------------QNWRLHPQ---AFEDAKDCLLSLQNSVVKLHQKKLF 626 Query: 1328 PYNPDVLLRRLTKFEELCFKEK 1393 PYNP+ LLRRL++F+ELC + Sbjct: 627 PYNPEALLRRLSRFQELCLSHE 648 >ref|NP_190612.3| uncharacterized protein [Arabidopsis thaliana] gi|28973649|gb|AAO64145.1| unknown protein [Arabidopsis thaliana] gi|110737253|dbj|BAF00574.1| hypothetical protein [Arabidopsis thaliana] gi|332645145|gb|AEE78666.1| uncharacterized protein [Arabidopsis thaliana] Length = 642 Score = 251 bits (641), Expect = 4e-64 Identities = 163/442 (36%), Positives = 233/442 (52%), Gaps = 7/442 (1%) Frame = +2 Query: 89 LFSGNLVQFFCSLAAHGYASDASAVCKDEPVACIISDAM---PKILAWCLSKQEDRNKTS 259 +F G+ VQF CS+ + + S EP I+ + P +L WC K + ++ + Sbjct: 254 VFLGSFVQFLCSMVQQVHVVEDSD--DFEPSYLILQKTIKLIPDLLRWCQPKLKSQSGSC 311 Query: 260 TSQYXXXXXXXXXXXXTYQMHLQCQVLVSWLNIIGKYFQDLLAEPLTRVENNLDDSLEGS 439 S+Y T + ++C +L+SWL + + Q L LT+ + D+ LEGS Sbjct: 312 MSRYLGHKLLVLMIRLTDKSKIKCTILLSWLQYLQRDSQGFLQHTLTKFKPVQDNCLEGS 371 Query: 440 PFLSSFS-KESKGISDRHLQRLAVFLFLRCSLSLVCQRDGIHEHCICAKINPSLLSEQMG 616 PF S S +E + HLQRL+VFLFLRCS +L+ S ++++ Sbjct: 372 PFFVSLSDREVNEMHSNHLQRLSVFLFLRCSFTLIYS---------------SRHNDKLC 416 Query: 617 NHSCCNRKQGLLGLYQWLHGHFTRDISVDNDIYIQKCASFSLSFLRLYIHEDDILFKVLL 796 C RK+G+ +++W+ ++ D+ IY +K FS SF+RL++HEDD+LFKVLL Sbjct: 417 EFDC--RKKGMAEMFKWIERQIPGNMFSDHRIYSKKNVEFSASFVRLFMHEDDLLFKVLL 474 Query: 797 QLFTIPLSVK--PVCEGSKTPQKVEDEENM-LHLISDLLNPICLFHLFLAELLYDHKVLL 967 QL ++PL + P EG +EDEE + L +S L NP+ LF +FL+EL YDH+VLL Sbjct: 475 QLLSVPLHRQELPNVEGGS----LEDEEQITLFRLSTLFNPVRLFCIFLSELHYDHQVLL 530 Query: 968 DYLMSKDTGASSAEYLLRCLRAVCDSWTFFVEFSWDEEVTNSRNSKKRKVSVDVLDFEGG 1147 DYL+SKD GAS AEYLLRCLRAVCDSWT FVEF + E T++ + K+RKV + + E Sbjct: 531 DYLISKDIGASCAEYLLRCLRAVCDSWTLFVEFPF-EGSTDAPSPKRRKVLPETSEVE-- 587 Query: 1148 EISVLRENDDSPLSLLKRCMTEDVYTSRQHTTTRLSFEDAMECXXXXXXXXXXXXXXXXF 1327 R H +FEDA +C F Sbjct: 588 ------------------------QNWRLHAQ---AFEDAKDCLLSLQNSVVKLHQKKLF 620 Query: 1328 PYNPDVLLRRLTKFEELCFKEK 1393 PYNP+ LLRRL++F ELC + Sbjct: 621 PYNPEALLRRLSRFHELCLSHE 642 >emb|CAB62472.1| hypothetical protein [Arabidopsis thaliana] Length = 730 Score = 239 bits (609), Expect = 2e-60 Identities = 158/430 (36%), Positives = 225/430 (52%), Gaps = 7/430 (1%) Frame = +2 Query: 89 LFSGNLVQFFCSLAAHGYASDASAVCKDEPVACIISDAM---PKILAWCLSKQEDRNKTS 259 +F G+ VQF CS+ + + S EP I+ + P +L WC K + ++ + Sbjct: 247 VFLGSFVQFLCSMVQQVHVVEDSD--DFEPSYLILQKTIKLIPDLLRWCQPKLKSQSGSC 304 Query: 260 TSQYXXXXXXXXXXXXTYQMHLQCQVLVSWLNIIGKYFQDLLAEPLTRVENNLDDSLEGS 439 S+Y T + ++C +L+SWL + + Q L LT+ + D+ LEGS Sbjct: 305 MSRYLGHKLLVLMIRLTDKSKIKCTILLSWLQYLQRDSQGFLQHTLTKFKPVQDNCLEGS 364 Query: 440 PFLSSFS-KESKGISDRHLQRLAVFLFLRCSLSLVCQRDGIHEHCICAKINPSLLSEQMG 616 PF S S +E + HLQRL+VFLFLRCS +L+ S ++++ Sbjct: 365 PFFVSLSDREVNEMHSNHLQRLSVFLFLRCSFTLIYS---------------SRHNDKLC 409 Query: 617 NHSCCNRKQGLLGLYQWLHGHFTRDISVDNDIYIQKCASFSLSFLRLYIHEDDILFKVLL 796 C RK+G+ +++W+ ++ D+ IY +K FS SF+RL++HEDD+LFKVLL Sbjct: 410 EFDC--RKKGMAEMFKWIERQIPGNMFSDHRIYSKKNVEFSASFVRLFMHEDDLLFKVLL 467 Query: 797 QLFTIPLSVK--PVCEGSKTPQKVEDEENM-LHLISDLLNPICLFHLFLAELLYDHKVLL 967 QL ++PL + P EG +EDEE + L +S L NP+ LF +FL+EL YDH+VLL Sbjct: 468 QLLSVPLHRQELPNVEGGS----LEDEEQITLFRLSTLFNPVRLFCIFLSELHYDHQVLL 523 Query: 968 DYLMSKDTGASSAEYLLRCLRAVCDSWTFFVEFSWDEEVTNSRNSKKRKVSVDVLDFEGG 1147 DYL+SKD GAS AEYLLRCLRAVCDSWT FVEF + E T++ + K+RKV + + E Sbjct: 524 DYLISKDIGASCAEYLLRCLRAVCDSWTLFVEFPF-EGSTDAPSPKRRKVLPETSEVE-- 580 Query: 1148 EISVLRENDDSPLSLLKRCMTEDVYTSRQHTTTRLSFEDAMECXXXXXXXXXXXXXXXXF 1327 R H +FEDA +C F Sbjct: 581 ------------------------QNWRLHAQ---AFEDAKDCLLSLQNSVVKLHQKKLF 613 Query: 1328 PYNPDVLLRR 1357 PYNP+ LLRR Sbjct: 614 PYNPEALLRR 623 >ref|XP_003519917.1| PREDICTED: uncharacterized protein LOC100812484 [Glycine max] Length = 639 Score = 221 bits (562), Expect = 6e-55 Identities = 133/384 (34%), Positives = 204/384 (53%), Gaps = 5/384 (1%) Frame = +2 Query: 62 NLQNFT---PLMLFSGNLVQFFCSLAAHGYASDASAVCKDE-PVACIISDAMPKILAWCL 229 +L+NF+ P+M F G +Q CSL + + D+ P+ + + +P++ WCL Sbjct: 243 HLKNFSVMDPVMNFLGTFLQLLCSLVYRNDSVETGCDSVDKHPLFLTVVNLIPRLAKWCL 302 Query: 230 SKQEDRNKTSTSQYXXXXXXXXXXXXTYQMHLQCQVLVSWLNIIGKYFQDLLAEPLTRVE 409 S+QE+ + Y L C++ +SWL ++ YFQ+LL +PLT+ Sbjct: 303 SEQENNAEMHAIHYLKHKLLILMIRLGSLTGLDCRIRLSWLELLHNYFQELLQQPLTQFL 362 Query: 410 NNLDDSLEGSPFLSSFSK-ESKGISDRHLQRLAVFLFLRCSLSLVCQRDGIHEHCICAKI 586 ++ D LE SPFL S E+ HL+R AV+L L CS SL+C+R I HC + + Sbjct: 363 SDQIDCLEDSPFLWSLCDGEACMKRSDHLRRQAVYLLLACSFSLICKRGEIANHCNNSTL 422 Query: 587 NPSLLSEQMGNHSCCNRKQGLLGLYQWLHGHFTRDISVDNDIYIQKCASFSLSFLRLYIH 766 S + H RK+G L L++W+ GH IS++++ Y+Q C +F SFL+LY+ Sbjct: 423 CSSFTTNPDSEHDYFCRKKGSLELFKWILGHLPTAISINHEKYMQMCMNFISSFLQLYLR 482 Query: 767 EDDILFKVLLQLFTIPLSVKPVCEGSKTPQKVEDEENMLHLISDLLNPICLFHLFLAELL 946 EDD+LF+VLL LF+I S++ ++ E ++ H ++ Sbjct: 483 EDDLLFEVLLLLFSISSSLQ---------EQSESKDAAYH-----------------DIH 516 Query: 947 YDHKVLLDYLMSKDTGASSAEYLLRCLRAVCDSWTFFVEFSWDEEVTNSRNSKKRKVSVD 1126 YDH+VLLDYL+SKDTG S A+YLLRCL +C+SW FVEF E + + K+RK+ D Sbjct: 517 YDHQVLLDYLISKDTGISCAKYLLRCLHLICNSWKLFVEFPLFGEFLDQSSCKRRKIVGD 576 Query: 1127 VLDFEGGEISVLRENDDSPLSLLK 1198 L F + +N S + +K Sbjct: 577 GLHFLADGMPTSIDNSGSIILHIK 600 >ref|XP_002522353.1| conserved hypothetical protein [Ricinus communis] gi|223538431|gb|EEF40037.1| conserved hypothetical protein [Ricinus communis] Length = 535 Score = 211 bits (537), Expect = 4e-52 Identities = 117/301 (38%), Positives = 168/301 (55%), Gaps = 2/301 (0%) Frame = +2 Query: 44 DALVHNNLQNFTPLMLFSGNLVQFFCSLAAHGYASDASAVCKDE-PVACIISDAMPKILA 220 DAL N+ ++F GN +Q CSL + +D PV C+I+ +PK+++ Sbjct: 231 DALFLKNVGEQQKKIVFLGNFIQLLCSLVEQSCDVEVKVGSQDHHPVLCLITSFVPKVVS 290 Query: 221 WCLSKQEDRNKTSTSQYXXXXXXXXXXXXTYQMHLQCQVLVSWLNIIGKYFQDLLAEPLT 400 CL Q + S SQY +YQ L L+SWL ++ YF+ LL +P+ Sbjct: 291 CCLGGQGNCVSASVSQYFRHKLLMLMLRLSYQTCLDYFTLISWLQLLHDYFEVLLWKPII 350 Query: 401 RVENNLDDSLEGSPFLSSFSK-ESKGISDRHLQRLAVFLFLRCSLSLVCQRDGIHEHCIC 577 ++E D+SLE SPFLSS S + GI+ HLQR A+ LFLRC L+ + C C Sbjct: 351 KLEFPQDESLEDSPFLSSLSDGDIHGINSHHLQRWAILLFLRCCFGLISLTRDKSKKCTC 410 Query: 578 AKINPSLLSEQMGNHSCCNRKQGLLGLYQWLHGHFTRDISVDNDIYIQKCASFSLSFLRL 757 +N + + CC RK+G L +Y+WL GHF D+SV ++Y +KC F+ SFL+L Sbjct: 411 GTLN-CCSGYSISDMDCCGRKKGFLEIYKWLQGHFPIDMSVGQEMYFEKCIGFTFSFLQL 469 Query: 758 YIHEDDILFKVLLQLFTIPLSVKPVCEGSKTPQKVEDEENMLHLISDLLNPICLFHLFLA 937 Y+HEDD+LFKVLLQL +I ++ + K + E+ + H IS + NP+ LFHLFLA Sbjct: 470 YMHEDDVLFKVLLQLLSINSCLEQLLNRVKWTSEDVKEDILFH-ISHIFNPVYLFHLFLA 528 Query: 938 E 940 E Sbjct: 529 E 529