BLASTX nr result
ID: Bupleurum21_contig00022108
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Bupleurum21_contig00022108 (1023 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002522374.1| hypothetical protein RCOM_0603630 [Ricinus c... 158 2e-36 ref|XP_002310176.1| predicted protein [Populus trichocarpa] gi|2... 147 4e-33 ref|XP_004142023.1| PREDICTED: uncharacterized protein LOC101222... 121 3e-25 ref|XP_003548909.1| PREDICTED: uncharacterized protein LOC100818... 113 8e-23 ref|NP_190607.2| uncharacterized protein [Arabidopsis thaliana] ... 105 2e-20 >ref|XP_002522374.1| hypothetical protein RCOM_0603630 [Ricinus communis] gi|223538452|gb|EEF40058.1| hypothetical protein RCOM_0603630 [Ricinus communis] Length = 1720 Score = 158 bits (400), Expect = 2e-36 Identities = 109/342 (31%), Positives = 163/342 (47%), Gaps = 22/342 (6%) Frame = +3 Query: 63 KTLNLLVVFWSFI----CSIYHSILQAVLPRNH--FINHLNNGGHGVRSNKACLEHCFSV 224 K L+LLV W+ I S H+ L V R F HL G+ S C ++CF + Sbjct: 419 KVLSLLVFTWNVIHRVVLSNIHAFLSIVFSRQEPKFDGHL-----GIISEDHCPQYCFLL 473 Query: 225 EIGIISITIYPEKALEHPVPGRPASDTGISFSNLLSFCFSIDAFFLLYKDSIFEHFLSFS 404 G + IT + H V + S GIS ++ SFC S+DA L+Y D IFE S S Sbjct: 474 NFGKVLITFCSGNTI-HNVIKKLESHIGISLPDIHSFCLSLDALLLVYVDDIFEQSFSLS 532 Query: 405 CGNLXXXXXXXXXXXXXXXXXXF-----KRQNKKFSDPVRTLWVQPAQVF---------D 542 CG L R+ +D L +PAQ+F Sbjct: 533 CGKLKVKTSSVTGDTATEGSSKHHTVKGNRERMTANDSKTVLQGEPAQIFLPLQNSQKNA 592 Query: 543 YAETNSTH--FVGSSVKEMWSRWKSSCTEFEDGKVLFTKHPFLLCEIKNFLADQGFGSKN 716 + S H F+ + + EMW W+ +C +++D ++ ++++P+LLCEIKN L G N Sbjct: 593 EGQDESAHGPFLKTFLGEMWLTWRRACKKYDDNEIEYSENPWLLCEIKNCLLHPGLKGPN 652 Query: 717 FGFKNCCLVVGELNFILDYASMVSTVLILKQIQCALKWGDQSLEVSVSLHNPVTSEDPPL 896 G C L VG+LN L Y SM+S ++L+Q+Q ALKW + + VSV T +D Sbjct: 653 SGLWKCNLTVGKLNITLGYLSMISMAILLEQMQHALKWTNDNGRVSVRSIPTPTFQDQSE 712 Query: 897 RSWEKKHDSHASELEIELYKLLPHQQIRLAVFFAGAQFRISL 1022 E K+D + +++ L + LP + I+L V AG ++S+ Sbjct: 713 IVLEGKYDDYVGKMKKTLLRRLPEKCIQLGVLIAGPHIQMSV 754 >ref|XP_002310176.1| predicted protein [Populus trichocarpa] gi|222853079|gb|EEE90626.1| predicted protein [Populus trichocarpa] Length = 868 Score = 147 bits (371), Expect = 4e-33 Identities = 103/350 (29%), Positives = 165/350 (47%), Gaps = 18/350 (5%) Frame = +3 Query: 6 DSKDSSVVSPKNTDASFILKTLNLLVVFWSFICSIYHSILQAVLPRNHFINHLN-NGGHG 182 + K+S S + + K L++ +V W+ + I SIL F + G Sbjct: 395 NGKNSFKESSMDKQVNVFSKILSVFIVIWNVMYKILLSILHCFFFIILFFQRPKLDWNPG 454 Query: 183 VRSNKACLEHCFSVEIGIISITIYPEKALEHPVPGRPASDTGISFSNLLSFCFSIDAFFL 362 S +CF + G I +T + + V R S TGIS+S++ SF SI L Sbjct: 455 NNSEDYSSRYCFLLNFGKILVT-FSSTSKHKNVDERIESHTGISYSDIHSFSLSIHMLLL 513 Query: 363 LYKDSIFEHFLSFSCGNLXXXXXXXXXXXXXXXXXXFKRQNKKFS-----DPVRTLWV-Q 524 Y D +FE LS SCG L +KK D ++T+ + + Sbjct: 514 AYVDEVFEQSLSLSCGKLKVKSSSVMETAIVDRSVKNPFSSKKVRRKGSVDKLKTILMGK 573 Query: 525 PAQVFDYAETNSTH-----------FVGSSVKEMWSRWKSSCTEFEDGKVLFTKHPFLLC 671 PAQVF ++T+ T ++ + + EMW W+ S ++D ++ +++ P+LLC Sbjct: 574 PAQVFLPSQTSETSVANPAEGTCNPYLQTLMGEMWLAWQKSSAGYKDNEIAYSETPWLLC 633 Query: 672 EIKNFLADQGFGSKNFGFKNCCLVVGELNFILDYASMVSTVLILKQIQCALKWGDQSLEV 851 EIKN L D GF C L G+LN L Y+S++S ++L QIQ AL + + Sbjct: 634 EIKNCLMDPNLKRPVSGFWKCSLTAGKLNLALGYSSVLSLAILLGQIQHALNLNESTGRA 693 Query: 852 SVSLHNPVTSEDPPLRSWEKKHDSHASELEIELYKLLPHQQIRLAVFFAG 1001 +V L+ P T E+ SWE K++ +++ L++ ++LP + I L VF G Sbjct: 694 TVPLNFPPTIENQEEISWEDKYELYSNRLKLTFLRMLPEKHIELGVFVTG 743 >ref|XP_004142023.1| PREDICTED: uncharacterized protein LOC101222087 [Cucumis sativus] Length = 3608 Score = 121 bits (303), Expect = 3e-25 Identities = 91/354 (25%), Positives = 157/354 (44%), Gaps = 15/354 (4%) Frame = +3 Query: 6 DSKD-SSVVSPKNTDASFILKTLNLLVVFWSFICSIYHSILQAVLPRNHFINHLNNGGHG 182 D K+ SS+V K F + +LL W +C I+ I + ++ + H +G Sbjct: 395 DKKEVSSIVQLK-----FFYQVFSLLSCIWKMLCGIFCFIERCIV-KTLTQPHKLDGCVK 448 Query: 183 VRSNKACLEHCFSVEIGIISITIYPEKALEHPVPGRPASDTGISFSNLLSFCFSIDAFFL 362 + + + CF + G + ++IYP ++ P S GI S LSFCFS D+ + Sbjct: 449 IVRRDSNSQFCFMLNTGKLLVSIYPPDDIQPPTFENLKSSFGIPSSFSLSFCFSFDSLVV 508 Query: 363 LYKDSIFEHFLSFSCGNLXXXXXXXXXXXXXXXXXX----------FKRQN--KKF--SD 500 +Y + E L SC +R N K F + Sbjct: 509 MYMVDLCEQSLLMSCDQFNVTPLPSVEASNGGGCSVDLLGSLEGCEMERANSLKSFIRGE 568 Query: 501 PVRTLWVQPAQVFDYAETNSTHFVGSSVKEMWSRWKSSCTEFEDGKVLFTKHPFLLCEIK 680 P ++ + + D T F+ ++ MW RWKS C E+G + ++ +P+ LCEI Sbjct: 569 PAQSFFPSNGREID---TGCNQFIVKYLEGMWLRWKSVCRNLEEGMIPYSDNPWFLCEIS 625 Query: 681 NFLADQGFGSKNFGFKNCCLVVGELNFILDYASMVSTVLILKQIQCALKWGDQSLEVSVS 860 + + + + C L +G+LNF L Y+S++S L+L Q A W + VS Sbjct: 626 SSMTKSVLENSSTSIWKCNLALGKLNFALQYSSVLSAALLL---QLASSWTEDEQSPEVS 682 Query: 861 LHNPVTSEDPPLRSWEKKHDSHASELEIELYKLLPHQQIRLAVFFAGAQFRISL 1022 LH P + D K+++ AS++ L + L + I++A+ AG++ +++L Sbjct: 683 LHPPTVAGDNREACLNNKYENCASQMMTPLLEKLSLKDIQVAMHIAGSKIKMAL 736 >ref|XP_003548909.1| PREDICTED: uncharacterized protein LOC100818143 [Glycine max] Length = 3602 Score = 113 bits (282), Expect = 8e-23 Identities = 89/339 (26%), Positives = 144/339 (42%), Gaps = 13/339 (3%) Frame = +3 Query: 42 TDASFILKTLNLLVVFWSFICSIYHSILQAVLPRNHFINHLNNGGHGVRS--NKACLEHC 215 T F + LL W I +I H ++ + R + + G + S C C Sbjct: 408 TTNKFFRPFIFLLSFMWKLISTIIHCLVN-IFSREKIVQDPDIDGCCLESLIEDPCQSCC 466 Query: 216 FSVEIGIISITIYPEKALEHPVPGRPASDTGISFSNLLSFCFSIDAFFLLYKDSIFEHFL 395 F + G I IT+ ++ V + S GI+ S LS CF IDA L+ IFE + Sbjct: 467 FVLNFGKIIITVSQINEIDPSVYEKLQSLAGIACSAFLSICFCIDALLLISVKDIFEQRI 526 Query: 396 SFSCGNLXXXXXXXXXXXXXXXXXXFKRQNKKFSDPVR----TLWVQPAQVF-------D 542 SCG + + + +WV+PA++F Sbjct: 527 FLSCGQMKVESAPLTMSEEACTMDPLSSAKGNEKEGINHMESIMWVEPAKIFLLSEIDGG 586 Query: 543 YAETNSTHFVGSSVKEMWSRWKSSCTEFEDGKVLFTKHPFLLCEIKNFLADQGFGSKNFG 722 AE + +K+ WK C + + ++ F+++P +L +I+ + + +FG Sbjct: 587 QAEDCCDSHIEIFMKKFSVNWKRICRKLNENEIEFSENPCILSKIEISSTNPDPKNPDFG 646 Query: 723 FKNCCLVVGELNFILDYASMVSTVLILKQIQCALKWGDQSLEVSVSLHNPVTSEDPPLRS 902 F C L++G+LN +L ++S+ S LIL QIQ AL W D+ E S++ + D Sbjct: 647 FCECGLMLGKLNLVLTHSSVSSLSLILSQIQHALYWEDRR-EASIASN----FVDKAEMD 701 Query: 903 WEKKHDSHASELEIELYKLLPHQQIRLAVFFAGAQFRIS 1019 W K+D + EL + L + LP + I V G R S Sbjct: 702 WVNKYDCYCKELIMTLLQKLPEKHIHFGVLVDGPAARFS 740 >ref|NP_190607.2| uncharacterized protein [Arabidopsis thaliana] gi|332645140|gb|AEE78661.1| uncharacterized protein [Arabidopsis thaliana] Length = 3072 Score = 105 bits (261), Expect = 2e-20 Identities = 84/337 (24%), Positives = 150/337 (44%), Gaps = 16/337 (4%) Frame = +3 Query: 54 FILKTLNLLVVFWSFICSIYHSILQAVLPRNHFINHLNNGGHGVRSNKAC--LEHCFSVE 227 ++ KT +L W I + SI L N + + C LE V Sbjct: 415 YLSKTTWVLAYIWRLISRTFWSIA-CFLWLNKLLTQELQTDRNNEDDSECVSLEFHAVVN 473 Query: 228 IGIISITIYPEKALEHPVPGRPASDTGISFSNLLSFCFSIDAFFLLYKDSIFEHFLSFSC 407 +G +S+T YPEK + + + + TG SN++ C S+D F +LY +LS SC Sbjct: 474 LGKLSVTCYPEKIISSFMTSKDS--TGHVDSNIVMLCLSVDEFLVLYTVGCLTQYLSASC 531 Query: 408 GNLXXXXXXXXXXXXXXXXXX-----FKRQNKKFSDPVRTLW-VQPAQVFDYAETN--ST 563 G L + K + V+T+ + PAQ N S Sbjct: 532 GKLKVESSSFKNTSRFMKSTKDPSSSSEGNKKHMREDVKTILDMDPAQQISKTVNNHGSD 591 Query: 564 HFVG-----SSVKEMWSRWKSSCTEFEDGKVLFTKHPFLLCEIKNFLADQGFGSKNFGFK 728 G + ++EMW W S+C + + + P LL +IK+ +A + G+++ F Sbjct: 592 QHEGMLHLQNLLREMWLNWNSNCMKLDKSTFTISDKPCLLVDIKSCMAYEVVGNQDSEFW 651 Query: 729 NCCLVVGELNFILDYASMVSTVLILKQIQCALKWGDQSLEVSVSLHNPVT-SEDPPLRSW 905 C +V+G+L+ + +Y+S+ S L++ QI+ A K V + VT DP + S+ Sbjct: 652 KCSMVLGKLDIVFEYSSLFSLALLIWQIEWAQKLLVDDYTGEVHSSSLVTGGVDPEMASY 711 Query: 906 EKKHDSHASELEIELYKLLPHQQIRLAVFFAGAQFRI 1016 + ++ + +E+ L+++ P +QI++ + G Q ++ Sbjct: 712 D-EYGIYRRSIELSLHRVHPERQIQVGILLGGPQIKL 747