BLASTX nr result
ID: Glycyrrhiza23_contig00005171
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza23_contig00005171 (1581 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003538768.1| PREDICTED: uncharacterized protein LOC100784... 590 e-166 ref|XP_003611437.1| hypothetical protein MTR_5g014010 [Medicago ... 590 e-166 ref|XP_003516643.1| PREDICTED: uncharacterized protein LOC100779... 578 e-162 ref|XP_002326500.1| predicted protein [Populus trichocarpa] gi|2... 418 e-114 ref|XP_002528762.1| conserved hypothetical protein [Ricinus comm... 410 e-112 >ref|XP_003538768.1| PREDICTED: uncharacterized protein LOC100784375 [Glycine max] Length = 381 Score = 590 bits (1522), Expect = e-166 Identities = 280/380 (73%), Positives = 313/380 (82%), Gaps = 1/380 (0%) Frame = -1 Query: 1401 RRSESTRSSIAKTSSGSVKSLIEKGEPPHSLFPSKHEFPRLIXXXXXXXXXSWACNLLFT 1222 ++++ R ++ ++S + EPP +L PSKH+FPRL+ +W CN LFT Sbjct: 2 KQNDRPRECLSNSNSEGCSLSLMGREPPQNLLPSKHDFPRLVLVIALASLVAWTCNFLFT 61 Query: 1221 SLVHPPTKPFCDINLDPSDYFPDSCEPCPSNGECNGGKLECFQGYQKHGNLCVEDGDINE 1042 SL HPP+KPFCD NL DYF D C+PCPSNGECN GKLEC QGYQ+HGNLC EDGDINE Sbjct: 62 SLFHPPSKPFCDTNLHSPDYFLDICQPCPSNGECNDGKLECHQGYQRHGNLCAEDGDINE 121 Query: 1041 SARKIVERVEHHLCEEYAQYLCSGTGSIWVHEDDLQNYFEPMGNVKVDNALYNYTKQKAF 862 SARK++ERVEHHLCE+YAQ+LC+GTG IWVHEDDL NYFEP+GNVKVDNALYNYTKQ+A Sbjct: 122 SARKLLERVEHHLCEKYAQFLCTGTGIIWVHEDDLWNYFEPVGNVKVDNALYNYTKQRAV 181 Query: 861 DTMGKLLEMRLNS-HGTKEFKCPDLLAEHYKPYACRFHQWISQHILVVLPVFATLVGCTT 685 +TMGKLLE RLNS HG KEFKCPD LAEHYKPY C QWISQHILVVLP+ A LVGCT Sbjct: 182 ETMGKLLETRLNSSHGMKEFKCPDQLAEHYKPYTCCIRQWISQHILVVLPICAMLVGCTA 241 Query: 684 LFRNARRKLRISRRVEELYNKVCEILEENALTSKSANGECEPWVVASRLRDHLLLPRERK 505 L N R+KL +SRRVEELY+KVCEILE+NALTSKSANGECEPWVVASRLRDHLLLPRERK Sbjct: 242 LCWNVRQKLSMSRRVEELYDKVCEILEDNALTSKSANGECEPWVVASRLRDHLLLPRERK 301 Query: 504 DPLLWKKVEELVQEDSRIDRYPKLVKGESKVVWEWQVEGSLSASKMMIKRDTSKTRVNEN 325 +PLLWKK+EELVQEDSRIDRYPKLVKGESKVVWEWQVEGSLSASKM +RD SKT VNE+ Sbjct: 302 NPLLWKKLEELVQEDSRIDRYPKLVKGESKVVWEWQVEGSLSASKMKKRRDASKTMVNES 361 Query: 324 MDLNSQQRPTMKAVPMEPLF 265 DLN QQ P MK P PLF Sbjct: 362 TDLNHQQHPAMKTEPTVPLF 381 >ref|XP_003611437.1| hypothetical protein MTR_5g014010 [Medicago truncatula] gi|355512772|gb|AES94395.1| hypothetical protein MTR_5g014010 [Medicago truncatula] Length = 374 Score = 590 bits (1521), Expect = e-166 Identities = 287/381 (75%), Positives = 314/381 (82%) Frame = -1 Query: 1407 MSRRSESTRSSIAKTSSGSVKSLIEKGEPPHSLFPSKHEFPRLIXXXXXXXXXSWACNLL 1228 M+RRS + + SS +I+K EPP +L PSKHEFP+L+ +W+ NLL Sbjct: 1 MNRRSGKSSREVKLKSS-----IIDK-EPPPNLLPSKHEFPKLLLVLTVASLVAWSSNLL 54 Query: 1227 FTSLVHPPTKPFCDINLDPSDYFPDSCEPCPSNGECNGGKLECFQGYQKHGNLCVEDGDI 1048 FTS +HP TKPFCD N ++FPDSCEPCPSNGECN GKLEC +GYQKHGNLCVEDGDI Sbjct: 55 FTSFLHPSTKPFCDTN-SLHNHFPDSCEPCPSNGECNDGKLECLRGYQKHGNLCVEDGDI 113 Query: 1047 NESARKIVERVEHHLCEEYAQYLCSGTGSIWVHEDDLQNYFEPMGNVKVDNALYNYTKQK 868 N+SARKI + VE HLC EYAQ+LCSGTGSIWVH+DDL NY EP+ NVK NALYNYTKQK Sbjct: 114 NDSARKIADTVERHLCGEYAQFLCSGTGSIWVHDDDLWNYIEPVENVKEGNALYNYTKQK 173 Query: 867 AFDTMGKLLEMRLNSHGTKEFKCPDLLAEHYKPYACRFHQWISQHILVVLPVFATLVGCT 688 AFD M KLLEMRL +HG KEFKCPD L E YKPYACR QWI+QHILVVLP+ A LVGC Sbjct: 174 AFDMMDKLLEMRLTTHGMKEFKCPDSLVEQYKPYACRLRQWITQHILVVLPICAMLVGCM 233 Query: 687 TLFRNARRKLRISRRVEELYNKVCEILEENALTSKSANGECEPWVVASRLRDHLLLPRER 508 LF N RRKLR+SRRVEELYNKVCEILEENALTSKS NGECEPWVVASRLRDHLLLPRER Sbjct: 234 ILFWNVRRKLRVSRRVEELYNKVCEILEENALTSKSVNGECEPWVVASRLRDHLLLPRER 293 Query: 507 KDPLLWKKVEELVQEDSRIDRYPKLVKGESKVVWEWQVEGSLSASKMMIKRDTSKTRVNE 328 KDPLLWKKVEELVQEDSR+DRYPKLVKGESKVVWEWQVEGSLSA+KM+ KRD SKT VN Sbjct: 294 KDPLLWKKVEELVQEDSRVDRYPKLVKGESKVVWEWQVEGSLSATKMLTKRDASKTMVNR 353 Query: 327 NMDLNSQQRPTMKAVPMEPLF 265 N +LNSQQRPTMKA PMEP F Sbjct: 354 NTELNSQQRPTMKAEPMEPHF 374 >ref|XP_003516643.1| PREDICTED: uncharacterized protein LOC100779650 [Glycine max] Length = 377 Score = 578 bits (1491), Expect = e-162 Identities = 278/380 (73%), Positives = 314/380 (82%), Gaps = 1/380 (0%) Frame = -1 Query: 1401 RRSESTRSSIAKTSSGSVKSLIEKGEPPHSLFPSKHEFPRLIXXXXXXXXXSWACNLLFT 1222 ++++ R ++ ++S S + EPP +L PSKH+FPRL+ +W CN LFT Sbjct: 2 KQNDRRRECLSNSNSESCSFSLMGREPPQNLLPSKHDFPRLVLVVALASLVAWTCNFLFT 61 Query: 1221 SLVHPPTKPFCDINLDPSDYFPDSCEPCPSNGECNGGKLECFQGYQKHGNLCVEDGDINE 1042 P+KPFCD NL DYF D CEPCPSNGECN GKL+C QGYQ+HGNLCVEDGDINE Sbjct: 62 -----PSKPFCDPNLHSPDYFSDICEPCPSNGECNDGKLKCLQGYQRHGNLCVEDGDINE 116 Query: 1041 SARKIVERVEHHLCEEYAQYLCSGTGSIWVHEDDLQNYFEPMGNVKVDNALYNYTKQKAF 862 SARK++ERVEHHLCEEYAQ+LC+GTG+IWV EDDL NYFEP+GNVKVDNALY YTKQKAF Sbjct: 117 SARKLLERVEHHLCEEYAQFLCTGTGTIWVREDDLWNYFEPVGNVKVDNALYKYTKQKAF 176 Query: 861 DTMGKLLEMRLNS-HGTKEFKCPDLLAEHYKPYACRFHQWISQHILVVLPVFATLVGCTT 685 +TMGKLL+ RLNS HG KEFKCPD LAEHYK YAC QWISQHILVVLP+ A LVGCT Sbjct: 177 ETMGKLLDTRLNSSHGMKEFKCPDQLAEHYKSYACCIRQWISQHILVVLPICAMLVGCTA 236 Query: 684 LFRNARRKLRISRRVEELYNKVCEILEENALTSKSANGECEPWVVASRLRDHLLLPRERK 505 LF + R+KL +SRR+EELYNKVCEILEENALTSKSANGECEPWVV+SRLRDHLLLPRERK Sbjct: 237 LFWSVRQKLCMSRRIEELYNKVCEILEENALTSKSANGECEPWVVSSRLRDHLLLPRERK 296 Query: 504 DPLLWKKVEELVQEDSRIDRYPKLVKGESKVVWEWQVEGSLSASKMMIKRDTSKTRVNEN 325 +PLLWKKVE++VQEDSRIDRYPKLVKGESKVVWEWQVEGSLS SKM +RD SKTRVNE+ Sbjct: 297 NPLLWKKVEKMVQEDSRIDRYPKLVKGESKVVWEWQVEGSLSFSKMK-RRDASKTRVNES 355 Query: 324 MDLNSQQRPTMKAVPMEPLF 265 DLN Q RP M+ PMEPLF Sbjct: 356 TDLNHQHRPAMRTEPMEPLF 375 >ref|XP_002326500.1| predicted protein [Populus trichocarpa] gi|222833822|gb|EEE72299.1| predicted protein [Populus trichocarpa] Length = 382 Score = 418 bits (1074), Expect = e-114 Identities = 203/354 (57%), Positives = 253/354 (71%), Gaps = 3/354 (0%) Frame = -1 Query: 1365 TSSGSVKSLIEKGEPPHSLFPSKHEFPRLIXXXXXXXXXSWACNLLFTSLVHPPTKPFCD 1186 +SS ++ K EPPH+LFPSK EF RLI + CN + + H TKPFCD Sbjct: 15 SSSSHPYTISSKIEPPHNLFPSKQEFLRLIAVLAIASSVALTCNFIANYIDHS-TKPFCD 73 Query: 1185 INLDPSDYFPDSCEPCPSNGECNGGKLECFQGYQKHGNLCVEDGDINESARKIVERVEHH 1006 +LD SD +SCEPCP NGECN GKLEC +GY+KH N C+EDGD+ E A+K++E VE+H Sbjct: 74 TSLDSSDSLSNSCEPCPRNGECNQGKLECARGYRKHRNTCIEDGDVYERAKKLLEGVENH 133 Query: 1005 LCEEYAQYLCSGTGSIWVHEDDLQNYFEP---MGNVKVDNALYNYTKQKAFDTMGKLLEM 835 LCE YA +LC GTG +WV EDD+ N + + N DN +Y YTK KA +T+ + L+ Sbjct: 134 LCEAYADFLCYGTGIMWVQEDDILNDLDGHQLLENYSSDNPVYVYTKMKAMETISEELQT 193 Query: 834 RLNSHGTKEFKCPDLLAEHYKPYACRFHQWISQHILVVLPVFATLVGCTTLFRNARRKLR 655 R N +G KEFKCPDLL EHYKP+ C QWIS+H LV++PV A +VG L RR+ Sbjct: 194 RTNPNGKKEFKCPDLLVEHYKPFTCHLRQWISEHALVIVPVCALVVGFAFLVWKIRRRWY 253 Query: 654 ISRRVEELYNKVCEILEENALTSKSANGECEPWVVASRLRDHLLLPRERKDPLLWKKVEE 475 +S R EELY++VC+ILEE AL SK N ECEPWVVASRLRDHLL P+ERKD +LWKKVE+ Sbjct: 254 LSTRGEELYHQVCDILEERALMSKRVNAECEPWVVASRLRDHLLSPKERKDFVLWKKVED 313 Query: 474 LVQEDSRIDRYPKLVKGESKVVWEWQVEGSLSASKMMIKRDTSKTRVNENMDLN 313 LV+EDSR+DRYPKLVKGESKVVWEWQVEGSLS+ +M K ++SK + N+ + N Sbjct: 314 LVREDSRVDRYPKLVKGESKVVWEWQVEGSLSSGRMRKKVESSKLKSNDGVKEN 367 >ref|XP_002528762.1| conserved hypothetical protein [Ricinus communis] gi|223531765|gb|EEF33584.1| conserved hypothetical protein [Ricinus communis] Length = 373 Score = 410 bits (1053), Expect = e-112 Identities = 195/336 (58%), Positives = 251/336 (74%), Gaps = 3/336 (0%) Frame = -1 Query: 1323 PPHSLFPSKHEFPRLIXXXXXXXXXSWACNLLFTSLVHPPTKPFCDINLDPSDYFPDSCE 1144 PP++LFPSK EF RLI ++ CNL+ T ++P TKPFCD N +D F + C Sbjct: 26 PPNNLFPSKEEFVRLIAVLAIASSVAFTCNLIAT-YINPSTKPFCDSN---TDSFSEFCV 81 Query: 1143 PCPSNGECNGGKLECFQGYQKHGNLCVEDGDINESARKIVERVEHHLCEEYAQYLCSGTG 964 PCP NGEC GKLEC +GY+KH N+C+EDGDINE A+K+ E VE+HLCE YAQYLC G G Sbjct: 82 PCPENGECTQGKLECAEGYRKHRNICIEDGDINERAKKLSEWVENHLCEAYAQYLCDGIG 141 Query: 963 SIWVHEDDLQ---NYFEPMGNVKVDNALYNYTKQKAFDTMGKLLEMRLNSHGTKEFKCPD 793 +IW ++D+ + + M N + DNA Y Y K+KA + + +LLE+R NSHG KE KCPD Sbjct: 142 TIWFQDNDIWYDLDGHQLMENFQPDNATYIYAKRKAMEMIVRLLEIRTNSHGNKELKCPD 201 Query: 792 LLAEHYKPYACRFHQWISQHILVVLPVFATLVGCTTLFRNARRKLRISRRVEELYNKVCE 613 L+AEHYKP+ CRF QWIS H V+ + + +VG L R +R+ +S R EELY++VCE Sbjct: 202 LVAEHYKPFTCRFRQWISNHAFVIASLCSLVVGAVLLLRKLQRRWYLSARGEELYHQVCE 261 Query: 612 ILEENALTSKSANGECEPWVVASRLRDHLLLPRERKDPLLWKKVEELVQEDSRIDRYPKL 433 +LEENAL SK +NGEC+ WVVAS+LRDHLLLP+ERKDP+LWK+VE+LVQEDSR+DRYPKL Sbjct: 262 VLEENALMSKQSNGECDSWVVASQLRDHLLLPKERKDPVLWKRVEQLVQEDSRVDRYPKL 321 Query: 432 VKGESKVVWEWQVEGSLSASKMMIKRDTSKTRVNEN 325 VKGESKVVWEWQVEGS S+ ++ K++ SK + +E+ Sbjct: 322 VKGESKVVWEWQVEGSWSSGRIR-KKEASKLKSSES 356