BLASTX nr result
ID: Glycyrrhiza23_contig00020468
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza23_contig00020468 (2059 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003522941.1| PREDICTED: uncharacterized protein LOC100807... 806 0.0 ref|XP_003525940.1| PREDICTED: uncharacterized protein LOC100792... 790 0.0 ref|XP_002528397.1| conserved hypothetical protein [Ricinus comm... 636 e-180 ref|XP_003518228.1| PREDICTED: uncharacterized protein LOC100794... 632 e-178 ref|XP_002324519.1| predicted protein [Populus trichocarpa] gi|2... 615 e-173 >ref|XP_003522941.1| PREDICTED: uncharacterized protein LOC100807349 [Glycine max] Length = 533 Score = 806 bits (2081), Expect = 0.0 Identities = 426/545 (78%), Positives = 453/545 (83%), Gaps = 15/545 (2%) Frame = -1 Query: 1879 MVQLMKNSGKPEKATAPVKLEIVEDPLEDEHGPLNKRHKPXXXXXXXXXXXXXXS---VG 1709 MVQLMKNS KPE A++ VKLEIVED LE+EH PLNKR KP V Sbjct: 1 MVQLMKNSRKPETASS-VKLEIVEDSLEEEHAPLNKRCKPSASPLPQPQQWNASDDGSVS 59 Query: 1708 PSSQISILDEPSPLGLRLRKSPSLLDLIQMKLSQGSVFVTNNTEQNENSVSEVKKESR-- 1535 S ++ILDEPSPLGLRLRKSPSLLDLIQMKLSQGS +Q E+ SE K+ESR Sbjct: 60 SPSHLNILDEPSPLGLRLRKSPSLLDLIQMKLSQGSA------QQKEDLSSEAKRESRCA 113 Query: 1534 ---------GTADKLKASNFPGSLLRIGSWEYKSRYEGDLVAKCYFAKHKLVWEVLEGGL 1382 G ADKLKASNFP SLLRIGSWEYKSRYEGDLVAKCYFAKHKLVWEVLEGGL Sbjct: 114 SAAAASSGAGAADKLKASNFPASLLRIGSWEYKSRYEGDLVAKCYFAKHKLVWEVLEGGL 173 Query: 1381 KSKIEIQWSDIMALKAHCPDNGPSTLTVVLARQPLFFRETNPQPRKHTLWQATADFTDGQ 1202 KSKIEIQWSDIMALKAHCPDNGPSTLTVVLARQPL+FRETNPQPRKHTLWQATADFTDGQ Sbjct: 174 KSKIEIQWSDIMALKAHCPDNGPSTLTVVLARQPLYFRETNPQPRKHTLWQATADFTDGQ 233 Query: 1201 CSKHRLHFLQCPQGLLAKHFEKLIQCDMRLNFLSQQPEIILDSPYFDTQPAAFEDPDNPK 1022 SKHRLHFLQCPQGLLAKHFEKLIQCDMRLNFLSQQPEIILDSP+FDTQP+AFEDPDNPK Sbjct: 234 SSKHRLHFLQCPQGLLAKHFEKLIQCDMRLNFLSQQPEIILDSPHFDTQPSAFEDPDNPK 293 Query: 1021 DHDLHQVXXXXXXXXSYQDRGSPQASLSSSFKIECNDPPGTTLESLPRDAPSPSSVMDCP 842 DHDL QV YQD GSPQASLSSSFKIE NDPPG L+SLPRDAPSPSSVM+C Sbjct: 294 DHDLLQVSGKGSSTSCYQDSGSPQASLSSSFKIEHNDPPGMMLDSLPRDAPSPSSVMECT 353 Query: 841 SVDGSTCTSSETESKAPRSLDQIKVPGLKPSMSMSDFLGQIELHLSEQMTSGNPPLSAAV 662 S++GS TSSET+SK PR+ DQIK+PGL+PSMS+SDF+GQIEL LSEQ+TSGNPP S Sbjct: 354 SIEGS--TSSETDSKGPRNGDQIKLPGLRPSMSVSDFIGQIELCLSEQITSGNPPFSD-- 409 Query: 661 GSEGYQEILEDIAQHLLNDNQVAAASDEKSLMSRVNSLCCLLQKDPVAVQKSHCAEDSNT 482 G Y+EILEDIAQHLLNDNQVAA SDEKSLMSRVNSLCCLLQKDPV VQ SH ED NT Sbjct: 410 GGSEYKEILEDIAQHLLNDNQVAATSDEKSLMSRVNSLCCLLQKDPVTVQNSHFTED-NT 468 Query: 481 GEGPGDGKDVKPA-EDSRDASSGKQALGMSRKDSFGDLLLHLPRIASLPKFLFNISEDDG 305 EGP DGKDVKPA E+S+DAS GKQALGMSRKDSF DLLLHLPRIASLPKFLFNISE+DG Sbjct: 469 VEGPDDGKDVKPAVEESKDASGGKQALGMSRKDSFSDLLLHLPRIASLPKFLFNISEEDG 528 Query: 304 DSQAR 290 +S AR Sbjct: 529 NSHAR 533 >ref|XP_003525940.1| PREDICTED: uncharacterized protein LOC100792008 [Glycine max] Length = 533 Score = 790 bits (2041), Expect = 0.0 Identities = 415/543 (76%), Positives = 447/543 (82%), Gaps = 13/543 (2%) Frame = -1 Query: 1879 MVQLMKNSGKPEKATA--PVKLEIVEDPLEDEHGPLNKRHKPXXXXXXXXXXXXXXS--- 1715 MVQLMKNS KPE ++ PVKLEIVEDPLE+EHGP NKR KP Sbjct: 1 MVQLMKNSRKPETSSCGDPVKLEIVEDPLEEEHGPHNKRCKPSPSPSPQPQQWSASDDGS 60 Query: 1714 VGPSSQISILDEPSPLGLRLRKSPSLLDLIQMKLSQGSVFVTNNTEQNENSVSEVKKESR 1535 V S ++ILDEPSPLGLRLRKSPSLLDLIQMKLSQGS + +E+ S K+ESR Sbjct: 61 VSSPSHLNILDEPSPLGLRLRKSPSLLDLIQMKLSQGSA------QPSEDLSSGAKRESR 114 Query: 1534 -------GTADKLKASNFPGSLLRIGSWEYKSRYEGDLVAKCYFAKHKLVWEVLEGGLKS 1376 G ADKLKASNFP SLLRIGSWEYKSRYEGDLVAKCYFAKHKLVWEVLEGGLKS Sbjct: 115 AAAVSGAGGADKLKASNFPASLLRIGSWEYKSRYEGDLVAKCYFAKHKLVWEVLEGGLKS 174 Query: 1375 KIEIQWSDIMALKAHCPDNGPSTLTVVLARQPLFFRETNPQPRKHTLWQATADFTDGQCS 1196 KIEIQWSDIMALKAHCPD+GPSTLTVVLARQPL+FRETNPQPRKHTLWQATADFTDGQ S Sbjct: 175 KIEIQWSDIMALKAHCPDDGPSTLTVVLARQPLYFRETNPQPRKHTLWQATADFTDGQSS 234 Query: 1195 KHRLHFLQCPQGLLAKHFEKLIQCDMRLNFLSQQPEIILDSPYFDTQPAAFEDPDNPKDH 1016 KHRLHFLQCPQGLLAKHFEKLIQCDMRLNFLSQQPEIILDSP+FDTQP+AFEDPDNPKD Sbjct: 235 KHRLHFLQCPQGLLAKHFEKLIQCDMRLNFLSQQPEIILDSPHFDTQPSAFEDPDNPKDR 294 Query: 1015 DLHQVXXXXXXXXSYQDRGSPQASLSSSFKIECNDPPGTTLESLPRDAPSPSSVMDCPSV 836 DL QV +QD GSPQASL SSFK E NDPPG L+SLPRDAPSPSSVM+C S+ Sbjct: 295 DLLQVSGKGSSTSCFQDSGSPQASLLSSFKTEHNDPPGMMLDSLPRDAPSPSSVMECTSI 354 Query: 835 DGSTCTSSETESKAPRSLDQIKVPGLKPSMSMSDFLGQIELHLSEQMTSGNPPLSAAVGS 656 +GS TSSET+SK PR+ DQIK+PGL+PSMS+SDF+GQIEL L+EQ+TSGNPP S G Sbjct: 355 EGS--TSSETDSKGPRNGDQIKLPGLRPSMSVSDFIGQIELCLTEQITSGNPPFSD--GG 410 Query: 655 EGYQEILEDIAQHLLNDNQVAAASDEKSLMSRVNSLCCLLQKDPVAVQKSHCAEDSNTGE 476 Y+EILEDIAQHLLNDNQVAA SDEKSLMSRVNSLCCLLQKDPV VQ SH EDS+T E Sbjct: 411 SEYKEILEDIAQHLLNDNQVAATSDEKSLMSRVNSLCCLLQKDPVTVQNSHFTEDSSTVE 470 Query: 475 GPGDGKDVKP-AEDSRDASSGKQALGMSRKDSFGDLLLHLPRIASLPKFLFNISEDDGDS 299 GP DGKDVKP AE+ +D S GKQALGMSRKDSF DLLLHLPRI SLPKFLFNISE+D DS Sbjct: 471 GPHDGKDVKPGAEEPKDTSGGKQALGMSRKDSFSDLLLHLPRITSLPKFLFNISEEDDDS 530 Query: 298 QAR 290 A+ Sbjct: 531 HAK 533 >ref|XP_002528397.1| conserved hypothetical protein [Ricinus communis] gi|223532185|gb|EEF33990.1| conserved hypothetical protein [Ricinus communis] Length = 560 Score = 636 bits (1641), Expect = e-180 Identities = 356/567 (62%), Positives = 405/567 (71%), Gaps = 37/567 (6%) Frame = -1 Query: 1879 MVQLMKNSGKPEKAT------APVKLEIVEDPLEDEHGPLNKRHKPXXXXXXXXXXXXXX 1718 MVQLM + P + APVK+EIVEDPLE+EHGPLNKR K Sbjct: 1 MVQLMISGNNPVETETTSSKGAPVKVEIVEDPLEEEHGPLNKRSKQSQTVQQWGAGANAY 60 Query: 1717 SVGPSSQISILDEPSPLGLRLRKSPSLLDLIQMKLSQGSVFVTNNTE--QNENSVSEVKK 1544 V P Q + LDEPSPLGLRLRKSPSLLDLIQM+LSQG + N N+ S +KK Sbjct: 61 PV-PPVQYNPLDEPSPLGLRLRKSPSLLDLIQMRLSQGGASAPGTIQGTDNTNNNSVIKK 119 Query: 1543 ESRG-------TADKLKASNFPGSLLRIGSWEYKSRYEGDLVAKCYFAKHKLVWEVLEGG 1385 ES + DKLKASNFP S+LRIGSWEYKSRYEG+LVAKCYFAKHKLVWEVLEGG Sbjct: 120 ESSNKTTTASSSTDKLKASNFPASILRIGSWEYKSRYEGELVAKCYFAKHKLVWEVLEGG 179 Query: 1384 LKSKIEIQWSDIMALKAHCPDNGPSTLTVVLARQPLFFRETNPQPRKHTLWQATADFTDG 1205 LKSKIEIQWSDIMALKA+CPDN P TLTVVLARQPLFFRETNPQPRKHTLWQATADFT+G Sbjct: 180 LKSKIEIQWSDIMALKANCPDNEPGTLTVVLARQPLFFRETNPQPRKHTLWQATADFTNG 239 Query: 1204 QCSKHRLHFLQCPQGLLAKHFEKLIQCDMRLNFLSQQPEIILDSPYFDTQPAAFEDPDNP 1025 Q S HR HFLQCPQGLL KHFEKLIQCDMRLNFLS+QPEIILDSPYF+ + + FEDPD+P Sbjct: 240 QASIHRQHFLQCPQGLLNKHFEKLIQCDMRLNFLSRQPEIILDSPYFEQRTSVFEDPDDP 299 Query: 1024 KDHDLHQV--XXXXXXXXSYQDR-GSPQASLSSSFKIECNDPPGTTLESLPRDAPSPSSV 854 K D ++V +QD SP A+ SSS +E D GTT E + R+APSPSSV Sbjct: 300 KSQDFNKVEMGIASSSASGFQDLVASPSAAHSSS--LEIGDHAGTTSEQMSREAPSPSSV 357 Query: 853 MDCPSVDGSTCTSSETESKAPRSLDQIKVPGLKPSMSMSDFLGQIELHLSEQMTSGNPPL 674 MD +++G+ +SK R+ DQIKVPGL PSMSMSD + IE +SEQ+TSGNP L Sbjct: 358 MDTRAIEGNG-NCEAVDSKGLRNWDQIKVPGLHPSMSMSDLMNHIENCISEQITSGNPSL 416 Query: 673 SAAVGSEGYQEILEDIAQHLLNDNQVAAASDEKSLMSRVNSLCCLLQKDPVAVQKSHCAE 494 SA GSEG Q ILEDIAQ+LL+DNQ+ +SDEK LM+RVNSLCCLLQKDP + Q S E Sbjct: 417 SAD-GSEG-QNILEDIAQYLLSDNQLTTSSDEKRLMARVNSLCCLLQKDPASSQNSQANE 474 Query: 493 DSNTGEGPGDGKDVK-------------------PAEDSRDASSGKQALGMSRKDSFGDL 371 + GE +GK V+ P +++D S KQ GMSRKDSFGDL Sbjct: 475 EIYVGES-DNGKGVQLNNTYESLNENKNKGGIKDPELNTKDVSGSKQVPGMSRKDSFGDL 533 Query: 370 LLHLPRIASLPKFLFNISEDDGDSQAR 290 LLHLPRIASLPKFLFNISE+DG+SQAR Sbjct: 534 LLHLPRIASLPKFLFNISEEDGESQAR 560 >ref|XP_003518228.1| PREDICTED: uncharacterized protein LOC100794550 [Glycine max] Length = 531 Score = 632 bits (1629), Expect = e-178 Identities = 347/534 (64%), Positives = 398/534 (74%), Gaps = 25/534 (4%) Frame = -1 Query: 1837 TAPVKLEIVEDPLEDEHGPLNKRHKPXXXXXXXXXXXXXXSVGPSSQISILDEPSPLGLR 1658 T P KLEI E+PL++EH PLNKR K S + +ILDEPSPLGLR Sbjct: 27 TVPFKLEI-EEPLQEEHAPLNKRFKSS---------------STSQEYNILDEPSPLGLR 70 Query: 1657 LRKSPSLLDLIQMKLSQGSVFVTNNTEQNENSVSE-VKKESRGTA-----DKLKASNFPG 1496 LRKSPSLLDLI+MKLSQG+V + N QNEN +S +KKESRG A +KLKASNFP Sbjct: 71 LRKSPSLLDLIEMKLSQGNVIIANT--QNENFLSSGLKKESRGAAASDSVEKLKASNFPA 128 Query: 1495 SLLRIGSWEYKSRYEGDLVAKCYFAKHKLVWEVLEGGLKSKIEIQWSDIMALKAHCPDNG 1316 SLLRIGSWEYKS++EGDLVAKCYFAKHKLVWEVLEG LK+K+EIQWSDIMALKA+CPD G Sbjct: 129 SLLRIGSWEYKSKHEGDLVAKCYFAKHKLVWEVLEGELKNKMEIQWSDIMALKANCPDTG 188 Query: 1315 PSTLTVVLARQPLFFRETNPQPRKHTLWQATADFTDGQCSKHRLHFLQCPQGLLAKHFEK 1136 PS+LTVVLARQPLFF+ETNPQPRKHT+WQAT+DFT+G+ KHR HFL+ PQGLLAKHFEK Sbjct: 189 PSSLTVVLARQPLFFKETNPQPRKHTIWQATSDFTEGEACKHRQHFLEFPQGLLAKHFEK 248 Query: 1135 LIQCDMRLNFLSQQPEIILDSPYFDTQPAAFEDPDNPKDHDLHQVXXXXXXXXSYQDRGS 956 LIQCD LNFLSQQPEIILDSP+FDT+PAAFE+ DNP+D DLH V QD GS Sbjct: 249 LIQCDTHLNFLSQQPEIILDSPHFDTRPAAFENLDNPEDLDLHLVNCKGSTTSCLQDIGS 308 Query: 955 PQASLSSSFKIECNDPPGTTLESLPRDAPSPSSVMDCPSVDGSTCTSSETESKAPRSLDQ 776 P +SLS SFKIE ND G ++LP +AP PSS GS TSSET+ K PR+ DQ Sbjct: 309 PHSSLSPSFKIEHNDLLGIASDNLPCEAPFPSS--------GS--TSSETDFKGPRNWDQ 358 Query: 775 IKVPGLKPSMSMSDFLGQIELHLSEQMTSGNPPLSAAVGSEGYQEILEDIAQHLLNDNQV 596 IK+PGL+PSM++SDF+G IE LSEQ+TSGNP S G +QE+LE+IAQHLLNDN+V Sbjct: 359 IKLPGLRPSMAVSDFIGHIEHCLSEQITSGNP--SFCGGRPEFQEMLEEIAQHLLNDNKV 416 Query: 595 AAASDEKSLMSRVNSLCCLLQKDPVAVQKSHCAEDSNTGEGPGDGK-------------- 458 SDEKSLM+RVNSLCCLLQKDP A+Q SH E ++ EGP DGK Sbjct: 417 ITTSDEKSLMTRVNSLCCLLQKDPAALQSSHDKESAD--EGPADGKSIQLSHDLESMQNN 474 Query: 457 ----DVKPAEDS-RDASSGKQALGMSRKDSFGDLLLHLPRIASLPKFLFNISED 311 DVK +E+ RD S GKQ LGM RKDS G+LLL LPRIASL KFLF+ISED Sbjct: 475 KIKMDVKASEEEFRDVSRGKQTLGMPRKDSLGELLLQLPRIASLSKFLFDISED 528 >ref|XP_002324519.1| predicted protein [Populus trichocarpa] gi|222865953|gb|EEF03084.1| predicted protein [Populus trichocarpa] Length = 555 Score = 615 bits (1585), Expect = e-173 Identities = 335/535 (62%), Positives = 391/535 (73%), Gaps = 26/535 (4%) Frame = -1 Query: 1834 APVKLEIVEDPLEDEHGPLNKRHKPXXXXXXXXXXXXXXSVGPSSQISILDEPSPLGLRL 1655 A VKLEIVEDPLE+++GPL+KR K V +++ + LDEPSPLGL+L Sbjct: 26 AAVKLEIVEDPLEEKYGPLHKRSKASQTIQQWGAGANAVPVS-AAEYNPLDEPSPLGLQL 84 Query: 1654 RKSPSLLDLIQMKLSQGSVFVTNNTEQNENSVSEVKKESR-----GTADKLKASNFPGSL 1490 RKSPSLLDLIQM+L+QG+ T+Q E VKKES+ G+ DKLKASNFP S+ Sbjct: 85 RKSPSLLDLIQMRLTQGNNASALGTQQTEKQNLGVKKESKSAPASGSMDKLKASNFPASI 144 Query: 1489 LRIGSWEYKSRYEGDLVAKCYFAKHKLVWEVLEGGLKSKIEIQWSDIMALKAHCPDNGPS 1310 LRIGSWEYKSRYEGDLVAKCYFAKHKLVWEVLEGGLKSKIEIQWSDIMALKA+CPDN P+ Sbjct: 145 LRIGSWEYKSRYEGDLVAKCYFAKHKLVWEVLEGGLKSKIEIQWSDIMALKANCPDNEPA 204 Query: 1309 TLTVVLARQPLFFRETNPQPRKHTLWQATADFTDGQCSKHRLHFLQCPQGLLAKHFEKLI 1130 TL VVLARQPLFFRETNPQPRKHTLWQATADFTDGQ S H+ HFLQCP+GLL KHFEKLI Sbjct: 205 TLNVVLARQPLFFRETNPQPRKHTLWQATADFTDGQASIHKQHFLQCPEGLLNKHFEKLI 264 Query: 1129 QCDMRLNFLSQQPEIILDSPYFDTQPAAFEDPDNPKDHDLHQV-XXXXXXXXSYQDRGSP 953 QCDMRLNFLSQQPEIILDSPYF+ +P+ FED D+ K D +QV +QD SP Sbjct: 265 QCDMRLNFLSQQPEIILDSPYFEQRPSVFEDLDDSKSQDFNQVESAKVSVVSGFQDLASP 324 Query: 952 QASLSSSFKIECNDPPGTTLESLPRDAPSPSSVMDCPSVDGSTCTSSETESKAPRSLDQI 773 A+ SSS +IE DP +T + + R+APSPSSVMD +++G + +SKAPR+ DQI Sbjct: 325 SAAQSSSLEIEKGDPTASTSDPMSREAPSPSSVMDSRAIEGRGICEA-VDSKAPRNWDQI 383 Query: 772 KVPGLKPSMSMSDFLGQIELHLSEQMTSGNPPLSAAVGSEGYQEILEDIAQHLLNDNQVA 593 KVPGL+PSMSM+D + I +SEQMTSGN P SA GSE Q+ILEDIAQ+LL+D Q Sbjct: 384 KVPGLRPSMSMTDLMNHIGNCISEQMTSGNQPFSAD-GSE-CQDILEDIAQYLLSDTQQT 441 Query: 592 AASDEKSLMSRVNSLCCLLQKDPVAVQKSHCAEDSNTGEGPGDGK--------------- 458 +SDEK +M+RVNSLCCLLQKDP + Q + E P +GK Sbjct: 442 TSSDEKGIMARVNSLCCLLQKDPASTQNLQ-GNGESFFEEPNNGKGVLLNHTNESFHENK 500 Query: 457 ---DVKPAE--DSRDASSGKQALGMSRKDSFGDLLLHLPRIASLPKFLFNISEDD 308 D++ +E S+D S K A GMSRKDSFGDLLLHLPRIASLPKFLFNISE+D Sbjct: 501 VRGDIRGSEGSSSKDISGSKPAPGMSRKDSFGDLLLHLPRIASLPKFLFNISEED 555