BLASTX nr result
ID: Glycyrrhiza24_contig00004157
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza24_contig00004157 (1430 letters) Database: ./nr 23,641,837 sequences; 8,123,359,852 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003629374.1| Microspherule protein [Medicago truncatula] ... 305 2e-80 ref|XP_002310284.1| predicted protein [Populus trichocarpa] gi|2... 166 2e-38 ref|XP_002510473.1| conserved hypothetical protein [Ricinus comm... 127 7e-27 ref|XP_003548864.1| PREDICTED: uncharacterized protein LOC100779... 115 3e-23 ref|XP_003519881.1| PREDICTED: uncharacterized protein LOC100788... 114 5e-23 >ref|XP_003629374.1| Microspherule protein [Medicago truncatula] gi|355523396|gb|AET03850.1| Microspherule protein [Medicago truncatula] Length = 747 Score = 305 bits (781), Expect = 2e-80 Identities = 185/378 (48%), Positives = 221/378 (58%), Gaps = 15/378 (3%) Frame = +2 Query: 254 PPWNPEDDFLLKNXXXXXXXXXXXXKGAVPFSRRYSVTELRDRWRSLLYDPDVSAEASAS 433 PPWN +DDF+LK+ KG V FS+RYS EL +RW SLLYD D+S EAS + Sbjct: 12 PPWNSDDDFVLKSAVEGGASLESLAKGVVSFSKRYSTAELTERWHSLLYDYDISDEASVA 71 Query: 434 MVNLELGKSNGNGIRE-----------ASGDGGGGKRKAQSIRKQYYAMRKKLCTEV-FD 577 M NLE+ K N +GI+E AS D KRK Q++RK+YYAMRK+L TEV F+ Sbjct: 72 MNNLEVAKPNSDGIKEAVSVDLGIKEAASVDVTARKRKTQTLRKKYYAMRKRLRTEVFFN 131 Query: 578 SFDMALRDEMCVENHAVETD---GXXXXXXXXXXXXXXXXXXXXXGYGNYSGLEEAGVGS 748 +FDMALRDEMC+EN+ E + YGN GL A GS Sbjct: 132 TFDMALRDEMCIENNTTEKEIVGSGNINDCLNKDVNNNLLVNSIVNYGNQLGLVGARAGS 191 Query: 749 SHSMSEAVPLWKTIEDVSAPAMPVQVSLENEGGGGSGAREMIPHGLKGKCRNGVLNSDDD 928 SHSMSE PLWKT+EDVSAP MP+ SLEN GGS ++E IP Sbjct: 192 SHSMSED-PLWKTMEDVSAPNMPIHASLEN---GGSESKETIP----------------- 230 Query: 929 PADALLGDVESHISDSLLDLTNADELLFGDIDGKDETVVDKQCYDNVDSLLLSSPCDIQA 1108 H+SD+L +L N DEL+F +ID KDET V+KQ NVDS+LL SPCDIQ Sbjct: 231 -----------HVSDALFNLPNEDELMFVNIDEKDETAVNKQSDANVDSILLRSPCDIQG 279 Query: 1109 NDVSDVRQSHKLDTETKLAVPGGSSAGLEVVAKPLASSHGDLGPVPDPGNKVQLSAAAQS 1288 D+S V +S KL ET+LA+ G SA LEVVA ASSHGD G V D N+VQ SAAA Sbjct: 280 EDMSVVGESQKLVAETRLAMANGPSAELEVVADSPASSHGDSGFVADCRNEVQSSAAAHG 339 Query: 1289 SHCEAGEEFMYCVLNTED 1342 SH + EF C LNTED Sbjct: 340 SHPKPANEFRVCSLNTED 357 >ref|XP_002310284.1| predicted protein [Populus trichocarpa] gi|222853187|gb|EEE90734.1| predicted protein [Populus trichocarpa] Length = 720 Score = 166 bits (419), Expect = 2e-38 Identities = 143/424 (33%), Positives = 192/424 (45%), Gaps = 58/424 (13%) Frame = +2 Query: 245 AVLPPWNPEDDFLLKNXXXXXXXXXXXXKGAVPFSRRYSVTELRDRWRSLLYDPDVSAEA 424 ++ P W PEDD LLKN KGAV FSR++SV ELRDRW SLLYD +VS EA Sbjct: 17 SIPPSWIPEDDLLLKNAIEAGASLEALAKGAVRFSRKFSVRELRDRWHSLLYDNEVSTEA 76 Query: 425 SASMVNLELGKSNGNGIREASGDGGGGK------------RKAQSIRKQYYA----MRKK 556 S+ MV LEL SN + + +S G K RK + +R+ YYA MRK+ Sbjct: 77 SSRMVELEL--SNFSYTKVSSSSNGNSKFGFVVKESDPVKRKFECVRQLYYAMRKKMRKR 134 Query: 557 --------------------LCTEVFDSFDMALRDEMCVENHAVETDGXXXXXXXXXXXX 676 + F + DE V + E + Sbjct: 135 GGGFGFLGSLDGGGCEGNGGFGEDDRVHFGFSGEDEGGVGDVRFERENVRKDVQDIGDGL 194 Query: 677 XXXXXXXXXGYGNYSGLEEAGV--GSSHSMSEAVPLWKTIEDVSAPAMPVQVSLENEGGG 850 G+ E V + S+ VPLWKT+EDVSAP MPV S+E +G Sbjct: 195 VELRDSERGEEAGPCGVPERDVLIQAESSLVTRVPLWKTMEDVSAPEMPVSASVEGKGNS 254 Query: 851 GSG--AREMIPHGLKGKC------RNGVLNSDDDPADALLGDVE------SHISDSLLDL 988 G G + G K +GV ++ DAL ISDSLL+ Sbjct: 255 GEGMLVDNDVVDGNKVSLAGVDVNHSGVTFQEEPTVDALDRSTAISESDFPDISDSLLNF 314 Query: 989 TNADELLFGDIDGKDETVVDKQCYDNVDSLLLSSPCDIQANDVSDVRQSHKLDTETKLAV 1168 N D LF D+DGKD +DK CYD+V +LL+SSP D+Q DV +V+ L ++T L + Sbjct: 315 PNEDAPLFMDVDGKD--AIDKSCYDSVTTLLVSSPIDVQ-GDVPNVKAPEILASDTSLGI 371 Query: 1169 PGGS-SAGLEVVAKPLASSHGDLGPVPDPGNKVQLSAAAQSS-----HCEAGEEFMYCVL 1330 P + A LEV+ + S G+ D +++SA + +S E + M CVL Sbjct: 372 PDSACPAELEVIPEESYSVGGN----QDSNFVLEMSAPSSTSASNILSAEENDGEMECVL 427 Query: 1331 NTED 1342 N ED Sbjct: 428 NMED 431 >ref|XP_002510473.1| conserved hypothetical protein [Ricinus communis] gi|223551174|gb|EEF52660.1| conserved hypothetical protein [Ricinus communis] Length = 776 Score = 127 bits (319), Expect = 7e-27 Identities = 122/410 (29%), Positives = 169/410 (41%), Gaps = 39/410 (9%) Frame = +2 Query: 233 MEPLAVLPPWNPEDDFLLKNXXXXXXXXXXXXKGAVPFSRRYSVTELRDRWRSLLYDPDV 412 M LA L W PEDD LLKN KGAV FSR+++V EL++RW SLLYDP V Sbjct: 1 MGALAPLSSWIPEDDLLLKNAVEAGASLESLAKGAVQFSRKFTVRELQERWHSLLYDPIV 60 Query: 413 SAEASASMVNLELGKSNGNGIREASGDGG-----GGKRKAQSIRKQYYAMRKKLCTEVFD 577 SAEA+ M+ E S SG+ GKRKA+SIR YYA+RK++ E F+ Sbjct: 61 SAEAAFHMIEFERSASTLPSKFSKSGNSKESKSVSGKRKAESIRNCYYALRKRIRNEPFN 120 Query: 578 SFDMALRDEMCVENHAVETDGXXXXXXXXXXXXXXXXXXXXXGYGNYSGLEEAGVGSSHS 757 + D++ N D + GL+ + H Sbjct: 121 TMDLSFLIAPTDSNFIGNED-----------EPFSGNCILEDPVSTHFGLQGTNLDIMHH 169 Query: 758 MSEAVPLWKTIEDVSAPAMPVQVSL---ENEGGGGSGAREMIPHGLKGK--CRNGVLNSD 922 + +D SA A+ Q E+ E IPH + G+ L + Sbjct: 170 SFPEIG-----DDASAHALHAQFQNTIGEDYPVEQDIVHEEIPH-IHGENIWDTFSLPCN 223 Query: 923 DDPADALLGDVESHISDS--------------------LLDLTNA-------DELLFGDI 1021 DD + L + + H S L +L+N+ +ELLF D+ Sbjct: 224 DDTKNTCLSEYDVHGESSLKLEIPSEEMKNVNASTEGYLAELSNSLLNFTNEEELLFTDV 283 Query: 1022 DGKDETVVDKQCYDNVDSLLLSSPCDIQANDVSDVRQSHKLDTETKLAVPGGSSAGLEVV 1201 DGKD +DK YD + SLLL+SP DI + D+ + SS + + Sbjct: 284 DGKD--AIDKSYYDGLSSLLLNSPNDISQERMPDITEP-------------DSSLTPDYI 328 Query: 1202 AKPLASSHGDLGP--VPDPGNKVQLSAAAQSSHCEAGEEFMYCVLNTEDP 1345 +SHG+L D G+ + S C E + C LNTEDP Sbjct: 329 VNQCGASHGELDEDRGSDTGDVIGHSEVQLPELC---VEVIICTLNTEDP 375 >ref|XP_003548864.1| PREDICTED: uncharacterized protein LOC100779823 [Glycine max] Length = 612 Score = 115 bits (288), Expect = 3e-23 Identities = 61/104 (58%), Positives = 73/104 (70%), Gaps = 1/104 (0%) Frame = +2 Query: 251 LPPWNPEDDFLLKNXXXXXXXXXXXXKGAVPFSRRYSVTELRDRWRSLLYDPDVSAEASA 430 + PW P+DDFLLKN KGAV FSRR+SVTELRDRW++LLYDPDVSA A A Sbjct: 1 MEPWLPQDDFLLKNAIEGGASLESLAKGAVRFSRRFSVTELRDRWQALLYDPDVSAAARA 60 Query: 431 SMVNLELGKSNGNGIREASGDGGGG-KRKAQSIRKQYYAMRKKL 559 +M NLEL K G +G+GGGG KR ++SIRK Y AM+K+L Sbjct: 61 AMANLELTKYGGG---TGTGEGGGGKKRNSESIRKHYSAMQKRL 101 Score = 86.7 bits (213), Expect = 1e-14 Identities = 76/226 (33%), Positives = 99/226 (43%), Gaps = 14/226 (6%) Frame = +2 Query: 707 YGNYSGLEEAGVGSSHSMSEAVPLWKTIEDVSAPAMPVQVSLENEGGGGSGAREMIP--- 877 YG +G E G G + SE++ + AM ++ G GS A+ Sbjct: 70 YGGGTGTGEGGGGKKRN-SESIRKHYS-------AMQKRLRRCRHGVAGSDAKNATEGCD 121 Query: 878 ---HGL------KGKCRNGVLNSDDDPADALLGDVESHISDSLLDLTNADELLFGDIDGK 1030 HGL KG+C NGVL + +A+L D + L +L N DE++F D+ K Sbjct: 122 GKSHGLEGRVNLKGECGNGVLRLN--APNAVLRDDAKNDLKCLFNLANEDEVVFMDLVRK 179 Query: 1031 DETVVDKQ--CYDNVDSLLLSSPCDIQANDVSDVRQSHKLDTETKLAVPGGSSAGLEVVA 1204 + DK YDNVDSLLLSSPCD+Q +D D R+ Sbjct: 180 EVLAADKDKPSYDNVDSLLLSSPCDVQGDD--DGRE------------------------ 213 Query: 1205 KPLASSHGDLGPVPDPGNKVQLSAAAQSSHCEAGEEFMYCVLNTED 1342 PL D V + GN S A Q+ H E E FM CVLNTED Sbjct: 214 -PLGGGCADQHCVSESGNNAGSSGAVQTPHAEQSEGFMICVLNTED 258 >ref|XP_003519881.1| PREDICTED: uncharacterized protein LOC100788061 [Glycine max] Length = 610 Score = 114 bits (286), Expect = 5e-23 Identities = 58/103 (56%), Positives = 72/103 (69%) Frame = +2 Query: 251 LPPWNPEDDFLLKNXXXXXXXXXXXXKGAVPFSRRYSVTELRDRWRSLLYDPDVSAEASA 430 + PW P+DDFLLKN KGAV FSRR+SVTELRDRW++LLYDPDVSA A A Sbjct: 1 MEPWLPQDDFLLKNAIEGGASLESLAKGAVRFSRRFSVTELRDRWQALLYDPDVSAAALA 60 Query: 431 SMVNLELGKSNGNGIREASGDGGGGKRKAQSIRKQYYAMRKKL 559 +M +LE K G + G GGG KRK++SIRK Y+AM+++L Sbjct: 61 AMAHLEAAK-YGGAAGTSEGGGGGKKRKSESIRKHYFAMQRRL 102 Score = 87.8 bits (216), Expect = 6e-15 Identities = 72/215 (33%), Positives = 93/215 (43%), Gaps = 3/215 (1%) Frame = +2 Query: 707 YGNYSGLEEAGVGSSHSMSEAVPL-WKTIEDVSAPAMPVQVSLENEGGGGSGAREMIPHG 883 YG +G E G G SE++ + ++ EG G+G + Sbjct: 70 YGGAAGTSEGGGGGKKRKSESIRKHYFAMQRRLRGCRHSDAKNATEGCEGNGRGLEVRVN 129 Query: 884 LKGKCRNGVLNSDDDPADALLGDVESHISDSLLDLTNADELLFGDIDGKDETVVDKQ--C 1057 KG+C NG L + P DA+L D + + LL+ N D L+F D+D K+ T VDK Sbjct: 130 SKGECGNGGLRINA-PDDAVLRDDAKNDLECLLNSANEDGLVFMDVDRKEVTAVDKDKPS 188 Query: 1058 YDNVDSLLLSSPCDIQANDVSDVRQSHKLDTETKLAVPGGSSAGLEVVAKPLASSHGDLG 1237 YDN D +L SSPCD+Q G S G E PL D Sbjct: 189 YDNFDLILSSSPCDVQ-----------------------GDSDGRE----PLGGGCADQH 221 Query: 1238 PVPDPGNKVQLSAAAQSSHCEAGEEFMYCVLNTED 1342 V + GN S A QS E GE +M CVLNTED Sbjct: 222 CVSESGNDAGSSGAVQSPLPERGEGYMICVLNTED 256