BLASTX nr result
ID: Glycyrrhiza35_contig00033954
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza35_contig00033954 (651 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value GAU33749.1 hypothetical protein TSUD_52820 [Trifolium subterraneum] 82 1e-17 GAU41870.1 hypothetical protein TSUD_366180 [Trifolium subterran... 82 2e-16 GAU10291.1 hypothetical protein TSUD_418880 [Trifolium subterran... 85 2e-15 GAU49672.1 hypothetical protein TSUD_91100 [Trifolium subterraneum] 84 3e-15 GAU29466.1 hypothetical protein TSUD_65010 [Trifolium subterraneum] 84 5e-15 GAU28846.1 hypothetical protein TSUD_21830 [Trifolium subterraneum] 78 4e-13 GAU26774.1 hypothetical protein TSUD_317720 [Trifolium subterran... 73 3e-11 GAU30708.1 hypothetical protein TSUD_39320 [Trifolium subterraneum] 72 5e-11 GAU26016.1 hypothetical protein TSUD_64040 [Trifolium subterraneum] 72 6e-11 GAU29238.1 hypothetical protein TSUD_362280 [Trifolium subterran... 72 7e-11 GAU22178.1 hypothetical protein TSUD_252060 [Trifolium subterran... 70 2e-10 GAU19074.1 hypothetical protein TSUD_99360 [Trifolium subterraneum] 70 3e-10 GAU48593.1 hypothetical protein TSUD_179750 [Trifolium subterran... 69 4e-10 GAU48324.1 hypothetical protein TSUD_351640 [Trifolium subterran... 67 8e-10 KHN36156.1 Retrovirus-related Pol polyprotein from transposon TN... 68 1e-09 KHN22040.1 Retrovirus-related Pol polyprotein from transposon TN... 68 1e-09 XP_019416970.1 PREDICTED: uncharacterized protein LOC109328125 [... 68 1e-09 XP_019464137.1 PREDICTED: uncharacterized protein LOC109362641 [... 68 1e-09 GAU39478.1 hypothetical protein TSUD_159100 [Trifolium subterran... 61 2e-09 GAU50392.1 hypothetical protein TSUD_409430 [Trifolium subterran... 63 6e-09 >GAU33749.1 hypothetical protein TSUD_52820 [Trifolium subterraneum] Length = 730 Score = 81.6 bits (200), Expect(2) = 1e-17 Identities = 49/129 (37%), Positives = 57/129 (44%), Gaps = 7/129 (5%) Frame = +2 Query: 59 VQCQICHRWNHDASVCYYRMTXXXXXXXXXXXXXXXGGFRSNTWHQQRPATPNHQF---L 229 +QCQIC++ HDAS CYYR G SN W Q P + F L Sbjct: 219 IQCQICYKPGHDASYCYYRFDGPSSYGYGVYGAPNGYGAPSNVWMQNLPHSSPPTFQARL 278 Query: 230 XXXXXXXXXXXXXXXXXXXDN----SQYSQQAWYPDSGASHHVTADATNVFESTSLQGSD 397 N S Q AWYPDS A+HHVT DA N+ SL G+D Sbjct: 279 TFTSQFGNPRPQTPQAYLTGNESTASSSFQNAWYPDSRATHHVTPDANNLMNVVSLSGTD 338 Query: 398 QVMMGNGQG 424 QV +GNGQG Sbjct: 339 QVHIGNGQG 347 Score = 35.8 bits (81), Expect(2) = 1e-17 Identities = 15/36 (41%), Positives = 26/36 (72%) Frame = +3 Query: 423 GTSRVLMRGNVGEDGLYKFDNIQSLNNHSKNPAILS 530 G+S++L+RG++G+DGLY+FD+ N+ + A S Sbjct: 386 GSSKILLRGSLGDDGLYQFDSPFQHNSEASASASTS 421 >GAU41870.1 hypothetical protein TSUD_366180 [Trifolium subterraneum] Length = 1276 Score = 82.0 bits (201), Expect(2) = 2e-16 Identities = 47/122 (38%), Positives = 62/122 (50%), Gaps = 1/122 (0%) Frame = +2 Query: 59 VQCQICHRWNHDASVCYYRMTXXXXXXXXXXXXXXXGGFRSNTWHQ-QRPATPNHQFLXX 235 V CQIC++ NHDAS+C YR GFR + Q P+ + + Sbjct: 258 VTCQICNKPNHDASICRYRHAPAMPNY----------GFRPFQYQPPQYPSFFPNSYGYG 307 Query: 236 XXXXXXXXXXXXXXXXXDNSQYSQQAWYPDSGASHHVTADATNVFESTSLQGSDQVMMGN 415 N+ ++ Q WYPDSGASHHVT DA+N+ ++ SL GSDQV+MGN Sbjct: 308 FAPRSQRPPAPQALLTSGNTNFNNQWWYPDSGASHHVTPDASNLSDAASLPGSDQVLMGN 367 Query: 416 GQ 421 GQ Sbjct: 368 GQ 369 Score = 31.2 bits (69), Expect(2) = 2e-16 Identities = 15/34 (44%), Positives = 24/34 (70%), Gaps = 4/34 (11%) Frame = +3 Query: 420 KGTSRVLMRGNVGEDGLYKF----DNIQSLNNHS 509 + TS VL++G VG+DGLY+F ++I ++N S Sbjct: 403 QATSEVLLQGIVGKDGLYRFASPLNSISAINKSS 436 >GAU10291.1 hypothetical protein TSUD_418880 [Trifolium subterraneum] Length = 483 Score = 84.7 bits (208), Expect = 2e-15 Identities = 46/129 (35%), Positives = 60/129 (46%), Gaps = 7/129 (5%) Frame = +2 Query: 59 VQCQICHRWNHDASVCYYRMTXXXXXXXXXXXXXXXGGFRSNTWHQQ--RPATPNHQFLX 232 +QCQIC++ HDAS CYYR G SN W Q RP+ P Sbjct: 252 IQCQICYKTGHDASYCYYRFDGPNSYGYGGYGAPNGYGAPSNVWMQNLPRPSQPTFNARP 311 Query: 233 XXXXXXXXXXXXXXXXXXDNSQYSQQA-----WYPDSGASHHVTADATNVFESTSLQGSD 397 ++ + + WYPDSGA+HHVT DA N+ ++ SL G+D Sbjct: 312 AFPPQFGNPKPQAPQAYLTGNESTASSSFSNGWYPDSGATHHVTPDANNLMDAVSLSGTD 371 Query: 398 QVMMGNGQG 424 QV +GNGQG Sbjct: 372 QVHIGNGQG 380 >GAU49672.1 hypothetical protein TSUD_91100 [Trifolium subterraneum] Length = 1380 Score = 84.3 bits (207), Expect = 3e-15 Identities = 47/127 (37%), Positives = 62/127 (48%), Gaps = 5/127 (3%) Frame = +2 Query: 59 VQCQICHRWNHDASVCYYRMTXXXXXXXXXXXXXXXGGFRSNTWHQQRPATPNHQF---- 226 + CQICH+ NHDAS+C YR T G+R + + + + F Sbjct: 287 ITCQICHKPNHDASICRYRHTGNTGF-----------GYRPSQYPPAQYPPSQYHFPYGY 335 Query: 227 -LXXXXXXXXXXXXXXXXXXXDNSQYSQQAWYPDSGASHHVTADATNVFESTSLQGSDQV 403 N ++ Q WYPDSGASHHVT DA+N+ ++ SL GSDQV Sbjct: 336 GYGYQPRPPPRPSTPQALLTSGNIGFNNQWWYPDSGASHHVTPDASNLSDAASLSGSDQV 395 Query: 404 MMGNGQG 424 +MGNGQG Sbjct: 396 LMGNGQG 402 >GAU29466.1 hypothetical protein TSUD_65010 [Trifolium subterraneum] Length = 1362 Score = 83.6 bits (205), Expect = 5e-15 Identities = 47/130 (36%), Positives = 59/130 (45%), Gaps = 8/130 (6%) Frame = +2 Query: 59 VQCQICHRWNHDASVCYYRMTXXXXXXXXXXXXXXXGGFRSNTWHQQRP--------ATP 214 +QCQIC++ HDAS C+YR G SN W Q P A P Sbjct: 254 IQCQICYKPGHDASYCHYRFDGPGPYGYGGYGAPIGYGAPSNVWMQNMPRASQPTFQARP 313 Query: 215 NHQFLXXXXXXXXXXXXXXXXXXXDNSQYSQQAWYPDSGASHHVTADATNVFESTSLQGS 394 +S + WYPDSGA+HHVT DATN+ ++ SL G+ Sbjct: 314 TFPSQFGNPRPQTPQAYLTGNESTASSSF-HNGWYPDSGATHHVTPDATNLMDAVSLSGT 372 Query: 395 DQVMMGNGQG 424 DQV +GNGQG Sbjct: 373 DQVHIGNGQG 382 >GAU28846.1 hypothetical protein TSUD_21830 [Trifolium subterraneum] Length = 1496 Score = 78.2 bits (191), Expect = 4e-13 Identities = 48/125 (38%), Positives = 61/125 (48%), Gaps = 5/125 (4%) Frame = +2 Query: 65 CQICHRWNHDASVCYYRMTXXXXXXXXXXXXXXXGGFRSNTWHQQRPATP--NHQF---L 229 CQICH+ NHDAS C +R G+R Q P TP + QF Sbjct: 273 CQICHKNNHDASFCRFRYNTPSQGYGYGYGF----GYRP-----QAPQTPQASSQFPMSY 323 Query: 230 XXXXXXXXXXXXXXXXXXXDNSQYSQQAWYPDSGASHHVTADATNVFESTSLQGSDQVMM 409 + ++ Q WYPDSGASHHVT D +N+ ++TSL GSDQV++ Sbjct: 324 GYGFPRPSRPPAPQAMLTGGDPNFNNQWWYPDSGASHHVTPDPSNLSDTTSLPGSDQVLI 383 Query: 410 GNGQG 424 GNGQG Sbjct: 384 GNGQG 388 >GAU26774.1 hypothetical protein TSUD_317720 [Trifolium subterraneum] Length = 1262 Score = 72.8 bits (177), Expect = 3e-11 Identities = 31/45 (68%), Positives = 38/45 (84%) Frame = +2 Query: 290 NSQYSQQAWYPDSGASHHVTADATNVFESTSLQGSDQVMMGNGQG 424 NS ++ Q WYPDSGASHHVT D +N+ ++TSL GSDQV+MGNGQG Sbjct: 309 NSSFNNQWWYPDSGASHHVTPDVSNLSDATSLPGSDQVLMGNGQG 353 >GAU30708.1 hypothetical protein TSUD_39320 [Trifolium subterraneum] Length = 1432 Score = 72.0 bits (175), Expect = 5e-11 Identities = 44/122 (36%), Positives = 53/122 (43%) Frame = +2 Query: 59 VQCQICHRWNHDASVCYYRMTXXXXXXXXXXXXXXXGGFRSNTWHQQRPATPNHQFLXXX 238 VQCQIC + NHDA+ C+YR G N + RP N Sbjct: 296 VQCQICSKSNHDAANCWYRYEPPSSRT---------NGRGYNAGNTSRPPLYN----PYP 342 Query: 239 XXXXXXXXXXXXXXXXDNSQYSQQAWYPDSGASHHVTADATNVFESTSLQGSDQVMMGNG 418 + YS +WYPDSGASHH+T + N+ T QG DQV MGNG Sbjct: 343 RPSAHLALPQYYNPTAEFDTYSNASWYPDSGASHHLTFNPNNMAYRTPYQGQDQVTMGNG 402 Query: 419 QG 424 QG Sbjct: 403 QG 404 >GAU26016.1 hypothetical protein TSUD_64040 [Trifolium subterraneum] Length = 942 Score = 71.6 bits (174), Expect = 6e-11 Identities = 42/122 (34%), Positives = 55/122 (45%) Frame = +2 Query: 59 VQCQICHRWNHDASVCYYRMTXXXXXXXXXXXXXXXGGFRSNTWHQQRPATPNHQFLXXX 238 VQCQIC + NHDA+ C+YR G+ + + +Q P + Sbjct: 296 VQCQICAKHNHDAANCWYRYDPPSSRYNAR-------GYNAGSTSRQPQYNPYPR----- 343 Query: 239 XXXXXXXXXXXXXXXXDNSQYSQQAWYPDSGASHHVTADATNVFESTSLQGSDQVMMGNG 418 D +S +WYPDSGASHH+T + N+ T QG DQV MGNG Sbjct: 344 -PSAHLALPQHYNPIADMDTFSNASWYPDSGASHHLTFNPNNLTYRTPYQGQDQVTMGNG 402 Query: 419 QG 424 QG Sbjct: 403 QG 404 >GAU29238.1 hypothetical protein TSUD_362280 [Trifolium subterraneum] Length = 1433 Score = 71.6 bits (174), Expect = 7e-11 Identities = 43/123 (34%), Positives = 54/123 (43%), Gaps = 1/123 (0%) Frame = +2 Query: 59 VQCQICHRWNHDASVCYYRMTXXXXXXXXXXXXXXXGGFRSNTW-HQQRPATPNHQFLXX 235 VQCQIC + NHDA++C+YR RSN H ++ + Sbjct: 296 VQCQICGKANHDAAICWYRYEPPSS--------------RSNACGHNAGSSSRPPPYNPY 341 Query: 236 XXXXXXXXXXXXXXXXXDNSQYSQQAWYPDSGASHHVTADATNVFESTSLQGSDQVMMGN 415 D S +WYPDSGASHH+T + N+ T QG DQV MGN Sbjct: 342 PRPSAHLALPQYYNPIADMDSVSNASWYPDSGASHHLTFNPNNLTYRTPYQGQDQVTMGN 401 Query: 416 GQG 424 GQG Sbjct: 402 GQG 404 >GAU22178.1 hypothetical protein TSUD_252060 [Trifolium subterraneum] Length = 420 Score = 69.7 bits (169), Expect = 2e-10 Identities = 40/119 (33%), Positives = 56/119 (47%) Frame = +2 Query: 65 CQICHRWNHDASVCYYRMTXXXXXXXXXXXXXXXGGFRSNTWHQQRPATPNHQFLXXXXX 244 C+ICH+ NHDAS C +R + ++ Q P + + F Sbjct: 274 CEICHKNNHDASYCRFRYSTPSQGYGYGYGYGFGYRPQAPQASSQFPMSYGYGF-----P 328 Query: 245 XXXXXXXXXXXXXXDNSQYSQQAWYPDSGASHHVTADATNVFESTSLQGSDQVMMGNGQ 421 + +S Q WYPDSGA HHVT D +N+ ++T L GSDQV++GNGQ Sbjct: 329 RPSIPPAPQAMLTGGDPNFSNQWWYPDSGAFHHVTPDPSNLSDTTYLLGSDQVLIGNGQ 387 >GAU19074.1 hypothetical protein TSUD_99360 [Trifolium subterraneum] Length = 1329 Score = 69.7 bits (169), Expect = 3e-10 Identities = 40/112 (35%), Positives = 56/112 (50%), Gaps = 1/112 (0%) Frame = +2 Query: 92 DASVCYYRMTXXXXXXXXXXXXXXXGGFRSNTWHQQR-PATPNHQFLXXXXXXXXXXXXX 268 DAS+C YR GFR + + R P+ + ++ Sbjct: 239 DASICRYRHATTMPNY----------GFRPSQYQPPRYPSFFPNSYVYGFAPRAQRPLAP 288 Query: 269 XXXXXXDNSQYSQQAWYPDSGASHHVTADATNVFESTSLQGSDQVMMGNGQG 424 ++ ++ Q WYPDSGASHHVT DA+N+ ++ SL GSDQV+MGNGQG Sbjct: 289 QALLTNGSTNFNNQWWYPDSGASHHVTPDASNLSDAASLPGSDQVLMGNGQG 340 >GAU48593.1 hypothetical protein TSUD_179750 [Trifolium subterraneum] Length = 1364 Score = 69.3 bits (168), Expect = 4e-10 Identities = 29/45 (64%), Positives = 38/45 (84%) Frame = +2 Query: 290 NSQYSQQAWYPDSGASHHVTADATNVFESTSLQGSDQVMMGNGQG 424 ++ ++ Q WYPDSGASHHVT DA+N+ ++ SL GSDQV+MGNGQG Sbjct: 301 STNFNNQWWYPDSGASHHVTPDASNLSDAASLPGSDQVLMGNGQG 345 >GAU48324.1 hypothetical protein TSUD_351640 [Trifolium subterraneum] Length = 301 Score = 67.4 bits (163), Expect = 8e-10 Identities = 40/122 (32%), Positives = 51/122 (41%) Frame = +2 Query: 62 QCQICHRWNHDASVCYYRMTXXXXXXXXXXXXXXXGGFRSNTWHQQRPATPNHQFLXXXX 241 +CQIC + NHDA+ C+YR G+ + + P P Sbjct: 193 KCQICTKSNHDATNCWYRYEPPSSRANAR-------GYNAGNTSRAPPYNP------YPC 239 Query: 242 XXXXXXXXXXXXXXXDNSQYSQQAWYPDSGASHHVTADATNVFESTSLQGSDQVMMGNGQ 421 D S +WYPDSGASHH+T + N+ QG DQV MGNGQ Sbjct: 240 PAAHLALPQYYHPIPDMDTVSTSSWYPDSGASHHLTFNPNNLAYRMPYQGQDQVTMGNGQ 299 Query: 422 GY 427 GY Sbjct: 300 GY 301 >KHN36156.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 1417 Score = 68.2 bits (165), Expect = 1e-09 Identities = 41/122 (33%), Positives = 57/122 (46%) Frame = +2 Query: 59 VQCQICHRWNHDASVCYYRMTXXXXXXXXXXXXXXXGGFRSNTWHQQRPATPNHQFLXXX 238 V+CQIC + NHDA C+YR GG++ ++ + P + Sbjct: 259 VKCQICAKPNHDAINCWYRYDPQAMNQNSR------GGYQVGPSNRPQNFNPYMR----- 307 Query: 239 XXXXXXXXXXXXXXXXDNSQYSQQAWYPDSGASHHVTADATNVFESTSLQGSDQVMMGNG 418 + Q+S AWYPDSGASHH+T + N+ S+ G DQV+MGNG Sbjct: 308 --PTAHLAMPQPYAMPNMDQFSNGAWYPDSGASHHLTYNPNNLSYSSPYTGQDQVVMGNG 365 Query: 419 QG 424 QG Sbjct: 366 QG 367 >KHN22040.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94, partial [Glycine soja] Length = 1417 Score = 68.2 bits (165), Expect = 1e-09 Identities = 41/122 (33%), Positives = 57/122 (46%) Frame = +2 Query: 59 VQCQICHRWNHDASVCYYRMTXXXXXXXXXXXXXXXGGFRSNTWHQQRPATPNHQFLXXX 238 V+CQIC + NHDA C+YR GG++ ++ + P + Sbjct: 259 VKCQICAKPNHDAINCWYRYDPQAMNQNSR------GGYQVGPSNRPQNFNPYMR----- 307 Query: 239 XXXXXXXXXXXXXXXXDNSQYSQQAWYPDSGASHHVTADATNVFESTSLQGSDQVMMGNG 418 + Q+S AWYPDSGASHH+T + N+ S+ G DQV+MGNG Sbjct: 308 --PTAHLAMPQPYAMPNMDQFSNGAWYPDSGASHHLTYNPNNLSYSSPYTGQDQVVMGNG 365 Query: 419 QG 424 QG Sbjct: 366 QG 367 >XP_019416970.1 PREDICTED: uncharacterized protein LOC109328125 [Lupinus angustifolius] XP_019449377.1 PREDICTED: uncharacterized protein LOC109352047 [Lupinus angustifolius] Length = 482 Score = 67.8 bits (164), Expect = 1e-09 Identities = 30/85 (35%), Positives = 42/85 (49%) Frame = +2 Query: 170 GFRSNTWHQQRPATPNHQFLXXXXXXXXXXXXXXXXXXXDNSQYSQQAWYPDSGASHHVT 349 GF N+ P +P H + + Y+ Q WYPDSGA+HH+T Sbjct: 393 GFHQNSVLGPAPMSPAHSYADSFHPTQGQPSPHHLPVTVEPQAYNSQLWYPDSGATHHIT 452 Query: 350 ADATNVFESTSLQGSDQVMMGNGQG 424 +D++N+ S L GSD + MGNG G Sbjct: 453 SDSSNLMHSAGLPGSDSIFMGNGSG 477 >XP_019464137.1 PREDICTED: uncharacterized protein LOC109362641 [Lupinus angustifolius] Length = 482 Score = 67.8 bits (164), Expect = 1e-09 Identities = 30/85 (35%), Positives = 42/85 (49%) Frame = +2 Query: 170 GFRSNTWHQQRPATPNHQFLXXXXXXXXXXXXXXXXXXXDNSQYSQQAWYPDSGASHHVT 349 GF N+ P +P H + + Y+ Q WYPDSGA+HH+T Sbjct: 393 GFHQNSVLGPAPMSPAHSYADSFHPTQGQPSPHHLPVTVEPQAYNSQLWYPDSGATHHIT 452 Query: 350 ADATNVFESTSLQGSDQVMMGNGQG 424 +D++N+ S L GSD + MGNG G Sbjct: 453 SDSSNLMHSAGLPGSDSIFMGNGSG 477 >GAU39478.1 hypothetical protein TSUD_159100 [Trifolium subterraneum] Length = 1143 Score = 61.2 bits (147), Expect(2) = 2e-09 Identities = 37/121 (30%), Positives = 50/121 (41%) Frame = +2 Query: 62 QCQICHRWNHDASVCYYRMTXXXXXXXXXXXXXXXGGFRSNTWHQQRPATPNHQFLXXXX 241 +C IC + NHDA+ C+YR G+ + + P P + Sbjct: 203 KCHICAKSNHDATNCWYRYEPPSSRANAR-------GYNAGNTSRAPPYNPYPR------ 249 Query: 242 XXXXXXXXXXXXXXXDNSQYSQQAWYPDSGASHHVTADATNVFESTSLQGSDQVMMGNGQ 421 D S +WYPDSGASHH+T + N+ QG DQV +GNGQ Sbjct: 250 PSAHLALPQYYHPIPDMDTVSTSSWYPDSGASHHLTFNPNNLAYRMPYQGQDQVTVGNGQ 309 Query: 422 G 424 G Sbjct: 310 G 310 Score = 28.9 bits (63), Expect(2) = 2e-09 Identities = 17/41 (41%), Positives = 22/41 (53%), Gaps = 3/41 (7%) Frame = +3 Query: 426 TSRVLMRGNVGEDGLYKFDNIQ---SLNNHSKNPAILSLLL 539 T + L+ G VG DGLYKF + + NN +I SL L Sbjct: 340 TKQTLLEGTVGSDGLYKFQPFEFTPTKNNTLCTQSISSLPL 380 >GAU50392.1 hypothetical protein TSUD_409430 [Trifolium subterraneum] Length = 1069 Score = 62.8 bits (151), Expect(2) = 6e-09 Identities = 26/41 (63%), Positives = 35/41 (85%) Frame = +2 Query: 299 YSQQAWYPDSGASHHVTADATNVFESTSLQGSDQVMMGNGQ 421 ++ Q W+PDSGASHHVT D +N+ ++TSL GSDQV++GNGQ Sbjct: 287 FNNQWWHPDSGASHHVTPDPSNLSDTTSLPGSDQVLIGNGQ 327 Score = 25.4 bits (54), Expect(2) = 6e-09 Identities = 11/20 (55%), Positives = 14/20 (70%) Frame = +3 Query: 420 KGTSRVLMRGNVGEDGLYKF 479 + +S VL+ G V DGLYKF Sbjct: 361 QASSEVLLHGVVRADGLYKF 380