BLASTX nr result
ID: Glycyrrhiza34_contig00020511
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza34_contig00020511 (375 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value KYP39878.1 hypothetical protein KK1_038796 [Cajanus cajan] 81 9e-16 XP_014632447.1 PREDICTED: uncharacterized protein LOC100798029 [... 78 8e-15 KYP78202.1 Retrovirus-related Pol polyprotein from transposon TN... 77 3e-14 XP_003597128.2 Myb/SANT-like DNA-binding domain protein [Medicag... 77 6e-14 XP_014623011.1 PREDICTED: uncharacterized protein LOC102665640 [... 77 7e-14 XP_003538716.1 PREDICTED: uncharacterized protein LOC100798851 [... 76 1e-13 XP_006589869.1 PREDICTED: uncharacterized protein LOC102663555 [... 72 2e-13 KYP35038.1 hypothetical protein KK1_043951 [Cajanus cajan] 74 5e-13 XP_006573751.1 PREDICTED: uncharacterized protein LOC100807274 i... 74 5e-13 KHN25746.1 hypothetical protein glysoja_018320 [Glycine soja] 74 5e-13 XP_003516682.1 PREDICTED: uncharacterized protein LOC100807274 i... 74 5e-13 XP_014619055.1 PREDICTED: uncharacterized protein LOC106795037 [... 70 7e-13 XP_013441863.1 Ulp1 protease family, carboxy-terminal domain pro... 73 8e-13 XP_006582171.1 PREDICTED: uncharacterized protein LOC102668599 [... 69 2e-12 XP_006589878.2 PREDICTED: uncharacterized protein LOC102665245 [... 72 3e-12 XP_014622322.1 PREDICTED: uncharacterized protein LOC100818274 [... 71 6e-12 KYP40410.1 hypothetical protein KK1_038259 [Cajanus cajan] 71 8e-12 XP_003627090.1 Ulp1 protease family, carboxy-terminal domain pro... 70 1e-11 XP_014630124.1 PREDICTED: uncharacterized protein LOC102667062 [... 70 1e-11 KHM99138.1 hypothetical protein glysoja_032029 [Glycine soja] 70 1e-11 >KYP39878.1 hypothetical protein KK1_038796 [Cajanus cajan] Length = 300 Score = 80.9 bits (198), Expect = 9e-16 Identities = 41/105 (39%), Positives = 58/105 (55%), Gaps = 1/105 (0%) Frame = +3 Query: 24 VVVWMCSLDHKPENSMKNTVNVALEGYWRTKG-KRKKNLIWINATVPIQVSN*ECGHYVM 200 + V +CSL P + ++ A+EGY K K KK + W++ Q N ECG+YV+ Sbjct: 194 IAVCICSLHKLPPMDFRQLLDRAMEGYHILKSLKLKKKMSWVSPKSHKQKGNYECGYYVL 253 Query: 201 RYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLEHY 335 + M IV +IV W E F D+SP E I RE WA F+++HY Sbjct: 254 KIMHTIVDSKIVSGWTEIFIDRSPLPLEDINTIREQWATFFIDHY 298 >XP_014632447.1 PREDICTED: uncharacterized protein LOC100798029 [Glycine max] Length = 265 Score = 77.8 bits (190), Expect = 8e-15 Identities = 40/104 (38%), Positives = 63/104 (60%), Gaps = 2/104 (1%) Frame = +3 Query: 24 VVVWMCSLDHKPENSMKNTVNVALEGYWRT-KGKRKKN-LIWINATVPIQVSN*ECGHYV 197 VVVW CSL +P+ ++K VN A++ +T +GK ++ WI A +Q N ECG+YV Sbjct: 159 VVVWFCSLRKRPDAAIKGAVNSAMKSVTKTAEGKPPQHGPQWIEAKSHVQTGNYECGYYV 218 Query: 198 MRYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329 M ++ IV+G + DW F+D+S +EE I R WA ++++ Sbjct: 219 MHWIWCIVTGGLKDDWIHWFSDRSAVTEETITTLRHKWAAYFIQ 262 >KYP78202.1 Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus cajan] Length = 294 Score = 76.6 bits (187), Expect = 3e-14 Identities = 41/105 (39%), Positives = 59/105 (56%), Gaps = 1/105 (0%) Frame = +3 Query: 24 VVVWMCSLDHKPENSMKNTVNVALEGYWRTKG-KRKKNLIWINATVPIQVSN*ECGHYVM 200 + V +CSL P + ++ A+EGY KG K KK + W++ Q N EC +YVM Sbjct: 188 IAVCICSLHKPPPMDFRQLLDRAMEGYHILKGSKLKKKMSWVSPKSHKQKGNYECEYYVM 247 Query: 201 RYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLEHY 335 + M IV +IV W + F D+SP S E I RE WA F++++Y Sbjct: 248 KTMHTIVDLQIVSRWTKIFIDQSPLSLEDINTIREQWATFFIDYY 292 >XP_003597128.2 Myb/SANT-like DNA-binding domain protein [Medicago truncatula] AES67379.2 Myb/SANT-like DNA-binding domain protein [Medicago truncatula] Length = 1223 Score = 77.0 bits (188), Expect = 6e-14 Identities = 38/101 (37%), Positives = 60/101 (59%) Frame = +3 Query: 27 VVWMCSLDHKPENSMKNTVNVALEGYWRTKGKRKKNLIWINATVPIQVSN*ECGHYVMRY 206 VV++CSL+ KP+ ++ TV+ AL+ Y + +G +KK WI Q + ECG+Y+M + Sbjct: 1115 VVFLCSLERKPDKNIIQTVDSALDEYHKLQGVQKKKPTWIVPVCQRQPESYECGYYIMIH 1174 Query: 207 MLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329 ML IVS I+ W++ F + PF E+ + R+ A LE Sbjct: 1175 MLKIVSDGIIDSWKKIFGNPEPFDEDELINVRQRCASLILE 1215 >XP_014623011.1 PREDICTED: uncharacterized protein LOC102665640 [Glycine max] Length = 530 Score = 76.6 bits (187), Expect = 7e-14 Identities = 39/104 (37%), Positives = 64/104 (61%), Gaps = 2/104 (1%) Frame = +3 Query: 24 VVVWMCSLDHKPENSMKNTVNVALEGYWRT-KGKRKKN-LIWINATVPIQVSN*ECGHYV 197 VVVW SL KP+ ++K VN A++ +T +GK ++ WI A +Q N ECG+YV Sbjct: 415 VVVWFFSLKKKPDAAIKGAVNSAMKSVTKTAEGKPPQHGPQWIEAKSHVQTGNYECGYYV 474 Query: 198 MRYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329 M+++ I+SG + DW F+++SP +EE + R WA ++++ Sbjct: 475 MQWIWCIISGGLKDDWIHWFSNRSPLTEETMTTLRHKWAAYFIQ 518 >XP_003538716.1 PREDICTED: uncharacterized protein LOC100798851 [Glycine max] KHN34985.1 hypothetical protein glysoja_004751 [Glycine soja] Length = 736 Score = 76.3 bits (186), Expect = 1e-13 Identities = 44/109 (40%), Positives = 59/109 (54%), Gaps = 5/109 (4%) Frame = +3 Query: 18 QPVVVWMCSLDHKPEN-SMKNTVNVALEGYWRTKGK----RKKNLIWINATVPIQVSN*E 182 Q VVV +CSL K N MK TV++A++ Y R G R+K WI Q E Sbjct: 624 QNVVVLLCSLHKKTINREMKTTVDLAMDEYQRLVGSQSRSRRKKPTWILPRCQTQTEGYE 683 Query: 183 CGHYVMRYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329 CG+YVM+ ML +V+ IV W++ F PF EE I ++ WA F L+ Sbjct: 684 CGYYVMKQMLTVVTVDIVDSWKKIFNSSGPFPEEDIADIQQRWAAFLLQ 732 >XP_006589869.1 PREDICTED: uncharacterized protein LOC102663555 [Glycine max] Length = 137 Score = 71.6 bits (174), Expect = 2e-13 Identities = 38/103 (36%), Positives = 55/103 (53%), Gaps = 1/103 (0%) Frame = +3 Query: 24 VVVWMCSLDHKPENSMKNTVNVALEGYWRT-KGKRKKNLIWINATVPIQVSN*ECGHYVM 200 +VVW CSL ++ +N +K +N AL+G T + K K WI Q + ECG+YVM Sbjct: 30 LVVWFCSLHNRLDNYLKGIINSALKGLDDTPQPKSKVGARWIVVKCNRQKGSIECGYYVM 89 Query: 201 RYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329 +M I+ G +WE F D P E + R WA++YL+ Sbjct: 90 HWMSTIILGSFKNNWEMYFNDVRPLEAERLNALRIQWAKYYLK 132 >KYP35038.1 hypothetical protein KK1_043951 [Cajanus cajan] Length = 564 Score = 74.3 bits (181), Expect = 5e-13 Identities = 38/100 (38%), Positives = 51/100 (51%), Gaps = 3/100 (3%) Frame = +3 Query: 24 VVVWMCSLDHKPENSMKNTVNVALEGYWRTKGK---RKKNLIWINATVPIQVSN*ECGHY 194 V VW CS HKP +KN + + Y G+ + KNL WI Q ECG+Y Sbjct: 402 VAVWFCSFYHKPNVQIKNLIKSVMVVYNVMGGRSTAQPKNLDWIYPMSNQQQGGYECGYY 461 Query: 195 VMRYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWA 314 VM +ML+I+ G + DW E F D +P E ++ R WA Sbjct: 462 VMNWMLDIIEGEVTNDWIELFDDVAPLPETKLEDIRSQWA 501 >XP_006573751.1 PREDICTED: uncharacterized protein LOC100807274 isoform X2 [Glycine max] Length = 647 Score = 74.3 bits (181), Expect = 5e-13 Identities = 43/109 (39%), Positives = 58/109 (53%), Gaps = 5/109 (4%) Frame = +3 Query: 18 QPVVVWMCSLDHKPENS-MKNTVNVALEGYWRTKGK----RKKNLIWINATVPIQVSN*E 182 Q VVV +CSL K N MK VN+A++ Y R G R+K WI Q E Sbjct: 535 QNVVVLLCSLHKKTINKEMKTIVNLAMDEYQRLVGSQSRSRRKKPTWILPRCQTQSKGYE 594 Query: 183 CGHYVMRYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329 CG+YVM+ M +V+ IV W++ F + PF EE I ++ WA F L+ Sbjct: 595 CGYYVMKQMFTVVTVDIVDSWKQLFNNSGPFPEEDIADIQQRWAAFLLQ 643 >KHN25746.1 hypothetical protein glysoja_018320 [Glycine soja] Length = 736 Score = 74.3 bits (181), Expect = 5e-13 Identities = 43/109 (39%), Positives = 58/109 (53%), Gaps = 5/109 (4%) Frame = +3 Query: 18 QPVVVWMCSLDHKPENS-MKNTVNVALEGYWRTKGK----RKKNLIWINATVPIQVSN*E 182 Q VVV +CSL K N MK VN+A++ Y R G R+K WI Q E Sbjct: 624 QNVVVLLCSLHKKTINKEMKTIVNLAMDEYQRLVGSQSRSRRKKPTWILPRCQTQSKGYE 683 Query: 183 CGHYVMRYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329 CG+YVM+ M +V+ IV W++ F + PF EE I ++ WA F L+ Sbjct: 684 CGYYVMKQMFTVVTVDIVDSWKQLFNNSGPFPEEDIADIQQRWAAFLLQ 732 >XP_003516682.1 PREDICTED: uncharacterized protein LOC100807274 isoform X1 [Glycine max] Length = 736 Score = 74.3 bits (181), Expect = 5e-13 Identities = 43/109 (39%), Positives = 58/109 (53%), Gaps = 5/109 (4%) Frame = +3 Query: 18 QPVVVWMCSLDHKPENS-MKNTVNVALEGYWRTKGK----RKKNLIWINATVPIQVSN*E 182 Q VVV +CSL K N MK VN+A++ Y R G R+K WI Q E Sbjct: 624 QNVVVLLCSLHKKTINKEMKTIVNLAMDEYQRLVGSQSRSRRKKPTWILPRCQTQSKGYE 683 Query: 183 CGHYVMRYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329 CG+YVM+ M +V+ IV W++ F + PF EE I ++ WA F L+ Sbjct: 684 CGYYVMKQMFTVVTVDIVDSWKQLFNNSGPFPEEDIADIQQRWAAFLLQ 732 >XP_014619055.1 PREDICTED: uncharacterized protein LOC106795037 [Glycine max] Length = 139 Score = 70.1 bits (170), Expect = 7e-13 Identities = 35/105 (33%), Positives = 56/105 (53%), Gaps = 3/105 (2%) Frame = +3 Query: 24 VVVWMCSLDHKPENSMKNTVNVALEGYWRTKGKRKKNLI---WINATVPIQVSN*ECGHY 194 +VVW SL H+P+N +K +N AL+G T + K + WI Q + +CG+Y Sbjct: 30 LVVWFYSLHHRPDNYLKGIINSALKGVDGTPQPKSKAAVGARWIVVKCNKQKGSTKCGYY 89 Query: 195 VMRYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329 V+ ++ I+SG +WE F D P + ++ R WA +YL+ Sbjct: 90 VIHWLSTIISGSFRNNWELYFNDVRPLELDTLKAFRIQWANYYLK 134 >XP_013441863.1 Ulp1 protease family, carboxy-terminal domain protein [Medicago truncatula] KEH15888.1 Ulp1 protease family, carboxy-terminal domain protein [Medicago truncatula] Length = 296 Score = 72.8 bits (177), Expect = 8e-13 Identities = 40/102 (39%), Positives = 58/102 (56%), Gaps = 1/102 (0%) Frame = +3 Query: 27 VVWMCSLDHKPENSMKNTVNVALEGYWRTKGKRK-KNLIWINATVPIQVSN*ECGHYVMR 203 VV++CS+ KP+ + VN A+EGY G RK + IW Q N ECG+++M Sbjct: 190 VVFLCSMGWKPDKILVQIVNSAIEGYNMLSGFRKARKPIWEIPACQRQPFNYECGYFIMI 249 Query: 204 YMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329 +MLNIVS I W F D++PF+++ + +E A F LE Sbjct: 250 HMLNIVSAGITDSWNMIFGDETPFTDDEMTKVQERCANFILE 291 >XP_006582171.1 PREDICTED: uncharacterized protein LOC102668599 [Glycine max] XP_014632186.1 PREDICTED: uncharacterized protein LOC102668599 [Glycine max] Length = 127 Score = 68.9 bits (167), Expect = 2e-12 Identities = 35/104 (33%), Positives = 52/104 (50%), Gaps = 2/104 (1%) Frame = +3 Query: 24 VVVWMCSLDHKPENSMKNTVNVALEGYWRTKG--KRKKNLIWINATVPIQVSN*ECGHYV 197 VV W CSL KP+ +K +N A++ T + WI +Q ECG+YV Sbjct: 15 VVAWFCSLRKKPDTHIKTAINNAMKTANTTANGTNNQGTPKWIEVKSHVQSGGYECGYYV 74 Query: 198 MRYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329 M +M NI+SG + DW F D + +E I + WA ++L+ Sbjct: 75 MHWMWNIISGGLKNDWTMWFLDGTTLDKETITTIHQKWASYFLK 118 >XP_006589878.2 PREDICTED: uncharacterized protein LOC102665245 [Glycine max] Length = 339 Score = 71.6 bits (174), Expect = 3e-12 Identities = 36/106 (33%), Positives = 55/106 (51%), Gaps = 2/106 (1%) Frame = +3 Query: 18 QPVVVWMCSLDHKPENSMKNTVNVALEGYWRTKGKRKKNLI--WINATVPIQVSN*ECGH 191 Q VVVW CSL KP+ +K T+N + +T K + WI +Q ECG+ Sbjct: 227 QHVVVWFCSLRRKPDMHIKATINSVMTKLKKTLSPETKAVAPKWIEVKSHVQTGCYECGY 286 Query: 192 YVMRYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329 Y+M ++ NI++ I DW F + +P +I R+ WA F+L+ Sbjct: 287 YIMHWIWNIIASDIKSDWSMWFANDTPLDIGIITTIRKKWATFFLK 332 >XP_014622322.1 PREDICTED: uncharacterized protein LOC100818274 [Glycine max] Length = 857 Score = 71.2 bits (173), Expect = 6e-12 Identities = 38/104 (36%), Positives = 60/104 (57%), Gaps = 2/104 (1%) Frame = +3 Query: 24 VVVWMCSLDHKPENSMKNTVNVALEGYWRT-KGKRKKN-LIWINATVPIQVSN*ECGHYV 197 V VW CSL K + ++K VN A++ +T +GK ++ WI A +Q N EC +YV Sbjct: 745 VAVWFCSLRKKLDAAIKGAVNSAMKSVTKTAEGKPPQHGPQWIEAKSHVQTGNYECEYYV 804 Query: 198 MRYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329 M ++ IVSG + W + F+D+SP EE + R WA ++++ Sbjct: 805 MHWIWCIVSGGLKDGWIDWFSDRSPIPEETMTTLRHKWAAYFIQ 848 >KYP40410.1 hypothetical protein KK1_038259 [Cajanus cajan] Length = 571 Score = 70.9 bits (172), Expect = 8e-12 Identities = 39/98 (39%), Positives = 52/98 (53%), Gaps = 1/98 (1%) Frame = +3 Query: 39 CSLDHKPENSMKNTVNVALEGYWRTKG-KRKKNLIWINATVPIQVSN*ECGHYVMRYMLN 215 CS+ P K ++ +EGY KG K KK + W+ Q N ECG+YVM+ M Sbjct: 470 CSMYKPPPTEFKQLLDKTMEGYHILKGSKSKKKMQWLFVKSHKQNGNYECGYYVMKAMHT 529 Query: 216 IVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329 IV+ +IV W E F D+S E I RE WA F++E Sbjct: 530 IVNSQIVSGWTEIFIDRSSLPLEDINIIREQWATFFIE 567 >XP_003627090.1 Ulp1 protease family, carboxy-terminal domain protein [Medicago truncatula] AET01566.1 Ulp1 protease family, carboxy-terminal domain protein [Medicago truncatula] Length = 694 Score = 70.5 bits (171), Expect = 1e-11 Identities = 37/103 (35%), Positives = 62/103 (60%), Gaps = 2/103 (1%) Frame = +3 Query: 27 VVWMCSLDHKP-ENSMKNTVNVALEGYWRTKGKRKKN-LIWINATVPIQVSN*ECGHYVM 200 V ++CSL KP + ++ V+ ALEGY++ +G RK + ++W T Q + E G++VM Sbjct: 583 VTFLCSLGKKPSDKNLPVIVDSALEGYYKLQGVRKHSKVVWFYPTSRRQSVSYESGYFVM 642 Query: 201 RYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329 +MLNI+S +V W + F D +PF ++ ++ +E A LE Sbjct: 643 LHMLNIISSGVVDSWMQIFADSTPFQKDEVKNVQERCANLILE 685 >XP_014630124.1 PREDICTED: uncharacterized protein LOC102667062 [Glycine max] XP_014630126.1 PREDICTED: uncharacterized protein LOC102667062 [Glycine max] Length = 767 Score = 70.5 bits (171), Expect = 1e-11 Identities = 36/106 (33%), Positives = 54/106 (50%), Gaps = 2/106 (1%) Frame = +3 Query: 18 QPVVVWMCSLDHKPENSMKNTVNVALEGYWRTKGKRKKNLI--WINATVPIQVSN*ECGH 191 Q VVVW CSL KP+ +K T+N + +T K + WI +Q ECG+ Sbjct: 655 QHVVVWFCSLRRKPDMHIKATINSVMTKLKKTLSPETKAVAPKWIEVKSHVQTGCYECGY 714 Query: 192 YVMRYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLE 329 Y+M ++ NI+ I DW F + +P +I R+ WA F+L+ Sbjct: 715 YIMHWIWNIIVSDIKSDWSMWFANDTPLDIGIITTIRKKWATFFLK 760 >KHM99138.1 hypothetical protein glysoja_032029 [Glycine soja] Length = 315 Score = 69.7 bits (169), Expect = 1e-11 Identities = 38/112 (33%), Positives = 63/112 (56%), Gaps = 2/112 (1%) Frame = +3 Query: 27 VVWMCSLDHKPENSMKNTVNVALEGYWR-TKGKRKKNLI-WINATVPIQVSN*ECGHYVM 200 VVW CSL K + S+K TVN A++ + KG + + WI + +Q EC +YVM Sbjct: 204 VVWFCSLRKKLDASIKATVNSAMKTLSKGDKGNTDQPMPQWIEPMIHVQTGAYECVYYVM 263 Query: 201 RYMLNIVSGRIVGDWEETFTDKSPFSEEVIQGNRELWAEFYLEHYERSIKKT 356 ++ NIVSG + +W F+D +P ++E + R WA ++L+ ++K+ Sbjct: 264 HWIWNIVSGGLKDEWITWFSDGTPLTKETMTTLRHKWAAYFLQIKNLEVRKS 315