BLASTX nr result
ID: Glycyrrhiza31_contig00014111
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza31_contig00014111 (346 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value GAU49781.1 hypothetical protein TSUD_188300 [Trifolium subterran... 69 5e-16 GAU24540.1 hypothetical protein TSUD_156530 [Trifolium subterran... 72 9e-16 GAU30135.1 hypothetical protein TSUD_360280 [Trifolium subterran... 77 9e-16 GAU48983.1 hypothetical protein TSUD_245740 [Trifolium subterran... 73 3e-15 KYP61721.1 Putative ribonuclease H protein At1g65750 family [Caj... 64 1e-14 GAU32945.1 hypothetical protein TSUD_153620 [Trifolium subterran... 71 2e-14 GAU24479.1 hypothetical protein TSUD_319560 [Trifolium subterran... 65 2e-14 GAU42016.1 hypothetical protein TSUD_236830 [Trifolium subterran... 73 2e-14 GAU30014.1 hypothetical protein TSUD_160990 [Trifolium subterran... 72 2e-13 GAU44045.1 hypothetical protein TSUD_300150 [Trifolium subterran... 74 5e-13 GAU48830.1 hypothetical protein TSUD_190600 [Trifolium subterran... 69 5e-13 KYP73361.1 Putative ribonuclease H protein At1g65750 family, par... 68 3e-12 GAU36275.1 hypothetical protein TSUD_255290 [Trifolium subterran... 63 3e-11 GAU27562.1 hypothetical protein TSUD_29940 [Trifolium subterraneum] 68 5e-11 GAU39987.1 hypothetical protein TSUD_211080 [Trifolium subterran... 66 6e-11 GAU10807.1 hypothetical protein TSUD_424460, partial [Trifolium ... 52 1e-10 KHN12872.1 Putative ribonuclease H protein, partial [Glycine soja] 57 4e-10 KHN31021.1 Putative ribonuclease H protein [Glycine soja] 55 8e-10 KYP38635.1 hypothetical protein KK1_040110 [Cajanus cajan] 62 8e-10 GAU48278.1 hypothetical protein TSUD_405240 [Trifolium subterran... 55 1e-09 >GAU49781.1 hypothetical protein TSUD_188300 [Trifolium subterraneum] Length = 221 Score = 69.3 bits (168), Expect(2) = 5e-16 Identities = 35/74 (47%), Positives = 45/74 (60%), Gaps = 3/74 (4%) Frame = +2 Query: 101 LQLCWQTGFRHVVCFSDSLLAVNL---GTQPLHFFANDIALIRNFFSRIWILEVCHILLE 271 L +CW+ G+R + C SDSL VNL G P H FAN+I IR +R W + + H L E Sbjct: 117 LTICWENGYRKINCLSDSLQVVNLIRSGVSPHHRFANEILSIRQLITRDWEVVLSHTLRE 176 Query: 272 VNSCADLLAKTRAL 313 N CAD+LAK A+ Sbjct: 177 GNLCADVLAKMGAV 190 Score = 42.0 bits (97), Expect(2) = 5e-16 Identities = 20/33 (60%), Positives = 24/33 (72%) Frame = +3 Query: 3 GFGGIVRNHLGEFILCFYGCLGEASILGAELHA 101 G+GG++RNH GEFIL FYG SIL AE+ A Sbjct: 80 GYGGLLRNHNGEFILGFYGTTSLKSILFAEIMA 112 >GAU24540.1 hypothetical protein TSUD_156530 [Trifolium subterraneum] Length = 1147 Score = 72.0 bits (175), Expect(2) = 9e-16 Identities = 35/70 (50%), Positives = 44/70 (62%), Gaps = 3/70 (4%) Frame = +2 Query: 101 LQLCWQTGFRHVVCFSDSLLAVNL---GTQPLHFFANDIALIRNFFSRIWILEVCHILLE 271 L LCW G+R +VC+SDSL AV+L G H FAN+I I R W + + HIL E Sbjct: 1043 LHLCWNNGYRSIVCYSDSLQAVSLIKDGVSHFHTFANEIYTIHQLLRRDWTIVIEHILRE 1102 Query: 272 VNSCADLLAK 301 N+CAD+LAK Sbjct: 1103 GNACADILAK 1112 Score = 38.5 bits (88), Expect(2) = 9e-16 Identities = 16/33 (48%), Positives = 23/33 (69%) Frame = +3 Query: 3 GFGGIVRNHLGEFILCFYGCLGEASILGAELHA 101 GFGG++RN G F+ FYG ++S+L AE+ A Sbjct: 1006 GFGGLIRNSFGAFLKGFYGTASQSSVLYAEIMA 1038 >GAU30135.1 hypothetical protein TSUD_360280 [Trifolium subterraneum] Length = 479 Score = 76.6 bits (187), Expect(2) = 9e-16 Identities = 38/73 (52%), Positives = 49/73 (67%), Gaps = 3/73 (4%) Frame = +2 Query: 101 LQLCWQTGFRHVVCFSDSLLAVNL---GTQPLHFFANDIALIRNFFSRIWILEVCHILLE 271 L+LCW+ GFR V+C+SDSLL+VNL G P H FAN+I I+ +R W + + H L E Sbjct: 321 LELCWERGFRKVLCYSDSLLSVNLIKEGVTPHHRFANEIHRIKKLLARDWEVTISHTLRE 380 Query: 272 VNSCADLLAKTRA 310 N CAD+LAK A Sbjct: 381 GNVCADVLAKLGA 393 Score = 33.9 bits (76), Expect(2) = 9e-16 Identities = 16/33 (48%), Positives = 22/33 (66%) Frame = +3 Query: 3 GFGGIVRNHLGEFILCFYGCLGEASILGAELHA 101 G+GG++RN G+FI FYG +IL AE+ A Sbjct: 284 GYGGLLRNKDGDFICGFYGVAAIPNILFAEIMA 316 >GAU48983.1 hypothetical protein TSUD_245740 [Trifolium subterraneum] Length = 1103 Score = 72.8 bits (177), Expect(2) = 3e-15 Identities = 36/70 (51%), Positives = 45/70 (64%), Gaps = 3/70 (4%) Frame = +2 Query: 101 LQLCWQTGFRHVVCFSDSLLAVNL---GTQPLHFFANDIALIRNFFSRIWILEVCHILLE 271 L LCW G+R +VC+SDSL AV+L G H FAN+I IR R W + + HIL E Sbjct: 999 LHLCWNNGYRSIVCYSDSLQAVSLIKDGVSHFHTFANEIHPIRQLLRRDWTIVIEHILRE 1058 Query: 272 VNSCADLLAK 301 N+CAD+LAK Sbjct: 1059 GNACADVLAK 1068 Score = 36.2 bits (82), Expect(2) = 3e-15 Identities = 15/33 (45%), Positives = 22/33 (66%) Frame = +3 Query: 3 GFGGIVRNHLGEFILCFYGCLGEASILGAELHA 101 GFGG++RN F+ FYG ++S+L AE+ A Sbjct: 962 GFGGLIRNSFSAFLKGFYGTASQSSVLYAEIMA 994 >KYP61721.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 219 Score = 63.5 bits (153), Expect(2) = 1e-14 Identities = 33/74 (44%), Positives = 41/74 (55%), Gaps = 3/74 (4%) Frame = +2 Query: 101 LQLCWQTGFRHVVCFSDSLLAVNLGTQPL---HFFANDIALIRNFFSRIWILEVCHILLE 271 L LCW GFR +VC+SDS L V+L P+ H + N + I + W V H L E Sbjct: 115 LHLCWDKGFRKIVCYSDSTLVVSLLQGPILMFHRYGNQLMEIHQLLNCDWTCTVVHTLCE 174 Query: 272 VNSCADLLAKTRAL 313 NSCAD LA+ AL Sbjct: 175 GNSCADALARMGAL 188 Score = 43.1 bits (100), Expect(2) = 1e-14 Identities = 18/33 (54%), Positives = 25/33 (75%) Frame = +3 Query: 3 GFGGIVRNHLGEFILCFYGCLGEASILGAELHA 101 G+GG+ +NH G+F+ FYG LGEAS+L E+ A Sbjct: 78 GYGGLCQNHEGQFLFGFYGFLGEASVLQTEILA 110 >GAU32945.1 hypothetical protein TSUD_153620 [Trifolium subterraneum] Length = 292 Score = 70.9 bits (172), Expect(2) = 2e-14 Identities = 37/73 (50%), Positives = 47/73 (64%), Gaps = 3/73 (4%) Frame = +2 Query: 101 LQLCWQTGFRHVVCFSDSLLAVNL---GTQPLHFFANDIALIRNFFSRIWILEVCHILLE 271 L+LCW+ GFR V+C SDSLL+VN+ G H FAN+I IR S W + + H L E Sbjct: 188 LKLCWERGFRKVLCCSDSLLSVNVIKEGVTTHHGFANEILCIRKLLSNDWEVILTHTLRE 247 Query: 272 VNSCADLLAKTRA 310 N+CAD+LAK A Sbjct: 248 GNACADVLAKLGA 260 Score = 35.4 bits (80), Expect(2) = 2e-14 Identities = 17/33 (51%), Positives = 22/33 (66%) Frame = +3 Query: 3 GFGGIVRNHLGEFILCFYGCLGEASILGAELHA 101 G+GG++RN GEFI FYG +IL AE+ A Sbjct: 151 GYGGLLRNRDGEFIWGFYGAAAIQNILYAEIMA 183 >GAU24479.1 hypothetical protein TSUD_319560 [Trifolium subterraneum] Length = 227 Score = 64.7 bits (156), Expect(2) = 2e-14 Identities = 33/70 (47%), Positives = 43/70 (61%), Gaps = 3/70 (4%) Frame = +2 Query: 101 LQLCWQTGFRHVVCFSDSLLAVNL---GTQPLHFFANDIALIRNFFSRIWILEVCHILLE 271 L +CW+ G+R + C S+SL VNL G H FAN+I IR +R W + + H L E Sbjct: 138 LTICWENGYRKINCLSNSLQLVNLIRSGVSLHHRFANEILSIRRLITRDWEVVLSHTLRE 197 Query: 272 VNSCADLLAK 301 NSCAD+LAK Sbjct: 198 GNSCADVLAK 207 Score = 41.2 bits (95), Expect(2) = 2e-14 Identities = 19/31 (61%), Positives = 23/31 (74%) Frame = +3 Query: 3 GFGGIVRNHLGEFILCFYGCLGEASILGAEL 95 G+GG++RNH GEFIL FYG SIL AE+ Sbjct: 101 GYGGLLRNHNGEFILGFYGTTSLKSILFAEI 131 >GAU42016.1 hypothetical protein TSUD_236830 [Trifolium subterraneum] Length = 111 Score = 72.8 bits (177), Expect = 2e-14 Identities = 39/74 (52%), Positives = 47/74 (63%), Gaps = 3/74 (4%) Frame = +2 Query: 101 LQLCWQTGFRHVVCFSDSLLAVNL---GTQPLHFFANDIALIRNFFSRIWILEVCHILLE 271 LQ+CW+ GFR +VCFSDSL AV+L G H AN+I IR R W + V H L E Sbjct: 7 LQICWEKGFRRIVCFSDSLQAVSLIREGVSAHHRSANEICSIRQLVGRDWDVIVEHTLRE 66 Query: 272 VNSCADLLAKTRAL 313 N+CAD+LAK AL Sbjct: 67 GNACADVLAKMGAL 80 >GAU30014.1 hypothetical protein TSUD_160990 [Trifolium subterraneum] Length = 168 Score = 72.0 bits (175), Expect = 2e-13 Identities = 36/70 (51%), Positives = 45/70 (64%), Gaps = 3/70 (4%) Frame = +2 Query: 101 LQLCWQTGFRHVVCFSDSLLAVNL---GTQPLHFFANDIALIRNFFSRIWILEVCHILLE 271 L LCW G+R + C+SDSL AV L G P H +AN+I IR+ R WI+ V H L E Sbjct: 64 LDLCWVNGYRKIECYSDSLQAVALIRDGVSPHHQYANEIQSIRHLLRRDWIVAVIHTLRE 123 Query: 272 VNSCADLLAK 301 N+CAD+LAK Sbjct: 124 GNACADVLAK 133 >GAU44045.1 hypothetical protein TSUD_300150 [Trifolium subterraneum] Length = 961 Score = 73.9 bits (180), Expect = 5e-13 Identities = 40/74 (54%), Positives = 47/74 (63%), Gaps = 3/74 (4%) Frame = +2 Query: 101 LQLCWQTGFRHVVCFSDSLLAVNL---GTQPLHFFANDIALIRNFFSRIWILEVCHILLE 271 LQ+CW+ GFR +VCFSDSL AV L G H FAN+I IR R W + V H L E Sbjct: 7 LQICWEKGFRRIVCFSDSLQAVGLIREGVSAHHRFANEIYSIRQLRGRDWDVIVEHTLRE 66 Query: 272 VNSCADLLAKTRAL 313 N+CAD+LAK AL Sbjct: 67 GNACADVLAKMGAL 80 >GAU48830.1 hypothetical protein TSUD_190600 [Trifolium subterraneum] Length = 298 Score = 69.3 bits (168), Expect(2) = 5e-13 Identities = 32/73 (43%), Positives = 46/73 (63%), Gaps = 3/73 (4%) Frame = +2 Query: 101 LQLCWQTGFRHVVCFSDSLLAVNL---GTQPLHFFANDIALIRNFFSRIWILEVCHILLE 271 LQ+CW++GFR + CFSDSL VNL G H F+N++ +I ++ W + + H E Sbjct: 194 LQICWESGFRRITCFSDSLQIVNLIRDGVSAHHRFSNEVFIIHQLLAKDWEVVIGHTFRE 253 Query: 272 VNSCADLLAKTRA 310 N+CAD+LAK A Sbjct: 254 GNACADVLAKMGA 266 Score = 32.0 bits (71), Expect(2) = 5e-13 Identities = 16/33 (48%), Positives = 21/33 (63%) Frame = +3 Query: 3 GFGGIVRNHLGEFILCFYGCLGEASILGAELHA 101 G+GG++R+ G F+ FYG SIL AEL A Sbjct: 157 GYGGLIRDSNGVFLSGFYGTATVQSILFAELMA 189 >KYP73361.1 Putative ribonuclease H protein At1g65750 family, partial [Cajanus cajan] Length = 132 Score = 68.2 bits (165), Expect = 3e-12 Identities = 35/78 (44%), Positives = 44/78 (56%), Gaps = 3/78 (3%) Frame = +2 Query: 92 VTCLQLCWQTGFRHVVCFSDSLLAVNL---GTQPLHFFANDIALIRNFFSRIWILEVCHI 262 V L+LCW+ F HV C DSL V L GT P H + N+I LIR R W + ++ Sbjct: 25 VNGLELCWERRFSHVQCQLDSLYLVQLVQEGTNPQHRYTNEITLIREILGRSWSCSLTYV 84 Query: 263 LLEVNSCADLLAKTRALL 316 E NSCAD LAK ++L Sbjct: 85 FREENSCADWLAKKGSML 102 >GAU36275.1 hypothetical protein TSUD_255290 [Trifolium subterraneum] Length = 258 Score = 62.8 bits (151), Expect(2) = 3e-11 Identities = 32/70 (45%), Positives = 43/70 (61%), Gaps = 3/70 (4%) Frame = +2 Query: 101 LQLCWQTGFRHVVCFSDSLLAVNL---GTQPLHFFANDIALIRNFFSRIWILEVCHILLE 271 L+LCW+ GFR V C SD LL+V++ G H FAN+I IR + W + + H L E Sbjct: 154 LKLCWERGFRKVFCCSDYLLSVDVTKEGVTTHHRFANEILCIRKLLANDWEVILTHTLRE 213 Query: 272 VNSCADLLAK 301 N+CAD+L K Sbjct: 214 GNACADVLGK 223 Score = 32.3 bits (72), Expect(2) = 3e-11 Identities = 16/33 (48%), Positives = 21/33 (63%) Frame = +3 Query: 3 GFGGIVRNHLGEFILCFYGCLGEASILGAELHA 101 G+ G++RN GEFI FYG +IL AE+ A Sbjct: 117 GYDGLLRNRDGEFIWGFYGVAAIQNILYAEIMA 149 >GAU27562.1 hypothetical protein TSUD_29940 [Trifolium subterraneum] Length = 584 Score = 68.2 bits (165), Expect = 5e-11 Identities = 34/73 (46%), Positives = 46/73 (63%), Gaps = 3/73 (4%) Frame = +2 Query: 101 LQLCWQTGFRHVVCFSDSLLAVNL---GTQPLHFFANDIALIRNFFSRIWILEVCHILLE 271 L+LCW+ GF V+C SDSLL++NL G H FAN+I I+ + W + + H L E Sbjct: 480 LKLCWERGFHKVLCCSDSLLSINLIKGGVTAHHRFANEILCIQKLLANDWEVTLSHTLCE 539 Query: 272 VNSCADLLAKTRA 310 N+CAD+LAK A Sbjct: 540 GNACADVLAKLGA 552 >GAU39987.1 hypothetical protein TSUD_211080 [Trifolium subterraneum] Length = 192 Score = 65.9 bits (159), Expect = 6e-11 Identities = 31/73 (42%), Positives = 44/73 (60%), Gaps = 3/73 (4%) Frame = +2 Query: 101 LQLCWQTGFRHVVCFSDSLLAVNL---GTQPLHFFANDIALIRNFFSRIWILEVCHILLE 271 LQ+CW++GFR + CFSDSL VNL G H +N++ +I + W + + H E Sbjct: 94 LQICWESGFRRITCFSDSLQTVNLIRDGVSTHHRSSNEVFIIHQLLANDWEVVIDHTFRE 153 Query: 272 VNSCADLLAKTRA 310 N+CAD+LAK A Sbjct: 154 GNACADVLAKMGA 166 >GAU10807.1 hypothetical protein TSUD_424460, partial [Trifolium subterraneum] Length = 170 Score = 51.6 bits (122), Expect(2) = 1e-10 Identities = 32/73 (43%), Positives = 39/73 (53%), Gaps = 3/73 (4%) Frame = +2 Query: 101 LQLCWQTGFRHVVCFSDSLLAVNL---GTQPLHFFANDIALIRNFFSRIWILEVCHILLE 271 L L GF +V+ SDS +A+ L GT PLH +A I IR F + W + H L E Sbjct: 94 LCLAMDQGFNNVIIESDSTIAIGLVEHGTSPLHPYAPLIKNIRQFQNMDWTIAFHHTLRE 153 Query: 272 VNSCADLLAKTRA 310 N CAD LAK A Sbjct: 154 GNECADWLAKKGA 166 Score = 41.6 bits (96), Expect(2) = 1e-10 Identities = 19/33 (57%), Positives = 26/33 (78%) Frame = +3 Query: 3 GFGGIVRNHLGEFILCFYGCLGEASILGAELHA 101 GFGGI+RN++G ++L F G +G A+ L AELHA Sbjct: 57 GFGGIMRNNMGNWLLGFSGFIGIATSLQAELHA 89 >KHN12872.1 Putative ribonuclease H protein, partial [Glycine soja] Length = 151 Score = 56.6 bits (135), Expect(2) = 4e-10 Identities = 36/85 (42%), Positives = 46/85 (54%), Gaps = 4/85 (4%) Frame = +2 Query: 101 LQLCWQTGFRHVVCFSDSLLAVNL---GTQPLHFFANDIALIRNFFSRIWILEVCHILLE 271 L L W GFR++ C SDS LA+ L G LH +A+ I I++F W L H E Sbjct: 54 LNLSWDKGFRNIQCESDSKLALQLISEGRNSLHPYASIIQKIQDFKLLHWDLHFNHTFRE 113 Query: 272 VNSCADLLAKTRALLFCFVS-YLGC 343 N CAD LAKT + L C + + GC Sbjct: 114 GNMCADELAKTGSSLQCNLQVFNGC 138 Score = 35.0 bits (79), Expect(2) = 4e-10 Identities = 17/33 (51%), Positives = 22/33 (66%) Frame = +3 Query: 3 GFGGIVRNHLGEFILCFYGCLGEASILGAELHA 101 GFGGI+R+ G++ FYG G A+ L AEL A Sbjct: 17 GFGGIIRDSFGDWHAGFYGSCGTATSLQAELLA 49 >KHN31021.1 Putative ribonuclease H protein [Glycine soja] Length = 172 Score = 54.7 bits (130), Expect(2) = 8e-10 Identities = 27/82 (32%), Positives = 41/82 (50%), Gaps = 3/82 (3%) Frame = +2 Query: 101 LQLCWQTGFRHVVCFSDSLLAVNL---GTQPLHFFANDIALIRNFFSRIWILEVCHILLE 271 L+LCW+ ++ +VC+SDSL VNL G + HF+ N I R W + L + Sbjct: 68 LELCWEARYKKLVCYSDSLTMVNLVNNGVRSYHFYNNMIMKFHQLLGRDWPCSLNQTLRK 127 Query: 272 VNSCADLLAKTRALLFCFVSYL 337 N CA++L K C + + Sbjct: 128 GNQCANVLVKMGVASSCLLEVI 149 Score = 35.8 bits (81), Expect(2) = 8e-10 Identities = 16/32 (50%), Positives = 22/32 (68%) Frame = +3 Query: 6 FGGIVRNHLGEFILCFYGCLGEASILGAELHA 101 FGG+ RNH G F+L FYG + + +L AE+ A Sbjct: 32 FGGLCRNHNGAFLLGFYGAVEISEVLYAEILA 63 >KYP38635.1 hypothetical protein KK1_040110 [Cajanus cajan] Length = 128 Score = 61.6 bits (148), Expect = 8e-10 Identities = 34/70 (48%), Positives = 42/70 (60%), Gaps = 3/70 (4%) Frame = +2 Query: 101 LQLCWQTGFRHVVCFSDSLLAVNLGTQPL---HFFANDIALIRNFFSRIWILEVCHILLE 271 L+L WQ F +++C SDSL V+L PL H +A IA IR FF R W L + H L E Sbjct: 24 LELAWQQNFSNIMCVSDSLNVVHLILGPLDPFHRYAVTIAKIREFFHRDWRLSLRHSLRE 83 Query: 272 VNSCADLLAK 301 N CAD L+K Sbjct: 84 GNQCADFLSK 93 >GAU48278.1 hypothetical protein TSUD_405240 [Trifolium subterraneum] Length = 161 Score = 55.5 bits (132), Expect(2) = 1e-09 Identities = 31/70 (44%), Positives = 41/70 (58%), Gaps = 3/70 (4%) Frame = +2 Query: 101 LQLCWQTGFRHVVCFSDSLLAVNL---GTQPLHFFANDIALIRNFFSRIWILEVCHILLE 271 L+L W+ GFR V+ SDSLL++NL G H AN+I IR W + + H E Sbjct: 69 LKLGWKRGFRKVLRCSDSLLSINLIKDGVTIHHCLANEIHCIRKLLENDWEVILTHTFRE 128 Query: 272 VNSCADLLAK 301 N+CAD+LAK Sbjct: 129 GNACADVLAK 138 Score = 34.3 bits (77), Expect(2) = 1e-09 Identities = 18/33 (54%), Positives = 22/33 (66%) Frame = +3 Query: 3 GFGGIVRNHLGEFILCFYGCLGEASILGAELHA 101 G+GG++RN GEFI FYG SIL AE+ A Sbjct: 32 GYGGLLRNIDGEFIWGFYGDAAIQSILFAEIMA 64