BLASTX nr result
ID: Glycyrrhiza30_contig00030333
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza30_contig00030333 (319 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value GAU10423.1 hypothetical protein TSUD_421820 [Trifolium subterran... 80 2e-16 GAU21633.1 hypothetical protein TSUD_251160 [Trifolium subterran... 79 4e-15 GAU36816.1 hypothetical protein TSUD_219190 [Trifolium subterran... 76 7e-14 GAU38761.1 hypothetical protein TSUD_64920 [Trifolium subterraneum] 76 7e-14 XP_016206284.1 PREDICTED: uncharacterized protein LOC107646622 [... 75 2e-13 XP_019094473.1 PREDICTED: uncharacterized protein LOC109129898 [... 73 7e-13 XP_016164673.1 PREDICTED: uncharacterized protein LOC107607211 [... 73 9e-13 KYP56435.1 Putative ribonuclease H protein At1g65750 family [Caj... 70 2e-12 GAU10752.1 hypothetical protein TSUD_425200, partial [Trifolium ... 67 2e-12 GAU26239.1 hypothetical protein TSUD_224300 [Trifolium subterran... 72 2e-12 XP_016168765.1 PREDICTED: uncharacterized protein LOC107611342 [... 71 3e-12 KYP69874.1 Retrovirus-related Pol polyprotein LINE-1 [Cajanus ca... 70 8e-12 XP_015935830.1 PREDICTED: uncharacterized protein LOC107461787 [... 70 8e-12 AAC63844.1 putative non-LTR retroelement reverse transcriptase [... 70 1e-11 JAU21877.1 Putative ribonuclease H protein, partial [Noccaea cae... 67 1e-11 XP_019091097.1 PREDICTED: uncharacterized protein LOC109128707 [... 69 1e-11 KYP65494.1 Putative ribonuclease H protein At1g65750 family [Caj... 66 2e-11 P0C2F6.1 RecName: Full=Putative ribonuclease H protein At1g65750 69 3e-11 KYP45496.1 Putative ribonuclease H protein At1g65750 family, par... 64 3e-11 GAU28660.1 hypothetical protein TSUD_159420 [Trifolium subterran... 68 4e-11 >GAU10423.1 hypothetical protein TSUD_421820 [Trifolium subterraneum] Length = 196 Score = 80.1 bits (196), Expect = 2e-16 Identities = 32/55 (58%), Positives = 45/55 (81%) Frame = +1 Query: 1 IWGSTPEARRQHLISWKKICTPKEDGGLGFRNLRLVNSAYLMKLAWGLISQKDAL 165 IWGST R+ HL+SW+KIC PKE+GGLGF+NLR++N AY+ KLAW ++++ + L Sbjct: 134 IWGSTVNQRKCHLVSWEKICRPKEEGGLGFKNLRMLNQAYIHKLAWQMVAEPNKL 188 Score = 66.2 bits (160), Expect = 4e-11 Identities = 30/47 (63%), Positives = 37/47 (78%) Frame = +1 Query: 163 LSQAGKISLAQSCIFSLPSYVM*TCKLPASICDEIECLCRDFIWGST 303 LS AG+I+LAQS + +P YVM T +PAS+CDE E +CRDFIWGST Sbjct: 92 LSFAGRITLAQSSLPCIPGYVMQTANIPASVCDEAEKICRDFIWGST 138 >GAU21633.1 hypothetical protein TSUD_251160 [Trifolium subterraneum] Length = 378 Score = 79.0 bits (193), Expect = 4e-15 Identities = 34/55 (61%), Positives = 44/55 (80%) Frame = +1 Query: 1 IWGSTPEARRQHLISWKKICTPKEDGGLGFRNLRLVNSAYLMKLAWGLISQKDAL 165 IWGST + R+ HLI W++IC PKE+GGLGF+NL +NSAY+MKL+W LI+ D L Sbjct: 257 IWGSTTDHRKTHLILWEQICRPKEEGGLGFKNLEWLNSAYMMKLSWQLITCPDKL 311 Score = 60.5 bits (145), Expect = 2e-08 Identities = 25/51 (49%), Positives = 40/51 (78%) Frame = +1 Query: 157 DALSQAGKISLAQSCIFSLPSYVM*TCKLPASICDEIECLCRDFIWGSTPE 309 ++LS AG+++LAQS + ++P YV+ + ++P S+C E E +CRDFIWGST + Sbjct: 213 NSLSFAGRVTLAQSSLTNIPGYVIQSSRIPISVCVEAEKICRDFIWGSTTD 263 >GAU36816.1 hypothetical protein TSUD_219190 [Trifolium subterraneum] Length = 521 Score = 75.9 bits (185), Expect = 7e-14 Identities = 30/55 (54%), Positives = 44/55 (80%) Frame = +1 Query: 1 IWGSTPEARRQHLISWKKICTPKEDGGLGFRNLRLVNSAYLMKLAWGLISQKDAL 165 IW +T EAR+ HLI+W K+C PK +GGLGFRNL+++N A++MKL+W +++Q L Sbjct: 165 IWETTAEARKVHLIAWDKLCHPKSEGGLGFRNLKMLNKAHMMKLSWQILTQPSKL 219 >GAU38761.1 hypothetical protein TSUD_64920 [Trifolium subterraneum] Length = 533 Score = 75.9 bits (185), Expect = 7e-14 Identities = 30/55 (54%), Positives = 44/55 (80%) Frame = +1 Query: 1 IWGSTPEARRQHLISWKKICTPKEDGGLGFRNLRLVNSAYLMKLAWGLISQKDAL 165 IWG+T EAR+ HLI+W K+C PK +GGL FRNL+++N A++MKL+W +++Q L Sbjct: 173 IWGTTTEARKVHLIAWDKLCHPKSEGGLRFRNLKMLNKAHMMKLSWQILTQPSKL 227 Score = 60.1 bits (144), Expect = 2e-08 Identities = 24/50 (48%), Positives = 38/50 (76%) Frame = +1 Query: 163 LSQAGKISLAQSCIFSLPSYVM*TCKLPASICDEIECLCRDFIWGSTPEA 312 LS AG+++LAQ+ + +P YV+ + +P S+C E+E +CR+FIWG+T EA Sbjct: 131 LSFAGRVTLAQTSLVHIPGYVLQSIPIPVSVCQEVEQICRNFIWGTTTEA 180 >XP_016206284.1 PREDICTED: uncharacterized protein LOC107646622 [Arachis ipaensis] Length = 1460 Score = 74.7 bits (182), Expect = 2e-13 Identities = 32/55 (58%), Positives = 40/55 (72%) Frame = +1 Query: 1 IWGSTPEARRQHLISWKKICTPKEDGGLGFRNLRLVNSAYLMKLAWGLISQKDAL 165 +WGS R+ HL+SW+K+C PK GGLG R R++N+A LMKLAW LI KDAL Sbjct: 578 LWGSVSSGRKPHLMSWEKVCLPKSQGGLGLRPARVLNNANLMKLAWKLIHNKDAL 632 Score = 64.3 bits (155), Expect = 9e-10 Identities = 28/46 (60%), Positives = 37/46 (80%) Frame = +1 Query: 163 LSQAGKISLAQSCIFSLPSYVM*TCKLPASICDEIECLCRDFIWGS 300 LS AG+++L QS + S+PSYVM T KLP SICD I+ +CR+F+WGS Sbjct: 536 LSLAGRVTLTQSALASIPSYVMQTMKLPLSICDSIDKICRNFLWGS 581 >XP_019094473.1 PREDICTED: uncharacterized protein LOC109129898 [Camelina sativa] Length = 1738 Score = 73.2 bits (178), Expect = 7e-13 Identities = 28/55 (50%), Positives = 42/55 (76%) Frame = +1 Query: 1 IWGSTPEARRQHLISWKKICTPKEDGGLGFRNLRLVNSAYLMKLAWGLISQKDAL 165 +WGST E +RQHL++WK++C PK++GGLG R R +N A + K+ W L+ +KD+L Sbjct: 1192 VWGSTAEKKRQHLLAWKRVCVPKQEGGLGLRPTRDMNRALIAKVGWRLLHEKDSL 1246 >XP_016164673.1 PREDICTED: uncharacterized protein LOC107607211 [Arachis ipaensis] Length = 1901 Score = 72.8 bits (177), Expect = 9e-13 Identities = 29/55 (52%), Positives = 41/55 (74%) Frame = +1 Query: 1 IWGSTPEARRQHLISWKKICTPKEDGGLGFRNLRLVNSAYLMKLAWGLISQKDAL 165 IWG T + ++ HL++WKKIC PK+ GGLG R+ +N A++MK WGLI++KD L Sbjct: 1365 IWGDTDQNKKVHLLNWKKICEPKQSGGLGIRHAGQMNQAFMMKAGWGLIARKDDL 1419 >KYP56435.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 228 Score = 70.1 bits (170), Expect = 2e-12 Identities = 27/55 (49%), Positives = 41/55 (74%) Frame = +1 Query: 1 IWGSTPEARRQHLISWKKICTPKEDGGLGFRNLRLVNSAYLMKLAWGLISQKDAL 165 +WG +P +R+ H ISWK IC PK+ GGLG R++R VN++++MK W LI++ + L Sbjct: 22 LWGDSPTSRKIHAISWKTICMPKDQGGLGLRSMRTVNNSFMMKNGWSLITEPNKL 76 >GAU10752.1 hypothetical protein TSUD_425200, partial [Trifolium subterraneum] Length = 93 Score = 67.0 bits (162), Expect = 2e-12 Identities = 26/55 (47%), Positives = 38/55 (69%) Frame = +1 Query: 1 IWGSTPEARRQHLISWKKICTPKEDGGLGFRNLRLVNSAYLMKLAWGLISQKDAL 165 +WG T + R+ HL+SW C PK+DGGLG ++ +N A+LMK+ W LI++ D L Sbjct: 22 LWGDTDQVRKPHLVSWNVCCLPKKDGGLGIKSPHQMNEAFLMKMLWNLINRPDDL 76 >GAU26239.1 hypothetical protein TSUD_224300 [Trifolium subterraneum] Length = 1250 Score = 71.6 bits (174), Expect = 2e-12 Identities = 30/55 (54%), Positives = 42/55 (76%) Frame = +1 Query: 1 IWGSTPEARRQHLISWKKICTPKEDGGLGFRNLRLVNSAYLMKLAWGLISQKDAL 165 IWG T E+R+ HLISW KIC+PK++GGLG R+ VNSA++M+ W L S+ ++L Sbjct: 710 IWGDTEESRKIHLISWDKICSPKKEGGLGMRHAYNVNSAFMMRAGWNLCSKPNSL 764 Score = 58.5 bits (140), Expect = 9e-08 Identities = 28/50 (56%), Positives = 36/50 (72%) Frame = +1 Query: 163 LSQAGKISLAQSCIFSLPSYVM*TCKLPASICDEIECLCRDFIWGSTPEA 312 LS AG+++LA+S I +LP Y M + LP SICDEI+ CR FIWG T E+ Sbjct: 668 LSFAGRVTLAKSVIQALPVYTMQSTLLPKSICDEIDKKCRSFIWGDTEES 717 >XP_016168765.1 PREDICTED: uncharacterized protein LOC107611342 [Arachis ipaensis] Length = 917 Score = 71.2 bits (173), Expect = 3e-12 Identities = 27/55 (49%), Positives = 41/55 (74%) Frame = +1 Query: 1 IWGSTPEARRQHLISWKKICTPKEDGGLGFRNLRLVNSAYLMKLAWGLISQKDAL 165 +WG+T + ++ HL+SWK++C PK GGLG R+ +N A++MK WGLI +K+AL Sbjct: 545 LWGNTEQTKKIHLLSWKRVCEPKSCGGLGIRHASQMNQAFMMKAGWGLIERKEAL 599 >KYP69874.1 Retrovirus-related Pol polyprotein LINE-1 [Cajanus cajan] Length = 956 Score = 70.1 bits (170), Expect = 8e-12 Identities = 27/55 (49%), Positives = 41/55 (74%) Frame = +1 Query: 1 IWGSTPEARRQHLISWKKICTPKEDGGLGFRNLRLVNSAYLMKLAWGLISQKDAL 165 +WG +P +R+ H ISWK IC PK+ GGLG R++R VN++++MK W LI++ + L Sbjct: 902 LWGDSPTSRKIHAISWKTICMPKDQGGLGLRSMRTVNNSFMMKNGWSLITEPNKL 956 >XP_015935830.1 PREDICTED: uncharacterized protein LOC107461787 [Arachis duranensis] Length = 1370 Score = 70.1 bits (170), Expect = 8e-12 Identities = 29/55 (52%), Positives = 39/55 (70%) Frame = +1 Query: 1 IWGSTPEARRQHLISWKKICTPKEDGGLGFRNLRLVNSAYLMKLAWGLISQKDAL 165 IWG T + ++ HL++WK IC PK GGLG R+ L N A++MK WGLI++KD L Sbjct: 741 IWGDTDQNKKIHLLNWKMICEPKHTGGLGIRHASLGNKAFMMKAGWGLIAKKDDL 795 Score = 52.8 bits (125), Expect = 1e-05 Identities = 24/54 (44%), Positives = 37/54 (68%) Frame = +1 Query: 148 SQKDALSQAGKISLAQSCIFSLPSYVM*TCKLPASICDEIECLCRDFIWGSTPE 309 S+ +LS AG+ +L +S + S+P Y M + LPAS C+ I+ +CR+FIWG T + Sbjct: 694 SKASSLSLAGRTTLVKSVLSSMPLYNMQSAILPASTCNTIDRICRNFIWGDTDQ 747 >AAC63844.1 putative non-LTR retroelement reverse transcriptase [Arabidopsis thaliana] Length = 1231 Score = 69.7 bits (169), Expect = 1e-11 Identities = 27/55 (49%), Positives = 41/55 (74%) Frame = +1 Query: 1 IWGSTPEARRQHLISWKKICTPKEDGGLGFRNLRLVNSAYLMKLAWGLISQKDAL 165 +WGST E ++QHL+SW+KIC PK +GG+G R+ R +N A + K+ W L+ K++L Sbjct: 689 LWGSTMEKKKQHLLSWRKICKPKAEGGIGLRSARDMNKALVAKVGWRLLQDKESL 743 >JAU21877.1 Putative ribonuclease H protein, partial [Noccaea caerulescens] Length = 197 Score = 67.4 bits (163), Expect = 1e-11 Identities = 27/55 (49%), Positives = 40/55 (72%) Frame = +1 Query: 1 IWGSTPEARRQHLISWKKICTPKEDGGLGFRNLRLVNSAYLMKLAWGLISQKDAL 165 +WGST E R+QHL+SWKK+CT K +GGLG R+ +N A + K+ W L++ + +L Sbjct: 94 LWGSTVEKRKQHLVSWKKVCTTKGEGGLGIRSAVKMNKALIAKVGWRLLNDEKSL 148 >XP_019091097.1 PREDICTED: uncharacterized protein LOC109128707 [Camelina sativa] Length = 850 Score = 69.3 bits (168), Expect = 1e-11 Identities = 27/55 (49%), Positives = 40/55 (72%) Frame = +1 Query: 1 IWGSTPEARRQHLISWKKICTPKEDGGLGFRNLRLVNSAYLMKLAWGLISQKDAL 165 +WGSTP+ R+QHL++W K+C PK +GGLG R + +N A + K+ W LI+ +D L Sbjct: 436 LWGSTPDKRKQHLVAWDKVCLPKCEGGLGIRAAKDMNKALIAKMGWRLINDQDKL 490 >KYP65494.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 135 Score = 65.9 bits (159), Expect = 2e-11 Identities = 28/55 (50%), Positives = 38/55 (69%) Frame = +1 Query: 1 IWGSTPEARRQHLISWKKICTPKEDGGLGFRNLRLVNSAYLMKLAWGLISQKDAL 165 +WG P R+ H ISW KIC PKE GLG R+LR VN++++MK W LI++ + L Sbjct: 22 LWGDRPTHRKIHTISWDKICRPKERDGLGLRSLREVNNSFMMKNCWSLITEPNKL 76 >P0C2F6.1 RecName: Full=Putative ribonuclease H protein At1g65750 Length = 620 Score = 68.6 bits (166), Expect = 3e-11 Identities = 25/55 (45%), Positives = 41/55 (74%) Frame = +1 Query: 1 IWGSTPEARRQHLISWKKICTPKEDGGLGFRNLRLVNSAYLMKLAWGLISQKDAL 165 +WGST E ++QHL+ W K+C+PK++GGLG R + +N A + K+ W L+ +K++L Sbjct: 74 LWGSTAEKKKQHLVKWSKVCSPKKEGGLGVRAAKSMNRALISKVGWRLLQEKNSL 128 >KYP45496.1 Putative ribonuclease H protein At1g65750 family, partial [Cajanus cajan] Length = 108 Score = 64.3 bits (155), Expect = 3e-11 Identities = 25/55 (45%), Positives = 39/55 (70%) Frame = +1 Query: 1 IWGSTPEARRQHLISWKKICTPKEDGGLGFRNLRLVNSAYLMKLAWGLISQKDAL 165 +WG + +R+ H ISW IC PK+ GGLG R++R VN++++MK W LI++ + L Sbjct: 33 LWGDSLTSRKIHAISWNTICMPKDQGGLGLRSMRTVNNSFMMKNCWSLITEPNKL 87 >GAU28660.1 hypothetical protein TSUD_159420 [Trifolium subterraneum] Length = 1424 Score = 68.2 bits (165), Expect = 4e-11 Identities = 38/110 (34%), Positives = 55/110 (50%), Gaps = 22/110 (20%) Frame = +1 Query: 34 HLISWKKICTPKEDGGLGFRNLRLVNSAYLMKLAWGLISQKD------------------ 159 H ++W ++ PK GGLGFRN N A + K AW +I + Sbjct: 1034 HWLAWDRLTCPKAKGGLGFRNFEAFNMAMVAKQAWNIIXGRSKKAIFSYIKDRIWNKMNS 1093 Query: 160 ----ALSQAGKISLAQSCIFSLPSYVM*TCKLPASICDEIECLCRDFIWG 297 ALS+AGK + +S + ++PSYVM LP+S+ D+IE + F WG Sbjct: 1094 WRGRALSKAGKEVMIKSVLQAIPSYVMSLFILPSSLIDDIEKMLNSFWWG 1143