BLASTX nr result
ID: Glycyrrhiza32_contig00015664
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza32_contig00015664 (1006 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value GAU17063.1 hypothetical protein TSUD_105620 [Trifolium subterran... 144 2e-56 GAU50297.1 hypothetical protein TSUD_288310 [Trifolium subterran... 148 4e-56 GAU35042.1 hypothetical protein TSUD_30080 [Trifolium subterraneum] 154 2e-54 GAU49954.1 hypothetical protein TSUD_180180 [Trifolium subterran... 118 6e-52 GAU22350.1 hypothetical protein TSUD_106780 [Trifolium subterran... 121 2e-51 GAU47648.1 hypothetical protein TSUD_27720 [Trifolium subterraneum] 150 3e-51 GAU20604.1 hypothetical protein TSUD_33400 [Trifolium subterraneum] 117 8e-51 GAU18899.1 hypothetical protein TSUD_228890 [Trifolium subterran... 115 2e-50 GAU11845.1 hypothetical protein TSUD_75960 [Trifolium subterraneum] 118 1e-49 GAU10400.1 hypothetical protein TSUD_423410, partial [Trifolium ... 171 8e-48 GAU48622.1 hypothetical protein TSUD_133530 [Trifolium subterran... 125 2e-47 ABD28627.2 RNA-directed DNA polymerase (Reverse transcriptase); ... 108 1e-46 GAU41508.1 hypothetical protein TSUD_302460 [Trifolium subterran... 118 3e-46 ABO80459.1 RNA-directed DNA polymerase (Reverse transcriptase); ... 109 9e-46 GAU12283.1 hypothetical protein TSUD_141910 [Trifolium subterran... 153 2e-45 GAU23316.1 hypothetical protein TSUD_237700 [Trifolium subterran... 109 6e-44 GAU50246.1 hypothetical protein TSUD_188980 [Trifolium subterran... 112 8e-44 GAU35137.1 hypothetical protein TSUD_394630 [Trifolium subterran... 103 1e-42 GAU34195.1 hypothetical protein TSUD_162960 [Trifolium subterran... 153 4e-42 GAU33259.1 hypothetical protein TSUD_333820 [Trifolium subterran... 155 9e-42 >GAU17063.1 hypothetical protein TSUD_105620 [Trifolium subterraneum] Length = 440 Score = 144 bits (363), Expect(2) = 2e-56 Identities = 73/178 (41%), Positives = 105/178 (58%) Frame = +1 Query: 367 TETQCLLLKRQETWHPSGEAIAVLNVDGSSFGNTGRAGVGGLLRHGDGEWIQGFYGSVGV 546 ++ + L R W+ +LNVDGSS GN G GGL+R+ G WI+GF G++G Sbjct: 263 SQQKSTLANRLVRWNAHDGTDMILNVDGSSIGNPEIYGFGGLIRNSHGAWIRGFAGNIGF 322 Query: 547 ADSLKAELAAIWQGLSLAWLCGARDLVCYTDSSMALSLLRQAASQSHAYAAIIGSAQDLL 726 ++ L AEL A++ GL LAW +DL+CY+DS A+ L+ ++ H +AAI+ + +D+L Sbjct: 323 SNILHAELLAVYHGLVLAWDMDIKDLICYSDSKTAIKLIGDPINEWHHFAAILQNIKDIL 382 Query: 727 RRPWNVKMEHSLREGNSCADSLARMGAAQTADLVVLDRPPPEIGGQLLADAMRVRFQR 900 R W V + H+LREGN+CAD LA+ GA + PP + LLADA F R Sbjct: 383 ARDWRVTVAHTLREGNACADYLAKFGAQNIKVFSTMTTPPDGMNLLLLADASGTWFTR 440 Score = 104 bits (260), Expect(2) = 2e-56 Identities = 46/129 (35%), Positives = 68/129 (52%) Frame = +3 Query: 9 IQQKGSSIITRDWRWIWQLPAPEMIKSLLWLEAHDSAPTMKTLSHRGISQSDICKRCNNT 188 I+ +SI W W+W + APE IK LW H++ PT LSHR + +C R + Sbjct: 112 IESTNNSIEDISWSWLWHIEAPEKIKFFLWTALHNALPTRAMLSHRRLLSVHVCPRSDIA 171 Query: 189 AETFVHSVRDCTLPQQMWLSLGFRDPLFYAELPSLGWMHKGMKSHNPALFLAGVWTAWTD 368 ET +H +RDC + +W ++GF D F+ W+ KG S + +FLA +W W Sbjct: 172 EETIMHCLRDCEFVKHLWKTIGFTDQTFFHGDNLYAWLRKGCDSPSMFMFLAALWWIWRA 231 Query: 369 RNAMSIAQE 395 RN + +A E Sbjct: 232 RNKLCLANE 240 >GAU50297.1 hypothetical protein TSUD_288310 [Trifolium subterraneum] Length = 545 Score = 148 bits (373), Expect(2) = 4e-56 Identities = 76/165 (46%), Positives = 100/165 (60%) Frame = +1 Query: 406 WHPSGEAIAVLNVDGSSFGNTGRAGVGGLLRHGDGEWIQGFYGSVGVADSLKAELAAIWQ 585 W+ G +LNVDGSS GN G +G GGL+R+ DG W+ GF G++G + L+AEL AI+ Sbjct: 381 WNAHGGIGMILNVDGSSIGNPGISGFGGLIRNSDGAWVHGFAGNIGHLNILQAELLAIYH 440 Query: 586 GLSLAWLCGARDLVCYTDSSMALSLLRQAASQSHAYAAIIGSAQDLLRRPWNVKMEHSLR 765 GL LAW +DL CY+DS AL L+ ++ H YAAII + +D L R W V++ H LR Sbjct: 441 GLVLAWELDIKDLCCYSDSKTALKLIYDHVNEWHQYAAIIYNIKDFLSRNWRVRLVHMLR 500 Query: 766 EGNSCADSLARMGAAQTADLVVLDRPPPEIGGQLLADAMRVRFQR 900 EGN+CAD L + GA + PP + LLADA F R Sbjct: 501 EGNNCADILDKFGARNPKAYCSIAVPPDGMSLLLLADASGTIFSR 545 Score = 99.8 bits (247), Expect(2) = 4e-56 Identities = 46/141 (32%), Positives = 74/141 (52%), Gaps = 2/141 (1%) Frame = +3 Query: 3 WIIQQKGSSIITRD--WRWIWQLPAPEMIKSLLWLEAHDSAPTMKTLSHRGISQSDICKR 176 W+ + S T D W +W +PAPE IK +W H++ PT LSHRG+ Q+++C R Sbjct: 213 WLNRFSFSDTATDDISWNSVWHIPAPEKIKFFIWSALHNALPTKSMLSHRGLLQANLCPR 272 Query: 177 CNNTAETFVHSVRDCTLPQQMWLSLGFRDPLFYAELPSLGWMHKGMKSHNPALFLAGVWT 356 CN E+ +H +R+C ++ W ++GF F+ W+ + + LF+A VW Sbjct: 273 CNIEEESTLHCLRNCEFIKRFWKAIGFLGQTFFQGDNLNDWLRNSIDGPSSFLFMAAVWW 332 Query: 357 AWTDRNAMSIAQETRNLASLR 419 W RN + + E + +LR Sbjct: 333 IWCARNQLCMDNEAISYFTLR 353 >GAU35042.1 hypothetical protein TSUD_30080 [Trifolium subterraneum] Length = 724 Score = 154 bits (388), Expect(2) = 2e-54 Identities = 76/175 (43%), Positives = 106/175 (60%) Frame = +1 Query: 376 QCLLLKRQETWHPSGEAIAVLNVDGSSFGNTGRAGVGGLLRHGDGEWIQGFYGSVGVADS 555 Q L + W+ G +LNVDGS+ GN G +G GGL+R+ DG WI GF+G++GV + Sbjct: 550 QVTTLPKIVRWNALGGTSMILNVDGSTIGNPGISGFGGLIRNADGAWIHGFFGNLGVTNI 609 Query: 556 LKAELAAIWQGLSLAWLCGARDLVCYTDSSMALSLLRQAASQSHAYAAIIGSAQDLLRRP 735 L AEL AI +GL LAW +DL+CY+DS+ A+ L+ + H YAAI+ + +D+L R Sbjct: 610 LHAELMAILKGLLLAWELNIKDLLCYSDSATAIKLITEPVDVWHHYAAILNNIKDILNRD 669 Query: 736 WNVKMEHSLREGNSCADSLARMGAAQTADLVVLDRPPPEIGGQLLADAMRVRFQR 900 W V + H+ REGN+CAD LA+ GA + PP + LLAD + F R Sbjct: 670 WQVSIFHTFREGNACADYLAKHGAHNNIVFTTIAIPPAGLNLHLLADVSGIIFSR 724 Score = 88.6 bits (218), Expect(2) = 2e-54 Identities = 41/117 (35%), Positives = 56/117 (47%) Frame = +3 Query: 45 WRWIWQLPAPEMIKSLLWLEAHDSAPTMKTLSHRGISQSDICKRCNNTAETFVHSVRDCT 224 W W+W LPAPE IK +W H+S T L+HRGI + DC Sbjct: 427 WSWLWHLPAPEKIKFFIWTLLHNSLATRDMLTHRGI-------------------IHDCN 467 Query: 225 LPQQMWLSLGFRDPLFYAELPSLGWMHKGMKSHNPALFLAGVWTAWTDRNAMSIAQE 395 +W SLGF D F+ E+ S W+ G+ + LF+A +W W RNA+ + E Sbjct: 468 FVYTIWKSLGFTDRNFFQEVDSSSWLRNGLSCSSMFLFMAAIWWIWRTRNALCLDNE 524 >GAU49954.1 hypothetical protein TSUD_180180 [Trifolium subterraneum] Length = 968 Score = 118 bits (296), Expect(2) = 6e-52 Identities = 50/127 (39%), Positives = 73/127 (57%) Frame = +3 Query: 36 TRDWRWIWQLPAPEMIKSLLWLEAHDSAPTMKTLSHRGISQSDICKRCNNTAETFVHSVR 215 ++ W WIW+L PE IK LWL H+S PT+ L+HR ++ S C RC E+F+H +R Sbjct: 647 SQSWTWIWKLHLPEKIKFFLWLACHNSVPTLSLLNHRKMNPSTTCVRCGLQDESFLHCIR 706 Query: 216 DCTLPQQMWLSLGFRDPLFYAELPSLGWMHKGMKSHNPALFLAGVWTAWTDRNAMSIAQE 395 DC + +W +GF +P F++ + W+ G +F AGVW +W RN MS+ E Sbjct: 707 DCDFSRSLWHHIGFTNPNFFSNMDVYDWLKMGATGTQSLIFSAGVWWSWRHRNLMSLNNE 766 Query: 396 TRNLASL 416 T L+ L Sbjct: 767 TWTLSRL 773 Score = 115 bits (288), Expect(2) = 6e-52 Identities = 66/166 (39%), Positives = 97/166 (58%), Gaps = 1/166 (0%) Frame = +1 Query: 406 WHPSGEAIAVLNVDGSSFGNTGRAGVGGLLRHGDGEWIQGFYGSV-GVADSLKAELAAIW 582 W + + +LNVDGS G+ RAG GG++R+ G ++ GF G + G +D L AEL AI+ Sbjct: 803 WKNNNFSCTILNVDGSCLGSPARAGFGGIIRNTFGYYLAGFSGYIQGSSDILYAELYAIY 862 Query: 583 QGLSLAWLCGARDLVCYTDSSMALSLLRQAASQSHAYAAIIGSAQDLLRRPWNVKMEHSL 762 +GL LA G +LVCY+DS ++L++ + H +A +I ++L+ NV + H+L Sbjct: 863 KGLLLAKNMGIDELVCYSDSLHCINLIKGPQVKYHIHAVLIQDIKELISLN-NVSLCHTL 921 Query: 763 REGNSCADSLARMGAAQTADLVVLDRPPPEIGGQLLADAMRVRFQR 900 REGN CAD A++GA AD PP + QL DA+ F R Sbjct: 922 REGNQCADFFAKLGATSDADFSSHGSPPEGVREQLRIDALGTLFLR 967 >GAU22350.1 hypothetical protein TSUD_106780 [Trifolium subterraneum] Length = 1200 Score = 121 bits (304), Expect(2) = 2e-51 Identities = 50/124 (40%), Positives = 72/124 (58%) Frame = +3 Query: 24 SSIITRDWRWIWQLPAPEMIKSLLWLEAHDSAPTMKTLSHRGISQSDICKRCNNTAETFV 203 ++I W W+W + APE +K W H+S PT L+HRGI ++C RC+N AET + Sbjct: 932 TTISVASWSWLWHVSAPEKLKFFFWTMLHNSLPTRDMLAHRGIITRNLCPRCSNHAETTI 991 Query: 204 HSVRDCTLPQQMWLSLGFRDPLFYAELPSLGWMHKGMKSHNPALFLAGVWTAWTDRNAMS 383 H +RDC ++W S+GF D F+ + W+H G+ S LF+AG+W W RNAM Sbjct: 992 HCLRDCDFVNRIWKSIGFLDQNFFQGVDVYAWLHNGLNSPTMMLFIAGIWWIWRARNAMC 1051 Query: 384 IAQE 395 + E Sbjct: 1052 LDSE 1055 Score = 110 bits (275), Expect(2) = 2e-51 Identities = 52/109 (47%), Positives = 72/109 (66%) Frame = +1 Query: 406 WHPSGEAIAVLNVDGSSFGNTGRAGVGGLLRHGDGEWIQGFYGSVGVADSLKAELAAIWQ 585 W+ G +LNVDGSS GN G +G GGL+R+ DG WI GF+G++GV + L EL AI++ Sbjct: 1091 WNALGGTGLILNVDGSSIGNPGISGFGGLIRNADGAWIHGFFGNLGVTNILHPELMAIYK 1150 Query: 586 GLSLAWLCGARDLVCYTDSSMALSLLRQAASQSHAYAAIIGSAQDLLRR 732 GL LAW ++L CY+DS MA+ L+ H YAAI+ + +D+L R Sbjct: 1151 GLLLAWELNIKELWCYSDSKMAIKLITDPTDVWHHYAAILNNIKDILDR 1199 >GAU47648.1 hypothetical protein TSUD_27720 [Trifolium subterraneum] Length = 521 Score = 150 bits (378), Expect(2) = 3e-51 Identities = 72/165 (43%), Positives = 101/165 (61%) Frame = +1 Query: 406 WHPSGEAIAVLNVDGSSFGNTGRAGVGGLLRHGDGEWIQGFYGSVGVADSLKAELAAIWQ 585 W+ G +LNVDGSS GN G +G GGL+ + G W GF G++G ++ L AEL A++ Sbjct: 357 WNALGSPDMILNVDGSSIGNPGVSGFGGLIHNSKGAWAHGFVGNIGFSNILHAELMALYH 416 Query: 586 GLSLAWLCGARDLVCYTDSSMALSLLRQAASQSHAYAAIIGSAQDLLRRPWNVKMEHSLR 765 GL LAW ++L CY+DS A+ L+ + + H YAAI+ + +D+L R W V + H+ R Sbjct: 417 GLLLAWQLNIKELWCYSDSETAIKLITEPVDEWHHYAAILLNIKDILAREWRVNIAHTFR 476 Query: 766 EGNSCADSLARMGAAQTADLVVLDRPPPEIGGQLLADAMRVRFQR 900 EGN+CAD LA++GA L V+ PP + LLADA F R Sbjct: 477 EGNACADYLAKLGACNNEALSVMTNPPASLNLLLLADASGTWFPR 521 Score = 81.6 bits (200), Expect(2) = 3e-51 Identities = 43/125 (34%), Positives = 55/125 (44%) Frame = +3 Query: 45 WRWIWQLPAPEMIKSLLWLEAHDSAPTMKTLSHRGISQSDICKRCNNTAETFVHSVRDCT 224 W W+W +PAPE IK LW H + PT LSHRGI C RCN Sbjct: 231 WSWLWHIPAPEKIKFFLWTALHKALPTKAMLSHRGILHDSSCPRCNK------------- 277 Query: 225 LPQQMWLSLGFRDPLFYAELPSLGWMHKGMKSHNPALFLAGVWTAWTDRNAMSIAQETRN 404 S+ F D FY W+ G+ S + LF A +W W RN + + E+ + Sbjct: 278 -------SIFFEDDEFYV------WLWNGLDSPSKLLFTAAIWWIWCTRNNLCMNNESIS 324 Query: 405 LASLR 419 SLR Sbjct: 325 QVSLR 329 >GAU20604.1 hypothetical protein TSUD_33400 [Trifolium subterraneum] Length = 1174 Score = 117 bits (293), Expect(2) = 8e-51 Identities = 51/133 (38%), Positives = 71/133 (53%), Gaps = 1/133 (0%) Frame = +3 Query: 3 WIIQQKGS-SIITRDWRWIWQLPAPEMIKSLLWLEAHDSAPTMKTLSHRGISQSDICKRC 179 W++ Q +I W WIWQ+ PE +K L WL HD+ PT+ L HR I+ IC RC Sbjct: 841 WLLAQTDQVTIPLNSWSWIWQIAGPEKLKFLFWLSCHDAVPTLSMLHHRNIASCPICTRC 900 Query: 180 NNTAETFVHSVRDCTLPQQMWLSLGFRDPLFYAELPSLGWMHKGMKSHNPALFLAGVWTA 359 ETF+H VRDC + +W+ LGF +F+ W+ + +F+AGVW Sbjct: 901 GQQIETFLHCVRDCIFSRPVWIRLGFTSRIFFDITSVHDWLKSAYFRPHRFVFMAGVWWL 960 Query: 360 WTDRNAMSIAQET 398 W RN M ++ ET Sbjct: 961 WRHRNLMCLSNET 973 Score = 112 bits (281), Expect(2) = 8e-51 Identities = 64/170 (37%), Positives = 98/170 (57%), Gaps = 1/170 (0%) Frame = +1 Query: 394 RQETWHPSGEAIAVLNVDGSSFGNTGRAGVGGLLRHGDGEWIQGFYGSV-GVADSLKAEL 570 R W+ + +LN+DGS G R G G ++R+ G +I G G + G +D L AEL Sbjct: 1005 RTVKWNSTDFTGFILNMDGSCSGTPIRCGFGCIIRNNVGSYIAGASGHIIGSSDILLAEL 1064 Query: 571 AAIWQGLSLAWLCGARDLVCYTDSSMALSLLRQAASQSHAYAAIIGSAQDLLRRPWNVKM 750 + I+ GL LA G DL+CYTDS ++ +L++ S H Y +I + +D +++ N+ + Sbjct: 1065 SGIFHGLKLASSLGITDLICYTDSLLSCNLIQGPYSHYHIYGVLIQNIKDYMQQS-NINI 1123 Query: 751 EHSLREGNSCADSLARMGAAQTADLVVLDRPPPEIGGQLLADAMRVRFQR 900 H+LREGN CAD LA++GA+ T LV+ D P ++ + DA F R Sbjct: 1124 CHTLREGNQCADYLAKLGASSTEALVIHDTPTADLCNLMDLDARGTIFTR 1173 >GAU18899.1 hypothetical protein TSUD_228890 [Trifolium subterraneum] Length = 1098 Score = 115 bits (288), Expect(2) = 2e-50 Identities = 51/118 (43%), Positives = 67/118 (56%) Frame = +3 Query: 45 WRWIWQLPAPEMIKSLLWLEAHDSAPTMKTLSHRGISQSDICKRCNNTAETFVHSVRDCT 224 W WIW++ PE +K L WL H++ PT+ L HR I+ S IC RC+N ETF+H VRDC Sbjct: 781 WSWIWKITGPEKLKILFWLACHEAVPTLAMLHHRNIASSPICPRCSNHNETFLHCVRDCI 840 Query: 225 LPQQMWLSLGFRDPLFYAELPSLGWMHKGMKSHNPALFLAGVWTAWTDRNAMSIAQET 398 + +W LGF + F+ + W+ S LFLAGVW W RN M + ET Sbjct: 841 HSKTVWDQLGFTNSSFFDSTMAHEWLKHSYFSPRRLLFLAGVWWIWRHRNNMCLGDET 898 Score = 113 bits (283), Expect(2) = 2e-50 Identities = 62/166 (37%), Positives = 96/166 (57%), Gaps = 1/166 (0%) Frame = +1 Query: 406 WHPSGEAIAVLNVDGSSFGNTGRAGVGGLLRHGDGEWIQGFYGSV-GVADSLKAELAAIW 582 W+ + V+N DGS G R G G ++R+ DG +I G G + +D L AEL+ I+ Sbjct: 933 WNFTNFTGVVINTDGSCSGTPARTGFGCIIRNNDGRYITGASGHITNSSDILLAELSGIY 992 Query: 583 QGLSLAWLCGARDLVCYTDSSMALSLLRQAASQSHAYAAIIGSAQDLLRRPWNVKMEHSL 762 GL LA G D +CYTDS ++ +L++ +S H Y +I + +D +++ N+ + H+L Sbjct: 993 HGLQLAISLGITDFICYTDSLISCNLIQGVSSPYHIYGVLIQNIKDSMQQS-NIIICHTL 1051 Query: 763 REGNSCADSLARMGAAQTADLVVLDRPPPEIGGQLLADAMRVRFQR 900 REGN CAD LA++GA+ L + D PPP++ + DA F R Sbjct: 1052 REGNQCADYLAKLGASSHDALWIHDTPPPDLRSLMDIDARGTLFTR 1097 >GAU11845.1 hypothetical protein TSUD_75960 [Trifolium subterraneum] Length = 386 Score = 118 bits (296), Expect(2) = 1e-49 Identities = 50/117 (42%), Positives = 68/117 (58%) Frame = +3 Query: 45 WRWIWQLPAPEMIKSLLWLEAHDSAPTMKTLSHRGISQSDICKRCNNTAETFVHSVRDCT 224 W W+W LPAPE IK +W H+S PT L+HRGI ++C RCN ET +H +RDC Sbjct: 124 WSWLWHLPAPEKIKFFIWTLLHNSLPTRDMLTHRGIIHGNMCPRCNIHVETDLHCLRDCD 183 Query: 225 LPQQMWLSLGFRDPLFYAELPSLGWMHKGMKSHNPALFLAGVWTAWTDRNAMSIAQE 395 +W SLGF D F+ E+ S W+ G+ + LF+A +W W RNA+ + E Sbjct: 184 FVYTIWKSLGFTDHNFFQEVDSSSWLRNGLSCSSMFLFMAAIWWIWRTRNALCLDNE 240 Score = 107 bits (268), Expect(2) = 1e-49 Identities = 53/121 (43%), Positives = 74/121 (61%) Frame = +1 Query: 376 QCLLLKRQETWHPSGEAIAVLNVDGSSFGNTGRAGVGGLLRHGDGEWIQGFYGSVGVADS 555 Q L + W+ G +LNVD SS GN G +G GGL+ + G WI GF+G++GV + Sbjct: 266 QVTTLPKIVRWNALGGTSMILNVDRSSIGNPGISGFGGLICNAYGAWIHGFFGNLGVTNI 325 Query: 556 LKAELAAIWQGLSLAWLCGARDLVCYTDSSMALSLLRQAASQSHAYAAIIGSAQDLLRRP 735 L AEL AI +GL LAW +DL CY+DS+ A+ L+ + H YAAI+ + +D+L R Sbjct: 326 LHAELMAILKGLLLAWELNIKDLSCYSDSATAIKLITEPVDVWHHYAAILNNIKDILNRD 385 Query: 736 W 738 W Sbjct: 386 W 386 >GAU10400.1 hypothetical protein TSUD_423410, partial [Trifolium subterraneum] Length = 284 Score = 171 bits (434), Expect = 8e-48 Identities = 88/189 (46%), Positives = 118/189 (62%) Frame = +1 Query: 337 SLLVYGLPGLTETQCLLLKRQETWHPSGEAIAVLNVDGSSFGNTGRAGVGGLLRHGDGEW 516 +L+V+ P T +R +WHP VLNVDGS G+ GRAG GGL R GDGEW Sbjct: 98 ALIVFCFPARVHTDTP--RRWISWHPCKTDCVVLNVDGSCLGDPGRAGFGGLFRKGDGEW 155 Query: 517 IQGFYGSVGVADSLKAELAAIWQGLSLAWLCGARDLVCYTDSSMALSLLRQAASQSHAYA 696 I+GF G +GV + + AEL A++ GL +A G L CY+DS L LL + + H YA Sbjct: 156 IRGFSGYLGVTNIMLAELMAVYHGLKIAREAGYNRLFCYSDSKTVLDLLSKERNSFHCYA 215 Query: 697 AIIGSAQDLLRRPWNVKMEHSLREGNSCADSLARMGAAQTADLVVLDRPPPEIGGQLLAD 876 AII + QDLL W+V ++HSLREGN CAD LA++G+A + + PPP++ LL+D Sbjct: 216 AIIANIQDLLVLEWDVSLKHSLREGNFCADFLAKLGSANDEKFFIWESPPPDMQDLLLSD 275 Query: 877 AMRVRFQRA 903 A+RV + RA Sbjct: 276 ALRVPYPRA 284 Score = 66.6 bits (161), Expect = 7e-09 Identities = 29/79 (36%), Positives = 45/79 (56%) Frame = +3 Query: 150 ISQSDICKRCNNTAETFVHSVRDCTLPQQMWLSLGFRDPLFYAELPSLGWMHKGMKSHNP 329 +S SDIC RC++ ET +H +RDC + +++W SLGF++ F++ W+ N Sbjct: 1 LSSSDICTRCSSGEETILHCLRDCPISRRIWNSLGFQNSSFFSCSDLELWLRNNSIGLNA 60 Query: 330 ALFLAGVWTAWTDRNAMSI 386 FLAG+W W RN + Sbjct: 61 PTFLAGLWWNWRARNIFCV 79 >GAU48622.1 hypothetical protein TSUD_133530 [Trifolium subterraneum] Length = 350 Score = 125 bits (314), Expect(2) = 2e-47 Identities = 69/185 (37%), Positives = 99/185 (53%) Frame = +1 Query: 346 VYGLPGLTETQCLLLKRQETWHPSGEAIAVLNVDGSSFGNTGRAGVGGLLRHGDGEWIQG 525 VY +T L+K W+ G + +LNVDGSS GN G + GGL+R+ +G W G Sbjct: 187 VYSSDLITTPNTKLVK----WNALGSSGMILNVDGSSIGNPGVSRYGGLIRNSEGAWAHG 242 Query: 526 FYGSVGVADSLKAELAAIWQGLSLAWLCGARDLVCYTDSSMALSLLRQAASQSHAYAAII 705 F G++G ++ L EL A+++GL LAW ++L CY+DS A+ L++ Sbjct: 243 FAGNIGFSNILHPELMALYRGLLLAWQLNIKELWCYSDSEAAIKLIK------------- 289 Query: 706 GSAQDLLRRPWNVKMEHSLREGNSCADSLARMGAAQTADLVVLDRPPPEIGGQLLADAMR 885 D+L R W V + H+ REGN+C D LA++GA V+ PP + LLADA Sbjct: 290 ----DILAREWRVNIAHTFREGNACVDYLAKLGARNNEAFYVMASPPAGLNLLLLADASG 345 Query: 886 VRFQR 900 F R Sbjct: 346 TWFPR 350 Score = 93.2 bits (230), Expect(2) = 2e-47 Identities = 37/84 (44%), Positives = 48/84 (57%) Frame = +3 Query: 45 WRWIWQLPAPEMIKSLLWLEAHDSAPTMKTLSHRGISQSDICKRCNNTAETFVHSVRDCT 224 W W+W +PAPE IK LW H + PT LSHRGI C RCNN ET +H +RDC Sbjct: 97 WSWLWHIPAPEKIKFFLWTALHKALPTKAMLSHRGILHDSSCPRCNNNVETTIHCLRDCD 156 Query: 225 LPQQMWLSLGFRDPLFYAELPSLG 296 + +W S+GF +F+ + G Sbjct: 157 FAKNIWKSIGFTKSIFFEDAEWFG 180 >ABD28627.2 RNA-directed DNA polymerase (Reverse transcriptase); Ribonuclease H [Medicago truncatula] Length = 1296 Score = 108 bits (269), Expect(2) = 1e-46 Identities = 47/118 (39%), Positives = 64/118 (54%) Frame = +3 Query: 45 WRWIWQLPAPEMIKSLLWLEAHDSAPTMKTLSHRGISQSDICKRCNNTAETFVHSVRDCT 224 W WIWQL APE K L+WL H++APT+ L HR ++ + C RC E+F+H VRDC Sbjct: 977 WSWIWQLTAPEKYKLLIWLACHNAAPTLSLLHHRKMAPAATCSRCGENDESFLHCVRDCK 1036 Query: 225 LPQQMWLSLGFRDPLFYAELPSLGWMHKGMKSHNPALFLAGVWTAWTDRNAMSIAQET 398 +W LGF F++ W+ FLAG+W +W RN M ++ ET Sbjct: 1037 HSAAIWHKLGFVTAAFFSVSSVQDWIRNFSSGSRAITFLAGLWWSWRHRNLMCLSNET 1094 Score = 108 bits (269), Expect(2) = 1e-46 Identities = 62/170 (36%), Positives = 92/170 (54%), Gaps = 1/170 (0%) Frame = +1 Query: 394 RQETWHPSGEAIAVLNVDGSSFGNTGRAGVGGLLRHGDGEWIQGFYGSVGVA-DSLKAEL 570 R W+ +LNVDGS G RAG GG+ R+ G ++ G+ G + + D L AEL Sbjct: 1127 RMVKWNQGNHQCHILNVDGSCLGTPIRAGFGGIFRNNVGGYLSGYSGFISESTDILLAEL 1186 Query: 571 AAIWQGLSLAWLCGARDLVCYTDSSMALSLLRQAASQSHAYAAIIGSAQDLLRRPWNVKM 750 A+ QGL +A G +L CY+DS + ++L+ + S+ H YA +I +DLL N + Sbjct: 1187 TALHQGLIMAAEMGIEELACYSDSLLTINLITRTTSKYHTYAVLIQDIKDLL-SAHNFSV 1245 Query: 751 EHSLREGNSCADSLARMGAAQTADLVVLDRPPPEIGGQLLADAMRVRFQR 900 H REGN CAD LA++GA+ + +V E+ + DA+ F R Sbjct: 1246 YHCFREGNQCADYLAKLGASSNEECLVHASACQELLVLIQMDAIGTLFPR 1295 >GAU41508.1 hypothetical protein TSUD_302460 [Trifolium subterraneum] Length = 1075 Score = 118 bits (296), Expect(2) = 3e-46 Identities = 66/166 (39%), Positives = 101/166 (60%), Gaps = 1/166 (0%) Frame = +1 Query: 406 WHPSGEAIAVLNVDGSSFGNTGRAGVGGLLRHGDGEWIQGFYGSVGVA-DSLKAELAAIW 582 W+ + +LNVDGS G+ RAG GG++R+ G ++ GF G + + D L AEL AI+ Sbjct: 910 WNNDNFSCVILNVDGSCLGSPVRAGYGGIIRNDSGFYLSGFSGFIRESSDILLAELYAIY 969 Query: 583 QGLSLAWLCGARDLVCYTDSSMALSLLRQAASQSHAYAAIIGSAQDLLRRPWNVKMEHSL 762 QGL+LA +LVCY+DS + ++L++ + H YA +I ++L+ + NV + H+ Sbjct: 970 QGLTLAKDLVIDELVCYSDSLLCINLIKGPIVKYHVYAVLIQDIKELISQS-NVTLCHTF 1028 Query: 763 REGNSCADSLARMGAAQTADLVVLDRPPPEIGGQLLADAMRVRFQR 900 REGN CAD LA++GA+ ADL++ PP I L +D+ F R Sbjct: 1029 REGNQCADFLAKLGASSDADLIIHASPPDGIFDLLKSDSFGTFFLR 1074 Score = 96.3 bits (238), Expect(2) = 3e-46 Identities = 46/118 (38%), Positives = 61/118 (51%) Frame = +3 Query: 63 LPAPEMIKSLLWLEAHDSAPTMKTLSHRGISQSDICKRCNNTAETFVHSVRDCTLPQQMW 242 L P I WL H+S PT+ L+HR +S S IC RC ETF+H VRDC + +W Sbjct: 763 LKLPSKIIFFFWLVCHNSVPTLSLLNHRKMSNSFICARCGLQEETFLHCVRDCDFSRNIW 822 Query: 243 LSLGFRDPLFYAELPSLGWMHKGMKSHNPALFLAGVWTAWTDRNAMSIAQETRNLASL 416 +GF DP F++ + W+ G F A VW AW RN M + E+ +L L Sbjct: 823 HHIGFNDPTFFSFTDAREWLKVGSTGSQAYTFSASVWWAWRHRNLMCLNNESWSLNRL 880 >ABO80459.1 RNA-directed DNA polymerase (Reverse transcriptase); Ribonuclease H [Medicago truncatula] Length = 869 Score = 109 bits (273), Expect(2) = 9e-46 Identities = 64/171 (37%), Positives = 98/171 (57%), Gaps = 1/171 (0%) Frame = +1 Query: 394 RQETWHPSGEAIAVLNVDGSSFGNTGRAGVGGLLRHGDGEWIQGFYGSV-GVADSLKAEL 570 R W+ S +LNVDGS G+ RAG GGL+R+ G ++ GF G + +D L AEL Sbjct: 700 RYVKWNNSNFNCTILNVDGSCIGSPIRAGFGGLIRNSVGFYLSGFLGFLPSSSDILLAEL 759 Query: 571 AAIWQGLSLAWLCGARDLVCYTDSSMALSLLRQAASQSHAYAAIIGSAQDLLRRPWNVKM 750 AI+ G++ A G D+ Y+DS ++++L+ +S+ H +AA+I +D L N + Sbjct: 760 TAIYDGINTAIDMGITDMAVYSDSLLSINLITTTSSKFHIHAALIQDIRDKLSLR-NFSL 818 Query: 751 EHSLREGNSCADSLARMGAAQTADLVVLDRPPPEIGGQLLADAMRVRFQRA 903 H+LREGN AD LA++GA ++++ PP E+ L DA F R+ Sbjct: 819 NHTLREGNQSADYLAKLGAMSDVNVLIHQSPPDELCPLLKNDAAGTLFLRS 869 Score = 103 bits (257), Expect(2) = 9e-46 Identities = 49/138 (35%), Positives = 68/138 (49%) Frame = +3 Query: 3 WIIQQKGSSIITRDWRWIWQLPAPEMIKSLLWLEAHDSAPTMKTLSHRGISQSDICKRCN 182 W++ G+ T W WI + E K L+WL HDS PT L HR I S C RC Sbjct: 537 WLLSLSGNDNNTHSWSWILKKKISEKYKFLIWLACHDSLPTAALLHHRQIIASATCARCG 596 Query: 183 NTAETFVHSVRDCTLPQQMWLSLGFRDPLFYAELPSLGWMHKGMKSHNPALFLAGVWTAW 362 + E+ H +RDC + +W +GF +P F+A W G+ LF AG+W W Sbjct: 597 VSDESVFHCIRDCPFSKIIWHHIGFSEPYFFAVTDIEIWCKSGLIGSKAILFAAGLWWIW 656 Query: 363 TDRNAMSIAQETRNLASL 416 RNA +++E+ L L Sbjct: 657 RSRNARCMSEESMLLQRL 674 >GAU12283.1 hypothetical protein TSUD_141910 [Trifolium subterraneum] Length = 1049 Score = 153 bits (386), Expect(2) = 2e-45 Identities = 77/165 (46%), Positives = 104/165 (63%) Frame = +1 Query: 406 WHPSGEAIAVLNVDGSSFGNTGRAGVGGLLRHGDGEWIQGFYGSVGVADSLKAELAAIWQ 585 W+ G +LNVDGSS GN G +G GGL+ + DG W+ GF+G++GV + L AEL AI++ Sbjct: 885 WNALGGTGMILNVDGSSLGNPGISGFGGLIHNADGAWVLGFFGNLGVNNILHAELRAIYK 944 Query: 586 GLSLAWLCGARDLVCYTDSSMALSLLRQAASQSHAYAAIIGSAQDLLRRPWNVKMEHSLR 765 GL LAW +DL CY+DS MA+ L+ ++ Q H YAAI+ + QD+LRR W V + H+ R Sbjct: 945 GLLLAWDLNIKDLWCYSDSEMAIKLISESVDQWHHYAAILNNIQDILRRDWQVLILHTFR 1004 Query: 766 EGNSCADSLARMGAAQTADLVVLDRPPPEIGGQLLADAMRVRFQR 900 EGN+ AD LA+ GA + PP + LLADA F R Sbjct: 1005 EGNAYADYLAKHGANNNKVFSSIATPPAGLNLSLLADASGTWFSR 1049 Score = 58.9 bits (141), Expect(2) = 2e-45 Identities = 27/81 (33%), Positives = 42/81 (51%) Frame = +3 Query: 177 CNNTAETFVHSVRDCTLPQQMWLSLGFRDPLFYAELPSLGWMHKGMKSHNPALFLAGVWT 356 CN ET +H +RDC Q +W S+GF + F+ W+ G+ + LF+A +W Sbjct: 777 CNTHLETTLHCLRDCDFAQSIWKSIGFSNLNFFQGDDPYVWIRNGLHCSSMFLFMATIWW 836 Query: 357 AWTDRNAMSIAQETRNLASLR 419 W RNA+ + E+ SL+ Sbjct: 837 IWRARNALCLNSESILFYSLK 857 >GAU23316.1 hypothetical protein TSUD_237700 [Trifolium subterraneum] Length = 418 Score = 109 bits (273), Expect(2) = 6e-44 Identities = 47/118 (39%), Positives = 65/118 (55%) Frame = +3 Query: 45 WRWIWQLPAPEMIKSLLWLEAHDSAPTMKTLSHRGISQSDICKRCNNTAETFVHSVRDCT 224 W WIW+L PE IK WL H+S PT+ L HR ++ S C RC ETF+H VRDC Sbjct: 127 WSWIWKLQLPEKIKFFFWLVCHNSVPTLSLLDHRKMNLSATCARCGLREETFLHCVRDCD 186 Query: 225 LPQQMWLSLGFRDPLFYAELPSLGWMHKGMKSHNPALFLAGVWTAWTDRNAMSIAQET 398 +W +GF +P F++ + + W+ G +F AGVW +W + N M + ET Sbjct: 187 FSISIWHHIGFDNPDFFSSMDAHDWLKWGSTGSKAFIFSAGVWWSWRNHNLMCLNNET 244 Score = 97.4 bits (241), Expect(2) = 6e-44 Identities = 60/166 (36%), Positives = 90/166 (54%), Gaps = 1/166 (0%) Frame = +1 Query: 406 WHPSGEAIAVLNVDGSSFGNTGRAGVGGLLRHGDGEWIQGFYGSV-GVADSLKAELAAIW 582 W+ + + +LNVD S G+ R+G GG+ R+ G ++ GF G + G +D + AE AI+ Sbjct: 253 WNNNNFSGVILNVDESCLGSPIRSGFGGIFRNDSGFYLSGFSGFIQGSSDIMLAEPYAIY 312 Query: 583 QGLSLAWLCGARDLVCYTDSSMALSLLRQAASQSHAYAAIIGSAQDLLRRPWNVKMEHSL 762 GLSLA + VCY+DS ++L+ + H +A +I ++ L NV + H+L Sbjct: 313 HGLSLAEDMEINEFVCYSDSLHRINLITGLTLKYHVHAVLIQDIKEFLSNR-NVSLCHTL 371 Query: 763 REGNSCADSLARMGAAQTADLVVLDRPPPEIGGQLLADAMRVRFQR 900 EGN CAD A+ GA+ DL + PP I L +DA F R Sbjct: 372 GEGNQCADFFAKHGASSDVDLFIHASPPECILDLLRSDAAGTFFLR 417 >GAU50246.1 hypothetical protein TSUD_188980 [Trifolium subterraneum] Length = 458 Score = 112 bits (280), Expect(2) = 8e-44 Identities = 63/166 (37%), Positives = 100/166 (60%), Gaps = 1/166 (0%) Frame = +1 Query: 406 WHPSGEAIAVLNVDGSSFGNTGRAGVGGLLRHGDGEWIQGFYGSV-GVADSLKAELAAIW 582 W+ + +LNVDGS G+ RAG GG++R+ G ++ GF G + G +D L AEL I+ Sbjct: 293 WNNNNFPGVILNVDGSCLGSPVRAGFGGVIRNESGFYLSGFSGFIQGSSDILLAELFVIY 352 Query: 583 QGLSLAWLCGARDLVCYTDSSMALSLLRQAASQSHAYAAIIGSAQDLLRRPWNVKMEHSL 762 + L+LA +LVCY+DS ++L++ + + H YA +I ++L+ + N+ + H+L Sbjct: 353 KSLTLAKNMAIDELVCYSDSLHCINLIKGPSIKYHVYAVLIQDIKELMSQS-NITLCHTL 411 Query: 763 REGNSCADSLARMGAAQTADLVVLDRPPPEIGGQLLADAMRVRFQR 900 REGN+CAD LA++GA+ +DL + PP L +DA F R Sbjct: 412 REGNNCADFLAKLGASSDSDLTIHASPPEGFLDILRSDATGTFFLR 457 Score = 94.4 bits (233), Expect(2) = 8e-44 Identities = 46/119 (38%), Positives = 62/119 (52%) Frame = +3 Query: 60 QLPAPEMIKSLLWLEAHDSAPTMKTLSHRGISQSDICKRCNNTAETFVHSVRDCTLPQQM 239 +L PE IK L WL H+S PT+ L HR ++ S C RC ETF+H VRDC L + + Sbjct: 145 ELHLPEKIKFLFWLACHNSVPTLSLLHHRRMNPSSNCPRCCTHEETFLHCVRDCDLSRPI 204 Query: 240 WLSLGFRDPLFYAELPSLGWMHKGMKSHNPALFLAGVWTAWTDRNAMSIAQETRNLASL 416 W LGF P F++ + W+ G F VW AW +N M + ET ++ L Sbjct: 205 WHRLGFITPDFFSSSDAHEWLKFGSTGSQAFAFSTSVWWAWRHQNLMCLQNETWSINRL 263 >GAU35137.1 hypothetical protein TSUD_394630 [Trifolium subterraneum] Length = 547 Score = 103 bits (258), Expect(2) = 1e-42 Identities = 49/115 (42%), Positives = 67/115 (58%) Frame = +3 Query: 75 EMIKSLLWLEAHDSAPTMKTLSHRGISQSDICKRCNNTAETFVHSVRDCTLPQQMWLSLG 254 E +K LW H+S PT + LSHRGI Q ++C RCN AET +H +RDC Q +W S+G Sbjct: 297 EKLKFFLWTMIHNSLPTSEMLSHRGILQGNLCPRCNTLAETTLHCLRDCDFVQIIWKSIG 356 Query: 255 FRDPLFYAELPSLGWMHKGMKSHNPALFLAGVWTAWTDRNAMSIAQETRNLASLR 419 F D F+ E S W+ G+ + +FLA VW W RNA+ + E L +L+ Sbjct: 357 FSDLNFFQEDDSYDWLRNGLSFPSMFIFLAAVWWIWRARNALCLNSELIPLFALK 411 Score = 99.0 bits (245), Expect(2) = 1e-42 Identities = 47/107 (43%), Positives = 71/107 (66%) Frame = +1 Query: 406 WHPSGEAIAVLNVDGSSFGNTGRAGVGGLLRHGDGEWIQGFYGSVGVADSLKAELAAIWQ 585 W+ G + +LNV+GSS GN G G GGL+R+ DG + GF+G++GV + L AEL AI++ Sbjct: 439 WNAGGTDM-ILNVEGSSIGNPGIYGFGGLIRNADGACVHGFFGNLGVTNILHAELMAIYK 497 Query: 586 GLSLAWLCGARDLVCYTDSSMALSLLRQAASQSHAYAAIIGSAQDLL 726 GL LAW +DL CY+DS M + L+ + + H YA I+ + +++L Sbjct: 498 GLLLAWELNIKDLWCYSDSEMVIKLITEPVDEWHHYATILINIKEIL 544 >GAU34195.1 hypothetical protein TSUD_162960 [Trifolium subterraneum] Length = 168 Score = 153 bits (386), Expect = 4e-42 Identities = 78/165 (47%), Positives = 103/165 (62%) Frame = +1 Query: 406 WHPSGEAIAVLNVDGSSFGNTGRAGVGGLLRHGDGEWIQGFYGSVGVADSLKAELAAIWQ 585 W+ G +LNVDGSS GN G +G GGL+R+ DG W+ GF G++G ++ L AEL AI+ Sbjct: 4 WNAHGGIGMILNVDGSSIGNPGISGFGGLIRNSDGAWVHGFAGNIGHSNILHAELLAIYH 63 Query: 586 GLSLAWLCGARDLVCYTDSSMALSLLRQAASQSHAYAAIIGSAQDLLRRPWNVKMEHSLR 765 GL LAW +DL CY+DS AL L+ ++ H YAAII + +D L R W V++ H+LR Sbjct: 64 GLVLAWELDIKDLCCYSDSKTALKLIYDHVNEWHHYAAIIYNIKDFLSRNWRVRLVHTLR 123 Query: 766 EGNSCADSLARMGAAQTADLVVLDRPPPEIGGQLLADAMRVRFQR 900 EGN+CAD LA+ GA + PP E+ LLADA F R Sbjct: 124 EGNNCADFLAKFGARNPEAYSSIAVPPDEMNLLLLADASGTIFSR 168 >GAU33259.1 hypothetical protein TSUD_333820 [Trifolium subterraneum] Length = 284 Score = 155 bits (393), Expect = 9e-42 Identities = 78/157 (49%), Positives = 103/157 (65%) Frame = +1 Query: 433 VLNVDGSSFGNTGRAGVGGLLRHGDGEWIQGFYGSVGVADSLKAELAAIWQGLSLAWLCG 612 VLNVDGS G+ GRAG GGL R GDGEWI+G G +GV + AEL A++ GL +A G Sbjct: 128 VLNVDGSCLGDPGRAGFGGLFRKGDGEWIRGSSGYLGVTNITLAELMAVYHGLKIAREAG 187 Query: 613 ARDLVCYTDSSMALSLLRQAASQSHAYAAIIGSAQDLLRRPWNVKMEHSLREGNSCADSL 792 L CY+DS L LL + + H YAAII + QDLL W+V ++HS+REGN CAD L Sbjct: 188 YNRLFCYSDSKTVLDLLSKERNSFHCYAAIIANIQDLLVLEWDVSLKHSVREGNFCADFL 247 Query: 793 ARMGAAQTADLVVLDRPPPEIGGQLLADAMRVRFQRA 903 A++G+A + + PPP++ LL+DA+RV + RA Sbjct: 248 AKLGSANDEKFSIWESPPPDMQDLLLSDALRVPYPRA 284 Score = 72.4 bits (176), Expect = 7e-11 Identities = 32/80 (40%), Positives = 47/80 (58%) Frame = +3 Query: 135 LSHRGISQSDICKRCNNTAETFVHSVRDCTLPQQMWLSLGFRDPLFYAELPSLGWMHKGM 314 L HR +S SDIC RC++ ET +H +RDC + +++W SLGF++ F++ W+ Sbjct: 2 LHHRNLSSSDICTRCSSGEETILHCLRDCPISRRIWNSLGFQNSSFFSCSDLELWLRNNS 61 Query: 315 KSHNPALFLAGVWTAWTDRN 374 N FLAG+W W RN Sbjct: 62 IGLNAPTFLAGLWWNWRARN 81