BLASTX nr result
ID: Astragalus24_contig00002805
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus24_contig00002805 (1093 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PNX81608.1| ribonuclease H [Trifolium pratense] 232 4e-67 gb|PNX94788.1| ribonuclease H [Trifolium pratense] 222 3e-64 gb|PNY12420.1| ribonuclease H [Trifolium pratense] 215 2e-58 dbj|GAU19541.1| hypothetical protein TSUD_303540 [Trifolium subt... 205 1e-56 dbj|GAU13938.1| hypothetical protein TSUD_262650 [Trifolium subt... 207 2e-56 dbj|GAU51253.1| hypothetical protein TSUD_412460, partial [Trifo... 203 3e-56 gb|PNY15498.1| ribonuclease H [Trifolium pratense] 192 3e-54 dbj|GAU10071.1| hypothetical protein TSUD_422240 [Trifolium subt... 189 9e-54 gb|PNY03121.1| ribonuclease H, partial [Trifolium pratense] 200 1e-53 dbj|GAU50328.1| hypothetical protein TSUD_290640 [Trifolium subt... 184 3e-50 dbj|GAU25543.1| hypothetical protein TSUD_259770 [Trifolium subt... 184 5e-49 dbj|GAU12817.1| hypothetical protein TSUD_73040 [Trifolium subte... 171 3e-46 gb|PNY12120.1| 3-ketoacyl-CoA synthase, partial [Trifolium prate... 167 6e-43 gb|PNY05394.1| ribonuclease H [Trifolium pratense] 167 2e-42 gb|PNX97917.1| ribonuclease H [Trifolium pratense] 165 8e-42 gb|PNY01766.1| ribonuclease H [Trifolium pratense] 154 2e-40 dbj|GAU43930.1| hypothetical protein TSUD_28720 [Trifolium subte... 154 2e-39 dbj|GAU46467.1| hypothetical protein TSUD_402340 [Trifolium subt... 148 3e-38 gb|PNX97265.1| ribonuclease H, partial [Trifolium pratense] 155 8e-38 gb|PNX86941.1| hypothetical protein L195_g043024, partial [Trifo... 147 1e-37 >gb|PNX81608.1| ribonuclease H [Trifolium pratense] Length = 630 Score = 232 bits (592), Expect = 4e-67 Identities = 122/334 (36%), Positives = 187/334 (55%), Gaps = 24/334 (7%) Frame = +2 Query: 20 NWAKIWKIAAKERVRQFVWIISHDRLLTNSRKARMHLGEATFHHCRSVDGTTLHVMRDCP 199 +W +IW++ ER+R F+WII HDRLLTN RK++MH+ E +HC V TLHV+RDCP Sbjct: 298 DWLQIWRLRVPERIRTFIWIIRHDRLLTNYRKSKMHISEPWCNHCVDVVEDTLHVLRDCP 357 Query: 200 QALNIWLHLVSSKIRDIFLDCNLVEWIELNMKVNLGTDSTLNWSAVWANTCHLLWMWRNK 379 A ++W +L++ K +D F L EWI LNMK LG D+ L+W++VWA +CH LW+WRN+ Sbjct: 358 LAKSVWCNLLNGKDKDWFFTAALDEWIILNMKKQLGRDNNLSWASVWATSCHFLWLWRNR 417 Query: 380 FTHDKDYIMPWEPWTIPIKETRDYKLLNQMQIIRQHTEKVTKLIGLIPHCP------L*V 541 TH + P +PW + +K Y + I+ + +KV I + CP L Sbjct: 418 ETHGDSRLRPLQPWKLILKWVMQYFEADVSGIVIANRQKVE--ITVTWQCPEDGWLSLNT 475 Query: 542 DG*N------------------WMCGFSKCLGYCSAHVAELWGVYQGLMLAKSKDYNQLI 667 DG + W+ GFS+ LG C+A++AELWGV+ GL LA+ K +L Sbjct: 476 DGASRGHTSAGCGGLLRNSEGQWLGGFSRNLGRCNAYIAELWGVHDGLCLARDKGAKKLK 535 Query: 668 VEIDSKQIVSNIHNKEKGRSNCWSLMNKIQQELNNSSCQVQFVHCFKEANKVAHALAYIG 847 V +DS +V +++ G W L+ +I++ L +++ H ++EAN A ALA +G Sbjct: 536 VYVDSSVVVHTLNSTTGGSVVGWRLIQEIRR-LLALDWEIKVCHSYREANACADALANMG 594 Query: 848 NTQVQDVAFYDILPDVIASIVYSDCRGSVYPRAV 949 + Y+ P ++ ++ +D G PR + Sbjct: 595 CDHGPGIRVYEQCPSRLSLLLLADTMGITTPRVI 628 >gb|PNX94788.1| ribonuclease H [Trifolium pratense] Length = 509 Score = 222 bits (565), Expect = 3e-64 Identities = 113/336 (33%), Positives = 189/336 (56%), Gaps = 24/336 (7%) Frame = +2 Query: 17 ENWAKIWKIAAKERVRQFVWIISHDRLLTNSRKARMHLGEATFHHCRSVDGTTLHVMRDC 196 + W +IWKI + ER++ F+W ++HDRL+T +R AR +G + C + +T+HV+RDC Sbjct: 174 KKWTQIWKIDSTERIKVFIWQLAHDRLMTKARLARWQIGNSYCDSCTQFEESTIHVVRDC 233 Query: 197 PQALNIWLHLVSSKIRDIFLDCNLVEWIELNMKVNLGTDSTLNWSAVWANTCHLLWMWRN 376 P+A+++W HL+S++ R F EWI LN+ G + +W A+WA TC+LLW+WRN Sbjct: 234 PRAVHLWRHLISNQERGYFFVIEFDEWIHLNLNNKFGQNYGNDWKAIWATTCYLLWLWRN 293 Query: 377 KFTHDKDYIMPWEPWTIPIKETRDYK--LLNQMQIIRQHTEKVTKLIGLIP---HCPL*V 541 K HD ++++P PW + + YK +L + Q+ ++ + L P L Sbjct: 294 KSIHDDEFVIPERPWQVVMDYVAAYKHSMLTEEQVGHGRVQQQVDITWLAPPPGWFALNS 353 Query: 542 DG-------------------*NWMCGFSKCLGYCSAHVAELWGVYQGLMLAKSKDYNQL 664 DG NW+ GF+K LG +A++AELWG+Y+GL LA+ +D +L Sbjct: 354 DGAAKLSESKAGCGGVLRNENGNWIEGFTKALGDTTAYMAELWGIYEGLRLAQRRDVMKL 413 Query: 665 IVEIDSKQIVSNIHNKEKGRSNCWSLMNKIQQELNNSSCQVQFVHCFKEANKVAHALAYI 844 + DS+ I ++ ++++G + +L+ KI + L + +V+ +H F+EAN+ A LA + Sbjct: 414 ELRTDSQVIAQSLQDRKRGSNMGCALLKKI-RSLLDGPWEVKIIHVFREANRCADMLANM 472 Query: 845 GNTQVQDVAFYDILPDVIASIVYSDCRGSVYPRAVA 952 G+ F+ P + IV D RG +PR ++ Sbjct: 473 GSEGPIGFEFFANPPPRVMQIVDDDIRGVSFPRLIS 508 >gb|PNY12420.1| ribonuclease H [Trifolium pratense] Length = 1341 Score = 215 bits (547), Expect = 2e-58 Identities = 121/339 (35%), Positives = 181/339 (53%), Gaps = 23/339 (6%) Frame = +2 Query: 2 GNPSNENWAKIWKIAAKERVRQFVWIISHDRLLTNSRKARMHLGEATFHHCRSVDGTTLH 181 GN S+ W IWK+ ERVR FVW++ HDRLLTNS K+ M LG A ++C V T +H Sbjct: 1003 GNESDSTWNMIWKLQVPERVRAFVWLLMHDRLLTNSTKSSMGLGHAMCNYCGDVVETAIH 1062 Query: 182 VMRDCPQALNIWLHLVSSKIRDIFLDCNLVEWIELNMKVNLGTDSTLNWSAVWANTCHLL 361 VMRDCP+A+ IW+ +V + R FL N+ W+ N++ ++ D W WA CH L Sbjct: 1063 VMRDCPKAMQIWVTVVPANDRGSFLMGNVKNWVCFNLQNSVTWDRRGQWREYWAQACHCL 1122 Query: 362 WMWRNKFTHDKDYIMPWEPWTIPIKETRDYK-LLNQMQIIRQHTEKVTKLIGLIP----H 526 W WRNK HD+D++ P P +K DY N ++ + T + + IG P Sbjct: 1123 WFWRNKDIHDEDFVRPTRPVQQVMKLLGDYMHAFNNNNVVLERTRSI-RWIGWSPPKMNF 1181 Query: 527 CPL*VDG*------------------NWMCGFSKCLGYCSAHVAELWGVYQGLMLAKSKD 652 L DG W+ G++K +G CSA VAELWGV +GL Sbjct: 1182 VKLNTDGAYKENRAAGCGGVIRGCEGEWLGGYAKGVGLCSAFVAELWGVLEGLRYVHHIG 1241 Query: 653 YNQLIVEIDSKQIVSNIHNKEKGRSNCWSLMNKIQQELNNSSCQVQFVHCFKEANKVAHA 832 + + + IDS+ +V + ++ G S+ +L+ +I + L + + +V+ H ++EANK A A Sbjct: 1242 FTMVELNIDSEAVVKVVKARQLGSSSGAALVKQIWRML-DMNWKVEISHTYREANKCADA 1300 Query: 833 LAYIGNTQVQDVAFYDILPDVIASIVYSDCRGSVYPRAV 949 LA +G+T +++ F+D P I I +D G PR + Sbjct: 1301 LANLGSTLDKELIFFDDCPSHIREICTADRLGITNPRLI 1339 >dbj|GAU19541.1| hypothetical protein TSUD_303540 [Trifolium subterraneum] Length = 642 Score = 205 bits (521), Expect = 1e-56 Identities = 111/328 (33%), Positives = 170/328 (51%), Gaps = 18/328 (5%) Frame = +2 Query: 23 WAKIWKIAAKERVRQFVWIISHDRLLTNSRKARMHLGEATFHHCRSVDGTTLHVMRDCPQ 202 W IW++ ERVR FVW++ HDRLLTN RK++M L E +HC + T+HV+RDCP Sbjct: 315 WLHIWRLKVPERVRSFVWLVRHDRLLTNYRKSKMQLCEPWCNHCIDIVEDTMHVLRDCPL 374 Query: 203 ALNIWLHLVSSKIRDIFLDCNLVEWIELNMKVNLGTDSTLNWSAVWANTCHLLWMWRNKF 382 A +W +L++ R F NL +WI +N+ LG ++ + WS VWA CH+LW+WRN+ Sbjct: 375 AKVVWSNLLNDTARYSFYHTNLEDWICMNLHKELGKETNVRWSCVWAVGCHVLWLWRNRE 434 Query: 383 THDKDYIMPWEPWTIPIKETRDYKLLNQMQIIRQHTEKVTKLIG---------------- 514 H + P + W + + YK + I +KV IG Sbjct: 435 CHGDMRVRPTQLWQTILYMVQQYKQADIKSIALPVHQKVEVPIGWNKPAGDWIKLNTDGA 494 Query: 515 LIPHCP--L*VDG*NWMCGFSKCLGYCSAHVAELWGVYQGLMLAKSKDYNQLIVEIDSKQ 688 P C L W+ GFS+ LG C+A++AELWGV GL + + ++ + IDS Sbjct: 495 SRPRCGGLLRNSNGQWLGGFSRHLGRCNAYLAELWGVLDGLNFTYERGHKKIELHIDSNV 554 Query: 689 IVSNIHNKEKGRSNCWSLMNKIQQELNNSSCQVQFVHCFKEANKVAHALAYIGNTQVQDV 868 +V +H+ G W ++ +I++ L V+ H ++EAN A ALA +G + Sbjct: 555 VVQTLHSARDGGVVGWRIIQEIRR-LLALDWDVKICHSYREANACADALANLGCDHGPGL 613 Query: 869 AFYDILPDVIASIVYSDCRGSVYPRAVA 952 Y+ P ++S++ +D G PR V+ Sbjct: 614 RVYEQCPPKVSSLLLADAMGITTPRVVS 641 >dbj|GAU13938.1| hypothetical protein TSUD_262650 [Trifolium subterraneum] Length = 875 Score = 207 bits (528), Expect = 2e-56 Identities = 117/338 (34%), Positives = 172/338 (50%), Gaps = 22/338 (6%) Frame = +2 Query: 5 NPSNENWAKIWKIAAKERVRQFVWIISHDRLLTNSRKARMHLGEATFHHCRSVDGTTLHV 184 N + NW K+WK+ ERVR FVW++ HDRLLTN RK+RM LG A ++C ++ TTLH Sbjct: 538 NVESNNWIKVWKLNVPERVRCFVWLLLHDRLLTNYRKSRMGLGHAMCNYCGDLEETTLHA 597 Query: 185 MRDCPQALNIWLHLVSSKIRDIFLDCNLVEWIELNMKVNLGTDSTLNWSAVWANTCHLLW 364 +RDC + WL +V + R F + +WI N+ +W WA TCH LW Sbjct: 598 IRDCALIIPFWLQVVPMEDRSSFFMEDTQDWISRNLMKGRTRRRGSDWCDFWATTCHSLW 657 Query: 365 MWRNKFTHDKDYIMPWEPWTIPIKETRDYKLLNQMQIIRQHTEKVTKLIGLIP----HCP 532 MWRNK HD++++ P +P K +Y+ Q + E IG P Sbjct: 658 MWRNKEAHDEEFVRPMQPVNYVQKRVEEYQHAKQASDLLDGREYTLVDIGWKPPSGSFVK 717 Query: 533 L*VDG------------------*NWMCGFSKCLGYCSAHVAELWGVYQGLMLAKSKDYN 658 L DG W+ GF+K +G CSA +AELWGV++GL LAK + Sbjct: 718 LNTDGARKDNNKAGCGGIIRGNHGEWLGGFAKGVGECSAFIAELWGVFEGLSLAKRMCFR 777 Query: 659 QLIVEIDSKQIVSNIHNKEKGRSNCWSLMNKIQQELNNSSCQVQFVHCFKEANKVAHALA 838 ++ + IDS +V I + WSL+ I ++L + +V H ++E NK A ALA Sbjct: 778 KVELHIDSVAVVQVISTGKLKSKLGWSLVLNI-RKLLDLDWEVTITHAYRETNKCADALA 836 Query: 839 YIGNTQVQDVAFYDILPDVIASIVYSDCRGSVYPRAVA 952 IG +++ F++ P + +V +D G PR ++ Sbjct: 837 NIGCQLGREIIFFEDCPPHMKDLVLADVMGITTPRMIS 874 >dbj|GAU51253.1| hypothetical protein TSUD_412460, partial [Trifolium subterraneum] Length = 609 Score = 203 bits (516), Expect = 3e-56 Identities = 108/326 (33%), Positives = 170/326 (52%), Gaps = 23/326 (7%) Frame = +2 Query: 17 ENWAKIWKIAAKERVRQFVWIISHDRLLTNSRKARMHLGEATFHHCRSVDGTTLHVMRDC 196 + W KIW++ ER+R F+W + HDR+LTN R A+ +L + +C ++ TTLHV+RDC Sbjct: 285 KKWFKIWRLETTERIRVFMWQVLHDRILTNWRTAKWNLTDPYCSYCEHMEETTLHVLRDC 344 Query: 197 PQALNIWLHLVSSKIRDIFLDCNLVEWIELNMKVNLGTDSTLNWSAVWANTCHLLWMWRN 376 P A+ +W HL+ + R F L +WI+LN+ ++G L+W AVW TC LW WRN Sbjct: 345 PLAVEVWQHLLEEEHRGRFFIGQLHQWIDLNLSTSIGIRRDLDWDAVWVTTCFWLWKWRN 404 Query: 377 KFTHDKDYIMPWEPWTIPIKETRDYKLLNQMQIIRQHTEKVTKLIGLIPHCP-------- 532 K H+ ++ W+PW+ + +YK Q + + +K K I I Sbjct: 405 KRVHEPNHTSQWKPWSFILNLVNEYKYTKQARETEKPCQKELKDIKWIYPAKGWVCLNTD 464 Query: 533 ---------------L*VDG*NWMCGFSKCLGYCSAHVAELWGVYQGLMLAKSKDYNQLI 667 L D W+CGFSK LG SA++AE+WG+Y+GL +A++ +L Sbjct: 465 GAAKSDTGIAGCGGILRNDNGIWICGFSKFLGNTSAYMAEVWGLYEGLSMARNLGIERLE 524 Query: 668 VEIDSKQIVSNIHNKEKGRSNCWSLMNKIQQELNNSSCQVQFVHCFKEANKVAHALAYIG 847 V++DS+ +V G + W++M +I + L + + +V+ H F E N+ A LA +G Sbjct: 525 VQVDSEVLVMATKKDGTGCTMSWNIMRRI-RALLDLNWEVRIKHIFCEGNRCADVLANMG 583 Query: 848 NTQVQDVAFYDILPDVIASIVYSDCR 925 Q Y P + ++ D R Sbjct: 584 CNQDAVWMPYQESPAELLQVLSDDFR 609 >gb|PNY15498.1| ribonuclease H [Trifolium pratense] Length = 378 Score = 192 bits (488), Expect = 3e-54 Identities = 114/333 (34%), Positives = 172/333 (51%), Gaps = 23/333 (6%) Frame = +2 Query: 23 WAKIWKIAAKERVRQFVWIISHDRLLTNSRKARMHLGEATFHHCRSVDGTTLHVMRDCPQ 202 W+ I K+ ERVR FVW++S+ RLLTN K +MHLG C T HVMRDCP Sbjct: 47 WSHILKLEVNERVRCFVWLMSYGRLLTNHYKHKMHLGYPYCRRCSLQVETISHVMRDCPI 106 Query: 203 ALNIWLHLVSSKIRDIFLDCNLVEWIELNMKVNLGTDSTLNWSAVWANTCHLLWMWRNKF 382 A +W+H+V K+R F + WI N+ N + + W+A+WA CH LW+WRNK Sbjct: 107 ARVVWMHIVPHKLRRNFFNTGKDVWIAQNVLYNWENNDGIKWNALWAIACHYLWLWRNKE 166 Query: 383 THDKDYIMPWEPWTIPIKETRDYKLL------------NQMQI---------IRQHTEKV 499 THD + P E W K +DYK+ Q+Q+ ++ +T+ Sbjct: 167 THDSNVSRPPEAWVWIRKIAKDYKIATAMAEELQKGVRTQIQVKWQPPSIGWVKVNTDGA 226 Query: 500 TKLIGLIPHCPL*VDG*N--WMCGFSKCLGYCSAHVAELWGVYQGLMLAKSKDYNQLIVE 673 +K G C + G + W+ GFSK +G C++ VAELWGV +GL LA+ + + ++ + Sbjct: 227 SKSDG-SASCGGLIRGSDCEWLGGFSKHIGRCTSVVAELWGVSEGLKLARDRGFQRIELC 285 Query: 674 IDSKQIVSNIHNKEKGRSNCWSLMNKIQQELNNSSCQVQFVHCFKEANKVAHALAYIGNT 853 +DS ++S I N G ++ IQQ L + + +V+ H ++E N A LA + Sbjct: 286 VDSVSVISIIQNGGGGNVMAHRIVQSIQQML-SLNWEVKLKHIYREENHCADGLANLAFI 344 Query: 854 QVQDVAFYDILPDVIASIVYSDCRGSVYPRAVA 952 + + +D+ PD I +D G PR V+ Sbjct: 345 LPKGIVLFDVCPDGIREHFDADVIGVSTPRLVS 377 >dbj|GAU10071.1| hypothetical protein TSUD_422240 [Trifolium subterraneum] Length = 306 Score = 189 bits (479), Expect = 9e-54 Identities = 107/306 (34%), Positives = 156/306 (50%), Gaps = 22/306 (7%) Frame = +2 Query: 23 WAKIWKIAAKERVRQFVWIISHDRLLTNSRKARMHLGEATFHHCRSVDGTTLHVMRDCPQ 202 W+KIW++ ERVR FVWI+++DRLLTN RK+RM LG A +HC +V T++HVMRDC Sbjct: 2 WSKIWRLNVTERVRCFVWILAYDRLLTNYRKSRMGLGHAMCNHCGNVVETSMHVMRDCSL 61 Query: 203 ALNIWLHLVSSKIRDIFLDCNLVEWIELNMKVNLGTDSTLNWSAVWANTCHLLWMWRNKF 382 WL +V + R F +L +WI N+ ++ W +WA CH LW WRNK Sbjct: 62 VTPFWLQVVPMRERSTFFTEDLQQWIITNINTGRRGNNGSAWCDIWATACHCLWTWRNKE 121 Query: 383 THDKDYIMPWEPWTIPIKETRDYKLLNQMQIIRQHTEKVTKLIGLIPHC----PL*VDG- 547 HD +Y+ P K DY ++ + +++E+ IG P C L DG Sbjct: 122 MHDNNYVRPSHMVQHVYKIVEDYSQARRVNELMRNSERFFTQIGWQPPCGHFVKLNTDGA 181 Query: 548 -----------------*NWMCGFSKCLGYCSAHVAELWGVYQGLMLAKSKDYNQLIVEI 676 W+ GF+K +G CSA AELWGV +GL A+ + + + I Sbjct: 182 CKDNNIAGCGGIIRGAHGEWLGGFAKGVGVCSAFAAELWGVLEGLQYARRLGFTAVELNI 241 Query: 677 DSKQIVSNIHNKEKGRSNCWSLMNKIQQELNNSSCQVQFVHCFKEANKVAHALAYIGNTQ 856 DS +V I L+ +I++ L ++ H ++E+NK A ALA IG T Sbjct: 242 DSITVVQVIKTGRLSSPIGLPLVKQIRRYL-ELDWEIMIAHAYRESNKCADALASIGCTL 300 Query: 857 VQDVAF 874 +++ F Sbjct: 301 DREIIF 306 >gb|PNY03121.1| ribonuclease H, partial [Trifolium pratense] Length = 952 Score = 200 bits (508), Expect = 1e-53 Identities = 112/312 (35%), Positives = 169/312 (54%), Gaps = 23/312 (7%) Frame = +2 Query: 23 WAKIWKIAAKERVRQFVWIISHDRLLTNSRKARMHLGEATFHHCRSVDGTTLHVMRDCPQ 202 W IWK+ ERVR FVW+++HDRLLTNS K M L A H+C V+ TTLHVMRDCP+ Sbjct: 642 WTSIWKLKVPERVRAFVWLLTHDRLLTNSLKRNMGLSHAMCHYCGDVEETTLHVMRDCPK 701 Query: 203 ALNIWLHLVSSKIRDIFLDCNLVEWIELNMKVNLGTDSTLNWSAVWANTCHLLWMWRNKF 382 A+ IW ++ + + ++ +L +W+ N++ ++ +W WA CH LW WRNK Sbjct: 702 AMEIWAVVIPVQEKGKYMIGDLKQWVCYNLQNSITWSGKGSWRDYWAQACHCLWFWRNKE 761 Query: 383 THDKDYIMPWEPWTIPIKETRDY-KLLNQMQIIRQHTEKVTKLIGLIPHCP----L*VDG 547 HD+DY+ P P +K+ +Y ++ + T++V + IG P P L DG Sbjct: 762 LHDEDYMRPINPAHQVLKQRWEYIAAATNNSVVLERTKEV-RWIGWRPPKPNFVKLNTDG 820 Query: 548 ------------------*NWMCGFSKCLGYCSAHVAELWGVYQGLMLAKSKDYNQLIVE 673 WM GF+K +G CSA VAELWGV +GL ++ + Sbjct: 821 AYKENKAAGCGGVIRGSEGEWMGGFAKGVGLCSAFVAELWGVLEGLRYVHRMGLVKVELN 880 Query: 674 IDSKQIVSNIHNKEKGRSNCWSLMNKIQQELNNSSCQVQFVHCFKEANKVAHALAYIGNT 853 IDS+ +V + N+ S +L+ +I + L + + +V+ H ++EANK A ALA +G + Sbjct: 881 IDSEAVVQVVKNRXMSSSLGVALVKQIWR-LLDMNWEVEVSHTYREANKCADALANLGCS 939 Query: 854 QVQDVAFYDILP 889 ++ FYD P Sbjct: 940 LANEIVFYDSCP 951 >dbj|GAU50328.1| hypothetical protein TSUD_290640 [Trifolium subterraneum] Length = 474 Score = 184 bits (467), Expect = 3e-50 Identities = 92/262 (35%), Positives = 144/262 (54%), Gaps = 23/262 (8%) Frame = +2 Query: 5 NPSNENWAKIWKIAAKERVRQFVWIISHDRLLTNSRKARMHLGEATFHHCRSVDGTTLHV 184 N W +IWK+ ER+ F+W++ HDRLLTN RK++MH+GE HC + TLHV Sbjct: 192 NSREMEWLRIWKLKVPERITSFIWLVRHDRLLTNYRKSKMHIGEPWCTHCVDIVEDTLHV 251 Query: 185 MRDCPQALNIWLHLVSSKIRDIFLDCNLVEWIELNMKVNLGTDSTLNWSAVWANTCHLLW 364 +RDCP A ++W +L+++ R+ L +WI +N++ +LG + + W VWA +CH LW Sbjct: 252 LRDCPLAKSVWCNLLNNAARENSFAAELKDWIHMNLQQDLGRNMHMEWYGVWATSCHSLW 311 Query: 365 MWRNKFTHDKDYIMPWEPWTIPIKETRDYKLLN----------QMQI-----------IR 481 WRN+ THD+ + P PW + Y N QM++ + Sbjct: 312 TWRNRETHDETRLRPIHPWRYILDCHLQYMAANVNNIALSTRQQMEVDIAWQQPEAGWVV 371 Query: 482 QHTEKVTKLIGLIPHCP--L*VDG*NWMCGFSKCLGYCSAHVAELWGVYQGLMLAKSKDY 655 +T+ +K+ + C L W+ GFS+ LG CSA++AELWGV GL LA+ + Sbjct: 372 LNTDGASKM-DVAAGCGGLLRNSHGQWIGGFSRHLGICSAYLAELWGVLDGLRLARERGI 430 Query: 656 NQLIVEIDSKQIVSNIHNKEKG 721 +L V++DS+ +V +++ G Sbjct: 431 TKLKVQVDSRVVVQTLNSSNIG 452 >dbj|GAU25543.1| hypothetical protein TSUD_259770 [Trifolium subterraneum] Length = 633 Score = 184 bits (467), Expect = 5e-49 Identities = 111/331 (33%), Positives = 166/331 (50%), Gaps = 22/331 (6%) Frame = +2 Query: 23 WAKIWKIAAKERVRQFVWIISHDRLLTNSRKARMHLGEATFHHCRSVDGTTLHVMRDCPQ 202 W ++WK+ A ERVR F+W++ H++LLTNS K+ M L A +CR V+ TTLHV+RDC Sbjct: 307 WGRVWKLKAPERVRTFIWLVMHNKLLTNSLKSVMGLSHAMCSYCRVVEETTLHVLRDCTL 366 Query: 203 ALNIWLHLVSSKIRDIFLDCNLVEWIELNMKVNLGTDSTLNWSAVWANTCHLLWMWRNKF 382 A IW H+V R L EWI N+ + W+ WA C+LLW WRNK Sbjct: 367 AKKIWSHVVPLASRS-----GLQEWICFNLNNFVMGICEGTWNVFWAMACYLLWNWRNKE 421 Query: 383 THDKDYIMPWEPWTIPIKETRDYKLLNQMQIIRQHTEKVTKLIGLIP----HCPL*VDG* 550 H + ++ P P DY Q I + +V IG +P + L DG Sbjct: 422 LHVEGFLRPNRPVQHVRNMAADYIHAMQNSSIMVNRSQVASRIGWVPPRADYIKLNTDGA 481 Query: 551 N------------------WMCGFSKCLGYCSAHVAELWGVYQGLMLAKSKDYNQLIVEI 676 + W+ G++KC+G C+A VAELWGV +GL ++ + ++ + I Sbjct: 482 SKKMQLAGCGGVVRGSQGEWIGGYAKCVGMCNAFVAELWGVLEGLRFVRNMGFRKVELCI 541 Query: 677 DSKQIVSNIHNKEKGRSNCWSLMNKIQQELNNSSCQVQFVHCFKEANKVAHALAYIGNTQ 856 DS+ +V I N S SL+ +I + L + V+ H ++EAN A ALA +G + Sbjct: 542 DSQFVVQVIKNGCVQSSMGVSLLKQIWR-LLDLDWNVEVSHTYREANNCADALAMLGCSL 600 Query: 857 VQDVAFYDILPDVIASIVYSDCRGSVYPRAV 949 ++ ++ P I + +DC G PR + Sbjct: 601 GYEITTFEACPSHIRELYDADCMGITTPRLI 631 >dbj|GAU12817.1| hypothetical protein TSUD_73040 [Trifolium subterraneum] Length = 375 Score = 171 bits (433), Expect = 3e-46 Identities = 90/245 (36%), Positives = 130/245 (53%) Frame = +2 Query: 20 NWAKIWKIAAKERVRQFVWIISHDRLLTNSRKARMHLGEATFHHCRSVDGTTLHVMRDCP 199 NW K+WK+ ER FVW++ HDRLLT RK+RM LG A ++C ++ TTLH +RDC Sbjct: 117 NWIKVWKLNVPERDHCFVWLLLHDRLLTYYRKSRMGLGHAMCNYCGDLEETTLHAIRDCA 176 Query: 200 QALNIWLHLVSSKIRDIFLDCNLVEWIELNMKVNLGTDSTLNWSAVWANTCHLLWMWRNK 379 + WL +V + R F + WI N+ +W WA CH LWMWRNK Sbjct: 177 LIIPFWLQVVPMEDRSSFFMEDTQAWISTNLTKGRTQRRGSDWCDFWATACHSLWMWRNK 236 Query: 380 FTHDKDYIMPWEPWTIPIKETRDYKLLNQMQIIRQHTEKVTKLIGLIPHCPL*VDG*NWM 559 HD++++ P +P K +Y+ Q R+ K G+I + W+ Sbjct: 237 EAHDEEFVRPMQPVNYVQKRVEEYQHAKQANGARKDNNK-AGCGGIIRG-----NKGEWL 290 Query: 560 CGFSKCLGYCSAHVAELWGVYQGLMLAKSKDYNQLIVEIDSKQIVSNIHNKEKGRSNCWS 739 GF+K +G CSA +AELWGV++GL LAK + ++ + IDS +V I ++ WS Sbjct: 291 GGFAKGVGECSAFIAELWGVFEGLSLAKRMCFRKVELHIDSVAVVQVISTRKLKSKLGWS 350 Query: 740 LMNKI 754 L+ I Sbjct: 351 LVLNI 355 >gb|PNY12120.1| 3-ketoacyl-CoA synthase, partial [Trifolium pratense] Length = 609 Score = 167 bits (423), Expect = 6e-43 Identities = 85/232 (36%), Positives = 132/232 (56%), Gaps = 22/232 (9%) Frame = +2 Query: 98 LTNSRKARMHLGEATFHHCRSVDGTTLHVMRDCPQALNIWLHLVSSKIRDIFLDCNLVEW 277 LT R+++MH+G HHC+S TTLHV+RDCP A+ IW++ V +I++ F D +L +W Sbjct: 155 LTEGRRSQMHIGTPYCHHCQSTIETTLHVLRDCPLAMIIWVNSVDVQIQNQFFDTDLNDW 214 Query: 278 IELNMKVNLGTDSTLNWSAVWANTCHLLWMWRNKFTHDKDYIMPWEPWTIPIKETRDYKL 457 IELN+ + W WA CH LW WRNK HD+ +IMP +PW + T+ Y++ Sbjct: 215 IELNI-------NKPRWVQFWATGCHALWTWRNKLIHDESFIMPLQPWKEIHRSTQLYEM 267 Query: 458 LNQMQIIRQHTEKVTKLIGLIPHCP----L*VDG------------------*NWMCGFS 571 + +Q E+V + +P P + DG +W+CGF+ Sbjct: 268 HSSVQSTVNLVERVVTNVRWLPLEPGWVRINTDGASKGDEVAGCGGMIKGEDGSWICGFT 327 Query: 572 KCLGYCSAHVAELWGVYQGLMLAKSKDYNQLIVEIDSKQIVSNIHNKEKGRS 727 K +G CSA+VAELWGV + L +A+++ + Q+ + +DS +VSN+ + GR+ Sbjct: 328 KGVGVCSAYVAELWGVLEALQIARARGFRQVELHVDSLGVVSNLQAQHGGRA 379 >gb|PNY05394.1| ribonuclease H [Trifolium pratense] Length = 675 Score = 167 bits (422), Expect = 2e-42 Identities = 92/314 (29%), Positives = 152/314 (48%), Gaps = 4/314 (1%) Frame = +2 Query: 23 WAKIWKIAAKERVRQFVWIISHDRLLTNSRKARMHLGEATFHHCRSVDGTTLHVMRDCPQ 202 W +IW++ ERVR F+W++ HDRL+TN RK++MH+ HC T+HV+RD P Sbjct: 378 WRRIWRLQVLERVRSFIWLVRHDRLITNYRKSKMHICAPWCKHCVKAIEDTMHVLRDFPL 437 Query: 203 ALNIWLHLVSSKIRDIFLDCNLVEWIELNMKVNLGTDSTLNWSAVWANTCHLLWMWRNKF 382 A +W +L++S + DIF NL +WI LN+ +LG + WS VWA CH LW WRNK Sbjct: 438 AKVVWCNLMNSAVGDIFYAANLEDWITLNLNQDLGKEKEGTWSCVWAVGCHFLWFWRNKE 497 Query: 383 THDKDYIMPWEPWTIPIKETRDYKLLNQMQIIRQHTEKVTKLIGLI----PHCPL*VDG* 550 H + +PW + + + + Y+ N + + K IG L DG Sbjct: 498 AHGDESRRLSQPWQLIMSQFQHYQQANILHVASHVRHKAVVQIGWTRPEEDWIMLNTDGA 557 Query: 551 NWMCGFSKCLGYCSAHVAELWGVYQGLMLAKSKDYNQLIVEIDSKQIVSNIHNKEKGRSN 730 + + C G +L S + ++ + IDS +V + + + Sbjct: 558 SRPSSSAGC----------------GGLLRNSNGFKKIALHIDSYVVVHTLQSDKDDSVV 601 Query: 731 CWSLMNKIQQELNNSSCQVQFVHCFKEANKVAHALAYIGNTQVQDVAFYDILPDVIASIV 910 W ++ +I++ L +V+ H ++E+N LA +G + YD P ++S++ Sbjct: 602 GWRIIQEIRR-LLAMDWEVKISHSYRESNACGDTLANLGCDNEPGMQVYDHCPASLSSLL 660 Query: 911 YSDCRGSVYPRAVA 952 +D G PR ++ Sbjct: 661 LADVMGIATPRVIS 674 >gb|PNX97917.1| ribonuclease H [Trifolium pratense] Length = 712 Score = 165 bits (418), Expect = 8e-42 Identities = 107/333 (32%), Positives = 157/333 (47%), Gaps = 22/333 (6%) Frame = +2 Query: 23 WAKIWKIAAKERVRQFVWIISHDRLLTNSRKARMHLGEATFHHCRSVDGTTLHVMRDCPQ 202 W IWK+ ERVR FVW + +RLLTNS K RM L A C D T LH +RDC Sbjct: 381 WRNIWKLQVPERVRSFVWRVKWERLLTNSLKHRMGLTSAVCCFCGMADETILHALRDCSI 440 Query: 203 ALNIWLHLVSSKIRDIFLDCNLVEWIELNMKVNLGTDSTLNWSAVWANTCHLLWMWRNKF 382 W +V ++R F +L W+ +N+ W WA C W WRNK Sbjct: 441 VQQFWQQIVPQEVRGAFFMSSLQNWLHINVNYAGKLAIGGRWCDFWALACSCFWTWRNKE 500 Query: 383 THDKDYIMPWEPWTIPIKETRDY-KLLNQMQIIRQHTEKVTKLIGLIPH---CPL*VDG* 550 H + + P K +Y K L+ M+++ QH V + P L DG Sbjct: 501 LHGEKIVWPSNIIQHVRKLGENYRKALHTMEVVEQHESIVAHIHWKPPEGVFVKLNTDGA 560 Query: 551 N------------------WMCGFSKCLGYCSAHVAELWGVYQGLMLAKSKDYNQLIVEI 676 + W+ GF+K +G CSA VAELWGV +GL+L + + + + I Sbjct: 561 SKAGNRAGCGGVIRGNQGEWLGGFAKGVGNCSAFVAELWGVLEGLLLVQRMGFENVELSI 620 Query: 677 DSKQIVSNIHNKEKGRSNCWSLMNKIQQELNNSSCQVQFVHCFKEANKVAHALAYIGNTQ 856 DSK +V I + ++ ++++ KI++ L V+ +H ++EANK A ALA G Sbjct: 621 DSKAVVHVITAGKATSADGYAIVRKIRR-LLLMDWNVKVLHEYREANKCADALANTGCIL 679 Query: 857 VQDVAFYDILPDVIASIVYSDCRGSVYPRAVAS 955 ++ FY P I +I+ +D G PR + + Sbjct: 680 DLELIFYQECPMEIRNILLADELGISTPRIIVA 712 >gb|PNY01766.1| ribonuclease H [Trifolium pratense] Length = 300 Score = 154 bits (388), Expect = 2e-40 Identities = 89/268 (33%), Positives = 134/268 (50%), Gaps = 23/268 (8%) Frame = +2 Query: 20 NWAKIWKIAAKERVRQFVWIISHDRLLTNSRKARMHLGEATFHHCRSVDGTTLHVMRDCP 199 +W K+WKI ER+ FVW+I HDRLLT R +H+GE + C +V TTL V+RDCP Sbjct: 4 DWDKVWKIEVPERIPSFVWLIKHDRLLTRYRLNYVHIGEPYCYRCGNVMETTLRVLRDCP 63 Query: 200 QALNIWLHLVSSKIRDIFLDCNLVEWIELNMKVNLGTDSTLNWSAVWANTCHLLWMWRNK 379 A+ IWL+ V+ + + F +L +WI++N + W WA CH LW WRNK Sbjct: 64 LAMVIWLNAVNLQQCEAFFTVDLSDWIKINF-------NQQKWVNFWAKACHALWTWRNK 116 Query: 380 FTHDKDYIMPWEPWTIPIKETRDY-KLLNQMQIIRQHTEKVTKLIGLIPHCP----L*VD 544 HD D+IMP PW + + Y +N + + +T + P P + + Sbjct: 117 LIHDDDFIMPNRPWLEISRRVQHYANQINVSSTVDLRNKVMTDI--RWPADPGWVKINTN 174 Query: 545 G*NWMCGFSKCLG------------------YCSAHVAELWGVYQGLMLAKSKDYNQLIV 670 G + G + C G + +VAELWGV +GL L +++ Y + Sbjct: 175 GASKGGGVAGCGGVIRGEMVVGCVALLKVWVFVVPYVAELWGVLEGLQLVRARGYLHFEL 234 Query: 671 EIDSKQIVSNIHNKEKGRSNCWSLMNKI 754 ID +VS + +K+ G + W L+ +I Sbjct: 235 NIDYLAVVSVLASKQGGAAAGWCLVQRI 262 >dbj|GAU43930.1| hypothetical protein TSUD_28720 [Trifolium subterraneum] Length = 432 Score = 154 bits (390), Expect = 2e-39 Identities = 91/254 (35%), Positives = 129/254 (50%), Gaps = 28/254 (11%) Frame = +2 Query: 23 WAKIWKIAAKERVRQFVWIISHDRLLTNSRKARMHLGEATFHHCRSVDGTTLHVMRDCPQ 202 W KIWK+ ERVR VW+++HDRLLTN K RM LG A CR ++ T LHV RDCP+ Sbjct: 175 WIKIWKLGVPERVRTLVWLLTHDRLLTNYNKHRMGLGTALCSFCRDIE-TALHVFRDCPK 233 Query: 203 ALNIWLHLVSSKIRDIFLDCNLVEWIELNMKVNLGTDSTLNWSAVWANTCHLLWMWRNKF 382 A+ +WL++V + R F +L WI N+ ++ + WS WA CH LW WRN+ Sbjct: 234 AMQVWLNVVLQEARQKFFHEDLSGWINYNLDYCWSGNNGVRWSNFWAMGCHRLWQWRNQE 293 Query: 383 THDKDYIMPWEP---WTIPIKETRDYKLLNQMQIIRQHTEKVTKLIGLIPH----CPL*V 541 H D+ P P + I E K+ + + H +LIG P L Sbjct: 294 VHSDDFQWPQRPVQHISSAIYEYSQAKMAGSDIVNKVHG---VRLIGWKPPDIGVVKLNT 350 Query: 542 DG------------------*NWMCGFSKCLGYCSAHVAELWGVYQGLMLAKSKDYNQLI 667 DG W+ GF+K LG C+++VAELW V +GL A+ Y + Sbjct: 351 DGACKDDRTAGCGGIIRNSDGRWIDGFAKSLGKCNSYVAELWEVLEGLKYARRLGYQAIN 410 Query: 668 VEIDS---KQIVSN 700 + +DS KQ++++ Sbjct: 411 LNVDSLAVKQVLTS 424 >dbj|GAU46467.1| hypothetical protein TSUD_402340 [Trifolium subterraneum] Length = 299 Score = 148 bits (373), Expect = 3e-38 Identities = 85/298 (28%), Positives = 145/298 (48%), Gaps = 22/298 (7%) Frame = +2 Query: 122 MHLGEATFHHCRSVDGTTLHVMRDCPQALNIWLHLVSSKIRDIFLDCNLVEWIELNMKVN 301 M + HCR + T+LHV+RDC A IW+ +V +R F +L W +N+ Sbjct: 1 MRIAHMMCDHCRVFEETSLHVLRDCDVAKEIWMVVVPRSVRSAFFGGDLSHWFSINLDGE 60 Query: 302 LGTDSTLNWSAVWANTCHLLWMWRNKFTHDKDYIMPWEPWTIPIKETRDYKLLNQMQIIR 481 L + +NW WA C+ LW WRN+ HD + P +P + ++ R+YKL + + Sbjct: 61 LVGINDINWPEFWATVCYFLWNWRNREYHDNSFTRPVQPVQVIMQRCREYKLAARASRVV 120 Query: 482 QHTEKVTKLIGLIP----HCPL*VDG------------------*NWMCGFSKCLGYCSA 595 ++ +IG P L DG +W+ GF+K +G CSA Sbjct: 121 TSVPRINVMIGWEPPSQGWVKLNTDGARKNERVAGCGGIIRNNIGDWIGGFAKHVGSCSA 180 Query: 596 HVAELWGVYQGLMLAKSKDYNQLIVEIDSKQIVSNIHNKEKGRSNCWSLMNKIQQELNNS 775 VAELWGV +GL A + ++ +EIDS +V +++ E + +L+ I++ + Sbjct: 181 FVAELWGVLEGLNYAWKLGFKKVELEIDSAIVVDAVNSGETNSAMGIALIRSIRR-IIAL 239 Query: 776 SCQVQFVHCFKEANKVAHALAYIGNTQVQDVAFYDILPDVIASIVYSDCRGSVYPRAV 949 + V+ H ++E+N A A A +G +++ F+D I +++++D G R + Sbjct: 240 NWNVKVYHSYRESNLCADAFANLGCALDENIVFFDTCSSQIRNLLFADISGHTTLRLI 297 >gb|PNX97265.1| ribonuclease H, partial [Trifolium pratense] Length = 1220 Score = 155 bits (391), Expect = 8e-38 Identities = 61/140 (43%), Positives = 91/140 (65%) Frame = +2 Query: 2 GNPSNENWAKIWKIAAKERVRQFVWIISHDRLLTNSRKARMHLGEATFHHCRSVDGTTLH 181 G+ W +IWK+ ER+R F+W++ HDRL+TN RK +MH+GE HC + TLH Sbjct: 1063 GDNREMEWLRIWKLKVPERIRNFIWLVRHDRLITNYRKNKMHIGEPWCTHCVDIXEDTLH 1122 Query: 182 VMRDCPQALNIWLHLVSSKIRDIFLDCNLVEWIELNMKVNLGTDSTLNWSAVWANTCHLL 361 V+RDCP A ++W +L+++ R+ F L WI +N++ +LG + + WS+VWA +CH L Sbjct: 1123 VLRDCPLAKSVWCNLLNNAAREKFFAAELKTWIHMNLQQDLGRNMHMEWSSVWATSCHSL 1182 Query: 362 WMWRNKFTHDKDYIMPWEPW 421 W WRN+ THD+ + P PW Sbjct: 1183 WTWRNRETHDETRLRPIHPW 1202 >gb|PNX86941.1| hypothetical protein L195_g043024, partial [Trifolium pratense] Length = 309 Score = 147 bits (370), Expect = 1e-37 Identities = 77/240 (32%), Positives = 121/240 (50%), Gaps = 4/240 (1%) Frame = +2 Query: 14 NENWAKIWKIAAKERVRQFVWIISHDRLLTNSRKARMHLGEATFHHCRSVDGTTLHVMRD 193 N W +IW + ERV+ F+W+ HDRL+TN +K RM LG A ++C + T LHV+RD Sbjct: 47 NAPWNRIWNLHVTERVKSFIWLALHDRLITNHKKNRMGLGHAMCNYCGDISETELHVLRD 106 Query: 194 CPQALNIWLHLVSSKIRDIFLDCNLVEWIELNMKVNLGTDSTLNWSAVWANTCHLLWMWR 373 CP + +WL++V +R F ++ +WI LN+ + + W WA +CH++WMWR Sbjct: 107 CPLVMPLWLNVVDQSMRSDFSLGDIQQWISLNLCSSANGKEDIMWRNFWATSCHVIWMWR 166 Query: 374 NKFTHDKDYIMPWEPWTIPIKETRDYKLLNQMQIIRQHTEKVTKLI----GLIPHCPL*V 541 N HD + P I K+ Y+ M + + +I L L Sbjct: 167 NMAEHDDHFQRPSTLELIIAKQINQYQQKVMMNTVSNKVARTEVMIYWKTPLEGWVKLNT 226 Query: 542 DG*NWMCGFSKCLGYCSAHVAELWGVYQGLMLAKSKDYNQLIVEIDSKQIVSNIHNKEKG 721 DG + G + +VAELWGV +GL A+ + ++ + +DS + S + + KG Sbjct: 227 DG-------AYKEGSVAGYVAELWGVLEGLRYARKLGFTRIELNVDSSVVDSVLRLEGKG 279