BLASTX nr result
ID: Astragalus22_contig00004346
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00004346 (1019 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|GAU46467.1| hypothetical protein TSUD_402340 [Trifolium subt... 209 6e-62 gb|PNX66369.1| ribonuclease H, partial [Trifolium pratense] 205 4e-61 gb|PNY06182.1| ribonuclease H [Trifolium pratense] 214 3e-60 gb|PNX79929.1| ribonuclease H, partial [Trifolium pratense] 212 3e-59 gb|PNX81608.1| ribonuclease H [Trifolium pratense] 206 1e-57 gb|PNX94788.1| ribonuclease H [Trifolium pratense] 201 1e-56 dbj|GAU43502.1| hypothetical protein TSUD_398950 [Trifolium subt... 206 7e-56 gb|PNX97917.1| ribonuclease H [Trifolium pratense] 201 3e-55 dbj|GAU19541.1| hypothetical protein TSUD_303540 [Trifolium subt... 198 1e-54 dbj|GAU13938.1| hypothetical protein TSUD_262650 [Trifolium subt... 199 8e-54 gb|PNY17666.1| ribonuclease H [Trifolium pratense] 198 1e-53 dbj|GAU31120.1| hypothetical protein TSUD_212270 [Trifolium subt... 188 2e-53 gb|PNY12420.1| ribonuclease H [Trifolium pratense] 196 3e-52 gb|PNX81975.1| ribonuclease H, partial [Trifolium pratense] 182 3e-52 dbj|GAU38338.1| hypothetical protein TSUD_61990 [Trifolium subte... 194 5e-52 gb|PNY00696.1| ribonuclease H [Trifolium pratense] 181 2e-51 dbj|BAE71259.1| hypothetical protein [Trifolium pratense] 188 2e-51 dbj|GAU31911.1| hypothetical protein TSUD_270960 [Trifolium subt... 191 8e-51 gb|PNY15602.1| ribonuclease H [Trifolium pratense] 181 7e-50 dbj|GAU10071.1| hypothetical protein TSUD_422240 [Trifolium subt... 177 9e-50 >dbj|GAU46467.1| hypothetical protein TSUD_402340 [Trifolium subterraneum] Length = 299 Score = 209 bits (531), Expect = 6e-62 Identities = 111/291 (38%), Positives = 165/291 (56%), Gaps = 1/291 (0%) Frame = +3 Query: 129 TNIHLFLTCRQYWHAKQLYISLWNSFIPPEVRSDFYGVELLDWLNGNMLNSTKLEVVLQW 308 T++H+ C AK++ W +P VRS F+G +L W + N+ + W Sbjct: 17 TSLHVLRDCDV---AKEI----WMVVVPRSVRSAFFGGDLSHWFSINLDGELVGINDINW 69 Query: 309 CNVWATGCYYLWYWRNKYVHEEGYSPPNCPSRIITQNAMNYESSKHLSVVHERTNQVT-M 485 WAT CY+LW WRN+ H+ ++ P P ++I Q Y+ + S V ++ M Sbjct: 70 PEFWATVCYFLWNWRNREYHDNSFTRPVQPVQVIMQRCREYKLAARASRVVTSVPRINVM 129 Query: 486 VHWRPAATGWTVINTDGAFKSTSNVAGCGAVVRNSDGFWIGGIARRLFVDSAYLAELWGV 665 + W P + GW +NTDGA K+ VAGCG ++RN+ G WIGG A+ + SA++AELWGV Sbjct: 130 IGWEPPSQGWVKLNTDGARKN-ERVAGCGGIIRNNIGDWIGGFAKHVGSCSAFVAELWGV 188 Query: 666 IDGINLAASKGYTDIELQLDSSIVVNCLTGTNMGSIAGRQLVRRIRHLLACFSNVRILHI 845 ++G+N A G+ +EL++DS+IVV+ + S G L+R IR ++A NV++ H Sbjct: 189 LEGLNYAWKLGFKKVELEIDSAIVVDAVNSGETNSAMGIALIRSIRRIIALNWNVKVYHS 248 Query: 846 YREANKVADALANLGCTFLEGFSIIDTPPPSISLLVDADVVGVSTPRVIPV 998 YRE+N ADA ANLGC E DT I L+ AD+ G +T R+IP+ Sbjct: 249 YRESNLCADAFANLGCALDENIVFFDTCSSQIRNLLFADISGHTTLRLIPM 299 >gb|PNX66369.1| ribonuclease H, partial [Trifolium pratense] Length = 252 Score = 205 bits (521), Expect = 4e-61 Identities = 105/249 (42%), Positives = 148/249 (59%), Gaps = 1/249 (0%) Frame = +3 Query: 255 WLNGNMLNSTKLEVVLQWCNVWATGCYYLWYWRNKYVHEEGYSPPNCPSRIITQNAMNYE 434 W++ N+L+S K + W + WAT C+ LW WRNK V +E + P +T Y Sbjct: 5 WIDFNLLSSRKWREEIAWKDFWATACHCLWPWRNKEVRDEQFQRPQHVVTAVTDWVKQYN 64 Query: 435 SSKHLS-VVHERTNQVTMVHWRPAATGWTVINTDGAFKSTSNVAGCGAVVRNSDGFWIGG 611 + L V+H V M++W+P + GW +NTDGA+K S VAGCG V+R+S+G W GG Sbjct: 65 QAMGLQQVLHNVEKNVVMINWKPPSEGWVKLNTDGAYKEGS-VAGCGGVIRDSNGVWRGG 123 Query: 612 IARRLFVDSAYLAELWGVIDGINLAASKGYTDIELQLDSSIVVNCLTGTNMGSIAGRQLV 791 A+ L + SAY+AELWGV++G+ A S G+ +EL +DSS+V++ L G G LV Sbjct: 124 FAKNLGICSAYVAELWGVLEGLRYANSLGFNRVELNVDSSVVIHVLRRPGYGRPLGGALV 183 Query: 792 RRIRHLLACFSNVRILHIYREANKVADALANLGCTFLEGFSIIDTPPPSISLLVDADVVG 971 RI+ +L V I H YREANK AD LAN+GC +T P ++ ADV+G Sbjct: 184 MRIQRMLDLDWEVVINHSYREANKCADVLANIGCAIDTHMVYYETCPTECRNVMLADVMG 243 Query: 972 VSTPRVIPV 998 ++TPR+I V Sbjct: 244 IATPRIISV 252 >gb|PNY06182.1| ribonuclease H [Trifolium pratense] Length = 686 Score = 214 bits (545), Expect = 3e-60 Identities = 115/268 (42%), Positives = 160/268 (59%), Gaps = 1/268 (0%) Frame = +3 Query: 192 LWNSFIPPEVRSDFYGVELLDWLNGNMLNSTKLEVVLQWCNVWATGCYYLWYWRNKYVHE 371 +WN+ +P + R F+ +W+ NM + K ++W VWAT CYY+W WRNK+ + Sbjct: 420 IWNNLVPLQGRLAFFTCNYQNWVVINMEVTVKTIEGVEWRVVWATACYYIWRWRNKFKFD 479 Query: 372 EGYSPPNCPSRIITQNAMNYESSKHL-SVVHERTNQVTMVHWRPAATGWTVINTDGAFKS 548 + P P + I Y +K + VV + + + WR +NTDGA Sbjct: 480 SNFVRPLHPHKEIMNYVDCYNKAKPVVDVVSASSRRRIDIRWRAPQRDCICLNTDGAL-- 537 Query: 549 TSNVAGCGAVVRNSDGFWIGGIARRLFVDSAYLAELWGVIDGINLAASKGYTDIELQLDS 728 VAGCG R+S+G W GG A+ + SAY+AELWGV +G+ LA K + +IELQ+DS Sbjct: 538 LGGVAGCGGAFRDSNGAWKGGFAKNIGSASAYVAELWGVYEGLCLARRKSFNNIELQVDS 597 Query: 729 SIVVNCLTGTNMGSIAGRQLVRRIRHLLACFSNVRILHIYREANKVADALANLGCTFLEG 908 +VV + G +GS +GR L+ RIR L+ NVRI H+YREANKVADA+A LGCT +G Sbjct: 598 LVVVRGIKGEEVGSASGRILLNRIRQLMNMDWNVRISHVYREANKVADAIAALGCT-TQG 656 Query: 909 FSIIDTPPPSISLLVDADVVGVSTPRVI 992 FS +TPP ++ L DV+GVSTPR+I Sbjct: 657 FSYFNTPPANLERLCLDDVMGVSTPRII 684 >gb|PNX79929.1| ribonuclease H, partial [Trifolium pratense] Length = 709 Score = 212 bits (539), Expect = 3e-59 Identities = 110/267 (41%), Positives = 156/267 (58%), Gaps = 1/267 (0%) Frame = +3 Query: 195 WNSFIPPEVRSDFYGVELLDWLNGNMLNSTKLEVVLQWCNVWATGCYYLWYWRNKYVHEE 374 W +P + R DF+ W NM + T LE ++W +WA+ C++LW WRNK + Sbjct: 444 WQHLVPIQNRLDFFTCNYHVWFQRNMESVTSLEGGIEWRVIWASTCFHLWLWRNKEKFDT 503 Query: 375 GYSPPNCPSRIITQNAMNY-ESSKHLSVVHERTNQVTMVHWRPAATGWTVINTDGAFKST 551 Y P SR I NY ++ + ++++ E V V W W INTDGA + Sbjct: 504 EYVRPCLSSRYIHNYVENYYKAHRSIALIMEHPRVVINVRWEAPLVSWNSINTDGAVQH- 562 Query: 552 SNVAGCGAVVRNSDGFWIGGIARRLFVDSAYLAELWGVIDGINLAASKGYTDIELQLDSS 731 +AGCG V+R+ G WIGG A+ + +A++AELWG +G+ LA +G ++ELQ+DS Sbjct: 563 -GIAGCGGVLRDHRGAWIGGFAKNIGTTNAFIAELWGAYEGLCLARRRGLINVELQIDSL 621 Query: 732 IVVNCLTGTNMGSIAGRQLVRRIRHLLACFSNVRILHIYREANKVADALANLGCTFLEGF 911 VV + G ++GS GR L RRIR L+ NVRI H+YREANKVADALA++GC + G Sbjct: 622 AVVKTIGGESIGSNGGRSLTRRIRRLIQEEWNVRIRHVYREANKVADALASIGCQSV-GC 680 Query: 912 SIIDTPPPSISLLVDADVVGVSTPRVI 992 + D PP + L AD +GV+TPR + Sbjct: 681 ILFDDPPAGVDQLCFADRLGVTTPRSV 707 >gb|PNX81608.1| ribonuclease H [Trifolium pratense] Length = 630 Score = 206 bits (524), Expect = 1e-57 Identities = 108/271 (39%), Positives = 161/271 (59%), Gaps = 1/271 (0%) Frame = +3 Query: 189 SLWNSFIPPEVRSDFYGVELLDWLNGNMLNSTKLEVVLQWCNVWATGCYYLWYWRNKYVH 368 S+W + + + + F+ L +W+ NM + L W +VWAT C++LW WRN+ H Sbjct: 361 SVWCNLLNGKDKDWFFTAALDEWIILNMKKQLGRDNNLSWASVWATSCHFLWLWRNRETH 420 Query: 369 EEGYSPPNCPSRIITQNAMNY-ESSKHLSVVHERTNQVTMVHWRPAATGWTVINTDGAFK 545 + P P ++I + M Y E+ V+ R V W+ GW +NTDGA + Sbjct: 421 GDSRLRPLQPWKLILKWVMQYFEADVSGIVIANRQKVEITVTWQCPEDGWLSLNTDGASR 480 Query: 546 STSNVAGCGAVVRNSDGFWIGGIARRLFVDSAYLAELWGVIDGINLAASKGYTDIELQLD 725 ++ AGCG ++RNS+G W+GG +R L +AY+AELWGV DG+ LA KG +++ +D Sbjct: 481 GHTS-AGCGGLLRNSEGQWLGGFSRNLGRCNAYIAELWGVHDGLCLARDKGAKKLKVYVD 539 Query: 726 SSIVVNCLTGTNMGSIAGRQLVRRIRHLLACFSNVRILHIYREANKVADALANLGCTFLE 905 SS+VV+ L T GS+ G +L++ IR LLA +++ H YREAN ADALAN+GC Sbjct: 540 SSVVVHTLNSTTGGSVVGWRLIQEIRRLLALDWEIKVCHSYREANACADALANMGCDHGP 599 Query: 906 GFSIIDTPPPSISLLVDADVVGVSTPRVIPV 998 G + + P +SLL+ AD +G++TPRVI V Sbjct: 600 GIRVYEQCPSRLSLLLLADTMGITTPRVIVV 630 >gb|PNX94788.1| ribonuclease H [Trifolium pratense] Length = 509 Score = 201 bits (511), Expect = 1e-56 Identities = 106/297 (35%), Positives = 157/297 (52%), Gaps = 2/297 (0%) Frame = +3 Query: 114 TRLLSTNIHLFLTCRQYWHAKQLYISLWNSFIPPEVRSDFYGVELLDWLNGNMLNSTKLE 293 T+ + IH+ C + H LW I + R F+ +E +W++ N+ N Sbjct: 220 TQFEESTIHVVRDCPRAVH-------LWRHLISNQERGYFFVIEFDEWIHLNLNNKFGQN 272 Query: 294 VVLQWCNVWATGCYYLWYWRNKYVHEEGYSPPNCPSRIITQNAMNYESSK--HLSVVHER 467 W +WAT CY LW WRNK +H++ + P P +++ Y+ S V H R Sbjct: 273 YGNDWKAIWATTCYLLWLWRNKSIHDDEFVIPERPWQVVMDYVAAYKHSMLTEEQVGHGR 332 Query: 468 TNQVTMVHWRPAATGWTVINTDGAFKSTSNVAGCGAVVRNSDGFWIGGIARRLFVDSAYL 647 Q + W GW +N+DGA K + + AGCG V+RN +G WI G + L +AY+ Sbjct: 333 VQQQVDITWLAPPPGWFALNSDGAAKLSESKAGCGGVLRNENGNWIEGFTKALGDTTAYM 392 Query: 648 AELWGVIDGINLAASKGYTDIELQLDSSIVVNCLTGTNMGSIAGRQLVRRIRHLLACFSN 827 AELWG+ +G+ LA + +EL+ DS ++ L GS G L+++IR LL Sbjct: 393 AELWGIYEGLRLAQRRDVMKLELRTDSQVIAQSLQDRKRGSNMGCALLKKIRSLLDGPWE 452 Query: 828 VRILHIYREANKVADALANLGCTFLEGFSIIDTPPPSISLLVDADVVGVSTPRVIPV 998 V+I+H++REAN+ AD LAN+G GF PPP + +VD D+ GVS PR+I V Sbjct: 453 VKIIHVFREANRCADMLANMGSEGPIGFEFFANPPPRVMQIVDDDIRGVSFPRLISV 509 >dbj|GAU43502.1| hypothetical protein TSUD_398950 [Trifolium subterraneum] Length = 1962 Score = 206 bits (525), Expect = 7e-56 Identities = 105/267 (39%), Positives = 158/267 (59%), Gaps = 1/267 (0%) Frame = +3 Query: 195 WNSFIPPEVRSDFYGVELLDWLNGNMLNSTKLEVVLQWCNVWATGCYYLWYWRNKYVHEE 374 W +P EVR F+ L +WL+ N+ + K+ + +WC+ WA C W WRNK +HE+ Sbjct: 1695 WQQIVPFEVRGAFFMSNLQNWLHINVNYAGKVAIGQRWCDFWALACSCFWTWRNKELHED 1754 Query: 375 GYSPPNCPSRIITQNAMNYESS-KHLSVVHERTNQVTMVHWRPAATGWTVINTDGAFKST 551 + P+ I + NY+++ + + V+ ++ + + W+P W +NTDGA K Sbjct: 1755 NFMRPSNIILHIRKLGENYKNATRAMEVMGQKEKIIAHIRWKPPEGVWVKLNTDGACKE- 1813 Query: 552 SNVAGCGAVVRNSDGFWIGGIARRLFVDSAYLAELWGVIDGINLAASKGYTDIELQLDSS 731 N AGCG VVR + G W+GG A+ + SA++AELWGV++G+ L G+ ++EL +DS Sbjct: 1814 DNRAGCGGVVRGNQGEWLGGFAKGVGKCSAFVAELWGVLEGLLLVQRMGFANVELSIDSK 1873 Query: 732 IVVNCLTGTNMGSIAGRQLVRRIRHLLACFSNVRILHIYREANKVADALANLGCTFLEGF 911 IVV+ LT S+ G +VR++R LL NVR+ H YREANK ADALAN+GCT Sbjct: 1874 IVVHALTSGTATSVDGYAIVRKVRRLLLLDWNVRVTHEYREANKCADALANIGCTLDMEC 1933 Query: 912 SIIDTPPPSISLLVDADVVGVSTPRVI 992 + P I ++ AD +G S+PR+I Sbjct: 1934 TYFQECPAEIRHILLADELGTSSPRLI 1960 >gb|PNX97917.1| ribonuclease H [Trifolium pratense] Length = 712 Score = 201 bits (511), Expect = 3e-55 Identities = 106/267 (39%), Positives = 154/267 (57%), Gaps = 1/267 (0%) Frame = +3 Query: 195 WNSFIPPEVRSDFYGVELLDWLNGNMLNSTKLEVVLQWCNVWATGCYYLWYWRNKYVHEE 374 W +P EVR F+ L +WL+ N+ + KL + +WC+ WA C W WRNK +H E Sbjct: 445 WQQIVPQEVRGAFFMSSLQNWLHINVNYAGKLAIGGRWCDFWALACSCFWTWRNKELHGE 504 Query: 375 GYSPPNCPSRIITQNAMNYESSKH-LSVVHERTNQVTMVHWRPAATGWTVINTDGAFKST 551 P+ + + + NY + H + VV + + V +HW+P + +NTDGA K+ Sbjct: 505 KIVWPSNIIQHVRKLGENYRKALHTMEVVEQHESIVAHIHWKPPEGVFVKLNTDGASKA- 563 Query: 552 SNVAGCGAVVRNSDGFWIGGIARRLFVDSAYLAELWGVIDGINLAASKGYTDIELQLDSS 731 N AGCG V+R + G W+GG A+ + SA++AELWGV++G+ L G+ ++EL +DS Sbjct: 564 GNRAGCGGVIRGNQGEWLGGFAKGVGNCSAFVAELWGVLEGLLLVQRMGFENVELSIDSK 623 Query: 732 IVVNCLTGTNMGSIAGRQLVRRIRHLLACFSNVRILHIYREANKVADALANLGCTFLEGF 911 VV+ +T S G +VR+IR LL NV++LH YREANK ADALAN GC Sbjct: 624 AVVHVITAGKATSADGYAIVRKIRRLLLMDWNVKVLHEYREANKCADALANTGCILDLEL 683 Query: 912 SIIDTPPPSISLLVDADVVGVSTPRVI 992 P I ++ AD +G+STPR+I Sbjct: 684 IFYQECPMEIRNILLADELGISTPRII 710 >dbj|GAU19541.1| hypothetical protein TSUD_303540 [Trifolium subterraneum] Length = 642 Score = 198 bits (504), Expect = 1e-54 Identities = 103/270 (38%), Positives = 151/270 (55%), Gaps = 3/270 (1%) Frame = +3 Query: 192 LWNSFIPPEVRSDFYGVELLDWLNGNMLNSTKLEVVLQWCNVWATGCYYLWYWRNKYVHE 371 +W++ + R FY L DW+ N+ E ++W VWA GC+ LW WRN+ H Sbjct: 378 VWSNLLNDTARYSFYHTNLEDWICMNLHKELGKETNVRWSCVWAVGCHVLWLWRNRECHG 437 Query: 372 EGYSPPNCPSRIITQNAMNYESSKHLSV---VHERTNQVTMVHWRPAATGWTVINTDGAF 542 + P + I Y+ + S+ VH++ + W A W +NTDGA Sbjct: 438 DMRVRPTQLWQTILYMVQQYKQADIKSIALPVHQKVE--VPIGWNKPAGDWIKLNTDGAS 495 Query: 543 KSTSNVAGCGAVVRNSDGFWIGGIARRLFVDSAYLAELWGVIDGINLAASKGYTDIELQL 722 + CG ++RNS+G W+GG +R L +AYLAELWGV+DG+N +G+ IEL + Sbjct: 496 RPR-----CGGLLRNSNGQWLGGFSRHLGRCNAYLAELWGVLDGLNFTYERGHKKIELHI 550 Query: 723 DSSIVVNCLTGTNMGSIAGRQLVRRIRHLLACFSNVRILHIYREANKVADALANLGCTFL 902 DS++VV L G + G ++++ IR LLA +V+I H YREAN ADALANLGC Sbjct: 551 DSNVVVQTLHSARDGGVVGWRIIQEIRRLLALDWDVKICHSYREANACADALANLGCDHG 610 Query: 903 EGFSIIDTPPPSISLLVDADVVGVSTPRVI 992 G + + PP +S L+ AD +G++TPRV+ Sbjct: 611 PGLRVYEQCPPKVSSLLLADAMGITTPRVV 640 >dbj|GAU13938.1| hypothetical protein TSUD_262650 [Trifolium subterraneum] Length = 875 Score = 199 bits (506), Expect = 8e-54 Identities = 108/294 (36%), Positives = 157/294 (53%), Gaps = 1/294 (0%) Frame = +3 Query: 120 LLSTNIHLFLTCRQYWHAKQLYISLWNSFIPPEVRSDFYGVELLDWLNGNMLNSTKLEVV 299 L T +H C L I W +P E RS F+ + DW++ N++ Sbjct: 590 LEETTLHAIRDCA-------LIIPFWLQVVPMEDRSSFFMEDTQDWISRNLMKGRTRRRG 642 Query: 300 LQWCNVWATGCYYLWYWRNKYVHEEGYSPPNCPSRIITQNAMNYESSKHLS-VVHERTNQ 476 WC+ WAT C+ LW WRNK H+E + P P + + Y+ +K S ++ R Sbjct: 643 SDWCDFWATTCHSLWMWRNKEAHDEEFVRPMQPVNYVQKRVEEYQHAKQASDLLDGREYT 702 Query: 477 VTMVHWRPAATGWTVINTDGAFKSTSNVAGCGAVVRNSDGFWIGGIARRLFVDSAYLAEL 656 + + W+P + + +NTDGA K +N AGCG ++R + G W+GG A+ + SA++AEL Sbjct: 703 LVDIGWKPPSGSFVKLNTDGARKD-NNKAGCGGIIRGNHGEWLGGFAKGVGECSAFIAEL 761 Query: 657 WGVIDGINLAASKGYTDIELQLDSSIVVNCLTGTNMGSIAGRQLVRRIRHLLACFSNVRI 836 WGV +G++LA + +EL +DS VV ++ + S G LV IR LL V I Sbjct: 762 WGVFEGLSLAKRMCFRKVELHIDSVAVVQVISTGKLKSKLGWSLVLNIRKLLDLDWEVTI 821 Query: 837 LHIYREANKVADALANLGCTFLEGFSIIDTPPPSISLLVDADVVGVSTPRVIPV 998 H YRE NK ADALAN+GC + PP + LV ADV+G++TPR+I V Sbjct: 822 THAYRETNKCADALANIGCQLGREIIFFEDCPPHMKDLVLADVMGITTPRMISV 875 >gb|PNY17666.1| ribonuclease H [Trifolium pratense] Length = 856 Score = 198 bits (504), Expect = 1e-53 Identities = 109/289 (37%), Positives = 160/289 (55%), Gaps = 1/289 (0%) Frame = +3 Query: 129 TNIHLFLTCRQYWHAKQLYISLWNSFIPPEVRSDFYGVELLDWLNGNMLNSTKLEVVLQW 308 T +H+ CR + +W IP + S F+G ++ W+ NM+ + + + W Sbjct: 575 TVLHVLRDCR-------VASFVWRYLIPATLWSVFFGGDMAQWVQFNMVQA-RHDGRKSW 626 Query: 309 CNVWATGCYYLWYWRNKYVHEEGYSPPNCPSRIITQNAMNYESS-KHLSVVHERTNQVTM 485 WAT CY LW WRN+ VH+ YS P II + +Y+ + + H R Q M Sbjct: 627 PKTWATACYMLWRWRNREVHDAEYSRPTNTPVIIRRAVADYDMGLRSQNPEHYREMQRRM 686 Query: 486 VHWRPAATGWTVINTDGAFKSTSNVAGCGAVVRNSDGFWIGGIARRLFVDSAYLAELWGV 665 V W+ A GW INTDGA K +AG G V+R+ +G W+ +R + +A+ AELWG+ Sbjct: 687 VQWQRPAEGWICINTDGAAKERGKIAGSGRVMRDHNGIWLCSFSRFIGEATAFDAELWGI 746 Query: 666 IDGINLAASKGYTDIELQLDSSIVVNCLTGTNMGSIAGRQLVRRIRHLLACFSNVRILHI 845 DG+ +A +GY +ELQ+ S VV+CLT NM LVRRIR ++ V I H+ Sbjct: 747 FDGLTIARRQGYQHVELQIHSHNVVSCLT-NNMERNGLCILVRRIRSIMQENWRVVIKHV 805 Query: 846 YREANKVADALANLGCTFLEGFSIIDTPPPSISLLVDADVVGVSTPRVI 992 YREANK+AD LA+L C S+ + PP ++ + D++GVSTPR++ Sbjct: 806 YREANKIADGLASLACVTKVSLSLYEQPPTEVAQVYHDDLIGVSTPRLV 854 >dbj|GAU31120.1| hypothetical protein TSUD_212270 [Trifolium subterraneum] Length = 347 Score = 188 bits (478), Expect = 2e-53 Identities = 100/235 (42%), Positives = 135/235 (57%), Gaps = 1/235 (0%) Frame = +3 Query: 192 LWNSFIPPEVRSDFYGVELLDWLNGNMLNSTKLEVVLQWCNVWATGCYYLWYWRNKYVHE 371 +WN +P + R F+ W NMLN KLE +W VWA CY+LW WRNK + Sbjct: 114 VWNHLLPMQTRLGFFTCHYHSWFQHNMLNYEKLEGGNEWRVVWAVTCYHLWLWRNKETFD 173 Query: 372 EGYSPPNCPSRIITQNAMNYESSKHLS-VVHERTNQVTMVHWRPAATGWTVINTDGAFKS 548 + P S++I NY S+K S + ++ V W GW +NTDGA + Sbjct: 174 YEFVRPRYASQVIHHYVENYISAKSSSSFIMDKARININVRWEAPRNGWISLNTDGAVQH 233 Query: 549 TSNVAGCGAVVRNSDGFWIGGIARRLFVDSAYLAELWGVIDGINLAASKGYTDIELQLDS 728 VAGCG V+R+ G WI G ++ + S + AELWGV G+ LA +G +IELQ+DS Sbjct: 234 --GVAGCGGVLRDYQGNWITGFSKFIGTASVFNAELWGVYAGLCLARQRGINNIELQIDS 291 Query: 729 SIVVNCLTGTNMGSIAGRQLVRRIRHLLACFSNVRILHIYREANKVADALANLGC 893 VV L ++GS G+ LVRR+R+L NVRI H+YREAN+VADALA++GC Sbjct: 292 LAVVRNLGDDSLGSSEGKSLVRRVRNLFQEGLNVRIQHVYREANRVADALASMGC 346 >gb|PNY12420.1| ribonuclease H [Trifolium pratense] Length = 1341 Score = 196 bits (498), Expect = 3e-52 Identities = 105/294 (35%), Positives = 162/294 (55%), Gaps = 1/294 (0%) Frame = +3 Query: 120 LLSTNIHLFLTCRQYWHAKQLYISLWNSFIPPEVRSDFYGVELLDWLNGNMLNSTKLEVV 299 ++ T IH+ C + + +W + +P R F + +W+ N+ NS + Sbjct: 1056 VVETAIHVMRDCPKA-------MQIWVTVVPANDRGSFLMGNVKNWVCFNLQNSVTWDRR 1108 Query: 300 LQWCNVWATGCYYLWYWRNKYVHEEGYSPPNCPSRIITQNAMNY-ESSKHLSVVHERTNQ 476 QW WA C+ LW+WRNK +H+E + P P + + + +Y + + +VV ERT Sbjct: 1109 GQWREYWAQACHCLWFWRNKDIHDEDFVRPTRPVQQVMKLLGDYMHAFNNNNVVLERTRS 1168 Query: 477 VTMVHWRPAATGWTVINTDGAFKSTSNVAGCGAVVRNSDGFWIGGIARRLFVDSAYLAEL 656 + + W P + +NTDGA+K + AGCG V+R +G W+GG A+ + + SA++AEL Sbjct: 1169 IRWIGWSPPKMNFVKLNTDGAYKE-NRAAGCGGVIRGCEGEWLGGYAKGVGLCSAFVAEL 1227 Query: 657 WGVIDGINLAASKGYTDIELQLDSSIVVNCLTGTNMGSIAGRQLVRRIRHLLACFSNVRI 836 WGV++G+ G+T +EL +DS VV + +GS +G LV++I +L V I Sbjct: 1228 WGVLEGLRYVHHIGFTMVELNIDSEAVVKVVKARQLGSSSGAALVKQIWRMLDMNWKVEI 1287 Query: 837 LHIYREANKVADALANLGCTFLEGFSIIDTPPPSISLLVDADVVGVSTPRVIPV 998 H YREANK ADALANLG T + D P I + AD +G++ PR+IPV Sbjct: 1288 SHTYREANKCADALANLGSTLDKELIFFDDCPSHIREICTADRLGITNPRLIPV 1341 >gb|PNX81975.1| ribonuclease H, partial [Trifolium pratense] Length = 240 Score = 182 bits (461), Expect = 3e-52 Identities = 89/231 (38%), Positives = 139/231 (60%), Gaps = 2/231 (0%) Frame = +3 Query: 306 WCNVWATGCYYLWYWRNKYVHEEGYSPPNCPSRIITQNAMNYESSKHLSVVHERTNQ--V 479 W N WA C+Y+W WRNK + + YS P + ++ ++ NY + + + + + V Sbjct: 9 WGNYWAYACHYIWSWRNKEKYNDNYSRPYNQAGVVLRSLKNYTMAMNATTTTQNVYRPCV 68 Query: 480 TMVHWRPAATGWTVINTDGAFKSTSNVAGCGAVVRNSDGFWIGGIARRLFVDSAYLAELW 659 +V W+P GW +++ D + + ++ G G ++R SDG W+GG ++ + ++SAY+AELW Sbjct: 69 VLVRWKPPLVGWVLLDADESCREDGHI-GFGGIIRGSDGEWLGGSSKYIGIESAYVAELW 127 Query: 660 GVIDGINLAASKGYTDIELQLDSSIVVNCLTGTNMGSIAGRQLVRRIRHLLACFSNVRIL 839 G+++G+ A S + IE+ +DS V+ ++ GS GR LV +IR LLA V + Sbjct: 128 GLLEGLMYARSLQFKFIEVHVDSLAVMQVVSSHENGSWKGRTLVEKIRRLLALDWEVVVH 187 Query: 840 HIYREANKVADALANLGCTFLEGFSIIDTPPPSISLLVDADVVGVSTPRVI 992 H YREAN ADALAN GC+ GFS D P SI+ L+ D++G +TPRVI Sbjct: 188 HSYREANCCADALANYGCSLTSGFSFFDVCPSSINNLLFFDLLGYTTPRVI 238 >dbj|GAU38338.1| hypothetical protein TSUD_61990 [Trifolium subterraneum] Length = 813 Score = 194 bits (492), Expect = 5e-52 Identities = 97/268 (36%), Positives = 155/268 (57%), Gaps = 1/268 (0%) Frame = +3 Query: 192 LWNSFIPPEVRSDFYGVELLDWLNGNMLNSTKLEVVLQWCNVWATGCYYLWYWRNKYVHE 371 +WN IP + ++ FY +L W++ N+ N + +WC+ WA CY LW WRNK +HE Sbjct: 545 IWNYVIPSKDKAIFYMGDLKQWISFNINNYMRWMSSGKWCDFWAYCCYCLWQWRNKEIHE 604 Query: 372 EGYSPPNCPSRIITQNAMNYE-SSKHLSVVHERTNQVTMVHWRPAATGWTVINTDGAFKS 548 E + P P + I Q +Y ++ ++++V + ++M+ W P + + +NTDGA+K Sbjct: 605 EQFVRPIRPVQYIMQLLSDYVCAASNINIVSGKNQVISMIRWNPPSDLFVKLNTDGAYKE 664 Query: 549 TSNVAGCGAVVRNSDGFWIGGIARRLFVDSAYLAELWGVIDGINLAASKGYTDIELQLDS 728 + + GCG V R + G W+GG A+ + + S ++AE WGV +G++ G+ + L +DS Sbjct: 665 NA-IVGCGGVTRGNHGEWLGGFAKCVGICSVFVAESWGVFEGLSYVHRLGFRKVVLHIDS 723 Query: 729 SIVVNCLTGTNMGSIAGRQLVRRIRHLLACFSNVRILHIYREANKVADALANLGCTFLEG 908 +VV + + S AG L+ +I LL V + H YREAN ADALANLGC+ Sbjct: 724 EVVVRVIKNGSSDSSAGSSLLTQIWRLLEMDWIVEVSHTYREANNCADALANLGCSLDYD 783 Query: 909 FSIIDTPPPSISLLVDADVVGVSTPRVI 992 I + PP I + D D++G+S+PR+I Sbjct: 784 TVIFNDFPPQIRNIFDTDLMGISSPRLI 811 >gb|PNY00696.1| ribonuclease H [Trifolium pratense] Length = 276 Score = 181 bits (459), Expect = 2e-51 Identities = 93/232 (40%), Positives = 139/232 (59%), Gaps = 1/232 (0%) Frame = +3 Query: 306 WCNVWATGCYYLWYWRNKYVHEEGYSPPNCPSRIITQNAMNYESSKHLSVVHERTNQVTM 485 WC+ WAT C+ LW WRNK +H++ + P + ++ +Y +K + V V++ Sbjct: 46 WCDFWATACHNLWIWRNKELHDDDFVRPMYAVQYVSNKVEDYMQAKKATEVLRCREVVSI 105 Query: 486 -VHWRPAATGWTVINTDGAFKSTSNVAGCGAVVRNSDGFWIGGIARRLFVDSAYLAELWG 662 + W P +NTDGA K +N+A CG ++R S G W+GG A+ + SA++AELWG Sbjct: 106 QIGWIPPTRDRVKLNTDGARKH-NNIAWCGGIIRGSQGEWLGGFAKGVGNCSAFVAELWG 164 Query: 663 VIDGINLAASKGYTDIELQLDSSIVVNCLTGTNMGSIAGRQLVRRIRHLLACFSNVRILH 842 V +G++ A G+ +EL +DS VVN LT + S+A LVR IR L+A V I+H Sbjct: 165 VYEGLSYARRLGFMKVELNIDSVTVVNVLTKGTLQSLARAMLVRNIRSLIALDWEVSIVH 224 Query: 843 IYREANKVADALANLGCTFLEGFSIIDTPPPSISLLVDADVVGVSTPRVIPV 998 YRE+N+ ADAL N+GCT + + D P I L+ ADV+G++TPR++ V Sbjct: 225 AYRESNQCADALVNIGCTLDKEIIVYDDCPSEIKDLLLADVLGITTPRLLHV 276 >dbj|BAE71259.1| hypothetical protein [Trifolium pratense] Length = 553 Score = 188 bits (477), Expect = 2e-51 Identities = 98/285 (34%), Positives = 156/285 (54%), Gaps = 5/285 (1%) Frame = +3 Query: 153 CRQYWHAKQLYI----SLWNSFIPPEVRSDFYGVELLDWLNGNMLNSTKLEVVLQWCNVW 320 C++ W +L+ + +S P R+ F+ L W+ N+ K + WC+ W Sbjct: 268 CKRMWQLFRLHSYVCQQIDDSCNVPLCRNTFFMENLQQWIYSNLSKGAKGRIDSAWCDYW 327 Query: 321 ATGCYYLWYWRNKYVHEEGYSPPNCPSRIITQNAMNY-ESSKHLSVVHERTNQVTMVHWR 497 AT C+ W WRNK +H+ ++ P + + + +Y +++K ++ + + + + W+ Sbjct: 328 ATACHCFWTWRNKEIHDAEFTRPIYAMQHVYRRVNDYYQANKTNRLMRIKNDMLVYISWK 387 Query: 498 PAATGWTVINTDGAFKSTSNVAGCGAVVRNSDGFWIGGIARRLFVDSAYLAELWGVIDGI 677 P + +NTDGA K N GCG ++R S G W+GG AR L SA +AELWGV +G+ Sbjct: 388 PPCGSYVKLNTDGACKD-QNRGGCGGIIRGSQGEWLGGFARGLGNCSAIIAELWGVAEGL 446 Query: 678 NLAASKGYTDIELQLDSSIVVNCLTGTNMGSIAGRQLVRRIRHLLACFSNVRILHIYREA 857 + A G+T +EL +DS +VV + S G LV+ IR +L ++I H YRE+ Sbjct: 447 SYARRLGFTAVELNVDSVVVVQAIKTGRFSSSVGLPLVKHIRRMLDLDWEIKIEHAYRES 506 Query: 858 NKVADALANLGCTFLEGFSIIDTPPPSISLLVDADVVGVSTPRVI 992 NK ADA+AN+GC D+ P SI L+ AD +G++TPR+I Sbjct: 507 NKCADAMANIGCHLDRETIFYDSCPISIKELLLADELGITTPRII 551 >dbj|GAU31911.1| hypothetical protein TSUD_270960 [Trifolium subterraneum] Length = 853 Score = 191 bits (484), Expect = 8e-51 Identities = 99/258 (38%), Positives = 145/258 (56%), Gaps = 1/258 (0%) Frame = +3 Query: 129 TNIHLFLTCRQYWHAKQLYISLWNSFIPPEVRSDFYGVELLDWLNGNMLNSTKLEVVLQW 308 T IH+ C + ++ WN IP R FY E+ W+N N+ NS K W Sbjct: 586 TIIHVMRDC-------PIAVNFWNQVIPVVDRGVFYMGEINQWMNFNLNNSIKWINNGNW 638 Query: 309 CNVWATGCYYLWYWRNKYVHEEGYSPPNCPSRIITQNAMNYESSKHLS-VVHERTNQVTM 485 C+ WA GC+ LW WRN+ +HE+ + P+ P + + AM Y + S + RT ++M Sbjct: 639 CSFWALGCFCLWKWRNQELHEDSFVRPSMPVHHVGRMAMEYRKAMSNSELALGRTRAISM 698 Query: 486 VHWRPAATGWTVINTDGAFKSTSNVAGCGAVVRNSDGFWIGGIARRLFVDSAYLAELWGV 665 + W P + +NTDGA K +AGCG +VR S+G WIGG A+ + + +A++AE+WGV Sbjct: 699 IRWSPPKANFVKLNTDGACKEHI-IAGCGGIVRGSEGEWIGGFAKCVGMCNAFIAEMWGV 757 Query: 666 IDGINLAASKGYTDIELQLDSSIVVNCLTGTNMGSIAGRQLVRRIRHLLACFSNVRILHI 845 ++G+ G+ +EL +DS+ VV + + S G L R+I L+A V + HI Sbjct: 758 LEGLRYVRRLGFRKVELNIDSAAVVQVIKTGRLQSSTGSALARQIWKLMAMDWEVEVNHI 817 Query: 846 YREANKVADALANLGCTF 899 YREANK ADALAN+G F Sbjct: 818 YREANKCADALANMGSNF 835 >gb|PNY15602.1| ribonuclease H [Trifolium pratense] Length = 437 Score = 181 bits (460), Expect = 7e-50 Identities = 104/292 (35%), Positives = 154/292 (52%), Gaps = 4/292 (1%) Frame = +3 Query: 129 TNIHLFLTCRQYWHAKQLYISLWNSFIPPEVRSDFYGVELLDWLNGNMLNSTKLEVVLQW 308 T+IH C + ++W S +P + R F+G L W+N N+ + ++W Sbjct: 155 TSIHALRDCT-------VVRNMWLSVVPCDSRGLFFGGGLESWINYNLSSDIDRINGIRW 207 Query: 309 CNVWATGCYYLWYWRNKYVHEEGYSPPNCPSRIITQNAMNYESS----KHLSVVHERTNQ 476 + WAT C++LW WRNK H E +S P I Q +Y ++ KH++ + + Sbjct: 208 EDFWATACHFLWSWRNKEQHVEEFSRPVQAIVHILQCCSHYYNACMELKHVNTIQKG--- 264 Query: 477 VTMVHWRPAATGWTVINTDGAFKSTSNVAGCGAVVRNSDGFWIGGIARRLFVDSAYLAEL 656 V + W+ GW +NTDGA K ++GCG ++R+ G W GG A+ + S +AEL Sbjct: 265 VCWIGWKVPEEGWVKLNTDGASKG-EGLSGCGGIIRDHQGNWCGGFAKFVGTGSVLIAEL 323 Query: 657 WGVIDGINLAASKGYTDIELQLDSSIVVNCLTGTNMGSIAGRQLVRRIRHLLACFSNVRI 836 WGV++G+ L KGY +E+ +DS VV + S G LV+ I LL V+I Sbjct: 324 WGVLEGLKLVWRKGYRKVEVNIDSISVVKMILNGMTSSALGFSLVKSIGRLLDEQWEVKI 383 Query: 837 LHIYREANKVADALANLGCTFLEGFSIIDTPPPSISLLVDADVVGVSTPRVI 992 H YRE NK ADALA++GC +T P SI + ADV+G +TPR+I Sbjct: 384 SHSYRETNKCADALASMGCILDCNIVFFETCPSSIKNVFSADVMGRNTPRLI 435 >dbj|GAU10071.1| hypothetical protein TSUD_422240 [Trifolium subterraneum] Length = 306 Score = 177 bits (450), Expect = 9e-50 Identities = 89/260 (34%), Positives = 144/260 (55%), Gaps = 1/260 (0%) Frame = +3 Query: 120 LLSTNIHLFLTCRQYWHAKQLYISLWNSFIPPEVRSDFYGVELLDWLNGNMLNSTKLEVV 299 ++ T++H+ C L W +P RS F+ +L W+ N+ + Sbjct: 48 VVETSMHVMRDC-------SLVTPFWLQVVPMRERSTFFTEDLQQWIITNINTGRRGNNG 100 Query: 300 LQWCNVWATGCYYLWYWRNKYVHEEGYSPPNCPSRIITQNAMNYESSKHLSVVHERTNQV 479 WC++WAT C+ LW WRNK +H+ Y P+ + + + +Y ++ ++ + + + Sbjct: 101 SAWCDIWATACHCLWTWRNKEMHDNNYVRPSHMVQHVYKIVEDYSQARRVNELMRNSERF 160 Query: 480 -TMVHWRPAATGWTVINTDGAFKSTSNVAGCGAVVRNSDGFWIGGIARRLFVDSAYLAEL 656 T + W+P + +NTDGA K +N+AGCG ++R + G W+GG A+ + V SA+ AEL Sbjct: 161 FTQIGWQPPCGHFVKLNTDGACKD-NNIAGCGGIIRGAHGEWLGGFAKGVGVCSAFAAEL 219 Query: 657 WGVIDGINLAASKGYTDIELQLDSSIVVNCLTGTNMGSIAGRQLVRRIRHLLACFSNVRI 836 WGV++G+ A G+T +EL +DS VV + + S G LV++IR L + I Sbjct: 220 WGVLEGLQYARRLGFTAVELNIDSITVVQVIKTGRLSSPIGLPLVKQIRRYLELDWEIMI 279 Query: 837 LHIYREANKVADALANLGCT 896 H YRE+NK ADALA++GCT Sbjct: 280 AHAYRESNKCADALASIGCT 299