BLASTX nr result
ID: Astragalus22_contig00020811
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus22_contig00020811 (820 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PNX71533.1| ribonuclease H [Trifolium pratense] 207 6e-58 dbj|GAU38301.1| hypothetical protein TSUD_157860 [Trifolium subt... 193 1e-56 gb|PNY14301.1| ribonuclease H [Trifolium pratense] 203 1e-55 gb|PNY15111.1| ribonuclease H [Trifolium pratense] 201 4e-55 gb|PNX72264.1| ribonuclease H [Trifolium pratense] 193 1e-52 dbj|GAU35627.1| hypothetical protein TSUD_30450 [Trifolium subte... 194 2e-52 gb|AFK38936.1| unknown [Medicago truncatula] 164 2e-45 dbj|GAU35964.1| hypothetical protein TSUD_207680 [Trifolium subt... 144 6e-39 dbj|GAU30026.1| hypothetical protein TSUD_161120 [Trifolium subt... 152 6e-38 dbj|GAU47092.1| hypothetical protein TSUD_369250 [Trifolium subt... 144 2e-37 dbj|GAU41525.1| hypothetical protein TSUD_140560 [Trifolium subt... 145 1e-35 dbj|GAU29911.1| hypothetical protein TSUD_148190 [Trifolium subt... 134 1e-32 dbj|GAU39028.1| hypothetical protein TSUD_59840 [Trifolium subte... 131 1e-30 dbj|GAU39667.1| hypothetical protein TSUD_60340 [Trifolium subte... 129 7e-30 gb|PNX85413.1| ribonuclease H [Trifolium pratense] >gi|133524134... 117 2e-28 gb|PNX58626.1| ribonuclease H, partial [Trifolium pratense] 114 3e-27 gb|PNY04967.1| ribonuclease H [Trifolium pratense] 114 3e-27 dbj|GAU21787.1| hypothetical protein TSUD_329120, partial [Trifo... 119 9e-27 dbj|GAU32098.1| hypothetical protein TSUD_292220 [Trifolium subt... 112 3e-26 dbj|GAU37237.1| hypothetical protein TSUD_375390 [Trifolium subt... 112 6e-26 >gb|PNX71533.1| ribonuclease H [Trifolium pratense] Length = 798 Score = 207 bits (527), Expect = 6e-58 Identities = 113/280 (40%), Positives = 162/280 (57%), Gaps = 14/280 (5%) Frame = +2 Query: 2 SNLRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILE 181 SNL KK I LD+ CPLC++ E+ S+H+FL+C++ +L LFAS L H P+ D+ + Sbjct: 500 SNLHKKGITLDLLCPLCSSEEES-----SQHLFLKCDMFKLTLFASHLGSHIPIDIDLHD 554 Query: 182 WLLKWLTCRDTEGAQLFCIMIWR---AKNEAVFNNKKPDPIRIAEAAVEFITEYIAANSS 352 W+LKWL C+D G QLFC ++W+ +N +FN + DP +A A+ F+ E+ AN S Sbjct: 555 WILKWLVCQDPLGVQLFCTLLWKFWAGRNAVIFNGWQMDPTFLALDALSFVQEFNEANPS 614 Query: 353 RLHFSLQEQQAPIPANHLGT---CLYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNI 523 R +L Q P+ T ++VD G N G T WGL + N +G+ ++SACKR++I Sbjct: 615 RNRRALVSQSISEPSRSTCTSMNSMFVDAGCCNSGHTVWGLVLRNLNGETVFSACKREDI 674 Query: 524 --------AVDIRWGMQQVLEMNFVPDAIYSDAQVVTLCISKRFHVASIEHIMQDCRSLL 679 A+ +RW +Q + +IYSDA V CI+KR + A+I I QDCR+L+ Sbjct: 675 TAEPLLAEALGVRWALQVATDQGINSVSIYSDAANVVNCINKRSNFAAINLIAQDCRNLM 734 Query: 680 ELILGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLGPPPV 799 + SV I R N AH LV LAK G+ WLG P+ Sbjct: 735 AGLGNVSVMFISRTQNCDAHNLVSLAKVVGSRTWLGVAPL 774 >dbj|GAU38301.1| hypothetical protein TSUD_157860 [Trifolium subterraneum] Length = 317 Score = 193 bits (491), Expect = 1e-56 Identities = 105/280 (37%), Positives = 152/280 (54%), Gaps = 12/280 (4%) Frame = +2 Query: 5 NLRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILEW 184 NL KK I D++CPLC+ E+ S H+F+ CN+ RL LFAS L H P+ D+ W Sbjct: 26 NLSKKGINFDLSCPLCHHGLES-----SNHLFMNCNLMRLTLFASNLGSHIPVSVDVSVW 80 Query: 185 LLKWLTCRDTEGAQLFCIMIWR---AKNEAVFNNKKPDPIRIAEAAVEFITEYIAANSSR 355 +L WLTC+D G QLFC+++W+ +N+ +F DPI +A A ++ E+ AN R Sbjct: 81 ILSWLTCKDMIGTQLFCVLLWKFWYGRNQVIFKGVVLDPIALAAEAALYVHEFNEANPRR 140 Query: 356 LHFSLQEQQAPIPANHLG-TCLYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNIAVD 532 + +Q + + ++ D G FN+G T WG+ + N DG +SACKR+ I V+ Sbjct: 141 CSQVVLQQASVSRLDDANMQLMFTDAGCFNNGYTGWGIVLRNVDGTTSFSACKREEIEVE 200 Query: 533 --------IRWGMQQVLEMNFVPDAIYSDAQVVTLCISKRFHVASIEHIMQDCRSLLELI 688 +RW +Q L+ + I SDA V CI+KR +ASI+ I QDCR LL Sbjct: 201 PAVAEALGVRWALQLSLDQHLDNFIILSDAANVVNCIAKRISLASIDLIAQDCRDLLCNF 260 Query: 689 LGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLGPPPVNCN 808 S+ + R N+ AH + LAK G+ W+G P N Sbjct: 261 SNVSIKFVGRALNIDAHNVASLAKCVGSRTWVGSAPTVSN 300 >gb|PNY14301.1| ribonuclease H [Trifolium pratense] Length = 1196 Score = 203 bits (516), Expect = 1e-55 Identities = 113/279 (40%), Positives = 158/279 (56%), Gaps = 13/279 (4%) Frame = +2 Query: 5 NLRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILEW 184 NL KK I LD++CPLC+ E+ S H+F+ CN+ RL LFAS L H P D+ W Sbjct: 905 NLAKKGINLDLSCPLCHHVLES-----SNHLFMHCNIMRLTLFASNLGSHIPHSVDLSVW 959 Query: 185 LLKWLTCRDTEGAQLFCIMIWR---AKNEAVFNNKKPDPIRIAEAAVEFITEYIAANSSR 355 +L WLTC+D G QLFC+++W+ +N+ +F + DPI +A A+E++ E+ AN R Sbjct: 960 ILSWLTCKDMIGTQLFCVLLWKFWYGRNQVIFKDAVFDPILLAADAIEYVHEFNEANPRR 1019 Query: 356 LH-FSLQEQQAPIPANHLGTCLYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNIAVD 532 + LQ AP + ++ D G FN+G T WGL + N DG +SACKR+NI V+ Sbjct: 1020 CNQVVLQHISAPRLDDSNMQLMFTDAGCFNNGYTGWGLVLRNVDGTTSFSACKRENIEVE 1079 Query: 533 --------IRWGMQQVLEMNFVPDAIYSDAQVVTLCISKRFHVASIEHIMQDCRSLLELI 688 +RW ++ L + I SDA V CI+KR +ASIE I QDCR LL Sbjct: 1080 PALAEALGVRWALEFALAQHLDNIIILSDAANVVNCIAKRTVLASIELIAQDCRDLLCNF 1139 Query: 689 LGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLG-PPPVN 802 S+ + R N+ AH + LAK G+ W+G PPV+ Sbjct: 1140 SNVSIKFVSRVSNVDAHNVASLAKFVGSRTWIGNAPPVS 1178 >gb|PNY15111.1| ribonuclease H [Trifolium pratense] Length = 1334 Score = 201 bits (512), Expect = 4e-55 Identities = 110/284 (38%), Positives = 163/284 (57%), Gaps = 14/284 (4%) Frame = +2 Query: 2 SNLRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILE 181 +NL +K +Q++ CP C++APET++HLF L C++T+L FAS+L P + Sbjct: 1045 ANLVRKGVQIENLCPQCHSAPETIDHLF-----LHCHLTQLTWFASQLGARVPQSVPVHI 1099 Query: 182 WLLKWLTCRDTEGAQLFCIM---IWRAKNEAVFNNKKPDPIRIAEAAVEFITEYIAANSS 352 WLL+ LTC DT GAQLFC++ IW A+N VFNNK DPI IA+ A+ F+ E + S Sbjct: 1100 WLLQGLTCDDTRGAQLFCVLMWKIWNARNNLVFNNKLVDPIAIAQEAMYFMQEL--SPSP 1157 Query: 353 RLHFSLQEQQAPIPANHLGTC---LYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNI 523 H + Q A + A + + YVD G F+ T WG+ + N G ++ SAC+++ I Sbjct: 1158 HEHNATPMQDAVLAAQPMPSAPHVFYVDAGCFSGNATGWGMVIYNQSGRVVLSACRKELI 1217 Query: 524 --------AVDIRWGMQQVLEMNFVPDAIYSDAQVVTLCISKRFHVASIEHIMQDCRSLL 679 A+ +RW +Q+ +E+N I SDA V CI+ HVA I+ ++QDC L+ Sbjct: 1218 DVEPVLAEAIGVRWCLQKAIELNMTDIVIVSDAATVVSCINSNKHVAVIDLVIQDCNLLI 1277 Query: 680 ELILGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLGPPPVNCNA 811 E + V H+RR N+VAH L G + G + W+G P + +A Sbjct: 1278 EQLDSVVVTHVRRHLNVVAHGLAGFSNVVGTKLWMGVVPNSISA 1321 >gb|PNX72264.1| ribonuclease H [Trifolium pratense] Length = 854 Score = 193 bits (490), Expect = 1e-52 Identities = 106/282 (37%), Positives = 157/282 (55%), Gaps = 15/282 (5%) Frame = +2 Query: 2 SNLRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILE 181 +NL K I LD+ CPLC E+ S+H+FL+C++ +L LFAS L H P+ D+ + Sbjct: 557 TNLHNKGITLDLQCPLCFREEES-----SQHLFLKCDIFKLTLFASHLGSHIPMNIDLHD 611 Query: 182 WLLKWLTCRDTEGAQLFCIMIWR---AKNEAVFNNKKPDPIRIAEAAVEFITEYIAANSS 352 W+L+WL C+D G QLFC+++W+ +N AVFN + DP R+A A+ F+ ++ AN Sbjct: 612 WILEWLLCQDPMGVQLFCVLLWKFWAGRNAAVFNGVQLDPGRLAIDAMSFVHDFNEANPP 671 Query: 353 RLHFSLQEQQAPIPANHLGT----CLYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDN 520 R + A +P T L+VD G N G T WGL + N+DG+ + SACKR++ Sbjct: 672 RCR---RAPVAHVPIQPGMTNPIFSLFVDAGCSNSGHTVWGLVLRNSDGETVLSACKRED 728 Query: 521 IAVD--------IRWGMQQVLEMNFVPDAIYSDAQVVTLCISKRFHVASIEHIMQDCRSL 676 VD +RW +Q V++ +IYSDA V CI++ A+I I +DCR L Sbjct: 729 FYVDPLMAEALGVRWALQLVVDQGINSVSIYSDAANVVNCINRNSSFAAINLIAEDCRKL 788 Query: 677 LELILGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLGPPPVN 802 + + V + R N AH L LA+ GN W+G P++ Sbjct: 789 MNRLTNVCVLFVSRTQNSDAHNLASLARIMGNRTWVGVVPLS 830 >dbj|GAU35627.1| hypothetical protein TSUD_30450 [Trifolium subterraneum] Length = 1475 Score = 194 bits (492), Expect = 2e-52 Identities = 107/277 (38%), Positives = 148/277 (53%), Gaps = 12/277 (4%) Frame = +2 Query: 2 SNLRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILE 181 +NL KK I LD+ CPLC+ E+ HLF L+C++ +L LFAS L H PLQ D+ + Sbjct: 1178 ANLHKKGISLDLQCPLCHHEVESTNHLF-----LQCDLMKLTLFASHLGSHMPLQVDLYD 1232 Query: 182 WLLKWLTCRDTEGAQLFCIMIWR---AKNEAVFNNKKPDPIRIAEAAVEFITEYIAANSS 352 W+ WLTC DT QLFC ++W+ +N VF K DP+ + + + F+ E+ AN Sbjct: 1233 WIFSWLTCHDTLDTQLFCTLLWKFWATRNNVVFRGDKLDPVCLVDEVMSFVQEFNEANPP 1292 Query: 353 RL-HFSLQEQQAPIPANHLGTCLYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNIAV 529 R SL + ++VD G +G T WGL + N D +SACK D+IAV Sbjct: 1293 RQGRVSLPLTTVTPSISRPSFSVFVDAGCNLNGPTVWGLVLKNHDRITTFSACKYDDIAV 1352 Query: 530 D--------IRWGMQQVLEMNFVPDAIYSDAQVVTLCISKRFHVASIEHIMQDCRSLLEL 685 + +RW +Q V E I+SDA V CI + + +IE + QDCR LL Sbjct: 1353 EPVMAEALGVRWAIQFVREQGLHSVCIFSDAANVVDCICNKVKLDAIEMVAQDCRELLSS 1412 Query: 686 ILGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLGPPP 796 + SV +RRD N+ AH L LA+ GN W+G P Sbjct: 1413 LPNVSVLFVRRDQNIDAHNLASLARLVGNRTWVGAAP 1449 >gb|AFK38936.1| unknown [Medicago truncatula] Length = 297 Score = 164 bits (415), Expect = 2e-45 Identities = 94/280 (33%), Positives = 145/280 (51%), Gaps = 15/280 (5%) Frame = +2 Query: 5 NLRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILEW 184 NLR++ + LD CPLC A E+ S H+F+ C +T FAS L PP QTD+ W Sbjct: 14 NLRRRGVVLDTVCPLCFDADES-----SNHLFMACPMTLQVWFASPLGFQPPPQTDLNAW 68 Query: 185 LLKWLTCRDTEGAQLFCIMIWRA---KNEAVFNNKKPDPIRIAEAAVEFITEYIAANSSR 355 L WL+ ++ QLFC+ +W+ +N+A+FN +P +A +A +F++E+ AN +R Sbjct: 69 LQSWLSAKEPLAVQLFCVCLWKIWFFRNQAIFNQVVFEPRMVAASAHDFVSEFNLANPTR 128 Query: 356 ----LHFSLQEQQAPIPANHLGTCLYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNI 523 L Q AP P + L +D G G WGL + N + +++++A + +I Sbjct: 129 SVDRLQIPAQVWIAP-PTDFLKA--NIDAGRDKHGKVTWGLVIRNHESEVLFAATQSPDI 185 Query: 524 AVD--------IRWGMQQVLEMNFVPDAIYSDAQVVTLCISKRFHVASIEHIMQDCRSLL 679 D +RWG+Q VLE+ DA VV C + +ASI + DC L Sbjct: 186 MADPLLVETLGLRWGIQTVLELQLSNVMFELDASVVVKCFNGLSTIASISPFISDCHDLF 245 Query: 680 ELILGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLGPPPV 799 ++G SV+ + R N+ AH L +AK G+ W+G P+ Sbjct: 246 GSLVGSSVSFVNRSCNVAAHELAQVAKSIGSRTWVGNAPL 285 >dbj|GAU35964.1| hypothetical protein TSUD_207680 [Trifolium subterraneum] Length = 198 Score = 144 bits (363), Expect = 6e-39 Identities = 79/180 (43%), Positives = 108/180 (60%), Gaps = 8/180 (4%) Frame = +2 Query: 2 SNLRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILE 181 + L+ K I LD+ CPLC+ LE + H+FL+C++ + LFAS L H PL TD+ Sbjct: 13 AKLKNKGISLDLLCPLCH-----LEEESASHLFLQCDLMKFTLFASHLGFHVPLNTDLHY 67 Query: 182 WLLKWLTCRDTEGAQLFCIMIWR---AKNEAVFNNKKPDPIRIAEAAVEFITEYIAAN-- 346 W+LKWLTC+D G+QLFC ++W+ A N VFN + +P+RIAE A+ F+ EY AAN Sbjct: 68 WILKWLTCQDALGSQLFCTLLWKFWTAINNVVFNGIQLEPVRIAEEAMSFVQEYNAANPI 127 Query: 347 -SSRLHFSLQE--QQAPIPANHLGTCLYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRD 517 R+ SL AP P ++VDVG G T WGL + N D + ++SACKRD Sbjct: 128 KRGRISSSLPNILPAAPRPL----FSIFVDVGCCVLGPTTWGLVIKNQDCNCVFSACKRD 183 >dbj|GAU30026.1| hypothetical protein TSUD_161120 [Trifolium subterraneum] Length = 1957 Score = 152 bits (384), Expect = 6e-38 Identities = 97/283 (34%), Positives = 146/283 (51%), Gaps = 20/283 (7%) Frame = +2 Query: 8 LRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILEWL 187 L++K + LD CPLC A E SEH+F++C + + F+S L +H P Q + W+ Sbjct: 1419 LQRKGVILDTICPLCFEAEEN-----SEHLFMKCRLAQQTWFSSCLGLHVPSQMSLKNWM 1473 Query: 188 LKWLTCRDTEGAQLFCIM---IWRAKNEAVFNNKKPDPIRIAEAAVEFITEYIAANSSRL 358 +WL ++ +QLF I IW+ +N+ VF N DP IA AA +F E+ AN Sbjct: 1474 CEWLISKNQSASQLFGITLSKIWKGRNQVVFQNALFDPCHIAIAAADFTLEFNCANPPN- 1532 Query: 359 HFSLQEQQAP-IPANHLGTC-------LYVDVGSFNDGTTCWGLCVVNADGDIIYSACKR 514 E P I A C L VD G FNDG +G+ V + G++ ++A K Sbjct: 1533 -----EAAVPVITATETWCCPPTGMSKLNVDAGCFNDGLLGFGMVVRDNLGNVCFAATKL 1587 Query: 515 DNI--------AVDIRWGMQQVLEMNFVPDAIY-SDAQVVTLCISKRFHVASIEHIMQDC 667 + A+ +RW + +L N V I +D++VV C+ ++ IE+I+ DC Sbjct: 1588 EKKQASPTLAEALALRWCLHWILSSNQVGHFIVETDSEVVVKCLQGVSSLSEIENIILDC 1647 Query: 668 RSLLELILGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLGPPP 796 ++ + CSV IRR N+VAH LVG+AK G+ +W+G P Sbjct: 1648 SDIMSNLSNCSVVFIRRCKNIVAHSLVGVAKHVGSRSWVGYIP 1690 >dbj|GAU47092.1| hypothetical protein TSUD_369250 [Trifolium subterraneum] Length = 335 Score = 144 bits (363), Expect = 2e-37 Identities = 92/283 (32%), Positives = 142/283 (50%), Gaps = 14/283 (4%) Frame = +2 Query: 2 SNLRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILE 181 + L KK + LD CPLC E EHLF + C +L FAS L +H P D+ Sbjct: 51 ARLAKKGLTLDPWCPLCYQQVEDYEHLF-----MSCPFAKLTWFASPLDLHAPSNVDVNS 105 Query: 182 WLLKWLTCRDTEGAQLFCIM--IWRAKNEAVFNNKKPDPIRIAEAAVEFITEYIAANSSR 355 W+L+ L+ EG Q+FC M IW +N+ +F + P +A +A F+ E+ Sbjct: 106 WVLQGLSNPLVEGVQIFCTMSKIWFHRNKLIFKQQAFVPHEVASSASSFVAEFSPTFLRE 165 Query: 356 LHFSLQEQ-QAPIPANHLGTCLYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNI--- 523 ++ + + +A + + + VD GSF++G+T WGL V + + +I SAC+ + I Sbjct: 166 IYMNTSDVLEASQVVSPVCNRICVDAGSFSNGSTGWGLIVKDHESSVILSACRFEEIYTC 225 Query: 524 -----AVDIRWGMQQVLEMNFVPDAIYSDAQVVTLCISKRFHVASIEHIMQDCRSLLELI 688 A+ IRW +Q +++N+ I SDA + I + A + I+QDC SL Sbjct: 226 PILAEALGIRWAIQTAIDLNYNQVTIVSDALTIVKGIEGKTCPAEVALIVQDCISLCSNF 285 Query: 689 LGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLGPPP---VNCN 808 + +V +++R N AH LV L+K G W G P V CN Sbjct: 286 MHVAVVYVKRTLNTEAHNLVQLSKHVGCRTWSGIIPNLAVVCN 328 >dbj|GAU41525.1| hypothetical protein TSUD_140560 [Trifolium subterraneum] Length = 1610 Score = 145 bits (367), Expect = 1e-35 Identities = 78/210 (37%), Positives = 120/210 (57%), Gaps = 12/210 (5%) Frame = +2 Query: 5 NLRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILEW 184 NL+KK I LD +CPLC+ E HLF + CN+ +LALFAS L HPP+ D+ W Sbjct: 1406 NLKKKGISLDTSCPLCHNDSENAHHLF-----MHCNMLKLALFASPLGCHPPMNVDLNCW 1460 Query: 185 LLKWLTCRDTEGAQLFCIMIWR---AKNEAVFNNKKPDPIRIAEAAVEFITEYIAANS-S 352 LL+WL C D GAQLFC ++W+ A+N+ VFN +P+R+A++A+ F+ E+ AN+ S Sbjct: 1461 LLEWLNCSDKLGAQLFCTILWKFWFARNQYVFNGYPIEPLRLAQSALLFVQEFNEANNLS 1520 Query: 353 RLHFSLQEQQAPIPANHLGTCLYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNI--- 523 R A+ ++VD G F++ T WGL + + G++ ++AC+R++I Sbjct: 1521 RSTHVATRVHNTNSASPCQFSMFVDAGCFSNARTGWGLVLKDQRGNVTWNACRREDIEVT 1580 Query: 524 -----AVDIRWGMQQVLEMNFVPDAIYSDA 598 A+++RW +Q L + DA Sbjct: 1581 PILAEALELRWAIQSALSQGIQSISFNCDA 1610 >dbj|GAU29911.1| hypothetical protein TSUD_148190 [Trifolium subterraneum] Length = 482 Score = 134 bits (338), Expect = 1e-32 Identities = 84/274 (30%), Positives = 137/274 (50%), Gaps = 14/274 (5%) Frame = +2 Query: 2 SNLRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILE 181 + L KK + LD PLC E EHLF + C ++L FAS L +H P D+ Sbjct: 195 ARLAKKGLTLDPYFPLCYQQAEDYEHLF-----MSCPFSKLTWFASPLGLHAPSNVDVNS 249 Query: 182 WLLKWLTCRDTEGAQLFCIMIWRA---KNEAVFNNKK--PDPIRIAEAAVEFITEYIAAN 346 W+L+ L+ EG Q+FC +W+ +N+ +F + P +A +A F E+ Sbjct: 250 WVLQGLSNPLVEGVQIFCTSLWKIWFHRNKLIFEQQAFVPHEYEVASSASSFGAEFSPTF 309 Query: 347 SSRLHFSLQEQ-QAPIPANHLGTCLYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNI 523 + + + +A + + + VD G F++G+T WGL V + +G +I+SAC+ + I Sbjct: 310 LREIDMNTSDVLEASQVVSPICNRICVDAGCFSNGSTGWGLIVKDHEGSVIFSACRFEEI 369 Query: 524 --------AVDIRWGMQQVLEMNFVPDAIYSDAQVVTLCISKRFHVASIEHIMQDCRSLL 679 A+ IRW ++ +++N+ I SDA + I + A +E I+QDC SL Sbjct: 370 HTSPILAEALAIRWAIRTAIDLNYNQVTIVSDALTIVKDIEGKTCPAKVELIVQDCISLC 429 Query: 680 ELILGCSVNHIRRDGNLVAHRLVGLAKRFGNENW 781 + +V +++R N AH LV L+K G W Sbjct: 430 SNFMHVAVVYVKRTLNTEAHNLVQLSKHVGCRTW 463 >dbj|GAU39028.1| hypothetical protein TSUD_59840 [Trifolium subterraneum] Length = 1626 Score = 131 bits (329), Expect = 1e-30 Identities = 84/279 (30%), Positives = 138/279 (49%), Gaps = 15/279 (5%) Frame = +2 Query: 8 LRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILEWL 187 L KK I LD CPLC E +EH+F++C +++ F+S L +H P + W+ Sbjct: 1341 LEKKGITLDTTCPLCFNDIEC-----NEHLFMQCPLSKQVWFSSPLGLHAPNNFSLNSWM 1395 Query: 188 LKWLTCRDTEGAQLFCI---MIWRAKNEAVFNNKKPDPIRIAEAAVEFITEYIAANSSRL 358 WL+ D +QLF MIW+ +N+ +F N+K PI +A A+ +F+ E+ + S Sbjct: 1396 QLWLSNPDKLASQLFSTTLWMIWKGRNKLIFKNEKFCPIYVAAASSDFVAEFNSGTCSFE 1455 Query: 359 HFSLQEQQAPIPANHLGTC-LYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNIAVD- 532 + + G + +D G F++GTT WG+ + N G + ++A + I V Sbjct: 1456 NIPSCDNPGKWEHPEQGKLKVNIDAGCFSNGTTGWGMIMRNHLGMVEFAATHLEKIKVSS 1515 Query: 533 -------IRWGMQQVLEMNFVPDAIY-SDAQVVTLCISKRFHVASIEHIMQDCRSLLELI 688 +RW +Q + I SD++V C++ +E+I+QDCR+ L + Sbjct: 1516 TLAETMALRWCLQWIQASTHHEHIIIESDSEVSVKCLNGSICDVLVENIIQDCRNFLSSL 1575 Query: 689 LGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLG--PPPV 799 V +RR N+ H L LA+ G ++W+G P PV Sbjct: 1576 PNVIVVFVRRSKNVATHELASLARTVGAKSWVGCVPGPV 1614 >dbj|GAU39667.1| hypothetical protein TSUD_60340 [Trifolium subterraneum] Length = 1063 Score = 129 bits (323), Expect = 7e-30 Identities = 84/288 (29%), Positives = 137/288 (47%), Gaps = 23/288 (7%) Frame = +2 Query: 8 LRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILEWL 187 L +K + LD CPLC ET EHLF + C V R F S L +H P ++ +W+ Sbjct: 773 LEQKGVALDPICPLCYDGEETQEHLF-----MHCQVIRRFWFLSPLGLHVPADVNLFKWM 827 Query: 188 LKWLTCRDTEGAQLFCIM---IWRAKNEAVFNNKKPDPIRIAE----------AAVEFIT 328 WL+ + QLF + IW+ +N++VFN K P+ + + + A I+ Sbjct: 828 EHWLSNSNFMATQLFSLSLWTIWKMRNDSVFNKKYPNCMIVVQNVSILAEEFNLACNLIS 887 Query: 329 EYIAANSSRLHFSLQEQQAPIPANHLGTCLYVDVGSFNDGTTCWGLCVVNADGDIIYSAC 508 ++ ++ + PI + +D G F + TCWGL N G + ++A Sbjct: 888 NVVSEPIINSDVDVRWELPPIGFLKVN----IDAGCFKNNYTCWGLLDRNHKGIVQFAAT 943 Query: 509 KRDNI--------AVDIRWGMQQVLEMNFVPDAIYSDAQVVTLCISKRFHVASIEHIMQD 664 KR+ I A+ ++W ++ + + N + DA+ V C+ +R ++ IE I+ D Sbjct: 944 KRERITCSPLLVEALSLKWCLRWIKDQNLQNVEVEMDAENVVNCLLRRINIVEIELIVVD 1003 Query: 665 CRSLLELILGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLG--PPPVN 802 C +L +L SV ++ N AH LVG+A G+ W G P PV+ Sbjct: 1004 CLYILLSLLNVSVLVVKSCKNKAAHGLVGVAMNLGSLLWFGNVPEPVS 1051 >gb|PNX85413.1| ribonuclease H [Trifolium pratense] gb|PNY00296.1| ribonuclease H [Trifolium pratense] Length = 207 Score = 117 bits (293), Expect = 2e-28 Identities = 70/182 (38%), Positives = 98/182 (53%), Gaps = 11/182 (6%) Frame = +2 Query: 287 DPIRIAEAAVEFITEYIAANSSRLHFSLQEQQAPIPANHLGTCL---YVDVGSFNDGTTC 457 DP +A A+ F+ E+ AN SR +L Q P+ T + +VD G N G T Sbjct: 2 DPTFLALDALSFVQEFNEANPSRNRRALVSQSISEPSRSTCTSMNSMFVDAGCCNSGHTV 61 Query: 458 WGLCVVNADGDIIYSACKRDNI--------AVDIRWGMQQVLEMNFVPDAIYSDAQVVTL 613 WGL + N +G+ ++SACKR++I A+ +RW +Q + +IYSDA V Sbjct: 62 WGLVLRNLNGETVFSACKREDITAEPLLAEALGVRWALQVATDQGINSVSIYSDAANVVN 121 Query: 614 CISKRFHVASIEHIMQDCRSLLELILGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLGPP 793 CI+KR + A+I I QDCR+L+ + SV I R N AH LV LAK G+ WLG Sbjct: 122 CINKRSNFAAINLIAQDCRNLMAGLGNVSVMFISRTQNCDAHNLVSLAKVVGSRTWLGVA 181 Query: 794 PV 799 P+ Sbjct: 182 PL 183 >gb|PNX58626.1| ribonuclease H, partial [Trifolium pratense] Length = 217 Score = 114 bits (286), Expect = 3e-27 Identities = 69/195 (35%), Positives = 101/195 (51%), Gaps = 12/195 (6%) Frame = +2 Query: 254 KNEAVFNNKKPDPIRIAEAAVEFITEYIAANSSRLHFSLQEQQAPIPANHLGT----CLY 421 +N AVFN + DP R+A A+ F+ ++ AN R + A +P T L+ Sbjct: 2 RNAAVFNGVQLDPGRLAIDAMSFVHDFNEANPPRCR---RAPVAHVPIQPGMTNPIFSLF 58 Query: 422 VDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNIAVD--------IRWGMQQVLEMNFVP 577 VD G N G T WGL + N+DG+ + SACKR++ VD +RW +Q V++ Sbjct: 59 VDAGCSNSGHTVWGLVLRNSDGETVLSACKREDFYVDPLMAEALGVRWALQLVVDQGINS 118 Query: 578 DAIYSDAQVVTLCISKRFHVASIEHIMQDCRSLLELILGCSVNHIRRDGNLVAHRLVGLA 757 +IYSDA V CI++ A+I I +DCR L+ + V + R N AH L LA Sbjct: 119 VSIYSDAANVVNCINRNSSFAAINLIAEDCRKLMNRLTNVCVLFVSRTQNSDAHNLASLA 178 Query: 758 KRFGNENWLGPPPVN 802 + GN W+G P++ Sbjct: 179 RIMGNRTWVGVVPLS 193 >gb|PNY04967.1| ribonuclease H [Trifolium pratense] Length = 207 Score = 114 bits (285), Expect = 3e-27 Identities = 67/190 (35%), Positives = 99/190 (52%), Gaps = 8/190 (4%) Frame = +2 Query: 242 IWRAKNEAVFNNKKPDPIRIAEAAVEFITEYIAANSSRLHFSLQEQQAPIPANHLGTCLY 421 +W +N+ VF K P P IA AA++ + E+ A + Q + A + Sbjct: 1 MWFFRNQVVFQQKIPTPPDIAIAALDIVHEFNLAVPKKSKQRQQHAASEPAATLCSHLIQ 60 Query: 422 VDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNIAVD--------IRWGMQQVLEMNFVP 577 VD G F DG T +G + + G I +SAC+++N+ VD IRW +Q + N Sbjct: 61 VDAGCFPDGYTTFGCVIKDCSGMISFSACRKENLLVDPLLAEALAIRWCLQVAKDQNLKE 120 Query: 578 DAIYSDAQVVTLCISKRFHVASIEHIMQDCRSLLELILGCSVNHIRRDGNLVAHRLVGLA 757 I SDA VV CI +A IE I+ DC+ L+ S+N++ RD N++AHRLVG A Sbjct: 121 VIIQSDALVVVECIRGSNSIACIELIVTDCKLLMSTFSSVSINYVCRDLNVLAHRLVGYA 180 Query: 758 KRFGNENWLG 787 + G ++WLG Sbjct: 181 MQVGCKSWLG 190 >dbj|GAU21787.1| hypothetical protein TSUD_329120, partial [Trifolium subterraneum] Length = 734 Score = 119 bits (299), Expect = 9e-27 Identities = 81/273 (29%), Positives = 120/273 (43%), Gaps = 8/273 (2%) Frame = +2 Query: 2 SNLRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILE 181 +NL K I+LD+ CPLC E+ S+H+FL+C++ +L LFAS L Sbjct: 329 TNLHNKGIKLDLQCPLCFREEES-----SQHLFLKCDIFKLTLFASHLG----------- 372 Query: 182 WLLKWLTCRDTEGAQLFCIMIWRAKNEAVFNNKKPDPIRIAEAAVEFITEYIAANSSRLH 361 +N +VFN K DP R+A F+ ++ AN + Sbjct: 373 ------------------------RNASVFNGIKLDPGRLALDVTSFVHDFNEANPPSM- 407 Query: 362 FSLQEQQAPIPANHLGTCLYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNIAVD--- 532 G T WGL + N+DG+ I+SACKR+ I+VD Sbjct: 408 ---------------------------SGPTVWGLVLRNSDGETIFSACKREEISVDPLM 440 Query: 533 -----IRWGMQQVLEMNFVPDAIYSDAQVVTLCISKRFHVASIEHIMQDCRSLLELILGC 697 +RW +Q V++ +I+SDA V CI+++ A+I I +DCR+L+ + Sbjct: 441 AEALGVRWALQLVVDQGINSVSIHSDAANVVNCINRKSSFAAINLIAEDCRNLMTCLANV 500 Query: 698 SVNHIRRDGNLVAHRLVGLAKRFGNENWLGPPP 796 + R N AH L LA+ GN W G P Sbjct: 501 CDLFVSRTQNSDAHNLASLARIMGNRTWQGVAP 533 >dbj|GAU32098.1| hypothetical protein TSUD_292220 [Trifolium subterraneum] Length = 240 Score = 112 bits (280), Expect = 3e-26 Identities = 73/243 (30%), Positives = 115/243 (47%), Gaps = 11/243 (4%) Frame = +2 Query: 101 LECNVTRLALFASRLAIHPPLQTDILEWLLKWLTCRDTEGAQLFCIMIWRAKNEAVFNNK 280 + CN+ +L LFAS L HPPL D+ WL N+ VF Sbjct: 1 MHCNLLKLVLFASPLGCHPPLNVDLNCWL-----------------------NQTVFKGT 37 Query: 281 KPDPIRIAEAAVEFITEYIAAN-SSRLHFSLQE--QQAPIPANHLGTCLYVDVGSFNDGT 451 + + +A+ A+ F+ E+ AN SR + +P+P+ ++VD ++ Sbjct: 38 PFEAVSLAQPALLFVQEFNDANIQSRPSQAATRVRNSSPVPSRQFS--MFVDASCLSNAQ 95 Query: 452 TCWGLCVVNADGDIIYSACKRDNI--------AVDIRWGMQQVLEMNFVPDAIYSDAQVV 607 WG+ + +G +++SACKRDNI A+ +RW +Q + + DA V Sbjct: 96 IGWGIVFKDHNGAVLWSACKRDNIVVTPIIADALGLRWAIQTSISQGIQCLSFACDALEV 155 Query: 608 TLCISKRFHVASIEHIMQDCRSLLELILGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLG 787 CI+ + VASI+ +++DC +LLE I V H+ R N AH L LA+ G+ W+G Sbjct: 156 VNCINSKCVVASIDPVIKDCTNLLENIPYAMVYHVSRKLNREAHDLASLARYVGSRTWMG 215 Query: 788 PPP 796 P Sbjct: 216 NAP 218 >dbj|GAU37237.1| hypothetical protein TSUD_375390 [Trifolium subterraneum] Length = 246 Score = 112 bits (279), Expect = 6e-26 Identities = 69/199 (34%), Positives = 98/199 (49%), Gaps = 15/199 (7%) Frame = +2 Query: 245 WRAKNEAVFNNKKPDPIRIAEAAVEFITEYIAANSSRL------HFSLQEQQA-PIPANH 403 W +N VFN K DP R+A F+ ++ AN R H S+Q PI + Sbjct: 5 WNGRNATVFNGIKLDPGRLALDVTSFVHDFNEANPPRCRRAPVAHVSIQPSLVTPIFS-- 62 Query: 404 LGTCLYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNIAVD--------IRWGMQQVL 559 L+VD G G WGL + N+DG+ I S CKR+ I+VD +RW +Q V+ Sbjct: 63 ----LFVDAGCSMSGPIVWGLVLRNSDGETILSVCKREEISVDPLMAETLGVRWALQLVI 118 Query: 560 EMNFVPDAIYSDAQVVTLCISKRFHVASIEHIMQDCRSLLELILGCSVNHIRRDGNLVAH 739 + +I+SDA V CI+++ A+I I +DCR+L+ + V + R N AH Sbjct: 119 DQGINSVSIHSDAANVVNCINRKSSFAAINLIAEDCRNLMTCLANVCVLFVSRTQNSDAH 178 Query: 740 RLVGLAKRFGNENWLGPPP 796 L LA+ GN W G P Sbjct: 179 NLASLARIMGNRTWQGVSP 197