BLASTX nr result
ID: Astragalus24_contig00008852
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Astragalus24_contig00008852 (1278 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value dbj|GAU24087.1| hypothetical protein TSUD_388800 [Trifolium subt... 84 2e-17 gb|KHN30886.1| Putative ribonuclease H protein, partial [Glycine... 79 8e-17 gb|KHN41375.1| Putative ribonuclease H protein, partial [Glycine... 78 1e-16 gb|KHN28363.1| Putative ribonuclease H protein, partial [Glycine... 76 4e-16 gb|KHN20323.1| Putative ribonuclease H protein, partial [Glycine... 73 3e-15 gb|PNY01502.1| ribonuclease H [Trifolium pratense] 87 9e-15 dbj|GAU29820.1| hypothetical protein TSUD_223660 [Trifolium subt... 79 1e-14 gb|PNX95738.1| ribonuclease H, partial [Trifolium pratense] 75 4e-14 gb|PNX72376.1| ribonuclease H, partial [Trifolium pratense] 75 6e-14 dbj|GAU26446.1| hypothetical protein TSUD_294120 [Trifolium subt... 81 2e-13 dbj|GAU22230.1| hypothetical protein TSUD_227660 [Trifolium subt... 49 2e-12 dbj|GAU25119.1| hypothetical protein TSUD_274080 [Trifolium subt... 67 5e-11 gb|KYP45153.1| Putative ribonuclease H protein At1g65750 family ... 76 6e-11 gb|KYP45155.1| Putative ribonuclease H protein At1g65750 family ... 76 7e-11 dbj|GAU36827.1| hypothetical protein TSUD_320640 [Trifolium subt... 74 2e-10 dbj|GAU50636.1| hypothetical protein TSUD_134420 [Trifolium subt... 70 4e-10 gb|KYP50779.1| Transposon TX1 uncharacterized [Cajanus cajan] 68 4e-10 gb|KYP60814.1| Putative ribonuclease H protein At1g65750 family ... 73 4e-10 gb|KYP34281.1| Putative ribonuclease H protein At1g65750 family ... 72 7e-10 dbj|GAU43110.1| hypothetical protein TSUD_373050 [Trifolium subt... 64 1e-09 >dbj|GAU24087.1| hypothetical protein TSUD_388800 [Trifolium subterraneum] Length = 1985 Score = 84.0 bits (206), Expect(3) = 2e-17 Identities = 70/264 (26%), Positives = 111/264 (42%), Gaps = 28/264 (10%) Frame = -1 Query: 843 GMGVVFYFWSHSWFGKDKLEDLFPVLA*VALNRFAHISKMGSWKGGALQWQLAFSCEL-- 670 G G FW W G LE FP L +++ + + ++GSW G +W L + L Sbjct: 1694 GCGNSIKFWKEVWVGGQSLELQFPRLFGISVQQDDMVREVGSWVNGVWRWGLRWRRVLFV 1753 Query: 669 -SGEELEEVKLLHGCI----------------DGTTVKSFYNLFVDYFNHACEIQEFTLF 541 + + E++L+ I DG TVKS Y C + F F Sbjct: 1754 WEEDLVSELELVLNNISITEEEDRWVWRLNVGDGFTVKSLYEALDPLLTPRCLVSSFESF 1813 Query: 540 ILN---VEAVHSNKS--GWRFFLNRLPTRDILKARGILLGDFVSCCPMCFIEDETSTHLG 376 AV S S W+ FL+R+PT+ L RGIL D SC +C E ET+ HL Sbjct: 1814 AYRSIWKSAVPSKVSALAWQLFLDRIPTKVNLYKRGILRMDHASCV-LCGEEAETARHLF 1872 Query: 375 L-LNQCGKVSMIGSKWRSKIILL---VVLITGYIALWWKGKNLGGSTNLIWMAVLWCLWV 208 L + + +W +L V++ G + + K + ++WMA +W +W Sbjct: 1873 LHCDYAAGIWYAVCRWLGVFAVLPADVMMSYGLLVGCGRNKKIRKGFAIVWMAFIWVIWK 1932 Query: 207 SQSFDLFKGLVADVNSLLN*IMSL 136 ++ +FK +V ++ + L Sbjct: 1933 VRNERVFKNATVEVTDAVDMVQRL 1956 Score = 29.6 bits (65), Expect(3) = 2e-17 Identities = 10/29 (34%), Positives = 20/29 (68%) Frame = -3 Query: 142 VSWRWFISRTGRSSCLTVSDWWLSPLLCL 56 +SW+W++++ SSCL + +W +P C+ Sbjct: 1956 LSWQWYLNKMASSSCL-LYEWIWNPCECM 1983 Score = 25.4 bits (54), Expect(3) = 2e-17 Identities = 14/41 (34%), Positives = 19/41 (46%) Frame = -3 Query: 943 GIRIQFWWRDLMSIEDGLSQHKDLFQETYRCEVGDGSSILF 821 G WWRDL ++ G+ F R +G G+SI F Sbjct: 1665 GSMSSLWWRDLCRLDKGVG----WFNHFARKYLGCGNSIKF 1701 >gb|KHN30886.1| Putative ribonuclease H protein, partial [Glycine soja] Length = 373 Score = 78.6 bits (192), Expect(2) = 8e-17 Identities = 67/273 (24%), Positives = 113/273 (41%), Gaps = 32/273 (11%) Frame = -1 Query: 873 YFKKLIDVKWGMGVVFYFWSHSWFGKDKLEDLFPVLA*VALNRFAHISKMGSWKGGALQW 694 +F++ + G G FW + W G + L+D FP L ++ + + GSW+ W Sbjct: 78 FFQRAVSCDVGDGQSILFWYNKWLGSEPLKDAFPELFAISSQQLVSVGNAGSWRRDQWTW 137 Query: 693 QLAFSCELSGEELEEVKLLHGCIDGT--------------------TVKSFYNLFVDYFN 574 L + +L+ E E + L + TV S Y+ + N Sbjct: 138 DLTWKRQLNPNEEESLHSLETILVDVHLVAESHDRWKWSLHNSKLFTVSSCYSFAMSLVN 197 Query: 573 HACEIQEFTLFILNV---EAVHSNKS--GWRFFLNRLPTRDILKARGILLGDFVSCCPMC 409 ++ L IL++ V S + WR L+RLPT+D L R +++ + S C +C Sbjct: 198 QT-QMNSDILDILSIVWKVPVPSKVALFCWRLLLDRLPTKDNLIRRNVVINN--SRCSLC 254 Query: 408 FIEDETSTHLGLLNQCGKVSMIGSKWRSKIILLVVLITGYIALWWKGKNL---GGSTN-- 244 DE H L C I + S I ++ V+ G + +W+ L S N Sbjct: 255 DSCDENVVH--LFFHCDFSKCIWKEILSWIGIVDVIAVGGVQHFWEYDRLLKYNTSRNKV 312 Query: 243 --LIWMAVLWCLWVSQSFDLFKGLVADVNSLLN 151 + W+A LW +W ++ +FK D+ +N Sbjct: 313 PFMFWLATLWIIWQVRNNSIFKEEEKDIPKTIN 345 Score = 38.5 bits (88), Expect(2) = 8e-17 Identities = 16/31 (51%), Positives = 21/31 (67%) Frame = -2 Query: 1019 LWSDLLSYRYGDLGTKIMANVGRKWGNTDSI 927 LW DLL++RYG+L K ++ R WG DSI Sbjct: 30 LWRDLLAFRYGNLIAKQTCSLDRSWGTKDSI 60 >gb|KHN41375.1| Putative ribonuclease H protein, partial [Glycine soja] Length = 363 Score = 77.8 bits (190), Expect(2) = 1e-16 Identities = 67/273 (24%), Positives = 113/273 (41%), Gaps = 32/273 (11%) Frame = -1 Query: 873 YFKKLIDVKWGMGVVFYFWSHSWFGKDKLEDLFPVLA*VALNRFAHISKMGSWKGGALQW 694 +F++ + G G FW + W G + L+D FP L ++ + + GSW+ W Sbjct: 64 FFQRAVSCDVGDGQSILFWYNKWLGSEPLKDAFPELFAISSQQLVSVGNAGSWRRDQWTW 123 Query: 693 QLAFSCELSGEELEEVKLLHGCIDGT--------------------TVKSFYNLFVDYFN 574 L + +L+ E E + L + TV S Y+ + N Sbjct: 124 GLTWKRQLNPNEEESLHSLETILVDVHLVAESHDRWKWSLHNSKLFTVSSCYSFAMSLVN 183 Query: 573 HACEIQEFTLFILNV---EAVHSNKS--GWRFFLNRLPTRDILKARGILLGDFVSCCPMC 409 ++ L IL++ V S + WR L+RLPT+D L R +++ + S C +C Sbjct: 184 QT-QMNSDILDILSIVWKVPVPSKVALFCWRLLLDRLPTKDNLIRRNVVINN--SRCSLC 240 Query: 408 FIEDETSTHLGLLNQCGKVSMIGSKWRSKIILLVVLITGYIALWWKGKNL---GGSTN-- 244 DE H L C I + S I ++ V+ G + +W+ L S N Sbjct: 241 DSCDENVVH--LFFHCDFSKCIWKEILSWIGIVDVIAVGGVQHFWEYDRLLKYNTSRNKV 298 Query: 243 --LIWMAVLWCLWVSQSFDLFKGLVADVNSLLN 151 + W+A LW +W ++ +FK D+ +N Sbjct: 299 PFMFWLATLWIIWQVRNNSIFKEEEKDIPKTIN 331 Score = 38.5 bits (88), Expect(2) = 1e-16 Identities = 16/31 (51%), Positives = 21/31 (67%) Frame = -2 Query: 1019 LWSDLLSYRYGDLGTKIMANVGRKWGNTDSI 927 LW DLL++RYG+L K ++ R WG DSI Sbjct: 16 LWRDLLAFRYGNLIAKQTCSLDRSWGTKDSI 46 >gb|KHN28363.1| Putative ribonuclease H protein, partial [Glycine soja] Length = 417 Score = 76.3 bits (186), Expect(2) = 4e-16 Identities = 66/268 (24%), Positives = 112/268 (41%), Gaps = 32/268 (11%) Frame = -1 Query: 873 YFKKLIDVKWGMGVVFYFWSHSWFGKDKLEDLFPVLA*VALNRFAHISKMGSWKGGALQW 694 +F++ + G G FW + W G + L+D FP L ++ + + GSW+ W Sbjct: 153 FFQRAVSCDVGDGQSILFWYNKWLGSEPLKDAFPELFAISSQQLVSVGNAGSWRRDQWTW 212 Query: 693 QLAFSCELSGEELEEVKLLHGCIDGT--------------------TVKSFYNLFVDYFN 574 L + +L+ E E + L + TV S Y+ + N Sbjct: 213 GLTWKRQLNPNEEESLHSLETILVDVHLVAESHDRWKWSLHNSKLFTVSSCYSFAMSLVN 272 Query: 573 HACEIQEFTLFILNV---EAVHSNKS--GWRFFLNRLPTRDILKARGILLGDFVSCCPMC 409 ++ L IL++ V S + WR L+RLPT+D L R +++ + S C +C Sbjct: 273 QT-QMNSDILDILSIVWKVPVPSKVALFCWRLLLDRLPTKDNLIRRNVVINN--SRCSLC 329 Query: 408 FIEDETSTHLGLLNQCGKVSMIGSKWRSKIILLVVLITGYIALWWKGKNL---GGSTN-- 244 DE H L C + I + S I ++ V+ G + +W+ L S N Sbjct: 330 DSCDENVVH--LFFHCDFSNCIWKEVLSWIGIVDVIAVGGVQHFWEYDRLLKYNTSRNKV 387 Query: 243 --LIWMAVLWCLWVSQSFDLFKGLVADV 166 + W+A LW +W ++ +FK D+ Sbjct: 388 PFMFWLATLWIIWQVRNNSIFKEEEKDI 415 Score = 38.5 bits (88), Expect(2) = 4e-16 Identities = 16/31 (51%), Positives = 21/31 (67%) Frame = -2 Query: 1019 LWSDLLSYRYGDLGTKIMANVGRKWGNTDSI 927 LW DLL++RYG+L K ++ R WG DSI Sbjct: 105 LWRDLLAFRYGNLIAKQTCSLDRSWGTKDSI 135 >gb|KHN20323.1| Putative ribonuclease H protein, partial [Glycine soja] Length = 417 Score = 73.2 bits (178), Expect(2) = 3e-15 Identities = 65/268 (24%), Positives = 110/268 (41%), Gaps = 32/268 (11%) Frame = -1 Query: 873 YFKKLIDVKWGMGVVFYFWSHSWFGKDKLEDLFPVLA*VALNRFAHISKMGSWKGGALQW 694 +F++ + G G FW + W G + L+D FP L ++ + + SW+ W Sbjct: 153 FFQRAVSCDVGDGQSILFWYNKWLGSEPLKDAFPELFAISSQQLESVGNASSWRRDQWTW 212 Query: 693 QLAFSCELSGEELEEVKLLHGCIDGT--------------------TVKSFYNLFVDYFN 574 L + +L+ E E + L + TV S Y+ + N Sbjct: 213 GLTWKRQLNPNEEESLHSLETILVDVHLVAESHDRWKWSLHNSKLFTVSSCYSFAMSLVN 272 Query: 573 HACEIQEFTLFILNV---EAVHSNKS--GWRFFLNRLPTRDILKARGILLGDFVSCCPMC 409 ++ L IL++ V S + WR L+RLPT+D L R +++ + S C +C Sbjct: 273 QT-QMNSDILDILSIVWKVPVPSKVALFCWRLLLDRLPTKDNLIRRNVVINN--SRCSLC 329 Query: 408 FIEDETSTHLGLLNQCGKVSMIGSKWRSKIILLVVLITGYIALWWKGKNL---GGSTN-- 244 DE H L C I + S I ++ V+ G + +W+ L S N Sbjct: 330 DSCDENVVH--LFFHCDFSKCIWKEILSWIGIVDVIAVGGVQHFWEYDRLLKYNTSRNKV 387 Query: 243 --LIWMAVLWCLWVSQSFDLFKGLVADV 166 + W+A LW +W ++ +FK D+ Sbjct: 388 PFMFWLATLWIIWQVRNNSIFKEEEKDI 415 Score = 38.5 bits (88), Expect(2) = 3e-15 Identities = 16/31 (51%), Positives = 21/31 (67%) Frame = -2 Query: 1019 LWSDLLSYRYGDLGTKIMANVGRKWGNTDSI 927 LW DLL++RYG+L K ++ R WG DSI Sbjct: 105 LWRDLLAFRYGNLIAKQTCSLDRSWGTKDSI 135 >gb|PNY01502.1| ribonuclease H [Trifolium pratense] Length = 554 Score = 87.4 bits (215), Expect = 9e-15 Identities = 73/269 (27%), Positives = 110/269 (40%), Gaps = 28/269 (10%) Frame = -1 Query: 873 YFKKLIDVKWGMGVVFYFWSHSWFGKDKLEDLFPVLA*VALNRFAHISKMGSWKGGALQW 694 +F + G G FW+ WFG LFP L +N+FA +S+ +G + Sbjct: 250 WFSSNVSCHVGNGNNIEFWNFKWFGSQSFSSLFPDLYAKEVNKFAMVSERMGREGASSVC 309 Query: 693 QLAFSCELSGEELEEVKLLHGCIDGT--------------------TVKSFYNLFVDYFN 574 + ++ L E ++V L G + G +VKSFY+ + Sbjct: 310 RWRWNGPLDDIEQQQVLELQGLLAGFICSDSDPDRWWWLPDVNGMFSVKSFYSFLLSSRQ 369 Query: 573 ----HACEIQEFTLFILNVEAVHSNKSGWRFFLNRLPTRDILKARGILLGDFVSCCPMCF 406 E + + + N GWR LNRLPTR L RGIL F C CF Sbjct: 370 VTILETTEAEALSRLWKSDVPSKINVFGWRLLLNRLPTRMALHRRGILSNPFELSCVFCF 429 Query: 405 IEDETSTHLGLLNQCGKVSMIG-SKWRSKIILLVVLITGYIALW---WKGKNLGGSTNLI 238 E HL KV KW I + V + L+ +K K+ G +L+ Sbjct: 430 RHREDGAHLFFSCYFSKVVWRNVLKWLGLSIPMDVEGIDHFMLFGDLFKVKDKGRVRHLV 489 Query: 237 WMAVLWCLWVSQSFDLFKGLVADVNSLLN 151 W+A W LW ++ +FKG + + ++LL+ Sbjct: 490 WLATTWNLWKLRNKVIFKGDIPETSALLD 518 >dbj|GAU29820.1| hypothetical protein TSUD_223660 [Trifolium subterraneum] Length = 672 Score = 79.0 bits (193), Expect(2) = 1e-14 Identities = 67/274 (24%), Positives = 115/274 (41%), Gaps = 28/274 (10%) Frame = -1 Query: 873 YFKKLIDVKWGMGVVFYFWSHSWFGKDKLEDLFPVLA*VALNRFAHISKMGSWKGGALQW 694 +F +++ K GMG FW W G LE FP L +++ + + MGSW +W Sbjct: 371 WFNQVVLKKVGMGNSILFWKDVWAGDQSLEHRFPRLFGISIQQNEVVRNMGSWVNVEWRW 430 Query: 693 QLAFSCELSGEELEEVKLLHGCI-------------------DGTTVKSFYN-----LFV 586 +L + + E E V+ L + +G +VKS Y+ L Sbjct: 431 ELLWRRQFFVWENELVRELGEVLNIFPLSEEVDRWVWKPNEAEGFSVKSLYDWLDSTLVT 490 Query: 585 DYFNHACEIQEFTLFILNVEAVHSNKSGWRFFLNRLPTRDILKARGILLGDFVSCCPMCF 406 E F V + W+ FL+R+PT+D L R I+ + + C MC Sbjct: 491 RAILTPLEAFSFCSIWKCVVPSKVSALAWQLFLDRIPTKDNLCRRRIIRSED-AVCDMCG 549 Query: 405 IEDETSTHLGL-LNQCGKVSMIGSKWRSKIILL---VVLITGYIALWWKGKNLGGSTNLI 238 ETS H+ + + +V +W ++LL V+ + G + K + +++ Sbjct: 550 GVSETSRHVFMHCDFAAQVWYAICRWLGVVVLLPPDVMTMYGSLVGCGSNKKIKKGFSIV 609 Query: 237 WMAVLWCLWVSQSFDLFKGLVADVNSLLN*IMSL 136 W+A +W +W S++ +F + V LN I + Sbjct: 610 WLAFIWVMWRSRNDKVFNNVAGVVEDALNHIQRI 643 Score = 30.8 bits (68), Expect(2) = 1e-14 Identities = 11/29 (37%), Positives = 18/29 (62%) Frame = -3 Query: 142 VSWRWFISRTGRSSCLTVSDWWLSPLLCL 56 +SW+WF+S T + CL + +W P C+ Sbjct: 643 ISWQWFLSNTAKGPCL-LYEWSWDPGKCM 670 >gb|PNX95738.1| ribonuclease H, partial [Trifolium pratense] Length = 1375 Score = 75.1 bits (183), Expect(2) = 4e-14 Identities = 71/271 (26%), Positives = 112/271 (41%), Gaps = 30/271 (11%) Frame = -1 Query: 858 IDVKWGMGVVFYFWSHSWFGKDKLEDLFPVLA*VALNRFAHISKMGSWKGGALQWQLAFS 679 I K G G FW W G L LFP L +++++ + +S +G GA QW L + Sbjct: 1078 ISRKVGNGETTSFWLDKWVGNSTLASLFPRLFSISIDKQSMVSDLGECVNGAWQWNLIWR 1137 Query: 678 CELSGEELEEV----KLLHGCI---------------DGTTVKSFYNLFVDYFNHA--CE 562 + E E V +LLH + +V+S Y + + + C+ Sbjct: 1138 RRIFEWEKELVEQLFQLLHTAVLSENSDCWVWKPGEGGSFSVRSAYLVLEEELSVQVNCD 1197 Query: 561 IQEF-TLFILNVEAVHSN--KSGWRFFLNRLPTRDILKARGILLGDFVSCCPMCFIEDET 391 +QE TL L S W+ LN +PTR L RG+L C +C DE+ Sbjct: 1198 VQETRTLHQLWSSPAPSKVIAFSWKLLLNSIPTRQNLAHRGVLQQTDSKLCAICVGVDES 1257 Query: 390 STHLGLLNQCGKVSMIGSKWRSKIILLVVLITGYIALWWK------GKNLGGSTNLIWMA 229 S HL L C S I + + L++VL + GK +IW Sbjct: 1258 SVHLFL--HCDFASCIWYEIFRWLGLVIVLPANLFQCFDSFIGAAVGKKCRKMFRMIWHT 1315 Query: 228 VLWCLWVSQSFDLFKGLVADVNSLLN*IMSL 136 ++W +W +++ +F +VN +++ I L Sbjct: 1316 IVWLIWKNRNDVIFSNSSKEVNEVVDDIKQL 1346 Score = 32.7 bits (73), Expect(2) = 4e-14 Identities = 12/35 (34%), Positives = 22/35 (62%) Frame = -3 Query: 154 ELDYVSWRWFISRTGRSSCLTVSDWWLSPLLCLKS 50 ++ +SWRW +SR+ + C+ +W + PL C +S Sbjct: 1342 DIKQLSWRWSLSRSKINPCM-FYEWCMEPLYCFRS 1375 >gb|PNX72376.1| ribonuclease H, partial [Trifolium pratense] Length = 852 Score = 74.7 bits (182), Expect(3) = 6e-14 Identities = 66/249 (26%), Positives = 105/249 (42%), Gaps = 25/249 (10%) Frame = -1 Query: 843 GMGVVFYFWSHSWFGKDKLEDLFPVLA*VALNRFAHISKMGSWKGGALQWQLAFSCE-LS 667 G G FW W G LE FP L +++ + + +G W G W+L + Sbjct: 588 GSGNNTKFWKDVWVGDQSLEIRFPRLFGISVQQDLVVRDVGRWVDGVWHWELLWRRNFFV 647 Query: 666 GEELEEVKLLH------------------GCIDGTTVKSFYNLFVD----YFNHACEIQE 553 EE+ +LLH +G +VKS Y +F+D + I+ Sbjct: 648 WEEILVQQLLHLISQASITEAEDGWSWRPASSEGFSVKSLY-VFLDSELLHHEPRSPIEY 706 Query: 552 FTLFILNVEAVHSNKS--GWRFFLNRLPTRDILKARGILLGDFVSCCPMCFIEDETSTHL 379 F + V S S W+ FL+R+PTR+ L+ RG++ + S CP+C +E ETS HL Sbjct: 707 FGFKHIWKSGVPSKVSAMAWQLFLDRIPTRNNLRLRGVISPEEES-CPVCTVEVETSQHL 765 Query: 378 GLLNQCGKVSMIGSKWRSKIILLVVLITGYIALWWKGKNLGGSTNLIWMAVLWCLWVSQS 199 L + VS +K R + +++W+A +W LW ++ Sbjct: 766 FLHCRFAAVSSGSNKKRRR-----------------------GFSIVWLAFVWVLWKIRN 802 Query: 198 FDLFKGLVA 172 +F +VA Sbjct: 803 DRVFNNIVA 811 Score = 28.9 bits (63), Expect(3) = 6e-14 Identities = 11/29 (37%), Positives = 18/29 (62%) Frame = -3 Query: 142 VSWRWFISRTGRSSCLTVSDWWLSPLLCL 56 +SW+WF+S +S CL + +W P C+ Sbjct: 823 LSWQWFMSNVAKSPCL-LYEWLWDPGDCM 850 Score = 23.5 bits (49), Expect(3) = 6e-14 Identities = 10/35 (28%), Positives = 18/35 (51%) Frame = -3 Query: 925 WWRDLMSIEDGLSQHKDLFQETYRCEVGDGSSILF 821 WWRD+ +++G+ F + VG G++ F Sbjct: 565 WWRDICRLDNGVG----WFSQVAIRNVGSGNNTKF 595 >dbj|GAU26446.1| hypothetical protein TSUD_294120 [Trifolium subterraneum] Length = 333 Score = 80.9 bits (198), Expect(2) = 2e-13 Identities = 63/276 (22%), Positives = 119/276 (43%), Gaps = 30/276 (10%) Frame = -1 Query: 873 YFKKLIDVKWGMGVVFYFWSHSWFGKDKLEDLFPVLA*VALNRFAHISKMGSWKGGALQW 694 + +++ + G G FW W G L+ FP L +++ + + ++G+W G +W Sbjct: 32 WLSQVVKRRMGGGSTISFWKDIWVGDQTLQHRFPRLYGISMQQNNSVREVGNWVEGGWRW 91 Query: 693 QLAFSCELSGEELEEVKLLHGCI-------------------DGTTVKSFYNLFVDYF-- 577 +L + E E V+ L I DG +VKS Y F Sbjct: 92 ELLWRRNFFAWEEELVRELEDVIRHMATTEEEDRWVWIPNEADGFSVKSLYVFLEGMFLP 151 Query: 576 -NHACEIQEFTLFILNVEAVHSN--KSGWRFFLNRLPTRDILKARGILLGDFVSCCPMCF 406 NH C+ + FT + V S W+ L+R+PT++ L +R I+ + + CP C Sbjct: 152 TNHLCDFERFTFKKIWKTPVPSKVCALAWQVCLDRIPTKENLVSRKIMRRE-DALCPTCG 210 Query: 405 IEDETSTHLGLLNQCGKVSMIG---SKWRSKIILL---VVLITGYIALWWKGKNLGGSTN 244 ET HL L C S + ++W ++++ +++ G + K + + Sbjct: 211 ETIETVRHLFL--HCRFASAVWYRVNRWLGTMVVIPHDIIMSHGLLVGCGGNKKVRKGYS 268 Query: 243 LIWMAVLWCLWVSQSFDLFKGLVADVNSLLN*IMSL 136 ++W+A +W +W ++ +F + +V ++ I L Sbjct: 269 IVWLAFVWVIWRFRNDRVFNNINGEVEDAMDSIQRL 304 Score = 25.0 bits (53), Expect(2) = 2e-13 Identities = 9/29 (31%), Positives = 19/29 (65%) Frame = -3 Query: 142 VSWRWFISRTGRSSCLTVSDWWLSPLLCL 56 +SW+W++ +T + S L + +W +P C+ Sbjct: 304 LSWQWYLLKTAKGSSL-LYEWVWNPGDCM 331 >dbj|GAU22230.1| hypothetical protein TSUD_227660 [Trifolium subterraneum] Length = 419 Score = 49.3 bits (116), Expect(4) = 2e-12 Identities = 48/180 (26%), Positives = 68/180 (37%), Gaps = 25/180 (13%) Frame = -1 Query: 843 GMGVVFYFWSHSWFGKDKLEDLFPVLA*VALNRFAHISKMGSWKGGALQWQLAFSCELSG 664 G G FW WFG +LFP L + I++ G + + LS Sbjct: 190 GNGNNIGFWKFKWFGNQPFNELFPNLFAKETHLNVMIAERLGGNGVSYMSSWQWVDRLST 249 Query: 663 EELEEVKLLHGCIDGTT--------------------VKSFYNLFVDYFNHACEIQEFTL 544 EE ++K L + G + VKS+YN+ ++ +H E+ L Sbjct: 250 EEANQMKELSEPLVGFSLHPFNSDRWRWIQESSGIFSVKSYYNVLIEN-SHMEELDINVL 308 Query: 543 FILNVEAVHSNKS-----GWRFFLNRLPTRDILKARGILLGDFVSCCPMCFIEDETSTHL 379 + + S GWR L R PTR L RGILL C C + E +HL Sbjct: 309 TAIRQLCRNDVPSKVLFFGWRLLLERPPTRVALNHRGILLNQQDLSCIFCSLIHEDCSHL 368 Score = 33.1 bits (74), Expect(4) = 2e-12 Identities = 16/41 (39%), Positives = 22/41 (53%) Frame = -3 Query: 943 GIRIQFWWRDLMSIEDGLSQHKDLFQETYRCEVGDGSSILF 821 G + WW+D+MS G D FQ R VG+G++I F Sbjct: 159 GAKYSTWWKDIMS--SGREAESDGFQTNVRAVVGNGNNIGF 197 Score = 30.0 bits (66), Expect(4) = 2e-12 Identities = 13/35 (37%), Positives = 19/35 (54%) Frame = -3 Query: 376 FAKSVWQGVNDWFEVEVQDNIVGGAHYRLYSSLVE 272 F+KSVW+ V W E G H+RL+ +V+ Sbjct: 374 FSKSVWEAVCTWVEKGYPTWAEGWNHFRLFGDMVK 408 Score = 28.5 bits (62), Expect(4) = 2e-12 Identities = 11/20 (55%), Positives = 15/20 (75%) Frame = -2 Query: 1025 DFLWSDLLSYRYGDLGTKIM 966 D LW+DLL +RYG L T ++ Sbjct: 132 DALWADLLRFRYGHLPTLVI 151 >dbj|GAU25119.1| hypothetical protein TSUD_274080 [Trifolium subterraneum] Length = 937 Score = 67.4 bits (163), Expect(2) = 5e-11 Identities = 74/277 (26%), Positives = 110/277 (39%), Gaps = 31/277 (11%) Frame = -1 Query: 873 YFKKLIDVKWGMGVVFYFWSHSWFGKDKLEDLFPVLA*VALNRFAHISKMGSWKGGALQW 694 +F K I + G G FW W G+ L + FP L ++L + A +S++ G W Sbjct: 635 WFAKNISRRVGRGDTTRFWKDCWVGQVPLCESFPRLFSISLQKEALVSEIRVGGEGVSWW 694 Query: 693 QLAFSCELSGEELEEVKLLH-----------------GCIDGT--TVKSFYNLFVDYFN- 574 + + L E E + L G DG TVKS Y L F Sbjct: 695 EWGWRRSLFVWEEELLLGLQDFISPMAFSTDDDVWYWGLEDGGVFTVKSAYLLLGRMFAS 754 Query: 573 ----HACEIQEFTLFILNVEAVHSNKSGWRFFLNRLPTRDILKARGILLGDFVSCCPMCF 406 + CE++ + W+ NR+PTRD L RGIL C C Sbjct: 755 FSMFNVCELRVLNSIWRSPAPSKVIAFSWKLLRNRIPTRDCLSRRGILAAGGSRECVHCQ 814 Query: 405 IEDETSTHLGLLNQCGKVSMIGS---KWRSKIIL----LVVLITGYIALWWKGKNLGGST 247 +ET+ HL L C + S +W +I+ L +L ++ K G Sbjct: 815 GREETALHLFLF--CDFAFRVWSAIFQWLGVVIVMPPNLFILFDCFVGAAGCNKRAKGFL 872 Query: 246 NLIWMAVLWCLWVSQSFDLFKGLVADVNSLLN*IMSL 136 LIW +W +W S++ LF V D +S+++ I L Sbjct: 873 -LIWHTTVWAIWRSRNEILFANGVLDPSSVIDEIKLL 908 Score = 30.0 bits (66), Expect(2) = 5e-11 Identities = 13/34 (38%), Positives = 20/34 (58%) Frame = -3 Query: 154 ELDYVSWRWFISRTGRSSCLTVSDWWLSPLLCLK 53 E+ +SWRW +SR CL + +W P +CL+ Sbjct: 904 EIKLLSWRWGLSRQKIPMCL-LYEWCWDPGICLR 936 >gb|KYP45153.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 689 Score = 75.9 bits (185), Expect = 6e-11 Identities = 63/267 (23%), Positives = 112/267 (41%), Gaps = 27/267 (10%) Frame = -1 Query: 873 YFKKLIDVKWGMGVVFYFWSHSWFGKDKLEDLFPVLA*VALNRFAHISKMGSWKGGALQW 694 +F + + GMG FWS +W + L + FP + +L + ++ MG W G QW Sbjct: 391 WFMDSLSWRLGMGDRIRFWSDAWAEAEPLANRFPRIFSNSLQKSNVVANMGHWSRGRWQW 450 Query: 693 QLAFSCELSGEELEEVK----------LLHGCIDGT----------TVKSFYNLFVDYFN 574 + + EL +V+ L+ G D +V+S Y +D Sbjct: 451 RFQWRRAWFTWELNDVQQFMNIVEARVLIEGVQDSRLWTLDSSGCFSVRSGYRALMDR-G 509 Query: 573 HACEIQEFTLFILNVEAVHSNKSG-WRFFLNRLPTRDILKARGILLGDFVSCCPMCFIED 397 + ++ +++ K WR F+ LPT++ L R +++ + CP C + Sbjct: 510 PSSQLPNVAAVAWDIKVPPKVKCFIWRLFMGALPTKENLLRRNVIVLRDQATCPFCNADI 569 Query: 396 ETSTHLGLLNQCGKVSMIGSKWRSKIILLVVLITGYIALWWK------GKNLGGSTNLIW 235 E+S H +L C I KW + L + + ++ K ++IW Sbjct: 570 ESSEH--ILLYCSSTDPIWKKWLLWLDSPTPLSSSFEGNFFAHPSILLSKKRVDQWHVIW 627 Query: 234 MAVLWCLWVSQSFDLFKGLVADVNSLL 154 A+LWC+W +++ +F+G D N LL Sbjct: 628 TAILWCIWRARNKYVFEGEHLDGNRLL 654 >gb|KYP45155.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 979 Score = 75.9 bits (185), Expect = 7e-11 Identities = 63/267 (23%), Positives = 112/267 (41%), Gaps = 27/267 (10%) Frame = -1 Query: 873 YFKKLIDVKWGMGVVFYFWSHSWFGKDKLEDLFPVLA*VALNRFAHISKMGSWKGGALQW 694 +F + + GMG FWS +W + L + FP + +L + ++ MG W G QW Sbjct: 681 WFMDSLSWRLGMGDRIRFWSDAWAEAEPLANRFPRIFSNSLQKSNVVANMGHWSRGRWQW 740 Query: 693 QLAFSCELSGEELEEVK----------LLHGCIDGT----------TVKSFYNLFVDYFN 574 + + EL +V+ L+ G D +V+S Y +D Sbjct: 741 RFQWRRAWFTWELNDVQQFMNIVEARVLIEGVQDSRLWTLDSSGCFSVRSGYRALMDR-G 799 Query: 573 HACEIQEFTLFILNVEAVHSNKSG-WRFFLNRLPTRDILKARGILLGDFVSCCPMCFIED 397 + ++ +++ K WR F+ LPT++ L R +++ + CP C + Sbjct: 800 PSSQLPNVAAVAWDIKVPPKVKCFIWRLFMGALPTKENLLRRNVIVLRDQATCPFCNADI 859 Query: 396 ETSTHLGLLNQCGKVSMIGSKWRSKIILLVVLITGYIALWWK------GKNLGGSTNLIW 235 E+S H +L C I KW + L + + ++ K ++IW Sbjct: 860 ESSEH--ILLYCSSTDPIWKKWLLWLDSPTPLSSSFEGNFFAHPSILLSKKRVDQWHVIW 917 Query: 234 MAVLWCLWVSQSFDLFKGLVADVNSLL 154 A+LWC+W +++ +F+G D N LL Sbjct: 918 TAILWCIWRARNKYVFEGEHLDGNRLL 944 >dbj|GAU36827.1| hypothetical protein TSUD_320640 [Trifolium subterraneum] Length = 795 Score = 74.3 bits (181), Expect = 2e-10 Identities = 62/263 (23%), Positives = 113/263 (42%), Gaps = 31/263 (11%) Frame = -1 Query: 873 YFKKLIDVKWGMGVVFYFWSHSWFGKDKLEDLFPVLA*VALNRFAHISKMGSWKGGALQW 694 +F +++ + G G FW W G L+ FP L +++ + + +G+W G +W Sbjct: 522 WFTQVVRKRMGDGNTINFWKDIWVGNQTLQQRFPRLYGISVQQDNSVRDVGNWVNGVWRW 581 Query: 693 QLAFS---C---ELSGEELEEVKLLHGCI--------------DGTTVKSFYNLFVDYFN 574 +L++ C E ELEE + H I DG +VKS Y F Sbjct: 582 ELSWRRNFCVWEEALVRELEEA-IRHTTITATEDRWVWVPNEADGFSVKSLYVFLQGMFG 640 Query: 573 HACEIQEFTLFILN---VEAVHSN--KSGWRFFLNRLPTRDILKARGILLGDFVSCCPMC 409 + +F F+ V S W+ L+R+PTRD L R I+ + + CP C Sbjct: 641 PQNNLNDFECFVFKNIWKSPVPSKVCALAWQLCLDRIPTRDNLVIRRIIRSE-DALCPAC 699 Query: 408 FIEDETSTHLGLLNQCGKVSMIG---SKWRSKIILL---VVLITGYIALWWKGKNLGGST 247 ET+ HL + C + + ++W K++++ + + G K + Sbjct: 700 GDVLETARHLFM--HCRFAAAVWYRVNRWLGKMVMIPPDIRMSYGLFVGCGGNKKIRKGY 757 Query: 246 NLIWMAVLWCLWVSQSFDLFKGL 178 +++W+A +W +W ++ +F + Sbjct: 758 SIVWLAFVWVIWRIRNDRIFNNI 780 >dbj|GAU50636.1| hypothetical protein TSUD_134420 [Trifolium subterraneum] Length = 208 Score = 70.1 bits (170), Expect = 4e-10 Identities = 43/148 (29%), Positives = 62/148 (41%) Frame = -1 Query: 822 FWSHSWFGKDKLEDLFPVLA*VALNRFAHISKMGSWKGGALQWQLAFSCELSGEELEEVK 643 F W G L DLFP L ++ + H+S +G W G W+L +S L + E Sbjct: 40 FRKEKWLGMTPLRDLFPSLFNISTQQDGHVSDLGIWTNGNWAWKLEWSVMLDETKAETAC 99 Query: 642 LLHGCIDGTTVKSFYNLFVDYFNHACEIQEFTLFILNVEAVHSNKSGWRFFLNRLPTRDI 463 L ++ + VDY A L LN N GWR L +LPT++ Sbjct: 100 ELITLLEQVQQRQLTVAAVDYDTEA----TLKLLWLNNVPSKINIFGWRLLLQKLPTKEA 155 Query: 462 LKARGILLGDFVSCCPMCFIEDETSTHL 379 L +G++ C C+ E+E HL Sbjct: 156 LHRKGVITNTHDWACVFCYKEEEDLCHL 183 >gb|KYP50779.1| Transposon TX1 uncharacterized [Cajanus cajan] Length = 1102 Score = 67.8 bits (164), Expect(2) = 4e-10 Identities = 64/266 (24%), Positives = 115/266 (43%), Gaps = 35/266 (13%) Frame = -1 Query: 843 GMGVVFYFWSHSWFGKDKLEDLFPVLA*VALNRFAHISKMGSWKGGALQWQLAFSCELSG 664 G G FW W G +L D+FP L A + + + G+W+G WQ+ + E Sbjct: 812 GDGSSTRFWEDKWIGGLRLLDVFPRLYSFAFDPLSMVGHNGNWEGSTWLWQIKWRRETFV 871 Query: 663 EE----------LEEVKLLHGCIDG----------TTVKSFYNLFVDYFNHAC--EIQEF 550 E L+E+++ D +VKS Y+ + H+ E+ Sbjct: 872 HEEGSVNTLIEMLQEIQIFSSKQDQWRWICDKDGVFSVKSAYS----WLQHSMGGELSYS 927 Query: 549 TLFILNVEAVHSNKSG-------WRFFLNRLPTRDILKARGILLGDFVSCCPMCFIEDET 391 + FIL +++ K+ W+ F+N P + +L+ RG+ + + + C +C + E Sbjct: 928 SDFILVTKSLWKCKAPIKCLVFCWQVFMNAFPCKSLLQVRGVEVEN--NLCSLCSLFIED 985 Query: 390 STHLGLLNQCGKVSMIGSKWRSKIILLVVLITGYIALWWKGKNLG------GSTNLIWMA 229 HL LL C I + + + VVL +L+ NLG ++W++ Sbjct: 986 PIHLFLL--CPMAFNIWLSVANWLEVEVVLPNSLTSLYLYWTNLGIYKKSKQCFKVVWVS 1043 Query: 228 VLWCLWVSQSFDLFKGLVADVNSLLN 151 V+W LW+ ++ +F+ V D +L+ Sbjct: 1044 VIWSLWLHRNGIIFQQGVMDCKEVLD 1069 Score = 26.6 bits (57), Expect(2) = 4e-10 Identities = 11/28 (39%), Positives = 16/28 (57%) Frame = -3 Query: 139 SWRWFISRTGRSSCLTVSDWWLSPLLCL 56 SW+W S S + S+W+ SP LC+ Sbjct: 1075 SWKWIKSSVPGCS-FSYSNWYFSPRLCI 1101 >gb|KYP60814.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 681 Score = 73.2 bits (178), Expect = 4e-10 Identities = 61/247 (24%), Positives = 105/247 (42%), Gaps = 7/247 (2%) Frame = -1 Query: 873 YFKKLIDVKWGMGVVFYFWSHSWFGKDKLEDLFPVLA*VALNRFAHISKMGSWKGGALQW 694 +F + + GMG FWS +W + L + FP + +L + ++ MG W G L W Sbjct: 422 WFMDSLSWRLGMGDRIRFWSDAWAEAEPLANRFPRIFSNSLQKSNVVANMGHWSRGRL-W 480 Query: 693 QLAFSCELSGEELEEVKLLHGCIDGTTVKSFYNLFVDYFNHACEIQEFTLFILNVEAVHS 514 L S GC +V+S Y +D + ++ +++ Sbjct: 481 TLDSS---------------GCF---SVRSGYRALMDR-GPSSQLPNVAAVAWDIKVPPK 521 Query: 513 NKSG-WRFFLNRLPTRDILKARGILLGDFVSCCPMCFIEDETSTHLGLLNQCGKVSMIGS 337 K WR F+ LPT++ L R +++ + CP C + E+S H +L C I Sbjct: 522 VKCFIWRLFMGALPTKENLLRRNVIVLRDQATCPFCNADIESSEH--ILLYCSSTDPIWK 579 Query: 336 KWRSKIILLVVLITGYIALWWK------GKNLGGSTNLIWMAVLWCLWVSQSFDLFKGLV 175 KW + L + + ++ K ++IW A+LWC+W +++ +F+G Sbjct: 580 KWLLWLDSPTPLSSSFEGNFFAHPSILLSKKRVDQWHVIWTAILWCIWRARNKYVFEGEH 639 Query: 174 ADVNSLL 154 D N LL Sbjct: 640 LDGNRLL 646 >gb|KYP34281.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 374 Score = 71.6 bits (174), Expect = 7e-10 Identities = 61/251 (24%), Positives = 106/251 (42%), Gaps = 21/251 (8%) Frame = -1 Query: 843 GMGVVFYFWSHSWFGKDKLEDLFPVLA*VALNRFAHISKMGSWKGGALQWQLA-----FS 679 GMG FWS +W ++ L + FP + +L + ++ MG W+ G QW+ F+ Sbjct: 116 GMGDRIQFWSDAWVEEEPLANRFPRIFSNSLQKSNVVADMGHWRRGRGQWRFQWSRAWFT 175 Query: 678 CELSGEE-----LEEVKLLHGCIDGT----------TVKSFYNLFVDYFNHACEIQEFTL 544 EL+ + +EE L G D +V+S Y +D + ++ Sbjct: 176 WELNDVQQFMNIVEEGVLTEGVQDSRLWTLDSSGCFSVRSGYRALMDR-GSSSQLPNVAA 234 Query: 543 FILNVEAVHSNKSG-WRFFLNRLPTRDILKARGILLGDFVSCCPMCFIEDETSTHLGLLN 367 +++ K WR F+ LPT++ L R +++ P C ++ E++ H+ L Sbjct: 235 IAWDIKVPPKVKCFIWRLFMGALPTKENLLRRNVIVLRDQVTYPFCNVDIESAEHI--LL 292 Query: 366 QCGKVSMIGSKWRSKIILLVVLITGYIALWWKGKNLGGSTNLIWMAVLWCLWVSQSFDLF 187 C + I KW + W ++IW A+LWC+W ++ +F Sbjct: 293 YCNGIDPIWKKW--------------VEQW----------HVIWTAILWCIWRARYKCVF 328 Query: 186 KGLVADVNSLL 154 +G D N LL Sbjct: 329 EGEHLDGNRLL 339 >dbj|GAU43110.1| hypothetical protein TSUD_373050 [Trifolium subterraneum] Length = 1099 Score = 63.5 bits (153), Expect(2) = 1e-09 Identities = 64/251 (25%), Positives = 105/251 (41%), Gaps = 30/251 (11%) Frame = -1 Query: 849 KWGMGVVFYFWSHSWFGKDKLEDLFPVLA*VALNRFAHISKMGSWKGGALQWQLAFSCEL 670 K G G FWS W G L +FP L ++ ++ + G +W+ ++ EL Sbjct: 805 KVGNGNSTSFWSTKWIGDAPLSVIFPRLFSLSNHKDCMVRDFYEDDGDNERWRFSWRREL 864 Query: 669 SGEELEEVKLLHGCI------------------DGT-TVKSFYNLFVDYFNHACEIQEFT 547 E++ + L + DG +VKS YNL ++ E++E Sbjct: 865 FQWEVDRLTRLKELLVSFVFSSDDDSWIWRPDPDGVFSVKSAYNLLIEELRSGEELEEEA 924 Query: 546 LFILNV--EAVHSNKS---GWRFFLNRLPTRDILKARGILLGDFVSCCPMCFIEDETSTH 382 I E+ +K W+ +R+PTR L+ RG+L D C C ET+TH Sbjct: 925 ALIFEQIWESPAPSKVIAFSWQLLYDRIPTRRNLEVRGLLGLDSPWECVGCVGSVETTTH 984 Query: 381 LGLLNQCGKVSMIGSKWRSKIILLVVLITGYIALW--WKGKNLGGSTNL----IWMAVLW 220 L L C M+ + I +++V + L+ +G T L IW A +W Sbjct: 985 LFL--HCPSALMVWYEVFRWIGVIIVTPPSMMILFEVLRGSARNKKTRLGFLMIWHATIW 1042 Query: 219 CLWVSQSFDLF 187 C+W +++ +F Sbjct: 1043 CIWRARNNSIF 1053 Score = 29.3 bits (64), Expect(2) = 1e-09 Identities = 13/33 (39%), Positives = 19/33 (57%) Frame = -3 Query: 154 ELDYVSWRWFISRTGRSSCLTVSDWWLSPLLCL 56 E+ +SW+W +SRT S C+ +W P CL Sbjct: 1066 EIKVLSWKWCLSRTKISPCM-FYEWTWDPGECL 1097