BLASTX nr result
ID: Glycyrrhiza32_contig00031359
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza32_contig00031359 (877 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value GAU38148.1 hypothetical protein TSUD_395930 [Trifolium subterran... 201 1e-57 GAU50434.1 hypothetical protein TSUD_134890, partial [Trifolium ... 202 3e-56 KHN28363.1 Putative ribonuclease H protein, partial [Glycine soja] 194 8e-56 KHN20323.1 Putative ribonuclease H protein, partial [Glycine soja] 192 4e-55 KHN30886.1 Putative ribonuclease H protein, partial [Glycine soja] 188 6e-54 XP_019447203.1 PREDICTED: uncharacterized protein LOC109350421 [... 187 9e-53 XP_019418409.1 PREDICTED: uncharacterized protein LOC109329191 [... 194 1e-52 GAU50085.1 hypothetical protein TSUD_371690 [Trifolium subterran... 184 2e-51 GAU29820.1 hypothetical protein TSUD_223660 [Trifolium subterran... 188 3e-51 GAU40143.1 hypothetical protein TSUD_163120 [Trifolium subterran... 182 5e-51 GAU43007.1 hypothetical protein TSUD_187280 [Trifolium subterran... 189 9e-51 KYP32706.1 Transposon TX1 uncharacterized [Cajanus cajan] 189 1e-50 GAU49581.1 hypothetical protein TSUD_139980 [Trifolium subterran... 180 2e-50 KYP69313.1 hypothetical protein KK1_008502 [Cajanus cajan] 176 3e-49 GAU29496.1 hypothetical protein TSUD_360410 [Trifolium subterran... 183 1e-48 GAU43110.1 hypothetical protein TSUD_373050 [Trifolium subterran... 182 2e-48 GAU35675.1 hypothetical protein TSUD_162470 [Trifolium subterran... 179 2e-48 KYP44529.1 Putative ribonuclease H protein At1g65750 family [Caj... 178 3e-48 GAU44350.1 hypothetical protein TSUD_129240 [Trifolium subterran... 174 4e-48 KYP44023.1 Putative ribonuclease H protein At1g65750 family [Caj... 181 5e-48 >GAU38148.1 hypothetical protein TSUD_395930 [Trifolium subterraneum] Length = 503 Score = 201 bits (512), Expect = 1e-57 Identities = 111/290 (38%), Positives = 161/290 (55%), Gaps = 3/290 (1%) Frame = +1 Query: 10 CLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYG-DGIAKCGTY-LQI 183 C P+ +GGLGV+DL+L N SLLAK RW+L T + +W+E++ RYG D I K + + Sbjct: 123 CKPKKEGGLGVRDLRLVNISLLAKWRWKLLTTECEVWKEVVGARYGRDVIGKVNLGDIDV 182 Query: 184 RQQGSIWMKDLFLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRMF 363 + GS W +DL LL+ D WFS+A+ +++G G S FW+E W G L+ FPR+F Sbjct: 183 TRTGSCWWRDLCLLDSDVR----WFSSAVGKRVGRGDSTMFWNEIWIGDQPLRQRFPRLF 238 Query: 364 SLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSWS 543 +ST + + +MG +W W+L WRR+FF WE F + + QF+P+ +D W Sbjct: 239 GMSTQQNEVICNMGSLVNGLWHWELQWRRNFFTWEEDQYNHFLDIIVQFAPTVQ-QDRWL 297 Query: 544 WKLDPSNTXXXXXXXXXXXAYKTLLEVQISSSL-DPRVSKALGVLWQTNVPSNIQIFSWR 720 W D AY ++ ++ S+ DP +LW+ PS + FSW+ Sbjct: 298 WLGD------GVQGYTANSAYSLVVNKLVTPSVCDPINDLVFKILWKCGAPSKVSAFSWQ 351 Query: 721 LFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870 L LDRL T++ LM+R II + HG +CVFC ES HLF C +W Sbjct: 352 LMLDRLQTKDNLMKRRIIQAHHG-NCVFCNLAQESASHLFLHCDRVAKVW 400 >GAU50434.1 hypothetical protein TSUD_134890, partial [Trifolium subterraneum] Length = 712 Score = 202 bits (513), Expect = 3e-56 Identities = 107/290 (36%), Positives = 154/290 (53%), Gaps = 3/290 (1%) Frame = +1 Query: 10 CLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYG---DGIAKCGTYLQ 180 CLP++KGGLG+K+L FN +LL K +WR D + +W +L+ RYG D + T Sbjct: 387 CLPKDKGGLGIKNLNCFNQALLCKWKWRGLCDHNTLWTKLLEHRYGSLADNFLR-DTTRD 445 Query: 181 IRQQGSIWMKDLFLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRM 360 ++ Q S+W +D+ ++ D WF + LGNG+ I+FW E W+GP CLKDLFP++ Sbjct: 446 VKGQ-SLWWRDIMMIGGIEN--DAWFRFNVRNVLGNGTCIRFWHETWHGPVCLKDLFPQL 502 Query: 361 FSLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSW 540 + S E + D+G+ W W L W + + E E L+ PS D Sbjct: 503 YCKSPQAEAIIYDVGKWVNQQWVWNLQWSTNLTSTEHDAACELANLLTGIQPSLECADRR 562 Query: 541 SWKLDPSNTXXXXXXXXXXXAYKTLLEVQISSSLDPRVSKALGVLWQTNVPSNIQIFSWR 720 W L + Y+ L ++ +++ V KAL +LW +VPS + IF WR Sbjct: 563 RWGLTQTGMFSVKS------TYEFLQSREVVVAIEDNVVKALQLLWLNDVPSKVSIFGWR 616 Query: 721 LFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870 L L RLPTR L R+NII + H +SC+FC E E HL F+C FS +W Sbjct: 617 LLLSRLPTRMALARKNIIVNLHELSCIFCGEEQEELSHLLFNCPFSQELW 666 >KHN28363.1 Putative ribonuclease H protein, partial [Glycine soja] Length = 417 Score = 194 bits (494), Expect = 8e-56 Identities = 107/289 (37%), Positives = 150/289 (51%), Gaps = 2/289 (0%) Frame = +1 Query: 10 CLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAK--CGTYLQI 183 C P+ +GGLGVK+L++FN SLLAK +WR D +A+WR+L+ FRYG+ IAK C Sbjct: 70 CKPKKEGGLGVKNLEVFNISLLAKWKWRCIHDHNALWRDLLAFRYGNLIAKQTCSLDRSW 129 Query: 184 RQQGSIWMKDLFLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRMF 363 + SIW +DL LLE D +F A+ +G+G SI FW +W G LKD FP +F Sbjct: 130 GTKDSIWWRDLMLLEKDLSQNQNFFQRAVSCDVGDGQSILFWYNKWLGSEPLKDAFPELF 189 Query: 364 SLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSWS 543 ++S+ + VG+ G R D W W L W+R E L L D W Sbjct: 190 AISSQQLVSVGNAGSWRRDQWTWGLTWKRQLNPNEEESLHSLETILVDVHLVAESHDRWK 249 Query: 544 WKLDPSNTXXXXXXXXXXXAYKTLLEVQISSSLDPRVSKALGVLWQTNVPSNIQIFSWRL 723 W L S Y + + + ++ + L ++W+ VPS + +F WRL Sbjct: 250 WSLHNSK------LFTVSSCYSFAMSLVNQTQMNSDILDILSIVWKVPVPSKVALFCWRL 303 Query: 724 FLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870 LDRLPT++ L+RRN++ + C C E+ HLFF C FS IW Sbjct: 304 LLDRLPTKDNLIRRNVVI--NNSRCSLCDSCDENVVHLFFHCDFSNCIW 350 >KHN20323.1 Putative ribonuclease H protein, partial [Glycine soja] Length = 417 Score = 192 bits (489), Expect = 4e-55 Identities = 106/289 (36%), Positives = 149/289 (51%), Gaps = 2/289 (0%) Frame = +1 Query: 10 CLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAK--CGTYLQI 183 C P+ +GGLGVK+L++FN SLLAK +WR D +A+WR+L+ FRYG+ IAK C Sbjct: 70 CKPKKEGGLGVKNLEVFNISLLAKWKWRCIHDHNALWRDLLAFRYGNLIAKQTCSLDRSW 129 Query: 184 RQQGSIWMKDLFLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRMF 363 + SIW +DL LLE D +F A+ +G+G SI FW +W G LKD FP +F Sbjct: 130 GTKDSIWWRDLMLLEKDLSQNQNFFQRAVSCDVGDGQSILFWYNKWLGSEPLKDAFPELF 189 Query: 364 SLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSWS 543 ++S+ + VG+ R D W W L W+R E L L D W Sbjct: 190 AISSQQLESVGNASSWRRDQWTWGLTWKRQLNPNEEESLHSLETILVDVHLVAESHDRWK 249 Query: 544 WKLDPSNTXXXXXXXXXXXAYKTLLEVQISSSLDPRVSKALGVLWQTNVPSNIQIFSWRL 723 W L S Y + + + ++ + L ++W+ VPS + +F WRL Sbjct: 250 WSLHNSK------LFTVSSCYSFAMSLVNQTQMNSDILDILSIVWKVPVPSKVALFCWRL 303 Query: 724 FLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870 LDRLPT++ L+RRN++ + C C E+ HLFF C FS IW Sbjct: 304 LLDRLPTKDNLIRRNVVI--NNSRCSLCDSCDENVVHLFFHCDFSKCIW 350 >KHN30886.1 Putative ribonuclease H protein, partial [Glycine soja] Length = 373 Score = 188 bits (478), Expect = 6e-54 Identities = 105/283 (37%), Positives = 146/283 (51%), Gaps = 2/283 (0%) Frame = +1 Query: 28 GGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAK--CGTYLQIRQQGSI 201 GGLGVK+L++FN SLLAK +WR D +A+WR+L+ FRYG+ IAK C + SI Sbjct: 1 GGLGVKNLEVFNISLLAKWKWRCIHDHNALWRDLLAFRYGNLIAKQTCSLDRSWGTKDSI 60 Query: 202 WMKDLFLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRMFSLSTSK 381 W +DL LLE D +F A+ +G+G SI FW +W G LKD FP +F++S+ + Sbjct: 61 WWRDLMLLEKDLSQNQNFFQRAVSCDVGDGQSILFWYNKWLGSEPLKDAFPELFAISSQQ 120 Query: 382 EGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSWSWKLDPS 561 VG+ G R D W W L W+R E L L D W W L S Sbjct: 121 LVSVGNAGSWRRDQWTWDLTWKRQLNPNEEESLHSLETILVDVHLVAESHDRWKWSLHNS 180 Query: 562 NTXXXXXXXXXXXAYKTLLEVQISSSLDPRVSKALGVLWQTNVPSNIQIFSWRLFLDRLP 741 Y + + + ++ + L ++W+ VPS + +F WRL LDRLP Sbjct: 181 K------LFTVSSCYSFAMSLVNQTQMNSDILDILSIVWKVPVPSKVALFCWRLLLDRLP 234 Query: 742 TRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870 T++ L+RRN++ + C C E+ HLFF C FS IW Sbjct: 235 TKDNLIRRNVVI--NNSRCSLCDSCDENVVHLFFHCDFSKCIW 275 >XP_019447203.1 PREDICTED: uncharacterized protein LOC109350421 [Lupinus angustifolius] Length = 456 Score = 187 bits (476), Expect = 9e-53 Identities = 102/284 (35%), Positives = 149/284 (52%), Gaps = 4/284 (1%) Frame = +1 Query: 31 GLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAKCGTYLQIR--QQGSIW 204 G GVK+L LFN +LL K RWR+ + ++W +++ YG G + + ++GS W Sbjct: 12 GFGVKNLGLFNLALLGKWRWRMLSSSESLWVKVLRSIYGVEAVVRGGLVDVECFKKGSSW 71 Query: 205 MKDL-FLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRMFSLSTSK 381 +DL + D G WF+ + R++G+G S FW + W G CLK+ F R+F ++ +K Sbjct: 72 WRDLGCVCNRDNGFNKGWFNEGVRRRVGSGQSTLFWRDIWVGGECLKNCFERLFQVTLNK 131 Query: 382 EGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSWSWKLDPS 561 + C+ M E R VWCW L WRR F WE + + + + + ED W W D + Sbjct: 132 DACISSMDEWRNGVWCWLLNWRRSLFLWEQDEVNDLLNKVEEVRLVQGNEDGWLWVHDKN 191 Query: 562 NTXXXXXXXXXXXAYKTLL-EVQISSSLDPRVSKALGVLWQTNVPSNIQIFSWRLFLDRL 738 T AYK L EV+ + L + LW + VPS ++ F+WRLF+ + Sbjct: 192 GT------YSVRNAYKVLQNEVRNDNYLHYK------RLWASKVPSKLKCFAWRLFVGGV 239 Query: 739 PTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870 PTR L RR II S C FC ES +HLFF+C+ SY +W Sbjct: 240 PTRMNLARRGIIGSLPSTLCAFCGELEESSDHLFFTCSLSYSVW 283 >XP_019418409.1 PREDICTED: uncharacterized protein LOC109329191 [Lupinus angustifolius] Length = 953 Score = 194 bits (493), Expect = 1e-52 Identities = 105/293 (35%), Positives = 154/293 (52%), Gaps = 4/293 (1%) Frame = +1 Query: 4 KACLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAKCGTYLQI 183 + C + +GGLGVK+L LFN +LL K RW + + ++W +++ YG G + + Sbjct: 562 EVCRSKEEGGLGVKNLGLFNLALLGKWRWHMLSSSESLWVKVLRSIYGVEAVVRGGLVDV 621 Query: 184 R--QQGSIWMKDL-FLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFP 354 ++GS W +DL L D G WF+ + R++G+G S FW + W G CLK+ F Sbjct: 622 ECFKKGSSWWRDLGCLCNRDNGFNKGWFNEGVRRRVGSGQSTLFWRDIWVGGECLKNCFE 681 Query: 355 RMFSLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIED 534 R+F ++ +K+ C+ MGE R VWCW L WRR F WE + + + + + ED Sbjct: 682 RLFQVTLNKDACISSMGEWRNGVWCWLLNWRRSLFLWEQDEVNDLLNKVEEVRLVQGNED 741 Query: 535 SWSWKLDPSNTXXXXXXXXXXXAYKTLL-EVQISSSLDPRVSKALGVLWQTNVPSNIQIF 711 W W D + T AYK L EV+ + L + LW + VPS ++ F Sbjct: 742 GWLWVHDKNGT------YSVRNAYKVLQNEVRNDNYLHYK------RLWASKVPSKLKCF 789 Query: 712 SWRLFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870 +WRLF+ +PT L RR II S C FC ES +HLFF+C+ SY +W Sbjct: 790 AWRLFVGGVPTWMNLARRGIIGSLPSTLCAFCGELEESSDHLFFTCSLSYSVW 842 >GAU50085.1 hypothetical protein TSUD_371690 [Trifolium subterraneum] Length = 438 Score = 184 bits (466), Expect = 2e-51 Identities = 106/291 (36%), Positives = 149/291 (51%), Gaps = 4/291 (1%) Frame = +1 Query: 10 CLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAKCGTYLQIR- 186 C ++KGGLGV+D++L N SLLAK RWRL + +W+E+++ +YG+ I + IR Sbjct: 54 CKEKSKGGLGVRDVRLANLSLLAKWRWRLLLPGNPLWKEVLVAKYGNHILNRVDWRDIRI 113 Query: 187 -QQGSIWMKDLFLLEHDRGVPDL-WFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRM 360 S W KD+ L D+ V + W + ++ RK+GNG+S FW W G L FP + Sbjct: 114 PTLASKWWKDICTL--DKVVDNHNWLAESMIRKVGNGTSTSFWCSNWIGEAPLSVTFPLL 171 Query: 361 FSLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSW 540 FSLS K G V + + G+ W W WRRD F WE ++ E L S +ED W Sbjct: 172 FSLSNHKNGMVRNFCDHVGENWRWSFSWRRDLFQWEEDLVVRLREILEPVVLSL-VEDFW 230 Query: 541 SWKLDPSNTXXXXXXXXXXXAYKTLL-EVQISSSLDPRVSKALGVLWQTNVPSNIQIFSW 717 SWKLDP AY L+ E+ L+ ++ +W + PS + FSW Sbjct: 231 SWKLDPEG------KFSVKSAYTFLVEELTRDDDLEEAMATVFDQIWDSPAPSKVIAFSW 284 Query: 718 RLFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870 +L DR+PTR L R ++ CV C VES HLF C + +W Sbjct: 285 QLLSDRIPTRRNLEIRGLLGLDMPWECVGCVGRVESTTHLFLHCPSAMMVW 335 >GAU29820.1 hypothetical protein TSUD_223660 [Trifolium subterraneum] Length = 672 Score = 188 bits (477), Expect = 3e-51 Identities = 104/292 (35%), Positives = 159/292 (54%), Gaps = 3/292 (1%) Frame = +1 Query: 10 CLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGD---GIAKCGTYLQ 180 C P+ +GGLG+++L+L N SLL K RWRL + + +W+++++ +YG+ G A+ + Sbjct: 292 CKPKKEGGLGIRNLRLVNLSLLTKWRWRLLSGEGEVWKDIIVAKYGERVMGNARLDNIVY 351 Query: 181 IRQQGSIWMKDLFLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRM 360 + Q GS W +DL L+ D G WF+ + +K+G G+SI FW + W G L+ FPR+ Sbjct: 352 L-QFGSAWWRDLCNLDKDEG----WFNQVVLKKVGMGNSILFWKDVWAGDQSLEHRFPRL 406 Query: 361 FSLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSW 540 F +S + V +MG W W+L WRR FF WE ++RE E L+ F S+ + D W Sbjct: 407 FGISIQQNEVVRNMGSWVNVEWRWELLWRRQFFVWENELVRELGEVLNIFPLSEEV-DRW 465 Query: 541 SWKLDPSNTXXXXXXXXXXXAYKTLLEVQISSSLDPRVSKALGVLWQTNVPSNIQIFSWR 720 WK P+ TL+ I L P + + +W+ VPS + +W+ Sbjct: 466 VWK--PNEAEGFSVKSLYDWLDSTLVTRAI---LTPLEAFSFCSIWKCVVPSKVSALAWQ 520 Query: 721 LFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIWSA 876 LFLDR+PT++ L RR II S + C C E+ H+F C F+ +W A Sbjct: 521 LFLDRIPTKDNLCRRRIIRSEDAV-CDMCGGVSETSRHVFMHCDFAAQVWYA 571 >GAU40143.1 hypothetical protein TSUD_163120 [Trifolium subterraneum] Length = 419 Score = 182 bits (462), Expect = 5e-51 Identities = 104/294 (35%), Positives = 155/294 (52%), Gaps = 3/294 (1%) Frame = +1 Query: 4 KACLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAKCGTYL-- 177 + C PR++GGLGV+D+ N SLL K RWRL A W+E+++ +YG +A+ + Sbjct: 34 EVCRPRSEGGLGVRDVAKVNLSLLIKWRWRLLQSGYAFWKEVLVAKYGI-MARFKVHWIG 92 Query: 178 -QIRQQGSIWMKDLFLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFP 354 + + S+W KD+ + D WF+ +CRKLGNG+S +FW +RW G L D FP Sbjct: 93 HALPNRVSLWWKDICGI--DIREDGSWFARNMCRKLGNGNSTRFWLDRWIGSLPLSDQFP 150 Query: 355 RMFSLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIED 534 R+FSLS +++G V + + RG W + WRR F WE +L+ + L P ED Sbjct: 151 RLFSLSLNQQGMVREFRDVRGGEDGWVMRWRRRLFVWEEELLQRLQDLLPVDVPWSEAED 210 Query: 535 SWSWKLDPSNTXXXXXXXXXXXAYKTLLEVQISSSLDPRVSKALGVLWQTNVPSNIQIFS 714 WSW+L+ + + +SS + + G +W++ VPS + F+ Sbjct: 211 RWSWRLEEDGSFSVSSMYWYLGSV-----FSQASSFNAQELWVFGKIWKSPVPSKVIAFT 265 Query: 715 WRLFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIWSA 876 W+L +R+PTR L R I G+ CV C ES HLF C F+ IW+A Sbjct: 266 WKLLRNRIPTRCNLASRG-IQLIGGLDCVHCVGREESGTHLFMFCDFAGQIWNA 318 >GAU43007.1 hypothetical protein TSUD_187280 [Trifolium subterraneum] Length = 1892 Score = 189 bits (481), Expect = 9e-51 Identities = 104/293 (35%), Positives = 151/293 (51%), Gaps = 5/293 (1%) Frame = +1 Query: 10 CLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAKCGTY--LQI 183 C P+ + GLG++DL++ N SLLAK RW+L + Q +W+E+++ +YG I G + I Sbjct: 1564 CKPKKEAGLGIRDLRVVNISLLAKWRWKLLSHQREVWKEVVIAKYGQYIIGNGNLGNVTI 1623 Query: 184 RQQGSIWMKDLFLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRMF 363 + S W KD+ L+ D WF+ A+ + +GNG FWS+ W G L+ FPRM+ Sbjct: 1624 PRVASTWWKDICSLDKDSN----WFAEAVEQSVGNGHLTSFWSDIWIGDQSLQQRFPRMY 1679 Query: 364 SLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSWS 543 S+S K+ + +MG GD W W WRR+ FAWE + E + L+QF PS ED W Sbjct: 1680 SISNQKDSSIFNMGRWDGDRWRWDFNWRRNLFAWEEPMKLELMDVLNQFRPSDR-EDRWL 1738 Query: 544 W---KLDPSNTXXXXXXXXXXXAYKTLLEVQISSSLDPRVSKALGVLWQTNVPSNIQIFS 714 W K D + + +LE P LW+ P+ + FS Sbjct: 1739 WSENKEDGFSVKTCYDRLQYMFCERRVLE--------PSEEFVFAKLWKCGAPTKVCAFS 1790 Query: 715 WRLFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIWS 873 W+L DRL T+E L +R I+ M CV C VE+ HLF C F+ +W+ Sbjct: 1791 WQLLWDRLQTKENLYKRRILQQQQTM-CVLCNAAVETNRHLFLHCDFAAKVWN 1842 >KYP32706.1 Transposon TX1 uncharacterized [Cajanus cajan] Length = 1025 Score = 189 bits (480), Expect = 1e-50 Identities = 102/284 (35%), Positives = 149/284 (52%), Gaps = 1/284 (0%) Frame = +1 Query: 4 KACLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYG-DGIAKCGTYLQ 180 K P+ +GGLG+K++ FN +LLAK RW L + +MW ++L +YG D C +Y + Sbjct: 735 KVTRPKEEGGLGIKNIATFNVALLAKWRWNLFHNPDSMWARVLLSKYGVDRPNLCTSYNK 794 Query: 181 IRQQGSIWMKDLFLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRM 360 + SIW +D+ L D WF + K+G G FW +RW G CL L+PR+ Sbjct: 795 TK--ASIWWRDV-LKACGADNEDKWFDKSKDWKMGEGKQTLFWLDRWTGEECLAVLYPRL 851 Query: 361 FSLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSW 540 F +S K+ V MG+ D W W+ WRR+ F WE + ++ L+ FS K D W Sbjct: 852 FLISEQKQDTVHKMGQWVDDTWVWEFRWRRERFDWEANQILTLHQILNTFSMKKLKNDYW 911 Query: 541 SWKLDPSNTXXXXXXXXXXXAYKTLLEVQISSSLDPRVSKALGVLWQTNVPSNIQIFSWR 720 WKL+PS AYK L + ++ K +W+ +VP + +F WR Sbjct: 912 YWKLEPSG------EFSVKSAYKFLTSQRSTNER----QKLFVCMWKLHVPLKVSLFVWR 961 Query: 721 LFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCA 852 L ++ LPT+E L+RRNI P CVFC +E+ HLF +C+ Sbjct: 962 LLINALPTKENLLRRNIQLEPQNRLCVFCRASLETASHLFCTCS 1005 >GAU49581.1 hypothetical protein TSUD_139980 [Trifolium subterraneum] Length = 407 Score = 180 bits (457), Expect = 2e-50 Identities = 107/291 (36%), Positives = 148/291 (50%), Gaps = 3/291 (1%) Frame = +1 Query: 10 CLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAKCGTYLQIRQ 189 C P+ +GGLGVKDLK FN SLL K RWRL + ++W+ ++ +YG+ + + L Sbjct: 26 CRPKEEGGLGVKDLKWFNISLLTKWRWRLLLEHGSLWKLVLEAKYGN-VERVKLVLPRGN 84 Query: 190 QGSIWMKDLFLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRMFSL 369 + S+W KDL L GV D W + +KLG G S +FW +RW G L + F R+F + Sbjct: 85 KFSLWWKDLVGLGVTNGVEDDWNQHVFLKKLGCGGSTRFWLDRWVGLAPLCETFSRIFKV 144 Query: 370 STSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSWSWK 549 S E + D+GE D W W+L WRR FF E E ++ +K EDSWS+ Sbjct: 145 SLHPECVIKDLGEWVNDTWVWRLAWRRSFFIREEESYNNLMEIITPVPITKE-EDSWSF- 202 Query: 550 LDPSNTXXXXXXXXXXXAYKTLLEVQISSSLDPRVSKALGVL---WQTNVPSNIQIFSWR 720 + YK L P V ++GV+ W++ P + +FSW+ Sbjct: 203 ---IDRGMFTVRYMYSYLYKKFLPP------SPLVLCSVGVIARVWESWAPLKVIVFSWQ 253 Query: 721 LFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIWS 873 L RLPTR L+RR II CVFC ES HLF SC ++ +WS Sbjct: 254 ALLGRLPTRGNLVRRRIIIDGEASFCVFCNGARESENHLFSSCGTAWLVWS 304 >KYP69313.1 hypothetical protein KK1_008502 [Cajanus cajan] Length = 375 Score = 176 bits (447), Expect = 3e-49 Identities = 97/286 (33%), Positives = 147/286 (51%), Gaps = 5/286 (1%) Frame = +1 Query: 28 GGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAKCGTYLQIRQQGSIWM 207 GGLGVK++ FN +LLAK RW L ++W ++ RYG G C R SIW Sbjct: 2 GGLGVKNITRFNMALLAKWRWSLFHQNDSLWARVLYSRYGGGTNLCAQSSSRRD--SIWW 59 Query: 208 KDLFL----LEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRMFSLST 375 +DL + LE D WF + ++G+G+ +FW + W GP CL +FPR+F++S Sbjct: 60 RDLLMVCGGLEQDN-----WFERKIKWRIGSGARARFWLDNWTGPICLASVFPRLFTISE 114 Query: 376 SKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSWSWKLD 555 + + DMG W W+L WRR+ F WE + ++ + L + SP DSW W + Sbjct: 115 QQNHFIQDMGSWTDSSWVWQLQWRRERFEWEIQLEQQLMQQLLECSPRAEQVDSWWWLGE 174 Query: 556 PSNTXXXXXXXXXXXAYKTLL-EVQISSSLDPRVSKALGVLWQTNVPSNIQIFSWRLFLD 732 PS T AY + EV + S+ + A ++W P ++IF+WR+ Sbjct: 175 PSGT------YTVRSAYSAITSEVVVGSN-----NGAPNIIWSIPAPPKVKIFAWRMMSR 223 Query: 733 RLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870 LPT + L R+I S + CVFC +++E+ HLF +C+ +W Sbjct: 224 GLPTVDNLASRSITISDNDALCVFCKQDIETDYHLFCTCSVVDKVW 269 >GAU29496.1 hypothetical protein TSUD_360410 [Trifolium subterraneum] Length = 1301 Score = 183 bits (465), Expect = 1e-48 Identities = 100/290 (34%), Positives = 152/290 (52%), Gaps = 3/290 (1%) Frame = +1 Query: 10 CLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAKCGTYLQIR- 186 C + KGGLGV+D++L N SLL+K RWRL +W+E+++ +YG+ I + +R Sbjct: 945 CKDKAKGGLGVRDIRLVNISLLSKWRWRLLQPGRPLWKEVLVAKYGEFILNKVDWSGVRI 1004 Query: 187 -QQGSIWMKDLFLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRMF 363 S+W +D+ ++ D WF+ ++ RK+GNG+S FWS W G L +FPR+F Sbjct: 1005 PSTASMWWRDISSIDKVVSSKD-WFAESIVRKVGNGNSTSFWSTIWIGDDPLSVVFPRLF 1063 Query: 364 SLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSWS 543 SLS + + V D GE R W W WRRD F WE ++ + E L S ED W Sbjct: 1064 SLSNNNDRMVKDFGEYREGRWIWSFSWRRDLFQWEEDLVAQLRELLDPVVLSLE-EDWWR 1122 Query: 544 WKLDPSNTXXXXXXXXXXXAYKTLL-EVQISSSLDPRVSKALGVLWQTNVPSNIQIFSWR 720 W+ + + +YK L+ E++ L+ G +W + PS + FSW+ Sbjct: 1123 WRPETNGV------FSVNSSYKLLVDELESEEVLEEAEITVFGQIWDSPAPSKVIAFSWQ 1176 Query: 721 LFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870 L D++PTR+ L R+++ + CV C VES HLF C + +W Sbjct: 1177 LLYDQIPTRKNLEARDMVLADMPWECVGCVGNVESSLHLFLHCPSAMLVW 1226 >GAU43110.1 hypothetical protein TSUD_373050 [Trifolium subterraneum] Length = 1099 Score = 182 bits (463), Expect = 2e-48 Identities = 105/291 (36%), Positives = 148/291 (50%), Gaps = 4/291 (1%) Frame = +1 Query: 10 CLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAKCGTYLQIR- 186 C +NKGGLGV+D+++ N SLLAK RWRL +W+E+++ +YG+ I + R Sbjct: 715 CRAKNKGGLGVRDVRIVNLSLLAKWRWRLLLPGRPLWKEILVAKYGEHILHRVDWSDYRI 774 Query: 187 -QQGSIWMKDLFLLEHDRGVPDL-WFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRM 360 S W KD+ + D+ V D W + RK+GNG+S FWS +W G L +FPR+ Sbjct: 775 PSSASKWWKDICSI--DKVVEDKNWLVEEVGRKVGNGNSTSFWSTKWIGDAPLSVIFPRL 832 Query: 361 FSLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSW 540 FSLS K+ V D E GD W+ WRR+ F WE L E L F S +DSW Sbjct: 833 FSLSNHKDCMVRDFYEDDGDNERWRFSWRRELFQWEVDRLTRLKELLVSFVFSSD-DDSW 891 Query: 541 SWKLDPSNTXXXXXXXXXXXAYKTLL-EVQISSSLDPRVSKALGVLWQTNVPSNIQIFSW 717 W+ DP AY L+ E++ L+ + +W++ PS + FSW Sbjct: 892 IWRPDPDGV------FSVKSAYNLLIEELRSGEELEEEAALIFEQIWESPAPSKVIAFSW 945 Query: 718 RLFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870 +L DR+PTR L R ++ CV C VE+ HLF C + +W Sbjct: 946 QLLYDRIPTRRNLEVRGLLGLDSPWECVGCVGSVETTTHLFLHCPSALMVW 996 >GAU35675.1 hypothetical protein TSUD_162470 [Trifolium subterraneum] Length = 587 Score = 179 bits (453), Expect = 2e-48 Identities = 99/288 (34%), Positives = 154/288 (53%), Gaps = 2/288 (0%) Frame = +1 Query: 4 KACLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAK--CGTYL 177 + CLP++KGGLGV+DL+LFN +LL K +WR TD+ A+W L+ +RYG K C + Sbjct: 282 QVCLPKDKGGLGVRDLELFNLALLCKCKWRCITDKHALWNALLQYRYGPLSFKLLCRETI 341 Query: 178 QIRQQGSIWMKDLFLLEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPR 357 R + S+W +D+ + +G D WF + LGNG+SI FW E+W+G L++LFP Sbjct: 342 VTRPKDSLWWRDVVGVG-GKG-EDCWFPTQVSSVLGNGNSISFWKEKWHGVVPLRELFPL 399 Query: 358 MFSLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDS 537 ++ K+ V ++ ++ W W R + E A + L + + + D Sbjct: 400 LYEKEIHKDCVVSELFLPGSNLLNWNREWLRSLSSSELAEKADLEILLVGLTLNSDVADH 459 Query: 538 WSWKLDPSNTXXXXXXXXXXXAYKTLLEVQISSSLDPRVSKALGVLWQTNVPSNIQIFSW 717 W W P N+ Y L + LD + AL LW+ ++PS + +F W Sbjct: 460 WRWV--PENSGLFSVKS----VYIFLQSSLELNPLDSDLLYALSKLWKNDIPSKVGVFGW 513 Query: 718 RLFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSY 861 RL LD+LPTR L+ + I+S+ + +SC+FC +VE H+FFS A + Sbjct: 514 RLLLDKLPTRAALVSKGILSNSNDVSCIFCSMDVEDSNHIFFSDATKF 561 >KYP44529.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 567 Score = 178 bits (451), Expect = 3e-48 Identities = 99/294 (33%), Positives = 149/294 (50%), Gaps = 5/294 (1%) Frame = +1 Query: 4 KACLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAKCGTYLQI 183 + LP+ GGLGVK++ FN +LLAK RW L ++W ++ RYG G C Sbjct: 189 RVTLPKKMGGLGVKNIIRFNMALLAKWRWSLFHQNDSLWARVLYSRYGGGTNLCAQSSSR 248 Query: 184 RQQGSIWMKDLFL----LEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLF 351 R S+W +DL + LE D WF + +G+GS ++FW ++W GP CL LF Sbjct: 249 RD--SLWWRDLVVVCGGLEQDN-----WFDRKVKWSIGSGSRVRFWLDKWIGPICLASLF 301 Query: 352 PRMFSLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIE 531 PR+F++S + + DMG G W W+L WRR+ F WE + ++ + L + +P Sbjct: 302 PRLFTISEQQNQFIQDMGYWTGHRWAWQLHWRRERFEWEIPLEQQLMQRLLECNPRARQV 361 Query: 532 DSWSWKLDPSNTXXXXXXXXXXXAYKTLL-EVQISSSLDPRVSKALGVLWQTNVPSNIQI 708 DSW W +PS T AY + E + S++ A +W P +I Sbjct: 362 DSWWWLGEPSGT------YTVRSAYSAITSEADVGSNIG-----APSSVWSIPAPPKAKI 410 Query: 709 FSWRLFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870 F+WR+ LPT + L R+I+ S + CVFC ++E+ HLF +C +W Sbjct: 411 FAWRMMSRGLPTVDNLASRSIVLSENDALCVFCKSDIETDYHLFCTCPVVDKVW 464 >GAU44350.1 hypothetical protein TSUD_129240 [Trifolium subterraneum] Length = 388 Score = 174 bits (440), Expect = 4e-48 Identities = 100/291 (34%), Positives = 143/291 (49%), Gaps = 4/291 (1%) Frame = +1 Query: 10 CLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAKCGTYLQIR- 186 C + GGLGV+D+K+ N SLLAK RWRL + +W+++++ +YG+ I + IR Sbjct: 28 CKAKRMGGLGVRDIKIVNLSLLAKWRWRLLLPGNPLWKQVLVAKYGNHILNRVIWSDIRI 87 Query: 187 -QQGSIWMKDLFLLEHDRGVPDL-WFSNALCRKLGNGSSIKFWSERWNGPTCLKDLFPRM 360 S W KD+ L D+ V W ++ RK+GNG S FWS W G L ++FPR+ Sbjct: 88 PSLASKWWKDVCSL--DKVVESKNWLGESIVRKVGNGFSTYFWSSNWIGEAPLLEVFPRL 145 Query: 361 FSLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIEDSW 540 +SLS K+ V D G W W WRR+ F WE ++ E + P ED W Sbjct: 146 YSLSIHKDSMVRDFYVQEGGGWRWSFSWRRNLFQWEEDLVTRLREMVEPV-PLSLEEDYW 204 Query: 541 SWKLDPSNTXXXXXXXXXXXAYKTL-LEVQISSSLDPRVSKALGVLWQTNVPSNIQIFSW 717 W DP AY L E+++ L+ V+ +W + PS + FSW Sbjct: 205 VWSPDPEG------KFSVKSAYNFLGDELRVGEDLEEEVALVFDNIWGSPAPSKVIAFSW 258 Query: 718 RLFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870 +L DR+P+R L R ++ CV C VES HLF C + +W Sbjct: 259 QLLYDRIPSRRNLEARGLLCLDMPWECVGCVGSVESTTHLFLHCPSAMKVW 309 >KYP44023.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 1034 Score = 181 bits (460), Expect = 5e-48 Identities = 99/294 (33%), Positives = 151/294 (51%), Gaps = 5/294 (1%) Frame = +1 Query: 4 KACLPRNKGGLGVKDLKLFNPSLLAK*RWRLCTDQSAMWRELMLFRYGDGIAKCGTYLQI 183 + LP+ GGLGVK++ FN +LLAK RW L ++W ++ RYG G C Sbjct: 653 RVTLPKKMGGLGVKNITRFNMALLAKWRWSLFHQNDSLWARVLYSRYGGGTNLCAQSSSR 712 Query: 184 RQQGSIWMKDLFL----LEHDRGVPDLWFSNALCRKLGNGSSIKFWSERWNGPTCLKDLF 351 R SIW +DL + LE D WF + ++G+G+ +FW + W GP CL +F Sbjct: 713 RD--SIWWRDLLMVCGGLEQDN-----WFERKIKWRIGSGARARFWLDNWTGPICLASVF 765 Query: 352 PRMFSLSTSKEGCVGDMGE*RGDVWCWKLGWRRDFFAWEGAVLREFNECLSQFSPSKHIE 531 PR+F++S + + DMG W W+L WRR+ F WE + ++ + L + SP Sbjct: 766 PRLFTISEQQNHFIQDMGSWTDSSWVWQLQWRRERFEWEIQLEQQLMQQLLECSPRAEQV 825 Query: 532 DSWSWKLDPSNTXXXXXXXXXXXAYKTLL-EVQISSSLDPRVSKALGVLWQTNVPSNIQI 708 DSW W +PS T AY + EV + S+ + A ++W P ++I Sbjct: 826 DSWWWLGEPSGT------YTVRSAYSAITSEVVVGSN-----NGAPNIIWSIPAPPKVKI 874 Query: 709 FSWRLFLDRLPTRELLMRRNIISSPHGMSCVFCFREVESREHLFFSCAFSYGIW 870 F+WR+ LPT + L R+I S + CVFC +++E+ HLF +C+ +W Sbjct: 875 FAWRMMSRGLPTVDNLASRSITISDNDALCVFCKQDIETDYHLFCTCSVVDKVW 928