BLASTX nr result
ID: Glycyrrhiza29_contig00041044
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza29_contig00041044 (994 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value GAU24540.1 hypothetical protein TSUD_156530 [Trifolium subterran... 242 1e-68 GAU48830.1 hypothetical protein TSUD_190600 [Trifolium subterran... 204 2e-60 GAU47359.1 hypothetical protein TSUD_403620 [Trifolium subterran... 199 7e-58 GAU49781.1 hypothetical protein TSUD_188300 [Trifolium subterran... 191 3e-56 GAU48983.1 hypothetical protein TSUD_245740 [Trifolium subterran... 202 7e-55 KYP57109.1 Putative ribonuclease H protein At1g65750 family [Caj... 186 1e-52 GAU36275.1 hypothetical protein TSUD_255290 [Trifolium subterran... 181 9e-52 KYP61721.1 Putative ribonuclease H protein At1g65750 family [Caj... 177 9e-51 ABN09044.1 Ribonuclease H [Medicago truncatula] 173 3e-49 GAU17063.1 hypothetical protein TSUD_105620 [Trifolium subterran... 178 1e-48 AFK48593.1 unknown [Lotus japonicus] 166 8e-46 GAU32945.1 hypothetical protein TSUD_153620 [Trifolium subterran... 165 2e-45 KYP56001.1 Putative ribonuclease H protein At1g65750 family, par... 168 3e-45 GAU39987.1 hypothetical protein TSUD_211080 [Trifolium subterran... 161 5e-45 KYP32780.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan] 170 1e-43 GAU24479.1 hypothetical protein TSUD_319560 [Trifolium subterran... 158 2e-43 GAU50297.1 hypothetical protein TSUD_288310 [Trifolium subterran... 165 4e-43 GAU35042.1 hypothetical protein TSUD_30080 [Trifolium subterraneum] 163 2e-41 GAU34195.1 hypothetical protein TSUD_162960 [Trifolium subterran... 150 5e-41 GAU16646.1 hypothetical protein TSUD_325960 [Trifolium subterran... 148 1e-40 >GAU24540.1 hypothetical protein TSUD_156530 [Trifolium subterraneum] Length = 1147 Score = 242 bits (617), Expect = 1e-68 Identities = 120/282 (42%), Positives = 171/282 (60%) Frame = +2 Query: 5 CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAI 184 CP C + ET++H F+C A +W L HV P S++ D+ W RD S+G I+ I Sbjct: 867 CPRCTAMPETIVHCLFACTDAIGIWRACGLEHVLPPSTD-VDLFCWCRDVGKSHGCIIFI 925 Query: 185 ILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVAWMGT 364 I+W +W +RN IF+N ++ +V + +L+ A++ S R V W Sbjct: 926 IMWFVWCSRNDAIFNNNKAIVHNLVAKVHYMLSFCTAAFENTTSGSGGNSEHRLVVWP-R 984 Query: 365 SSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLR 544 + V LNVDGS + +GFGGL+R+ G FL GFYG+A QSSVL AE++ ++HGL Sbjct: 985 PDEGTVCLNVDGSMLGSLQTAGFGGLIRNSFGAFLKGFYGTASQSSVLYAEIMAILHGLH 1044 Query: 545 LCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGN 724 LCW +GYR ++CYS+SL AV I +GVSH H ANEI I + +++DW ++ H LREGN Sbjct: 1045 LCWNNGYRSIVCYSDSLQAVSLIKDGVSHFHTFANEIYTIHQLLRRDWTIVIEHILREGN 1104 Query: 725 QCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850 CAD LAK G+ T+ P+++++ PPP L ADA G+ F R Sbjct: 1105 ACADILAKKGSSTNSPIVIVESPPPEPSNALSADARGIVFVR 1146 >GAU48830.1 hypothetical protein TSUD_190600 [Trifolium subterraneum] Length = 298 Score = 204 bits (519), Expect = 2e-60 Identities = 100/230 (43%), Positives = 147/230 (63%) Frame = +2 Query: 161 SNGIIVAIILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMP 340 ++G + I+LW IW RN+ +F+N+R S +I+ + LL+ + + +S +ATT P Sbjct: 69 NHGPLFFIVLWVIWCVRNEFVFNNQRESTHIIMGKIYSLLHSCEAVFTPPHSSMATTAKP 128 Query: 341 REVAWMGTSSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEV 520 R V W ++ V LNVDGS ++ +G+GGL+RD G FL GFYG+A S+L AE+ Sbjct: 129 RLVTWT-KPAEGTVCLNVDGSLLKATNTAGYGGLIRDSNGVFLSGFYGTATVQSILFAEL 187 Query: 521 LGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLL 700 + V+HGL++CWE G+R + C+S+SL V I +GVS H +NE+ II + + +DW+ ++ Sbjct: 188 MAVLHGLQICWESGFRRITCFSDSLQIVNLIRDGVSAHHRFSNEVFIIHQLLAKDWEVVI 247 Query: 701 VHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850 HT REGN CAD LAK+GA +D L+ I PP + LLADA V F R Sbjct: 248 GHTFREGNACADVLAKMGAASDSTLVTISTPPCDLSMPLLADAHVVVFIR 297 >GAU47359.1 hypothetical protein TSUD_403620 [Trifolium subterraneum] Length = 330 Score = 199 bits (505), Expect = 7e-58 Identities = 107/282 (37%), Positives = 147/282 (52%) Frame = +2 Query: 5 CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAI 184 CP C SE++ H F+CN A VW + L HV P SS+ D W + +G I I Sbjct: 74 CPRCAIASESIEHCLFTCNDAASVWRAYGL-HVIPNSSHGVDNFTWYKKQGMKHGRIFFI 132 Query: 185 ILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVAWMGT 364 I+W IW RN+ IFDN R S V + L A+ + + PR V W Sbjct: 133 IMWVIWCARNEFIFDNHRQSVVTSVIKIDSLQQACAAAFGSTQTIATQSSNPRLVTW-AR 191 Query: 365 SSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLR 544 + + LNVDGS + +G+GGLLR+H G F+ GFYG+ S+L AE++ V+HGL Sbjct: 192 PMEGTICLNVDGSLLGSLNSAGYGGLLRNHNGEFILGFYGTTSLKSILFAEIMAVLHGLT 251 Query: 545 LCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGN 724 +CWE+GYR + C S+SL + +DW+ +L HTLREG+ Sbjct: 252 ICWENGYRKINCLSDSLQLI------------------------TRDWEVVLSHTLREGS 287 Query: 725 QCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850 CAD LAK+GA+ + PL+ PP + L D +GV FTR Sbjct: 288 SCADVLAKMGAVANTPLVTTSTPPRTLAKPLFEDVNGVIFTR 329 >GAU49781.1 hypothetical protein TSUD_188300 [Trifolium subterraneum] Length = 221 Score = 191 bits (485), Expect = 3e-56 Identities = 96/221 (43%), Positives = 130/221 (58%) Frame = +2 Query: 188 LWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVAWMGTS 367 +W IW RN+ IFDN R S V + L A+ + + PR V W Sbjct: 1 MWVIWCARNEFIFDNHRQSVVTSVIKIDSLQQACAAAFGSTQTIATQSSNPRLVTW-ARP 59 Query: 368 SDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLRL 547 + + LNVDGS + +G+GGLLR+H G F+ GFYG+ S+L AE++ V+HGL + Sbjct: 60 MEGTICLNVDGSLLGSLNSAGYGGLLRNHNGEFILGFYGTTSLKSILFAEIMAVLHGLTI 119 Query: 548 CWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGNQ 727 CWE+GYR + C S+SL V I GVS H ANEI I++ + +DW+ +L HTLREGN Sbjct: 120 CWENGYRKINCLSDSLQVVNLIRSGVSPHHRFANEILSIRQLITRDWEVVLSHTLREGNL 179 Query: 728 CADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850 CAD LAK+GA+ + PL+ PP + L DA+GV FTR Sbjct: 180 CADVLAKMGAVANTPLVTTSTPPRTLPKPLFEDANGVIFTR 220 >GAU48983.1 hypothetical protein TSUD_245740 [Trifolium subterraneum] Length = 1103 Score = 202 bits (515), Expect = 7e-55 Identities = 104/246 (42%), Positives = 150/246 (60%), Gaps = 2/246 (0%) Frame = +2 Query: 119 NRSDIAKWTRDAI--HSNGIIVAIILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDID 292 N S+ T+DA S+G I+ II+W +W +RN IF+N ++ +V + +L+ Sbjct: 858 NSSNGVYTTKDADVGKSHGCIIFIIMWFVWCSRNDAIFNNNKAIVHNLVAKVHSMLSFCI 917 Query: 293 KAYQEINSPLATTRMPREVAWMGTSSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLF 472 A++ S R V W ++ V LNV GS + +GFGGL+R+ FL Sbjct: 918 AAFKNTTSGSGGNSEQRLVVWP-RPAEGTVCLNVHGSMLGSLQTAGFGGLIRNSFSAFLK 976 Query: 473 GFYGSADQSSVLRAEVLGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANE 652 GFYG+A QSSVL AE++ ++HGL LCW +GYR ++CYS+SL AV I +GVSH H ANE Sbjct: 977 GFYGTASQSSVLYAEIMAILHGLHLCWNNGYRSIVCYSDSLQAVSLIKDGVSHFHTFANE 1036 Query: 653 ISIIKKHMQQDWDCLLVHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADAS 832 I I++ +++DW ++ H LREGN CAD LAK G+ T+ P++++ PPP + L DA Sbjct: 1037 IHPIRQLLRRDWTIVIEHILREGNACADVLAKKGSSTNSPIVIVDSPPPELSNALSVDAR 1096 Query: 833 GVSFTR 850 GV F R Sbjct: 1097 GVVFVR 1102 >KYP57109.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 365 Score = 186 bits (472), Expect = 1e-52 Identities = 100/285 (35%), Positives = 150/285 (52%), Gaps = 2/285 (0%) Frame = +2 Query: 2 SCPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVA 181 +CP C ET+LH C +W+ L L ++ I +W + + G I+ Sbjct: 82 NCPMCNAQQETLLHCLLECPRIGALWNSLGLCQ-PHLPTDSEKIKEWLKCWVEEQGSIIP 140 Query: 182 IILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAY--QEINSPLATTRMPREVAW 355 ++LW IW +RN +IF + + S + I +AY + + R R V W Sbjct: 141 VLLWVIWRSRNNMIFKGKLDKVADLKVWVSTWCSAIIRAYGGEPATGSIWQQRSTRLVRW 200 Query: 356 MGTSSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMH 535 D V +NVDGSA+ +PG G GGL+RD+TG F+ GFYGS S+ + AE++ + Sbjct: 201 TAKEGDW-VTINVDGSALTNPGAVGVGGLVRDNTGLFMVGFYGSIGISNNIHAELVAMWR 259 Query: 536 GLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLR 715 GL LCWE GY V C S+ L V+ + + SH H A + IK+ + + W C ++H LR Sbjct: 260 GLTLCWERGYSHVCCQSDCLYVVQLLQQESSHYHRYAVLLDKIKELISRHWTCQVIHILR 319 Query: 716 EGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850 EGN CAD A+ GA++ E L+++++ P M LL D +G R Sbjct: 320 EGNFCADFFARKGAVSSEGLVILEEAPVEMEELLRKDITGTCVLR 364 >GAU36275.1 hypothetical protein TSUD_255290 [Trifolium subterraneum] Length = 258 Score = 181 bits (458), Expect = 9e-52 Identities = 100/252 (39%), Positives = 142/252 (56%), Gaps = 2/252 (0%) Frame = +2 Query: 101 VAPLSSNRSDIAKWTRDAIHSNGIIVAIILWTIWLTRNKLIFDNERSSPQLIVYRASRLL 280 + P S D W + +G I+ + LW +W RN IF+N + S V ++ L+ Sbjct: 9 LVPSSVQGVDRLTWCKQLGKKHGNIIFVTLWMVWCVRNNFIFNNHQESTHTSVAKSHSLV 68 Query: 281 NDIDKAYQ--EINSPLATTRMPREVAWMGTSSDQRVVLNVDGSAIQHPGHSGFGGLLRDH 454 N KA+ + SPLA + R V W +D+ V LNVDGS + +G+ GLLR+ Sbjct: 69 NASAKAFSLPSVVSPLAGHQ--RSVRWF-RPADEFVCLNVDGSLLGSNNTAGYDGLLRNR 125 Query: 455 TGTFLFGFYGSADQSSVLRAEVLGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHC 634 G F++GFYG A ++L AE++ + +GL+LCWE G+R V C S+ L +V EGV+ Sbjct: 126 DGEFIWGFYGVAAIQNILYAEIMAIWYGLKLCWERGFRKVFCCSDYLLSVDVTKEGVTTH 185 Query: 635 HPLANEISIIKKHMQQDWDCLLVHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPL 814 H ANEI I+K + DW+ +L HTLREGN CAD L KLG +D ++ I P + Sbjct: 186 HRFANEILCIRKLLANDWEVILTHTLREGNACADVLGKLGVNSDSSMVNIYAPSQDLVIP 245 Query: 815 LLADASGVSFTR 850 L DASG+ F R Sbjct: 246 LHDDASGIEFIR 257 >KYP61721.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 219 Score = 177 bits (448), Expect = 9e-51 Identities = 90/221 (40%), Positives = 134/221 (60%) Frame = +2 Query: 188 LWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVAWMGTS 367 +W IW RN+ IFD + I+ +A+ LL A+ I+ + +PR V W+ Sbjct: 1 MWFIWCHRNRHIFDQVDWNLTSILAQANALLQFSVSAFTSIDC--SHRPLPRLVHWIHPL 58 Query: 368 SDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLRL 547 D V LNVDGS I PG G+GGL ++H G FLFGFYG ++SVL+ E+L ++HGL L Sbjct: 59 VDS-VALNVDGSRIGTPGRGGYGGLCQNHEGQFLFGFYGFLGEASVLQTEILALLHGLHL 117 Query: 548 CWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGNQ 727 CW+ G+R ++CYS+S V + + H N++ I + + DW C +VHTL EGN Sbjct: 118 CWDKGFRKIVCYSDSTLVVSLLQGPILMFHRYGNQLMEIHQLLNCDWTCTVVHTLCEGNS 177 Query: 728 CADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850 CAD LA++GA+ ++ ++++Q+ P + LLLAD+ G F R Sbjct: 178 CADALARMGALGNDRVVILQEHPMTLSSLLLADSLGTVFQR 218 >ABN09044.1 Ribonuclease H [Medicago truncatula] Length = 235 Score = 173 bits (439), Expect = 3e-49 Identities = 95/235 (40%), Positives = 138/235 (58%) Frame = +2 Query: 146 RDAIHSNGIIVAIILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLA 325 +D I + G I II+W IW +RNK IF++ + S Q I + L+ I KA+ S + Sbjct: 2 KDFISNIGPIGPIIIWKIWCSRNKCIFEDIKHSIQEIGAQVLSSLHHILKAFAHPTSH-S 60 Query: 326 TTRMPREVAWMGTSSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSV 505 + R V+W S + V LNVDG+ FGGL+RDHT +FL GF+G + + Sbjct: 61 VQQPARIVSWQRPSMNS-VALNVDGNVFLDSNLGSFGGLIRDHTSSFLHGFFGKNSRPCI 119 Query: 506 LRAEVLGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQD 685 L E+ G+ HGL+LCW+ G + V+C+S+S V + + ++ H N I IKK +++D Sbjct: 120 LHVEISGLYHGLKLCWDIGIKHVVCHSDSTTVVDLVQKDLNVHHKYGNLIMAIKKLLRRD 179 Query: 686 WDCLLVHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850 W L HTL EGN AD LAK GA++D L+++ + PP + +LLADA GV F R Sbjct: 180 WVVSLRHTLCEGNAAADFLAKKGALSDTSLVILNEAPPDIAFVLLADAVGVKFVR 234 >GAU17063.1 hypothetical protein TSUD_105620 [Trifolium subterraneum] Length = 440 Score = 178 bits (451), Expect = 1e-48 Identities = 109/287 (37%), Positives = 150/287 (52%), Gaps = 5/287 (1%) Frame = +2 Query: 5 CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAI 184 CP ET++H C +W T + ++ W R S + + + Sbjct: 165 CPRSDIAEETIMHCLRDCEFVKHLWKTIGFTDQTFFHGD--NLYAWLRKGCDSPSMFMFL 222 Query: 185 I-LWTIWLTRNKLIFDNERSSPQLIVYRASRLLND----IDKAYQEINSPLATTRMPREV 349 LW IW RNKL NE SP + SR + D + K Y + S LA R+ R Sbjct: 223 AALWWIWRARNKLCLANELVSP----FTISRCIEDYALLVKKCYSQQKSTLAN-RLVRWN 277 Query: 350 AWMGTSSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGV 529 A GT ++LNVDGS+I +P GFGGL+R+ G ++ GF G+ S++L AE+L V Sbjct: 278 AHDGTD----MILNVDGSSIGNPEIYGFGGLIRNSHGAWIRGFAGNIGFSNILHAELLAV 333 Query: 530 MHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHT 709 HGL L W+ +D+ICYS+S A+K I + ++ H A + IK + +DW + HT Sbjct: 334 YHGLVLAWDMDIKDLICYSDSKTAIKLIGDPINEWHHFAAILQNIKDILARDWRVTVAHT 393 Query: 710 LREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850 LREGN CAD+LAK GA + + PP M LLLADASG FTR Sbjct: 394 LREGNACADYLAKFGAQNIKVFSTMTTPPDGMNLLLLADASGTWFTR 440 >AFK48593.1 unknown [Lotus japonicus] Length = 272 Score = 166 bits (419), Expect = 8e-46 Identities = 97/243 (39%), Positives = 132/243 (54%), Gaps = 1/243 (0%) Frame = +2 Query: 125 SDIAKWTRDAIHSNGIIVAIILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQ 304 +D W R+ + N +V LW +W RN + + Q++ + +DI + Y Sbjct: 33 ADSKAWLREVLKENSPLVMSTLWWVWRLRNVWCMEGKLIPWQVLRGDILAMFDDIARCYA 92 Query: 305 -EINSPLATTRMPREVAWMGTSSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFY 481 ++++P+ T PR V W +D VVLNVDGS P GFGG R G +L GF+ Sbjct: 93 VDVDAPMHT---PRLVRWTVGLADC-VVLNVDGSVHGTPQRGGFGGCFRTIHGNWLRGFF 148 Query: 482 GSADQSSVLRAEVLGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISI 661 G D+ +L E+LG+ HGL L WE GYR V C S+S +AV + S CH A + Sbjct: 149 GYLDECCILHLELLGMFHGLSLAWEQGYRIVECQSDSQDAVTLVKSTPSSCHRYAALVWD 208 Query: 662 IKKHMQQDWDCLLVHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVS 841 IK +DW L HTLREGN CAD L K GA ++ L++ + P +G LLLADA GVS Sbjct: 209 IKDLQSRDWIVELRHTLREGNACADLLVKHGADQNDDLVITENPIAGLGVLLLADARGVS 268 Query: 842 FTR 850 F R Sbjct: 269 FVR 271 >GAU32945.1 hypothetical protein TSUD_153620 [Trifolium subterraneum] Length = 292 Score = 165 bits (418), Expect = 2e-45 Identities = 87/179 (48%), Positives = 117/179 (65%) Frame = +2 Query: 314 SPLATTRMPREVAWMGTSSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSAD 493 SPLA + R V W +D V LNVDGS + +G+GGLLR+ G F++GFYG+A Sbjct: 116 SPLAGHQ--RRVRW-SRPADGFVCLNVDGSLLGSNNTAGYGGLLRNRDGEFIWGFYGAAA 172 Query: 494 QSSVLRAEVLGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKH 673 ++L AE++ + +GL+LCWE G+R V+C S+SL +V I EGV+ H ANEI I+K Sbjct: 173 IQNILYAEIMAIWYGLKLCWERGFRKVLCCSDSLLSVNVIKEGVTTHHGFANEILCIRKL 232 Query: 674 MQQDWDCLLVHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850 + DW+ +L HTLREGN CAD LAKLGA +D P++ I PP + L DASG+ F R Sbjct: 233 LSNDWEVILTHTLREGNACADVLAKLGANSDSPMVNISTPPRDLVIPLHHDASGIEFIR 291 >KYP56001.1 Putative ribonuclease H protein At1g65750 family, partial [Cajanus cajan] Length = 414 Score = 168 bits (426), Expect = 3e-45 Identities = 95/284 (33%), Positives = 142/284 (50%), Gaps = 2/284 (0%) Frame = +2 Query: 5 CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAI 184 C C ET++H F C+ +VW + W I G + Sbjct: 139 CRQCDSQEETVMHCFRDCHEVQEVWKILQFVSCDTFYQI-DNFKMWVNHGIKLGGALFLS 197 Query: 185 ILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAY--QEINSPLATTRMPREVAWM 358 +W IWL N+L+F+ ++ + A + + E+NS L P+ V W Sbjct: 198 TIWEIWLGWNRLVFEGSKTKAWQVALAAKSFSEAMTNVFLNHEVNSNL-----PKWVGW- 251 Query: 359 GTSSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHG 538 S+ V+LN DGS ++ +GFGG+LR G ++ GFYG+ D S ++ E+LG++ G Sbjct: 252 SAPSENCVILNTDGSVMEDK--AGFGGVLRSSDGVWIHGFYGNVDGSDIIGVELLGILQG 309 Query: 539 LRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLRE 718 LR+ G V C ++SL AVK+I GVSH H +N + I K + +DW + H LRE Sbjct: 310 LRIAQRLGLSRVYCQTDSLVAVKWIQGGVSHMHHYSNLVQEIHKLLDKDWAVSISHVLRE 369 Query: 719 GNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850 N+CAD+ AKLG + L +PP + P+L ADA+G F R Sbjct: 370 CNKCADYFAKLGLNCPDRLTNFMEPPLDVIPMLQADAAGERFLR 413 >GAU39987.1 hypothetical protein TSUD_211080 [Trifolium subterraneum] Length = 192 Score = 161 bits (407), Expect = 5e-45 Identities = 82/187 (43%), Positives = 115/187 (61%) Frame = +2 Query: 275 LLNDIDKAYQEINSPLATTRMPREVAWMGTSSDQRVVLNVDGSAIQHPGHSGFGGLLRDH 454 LL+ + + +S +ATT PR V W + V LNVDGS + + +GGL+RD Sbjct: 7 LLHFCEAMFTPPHSSVATTAKPRLVTWT-KPVEGTVCLNVDGSLLGATNTASYGGLIRDS 65 Query: 455 TGTFLFGFYGSADQSSVLRAEVLGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHC 634 L GFYG+ S+L AE++ V+HGL++CWE G+R + C+S+SL V I +GVS Sbjct: 66 NRVILSGFYGTTSVQSILFAELMAVLHGLQICWESGFRRITCFSDSLQTVNLIRDGVSTH 125 Query: 635 HPLANEISIIKKHMQQDWDCLLVHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPL 814 H +NE+ II + + DW+ ++ HT REGN CAD LAK+GA +D PL+ I PP + Sbjct: 126 HRSSNEVFIIHQLLANDWEVVIDHTFREGNACADVLAKMGAASDSPLVKISTPPCDLSMP 185 Query: 815 LLADASG 835 LLADA G Sbjct: 186 LLADAQG 192 >KYP32780.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan] Length = 1123 Score = 170 bits (431), Expect = 1e-43 Identities = 97/284 (34%), Positives = 142/284 (50%), Gaps = 2/284 (0%) Frame = +2 Query: 5 CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAI 184 C C ET++H F C+ +VW + W I G + Sbjct: 848 CRQCDSQEETVMHCFRDCHEVQEVWRILQFVSCDTFYQI-DNFKMWVNHGIKLGGALFLS 906 Query: 185 ILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAY--QEINSPLATTRMPREVAWM 358 +W IWL RN+L+F+ ++ + A L + + E+NS L P+ V W Sbjct: 907 TIWEIWLGRNRLVFEGSKTKAWQVALAAKSLSEAMTNVFLNHEVNSNL-----PKWVGW- 960 Query: 359 GTSSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHG 538 S+ V+LN DGS ++ +GFGG+LR G ++ GF G+ D ++ E+LG++ G Sbjct: 961 SAPSENCVILNTDGSVMEDK--AGFGGVLRSSNGAWIHGFCGNVDGYEIIGVELLGILQG 1018 Query: 539 LRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLRE 718 LR+ G V C +NSL AVK+I GVSH H +N + I K + +DW + H LRE Sbjct: 1019 LRIAQRLGLSRVYCQTNSLAAVKWIQGGVSHMHHYSNLVQEIHKLLDKDWAVSISHVLRE 1078 Query: 719 GNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850 N+CAD+ AKLG + L +PP + PLL ADA+G F R Sbjct: 1079 CNKCADYFAKLGLNCPDRLTNFMEPPLDVIPLLQADAAGERFLR 1122 >GAU24479.1 hypothetical protein TSUD_319560 [Trifolium subterraneum] Length = 227 Score = 158 bits (399), Expect = 2e-43 Identities = 81/200 (40%), Positives = 121/200 (60%), Gaps = 4/200 (2%) Frame = +2 Query: 185 ILWTIWLTRNKLIFDNE----RSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVA 352 ++W IW RN +IF+++ + Q I+ + + L + K ++ + + PR V Sbjct: 19 VVWAIWRVRNDVIFNSKVPVIEEAFQGIISLSWKWLRE--KKKKKKKGAVTQSSNPRLVT 76 Query: 353 WMGTSSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVM 532 W + + LNVDG+ + + G+GGLLR+H G F+ GFYG+ S+L AE++ V+ Sbjct: 77 W-ARPMEGTICLNVDGNLLGSLNYLGYGGLLRNHNGEFILGFYGTTSLKSILFAEIMVVL 135 Query: 533 HGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTL 712 HGL +CWE+GYR + C SNSL V I GVS H ANEI I++ + +DW+ +L HTL Sbjct: 136 HGLTICWENGYRKINCLSNSLQLVNLIRSGVSLHHRFANEILSIRRLITRDWEVVLSHTL 195 Query: 713 REGNQCADHLAKLGAMTDEP 772 REGN CAD LAK+G + + P Sbjct: 196 REGNSCADVLAKMGVVANTP 215 >GAU50297.1 hypothetical protein TSUD_288310 [Trifolium subterraneum] Length = 545 Score = 165 bits (418), Expect = 4e-43 Identities = 102/283 (36%), Positives = 144/283 (50%), Gaps = 1/283 (0%) Frame = +2 Query: 5 CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHS-NGIIVA 181 CP C E+ LH +C + W + ++ W R++I + + Sbjct: 270 CPRCNIEEESTLHCLRNCEFIKRFWKAIGF--LGQTFFQGDNLNDWLRNSIDGPSSFLFM 327 Query: 182 IILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVAWMG 361 +W IW RN+L DNE S + L + + + N +T M R A G Sbjct: 328 AAVWWIWCARNQLCMDNEAISYFTLRTNTENLAQLLRMCFIKQNIS-STATMVRWNAHGG 386 Query: 362 TSSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGL 541 ++LNVDGS+I +PG SGFGGL+R+ G ++ GF G+ ++L+AE+L + HGL Sbjct: 387 IG----MILNVDGSSIGNPGISGFGGLIRNSDGAWVHGFAGNIGHLNILQAELLAIYHGL 442 Query: 542 RLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREG 721 L WE +D+ CYS+S A+K I + V+ H A I IK + ++W LVH LREG Sbjct: 443 VLAWELDIKDLCCYSDSKTALKLIYDHVNEWHQYAAIIYNIKDFLSRNWRVRLVHMLREG 502 Query: 722 NQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850 N CAD L K GA + I PP M LLLADASG F+R Sbjct: 503 NNCADILDKFGARNPKAYCSIAVPPDGMSLLLLADASGTIFSR 545 >GAU35042.1 hypothetical protein TSUD_30080 [Trifolium subterraneum] Length = 724 Score = 163 bits (412), Expect = 2e-41 Identities = 94/268 (35%), Positives = 141/268 (52%), Gaps = 3/268 (1%) Frame = +2 Query: 56 CNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAII-LWTIWLTRNKLIFDN 232 CN +W T D + W R+ + + + + + +W IW TRN L DN Sbjct: 466 CNFVYTIWKSLGFTDRNFFQE--VDSSSWLRNGLSCSSMFLFMAAIWWIWRTRNALCLDN 523 Query: 233 ERSSPQLIVYRASRLLNDIDKAYQEINSPL--ATTRMPREVAWMGTSSDQRVVLNVDGSA 406 E ++ + S + +D A N T +P+ V W ++LNVDGS Sbjct: 524 E------LIPQFSLKMRIVDYALLLKNCHFNHQVTTLPKIVRWNALGGTS-MILNVDGST 576 Query: 407 IQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLRLCWEHGYRDVICYS 586 I +PG SGFGGL+R+ G ++ GF+G+ +++L AE++ ++ GL L WE +D++CYS Sbjct: 577 IGNPGISGFGGLIRNADGAWIHGFFGNLGVTNILHAELMAILKGLLLAWELNIKDLLCYS 636 Query: 587 NSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGNQCADHLAKLGAMTD 766 +S A+K I E V H A ++ IK + +DW + HT REGN CAD+LAK GA + Sbjct: 637 DSATAIKLITEPVDVWHHYAAILNNIKDILNRDWQVSIFHTFREGNACADYLAKHGAHNN 696 Query: 767 EPLLVIQQPPPAMGPLLLADASGVSFTR 850 I PP + LLAD SG+ F+R Sbjct: 697 IVFTTIAIPPAGLNLHLLADVSGIIFSR 724 >GAU34195.1 hypothetical protein TSUD_162960 [Trifolium subterraneum] Length = 168 Score = 150 bits (378), Expect = 5e-41 Identities = 78/157 (49%), Positives = 101/157 (64%) Frame = +2 Query: 380 VVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLRLCWEH 559 ++LNVDGS+I +PG SGFGGL+R+ G ++ GF G+ S++L AE+L + HGL L WE Sbjct: 12 MILNVDGSSIGNPGISGFGGLIRNSDGAWVHGFAGNIGHSNILHAELLAIYHGLVLAWEL 71 Query: 560 GYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGNQCADH 739 +D+ CYS+S A+K I + V+ H A I IK + ++W LVHTLREGN CAD Sbjct: 72 DIKDLCCYSDSKTALKLIYDHVNEWHHYAAIIYNIKDFLSRNWRVRLVHTLREGNNCADF 131 Query: 740 LAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850 LAK GA E I PP M LLLADASG F+R Sbjct: 132 LAKFGARNPEAYSSIAVPPDEMNLLLLADASGTIFSR 168 >GAU16646.1 hypothetical protein TSUD_325960 [Trifolium subterraneum] Length = 157 Score = 148 bits (374), Expect = 1e-40 Identities = 77/157 (49%), Positives = 101/157 (64%) Frame = +2 Query: 380 VVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLRLCWEH 559 ++LNVDGS+I +PG SGFGGL+R+ G ++ GF G+ S++L AE+L + HGL L WE Sbjct: 1 MILNVDGSSIGNPGISGFGGLIRNSDGAWIHGFAGNIGHSNILHAELLAIYHGLVLAWEL 60 Query: 560 GYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGNQCADH 739 +D+ CYS+S A+K I + V+ H A I IK + ++W LVHTLREGN CAD Sbjct: 61 DIKDLCCYSDSKTALKLIYDHVNEWHHYAAIIYNIKDFLSRNWRVRLVHTLREGNNCADF 120 Query: 740 LAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850 LAK GA + E I PP M LLLAD SG F+R Sbjct: 121 LAKFGARSPEAYSSIVVPPDGMNLLLLADDSGTIFSR 157