BLASTX nr result
ID: Glycyrrhiza36_contig00036160
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Glycyrrhiza36_contig00036160 (994 letters) Database: ./nr 115,041,592 sequences; 42,171,959,267 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value GAU24540.1 hypothetical protein TSUD_156530 [Trifolium subterran... 244 1e-69 GAU48830.1 hypothetical protein TSUD_190600 [Trifolium subterran... 204 2e-60 GAU47359.1 hypothetical protein TSUD_403620 [Trifolium subterran... 201 6e-59 GAU49781.1 hypothetical protein TSUD_188300 [Trifolium subterran... 194 2e-57 GAU48983.1 hypothetical protein TSUD_245740 [Trifolium subterran... 205 8e-56 GAU36275.1 hypothetical protein TSUD_255290 [Trifolium subterran... 180 1e-51 KYP57109.1 Putative ribonuclease H protein At1g65750 family [Caj... 182 4e-51 KYP61721.1 Putative ribonuclease H protein At1g65750 family [Caj... 173 3e-49 ABN09044.1 Ribonuclease H [Medicago truncatula] 173 4e-49 GAU17063.1 hypothetical protein TSUD_105620 [Trifolium subterran... 174 3e-47 KYP56001.1 Putative ribonuclease H protein At1g65750 family, par... 168 3e-45 GAU32945.1 hypothetical protein TSUD_153620 [Trifolium subterran... 165 3e-45 GAU39987.1 hypothetical protein TSUD_211080 [Trifolium subterran... 161 5e-45 GAU24479.1 hypothetical protein TSUD_319560 [Trifolium subterran... 160 2e-44 AFK48593.1 unknown [Lotus japonicus] 162 3e-44 KYP32780.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan] 170 1e-43 GAU50297.1 hypothetical protein TSUD_288310 [Trifolium subterran... 161 1e-41 KYP78366.1 Putative ribonuclease H protein At1g65750 family [Caj... 160 3e-40 KYP64035.1 Putative ribonuclease H protein At1g65750 family [Caj... 148 4e-40 GAU35042.1 hypothetical protein TSUD_30080 [Trifolium subterraneum] 159 4e-40 >GAU24540.1 hypothetical protein TSUD_156530 [Trifolium subterraneum] Length = 1147 Score = 244 bits (624), Expect = 1e-69 Identities = 121/282 (42%), Positives = 172/282 (60%) Frame = -2 Query: 990 CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAI 811 CP C + ET++H F+C A +W L HV P S++ D+ W RD S+G I+ I Sbjct: 867 CPRCTAMPETIVHCLFACTDAIGIWRACGLEHVLPPSTD-VDLFCWCRDVGKSHGCIIFI 925 Query: 810 ILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVAWMGT 631 I+W +W +RN IF+N ++ +V + +L+ A++ S R V W Sbjct: 926 IMWFVWCSRNDAIFNNNKAIVHNLVAKVHYMLSFCTAAFENTTSGSGGNSEHRLVVWP-R 984 Query: 630 SSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLR 451 + V LNVDGS + L +GFGGL+R+ G FL GFYG+A QSSVL AE++ ++HGL Sbjct: 985 PDEGTVCLNVDGSMLGSLQTAGFGGLIRNSFGAFLKGFYGTASQSSVLYAEIMAILHGLH 1044 Query: 450 LCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGN 271 LCW +GYR ++CYS+SL AV I +GVSH H ANEI I + +++DW ++ H LREGN Sbjct: 1045 LCWNNGYRSIVCYSDSLQAVSLIKDGVSHFHTFANEIYTIHQLLRRDWTIVIEHILREGN 1104 Query: 270 QCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145 CAD LAK G+ T+ P+++++ PPP L ADA G+ F R Sbjct: 1105 ACADILAKKGSSTNSPIVIVESPPPEPSNALSADARGIVFVR 1146 >GAU48830.1 hypothetical protein TSUD_190600 [Trifolium subterraneum] Length = 298 Score = 204 bits (519), Expect = 2e-60 Identities = 100/230 (43%), Positives = 147/230 (63%) Frame = -2 Query: 834 SNGIIVAIILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMP 655 ++G + I+LW IW RN+ +F+N+R S +I+ + LL+ + + +S +ATT P Sbjct: 69 NHGPLFFIVLWVIWCVRNEFVFNNQRESTHIIMGKIYSLLHSCEAVFTPPHSSMATTAKP 128 Query: 654 REVAWMGTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEV 475 R V W ++ V LNVDGS ++ +G+GGL+RD G FL GFYG+A S+L AE+ Sbjct: 129 RLVTWT-KPAEGTVCLNVDGSLLKATNTAGYGGLIRDSNGVFLSGFYGTATVQSILFAEL 187 Query: 474 LGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLL 295 + V+HGL++CWE G+R + C+S+SL V I +GVS H +NE+ II + + +DW+ ++ Sbjct: 188 MAVLHGLQICWESGFRRITCFSDSLQIVNLIRDGVSAHHRFSNEVFIIHQLLAKDWEVVI 247 Query: 294 VHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145 HT REGN CAD LAK+GA +D L+ I PP + LLADA V F R Sbjct: 248 GHTFREGNACADVLAKMGAASDSTLVTISTPPCDLSMPLLADAHVVVFIR 297 >GAU47359.1 hypothetical protein TSUD_403620 [Trifolium subterraneum] Length = 330 Score = 201 bits (512), Expect = 6e-59 Identities = 108/282 (38%), Positives = 148/282 (52%) Frame = -2 Query: 990 CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAI 811 CP C SE++ H F+CN A VW + L HV P SS+ D W + +G I I Sbjct: 74 CPRCAIASESIEHCLFTCNDAASVWRAYGL-HVIPNSSHGVDNFTWYKKQGMKHGRIFFI 132 Query: 810 ILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVAWMGT 631 I+W IW RN+ IFDN R S V + L A+ + + PR V W Sbjct: 133 IMWVIWCARNEFIFDNHRQSVVTSVIKIDSLQQACAAAFGSTQTIATQSSNPRLVTW-AR 191 Query: 630 SSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLR 451 + + LNVDGS + L +G+GGLLR+H G F+ GFYG+ S+L AE++ V+HGL Sbjct: 192 PMEGTICLNVDGSLLGSLNSAGYGGLLRNHNGEFILGFYGTTSLKSILFAEIMAVLHGLT 251 Query: 450 LCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGN 271 +CWE+GYR + C S+SL + +DW+ +L HTLREG+ Sbjct: 252 ICWENGYRKINCLSDSLQLI------------------------TRDWEVVLSHTLREGS 287 Query: 270 QCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145 CAD LAK+GA+ + PL+ PP + L D +GV FTR Sbjct: 288 SCADVLAKMGAVANTPLVTTSTPPRTLAKPLFEDVNGVIFTR 329 >GAU49781.1 hypothetical protein TSUD_188300 [Trifolium subterraneum] Length = 221 Score = 194 bits (492), Expect = 2e-57 Identities = 97/221 (43%), Positives = 131/221 (59%) Frame = -2 Query: 807 LWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVAWMGTS 628 +W IW RN+ IFDN R S V + L A+ + + PR V W Sbjct: 1 MWVIWCARNEFIFDNHRQSVVTSVIKIDSLQQACAAAFGSTQTIATQSSNPRLVTW-ARP 59 Query: 627 SDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLRL 448 + + LNVDGS + L +G+GGLLR+H G F+ GFYG+ S+L AE++ V+HGL + Sbjct: 60 MEGTICLNVDGSLLGSLNSAGYGGLLRNHNGEFILGFYGTTSLKSILFAEIMAVLHGLTI 119 Query: 447 CWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGNQ 268 CWE+GYR + C S+SL V I GVS H ANEI I++ + +DW+ +L HTLREGN Sbjct: 120 CWENGYRKINCLSDSLQVVNLIRSGVSPHHRFANEILSIRQLITRDWEVVLSHTLREGNL 179 Query: 267 CADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145 CAD LAK+GA+ + PL+ PP + L DA+GV FTR Sbjct: 180 CADVLAKMGAVANTPLVTTSTPPRTLPKPLFEDANGVIFTR 220 >GAU48983.1 hypothetical protein TSUD_245740 [Trifolium subterraneum] Length = 1103 Score = 205 bits (522), Expect = 8e-56 Identities = 105/246 (42%), Positives = 151/246 (61%), Gaps = 2/246 (0%) Frame = -2 Query: 876 NRSDIAKWTRDAI--HSNGIIVAIILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDID 703 N S+ T+DA S+G I+ II+W +W +RN IF+N ++ +V + +L+ Sbjct: 858 NSSNGVYTTKDADVGKSHGCIIFIIMWFVWCSRNDAIFNNNKAIVHNLVAKVHSMLSFCI 917 Query: 702 KAYQEINSPLATTRMPREVAWMGTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLF 523 A++ S R V W ++ V LNV GS + L +GFGGL+R+ FL Sbjct: 918 AAFKNTTSGSGGNSEQRLVVWP-RPAEGTVCLNVHGSMLGSLQTAGFGGLIRNSFSAFLK 976 Query: 522 GFYGSADQSSVLRAEVLGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANE 343 GFYG+A QSSVL AE++ ++HGL LCW +GYR ++CYS+SL AV I +GVSH H ANE Sbjct: 977 GFYGTASQSSVLYAEIMAILHGLHLCWNNGYRSIVCYSDSLQAVSLIKDGVSHFHTFANE 1036 Query: 342 ISIIKKHMQQDWDCLLVHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADAS 163 I I++ +++DW ++ H LREGN CAD LAK G+ T+ P++++ PPP + L DA Sbjct: 1037 IHPIRQLLRRDWTIVIEHILREGNACADVLAKKGSSTNSPIVIVDSPPPELSNALSVDAR 1096 Query: 162 GVSFTR 145 GV F R Sbjct: 1097 GVVFVR 1102 >GAU36275.1 hypothetical protein TSUD_255290 [Trifolium subterraneum] Length = 258 Score = 180 bits (457), Expect = 1e-51 Identities = 100/252 (39%), Positives = 142/252 (56%), Gaps = 2/252 (0%) Frame = -2 Query: 894 VAPLSSNRSDIAKWTRDAIHSNGIIVAIILWTIWLTRNKLIFDNERSSPQLIVYRASRLL 715 + P S D W + +G I+ + LW +W RN IF+N + S V ++ L+ Sbjct: 9 LVPSSVQGVDRLTWCKQLGKKHGNIIFVTLWMVWCVRNNFIFNNHQESTHTSVAKSHSLV 68 Query: 714 NDIDKAYQ--EINSPLATTRMPREVAWMGTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDH 541 N KA+ + SPLA + R V W +D+ V LNVDGS + +G+ GLLR+ Sbjct: 69 NASAKAFSLPSVVSPLAGHQ--RSVRWF-RPADEFVCLNVDGSLLGSNNTAGYDGLLRNR 125 Query: 540 TGTFLFGFYGSADQSSVLRAEVLGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHC 361 G F++GFYG A ++L AE++ + +GL+LCWE G+R V C S+ L +V EGV+ Sbjct: 126 DGEFIWGFYGVAAIQNILYAEIMAIWYGLKLCWERGFRKVFCCSDYLLSVDVTKEGVTTH 185 Query: 360 HPLANEISIIKKHMQQDWDCLLVHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPL 181 H ANEI I+K + DW+ +L HTLREGN CAD L KLG +D ++ I P + Sbjct: 186 HRFANEILCIRKLLANDWEVILTHTLREGNACADVLGKLGVNSDSSMVNIYAPSQDLVIP 245 Query: 180 LLADASGVSFTR 145 L DASG+ F R Sbjct: 246 LHDDASGIEFIR 257 >KYP57109.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 365 Score = 182 bits (462), Expect = 4e-51 Identities = 99/285 (34%), Positives = 149/285 (52%), Gaps = 2/285 (0%) Frame = -2 Query: 993 SCPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVA 814 +CP C ET+LH C +W+ L L ++ I +W + + G I+ Sbjct: 82 NCPMCNAQQETLLHCLLECPRIGALWNSLGLCQ-PHLPTDSEKIKEWLKCWVEEQGSIIP 140 Query: 813 IILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAY--QEINSPLATTRMPREVAW 640 ++LW IW +RN +IF + + S + I +AY + + R R V W Sbjct: 141 VLLWVIWRSRNNMIFKGKLDKVADLKVWVSTWCSAIIRAYGGEPATGSIWQQRSTRLVRW 200 Query: 639 MGTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMH 460 D V +NVDGSA+ + G G GGL+RD+TG F+ GFYGS S+ + AE++ + Sbjct: 201 TAKEGDW-VTINVDGSALTNPGAVGVGGLVRDNTGLFMVGFYGSIGISNNIHAELVAMWR 259 Query: 459 GLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLR 280 GL LCWE GY V C S+ L V+ + + SH H A + IK+ + + W C ++H LR Sbjct: 260 GLTLCWERGYSHVCCQSDCLYVVQLLQQESSHYHRYAVLLDKIKELISRHWTCQVIHILR 319 Query: 279 EGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145 EGN CAD A+ GA++ E L+++++ P M LL D +G R Sbjct: 320 EGNFCADFFARKGAVSSEGLVILEEAPVEMEELLRKDITGTCVLR 364 >KYP61721.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 219 Score = 173 bits (438), Expect = 3e-49 Identities = 89/221 (40%), Positives = 133/221 (60%) Frame = -2 Query: 807 LWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVAWMGTS 628 +W IW RN+ IFD + I+ +A+ LL A+ I+ + +PR V W+ Sbjct: 1 MWFIWCHRNRHIFDQVDWNLTSILAQANALLQFSVSAFTSIDC--SHRPLPRLVHWIHPL 58 Query: 627 SDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLRL 448 D V LNVDGS I G G+GGL ++H G FLFGFYG ++SVL+ E+L ++HGL L Sbjct: 59 VDS-VALNVDGSRIGTPGRGGYGGLCQNHEGQFLFGFYGFLGEASVLQTEILALLHGLHL 117 Query: 447 CWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGNQ 268 CW+ G+R ++CYS+S V + + H N++ I + + DW C +VHTL EGN Sbjct: 118 CWDKGFRKIVCYSDSTLVVSLLQGPILMFHRYGNQLMEIHQLLNCDWTCTVVHTLCEGNS 177 Query: 267 CADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145 CAD LA++GA+ ++ ++++Q+ P + LLLAD+ G F R Sbjct: 178 CADALARMGALGNDRVVILQEHPMTLSSLLLADSLGTVFQR 218 >ABN09044.1 Ribonuclease H [Medicago truncatula] Length = 235 Score = 173 bits (438), Expect = 4e-49 Identities = 95/235 (40%), Positives = 138/235 (58%) Frame = -2 Query: 849 RDAIHSNGIIVAIILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLA 670 +D I + G I II+W IW +RNK IF++ + S Q I + L+ I KA+ S + Sbjct: 2 KDFISNIGPIGPIIIWKIWCSRNKCIFEDIKHSIQEIGAQVLSSLHHILKAFAHPTSH-S 60 Query: 669 TTRMPREVAWMGTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSV 490 + R V+W S + V LNVDG+ FGGL+RDHT +FL GF+G + + Sbjct: 61 VQQPARIVSWQRPSMNS-VALNVDGNVFLDSNLGSFGGLIRDHTSSFLHGFFGKNSRPCI 119 Query: 489 LRAEVLGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQD 310 L E+ G+ HGL+LCW+ G + V+C+S+S V + + ++ H N I IKK +++D Sbjct: 120 LHVEISGLYHGLKLCWDIGIKHVVCHSDSTTVVDLVQKDLNVHHKYGNLIMAIKKLLRRD 179 Query: 309 WDCLLVHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145 W L HTL EGN AD LAK GA++D L+++ + PP + +LLADA GV F R Sbjct: 180 WVVSLRHTLCEGNAAADFLAKKGALSDTSLVILNEAPPDIAFVLLADAVGVKFVR 234 >GAU17063.1 hypothetical protein TSUD_105620 [Trifolium subterraneum] Length = 440 Score = 174 bits (441), Expect = 3e-47 Identities = 108/287 (37%), Positives = 149/287 (51%), Gaps = 5/287 (1%) Frame = -2 Query: 990 CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAI 811 CP ET++H C +W T + ++ W R S + + + Sbjct: 165 CPRSDIAEETIMHCLRDCEFVKHLWKTIGFTDQTFFHGD--NLYAWLRKGCDSPSMFMFL 222 Query: 810 I-LWTIWLTRNKLIFDNERSSPQLIVYRASRLLND----IDKAYQEINSPLATTRMPREV 646 LW IW RNKL NE SP + SR + D + K Y + S LA R+ R Sbjct: 223 AALWWIWRARNKLCLANELVSP----FTISRCIEDYALLVKKCYSQQKSTLAN-RLVRWN 277 Query: 645 AWMGTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGV 466 A GT ++LNVDGS+I + GFGGL+R+ G ++ GF G+ S++L AE+L V Sbjct: 278 AHDGTD----MILNVDGSSIGNPEIYGFGGLIRNSHGAWIRGFAGNIGFSNILHAELLAV 333 Query: 465 MHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHT 286 HGL L W+ +D+ICYS+S A+K I + ++ H A + IK + +DW + HT Sbjct: 334 YHGLVLAWDMDIKDLICYSDSKTAIKLIGDPINEWHHFAAILQNIKDILARDWRVTVAHT 393 Query: 285 LREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145 LREGN CAD+LAK GA + + PP M LLLADASG FTR Sbjct: 394 LREGNACADYLAKFGAQNIKVFSTMTTPPDGMNLLLLADASGTWFTR 440 >KYP56001.1 Putative ribonuclease H protein At1g65750 family, partial [Cajanus cajan] Length = 414 Score = 168 bits (426), Expect = 3e-45 Identities = 95/284 (33%), Positives = 142/284 (50%), Gaps = 2/284 (0%) Frame = -2 Query: 990 CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAI 811 C C ET++H F C+ +VW + W I G + Sbjct: 139 CRQCDSQEETVMHCFRDCHEVQEVWKILQFVSCDTFYQI-DNFKMWVNHGIKLGGALFLS 197 Query: 810 ILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAY--QEINSPLATTRMPREVAWM 637 +W IWL N+L+F+ ++ + A + + E+NS L P+ V W Sbjct: 198 TIWEIWLGWNRLVFEGSKTKAWQVALAAKSFSEAMTNVFLNHEVNSNL-----PKWVGW- 251 Query: 636 GTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHG 457 S+ V+LN DGS ++ +GFGG+LR G ++ GFYG+ D S ++ E+LG++ G Sbjct: 252 SAPSENCVILNTDGSVMED--KAGFGGVLRSSDGVWIHGFYGNVDGSDIIGVELLGILQG 309 Query: 456 LRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLRE 277 LR+ G V C ++SL AVK+I GVSH H +N + I K + +DW + H LRE Sbjct: 310 LRIAQRLGLSRVYCQTDSLVAVKWIQGGVSHMHHYSNLVQEIHKLLDKDWAVSISHVLRE 369 Query: 276 GNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145 N+CAD+ AKLG + L +PP + P+L ADA+G F R Sbjct: 370 CNKCADYFAKLGLNCPDRLTNFMEPPLDVIPMLQADAAGERFLR 413 >GAU32945.1 hypothetical protein TSUD_153620 [Trifolium subterraneum] Length = 292 Score = 165 bits (417), Expect = 3e-45 Identities = 87/179 (48%), Positives = 117/179 (65%) Frame = -2 Query: 681 SPLATTRMPREVAWMGTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSAD 502 SPLA + R V W +D V LNVDGS + +G+GGLLR+ G F++GFYG+A Sbjct: 116 SPLAGHQ--RRVRW-SRPADGFVCLNVDGSLLGSNNTAGYGGLLRNRDGEFIWGFYGAAA 172 Query: 501 QSSVLRAEVLGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKH 322 ++L AE++ + +GL+LCWE G+R V+C S+SL +V I EGV+ H ANEI I+K Sbjct: 173 IQNILYAEIMAIWYGLKLCWERGFRKVLCCSDSLLSVNVIKEGVTTHHGFANEILCIRKL 232 Query: 321 MQQDWDCLLVHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145 + DW+ +L HTLREGN CAD LAKLGA +D P++ I PP + L DASG+ F R Sbjct: 233 LSNDWEVILTHTLREGNACADVLAKLGANSDSPMVNISTPPRDLVIPLHHDASGIEFIR 291 >GAU39987.1 hypothetical protein TSUD_211080 [Trifolium subterraneum] Length = 192 Score = 161 bits (407), Expect = 5e-45 Identities = 82/187 (43%), Positives = 115/187 (61%) Frame = -2 Query: 720 LLNDIDKAYQEINSPLATTRMPREVAWMGTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDH 541 LL+ + + +S +ATT PR V W + V LNVDGS + + +GGL+RD Sbjct: 7 LLHFCEAMFTPPHSSVATTAKPRLVTWT-KPVEGTVCLNVDGSLLGATNTASYGGLIRDS 65 Query: 540 TGTFLFGFYGSADQSSVLRAEVLGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHC 361 L GFYG+ S+L AE++ V+HGL++CWE G+R + C+S+SL V I +GVS Sbjct: 66 NRVILSGFYGTTSVQSILFAELMAVLHGLQICWESGFRRITCFSDSLQTVNLIRDGVSTH 125 Query: 360 HPLANEISIIKKHMQQDWDCLLVHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPL 181 H +NE+ II + + DW+ ++ HT REGN CAD LAK+GA +D PL+ I PP + Sbjct: 126 HRSSNEVFIIHQLLANDWEVVIDHTFREGNACADVLAKMGAASDSPLVKISTPPCDLSMP 185 Query: 180 LLADASG 160 LLADA G Sbjct: 186 LLADAQG 192 >GAU24479.1 hypothetical protein TSUD_319560 [Trifolium subterraneum] Length = 227 Score = 160 bits (406), Expect = 2e-44 Identities = 82/200 (41%), Positives = 122/200 (61%), Gaps = 4/200 (2%) Frame = -2 Query: 810 ILWTIWLTRNKLIFDNE----RSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVA 643 ++W IW RN +IF+++ + Q I+ + + L + K ++ + + PR V Sbjct: 19 VVWAIWRVRNDVIFNSKVPVIEEAFQGIISLSWKWLRE--KKKKKKKGAVTQSSNPRLVT 76 Query: 642 WMGTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVM 463 W + + LNVDG+ + L + G+GGLLR+H G F+ GFYG+ S+L AE++ V+ Sbjct: 77 W-ARPMEGTICLNVDGNLLGSLNYLGYGGLLRNHNGEFILGFYGTTSLKSILFAEIMVVL 135 Query: 462 HGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTL 283 HGL +CWE+GYR + C SNSL V I GVS H ANEI I++ + +DW+ +L HTL Sbjct: 136 HGLTICWENGYRKINCLSNSLQLVNLIRSGVSLHHRFANEILSIRRLITRDWEVVLSHTL 195 Query: 282 REGNQCADHLAKLGAMTDEP 223 REGN CAD LAK+G + + P Sbjct: 196 REGNSCADVLAKMGVVANTP 215 >AFK48593.1 unknown [Lotus japonicus] Length = 272 Score = 162 bits (409), Expect = 3e-44 Identities = 96/243 (39%), Positives = 131/243 (53%), Gaps = 1/243 (0%) Frame = -2 Query: 870 SDIAKWTRDAIHSNGIIVAIILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQ 691 +D W R+ + N +V LW +W RN + + Q++ + +DI + Y Sbjct: 33 ADSKAWLREVLKENSPLVMSTLWWVWRLRNVWCMEGKLIPWQVLRGDILAMFDDIARCYA 92 Query: 690 -EINSPLATTRMPREVAWMGTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFY 514 ++++P+ T PR V W +D VVLNVDGS GFGG R G +L GF+ Sbjct: 93 VDVDAPMHT---PRLVRWTVGLADC-VVLNVDGSVHGTPQRGGFGGCFRTIHGNWLRGFF 148 Query: 513 GSADQSSVLRAEVLGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISI 334 G D+ +L E+LG+ HGL L WE GYR V C S+S +AV + S CH A + Sbjct: 149 GYLDECCILHLELLGMFHGLSLAWEQGYRIVECQSDSQDAVTLVKSTPSSCHRYAALVWD 208 Query: 333 IKKHMQQDWDCLLVHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVS 154 IK +DW L HTLREGN CAD L K GA ++ L++ + P +G LLLADA GVS Sbjct: 209 IKDLQSRDWIVELRHTLREGNACADLLVKHGADQNDDLVITENPIAGLGVLLLADARGVS 268 Query: 153 FTR 145 F R Sbjct: 269 FVR 271 >KYP32780.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan] Length = 1123 Score = 170 bits (431), Expect = 1e-43 Identities = 97/284 (34%), Positives = 142/284 (50%), Gaps = 2/284 (0%) Frame = -2 Query: 990 CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAI 811 C C ET++H F C+ +VW + W I G + Sbjct: 848 CRQCDSQEETVMHCFRDCHEVQEVWRILQFVSCDTFYQI-DNFKMWVNHGIKLGGALFLS 906 Query: 810 ILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAY--QEINSPLATTRMPREVAWM 637 +W IWL RN+L+F+ ++ + A L + + E+NS L P+ V W Sbjct: 907 TIWEIWLGRNRLVFEGSKTKAWQVALAAKSLSEAMTNVFLNHEVNSNL-----PKWVGW- 960 Query: 636 GTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHG 457 S+ V+LN DGS ++ +GFGG+LR G ++ GF G+ D ++ E+LG++ G Sbjct: 961 SAPSENCVILNTDGSVMED--KAGFGGVLRSSNGAWIHGFCGNVDGYEIIGVELLGILQG 1018 Query: 456 LRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLRE 277 LR+ G V C +NSL AVK+I GVSH H +N + I K + +DW + H LRE Sbjct: 1019 LRIAQRLGLSRVYCQTNSLAAVKWIQGGVSHMHHYSNLVQEIHKLLDKDWAVSISHVLRE 1078 Query: 276 GNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145 N+CAD+ AKLG + L +PP + PLL ADA+G F R Sbjct: 1079 CNKCADYFAKLGLNCPDRLTNFMEPPLDVIPLLQADAAGERFLR 1122 >GAU50297.1 hypothetical protein TSUD_288310 [Trifolium subterraneum] Length = 545 Score = 161 bits (408), Expect = 1e-41 Identities = 101/283 (35%), Positives = 143/283 (50%), Gaps = 1/283 (0%) Frame = -2 Query: 990 CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHS-NGIIVA 814 CP C E+ LH +C + W + ++ W R++I + + Sbjct: 270 CPRCNIEEESTLHCLRNCEFIKRFWKAIGF--LGQTFFQGDNLNDWLRNSIDGPSSFLFM 327 Query: 813 IILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVAWMG 634 +W IW RN+L DNE S + L + + + N +T M R A G Sbjct: 328 AAVWWIWCARNQLCMDNEAISYFTLRTNTENLAQLLRMCFIKQNIS-STATMVRWNAHGG 386 Query: 633 TSSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGL 454 ++LNVDGS+I + G SGFGGL+R+ G ++ GF G+ ++L+AE+L + HGL Sbjct: 387 IG----MILNVDGSSIGNPGISGFGGLIRNSDGAWVHGFAGNIGHLNILQAELLAIYHGL 442 Query: 453 RLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREG 274 L WE +D+ CYS+S A+K I + V+ H A I IK + ++W LVH LREG Sbjct: 443 VLAWELDIKDLCCYSDSKTALKLIYDHVNEWHQYAAIIYNIKDFLSRNWRVRLVHMLREG 502 Query: 273 NQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145 N CAD L K GA + I PP M LLLADASG F+R Sbjct: 503 NNCADILDKFGARNPKAYCSIAVPPDGMSLLLLADASGTIFSR 545 >KYP78366.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 1090 Score = 160 bits (406), Expect = 3e-40 Identities = 95/284 (33%), Positives = 140/284 (49%), Gaps = 2/284 (0%) Frame = -2 Query: 990 CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAI 811 C C ET++H F + +VW S + W I G + Sbjct: 815 CRQCGSQEETVMHCFRDSHEVQEVWRILQFVSCDTFSQI-DNFKMWVIHGIKLGGALFLS 873 Query: 810 ILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAY--QEINSPLATTRMPREVAWM 637 +W IWL RN+L+F+ ++ + A L + + E+N L P+ V W+ Sbjct: 874 TIWEIWLGRNRLVFEGSKTKAWQVALAAKSLSEAMTNVFLNHEVNRNL-----PKWVGWL 928 Query: 636 GTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHG 457 S + V+LN DGS + +GFGG+LR G ++ GF G+ D S ++ E+LG++ G Sbjct: 929 DPS-ENCVILNTDGSVMDD--KAGFGGVLRSSDGVWIHGFCGNMDGSEIIGVELLGILQG 985 Query: 456 LRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLRE 277 LR+ G V C ++SL AVK+I G SH H +N + I K + +DW + H LRE Sbjct: 986 LRIAQILGLSRVYCQTDSLVAVKWIKGGESHMHHYSNLVQEIHKLLDKDWAVSISHVLRE 1045 Query: 276 GNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145 N+CAD+ AKLG + L +PP + PLL ADA G F R Sbjct: 1046 CNKCADYFAKLGLRCPDRLTNFMEPPLDVIPLLQADADGERFLR 1089 >KYP64035.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan] Length = 190 Score = 148 bits (374), Expect = 4e-40 Identities = 79/190 (41%), Positives = 109/190 (57%) Frame = -2 Query: 807 LWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVAWMGTS 628 +W IW RN+LIFD + I+ + + LL A+ I+ + +PR V W+ Sbjct: 1 MWFIWCHRNRLIFDQVDWNLTSILAQVNALLQISVSAFTSIDC--SHRPLPRLVHWIHPP 58 Query: 627 SDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLRL 448 D V LNVDGS I LG GFGGL R+H G FLFGFYG + SVL+ +L +++GLRL Sbjct: 59 LDS-VALNVDGSRIGTLGRGGFGGLCRNHEGQFLFGFYGFLGEVSVLQTVILALLYGLRL 117 Query: 447 CWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGNQ 268 CW+ +R +ICYS+S V + + H N++ I + + DW C +VHTLREGN Sbjct: 118 CWDKWFRKIICYSDSTLVVSLLQGPIPMFHRYENQLMEIHQLLNCDWTCTVVHTLREGNS 177 Query: 267 CADHLAKLGA 238 CAD G+ Sbjct: 178 CADAFGSNGS 187 >GAU35042.1 hypothetical protein TSUD_30080 [Trifolium subterraneum] Length = 724 Score = 159 bits (402), Expect = 4e-40 Identities = 93/268 (34%), Positives = 140/268 (52%), Gaps = 3/268 (1%) Frame = -2 Query: 939 CNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAII-LWTIWLTRNKLIFDN 763 CN +W T D + W R+ + + + + + +W IW TRN L DN Sbjct: 466 CNFVYTIWKSLGFTDRNFFQE--VDSSSWLRNGLSCSSMFLFMAAIWWIWRTRNALCLDN 523 Query: 762 ERSSPQLIVYRASRLLNDIDKAYQEINSPL--ATTRMPREVAWMGTSSDQRVVLNVDGSA 589 E ++ + S + +D A N T +P+ V W ++LNVDGS Sbjct: 524 E------LIPQFSLKMRIVDYALLLKNCHFNHQVTTLPKIVRWNALGGTS-MILNVDGST 576 Query: 588 IQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLRLCWEHGYRDVICYS 409 I + G SGFGGL+R+ G ++ GF+G+ +++L AE++ ++ GL L WE +D++CYS Sbjct: 577 IGNPGISGFGGLIRNADGAWIHGFFGNLGVTNILHAELMAILKGLLLAWELNIKDLLCYS 636 Query: 408 NSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGNQCADHLAKLGAMTD 229 +S A+K I E V H A ++ IK + +DW + HT REGN CAD+LAK GA + Sbjct: 637 DSATAIKLITEPVDVWHHYAAILNNIKDILNRDWQVSIFHTFREGNACADYLAKHGAHNN 696 Query: 228 EPLLVIQQPPPAMGPLLLADASGVSFTR 145 I PP + LLAD SG+ F+R Sbjct: 697 IVFTTIAIPPAGLNLHLLADVSGIIFSR 724