BLASTX nr result

ID: Glycyrrhiza36_contig00036160 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza36_contig00036160
         (994 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

GAU24540.1 hypothetical protein TSUD_156530 [Trifolium subterran...   244   1e-69
GAU48830.1 hypothetical protein TSUD_190600 [Trifolium subterran...   204   2e-60
GAU47359.1 hypothetical protein TSUD_403620 [Trifolium subterran...   201   6e-59
GAU49781.1 hypothetical protein TSUD_188300 [Trifolium subterran...   194   2e-57
GAU48983.1 hypothetical protein TSUD_245740 [Trifolium subterran...   205   8e-56
GAU36275.1 hypothetical protein TSUD_255290 [Trifolium subterran...   180   1e-51
KYP57109.1 Putative ribonuclease H protein At1g65750 family [Caj...   182   4e-51
KYP61721.1 Putative ribonuclease H protein At1g65750 family [Caj...   173   3e-49
ABN09044.1 Ribonuclease H [Medicago truncatula]                       173   4e-49
GAU17063.1 hypothetical protein TSUD_105620 [Trifolium subterran...   174   3e-47
KYP56001.1 Putative ribonuclease H protein At1g65750 family, par...   168   3e-45
GAU32945.1 hypothetical protein TSUD_153620 [Trifolium subterran...   165   3e-45
GAU39987.1 hypothetical protein TSUD_211080 [Trifolium subterran...   161   5e-45
GAU24479.1 hypothetical protein TSUD_319560 [Trifolium subterran...   160   2e-44
AFK48593.1 unknown [Lotus japonicus]                                  162   3e-44
KYP32780.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan]       170   1e-43
GAU50297.1 hypothetical protein TSUD_288310 [Trifolium subterran...   161   1e-41
KYP78366.1 Putative ribonuclease H protein At1g65750 family [Caj...   160   3e-40
KYP64035.1 Putative ribonuclease H protein At1g65750 family [Caj...   148   4e-40
GAU35042.1 hypothetical protein TSUD_30080 [Trifolium subterraneum]   159   4e-40

>GAU24540.1 hypothetical protein TSUD_156530 [Trifolium subterraneum]
          Length = 1147

 Score =  244 bits (624), Expect = 1e-69
 Identities = 121/282 (42%), Positives = 172/282 (60%)
 Frame = -2

Query: 990  CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAI 811
            CP C  + ET++H  F+C  A  +W    L HV P S++  D+  W RD   S+G I+ I
Sbjct: 867  CPRCTAMPETIVHCLFACTDAIGIWRACGLEHVLPPSTD-VDLFCWCRDVGKSHGCIIFI 925

Query: 810  ILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVAWMGT 631
            I+W +W +RN  IF+N ++    +V +   +L+    A++   S        R V W   
Sbjct: 926  IMWFVWCSRNDAIFNNNKAIVHNLVAKVHYMLSFCTAAFENTTSGSGGNSEHRLVVWP-R 984

Query: 630  SSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLR 451
              +  V LNVDGS +  L  +GFGGL+R+  G FL GFYG+A QSSVL AE++ ++HGL 
Sbjct: 985  PDEGTVCLNVDGSMLGSLQTAGFGGLIRNSFGAFLKGFYGTASQSSVLYAEIMAILHGLH 1044

Query: 450  LCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGN 271
            LCW +GYR ++CYS+SL AV  I +GVSH H  ANEI  I + +++DW  ++ H LREGN
Sbjct: 1045 LCWNNGYRSIVCYSDSLQAVSLIKDGVSHFHTFANEIYTIHQLLRRDWTIVIEHILREGN 1104

Query: 270  QCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145
             CAD LAK G+ T+ P+++++ PPP     L ADA G+ F R
Sbjct: 1105 ACADILAKKGSSTNSPIVIVESPPPEPSNALSADARGIVFVR 1146


>GAU48830.1 hypothetical protein TSUD_190600 [Trifolium subterraneum]
          Length = 298

 Score =  204 bits (519), Expect = 2e-60
 Identities = 100/230 (43%), Positives = 147/230 (63%)
 Frame = -2

Query: 834 SNGIIVAIILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMP 655
           ++G +  I+LW IW  RN+ +F+N+R S  +I+ +   LL+  +  +   +S +ATT  P
Sbjct: 69  NHGPLFFIVLWVIWCVRNEFVFNNQRESTHIIMGKIYSLLHSCEAVFTPPHSSMATTAKP 128

Query: 654 REVAWMGTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEV 475
           R V W    ++  V LNVDGS ++    +G+GGL+RD  G FL GFYG+A   S+L AE+
Sbjct: 129 RLVTWT-KPAEGTVCLNVDGSLLKATNTAGYGGLIRDSNGVFLSGFYGTATVQSILFAEL 187

Query: 474 LGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLL 295
           + V+HGL++CWE G+R + C+S+SL  V  I +GVS  H  +NE+ II + + +DW+ ++
Sbjct: 188 MAVLHGLQICWESGFRRITCFSDSLQIVNLIRDGVSAHHRFSNEVFIIHQLLAKDWEVVI 247

Query: 294 VHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145
            HT REGN CAD LAK+GA +D  L+ I  PP  +   LLADA  V F R
Sbjct: 248 GHTFREGNACADVLAKMGAASDSTLVTISTPPCDLSMPLLADAHVVVFIR 297


>GAU47359.1 hypothetical protein TSUD_403620 [Trifolium subterraneum]
          Length = 330

 Score =  201 bits (512), Expect = 6e-59
 Identities = 108/282 (38%), Positives = 148/282 (52%)
 Frame = -2

Query: 990 CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAI 811
           CP C   SE++ H  F+CN A  VW  + L HV P SS+  D   W +     +G I  I
Sbjct: 74  CPRCAIASESIEHCLFTCNDAASVWRAYGL-HVIPNSSHGVDNFTWYKKQGMKHGRIFFI 132

Query: 810 ILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVAWMGT 631
           I+W IW  RN+ IFDN R S    V +   L      A+    +    +  PR V W   
Sbjct: 133 IMWVIWCARNEFIFDNHRQSVVTSVIKIDSLQQACAAAFGSTQTIATQSSNPRLVTW-AR 191

Query: 630 SSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLR 451
             +  + LNVDGS +  L  +G+GGLLR+H G F+ GFYG+    S+L AE++ V+HGL 
Sbjct: 192 PMEGTICLNVDGSLLGSLNSAGYGGLLRNHNGEFILGFYGTTSLKSILFAEIMAVLHGLT 251

Query: 450 LCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGN 271
           +CWE+GYR + C S+SL  +                         +DW+ +L HTLREG+
Sbjct: 252 ICWENGYRKINCLSDSLQLI------------------------TRDWEVVLSHTLREGS 287

Query: 270 QCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145
            CAD LAK+GA+ + PL+    PP  +   L  D +GV FTR
Sbjct: 288 SCADVLAKMGAVANTPLVTTSTPPRTLAKPLFEDVNGVIFTR 329


>GAU49781.1 hypothetical protein TSUD_188300 [Trifolium subterraneum]
          Length = 221

 Score =  194 bits (492), Expect = 2e-57
 Identities = 97/221 (43%), Positives = 131/221 (59%)
 Frame = -2

Query: 807 LWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVAWMGTS 628
           +W IW  RN+ IFDN R S    V +   L      A+    +    +  PR V W    
Sbjct: 1   MWVIWCARNEFIFDNHRQSVVTSVIKIDSLQQACAAAFGSTQTIATQSSNPRLVTW-ARP 59

Query: 627 SDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLRL 448
            +  + LNVDGS +  L  +G+GGLLR+H G F+ GFYG+    S+L AE++ V+HGL +
Sbjct: 60  MEGTICLNVDGSLLGSLNSAGYGGLLRNHNGEFILGFYGTTSLKSILFAEIMAVLHGLTI 119

Query: 447 CWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGNQ 268
           CWE+GYR + C S+SL  V  I  GVS  H  ANEI  I++ + +DW+ +L HTLREGN 
Sbjct: 120 CWENGYRKINCLSDSLQVVNLIRSGVSPHHRFANEILSIRQLITRDWEVVLSHTLREGNL 179

Query: 267 CADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145
           CAD LAK+GA+ + PL+    PP  +   L  DA+GV FTR
Sbjct: 180 CADVLAKMGAVANTPLVTTSTPPRTLPKPLFEDANGVIFTR 220


>GAU48983.1 hypothetical protein TSUD_245740 [Trifolium subterraneum]
          Length = 1103

 Score =  205 bits (522), Expect = 8e-56
 Identities = 105/246 (42%), Positives = 151/246 (61%), Gaps = 2/246 (0%)
 Frame = -2

Query: 876  NRSDIAKWTRDAI--HSNGIIVAIILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDID 703
            N S+    T+DA    S+G I+ II+W +W +RN  IF+N ++    +V +   +L+   
Sbjct: 858  NSSNGVYTTKDADVGKSHGCIIFIIMWFVWCSRNDAIFNNNKAIVHNLVAKVHSMLSFCI 917

Query: 702  KAYQEINSPLATTRMPREVAWMGTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLF 523
             A++   S        R V W    ++  V LNV GS +  L  +GFGGL+R+    FL 
Sbjct: 918  AAFKNTTSGSGGNSEQRLVVWP-RPAEGTVCLNVHGSMLGSLQTAGFGGLIRNSFSAFLK 976

Query: 522  GFYGSADQSSVLRAEVLGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANE 343
            GFYG+A QSSVL AE++ ++HGL LCW +GYR ++CYS+SL AV  I +GVSH H  ANE
Sbjct: 977  GFYGTASQSSVLYAEIMAILHGLHLCWNNGYRSIVCYSDSLQAVSLIKDGVSHFHTFANE 1036

Query: 342  ISIIKKHMQQDWDCLLVHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADAS 163
            I  I++ +++DW  ++ H LREGN CAD LAK G+ T+ P++++  PPP +   L  DA 
Sbjct: 1037 IHPIRQLLRRDWTIVIEHILREGNACADVLAKKGSSTNSPIVIVDSPPPELSNALSVDAR 1096

Query: 162  GVSFTR 145
            GV F R
Sbjct: 1097 GVVFVR 1102


>GAU36275.1 hypothetical protein TSUD_255290 [Trifolium subterraneum]
          Length = 258

 Score =  180 bits (457), Expect = 1e-51
 Identities = 100/252 (39%), Positives = 142/252 (56%), Gaps = 2/252 (0%)
 Frame = -2

Query: 894 VAPLSSNRSDIAKWTRDAIHSNGIIVAIILWTIWLTRNKLIFDNERSSPQLIVYRASRLL 715
           + P S    D   W +     +G I+ + LW +W  RN  IF+N + S    V ++  L+
Sbjct: 9   LVPSSVQGVDRLTWCKQLGKKHGNIIFVTLWMVWCVRNNFIFNNHQESTHTSVAKSHSLV 68

Query: 714 NDIDKAYQ--EINSPLATTRMPREVAWMGTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDH 541
           N   KA+    + SPLA  +  R V W    +D+ V LNVDGS +     +G+ GLLR+ 
Sbjct: 69  NASAKAFSLPSVVSPLAGHQ--RSVRWF-RPADEFVCLNVDGSLLGSNNTAGYDGLLRNR 125

Query: 540 TGTFLFGFYGSADQSSVLRAEVLGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHC 361
            G F++GFYG A   ++L AE++ + +GL+LCWE G+R V C S+ L +V    EGV+  
Sbjct: 126 DGEFIWGFYGVAAIQNILYAEIMAIWYGLKLCWERGFRKVFCCSDYLLSVDVTKEGVTTH 185

Query: 360 HPLANEISIIKKHMQQDWDCLLVHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPL 181
           H  ANEI  I+K +  DW+ +L HTLREGN CAD L KLG  +D  ++ I  P   +   
Sbjct: 186 HRFANEILCIRKLLANDWEVILTHTLREGNACADVLGKLGVNSDSSMVNIYAPSQDLVIP 245

Query: 180 LLADASGVSFTR 145
           L  DASG+ F R
Sbjct: 246 LHDDASGIEFIR 257


>KYP57109.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 365

 Score =  182 bits (462), Expect = 4e-51
 Identities = 99/285 (34%), Positives = 149/285 (52%), Gaps = 2/285 (0%)
 Frame = -2

Query: 993 SCPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVA 814
           +CP C    ET+LH    C     +W+   L     L ++   I +W +  +   G I+ 
Sbjct: 82  NCPMCNAQQETLLHCLLECPRIGALWNSLGLCQ-PHLPTDSEKIKEWLKCWVEEQGSIIP 140

Query: 813 IILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAY--QEINSPLATTRMPREVAW 640
           ++LW IW +RN +IF  +      +    S   + I +AY  +     +   R  R V W
Sbjct: 141 VLLWVIWRSRNNMIFKGKLDKVADLKVWVSTWCSAIIRAYGGEPATGSIWQQRSTRLVRW 200

Query: 639 MGTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMH 460
                D  V +NVDGSA+ + G  G GGL+RD+TG F+ GFYGS   S+ + AE++ +  
Sbjct: 201 TAKEGDW-VTINVDGSALTNPGAVGVGGLVRDNTGLFMVGFYGSIGISNNIHAELVAMWR 259

Query: 459 GLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLR 280
           GL LCWE GY  V C S+ L  V+ + +  SH H  A  +  IK+ + + W C ++H LR
Sbjct: 260 GLTLCWERGYSHVCCQSDCLYVVQLLQQESSHYHRYAVLLDKIKELISRHWTCQVIHILR 319

Query: 279 EGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145
           EGN CAD  A+ GA++ E L+++++ P  M  LL  D +G    R
Sbjct: 320 EGNFCADFFARKGAVSSEGLVILEEAPVEMEELLRKDITGTCVLR 364


>KYP61721.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 219

 Score =  173 bits (438), Expect = 3e-49
 Identities = 89/221 (40%), Positives = 133/221 (60%)
 Frame = -2

Query: 807 LWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVAWMGTS 628
           +W IW  RN+ IFD    +   I+ +A+ LL     A+  I+   +   +PR V W+   
Sbjct: 1   MWFIWCHRNRHIFDQVDWNLTSILAQANALLQFSVSAFTSIDC--SHRPLPRLVHWIHPL 58

Query: 627 SDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLRL 448
            D  V LNVDGS I   G  G+GGL ++H G FLFGFYG   ++SVL+ E+L ++HGL L
Sbjct: 59  VDS-VALNVDGSRIGTPGRGGYGGLCQNHEGQFLFGFYGFLGEASVLQTEILALLHGLHL 117

Query: 447 CWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGNQ 268
           CW+ G+R ++CYS+S   V  +   +   H   N++  I + +  DW C +VHTL EGN 
Sbjct: 118 CWDKGFRKIVCYSDSTLVVSLLQGPILMFHRYGNQLMEIHQLLNCDWTCTVVHTLCEGNS 177

Query: 267 CADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145
           CAD LA++GA+ ++ ++++Q+ P  +  LLLAD+ G  F R
Sbjct: 178 CADALARMGALGNDRVVILQEHPMTLSSLLLADSLGTVFQR 218


>ABN09044.1 Ribonuclease H [Medicago truncatula]
          Length = 235

 Score =  173 bits (438), Expect = 4e-49
 Identities = 95/235 (40%), Positives = 138/235 (58%)
 Frame = -2

Query: 849 RDAIHSNGIIVAIILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLA 670
           +D I + G I  II+W IW +RNK IF++ + S Q I  +    L+ I KA+    S  +
Sbjct: 2   KDFISNIGPIGPIIIWKIWCSRNKCIFEDIKHSIQEIGAQVLSSLHHILKAFAHPTSH-S 60

Query: 669 TTRMPREVAWMGTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSV 490
             +  R V+W   S +  V LNVDG+         FGGL+RDHT +FL GF+G   +  +
Sbjct: 61  VQQPARIVSWQRPSMNS-VALNVDGNVFLDSNLGSFGGLIRDHTSSFLHGFFGKNSRPCI 119

Query: 489 LRAEVLGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQD 310
           L  E+ G+ HGL+LCW+ G + V+C+S+S   V  + + ++  H   N I  IKK +++D
Sbjct: 120 LHVEISGLYHGLKLCWDIGIKHVVCHSDSTTVVDLVQKDLNVHHKYGNLIMAIKKLLRRD 179

Query: 309 WDCLLVHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145
           W   L HTL EGN  AD LAK GA++D  L+++ + PP +  +LLADA GV F R
Sbjct: 180 WVVSLRHTLCEGNAAADFLAKKGALSDTSLVILNEAPPDIAFVLLADAVGVKFVR 234


>GAU17063.1 hypothetical protein TSUD_105620 [Trifolium subterraneum]
          Length = 440

 Score =  174 bits (441), Expect = 3e-47
 Identities = 108/287 (37%), Positives = 149/287 (51%), Gaps = 5/287 (1%)
 Frame = -2

Query: 990 CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAI 811
           CP      ET++H    C     +W     T       +  ++  W R    S  + + +
Sbjct: 165 CPRSDIAEETIMHCLRDCEFVKHLWKTIGFTDQTFFHGD--NLYAWLRKGCDSPSMFMFL 222

Query: 810 I-LWTIWLTRNKLIFDNERSSPQLIVYRASRLLND----IDKAYQEINSPLATTRMPREV 646
             LW IW  RNKL   NE  SP    +  SR + D    + K Y +  S LA  R+ R  
Sbjct: 223 AALWWIWRARNKLCLANELVSP----FTISRCIEDYALLVKKCYSQQKSTLAN-RLVRWN 277

Query: 645 AWMGTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGV 466
           A  GT     ++LNVDGS+I +    GFGGL+R+  G ++ GF G+   S++L AE+L V
Sbjct: 278 AHDGTD----MILNVDGSSIGNPEIYGFGGLIRNSHGAWIRGFAGNIGFSNILHAELLAV 333

Query: 465 MHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHT 286
            HGL L W+   +D+ICYS+S  A+K I + ++  H  A  +  IK  + +DW   + HT
Sbjct: 334 YHGLVLAWDMDIKDLICYSDSKTAIKLIGDPINEWHHFAAILQNIKDILARDWRVTVAHT 393

Query: 285 LREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145
           LREGN CAD+LAK GA   +    +  PP  M  LLLADASG  FTR
Sbjct: 394 LREGNACADYLAKFGAQNIKVFSTMTTPPDGMNLLLLADASGTWFTR 440


>KYP56001.1 Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 414

 Score =  168 bits (426), Expect = 3e-45
 Identities = 95/284 (33%), Positives = 142/284 (50%), Gaps = 2/284 (0%)
 Frame = -2

Query: 990 CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAI 811
           C  C    ET++H F  C+   +VW                +   W    I   G +   
Sbjct: 139 CRQCDSQEETVMHCFRDCHEVQEVWKILQFVSCDTFYQI-DNFKMWVNHGIKLGGALFLS 197

Query: 810 ILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAY--QEINSPLATTRMPREVAWM 637
            +W IWL  N+L+F+  ++    +   A      +   +   E+NS L     P+ V W 
Sbjct: 198 TIWEIWLGWNRLVFEGSKTKAWQVALAAKSFSEAMTNVFLNHEVNSNL-----PKWVGW- 251

Query: 636 GTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHG 457
              S+  V+LN DGS ++    +GFGG+LR   G ++ GFYG+ D S ++  E+LG++ G
Sbjct: 252 SAPSENCVILNTDGSVMED--KAGFGGVLRSSDGVWIHGFYGNVDGSDIIGVELLGILQG 309

Query: 456 LRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLRE 277
           LR+    G   V C ++SL AVK+I  GVSH H  +N +  I K + +DW   + H LRE
Sbjct: 310 LRIAQRLGLSRVYCQTDSLVAVKWIQGGVSHMHHYSNLVQEIHKLLDKDWAVSISHVLRE 369

Query: 276 GNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145
            N+CAD+ AKLG    + L    +PP  + P+L ADA+G  F R
Sbjct: 370 CNKCADYFAKLGLNCPDRLTNFMEPPLDVIPMLQADAAGERFLR 413


>GAU32945.1 hypothetical protein TSUD_153620 [Trifolium subterraneum]
          Length = 292

 Score =  165 bits (417), Expect = 3e-45
 Identities = 87/179 (48%), Positives = 117/179 (65%)
 Frame = -2

Query: 681 SPLATTRMPREVAWMGTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSAD 502
           SPLA  +  R V W    +D  V LNVDGS +     +G+GGLLR+  G F++GFYG+A 
Sbjct: 116 SPLAGHQ--RRVRW-SRPADGFVCLNVDGSLLGSNNTAGYGGLLRNRDGEFIWGFYGAAA 172

Query: 501 QSSVLRAEVLGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKH 322
             ++L AE++ + +GL+LCWE G+R V+C S+SL +V  I EGV+  H  ANEI  I+K 
Sbjct: 173 IQNILYAEIMAIWYGLKLCWERGFRKVLCCSDSLLSVNVIKEGVTTHHGFANEILCIRKL 232

Query: 321 MQQDWDCLLVHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145
           +  DW+ +L HTLREGN CAD LAKLGA +D P++ I  PP  +   L  DASG+ F R
Sbjct: 233 LSNDWEVILTHTLREGNACADVLAKLGANSDSPMVNISTPPRDLVIPLHHDASGIEFIR 291


>GAU39987.1 hypothetical protein TSUD_211080 [Trifolium subterraneum]
          Length = 192

 Score =  161 bits (407), Expect = 5e-45
 Identities = 82/187 (43%), Positives = 115/187 (61%)
 Frame = -2

Query: 720 LLNDIDKAYQEINSPLATTRMPREVAWMGTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDH 541
           LL+  +  +   +S +ATT  PR V W     +  V LNVDGS +     + +GGL+RD 
Sbjct: 7   LLHFCEAMFTPPHSSVATTAKPRLVTWT-KPVEGTVCLNVDGSLLGATNTASYGGLIRDS 65

Query: 540 TGTFLFGFYGSADQSSVLRAEVLGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHC 361
               L GFYG+    S+L AE++ V+HGL++CWE G+R + C+S+SL  V  I +GVS  
Sbjct: 66  NRVILSGFYGTTSVQSILFAELMAVLHGLQICWESGFRRITCFSDSLQTVNLIRDGVSTH 125

Query: 360 HPLANEISIIKKHMQQDWDCLLVHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPL 181
           H  +NE+ II + +  DW+ ++ HT REGN CAD LAK+GA +D PL+ I  PP  +   
Sbjct: 126 HRSSNEVFIIHQLLANDWEVVIDHTFREGNACADVLAKMGAASDSPLVKISTPPCDLSMP 185

Query: 180 LLADASG 160
           LLADA G
Sbjct: 186 LLADAQG 192


>GAU24479.1 hypothetical protein TSUD_319560 [Trifolium subterraneum]
          Length = 227

 Score =  160 bits (406), Expect = 2e-44
 Identities = 82/200 (41%), Positives = 122/200 (61%), Gaps = 4/200 (2%)
 Frame = -2

Query: 810 ILWTIWLTRNKLIFDNE----RSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVA 643
           ++W IW  RN +IF+++      + Q I+  + + L +  K  ++    +  +  PR V 
Sbjct: 19  VVWAIWRVRNDVIFNSKVPVIEEAFQGIISLSWKWLRE--KKKKKKKGAVTQSSNPRLVT 76

Query: 642 WMGTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVM 463
           W     +  + LNVDG+ +  L + G+GGLLR+H G F+ GFYG+    S+L AE++ V+
Sbjct: 77  W-ARPMEGTICLNVDGNLLGSLNYLGYGGLLRNHNGEFILGFYGTTSLKSILFAEIMVVL 135

Query: 462 HGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTL 283
           HGL +CWE+GYR + C SNSL  V  I  GVS  H  ANEI  I++ + +DW+ +L HTL
Sbjct: 136 HGLTICWENGYRKINCLSNSLQLVNLIRSGVSLHHRFANEILSIRRLITRDWEVVLSHTL 195

Query: 282 REGNQCADHLAKLGAMTDEP 223
           REGN CAD LAK+G + + P
Sbjct: 196 REGNSCADVLAKMGVVANTP 215


>AFK48593.1 unknown [Lotus japonicus]
          Length = 272

 Score =  162 bits (409), Expect = 3e-44
 Identities = 96/243 (39%), Positives = 131/243 (53%), Gaps = 1/243 (0%)
 Frame = -2

Query: 870 SDIAKWTRDAIHSNGIIVAIILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQ 691
           +D   W R+ +  N  +V   LW +W  RN    + +    Q++      + +DI + Y 
Sbjct: 33  ADSKAWLREVLKENSPLVMSTLWWVWRLRNVWCMEGKLIPWQVLRGDILAMFDDIARCYA 92

Query: 690 -EINSPLATTRMPREVAWMGTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFY 514
            ++++P+ T   PR V W    +D  VVLNVDGS        GFGG  R   G +L GF+
Sbjct: 93  VDVDAPMHT---PRLVRWTVGLADC-VVLNVDGSVHGTPQRGGFGGCFRTIHGNWLRGFF 148

Query: 513 GSADQSSVLRAEVLGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISI 334
           G  D+  +L  E+LG+ HGL L WE GYR V C S+S +AV  +    S CH  A  +  
Sbjct: 149 GYLDECCILHLELLGMFHGLSLAWEQGYRIVECQSDSQDAVTLVKSTPSSCHRYAALVWD 208

Query: 333 IKKHMQQDWDCLLVHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVS 154
           IK    +DW   L HTLREGN CAD L K GA  ++ L++ + P   +G LLLADA GVS
Sbjct: 209 IKDLQSRDWIVELRHTLREGNACADLLVKHGADQNDDLVITENPIAGLGVLLLADARGVS 268

Query: 153 FTR 145
           F R
Sbjct: 269 FVR 271


>KYP32780.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan]
          Length = 1123

 Score =  170 bits (431), Expect = 1e-43
 Identities = 97/284 (34%), Positives = 142/284 (50%), Gaps = 2/284 (0%)
 Frame = -2

Query: 990  CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAI 811
            C  C    ET++H F  C+   +VW                +   W    I   G +   
Sbjct: 848  CRQCDSQEETVMHCFRDCHEVQEVWRILQFVSCDTFYQI-DNFKMWVNHGIKLGGALFLS 906

Query: 810  ILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAY--QEINSPLATTRMPREVAWM 637
             +W IWL RN+L+F+  ++    +   A  L   +   +   E+NS L     P+ V W 
Sbjct: 907  TIWEIWLGRNRLVFEGSKTKAWQVALAAKSLSEAMTNVFLNHEVNSNL-----PKWVGW- 960

Query: 636  GTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHG 457
               S+  V+LN DGS ++    +GFGG+LR   G ++ GF G+ D   ++  E+LG++ G
Sbjct: 961  SAPSENCVILNTDGSVMED--KAGFGGVLRSSNGAWIHGFCGNVDGYEIIGVELLGILQG 1018

Query: 456  LRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLRE 277
            LR+    G   V C +NSL AVK+I  GVSH H  +N +  I K + +DW   + H LRE
Sbjct: 1019 LRIAQRLGLSRVYCQTNSLAAVKWIQGGVSHMHHYSNLVQEIHKLLDKDWAVSISHVLRE 1078

Query: 276  GNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145
             N+CAD+ AKLG    + L    +PP  + PLL ADA+G  F R
Sbjct: 1079 CNKCADYFAKLGLNCPDRLTNFMEPPLDVIPLLQADAAGERFLR 1122


>GAU50297.1 hypothetical protein TSUD_288310 [Trifolium subterraneum]
          Length = 545

 Score =  161 bits (408), Expect = 1e-41
 Identities = 101/283 (35%), Positives = 143/283 (50%), Gaps = 1/283 (0%)
 Frame = -2

Query: 990  CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHS-NGIIVA 814
            CP C    E+ LH   +C    + W       +        ++  W R++I   +  +  
Sbjct: 270  CPRCNIEEESTLHCLRNCEFIKRFWKAIGF--LGQTFFQGDNLNDWLRNSIDGPSSFLFM 327

Query: 813  IILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVAWMG 634
              +W IW  RN+L  DNE  S   +      L   +   + + N   +T  M R  A  G
Sbjct: 328  AAVWWIWCARNQLCMDNEAISYFTLRTNTENLAQLLRMCFIKQNIS-STATMVRWNAHGG 386

Query: 633  TSSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGL 454
                  ++LNVDGS+I + G SGFGGL+R+  G ++ GF G+    ++L+AE+L + HGL
Sbjct: 387  IG----MILNVDGSSIGNPGISGFGGLIRNSDGAWVHGFAGNIGHLNILQAELLAIYHGL 442

Query: 453  RLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREG 274
             L WE   +D+ CYS+S  A+K I + V+  H  A  I  IK  + ++W   LVH LREG
Sbjct: 443  VLAWELDIKDLCCYSDSKTALKLIYDHVNEWHQYAAIIYNIKDFLSRNWRVRLVHMLREG 502

Query: 273  NQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145
            N CAD L K GA   +    I  PP  M  LLLADASG  F+R
Sbjct: 503  NNCADILDKFGARNPKAYCSIAVPPDGMSLLLLADASGTIFSR 545


>KYP78366.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 1090

 Score =  160 bits (406), Expect = 3e-40
 Identities = 95/284 (33%), Positives = 140/284 (49%), Gaps = 2/284 (0%)
 Frame = -2

Query: 990  CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAI 811
            C  C    ET++H F   +   +VW           S    +   W    I   G +   
Sbjct: 815  CRQCGSQEETVMHCFRDSHEVQEVWRILQFVSCDTFSQI-DNFKMWVIHGIKLGGALFLS 873

Query: 810  ILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAY--QEINSPLATTRMPREVAWM 637
             +W IWL RN+L+F+  ++    +   A  L   +   +   E+N  L     P+ V W+
Sbjct: 874  TIWEIWLGRNRLVFEGSKTKAWQVALAAKSLSEAMTNVFLNHEVNRNL-----PKWVGWL 928

Query: 636  GTSSDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHG 457
              S +  V+LN DGS +     +GFGG+LR   G ++ GF G+ D S ++  E+LG++ G
Sbjct: 929  DPS-ENCVILNTDGSVMDD--KAGFGGVLRSSDGVWIHGFCGNMDGSEIIGVELLGILQG 985

Query: 456  LRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLRE 277
            LR+    G   V C ++SL AVK+I  G SH H  +N +  I K + +DW   + H LRE
Sbjct: 986  LRIAQILGLSRVYCQTDSLVAVKWIKGGESHMHHYSNLVQEIHKLLDKDWAVSISHVLRE 1045

Query: 276  GNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 145
             N+CAD+ AKLG    + L    +PP  + PLL ADA G  F R
Sbjct: 1046 CNKCADYFAKLGLRCPDRLTNFMEPPLDVIPLLQADADGERFLR 1089


>KYP64035.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 190

 Score =  148 bits (374), Expect = 4e-40
 Identities = 79/190 (41%), Positives = 109/190 (57%)
 Frame = -2

Query: 807 LWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVAWMGTS 628
           +W IW  RN+LIFD    +   I+ + + LL     A+  I+   +   +PR V W+   
Sbjct: 1   MWFIWCHRNRLIFDQVDWNLTSILAQVNALLQISVSAFTSIDC--SHRPLPRLVHWIHPP 58

Query: 627 SDQRVVLNVDGSAIQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLRL 448
            D  V LNVDGS I  LG  GFGGL R+H G FLFGFYG   + SVL+  +L +++GLRL
Sbjct: 59  LDS-VALNVDGSRIGTLGRGGFGGLCRNHEGQFLFGFYGFLGEVSVLQTVILALLYGLRL 117

Query: 447 CWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGNQ 268
           CW+  +R +ICYS+S   V  +   +   H   N++  I + +  DW C +VHTLREGN 
Sbjct: 118 CWDKWFRKIICYSDSTLVVSLLQGPIPMFHRYENQLMEIHQLLNCDWTCTVVHTLREGNS 177

Query: 267 CADHLAKLGA 238
           CAD     G+
Sbjct: 178 CADAFGSNGS 187


>GAU35042.1 hypothetical protein TSUD_30080 [Trifolium subterraneum]
          Length = 724

 Score =  159 bits (402), Expect = 4e-40
 Identities = 93/268 (34%), Positives = 140/268 (52%), Gaps = 3/268 (1%)
 Frame = -2

Query: 939  CNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAII-LWTIWLTRNKLIFDN 763
            CN    +W     T          D + W R+ +  + + + +  +W IW TRN L  DN
Sbjct: 466  CNFVYTIWKSLGFTDRNFFQE--VDSSSWLRNGLSCSSMFLFMAAIWWIWRTRNALCLDN 523

Query: 762  ERSSPQLIVYRASRLLNDIDKAYQEINSPL--ATTRMPREVAWMGTSSDQRVVLNVDGSA 589
            E      ++ + S  +  +D A    N       T +P+ V W        ++LNVDGS 
Sbjct: 524  E------LIPQFSLKMRIVDYALLLKNCHFNHQVTTLPKIVRWNALGGTS-MILNVDGST 576

Query: 588  IQHLGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLRLCWEHGYRDVICYS 409
            I + G SGFGGL+R+  G ++ GF+G+   +++L AE++ ++ GL L WE   +D++CYS
Sbjct: 577  IGNPGISGFGGLIRNADGAWIHGFFGNLGVTNILHAELMAILKGLLLAWELNIKDLLCYS 636

Query: 408  NSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGNQCADHLAKLGAMTD 229
            +S  A+K I E V   H  A  ++ IK  + +DW   + HT REGN CAD+LAK GA  +
Sbjct: 637  DSATAIKLITEPVDVWHHYAAILNNIKDILNRDWQVSIFHTFREGNACADYLAKHGAHNN 696

Query: 228  EPLLVIQQPPPAMGPLLLADASGVSFTR 145
                 I  PP  +   LLAD SG+ F+R
Sbjct: 697  IVFTTIAIPPAGLNLHLLADVSGIIFSR 724


Top