BLASTX nr result

ID: Glycyrrhiza29_contig00041044 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza29_contig00041044
         (994 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

GAU24540.1 hypothetical protein TSUD_156530 [Trifolium subterran...   242   1e-68
GAU48830.1 hypothetical protein TSUD_190600 [Trifolium subterran...   204   2e-60
GAU47359.1 hypothetical protein TSUD_403620 [Trifolium subterran...   199   7e-58
GAU49781.1 hypothetical protein TSUD_188300 [Trifolium subterran...   191   3e-56
GAU48983.1 hypothetical protein TSUD_245740 [Trifolium subterran...   202   7e-55
KYP57109.1 Putative ribonuclease H protein At1g65750 family [Caj...   186   1e-52
GAU36275.1 hypothetical protein TSUD_255290 [Trifolium subterran...   181   9e-52
KYP61721.1 Putative ribonuclease H protein At1g65750 family [Caj...   177   9e-51
ABN09044.1 Ribonuclease H [Medicago truncatula]                       173   3e-49
GAU17063.1 hypothetical protein TSUD_105620 [Trifolium subterran...   178   1e-48
AFK48593.1 unknown [Lotus japonicus]                                  166   8e-46
GAU32945.1 hypothetical protein TSUD_153620 [Trifolium subterran...   165   2e-45
KYP56001.1 Putative ribonuclease H protein At1g65750 family, par...   168   3e-45
GAU39987.1 hypothetical protein TSUD_211080 [Trifolium subterran...   161   5e-45
KYP32780.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan]       170   1e-43
GAU24479.1 hypothetical protein TSUD_319560 [Trifolium subterran...   158   2e-43
GAU50297.1 hypothetical protein TSUD_288310 [Trifolium subterran...   165   4e-43
GAU35042.1 hypothetical protein TSUD_30080 [Trifolium subterraneum]   163   2e-41
GAU34195.1 hypothetical protein TSUD_162960 [Trifolium subterran...   150   5e-41
GAU16646.1 hypothetical protein TSUD_325960 [Trifolium subterran...   148   1e-40

>GAU24540.1 hypothetical protein TSUD_156530 [Trifolium subterraneum]
          Length = 1147

 Score =  242 bits (617), Expect = 1e-68
 Identities = 120/282 (42%), Positives = 171/282 (60%)
 Frame = +2

Query: 5    CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAI 184
            CP C  + ET++H  F+C  A  +W    L HV P S++  D+  W RD   S+G I+ I
Sbjct: 867  CPRCTAMPETIVHCLFACTDAIGIWRACGLEHVLPPSTD-VDLFCWCRDVGKSHGCIIFI 925

Query: 185  ILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVAWMGT 364
            I+W +W +RN  IF+N ++    +V +   +L+    A++   S        R V W   
Sbjct: 926  IMWFVWCSRNDAIFNNNKAIVHNLVAKVHYMLSFCTAAFENTTSGSGGNSEHRLVVWP-R 984

Query: 365  SSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLR 544
              +  V LNVDGS +     +GFGGL+R+  G FL GFYG+A QSSVL AE++ ++HGL 
Sbjct: 985  PDEGTVCLNVDGSMLGSLQTAGFGGLIRNSFGAFLKGFYGTASQSSVLYAEIMAILHGLH 1044

Query: 545  LCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGN 724
            LCW +GYR ++CYS+SL AV  I +GVSH H  ANEI  I + +++DW  ++ H LREGN
Sbjct: 1045 LCWNNGYRSIVCYSDSLQAVSLIKDGVSHFHTFANEIYTIHQLLRRDWTIVIEHILREGN 1104

Query: 725  QCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850
             CAD LAK G+ T+ P+++++ PPP     L ADA G+ F R
Sbjct: 1105 ACADILAKKGSSTNSPIVIVESPPPEPSNALSADARGIVFVR 1146


>GAU48830.1 hypothetical protein TSUD_190600 [Trifolium subterraneum]
          Length = 298

 Score =  204 bits (519), Expect = 2e-60
 Identities = 100/230 (43%), Positives = 147/230 (63%)
 Frame = +2

Query: 161 SNGIIVAIILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMP 340
           ++G +  I+LW IW  RN+ +F+N+R S  +I+ +   LL+  +  +   +S +ATT  P
Sbjct: 69  NHGPLFFIVLWVIWCVRNEFVFNNQRESTHIIMGKIYSLLHSCEAVFTPPHSSMATTAKP 128

Query: 341 REVAWMGTSSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEV 520
           R V W    ++  V LNVDGS ++    +G+GGL+RD  G FL GFYG+A   S+L AE+
Sbjct: 129 RLVTWT-KPAEGTVCLNVDGSLLKATNTAGYGGLIRDSNGVFLSGFYGTATVQSILFAEL 187

Query: 521 LGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLL 700
           + V+HGL++CWE G+R + C+S+SL  V  I +GVS  H  +NE+ II + + +DW+ ++
Sbjct: 188 MAVLHGLQICWESGFRRITCFSDSLQIVNLIRDGVSAHHRFSNEVFIIHQLLAKDWEVVI 247

Query: 701 VHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850
            HT REGN CAD LAK+GA +D  L+ I  PP  +   LLADA  V F R
Sbjct: 248 GHTFREGNACADVLAKMGAASDSTLVTISTPPCDLSMPLLADAHVVVFIR 297


>GAU47359.1 hypothetical protein TSUD_403620 [Trifolium subterraneum]
          Length = 330

 Score =  199 bits (505), Expect = 7e-58
 Identities = 107/282 (37%), Positives = 147/282 (52%)
 Frame = +2

Query: 5   CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAI 184
           CP C   SE++ H  F+CN A  VW  + L HV P SS+  D   W +     +G I  I
Sbjct: 74  CPRCAIASESIEHCLFTCNDAASVWRAYGL-HVIPNSSHGVDNFTWYKKQGMKHGRIFFI 132

Query: 185 ILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVAWMGT 364
           I+W IW  RN+ IFDN R S    V +   L      A+    +    +  PR V W   
Sbjct: 133 IMWVIWCARNEFIFDNHRQSVVTSVIKIDSLQQACAAAFGSTQTIATQSSNPRLVTW-AR 191

Query: 365 SSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLR 544
             +  + LNVDGS +     +G+GGLLR+H G F+ GFYG+    S+L AE++ V+HGL 
Sbjct: 192 PMEGTICLNVDGSLLGSLNSAGYGGLLRNHNGEFILGFYGTTSLKSILFAEIMAVLHGLT 251

Query: 545 LCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGN 724
           +CWE+GYR + C S+SL  +                         +DW+ +L HTLREG+
Sbjct: 252 ICWENGYRKINCLSDSLQLI------------------------TRDWEVVLSHTLREGS 287

Query: 725 QCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850
            CAD LAK+GA+ + PL+    PP  +   L  D +GV FTR
Sbjct: 288 SCADVLAKMGAVANTPLVTTSTPPRTLAKPLFEDVNGVIFTR 329


>GAU49781.1 hypothetical protein TSUD_188300 [Trifolium subterraneum]
          Length = 221

 Score =  191 bits (485), Expect = 3e-56
 Identities = 96/221 (43%), Positives = 130/221 (58%)
 Frame = +2

Query: 188 LWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVAWMGTS 367
           +W IW  RN+ IFDN R S    V +   L      A+    +    +  PR V W    
Sbjct: 1   MWVIWCARNEFIFDNHRQSVVTSVIKIDSLQQACAAAFGSTQTIATQSSNPRLVTW-ARP 59

Query: 368 SDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLRL 547
            +  + LNVDGS +     +G+GGLLR+H G F+ GFYG+    S+L AE++ V+HGL +
Sbjct: 60  MEGTICLNVDGSLLGSLNSAGYGGLLRNHNGEFILGFYGTTSLKSILFAEIMAVLHGLTI 119

Query: 548 CWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGNQ 727
           CWE+GYR + C S+SL  V  I  GVS  H  ANEI  I++ + +DW+ +L HTLREGN 
Sbjct: 120 CWENGYRKINCLSDSLQVVNLIRSGVSPHHRFANEILSIRQLITRDWEVVLSHTLREGNL 179

Query: 728 CADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850
           CAD LAK+GA+ + PL+    PP  +   L  DA+GV FTR
Sbjct: 180 CADVLAKMGAVANTPLVTTSTPPRTLPKPLFEDANGVIFTR 220


>GAU48983.1 hypothetical protein TSUD_245740 [Trifolium subterraneum]
          Length = 1103

 Score =  202 bits (515), Expect = 7e-55
 Identities = 104/246 (42%), Positives = 150/246 (60%), Gaps = 2/246 (0%)
 Frame = +2

Query: 119  NRSDIAKWTRDAI--HSNGIIVAIILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDID 292
            N S+    T+DA    S+G I+ II+W +W +RN  IF+N ++    +V +   +L+   
Sbjct: 858  NSSNGVYTTKDADVGKSHGCIIFIIMWFVWCSRNDAIFNNNKAIVHNLVAKVHSMLSFCI 917

Query: 293  KAYQEINSPLATTRMPREVAWMGTSSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLF 472
             A++   S        R V W    ++  V LNV GS +     +GFGGL+R+    FL 
Sbjct: 918  AAFKNTTSGSGGNSEQRLVVWP-RPAEGTVCLNVHGSMLGSLQTAGFGGLIRNSFSAFLK 976

Query: 473  GFYGSADQSSVLRAEVLGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANE 652
            GFYG+A QSSVL AE++ ++HGL LCW +GYR ++CYS+SL AV  I +GVSH H  ANE
Sbjct: 977  GFYGTASQSSVLYAEIMAILHGLHLCWNNGYRSIVCYSDSLQAVSLIKDGVSHFHTFANE 1036

Query: 653  ISIIKKHMQQDWDCLLVHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADAS 832
            I  I++ +++DW  ++ H LREGN CAD LAK G+ T+ P++++  PPP +   L  DA 
Sbjct: 1037 IHPIRQLLRRDWTIVIEHILREGNACADVLAKKGSSTNSPIVIVDSPPPELSNALSVDAR 1096

Query: 833  GVSFTR 850
            GV F R
Sbjct: 1097 GVVFVR 1102


>KYP57109.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 365

 Score =  186 bits (472), Expect = 1e-52
 Identities = 100/285 (35%), Positives = 150/285 (52%), Gaps = 2/285 (0%)
 Frame = +2

Query: 2   SCPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVA 181
           +CP C    ET+LH    C     +W+   L     L ++   I +W +  +   G I+ 
Sbjct: 82  NCPMCNAQQETLLHCLLECPRIGALWNSLGLCQ-PHLPTDSEKIKEWLKCWVEEQGSIIP 140

Query: 182 IILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAY--QEINSPLATTRMPREVAW 355
           ++LW IW +RN +IF  +      +    S   + I +AY  +     +   R  R V W
Sbjct: 141 VLLWVIWRSRNNMIFKGKLDKVADLKVWVSTWCSAIIRAYGGEPATGSIWQQRSTRLVRW 200

Query: 356 MGTSSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMH 535
                D  V +NVDGSA+ +PG  G GGL+RD+TG F+ GFYGS   S+ + AE++ +  
Sbjct: 201 TAKEGDW-VTINVDGSALTNPGAVGVGGLVRDNTGLFMVGFYGSIGISNNIHAELVAMWR 259

Query: 536 GLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLR 715
           GL LCWE GY  V C S+ L  V+ + +  SH H  A  +  IK+ + + W C ++H LR
Sbjct: 260 GLTLCWERGYSHVCCQSDCLYVVQLLQQESSHYHRYAVLLDKIKELISRHWTCQVIHILR 319

Query: 716 EGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850
           EGN CAD  A+ GA++ E L+++++ P  M  LL  D +G    R
Sbjct: 320 EGNFCADFFARKGAVSSEGLVILEEAPVEMEELLRKDITGTCVLR 364


>GAU36275.1 hypothetical protein TSUD_255290 [Trifolium subterraneum]
          Length = 258

 Score =  181 bits (458), Expect = 9e-52
 Identities = 100/252 (39%), Positives = 142/252 (56%), Gaps = 2/252 (0%)
 Frame = +2

Query: 101 VAPLSSNRSDIAKWTRDAIHSNGIIVAIILWTIWLTRNKLIFDNERSSPQLIVYRASRLL 280
           + P S    D   W +     +G I+ + LW +W  RN  IF+N + S    V ++  L+
Sbjct: 9   LVPSSVQGVDRLTWCKQLGKKHGNIIFVTLWMVWCVRNNFIFNNHQESTHTSVAKSHSLV 68

Query: 281 NDIDKAYQ--EINSPLATTRMPREVAWMGTSSDQRVVLNVDGSAIQHPGHSGFGGLLRDH 454
           N   KA+    + SPLA  +  R V W    +D+ V LNVDGS +     +G+ GLLR+ 
Sbjct: 69  NASAKAFSLPSVVSPLAGHQ--RSVRWF-RPADEFVCLNVDGSLLGSNNTAGYDGLLRNR 125

Query: 455 TGTFLFGFYGSADQSSVLRAEVLGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHC 634
            G F++GFYG A   ++L AE++ + +GL+LCWE G+R V C S+ L +V    EGV+  
Sbjct: 126 DGEFIWGFYGVAAIQNILYAEIMAIWYGLKLCWERGFRKVFCCSDYLLSVDVTKEGVTTH 185

Query: 635 HPLANEISIIKKHMQQDWDCLLVHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPL 814
           H  ANEI  I+K +  DW+ +L HTLREGN CAD L KLG  +D  ++ I  P   +   
Sbjct: 186 HRFANEILCIRKLLANDWEVILTHTLREGNACADVLGKLGVNSDSSMVNIYAPSQDLVIP 245

Query: 815 LLADASGVSFTR 850
           L  DASG+ F R
Sbjct: 246 LHDDASGIEFIR 257


>KYP61721.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 219

 Score =  177 bits (448), Expect = 9e-51
 Identities = 90/221 (40%), Positives = 134/221 (60%)
 Frame = +2

Query: 188 LWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVAWMGTS 367
           +W IW  RN+ IFD    +   I+ +A+ LL     A+  I+   +   +PR V W+   
Sbjct: 1   MWFIWCHRNRHIFDQVDWNLTSILAQANALLQFSVSAFTSIDC--SHRPLPRLVHWIHPL 58

Query: 368 SDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLRL 547
            D  V LNVDGS I  PG  G+GGL ++H G FLFGFYG   ++SVL+ E+L ++HGL L
Sbjct: 59  VDS-VALNVDGSRIGTPGRGGYGGLCQNHEGQFLFGFYGFLGEASVLQTEILALLHGLHL 117

Query: 548 CWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGNQ 727
           CW+ G+R ++CYS+S   V  +   +   H   N++  I + +  DW C +VHTL EGN 
Sbjct: 118 CWDKGFRKIVCYSDSTLVVSLLQGPILMFHRYGNQLMEIHQLLNCDWTCTVVHTLCEGNS 177

Query: 728 CADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850
           CAD LA++GA+ ++ ++++Q+ P  +  LLLAD+ G  F R
Sbjct: 178 CADALARMGALGNDRVVILQEHPMTLSSLLLADSLGTVFQR 218


>ABN09044.1 Ribonuclease H [Medicago truncatula]
          Length = 235

 Score =  173 bits (439), Expect = 3e-49
 Identities = 95/235 (40%), Positives = 138/235 (58%)
 Frame = +2

Query: 146 RDAIHSNGIIVAIILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLA 325
           +D I + G I  II+W IW +RNK IF++ + S Q I  +    L+ I KA+    S  +
Sbjct: 2   KDFISNIGPIGPIIIWKIWCSRNKCIFEDIKHSIQEIGAQVLSSLHHILKAFAHPTSH-S 60

Query: 326 TTRMPREVAWMGTSSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSV 505
             +  R V+W   S +  V LNVDG+         FGGL+RDHT +FL GF+G   +  +
Sbjct: 61  VQQPARIVSWQRPSMNS-VALNVDGNVFLDSNLGSFGGLIRDHTSSFLHGFFGKNSRPCI 119

Query: 506 LRAEVLGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQD 685
           L  E+ G+ HGL+LCW+ G + V+C+S+S   V  + + ++  H   N I  IKK +++D
Sbjct: 120 LHVEISGLYHGLKLCWDIGIKHVVCHSDSTTVVDLVQKDLNVHHKYGNLIMAIKKLLRRD 179

Query: 686 WDCLLVHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850
           W   L HTL EGN  AD LAK GA++D  L+++ + PP +  +LLADA GV F R
Sbjct: 180 WVVSLRHTLCEGNAAADFLAKKGALSDTSLVILNEAPPDIAFVLLADAVGVKFVR 234


>GAU17063.1 hypothetical protein TSUD_105620 [Trifolium subterraneum]
          Length = 440

 Score =  178 bits (451), Expect = 1e-48
 Identities = 109/287 (37%), Positives = 150/287 (52%), Gaps = 5/287 (1%)
 Frame = +2

Query: 5   CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAI 184
           CP      ET++H    C     +W     T       +  ++  W R    S  + + +
Sbjct: 165 CPRSDIAEETIMHCLRDCEFVKHLWKTIGFTDQTFFHGD--NLYAWLRKGCDSPSMFMFL 222

Query: 185 I-LWTIWLTRNKLIFDNERSSPQLIVYRASRLLND----IDKAYQEINSPLATTRMPREV 349
             LW IW  RNKL   NE  SP    +  SR + D    + K Y +  S LA  R+ R  
Sbjct: 223 AALWWIWRARNKLCLANELVSP----FTISRCIEDYALLVKKCYSQQKSTLAN-RLVRWN 277

Query: 350 AWMGTSSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGV 529
           A  GT     ++LNVDGS+I +P   GFGGL+R+  G ++ GF G+   S++L AE+L V
Sbjct: 278 AHDGTD----MILNVDGSSIGNPEIYGFGGLIRNSHGAWIRGFAGNIGFSNILHAELLAV 333

Query: 530 MHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHT 709
            HGL L W+   +D+ICYS+S  A+K I + ++  H  A  +  IK  + +DW   + HT
Sbjct: 334 YHGLVLAWDMDIKDLICYSDSKTAIKLIGDPINEWHHFAAILQNIKDILARDWRVTVAHT 393

Query: 710 LREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850
           LREGN CAD+LAK GA   +    +  PP  M  LLLADASG  FTR
Sbjct: 394 LREGNACADYLAKFGAQNIKVFSTMTTPPDGMNLLLLADASGTWFTR 440


>AFK48593.1 unknown [Lotus japonicus]
          Length = 272

 Score =  166 bits (419), Expect = 8e-46
 Identities = 97/243 (39%), Positives = 132/243 (54%), Gaps = 1/243 (0%)
 Frame = +2

Query: 125 SDIAKWTRDAIHSNGIIVAIILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQ 304
           +D   W R+ +  N  +V   LW +W  RN    + +    Q++      + +DI + Y 
Sbjct: 33  ADSKAWLREVLKENSPLVMSTLWWVWRLRNVWCMEGKLIPWQVLRGDILAMFDDIARCYA 92

Query: 305 -EINSPLATTRMPREVAWMGTSSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFY 481
            ++++P+ T   PR V W    +D  VVLNVDGS    P   GFGG  R   G +L GF+
Sbjct: 93  VDVDAPMHT---PRLVRWTVGLADC-VVLNVDGSVHGTPQRGGFGGCFRTIHGNWLRGFF 148

Query: 482 GSADQSSVLRAEVLGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISI 661
           G  D+  +L  E+LG+ HGL L WE GYR V C S+S +AV  +    S CH  A  +  
Sbjct: 149 GYLDECCILHLELLGMFHGLSLAWEQGYRIVECQSDSQDAVTLVKSTPSSCHRYAALVWD 208

Query: 662 IKKHMQQDWDCLLVHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVS 841
           IK    +DW   L HTLREGN CAD L K GA  ++ L++ + P   +G LLLADA GVS
Sbjct: 209 IKDLQSRDWIVELRHTLREGNACADLLVKHGADQNDDLVITENPIAGLGVLLLADARGVS 268

Query: 842 FTR 850
           F R
Sbjct: 269 FVR 271


>GAU32945.1 hypothetical protein TSUD_153620 [Trifolium subterraneum]
          Length = 292

 Score =  165 bits (418), Expect = 2e-45
 Identities = 87/179 (48%), Positives = 117/179 (65%)
 Frame = +2

Query: 314 SPLATTRMPREVAWMGTSSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSAD 493
           SPLA  +  R V W    +D  V LNVDGS +     +G+GGLLR+  G F++GFYG+A 
Sbjct: 116 SPLAGHQ--RRVRW-SRPADGFVCLNVDGSLLGSNNTAGYGGLLRNRDGEFIWGFYGAAA 172

Query: 494 QSSVLRAEVLGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKH 673
             ++L AE++ + +GL+LCWE G+R V+C S+SL +V  I EGV+  H  ANEI  I+K 
Sbjct: 173 IQNILYAEIMAIWYGLKLCWERGFRKVLCCSDSLLSVNVIKEGVTTHHGFANEILCIRKL 232

Query: 674 MQQDWDCLLVHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850
           +  DW+ +L HTLREGN CAD LAKLGA +D P++ I  PP  +   L  DASG+ F R
Sbjct: 233 LSNDWEVILTHTLREGNACADVLAKLGANSDSPMVNISTPPRDLVIPLHHDASGIEFIR 291


>KYP56001.1 Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 414

 Score =  168 bits (426), Expect = 3e-45
 Identities = 95/284 (33%), Positives = 142/284 (50%), Gaps = 2/284 (0%)
 Frame = +2

Query: 5   CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAI 184
           C  C    ET++H F  C+   +VW                +   W    I   G +   
Sbjct: 139 CRQCDSQEETVMHCFRDCHEVQEVWKILQFVSCDTFYQI-DNFKMWVNHGIKLGGALFLS 197

Query: 185 ILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAY--QEINSPLATTRMPREVAWM 358
            +W IWL  N+L+F+  ++    +   A      +   +   E+NS L     P+ V W 
Sbjct: 198 TIWEIWLGWNRLVFEGSKTKAWQVALAAKSFSEAMTNVFLNHEVNSNL-----PKWVGW- 251

Query: 359 GTSSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHG 538
              S+  V+LN DGS ++    +GFGG+LR   G ++ GFYG+ D S ++  E+LG++ G
Sbjct: 252 SAPSENCVILNTDGSVMEDK--AGFGGVLRSSDGVWIHGFYGNVDGSDIIGVELLGILQG 309

Query: 539 LRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLRE 718
           LR+    G   V C ++SL AVK+I  GVSH H  +N +  I K + +DW   + H LRE
Sbjct: 310 LRIAQRLGLSRVYCQTDSLVAVKWIQGGVSHMHHYSNLVQEIHKLLDKDWAVSISHVLRE 369

Query: 719 GNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850
            N+CAD+ AKLG    + L    +PP  + P+L ADA+G  F R
Sbjct: 370 CNKCADYFAKLGLNCPDRLTNFMEPPLDVIPMLQADAAGERFLR 413


>GAU39987.1 hypothetical protein TSUD_211080 [Trifolium subterraneum]
          Length = 192

 Score =  161 bits (407), Expect = 5e-45
 Identities = 82/187 (43%), Positives = 115/187 (61%)
 Frame = +2

Query: 275 LLNDIDKAYQEINSPLATTRMPREVAWMGTSSDQRVVLNVDGSAIQHPGHSGFGGLLRDH 454
           LL+  +  +   +S +ATT  PR V W     +  V LNVDGS +     + +GGL+RD 
Sbjct: 7   LLHFCEAMFTPPHSSVATTAKPRLVTWT-KPVEGTVCLNVDGSLLGATNTASYGGLIRDS 65

Query: 455 TGTFLFGFYGSADQSSVLRAEVLGVMHGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHC 634
               L GFYG+    S+L AE++ V+HGL++CWE G+R + C+S+SL  V  I +GVS  
Sbjct: 66  NRVILSGFYGTTSVQSILFAELMAVLHGLQICWESGFRRITCFSDSLQTVNLIRDGVSTH 125

Query: 635 HPLANEISIIKKHMQQDWDCLLVHTLREGNQCADHLAKLGAMTDEPLLVIQQPPPAMGPL 814
           H  +NE+ II + +  DW+ ++ HT REGN CAD LAK+GA +D PL+ I  PP  +   
Sbjct: 126 HRSSNEVFIIHQLLANDWEVVIDHTFREGNACADVLAKMGAASDSPLVKISTPPCDLSMP 185

Query: 815 LLADASG 835
           LLADA G
Sbjct: 186 LLADAQG 192


>KYP32780.1 LINE-1 reverse transcriptase isogeny [Cajanus cajan]
          Length = 1123

 Score =  170 bits (431), Expect = 1e-43
 Identities = 97/284 (34%), Positives = 142/284 (50%), Gaps = 2/284 (0%)
 Frame = +2

Query: 5    CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAI 184
            C  C    ET++H F  C+   +VW                +   W    I   G +   
Sbjct: 848  CRQCDSQEETVMHCFRDCHEVQEVWRILQFVSCDTFYQI-DNFKMWVNHGIKLGGALFLS 906

Query: 185  ILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAY--QEINSPLATTRMPREVAWM 358
             +W IWL RN+L+F+  ++    +   A  L   +   +   E+NS L     P+ V W 
Sbjct: 907  TIWEIWLGRNRLVFEGSKTKAWQVALAAKSLSEAMTNVFLNHEVNSNL-----PKWVGW- 960

Query: 359  GTSSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHG 538
               S+  V+LN DGS ++    +GFGG+LR   G ++ GF G+ D   ++  E+LG++ G
Sbjct: 961  SAPSENCVILNTDGSVMEDK--AGFGGVLRSSNGAWIHGFCGNVDGYEIIGVELLGILQG 1018

Query: 539  LRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLRE 718
            LR+    G   V C +NSL AVK+I  GVSH H  +N +  I K + +DW   + H LRE
Sbjct: 1019 LRIAQRLGLSRVYCQTNSLAAVKWIQGGVSHMHHYSNLVQEIHKLLDKDWAVSISHVLRE 1078

Query: 719  GNQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850
             N+CAD+ AKLG    + L    +PP  + PLL ADA+G  F R
Sbjct: 1079 CNKCADYFAKLGLNCPDRLTNFMEPPLDVIPLLQADAAGERFLR 1122


>GAU24479.1 hypothetical protein TSUD_319560 [Trifolium subterraneum]
          Length = 227

 Score =  158 bits (399), Expect = 2e-43
 Identities = 81/200 (40%), Positives = 121/200 (60%), Gaps = 4/200 (2%)
 Frame = +2

Query: 185 ILWTIWLTRNKLIFDNE----RSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVA 352
           ++W IW  RN +IF+++      + Q I+  + + L +  K  ++    +  +  PR V 
Sbjct: 19  VVWAIWRVRNDVIFNSKVPVIEEAFQGIISLSWKWLRE--KKKKKKKGAVTQSSNPRLVT 76

Query: 353 WMGTSSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVM 532
           W     +  + LNVDG+ +    + G+GGLLR+H G F+ GFYG+    S+L AE++ V+
Sbjct: 77  W-ARPMEGTICLNVDGNLLGSLNYLGYGGLLRNHNGEFILGFYGTTSLKSILFAEIMVVL 135

Query: 533 HGLRLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTL 712
           HGL +CWE+GYR + C SNSL  V  I  GVS  H  ANEI  I++ + +DW+ +L HTL
Sbjct: 136 HGLTICWENGYRKINCLSNSLQLVNLIRSGVSLHHRFANEILSIRRLITRDWEVVLSHTL 195

Query: 713 REGNQCADHLAKLGAMTDEP 772
           REGN CAD LAK+G + + P
Sbjct: 196 REGNSCADVLAKMGVVANTP 215


>GAU50297.1 hypothetical protein TSUD_288310 [Trifolium subterraneum]
          Length = 545

 Score =  165 bits (418), Expect = 4e-43
 Identities = 102/283 (36%), Positives = 144/283 (50%), Gaps = 1/283 (0%)
 Frame = +2

Query: 5    CPYCIDVSETMLHSFFSCNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHS-NGIIVA 181
            CP C    E+ LH   +C    + W       +        ++  W R++I   +  +  
Sbjct: 270  CPRCNIEEESTLHCLRNCEFIKRFWKAIGF--LGQTFFQGDNLNDWLRNSIDGPSSFLFM 327

Query: 182  IILWTIWLTRNKLIFDNERSSPQLIVYRASRLLNDIDKAYQEINSPLATTRMPREVAWMG 361
              +W IW  RN+L  DNE  S   +      L   +   + + N   +T  M R  A  G
Sbjct: 328  AAVWWIWCARNQLCMDNEAISYFTLRTNTENLAQLLRMCFIKQNIS-STATMVRWNAHGG 386

Query: 362  TSSDQRVVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGL 541
                  ++LNVDGS+I +PG SGFGGL+R+  G ++ GF G+    ++L+AE+L + HGL
Sbjct: 387  IG----MILNVDGSSIGNPGISGFGGLIRNSDGAWVHGFAGNIGHLNILQAELLAIYHGL 442

Query: 542  RLCWEHGYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREG 721
             L WE   +D+ CYS+S  A+K I + V+  H  A  I  IK  + ++W   LVH LREG
Sbjct: 443  VLAWELDIKDLCCYSDSKTALKLIYDHVNEWHQYAAIIYNIKDFLSRNWRVRLVHMLREG 502

Query: 722  NQCADHLAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850
            N CAD L K GA   +    I  PP  M  LLLADASG  F+R
Sbjct: 503  NNCADILDKFGARNPKAYCSIAVPPDGMSLLLLADASGTIFSR 545


>GAU35042.1 hypothetical protein TSUD_30080 [Trifolium subterraneum]
          Length = 724

 Score =  163 bits (412), Expect = 2e-41
 Identities = 94/268 (35%), Positives = 141/268 (52%), Gaps = 3/268 (1%)
 Frame = +2

Query: 56   CNHATQVWDFFNLTHVAPLSSNRSDIAKWTRDAIHSNGIIVAII-LWTIWLTRNKLIFDN 232
            CN    +W     T          D + W R+ +  + + + +  +W IW TRN L  DN
Sbjct: 466  CNFVYTIWKSLGFTDRNFFQE--VDSSSWLRNGLSCSSMFLFMAAIWWIWRTRNALCLDN 523

Query: 233  ERSSPQLIVYRASRLLNDIDKAYQEINSPL--ATTRMPREVAWMGTSSDQRVVLNVDGSA 406
            E      ++ + S  +  +D A    N       T +P+ V W        ++LNVDGS 
Sbjct: 524  E------LIPQFSLKMRIVDYALLLKNCHFNHQVTTLPKIVRWNALGGTS-MILNVDGST 576

Query: 407  IQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLRLCWEHGYRDVICYS 586
            I +PG SGFGGL+R+  G ++ GF+G+   +++L AE++ ++ GL L WE   +D++CYS
Sbjct: 577  IGNPGISGFGGLIRNADGAWIHGFFGNLGVTNILHAELMAILKGLLLAWELNIKDLLCYS 636

Query: 587  NSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGNQCADHLAKLGAMTD 766
            +S  A+K I E V   H  A  ++ IK  + +DW   + HT REGN CAD+LAK GA  +
Sbjct: 637  DSATAIKLITEPVDVWHHYAAILNNIKDILNRDWQVSIFHTFREGNACADYLAKHGAHNN 696

Query: 767  EPLLVIQQPPPAMGPLLLADASGVSFTR 850
                 I  PP  +   LLAD SG+ F+R
Sbjct: 697  IVFTTIAIPPAGLNLHLLADVSGIIFSR 724


>GAU34195.1 hypothetical protein TSUD_162960 [Trifolium subterraneum]
          Length = 168

 Score =  150 bits (378), Expect = 5e-41
 Identities = 78/157 (49%), Positives = 101/157 (64%)
 Frame = +2

Query: 380 VVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLRLCWEH 559
           ++LNVDGS+I +PG SGFGGL+R+  G ++ GF G+   S++L AE+L + HGL L WE 
Sbjct: 12  MILNVDGSSIGNPGISGFGGLIRNSDGAWVHGFAGNIGHSNILHAELLAIYHGLVLAWEL 71

Query: 560 GYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGNQCADH 739
             +D+ CYS+S  A+K I + V+  H  A  I  IK  + ++W   LVHTLREGN CAD 
Sbjct: 72  DIKDLCCYSDSKTALKLIYDHVNEWHHYAAIIYNIKDFLSRNWRVRLVHTLREGNNCADF 131

Query: 740 LAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850
           LAK GA   E    I  PP  M  LLLADASG  F+R
Sbjct: 132 LAKFGARNPEAYSSIAVPPDEMNLLLLADASGTIFSR 168


>GAU16646.1 hypothetical protein TSUD_325960 [Trifolium subterraneum]
          Length = 157

 Score =  148 bits (374), Expect = 1e-40
 Identities = 77/157 (49%), Positives = 101/157 (64%)
 Frame = +2

Query: 380 VVLNVDGSAIQHPGHSGFGGLLRDHTGTFLFGFYGSADQSSVLRAEVLGVMHGLRLCWEH 559
           ++LNVDGS+I +PG SGFGGL+R+  G ++ GF G+   S++L AE+L + HGL L WE 
Sbjct: 1   MILNVDGSSIGNPGISGFGGLIRNSDGAWIHGFAGNIGHSNILHAELLAIYHGLVLAWEL 60

Query: 560 GYRDVICYSNSLNAVKFINEGVSHCHPLANEISIIKKHMQQDWDCLLVHTLREGNQCADH 739
             +D+ CYS+S  A+K I + V+  H  A  I  IK  + ++W   LVHTLREGN CAD 
Sbjct: 61  DIKDLCCYSDSKTALKLIYDHVNEWHHYAAIIYNIKDFLSRNWRVRLVHTLREGNNCADF 120

Query: 740 LAKLGAMTDEPLLVIQQPPPAMGPLLLADASGVSFTR 850
           LAK GA + E    I  PP  M  LLLAD SG  F+R
Sbjct: 121 LAKFGARSPEAYSSIVVPPDGMNLLLLADDSGTIFSR 157


Top