BLASTX nr result

ID: Glycyrrhiza34_contig00021440 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Glycyrrhiza34_contig00021440
         (652 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

KYP54863.1 Putative ribonuclease H protein At1g65750 family [Caj...    71   1e-20
GAU33009.1 hypothetical protein TSUD_358760 [Trifolium subterran...    80   3e-20
XP_019418409.1 PREDICTED: uncharacterized protein LOC109329191 [...    69   4e-20
KYP61726.1 Putative ribonuclease H protein At1g65750 family [Caj...    68   5e-20
XP_019447203.1 PREDICTED: uncharacterized protein LOC109350421 [...    70   5e-20
GAU20019.1 hypothetical protein TSUD_273540 [Trifolium subterran...    80   1e-19
GAU30482.1 hypothetical protein TSUD_18620 [Trifolium subterraneum]    79   2e-19
GAU47989.1 hypothetical protein TSUD_272340 [Trifolium subterran...    73   2e-18
GAU47519.1 hypothetical protein TSUD_138910 [Trifolium subterran...    77   2e-18
KYP73000.1 Putative ribonuclease H protein At1g65750 family [Caj...    76   4e-18
GAU36864.1 hypothetical protein TSUD_213880 [Trifolium subterran...    79   6e-18
KYP44439.1 Retrovirus-related Pol polyprotein LINE-1 [Cajanus ca...    72   1e-17
GAU34857.1 hypothetical protein TSUD_259370 [Trifolium subterran...    77   3e-17
GAU40060.1 hypothetical protein TSUD_258530 [Trifolium subterran...    70   5e-17
KYP75188.1 Retrovirus-related Pol polyprotein LINE-1, partial [C...    68   6e-17
GAU10366.1 hypothetical protein TSUD_421750, partial [Trifolium ...    67   7e-17
ABN08132.1 Putative non-LTR retroelement reverse transcriptase, ...    57   4e-16
KYP59313.1 Putative ribonuclease H protein At1g65750 family [Caj...    66   7e-16
GAU36466.1 hypothetical protein TSUD_166320 [Trifolium subterran...    64   7e-16
KYP42973.1 Putative ribonuclease H protein At1g65750 family, par...    69   1e-15

>KYP54863.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 648

 Score = 70.9 bits (172), Expect(2) = 1e-20
 Identities = 51/135 (37%), Positives = 65/135 (48%), Gaps = 2/135 (1%)
 Frame = +2

Query: 251 LFQWESDLLDDLQASVSLASLKGVGEDKRVRKMESSGEYSVKSAYKMVENC*C*W*GVLL 430
           LFQWE DLL  L A +    LK    D+   K  + G Y+VKSAYK V N    +   LL
Sbjct: 398 LFQWELDLLSQLAADLGSIVLKNDCCDRWCWKDSNDGIYNVKSAYKAVINGGI-YADFLL 456

Query: 431 *QAMEPGGAPLKVKAFVWKLAQNRIPTSQNLVRRNVKM*SVL-CRGCEKEIE-AGHLFFK 604
            + +     P KV  F WK   NRIP+  NL++R V   S   C  C +++E   HL F 
Sbjct: 457 HKFLWSSCVPSKVSGFAWKALLNRIPSKCNLIKRKVLNISASGCAWCGEDLENTSHLLFG 516

Query: 605 CELYSKVWHKCLNWW 649
           C     VW     W+
Sbjct: 517 CYYAYFVWLSNFAWF 531



 Score = 57.0 bits (136), Expect(2) = 1e-20
 Identities = 25/55 (45%), Positives = 35/55 (63%)
 Frame = +3

Query: 57  FSSRIKK*MGNGSNTYFWSDSWVDCGPLKGSIERLFSLNPNKECLVSDTMSWREG 221
           FSSR  K +G+G NT+FW D W   GPL     RLFS+  +K+  V++ + WR+G
Sbjct: 332 FSSRCTKVVGDGRNTFFWKDGWSGQGPLCNRYSRLFSIASDKDVSVANMVLWRDG 386


>GAU33009.1 hypothetical protein TSUD_358760 [Trifolium subterraneum]
          Length = 821

 Score = 79.7 bits (195), Expect(2) = 3e-20
 Identities = 52/141 (36%), Positives = 73/141 (51%), Gaps = 9/141 (6%)
 Frame = +2

Query: 251  LFQWESDLLDDLQASVSLASLKGVGEDKRVRKMESSGEYSVKSAYKMVENC*C*W*GVLL 430
            LF WE +L+      ++   L+G   D+ V K+  S  YSV+SAY  +         V  
Sbjct: 618  LFAWEVELVAQWVGVLANFVLQGDATDRWVWKLHPSQSYSVRSAYSYLM--------VSD 669

Query: 431  *QAMEPGGA-------PLKVKAFVWKLAQNRIPTSQNLVRRNV-KM*SVLC-RGCEKEIE 583
               ME   +       PLKV  F+W+L  NR+PT  NL+RR V ++  VLC   C K  +
Sbjct: 670  GSPMEDFASFLWMKSVPLKVNIFIWRLFLNRLPTKDNLLRRGVIEVHMVLCSTNCGKSED 729

Query: 584  AGHLFFKCELYSKVWHKCLNW 646
              HLF +C++YS+VW   LNW
Sbjct: 730  VVHLFLQCDVYSQVWQLVLNW 750



 Score = 46.6 bits (109), Expect(2) = 3e-20
 Identities = 23/66 (34%), Positives = 35/66 (53%)
 Frame = +3

Query: 3   WKDIQNLEMERTGFKQL*FSSRIKK*MGNGSNTYFWSDSWVDCGPLKGSIERLFSLNPNK 182
           W+ + N+   R        S  I + +G+G +T FW DSW++ GPL  +  RL+ L  NK
Sbjct: 534 WRALNNVWSGRGLIDPRWLSDNIVRKIGDGRSTSFWVDSWLEVGPLARAFGRLYDLADNK 593

Query: 183 ECLVSD 200
              V+D
Sbjct: 594 NISVAD 599


>XP_019418409.1 PREDICTED: uncharacterized protein LOC109329191 [Lupinus
            angustifolius]
          Length = 953

 Score = 68.9 bits (167), Expect(2) = 4e-20
 Identities = 47/131 (35%), Positives = 65/131 (49%), Gaps = 3/131 (2%)
 Frame = +2

Query: 251  LFQWESDLLDDLQASVSLASLKGVGEDKRVRKMESSGEYSVKSAYKMVENC*C*W*GVLL 430
            LF WE D ++DL   V    L    ED  +   + +G YSV++AYK+++N        L 
Sbjct: 715  LFLWEQDEVNDLLNKVEEVRLVQGNEDGWLWVHDKNGTYSVRNAYKVLQN-EVRNDNYLH 773

Query: 431  *QAMEPGGAPLKVKAFVWKLAQNRIPTSQNLVRRNV--KM*SVLCRGC-EKEIEAGHLFF 601
             + +     P K+K F W+L    +PT  NL RR +   + S LC  C E E  + HLFF
Sbjct: 774  YKRLWASKVPSKLKCFAWRLFVGGVPTWMNLARRGIIGSLPSTLCAFCGELEESSDHLFF 833

Query: 602  KCELYSKVWHK 634
             C L   VW K
Sbjct: 834  TCSLSYSVWQK 844



 Score = 57.0 bits (136), Expect(2) = 4e-20
 Identities = 26/74 (35%), Positives = 40/74 (54%)
 Frame = +3

Query: 3   WKDIQNLEMERTGFKQL*FSSRIKK*MGNGSNTYFWSDSWVDCGPLKGSIERLFSLNPNK 182
           W+D+  L     GF +  F+  +++ +G+G +T FW D WV    LK   ERLF +  NK
Sbjct: 631 WRDLGCLCNRDNGFNKGWFNEGVRRRVGSGQSTLFWRDIWVGGECLKNCFERLFQVTLNK 690

Query: 183 ECLVSDTMSWREGI 224
           +  +S    WR G+
Sbjct: 691 DACISSMGEWRNGV 704


>KYP61726.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 554

 Score = 68.2 bits (165), Expect(2) = 5e-20
 Identities = 49/137 (35%), Positives = 64/137 (46%), Gaps = 4/137 (2%)
 Frame = +2

Query: 251 LFQWESDLLDDLQASVSLASLKGVGEDKRVRKMESSGEYSVKSAYKMVENC*C*W*GVLL 430
           LFQWE DLL  L A +    LK    D+   K  +   Y+VKSAYK V N    +   LL
Sbjct: 335 LFQWELDLLSQLAADLGSTVLKNDCCDRWCWKDSNDEIYNVKSAYKAVINDGI-YANFLL 393

Query: 431 *QAMEPGGAPLKVKAFVWKLAQNRIPTSQNLVRRNVKM*SVLCRGC----EKEIEAGHLF 598
            + +     P KV  F WK   NRIP++ NL++R  K+  +   GC    E      HL 
Sbjct: 394 HKFLWSSCVPSKVSGFAWKALLNRIPSNCNLIKR--KVLDISASGCAWYGEDLENTSHLL 451

Query: 599 FKCELYSKVWHKCLNWW 649
           F C     VW    +W+
Sbjct: 452 FGCYYAYSVWLSIFDWF 468



 Score = 57.4 bits (137), Expect(2) = 5e-20
 Identities = 25/55 (45%), Positives = 35/55 (63%)
 Frame = +3

Query: 57  FSSRIKK*MGNGSNTYFWSDSWVDCGPLKGSIERLFSLNPNKECLVSDTMSWREG 221
           FSSR  K +G+G NT+FW D W   GPL     RLFS+  +K+  V++ + WR+G
Sbjct: 269 FSSRCTKVVGDGQNTFFWKDGWSGQGPLCNRYSRLFSIASDKDVSVANMVLWRDG 323


>XP_019447203.1 PREDICTED: uncharacterized protein LOC109350421 [Lupinus
           angustifolius]
          Length = 456

 Score = 69.7 bits (169), Expect(2) = 5e-20
 Identities = 47/131 (35%), Positives = 65/131 (49%), Gaps = 3/131 (2%)
 Frame = +2

Query: 251 LFQWESDLLDDLQASVSLASLKGVGEDKRVRKMESSGEYSVKSAYKMVENC*C*W*GVLL 430
           LF WE D ++DL   V    L    ED  +   + +G YSV++AYK+++N        L 
Sbjct: 156 LFLWEQDEVNDLLNKVEEVRLVQGNEDGWLWVHDKNGTYSVRNAYKVLQN-EVRNDNYLH 214

Query: 431 *QAMEPGGAPLKVKAFVWKLAQNRIPTSQNLVRRNV--KM*SVLCRGC-EKEIEAGHLFF 601
            + +     P K+K F W+L    +PT  NL RR +   + S LC  C E E  + HLFF
Sbjct: 215 YKRLWASKVPSKLKCFAWRLFVGGVPTRMNLARRGIIGSLPSTLCAFCGELEESSDHLFF 274

Query: 602 KCELYSKVWHK 634
            C L   VW K
Sbjct: 275 TCSLSYSVWQK 285



 Score = 55.8 bits (133), Expect(2) = 5e-20
 Identities = 25/74 (33%), Positives = 40/74 (54%)
 Frame = +3

Query: 3   WKDIQNLEMERTGFKQL*FSSRIKK*MGNGSNTYFWSDSWVDCGPLKGSIERLFSLNPNK 182
           W+D+  +     GF +  F+  +++ +G+G +T FW D WV    LK   ERLF +  NK
Sbjct: 72  WRDLGCVCNRDNGFNKGWFNEGVRRRVGSGQSTLFWRDIWVGGECLKNCFERLFQVTLNK 131

Query: 183 ECLVSDTMSWREGI 224
           +  +S    WR G+
Sbjct: 132 DACISSMDEWRNGV 145


>GAU20019.1 hypothetical protein TSUD_273540 [Trifolium subterraneum]
          Length = 504

 Score = 79.7 bits (195), Expect(2) = 1e-19
 Identities = 50/141 (35%), Positives = 72/141 (51%), Gaps = 9/141 (6%)
 Frame = +2

Query: 251 LFQWESDLLDDLQASVSLASLKGVGEDKRVRKMESSGEYSVKSAYKMVENC*C*W*GVLL 430
           LF WE +L+      ++   L+G   D+ V  +  S  YSV+SAY  +            
Sbjct: 272 LFVWEEELVAQCVGVLANFVLQGDATDRWVWNLHPSQSYSVRSAYSYLT--------ASD 323

Query: 431 *QAMEP-------GGAPLKVKAFVWKLAQNRIPTSQNLVRRNV-KM*SVLC-RGCEKEIE 583
             +ME           PLKV  F+W++  NR+PT  NL+RR V ++   LC   C K  +
Sbjct: 324 GSSMEDFASFLWVKSVPLKVNIFIWRIFLNRLPTKDNLLRRGVIEVHQELCSTNCGKAED 383

Query: 584 AGHLFFKCELYSKVWHKCLNW 646
           A HLF +C++YS+VWH  LNW
Sbjct: 384 AVHLFIQCDVYSQVWHLVLNW 404



 Score = 44.7 bits (104), Expect(2) = 1e-19
 Identities = 20/47 (42%), Positives = 29/47 (61%)
 Frame = +3

Query: 60  SSRIKK*MGNGSNTYFWSDSWVDCGPLKGSIERLFSLNPNKECLVSD 200
           S  I + +G+G +T FW+DSW++ GPL     RL+ L  NK   V+D
Sbjct: 207 SDNIVRKIGDGRSTAFWADSWLEVGPLARVFGRLYDLADNKHISVAD 253


>GAU30482.1 hypothetical protein TSUD_18620 [Trifolium subterraneum]
          Length = 361

 Score = 78.6 bits (192), Expect(2) = 2e-19
 Identities = 51/141 (36%), Positives = 71/141 (50%), Gaps = 9/141 (6%)
 Frame = +2

Query: 251 LFQWESDLLDDLQASVSLASLKGVGEDKRVRKMESSGEYSVKSAYKMVENC*C*W*GVLL 430
           LF WE +L+      ++   L+G   D+ V  +  S  YSV+SAY  +            
Sbjct: 163 LFAWEEELVAQCVGVLANFVLQGDATDRWVWNLHPSQSYSVRSAYSYLT--------ASD 214

Query: 431 *QAMEP-------GGAPLKVKAFVWKLAQNRIPTSQNLVRRNV-KM*SVLC-RGCEKEIE 583
             +ME           PLKV  F+W+L  NR+PT   L+RR V ++   LC   C K  +
Sbjct: 215 GSSMEDFASFLWVKSVPLKVNIFIWRLFLNRLPTKDILLRRGVIEVHQDLCSTNCGKAED 274

Query: 584 AGHLFFKCELYSKVWHKCLNW 646
           A HLF KC++YS+VWH  LNW
Sbjct: 275 AVHLFIKCDVYSQVWHLVLNW 295



 Score = 45.4 bits (106), Expect(2) = 2e-19
 Identities = 20/47 (42%), Positives = 30/47 (63%)
 Frame = +3

Query: 60  SSRIKK*MGNGSNTYFWSDSWVDCGPLKGSIERLFSLNPNKECLVSD 200
           S  I + +G+G +T FW+DSW++ GPL  +  RL+ L  NK   V+D
Sbjct: 98  SDNIIRKIGDGRSTAFWADSWLEVGPLARAFGRLYDLADNKNISVAD 144


>GAU47989.1 hypothetical protein TSUD_272340 [Trifolium subterraneum]
          Length = 849

 Score = 73.2 bits (178), Expect(2) = 2e-18
 Identities = 48/135 (35%), Positives = 67/135 (49%), Gaps = 6/135 (4%)
 Frame = +2

Query: 260  WESDLLDDLQASVSLASLKGVGEDKRVRKMESSGEYSVKSAYKMVENC*C*W*GVLL*QA 439
            WE DL+ +    +S   ++   +DK V K+  S  Y+VKSAY  +         V L + 
Sbjct: 620  WEEDLVKECITRLSNVFMQVTEQDKWVWKLHPSSCYNVKSAYSYLTES-----DVHLNED 674

Query: 440  ----MEPGGAPLKVKAFVWKLAQNRIPTSQNLVRRNVKM*S-VLCRG-CEKEIEAGHLFF 601
                M     PLKV   +W+L  N++PT  NL+RR +   S +LC   C KE    HLFF
Sbjct: 675  YNRFMRVKSLPLKVNLLMWRLFLNKLPTKDNLLRRGILDGSGILCDTLCGKEENVDHLFF 734

Query: 602  KCELYSKVWHKCLNW 646
            +CE Y K+W     W
Sbjct: 735  QCEHYGKIWALISGW 749



 Score = 47.0 bits (110), Expect(2) = 2e-18
 Identities = 19/40 (47%), Positives = 28/40 (70%)
 Frame = +3

Query: 81  MGNGSNTYFWSDSWVDCGPLKGSIERLFSLNPNKECLVSD 200
           +G+G NT FW D+W+D GP++ S  RL++L  NK   V+D
Sbjct: 559 VGDGRNTLFWKDNWLDDGPVERSFSRLYALAENKLVTVAD 598


>GAU47519.1 hypothetical protein TSUD_138910 [Trifolium subterraneum]
          Length = 330

 Score = 76.6 bits (187), Expect(2) = 2e-18
 Identities = 49/141 (34%), Positives = 70/141 (49%), Gaps = 9/141 (6%)
 Frame = +2

Query: 251 LFQWESDLLDDLQASVSLASLKGVGEDKRVRKMESSGEYSVKSAYKMVENC*C*W*GVLL 430
           LF WE +L+      ++   L+G   D+ V  +     YSV+SAY  +            
Sbjct: 100 LFVWEEELVAQCVGVLANFVLQGDATDRWVWNLHPLQSYSVRSAYSYLT--------ASD 151

Query: 431 *QAMEP-------GGAPLKVKAFVWKLAQNRIPTSQNLVRRNV-KM*SVLC-RGCEKEIE 583
             +ME           PLKV  F+W+L  NR+PT  NL+RR V ++   LC   C K  +
Sbjct: 152 GSSMEDFASFLWVKSVPLKVNIFIWRLFLNRLPTKDNLLRRGVIEVHQELCSTNCGKAED 211

Query: 584 AGHLFFKCELYSKVWHKCLNW 646
             HLF +C++YS+VWH  LNW
Sbjct: 212 VVHLFIQCDVYSQVWHLVLNW 232



 Score = 43.5 bits (101), Expect(2) = 2e-18
 Identities = 19/47 (40%), Positives = 29/47 (61%)
 Frame = +3

Query: 60  SSRIKK*MGNGSNTYFWSDSWVDCGPLKGSIERLFSLNPNKECLVSD 200
           S  I + +G+G +T FW+DSW++ GPL  +  R + L  NK   V+D
Sbjct: 35  SDNIVRKIGDGRSTDFWADSWLEVGPLARAFGRFYDLAVNKHISVAD 81


>KYP73000.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 616

 Score = 76.3 bits (186), Expect(2) = 4e-18
 Identities = 45/135 (33%), Positives = 66/135 (48%), Gaps = 2/135 (1%)
 Frame = +2

Query: 251 LFQWESDLLDDLQASVSLASLKGVGEDKRVRKMESSGEYSVKSAYKMVENC*C*W*GVLL 430
           LFQWE +LL  LQ  +    L+    D  V      G+YSV+SAY ++ N    +    L
Sbjct: 383 LFQWEGELLQQLQVDIDFLHLQQGVNDHWVWSASKDGQYSVRSAYNVIVNKDI-FGEFPL 441

Query: 431 *QAMEPGGAPLKVKAFVWKLAQNRIPTSQNLVRRNV-KM*SVLCRGCEKEI-EAGHLFFK 604
              +     P KV  F W+   N++PT QNL++R + +     C  C  ++    HLFF+
Sbjct: 442 YNYLWSKFLPSKVSGFTWRSMLNKLPTKQNLIKRGILQAGGGFCIWCGHDLGTVSHLFFE 501

Query: 605 CELYSKVWHKCLNWW 649
           C     +W  CLNW+
Sbjct: 502 CPFAYCIWMLCLNWF 516



 Score = 43.1 bits (100), Expect(2) = 4e-18
 Identities = 23/68 (33%), Positives = 37/68 (54%), Gaps = 2/68 (2%)
 Frame = +3

Query: 3   WKDIQNLEMERTGFKQL*FSSRIKK*MGNGSNTYFWSDSWVDCGPLKGSI--ERLFSLNP 176
           W D+  ++ E   F     +SR  K + +G+ T+FW ++W  CGP    +  ERLFS+  
Sbjct: 304 WVDLWRIDKENGWF-----ASRCNKVVRDGTYTFFWQEAW--CGPTAFCVKYERLFSIAT 356

Query: 177 NKECLVSD 200
           NK+  + D
Sbjct: 357 NKDATIDD 364


>GAU36864.1 hypothetical protein TSUD_213880 [Trifolium subterraneum]
          Length = 1204

 Score = 79.3 bits (194), Expect(2) = 6e-18
 Identities = 51/141 (36%), Positives = 72/141 (51%), Gaps = 9/141 (6%)
 Frame = +2

Query: 251  LFQWESDLLDDLQASVSLASLKGVGEDKRVRKMESSGEYSVKSAYKMVENC*C*W*GVLL 430
            LF WE +L+      ++   L+G   D+ V  +  S  YSV+SAY  +            
Sbjct: 1043 LFAWEEELVAQCVGVLANFVLQGEETDRWVWNLHPSQSYSVRSAYSYLT--------ASD 1094

Query: 431  *QAMEPGGA-------PLKVKAFVWKLAQNRIPTSQNLVRRNV-KM*SVLC-RGCEKEIE 583
              +ME   +       PLKV  F+W+L  NR+PT  NL+RR V +    LC   C K  +
Sbjct: 1095 GSSMEDFASFLWVKSIPLKVNIFIWRLFLNRLPTKDNLLRRGVIETHQDLCSTNCGKAED 1154

Query: 584  AGHLFFKCELYSKVWHKCLNW 646
            A HLF +C++YS+VWH  LNW
Sbjct: 1155 AVHLFIQCDVYSQVWHLVLNW 1175



 Score = 39.3 bits (90), Expect(2) = 6e-18
 Identities = 18/47 (38%), Positives = 27/47 (57%)
 Frame = +3

Query: 60   SSRIKK*MGNGSNTYFWSDSWVDCGPLKGSIERLFSLNPNKECLVSD 200
            S  I + +G+G +T FW+DSW++ GPL  +  R +    NK   V D
Sbjct: 978  SDNIVRKIGDGRSTTFWADSWLEVGPLARAFGRHYDPADNKNISVVD 1024


>KYP44439.1 Retrovirus-related Pol polyprotein LINE-1 [Cajanus cajan]
          Length = 1142

 Score = 72.0 bits (175), Expect(2) = 1e-17
 Identities = 49/136 (36%), Positives = 67/136 (49%), Gaps = 4/136 (2%)
 Frame = +2

Query: 251  LFQWESDLLDDLQASVSLASLKGVGEDKRVRKMESSGEYSVKSAYKMVENC*C*W*GVLL 430
            L  WE  LL+ L   ++        EDK +        Y+V SAYK++ N       V+ 
Sbjct: 913  LLVWEQQLLNTLANFINGTKFIISDEDKWLWIAAPERVYTVSSAYKVLRNDIIFASNVIF 972

Query: 431  *QAMEPGGAPLKVKAFVWKLAQNRIPTSQNLVRRNV----KM*SVLCRGCEKEIEAGHLF 598
             + +    AP KV AF W++  NRIPT  NL RR V    ++   LCR   KE    HLF
Sbjct: 973  -RWIWTSIAPTKVSAFTWRVILNRIPTKDNLFRRGVLQATQLECGLCR--NKEETTSHLF 1029

Query: 599  FKCELYSKVWHKCLNW 646
            F+CE+  ++W  C NW
Sbjct: 1030 FECEVSFQLWMACFNW 1045



 Score = 45.4 bits (106), Expect(2) = 1e-17
 Identities = 24/75 (32%), Positives = 36/75 (48%)
 Frame = +3

Query: 3    WKDIQNLEMERTGFKQL*FSSRIKK*MGNGSNTYFWSDSWVDCGPLKGSIERLFSLNPNK 182
            W D+  +E E         SS   K +GNG +T FW D WV  G L  +  RL+ +  NK
Sbjct: 830  WVDLNRIE-EGDLVSNEWMSSNCCKVIGNGVDTKFWLDKWVGHGILAHTFSRLYQIAINK 888

Query: 183  ECLVSDTMSWREGIL 227
               +++   W  G++
Sbjct: 889  NVSIAEMFEWEGGVV 903


>GAU34857.1 hypothetical protein TSUD_259370 [Trifolium subterraneum]
          Length = 1189

 Score = 76.6 bits (187), Expect(2) = 3e-17
 Identities = 51/138 (36%), Positives = 73/138 (52%), Gaps = 6/138 (4%)
 Frame = +2

Query: 251  LFQWESDLLDDLQASVSLASLKGVGEDKRVRKMESSGEYSVKSAYKMVENC*C*W*GVLL 430
            LF WE +L+    A +S  SL+    D  V ++ +SG YSVKSAY  +         V L
Sbjct: 957  LFAWEEELVAGCIARLSNVSLQAGVPDSWVWQLHNSGCYSVKSAYSYLTAS-----EVRL 1011

Query: 431  *QAMEP----GGAPLKVKAFVWKLAQNRIPTSQNLVRR-NVKM*SVLCRG-CEKEIEAGH 592
             +  +        PLKV  FVW L  +R+PT  NL+RR ++   +V C   C K  +  H
Sbjct: 1012 NENFDKFLWLRSVPLKVNIFVWHLFLDRLPTKSNLLRRGSLGAENVYCSTMCGKTEDLNH 1071

Query: 593  LFFKCELYSKVWHKCLNW 646
            LFF+C++YS++W   L W
Sbjct: 1072 LFFQCDVYSRLWLMILQW 1089



 Score = 39.7 bits (91), Expect(2) = 3e-17
 Identities = 18/52 (34%), Positives = 29/52 (55%)
 Frame = +3

Query: 69   IKK*MGNGSNTYFWSDSWVDCGPLKGSIERLFSLNPNKECLVSDTMSWREGI 224
            I++ +G G  + FW D W++  PL  S  RL+ L  +K  LV+D  +   G+
Sbjct: 895  IRRKVGGGRGSLFWLDPWLEDSPLSRSFSRLYVLAVDKNILVADMFAAGWGV 946


>GAU40060.1 hypothetical protein TSUD_258530 [Trifolium subterraneum]
          Length = 799

 Score = 70.5 bits (171), Expect(2) = 5e-17
 Identities = 48/135 (35%), Positives = 68/135 (50%), Gaps = 6/135 (4%)
 Frame = +2

Query: 260 WESDLLDDLQASVSLASLKGVGEDKRVRKMESSGEYSVKSAYKMV----ENC*C*W*GVL 427
           WE +L+ +    +S   L+    D+   K+ SS  YSV+SAY  +    EN    +   L
Sbjct: 570 WEEELVRECIMRLSNVVLQDNEHDRWAWKLHSSHVYSVQSAYDYLTATDENLNAGFDKFL 629

Query: 428 L*QAMEPGGAPLKVKAFVWKLAQNRIPTSQNLVRRNVKM*SVL--CRGCEKEIEAGHLFF 601
             +++     PLKV  FVW+L  NR+PT  NL RR V   + L     C     A HLFF
Sbjct: 630 WLKSV-----PLKVNLFVWRLFLNRLPTKDNLHRRGVIGATQLTCVSSCGSVETADHLFF 684

Query: 602 KCELYSKVWHKCLNW 646
           +C+ Y ++WH   NW
Sbjct: 685 QCDFYGQLWHLLSNW 699



 Score = 45.1 bits (105), Expect(2) = 5e-17
 Identities = 21/51 (41%), Positives = 29/51 (56%)
 Frame = +3

Query: 69  IKK*MGNGSNTYFWSDSWVDCGPLKGSIERLFSLNPNKECLVSDTMSWREG 221
           I K +G+G NT FW+D W++ GPL+    RL+ L  NK   + D   W  G
Sbjct: 505 IVKKIGDGRNTLFWTDCWLEDGPLERVYSRLYDLAENKNATIFD--MWEAG 553


>KYP75188.1 Retrovirus-related Pol polyprotein LINE-1, partial [Cajanus cajan]
          Length = 855

 Score = 68.2 bits (165), Expect(2) = 6e-17
 Identities = 43/134 (32%), Positives = 65/134 (48%), Gaps = 2/134 (1%)
 Frame = +2

Query: 251  LFQWESDLLDDLQASVSLASLKGVGEDKRVRKMESSGEYSVKSAYKMVENC*C*W*GVLL 430
            LFQWE   L  L   ++   +    +D      + SG YSVKS Y ++ N    +   LL
Sbjct: 683  LFQWEQSQLSLLMMDLTCVQMDDTNDDSWKWSADPSGLYSVKSGYYIIVNASISY-FYLL 741

Query: 431  *QAMEPGGAPLKVKAFVWKLAQNRIPTSQNLVRRNVKM*SVL-CRGCEKEIEAG-HLFFK 604
             + +       KV  F W++  +RIPT  NL +RN+ + S   C  C + ++   H+FF+
Sbjct: 742  QRFIWCRLVRFKVSCFAWRVMLDRIPTKVNLAKRNLLLSSNSGCVWCNQGLDTSYHIFFE 801

Query: 605  CELYSKVWHKCLNW 646
            C    +VW  CL W
Sbjct: 802  CSFAYQVWMLCLEW 815



 Score = 47.0 bits (110), Expect(2) = 6e-17
 Identities = 27/72 (37%), Positives = 35/72 (48%)
 Frame = +3

Query: 3   WKDIQNLEMERTGFKQL*FSSRIKK*MGNGSNTYFWSDSWVDCGPLKGSIERLFSLNPNK 182
           W D+ N+E E        FS    + +GNG NT FW D W    P      RLFS++ NK
Sbjct: 600 WYDLWNIE-EGATITCNWFSKECVRVVGNGRNTSFWRDPWCTTKPFCERYSRLFSISNNK 658

Query: 183 ECLVSDTMSWRE 218
           +  V+D    RE
Sbjct: 659 DMSVADMKLCRE 670


>GAU10366.1 hypothetical protein TSUD_421750, partial [Trifolium subterraneum]
          Length = 373

 Score = 67.4 bits (163), Expect(2) = 7e-17
 Identities = 50/145 (34%), Positives = 70/145 (48%), Gaps = 6/145 (4%)
 Frame = +2

Query: 215 GGDFDXXXXXXXLFQWESDLLDDLQASVSLASLKGVGEDKRVRKMESSGEYSVKSAYKMV 394
           G D +       L  WE +L+ +    +S   L+    D+ V K+ SS  YSV+SAY  +
Sbjct: 232 GVDGEAWKWRRSLRAWEEELVRECIMRLSNVVLQDNEHDRWVWKLHSSHVYSVQSAYGYI 291

Query: 395 ----ENC*C*W*GVLL*QAMEPGGAPLKVKAFVWKLAQNRIPTSQNLVRRNV-KM*SVLC 559
               EN    +   L  +++     PLKV  FVW+L  NR+PT  NL RR V     + C
Sbjct: 292 TATDENLNAGFDKFLWLKSV-----PLKVNLFVWRLFLNRLPTKDNLHRRGVLGATQITC 346

Query: 560 -RGCEKEIEAGHLFFKCELYSKVWH 631
              C     A HLFF C+ Y ++WH
Sbjct: 347 VSSCGSVETADHLFFLCDFYGQLWH 371



 Score = 47.8 bits (112), Expect(2) = 7e-17
 Identities = 23/73 (31%), Positives = 35/73 (47%)
 Frame = +3

Query: 3   WKDIQNLEMERTGFKQL*FSSRIKK*MGNGSNTYFWSDSWVDCGPLKGSIERLFSLNPNK 182
           W+D+  + +             I K +G+G NT FW+D W++ GPL+    RL+ L  NK
Sbjct: 160 WRDLNQIRVGTGLVDDRWLEENIVKKIGDGRNTLFWTDCWLEDGPLERVYSRLYDLADNK 219

Query: 183 ECLVSDTMSWREG 221
              + D   W  G
Sbjct: 220 NATIFD--MWEAG 230


>ABN08132.1 Putative non-LTR retroelement reverse transcriptase, related,
           partial [Medicago truncatula]
          Length = 532

 Score = 56.6 bits (135), Expect(2) = 4e-16
 Identities = 27/57 (47%), Positives = 35/57 (61%), Gaps = 2/57 (3%)
 Frame = +3

Query: 57  FSSRIKK*MGNGSNTYFWSDSWVDCGPLKGSIERLFSLNPNKECLVSD--TMSWREG 221
           F   I++ +G+G+NT+FW DSWV   PL     RLF L  NKEC V +  T+ W EG
Sbjct: 234 FDENIRRVVGDGNNTFFWYDSWVGEMPLCTKFPRLFDLAVNKECSVGEMVTLGWAEG 290



 Score = 55.8 bits (133), Expect(2) = 4e-16
 Identities = 40/138 (28%), Positives = 63/138 (45%), Gaps = 6/138 (4%)
 Frame = +2

Query: 251 LFQWESDLLDDLQASVSLASLKGVGEDKRVRKMESSGEYSVKSAYKMVENC*C*W*GVLL 430
           L  WE D + +    +    L+    DK    ++    YSV+ +Y+ +        G  +
Sbjct: 300 LLAWEEDSVRECTLLLHNVVLQVNVPDKWSWLLDPINGYSVRESYRHITTS-----GEYV 354

Query: 431 *QAMEPGG----APLKVKAFVWKLAQNRIPTSQNLVRRNVKM*SVL--CRGCEKEIEAGH 592
            Q++         P KV  FVW+L +NR+PT  NL+RR + + +V+     C K   A H
Sbjct: 355 DQSVVDDVWHRYIPQKVSLFVWRLLRNRLPTKDNLMRRRIILANVVDCVYECGKLESATH 414

Query: 593 LFFKCELYSKVWHKCLNW 646
           LF  C + + VW    NW
Sbjct: 415 LFLDCRIPTMVWLHVQNW 432


>KYP59313.1 Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 462

 Score = 66.2 bits (160), Expect(2) = 7e-16
 Identities = 47/137 (34%), Positives = 67/137 (48%), Gaps = 4/137 (2%)
 Frame = +2

Query: 251 LFQWESDLLDDLQASVSLASLKGVGEDKRVRKMESSGEYSVKSAYKMVENC*C*W*GVLL 430
           LFQWE D L  L   +    L     D    K +S G Y  KSAY+++ N    +    +
Sbjct: 231 LFQWEEDQLQLLYLELQSVKLFEEKFDGWRWKHDSGGSYYDKSAYQVIINQSI-YADFSM 289

Query: 431 *QAMEPGGAPLKVKAFVWKLAQNRIPTSQNLVRRNVKM*SVL----CRGCEKEIEAGHLF 598
            + +     P KV +F W+   +RIPT QNL++R V   +V     C  CE+   + HLF
Sbjct: 290 YRYLWSKLIPSKVSSFGWRAILDRIPTKQNLIKRKVLPSNVASCVWCGLCEE--TSSHLF 347

Query: 599 FKCELYSKVWHKCLNWW 649
           F+C    K+W  CL W+
Sbjct: 348 FECFYAFKLWMSCLQWF 364



 Score = 45.4 bits (106), Expect(2) = 7e-16
 Identities = 22/55 (40%), Positives = 28/55 (50%)
 Frame = +3

Query: 57  FSSRIKK*MGNGSNTYFWSDSWVDCGPLKGSIERLFSLNPNKECLVSDTMSWREG 221
           FS    K +GNG NT FW D W     L     RL+S++ NK   ++D    REG
Sbjct: 165 FSKGCVKEVGNGENTMFWDDVWYGSSALSTRYARLYSISNNKSATLADMCLRREG 219


>GAU36466.1 hypothetical protein TSUD_166320 [Trifolium subterraneum]
          Length = 307

 Score = 64.3 bits (155), Expect(2) = 7e-16
 Identities = 47/131 (35%), Positives = 64/131 (48%), Gaps = 3/131 (2%)
 Frame = +2

Query: 263 ESDLLDDLQASVSLASLKGVGEDKRVRKMESSGEYSVKSAYKMVENC*C*W*GVLL*QAM 442
           E DL  DL   V+ +   GV ED  V K + SG +SV+SAY  +           L   +
Sbjct: 133 EFDLAHDLMQVVTQSPTLGV-EDSWVWKYDPSGRFSVRSAYLTLTGSEVVSDPNPLLSRV 191

Query: 443 EPGGAPLKVKAFVWKLAQNRIPTSQNLVRRNV--KM*SVLCRGCEKEIEA-GHLFFKCEL 613
               AP KV  F W+L Q+R+ T QNL+RR V   +    C  C   +E+  HLF  C+ 
Sbjct: 192 WKSWAPSKVIVFSWQLLQDRVATRQNLLRRRVIRDISDSFCALCGVSVESVDHLFTSCDS 251

Query: 614 YSKVWHKCLNW 646
              VW+K + W
Sbjct: 252 IFPVWYKLVRW 262



 Score = 47.4 bits (111), Expect(2) = 7e-16
 Identities = 25/73 (34%), Positives = 36/73 (49%)
 Frame = +3

Query: 3   WKDIQNLEMERTGFKQL*FSSRIKK*MGNGSNTYFWSDSWVDCGPLKGSIERLFSLNPNK 182
           WKD+  L           FS  + + +GNG  T FW D W+   PLK   +RLF ++   
Sbjct: 46  WKDVSLLGDSTVTCSDW-FSDGMIRRVGNGRETAFWFDPWLGSVPLKNRFQRLFQVSEQC 104

Query: 183 ECLVSDTMSWREG 221
             L+ D +SW +G
Sbjct: 105 LNLIGDMISWVQG 117


>KYP42973.1 Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 370

 Score = 68.9 bits (167), Expect(2) = 1e-15
 Identities = 48/134 (35%), Positives = 64/134 (47%), Gaps = 2/134 (1%)
 Frame = +2

Query: 251 LFQWESDLLDDLQASVSLASLKGVGEDKRVRKMESSGEYSVKSAYKMVENC*C*W*GVLL 430
           L  WE  LL+ L   ++        EDK    +     Y V  AYK++ N       V+ 
Sbjct: 195 LLVWEQQLLNTLVNVINGLKFIVSDEDKWSWIVAPENVYIVSLAYKVLRNDIIFASNVIF 254

Query: 431 *QAMEPGGAPLKVKAFVWKLAQNRIPTSQNLVRRNVKM*SVL-CRGCE-KEIEAGHLFFK 604
            Q +    AP KV  F W++  NRIPT  NL RR V   + L C  C+ KE    HLFF+
Sbjct: 255 -QWIWTSIAPTKVSTFAWRVILNRIPTKDNLFRRGVLQATQLECGLCKNKEETTSHLFFE 313

Query: 605 CELYSKVWHKCLNW 646
           CE+  ++W  C NW
Sbjct: 314 CEVSFQLWMACFNW 327



 Score = 42.0 bits (97), Expect(2) = 1e-15
 Identities = 18/50 (36%), Positives = 26/50 (52%)
 Frame = +3

Query: 75  K*MGNGSNTYFWSDSWVDCGPLKGSIERLFSLNPNKECLVSDTMSWREGI 224
           K +GNG++T FW D WV  G L     RL+ +  NK   + +   W  G+
Sbjct: 135 KVIGNGADTKFWLDKWVGHGILAHRFSRLYQIAINKNASIVEMSEWEGGV 184


Top