BLASTX nr result

ID: Astragalus22_contig00020811 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00020811
         (820 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNX71533.1| ribonuclease H [Trifolium pratense]                    207   6e-58
dbj|GAU38301.1| hypothetical protein TSUD_157860 [Trifolium subt...   193   1e-56
gb|PNY14301.1| ribonuclease H [Trifolium pratense]                    203   1e-55
gb|PNY15111.1| ribonuclease H [Trifolium pratense]                    201   4e-55
gb|PNX72264.1| ribonuclease H [Trifolium pratense]                    193   1e-52
dbj|GAU35627.1| hypothetical protein TSUD_30450 [Trifolium subte...   194   2e-52
gb|AFK38936.1| unknown [Medicago truncatula]                          164   2e-45
dbj|GAU35964.1| hypothetical protein TSUD_207680 [Trifolium subt...   144   6e-39
dbj|GAU30026.1| hypothetical protein TSUD_161120 [Trifolium subt...   152   6e-38
dbj|GAU47092.1| hypothetical protein TSUD_369250 [Trifolium subt...   144   2e-37
dbj|GAU41525.1| hypothetical protein TSUD_140560 [Trifolium subt...   145   1e-35
dbj|GAU29911.1| hypothetical protein TSUD_148190 [Trifolium subt...   134   1e-32
dbj|GAU39028.1| hypothetical protein TSUD_59840 [Trifolium subte...   131   1e-30
dbj|GAU39667.1| hypothetical protein TSUD_60340 [Trifolium subte...   129   7e-30
gb|PNX85413.1| ribonuclease H [Trifolium pratense] >gi|133524134...   117   2e-28
gb|PNX58626.1| ribonuclease H, partial [Trifolium pratense]           114   3e-27
gb|PNY04967.1| ribonuclease H [Trifolium pratense]                    114   3e-27
dbj|GAU21787.1| hypothetical protein TSUD_329120, partial [Trifo...   119   9e-27
dbj|GAU32098.1| hypothetical protein TSUD_292220 [Trifolium subt...   112   3e-26
dbj|GAU37237.1| hypothetical protein TSUD_375390 [Trifolium subt...   112   6e-26

>gb|PNX71533.1| ribonuclease H [Trifolium pratense]
          Length = 798

 Score =  207 bits (527), Expect = 6e-58
 Identities = 113/280 (40%), Positives = 162/280 (57%), Gaps = 14/280 (5%)
 Frame = +2

Query: 2    SNLRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILE 181
            SNL KK I LD+ CPLC++  E+     S+H+FL+C++ +L LFAS L  H P+  D+ +
Sbjct: 500  SNLHKKGITLDLLCPLCSSEEES-----SQHLFLKCDMFKLTLFASHLGSHIPIDIDLHD 554

Query: 182  WLLKWLTCRDTEGAQLFCIMIWR---AKNEAVFNNKKPDPIRIAEAAVEFITEYIAANSS 352
            W+LKWL C+D  G QLFC ++W+    +N  +FN  + DP  +A  A+ F+ E+  AN S
Sbjct: 555  WILKWLVCQDPLGVQLFCTLLWKFWAGRNAVIFNGWQMDPTFLALDALSFVQEFNEANPS 614

Query: 353  RLHFSLQEQQAPIPANHLGT---CLYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNI 523
            R   +L  Q    P+    T    ++VD G  N G T WGL + N +G+ ++SACKR++I
Sbjct: 615  RNRRALVSQSISEPSRSTCTSMNSMFVDAGCCNSGHTVWGLVLRNLNGETVFSACKREDI 674

Query: 524  --------AVDIRWGMQQVLEMNFVPDAIYSDAQVVTLCISKRFHVASIEHIMQDCRSLL 679
                    A+ +RW +Q   +      +IYSDA  V  CI+KR + A+I  I QDCR+L+
Sbjct: 675  TAEPLLAEALGVRWALQVATDQGINSVSIYSDAANVVNCINKRSNFAAINLIAQDCRNLM 734

Query: 680  ELILGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLGPPPV 799
              +   SV  I R  N  AH LV LAK  G+  WLG  P+
Sbjct: 735  AGLGNVSVMFISRTQNCDAHNLVSLAKVVGSRTWLGVAPL 774


>dbj|GAU38301.1| hypothetical protein TSUD_157860 [Trifolium subterraneum]
          Length = 317

 Score =  193 bits (491), Expect = 1e-56
 Identities = 105/280 (37%), Positives = 152/280 (54%), Gaps = 12/280 (4%)
 Frame = +2

Query: 5   NLRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILEW 184
           NL KK I  D++CPLC+   E+     S H+F+ CN+ RL LFAS L  H P+  D+  W
Sbjct: 26  NLSKKGINFDLSCPLCHHGLES-----SNHLFMNCNLMRLTLFASNLGSHIPVSVDVSVW 80

Query: 185 LLKWLTCRDTEGAQLFCIMIWR---AKNEAVFNNKKPDPIRIAEAAVEFITEYIAANSSR 355
           +L WLTC+D  G QLFC+++W+    +N+ +F     DPI +A  A  ++ E+  AN  R
Sbjct: 81  ILSWLTCKDMIGTQLFCVLLWKFWYGRNQVIFKGVVLDPIALAAEAALYVHEFNEANPRR 140

Query: 356 LHFSLQEQQAPIPANHLG-TCLYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNIAVD 532
               + +Q +    +      ++ D G FN+G T WG+ + N DG   +SACKR+ I V+
Sbjct: 141 CSQVVLQQASVSRLDDANMQLMFTDAGCFNNGYTGWGIVLRNVDGTTSFSACKREEIEVE 200

Query: 533 --------IRWGMQQVLEMNFVPDAIYSDAQVVTLCISKRFHVASIEHIMQDCRSLLELI 688
                   +RW +Q  L+ +     I SDA  V  CI+KR  +ASI+ I QDCR LL   
Sbjct: 201 PAVAEALGVRWALQLSLDQHLDNFIILSDAANVVNCIAKRISLASIDLIAQDCRDLLCNF 260

Query: 689 LGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLGPPPVNCN 808
              S+  + R  N+ AH +  LAK  G+  W+G  P   N
Sbjct: 261 SNVSIKFVGRALNIDAHNVASLAKCVGSRTWVGSAPTVSN 300


>gb|PNY14301.1| ribonuclease H [Trifolium pratense]
          Length = 1196

 Score =  203 bits (516), Expect = 1e-55
 Identities = 113/279 (40%), Positives = 158/279 (56%), Gaps = 13/279 (4%)
 Frame = +2

Query: 5    NLRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILEW 184
            NL KK I LD++CPLC+   E+     S H+F+ CN+ RL LFAS L  H P   D+  W
Sbjct: 905  NLAKKGINLDLSCPLCHHVLES-----SNHLFMHCNIMRLTLFASNLGSHIPHSVDLSVW 959

Query: 185  LLKWLTCRDTEGAQLFCIMIWR---AKNEAVFNNKKPDPIRIAEAAVEFITEYIAANSSR 355
            +L WLTC+D  G QLFC+++W+    +N+ +F +   DPI +A  A+E++ E+  AN  R
Sbjct: 960  ILSWLTCKDMIGTQLFCVLLWKFWYGRNQVIFKDAVFDPILLAADAIEYVHEFNEANPRR 1019

Query: 356  LH-FSLQEQQAPIPANHLGTCLYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNIAVD 532
             +   LQ   AP   +     ++ D G FN+G T WGL + N DG   +SACKR+NI V+
Sbjct: 1020 CNQVVLQHISAPRLDDSNMQLMFTDAGCFNNGYTGWGLVLRNVDGTTSFSACKRENIEVE 1079

Query: 533  --------IRWGMQQVLEMNFVPDAIYSDAQVVTLCISKRFHVASIEHIMQDCRSLLELI 688
                    +RW ++  L  +     I SDA  V  CI+KR  +ASIE I QDCR LL   
Sbjct: 1080 PALAEALGVRWALEFALAQHLDNIIILSDAANVVNCIAKRTVLASIELIAQDCRDLLCNF 1139

Query: 689  LGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLG-PPPVN 802
               S+  + R  N+ AH +  LAK  G+  W+G  PPV+
Sbjct: 1140 SNVSIKFVSRVSNVDAHNVASLAKFVGSRTWIGNAPPVS 1178


>gb|PNY15111.1| ribonuclease H [Trifolium pratense]
          Length = 1334

 Score =  201 bits (512), Expect = 4e-55
 Identities = 110/284 (38%), Positives = 163/284 (57%), Gaps = 14/284 (4%)
 Frame = +2

Query: 2    SNLRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILE 181
            +NL +K +Q++  CP C++APET++HLF     L C++T+L  FAS+L    P    +  
Sbjct: 1045 ANLVRKGVQIENLCPQCHSAPETIDHLF-----LHCHLTQLTWFASQLGARVPQSVPVHI 1099

Query: 182  WLLKWLTCRDTEGAQLFCIM---IWRAKNEAVFNNKKPDPIRIAEAAVEFITEYIAANSS 352
            WLL+ LTC DT GAQLFC++   IW A+N  VFNNK  DPI IA+ A+ F+ E   + S 
Sbjct: 1100 WLLQGLTCDDTRGAQLFCVLMWKIWNARNNLVFNNKLVDPIAIAQEAMYFMQEL--SPSP 1157

Query: 353  RLHFSLQEQQAPIPANHLGTC---LYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNI 523
              H +   Q A + A  + +     YVD G F+   T WG+ + N  G ++ SAC+++ I
Sbjct: 1158 HEHNATPMQDAVLAAQPMPSAPHVFYVDAGCFSGNATGWGMVIYNQSGRVVLSACRKELI 1217

Query: 524  --------AVDIRWGMQQVLEMNFVPDAIYSDAQVVTLCISKRFHVASIEHIMQDCRSLL 679
                    A+ +RW +Q+ +E+N     I SDA  V  CI+   HVA I+ ++QDC  L+
Sbjct: 1218 DVEPVLAEAIGVRWCLQKAIELNMTDIVIVSDAATVVSCINSNKHVAVIDLVIQDCNLLI 1277

Query: 680  ELILGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLGPPPVNCNA 811
            E +    V H+RR  N+VAH L G +   G + W+G  P + +A
Sbjct: 1278 EQLDSVVVTHVRRHLNVVAHGLAGFSNVVGTKLWMGVVPNSISA 1321


>gb|PNX72264.1| ribonuclease H [Trifolium pratense]
          Length = 854

 Score =  193 bits (490), Expect = 1e-52
 Identities = 106/282 (37%), Positives = 157/282 (55%), Gaps = 15/282 (5%)
 Frame = +2

Query: 2    SNLRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILE 181
            +NL  K I LD+ CPLC    E+     S+H+FL+C++ +L LFAS L  H P+  D+ +
Sbjct: 557  TNLHNKGITLDLQCPLCFREEES-----SQHLFLKCDIFKLTLFASHLGSHIPMNIDLHD 611

Query: 182  WLLKWLTCRDTEGAQLFCIMIWR---AKNEAVFNNKKPDPIRIAEAAVEFITEYIAANSS 352
            W+L+WL C+D  G QLFC+++W+    +N AVFN  + DP R+A  A+ F+ ++  AN  
Sbjct: 612  WILEWLLCQDPMGVQLFCVLLWKFWAGRNAAVFNGVQLDPGRLAIDAMSFVHDFNEANPP 671

Query: 353  RLHFSLQEQQAPIPANHLGT----CLYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDN 520
            R     +   A +P     T     L+VD G  N G T WGL + N+DG+ + SACKR++
Sbjct: 672  RCR---RAPVAHVPIQPGMTNPIFSLFVDAGCSNSGHTVWGLVLRNSDGETVLSACKRED 728

Query: 521  IAVD--------IRWGMQQVLEMNFVPDAIYSDAQVVTLCISKRFHVASIEHIMQDCRSL 676
              VD        +RW +Q V++      +IYSDA  V  CI++    A+I  I +DCR L
Sbjct: 729  FYVDPLMAEALGVRWALQLVVDQGINSVSIYSDAANVVNCINRNSSFAAINLIAEDCRKL 788

Query: 677  LELILGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLGPPPVN 802
            +  +    V  + R  N  AH L  LA+  GN  W+G  P++
Sbjct: 789  MNRLTNVCVLFVSRTQNSDAHNLASLARIMGNRTWVGVVPLS 830


>dbj|GAU35627.1| hypothetical protein TSUD_30450 [Trifolium subterraneum]
          Length = 1475

 Score =  194 bits (492), Expect = 2e-52
 Identities = 107/277 (38%), Positives = 148/277 (53%), Gaps = 12/277 (4%)
 Frame = +2

Query: 2    SNLRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILE 181
            +NL KK I LD+ CPLC+   E+  HLF     L+C++ +L LFAS L  H PLQ D+ +
Sbjct: 1178 ANLHKKGISLDLQCPLCHHEVESTNHLF-----LQCDLMKLTLFASHLGSHMPLQVDLYD 1232

Query: 182  WLLKWLTCRDTEGAQLFCIMIWR---AKNEAVFNNKKPDPIRIAEAAVEFITEYIAANSS 352
            W+  WLTC DT   QLFC ++W+    +N  VF   K DP+ + +  + F+ E+  AN  
Sbjct: 1233 WIFSWLTCHDTLDTQLFCTLLWKFWATRNNVVFRGDKLDPVCLVDEVMSFVQEFNEANPP 1292

Query: 353  RL-HFSLQEQQAPIPANHLGTCLYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNIAV 529
            R    SL         +     ++VD G   +G T WGL + N D    +SACK D+IAV
Sbjct: 1293 RQGRVSLPLTTVTPSISRPSFSVFVDAGCNLNGPTVWGLVLKNHDRITTFSACKYDDIAV 1352

Query: 530  D--------IRWGMQQVLEMNFVPDAIYSDAQVVTLCISKRFHVASIEHIMQDCRSLLEL 685
            +        +RW +Q V E       I+SDA  V  CI  +  + +IE + QDCR LL  
Sbjct: 1353 EPVMAEALGVRWAIQFVREQGLHSVCIFSDAANVVDCICNKVKLDAIEMVAQDCRELLSS 1412

Query: 686  ILGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLGPPP 796
            +   SV  +RRD N+ AH L  LA+  GN  W+G  P
Sbjct: 1413 LPNVSVLFVRRDQNIDAHNLASLARLVGNRTWVGAAP 1449


>gb|AFK38936.1| unknown [Medicago truncatula]
          Length = 297

 Score =  164 bits (415), Expect = 2e-45
 Identities = 94/280 (33%), Positives = 145/280 (51%), Gaps = 15/280 (5%)
 Frame = +2

Query: 5   NLRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILEW 184
           NLR++ + LD  CPLC  A E+     S H+F+ C +T    FAS L   PP QTD+  W
Sbjct: 14  NLRRRGVVLDTVCPLCFDADES-----SNHLFMACPMTLQVWFASPLGFQPPPQTDLNAW 68

Query: 185 LLKWLTCRDTEGAQLFCIMIWRA---KNEAVFNNKKPDPIRIAEAAVEFITEYIAANSSR 355
           L  WL+ ++    QLFC+ +W+    +N+A+FN    +P  +A +A +F++E+  AN +R
Sbjct: 69  LQSWLSAKEPLAVQLFCVCLWKIWFFRNQAIFNQVVFEPRMVAASAHDFVSEFNLANPTR 128

Query: 356 ----LHFSLQEQQAPIPANHLGTCLYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNI 523
               L    Q   AP P + L     +D G    G   WGL + N + +++++A +  +I
Sbjct: 129 SVDRLQIPAQVWIAP-PTDFLKA--NIDAGRDKHGKVTWGLVIRNHESEVLFAATQSPDI 185

Query: 524 AVD--------IRWGMQQVLEMNFVPDAIYSDAQVVTLCISKRFHVASIEHIMQDCRSLL 679
             D        +RWG+Q VLE+         DA VV  C +    +ASI   + DC  L 
Sbjct: 186 MADPLLVETLGLRWGIQTVLELQLSNVMFELDASVVVKCFNGLSTIASISPFISDCHDLF 245

Query: 680 ELILGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLGPPPV 799
             ++G SV+ + R  N+ AH L  +AK  G+  W+G  P+
Sbjct: 246 GSLVGSSVSFVNRSCNVAAHELAQVAKSIGSRTWVGNAPL 285


>dbj|GAU35964.1| hypothetical protein TSUD_207680 [Trifolium subterraneum]
          Length = 198

 Score =  144 bits (363), Expect = 6e-39
 Identities = 79/180 (43%), Positives = 108/180 (60%), Gaps = 8/180 (4%)
 Frame = +2

Query: 2   SNLRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILE 181
           + L+ K I LD+ CPLC+     LE   + H+FL+C++ +  LFAS L  H PL TD+  
Sbjct: 13  AKLKNKGISLDLLCPLCH-----LEEESASHLFLQCDLMKFTLFASHLGFHVPLNTDLHY 67

Query: 182 WLLKWLTCRDTEGAQLFCIMIWR---AKNEAVFNNKKPDPIRIAEAAVEFITEYIAAN-- 346
           W+LKWLTC+D  G+QLFC ++W+   A N  VFN  + +P+RIAE A+ F+ EY AAN  
Sbjct: 68  WILKWLTCQDALGSQLFCTLLWKFWTAINNVVFNGIQLEPVRIAEEAMSFVQEYNAANPI 127

Query: 347 -SSRLHFSLQE--QQAPIPANHLGTCLYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRD 517
              R+  SL      AP P       ++VDVG    G T WGL + N D + ++SACKRD
Sbjct: 128 KRGRISSSLPNILPAAPRPL----FSIFVDVGCCVLGPTTWGLVIKNQDCNCVFSACKRD 183


>dbj|GAU30026.1| hypothetical protein TSUD_161120 [Trifolium subterraneum]
          Length = 1957

 Score =  152 bits (384), Expect = 6e-38
 Identities = 97/283 (34%), Positives = 146/283 (51%), Gaps = 20/283 (7%)
 Frame = +2

Query: 8    LRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILEWL 187
            L++K + LD  CPLC  A E      SEH+F++C + +   F+S L +H P Q  +  W+
Sbjct: 1419 LQRKGVILDTICPLCFEAEEN-----SEHLFMKCRLAQQTWFSSCLGLHVPSQMSLKNWM 1473

Query: 188  LKWLTCRDTEGAQLFCIM---IWRAKNEAVFNNKKPDPIRIAEAAVEFITEYIAANSSRL 358
             +WL  ++   +QLF I    IW+ +N+ VF N   DP  IA AA +F  E+  AN    
Sbjct: 1474 CEWLISKNQSASQLFGITLSKIWKGRNQVVFQNALFDPCHIAIAAADFTLEFNCANPPN- 1532

Query: 359  HFSLQEQQAP-IPANHLGTC-------LYVDVGSFNDGTTCWGLCVVNADGDIIYSACKR 514
                 E   P I A     C       L VD G FNDG   +G+ V +  G++ ++A K 
Sbjct: 1533 -----EAAVPVITATETWCCPPTGMSKLNVDAGCFNDGLLGFGMVVRDNLGNVCFAATKL 1587

Query: 515  DNI--------AVDIRWGMQQVLEMNFVPDAIY-SDAQVVTLCISKRFHVASIEHIMQDC 667
            +          A+ +RW +  +L  N V   I  +D++VV  C+     ++ IE+I+ DC
Sbjct: 1588 EKKQASPTLAEALALRWCLHWILSSNQVGHFIVETDSEVVVKCLQGVSSLSEIENIILDC 1647

Query: 668  RSLLELILGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLGPPP 796
              ++  +  CSV  IRR  N+VAH LVG+AK  G+ +W+G  P
Sbjct: 1648 SDIMSNLSNCSVVFIRRCKNIVAHSLVGVAKHVGSRSWVGYIP 1690


>dbj|GAU47092.1| hypothetical protein TSUD_369250 [Trifolium subterraneum]
          Length = 335

 Score =  144 bits (363), Expect = 2e-37
 Identities = 92/283 (32%), Positives = 142/283 (50%), Gaps = 14/283 (4%)
 Frame = +2

Query: 2   SNLRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILE 181
           + L KK + LD  CPLC    E  EHLF     + C   +L  FAS L +H P   D+  
Sbjct: 51  ARLAKKGLTLDPWCPLCYQQVEDYEHLF-----MSCPFAKLTWFASPLDLHAPSNVDVNS 105

Query: 182 WLLKWLTCRDTEGAQLFCIM--IWRAKNEAVFNNKKPDPIRIAEAAVEFITEYIAANSSR 355
           W+L+ L+    EG Q+FC M  IW  +N+ +F  +   P  +A +A  F+ E+       
Sbjct: 106 WVLQGLSNPLVEGVQIFCTMSKIWFHRNKLIFKQQAFVPHEVASSASSFVAEFSPTFLRE 165

Query: 356 LHFSLQEQ-QAPIPANHLGTCLYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNI--- 523
           ++ +  +  +A    + +   + VD GSF++G+T WGL V + +  +I SAC+ + I   
Sbjct: 166 IYMNTSDVLEASQVVSPVCNRICVDAGSFSNGSTGWGLIVKDHESSVILSACRFEEIYTC 225

Query: 524 -----AVDIRWGMQQVLEMNFVPDAIYSDAQVVTLCISKRFHVASIEHIMQDCRSLLELI 688
                A+ IRW +Q  +++N+    I SDA  +   I  +   A +  I+QDC SL    
Sbjct: 226 PILAEALGIRWAIQTAIDLNYNQVTIVSDALTIVKGIEGKTCPAEVALIVQDCISLCSNF 285

Query: 689 LGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLGPPP---VNCN 808
           +  +V +++R  N  AH LV L+K  G   W G  P   V CN
Sbjct: 286 MHVAVVYVKRTLNTEAHNLVQLSKHVGCRTWSGIIPNLAVVCN 328


>dbj|GAU41525.1| hypothetical protein TSUD_140560 [Trifolium subterraneum]
          Length = 1610

 Score =  145 bits (367), Expect = 1e-35
 Identities = 78/210 (37%), Positives = 120/210 (57%), Gaps = 12/210 (5%)
 Frame = +2

Query: 5    NLRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILEW 184
            NL+KK I LD +CPLC+   E   HLF     + CN+ +LALFAS L  HPP+  D+  W
Sbjct: 1406 NLKKKGISLDTSCPLCHNDSENAHHLF-----MHCNMLKLALFASPLGCHPPMNVDLNCW 1460

Query: 185  LLKWLTCRDTEGAQLFCIMIWR---AKNEAVFNNKKPDPIRIAEAAVEFITEYIAANS-S 352
            LL+WL C D  GAQLFC ++W+   A+N+ VFN    +P+R+A++A+ F+ E+  AN+ S
Sbjct: 1461 LLEWLNCSDKLGAQLFCTILWKFWFARNQYVFNGYPIEPLRLAQSALLFVQEFNEANNLS 1520

Query: 353  RLHFSLQEQQAPIPANHLGTCLYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNI--- 523
            R             A+     ++VD G F++  T WGL + +  G++ ++AC+R++I   
Sbjct: 1521 RSTHVATRVHNTNSASPCQFSMFVDAGCFSNARTGWGLVLKDQRGNVTWNACRREDIEVT 1580

Query: 524  -----AVDIRWGMQQVLEMNFVPDAIYSDA 598
                 A+++RW +Q  L       +   DA
Sbjct: 1581 PILAEALELRWAIQSALSQGIQSISFNCDA 1610


>dbj|GAU29911.1| hypothetical protein TSUD_148190 [Trifolium subterraneum]
          Length = 482

 Score =  134 bits (338), Expect = 1e-32
 Identities = 84/274 (30%), Positives = 137/274 (50%), Gaps = 14/274 (5%)
 Frame = +2

Query: 2    SNLRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILE 181
            + L KK + LD   PLC    E  EHLF     + C  ++L  FAS L +H P   D+  
Sbjct: 195  ARLAKKGLTLDPYFPLCYQQAEDYEHLF-----MSCPFSKLTWFASPLGLHAPSNVDVNS 249

Query: 182  WLLKWLTCRDTEGAQLFCIMIWRA---KNEAVFNNKK--PDPIRIAEAAVEFITEYIAAN 346
            W+L+ L+    EG Q+FC  +W+    +N+ +F  +   P    +A +A  F  E+    
Sbjct: 250  WVLQGLSNPLVEGVQIFCTSLWKIWFHRNKLIFEQQAFVPHEYEVASSASSFGAEFSPTF 309

Query: 347  SSRLHFSLQEQ-QAPIPANHLGTCLYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNI 523
               +  +  +  +A    + +   + VD G F++G+T WGL V + +G +I+SAC+ + I
Sbjct: 310  LREIDMNTSDVLEASQVVSPICNRICVDAGCFSNGSTGWGLIVKDHEGSVIFSACRFEEI 369

Query: 524  --------AVDIRWGMQQVLEMNFVPDAIYSDAQVVTLCISKRFHVASIEHIMQDCRSLL 679
                    A+ IRW ++  +++N+    I SDA  +   I  +   A +E I+QDC SL 
Sbjct: 370  HTSPILAEALAIRWAIRTAIDLNYNQVTIVSDALTIVKDIEGKTCPAKVELIVQDCISLC 429

Query: 680  ELILGCSVNHIRRDGNLVAHRLVGLAKRFGNENW 781
               +  +V +++R  N  AH LV L+K  G   W
Sbjct: 430  SNFMHVAVVYVKRTLNTEAHNLVQLSKHVGCRTW 463


>dbj|GAU39028.1| hypothetical protein TSUD_59840 [Trifolium subterraneum]
          Length = 1626

 Score =  131 bits (329), Expect = 1e-30
 Identities = 84/279 (30%), Positives = 138/279 (49%), Gaps = 15/279 (5%)
 Frame = +2

Query: 8    LRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILEWL 187
            L KK I LD  CPLC    E      +EH+F++C +++   F+S L +H P    +  W+
Sbjct: 1341 LEKKGITLDTTCPLCFNDIEC-----NEHLFMQCPLSKQVWFSSPLGLHAPNNFSLNSWM 1395

Query: 188  LKWLTCRDTEGAQLFCI---MIWRAKNEAVFNNKKPDPIRIAEAAVEFITEYIAANSSRL 358
              WL+  D   +QLF     MIW+ +N+ +F N+K  PI +A A+ +F+ E+ +   S  
Sbjct: 1396 QLWLSNPDKLASQLFSTTLWMIWKGRNKLIFKNEKFCPIYVAAASSDFVAEFNSGTCSFE 1455

Query: 359  HFSLQEQQAPIPANHLGTC-LYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNIAVD- 532
            +    +          G   + +D G F++GTT WG+ + N  G + ++A   + I V  
Sbjct: 1456 NIPSCDNPGKWEHPEQGKLKVNIDAGCFSNGTTGWGMIMRNHLGMVEFAATHLEKIKVSS 1515

Query: 533  -------IRWGMQQVLEMNFVPDAIY-SDAQVVTLCISKRFHVASIEHIMQDCRSLLELI 688
                   +RW +Q +         I  SD++V   C++       +E+I+QDCR+ L  +
Sbjct: 1516 TLAETMALRWCLQWIQASTHHEHIIIESDSEVSVKCLNGSICDVLVENIIQDCRNFLSSL 1575

Query: 689  LGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLG--PPPV 799
                V  +RR  N+  H L  LA+  G ++W+G  P PV
Sbjct: 1576 PNVIVVFVRRSKNVATHELASLARTVGAKSWVGCVPGPV 1614


>dbj|GAU39667.1| hypothetical protein TSUD_60340 [Trifolium subterraneum]
          Length = 1063

 Score =  129 bits (323), Expect = 7e-30
 Identities = 84/288 (29%), Positives = 137/288 (47%), Gaps = 23/288 (7%)
 Frame = +2

Query: 8    LRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILEWL 187
            L +K + LD  CPLC    ET EHLF     + C V R   F S L +H P   ++ +W+
Sbjct: 773  LEQKGVALDPICPLCYDGEETQEHLF-----MHCQVIRRFWFLSPLGLHVPADVNLFKWM 827

Query: 188  LKWLTCRDTEGAQLFCIM---IWRAKNEAVFNNKKPDPIRIAE----------AAVEFIT 328
              WL+  +    QLF +    IW+ +N++VFN K P+ + + +           A   I+
Sbjct: 828  EHWLSNSNFMATQLFSLSLWTIWKMRNDSVFNKKYPNCMIVVQNVSILAEEFNLACNLIS 887

Query: 329  EYIAANSSRLHFSLQEQQAPIPANHLGTCLYVDVGSFNDGTTCWGLCVVNADGDIIYSAC 508
              ++         ++ +  PI    +     +D G F +  TCWGL   N  G + ++A 
Sbjct: 888  NVVSEPIINSDVDVRWELPPIGFLKVN----IDAGCFKNNYTCWGLLDRNHKGIVQFAAT 943

Query: 509  KRDNI--------AVDIRWGMQQVLEMNFVPDAIYSDAQVVTLCISKRFHVASIEHIMQD 664
            KR+ I        A+ ++W ++ + + N     +  DA+ V  C+ +R ++  IE I+ D
Sbjct: 944  KRERITCSPLLVEALSLKWCLRWIKDQNLQNVEVEMDAENVVNCLLRRINIVEIELIVVD 1003

Query: 665  CRSLLELILGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLG--PPPVN 802
            C  +L  +L  SV  ++   N  AH LVG+A   G+  W G  P PV+
Sbjct: 1004 CLYILLSLLNVSVLVVKSCKNKAAHGLVGVAMNLGSLLWFGNVPEPVS 1051


>gb|PNX85413.1| ribonuclease H [Trifolium pratense]
 gb|PNY00296.1| ribonuclease H [Trifolium pratense]
          Length = 207

 Score =  117 bits (293), Expect = 2e-28
 Identities = 70/182 (38%), Positives = 98/182 (53%), Gaps = 11/182 (6%)
 Frame = +2

Query: 287 DPIRIAEAAVEFITEYIAANSSRLHFSLQEQQAPIPANHLGTCL---YVDVGSFNDGTTC 457
           DP  +A  A+ F+ E+  AN SR   +L  Q    P+    T +   +VD G  N G T 
Sbjct: 2   DPTFLALDALSFVQEFNEANPSRNRRALVSQSISEPSRSTCTSMNSMFVDAGCCNSGHTV 61

Query: 458 WGLCVVNADGDIIYSACKRDNI--------AVDIRWGMQQVLEMNFVPDAIYSDAQVVTL 613
           WGL + N +G+ ++SACKR++I        A+ +RW +Q   +      +IYSDA  V  
Sbjct: 62  WGLVLRNLNGETVFSACKREDITAEPLLAEALGVRWALQVATDQGINSVSIYSDAANVVN 121

Query: 614 CISKRFHVASIEHIMQDCRSLLELILGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLGPP 793
           CI+KR + A+I  I QDCR+L+  +   SV  I R  N  AH LV LAK  G+  WLG  
Sbjct: 122 CINKRSNFAAINLIAQDCRNLMAGLGNVSVMFISRTQNCDAHNLVSLAKVVGSRTWLGVA 181

Query: 794 PV 799
           P+
Sbjct: 182 PL 183


>gb|PNX58626.1| ribonuclease H, partial [Trifolium pratense]
          Length = 217

 Score =  114 bits (286), Expect = 3e-27
 Identities = 69/195 (35%), Positives = 101/195 (51%), Gaps = 12/195 (6%)
 Frame = +2

Query: 254 KNEAVFNNKKPDPIRIAEAAVEFITEYIAANSSRLHFSLQEQQAPIPANHLGT----CLY 421
           +N AVFN  + DP R+A  A+ F+ ++  AN  R     +   A +P     T     L+
Sbjct: 2   RNAAVFNGVQLDPGRLAIDAMSFVHDFNEANPPRCR---RAPVAHVPIQPGMTNPIFSLF 58

Query: 422 VDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNIAVD--------IRWGMQQVLEMNFVP 577
           VD G  N G T WGL + N+DG+ + SACKR++  VD        +RW +Q V++     
Sbjct: 59  VDAGCSNSGHTVWGLVLRNSDGETVLSACKREDFYVDPLMAEALGVRWALQLVVDQGINS 118

Query: 578 DAIYSDAQVVTLCISKRFHVASIEHIMQDCRSLLELILGCSVNHIRRDGNLVAHRLVGLA 757
            +IYSDA  V  CI++    A+I  I +DCR L+  +    V  + R  N  AH L  LA
Sbjct: 119 VSIYSDAANVVNCINRNSSFAAINLIAEDCRKLMNRLTNVCVLFVSRTQNSDAHNLASLA 178

Query: 758 KRFGNENWLGPPPVN 802
           +  GN  W+G  P++
Sbjct: 179 RIMGNRTWVGVVPLS 193


>gb|PNY04967.1| ribonuclease H [Trifolium pratense]
          Length = 207

 Score =  114 bits (285), Expect = 3e-27
 Identities = 67/190 (35%), Positives = 99/190 (52%), Gaps = 8/190 (4%)
 Frame = +2

Query: 242 IWRAKNEAVFNNKKPDPIRIAEAAVEFITEYIAANSSRLHFSLQEQQAPIPANHLGTCLY 421
           +W  +N+ VF  K P P  IA AA++ + E+  A   +     Q   +   A      + 
Sbjct: 1   MWFFRNQVVFQQKIPTPPDIAIAALDIVHEFNLAVPKKSKQRQQHAASEPAATLCSHLIQ 60

Query: 422 VDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNIAVD--------IRWGMQQVLEMNFVP 577
           VD G F DG T +G  + +  G I +SAC+++N+ VD        IRW +Q   + N   
Sbjct: 61  VDAGCFPDGYTTFGCVIKDCSGMISFSACRKENLLVDPLLAEALAIRWCLQVAKDQNLKE 120

Query: 578 DAIYSDAQVVTLCISKRFHVASIEHIMQDCRSLLELILGCSVNHIRRDGNLVAHRLVGLA 757
             I SDA VV  CI     +A IE I+ DC+ L+      S+N++ RD N++AHRLVG A
Sbjct: 121 VIIQSDALVVVECIRGSNSIACIELIVTDCKLLMSTFSSVSINYVCRDLNVLAHRLVGYA 180

Query: 758 KRFGNENWLG 787
            + G ++WLG
Sbjct: 181 MQVGCKSWLG 190


>dbj|GAU21787.1| hypothetical protein TSUD_329120, partial [Trifolium subterraneum]
          Length = 734

 Score =  119 bits (299), Expect = 9e-27
 Identities = 81/273 (29%), Positives = 120/273 (43%), Gaps = 8/273 (2%)
 Frame = +2

Query: 2   SNLRKKCIQLDVACPLCNAAPETLEHLFSEHIFLECNVTRLALFASRLAIHPPLQTDILE 181
           +NL  K I+LD+ CPLC    E+     S+H+FL+C++ +L LFAS L            
Sbjct: 329 TNLHNKGIKLDLQCPLCFREEES-----SQHLFLKCDIFKLTLFASHLG----------- 372

Query: 182 WLLKWLTCRDTEGAQLFCIMIWRAKNEAVFNNKKPDPIRIAEAAVEFITEYIAANSSRLH 361
                                   +N +VFN  K DP R+A     F+ ++  AN   + 
Sbjct: 373 ------------------------RNASVFNGIKLDPGRLALDVTSFVHDFNEANPPSM- 407

Query: 362 FSLQEQQAPIPANHLGTCLYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNIAVD--- 532
                                       G T WGL + N+DG+ I+SACKR+ I+VD   
Sbjct: 408 ---------------------------SGPTVWGLVLRNSDGETIFSACKREEISVDPLM 440

Query: 533 -----IRWGMQQVLEMNFVPDAIYSDAQVVTLCISKRFHVASIEHIMQDCRSLLELILGC 697
                +RW +Q V++      +I+SDA  V  CI+++   A+I  I +DCR+L+  +   
Sbjct: 441 AEALGVRWALQLVVDQGINSVSIHSDAANVVNCINRKSSFAAINLIAEDCRNLMTCLANV 500

Query: 698 SVNHIRRDGNLVAHRLVGLAKRFGNENWLGPPP 796
               + R  N  AH L  LA+  GN  W G  P
Sbjct: 501 CDLFVSRTQNSDAHNLASLARIMGNRTWQGVAP 533


>dbj|GAU32098.1| hypothetical protein TSUD_292220 [Trifolium subterraneum]
          Length = 240

 Score =  112 bits (280), Expect = 3e-26
 Identities = 73/243 (30%), Positives = 115/243 (47%), Gaps = 11/243 (4%)
 Frame = +2

Query: 101 LECNVTRLALFASRLAIHPPLQTDILEWLLKWLTCRDTEGAQLFCIMIWRAKNEAVFNNK 280
           + CN+ +L LFAS L  HPPL  D+  WL                       N+ VF   
Sbjct: 1   MHCNLLKLVLFASPLGCHPPLNVDLNCWL-----------------------NQTVFKGT 37

Query: 281 KPDPIRIAEAAVEFITEYIAAN-SSRLHFSLQE--QQAPIPANHLGTCLYVDVGSFNDGT 451
             + + +A+ A+ F+ E+  AN  SR   +       +P+P+      ++VD    ++  
Sbjct: 38  PFEAVSLAQPALLFVQEFNDANIQSRPSQAATRVRNSSPVPSRQFS--MFVDASCLSNAQ 95

Query: 452 TCWGLCVVNADGDIIYSACKRDNI--------AVDIRWGMQQVLEMNFVPDAIYSDAQVV 607
             WG+   + +G +++SACKRDNI        A+ +RW +Q  +       +   DA  V
Sbjct: 96  IGWGIVFKDHNGAVLWSACKRDNIVVTPIIADALGLRWAIQTSISQGIQCLSFACDALEV 155

Query: 608 TLCISKRFHVASIEHIMQDCRSLLELILGCSVNHIRRDGNLVAHRLVGLAKRFGNENWLG 787
             CI+ +  VASI+ +++DC +LLE I    V H+ R  N  AH L  LA+  G+  W+G
Sbjct: 156 VNCINSKCVVASIDPVIKDCTNLLENIPYAMVYHVSRKLNREAHDLASLARYVGSRTWMG 215

Query: 788 PPP 796
             P
Sbjct: 216 NAP 218


>dbj|GAU37237.1| hypothetical protein TSUD_375390 [Trifolium subterraneum]
          Length = 246

 Score =  112 bits (279), Expect = 6e-26
 Identities = 69/199 (34%), Positives = 98/199 (49%), Gaps = 15/199 (7%)
 Frame = +2

Query: 245 WRAKNEAVFNNKKPDPIRIAEAAVEFITEYIAANSSRL------HFSLQEQQA-PIPANH 403
           W  +N  VFN  K DP R+A     F+ ++  AN  R       H S+Q     PI +  
Sbjct: 5   WNGRNATVFNGIKLDPGRLALDVTSFVHDFNEANPPRCRRAPVAHVSIQPSLVTPIFS-- 62

Query: 404 LGTCLYVDVGSFNDGTTCWGLCVVNADGDIIYSACKRDNIAVD--------IRWGMQQVL 559
               L+VD G    G   WGL + N+DG+ I S CKR+ I+VD        +RW +Q V+
Sbjct: 63  ----LFVDAGCSMSGPIVWGLVLRNSDGETILSVCKREEISVDPLMAETLGVRWALQLVI 118

Query: 560 EMNFVPDAIYSDAQVVTLCISKRFHVASIEHIMQDCRSLLELILGCSVNHIRRDGNLVAH 739
           +      +I+SDA  V  CI+++   A+I  I +DCR+L+  +    V  + R  N  AH
Sbjct: 119 DQGINSVSIHSDAANVVNCINRKSSFAAINLIAEDCRNLMTCLANVCVLFVSRTQNSDAH 178

Query: 740 RLVGLAKRFGNENWLGPPP 796
            L  LA+  GN  W G  P
Sbjct: 179 NLASLARIMGNRTWQGVSP 197


Top