BLASTX nr result

ID: Astragalus22_contig00036699 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00036699
         (825 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU31768.1| hypothetical protein TSUD_22140 [Trifolium subte...   107   5e-41
dbj|GAU36460.1| hypothetical protein TSUD_166260 [Trifolium subt...   110   1e-38
dbj|GAU37771.1| hypothetical protein TSUD_102880 [Trifolium subt...   100   3e-37
gb|PNX68200.1| pentatricopeptide repeat-containing protein, part...   102   3e-32
gb|KYP46236.1| Putative ribonuclease H protein At1g65750 family ...    87   5e-32
dbj|GAU48919.1| hypothetical protein TSUD_301740 [Trifolium subt...    88   4e-31
gb|KYP45089.1| Putative ribonuclease H protein At1g65750 family ...    83   5e-31
dbj|GAU50352.1| hypothetical protein TSUD_288030 [Trifolium subt...    84   1e-29
gb|PNX73669.1| ribonuclease H [Trifolium pratense]                     79   7e-27
dbj|GAU44059.1| hypothetical protein TSUD_399580 [Trifolium subt...    76   2e-24
dbj|GAU45920.1| hypothetical protein TSUD_280610 [Trifolium subt...    74   4e-24
dbj|GAU42252.1| hypothetical protein TSUD_327300 [Trifolium subt...    76   7e-24
gb|KYP48093.1| Transposon TX1 uncharacterized [Cajanus cajan]          82   1e-23
gb|KYP48474.1| hypothetical protein KK1_029849 [Cajanus cajan]         87   3e-23
dbj|GAU17471.1| hypothetical protein TSUD_340140 [Trifolium subt...    75   3e-23
gb|PNX93528.1| pentatricopeptide repeat-containing protein, part...    70   6e-23
dbj|GAU23639.1| hypothetical protein TSUD_386280 [Trifolium subt...    73   7e-23
gb|KYP76107.1| Putative ribonuclease H protein At1g65750 [Cajanu...    75   1e-22
dbj|GAU42401.1| hypothetical protein TSUD_324560 [Trifolium subt...    73   2e-22
dbj|GAU48398.1| hypothetical protein TSUD_405430 [Trifolium subt...    74   4e-22

>dbj|GAU31768.1| hypothetical protein TSUD_22140 [Trifolium subterraneum]
          Length = 1601

 Score =  107 bits (266), Expect(3) = 5e-41
 Identities = 50/116 (43%), Positives = 69/116 (59%)
 Frame = +2

Query: 377  GILPCRYNLRRRRVQCPEECPLCDIAVEDEWHLFFGYQQAEVGWRTAGPWNDAQSVILEA 556
            G LP R  L+ R VQC + CP C+   E++WHLF    +A   WR A  W++  SV+   
Sbjct: 1304 GCLPTRDRLQSRGVQCTDLCPHCETTYENDWHLFVSCNKAHEVWREANLWDEVCSVVETV 1363

Query: 557  DGFKDGFFKLLATLQEGAKKSFAMMVLCLWKRRNEKIWKNITKPVRLSLQVAFDLL 724
               KD  F  LA L E  +  F MM+ CLWK RN+KIW++  +PVR+ +Q+A D+L
Sbjct: 1364 SCIKDFIFAALAALAEPRRSEFVMMLWCLWKCRNDKIWEDKVQPVRVGMQLARDML 1419



 Score = 89.7 bits (221), Expect(3) = 5e-41
 Identities = 43/112 (38%), Positives = 63/112 (56%)
 Frame = +1

Query: 31   NYFILSTMVPGTETLNASSLFDTGTESWNTQLITQLFNSEDQNRIRGMSLRCRDMNNTLV 210
            N  + S MV G E      L   GT+SWN +LI +LFN+ D+  I  + +   +  +  +
Sbjct: 1189 NSHVTSAMVQGWEHTRIIDLIHQGTKSWNWELIGRLFNARDKEEISRLPIVGMEGKDKRI 1248

Query: 211  WNRSRDGRFLVKSTYYVAMNEFIDTSELRIAGDWGLIWKMQIPSKVKLQLWR 366
            W  +  G + VKS Y  AM+  I+  + +I GDW LIWK+ IP +VK+ LWR
Sbjct: 1249 WRYNNKGIYTVKSAYRFAMDTLINNEQYKIPGDWMLIWKLSIPQRVKIFLWR 1300



 Score = 21.2 bits (43), Expect(3) = 5e-41
 Identities = 5/9 (55%), Positives = 8/9 (88%)
 Frame = +2

Query: 2    WSDPWIKGE 28
            WS+PW++ E
Sbjct: 1179 WSEPWLRDE 1187


>dbj|GAU36460.1| hypothetical protein TSUD_166260 [Trifolium subterraneum]
          Length = 1012

 Score =  110 bits (274), Expect(2) = 1e-38
 Identities = 50/118 (42%), Positives = 71/118 (60%)
 Frame = +2

Query: 374  RGILPCRYNLRRRRVQCPEECPLCDIAVEDEWHLFFGYQQAEVGWRTAGPWNDAQSVILE 553
            RG LP R  L+R+ VQC + CP C+   E+EWH+F G ++A+  W  AG W+D   +++ 
Sbjct: 795  RGCLPTRDKLQRKGVQCTDLCPHCETTYENEWHVFLGCEKAKRIWIEAGLWDDIAQLVVA 854

Query: 554  ADGFKDGFFKLLATLQEGAKKSFAMMVLCLWKRRNEKIWKNITKPVRLSLQVAFDLLV 727
            A+ F    F  +    E     F M++ CLWKRRNEKIW+ + KPV LS+  A + LV
Sbjct: 855  ANSFNSLVFSFMTVNLEQKCSDFVMIMWCLWKRRNEKIWEGVEKPVHLSINTAREYLV 912



 Score = 79.3 bits (194), Expect(2) = 1e-38
 Identities = 38/116 (32%), Positives = 58/116 (50%)
 Frame = +1

Query: 19   KRGTNYFILSTMVPGTETLNASSLFDTGTESWNTQLITQLFNSEDQNRIRGMSLRCRDMN 198
            ++  N +I S ++ G E L  + L D  T +W   LI  +FN  D   I+ M+       
Sbjct: 677  RKSENSYITSPLLQGGEHLKVADLMDAATCTWKWDLINAIFNDRDIREIKKMAAISGGET 736

Query: 199  NTLVWNRSRDGRFLVKSTYYVAMNEFIDTSELRIAGDWGLIWKMQIPSKVKLQLWR 366
            +  VW  +  G + VKS Y  +M   ID    ++ GDW  IW ++IP +VK  +WR
Sbjct: 737  DHKVWKFNNKGSYTVKSAYRYSMETLIDNEGYKLPGDWMQIWNLKIPQRVKKFMWR 792


>dbj|GAU37771.1| hypothetical protein TSUD_102880 [Trifolium subterraneum]
          Length = 1688

 Score = 99.8 bits (247), Expect(2) = 3e-37
 Identities = 46/117 (39%), Positives = 67/117 (57%)
 Frame = +2

Query: 374  RGILPCRYNLRRRRVQCPEECPLCDIAVEDEWHLFFGYQQAEVGWRTAGPWNDAQSVILE 553
            RG LP RY L+++ V CP  C  C    E++WH+FFG  +A+  W  AG W+  + +   
Sbjct: 1437 RGCLPTRYRLQQKGVNCPHTCAYCQNNFENDWHVFFGCVKAQEIWEEAGLWSFIEGMFES 1496

Query: 554  ADGFKDGFFKLLATLQEGAKKSFAMMVLCLWKRRNEKIWKNITKPVRLSLQVAFDLL 724
             +GF   FF LL  L +     F     C+WKRRN+KIW++I     +SLQ+A D++
Sbjct: 1497 TEGFVSLFFSLLELLSQHKIILFVAAFWCIWKRRNQKIWEDIELHPSVSLQLASDII 1553



 Score = 84.7 bits (208), Expect(2) = 3e-37
 Identities = 39/115 (33%), Positives = 59/115 (51%)
 Frame = +1

Query: 31   NYFILSTMVPGTETLNASSLFDTGTESWNTQLITQLFNSEDQNRIRGMSLRCRDMNNTLV 210
            N ++ + M+ G E +    L + G   W   LI   FN  D   I    L      +   
Sbjct: 1323 NSYVTTPMILGREDMCVHDLIEEGGREWRRSLIMGSFNERDARCILSTPLFGDVQEDVPS 1382

Query: 211  WNRSRDGRFLVKSTYYVAMNEFIDTSELRIAGDWGLIWKMQIPSKVKLQLWRSTK 375
            W  SR+G + VKS YY  M   +D + LR+ G+WG IW+++IP K+K+ LWR+ +
Sbjct: 1383 WKHSRNGEYSVKSAYYYTMENLVDNTGLRVEGNWGKIWELKIPQKMKVFLWRAAR 1437


>gb|PNX68200.1| pentatricopeptide repeat-containing protein, partial [Trifolium
           pratense]
          Length = 220

 Score =  102 bits (253), Expect(2) = 3e-32
 Identities = 48/117 (41%), Positives = 68/117 (58%)
 Frame = +2

Query: 374 RGILPCRYNLRRRRVQCPEECPLCDIAVEDEWHLFFGYQQAEVGWRTAGPWNDAQSVILE 553
           RG LP RY L+R+ V CP  C  C    E++WH+FFG  +A+  W  AG W+  + +   
Sbjct: 62  RGCLPTRYRLQRKGVNCPHTCAYCQNNFENDWHVFFGCVKAQEIWEEAGLWSLIEGMFES 121

Query: 554 ADGFKDGFFKLLATLQEGAKKSFAMMVLCLWKRRNEKIWKNITKPVRLSLQVAFDLL 724
           A+GF   FF LL  L +     F     C+WKRRN+KIW++I     +SLQ+A D++
Sbjct: 122 AEGFVSLFFSLLELLSQHNIILFVAAFWCIWKRRNQKIWEDIELRPSVSLQLATDII 178



 Score = 65.9 bits (159), Expect(2) = 3e-32
 Identities = 25/55 (45%), Positives = 37/55 (67%)
 Frame = +1

Query: 211 WNRSRDGRFLVKSTYYVAMNEFIDTSELRIAGDWGLIWKMQIPSKVKLQLWRSTK 375
           W  SR+G + VKS YY  M   +D + LR+ G+WG IW ++IP K+K+ LWR+ +
Sbjct: 8   WKHSRNGEYSVKSAYYYTMENLVDNTGLRVEGNWGKIWGLKIPQKMKVFLWRAAR 62


>gb|KYP46236.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 507

 Score = 86.7 bits (213), Expect(2) = 5e-32
 Identities = 37/107 (34%), Positives = 58/107 (54%)
 Frame = +2

Query: 368 LQRGILPCRYNLRRRRVQCPEECPLCDIAVEDEWHLFFGYQQAEVGWRTAGPWNDAQSVI 547
           L R  LP R  L+++ V C   CP C+ A E+ WH+FFG Q+A+  W+  G W   +S+I
Sbjct: 303 LLRDCLPSRQRLQQKGVPCTSLCPHCEAAQENNWHIFFGCQEAQTVWQATGIWQHIKSLI 362

Query: 548 LEADGFKDGFFKLLATLQEGAKKSFAMMVLCLWKRRNEKIWKNITKP 688
              +G  +  F LL ++ +       + + C+W+RRN K+W     P
Sbjct: 363 DVGEGIVEVIFSLLGSISQSHIVEVVVTLSCIWRRRNAKVWDQGAPP 409



 Score = 80.5 bits (197), Expect(2) = 5e-32
 Identities = 38/110 (34%), Positives = 58/110 (52%)
 Frame = +1

Query: 37  FILSTMVPGTETLNASSLFDTGTESWNTQLITQLFNSEDQNRIRGMSLRCRDMNNTLVWN 216
           ++ S    G E +  + L D    +W   LI ++FN  D + I+ + L   D  +TL WN
Sbjct: 193 YVTSLPSAGYEEIRIADLIDFENGTWKFDLINRIFNQRDVDAIKDIPLLQLDEADTLTWN 252

Query: 217 RSRDGRFLVKSTYYVAMNEFIDTSELRIAGDWGLIWKMQIPSKVKLQLWR 366
            +R G + VKS YY+ M   +  S  R+ GDW  +W + IP  +K+ LWR
Sbjct: 253 LNRKGSYSVKSAYYLIMESLLCNSISRLPGDWKKLWALPIPHNMKIFLWR 302


>dbj|GAU48919.1| hypothetical protein TSUD_301740 [Trifolium subterraneum]
          Length = 640

 Score = 88.2 bits (217), Expect(2) = 4e-31
 Identities = 39/119 (32%), Positives = 66/119 (55%)
 Frame = +1

Query: 19  KRGTNYFILSTMVPGTETLNASSLFDTGTESWNTQLITQLFNSEDQNRIRGMSLRCRDMN 198
           ++  + ++ +  V G E L  + L +     W+  L+ QLFN  D   I  + L+  +  
Sbjct: 204 RKANHAYVTTNTVAGHEQLKVAGLINHNEGKWDVNLVQQLFNQNDTASIFQIPLQLTNEE 263

Query: 199 NTLVWNRSRDGRFLVKSTYYVAMNEFIDTSELRIAGDWGLIWKMQIPSKVKLQLWRSTK 375
           +  +W  SR+G++ V+S YY  M   ID + LR  G+W  +WK+ +P+KVK+ LWRS +
Sbjct: 264 DVPIWRFSRNGKYSVRSAYYQLMEVIIDNNHLREEGNWTKLWKLNVPNKVKIFLWRSLR 322



 Score = 75.9 bits (185), Expect(2) = 4e-31
 Identities = 36/101 (35%), Positives = 53/101 (52%)
 Frame = +2

Query: 374 RGILPCRYNLRRRRVQCPEECPLCDIAVEDEWHLFFGYQQAEVGWRTAGPWNDAQSVILE 553
           RG LP +  L  + VQC  +C  CD++ E+EWH FFG + A+  W  +  W      I  
Sbjct: 322 RGCLPVKERLIPKGVQCDSKCICCDVSGENEWHCFFGCKAAQEVWIESEFWESLHQKIEA 381

Query: 554 ADGFKDGFFKLLATLQEGAKKSFAMMVLCLWKRRNEKIWKN 676
           A GFK   F L+ ++   +    AM++  LW RRN+  W +
Sbjct: 382 AVGFKQLVFSLIESMDSKSMAQVAMLLWTLWWRRNQLCWND 422


>gb|KYP45089.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 406

 Score = 83.2 bits (204), Expect(2) = 5e-31
 Identities = 38/115 (33%), Positives = 60/115 (52%)
 Frame = +2

Query: 368 LQRGILPCRYNLRRRRVQCPEECPLCDIAVEDEWHLFFGYQQAEVGWRTAGPWNDAQSVI 547
           + R  LP R NL++R +     C  C +  E+EWH+FFG Q AE  W T G W    + I
Sbjct: 202 IARRCLPSRMNLQQRGIPRTSLCAHCSLNQENEWHIFFGCQTAESIWMTFGLWPSTNAYI 261

Query: 548 LEADGFKDGFFKLLATLQEGAKKSFAMMVLCLWKRRNEKIWKNITKPVRLSLQVA 712
              + FKD  F L++ L         +++  +W+ RN+K+W + T P  +++  A
Sbjct: 262 DNGEDFKDTIFSLISNLHHDIACKVIIILWSIWRNRNDKVWSDTTTPPGIAVHKA 316



 Score = 80.5 bits (197), Expect(2) = 5e-31
 Identities = 38/117 (32%), Positives = 61/117 (52%)
 Frame = +1

Query: 28  TNYFILSTMVPGTETLNASSLFDTGTESWNTQLITQLFNSEDQNRIRGMSLRCRDMNNTL 207
           TN ++ + ++ G E L  + L D     WN  L++ LF +ED   I  + L     ++T 
Sbjct: 89  TNSYVTTPILEGHENLIVAELIDMNEGKWNQDLLSTLFGTEDVRDICSIPLLNLHEHDTP 148

Query: 208 VWNRSRDGRFLVKSTYYVAMNEFIDTSELRIAGDWGLIWKMQIPSKVKLQLWRSTKR 378
            W  SR G + VKS YY  M   I  + L + G+W  IW +++ + +K+ LWR  +R
Sbjct: 149 SWKLSRKGSYYVKSAYYYVMESLISNTHLHVPGNWKQIWSLKVLNTMKIFLWRIARR 205


>dbj|GAU50352.1| hypothetical protein TSUD_288030 [Trifolium subterraneum]
          Length = 452

 Score = 84.0 bits (206), Expect(3) = 1e-29
 Identities = 41/119 (34%), Positives = 63/119 (52%)
 Frame = +2

Query: 368 LQRGILPCRYNLRRRRVQCPEECPLCDIAVEDEWHLFFGYQQAEVGWRTAGPWNDAQSVI 547
           L RG LP R+NL RR VQC   C LC+ A EDE H F     A + W+    W   +   
Sbjct: 214 LLRGCLPTRFNLHRRGVQCQTICALCNNATEDELHPFTDCAHAILCWKEVNLWQSLEPQF 273

Query: 548 LEADGFKDGFFKLLATLQEGAKKSFAMMVLCLWKRRNEKIWKNITKPVRLSLQVAFDLL 724
           L++  F    F ++++++E  +  F  ++  +W+ RNE IW+N       S ++ FDL+
Sbjct: 274 LQSGSFSSIIFSIISSMEETKQSVFVAVLWSIWRARNECIWENKQANPVASCRLDFDLI 332



 Score = 74.7 bits (182), Expect(3) = 1e-29
 Identities = 34/98 (34%), Positives = 56/98 (57%)
 Frame = +1

Query: 73  LNASSLFDTGTESWNTQLITQLFNSEDQNRIRGMSLRCRDMNNTLVWNRSRDGRFLVKST 252
           L+ S LFD+   +WN  L+  +FN++D   I  + L  R   +++VW  S DG + VK+ 
Sbjct: 116 LHVSHLFDSALNTWNYTLLNTVFNTQDVADICKILLHARAPQDSVVWKASPDGNYSVKTA 175

Query: 253 YYVAMNEFIDTSELRIAGDWGLIWKMQIPSKVKLQLWR 366
           Y + +N  +  S + + G+W  IW M++P K+K   WR
Sbjct: 176 YRLCLNLVLHDSSICVNGEWRKIWDMRLPPKLKHFCWR 213



 Score = 20.8 bits (42), Expect(3) = 1e-29
 Identities = 5/7 (71%), Positives = 7/7 (100%)
 Frame = +2

Query: 2   WSDPWIK 22
           W+DPWI+
Sbjct: 92  WTDPWIR 98


>gb|PNX73669.1| ribonuclease H [Trifolium pratense]
          Length = 275

 Score = 79.3 bits (194), Expect(2) = 7e-27
 Identities = 39/113 (34%), Positives = 58/113 (51%)
 Frame = +1

Query: 31  NYFILSTMVPGTETLNASSLFDTGTESWNTQLITQLFNSEDQNRIRGMSLRCRDMNNTLV 210
           N  + S  V   E +  S L       WN  LI QLFN  D   I  + + C    +  +
Sbjct: 14  NPCVTSMTVADYEGMRVSELMQPNERKWNEALIHQLFNLRDAAEILKIPISCMRDEDVPI 73

Query: 211 WNRSRDGRFLVKSTYYVAMNEFIDTSELRIAGDWGLIWKMQIPSKVKLQLWRS 369
           W  S++G + V+S YY  M   ID + LR  GD   IWK+++P++VK+ +WR+
Sbjct: 74  WRFSKNGIYSVRSAYYQLMEAIIDNTHLRFEGDSMKIWKLKVPNRVKIFIWRT 126



 Score = 70.5 bits (171), Expect(2) = 7e-27
 Identities = 36/98 (36%), Positives = 47/98 (47%)
 Frame = +2

Query: 377 GILPCRYNLRRRRVQCPEECPLCDIAVEDEWHLFFGYQQAEVGWRTAGPWNDAQSVILEA 556
           G LP R  L ++ VQC   CP C  A E+E H F G   A+  WR  G W   +  +  A
Sbjct: 129 GCLPVR--LLQKGVQCEPNCPCCASATENECHCFIGCDVAQEVWREMGDWETMEQYVWNA 186

Query: 557 DGFKDGFFKLLATLQEGAKKSFAMMVLCLWKRRNEKIW 670
            G+ + FF LL  L         M +  +W RRN+K W
Sbjct: 187 QGYVELFFTLLQDLDSERMARNVMTLWMIWWRRNQKCW 224


>dbj|GAU44059.1| hypothetical protein TSUD_399580 [Trifolium subterraneum]
          Length = 1229

 Score = 76.3 bits (186), Expect(2) = 2e-24
 Identities = 39/109 (35%), Positives = 59/109 (54%)
 Frame = +1

Query: 73   LNASSLFDTGTESWNTQLITQLFNSEDQNRIRGMSLRCRDMNNTLVWNRSRDGRFLVKST 252
            L  S LFD  T SWN  LI  +FN +D   I  + L  R ++++++W  S +G + VKS 
Sbjct: 834  LCVSDLFDPITNSWNHTLIASIFNGQDTADICRIPLHSRALHDSIIWKSSPNGNYTVKSA 893

Query: 253  YYVAMNEFIDTSELRIAGDWGLIWKMQIPSKVKLQLWRSTKRHFTLPVQ 399
            Y + + +        ++GDW  IW MQIP K+K   WR  +  + LP +
Sbjct: 894  YKLCL-QLTSHDSFNVSGDWRKIWTMQIPPKLKHFCWRMLR--YCLPTR 939



 Score = 65.1 bits (157), Expect(2) = 2e-24
 Identities = 33/117 (28%), Positives = 54/117 (46%)
 Frame = +2

Query: 374  RGILPCRYNLRRRRVQCPEECPLCDIAVEDEWHLFFGYQQAEVGWRTAGPWNDAQSVILE 553
            R  LP R  L  R V C   C +C  A EDE HLFF    A   W+    W   +  + +
Sbjct: 933  RYCLPTRLKLHIRGVNCQTTCAVCSNATEDELHLFFDCPHAISCWKELNLWQRLEQKMHQ 992

Query: 554  ADGFKDGFFKLLATLQEGAKKSFAMMVLCLWKRRNEKIWKNITKPVRLSLQVAFDLL 724
            +  F    F +LA L    +  F  ++  +W+ RN+ +W++       + ++A D++
Sbjct: 993  SGSFSSIIFAILADLDADTQARFVAILWSIWRTRNDCLWEHKQPSTVTTCRLATDIV 1049


>dbj|GAU45920.1| hypothetical protein TSUD_280610 [Trifolium subterraneum]
          Length = 1487

 Score = 73.9 bits (180), Expect(2) = 4e-24
 Identities = 40/118 (33%), Positives = 57/118 (48%), Gaps = 1/118 (0%)
 Frame = +2

Query: 368  LQRGILPCRYNLRRRRVQCPEECPLCDIAVEDEWHLFFGYQQAEVGWRTAGPWNDAQSVI 547
            L RG LP R  L  RR +C   CP+CD  +EDE H+FF    A   W  AG  +   +  
Sbjct: 1302 LCRGCLPTRSRLLERREECTLNCPVCDEEIEDELHIFFRCAVARDSWSAAGLSSVLHNAT 1361

Query: 548  LEADGFKDGFFKLLATLQEGAKKSFAMMVLCLWKRRNEKIWK-NITKPVRLSLQVAFD 718
             +     D  F + +          AM++ C+W  RNEK+W  N+  P ++  + AFD
Sbjct: 1362 YQQTNAMDRIFAVCSNESSDTVGRVAMLLWCIWHNRNEKLWNDNVQMPHQIG-RYAFD 1418



 Score = 66.6 bits (161), Expect(2) = 4e-24
 Identities = 33/112 (29%), Positives = 54/112 (48%)
 Frame = +1

Query: 31   NYFILSTMVPGTETLNASSLFDTGTESWNTQLITQLFNSEDQNRIRGMSLRCRDMNNTLV 210
            N ++ S    G   L+   L     ++WN   +  LF+ +   +I  + L      + +V
Sbjct: 1190 NRWVPSPQPAGVYQLSVRDLLHENYKAWNIVKVRNLFSKDVAEKILEIPLVSSVREDKVV 1249

Query: 211  WNRSRDGRFLVKSTYYVAMNEFIDTSELRIAGDWGLIWKMQIPSKVKLQLWR 366
            W   R+G + VKS Y +AM   I + +  + G+W  IWK Q+P K +  LWR
Sbjct: 1250 WEEERNGCYSVKSGYQLAMRYIIGSDKYHVGGNWNGIWKAQVPHKARHLLWR 1301


>dbj|GAU42252.1| hypothetical protein TSUD_327300 [Trifolium subterraneum]
          Length = 347

 Score = 75.9 bits (185), Expect(2) = 7e-24
 Identities = 40/118 (33%), Positives = 60/118 (50%), Gaps = 1/118 (0%)
 Frame = +2

Query: 368 LQRGILPCRYNLRRRRVQCPEECPLCDIAVEDEWHLFFGYQQAEVGWRTAGPWNDAQSVI 547
           L RG LP RY L  RRV+C   CP+CD  ++DE H+F     A   W  AG  +   S  
Sbjct: 123 LCRGCLPTRYRLLERRVECNLNCPVCDEDIDDELHIFVTCAVARDSWCAAGLSSVLHSAA 182

Query: 548 LEADGFKDGFFKLLATLQEGAKKSFAMMVLCLWKRRNEKIWK-NITKPVRLSLQVAFD 718
            +    +D  F + +  +       AM++ C+W  RN+K+W  N+  P ++  + AFD
Sbjct: 183 YQQSNARDRIFVVCSNERSDTVGRVAMLLWCIWHNRNDKLWNYNVQMPRQIG-RYAFD 239



 Score = 63.9 bits (154), Expect(2) = 7e-24
 Identities = 33/110 (30%), Positives = 53/110 (48%)
 Frame = +1

Query: 37  FILSTMVPGTETLNASSLFDTGTESWNTQLITQLFNSEDQNRIRGMSLRCRDMNNTLVWN 216
           ++ S    G   L+  +L +   ++WN   +  LF+ +   RI    L      + +VW 
Sbjct: 13  WVSSPQPAGVYQLSVRNLLNETYKAWNISKVRNLFSGDVAKRILETLLVSSVCEDKVVWE 72

Query: 217 RSRDGRFLVKSTYYVAMNEFIDTSELRIAGDWGLIWKMQIPSKVKLQLWR 366
             R+G + VKS Y + M   I + +  +AG+W  IWK Q P K +  LWR
Sbjct: 73  EERNGCYSVKSGYKLDMRYIIGSDKYHVAGNWNGIWKAQAPHKARHLLWR 122


>gb|KYP48093.1| Transposon TX1 uncharacterized [Cajanus cajan]
          Length = 748

 Score = 81.6 bits (200), Expect(2) = 1e-23
 Identities = 37/113 (32%), Positives = 59/113 (52%)
 Frame = +1

Query: 28  TNYFILSTMVPGTETLNASSLFDTGTESWNTQLITQLFNSEDQNRIRGMSLRCRDMNNTL 207
           TN ++ + +  G E L  + L D     WN  L++ LF +ED   I  + L     ++T 
Sbjct: 578 TNSYVATPIPEGHENLTVAELIDMNERKWNQDLLSTLFGTEDIRDICSIPLLNLHEHDTP 637

Query: 208 VWNRSRDGRFLVKSTYYVAMNEFIDTSELRIAGDWGLIWKMQIPSKVKLQLWR 366
            W  SR G + VKS YY  M   I  + L + G+W  +W +++P+ +K+ LWR
Sbjct: 638 SWKLSRKGSYSVKSAYYYVMESLISNAHLHVPGNWKQLWSLKVPNTMKIFLWR 690



 Score = 57.4 bits (137), Expect(2) = 1e-23
 Identities = 23/53 (43%), Positives = 32/53 (60%)
 Frame = +2

Query: 368 LQRGILPCRYNLRRRRVQCPEECPLCDIAVEDEWHLFFGYQQAEVGWRTAGPW 526
           + RG LP R NL++R + C   C  C +  E+EWH+FFG Q  +  W T+G W
Sbjct: 691 IARGCLPSRMNLQQRGIPCTSLCAHCSLNQENEWHIFFGCQTTKSYWMTSGLW 743


>gb|KYP48474.1| hypothetical protein KK1_029849 [Cajanus cajan]
          Length = 547

 Score = 86.7 bits (213), Expect(2) = 3e-23
 Identities = 41/114 (35%), Positives = 60/114 (52%)
 Frame = +2

Query: 368 LQRGILPCRYNLRRRRVQCPEECPLCDIAVEDEWHLFFGYQQAEVGWRTAGPWNDAQSVI 547
           L RG +P   NL+++ V C   CP C    E+EWHLF+    A   W  +G W     ++
Sbjct: 374 LLRGCIPTCLNLQQKGVSCTSSCPHCSANQENEWHLFYSCPAAISIWIDSGCWPRIARIV 433

Query: 548 LEADGFKDGFFKLLATLQEGAKKSFAMMVLCLWKRRNEKIWKNITKPVRLSLQV 709
            +   F D  +KLL  L      SF +M+ C+W+ RN+K+WK    P R S+Q+
Sbjct: 434 EQGISFIDTTWKLLGHLTGSDLTSFTLMLWCIWRWRNDKVWKESAPPPRTSIQL 487



 Score = 50.8 bits (120), Expect(2) = 3e-23
 Identities = 19/56 (33%), Positives = 33/56 (58%)
 Frame = +1

Query: 199 NTLVWNRSRDGRFLVKSTYYVAMNEFIDTSELRIAGDWGLIWKMQIPSKVKLQLWR 366
           ++L W  + +G F V++ Y+  M   I  + LR+ GDW  +W ++IP   ++ LWR
Sbjct: 318 DSLTWRLTTNGSFSVRTAYHHLMEHVISNNTLRVQGDWMKLWSLKIPHSTQIFLWR 373


>dbj|GAU17471.1| hypothetical protein TSUD_340140 [Trifolium subterraneum]
          Length = 479

 Score = 75.5 bits (184), Expect(2) = 3e-23
 Identities = 34/104 (32%), Positives = 58/104 (55%)
 Frame = +2

Query: 374 RGILPCRYNLRRRRVQCPEECPLCDIAVEDEWHLFFGYQQAEVGWRTAGPWNDAQSVILE 553
           R +LP R  L  R VQC   C +C+ + ED  H+ F   ++   W+ AG WN   + +  
Sbjct: 179 RNVLPTRATLNSRSVQCLVHCAVCNDSAEDSIHILFLCPRSTECWQQAGLWNQIDAGLNT 238

Query: 554 ADGFKDGFFKLLATLQEGAKKSFAMMVLCLWKRRNEKIWKNITK 685
           ++   D    +L +L +  ++ F++++  +WKRRN K+W NIT+
Sbjct: 239 SNNIADILLFILQSLNKEQQEIFSVLLWSIWKRRNAKVWDNITE 282



 Score = 62.0 bits (149), Expect(2) = 3e-23
 Identities = 32/87 (36%), Positives = 45/87 (51%)
 Frame = +1

Query: 106 ESWNTQLITQLFNSEDQNRIRGMSLRCRDMNNTLVWNRSRDGRFLVKSTYYVAMNEFIDT 285
           +SW+   I+ +F+S    RI    L      + LVWN   DG + V+S Y   +N   + 
Sbjct: 90  KSWDIDKISSMFDSTTVRRIINTPLFASVRTDKLVWNLEHDGVYSVRSAYNYYVNNVGNQ 149

Query: 286 SELRIAGDWGLIWKMQIPSKVKLQLWR 366
               IAG+W  IW+ +IP KVK  LWR
Sbjct: 150 DNSGIAGNWHQIWRAKIPPKVKNLLWR 176


>gb|PNX93528.1| pentatricopeptide repeat-containing protein, partial [Trifolium
           pratense]
          Length = 231

 Score = 70.5 bits (171), Expect(2) = 6e-23
 Identities = 36/117 (30%), Positives = 56/117 (47%)
 Frame = +2

Query: 374 RGILPCRYNLRRRRVQCPEECPLCDIAVEDEWHLFFGYQQAEVGWRTAGPWNDAQSVILE 553
           RG  P R   R + + CP  C +C+   ED  H+F     A   WR +G WN  ++ +  
Sbjct: 95  RGCFPTRVRHRDKGIDCPSNCVVCNDNFEDTSHVFCLCPFAASIWRDSGLWNHVEAAVNS 154

Query: 554 ADGFKDGFFKLLATLQEGAKKSFAMMVLCLWKRRNEKIWKNITKPVRLSLQVAFDLL 724
           ++   +  F LL  L+E      A+++  +WK RN K+W  +T+     L  A  LL
Sbjct: 155 SNTVAETIFMLLQNLEEQNSARLAVIMWSIWKHRNMKLWNRVTETKEQVLNRADHLL 211



 Score = 66.2 bits (160), Expect(2) = 6e-23
 Identities = 27/65 (41%), Positives = 44/65 (67%)
 Frame = +1

Query: 205 LVWNRSRDGRFLVKSTYYVAMNEFIDTSELRIAGDWGLIWKMQIPSKVKLQLWRSTKRHF 384
           L+W   ++G++LVKSTY + + E IDT+ LRI+  W  +W++++P KVK  +WRS +  F
Sbjct: 39  LIWKAEKNGQYLVKSTYRLCVEELIDTNHLRISSFWAGVWRLKVPPKVKNLIWRSCRGCF 98

Query: 385 TLPVQ 399
              V+
Sbjct: 99  PTRVR 103


>dbj|GAU23639.1| hypothetical protein TSUD_386280 [Trifolium subterraneum]
          Length = 399

 Score = 72.8 bits (177), Expect(2) = 7e-23
 Identities = 40/117 (34%), Positives = 56/117 (47%)
 Frame = +2

Query: 368 LQRGILPCRYNLRRRRVQCPEECPLCDIAVEDEWHLFFGYQQAEVGWRTAGPWNDAQSVI 547
           L RG LP R  L  RRV+C   CP+CDI  EDE H+FF    A   W  AG  +   +V 
Sbjct: 201 LCRGCLPTRCRLLERRVECNLNCPVCDIETEDELHIFFRCAVARDSWCAAGLSSVLHNVA 260

Query: 548 LEADGFKDGFFKLLATLQEGAKKSFAMMVLCLWKRRNEKIWKNITKPVRLSLQVAFD 718
            +     D  F + +          AM++  +W  RN+K+W +  +  R   + AFD
Sbjct: 261 YQQSNDMDRNFAVCSNESSDTVGRVAMLLWSIWHNRNDKLWNDNVQTPRHIGRYAFD 317



 Score = 63.5 bits (153), Expect(2) = 7e-23
 Identities = 34/110 (30%), Positives = 51/110 (46%)
 Frame = +1

Query: 37  FILSTMVPGTETLNASSLFDTGTESWNTQLITQLFNSEDQNRIRGMSLRCRDMNNTLVWN 216
           ++ S    G   L+   L     + WN   +  LF+ +   RI    L      + +VW 
Sbjct: 91  WVPSPQPAGVYQLSVRDLLHENYKVWNIAKVQNLFSRDAAKRILETPLVSSVREDKVVWE 150

Query: 217 RSRDGRFLVKSTYYVAMNEFIDTSELRIAGDWGLIWKMQIPSKVKLQLWR 366
             R+G + VKS Y +AM   I + +  +AG+W  IWK Q P K +  LWR
Sbjct: 151 EERNGCYSVKSGYKLAMRYMIGSDKHHVAGNWNGIWKAQAPHKARHLLWR 200


>gb|KYP76107.1| Putative ribonuclease H protein At1g65750 [Cajanus cajan]
          Length = 682

 Score = 75.5 bits (184), Expect(2) = 1e-22
 Identities = 34/102 (33%), Positives = 52/102 (50%)
 Frame = +2

Query: 374 RGILPCRYNLRRRRVQCPEECPLCDIAVEDEWHLFFGYQQAEVGWRTAGPWNDAQSVILE 553
           + ILP R  L+++ V CP  CP CD+ +E  WH  FG   A + W  +   +   S I +
Sbjct: 382 KNILPTRAQLQKKGVSCPISCPRCDVGIESSWHALFGCYDARICWLASNLKDKMSSNIDQ 441

Query: 554 ADGFKDGFFKLLATLQEGAKKSFAMMVLCLWKRRNEKIWKNI 679
           A+G  D   K+L    +     F MM+  +W  RN  +WK++
Sbjct: 442 ANGTSDLVNKILNLWSKSDAADFCMMMWSIWTSRNNLLWKDM 483



 Score = 60.1 bits (144), Expect(2) = 1e-22
 Identities = 30/101 (29%), Positives = 47/101 (46%)
 Frame = +1

Query: 73  LNASSLFDTGTESWNTQLITQLFNSEDQNRIRGMSLRCRDMNNTLVWNRSRDGRFLVKST 252
           L+ S L  + T  WN  +I  +F+      I    +     +++ VW  S DG F V+S 
Sbjct: 282 LHVSDLISSDTLEWNHNIIHHIFDEATTKDILATPVYSHCTDDSYVWKWSNDGCFTVRSA 341

Query: 253 YYVAMNEFIDTSELRIAGDWGLIWKMQIPSKVKLQLWRSTK 375
           Y    +  + T+ +     W +IW + IP+KVK   WR  K
Sbjct: 342 YKAITSSLLPTNTMDPDKTWNIIWSLPIPAKVKHHTWRVYK 382


>dbj|GAU42401.1| hypothetical protein TSUD_324560 [Trifolium subterraneum]
          Length = 958

 Score = 72.8 bits (177), Expect(2) = 2e-22
 Identities = 42/122 (34%), Positives = 58/122 (47%), Gaps = 5/122 (4%)
 Frame = +2

Query: 368  LQRGILPCRYNLRRRRVQCPEECPLCDIAVEDEWHLFFGYQQAEVGWRTAGPWNDAQSVI 547
            L RG  P RY L  RRV+C   CP+CD  +EDE H+FF    A   W  AG      S +
Sbjct: 760  LCRGCFPTRYRLLERRVECNLNCPVCDEEIEDEIHIFFRCAVARDSWCAAG-----LSTV 814

Query: 548  LEADGFK-----DGFFKLLATLQEGAKKSFAMMVLCLWKRRNEKIWKNITKPVRLSLQVA 712
            L  D ++     D  F +            AM++ C+ K RN+K+W +  +  R   + A
Sbjct: 815  LHNDAYQQSNAMDRIFAMCGNESSAIVGRVAMLLWCISKNRNDKLWNDNVQTPRQIGRYA 874

Query: 713  FD 718
            FD
Sbjct: 875  FD 876



 Score = 62.4 bits (150), Expect(2) = 2e-22
 Identities = 36/116 (31%), Positives = 55/116 (47%), Gaps = 1/116 (0%)
 Frame = +1

Query: 22  RGT-NYFILSTMVPGTETLNASSLFDTGTESWNTQLITQLFNSEDQNRIRGMSLRCRDMN 198
           RGT N ++ S        L+  +L     ++WN   +  LF+ +   RI    L      
Sbjct: 644 RGTVNKWMPSPQPAEVYQLSVRNLLHENYKAWNIAKVRNLFSGDVAERILETPLVNSVRK 703

Query: 199 NTLVWNRSRDGRFLVKSTYYVAMNEFIDTSELRIAGDWGLIWKMQIPSKVKLQLWR 366
           + +VW   R+G + VKS Y +AM   I + +  + G+W  IWK Q P K +  LWR
Sbjct: 704 DKVVWEEERNGCYSVKSGYKLAMRYIIGSDKYHVPGNWNGIWKAQAPHKARHLLWR 759


>dbj|GAU48398.1| hypothetical protein TSUD_405430 [Trifolium subterraneum]
          Length = 395

 Score = 73.6 bits (179), Expect(2) = 4e-22
 Identities = 34/91 (37%), Positives = 53/91 (58%)
 Frame = +1

Query: 106 ESWNTQLITQLFNSEDQNRIRGMSLRCRDMNNTLVWNRSRDGRFLVKSTYYVAMNEFIDT 285
           + W+T LI+ +F+     RI    L      +  +W+  R+G + V+S Y + + E IDT
Sbjct: 6   KEWDTHLISLIFDPIKAARILNTPLYPSVTEDRRLWSGERNGDYSVRSAYRLCVQELIDT 65

Query: 286 SELRIAGDWGLIWKMQIPSKVKLQLWRSTKR 378
           S LR+ GDW L+WK++ P KVK  +WR  +R
Sbjct: 66  SHLRVNGDWNLLWKIKAPPKVKNLIWRICRR 96



 Score = 60.5 bits (145), Expect(2) = 4e-22
 Identities = 32/103 (31%), Positives = 50/103 (48%)
 Frame = +2

Query: 374 RGILPCRYNLRRRRVQCPEECPLCDIAVEDEWHLFFGYQQAEVGWRTAGPWNDAQSVILE 553
           R  +  R  L+ + V CP  C LC+I  ED  H+FF    ++  W     +    SVI  
Sbjct: 95  RRCVSTRARLQDKGVNCPNLCALCNIEGEDSLHVFFKCPSSQNVWSMTSFFQVVSSVINN 154

Query: 554 ADGFKDGFFKLLATLQEGAKKSFAMMVLCLWKRRNEKIWKNIT 682
            +      F++L  L +     FA ++  +WK+RN +IW N+T
Sbjct: 155 ENEASAIVFQILRQLSKEDAALFACILWSIWKQRNNQIWNNVT 197