BLASTX nr result

ID: Astragalus23_contig00022363 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00022363
         (717 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU41450.1| hypothetical protein TSUD_98460 [Trifolium subte...   130   7e-38
gb|PNY17781.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   135   1e-32
gb|KYP68937.1| Retrotransposon-derived protein PEG10 [Cajanus ca...   129   3e-31
gb|PNY07310.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   130   7e-31
ref|XP_017423676.1| PREDICTED: uncharacterized protein LOC108332...   107   5e-29
ref|XP_014511429.1| uncharacterized protein LOC106770116 [Vigna ...   120   1e-27
gb|PNX79664.1| hypothetical protein L195_g035651 [Trifolium prat...   117   1e-26
ref|XP_017426291.1| PREDICTED: uncharacterized protein LOC108334...   116   6e-26
gb|KYP76287.1| Transposon Ty3-I Gag-Pol polyprotein [Cajanus cajan]    87   1e-25
ref|XP_006574291.1| PREDICTED: uncharacterized protein LOC102661...    79   2e-24
gb|PNX92353.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   112   2e-24
gb|KYP42639.1| Retrovirus-related Pol polyprotein from transposo...   111   2e-24
gb|KYP61806.1| hypothetical protein KK1_016317 [Cajanus cajan]         96   2e-22
dbj|GAU26773.1| hypothetical protein TSUD_317710 [Trifolium subt...   105   2e-22
gb|PNX92970.1| Ty3/gypsy retrotransposon protein, partial [Trifo...   105   5e-22
gb|KHN07600.1| Retrovirus-related Pol polyprotein from transposo...   103   1e-21
gb|PNX98954.1| retrotransposon-related protein, partial [Trifoli...   103   1e-21
dbj|GAU45274.1| hypothetical protein TSUD_99960 [Trifolium subte...   102   3e-21
gb|PNX92469.1| hypothetical protein L195_g015607 [Trifolium prat...   102   3e-21
dbj|GAU40605.1| hypothetical protein TSUD_28110 [Trifolium subte...   102   4e-21

>dbj|GAU41450.1| hypothetical protein TSUD_98460 [Trifolium subterraneum]
          Length = 1385

 Score =  130 bits (326), Expect(3) = 7e-38
 Identities = 83/191 (43%), Positives = 100/191 (52%), Gaps = 19/191 (9%)
 Frame = +2

Query: 200 RFEVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPWSPAQTPSRTES----STSKNA 367
           R EVLAQQP  LSQAAGLARL E+K+ DLLRL R K P++P  T S T++    +T  N+
Sbjct: 199 RREVLAQQPVDLSQAAGLARLHEEKIQDLLRLARPKQPFTPWNTSSSTKTFAAPTTKPNS 258

Query: 368 ----TSXXXXXXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXX 535
                S             RFRQL AAE+ADRREKGLCFN DERFS++HRCKAR      
Sbjct: 259 EITKNSPPPPLLPTPQPKTRFRQLYAAELADRREKGLCFNCDERFSRNHRCKARFLLLIA 318

Query: 536 XXXXXXXXXXXXXXXVWPIV-----------DPEAEPTQFALHTMTGAHTAHTFRVQGHI 682
                              +             EA+  Q + H M+G  TA T +V G I
Sbjct: 319 VDNDEEEKGGPEAEIGESEIPTDSLLALLGTQEEAQLAQLSYHAMSGIQTAQTIKVLGKI 378

Query: 683 DDEPVHILVDG 715
               VH+LVDG
Sbjct: 379 AQHSVHVLVDG 389



 Score = 39.3 bits (90), Expect(3) = 7e-38
 Identities = 15/19 (78%), Positives = 16/19 (84%)
 Frame = +3

Query: 81  YQWMHSNGQITSWTQFLTA 137
           YQW HSNG+I SWTQFL A
Sbjct: 120 YQWKHSNGEIVSWTQFLRA 138



 Score = 37.4 bits (85), Expect(3) = 7e-38
 Identities = 18/23 (78%), Positives = 20/23 (86%)
 Frame = +1

Query: 136 HRIVGLSTHNLLSCFVSGLKPEI 204
           +RIVGLS  +LLSCFVSGLK EI
Sbjct: 176 NRIVGLSPQDLLSCFVSGLKVEI 198


>gb|PNY17781.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1478

 Score =  135 bits (340), Expect = 1e-32
 Identities = 103/296 (34%), Positives = 127/296 (42%), Gaps = 64/296 (21%)
 Frame = +2

Query: 20  VFKISQFFAYHRTPETERITVSV------------------DAQQWPN------------ 109
           +FKISQFF YH TPE ERITV+                       WP             
Sbjct: 56  IFKISQFFTYHNTPEEERITVASFYLDGPALAWYQWMYRNGQIVSWPQVLQALELRFAPT 115

Query: 110 ------------HLLDTVSHRIASWGSLRTIY*VALSRV-----------SNRRFEVLAQ 220
                       H   TV+  ++ + SL     V LS             S  R EVLAQ
Sbjct: 116 AYDDPRGKLFKLHQTTTVASYLSDFESLANRI-VGLSPPDLLSCFISGLRSEIRREVLAQ 174

Query: 221 QPSSLSQAAGLARLQEDKVNDLLRLTRQKP--PWSPAQTPSRTESSTSKNATSXXXXXXX 394
           QP+SL+QAA LARLQE+K+ DLLRL + +   PWS     + + SS   N          
Sbjct: 175 QPTSLTQAAALARLQEEKIQDLLRLAKPRTTAPWS-----NPSSSSPRSNPAPTTASLLP 229

Query: 395 XXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXXXXXXXXXXXXXXX 574
                 R+RQLS  EM +RREKGLCFN DERFS++HRCKAR                   
Sbjct: 230 TPANHPRYRQLSPTEMNERREKGLCFNCDERFSRTHRCKARFLLFIADEDEELAGLDPGE 289

Query: 575 XXVWPIVDP---------EAEPTQFALHTMTGAHTAHTFRVQGHIDDEPVHILVDG 715
               P  DP         E    Q + H ++G  +A T RV G +    V +LVDG
Sbjct: 290 TDPPPAADPLPDVSGLVEEFHSAQLSYHALSGVQSAQTIRVPGRVGAHSVRVLVDG 345


>gb|KYP68937.1| Retrotransposon-derived protein PEG10 [Cajanus cajan]
          Length = 507

 Score =  129 bits (325), Expect = 3e-31
 Identities = 100/295 (33%), Positives = 127/295 (43%), Gaps = 63/295 (21%)
 Frame = +2

Query: 20  VFKISQFFAYHRTPETERITVS---VDAQ-----QWP---------NHLLDTVSHRIASW 148
           +FKI+QFF YH TPE ERITV+   +D       QW           HLL  +  R A  
Sbjct: 78  IFKITQFFDYHNTPEEERITVASFYLDGAALAWFQWMYRNGQIHSWQHLLQALETRFAPT 137

Query: 149 G--------------SLRTIY*VALSRVSNR---------------------RFEVLAQQ 223
                          +  + +      V+NR                     R EV+AQQ
Sbjct: 138 AFDDPRGRLFKLTQTTTVSAFLTEFEAVANRVTGLSPQFLLSCFIFGLKPEIRREVIAQQ 197

Query: 224 PSSLSQAAGLARLQEDKVNDLLRLTRQKP--PWS--PAQTPSRTESSTSKNATSXXXXXX 391
           P SL+ A GLARL E+K+ DL R+ R KP  PWS  P        +     A        
Sbjct: 198 PPSLTHAVGLARLHEEKLQDLSRIQRAKPGAPWSSPPFSRTFTPFAPPQTIAPKPLPPLL 257

Query: 392 XXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXXXXXXXXXXXXXX 571
                  RFRQL+ AEMADRREKGLCFN D+++S+SHRC AR                  
Sbjct: 258 PSPPPKTRFRQLTEAEMADRREKGLCFNCDQKYSRSHRCPARFLLLIAEDDDPPSAPDLD 317

Query: 572 XXXVWP---IVDP----EAEPTQFALHTMTGAHTAHTFRVQGHIDDEPVHILVDG 715
                P   +VDP       P Q +LH ++G     T R+ G I   P+ +LVDG
Sbjct: 318 FPAADPDPSLVDPSTVTSVHPAQISLHALSGTGAPKTLRLTGQIAHHPIRVLVDG 372


>gb|PNY07310.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
 gb|PNY07311.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1494

 Score =  130 bits (327), Expect = 7e-31
 Identities = 102/292 (34%), Positives = 133/292 (45%), Gaps = 55/292 (18%)
 Frame = +2

Query: 5   NAPALVFKISQFFAYHRTPETERITVS--------VDAQQWP---------NHLLDTVSH 133
           +A   +FKISQFF YH+TPE +RIT++        +   QW          N  L  +  
Sbjct: 69  DANGWIFKISQFFTYHQTPEEDRITIASFYLDGPALAWYQWMYRNSQIVSWNQFLRALET 128

Query: 134 RIASW------GSLRTI--------Y*VALSRVSNR---------------------RFE 208
           R A        G+L  +        Y      ++NR                     R E
Sbjct: 129 RFAPTAYDDPKGNLFKLTQSGSVNDYLTEFESLANRIVGLSPLDLLSCFISGLKVEIRRE 188

Query: 209 VLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPWSPAQT-PSRTESSTSKNATSXXXX 385
           VLAQQP+SLSQAAGLARLQEDK+ D ++ +R K   SPA T PSR   +           
Sbjct: 189 VLAQQPNSLSQAAGLARLQEDKIQDQIKASRSK--LSPAYTAPSRPNFNLPGRPAPGLLP 246

Query: 386 XXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXXXXXXXXXXXX 565
                    RFR LS  E+A+RREKGLCFN D+++SK HRC  R                
Sbjct: 247 APPSKP---RFRHLSEPELAERREKGLCFNCDQKWSKQHRCGGRTFLLLADEEDEEVDPS 303

Query: 566 XXXXXVWPIVDP--EAEPTQFALHTMTGAHTAHTFRVQGHIDDEPVHILVDG 715
                +     P  +    Q +LH + G+H   TFRV+G I  +PV+ILVDG
Sbjct: 304 QLESTIDIDTSPPDDTPQAQLSLHALAGSHATDTFRVEGQILKQPVNILVDG 355


>ref|XP_017423676.1| PREDICTED: uncharacterized protein LOC108332889 [Vigna angularis]
          Length = 556

 Score =  107 bits (268), Expect(3) = 5e-29
 Identities = 69/182 (37%), Positives = 88/182 (48%), Gaps = 10/182 (5%)
 Frame = +2

Query: 200 RFEVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPWSPAQ--TPSRTESSTSKNATS 373
           R EVL+QQP +LSQA+GLARL E+K  DL RL RQ+    P    T S T          
Sbjct: 173 RREVLSQQPQTLSQASGLARLHEEKFQDLTRLIRQRSGPGPLSLLTRSPTTPLVPLVPLK 232

Query: 374 XXXXXXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKAR--------XXXX 529
                        RF+QL+ AEMADRRE+GLCFN D++FS++HRC AR            
Sbjct: 233 QLPPLLPAPPPRTRFKQLTEAEMADRRERGLCFNCDQKFSRNHRCPARYMLLVAEEDNDS 292

Query: 530 XXXXXXXXXXXXXXXXXVWPIVDPEAEPTQFALHTMTGAHTAHTFRVQGHIDDEPVHILV 709
                            V P++  +    Q  L+ ++G     T RV G ID   V +LV
Sbjct: 293 CKSSLHAPILEPNPPDPVSPVITDDPNQAQLCLNALSGFGAPETLRVSGQIDQFQVTVLV 352

Query: 710 DG 715
           DG
Sbjct: 353 DG 354



 Score = 37.4 bits (85), Expect(3) = 5e-29
 Identities = 17/23 (73%), Positives = 19/23 (82%)
 Frame = +1

Query: 136 HRIVGLSTHNLLSCFVSGLKPEI 204
           +RIVGL    LLSCF+SGLKPEI
Sbjct: 150 NRIVGLQPQFLLSCFISGLKPEI 172



 Score = 31.6 bits (70), Expect(3) = 5e-29
 Identities = 12/19 (63%), Positives = 14/19 (73%)
 Frame = +3

Query: 81  YQWMHSNGQITSWTQFLTA 137
           +QWM+ NGQI SW Q L A
Sbjct: 94  FQWMYRNGQIHSWPQLLQA 112


>ref|XP_014511429.1| uncharacterized protein LOC106770116 [Vigna radiata var. radiata]
          Length = 851

 Score =  120 bits (302), Expect = 1e-27
 Identities = 96/299 (32%), Positives = 124/299 (41%), Gaps = 67/299 (22%)
 Frame = +2

Query: 20  VFKISQFFAYHRTPETERITVS---VDAQ-----QWP---------NHLLDTVSHRIA-- 142
           +FKI+QFF YH TPE ERI V+   +D       QW           HLL  +  R A  
Sbjct: 101 IFKINQFFDYHNTPEEERIIVASFYLDGAALAWFQWMYRNGQILSWTHLLQALETRFAPT 160

Query: 143 ------------SWGSLRTIY*VALSRVSNR---------------------RFEVLAQQ 223
                       S  S  + Y       +NR                     R EV+AQQ
Sbjct: 161 AFEDPRGKLFKLSQTSSVSAYLNEFEATANRVTGXSPPFLLSCFLSGLKSEXRREVVAQQ 220

Query: 224 PSSLSQAAGLARLQEDKVNDLLRLTRQKP--PWSPAQTPSRTESSTSKNATSXXXXXXXX 397
           P +LS A GLARLQE+K+ DL R+ R KP   W  +       +   +            
Sbjct: 221 PQTLSLAVGLARLQEEKLWDLSRVQRVKPLSSWPTSSLTRTVPTPIQQPPPKPLPPILPS 280

Query: 398 XXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXXXXXXXXXXXXXXXX 577
                R+RQL+ AEMADR EKGLCFN D+++S+SHRC AR                    
Sbjct: 281 PSPKTRYRQLTEAEMADRHEKGLCFNCDQKYSRSHRCPARFLLLIAEEDDSTGGLASNPT 340

Query: 578 XVWPIVDPEAE-------------PTQFALHTMTGAHTAHTFRVQGHIDDEPVHILVDG 715
              P  DP  +             P Q +LH ++G     T R+ GHI   P+ +LVDG
Sbjct: 341 SFDP--DPRTDEQPPQPIDLLLDLPAQISLHALSGIGGPETLRLTGHIGQHPIRVLVDG 397


>gb|PNX79664.1| hypothetical protein L195_g035651 [Trifolium pratense]
          Length = 536

 Score =  117 bits (293), Expect = 1e-26
 Identities = 90/303 (29%), Positives = 129/303 (42%), Gaps = 65/303 (21%)
 Frame = +2

Query: 2   TNAPALVFKISQFFAYHRTPETERITVS---VDAQQWPNHLLDTVSHRIASWGSLR---- 160
           T+    +FKISQFF YH+TPE ERIT++   +D      +     + +IASW        
Sbjct: 61  TDTHGWIFKISQFFDYHQTPEEERITIASFYLDGAALAWYQWMYRNRQIASWAQFLEKLE 120

Query: 161 ------------------------TIY*VALSRVSNR---------------------RF 205
                                   + Y      ++NR                     R 
Sbjct: 121 TRFAPTAFDDPRGNLFKLTQSTTVSAYLTEFEALANRLEGLSDVDLLSCFISGLKSDVRR 180

Query: 206 EVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPWSPAQTPSRTE-----SSTSKNAT 370
           EV+AQQP+S+SQAAGLARLQE+K+ D+ R +R    W P       +     +S +KN +
Sbjct: 181 EVVAQQPTSISQAAGLARLQEEKLQDIARASRPTSSWQPPSVARPIQKAPEVTSPAKNTS 240

Query: 371 SXXXXXXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXXXXXXX 550
                         RFR LS  E+ ++REKGLCFN D+++ K H+C AR           
Sbjct: 241 G----LLPTPPAKPRFRHLSGPELDEQREKGLCFNCDKKWPKQHKCGAR-VFVMLADNDD 295

Query: 551 XXXXXXXXXXVWPIVDP--------EAEPTQFALHTMTGAHTAHTFRVQGHIDDEPVHIL 706
                     +   +DP        E +  Q +L  ++G   A T R+ G+I   PV +L
Sbjct: 296 SFSTTAAEELLTDSIDPSGVLTPQSEVQAAQLSLFALSGVPAADTIRILGYIGSHPVRVL 355

Query: 707 VDG 715
           VDG
Sbjct: 356 VDG 358


>ref|XP_017426291.1| PREDICTED: uncharacterized protein LOC108334872 [Vigna angularis]
          Length = 756

 Score =  116 bits (290), Expect = 6e-26
 Identities = 95/302 (31%), Positives = 130/302 (43%), Gaps = 70/302 (23%)
 Frame = +2

Query: 20  VFKISQFFAYHRTPETERITV--------SVDAQQWP---------NHLLDTVSHRIASW 148
           +FKI+QFF YH TPE ERITV        ++   QW            LL  +  R A  
Sbjct: 81  IFKITQFFDYHNTPEEERITVASFYLDGAALALFQWMYRNGQLHSWQQLLQALETRFAPT 140

Query: 149 ------GSLRTI--------Y*VALSRVSNR---------------------RFEVLAQQ 223
                 G L  +        +      ++NR                     R EV+AQQ
Sbjct: 141 AFDDPKGKLFKLAQTTTVSDFLTEFESIANRVAGLPPSFLLSCFISGLKPEIRREVVAQQ 200

Query: 224 PSSLSQAAGLARLQEDKVNDLLRLTRQKP--PWSPAQT----PSRTESSTSKNAT----- 370
           P +LS A GLARLQE+K+ DL R+ + K   PW P        S+T+S T+K+       
Sbjct: 201 PPTLSHAVGLARLQEEKIWDLNRVPKPKSVSPWPPPSINRTITSQTQSQTTKHLPPLLTS 260

Query: 371 ---SXXXXXXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKAR----XXXX 529
                            R+RQL+ AEMADRREKGLCFN ++++S+SHRC AR        
Sbjct: 261 PIEKSLPPLLTPPPPKTRYRQLTEAEMADRREKGLCFNCEQKYSRSHRCPARFLFFIAEE 320

Query: 530 XXXXXXXXXXXXXXXXXVWPIVDPEAEPTQFALHTMTGAHTAHTFRVQGHIDDEPVHILV 709
                            + P+ D   +  Q +LH ++G     T R+ G I    + +LV
Sbjct: 321 ADSVGEGDLEMPTFDGVLEPMGDSPNQSAQISLHALSGTGAPETLRLMGQIGLHQISVLV 380

Query: 710 DG 715
           DG
Sbjct: 381 DG 382


>gb|KYP76287.1| Transposon Ty3-I Gag-Pol polyprotein [Cajanus cajan]
          Length = 712

 Score = 86.7 bits (213), Expect(4) = 1e-25
 Identities = 62/177 (35%), Positives = 82/177 (46%), Gaps = 7/177 (3%)
 Frame = +2

Query: 206 EVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPWSPAQTPSRTESSTSKNATSXXXX 385
           EV A +P+SL     LA+LQEDK+ +  R    KP    A TPS   +  +         
Sbjct: 131 EVQALRPASLDHTTQLAKLQEDKIEERRRAFFPKPQ---ALTPSSHTALPTPQPR----- 182

Query: 386 XXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXXXXXXXXXXXX 565
                     FR+LS  +MA RREKGLC+N DE F+ SHRCK +                
Sbjct: 183 --------VNFRRLSPDDMAARREKGLCYNCDELFTPSHRCKGKFFLLTTDDPIVDDFTP 234

Query: 566 XXXXXVWPIVDP-------EAEPTQFALHTMTGAHTAHTFRVQGHIDDEPVHILVDG 715
                   ++DP       +  P+Q +LH  TG   + T R+QG I + PV ILVDG
Sbjct: 235 DPTL----VLDPPPPEPVSDDTPSQVSLHAFTGGVGSSTIRLQGQIRNNPVSILVDG 287



 Score = 36.2 bits (82), Expect(4) = 1e-25
 Identities = 14/22 (63%), Positives = 19/22 (86%)
 Frame = +2

Query: 20 VFKISQFFAYHRTPETERITVS 85
          +FKISQFF YH TP++ER+ V+
Sbjct: 17 IFKISQFFDYHNTPKSERLQVA 38



 Score = 31.2 bits (69), Expect(4) = 1e-25
 Identities = 13/21 (61%), Positives = 15/21 (71%)
 Frame = +3

Query: 75  SRYQWMHSNGQITSWTQFLTA 137
           S YQWM+ NGQI +W  FL A
Sbjct: 48  SWYQWMYWNGQIQTWFGFLRA 68



 Score = 31.2 bits (69), Expect(4) = 1e-25
 Identities = 14/23 (60%), Positives = 17/23 (73%)
 Frame = +1

Query: 136 HRIVGLSTHNLLSCFVSGLKPEI 204
           +RIVGL     L+CF+SGL PEI
Sbjct: 106 NRIVGLPAPFALNCFISGLTPEI 128


>ref|XP_006574291.1| PREDICTED: uncharacterized protein LOC102661730 [Glycine max]
          Length = 1588

 Score = 79.3 bits (194), Expect(4) = 2e-24
 Identities = 60/185 (32%), Positives = 82/185 (44%), Gaps = 13/185 (7%)
 Frame = +2

Query: 200 RFEVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPWSPAQTPSRTESSTSKNATSXX 379
           R EV A  P S+ QAAGLARLQ +KV D     R +PP +P   P +     S  A +  
Sbjct: 319 RREVQAHHPLSMVQAAGLARLQAEKVLDQRPSPRSRPP-NPTPFPPQLGPPPSLPAPTLP 377

Query: 380 XXXXXXXXXXX--------RFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXX 535
                                +++S  EMA RREKGLCFN DE++ + H+C +R      
Sbjct: 378 PLLNPPPPPRPPTTPMSTPTLKRVSPDEMALRREKGLCFNCDEKYHRGHKCSSRFFILIS 437

Query: 536 XXXXXXXXXXXXXXXV-WPIVDP----EAEPTQFALHTMTGAHTAHTFRVQGHIDDEPVH 700
                             P  DP    +  PTQ +L+++ G     T R  G +  +P+ 
Sbjct: 438 DDLEPIPSHIPIPDLTHHPPPDPPDNLDLYPTQISLNSLAGHIAPETLRFVGQLSGQPML 497

Query: 701 ILVDG 715
           ILVDG
Sbjct: 498 ILVDG 502



 Score = 35.8 bits (81), Expect(4) = 2e-24
 Identities = 13/22 (59%), Positives = 19/22 (86%)
 Frame = +2

Query: 20  VFKISQFFAYHRTPETERITVS 85
           +FKI+QFF YH TPE +++TV+
Sbjct: 207 IFKINQFFEYHSTPEQDKLTVA 228



 Score = 34.7 bits (78), Expect(4) = 2e-24
 Identities = 15/22 (68%), Positives = 18/22 (81%)
 Frame = +1

Query: 139 RIVGLSTHNLLSCFVSGLKPEI 204
           R+VG+S   LLSCF+SGL PEI
Sbjct: 297 RVVGISPPLLLSCFISGLSPEI 318



 Score = 31.2 bits (69), Expect(4) = 2e-24
 Identities = 11/24 (45%), Positives = 16/24 (66%)
 Frame = +3

Query: 66  RSASRYQWMHSNGQITSWTQFLTA 137
           R+ + YQWM +N   TSW+ F+ A
Sbjct: 235 RALAWYQWMKANNHFTSWSSFIQA 258


>gb|PNX92353.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1502

 Score =  112 bits (279), Expect = 2e-24
 Identities = 91/293 (31%), Positives = 126/293 (43%), Gaps = 55/293 (18%)
 Frame = +2

Query: 2   TNAPALVFKISQFFAYHRTPETERITVSVDAQQWP-----------------NHLLDTVS 130
           ++A   +FKISQFF +H TPE +R+T++    + P                 + LL+ + 
Sbjct: 84  SDAMGWIFKISQFFEFHATPEADRLTIASFYMEGPALGWYQWMARNGQLTSWHGLLNAIE 143

Query: 131 HRIASW------GSLRTI--------Y*VALSRVSNR---------------------RF 205
            R A        GSL  +        Y  A   ++NR                     R 
Sbjct: 144 ARFAPSQYDDPKGSLFKLTQKGSVSEYLSAFETLANRIVGLQPPFLLSCFISGLIPEIRR 203

Query: 206 EVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPWSPAQTPSRTESSTSKNATSXXXX 385
           EV+A QP +L QAA LARLQE+K ND  R  R +   +P        SSTS+  T     
Sbjct: 204 EVMALQPLNLIQAASLARLQEEKFNDARRALRNRGILNPTPLQQIPPSSTSR--TPLALL 261

Query: 386 XXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKAR-XXXXXXXXXXXXXXX 562
                     F++LS  EMA RREKGLCFN DE+F   H+C +R                
Sbjct: 262 PPPPKPSPPTFKRLSPTEMAQRREKGLCFNCDEKFRPGHKCSSRFFILITDDDIDPDLTH 321

Query: 563 XXXXXXVWPIVDPEAEPT--QFALHTMTGAHTAHTFRVQGHIDDEPVHILVDG 715
                   P  +P+ EP+  Q + H ++G     T R+ G I  + VHIL+DG
Sbjct: 322 IDPNSAAQPDPEPDLEPSQAQISFHALSGHLAPETLRLAGRIAHQRVHILMDG 374


>gb|KYP42639.1| Retrovirus-related Pol polyprotein from transposon 297 family
           [Cajanus cajan]
          Length = 894

 Score =  111 bits (278), Expect = 2e-24
 Identities = 98/302 (32%), Positives = 132/302 (43%), Gaps = 64/302 (21%)
 Frame = +2

Query: 2   TNAPALVFKISQFFAYHRTPETERITVS---VDAQ-----QWP---------NHLLDTVS 130
           ++A   +FKI+QFF YH TPE E ITV+   +D       QW          N +L  + 
Sbjct: 63  SDALGWIFKITQFFEYHNTPEEECITVASFYLDGSALAWFQWMYRNGQIHSWNQMLQALE 122

Query: 131 HRIASW------GSLR--------TIY*VALSRVSNR---------------------RF 205
           +R A        G L         T Y      ++NR                     R 
Sbjct: 123 NRFAPTAFDNPRGKLFKLTQSFSVTSYLTEFESLANRIVGLQPSFLLSCFISGLKPELRR 182

Query: 206 EVLAQQPSSLSQAAGLARLQEDKVNDLLRLTR--QKPPWSPAQTPSRTESSTSKNAT-SX 376
           +V+A QPSSLSQA G ARL E+K+ D  R  R  Q P WS A   SRT S  S +     
Sbjct: 183 DVIAHQPSSLSQAVGYARLHEEKLFDSSRTHRPSQSPRWS-APPVSRTFSPLSPSPPPKS 241

Query: 377 XXXXXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKAR---------XXXX 529
                       RF+QL+ AEMAD+REKGLCFN D++FS++HRC AR             
Sbjct: 242 LPPLLPPPPPKTRFKQLTEAEMADKREKGLCFNCDQKFSRNHRCLARYFLLIVDEDESPP 301

Query: 530 XXXXXXXXXXXXXXXXXVWPIVDPEAEPTQFALHTMTGAHTAHTFRVQGHIDDEPVHILV 709
                            V  ++D   +  Q +L+ ++G+ T  T R+ G +    V +LV
Sbjct: 302 PDSDGGSDLGAGSDPKLVEELLDLSPDSAQLSLNALSGSGTPETLRIVGLLAQYQVRVLV 361

Query: 710 DG 715
           DG
Sbjct: 362 DG 363


>gb|KYP61806.1| hypothetical protein KK1_016317 [Cajanus cajan]
          Length = 287

 Score = 95.9 bits (237), Expect(3) = 2e-22
 Identities = 56/112 (50%), Positives = 70/112 (62%), Gaps = 6/112 (5%)
 Frame = +2

Query: 200 RFEVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPWS-PAQTPSRTESSTSKN---- 364
           R EV+AQQP SL+ A GLARLQE++++DL R  R +P  S PA   +RT +S        
Sbjct: 77  RREVIAQQPQSLATAVGLARLQEERLSDLSRFQRPRPASSWPAPLLTRTITSAPPPQQPT 136

Query: 365 -ATSXXXXXXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKAR 517
            A               RFRQL+ AEMADRREKGLCFN D+++S+SHRC AR
Sbjct: 137 AAPPRAPPLLPTPAPKTRFRQLTEAEMADRREKGLCFNCDQKYSRSHRCPAR 188



 Score = 33.9 bits (76), Expect(3) = 2e-22
 Identities = 15/23 (65%), Positives = 19/23 (82%)
 Frame = +1

Query: 136 HRIVGLSTHNLLSCFVSGLKPEI 204
           +R+ GLS   LLSCF+SGL+PEI
Sbjct: 54  NRVTGLSPPFLLSCFLSGLQPEI 76



 Score = 25.0 bits (53), Expect(3) = 2e-22
 Identities = 10/16 (62%), Positives = 11/16 (68%)
 Frame = +3

Query: 90  MHSNGQITSWTQFLTA 137
           M+ NGQI SWT  L A
Sbjct: 1   MYRNGQILSWTHLLQA 16


>dbj|GAU26773.1| hypothetical protein TSUD_317710 [Trifolium subterraneum]
          Length = 1395

 Score =  105 bits (263), Expect = 2e-22
 Identities = 92/301 (30%), Positives = 123/301 (40%), Gaps = 63/301 (20%)
 Frame = +2

Query: 2   TNAPALVFKISQFFAYHRTPETERITVSV-----DAQQWPNH------------LLDTVS 130
           T+A   +FKISQFF YH TPETER+TV+       A  W  +            LL  + 
Sbjct: 62  TDAMGWIFKISQFFDYHNTPETERLTVASFYMDGPALTWYQYMYRNGHINSWFGLLQALE 121

Query: 131 HRIA---------------SWGSLRTIY*VALSRVSNR---------------------R 202
            R A                 GSL   Y     R++NR                     R
Sbjct: 122 ARFAPSYYDDPSQALFKLTQRGSLNQ-YLTEFERLANRIIGLPQPFILNCFISGLAPEIR 180

Query: 203 FEVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQK---------PPWSPAQTPSRTESST 355
            EV A QP++LS A  LA+LQEDK++D  R  + K         PP  P    S T+   
Sbjct: 181 REVQALQPATLSLATALAKLQEDKIDDRRRNFKTKQHTSSSSTTPPLLPTPLSSTTQPPN 240

Query: 356 SKNATSXXXXXXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXX 535
           + +                +FR+LS+ +MA RREKGLC+N DE F   H+CK R      
Sbjct: 241 NPSRV--------------QFRKLSSEDMASRREKGLCYNCDETFIPGHKCKGRLYLLVS 286

Query: 536 XXXXXXXXXXXXXXXV-WPIVDPEAEPTQFALHTMTGAHTAHTFRVQGHIDDEPVHILVD 712
                          +   I  P     Q + H++ G+    T R+ G I + PV IL+D
Sbjct: 287 DEPDPAESPPSQTPDLDHSIESPPDLEGQISFHSLAGSSATATLRIIGQIANHPVTILID 346

Query: 713 G 715
           G
Sbjct: 347 G 347


>gb|PNX92970.1| Ty3/gypsy retrotransposon protein, partial [Trifolium pratense]
          Length = 1483

 Score =  105 bits (261), Expect = 5e-22
 Identities = 92/300 (30%), Positives = 123/300 (41%), Gaps = 62/300 (20%)
 Frame = +2

Query: 2   TNAPALVFKISQFFAYHRTPETERITVSVDAQQWPNHLLDTVSHR---IASW-GSLRTI- 166
           ++A   +FKISQFF YH+TPE ER+TV+    + P        HR   I +W G L+ + 
Sbjct: 67  SDAMGWIFKISQFFDYHQTPEEERLTVASFYMEGPALSWFQWMHRNGQITTWFGLLQALE 126

Query: 167 --------------------------Y*VALSRVSNR---------------------RF 205
                                     Y     R++NR                     R 
Sbjct: 127 TRFAPSYYDDPSSSLFKLTQRTTVNEYLAEFERLANRIVGLQPPFLLSCFISGLSPEIRR 186

Query: 206 EVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKP---PWSPAQTPSRTESSTSKNAT-S 373
           EV A +P SL+QA  LA+LQEDK+ D  R  + KP     S +  P     ST    T +
Sbjct: 187 EVQALRPMSLTQATALAKLQEDKIADRRRFFKNKPNSQQISSSSNPFGPPPSTPPLPTPN 246

Query: 374 XXXXXXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXXXXXXXX 553
                         FR+LS  EMA RREKGLC+N DE F+  H+C+ R            
Sbjct: 247 TLPLLPPPKPNRPNFRKLSPEEMASRREKGLCYNCDETFTPQHKCRGRFFLLVTEEPMES 306

Query: 554 XXXXXXXXXVWPIVDPEAEPT------QFALHTMTGAHTAHTFRVQGHIDDEPVHILVDG 715
                      P    E  PT      Q +LH ++G   A T R+ G I + PV +L+DG
Sbjct: 307 PPDLIDFTE--PDPPNETTPTDAAIDAQISLHALSGCTVASTIRLMGCIANHPVTVLIDG 364


>gb|KHN07600.1| Retrovirus-related Pol polyprotein from transposon opus, partial
           [Glycine soja]
          Length = 466

 Score =  103 bits (256), Expect = 1e-21
 Identities = 87/264 (32%), Positives = 122/264 (46%), Gaps = 33/264 (12%)
 Frame = +2

Query: 20  VFKISQFFAYHRTPETERITVSV-----DAQQWPNHLLDTVSHRIASW-GSLRTIY*V-- 175
           +FKISQ F Y  TPE ERITV+       A  W   +    +  I SW G L+ +     
Sbjct: 60  IFKISQLFEYQNTPEEERITVAFFYLDGAALSWYQWMFR--NGFITSWSGFLQALESRFA 117

Query: 176 ---------ALSRVSNR----RFEVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPW 316
                    AL +++ R    R EVLA QP SL QA  LA+LQEDK+ D     RQ PP 
Sbjct: 118 PSYYDDPKGALFKLTQRGTDIRREVLALQPISLPQAMALAKLQEDKIRD----RRQAPPR 173

Query: 317 SPAQTPSRTESSTSKNATSXXXXXXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSK 496
           +   TPS + +  ++   S              + Q +  EMA RREKGLC+N +E++S 
Sbjct: 174 NH-NTPSASYAPPTRKPHST-------------YVQRTPDEMALRREKGLCYNCEEKWSS 219

Query: 497 SHRCKARXXXXXXXXXXXXXXXXXXXXXVWPIVDP------------EAEPTQFALHTMT 640
           +HRCK R                     + P+ +P            E  P   +LH ++
Sbjct: 220 THRCKGRVLLFIADNPSPTSDEPISEPPLLPLPEPTPACPPDLDSTSELTPPHVSLHALS 279

Query: 641 GAHTAHTFRVQGHIDDEPVHILVD 712
           G  ++ TFR+ G I+  P+ IL+D
Sbjct: 280 GLPSSETFRLVGIINHSPLTILID 303


>gb|PNX98954.1| retrotransposon-related protein, partial [Trifolium pratense]
          Length = 957

 Score =  103 bits (258), Expect = 1e-21
 Identities = 88/302 (29%), Positives = 128/302 (42%), Gaps = 64/302 (21%)
 Frame = +2

Query: 2   TNAPALVFKISQFFAYHRTPETERITVS---VDAQ-----QWPNHLLDTVSHRIASW-GS 154
           ++A   +FKISQFF YH+TPE ER+TV+   ++ Q     QW +      ++++ +W G 
Sbjct: 65  SDAMGWIFKISQFFDYHQTPEEERLTVASFYMEGQALSWFQWMHR-----NNQLNTWFGF 119

Query: 155 LRTI---------------------------Y*VALSRVSNR------------------ 199
           L+ +                           Y     R++NR                  
Sbjct: 120 LQALETRFAPSFYDEPSSALFKLVQRSSVNNYLTEFERLANRIVGLPQPFLLSCFISGLS 179

Query: 200 ---RFEVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPWSPAQ-TPSRTESSTSKNA 367
              R EV A +P +L QA  LA+LQEDK++D  RL + K   S    TP  + S+     
Sbjct: 180 PEIRREVQALRPVTLCQATALAKLQEDKIDDRRRLFKSKNSTSTLNPTPIASSSAPPLLP 239

Query: 368 TSXXXXXXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXXXXXX 547
           T               FR+LS  EMA RREKGLC+N DE F+  H+C+ R          
Sbjct: 240 TPKPNNKV-------NFRKLSPEEMATRREKGLCYNCDETFTPLHKCRGRFFLLVADDDC 292

Query: 548 XXXXXXXXXXXVWPIVDPEAEPT------QFALHTMTGAHTAHTFRVQGHIDDEPVHILV 709
                      + P   P   PT      Q + H M+G+    T R+ G + + PV +L+
Sbjct: 293 DPDDIPDPPPDIDPTPPPPTLPTTEPSEAQISFHAMSGSADPATIRISGFLANHPVTVLI 352

Query: 710 DG 715
           DG
Sbjct: 353 DG 354


>dbj|GAU45274.1| hypothetical protein TSUD_99960 [Trifolium subterraneum]
          Length = 970

 Score =  102 bits (255), Expect = 3e-21
 Identities = 87/297 (29%), Positives = 126/297 (42%), Gaps = 59/297 (19%)
 Frame = +2

Query: 2   TNAPALVFKISQFFAYHRTPETERITVSVDAQQWPNHLLDTVSHR---IASW-GSLRTI- 166
           T+A   +FKISQFF +H+T E ER+TV+    + P        H+   I +W G L+ + 
Sbjct: 63  TDAMGWIFKISQFFDFHQTTEEERLTVASFYMEGPALSWYQWMHKNNQINTWFGFLQALE 122

Query: 167 --------------------------Y*VALSRVSNR---------------------RF 205
                                     Y +   R++NR                     R 
Sbjct: 123 MRFAPSYYDEPSSALFKLVQKTTVNSYLIKFERLTNRIVGLPQPFLLSCFISGLSPEIRR 182

Query: 206 EVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPWSPAQTPSRTESSTSKNATSXXXX 385
           EV A +P SL+QA  LA+LQEDK+ D  RL + K   + A T +     +S N  +    
Sbjct: 183 EVQALRPLSLTQATALAKLQEDKIEDRRRLFKTK---TSASTTTTNALPSSSNLPALLPN 239

Query: 386 XXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXXXXXXXXXXXX 565
                     FR+LS  EMA+RREKGLC+N DE F+  H+CK R                
Sbjct: 240 PKPPNRV--NFRKLSPEEMANRREKGLCYNCDETFTPQHKCKGRFFLLIADDDFDSDEPP 297

Query: 566 XXXXXVW-------PIVDPEAEPTQFALHTMTGAHTAHTFRVQGHIDDEPVHILVDG 715
                +        PI+   +E  Q + H M+G+    T R+ G + + PV +L+DG
Sbjct: 298 IPPPTIESPPPESPPIITDPSE-AQISFHAMSGSTDQTTIRIPGRLANHPVTVLIDG 353


>gb|PNX92469.1| hypothetical protein L195_g015607 [Trifolium pratense]
          Length = 566

 Score =  102 bits (254), Expect = 3e-21
 Identities = 91/290 (31%), Positives = 119/290 (41%), Gaps = 52/290 (17%)
 Frame = +2

Query: 2   TNAPALVFKISQFFAYHRTPETERITVS--------VDAQQWP---------NHLLDTVS 130
           T+A   +FKI QFF YH TPE ERIT++        +   QW          N  L  + 
Sbjct: 61  TDAHGWIFKICQFFTYHETPEEERITIASFYLDGPALSWYQWMYRNSQLVSWNQFLQALE 120

Query: 131 HRIASW------GSLRTI--------Y*VALSRVSNR---------------------RF 205
            R A        G+L  +        Y V    ++NR                     R 
Sbjct: 121 TRFAPTAYDDPRGNLFKLTQSTTVAAYLVEFEALANRIVGLSSADLLSCFISGLKLDIRR 180

Query: 206 EVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPWSPAQTPSRTESSTSKNATSXXXX 385
           EVLA+QP+SL+QAAGLARLQEDK+ D  R  R      P  TP     S+  + T     
Sbjct: 181 EVLARQPTSLTQAAGLARLQEDKLLDQQRANR------PKFTPPPPRYSSDSSITRPSPG 234

Query: 386 XXXXXXXXXRFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXXXXXXXXXXXX 565
                    RFR LS  E+A+RREKGLCF Y  R  ++   K                  
Sbjct: 235 LLPTPPAKPRFRHLSEPELAERREKGLCFXYQNRSWRNAGKKDYVSTVIKSDQELEAIVT 294

Query: 566 XXXXXVWPIVDPEAEPTQFALHTMTGAHTAHTFRVQGHIDDEPVHILVDG 715
                     + ++   Q +LH ++G   + TF+V G I    V ILVDG
Sbjct: 295 TPDSS----NNDDSPSAQLSLHALSGHQASDTFKVTGKIATHTVDILVDG 340


>dbj|GAU40605.1| hypothetical protein TSUD_28110 [Trifolium subterraneum]
          Length = 1208

 Score =  102 bits (254), Expect = 4e-21
 Identities = 89/298 (29%), Positives = 123/298 (41%), Gaps = 60/298 (20%)
 Frame = +2

Query: 2   TNAPALVFKISQFFAYHRTPETERITVS--------VDAQQWPNH---------LLDTVS 130
           T+A   +FKISQFF YH TPE ER+TV+        +   QW            LL  + 
Sbjct: 63  TDAMGWIFKISQFFDYHNTPEEERLTVASFYMDGPALSWYQWMFRNGLITTWFALLQAIE 122

Query: 131 HRIA---------------SWGSLRTIY*VALSRVSNR---------------------R 202
            R A                 G L   Y     RV+NR                     R
Sbjct: 123 TRFAPSYYDDPSQALFKLTQRGPLNQ-YLTEFERVANRIVGLPQPFLLSCFISGLSPEIR 181

Query: 203 FEVLAQQPSSLSQAAGLARLQEDKVNDLLRLTRQKPPWSPAQTPSRTESSTSKNATSXXX 382
            EV A QP+SLS A  LA+LQEDK+ +  R       + P Q  + + SS++ N+T    
Sbjct: 182 REVQALQPASLSLATALAKLQEDKIEERRR------NYKPRQNNTSSSSSSNTNSTPLLP 235

Query: 383 XXXXXXXXXX-RFRQLSAAEMADRREKGLCFNYDERFSKSHRCKARXXXXXXXXXXXXXX 559
                      +FR+LS+ EM+ RREKGLC+N D+ F+  H+CK R              
Sbjct: 236 SPTTPSNPPRVQFRKLSSEEMSSRREKGLCYNCDDTFTPGHKCKGRFYLLVSDDPESPPV 295

Query: 560 XXXXXXXVWPIVDPEAE------PTQFALHTMTGAHTAHTFRVQGHIDDEPVHILVDG 715
                    P  D E          Q + H+++G+    T R+ G I +  V +L+DG
Sbjct: 296 EPLSIQS--PETDTENHLDTPDLDAQISFHSLSGSSATATLRIPGQIANHSVTVLIDG 351


Top