BLASTX nr result

ID: Astragalus23_contig00015334 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00015334
         (1176 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNX86812.1| hypothetical protein L195_g042894, partial [Trifo...   294   4e-92
gb|PNY17453.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   280   9e-89
gb|AAO23078.1| polyprotein [Glycine max]                              285   4e-88
gb|PNX99332.1| retrotransposon-related protein [Trifolium pratense]   273   3e-86
gb|PNX93204.1| hypothetical protein L195_g016355 [Trifolium prat...   281   4e-86
dbj|GAU25507.1| hypothetical protein TSUD_279910 [Trifolium subt...   263   7e-85
gb|PNY16560.1| Ty3/gypsy retrotransposon protein [Trifolium prat...   273   1e-84
gb|PNX94483.1| retrotransposon-related protein, partial [Trifoli...   269   2e-84
dbj|GAU37387.1| hypothetical protein TSUD_22610 [Trifolium subte...   259   1e-83
dbj|GAU11620.1| hypothetical protein TSUD_346120 [Trifolium subt...   265   3e-83
dbj|GAU25204.1| hypothetical protein TSUD_151040 [Trifolium subt...   260   1e-82
ref|XP_014620186.1| PREDICTED: uncharacterized protein LOC106795...   268   5e-82
gb|KYP53387.1| Retrotransposon-derived protein PEG10 [Cajanus ca...   266   7e-82
dbj|GAU27453.1| hypothetical protein TSUD_161390 [Trifolium subt...   259   4e-81
ref|XP_020206869.1| uncharacterized protein LOC109791920 [Cajanu...   264   1e-80
gb|KYP57088.1| Retrovirus-related Pol polyprotein from transposo...   251   2e-79
gb|KYP45652.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]   275   2e-79
dbj|GAU19157.1| hypothetical protein TSUD_79800 [Trifolium subte...   275   4e-79
ref|XP_014634047.1| PREDICTED: uncharacterized protein LOC106799...   264   4e-79
gb|KYP39589.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]   247   1e-77

>gb|PNX86812.1| hypothetical protein L195_g042894, partial [Trifolium pratense]
          Length = 487

 Score =  294 bits (753), Expect = 4e-92
 Identities = 160/346 (46%), Positives = 211/346 (60%), Gaps = 2/346 (0%)
 Frame = -1

Query: 1176 LFEEKYSSSNKNKQMYAGYXXXXXXXXXXXXXXXXXVAGNKADNNSNQN--KGTLPPLLP 1003
            LFEEKY++  K K                          NK    + QN  +  LPPLLP
Sbjct: 55   LFEEKYTTQTKPKT-----NPYKSSYTPNSYQNKISPNTNKPHPITQQNPQRAQLPPLLP 109

Query: 1002 TPNIKPMNQNAKNLAIKSMTAAEIQLRRDKGLCYFCDEKFSFTHKCLNKHKFMVLQISXX 823
            TPN KPM+       IK+M++AEIQLRRDKGLCYFCD+KFS TH+C N+ + M+LQ+   
Sbjct: 110  TPNQKPMS-------IKNMSSAEIQLRRDKGLCYFCDDKFSHTHRCPNR-RVMMLQLREE 161

Query: 822  XXXXXXXDPQQNVLESQSSASEEHHLSLNAMKGGTGTGTIRFSGRIGQIPIQSLLDGGSS 643
                   DP +  L S +S   +HHLSLNAMKG +G G IRF+G IG I +Q L+DGGSS
Sbjct: 162  DDKELEPDPPEESLNSHTSDDNQHHLSLNAMKGISGRGIIRFTGMIGNIEVQVLVDGGSS 221

Query: 642  ENFLQPRIAQCLKLAVEPLPSFRVLIGNGQYMQTEGWIENLQVQIQEQXXXXXXXXXXXV 463
            + +LQPRIAQ LK+ +E  P F+VL+GNGQ +  EG +  L VQ+Q              
Sbjct: 222  DTYLQPRIAQFLKVPIETSPKFQVLVGNGQSLIVEGMVRQLHVQVQGHELTIPAYLLPVA 281

Query: 462  GADLILGSTWLATLGPHVADYATAVIKFFHKDKFITFKGDKKAQPAEAQFHQLKRMNNTD 283
            GADLILGS+WLATLGPH+ADYA+  +KF+   K+IT +GD + QP ++Q HQL+RM++T+
Sbjct: 282  GADLILGSSWLATLGPHIADYASLTLKFYQNGKYITLQGDTRPQPLQSQLHQLRRMHHTN 341

Query: 282  AISECFTVQMVQPVVXXXXXXXXXXXXXXXXXXXLHTYRQVFDTPT 145
            AI+ECFT+QM+ P V                   LHTY+++F  P+
Sbjct: 342  AIAECFTIQMLAPEVPQDVLAELPSDIEPELAILLHTYQKLFHKPS 387



 Score = 76.3 bits (186), Expect = 2e-11
 Identities = 39/62 (62%), Positives = 43/62 (69%)
 Frame = -2

Query: 188 LHCYTLTDRCLIHLLTGLPPNRNQNHAISLIEGAKPVKVKPYRYPHSQKEQIENMVQEML 9
           LH Y    + L H  +GLPP R Q H I L EG  PVKV+PYRYPHSQKEQIE MV EML
Sbjct: 376 LHTY----QKLFHKPSGLPPPREQMHEIHLQEGTTPVKVRPYRYPHSQKEQIEKMVHEML 431

Query: 8   DQ 3
           +Q
Sbjct: 432 EQ 433


>gb|PNY17453.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1535

 Score =  280 bits (715), Expect(2) = 9e-89
 Identities = 152/348 (43%), Positives = 202/348 (58%), Gaps = 4/348 (1%)
 Frame = -1

Query: 1176 LFEEKYSSSNKNKQMYAGYXXXXXXXXXXXXXXXXXVAGNKADNNS-NQNKGTLPPLLPT 1000
            ++EEKYS +NKN++ Y+                    + NK + N  +       P+L T
Sbjct: 224  VYEEKYSYNNKNQKNYSN-----------------SYSTNKPNTNKPDYTTRNTAPILNT 266

Query: 999  PNIKPMNQNAKNLAIKSMTAAEIQLRRDKGLCYFCDEKFSFTHKCLNKHKFMVLQ---IS 829
            P  +PM+Q   N  IK M+ AE QLRRDKGLCY+CD+KFSFTHKC N+   ++     + 
Sbjct: 267  PPTRPMSQFQNNPNIKRMSQAERQLRRDKGLCYWCDDKFSFTHKCPNRQLMLIQNDDDLD 326

Query: 828  XXXXXXXXXDPQQNVLESQSSASEEHHLSLNAMKGGTGTGTIRFSGRIGQIPIQSLLDGG 649
                        +  ++S  +   EHHLSLNAMKG +  G +RF+G I  I +Q L+DGG
Sbjct: 327  ADQVLDQLTQTTETTIKSLDTNQPEHHLSLNAMKGTSNMGVLRFAGSIEHIGVQILIDGG 386

Query: 648  SSENFLQPRIAQCLKLAVEPLPSFRVLIGNGQYMQTEGWIENLQVQIQEQXXXXXXXXXX 469
            SS+NFLQPRIA+ LKL +EP P F VL+GNG+ M  EG I+NL ++IQ            
Sbjct: 387  SSDNFLQPRIAKFLKLPIEPGPQFNVLVGNGEIMTAEGVIQNLPLEIQGHKLEVPVFLLP 446

Query: 468  XVGADLILGSTWLATLGPHVADYATAVIKFFHKDKFITFKGDKKAQPAEAQFHQLKRMNN 289
              GAD+ILG++WLATLGPHVADYA+  +KFF KDKF+T  G    +P  AQFH  KR+ N
Sbjct: 447  VAGADVILGASWLATLGPHVADYASLTLKFFLKDKFVTLTGQAVPRPTPAQFHHFKRLAN 506

Query: 288  TDAISECFTVQMVQPVVXXXXXXXXXXXXXXXXXXXLHTYRQVFDTPT 145
            TD+I+ECFTVQ ++                      L+TY+ VF TPT
Sbjct: 507  TDSIAECFTVQCLKSTDDADIFKDLPTNTEPEIAMLLYTYKNVFKTPT 554



 Score = 77.8 bits (190), Expect(2) = 9e-89
 Identities = 36/47 (76%), Positives = 39/47 (82%)
 Frame = -2

Query: 143 TGLPPNRNQNHAISLIEGAKPVKVKPYRYPHSQKEQIENMVQEMLDQ 3
           T LPP+R  NH I LIEGA PVKVKPYRYPHSQKEQIE M+Q+ML Q
Sbjct: 554 TALPPDRFHNHTIPLIEGATPVKVKPYRYPHSQKEQIEAMIQDMLHQ 600


>gb|AAO23078.1| polyprotein [Glycine max]
          Length = 1552

 Score =  285 bits (728), Expect(2) = 4e-88
 Identities = 157/343 (45%), Positives = 202/343 (58%)
 Frame = -1

Query: 1176 LFEEKYSSSNKNKQMYAGYXXXXXXXXXXXXXXXXXVAGNKADNNSNQNKGTLPPLLPTP 997
            LFEEKY+S  K K     +                    N+ ++N   N   LPPLLPTP
Sbjct: 246  LFEEKYTSPPKTKT----FSNLARNFTSNTSATQKYPPTNQKNDNPKPN---LPPLLPTP 298

Query: 996  NIKPMNQNAKNLAIKSMTAAEIQLRRDKGLCYFCDEKFSFTHKCLNKHKFMVLQISXXXX 817
            + KP   N +N  IK ++ AEIQLRR+K LCYFCDEKFS  HKC N+ + M+LQ+     
Sbjct: 299  STKPF--NLRNQNIKKISPAEIQLRREKNLCYFCDEKFSPAHKCPNR-QVMLLQLE--ET 353

Query: 816  XXXXXDPQQNVLESQSSASEEHHLSLNAMKGGTGTGTIRFSGRIGQIPIQSLLDGGSSEN 637
                 D Q  V E  +   + HHLSLNAM+G  G GTIRF+G++G I ++ L+DGGSS+N
Sbjct: 354  DEDQTDEQVMVTEEANMDDDTHHLSLNAMRGSNGVGTIRFTGQVGGIAVKILVDGGSSDN 413

Query: 636  FLQPRIAQCLKLAVEPLPSFRVLIGNGQYMQTEGWIENLQVQIQEQXXXXXXXXXXXVGA 457
            F+QPR+AQ LKL VEP P+ RVL+GNGQ +  EG ++ L + IQ Q            GA
Sbjct: 414  FIQPRVAQVLKLPVEPAPNLRVLVGNGQILSAEGIVQQLPLHIQGQEVKVPVYLLQISGA 473

Query: 456  DLILGSTWLATLGPHVADYATAVIKFFHKDKFITFKGDKKAQPAEAQFHQLKRMNNTDAI 277
            D+ILGSTWLATLGPHVADYA   +KFF  DKFIT +G+  ++  +AQ H  +R+ NT +I
Sbjct: 474  DVILGSTWLATLGPHVADYAALTLKFFQNDKFITLQGEGNSEATQAQLHHFRRLQNTKSI 533

Query: 276  SECFTVQMVQPVVXXXXXXXXXXXXXXXXXXXLHTYRQVFDTP 148
             ECF +Q++Q  V                   LHTY QVF  P
Sbjct: 534  EECFAIQLIQKEVPEDTLKDLPTNIDPELAILLHTYAQVFAVP 576



 Score = 70.5 bits (171), Expect(2) = 4e-88
 Identities = 31/45 (68%), Positives = 38/45 (84%)
 Frame = -2

Query: 137 LPPNRNQNHAISLIEGAKPVKVKPYRYPHSQKEQIENMVQEMLDQ 3
           LPP R Q+HAI L +G+ PVKV+PYRYPH+QK+QIE M+QEML Q
Sbjct: 579 LPPQREQDHAIPLKQGSGPVKVRPYRYPHTQKDQIEKMIQEMLVQ 623


>gb|PNX99332.1| retrotransposon-related protein [Trifolium pratense]
          Length = 1084

 Score =  273 bits (697), Expect(2) = 3e-86
 Identities = 154/346 (44%), Positives = 198/346 (57%), Gaps = 2/346 (0%)
 Frame = -1

Query: 1176 LFEEKYSSSNKNKQMYAGYXXXXXXXXXXXXXXXXXVAGNKADNNSNQNKGTLPPLLPTP 997
            ++EEKY+++ K  Q Y                            NS +N     P+L TP
Sbjct: 225  VYEEKYTTNTKLPQTYQNNQITNKTYAAKP-------------ENSTRNSA---PILHTP 268

Query: 996  NIKPMNQNAKNLAIKSMTAAEIQLRRDKGLCYFCDEKFSFTHKCLNKHKFMVLQISXXXX 817
              +PM+ N +N  IK ++ AE Q+RRDKGLCY+CDEKFSFTHKC N+ + M+LQ      
Sbjct: 269  PTRPMHPNQRNPNIKRISPAERQIRRDKGLCYWCDEKFSFTHKCPNR-QLMLLQYDDGDT 327

Query: 816  XXXXXDPQQNVLESQSSASE--EHHLSLNAMKGGTGTGTIRFSGRIGQIPIQSLLDGGSS 643
                  P    L + S  +   E HLS+NAMKG    G +RF+G IG I +Q L+DGGSS
Sbjct: 328  QLFDESPDPPDLTTNSLDTNLPELHLSMNAMKGTNNMGVMRFAGSIGHIDVQILIDGGSS 387

Query: 642  ENFLQPRIAQCLKLAVEPLPSFRVLIGNGQYMQTEGWIENLQVQIQEQXXXXXXXXXXXV 463
            +NF+QPRIA+ LKL VEP P F+VL+GNG+ M  EG I+ L + IQ              
Sbjct: 388  DNFVQPRIAKFLKLPVEPAPIFKVLVGNGEIMTAEGVIKQLPINIQSHKLEVSAYLLPVA 447

Query: 462  GADLILGSTWLATLGPHVADYATAVIKFFHKDKFITFKGDKKAQPAEAQFHQLKRMNNTD 283
            GAD+ILG++WLATLGPHVADYA+  +KFF   KF+T  GD  A+P+ AQFH L+R  NTD
Sbjct: 448  GADVILGASWLATLGPHVADYASLTLKFFLNGKFVTLVGDPIARPSPAQFHHLQRFCNTD 507

Query: 282  AISECFTVQMVQPVVXXXXXXXXXXXXXXXXXXXLHTYRQVFDTPT 145
            AI+ECFTVQ V+P                     LH +  VF TPT
Sbjct: 508  AIAECFTVQCVKPHESSDIFRELPTDIEPEIALLLHNFHSVFQTPT 553



 Score = 76.3 bits (186), Expect(2) = 3e-86
 Identities = 36/47 (76%), Positives = 38/47 (80%)
 Frame = -2

Query: 143 TGLPPNRNQNHAISLIEGAKPVKVKPYRYPHSQKEQIENMVQEMLDQ 3
           T LPP R  NHAI L+EG+ PVKVKPYRYPHSQK QIE MVQEML Q
Sbjct: 553 TTLPPTRAHNHAIPLLEGSDPVKVKPYRYPHSQKNQIEVMVQEMLQQ 599


>gb|PNX93204.1| hypothetical protein L195_g016355 [Trifolium pratense]
          Length = 576

 Score =  281 bits (720), Expect = 4e-86
 Identities = 142/301 (47%), Positives = 190/301 (63%)
 Frame = -1

Query: 1047 NNSNQNKGTLPPLLPTPNIKPMNQNAKNLAIKSMTAAEIQLRRDKGLCYFCDEKFSFTHK 868
            NN N N+ T+PPLLPTPN KP N  +KN  +K+MT AE+Q+RR+KGLCY CDEK+SF+H+
Sbjct: 261  NNPNPNRTTIPPLLPTPNSKPTNTYSKNQNVKNMTRAEMQIRREKGLCYTCDEKWSFSHR 320

Query: 867  CLNKHKFMVLQISXXXXXXXXXDPQQNVLESQSSASEEHHLSLNAMKGGTGTGTIRFSGR 688
            C N+H  M+LQI             +N +++      E HLS NA++G TG GTI+F+G 
Sbjct: 321  CPNRH-LMILQIEDDDNDYEDA--SENQVDTGGDKQLELHLSFNALRGATGVGTIKFTGY 377

Query: 687  IGQIPIQSLLDGGSSENFLQPRIAQCLKLAVEPLPSFRVLIGNGQYMQTEGWIENLQVQI 508
            IG++PIQ L+DGGSS+NFLQPRIA  LKL + P P F+VL+GNG  +  EG I  L + +
Sbjct: 378  IGKMPIQILVDGGSSDNFLQPRIAHFLKLDIAPAPLFKVLVGNGNSLSPEGSIPELCIAV 437

Query: 507  QEQXXXXXXXXXXXVGADLILGSTWLATLGPHVADYATAVIKFFHKDKFITFKGDKKAQP 328
            Q+            VGADLILG+TWLATLGPHVADY    +KF+ +  F+T +G+K   P
Sbjct: 438  QQHNIKIPVYLLPIVGADLILGATWLATLGPHVADYQALSLKFYDQGNFVTLQGEKSTIP 497

Query: 327  AEAQFHQLKRMNNTDAISECFTVQMVQPVVXXXXXXXXXXXXXXXXXXXLHTYRQVFDTP 148
             +AQ H ++R+  TDAI  CF++  V+P+                    L+ YR VF  P
Sbjct: 498  QQAQLHHMRRLYQTDAIEACFSIHRVEPINQQDYWLDLPTDMEPELVLLLNKYRDVFHKP 557

Query: 147  T 145
            T
Sbjct: 558  T 558


>dbj|GAU25507.1| hypothetical protein TSUD_279910 [Trifolium subterraneum]
          Length = 1389

 Score =  263 bits (671), Expect(2) = 7e-85
 Identities = 148/360 (41%), Positives = 205/360 (56%), Gaps = 16/360 (4%)
 Frame = -1

Query: 1176 LFEEKYSSSNKNKQMYAGYXXXXXXXXXXXXXXXXXVAGNKADNNSNQNKGTLP-----P 1012
            ++EEKY+SSNK +++                        N + N    N+  +      P
Sbjct: 219  VYEEKYTSSNKPQRI---------------------NTNNYSTNKPFMNRTEIQTRNATP 257

Query: 1011 LLPTPNIKPMNQNAKNLAIKSMTAAEIQLRRDKGLCYFCDEKFSFTHKCLNKHKFMVLQI 832
            +L TP  +PM+Q  KN  IK ++ AE+Q+RR+KGLCY+CD+KFSFTHKC N+ + M+L  
Sbjct: 258  ILNTPPTRPMSQFQKNPNIKRISPAEMQIRRNKGLCYWCDDKFSFTHKCPNR-QLMLLHY 316

Query: 831  SXXXXXXXXXDPQQNVLESQSSASE-----------EHHLSLNAMKGGTGTGTIRFSGRI 685
                        +  VL++ + ++E           EHHLSLNAMKG    G +RF+G I
Sbjct: 317  DEDSDN------EDKVLDTMTQSTEITTNSLDTNQPEHHLSLNAMKGTNNMGVLRFAGSI 370

Query: 684  GQIPIQSLLDGGSSENFLQPRIAQCLKLAVEPLPSFRVLIGNGQYMQTEGWIENLQVQIQ 505
              I +Q L+DGGSS+NFLQPRIA+ LKL +E  P F+VL+GNG+ M  EG + N+ ++IQ
Sbjct: 371  NNIGVQILIDGGSSDNFLQPRIAKFLKLPIESGPQFKVLVGNGEIMTAEGVVHNVPLEIQ 430

Query: 504  EQXXXXXXXXXXXVGADLILGSTWLATLGPHVADYATAVIKFFHKDKFITFKGDKKAQPA 325
                          GAD+ILG++WLATLGPHVA YA+  +K F KDKFIT  G+   +P+
Sbjct: 431  GHKLEVPVFLLPVAGADVILGASWLATLGPHVAHYASLTLKNFWKDKFITLTGEVTHKPS 490

Query: 324  EAQFHQLKRMNNTDAISECFTVQMVQPVVXXXXXXXXXXXXXXXXXXXLHTYRQVFDTPT 145
             AQFH  KR+  TDAI+ECFT+Q ++P                     LHTY+ +F  PT
Sbjct: 491  PAQFHHFKRLQTTDAIAECFTIQWLKPTDEEDVFKELPTNIEPEIAVLLHTYKGLFKPPT 550



 Score = 81.6 bits (200), Expect(2) = 7e-85
 Identities = 36/47 (76%), Positives = 40/47 (85%)
 Frame = -2

Query: 143 TGLPPNRNQNHAISLIEGAKPVKVKPYRYPHSQKEQIENMVQEMLDQ 3
           T LPPNR+ NH I L+EG+ PVKVKPYRYPHSQKEQIE M+QEML Q
Sbjct: 550 TALPPNRSHNHTIPLMEGSNPVKVKPYRYPHSQKEQIEKMIQEMLQQ 596


>gb|PNY16560.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1525

 Score =  273 bits (699), Expect(2) = 1e-84
 Identities = 137/269 (50%), Positives = 184/269 (68%), Gaps = 1/269 (0%)
 Frame = -1

Query: 1047 NNSNQNKGTLPPLLPTPNIKPMNQNAKNLAIKSMTAAEIQLRRDKGLCYFCDEKFSFTHK 868
            NN N N+ T PPLLPTP+ KP     KN  +K++++AE+QLRR+KGLCY C++K+SF HK
Sbjct: 255  NNPNPNRPTQPPLLPTPSTKPSLFTQKNNIVKNISSAEMQLRREKGLCYTCEDKWSFNHK 314

Query: 867  CLNKHKFMVLQISXXXXXXXXXDPQQNVLESQSSASE-EHHLSLNAMKGGTGTGTIRFSG 691
            C NKH  ++L I            Q N  +  S+  + E HLSLNA+KG +G GTI+F+G
Sbjct: 315  CPNKH--VMLLIVEDDSQIEPEIDQSNHTQIDSNHQDLELHLSLNALKGASGLGTIKFTG 372

Query: 690  RIGQIPIQSLLDGGSSENFLQPRIAQCLKLAVEPLPSFRVLIGNGQYMQTEGWIENLQVQ 511
            ++   P+Q L+DGGSS+NFLQPRIAQ LKL +EP P F+VL+GNG  + +EG +++L+V 
Sbjct: 373  QVSNTPLQILVDGGSSDNFLQPRIAQFLKLDIEPAPLFKVLVGNGNALTSEGIVKDLKVS 432

Query: 510  IQEQXXXXXXXXXXXVGADLILGSTWLATLGPHVADYATAVIKFFHKDKFITFKGDKKAQ 331
            +Q Q           VGADLILG+TWLATLGPHVADY    +KFF +  FIT +GDK   
Sbjct: 433  VQGQELKLPVYLLPIVGADLILGATWLATLGPHVADYQALTLKFFQQGHFITLQGDKSTT 492

Query: 330  PAEAQFHQLKRMNNTDAISECFTVQMVQP 244
            P +AQ H ++R++ T+AISE FT+Q   P
Sbjct: 493  PQQAQLHHMRRLHTTEAISEYFTIQRSDP 521



 Score = 70.5 bits (171), Expect(2) = 1e-84
 Identities = 33/46 (71%), Positives = 38/46 (82%)
 Frame = -2

Query: 140 GLPPNRNQNHAISLIEGAKPVKVKPYRYPHSQKEQIENMVQEMLDQ 3
           GLPP R Q+H I L  GAKPVKV+PYRYP SQKEQIE MV+EML++
Sbjct: 555 GLPPQRLQDHVIPLEPGAKPVKVRPYRYPQSQKEQIEIMVKEMLEE 600


>gb|PNX94483.1| retrotransposon-related protein, partial [Trifolium pratense]
          Length = 1287

 Score =  269 bits (687), Expect(2) = 2e-84
 Identities = 143/300 (47%), Positives = 189/300 (63%), Gaps = 11/300 (3%)
 Frame = -1

Query: 1014 PLLPTPNIKPMNQNAKNLAIKSMTAAEIQLRRDKGLCYFCDEKFSFTHKCLNKHKFMVLQ 835
            P+L TP  +PM+Q  KN  IK ++ AE Q+RRDKGLCY+CD+KFS+THKC N+ + M+LQ
Sbjct: 264  PILNTPPTRPMSQFQKNPNIKRISPAERQVRRDKGLCYWCDDKFSYTHKCPNR-QLMLLQ 322

Query: 834  ISXXXXXXXXXDPQQNVLESQSSASE-----------EHHLSLNAMKGGTGTGTIRFSGR 688
                         ++NV+E  S +SE           EHHLS NAMKG +  G +RFSG 
Sbjct: 323  YDDNE--------EENVVEIPSDSSELAINTLETTQPEHHLSFNAMKGNSSMGILRFSGT 374

Query: 687  IGQIPIQSLLDGGSSENFLQPRIAQCLKLAVEPLPSFRVLIGNGQYMQTEGWIENLQVQI 508
            I  I +Q L+DGGSS+NFLQPRIA+ LKL +EP P F+VL+GNG+ M  EG I+NL + I
Sbjct: 375  IEHIQVQILIDGGSSDNFLQPRIARFLKLPIEPGPVFKVLVGNGEIMTAEGVIQNLALNI 434

Query: 507  QEQXXXXXXXXXXXVGADLILGSTWLATLGPHVADYATAVIKFFHKDKFITFKGDKKAQP 328
            Q              GAD+ILG++WLATLGPHVADYA+  +KFF   KF+T +G+   +P
Sbjct: 435  QGTELQVPVFLLPVAGADVILGASWLATLGPHVADYASLTLKFFINGKFVTLQGEATPRP 494

Query: 327  AEAQFHQLKRMNNTDAISECFTVQMVQPVVXXXXXXXXXXXXXXXXXXXLHTYRQVFDTP 148
            A AQFH  KR++ TDAI+ECFT+Q ++                      LHTY+++F TP
Sbjct: 495  AAAQFHHFKRLHYTDAIAECFTIQWLKSHTDEDIFRDLPTNIEPEIAILLHTYKELFKTP 554



 Score = 74.3 bits (181), Expect(2) = 2e-84
 Identities = 34/45 (75%), Positives = 38/45 (84%)
 Frame = -2

Query: 137 LPPNRNQNHAISLIEGAKPVKVKPYRYPHSQKEQIENMVQEMLDQ 3
           LPP+R+ NH+I LIEG  PVKVKPYRYPHSQK QIE MVQ+ML Q
Sbjct: 557 LPPHRSHNHSIPLIEGHNPVKVKPYRYPHSQKNQIELMVQDMLQQ 601


>dbj|GAU37387.1| hypothetical protein TSUD_22610 [Trifolium subterraneum]
          Length = 1418

 Score =  259 bits (662), Expect(2) = 1e-83
 Identities = 148/348 (42%), Positives = 193/348 (55%), Gaps = 4/348 (1%)
 Frame = -1

Query: 1176 LFEEKYSSSNK-NKQMYAGYXXXXXXXXXXXXXXXXXVAGNKADNNSNQNKGTLPPLLPT 1000
            ++EEKY+S+ K  K     Y                    NK++N +        P+L T
Sbjct: 200  VYEEKYASNQKLQKNNTTNYSTNKPLY-------------NKSENTTRN----AAPILNT 242

Query: 999  PNIKPMNQNAKNLAIKSMTAAEIQLRRDKGLCYFCDEKFSFTHKCLNKHKFMVLQISXXX 820
               +PM+Q  KN  IK ++ AEIQ+RRDKGLCY+CDEKFSFTHKC N+ + M+LQ     
Sbjct: 243  SPTRPMSQFQKNPNIKRISPAEIQIRRDKGLCYWCDEKFSFTHKCPNR-QLMLLQYDDKD 301

Query: 819  XXXXXXDPQQNV---LESQSSASEEHHLSLNAMKGGTGTGTIRFSGRIGQIPIQSLLDGG 649
                     Q       S  +   EHHLSLNAMKG    G +RF+G I  I +Q L+DGG
Sbjct: 302  EDPVLETLTQTTPITTNSPDTNQPEHHLSLNAMKGTRNMGVLRFAGSIEHIEVQVLIDGG 361

Query: 648  SSENFLQPRIAQCLKLAVEPLPSFRVLIGNGQYMQTEGWIENLQVQIQEQXXXXXXXXXX 469
            SS NFLQPRIA+ LKL +EP P F+VL+GNG+ M  E  I  L ++IQ            
Sbjct: 362  SSNNFLQPRIAKFLKLPIEPRPQFKVLVGNGEIMTAERVINKLPLEIQGHKLDVPVFLLP 421

Query: 468  XVGADLILGSTWLATLGPHVADYATAVIKFFHKDKFITFKGDKKAQPAEAQFHQLKRMNN 289
              GAD+ILG++W ATLGPHVADYA+  + FF KDKF+T  G+    P  AQFH  KR+ N
Sbjct: 422  VAGADVILGASWFATLGPHVADYASLTLNFFLKDKFVTLTGEVVPIPTPAQFHHFKRLTN 481

Query: 288  TDAISECFTVQMVQPVVXXXXXXXXXXXXXXXXXXXLHTYRQVFDTPT 145
            TD+I+EC+T+Q ++                      LHTY+ +F  PT
Sbjct: 482  TDSIAECYTIQWLKSNDTEDIFKDLPTNIEPKIAMLLHTYKDLFKPPT 529



 Score = 81.3 bits (199), Expect(2) = 1e-83
 Identities = 37/47 (78%), Positives = 39/47 (82%)
 Frame = -2

Query: 143 TGLPPNRNQNHAISLIEGAKPVKVKPYRYPHSQKEQIENMVQEMLDQ 3
           T LPPNR  NHAI LI+GA PVKVKPYRYPH QK QIENM+QEML Q
Sbjct: 529 TALPPNRAHNHAIPLIDGASPVKVKPYRYPHCQKTQIENMIQEMLQQ 575


>dbj|GAU11620.1| hypothetical protein TSUD_346120 [Trifolium subterraneum]
          Length = 1479

 Score =  265 bits (678), Expect(2) = 3e-83
 Identities = 145/347 (41%), Positives = 202/347 (58%), Gaps = 4/347 (1%)
 Frame = -1

Query: 1176 LFEEKYSSSNKNK-QMYAGYXXXXXXXXXXXXXXXXXVAGNKADNNSNQNKGTLPPLLPT 1000
            ++EEKY+ ++K++ + Y+ Y                        N S        P+L T
Sbjct: 224  VYEEKYAMNSKSQTRNYSNYS-----------------TNKPLYNKSEIATRNSAPILNT 266

Query: 999  PNIKPMNQNAKNLAIKSMTAAEIQLRRDKGLCYFCDEKFSFTHKCLNKHKFMVLQISXXX 820
            P  +PM+Q  KN  IK ++ AE+Q+RRDKGLCY+CDEKFSFTHKC N+ + M+L      
Sbjct: 267  PPTRPMSQYQKNPNIKRISPAEMQVRRDKGLCYWCDEKFSFTHKCPNR-QLMLLHYDDSD 325

Query: 819  XXXXXXDP---QQNVLESQSSASEEHHLSLNAMKGGTGTGTIRFSGRIGQIPIQSLLDGG 649
                       +   ++S  + + +HHLSLNAMKG    G +RF+G I Q  +Q L+DGG
Sbjct: 326  EEQLVEPSITLEPKTIDSSITNTPDHHLSLNAMKGNNTMGVLRFTGAIEQFKVQVLIDGG 385

Query: 648  SSENFLQPRIAQCLKLAVEPLPSFRVLIGNGQYMQTEGWIENLQVQIQEQXXXXXXXXXX 469
            SS+NFLQPRIA+ LKL +EP P+FRVL+GNG+ M  EG I+ L + IQ            
Sbjct: 386  SSDNFLQPRIAKFLKLPIEPGPTFRVLVGNGEIMTAEGVIQELPLDIQGHKIHIPVFLLP 445

Query: 468  XVGADLILGSTWLATLGPHVADYATAVIKFFHKDKFITFKGDKKAQPAEAQFHQLKRMNN 289
             VGAD++LG++WLATLGPHVADYA+  +KFF + KF+T  G+ + +P  AQ HQ KR +N
Sbjct: 446  VVGADIVLGASWLATLGPHVADYASLTLKFFLEGKFVTLVGEHENRPVTAQLHQFKRFHN 505

Query: 288  TDAISECFTVQMVQPVVXXXXXXXXXXXXXXXXXXXLHTYRQVFDTP 148
            TDAI+ECFT+Q ++                      LH Y ++F +P
Sbjct: 506  TDAIAECFTIQCIKTTEPADILHELPSNMEPELAILLHNYSKLFQSP 552



 Score = 73.6 bits (179), Expect(2) = 3e-83
 Identities = 33/45 (73%), Positives = 38/45 (84%)
 Frame = -2

Query: 137 LPPNRNQNHAISLIEGAKPVKVKPYRYPHSQKEQIENMVQEMLDQ 3
           LPP+R+ NH I LIEG+ PVKVKPYRYPHSQK QIE MV++ML Q
Sbjct: 555 LPPSRSHNHCIPLIEGSSPVKVKPYRYPHSQKAQIEIMVEDMLQQ 599


>dbj|GAU25204.1| hypothetical protein TSUD_151040 [Trifolium subterraneum]
          Length = 1512

 Score =  260 bits (665), Expect(2) = 1e-82
 Identities = 139/301 (46%), Positives = 182/301 (60%), Gaps = 2/301 (0%)
 Frame = -1

Query: 1044 NSNQNKGTLPPLLPTPNIKPMNQNAKNLAIKSMTAAEIQLRRDKGLCYFCDEKFSFTHKC 865
            NS +N     P+L TP  +PM+Q  KN  IK ++ AE+Q+RRDKGLCY+CDEKFSFTHKC
Sbjct: 256  NSTRNSA---PILNTPPTRPMSQFQKNPNIKRISPAEMQIRRDKGLCYWCDEKFSFTHKC 312

Query: 864  LNKHKFMVLQISXXXXXXXXXDPQ--QNVLESQSSASEEHHLSLNAMKGGTGTGTIRFSG 691
             N+ + M+LQ            P+   +   S  +   +HHLS+NAMKG +  G IRF G
Sbjct: 313  PNR-QLMLLQYDDNETQLFDGSPEPPDSPTNSLDTNIPDHHLSMNAMKGTSNMGVIRFVG 371

Query: 690  RIGQIPIQSLLDGGSSENFLQPRIAQCLKLAVEPLPSFRVLIGNGQYMQTEGWIENLQVQ 511
             I  I +Q L+DGGSS+NF+QPRIA+ LKL +EP P F+VL+GNG+ M  EG I+ L + 
Sbjct: 372  SIEHIEVQILIDGGSSDNFVQPRIAKFLKLPIEPAPVFKVLVGNGEIMNAEGVIKQLPID 431

Query: 510  IQEQXXXXXXXXXXXVGADLILGSTWLATLGPHVADYATAVIKFFHKDKFITFKGDKKAQ 331
            IQ              G D++LG++WLATLGPHVADYA+  +KFF   KF+T  G+  A+
Sbjct: 432  IQGHKLEVPAFLLPVAGVDVVLGASWLATLGPHVADYASLTLKFFLNGKFVTLVGEPLAR 491

Query: 330  PAEAQFHQLKRMNNTDAISECFTVQMVQPVVXXXXXXXXXXXXXXXXXXXLHTYRQVFDT 151
            P   QFH LKR  NT AI+ECF VQ ++                      LH Y++VF T
Sbjct: 492  PEPTQFHHLKRCCNTKAIAECFIVQRLKTTEATDIFKELPTNTEPEIAMLLHNYQEVFKT 551

Query: 150  P 148
            P
Sbjct: 552  P 552



 Score = 76.6 bits (187), Expect(2) = 1e-82
 Identities = 34/46 (73%), Positives = 39/46 (84%)
 Frame = -2

Query: 140 GLPPNRNQNHAISLIEGAKPVKVKPYRYPHSQKEQIENMVQEMLDQ 3
           GLPP R  NH+I L+EG+ PVKVKPYRYPHSQK QIE+MVQ+ML Q
Sbjct: 554 GLPPTRAHNHSIPLLEGSNPVKVKPYRYPHSQKTQIEHMVQDMLQQ 599


>ref|XP_014620186.1| PREDICTED: uncharacterized protein LOC106795283 [Glycine max]
          Length = 495

 Score =  268 bits (686), Expect = 5e-82
 Identities = 138/270 (51%), Positives = 175/270 (64%)
 Frame = -1

Query: 1056 KADNNSNQNKGTLPPLLPTPNIKPMNQNAKNLAIKSMTAAEIQLRRDKGLCYFCDEKFSF 877
            KAD  +   K    PLL TPN KP+NQ      IK ++ AE+Q+RRDKGL Y+CD+KFSF
Sbjct: 218  KADVPNTNPKANQSPLLRTPNSKPLNQTQNKPKIKYISQAEMQVRRDKGLSYWCDDKFSF 277

Query: 876  THKCLNKHKFMVLQISXXXXXXXXXDPQQNVLESQSSASEEHHLSLNAMKGGTGTGTIRF 697
            + KC NK + M+LQ++          P    + +       HHLSLNAMKG  G GTIRF
Sbjct: 278  SLKCPNK-QLMMLQLTDDSDLNEEIKPPDIDIATAEMPRGAHHLSLNAMKGFHGVGTIRF 336

Query: 696  SGRIGQIPIQSLLDGGSSENFLQPRIAQCLKLAVEPLPSFRVLIGNGQYMQTEGWIENLQ 517
            +G IG I +Q L+DG +SE+FLQPRIA  LKL +EP P FRVL+GNGQ M+TEGW++ L 
Sbjct: 337  TGNIGNIRVQILVDGDNSESFLQPRIAMFLKLPIEPEPHFRVLVGNGQIMETEGWVKQLA 396

Query: 516  VQIQEQXXXXXXXXXXXVGADLILGSTWLATLGPHVADYATAVIKFFHKDKFITFKGDKK 337
            V IQ Q            GADLILGS WLATLGPHVADYA   +KFF+   F+T +G+  
Sbjct: 397  VDIQGQKLLVPVYLLPVSGADLILGSPWLATLGPHVADYAALTLKFFYHGNFLTLQGEVS 456

Query: 336  AQPAEAQFHQLKRMNNTDAISECFTVQMVQ 247
            + P +A  HQLKR+++T AIS  F + M+Q
Sbjct: 457  STPTQAHLHQLKRLHDTSAISNSFAIHMMQ 486


>gb|KYP53387.1| Retrotransposon-derived protein PEG10 [Cajanus cajan]
          Length = 431

 Score =  266 bits (680), Expect = 7e-82
 Identities = 146/313 (46%), Positives = 184/313 (58%), Gaps = 2/313 (0%)
 Frame = -1

Query: 1176 LFEEKY--SSSNKNKQMYAGYXXXXXXXXXXXXXXXXXVAGNKADNNSNQNKGTLPPLLP 1003
            LFEEKY  SS+ KN                         +  K D   +  K  L PLLP
Sbjct: 98   LFEEKYILSSAPKNPSYQP---------RATTFYPNRHSSNPKPDIPHSLPKSNLSPLLP 148

Query: 1002 TPNIKPMNQNAKNLAIKSMTAAEIQLRRDKGLCYFCDEKFSFTHKCLNKHKFMVLQISXX 823
             P+ KP  Q  KN  +K ++ AE+Q+RR+KGLCYFCDEKF FTHKC N+   M+  I   
Sbjct: 149  NPSTKPFPQTHKN-QVKKISPAEMQIRREKGLCYFCDEKFPFTHKCPNRQMMMLQLIDDE 207

Query: 822  XXXXXXXDPQQNVLESQSSASEEHHLSLNAMKGGTGTGTIRFSGRIGQIPIQSLLDGGSS 643
                   DP          ++ EHHLSLNAMKG  G GTI F+G I  I I+ L+DGGSS
Sbjct: 208  LLDSREPDPPDLPQPDTEVSNPEHHLSLNAMKGVGGVGTIEFTGHIEPISIKVLVDGGSS 267

Query: 642  ENFLQPRIAQCLKLAVEPLPSFRVLIGNGQYMQTEGWIENLQVQIQEQXXXXXXXXXXXV 463
            ++FLQPRIA  LKL +E +P F V +GNGQ M TEG I+ L + IQ              
Sbjct: 268  DSFLQPRIAHFLKLPIELVPGFPVFVGNGQSMTTEGVIQQLAMTIQGHQLVVPVYLLSVF 327

Query: 462  GADLILGSTWLATLGPHVADYATAVIKFFHKDKFITFKGDKKAQPAEAQFHQLKRMNNTD 283
            GADL+LGS+WLATLGPH+ADYAT+ +KFF   KF+  +G+   QP +AQ H + RM  T 
Sbjct: 328  GADLVLGSSWLATLGPHIADYATSTLKFFQHGKFVVLQGEHPIQPQQAQLHHMHRMQQTQ 387

Query: 282  AISECFTVQMVQP 244
            AI+ECF++Q+VQP
Sbjct: 388  AIAECFSIQLVQP 400


>dbj|GAU27453.1| hypothetical protein TSUD_161390 [Trifolium subterraneum]
          Length = 1531

 Score =  259 bits (661), Expect(2) = 4e-81
 Identities = 144/347 (41%), Positives = 201/347 (57%), Gaps = 3/347 (0%)
 Frame = -1

Query: 1176 LFEEKYSSSNKNKQMYAGYXXXXXXXXXXXXXXXXXVAGNKADNNSNQNKG-TLPPLLPT 1000
            ++EEKY+++ K ++ Y                     + NK  NN  +N      P+L T
Sbjct: 226  VYEEKYTTTMKPQKPYT-----------------QTYSTNKPYNNKPENSTRNTAPILNT 268

Query: 999  PNIKPMNQNAKNLAIKSMTAAEIQLRRDKGLCYFCDEKFSFTHKCLNKHKFMVLQI--SX 826
            P  +PM+Q  KN  +K ++ AE+QLRRDKGLCY+CD+KFSFTHKC N+ + M+LQ   S 
Sbjct: 269  PPTRPMSQFQKNPNVKRISPAEMQLRRDKGLCYWCDDKFSFTHKCPNR-QLMLLQYEDSE 327

Query: 825  XXXXXXXXDPQQNVLESQSSASEEHHLSLNAMKGGTGTGTIRFSGRIGQIPIQSLLDGGS 646
                    DP        ++   + HLS++AMKG +  G +RF+G I  I +Q L+DGGS
Sbjct: 328  DQVLDEITDPPDPTTNGLTTNLPKLHLSMSAMKGSSHMGVLRFTGAIEHIQVQILIDGGS 387

Query: 645  SENFLQPRIAQCLKLAVEPLPSFRVLIGNGQYMQTEGWIENLQVQIQEQXXXXXXXXXXX 466
            S+NF+QPRIA+ LKL +EP P F+VL+GNG+ M  EG ++ L + +Q             
Sbjct: 388  SDNFVQPRIAKFLKLPIEPAPIFKVLVGNGEVMTAEGIVKQLPLDVQGHRLQVPVYLLPV 447

Query: 465  VGADLILGSTWLATLGPHVADYATAVIKFFHKDKFITFKGDKKAQPAEAQFHQLKRMNNT 286
             GAD+ILG++WL+TLGPHVADYA+  IKFF  DKF+T  G+  A+P  AQFH +KR ++T
Sbjct: 448  AGADVILGASWLSTLGPHVADYASLTIKFFLHDKFVTLVGEPIARPEPAQFHHMKRFHHT 507

Query: 285  DAISECFTVQMVQPVVXXXXXXXXXXXXXXXXXXXLHTYRQVFDTPT 145
            DAI ECF +Q  +                      L+ Y+ VF TPT
Sbjct: 508  DAIEECFAIQWFKDTEAADIFKELPTNTDPEIAMLLYNYQAVFKTPT 554



 Score = 73.2 bits (178), Expect(2) = 4e-81
 Identities = 33/45 (73%), Positives = 36/45 (80%)
 Frame = -2

Query: 143 TGLPPNRNQNHAISLIEGAKPVKVKPYRYPHSQKEQIENMVQEML 9
           T LPP R  NHAI L+EG  P+KVKPYRYPHSQK QIE MVQ+ML
Sbjct: 554 TMLPPTRAHNHAIPLLEGTNPIKVKPYRYPHSQKTQIETMVQDML 598


>ref|XP_020206869.1| uncharacterized protein LOC109791920 [Cajanus cajan]
          Length = 641

 Score =  264 bits (674), Expect(2) = 1e-80
 Identities = 145/344 (42%), Positives = 192/344 (55%)
 Frame = -1

Query: 1176 LFEEKYSSSNKNKQMYAGYXXXXXXXXXXXXXXXXXVAGNKADNNSNQNKGTLPPLLPTP 997
            LFEEKY    K++  Y  Y                     K  N+        PPLLPTP
Sbjct: 242  LFEEKYIPKPKSQPPYKPYTQTTPYTYP------------KLQNSQ-------PPLLPTP 282

Query: 996  NIKPMNQNAKNLAIKSMTAAEIQLRRDKGLCYFCDEKFSFTHKCLNKHKFMVLQISXXXX 817
             +KP  Q      IK MT  E+Q+RR+KGLCY CDE+FS  H+C NKH +M+LQ+     
Sbjct: 283  TVKPFAQPTN--PIKKMTPTEMQIRREKGLCYTCDERFSPNHRCPNKH-YMILQVDEEQC 339

Query: 816  XXXXXDPQQNVLESQSSASEEHHLSLNAMKGGTGTGTIRFSGRIGQIPIQSLLDGGSSEN 637
                   Q + ++S    + +HHLS NA+KG +G GTI+F GRI   P+  LLD GSS+N
Sbjct: 340  THEDNVIQTDSVDSLPQDATDHHLSFNALKGSSGVGTIKFQGRINGCPVNILLDSGSSDN 399

Query: 636  FLQPRIAQCLKLAVEPLPSFRVLIGNGQYMQTEGWIENLQVQIQEQXXXXXXXXXXXVGA 457
            FLQPRIA  LKL +EP P+F+V++GNG  M  EG+I +LQV++Q              GA
Sbjct: 400  FLQPRIAHFLKLPIEPAPNFQVMVGNGNSMSAEGFISDLQVEVQGYTLQFPVYLLPVAGA 459

Query: 456  DLILGSTWLATLGPHVADYATAVIKFFHKDKFITFKGDKKAQPAEAQFHQLKRMNNTDAI 277
            DL+LG+ WLATLGPHVADY+    KF    KFIT  G+K+  P  AQFH +KR++ T AI
Sbjct: 460  DLVLGAAWLATLGPHVADYSALAFKFLLDGKFITLYGEKQNLPQLAQFHHIKRLHQTHAI 519

Query: 276  SECFTVQMVQPVVXXXXXXXXXXXXXXXXXXXLHTYRQVFDTPT 145
            +E F++Q+ +                      LHTY++VF  P+
Sbjct: 520  AEVFSIQLQESDNKSEFCEEIPQNLEPALVLLLHTYKEVFTKPS 563



 Score = 66.2 bits (160), Expect(2) = 1e-80
 Identities = 30/45 (66%), Positives = 36/45 (80%)
 Frame = -2

Query: 143 TGLPPNRNQNHAISLIEGAKPVKVKPYRYPHSQKEQIENMVQEML 9
           + LPP R Q+H+I LIEG+ PVKV+PYRY HSQK QIE MV +ML
Sbjct: 563 SSLPPQRFQDHSIPLIEGSNPVKVRPYRYAHSQKAQIEKMVADML 607


>gb|KYP57088.1| Retrovirus-related Pol polyprotein from transposon 17.6 [Cajanus
            cajan]
          Length = 620

 Score =  251 bits (640), Expect(2) = 2e-79
 Identities = 134/292 (45%), Positives = 175/292 (59%)
 Frame = -1

Query: 1023 TLPPLLPTPNIKPMNQNAKNLAIKSMTAAEIQLRRDKGLCYFCDEKFSFTHKCLNKHKFM 844
            +LPPLLPTP     N    +  +K M+ AE+QLRR+KGLC+ CDEK+S  HKC NK +++
Sbjct: 49   SLPPLLPTPT----NPTFNSTNVKKMSPAEVQLRREKGLCFTCDEKYSPAHKCPNK-QYL 103

Query: 843  VLQISXXXXXXXXXDPQQNVLESQSSASEEHHLSLNAMKGGTGTGTIRFSGRIGQIPIQS 664
             LQI           P     +     S EHHLS NA+KG +G GT+ F G I  I IQ 
Sbjct: 104  FLQIIEDETEILEPKPPDTNEQLDIVNSLEHHLSFNALKGSSGVGTMCFKGSINGIIIQV 163

Query: 663  LLDGGSSENFLQPRIAQCLKLAVEPLPSFRVLIGNGQYMQTEGWIENLQVQIQEQXXXXX 484
            LLD GSS+NFLQPR+A CLKL +EP P+F+VL+GNG  +  EG++  L+V IQ       
Sbjct: 164  LLDSGSSDNFLQPRLASCLKLPIEPAPNFQVLVGNGNSLIAEGFVSKLEVLIQGHTLQLP 223

Query: 483  XXXXXXVGADLILGSTWLATLGPHVADYATAVIKFFHKDKFITFKGDKKAQPAEAQFHQL 304
                   GADLILG+ WLATLGPH++DY T  +KF+   +F+TF G+K    + AQF+QL
Sbjct: 224  VYLLPVAGADLILGAAWLATLGPHISDYNTLTLKFYLGTQFVTFHGEKSTTASPAQFNQL 283

Query: 303  KRMNNTDAISECFTVQMVQPVVXXXXXXXXXXXXXXXXXXXLHTYRQVFDTP 148
            +RM + +AI+E FT+Q+  PV                    LHTY QVF  P
Sbjct: 284  RRMYHMNAIAELFTLQVEPPVCPQDDWLDFPGDMEPELAILLHTYEQVFAVP 335



 Score = 75.5 bits (184), Expect(2) = 2e-79
 Identities = 33/46 (71%), Positives = 39/46 (84%)
 Frame = -2

Query: 140 GLPPNRNQNHAISLIEGAKPVKVKPYRYPHSQKEQIENMVQEMLDQ 3
           GLPPNR+QNHAI L+    PVKV+PYRYPHSQK QIE M+Q+MLD+
Sbjct: 337 GLPPNRSQNHAIPLMPSTGPVKVRPYRYPHSQKLQIEKMIQDMLDE 382


>gb|KYP45652.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]
          Length = 1210

 Score =  275 bits (703), Expect = 2e-79
 Identities = 139/271 (51%), Positives = 177/271 (65%)
 Frame = -1

Query: 1056 KADNNSNQNKGTLPPLLPTPNIKPMNQNAKNLAIKSMTAAEIQLRRDKGLCYFCDEKFSF 877
            K D   +  K  LPPLLP P+ KP +Q  +N  +K ++ AE+Q+RR+KGLCYFCDEKFSF
Sbjct: 275  KPDIPHSLPKSNLPPLLPNPSTKPFSQTYQN-QVKKISPAEMQIRREKGLCYFCDEKFSF 333

Query: 876  THKCLNKHKFMVLQISXXXXXXXXXDPQQNVLESQSSASEEHHLSLNAMKGGTGTGTIRF 697
             HKC N+H  M+  I          DP           + EHHLSLNAMKG  G GTI F
Sbjct: 334  NHKCPNRHMMMLQLIDDELVDSREPDPPDLPQPDIEVGNPEHHLSLNAMKGVGGVGTIGF 393

Query: 696  SGRIGQIPIQSLLDGGSSENFLQPRIAQCLKLAVEPLPSFRVLIGNGQYMQTEGWIENLQ 517
            +G IG I I+ L+DGGSS++FLQPRIA  LKL +E +  F+V +GNGQ M TEG I+ L 
Sbjct: 394  TGHIGPIAIKVLVDGGSSDSFLQPRIAHFLKLPIELVRGFQVFVGNGQSMTTEGVIQQLA 453

Query: 516  VQIQEQXXXXXXXXXXXVGADLILGSTWLATLGPHVADYATAVIKFFHKDKFITFKGDKK 337
            V IQ              GADL+LGS+WLATLGPH+ADYAT+ +KFF  DKF+  +G+  
Sbjct: 454  VTIQGHQLVVPVYLLPVSGADLVLGSSWLATLGPHIADYATSTLKFFQHDKFVVLQGEYP 513

Query: 336  AQPAEAQFHQLKRMNNTDAISECFTVQMVQP 244
             QP +AQ H ++RM  T AI+ECF++Q+VQP
Sbjct: 514  IQPQQAQLHHMRRMQQTQAIAECFSIQLVQP 544


>dbj|GAU19157.1| hypothetical protein TSUD_79800 [Trifolium subterraneum]
          Length = 1500

 Score =  275 bits (704), Expect = 4e-79
 Identities = 151/347 (43%), Positives = 204/347 (58%), Gaps = 3/347 (0%)
 Frame = -1

Query: 1176 LFEEKYSSSNKNKQMYAGYXXXXXXXXXXXXXXXXXVAGNKADNNSNQNKG-TLPPLLPT 1000
            ++EEKYSS  K+++ Y+                      NK + N N        P+L T
Sbjct: 225  VYEEKYSSCLKSQKNYSNSQLT-----------------NKPNFNKNDTTTRNAAPVLNT 267

Query: 999  PNIKPMNQNAKNLAIKSMTAAEIQLRRDKGLCYFCDEKFSFTHKCLNKHKFMVLQISXXX 820
            P  +PM+Q  KN  IK ++ AE+QLRRDKGLCY+CDEKFSFTHKC N+   ++       
Sbjct: 268  PPTRPMSQYQKNPNIKRISPAEMQLRRDKGLCYWCDEKFSFTHKCPNRQLMLLHYDDNDE 327

Query: 819  XXXXXXDPQQNVLESQSSASE--EHHLSLNAMKGGTGTGTIRFSGRIGQIPIQSLLDGGS 646
                    QQ+ + + S  +   EHHLS NA+KG +  G IRF+G IG++ +Q L+DGGS
Sbjct: 328  DQVLDTLTQQDEITTDSPTTNLPEHHLSFNALKGNSNMGVIRFAGSIGKLGVQILIDGGS 387

Query: 645  SENFLQPRIAQCLKLAVEPLPSFRVLIGNGQYMQTEGWIENLQVQIQEQXXXXXXXXXXX 466
            S+NFLQPR+A+ LKL VEP P F VL+GNG+ M  EG I+ L V+IQ             
Sbjct: 388  SDNFLQPRVAKFLKLPVEPGPQFNVLVGNGEIMSAEGTIQKLPVEIQGHMIEIPVFLLPI 447

Query: 465  VGADLILGSTWLATLGPHVADYATAVIKFFHKDKFITFKGDKKAQPAEAQFHQLKRMNNT 286
             GAD+ILG++WLATLGPHVADYA+  +KFF KDKF+T  GD   + ++AQ H L+RM  T
Sbjct: 448  AGADVILGASWLATLGPHVADYASLTLKFFLKDKFVTLIGDPIPRSSQAQVHHLRRMATT 507

Query: 285  DAISECFTVQMVQPVVXXXXXXXXXXXXXXXXXXXLHTYRQVFDTPT 145
            +AI+ECFTVQ ++                      L+TY+ +F +PT
Sbjct: 508  NAIAECFTVQCIKSTDGNDIFKDLPTNTDPEIAMLLYTYKSLFQSPT 554



 Score = 78.6 bits (192), Expect = 8e-12
 Identities = 37/56 (66%), Positives = 42/56 (75%)
 Frame = -2

Query: 170 TDRCLIHLLTGLPPNRNQNHAISLIEGAKPVKVKPYRYPHSQKEQIENMVQEMLDQ 3
           T + L    T LPP R  NH+I L+EG+ PVKVKPYRYPHSQKEQIE M+QEML Q
Sbjct: 545 TYKSLFQSPTTLPPTRPHNHSIPLLEGSAPVKVKPYRYPHSQKEQIETMIQEMLQQ 600


>ref|XP_014634047.1| PREDICTED: uncharacterized protein LOC106799639 [Glycine max]
          Length = 600

 Score =  264 bits (674), Expect = 4e-79
 Identities = 144/309 (46%), Positives = 188/309 (60%), Gaps = 4/309 (1%)
 Frame = -1

Query: 1059 NKADNNSNQNKGTLPPLLPTPNIKPMNQNAKNLAIKSMTAAEIQLRRDKGLCYFCDEKFS 880
            NK +N    N   L   LPT   +PMN N +N  IK ++ AE+QLRR+KGLCY+CD++FS
Sbjct: 266  NKPENTQKANHTPLLQTLPT---RPMNPNQRNPNIKRISPAEMQLRREKGLCYWCDDQFS 322

Query: 879  FTHKCLNKHKFMVLQISXXXXXXXXXDPQQNVLESQSS----ASEEHHLSLNAMKGGTGT 712
             THKC N+ + M+LQ            P++  L+   +     + +HHLSLNAMKG    
Sbjct: 323  LTHKCPNR-QVMMLQFDDSEKHIEPE-PEKAQLDMTCNEPDPTTNDHHLSLNAMKGTNSM 380

Query: 711  GTIRFSGRIGQIPIQSLLDGGSSENFLQPRIAQCLKLAVEPLPSFRVLIGNGQYMQTEGW 532
            G +RF+G+IGQI +Q L+DGGSS+NFLQPRIA+ LKL VEP P F+VL+GN Q M  EG 
Sbjct: 381  GILRFTGQIGQISVQVLIDGGSSDNFLQPRIAEFLKLPVEPGPCFKVLVGNVQTMTAEGV 440

Query: 531  IENLQVQIQEQXXXXXXXXXXXVGADLILGSTWLATLGPHVADYATAVIKFFHKDKFITF 352
            + NL + +Q              GAD+ILGS+WLATLGPHVADYA   +KF +K KF+T 
Sbjct: 441  VPNLSITLQGHELIVPVFLLPVAGADIILGSSWLATLGPHVADYAALTLKFLYKGKFVTL 500

Query: 351  KGDKKAQPAEAQFHQLKRMNNTDAISECFTVQMVQPVVXXXXXXXXXXXXXXXXXXXLHT 172
            +G++   P  AQF+  +RM NTDAI+E F VQ++Q                      LHT
Sbjct: 501  QGERGTSPKLAQFNHCRRMQNTDAIAETFAVQLLQFHTEEDILKELPQDIAPEIALLLHT 560

Query: 171  YRQVFDTPT 145
            Y  VF TPT
Sbjct: 561  YSSVFQTPT 569



 Score = 60.1 bits (144), Expect = 5e-06
 Identities = 26/32 (81%), Positives = 28/32 (87%)
 Frame = -2

Query: 143 TGLPPNRNQNHAISLIEGAKPVKVKPYRYPHS 48
           T LPP R+QNHAI L+EG KPVKVKPYRYPHS
Sbjct: 569 TALPPPRSQNHAIPLMEGTKPVKVKPYRYPHS 600


>gb|KYP39589.1| Transposon Ty3-G Gag-Pol polyprotein [Cajanus cajan]
          Length = 1510

 Score =  247 bits (630), Expect(2) = 1e-77
 Identities = 139/344 (40%), Positives = 192/344 (55%)
 Frame = -1

Query: 1176 LFEEKYSSSNKNKQMYAGYXXXXXXXXXXXXXXXXXVAGNKADNNSNQNKGTLPPLLPTP 997
            LFEEKYS   +++Q +                     AGN++  N  Q      PLL TP
Sbjct: 209  LFEEKYSF--RSRQSFV-------------TRNTSHSAGNQSYTNPAQQ-----PLLNTP 248

Query: 996  NIKPMNQNAKNLAIKSMTAAEIQLRRDKGLCYFCDEKFSFTHKCLNKHKFMVLQISXXXX 817
            NIKP     +N A++ M+ AE+Q RR++GLC+ CDE+FS  H+C NK ++++LQ+     
Sbjct: 249  NIKPAAFPNRNTAVRKMSPAEMQSRRERGLCFTCDERFSANHRCPNK-QYLLLQVEDEEE 307

Query: 816  XXXXXDPQQNVLESQSSASEEHHLSLNAMKGGTGTGTIRFSGRIGQIPIQSLLDGGSSEN 637
                 +     LE +     EHHLS NA+KG    GT+RF+G I    +  LLD GSS+N
Sbjct: 308  LEETTNVDSTALEDEL----EHHLSFNALKGVATVGTMRFTGSIAGKEVHILLDSGSSDN 363

Query: 636  FLQPRIAQCLKLAVEPLPSFRVLIGNGQYMQTEGWIENLQVQIQEQXXXXXXXXXXXVGA 457
            FLQP++A  LKL +EP    +V++GNG  + TEG I NLQVQ+Q Q            GA
Sbjct: 364  FLQPKLAHYLKLPIEPAAGLQVMVGNGSSLSTEGKILNLQVQVQGQVLQLPVYLLSVSGA 423

Query: 456  DLILGSTWLATLGPHVADYATAVIKFFHKDKFITFKGDKKAQPAEAQFHQLKRMNNTDAI 277
            DL+LG+ WLATLGPH+ADY +  IKF+   K +T +G+K    A +QFH LKR+N+T  I
Sbjct: 424  DLVLGAAWLATLGPHIADYGSLTIKFYKDKKLVTLQGEKSRPAAMSQFHHLKRLNHTQGI 483

Query: 276  SECFTVQMVQPVVXXXXXXXXXXXXXXXXXXXLHTYRQVFDTPT 145
            +E +T+Q++   V                   LH YRQ+F  PT
Sbjct: 484  AEVYTLQLLSSFVETDQWKDIPDNVDPEIALLLHYYRQIFAKPT 527



 Score = 73.6 bits (179), Expect(2) = 1e-77
 Identities = 31/46 (67%), Positives = 41/46 (89%)
 Frame = -2

Query: 143 TGLPPNRNQNHAISLIEGAKPVKVKPYRYPHSQKEQIENMVQEMLD 6
           TGLPP R+QNH I L++G+ PVKV+PY+YPHSQK+QIE M++EML+
Sbjct: 527 TGLPPPRSQNHRIPLLQGSGPVKVRPYKYPHSQKQQIELMIKEMLE 572


Top