BLASTX nr result

ID: Astragalus22_contig00030707 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00030707
         (983 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KYP69348.1| Retrovirus-related Pol polyprotein from transposo...   281   5e-90
gb|KYP66912.1| Retrovirus-related Pol polyprotein from transposo...   284   2e-85
gb|PNX67781.1| hypothetical protein L195_g055811, partial [Trifo...   258   5e-82
dbj|GAU44851.1| hypothetical protein TSUD_112250 [Trifolium subt...   278   1e-81
gb|PNX98468.1| putative copia-type polyprotein, partial [Trifoli...   275   4e-80
dbj|GAU51473.1| hypothetical protein TSUD_95880 [Trifolium subte...   274   5e-80
gb|KYP48234.1| Retrovirus-related Pol polyprotein from transposo...   271   9e-80
gb|PNX93875.1| copia-type polyprotein [Trifolium pratense]            273   2e-79
dbj|GAU38708.1| hypothetical protein TSUD_396360 [Trifolium subt...   269   2e-79
dbj|GAU28814.1| hypothetical protein TSUD_21510 [Trifolium subte...   268   9e-79
dbj|GAU37826.1| hypothetical protein TSUD_63880 [Trifolium subte...   266   5e-78
dbj|GAU26253.1| hypothetical protein TSUD_224440 [Trifolium subt...   269   6e-78
dbj|GAU50018.1| hypothetical protein TSUD_331710 [Trifolium subt...   267   9e-78
dbj|GAU51495.1| hypothetical protein TSUD_413780, partial [Trifo...   265   1e-77
dbj|GAU44417.1| hypothetical protein TSUD_100640 [Trifolium subt...   266   4e-77
gb|KYP68287.1| Retrovirus-related Pol polyprotein from transposo...   266   8e-77
dbj|GAU30980.1| hypothetical protein TSUD_104940 [Trifolium subt...   266   1e-76
gb|KYP61818.1| Retrovirus-related Pol polyprotein from transposo...   265   1e-76
gb|KYP31826.1| Retrovirus-related Pol polyprotein from transposo...   265   2e-76
dbj|GAU46968.1| hypothetical protein TSUD_143100 [Trifolium subt...   264   2e-76

>gb|KYP69348.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 342

 Score =  281 bits (720), Expect = 5e-90
 Identities = 135/208 (64%), Positives = 170/208 (81%)
 Frame = -3

Query: 978 GEVLKYKTRLVVRGFQQRAGLDYEEVFAPVARMKTIRIVVAIGSLRQWKMHQMDVKSSFL 799
           GEV KYK RLV +GF Q+AG+DY +VFAPVAR++T+R+VVA+ SL+ W M+Q+DVKS+FL
Sbjct: 116 GEVAKYKARLVAKGFLQKAGIDYGDVFAPVARIETVRLVVALASLKGWPMYQLDVKSAFL 175

Query: 798 NGPIEEEVYIGQPPGFEIEGHEEKVYRLRKALYGLKQDPRAWNKRIDSFLCQIGFQRCTS 619
           NG IEEEVY+ QPPGF+I+G EEKVY+LRK LYGLKQ PRAWNKRIDSFL Q+ F +CT 
Sbjct: 176 NGEIEEEVYVAQPPGFKIKGQEEKVYKLRKDLYGLKQAPRAWNKRIDSFLHQMKFIKCTY 235

Query: 618 EHGVYLKNDVDTNVLIIGLYVDDLLVAGSKEASISHFIFQMKWEFEMTDLGELPYFLGIE 439
           EHGVY+K++ ++++LI  LYVDDLLV GS +  +  F   M  EFEMTDLG L YFLGIE
Sbjct: 236 EHGVYVKSENNSDLLIACLYVDDLLVTGSNQGMVVDFKRSMMEEFEMTDLGHLSYFLGIE 295

Query: 438 FKRNDEGIVMSEHKYATDILKRFEMLNC 355
           FK+ ++GIVM + KYATDILK+F + +C
Sbjct: 296 FKKTEKGIVMHQCKYATDILKKFNLYDC 323


>gb|KYP66912.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 861

 Score =  284 bits (727), Expect = 2e-85
 Identities = 138/208 (66%), Positives = 166/208 (79%)
 Frame = -3

Query: 978  GEVLKYKTRLVVRGFQQRAGLDYEEVFAPVARMKTIRIVVAIGSLRQWKMHQMDVKSSFL 799
            GEV KYK RLV +GF QR GLDY+EVFAPVAR++TIR+VV++ S   W +HQMDVKS+FL
Sbjct: 402  GEVAKYKARLVAKGFLQRQGLDYDEVFAPVARLETIRLVVSMASYHCWPIHQMDVKSAFL 461

Query: 798  NGPIEEEVYIGQPPGFEIEGHEEKVYRLRKALYGLKQDPRAWNKRIDSFLCQIGFQRCTS 619
            NGP+EEEV++ QPPGFEI+G +++VYRL KALYGLKQ PRAWNKRIDSFL Q+GF +CT+
Sbjct: 462  NGPLEEEVFVSQPPGFEIKGKKKQVYRLHKALYGLKQAPRAWNKRIDSFLLQLGFVKCTT 521

Query: 618  EHGVYLKNDVDTNVLIIGLYVDDLLVAGSKEASISHFIFQMKWEFEMTDLGELPYFLGIE 439
            E+GVY K    T++LI+ LYVDDLLV GSK   I  F   MK EFEM DLG+L YFLGIE
Sbjct: 522  EYGVYTKGLNMTDLLIVCLYVDDLLVTGSKAKEIDAFKQDMKAEFEMNDLGKLSYFLGIE 581

Query: 438  FKRNDEGIVMSEHKYATDILKRFEMLNC 355
            FK    GI M + KY TD+L+RF+MLNC
Sbjct: 582  FKETKGGIFMHQSKYTTDVLERFQMLNC 609


>gb|PNX67781.1| hypothetical protein L195_g055811, partial [Trifolium pratense]
          Length = 240

 Score =  258 bits (658), Expect = 5e-82
 Identities = 120/201 (59%), Positives = 153/201 (76%)
 Frame = -3

Query: 978 GEVLKYKTRLVVRGFQQRAGLDYEEVFAPVARMKTIRIVVAIGSLRQWKMHQMDVKSSFL 799
           GE+ ++K RLVV+GF Q+ G+D+ EVFAPVARM+TIR+V A+    +W MHQMDVK +FL
Sbjct: 40  GEITRHKARLVVKGFLQKEGIDFNEVFAPVARMETIRLVTALTHYNKWSMHQMDVKCAFL 99

Query: 798 NGPIEEEVYIGQPPGFEIEGHEEKVYRLRKALYGLKQDPRAWNKRIDSFLCQIGFQRCTS 619
           NGP+EEEVY+ QPPGF  + +E KVY+L KALYGLKQ PRAWNKRID FL  IGF +C +
Sbjct: 100 NGPLEEEVYVVQPPGFIDKENESKVYKLNKALYGLKQAPRAWNKRIDRFLSDIGFSKCIT 159

Query: 618 EHGVYLKNDVDTNVLIIGLYVDDLLVAGSKEASISHFIFQMKWEFEMTDLGELPYFLGIE 439
           EHGVY+     ++ +I+ LYVDDLL+ GS EA IS F   M  EFEM DLG + YFLGIE
Sbjct: 160 EHGVYVMKSATSDTIILCLYVDDLLITGSNEAQISKFKVDMMKEFEMVDLGHISYFLGIE 219

Query: 438 FKRNDEGIVMSEHKYATDILK 376
           F++  EG+++ + KYA +ILK
Sbjct: 220 FQKTSEGLILHQRKYANEILK 240


>dbj|GAU44851.1| hypothetical protein TSUD_112250 [Trifolium subterraneum]
          Length = 1133

 Score =  278 bits (710), Expect = 1e-81
 Identities = 130/209 (62%), Positives = 164/209 (78%)
 Frame = -3

Query: 981  HGEVLKYKTRLVVRGFQQRAGLDYEEVFAPVARMKTIRIVVAIGSLRQWKMHQMDVKSSF 802
            +GE+ KYK RLV RGF Q+AG+D+ EV+APVAR++TIRIVVAI +   WKMHQ+DV+S+F
Sbjct: 759  NGEIAKYKARLVSRGFLQKAGIDFNEVYAPVARLETIRIVVAIAAYNGWKMHQLDVESAF 818

Query: 801  LNGPIEEEVYIGQPPGFEIEGHEEKVYRLRKALYGLKQDPRAWNKRIDSFLCQIGFQRCT 622
            LNGP+EEEVY+ QPPGFE++G E+KVYRLRKALYGLKQ PRAWNKRID FL +IGF +C 
Sbjct: 819  LNGPLEEEVYVKQPPGFEVKGQEQKVYRLRKALYGLKQAPRAWNKRIDGFLIKIGFTKCV 878

Query: 621  SEHGVYLKNDVDTNVLIIGLYVDDLLVAGSKEASISHFIFQMKWEFEMTDLGELPYFLGI 442
            SEHG+Y+K     + +I  LYVDDLL+ G+ E  I  F   +  EFEM+DLG L YFLG+
Sbjct: 879  SEHGLYVKGSSKLDHIIRCLYVDDLLITGANEKEILKFKTSLMQEFEMSDLGNLSYFLGM 938

Query: 441  EFKRNDEGIVMSEHKYATDILKRFEMLNC 355
            EFK   +G+ + + KYA DIL RF+M+NC
Sbjct: 939  EFKHTKKGVFLHQKKYAEDILNRFKMVNC 967


>gb|PNX98468.1| putative copia-type polyprotein, partial [Trifolium pratense]
          Length = 1267

 Score =  275 bits (702), Expect = 4e-80
 Identities = 129/209 (61%), Positives = 165/209 (78%)
 Frame = -3

Query: 981  HGEVLKYKTRLVVRGFQQRAGLDYEEVFAPVARMKTIRIVVAIGSLRQWKMHQMDVKSSF 802
            +GE+ KYK RLV +GF Q+ GLDY EVFAPVAR++TIR+VVA+ S R W MHQ+DVKS+F
Sbjct: 883  NGEIAKYKARLVAKGFLQKHGLDYNEVFAPVARLETIRLVVAVASYRGWTMHQLDVKSAF 942

Query: 801  LNGPIEEEVYIGQPPGFEIEGHEEKVYRLRKALYGLKQDPRAWNKRIDSFLCQIGFQRCT 622
            LNGP++EEVY+ QPPGFEI+G E KV+RLRKALYGLKQ PRAWNKRIDSFL ++ F +CT
Sbjct: 943  LNGPLDEEVYVKQPPGFEIKGQEHKVFRLRKALYGLKQAPRAWNKRIDSFLIKLKFTKCT 1002

Query: 621  SEHGVYLKNDVDTNVLIIGLYVDDLLVAGSKEASISHFIFQMKWEFEMTDLGELPYFLGI 442
            SE+GVY+K   + +++I+ LYVDDLL+ GS +  +  F   +  EFEM+DLGEL YFLG+
Sbjct: 1003 SENGVYVKGTSNEDLIILCLYVDDLLITGSNKNVLEKFKVDIMKEFEMSDLGELSYFLGM 1062

Query: 441  EFKRNDEGIVMSEHKYATDILKRFEMLNC 355
            EF +  +G  + + KYA DIL+RF M NC
Sbjct: 1063 EFVKTSKGFFLHQKKYAEDILRRFHMNNC 1091


>dbj|GAU51473.1| hypothetical protein TSUD_95880 [Trifolium subterraneum]
          Length = 1242

 Score =  274 bits (701), Expect = 5e-80
 Identities = 130/209 (62%), Positives = 162/209 (77%)
 Frame = -3

Query: 981  HGEVLKYKTRLVVRGFQQRAGLDYEEVFAPVARMKTIRIVVAIGSLRQWKMHQMDVKSSF 802
            +GE+ K K RLV RGF Q+AG+D+ EV+APVAR++TIRIVVAI +   WKMHQ+DVKS+F
Sbjct: 788  NGEIAKCKARLVARGFLQKAGIDFNEVYAPVARLETIRIVVAIAAYNGWKMHQLDVKSAF 847

Query: 801  LNGPIEEEVYIGQPPGFEIEGHEEKVYRLRKALYGLKQDPRAWNKRIDSFLCQIGFQRCT 622
            LNGP+EEEVY+ QPPGFE++G E+KVYRLRKALYGLKQ PRAWNKRID FL +IGF +C 
Sbjct: 848  LNGPLEEEVYVKQPPGFEVKGQEQKVYRLRKALYGLKQAPRAWNKRIDGFLIKIGFTKCV 907

Query: 621  SEHGVYLKNDVDTNVLIIGLYVDDLLVAGSKEASISHFIFQMKWEFEMTDLGELPYFLGI 442
            SEHGVY+K     + +I+ LYVDDLL+ G+ E  I  F   +  EFEM DLG L YFLG+
Sbjct: 908  SEHGVYVKGLSKLDHIILCLYVDDLLITGANEKEIVKFKTSLMQEFEMYDLGNLSYFLGM 967

Query: 441  EFKRNDEGIVMSEHKYATDILKRFEMLNC 355
            EFK   + + + + KYA DIL RF+M+NC
Sbjct: 968  EFKHTKKDVFLHQKKYAEDILNRFKMVNC 996


>gb|KYP48234.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1029

 Score =  271 bits (694), Expect = 9e-80
 Identities = 131/203 (64%), Positives = 166/203 (81%)
 Frame = -3

Query: 981  HGEVLKYKTRLVVRGFQQRAGLDYEEVFAPVARMKTIRIVVAIGSLRQWKMHQMDVKSSF 802
            +GEV KYK RLV +GF Q+AG+DY +VFAPVAR++T+R+VVA+ SL+ W M+Q+DVKS+F
Sbjct: 826  NGEVAKYKARLVAKGFLQKAGIDYGDVFAPVARIETVRLVVALASLKGWPMYQLDVKSAF 885

Query: 801  LNGPIEEEVYIGQPPGFEIEGHEEKVYRLRKALYGLKQDPRAWNKRIDSFLCQIGFQRCT 622
            LNG IEEEVY+ QPP F+I+G EEKVY+LRKALYGLKQ PRAWNKRIDSFL Q+ F +CT
Sbjct: 886  LNGEIEEEVYVAQPPSFKIKGQEEKVYKLRKALYGLKQAPRAWNKRIDSFLHQMKFIKCT 945

Query: 621  SEHGVYLKNDVDTNVLIIGLYVDDLLVAGSKEASISHFIFQMKWEFEMTDLGELPYFLGI 442
            SEHGVY+K++ ++++LI  LYVDDLLV GS +  +  F   M  EFEMTDLG L YFLGI
Sbjct: 946  SEHGVYVKSENNSDLLIACLYVDDLLVTGSNQGMVVDFKRSMMEEFEMTDLGHLSYFLGI 1005

Query: 441  EFKRNDEGIVMSEHKYATDILKR 373
            EFK+ ++ IVM + KYATDI ++
Sbjct: 1006 EFKKTEKEIVMHQCKYATDIFEK 1028


>gb|PNX93875.1| copia-type polyprotein [Trifolium pratense]
          Length = 1350

 Score =  273 bits (698), Expect = 2e-79
 Identities = 124/208 (59%), Positives = 163/208 (78%)
 Frame = -3

Query: 978  GEVLKYKTRLVVRGFQQRAGLDYEEVFAPVARMKTIRIVVAIGSLRQWKMHQMDVKSSFL 799
            GE+ ++K RLV +GF QR G+DY EVFAPV RM+TIR+V AI ++  W M+QMDVKS+FL
Sbjct: 891  GEITRHKARLVAKGFLQREGIDYGEVFAPVTRMETIRLVTAIANINDWPMYQMDVKSAFL 950

Query: 798  NGPIEEEVYIGQPPGFEIEGHEEKVYRLRKALYGLKQDPRAWNKRIDSFLCQIGFQRCTS 619
            NGPI+EEVY+ QPPGF+++  E KVYRL+KALYGLKQ PRAWNKR+D FL +IGF++C +
Sbjct: 951  NGPIDEEVYVAQPPGFKVKNQESKVYRLKKALYGLKQAPRAWNKRMDKFLIEIGFEKCVT 1010

Query: 618  EHGVYLKNDVDTNVLIIGLYVDDLLVAGSKEASISHFIFQMKWEFEMTDLGELPYFLGIE 439
            EHGVY+K      ++++ LYVDDLL+ GS ++ I  F   +K EFEMTDLG + YFLGIE
Sbjct: 1011 EHGVYVKKSDTKGIIVMCLYVDDLLITGSNDSYIGEFKSDLKKEFEMTDLGHMTYFLGIE 1070

Query: 438  FKRNDEGIVMSEHKYATDILKRFEMLNC 355
            F R  +GI+M + KYA++ILK+F+M  C
Sbjct: 1071 FVRTKQGILMHQSKYASEILKKFDMDKC 1098


>dbj|GAU38708.1| hypothetical protein TSUD_396360 [Trifolium subterraneum]
          Length = 920

 Score =  269 bits (687), Expect = 2e-79
 Identities = 126/209 (60%), Positives = 164/209 (78%), Gaps = 1/209 (0%)
 Frame = -3

Query: 978  GEVLKYKTRLVVRGFQQRAGLDYEEVFAPVARMKTIRIVVAIGSLRQWKMHQMDVKSSFL 799
            GE+ ++K RLV +GF QR G+DYEEV+APVAR++TIR+VVA+ +   W +HQMDVK +FL
Sbjct: 460  GEITRHKARLVAKGFLQREGIDYEEVYAPVARIETIRLVVAMANSNNWSIHQMDVKCAFL 519

Query: 798  -NGPIEEEVYIGQPPGFEIEGHEEKVYRLRKALYGLKQDPRAWNKRIDSFLCQIGFQRCT 622
             NGP+ EEV++ QPPGFE++G   KVY+L KALYGLKQ PRAWNKRID +L QIGF +C 
Sbjct: 520  KNGPLSEEVFVKQPPGFEVKGQTNKVYKLHKALYGLKQAPRAWNKRIDGYLSQIGFIKCV 579

Query: 621  SEHGVYLKNDVDTNVLIIGLYVDDLLVAGSKEASISHFIFQMKWEFEMTDLGELPYFLGI 442
            +EHGVY++ D +  V+I+ LYVDDLL+ GS E  I+ F  QM  EFEMTD+G L YFLGI
Sbjct: 580  TEHGVYVRKDKNKGVIILCLYVDDLLITGSNEEYIADFKKQMMREFEMTDIGHLSYFLGI 639

Query: 441  EFKRNDEGIVMSEHKYATDILKRFEMLNC 355
            EF R   G++M + +YA++ILKRF+M+NC
Sbjct: 640  EFARCARGLMMHQKRYASEILKRFDMVNC 668


>dbj|GAU28814.1| hypothetical protein TSUD_21510 [Trifolium subterraneum]
          Length = 949

 Score =  268 bits (684), Expect = 9e-79
 Identities = 126/208 (60%), Positives = 158/208 (75%)
 Frame = -3

Query: 978  GEVLKYKTRLVVRGFQQRAGLDYEEVFAPVARMKTIRIVVAIGSLRQWKMHQMDVKSSFL 799
            GE+ ++K RLVV+GF Q+ G+D+ EVFAPVARM+TIR+V A+     W MHQMDVK +FL
Sbjct: 491  GEITRHKARLVVKGFLQKEGIDFNEVFAPVARMETIRLVTALAHHNNWSMHQMDVKCAFL 550

Query: 798  NGPIEEEVYIGQPPGFEIEGHEEKVYRLRKALYGLKQDPRAWNKRIDSFLCQIGFQRCTS 619
            NGP++EEVY+ QPPGF  +  E KVY+L KALYGLKQ PRAWNKRID FL  I F++C +
Sbjct: 551  NGPLDEEVYVVQPPGFTSKEDEFKVYKLHKALYGLKQAPRAWNKRIDKFLGDIDFRKCVT 610

Query: 618  EHGVYLKNDVDTNVLIIGLYVDDLLVAGSKEASISHFIFQMKWEFEMTDLGELPYFLGIE 439
            EHGVY+KN  +   +I+ LYVDDLL+ GS EA I  F   M  EFEMTDLG + YFLGIE
Sbjct: 611  EHGVYVKNCAEKGTIILCLYVDDLLITGSNEAHIREFKVDMMREFEMTDLGHISYFLGIE 670

Query: 438  FKRNDEGIVMSEHKYATDILKRFEMLNC 355
            F+R  EG+++ + KYA++ILKRFEM  C
Sbjct: 671  FQRTSEGLILHQKKYASEILKRFEMDQC 698


>dbj|GAU37826.1| hypothetical protein TSUD_63880 [Trifolium subterraneum]
          Length = 1000

 Score =  266 bits (681), Expect = 5e-78
 Identities = 126/208 (60%), Positives = 158/208 (75%)
 Frame = -3

Query: 978  GEVLKYKTRLVVRGFQQRAGLDYEEVFAPVARMKTIRIVVAIGSLRQWKMHQMDVKSSFL 799
            GE+ ++K RLVV+GF Q+ G+++ EVFAPVARM+TIR+V A+     W MHQMDVK +FL
Sbjct: 542  GEITRHKARLVVKGFLQKEGINFNEVFAPVARMETIRLVTALAHYNGWSMHQMDVKCAFL 601

Query: 798  NGPIEEEVYIGQPPGFEIEGHEEKVYRLRKALYGLKQDPRAWNKRIDSFLCQIGFQRCTS 619
            NG ++EE Y+ QPPGF I+  E KVY+L KALYGLKQ PRAWNKRID FL  IGF +C +
Sbjct: 602  NGLLDEEAYVTQPPGFIIKEDESKVYKLDKALYGLKQAPRAWNKRIDKFLSDIGFNKCVT 661

Query: 618  EHGVYLKNDVDTNVLIIGLYVDDLLVAGSKEASISHFIFQMKWEFEMTDLGELPYFLGIE 439
            EHGVY+KN V+   +I+ LYVDDLL+ GS EA I  F   M  EFEMTDLG + YFLGIE
Sbjct: 662  EHGVYVKNCVEKGTIILCLYVDDLLITGSNEAHIREFKVDMMREFEMTDLGHISYFLGIE 721

Query: 438  FKRNDEGIVMSEHKYATDILKRFEMLNC 355
            F+R  EG+++ + KYA++ILKRFEM  C
Sbjct: 722  FQRTSEGLMLHQKKYASEILKRFEMDQC 749


>dbj|GAU26253.1| hypothetical protein TSUD_224440 [Trifolium subterraneum]
          Length = 1312

 Score =  269 bits (687), Expect = 6e-78
 Identities = 126/208 (60%), Positives = 159/208 (76%)
 Frame = -3

Query: 978  GEVLKYKTRLVVRGFQQRAGLDYEEVFAPVARMKTIRIVVAIGSLRQWKMHQMDVKSSFL 799
            GE+ ++K RLVV+GF Q+ G+D+ EVFAPVARM+TIR+V A+     W MHQMDVK +FL
Sbjct: 901  GEITRHKARLVVKGFLQKEGIDFNEVFAPVARMETIRLVTALAHHNNWSMHQMDVKCAFL 960

Query: 798  NGPIEEEVYIGQPPGFEIEGHEEKVYRLRKALYGLKQDPRAWNKRIDSFLCQIGFQRCTS 619
            NGP++EEVY+ QPPGF  +  E KVY+L KALYGLKQ PRAWNKRID FL  IGF++C +
Sbjct: 961  NGPLDEEVYVVQPPGFTSKEDEFKVYKLHKALYGLKQAPRAWNKRIDKFLGDIGFRKCVT 1020

Query: 618  EHGVYLKNDVDTNVLIIGLYVDDLLVAGSKEASISHFIFQMKWEFEMTDLGELPYFLGIE 439
            EHGVY+KN  +   +I+ LYVDDLL+ GS EA I  F   M  EFEMTDLG + YFLGIE
Sbjct: 1021 EHGVYVKNCAEKGTIILCLYVDDLLITGSNEAHIREFKVDMMREFEMTDLGHISYFLGIE 1080

Query: 438  FKRNDEGIVMSEHKYATDILKRFEMLNC 355
            F+R  +G+++ + KYA++ILKRFEM  C
Sbjct: 1081 FQRTSKGLILHQKKYASEILKRFEMDQC 1108


>dbj|GAU50018.1| hypothetical protein TSUD_331710 [Trifolium subterraneum]
          Length = 1150

 Score =  267 bits (683), Expect = 9e-78
 Identities = 126/208 (60%), Positives = 157/208 (75%)
 Frame = -3

Query: 978  GEVLKYKTRLVVRGFQQRAGLDYEEVFAPVARMKTIRIVVAIGSLRQWKMHQMDVKSSFL 799
            GE+  +K RLVV+GF Q+ G+D+ EVFAPVARM+TIR+V A+     W MHQMDVK +FL
Sbjct: 692  GEITTHKARLVVKGFLQKEGIDFNEVFAPVARMETIRLVTALAHHNNWLMHQMDVKCAFL 751

Query: 798  NGPIEEEVYIGQPPGFEIEGHEEKVYRLRKALYGLKQDPRAWNKRIDSFLCQIGFQRCTS 619
            NGP++EEVY+ QPPGF  +  E KVY+L KALYG+KQ PRAWNKRID FL  IGF +C +
Sbjct: 752  NGPLDEEVYVVQPPGFTSKEDEFKVYKLHKALYGIKQAPRAWNKRIDKFLGDIGFSKCVT 811

Query: 618  EHGVYLKNDVDTNVLIIGLYVDDLLVAGSKEASISHFIFQMKWEFEMTDLGELPYFLGIE 439
            EHGVY+KN  +   +I+ LYVDDLL+ GS EA I  F   M  EFEMTDLG + YFLGIE
Sbjct: 812  EHGVYVKNCAEKGTIILCLYVDDLLITGSNEAHIREFKVDMMREFEMTDLGHISYFLGIE 871

Query: 438  FKRNDEGIVMSEHKYATDILKRFEMLNC 355
            F+R  EG+++ + KYA++ILKRFEM  C
Sbjct: 872  FQRTSEGLILHQKKYASEILKRFEMDQC 899


>dbj|GAU51495.1| hypothetical protein TSUD_413780, partial [Trifolium subterraneum]
          Length = 956

 Score =  265 bits (677), Expect = 1e-77
 Identities = 124/208 (59%), Positives = 160/208 (76%)
 Frame = -3

Query: 978  GEVLKYKTRLVVRGFQQRAGLDYEEVFAPVARMKTIRIVVAIGSLRQWKMHQMDVKSSFL 799
            GE+ ++K RLV RGF QR G+DY EVFA V RM+TIR+V AI ++  W M+QMDVKS+FL
Sbjct: 488  GEITRHKARLVARGFLQREGIDYGEVFALVTRMETIRLVTAITNINDWPMYQMDVKSAFL 547

Query: 798  NGPIEEEVYIGQPPGFEIEGHEEKVYRLRKALYGLKQDPRAWNKRIDSFLCQIGFQRCTS 619
            NGPIEEEV++ QPPG++++  E KVYRL+KALYGLKQ PR WN+RID FL +IGF +C +
Sbjct: 548  NGPIEEEVFVTQPPGYKVKNQENKVYRLKKALYGLKQAPRDWNRRIDKFLIEIGFVKCVT 607

Query: 618  EHGVYLKNDVDTNVLIIGLYVDDLLVAGSKEASISHFIFQMKWEFEMTDLGELPYFLGIE 439
            EHGVY+K   +  ++++ LYVDDLL+ GS +  IS F   +K EFEMTDLG + YFLGIE
Sbjct: 608  EHGVYVKKHDEKRLIVMCLYVDDLLITGSNDKYISEFKSDLKREFEMTDLGHMTYFLGIE 667

Query: 438  FKRNDEGIVMSEHKYATDILKRFEMLNC 355
            F R D+GI M + +YA +ILK+FEM  C
Sbjct: 668  FLRTDQGIFMHQTRYAKEILKKFEMDKC 695


>dbj|GAU44417.1| hypothetical protein TSUD_100640 [Trifolium subterraneum]
          Length = 1318

 Score =  266 bits (681), Expect = 4e-77
 Identities = 124/209 (59%), Positives = 162/209 (77%)
 Frame = -3

Query: 981  HGEVLKYKTRLVVRGFQQRAGLDYEEVFAPVARMKTIRIVVAIGSLRQWKMHQMDVKSSF 802
            +GE+ KYK RLV +GF Q+ GLD+ EVFAPVAR++TIR+VVAI S R W MHQ+DVKS+F
Sbjct: 857  NGEIAKYKARLVAKGFLQKQGLDFNEVFAPVARLETIRLVVAITSYRGWSMHQLDVKSAF 916

Query: 801  LNGPIEEEVYIGQPPGFEIEGHEEKVYRLRKALYGLKQDPRAWNKRIDSFLCQIGFQRCT 622
            LNGP++EEVY+ QPP FEI+G EEKV++L+K LYGLKQ PR+WNK IDSFL ++ F +CT
Sbjct: 917  LNGPLDEEVYVKQPPSFEIKGQEEKVFKLKKTLYGLKQAPRSWNKTIDSFLIKLDFIKCT 976

Query: 621  SEHGVYLKNDVDTNVLIIGLYVDDLLVAGSKEASISHFIFQMKWEFEMTDLGELPYFLGI 442
            SEHGVY+K   + +++++ LYVDDLL+ GS +  +  F   +  EFEM+DLGEL YFLG+
Sbjct: 977  SEHGVYVKGSNNEDLILLCLYVDDLLITGSNKNILQKFKTDIMREFEMSDLGELSYFLGM 1036

Query: 441  EFKRNDEGIVMSEHKYATDILKRFEMLNC 355
            EF +  +G  + + KY  DILKRF M NC
Sbjct: 1037 EFVKTSKGYFLHQKKYVEDILKRFHMSNC 1065


>gb|KYP68287.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1340

 Score =  266 bits (679), Expect = 8e-77
 Identities = 124/209 (59%), Positives = 161/209 (77%)
 Frame = -3

Query: 978  GEVLKYKTRLVVRGFQQRAGLDYEEVFAPVARMKTIRIVVAIGSLRQWKMHQMDVKSSFL 799
            G + K+K RLV +GF Q+ G+DY EVFAPVAR++T+R++VA+ S R WK+ Q+DVKS+FL
Sbjct: 884  GSIAKHKARLVAKGFMQKEGIDYSEVFAPVARLETVRLIVALASWRNWKLWQLDVKSAFL 943

Query: 798  NGPIEEEVYIGQPPGFEIEGHEEKVYRLRKALYGLKQDPRAWNKRIDSFLCQIGFQRCTS 619
            NGP++EEV++ QPPGF  +G E KV RL+KALYGLKQ PRAWNKRIDSFL   GFQ+C+ 
Sbjct: 944  NGPLDEEVFVAQPPGFICKGKELKVLRLKKALYGLKQAPRAWNKRIDSFLTGFGFQKCSV 1003

Query: 618  EHGVYLKNDVDTNVLIIGLYVDDLLVAGSKEASISHFIFQMKWEFEMTDLGELPYFLGIE 439
            EHGVY+K   +T +L++ LYVDDLL+ GS   +I      +K EFEMTDLG L YFLGIE
Sbjct: 1004 EHGVYIKTVSETEILVLCLYVDDLLITGSSLTAIESLKQGLKSEFEMTDLGILSYFLGIE 1063

Query: 438  FKRNDEGIVMSEHKYATDILKRFEMLNCK 352
            F   ++GI M + KY +++LKRF+ML CK
Sbjct: 1064 FAYTEKGIFMHQRKYMSEVLKRFKMLGCK 1092


>dbj|GAU30980.1| hypothetical protein TSUD_104940 [Trifolium subterraneum]
          Length = 1449

 Score =  266 bits (679), Expect = 1e-76
 Identities = 126/208 (60%), Positives = 156/208 (75%)
 Frame = -3

Query: 978  GEVLKYKTRLVVRGFQQRAGLDYEEVFAPVARMKTIRIVVAIGSLRQWKMHQMDVKSSFL 799
            GE+ ++K RLVV+ F Q+ G+D+ EVFAPVARM+TIR+V A+     W MHQMDVK +FL
Sbjct: 991  GEITRHKARLVVKSFLQKEGIDFNEVFAPVARMETIRLVTAMAHYNGWSMHQMDVKCAFL 1050

Query: 798  NGPIEEEVYIGQPPGFEIEGHEEKVYRLRKALYGLKQDPRAWNKRIDSFLCQIGFQRCTS 619
            NGP++EEVY+ QPPGF  +  E KVY+L KALYGL Q PRAWNKRID FL  IGF +C +
Sbjct: 1051 NGPLDEEVYVTQPPGFISKEDEFKVYKLHKALYGLNQAPRAWNKRIDKFLSDIGFNKCVT 1110

Query: 618  EHGVYLKNDVDTNVLIIGLYVDDLLVAGSKEASISHFIFQMKWEFEMTDLGELPYFLGIE 439
            EHGVY+KN V    +I+ LYVDDLL+ GS EA I  F   M  EFEMTDLG + YFLGIE
Sbjct: 1111 EHGVYVKNCVKKGTIILCLYVDDLLITGSDEAHIREFKVDMMREFEMTDLGHISYFLGIE 1170

Query: 438  FKRNDEGIVMSEHKYATDILKRFEMLNC 355
            F+R  EG+++ + KYA++ILKRFEM  C
Sbjct: 1171 FQRTSEGLILHQKKYASEILKRFEMDQC 1198


>gb|KYP61818.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
 gb|KYP63036.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1314

 Score =  265 bits (677), Expect = 1e-76
 Identities = 124/209 (59%), Positives = 161/209 (77%)
 Frame = -3

Query: 978  GEVLKYKTRLVVRGFQQRAGLDYEEVFAPVARMKTIRIVVAIGSLRQWKMHQMDVKSSFL 799
            G + K+K RLV +GF Q+ G+DY EVFAPVAR++T+R++VA+ S R WK+ Q+DVKS+FL
Sbjct: 858  GSIAKHKARLVAKGFMQKEGIDYSEVFAPVARLETVRLIVALASWRNWKLWQLDVKSAFL 917

Query: 798  NGPIEEEVYIGQPPGFEIEGHEEKVYRLRKALYGLKQDPRAWNKRIDSFLCQIGFQRCTS 619
            NGP++EEV++ QPPGF  +G E KV RL+KALYGLKQ PRAWNKRIDSFL   GFQ+C+ 
Sbjct: 918  NGPLDEEVFVTQPPGFICKGKELKVLRLKKALYGLKQAPRAWNKRIDSFLTGFGFQKCSV 977

Query: 618  EHGVYLKNDVDTNVLIIGLYVDDLLVAGSKEASISHFIFQMKWEFEMTDLGELPYFLGIE 439
            EHGVY+K   +T +L++ LYVDDLL+ GS   +I      +K EFEMTDLG L YFLGIE
Sbjct: 978  EHGVYIKTVSETEILVLCLYVDDLLITGSSLTAIESLKQGLKSEFEMTDLGILSYFLGIE 1037

Query: 438  FKRNDEGIVMSEHKYATDILKRFEMLNCK 352
            F   ++GI M + KY +++LKRF+ML CK
Sbjct: 1038 FAYTEKGIFMHQRKYMSEVLKRFKMLGCK 1066


>gb|KYP31826.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94 [Cajanus
            cajan]
          Length = 1340

 Score =  265 bits (677), Expect = 2e-76
 Identities = 124/209 (59%), Positives = 161/209 (77%)
 Frame = -3

Query: 978  GEVLKYKTRLVVRGFQQRAGLDYEEVFAPVARMKTIRIVVAIGSLRQWKMHQMDVKSSFL 799
            G + K+K RLV +GF Q+ G+DY EVFAPVAR++T+R++VA+ S R WK+ Q+DVKS+FL
Sbjct: 884  GSIAKHKARLVAKGFMQKEGIDYSEVFAPVARLETVRLIVALASWRNWKLWQLDVKSAFL 943

Query: 798  NGPIEEEVYIGQPPGFEIEGHEEKVYRLRKALYGLKQDPRAWNKRIDSFLCQIGFQRCTS 619
            NGP++EEV++ QPPGF  +G E KV RL+KALYGLKQ PRAWNKRIDSFL   GFQ+C+ 
Sbjct: 944  NGPLDEEVFVTQPPGFICKGKELKVLRLKKALYGLKQAPRAWNKRIDSFLTGFGFQKCSV 1003

Query: 618  EHGVYLKNDVDTNVLIIGLYVDDLLVAGSKEASISHFIFQMKWEFEMTDLGELPYFLGIE 439
            EHGVY+K   +T +L++ LYVDDLL+ GS   +I      +K EFEMTDLG L YFLGIE
Sbjct: 1004 EHGVYIKTVSETEILVLCLYVDDLLITGSSLTAIESLKQGLKSEFEMTDLGILSYFLGIE 1063

Query: 438  FKRNDEGIVMSEHKYATDILKRFEMLNCK 352
            F   ++GI M + KY +++LKRF+ML CK
Sbjct: 1064 FAYTEKGIFMHQRKYMSEVLKRFKMLGCK 1092


>dbj|GAU46968.1| hypothetical protein TSUD_143100 [Trifolium subterraneum]
          Length = 1293

 Score =  264 bits (675), Expect = 2e-76
 Identities = 123/207 (59%), Positives = 165/207 (79%)
 Frame = -3

Query: 975  EVLKYKTRLVVRGFQQRAGLDYEEVFAPVARMKTIRIVVAIGSLRQWKMHQMDVKSSFLN 796
            +V+K+K RLV +GF Q+ GLDY+EVF+PVAR +TIR+V+A+   R+W M  +DVKS+FLN
Sbjct: 831  QVIKHKARLVAKGFLQKQGLDYDEVFSPVARHETIRLVIALACSRRWPMFHLDVKSAFLN 890

Query: 795  GPIEEEVYIGQPPGFEIEGHEEKVYRLRKALYGLKQDPRAWNKRIDSFLCQIGFQRCTSE 616
            GP+EE+VY+ QPPGFE++G E++V +L KALYGLKQ PRAWNKRID FL   GF +C+ E
Sbjct: 891  GPLEEDVYVKQPPGFELKGKEDRVLKLNKALYGLKQAPRAWNKRIDQFLVMQGFVKCSVE 950

Query: 615  HGVYLKNDVDTNVLIIGLYVDDLLVAGSKEASISHFIFQMKWEFEMTDLGELPYFLGIEF 436
            +GVY+K+  D ++LII LYVDDLLV GS    I +F  QMK EF+MTDLG+L YFLG+E 
Sbjct: 951  YGVYVKHSDDKHMLIICLYVDDLLVTGSSPMEIENFKSQMKSEFQMTDLGKLTYFLGMEL 1010

Query: 435  KRNDEGIVMSEHKYATDILKRFEMLNC 355
                +G+++++ KYAT+ILK+FEML+C
Sbjct: 1011 LETPKGVILNQAKYATEILKKFEMLDC 1037


Top