BLASTX nr result

ID: Astragalus22_contig00019119 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00019119
         (304 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KYP58223.1| Retrovirus-related Pol polyprotein from transposo...   132   3e-37
gb|PNX93789.1| copia-type polyprotein [Trifolium pratense]            134   2e-34
gb|KYP35468.1| Retrovirus-related Pol polyprotein from transposo...   131   8e-34
gb|KYP40819.1| Retrovirus-related Pol polyprotein from transposo...   131   1e-33
gb|PNX95763.1| retrotransposon-related protein, partial [Trifoli...   130   7e-33
dbj|GAU26746.1| hypothetical protein TSUD_317440 [Trifolium subt...   129   1e-32
dbj|GAU22946.1| hypothetical protein TSUD_326740 [Trifolium subt...   128   3e-32
gb|KYP73784.1| Retrovirus-related Pol polyprotein from transposo...   123   1e-31
gb|PNX69396.1| ubiquitin carboxyl-terminal hydrolase, partial [T...   117   1e-31
emb|CAN74984.1| hypothetical protein VITISV_035210 [Vitis vinifera]   125   2e-31
gb|PNX58304.1| retrovirus-related Pol polyprotein from transposo...   115   4e-31
dbj|GAU15733.1| hypothetical protein TSUD_235460 [Trifolium subt...   125   4e-31
gb|KYP42102.1| Retrovirus-related Pol polyprotein from transposo...   124   5e-31
dbj|GAU12447.1| hypothetical protein TSUD_229810 [Trifolium subt...   124   6e-31
gb|KYP63625.1| Retrovirus-related Pol polyprotein from transposo...   124   1e-30
gb|PNX95204.1| copia-type polyprotein [Trifolium pratense]            123   2e-30
dbj|GAU27929.1| hypothetical protein TSUD_160240 [Trifolium subt...   122   3e-30
gb|PRQ17740.1| putative RNA-directed DNA polymerase [Rosa chinen...   122   3e-30
dbj|GAU42405.1| hypothetical protein TSUD_324600 [Trifolium subt...   122   3e-30
dbj|GAU32260.1| hypothetical protein TSUD_53880 [Trifolium subte...   122   4e-30

>gb|KYP58223.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 164

 Score =  132 bits (332), Expect = 3e-37
 Identities = 58/97 (59%), Positives = 73/97 (75%)
 Frame = -1

Query: 292 PIPPTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIGRQ 113
           P+ PTC   V+ED + LWH RFGHLS+  LQ L QKGMV GLP +K P+K C +CLIG+Q
Sbjct: 19  PVAPTCFNTVTEDVAQLWHCRFGHLSFKGLQTLQQKGMVEGLPMLKSPSKLCKDCLIGKQ 78

Query: 112 HREAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2
           HR++   RS WRAS+ L+++H+DICGPI P SNS KR
Sbjct: 79  HRDSFPMRSSWRASQILQLVHADICGPIKPISNSKKR 115


>gb|PNX93789.1| copia-type polyprotein [Trifolium pratense]
          Length = 1347

 Score =  134 bits (338), Expect = 2e-34
 Identities = 62/97 (63%), Positives = 75/97 (77%)
 Frame = -1

Query: 292 PIPPTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIGRQ 113
           P   TC Q VSE++S LWH RFGHLSY  L+ L  K MV GLPS++ P K C ECLIG+Q
Sbjct: 417 PNESTCFQNVSENESYLWHCRFGHLSYQGLRTLFYKKMVNGLPSIQIPKKLCTECLIGKQ 476

Query: 112 HREAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2
           HR++MSK+S WRAS KL+++H+DICGPI P SNSNKR
Sbjct: 477 HRDSMSKKSLWRASNKLQLVHADICGPIKPESNSNKR 513


>gb|KYP35468.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 508

 Score =  131 bits (329), Expect = 8e-34
 Identities = 56/92 (60%), Positives = 75/92 (81%)
 Frame = -1

Query: 277 CLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIGRQHREAM 98
           C+Q  + D + +WH+R+GHLSYG L+LL+QK MV GLP +KE  K C EC +G+QHR+A+
Sbjct: 84  CIQTSNVDSTEMWHKRYGHLSYGGLKLLNQKAMVKGLPELKEMDKVCPECAVGKQHRDAI 143

Query: 97  SKRSKWRASRKLEMIHSDICGPISPASNSNKR 2
           SK+S WRA+R+LE+IHSDICGP +P SNSN+R
Sbjct: 144 SKQSTWRATRRLELIHSDICGPSTPTSNSNRR 175


>gb|KYP40819.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 582

 Score =  131 bits (330), Expect = 1e-33
 Identities = 59/103 (57%), Positives = 79/103 (76%), Gaps = 2/103 (1%)
 Frame = -1

Query: 304 IMADPI--PPTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAE 131
           I A P+     C+Q  + D + +WH+R+GHLSYG L+LL+QK MV GLP +KE  K C E
Sbjct: 402 IKAKPVITEAACIQTSNVDSTEMWHKRYGHLSYGGLKLLNQKAMVKGLPELKEMDKVCPE 461

Query: 130 CLIGRQHREAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2
           C +G+QHR+A+SK+S WRA+R+LE+IHSDICGP +P SNSN+R
Sbjct: 462 CAVGKQHRDAISKQSTWRATRRLELIHSDICGPSTPTSNSNRR 504


>gb|PNX95763.1| retrotransposon-related protein, partial [Trifolium pratense]
          Length = 1327

 Score =  130 bits (326), Expect = 7e-33
 Identities = 56/94 (59%), Positives = 73/94 (77%)
 Frame = -1

Query: 283 PTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIGRQHRE 104
           P C+Q  ++D+S LWH RFGHL+Y  L+ L  +GMV GLP+VK P K C  CL+G+QHR+
Sbjct: 418 PKCMQVEADDESRLWHSRFGHLNYKGLRTLAYRGMVEGLPTVKTPQKLCTHCLVGKQHRD 477

Query: 103 AMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2
            + KR+ WRA+ KL+++HSDICGPISP SNSNKR
Sbjct: 478 PIPKRNLWRATHKLQLVHSDICGPISPISNSNKR 511


>dbj|GAU26746.1| hypothetical protein TSUD_317440 [Trifolium subterraneum]
          Length = 1608

 Score =  129 bits (325), Expect = 1e-32
 Identities = 59/94 (62%), Positives = 75/94 (79%), Gaps = 2/94 (2%)
 Frame = -1

Query: 277 CLQA--VSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIGRQHRE 104
           CLQA  +SE ++ LWH RFGHL+Y  L  L  K MVIGLPS+K P K C  CLIG+QHRE
Sbjct: 422 CLQAEMMSEKETQLWHSRFGHLNYKGLNTLSNKKMVIGLPSLKSPKKICTTCLIGKQHRE 481

Query: 103 AMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2
           ++ K+S WRAS+KL+++H+DICGPI+P+SNSNKR
Sbjct: 482 SIPKKSSWRASKKLQLVHADICGPITPSSNSNKR 515


>dbj|GAU22946.1| hypothetical protein TSUD_326740 [Trifolium subterraneum]
          Length = 1222

 Score =  128 bits (321), Expect = 3e-32
 Identities = 57/100 (57%), Positives = 73/100 (73%)
 Frame = -1

Query: 301 MADPIPPTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLI 122
           +A  I P C +    +++ LWH+R+GHLSY  L +L  K MV+GLP VKEPT  C+ C+ 
Sbjct: 340 LAPVIVPKCFKTTHSNENQLWHQRYGHLSYKGLGVLANKKMVLGLPCVKEPTDKCSNCMK 399

Query: 121 GRQHREAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2
           G+QHREA+ KRS WRAS KLE++HSDICGPI+P SN  KR
Sbjct: 400 GKQHREAIPKRSLWRASAKLELVHSDICGPITPESNGKKR 439


>gb|KYP73784.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 395

 Score =  123 bits (309), Expect = 1e-31
 Identities = 57/99 (57%), Positives = 74/99 (74%), Gaps = 2/99 (2%)
 Frame = -1

Query: 292 PIPPTCLQA--VSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIG 119
           P  P CLQA  VSE ++ LWH RFGHL+Y  L  L  K MV+G+PS+K P   C  CL+G
Sbjct: 10  PQHPLCLQAEDVSEKKTQLWHSRFGHLNYKGLSTLASKQMVLGIPSLKSPKTICTTCLVG 69

Query: 118 RQHREAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2
           +QHR+++  +S WRAS KL++IH+DICGPISP+S+SNKR
Sbjct: 70  KQHRDSIPMQSSWRASTKLQLIHADICGPISPSSHSNKR 108


>gb|PNX69396.1| ubiquitin carboxyl-terminal hydrolase, partial [Trifolium pratense]
          Length = 149

 Score =  117 bits (293), Expect = 1e-31
 Identities = 51/96 (53%), Positives = 70/96 (72%)
 Frame = -1

Query: 289 IPPTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIGRQH 110
           I P CL+   ++ + LWH R+GHLS+  L  L +K MVIGLP +++  +NC++CL G+QH
Sbjct: 41  IMPMCLKTAKQESTQLWHDRYGHLSFKGLNTLSKKQMVIGLPELEDSDENCSDCLTGKQH 100

Query: 109 REAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2
           R+ + K++ WRAS KLE+IHSDICGPISP SN   R
Sbjct: 101 RDIIPKQANWRASVKLELIHSDICGPISPQSNGGCR 136


>emb|CAN74984.1| hypothetical protein VITISV_035210 [Vitis vinifera]
          Length = 2408

 Score =  125 bits (315), Expect = 2e-31
 Identities = 53/94 (56%), Positives = 72/94 (76%)
 Frame = -1

Query: 283 PTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIGRQHRE 104
           PTC Q + ED + LWH R+GHLS+  L+ L  K MV GLP +K P+K C +C++G+QHR+
Sbjct: 357 PTCFQTILEDNTHLWHCRYGHLSFKGLRTLQYKQMVRGLPQLKAPSKICTDCMVGKQHRD 416

Query: 103 AMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2
           A+ KRS WRAS++L+++H+DICGPI P SNS KR
Sbjct: 417 AIPKRSLWRASQRLQLVHADICGPIKPISNSKKR 450


>gb|PNX58304.1| retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Trifolium pratense]
          Length = 132

 Score =  115 bits (289), Expect = 4e-31
 Identities = 54/104 (51%), Positives = 74/104 (71%), Gaps = 3/104 (2%)
 Frame = -1

Query: 304 IMADPIPPTCLQAVSEDQSV---LWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCA 134
           I A  I P C++  S D +V   LWH+R+GHLS+  + +L QK MV GLP +K+P + C+
Sbjct: 13  IKAPVIVPHCMKT-SNDSNVNANLWHQRYGHLSFKGMSVLVQKEMVTGLPKLKQPNEECS 71

Query: 133 ECLIGRQHREAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2
            C+ G+QHR+ + K+S WRAS KLE++HSDICGPI+P SN  KR
Sbjct: 72  NCMKGKQHRKNVPKKSSWRASTKLELVHSDICGPINPESNGKKR 115


>dbj|GAU15733.1| hypothetical protein TSUD_235460 [Trifolium subterraneum]
          Length = 1067

 Score =  125 bits (313), Expect = 4e-31
 Identities = 54/94 (57%), Positives = 72/94 (76%)
 Frame = -1

Query: 283 PTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIGRQHRE 104
           P CLQ  +EDQ+ LWH R+GHLS   L+LL  K MV GLP+++E ++ C +CLIG+Q RE
Sbjct: 409 PMCLQTSNEDQTQLWHNRYGHLSVNGLKLLSSKDMVKGLPAIREMSERCIDCLIGKQQRE 468

Query: 103 AMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2
            + K++KWRA+ KL++IHSDICGPI+P SN  KR
Sbjct: 469 VIPKQAKWRANTKLQLIHSDICGPINPCSNGGKR 502


>gb|KYP42102.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 538

 Score =  124 bits (310), Expect = 5e-31
 Identities = 55/97 (56%), Positives = 72/97 (74%)
 Frame = -1

Query: 292 PIPPTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIGRQ 113
           PI  TC +  +ED + LWH RFGHLS+  LQ L QK MV GLP +K P+K C +C+ G+Q
Sbjct: 414 PISSTCFKIATEDVAHLWHCRFGHLSFKGLQTLQQKEMVKGLPLLKSPSKLCKDCIAGKQ 473

Query: 112 HREAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2
           HR++   +S WRAS+ L+++H+DICGPI PASNSNKR
Sbjct: 474 HRDSFPIKSSWRASQILQLVHADICGPIKPASNSNKR 510


>dbj|GAU12447.1| hypothetical protein TSUD_229810 [Trifolium subterraneum]
          Length = 1102

 Score =  124 bits (312), Expect = 6e-31
 Identities = 57/101 (56%), Positives = 72/101 (71%)
 Frame = -1

Query: 304 IMADPIPPTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECL 125
           I A  I P C +A   +++ +WH+R+GHLSY  L +L  K MVIGLP VKEPT  C+ C+
Sbjct: 288 IRAPLIVPKCFKASHSNENQIWHQRYGHLSYKGLGVLANKKMVIGLPCVKEPTDKCSNCM 347

Query: 124 IGRQHREAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2
            G+QHREA+ K   WRAS KLE++HSDICGPI+P SN  KR
Sbjct: 348 KGKQHREAIPKMRLWRASAKLELVHSDICGPITPESNGKKR 388


>gb|KYP63625.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 930

 Score =  124 bits (310), Expect = 1e-30
 Identities = 55/97 (56%), Positives = 71/97 (73%)
 Frame = -1

Query: 292 PIPPTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIGRQ 113
           P+ PTC   V+ED + LWH RFGHLS+  LQ L QKGMV GL  +K P+K C + LIG+Q
Sbjct: 392 PVAPTCFNTVTEDVAQLWHCRFGHLSFKGLQTLQQKGMVEGLAMLKPPSKLCKDYLIGKQ 451

Query: 112 HREAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2
           HR++   +S WRAS+ L+++H+DICGPI P SNS KR
Sbjct: 452 HRDSFPMKSSWRASQILQLVHADICGPIKPVSNSKKR 488


>gb|PNX95204.1| copia-type polyprotein [Trifolium pratense]
          Length = 1328

 Score =  123 bits (308), Expect = 2e-30
 Identities = 56/101 (55%), Positives = 73/101 (72%)
 Frame = -1

Query: 304 IMADPIPPTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECL 125
           I A  + P CLQ  +E  S LWH+R+GHLSY  L  L +K MV GLP++KE +  C++CL
Sbjct: 409 IKATVLVPMCLQTTNEIDSQLWHKRYGHLSYKGLNTLVKKEMVRGLPALKEASDVCSDCL 468

Query: 124 IGRQHREAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2
            G+QHRE + K+  WRA+ KLE+IHSDICGPI+PASNS  +
Sbjct: 469 FGKQHREVIPKKVNWRATHKLELIHSDICGPINPASNSGNK 509


>dbj|GAU27929.1| hypothetical protein TSUD_160240 [Trifolium subterraneum]
          Length = 1197

 Score =  122 bits (307), Expect = 3e-30
 Identities = 54/101 (53%), Positives = 77/101 (76%)
 Frame = -1

Query: 304 IMADPIPPTCLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECL 125
           I+A  I   CLQ  +ED++ LWH R+GHLS   L+LL++K MV GLP+++E  + C +CL
Sbjct: 332 IVAPVIVLMCLQTTNEDKTQLWHHRYGHLSVKGLKLLNKKDMVKGLPALRELNEKCTDCL 391

Query: 124 IGRQHREAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2
           +G+QHRE + K++KWRA+ KL++IH DICGPI+P+SN  KR
Sbjct: 392 MGKQHREVIPKQAKWRATAKLQLIHYDICGPINPSSNGGKR 432


>gb|PRQ17740.1| putative RNA-directed DNA polymerase [Rosa chinensis]
          Length = 1302

 Score =  122 bits (307), Expect = 3e-30
 Identities = 55/103 (53%), Positives = 75/103 (72%), Gaps = 2/103 (1%)
 Frame = -1

Query: 304 IMADPIPPT--CLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAE 131
           I+A+ +P    C Q VSED + LWH R+GHLS+  L+ L  + MV GLP  K P+K C +
Sbjct: 408 ILANMLPQASACFQTVSEDNTHLWHCRYGHLSFKGLKTLQYRNMVKGLPDFKMPSKLCKD 467

Query: 130 CLIGRQHREAMSKRSKWRASRKLEMIHSDICGPISPASNSNKR 2
           C++G+QHRE++ K+S WRAS +L++IHSDICGPI P SN+ KR
Sbjct: 468 CMVGKQHRESIPKKSMWRASHRLQLIHSDICGPIKPLSNNRKR 510


>dbj|GAU42405.1| hypothetical protein TSUD_324600 [Trifolium subterraneum]
          Length = 1302

 Score =  122 bits (307), Expect = 3e-30
 Identities = 56/92 (60%), Positives = 71/92 (77%)
 Frame = -1

Query: 277 CLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIGRQHREAM 98
           CLQA +ED + LWH R+GHL+   LQ L QK MVIGLP  +E    CA+CL G+QHRE++
Sbjct: 413 CLQANTEDITQLWHCRYGHLNIKGLQNLQQKNMVIGLPKFEESNHVCADCLRGKQHRESI 472

Query: 97  SKRSKWRASRKLEMIHSDICGPISPASNSNKR 2
            K S W++S++LE+IHSDICGPI+P SNSNKR
Sbjct: 473 PKTSNWKSSKRLELIHSDICGPITPVSNSNKR 504


>dbj|GAU32260.1| hypothetical protein TSUD_53880 [Trifolium subterraneum]
          Length = 1172

 Score =  122 bits (306), Expect = 4e-30
 Identities = 52/92 (56%), Positives = 67/92 (72%)
 Frame = -1

Query: 277 CLQAVSEDQSVLWHRRFGHLSYGRLQLLHQKGMVIGLPSVKEPTKNCAECLIGRQHREAM 98
           C +    D+  LWH+R+GHLS+  + +L QK MVIGLP +KEPT+ C  C+ G+Q RE +
Sbjct: 401 CFKITKNDEDTLWHKRYGHLSFKGINVLVQKNMVIGLPKLKEPTEKCTHCMKGKQQRENV 460

Query: 97  SKRSKWRASRKLEMIHSDICGPISPASNSNKR 2
            K+S WRAS KLE+IHSDICGPI+P SN  KR
Sbjct: 461 PKKSHWRASHKLELIHSDICGPINPESNGKKR 492