BLASTX nr result

ID: Astragalus22_contig00033532 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00033532
         (416 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNY15768.1| retrotransposon-related protein, partial [Trifoli...   111   9e-26
gb|PNX92003.1| retrotransposon-related protein [Trifolium pratense]   110   2e-25
dbj|GAU37691.1| hypothetical protein TSUD_164940 [Trifolium subt...   105   1e-23
gb|PNY11704.1| hypothetical protein L195_g008316 [Trifolium prat...   102   1e-22
gb|PNY12666.1| poly(ADP-ribose) polymerase domain protein [Trifo...   101   3e-22
dbj|GAU30534.1| hypothetical protein TSUD_65440 [Trifolium subte...   100   5e-22
dbj|GAU42300.1| hypothetical protein TSUD_136860 [Trifolium subt...   100   1e-21
dbj|GAU35788.1| hypothetical protein TSUD_56650 [Trifolium subte...    98   6e-21
gb|PNX92424.1| retrotransposon-related protein [Trifolium pratense]    97   9e-21
gb|PNX62727.1| hypothetical protein L195_g053129, partial [Trifo...    91   9e-21
gb|PNY07014.1| hypothetical protein L195_g003497, partial [Trifo...    91   1e-20
gb|PNY03891.1| transposon Ty3 gag-pol polyprotein [Trifolium pra...    96   2e-20
dbj|GAU22407.1| hypothetical protein TSUD_122930 [Trifolium subt...    96   2e-20
dbj|GAU25313.1| hypothetical protein TSUD_375770, partial [Trifo...    91   2e-18
gb|PNX77561.1| transposon Tf2-1 polyprotein [Trifolium pratense]       91   2e-18
gb|ABN06064.1| RNA-directed DNA polymerase (Reverse transcriptas...    91   2e-18
gb|PNY17486.1| Ty3/gypsy retrotransposon protein [Trifolium prat...    90   4e-18
gb|PNY11636.1| transposon Ty3 gag-pol polyprotein [Trifolium pra...    88   8e-18
dbj|GAU12466.1| hypothetical protein TSUD_229990, partial [Trifo...    88   1e-17
gb|KYP32789.1| Retrotransposable element Tf2, partial [Cajanus c...    87   2e-17

>gb|PNY15768.1| retrotransposon-related protein, partial [Trifolium pratense]
          Length = 1583

 Score =  111 bits (278), Expect = 9e-26
 Identities = 51/105 (48%), Positives = 69/105 (65%)
 Frame = -2

Query: 406  ESDNMAEDTPEVFEHENVLATCKVKHKGEEKMQLLVHRKGKPAEKVTWEDEILIRSPFPN 227
            E  ++ E++ +++E E VLAT K+K  GEE  QLL+H KGK  E+ TWEDE++IRS FP 
Sbjct: 1457 ELPDLMEESIDMYEPEAVLATRKIKQNGEESKQLLIHWKGKNVEEATWEDELMIRSQFPK 1516

Query: 226  FSPEDKAEVSGGGIVRVKARNADPNRQLVHQGSSGPKVWKVYSMR 92
            F  EDK ++ GG IVR +     P   ++HQG  GP+ W VYS R
Sbjct: 1517 FDLEDKVDIEGGSIVRTQINEEMPREPVIHQGIGGPRPWLVYSRR 1561


>gb|PNX92003.1| retrotransposon-related protein [Trifolium pratense]
          Length = 1571

 Score =  110 bits (276), Expect = 2e-25
 Identities = 52/101 (51%), Positives = 68/101 (67%)
 Frame = -2

Query: 394  MAEDTPEVFEHENVLATCKVKHKGEEKMQLLVHRKGKPAEKVTWEDEILIRSPFPNFSPE 215
            + E+  EVFE E+VLA   VK +GE+  Q+LVH KGK  E+ TWEDE++IRS FP F+ E
Sbjct: 1458 LMEEPIEVFEPESVLAARLVKQQGEDIKQVLVHWKGKTVEEATWEDELVIRSQFPKFNLE 1517

Query: 214  DKAEVSGGGIVRVKARNADPNRQLVHQGSSGPKVWKVYSMR 92
            DK    GGG+V       +P++Q++HQ S GP  WKVYS R
Sbjct: 1518 DKVTAEGGGVVTTSVNTENPHQQMIHQRSHGPVTWKVYSRR 1558


>dbj|GAU37691.1| hypothetical protein TSUD_164940 [Trifolium subterraneum]
          Length = 1542

 Score =  105 bits (263), Expect = 1e-23
 Identities = 53/107 (49%), Positives = 71/107 (66%)
 Frame = -2

Query: 406  ESDNMAEDTPEVFEHENVLATCKVKHKGEEKMQLLVHRKGKPAEKVTWEDEILIRSPFPN 227
            E  ++ E+  +++E E +LAT KVKH GEE  Q+LVH KGK AE  TWEDE++IRS FP 
Sbjct: 1431 ELPDLLEELNDMYEPEAILATRKVKHSGEEVKQVLVHWKGKTAEDATWEDELMIRSQFPK 1490

Query: 226  FSPEDKAEVSGGGIVRVKARNADPNRQLVHQGSSGPKVWKVYSMRVK 86
            FS EDKA    GGI R ++    P++QL+H  + G K W VY+ + K
Sbjct: 1491 FSLEDKAIAEEGGIDRDQSTAGMPHQQLIHHQTHGSKPWLVYTRKGK 1537


>gb|PNY11704.1| hypothetical protein L195_g008316 [Trifolium pratense]
          Length = 691

 Score =  102 bits (254), Expect = 1e-22
 Identities = 55/106 (51%), Positives = 67/106 (63%), Gaps = 1/106 (0%)
 Frame = -2

Query: 400 DNMAEDTPEVFEHENVLATCKVKHKGEEKMQLLVHRKGKPAEKVTWEDEILIRSPFPNFS 221
           D M  D PE+FE E VLAT  +++KGE  +Q+L+  KGK AE+ TWED I I+S FP FS
Sbjct: 578 DGMTGDQPEMFELERVLATRGIQNKGETILQVLIQWKGKLAEEATWEDVITIKSQFPKFS 637

Query: 220 PEDKAEVSGGGIVRVKARNADP-NRQLVHQGSSGPKVWKVYSMRVK 86
            EDKA + GG IVR   +   P +  LVH G  G K W VYS R K
Sbjct: 638 LEDKANLIGGSIVRPNEKELGPISESLVHDGRKGCKEWIVYSRRGK 683


>gb|PNY12666.1| poly(ADP-ribose) polymerase domain protein [Trifolium pratense]
          Length = 510

 Score =  101 bits (251), Expect = 3e-22
 Identities = 48/110 (43%), Positives = 71/110 (64%)
 Frame = -2

Query: 400 DNMAEDTPEVFEHENVLATCKVKHKGEEKMQLLVHRKGKPAEKVTWEDEILIRSPFPNFS 221
           DN+  D  ++ E E VLA   V+ +GEE MQ+L+  KG+P E+ TWE+  +IRS FP+F+
Sbjct: 393 DNLEGDGTDLTEPEPVLAARMVQKQGEEIMQVLLQWKGRPVEEATWEEAFMIRSQFPSFN 452

Query: 220 PEDKAEVSGGGIVRVKARNADPNRQLVHQGSSGPKVWKVYSMRVKAEHAG 71
            E+K +  GG IVR +     PN +++H  S+G K+W+VYS R K  + G
Sbjct: 453 LENKVQALGGSIVRHREATHRPNNEMIHNDSAGAKIWRVYSRRGKRGNMG 502


>dbj|GAU30534.1| hypothetical protein TSUD_65440 [Trifolium subterraneum]
          Length = 1084

 Score =  100 bits (250), Expect = 5e-22
 Identities = 51/106 (48%), Positives = 69/106 (65%), Gaps = 1/106 (0%)
 Frame = -2

Query: 406  ESDNMAEDTPEVFEHENVLATCKVKHKGEEKMQLLVHRKGKPAEKVTWEDEILIRSPFPN 227
            E   + ++  ++FE E VLAT KV H+ EE  Q+LV  KG+ A++ TWEDE++IRS FP+
Sbjct: 972  ELPELMDEYTDLFEPETVLATRKVNHQNEEVKQVLVQWKGRTADEATWEDELVIRSQFPS 1031

Query: 226  FSPEDKAEVSGGGIVRVKARN-ADPNRQLVHQGSSGPKVWKVYSMR 92
            F  EDK    GGG  R +  + A P+ QL+H GS+GP  W VYS R
Sbjct: 1032 FDLEDKVSSEGGGNDRSQLNDRALPHGQLIHNGSNGPSTWLVYSRR 1077


>dbj|GAU42300.1| hypothetical protein TSUD_136860 [Trifolium subterraneum]
          Length = 1523

 Score = 99.8 bits (247), Expect = 1e-21
 Identities = 50/106 (47%), Positives = 67/106 (63%), Gaps = 1/106 (0%)
 Frame = -2

Query: 406  ESDNMAEDTPEVFEHENVLATCKVKHKGEEKMQLLVHRKGKPAEKVTWEDEILIRSPFPN 227
            E   + E+  ++FE E +LA  K+K   EE  Q+LVH +GK  E+ TWEDEI+IRS FP 
Sbjct: 1410 ELPELLEENSDMFEPETILAARKIKKYDEEVKQVLVHWRGKSVEEATWEDEIVIRSQFPK 1469

Query: 226  FSPEDKAEVSGGGIVRV-KARNADPNRQLVHQGSSGPKVWKVYSMR 92
            F+ EDK  V GG I R    ++  P+ QLV+ GS+ P+ W VYS R
Sbjct: 1470 FALEDKVTVEGGSIDRTWSTKDGMPHEQLVNDGSNEPRAWMVYSRR 1515


>dbj|GAU35788.1| hypothetical protein TSUD_56650 [Trifolium subterraneum]
          Length = 1311

 Score = 97.8 bits (242), Expect = 6e-21
 Identities = 50/106 (47%), Positives = 66/106 (62%), Gaps = 1/106 (0%)
 Frame = -2

Query: 406  ESDNMAEDTPEVFEHENVLATCKVKHKGEEKMQLLVHRKGKPAEKVTWEDEILIRSPFPN 227
            E   + E+  ++FE E VLA  KV H+ EE  Q+LV  KG+ A++ TWEDEI+IRS FP+
Sbjct: 1199 ELPELMEEYTDLFEPEAVLAARKVTHQNEEVKQVLVQWKGRTADEATWEDEIVIRSQFPS 1258

Query: 226  FSPEDKAEVSGGGIVRVKARNAD-PNRQLVHQGSSGPKVWKVYSMR 92
            F  EDK    GGG+ R +  N D P  Q++H  S+G   W VYS R
Sbjct: 1259 FDLEDKVSSEGGGMDRSQLTNRDLPREQMIHYDSNGSHTWLVYSRR 1304


>gb|PNX92424.1| retrotransposon-related protein [Trifolium pratense]
          Length = 1554

 Score = 97.4 bits (241), Expect = 9e-21
 Identities = 47/101 (46%), Positives = 65/101 (64%)
 Frame = -2

Query: 394  MAEDTPEVFEHENVLATCKVKHKGEEKMQLLVHRKGKPAEKVTWEDEILIRSPFPNFSPE 215
            + E+ P++ E E +LA  K+  +G+E  Q+LVH KGK AE+ TWEDE++IRS FP F+ E
Sbjct: 1449 LMEELPDLCEPETILAVRKITQQGDEVKQVLVHWKGKTAEEATWEDELMIRSQFPKFALE 1508

Query: 214  DKAEVSGGGIVRVKARNADPNRQLVHQGSSGPKVWKVYSMR 92
            DK  + GGGI R +    +P   LV+  + GP  W VYS R
Sbjct: 1509 DKVNIEGGGIDRTQTVEEEP---LVNDSAIGPHTWLVYSRR 1546


>gb|PNX62727.1| hypothetical protein L195_g053129, partial [Trifolium pratense]
          Length = 118

 Score = 90.5 bits (223), Expect = 9e-21
 Identities = 50/111 (45%), Positives = 71/111 (63%), Gaps = 1/111 (0%)
 Frame = -2

Query: 400 DNMAEDTPEVFEHENVLATCKVKHKGEEKMQLLVHRKGKPAEKVTWEDEILIRSPFPNFS 221
           D +A +  EV+E E VLAT K++ +GEE  Q+L+  +GK AE+ TWE+ I++ S FP  S
Sbjct: 14  DLLAGEQVEVYEPEAVLATRKIQRQGEESKQVLIQWRGKTAEEATWEEAIMMTSQFPKLS 73

Query: 220 PEDKAEVSGGGIVRVKARNAD-PNRQLVHQGSSGPKVWKVYSMRVKAEHAG 71
            EDKA      IV  +A +    + +L+H  SSGPK+WKVYS R K  ++G
Sbjct: 74  LEDKA------IVEEEAIDVSLLHEELIHYESSGPKIWKVYSARGKRGNSG 118


>gb|PNY07014.1| hypothetical protein L195_g003497, partial [Trifolium pratense]
          Length = 120

 Score = 90.5 bits (223), Expect = 1e-20
 Identities = 49/104 (47%), Positives = 62/104 (59%), Gaps = 1/104 (0%)
 Frame = -2

Query: 400 DNMAEDTPEVFEHENVLATCKVKHKGEEKMQLLVHRKGKPAEKVTWEDEILIRSPFPNFS 221
           D M  +  E +E E VLA  K+K +GEE  QLLV  +GK  E+ TWEDEI+IRS FP F+
Sbjct: 10  DQMEGNITENYEPEAVLAVRKIKQQGEESKQLLVQWRGKTVEEATWEDEIMIRSQFPPFN 69

Query: 220 PEDKAEVSGGGIVRVKARNADPNR-QLVHQGSSGPKVWKVYSMR 92
            E K +  GG I R + +     R QLV+  + GPK W VY  R
Sbjct: 70  LEGKVDFEGGSIDRAQNKKEVMTRDQLVYHEAKGPKTWLVYYRR 113


>gb|PNY03891.1| transposon Ty3 gag-pol polyprotein [Trifolium pratense]
          Length = 535

 Score = 96.3 bits (238), Expect = 2e-20
 Identities = 52/103 (50%), Positives = 68/103 (66%)
 Frame = -2

Query: 400 DNMAEDTPEVFEHENVLATCKVKHKGEEKMQLLVHRKGKPAEKVTWEDEILIRSPFPNFS 221
           D++A +  EV E E +LAT KV+ + EE  Q+LVH KGK +E+ TWED ILI+S FP+F+
Sbjct: 431 DSLAGEQTEVCEPEAILATRKVQQQDEEVKQVLVHWKGKTSEEATWEDLILIKSQFPSFN 490

Query: 220 PEDKAEVSGGGIVRVKARNADPNRQLVHQGSSGPKVWKVYSMR 92
            EDK    G GI R       P+ QLV+  + GPKVW+VYS R
Sbjct: 491 LEDKVMAEGEGIDR-----HVPHEQLVNVATKGPKVWRVYSRR 528


>dbj|GAU22407.1| hypothetical protein TSUD_122930 [Trifolium subterraneum]
          Length = 1490

 Score = 96.3 bits (238), Expect = 2e-20
 Identities = 49/106 (46%), Positives = 66/106 (62%), Gaps = 1/106 (0%)
 Frame = -2

Query: 406  ESDNMAEDTPEVFEHENVLATCKVKHKGEEKMQLLVHRKGKPAEKVTWEDEILIRSPFPN 227
            E   + E+  ++FE E +LA  K+K  GEE  Q+L+H KGK  E+ TWEDE++IR  FP 
Sbjct: 1377 ELPELMEEQSDLFEPEAMLAARKIKQHGEEVKQVLIHWKGKTVEEATWEDELVIRGQFPK 1436

Query: 226  FSPEDKAEVSGGGIVRVKARNADPNR-QLVHQGSSGPKVWKVYSMR 92
            F+ EDK    GG I R ++ + D +R QLV   S+GP  W VYS R
Sbjct: 1437 FALEDKVYTEGGSIDRTQSIDEDLSREQLVSNSSNGPHTWLVYSRR 1482


>dbj|GAU25313.1| hypothetical protein TSUD_375770, partial [Trifolium subterraneum]
          Length = 1110

 Score = 90.9 bits (224), Expect = 2e-18
 Identities = 43/103 (41%), Positives = 63/103 (61%)
 Frame = -2

Query: 394  MAEDTPEVFEHENVLATCKVKHKGEEKMQLLVHRKGKPAEKVTWEDEILIRSPFPNFSPE 215
            + ED  E +E   V+A+ K++  GEE  Q+L+  KG+  E+ TWED ++++S FP F+ E
Sbjct: 1003 LVEDQMENYEPMFVVASRKIRQNGEEVRQVLIQWKGQTVEEATWEDVVMMKSQFPGFNLE 1062

Query: 214  DKAEVSGGGIVRVKARNADPNRQLVHQGSSGPKVWKVYSMRVK 86
            DK    GGG+ R     A P+ QL++  S+ PK W VYS R K
Sbjct: 1063 DKIIAEGGGVDRTHINTAMPHEQLIYNQSNNPKTWLVYSRRGK 1105


>gb|PNX77561.1| transposon Tf2-1 polyprotein [Trifolium pratense]
          Length = 506

 Score = 90.5 bits (223), Expect = 2e-18
 Identities = 50/106 (47%), Positives = 64/106 (60%), Gaps = 1/106 (0%)
 Frame = -2

Query: 406 ESDNMAEDTPEVFEHENVLATCKVKHKGEEKMQLLVHRKGKPAEKVTWEDEILIRSPFPN 227
           E   + E+  +VFE E VLAT K+K +GE   Q+L+  +GK AE+ TWEDEI++RS FP 
Sbjct: 393 ELPELLEEPIDVFEPEAVLATRKLKQQGEVINQVLIQWRGKTAEEATWEDEIMMRSQFPK 452

Query: 226 FSPEDKAEVSGGGIVRVK-ARNADPNRQLVHQGSSGPKVWKVYSMR 92
           F  E KA V   GI R + A    P  QL+H  + GPK   VYS R
Sbjct: 453 FGLEGKANVEEEGIDRTQPAEEELPREQLIHNAAGGPKALLVYSRR 498


>gb|ABN06064.1| RNA-directed DNA polymerase (Reverse transcriptase); Chromo; Zinc
            finger, CCHC-type; Peptidase aspartic, active site;
            Polynucleotidyl transferase, Ribonuclease H fold
            [Medicago truncatula]
          Length = 1297

 Score = 90.5 bits (223), Expect = 2e-18
 Identities = 50/109 (45%), Positives = 64/109 (58%)
 Frame = -2

Query: 397  NMAEDTPEVFEHENVLATCKVKHKGEEKMQLLVHRKGKPAEKVTWEDEILIRSPFPNFSP 218
            ++ ED   V E E VL    ++ +GE+  Q+LVH  G+  E+ TWED ++IRS FPNF  
Sbjct: 1196 DLEEDKGVVIEPETVLTRRTIQVQGEKIDQVLVHWMGQKVEEATWEDTLIIRSQFPNFYL 1255

Query: 217  EDKAEVSGGGIVRVKARNADPNRQLVHQGSSGPKVWKVYSMRVKAEHAG 71
            EDKA +SGG IVR           LVH  + GPKVW+VYS R K    G
Sbjct: 1256 EDKAMLSGGSIVRTA-------ESLVHNTTVGPKVWQVYSRRCKKPAKG 1297


>gb|PNY17486.1| Ty3/gypsy retrotransposon protein [Trifolium pratense]
          Length = 1357

 Score = 89.7 bits (221), Expect = 4e-18
 Identities = 46/110 (41%), Positives = 68/110 (61%)
 Frame = -2

Query: 400  DNMAEDTPEVFEHENVLATCKVKHKGEEKMQLLVHRKGKPAEKVTWEDEILIRSPFPNFS 221
            D M +DT E +E   VLAT K++ +GEE  Q+LV  +GK AE+ TWE+ I+++S FP F+
Sbjct: 1248 DLMGDDTGENYEPVAVLATRKIRQQGEEIKQVLVQWEGKDAEEATWEEAIMMKSQFPKFN 1307

Query: 220  PEDKAEVSGGGIVRVKARNADPNRQLVHQGSSGPKVWKVYSMRVKAEHAG 71
             EDK     G I R +   + P+ QL++  +  PK+W+VY  R K   +G
Sbjct: 1308 LEDKVIAEEGSIDRTQNMESLPHEQLIYNETIKPKIWQVYYRRGKRVLSG 1357


>gb|PNY11636.1| transposon Ty3 gag-pol polyprotein [Trifolium pratense]
          Length = 395

 Score = 88.2 bits (217), Expect = 8e-18
 Identities = 52/107 (48%), Positives = 63/107 (58%)
 Frame = -2

Query: 400 DNMAEDTPEVFEHENVLATCKVKHKGEEKMQLLVHRKGKPAEKVTWEDEILIRSPFPNFS 221
           DN++ D  E+ E E +LA   V+ +GEE  Q+LV  KGK AE+VTWED I ++S FP F 
Sbjct: 291 DNLSGDGNEMVEPEQLLANRTVQQQGEEGQQVLVKWKGKSAEEVTWEDMITMQSQFPQFC 350

Query: 220 PEDKAEVSGGGIVRVKARNADPNRQLVHQGSSGPKVWKVYSMRVKAE 80
             DKA    GGI R  A N      LVH    GPK W VYS RV+ E
Sbjct: 351 LADKAIGLEGGIDRGAASN------LVHNQQGGPKQWIVYSRRVRKE 391


>dbj|GAU12466.1| hypothetical protein TSUD_229990, partial [Trifolium subterraneum]
          Length = 1303

 Score = 88.2 bits (217), Expect = 1e-17
 Identities = 48/107 (44%), Positives = 66/107 (61%), Gaps = 4/107 (3%)
 Frame = -2

Query: 400  DNMAEDTPEVFEHENVLATCKVKHKGEEKMQLLVHRKGKPAEKVTWEDEILIRSPFPNFS 221
            D ++++ PE +E E VL   K+K +GEE  Q+L+  KGK A++ TWE+     S FP F+
Sbjct: 1195 DLVSDEQPEAYEPEAVLGNRKIKQQGEEIKQVLIQWKGKNADEATWEE-----SQFPTFN 1249

Query: 220  PEDKAEVSGGGI----VRVKARNADPNRQLVHQGSSGPKVWKVYSMR 92
             EDK  V GGGI        + ++ P+ QLVHQG+SG K W VYS R
Sbjct: 1250 LEDKIIVEGGGIDSNLSAQSSASSAPHEQLVHQGASGAKPWLVYSRR 1296


>gb|KYP32789.1| Retrotransposable element Tf2, partial [Cajanus cajan]
          Length = 458

 Score = 87.4 bits (215), Expect = 2e-17
 Identities = 49/100 (49%), Positives = 60/100 (60%)
 Frame = -2

Query: 385 DTPEVFEHENVLATCKVKHKGEEKMQLLVHRKGKPAEKVTWEDEILIRSPFPNFSPEDKA 206
           + PE  E E+++AT K   +GE   QLLV  K KP E  TWEDE +IRS FP+FSPEDKA
Sbjct: 368 EIPESIEPESIIATRKSSKQGETTQQLLVKWKNKPMEDATWEDEFVIRSQFPSFSPEDKA 427

Query: 205 EVSGGGIVRVKARNADPNRQLVHQGSSGPKVWKVYSMRVK 86
           +  G G         +   QLV Q  S PKVW+VY+ R K
Sbjct: 428 DFIGEG---------NDRAQLVRQ-ESRPKVWRVYTRRNK 457


Top