BLASTX nr result

ID: Astragalus22_contig00038574 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00038574
         (350 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|KHN49021.1| hypothetical protein glysoja_031232, partial [Gly...   109   2e-26
gb|KHN21230.1| Retrovirus-related Pol polyprotein from transposo...   109   3e-25
gb|KYP30925.1| hypothetical protein KK1_049600 [Cajanus cajan]         95   3e-22
gb|KYP72170.1| hypothetical protein KK1_004754 [Cajanus cajan] >...    93   8e-22
gb|KYP45646.1| hypothetical protein KK1_032760, partial [Cajanus...    94   1e-21
gb|KYP61342.1| Retrovirus-related Pol polyprotein from transposo...    98   2e-21
gb|KYP47861.1| Retrovirus-related Pol polyprotein from transposo...    95   4e-21
gb|KYP35140.1| hypothetical protein KK1_043839 [Cajanus cajan]         94   5e-21
gb|KYP48791.1| hypothetical protein KK1_029520 [Cajanus cajan]         92   9e-21
gb|KYP40244.1| Retrovirus-related Pol polyprotein from transposo...    96   1e-20
gb|KYP46257.1| Retrovirus-related Pol polyprotein from transposo...    96   1e-20
gb|KYP34307.1| Retrovirus-related Pol polyprotein from transposo...    96   2e-20
ref|XP_020233181.1| uncharacterized protein LOC109813405 [Cajanu...    95   2e-20
ref|XP_020206509.1| uncharacterized protein LOC109791608 [Cajanu...    94   6e-20
gb|KYP45672.1| hypothetical protein KK1_032786 [Cajanus cajan]         93   8e-20
gb|KYP43730.1| hypothetical protein KK1_034810, partial [Cajanus...    92   2e-19
ref|XP_020204897.1| uncharacterized protein LOC109790192 [Cajanu...    91   5e-19
dbj|GAU35317.1| hypothetical protein TSUD_389420 [Trifolium subt...    89   1e-18
gb|KHN15272.1| hypothetical protein glysoja_044267, partial [Gly...    87   1e-18
ref|XP_014627013.1| PREDICTED: uncharacterized protein LOC106797...    88   3e-18

>gb|KHN49021.1| hypothetical protein glysoja_031232, partial [Glycine soja]
          Length = 323

 Score =  109 bits (272), Expect = 2e-26
 Identities = 56/115 (48%), Positives = 70/115 (60%)
 Frame = +3

Query: 3   DDKNYLVWLQQMEPFLHAHHLLHHYCVNLEIPPRFLSEADHDAGTENPAYTAWEXXXXXX 182
           D  NYLVWLQQ+EP L AH L H +CV  EIPP++ SE D  A  ENPA++ WE      
Sbjct: 24  DATNYLVWLQQIEPVLRAHRL-HRFCVTPEIPPQYASEHDRLANIENPAFSNWELQDQLL 82

Query: 183 XXXXXXXXXXXXXXXVIRCRHAWQLWD*VHQNFHSKTKASARQLHTELRKFTTRR 347
                          VI C+H +QLW+ +HQ+F SKTKA ARQL T+LR  TT++
Sbjct: 83  LAWLQSSLSPAILPSVIGCKHTFQLWENIHQSFQSKTKAQARQLRTQLR--TTKK 135


>gb|KHN21230.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Glycine soja]
          Length = 1429

 Score =  109 bits (272), Expect = 3e-25
 Identities = 56/115 (48%), Positives = 70/115 (60%)
 Frame = +3

Query: 3   DDKNYLVWLQQMEPFLHAHHLLHHYCVNLEIPPRFLSEADHDAGTENPAYTAWEXXXXXX 182
           D  NYLVWLQQ+EP L AH L H +CV  EIPP++ SE D  A  ENPA++ WE      
Sbjct: 11  DATNYLVWLQQIEPVLRAHRL-HRFCVTPEIPPQYASEHDRLANIENPAFSNWELQDQLL 69

Query: 183 XXXXXXXXXXXXXXXVIRCRHAWQLWD*VHQNFHSKTKASARQLHTELRKFTTRR 347
                          VI C+H +QLW+ +HQ+F SKTKA ARQL T+LR  TT++
Sbjct: 70  LAWLQSSLSPAILPSVIGCKHTFQLWENIHQSFQSKTKAQARQLRTQLR--TTKK 122


>gb|KYP30925.1| hypothetical protein KK1_049600 [Cajanus cajan]
          Length = 166

 Score = 94.7 bits (234), Expect = 3e-22
 Identities = 48/112 (42%), Positives = 61/112 (54%)
 Frame = +3

Query: 3   DDKNYLVWLQQMEPFLHAHHLLHHYCVNLEIPPRFLSEADHDAGTENPAYTAWEXXXXXX 182
           D KNYL+W QQ+EP +  H L HHY VN +IP +F + A+ DAG  + +Y AWE      
Sbjct: 45  DTKNYLLWCQQVEPVIKGHRL-HHYLVNPQIPQKFATLANRDAGRISESYLAWEQQDQLL 103

Query: 183 XXXXXXXXXXXXXXXVIRCRHAWQLWD*VHQNFHSKTKASARQLHTELRKFT 338
                          VI C+ ++QLWD +H  FHS   A ARQL  ELR  T
Sbjct: 104 LSWLQSSMSKDMLTRVIGCKSSFQLWDKIHSYFHSHMNAKARQLRNELRSTT 155


>gb|KYP72170.1| hypothetical protein KK1_004754 [Cajanus cajan]
 gb|KYP72182.1| hypothetical protein KK1_004768 [Cajanus cajan]
          Length = 146

 Score = 93.2 bits (230), Expect = 8e-22
 Identities = 47/116 (40%), Positives = 62/116 (53%)
 Frame = +3

Query: 3   DDKNYLVWLQQMEPFLHAHHLLHHYCVNLEIPPRFLSEADHDAGTENPAYTAWEXXXXXX 182
           D+ N+L+W QQ+EP + AH L  H+ V  + P RFL+E D DAG  NP Y AWE      
Sbjct: 7   DETNFLIWRQQVEPVIKAHRL-QHFVVCPKNPLRFLNETDRDAGKLNPEYIAWEQQDQIL 65

Query: 183 XXXXXXXXXXXXXXXVIRCRHAWQLWD*VHQNFHSKTKASARQLHTELRKFTTRRK 350
                          V+   H +Q+WD +H  FH +T+A ARQL TELR  +   K
Sbjct: 66  MLWLQSSLSPTILSRVLGSNHLYQVWDKIHDYFHKQTRARARQLRTELRSTSLEEK 121


>gb|KYP45646.1| hypothetical protein KK1_032760, partial [Cajanus cajan]
          Length = 202

 Score = 94.4 bits (233), Expect = 1e-21
 Identities = 45/112 (40%), Positives = 63/112 (56%)
 Frame = +3

Query: 3   DDKNYLVWLQQMEPFLHAHHLLHHYCVNLEIPPRFLSEADHDAGTENPAYTAWEXXXXXX 182
           D KNYL+W QQ+EP +  H L HH+ VN +IPP+FL+ +D D    +  Y AWE      
Sbjct: 16  DTKNYLLWCQQVEPVIKGHRL-HHFLVNPQIPPKFLTISDKDENCVSEEYLAWEQQDQLL 74

Query: 183 XXXXXXXXXXXXXXXVIRCRHAWQLWD*VHQNFHSKTKASARQLHTELRKFT 338
                          VI C+ ++Q+WD +H+ FH+ T A ARQL ++LR  T
Sbjct: 75  LSWLQSSMSKDMLTHVIGCKSSFQIWDKIHEYFHAHTNAKARQLRSDLRSTT 126


>gb|KYP61342.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 1358

 Score = 98.2 bits (243), Expect = 2e-21
 Identities = 49/112 (43%), Positives = 65/112 (58%)
 Frame = +3

Query: 3   DDKNYLVWLQQMEPFLHAHHLLHHYCVNLEIPPRFLSEADHDAGTENPAYTAWEXXXXXX 182
           DD NYL W QQ+EP + +H L   + VN +IPPR+L++AD D+   NPAY  WE      
Sbjct: 23  DDSNYLHWRQQIEPVIKSHKL-QRFVVNPQIPPRYLTDADRDSDIVNPAYETWEVQDQML 81

Query: 183 XXXXXXXXXXXXXXXVIRCRHAWQLWD*VHQNFHSKTKASARQLHTELRKFT 338
                          VI   H++Q+WD VH+ FH++TKA ARQL T+LR  T
Sbjct: 82  LTWLQSTLSKSILSRVIGSVHSYQVWDKVHEYFHTQTKARARQLRTDLRSTT 133


>gb|KYP47861.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Cajanus cajan]
          Length = 281

 Score = 94.7 bits (234), Expect = 4e-21
 Identities = 47/108 (43%), Positives = 63/108 (58%)
 Frame = +3

Query: 3   DDKNYLVWLQQMEPFLHAHHLLHHYCVNLEIPPRFLSEADHDAGTENPAYTAWEXXXXXX 182
           DD NYL W QQ+EP + +H L   + VN +IPPR+L++AD D+   NPAY  WE      
Sbjct: 10  DDSNYLYWRQQIEPVIKSHKL-QRFLVNPQIPPRYLTDADRDSDIVNPAYETWEVQDQML 68

Query: 183 XXXXXXXXXXXXXXXVIRCRHAWQLWD*VHQNFHSKTKASARQLHTEL 326
                          VI   H++Q+WD VH+ FH++TKA ARQL T+L
Sbjct: 69  LTWLQSTLSKSILSRVIGSVHSYQVWDKVHEYFHTQTKARARQLCTDL 116


>gb|KYP35140.1| hypothetical protein KK1_043839 [Cajanus cajan]
          Length = 255

 Score = 94.0 bits (232), Expect = 5e-21
 Identities = 47/108 (43%), Positives = 62/108 (57%)
 Frame = +3

Query: 3   DDKNYLVWLQQMEPFLHAHHLLHHYCVNLEIPPRFLSEADHDAGTENPAYTAWEXXXXXX 182
           DD NYL W QQ+EP + +H L   + VN +IPPR+L+ AD D+   NPAY  WE      
Sbjct: 23  DDSNYLHWRQQIEPAIKSHKL-QRFVVNPQIPPRYLTNADRDSDIVNPAYETWEVQDQML 81

Query: 183 XXXXXXXXXXXXXXXVIRCRHAWQLWD*VHQNFHSKTKASARQLHTEL 326
                          VI   H++Q+WD VH+ FH++TKA ARQL T+L
Sbjct: 82  LTWLQSTLSKTILSRVIGSVHSYQVWDKVHEYFHTQTKARARQLRTDL 129


>gb|KYP48791.1| hypothetical protein KK1_029520 [Cajanus cajan]
          Length = 189

 Score = 91.7 bits (226), Expect = 9e-21
 Identities = 44/109 (40%), Positives = 60/109 (55%)
 Frame = +3

Query: 3   DDKNYLVWLQQMEPFLHAHHLLHHYCVNLEIPPRFLSEADHDAGTENPAYTAWEXXXXXX 182
           DD+N++ W QQ+   + AH L   + VN +IP +FL+  D D+ T NP YT W+      
Sbjct: 25  DDRNFMTWQQQVTAVIRAHDL-ERFVVNPKIPLKFLTAEDRDSNTINPEYTVWDRKDSLL 83

Query: 183 XXXXXXXXXXXXXXXVIRCRHAWQLWD*VHQNFHSKTKASARQLHTELR 329
                          V+ CRH++Q+WD V Q+FHS TK  A QLH ELR
Sbjct: 84  FSWLLSTLSESIQAHVVSCRHSYQIWDLVFQHFHSLTKVKAAQLHLELR 132


>gb|KYP40244.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 720

 Score = 95.9 bits (237), Expect = 1e-20
 Identities = 48/112 (42%), Positives = 64/112 (57%)
 Frame = +3

Query: 3   DDKNYLVWLQQMEPFLHAHHLLHHYCVNLEIPPRFLSEADHDAGTENPAYTAWEXXXXXX 182
           DD NYL W QQ++P + +H L   + VN +IPPR+L++AD D    NPAY  WE      
Sbjct: 23  DDSNYLHWRQQIKPIIKSHKL-QRFVVNPQIPPRYLTDADRDYDIVNPAYETWEVQDQML 81

Query: 183 XXXXXXXXXXXXXXXVIRCRHAWQLWD*VHQNFHSKTKASARQLHTELRKFT 338
                          VI   H++Q+WD VH+ FH++TKA ARQL T+LR  T
Sbjct: 82  LTWLQSMLSKTILSRVIGSVHSYQVWDKVHEYFHTQTKARARQLRTDLRSTT 133


>gb|KYP46257.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 1408

 Score = 95.9 bits (237), Expect = 1e-20
 Identities = 49/112 (43%), Positives = 61/112 (54%)
 Frame = +3

Query: 3   DDKNYLVWLQQMEPFLHAHHLLHHYCVNLEIPPRFLSEADHDAGTENPAYTAWEXXXXXX 182
           D KNYL+W QQ+EP +  H L HHY VN +IP +F + AD DAG  + +Y AWE      
Sbjct: 45  DTKNYLLWCQQVEPVIKGHRL-HHYLVNPQIPQKFATLADRDAGHISESYLAWEQQDQLL 103

Query: 183 XXXXXXXXXXXXXXXVIRCRHAWQLWD*VHQNFHSKTKASARQLHTELRKFT 338
                          VI C+ ++QLWD +H  FHS   A ARQL  ELR  T
Sbjct: 104 LSWLQSSMSKDMLTRVIGCKSSFQLWDKIHTYFHSHMNAKARQLRNELRSTT 155


>gb|KYP34307.1| Retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Cajanus cajan]
          Length = 1102

 Score = 95.5 bits (236), Expect = 2e-20
 Identities = 48/109 (44%), Positives = 60/109 (55%)
 Frame = +3

Query: 3   DDKNYLVWLQQMEPFLHAHHLLHHYCVNLEIPPRFLSEADHDAGTENPAYTAWEXXXXXX 182
           D KNYL+W QQ+EP +  H L HHY VN +IP +F + AD DAG  + +Y AWE      
Sbjct: 45  DTKNYLLWCQQVEPVIKGHRL-HHYLVNPQIPQKFATLADRDAGRISESYLAWEQQDQLL 103

Query: 183 XXXXXXXXXXXXXXXVIRCRHAWQLWD*VHQNFHSKTKASARQLHTELR 329
                          VI C+ ++QLWD +H  FHS   A ARQL  ELR
Sbjct: 104 LSWLQSSMSKDMLTRVIGCKSSFQLWDKIHSYFHSHMNAKARQLRNELR 152


>ref|XP_020233181.1| uncharacterized protein LOC109813405 [Cajanus cajan]
          Length = 680

 Score = 95.1 bits (235), Expect = 2e-20
 Identities = 48/112 (42%), Positives = 62/112 (55%)
 Frame = +3

Query: 3   DDKNYLVWLQQMEPFLHAHHLLHHYCVNLEIPPRFLSEADHDAGTENPAYTAWEXXXXXX 182
           D KNYL+W QQ+EP +  H L HH+ VN +IPP+FLS  D DA   + AY AWE      
Sbjct: 135 DTKNYLLWCQQVEPVIKGHRL-HHFLVNPQIPPKFLSIFDRDANRISEAYLAWEQQDQLL 193

Query: 183 XXXXXXXXXXXXXXXVIRCRHAWQLWD*VHQNFHSKTKASARQLHTELRKFT 338
                          VI C+ ++Q+WD +H  FH+ T A ARQL  +LR  T
Sbjct: 194 LSWLQSSMSKDMLTRVIGCKSSFQIWDKIHAYFHAHTNAKARQLRGDLRGTT 245


>ref|XP_020206509.1| uncharacterized protein LOC109791608 [Cajanus cajan]
          Length = 973

 Score = 94.0 bits (232), Expect = 6e-20
 Identities = 47/108 (43%), Positives = 62/108 (57%)
 Frame = +3

Query: 3   DDKNYLVWLQQMEPFLHAHHLLHHYCVNLEIPPRFLSEADHDAGTENPAYTAWEXXXXXX 182
           DD NYL W QQ+EP + +H L   + VN +IPPR+L+ AD D+   NPAY  WE      
Sbjct: 147 DDSNYLHWRQQIEPAIKSHKL-QRFVVNPQIPPRYLTNADRDSDIVNPAYETWEVQDQML 205

Query: 183 XXXXXXXXXXXXXXXVIRCRHAWQLWD*VHQNFHSKTKASARQLHTEL 326
                          VI   H++Q+WD VH+ FH++TKA ARQL T+L
Sbjct: 206 LTWLQSTLSKTILSRVIGSVHSYQVWDKVHEYFHTQTKARARQLRTDL 253


>gb|KYP45672.1| hypothetical protein KK1_032786 [Cajanus cajan]
          Length = 439

 Score = 93.2 bits (230), Expect = 8e-20
 Identities = 45/112 (40%), Positives = 61/112 (54%)
 Frame = +3

Query: 3   DDKNYLVWLQQMEPFLHAHHLLHHYCVNLEIPPRFLSEADHDAGTENPAYTAWEXXXXXX 182
           D KNYL+W QQ EP +  H L HH+ VN +IPP+FL+ +D D    +  Y AWE      
Sbjct: 40  DTKNYLLWCQQAEPVIKGHRL-HHFLVNPQIPPKFLTVSDRDENRVSEEYLAWEQQDQLL 98

Query: 183 XXXXXXXXXXXXXXXVIRCRHAWQLWD*VHQNFHSKTKASARQLHTELRKFT 338
                          VI C+ ++Q+WD +H  FH+ T A ARQL ++LR  T
Sbjct: 99  LSWLQSSMSKDMLTRVIGCKSSFQIWDKIHAYFHAHTNAKARQLRSDLRSTT 150


>gb|KYP43730.1| hypothetical protein KK1_034810, partial [Cajanus cajan]
          Length = 363

 Score = 91.7 bits (226), Expect = 2e-19
 Identities = 46/109 (42%), Positives = 59/109 (54%)
 Frame = +3

Query: 3   DDKNYLVWLQQMEPFLHAHHLLHHYCVNLEIPPRFLSEADHDAGTENPAYTAWEXXXXXX 182
           D KNYL+W QQ++P +  H L HH+ VN +IP +FL+ AD D G  +  Y AWE      
Sbjct: 29  DTKNYLLWCQQVKPVIKGHRL-HHFLVNPQIPQKFLNLADRDVGRISEPYLAWEQQDQLL 87

Query: 183 XXXXXXXXXXXXXXXVIRCRHAWQLWD*VHQNFHSKTKASARQLHTELR 329
                          VI C+ ++QLWD +H  FHS   A ARQL  ELR
Sbjct: 88  LSWLQSSMSKDMLTRVIGCKTSFQLWDKIHSYFHSHMNAKARQLRNELR 136


>ref|XP_020204897.1| uncharacterized protein LOC109790192 [Cajanus cajan]
          Length = 385

 Score = 90.5 bits (223), Expect = 5e-19
 Identities = 45/108 (41%), Positives = 61/108 (56%)
 Frame = +3

Query: 3   DDKNYLVWLQQMEPFLHAHHLLHHYCVNLEIPPRFLSEADHDAGTENPAYTAWEXXXXXX 182
           DD NYL W QQ+EP + +H L   + VN + PP++L+ AD D+   NPAY  WE      
Sbjct: 23  DDSNYLHWRQQIEPVIKSHKL-QRFVVNPQSPPQYLTNADRDSDIVNPAYETWEVQDQML 81

Query: 183 XXXXXXXXXXXXXXXVIRCRHAWQLWD*VHQNFHSKTKASARQLHTEL 326
                          VI   H++Q+WD VH+ FH++TKA ARQL T+L
Sbjct: 82  LTWLQSTLSKTILSHVIGSVHSYQVWDKVHEYFHTQTKACARQLRTDL 129


>dbj|GAU35317.1| hypothetical protein TSUD_389420 [Trifolium subterraneum]
          Length = 346

 Score = 89.4 bits (220), Expect = 1e-18
 Identities = 43/112 (38%), Positives = 62/112 (55%)
 Frame = +3

Query: 3   DDKNYLVWLQQMEPFLHAHHLLHHYCVNLEIPPRFLSEADHDAGTENPAYTAWEXXXXXX 182
           DD NYL W QQ+E  L    ++  Y V+ +IPP +LS+AD ++G+ENP YT WE      
Sbjct: 36  DDSNYLQWKQQVEGVLRGTKMVK-YVVSPQIPPVYLSDADRESGSENPLYTEWEEQDSLL 94

Query: 183 XXXXXXXXXXXXXXXVIRCRHAWQLWD*VHQNFHSKTKASARQLHTELRKFT 338
                           +R RH+WQ+W+ VH   H++ +  +RQL +ELR  T
Sbjct: 95  CTWILSTISSSLLSRFVRLRHSWQVWEEVHSYCHTQMRTCSRQLRSELRSIT 146


>gb|KHN15272.1| hypothetical protein glysoja_044267, partial [Glycine soja]
          Length = 212

 Score = 86.7 bits (213), Expect = 1e-18
 Identities = 43/115 (37%), Positives = 57/115 (49%)
 Frame = +3

Query: 6   DKNYLVWLQQMEPFLHAHHLLHHYCVNLEIPPRFLSEADHDAGTENPAYTAWEXXXXXXX 185
           + NYL+W +Q+EP L  H L HH+ VN  IP +F + AD D G  +P Y AWE       
Sbjct: 21  NSNYLLWCKQVEPVLKGHRL-HHFLVNPTIPSQFRTLADRDLGISSPKYLAWESQDQLLL 79

Query: 186 XXXXXXXXXXXXXXVIRCRHAWQLWD*VHQNFHSKTKASARQLHTELRKFTTRRK 350
                         ++ CR +W LWD  H +F S  +A  RQL TE R    + K
Sbjct: 80  SWLQSMISPEALPRLLGCRTSWYLWDSYHSHFCSSIRAKTRQLRTEFRNMQLQNK 134


>ref|XP_014627013.1| PREDICTED: uncharacterized protein LOC106797270 [Glycine max]
          Length = 329

 Score = 87.8 bits (216), Expect = 3e-18
 Identities = 46/105 (43%), Positives = 61/105 (58%)
 Frame = +3

Query: 33  QMEPFLHAHHLLHHYCVNLEIPPRFLSEADHDAGTENPAYTAWEXXXXXXXXXXXXXXXX 212
           ++EP L AH L H +CV  EIPP++ SE D  A  EN A++ WE                
Sbjct: 12  KIEPVLRAHRL-HRFCVTPEIPPQYASEHDRLANIENSAFSNWELQDQFFLAWLQSSLSP 70

Query: 213 XXXXXVIRCRHAWQLWD*VHQNFHSKTKASARQLHTELRKFTTRR 347
                VI C+H +QLW+ +HQ+F SKTKA ARQL T+LR  TT++
Sbjct: 71  AILPSVIGCKHTFQLWENIHQSFQSKTKAQARQLRTQLR--TTKK 113


Top