BLASTX nr result

ID: Astragalus23_contig00029657 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus23_contig00029657
         (340 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|PNX79728.1| hypothetical protein L195_g035716 [Trifolium prat...    79   4e-15
gb|PNY05487.1| integrase catalytic region [Trifolium pratense]         76   4e-14
ref|XP_013459804.1| hypothetical protein MTR_3g052072 [Medicago ...    72   6e-13
gb|PNX94461.1| retrovirus-related Pol polyprotein from transposo...    73   1e-12
gb|KYP66985.1| hypothetical protein KK1_013301 [Cajanus cajan]         72   2e-12
gb|PNX76805.1| hypothetical protein L195_g032764 [Trifolium prat...    72   2e-12
gb|KYP56862.1| hypothetical protein KK1_003111, partial [Cajanus...    69   4e-12
dbj|GAU31058.1| hypothetical protein TSUD_214940 [Trifolium subt...    71   5e-12
dbj|GAU51530.1| hypothetical protein TSUD_413950 [Trifolium subt...    70   9e-12
dbj|GAU36337.1| hypothetical protein TSUD_321760 [Trifolium subt...    70   9e-12
gb|PNX97245.1| hypothetical protein L195_g020471, partial [Trifo...    70   1e-11
dbj|GAU45704.1| hypothetical protein TSUD_86800 [Trifolium subte...    70   2e-11
dbj|GAU46782.1| hypothetical protein TSUD_351810 [Trifolium subt...    70   2e-11
gb|PNX92077.1| retrovirus-related Pol polyprotein from transposo...    69   3e-11
ref|XP_019190428.1| PREDICTED: uncharacterized protein LOC109184...    69   4e-11
gb|PNX77860.1| retrovirus-related Pol polyprotein from transposo...    69   4e-11
gb|ABE88096.1| Integrase, catalytic region [Medicago truncatula]       69   4e-11
gb|PNX72611.1| peptide transporter PTR2 [Trifolium pratense]           69   4e-11
ref|XP_019164275.1| PREDICTED: uncharacterized protein LOC109160...    69   4e-11
ref|XP_019184395.1| PREDICTED: uncharacterized protein LOC109179...    68   6e-11

>gb|PNX79728.1| hypothetical protein L195_g035716 [Trifolium pratense]
          Length = 272

 Score = 78.6 bits (192), Expect = 4e-15
 Identities = 40/115 (34%), Positives = 65/115 (56%), Gaps = 6/115 (5%)
 Frame = -3

Query: 338 ADGKKPFNRGKANFQGNGNKYTSKVCSYCERVGHTVETCYKRHGTPPSWQKSASNITDEP 159
           +D ++P   G+++F    NK   K C+YC +  H VE CYK+HG PP++ ++ +      
Sbjct: 161 SDSRRPLGCGRSSFNPQFNK--KKYCTYCGKDNHVVENCYKKHGFPPNFGRNINANNVNA 218

Query: 158 EEQTENDDSKTV------GISKDQYDKLVNMIQGITTLNSKAVNMIAGPSPSLTG 12
           E+  +NDD+++         +K QY+KLVN++Q   +LNS      AGPS  + G
Sbjct: 219 EDSMDNDDARSTKGTDSFTFTKSQYEKLVNLLQSNASLNS------AGPSTHING 267


>gb|PNY05487.1| integrase catalytic region [Trifolium pratense]
          Length = 272

 Score = 75.9 bits (185), Expect = 4e-14
 Identities = 40/115 (34%), Positives = 65/115 (56%), Gaps = 8/115 (6%)
 Frame = -3

Query: 338 ADGKKPFNRGKANFQGNGNKYTSKVCSYCERVGHTVETCYKRHGTPPSWQKSASNI--TD 165
           +D +KPFNRGK+       K  ++ C++C++ GHTV+ CYK+HG P     +  N+  +D
Sbjct: 101 SDARKPFNRGKSLMNSGKGKGDTRHCTFCDKNGHTVDWCYKKHGNPNIRSNTGVNLVNSD 160

Query: 164 EPEEQTENDDSKTV------GISKDQYDKLVNMIQGITTLNSKAVNMIAGPSPSL 18
             +  T N +S  V       IS+++YD+LVN++Q          N+IA  SP++
Sbjct: 161 NVDSTTANGNSDLVASSSDANISQEKYDQLVNLLQ--------QANLIASASPTV 207


>ref|XP_013459804.1| hypothetical protein MTR_3g052072 [Medicago truncatula]
 gb|KEH33835.1| hypothetical protein MTR_3g052072 [Medicago truncatula]
          Length = 206

 Score = 71.6 bits (174), Expect = 6e-13
 Identities = 36/99 (36%), Positives = 55/99 (55%), Gaps = 13/99 (13%)
 Frame = -3

Query: 335 DGKKPFNRGKAN-FQGNGNKYTSKVCSYCERVGHTVETCYKRHGTPPSWQKSASN----- 174
           DG++ F  G+ N FQG G +   +VCS+C R  HTV+TCYK+HG PP+W +   N     
Sbjct: 28  DGQRQFGIGRGNGFQGRG-RGNLRVCSFCNRTNHTVKTCYKKHGYPPNWGRGGGNSFANA 86

Query: 173 --ITDEPEE-----QTENDDSKTVGISKDQYDKLVNMIQ 78
             +  E  E      T  +D   + ++KDQY  L+ +++
Sbjct: 87  NFVESEETELKGNASTGKNDENGIMLTKDQYQNLIALLE 125


>gb|PNX94461.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Trifolium pratense]
          Length = 1000

 Score = 72.8 bits (177), Expect = 1e-12
 Identities = 38/112 (33%), Positives = 64/112 (57%), Gaps = 8/112 (7%)
 Frame = -3

Query: 338 ADGKKPFNRGKANFQGNGNKYTSKVCSYCERVGHTVETCYKRHGTPPSWQKSASNI--TD 165
           +D +KPFNRGK+       K  ++ C++C++ GHTV+ CYK+HG P     +  N+  +D
Sbjct: 101 SDARKPFNRGKSLMNSGKGKGDTRHCTFCDKNGHTVDWCYKKHGNPNIRSNTGVNLVNSD 160

Query: 164 EPEEQTENDDSKTV------GISKDQYDKLVNMIQGITTLNSKAVNMIAGPS 27
             +  T N +S  V       IS+++YD+L+N++Q    + S +  +  GPS
Sbjct: 161 NVDSTTANGNSDLVASSSGTNISQEKYDQLMNLLQQTNLIPSASPTV--GPS 210


>gb|KYP66985.1| hypothetical protein KK1_013301 [Cajanus cajan]
          Length = 341

 Score = 72.0 bits (175), Expect = 2e-12
 Identities = 35/86 (40%), Positives = 53/86 (61%), Gaps = 10/86 (11%)
 Frame = -3

Query: 335 DGKKPFNRGKANFQGNGNKYTSKVCSYCERVGHTVETCYKRHGTPPSWQKSASNIT---- 168
           + KK F RG+ +   NG +   K+C+YC + GHTVETCYK+HG PPS+  ++S +     
Sbjct: 53  ENKKSFGRGRGSNFKNGGRGNGKMCTYCGKSGHTVETCYKKHGYPPSFGNNSSYVNNFVM 112

Query: 167 DEPEEQTEN------DDSKTVGISKD 108
           D+ E  T+N      D+S+++  SKD
Sbjct: 113 DDNEGSTDNHSMKDHDESRSMTFSKD 138


>gb|PNX76805.1| hypothetical protein L195_g032764 [Trifolium pratense]
 gb|PNX79761.1| hypothetical protein L195_g035749 [Trifolium pratense]
          Length = 435

 Score = 72.0 bits (175), Expect = 2e-12
 Identities = 40/115 (34%), Positives = 60/115 (52%), Gaps = 6/115 (5%)
 Frame = -3

Query: 338 ADGKKPFNRGKANFQGNGNKYTSKVCSYCERVGHTVETCYKRHGTPPSWQKSASNITDEP 159
           +D ++P  RG+        K   K C+YC +  H VE CYK+HG PP++ ++A       
Sbjct: 234 SDSRRPQGRGRGRSNSQFGK--KKYCTYCGKDNHIVENCYKKHGFPPNFGRNAVANNANA 291

Query: 158 EEQTENDD------SKTVGISKDQYDKLVNMIQGITTLNSKAVNMIAGPSPSLTG 12
           EEQ +NDD      +++   +K QY+KLVN++Q      S      AGPS  + G
Sbjct: 292 EEQLDNDDIRSTKGTESFTFTKFQYEKLVNLLQSTPAPQS------AGPSTQVNG 340


>gb|KYP56862.1| hypothetical protein KK1_003111, partial [Cajanus cajan]
          Length = 200

 Score = 69.3 bits (168), Expect = 4e-12
 Identities = 34/80 (42%), Positives = 52/80 (65%), Gaps = 9/80 (11%)
 Frame = -3

Query: 290 NGNKYTSKVCSYCERVGHTVETCYKRHGTPPSWQKS---ASNITDEPEEQTENDD----- 135
           NGNK    +C YC + GHT+ETCYKRHG PP+WQ++   +SN+  E  E  EN       
Sbjct: 125 NGNK----MCIYCGKSGHTIETCYKRHGYPPNWQRNGYGSSNVASETFEYKENASMNEEI 180

Query: 134 -SKTVGISKDQYDKLVNMIQ 78
            ++   ++++QY+KL+++IQ
Sbjct: 181 KAEPPMLTQEQYEKLLSLIQ 200


>dbj|GAU31058.1| hypothetical protein TSUD_214940 [Trifolium subterraneum]
          Length = 927

 Score = 71.2 bits (173), Expect = 5e-12
 Identities = 35/114 (30%), Positives = 61/114 (53%), Gaps = 13/114 (11%)
 Frame = -3

Query: 329 KKPFNRGKANFQGNG------NKYTSKVCSYCERVGHTVETCYKRHGTPPSWQKSASN-- 174
           ++ + RG+ NF   G      N  T+KVCSYC + GHT++ CYK+HG PP+W  +  N  
Sbjct: 265 RRGYGRGRGNFSYQGGRGRGNNSNTAKVCSYCGKNGHTIDICYKKHGYPPNWGYTRGNNG 324

Query: 173 ----ITDEPEEQTENDDSKTVGISKDQYDKLVNMIQGITTLNSK-AVNMIAGPS 27
               + +   +  +   +  V ++KDQY+ L+ +++     N + + N + G S
Sbjct: 325 GNSSVNNVEVDHDDEGGNSNVSLTKDQYNSLLALLERNNLDNPQHSTNFVKGES 378


>dbj|GAU51530.1| hypothetical protein TSUD_413950 [Trifolium subterraneum]
          Length = 830

 Score = 70.5 bits (171), Expect = 9e-12
 Identities = 36/116 (31%), Positives = 63/116 (54%), Gaps = 16/116 (13%)
 Frame = -3

Query: 326 KPFNRGKANFQGNG------NKYTSKVCSYCERVGHTVETCYKRHGTPPSW--------- 192
           + + RG+ NF   G      N  T+KVC+YC + GHT++ CYK+HG PP+W         
Sbjct: 266 RDYGRGRGNFSYQGGRGRGNNSNTAKVCTYCGKNGHTIDICYKKHGYPPNWGYTRGNNGG 325

Query: 191 QKSASNITDEPEEQTENDDSKTVGISKDQYDKLVNMIQGITTLNSK-AVNMIAGPS 27
             S +N+  + +++  N +   V ++KDQY+ L+ +++     N + + N + G S
Sbjct: 326 NSSVNNVEADHDDEVGNSN---VSLTKDQYNSLLALLERNNLDNPQHSTNFVKGES 378


>dbj|GAU36337.1| hypothetical protein TSUD_321760 [Trifolium subterraneum]
          Length = 1094

 Score = 70.5 bits (171), Expect = 9e-12
 Identities = 35/111 (31%), Positives = 60/111 (54%), Gaps = 13/111 (11%)
 Frame = -3

Query: 320 FNRGKANFQGNG------NKYTSKVCSYCERVGHTVETCYKRHGTPPSWQKSASN----- 174
           + RG+ NF   G      N  T+KVC+YC + GHT++ CYK+HG PP+W  + SN     
Sbjct: 242 YGRGRGNFSYQGGRGRGNNSNTTKVCTYCGKNGHTIDICYKKHGYPPNWGYTRSNNGGNS 301

Query: 173 -ITDEPEEQTENDDSKTVGISKDQYDKLVNMIQGITTLNSK-AVNMIAGPS 27
            + +   +  +   +  V ++KDQY+ L+ +++     N + + N + G S
Sbjct: 302 SVNNVEADHDDEVGNSNVSLTKDQYNSLLALLERNNLDNPQHSTNFVKGES 352


>gb|PNX97245.1| hypothetical protein L195_g020471, partial [Trifolium pratense]
          Length = 750

 Score = 70.1 bits (170), Expect = 1e-11
 Identities = 44/129 (34%), Positives = 71/129 (55%), Gaps = 27/129 (20%)
 Frame = -3

Query: 338 ADGKKPFNRGK----ANFQGNGNKYTS-----KVCSYCERVGHTVETCYKRHGTPPSWQK 186
           +D ++   RG+    ++F   GN+  S     K CSYC +  H VE CYK+HG PP + +
Sbjct: 77  SDSRRGQGRGRGSSHSSFAQGGNRSNSFSAKNKECSYCGKTNHVVENCYKKHGFPPHYGR 136

Query: 185 S--ASNITDEP-EEQTENDDSKTV---------GISKDQYDKLVNMIQGI------TTLN 60
           S  A+N + E  EE+ + DD+K+V         G +KDQY++L+N++Q          + 
Sbjct: 137 STTANNASLESFEEREDLDDTKSVKGNNSHDAFGFTKDQYNQLLNLVQASNASTSNNAIT 196

Query: 59  SKAVNMIAG 33
           S  VN+++G
Sbjct: 197 SSKVNIVSG 205


>dbj|GAU45704.1| hypothetical protein TSUD_86800 [Trifolium subterraneum]
          Length = 902

 Score = 69.7 bits (169), Expect = 2e-11
 Identities = 42/125 (33%), Positives = 69/125 (55%), Gaps = 14/125 (11%)
 Frame = -3

Query: 338 ADGKKPFNRG-KANFQ---GNGNKYTSKVCSYCERVGHTVETCYKRHGTPPSWQKS---- 183
           AD KKP+ +  K NFQ   G GN++    C+YC+R GHTV+ C+K+HG PP  Q++    
Sbjct: 253 ADSKKPYYKNSKPNFQSFNGKGNRH----CTYCDRQGHTVDGCFKKHGYPPHMQRNFGSV 308

Query: 182 ------ASNITDEPEEQTENDDSKTVGISKDQYDKLVNMIQGITTLNSKAVNMIAGPSPS 21
                  S+   +  E+ E+ +S    +++DQ+D+L+ ++Q      S  +N  +G   S
Sbjct: 309 HNTSTEGSDSQSQQMERGESSNSSPASLTQDQFDQLMLLLQ------SSGMNQSSG---S 359

Query: 20  LTGHE 6
            T H+
Sbjct: 360 QTSHQ 364


>dbj|GAU46782.1| hypothetical protein TSUD_351810 [Trifolium subterraneum]
          Length = 1512

 Score = 69.7 bits (169), Expect = 2e-11
 Identities = 39/122 (31%), Positives = 65/122 (53%), Gaps = 13/122 (10%)
 Frame = -3

Query: 338 ADGKKPFNRGKANFQGNGN-----KYTSK--VCSYCERVGHTVETCYKRHGTPPSWQKSA 180
           +D ++   RG++ F    N     +Y +K  VC+YC +  H VE CYK+HG PP + + +
Sbjct: 259 SDSRRSQGRGRSGFNSQYNSGFNPQYNNKKKVCTYCGKENHVVENCYKKHGFPPHYGRGS 318

Query: 179 SNITDEPEEQTENDDSKTV------GISKDQYDKLVNMIQGITTLNSKAVNMIAGPSPSL 18
           +       E  +NDD+++         +K QY++LVN++Q   + +S      AGPS S+
Sbjct: 319 TANNANAGELMDNDDARSTRGSDSFSFTKAQYEQLVNLLQTSASTSS------AGPSTSI 372

Query: 17  TG 12
            G
Sbjct: 373 NG 374


>gb|PNX92077.1| retrovirus-related Pol polyprotein from transposon TNT 1-94
           [Trifolium pratense]
          Length = 422

 Score = 68.9 bits (167), Expect = 3e-11
 Identities = 36/111 (32%), Positives = 62/111 (55%), Gaps = 7/111 (6%)
 Frame = -3

Query: 338 ADGKKPFNRGKANFQGNGNKYTSKVCSYCERVGHTVETCYKRHGTPPSWQKSASNITDEP 159
           +D ++ FNRGK+       K  ++ C++C++ GHTV+ CYK+HG P     +  N+ +  
Sbjct: 252 SDARRSFNRGKSPMHSGKGKGDTRHCTFCDKNGHTVDWCYKKHGNPNIRSNTGVNLANSD 311

Query: 158 E-EQTENDDSKTV------GISKDQYDKLVNMIQGITTLNSKAVNMIAGPS 27
             + T N ++  V       IS+++YD+LVN++Q    + S  V+   GPS
Sbjct: 312 NVDSTANGNTDLVASSSGTNISQEKYDQLVNLLQQANLIPS--VSHTVGPS 360


>ref|XP_019190428.1| PREDICTED: uncharacterized protein LOC109184834 [Ipomoea nil]
          Length = 483

 Score = 68.6 bits (166), Expect = 4e-11
 Identities = 34/99 (34%), Positives = 53/99 (53%), Gaps = 3/99 (3%)
 Frame = -3

Query: 311 GKANFQGNGNKYTSKVCSYCERVGHTVETCYKRHGTPPSWQ---KSASNITDEPEEQTEN 141
           G  N + N NK  + VCS+C   GHT+E CYK+HG PP ++   K+          Q ++
Sbjct: 315 GNNNRRFNNNKKKTVVCSFCGFTGHTIEKCYKKHGYPPGYRGKGKAGGVANAAQVSQAQD 374

Query: 140 DDSKTVGISKDQYDKLVNMIQGITTLNSKAVNMIAGPSP 24
           D   T G ++DQY+K++ +I      ++   N   GP+P
Sbjct: 375 DTDYTRGFTRDQYEKILYLIGKEGQNSNPTPNFSLGPNP 413


>gb|PNX77860.1| retrovirus-related Pol polyprotein from transposon TNT 1-94,
           partial [Trifolium pratense]
          Length = 581

 Score = 68.6 bits (166), Expect = 4e-11
 Identities = 35/104 (33%), Positives = 61/104 (58%), Gaps = 8/104 (7%)
 Frame = -3

Query: 338 ADGKKPFNRGKANFQGNGNKYTSKVCSYCERVGHTVETCYKRHGTPPSW-QKSASNITDE 162
           +D ++   RGK ++ GNG     +VC+YC +  H V+ CYK+HG PP + + +A+N  + 
Sbjct: 259 SDARRGQGRGKGSY-GNGYGSKKRVCTYCGKDNHIVDNCYKKHGFPPGFGRNNATNSVNT 317

Query: 161 PEEQTEND-------DSKTVGISKDQYDKLVNMIQGITTLNSKA 51
            +    N+       D ++ G++K QY+KLVN++Q  T  ++ A
Sbjct: 318 EDSAPANNEDVGNTKDIESFGLTKAQYEKLVNLLQTTTLPSTSA 361


>gb|ABE88096.1| Integrase, catalytic region [Medicago truncatula]
          Length = 604

 Score = 68.6 bits (166), Expect = 4e-11
 Identities = 35/91 (38%), Positives = 54/91 (59%), Gaps = 4/91 (4%)
 Frame = -3

Query: 338 ADGKKPFNRGKANFQGNGNKYTSKVCSYCERVGHTVETCYKRHGTP----PSWQKSASNI 171
           +D +KPF RGK N   +  K  SK CS+C +  HT+E CY++HG P     S   +A N 
Sbjct: 178 SDARKPFGRGKLNSGSHPPKNNSKYCSFCHKTNHTLEFCYQKHGFPNANKGSGSTNAVNS 237

Query: 170 TDEPEEQTENDDSKTVGISKDQYDKLVNMIQ 78
              PE Q  +  S+ +G++++QY  LV+++Q
Sbjct: 238 EGVPESQGSSAISQ-IGLTQEQYVHLVSLLQ 267


>gb|PNX72611.1| peptide transporter PTR2 [Trifolium pratense]
          Length = 845

 Score = 68.6 bits (166), Expect = 4e-11
 Identities = 36/121 (29%), Positives = 60/121 (49%), Gaps = 14/121 (11%)
 Frame = -3

Query: 338 ADGKKPFNRGKANFQGNGNKYTSKVCSYCERVGHTVETCYKRHGTPPSW-QKSASNITDE 162
           +D ++   RGK  F G    +  K C+YC +  H +E C+K+HG PP++ + +AS     
Sbjct: 168 SDNRRSQGRGKGGFNGQSGPFKKKYCTYCGKDNHVIENCFKKHGFPPNFGRNNASANHFG 227

Query: 161 PEEQTENDDSKTVGIS------KDQYDKLVNMIQG-------ITTLNSKAVNMIAGPSPS 21
            ++  +NDD K++  S      K QY+ LVN++Q        +    S +VN    P   
Sbjct: 228 TDDSMDNDDIKSLKASEPFTFTKSQYEHLVNLLQSHASSSTQVXASTSNSVNTFGHPKSG 287

Query: 20  L 18
           +
Sbjct: 288 I 288


>ref|XP_019164275.1| PREDICTED: uncharacterized protein LOC109160431 [Ipomoea nil]
          Length = 1108

 Score = 68.6 bits (166), Expect = 4e-11
 Identities = 32/86 (37%), Positives = 51/86 (59%), Gaps = 5/86 (5%)
 Frame = -3

Query: 317 NRGKANFQGNGNKYTSKVCSYCERVGHTVETCYKRHGTPPSW---QKSASNITDEPEE-- 153
           + G+  F  NG K   K C++C  +GHT+E CYK+HG PPSW    KS +    E ++  
Sbjct: 165 SNGRRKFNNNGGKNVPK-CTFCGMLGHTIEKCYKKHGYPPSWVAVYKSKNKQVQEVQQLS 223

Query: 152 QTENDDSKTVGISKDQYDKLVNMIQG 75
            T  +    +G+S DQ+ +L++++QG
Sbjct: 224 NTSVNQVGDIGLSNDQFQRLLSLLQG 249


>ref|XP_019184395.1| PREDICTED: uncharacterized protein LOC109179345 [Ipomoea nil]
          Length = 524

 Score = 68.2 bits (165), Expect = 6e-11
 Identities = 31/92 (33%), Positives = 48/92 (52%)
 Frame = -3

Query: 317 NRGKANFQGNGNKYTSKVCSYCERVGHTVETCYKRHGTPPSWQKSASNITDEPEEQTEND 138
           N G   F  NGNK    VC++C   GHT E CYK+HG PP W+  + N     + Q  + 
Sbjct: 284 NSGSKKFYSNGNK--KPVCTHCGFTGHTAEKCYKKHGYPPGWRPRSKNAGATNQVQLISQ 341

Query: 137 DSKTVGISKDQYDKLVNMIQGITTLNSKAVNM 42
              T+ +S+ +Y  L  ++Q   T+ +  ++M
Sbjct: 342 PEDTISLSQSEYMMLKQLLQKENTIQNSPLDM 373


Top