BLASTX nr result

ID: Astragalus24_contig00026048 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus24_contig00026048
         (489 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU14292.1| hypothetical protein TSUD_308520 [Trifolium subt...   116   1e-28
gb|PNX79078.1| ribonuclease H, partial [Trifolium pratense]           115   4e-27
dbj|GAU24540.1| hypothetical protein TSUD_156530 [Trifolium subt...   111   3e-25
gb|KYP57109.1| Putative ribonuclease H protein At1g65750 family ...   108   6e-25
gb|PNX71251.1| ribonuclease H, partial [Trifolium pratense]           105   1e-23
gb|PNY06444.1| ribonuclease H [Trifolium pratense]                    106   1e-23
gb|KYP72297.1| Putative ribonuclease H protein At1g65750 family ...    97   3e-23
dbj|GAU48967.1| hypothetical protein TSUD_188140 [Trifolium subt...   104   5e-23
gb|KYP35957.1| Putative ribonuclease H protein At1g65750 family,...   100   1e-22
gb|PNX60840.1| hypothetical protein L195_g052142, partial [Trifo...    99   4e-22
gb|PNX74924.1| ribonuclease H, partial [Trifolium pratense]           100   2e-21
gb|ABN08764.1| hypothetical protein MtrDRAFT_AC160012g33v2 [Medi...    91   1e-20
gb|PNY14872.1| ribonuclease H, partial [Trifolium pratense]            97   3e-20
ref|XP_020219669.1| uncharacterized protein LOC109802663 [Cajanu...    97   4e-20
dbj|GAU43217.1| hypothetical protein TSUD_301040 [Trifolium subt...    96   4e-20
dbj|GAU50504.1| hypothetical protein TSUD_409790 [Trifolium subt...    96   9e-20
dbj|GAU23316.1| hypothetical protein TSUD_237700 [Trifolium subt...    94   2e-19
dbj|GAU51472.1| hypothetical protein TSUD_95870 [Trifolium subte...    93   8e-19
dbj|GAU14768.1| hypothetical protein TSUD_204170 [Trifolium subt...    92   9e-19
gb|PNX57276.1| hypothetical protein L195_g058611, partial [Trifo...    87   2e-18

>dbj|GAU14292.1| hypothetical protein TSUD_308520 [Trifolium subterraneum]
          Length = 292

 Score =  116 bits (291), Expect = 1e-28
 Identities = 53/123 (43%), Positives = 72/123 (58%)
 Frame = -3

Query: 463 RSMMCIVVELGNFQISNFQPMFVDNGNDCWVWNQSMCGTYSVKSAYSWLRMQNEINLNGD 284
           +  +C  V     Q +  QP  VD+  D W W+ S  G Y+VK AY WL     IN   +
Sbjct: 25  KEKLCSFVPFVAIQDTQLQPYMVDDLPDVWTWHNSSVGLYTVKEAYEWLLPPLPINNQVN 84

Query: 283 WKWIWSLKIPANIQIFIWQLCHFSIPTKSVLATRGLGLNSACCFCGNHKETLLHCLFYCN 104
           WKWIW L++P+NIQ FIWQ+ H SIPT+ VL  R + +++ C  C    ET+ HCLF+C 
Sbjct: 85  WKWIWQLQLPSNIQFFIWQMLHNSIPTREVLHHRWICVSNICPRCSTMVETIEHCLFWCV 144

Query: 103 RAL 95
            A+
Sbjct: 145 EAV 147


>gb|PNX79078.1| ribonuclease H, partial [Trifolium pratense]
          Length = 548

 Score =  115 bits (289), Expect = 4e-27
 Identities = 51/112 (45%), Positives = 71/112 (63%)
 Frame = -3

Query: 430 NFQISNFQPMFVDNGNDCWVWNQSMCGTYSVKSAYSWLRMQNEINLNGDWKWIWSLKIPA 251
           +F  ++ +P  VD+  D WVW+ S  G YS K AY WL     IN + +WKWIW+LK+PA
Sbjct: 264 DFVKNSIRPYIVDDIPDVWVWHNSSTGIYSAKDAYDWLLKPTPINNHTNWKWIWNLKVPA 323

Query: 250 NIQIFIWQLCHFSIPTKSVLATRGLGLNSACCFCGNHKETLLHCLFYCNRAL 95
           +IQ F+WQ+ H SIPT+ VL  R +  ++ C  C    ET++HCLF C  A+
Sbjct: 324 SIQFFVWQVVHGSIPTREVLNHRQVCASNLCPRCSAMPETIVHCLFACVEAI 375


>dbj|GAU24540.1| hypothetical protein TSUD_156530 [Trifolium subterraneum]
          Length = 1147

 Score =  111 bits (277), Expect = 3e-25
 Identities = 50/105 (47%), Positives = 65/105 (61%)
 Frame = -3

Query: 409  QPMFVDNGNDCWVWNQSMCGTYSVKSAYSWLRMQNEINLNGDWKWIWSLKIPANIQIFIW 230
            QP  V++  D WVW  S  G Y+ K AY WL     IN + +WKWIW LK+PANIQ F+W
Sbjct: 784  QPYIVNDILDVWVWQNSSNGVYTTKDAYEWLLNPLPINSHLNWKWIWQLKLPANIQFFVW 843

Query: 229  QLCHFSIPTKSVLATRGLGLNSACCFCGNHKETLLHCLFYCNRAL 95
            Q+ H SIPTK +L  R +  ++ C  C    ET++HCLF C  A+
Sbjct: 844  QILHGSIPTKYILNRRRVCASNICPRCTAMPETIVHCLFACTDAI 888


>gb|KYP57109.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 365

 Score =  108 bits (269), Expect = 6e-25
 Identities = 48/102 (47%), Positives = 63/102 (61%), Gaps = 1/102 (0%)
 Frame = -3

Query: 403 MFVDNGNDCWVWNQSMCGTYSVKSAYSWL-RMQNEINLNGDWKWIWSLKIPANIQIFIWQ 227
           M  +N  D W W   + G YSVKSAY+WL R Q++     +W WIW   +PA+IQ F+WQ
Sbjct: 1   MSSENVRDVWTWKTELTGLYSVKSAYTWLCRQQSDQLEETNWAWIWRTCLPASIQFFLWQ 60

Query: 226 LCHFSIPTKSVLATRGLGLNSACCFCGNHKETLLHCLFYCNR 101
           +CH ++PT+  L  RG+  NS C  C   +ETLLHCL  C R
Sbjct: 61  ICHEALPTRETLVHRGIIDNSNCPMCNAQQETLLHCLLECPR 102


>gb|PNX71251.1| ribonuclease H, partial [Trifolium pratense]
          Length = 384

 Score =  105 bits (261), Expect = 1e-23
 Identities = 48/108 (44%), Positives = 65/108 (60%)
 Frame = -3

Query: 421 ISNFQPMFVDNGNDCWVWNQSMCGTYSVKSAYSWLRMQNEINLNGDWKWIWSLKIPANIQ 242
           I + +P  V +  D WVW  +  G YS K AY WL     IN + +WKWIW +++P+NIQ
Sbjct: 19  ILSLKPRVVRDLPDVWVWKHASTGIYSPKEAYDWLLKPQPINNHSNWKWIWQVRLPSNIQ 78

Query: 241 IFIWQLCHFSIPTKSVLATRGLGLNSACCFCGNHKETLLHCLFYCNRA 98
            F+WQ+ H SI TK VL  R +  ++ C  C +  ET+LHCLF C  A
Sbjct: 79  FFVWQVLHNSILTKDVLHHRRICDSNVCPRCLDAFETILHCLFDCTEA 126


>gb|PNY06444.1| ribonuclease H [Trifolium pratense]
          Length = 547

 Score =  106 bits (264), Expect = 1e-23
 Identities = 46/105 (43%), Positives = 61/105 (58%)
 Frame = -3

Query: 421 ISNFQPMFVDNGNDCWVWNQSMCGTYSVKSAYSWLRMQNEINLNGDWKWIWSLKIPANIQ 242
           I   QP  V N  D W W+ +  G YSVK AY+WLR    +  + +W+WIW L +PA IQ
Sbjct: 180 IEALQPRLVSNLPDVWTWSNATSGVYSVKDAYNWLRKPTPLQDHVNWQWIWQLPVPATIQ 239

Query: 241 IFIWQLCHFSIPTKSVLATRGLGLNSACCFCGNHKETLLHCLFYC 107
            F+WQ+ H SIP + VL  R +   S C  C +  E++ HCLF C
Sbjct: 240 FFVWQIIHDSIPVREVLQHRHVCATSVCPRCTSTAESIEHCLFSC 284


>gb|KYP72297.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 106

 Score = 97.4 bits (241), Expect = 3e-23
 Identities = 41/97 (42%), Positives = 57/97 (58%), Gaps = 1/97 (1%)
 Frame = -3

Query: 394 DNGNDCWVWNQSMCGTYSVKSAYSWLRMQNEINLNG-DWKWIWSLKIPANIQIFIWQLCH 218
           +N  D W W   + G YSVKS Y+ L  +  + L+  DW WIW   +P +IQ F+WQ+C+
Sbjct: 9   ENVRDVWTWETELAGLYSVKSTYTSLFQRQTVQLDETDWVWIWQTCLPTSIQFFLWQICY 68

Query: 217 FSIPTKSVLATRGLGLNSACCFCGNHKETLLHCLFYC 107
            ++PTK  L  R +  N+ C  C  H ETL+HCL  C
Sbjct: 69  EALPTKEALVHRSIIDNNNCLVCNVHHETLMHCLLDC 105


>dbj|GAU48967.1| hypothetical protein TSUD_188140 [Trifolium subterraneum]
          Length = 621

 Score =  104 bits (260), Expect = 5e-23
 Identities = 48/107 (44%), Positives = 64/107 (59%), Gaps = 3/107 (2%)
 Frame = -3

Query: 418 SNFQ---PMFVDNGNDCWVWNQSMCGTYSVKSAYSWLRMQNEINLNGDWKWIWSLKIPAN 248
           S+FQ   P  V++  D W WN    G Y+ K AY WL   + +N N  W+WIW L++PA+
Sbjct: 376 SDFQFAIPCIVNDLPDIWTWNNPSSGIYTAKDAYRWLLEPSPVNNNNGWQWIWRLQLPAS 435

Query: 247 IQIFIWQLCHFSIPTKSVLATRGLGLNSACCFCGNHKETLLHCLFYC 107
           IQ F+WQ  H SIPTK VL  R + +++ C  C    ET+ HCLF C
Sbjct: 436 IQFFVWQFLHESIPTKDVLHHRQVCISNLCPRCLLDTETIDHCLFLC 482


>gb|KYP35957.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
           cajan]
          Length = 307

 Score =  100 bits (250), Expect = 1e-22
 Identities = 44/107 (41%), Positives = 63/107 (58%), Gaps = 2/107 (1%)
 Frame = -3

Query: 421 ISNFQPMFVDNGNDCWVWNQSMCGTYSVKSAYSWLRMQNEI--NLNGDWKWIWSLKIPAN 248
           I +  P  V    D W+W+ +  GTY+V+ AY WL +Q+ +  ++ G WKWIW L +  N
Sbjct: 126 IISLSPRLVQGAIDVWIWSANTSGTYTVRKAYQWL-LQSSLTWSVEGSWKWIWQLPLSKN 184

Query: 247 IQIFIWQLCHFSIPTKSVLATRGLGLNSACCFCGNHKETLLHCLFYC 107
           IQ F+W++CH SIPT+  LA R +     C  C    E++ HCLF C
Sbjct: 185 IQFFLWEVCHNSIPTRGTLAQRHITNVDICPRCHEVVESIYHCLFEC 231


>gb|PNX60840.1| hypothetical protein L195_g052142, partial [Trifolium pratense]
          Length = 273

 Score = 99.0 bits (245), Expect = 4e-22
 Identities = 46/105 (43%), Positives = 62/105 (59%)
 Frame = -3

Query: 421 ISNFQPMFVDNGNDCWVWNQSMCGTYSVKSAYSWLRMQNEINLNGDWKWIWSLKIPANIQ 242
           I + QP  V++  D W W+ S  G YS K AY WL     I  + +W+WIW L++PANIQ
Sbjct: 70  IKSLQPRMVNDLPDVWTWHHSSSGVYSTKDAYRWLLNPLPIVNHLNWQWIWQLQLPANIQ 129

Query: 241 IFIWQLCHFSIPTKSVLATRGLGLNSACCFCGNHKETLLHCLFYC 107
            F+WQ  H SIPTK +L  R +  ++ C  C    E++ HCLF C
Sbjct: 130 FFVWQKLHRSIPTKEILHHRQVCNSNLCPRCLVAVESIEHCLFSC 174


>gb|PNX74924.1| ribonuclease H, partial [Trifolium pratense]
          Length = 593

 Score =  100 bits (248), Expect = 2e-21
 Identities = 46/103 (44%), Positives = 60/103 (58%)
 Frame = -3

Query: 421 ISNFQPMFVDNGNDCWVWNQSMCGTYSVKSAYSWLRMQNEINLNGDWKWIWSLKIPANIQ 242
           I++ QP  V++  D W W+ S  G Y+ K AY WL   +  N    W+WIW L +PANIQ
Sbjct: 386 ITSLQPCIVNDLPDIWTWHDSSSGIYTAKDAYCWLLEPSPCNNLTGWQWIWRLHLPANIQ 445

Query: 241 IFIWQLCHFSIPTKSVLATRGLGLNSACCFCGNHKETLLHCLF 113
            FIWQL H SIPT ++L  R +  +  C  C    ET+ HCLF
Sbjct: 446 FFIWQLVHESIPTNAMLHHRQVCTSDLCHRCSLSSETIDHCLF 488


>gb|ABN08764.1| hypothetical protein MtrDRAFT_AC160012g33v2 [Medicago truncatula]
          Length = 124

 Score = 91.3 bits (225), Expect = 1e-20
 Identities = 42/97 (43%), Positives = 56/97 (57%), Gaps = 2/97 (2%)
 Frame = -3

Query: 382 DCWVWNQSMCGTYSVKSAYSWLRMQNEINLNGD--WKWIWSLKIPANIQIFIWQLCHFSI 209
           D WVW     G YS++  Y WL M    +L G+  W WIW L +P+NI+ F+WQLCH S+
Sbjct: 6   DVWVWTNRTSGIYSIQEGYQWL-MGAHTSLLGEESWNWIWHLCVPSNIRFFLWQLCHDSV 64

Query: 208 PTKSVLATRGLGLNSACCFCGNHKETLLHCLFYCNRA 98
           P +SVL +R     + C  C    E +LH L+ C RA
Sbjct: 65  PFRSVLLSR-----NVCSICNQGSEDMLHALYPCPRA 96


>gb|PNY14872.1| ribonuclease H, partial [Trifolium pratense]
          Length = 1396

 Score = 97.1 bits (240), Expect = 3e-20
 Identities = 44/94 (46%), Positives = 60/94 (63%)
 Frame = -3

Query: 463  RSMMCIVVELGNFQISNFQPMFVDNGNDCWVWNQSMCGTYSVKSAYSWLRMQNEINLNGD 284
            +++  IV +    +I + QP  V+   D WVW  S  G Y+VK AY+WL   + IN + +
Sbjct: 1302 QNLYTIVPDFARKEIVSLQPRIVNGIPDIWVWQSSSVGIYTVKDAYNWLLEPSGINNHSN 1361

Query: 283  WKWIWSLKIPANIQIFIWQLCHFSIPTKSVLATR 182
            W+WIW L +PA+IQ FIWQL H SIPTK+VL  R
Sbjct: 1362 WQWIWRLTLPASIQFFIWQLAHDSIPTKAVLHHR 1395


>ref|XP_020219669.1| uncharacterized protein LOC109802663 [Cajanus cajan]
          Length = 1049

 Score = 96.7 bits (239), Expect = 4e-20
 Identities = 41/111 (36%), Positives = 59/111 (53%)
 Frame = -3

Query: 439  ELGNFQISNFQPMFVDNGNDCWVWNQSMCGTYSVKSAYSWLRMQNEINLNGDWKWIWSLK 260
            E    +IS   P  V    D W W   + G Y+V+SAY WL   + +  +  W WIW L 
Sbjct: 708  ETFRLKISCLTPHLVVGVPDLWTWAPDIFGKYTVRSAYRWLIEDHGLETSAGWNWIWKLS 767

Query: 259  IPANIQIFIWQLCHFSIPTKSVLATRGLGLNSACCFCGNHKETLLHCLFYC 107
            +PA++  F+WQ CH ++P K VL  RG+  ++ C  C +  E+  HC F C
Sbjct: 768  LPASLCFFLWQSCHLALPIKEVLCQRGILASNTCPKCNSAVESFEHCFFTC 818


>dbj|GAU43217.1| hypothetical protein TSUD_301040 [Trifolium subterraneum]
          Length = 565

 Score = 96.3 bits (238), Expect = 4e-20
 Identities = 47/105 (44%), Positives = 54/105 (51%)
 Frame = -3

Query: 421 ISNFQPMFVDNGNDCWVWNQSMCGTYSVKSAYSWLRMQNEINLNGDWKWIWSLKIPANIQ 242
           I + Q   V +  D W W     G YS K AY WL     IN    W WIW L IPANIQ
Sbjct: 266 IKSLQQCIVSDLPDIWTWQNDNTGIYSTKDAYIWLLDPMHINNLTGWHWIWQLCIPANIQ 325

Query: 241 IFIWQLCHFSIPTKSVLATRGLGLNSACCFCGNHKETLLHCLFYC 107
            F+WQL H SIPT++ L  R +     C  C    ET+ HCLF C
Sbjct: 326 FFLWQLVHESIPTRAFLHHRHVCSTDLCPRCSAAAETIDHCLFLC 370


>dbj|GAU50504.1| hypothetical protein TSUD_409790 [Trifolium subterraneum]
          Length = 902

 Score = 95.5 bits (236), Expect = 9e-20
 Identities = 43/110 (39%), Positives = 66/110 (60%), Gaps = 4/110 (3%)
 Frame = -3

Query: 421  ISNFQPMFVDNGNDCWVWNQSMCGTYSVKSAYSWL-RMQNEI---NLNGDWKWIWSLKIP 254
            I+N    F D+ +D ++W  +  G+Y+ KS Y+WL  +QN +   N +  W WIW L++P
Sbjct: 713  INNIHIKFNDSIDDAFIWTSNKNGSYTTKSGYNWLLSLQNLVTPHNPSLSWSWIWKLQLP 772

Query: 253  ANIQIFIWQLCHFSIPTKSVLATRGLGLNSACCFCGNHKETLLHCLFYCN 104
              I+ F W  CH S+PT S+L  R + L++ C  CG  +ET LHC+  C+
Sbjct: 773  EKIKFFFWLACHNSVPTLSLLNHRKMNLSATCARCGLREETFLHCVRDCD 822


>dbj|GAU23316.1| hypothetical protein TSUD_237700 [Trifolium subterraneum]
          Length = 418

 Score = 94.0 bits (232), Expect = 2e-19
 Identities = 41/110 (37%), Positives = 67/110 (60%), Gaps = 4/110 (3%)
 Frame = -3

Query: 421 ISNFQPMFVDNGNDCWVWNQSMCGTYSVKSAYSWL-RMQNEI---NLNGDWKWIWSLKIP 254
           I+N    F D+ ++ ++W  +  G+Y+ KS ++WL  +QN +   N +  W WIW L++P
Sbjct: 77  INNIHIKFNDSIDNAFIWTSNKNGSYTTKSGFNWLFSLQNPVTPHNPSFSWSWIWKLQLP 136

Query: 253 ANIQIFIWQLCHFSIPTKSVLATRGLGLNSACCFCGNHKETLLHCLFYCN 104
             I+ F W +CH S+PT S+L  R + L++ C  CG  +ET LHC+  C+
Sbjct: 137 EKIKFFFWLVCHNSVPTLSLLDHRKMNLSATCARCGLREETFLHCVRDCD 186


>dbj|GAU51472.1| hypothetical protein TSUD_95870 [Trifolium subterraneum]
          Length = 1682

 Score = 92.8 bits (229), Expect = 8e-19
 Identities = 45/106 (42%), Positives = 60/106 (56%), Gaps = 1/106 (0%)
 Frame = -3

Query: 421  ISNFQPMFVDNGNDCWVWNQSMCGTYSVKSAYSWLRMQNEINLNG-DWKWIWSLKIPANI 245
            ISN+  +      DC+ WN S+ G Y+  S YSWL  + + N N   WKWIW+L+ P  I
Sbjct: 1537 ISNYHLILNIGVPDCFTWNDSLDGVYTSSSGYSWLLKKKQYNPNNKSWKWIWNLRAPEKI 1596

Query: 244  QIFIWQLCHFSIPTKSVLATRGLGLNSACCFCGNHKETLLHCLFYC 107
            + FIW + H SIPT  +L  R L  ++ C  C  + ET LHCL  C
Sbjct: 1597 KFFIWCISHNSIPTLDMLHHRHLAQDNICFRCLANVETTLHCLRDC 1642


>dbj|GAU14768.1| hypothetical protein TSUD_204170 [Trifolium subterraneum]
          Length = 503

 Score = 92.4 bits (228), Expect = 9e-19
 Identities = 42/93 (45%), Positives = 55/93 (59%), Gaps = 1/93 (1%)
 Frame = -3

Query: 382 DCWVWNQSMCGTYSVKSAYSWLRMQNEINLNG-DWKWIWSLKIPANIQIFIWQLCHFSIP 206
           DC+ WN S+ G Y+  S YSWL  + + N N   WKWIW+L+ P  I+  IW +CH SIP
Sbjct: 243 DCFTWNGSLDGVYTASSGYSWLLKKKQHNPNNKSWKWIWNLRAPEKIKFLIWCICHNSIP 302

Query: 205 TKSVLATRGLGLNSACCFCGNHKETLLHCLFYC 107
           T  +L  R L  ++ C  C  + ET LHCL  C
Sbjct: 303 TLDMLHHRHLAQDNICSRCLANVETTLHCLRDC 335


>gb|PNX57276.1| hypothetical protein L195_g058611, partial [Trifolium pratense]
          Length = 180

 Score = 87.4 bits (215), Expect = 2e-18
 Identities = 39/77 (50%), Positives = 46/77 (59%)
 Frame = -3

Query: 421 ISNFQPMFVDNGNDCWVWNQSMCGTYSVKSAYSWLRMQNEINLNGDWKWIWSLKIPANIQ 242
           I + QP  V +  D W W+ +  G YS K AY WL     IN    WKWIW L IPANIQ
Sbjct: 103 IKSLQPCIVSDLPDIWTWHNASSGVYSTKDAYIWLLDPMHINNLTGWKWIWKLCIPANIQ 162

Query: 241 IFIWQLCHFSIPTKSVL 191
            F+WQL H SIPT+ +L
Sbjct: 163 FFLWQLVHESIPTREIL 179


Top