BLASTX nr result

ID: Astragalus22_contig00020069 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00020069
         (1623 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007146655.1| hypothetical protein PHAVU_006G058500g [Phas...    83   7e-14
gb|KYP57461.1| Putative ribonuclease H protein At1g65750 family ...    81   2e-13
gb|KYP67585.1| Putative ribonuclease H protein At1g65750 family ...    82   4e-13
gb|KYP65942.1| Putative ribonuclease H protein At1g65750 family ...    83   4e-13
dbj|GAU30116.1| hypothetical protein TSUD_360090 [Trifolium subt...    77   1e-12
ref|XP_007158841.1| hypothetical protein PHAVU_002G186500g [Phas...    79   1e-12
gb|EOY31223.1| Uncharacterized protein TCM_038187 [Theobroma cacao]    77   5e-12
ref|XP_007156793.1| hypothetical protein PHAVU_002G018000g [Phas...    75   1e-11
gb|KHM99756.1| TMV resistance protein N [Glycine soja]                 79   2e-11
gb|OMO81372.1| hypothetical protein COLO4_23635, partial [Corcho...    73   2e-11
gb|KYP56682.1| Putative ribonuclease H protein At1g65750 family ...    75   2e-11
ref|XP_019152396.1| PREDICTED: uncharacterized protein LOC109149...    78   3e-11
ref|XP_019162186.1| PREDICTED: uncharacterized protein LOC109158...    78   3e-11
gb|KYP50862.1| Putative ribonuclease H protein At1g65750 family,...    73   3e-11
dbj|GAU31501.1| hypothetical protein TSUD_332760 [Trifolium subt...    72   3e-11
ref|XP_014617687.1| PREDICTED: TMV resistance protein N-like iso...    77   6e-11
ref|XP_014617686.1| PREDICTED: TMV resistance protein N-like iso...    77   7e-11
ref|XP_019163602.1| PREDICTED: uncharacterized protein LOC109159...    77   7e-11
dbj|GAU38343.1| hypothetical protein TSUD_395970 [Trifolium subt...    71   7e-11
ref|XP_012448545.1| PREDICTED: uncharacterized protein LOC105771...    75   9e-11

>ref|XP_007146655.1| hypothetical protein PHAVU_006G058500g [Phaseolus vulgaris]
 gb|ESW18649.1| hypothetical protein PHAVU_006G058500g [Phaseolus vulgaris]
          Length = 277

 Score = 82.8 bits (203), Expect = 7e-14
 Identities = 43/96 (44%), Positives = 57/96 (59%)
 Frame = +3

Query: 1335 QNSNTNTIYTWRPPPSGSVSISVDASIRDSGTTAGCGGVLRDHGGNFIWAFAMKLEPCNV 1514
            +N   + +  WRPP  G V I+ D +    G  AG GGV+RD  GNFI+ F+  L  C+V
Sbjct: 96   KNIRPSRLIWWRPPFEGFVKINCDGAFTMHGNKAGAGGVVRDWRGNFIFGFSSGLANCSV 155

Query: 1515 LEGELLAIYHGFSIAGGYGYRNIVVESDCQVAIDII 1622
            L  EL AI  G       GY+N++VESD +VA+DII
Sbjct: 156  LTAELEAIKIGIETTISKGYKNLMVESDSKVAVDII 191


>gb|KYP57461.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 249

 Score = 80.9 bits (198), Expect = 2e-13
 Identities = 34/80 (42%), Positives = 50/80 (62%)
 Frame = +3

Query: 1365 WRPPPSGSVSISVDASIRDSGTTAGCGGVLRDHGGNFIWAFAMKLEPCNVLEGELLAIYH 1544
            W PPP G +  +VDA +R  GT  GCGGVL D  G ++  F  KLEPC+++E E+  +  
Sbjct: 147  WVPPPVGWLKFNVDAVVRLGGTQVGCGGVLHDEKGTWVRGFCRKLEPCSIMEAEMHVVLT 206

Query: 1545 GFSIAGGYGYRNIVVESDCQ 1604
               IA  YG +N+ +E++C+
Sbjct: 207  SLEIAWEYGTKNLCIETNCR 226


>gb|KYP67585.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 363

 Score = 82.0 bits (201), Expect = 4e-13
 Identities = 36/87 (41%), Positives = 57/87 (65%), Gaps = 1/87 (1%)
 Frame = +3

Query: 1365 WRPPPSGSVSISVDASIRDSGTTAGCGGVLRDHGGNFIWAFAMKLEPCNVLEGELLAIYH 1544
            W+PPP GS+ ++ D ++R  G   GCGG++R++ G FI  F+ KL  C++L+ EL AI+H
Sbjct: 150  WQPPPLGSIKLNCDGAVRGVGRKVGCGGIIRNYLGGFIMGFSCKLGQCSILQAELWAIFH 209

Query: 1545 GFSIAGGYGYR-NIVVESDCQVAIDII 1622
            G  I    G++ +I+VE D  +AI  +
Sbjct: 210  GLRIIKEKGFKEDIIVELDSSLAIKFL 236



 Score = 54.3 bits (129), Expect(2) = 4e-06
 Identities = 32/92 (34%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
 Frame = -1

Query: 402 YGAYHGLSIAKGLGFK-DLIIELDSDQAIHLLSCGSVTGHPLQHLVCSILDMSNGDMDVD 226
           +  +HGL I K  GFK D+I+ELDS  AI  L+ G    H    L+ SI+++++ + D++
Sbjct: 205 WAIFHGLRIIKEKGFKEDIIVELDSSLAIKFLNEGCSASHSCAPLINSIVELADMEQDLN 264

Query: 225 *KKI*RDTNCVADRLAKNSLTLETECVIYDLS 130
              I R+ N V D +A  S  ++ +  I++ S
Sbjct: 265 CSHIYREANQVNDAMANLSFDIQDKFQIFNNS 296



 Score = 26.9 bits (58), Expect(2) = 4e-06
 Identities = 10/22 (45%), Positives = 17/22 (77%)
 Frame = -3

Query: 457 FQMGFSQRIEPSTVLEAELWSL 392
           F MGFS ++   ++L+AELW++
Sbjct: 186 FIMGFSCKLGQCSILQAELWAI 207


>gb|KYP65942.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 457

 Score = 82.8 bits (203), Expect = 4e-13
 Identities = 36/92 (39%), Positives = 59/92 (64%), Gaps = 1/92 (1%)
 Frame = +3

Query: 1350 NTIYTWRPPPSGSVSISVDASIRDSGTTAGCGGVLRDHGGNFIWAFAMKLEPCNVLEGEL 1529
            +++  W+PPP GS+ ++ D ++   G  AGCGG+++D+ G FI  F  KL  C++L+ EL
Sbjct: 285  SSLIRWQPPPLGSIKLNCDRAVHGVGRKAGCGGIIKDYLGGFITGFPCKLGQCSILQAEL 344

Query: 1530 LAIYHGFSIAGGYGYR-NIVVESDCQVAIDII 1622
              I+HG  I    G++ +I+VESD  +AI  +
Sbjct: 345  WTIFHGLRIIKDKGFKEDIIVESDSSLAIKFL 376


>dbj|GAU30116.1| hypothetical protein TSUD_360090 [Trifolium subterraneum]
          Length = 173

 Score = 76.6 bits (187), Expect = 1e-12
 Identities = 34/99 (34%), Positives = 58/99 (58%), Gaps = 1/99 (1%)
 Frame = +3

Query: 1329 LSQNSNTNTIYT-WRPPPSGSVSISVDASIRDSGTTAGCGGVLRDHGGNFIWAFAMKLEP 1505
            + Q    +TI+  W+ PP G + ++ D   ++S   AGCGG+LRD  G +I  +  K+  
Sbjct: 50   IGQQRKKDTIFVGWKQPPEGWIKLNCDGVYKESLDLAGCGGLLRDSNGQWIHGYTQKIGA 109

Query: 1506 CNVLEGELLAIYHGFSIAGGYGYRNIVVESDCQVAIDII 1622
            C+ L  E+  +Y G ++A   G  +I+VESD ++ +DI+
Sbjct: 110  CDALHAEMWGMYAGMNLARRQGVTHIIVESDSKLLVDIV 148


>ref|XP_007158841.1| hypothetical protein PHAVU_002G186500g [Phaseolus vulgaris]
 gb|ESW30835.1| hypothetical protein PHAVU_002G186500g [Phaseolus vulgaris]
          Length = 252

 Score = 78.6 bits (192), Expect = 1e-12
 Identities = 44/102 (43%), Positives = 59/102 (57%)
 Frame = +3

Query: 1317 NLDPLSQNSNTNTIYTWRPPPSGSVSISVDASIRDSGTTAGCGGVLRDHGGNFIWAFAMK 1496
            ++D    N   + +  W PP  G V I+ D +    G  AG GGV+RD  G FI+ F+  
Sbjct: 65   SIDVKEDNIIPSRLIWWTPPFEGFVKINCDGAFTMHGNKAGAGGVVRDWRGEFIFGFSSG 124

Query: 1497 LEPCNVLEGELLAIYHGFSIAGGYGYRNIVVESDCQVAIDII 1622
            L+  +VL  EL AI  G  IA   GY+N++VESD +VAIDII
Sbjct: 125  LKNYSVLMAELEAIKIGIEIAISKGYKNLMVESDSKVAIDII 166


>gb|EOY31223.1| Uncharacterized protein TCM_038187 [Theobroma cacao]
          Length = 271

 Score = 77.4 bits (189), Expect = 5e-12
 Identities = 36/86 (41%), Positives = 53/86 (61%)
 Frame = +3

Query: 1365 WRPPPSGSVSISVDASIRDSGTTAGCGGVLRDHGGNFIWAFAMKLEPCNVLEGELLAIYH 1544
            W PP +G ++++VD + + S   A   GVLRD  GN++  F+MKLE C+    EL  I+ 
Sbjct: 178  WSPPITGGIALNVDGAFKKSQRKAAAAGVLRDEHGNWLCGFSMKLEKCSAFRAELWGIFK 237

Query: 1545 GFSIAGGYGYRNIVVESDCQVAIDII 1622
            G S+A   GYRNI ++ D +VA+  I
Sbjct: 238  GLSLAWELGYRNIDLQIDNRVAVQSI 263


>ref|XP_007156793.1| hypothetical protein PHAVU_002G018000g [Phaseolus vulgaris]
 gb|ESW28787.1| hypothetical protein PHAVU_002G018000g [Phaseolus vulgaris]
          Length = 210

 Score = 75.1 bits (183), Expect = 1e-11
 Identities = 41/95 (43%), Positives = 55/95 (57%)
 Frame = +3

Query: 1338 NSNTNTIYTWRPPPSGSVSISVDASIRDSGTTAGCGGVLRDHGGNFIWAFAMKLEPCNVL 1517
            N   + +  WRPP  G V I+ D +    G  A  GGV+RD  GNFI+ F+  L+  +VL
Sbjct: 30   NIRPSRLIWWRPPFEGFVKINCDGAFTRHGNKASAGGVVRDWRGNFIFGFSSGLKNGSVL 89

Query: 1518 EGELLAIYHGFSIAGGYGYRNIVVESDCQVAIDII 1622
              EL AI  G       GY+N++VESD +VAI+II
Sbjct: 90   TAELEAIKIGIETTISKGYKNLMVESDSKVAINII 124


>gb|KHM99756.1| TMV resistance protein N [Glycine soja]
          Length = 1174

 Score = 78.6 bits (192), Expect = 2e-11
 Identities = 36/100 (36%), Positives = 59/100 (59%), Gaps = 3/100 (3%)
 Frame = +3

Query: 1332 SQNSNTNTIYTWRPPPSGSVSISVDASIRDSGTTAGCGGVLRDHGGNFIWAFAMKLEP-- 1505
            +Q  +++ ++ WRPP    + +++D +I     TA CGG+ RD+ G F+  F++KL+   
Sbjct: 1003 TQPCSSDHLFKWRPPVDPWLKLNMDGAIDPCSKTAACGGIFRDYSGRFVLGFSVKLDMEH 1062

Query: 1506 -CNVLEGELLAIYHGFSIAGGYGYRNIVVESDCQVAIDII 1622
             C++ E E+  +YHG  IA  Y +  IVVESD   AI  +
Sbjct: 1063 YCSIDEAEIWGVYHGIKIARQYDFGKIVVESDSPKAISFV 1102


>gb|OMO81372.1| hypothetical protein COLO4_23635, partial [Corchorus olitorius]
          Length = 171

 Score = 73.2 bits (178), Expect = 2e-11
 Identities = 37/93 (39%), Positives = 51/93 (54%)
 Frame = +3

Query: 1344 NTNTIYTWRPPPSGSVSISVDASIRDSGTTAGCGGVLRDHGGNFIWAFAMKLEPCNVLEG 1523
            NT    +W+PPP G + I+ D + + +   AG  GV RD  GNFI   + KL  C     
Sbjct: 41   NTTLHISWKPPPDGFIKINTDGASQGNSGCAGASGVFRDSRGNFITCLSRKLGTCTSTCA 100

Query: 1524 ELLAIYHGFSIAGGYGYRNIVVESDCQVAIDII 1622
            EL A+     IA   GY+NI++E D +VAI +I
Sbjct: 101  ELWAVRDALRIAVDNGYQNIMLECDSKVAIQLI 133


>gb|KYP56682.1| Putative ribonuclease H protein At1g65750 family [Cajanus cajan]
          Length = 278

 Score = 75.5 bits (184), Expect = 2e-11
 Identities = 33/78 (42%), Positives = 48/78 (61%)
 Frame = +3

Query: 1365 WRPPPSGSVSISVDASIRDSGTTAGCGGVLRDHGGNFIWAFAMKLEPCNVLEGELLAIYH 1544
            W  PP G +  +VDA++R  GT  GCGGVLRD  G ++  F  KLEPC+++E E+ A+  
Sbjct: 188  WVSPPVGWLKFNVDAAVRLGGTQVGCGGVLRDEKGTWVRGFCRKLEPCSIMEAEMHAVLT 247

Query: 1545 GFSIAGGYGYRNIVVESD 1598
                A  YG + + +E+D
Sbjct: 248  SLETAWEYGTKYLCIETD 265


>ref|XP_019152396.1| PREDICTED: uncharacterized protein LOC109149189 [Ipomoea nil]
          Length = 1344

 Score = 78.2 bits (191), Expect = 3e-11
 Identities = 38/96 (39%), Positives = 55/96 (57%)
 Frame = +3

Query: 1335 QNSNTNTIYTWRPPPSGSVSISVDASIRDSGTTAGCGGVLRDHGGNFIWAFAMKLEPCNV 1514
            Q  +T     WRPP  G + ++ D   R  G +A CGGVLR++ G +I  F+ K+  C+ 
Sbjct: 1165 QREDTRRKQPWRPPNEGWIKVNTDGCARTKGHSA-CGGVLRNNEGQYIGGFSKKIGTCSA 1223

Query: 1515 LEGELLAIYHGFSIAGGYGYRNIVVESDCQVAIDII 1622
            LE E+  IY G   A   GYR ++ E+DC  AI++I
Sbjct: 1224 LEAEVWGIYVGIQKAWELGYRKVMFETDCSKAINLI 1259


>ref|XP_019162186.1| PREDICTED: uncharacterized protein LOC109158742 [Ipomoea nil]
          Length = 1371

 Score = 78.2 bits (191), Expect = 3e-11
 Identities = 38/96 (39%), Positives = 55/96 (57%)
 Frame = +3

Query: 1335 QNSNTNTIYTWRPPPSGSVSISVDASIRDSGTTAGCGGVLRDHGGNFIWAFAMKLEPCNV 1514
            Q  +T     WRPP  G + ++ D   R  G +A CGGVLR++ G +I  F+ K+  C+ 
Sbjct: 1192 QREDTRRKQPWRPPNEGWIKVNTDGCARTKGHSA-CGGVLRNNEGQYIGGFSKKIGTCSA 1250

Query: 1515 LEGELLAIYHGFSIAGGYGYRNIVVESDCQVAIDII 1622
            LE E+  IY G   A   GYR ++ E+DC  AI++I
Sbjct: 1251 LEAEVWGIYVGIQKAWELGYRKVMFETDCSKAINLI 1286


>gb|KYP50862.1| Putative ribonuclease H protein At1g65750 family, partial [Cajanus
            cajan]
          Length = 175

 Score = 72.8 bits (177), Expect = 3e-11
 Identities = 40/115 (34%), Positives = 63/115 (54%), Gaps = 1/115 (0%)
 Frame = +3

Query: 1281 VQAVIQGKLLLKNLDPLSQNSNTNTIYTWRPPPSGSVSISVDASIRDSGTTAGCGGVLRD 1460
            V  + +G +L   +D       TN    W  PP+ ++ ++ D ++ D    A CGGVLRD
Sbjct: 40   VNEIKKGSIL--KIDHAKSRLKTNQHIGWIRPPNNTLKLNCDGAV-DDNAHAACGGVLRD 96

Query: 1461 HGGNFIWAFAMKLEPCNVLEGELLAIYHGFSIAGGYGYR-NIVVESDCQVAIDII 1622
              G FI+ FA K+  C+VL+ EL AI+HG  I      + N ++ESD ++A+  +
Sbjct: 97   CLGKFIFGFAGKIGTCSVLQAELWAIFHGLRIIKEMNLKGNFLIESDSEIAVKFL 151


>dbj|GAU31501.1| hypothetical protein TSUD_332760 [Trifolium subterraneum]
          Length = 153

 Score = 72.0 bits (175), Expect = 3e-11
 Identities = 34/94 (36%), Positives = 54/94 (57%), Gaps = 1/94 (1%)
 Frame = +3

Query: 1344 NTNTIYT-WRPPPSGSVSISVDASIRDSGTTAGCGGVLRDHGGNFIWAFAMKLEPCNVLE 1520
            N   +Y  W+ P  G V ++ D + ++ G TAGCGG+ RD  G +I  F  K+  C+ L 
Sbjct: 36   NREIVYIGWKKPQDGWVKLNCDRACKELGETAGCGGLFRDSDGRWIKGFTRKIGACDALH 95

Query: 1521 GELLAIYHGFSIAGGYGYRNIVVESDCQVAIDII 1622
             E+  +Y G  IA   G  +++VESD +V I+++
Sbjct: 96   AEMWGMYLGIDIAWRDGLSHLIVESDSKVLINMV 129


>ref|XP_014617687.1| PREDICTED: TMV resistance protein N-like isoform X2 [Glycine max]
          Length = 1293

 Score = 77.0 bits (188), Expect = 6e-11
 Identities = 37/99 (37%), Positives = 57/99 (57%), Gaps = 3/99 (3%)
 Frame = +3

Query: 1335 QNSNTNTIYTWRPPPSGSVSISVDASIRDSGTTAGCGGVLRDHGGNFIWAFAMKLEP--- 1505
            Q  +++ ++ WRPP    + ++VD +I     TA CGG+ RD+ G F+  F++KL+    
Sbjct: 1123 QPCSSDHLFKWRPPVDPWLKLNVDGAIDPCSKTAACGGIFRDYSGRFVLGFSVKLDMEYF 1182

Query: 1506 CNVLEGELLAIYHGFSIAGGYGYRNIVVESDCQVAIDII 1622
            C+  E E+  +YHG  IA  Y +  IVVESD   AI  +
Sbjct: 1183 CSFDEAEIWGVYHGIKIARQYDFGKIVVESDSAKAIRFV 1221


>ref|XP_014617686.1| PREDICTED: TMV resistance protein N-like isoform X1 [Glycine max]
 gb|KRH38837.1| hypothetical protein GLYMA_09G161400 [Glycine max]
          Length = 1390

 Score = 77.0 bits (188), Expect = 7e-11
 Identities = 37/99 (37%), Positives = 57/99 (57%), Gaps = 3/99 (3%)
 Frame = +3

Query: 1335 QNSNTNTIYTWRPPPSGSVSISVDASIRDSGTTAGCGGVLRDHGGNFIWAFAMKLEP--- 1505
            Q  +++ ++ WRPP    + ++VD +I     TA CGG+ RD+ G F+  F++KL+    
Sbjct: 1220 QPCSSDHLFKWRPPVDPWLKLNVDGAIDPCSKTAACGGIFRDYSGRFVLGFSVKLDMEYF 1279

Query: 1506 CNVLEGELLAIYHGFSIAGGYGYRNIVVESDCQVAIDII 1622
            C+  E E+  +YHG  IA  Y +  IVVESD   AI  +
Sbjct: 1280 CSFDEAEIWGVYHGIKIARQYDFGKIVVESDSAKAIRFV 1318


>ref|XP_019163602.1| PREDICTED: uncharacterized protein LOC109159944 [Ipomoea nil]
          Length = 1610

 Score = 77.0 bits (188), Expect = 7e-11
 Identities = 34/101 (33%), Positives = 56/101 (55%)
 Frame = +3

Query: 1320 LDPLSQNSNTNTIYTWRPPPSGSVSISVDASIRDSGTTAGCGGVLRDHGGNFIWAFAMKL 1499
            ++P+     +  + TW+ PP G++ +++D S+     TAGCGGV+R+  G +I  F  KL
Sbjct: 1425 MNPIPTVDQSWKMLTWKKPPPGTLKLNIDGSVAPLSLTAGCGGVIRNSSGEWITGFIAKL 1484

Query: 1500 EPCNVLEGELLAIYHGFSIAGGYGYRNIVVESDCQVAIDII 1622
              C  LE E  +I  G   A   GY N+++ESD    ++ +
Sbjct: 1485 GTCTPLEAEAWSILKGIQFAIAKGYSNVLIESDSSDVVNFL 1525


>dbj|GAU38343.1| hypothetical protein TSUD_395970 [Trifolium subterraneum]
          Length = 144

 Score = 70.9 bits (172), Expect = 7e-11
 Identities = 31/91 (34%), Positives = 56/91 (61%), Gaps = 1/91 (1%)
 Frame = +3

Query: 1353 TIY-TWRPPPSGSVSISVDASIRDSGTTAGCGGVLRDHGGNFIWAFAMKLEPCNVLEGEL 1529
            TIY +W+ P    + ++ D + +DS   AGCGG+ RD  G ++ A+ +++  C+ L  E+
Sbjct: 22   TIYISWKYPHGDWIKLNCDRAYKDSMNIAGCGGLFRDSDGRWLKAYTLRIGDCDALHAEM 81

Query: 1530 LAIYHGFSIAGGYGYRNIVVESDCQVAIDII 1622
              +Y G  +A   GY +++VESD ++ ID++
Sbjct: 82   WGMYTGMKMARRQGYTHLIVESDFKLLIDMV 112


>ref|XP_012448545.1| PREDICTED: uncharacterized protein LOC105771683 [Gossypium raimondii]
          Length = 350

 Score = 74.7 bits (182), Expect = 9e-11
 Identities = 34/86 (39%), Positives = 53/86 (61%)
 Frame = +3

Query: 1365 WRPPPSGSVSISVDASIRDSGTTAGCGGVLRDHGGNFIWAFAMKLEPCNVLEGELLAIYH 1544
            W+PPP G V  ++D SI    ++A  GG+LRDH GN+++ F M++    + + E  A+Y 
Sbjct: 242  WQPPPVGWVKGNIDGSIPKHTSSAAVGGMLRDHEGNWLFGFGMRIGRYGIFQTEARALYE 301

Query: 1545 GFSIAGGYGYRNIVVESDCQVAIDII 1622
            G  +A   G+R I VESD  + ID++
Sbjct: 302  GLVVAWHEGFRQIEVESDNAILIDVV 327


Top