BLASTX nr result

ID: Astragalus22_contig00016513 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Astragalus22_contig00016513
         (1888 letters)

Database: All non-redundant GenBank CDS
translations+PDB+SwissProt+PIR+PRF excluding environmental samples
from WGS projects 
           149,584,005 sequences; 54,822,741,787 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

dbj|GAU49586.1| hypothetical protein TSUD_138730 [Trifolium subt...    97   2e-18
gb|PNX59020.1| hypothetical protein L195_g051207, partial [Trifo...    97   3e-18
ref|XP_003623722.2| PPR repeat protein [Medicago truncatula] >gi...    83   7e-13
gb|PNX82056.1| pentatricopeptide repeat-containing protein [Trif...    73   2e-10
gb|PNX90652.1| hypothetical protein L195_g046777, partial [Trifo...    59   7e-06

>dbj|GAU49586.1| hypothetical protein TSUD_138730 [Trifolium subterraneum]
          Length = 298

 Score = 97.1 bits (240), Expect = 2e-18
 Identities = 66/132 (50%), Positives = 76/132 (57%), Gaps = 2/132 (1%)
 Frame = -1

Query: 1780 MIPYEEFLPLLPHEYSYALYLSYGFDLCMYDPMILNMVSERYPRD--EDYASNSNSVIDE 1607
            MI YE    LLPHEYSYALYLSY +DL  YD MILNMVSERYPRD  EDYAS   S   E
Sbjct: 1    MISYE----LLPHEYSYALYLSYEYDLHPYDSMILNMVSERYPRDEVEDYASAYGSKNCE 56

Query: 1606 DFSDMEDDIPPWLFVDLEEEDASPIHQEDDLLNIVFGNFDQDSYLETFSVAPLISIEPEL 1427
            +    EDD     F  +E +D            IVFGNF ++  LE    A  +SI P +
Sbjct: 57   ENQVQEDDSSS-DFASMENKDP----------EIVFGNFHEEPNLEK---AHTLSINPSI 102

Query: 1426 DEFSIKIGEIYC 1391
            +  SI+IG I C
Sbjct: 103  EGNSIEIGSITC 114


>gb|PNX59020.1| hypothetical protein L195_g051207, partial [Trifolium pratense]
          Length = 300

 Score = 96.7 bits (239), Expect = 3e-18
 Identities = 72/172 (41%), Positives = 94/172 (54%), Gaps = 21/172 (12%)
 Frame = -1

Query: 1753 LLPHEYSYALYLSYGFDLCMYDPMILNMVSERYPRDED-YAS------NSNSVI--DEDF 1601
            LLPHE+ YAL+LSY +DL  YDPMIL+MVSERYPRDED YAS       SNS I  DED 
Sbjct: 6    LLPHEHDYALFLSYVYDLWHYDPMILDMVSERYPRDEDAYASVEDSAAYSNSAIAEDEDL 65

Query: 1600 SDMEDDIPPWLFVDLEEEDASPIHQEDDLLNIVFGNFDQDSYLETFSVAPLISIEPELDE 1421
            S  ED              A  +H      +I FGNF  +S L T     +  I+P +D 
Sbjct: 66   SHDED----------MAISAPEVHSSSLCNHIHFGNFG-NSNLSTIQSPLITPIKPTIDG 114

Query: 1420 FSIKIGEIYCPV----------AAVKQSD--DEGENVAAPNLDSSTTKEIAT 1301
            F+IKIGEI C +          +A ++S+     E++   NLDS+++  + T
Sbjct: 115  FTIKIGEITCILVDSCCSIVEESAFQESEFAASEEHMKLKNLDSASSSLLTT 166


>ref|XP_003623722.2| PPR repeat protein [Medicago truncatula]
 gb|AES79940.2| PPR repeat protein [Medicago truncatula]
          Length = 499

 Score = 82.8 bits (203), Expect = 7e-13
 Identities = 57/151 (37%), Positives = 78/151 (51%), Gaps = 8/151 (5%)
 Frame = -1

Query: 1750 LPHEYSYALYLSYGFDLCMYDPMILNMVSERYPRDEDYASNSNSVIDEDFSDMEDDIPPW 1571
            LPHEYSYA  L+Y +DL  YD MI NMVSERYPR+ED    S    DED   +ED     
Sbjct: 7    LPHEYSYACSLAYIYDLSPYDSMIFNMVSERYPREEDEDDASVDDSDEDAVSVEDSAAIA 66

Query: 1570 LFVDLEEEDASPIHQEDDLL-----NIVFGNFDQDSYLETFSVAPLISIEPELDEFSIKI 1406
            +  +  +E +S      D +     NI+FG+F Q S  +   V+ +  I P ++    KI
Sbjct: 67   VSDEDPDEASSAEFVSSDFVTVPDSNIIFGSFGQAS--DAIPVSLITMITPSIEGKLFKI 124

Query: 1405 GEIYCPV---AAVKQSDDEGENVAAPNLDSS 1322
            GEI C +    +V  ++ +  NV      SS
Sbjct: 125  GEIDCFLLVSESVTVAEQQSSNVEFQGSSSS 155


>gb|PNX82056.1| pentatricopeptide repeat-containing protein [Trifolium pratense]
          Length = 292

 Score = 73.2 bits (178), Expect = 2e-10
 Identities = 51/124 (41%), Positives = 65/124 (52%)
 Frame = -1

Query: 1762 FLPLLPHEYSYALYLSYGFDLCMYDPMILNMVSERYPRDEDYASNSNSVIDEDFSDMEDD 1583
            F   LPHEYSYALYL+Y +DL  YD MI+NMVSERYPRDED   +  S  D + S  ED+
Sbjct: 3    FYEYLPHEYSYALYLAYEYDLHPYDSMIINMVSERYPRDED--EDCASAYDSE-SCEEDE 59

Query: 1582 IPPWLFVDLEEEDASPIHQEDDLLNIVFGNFDQDSYLETFSVAPLISIEPELDEFSIKIG 1403
                  +    +D      +D +L   F +          SV+ + SI  +   F IKIG
Sbjct: 60   SVDVAILSTSTQDLEFSTSQDSILFCSFAS-------PIISVS-IPSIASDFYGFFIKIG 111

Query: 1402 EIYC 1391
             I C
Sbjct: 112  AITC 115


>gb|PNX90652.1| hypothetical protein L195_g046777, partial [Trifolium pratense]
          Length = 230

 Score = 58.5 bits (140), Expect = 7e-06
 Identities = 50/148 (33%), Positives = 74/148 (50%), Gaps = 13/148 (8%)
 Frame = -1

Query: 1684 MILNMVSERYPR--DEDYASNSNSVIDEDFSDMEDDIPPWLFVDLEEEDASPIHQEDDLL 1511
            MILNMVSERY R  DED A++ +S IDED   +  D+   + V   E  +S I       
Sbjct: 1    MILNMVSERYLRDEDEDSAADFDSAIDED-EALSHDVA--MEVSASEAQSSTICN----- 52

Query: 1510 NIVFGNFDQDSYLETFSVAPLISIEPELDEFSIKIGEIYCPVA----------AVKQSDD 1361
            +I FG+F    +      A +  I+P +D F+IKIGEI C +           A  + D+
Sbjct: 53   HICFGSFGNSDFF-AVQFASITPIKPTIDGFTIKIGEIACVLVASSCNNKSAFAASEYDE 111

Query: 1360 EGENVAAPNLDSSTTKEI-ATGCEFGIS 1280
            + +N+ +  + S   K +     EFG+S
Sbjct: 112  KSKNLDSKVVVSEAIKGLNECKSEFGVS 139


Top