BLASTX nr result

ID: Salvia21_contig00000707 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Salvia21_contig00000707
         (2295 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002509811.1| conserved hypothetical protein [Ricinus comm...   117   2e-23
ref|XP_002511864.1| conserved hypothetical protein [Ricinus comm...   105   4e-20
ref|NP_182114.1| Phosphatidylinositol N-acetyglucosaminlytransfe...   100   1e-18
ref|XP_003551662.1| PREDICTED: uncharacterized protein LOC100782...    99   5e-18
ref|XP_003533608.1| PREDICTED: uncharacterized protein LOC100783...    98   8e-18

>ref|XP_002509811.1| conserved hypothetical protein [Ricinus communis]
            gi|223549710|gb|EEF51198.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 942

 Score =  117 bits (292), Expect = 2e-23
 Identities = 102/371 (27%), Positives = 177/371 (47%), Gaps = 24/371 (6%)
 Frame = +1

Query: 1141 KDSCFSPRRESQ----YFSGSAQVSLISPFRQSVYLSPARASRDVAPFDGLTTGSVATTE 1308
            K SC SP R++Q    +     Q+        ++         DV     +++ +   TE
Sbjct: 588  KKSCLSPPRQNQEAPPWAVNKKQLQTYEKISDNILY-------DVQEHIDISSSNNDLTE 640

Query: 1309 TSSSLVPTTDEEADEARITPTRNIKFHDDVHRSEDAMISSAERSAEIKLMHSIMDSISEN 1488
              +  +  + E +    ++       +DDV++S +A+  + ER+  I L+ S MDS  EN
Sbjct: 641  IETKYIENSKEISSSVVVSKPDG-SCNDDVNQSTEAL-DACERN--IPLVFSRMDSPVEN 696

Query: 1489 EASFSNAADDFPSSP----TLDMCDSNDNQEEYQSPVSVLDQFFAEHSNHTLKP------ 1638
            + S +   DD+ SSP    ++   D   +  E  SPVSVLDQF+ E  N  L        
Sbjct: 697  QTS-TIPVDDYSSSPLNSWSVGEFDRIKDNVEQPSPVSVLDQFYTEDMNSPLNVDFQPVL 755

Query: 1639 ---RRLHF--EEPNTASSSQDTPPKSDPCRHDQDY-TISNYVRLILEASCLNWDQLSAMR 1800
               R LH   EE   A        K +     +DY ++  YV  +L+A CL WD+L    
Sbjct: 756  PSVRLLHIGIEEGCLAGIRFPLDVKINSSSSTEDYGSVIKYVTAVLQACCLEWDELMRKF 815

Query: 1801 LLSEHLLHPSLFDQALFL----HLDTRLLFDHMNEVLLEMQRSHFLSPYWPAYAKPRICF 1968
              S+ LL+ SL D           D+RLLFD++NEV++++ + +     W ++ KPRI  
Sbjct: 816  HFSDQLLNQSLLDDLDVWPNQSRGDSRLLFDYINEVIVDVCQCYLRCSPWLSFIKPRILS 875

Query: 1969 SPLEEAVVDEIMRDAEFYLLPRTQRRTLDQLVAKDLSGPRSRPDVRPETDHIILHISDYI 2148
              +  +V+ E+M++ ++ LL     +TL++ + KD     +  D+R + + I+  + D +
Sbjct: 876  KIITGSVLHEVMKNVDWNLLSAPPLQTLEKTIEKD----GTWMDIRIDAEDIVREMVDSL 931

Query: 2149 LEESVLDVILE 2181
            +EE  +++ +E
Sbjct: 932  VEELTIEIAIE 942


>ref|XP_002511864.1| conserved hypothetical protein [Ricinus communis]
            gi|223549044|gb|EEF50533.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 999

 Score =  105 bits (263), Expect = 4e-20
 Identities = 79/317 (24%), Positives = 149/317 (47%), Gaps = 15/317 (4%)
 Frame = +1

Query: 1273 DGLTTGSVATTETSSSLVPTTDEEADEARITPTRNIKFHDDVHRSEDAMISSAERSAEIK 1452
            DG T+        +  +V   D    ++   P+ +    DD       +  +   S  ++
Sbjct: 660  DGTTSEGDVEIIKADEIVVQGDVNILDSLSEPSNSCITRDDQTGGLSEVSDAKGYSDSLR 719

Query: 1453 LMHSIMDSISENEASFSNAADDFPSSPTLDMCDSNDNQEEYQSPVSVLDQFFAEH----- 1617
            ++ S+ D   +   S        P +   +  + +    +  SPVSVL+  F E      
Sbjct: 720  VVKSLTDEEIQPLPSLLTTLSSSPVTKKENDQECSVEVSDRPSPVSVLEPLFTEEDISPA 779

Query: 1618 ------SNHTLKPRRLHFEEPNTASSSQDTPPKSDPCRHDQDYTISNYVRLILEASCLNW 1779
                  +   + P R+ FEE   +S+   T  K+  C  D++ ++  Y++ +LEAS LNW
Sbjct: 780  STRYQPAELPMPPLRIQFEEHGPSSTDLGTHLKA--CIQDKE-SVFEYIKAVLEASELNW 836

Query: 1780 DQLSAMRLLSEHLLHPSLFDQALF----LHLDTRLLFDHMNEVLLEMQRSHFLSPYWPAY 1947
            D+   M   S+ LL PS++D+  F    L  D +LLFD ++EVL+E+   +F  P   ++
Sbjct: 837  DEFYIMSNSSDPLLDPSIYDEVGFYPNQLCYDRKLLFDCISEVLMEVYERYFGCPLGLSF 896

Query: 1948 AKPRICFSPLEEAVVDEIMRDAEFYLLPRTQRRTLDQLVAKDLSGPRSRPDVRPETDHII 2127
             KP +  +P  +  +  +     +Y+LP     TL+Q+V KD++   S  D+R +++ ++
Sbjct: 897  GKPTVQPAPDMKYAIHAVWEGVYWYILPLPLPHTLEQIVKKDMAKTGSWMDLRCDSETMV 956

Query: 2128 LHISDYILEESVLDVIL 2178
            + I D I ++ + + +L
Sbjct: 957  IEIGDAIFKDLIGETVL 973


>ref|NP_182114.1| Phosphatidylinositol N-acetyglucosaminlytransferase subunit P-like
            protein [Arabidopsis thaliana] gi|3386619|gb|AAC28549.1|
            hypothetical protein [Arabidopsis thaliana]
            gi|330255521|gb|AEC10615.1| Phosphatidylinositol
            N-acetyglucosaminlytransferase subunit P-like protein
            [Arabidopsis thaliana]
          Length = 720

 Score =  100 bits (250), Expect = 1e-18
 Identities = 140/570 (24%), Positives = 227/570 (39%), Gaps = 48/570 (8%)
 Frame = +1

Query: 610  KPFSDALQVLSSDKDFLFLSEPQNCIKNLESDKQTCPKSNTALTLKAELQKRQDSGRFAC 789
            K   DA QVL S ++ L +  P +       D Q   ++ T + LK E     D G    
Sbjct: 174  KNLVDAFQVLDSKEESLNIGTPTS------GDSQRIKETQTIVILKPE-PNTLDVGS--- 223

Query: 790  HCSSGQPGGGSRVK--------TLSEIRRKLKHTFG---------SSRRKDADQLPRFRF 918
              S G P   ++ K        +LS IRR+LK   G         S    DAD L     
Sbjct: 224  --SPGTPSTDNKAKNEKFSSRFSLSRIRRRLKFAVGKNPCNAQHDSDPDPDADALSSSMS 281

Query: 919  HGCIYSGVETRDPLSS---TTNAKAKEKPGKKQGLIKVEKEPL--VAGEAETARKKI--- 1074
              C        +P S      +  +K +  K+    + EK+    + G    A+K +   
Sbjct: 282  QNCCLGEEIETNPGSDGEILPDIASKGEANKEDTFHESEKDSKKSMCGIYIAAKKHLSEM 341

Query: 1075 ----DVSSDLVDVAATSPWRVMSSLEKDSCFSPRRESQYFSGSAQVSLISPFRQSVYLSP 1242
                D+ +DL D     P  +   L     F+P    +         +  P  Q      
Sbjct: 342  LAEGDIDADLPDKEV--PRILGKILALPEFFTPENSPRVTLALDHQIIEKPNIQQC---- 395

Query: 1243 ARASRDVAPFDGLTTGSVATTETSSSLVPT-TDEEADEARITPTRNIKFHDDVHRSEDAM 1419
              +S+D   ++ L   S    ET    VP  T  E +E  +  + +      + + +DA 
Sbjct: 396  --SSKDYY-YEPLRLDSNNHEETEFMPVPEDTRMEEEEQTVMDSLSEAISSSIIQ-QDAY 451

Query: 1420 ISSAERSAEIKLMHSIMDSISENEASFSNAADDFPSSPTLDMCDSNDNQEEY---QSPVS 1590
            I   E              + E E      +   P + ++ M +  +N  +     SPVS
Sbjct: 452  IDEDEHK-----------QLLEKEVLKEGQSPCSPPNSSVRMSECQENTTDVLGKSSPVS 500

Query: 1591 VLDQFFAEHSNHT-----------LKPRRLHFEEPNTASSSQDTPPKSDPCRHDQDYTIS 1737
            VL+ FF +                ++P  + F+EP+     +D   K+   R D      
Sbjct: 501  VLEPFFTDDDTSPNSSRFSSAEMRMQPLCIRFDEPDFPGPEKDNDVKT---RMDDKELAL 557

Query: 1738 NYVRLILEASCLNWDQLSAMRLLSEHLLHPSLFDQALF----LHLDTRLLFDHMNEVLLE 1905
             Y++ ++++S LNW++L A    SE +L  +L D   F    L  D +LLFD +NEVL+E
Sbjct: 558  EYIQAVVKSSELNWEELLARSFYSEKILEQALMDDIDFCSTNLCSDKKLLFDCINEVLME 617

Query: 1906 MQRSHFLSPYWPAYAKPRICFSPLEEAVVDEIMRDAEFYLLPRTQRRTLDQLVAKDLSGP 2085
                    P W ++ KP + F P  E  V+ +  +  ++LLP     TLDQ+V KDL+  
Sbjct: 618  FCGH---GP-WISFVKPAMHFFPDMENAVEVVQEEVYWHLLPLPSPHTLDQIVRKDLART 673

Query: 2086 RSRPDVRPETDHIILHISDYILEESVLDVI 2175
             +  D+R +   I+    + IL+E + ++I
Sbjct: 674  GNWMDLRFDIGCIVSETGEIILDELLEEII 703


>ref|XP_003551662.1| PREDICTED: uncharacterized protein LOC100782204 [Glycine max]
          Length = 929

 Score = 99.0 bits (245), Expect = 5e-18
 Identities = 135/538 (25%), Positives = 225/538 (41%), Gaps = 87/538 (16%)
 Frame = +1

Query: 832  TLSEIRRKLKHTFGSSRRKDADQLPRFRF--------HGCIYSGVETRDP------LSST 969
            +L+EI+RKLK   G  R  + + +PR            G        R P      +   
Sbjct: 385  SLTEIKRKLKCAMGKERHGNPELIPRKLPVERQNKLPRGKCKDNAGMRSPNKDHFFIEKI 444

Query: 970  TNAKAKEKPGKKQGLIK-----VEKEPLVAGEAET-----ARKKIDVSSDLVDVAATSPW 1119
            T        G K G +K     VE E  +  ++ +     ARK +    D  D       
Sbjct: 445  TRPMFNVVKGNKTGTMKDSELNVEHESGIPNQSVSNIYIEARKHLCEMLDNADENTNISS 504

Query: 1120 RVMS-------SLEKDSCFSPRRESQYFSGSAQVSLISPFRQSVYLSPARASRDVAPFDG 1278
            R M        SL + +  SP R+ ++ S +AQ +  S  +    +S  + S   A   G
Sbjct: 505  RQMPKTLGRILSLPEYNFSSPGRDLEHHSVTAQATFSSSDKTRE-VSEDKLSPKPATCIG 563

Query: 1279 LTTGSVATTETSSSLVPT-TDEEADEARITPTRNIKFHDDVH----------RSEDAMIS 1425
            L    +  +E  SS+    +D +  E ++    +   HD  H          R E     
Sbjct: 564  LPDQEINNSEKQSSICDERSDNKVQEIKLVSNLS---HDVNHVNTSEACYPVRDEIVTEG 620

Query: 1426 SAERSAEIKLMHSIMD------------SISE------------NEASFSNAADDFPSSP 1533
            + E + E   + S +D             ISE             +    N +    SSP
Sbjct: 621  NVESTKEKNDLESSLDPNGFIIGKDQNIDISEIPDGAGCSECLNQDIPEENQSSSLLSSP 680

Query: 1534 T------LDMCDSNDNQEEYQSPVSVLDQFFAE------HSNHT-----LKPRRLHFEEP 1662
                   ++  ++  +     SPVSVLD  F++      HS +      ++P ++ FEE 
Sbjct: 681  QSSITKKIEELENGTDVSGRPSPVSVLDTSFSDDDFGPGHSRYQPVKLPVQPLQIKFEEH 740

Query: 1663 NTASSSQDTPPKSDPCRHDQDYTISNYVRLILEASCLNWDQLSAMRLLSEHLLHPSLFDQ 1842
            +++ + Q    K   C  + +  I +Y++ +L AS L  DQL    L S+ +L PSLFDQ
Sbjct: 741  DSSPAEQFDRRKY--CFEESEL-IYDYIKAVLHASGLTTDQLLMKCLSSDKILDPSLFDQ 797

Query: 1843 A-LFLHL---DTRLLFDHMNEVLLEMQRSHFLSPYWPAYAKPRICFSPLEEAVVDEIMRD 2010
              LF +L   + +LLFD +NEVL+E+ + +F +  W ++  P    +P  + V  ++   
Sbjct: 798  VELFSNLLCNNQKLLFDSINEVLMEICQHYFGASPWVSFVNPSTRLTPSMKRVTLKVWEG 857

Query: 2011 AEFYLLPRTQRRTLDQLVAKDLSGPRSRPDVRPETDHIILHISDYILEESVLDVILEL 2184
              +++LP    RTL+Q+V KD++   +  D+  +T+ I   + + IL E + D IL L
Sbjct: 858  VCWHMLPLPPPRTLEQIVRKDMARRGTWMDLGLDTETIGFEMGEAILAELMEDTILSL 915


>ref|XP_003533608.1| PREDICTED: uncharacterized protein LOC100783243 [Glycine max]
          Length = 932

 Score = 98.2 bits (243), Expect = 8e-18
 Identities = 84/318 (26%), Positives = 151/318 (47%), Gaps = 23/318 (7%)
 Frame = +1

Query: 1306 ETSSSLVPTTDEEADEARITPTRNIKFHDDVHRSEDAMISSAERSAEIKLMHS------- 1464
            +TS +  P  DE   E  +   +  K   ++  + +  I+  +++ +I  +         
Sbjct: 607  DTSEARYPVRDEIVTEGNVESAKE-KNDLELSLNPNGFITGKDQNIDISEIPDGAGCSER 665

Query: 1465 -IMDSISENEASFSNAADDFPSSPTLDMCDSNDNQEEYQSPVSVLDQFFAEHS---NHT- 1629
               D   EN+ S    +  F  +  ++  ++  +  E  SPVSVLD  F++      H+ 
Sbjct: 666  LNQDITEENQPSSPPPSPHFSVTKKIEELENGTDVSERPSPVSVLDTSFSDDDFCPGHSR 725

Query: 1630 -------LKPRRLHFEEPNTASSSQDTPPKSDPCRHDQDYTISNYVRLILEASCLNWDQL 1788
                   ++ R++ FEE + +   Q    K   C  + +  I +Y++ +L AS L  DQL
Sbjct: 726  CEPVKLPVQARQIQFEEHDCSPPEQFDRGKY--CFEESEL-IYDYIKAVLHASGLTTDQL 782

Query: 1789 SAMRLLSEHLLHPSLFDQALF----LHLDTRLLFDHMNEVLLEMQRSHFLSPYWPAYAKP 1956
                L S+ +L PSLFDQ  +    L  D +LLFD +NEVL+E+ + +F +  W ++  P
Sbjct: 783  LMKCLSSDKILDPSLFDQVEYFSNLLCHDQKLLFDSINEVLMEICQHYFGASPWVSFVNP 842

Query: 1957 RICFSPLEEAVVDEIMRDAEFYLLPRTQRRTLDQLVAKDLSGPRSRPDVRPETDHIILHI 2136
                +P  + V  ++     +++LP    RTL+Q+V KD++   +  D+  + + I   +
Sbjct: 843  STRLTPSMKRVTLKVWEGVCWHILPLPPPRTLEQIVRKDMARRGTWMDLGLDAETIGFEM 902

Query: 2137 SDYILEESVLDVILELHS 2190
             + IL E + D IL L S
Sbjct: 903  GEDILGELMEDTILSLVS 920


Top