BLASTX nr result

ID: Scutellaria22_contig00004398 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Scutellaria22_contig00004398
         (2381 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002514588.1| conserved hypothetical protein [Ricinus comm...   176   3e-41
gb|AAX55105.1| hypothetical protein At2g03810 [Arabidopsis thali...   163   2e-37
ref|NP_178475.2| 18S pre-ribosomal assembly protein gar2-related...   163   2e-37
ref|XP_002875229.1| hypothetical protein ARALYDRAFT_484289 [Arab...   160   1e-36
gb|AAM77645.1|AF517846_1 hypothetical protein [Arabidopsis thali...   159   2e-36

>ref|XP_002514588.1| conserved hypothetical protein [Ricinus communis]
            gi|223546192|gb|EEF47694.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 488

 Score =  176 bits (446), Expect = 3e-41
 Identities = 145/431 (33%), Positives = 196/431 (45%), Gaps = 23/431 (5%)
 Frame = +2

Query: 1001 FGTDANMFTDKNVLECDVPELEVCYSDIDCQILKDICVDEGRPEKDINVIE-SFEDEKPG 1177
            F  D+  + DKNV+E ++PEL +CY +    ++KDICVDEG P ++  + + S + EK  
Sbjct: 106  FDKDSVFYIDKNVMEPELPELVLCYKENTYHVVKDICVDEGVPSQENFLFDTSVDQEKLC 165

Query: 1178 HLFSQPPDDNDHFESTEESTKEHTDGLDAN----QCGSKKAIERNTLVSQGSFESSVDNF 1345
                   D     +         T  L  N    +C SK+++    +      E  + N+
Sbjct: 166  PYLIPEKDIKSEIQKERVDLDMSTQYLSKNDNSFKCDSKESMAIAEIEDDAMEE--IANY 223

Query: 1346 FEKDTTKSCDPESLGEAETGGSPE---ECSHVGENLPMVDCTEENEIMQPPNQILNEQAD 1516
              K+T       SLGE      PE   E SH    L   D  E+  I +P   I+   A 
Sbjct: 224  TSKETF------SLGELLL--MPEVVAELSHSKSLLNSTDEAEQLSIQRPSENIVLATAS 275

Query: 1517 SESHDASLEEAGGTGEDVQDSSLLYKSNVETGIITFNFNSSTPVAVGSGITENV------ 1678
            +        E         D  +    + E  + T   +SS P A   G  E +      
Sbjct: 276  ACEESKYATEQFLLVTPAVDPLVEESGHEEAKLGTLTSDSS-PKASDHGHDEVILASLAP 334

Query: 1679 ----EEQSSGSRDVLNDLHTV-NLSDANVQEQPLVNENSDASSVQCVRSEDTNNENTHDH 1843
                EE  +G++   +  HT+ ++SD N    P  +   + S V      ++ N + H  
Sbjct: 335  SYATEEPENGAKAAKSPSHTLDSVSDLN-SSAPTASGGEEGSQVGGSEHLESRNSSRH-- 391

Query: 1844 QSPEDKEATKSDDLPGQIPTKKSQSPTNDDPMHQRRNSGNVSVVNQLQHDEGETSFSAA- 2020
               ED   T+                                   QLQ+  GE+SFSAA 
Sbjct: 392  ---EDTSITE-------------------------------PFSGQLQYSHGESSFSAAG 417

Query: 2021 ---SLITYSGPIAFSGSLSHRSDGSTTSAKSFAFPILQSEWNSSPVRMAKADRRHFRKHK 2191
                LI+YSGPIA+SGSLS RSD STTS +SFAFPILQSEWNSSPVRMAKADRRHFRKH+
Sbjct: 418  PLSGLISYSGPIAYSGSLSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRRHFRKHR 477

Query: 2192 GWRSGLLCCRF 2224
             WR GLLCCRF
Sbjct: 478  SWRQGLLCCRF 488


>gb|AAX55105.1| hypothetical protein At2g03810 [Arabidopsis thaliana]
          Length = 439

 Score =  163 bits (413), Expect = 2e-37
 Identities = 144/417 (34%), Positives = 194/417 (46%), Gaps = 12/417 (2%)
 Frame = +2

Query: 1010 DANMFTDKNVLECDVPELEVCYSDIDCQILKDICVDEGRPEKDINVIESFEDEKPGHLFS 1189
            D   + DKNV  CD+PE+ VCY +    I+KDICVDEG P     V E F       LF 
Sbjct: 86   DPVFYMDKNVTACDLPEIVVCYKENTYHIVKDICVDEGVP-----VQEKF-------LFG 133

Query: 1190 QPPDDNDHFESTEESTKEHTDGLDANQCGSKKAIERNTLVSQGSFESSVDNFFEKDTTKS 1369
            +   D+    STE+  K   D  + N   +K A                    E   +K 
Sbjct: 134  EK--DSVKSSSTEDLMK--ADKTNVNPSETKSA--------------------EDSISKV 169

Query: 1370 CDPESLGEAETGGSPEECSHVGENLPMVDCTEENEIMQPPNQILNEQADSESHDASLEEA 1549
             D E   + +T    EE S  GE+    + T  N   Q    +  E   S +H  S  E 
Sbjct: 170  DDSEFCNDHKTDRDVEESS--GEDFADAEGTSSN-YNQEHLIVTEEVKASPTHGLSPSEI 226

Query: 1550 GGTGEDVQDSSLLYKSNVETGIITFNFNSSTPVAVGSGITENVEEQSSGSRDVLNDLHTV 1729
                E+ +D   + + N     +T              I    +EQ S ++D ++     
Sbjct: 227  E-PDENSKDEVAISQDNDSKECLTLG-----------DILSREDEQKSLNQDNIS----- 269

Query: 1730 NLSDANVQEQPLVNENSDASSVQCVRSEDTNNENTHDHQSPEDKEATKSDDLPGQIPTKK 1909
              SD++ ++ P   ++ +  S++    E T  E T + +  E+K ++ S     Q P K 
Sbjct: 270  --SDSHEEQSPSQLQDKEKRSLETTAIE-TELEKTEEPKQGEEKLSSVSTTT-SQEPNKT 325

Query: 1910 SQSPTNDDPMHQRRNSGNVSVVNQLQHDE------GETSFSAASL------ITYSGPIAF 2053
               P  + P  +  +  N  V N  + D+      GETSFSAA        ITYSGPIA+
Sbjct: 326  CNEP--EKPETENHHQQNCLVENSYEDDKFSSSRFGETSFSAADSVSISGHITYSGPIAY 383

Query: 2054 SGSLSHRSDGSTTSAKSFAFPILQSEWNSSPVRMAKADRRHFRKHKGWRSGLLCCRF 2224
            SGSLS RSD STTS +SFAFPILQSEWNSSPVRMAKAD+R  R+  GWR  LLCCRF
Sbjct: 384  SGSLSVRSDASTTSGRSFAFPILQSEWNSSPVRMAKADKR--RQKGGWRHTLLCCRF 438


>ref|NP_178475.2| 18S pre-ribosomal assembly protein gar2-related protein [Arabidopsis
            thaliana] gi|42570677|ref|NP_973412.1| 18S pre-ribosomal
            assembly protein gar2-related protein [Arabidopsis
            thaliana] gi|79316683|ref|NP_001030966.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|186499149|ref|NP_001118260.1|
            18S pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250656|gb|AEC05750.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250657|gb|AEC05751.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250658|gb|AEC05752.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana] gi|330250659|gb|AEC05753.1| 18S
            pre-ribosomal assembly protein gar2-related protein
            [Arabidopsis thaliana]
          Length = 439

 Score =  163 bits (413), Expect = 2e-37
 Identities = 144/417 (34%), Positives = 194/417 (46%), Gaps = 12/417 (2%)
 Frame = +2

Query: 1010 DANMFTDKNVLECDVPELEVCYSDIDCQILKDICVDEGRPEKDINVIESFEDEKPGHLFS 1189
            D   + DKNV  CD+PE+ VCY +    I+KDICVDEG P     V E F       LF 
Sbjct: 86   DPVFYMDKNVTACDLPEIVVCYKENTYHIVKDICVDEGVP-----VQEKF-------LFG 133

Query: 1190 QPPDDNDHFESTEESTKEHTDGLDANQCGSKKAIERNTLVSQGSFESSVDNFFEKDTTKS 1369
            +   D+    STE+  K   D  + N   +K A                    E   +K 
Sbjct: 134  EK--DSVKSSSTEDLMK--ADKTNVNPSETKSA--------------------EDSISKV 169

Query: 1370 CDPESLGEAETGGSPEECSHVGENLPMVDCTEENEIMQPPNQILNEQADSESHDASLEEA 1549
             D E   + +T    EE S  GE+    + T  N   Q    +  E   S +H  S  E 
Sbjct: 170  DDSEFCNDHKTDRDVEESS--GEDFADAEGTSSN-YNQEHLIVTEEVKASPTHGLSPSEI 226

Query: 1550 GGTGEDVQDSSLLYKSNVETGIITFNFNSSTPVAVGSGITENVEEQSSGSRDVLNDLHTV 1729
                E+ +D   + + N     +T              I    +EQ S ++D ++     
Sbjct: 227  E-PDENSKDEVAISQDNDSKECLTLG-----------DILSREDEQKSLNQDNIS----- 269

Query: 1730 NLSDANVQEQPLVNENSDASSVQCVRSEDTNNENTHDHQSPEDKEATKSDDLPGQIPTKK 1909
              SD++ ++ P   ++ +  S++    E T  E T + +  E+K ++ S     Q P K 
Sbjct: 270  --SDSHEEQSPSQLQDKEKRSLETTAIE-TELEKTEEPKQGEEKLSSVSTTT-SQEPNKT 325

Query: 1910 SQSPTNDDPMHQRRNSGNVSVVNQLQHDE------GETSFSAASL------ITYSGPIAF 2053
               P  + P  +  +  N  V N  + D+      GETSFSAA        ITYSGPIA+
Sbjct: 326  CNEP--EKPETENHHQQNCLVENSYEDDKFSSSRFGETSFSAADSVSISGHITYSGPIAY 383

Query: 2054 SGSLSHRSDGSTTSAKSFAFPILQSEWNSSPVRMAKADRRHFRKHKGWRSGLLCCRF 2224
            SGSLS RSD STTS +SFAFPILQSEWNSSPVRMAKAD+R  R+  GWR  LLCCRF
Sbjct: 384  SGSLSVRSDASTTSGRSFAFPILQSEWNSSPVRMAKADKR--RQKGGWRHTLLCCRF 438


>ref|XP_002875229.1| hypothetical protein ARALYDRAFT_484289 [Arabidopsis lyrata subsp.
            lyrata] gi|297321067|gb|EFH51488.1| hypothetical protein
            ARALYDRAFT_484289 [Arabidopsis lyrata subsp. lyrata]
          Length = 435

 Score =  160 bits (405), Expect = 1e-36
 Identities = 143/416 (34%), Positives = 197/416 (47%), Gaps = 13/416 (3%)
 Frame = +2

Query: 1010 DANMFTDKNVLECDVPELEVCYSDIDCQILKDICVDEGRPEKDINVIESFEDEKPGHLFS 1189
            D   + DKNV  CD+PE+ VCY +    ++KDICVDEG P     V E F       LF 
Sbjct: 86   DPVFYMDKNVTACDLPEIVVCYKENTYHVVKDICVDEGVP-----VQEKF-------LFG 133

Query: 1190 QPPDDNDHFESTEESTKEHTDGLDANQCGSKKAIERNTLVSQGSFESSVDNFFEKDTTKS 1369
            +   D+    STE+ TK   D  + N   SK A + NT V    F ++     ++D  +S
Sbjct: 134  E--KDSVKSSSTEDLTK--ADKTNVNPSESKSAEDSNTKVDDSEFCNNCKT--DRDVEES 187

Query: 1370 CDPESLGEAETGGSPEECSHVGENLPMVDCTEENEIMQPPNQILNEQAD-SESHDASLEE 1546
               E   +AE G S     H+                     I+ E+A  S SH      
Sbjct: 188  -SREDFADAE-GSSAYNQEHL---------------------IVTEEAKASPSH------ 218

Query: 1547 AGGTGEDVQDSSLLYKSNVETGIITFNFNSSTPVAVGSGITENVEEQSSGSRDVLNDLHT 1726
             G    +++       SN E   I+   +S   + +G  ++   E++S    ++ +D H 
Sbjct: 219  -GLNPSEIEPDE---NSNDEVA-ISSETDSKESLTLGDILSREDEQKSLNHGNISSDSHE 273

Query: 1727 VNLSDANVQEQPLVNENSDASSVQCVRSEDTNNENTHDHQSPEDKEATKSDDLPGQIPTK 1906
                    ++ P   ++ +  S++    E T  E T + +  E+K  + S     Q P K
Sbjct: 274  --------EQSPSQLQDKEKRSLETAAIE-TELEKTEEPKPVEEKLPSASTTTL-QEPNK 323

Query: 1907 KSQSPTNDDPMHQRRNSGNVSVVNQLQHDE------GETSFSAASL------ITYSGPIA 2050
                P  + P  +  +  N  V N  + D+      GETSFSAA        ITYSGPIA
Sbjct: 324  TCNDP--EKPETENHHQQNSLVENSYEDDKLSSSRFGETSFSAAESVSISGHITYSGPIA 381

Query: 2051 FSGSLSHRSDGSTTSAKSFAFPILQSEWNSSPVRMAKADRRHFRKHKGWRSGLLCC 2218
            +SGSLS RSD STTS +SFAFPILQSEWNSSPVRMAKAD+R  R+  GWR  LLCC
Sbjct: 382  YSGSLSVRSDASTTSGRSFAFPILQSEWNSSPVRMAKADKR--RQKGGWRHTLLCC 435


>gb|AAM77645.1|AF517846_1 hypothetical protein [Arabidopsis thaliana]
            gi|41059759|gb|AAR99354.1| hypothetical protein At2g03810
            [Arabidopsis thaliana]
          Length = 439

 Score =  159 bits (403), Expect = 2e-36
 Identities = 142/417 (34%), Positives = 192/417 (46%), Gaps = 12/417 (2%)
 Frame = +2

Query: 1010 DANMFTDKNVLECDVPELEVCYSDIDCQILKDICVDEGRPEKDINVIESFEDEKPGHLFS 1189
            D   + DKNV  CD+PE+  CY +    I+KDICVDE  P     V E F       LF 
Sbjct: 86   DPVFYMDKNVTACDLPEIVACYKENTYHIVKDICVDESVP-----VQEKF-------LFG 133

Query: 1190 QPPDDNDHFESTEESTKEHTDGLDANQCGSKKAIERNTLVSQGSFESSVDNFFEKDTTKS 1369
            +   D+    STE+  K   D  + N   +K A                    E   +K 
Sbjct: 134  EK--DSVKSSSTEDLMK--ADKTNVNPSETKSA--------------------EDSISKV 169

Query: 1370 CDPESLGEAETGGSPEECSHVGENLPMVDCTEENEIMQPPNQILNEQADSESHDASLEEA 1549
             D E   + +T    EE S  GE+    + T  N   Q    +  E   S +H  S  E 
Sbjct: 170  DDSEFCNDHKTDRDVEESS--GEDFADAEGTSSN-YNQEHLIVTEEVXASPTHGLSPSEI 226

Query: 1550 GGTGEDVQDSSLLYKSNVETGIITFNFNSSTPVAVGSGITENVEEQSSGSRDVLNDLHTV 1729
                E+ +D   + + N     +T              I    +EQ S ++D ++     
Sbjct: 227  E-PDENSKDEVAISQDNDSKECLTLG-----------DILSREDEQKSLNQDNIS----- 269

Query: 1730 NLSDANVQEQPLVNENSDASSVQCVRSEDTNNENTHDHQSPEDKEATKSDDLPGQIPTKK 1909
              SD++ ++ P   ++ +  S++    E T  E T + +  E+K ++ S     Q P K 
Sbjct: 270  --SDSHEEQSPSQLQDKEKRSLETTAIE-TELEKTEEPKQGEEKLSSVSTTT-SQEPNKT 325

Query: 1910 SQSPTNDDPMHQRRNSGNVSVVNQLQHDE------GETSFSAASL------ITYSGPIAF 2053
               P  + P  +  +  N  V N  + D+      GETSFSAA        ITYSGPIA+
Sbjct: 326  CNEP--EKPETENHHQQNCLVENSYEDDKFSSSRFGETSFSAADSVSISGHITYSGPIAY 383

Query: 2054 SGSLSHRSDGSTTSAKSFAFPILQSEWNSSPVRMAKADRRHFRKHKGWRSGLLCCRF 2224
            SGSLS RSD STTS +SFAFPILQSEWNSSPVRMAKAD+R  R+  GWR  LLCCRF
Sbjct: 384  SGSLSVRSDASTTSGRSFAFPILQSEWNSSPVRMAKADKR--RQKGGWRHTLLCCRF 438


Top