BLASTX nr result

ID: Scutellaria23_contig00004195 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Scutellaria23_contig00004195
         (1936 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN80069.1| hypothetical protein VITISV_019030 [Vitis vinifera]   122   3e-25
ref|XP_002285142.1| PREDICTED: uncharacterized protein LOC100248...   121   6e-25
ref|NP_200204.1| uncharacterized protein [Arabidopsis thaliana] ...    93   2e-16
ref|XP_002866006.1| hypothetical protein ARALYDRAFT_918499 [Arab...    91   1e-15
ref|XP_003541837.1| PREDICTED: uncharacterized protein LOC100805...    89   3e-15

>emb|CAN80069.1| hypothetical protein VITISV_019030 [Vitis vinifera]
          Length = 582

 Score =  122 bits (306), Expect = 3e-25
 Identities = 130/430 (30%), Positives = 179/430 (41%), Gaps = 106/430 (24%)
 Frame = +1

Query: 691  RGTADGKRRRSKVIPDKEVQRLSH--KRQRNHEVRSPSYSSCDRGND-PSDGALLTMNDS 861
            RG ++ K R  +      + ++ H  KR R+      SYS C   +   S       N+S
Sbjct: 161  RGRSERKERDKR-----NLGKVKHVNKRSRHRSRSCSSYSRCSESSGYQSVERWDAENNS 215

Query: 862  RRLRSVITVVNQPEDERENRWEMDLHNEEIVYDQND-YPSSKSVDCNEGGDRMDSGIHCH 1038
            RRLRSVITVV +PE+E     + D H EEI+YD +D YPS +S D N+GG + +   H  
Sbjct: 216  RRLRSVITVVREPEEEDGRELDKDAHKEEIIYDHDDGYPSCRSNDSNDGGGKRELTYHSE 275

Query: 1039 GGL-------------------------------------GVVETENEVDVNSPRGEILE 1107
                                                    GV E +NE   +      LE
Sbjct: 276  KRKQIESGKEAFVSNIRTTEDKESDKDCGTQNDGSNPSFHGVXENKNEASDDIGH---LE 332

Query: 1108 SILRQKALENLRKFKGRHQTGL-RSSVVETNNESDMETIDVTQNKSINQGS--------- 1257
            SILRQ+A+ENLRKF+G     L R S  ++       T  V+ N  + Q +         
Sbjct: 333  SILRQRAIENLRKFRGLSIHPLQRLSWCKSRPSRVDGTRAVSANPVVEQSNMPTVGREFT 392

Query: 1258 -SNSNAHKIDERK------GLSSEG----------------DFSLKEVKKLGDDEHTERD 1368
             S+ N  KI + +      G S  G                D S K       ++     
Sbjct: 393  YSSQNLGKIPDGRYSENEPGASERGVVCPPEKVAXTCAPNDDNSSKTAVNAFGNKSKPGT 452

Query: 1369 NEMAKQSF--------------VHPPNEEALLECSEEDKRATSRAV-----------SST 1473
            + + ++SF               H PN          +  AT++ V           S T
Sbjct: 453  SVLRRESFGTSTPLKQASISQEXHRPNLLVTRPSVNTNSAATAQTVLWSSKDNGQQVSDT 512

Query: 1474 LETSSS------QPVSGEHCLEQ-GNEAKDCSQFEQKTMSVMRGGEVVQVSYKVFIPKKV 1632
               ++S      +P+SGE   ++   EAK+ SQFEQKTMSVMRGGE+VQVSYKV+IPKK 
Sbjct: 513  XGPAASNPPPELKPISGEQSSKEVQGEAKEGSQFEQKTMSVMRGGEMVQVSYKVYIPKKA 572

Query: 1633 SPLARRQLRR 1662
                 RQL R
Sbjct: 573  PASGWRQLPR 582


>ref|XP_002285142.1| PREDICTED: uncharacterized protein LOC100248740 [Vitis vinifera]
            gi|302142970|emb|CBI20265.3| unnamed protein product
            [Vitis vinifera]
          Length = 597

 Score =  121 bits (304), Expect = 6e-25
 Identities = 129/445 (28%), Positives = 181/445 (40%), Gaps = 121/445 (27%)
 Frame = +1

Query: 691  RGTADGKRRRSKVIPDKEVQRLSH--KRQRNHEVRSPSYSSCDRGND-PSDGALLTMNDS 861
            RG ++ K R  +      + ++ H  KR R+      SYS C   +   S       N+S
Sbjct: 161  RGRSERKERDKR-----NLGKVKHVNKRSRHRSRSCSSYSRCSESSGYQSVERWDAENNS 215

Query: 862  RRLRSVITVVNQPEDERENRWEMDLHNEEIVYDQND-YPSSKSVDCNEGGDRMDSGIHCH 1038
            RRLRSVITVV +PE+E     + D H EEI+YD +D YPS +S D N+GG + +   H  
Sbjct: 216  RRLRSVITVVREPEEEDGRELDKDAHKEEIIYDHDDGYPSCRSNDSNDGGGKRELTYHSE 275

Query: 1039 GGL-------------------------------------GVVETENEVDVNSPRGEILE 1107
                                                    GV E +NE   +      LE
Sbjct: 276  KRKQIESGKEAFVSNIRTTEDKESDKDCGTQNDGSNPSFHGVKENKNEASDDIGH---LE 332

Query: 1108 SILRQKALENLRKFKG----------------RHQTGLRSSVVETNNESDMETIDVTQNK 1239
            SILRQ+A+ENLRKF+G                +H    ++ +V+        T  V+ N 
Sbjct: 333  SILRQRAIENLRKFRGVQTNAKTTPKDVTAAVKHSPTSKAELVQIKASRVDGTRAVSANP 392

Query: 1240 SINQGS----------SNSNAHKIDERK------GLSSEG----------------DFSL 1323
             + Q +          S+ N  KI + +      G S  G                D S 
Sbjct: 393  VVEQSNMPTVGREFTYSSQNLGKIPDGRYSENEPGASERGVVCPPEKVATTCAPNDDNSS 452

Query: 1324 KEVKKLGDDEHTERDNEMAKQSF--------------VHPPNEEALLECSEEDKRATSRA 1461
            K       ++     + + ++SF               H PN          +  AT++ 
Sbjct: 453  KTAVNAFGNKSKPGTSVLRRESFGTSTPLKQASISQEPHRPNLLVTRPSVNTNSAATAQT 512

Query: 1462 V-----------SSTLETSSS------QPVSGEHCLEQ-GNEAKDCSQFEQKTMSVMRGG 1587
            V           S T   ++S      +P+SGE   ++   EAK+ SQFEQKTMSVMRGG
Sbjct: 513  VLWSSKDNGQQVSDTAGPAASNPPPELKPISGEQSSKEVQGEAKEGSQFEQKTMSVMRGG 572

Query: 1588 EVVQVSYKVFIPKKVSPLARRQLRR 1662
            E+VQVSYKV+IPKK      RQL R
Sbjct: 573  EMVQVSYKVYIPKKAPASGWRQLPR 597


>ref|NP_200204.1| uncharacterized protein [Arabidopsis thaliana]
            gi|10177255|dbj|BAB10723.1| unnamed protein product
            [Arabidopsis thaliana] gi|26453018|dbj|BAC43585.1|
            unknown protein [Arabidopsis thaliana]
            gi|28973163|gb|AAO63906.1| unknown protein [Arabidopsis
            thaliana] gi|332009047|gb|AED96430.1| uncharacterized
            protein [Arabidopsis thaliana]
          Length = 529

 Score = 93.2 bits (230), Expect = 2e-16
 Identities = 111/400 (27%), Positives = 164/400 (41%), Gaps = 76/400 (19%)
 Frame = +1

Query: 691  RGTADGKRRRSKVIPDKEVQRLSHKRQRNH-EVRSPSYSSCDRGNDPSDGALLTMNDSRR 867
            R + D  RR  KV       + S  R R+  E  S     C +G D     ++   + RR
Sbjct: 144  RWSRDRGRRLGKV-------KDSRSRSRDELEGESEEPDECWQGEDE----VIPEKNPRR 192

Query: 868  LRSVITV-VNQPEDERENRWEMDLH------NEEIVYDQN-DYPSSKSVDCNEGGDRMDS 1023
            L+S++ V  N    ER+   + D++      N E+ Y ++ D    +S+D        D+
Sbjct: 193  LKSIVVVSYNYGNGERKEEDDRDVYMTRGGGNRELGYSEDSDEMDGESIDSYSRIRADDN 252

Query: 1024 GIHCHGGLGVVETENEVDV-NSPRGEILESILRQKALENLRKFKGRHQTG---------- 1170
            G    G     ET       NS + + LE+IL+++ALENL++F+G  Q            
Sbjct: 253  GF---GEYNKSETSKVSHTDNSLKDDDLEAILKKRALENLKRFRGVTQKSGIAKKEVSSV 309

Query: 1171 -------LRSSVVET--------------------NNESDMETIDVTQ------NKSINQ 1251
                   + S  VE+                    N+E  +  I+V +      N +  Q
Sbjct: 310  SEGEPMQIESEKVESQDHDLMEQKLCDSAVSKDLENSEKILHVINVKESGTALANSASQQ 369

Query: 1252 GSSNSNAHKIDERKGLSS-------------EGDFSLKEVKKLGDDEHTERDNEMAKQSF 1392
               + +  K+    GLSS             +   +L   K+    +  E ++       
Sbjct: 370  DQQSGDTAKVKVSSGLSSCTTKRKLVRPVLSKDSLNLASKKEASGSQDAEAESIDGSTVD 429

Query: 1393 VHPPNEEALLECSEEDKRATSRAVSSTLETSSSQ--------PVSGEHCLEQG--NEAKD 1542
             +       L    E +      VSSTL   SS          V G    EQ   +E KD
Sbjct: 430  KNCLESTLALVTKNEGEHIEPTKVSSTLNAESSSHADTEEVDEVKGGSQSEQKTIDETKD 489

Query: 1543 CSQFEQKTMSVMRGGEVVQVSYKVFIPKKVSPLARRQLRR 1662
             SQ+EQKTM+VMRGGE+VQVSYKV+IPKK S L RR+L R
Sbjct: 490  ESQYEQKTMTVMRGGEMVQVSYKVYIPKKASSLGRRKLNR 529


>ref|XP_002866006.1| hypothetical protein ARALYDRAFT_918499 [Arabidopsis lyrata subsp.
            lyrata] gi|297311841|gb|EFH42265.1| hypothetical protein
            ARALYDRAFT_918499 [Arabidopsis lyrata subsp. lyrata]
          Length = 528

 Score = 90.5 bits (223), Expect = 1e-15
 Identities = 99/394 (25%), Positives = 158/394 (40%), Gaps = 73/394 (18%)
 Frame = +1

Query: 700  ADGKRR-RSKVIPDKEVQRLSHKRQRNHEVRSPSYSSCDRGNDPSDGALLTMNDSRRLRS 876
            +DGKRR R +     EV+    + +   E  S     C +     +G ++   + RRL+S
Sbjct: 144  SDGKRRSRDRGRRLGEVKDARSRSRDGLEGESEEPDECWQ----VEGEVIPEKNPRRLKS 199

Query: 877  VITV-VNQPEDERENRWEMDLH-----NEEIVYDQNDYPSSKSVDCNEGGDRMDSGIHCH 1038
            ++ V  +   DER+   + D++     N E+   +           +    R D     +
Sbjct: 200  IVVVSYSYGNDERKEEDDRDVYMTRGGNRELGDSEESDERDGETTVSYSRTRAD-----Y 254

Query: 1039 GGLGVVETENEVDVNSPRGEILESILRQKALENLRKFKGRHQTG---------------- 1170
             GL  V  +   + NS + + LE+IL+++ALENL++F+G  Q                  
Sbjct: 255  NGLKTVGYDEFGESNSMKDDNLEAILKKRALENLKRFRGVTQKSGIAKKEVSSVSEGEPM 314

Query: 1171 ---------------LRSSVVETNNESDMETID-------------VTQNKSINQGSSNS 1266
                           +   V ++    D+ET++                N +  Q   + 
Sbjct: 315  QIESEKVEESQDHGLMEQKVCDSEVSKDLETLEKILHVVNVKESGTALANSASQQDQQSG 374

Query: 1267 NAHKIDERKGLSS-------------EGDFSLKEVKKLGDDEHTERDNEMAKQSFVHPPN 1407
            +  K+    G+SS             +   +L   K+    +  E ++        +   
Sbjct: 375  DTAKVKASSGISSCSTKRKLVRPVLGKDSLNLASRKEATGSQDVEAESIGGSTIDKNCLE 434

Query: 1408 EEALLECSEEDKRATSRAVSSTLETSSSQ--------PVSGEHCLEQG-NEAKDCSQFEQ 1560
                L    E +      V STL   SS          + G    EQ  +E KD SQ+EQ
Sbjct: 435  STLALVTKNEGEHIEPTKVRSTLNAESSSHADTEAVDEIKGRSQSEQKMDETKDESQYEQ 494

Query: 1561 KTMSVMRGGEVVQVSYKVFIPKKVSPLARRQLRR 1662
            KTM+VMRGGE+VQVSYKV+IPKK S L RR+L R
Sbjct: 495  KTMTVMRGGEMVQVSYKVYIPKKTSSLGRRKLNR 528


>ref|XP_003541837.1| PREDICTED: uncharacterized protein LOC100805450 [Glycine max]
          Length = 582

 Score = 89.4 bits (220), Expect = 3e-15
 Identities = 105/411 (25%), Positives = 164/411 (39%), Gaps = 111/411 (27%)
 Frame = +1

Query: 763  KRQRNHEVRSPSYSSCDRGNDPSDGALLTMNDSRRLRSVITVVNQPED-----ERENRWE 927
            K+   +  RS S  S +   + ++      N+SR LRSVITV  + E+       EN+ E
Sbjct: 184  KKSSRYRARSCSPCSIENSYEVTEEKYAGENNSRWLRSVITVTEEAEEYGELCRNENKDE 243

Query: 928  MDLHNEEIVYDQNDYPSSKSVDCNEGGDRMDSGIHCHGG---LGVVE------------- 1059
            +D        D +DYP  +S D N+GG + +   H       LG+ E             
Sbjct: 244  ID--------DDHDYPC-RSSDSNDGGTKTELDHHTLASEEKLGIEEEAGDMNADLNFTE 294

Query: 1060 ---------------------TENEVDVNSPRG-----EILESILRQKALENLRKFKGRH 1161
                                 TE+  + +   G     + LESILRQ+ALENLRKF+   
Sbjct: 295  PKFRDRSYNDSSNLKAYSGETTESMKETSETSGANVNDDDLESILRQRALENLRKFREIQ 354

Query: 1162 QTGL----RSSVVETNNESDMETIDVTQNKSINQGSSNSNAHKIDERKGLSSEGDFSLKE 1329
             +      ++ +V    +   +  ++ Q KS+    +++   K  +++    E +  +  
Sbjct: 355  SSAKAPDQKNKIVSQVKQPITDKHELVQGKSV---VNDATVGKKFDKQTPGEETNLPIGR 411

Query: 1330 VKKLGDDEHTER----DNEMAKQSFVHPPNEEALLECSEEDKRA---------------- 1449
               +    + ER    D +++  +  HP N       S+   R                 
Sbjct: 412  RNLIACPRNNERILNMDKDVSGSAKCHPVNAPEKGIDSDNPSRTITESTNYNNTINLELI 471

Query: 1450 --TSRAVSSTLETSSSQPVSGEHCLE-----QGNEAK----------------------- 1539
              T ++   +L+TS+S   +    L      + N AK                       
Sbjct: 472  KQTQKSRGDSLQTSTSHEAANAKLLVTEGDVESNAAKTPHAAIQSVNNNVGDVDVSSVEN 531

Query: 1540 ----------DCSQFEQKTMSVMRGGEVVQVSYKVFIPKKVSPLARRQLRR 1662
                        SQFE+KTM+VMRGGE+VQVSYKV+IP KV  LARRQL+R
Sbjct: 532  KTGKLLDESNQGSQFEKKTMNVMRGGEMVQVSYKVYIPNKVPALARRQLKR 582


Top