BLASTX nr result

ID: Scutellaria22_contig00020720 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Scutellaria22_contig00020720
         (1788 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CBI18449.3| unnamed protein product [Vitis vinifera]              457   e-126
ref|NP_173124.2| hydrolase domain-containing protein [Arabidopsi...   273   8e-71
gb|EEE69980.1| hypothetical protein OsJ_29879 [Oryza sativa Japo...   268   3e-69
ref|XP_002892921.1| hypothetical protein ARALYDRAFT_312653 [Arab...   246   2e-62
gb|AAG09081.1|AC026237_2 Similar to tRNA-splicing endonuclease p...   242   2e-61

>emb|CBI18449.3| unnamed protein product [Vitis vinifera]
          Length = 2154

 Score =  457 bits (1175), Expect = e-126
 Identities = 257/522 (49%), Positives = 338/522 (64%), Gaps = 12/522 (2%)
 Frame = -1

Query: 1779 DEEEEILSGLDIEDKENDSCWKEFTLQHKIISQVDGNWMCIPMLWFDVLVEIDPFCLPLS 1600
            ++++E+   LD+E+K + S W E++ Q KI SQ    W CIPMLW +VLVEI+P  LP+S
Sbjct: 565  EDDDELPFVLDVEEKHSSS-WSEYSEQSKITSQDCRRWRCIPMLWLEVLVEINPSVLPIS 623

Query: 1599 FSKAVIWALSHFSLIETERSTETSLSVRNWLATCASEICYLLGWKVPTGSXXXXXGTESR 1420
             SKAV WA S F+L+E E++ E  + V+NWL+  A EI    GWKVPTGS     G ES+
Sbjct: 624  VSKAVFWARSRFALVEPEKNAEMEVPVKNWLSFSAKEISSSFGWKVPTGSDDGGDGKESQ 683

Query: 1419 NSIRTSTMCRPLVKTFKRLTTHYTVRMEQGELRKQWIWEPMMSNSLILLLVDPNDNTRQV 1240
            NS++ STMC PL++TFKRLT HY V+MEQ ELRKQWIWEP M  SLILLL++PNDN RQV
Sbjct: 684  NSMKVSTMCIPLIRTFKRLTAHYIVQMEQEELRKQWIWEPRMGESLILLLLEPNDNVRQV 743

Query: 1239 GRLILEQVSNTRGLTCGLQFLCSTPSSLLAVLMGLKHALKLVQLDSVLLNFQTLHHLFFI 1060
            G+ +LEQVSN RGL   LQFLCS   S+ A   GL+HAL+LVQ+DSVLLNF+TLHH FF+
Sbjct: 744  GKCLLEQVSNMRGLAHCLQFLCSCTLSMSATYNGLRHALRLVQVDSVLLNFETLHHFFFV 803

Query: 1059 LCKLLKEGNSSTQTTVQNLQDVAGISKFSLQGGFLKQPVFDSSPTDGGQRLSIVSSTLWK 880
            LCKLLKEG   T    ++   +  ISKFS QGGFL+QP FDS P +     S+  S   +
Sbjct: 804  LCKLLKEGVICTSDPQRHSSGIKNISKFSSQGGFLRQPAFDSFPENVNGHSSVDDSKSRE 863

Query: 879  KFSCLLSQVAWPSILKCLDGGKTSTDYSVSQMTCIRLLEVIPVVFERLPQNSGMMLEIFC 700
            KFSCLLS++ WP I KCL  GK   DY +SQ+T   L E   ++ +R   +    + IF 
Sbjct: 864  KFSCLLSEITWPFIRKCLVEGKAFVDYKISQLTLGYLFENHALLSKRTKAS----VRIFS 919

Query: 699  DK---------KWLHDFAEWGKSSLAVVVRYWKQTLSYLLFQIKASCSNKSSSAIIEIEK 547
             K         +++     WG       V YW+QT+  LL  +K SCS+KS+S I  IE 
Sbjct: 920  LKDISYRLVLPRFIFYQIRWGLRLSFCWVGYWRQTMISLLHLLKGSCSDKSASFIRAIEN 979

Query: 546  LISYESFSMDELSKQIARLSVALTDEGSA-MNKTATQPKCSPSEESLNWRNCSAQS-EVL 373
            LIS +S  MDEL++Q+A LSV+L++E S  + KT  + K   SE+S   R  SA   +  
Sbjct: 980  LISCDSLMMDELTEQVAHLSVSLSNEASCIVGKTDLKSKAFFSEDSSFERQHSASDLQPF 1039

Query: 372  RVDEAKVNVLDSGPLVDFQEDN-VIILSDDEKESEISAHARF 250
              D+  V +LDS  + +  ++N VIILSDDE E +IS++ +F
Sbjct: 1040 ASDDMDVQILDSVTVSNKMDNNSVIILSDDETEKQISSNKQF 1081


>ref|NP_173124.2| hydrolase domain-containing protein [Arabidopsis thaliana]
            gi|332191377|gb|AEE29498.1| hydrolase domain-containing
            protein [Arabidopsis thaliana]
          Length = 2127

 Score =  273 bits (699), Expect = 8e-71
 Identities = 163/375 (43%), Positives = 222/375 (59%), Gaps = 19/375 (5%)
 Frame = -1

Query: 1557 IETERSTETSLSVRNWLATCASEICYLLGWKVPTGSXXXXXGTESRNSIRTSTMCRPLVK 1378
            +E+E++ E ++ +  WL++ A EI   LGWKV TGS     G ES+NS+  S MC  L++
Sbjct: 463  MESEKNDEMTVDIETWLSSSAVEIKGTLGWKVATGSDDGGPGKESKNSVTVSKMCLTLIR 522

Query: 1377 TFKRLTTHYTVRMEQGELRKQWIWEPMMSNSLILLLVDPNDNTRQVGRLILEQVSNTRGL 1198
            T KRLTT Y V++   E RKQW W P M  + IL L DP+DN RQ G+ +LE VSNTRGL
Sbjct: 523  TLKRLTTCYLVQIGD-ECRKQWTWVPEMGETFILSLSDPDDNVRQFGKSMLEHVSNTRGL 581

Query: 1197 TCGLQFLCSTPSSLLAVLMGLKHALKLVQLDSVLLNFQTLHHLFFILCKLLKEGNSSTQT 1018
            +CGL+FLCS  S LL V  G++H L+ V L SVL +FQ LHH FF+L KLLKE   +   
Sbjct: 582  SCGLKFLCSQTSHLLFVSSGVRHVLQQVHLSSVLQSFQILHHFFFLLFKLLKEEEVAITD 641

Query: 1017 TVQNLQDVAGISKFSLQGGFLKQPVFDSSPTDGGQRLSIVSSTLWKKFSCLLSQVAWPSI 838
             V+           S  GGFL+QP F++ P   G R  + S+    KF  LL++VAW  I
Sbjct: 642  VVK-----------SSAGGFLRQPNFNALPVSEG-RNPLSSTPELLKFQYLLAEVAWGII 689

Query: 837  LKCLDGGKTSTDYSVSQMTCIRLLEVIPVVFERL------PQNSGMMLEIFCDKKWLHDF 676
             KCL  GKT    S+ QMTC+RLLE++PVV  +L        ++   L+   D KWL D 
Sbjct: 690  RKCLVEGKTFIHQSLCQMTCVRLLEILPVVLGKLRVSREESCDTRGTLKDASDLKWLPDL 749

Query: 675  AEWGKSSLAVVVRYWKQTLSYLLFQIKASCSNKSSSAIIEIE-------------KLISY 535
             +WG+S L VVV YWK+ L  LL  ++ S S+  SSA+  I               L++ 
Sbjct: 750  IDWGRSQLKVVVAYWKRALVALLDILQGSNSDACSSAVQAIRHVLSSGDTIDNALTLLNS 809

Query: 534  ESFSMDELSKQIARL 490
            +   +++L++QI+RL
Sbjct: 810  DDVDIEQLAEQISRL 824


>gb|EEE69980.1| hypothetical protein OsJ_29879 [Oryza sativa Japonica Group]
          Length = 2215

 Score =  268 bits (685), Expect = 3e-69
 Identities = 171/534 (32%), Positives = 274/534 (51%), Gaps = 8/534 (1%)
 Frame = -1

Query: 1779 DEEEEILSGLDIEDKENDSCWKEFTLQHKIISQVDGNWMCIPMLWFDVLVEIDPFCLPLS 1600
            D+++E+    D E+ +   CW +F + +K+  +   +W C+P+LW+ ++V+++P  LP++
Sbjct: 436  DDDDELPVFCDAEEMDY-GCWNDFNVLYKLTCRECKDWRCVPLLWYLIMVQLEPSELPMA 494

Query: 1599 FSKAVIWALSHFSLIETERSTETSLSVRNWLATCASEICYLLGWKVPTGSXXXXXGTESR 1420
            FSKAV WALSH S++E   STE+S+ V +WL++ A E+     W+VP G+     G E  
Sbjct: 495  FSKAVFWALSHISVLEPGVSTESSVPVNDWLSSHAGEVLPTFSWQVPNGADDGGVGKECI 554

Query: 1419 NSIRTSTMCRPLVKTFKRLTTHYTVRMEQGELRKQWIWEPMMSNSLILLLVDPNDNTRQV 1240
            N+++                                                   N RQV
Sbjct: 555  NTLK---------------------------------------------------NLRQV 563

Query: 1239 GRLILEQVSNTRGLTCGLQFLCSTPSSLLAVLMGLKHALKLVQLDSVLLNFQTLHHLFFI 1060
            GR +LE  S  RGLT GLQFLCS+ SSL A  +GL++A++ V+  SVL +F +LHHLFF+
Sbjct: 564  GRAVLELASQGRGLTSGLQFLCSSASSLTATFLGLRYAVQSVETKSVLADFPSLHHLFFV 623

Query: 1059 LCKLLKEGNSSTQTTVQNLQDVAGISKFSLQGGFLKQPVFDSSPTDGGQRLSIVSSTLWK 880
            +CKLLK+        VQ  Q    +     +GGFL+Q   + S       + I+S   W+
Sbjct: 624  ICKLLKD------VVVQ--QPSVALQAKPFEGGFLRQSFSNVSVNLPQHSVDIIS---WE 672

Query: 879  KFSCLLSQVAWPSILKCLDGGKTSTDYSVSQMTCIRLLEVIPVVFERLPQNSG------- 721
            KFS LLS   WP I  CL  G    +    Q++C+RLLE++P+V+ER+   S        
Sbjct: 673  KFSTLLSGALWPFIFTCLRKGDDLINTKQCQISCVRLLELVPLVYERVSSYSSAKSCGVP 732

Query: 720  -MMLEIFCDKKWLHDFAEWGKSSLAVVVRYWKQTLSYLLFQIKASCSNKSSSAIIEIEKL 544
             M+L+   D  WL     WGKSSL V++R+WKQ +  L+  +K S        I ++  +
Sbjct: 733  TMVLDP-TDITWLFHLINWGKSSLLVIIRHWKQCMLSLIKILKGSLGGTVQHYIEDLGSI 791

Query: 543  ISYESFSMDELSKQIARLSVALTDEGSAMNKTATQPKCSPSEESLNWRNCSAQSEVLRVD 364
            IS+++ ++DELS++I+ L +AL+ E SA ++       S   E +      A       +
Sbjct: 792  ISHDAVNIDELSEKISDLKLALSKEASAKSERRVVAGVSMFTEPIAGIPSPATQTAQERN 851

Query: 363  EAKVNVLDSGPLVDFQEDNVIILSDDEKESEISAHARFSNSWSSIRTHNDNHTA 202
              + NV           +++I+LSD E E+ ++A     +  SS++  + + T+
Sbjct: 852  TGRDNVETMKSSRSTCTEHIILLSDSE-ENSLTADVSGEDVLSSVKDSDGSGTS 904


>ref|XP_002892921.1| hypothetical protein ARALYDRAFT_312653 [Arabidopsis lyrata subsp.
            lyrata] gi|297338763|gb|EFH69180.1| hypothetical protein
            ARALYDRAFT_312653 [Arabidopsis lyrata subsp. lyrata]
          Length = 2129

 Score =  246 bits (627), Expect = 2e-62
 Identities = 162/451 (35%), Positives = 224/451 (49%), Gaps = 20/451 (4%)
 Frame = -1

Query: 1782 ADEEEEILSGLDIEDKENDSCWKEFTLQHKIISQVDGNWMCIPMLWFDVLVEIDPFCLPL 1603
            +D+++  L    I +  +D  W +FT Q KI       WMCIPMLW   L   +   LP+
Sbjct: 362  SDDDDSNLPFSHIAEDVSDRSWSDFTQQSKITLGECKEWMCIPMLWITTLTNTNLLNLPV 421

Query: 1602 SFSKAVIWALSHFSLIETERSTETSLSVRNWLATCASEICYLLGWKVPTGSXXXXXGTES 1423
            S S+AV WA S F L+E+E++ E ++ +  WL++ A EI   LGWKV TGS     G ES
Sbjct: 422  SLSQAVFWARSRFCLVESEKNDEMTVDIETWLSSSAVEIKGTLGWKVATGSDDGGPGKES 481

Query: 1422 RNSIRTSTMCRPLVKTFKRLTTHYTVRMEQGELRKQWIWEPMMSNSLILLLVDPNDNTRQ 1243
            +NS+  S MC  L++T KRLTT Y V+M + E RKQW W P M  + IL L DP+DN RQ
Sbjct: 482  KNSVTVSKMCLTLIRTLKRLTTCYLVQMGE-ECRKQWTWVPGMGETFILSLSDPDDNVRQ 540

Query: 1242 VGRLILEQVSNTRGLTCGLQFLCSTPSSLLAVLMGLKHALKLVQLDSVLLNFQTLHHLFF 1063
             G+ +LE VSNTRGL+CGL+FLCS  S LL V  G++H L+                   
Sbjct: 541  FGKSMLEHVSNTRGLSCGLKFLCSQTSHLLFVSSGVRHVLQ------------------- 581

Query: 1062 ILCKLLKEGNSSTQTTVQNLQDVAGISKFSLQGGFLKQPVFDSSPTDGGQRLSIVSSTLW 883
               +LLKE   +       + DV  IS     GGFL+QP F+  P               
Sbjct: 582  ---QLLKEEEVA-------ITDVVKIS----AGGFLRQPNFNVLP--------------- 612

Query: 882  KKFSCLLSQVAWPSILKCLDGGKTSTDYSVSQMTCIRLLEVIPVVFERLP-------QNS 724
                                            MTC+RLLE++PVV  +L           
Sbjct: 613  --------------------------------MTCVRLLEILPVVLGKLRVSREESFHTR 640

Query: 723  GMMLEIFCDKKWLHDFAEWGKSSLAVVVRYWKQTLSYLLFQIKASCSNKSSSAIIEIE-- 550
            G + ++    KWL D  +WG+S L VVV YWK+ L  LL  ++ S S+  SSA+  I   
Sbjct: 641  GTLKDV-SGLKWLPDLIDWGRSQLKVVVAYWKRALVALLDILQGSNSDACSSAVQAIRHV 699

Query: 549  -----------KLISYESFSMDELSKQIARL 490
                        L++ +   +++L++QI+RL
Sbjct: 700  LASGDTSHNALTLLNSDDVDIEQLAEQISRL 730


>gb|AAG09081.1|AC026237_2 Similar to tRNA-splicing endonuclease positive effector SEN1
            [Arabidopsis thaliana]
          Length = 2142

 Score =  242 bits (618), Expect = 2e-61
 Identities = 158/449 (35%), Positives = 221/449 (49%), Gaps = 19/449 (4%)
 Frame = -1

Query: 1779 DEEEEILSGLDIEDKENDSCWKEFTLQHKIISQVDGNWMCIPMLWFDVLVEIDPFCLPLS 1600
            D+++  L      +  +D  W +FT Q KI       WMCIPMLW   L   +   LP+S
Sbjct: 363  DDDDSNLPFSHTTEDVSDRSWSDFTQQSKITLGECKEWMCIPMLWITTLTNTNFLNLPVS 422

Query: 1599 FSKAVIWALSHFSLIETERSTETSLSVRNWLATCASEICYLLGWKVPTGSXXXXXGTESR 1420
             S+AV W+ S F L+E+E++ E ++ +  WL++ A EI   LGWKV TGS     G ES+
Sbjct: 423  LSQAVFWSRSRFCLVESEKNDEMTVDIETWLSSSAVEIKGTLGWKVATGSDDGGPGKESK 482

Query: 1419 NSIRTSTMCRPLVKTFKRLTTHYTVRMEQGELRKQWIWEPMMSNSLILLLVDPNDNTRQV 1240
            NS+  S MC  L++T KRLTT Y V++   E RKQW W P M  + IL L DP+DN RQ 
Sbjct: 483  NSVTVSKMCLTLIRTLKRLTTCYLVQIGD-ECRKQWTWVPEMGETFILSLSDPDDNVRQF 541

Query: 1239 GRLILEQVSNTRGLTCGLQFLCSTPSSLLAVLMGLKHALKLVQLDSVLLNFQTLHHLFFI 1060
            G+ +LE VSNTRGL+CGL+FLCS  S LL V  G++H L+                    
Sbjct: 542  GKSMLEHVSNTRGLSCGLKFLCSQTSHLLFVSSGVRHVLQ-------------------- 581

Query: 1059 LCKLLKEGNSSTQTTVQNLQDVAGISKFSLQGGFLKQPVFDSSPTDGGQRLSIVSSTLWK 880
              +LLKE   +    V+           S  GGFL+QP F++ P                
Sbjct: 582  --QLLKEEEVAITDVVK-----------SSAGGFLRQPNFNALP---------------- 612

Query: 879  KFSCLLSQVAWPSILKCLDGGKTSTDYSVSQMTCIRLLEVIPVVFERL------PQNSGM 718
                                           MTC+RLLE++PVV  +L        ++  
Sbjct: 613  -------------------------------MTCVRLLEILPVVLGKLRVSREESCDTRG 641

Query: 717  MLEIFCDKKWLHDFAEWGKSSLAVVVRYWKQTLSYLLFQIKASCSNKSSSAIIEIE---- 550
             L+   D KWL D  +WG+S L VVV YWK+ L  LL  ++ S S+  SSA+  I     
Sbjct: 642  TLKDASDLKWLPDLIDWGRSQLKVVVAYWKRALVALLDILQGSNSDACSSAVQAIRHVLS 701

Query: 549  ---------KLISYESFSMDELSKQIARL 490
                      L++ +   +++L++QI+RL
Sbjct: 702  SGDTIDNALTLLNSDDVDIEQLAEQISRL 730