BLASTX nr result

ID: Angelica23_contig00035222 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00035222
         (1632 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002876031.1| hypothetical protein ARALYDRAFT_485395 [Arab...   254   5e-65
ref|NP_190612.3| uncharacterized protein [Arabidopsis thaliana] ...   251   4e-64
emb|CAB62472.1| hypothetical protein [Arabidopsis thaliana]           239   2e-60
ref|XP_003519917.1| PREDICTED: uncharacterized protein LOC100812...   221   6e-55
ref|XP_002522353.1| conserved hypothetical protein [Ricinus comm...   211   4e-52

>ref|XP_002876031.1| hypothetical protein ARALYDRAFT_485395 [Arabidopsis lyrata subsp.
            lyrata] gi|297321869|gb|EFH52290.1| hypothetical protein
            ARALYDRAFT_485395 [Arabidopsis lyrata subsp. lyrata]
          Length = 648

 Score =  254 bits (649), Expect = 5e-65
 Identities = 165/442 (37%), Positives = 228/442 (51%), Gaps = 7/442 (1%)
 Frame = +2

Query: 89   LFSGNLVQFFCSLAAHGYASDASAVCKDEPVACIISDAM---PKILAWCLSKQEDRNKTS 259
            +F GN VQF CS+  H    + S     EP   I+   +   P ++ WC  K + ++ + 
Sbjct: 260  VFLGNFVQFLCSMVQHVRVVEDSD--DSEPSHLILQKTIKLVPDLIRWCQPKLKSQSGSC 317

Query: 260  TSQYXXXXXXXXXXXXTYQMHLQCQVLVSWLNIIGKYFQDLLAEPLTRVENNLDDSLEGS 439
             S+Y            T + +++C +L+SWL  + +  Q  L   LT+ +   D+ LEGS
Sbjct: 318  MSRYLGHKLLVLMIRLTDKSNIKCTILLSWLQYLQRDSQGFLQHTLTKFKPVQDNCLEGS 377

Query: 440  PFLSSFS-KESKGISDRHLQRLAVFLFLRCSLSLVCQRDGIHEHCICAKINPSLLSEQMG 616
            PF  S S +E       HLQRL+VFLFLRCS +L+       + C               
Sbjct: 378  PFFVSLSDREINETHSNHLQRLSVFLFLRCSFTLIYSSRHNGKQC--------------- 422

Query: 617  NHSCCNRKQGLLGLYQWLHGHFTRDISVDNDIYIQKCASFSLSFLRLYIHEDDILFKVLL 796
               C  RK+G+  +++W+       I  D+ IY +K   FS SF+RL++HEDD+LFKVLL
Sbjct: 423  EFDC--RKKGMAEMFKWIVRQIPGIICSDHRIYSKKSVEFSASFVRLFMHEDDLLFKVLL 480

Query: 797  QLFTIPLSVK--PVCEGSKTPQKVEDEENM-LHLISDLLNPICLFHLFLAELLYDHKVLL 967
            QL ++PL  +  P  EG      +EDEE + L   S L NP+ LF +FL+EL YDH+VLL
Sbjct: 481  QLLSVPLHRQELPNVEGGS----LEDEEQITLFRFSTLFNPVTLFCIFLSELHYDHQVLL 536

Query: 968  DYLMSKDTGASSAEYLLRCLRAVCDSWTFFVEFSWDEEVTNSRNSKKRKVSVDVLDFEGG 1147
            DYL+SKD G S AEYLLRCLRAVCDSWT FVEF + E  TN+ + K+RKV  +  + E  
Sbjct: 537  DYLISKDIGDSCAEYLLRCLRAVCDSWTLFVEFPF-EGSTNASSPKRRKVLPETSEVE-- 593

Query: 1148 EISVLRENDDSPLSLLKRCMTEDVYTSRQHTTTRLSFEDAMECXXXXXXXXXXXXXXXXF 1327
                                       R H     +FEDA +C                F
Sbjct: 594  ------------------------QNWRLHPQ---AFEDAKDCLLSLQNSVVKLHQKKLF 626

Query: 1328 PYNPDVLLRRLTKFEELCFKEK 1393
            PYNP+ LLRRL++F+ELC   +
Sbjct: 627  PYNPEALLRRLSRFQELCLSHE 648


>ref|NP_190612.3| uncharacterized protein [Arabidopsis thaliana]
            gi|28973649|gb|AAO64145.1| unknown protein [Arabidopsis
            thaliana] gi|110737253|dbj|BAF00574.1| hypothetical
            protein [Arabidopsis thaliana]
            gi|332645145|gb|AEE78666.1| uncharacterized protein
            [Arabidopsis thaliana]
          Length = 642

 Score =  251 bits (641), Expect = 4e-64
 Identities = 163/442 (36%), Positives = 233/442 (52%), Gaps = 7/442 (1%)
 Frame = +2

Query: 89   LFSGNLVQFFCSLAAHGYASDASAVCKDEPVACIISDAM---PKILAWCLSKQEDRNKTS 259
            +F G+ VQF CS+    +  + S     EP   I+   +   P +L WC  K + ++ + 
Sbjct: 254  VFLGSFVQFLCSMVQQVHVVEDSD--DFEPSYLILQKTIKLIPDLLRWCQPKLKSQSGSC 311

Query: 260  TSQYXXXXXXXXXXXXTYQMHLQCQVLVSWLNIIGKYFQDLLAEPLTRVENNLDDSLEGS 439
             S+Y            T +  ++C +L+SWL  + +  Q  L   LT+ +   D+ LEGS
Sbjct: 312  MSRYLGHKLLVLMIRLTDKSKIKCTILLSWLQYLQRDSQGFLQHTLTKFKPVQDNCLEGS 371

Query: 440  PFLSSFS-KESKGISDRHLQRLAVFLFLRCSLSLVCQRDGIHEHCICAKINPSLLSEQMG 616
            PF  S S +E   +   HLQRL+VFLFLRCS +L+                 S  ++++ 
Sbjct: 372  PFFVSLSDREVNEMHSNHLQRLSVFLFLRCSFTLIYS---------------SRHNDKLC 416

Query: 617  NHSCCNRKQGLLGLYQWLHGHFTRDISVDNDIYIQKCASFSLSFLRLYIHEDDILFKVLL 796
               C  RK+G+  +++W+      ++  D+ IY +K   FS SF+RL++HEDD+LFKVLL
Sbjct: 417  EFDC--RKKGMAEMFKWIERQIPGNMFSDHRIYSKKNVEFSASFVRLFMHEDDLLFKVLL 474

Query: 797  QLFTIPLSVK--PVCEGSKTPQKVEDEENM-LHLISDLLNPICLFHLFLAELLYDHKVLL 967
            QL ++PL  +  P  EG      +EDEE + L  +S L NP+ LF +FL+EL YDH+VLL
Sbjct: 475  QLLSVPLHRQELPNVEGGS----LEDEEQITLFRLSTLFNPVRLFCIFLSELHYDHQVLL 530

Query: 968  DYLMSKDTGASSAEYLLRCLRAVCDSWTFFVEFSWDEEVTNSRNSKKRKVSVDVLDFEGG 1147
            DYL+SKD GAS AEYLLRCLRAVCDSWT FVEF + E  T++ + K+RKV  +  + E  
Sbjct: 531  DYLISKDIGASCAEYLLRCLRAVCDSWTLFVEFPF-EGSTDAPSPKRRKVLPETSEVE-- 587

Query: 1148 EISVLRENDDSPLSLLKRCMTEDVYTSRQHTTTRLSFEDAMECXXXXXXXXXXXXXXXXF 1327
                                       R H     +FEDA +C                F
Sbjct: 588  ------------------------QNWRLHAQ---AFEDAKDCLLSLQNSVVKLHQKKLF 620

Query: 1328 PYNPDVLLRRLTKFEELCFKEK 1393
            PYNP+ LLRRL++F ELC   +
Sbjct: 621  PYNPEALLRRLSRFHELCLSHE 642


>emb|CAB62472.1| hypothetical protein [Arabidopsis thaliana]
          Length = 730

 Score =  239 bits (609), Expect = 2e-60
 Identities = 158/430 (36%), Positives = 225/430 (52%), Gaps = 7/430 (1%)
 Frame = +2

Query: 89   LFSGNLVQFFCSLAAHGYASDASAVCKDEPVACIISDAM---PKILAWCLSKQEDRNKTS 259
            +F G+ VQF CS+    +  + S     EP   I+   +   P +L WC  K + ++ + 
Sbjct: 247  VFLGSFVQFLCSMVQQVHVVEDSD--DFEPSYLILQKTIKLIPDLLRWCQPKLKSQSGSC 304

Query: 260  TSQYXXXXXXXXXXXXTYQMHLQCQVLVSWLNIIGKYFQDLLAEPLTRVENNLDDSLEGS 439
             S+Y            T +  ++C +L+SWL  + +  Q  L   LT+ +   D+ LEGS
Sbjct: 305  MSRYLGHKLLVLMIRLTDKSKIKCTILLSWLQYLQRDSQGFLQHTLTKFKPVQDNCLEGS 364

Query: 440  PFLSSFS-KESKGISDRHLQRLAVFLFLRCSLSLVCQRDGIHEHCICAKINPSLLSEQMG 616
            PF  S S +E   +   HLQRL+VFLFLRCS +L+                 S  ++++ 
Sbjct: 365  PFFVSLSDREVNEMHSNHLQRLSVFLFLRCSFTLIYS---------------SRHNDKLC 409

Query: 617  NHSCCNRKQGLLGLYQWLHGHFTRDISVDNDIYIQKCASFSLSFLRLYIHEDDILFKVLL 796
               C  RK+G+  +++W+      ++  D+ IY +K   FS SF+RL++HEDD+LFKVLL
Sbjct: 410  EFDC--RKKGMAEMFKWIERQIPGNMFSDHRIYSKKNVEFSASFVRLFMHEDDLLFKVLL 467

Query: 797  QLFTIPLSVK--PVCEGSKTPQKVEDEENM-LHLISDLLNPICLFHLFLAELLYDHKVLL 967
            QL ++PL  +  P  EG      +EDEE + L  +S L NP+ LF +FL+EL YDH+VLL
Sbjct: 468  QLLSVPLHRQELPNVEGGS----LEDEEQITLFRLSTLFNPVRLFCIFLSELHYDHQVLL 523

Query: 968  DYLMSKDTGASSAEYLLRCLRAVCDSWTFFVEFSWDEEVTNSRNSKKRKVSVDVLDFEGG 1147
            DYL+SKD GAS AEYLLRCLRAVCDSWT FVEF + E  T++ + K+RKV  +  + E  
Sbjct: 524  DYLISKDIGASCAEYLLRCLRAVCDSWTLFVEFPF-EGSTDAPSPKRRKVLPETSEVE-- 580

Query: 1148 EISVLRENDDSPLSLLKRCMTEDVYTSRQHTTTRLSFEDAMECXXXXXXXXXXXXXXXXF 1327
                                       R H     +FEDA +C                F
Sbjct: 581  ------------------------QNWRLHAQ---AFEDAKDCLLSLQNSVVKLHQKKLF 613

Query: 1328 PYNPDVLLRR 1357
            PYNP+ LLRR
Sbjct: 614  PYNPEALLRR 623


>ref|XP_003519917.1| PREDICTED: uncharacterized protein LOC100812484 [Glycine max]
          Length = 639

 Score =  221 bits (562), Expect = 6e-55
 Identities = 133/384 (34%), Positives = 204/384 (53%), Gaps = 5/384 (1%)
 Frame = +2

Query: 62   NLQNFT---PLMLFSGNLVQFFCSLAAHGYASDASAVCKDE-PVACIISDAMPKILAWCL 229
            +L+NF+   P+M F G  +Q  CSL     + +      D+ P+   + + +P++  WCL
Sbjct: 243  HLKNFSVMDPVMNFLGTFLQLLCSLVYRNDSVETGCDSVDKHPLFLTVVNLIPRLAKWCL 302

Query: 230  SKQEDRNKTSTSQYXXXXXXXXXXXXTYQMHLQCQVLVSWLNIIGKYFQDLLAEPLTRVE 409
            S+QE+  +     Y                 L C++ +SWL ++  YFQ+LL +PLT+  
Sbjct: 303  SEQENNAEMHAIHYLKHKLLILMIRLGSLTGLDCRIRLSWLELLHNYFQELLQQPLTQFL 362

Query: 410  NNLDDSLEGSPFLSSFSK-ESKGISDRHLQRLAVFLFLRCSLSLVCQRDGIHEHCICAKI 586
            ++  D LE SPFL S    E+      HL+R AV+L L CS SL+C+R  I  HC  + +
Sbjct: 363  SDQIDCLEDSPFLWSLCDGEACMKRSDHLRRQAVYLLLACSFSLICKRGEIANHCNNSTL 422

Query: 587  NPSLLSEQMGNHSCCNRKQGLLGLYQWLHGHFTRDISVDNDIYIQKCASFSLSFLRLYIH 766
              S  +     H    RK+G L L++W+ GH    IS++++ Y+Q C +F  SFL+LY+ 
Sbjct: 423  CSSFTTNPDSEHDYFCRKKGSLELFKWILGHLPTAISINHEKYMQMCMNFISSFLQLYLR 482

Query: 767  EDDILFKVLLQLFTIPLSVKPVCEGSKTPQKVEDEENMLHLISDLLNPICLFHLFLAELL 946
            EDD+LF+VLL LF+I  S++         ++ E ++   H                 ++ 
Sbjct: 483  EDDLLFEVLLLLFSISSSLQ---------EQSESKDAAYH-----------------DIH 516

Query: 947  YDHKVLLDYLMSKDTGASSAEYLLRCLRAVCDSWTFFVEFSWDEEVTNSRNSKKRKVSVD 1126
            YDH+VLLDYL+SKDTG S A+YLLRCL  +C+SW  FVEF    E  +  + K+RK+  D
Sbjct: 517  YDHQVLLDYLISKDTGISCAKYLLRCLHLICNSWKLFVEFPLFGEFLDQSSCKRRKIVGD 576

Query: 1127 VLDFEGGEISVLRENDDSPLSLLK 1198
             L F    +    +N  S +  +K
Sbjct: 577  GLHFLADGMPTSIDNSGSIILHIK 600


>ref|XP_002522353.1| conserved hypothetical protein [Ricinus communis]
            gi|223538431|gb|EEF40037.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 535

 Score =  211 bits (537), Expect = 4e-52
 Identities = 117/301 (38%), Positives = 168/301 (55%), Gaps = 2/301 (0%)
 Frame = +2

Query: 44   DALVHNNLQNFTPLMLFSGNLVQFFCSLAAHGYASDASAVCKDE-PVACIISDAMPKILA 220
            DAL   N+      ++F GN +Q  CSL       +     +D  PV C+I+  +PK+++
Sbjct: 231  DALFLKNVGEQQKKIVFLGNFIQLLCSLVEQSCDVEVKVGSQDHHPVLCLITSFVPKVVS 290

Query: 221  WCLSKQEDRNKTSTSQYXXXXXXXXXXXXTYQMHLQCQVLVSWLNIIGKYFQDLLAEPLT 400
             CL  Q +    S SQY            +YQ  L    L+SWL ++  YF+ LL +P+ 
Sbjct: 291  CCLGGQGNCVSASVSQYFRHKLLMLMLRLSYQTCLDYFTLISWLQLLHDYFEVLLWKPII 350

Query: 401  RVENNLDDSLEGSPFLSSFSK-ESKGISDRHLQRLAVFLFLRCSLSLVCQRDGIHEHCIC 577
            ++E   D+SLE SPFLSS S  +  GI+  HLQR A+ LFLRC   L+       + C C
Sbjct: 351  KLEFPQDESLEDSPFLSSLSDGDIHGINSHHLQRWAILLFLRCCFGLISLTRDKSKKCTC 410

Query: 578  AKINPSLLSEQMGNHSCCNRKQGLLGLYQWLHGHFTRDISVDNDIYIQKCASFSLSFLRL 757
              +N       + +  CC RK+G L +Y+WL GHF  D+SV  ++Y +KC  F+ SFL+L
Sbjct: 411  GTLN-CCSGYSISDMDCCGRKKGFLEIYKWLQGHFPIDMSVGQEMYFEKCIGFTFSFLQL 469

Query: 758  YIHEDDILFKVLLQLFTIPLSVKPVCEGSKTPQKVEDEENMLHLISDLLNPICLFHLFLA 937
            Y+HEDD+LFKVLLQL +I   ++ +    K   +   E+ + H IS + NP+ LFHLFLA
Sbjct: 470  YMHEDDVLFKVLLQLLSINSCLEQLLNRVKWTSEDVKEDILFH-ISHIFNPVYLFHLFLA 528

Query: 938  E 940
            E
Sbjct: 529  E 529


Top