BLASTX nr result

ID: Angelica23_contig00024995 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00024995
         (1657 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002274931.1| PREDICTED: uncharacterized protein LOC100241...   584   e-164
ref|XP_002510544.1| conserved hypothetical protein [Ricinus comm...   561   e-157
ref|XP_002306933.1| predicted protein [Populus trichocarpa] gi|2...   519   e-145
ref|NP_564362.1| uncharacterized protein [Arabidopsis thaliana] ...   471   e-130
ref|XP_002893642.1| hypothetical protein ARALYDRAFT_473306 [Arab...   469   e-130

>ref|XP_002274931.1| PREDICTED: uncharacterized protein LOC100241980 [Vitis vinifera]
            gi|302142673|emb|CBI19876.3| unnamed protein product
            [Vitis vinifera]
          Length = 669

 Score =  584 bits (1505), Expect = e-164
 Identities = 313/523 (59%), Positives = 391/523 (74%), Gaps = 2/523 (0%)
 Frame = -1

Query: 1576 MGGLCTGGTIKHHRDVQGNNDYDHCSDNYSKDVSKKTNDGVEEDENYDGTNDPGFQIYDK 1397
            MG +C+GG +K +    G N          K + K+  D      +Y   N  GF+    
Sbjct: 51   MGAVCSGGMMKRN---SGKNLGFSGKLKKVKSLRKQKEDSY----SYSNPNVDGFE---- 99

Query: 1396 KRTRHIFDSGELNFNISTQFLNRRSAAARTPPTTKGPHMNSFFGKAGIVGLERAVDVLDT 1217
             RT  ++D GEL+F+IS +   + S  ART   +K P   SF G+AG+VGLE+AV+VLDT
Sbjct: 100  -RTPQMYDPGELSFSISREL--KPSTPARTG-ASKVPQKTSFLGRAGVVGLEKAVEVLDT 155

Query: 1216 LGSSMTNMHGGSGFAYNMASRGNKVSILAFEVANTIAKGSNLLQSISEENIRILKDEILQ 1037
            LGSSM++++  SGF   +ASRGNK+SILAFEVANTIAKG+NL  S+SEENI+ LK EIL 
Sbjct: 156  LGSSMSSLNPHSGFVSGIASRGNKISILAFEVANTIAKGANLQHSLSEENIQFLKKEILH 215

Query: 1036 SDGVLRLVSTDMTELLRIAAIDQREEFNIFSREVIRFGDLCRDPQWHNLDRYFSKFDSDP 857
            S+GV +LVSTDMTELL IAA D+REEF++FSREVIRFGDLC+DPQWHNLDRYFSK D+D 
Sbjct: 216  SEGVQQLVSTDMTELLSIAAADKREEFDVFSREVIRFGDLCKDPQWHNLDRYFSKLDTDD 275

Query: 856  GVGKLRREEAEMTMQELLSLAQYTSELYHELHALDRFEQDYHRKVEEVESLHLPRKGEGL 677
               K  REE E+T+QEL +LAQ+TSELYHEL+A+DRFEQDY RK+EEVESLHLPR+GE L
Sbjct: 276  PSHKQLREEIEVTVQELTTLAQHTSELYHELNAVDRFEQDYRRKLEEVESLHLPRRGESL 335

Query: 676  LILHSELKHQRXXXXXXXXXXXXXXXXXXXVEKLVDIVTFIHQHILEVFGDNVLGSSSTE 497
             +LHSELKHQR                   VEKLVD+ TFIHQ ILE F  N  G + T 
Sbjct: 336  TMLHSELKHQRKLVRSLKKKSLWSRNLEEIVEKLVDVATFIHQEILEAFRSN--GLTLTI 393

Query: 496  EKASSKPDERLGIAGLALHYAHVVTQIDNIASRPTSLPPNMRDGLYSGLPVNVKKSLRSR 317
            ++ S+ P +RLG AGL+LHYA+++ Q+DNIASRPTSLPPNMRD LY GLP +VK +LRS+
Sbjct: 394  KEPSNCP-QRLGAAGLSLHYANIINQMDNIASRPTSLPPNMRDTLYHGLPASVKTALRSQ 452

Query: 316  LQSLDVHDVLSIPQMKTQMERTLKWLAPIAADTTKAHQGFGWVGEWANTGIEFGK--TTK 143
            LQ++D  + L+IPQ+K +ME+TL+WL P+  +TTKAHQGFGWVGEWANTG EFGK  TT+
Sbjct: 453  LQAVDAKEELTIPQIKAEMEKTLQWLVPVVTNTTKAHQGFGWVGEWANTGNEFGKKTTTQ 512

Query: 142  NSVIRLQTLYHADKGKMDRYILELVICLHRLISLVKYKDNGSK 14
            N++IRLQTLYHADK K+D+YILELVI LHRLI+LV+++D+G K
Sbjct: 513  NNLIRLQTLYHADKQKIDQYILELVIWLHRLINLVRHRDHGFK 555


>ref|XP_002510544.1| conserved hypothetical protein [Ricinus communis]
            gi|223551245|gb|EEF52731.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 620

 Score =  561 bits (1446), Expect = e-157
 Identities = 301/526 (57%), Positives = 373/526 (70%), Gaps = 2/526 (0%)
 Frame = -1

Query: 1576 MGGLCTGGTIKHHRDVQGNNDYDHCSDNYSKDVSKKTNDGVEEDENYDGTNDPGFQIYDK 1397
            MGG+C+GGT   H  V    +         K V   +    +    Y+   D  F    K
Sbjct: 1    MGGVCSGGTKPRHAKVGDGENNKSGFSGKLKSVKSFSKLKEKNSHLYNTNKDDDF---GK 57

Query: 1396 KRTRHIFDSGELNFNISTQFLNRRSAAARTPPTTKGPHMNSFFGKAGIVGLERAVDVLDT 1217
            + TR  ++SGEL  N S +   + S  AR     K    +SF GKAG V LE+AV+VLDT
Sbjct: 58   RTTRSRYNSGELLLNFSREL--KPSTPARVG-AVKDSQKSSFIGKAGAVSLEKAVEVLDT 114

Query: 1216 LGSSMTNMHGGSGFAYNMASRGNKVSILAFEVANTIAKGSNLLQSISEENIRILKDEILQ 1037
            LGSSM+N++  SGF   MASRGN++SILAFEVANTIAKG+NL QS+SEEN++ L+ EIL 
Sbjct: 115  LGSSMSNLNARSGFVSGMASRGNRISILAFEVANTIAKGANLFQSLSEENVQFLRKEILH 174

Query: 1036 SDGVLRLVSTDMTELLRIAAIDQREEFNIFSREVIRFGDLCRDPQWHNLDRYFSKFDSDP 857
            S+GV +LVSTDMTELL IAA D+REE ++F+REVIRFGDLC+DPQWHNL RYFSK DS+ 
Sbjct: 175  SEGVQQLVSTDMTELLCIAASDKREELDVFAREVIRFGDLCKDPQWHNLGRYFSKLDSEY 234

Query: 856  GVGKLRREEAEMTMQELLSLAQYTSELYHELHALDRFEQDYHRKVEEVESLHLPRKGEGL 677
               K  REE+EM MQEL +LAQ+TSELYHEL+ALDRFEQDY +K+EEVESL LPRKGE L
Sbjct: 235  STDKQPREESEMIMQELTTLAQHTSELYHELNALDRFEQDYQQKLEEVESLQLPRKGESL 294

Query: 676  LILHSELKHQRXXXXXXXXXXXXXXXXXXXVEKLVDIVTFIHQHILEVFGDNVLGSSSTE 497
             IL SEL+ QR                   +EK VDIVT++HQ I++ FG++ +G ++  
Sbjct: 295  SILQSELRQQRKLVRSLKKKSLWSKSLAEVMEKFVDIVTYLHQIIVDAFGNSGVGLAN-- 352

Query: 496  EKASSKPDERLGIAGLALHYAHVVTQIDNIASRPTSLPPNMRDGLYSGLPVNVKKSLRSR 317
             +   K  +RLG AGLALHYA+V+ QIDNIASRPTSLPPN RD LY GLP  VKK+LRS+
Sbjct: 353  -ERPGKNSQRLGAAGLALHYANVIHQIDNIASRPTSLPPNTRDNLYRGLPTYVKKALRSQ 411

Query: 316  LQSLDVHDVLSIPQMKTQMERTLKWLAPIAADTTKAHQGFGWVGEWANTGIEFGK--TTK 143
            LQ +D  + L++ Q+K +ME+TL WL P+A +TTKAHQGFGWVGEWANTG EFGK  TT+
Sbjct: 412  LQMVDNKEELTVVQVKAEMEKTLHWLVPVATNTTKAHQGFGWVGEWANTGNEFGKNSTTQ 471

Query: 142  NSVIRLQTLYHADKGKMDRYILELVICLHRLISLVKYKDNGSKAQP 5
            N++IRLQTLYHADK K D YI ELV  LHRLI+LV+++D+G K  P
Sbjct: 472  NNLIRLQTLYHADKQKTDNYIFELVTWLHRLINLVRHRDHGLKTMP 517


>ref|XP_002306933.1| predicted protein [Populus trichocarpa] gi|222856382|gb|EEE93929.1|
            predicted protein [Populus trichocarpa]
          Length = 594

 Score =  519 bits (1337), Expect = e-145
 Identities = 287/517 (55%), Positives = 357/517 (69%), Gaps = 5/517 (0%)
 Frame = -1

Query: 1576 MGGLCTGGTIKHHRDVQGNNDYDHCSDNYSKDVSKKTNDGVEEDENYDGTNDPGFQIYDK 1397
            MGG+C+GG  +    V G  + ++   N S  +    +   + + +Y   N   F     
Sbjct: 1    MGGVCSGGAKRKSVKV-GGEENNNGGINTSGKLRSLHSTCKKRENSYRNNNGDDFGRTTP 59

Query: 1396 KRTRHIFDSGELNFNISTQFLNRRSAAARTPPTTKGPHMN---SFFGKAGIVGLERAVDV 1226
            +R+    +SGE   + S      R     TP  T+   +N   SF GKAG VGLE+AV+V
Sbjct: 60   QRS----NSGEFLSSFS------RELKPSTPVRTEADKINQKKSFLGKAGTVGLEKAVEV 109

Query: 1225 LDTLGSSMTNMHGGSGFAYNMASRGNKVSILAFEVANTIAKGSNLLQSISEENIRILKDE 1046
            LDTLGSSM+N++   GFA  + SRGN++SILAFEVANTIAKG+NL  S+SEEN+  LK E
Sbjct: 110  LDTLGSSMSNLNPKGGFATGIGSRGNRISILAFEVANTIAKGANLFHSLSEENVESLKKE 169

Query: 1045 ILQSDGVLRLVSTDMTELLRIAAIDQREEFNIFSREVIRFGDLCRDPQWHNLDRYFSKFD 866
            +L S+GV +LVSTDM ELL IAA D+REEF++FSREVIRFGDLC+DPQWHNL RYFSK D
Sbjct: 170  VLHSEGVHKLVSTDMEELLIIAAADKREEFDVFSREVIRFGDLCKDPQWHNLGRYFSKLD 229

Query: 865  SDPGVGKLRREEAEMTMQELLSLAQYTSELYHELHALDRFEQDYHRKVEEVESLHLPRKG 686
            S+  + +  R EAE+TMQEL++L Q TSELYHEL+ALDRFEQDY +KVEEV+SL+L  KG
Sbjct: 230  SEYSIERQHRTEAEVTMQELITLVQNTSELYHELNALDRFEQDYRQKVEEVQSLNLSVKG 289

Query: 685  EGLLILHSELKHQRXXXXXXXXXXXXXXXXXXXVEKLVDIVTFIHQHILEVFGDNVLGSS 506
            E L ILHSELK QR                   +EKLVDIVT++ Q ILE FG+N     
Sbjct: 290  ECLTILHSELKQQRKLVRSLKKKSLWSKNVEEIMEKLVDIVTYLQQAILEAFGNN---GV 346

Query: 505  STEEKASSKPDERLGIAGLALHYAHVVTQIDNIASRPTSLPPNMRDGLYSGLPVNVKKSL 326
               +K      +RLG +GLALHYA+++ QIDNI SRP SLPPN RD LY G+P +VK +L
Sbjct: 347  ILVDKEPGNSRQRLGTSGLALHYANLINQIDNITSRPASLPPNTRDSLYRGIPNSVKAAL 406

Query: 325  RSRLQSLDVHDVLSIPQMKTQMERTLKWLAPIAADTTKAHQGFGWVGEWANTGIEFGKTT 146
            RSRLQ +D  + L+I  +K +ME+TL WLAPIA +TTKAHQGFGWVGEWANTGIEFGK T
Sbjct: 407  RSRLQMVDTKEELTIALVKAEMEKTLHWLAPIATNTTKAHQGFGWVGEWANTGIEFGKNT 466

Query: 145  --KNSVIRLQTLYHADKGKMDRYILELVICLHRLISL 41
               +++IRLQTL+HADK K D YILELV  LHRLI+L
Sbjct: 467  AGNSNLIRLQTLHHADKQKTDLYILELVTWLHRLINL 503


>ref|NP_564362.1| uncharacterized protein [Arabidopsis thaliana]
            gi|20466802|gb|AAM20718.1| unknown protein [Arabidopsis
            thaliana] gi|332193147|gb|AEE31268.1| uncharacterized
            protein [Arabidopsis thaliana]
          Length = 615

 Score =  471 bits (1213), Expect = e-130
 Identities = 268/539 (49%), Positives = 359/539 (66%), Gaps = 16/539 (2%)
 Frame = -1

Query: 1576 MGGLCTGGTIKHHRDVQGNNDYDHCSDNYSKDVSK----KTNDGVEEDENYDGTNDPGFQ 1409
            MGG+C+       +  +  ++ D  S  +S  +      KT+D    D NY G+      
Sbjct: 1    MGGVCSCVFKDDDKKKKLRSNDDDKSRGFSGKLKSMRRSKTSDSYYSD-NYGGSRRKS-- 57

Query: 1408 IYDKKRTRHIFD-SGELN-----FNISTQFLNRRSAAARTPPTTKGPHMNSFFGKAGIVG 1247
                K    +F+ SGEL       N ST+F+ R                NSF G+AG++G
Sbjct: 58   ---SKPDEVVFNFSGELGPMPPLRNDSTKFMQR----------------NSFMGRAGVMG 98

Query: 1246 LERAVDVLDTLGSSMTNMHGGSGFAYNM-ASRGNKVSILAFEVANTIAKGSNLLQSISEE 1070
            LE+AV+VLDTLGSSMT M+  + +   + +SRG KV+ILAFEVANTIAKG+ LLQS+SEE
Sbjct: 99   LEKAVEVLDTLGSSMTRMNPSNAYLSGVTSSRGGKVTILAFEVANTIAKGAALLQSLSEE 158

Query: 1069 NIRILKDEILQSDGVLRLVSTDMTELLRIAAIDQREEFNIFSREVIRFGDLCRDPQWHNL 890
            N++ +K ++L S+ V +LVSTD TEL  +AA D+REE ++FS EVIRFG++C+D QWHNL
Sbjct: 159  NLKFMKKDMLHSEEVKKLVSTDTTELQILAASDKREELDLFSGEVIRFGNMCKDLQWHNL 218

Query: 889  DRYFSKFDSDPGVGKLRREEAEMTMQELLSLAQYTSELYHELHALDRFEQDYHRKVEEVE 710
            DRYF K D++    KL +++AE  MQEL++LA+ TSELYHEL ALDRFEQDY RK+ EVE
Sbjct: 219  DRYFMKLDTENSQHKLLKDDAEARMQELVTLARITSELYHELQALDRFEQDYRRKLAEVE 278

Query: 709  SLHLPRKGEGLLILHSELKHQRXXXXXXXXXXXXXXXXXXXVEKLVDIVTFIHQHILEVF 530
            SL+LPR+GEG++IL +ELK Q+                   +EKLVD+V++I Q I+EVF
Sbjct: 279  SLNLPRRGEGIVILQNELKQQKKLVKSLQKKSLWSQNLAEIIEKLVDVVSYIRQTIVEVF 338

Query: 529  GDNVLGSSSTEEKASSKPDERLGIAGLALHYAHVVTQIDNIASRPTSLPPNMRDGLYSGL 350
            G+N L  +  E+       ERLG AGL+LHYA+++ QIDNIASRP+SLP N+RD LY+ L
Sbjct: 339  GNNGLRDNEGEQGR-----ERLGEAGLSLHYANLIQQIDNIASRPSSLPSNVRDTLYNAL 393

Query: 349  PVNVKKSLRSRLQSLDVHDVLSIPQMKTQMERTLKWLAPIAADTTKAHQGFGWVGEWANT 170
            P  VK +LR RLQ+LD  + LS+P++K +ME++L+WL P A +TTKAHQGFGWVGEWAN+
Sbjct: 394  PATVKTALRPRLQTLDQEEELSVPEIKAEMEKSLQWLVPFAENTTKAHQGFGWVGEWANS 453

Query: 169  GIEFGK-----TTKNSVIRLQTLYHADKGKMDRYILELVICLHRLISLVKYKDNGSKAQ 8
             IEFGK         +  RLQTL+HADK  +D Y+LELV+ LHRL+   K + +G K Q
Sbjct: 454  RIEFGKGKGKGENNGNPTRLQTLHHADKPIVDSYVLELVVWLHRLMKSSKKRAHGVKLQ 512


>ref|XP_002893642.1| hypothetical protein ARALYDRAFT_473306 [Arabidopsis lyrata subsp.
            lyrata] gi|297339484|gb|EFH69901.1| hypothetical protein
            ARALYDRAFT_473306 [Arabidopsis lyrata subsp. lyrata]
          Length = 613

 Score =  469 bits (1207), Expect = e-130
 Identities = 258/518 (49%), Positives = 344/518 (66%), Gaps = 19/518 (3%)
 Frame = -1

Query: 1504 CSDNYSKDVSKKTNDGVEEDENYDG---------TNDPGFQIYDKKRTRHIFDSGELNFN 1352
            CS  Y     KK     ++   + G         T+D  +  +     R      E+ FN
Sbjct: 5    CSCVYKDGDKKKLRSNDDKTRGFSGKLKSMRRRRTSDSYYSDHYGSSRRKSSKPDEVVFN 64

Query: 1351 ISTQFLNRRSAAARTPP----TTKGPHMNSFFGKAGIVGLERAVDVLDTLGSSMTNMHGG 1184
             S +           PP    +TK    NSF G+AG++GLE+AV+VLDTLGSSM+ M+  
Sbjct: 65   FSGEL-------GPMPPLRNDSTKFMQRNSFMGRAGVMGLEKAVEVLDTLGSSMSRMNPS 117

Query: 1183 SGFAYNM-ASRGNKVSILAFEVANTIAKGSNLLQSISEENIRILKDEILQSDGVLRLVST 1007
            S +   + +SRG KV+ILAFEVANTIAKG+ LLQS+SEEN++ +K E+L+S GV +LVST
Sbjct: 118  SAYLSGVTSSRGGKVTILAFEVANTIAKGAALLQSLSEENLKFMKKEMLRSKGVKKLVST 177

Query: 1006 DMTELLRIAAIDQREEFNIFSREVIRFGDLCRDPQWHNLDRYFSKFDSDPGVGKLRREEA 827
            D  EL  +AA D+REE ++FS EVIRFG++C+D QWHNLDRYF K D++    KL ++EA
Sbjct: 178  DTAELQILAASDKREELDLFSGEVIRFGNMCKDMQWHNLDRYFMKLDTENSQHKLLKDEA 237

Query: 826  EMTMQELLSLAQYTSELYHELHALDRFEQDYHRKVEEVESLHLPRKGEGLLILHSELKHQ 647
            E  MQEL++LA++TSELYHEL ALDRFEQDY RK+ E+ESL+LPR+GEG++IL +ELK Q
Sbjct: 238  EAKMQELVTLARFTSELYHELQALDRFEQDYRRKLAEIESLNLPRRGEGIVILQNELKQQ 297

Query: 646  RXXXXXXXXXXXXXXXXXXXVEKLVDIVTFIHQHILEVFGDNVLGSSSTEEKASSKPDER 467
            R                   +EKLVD+V +I Q I+EVFG+N L     ++    +  ER
Sbjct: 298  RKLVKSLQKKSLWSQNLEEIIEKLVDVVCYIRQTIVEVFGNNGL-----KDNEGKQGRER 352

Query: 466  LGIAGLALHYAHVVTQIDNIASRPTSLPPNMRDGLYSGLPVNVKKSLRSRLQSLDVHDVL 287
            LG AGL+LHYA+++ QID+IASRP+SLP N+RD LY+ LP  VK +LR RLQ+LD  + +
Sbjct: 353  LGEAGLSLHYANLIQQIDSIASRPSSLPSNVRDTLYNALPATVKTALRPRLQTLDPEEEV 412

Query: 286  SIPQMKTQMERTLKWLAPIAADTTKAHQGFGWVGEWANTGIEFGK-----TTKNSVIRLQ 122
             + ++K +ME++L+WL P A +TTKAHQGFGWVGEWAN+ IEFGK         +  RLQ
Sbjct: 413  LVSEIKAEMEKSLQWLVPFAENTTKAHQGFGWVGEWANSRIEFGKGKGKGENNGNPTRLQ 472

Query: 121  TLYHADKGKMDRYILELVICLHRLISLVKYKDNGSKAQ 8
            TL+HADK K+D Y+LELV+ LHRL+   K +  G K Q
Sbjct: 473  TLHHADKPKVDSYVLELVVWLHRLMKSSKKRVQGVKLQ 510


Top