BLASTX nr result

ID: Angelica23_contig00005947 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00005947
         (1617 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002285669.1| PREDICTED: uncharacterized protein LOC100259...   296   1e-77
ref|XP_002297812.1| predicted protein [Populus trichocarpa] gi|2...   287   6e-75
ref|XP_002518117.1| conserved hypothetical protein [Ricinus comm...   281   3e-73
ref|NP_193103.2| protein plastid transcriptionally active 5 [Ara...   278   2e-72
ref|XP_002870363.1| predicted protein [Arabidopsis lyrata subsp....   278   4e-72

>ref|XP_002285669.1| PREDICTED: uncharacterized protein LOC100259626 [Vitis vinifera]
          Length = 372

 Score =  296 bits (758), Expect = 1e-77
 Identities = 157/322 (48%), Positives = 205/322 (63%), Gaps = 6/322 (1%)
 Frame = +3

Query: 432  SRWIHEREALLGEIETLRSKIEELENANDRNLNVGGVLQVLR-----NEVSRIAERGSSA 596
            SRW  ER++LL EI  L+ +I++LE+ +  + ++  +  +L+      EV+RIAE GSSA
Sbjct: 73   SRWTTERQSLLREISELKFRIQQLEHQSSVSASIPDIAALLQLPKDSAEVARIAESGSSA 132

Query: 597  APLELESLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXITLRKGSEGDDVRI 776
             P+ LES                                        TLR GSEG++VR 
Sbjct: 133  LPMVLESKEVKEEKVGDQKKRK-------------------------TLRVGSEGEEVRA 167

Query: 777  MQEALQKLGFYCGEEDEEYSMFSSGTERAIKTWQASLRIPEDGIMTPELLEKLYGEQRND 956
            MQEALQ LGFY GEED E+S FSSGTERA+KTWQASL  PE+GIMT ELLE+L+ EQ  +
Sbjct: 168  MQEALQNLGFYSGEEDVEFSSFSSGTERAVKTWQASLGAPENGIMTAELLERLFMEQHIE 227

Query: 957  VSWLTEKADIKGTDVATMQKASNGAAATHTTKVPEIQKRDDKEKDAAESKVPHSRVFLLG 1136
             + L    D K  D +  ++  NGA     T++ EIQ++  KE+   E +V   RVFLLG
Sbjct: 228  AAGLKRNIDPKENDASPPKEGVNGALVASVTEISEIQQKVLKEEGFTEVEVSQQRVFLLG 287

Query: 1137 ENRWEDSSRLIGRNKQDGGSINTE-TTRCISCRGEGRLLCSECDGTGEPNIEEQFMEWVD 1313
            ENRWE+ SRL+GR+K+ GG+   + TT+C++CRGEGRL+C+ECDGTGEPNIE QF++WVD
Sbjct: 288  ENRWEEPSRLVGRDKKGGGNKPKDATTKCLTCRGEGRLMCTECDGTGEPNIEPQFLDWVD 347

Query: 1314 EGAKCPYCNGVGFEICDVCNGK 1379
            EG KCPYC G+G  ICD C GK
Sbjct: 348  EGVKCPYCEGLGHTICDACEGK 369


>ref|XP_002297812.1| predicted protein [Populus trichocarpa] gi|222845070|gb|EEE82617.1|
            predicted protein [Populus trichocarpa]
          Length = 395

 Score =  287 bits (734), Expect = 6e-75
 Identities = 182/413 (44%), Positives = 227/413 (54%), Gaps = 20/413 (4%)
 Frame = +3

Query: 201  FPLCLNPL-FTTKPHTNNTISPHTYNN-------SFQSLSKS-YICFSISSDNSSFXXXX 353
            FPL LNP  F     +++T SP  ++        S  +LSK  YICFS + D        
Sbjct: 10   FPLSLNPKPFHPHKQSHSTHSPLQFSKHTTVLPLSRSTLSKPHYICFSSNPDREE----- 64

Query: 354  XXXXXXXXXXXXXXXXXXXXXXXXXXSRWIHEREALLGEIETLRSKIEELEN-------A 512
                                       RW  +RE+LL +I++L+ +IE LEN        
Sbjct: 65   -----SRWLREEQRWLREEERWLREEKRWSCDRESLLAQIQSLKLQIEALENRISVLQGG 119

Query: 513  NDRNLNVGGVLQVLR--NEVSRIAERGSSAAPLELESLXXXXXXXXXXXXXXXXXXXXXX 686
             D    VG +LQVL+  N  + IAE GSSA PL LE                        
Sbjct: 120  EDTVAKVGLLLQVLKDKNNNNLIAESGSSARPLVLEENVVEEQKEVIDRVLEEKKERK-- 177

Query: 687  XXXXXXXXXXXXXXXXITLRKGSEGDDVRIMQEALQKLGFYCGEEDEEYSMFSSGTERAI 866
                             TLRKGSEG+ V+ MQ+ALQKLGFY GEED EYS FSSGTERA+
Sbjct: 178  -----------------TLRKGSEGEQVKEMQDALQKLGFYSGEEDMEYSSFSSGTERAV 220

Query: 867  KTWQASLRIPEDGIMTPELLEKLYGEQRNDVSWLTEKADIKGT-DVATMQKASNGAAATH 1043
            +TWQASL   EDGIMT ELL++LY EQ  D    +     KG+      ++ ++GAA T 
Sbjct: 221  RTWQASLGASEDGIMTTELLKRLYMEQHIDARMPSISETQKGSAQTVPAEEGADGAAVTS 280

Query: 1044 TTKVPEIQKRDDKEKDAAESKVPHSRVFLLGENRWEDSSRLIGRNKQDGGSINTETTR-C 1220
             T++ EI ++  KE +  E  V H RVFLLGENRWE+ SRL GR KQ  GS   ++T+ C
Sbjct: 281  VTEISEIHQKVVKE-EVTEVDVSHHRVFLLGENRWEEPSRLNGRKKQVSGSKTKDSTKQC 339

Query: 1221 ISCRGEGRLLCSECDGTGEPNIEEQFMEWVDEGAKCPYCNGVGFEICDVCNGK 1379
            ++CRGEGRLLC+ECDGTGEPN+E QF+EWV EGA CPYC G G+ ICDVC GK
Sbjct: 340  LTCRGEGRLLCTECDGTGEPNVEPQFLEWVGEGANCPYCEGQGYTICDVCAGK 392


>ref|XP_002518117.1| conserved hypothetical protein [Ricinus communis]
            gi|223542713|gb|EEF44250.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 386

 Score =  281 bits (719), Expect = 3e-73
 Identities = 172/381 (45%), Positives = 216/381 (56%), Gaps = 17/381 (4%)
 Frame = +3

Query: 288  SLSKSYICFSISSDNSSFXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSRWIHEREALLG 467
            S SKS++CFS S  N+                                 RW+ ERE+LL 
Sbjct: 28   SFSKSHLCFSSSLPNTP---SPSDGKDFLWLREEQRWLREEQRWLREEQRWLRERESLLH 84

Query: 468  EIETLRSKIEELEN---------ANDRNLNVGGVLQVLRNEVSRIAERGSSAA------P 602
            EI++L+ +I+ LE            +   +V  +LQVL NE +RIAE GS+++      P
Sbjct: 85   EIQSLKLQIKALEQRISVQEVDLVPENIASVRALLQVL-NEKNRIAESGSTSSSNPDPNP 143

Query: 603  LELESLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXITLRKGSEGDDVRIMQ 782
            + +E                                        ITLRKGSEGD+VR MQ
Sbjct: 144  IAVEE--------------------KVEEVKEVIGVLKKEEKRRITLRKGSEGDEVREMQ 183

Query: 783  EALQKLGFYCGEEDEEYSMFSSGTERAIKTWQASLRIPEDGIMTPELLEKLYGEQRNDVS 962
            EAL  LGFY GEED E+S FSSGTERA+KTWQASL  PEDGIMT ELLE+LY  Q+N V+
Sbjct: 184  EALLNLGFYSGEEDMEFSSFSSGTERAVKTWQASLGAPEDGIMTAELLERLYVGQQNKVT 243

Query: 963  WLTEKADIKGTDVATMQKAS-NGAAATHTTKVPEIQKRDDKEKDAAESKVPHSRVFLLGE 1139
              T   D K + +   QK S NGAA    T++ E Q++  K+    E K    RVFLLGE
Sbjct: 244  GSTISIDQKESSLTVSQKESANGAAVASITEISETQQKIVKD-GVTEVKGSQQRVFLLGE 302

Query: 1140 NRWEDSSRLIGRNKQDGGSINTET-TRCISCRGEGRLLCSECDGTGEPNIEEQFMEWVDE 1316
            NRWE+ SRL+ ++KQ G S   ++ T+C+SCRGEGRLLC+ECDGTGEPNIE QF+EWV E
Sbjct: 303  NRWEEPSRLVSKDKQVGVSKPKDSMTKCLSCRGEGRLLCTECDGTGEPNIEPQFLEWVGE 362

Query: 1317 GAKCPYCNGVGFEICDVCNGK 1379
            G KCPYC G+G+  CDVC GK
Sbjct: 363  GMKCPYCEGLGYTTCDVCEGK 383


>ref|NP_193103.2| protein plastid transcriptionally active 5 [Arabidopsis thaliana]
            gi|119360137|gb|ABL66797.1| At4g13670 [Arabidopsis
            thaliana] gi|332657911|gb|AEE83311.1| protein plastid
            transcriptionally active 5 [Arabidopsis thaliana]
          Length = 387

 Score =  278 bits (712), Expect = 2e-72
 Identities = 159/326 (48%), Positives = 198/326 (60%), Gaps = 11/326 (3%)
 Frame = +3

Query: 435  RWIHEREALLGEIETLRSKIEELENAN--------DRNLNVGGVLQVLRNEVSRIAERGS 590
            RWI ERE+LL EI  L+ +I+ LE+ N        D   N+  +LQVL+ E +RI+E G 
Sbjct: 74   RWIRERESLLQEISDLQLRIQSLESRNSQLGNSIPDTISNIAALLQVLK-EKNRISESGL 132

Query: 591  SAAPLELESLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXITLRKGSEGDDV 770
            SA P+ LES                                         L+ GSEGDDV
Sbjct: 133  SATPMVLESTREQIVEEVEEEEKRVIIAEEKVRVSEPVKKIKRRI-----LKVGSEGDDV 187

Query: 771  RIMQEALQKLGFYCGEEDEEYSMFSSGTERAIKTWQASLRIPEDGIMTPELLEKLYGEQR 950
            + +QEAL KLGFY GEED E+S FSSGT  A+KTWQASL + EDG+MT ELL++L+    
Sbjct: 188  QALQEALLKLGFYSGEEDMEFSSFSSGTASAVKTWQASLGVREDGVMTAELLQRLF---- 243

Query: 951  NDVSWLTEKADIKGTDVATMQK--ASNGAAATHTTKVPEIQKRDDKEKDAAESKVPHSRV 1124
                 + E  +    + +TM+K  A NGA  T  T+VPE ++   K++   E  V  +RV
Sbjct: 244  -----MDEDVETDKDEASTMKKEEAGNGAVFTSVTQVPEKKQSIVKDQSDREVDVTQNRV 298

Query: 1125 FLLGENRWEDSSRLIGRNKQDGGSINTET-TRCISCRGEGRLLCSECDGTGEPNIEEQFM 1301
            FLLGENRWED SRLIGRNK    S +T T TRCI+CRGEGRL+C ECDGTGEPNIE QFM
Sbjct: 299  FLLGENRWEDPSRLIGRNKPVDRSESTNTKTRCITCRGEGRLMCLECDGTGEPNIEPQFM 358

Query: 1302 EWVDEGAKCPYCNGVGFEICDVCNGK 1379
            EWV E  KCPYC G+G+ +CDVC+GK
Sbjct: 359  EWVGEDTKCPYCEGLGYTVCDVCDGK 384


>ref|XP_002870363.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
            gi|297316199|gb|EFH46622.1| predicted protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 394

 Score =  278 bits (710), Expect = 4e-72
 Identities = 159/326 (48%), Positives = 199/326 (61%), Gaps = 11/326 (3%)
 Frame = +3

Query: 435  RWIHEREALLGEIETLRSKIEELENAN--------DRNLNVGGVLQVLRNEVSRIAERGS 590
            RWI ERE+LL EI  L+ KI+ LE+ N        D   N+  +LQ L+ E +RI+E G 
Sbjct: 77   RWIRERESLLQEISNLQLKIQALESRNSQLGTSVPDTISNIAALLQGLK-EKNRISESGL 135

Query: 591  SAAPLELESLXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXITLRKGSEGDDV 770
            SA P+ LES                                        TL+ GSEGDDV
Sbjct: 136  SATPMVLESTREQIVEEVVEVEEEEKRVVIAEEKVMVSEPVKKKKKRR-TLKVGSEGDDV 194

Query: 771  RIMQEALQKLGFYCGEEDEEYSMFSSGTERAIKTWQASLRIPEDGIMTPELLEKLYGEQR 950
            + +QEAL KLGFY GEED E+S FSSGT  A+KTWQASL + EDGIMT ELL++L+    
Sbjct: 195  QALQEALLKLGFYSGEEDMEFSSFSSGTASAVKTWQASLGVREDGIMTAELLQRLF---- 250

Query: 951  NDVSWLTEKADIKGTDVATMQK--ASNGAAATHTTKVPEIQKRDDKEKDAAESKVPHSRV 1124
                 + E  +    + +TM+K  ASNG+  +  T+VPE ++   K++   E  V  +RV
Sbjct: 251  -----MDEDVETDKDEASTMKKEEASNGSVFSSVTQVPEKKQSIIKDQSNREDDVTQNRV 305

Query: 1125 FLLGENRWEDSSRLIGRNKQDGGSINTET-TRCISCRGEGRLLCSECDGTGEPNIEEQFM 1301
            +LLGENRWED SRLIGRNK    S +T T TRCI+CRGEGRL+C ECDGTGEPNIE QFM
Sbjct: 306  YLLGENRWEDPSRLIGRNKPVDSSKSTITKTRCITCRGEGRLMCLECDGTGEPNIEPQFM 365

Query: 1302 EWVDEGAKCPYCNGVGFEICDVCNGK 1379
            EWV E  KCPYC G+G+ +CDVC+GK
Sbjct: 366  EWVGEDTKCPYCEGLGYTVCDVCDGK 391


Top