BLASTX nr result

ID: Angelica23_contig00009565 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00009565
         (1500 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_003635095.1| PREDICTED: uncharacterized protein At4g37920...   552   e-154
ref|XP_003517682.1| PREDICTED: uncharacterized protein At4g37920...   516   e-144
ref|XP_002866923.1| hypothetical protein ARALYDRAFT_327981 [Arab...   508   e-141
ref|NP_195505.2| uncharacterized protein [Arabidopsis thaliana] ...   507   e-141
emb|CAB37532.1| putative protein [Arabidopsis thaliana] gi|72707...   507   e-141

>ref|XP_003635095.1| PREDICTED: uncharacterized protein At4g37920, chloroplastic-like
            [Vitis vinifera] gi|297736678|emb|CBI25695.3| unnamed
            protein product [Vitis vinifera]
          Length = 431

 Score =  552 bits (1422), Expect = e-154
 Identities = 287/430 (66%), Positives = 342/430 (79%), Gaps = 4/430 (0%)
 Frame = +3

Query: 54   MSNLLGFXXXXXXXXXXPSQLNTTLSFFNTEMPPTILTXXXXXXXXXXXXXXNHFLTSRP 233
            MSNLLGF          PS LN  L F  + + P+I +               +  T R 
Sbjct: 1    MSNLLGFKLLLCNTTK-PSVLNQNL-FSASILLPSISSPPLFLPSKQSDSLTPNSRT-RK 57

Query: 234  TKPTSH----KFTKKMFSRKVGCVQVEEQASEVEIAEGYTMTKFLDKIIDLFLNEKPRVK 401
             + TS      F     +  +G  +V EQ  EVE+A GYT+T+F DKIID+F+NEKP++K
Sbjct: 58   GRGTSDAVLSNFRANSTANSIGAAEVAEQV-EVEVANGYTITQFCDKIIDVFMNEKPKLK 116

Query: 402  DWRKYLVFREDWKKYRDGFYDRCHTLAMSQTDSAMKQKLISLGRKVKRIDDEMERHTELL 581
            +WRKYLVFRE+W KYR+ FY+RC T A ++TD  +K+KLI LGRKVK+IDDEMERHTELL
Sbjct: 117  EWRKYLVFREEWNKYREAFYNRCQTRAYAETDPVIKKKLIELGRKVKKIDDEMERHTELL 176

Query: 582  KEIEGSPLDINAIVSKRRKDFTEEFFRHLTLLSETYDGLEDRDAVARLGARCLAAVGAYD 761
            +E++ SP+D+NAIV +RRKDFT EFFRHL+LLSETYD LEDRDA+ARLGARCL+AV AYD
Sbjct: 177  EEVQSSPMDVNAIVVRRRKDFTGEFFRHLSLLSETYDSLEDRDAMARLGARCLSAVSAYD 236

Query: 762  NTIEIVDTLDSAQAKFDDILNCPSVEEACEKINSLAKAKELDSSLILLINSAWASAKEST 941
            NT+EIV+TLD AQAKFDDILN PS++ ACEKI SLAKAKELDSSLILLINSAW++AKEST
Sbjct: 237  NTLEIVETLDVAQAKFDDILNSPSIDVACEKIKSLAKAKELDSSLILLINSAWSAAKEST 296

Query: 942  TMKNEVKDIMYRLYRATKSSLKSIEPKEIKLLKYLLNITDPEERFSALATAFSPGDDHVA 1121
            TMKNEVKDIMY LY+ATKSSL+SI PKEIKLLK+LLNITDPEERFSALA+AFSPGDD  A
Sbjct: 297  TMKNEVKDIMYHLYKATKSSLRSIAPKEIKLLKHLLNITDPEERFSALASAFSPGDDREA 356

Query: 1122 RDPKAVYTTPKELHKWIKIMLDAHHLHKEDTEIREAKQMTPPMVIQRLFILKETIEVEYL 1301
            +DP A+YTTPKELHKWIKIMLDA+HL+KE+T+IREA+QMT P+VIQRLFILKETIE EYL
Sbjct: 357  KDPNALYTTPKELHKWIKIMLDAYHLNKEETDIREARQMTEPVVIQRLFILKETIEEEYL 416

Query: 1302 EQNVNKENKT 1331
            E+ +  E +T
Sbjct: 417  ERGIKTEQET 426


>ref|XP_003517682.1| PREDICTED: uncharacterized protein At4g37920, chloroplastic-like
            [Glycine max]
          Length = 432

 Score =  516 bits (1330), Expect = e-144
 Identities = 251/346 (72%), Positives = 305/346 (88%)
 Frame = +3

Query: 294  QVEEQASEVEIAEGYTMTKFLDKIIDLFLNEKPRVKDWRKYLVFREDWKKYRDGFYDRCH 473
            Q EE   EVEIA+GYTMT+F DK+ID FLNEK + K+WRKYL+FRE+WKKYRD FY+RC 
Sbjct: 80   QAEEH--EVEIAKGYTMTQFCDKMIDFFLNEKTKSKEWRKYLIFREEWKKYRDRFYNRCQ 137

Query: 474  TLAMSQTDSAMKQKLISLGRKVKRIDDEMERHTELLKEIEGSPLDINAIVSKRRKDFTEE 653
              A  + D  MK+K ISL RK+K+IDDEME H ELL EI+ SP+DINAIV++RRKDFT E
Sbjct: 138  RRADMENDPVMKEKFISLRRKLKKIDDEMEGHYELLMEIQDSPMDINAIVARRRKDFTGE 197

Query: 654  FFRHLTLLSETYDGLEDRDAVARLGARCLAAVGAYDNTIEIVDTLDSAQAKFDDILNCPS 833
            FF +L+L+S+TYD LEDRD ++RLG+RCL+AV AYDNT+E ++TLD+AQAKFDDILN PS
Sbjct: 198  FFHYLSLISDTYDSLEDRDGISRLGSRCLSAVSAYDNTLENIETLDAAQAKFDDILNSPS 257

Query: 834  VEEACEKINSLAKAKELDSSLILLINSAWASAKESTTMKNEVKDIMYRLYRATKSSLKSI 1013
            ++ AC+KI SLAKAKELDSSLILLI+SAWA AKESTTMKNEVKDIMY+LYRATKSSL+SI
Sbjct: 258  IDIACQKIKSLAKAKELDSSLILLISSAWAKAKESTTMKNEVKDIMYQLYRATKSSLRSI 317

Query: 1014 EPKEIKLLKYLLNITDPEERFSALATAFSPGDDHVARDPKAVYTTPKELHKWIKIMLDAH 1193
             PKEIKLLK+LLNI DPEERFSALATAF+PGD+H A+DP A+YTTPKELHKWIKIMLDA+
Sbjct: 318  TPKEIKLLKHLLNIIDPEERFSALATAFTPGDEHEAKDPNALYTTPKELHKWIKIMLDAY 377

Query: 1194 HLHKEDTEIREAKQMTPPMVIQRLFILKETIEVEYLEQNVNKENKT 1331
            HL+KE+T++REA+QMT P+VIQRLFILK+TIE EY+E++  ++++T
Sbjct: 378  HLNKEETDLREARQMTDPVVIQRLFILKDTIEQEYMEKDTTQKSET 423


>ref|XP_002866923.1| hypothetical protein ARALYDRAFT_327981 [Arabidopsis lyrata subsp.
            lyrata] gi|297312759|gb|EFH43182.1| hypothetical protein
            ARALYDRAFT_327981 [Arabidopsis lyrata subsp. lyrata]
          Length = 643

 Score =  508 bits (1307), Expect = e-141
 Identities = 254/350 (72%), Positives = 296/350 (84%), Gaps = 7/350 (2%)
 Frame = +3

Query: 303  EQASEVEIAEGYTMTKFLDKIIDLFLNEKPRVKDWRKYLVFREDWKKYRDGFYDRCHTLA 482
            E   EVE+AEGYTM +F DKIIDLFLNEKP+VK W+ YLV R++W KY   FY RC   A
Sbjct: 287  EDPMEVEVAEGYTMAQFCDKIIDLFLNEKPKVKQWKTYLVLRDEWNKYSVNFYRRCRIRA 346

Query: 483  MSQTDSAMKQKLISLGRKVKRIDDEMERHTELLKEIEGSPLDINAIVSKRRKDFTEEFFR 662
             S+TD  +KQKL+SL  KVK+ID+EME+H +LLKEI+ +P DINAI +KRR+DFT EFFR
Sbjct: 347  DSETDPILKQKLLSLESKVKKIDEEMEKHNDLLKEIQENPTDINAIAAKRRRDFTGEFFR 406

Query: 663  HLTLLSETYDGLEDRDAVARLGARCLAAVGAYDNTIEIVDTLDSAQAKFDDILNCPSVEE 842
            ++ LLSET DGLEDRDAVARL  RCL+AV AYDNT+E V+TLDSAQAKFDDILN PSV+ 
Sbjct: 407  YVALLSETLDGLEDRDAVARLATRCLSAVSAYDNTLESVETLDSAQAKFDDILNSPSVDA 466

Query: 843  ACEKINSLAKAKELDSSLILLINSAWASAKESTTMKNEVKDIMYRLYRATKSSLKSIEPK 1022
            ACEKI SLAK+KELDSSLILLINSA+A+AKES T+ NE KD+MY LY+ATKSSL+SI PK
Sbjct: 467  ACEKIRSLAKSKELDSSLILLINSAYAAAKESQTVTNEAKDVMYHLYKATKSSLRSITPK 526

Query: 1023 EIKLLKYLLNITDPEERFSALATAFSPGDDHVARDPKAVYTTPKELHKWIKIMLDAHHLH 1202
            EIKLLKYLLNITDPEERFSALATAFSPGDDH A+DPKA+YTTPKELHKWIKIMLDA+HL+
Sbjct: 527  EIKLLKYLLNITDPEERFSALATAFSPGDDHEAKDPKALYTTPKELHKWIKIMLDAYHLN 586

Query: 1203 KEDTEIREAKQMTPPMVIQRLFILKETIEVEYL-------EQNVNKENKT 1331
            KE+T+I+EAKQM+ P+VIQRLFILK+TIE EYL       ++N  KE  T
Sbjct: 587  KEETDIKEAKQMSQPIVIQRLFILKDTIEDEYLDKKTIVADENPKKEEDT 636


>ref|NP_195505.2| uncharacterized protein [Arabidopsis thaliana]
            gi|209574320|sp|Q84WN0.2|Y4920_ARATH RecName:
            Full=Uncharacterized protein At4g37920, chloroplastic;
            Flags: Precursor gi|332661453|gb|AEE86853.1|
            uncharacterized protein [Arabidopsis thaliana]
          Length = 427

 Score =  507 bits (1306), Expect = e-141
 Identities = 251/343 (73%), Positives = 295/343 (86%)
 Frame = +3

Query: 303  EQASEVEIAEGYTMTKFLDKIIDLFLNEKPRVKDWRKYLVFREDWKKYRDGFYDRCHTLA 482
            E   EVE+AEGYTM +F DKIIDLFLNEKP+VK W+ YLV R++W KY   FY RC   A
Sbjct: 70   EDPMEVEVAEGYTMAQFCDKIIDLFLNEKPKVKQWKTYLVLRDEWNKYSVNFYKRCRIRA 129

Query: 483  MSQTDSAMKQKLISLGRKVKRIDDEMERHTELLKEIEGSPLDINAIVSKRRKDFTEEFFR 662
             ++TD  +KQKL+SL  KVK+ID EME+H +LLKEI+ +P DINAI +KRR+DFT EFFR
Sbjct: 130  DTETDPILKQKLVSLESKVKKIDKEMEKHNDLLKEIQENPTDINAIAAKRRRDFTGEFFR 189

Query: 663  HLTLLSETYDGLEDRDAVARLGARCLAAVGAYDNTIEIVDTLDSAQAKFDDILNCPSVEE 842
            ++TLLSET DGLEDRDAVARL  RCL+AV AYDNT+E V+TLD+AQAKF+DILN PSV+ 
Sbjct: 190  YVTLLSETLDGLEDRDAVARLATRCLSAVSAYDNTLESVETLDTAQAKFEDILNSPSVDS 249

Query: 843  ACEKINSLAKAKELDSSLILLINSAWASAKESTTMKNEVKDIMYRLYRATKSSLKSIEPK 1022
            ACEKI SLAKAKELDSSLILLINSA+A+AKES T+ NE KDIMY LY+ATKSSL+SI PK
Sbjct: 250  ACEKIRSLAKAKELDSSLILLINSAYAAAKESQTVTNEAKDIMYHLYKATKSSLRSITPK 309

Query: 1023 EIKLLKYLLNITDPEERFSALATAFSPGDDHVARDPKAVYTTPKELHKWIKIMLDAHHLH 1202
            EIKLLKYLLNITDPEERFSALATAFSPGDDH A+DPKA+YTTPKELHKWIKIMLDA+HL+
Sbjct: 310  EIKLLKYLLNITDPEERFSALATAFSPGDDHEAKDPKALYTTPKELHKWIKIMLDAYHLN 369

Query: 1203 KEDTEIREAKQMTPPMVIQRLFILKETIEVEYLEQNVNKENKT 1331
            KE+T+I+EAKQM+ P+VIQRLFILK+TIE EYL++     ++T
Sbjct: 370  KEETDIKEAKQMSQPIVIQRLFILKDTIEDEYLDKKTIVADET 412


>emb|CAB37532.1| putative protein [Arabidopsis thaliana] gi|7270775|emb|CAB80457.1|
            putative protein [Arabidopsis thaliana]
          Length = 673

 Score =  507 bits (1306), Expect = e-141
 Identities = 251/343 (73%), Positives = 295/343 (86%)
 Frame = +3

Query: 303  EQASEVEIAEGYTMTKFLDKIIDLFLNEKPRVKDWRKYLVFREDWKKYRDGFYDRCHTLA 482
            E   EVE+AEGYTM +F DKIIDLFLNEKP+VK W+ YLV R++W KY   FY RC   A
Sbjct: 316  EDPMEVEVAEGYTMAQFCDKIIDLFLNEKPKVKQWKTYLVLRDEWNKYSVNFYKRCRIRA 375

Query: 483  MSQTDSAMKQKLISLGRKVKRIDDEMERHTELLKEIEGSPLDINAIVSKRRKDFTEEFFR 662
             ++TD  +KQKL+SL  KVK+ID EME+H +LLKEI+ +P DINAI +KRR+DFT EFFR
Sbjct: 376  DTETDPILKQKLVSLESKVKKIDKEMEKHNDLLKEIQENPTDINAIAAKRRRDFTGEFFR 435

Query: 663  HLTLLSETYDGLEDRDAVARLGARCLAAVGAYDNTIEIVDTLDSAQAKFDDILNCPSVEE 842
            ++TLLSET DGLEDRDAVARL  RCL+AV AYDNT+E V+TLD+AQAKF+DILN PSV+ 
Sbjct: 436  YVTLLSETLDGLEDRDAVARLATRCLSAVSAYDNTLESVETLDTAQAKFEDILNSPSVDS 495

Query: 843  ACEKINSLAKAKELDSSLILLINSAWASAKESTTMKNEVKDIMYRLYRATKSSLKSIEPK 1022
            ACEKI SLAKAKELDSSLILLINSA+A+AKES T+ NE KDIMY LY+ATKSSL+SI PK
Sbjct: 496  ACEKIRSLAKAKELDSSLILLINSAYAAAKESQTVTNEAKDIMYHLYKATKSSLRSITPK 555

Query: 1023 EIKLLKYLLNITDPEERFSALATAFSPGDDHVARDPKAVYTTPKELHKWIKIMLDAHHLH 1202
            EIKLLKYLLNITDPEERFSALATAFSPGDDH A+DPKA+YTTPKELHKWIKIMLDA+HL+
Sbjct: 556  EIKLLKYLLNITDPEERFSALATAFSPGDDHEAKDPKALYTTPKELHKWIKIMLDAYHLN 615

Query: 1203 KEDTEIREAKQMTPPMVIQRLFILKETIEVEYLEQNVNKENKT 1331
            KE+T+I+EAKQM+ P+VIQRLFILK+TIE EYL++     ++T
Sbjct: 616  KEETDIKEAKQMSQPIVIQRLFILKDTIEDEYLDKKTIVADET 658


Top