BLASTX nr result

ID: Cnidium21_contig00007579 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Cnidium21_contig00007579
         (1898 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002267174.1| PREDICTED: digestive organ expansion factor ...   659   0.0  
ref|XP_004161801.1| PREDICTED: U3 small nucleolar RNA-associated...   569   e-160
ref|NP_564032.1| uncharacterized protein [Arabidopsis thaliana] ...   561   e-157
gb|AAL07149.1| unknown protein [Arabidopsis thaliana]                 561   e-157
ref|XP_002892963.1| hypothetical protein ARALYDRAFT_471979 [Arab...   559   e-156

>ref|XP_002267174.1| PREDICTED: digestive organ expansion factor homolog [Vitis vinifera]
            gi|296090307|emb|CBI40126.3| unnamed protein product
            [Vitis vinifera]
          Length = 753

 Score =  659 bits (1699), Expect = 0.0
 Identities = 348/580 (60%), Positives = 421/580 (72%), Gaps = 14/580 (2%)
 Frame = +3

Query: 201  RRFVTAGGSRR-KGHENAYKNGPKKKVSSSRDE-VADSPSPCTSGDSDVEHTDIVSDGES 374
            +RF    G+R  KG     K    K+V    +E VA SPSP +SG+S  E     S G +
Sbjct: 3    KRFGRKRGARGLKGTNTPAKYDTSKRVRRRTEEKVASSPSPSSSGESSDEQ----SSGVA 58

Query: 375  EEMVEVHKENTMYDDLLKTLGSASKSLASANRRRQRDEEGKXXXXXXXXXXXXXXXXXXX 554
             E   V+KE +MYD+LL TLG+ S+SLA+A +RR+RDEEG+                   
Sbjct: 59   SEEEVVYKEPSMYDNLLFTLGTGSESLANAYKRRRRDEEGRSDSEEDINGGSESLTVSEE 118

Query: 555  XXLQMDYGGSKHQESSLVDVIEQANNDDTYVDN-TFDSDDECDLNVSGHSIATAPAITSC 731
               +        +  +   ++EQ+ + +T  DN   D+D   DL+V   S   A   TS 
Sbjct: 119  EDNEEGTDNESARGHASESIMEQSEDAETEDDNEASDTDQMHDLDVDDQSAVGASESTS- 177

Query: 732  SSFDEHFGYKLSKDEVNDLMRGKWKNKWKVPAVGMSKCKWIGTGERFLENDDINLAYDLK 911
             SF  H G+KLSK+EV++L R KWK  W++PA  MS CKW+GTGE FL++ + +  YDLK
Sbjct: 178  -SFTSHVGHKLSKEEVDNLTRRKWKYNWEMPAFDMSSCKWMGTGESFLKDVNTSSGYDLK 236

Query: 912  PRLYNHWLDNYKANGGKDFHSSRPRSFFSLCNSYRDILHHNKKPFYLKGVEEDSSIMDAY 1091
             +LY HWL+NYK +GG DFHSS+ R FFSLCNSYRDIL+ NKKPFYLKG EEDS IMDAY
Sbjct: 237  LKLYKHWLENYKTSGGNDFHSSKQRLFFSLCNSYRDILYCNKKPFYLKGQEEDSCIMDAY 296

Query: 1092 IMHSLNHIFKSGDLVNKNDAKVAKLQETTKNEIPSNDEFLDRGFTRPKILILLPLASIAF 1271
            IMH+LNHIF++GDLV KND+KVAK QET K EI + D FLD GFTRPK+LILLPLASIA 
Sbjct: 297  IMHALNHIFRTGDLVTKNDSKVAKHQETVKEEILTGDSFLDHGFTRPKVLILLPLASIAL 356

Query: 1272 RVVKRLIQLTPSKHKVNVENNDRFTDEYGTATDEIDET-----------ENSKSQKSSKP 1418
            RVVKRLIQLTPS  KVNVE+ DRF+DE+GT   E D+            ENS  QKSSKP
Sbjct: 357  RVVKRLIQLTPSSSKVNVEHIDRFSDEFGTGVVEDDQEQNELFQKVQDYENSIVQKSSKP 416

Query: 1419 SDFQALFGGNNNDHFMIGIKFTRRSMKLYNDFYSSDMIVASPLGLITKIGEAEIDKEKDT 1598
            SDFQALFG NNNDHFMIGIKFTRR++KLY+DFYSSDMI+ASPLGLITKIGEAE++KEKD 
Sbjct: 417  SDFQALFGANNNDHFMIGIKFTRRTIKLYSDFYSSDMIIASPLGLITKIGEAEVEKEKDV 476

Query: 1599 DYLSSIEVLVIDHADVIAMQNWSHVSTVIDKLNRIPSKQHGTDIMRIRPWYLDGQAKFYR 1778
            DYLSSIEVLVIDHADVI+MQNWSHV++V+++LNRIPSKQHGTDIMRIR WYLDG A+FYR
Sbjct: 477  DYLSSIEVLVIDHADVISMQNWSHVNSVVEQLNRIPSKQHGTDIMRIRQWYLDGHAQFYR 536

Query: 1779 QSIILGSHLNPDINAMFNRHCLNYRGKIKLDYDHKGVLPK 1898
            Q+IILGS+LNPD+NA FN HC+NY+GK+KL  ++KG L K
Sbjct: 537  QTIILGSYLNPDMNASFNHHCVNYQGKVKLVCEYKGALAK 576


>ref|XP_004161801.1| PREDICTED: U3 small nucleolar RNA-associated protein 25-like [Cucumis
            sativus]
          Length = 710

 Score =  569 bits (1467), Expect = e-160
 Identities = 303/526 (57%), Positives = 376/526 (71%), Gaps = 16/526 (3%)
 Frame = +3

Query: 369  ESEEMVEVHKENTMYDDLLKTLGSASKSLASANRRRQRDEEGKXXXXXXXXXXXXXXXXX 548
            E+EE+ EV  E + Y++LL  L S  K +A++  +RQR EEGK                 
Sbjct: 13   EAEEL-EVFTEQSNYENLLMQLQSRHKDVAASCMKRQRQEEGKSDTEDDEDNCSESSSAL 71

Query: 549  XXXXLQMDY------GGSKHQESSLVDVIEQANNDDTYVD-NTFDSDDECDLNVSGHSIA 707
                 + +         S+   SS   + E   N +T  D ++ D+D E +L    HS  
Sbjct: 72   EEEEEEEEEEEEVTDDESRRSPSSGNSMYEPVLNVETEDDADSSDTDQENELEFGSHSGP 131

Query: 708  TAPAITSCSSFDEHFGYKLSKDEVNDLMRGKWKNKWKVPAVGMSKCKWIGTGERFLENDD 887
            +   ITS  SF++H  +KLS+ EV + ++ KWK  W VPAVGM  CKW GTGE FL+  D
Sbjct: 132  STSDITS--SFNKHMEHKLSEGEVENFLKMKWKYTWAVPAVGMPNCKWSGTGECFLKELD 189

Query: 888  IN-LAYDLKPRLYNHWLDNYKANGGKDFHSSRPRSFFSLCNSYRDILHHNKKPFYLKGVE 1064
            +   +YDLK RLY HWLD YK++ G DFHSSR R FFSLCNSYRDIL+ NKKPFYLKG+E
Sbjct: 190  MKPSSYDLKLRLYEHWLDTYKSSRGTDFHSSRQRFFFSLCNSYRDILYCNKKPFYLKGLE 249

Query: 1065 EDSSIMDAYIMHSLNHIFKSGDLVNKNDAKVAKLQETTKNEIPSNDEFLDRGFTRPKILI 1244
            EDSSIMD+YIMHSLNH+FK+ DL+ KND+KVAK Q+    EI S ++FLD GFTRPK+LI
Sbjct: 250  EDSSIMDSYIMHSLNHVFKARDLIAKNDSKVAKHQDCA--EILSGEKFLDHGFTRPKVLI 307

Query: 1245 LLPLASIAFRVVKRLIQLTPSKHKVNVENNDRFTDEYGTATDEIDET--------ENSKS 1400
            LLPLASIAFRV+KRL+ LTPS +KV VE  DR   ++G   D  ++         ++S S
Sbjct: 308  LLPLASIAFRVIKRLVHLTPSANKVTVEYLDRLFKDFGNGDDGKNQDMVELSLNDQSSSS 367

Query: 1401 QKSSKPSDFQALFGGNNNDHFMIGIKFTRRSMKLYNDFYSSDMIVASPLGLITKIGEAEI 1580
            QKSSKPSDFQALFGGNN D FMIGIKFTR+S+KL++DFYSSD+IVASPLGLITK+GE E 
Sbjct: 368  QKSSKPSDFQALFGGNNEDLFMIGIKFTRKSIKLFSDFYSSDIIVASPLGLITKLGEIEK 427

Query: 1581 DKEKDTDYLSSIEVLVIDHADVIAMQNWSHVSTVIDKLNRIPSKQHGTDIMRIRPWYLDG 1760
            +KEKD DYLSSIEVL+IDHAD+IAMQNWSHV+TVI+ +N+IPSKQHGTD+MRIR WYLDG
Sbjct: 428  NKEKDVDYLSSIEVLIIDHADIIAMQNWSHVNTVIEHMNKIPSKQHGTDVMRIRQWYLDG 487

Query: 1761 QAKFYRQSIILGSHLNPDINAMFNRHCLNYRGKIKLDYDHKGVLPK 1898
             A+FYRQS++LG H NPDIN  F R+C N+ GK+KL  ++KGVLPK
Sbjct: 488  HARFYRQSVVLGFHSNPDINGFFVRYCNNFEGKVKLLCEYKGVLPK 533


>ref|NP_564032.1| uncharacterized protein [Arabidopsis thaliana]
            gi|23297505|gb|AAN12983.1| unknown protein [Arabidopsis
            thaliana] gi|332191501|gb|AEE29622.1| uncharacterized
            protein [Arabidopsis thaliana]
          Length = 754

 Score =  561 bits (1446), Expect = e-157
 Identities = 303/577 (52%), Positives = 379/577 (65%), Gaps = 20/577 (3%)
 Frame = +3

Query: 228  RRKGHENAYKNGPKKKVSSSRDEVADSPSPCTSGDSDVEHTDIVSDGESEEMVEVHKENT 407
            R + HE   K    KK       +  +PS  +  +S +E        ESE MV  ++E T
Sbjct: 11   RHRSHEKFDKKRDTKKHKHVEKTIVSNPSTDSPEESSIE-------AESEAMV--YREPT 61

Query: 408  MYDDLLKTLGSASKSLASANRRRQRDEEGKXXXXXXXXXXXXXXXXXXXXXLQMDYGGSK 587
             Y +LL +LGS++K +A  N+RRQR+EEGK                           G  
Sbjct: 62   QYQNLLVSLGSSNKVVADMNKRRQREEEGKSDTEEDEDDEDEDEEENSGSDDLSSTDGED 121

Query: 588  HQESSLVDVIEQANNDDTYVDNTFDSDDEC-----------DLNVSGHSIATAPAITSCS 734
             +             DDT  DN   S++E            +L+ +G S   A +  S S
Sbjct: 122  DKSQGDDQETLGGLTDDTQEDNDNQSEEEDPDDYETDEEVHELSTNGQSFVDASS--SIS 179

Query: 735  SFDEHFGYKLSKDEVNDLMRGKWKNKWKVPAVGMSKCKWIGTGERFLENDDINLAYDLKP 914
            +F EH  +KLS +EV  L +GKWK KW+ PA  M  CKW GT E FL+    +  Y LKP
Sbjct: 180  AFSEHLSHKLSSEEVETLPKGKWKFKWESPAFDMPNCKWKGTSENFLDGIQSDAPYGLKP 239

Query: 915  RLYNHWLDNYKANGGKDFHSSRPRSFFSLCNSYRDILHHNKKPFYLKGVEEDSSIMDAYI 1094
            +LYNHWL  YK  GGKD  SS+ R FFS+CNSY DILH NKKPFY  G +EDSS MDAY+
Sbjct: 240  KLYNHWLQLYKKCGGKDLDSSKRRKFFSICNSYLDILHSNKKPFYHCGSDEDSSAMDAYL 299

Query: 1095 MHSLNHIFKSGDLVNKNDAKVAKLQETTKNEIPSNDEFLDRGFTRPKILILLPLASIAFR 1274
            MHSLNHIFK+ DLV KN++K+AK +ET++ EI S+D FLD+GFTRPK+LILLPL SIAFR
Sbjct: 300  MHSLNHIFKTRDLVKKNESKIAKHRETSEEEILSDDGFLDQGFTRPKVLILLPLRSIAFR 359

Query: 1275 VVKRLIQLTPSKHKVNVENNDRFTDEYGTATDEID--------ETENSKSQKSSKPSDFQ 1430
            VVKRLIQLTP   +VNVE+ DRF DE+G   D  D        +  NS  QKSSKPSD+Q
Sbjct: 360  VVKRLIQLTPESQRVNVEHLDRFNDEFGCEEDTDDCDGEKTTSKNGNSIKQKSSKPSDWQ 419

Query: 1431 ALFGGNNN-DHFMIGIKFTRRSMKLYNDFYSSDMIVASPLGLITKIGEAEIDKEKDTDYL 1607
            ALFG NNN D FM+GIK TR+S++LY DFYSSD+IVASPL L   IG AE +KE+D DYL
Sbjct: 420  ALFGANNNDDEFMLGIKHTRKSIRLYGDFYSSDIIVASPLKLHMAIGAAEENKERDVDYL 479

Query: 1608 SSIEVLVIDHADVIAMQNWSHVSTVIDKLNRIPSKQHGTDIMRIRPWYLDGQAKFYRQSI 1787
            SSIEVLVIDHAD+I+MQNWS ++TV+D LNR+P+KQHGT++MRIRP YLDG A+FYRQSI
Sbjct: 480  SSIEVLVIDHADIISMQNWSFLATVVDYLNRLPTKQHGTNVMRIRPLYLDGHARFYRQSI 539

Query: 1788 ILGSHLNPDINAMFNRHCLNYRGKIKLDYDHKGVLPK 1898
            IL S+L P++N++F RHCLNY+GK+K+  ++KGVL K
Sbjct: 540  ILSSYLTPEMNSLFGRHCLNYKGKMKMACEYKGVLEK 576


>gb|AAL07149.1| unknown protein [Arabidopsis thaliana]
          Length = 754

 Score =  561 bits (1446), Expect = e-157
 Identities = 303/577 (52%), Positives = 379/577 (65%), Gaps = 20/577 (3%)
 Frame = +3

Query: 228  RRKGHENAYKNGPKKKVSSSRDEVADSPSPCTSGDSDVEHTDIVSDGESEEMVEVHKENT 407
            R + HE   K    KK       +  +PS  +  +S +E        ESE MV  ++E T
Sbjct: 11   RHRSHEKFDKKRDTKKHKHVEKTIVSNPSTDSPEESSIE-------AESEAMV--YREPT 61

Query: 408  MYDDLLKTLGSASKSLASANRRRQRDEEGKXXXXXXXXXXXXXXXXXXXXXLQMDYGGSK 587
             Y +LL +LGS++K +A  N+RRQR+EEGK                           G  
Sbjct: 62   QYQNLLVSLGSSNKVVADMNKRRQREEEGKSDTEEDEDDEDEDEEENSGSDDLSSTDGED 121

Query: 588  HQESSLVDVIEQANNDDTYVDNTFDSDDEC-----------DLNVSGHSIATAPAITSCS 734
             +             DDT  DN   S++E            +L+ +G S   A +  S S
Sbjct: 122  DKSQGDDQETLGGLTDDTQEDNDNQSEEEDPDDYETDEEVHELSTNGQSFVDASS--SIS 179

Query: 735  SFDEHFGYKLSKDEVNDLMRGKWKNKWKVPAVGMSKCKWIGTGERFLENDDINLAYDLKP 914
            +F EH  +KLS +EV  L +GKWK KW+ PA  M  CKW GT E FL+    +  Y LKP
Sbjct: 180  AFSEHLSHKLSSEEVETLPKGKWKFKWESPAFDMPNCKWKGTSENFLDGIQSDAPYGLKP 239

Query: 915  RLYNHWLDNYKANGGKDFHSSRPRSFFSLCNSYRDILHHNKKPFYLKGVEEDSSIMDAYI 1094
            +LYNHWL  YK  GGKD  SS+ R FFS+CNSY DILH NKKPFY  G +EDSS MDAY+
Sbjct: 240  KLYNHWLQLYKKCGGKDLDSSKRRKFFSICNSYLDILHSNKKPFYHCGSDEDSSAMDAYL 299

Query: 1095 MHSLNHIFKSGDLVNKNDAKVAKLQETTKNEIPSNDEFLDRGFTRPKILILLPLASIAFR 1274
            MHSLNHIFK+ DLV KN++K+AK +ET++ EI S+D FLD+GFTRPK+LILLPL SIAFR
Sbjct: 300  MHSLNHIFKTRDLVKKNESKIAKHRETSEEEILSDDGFLDQGFTRPKVLILLPLRSIAFR 359

Query: 1275 VVKRLIQLTPSKHKVNVENNDRFTDEYGTATDEID--------ETENSKSQKSSKPSDFQ 1430
            VVKRLIQLTP   +VNVE+ DRF DE+G   D  D        +  NS  QKSSKPSD+Q
Sbjct: 360  VVKRLIQLTPESQRVNVEHLDRFNDEFGCEEDTDDCDGEKTTSKNGNSIKQKSSKPSDWQ 419

Query: 1431 ALFGGNNN-DHFMIGIKFTRRSMKLYNDFYSSDMIVASPLGLITKIGEAEIDKEKDTDYL 1607
            ALFG NNN D FM+GIK TR+S++LY DFYSSD+IVASPL L   IG AE +KE+D DYL
Sbjct: 420  ALFGANNNDDEFMLGIKHTRKSIRLYGDFYSSDIIVASPLKLHMAIGAAEENKERDVDYL 479

Query: 1608 SSIEVLVIDHADVIAMQNWSHVSTVIDKLNRIPSKQHGTDIMRIRPWYLDGQAKFYRQSI 1787
            SSIEVLVIDHAD+I+MQNWS ++TV+D LNR+P+KQHGT++MRIRP YLDG A+FYRQSI
Sbjct: 480  SSIEVLVIDHADIISMQNWSFLATVVDYLNRLPTKQHGTNVMRIRPLYLDGHARFYRQSI 539

Query: 1788 ILGSHLNPDINAMFNRHCLNYRGKIKLDYDHKGVLPK 1898
            IL S+L P++N++F RHCLNY+GK+K+  ++KGVL K
Sbjct: 540  ILSSYLTPEMNSLFGRHCLNYKGKMKMACEYKGVLEK 576


>ref|XP_002892963.1| hypothetical protein ARALYDRAFT_471979 [Arabidopsis lyrata subsp.
            lyrata] gi|297338805|gb|EFH69222.1| hypothetical protein
            ARALYDRAFT_471979 [Arabidopsis lyrata subsp. lyrata]
          Length = 748

 Score =  559 bits (1440), Expect = e-156
 Identities = 297/569 (52%), Positives = 387/569 (68%), Gaps = 12/569 (2%)
 Frame = +3

Query: 228  RRKGHENAYKNGPKKKVSSSRDEVADSPSPCTSGDSDVEHTDIVSDGESEEMVEVHKENT 407
            R + HE   K    KK       V  +PS     D D    D +   E+E    +++E T
Sbjct: 11   RHRSHEKFDKKRDTKKHKHVEKAVVSNPS---IEDPDSPEEDSI---ETESEAMLYREPT 64

Query: 408  MYDDLLKTLGSASKSLASANRRRQRDEEGKXXXXXXXXXXXXXXXXXXXXXLQMDYGGSK 587
             Y  LL +LGS++K +A  N+RRQR+EEGK                     +  D    +
Sbjct: 65   QYQHLLASLGSSNKVVADMNKRRQREEEGKSDTEEDEDEEENSGSEDLSS-IDGDDDKIQ 123

Query: 588  HQESSLVDVIEQANNDDTYVD---NTFDSDDECDLNVSGHSIATAPAITSCSSFDEHFGY 758
              +   +  +   NND+   +   + +++D+E DL+ +G S   A +  S S+F EH  +
Sbjct: 124  GDDQETLRGLMMENNDNQSEEEDPDDYETDEEHDLSTNGQSFVDASS--SVSAFSEHLSH 181

Query: 759  KLSKDEVNDLMRGKWKNKWKVPAVGMSKCKWIGTGERFLENDDINLAYDLKPRLYNHWLD 938
            KLS +EVN L +GKWK KW+  A  M  C+W GT E FL+    +  Y LKP+LY HWL 
Sbjct: 182  KLSSEEVNTLPKGKWKFKWESLAFDMPNCRWKGTSENFLDGIQSDATYGLKPKLYKHWLQ 241

Query: 939  NYKANGGKDFHSSRPRSFFSLCNSYRDILHHNKKPFYLKGVEEDSSIMDAYIMHSLNHIF 1118
             YK +GGKD  SS+ R FFS+CN+Y DILH NKKPFY  G +EDSS MDAY+MHSLNHIF
Sbjct: 242  LYKKSGGKDLDSSKRRKFFSICNNYLDILHSNKKPFYHSGSDEDSSAMDAYLMHSLNHIF 301

Query: 1119 KSGDLVNKNDAKVAKLQETTKNEIPSNDEFLDRGFTRPKILILLPLASIAFRVVKRLIQL 1298
            K+ DLV KN++K+AK +E ++ EI S+D FLD+GFTRPK+LILLPL SIAFRVVKRLIQL
Sbjct: 302  KTRDLVKKNESKIAKHREISEEEILSDDGFLDQGFTRPKVLILLPLRSIAFRVVKRLIQL 361

Query: 1299 TPSKHKVNVENNDRFTDEYG--TATDEID------ETENSKSQKSSKPSDFQALFG-GNN 1451
            TP   +V+VE+ DRF DE+G   ATD+ D      +  NS +QKSSKPSD+Q+LFG  NN
Sbjct: 362  TPESQRVSVEHLDRFNDEFGCEEATDDGDVEKNTSKKGNSTTQKSSKPSDWQSLFGASNN 421

Query: 1452 NDHFMIGIKFTRRSMKLYNDFYSSDMIVASPLGLITKIGEAEIDKEKDTDYLSSIEVLVI 1631
            +D FM+GIK TR+S++LY DFYSSD+I+ASPL L    G+AE +KE+D DYLSSIEVLVI
Sbjct: 422  DDEFMLGIKHTRKSIRLYGDFYSSDIIIASPLKLQMTFGQAEENKERDVDYLSSIEVLVI 481

Query: 1632 DHADVIAMQNWSHVSTVIDKLNRIPSKQHGTDIMRIRPWYLDGQAKFYRQSIILGSHLNP 1811
            DHAD+I+MQNWS ++TV+D LNR+PSKQHGT++MRIRP YLDG A+FYRQSIIL S+L P
Sbjct: 482  DHADIISMQNWSFLATVVDHLNRLPSKQHGTNVMRIRPLYLDGHARFYRQSIILSSYLTP 541

Query: 1812 DINAMFNRHCLNYRGKIKLDYDHKGVLPK 1898
            ++N++FNRHCLNY+GK+KL  ++KGVL K
Sbjct: 542  EMNSLFNRHCLNYKGKMKLACEYKGVLEK 570


Top