BLASTX nr result

ID: Angelica23_contig00019586 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica23_contig00019586
         (1411 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN80011.1| hypothetical protein VITISV_017818 [Vitis vinifera]   241   4e-61
emb|CBI32226.3| unnamed protein product [Vitis vinifera]              201   3e-49
ref|XP_002531864.1| hypothetical protein RCOM_1439490 [Ricinus c...   171   6e-40
ref|NP_849582.1| uncharacterized protein [Arabidopsis thaliana] ...   116   1e-23
ref|XP_003601493.1| hypothetical protein MTR_3g082270 [Medicago ...   115   2e-23

>emb|CAN80011.1| hypothetical protein VITISV_017818 [Vitis vinifera]
          Length = 824

 Score =  241 bits (614), Expect = 4e-61
 Identities = 174/493 (35%), Positives = 258/493 (52%), Gaps = 73/493 (14%)
 Frame = +3

Query: 135  QSISEDEDCEGNVHGGWSISLGETTETKGRGQVLSQLEILRDARGHEHVIGLS--NTLQK 308
            +++SE+ED   N      ISLG  TE K     L Q+ + R   G + +  +   N +  
Sbjct: 50   RNVSEEEDWPQNC-----ISLGGATEKKDSRTFLVQVHMKRITEGKQQICHMRHVNPIDL 104

Query: 309  RDKLFCIQDEVD------------------DKRQ-------------------LTCLADR 377
                F ++ E+D                  D++Q                   L     +
Sbjct: 105  LFSCFVLKVELDFPWELNVIKRLPFDIHKFDQKQTGFSFEDDDEMSVFHNGGHLILSPRK 164

Query: 378  VSTCTSDEENISDEE---------VIKENNMLLVRS-------ISSGCGNFQKVNFPVSR 509
             S C S+EE + D+E         ++  N +LLV+        ++SG  NFQKV+     
Sbjct: 165  GSKCNSEEERVYDDEALSLLQCSVLVARNCVLLVKGNKFSTFPLNSGAKNFQKVDLHKFG 224

Query: 510  IARNDEECAWSAVNREVEELVHVNENAICSSSPTIFSKLSKSGKGVRGKAKPKFSFHLQS 689
             A+ D +  WSAV +E EELVH+NENA CSSS   +S+ +K  KG + K KPKFSF  QS
Sbjct: 225  SAKQDGD-TWSAVTKETEELVHLNENAGCSSSHVSYSRGNKFSKGGKTKXKPKFSFRFQS 283

Query: 690  HKNANGDTSMPLL---------------EASESIKPVTTEQSMSDLLDFFKGGRIEQSNI 824
            +K    D+  P +               E  E+I+  T E ++++ +D F G +++++ I
Sbjct: 284  NKE---DSFGPFITNEKSSRSSKVDQVPEGLEAIEHKTMEGAIAEFVDGFHGEKLKETEI 340

Query: 825  HAVQDDVSVGENCTKHSMAVLLDSFRENNALPQGDAKVISRTK-ERLQLVVKRNIHSLGD 1001
            HAVQ D +VG  C+KHS+A LL+  +E N L  G +K+  R K  R+QLV+K+NI +  D
Sbjct: 341  HAVQGDQTVGHGCSKHSVAELLNDLQEKNGLLGGKSKMCCRRKGRRVQLVIKKNISTSED 400

Query: 1002 RTFDADKSPDILXXXXXXXXXEKVFHQNLEHIIPESKRKKSMADQFQEAIGVGSARDEEI 1181
            RT   ++ PD           E    QN    + E++RK +MAD+F +A+G  S  DE  
Sbjct: 401  RTMQ-NEDPDEPMASGPSSDDEANM-QNTRLNVSEAERK-TMADRFHDALGAASVNDEAP 457

Query: 1182 SFALSTHMGTGLFGKLQHVIQNEKERDMNFMKQLQ--AQACMKGSYLDVRILSRWLEAKL 1355
             F +    GTGLFGKLQ V+Q+EKERDMNF+K+LQ  A    + S +DV+ILSR+ EAKL
Sbjct: 458  LFVVPKPSGTGLFGKLQRVMQSEKERDMNFLKKLQIGASPNYEASCIDVKILSRFFEAKL 517

Query: 1356 TVCSCTFIRHKED 1394
            TVC+C+ + ++E+
Sbjct: 518  TVCNCSLVENEEE 530


>emb|CBI32226.3| unnamed protein product [Vitis vinifera]
          Length = 336

 Score =  201 bits (512), Expect = 3e-49
 Identities = 132/332 (39%), Positives = 192/332 (57%), Gaps = 28/332 (8%)
 Frame = +3

Query: 423  VIKENNMLLVRS-------ISSGCGNFQKVNFPVSRIARNDEECAWSAVNREVEELVHVN 581
            ++  N +LLV+        ++SG  NFQKV+      A+ D +  WSAV +E EELVH+N
Sbjct: 12   LVARNCVLLVKGNKFSTFPLNSGAKNFQKVDLHKFGSAKQDGD-TWSAVTKETEELVHLN 70

Query: 582  ENAICSSSPTIFSKLSKSGKGVRGKAKPKFSFHLQSHKNANGDTSMPLL----------- 728
            ENA CSSS   +S+ +K  KG + K+KPKFSF  QS+K    D+  P +           
Sbjct: 71   ENAGCSSSHVSYSRGNKFSKGGKTKSKPKFSFRFQSNKE---DSFGPFITNEKSSRSSKV 127

Query: 729  ----EASESIKPVTTEQSMSDLLDFFKGGRIEQSNIHAVQDDVSVGENCTKHSMAVLLDS 896
                E  E+I+  T E ++++ +D F G +++++ IHAVQ D +VG  C+KHS+A LL+ 
Sbjct: 128  DQVPEGLEAIEHKTMEGAIAEFVDGFHGEKLKETEIHAVQGDQTVGHGCSKHSVAELLND 187

Query: 897  FRENNALPQGDAKVISRTK-ERLQLVVKRNIHSLGDRTFDADKSPDILXXXXXXXXXEKV 1073
             +E N L  G +K+  R K  R+QLV+K+NI +  DRT   ++ PD           E  
Sbjct: 188  LQEKNGLLGGKSKMCCRRKGRRVQLVIKKNISTSEDRTMQ-NEDPDEPMASGPSSDDEAN 246

Query: 1074 FHQNLEHIIPESKRKKSMADQFQEAIGVGSARDEEISFALSTHMGTGLFGKLQHVIQNEK 1253
              QN    + E++RK +MAD+F +A+G  S  DE   F +    GTGLFGKLQ V+Q+EK
Sbjct: 247  M-QNTRLNVSEAERK-TMADRFHDALGAASVNDEAPLFVVPKPSGTGLFGKLQRVMQSEK 304

Query: 1254 ERDMNFMKQLQAQAC-----MKGSYLDVRILS 1334
            ERDMNF+K+LQ  A      +K  Y  +++LS
Sbjct: 305  ERDMNFLKKLQIGASPNCKRLKMIYFFLKVLS 336


>ref|XP_002531864.1| hypothetical protein RCOM_1439490 [Ricinus communis]
            gi|223528472|gb|EEF30501.1| hypothetical protein
            RCOM_1439490 [Ricinus communis]
          Length = 501

 Score =  171 bits (432), Expect = 6e-40
 Identities = 134/436 (30%), Positives = 220/436 (50%), Gaps = 21/436 (4%)
 Frame = +3

Query: 147  EDEDCEGNVHGGWSISLGETTETKGRGQVLSQLEILRDARGHEHVIGLSNTLQKRDKLFC 326
            ++++  G++    S SL   TE +    + S+LEILR    ++     SN        F 
Sbjct: 11   DEDELNGDLREN-SFSLTTRTEKEQGLLLQSRLEILRGKDDNQ-----SN--------FF 56

Query: 327  IQDEVDD-----KRQLTCLADRVSTCTSDEENISDEEVIKENNMLLVRSISSGCGNFQK- 488
            ++D+V+      +        + STC  D+E ISD E   ++      + SSG    QK 
Sbjct: 57   VEDDVEMPDFPYEGGSILSRGKESTCHPDQEIISDSE---DDCGFSKHATSSGARKLQKD 113

Query: 489  VNFPVSRIARNDEECAWSAVNREVEELVHVNENAICSSSPTIFSKLSKSGKGVRGKAKPK 668
             +    R    D  C WS +N+E E L+H+N+   C SS + FSK +KS KGV GK KPK
Sbjct: 114  SSLNTYRNEIQDGACTWSMINKEAEALIHLNDG--CFSSASTFSKANKSYKGVTGKVKPK 171

Query: 669  FSFHLQSHK------------NANGDTSMPLLEASESIKPVTTEQSMSDLLDFFKGGRIE 812
            FS H + HK            N +  ++  + E  E     T + S  + L  + G   +
Sbjct: 172  FSLHFKLHKDGLPQHFNFMDENISSSSAHGVPEQFEPTDDGTADNSDLEFLKDYHGENDK 231

Query: 813  QSNIHAVQDDVSVGENCTKHSMAVLLDSFRENNALPQGDAKVISRTKE-RLQLVVKRNIH 989
            Q      + + +      KH+M+ LLD  ++ N +P+ ++K+  RTK+ R QL+VK+++ 
Sbjct: 232  QLKFLPTEME-AFANRLNKHTMSELLDGLQDRNVVPRRNSKMSGRTKDKRTQLIVKKSLL 290

Query: 990  SLGDRTFDADKSPDILXXXXXXXXXEKVFHQNLEHIIPESKRKKSMADQFQEAIGVGSAR 1169
             LG R  + ++ P+ L           + H NL ++   + +K++MADQFQEA+   S  
Sbjct: 291  QLGKRIINNEEQPE-LVVSGSSDDEASIQHINLANL---AMKKQTMADQFQEALAAASLS 346

Query: 1170 DEEISFALSTHMGTGLFGKLQHVIQNEKERDMNFMKQLQ--AQACMKGSYLDVRILSRWL 1343
            +E +    +      L GKLQ V+Q+EKE D +F++++Q       +   + V+ILSR+L
Sbjct: 347  NEGVHVTAAK-----LSGKLQQVMQSEKEMDADFLRRIQLGPSTSDESHSIVVKILSRYL 401

Query: 1344 EAKLTVCSCTFIRHKE 1391
            +AKL VC C+F +++E
Sbjct: 402  DAKLIVCRCSFGKNRE 417


>ref|NP_849582.1| uncharacterized protein [Arabidopsis thaliana]
            gi|26452912|dbj|BAC43534.1| unknown protein [Arabidopsis
            thaliana] gi|29824301|gb|AAP04111.1| unknown protein
            [Arabidopsis thaliana] gi|332189384|gb|AEE27505.1|
            uncharacterized protein [Arabidopsis thaliana]
          Length = 471

 Score =  116 bits (291), Expect = 1e-23
 Identities = 119/458 (25%), Positives = 206/458 (44%), Gaps = 23/458 (5%)
 Frame = +3

Query: 93   MDHHKSIPENYESDQSISEDEDCEGNVHGGWSISLGETTETKGRGQVL-SQLEILRDARG 269
            M    ++P N +SDQSISE+E+          IS  E  + + RG +L ++LE L     
Sbjct: 1    MWRRSNLPGNPDSDQSISEEEEEN-------DISPKENAKEQERGLLLQTKLEKLIGCGE 53

Query: 270  HEHV-----IGLSNTLQKRDKLFCIQDEVDDKRQLTCLADRVST---CTSDEENISDEEV 425
              +      + L  TL+K         EV D  + + ++    T   C S +EN SD+E 
Sbjct: 54   LNYARETGDVTLEATLRKISSSLEDVVEVPDSPEESYISSSRRTGLACISAQENGSDDE- 112

Query: 426  IKENNMLLVRSISSGCGNFQKVNFPVSRIARNDEECAWSAVNREVEELVHVNENAICSSS 605
                                        I  +++   WSA+++E + L+H+N  A  +SS
Sbjct: 113  ----------------------------ITHDEQVATWSAISKETKSLIHLNGVASVASS 144

Query: 606  PTIFSKLSKSGKGVRGKAKPKFSFHLQSHKNANGDTSMPLLEASESIKPVTTEQSMSD-- 779
                 +  KS  G++   +PKFSF+  +H    G+TS  + + +E  +P   +Q++ +  
Sbjct: 145  HLSGFRAKKSSNGLKDHGRPKFSFNSHTH----GETSSKISDMAEIFEPDVEDQAIEEDP 200

Query: 780  LLDFFKG--GRIEQSNIHAVQDDVSVGENCTKHSMAVLLDSFRENNALPQGDAKVISRTK 953
            +++       R E     +V +   V   CTK ++  L +       +P    ++I R+ 
Sbjct: 201  IIECPNSFDERSENRQGVSVAESREVLHECTKDAVPKLQE-------IPLDKIRLIKRSS 253

Query: 954  E---RLQLVVKRNIHSLGDRTFDADKSPDIL---XXXXXXXXXEKVFHQNLEHIIPESKR 1115
            E   R +   ++  H      F    + D L            E  +  ++ +I   +++
Sbjct: 254  ELDSRHEAKSRKFTHKGNSSNFQDSDTDDELPGPMDSGSSSDDEPSYQSSVPNI--SNQK 311

Query: 1116 KKSMADQFQEAIGVGSARDEEISF-ALSTHMGTGLFGKLQHVIQNEKERDMNFMKQLQA- 1289
            K+ + D+F EAI   S   E + F +     G+ L+GKLQ +++ EKE +M  M++LQ+ 
Sbjct: 312  KQFVGDRFDEAIKASSLSKEGLLFGSPKLSGGSSLYGKLQQIMKQEKETEMEIMRKLQSG 371

Query: 1290 --QACMKGSYLDVRILSRWLEAKLTVCSCTFIRHKEDS 1397
              +A   G Y+DV+++SR LE KL VC C+ I    DS
Sbjct: 372  IGEADSSG-YVDVKVMSRHLEGKLVVCKCSVIDLSGDS 408


>ref|XP_003601493.1| hypothetical protein MTR_3g082270 [Medicago truncatula]
            gi|355490541|gb|AES71744.1| hypothetical protein
            MTR_3g082270 [Medicago truncatula]
          Length = 475

 Score =  115 bits (289), Expect = 2e-23
 Identities = 120/439 (27%), Positives = 195/439 (44%), Gaps = 14/439 (3%)
 Frame = +3

Query: 120  NYESDQSISEDEDCEGNVHGGWSISLGETTETKGRGQVLSQLEILRDARGHEHVIGLSNT 299
            N  S+ SIS+D+D   N+    S+    T E      +  +L++L++ R   H    S  
Sbjct: 6    NPHSEDSISDDDDHYANL---LSLQNHATNEQDEELPLPVRLDLLKE-RFCNHNSSSSFQ 61

Query: 300  LQKRDKLFCIQD--EVDDKRQLTCLADRVSTCTSDEENISDEEVIKENNMLLVRSISSGC 473
                D+   + D  E D+  +L      V   + DE+ +S++E     + +L   +    
Sbjct: 62   KLPEDEEVEMPDFNEGDNHDEL----GEVVADSDDEDTVSEDE----GSTILSTRLLQRY 113

Query: 474  GNFQKVNFPVSRIARNDEECAWSAVNREVEELVHVNENAICSSSPTIFSKLSKSGKGVRG 653
            G         S+  RNDE        R+ E L+   E+A   SS   FSK + SG  +  
Sbjct: 114  G---------SQFLRNDEV---KDAKRKSEALLCFQESA---SSHATFSKANSSGTCIWN 158

Query: 654  KAKPKFSFHLQSHKNANGDTSMPLLEASESIKPVTTEQSMSDLLDFFKGGRIEQSNIHAV 833
            KAKP+ S    +HK  +   S+  ++    I      +  + L     G  +E  +I  +
Sbjct: 159  KAKPRISLSSVTHKYGHTGPSISNIDRLPEIMKAVDHRPSASL----DGNHLEDDDIVEI 214

Query: 834  QDDVSVGENCTKHS------MAVLLDSFRENNALPQGDAKVISRTKER----LQLVVKRN 983
              D    E   +        MA L D+ ++   L  G  +   R  +R    +QL  K +
Sbjct: 215  NLDTEPSETEARPHEFNLPLMADLFDNLQDKTDLVGGKFRNYPRDNQRKGKAVQLFQKSS 274

Query: 984  IHSLGDRTFDADKSPDILXXXXXXXXXEKVFHQNLEHIIPESKRKKSMADQFQEAIGVGS 1163
            I  L +   D++ SP+ +         E   H  +       K+ ++MAD+F  A+G  S
Sbjct: 275  ISHLPETVVDSEDSPEPVDSGSSSDNEETDQHMRITF---PGKKMQTMADRFHNALGTSS 331

Query: 1164 ARDEEISFALSTHMGTGLFGKLQHVIQNEKERDMNFMKQLQAQACMKGSY--LDVRILSR 1337
               E +    S  + TG+F KLQ  +Q EKERD++F K+LQA A   G +  +DV I+SR
Sbjct: 332  VITESVGAHNS--LRTGIFEKLQQAMQKEKERDIDFSKKLQAGAKPDGEFGCVDVNIISR 389

Query: 1338 WLEAKLTVCSCTFIRHKED 1394
            +L+ KL VC C+F ++ E+
Sbjct: 390  YLDGKLIVCHCSFSKYTEN 408


Top