BLASTX nr result

ID: Angelica22_contig00016977 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica22_contig00016977
         (1854 letters)

Database: ./nr 
           23,641,837 sequences; 8,123,359,852 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

emb|CAN62190.1| hypothetical protein VITISV_020113 [Vitis vinifera]   643   0.0  
ref|XP_002279030.1| PREDICTED: protein notum homolog isoform 1 [...   640   0.0  
ref|XP_002512590.1| pectin acetylesterase, putative [Ricinus com...   634   e-179
ref|XP_002329987.1| predicted protein [Populus trichocarpa] gi|2...   634   e-179
ref|XP_003555771.1| PREDICTED: uncharacterized protein LOC100793...   629   e-178

>emb|CAN62190.1| hypothetical protein VITISV_020113 [Vitis vinifera]
          Length = 423

 Score =  643 bits (1658), Expect = 0.0
 Identities = 304/425 (71%), Positives = 359/425 (84%), Gaps = 10/425 (2%)
 Frame = +3

Query: 165  MTKLLLWSVLVIGILES-FCNGYEYDEY---------LNKTEVLYKDFAFGGSAAAYSPL 314
            M K++   V ++G++ S +  G+E++E          LN TE+ + D ++G SAA+  P+
Sbjct: 1    MVKVVGVVVAIVGLVFSKWVYGFEFEENGSWVHGLDDLNVTELSFSD-SYGVSAAS-RPM 58

Query: 315  TVGLTIIQGAGSRGAVCLDGTLPGYHLHRGYGSGANSWLIQLEGGGWCDTVRSCVYRKKT 494
             VGLT+I  A ++GAVCLDGTLPGYHLHRGYGSGANSWLIQLEGGGWC+++R+CVYRKKT
Sbjct: 59   MVGLTLIHAAAAKGAVCLDGTLPGYHLHRGYGSGANSWLIQLEGGGWCNSIRTCVYRKKT 118

Query: 495  RRGSLNYMEKQLVFSGILSNKAEENPDFFNWNRVKLRYCDGASFAGDSEDKAARLQFRGQ 674
            RRGS  YMEKQ+ F+GILSN  EENPDFFNWNRVKLRYCDGASF GDS+++AA+L FRGQ
Sbjct: 119  RRGSSIYMEKQIPFTGILSNNPEENPDFFNWNRVKLRYCDGASFTGDSQNQAAQLNFRGQ 178

Query: 675  RIWWAAMEDLKAKGMRYANQALLSGCSAGGLASILKCDDFRGIFSRRTKVKCLSDGGLFL 854
            RIW AA+EDL +KGMRYANQALLSGCSAGGLA+IL CD+FRG F R TKVKCLSD GLFL
Sbjct: 179  RIWSAAIEDLMSKGMRYANQALLSGCSAGGLAAILHCDEFRGFFPRNTKVKCLSDAGLFL 238

Query: 855  NAVDVAGGRTLRNMFGGVVKLQGVGKNLPRSCTNHLDPTTCFFPENLIANIQTPIFLLNA 1034
            +++DV+GGRTLRN+F GVV LQGV +NLP  C N LDPT+CFFP+N+I+NI+TP+FLLNA
Sbjct: 239  DSIDVSGGRTLRNLFSGVVNLQGVQRNLPSFCLNRLDPTSCFFPQNVISNIKTPLFLLNA 298

Query: 1035 AYDSWQVTVSLAPPSADPHGYWTACKANYAHCSASQIQFLQRFRDHMVNVVQRFSRSPKN 1214
            AYDSWQV  SLAPPSADPHGYW  CK N+A CS SQIQFLQ FR+ M+N ++ FS S +N
Sbjct: 299  AYDSWQVQASLAPPSADPHGYWNECKKNHAQCSPSQIQFLQGFRNQMLNAIKGFSMSKQN 358

Query: 1215 GLFINSCFAHCQSERQDTWFSDNSPLIGNKGIALAVGDWYFDRAGCKAIDCPYPCDKTCH 1394
            GLFINSCFAHCQ+ERQDTWF+DNSP+I NKGIALAVGDWYFDR+G KAIDCPYPCDKTCH
Sbjct: 359  GLFINSCFAHCQTERQDTWFADNSPIIKNKGIALAVGDWYFDRSGIKAIDCPYPCDKTCH 418

Query: 1395 NLVFR 1409
            NLVFR
Sbjct: 419  NLVFR 423


>ref|XP_002279030.1| PREDICTED: protein notum homolog isoform 1 [Vitis vinifera]
          Length = 423

 Score =  640 bits (1651), Expect = 0.0
 Identities = 303/425 (71%), Positives = 358/425 (84%), Gaps = 10/425 (2%)
 Frame = +3

Query: 165  MTKLLLWSVLVIGILES-FCNGYEYDEY---------LNKTEVLYKDFAFGGSAAAYSPL 314
            M K++   V ++G++ S +  G+E++E          LN TE+ +   ++G SAA+  P+
Sbjct: 1    MVKVVGVVVAIVGLVFSKWVYGFEFEENGSWVHGLDDLNVTELSFS-VSYGVSAAS-RPM 58

Query: 315  TVGLTIIQGAGSRGAVCLDGTLPGYHLHRGYGSGANSWLIQLEGGGWCDTVRSCVYRKKT 494
             VGLT+I  A ++GAVCLDGTLPGYHLHRGYGSGANSWLIQLEGGGWC+++R+CVYRKKT
Sbjct: 59   MVGLTLIHAAAAKGAVCLDGTLPGYHLHRGYGSGANSWLIQLEGGGWCNSIRTCVYRKKT 118

Query: 495  RRGSLNYMEKQLVFSGILSNKAEENPDFFNWNRVKLRYCDGASFAGDSEDKAARLQFRGQ 674
            RRGS  YMEKQ+ F+GILSN  EENPDFFNWNRVKLRYCDGASF GDS+++AA+L FRGQ
Sbjct: 119  RRGSSIYMEKQIPFTGILSNNPEENPDFFNWNRVKLRYCDGASFTGDSQNQAAQLNFRGQ 178

Query: 675  RIWWAAMEDLKAKGMRYANQALLSGCSAGGLASILKCDDFRGIFSRRTKVKCLSDGGLFL 854
            RIW AA+EDL +KGMRYANQALLSGCSAGGLA+IL CD+FRG F R TKVKCLSD GLFL
Sbjct: 179  RIWSAAIEDLMSKGMRYANQALLSGCSAGGLAAILHCDEFRGFFPRNTKVKCLSDAGLFL 238

Query: 855  NAVDVAGGRTLRNMFGGVVKLQGVGKNLPRSCTNHLDPTTCFFPENLIANIQTPIFLLNA 1034
            +++DV+GGRTLRN+F GVV LQGV +NLP  C N LDPT+CFFP+N+I+NI+TP+FLLNA
Sbjct: 239  DSIDVSGGRTLRNLFSGVVNLQGVQRNLPSFCLNRLDPTSCFFPQNVISNIKTPLFLLNA 298

Query: 1035 AYDSWQVTVSLAPPSADPHGYWTACKANYAHCSASQIQFLQRFRDHMVNVVQRFSRSPKN 1214
            AYDSWQV  SLAPPSADPHGYW  CK N+A CS SQIQFLQ FR+ M+N ++ FS S +N
Sbjct: 299  AYDSWQVQASLAPPSADPHGYWNECKKNHAQCSPSQIQFLQGFRNQMLNAIKGFSMSKQN 358

Query: 1215 GLFINSCFAHCQSERQDTWFSDNSPLIGNKGIALAVGDWYFDRAGCKAIDCPYPCDKTCH 1394
            GLFINSCFAHCQ+ERQDTWF+DNSP+I NKGIALAVGDWYFDR+G KAIDCPYPCDKTCH
Sbjct: 359  GLFINSCFAHCQTERQDTWFADNSPIIKNKGIALAVGDWYFDRSGIKAIDCPYPCDKTCH 418

Query: 1395 NLVFR 1409
            NLVFR
Sbjct: 419  NLVFR 423


>ref|XP_002512590.1| pectin acetylesterase, putative [Ricinus communis]
            gi|223548551|gb|EEF50042.1| pectin acetylesterase,
            putative [Ricinus communis]
          Length = 418

 Score =  634 bits (1635), Expect = e-179
 Identities = 297/420 (70%), Positives = 348/420 (82%), Gaps = 5/420 (1%)
 Frame = +3

Query: 165  MTKLLLWSVLVIGILESFCNGYEYDEYLNKTEVLYKDFA-----FGGSAAAYSPLTVGLT 329
            M KLL    +++ ++  + NG E +E   + E+LY   A     F  S    + L VGLT
Sbjct: 1    MGKLLWVMFVMVSVIGKWANGLELNE--TEPEILYTGVASDEGYFNESLVFNNALMVGLT 58

Query: 330  IIQGAGSRGAVCLDGTLPGYHLHRGYGSGANSWLIQLEGGGWCDTVRSCVYRKKTRRGSL 509
            +I+ AG+RGAVCLDGTLPGYHLHRGYGSGANSWLIQLEGGGWC+ +R+CVYRKKTRRGS 
Sbjct: 59   LIRSAGARGAVCLDGTLPGYHLHRGYGSGANSWLIQLEGGGWCNNIRNCVYRKKTRRGSS 118

Query: 510  NYMEKQLVFSGILSNKAEENPDFFNWNRVKLRYCDGASFAGDSEDKAARLQFRGQRIWWA 689
             YMEKQL F+GILSNK +ENPDFFNWNRVKLRYCDGASF+GD+E+KAA+LQFRGQRIW A
Sbjct: 119  KYMEKQLAFTGILSNKPQENPDFFNWNRVKLRYCDGASFSGDNENKAAQLQFRGQRIWLA 178

Query: 690  AMEDLKAKGMRYANQALLSGCSAGGLASILKCDDFRGIFSRRTKVKCLSDGGLFLNAVDV 869
            AM+DL +KGMRYANQALLSGCSAGGLASIL CD+FR +F RRT+VKCLSD GLFL+AVDV
Sbjct: 179  AMQDLMSKGMRYANQALLSGCSAGGLASILHCDEFRNLFPRRTRVKCLSDAGLFLDAVDV 238

Query: 870  AGGRTLRNMFGGVVKLQGVGKNLPRSCTNHLDPTTCFFPENLIANIQTPIFLLNAAYDSW 1049
            +GGRTLRNM+ GVV LQGV  NLPR CTNHLDPT+CFFP+N+I N++TP+F+LNAAYDSW
Sbjct: 239  SGGRTLRNMYSGVVGLQGVQNNLPRICTNHLDPTSCFFPQNIIGNVKTPLFILNAAYDSW 298

Query: 1050 QVTVSLAPPSADPHGYWTACKANYAHCSASQIQFLQRFRDHMVNVVQRFSRSPKNGLFIN 1229
            Q+  SLAPPSADPHGYW  C+ N+A CSA QIQFLQ FR+ M+  ++ FS S +NGLFIN
Sbjct: 299  QIQSSLAPPSADPHGYWNECRKNHAKCSAPQIQFLQGFRNQMLRAIRGFSMSKQNGLFIN 358

Query: 1230 SCFAHCQSERQDTWFSDNSPLIGNKGIALAVGDWYFDRAGCKAIDCPYPCDKTCHNLVFR 1409
            SCFAHCQSERQDTWF+D+SP+IGNK +A+AVGDWYFDR+G K IDCPYPCD      VFR
Sbjct: 359  SCFAHCQSERQDTWFADDSPVIGNKAVAIAVGDWYFDRSGVKLIDCPYPCDTPATIWVFR 418


>ref|XP_002329987.1| predicted protein [Populus trichocarpa] gi|222871412|gb|EEF08543.1|
            predicted protein [Populus trichocarpa]
          Length = 393

 Score =  634 bits (1635), Expect = e-179
 Identities = 295/393 (75%), Positives = 343/393 (87%), Gaps = 5/393 (1%)
 Frame = +3

Query: 246  LNKTEVLYKDF--AFGGSAAAYSP---LTVGLTIIQGAGSRGAVCLDGTLPGYHLHRGYG 410
            +N+ E+ Y +   ++   + AY+P   L VGLT+I+ A ++GAVCLDGTLPGYH HRGYG
Sbjct: 1    MNEEEIFYTEANASYYIESKAYNPNNALLVGLTLIKSAAAKGAVCLDGTLPGYHWHRGYG 60

Query: 411  SGANSWLIQLEGGGWCDTVRSCVYRKKTRRGSLNYMEKQLVFSGILSNKAEENPDFFNWN 590
            SGANSWLIQLEGGGWC++VR+CVYRK TRRGS NYMEKQL F+GILSNKA ENPDFFNWN
Sbjct: 61   SGANSWLIQLEGGGWCNSVRACVYRKTTRRGSSNYMEKQLAFTGILSNKAVENPDFFNWN 120

Query: 591  RVKLRYCDGASFAGDSEDKAARLQFRGQRIWWAAMEDLKAKGMRYANQALLSGCSAGGLA 770
            RVKLRYCDGASF GDSE KAA+LQFRGQRIW AAMEDL +KGMRYANQALLSGCSAGGLA
Sbjct: 121  RVKLRYCDGASFTGDSEHKAAQLQFRGQRIWSAAMEDLMSKGMRYANQALLSGCSAGGLA 180

Query: 771  SILKCDDFRGIFSRRTKVKCLSDGGLFLNAVDVAGGRTLRNMFGGVVKLQGVGKNLPRSC 950
            SIL CD+FR  F R+T+VKCLSD GLFL+AVDV+GGRTLRN++GGVV LQGV  NLPR C
Sbjct: 181  SILHCDEFRNFFPRKTRVKCLSDAGLFLDAVDVSGGRTLRNLYGGVVGLQGVQNNLPRIC 240

Query: 951  TNHLDPTTCFFPENLIANIQTPIFLLNAAYDSWQVTVSLAPPSADPHGYWTACKANYAHC 1130
             NHLDPT+CFFP+N+I N++TP+F+LNAAYDSWQ+  SLAPPSADP GYW+ C+ +++ C
Sbjct: 241  INHLDPTSCFFPQNVIGNVKTPLFILNAAYDSWQIQSSLAPPSADPAGYWSNCRKDHSKC 300

Query: 1131 SASQIQFLQRFRDHMVNVVQRFSRSPKNGLFINSCFAHCQSERQDTWFSDNSPLIGNKGI 1310
            SASQIQFLQ FR+ M+N ++ FSRS +NGLFINSCFAHCQSERQDTWF+DNSP++GNK I
Sbjct: 301  SASQIQFLQGFRNQMLNAIKGFSRSRQNGLFINSCFAHCQSERQDTWFADNSPVLGNKPI 360

Query: 1311 ALAVGDWYFDRAGCKAIDCPYPCDKTCHNLVFR 1409
            ALAVGDWYFDR+G KAIDCPYPCD +CHNLVFR
Sbjct: 361  ALAVGDWYFDRSGEKAIDCPYPCDSSCHNLVFR 393


>ref|XP_003555771.1| PREDICTED: uncharacterized protein LOC100793403 [Glycine max]
          Length = 424

 Score =  629 bits (1623), Expect = e-178
 Identities = 294/419 (70%), Positives = 343/419 (81%), Gaps = 8/419 (1%)
 Frame = +3

Query: 177  LLWSVLVIGILESF------CNGYEYDEYLNKTEV--LYKDFAFGGSAAAYSPLTVGLTI 332
            L W  +V  ++ SF       N Y    + N+TE   L        S    +PL VGLT+
Sbjct: 6    LFWLGIVAALVFSFWVDAFSANQYYPHHHFNETEFSSLESQEQAHSSLLGRTPLMVGLTL 65

Query: 333  IQGAGSRGAVCLDGTLPGYHLHRGYGSGANSWLIQLEGGGWCDTVRSCVYRKKTRRGSLN 512
            IQ A ++GAVCLDGTLPGYHLHRGYGSGANSW++ LEGGGWC+ VRSCVYRKKTRRGS  
Sbjct: 66   IQSAAAKGAVCLDGTLPGYHLHRGYGSGANSWVVNLEGGGWCNDVRSCVYRKKTRRGSST 125

Query: 513  YMEKQLVFSGILSNKAEENPDFFNWNRVKLRYCDGASFAGDSEDKAARLQFRGQRIWWAA 692
            +MEKQ+ F+GILSN AE+NPDFFNWNRVK+RYCDGASFAGD EDK A+LQFRGQRIW AA
Sbjct: 126  FMEKQIPFTGILSNSAEDNPDFFNWNRVKIRYCDGASFAGDGEDKVAQLQFRGQRIWLAA 185

Query: 693  MEDLKAKGMRYANQALLSGCSAGGLASILKCDDFRGIFSRRTKVKCLSDGGLFLNAVDVA 872
            MEDLK+KGMR+A QALLSGCSAGGLA+I+ CD+FRG F   TKVKCLSD GLFL+A+DV+
Sbjct: 186  MEDLKSKGMRFAKQALLSGCSAGGLATIIHCDEFRGFFPETTKVKCLSDAGLFLDAIDVS 245

Query: 873  GGRTLRNMFGGVVKLQGVGKNLPRSCTNHLDPTTCFFPENLIANIQTPIFLLNAAYDSWQ 1052
             G T++N+F GVV+LQGV KNLP  CTNHLDPT+CFFP+NLIA I+TP+F+LN AYDSWQ
Sbjct: 246  RGHTIKNLFSGVVRLQGVQKNLPHFCTNHLDPTSCFFPQNLIAGIRTPLFILNTAYDSWQ 305

Query: 1053 VTVSLAPPSADPHGYWTACKANYAHCSASQIQFLQRFRDHMVNVVQRFSRSPKNGLFINS 1232
            V  SLAP SADPHG+W  C+ N+A C++SQIQ+LQ FR+ M+N ++ FSRSP+NGLFINS
Sbjct: 306  VQTSLAPSSADPHGFWHDCRLNHAKCTSSQIQYLQGFRNQMLNAIKGFSRSPQNGLFINS 365

Query: 1233 CFAHCQSERQDTWFSDNSPLIGNKGIALAVGDWYFDRAGCKAIDCPYPCDKTCHNLVFR 1409
            CFAHCQSERQDTWF+DNSP+IGNK IALAVGDWYFDRA  KAIDCPYPCD TCH+LVFR
Sbjct: 366  CFAHCQSERQDTWFADNSPVIGNKAIALAVGDWYFDRAVVKAIDCPYPCDNTCHHLVFR 424


Top