BLASTX nr result

ID: Angelica27_contig00010101 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica27_contig00010101
         (1178 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_017230718.1 PREDICTED: membrane protein of ER body-like prote...   436   e-145
KZN12081.1 hypothetical protein DCAR_004737 [Daucus carota subsp...   422   e-140
KZN12082.1 hypothetical protein DCAR_004738 [Daucus carota subsp...   418   e-139
XP_017230719.1 PREDICTED: membrane protein of ER body-like prote...   328   e-104
XP_017243826.1 PREDICTED: membrane protein of ER body-like prote...   100   6e-20
XP_011089220.1 PREDICTED: membrane protein of ER body-like prote...    76   3e-11
CDP03901.1 unnamed protein product [Coffea canephora]                  74   2e-10

>XP_017230718.1 PREDICTED: membrane protein of ER body-like protein isoform X1
            [Daucus carota subsp. sativus]
          Length = 645

 Score =  436 bits (1122), Expect = e-145
 Identities = 231/352 (65%), Positives = 258/352 (73%), Gaps = 13/352 (3%)
 Frame = +3

Query: 162  MESTNGDAVVVGEILVQ-QRREVVVPKDPGLQGLDHQ---SSSDDTDVGAGMHIKSGTNG 329
            M+ST GD V VGE+L+Q QRREVVVP+DPGLQGLD     SSSDD+D G GM IK  +NG
Sbjct: 1    MDSTEGDMVAVGEMLIQPQRREVVVPRDPGLQGLDQDHRSSSSDDSDGGDGMDIKKASNG 60

Query: 330  NGHHVTKGEKENFGKSVYFDEDTDSTDEWSLSKTNNGAAESDVKQNISDANQGVSKGNEI 509
            NGHHV K E ENFGKSVYFDE +DS DE SL K  NG AESDVKQ ISD  QG S+ NEI
Sbjct: 61   NGHHVMKAEAENFGKSVYFDEGSDSADELSLHKEKNGVAESDVKQKISDPIQGYSEDNEI 120

Query: 510  SPEISHDQKIKSKIPSAQVKSKHMKLENXXXXXXXXXXXXXXXFERDIERRDTHTMHCPK 689
             PE SH+ +IK +IPSA+VKSKH+KLEN               FER+I+RRDTHTMHCP 
Sbjct: 121  LPETSHEPEIKPEIPSAKVKSKHVKLENLIEESDDDEDVIELEFEREIKRRDTHTMHCPN 180

Query: 690  CNATIXXXXXXXXXXXXXXSATHGTPQPEPVDLLGCLSCCSVFVPSGNCFSWIRLLANGG 869
            CNA I              SAT   PQPEPVDLLGCLSCCSVFVPSGNCFSWIRLLANGG
Sbjct: 181  CNAVITKVVLRKKRTKKGLSAT--PPQPEPVDLLGCLSCCSVFVPSGNCFSWIRLLANGG 238

Query: 870  EESEDIPQQTNVPAAGLTGENENMVIDKEGDCFSLFRIFGNKPEKKSTQKPLQESSYDSG 1049
            E+SE +PQQTN+  +G TG + NMVIDKEGDCFSLFRIFGNKPEKKSTQKP ++SSYDS 
Sbjct: 239  EDSE-VPQQTNILQSGSTGNDGNMVIDKEGDCFSLFRIFGNKPEKKSTQKPPEQSSYDSA 297

Query: 1050 HAVGDTMINQEDGHAQDGDRDLEGTPNV---------QVDQNDVLPVQNGSS 1178
             AVG+ + NQ DGHAQD D++LEG PNV          V+QN V PV NGSS
Sbjct: 298  LAVGNPITNQGDGHAQDKDQNLEGIPNVVQPVPNGSAMVNQNVVRPVPNGSS 349


>KZN12081.1 hypothetical protein DCAR_004737 [Daucus carota subsp. sativus]
          Length = 671

 Score =  422 bits (1085), Expect = e-140
 Identities = 231/378 (61%), Positives = 258/378 (68%), Gaps = 39/378 (10%)
 Frame = +3

Query: 162  MESTNGDAVVVGEILVQ-QRREVVVPKDPGLQGLDHQ---SSSDDTDVGAGMHIKSGTNG 329
            M+ST GD V VGE+L+Q QRREVVVP+DPGLQGLD     SSSDD+D G GM IK  +NG
Sbjct: 1    MDSTEGDMVAVGEMLIQPQRREVVVPRDPGLQGLDQDHRSSSSDDSDGGDGMDIKKASNG 60

Query: 330  NGHHVTKGEKENFGKSVYFDEDT--------------------------DSTDEWSLSKT 431
            NGHHV K E ENFGKSVYFDE +                          DS DE SL K 
Sbjct: 61   NGHHVMKAEAENFGKSVYFDEGSGFWKCRHCVWTYGMGNRRRFNLSDFRDSADELSLHKE 120

Query: 432  NNGAAESDVKQNISDANQGVSKGNEISPEISHDQKIKSKIPSAQVKSKHMKLENXXXXXX 611
             NG AESDVKQ ISD  QG S+ NEI PE SH+ +IK +IPSA+VKSKH+KLEN      
Sbjct: 121  KNGVAESDVKQKISDPIQGYSEDNEILPETSHEPEIKPEIPSAKVKSKHVKLENLIEESD 180

Query: 612  XXXXXXXXXFERDIERRDTHTMHCPKCNATIXXXXXXXXXXXXXXSATHGTPQPEPVDLL 791
                     FER+I+RRDTHTMHCP CNA I              SAT   PQPEPVDLL
Sbjct: 181  DDEDVIELEFEREIKRRDTHTMHCPNCNAVITKVVLRKKRTKKGLSAT--PPQPEPVDLL 238

Query: 792  GCLSCCSVFVPSGNCFSWIRLLANGGEESEDIPQQTNVPAAGLTGENENMVIDKEGDCFS 971
            GCLSCCSVFVPSGNCFSWIRLLANGGE+SE +PQQTN+  +G TG + NMVIDKEGDCFS
Sbjct: 239  GCLSCCSVFVPSGNCFSWIRLLANGGEDSE-VPQQTNILQSGSTGNDGNMVIDKEGDCFS 297

Query: 972  LFRIFGNKPEKKSTQKPLQESSYDSGHAVGDTMINQEDGHAQDGDRDLEGTPNV------ 1133
            LFRIFGNKPEKKSTQKP ++SSYDS  AVG+ + NQ DGHAQD D++LEG PNV      
Sbjct: 298  LFRIFGNKPEKKSTQKPPEQSSYDSALAVGNPITNQGDGHAQDKDQNLEGIPNVVQPVPN 357

Query: 1134 ---QVDQNDVLPVQNGSS 1178
                V+QN V PV NGSS
Sbjct: 358  GSAMVNQNVVRPVPNGSS 375


>KZN12082.1 hypothetical protein DCAR_004738 [Daucus carota subsp. sativus]
          Length = 601

 Score =  418 bits (1075), Expect = e-139
 Identities = 225/371 (60%), Positives = 255/371 (68%), Gaps = 34/371 (9%)
 Frame = +3

Query: 162  MESTNGDAVVVGEILVQ-QRREVVVPKDPGLQGLDHQSSSDDTDVGAGMHIKSGTNGNGH 338
            ME ++GD V VGEIL+Q QR EVVVPKD GLQGLDHQS+SDD  VGAGM IK    GN H
Sbjct: 1    MEFSDGDMVAVGEILIQPQRPEVVVPKDQGLQGLDHQSTSDDNVVGAGMDIKKTNKGNAH 60

Query: 339  HVTKGEKENFGKSVYFDEDTDSTDEWSLSKTNNGAAESDVKQNISDANQGVSKGNEISPE 518
            H TK E ENFGKSVYFDE +DS DEWSL+KTNNGAAESDVKQ       GVS+ +EIS E
Sbjct: 61   HFTKAEAENFGKSVYFDEGSDSNDEWSLTKTNNGAAESDVKQ-------GVSREDEISSE 113

Query: 519  ISHDQKIKSKIPSAQVKSKHMKLENXXXXXXXXXXXXXXXFERDIERRDTHTMHCPKCNA 698
            ISH+Q+IK +IPSA+VKSKH+KLE+               FER+I+R DTHTMHCP CNA
Sbjct: 114  ISHEQEIKPEIPSAKVKSKHVKLEDLIEESDNDEDVIELEFEREIKRLDTHTMHCPNCNA 173

Query: 699  TIXXXXXXXXXXXXXXSATHGTPQPEPVDLLGCLSCCSVFVPSGNCFSWIRLLANGGEES 878
             I              SA    PQPEPVDLLGCLSCCSVFVPSGNCFSWIRLL NGGE+S
Sbjct: 174  VITKVVLRKRRTKKGLSAA--PPQPEPVDLLGCLSCCSVFVPSGNCFSWIRLLENGGEDS 231

Query: 879  EDIPQQTNVPAAGLTGENENMVIDKEGDCFSLFRIFGNKPEKKSTQKPLQESSYDSGHAV 1058
            E+IPQQTN+ A+G TG + +MVIDKEGDCFSLF+IFGNKPEKKS QKPL++SS  S  AV
Sbjct: 232  EEIPQQTNLLASGFTGNDGDMVIDKEGDCFSLFQIFGNKPEKKSAQKPLEQSSDVSAQAV 291

Query: 1059 GDTMINQEDGHAQDGDRDLEGT---------------------------------PNVQV 1139
            G+T+ NQE  HAQ GD++L+ T                                 PNVQV
Sbjct: 292  GNTITNQEYAHAQGGDQNLDRTPNVALPDQIGSAYPQPPLSTSPVDVIIDLGEHVPNVQV 351

Query: 1140 DQNDVLPVQNG 1172
            DQN  LP QNG
Sbjct: 352  DQNVALPDQNG 362


>XP_017230719.1 PREDICTED: membrane protein of ER body-like protein isoform X2
            [Daucus carota subsp. sativus]
          Length = 575

 Score =  328 bits (840), Expect = e-104
 Identities = 173/269 (64%), Positives = 193/269 (71%), Gaps = 9/269 (3%)
 Frame = +3

Query: 399  DSTDEWSLSKTNNGAAESDVKQNISDANQGVSKGNEISPEISHDQKIKSKIPSAQVKSKH 578
            DS DE SL K  NG AESDVKQ ISD  QG S+ NEI PE SH+ +IK +IPSA+VKSKH
Sbjct: 14   DSADELSLHKEKNGVAESDVKQKISDPIQGYSEDNEILPETSHEPEIKPEIPSAKVKSKH 73

Query: 579  MKLENXXXXXXXXXXXXXXXFERDIERRDTHTMHCPKCNATIXXXXXXXXXXXXXXSATH 758
            +KLEN               FER+I+RRDTHTMHCP CNA I              SAT 
Sbjct: 74   VKLENLIEESDDDEDVIELEFEREIKRRDTHTMHCPNCNAVITKVVLRKKRTKKGLSAT- 132

Query: 759  GTPQPEPVDLLGCLSCCSVFVPSGNCFSWIRLLANGGEESEDIPQQTNVPAAGLTGENEN 938
              PQPEPVDLLGCLSCCSVFVPSGNCFSWIRLLANGGE+SE +PQQTN+  +G TG + N
Sbjct: 133  -PPQPEPVDLLGCLSCCSVFVPSGNCFSWIRLLANGGEDSE-VPQQTNILQSGSTGNDGN 190

Query: 939  MVIDKEGDCFSLFRIFGNKPEKKSTQKPLQESSYDSGHAVGDTMINQEDGHAQDGDRDLE 1118
            MVIDKEGDCFSLFRIFGNKPEKKSTQKP ++SSYDS  AVG+ + NQ DGHAQD D++LE
Sbjct: 191  MVIDKEGDCFSLFRIFGNKPEKKSTQKPPEQSSYDSALAVGNPITNQGDGHAQDKDQNLE 250

Query: 1119 GTPNV---------QVDQNDVLPVQNGSS 1178
            G PNV          V+QN V PV NGSS
Sbjct: 251  GIPNVVQPVPNGSAMVNQNVVRPVPNGSS 279


>XP_017243826.1 PREDICTED: membrane protein of ER body-like protein [Daucus carota
            subsp. sativus]
          Length = 350

 Score =  100 bits (248), Expect = 6e-20
 Identities = 57/111 (51%), Positives = 65/111 (58%), Gaps = 33/111 (29%)
 Frame = +3

Query: 939  MVIDKEGDCFSLFRIFGNKPEKKSTQKPLQESSYDSGHAVGDTMINQEDGHAQDGDRDLE 1118
            MVIDKEGDCFSLF+IFGNKPEKKS QKPL++SS  S  AVG+T+ NQE  HAQ GD++L+
Sbjct: 1    MVIDKEGDCFSLFQIFGNKPEKKSAQKPLEQSSDVSAQAVGNTITNQEYAHAQGGDQNLD 60

Query: 1119 GT---------------------------------PNVQVDQNDVLPVQNG 1172
             T                                 PNVQVDQN  LP QNG
Sbjct: 61   RTPNVALPDQIGSAYPQPPLSTSPVDVIIDLGEHVPNVQVDQNVALPDQNG 111


>XP_011089220.1 PREDICTED: membrane protein of ER body-like protein [Sesamum indicum]
          Length = 667

 Score = 75.9 bits (185), Expect = 3e-11
 Identities = 71/262 (27%), Positives = 112/262 (42%), Gaps = 27/262 (10%)
 Frame = +3

Query: 384  FDEDTDS-TDEWSLSKTNNGAAESDV---------KQNISDANQGVSKGNEISPEISHDQ 533
            F+ DT   T + SLS  N G  + D+          QN  D    +  G  +   +S  +
Sbjct: 85   FNHDTRGPTLQLSLSGENLGLEDLDIPKLINTVVNNQNPGDLKDKIVSGLSVPKLVSTSR 144

Query: 534  KIKSKIPSA--QVKSKHMKLENXXXXXXXXXXXXXXXFERDIERRDTHTMHCPKCNATIX 707
            + +  I ++  +VK K +K++                FER +E+  THT +CP C++ I 
Sbjct: 145  EEEENIGTSTGRVKEK-VKID---IEEESDEEVIELEFERAVEKVHTHTAYCPNCSSQIT 200

Query: 708  XXXXXXXXXXXXXSAT---HGTPQPEPVDLLGCLSCCSVFVPSGNCFSWIRLLANGGEES 878
                          +T    G  + +PVDLLGCL+C S+F+PSGN  +  R+   GG+  
Sbjct: 201  KVVLRRKIRGSRTRSTGDGDGHRRDKPVDLLGCLACFSIFIPSGNGLNPFRIF--GGKGK 258

Query: 879  EDIPQ----------QTNVPAAGLTGENENMVIDKEGDCFSLFRIFGNKPEKKSTQ--KP 1022
             + PQ            +  A G T    +  + + G  F LF +F  + EK S Q  K 
Sbjct: 259  HESPQTVQEQPASGASLSAVAGGSTNAGGSTTVTQGG--FDLFWMFRKRGEKDSAQESKG 316

Query: 1023 LQESSYDSGHAVGDTMINQEDG 1088
             ++  +D G    +   N E G
Sbjct: 317  KEQIKHDGGSKNEEEHFNDESG 338


>CDP03901.1 unnamed protein product [Coffea canephora]
          Length = 664

 Score = 73.6 bits (179), Expect = 2e-10
 Identities = 82/317 (25%), Positives = 123/317 (38%), Gaps = 17/317 (5%)
 Frame = +3

Query: 189  VVGEILVQQRREVVVPKDPGLQG--------LDHQSSSDDTDVGAGMHIKS-GTNGNGHH 341
            VV E L+ +RR   +P      G         +  ++ D  D    M + S   +G+   
Sbjct: 25   VVEEALLLRRRSANIPTVAANSGGNATVFPAAEEVAAVDLLDTTPKMEVASVSVDGDQKE 84

Query: 342  VTKGEKENFGKSVYFDEDTDSTDEWSLSKTNNGAAESDVKQNISDANQGVSKGNEISPEI 521
            V K  +E     VYFD++  S  EWS    N    E   K    D NQ +S        +
Sbjct: 85   VEKTSEE----IVYFDKNEGSGPEWS--HFNVKVEELGAKLKDVDINQVLSA-------V 131

Query: 522  SHDQKIK---SKIPSAQVKSKHMKLENXXXXXXXXXXXXXXXFERDIERRDTH---TMHC 683
             H++++K   S  P  Q ++      +               FER +ER  TH   +M+C
Sbjct: 132  IHNKEVKLPTSTNPQLQEETDERVEVDDVLEETSDEEEIELEFERAVERIHTHDDYSMYC 191

Query: 684  PKCNATIXXXXXXXXXXXXXXSATHGTPQPEPVDLLGCLSCCSVFVPSGNCFSWIRLL-- 857
            P C++ I                   T +    DL GCL+C SVF+PSGN  +  R+   
Sbjct: 192  PNCSSRITKVVLRRKIRQRRVQTPKETQRD---DLFGCLACFSVFIPSGNRLNPFRIFGA 248

Query: 858  ANGGEESEDIPQQTNVPAAGLTGENENMVIDKEGDCFSLFRIFGNKPEKKSTQKPLQESS 1037
              G E S  + Q+   P   +     ++    +G  F LF IFG +   +     L  SS
Sbjct: 249  GKGPESSPLLQQEQQTPDVSMP---TSVTAQDKGKGFDLFWIFGKRRPHEKLSADLDSSS 305

Query: 1038 YDSGHAVGDTMINQEDG 1088
                +   D   +Q DG
Sbjct: 306  PKVTNEPRDQANDQGDG 322


Top