BLASTX nr result

ID: Angelica27_contig00021024 seq

BLASTX 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Angelica27_contig00021024
         (775 letters)

Database: ./nr 
           115,041,592 sequences; 42,171,959,267 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

XP_017240581.1 PREDICTED: uncharacterized protein LOC108213307 [...   225   6e-72
KZN10435.1 hypothetical protein DCAR_003091 [Daucus carota subsp...   175   9e-53
OMO90838.1 hypothetical protein COLO4_18845 [Corchorus olitorius]     156   7e-45
XP_018812873.1 PREDICTED: uncharacterized protein LOC108985144 i...   155   1e-44
XP_019416759.1 PREDICTED: uncharacterized protein LOC109327996 [...   155   2e-44
XP_004136477.1 PREDICTED: uncharacterized protein LOC101218754 [...   154   3e-44
XP_012076583.1 PREDICTED: uncharacterized protein LOC105637647 [...   154   6e-44
XP_010250102.1 PREDICTED: uncharacterized protein LOC104592429 [...   154   6e-44
XP_017646197.1 PREDICTED: uncharacterized protein LOC108486585 i...   154   7e-44
XP_016754937.1 PREDICTED: uncharacterized protein LOC107962897 i...   154   7e-44
XP_016754936.1 PREDICTED: uncharacterized protein LOC107962897 i...   154   7e-44
XP_017646196.1 PREDICTED: uncharacterized protein LOC108486585 i...   154   7e-44
OMO61673.1 hypothetical protein CCACVL1_23322, partial [Corchoru...   152   7e-44
CBI21032.3 unnamed protein product, partial [Vitis vinifera]          153   8e-44
EEF46973.1 conserved hypothetical protein [Ricinus communis]          153   9e-44
XP_002515524.2 PREDICTED: uncharacterized protein LOC8259462 [Ri...   153   1e-43
KDP33602.1 hypothetical protein JCGZ_07173 [Jatropha curcas]          154   1e-43
XP_007011875.1 PREDICTED: uncharacterized protein LOC18587804 [T...   153   1e-43
XP_003631795.1 PREDICTED: uncharacterized protein LOC100854903 [...   153   2e-43
EOY29496.1 DnaJ/Hsp40 cysteine-rich domain superfamily protein i...   153   2e-43

>XP_017240581.1 PREDICTED: uncharacterized protein LOC108213307 [Daucus carota
           subsp. sativus]
          Length = 129

 Score =  225 bits (573), Expect = 6e-72
 Identities = 104/128 (81%), Positives = 111/128 (86%), Gaps = 6/128 (4%)
 Frame = -3

Query: 662 MAALTTPIYVHLNTIVAHRPG------FCSTGCTNQKLRGTVKFDCVKAKATNSDQNTKK 501
           MAA TT +YVH++ IVAHRPG      FC     NQKLRGTVKF  VKAKATNSDQ+TKK
Sbjct: 1   MAASTTQLYVHIDAIVAHRPGLSTSSCFCRANFINQKLRGTVKFSSVKAKATNSDQDTKK 60

Query: 500 NSVICGDCDGNGAVLCSQCKGDGVNSVDHFNGQFKVGESCWLCGGKKDMLCGNCNGAGFI 321
           NSVICGDCDGNGAVLCSQCKGDGVNSVDHFNGQFK GESCWLCGGKKDMLCGNC+GAGF+
Sbjct: 61  NSVICGDCDGNGAVLCSQCKGDGVNSVDHFNGQFKAGESCWLCGGKKDMLCGNCSGAGFV 120

Query: 320 GGFMNSFD 297
           GGFMN+FD
Sbjct: 121 GGFMNTFD 128


>KZN10435.1 hypothetical protein DCAR_003091 [Daucus carota subsp. sativus]
          Length = 104

 Score =  175 bits (444), Expect = 9e-53
 Identities = 83/104 (79%), Positives = 87/104 (83%), Gaps = 6/104 (5%)
 Frame = -3

Query: 662 MAALTTPIYVHLNTIVAHRPG------FCSTGCTNQKLRGTVKFDCVKAKATNSDQNTKK 501
           MAA TT +YVH++ IVAHRPG      FC     NQKLRGTVKF  VKAKATNSDQ+TKK
Sbjct: 1   MAASTTQLYVHIDAIVAHRPGLSTSSCFCRANFINQKLRGTVKFSSVKAKATNSDQDTKK 60

Query: 500 NSVICGDCDGNGAVLCSQCKGDGVNSVDHFNGQFKVGESCWLCG 369
           NSVICGDCDGNGAVLCSQCKGDGVNSVDHFNGQFK GESCWLCG
Sbjct: 61  NSVICGDCDGNGAVLCSQCKGDGVNSVDHFNGQFKAGESCWLCG 104


>OMO90838.1 hypothetical protein COLO4_18845 [Corchorus olitorius]
          Length = 139

 Score =  156 bits (395), Expect = 7e-45
 Identities = 70/86 (81%), Positives = 74/86 (86%)
 Frame = -3

Query: 554 KFDCVKAKATNSDQNTKKNSVICGDCDGNGAVLCSQCKGDGVNSVDHFNGQFKVGESCWL 375
           KF  VKAKA  S +NTK NSVICGDCDGNGAV CSQCKG GVNSVD FNGQFK G+SCWL
Sbjct: 53  KFHSVKAKAVPSSRNTKPNSVICGDCDGNGAVTCSQCKGSGVNSVDFFNGQFKAGDSCWL 112

Query: 374 CGGKKDMLCGNCNGAGFIGGFMNSFD 297
           CGGKK MLCGNCNGAGFIGGFM++ D
Sbjct: 113 CGGKKQMLCGNCNGAGFIGGFMSTDD 138


>XP_018812873.1 PREDICTED: uncharacterized protein LOC108985144 isoform X1 [Juglans
           regia]
          Length = 130

 Score =  155 bits (393), Expect = 1e-44
 Identities = 67/86 (77%), Positives = 76/86 (88%)
 Frame = -3

Query: 554 KFDCVKAKATNSDQNTKKNSVICGDCDGNGAVLCSQCKGDGVNSVDHFNGQFKVGESCWL 375
           KF  +KAKA  SDQ+ K NSVIC DCDGNGAVLCSQCKG GVNSVD FNGQFK G+SCWL
Sbjct: 44  KFGNIKAKAEKSDQSPKPNSVICADCDGNGAVLCSQCKGSGVNSVDLFNGQFKAGDSCWL 103

Query: 374 CGGKKDMLCGNCNGAGFIGGFMNSFD 297
           CGG+K+MLCGNCNGAGF+GGF+++FD
Sbjct: 104 CGGRKEMLCGNCNGAGFVGGFLSTFD 129


>XP_019416759.1 PREDICTED: uncharacterized protein LOC109327996 [Lupinus
           angustifolius] OIV96487.1 hypothetical protein
           TanjilG_07879 [Lupinus angustifolius]
          Length = 136

 Score =  155 bits (392), Expect = 2e-44
 Identities = 70/105 (66%), Positives = 84/105 (80%), Gaps = 10/105 (9%)
 Frame = -3

Query: 581 TNQK-LRGT---------VKFDCVKAKATNSDQNTKKNSVICGDCDGNGAVLCSQCKGDG 432
           TNQK +R T          K+ C+ AKA + ++NTK NSVIC DCDGNGAVLCSQCKG G
Sbjct: 31  TNQKHIRATSVFQVPSPIAKYHCIIAKAASGNRNTKPNSVICADCDGNGAVLCSQCKGSG 90

Query: 431 VNSVDHFNGQFKVGESCWLCGGKKDMLCGNCNGAGFIGGFMNSFD 297
           VNSVD FNGQFK G+SCWLCGG+K+MLCGNCNGAGF+GGF++++D
Sbjct: 91  VNSVDLFNGQFKAGDSCWLCGGRKEMLCGNCNGAGFVGGFLSTYD 135


>XP_004136477.1 PREDICTED: uncharacterized protein LOC101218754 [Cucumis sativus]
           KGN60105.1 hypothetical protein Csa_3G878740 [Cucumis
           sativus]
          Length = 132

 Score =  154 bits (390), Expect = 3e-44
 Identities = 69/106 (65%), Positives = 82/106 (77%), Gaps = 10/106 (9%)
 Frame = -3

Query: 584 CTNQKLR----------GTVKFDCVKAKATNSDQNTKKNSVICGDCDGNGAVLCSQCKGD 435
           C+NQKL              +F  +  KA  +D+NTK NSVICGDCDGNGAV+CSQCKG 
Sbjct: 26  CSNQKLNLIHNGFHDYSSPARFPHLILKAAKNDRNTKPNSVICGDCDGNGAVVCSQCKGK 85

Query: 434 GVNSVDHFNGQFKVGESCWLCGGKKDMLCGNCNGAGFIGGFMNSFD 297
           GVN+VD FNGQFK GESCWLCGG+K+MLCGNCNGAGFIGGF++++D
Sbjct: 86  GVNAVDFFNGQFKAGESCWLCGGRKEMLCGNCNGAGFIGGFLSTYD 131


>XP_012076583.1 PREDICTED: uncharacterized protein LOC105637647 [Jatropha curcas]
           XP_012076584.1 PREDICTED: uncharacterized protein
           LOC105637647 [Jatropha curcas]
          Length = 131

 Score =  154 bits (388), Expect = 6e-44
 Identities = 72/111 (64%), Positives = 81/111 (72%), Gaps = 9/111 (8%)
 Frame = -3

Query: 602 GFCSTGCTNQKLRG---------TVKFDCVKAKATNSDQNTKKNSVICGDCDGNGAVLCS 450
           G     C  Q++ G         T +   VK +A+ S+QN K  SVIC DCDGNGAVLCS
Sbjct: 20  GAYKASCGKQEILGIHDFLHSSSTARRHSVKTRASPSNQNPKPKSVICADCDGNGAVLCS 79

Query: 449 QCKGDGVNSVDHFNGQFKVGESCWLCGGKKDMLCGNCNGAGFIGGFMNSFD 297
           QCKG GVNSVD FNGQFK G+SCWLCGGKKDMLCGNCNGAGFIGGFM++FD
Sbjct: 80  QCKGSGVNSVDVFNGQFKAGDSCWLCGGKKDMLCGNCNGAGFIGGFMSTFD 130


>XP_010250102.1 PREDICTED: uncharacterized protein LOC104592429 [Nelumbo nucifera]
          Length = 131

 Score =  154 bits (388), Expect = 6e-44
 Identities = 64/88 (72%), Positives = 77/88 (87%)
 Frame = -3

Query: 560 TVKFDCVKAKATNSDQNTKKNSVICGDCDGNGAVLCSQCKGDGVNSVDHFNGQFKVGESC 381
           T +F  ++AKA +S+++TK NS++C DCDGNGAVLCSQCKG GVNSVDHFNGQFK G  C
Sbjct: 43  TARFQSIEAKAADSNESTKSNSLLCADCDGNGAVLCSQCKGTGVNSVDHFNGQFKAGGLC 102

Query: 380 WLCGGKKDMLCGNCNGAGFIGGFMNSFD 297
           WLC GK+D+LCGNCNGAGF+GGFM+SFD
Sbjct: 103 WLCRGKRDILCGNCNGAGFVGGFMSSFD 130


>XP_017646197.1 PREDICTED: uncharacterized protein LOC108486585 isoform X2
           [Gossypium arboreum]
          Length = 133

 Score =  154 bits (388), Expect = 7e-44
 Identities = 69/96 (71%), Positives = 79/96 (82%)
 Frame = -3

Query: 584 CTNQKLRGTVKFDCVKAKATNSDQNTKKNSVICGDCDGNGAVLCSQCKGDGVNSVDHFNG 405
           C+  KL+  +K     AKA  S++NTK NSVICGDCDGNGAV+CSQCKG GVN VD FNG
Sbjct: 42  CSATKLQSVIK-----AKAAPSNRNTKPNSVICGDCDGNGAVVCSQCKGSGVNPVDFFNG 96

Query: 404 QFKVGESCWLCGGKKDMLCGNCNGAGFIGGFMNSFD 297
           QFK G+SCWLCGG+K+MLCGNCNGAGFIGGFMN+ D
Sbjct: 97  QFKAGDSCWLCGGRKEMLCGNCNGAGFIGGFMNTDD 132


>XP_016754937.1 PREDICTED: uncharacterized protein LOC107962897 isoform X2
           [Gossypium hirsutum]
          Length = 133

 Score =  154 bits (388), Expect = 7e-44
 Identities = 69/96 (71%), Positives = 79/96 (82%)
 Frame = -3

Query: 584 CTNQKLRGTVKFDCVKAKATNSDQNTKKNSVICGDCDGNGAVLCSQCKGDGVNSVDHFNG 405
           C+  KL+  +K     AKA  S++NTK NSVICGDCDGNGAV+CSQCKG GVN VD FNG
Sbjct: 42  CSATKLQSVIK-----AKAAPSNRNTKPNSVICGDCDGNGAVVCSQCKGSGVNPVDFFNG 96

Query: 404 QFKVGESCWLCGGKKDMLCGNCNGAGFIGGFMNSFD 297
           QFK G+SCWLCGG+K+MLCGNCNGAGFIGGFMN+ D
Sbjct: 97  QFKAGDSCWLCGGRKEMLCGNCNGAGFIGGFMNTDD 132


>XP_016754936.1 PREDICTED: uncharacterized protein LOC107962897 isoform X1
           [Gossypium hirsutum]
          Length = 134

 Score =  154 bits (388), Expect = 7e-44
 Identities = 69/96 (71%), Positives = 79/96 (82%)
 Frame = -3

Query: 584 CTNQKLRGTVKFDCVKAKATNSDQNTKKNSVICGDCDGNGAVLCSQCKGDGVNSVDHFNG 405
           C+  KL+  +K     AKA  S++NTK NSVICGDCDGNGAV+CSQCKG GVN VD FNG
Sbjct: 43  CSATKLQSVIK-----AKAAPSNRNTKPNSVICGDCDGNGAVVCSQCKGSGVNPVDFFNG 97

Query: 404 QFKVGESCWLCGGKKDMLCGNCNGAGFIGGFMNSFD 297
           QFK G+SCWLCGG+K+MLCGNCNGAGFIGGFMN+ D
Sbjct: 98  QFKAGDSCWLCGGRKEMLCGNCNGAGFIGGFMNTDD 133


>XP_017646196.1 PREDICTED: uncharacterized protein LOC108486585 isoform X1
           [Gossypium arboreum] KHG04360.1 Zinc finger protein
           [Gossypium arboreum]
          Length = 134

 Score =  154 bits (388), Expect = 7e-44
 Identities = 69/96 (71%), Positives = 79/96 (82%)
 Frame = -3

Query: 584 CTNQKLRGTVKFDCVKAKATNSDQNTKKNSVICGDCDGNGAVLCSQCKGDGVNSVDHFNG 405
           C+  KL+  +K     AKA  S++NTK NSVICGDCDGNGAV+CSQCKG GVN VD FNG
Sbjct: 43  CSATKLQSVIK-----AKAAPSNRNTKPNSVICGDCDGNGAVVCSQCKGSGVNPVDFFNG 97

Query: 404 QFKVGESCWLCGGKKDMLCGNCNGAGFIGGFMNSFD 297
           QFK G+SCWLCGG+K+MLCGNCNGAGFIGGFMN+ D
Sbjct: 98  QFKAGDSCWLCGGRKEMLCGNCNGAGFIGGFMNTDD 133


>OMO61673.1 hypothetical protein CCACVL1_23322, partial [Corchorus capsularis]
          Length = 102

 Score =  152 bits (385), Expect = 7e-44
 Identities = 68/86 (79%), Positives = 73/86 (84%)
 Frame = -3

Query: 554 KFDCVKAKATNSDQNTKKNSVICGDCDGNGAVLCSQCKGDGVNSVDHFNGQFKVGESCWL 375
           KF  VKAKA  S +NTK NSVICGDCDGNGAV CSQCKG GVN VD FNG+FK G+SCWL
Sbjct: 16  KFHSVKAKAVPSSRNTKPNSVICGDCDGNGAVPCSQCKGSGVNPVDFFNGEFKAGDSCWL 75

Query: 374 CGGKKDMLCGNCNGAGFIGGFMNSFD 297
           CGGKK MLCGNCNGAGFIGGFM++ D
Sbjct: 76  CGGKKQMLCGNCNGAGFIGGFMSTDD 101


>CBI21032.3 unnamed protein product, partial [Vitis vinifera]
          Length = 117

 Score =  153 bits (386), Expect = 8e-44
 Identities = 74/104 (71%), Positives = 81/104 (77%)
 Frame = -3

Query: 608 RPGFCSTGCTNQKLRGTVKFDCVKAKATNSDQNTKKNSVICGDCDGNGAVLCSQCKGDGV 429
           RPGF       QK   T KF   KA  +NS + TK NSV+C DCDGNGAVLCSQCKG GV
Sbjct: 22  RPGF-------QKFPAT-KFSPPKAAQSNS-KGTKPNSVVCADCDGNGAVLCSQCKGSGV 72

Query: 428 NSVDHFNGQFKVGESCWLCGGKKDMLCGNCNGAGFIGGFMNSFD 297
           NSVD FNGQFK GESCWLCGGKKD+LCGNCNGAGF+GGFM++FD
Sbjct: 73  NSVDFFNGQFKAGESCWLCGGKKDILCGNCNGAGFVGGFMSTFD 116


>EEF46973.1 conserved hypothetical protein [Ricinus communis]
          Length = 131

 Score =  153 bits (387), Expect = 9e-44
 Identities = 68/89 (76%), Positives = 75/89 (84%)
 Frame = -3

Query: 563 GTVKFDCVKAKATNSDQNTKKNSVICGDCDGNGAVLCSQCKGDGVNSVDHFNGQFKVGES 384
           GT +   +K +A  SDQN K  SVIC DCDGNGAVLCSQCKG GVNSVD FNGQFK G+S
Sbjct: 42  GTARRCYLKTRAAPSDQNPKPKSVICTDCDGNGAVLCSQCKGSGVNSVDIFNGQFKAGDS 101

Query: 383 CWLCGGKKDMLCGNCNGAGFIGGFMNSFD 297
           CWLCGGKKDMLCGNCNGAGF+GGFM++FD
Sbjct: 102 CWLCGGKKDMLCGNCNGAGFLGGFMSTFD 130


>XP_002515524.2 PREDICTED: uncharacterized protein LOC8259462 [Ricinus communis]
          Length = 133

 Score =  153 bits (387), Expect = 1e-43
 Identities = 68/89 (76%), Positives = 75/89 (84%)
 Frame = -3

Query: 563 GTVKFDCVKAKATNSDQNTKKNSVICGDCDGNGAVLCSQCKGDGVNSVDHFNGQFKVGES 384
           GT +   +K +A  SDQN K  SVIC DCDGNGAVLCSQCKG GVNSVD FNGQFK G+S
Sbjct: 44  GTARRCYLKTRAAPSDQNPKPKSVICTDCDGNGAVLCSQCKGSGVNSVDIFNGQFKAGDS 103

Query: 383 CWLCGGKKDMLCGNCNGAGFIGGFMNSFD 297
           CWLCGGKKDMLCGNCNGAGF+GGFM++FD
Sbjct: 104 CWLCGGKKDMLCGNCNGAGFLGGFMSTFD 132


>KDP33602.1 hypothetical protein JCGZ_07173 [Jatropha curcas]
          Length = 153

 Score =  154 bits (388), Expect = 1e-43
 Identities = 72/111 (64%), Positives = 81/111 (72%), Gaps = 9/111 (8%)
 Frame = -3

Query: 602 GFCSTGCTNQKLRG---------TVKFDCVKAKATNSDQNTKKNSVICGDCDGNGAVLCS 450
           G     C  Q++ G         T +   VK +A+ S+QN K  SVIC DCDGNGAVLCS
Sbjct: 42  GAYKASCGKQEILGIHDFLHSSSTARRHSVKTRASPSNQNPKPKSVICADCDGNGAVLCS 101

Query: 449 QCKGDGVNSVDHFNGQFKVGESCWLCGGKKDMLCGNCNGAGFIGGFMNSFD 297
           QCKG GVNSVD FNGQFK G+SCWLCGGKKDMLCGNCNGAGFIGGFM++FD
Sbjct: 102 QCKGSGVNSVDVFNGQFKAGDSCWLCGGKKDMLCGNCNGAGFIGGFMSTFD 152


>XP_007011875.1 PREDICTED: uncharacterized protein LOC18587804 [Theobroma cacao]
           EOY29494.1 DnaJ/Hsp40 cysteine-rich domain superfamily
           protein isoform 1 [Theobroma cacao]
          Length = 133

 Score =  153 bits (386), Expect = 1e-43
 Identities = 69/86 (80%), Positives = 75/86 (87%)
 Frame = -3

Query: 554 KFDCVKAKATNSDQNTKKNSVICGDCDGNGAVLCSQCKGDGVNSVDHFNGQFKVGESCWL 375
           KF  VKAKAT  ++NTK NSVIC DCDGNGAVLCSQC+G GVNSVD FNGQFK G+SCWL
Sbjct: 47  KFLPVKAKATPRNRNTKPNSVICADCDGNGAVLCSQCQGSGVNSVDFFNGQFKAGDSCWL 106

Query: 374 CGGKKDMLCGNCNGAGFIGGFMNSFD 297
           CGGKK MLCGNCNGAGFIGGFM++ D
Sbjct: 107 CGGKKQMLCGNCNGAGFIGGFMSTDD 132


>XP_003631795.1 PREDICTED: uncharacterized protein LOC100854903 [Vitis vinifera]
          Length = 139

 Score =  153 bits (386), Expect = 2e-43
 Identities = 74/104 (71%), Positives = 81/104 (77%)
 Frame = -3

Query: 608 RPGFCSTGCTNQKLRGTVKFDCVKAKATNSDQNTKKNSVICGDCDGNGAVLCSQCKGDGV 429
           RPGF       QK   T KF   KA  +NS + TK NSV+C DCDGNGAVLCSQCKG GV
Sbjct: 44  RPGF-------QKFPAT-KFSPPKAAQSNS-KGTKPNSVVCADCDGNGAVLCSQCKGSGV 94

Query: 428 NSVDHFNGQFKVGESCWLCGGKKDMLCGNCNGAGFIGGFMNSFD 297
           NSVD FNGQFK GESCWLCGGKKD+LCGNCNGAGF+GGFM++FD
Sbjct: 95  NSVDFFNGQFKAGESCWLCGGKKDILCGNCNGAGFVGGFMSTFD 138


>EOY29496.1 DnaJ/Hsp40 cysteine-rich domain superfamily protein isoform 3,
           partial [Theobroma cacao]
          Length = 140

 Score =  153 bits (386), Expect = 2e-43
 Identities = 69/86 (80%), Positives = 75/86 (87%)
 Frame = -3

Query: 554 KFDCVKAKATNSDQNTKKNSVICGDCDGNGAVLCSQCKGDGVNSVDHFNGQFKVGESCWL 375
           KF  VKAKAT  ++NTK NSVIC DCDGNGAVLCSQC+G GVNSVD FNGQFK G+SCWL
Sbjct: 54  KFLPVKAKATPRNRNTKPNSVICADCDGNGAVLCSQCQGSGVNSVDFFNGQFKAGDSCWL 113

Query: 374 CGGKKDMLCGNCNGAGFIGGFMNSFD 297
           CGGKK MLCGNCNGAGFIGGFM++ D
Sbjct: 114 CGGKKQMLCGNCNGAGFIGGFMSTDD 139


Top