BLASTX nr result
ID: Rehmannia32_contig00019995
seq
BLASTX 2.2.26 [Sep-21-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia32_contig00019995 (418 letters) Database: All non-redundant GenBank CDS translations+PDB+SwissProt+PIR+PRF excluding environmental samples from WGS projects 149,584,005 sequences; 54,822,741,787 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|PIN00799.1| hypothetical protein CDL12_26697 [Handroanthus im... 152 3e-45 ref|XP_011096407.1| uncharacterized protein LOC105175607 [Sesamu... 151 7e-45 ref|XP_012854738.1| PREDICTED: uncharacterized protein LOC105974... 130 7e-37 ref|XP_022875963.1| protein PHOTOSYSTEM I ASSEMBLY 2, chloroplas... 125 8e-35 ref|XP_006350324.1| PREDICTED: protein EMBRYO SAC DEVELOPMENT AR... 120 1e-32 ref|XP_009782305.1| PREDICTED: uncharacterized protein LOC104231... 118 1e-31 ref|XP_019265479.1| PREDICTED: uncharacterized protein LOC109243... 117 3e-31 ref|XP_015056978.1| PREDICTED: protein EMBRYO SAC DEVELOPMENT AR... 116 4e-31 dbj|GAV60316.1| hypothetical protein CFOL_v3_03847 [Cephalotus f... 116 5e-31 gb|KZV40884.1| hypothetical protein F511_05129 [Dorcoceras hygro... 114 3e-30 ref|XP_024026653.1| protein PHOTOSYSTEM I ASSEMBLY 2, chloroplas... 114 6e-30 ref|XP_021767004.1| uncharacterized protein LOC110731460 [Chenop... 114 7e-30 ref|XP_009599534.1| PREDICTED: uncharacterized protein LOC104095... 113 1e-29 ref|XP_021748719.1| uncharacterized protein LOC110714497 [Chenop... 112 2e-29 ref|XP_004250427.1| PREDICTED: protein EMBRYO SAC DEVELOPMENT AR... 112 3e-29 ref|XP_021912274.1| protein EMBRYO SAC DEVELOPMENT ARREST 3, chl... 112 3e-29 gb|PIA47192.1| hypothetical protein AQUCO_01400107v1 [Aquilegia ... 111 4e-29 ref|XP_009782304.1| PREDICTED: uncharacterized protein LOC104231... 111 8e-29 gb|PON49705.1| Heat shock protein DnaJ, cysteine-rich domain con... 110 1e-28 ref|XP_022719989.1| protein PHOTOSYSTEM I ASSEMBLY 2, chloroplas... 110 2e-28 >gb|PIN00799.1| hypothetical protein CDL12_26697 [Handroanthus impetiginosus] Length = 121 Score = 152 bits (385), Expect = 3e-45 Identities = 76/121 (62%), Positives = 86/121 (71%), Gaps = 8/121 (6%) Frame = -3 Query: 344 MKSFYLCSTS----LTFPTQFLHNQLFVQISEETEKQQRGRDFRPQAAKSGGFSLKS--- 186 MKSFYLCS+ TFP+ H QL +SEE +K + GR+F P+AAKSGGFSLKS Sbjct: 1 MKSFYLCSSRSSSLTTFPSLSTHPQLIASLSEEFKKHKTGRNFNPKAAKSGGFSLKSIIN 60 Query: 185 -CQKCKGQGAIECPXXXXXXXXXXXXNIFERWKCYDCQGFGLKSCPVCGKGGLTPEQRGE 9 C+ C GQGAIECP NIFERWKC++CQGFGLKSCPVCGKGGLTPEQRGE Sbjct: 61 KCKTCSGQGAIECPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPVCGKGGLTPEQRGE 120 Query: 8 R 6 R Sbjct: 121 R 121 >ref|XP_011096407.1| uncharacterized protein LOC105175607 [Sesamum indicum] Length = 118 Score = 151 bits (382), Expect = 7e-45 Identities = 76/118 (64%), Positives = 83/118 (70%), Gaps = 5/118 (4%) Frame = -3 Query: 344 MKSFYLCSTSLTFPTQF-LHNQLFVQISEETEKQQRGRDFRPQAAKSGGFSLKS----CQ 180 MKS +LCS+SLTFP LH L + SEE KQ RGRD AA+SGGFSL S C+ Sbjct: 1 MKSLHLCSSSLTFPPLLHLHPHLIISTSEENRKQHRGRDLTLHAARSGGFSLNSITNRCK 60 Query: 179 KCKGQGAIECPXXXXXXXXXXXXNIFERWKCYDCQGFGLKSCPVCGKGGLTPEQRGER 6 CKGQGAIECP NIFERWKC++CQGFGLKSCPVCGKGGLTPEQRGER Sbjct: 61 TCKGQGAIECPGCKGTGKNKKNGNIFERWKCFECQGFGLKSCPVCGKGGLTPEQRGER 118 >ref|XP_012854738.1| PREDICTED: uncharacterized protein LOC105974219 [Erythranthe guttata] gb|EYU22803.1| hypothetical protein MIMGU_mgv1a016872mg [Erythranthe guttata] Length = 103 Score = 130 bits (328), Expect = 7e-37 Identities = 64/113 (56%), Positives = 73/113 (64%) Frame = -3 Query: 344 MKSFYLCSTSLTFPTQFLHNQLFVQISEETEKQQRGRDFRPQAAKSGGFSLKSCQKCKGQ 165 MKS YL S+ LTFP + + T ++ +F AAKSGGFS KSCQ CKGQ Sbjct: 1 MKSLYLLSSRLTFPP----------LIQPTPNRKNSGNFTVHAAKSGGFSFKSCQTCKGQ 50 Query: 164 GAIECPXXXXXXXXXXXXNIFERWKCYDCQGFGLKSCPVCGKGGLTPEQRGER 6 GA+EC NIFERWKC+DCQGFG+K CPVCGKGGLTPEQRGER Sbjct: 51 GAVECQGCKGTGKNKKNGNIFERWKCFDCQGFGMKGCPVCGKGGLTPEQRGER 103 >ref|XP_022875963.1| protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic isoform X1 [Olea europaea var. sylvestris] Length = 107 Score = 125 bits (315), Expect = 8e-35 Identities = 67/117 (57%), Positives = 75/117 (64%), Gaps = 4/117 (3%) Frame = -3 Query: 344 MKSFYLCSTSLTFPTQFLHNQLFVQISEETEKQQRGRDFRPQAAKSGGFSLKS----CQK 177 M S ++C TQF N E+EKQQRG+ F AAKSGGFSLKS CQ Sbjct: 1 MASLFICINC----TQFKLN------ISESEKQQRGKTFTLHAAKSGGFSLKSITSKCQT 50 Query: 176 CKGQGAIECPXXXXXXXXXXXXNIFERWKCYDCQGFGLKSCPVCGKGGLTPEQRGER 6 C+G+GAIEC NIFERWKC++CQGFG KSCPVCGKGGLTPEQRGER Sbjct: 51 CRGEGAIECSGCKGTGKNKKNGNIFERWKCFECQGFGFKSCPVCGKGGLTPEQRGER 107 >ref|XP_006350324.1| PREDICTED: protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic [Solanum tuberosum] Length = 122 Score = 120 bits (302), Expect = 1e-32 Identities = 67/129 (51%), Positives = 78/129 (60%), Gaps = 4/129 (3%) Frame = -3 Query: 380 KKLSEFPNQENNMKSFYLCSTSLTFPTQFLHNQLFVQISEETEKQQRGRDFRPQAAKSGG 201 KK F QE M + +C F ++ + IS+ RG+ F AAKSGG Sbjct: 2 KKCVSFLKQEK-MANVSICCC-------FANSSITPTISQFNTVNLRGQRFITSAAKSGG 53 Query: 200 FSLKS----CQKCKGQGAIECPXXXXXXXXXXXXNIFERWKCYDCQGFGLKSCPVCGKGG 33 FSLKS C+ C+G+GAIECP NIFERWKC+DCQGFGLKSCPVCGKGG Sbjct: 54 FSLKSIGNRCEGCEGKGAIECPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPVCGKGG 113 Query: 32 LTPEQRGER 6 LTPEQRGER Sbjct: 114 LTPEQRGER 122 >ref|XP_009782305.1| PREDICTED: uncharacterized protein LOC104231074 isoform X2 [Nicotiana sylvestris] Length = 121 Score = 118 bits (295), Expect = 1e-31 Identities = 57/83 (68%), Positives = 61/83 (73%), Gaps = 4/83 (4%) Frame = -3 Query: 242 RGRDFRPQAAKSGGFSLKS----CQKCKGQGAIECPXXXXXXXXXXXXNIFERWKCYDCQ 75 RG+ F A+KSGGFSLKS C+ C GQGAIECP NIFERWKC+DCQ Sbjct: 39 RGQRFVTSASKSGGFSLKSIVNRCENCGGQGAIECPGCKGTGKNKKNGNIFERWKCFDCQ 98 Query: 74 GFGLKSCPVCGKGGLTPEQRGER 6 GFGLKSCPVCGKGGLTPEQRGER Sbjct: 99 GFGLKSCPVCGKGGLTPEQRGER 121 >ref|XP_019265479.1| PREDICTED: uncharacterized protein LOC109243046 isoform X1 [Nicotiana attenuata] gb|OIT35685.1| hypothetical protein A4A49_02881 [Nicotiana attenuata] Length = 121 Score = 117 bits (293), Expect = 3e-31 Identities = 56/83 (67%), Positives = 61/83 (73%), Gaps = 4/83 (4%) Frame = -3 Query: 242 RGRDFRPQAAKSGGFSLKS----CQKCKGQGAIECPXXXXXXXXXXXXNIFERWKCYDCQ 75 RG+ F A+KSGGFSLKS C+ C GQGAIECP NIFERWKC+DCQ Sbjct: 39 RGQRFITSASKSGGFSLKSIVNRCENCGGQGAIECPGCKGTGKNKKNGNIFERWKCFDCQ 98 Query: 74 GFGLKSCPVCGKGGLTPEQRGER 6 GFG+KSCPVCGKGGLTPEQRGER Sbjct: 99 GFGMKSCPVCGKGGLTPEQRGER 121 >ref|XP_015056978.1| PREDICTED: protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic isoform X1 [Solanum pennellii] Length = 111 Score = 116 bits (291), Expect = 4e-31 Identities = 61/102 (59%), Positives = 68/102 (66%), Gaps = 5/102 (4%) Frame = -3 Query: 296 FLHNQLFVQISEETEKQQRG-RDFRPQAAKSGGFSLKS----CQKCKGQGAIECPXXXXX 132 F ++ + IS+ RG R F AAKSGGFSLKS C+ C G+GAIECP Sbjct: 10 FANSSITPTISQFNTINLRGQRSFVTSAAKSGGFSLKSIGSRCEGCGGKGAIECPGCKGT 69 Query: 131 XXXXXXXNIFERWKCYDCQGFGLKSCPVCGKGGLTPEQRGER 6 NIFERWKC+DCQGFGLKSCPVCGKGGLTPEQRGER Sbjct: 70 GKNKKNGNIFERWKCFDCQGFGLKSCPVCGKGGLTPEQRGER 111 >dbj|GAV60316.1| hypothetical protein CFOL_v3_03847 [Cephalotus follicularis] Length = 120 Score = 116 bits (291), Expect = 5e-31 Identities = 63/113 (55%), Positives = 71/113 (62%), Gaps = 6/113 (5%) Frame = -3 Query: 326 CSTSLTFPTQFLHNQLFVQISEETE--KQQRGRDFRPQAAKSGGFS----LKSCQKCKGQ 165 C+ S+T P H Q I +++ K+QR AAKSGGFS LK CQ C G+ Sbjct: 8 CNNSMTAPALPGHLQFNSIIWKKSNYNKEQRYTTSTVSAAKSGGFSVNSILKGCQNCGGK 67 Query: 164 GAIECPXXXXXXXXXXXXNIFERWKCYDCQGFGLKSCPVCGKGGLTPEQRGER 6 GAIECP NIFERWKC+DCQGFGLKSCP CGKGGLTPEQRGER Sbjct: 68 GAIECPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPKCGKGGLTPEQRGER 120 >gb|KZV40884.1| hypothetical protein F511_05129 [Dorcoceras hygrometricum] Length = 120 Score = 114 bits (286), Expect = 3e-30 Identities = 65/123 (52%), Positives = 75/123 (60%), Gaps = 10/123 (8%) Frame = -3 Query: 344 MKSFYLCSTS---LTFPTQFLHNQLFVQISEETEKQQRGRD---FRPQAAKSGGFSLKS- 186 MK+ L S+S + FP + + + ++ Q R R F AAKS GFS S Sbjct: 1 MKNMRLISSSALTIMFPQPHI---ITINTEKKGTPQSRERTMMIFTIHAAKSAGFSKNSI 57 Query: 185 ---CQKCKGQGAIECPXXXXXXXXXXXXNIFERWKCYDCQGFGLKSCPVCGKGGLTPEQR 15 CQKC+GQGAIECP NIFERWKC+DCQGFGLKSCPVCGKGGLTPEQR Sbjct: 58 SRKCQKCEGQGAIECPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPVCGKGGLTPEQR 117 Query: 14 GER 6 GER Sbjct: 118 GER 120 >ref|XP_024026653.1| protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic [Morus notabilis] Length = 120 Score = 114 bits (284), Expect = 6e-30 Identities = 63/121 (52%), Positives = 74/121 (61%), Gaps = 8/121 (6%) Frame = -3 Query: 344 MKSFYLCSTSL----TFPTQFLHNQLFVQISEETEKQQRGRDFRPQAAKSGGFSLKS--- 186 M + ++C SL P Q +N+L + + +QQR R +KSGGFSL S Sbjct: 1 MANLHVCCNSLILNPALPPQS-YNKLRPGNPKRSTEQQRRRTTTVFVSKSGGFSLNSILK 59 Query: 185 -CQKCKGQGAIECPXXXXXXXXXXXXNIFERWKCYDCQGFGLKSCPVCGKGGLTPEQRGE 9 CQ C G+GAIECP NIFERWKC+DCQGFGLKSCP CGKGGLTPEQRGE Sbjct: 60 RCQTCGGKGAIECPGCKGTGKNKKNGNIFERWKCFDCQGFGLKSCPNCGKGGLTPEQRGE 119 Query: 8 R 6 R Sbjct: 120 R 120 >ref|XP_021767004.1| uncharacterized protein LOC110731460 [Chenopodium quinoa] Length = 124 Score = 114 bits (284), Expect = 7e-30 Identities = 57/103 (55%), Positives = 66/103 (64%), Gaps = 8/103 (7%) Frame = -3 Query: 290 HNQLFVQISEETEKQQ------RGRDFRPQAAKSGGFS--LKSCQKCKGQGAIECPXXXX 135 HN+LF I + ++ + F P AAKSG FS K C+ C+GQGAIECP Sbjct: 22 HNKLFHHIKPDFKESYVVGPIIQPSRFTPSAAKSGAFSSIFKKCEACRGQGAIECPGCKG 81 Query: 134 XXXXXXXXNIFERWKCYDCQGFGLKSCPVCGKGGLTPEQRGER 6 NIFERWKC++CQGFGLKSCP CGKGGLTPEQRGER Sbjct: 82 TGRNKKNGNIFERWKCFECQGFGLKSCPQCGKGGLTPEQRGER 124 >ref|XP_009599534.1| PREDICTED: uncharacterized protein LOC104095185 isoform X1 [Nicotiana tomentosiformis] ref|XP_016493609.1| PREDICTED: uncharacterized protein LOC107812933 isoform X1 [Nicotiana tabacum] ref|XP_016493911.1| PREDICTED: uncharacterized protein LOC107813195 isoform X1 [Nicotiana tabacum] Length = 121 Score = 113 bits (282), Expect = 1e-29 Identities = 54/83 (65%), Positives = 60/83 (72%), Gaps = 4/83 (4%) Frame = -3 Query: 242 RGRDFRPQAAKSGGFSLKS----CQKCKGQGAIECPXXXXXXXXXXXXNIFERWKCYDCQ 75 R + F A+KSGGFSLKS C+ C GQGAI+CP NIFERWKC+DCQ Sbjct: 39 RRQRFITSASKSGGFSLKSIVNRCENCGGQGAIDCPGCKGTGKNKKNGNIFERWKCFDCQ 98 Query: 74 GFGLKSCPVCGKGGLTPEQRGER 6 GFG+KSCPVCGKGGLTPEQRGER Sbjct: 99 GFGMKSCPVCGKGGLTPEQRGER 121 >ref|XP_021748719.1| uncharacterized protein LOC110714497 [Chenopodium quinoa] Length = 124 Score = 112 bits (281), Expect = 2e-29 Identities = 65/117 (55%), Positives = 73/117 (62%), Gaps = 11/117 (9%) Frame = -3 Query: 323 STSLT-FPTQFLH-NQLFVQISEETEKQ-------QRGRDFRPQAAKSGGFS--LKSCQK 177 S SLT P+Q H N+LF I ++ Q R F P AAKSG FS K C+ Sbjct: 9 SCSLTSLPSQHNHHNELFHHIKPVFKESYVVGPIIQHSR-FTPCAAKSGAFSSIFKKCEA 67 Query: 176 CKGQGAIECPXXXXXXXXXXXXNIFERWKCYDCQGFGLKSCPVCGKGGLTPEQRGER 6 C+GQGAIECP NIFERWKC++CQGFGLKSCP CGKGGLTPEQRGER Sbjct: 68 CRGQGAIECPGCKGTGRNKKNGNIFERWKCFECQGFGLKSCPQCGKGGLTPEQRGER 124 >ref|XP_004250427.1| PREDICTED: protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic isoform X1 [Solanum lycopersicum] Length = 111 Score = 112 bits (279), Expect = 3e-29 Identities = 59/102 (57%), Positives = 66/102 (64%), Gaps = 5/102 (4%) Frame = -3 Query: 296 FLHNQLFVQISEETEKQQRG-RDFRPQAAKSGGFSLKS----CQKCKGQGAIECPXXXXX 132 F ++ + IS+ RG R AAKSGGFSLKS C+ C G+GAIECP Sbjct: 10 FANSSITPTISQFNTINSRGQRSLVTSAAKSGGFSLKSIGSRCEGCGGKGAIECPGCKGT 69 Query: 131 XXXXXXXNIFERWKCYDCQGFGLKSCPVCGKGGLTPEQRGER 6 NIFERWKC+DCQGFGLKSCPVCGK GLTPEQRGER Sbjct: 70 GKNKKNGNIFERWKCFDCQGFGLKSCPVCGKEGLTPEQRGER 111 >ref|XP_021912274.1| protein EMBRYO SAC DEVELOPMENT ARREST 3, chloroplastic isoform X1 [Carica papaya] Length = 113 Score = 112 bits (279), Expect = 3e-29 Identities = 55/85 (64%), Positives = 60/85 (70%), Gaps = 4/85 (4%) Frame = -3 Query: 248 QQRGRDFRPQAAKSGGFS----LKSCQKCKGQGAIECPXXXXXXXXXXXXNIFERWKCYD 81 QQR AAKSGGF LKSCQKC+G+GAIECP NIFERWKC++ Sbjct: 29 QQRRTTIPTLAAKSGGFPFNSLLKSCQKCEGKGAIECPGCKGTGKNKKNGNIFERWKCFN 88 Query: 80 CQGFGLKSCPVCGKGGLTPEQRGER 6 CQGFGLKSCP CG+GGLTPEQRGER Sbjct: 89 CQGFGLKSCPNCGRGGLTPEQRGER 113 >gb|PIA47192.1| hypothetical protein AQUCO_01400107v1 [Aquilegia coerulea] Length = 115 Score = 111 bits (278), Expect = 4e-29 Identities = 57/112 (50%), Positives = 66/112 (58%), Gaps = 4/112 (3%) Frame = -3 Query: 329 LCSTSLTFPTQFLHNQLFVQISEETEKQQRGRDFRPQAAKSGGFSL----KSCQKCKGQG 162 LC TS+ H+ ++ + K + F A+K GGFS K C+KC GQG Sbjct: 6 LCGTSVIPSKLVFHSSYPRELKVQPAKAKA--TFTTSASKPGGFSFNPLAKKCEKCAGQG 63 Query: 161 AIECPXXXXXXXXXXXXNIFERWKCYDCQGFGLKSCPVCGKGGLTPEQRGER 6 IECP NIFERWKCYDCQGFG+KSCP CGKGGLTPEQRGER Sbjct: 64 GIECPGCKGTGRNKKNGNIFERWKCYDCQGFGMKSCPECGKGGLTPEQRGER 115 >ref|XP_009782304.1| PREDICTED: uncharacterized protein LOC104231074 isoform X1 [Nicotiana sylvestris] Length = 125 Score = 111 bits (277), Expect = 8e-29 Identities = 56/87 (64%), Positives = 61/87 (70%), Gaps = 8/87 (9%) Frame = -3 Query: 242 RGRDFRPQAAKSGGFSLKS----CQKCKGQGAIECPXXXXXXXXXXXXNIFERW----KC 87 RG+ F A+KSGGFSLKS C+ C GQGAIECP NIFERW +C Sbjct: 39 RGQRFVTSASKSGGFSLKSIVNRCENCGGQGAIECPGCKGTGKNKKNGNIFERWNKFCRC 98 Query: 86 YDCQGFGLKSCPVCGKGGLTPEQRGER 6 +DCQGFGLKSCPVCGKGGLTPEQRGER Sbjct: 99 FDCQGFGLKSCPVCGKGGLTPEQRGER 125 >gb|PON49705.1| Heat shock protein DnaJ, cysteine-rich domain containing protein [Parasponia andersonii] Length = 120 Score = 110 bits (275), Expect = 1e-28 Identities = 58/99 (58%), Positives = 66/99 (66%), Gaps = 4/99 (4%) Frame = -3 Query: 290 HNQLFVQISEETEKQQRGRDFRPQAAKSGGFSLKS----CQKCKGQGAIECPXXXXXXXX 123 HN+L +Q S T ++R AAKSGGFSL S C+ C G+GAIECP Sbjct: 25 HNKL-IQSSSGTSSERRQTPVF--AAKSGGFSLNSILKRCETCSGKGAIECPGCKGTGKN 81 Query: 122 XXXXNIFERWKCYDCQGFGLKSCPVCGKGGLTPEQRGER 6 NIFERWKC+DCQGFGL+SCP CGKGGLTPEQRGER Sbjct: 82 KKNGNIFERWKCFDCQGFGLRSCPNCGKGGLTPEQRGER 120 >ref|XP_022719989.1| protein PHOTOSYSTEM I ASSEMBLY 2, chloroplastic [Durio zibethinus] Length = 136 Score = 110 bits (276), Expect = 2e-28 Identities = 55/84 (65%), Positives = 59/84 (70%), Gaps = 2/84 (2%) Frame = -3 Query: 251 KQQRGRDFRPQAAKSGGFS--LKSCQKCKGQGAIECPXXXXXXXXXXXXNIFERWKCYDC 78 KQQR R AAKSG + LK CQKC G+GAIECP NIFERWKC+DC Sbjct: 53 KQQRSRVVTAFAAKSGPLNSILKRCQKCGGKGAIECPGCKGTGKNKKNGNIFERWKCFDC 112 Query: 77 QGFGLKSCPVCGKGGLTPEQRGER 6 QGFGLKSCP CG+GGLTPEQRGER Sbjct: 113 QGFGLKSCPKCGQGGLTPEQRGER 136