BLASTX nr result
ID: Zanthoxylum22_contig00014031
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Zanthoxylum22_contig00014031 (1009 letters) Database: ./nr 77,306,371 sequences; 28,104,191,420 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006444209.1| hypothetical protein CICLE_v10022350mg [Citr... 219 3e-54 ref|XP_012070589.1| PREDICTED: U1 small nuclear ribonucleoprotei... 171 9e-40 ref|XP_002304170.1| proline-rich family protein [Populus trichoc... 163 2e-37 ref|XP_007010252.1| C2H2 and C2HC zinc fingers superfamily prote... 163 2e-37 ref|XP_012490469.1| PREDICTED: U1 small nuclear ribonucleoprotei... 160 1e-36 ref|XP_011039992.1| PREDICTED: U1 small nuclear ribonucleoprotei... 160 1e-36 ref|XP_012490474.1| PREDICTED: U1 small nuclear ribonucleoprotei... 160 2e-36 ref|XP_007010255.1| C2H2 and C2HC zinc fingers superfamily prote... 131 5e-36 ref|XP_007010256.1| C2H2 and C2HC zinc fingers superfamily prote... 131 5e-36 ref|XP_010044039.1| PREDICTED: U1 small nuclear ribonucleoprotei... 155 5e-35 gb|KCW86043.1| hypothetical protein EUGRSUZ_B02749 [Eucalyptus g... 155 5e-35 ref|XP_012473506.1| PREDICTED: U1 small nuclear ribonucleoprotei... 155 6e-35 gb|KHG09880.1| U1 small nuclear ribonucleoprotein C [Gossypium a... 154 1e-34 ref|XP_010087443.1| U1 small nuclear ribonucleoprotein C [Morus ... 153 2e-34 ref|XP_002270246.1| PREDICTED: U1 small nuclear ribonucleoprotei... 153 2e-34 ref|XP_004148886.1| PREDICTED: U1 small nuclear ribonucleoprotei... 153 2e-34 ref|XP_008451425.1| PREDICTED: U1 small nuclear ribonucleoprotei... 152 3e-34 ref|XP_007010254.1| C2H2 and C2HC zinc fingers superfamily prote... 152 3e-34 ref|XP_007010253.1| C2H2 and C2HC zinc fingers superfamily prote... 152 3e-34 ref|XP_010265587.1| PREDICTED: U1 small nuclear ribonucleoprotei... 152 4e-34 >ref|XP_006444209.1| hypothetical protein CICLE_v10022350mg [Citrus clementina] gi|568852371|ref|XP_006479851.1| PREDICTED: U1 small nuclear ribonucleoprotein C-like [Citrus sinensis] gi|557546471|gb|ESR57449.1| hypothetical protein CICLE_v10022350mg [Citrus clementina] gi|641868734|gb|KDO87418.1| hypothetical protein CISIN_1g029190mg [Citrus sinensis] gi|641868735|gb|KDO87419.1| hypothetical protein CISIN_1g029190mg [Citrus sinensis] Length = 197 Score = 219 bits (558), Expect = 3e-54 Identities = 114/171 (66%), Positives = 122/171 (71%) Frame = -3 Query: 1007 GYKHKSNVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLLAQRIRLPVLP 828 GYKHK+NVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLLAQR RLPVLP Sbjct: 27 GYKHKANVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLLAQRPRLPVLP 86 Query: 827 TPVMPMHGGAPSIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQVNGLQRPY 648 TPVMPM G AP + GQ+NG RP Sbjct: 87 TPVMPMTGSAPLVPGMRPPVLPRPGPSPPGYVSAPGMPPMMAPPGAPSAPGQLNGFPRPP 146 Query: 647 SMMNPTAVTGSTAPPASSTGAPHMAAPQTYQANPTVPMSGNVNAQAPESNH 495 ++MNPTAV+GS APPASS+GAP MA PQTYQANPTVP SGN+NAQAPE NH Sbjct: 147 AVMNPTAVSGSAAPPASSSGAPSMATPQTYQANPTVPTSGNLNAQAPEMNH 197 >ref|XP_012070589.1| PREDICTED: U1 small nuclear ribonucleoprotein C-like [Jatropha curcas] gi|802585732|ref|XP_012070590.1| PREDICTED: U1 small nuclear ribonucleoprotein C-like [Jatropha curcas] gi|643732248|gb|KDP39407.1| hypothetical protein JCGZ_03689 [Jatropha curcas] Length = 207 Score = 171 bits (433), Expect = 9e-40 Identities = 98/181 (54%), Positives = 107/181 (59%), Gaps = 10/181 (5%) Frame = -3 Query: 1007 GYKHKSNVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLLAQRIRLPVLP 828 GYKHK+NVR+YYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLLAQR RLPVLP Sbjct: 27 GYKHKANVRTYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLLAQRPRLPVLP 86 Query: 827 TPVMPMHG------GAPSIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQVN 666 TPVMP+ G + GQ N Sbjct: 87 TPVMPIAGNGQLPMNTALLPGIRPPVLPRPVPGAPGYMPGQVMQPIMAPPGAPSIPGQAN 146 Query: 665 GLQRPYSMMNPTAVTGSTAPPASSTGAPHMAAPQTYQANPTVPMSGNV----NAQAPESN 498 G+ RP MM PT+V GSTA PA S+G P + P YQ NP VP SG NA A E+N Sbjct: 147 GMPRPPMMMPPTSVPGSTAVPAPSSGTPSIVPPPNYQPNPAVPTSGGFDSFNNAPASETN 206 Query: 497 H 495 H Sbjct: 207 H 207 >ref|XP_002304170.1| proline-rich family protein [Populus trichocarpa] gi|222841602|gb|EEE79149.1| proline-rich family protein [Populus trichocarpa] Length = 202 Score = 163 bits (413), Expect = 2e-37 Identities = 96/177 (54%), Positives = 105/177 (59%), Gaps = 6/177 (3%) Frame = -3 Query: 1007 GYKHKSNVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLLAQRIRLPVLP 828 GYKHK+NVR YYQQFEEQQTQS+IDQRIKEHLGQTAAFQQVGAAYNQHL+ QR RLPVLP Sbjct: 27 GYKHKANVRIYYQQFEEQQTQSIIDQRIKEHLGQTAAFQQVGAAYNQHLMVQRPRLPVLP 86 Query: 827 TPVMPMHG-GAPSIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQVNGLQRP 651 TPVMP+ G AP GQ+NG+ RP Sbjct: 87 TPVMPIGGNNAPLFPGMRPPVLPRPMPGAPGYMNPPMMPPMMAPPGAPSLPGQMNGIPRP 146 Query: 650 YSMMNPTAVTGSTAPPASSTGAPHMAAPQTYQANPTVPMSG-----NVNAQAPESNH 495 +M+ V GSTA P S G P M P TYQAN SG NVNA APE+NH Sbjct: 147 PTMIAQPNVPGSTAAPTPS-GPPSMGPPVTYQANQAATTSGGFDSFNVNAAAPEANH 202 >ref|XP_007010252.1| C2H2 and C2HC zinc fingers superfamily protein isoform 1 [Theobroma cacao] gi|508727165|gb|EOY19062.1| C2H2 and C2HC zinc fingers superfamily protein isoform 1 [Theobroma cacao] Length = 208 Score = 163 bits (412), Expect = 2e-37 Identities = 91/182 (50%), Positives = 104/182 (57%), Gaps = 11/182 (6%) Frame = -3 Query: 1007 GYKHKSNVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLLAQRIRLPVLP 828 GYKHK+NVR+YYQQFEEQQTQSLIDQRIKEHLGQ AAFQQVGAA+NQHL+AQR RLPVLP Sbjct: 27 GYKHKANVRTYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAFNQHLMAQRPRLPVLP 86 Query: 827 TPVMPMHGGA------PSIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQVN 666 TPVMP+ G A P + GQ+N Sbjct: 87 TPVMPIPGAAPLPMNQPMVPGIRPPVLPRPLPGPPGYVPAPGMPPMVAPPGAPSLPGQIN 146 Query: 665 GLQRPYSMMNPTAVTGSTAPPASSTGAPHMAAPQTYQANPTVPMSG-----NVNAQAPES 501 G+ RP ++ T V G+ P SS AP M P +YQ NP P G N NAQ E+ Sbjct: 147 GVPRPPTLAPLTTVPGTATTPTSSNAAPTMVTPASYQTNPAAPTGGGFDNFNANAQPSEA 206 Query: 500 NH 495 NH Sbjct: 207 NH 208 >ref|XP_012490469.1| PREDICTED: U1 small nuclear ribonucleoprotein C-like isoform X1 [Gossypium raimondii] gi|823188311|ref|XP_012490470.1| PREDICTED: U1 small nuclear ribonucleoprotein C-like isoform X1 [Gossypium raimondii] gi|823188316|ref|XP_012490471.1| PREDICTED: U1 small nuclear ribonucleoprotein C-like isoform X1 [Gossypium raimondii] gi|823188319|ref|XP_012490472.1| PREDICTED: U1 small nuclear ribonucleoprotein C-like isoform X1 [Gossypium raimondii] gi|823188322|ref|XP_012490473.1| PREDICTED: U1 small nuclear ribonucleoprotein C-like isoform X1 [Gossypium raimondii] gi|763774816|gb|KJB41939.1| hypothetical protein B456_007G132200 [Gossypium raimondii] gi|763774817|gb|KJB41940.1| hypothetical protein B456_007G132200 [Gossypium raimondii] gi|763774818|gb|KJB41941.1| hypothetical protein B456_007G132200 [Gossypium raimondii] gi|763774819|gb|KJB41942.1| hypothetical protein B456_007G132200 [Gossypium raimondii] gi|763774821|gb|KJB41944.1| hypothetical protein B456_007G132200 [Gossypium raimondii] Length = 208 Score = 160 bits (406), Expect = 1e-36 Identities = 91/181 (50%), Positives = 105/181 (58%), Gaps = 11/181 (6%) Frame = -3 Query: 1007 GYKHKSNVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLLAQRIRLPVLP 828 GYKHK+NVR+YYQQFEEQQTQSLIDQRIKEHLGQ AAF QVGAA+NQHL+AQR RLPVLP Sbjct: 27 GYKHKANVRTYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAFNQHLMAQRPRLPVLP 86 Query: 827 TPVMPMHGGA------PSIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQVN 666 TPVMP+ GA P + GQ+N Sbjct: 87 TPVMPIPRGAPLPINQPMVPGIRPPVLPRPVPGAPGYVPVPGMPPTVAPPGAPSFPGQIN 146 Query: 665 GLQRPYSMMNPTAVTGSTAPPASSTGAPHMAAPQTYQANPTVPMSG-----NVNAQAPES 501 GL +P ++ P+ VT + P SS AP MA P YQ+NP P SG N NAQ E+ Sbjct: 147 GLPQPPTLAPPSTVTVTATTPTSSNAAPTMATPALYQSNPAAPASGGFDNFNANAQPSEA 206 Query: 500 N 498 N Sbjct: 207 N 207 >ref|XP_011039992.1| PREDICTED: U1 small nuclear ribonucleoprotein C [Populus euphratica] Length = 202 Score = 160 bits (406), Expect = 1e-36 Identities = 95/177 (53%), Positives = 104/177 (58%), Gaps = 6/177 (3%) Frame = -3 Query: 1007 GYKHKSNVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLLAQRIRLPVLP 828 GYKHK+NVR YYQQFEEQQTQS+IDQRIKEHLGQTAAFQQVGAAYNQHL+ QR RLPVLP Sbjct: 27 GYKHKANVRIYYQQFEEQQTQSIIDQRIKEHLGQTAAFQQVGAAYNQHLMVQRPRLPVLP 86 Query: 827 TPVMPMHG-GAPSIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQVNGLQRP 651 TPVMP+ G A GQ+NG+ RP Sbjct: 87 TPVMPIGGNNAQLFPGMRPPVLPRPMPGAPGYMNPPMMPPMMAPPGAPSLPGQMNGVPRP 146 Query: 650 YSMMNPTAVTGSTAPPASSTGAPHMAAPQTYQANPTVPMSG-----NVNAQAPESNH 495 +M+ V GSTA P S G P M P TYQAN SG NVNA APE+NH Sbjct: 147 PTMIAQPTVPGSTAAPTPS-GPPSMVPPVTYQANQAATTSGGFDSFNVNATAPEANH 202 >ref|XP_012490474.1| PREDICTED: U1 small nuclear ribonucleoprotein C-like isoform X2 [Gossypium raimondii] gi|823188328|ref|XP_012490475.1| PREDICTED: U1 small nuclear ribonucleoprotein C-like isoform X2 [Gossypium raimondii] gi|763774820|gb|KJB41943.1| hypothetical protein B456_007G132200 [Gossypium raimondii] gi|763774822|gb|KJB41945.1| hypothetical protein B456_007G132200 [Gossypium raimondii] gi|763774823|gb|KJB41946.1| hypothetical protein B456_007G132200 [Gossypium raimondii] gi|763774824|gb|KJB41947.1| hypothetical protein B456_007G132200 [Gossypium raimondii] Length = 205 Score = 160 bits (405), Expect = 2e-36 Identities = 91/178 (51%), Positives = 105/178 (58%), Gaps = 8/178 (4%) Frame = -3 Query: 1007 GYKHKSNVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLLAQRIRLPVLP 828 GYKHK+NVR+YYQQFEEQQTQSLIDQRIKEHLGQ AAF QVGAA+NQHL+AQR RLPVLP Sbjct: 27 GYKHKANVRTYYQQFEEQQTQSLIDQRIKEHLGQAAAFHQVGAAFNQHLMAQRPRLPVLP 86 Query: 827 TPVMPMHGGAP---SIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQVNGLQ 657 TPVMP+ GAP + GQ+NGL Sbjct: 87 TPVMPIPRGAPLPINQPMVPGIRPPVLPRPVPGYVPVPGMPPTVAPPGAPSFPGQINGLP 146 Query: 656 RPYSMMNPTAVTGSTAPPASSTGAPHMAAPQTYQANPTVPMSG-----NVNAQAPESN 498 +P ++ P+ VT + P SS AP MA P YQ+NP P SG N NAQ E+N Sbjct: 147 QPPTLAPPSTVTVTATTPTSSNAAPTMATPALYQSNPAAPASGGFDNFNANAQPSEAN 204 >ref|XP_007010255.1| C2H2 and C2HC zinc fingers superfamily protein isoform 4, partial [Theobroma cacao] gi|590566524|ref|XP_007010257.1| C2H2 and C2HC zinc fingers superfamily protein isoform 4, partial [Theobroma cacao] gi|508727168|gb|EOY19065.1| C2H2 and C2HC zinc fingers superfamily protein isoform 4, partial [Theobroma cacao] gi|508727170|gb|EOY19067.1| C2H2 and C2HC zinc fingers superfamily protein isoform 4, partial [Theobroma cacao] Length = 163 Score = 131 bits (329), Expect(2) = 5e-36 Identities = 62/71 (87%), Positives = 67/71 (94%) Frame = -3 Query: 1007 GYKHKSNVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLLAQRIRLPVLP 828 GYKHK+NVR+YYQQFEEQQTQSLIDQRIKEHLGQ AAFQQVGAA+NQHL+AQR RLPVLP Sbjct: 17 GYKHKANVRTYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAFNQHLMAQRPRLPVLP 76 Query: 827 TPVMPMHGGAP 795 TPVMP+ G AP Sbjct: 77 TPVMPIPGAAP 87 Score = 48.9 bits (115), Expect(2) = 5e-36 Identities = 20/44 (45%), Positives = 26/44 (59%) Frame = -2 Query: 729 CSRHATNDGTSRCSFCTWSSKWSPKTLFYDESNSCYW*YSSTCF 598 CSRHATN GT+RCSF WS+KW + ++ W + T F Sbjct: 119 CSRHATNGGTTRCSFLAWSNKWCSTASYIGSPDNSSWNCNDTNF 162 >ref|XP_007010256.1| C2H2 and C2HC zinc fingers superfamily protein isoform 5, partial [Theobroma cacao] gi|508727169|gb|EOY19066.1| C2H2 and C2HC zinc fingers superfamily protein isoform 5, partial [Theobroma cacao] Length = 161 Score = 131 bits (329), Expect(2) = 5e-36 Identities = 62/71 (87%), Positives = 67/71 (94%) Frame = -3 Query: 1007 GYKHKSNVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLLAQRIRLPVLP 828 GYKHK+NVR+YYQQFEEQQTQSLIDQRIKEHLGQ AAFQQVGAA+NQHL+AQR RLPVLP Sbjct: 17 GYKHKANVRTYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAFNQHLMAQRPRLPVLP 76 Query: 827 TPVMPMHGGAP 795 TPVMP+ G AP Sbjct: 77 TPVMPIPGAAP 87 Score = 48.9 bits (115), Expect(2) = 5e-36 Identities = 20/44 (45%), Positives = 26/44 (59%) Frame = -2 Query: 729 CSRHATNDGTSRCSFCTWSSKWSPKTLFYDESNSCYW*YSSTCF 598 CSRHATN GT+RCSF WS+KW + ++ W + T F Sbjct: 117 CSRHATNGGTTRCSFLAWSNKWCSTASYIGSPDNSSWNCNDTNF 160 >ref|XP_010044039.1| PREDICTED: U1 small nuclear ribonucleoprotein C-like [Eucalyptus grandis] Length = 201 Score = 155 bits (392), Expect = 5e-35 Identities = 88/176 (50%), Positives = 101/176 (57%), Gaps = 5/176 (2%) Frame = -3 Query: 1007 GYKHKSNVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLLAQRIRLPVLP 828 GYKHK+NVR YYQQFEEQQTQSLIDQRIKEHLGQ AAFQQVGAAYNQHL+AQR RLP+LP Sbjct: 27 GYKHKANVRIYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLMAQRPRLPILP 86 Query: 827 TPVMPMHGGAPSIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQVNGLQRPY 648 TP+MPM G + Q+NG+ RP Sbjct: 87 TPMMPM-GPNMQLAPGMRPPVLPRPPPGAPGYIPPPGMPAMAPAPGAPSMPQMNGMPRPP 145 Query: 647 SMMNPTAVTGSTAPPASSTGAPHMAAPQTYQANPTVPMSG-----NVNAQAPESNH 495 ++ P + GS SS AP M P YQ NP P SG N NA+AP++NH Sbjct: 146 TLNVPPSGPGSALNLTSSGSAPSMVPPPAYQMNPAGPTSGGFDNFNANAKAPDANH 201 >gb|KCW86043.1| hypothetical protein EUGRSUZ_B02749 [Eucalyptus grandis] Length = 242 Score = 155 bits (392), Expect = 5e-35 Identities = 88/176 (50%), Positives = 101/176 (57%), Gaps = 5/176 (2%) Frame = -3 Query: 1007 GYKHKSNVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLLAQRIRLPVLP 828 GYKHK+NVR YYQQFEEQQTQSLIDQRIKEHLGQ AAFQQVGAAYNQHL+AQR RLP+LP Sbjct: 68 GYKHKANVRIYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAYNQHLMAQRPRLPILP 127 Query: 827 TPVMPMHGGAPSIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQVNGLQRPY 648 TP+MPM G + Q+NG+ RP Sbjct: 128 TPMMPM-GPNMQLAPGMRPPVLPRPPPGAPGYIPPPGMPAMAPAPGAPSMPQMNGMPRPP 186 Query: 647 SMMNPTAVTGSTAPPASSTGAPHMAAPQTYQANPTVPMSG-----NVNAQAPESNH 495 ++ P + GS SS AP M P YQ NP P SG N NA+AP++NH Sbjct: 187 TLNVPPSGPGSALNLTSSGSAPSMVPPPAYQMNPAGPTSGGFDNFNANAKAPDANH 242 >ref|XP_012473506.1| PREDICTED: U1 small nuclear ribonucleoprotein C [Gossypium raimondii] gi|763755215|gb|KJB22546.1| hypothetical protein B456_004G053600 [Gossypium raimondii] Length = 199 Score = 155 bits (391), Expect = 6e-35 Identities = 90/175 (51%), Positives = 104/175 (59%), Gaps = 5/175 (2%) Frame = -3 Query: 1007 GYKHKSNVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLLAQRIRLPVLP 828 GYKHK+NVR+YYQQFEEQQTQSLIDQRIKEHLGQ AAFQQVGAA+NQHL+AQR RLPV+ Sbjct: 27 GYKHKANVRTYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAFNQHLMAQRPRLPVMS 86 Query: 827 TPVMPMHGGAPSIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQVNGLQRPY 648 P P+ P + GQVNGL RP Sbjct: 87 IPGAPLPVNQPMV-PGMRPPVLPRPIPGAPGYIPVPGMPPMMAPPGAPLPGQVNGLPRPP 145 Query: 647 SMMNPTAVTGSTAPPASSTGAPHMAAPQTYQANPTVPMSG-----NVNAQAPESN 498 ++ PT V+G+ P SS GAP ++AP YQANPT P SG N NAQ E+N Sbjct: 146 TLAPPTTVSGTVTTPTSSNGAPTISAP--YQANPTAPTSGGFDNFNANAQPSEAN 198 >gb|KHG09880.1| U1 small nuclear ribonucleoprotein C [Gossypium arboreum] Length = 199 Score = 154 bits (389), Expect = 1e-34 Identities = 90/175 (51%), Positives = 103/175 (58%), Gaps = 5/175 (2%) Frame = -3 Query: 1007 GYKHKSNVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLLAQRIRLPVLP 828 GYKHK+NVR+YYQQFEEQQTQSLIDQRIKEHLGQ AAFQQVGAA+NQHL+AQR RLPV+ Sbjct: 27 GYKHKANVRTYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAFNQHLMAQRPRLPVMS 86 Query: 827 TPVMPMHGGAPSIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQVNGLQRPY 648 P P+ P + GQVNGL RP Sbjct: 87 IPGAPLPVNQPMV-PGMRPPVLPRPIPGAPGYIPVPGMPPMMAPPGAPLPGQVNGLPRPP 145 Query: 647 SMMNPTAVTGSTAPPASSTGAPHMAAPQTYQANPTVPMSG-----NVNAQAPESN 498 ++ PT V+G+ P SS GAP +AP YQANPT P SG N NAQ E+N Sbjct: 146 TLAPPTTVSGTATTPTSSNGAPTTSAP--YQANPTAPTSGGFDNFNANAQPSEAN 198 >ref|XP_010087443.1| U1 small nuclear ribonucleoprotein C [Morus notabilis] gi|587838420|gb|EXB29124.1| U1 small nuclear ribonucleoprotein C [Morus notabilis] Length = 250 Score = 153 bits (387), Expect = 2e-34 Identities = 91/177 (51%), Positives = 100/177 (56%), Gaps = 7/177 (3%) Frame = -3 Query: 1007 GYKHKSNVRSYYQQFEEQQTQSLIDQRIKEHLGQTAA-FQQVGAAYNQHLLAQRIRLPVL 831 GYKHK+NVR+YYQQFEEQQTQSLIDQRIKEHLGQ AA + VGAAYNQHLL QR RLPVL Sbjct: 73 GYKHKANVRTYYQQFEEQQTQSLIDQRIKEHLGQAAAAYHHVGAAYNQHLLVQRPRLPVL 132 Query: 830 PTPVMPMHGGAPSIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQVNGLQRP 651 PTP+MP G + GQVN QRP Sbjct: 133 PTPIMPQVPGGAPLIPGIRPPVLPRPVPGAPGYGPPTMPLMVAPPGAPSISGQVNVPQRP 192 Query: 650 YSMMNPTAVTGSTAPPASSTGAPHMAAPQT-YQANPTVPMSG-----NVNAQAPESN 498 ++ PT + GS A P G P M AP YQANP P SG NVN QAPES+ Sbjct: 193 PTLSVPTTIPGSLATPTPLNGGPLMTAPTAIYQANPVAPTSGGFDSFNVNMQAPESS 249 >ref|XP_002270246.1| PREDICTED: U1 small nuclear ribonucleoprotein C [Vitis vinifera] gi|363805533|sp|F6HQ26.1|RU1C_VITVI RecName: Full=U1 small nuclear ribonucleoprotein C; Short=U1 snRNP C; Short=U1-C; Short=U1C Length = 213 Score = 153 bits (387), Expect = 2e-34 Identities = 95/188 (50%), Positives = 104/188 (55%), Gaps = 17/188 (9%) Frame = -3 Query: 1007 GYKHKSNVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLLA-----QRIR 843 GYKHK+NVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHL++ R R Sbjct: 27 GYKHKANVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLVSFPGNPPRPR 86 Query: 842 LPVLPTPVMPMHGGAP-------SIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 684 LPVLPTP MP+ G AP Sbjct: 87 LPVLPTPGMPVAGSAPLPMNSPLVPGMRPPVLPRPVPGAPGYMPAPGMPSMMAPPGAPSM 146 Query: 683 XXGQVNGLQRPYSMMNPTAVTGSTAPPASSTGAPHMAAPQTYQANPTVPMSG-----NVN 519 +N L RP +M P AV GST+ P S GAP M YQANP P SG N+N Sbjct: 147 PMPPLNSLPRPPTMNVPPAVPGSTSTPTSG-GAPSMMTQPMYQANPAGPTSGGFDSFNIN 205 Query: 518 AQAPESNH 495 AQ PE+NH Sbjct: 206 AQGPEANH 213 >ref|XP_004148886.1| PREDICTED: U1 small nuclear ribonucleoprotein C [Cucumis sativus] gi|700189602|gb|KGN44835.1| hypothetical protein Csa_7G390140 [Cucumis sativus] Length = 197 Score = 153 bits (387), Expect = 2e-34 Identities = 93/177 (52%), Positives = 103/177 (58%), Gaps = 6/177 (3%) Frame = -3 Query: 1007 GYKHKSNVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLLAQRIRLPVLP 828 GYKHK+NVRSYYQQFEEQQTQSLIDQRIKEHLGQ AAFQQVGAA+NQHLL QR RLPVLP Sbjct: 27 GYKHKANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAFNQHLLGQRPRLPVLP 86 Query: 827 TPVMPMHGGAPSIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQVNGLQRPY 648 TPVMP G AP + GQVN RP Sbjct: 87 TPVMP--GAAPGLMPGIRPPVLPRPIPGAPGYLPTPTMPPMMAPPGAPIPGQVNIPSRP- 143 Query: 647 SMMNPTAVTGSTAPPASSTGAPHMAAPQTYQANPTVPMSGNVNA------QAPESNH 495 P + GS P+S+ GAP +AAP TYQANP P SG ++ + ESNH Sbjct: 144 --PPPAPLPGSAPQPSSTNGAP-LAAPSTYQANPAAPGSGGYDSFTSMAQPSSESNH 197 >ref|XP_008451425.1| PREDICTED: U1 small nuclear ribonucleoprotein C [Cucumis melo] Length = 197 Score = 152 bits (385), Expect = 3e-34 Identities = 92/177 (51%), Positives = 103/177 (58%), Gaps = 6/177 (3%) Frame = -3 Query: 1007 GYKHKSNVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLLAQRIRLPVLP 828 GYKHK+NVRSYYQQFEEQQTQSLIDQRIKEHLGQ AAFQQVGAA+NQHLL QR RLPVLP Sbjct: 27 GYKHKANVRSYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAFNQHLLGQRPRLPVLP 86 Query: 827 TPVMPMHGGAPSIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQVNGLQRPY 648 TPV+P G AP + GQVN RP Sbjct: 87 TPVIP--GAAPGLMPGIRPPVLPRPIPGAPGYLPTPTMPPMMAPPGAPIPGQVNIPSRP- 143 Query: 647 SMMNPTAVTGSTAPPASSTGAPHMAAPQTYQANPTVPMSGNVNA------QAPESNH 495 P + GS P+S+ GAP +AAP TYQANP P SG ++ + ESNH Sbjct: 144 --PPPAPIPGSAPQPSSTNGAP-LAAPSTYQANPAAPGSGGYDSFTSMAQPSSESNH 197 >ref|XP_007010254.1| C2H2 and C2HC zinc fingers superfamily protein isoform 3 [Theobroma cacao] gi|508727167|gb|EOY19064.1| C2H2 and C2HC zinc fingers superfamily protein isoform 3 [Theobroma cacao] Length = 211 Score = 152 bits (385), Expect = 3e-34 Identities = 85/167 (50%), Positives = 98/167 (58%), Gaps = 6/167 (3%) Frame = -3 Query: 1007 GYKHKSNVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLLAQRIRLPVLP 828 GYKHK+NVR+YYQQFEEQQTQSLIDQRIKEHLGQ AAFQQVGAA+NQHL+AQR RLPVLP Sbjct: 43 GYKHKANVRTYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAFNQHLMAQRPRLPVLP 102 Query: 827 TPVMPMHGGA------PSIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQVN 666 TPVMP+ G A P + GQ+N Sbjct: 103 TPVMPIPGAAPLPMNQPMVPGIRPPVLPRPLPGPPGYVPAPGMPPMVAPPGAPSLPGQIN 162 Query: 665 GLQRPYSMMNPTAVTGSTAPPASSTGAPHMAAPQTYQANPTVPMSGN 525 G+ RP ++ T V G+ P SS AP M P +YQ NP P +GN Sbjct: 163 GVPRPPTLAPLTTVPGTATTPTSSNAAPTMVTPASYQTNPAAP-TGN 208 >ref|XP_007010253.1| C2H2 and C2HC zinc fingers superfamily protein isoform 2 [Theobroma cacao] gi|508727166|gb|EOY19063.1| C2H2 and C2HC zinc fingers superfamily protein isoform 2 [Theobroma cacao] Length = 195 Score = 152 bits (385), Expect = 3e-34 Identities = 85/167 (50%), Positives = 98/167 (58%), Gaps = 6/167 (3%) Frame = -3 Query: 1007 GYKHKSNVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLLAQRIRLPVLP 828 GYKHK+NVR+YYQQFEEQQTQSLIDQRIKEHLGQ AAFQQVGAA+NQHL+AQR RLPVLP Sbjct: 27 GYKHKANVRTYYQQFEEQQTQSLIDQRIKEHLGQAAAFQQVGAAFNQHLMAQRPRLPVLP 86 Query: 827 TPVMPMHGGA------PSIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQVN 666 TPVMP+ G A P + GQ+N Sbjct: 87 TPVMPIPGAAPLPMNQPMVPGIRPPVLPRPLPGPPGYVPAPGMPPMVAPPGAPSLPGQIN 146 Query: 665 GLQRPYSMMNPTAVTGSTAPPASSTGAPHMAAPQTYQANPTVPMSGN 525 G+ RP ++ T V G+ P SS AP M P +YQ NP P +GN Sbjct: 147 GVPRPPTLAPLTTVPGTATTPTSSNAAPTMVTPASYQTNPAAP-TGN 192 >ref|XP_010265587.1| PREDICTED: U1 small nuclear ribonucleoprotein C-like isoform X2 [Nelumbo nucifera] gi|720030675|ref|XP_010265588.1| PREDICTED: U1 small nuclear ribonucleoprotein C-like isoform X2 [Nelumbo nucifera] Length = 237 Score = 152 bits (384), Expect = 4e-34 Identities = 88/173 (50%), Positives = 100/173 (57%), Gaps = 9/173 (5%) Frame = -3 Query: 1007 GYKHKSNVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLLA-----QRIR 843 GYKHK+NVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHL + QR R Sbjct: 27 GYKHKANVRSYYQQFEEQQTQSLIDQRIKEHLGQTAAFQQVGAAYNQHLASFQANPQRPR 86 Query: 842 LPVLPTPVMPMHG----GAPSIXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXG 675 LP+LPTP++P G S+ Sbjct: 87 LPILPTPILPTGAPQVPGTSSLVPGIRPPVLPRPVAGAPGYAAAPSMPSVFGPSGTPLPM 146 Query: 674 QVNGLQRPYSMMNPTAVTGSTAPPASSTGAPHMAAPQTYQANPTVPMSGNVNA 516 QVNGL RP ++ +PT V G TA P +S GAP M P YQANPT SG ++ Sbjct: 147 QVNGLPRPPTINSPTTVPGGTAAP-TSNGAPSMVTPVMYQANPTTATSGGFDS 198