BLASTX nr result
ID: Forsythia23_contig00011506
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia23_contig00011506 (1041 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011094009.1| PREDICTED: DNA-directed RNA polymerase III s... 377 e-101 ref|XP_012851133.1| PREDICTED: DNA-directed RNA polymerase III s... 327 7e-87 ref|XP_009631588.1| PREDICTED: uncharacterized protein LOC104121... 325 3e-86 ref|XP_010656128.1| PREDICTED: uncharacterized protein LOC100256... 314 8e-83 ref|XP_006362806.1| PREDICTED: uncharacterized protein LOC102600... 310 8e-82 ref|XP_010321121.1| PREDICTED: uncharacterized protein LOC101251... 307 7e-81 ref|XP_007012676.1| DNA-directed RNA polymerase III subunit RPC4... 300 9e-79 ref|XP_010049526.1| PREDICTED: DNA-directed RNA polymerase III s... 294 8e-77 ref|XP_010049527.1| PREDICTED: DNA-directed RNA polymerase III s... 294 8e-77 gb|KJB83020.1| hypothetical protein B456_013G225500 [Gossypium r... 293 2e-76 ref|XP_012461771.1| PREDICTED: uncharacterized protein LOC105781... 293 2e-76 ref|XP_010270534.1| PREDICTED: uncharacterized protein LOC104606... 290 9e-76 gb|KHG12357.1| DNA-directed RNA polymerase III subunit RPC4 [Gos... 288 3e-75 ref|XP_007154594.1| hypothetical protein PHAVU_003G132000g [Phas... 288 6e-75 ref|XP_012448093.1| PREDICTED: uncharacterized protein LOC105771... 286 2e-74 emb|CDP03102.1| unnamed protein product [Coffea canephora] 286 2e-74 ref|XP_012448095.1| PREDICTED: uncharacterized protein LOC105771... 285 5e-74 gb|KJB54458.1| hypothetical protein B456_009G035100 [Gossypium r... 285 5e-74 ref|XP_002516293.1| DNA binding protein, putative [Ricinus commu... 284 6e-74 gb|KDO54063.1| hypothetical protein CISIN_1g022055mg [Citrus sin... 283 2e-73 >ref|XP_011094009.1| PREDICTED: DNA-directed RNA polymerase III subunit RPC4 [Sesamum indicum] Length = 299 Score = 377 bits (967), Expect = e-101 Identities = 184/300 (61%), Positives = 226/300 (75%), Gaps = 1/300 (0%) Frame = +2 Query: 23 MDSESLAASKA-NXXXXXXXXXXXXXXXXXXXVLPKVEKIENDIDAARAQDLLRRFNESS 199 MD +SLAAS N VLPK EK+E+D++ A+A+ LLRR NE+S Sbjct: 1 MDPDSLAASSTTNAPRKVRFAPKAPPKREQKPVLPKAEKVESDLEEAKAEQLLRRLNEAS 60 Query: 200 MKAKPKFERKVGHNQIAFGYGGSSTTLKSYGAANRINRKPGSSSDGGCVEQRVEKEYKEP 379 +K KPK ERK G Q+AFGYGGSS +L+SYG INR PGSSSDGG +QR+EKEYKEP Sbjct: 61 LKGKPKVERKAGPTQVAFGYGGSSNSLRSYGVKKNINRIPGSSSDGGA-DQRIEKEYKEP 119 Query: 380 WNYYSYYPLQLPLRRPYSGNPELLDKEEFEDDSTRSIDDEYATNPALKLGLMEENLEDSM 559 W+YY+YYP LPLRRPYSGNPELLD+EEF +D S DE A N AL+LGL++EN+E+++ Sbjct: 120 WDYYTYYPTTLPLRRPYSGNPELLDEEEFAEDPQNSTYDESAENSALELGLLDENMEETI 179 Query: 560 FFVQLPTAMPMTKPCNNAEGREQGSNTNPVKGARPSQKPCGMEALPAGFMGKMLVYRSGA 739 FF+QLP+ +P TK NAE E NP KGA SQKPC +E LPAG MGKMLVYRSGA Sbjct: 180 FFLQLPSILPTTKQSTNAEVPEADKKANPGKGAEASQKPCRLEDLPAGLMGKMLVYRSGA 239 Query: 740 VKLKLGDTLYDVSAGLECVFAHDVVALNTEEKHCCSVGELNKRVVITPDVDSILDSMSDL 919 +KLKLGDTLYDVSAGL CVF DVVA+NT++KHCC++GE++KR +ITPD DS+LD+++DL Sbjct: 240 IKLKLGDTLYDVSAGLNCVFGQDVVAINTDDKHCCNMGEISKRAIITPDTDSMLDAIADL 299 >ref|XP_012851133.1| PREDICTED: DNA-directed RNA polymerase III subunit rpc4-like [Erythranthe guttatus] Length = 294 Score = 327 bits (839), Expect = 7e-87 Identities = 167/269 (62%), Positives = 204/269 (75%), Gaps = 1/269 (0%) Frame = +2 Query: 116 VLPKVEKIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHNQIAFGYGGSSTTLKSYGA 295 VLPKVEK+E D +A++L+RR+NESSM K K ERKV Q+AFG+GGSS L+SYGA Sbjct: 31 VLPKVEKVEEVEDDIKAEELMRRYNESSMNRKTKAERKVAPVQVAFGFGGSSNALRSYGA 90 Query: 296 ANRINRKPGSSSDGGCVEQRVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEFEDD 475 I + GSS+DG + VEKEYKEPW+YY+YYP+ +PLRRPYSGNPELLD+ EFE + Sbjct: 91 HKGIKKNLGSSNDGTAIN--VEKEYKEPWDYYTYYPVTVPLRRPYSGNPELLDEGEFEKE 148 Query: 476 STRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGS-NTNPVK 652 DE ATN A +LGL+EEN E++MF ++ P +PM K + AE RE G+ N K Sbjct: 149 PDY---DENATNDAAELGLVEENAENNMFLLKFPENLPMVKQPDRAEAREPGNIPKNTQK 205 Query: 653 GARPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLECVFAHDVVALNTEE 832 GA QK C +E LPAGFMGKMLVY+SGAVKLKLGDTLYDVSAGL+CVFA +VVA+N EE Sbjct: 206 GAGKPQKTCNLEELPAGFMGKMLVYKSGAVKLKLGDTLYDVSAGLDCVFAQEVVAVNAEE 265 Query: 833 KHCCSVGELNKRVVITPDVDSILDSMSDL 919 K CCSVGEL+KR +TPD+DS+L +MSDL Sbjct: 266 KKCCSVGELHKRASVTPDIDSVLKAMSDL 294 >ref|XP_009631588.1| PREDICTED: uncharacterized protein LOC104121328 [Nicotiana tomentosiformis] gi|697154708|ref|XP_009631589.1| PREDICTED: uncharacterized protein LOC104121328 [Nicotiana tomentosiformis] gi|697154710|ref|XP_009631590.1| PREDICTED: uncharacterized protein LOC104121328 [Nicotiana tomentosiformis] Length = 289 Score = 325 bits (833), Expect = 3e-86 Identities = 168/297 (56%), Positives = 208/297 (70%), Gaps = 1/297 (0%) Frame = +2 Query: 23 MDSESLAASKANXXXXXXXXXXXXXXXXXXXVLPKVEKIENDIDAARAQDLLRRFNESSM 202 MDS+ LA++ VLPK E IE D+DAA+A++L++RFNE S Sbjct: 1 MDSDPLASNTTKAPRKVRFAPKGPPRRAQKIVLPKPENIEEDVDAAKAEELMQRFNEVSA 60 Query: 203 KAKPKFERKVGHNQIAFGYGGSSTTLKSYGAANRINRKPGSSSDGGCVEQRVEKEYKEPW 382 K KPK E+K G Q+AFGYGGSS+ LKSYG + S S+GG Q+V+KEY EPW Sbjct: 61 KVKPKTEKK-GPTQVAFGYGGSSSALKSYGPLKGHKKVDSSMSNGGTGVQQVQKEYTEPW 119 Query: 383 NYYSYYPLQLPLRRPYSGNPELLDKEEFEDDSTRSIDDEYATNPALKLGLMEENLEDSMF 562 +YY+ YP+ LPLRRPYSGNPELLD++EF + S DE + PA++LGLMEENLE+ MF Sbjct: 120 DYYTNYPMTLPLRRPYSGNPELLDEQEFREASQSLSYDENSIKPAMELGLMEENLEEKMF 179 Query: 563 FVQLPTAMPMTKPCNNAEGREQGSNTNPVKGARPSQ-KPCGMEALPAGFMGKMLVYRSGA 739 F+QLPTAMPM K EG E S +RPS+ K M LP GFMGKMLVY+SGA Sbjct: 180 FIQLPTAMPMLKQSVKTEGSEASS-------SRPSKVKAYSMNELPRGFMGKMLVYKSGA 232 Query: 740 VKLKLGDTLYDVSAGLECVFAHDVVALNTEEKHCCSVGELNKRVVITPDVDSILDSM 910 VKLKLG+TLYDVS G++C FA DVVA+NTEEKHC ++GEL KR+++TPDVDSILDS+ Sbjct: 233 VKLKLGETLYDVSPGMDCAFAQDVVAVNTEEKHCSNIGELTKRIIVTPDVDSILDSI 289 >ref|XP_010656128.1| PREDICTED: uncharacterized protein LOC100256088 [Vitis vinifera] Length = 315 Score = 314 bits (804), Expect = 8e-83 Identities = 164/285 (57%), Positives = 205/285 (71%), Gaps = 17/285 (5%) Frame = +2 Query: 116 VLPKVEKIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHNQIAFGYGGSSTTLKSYGA 295 V+PK E E+D DAA+A +L+R FNE+SMK KPK E+K+ Q+AFGYGG+S +++SYG Sbjct: 31 VVPKSEVAEDD-DAAQANELMRHFNEASMKGKPKAEKKLAPTQVAFGYGGASASIRSYGT 89 Query: 296 ---ANRINRKPGSSSDGGCVEQRVE--KEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKE 460 A +R +S GG + KEYKEPW+YY+YYP+ LPLRRPYSGNPELLD+E Sbjct: 90 PRGATNSSRYQDPASGGGLYGSGLSDHKEYKEPWDYYTYYPVTLPLRRPYSGNPELLDEE 149 Query: 461 EFEDDSTRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNT 640 EF + S + DE +TNPA++LGLM+EN E SM F+QLP MPM K AE +E S++ Sbjct: 150 EFGEASESTAYDENSTNPAMELGLMDENQEASMLFLQLPATMPMIKQAATAEVKENASSS 209 Query: 641 NPVKGA------RPS------QKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAG 784 P + A +PS QK C +E LP+GFMGKMLVY+SGA+KLKLGDTLYDVS G Sbjct: 210 KPPEDAGQANRLKPSEGAGSIQKTCRLEELPSGFMGKMLVYKSGAIKLKLGDTLYDVSPG 269 Query: 785 LECVFAHDVVALNTEEKHCCSVGELNKRVVITPDVDSILDSMSDL 919 L+CVFA DVVA+NTE+K CC +GEL KR V+TPDVDS L SM DL Sbjct: 270 LDCVFAQDVVAINTEDKCCCVLGELKKRAVVTPDVDSALSSMDDL 314 >ref|XP_006362806.1| PREDICTED: uncharacterized protein LOC102600766 [Solanum tuberosum] Length = 283 Score = 310 bits (795), Expect = 8e-82 Identities = 157/265 (59%), Positives = 202/265 (76%) Frame = +2 Query: 116 VLPKVEKIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHNQIAFGYGGSSTTLKSYGA 295 VLPK E +E D DAA+A++L++RFNE+S K K K E+K G Q+AFGYGGSS++LKSYG Sbjct: 29 VLPKPENVEADGDAAKAEELMQRFNEASAKVKHKVEKK-GPTQVAFGYGGSSSSLKSYGH 87 Query: 296 ANRINRKPGSSSDGGCVEQRVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEFEDD 475 N+++ GS SDGG +RV+KEY EPW+YY+ YP+ LP+RRPYSGNPELLD+EEF + Sbjct: 88 YNKVS---GSMSDGGIGGERVQKEYTEPWDYYTNYPVTLPVRRPYSGNPELLDEEEFGEA 144 Query: 476 STRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNPVKG 655 S DE + PA+ LGLMEE+LE+ MF VQLPT MPM K EG E +++ P K Sbjct: 145 SRSLTYDENSIKPAMDLGLMEESLEEKMFLVQLPT-MPMLKQSIKTEGSEMANSSKPSKA 203 Query: 656 ARPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLECVFAHDVVALNTEEK 835 K C + LPAGFMGKMLVY+SGAVKLKLG+TL+++S G++C FA DVVA+NTEEK Sbjct: 204 -----KACSLNELPAGFMGKMLVYKSGAVKLKLGETLFNLSPGMDCSFAQDVVAVNTEEK 258 Query: 836 HCCSVGELNKRVVITPDVDSILDSM 910 +C ++GEL KR++ITPDVDS+LDS+ Sbjct: 259 YCSNIGELTKRIIITPDVDSLLDSI 283 >ref|XP_010321121.1| PREDICTED: uncharacterized protein LOC101251183 isoform X1 [Solanum lycopersicum] Length = 284 Score = 307 bits (787), Expect = 7e-81 Identities = 155/264 (58%), Positives = 199/264 (75%) Frame = +2 Query: 116 VLPKVEKIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHNQIAFGYGGSSTTLKSYGA 295 VLPK E +E D+DAA+A++L++RFNE+S K K K E+K G Q+AFGYGGSS++LKSYG Sbjct: 30 VLPKTENVEADVDAAKAEELMQRFNEASAKIKHKVEKK-GPTQVAFGYGGSSSSLKSYGH 88 Query: 296 ANRINRKPGSSSDGGCVEQRVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEFEDD 475 +++ GS SDGG +RV+KEY EPW+YY+ YP+ LP+RRPYSGNPELLD+EEF + Sbjct: 89 YTKVS---GSMSDGGINGERVQKEYTEPWDYYTNYPVTLPVRRPYSGNPELLDEEEFGEA 145 Query: 476 STRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNPVKG 655 S DE + PA+ LGLMEENLE+ MF VQLPT MPM K EG E +++ K Sbjct: 146 SQSLTYDENSIKPAMDLGLMEENLEEKMFLVQLPT-MPMLKQSIKTEGSEMANSSKTSKA 204 Query: 656 ARPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLECVFAHDVVALNTEEK 835 K C + LPAG MGK+LVY+SGAVKLKLG+TL++VS G++C FA DVVA+NTEEK Sbjct: 205 -----KACSLNELPAGLMGKLLVYKSGAVKLKLGETLFNVSPGMDCSFAQDVVAVNTEEK 259 Query: 836 HCCSVGELNKRVVITPDVDSILDS 907 +C ++GEL KR++ITPDVDS+LDS Sbjct: 260 YCSNIGELTKRIIITPDVDSLLDS 283 >ref|XP_007012676.1| DNA-directed RNA polymerase III subunit RPC4, putative [Theobroma cacao] gi|508783039|gb|EOY30295.1| DNA-directed RNA polymerase III subunit RPC4, putative [Theobroma cacao] Length = 294 Score = 300 bits (769), Expect = 9e-79 Identities = 150/271 (55%), Positives = 196/271 (72%), Gaps = 5/271 (1%) Frame = +2 Query: 122 PKVE---KIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHNQIAFGYGGSSTTLKSYG 292 PK+E ++ D DA +A+DLL+R N++S K KPK E+KV +Q+AFG+GG+S ++K +G Sbjct: 26 PKLEVKTEVVEDTDAVQARDLLQRLNQTSAKTKPKVEKKVASSQVAFGHGGASASMKLFG 85 Query: 293 AANRINRKPGSSSDGGCVEQ--RVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEF 466 + +R + +G R EKEY+EPW+YYSYYP+ LP+RRPYSGNPE LD+EEF Sbjct: 86 VSKGASRTSRETLNGVVHTPGLREEKEYREPWDYYSYYPVTLPMRRPYSGNPEFLDEEEF 145 Query: 467 EDDSTRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNP 646 ++ DE + PA++LGLM+ENLE SMFF+QLP +PM K G E S++ P Sbjct: 146 ASENITF--DENSVEPAVELGLMDENLEPSMFFLQLPPTLPMIKQSGTTAGLEVDSSSKP 203 Query: 647 VKGARPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLECVFAHDVVALNT 826 +K CG+E LPAG MGKMLV++SGAVKLKLGDTLYDV+ GL CVFA DVVA+NT Sbjct: 204 AARVGSVKKTCGLEELPAGLMGKMLVHKSGAVKLKLGDTLYDVTPGLNCVFAQDVVAVNT 263 Query: 827 EEKHCCSVGELNKRVVITPDVDSILDSMSDL 919 EK CC VGEL+KR V+TPDVDS+L+SM+DL Sbjct: 264 AEKQCCVVGELDKRAVLTPDVDSVLNSMADL 294 >ref|XP_010049526.1| PREDICTED: DNA-directed RNA polymerase III subunit RPC4 isoform X1 [Eucalyptus grandis] Length = 304 Score = 294 bits (752), Expect = 8e-77 Identities = 153/267 (57%), Positives = 189/267 (70%), Gaps = 7/267 (2%) Frame = +2 Query: 140 ENDIDAARAQDLLRRFNESSMKAKPKFERKVGHNQIAFGYGGSSTTLKSYGAA---NRIN 310 ++D D A+A+DLLR FNE +K KPK ERKV +QIAFGYGG+S +LKSY N +N Sbjct: 41 DDDDDEAKAKDLLRHFNEGILKEKPKIERKVAPSQIAFGYGGTSASLKSYHVQKDQNNVN 100 Query: 311 RKPGSSSDGGCVEQRVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEFEDDSTRSI 490 G+SS G R KEY+EPW+YYSYYP+ LPLRRPYSGNPELLD+EEF + + Sbjct: 101 SYQGTSSGPGL---RGMKEYREPWDYYSYYPVTLPLRRPYSGNPELLDEEEFGEAPSSVT 157 Query: 491 DDEYATNPALKLGLMEE----NLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNPVKGA 658 +E N A++L LM+ +LE SMFF+QLP +PM K A G E ++ Sbjct: 158 YNENILNTAMELDLMDSQRDGSLEPSMFFIQLPPTVPMAKRSTTAAGNETTESSTSSNVL 217 Query: 659 RPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLECVFAHDVVALNTEEKH 838 +K C ++ LPAG MGKMLVYRSGAVKLKLGDTLYDVS+GL+CVFA DVVA+N EKH Sbjct: 218 GSLEKSCSLDELPAGLMGKMLVYRSGAVKLKLGDTLYDVSSGLDCVFAQDVVAVNRTEKH 277 Query: 839 CCSVGELNKRVVITPDVDSILDSMSDL 919 C VGELNKR ++TPDVDS+L+SMS+L Sbjct: 278 FCVVGELNKRAILTPDVDSVLESMSEL 304 >ref|XP_010049527.1| PREDICTED: DNA-directed RNA polymerase III subunit RPC4 isoform X2 [Eucalyptus grandis] gi|629117484|gb|KCW82159.1| hypothetical protein EUGRSUZ_C03553 [Eucalyptus grandis] gi|629117485|gb|KCW82160.1| hypothetical protein EUGRSUZ_C03553 [Eucalyptus grandis] Length = 304 Score = 294 bits (752), Expect = 8e-77 Identities = 153/267 (57%), Positives = 189/267 (70%), Gaps = 7/267 (2%) Frame = +2 Query: 140 ENDIDAARAQDLLRRFNESSMKAKPKFERKVGHNQIAFGYGGSSTTLKSYGAA---NRIN 310 ++D D A+A+DLLR FNE +K KPK ERKV +QIAFGYGG+S +LKSY N +N Sbjct: 41 DDDDDEAKAKDLLRHFNEGILKEKPKIERKVAPSQIAFGYGGTSASLKSYHVQKDQNNVN 100 Query: 311 RKPGSSSDGGCVEQRVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEFEDDSTRSI 490 G+SS G R KEY+EPW+YYSYYP+ LPLRRPYSGNPELLD+EEF + + Sbjct: 101 SYQGTSSGPGL---RGMKEYREPWDYYSYYPVTLPLRRPYSGNPELLDEEEFGEAPSSVT 157 Query: 491 DDEYATNPALKLGLMEE----NLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNPVKGA 658 +E N A++L LM+ +LE SMFF+QLP +PM K A G E ++ Sbjct: 158 YNENILNTAMELDLMDSQRDGSLEPSMFFIQLPPTVPMAKRSTTAAGNETTESSTSSNVL 217 Query: 659 RPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLECVFAHDVVALNTEEKH 838 +K C ++ LPAG MGKMLVYRSGAVKLKLGDTLYDVS+GL+CVFA DVVA+N EKH Sbjct: 218 GSLEKSCSLDELPAGLMGKMLVYRSGAVKLKLGDTLYDVSSGLDCVFAQDVVAVNRTEKH 277 Query: 839 CCSVGELNKRVVITPDVDSILDSMSDL 919 C VGELNKR ++TPDVDS+L+SMS+L Sbjct: 278 FCVVGELNKRAILTPDVDSVLESMSEL 304 >gb|KJB83020.1| hypothetical protein B456_013G225500 [Gossypium raimondii] Length = 276 Score = 293 bits (749), Expect = 2e-76 Identities = 146/271 (53%), Positives = 194/271 (71%), Gaps = 5/271 (1%) Frame = +2 Query: 122 PKVE---KIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHNQIAFGYGGSSTTLKSYG 292 PK+E ++ DIDA +A+DLL+R N++S + KPK E+KV +Q+AFG+ G ++K++G Sbjct: 13 PKLEVKTEVVEDIDAVQARDLLQRLNQTSARTKPKVEKKVSSSQVAFGFVGGGASIKTFG 72 Query: 293 AANRINRKPGSSSDGGCVEQ--RVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEF 466 + N + G + GG RVEKEYKEPW+YYSYYPL LP+RRPYSGNPE LD+EEF Sbjct: 73 TSRGANHRSGETFGGGVRGPGLRVEKEYKEPWDYYSYYPLTLPMRRPYSGNPEFLDEEEF 132 Query: 467 EDDSTRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNP 646 + DE + PA+ LGLMEENLE M F+QLP +P+ K G E S+T Sbjct: 133 AAQNVAY--DENSIEPAVGLGLMEENLEPMMLFLQLPPTLPIIKA-----GHEGASSTGS 185 Query: 647 VKGARPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLECVFAHDVVALNT 826 + R ++K CG+ LPAG MGKMLVY+SGAVKLKLGDT+YDV+ GL CVFA DVVA++T Sbjct: 186 SRTVRSAKKTCGLTELPAGLMGKMLVYKSGAVKLKLGDTIYDVNPGLSCVFAQDVVAVDT 245 Query: 827 EEKHCCSVGELNKRVVITPDVDSILDSMSDL 919 +K CC VGE+NK V++TPD+DS+L+S+S+L Sbjct: 246 AKKQCCVVGEVNKHVIVTPDMDSVLNSLSEL 276 >ref|XP_012461771.1| PREDICTED: uncharacterized protein LOC105781790 [Gossypium raimondii] gi|763816166|gb|KJB83018.1| hypothetical protein B456_013G225500 [Gossypium raimondii] Length = 289 Score = 293 bits (749), Expect = 2e-76 Identities = 146/271 (53%), Positives = 194/271 (71%), Gaps = 5/271 (1%) Frame = +2 Query: 122 PKVE---KIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHNQIAFGYGGSSTTLKSYG 292 PK+E ++ DIDA +A+DLL+R N++S + KPK E+KV +Q+AFG+ G ++K++G Sbjct: 26 PKLEVKTEVVEDIDAVQARDLLQRLNQTSARTKPKVEKKVSSSQVAFGFVGGGASIKTFG 85 Query: 293 AANRINRKPGSSSDGGCVEQ--RVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEF 466 + N + G + GG RVEKEYKEPW+YYSYYPL LP+RRPYSGNPE LD+EEF Sbjct: 86 TSRGANHRSGETFGGGVRGPGLRVEKEYKEPWDYYSYYPLTLPMRRPYSGNPEFLDEEEF 145 Query: 467 EDDSTRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNP 646 + DE + PA+ LGLMEENLE M F+QLP +P+ K G E S+T Sbjct: 146 AAQNVAY--DENSIEPAVGLGLMEENLEPMMLFLQLPPTLPIIKA-----GHEGASSTGS 198 Query: 647 VKGARPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLECVFAHDVVALNT 826 + R ++K CG+ LPAG MGKMLVY+SGAVKLKLGDT+YDV+ GL CVFA DVVA++T Sbjct: 199 SRTVRSAKKTCGLTELPAGLMGKMLVYKSGAVKLKLGDTIYDVNPGLSCVFAQDVVAVDT 258 Query: 827 EEKHCCSVGELNKRVVITPDVDSILDSMSDL 919 +K CC VGE+NK V++TPD+DS+L+S+S+L Sbjct: 259 AKKQCCVVGEVNKHVIVTPDMDSVLNSLSEL 289 >ref|XP_010270534.1| PREDICTED: uncharacterized protein LOC104606837 [Nelumbo nucifera] Length = 302 Score = 290 bits (743), Expect = 9e-76 Identities = 149/269 (55%), Positives = 187/269 (69%), Gaps = 4/269 (1%) Frame = +2 Query: 122 PKVEKIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHNQIAFGYGGSSTTLKSYGAAN 301 PK+E++E D+ A + ++LL R NESS+ +PK ERK G Q+AFG+G SS SYG++ Sbjct: 33 PKIEEVE-DVKAFQTRELLXRVNESSVNGRPKMERKSGPAQVAFGFGPSSNYFMSYGSSK 91 Query: 302 --RINRKPGSSSDGGCVE--QRVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEFE 469 ++ G SD G +R+EKEYKEPW+YYSYYP LPLRRPYSG+P LLD EEF Sbjct: 92 VGSSSKYQGLGSDDGVHSSARRMEKEYKEPWDYYSYYPAALPLRRPYSGDPVLLDDEEFG 151 Query: 470 DDSTRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNPV 649 + S DE + A +L LMEEN E M F+QLP+++P+ K S Sbjct: 152 EASEEIAYDESSVKLATELDLMEENKEARMIFLQLPSSLPLVKRSATTNNDGTNSGLKQF 211 Query: 650 KGARPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLECVFAHDVVALNTE 829 +G S+KPC +E LP GFMGKMLVY SGA+KLKLGDTLYDVS+G+ CVFA DVVA+NTE Sbjct: 212 RGGVSSEKPCKLEELPVGFMGKMLVYESGAIKLKLGDTLYDVSSGMNCVFAQDVVAINTE 271 Query: 830 EKHCCSVGELNKRVVITPDVDSILDSMSD 916 EKHCC +GELNKR VITP++DSIL+SM D Sbjct: 272 EKHCCILGELNKRAVITPNIDSILNSMID 300 >gb|KHG12357.1| DNA-directed RNA polymerase III subunit RPC4 [Gossypium arboreum] Length = 288 Score = 288 bits (738), Expect = 3e-75 Identities = 146/271 (53%), Positives = 195/271 (71%), Gaps = 5/271 (1%) Frame = +2 Query: 122 PKVE---KIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHNQIAFGYGGSSTTLKSYG 292 PK+E ++ DIDA +A+DLL+R N++S + KPK E+KV +Q+AFG+GG ++ +K++G Sbjct: 26 PKLEVKTEVVEDIDAVQARDLLQRLNQTSARTKPKVEKKVSSSQVAFGFGGGAS-IKTFG 84 Query: 293 AANRINRKPGSSSDGGCVEQ--RVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEF 466 + N G + GG RVEKEYKEPW+YYSYYPL LP+RRPYSG+PE LD+EEF Sbjct: 85 TSKGANHSSGETFGGGVHGSGLRVEKEYKEPWDYYSYYPLTLPMRRPYSGSPEFLDEEEF 144 Query: 467 EDDSTRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNP 646 + DE + PA+ LGLMEENLE M F+QLP +P+ K G E S++ Sbjct: 145 AAQNVAY--DENSIEPAVGLGLMEENLEPMMLFLQLPPTLPIIKA-----GHEGASSSGS 197 Query: 647 VKGARPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLECVFAHDVVALNT 826 + R ++K CG+ LPAG MGKMLVY+SGAVKLKLGDT+YDV+ GL CVFA DVVA++T Sbjct: 198 SRTVRSAKKTCGLTELPAGLMGKMLVYKSGAVKLKLGDTIYDVNPGLSCVFAQDVVAVDT 257 Query: 827 EEKHCCSVGELNKRVVITPDVDSILDSMSDL 919 +K CC VGE+NK VV+TPD+DS+L+S+S+L Sbjct: 258 AKKQCCVVGEVNKHVVVTPDLDSVLNSLSEL 288 >ref|XP_007154594.1| hypothetical protein PHAVU_003G132000g [Phaseolus vulgaris] gi|593783109|ref|XP_007154595.1| hypothetical protein PHAVU_003G132000g [Phaseolus vulgaris] gi|561027948|gb|ESW26588.1| hypothetical protein PHAVU_003G132000g [Phaseolus vulgaris] gi|561027949|gb|ESW26589.1| hypothetical protein PHAVU_003G132000g [Phaseolus vulgaris] Length = 291 Score = 288 bits (736), Expect = 6e-75 Identities = 145/269 (53%), Positives = 194/269 (72%), Gaps = 4/269 (1%) Frame = +2 Query: 125 KVEKIEN-DIDAARAQDLLRRFNESSMKAKPKFERKVGHNQIAFGYGGSSTTLKSYG--- 292 K E +E+ DA +A+DLLRRFNES+MKA+ K E+KV +QIAFGYGG ST+LKSYG Sbjct: 30 KAEVVEDAQADANQAKDLLRRFNESAMKARNKVEKKVSASQIAFGYGGESTSLKSYGIGR 89 Query: 293 AANRINRKPGSSSDGGCVEQRVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEFED 472 +N P S+S EKEY EPW+YYS YP+ LPLRRPYSGNPELLD+EEF + Sbjct: 90 GGRNVNINPNSTSSAVA-----EKEYTEPWDYYSNYPVTLPLRRPYSGNPELLDEEEFGE 144 Query: 473 DSTRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNPVK 652 + DE ATN A++LGL+EENLE +MF ++LP+ +P+ + G++ + + P Sbjct: 145 AAEARTYDEEATNSAMELGLLEENLEANMFLIKLPSKLPIISTADG--GKDVNAKSKPPV 202 Query: 653 GARPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLECVFAHDVVALNTEE 832 G + ++ C ++ LP+GFMGKMLVY+SG +KLKLG+TLYDVS+G+ C F+ DVVA+N E Sbjct: 203 GTKKGERLCELKDLPSGFMGKMLVYKSGKIKLKLGNTLYDVSSGMNCSFSQDVVAINKAE 262 Query: 833 KHCCSVGELNKRVVITPDVDSILDSMSDL 919 K CS+GE++K V ITPD+D ILD++SDL Sbjct: 263 KTLCSIGEISKHVTITPDIDDILDNLSDL 291 >ref|XP_012448093.1| PREDICTED: uncharacterized protein LOC105771231 isoform X1 [Gossypium raimondii] gi|763787464|gb|KJB54460.1| hypothetical protein B456_009G035100 [Gossypium raimondii] Length = 284 Score = 286 bits (732), Expect = 2e-74 Identities = 143/269 (53%), Positives = 188/269 (69%), Gaps = 3/269 (1%) Frame = +2 Query: 122 PKVE---KIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHNQIAFGYGGSSTTLKSYG 292 PK+E ++ D DA +A+DLL+R N+ S K KPK E+KV +Q+AFG+G ST++K++G Sbjct: 27 PKLEVKTEVVEDTDAVQARDLLQRLNQISAKTKPKVEKKVASSQVAFGFGAGSTSIKTFG 86 Query: 293 AANRINRKPGSSSDGGCVEQRVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEFED 472 A+ PG R EKEYKEPW+YYSYYP+ LP+RRPYSGNPE LD+EEF Sbjct: 87 ASKGSVPTPGL---------REEKEYKEPWDYYSYYPVTLPMRRPYSGNPEFLDEEEFA- 136 Query: 473 DSTRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNPVK 652 + +E + PA++LGLMEEN E +MFF+QLP +PMTK N G E S + P Sbjct: 137 -LANATFEEDSVEPAVELGLMEENSEATMFFIQLPPTLPMTKQTGNISGNETNSRSKPAA 195 Query: 653 GARPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLECVFAHDVVALNTEE 832 ++K G+E LPAGFMGKMLVYRSGAVKLKLGD+LYDV+ G F+ DVVA+NT + Sbjct: 196 SVGSAKKTRGIEELPAGFMGKMLVYRSGAVKLKLGDSLYDVTPGCNSEFSQDVVAVNTGK 255 Query: 833 KHCCSVGELNKRVVITPDVDSILDSMSDL 919 KHCC VGE++KR ++TPDV S+ + ++DL Sbjct: 256 KHCCGVGEIDKRAILTPDVYSVFNYLTDL 284 >emb|CDP03102.1| unnamed protein product [Coffea canephora] Length = 262 Score = 286 bits (732), Expect = 2e-74 Identities = 154/299 (51%), Positives = 190/299 (63%) Frame = +2 Query: 23 MDSESLAASKANXXXXXXXXXXXXXXXXXXXVLPKVEKIENDIDAARAQDLLRRFNESSM 202 MD ESLA + N V+ K EK+E+ +DAA+A++LLRR NESS+ Sbjct: 1 MDPESLATTTTNAPRKVRFAPKVPPRRDQKTVVTKAEKVEDAVDAAQAEELLRRLNESSV 60 Query: 203 KAKPKFERKVGHNQIAFGYGGSSTTLKSYGAANRINRKPGSSSDGGCVEQRVEKEYKEPW 382 KPKFERK G GA +RVEKEYKEPW Sbjct: 61 NVKPKFERKAG------------------GAM-----------------RRVEKEYKEPW 85 Query: 383 NYYSYYPLQLPLRRPYSGNPELLDKEEFEDDSTRSIDDEYATNPALKLGLMEENLEDSMF 562 +YY+ YP+ LPLRRPYSG+PE LD+EEF++ S DE +TN A++LGL E +++M Sbjct: 86 DYYTNYPVTLPLRRPYSGDPEHLDQEEFDEASESLNYDECSTNAAVELGLTEG--KETML 143 Query: 563 FVQLPTAMPMTKPCNNAEGREQGSNTNPVKGARPSQKPCGMEALPAGFMGKMLVYRSGAV 742 F+QLP +MPM K N G E + P K QK C ++ LPAGFMGK+LVYRSGAV Sbjct: 144 FLQLPASMPMIKQLPNTAGSEMADTSKPTKSGELLQKSCSLDELPAGFMGKILVYRSGAV 203 Query: 743 KLKLGDTLYDVSAGLECVFAHDVVALNTEEKHCCSVGELNKRVVITPDVDSILDSMSDL 919 KLKLGD LYDVS GL+CVFA DVVA+N EEKHCC+VGEL+KRV+ITPDVDS+LD M+DL Sbjct: 204 KLKLGDNLYDVSVGLDCVFAQDVVAINDEEKHCCTVGELDKRVIITPDVDSMLDGMADL 262 >ref|XP_012448095.1| PREDICTED: uncharacterized protein LOC105771231 isoform X2 [Gossypium raimondii] gi|763787463|gb|KJB54459.1| hypothetical protein B456_009G035100 [Gossypium raimondii] Length = 273 Score = 285 bits (728), Expect = 5e-74 Identities = 140/262 (53%), Positives = 184/262 (70%) Frame = +2 Query: 134 KIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHNQIAFGYGGSSTTLKSYGAANRINR 313 ++ D DA +A+DLL+R N+ S K KPK E+KV +Q+AFG+G ST++K++GA+ Sbjct: 23 EVVEDTDAVQARDLLQRLNQISAKTKPKVEKKVASSQVAFGFGAGSTSIKTFGASKGSVP 82 Query: 314 KPGSSSDGGCVEQRVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEFEDDSTRSID 493 PG R EKEYKEPW+YYSYYP+ LP+RRPYSGNPE LD+EEF + Sbjct: 83 TPGL---------REEKEYKEPWDYYSYYPVTLPMRRPYSGNPEFLDEEEFA--LANATF 131 Query: 494 DEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNPVKGARPSQK 673 +E + PA++LGLMEEN E +MFF+QLP +PMTK N G E S + P ++K Sbjct: 132 EEDSVEPAVELGLMEENSEATMFFIQLPPTLPMTKQTGNISGNETNSRSKPAASVGSAKK 191 Query: 674 PCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLECVFAHDVVALNTEEKHCCSVG 853 G+E LPAGFMGKMLVYRSGAVKLKLGD+LYDV+ G F+ DVVA+NT +KHCC VG Sbjct: 192 TRGIEELPAGFMGKMLVYRSGAVKLKLGDSLYDVTPGCNSEFSQDVVAVNTGKKHCCGVG 251 Query: 854 ELNKRVVITPDVDSILDSMSDL 919 E++KR ++TPDV S+ + ++DL Sbjct: 252 EIDKRAILTPDVYSVFNYLTDL 273 >gb|KJB54458.1| hypothetical protein B456_009G035100 [Gossypium raimondii] Length = 294 Score = 285 bits (728), Expect = 5e-74 Identities = 140/262 (53%), Positives = 184/262 (70%) Frame = +2 Query: 134 KIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHNQIAFGYGGSSTTLKSYGAANRINR 313 ++ D DA +A+DLL+R N+ S K KPK E+KV +Q+AFG+G ST++K++GA+ Sbjct: 44 EVVEDTDAVQARDLLQRLNQISAKTKPKVEKKVASSQVAFGFGAGSTSIKTFGASKGSVP 103 Query: 314 KPGSSSDGGCVEQRVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEFEDDSTRSID 493 PG R EKEYKEPW+YYSYYP+ LP+RRPYSGNPE LD+EEF + Sbjct: 104 TPGL---------REEKEYKEPWDYYSYYPVTLPMRRPYSGNPEFLDEEEFA--LANATF 152 Query: 494 DEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNPVKGARPSQK 673 +E + PA++LGLMEEN E +MFF+QLP +PMTK N G E S + P ++K Sbjct: 153 EEDSVEPAVELGLMEENSEATMFFIQLPPTLPMTKQTGNISGNETNSRSKPAASVGSAKK 212 Query: 674 PCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLECVFAHDVVALNTEEKHCCSVG 853 G+E LPAGFMGKMLVYRSGAVKLKLGD+LYDV+ G F+ DVVA+NT +KHCC VG Sbjct: 213 TRGIEELPAGFMGKMLVYRSGAVKLKLGDSLYDVTPGCNSEFSQDVVAVNTGKKHCCGVG 272 Query: 854 ELNKRVVITPDVDSILDSMSDL 919 E++KR ++TPDV S+ + ++DL Sbjct: 273 EIDKRAILTPDVYSVFNYLTDL 294 >ref|XP_002516293.1| DNA binding protein, putative [Ricinus communis] gi|223544779|gb|EEF46295.1| DNA binding protein, putative [Ricinus communis] Length = 286 Score = 284 bits (727), Expect = 6e-74 Identities = 149/269 (55%), Positives = 190/269 (70%), Gaps = 4/269 (1%) Frame = +2 Query: 125 KVEKIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHNQIAFGYGGSSTTLKSYGAAN- 301 K EK E++ DA +A L+++F E SM+AKPK E+KV +QIAFG+G +S ++KSY A Sbjct: 30 KSEKAEDE-DATQAMKLMKQFQERSMRAKPKAEKKVQASQIAFGFGAASPSIKSYAAPKV 88 Query: 302 --RINRKPGSSSDGGCVEQRV-EKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEFED 472 +N GSS +GG + EKEY EPWNYYSYYP+ LPLRRPYSGNP L+ EEF + Sbjct: 89 GAAVNHNQGSSVNGGAYSSELGEKEYIEPWNYYSYYPVTLPLRRPYSGNPATLNAEEFGE 148 Query: 473 DSTRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNPVK 652 S S DE +TN A+ LGLMEEN+E +MFF+QLP +PM K A+G + VK Sbjct: 149 ASDTSEYDENSTNSAINLGLMEENVEANMFFLQLPPTVPMIKRLATADGHK-------VK 201 Query: 653 GARPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLECVFAHDVVALNTEE 832 +K C ++ LPAG MGKMLVYRSGAVKLKLGDTLYDVS GL+ FA D+ A+NT E Sbjct: 202 ----EEKTCKLDELPAGHMGKMLVYRSGAVKLKLGDTLYDVSPGLDFAFAQDIAAINTAE 257 Query: 833 KHCCSVGELNKRVVITPDVDSILDSMSDL 919 KHCC V E++K ++TPDVD+I++SM+DL Sbjct: 258 KHCCVVAEIDKHAIVTPDVDAIINSMADL 286 >gb|KDO54063.1| hypothetical protein CISIN_1g022055mg [Citrus sinensis] Length = 303 Score = 283 bits (723), Expect = 2e-73 Identities = 145/272 (53%), Positives = 194/272 (71%), Gaps = 7/272 (2%) Frame = +2 Query: 125 KVEKIENDIDAARAQDLLRRFN--ESSMKAKPKFERKVGHNQIAFGYGGSSTTLKSYG-- 292 K E +EN DAA+A DLL+RFN + ++K +PK E+KV +QIAFG GG+ST +KSYG Sbjct: 33 KTEMVEN-ADAAQAMDLLQRFNANQGALKGRPKVEKKVAPSQIAFGQGGASTFIKSYGIP 91 Query: 293 -AANRINRKPGSSSDGGCVEQ--RVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEE 463 + +R GS+ +GG R+ KEY+EPW+YYSYYP+ LPLRRPYSG+PELLD+EE Sbjct: 92 KGGSSSSRGQGSAVNGGAHASGTRLGKEYQEPWDYYSYYPVSLPLRRPYSGSPELLDEEE 151 Query: 464 FEDDSTRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTN 643 F + S DE + NPA +LGLMEENLE +M F+QLP +P+ K R+ +++ Sbjct: 152 FGEASETINYDESSMNPAEELGLMEENLEPNMIFLQLPPTLPLKKQPATGNERQVNESSS 211 Query: 644 PVKGARPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLECVFAHDVVALN 823 +GA +K + LP GFMGK+LVYRSGAVKLKLGDT+Y+V+ G++C+FA DVV +N Sbjct: 212 KHEGATAKEKTSSLSELPGGFMGKLLVYRSGAVKLKLGDTVYNVTPGMDCMFAQDVVVIN 271 Query: 824 TEEKHCCSVGELNKRVVITPDVDSILDSMSDL 919 T EKH C GELNKR +++PDVD IL++ +DL Sbjct: 272 TAEKHFCVAGELNKRAILSPDVDFILNNFADL 303