BLASTX nr result
ID: Forsythia22_contig00019724
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia22_contig00019724 (1168 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011094009.1| PREDICTED: DNA-directed RNA polymerase III s... 379 e-102 ref|XP_012851133.1| PREDICTED: DNA-directed RNA polymerase III s... 330 1e-87 ref|XP_009631588.1| PREDICTED: uncharacterized protein LOC104121... 328 4e-87 ref|XP_010656128.1| PREDICTED: uncharacterized protein LOC100256... 317 8e-84 ref|XP_006362806.1| PREDICTED: uncharacterized protein LOC102600... 314 9e-83 ref|XP_010321121.1| PREDICTED: uncharacterized protein LOC101251... 311 8e-82 ref|XP_007012676.1| DNA-directed RNA polymerase III subunit RPC4... 301 8e-79 ref|XP_010049526.1| PREDICTED: DNA-directed RNA polymerase III s... 295 3e-77 ref|XP_010049527.1| PREDICTED: DNA-directed RNA polymerase III s... 295 3e-77 gb|KJB83020.1| hypothetical protein B456_013G225500 [Gossypium r... 293 2e-76 ref|XP_012461771.1| PREDICTED: uncharacterized protein LOC105781... 293 2e-76 ref|XP_010270534.1| PREDICTED: uncharacterized protein LOC104606... 291 5e-76 gb|KHG12357.1| DNA-directed RNA polymerase III subunit RPC4 [Gos... 288 4e-75 ref|XP_007154594.1| hypothetical protein PHAVU_003G132000g [Phas... 288 5e-75 emb|CDP03102.1| unnamed protein product [Coffea canephora] 288 7e-75 ref|XP_012448093.1| PREDICTED: uncharacterized protein LOC105771... 286 2e-74 ref|XP_002516293.1| DNA binding protein, putative [Ricinus commu... 286 3e-74 ref|XP_012448095.1| PREDICTED: uncharacterized protein LOC105771... 285 5e-74 gb|KJB54458.1| hypothetical protein B456_009G035100 [Gossypium r... 285 5e-74 ref|XP_007204524.1| hypothetical protein PRUPE_ppa017748mg [Prun... 285 6e-74 >ref|XP_011094009.1| PREDICTED: DNA-directed RNA polymerase III subunit RPC4 [Sesamum indicum] Length = 299 Score = 379 bits (973), Expect = e-102 Identities = 186/300 (62%), Positives = 229/300 (76%), Gaps = 1/300 (0%) Frame = -1 Query: 1042 MDSESLAASKA-NXXXXXXXXXXXXXXXXXXPVLPKVEKIENDIDAARAQDLLRRFNESS 866 MD +SLAAS N PVLPK EK+E+D++ A+A+ LLRR NE+S Sbjct: 1 MDPDSLAASSTTNAPRKVRFAPKAPPKREQKPVLPKAEKVESDLEEAKAEQLLRRLNEAS 60 Query: 865 MKAKPKFERKVGHTQIAFGYGGSSTTLKSYGAANRINRKPGSSSDGGCVEQRVEKEYKEP 686 +K KPK ERK G TQ+AFGYGGSS +L+SYG INR PGSSSDGG +QR+EKEYKEP Sbjct: 61 LKGKPKVERKAGPTQVAFGYGGSSNSLRSYGVKKNINRIPGSSSDGGA-DQRIEKEYKEP 119 Query: 685 WNYYSYYPLQLPLRRPYSGNPELLDKEEFEDDSTRSIDDEYATNPALKLGLMEENLEDSM 506 W+YY+YYP LPLRRPYSGNPELLD+EEF +D S DE A N AL+LGL++EN+E+++ Sbjct: 120 WDYYTYYPTTLPLRRPYSGNPELLDEEEFAEDPQNSTYDESAENSALELGLLDENMEETI 179 Query: 505 FFVQLPTAMPMTKPCNNAEGREQGSNTNPVKGARPSQKPCGMEALPAGFMGKMLVYRSGA 326 FF+QLP+ +P TK NAE E NP KGA SQKPC +E LPAG MGKMLVYRSGA Sbjct: 180 FFLQLPSILPTTKQSTNAEVPEADKKANPGKGAEASQKPCRLEDLPAGLMGKMLVYRSGA 239 Query: 325 VKLKLGDTLYDVSAGLDCVFAHDVVALNTEEKHCCSVGELNKRVVITPDVDSILDSMSDL 146 +KLKLGDTLYDVSAGL+CVF DVVA+NT++KHCC++GE++KR +ITPD DS+LD+++DL Sbjct: 240 IKLKLGDTLYDVSAGLNCVFGQDVVAINTDDKHCCNMGEISKRAIITPDTDSMLDAIADL 299 >ref|XP_012851133.1| PREDICTED: DNA-directed RNA polymerase III subunit rpc4-like [Erythranthe guttatus] Length = 294 Score = 330 bits (846), Expect = 1e-87 Identities = 168/269 (62%), Positives = 204/269 (75%), Gaps = 1/269 (0%) Frame = -1 Query: 949 VLPKVEKIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHTQIAFGYGGSSTTLKSYGA 770 VLPKVEK+E D +A++L+RR+NESSM K K ERKV Q+AFG+GGSS L+SYGA Sbjct: 31 VLPKVEKVEEVEDDIKAEELMRRYNESSMNRKTKAERKVAPVQVAFGFGGSSNALRSYGA 90 Query: 769 ANRINRKPGSSSDGGCVEQRVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEFEDD 590 I + GSS+DG + VEKEYKEPW+YY+YYP+ +PLRRPYSGNPELLD+ EFE + Sbjct: 91 HKGIKKNLGSSNDGTAIN--VEKEYKEPWDYYTYYPVTVPLRRPYSGNPELLDEGEFEKE 148 Query: 589 STRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGS-NTNPVK 413 DE ATN A +LGL+EEN E++MF ++ P +PM K + AE RE G+ N K Sbjct: 149 PDY---DENATNDAAELGLVEENAENNMFLLKFPENLPMVKQPDRAEAREPGNIPKNTQK 205 Query: 412 GARPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLDCVFAHDVVALNTEE 233 GA QK C +E LPAGFMGKMLVY+SGAVKLKLGDTLYDVSAGLDCVFA +VVA+N EE Sbjct: 206 GAGKPQKTCNLEELPAGFMGKMLVYKSGAVKLKLGDTLYDVSAGLDCVFAQEVVAVNAEE 265 Query: 232 KHCCSVGELNKRVVITPDVDSILDSMSDL 146 K CCSVGEL+KR +TPD+DS+L +MSDL Sbjct: 266 KKCCSVGELHKRASVTPDIDSVLKAMSDL 294 >ref|XP_009631588.1| PREDICTED: uncharacterized protein LOC104121328 [Nicotiana tomentosiformis] gi|697154708|ref|XP_009631589.1| PREDICTED: uncharacterized protein LOC104121328 [Nicotiana tomentosiformis] gi|697154710|ref|XP_009631590.1| PREDICTED: uncharacterized protein LOC104121328 [Nicotiana tomentosiformis] Length = 289 Score = 328 bits (842), Expect = 4e-87 Identities = 170/297 (57%), Positives = 209/297 (70%), Gaps = 1/297 (0%) Frame = -1 Query: 1042 MDSESLAASKANXXXXXXXXXXXXXXXXXXPVLPKVEKIENDIDAARAQDLLRRFNESSM 863 MDS+ LA++ VLPK E IE D+DAA+A++L++RFNE S Sbjct: 1 MDSDPLASNTTKAPRKVRFAPKGPPRRAQKIVLPKPENIEEDVDAAKAEELMQRFNEVSA 60 Query: 862 KAKPKFERKVGHTQIAFGYGGSSTTLKSYGAANRINRKPGSSSDGGCVEQRVEKEYKEPW 683 K KPK E+K G TQ+AFGYGGSS+ LKSYG + S S+GG Q+V+KEY EPW Sbjct: 61 KVKPKTEKK-GPTQVAFGYGGSSSALKSYGPLKGHKKVDSSMSNGGTGVQQVQKEYTEPW 119 Query: 682 NYYSYYPLQLPLRRPYSGNPELLDKEEFEDDSTRSIDDEYATNPALKLGLMEENLEDSMF 503 +YY+ YP+ LPLRRPYSGNPELLD++EF + S DE + PA++LGLMEENLE+ MF Sbjct: 120 DYYTNYPMTLPLRRPYSGNPELLDEQEFREASQSLSYDENSIKPAMELGLMEENLEEKMF 179 Query: 502 FVQLPTAMPMTKPCNNAEGREQGSNTNPVKGARPSQ-KPCGMEALPAGFMGKMLVYRSGA 326 F+QLPTAMPM K EG E S +RPS+ K M LP GFMGKMLVY+SGA Sbjct: 180 FIQLPTAMPMLKQSVKTEGSEASS-------SRPSKVKAYSMNELPRGFMGKMLVYKSGA 232 Query: 325 VKLKLGDTLYDVSAGLDCVFAHDVVALNTEEKHCCSVGELNKRVVITPDVDSILDSM 155 VKLKLG+TLYDVS G+DC FA DVVA+NTEEKHC ++GEL KR+++TPDVDSILDS+ Sbjct: 233 VKLKLGETLYDVSPGMDCAFAQDVVAVNTEEKHCSNIGELTKRIIVTPDVDSILDSI 289 >ref|XP_010656128.1| PREDICTED: uncharacterized protein LOC100256088 [Vitis vinifera] Length = 315 Score = 317 bits (813), Expect = 8e-84 Identities = 166/285 (58%), Positives = 206/285 (72%), Gaps = 17/285 (5%) Frame = -1 Query: 949 VLPKVEKIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHTQIAFGYGGSSTTLKSYGA 770 V+PK E E+D DAA+A +L+R FNE+SMK KPK E+K+ TQ+AFGYGG+S +++SYG Sbjct: 31 VVPKSEVAEDD-DAAQANELMRHFNEASMKGKPKAEKKLAPTQVAFGYGGASASIRSYGT 89 Query: 769 ---ANRINRKPGSSSDGGCVEQRVE--KEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKE 605 A +R +S GG + KEYKEPW+YY+YYP+ LPLRRPYSGNPELLD+E Sbjct: 90 PRGATNSSRYQDPASGGGLYGSGLSDHKEYKEPWDYYTYYPVTLPLRRPYSGNPELLDEE 149 Query: 604 EFEDDSTRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNT 425 EF + S + DE +TNPA++LGLM+EN E SM F+QLP MPM K AE +E S++ Sbjct: 150 EFGEASESTAYDENSTNPAMELGLMDENQEASMLFLQLPATMPMIKQAATAEVKENASSS 209 Query: 424 NPVKGA------RPS------QKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAG 281 P + A +PS QK C +E LP+GFMGKMLVY+SGA+KLKLGDTLYDVS G Sbjct: 210 KPPEDAGQANRLKPSEGAGSIQKTCRLEELPSGFMGKMLVYKSGAIKLKLGDTLYDVSPG 269 Query: 280 LDCVFAHDVVALNTEEKHCCSVGELNKRVVITPDVDSILDSMSDL 146 LDCVFA DVVA+NTE+K CC +GEL KR V+TPDVDS L SM DL Sbjct: 270 LDCVFAQDVVAINTEDKCCCVLGELKKRAVVTPDVDSALSSMDDL 314 >ref|XP_006362806.1| PREDICTED: uncharacterized protein LOC102600766 [Solanum tuberosum] Length = 283 Score = 314 bits (804), Expect = 9e-83 Identities = 159/265 (60%), Positives = 203/265 (76%) Frame = -1 Query: 949 VLPKVEKIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHTQIAFGYGGSSTTLKSYGA 770 VLPK E +E D DAA+A++L++RFNE+S K K K E+K G TQ+AFGYGGSS++LKSYG Sbjct: 29 VLPKPENVEADGDAAKAEELMQRFNEASAKVKHKVEKK-GPTQVAFGYGGSSSSLKSYGH 87 Query: 769 ANRINRKPGSSSDGGCVEQRVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEFEDD 590 N+++ GS SDGG +RV+KEY EPW+YY+ YP+ LP+RRPYSGNPELLD+EEF + Sbjct: 88 YNKVS---GSMSDGGIGGERVQKEYTEPWDYYTNYPVTLPVRRPYSGNPELLDEEEFGEA 144 Query: 589 STRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNPVKG 410 S DE + PA+ LGLMEE+LE+ MF VQLPT MPM K EG E +++ P K Sbjct: 145 SRSLTYDENSIKPAMDLGLMEESLEEKMFLVQLPT-MPMLKQSIKTEGSEMANSSKPSKA 203 Query: 409 ARPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLDCVFAHDVVALNTEEK 230 K C + LPAGFMGKMLVY+SGAVKLKLG+TL+++S G+DC FA DVVA+NTEEK Sbjct: 204 -----KACSLNELPAGFMGKMLVYKSGAVKLKLGETLFNLSPGMDCSFAQDVVAVNTEEK 258 Query: 229 HCCSVGELNKRVVITPDVDSILDSM 155 +C ++GEL KR++ITPDVDS+LDS+ Sbjct: 259 YCSNIGELTKRIIITPDVDSLLDSI 283 >ref|XP_010321121.1| PREDICTED: uncharacterized protein LOC101251183 isoform X1 [Solanum lycopersicum] Length = 284 Score = 311 bits (796), Expect = 8e-82 Identities = 157/264 (59%), Positives = 200/264 (75%) Frame = -1 Query: 949 VLPKVEKIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHTQIAFGYGGSSTTLKSYGA 770 VLPK E +E D+DAA+A++L++RFNE+S K K K E+K G TQ+AFGYGGSS++LKSYG Sbjct: 30 VLPKTENVEADVDAAKAEELMQRFNEASAKIKHKVEKK-GPTQVAFGYGGSSSSLKSYGH 88 Query: 769 ANRINRKPGSSSDGGCVEQRVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEFEDD 590 +++ GS SDGG +RV+KEY EPW+YY+ YP+ LP+RRPYSGNPELLD+EEF + Sbjct: 89 YTKVS---GSMSDGGINGERVQKEYTEPWDYYTNYPVTLPVRRPYSGNPELLDEEEFGEA 145 Query: 589 STRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNPVKG 410 S DE + PA+ LGLMEENLE+ MF VQLPT MPM K EG E +++ K Sbjct: 146 SQSLTYDENSIKPAMDLGLMEENLEEKMFLVQLPT-MPMLKQSIKTEGSEMANSSKTSKA 204 Query: 409 ARPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLDCVFAHDVVALNTEEK 230 K C + LPAG MGK+LVY+SGAVKLKLG+TL++VS G+DC FA DVVA+NTEEK Sbjct: 205 -----KACSLNELPAGLMGKLLVYKSGAVKLKLGETLFNVSPGMDCSFAQDVVAVNTEEK 259 Query: 229 HCCSVGELNKRVVITPDVDSILDS 158 +C ++GEL KR++ITPDVDS+LDS Sbjct: 260 YCSNIGELTKRIIITPDVDSLLDS 283 >ref|XP_007012676.1| DNA-directed RNA polymerase III subunit RPC4, putative [Theobroma cacao] gi|508783039|gb|EOY30295.1| DNA-directed RNA polymerase III subunit RPC4, putative [Theobroma cacao] Length = 294 Score = 301 bits (770), Expect = 8e-79 Identities = 150/271 (55%), Positives = 197/271 (72%), Gaps = 5/271 (1%) Frame = -1 Query: 943 PKVE---KIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHTQIAFGYGGSSTTLKSYG 773 PK+E ++ D DA +A+DLL+R N++S K KPK E+KV +Q+AFG+GG+S ++K +G Sbjct: 26 PKLEVKTEVVEDTDAVQARDLLQRLNQTSAKTKPKVEKKVASSQVAFGHGGASASMKLFG 85 Query: 772 AANRINRKPGSSSDGGCVEQ--RVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEF 599 + +R + +G R EKEY+EPW+YYSYYP+ LP+RRPYSGNPE LD+EEF Sbjct: 86 VSKGASRTSRETLNGVVHTPGLREEKEYREPWDYYSYYPVTLPMRRPYSGNPEFLDEEEF 145 Query: 598 EDDSTRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNP 419 ++ DE + PA++LGLM+ENLE SMFF+QLP +PM K G E S++ P Sbjct: 146 ASENITF--DENSVEPAVELGLMDENLEPSMFFLQLPPTLPMIKQSGTTAGLEVDSSSKP 203 Query: 418 VKGARPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLDCVFAHDVVALNT 239 +K CG+E LPAG MGKMLV++SGAVKLKLGDTLYDV+ GL+CVFA DVVA+NT Sbjct: 204 AARVGSVKKTCGLEELPAGLMGKMLVHKSGAVKLKLGDTLYDVTPGLNCVFAQDVVAVNT 263 Query: 238 EEKHCCSVGELNKRVVITPDVDSILDSMSDL 146 EK CC VGEL+KR V+TPDVDS+L+SM+DL Sbjct: 264 AEKQCCVVGELDKRAVLTPDVDSVLNSMADL 294 >ref|XP_010049526.1| PREDICTED: DNA-directed RNA polymerase III subunit RPC4 isoform X1 [Eucalyptus grandis] Length = 304 Score = 295 bits (756), Expect = 3e-77 Identities = 154/267 (57%), Positives = 189/267 (70%), Gaps = 7/267 (2%) Frame = -1 Query: 925 ENDIDAARAQDLLRRFNESSMKAKPKFERKVGHTQIAFGYGGSSTTLKSYGAA---NRIN 755 ++D D A+A+DLLR FNE +K KPK ERKV +QIAFGYGG+S +LKSY N +N Sbjct: 41 DDDDDEAKAKDLLRHFNEGILKEKPKIERKVAPSQIAFGYGGTSASLKSYHVQKDQNNVN 100 Query: 754 RKPGSSSDGGCVEQRVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEFEDDSTRSI 575 G+SS G R KEY+EPW+YYSYYP+ LPLRRPYSGNPELLD+EEF + + Sbjct: 101 SYQGTSSGPGL---RGMKEYREPWDYYSYYPVTLPLRRPYSGNPELLDEEEFGEAPSSVT 157 Query: 574 DDEYATNPALKLGLMEE----NLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNPVKGA 407 +E N A++L LM+ +LE SMFF+QLP +PM K A G E ++ Sbjct: 158 YNENILNTAMELDLMDSQRDGSLEPSMFFIQLPPTVPMAKRSTTAAGNETTESSTSSNVL 217 Query: 406 RPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLDCVFAHDVVALNTEEKH 227 +K C ++ LPAG MGKMLVYRSGAVKLKLGDTLYDVS+GLDCVFA DVVA+N EKH Sbjct: 218 GSLEKSCSLDELPAGLMGKMLVYRSGAVKLKLGDTLYDVSSGLDCVFAQDVVAVNRTEKH 277 Query: 226 CCSVGELNKRVVITPDVDSILDSMSDL 146 C VGELNKR ++TPDVDS+L+SMS+L Sbjct: 278 FCVVGELNKRAILTPDVDSVLESMSEL 304 >ref|XP_010049527.1| PREDICTED: DNA-directed RNA polymerase III subunit RPC4 isoform X2 [Eucalyptus grandis] gi|629117484|gb|KCW82159.1| hypothetical protein EUGRSUZ_C03553 [Eucalyptus grandis] gi|629117485|gb|KCW82160.1| hypothetical protein EUGRSUZ_C03553 [Eucalyptus grandis] Length = 304 Score = 295 bits (756), Expect = 3e-77 Identities = 154/267 (57%), Positives = 189/267 (70%), Gaps = 7/267 (2%) Frame = -1 Query: 925 ENDIDAARAQDLLRRFNESSMKAKPKFERKVGHTQIAFGYGGSSTTLKSYGAA---NRIN 755 ++D D A+A+DLLR FNE +K KPK ERKV +QIAFGYGG+S +LKSY N +N Sbjct: 41 DDDDDEAKAKDLLRHFNEGILKEKPKIERKVAPSQIAFGYGGTSASLKSYHVQKDQNNVN 100 Query: 754 RKPGSSSDGGCVEQRVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEFEDDSTRSI 575 G+SS G R KEY+EPW+YYSYYP+ LPLRRPYSGNPELLD+EEF + + Sbjct: 101 SYQGTSSGPGL---RGMKEYREPWDYYSYYPVTLPLRRPYSGNPELLDEEEFGEAPSSVT 157 Query: 574 DDEYATNPALKLGLMEE----NLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNPVKGA 407 +E N A++L LM+ +LE SMFF+QLP +PM K A G E ++ Sbjct: 158 YNENILNTAMELDLMDSQRDGSLEPSMFFIQLPPTVPMAKRSTTAAGNETTESSTSSNVL 217 Query: 406 RPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLDCVFAHDVVALNTEEKH 227 +K C ++ LPAG MGKMLVYRSGAVKLKLGDTLYDVS+GLDCVFA DVVA+N EKH Sbjct: 218 GSLEKSCSLDELPAGLMGKMLVYRSGAVKLKLGDTLYDVSSGLDCVFAQDVVAVNRTEKH 277 Query: 226 CCSVGELNKRVVITPDVDSILDSMSDL 146 C VGELNKR ++TPDVDS+L+SMS+L Sbjct: 278 FCVVGELNKRAILTPDVDSVLESMSEL 304 >gb|KJB83020.1| hypothetical protein B456_013G225500 [Gossypium raimondii] Length = 276 Score = 293 bits (749), Expect = 2e-76 Identities = 146/271 (53%), Positives = 194/271 (71%), Gaps = 5/271 (1%) Frame = -1 Query: 943 PKVE---KIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHTQIAFGYGGSSTTLKSYG 773 PK+E ++ DIDA +A+DLL+R N++S + KPK E+KV +Q+AFG+ G ++K++G Sbjct: 13 PKLEVKTEVVEDIDAVQARDLLQRLNQTSARTKPKVEKKVSSSQVAFGFVGGGASIKTFG 72 Query: 772 AANRINRKPGSSSDGGCVEQ--RVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEF 599 + N + G + GG RVEKEYKEPW+YYSYYPL LP+RRPYSGNPE LD+EEF Sbjct: 73 TSRGANHRSGETFGGGVRGPGLRVEKEYKEPWDYYSYYPLTLPMRRPYSGNPEFLDEEEF 132 Query: 598 EDDSTRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNP 419 + DE + PA+ LGLMEENLE M F+QLP +P+ K G E S+T Sbjct: 133 AAQNVAY--DENSIEPAVGLGLMEENLEPMMLFLQLPPTLPIIKA-----GHEGASSTGS 185 Query: 418 VKGARPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLDCVFAHDVVALNT 239 + R ++K CG+ LPAG MGKMLVY+SGAVKLKLGDT+YDV+ GL CVFA DVVA++T Sbjct: 186 SRTVRSAKKTCGLTELPAGLMGKMLVYKSGAVKLKLGDTIYDVNPGLSCVFAQDVVAVDT 245 Query: 238 EEKHCCSVGELNKRVVITPDVDSILDSMSDL 146 +K CC VGE+NK V++TPD+DS+L+S+S+L Sbjct: 246 AKKQCCVVGEVNKHVIVTPDMDSVLNSLSEL 276 >ref|XP_012461771.1| PREDICTED: uncharacterized protein LOC105781790 [Gossypium raimondii] gi|763816166|gb|KJB83018.1| hypothetical protein B456_013G225500 [Gossypium raimondii] Length = 289 Score = 293 bits (749), Expect = 2e-76 Identities = 146/271 (53%), Positives = 194/271 (71%), Gaps = 5/271 (1%) Frame = -1 Query: 943 PKVE---KIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHTQIAFGYGGSSTTLKSYG 773 PK+E ++ DIDA +A+DLL+R N++S + KPK E+KV +Q+AFG+ G ++K++G Sbjct: 26 PKLEVKTEVVEDIDAVQARDLLQRLNQTSARTKPKVEKKVSSSQVAFGFVGGGASIKTFG 85 Query: 772 AANRINRKPGSSSDGGCVEQ--RVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEF 599 + N + G + GG RVEKEYKEPW+YYSYYPL LP+RRPYSGNPE LD+EEF Sbjct: 86 TSRGANHRSGETFGGGVRGPGLRVEKEYKEPWDYYSYYPLTLPMRRPYSGNPEFLDEEEF 145 Query: 598 EDDSTRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNP 419 + DE + PA+ LGLMEENLE M F+QLP +P+ K G E S+T Sbjct: 146 AAQNVAY--DENSIEPAVGLGLMEENLEPMMLFLQLPPTLPIIKA-----GHEGASSTGS 198 Query: 418 VKGARPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLDCVFAHDVVALNT 239 + R ++K CG+ LPAG MGKMLVY+SGAVKLKLGDT+YDV+ GL CVFA DVVA++T Sbjct: 199 SRTVRSAKKTCGLTELPAGLMGKMLVYKSGAVKLKLGDTIYDVNPGLSCVFAQDVVAVDT 258 Query: 238 EEKHCCSVGELNKRVVITPDVDSILDSMSDL 146 +K CC VGE+NK V++TPD+DS+L+S+S+L Sbjct: 259 AKKQCCVVGEVNKHVIVTPDMDSVLNSLSEL 289 >ref|XP_010270534.1| PREDICTED: uncharacterized protein LOC104606837 [Nelumbo nucifera] Length = 302 Score = 291 bits (746), Expect = 5e-76 Identities = 149/269 (55%), Positives = 188/269 (69%), Gaps = 4/269 (1%) Frame = -1 Query: 943 PKVEKIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHTQIAFGYGGSSTTLKSYGAAN 764 PK+E++E D+ A + ++LL R NESS+ +PK ERK G Q+AFG+G SS SYG++ Sbjct: 33 PKIEEVE-DVKAFQTRELLXRVNESSVNGRPKMERKSGPAQVAFGFGPSSNYFMSYGSSK 91 Query: 763 --RINRKPGSSSDGGCVE--QRVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEFE 596 ++ G SD G +R+EKEYKEPW+YYSYYP LPLRRPYSG+P LLD EEF Sbjct: 92 VGSSSKYQGLGSDDGVHSSARRMEKEYKEPWDYYSYYPAALPLRRPYSGDPVLLDDEEFG 151 Query: 595 DDSTRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNPV 416 + S DE + A +L LMEEN E M F+QLP+++P+ K S Sbjct: 152 EASEEIAYDESSVKLATELDLMEENKEARMIFLQLPSSLPLVKRSATTNNDGTNSGLKQF 211 Query: 415 KGARPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLDCVFAHDVVALNTE 236 +G S+KPC +E LP GFMGKMLVY SGA+KLKLGDTLYDVS+G++CVFA DVVA+NTE Sbjct: 212 RGGVSSEKPCKLEELPVGFMGKMLVYESGAIKLKLGDTLYDVSSGMNCVFAQDVVAINTE 271 Query: 235 EKHCCSVGELNKRVVITPDVDSILDSMSD 149 EKHCC +GELNKR VITP++DSIL+SM D Sbjct: 272 EKHCCILGELNKRAVITPNIDSILNSMID 300 >gb|KHG12357.1| DNA-directed RNA polymerase III subunit RPC4 [Gossypium arboreum] Length = 288 Score = 288 bits (738), Expect = 4e-75 Identities = 146/271 (53%), Positives = 195/271 (71%), Gaps = 5/271 (1%) Frame = -1 Query: 943 PKVE---KIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHTQIAFGYGGSSTTLKSYG 773 PK+E ++ DIDA +A+DLL+R N++S + KPK E+KV +Q+AFG+GG ++ +K++G Sbjct: 26 PKLEVKTEVVEDIDAVQARDLLQRLNQTSARTKPKVEKKVSSSQVAFGFGGGAS-IKTFG 84 Query: 772 AANRINRKPGSSSDGGCVEQ--RVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEF 599 + N G + GG RVEKEYKEPW+YYSYYPL LP+RRPYSG+PE LD+EEF Sbjct: 85 TSKGANHSSGETFGGGVHGSGLRVEKEYKEPWDYYSYYPLTLPMRRPYSGSPEFLDEEEF 144 Query: 598 EDDSTRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNP 419 + DE + PA+ LGLMEENLE M F+QLP +P+ K G E S++ Sbjct: 145 AAQNVAY--DENSIEPAVGLGLMEENLEPMMLFLQLPPTLPIIKA-----GHEGASSSGS 197 Query: 418 VKGARPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLDCVFAHDVVALNT 239 + R ++K CG+ LPAG MGKMLVY+SGAVKLKLGDT+YDV+ GL CVFA DVVA++T Sbjct: 198 SRTVRSAKKTCGLTELPAGLMGKMLVYKSGAVKLKLGDTIYDVNPGLSCVFAQDVVAVDT 257 Query: 238 EEKHCCSVGELNKRVVITPDVDSILDSMSDL 146 +K CC VGE+NK VV+TPD+DS+L+S+S+L Sbjct: 258 AKKQCCVVGEVNKHVVVTPDLDSVLNSLSEL 288 >ref|XP_007154594.1| hypothetical protein PHAVU_003G132000g [Phaseolus vulgaris] gi|593783109|ref|XP_007154595.1| hypothetical protein PHAVU_003G132000g [Phaseolus vulgaris] gi|561027948|gb|ESW26588.1| hypothetical protein PHAVU_003G132000g [Phaseolus vulgaris] gi|561027949|gb|ESW26589.1| hypothetical protein PHAVU_003G132000g [Phaseolus vulgaris] Length = 291 Score = 288 bits (737), Expect = 5e-75 Identities = 145/269 (53%), Positives = 195/269 (72%), Gaps = 4/269 (1%) Frame = -1 Query: 940 KVEKIEN-DIDAARAQDLLRRFNESSMKAKPKFERKVGHTQIAFGYGGSSTTLKSYG--- 773 K E +E+ DA +A+DLLRRFNES+MKA+ K E+KV +QIAFGYGG ST+LKSYG Sbjct: 30 KAEVVEDAQADANQAKDLLRRFNESAMKARNKVEKKVSASQIAFGYGGESTSLKSYGIGR 89 Query: 772 AANRINRKPGSSSDGGCVEQRVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEFED 593 +N P S+S EKEY EPW+YYS YP+ LPLRRPYSGNPELLD+EEF + Sbjct: 90 GGRNVNINPNSTSSAVA-----EKEYTEPWDYYSNYPVTLPLRRPYSGNPELLDEEEFGE 144 Query: 592 DSTRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNPVK 413 + DE ATN A++LGL+EENLE +MF ++LP+ +P+ + G++ + + P Sbjct: 145 AAEARTYDEEATNSAMELGLLEENLEANMFLIKLPSKLPIISTADG--GKDVNAKSKPPV 202 Query: 412 GARPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLDCVFAHDVVALNTEE 233 G + ++ C ++ LP+GFMGKMLVY+SG +KLKLG+TLYDVS+G++C F+ DVVA+N E Sbjct: 203 GTKKGERLCELKDLPSGFMGKMLVYKSGKIKLKLGNTLYDVSSGMNCSFSQDVVAINKAE 262 Query: 232 KHCCSVGELNKRVVITPDVDSILDSMSDL 146 K CS+GE++K V ITPD+D ILD++SDL Sbjct: 263 KTLCSIGEISKHVTITPDIDDILDNLSDL 291 >emb|CDP03102.1| unnamed protein product [Coffea canephora] Length = 262 Score = 288 bits (736), Expect = 7e-75 Identities = 153/299 (51%), Positives = 188/299 (62%) Frame = -1 Query: 1042 MDSESLAASKANXXXXXXXXXXXXXXXXXXPVLPKVEKIENDIDAARAQDLLRRFNESSM 863 MD ESLA + N V+ K EK+E+ +DAA+A++LLRR NESS+ Sbjct: 1 MDPESLATTTTNAPRKVRFAPKVPPRRDQKTVVTKAEKVEDAVDAAQAEELLRRLNESSV 60 Query: 862 KAKPKFERKVGHTQIAFGYGGSSTTLKSYGAANRINRKPGSSSDGGCVEQRVEKEYKEPW 683 KPKFERK G +RVEKEYKEPW Sbjct: 61 NVKPKFERKAGGAM-----------------------------------RRVEKEYKEPW 85 Query: 682 NYYSYYPLQLPLRRPYSGNPELLDKEEFEDDSTRSIDDEYATNPALKLGLMEENLEDSMF 503 +YY+ YP+ LPLRRPYSG+PE LD+EEF++ S DE +TN A++LGL E +++M Sbjct: 86 DYYTNYPVTLPLRRPYSGDPEHLDQEEFDEASESLNYDECSTNAAVELGLTEG--KETML 143 Query: 502 FVQLPTAMPMTKPCNNAEGREQGSNTNPVKGARPSQKPCGMEALPAGFMGKMLVYRSGAV 323 F+QLP +MPM K N G E + P K QK C ++ LPAGFMGK+LVYRSGAV Sbjct: 144 FLQLPASMPMIKQLPNTAGSEMADTSKPTKSGELLQKSCSLDELPAGFMGKILVYRSGAV 203 Query: 322 KLKLGDTLYDVSAGLDCVFAHDVVALNTEEKHCCSVGELNKRVVITPDVDSILDSMSDL 146 KLKLGD LYDVS GLDCVFA DVVA+N EEKHCC+VGEL+KRV+ITPDVDS+LD M+DL Sbjct: 204 KLKLGDNLYDVSVGLDCVFAQDVVAINDEEKHCCTVGELDKRVIITPDVDSMLDGMADL 262 >ref|XP_012448093.1| PREDICTED: uncharacterized protein LOC105771231 isoform X1 [Gossypium raimondii] gi|763787464|gb|KJB54460.1| hypothetical protein B456_009G035100 [Gossypium raimondii] Length = 284 Score = 286 bits (733), Expect = 2e-74 Identities = 143/269 (53%), Positives = 189/269 (70%), Gaps = 3/269 (1%) Frame = -1 Query: 943 PKVE---KIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHTQIAFGYGGSSTTLKSYG 773 PK+E ++ D DA +A+DLL+R N+ S K KPK E+KV +Q+AFG+G ST++K++G Sbjct: 27 PKLEVKTEVVEDTDAVQARDLLQRLNQISAKTKPKVEKKVASSQVAFGFGAGSTSIKTFG 86 Query: 772 AANRINRKPGSSSDGGCVEQRVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEFED 593 A+ PG R EKEYKEPW+YYSYYP+ LP+RRPYSGNPE LD+EEF Sbjct: 87 ASKGSVPTPGL---------REEKEYKEPWDYYSYYPVTLPMRRPYSGNPEFLDEEEFA- 136 Query: 592 DSTRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNPVK 413 + +E + PA++LGLMEEN E +MFF+QLP +PMTK N G E S + P Sbjct: 137 -LANATFEEDSVEPAVELGLMEENSEATMFFIQLPPTLPMTKQTGNISGNETNSRSKPAA 195 Query: 412 GARPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLDCVFAHDVVALNTEE 233 ++K G+E LPAGFMGKMLVYRSGAVKLKLGD+LYDV+ G + F+ DVVA+NT + Sbjct: 196 SVGSAKKTRGIEELPAGFMGKMLVYRSGAVKLKLGDSLYDVTPGCNSEFSQDVVAVNTGK 255 Query: 232 KHCCSVGELNKRVVITPDVDSILDSMSDL 146 KHCC VGE++KR ++TPDV S+ + ++DL Sbjct: 256 KHCCGVGEIDKRAILTPDVYSVFNYLTDL 284 >ref|XP_002516293.1| DNA binding protein, putative [Ricinus communis] gi|223544779|gb|EEF46295.1| DNA binding protein, putative [Ricinus communis] Length = 286 Score = 286 bits (731), Expect = 3e-74 Identities = 150/269 (55%), Positives = 190/269 (70%), Gaps = 4/269 (1%) Frame = -1 Query: 940 KVEKIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHTQIAFGYGGSSTTLKSYGAAN- 764 K EK E++ DA +A L+++F E SM+AKPK E+KV +QIAFG+G +S ++KSY A Sbjct: 30 KSEKAEDE-DATQAMKLMKQFQERSMRAKPKAEKKVQASQIAFGFGAASPSIKSYAAPKV 88 Query: 763 --RINRKPGSSSDGGCVEQRV-EKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEFED 593 +N GSS +GG + EKEY EPWNYYSYYP+ LPLRRPYSGNP L+ EEF + Sbjct: 89 GAAVNHNQGSSVNGGAYSSELGEKEYIEPWNYYSYYPVTLPLRRPYSGNPATLNAEEFGE 148 Query: 592 DSTRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNPVK 413 S S DE +TN A+ LGLMEEN+E +MFF+QLP +PM K A+G + VK Sbjct: 149 ASDTSEYDENSTNSAINLGLMEENVEANMFFLQLPPTVPMIKRLATADGHK-------VK 201 Query: 412 GARPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLDCVFAHDVVALNTEE 233 +K C ++ LPAG MGKMLVYRSGAVKLKLGDTLYDVS GLD FA D+ A+NT E Sbjct: 202 ----EEKTCKLDELPAGHMGKMLVYRSGAVKLKLGDTLYDVSPGLDFAFAQDIAAINTAE 257 Query: 232 KHCCSVGELNKRVVITPDVDSILDSMSDL 146 KHCC V E++K ++TPDVD+I++SM+DL Sbjct: 258 KHCCVVAEIDKHAIVTPDVDAIINSMADL 286 >ref|XP_012448095.1| PREDICTED: uncharacterized protein LOC105771231 isoform X2 [Gossypium raimondii] gi|763787463|gb|KJB54459.1| hypothetical protein B456_009G035100 [Gossypium raimondii] Length = 273 Score = 285 bits (729), Expect = 5e-74 Identities = 140/262 (53%), Positives = 185/262 (70%) Frame = -1 Query: 931 KIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHTQIAFGYGGSSTTLKSYGAANRINR 752 ++ D DA +A+DLL+R N+ S K KPK E+KV +Q+AFG+G ST++K++GA+ Sbjct: 23 EVVEDTDAVQARDLLQRLNQISAKTKPKVEKKVASSQVAFGFGAGSTSIKTFGASKGSVP 82 Query: 751 KPGSSSDGGCVEQRVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEFEDDSTRSID 572 PG R EKEYKEPW+YYSYYP+ LP+RRPYSGNPE LD+EEF + Sbjct: 83 TPGL---------REEKEYKEPWDYYSYYPVTLPMRRPYSGNPEFLDEEEFA--LANATF 131 Query: 571 DEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNPVKGARPSQK 392 +E + PA++LGLMEEN E +MFF+QLP +PMTK N G E S + P ++K Sbjct: 132 EEDSVEPAVELGLMEENSEATMFFIQLPPTLPMTKQTGNISGNETNSRSKPAASVGSAKK 191 Query: 391 PCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLDCVFAHDVVALNTEEKHCCSVG 212 G+E LPAGFMGKMLVYRSGAVKLKLGD+LYDV+ G + F+ DVVA+NT +KHCC VG Sbjct: 192 TRGIEELPAGFMGKMLVYRSGAVKLKLGDSLYDVTPGCNSEFSQDVVAVNTGKKHCCGVG 251 Query: 211 ELNKRVVITPDVDSILDSMSDL 146 E++KR ++TPDV S+ + ++DL Sbjct: 252 EIDKRAILTPDVYSVFNYLTDL 273 >gb|KJB54458.1| hypothetical protein B456_009G035100 [Gossypium raimondii] Length = 294 Score = 285 bits (729), Expect = 5e-74 Identities = 140/262 (53%), Positives = 185/262 (70%) Frame = -1 Query: 931 KIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHTQIAFGYGGSSTTLKSYGAANRINR 752 ++ D DA +A+DLL+R N+ S K KPK E+KV +Q+AFG+G ST++K++GA+ Sbjct: 44 EVVEDTDAVQARDLLQRLNQISAKTKPKVEKKVASSQVAFGFGAGSTSIKTFGASKGSVP 103 Query: 751 KPGSSSDGGCVEQRVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEFEDDSTRSID 572 PG R EKEYKEPW+YYSYYP+ LP+RRPYSGNPE LD+EEF + Sbjct: 104 TPGL---------REEKEYKEPWDYYSYYPVTLPMRRPYSGNPEFLDEEEFA--LANATF 152 Query: 571 DEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNPVKGARPSQK 392 +E + PA++LGLMEEN E +MFF+QLP +PMTK N G E S + P ++K Sbjct: 153 EEDSVEPAVELGLMEENSEATMFFIQLPPTLPMTKQTGNISGNETNSRSKPAASVGSAKK 212 Query: 391 PCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLDCVFAHDVVALNTEEKHCCSVG 212 G+E LPAGFMGKMLVYRSGAVKLKLGD+LYDV+ G + F+ DVVA+NT +KHCC VG Sbjct: 213 TRGIEELPAGFMGKMLVYRSGAVKLKLGDSLYDVTPGCNSEFSQDVVAVNTGKKHCCGVG 272 Query: 211 ELNKRVVITPDVDSILDSMSDL 146 E++KR ++TPDV S+ + ++DL Sbjct: 273 EIDKRAILTPDVYSVFNYLTDL 294 >ref|XP_007204524.1| hypothetical protein PRUPE_ppa017748mg [Prunus persica] gi|462400055|gb|EMJ05723.1| hypothetical protein PRUPE_ppa017748mg [Prunus persica] Length = 281 Score = 285 bits (728), Expect = 6e-74 Identities = 146/268 (54%), Positives = 186/268 (69%) Frame = -1 Query: 949 VLPKVEKIENDIDAARAQDLLRRFNESSMKAKPKFERKVGHTQIAFGYGGSSTTLKSYGA 770 V +V+ + DA +A++LL+RFNE S +A+ K E+KV TQI FGYGG+STT+KSYGA Sbjct: 16 VKTEVDHGAEESDAEKAKELLKRFNEQSSRARLKVEKKVVPTQIVFGYGGASTTMKSYGA 75 Query: 769 ANRINRKPGSSSDGGCVEQRVEKEYKEPWNYYSYYPLQLPLRRPYSGNPELLDKEEFEDD 590 S+++ G + EKEY PW+ YSYYP+ LPLR PYSGNPE+ ++EEF + Sbjct: 76 PK--GGSASSATNAGASGVKEEKEYSSPWDQYSYYPVTLPLRPPYSGNPEIRNEEEFGEG 133 Query: 589 STRSIDDEYATNPALKLGLMEENLEDSMFFVQLPTAMPMTKPCNNAEGREQGSNTNPVKG 410 S S DE +T PA LGL+EEN SMFF+QLP MP K A+ +E ++ P G Sbjct: 134 SEESTYDENSTTPANDLGLLEENKATSMFFLQLPPNMPTIKRSATADSQEVTKSSGPPGG 193 Query: 409 ARPSQKPCGMEALPAGFMGKMLVYRSGAVKLKLGDTLYDVSAGLDCVFAHDVVALNTEEK 230 AR QKPC + LPAGFMGKMLVYRSGAVK+K+GD+L+DVS G++C FA DVV +N EK Sbjct: 194 ARNMQKPCSLSELPAGFMGKMLVYRSGAVKMKIGDSLFDVSPGMNCDFAQDVVVVNKAEK 253 Query: 229 HCCSVGELNKRVVITPDVDSILDSMSDL 146 C +GELNKR +ITPDVDSIL S+ L Sbjct: 254 GCGIIGELNKRAIITPDVDSILASIDGL 281