BLASTX nr result
ID: Cocculus22_contig00012235
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus22_contig00012235 (1624 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002285818.1| PREDICTED: uncharacterized protein LOC100260... 705 0.0 ref|XP_006472675.1| PREDICTED: uncharacterized protein LOC102626... 705 0.0 ref|XP_007225709.1| hypothetical protein PRUPE_ppa005050mg [Prun... 704 0.0 ref|XP_004300664.1| PREDICTED: uncharacterized protein LOC101292... 702 0.0 ref|XP_006434072.1| hypothetical protein CICLE_v10001001mg [Citr... 700 0.0 ref|XP_007018895.1| DNA-directed RNA polymerase II protein isofo... 697 0.0 ref|XP_002533875.1| DNA-directed RNA polymerase II, putative [Ri... 691 0.0 ref|XP_003544777.1| PREDICTED: uncharacterized protein LOC100776... 679 0.0 ref|XP_007141122.1| hypothetical protein PHAVU_008G169200g [Phas... 676 0.0 ref|XP_002302270.1| hypothetical protein POPTR_0002s09150g [Popu... 676 0.0 ref|XP_003542641.1| PREDICTED: uncharacterized protein LOC100813... 670 0.0 ref|XP_003614901.1| hypothetical protein MTR_5g061040 [Medicago ... 661 0.0 gb|EXC06677.1| hypothetical protein L484_021513 [Morus notabilis] 660 0.0 ref|XP_006386390.1| hypothetical protein POPTR_0002s09150g [Popu... 659 0.0 gb|EYU43567.1| hypothetical protein MIMGU_mgv1a005543mg [Mimulus... 658 0.0 ref|XP_004237862.1| PREDICTED: uncharacterized protein LOC101264... 648 0.0 ref|XP_006354053.1| PREDICTED: uncharacterized protein LOC102590... 648 0.0 ref|XP_006397280.1| hypothetical protein EUTSA_v10028627mg [Eutr... 633 e-179 ref|XP_004138644.1| PREDICTED: uncharacterized protein LOC101217... 615 e-173 ref|NP_192594.2| DNA-directed RNA polymerase II protein [Arabido... 613 e-173 >ref|XP_002285818.1| PREDICTED: uncharacterized protein LOC100260778 [Vitis vinifera] gi|302141899|emb|CBI19102.3| unnamed protein product [Vitis vinifera] Length = 478 Score = 705 bits (1820), Expect = 0.0 Identities = 349/478 (73%), Positives = 403/478 (84%), Gaps = 5/478 (1%) Frame = +1 Query: 178 MSRKSGSCAICENSNLASICATCVKYRLNDYYTYLKSLKSARDSLYLRLTSELVAKSKAD 357 M+RK+ SC+ICE SNLASICA CV YRLN+Y T LKS K RDSLYLRL+ LVAK KAD Sbjct: 1 MTRKTSSCSICEKSNLASICAVCVNYRLNEYNTSLKSSKGRRDSLYLRLSEVLVAKGKAD 60 Query: 358 DQLSWRVLQNEKIVKLRERVCYTKRQLSQGKAKVDKLSNELKVKYASLESAMHMLQRNRV 537 DQ++WRVLQNEK+ +LRE++ + K Q GKAKV+K+SN+LK+KY LESAM ML++NRV Sbjct: 61 DQINWRVLQNEKLARLREKLRHRKEQYLDGKAKVEKMSNDLKLKYGLLESAMSMLEKNRV 120 Query: 538 EQLDKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVSADEEMKDGSTGQYD 717 EQL+KFYPNLICTQ+LGLMAITSER HKQSVVIKQICKLFPQRRV+ D E KDGS+ YD Sbjct: 121 EQLEKFYPNLICTQNLGLMAITSERFHKQSVVIKQICKLFPQRRVNIDGEKKDGSSRPYD 180 Query: 718 QICSARLPRGLDPHSVPPEELAVSLGYMIQLLNLVVHNLAAPALHCSGFAGSCSRIWQRD 897 QIC+ RLPR LDPHSVP +ELA SLGYM+QLLNLVV+NLAAPALH SGFAGSCSRIWQR+ Sbjct: 181 QICNVRLPRVLDPHSVPSDELAASLGYMVQLLNLVVYNLAAPALHNSGFAGSCSRIWQRE 240 Query: 898 TYWDARPSSRSNEYPLFIPRQNCCFPGGENSWTDRSSSNFGVASMESEKKPYLDSAG-VS 1074 +YW+ RPSSRSNEYPLFIPRQN C GENSW++RSSSNFG+ASMES++KP L+S+G S Sbjct: 241 SYWNPRPSSRSNEYPLFIPRQNLCSTNGENSWSERSSSNFGIASMESDRKPRLESSGSSS 300 Query: 1075 FNHHPTCPHLVENHKDLQKGISLLKKSVACITAYCYNSLFLDGLSEASTFEAFAKLLATL 1254 FN+ H VE HKDLQKGISLLKKSVAC+T YCY+SL LD +EASTFEAFAKLLA L Sbjct: 301 FNYSSASLHSVETHKDLQKGISLLKKSVACLTTYCYSSLCLDVPTEASTFEAFAKLLAIL 360 Query: 1255 SSSKEVRSAFSMKMASSRSCKQVPQMNKSVWHVNSAGTSSSLLESGHATSLMRNVCD--- 1425 SSSKEVRS FS+KMA SRSCKQV Q+NKS+W++NSA +SS+LLES H + RN+ D Sbjct: 361 SSSKEVRSVFSLKMACSRSCKQVQQLNKSIWNMNSAISSSTLLESAHTLPMTRNIFDNNL 420 Query: 1426 -NSSTSFLYTAEMSDVGKTESIVEGWDIVEHPKFPPPPSQNEDIEHWTRAMFIDATKK 1596 NS+ SFLYT EMSD+GK ES++E WD+VEH FPPPPSQ EDIEHWTRAM IDATKK Sbjct: 421 PNSAASFLYTTEMSDIGKNESLIEEWDLVEHANFPPPPSQTEDIEHWTRAMIIDATKK 478 >ref|XP_006472675.1| PREDICTED: uncharacterized protein LOC102626964 isoform X1 [Citrus sinensis] gi|568837325|ref|XP_006472676.1| PREDICTED: uncharacterized protein LOC102626964 isoform X2 [Citrus sinensis] Length = 478 Score = 705 bits (1819), Expect = 0.0 Identities = 348/478 (72%), Positives = 403/478 (84%), Gaps = 5/478 (1%) Frame = +1 Query: 178 MSRKSGSCAICENSNLASICATCVKYRLNDYYTYLKSLKSARDSLYLRLTSELVAKSKAD 357 M++K+ +CAICENSN ASICA CV YRL++ T LKSLKS RD+LY+RL+ LVAK KAD Sbjct: 1 MNKKASNCAICENSNRASICAACVNYRLSECNTLLKSLKSRRDALYMRLSEVLVAKGKAD 60 Query: 358 DQLSWRVLQNEKIVKLRERVCYTKRQLSQGKAKVDKLSNELKVKYASLESAMHMLQRNRV 537 DQL+WRVLQNEK+ LRE++ +K QLSQGK K++K S +LKV+YA L+SA M+++NR Sbjct: 61 DQLNWRVLQNEKLTNLREKLRRSKEQLSQGKLKIEKSSYDLKVRYAILDSARSMMEKNRA 120 Query: 538 EQLDKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVSADEEMKDGSTGQYD 717 EQL+KFYPN+ICTQSLG MAI SE LHKQSVVIKQICKLFPQRRV+ D E +DGS+GQYD Sbjct: 121 EQLEKFYPNIICTQSLGHMAIVSELLHKQSVVIKQICKLFPQRRVNIDGERRDGSSGQYD 180 Query: 718 QICSARLPRGLDPHSVPPEELAVSLGYMIQLLNLVVHNLAAPALHCSGFAGSCSRIWQRD 897 QIC ARLP+GLDPHSVP EELA SLGYM+QLLNLVV NLA P LH SGFAGSCSRIWQRD Sbjct: 181 QICGARLPKGLDPHSVPSEELAASLGYMVQLLNLVVLNLAVPILHNSGFAGSCSRIWQRD 240 Query: 898 TYWDARPSSRSNEYPLFIPRQNCCFPGGENSWTDRSSSNFGVASMESEKKPYLDSA-GVS 1074 +YWDARPSSRSNEYPLFIPRQN C GENSWTDRSSSNFGVASMESE++P LDS+ S Sbjct: 241 SYWDARPSSRSNEYPLFIPRQNYCSTSGENSWTDRSSSNFGVASMESERRPQLDSSRSTS 300 Query: 1075 FNHHPTCPHLVENHKDLQKGISLLKKSVACITAYCYNSLFLDGLSEASTFEAFAKLLATL 1254 FN+ H VE HKDLQKGISLLKKSVAC+TAYCYNSL LD +EASTFEAFAKLLATL Sbjct: 301 FNYTSASTHSVETHKDLQKGISLLKKSVACLTAYCYNSLCLDVPAEASTFEAFAKLLATL 360 Query: 1255 SSSKEVRSAFSMKMASSRSCKQVPQMNKSVWHVNSAGTSSSLLESGHATSLMRNVCDN-- 1428 SSSKEVRS FS+KMA SRSCKQV ++N+SVW++NSA +S++LLES H + +N+ DN Sbjct: 361 SSSKEVRSVFSLKMACSRSCKQVQKLNRSVWNMNSAISSTTLLESAHMFPITKNLSDNNL 420 Query: 1429 --SSTSFLYTAEMSDVGKTESIVEGWDIVEHPKFPPPPSQNEDIEHWTRAMFIDATKK 1596 S+ SFLY EMSD+GK ES+++GWD+VEHP FPPPPSQ ED+EHWTRAM IDATKK Sbjct: 421 PSSAASFLYATEMSDIGKNESLIDGWDLVEHPTFPPPPSQTEDVEHWTRAMIIDATKK 478 >ref|XP_007225709.1| hypothetical protein PRUPE_ppa005050mg [Prunus persica] gi|596287022|ref|XP_007225710.1| hypothetical protein PRUPE_ppa005050mg [Prunus persica] gi|462422645|gb|EMJ26908.1| hypothetical protein PRUPE_ppa005050mg [Prunus persica] gi|462422646|gb|EMJ26909.1| hypothetical protein PRUPE_ppa005050mg [Prunus persica] Length = 479 Score = 704 bits (1817), Expect = 0.0 Identities = 353/479 (73%), Positives = 407/479 (84%), Gaps = 5/479 (1%) Frame = +1 Query: 175 MMSRKSGSCAICENSNLASICATCVKYRLNDYYTYLKSLKSARDSLYLRLTSELVAKSKA 354 MM+RKS +CAICE+SNLAS+CA CV YRL +Y + LK+LKS RDSLY RLT LVAK KA Sbjct: 1 MMNRKSSNCAICESSNLASVCAICVNYRLTEYNSSLKALKSRRDSLYSRLTEALVAKGKA 60 Query: 355 DDQLSWRVLQNEKIVKLRERVCYTKRQLSQGKAKVDKLSNELKVKYASLESAMHMLQRNR 534 DDQL+WRVLQNEK+V+LRE++ K QL QGKAK++K S +LKVK LESA+ +L++NR Sbjct: 61 DDQLNWRVLQNEKLVRLREKLRCNKEQLVQGKAKIEKTSYDLKVKSGVLESALAVLEKNR 120 Query: 535 VEQLDKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVSADEEMKDGSTGQY 714 EQL+KFYPN ICTQ+LG MAITSERLHKQSVVIKQICKLFPQRRV+ D + KD S GQY Sbjct: 121 AEQLEKFYPNFICTQNLGHMAITSERLHKQSVVIKQICKLFPQRRVTVDAKRKDASGGQY 180 Query: 715 DQICSARLPRGLDPHSVPPEELAVSLGYMIQLLNLVVHNLAAPALHCSGFAGSCSRIWQR 894 DQIC+A LPRGLDPHSVP EELA SLGYM+QLLNLVV NLAAPALH SGFAGSCSRIWQR Sbjct: 181 DQICNACLPRGLDPHSVPSEELAASLGYMVQLLNLVVQNLAAPALHNSGFAGSCSRIWQR 240 Query: 895 DTYWDARPSSRSNEYPLFIPRQNCCFPGGENSWTDRSSSNFGVASMESEKKPYLDSAG-V 1071 D+YWDARPSSRSNEYPLFIPRQN C GENSW+DRSSSNFGVAS++SE+KP+LDS+G Sbjct: 241 DSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDRSSSNFGVASIDSERKPHLDSSGSS 300 Query: 1072 SFNHHPTCPHLVENHKDLQKGISLLKKSVACITAYCYNSLFLDGLSEASTFEAFAKLLAT 1251 SFN+ H VE HKDLQ+GISLLKKSVACITAYCYNSL LD SEASTFEAFAKLLAT Sbjct: 301 SFNYTSASQHSVETHKDLQRGISLLKKSVACITAYCYNSLCLDVPSEASTFEAFAKLLAT 360 Query: 1252 LSSSKEVRSAFSMKMASSRSCKQVPQMNKSVWHVNSAGTSSSLLESGHATSLMRNVCDNS 1431 LSSSKEV S FS+KMA SRSCKQV Q+NKSVW+VNSA +S++LL+S HA ++ +N+ + + Sbjct: 361 LSSSKEVHSVFSLKMACSRSCKQVQQLNKSVWNVNSAISSTTLLDSAHAMTMTKNLYEYN 420 Query: 1432 ----STSFLYTAEMSDVGKTESIVEGWDIVEHPKFPPPPSQNEDIEHWTRAMFIDATKK 1596 +TS L + E+SD GK ES+VEGWD+VEHP FPPPPSQ+EDIEHWTRAMFIDA +K Sbjct: 421 LPTYATSSLCSTELSDSGKNESLVEGWDLVEHPTFPPPPSQSEDIEHWTRAMFIDAKRK 479 >ref|XP_004300664.1| PREDICTED: uncharacterized protein LOC101292418 [Fragaria vesca subsp. vesca] Length = 478 Score = 702 bits (1813), Expect = 0.0 Identities = 353/479 (73%), Positives = 403/479 (84%), Gaps = 5/479 (1%) Frame = +1 Query: 175 MMSRKSGSCAICENSNLASICATCVKYRLNDYYTYLKSLKSARDSLYLRLTSELVAKSKA 354 M ++KS +CAICENSNLASICA CV YRLNDY LK+LKS RD LY RL+ LVAK KA Sbjct: 1 MTNKKSSNCAICENSNLASICAVCVNYRLNDYNNSLKALKSRRDLLYSRLSDALVAKGKA 60 Query: 355 DDQLSWRVLQNEKIVKLRERVCYTKRQLSQGKAKVDKLSNELKVKYASLESAMHMLQRNR 534 DDQL+WR+LQ+EK+V+LRE++ K QL QGKAK++K S +LKVKY LESA+ ML++NR Sbjct: 61 DDQLNWRILQDEKLVRLREKLRRNKEQLVQGKAKIEKTSYDLKVKYGVLESALSMLEKNR 120 Query: 535 VEQLDKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVSADEEMKDGSTGQY 714 EQL+KFYPNLICTQSLG MAITSERLHKQSVVIKQICKLFPQRRV+ D + K+GS GQY Sbjct: 121 AEQLEKFYPNLICTQSLGHMAITSERLHKQSVVIKQICKLFPQRRVTVDAKRKEGSGGQY 180 Query: 715 DQICSARLPRGLDPHSVPPEELAVSLGYMIQLLNLVVHNLAAPALHCSGFAGSCSRIWQR 894 DQIC+A LPRGLDPHSVP EELA SLGYM+QLLNLVV NL APALH SGFAGSCSRIWQR Sbjct: 181 DQICNASLPRGLDPHSVPSEELAASLGYMVQLLNLVVQNLGAPALHNSGFAGSCSRIWQR 240 Query: 895 DTYWDARPSSRSNEYPLFIPRQNCCFPGGENSWTDRSSSNFGVASMESEKKPYLDSAG-V 1071 D+YWDARPSSRSNEYPLFIPRQN C GENSW+DRSSSNFGVAS+ESE+KP LDS+G Sbjct: 241 DSYWDARPSSRSNEYPLFIPRQNYCSTSGENSWSDRSSSNFGVASIESERKPRLDSSGSS 300 Query: 1072 SFNHHPTCPHLVENHKDLQKGISLLKKSVACITAYCYNSLFLDGLSEASTFEAFAKLLAT 1251 SFN+ H VE HKDLQ+GISLLKKSVACITAYCYNSL LD SEASTFEAFAKLL+T Sbjct: 301 SFNYSSASQHSVETHKDLQRGISLLKKSVACITAYCYNSLCLDVPSEASTFEAFAKLLST 360 Query: 1252 LSSSKEVRSAFSMKMASSRSCKQVPQMNKSVWHVNSAGTSSSLLESGHATSLMRNVCDNS 1431 LSSSKEV S FS+KMA SRSCKQV Q+NKSVW+VNSA +S++LL+S H ++ +N +N+ Sbjct: 361 LSSSKEVHSVFSLKMACSRSCKQVQQLNKSVWNVNSAISSTTLLDSAHTMTMTKNFYENN 420 Query: 1432 ----STSFLYTAEMSDVGKTESIVEGWDIVEHPKFPPPPSQNEDIEHWTRAMFIDATKK 1596 +TSFL + EMSDVGK E +EGWD+VEHP PPPSQ+EDIEHWTRAMFID TK+ Sbjct: 421 IPNYATSFLSSTEMSDVGKNECTIEGWDLVEHPTL-PPPSQSEDIEHWTRAMFIDVTKR 478 >ref|XP_006434072.1| hypothetical protein CICLE_v10001001mg [Citrus clementina] gi|567883029|ref|XP_006434073.1| hypothetical protein CICLE_v10001001mg [Citrus clementina] gi|567883031|ref|XP_006434074.1| hypothetical protein CICLE_v10001001mg [Citrus clementina] gi|567883033|ref|XP_006434075.1| hypothetical protein CICLE_v10001001mg [Citrus clementina] gi|557536194|gb|ESR47312.1| hypothetical protein CICLE_v10001001mg [Citrus clementina] gi|557536195|gb|ESR47313.1| hypothetical protein CICLE_v10001001mg [Citrus clementina] gi|557536196|gb|ESR47314.1| hypothetical protein CICLE_v10001001mg [Citrus clementina] gi|557536197|gb|ESR47315.1| hypothetical protein CICLE_v10001001mg [Citrus clementina] Length = 478 Score = 700 bits (1807), Expect = 0.0 Identities = 346/478 (72%), Positives = 401/478 (83%), Gaps = 5/478 (1%) Frame = +1 Query: 178 MSRKSGSCAICENSNLASICATCVKYRLNDYYTYLKSLKSARDSLYLRLTSELVAKSKAD 357 M++K+ +CAICENSN ASICA CV YRL++ T LKSLKS RD+LY+RL+ LVAK KAD Sbjct: 1 MNKKASNCAICENSNRASICAACVNYRLSECNTLLKSLKSRRDALYMRLSEVLVAKGKAD 60 Query: 358 DQLSWRVLQNEKIVKLRERVCYTKRQLSQGKAKVDKLSNELKVKYASLESAMHMLQRNRV 537 DQL+WRVLQNEK+ LRE++ +K QLSQGK K++K S +LK +YA L+SA M+++NR Sbjct: 61 DQLNWRVLQNEKLTNLREKLRRSKEQLSQGKLKIEKSSYDLKGRYAILDSARSMMEKNRA 120 Query: 538 EQLDKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVSADEEMKDGSTGQYD 717 EQL+KFYPN+ICTQSLG MAI SE LHKQSVVIKQICKLFPQRRV+ D E +DGS+GQYD Sbjct: 121 EQLEKFYPNIICTQSLGHMAIVSELLHKQSVVIKQICKLFPQRRVNIDGERRDGSSGQYD 180 Query: 718 QICSARLPRGLDPHSVPPEELAVSLGYMIQLLNLVVHNLAAPALHCSGFAGSCSRIWQRD 897 QIC ARLP+GLDPHSVP EELA SLGYM+QLLNLVV NLA P LH SGFAGSCSRIWQRD Sbjct: 181 QICGARLPKGLDPHSVPSEELAASLGYMVQLLNLVVLNLAVPVLHNSGFAGSCSRIWQRD 240 Query: 898 TYWDARPSSRSNEYPLFIPRQNCCFPGGENSWTDRSSSNFGVASMESEKKPYLDSA-GVS 1074 +YWDARPSSRSNEYPLFIPRQN C GENSWTDRSSSNFGVASMESE++P LDS+ S Sbjct: 241 SYWDARPSSRSNEYPLFIPRQNYCSTSGENSWTDRSSSNFGVASMESERRPQLDSSRSAS 300 Query: 1075 FNHHPTCPHLVENHKDLQKGISLLKKSVACITAYCYNSLFLDGLSEASTFEAFAKLLATL 1254 FN+ H VE HKDLQKGISLLKKSVAC+TAYCYNSL LD +EASTFEAFAKLLATL Sbjct: 301 FNYTSASTHSVETHKDLQKGISLLKKSVACLTAYCYNSLCLDVPAEASTFEAFAKLLATL 360 Query: 1255 SSSKEVRSAFSMKMASSRSCKQVPQMNKSVWHVNSAGTSSSLLESGHATSLMRNVCDN-- 1428 S SKEVRS FS+KMA SRSCKQV ++N+SVW++NSA +S++LLES H + +N+ DN Sbjct: 361 SLSKEVRSVFSLKMACSRSCKQVQKLNRSVWNMNSAISSTTLLESAHMFPITKNLSDNNL 420 Query: 1429 --SSTSFLYTAEMSDVGKTESIVEGWDIVEHPKFPPPPSQNEDIEHWTRAMFIDATKK 1596 S+ SFLY EMSD+GK ES+++GWD+VEHP FPPPPSQ ED+EHWTRAM IDATKK Sbjct: 421 PSSAASFLYATEMSDIGKNESLIDGWDLVEHPTFPPPPSQTEDVEHWTRAMIIDATKK 478 >ref|XP_007018895.1| DNA-directed RNA polymerase II protein isoform 1 [Theobroma cacao] gi|508724223|gb|EOY16120.1| DNA-directed RNA polymerase II protein isoform 1 [Theobroma cacao] Length = 479 Score = 697 bits (1800), Expect = 0.0 Identities = 346/479 (72%), Positives = 402/479 (83%), Gaps = 5/479 (1%) Frame = +1 Query: 175 MMSRKSGSCAICENSNLASICATCVKYRLNDYYTYLKSLKSARDSLYLRLTSELVAKSKA 354 MMS+K+ +CAIC+NSN ASICA CV YRLN+Y + LKSLKS RD LY +L L AK KA Sbjct: 1 MMSKKASNCAICDNSNRASICAVCVNYRLNEYNSLLKSLKSRRDFLYSKLDEVLAAKRKA 60 Query: 355 DDQLSWRVLQNEKIVKLRERVCYTKRQLSQGKAKVDKLSNELKVKYASLESAMHMLQRNR 534 DDQL+W++LQNEK+ L+E++ +K QL+QGKAK++++S +LKVKY LESA ML++NR Sbjct: 61 DDQLNWKILQNEKLTDLKEKLRRSKEQLAQGKAKIERVSYDLKVKYGVLESARGMLEKNR 120 Query: 535 VEQLDKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVSADEEMKDGSTGQY 714 VE+L+KFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRV+ D E +DGS GQY Sbjct: 121 VEKLEKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVNLDGEGRDGSCGQY 180 Query: 715 DQICSARLPRGLDPHSVPPEELAVSLGYMIQLLNLVVHNLAAPALHCSGFAGSCSRIWQR 894 D IC+ LPRGLDPHSVP E+LA SLGYM+QLLNLVVHNLAAPALH SGFAGSCSRIWQR Sbjct: 181 DLICNVGLPRGLDPHSVPSEQLAASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQR 240 Query: 895 DTYWDARPSSRSNEYPLFIPRQNCCFPGGENSWTDRSSSNFGVASMESEKKPYLDSAGV- 1071 D+YW+ARPSSRSNEYPLFIPRQN C G+NSWTDRSSSNFGVASMESE++P LDS+G Sbjct: 241 DSYWNARPSSRSNEYPLFIPRQNYCSTSGDNSWTDRSSSNFGVASMESERRPRLDSSGSN 300 Query: 1072 SFNHHPTCPHLVENHKDLQKGISLLKKSVACITAYCYNSLFLDGLSEASTFEAFAKLLAT 1251 SFN+ H VE HKDLQ GISLLKKSVACITA+CYNSL LD +EASTFEAF+KLLAT Sbjct: 301 SFNYSSASSHTVETHKDLQIGISLLKKSVACITAFCYNSLCLDVPTEASTFEAFSKLLAT 360 Query: 1252 LSSSKEVRSAFSMKMASSRSCKQVPQMNKSVWHVNSAGTSSSLLESGHATSLMRNVCD-- 1425 LSS+KEVRS FS+KMA SRS KQ Q+NKSVW+VNSA +SS LLES H L +N+ D Sbjct: 361 LSSTKEVRSVFSLKMACSRSSKQAQQLNKSVWNVNSAMSSSMLLESAHMLPLTKNLSDHN 420 Query: 1426 --NSSTSFLYTAEMSDVGKTESIVEGWDIVEHPKFPPPPSQNEDIEHWTRAMFIDATKK 1596 +S+ SFL+ EM D+GK ES++E WD+VEHP FPPPPSQ ED+EHWTRAMFIDATK+ Sbjct: 421 LPSSAASFLFATEMPDIGKNESLIEEWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKR 479 >ref|XP_002533875.1| DNA-directed RNA polymerase II, putative [Ricinus communis] gi|223526176|gb|EEF28506.1| DNA-directed RNA polymerase II, putative [Ricinus communis] Length = 478 Score = 691 bits (1783), Expect = 0.0 Identities = 343/478 (71%), Positives = 396/478 (82%), Gaps = 5/478 (1%) Frame = +1 Query: 178 MSRKSGSCAICENSNLASICATCVKYRLNDYYTYLKSLKSARDSLYLRLTSELVAKSKAD 357 M++KS CAICENSN ASIC CV YRLN+Y T LKSLKS RD LY RL+ LVAK KAD Sbjct: 1 MNKKSSCCAICENSNRASICTVCVNYRLNEYSTLLKSLKSRRDLLYSRLSEVLVAKGKAD 60 Query: 358 DQLSWRVLQNEKIVKLRERVCYTKRQLSQGKAKVDKLSNELKVKYASLESAMHMLQRNRV 537 DQL+WRV QNEK+ LRE++ +K QL Q KAK +K+S++L KY LES+ L++NRV Sbjct: 61 DQLNWRVHQNEKLANLREKLLRSKEQLIQAKAKTEKMSSDLNAKYGLLESSRSALEKNRV 120 Query: 538 EQLDKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVSADEEMKDGSTGQYD 717 +QL+K++PNLICTQSLG MAITSE LH SV +KQICKLFPQRRV + E KDGS+GQYD Sbjct: 121 DQLEKYFPNLICTQSLGHMAITSELLHNLSVTVKQICKLFPQRRVIVEGEKKDGSSGQYD 180 Query: 718 QICSARLPRGLDPHSVPPEELAVSLGYMIQLLNLVVHNLAAPALHCSGFAGSCSRIWQRD 897 QIC+ARLPRGLDPHS+P EELA SLGYM+QLLNLVVHNLAAPALH SGFAGSCSRIWQRD Sbjct: 181 QICNARLPRGLDPHSIPSEELAASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRD 240 Query: 898 TYWDARPSSRSNEYPLFIPRQNCCFPGGENSWTDRSSSNFGVASMESEKKPYLDSA-GVS 1074 +YW+ARPSSRSNEYPLFIPRQ C GENSWTDRSSSNFGVASMESE++ LDS+ S Sbjct: 241 SYWNARPSSRSNEYPLFIPRQRYCSTSGENSWTDRSSSNFGVASMESERRARLDSSRSSS 300 Query: 1075 FNHHPTCPHLVENHKDLQKGISLLKKSVACITAYCYNSLFLDGLSEASTFEAFAKLLATL 1254 FN++ PH VE HKDLQKGISL+KKSVAC+TAY YN L LD +EASTFEAFAKLLATL Sbjct: 301 FNYNSASPHSVETHKDLQKGISLMKKSVACVTAYGYNLLCLDVPAEASTFEAFAKLLATL 360 Query: 1255 SSSKEVRSAFSMKMASSRSCKQVPQMNKSVWHVNSAGTSSSLLESGHATSLMRNVCD--- 1425 SSSKEVRS FS+KMA SRSCKQV ++NKSVW+VNS +SS+L+ES HA L +N+ D Sbjct: 361 SSSKEVRSVFSLKMACSRSCKQVQKLNKSVWNVNSIISSSTLMESAHAPHLTKNINDNNL 420 Query: 1426 -NSSTSFLYTAEMSDVGKTESIVEGWDIVEHPKFPPPPSQNEDIEHWTRAMFIDATKK 1596 NS+TSFL+ E+SD GK ES+++GWD+VEHP FPPPPSQ ED+EHWTRAMFIDATKK Sbjct: 421 RNSATSFLFANEISDAGKNESLIDGWDLVEHPTFPPPPSQTEDVEHWTRAMFIDATKK 478 >ref|XP_003544777.1| PREDICTED: uncharacterized protein LOC100776426 isoformX1 [Glycine max] Length = 475 Score = 679 bits (1753), Expect = 0.0 Identities = 336/476 (70%), Positives = 401/476 (84%), Gaps = 3/476 (0%) Frame = +1 Query: 178 MSRKSGSCAICENSNLASICATCVKYRLNDYYTYLKSLKSARDSLYLRLTSELVAKSKAD 357 M+RK+ +CAICENSN ASIC+ CV YRLN+Y T LK LK RDSLYL+L+ LV K K D Sbjct: 1 MARKTSNCAICENSNQASICSICVNYRLNEYNTSLKLLKDRRDSLYLKLSEVLVRKGKGD 60 Query: 358 DQLSWRVLQNEKIVKLRERVCYTKRQLSQGKAKVDKLSNELKVKYASLESAMHMLQRNRV 537 DQ +WRVLQ+EK+ +L+E++ +K Q++QG+AK++ +S +LK+KY LESA+ L++NRV Sbjct: 61 DQANWRVLQHEKLARLKEKLRQSKEQVTQGRAKIETMSADLKLKYGLLESALSTLEKNRV 120 Query: 538 EQLDKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVSADEEMKDGSTGQYD 717 EQL+KFYPNLICTQSLG +AITSE LHK+SVVIKQICKLFPQRRV + E +DG +GQYD Sbjct: 121 EQLEKFYPNLICTQSLGHVAITSELLHKESVVIKQICKLFPQRRVVIEGERRDGCSGQYD 180 Query: 718 QICSARLPRGLDPHSVPPEELAVSLGYMIQLLNLVVHNLAAPALHCSGFAGSCSRIWQRD 897 QIC+ARLPR LDPHSVP EEL+ SLGYM+QLLNLV+HNLAAPALH SGFAGSCSRIWQRD Sbjct: 181 QICNARLPRALDPHSVPSEELSTSLGYMVQLLNLVIHNLAAPALHNSGFAGSCSRIWQRD 240 Query: 898 TYWDARPSSRSNEYPLFIPRQNCCFPGGENSWTDRSSSNFGVASMESEKKPYLDSAG-VS 1074 +YWDARPSSRSNEYPLFIPRQN C GENSW++RSSSNFGVAS+ESE++ LDS+G S Sbjct: 241 SYWDARPSSRSNEYPLFIPRQNYCSTDGENSWSERSSSNFGVASVESERRHRLDSSGSTS 300 Query: 1075 FNHHPTCPHLVENHKDLQKGISLLKKSVACITAYCYNSLFLDGLSEASTFEAFAKLLATL 1254 FN+ H V+ HKDLQKGISLLKKSV CITAYCYNSL LD SEASTFEAFAKLLATL Sbjct: 301 FNYSLASSHSVQTHKDLQKGISLLKKSVVCITAYCYNSLCLDVPSEASTFEAFAKLLATL 360 Query: 1255 SSSKEVRSAFSMKMASSRSCKQVPQMNKSVWHVNSAGTSSSLLESGHA--TSLMRNVCDN 1428 +SSKEVRS FS+KMA SR+CKQV Q+NKSVW++NSA +S++LLES H+ T+ + N + Sbjct: 361 ASSKEVRSVFSLKMARSRTCKQVQQLNKSVWNMNSAISSTTLLESAHSVPTTRIENYLPS 420 Query: 1429 SSTSFLYTAEMSDVGKTESIVEGWDIVEHPKFPPPPSQNEDIEHWTRAMFIDATKK 1596 S+ SFLY A++SD GK E ++EGWDIVEHP FPPPPSQ+ED+EHWTRAMFIDA K Sbjct: 421 STGSFLYAADLSD-GKNECLIEGWDIVEHPTFPPPPSQSEDVEHWTRAMFIDAKGK 475 >ref|XP_007141122.1| hypothetical protein PHAVU_008G169200g [Phaseolus vulgaris] gi|593488511|ref|XP_007141123.1| hypothetical protein PHAVU_008G169200g [Phaseolus vulgaris] gi|561014255|gb|ESW13116.1| hypothetical protein PHAVU_008G169200g [Phaseolus vulgaris] gi|561014256|gb|ESW13117.1| hypothetical protein PHAVU_008G169200g [Phaseolus vulgaris] Length = 476 Score = 676 bits (1744), Expect = 0.0 Identities = 338/477 (70%), Positives = 401/477 (84%), Gaps = 4/477 (0%) Frame = +1 Query: 178 MSRKSGSCAICENSNLASICATCVKYRLNDYYTYLKSLKSARDSLYLRLTSELVAKSKAD 357 M+RK+ +CAICENSN ASIC+ CV YRLN+Y T LKSLK RDSLY +L+ LV K K D Sbjct: 1 MARKTSNCAICENSNQASICSICVNYRLNEYNTSLKSLKDRRDSLYSKLSEVLVQKGKGD 60 Query: 358 DQLSWRVLQNEKIVKLRERVCYTKRQLSQGKAKVDKLSNELKVKYASLESAMHMLQRNRV 537 DQ ++ VLQNEK+ +L+E++ +K Q++QG+AK++ +S +LK KY LESA+ L++NRV Sbjct: 61 DQENYIVLQNEKLARLKEKLHRSKEQVTQGRAKIETVSADLKHKYGLLESALSTLEKNRV 120 Query: 538 EQLDKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVSADEEMKDGSTGQYD 717 EQL+KFYPNLICTQSLG +AITSERLHKQSVVIKQICKLFPQRRV + E++DG +GQYD Sbjct: 121 EQLEKFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGEIRDGCSGQYD 180 Query: 718 QICSARLPRGLDPHSVPPEELAVSLGYMIQLLNLVVHNLAAPALHCSGFAGSCSRIWQRD 897 QIC+ARLPR LDPHSVP EEL+ SLGYM+QLLNLVVHNLAAPALH SGFAGSCSRIWQRD Sbjct: 181 QICNARLPRALDPHSVPSEELSASLGYMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRD 240 Query: 898 TYWDARPSSRSNEYPLFIPRQNCCFPGGENSW-TDRSSSNFGVASMESEKKPYLDSAGVS 1074 +YWDARPSSRSNEYPLFIPRQN C GENSW TD+SSSNFGVASMESEK+ LDS+G S Sbjct: 241 SYWDARPSSRSNEYPLFIPRQNYCSTAGENSWSTDKSSSNFGVASMESEKRNRLDSSGNS 300 Query: 1075 -FNHHPTCPHLVENHKDLQKGISLLKKSVACITAYCYNSLFLDGLSEASTFEAFAKLLAT 1251 FN+ H V+ HKDLQKGISLLKKSVACITAYCYNSL LD SEASTFE+FAKLLAT Sbjct: 301 NFNYSLASLHSVQTHKDLQKGISLLKKSVACITAYCYNSLCLDAPSEASTFESFAKLLAT 360 Query: 1252 LSSSKEVRSAFSMKMASSRSCKQVPQMNKSVWHVNSAGTSSSLLESGHA--TSLMRNVCD 1425 LSSSKEVRS FS+KMA SR+CKQV Q+NKSVW++NS +S++LLES H+ T+ + N Sbjct: 361 LSSSKEVRSVFSLKMAQSRTCKQVQQLNKSVWNMNSVISSTTLLESAHSVPTTRIENYLP 420 Query: 1426 NSSTSFLYTAEMSDVGKTESIVEGWDIVEHPKFPPPPSQNEDIEHWTRAMFIDATKK 1596 +S+ SFLY +++D GK E ++EGWDI+EHP FPPPPSQ+ED+EHWTRAMFIDA +K Sbjct: 421 SSTASFLYATDLND-GKNECLIEGWDIIEHPTFPPPPSQSEDVEHWTRAMFIDAKRK 476 >ref|XP_002302270.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa] gi|566157047|ref|XP_006386388.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa] gi|566157050|ref|XP_006386389.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa] gi|222843996|gb|EEE81543.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa] gi|550344610|gb|ERP64185.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa] gi|550344611|gb|ERP64186.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa] Length = 475 Score = 676 bits (1743), Expect = 0.0 Identities = 342/478 (71%), Positives = 391/478 (81%), Gaps = 5/478 (1%) Frame = +1 Query: 178 MSRKSGSCAICENSNLASICATCVKYRLNDYYTYLKSLKSARDSLYLRLTSELVAKSKAD 357 M++KS CAICENSN ASIC CV YRLN+Y T LKSL S RDSLY +L+ L+AK KAD Sbjct: 1 MNKKSSCCAICENSNRASICPICVNYRLNEYGTLLKSLNSRRDSLYSKLSVVLIAKGKAD 60 Query: 358 DQLSWRVLQNEKIVKLRERVCYTKRQLSQGKAKVDKLSNELKVKYASLESAMHMLQRNRV 537 DQ +WRV QNEK+ RE++ K QL+QGKAKV+KLS +LK K LESA ++L++NR+ Sbjct: 61 DQFNWRVQQNEKLASSREKLHRNKEQLAQGKAKVEKLSQDLKKKNGMLESARNVLEKNRM 120 Query: 538 EQLDKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVSADEEMKDGSTGQYD 717 EQL+KFYPNLICTQSLG MAITSE LHKQSVVIKQICKLFPQRRV+ D E +GQYD Sbjct: 121 EQLEKFYPNLICTQSLGHMAITSELLHKQSVVIKQICKLFPQRRVNVDGER--NFSGQYD 178 Query: 718 QICSARLPRGLDPHSVPPEELAVSLGYMIQLLNLVVHNLAAPALHCSGFAGSCSRIWQRD 897 QIC+ARLPRGLDPHSV EELA SLGYM+QLLNLV HNLAAP LH +GFAGSCSRIWQRD Sbjct: 179 QICNARLPRGLDPHSVSSEELAASLGYMVQLLNLVAHNLAAPTLHNAGFAGSCSRIWQRD 238 Query: 898 TYWDARPSSRSNEYPLFIPRQNCCFPGGENSWTDRSSSNFGVASMESEKKPYLDSA-GVS 1074 +YW+A PSSRSNEYPLFIPRQN C ENSWTD+SSSNFGVASMESE++P+LDS S Sbjct: 239 SYWNACPSSRSNEYPLFIPRQNYCSTSSENSWTDKSSSNFGVASMESERRPHLDSTRSNS 298 Query: 1075 FNHHPTCPHLVENHKDLQKGISLLKKSVACITAYCYNSLFLDGLSEASTFEAFAKLLATL 1254 FN+ PH VE HKDLQKG+SLLKKSVAC+TAYCYN L LD S+ STFEAFAKLL+TL Sbjct: 299 FNYSSVSPHSVETHKDLQKGVSLLKKSVACVTAYCYNLLCLDVPSDTSTFEAFAKLLSTL 358 Query: 1255 SSSKEVRSAFSMKMASSRSCKQVPQMNKSVWHVNSAGTSSSLLESGHATSLMRNVCD--- 1425 SSSKEVRS F++KMA SRSCKQV ++NKSVW+VNSA +SS+LLES HA LM+N D Sbjct: 359 SSSKEVRSVFNLKMACSRSCKQVQKLNKSVWNVNSAISSSALLESAHALQLMKNTSDNNL 418 Query: 1426 -NSSTSFLYTAEMSDVGKTESIVEGWDIVEHPKFPPPPSQNEDIEHWTRAMFIDATKK 1596 NS+ SFL+ +SD GK ES ++GWD+VEHP FPPPPSQ EDIEHWTRAMFIDATKK Sbjct: 419 PNSAASFLFATGISD-GKNESFIDGWDLVEHPTFPPPPSQVEDIEHWTRAMFIDATKK 475 >ref|XP_003542641.1| PREDICTED: uncharacterized protein LOC100813297 [Glycine max] Length = 474 Score = 670 bits (1729), Expect = 0.0 Identities = 335/476 (70%), Positives = 394/476 (82%), Gaps = 3/476 (0%) Frame = +1 Query: 178 MSRKSGSCAICENSNLASICATCVKYRLNDYYTYLKSLKSARDSLYLRLTSELVAKSKAD 357 M+RK+ +CAICENSN ASIC+ CV YRLN+Y T LK LK RDSLY +L+ LV K K D Sbjct: 1 MARKTSNCAICENSNQASICSICVNYRLNEYNTSLKLLKDRRDSLYSKLSEVLVRKGKGD 60 Query: 358 DQLSWRVLQNEKIVKLRERVCYTKRQLSQGKAKVDKLSNELKVKYASLESAMHMLQRNRV 537 DQ +WRVLQ+EK+ +L+E++ K Q++QG+AK++ S +LK+KY LESA+ L++NRV Sbjct: 61 DQANWRVLQHEKLARLKEKLRQGKEQVTQGRAKIETKSADLKLKYGLLESALSTLEKNRV 120 Query: 538 EQLDKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVSADEEMKDGSTGQYD 717 EQL+KFYPNLICTQSLG +AITSERLHKQSVVIKQICKLFPQRRV + E DG GQ+D Sbjct: 121 EQLEKFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGERGDGCCGQFD 180 Query: 718 QICSARLPRGLDPHSVPPEELAVSLGYMIQLLNLVVHNLAAPALHCSGFAGSCSRIWQRD 897 QIC+ARLPR LDP SVP EEL+ SLGYM+QLLNL+VHNLAAPALH SGFAGSCSRIWQRD Sbjct: 181 QICNARLPRALDPRSVPSEELSTSLGYMVQLLNLIVHNLAAPALHNSGFAGSCSRIWQRD 240 Query: 898 TYWDARPSSRSNEYPLFIPRQNCCFPGGENSWTDRSSSNFGVASMESEKKPYLDSAG-VS 1074 +YWDARPSSRSNEYPLFIPRQN C GGENSW++RSSSNFGVASMESE++ LDS+G S Sbjct: 241 SYWDARPSSRSNEYPLFIPRQNYCSTGGENSWSERSSSNFGVASMESERRHRLDSSGSSS 300 Query: 1075 FNHHPTCPHLVENHKDLQKGISLLKKSVACITAYCYNSLFLDGLSEASTFEAFAKLLATL 1254 FN+ H V+ HKDLQKGISLLKKSVACITAYCYNSL LD SEASTFEAFAKLLATL Sbjct: 301 FNYSLASSHSVQTHKDLQKGISLLKKSVACITAYCYNSLCLDVPSEASTFEAFAKLLATL 360 Query: 1255 SSSKEVRSAFSMKMASSRSCKQVPQMNKSVWHVNSAGTSSSLLESGHA--TSLMRNVCDN 1428 SSSKEVRS FS+KM SR+CKQV Q+NKSVW++NSA +S++LLES H+ T+ + N + Sbjct: 361 SSSKEVRSVFSLKMPRSRTCKQVQQLNKSVWNMNSAISSTTLLESAHSVPTTRIENYLPS 420 Query: 1429 SSTSFLYTAEMSDVGKTESIVEGWDIVEHPKFPPPPSQNEDIEHWTRAMFIDATKK 1596 ++ SFLY + GK E +VEGWDIVEHP FPPPPSQ+ED+EHWTRAMFIDA +K Sbjct: 421 ATASFLYATDSD--GKNECLVEGWDIVEHPTFPPPPSQSEDVEHWTRAMFIDAKRK 474 >ref|XP_003614901.1| hypothetical protein MTR_5g061040 [Medicago truncatula] gi|355516236|gb|AES97859.1| hypothetical protein MTR_5g061040 [Medicago truncatula] Length = 501 Score = 661 bits (1706), Expect = 0.0 Identities = 333/486 (68%), Positives = 394/486 (81%), Gaps = 16/486 (3%) Frame = +1 Query: 178 MSRKSGSCAICENSNLASICATCVKYRLNDYYTYLKSLKSARDSLYLRLTSELVAKSKAD 357 M+RKS +CAICEN N SIC+ CV YRLN+Y + LKSLK RDSLY +L+ LV K K D Sbjct: 1 MARKSTNCAICENLNQPSICSVCVNYRLNEYNSSLKSLKERRDSLYSKLSEVLVRKGKGD 60 Query: 358 DQLSWRVLQNEKIVKLRERVCYTKRQLSQGKAKVDKLSNELKVKYASLESAMHMLQRNRV 537 DQ +WRVL++EK+ + RE++ + K Q++QG+AK+ +S +LK+KY LESA+ ML++NRV Sbjct: 61 DQTNWRVLRHEKLARSREKLRHNKEQVTQGRAKIQAMSADLKLKYGVLESALSMLEKNRV 120 Query: 538 EQLDKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVSADEEMKDGSTGQYD 717 EQL+KFYPNLICTQSLG +AITSERLHKQSVVIKQICKLFPQRRV + E D +GQYD Sbjct: 121 EQLEKFYPNLICTQSLGHVAITSERLHKQSVVIKQICKLFPQRRVVIEGEKGDDCSGQYD 180 Query: 718 QICSARLPRGLDPHSVPPEELAVSLGYMIQLLNLVVHNLAAPALHCSGFAGSCSRIWQRD 897 QIC+ARLPR LDPHSVP EEL+ SLGYM+QLLNLV HNLAAPALH SGFAGSCSRIWQRD Sbjct: 181 QICNARLPRALDPHSVPSEELSASLGYMVQLLNLVAHNLAAPALHNSGFAGSCSRIWQRD 240 Query: 898 TYWDARPSSR-------------SNEYPLFIPRQNCCFPGGENSWTDRSSSNFGVASMES 1038 +YWDARPSSR SNEYPLFIPRQN C GENSW+++SSSNFGVASMES Sbjct: 241 SYWDARPSSRSKNFFNLKYSLFFSNEYPLFIPRQNYCSTSGENSWSEKSSSNFGVASMES 300 Query: 1039 EKKPYLDSAG-VSFNHHPTCPHLVENHKDLQKGISLLKKSVACITAYCYNSLFLDGLSEA 1215 +++P LDS+G SFN+ H V++HKDLQKGISLLKKSVACITAYCYNSL D SEA Sbjct: 301 DRRPRLDSSGSSSFNYSLASSHSVQSHKDLQKGISLLKKSVACITAYCYNSLCFDIPSEA 360 Query: 1216 STFEAFAKLLATLSSSKEVRSAFSMKMASSRSCKQVPQMNKSVWHVNSAGTSSSLLESGH 1395 STFEAFAKLLATLSSSKEVRS FS+KMA SR+CKQV Q+NKSVW++NSA +S++LLES H Sbjct: 361 STFEAFAKLLATLSSSKEVRSVFSLKMARSRTCKQVQQLNKSVWNMNSANSSTTLLESTH 420 Query: 1396 A--TSLMRNVCDNSSTSFLYTAEMSDVGKTESIVEGWDIVEHPKFPPPPSQNEDIEHWTR 1569 + T+ + N NS+ SFLY + SD K+E ++EGWDIVEHP PPPPSQ+ED+EHWTR Sbjct: 421 SVPTTRIENYMPNSAASFLYPTDSSD-RKSECLIEGWDIVEHPTLPPPPSQSEDVEHWTR 479 Query: 1570 AMFIDA 1587 AMFIDA Sbjct: 480 AMFIDA 485 >gb|EXC06677.1| hypothetical protein L484_021513 [Morus notabilis] Length = 478 Score = 660 bits (1702), Expect = 0.0 Identities = 337/478 (70%), Positives = 386/478 (80%), Gaps = 5/478 (1%) Frame = +1 Query: 178 MSRKSGSCAICENSNLASICATCVKYRLNDYYTYLKSLKSARDSLYLRLTSELVAKSKAD 357 M+RKS SCA+CENSNL SIC+ CV YRL D+Y LKS KS RDSLY RL L+AK KAD Sbjct: 1 MNRKSTSCALCENSNLPSICSICVNYRLADHYNILKSNKSHRDSLYSRLEEVLLAKGKAD 60 Query: 358 DQLSWRVLQNEKIVKLRERVCYTKRQLSQGKAKVDKLSNELKVKYASLESAMHMLQRNRV 537 DQ+ WR+ QNEK+ KLRE+ +K +L QGKAKV+++ +LKVK LE+A ML+ NR+ Sbjct: 61 DQVGWRMSQNEKLAKLREKHRRSKERLVQGKAKVERMHYDLKVKSGVLEAARSMLENNRM 120 Query: 538 EQLDKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVSADEEMKDGSTGQYD 717 EQL+KFYPN ICTQ+LG MAITSERLHKQSVVIKQICKLFP RRV D E K+GS QYD Sbjct: 121 EQLEKFYPNFICTQTLGHMAITSERLHKQSVVIKQICKLFPHRRVIIDGERKNGSAEQYD 180 Query: 718 QICSARLPRGLDPHSVPPEELAVSLGYMIQLLNLVVHNLAAPALHCSGFAGSCSRIWQRD 897 QIC+ARLPRG+DPHSV EEL SLGYM+QLLNL+V LAAPALH SGFAGS SRIWQRD Sbjct: 181 QICNARLPRGVDPHSVASEELGASLGYMVQLLNLIVRILAAPALHNSGFAGSNSRIWQRD 240 Query: 898 TYWDARPSSRSNEYPLFIPRQNCCFPGGENSWTDRSSSNFGVASMESEKKPYLDSAGV-S 1074 +YWDARPSSRSNEYPLFIPRQN C ENSW+DRSSSNFGV S+ESE+K LDS+G S Sbjct: 241 SYWDARPSSRSNEYPLFIPRQNYCSTSVENSWSDRSSSNFGVTSIESERKVRLDSSGSNS 300 Query: 1075 FNHHPTCPHLVENHKDLQKGISLLKKSVACITAYCYNSLFLDGLSEASTFEAFAKLLATL 1254 FN+ PH +E HKDLQKGISLLKKSVACIT YCYNSL LD SEASTFEAFAKLLATL Sbjct: 301 FNYSSASPHSIETHKDLQKGISLLKKSVACITTYCYNSLCLDVPSEASTFEAFAKLLATL 360 Query: 1255 SSSKEVRSAFSMKMASSRSCKQVPQMNKSVWHVNSAGTSSSLLESGHATSLMRNVCDNS- 1431 SSSKE+RS S+K A SRS KQV Q+NKSVW+VNSA S++LL+S H + M+N+ +N+ Sbjct: 361 SSSKELRSVCSIKSACSRSNKQVQQLNKSVWNVNSAFASTTLLDSAHTVASMKNIGENNL 420 Query: 1432 ---STSFLYTAEMSDVGKTESIVEGWDIVEHPKFPPPPSQNEDIEHWTRAMFIDATKK 1596 +TSFLY E SD GK E I+EGWD++EHP FPPPPSQ ED+EHWTRAMFIDATKK Sbjct: 421 PNPATSFLYATE-SDAGKNEFIIEGWDLIEHPTFPPPPSQCEDVEHWTRAMFIDATKK 477 >ref|XP_006386390.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa] gi|550344612|gb|ERP64187.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa] Length = 506 Score = 659 bits (1701), Expect = 0.0 Identities = 342/509 (67%), Positives = 391/509 (76%), Gaps = 36/509 (7%) Frame = +1 Query: 178 MSRKSGSCAICENSNLASICATCVKYRLNDYYTYLKSLKSARDSLYLRLTSELVAKSKAD 357 M++KS CAICENSN ASIC CV YRLN+Y T LKSL S RDSLY +L+ L+AK KAD Sbjct: 1 MNKKSSCCAICENSNRASICPICVNYRLNEYGTLLKSLNSRRDSLYSKLSVVLIAKGKAD 60 Query: 358 DQLSWRVLQNEKIVKLRERVCYTKRQLSQGKAKVDKLSNELKVKYASLESAMHMLQRNRV 537 DQ +WRV QNEK+ RE++ K QL+QGKAKV+KLS +LK K LESA ++L++NR+ Sbjct: 61 DQFNWRVQQNEKLASSREKLHRNKEQLAQGKAKVEKLSQDLKKKNGMLESARNVLEKNRM 120 Query: 538 EQLDKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVSADEEMKDGSTGQYD 717 EQL+KFYPNLICTQSLG MAITSE LHKQSVVIKQICKLFPQRRV+ D E +GQYD Sbjct: 121 EQLEKFYPNLICTQSLGHMAITSELLHKQSVVIKQICKLFPQRRVNVDGER--NFSGQYD 178 Query: 718 QICSARLPRGLDPHSVPPEELAVSLGYMIQLLNLVVHNLAAPALHCSGFAGSCSRIWQRD 897 QIC+ARLPRGLDPHSV EELA SLGYM+QLLNLV HNLAAP LH +GFAGSCSRIWQRD Sbjct: 179 QICNARLPRGLDPHSVSSEELAASLGYMVQLLNLVAHNLAAPTLHNAGFAGSCSRIWQRD 238 Query: 898 TYWDARPSSR-------------------------------SNEYPLFIPRQNCCFPGGE 984 +YW+A PSSR SNEYPLFIPRQN C E Sbjct: 239 SYWNACPSSRRYFDWKSLCFGISVAKFELLLLSELNILCACSNEYPLFIPRQNYCSTSSE 298 Query: 985 NSWTDRSSSNFGVASMESEKKPYLDSA-GVSFNHHPTCPHLVENHKDLQKGISLLKKSVA 1161 NSWTD+SSSNFGVASMESE++P+LDS SFN+ PH VE HKDLQKG+SLLKKSVA Sbjct: 299 NSWTDKSSSNFGVASMESERRPHLDSTRSNSFNYSSVSPHSVETHKDLQKGVSLLKKSVA 358 Query: 1162 CITAYCYNSLFLDGLSEASTFEAFAKLLATLSSSKEVRSAFSMKMASSRSCKQVPQMNKS 1341 C+TAYCYN L LD S+ STFEAFAKLL+TLSSSKEVRS F++KMA SRSCKQV ++NKS Sbjct: 359 CVTAYCYNLLCLDVPSDTSTFEAFAKLLSTLSSSKEVRSVFNLKMACSRSCKQVQKLNKS 418 Query: 1342 VWHVNSAGTSSSLLESGHATSLMRNVCD----NSSTSFLYTAEMSDVGKTESIVEGWDIV 1509 VW+VNSA +SS+LLES HA LM+N D NS+ SFL+ +SD GK ES ++GWD+V Sbjct: 419 VWNVNSAISSSALLESAHALQLMKNTSDNNLPNSAASFLFATGISD-GKNESFIDGWDLV 477 Query: 1510 EHPKFPPPPSQNEDIEHWTRAMFIDATKK 1596 EHP FPPPPSQ EDIEHWTRAMFIDATKK Sbjct: 478 EHPTFPPPPSQVEDIEHWTRAMFIDATKK 506 >gb|EYU43567.1| hypothetical protein MIMGU_mgv1a005543mg [Mimulus guttatus] Length = 479 Score = 658 bits (1698), Expect = 0.0 Identities = 330/476 (69%), Positives = 387/476 (81%), Gaps = 4/476 (0%) Frame = +1 Query: 181 SRKSGSCAICENSNLASICATCVKYRLNDYYTYLKSLKSARDSLYLRLTSELVAKSKADD 360 +RK+ SCAICE SNLASIC CV YRLN+Y L+ LKS RD+LY +LT LVAK KADD Sbjct: 6 TRKTSSCAICETSNLASICTVCVNYRLNEYNGNLRLLKSKRDALYSKLTQVLVAKGKADD 65 Query: 361 QLSWRVLQNEKIVKLRERVCYTKRQLSQGKAKVDKLSNELKVKYASLESAMHMLQRNRVE 540 Q SWRVL NEK+ +LR+++ K Q+ QGKAK++K S++LK+KY LESAM +++NR+E Sbjct: 66 QHSWRVLHNEKLARLRDKLRQRKEQILQGKAKIEKRSHDLKLKYELLESAMDTMEKNRLE 125 Query: 541 QLDKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVSADEEMKDGSTGQYDQ 720 Q++K+YPNLICTQSLG MAITSERLHKQSV+IKQICKLFPQRRV+ D E KDG GQYD Sbjct: 126 QIEKYYPNLICTQSLGHMAITSERLHKQSVIIKQICKLFPQRRVNIDGESKDGYGGQYDT 185 Query: 721 ICSARLPRGLDPHSVPPEELAVSLGYMIQLLNLVVHNLAAPALHCSGFAGSCSRIWQRDT 900 IC+ARLPRGLDPHSVP EELA SLGYM+QLLNLV+H + APALH SGFAGSCSRIWQR++ Sbjct: 186 ICNARLPRGLDPHSVPSEELAASLGYMVQLLNLVIHTVCAPALHHSGFAGSCSRIWQRES 245 Query: 901 YWDARPSSRSNEYPLFIPRQNCCFPGGENSWTDRSSSNFGVASMESEKKPYLDSAGVSFN 1080 YWDARPS RS EYPLFIPRQN C GGE SW++RSSSNFGVASMES +KP L+S+G SFN Sbjct: 246 YWDARPSPRS-EYPLFIPRQNFCTTGGETSWSERSSSNFGVASMESVRKPRLESSGGSFN 304 Query: 1081 HHPTCPHLVENHKDLQKGISLLKKSVACITAYCYNSLFLDGLSEASTFEAFAKLLATLSS 1260 + H VE HKDLQKGISLLKKSVACITAYCYNSL L+ +EASTFEAF+KLLATLSS Sbjct: 305 YSSASQHSVEIHKDLQKGISLLKKSVACITAYCYNSLSLEVPAEASTFEAFSKLLATLSS 364 Query: 1261 SKEVRSAFSMKMASSRSCKQVPQMNKSVWHVNSAGTSSSLLESGHATSLMRNVCDN---- 1428 SKEVR+ SM+ SSRS K Q+N SVW+V SA +SS+LLES + +MRN DN Sbjct: 365 SKEVRTVLSMRTVSSRS-KPGQQLNTSVWNVESAFSSSTLLESANVLPIMRNTFDNYLPS 423 Query: 1429 SSTSFLYTAEMSDVGKTESIVEGWDIVEHPKFPPPPSQNEDIEHWTRAMFIDATKK 1596 S+ S+LY E +D+GK E+++EGWD VEHP PPPPS ED+EHWTRAMFIDATKK Sbjct: 424 SAGSYLYGNEFADLGKNENLIEGWDFVEHPTLPPPPSHTEDVEHWTRAMFIDATKK 479 >ref|XP_004237862.1| PREDICTED: uncharacterized protein LOC101264619 [Solanum lycopersicum] Length = 481 Score = 648 bits (1672), Expect = 0.0 Identities = 320/481 (66%), Positives = 388/481 (80%), Gaps = 8/481 (1%) Frame = +1 Query: 178 MSRKSGSCAICENSNLASICATCVKYRLNDYYTYLKSLKSARDSLYLRLTSELVAKSKAD 357 M+RK+ C ICENSNL S+C CV YRLN+Y T LKSLK R++L +L+ L+AK KAD Sbjct: 1 MTRKTSCCGICENSNLPSVCTLCVNYRLNEYSTVLKSLKGRREALCGQLSEILLAKGKAD 60 Query: 358 DQLSWRVLQNEKIVKLRERVCYTKRQLSQGKAKVDKLSNELKVKYASLESAMHMLQRNRV 537 DQLSWRV +NEK+ +LRE++ K Q+SQGKAK++K+S++LKV+Y L SA ML++NR Sbjct: 61 DQLSWRVPRNEKLARLREKLRQQKEQVSQGKAKIEKMSHDLKVQYELLGSATRMLEKNRA 120 Query: 538 EQLDKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVSADEEMKDGSTGQYD 717 EQL+KFYPNLICTQ+LG MAITSE LHKQSVV+KQICKLFPQRRV+ D + KDGS+GQYD Sbjct: 121 EQLEKFYPNLICTQNLGHMAITSELLHKQSVVVKQICKLFPQRRVTIDGDKKDGSSGQYD 180 Query: 718 QICSARLPRGLDPHSVPPEELAVSLGYMIQLLNLVVHNLAAPALHCSGFAGSCSRIWQRD 897 IC+ARLP+GLDPHSVP +EL+ SLGYM+QLLNLVV + APALH SGFAGSCSRIWQRD Sbjct: 181 SICNARLPKGLDPHSVPSDELSASLGYMVQLLNLVVRCVCAPALHNSGFAGSCSRIWQRD 240 Query: 898 TYWDARPSSRSNEYPLFIPRQNCCFPGGENSWTDRS------SSNFGVASMESEKKPYLD 1059 +YWDARPSSRS EYPLFIPRQN C GGE SW DRS SSNFGV SMES++KP LD Sbjct: 241 SYWDARPSSRSGEYPLFIPRQNFCSSGGEASWYDRSSSNSGTSSNFGVTSMESDRKPRLD 300 Query: 1060 -SAGVSFNHHPTCPHLVENHKDLQKGISLLKKSVACITAYCYNSLFLDGLSEASTFEAFA 1236 S+ SFN+ H +E HKDLQKGI+LLKKSVACITAYCYN+L L+ +EASTFE FA Sbjct: 301 SSSSSSFNYASASLHSIETHKDLQKGIALLKKSVACITAYCYNTLCLEVPAEASTFETFA 360 Query: 1237 KLLATLSSSKEVRSAFSMKMASSRSCKQVPQMNKSVWHVNSAGTSSSLLESGHA-TSLMR 1413 +LLATLSSSKEVRS FS+KM+ SR+ KQV +NKSVW+V+SAG+SS+L+ESGH + Sbjct: 361 RLLATLSSSKEVRSVFSLKMSGSRASKQVQPLNKSVWNVDSAGSSSTLMESGHVPRNTFE 420 Query: 1414 NVCDNSSTSFLYTAEMSDVGKTESIVEGWDIVEHPKFPPPPSQNEDIEHWTRAMFIDATK 1593 +S + +Y E+S+VG+ E+++E WD++EHP FPPPPS ED+EHWTRAMFIDATK Sbjct: 421 KSLPSSGGNLMYATEVSNVGRNENLIEDWDLIEHPPFPPPPSHTEDVEHWTRAMFIDATK 480 Query: 1594 K 1596 K Sbjct: 481 K 481 >ref|XP_006354053.1| PREDICTED: uncharacterized protein LOC102590673 isoform X1 [Solanum tuberosum] gi|565375051|ref|XP_006354054.1| PREDICTED: uncharacterized protein LOC102590673 isoform X2 [Solanum tuberosum] Length = 483 Score = 648 bits (1671), Expect = 0.0 Identities = 320/483 (66%), Positives = 387/483 (80%), Gaps = 10/483 (2%) Frame = +1 Query: 178 MSRKSGSCAICENSNLASICATCVKYRLNDYYTYLKSLKSARDSLYLRLTSELVAKSKAD 357 M+ K+ C ICENSNL S+C CV YRLN+Y T LKSLK R++L +L+ L+AK KAD Sbjct: 1 MTLKTSCCGICENSNLPSVCTLCVNYRLNEYSTVLKSLKGRREALCGKLSEILLAKGKAD 60 Query: 358 DQLSWRVLQNEKIVKLRERVCYTKRQLSQGKAKVDKLSNELKVKYASLESAMHMLQRNRV 537 DQLSWRV +NEK+ +LRE++ K Q+SQGKAK++K+S++LKV+Y L SA ML++NR Sbjct: 61 DQLSWRVPRNEKLARLREKLRQQKEQISQGKAKIEKMSHDLKVQYELLGSATRMLEKNRA 120 Query: 538 EQLDKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVSADEEMKDGSTGQYD 717 EQL+KFYPNLICTQ+LG MAITSE LHKQSVV+KQICKLFPQRRV+ D + KDGS+GQYD Sbjct: 121 EQLEKFYPNLICTQNLGHMAITSELLHKQSVVVKQICKLFPQRRVTIDGDKKDGSSGQYD 180 Query: 718 QICSARLPRGLDPHSVPPEELAVSLGYMIQLLNLVVHNLAAPALHCSGFAGSCSRIWQRD 897 IC+ARLP+GLDPHSVP +EL+ SLGYM+QLLNLV+ + APALH SGFAGSCSRIWQRD Sbjct: 181 SICNARLPKGLDPHSVPSDELSASLGYMVQLLNLVIRCVCAPALHNSGFAGSCSRIWQRD 240 Query: 898 TYWDARPSSRSNEYPLFIPRQNCCFPGGENSWTDRS------SSNFGVASMESEKKPYLD 1059 +YWDARPSSRS EYPLFIPRQN C GGE SW DRS SSNFGV SMES++KP LD Sbjct: 241 SYWDARPSSRSGEYPLFIPRQNFCSSGGEASWYDRSCSNSGTSSNFGVTSMESDRKPRLD 300 Query: 1060 -SAGVSFNHHPTCPHLVENHKDLQKGISLLKKSVACITAYCYNSLFLDGLSEASTFEAFA 1236 S+ SFN+ H +E HKDLQKGI+LLKKSVACITAYCYN+L L+ +EASTFE FA Sbjct: 301 SSSSSSFNYASASLHSIETHKDLQKGIALLKKSVACITAYCYNTLCLEVPAEASTFETFA 360 Query: 1237 KLLATLSSSKEVRSAFSMKMASSRSCKQVPQMNKSVWHVNSAGTSSSLLESGHATSL--- 1407 +LLATLSSSKEVRS FS+KM+ SR+ KQV +NKSVW+V+SAG+SS+L+ESGH L Sbjct: 361 RLLATLSSSKEVRSVFSLKMSGSRASKQVQPLNKSVWNVDSAGSSSTLMESGHVPVLRNT 420 Query: 1408 MRNVCDNSSTSFLYTAEMSDVGKTESIVEGWDIVEHPKFPPPPSQNEDIEHWTRAMFIDA 1587 N +SS + +Y E+SD + E+++E WD++EHP FPPPPS ED+EHWTRAMFIDA Sbjct: 421 FENALPSSSGNLIYATEVSDARRNENLIEDWDLIEHPPFPPPPSHTEDVEHWTRAMFIDA 480 Query: 1588 TKK 1596 TKK Sbjct: 481 TKK 483 >ref|XP_006397280.1| hypothetical protein EUTSA_v10028627mg [Eutrema salsugineum] gi|557098297|gb|ESQ38733.1| hypothetical protein EUTSA_v10028627mg [Eutrema salsugineum] Length = 474 Score = 633 bits (1632), Expect = e-179 Identities = 317/476 (66%), Positives = 388/476 (81%), Gaps = 3/476 (0%) Frame = +1 Query: 178 MSRKSGSCAICENSNLASICATCVKYRLNDYYTYLKSLKSARDSLYLRLTSELVAKSKAD 357 M ++S +CAICEN+N ASIC+ CV YRL +Y T LKSLK+ RD+LY +L+ L AK KAD Sbjct: 1 MIKRSSNCAICENTNRASICSVCVNYRLIEYSTLLKSLKTRRDALYSKLSELLEAKGKAD 60 Query: 358 DQLSWRVLQNEKIVKLRERVCYTKRQLSQGKAKVDKLSNELKVKYASLESAMHMLQRNRV 537 DQ +W+++QNEK+ L+ + K Q++QGKAK+++ S +LK+KY L+SA L+R RV Sbjct: 61 DQKNWKLIQNEKLSGLKNNLRRNKEQVTQGKAKIERESRDLKLKYGVLDSARSTLERIRV 120 Query: 538 EQLDKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVSADEEMKDGSTGQYD 717 EQ++K++PNLICTQSLG MAI+SERLHKQSVV+KQ+CKLFPQRRVS D E ++GS GQY+ Sbjct: 121 EQVEKYFPNLICTQSLGHMAISSERLHKQSVVMKQVCKLFPQRRVSFDGESQNGSVGQYN 180 Query: 718 QICSARLPRGLDPHSVPPEELAVSLGYMIQLLNLVVHNLAAPALHCSGFAGSCSRIWQRD 897 IC++RLP+GLDPHS+P EELA SLG M+QLLNLVVHNLAAPALH SGFAGSCSRIWQRD Sbjct: 181 LICNSRLPKGLDPHSIPSEELAASLGLMVQLLNLVVHNLAAPALHNSGFAGSCSRIWQRD 240 Query: 898 TYWDARPSSRSNEYPLFIPRQNCCFPGGENSWTDRSSSNFGVASMESEKK-PYLDSAG-V 1071 +YWDARPS+RSNEYPLFIPRQN C ENSWTD++SSNFGVASMES++K LDS G Sbjct: 241 SYWDARPSTRSNEYPLFIPRQNYCSTSVENSWTDKNSSNFGVASMESDRKEARLDSTGRN 300 Query: 1072 SFNHHPTCPHLVENHKDLQKGISLLKKSVACITAYCYNSLFLDGLSEASTFEAFAKLLAT 1251 SFN+ PH VE+H+DLQKGI+LLKKSVAC+TAYCYNSL L+ EASTFEAFAKLLAT Sbjct: 301 SFNYSSASPHSVESHRDLQKGIALLKKSVACLTAYCYNSLCLEVPPEASTFEAFAKLLAT 360 Query: 1252 LSSSKEVRSAFSMKMASSRSCKQVPQMNKSVWHVNSAGTSSSLLESGH-ATSLMRNVCDN 1428 LSSSKEVRS FS+KMASSRSCKQ Q+NKS+W+ +S SSS+LES H + N N Sbjct: 361 LSSSKEVRSVFSLKMASSRSCKQAQQLNKSIWNAHSV-ISSSILESSHLPRNASYNQDPN 419 Query: 1429 SSTSFLYTAEMSDVGKTESIVEGWDIVEHPKFPPPPSQNEDIEHWTRAMFIDATKK 1596 S+ S+L E+S++ K+ + GWD+VEHPK+PPPPSQ+ED+EHWTRAMFIDA KK Sbjct: 420 SAASYLSGTELSEIRKSND-MNGWDLVEHPKYPPPPSQSEDVEHWTRAMFIDAKKK 474 >ref|XP_004138644.1| PREDICTED: uncharacterized protein LOC101217421 [Cucumis sativus] gi|449524750|ref|XP_004169384.1| PREDICTED: uncharacterized LOC101217421 [Cucumis sativus] Length = 476 Score = 615 bits (1586), Expect = e-173 Identities = 315/477 (66%), Positives = 375/477 (78%), Gaps = 4/477 (0%) Frame = +1 Query: 178 MSRKSGSCAICENSNLASICATCVKYRLNDYYTYLKSLKSARDSLYLRLTSELVAKSKAD 357 M+RK +CAICENSN ASIC CV RLNDY + LKSL++ RD LY RL+ LVAK KAD Sbjct: 1 MNRKFCNCAICENSNQASICTGCVNLRLNDYNSSLKSLRARRDVLYSRLSDVLVAKGKAD 60 Query: 358 DQLSWRVLQNEKIVKLRERVCYTKRQLSQGKAKVDKLSNELKVKYASLESAMHMLQRNRV 537 DQL+WRV +NEK+ LRE++ ++ QL QGKA+++ S +L++KYA LESA +L++ R+ Sbjct: 61 DQLNWRVTRNEKLTSLREKLRRSREQLEQGKAEIEMKSFDLQLKYAMLESARSVLEKQRL 120 Query: 538 EQLDKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVSADEEMKDGSTGQYD 717 EQL+K YP+LI T++LG MAITSERLHKQSVVIKQ+CKLFPQRRV E + G +D Sbjct: 121 EQLEKAYPDLISTKNLGHMAITSERLHKQSVVIKQLCKLFPQRRVLVRGEKEVGPGEPFD 180 Query: 718 QICSARLPRGLDPHSVPPEELAVSLGYMIQLLNLVVHNLAAPALHCSGFAGSCSRIWQRD 897 QIC+ LPR LDPHSV P EL+ SLGYM+QLLNLVV LAAPALH SGFAGSCSRIWQRD Sbjct: 181 QICNVSLPRSLDPHSVEPYELSASLGYMVQLLNLVVQYLAAPALHTSGFAGSCSRIWQRD 240 Query: 898 TYWDARPSSRSNEYPLFIPRQNCCFPGGENSWTDRSSSNFGVASMESEKKPYLDS-AGVS 1074 +YW+A PSSRSNEYP+F+PRQ+ C GENSW+D+SSSNFGVAS+ESE+KP L S S Sbjct: 241 SYWNACPSSRSNEYPVFMPRQSYCSTSGENSWSDKSSSNFGVASLESERKPQLSSLENRS 300 Query: 1075 FNHHPTCPHLVENHKDLQKGISLLKKSVACITAYCYNSLFLDGLSEASTFEAFAKLLATL 1254 FN+ PH +E+HKDLQKGI+LLKKSVAC+TAY YNSL LD SEASTFEAFAKLLATL Sbjct: 301 FNYSSASPHSIESHKDLQKGIALLKKSVACVTAYGYNSLSLDVPSEASTFEAFAKLLATL 360 Query: 1255 SSSKEVRSAFSMKMASSRSCKQVPQMNKSVWHVNSAGTSSSLLESGHATSLMRNVCDN-- 1428 SSSKEVRS FS+KMASSRS K + + KS W+VNS SS L ESGH+ + N N Sbjct: 361 SSSKEVRSVFSLKMASSRSTKHIQKPIKSTWNVNSI-ASSMLFESGHSQIMKTNYESNLP 419 Query: 1429 -SSTSFLYTAEMSDVGKTESIVEGWDIVEHPKFPPPPSQNEDIEHWTRAMFIDATKK 1596 S++S+LY E SD GK +S +EGWD+VEHP FPPPPSQ EDIEHWTRAM IDATK+ Sbjct: 420 SSASSYLYATEFSDTGKNDSSIEGWDLVEHPTFPPPPSQAEDIEHWTRAMIIDATKQ 476 >ref|NP_192594.2| DNA-directed RNA polymerase II protein [Arabidopsis thaliana] gi|23297675|gb|AAN13006.1| unknown protein [Arabidopsis thaliana] gi|332657255|gb|AEE82655.1| DNA-directed RNA polymerase II protein [Arabidopsis thaliana] Length = 473 Score = 613 bits (1580), Expect = e-173 Identities = 310/476 (65%), Positives = 378/476 (79%), Gaps = 3/476 (0%) Frame = +1 Query: 178 MSRKSGSCAICENSNLASICATCVKYRLNDYYTYLKSLKSARDSLYLRLTSELVAKSKAD 357 M+++S +CAIC+N+N IC CV +RL +Y T LKSLK+ RDSL R L +K KAD Sbjct: 1 MTKRSSNCAICDNTNRPCICTACVNHRLIEYNTLLKSLKTRRDSLLSRFNELLESKGKAD 60 Query: 358 DQLSWRVLQNEKIVKLRERVCYTKRQLSQGKAKVDKLSNELKVKYASLESAMHMLQRNRV 537 DQ +WR++QNEKI KL++++ K ++QGK K+++ S++LKVKY L+SA L++ RV Sbjct: 61 DQKNWRLIQNEKISKLKKKLKSNKELVTQGKVKIERGSSDLKVKYGVLDSARSTLEKTRV 120 Query: 538 EQLDKFYPNLICTQSLGLMAITSERLHKQSVVIKQICKLFPQRRVSADEEMKDGSTGQYD 717 EQ++K++PNLICTQSLG MAI+SERLHKQSVV+KQICKLFP RRVS D E ++GS QYD Sbjct: 121 EQVEKYFPNLICTQSLGHMAISSERLHKQSVVVKQICKLFPLRRVSFDGESQNGSVRQYD 180 Query: 718 QICSARLPRGLDPHSVPPEELAVSLGYMIQLLNLVVHNLAAPALHCSGFAGSCSRIWQRD 897 IC++RLP GLDPHS+P EELAVSLGYM+QLLNLVVHNLAAPALH SGFAGSCSRIWQRD Sbjct: 181 VICNSRLPSGLDPHSIPSEELAVSLGYMVQLLNLVVHNLAAPALHSSGFAGSCSRIWQRD 240 Query: 898 TYWDARPSSRSNEYPLFIPRQNCCFPGGENSWTDRSSSNFGVASMESEKK-PYLDSAGV- 1071 +YWD R S+RSNEYPLFIPR+N C ENSWTD++SSNFGVASMES++K P LDS G Sbjct: 241 SYWDGRTSTRSNEYPLFIPRRNYCSTSVENSWTDKNSSNFGVASMESDRKEPRLDSPGSN 300 Query: 1072 SFNHHPTCPHLVENHKDLQKGISLLKKSVACITAYCYNSLFLDGLSEASTFEAFAKLLAT 1251 SF + PH +E+H+DLQKGI+LLKKSVAC+TAYCYNSL L+ EASTFEAFAKLLAT Sbjct: 301 SFKYSSASPHSIESHRDLQKGIALLKKSVACLTAYCYNSLCLEVPPEASTFEAFAKLLAT 360 Query: 1252 LSSSKEVRSAFSMKMASSRSCKQVPQMNKSVWHVNSAGTSSSLLESGH-ATSLMRNVCDN 1428 LSSSKEVRS FS+KMASSRS KQ Q+NKS+W+ +S SSSLLES H + N N Sbjct: 361 LSSSKEVRSVFSLKMASSRSGKQAQQLNKSIWNAHSV-ISSSLLESAHLPRNTSYNQDPN 419 Query: 1429 SSTSFLYTAEMSDVGKTESIVEGWDIVEHPKFPPPPSQNEDIEHWTRAMFIDATKK 1596 S S+L E+S + + + GWD+VEHPK+PPPPSQ+ED+EHWTRAMFIDA KK Sbjct: 420 SPASYLSATELST--RKNNDMNGWDLVEHPKYPPPPSQSEDVEHWTRAMFIDAKKK 473