BLASTX nr result
ID: Cocculus23_contig00007111
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Cocculus23_contig00007111 (1227 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002285818.1| PREDICTED: uncharacterized protein LOC100260... 588 e-165 ref|XP_006472675.1| PREDICTED: uncharacterized protein LOC102626... 587 e-165 ref|XP_007018895.1| DNA-directed RNA polymerase II protein isofo... 585 e-164 ref|XP_004300664.1| PREDICTED: uncharacterized protein LOC101292... 583 e-164 ref|XP_006434072.1| hypothetical protein CICLE_v10001001mg [Citr... 582 e-164 ref|XP_007225709.1| hypothetical protein PRUPE_ppa005050mg [Prun... 581 e-163 ref|XP_002533875.1| DNA-directed RNA polymerase II, putative [Ri... 576 e-162 ref|XP_004514262.1| PREDICTED: uncharacterized protein LOC101503... 573 e-161 ref|XP_007141122.1| hypothetical protein PHAVU_008G169200g [Phas... 572 e-160 ref|XP_003544777.1| PREDICTED: uncharacterized protein LOC100776... 570 e-160 ref|XP_002302270.1| hypothetical protein POPTR_0002s09150g [Popu... 567 e-159 ref|XP_003542641.1| PREDICTED: uncharacterized protein LOC100813... 564 e-158 ref|XP_003614901.1| hypothetical protein MTR_5g061040 [Medicago ... 558 e-156 gb|EXC06677.1| hypothetical protein L484_021513 [Morus notabilis] 551 e-154 gb|EYU43567.1| hypothetical protein MIMGU_mgv1a005543mg [Mimulus... 551 e-154 ref|XP_006386390.1| hypothetical protein POPTR_0002s09150g [Popu... 551 e-154 ref|XP_006354053.1| PREDICTED: uncharacterized protein LOC102590... 545 e-152 ref|XP_004237862.1| PREDICTED: uncharacterized protein LOC101264... 544 e-152 ref|XP_006397280.1| hypothetical protein EUTSA_v10028627mg [Eutr... 536 e-150 ref|NP_192594.2| DNA-directed RNA polymerase II protein [Arabido... 523 e-146 >ref|XP_002285818.1| PREDICTED: uncharacterized protein LOC100260778 [Vitis vinifera] gi|302141899|emb|CBI19102.3| unnamed protein product [Vitis vinifera] Length = 478 Score = 588 bits (1517), Expect = e-165 Identities = 291/389 (74%), Positives = 333/389 (85%), Gaps = 5/389 (1%) Frame = -2 Query: 1226 GKAKVDKLSNELKVKYASLESAMHMLQRNRVEQLDKFYPNLICTQSLGLMAITSERLHKQ 1047 GKAKV+K+SN+LK+KY LESAM ML++NRVEQL+KFYPNLICTQ+LGLMAITSER HKQ Sbjct: 90 GKAKVEKMSNDLKLKYGLLESAMSMLEKNRVEQLEKFYPNLICTQNLGLMAITSERFHKQ 149 Query: 1046 SVVIKQICKLFPQRRVSADEEMKDGSTGQYDQICSARLPRGLDPHSVPPEELAVSLGYMI 867 SVVIKQICKLFPQRRV+ D E KDGS+ YDQIC+ RLPR LDPHSVP +ELA SLGYM+ Sbjct: 150 SVVIKQICKLFPQRRVNIDGEKKDGSSRPYDQICNVRLPRVLDPHSVPSDELAASLGYMV 209 Query: 866 QLLNLVVHNLAAPALHCSGFAGSCSRIWQRDTYWDARPSSRSNEYPLFIPRQNCCFPGGE 687 QLLNLVV+NLAAPALH SGFAGSCSRIWQR++YW+ RPSSRSNEYPLFIPRQN C GE Sbjct: 210 QLLNLVVYNLAAPALHNSGFAGSCSRIWQRESYWNPRPSSRSNEYPLFIPRQNLCSTNGE 269 Query: 686 NSWTDRSSSNFGVASMESEKKPYLDSAG-VSFNHHPTCPHLVENHKDLQKGISLLKKSVA 510 NSW++RSSSNFG+ASMES++KP L+S+G SFN+ H VE HKDLQKGISLLKKSVA Sbjct: 270 NSWSERSSSNFGIASMESDRKPRLESSGSSSFNYSSASLHSVETHKDLQKGISLLKKSVA 329 Query: 509 CITAYCYNSLFLDGLSEASTFEAFAKLLATLSSSKEVRSAFSMKMASSRSCKQVPQMNKS 330 C+T YCY+SL LD +EASTFEAFAKLLA LSSSKEVRS FS+KMA SRSCKQV Q+NKS Sbjct: 330 CLTTYCYSSLCLDVPTEASTFEAFAKLLAILSSSKEVRSVFSLKMACSRSCKQVQQLNKS 389 Query: 329 VWHVNSAGTSSSLLESGHATSLMRNVCD----NSSTSFLYTAEMSDVGKTESIVEGWDIV 162 +W++NSA +SS+LLES H + RN+ D NS+ SFLYT EMSD+GK ES++E WD+V Sbjct: 390 IWNMNSAISSSTLLESAHTLPMTRNIFDNNLPNSAASFLYTTEMSDIGKNESLIEEWDLV 449 Query: 161 EHPKFPPPPSQNEDIEHWTRAMFIDATKK 75 EH FPPPPSQ EDIEHWTRAM IDATKK Sbjct: 450 EHANFPPPPSQTEDIEHWTRAMIIDATKK 478 >ref|XP_006472675.1| PREDICTED: uncharacterized protein LOC102626964 isoform X1 [Citrus sinensis] gi|568837325|ref|XP_006472676.1| PREDICTED: uncharacterized protein LOC102626964 isoform X2 [Citrus sinensis] Length = 478 Score = 587 bits (1513), Expect = e-165 Identities = 289/389 (74%), Positives = 330/389 (84%), Gaps = 5/389 (1%) Frame = -2 Query: 1226 GKAKVDKLSNELKVKYASLESAMHMLQRNRVEQLDKFYPNLICTQSLGLMAITSERLHKQ 1047 GK K++K S +LKV+YA L+SA M+++NR EQL+KFYPN+ICTQSLG MAI SE LHKQ Sbjct: 90 GKLKIEKSSYDLKVRYAILDSARSMMEKNRAEQLEKFYPNIICTQSLGHMAIVSELLHKQ 149 Query: 1046 SVVIKQICKLFPQRRVSADEEMKDGSTGQYDQICSARLPRGLDPHSVPPEELAVSLGYMI 867 SVVIKQICKLFPQRRV+ D E +DGS+GQYDQIC ARLP+GLDPHSVP EELA SLGYM+ Sbjct: 150 SVVIKQICKLFPQRRVNIDGERRDGSSGQYDQICGARLPKGLDPHSVPSEELAASLGYMV 209 Query: 866 QLLNLVVHNLAAPALHCSGFAGSCSRIWQRDTYWDARPSSRSNEYPLFIPRQNCCFPGGE 687 QLLNLVV NLA P LH SGFAGSCSRIWQRD+YWDARPSSRSNEYPLFIPRQN C GE Sbjct: 210 QLLNLVVLNLAVPILHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGE 269 Query: 686 NSWTDRSSSNFGVASMESEKKPYLDSA-GVSFNHHPTCPHLVENHKDLQKGISLLKKSVA 510 NSWTDRSSSNFGVASMESE++P LDS+ SFN+ H VE HKDLQKGISLLKKSVA Sbjct: 270 NSWTDRSSSNFGVASMESERRPQLDSSRSTSFNYTSASTHSVETHKDLQKGISLLKKSVA 329 Query: 509 CITAYCYNSLFLDGLSEASTFEAFAKLLATLSSSKEVRSAFSMKMASSRSCKQVPQMNKS 330 C+TAYCYNSL LD +EASTFEAFAKLLATLSSSKEVRS FS+KMA SRSCKQV ++N+S Sbjct: 330 CLTAYCYNSLCLDVPAEASTFEAFAKLLATLSSSKEVRSVFSLKMACSRSCKQVQKLNRS 389 Query: 329 VWHVNSAGTSSSLLESGHATSLMRNVCDN----SSTSFLYTAEMSDVGKTESIVEGWDIV 162 VW++NSA +S++LLES H + +N+ DN S+ SFLY EMSD+GK ES+++GWD+V Sbjct: 390 VWNMNSAISSTTLLESAHMFPITKNLSDNNLPSSAASFLYATEMSDIGKNESLIDGWDLV 449 Query: 161 EHPKFPPPPSQNEDIEHWTRAMFIDATKK 75 EHP FPPPPSQ ED+EHWTRAM IDATKK Sbjct: 450 EHPTFPPPPSQTEDVEHWTRAMIIDATKK 478 >ref|XP_007018895.1| DNA-directed RNA polymerase II protein isoform 1 [Theobroma cacao] gi|508724223|gb|EOY16120.1| DNA-directed RNA polymerase II protein isoform 1 [Theobroma cacao] Length = 479 Score = 585 bits (1507), Expect = e-164 Identities = 291/389 (74%), Positives = 331/389 (85%), Gaps = 5/389 (1%) Frame = -2 Query: 1226 GKAKVDKLSNELKVKYASLESAMHMLQRNRVEQLDKFYPNLICTQSLGLMAITSERLHKQ 1047 GKAK++++S +LKVKY LESA ML++NRVE+L+KFYPNLICTQSLGLMAITSERLHKQ Sbjct: 91 GKAKIERVSYDLKVKYGVLESARGMLEKNRVEKLEKFYPNLICTQSLGLMAITSERLHKQ 150 Query: 1046 SVVIKQICKLFPQRRVSADEEMKDGSTGQYDQICSARLPRGLDPHSVPPEELAVSLGYMI 867 SVVIKQICKLFPQRRV+ D E +DGS GQYD IC+ LPRGLDPHSVP E+LA SLGYM+ Sbjct: 151 SVVIKQICKLFPQRRVNLDGEGRDGSCGQYDLICNVGLPRGLDPHSVPSEQLAASLGYMV 210 Query: 866 QLLNLVVHNLAAPALHCSGFAGSCSRIWQRDTYWDARPSSRSNEYPLFIPRQNCCFPGGE 687 QLLNLVVHNLAAPALH SGFAGSCSRIWQRD+YW+ARPSSRSNEYPLFIPRQN C G+ Sbjct: 211 QLLNLVVHNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQNYCSTSGD 270 Query: 686 NSWTDRSSSNFGVASMESEKKPYLDSAGV-SFNHHPTCPHLVENHKDLQKGISLLKKSVA 510 NSWTDRSSSNFGVASMESE++P LDS+G SFN+ H VE HKDLQ GISLLKKSVA Sbjct: 271 NSWTDRSSSNFGVASMESERRPRLDSSGSNSFNYSSASSHTVETHKDLQIGISLLKKSVA 330 Query: 509 CITAYCYNSLFLDGLSEASTFEAFAKLLATLSSSKEVRSAFSMKMASSRSCKQVPQMNKS 330 CITA+CYNSL LD +EASTFEAF+KLLATLSS+KEVRS FS+KMA SRS KQ Q+NKS Sbjct: 331 CITAFCYNSLCLDVPTEASTFEAFSKLLATLSSTKEVRSVFSLKMACSRSSKQAQQLNKS 390 Query: 329 VWHVNSAGTSSSLLESGHATSLMRNVCD----NSSTSFLYTAEMSDVGKTESIVEGWDIV 162 VW+VNSA +SS LLES H L +N+ D +S+ SFL+ EM D+GK ES++E WD+V Sbjct: 391 VWNVNSAMSSSMLLESAHMLPLTKNLSDHNLPSSAASFLFATEMPDIGKNESLIEEWDLV 450 Query: 161 EHPKFPPPPSQNEDIEHWTRAMFIDATKK 75 EHP FPPPPSQ ED+EHWTRAMFIDATK+ Sbjct: 451 EHPTFPPPPSQTEDVEHWTRAMFIDATKR 479 >ref|XP_004300664.1| PREDICTED: uncharacterized protein LOC101292418 [Fragaria vesca subsp. vesca] Length = 478 Score = 583 bits (1503), Expect = e-164 Identities = 293/389 (75%), Positives = 331/389 (85%), Gaps = 5/389 (1%) Frame = -2 Query: 1226 GKAKVDKLSNELKVKYASLESAMHMLQRNRVEQLDKFYPNLICTQSLGLMAITSERLHKQ 1047 GKAK++K S +LKVKY LESA+ ML++NR EQL+KFYPNLICTQSLG MAITSERLHKQ Sbjct: 91 GKAKIEKTSYDLKVKYGVLESALSMLEKNRAEQLEKFYPNLICTQSLGHMAITSERLHKQ 150 Query: 1046 SVVIKQICKLFPQRRVSADEEMKDGSTGQYDQICSARLPRGLDPHSVPPEELAVSLGYMI 867 SVVIKQICKLFPQRRV+ D + K+GS GQYDQIC+A LPRGLDPHSVP EELA SLGYM+ Sbjct: 151 SVVIKQICKLFPQRRVTVDAKRKEGSGGQYDQICNASLPRGLDPHSVPSEELAASLGYMV 210 Query: 866 QLLNLVVHNLAAPALHCSGFAGSCSRIWQRDTYWDARPSSRSNEYPLFIPRQNCCFPGGE 687 QLLNLVV NL APALH SGFAGSCSRIWQRD+YWDARPSSRSNEYPLFIPRQN C GE Sbjct: 211 QLLNLVVQNLGAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGE 270 Query: 686 NSWTDRSSSNFGVASMESEKKPYLDSAG-VSFNHHPTCPHLVENHKDLQKGISLLKKSVA 510 NSW+DRSSSNFGVAS+ESE+KP LDS+G SFN+ H VE HKDLQ+GISLLKKSVA Sbjct: 271 NSWSDRSSSNFGVASIESERKPRLDSSGSSSFNYSSASQHSVETHKDLQRGISLLKKSVA 330 Query: 509 CITAYCYNSLFLDGLSEASTFEAFAKLLATLSSSKEVRSAFSMKMASSRSCKQVPQMNKS 330 CITAYCYNSL LD SEASTFEAFAKLL+TLSSSKEV S FS+KMA SRSCKQV Q+NKS Sbjct: 331 CITAYCYNSLCLDVPSEASTFEAFAKLLSTLSSSKEVHSVFSLKMACSRSCKQVQQLNKS 390 Query: 329 VWHVNSAGTSSSLLESGHATSLMRNVCDNS----STSFLYTAEMSDVGKTESIVEGWDIV 162 VW+VNSA +S++LL+S H ++ +N +N+ +TSFL + EMSDVGK E +EGWD+V Sbjct: 391 VWNVNSAISSTTLLDSAHTMTMTKNFYENNIPNYATSFLSSTEMSDVGKNECTIEGWDLV 450 Query: 161 EHPKFPPPPSQNEDIEHWTRAMFIDATKK 75 EHP PPPSQ+EDIEHWTRAMFID TK+ Sbjct: 451 EHPTL-PPPSQSEDIEHWTRAMFIDVTKR 478 >ref|XP_006434072.1| hypothetical protein CICLE_v10001001mg [Citrus clementina] gi|567883029|ref|XP_006434073.1| hypothetical protein CICLE_v10001001mg [Citrus clementina] gi|567883031|ref|XP_006434074.1| hypothetical protein CICLE_v10001001mg [Citrus clementina] gi|567883033|ref|XP_006434075.1| hypothetical protein CICLE_v10001001mg [Citrus clementina] gi|557536194|gb|ESR47312.1| hypothetical protein CICLE_v10001001mg [Citrus clementina] gi|557536195|gb|ESR47313.1| hypothetical protein CICLE_v10001001mg [Citrus clementina] gi|557536196|gb|ESR47314.1| hypothetical protein CICLE_v10001001mg [Citrus clementina] gi|557536197|gb|ESR47315.1| hypothetical protein CICLE_v10001001mg [Citrus clementina] Length = 478 Score = 582 bits (1501), Expect = e-164 Identities = 287/389 (73%), Positives = 328/389 (84%), Gaps = 5/389 (1%) Frame = -2 Query: 1226 GKAKVDKLSNELKVKYASLESAMHMLQRNRVEQLDKFYPNLICTQSLGLMAITSERLHKQ 1047 GK K++K S +LK +YA L+SA M+++NR EQL+KFYPN+ICTQSLG MAI SE LHKQ Sbjct: 90 GKLKIEKSSYDLKGRYAILDSARSMMEKNRAEQLEKFYPNIICTQSLGHMAIVSELLHKQ 149 Query: 1046 SVVIKQICKLFPQRRVSADEEMKDGSTGQYDQICSARLPRGLDPHSVPPEELAVSLGYMI 867 SVVIKQICKLFPQRRV+ D E +DGS+GQYDQIC ARLP+GLDPHSVP EELA SLGYM+ Sbjct: 150 SVVIKQICKLFPQRRVNIDGERRDGSSGQYDQICGARLPKGLDPHSVPSEELAASLGYMV 209 Query: 866 QLLNLVVHNLAAPALHCSGFAGSCSRIWQRDTYWDARPSSRSNEYPLFIPRQNCCFPGGE 687 QLLNLVV NLA P LH SGFAGSCSRIWQRD+YWDARPSSRSNEYPLFIPRQN C GE Sbjct: 210 QLLNLVVLNLAVPVLHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGE 269 Query: 686 NSWTDRSSSNFGVASMESEKKPYLDSA-GVSFNHHPTCPHLVENHKDLQKGISLLKKSVA 510 NSWTDRSSSNFGVASMESE++P LDS+ SFN+ H VE HKDLQKGISLLKKSVA Sbjct: 270 NSWTDRSSSNFGVASMESERRPQLDSSRSASFNYTSASTHSVETHKDLQKGISLLKKSVA 329 Query: 509 CITAYCYNSLFLDGLSEASTFEAFAKLLATLSSSKEVRSAFSMKMASSRSCKQVPQMNKS 330 C+TAYCYNSL LD +EASTFEAFAKLLATLS SKEVRS FS+KMA SRSCKQV ++N+S Sbjct: 330 CLTAYCYNSLCLDVPAEASTFEAFAKLLATLSLSKEVRSVFSLKMACSRSCKQVQKLNRS 389 Query: 329 VWHVNSAGTSSSLLESGHATSLMRNVCDN----SSTSFLYTAEMSDVGKTESIVEGWDIV 162 VW++NSA +S++LLES H + +N+ DN S+ SFLY EMSD+GK ES+++GWD+V Sbjct: 390 VWNMNSAISSTTLLESAHMFPITKNLSDNNLPSSAASFLYATEMSDIGKNESLIDGWDLV 449 Query: 161 EHPKFPPPPSQNEDIEHWTRAMFIDATKK 75 EHP FPPPPSQ ED+EHWTRAM IDATKK Sbjct: 450 EHPTFPPPPSQTEDVEHWTRAMIIDATKK 478 >ref|XP_007225709.1| hypothetical protein PRUPE_ppa005050mg [Prunus persica] gi|596287022|ref|XP_007225710.1| hypothetical protein PRUPE_ppa005050mg [Prunus persica] gi|462422645|gb|EMJ26908.1| hypothetical protein PRUPE_ppa005050mg [Prunus persica] gi|462422646|gb|EMJ26909.1| hypothetical protein PRUPE_ppa005050mg [Prunus persica] Length = 479 Score = 581 bits (1498), Expect = e-163 Identities = 291/389 (74%), Positives = 333/389 (85%), Gaps = 5/389 (1%) Frame = -2 Query: 1226 GKAKVDKLSNELKVKYASLESAMHMLQRNRVEQLDKFYPNLICTQSLGLMAITSERLHKQ 1047 GKAK++K S +LKVK LESA+ +L++NR EQL+KFYPN ICTQ+LG MAITSERLHKQ Sbjct: 91 GKAKIEKTSYDLKVKSGVLESALAVLEKNRAEQLEKFYPNFICTQNLGHMAITSERLHKQ 150 Query: 1046 SVVIKQICKLFPQRRVSADEEMKDGSTGQYDQICSARLPRGLDPHSVPPEELAVSLGYMI 867 SVVIKQICKLFPQRRV+ D + KD S GQYDQIC+A LPRGLDPHSVP EELA SLGYM+ Sbjct: 151 SVVIKQICKLFPQRRVTVDAKRKDASGGQYDQICNACLPRGLDPHSVPSEELAASLGYMV 210 Query: 866 QLLNLVVHNLAAPALHCSGFAGSCSRIWQRDTYWDARPSSRSNEYPLFIPRQNCCFPGGE 687 QLLNLVV NLAAPALH SGFAGSCSRIWQRD+YWDARPSSRSNEYPLFIPRQN C GE Sbjct: 211 QLLNLVVQNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGE 270 Query: 686 NSWTDRSSSNFGVASMESEKKPYLDSAG-VSFNHHPTCPHLVENHKDLQKGISLLKKSVA 510 NSW+DRSSSNFGVAS++SE+KP+LDS+G SFN+ H VE HKDLQ+GISLLKKSVA Sbjct: 271 NSWSDRSSSNFGVASIDSERKPHLDSSGSSSFNYTSASQHSVETHKDLQRGISLLKKSVA 330 Query: 509 CITAYCYNSLFLDGLSEASTFEAFAKLLATLSSSKEVRSAFSMKMASSRSCKQVPQMNKS 330 CITAYCYNSL LD SEASTFEAFAKLLATLSSSKEV S FS+KMA SRSCKQV Q+NKS Sbjct: 331 CITAYCYNSLCLDVPSEASTFEAFAKLLATLSSSKEVHSVFSLKMACSRSCKQVQQLNKS 390 Query: 329 VWHVNSAGTSSSLLESGHATSLMRNVCDNS----STSFLYTAEMSDVGKTESIVEGWDIV 162 VW+VNSA +S++LL+S HA ++ +N+ + + +TS L + E+SD GK ES+VEGWD+V Sbjct: 391 VWNVNSAISSTTLLDSAHAMTMTKNLYEYNLPTYATSSLCSTELSDSGKNESLVEGWDLV 450 Query: 161 EHPKFPPPPSQNEDIEHWTRAMFIDATKK 75 EHP FPPPPSQ+EDIEHWTRAMFIDA +K Sbjct: 451 EHPTFPPPPSQSEDIEHWTRAMFIDAKRK 479 >ref|XP_002533875.1| DNA-directed RNA polymerase II, putative [Ricinus communis] gi|223526176|gb|EEF28506.1| DNA-directed RNA polymerase II, putative [Ricinus communis] Length = 478 Score = 576 bits (1484), Expect = e-162 Identities = 284/388 (73%), Positives = 328/388 (84%), Gaps = 5/388 (1%) Frame = -2 Query: 1223 KAKVDKLSNELKVKYASLESAMHMLQRNRVEQLDKFYPNLICTQSLGLMAITSERLHKQS 1044 KAK +K+S++L KY LES+ L++NRV+QL+K++PNLICTQSLG MAITSE LH S Sbjct: 91 KAKTEKMSSDLNAKYGLLESSRSALEKNRVDQLEKYFPNLICTQSLGHMAITSELLHNLS 150 Query: 1043 VVIKQICKLFPQRRVSADEEMKDGSTGQYDQICSARLPRGLDPHSVPPEELAVSLGYMIQ 864 V +KQICKLFPQRRV + E KDGS+GQYDQIC+ARLPRGLDPHS+P EELA SLGYM+Q Sbjct: 151 VTVKQICKLFPQRRVIVEGEKKDGSSGQYDQICNARLPRGLDPHSIPSEELAASLGYMVQ 210 Query: 863 LLNLVVHNLAAPALHCSGFAGSCSRIWQRDTYWDARPSSRSNEYPLFIPRQNCCFPGGEN 684 LLNLVVHNLAAPALH SGFAGSCSRIWQRD+YW+ARPSSRSNEYPLFIPRQ C GEN Sbjct: 211 LLNLVVHNLAAPALHNSGFAGSCSRIWQRDSYWNARPSSRSNEYPLFIPRQRYCSTSGEN 270 Query: 683 SWTDRSSSNFGVASMESEKKPYLDSA-GVSFNHHPTCPHLVENHKDLQKGISLLKKSVAC 507 SWTDRSSSNFGVASMESE++ LDS+ SFN++ PH VE HKDLQKGISL+KKSVAC Sbjct: 271 SWTDRSSSNFGVASMESERRARLDSSRSSSFNYNSASPHSVETHKDLQKGISLMKKSVAC 330 Query: 506 ITAYCYNSLFLDGLSEASTFEAFAKLLATLSSSKEVRSAFSMKMASSRSCKQVPQMNKSV 327 +TAY YN L LD +EASTFEAFAKLLATLSSSKEVRS FS+KMA SRSCKQV ++NKSV Sbjct: 331 VTAYGYNLLCLDVPAEASTFEAFAKLLATLSSSKEVRSVFSLKMACSRSCKQVQKLNKSV 390 Query: 326 WHVNSAGTSSSLLESGHATSLMRNVCD----NSSTSFLYTAEMSDVGKTESIVEGWDIVE 159 W+VNS +SS+L+ES HA L +N+ D NS+TSFL+ E+SD GK ES+++GWD+VE Sbjct: 391 WNVNSIISSSTLMESAHAPHLTKNINDNNLRNSATSFLFANEISDAGKNESLIDGWDLVE 450 Query: 158 HPKFPPPPSQNEDIEHWTRAMFIDATKK 75 HP FPPPPSQ ED+EHWTRAMFIDATKK Sbjct: 451 HPTFPPPPSQTEDVEHWTRAMFIDATKK 478 >ref|XP_004514262.1| PREDICTED: uncharacterized protein LOC101503483 isoform X1 [Cicer arietinum] Length = 427 Score = 573 bits (1477), Expect = e-161 Identities = 287/387 (74%), Positives = 331/387 (85%), Gaps = 3/387 (0%) Frame = -2 Query: 1226 GKAKVDKLSNELKVKYASLESAMHMLQRNRVEQLDKFYPNLICTQSLGLMAITSERLHKQ 1047 G+AKV+ LS +LK+KY LESA+ ML++NRVEQL+KFYPNLICTQSLG +AITSERLHKQ Sbjct: 42 GRAKVEALSADLKLKYGVLESALSMLEKNRVEQLEKFYPNLICTQSLGHVAITSERLHKQ 101 Query: 1046 SVVIKQICKLFPQRRVSADEEMKDGSTGQYDQICSARLPRGLDPHSVPPEELAVSLGYMI 867 SVVIKQICKLFPQRRV + E +D +GQYDQIC+ARLPR LDPHSVP E L+ SLGYM+ Sbjct: 102 SVVIKQICKLFPQRRVVIEGERRDDCSGQYDQICNARLPRALDPHSVPSEALSTSLGYMV 161 Query: 866 QLLNLVVHNLAAPALHCSGFAGSCSRIWQRDTYWDARPSSRSNEYPLFIPRQNCCFPGGE 687 QLLNLVVHNLAAPALH SGFAGSCSRIWQRD+YWDARPSSRSNEYPLFIPRQN C GE Sbjct: 162 QLLNLVVHNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSGE 221 Query: 686 NSWTDRSSSNFGVASMESEKKPYLDSAG-VSFNHHPTCPHLVENHKDLQKGISLLKKSVA 510 NSW+D+SSSNFGVASMES+++P LDS+G SFN+ H V+ HKDLQKGISLLKKSVA Sbjct: 222 NSWSDKSSSNFGVASMESDRRPRLDSSGSSSFNYSLGSSHSVQTHKDLQKGISLLKKSVA 281 Query: 509 CITAYCYNSLFLDGLSEASTFEAFAKLLATLSSSKEVRSAFSMKMASSRSCKQVPQMNKS 330 CITAYCYNSL LD EASTFEAFAKLLATLSSSKEVRS FS+KMA SR+CKQV Q+NKS Sbjct: 282 CITAYCYNSLCLDVPIEASTFEAFAKLLATLSSSKEVRSVFSLKMARSRTCKQVQQLNKS 341 Query: 329 VWHVNSAGTSSSLLESGHA--TSLMRNVCDNSSTSFLYTAEMSDVGKTESIVEGWDIVEH 156 VW++NSA +S++LLES H+ T+ + N +S+ SFLY + SD GK+E ++EGWDIVEH Sbjct: 342 VWNMNSAISSTTLLESAHSVPTTRIENYMPSSAASFLYPTDSSD-GKSECLIEGWDIVEH 400 Query: 155 PKFPPPPSQNEDIEHWTRAMFIDATKK 75 P PPPPSQ+ED+EHWTRAMFIDA +K Sbjct: 401 PTLPPPPSQSEDVEHWTRAMFIDAKRK 427 >ref|XP_007141122.1| hypothetical protein PHAVU_008G169200g [Phaseolus vulgaris] gi|593488511|ref|XP_007141123.1| hypothetical protein PHAVU_008G169200g [Phaseolus vulgaris] gi|561014255|gb|ESW13116.1| hypothetical protein PHAVU_008G169200g [Phaseolus vulgaris] gi|561014256|gb|ESW13117.1| hypothetical protein PHAVU_008G169200g [Phaseolus vulgaris] Length = 476 Score = 572 bits (1474), Expect = e-160 Identities = 286/388 (73%), Positives = 332/388 (85%), Gaps = 4/388 (1%) Frame = -2 Query: 1226 GKAKVDKLSNELKVKYASLESAMHMLQRNRVEQLDKFYPNLICTQSLGLMAITSERLHKQ 1047 G+AK++ +S +LK KY LESA+ L++NRVEQL+KFYPNLICTQSLG +AITSERLHKQ Sbjct: 90 GRAKIETVSADLKHKYGLLESALSTLEKNRVEQLEKFYPNLICTQSLGHVAITSERLHKQ 149 Query: 1046 SVVIKQICKLFPQRRVSADEEMKDGSTGQYDQICSARLPRGLDPHSVPPEELAVSLGYMI 867 SVVIKQICKLFPQRRV + E++DG +GQYDQIC+ARLPR LDPHSVP EEL+ SLGYM+ Sbjct: 150 SVVIKQICKLFPQRRVVIEGEIRDGCSGQYDQICNARLPRALDPHSVPSEELSASLGYMV 209 Query: 866 QLLNLVVHNLAAPALHCSGFAGSCSRIWQRDTYWDARPSSRSNEYPLFIPRQNCCFPGGE 687 QLLNLVVHNLAAPALH SGFAGSCSRIWQRD+YWDARPSSRSNEYPLFIPRQN C GE Sbjct: 210 QLLNLVVHNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTAGE 269 Query: 686 NSW-TDRSSSNFGVASMESEKKPYLDSAGVS-FNHHPTCPHLVENHKDLQKGISLLKKSV 513 NSW TD+SSSNFGVASMESEK+ LDS+G S FN+ H V+ HKDLQKGISLLKKSV Sbjct: 270 NSWSTDKSSSNFGVASMESEKRNRLDSSGNSNFNYSLASLHSVQTHKDLQKGISLLKKSV 329 Query: 512 ACITAYCYNSLFLDGLSEASTFEAFAKLLATLSSSKEVRSAFSMKMASSRSCKQVPQMNK 333 ACITAYCYNSL LD SEASTFE+FAKLLATLSSSKEVRS FS+KMA SR+CKQV Q+NK Sbjct: 330 ACITAYCYNSLCLDAPSEASTFESFAKLLATLSSSKEVRSVFSLKMAQSRTCKQVQQLNK 389 Query: 332 SVWHVNSAGTSSSLLESGHA--TSLMRNVCDNSSTSFLYTAEMSDVGKTESIVEGWDIVE 159 SVW++NS +S++LLES H+ T+ + N +S+ SFLY +++D GK E ++EGWDI+E Sbjct: 390 SVWNMNSVISSTTLLESAHSVPTTRIENYLPSSTASFLYATDLND-GKNECLIEGWDIIE 448 Query: 158 HPKFPPPPSQNEDIEHWTRAMFIDATKK 75 HP FPPPPSQ+ED+EHWTRAMFIDA +K Sbjct: 449 HPTFPPPPSQSEDVEHWTRAMFIDAKRK 476 >ref|XP_003544777.1| PREDICTED: uncharacterized protein LOC100776426 isoformX1 [Glycine max] Length = 475 Score = 570 bits (1468), Expect = e-160 Identities = 283/387 (73%), Positives = 331/387 (85%), Gaps = 3/387 (0%) Frame = -2 Query: 1226 GKAKVDKLSNELKVKYASLESAMHMLQRNRVEQLDKFYPNLICTQSLGLMAITSERLHKQ 1047 G+AK++ +S +LK+KY LESA+ L++NRVEQL+KFYPNLICTQSLG +AITSE LHK+ Sbjct: 90 GRAKIETMSADLKLKYGLLESALSTLEKNRVEQLEKFYPNLICTQSLGHVAITSELLHKE 149 Query: 1046 SVVIKQICKLFPQRRVSADEEMKDGSTGQYDQICSARLPRGLDPHSVPPEELAVSLGYMI 867 SVVIKQICKLFPQRRV + E +DG +GQYDQIC+ARLPR LDPHSVP EEL+ SLGYM+ Sbjct: 150 SVVIKQICKLFPQRRVVIEGERRDGCSGQYDQICNARLPRALDPHSVPSEELSTSLGYMV 209 Query: 866 QLLNLVVHNLAAPALHCSGFAGSCSRIWQRDTYWDARPSSRSNEYPLFIPRQNCCFPGGE 687 QLLNLV+HNLAAPALH SGFAGSCSRIWQRD+YWDARPSSRSNEYPLFIPRQN C GE Sbjct: 210 QLLNLVIHNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTDGE 269 Query: 686 NSWTDRSSSNFGVASMESEKKPYLDSAG-VSFNHHPTCPHLVENHKDLQKGISLLKKSVA 510 NSW++RSSSNFGVAS+ESE++ LDS+G SFN+ H V+ HKDLQKGISLLKKSV Sbjct: 270 NSWSERSSSNFGVASVESERRHRLDSSGSTSFNYSLASSHSVQTHKDLQKGISLLKKSVV 329 Query: 509 CITAYCYNSLFLDGLSEASTFEAFAKLLATLSSSKEVRSAFSMKMASSRSCKQVPQMNKS 330 CITAYCYNSL LD SEASTFEAFAKLLATL+SSKEVRS FS+KMA SR+CKQV Q+NKS Sbjct: 330 CITAYCYNSLCLDVPSEASTFEAFAKLLATLASSKEVRSVFSLKMARSRTCKQVQQLNKS 389 Query: 329 VWHVNSAGTSSSLLESGHA--TSLMRNVCDNSSTSFLYTAEMSDVGKTESIVEGWDIVEH 156 VW++NSA +S++LLES H+ T+ + N +S+ SFLY A++SD GK E ++EGWDIVEH Sbjct: 390 VWNMNSAISSTTLLESAHSVPTTRIENYLPSSTGSFLYAADLSD-GKNECLIEGWDIVEH 448 Query: 155 PKFPPPPSQNEDIEHWTRAMFIDATKK 75 P FPPPPSQ+ED+EHWTRAMFIDA K Sbjct: 449 PTFPPPPSQSEDVEHWTRAMFIDAKGK 475 >ref|XP_002302270.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa] gi|566157047|ref|XP_006386388.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa] gi|566157050|ref|XP_006386389.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa] gi|222843996|gb|EEE81543.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa] gi|550344610|gb|ERP64185.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa] gi|550344611|gb|ERP64186.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa] Length = 475 Score = 567 bits (1461), Expect = e-159 Identities = 287/389 (73%), Positives = 325/389 (83%), Gaps = 5/389 (1%) Frame = -2 Query: 1226 GKAKVDKLSNELKVKYASLESAMHMLQRNRVEQLDKFYPNLICTQSLGLMAITSERLHKQ 1047 GKAKV+KLS +LK K LESA ++L++NR+EQL+KFYPNLICTQSLG MAITSE LHKQ Sbjct: 90 GKAKVEKLSQDLKKKNGMLESARNVLEKNRMEQLEKFYPNLICTQSLGHMAITSELLHKQ 149 Query: 1046 SVVIKQICKLFPQRRVSADEEMKDGSTGQYDQICSARLPRGLDPHSVPPEELAVSLGYMI 867 SVVIKQICKLFPQRRV+ D E +GQYDQIC+ARLPRGLDPHSV EELA SLGYM+ Sbjct: 150 SVVIKQICKLFPQRRVNVDGERN--FSGQYDQICNARLPRGLDPHSVSSEELAASLGYMV 207 Query: 866 QLLNLVVHNLAAPALHCSGFAGSCSRIWQRDTYWDARPSSRSNEYPLFIPRQNCCFPGGE 687 QLLNLV HNLAAP LH +GFAGSCSRIWQRD+YW+A PSSRSNEYPLFIPRQN C E Sbjct: 208 QLLNLVAHNLAAPTLHNAGFAGSCSRIWQRDSYWNACPSSRSNEYPLFIPRQNYCSTSSE 267 Query: 686 NSWTDRSSSNFGVASMESEKKPYLDSA-GVSFNHHPTCPHLVENHKDLQKGISLLKKSVA 510 NSWTD+SSSNFGVASMESE++P+LDS SFN+ PH VE HKDLQKG+SLLKKSVA Sbjct: 268 NSWTDKSSSNFGVASMESERRPHLDSTRSNSFNYSSVSPHSVETHKDLQKGVSLLKKSVA 327 Query: 509 CITAYCYNSLFLDGLSEASTFEAFAKLLATLSSSKEVRSAFSMKMASSRSCKQVPQMNKS 330 C+TAYCYN L LD S+ STFEAFAKLL+TLSSSKEVRS F++KMA SRSCKQV ++NKS Sbjct: 328 CVTAYCYNLLCLDVPSDTSTFEAFAKLLSTLSSSKEVRSVFNLKMACSRSCKQVQKLNKS 387 Query: 329 VWHVNSAGTSSSLLESGHATSLMRNVCD----NSSTSFLYTAEMSDVGKTESIVEGWDIV 162 VW+VNSA +SS+LLES HA LM+N D NS+ SFL+ +SD GK ES ++GWD+V Sbjct: 388 VWNVNSAISSSALLESAHALQLMKNTSDNNLPNSAASFLFATGISD-GKNESFIDGWDLV 446 Query: 161 EHPKFPPPPSQNEDIEHWTRAMFIDATKK 75 EHP FPPPPSQ EDIEHWTRAMFIDATKK Sbjct: 447 EHPTFPPPPSQVEDIEHWTRAMFIDATKK 475 >ref|XP_003542641.1| PREDICTED: uncharacterized protein LOC100813297 [Glycine max] Length = 474 Score = 564 bits (1453), Expect = e-158 Identities = 283/387 (73%), Positives = 326/387 (84%), Gaps = 3/387 (0%) Frame = -2 Query: 1226 GKAKVDKLSNELKVKYASLESAMHMLQRNRVEQLDKFYPNLICTQSLGLMAITSERLHKQ 1047 G+AK++ S +LK+KY LESA+ L++NRVEQL+KFYPNLICTQSLG +AITSERLHKQ Sbjct: 90 GRAKIETKSADLKLKYGLLESALSTLEKNRVEQLEKFYPNLICTQSLGHVAITSERLHKQ 149 Query: 1046 SVVIKQICKLFPQRRVSADEEMKDGSTGQYDQICSARLPRGLDPHSVPPEELAVSLGYMI 867 SVVIKQICKLFPQRRV + E DG GQ+DQIC+ARLPR LDP SVP EEL+ SLGYM+ Sbjct: 150 SVVIKQICKLFPQRRVVIEGERGDGCCGQFDQICNARLPRALDPRSVPSEELSTSLGYMV 209 Query: 866 QLLNLVVHNLAAPALHCSGFAGSCSRIWQRDTYWDARPSSRSNEYPLFIPRQNCCFPGGE 687 QLLNL+VHNLAAPALH SGFAGSCSRIWQRD+YWDARPSSRSNEYPLFIPRQN C GGE Sbjct: 210 QLLNLIVHNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTGGE 269 Query: 686 NSWTDRSSSNFGVASMESEKKPYLDSAG-VSFNHHPTCPHLVENHKDLQKGISLLKKSVA 510 NSW++RSSSNFGVASMESE++ LDS+G SFN+ H V+ HKDLQKGISLLKKSVA Sbjct: 270 NSWSERSSSNFGVASMESERRHRLDSSGSSSFNYSLASSHSVQTHKDLQKGISLLKKSVA 329 Query: 509 CITAYCYNSLFLDGLSEASTFEAFAKLLATLSSSKEVRSAFSMKMASSRSCKQVPQMNKS 330 CITAYCYNSL LD SEASTFEAFAKLLATLSSSKEVRS FS+KM SR+CKQV Q+NKS Sbjct: 330 CITAYCYNSLCLDVPSEASTFEAFAKLLATLSSSKEVRSVFSLKMPRSRTCKQVQQLNKS 389 Query: 329 VWHVNSAGTSSSLLESGHA--TSLMRNVCDNSSTSFLYTAEMSDVGKTESIVEGWDIVEH 156 VW++NSA +S++LLES H+ T+ + N +++ SFLY + GK E +VEGWDIVEH Sbjct: 390 VWNMNSAISSTTLLESAHSVPTTRIENYLPSATASFLYATDSD--GKNECLVEGWDIVEH 447 Query: 155 PKFPPPPSQNEDIEHWTRAMFIDATKK 75 P FPPPPSQ+ED+EHWTRAMFIDA +K Sbjct: 448 PTFPPPPSQSEDVEHWTRAMFIDAKRK 474 >ref|XP_003614901.1| hypothetical protein MTR_5g061040 [Medicago truncatula] gi|355516236|gb|AES97859.1| hypothetical protein MTR_5g061040 [Medicago truncatula] Length = 501 Score = 558 bits (1439), Expect = e-156 Identities = 283/397 (71%), Positives = 327/397 (82%), Gaps = 16/397 (4%) Frame = -2 Query: 1226 GKAKVDKLSNELKVKYASLESAMHMLQRNRVEQLDKFYPNLICTQSLGLMAITSERLHKQ 1047 G+AK+ +S +LK+KY LESA+ ML++NRVEQL+KFYPNLICTQSLG +AITSERLHKQ Sbjct: 90 GRAKIQAMSADLKLKYGVLESALSMLEKNRVEQLEKFYPNLICTQSLGHVAITSERLHKQ 149 Query: 1046 SVVIKQICKLFPQRRVSADEEMKDGSTGQYDQICSARLPRGLDPHSVPPEELAVSLGYMI 867 SVVIKQICKLFPQRRV + E D +GQYDQIC+ARLPR LDPHSVP EEL+ SLGYM+ Sbjct: 150 SVVIKQICKLFPQRRVVIEGEKGDDCSGQYDQICNARLPRALDPHSVPSEELSASLGYMV 209 Query: 866 QLLNLVVHNLAAPALHCSGFAGSCSRIWQRDTYWDARPSSR-------------SNEYPL 726 QLLNLV HNLAAPALH SGFAGSCSRIWQRD+YWDARPSSR SNEYPL Sbjct: 210 QLLNLVAHNLAAPALHNSGFAGSCSRIWQRDSYWDARPSSRSKNFFNLKYSLFFSNEYPL 269 Query: 725 FIPRQNCCFPGGENSWTDRSSSNFGVASMESEKKPYLDSAG-VSFNHHPTCPHLVENHKD 549 FIPRQN C GENSW+++SSSNFGVASMES+++P LDS+G SFN+ H V++HKD Sbjct: 270 FIPRQNYCSTSGENSWSEKSSSNFGVASMESDRRPRLDSSGSSSFNYSLASSHSVQSHKD 329 Query: 548 LQKGISLLKKSVACITAYCYNSLFLDGLSEASTFEAFAKLLATLSSSKEVRSAFSMKMAS 369 LQKGISLLKKSVACITAYCYNSL D SEASTFEAFAKLLATLSSSKEVRS FS+KMA Sbjct: 330 LQKGISLLKKSVACITAYCYNSLCFDIPSEASTFEAFAKLLATLSSSKEVRSVFSLKMAR 389 Query: 368 SRSCKQVPQMNKSVWHVNSAGTSSSLLESGHA--TSLMRNVCDNSSTSFLYTAEMSDVGK 195 SR+CKQV Q+NKSVW++NSA +S++LLES H+ T+ + N NS+ SFLY + SD K Sbjct: 390 SRTCKQVQQLNKSVWNMNSANSSTTLLESTHSVPTTRIENYMPNSAASFLYPTDSSD-RK 448 Query: 194 TESIVEGWDIVEHPKFPPPPSQNEDIEHWTRAMFIDA 84 +E ++EGWDIVEHP PPPPSQ+ED+EHWTRAMFIDA Sbjct: 449 SECLIEGWDIVEHPTLPPPPSQSEDVEHWTRAMFIDA 485 >gb|EXC06677.1| hypothetical protein L484_021513 [Morus notabilis] Length = 478 Score = 551 bits (1421), Expect = e-154 Identities = 281/389 (72%), Positives = 319/389 (82%), Gaps = 5/389 (1%) Frame = -2 Query: 1226 GKAKVDKLSNELKVKYASLESAMHMLQRNRVEQLDKFYPNLICTQSLGLMAITSERLHKQ 1047 GKAKV+++ +LKVK LE+A ML+ NR+EQL+KFYPN ICTQ+LG MAITSERLHKQ Sbjct: 90 GKAKVERMHYDLKVKSGVLEAARSMLENNRMEQLEKFYPNFICTQTLGHMAITSERLHKQ 149 Query: 1046 SVVIKQICKLFPQRRVSADEEMKDGSTGQYDQICSARLPRGLDPHSVPPEELAVSLGYMI 867 SVVIKQICKLFP RRV D E K+GS QYDQIC+ARLPRG+DPHSV EEL SLGYM+ Sbjct: 150 SVVIKQICKLFPHRRVIIDGERKNGSAEQYDQICNARLPRGVDPHSVASEELGASLGYMV 209 Query: 866 QLLNLVVHNLAAPALHCSGFAGSCSRIWQRDTYWDARPSSRSNEYPLFIPRQNCCFPGGE 687 QLLNL+V LAAPALH SGFAGS SRIWQRD+YWDARPSSRSNEYPLFIPRQN C E Sbjct: 210 QLLNLIVRILAAPALHNSGFAGSNSRIWQRDSYWDARPSSRSNEYPLFIPRQNYCSTSVE 269 Query: 686 NSWTDRSSSNFGVASMESEKKPYLDSAGV-SFNHHPTCPHLVENHKDLQKGISLLKKSVA 510 NSW+DRSSSNFGV S+ESE+K LDS+G SFN+ PH +E HKDLQKGISLLKKSVA Sbjct: 270 NSWSDRSSSNFGVTSIESERKVRLDSSGSNSFNYSSASPHSIETHKDLQKGISLLKKSVA 329 Query: 509 CITAYCYNSLFLDGLSEASTFEAFAKLLATLSSSKEVRSAFSMKMASSRSCKQVPQMNKS 330 CIT YCYNSL LD SEASTFEAFAKLLATLSSSKE+RS S+K A SRS KQV Q+NKS Sbjct: 330 CITTYCYNSLCLDVPSEASTFEAFAKLLATLSSSKELRSVCSIKSACSRSNKQVQQLNKS 389 Query: 329 VWHVNSAGTSSSLLESGHATSLMRNVCDNS----STSFLYTAEMSDVGKTESIVEGWDIV 162 VW+VNSA S++LL+S H + M+N+ +N+ +TSFLY E SD GK E I+EGWD++ Sbjct: 390 VWNVNSAFASTTLLDSAHTVASMKNIGENNLPNPATSFLYATE-SDAGKNEFIIEGWDLI 448 Query: 161 EHPKFPPPPSQNEDIEHWTRAMFIDATKK 75 EHP FPPPPSQ ED+EHWTRAMFIDATKK Sbjct: 449 EHPTFPPPPSQCEDVEHWTRAMFIDATKK 477 >gb|EYU43567.1| hypothetical protein MIMGU_mgv1a005543mg [Mimulus guttatus] Length = 479 Score = 551 bits (1420), Expect = e-154 Identities = 276/388 (71%), Positives = 321/388 (82%), Gaps = 4/388 (1%) Frame = -2 Query: 1226 GKAKVDKLSNELKVKYASLESAMHMLQRNRVEQLDKFYPNLICTQSLGLMAITSERLHKQ 1047 GKAK++K S++LK+KY LESAM +++NR+EQ++K+YPNLICTQSLG MAITSERLHKQ Sbjct: 94 GKAKIEKRSHDLKLKYELLESAMDTMEKNRLEQIEKYYPNLICTQSLGHMAITSERLHKQ 153 Query: 1046 SVVIKQICKLFPQRRVSADEEMKDGSTGQYDQICSARLPRGLDPHSVPPEELAVSLGYMI 867 SV+IKQICKLFPQRRV+ D E KDG GQYD IC+ARLPRGLDPHSVP EELA SLGYM+ Sbjct: 154 SVIIKQICKLFPQRRVNIDGESKDGYGGQYDTICNARLPRGLDPHSVPSEELAASLGYMV 213 Query: 866 QLLNLVVHNLAAPALHCSGFAGSCSRIWQRDTYWDARPSSRSNEYPLFIPRQNCCFPGGE 687 QLLNLV+H + APALH SGFAGSCSRIWQR++YWDARPS RS EYPLFIPRQN C GGE Sbjct: 214 QLLNLVIHTVCAPALHHSGFAGSCSRIWQRESYWDARPSPRS-EYPLFIPRQNFCTTGGE 272 Query: 686 NSWTDRSSSNFGVASMESEKKPYLDSAGVSFNHHPTCPHLVENHKDLQKGISLLKKSVAC 507 SW++RSSSNFGVASMES +KP L+S+G SFN+ H VE HKDLQKGISLLKKSVAC Sbjct: 273 TSWSERSSSNFGVASMESVRKPRLESSGGSFNYSSASQHSVEIHKDLQKGISLLKKSVAC 332 Query: 506 ITAYCYNSLFLDGLSEASTFEAFAKLLATLSSSKEVRSAFSMKMASSRSCKQVPQMNKSV 327 ITAYCYNSL L+ +EASTFEAF+KLLATLSSSKEVR+ SM+ SSRS K Q+N SV Sbjct: 333 ITAYCYNSLSLEVPAEASTFEAFSKLLATLSSSKEVRTVLSMRTVSSRS-KPGQQLNTSV 391 Query: 326 WHVNSAGTSSSLLESGHATSLMRNVCDN----SSTSFLYTAEMSDVGKTESIVEGWDIVE 159 W+V SA +SS+LLES + +MRN DN S+ S+LY E +D+GK E+++EGWD VE Sbjct: 392 WNVESAFSSSTLLESANVLPIMRNTFDNYLPSSAGSYLYGNEFADLGKNENLIEGWDFVE 451 Query: 158 HPKFPPPPSQNEDIEHWTRAMFIDATKK 75 HP PPPPS ED+EHWTRAMFIDATKK Sbjct: 452 HPTLPPPPSHTEDVEHWTRAMFIDATKK 479 >ref|XP_006386390.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa] gi|550344612|gb|ERP64187.1| hypothetical protein POPTR_0002s09150g [Populus trichocarpa] Length = 506 Score = 551 bits (1419), Expect = e-154 Identities = 287/420 (68%), Positives = 325/420 (77%), Gaps = 36/420 (8%) Frame = -2 Query: 1226 GKAKVDKLSNELKVKYASLESAMHMLQRNRVEQLDKFYPNLICTQSLGLMAITSERLHKQ 1047 GKAKV+KLS +LK K LESA ++L++NR+EQL+KFYPNLICTQSLG MAITSE LHKQ Sbjct: 90 GKAKVEKLSQDLKKKNGMLESARNVLEKNRMEQLEKFYPNLICTQSLGHMAITSELLHKQ 149 Query: 1046 SVVIKQICKLFPQRRVSADEEMKDGSTGQYDQICSARLPRGLDPHSVPPEELAVSLGYMI 867 SVVIKQICKLFPQRRV+ D E +GQYDQIC+ARLPRGLDPHSV EELA SLGYM+ Sbjct: 150 SVVIKQICKLFPQRRVNVDGERN--FSGQYDQICNARLPRGLDPHSVSSEELAASLGYMV 207 Query: 866 QLLNLVVHNLAAPALHCSGFAGSCSRIWQRDTYWDARPSSR------------------- 744 QLLNLV HNLAAP LH +GFAGSCSRIWQRD+YW+A PSSR Sbjct: 208 QLLNLVAHNLAAPTLHNAGFAGSCSRIWQRDSYWNACPSSRRYFDWKSLCFGISVAKFEL 267 Query: 743 ------------SNEYPLFIPRQNCCFPGGENSWTDRSSSNFGVASMESEKKPYLDSA-G 603 SNEYPLFIPRQN C ENSWTD+SSSNFGVASMESE++P+LDS Sbjct: 268 LLLSELNILCACSNEYPLFIPRQNYCSTSSENSWTDKSSSNFGVASMESERRPHLDSTRS 327 Query: 602 VSFNHHPTCPHLVENHKDLQKGISLLKKSVACITAYCYNSLFLDGLSEASTFEAFAKLLA 423 SFN+ PH VE HKDLQKG+SLLKKSVAC+TAYCYN L LD S+ STFEAFAKLL+ Sbjct: 328 NSFNYSSVSPHSVETHKDLQKGVSLLKKSVACVTAYCYNLLCLDVPSDTSTFEAFAKLLS 387 Query: 422 TLSSSKEVRSAFSMKMASSRSCKQVPQMNKSVWHVNSAGTSSSLLESGHATSLMRNVCD- 246 TLSSSKEVRS F++KMA SRSCKQV ++NKSVW+VNSA +SS+LLES HA LM+N D Sbjct: 388 TLSSSKEVRSVFNLKMACSRSCKQVQKLNKSVWNVNSAISSSALLESAHALQLMKNTSDN 447 Query: 245 ---NSSTSFLYTAEMSDVGKTESIVEGWDIVEHPKFPPPPSQNEDIEHWTRAMFIDATKK 75 NS+ SFL+ +SD GK ES ++GWD+VEHP FPPPPSQ EDIEHWTRAMFIDATKK Sbjct: 448 NLPNSAASFLFATGISD-GKNESFIDGWDLVEHPTFPPPPSQVEDIEHWTRAMFIDATKK 506 >ref|XP_006354053.1| PREDICTED: uncharacterized protein LOC102590673 isoform X1 [Solanum tuberosum] gi|565375051|ref|XP_006354054.1| PREDICTED: uncharacterized protein LOC102590673 isoform X2 [Solanum tuberosum] Length = 483 Score = 545 bits (1405), Expect = e-152 Identities = 269/394 (68%), Positives = 321/394 (81%), Gaps = 10/394 (2%) Frame = -2 Query: 1226 GKAKVDKLSNELKVKYASLESAMHMLQRNRVEQLDKFYPNLICTQSLGLMAITSERLHKQ 1047 GKAK++K+S++LKV+Y L SA ML++NR EQL+KFYPNLICTQ+LG MAITSE LHKQ Sbjct: 90 GKAKIEKMSHDLKVQYELLGSATRMLEKNRAEQLEKFYPNLICTQNLGHMAITSELLHKQ 149 Query: 1046 SVVIKQICKLFPQRRVSADEEMKDGSTGQYDQICSARLPRGLDPHSVPPEELAVSLGYMI 867 SVV+KQICKLFPQRRV+ D + KDGS+GQYD IC+ARLP+GLDPHSVP +EL+ SLGYM+ Sbjct: 150 SVVVKQICKLFPQRRVTIDGDKKDGSSGQYDSICNARLPKGLDPHSVPSDELSASLGYMV 209 Query: 866 QLLNLVVHNLAAPALHCSGFAGSCSRIWQRDTYWDARPSSRSNEYPLFIPRQNCCFPGGE 687 QLLNLV+ + APALH SGFAGSCSRIWQRD+YWDARPSSRS EYPLFIPRQN C GGE Sbjct: 210 QLLNLVIRCVCAPALHNSGFAGSCSRIWQRDSYWDARPSSRSGEYPLFIPRQNFCSSGGE 269 Query: 686 NSWTDRS------SSNFGVASMESEKKPYLD-SAGVSFNHHPTCPHLVENHKDLQKGISL 528 SW DRS SSNFGV SMES++KP LD S+ SFN+ H +E HKDLQKGI+L Sbjct: 270 ASWYDRSCSNSGTSSNFGVTSMESDRKPRLDSSSSSSFNYASASLHSIETHKDLQKGIAL 329 Query: 527 LKKSVACITAYCYNSLFLDGLSEASTFEAFAKLLATLSSSKEVRSAFSMKMASSRSCKQV 348 LKKSVACITAYCYN+L L+ +EASTFE FA+LLATLSSSKEVRS FS+KM+ SR+ KQV Sbjct: 330 LKKSVACITAYCYNTLCLEVPAEASTFETFARLLATLSSSKEVRSVFSLKMSGSRASKQV 389 Query: 347 PQMNKSVWHVNSAGTSSSLLESGHATSL---MRNVCDNSSTSFLYTAEMSDVGKTESIVE 177 +NKSVW+V+SAG+SS+L+ESGH L N +SS + +Y E+SD + E+++E Sbjct: 390 QPLNKSVWNVDSAGSSSTLMESGHVPVLRNTFENALPSSSGNLIYATEVSDARRNENLIE 449 Query: 176 GWDIVEHPKFPPPPSQNEDIEHWTRAMFIDATKK 75 WD++EHP FPPPPS ED+EHWTRAMFIDATKK Sbjct: 450 DWDLIEHPPFPPPPSHTEDVEHWTRAMFIDATKK 483 >ref|XP_004237862.1| PREDICTED: uncharacterized protein LOC101264619 [Solanum lycopersicum] Length = 481 Score = 544 bits (1401), Expect = e-152 Identities = 268/392 (68%), Positives = 321/392 (81%), Gaps = 8/392 (2%) Frame = -2 Query: 1226 GKAKVDKLSNELKVKYASLESAMHMLQRNRVEQLDKFYPNLICTQSLGLMAITSERLHKQ 1047 GKAK++K+S++LKV+Y L SA ML++NR EQL+KFYPNLICTQ+LG MAITSE LHKQ Sbjct: 90 GKAKIEKMSHDLKVQYELLGSATRMLEKNRAEQLEKFYPNLICTQNLGHMAITSELLHKQ 149 Query: 1046 SVVIKQICKLFPQRRVSADEEMKDGSTGQYDQICSARLPRGLDPHSVPPEELAVSLGYMI 867 SVV+KQICKLFPQRRV+ D + KDGS+GQYD IC+ARLP+GLDPHSVP +EL+ SLGYM+ Sbjct: 150 SVVVKQICKLFPQRRVTIDGDKKDGSSGQYDSICNARLPKGLDPHSVPSDELSASLGYMV 209 Query: 866 QLLNLVVHNLAAPALHCSGFAGSCSRIWQRDTYWDARPSSRSNEYPLFIPRQNCCFPGGE 687 QLLNLVV + APALH SGFAGSCSRIWQRD+YWDARPSSRS EYPLFIPRQN C GGE Sbjct: 210 QLLNLVVRCVCAPALHNSGFAGSCSRIWQRDSYWDARPSSRSGEYPLFIPRQNFCSSGGE 269 Query: 686 NSWTDRS------SSNFGVASMESEKKPYLD-SAGVSFNHHPTCPHLVENHKDLQKGISL 528 SW DRS SSNFGV SMES++KP LD S+ SFN+ H +E HKDLQKGI+L Sbjct: 270 ASWYDRSSSNSGTSSNFGVTSMESDRKPRLDSSSSSSFNYASASLHSIETHKDLQKGIAL 329 Query: 527 LKKSVACITAYCYNSLFLDGLSEASTFEAFAKLLATLSSSKEVRSAFSMKMASSRSCKQV 348 LKKSVACITAYCYN+L L+ +EASTFE FA+LLATLSSSKEVRS FS+KM+ SR+ KQV Sbjct: 330 LKKSVACITAYCYNTLCLEVPAEASTFETFARLLATLSSSKEVRSVFSLKMSGSRASKQV 389 Query: 347 PQMNKSVWHVNSAGTSSSLLESGHA-TSLMRNVCDNSSTSFLYTAEMSDVGKTESIVEGW 171 +NKSVW+V+SAG+SS+L+ESGH + +S + +Y E+S+VG+ E+++E W Sbjct: 390 QPLNKSVWNVDSAGSSSTLMESGHVPRNTFEKSLPSSGGNLMYATEVSNVGRNENLIEDW 449 Query: 170 DIVEHPKFPPPPSQNEDIEHWTRAMFIDATKK 75 D++EHP FPPPPS ED+EHWTRAMFIDATKK Sbjct: 450 DLIEHPPFPPPPSHTEDVEHWTRAMFIDATKK 481 >ref|XP_006397280.1| hypothetical protein EUTSA_v10028627mg [Eutrema salsugineum] gi|557098297|gb|ESQ38733.1| hypothetical protein EUTSA_v10028627mg [Eutrema salsugineum] Length = 474 Score = 536 bits (1382), Expect = e-150 Identities = 270/387 (69%), Positives = 322/387 (83%), Gaps = 3/387 (0%) Frame = -2 Query: 1226 GKAKVDKLSNELKVKYASLESAMHMLQRNRVEQLDKFYPNLICTQSLGLMAITSERLHKQ 1047 GKAK+++ S +LK+KY L+SA L+R RVEQ++K++PNLICTQSLG MAI+SERLHKQ Sbjct: 90 GKAKIERESRDLKLKYGVLDSARSTLERIRVEQVEKYFPNLICTQSLGHMAISSERLHKQ 149 Query: 1046 SVVIKQICKLFPQRRVSADEEMKDGSTGQYDQICSARLPRGLDPHSVPPEELAVSLGYMI 867 SVV+KQ+CKLFPQRRVS D E ++GS GQY+ IC++RLP+GLDPHS+P EELA SLG M+ Sbjct: 150 SVVMKQVCKLFPQRRVSFDGESQNGSVGQYNLICNSRLPKGLDPHSIPSEELAASLGLMV 209 Query: 866 QLLNLVVHNLAAPALHCSGFAGSCSRIWQRDTYWDARPSSRSNEYPLFIPRQNCCFPGGE 687 QLLNLVVHNLAAPALH SGFAGSCSRIWQRD+YWDARPS+RSNEYPLFIPRQN C E Sbjct: 210 QLLNLVVHNLAAPALHNSGFAGSCSRIWQRDSYWDARPSTRSNEYPLFIPRQNYCSTSVE 269 Query: 686 NSWTDRSSSNFGVASMESEKK-PYLDSAG-VSFNHHPTCPHLVENHKDLQKGISLLKKSV 513 NSWTD++SSNFGVASMES++K LDS G SFN+ PH VE+H+DLQKGI+LLKKSV Sbjct: 270 NSWTDKNSSNFGVASMESDRKEARLDSTGRNSFNYSSASPHSVESHRDLQKGIALLKKSV 329 Query: 512 ACITAYCYNSLFLDGLSEASTFEAFAKLLATLSSSKEVRSAFSMKMASSRSCKQVPQMNK 333 AC+TAYCYNSL L+ EASTFEAFAKLLATLSSSKEVRS FS+KMASSRSCKQ Q+NK Sbjct: 330 ACLTAYCYNSLCLEVPPEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSCKQAQQLNK 389 Query: 332 SVWHVNSAGTSSSLLESGH-ATSLMRNVCDNSSTSFLYTAEMSDVGKTESIVEGWDIVEH 156 S+W+ +S SSS+LES H + N NS+ S+L E+S++ K+ + GWD+VEH Sbjct: 390 SIWNAHSV-ISSSILESSHLPRNASYNQDPNSAASYLSGTELSEIRKSND-MNGWDLVEH 447 Query: 155 PKFPPPPSQNEDIEHWTRAMFIDATKK 75 PK+PPPPSQ+ED+EHWTRAMFIDA KK Sbjct: 448 PKYPPPPSQSEDVEHWTRAMFIDAKKK 474 >ref|NP_192594.2| DNA-directed RNA polymerase II protein [Arabidopsis thaliana] gi|23297675|gb|AAN13006.1| unknown protein [Arabidopsis thaliana] gi|332657255|gb|AEE82655.1| DNA-directed RNA polymerase II protein [Arabidopsis thaliana] Length = 473 Score = 523 bits (1348), Expect = e-146 Identities = 266/387 (68%), Positives = 315/387 (81%), Gaps = 3/387 (0%) Frame = -2 Query: 1226 GKAKVDKLSNELKVKYASLESAMHMLQRNRVEQLDKFYPNLICTQSLGLMAITSERLHKQ 1047 GK K+++ S++LKVKY L+SA L++ RVEQ++K++PNLICTQSLG MAI+SERLHKQ Sbjct: 90 GKVKIERGSSDLKVKYGVLDSARSTLEKTRVEQVEKYFPNLICTQSLGHMAISSERLHKQ 149 Query: 1046 SVVIKQICKLFPQRRVSADEEMKDGSTGQYDQICSARLPRGLDPHSVPPEELAVSLGYMI 867 SVV+KQICKLFP RRVS D E ++GS QYD IC++RLP GLDPHS+P EELAVSLGYM+ Sbjct: 150 SVVVKQICKLFPLRRVSFDGESQNGSVRQYDVICNSRLPSGLDPHSIPSEELAVSLGYMV 209 Query: 866 QLLNLVVHNLAAPALHCSGFAGSCSRIWQRDTYWDARPSSRSNEYPLFIPRQNCCFPGGE 687 QLLNLVVHNLAAPALH SGFAGSCSRIWQRD+YWD R S+RSNEYPLFIPR+N C E Sbjct: 210 QLLNLVVHNLAAPALHSSGFAGSCSRIWQRDSYWDGRTSTRSNEYPLFIPRRNYCSTSVE 269 Query: 686 NSWTDRSSSNFGVASMESEKK-PYLDSAGV-SFNHHPTCPHLVENHKDLQKGISLLKKSV 513 NSWTD++SSNFGVASMES++K P LDS G SF + PH +E+H+DLQKGI+LLKKSV Sbjct: 270 NSWTDKNSSNFGVASMESDRKEPRLDSPGSNSFKYSSASPHSIESHRDLQKGIALLKKSV 329 Query: 512 ACITAYCYNSLFLDGLSEASTFEAFAKLLATLSSSKEVRSAFSMKMASSRSCKQVPQMNK 333 AC+TAYCYNSL L+ EASTFEAFAKLLATLSSSKEVRS FS+KMASSRS KQ Q+NK Sbjct: 330 ACLTAYCYNSLCLEVPPEASTFEAFAKLLATLSSSKEVRSVFSLKMASSRSGKQAQQLNK 389 Query: 332 SVWHVNSAGTSSSLLESGH-ATSLMRNVCDNSSTSFLYTAEMSDVGKTESIVEGWDIVEH 156 S+W+ +S SSSLLES H + N NS S+L E+S + + + GWD+VEH Sbjct: 390 SIWNAHSV-ISSSLLESAHLPRNTSYNQDPNSPASYLSATELST--RKNNDMNGWDLVEH 446 Query: 155 PKFPPPPSQNEDIEHWTRAMFIDATKK 75 PK+PPPPSQ+ED+EHWTRAMFIDA KK Sbjct: 447 PKYPPPPSQSEDVEHWTRAMFIDAKKK 473