BLASTX nr result
ID: Mentha29_contig00021198
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00021198 (1201 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU33944.1| hypothetical protein MIMGU_mgv1a001000mg [Mimulus... 424 e-116 gb|EPS66208.1| hypothetical protein M569_08569, partial [Genlise... 330 6e-88 ref|XP_004233815.1| PREDICTED: uncharacterized protein LOC101266... 328 2e-87 ref|XP_006347153.1| PREDICTED: uncharacterized protein LOC102600... 323 7e-86 ref|XP_006347154.1| PREDICTED: uncharacterized protein LOC102600... 318 4e-84 ref|XP_007025830.1| Lysine-specific demethylase 3B, putative iso... 316 1e-83 ref|XP_007025837.1| Lysine-specific demethylase 3B, putative iso... 310 8e-82 ref|XP_007025836.1| Lysine-specific demethylase 3B, putative iso... 310 8e-82 ref|XP_007025835.1| Lysine-specific demethylase 3B, putative iso... 310 8e-82 ref|XP_007025833.1| Lysine-specific demethylase 3B, putative iso... 310 8e-82 ref|XP_007025832.1| Lysine-specific demethylase 3B, putative iso... 310 8e-82 ref|XP_007025831.1| Lysine-specific demethylase 3B, putative iso... 310 8e-82 ref|XP_006467914.1| PREDICTED: uncharacterized protein LOC102608... 305 3e-80 ref|XP_006467915.1| PREDICTED: uncharacterized protein LOC102608... 300 6e-79 ref|XP_002524700.1| conserved hypothetical protein [Ricinus comm... 280 9e-73 ref|XP_007159238.1| hypothetical protein PHAVU_002G220900g [Phas... 274 5e-71 ref|XP_003532564.1| PREDICTED: uncharacterized protein LOC100810... 270 7e-70 ref|XP_003528426.1| PREDICTED: uncharacterized protein LOC100787... 263 1e-67 ref|XP_007213684.1| hypothetical protein PRUPE_ppa000920mg [Prun... 258 3e-66 ref|XP_006347155.1| PREDICTED: uncharacterized protein LOC102600... 257 8e-66 >gb|EYU33944.1| hypothetical protein MIMGU_mgv1a001000mg [Mimulus guttatus] Length = 916 Score = 424 bits (1091), Expect = e-116 Identities = 226/401 (56%), Positives = 263/401 (65%), Gaps = 2/401 (0%) Frame = +2 Query: 2 AKKPTVAA--ERRRRRGVSEALDEALKKMNFKRGDLHLELIRMFLKRQVEKKKEREFKEI 175 AKKP AA E+RRRR VSEALDEALK+M KR DLHL+LIR+FLKRQVEKKKE+E KE Sbjct: 83 AKKPVAAAVAEKRRRRCVSEALDEALKRMKLKRDDLHLDLIRVFLKRQVEKKKEKELKE- 141 Query: 176 AVAPVADETRVLPCGVMAISQSHCNLQSVQENDVLDVKIGGVSNADSLLQRHFRSKNIEP 355 P+ DETR LPCG+MAISQ+H +LQ END L+VK+G S+ LLQRHFRSKNIEP Sbjct: 142 -TTPIGDETRELPCGIMAISQAHSSLQKFPENDGLNVKVGVDSSNGFLLQRHFRSKNIEP 200 Query: 356 LPISTMQVFPLTGNVKAKKIKKCHWCRGSKCRCLIKCLTCKKRFFCLECIKERYFEKQEV 535 LPISTMQV P NVK K IK+CHWCR +K RCLIKCLTC+KRFFC++CIKERYFEKQEV Sbjct: 201 LPISTMQVVPFADNVKKKMIKRCHWCRDTKYRCLIKCLTCRKRFFCVDCIKERYFEKQEV 260 Query: 536 KAECPACRGICGCKLCLKQKIRPATHKESYGGGRKLDRKQXXXXXXXXXXXXXKKVNRDQ 715 K++CPACRG C C LC+KQ++R HKE + GGRKLDRKQ KKVNRDQ Sbjct: 261 KSKCPACRGTCSCNLCIKQQMRANNHKECHRGGRKLDRKQLLHYLIYMLLPVLKKVNRDQ 320 Query: 716 KIELEKECTAAGKEQSEILIPQTKLGTNTSFCCSKCKTMIIDCHRTCTKCSYNLCLSCCQ 895 ELE E G E+++ Q +L SF +KC+ I+D HRTCT+CSYNLCLSCC+ Sbjct: 321 NDELETESKVTGMRILELIL-QLRL---VSF--NKCRNSIVDYHRTCTECSYNLCLSCCR 374 Query: 896 ELSQRSSSGSFMLKSCKKRKISGDVKQIISRHNSRRPPSQSVISSQNWETSENGSIPCPP 1075 ELS+ S G Sbjct: 375 ELSRHSLHG--------------------------------------------------- 383 Query: 1076 TDIGGCGGSLLDLRCIFPFNWIRDLEDKAEDILDSYNLPTT 1198 +IGGCG S LDLRC+FP NW RDLE KAE+IL SY+LP T Sbjct: 384 -NIGGCGDSFLDLRCMFPLNWTRDLEVKAEEILCSYHLPET 423 >gb|EPS66208.1| hypothetical protein M569_08569, partial [Genlisea aurea] Length = 861 Score = 330 bits (847), Expect = 6e-88 Identities = 190/398 (47%), Positives = 239/398 (60%), Gaps = 9/398 (2%) Frame = +2 Query: 26 ERRRRRGVSEALDEALKKMNFKRGDLHLELIRMFLKRQVEKKKERE-------FKEIAVA 184 E++RRR VSEALD+ALKKM KRGDL LELIR+FLKRQVEKKKE+E +E+ Sbjct: 88 EKKRRRRVSEALDDALKKMKLKRGDLQLELIRVFLKRQVEKKKEKEKEKEKEKAQEVEEN 147 Query: 185 PVADETRVLPCGVMAIS-QSHCNLQSVQENDVLDVKIGGVSNADSLLQRHFRSKNIEPLP 361 +ETR LP GVMAIS S L + + LD K+G S S+LQRHFRSKNIEPLP Sbjct: 148 APENETRELPNGVMAISGSSSSGLLNHCKYGGLDFKVGDGSYDKSVLQRHFRSKNIEPLP 207 Query: 362 ISTMQVFPLTGNVKAKKIKKCHWCRGSKCRCLIKCLTCKKRFFCLECIKERYFEKQEVKA 541 IST++ P + KK K+CH+CR SK CLIKCL CKKRFFC++CIK+R+ +KQEVK Sbjct: 208 ISTVKAVPFVELLNKKKTKRCHFCRESKYGCLIKCLACKKRFFCVDCIKKRHLKKQEVKV 267 Query: 542 ECPACRGICGCKLCLKQKIRPATHKESYGGGRKLDRKQXXXXXXXXXXXXXKKVNRDQKI 721 CPAC G+C CK+C+KQ+++ HK YG GRKLDRK KKVN D Sbjct: 268 RCPACSGLCRCKICMKQRVKAYNHKVCYGDGRKLDRKYLLHYLIYRLLPLLKKVNIDHSS 327 Query: 722 ELEKECTAAGKEQSEILIPQTKLGTNTSFCCSKCKTMIIDCHRTCTKCSYNLCLSCCQEL 901 EL E GK S L S K+ + CC+E+ Sbjct: 328 ELTTESRVTGKNDSFDLF---------FHALSLLKSNV-----------------CCREI 361 Query: 902 SQRSSSGSFMLKSCKKRKI-SGDVKQIISRHNSRRPPSQSVISSQNWETSENGSIPCPPT 1078 S+ S + + KKRK+ S D I+ ++SR P Q + TS+ ++PCPP Sbjct: 362 SKNGSCETSKPRRSKKRKMDSSDDNISINENDSRDNPLQIL------GTSKYCAVPCPPL 415 Query: 1079 DIGGCGGSLLDLRCIFPFNWIRDLEDKAEDILDSYNLP 1192 GGCG SLLDLRC+FPFNW RDLE KAE++L +Y++P Sbjct: 416 IAGGCGESLLDLRCLFPFNWTRDLEVKAEELLCNYHVP 453 >ref|XP_004233815.1| PREDICTED: uncharacterized protein LOC101266484 [Solanum lycopersicum] Length = 1005 Score = 328 bits (842), Expect = 2e-87 Identities = 182/405 (44%), Positives = 244/405 (60%), Gaps = 20/405 (4%) Frame = +2 Query: 47 VSEALDEALKKMNFKRGDLHLELIRMFLKRQVEKKKEREFKEIAVAPVADETRVLPCGVM 226 VSEALDEAL++M KRGDL LELIR+FLKRQ+EKK E+E K + A+ R P +M Sbjct: 101 VSEALDEALRRMELKRGDLPLELIRVFLKRQLEKKNEKESKNAS----AEVMREFPNALM 156 Query: 227 AI----SQSHCNLQSVQENDVLDVKIGGVSNADSLLQRHFRSKNIEPLPISTMQVFPLTG 394 AI +++ N SV LDVK+G S+++ RHFRSKNIEPLPISTMQ P Sbjct: 157 AIPVIPAENFNNAGSV-----LDVKLGLDSSSNPFSLRHFRSKNIEPLPISTMQALPFAR 211 Query: 395 NVK-AKKIKK---CHWCRGSKCRCLIKCLTCKKRFFCLECIKERYFEKQEVKAECPACRG 562 N K + K+K+ CHWCR S R LIKC +CKK++FCL+CIKER E+QE+K +CP CR Sbjct: 212 NGKNSSKVKRRRLCHWCRRSSYRVLIKCSSCKKQYFCLDCIKERRLEQQEIKVKCPICRR 271 Query: 563 ICGCKLCLKQKIRPATHKESYGGGRKLDRKQXXXXXXXXXXXXXKKVNRDQKIELEKECT 742 C C++C + +++P HKES RK+ + Q +K+N +Q+IE+E E Sbjct: 272 DCSCRICKRSELKPNIHKESLRHKRKVPKVQLLNYLVHLLLPVLEKINEEQRIEVEIEAN 331 Query: 743 AAGKEQSEILIPQTKLGTNTSFCCSKCKTMIIDCHRTCTKCSYNLCLSCCQELSQRS--- 913 +GK +S+I I Q G + CS C T I+D HR C+KCSY LCL+CC++ S Sbjct: 332 ISGKGESDIQIQQASAGDGKLYHCSNCNTSILDYHRICSKCSYRLCLNCCRDSRHGSLTE 391 Query: 914 ---SSGSFMLKSCKKRKISGDVKQIISRHNSRRPPSQSVI------SSQNWETSENGSIP 1066 S GS ++C S +Q H S S S I S N++ +GSI Sbjct: 392 DCKSEGSNEEQACS----SNFERQSRMNHTSTSRQSFSGIHYPSSRSCSNYQACADGSIS 447 Query: 1067 CPPTDIGGCGGSLLDLRCIFPFNWIRDLEDKAEDILDSYNLPTTD 1201 CPP + GGC S L+LRC+FP+ WI++LE A+ IL SYN+ T+ Sbjct: 448 CPPAEYGGCSDSFLNLRCVFPYTWIKELEISADAILCSYNIQETE 492 >ref|XP_006347153.1| PREDICTED: uncharacterized protein LOC102600140 isoform X1 [Solanum tuberosum] Length = 1005 Score = 323 bits (829), Expect = 7e-86 Identities = 176/401 (43%), Positives = 238/401 (59%), Gaps = 16/401 (3%) Frame = +2 Query: 47 VSEALDEALKKMNFKRGDLHLELIRMFLKRQVEKKKEREFKEIAVAPVADETRVLPCGVM 226 VSEALDEAL++M KRGDL LELIR+FLKRQ+EKK E+E K + A+ R P +M Sbjct: 101 VSEALDEALRRMELKRGDLPLELIRVFLKRQLEKKNEKESKNAS----AEVMREFPNALM 156 Query: 227 AI----SQSHCNLQSVQENDVLDVKIGGVSNADSLLQRHFRSKNIEPLPISTMQVFPLTG 394 AI +++ N SV LDVK+G S+++ R FRSKNIEPLPISTMQ P Sbjct: 157 AIPIIPAKNFNNAGSV-----LDVKLGLDSSSNPFSLRRFRSKNIEPLPISTMQALPFAR 211 Query: 395 NVK----AKKIKKCHWCRGSKCRCLIKCLTCKKRFFCLECIKERYFEKQEVKAECPACRG 562 NVK K+ + CHWCR S R LIKC +CKK++FCL+CIKER E+QE++ +CP CR Sbjct: 212 NVKNLSKVKRRRLCHWCRRSSYRVLIKCSSCKKQYFCLDCIKERNLEQQEIRVKCPICRR 271 Query: 563 ICGCKLCLKQKIRPATHKESYGGGRKLDRKQXXXXXXXXXXXXXKKVNRDQKIELEKECT 742 C C++C + +++P +HKES RK+ + Q +K+N +Q+IE+E E Sbjct: 272 DCSCRICKRSELKPNSHKESSRHKRKVPKVQLLYYLVHLLLPILEKINEEQRIEVEIEAN 331 Query: 743 AAGKEQSEILIPQTKLGTNTSFCCSKCKTMIIDCHRTCTKCSYNLCLSCCQELSQRSSSG 922 +GK +S+I I Q G + CS C T I+D HR C+KCSY+LCL CC++ S + Sbjct: 332 ISGKGESDIQIQQASAGDGKLYHCSNCNTSILDYHRICSKCSYSLCLYCCRDSRHGSLTE 391 Query: 923 SFMLKSCKKRKISGDVKQIISRHNSRRPPSQSVI--------SSQNWETSENGSIPCPPT 1078 + + + + SR N QS S N + +GSI CPP Sbjct: 392 DCKSEGSNEEQACSSNFERQSRMNYTSTSRQSFSGIHYPSSRSCSNNQACADGSISCPPA 451 Query: 1079 DIGGCGGSLLDLRCIFPFNWIRDLEDKAEDILDSYNLPTTD 1201 + GGC S LDLRC+FP+ WI++LE AE IL SYN+ T+ Sbjct: 452 EYGGCSDSFLDLRCVFPYPWIKELEISAEAILCSYNIQDTE 492 >ref|XP_006347154.1| PREDICTED: uncharacterized protein LOC102600140 isoform X2 [Solanum tuberosum] Length = 1004 Score = 318 bits (814), Expect = 4e-84 Identities = 175/401 (43%), Positives = 238/401 (59%), Gaps = 16/401 (3%) Frame = +2 Query: 47 VSEALDEALKKMNFKRGDLHLELIRMFLKRQVEKKKEREFKEIAVAPVADETRVLPCGVM 226 VSEALDEAL++M KRGDL LELIR+FLKRQ+EKK E+E K + A+ R P +M Sbjct: 101 VSEALDEALRRMELKRGDLPLELIRVFLKRQLEKKNEKESKNAS----AEVMREFPNALM 156 Query: 227 AI----SQSHCNLQSVQENDVLDVKIGGVSNADSLLQRHFRSKNIEPLPISTMQVFPLTG 394 AI +++ N SV LDVK+G S+++ R FRSKNIEPLPISTMQ P Sbjct: 157 AIPIIPAKNFNNAGSV-----LDVKLGLDSSSNPFSLRRFRSKNIEPLPISTMQALPFAR 211 Query: 395 NVK----AKKIKKCHWCRGSKCRCLIKCLTCKKRFFCLECIKERYFEKQEVKAECPACRG 562 NVK K+ + CHWCR S R LIKC +CKK++FCL+CIKER E+QE++ +CP CR Sbjct: 212 NVKNLSKVKRRRLCHWCRRSSYRVLIKCSSCKKQYFCLDCIKERNLEQQEIRVKCPICRR 271 Query: 563 ICGCKLCLKQKIRPATHKESYGGGRKLDRKQXXXXXXXXXXXXXKKVNRDQKIELEKECT 742 C C++C + +++P +HKES RK+ + Q +K+N +Q+IE+E E Sbjct: 272 DCSCRICKRSELKPNSHKESSRHKRKVPKVQLLYYLVHLLLPILEKINEEQRIEVEIEAN 331 Query: 743 AAGKEQSEILIPQTKLGTNTSFCCSKCKTMIIDCHRTCTKCSYNLCLSCCQELSQRSSSG 922 +GK +S+I I Q G + C+ C T I+D HR C+KCSY+LCL CC++ S + Sbjct: 332 ISGKGESDIQIQQASAGDGKLYHCN-CNTSILDYHRICSKCSYSLCLYCCRDSRHGSLTE 390 Query: 923 SFMLKSCKKRKISGDVKQIISRHNSRRPPSQSVI--------SSQNWETSENGSIPCPPT 1078 + + + + SR N QS S N + +GSI CPP Sbjct: 391 DCKSEGSNEEQACSSNFERQSRMNYTSTSRQSFSGIHYPSSRSCSNNQACADGSISCPPA 450 Query: 1079 DIGGCGGSLLDLRCIFPFNWIRDLEDKAEDILDSYNLPTTD 1201 + GGC S LDLRC+FP+ WI++LE AE IL SYN+ T+ Sbjct: 451 EYGGCSDSFLDLRCVFPYPWIKELEISAEAILCSYNIQDTE 491 >ref|XP_007025830.1| Lysine-specific demethylase 3B, putative isoform 1 [Theobroma cacao] gi|508781196|gb|EOY28452.1| Lysine-specific demethylase 3B, putative isoform 1 [Theobroma cacao] Length = 1034 Score = 316 bits (809), Expect = 1e-83 Identities = 181/436 (41%), Positives = 246/436 (56%), Gaps = 39/436 (8%) Frame = +2 Query: 2 AKKPTVAAERRRRR---GVSEALDEALKKMNFKRGDLHLELIRMFLKRQVEKKKERE--- 163 AK +A +R+R G SEALDEA++KM KRGDL LELIRM LKR++EKKK +E Sbjct: 76 AKLLKLAKPMKRKRVIGGESEALDEAVRKMKLKRGDLPLELIRMVLKREIEKKKRKESDC 135 Query: 164 --FKEIAVAPVADETRVLPCGVMAISQSHCNLQSVQE------------NDVLDVKIGGV 301 F + D R LP G+MAIS S + + +VK+G Sbjct: 136 SDFDDEEEEEKGDLMRELPNGLMAISSSSPHFDNAGSCSGSGSGSGSVSGSCFNVKVGET 195 Query: 302 -SNADSLLQRHFRSKNIEPLPISTMQVFPLTG---NVKAKKIKKCHWCRGSKCRCLIKCL 469 +N ++ +R FRSKNIEPLP+ T+QV P N++ + +CHWCR R LIKC Sbjct: 196 ETNTVAITRRRFRSKNIEPLPVGTLQVVPYKKDMVNLRRGRRIRCHWCRKGGVRSLIKCS 255 Query: 470 TCKKRFFCLECIKERYFEKQE-VKAECPACRGICGCKLCLKQKIRPATHKESYGGGRKLD 646 +C+++FFCL+CIKE+YF QE VK CP CRG CGCK C + R KE K+D Sbjct: 256 SCRQQFFCLDCIKEQYFVMQEEVKIACPVCRGTCGCKACSVSQHRDTESKEFLRDKNKVD 315 Query: 647 RKQXXXXXXXXXXXXXKKVNRDQKIELEKECTAAGKEQSEILIPQTKLGTNTSFCCSKCK 826 + K++N+DQ +E+E E GK+ S+I + + G N +CCS CK Sbjct: 316 KVLHFHYLICMLLPVLKQINQDQSVEIEVEAKVKGKKLSDIQVQPAEFGGNKQYCCSNCK 375 Query: 827 TMIIDCHRTCTKCSYNLCLSCCQELSQRSSSGSFMLKSCK---KRKI---------SGDV 970 T I+D HR+C+KCSYNLCLSCC++ Q S GS +CK +RK V Sbjct: 376 TFILDFHRSCSKCSYNLCLSCCRDNFQGSLVGSIKEINCKCPNRRKTCVPGIRLSHKKSV 435 Query: 971 KQIISRHNSRRPPSQSVISSQNWETSENGSIP--CPPTDIGGCGGSLLDLRCIFPFNWIR 1144 + ++SR S + + S+ + +G++P CPPT+ GGCG LLDLRCI P W + Sbjct: 436 RTSKKNYDSRYFDSSASLPSRK---APDGNVPISCPPTEFGGCGDGLLDLRCILPLRWFK 492 Query: 1145 DLEDKAEDILDSYNLP 1192 +LE AE+I+ SY LP Sbjct: 493 ELEISAEEIVGSYELP 508 >ref|XP_007025837.1| Lysine-specific demethylase 3B, putative isoform 8 [Theobroma cacao] gi|508781203|gb|EOY28459.1| Lysine-specific demethylase 3B, putative isoform 8 [Theobroma cacao] Length = 970 Score = 310 bits (794), Expect = 8e-82 Identities = 180/436 (41%), Positives = 246/436 (56%), Gaps = 39/436 (8%) Frame = +2 Query: 2 AKKPTVAAERRRRR---GVSEALDEALKKMNFKRGDLHLELIRMFLKRQVEKKKERE--- 163 AK +A +R+R G SEALDEA++KM KRGDL LELIRM LKR++EKKK +E Sbjct: 76 AKLLKLAKPMKRKRVIGGESEALDEAVRKMKLKRGDLPLELIRMVLKREIEKKKRKESDC 135 Query: 164 --FKEIAVAPVADETRVLPCGVMAISQSHCNLQSVQE------------NDVLDVKIGGV 301 F + D R LP G+MAIS S + + +VK+G Sbjct: 136 SDFDDEEEEEKGDLMRELPNGLMAISSSSPHFDNAGSCSGSGSGSGSVSGSCFNVKVGET 195 Query: 302 -SNADSLLQRHFRSKNIEPLPISTMQVFPLTG---NVKAKKIKKCHWCRGSKCRCLIKCL 469 +N ++ +R FRSKNIEPLP+ T+QV P N++ + +CHWCR R LIKC Sbjct: 196 ETNTVAITRRRFRSKNIEPLPVGTLQVVPYKKDMVNLRRGRRIRCHWCRKGGVRSLIKCS 255 Query: 470 TCKKRFFCLECIKERYFEKQE-VKAECPACRGICGCKLCLKQKIRPATHKESYGGGRKLD 646 +C+++FFCL+CIKE+YF QE VK CP CRG CGCK C + R KE K+D Sbjct: 256 SCRQQFFCLDCIKEQYFVMQEEVKIACPVCRGTCGCKACSVSQHRDTESKEFLRDKNKVD 315 Query: 647 RKQXXXXXXXXXXXXXKKVNRDQKIELEKECTAAGKEQSEILIPQTKLGTNTSFCCSKCK 826 + K++N+DQ +E+E E GK+ S+I + + G N +CC+ CK Sbjct: 316 KVLHFHYLICMLLPVLKQINQDQSVEIEVEAKVKGKKLSDIQVQPAEFGGNKQYCCN-CK 374 Query: 827 TMIIDCHRTCTKCSYNLCLSCCQELSQRSSSGSFMLKSCK-----KRKISG-------DV 970 T I+D HR+C+KCSYNLCLSCC++ Q S GS +CK K + G V Sbjct: 375 TFILDFHRSCSKCSYNLCLSCCRDNFQGSLVGSIKEINCKCPNRRKTCVPGIRLSHKKSV 434 Query: 971 KQIISRHNSRRPPSQSVISSQNWETSENGSIP--CPPTDIGGCGGSLLDLRCIFPFNWIR 1144 + ++SR S + + S+ + +G++P CPPT+ GGCG LLDLRCI P W + Sbjct: 435 RTSKKNYDSRYFDSSASLPSRK---APDGNVPISCPPTEFGGCGDGLLDLRCILPLRWFK 491 Query: 1145 DLEDKAEDILDSYNLP 1192 +LE AE+I+ SY LP Sbjct: 492 ELEISAEEIVGSYELP 507 >ref|XP_007025836.1| Lysine-specific demethylase 3B, putative isoform 7 [Theobroma cacao] gi|508781202|gb|EOY28458.1| Lysine-specific demethylase 3B, putative isoform 7 [Theobroma cacao] Length = 897 Score = 310 bits (794), Expect = 8e-82 Identities = 180/436 (41%), Positives = 246/436 (56%), Gaps = 39/436 (8%) Frame = +2 Query: 2 AKKPTVAAERRRRR---GVSEALDEALKKMNFKRGDLHLELIRMFLKRQVEKKKERE--- 163 AK +A +R+R G SEALDEA++KM KRGDL LELIRM LKR++EKKK +E Sbjct: 20 AKLLKLAKPMKRKRVIGGESEALDEAVRKMKLKRGDLPLELIRMVLKREIEKKKRKESDC 79 Query: 164 --FKEIAVAPVADETRVLPCGVMAISQSHCNLQSVQE------------NDVLDVKIGGV 301 F + D R LP G+MAIS S + + +VK+G Sbjct: 80 SDFDDEEEEEKGDLMRELPNGLMAISSSSPHFDNAGSCSGSGSGSGSVSGSCFNVKVGET 139 Query: 302 -SNADSLLQRHFRSKNIEPLPISTMQVFPLTG---NVKAKKIKKCHWCRGSKCRCLIKCL 469 +N ++ +R FRSKNIEPLP+ T+QV P N++ + +CHWCR R LIKC Sbjct: 140 ETNTVAITRRRFRSKNIEPLPVGTLQVVPYKKDMVNLRRGRRIRCHWCRKGGVRSLIKCS 199 Query: 470 TCKKRFFCLECIKERYFEKQE-VKAECPACRGICGCKLCLKQKIRPATHKESYGGGRKLD 646 +C+++FFCL+CIKE+YF QE VK CP CRG CGCK C + R KE K+D Sbjct: 200 SCRQQFFCLDCIKEQYFVMQEEVKIACPVCRGTCGCKACSVSQHRDTESKEFLRDKNKVD 259 Query: 647 RKQXXXXXXXXXXXXXKKVNRDQKIELEKECTAAGKEQSEILIPQTKLGTNTSFCCSKCK 826 + K++N+DQ +E+E E GK+ S+I + + G N +CC+ CK Sbjct: 260 KVLHFHYLICMLLPVLKQINQDQSVEIEVEAKVKGKKLSDIQVQPAEFGGNKQYCCN-CK 318 Query: 827 TMIIDCHRTCTKCSYNLCLSCCQELSQRSSSGSFMLKSCK-----KRKISG-------DV 970 T I+D HR+C+KCSYNLCLSCC++ Q S GS +CK K + G V Sbjct: 319 TFILDFHRSCSKCSYNLCLSCCRDNFQGSLVGSIKEINCKCPNRRKTCVPGIRLSHKKSV 378 Query: 971 KQIISRHNSRRPPSQSVISSQNWETSENGSIP--CPPTDIGGCGGSLLDLRCIFPFNWIR 1144 + ++SR S + + S+ + +G++P CPPT+ GGCG LLDLRCI P W + Sbjct: 379 RTSKKNYDSRYFDSSASLPSRK---APDGNVPISCPPTEFGGCGDGLLDLRCILPLRWFK 435 Query: 1145 DLEDKAEDILDSYNLP 1192 +LE AE+I+ SY LP Sbjct: 436 ELEISAEEIVGSYELP 451 >ref|XP_007025835.1| Lysine-specific demethylase 3B, putative isoform 6 [Theobroma cacao] gi|508781201|gb|EOY28457.1| Lysine-specific demethylase 3B, putative isoform 6 [Theobroma cacao] Length = 1022 Score = 310 bits (794), Expect = 8e-82 Identities = 180/436 (41%), Positives = 246/436 (56%), Gaps = 39/436 (8%) Frame = +2 Query: 2 AKKPTVAAERRRRR---GVSEALDEALKKMNFKRGDLHLELIRMFLKRQVEKKKERE--- 163 AK +A +R+R G SEALDEA++KM KRGDL LELIRM LKR++EKKK +E Sbjct: 76 AKLLKLAKPMKRKRVIGGESEALDEAVRKMKLKRGDLPLELIRMVLKREIEKKKRKESDC 135 Query: 164 --FKEIAVAPVADETRVLPCGVMAISQSHCNLQSVQE------------NDVLDVKIGGV 301 F + D R LP G+MAIS S + + +VK+G Sbjct: 136 SDFDDEEEEEKGDLMRELPNGLMAISSSSPHFDNAGSCSGSGSGSGSVSGSCFNVKVGET 195 Query: 302 -SNADSLLQRHFRSKNIEPLPISTMQVFPLTG---NVKAKKIKKCHWCRGSKCRCLIKCL 469 +N ++ +R FRSKNIEPLP+ T+QV P N++ + +CHWCR R LIKC Sbjct: 196 ETNTVAITRRRFRSKNIEPLPVGTLQVVPYKKDMVNLRRGRRIRCHWCRKGGVRSLIKCS 255 Query: 470 TCKKRFFCLECIKERYFEKQE-VKAECPACRGICGCKLCLKQKIRPATHKESYGGGRKLD 646 +C+++FFCL+CIKE+YF QE VK CP CRG CGCK C + R KE K+D Sbjct: 256 SCRQQFFCLDCIKEQYFVMQEEVKIACPVCRGTCGCKACSVSQHRDTESKEFLRDKNKVD 315 Query: 647 RKQXXXXXXXXXXXXXKKVNRDQKIELEKECTAAGKEQSEILIPQTKLGTNTSFCCSKCK 826 + K++N+DQ +E+E E GK+ S+I + + G N +CC+ CK Sbjct: 316 KVLHFHYLICMLLPVLKQINQDQSVEIEVEAKVKGKKLSDIQVQPAEFGGNKQYCCN-CK 374 Query: 827 TMIIDCHRTCTKCSYNLCLSCCQELSQRSSSGSFMLKSCK-----KRKISG-------DV 970 T I+D HR+C+KCSYNLCLSCC++ Q S GS +CK K + G V Sbjct: 375 TFILDFHRSCSKCSYNLCLSCCRDNFQGSLVGSIKEINCKCPNRRKTCVPGIRLSHKKSV 434 Query: 971 KQIISRHNSRRPPSQSVISSQNWETSENGSIP--CPPTDIGGCGGSLLDLRCIFPFNWIR 1144 + ++SR S + + S+ + +G++P CPPT+ GGCG LLDLRCI P W + Sbjct: 435 RTSKKNYDSRYFDSSASLPSRK---APDGNVPISCPPTEFGGCGDGLLDLRCILPLRWFK 491 Query: 1145 DLEDKAEDILDSYNLP 1192 +LE AE+I+ SY LP Sbjct: 492 ELEISAEEIVGSYELP 507 >ref|XP_007025833.1| Lysine-specific demethylase 3B, putative isoform 4 [Theobroma cacao] gi|508781199|gb|EOY28455.1| Lysine-specific demethylase 3B, putative isoform 4 [Theobroma cacao] Length = 1034 Score = 310 bits (794), Expect = 8e-82 Identities = 180/436 (41%), Positives = 246/436 (56%), Gaps = 39/436 (8%) Frame = +2 Query: 2 AKKPTVAAERRRRR---GVSEALDEALKKMNFKRGDLHLELIRMFLKRQVEKKKERE--- 163 AK +A +R+R G SEALDEA++KM KRGDL LELIRM LKR++EKKK +E Sbjct: 76 AKLLKLAKPMKRKRVIGGESEALDEAVRKMKLKRGDLPLELIRMVLKREIEKKKRKESDC 135 Query: 164 --FKEIAVAPVADETRVLPCGVMAISQSHCNLQSVQE------------NDVLDVKIGGV 301 F + D R LP G+MAIS S + + +VK+G Sbjct: 136 SDFDDEEEEEKGDLMRELPNGLMAISSSSPHFDNAGSCSGSGSGSGSVSGSCFNVKVGET 195 Query: 302 -SNADSLLQRHFRSKNIEPLPISTMQVFPLTG---NVKAKKIKKCHWCRGSKCRCLIKCL 469 +N ++ +R FRSKNIEPLP+ T+QV P N++ + +CHWCR R LIKC Sbjct: 196 ETNTVAITRRRFRSKNIEPLPVGTLQVVPYKKDMVNLRRGRRIRCHWCRKGGVRSLIKCS 255 Query: 470 TCKKRFFCLECIKERYFEKQE-VKAECPACRGICGCKLCLKQKIRPATHKESYGGGRKLD 646 +C+++FFCL+CIKE+YF QE VK CP CRG CGCK C + R KE K+D Sbjct: 256 SCRQQFFCLDCIKEQYFVMQEEVKIACPVCRGTCGCKACSVSQHRDTESKEFLRDKNKVD 315 Query: 647 RKQXXXXXXXXXXXXXKKVNRDQKIELEKECTAAGKEQSEILIPQTKLGTNTSFCCSKCK 826 + K++N+DQ +E+E E GK+ S+I + + G N +CC+ CK Sbjct: 316 KVLHFHYLICMLLPVLKQINQDQSVEIEVEAKVKGKKLSDIQVQPAEFGGNKQYCCN-CK 374 Query: 827 TMIIDCHRTCTKCSYNLCLSCCQELSQRSSSGSFMLKSCK-----KRKISG-------DV 970 T I+D HR+C+KCSYNLCLSCC++ Q S GS +CK K + G V Sbjct: 375 TFILDFHRSCSKCSYNLCLSCCRDNFQGSLVGSIKEINCKCPNRRKTCVPGIRLSHKKSV 434 Query: 971 KQIISRHNSRRPPSQSVISSQNWETSENGSIP--CPPTDIGGCGGSLLDLRCIFPFNWIR 1144 + ++SR S + + S+ + +G++P CPPT+ GGCG LLDLRCI P W + Sbjct: 435 RTSKKNYDSRYFDSSASLPSRK---APDGNVPISCPPTEFGGCGDGLLDLRCILPLRWFK 491 Query: 1145 DLEDKAEDILDSYNLP 1192 +LE AE+I+ SY LP Sbjct: 492 ELEISAEEIVGSYELP 507 >ref|XP_007025832.1| Lysine-specific demethylase 3B, putative isoform 3 [Theobroma cacao] gi|508781198|gb|EOY28454.1| Lysine-specific demethylase 3B, putative isoform 3 [Theobroma cacao] Length = 1033 Score = 310 bits (794), Expect = 8e-82 Identities = 180/436 (41%), Positives = 246/436 (56%), Gaps = 39/436 (8%) Frame = +2 Query: 2 AKKPTVAAERRRRR---GVSEALDEALKKMNFKRGDLHLELIRMFLKRQVEKKKERE--- 163 AK +A +R+R G SEALDEA++KM KRGDL LELIRM LKR++EKKK +E Sbjct: 76 AKLLKLAKPMKRKRVIGGESEALDEAVRKMKLKRGDLPLELIRMVLKREIEKKKRKESDC 135 Query: 164 --FKEIAVAPVADETRVLPCGVMAISQSHCNLQSVQE------------NDVLDVKIGGV 301 F + D R LP G+MAIS S + + +VK+G Sbjct: 136 SDFDDEEEEEKGDLMRELPNGLMAISSSSPHFDNAGSCSGSGSGSGSVSGSCFNVKVGET 195 Query: 302 -SNADSLLQRHFRSKNIEPLPISTMQVFPLTG---NVKAKKIKKCHWCRGSKCRCLIKCL 469 +N ++ +R FRSKNIEPLP+ T+QV P N++ + +CHWCR R LIKC Sbjct: 196 ETNTVAITRRRFRSKNIEPLPVGTLQVVPYKKDMVNLRRGRRIRCHWCRKGGVRSLIKCS 255 Query: 470 TCKKRFFCLECIKERYFEKQE-VKAECPACRGICGCKLCLKQKIRPATHKESYGGGRKLD 646 +C+++FFCL+CIKE+YF QE VK CP CRG CGCK C + R KE K+D Sbjct: 256 SCRQQFFCLDCIKEQYFVMQEEVKIACPVCRGTCGCKACSVSQHRDTESKEFLRDKNKVD 315 Query: 647 RKQXXXXXXXXXXXXXKKVNRDQKIELEKECTAAGKEQSEILIPQTKLGTNTSFCCSKCK 826 + K++N+DQ +E+E E GK+ S+I + + G N +CC+ CK Sbjct: 316 KVLHFHYLICMLLPVLKQINQDQSVEIEVEAKVKGKKLSDIQVQPAEFGGNKQYCCN-CK 374 Query: 827 TMIIDCHRTCTKCSYNLCLSCCQELSQRSSSGSFMLKSCK-----KRKISG-------DV 970 T I+D HR+C+KCSYNLCLSCC++ Q S GS +CK K + G V Sbjct: 375 TFILDFHRSCSKCSYNLCLSCCRDNFQGSLVGSIKEINCKCPNRRKTCVPGIRLSHKKSV 434 Query: 971 KQIISRHNSRRPPSQSVISSQNWETSENGSIP--CPPTDIGGCGGSLLDLRCIFPFNWIR 1144 + ++SR S + + S+ + +G++P CPPT+ GGCG LLDLRCI P W + Sbjct: 435 RTSKKNYDSRYFDSSASLPSRK---APDGNVPISCPPTEFGGCGDGLLDLRCILPLRWFK 491 Query: 1145 DLEDKAEDILDSYNLP 1192 +LE AE+I+ SY LP Sbjct: 492 ELEISAEEIVGSYELP 507 >ref|XP_007025831.1| Lysine-specific demethylase 3B, putative isoform 2 [Theobroma cacao] gi|508781197|gb|EOY28453.1| Lysine-specific demethylase 3B, putative isoform 2 [Theobroma cacao] Length = 1045 Score = 310 bits (794), Expect = 8e-82 Identities = 180/436 (41%), Positives = 246/436 (56%), Gaps = 39/436 (8%) Frame = +2 Query: 2 AKKPTVAAERRRRR---GVSEALDEALKKMNFKRGDLHLELIRMFLKRQVEKKKERE--- 163 AK +A +R+R G SEALDEA++KM KRGDL LELIRM LKR++EKKK +E Sbjct: 76 AKLLKLAKPMKRKRVIGGESEALDEAVRKMKLKRGDLPLELIRMVLKREIEKKKRKESDC 135 Query: 164 --FKEIAVAPVADETRVLPCGVMAISQSHCNLQSVQE------------NDVLDVKIGGV 301 F + D R LP G+MAIS S + + +VK+G Sbjct: 136 SDFDDEEEEEKGDLMRELPNGLMAISSSSPHFDNAGSCSGSGSGSGSVSGSCFNVKVGET 195 Query: 302 -SNADSLLQRHFRSKNIEPLPISTMQVFPLTG---NVKAKKIKKCHWCRGSKCRCLIKCL 469 +N ++ +R FRSKNIEPLP+ T+QV P N++ + +CHWCR R LIKC Sbjct: 196 ETNTVAITRRRFRSKNIEPLPVGTLQVVPYKKDMVNLRRGRRIRCHWCRKGGVRSLIKCS 255 Query: 470 TCKKRFFCLECIKERYFEKQE-VKAECPACRGICGCKLCLKQKIRPATHKESYGGGRKLD 646 +C+++FFCL+CIKE+YF QE VK CP CRG CGCK C + R KE K+D Sbjct: 256 SCRQQFFCLDCIKEQYFVMQEEVKIACPVCRGTCGCKACSVSQHRDTESKEFLRDKNKVD 315 Query: 647 RKQXXXXXXXXXXXXXKKVNRDQKIELEKECTAAGKEQSEILIPQTKLGTNTSFCCSKCK 826 + K++N+DQ +E+E E GK+ S+I + + G N +CC+ CK Sbjct: 316 KVLHFHYLICMLLPVLKQINQDQSVEIEVEAKVKGKKLSDIQVQPAEFGGNKQYCCN-CK 374 Query: 827 TMIIDCHRTCTKCSYNLCLSCCQELSQRSSSGSFMLKSCK-----KRKISG-------DV 970 T I+D HR+C+KCSYNLCLSCC++ Q S GS +CK K + G V Sbjct: 375 TFILDFHRSCSKCSYNLCLSCCRDNFQGSLVGSIKEINCKCPNRRKTCVPGIRLSHKKSV 434 Query: 971 KQIISRHNSRRPPSQSVISSQNWETSENGSIP--CPPTDIGGCGGSLLDLRCIFPFNWIR 1144 + ++SR S + + S+ + +G++P CPPT+ GGCG LLDLRCI P W + Sbjct: 435 RTSKKNYDSRYFDSSASLPSRK---APDGNVPISCPPTEFGGCGDGLLDLRCILPLRWFK 491 Query: 1145 DLEDKAEDILDSYNLP 1192 +LE AE+I+ SY LP Sbjct: 492 ELEISAEEIVGSYELP 507 >ref|XP_006467914.1| PREDICTED: uncharacterized protein LOC102608274 isoform X1 [Citrus sinensis] Length = 1004 Score = 305 bits (781), Expect = 3e-80 Identities = 172/427 (40%), Positives = 241/427 (56%), Gaps = 28/427 (6%) Frame = +2 Query: 2 AKKPTVAAERRRRR--GVSEALDEALKKMNFKRGDLHLELIRMFLKRQVEKKKEREFKEI 175 A+K ++++R G SEALDEALKKM KRGDL LELIRM LKR+VEK+K ++ + Sbjct: 74 ARKSKKLKRKKKKRVIGESEALDEALKKMKLKRGDLQLELIRMVLKREVEKRKRQKNFDF 133 Query: 176 AVAPVADE----------TRVLPCGVMAISQSHCNLQSVQENDVLDVKIGGVSNADSLLQ 325 D TR LP G+MAIS ++ S VKIG + A ++ + Sbjct: 134 EDEENCDNSNYSDSDRELTRELPNGLMAISSTN----SDNAGTSCAVKIG--AEAAAVNR 187 Query: 326 RHFRSKNIEPLPISTMQVFPLTGNV----KAKKIKKCHWCRGSKCRCLIKCLTCKKRFFC 493 R FRSKNIEP+P+ T+QV P +V + ++ K+CHWCR + + LIKC +C+K FFC Sbjct: 188 RRFRSKNIEPMPVGTLQVVPYKRDVVSLRRRRRRKRCHWCR-RRGQSLIKCSSCRKLFFC 246 Query: 494 LECIKERYFEKQE-VKAECPACRGICGCKLCLKQKIRPATHKESYGGGRKLDRKQXXXXX 670 ++C+KE YF+ QE VK CP CRG CGCK C + R +K+ ++D+ Sbjct: 247 VDCVKEWYFDTQEDVKKACPVCRGTCGCKACSSSQYRDIDYKDLLKANNEVDKVLHFHYL 306 Query: 671 XXXXXXXXKKVNRDQKIELEKECTAAGKEQSEILIPQTKLGTNTSFCCSKCKTMIIDCHR 850 +++N+DQ +ELE E G+ SE+ I + + N +CCS CKT I+D HR Sbjct: 307 ICMLLPIVRQINQDQNVELEIEAKIKGQNPSEVQIQEAEFKYNRLYCCSSCKTSIVDYHR 366 Query: 851 TCTKCSYNLCLSCCQELSQRSSSGSFMLKSCK---KRKISGDVKQIISRHNSRRPPS--- 1012 +C CSY LCLSCC+++ Q S SG + CK RK+ +I+ + + R Sbjct: 367 SCASCSYTLCLSCCRDILQGSLSGCVRARLCKCPNGRKVCTSGVRILEKKSLRTYKEGYG 426 Query: 1013 ----QSVISSQNWETSE-NGSIPCPPTDIGGCGGSLLDLRCIFPFNWIRDLEDKAEDILD 1177 S +S +W+ + I CPP + GGCG S LDLRC+FP W ++LE AE I+ Sbjct: 427 STYFDSSAASPSWKAPDGTAGILCPPMEFGGCGDSFLDLRCVFPSCWTKELEINAEQIVG 486 Query: 1178 SYNLPTT 1198 Y LP T Sbjct: 487 CYELPET 493 >ref|XP_006467915.1| PREDICTED: uncharacterized protein LOC102608274 isoform X2 [Citrus sinensis] Length = 1003 Score = 300 bits (769), Expect = 6e-79 Identities = 172/427 (40%), Positives = 241/427 (56%), Gaps = 28/427 (6%) Frame = +2 Query: 2 AKKPTVAAERRRRR--GVSEALDEALKKMNFKRGDLHLELIRMFLKRQVEKKKEREFKEI 175 A+K ++++R G SEALDEALKKM KRGDL LELIRM LKR+VEK+K ++ + Sbjct: 74 ARKSKKLKRKKKKRVIGESEALDEALKKMKLKRGDLQLELIRMVLKREVEKRKRQKNFDF 133 Query: 176 AVAPVADE----------TRVLPCGVMAISQSHCNLQSVQENDVLDVKIGGVSNADSLLQ 325 D TR LP G+MAIS ++ S VKIG + A ++ + Sbjct: 134 EDEENCDNSNYSDSDRELTRELPNGLMAISSTN----SDNAGTSCAVKIG--AEAAAVNR 187 Query: 326 RHFRSKNIEPLPISTMQVFPLTGNV----KAKKIKKCHWCRGSKCRCLIKCLTCKKRFFC 493 R FRSKNIEP+P+ T+QV P +V + ++ K+CHWCR + + LIKC +C+K FFC Sbjct: 188 RRFRSKNIEPMPVGTLQVVPYKRDVVSLRRRRRRKRCHWCR-RRGQSLIKCSSCRKLFFC 246 Query: 494 LECIKERYFEKQE-VKAECPACRGICGCKLCLKQKIRPATHKESYGGGRKLDRKQXXXXX 670 ++C+KE YF+ QE VK CP CRG CGCK C + R +K+ ++D+ Sbjct: 247 VDCVKEWYFDTQEDVKKACPVCRGTCGCKACSSSQYRDIDYKDLLKANNEVDKVLHFHYL 306 Query: 671 XXXXXXXXKKVNRDQKIELEKECTAAGKEQSEILIPQTKLGTNTSFCCSKCKTMIIDCHR 850 +++N+DQ +ELE E G+ SE+ I + + N +CCS CKT I+D HR Sbjct: 307 ICMLLPIVRQINQDQNVELEIEAKIKGQNPSEVQIQEAEFKYNRLYCCS-CKTSIVDYHR 365 Query: 851 TCTKCSYNLCLSCCQELSQRSSSGSFMLKSCK---KRKISGDVKQIISRHNSRRPPS--- 1012 +C CSY LCLSCC+++ Q S SG + CK RK+ +I+ + + R Sbjct: 366 SCASCSYTLCLSCCRDILQGSLSGCVRARLCKCPNGRKVCTSGVRILEKKSLRTYKEGYG 425 Query: 1013 ----QSVISSQNWETSE-NGSIPCPPTDIGGCGGSLLDLRCIFPFNWIRDLEDKAEDILD 1177 S +S +W+ + I CPP + GGCG S LDLRC+FP W ++LE AE I+ Sbjct: 426 STYFDSSAASPSWKAPDGTAGILCPPMEFGGCGDSFLDLRCVFPSCWTKELEINAEQIVG 485 Query: 1178 SYNLPTT 1198 Y LP T Sbjct: 486 CYELPET 492 >ref|XP_002524700.1| conserved hypothetical protein [Ricinus communis] gi|223536061|gb|EEF37719.1| conserved hypothetical protein [Ricinus communis] Length = 1033 Score = 280 bits (716), Expect = 9e-73 Identities = 166/412 (40%), Positives = 235/412 (57%), Gaps = 43/412 (10%) Frame = +2 Query: 92 RGDLHLELIRMFLKRQVEKKKEREFKEI--------AVAPVADET--------------- 202 RG+L LELIRM LKR+VEK+K+++ K+I AV + + Sbjct: 109 RGNLQLELIRMVLKREVEKRKKKKKKKIKNKNKKVVAVEEINSDNDNIDVDSSSNSEEGE 168 Query: 203 --RVLPCGVMAISQSHCNLQSVQENDVL--DVKIGGVS-NADSLLQRHFRSKNIEPLPIS 367 R LP G+MAIS + NL + D+KIGG + ++ + +R FRSKNIEP+PI Sbjct: 169 LMRDLPNGLMAISPAKHNLSNAASCSTTPCDIKIGGAAADSSAFTRRCFRSKNIEPMPIG 228 Query: 368 TMQVFPLTGNV---KAKKIKKCHWCRGSKCRCLIKCLTCKKRFFCLECIKERYFEKQE-V 535 T+QV P ++ + K KKCH+CR S + LI+C +C+K+FFC++CIK++YF QE V Sbjct: 229 TLQVVPFKKDMVRLRKGKRKKCHFCRRSGLKTLIRCSSCRKQFFCMDCIKDQYFNMQEEV 288 Query: 536 KAECPACRGICGCKLCLKQKIRPATHKESYGGGRKLDRKQXXXXXXXXXXXXXKKVNRDQ 715 K C CRG C CK C + R K K+++ K++N+DQ Sbjct: 289 KIACSVCRGTCSCKACSAIQCRNIECKGFSKDKSKVNKVLHFHYLICMLLPVLKEINQDQ 348 Query: 716 KIELEKECTAAGKEQSEILIPQTKLGTNTSFCCSKCKTMIIDCHRTCTKCSYNLCLSCCQ 895 IELE E G++ S++ I Q ++G N +CC CKT I+D HR+C CSYNLCLSCCQ Sbjct: 349 SIELEIEAKIRGQKPSDLQIQQAEVGCNKRWCCDNCKTSIMDFHRSCPSCSYNLCLSCCQ 408 Query: 896 ELSQRS--SSGSFMLKSCKKRK---ISG----DVKQIIS-RHNSRRPPSQSVISSQNWET 1045 ++ Q S S +L C RK +SG ++K + + + N+ S +S + + Sbjct: 409 DIYQGSLLRSVKGLLCKCPNRKKACLSGKQFSEMKSVCTYKQNNGIKYSDFSMSLLSLKA 468 Query: 1046 SE-NGSIPCPPTDIGGCGGSLLDLRCIFPFNWIRDLEDKAEDILDSYNLPTT 1198 + NG IPCPPT+ GGCG SLLDL CIFP +W ++LE AE+I+ Y LP T Sbjct: 469 PDGNGGIPCPPTEFGGCGKSLLDLCCIFPSSWTKELEISAEEIIGCYELPET 520 >ref|XP_007159238.1| hypothetical protein PHAVU_002G220900g [Phaseolus vulgaris] gi|561032653|gb|ESW31232.1| hypothetical protein PHAVU_002G220900g [Phaseolus vulgaris] Length = 1030 Score = 274 bits (701), Expect = 5e-71 Identities = 166/428 (38%), Positives = 233/428 (54%), Gaps = 34/428 (7%) Frame = +2 Query: 17 VAAERRRRRGVSEALDEAL----KKMNFKRGDLHLELIRMFLKRQVEKK----------- 151 + ++RR SEAL A KK K+GD+ LELIRM LKR+ EKK Sbjct: 90 IVKKKRRLFEGSEALVVAAPSPAKKKALKQGDMQLELIRMVLKREAEKKNKNNKSKKKNK 149 Query: 152 ------KEREFKEIAVAPVADETRVLPCGVMAISQSHCNLQSVQENDVLDVKIGGVSNAD 313 K++E +E + R LP GVM IS + DVK+G ++ Sbjct: 150 KKNKKKKKKEEEEELCYGEGELRRELPNGVMEISPASPTRDYDNVASHFDVKVG--VDSK 207 Query: 314 SLLQRHFRSKNIEPLPISTMQVFPLTGNVKAK---KIKKCHWCRGSKCRCLIKCLTCKKR 484 ++ R+FRSKN++ +P+ +Q+ P N+K K KKCHWC+ S+ LI+CL+C++ Sbjct: 208 TVTPRYFRSKNVDRVPVGKLQIVPYGSNLKKGTKGKRKKCHWCQRSESCNLIQCLSCERE 267 Query: 485 FFCLECIKERYFEKQ-EVKAECPACRGICGCKLCLKQKIRPATHKESYGGGRKLDRKQXX 661 FFC++CIKERY + Q EVK CP CRG C CK C + + + KE G ++DR Sbjct: 268 FFCMDCIKERYLDTQNEVKKACPVCRGTCSCKDCSASQCKDSESKEYLTGKSRVDRILHF 327 Query: 662 XXXXXXXXXXXKKVNRDQKIELEKECTAAGKEQSEILIPQTKLGTNTSFCCSKCKTMIID 841 K ++ DQ IELE E GK S+I I Q + G N C+ CKT I+D Sbjct: 328 HYLICMLLPVLKHISEDQNIELETEAKVKGKNISDIQIKQVEFGCNEKNYCNHCKTPILD 387 Query: 842 CHRTCTKCSYNLCLSCCQELSQRSSSGSFMLKSC----KKRKISGDVKQIISRHNSRRPP 1009 HR+C CSY+LC SCCQELSQ +S L + K + S QI+ + + Sbjct: 388 LHRSCPSCSYSLCSSCCQELSQGKASAEINLSTFNRPDKMKTSSASESQIL---DEKAIS 444 Query: 1010 SQSVISSQ---NWETSENG--SIPCPPTDIGGCGGSLLDLRCIFPFNWIRDLEDKAEDIL 1174 S ++I + W T+ NG + CPPT++GGCG S L+LR +FP NWI+++E KAE+I+ Sbjct: 445 SGNLIDTSVMPEW-TNCNGIDCLSCPPTELGGCGNSHLELRSVFPSNWIKEMEVKAEEIV 503 Query: 1175 DSYNLPTT 1198 SY+ P T Sbjct: 504 CSYDFPET 511 >ref|XP_003532564.1| PREDICTED: uncharacterized protein LOC100810673 [Glycine max] Length = 1047 Score = 270 bits (691), Expect = 7e-70 Identities = 154/423 (36%), Positives = 225/423 (53%), Gaps = 34/423 (8%) Frame = +2 Query: 32 RRRRGVSEALDEAL-----KKMNFKRGDLHLELIRMFLKRQVEK---------------- 148 +++R +SE D + +K K+GD+ LEL+RM LKR+ EK Sbjct: 113 KKKRMLSEDSDASASSPPARKKALKQGDMQLELLRMVLKREAEKNKNKSKSKNKKNNNKK 172 Query: 149 ------KKEREFKEIAVAPVADETRVLPCGVMAISQSHCNLQSVQENDVLDVKIGGVSNA 310 K+ +E KE + R LP GVM IS + DVK+G ++ Sbjct: 173 KNKKKEKRRKEEKEELCYTKEELRRELPNGVMEISPASPTRDYNNVGSHCDVKVG--VDS 230 Query: 311 DSLLQRHFRSKNIEPLPISTMQVFPLTGNVKAKKIKKCHWCRGSKCRCLIKCLTCKKRFF 490 ++ R+FRSKN++ +P +Q+ P N+K K KKCHWC+ S+ LI+C +C++ FF Sbjct: 231 KTVTPRYFRSKNVDRVPAGKLQIVPYGSNLKKGKRKKCHWCQRSESGNLIQCSSCQREFF 290 Query: 491 CLECIKERYFEKQ-EVKAECPACRGICGCKLCLKQKIRPATHKESYGGGRKLDRKQXXXX 667 C++C+KERYF+ + E+K CP CRG C CK C + + + KE G ++DR Sbjct: 291 CMDCVKERYFDAENEIKKACPVCRGTCPCKYCSASQCKDSESKECLTGKSRVDRILHFHY 350 Query: 668 XXXXXXXXXKKVNRDQKIELEKECTAAGKEQSEILIPQTKLGTNTSFCCSKCKTMIIDCH 847 K+++ DQ IELE E GK S+I I Q + G + C+ CKT I+D H Sbjct: 351 LICMLLPVLKQISEDQNIELETEVKIKGKNISDIQIKQVEFGCSEKNYCNHCKTPILDLH 410 Query: 848 RTCTKCSYNLCLSCCQELSQRSSSG----SFMLKSCKKRKISGDVKQIISRHNSRRPPSQ 1015 R+C CSY+LC SCCQELSQ +SG S + K + S + + Sbjct: 411 RSCPSCSYSLCSSCCQELSQGKASGAMNSSVFKRPDKMKPCSASENHTLEERATSIGNLT 470 Query: 1016 SVISSQNWETSENG--SIPCPPTDIGGCGGSLLDLRCIFPFNWIRDLEDKAEDILDSYNL 1189 W T+ NG S+ CPPT++GGCG S L+LR +FP +WI+++E KAE+I+ SY+ Sbjct: 471 DTSVLPEW-TNGNGIDSLSCPPTELGGCGKSHLELRSVFPSSWIKEMEAKAEEIVCSYDF 529 Query: 1190 PTT 1198 P T Sbjct: 530 PET 532 >ref|XP_003528426.1| PREDICTED: uncharacterized protein LOC100787798 [Glycine max] Length = 1030 Score = 263 bits (672), Expect = 1e-67 Identities = 159/425 (37%), Positives = 227/425 (53%), Gaps = 31/425 (7%) Frame = +2 Query: 17 VAAERRRRRGVSEALDEALKKMNFKRGDLHLELIRMFLKRQVEKKK-------------- 154 + ++R G S+ A KK K+GD+ LEL+RM LKR+ EKKK Sbjct: 102 IVKKKRMLSGDSDDGSPARKKA-LKQGDMQLELLRMVLKREAEKKKSKNKRNNNNKKKNN 160 Query: 155 -------EREFKEIAVAPVADETRVLPCGVMAISQSHCNLQSVQENDVLDVKIGGVSNAD 313 ++E KE + R LP GVM IS + DVK+G ++ Sbjct: 161 KKKENKKKKEEKEELCYTKEELRRELPNGVMEISPASPTRDYNNVGSHCDVKVG--VDSK 218 Query: 314 SLLQRHFRSKNIEPLPISTMQVFPLTGNVKAKKIKKCHWCRGSKCRCLIKCLTCKKRFFC 493 ++ R+FRSKN++ +P +Q+ P K KK CHWC+ S+ LI+CL+C++ FFC Sbjct: 219 TVAPRYFRSKNVDRVPAGKLQIVPYGSKGKRKK---CHWCQRSESGNLIQCLSCQREFFC 275 Query: 494 LECIKERYFEKQ-EVKAECPACRGICGCKLCLKQKIRPATHKESYGGGRKLDRKQXXXXX 670 ++C+KERYF+ Q E+K CP C G C CK C + + + KE G K+DR Sbjct: 276 MDCVKERYFDTQNEIKKACPVCCGTCTCKDCSASQCKDSESKEYLTGKSKVDRILHFHYL 335 Query: 671 XXXXXXXXKKVNRDQKIELEKECTAAGKEQSEILIPQTKLGTNTSFCCSKCKTMIIDCHR 850 K++++DQ IELE E GK S+I I Q G N C+ CKT I+D HR Sbjct: 336 ICMLLPVLKQISKDQNIELEAEAKVKGKNISDIQIKQVGFGYNEKNYCNHCKTPILDLHR 395 Query: 851 TCTKCSYNLCLSCCQELSQRSSSG---SFMLKSCKKRKISGDVKQIISRHN--SRRPPSQ 1015 +C CSY+LC SCCQELSQ +SG S + K K K G + HN + S Sbjct: 396 SCPSCSYSLCSSCCQELSQGKASGEINSSVFKRPGKMKPCGANES----HNLDEKATSSG 451 Query: 1016 SVISSQNWETSENG----SIPCPPTDIGGCGGSLLDLRCIFPFNWIRDLEDKAEDILDSY 1183 ++ + +NG ++ CPPT++GGCG S L+LR +FP +WI+++E KAE+I+ SY Sbjct: 452 NLTDTSMLPEWKNGNGIDTLSCPPTELGGCGKSHLELRSVFPSSWIKEMEVKAEEIVCSY 511 Query: 1184 NLPTT 1198 + P T Sbjct: 512 DFPET 516 >ref|XP_007213684.1| hypothetical protein PRUPE_ppa000920mg [Prunus persica] gi|462409549|gb|EMJ14883.1| hypothetical protein PRUPE_ppa000920mg [Prunus persica] Length = 961 Score = 258 bits (660), Expect = 3e-66 Identities = 148/409 (36%), Positives = 213/409 (52%), Gaps = 18/409 (4%) Frame = +2 Query: 26 ERRRRRGVSEALDEALKKMNFKRGDLHLELIRMFLKRQVEKKKEREFKEIAVAPVADE-- 199 +R+R + + KKM K+ +L+LELIRM LKR+V+K+ + + K++ D+ Sbjct: 85 KRKRSEETLKKSKKRKKKMKLKKSELNLELIRMVLKREVDKRNQTKKKKVVEEESEDDDD 144 Query: 200 ------TRVLPCGVMAISQSHCNLQSVQE-----NDVLDVKIGGVSNADSLLQRHFRSKN 346 TR LP G+MAIS S ++ N D K+G ++ +R FRSKN Sbjct: 145 DDHDDLTRDLPNGLMAISSSSSQSPLLRSGNAGSNSSSDGKVGVDMGPAAMRRRCFRSKN 204 Query: 347 IEPLPISTMQVFPLT-GNVKAKKIKKCHWCRGSKC---RCLIKCLTCKKRFFCLECIKER 514 IEP+P T+QV P G ++ K K+CHWC+ S CL KC +C+K FFCL CIKER Sbjct: 205 IEPMPAGTLQVLPYNVGKLRRGKRKRCHWCQRSGSGVSSCLTKCSSCQKHFFCLGCIKER 264 Query: 515 YFEKQ-EVKAECPACRGICGCKLCLKQKIRPATHKESYGGGRKLDRKQXXXXXXXXXXXX 691 YF+ Q EVK CP CRG C CK C + + + A K+ G K++ Sbjct: 265 YFDTQDEVKMACPVCRGTCTCKECSENQSKDAESKDYLGVKNKVEVILHFHYLICMLLPV 324 Query: 692 XKKVNRDQKIELEKECTAAGKEQSEILIPQTKLGTNTSFCCSKCKTMIIDCHRTCTKCSY 871 K++N+DQK+ELE E G++ SE+ I + + N CC+KCK I+D HR+C CSY Sbjct: 325 LKQINQDQKVELEAEAKMRGEKLSEVHIKKAEYSCNEQQCCNKCKASIVDLHRSCPNCSY 384 Query: 872 NLCLSCCQELSQRSSSGSFMLKSCKKRKISGDVKQIISRHNSRRPPSQSVISSQNWETSE 1051 NLCLSCC+++ S + G + +S+H++++ Sbjct: 385 NLCLSCCRDIFNGS--------------LLGGINTSLSKHSNKKK--------------- 415 Query: 1052 NGSIPCPPTDIGGCGGSLLDLRCIFPFNWIRDLEDKAEDILDSYNLPTT 1198 CG LL LRC+FP +WI +LE AE+I+ SY P T Sbjct: 416 -----------NCCGDGLLHLRCVFPLSWINELEVSAEEIVCSYEFPET 453 >ref|XP_006347155.1| PREDICTED: uncharacterized protein LOC102600140 isoform X3 [Solanum tuberosum] Length = 824 Score = 257 bits (656), Expect = 8e-66 Identities = 128/301 (42%), Positives = 175/301 (58%), Gaps = 12/301 (3%) Frame = +2 Query: 335 RSKNIEPLPISTMQVFPLTGNVK----AKKIKKCHWCRGSKCRCLIKCLTCKKRFFCLEC 502 RSKNIEPLPISTMQ P NVK K+ + CHWCR S R LIKC +CKK++FCL+C Sbjct: 11 RSKNIEPLPISTMQALPFARNVKNLSKVKRRRLCHWCRRSSYRVLIKCSSCKKQYFCLDC 70 Query: 503 IKERYFEKQEVKAECPACRGICGCKLCLKQKIRPATHKESYGGGRKLDRKQXXXXXXXXX 682 IKER E+QE++ +CP CR C C++C + +++P +HKES RK+ + Q Sbjct: 71 IKERNLEQQEIRVKCPICRRDCSCRICKRSELKPNSHKESSRHKRKVPKVQLLYYLVHLL 130 Query: 683 XXXXKKVNRDQKIELEKECTAAGKEQSEILIPQTKLGTNTSFCCSKCKTMIIDCHRTCTK 862 +K+N +Q+IE+E E +GK +S+I I Q G + CS C T I+D HR C+K Sbjct: 131 LPILEKINEEQRIEVEIEANISGKGESDIQIQQASAGDGKLYHCSNCNTSILDYHRICSK 190 Query: 863 CSYNLCLSCCQELSQRSSSGSFMLKSCKKRKISGDVKQIISRHNSRRPPSQSVI------ 1024 CSY+LCL CC++ S + + + + + SR N QS Sbjct: 191 CSYSLCLYCCRDSRHGSLTEDCKSEGSNEEQACSSNFERQSRMNYTSTSRQSFSGIHYPS 250 Query: 1025 --SSQNWETSENGSIPCPPTDIGGCGGSLLDLRCIFPFNWIRDLEDKAEDILDSYNLPTT 1198 S N + +GSI CPP + GGC S LDLRC+FP+ WI++LE AE IL SYN+ T Sbjct: 251 SRSCSNNQACADGSISCPPAEYGGCSDSFLDLRCVFPYPWIKELEISAEAILCSYNIQDT 310 Query: 1199 D 1201 + Sbjct: 311 E 311