BLASTX nr result
ID: Mentha29_contig00011160
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00011160 (3786 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU32492.1| hypothetical protein MIMGU_mgv1a0001072mg, partia... 545 e-152 emb|CBI18961.3| unnamed protein product [Vitis vinifera] 238 2e-59 ref|XP_006357327.1| PREDICTED: uncharacterized protein LOC102595... 220 3e-54 ref|XP_006357328.1| PREDICTED: uncharacterized protein LOC102595... 218 2e-53 ref|XP_007221926.1| hypothetical protein PRUPE_ppa000052mg [Prun... 216 8e-53 gb|EXB28444.1| Zinc finger CCCH domain-containing protein 7 [Mor... 215 1e-52 ref|XP_007019228.1| Zinc finger C-x8-C-x5-C-x3-H type family pro... 215 1e-52 ref|XP_007019227.1| Zinc finger C-x8-C-x5-C-x3-H type family pro... 215 1e-52 ref|XP_007019226.1| Zinc finger C-x8-C-x5-C-x3-H type family pro... 215 1e-52 ref|XP_006434296.1| hypothetical protein CICLE_v10000009mg [Citr... 214 2e-52 ref|XP_004237575.1| PREDICTED: uncharacterized protein LOC101244... 213 4e-52 ref|XP_006472862.1| PREDICTED: uncharacterized protein At1g21580... 211 3e-51 ref|XP_002520303.1| protein with unknown function [Ricinus commu... 207 2e-50 ref|XP_004292729.1| PREDICTED: uncharacterized protein LOC101310... 198 2e-47 ref|XP_006593806.1| PREDICTED: uncharacterized protein LOC100788... 186 5e-44 ref|XP_007161425.1| hypothetical protein PHAVU_001G067600g [Phas... 184 2e-43 ref|XP_007161424.1| hypothetical protein PHAVU_001G067600g [Phas... 184 2e-43 gb|EPS73988.1| hypothetical protein M569_00768, partial [Genlise... 184 4e-43 ref|XP_006596227.1| PREDICTED: uncharacterized protein At1g21580... 178 2e-41 ref|XP_002302217.2| zinc finger family protein [Populus trichoca... 177 3e-41 >gb|EYU32492.1| hypothetical protein MIMGU_mgv1a0001072mg, partial [Mimulus guttatus] Length = 1562 Score = 545 bits (1405), Expect = e-152 Identities = 397/1024 (38%), Positives = 526/1024 (51%), Gaps = 42/1024 (4%) Frame = +1 Query: 841 ADENLSSNKSASYGDRLGFNQHVNIASNYIHDKTSVYDAEEGEILPTDQIIELDSDRDHR 1020 A E L N+S +GD L FNQH + ++ + SV E+ I DQ L +D Sbjct: 371 AHETLLVNESTGHGDSLYFNQHEKNVNKFVENGASVLRTEKRAIWSIDQYTRLGTDE--- 427 Query: 1021 VHDYSEHPDVSVPSEDIVSIKNHDPAGSRETRISEVHE-DNFFSENHKLVPSPKSACEVM 1197 +SE HE D+ + +L+ PK Sbjct: 428 -------------------------------YLSEFHESDSSLESDSRLLQRPKC----- 451 Query: 1198 KMTEDAVVSDVGFKDANSGGFLPNQCNSLQIILAAKDSKSLEG----------------- 1326 + SDVG ANS PNQ N+ QI + A + +S +G Sbjct: 452 -----VISSDVGSAYANSEELHPNQVNTPQIPVHAVEVRSSDGVVRDCSNVNIGITTRSD 506 Query: 1327 ----------QAVKVSSPDNMGTLLSHNIDSVEGGEICFYNDNLFPKGITDHETCLSVDN 1476 V+VS PD +G S ++DSVEG P I++ ETC +D Sbjct: 507 VPSADCKRISAQVEVSFPDALGEKSSTDVDSVEGS----------PTAISNVETCSHMDY 556 Query: 1477 SSKVIRKRKHAGDQLGLLSGTKATVNVRTVRLRSI---VTRCMAEDIVPAMDGNLVGEKD 1647 SSK+IRKRK G+ S + NV RSI V ++ + +P ++ +L+ E D Sbjct: 557 SSKIIRKRKARSAPEGVYS---LSTNVLVGTGRSIGGEVASLLSNNHIPDVEVDLLVEND 613 Query: 1648 SCKEDTNLQQGSSGVKDSLLEVHLSANGSFPGNQKKQTLCSPRXXXXXXXEDDTTAGGMS 1827 +C ED + S V+D+ LEV ANG +KK+ PR +DD AGG++ Sbjct: 614 TCNEDDFFNKELSEVEDTTLEVDSGANGLCFNYRKKKKGSCPRSNLISPLKDDPVAGGVT 673 Query: 1828 NP---LILEQHFFRPSEWEAEQSGNTPPASITISQCETTGVEGEVGINDSSVVSLDENML 1998 + L+L + +EW AE+ ++P S + ++C + +E +V + V LD+N+ Sbjct: 674 SDCSGLVLRS--IKLAEWGAERREDSPSGSTSTTECAISDMEDKVVCENFYVADLDKNLS 731 Query: 1999 DVDRSLGKDDLALNASGL-LCSERNGPSGSDSGDESLASSFDMRSCMTSPEELHVNSDSS 2175 DV++ D A+ A+ L C +R G + S+S ++ LAS FDM SCM+SPEEL SD S Sbjct: 732 DVNKLYTAGDQAVIANSLSSCGDRTGVAASNSHEDLLASGFDMGSCMSSPEELLSYSDLS 791 Query: 2176 FLENTKASACLLDNEMICQSDNIFNEKHVFADPNTISHGKCSGAAVSKSQSNIGIPQSLL 2355 F N AC NE + AA SQ+N + SLL Sbjct: 792 FSRNV---ACQTKNEEFMKK-----------------------AAADNSQTNGKLSPSLL 825 Query: 2356 KDTSQVVKKLYPIHSKLTWSKNQVTSAIAKVLPVQHPQNLSNSRKLYSSHATKSRTWCRT 2535 + +S++VKK +H KLT SKNQ T ++K P Sbjct: 826 EGSSKMVKKSNFVHGKLTMSKNQPT--VSKASP--------------------------- 856 Query: 2536 VNSTAAVAEPKIEPSPIPQSNGTSAARSVPSSYIRKGNSLVRKDSPSDDASHGVPGSS-S 2712 V EPK +PS +P S+ T AR++ SSYIRKGNSLVR S + G GS S Sbjct: 857 ------VTEPKSQPSLLPPSHETKLARNMQSSYIRKGNSLVRNPSSTGATPTGYHGSGCS 910 Query: 2713 VYRLSHCTNTRKNDLATDYMTGDADAPTVKRRGQISTSVMTKAVTLNHSGNSLKCTPCHL 2892 VYRL+ CT+ KN A+D D +A T+ R ++ TS K LNH+ C+ Sbjct: 911 VYRLTTCTDNLKNSQASDSEIDDVNASTLLRIKEVHTSAFPKEPPLNHT--------CNS 962 Query: 2893 EEPLSLSNPHINDCPPRT-LDVTKERIRSSVVSECQTDSVINSDSQSTARDGNSEKKIIY 3069 + LS + D P + LD E I+SS V EC+TD V N D QS GN EKKI+Y Sbjct: 963 GDSLS-----VGDTPRNSGLD---ETIKSSAVPECRTDPVSNPDGQSKLA-GNLEKKILY 1013 Query: 3070 VKRRSNQLVAACNSGDMSMLGVDNTRSQLSDGYYKSRGNQLLRASSKNHVAKGNAHATVS 3249 VKRRSNQL+AA +S D S+ G D T++ LSDGYYKS+ NQL+RASS+NHV K +A+ + Sbjct: 1014 VKRRSNQLIAASSSIDTSIPGADKTQASLSDGYYKSKKNQLVRASSENHVKKEDANVNLL 1073 Query: 3250 GLVPQSVVPKTSTRRQSGFAKSCRYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFSSK 3429 L P + +P+TS R SGFAKSCR+SKFSSVWKL D QSSEK+KNS+ PRKVWPH F K Sbjct: 1074 RLAPHTNLPRTSKRPVSGFAKSCRHSKFSSVWKLHDKQSSEKHKNSVVPRKVWPHLFPWK 1133 Query: 3430 RAAYWRSLM--LGIKP---SLSNISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSL 3594 RA Y R+ M LG KP SLS SQKLL+SRKRGAIYTRS+HGYSL+MSKVLSVG SSL Sbjct: 1134 RATYLRNFMHALGAKPNSSSLSTTSQKLLLSRKRGAIYTRSTHGYSLRMSKVLSVGASSL 1193 Query: 3595 KWSKSIEKSSRXXXXXXXXXXXXXXXXXXXXXGCVSIASKSRNHVSRKWVLSVKLRPGER 3774 KWSKSIE++S+ G V IA++SRNHVSR ER Sbjct: 1194 KWSKSIERNSKMANEEATRAVAAAEKKKKEETGAVPIATRSRNHVSR-----------ER 1242 Query: 3775 IFRI 3786 IFRI Sbjct: 1243 IFRI 1246 Score = 137 bits (346), Expect = 3e-29 Identities = 90/216 (41%), Positives = 131/216 (60%), Gaps = 16/216 (7%) Frame = +1 Query: 148 NRQPEFAEDDLSLGRFSGRLG--VSKGEFIR---KQRLQKKNTLLKTPLGKVRSKHNGGS 312 N++PEF +D++ LGRF+GRLG V EF+R K++LQKKN + + P+GK SKH+G Sbjct: 1 NKEPEFMQDNMRLGRFAGRLGRRVHNEEFVRSNKKRKLQKKNAIHRIPVGKDCSKHSGPP 60 Query: 313 KARHFTKDSNGGSFKVKEK-GYGKMQTRMEYEREREQSPMELAISFKSNALVAKAIQVPS 489 K +HF K+ + G KEK G+ ++QTR+ ++EREQSPMELAISFKSNALVAKAI PS Sbjct: 61 KNQHFKKNLSSGISGCKEKEGFQQLQTRIADKKEREQSPMELAISFKSNALVAKAILAPS 120 Query: 490 NPSTELCEKSNSVKERLGCHVSALPAVKS---VVVS-------DIQSDSWGTSKEHMDKX 639 P+ S++V + ++S P+ KS VV + D++S+S TS E +D+ Sbjct: 121 GPAVRSVIDSSNVNNKTVYNMSDSPSAKSSNGVVKTHCLTHGLDLRSESHRTSIEVLDEA 180 Query: 640 XXXXXXXXXXNCAGDLVEMALENACLKVKTLKGSTM 747 + A E A++N + L+ ST+ Sbjct: 181 SVSGSGFVGVDGAISFGENAIKNEPPRCMNLQSSTV 216 >emb|CBI18961.3| unnamed protein product [Vitis vinifera] Length = 2149 Score = 238 bits (606), Expect = 2e-59 Identities = 222/634 (35%), Positives = 299/634 (47%), Gaps = 42/634 (6%) Frame = +1 Query: 2011 SLGKDDLALNASGLL--CSERNGPSGSDSGDESLASSFDMRSCMTSPEELHVNSDSSFLE 2184 +L KDD S L ++ NG S ++S DE + S D S M SPE L + L+ Sbjct: 1252 TLMKDDKQPTVSNYLSIAADGNGVSPTNSNDELMQSLPDTLSNMASPETLPLIPGLHTLD 1311 Query: 2185 NTKASACLLDNEMICQSDNIFNEKHVFADPNTI-SHGKCSGAAVS--KSQSNIGIPQSL- 2352 T+ S + ++ C D +EK + + + +H CS ++ S K IG S+ Sbjct: 1312 -TELSVEQISDQKGCGDDRKSDEKPMVDCGSVLFAHNSCSQSSESNFKLDDAIGSDNSIN 1370 Query: 2353 -------LKDTSQVVKKLYPIHSKLTWSKNQVTSAIAKVLPVQHPQNLSNSRKLYSS-HA 2508 +DT + + I +L SKN + + + +V P L+NS+K SS H Sbjct: 1371 GKTVQPSSQDTKRTTHSVNLISGELNGSKNHLNNLVPRVFPAPSSFFLANSKKTASSTHI 1430 Query: 2509 TKSRTWCRTVNSTAAVAEPKIEPSPIPQSNGTSAARSVPSSYIRKGNSLVRKDSP---SD 2679 K RTW RT S++++ +P P PQ + +SYIRKGNSLVRK +P Sbjct: 1431 AKPRTWYRTGASSSSLKKPLSIAFP-PQRQLKKIGKVQGTSYIRKGNSLVRKPAPVAVIP 1489 Query: 2680 DASHGVPGSSSVYRL--SHCTNTRK---NDLATDYM-------TGDADAPTVKRRGQIST 2823 SHG+ SSSVYRL S RK ++ TD + TG DAP Sbjct: 1490 QGSHGL--SSSVYRLNPSGVDEMRKRTGSESRTDVIDPSNRSSTGATDAP---------- 1537 Query: 2824 SVMTKAVTLNHSGNSLKCTPCHLEEPLSLSNPHINDCPPRTLDVTKERIRSSVVSECQTD 3003 S + L +S KCT I+ P + D K SS +E QT Sbjct: 1538 SERPQTPPLPYSTKLPKCTT-------------ISSVPMSSEDGAK----SSGSTENQTG 1580 Query: 3004 SVINSDSQSTARDGNSE----KKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQLSDGYY 3171 + N +SQS DGNSE K++ YVKR+SNQLVAA N DMS+ D T + SD Sbjct: 1581 LINNLESQSVLNDGNSESSKLKRVTYVKRKSNQLVAASNPHDMSVQNADKTPALSSDDDG 1640 Query: 3172 KSRGNQLLRASSKNHVAKGNAHATVSGLVPQSVVPKTSTRRQSG--FAKSCRYSKFSSVW 3345 + Q P+ V K+S++R S +K+ SKFS VW Sbjct: 1641 SNSEGQ---------------------RPPKLVSSKSSSKRPSDKVLSKTREPSKFSLVW 1679 Query: 3346 KLQDSQSSEKYKNSLGPRKVWPHFFSSKRAAYWRSLM-----LGIKPSLSNISQKLLVSR 3510 L+ +QSSEK NS+ + V P F KRA YWRS M + SLS IS+KLL+ R Sbjct: 1680 TLRGAQSSEKDGNSVHSQGVLPSLFPWKRATYWRSFMHNPASIPNSTSLSMISRKLLLLR 1739 Query: 3511 KRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSRXXXXXXXXXXXXXXXXXXXXX 3690 KR +YTRS+ G+SL+ SKVL VGGSSLKWSKSIE+ S+ Sbjct: 1740 KRDTVYTRSTGGFSLRKSKVLGVGGSSLKWSKSIERQSKKANEEATLAVAAVERKKREQN 1799 Query: 3691 GCVSIAS--KSRNHVSRKWVLSVKLRPGERIFRI 3786 G S+ S +SRNH SR ERIFR+ Sbjct: 1800 GAASVISETESRNHSSR-----------ERIFRV 1822 >ref|XP_006357327.1| PREDICTED: uncharacterized protein LOC102595922 isoform X1 [Solanum tuberosum] Length = 1952 Score = 220 bits (561), Expect = 3e-54 Identities = 208/616 (33%), Positives = 288/616 (46%), Gaps = 25/616 (4%) Frame = +1 Query: 2014 LGKDDLALNASGL--LCSERNGPSGSDSGDESLASSF-DMRSCMTSPEELHVNSDSSFLE 2184 L +DD+ L A L ++ + S D S SF D+ + S E + +S SS + Sbjct: 1044 LDRDDMPLLADNLSLFANKVSVKSMESVPDMSPLVSFPDLTNSSVSEEPIDKSSMSSEIV 1103 Query: 2185 NTKASACLLDNEMICQSDNIFNEKHVFADPNTISHGKCSGAAVSKSQS-NIGIPQSLLKD 2361 KA +D I DNI + + +D G+ S V N+ ++ Sbjct: 1104 IEKALR--VDENSITAYDNISSSEKTSSD--AFEFGRSSDHKVGGDPLVNVSTVALSSQN 1159 Query: 2362 TSQVVKKLYPIHSKLTWSKNQVTSAIAKVLPVQHPQNLSNSRKLYSSHATKSRTWCRTVN 2541 T + K + K NQ + A +VL V+ P + R + K TW RT N Sbjct: 1160 TVKSSKNVSSQGWKPNLGANQQSPAGPRVLSVR-PSSFITPRNV--PVPKKPLTWHRTGN 1216 Query: 2542 STAAVAEPKIEPSPIPQSNGTSAARSVPSSYIRKGNSLVRKDSPSDDASHGVPG-SSSVY 2718 S+++V + S +P + S + SYIRKGNSLVR SP G SSS Y Sbjct: 1217 SSSSVVGRGSQMSALPPQSHLSKDTAKVGSYIRKGNSLVRNPSPVGSVPKGYHAPSSSTY 1276 Query: 2719 RL--SHCTNTRKNDLATDYMTGDADA---------------PTVKRRGQISTSVMTKAVT 2847 RL S + R+ +TG PT T V T + Sbjct: 1277 RLNSSGVNDLRRKCENRAEITGSPSCRGTPEVNAPSERPKTPTQSESFSCITLVSTSSPV 1336 Query: 2848 LNHSGNSLKCTPCHLEEPLSLSNPHINDCPPRTLDVTKERIRSSVVSECQTDSVINSDSQ 3027 +H GN T +P+ +++ + L ++ SS V ECQ +S SQ Sbjct: 1337 EDHPGNGSIATN---SDPMEVTDNIL------ALKPSEHPSTSSAVPECQIGLGGDSGSQ 1387 Query: 3028 STARDGNSEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQLSDGYYKSRGNQLLRASS 3207 +T +G+S+K I+YVK+RSNQL+AA + S SDGYYK R NQL+RAS Sbjct: 1388 NTLDEGSSKKNIVYVKQRSNQLLAASDKTQTS-----------SDGYYKRRKNQLIRASG 1436 Query: 3208 KNHVAKGNAHATVSGLVPQSVVP-KTSTRRQSGFAKSCRYSKFSSVWKLQDSQSSEKYKN 3384 NH+ + + +++VP + T+R +G AK+ + SKFS VWKL D+QSS KY Sbjct: 1437 NNHMKQRI-------VTTKTIVPFQRGTKRLNGLAKTSKLSKFSLVWKLGDTQSSRKYGG 1489 Query: 3385 SLGPRKVWPHFFSSKRAAYWRSLMLGIKPS--LSNISQKLLVSRKRGAIYTRSSHGYSLK 3558 ++ K+WP+ F KRA+Y RS L PS S I +KLL+S+KR IYTRS HG SL+ Sbjct: 1490 TVEYEKLWPYLFPWKRASYRRSF-LSSSPSDNSSIIRRKLLLSKKRETIYTRSIHGLSLR 1548 Query: 3559 MSKVLSVGGSSLKWSKSIEKSSRXXXXXXXXXXXXXXXXXXXXXGCVSIASKSRNHVSRK 3738 SKVLSV GSSLKWSKSIE+ S+ G + S S N+VSR Sbjct: 1549 RSKVLSVSGSSLKWSKSIEQRSKKATEEAALAVAAVDKRKRGQYG-FNADSMSGNNVSR- 1606 Query: 3739 WVLSVKLRPGERIFRI 3786 ERIFRI Sbjct: 1607 ----------ERIFRI 1612 Score = 67.0 bits (162), Expect = 6e-08 Identities = 45/109 (41%), Positives = 67/109 (61%), Gaps = 4/109 (3%) Frame = +1 Query: 190 RFSGRLGVSKGEFIR---KQRLQKKNTLLKTPLGKVRSKHNGGSKARHFTKDSNGGSFKV 360 RFS RL V K E R K+++QKK+ LL+ GK ++ +R+ D + G+ + Sbjct: 251 RFSNRLRVDKEEIHRSPQKKQVQKKSALLRIQCGKANNR------SRNQDHDLSSGAVRG 304 Query: 361 KEKG-YGKMQTRMEYEREREQSPMELAISFKSNALVAKAIQVPSNPSTE 504 K+K + +++ R+E ERE S MEL +SFKSNALVAKAI PS+ + + Sbjct: 305 KQKDVFERLERRVE---EREGSQMELDVSFKSNALVAKAIMTPSSSAID 350 >ref|XP_006357328.1| PREDICTED: uncharacterized protein LOC102595922 isoform X2 [Solanum tuberosum] Length = 1946 Score = 218 bits (555), Expect = 2e-53 Identities = 208/615 (33%), Positives = 282/615 (45%), Gaps = 24/615 (3%) Frame = +1 Query: 2014 LGKDDLALNASGL--LCSERNGPSGSDSGDESLASSF-DMRSCMTSPEELHVNSDSSFLE 2184 L +DD+ L A L ++ + S D S SF D+ + S E + +S SS + Sbjct: 1044 LDRDDMPLLADNLSLFANKVSVKSMESVPDMSPLVSFPDLTNSSVSEEPIDKSSMSSEIV 1103 Query: 2185 NTKASACLLDNEMICQSDNIFNEKHVFADPNTISHGKCSGAAVSKSQS-NIGIPQSLLKD 2361 KA +D I DNI + + +D G+ S V N+ ++ Sbjct: 1104 IEKALR--VDENSITAYDNISSSEKTSSD--AFEFGRSSDHKVGGDPLVNVSTVALSSQN 1159 Query: 2362 TSQVVKKLYPIHSKLTWSKNQVTSAIAKVLPVQHPQNLSNSRKLYSSHATKSRTWCRTVN 2541 T + K + K NQ + A +VL V+ P + R + K TW RT N Sbjct: 1160 TVKSSKNVSSQGWKPNLGANQQSPAGPRVLSVR-PSSFITPRNV--PVPKKPLTWHRTGN 1216 Query: 2542 STAAVAEPKIEPSPIPQSNGTSAARSVPSSYIRKGNSLVRKDSPSDDASHGVPG-SSSVY 2718 S+++V + S +P + S + SYIRKGNSLVR SP G SSS Y Sbjct: 1217 SSSSVVGRGSQMSALPPQSHLSKDTAKVGSYIRKGNSLVRNPSPVGSVPKGYHAPSSSTY 1276 Query: 2719 RL--SHCTNTRKNDLATDYMTGDADA---------------PTVKRRGQISTSVMTKAVT 2847 RL S + R+ +TG PT T V T + Sbjct: 1277 RLNSSGVNDLRRKCENRAEITGSPSCRGTPEVNAPSERPKTPTQSESFSCITLVSTSSPV 1336 Query: 2848 LNHSGNSLKCTPCHLEEPLSLSNPHINDCPPRTLDVTKERIRSSVVSECQTDSVINSDSQ 3027 +H GN T +P+ +++ + L ++ SS V ECQ +S SQ Sbjct: 1337 EDHPGNGSIATN---SDPMEVTDNIL------ALKPSEHPSTSSAVPECQIGLGGDSGSQ 1387 Query: 3028 STARDGNSEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQLSDGYYKSRGNQLLRASS 3207 +T +G+S+K I+YVK+RSNQL+AA + S SDGYYK R NQL+RAS Sbjct: 1388 NTLDEGSSKKNIVYVKQRSNQLLAASDKTQTS-----------SDGYYKRRKNQLIRASG 1436 Query: 3208 KNHVAKGNAHATVSGLVPQSVVPKTSTRRQSGFAKSCRYSKFSSVWKLQDSQSSEKYKNS 3387 NH+ + + V KT Q G AK+ + SKFS VWKL D+QSS KY + Sbjct: 1437 NNHMKQ------------RIVTTKTIVPFQRGLAKTSKLSKFSLVWKLGDTQSSRKYGGT 1484 Query: 3388 LGPRKVWPHFFSSKRAAYWRSLMLGIKPS--LSNISQKLLVSRKRGAIYTRSSHGYSLKM 3561 + K+WP+ F KRA+Y RS L PS S I +KLL+S+KR IYTRS HG SL+ Sbjct: 1485 VEYEKLWPYLFPWKRASYRRSF-LSSSPSDNSSIIRRKLLLSKKRETIYTRSIHGLSLRR 1543 Query: 3562 SKVLSVGGSSLKWSKSIEKSSRXXXXXXXXXXXXXXXXXXXXXGCVSIASKSRNHVSRKW 3741 SKVLSV GSSLKWSKSIE+ S+ G + S S N+VSR Sbjct: 1544 SKVLSVSGSSLKWSKSIEQRSKKATEEAALAVAAVDKRKRGQYG-FNADSMSGNNVSR-- 1600 Query: 3742 VLSVKLRPGERIFRI 3786 ERIFRI Sbjct: 1601 ---------ERIFRI 1606 Score = 67.0 bits (162), Expect = 6e-08 Identities = 45/109 (41%), Positives = 67/109 (61%), Gaps = 4/109 (3%) Frame = +1 Query: 190 RFSGRLGVSKGEFIR---KQRLQKKNTLLKTPLGKVRSKHNGGSKARHFTKDSNGGSFKV 360 RFS RL V K E R K+++QKK+ LL+ GK ++ +R+ D + G+ + Sbjct: 251 RFSNRLRVDKEEIHRSPQKKQVQKKSALLRIQCGKANNR------SRNQDHDLSSGAVRG 304 Query: 361 KEKG-YGKMQTRMEYEREREQSPMELAISFKSNALVAKAIQVPSNPSTE 504 K+K + +++ R+E ERE S MEL +SFKSNALVAKAI PS+ + + Sbjct: 305 KQKDVFERLERRVE---EREGSQMELDVSFKSNALVAKAIMTPSSSAID 350 >ref|XP_007221926.1| hypothetical protein PRUPE_ppa000052mg [Prunus persica] gi|462418862|gb|EMJ23125.1| hypothetical protein PRUPE_ppa000052mg [Prunus persica] Length = 2092 Score = 216 bits (549), Expect = 8e-53 Identities = 206/659 (31%), Positives = 296/659 (44%), Gaps = 62/659 (9%) Frame = +1 Query: 1996 LDVDRSLGKDDLALNASGLLCSERNGPSGSDSGDESLASSFDMRSCMTSPEELHVNSDSS 2175 ++ D KD L ++ LL + + + +E + S D S SPE + Sbjct: 1133 MESDHVSVKDSLPFASNRLLLCANDNEVSTTNSNEGVESVPDTLSDTGSPET-STDVPGV 1191 Query: 2176 FLENTKASACLLDNEMICQSDNIFNEKHVF---------------ADPNTISHGKCSGAA 2310 + S + + C D K V N SH G Sbjct: 1192 QMRTCSPSVIKISDGKDCGDDQKLGLKSVVEVGCSASARNSLSECTKSNLTSHPVTEGGQ 1251 Query: 2311 VSKSQSNIGIPQSLLKDTSQVVKKLYPIHSKLTWSKNQVTSAIAKVLPVQHPQNLSNSRK 2490 S + +P +K T+ + L S++ KNQ+ A +++P S S+K Sbjct: 1252 -SVMGKTVALPLQDIKKTAHGLN-LVTAESRV---KNQLGQATRRIVPGHSYSVFSTSKK 1306 Query: 2491 LYSS-HATKSRTWCRTVNSTAAVAEPKIEPSPIPQSNGTSAARSVP--------SSYIRK 2643 SS H K RTW R N++A+ P+ +P S+ R++P +SY+RK Sbjct: 1307 TGSSTHMAKPRTWHRNGNASASSL-----PASMPFSSTVPPQRNLPQKDGKLQSNSYVRK 1361 Query: 2644 GNSLVRKDSPS---DDASHGVPGSSSVYRLSHC-TNTRKNDLATDYMTGDADAPTVKRRG 2811 GNSLVRK P +SHG SS+VYRL+ + K + ++ + P++ R G Sbjct: 1362 GNSLVRKPVPVAALPQSSHGF--SSAVYRLNSLGIDGLKKNAGSESRVDVKNPPSLMRTG 1419 Query: 2812 QISTSVMTKAVTLNHSGNSLKC--------TPCHLEEPLSLSNPHINDCPPRTLDVTKER 2967 +++ L + C T L EPL LS +++D P L+ + Sbjct: 1420 EMNAPFDRPRPPLPNGAKLSTCDAISLGVCTSSQLAEPL-LSGENMSD-PMNCLETKDAK 1477 Query: 2968 I---RSSVVSECQTD--SVINS-DSQSTARDGNSE----KKIIYVKRRSNQLVAACNSGD 3117 I S V SE Q + NS ++Q+ DGNS K I+YVK + NQLVA+ + D Sbjct: 1478 IVVNDSLVTSETQENHSGPFNSLENQTELHDGNSAPSNTKNIVYVKHKLNQLVASSSPCD 1537 Query: 3118 MSMLGVDNTRSQLSDGYYKSRGNQLLRASSKNHVAKGNAHAT---------VSGLVPQSV 3270 + + D + DGYYK R NQL+R SS+ H + + VS +VP + Sbjct: 1538 LPVHNTDKIQHSSFDGYYKRRKNQLIRTSSEGHAKQAVITSNDNLNSQVQKVSKIVPSRI 1597 Query: 3271 VPKTSTRRQSGFAKSCRYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFSSKRAAYWRS 3450 K R Q AK+ + K S VW + +QSS +S +KV PH F KRA +WR+ Sbjct: 1598 YGKK--RSQKVIAKTSKTGKHSLVWTPRGTQSSNNDGDSFDHQKVLPHLFPWKRARHWRT 1655 Query: 3451 LMLGIKP-----SLSNISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIE 3615 M S S IS+KLL+SR+R +YTRS+HG+SL+M KVLSVGGSSLKWSKSIE Sbjct: 1656 SMQSQASNFKYSSASTISKKLLLSRRRDTVYTRSTHGFSLRMYKVLSVGGSSLKWSKSIE 1715 Query: 3616 KSSRXXXXXXXXXXXXXXXXXXXXXG--CVSIASKSRNHVSRKWVLSVKLRPGERIFRI 3786 S+ G CVS SK RN++S G+RIFRI Sbjct: 1716 NRSKKANEEATRAVAAVEKKKREHSGAACVSSGSKFRNNIS-----------GKRIFRI 1763 >gb|EXB28444.1| Zinc finger CCCH domain-containing protein 7 [Morus notabilis] Length = 2046 Score = 215 bits (547), Expect = 1e-52 Identities = 202/642 (31%), Positives = 286/642 (44%), Gaps = 48/642 (7%) Frame = +1 Query: 2005 DRSLGKDDLALNASGLLCSERNGPSGSDSGDESLASSFDMRSCMTSPEELHVNSDSSFLE 2184 D L KDD N L S N SG+ S DE++ D + +SP+ D + + Sbjct: 1164 DNLLVKDDFP-NLPNYLSSP-NDCSGATSTDEAMDFVPDSPTMTSSPQTSLDVPDVNMSD 1221 Query: 2185 NTKASACLLDNEMICQSDNIFNEKHVFADPNTISHGKC----------SGAAVSKSQSNI 2334 T S + IC+ D +K + + +S K S +A Q+ Sbjct: 1222 VTSVSQI---SNQICREDEKLVQKSLDDKGSEVSAQKSFSQCTKSNLTSDSATECDQAIG 1278 Query: 2335 GIPQSL-LKDTSQVVKKLYPIHSKLTWSKNQVTSAIAKVLPVQHPQNLSNSRKLYSSHAT 2511 G L L+D + + + KNQ+ A+++ P + L+ +K +S Sbjct: 1279 GKTAPLSLQDCRSTSRGVNIESVESNEQKNQLDQAVSRTFPGRSSFRLTTFKKRANSTHA 1338 Query: 2512 KSRTWCRTVNSTAAVAEPKIEPSPIPQSNGTSAARSVP--------SSYIRKGNSLVRKD 2667 RTW R VNS+A P S + + +P +SY+RKGNSLVRK Sbjct: 1339 NPRTWHRNVNSSACAL-----PGSKTFSKNVPSQKQLPERDEKVQSTSYVRKGNSLVRKP 1393 Query: 2668 SPSDDASHGVPGSSSVYRLSHC-TNTRKNDLATDYMTGDADAPTVKRRGQISTSVMTKAV 2844 SP+ S G P S VYRL+ ++ K + +D + + R G+ S Sbjct: 1394 SPTAALSQGPPSFSPVYRLNSAGSDELKRSIESDNRVSLGNTHDLSRVGETKASCNNPGP 1453 Query: 2845 TLNHSGNSLK---------CTPCHLEEPLSLSNPHINDCPPRTLD--VTKERIRSSVVSE 2991 SG+ L CT LS N P + + T + S+ SE Sbjct: 1454 LPIQSGSKLPNSVAISPGDCTASPSAGLLSNDRCETNSDPISSTENNETPNLVEDSLTSE 1513 Query: 2992 C------QTDSVINSDSQSTARDGNSE-KKIIYVKRRSNQLVAACNSGDMSMLGVDNTRS 3150 Q +S+ N S A +S K+I+YVKR+SNQLVA NS D ++ Sbjct: 1514 AFENQNGQLNSLDNQTELSNANLASSNMKQIVYVKRKSNQLVATSNS-----TSADKIQT 1568 Query: 3151 QLSDGYYKSRGNQLLRASSKNHVAKG---NAHATVSGLVPQSVVPKTSTRR-QSGFAKSC 3318 SDGYYK + NQL+R S ++H + + + + + V+P S RR K+ Sbjct: 1569 SSSDGYYKRKKNQLIRTSLESHTKQPVMPDDNFNLGVQMTLGVIPNRSKRRGHKVVPKTF 1628 Query: 3319 RYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFSSKRAAYWRSLMLG----IKPSLSNI 3486 + S S VW L ++S++ SL +KV+PH F KR YWRS ML K S I Sbjct: 1629 KRSTNSLVWTLCSTESTKVNSGSLYHQKVFPHLFPWKRTTYWRSFMLNSNLIYKSSSLAI 1688 Query: 3487 SQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSR--XXXXXXXXXXX 3660 S+KLL+SRKR +YTRS +G+SL+ SKVLSVGG+SLKWSKS+E S+ Sbjct: 1689 SKKLLLSRKRDTLYTRSLNGFSLRKSKVLSVGGASLKWSKSLENRSKKVNEEATLAVVAV 1748 Query: 3661 XXXXXXXXXXGCVSIASKSRNHVSRKWVLSVKLRPGERIFRI 3786 C+S SKSRNH SR ERIFRI Sbjct: 1749 DKKKREQKEATCISSGSKSRNHSSR-----------ERIFRI 1779 >ref|XP_007019228.1| Zinc finger C-x8-C-x5-C-x3-H type family protein, putative isoform 3 [Theobroma cacao] gi|508724556|gb|EOY16453.1| Zinc finger C-x8-C-x5-C-x3-H type family protein, putative isoform 3 [Theobroma cacao] Length = 1935 Score = 215 bits (547), Expect = 1e-52 Identities = 220/660 (33%), Positives = 302/660 (45%), Gaps = 71/660 (10%) Frame = +1 Query: 2020 KDDLALNASGLLCS-ERNGPSGSDSGDESLASSFDMRSCMTSPEELHVNSDSSFLENTKA 2196 KDDL L+ + N S ++S DE + + D+ S + SP N D+ + A Sbjct: 1162 KDDLPSALISLVFGVDANEVSATNSNDEVMPAP-DIVSDVGSP----YNHDNFVIS---A 1213 Query: 2197 SACLLDNEMICQSDNIFNEKHVFAD-------PNTISHGKCSGAAVSKSQSNIGIPQS-- 2349 S C +CQ +EK F D P G S A VS SQ + I +S Sbjct: 1214 STC---KAPLCQQ----SEKQAFGDEKFSDDKPMAEGAGNVS-ALVSYSQHSRTILKSND 1265 Query: 2350 LLKDTSQVVKK--LYPIH-SKLTWSKNQVTSA-------IAKVLPVQHPQNLS-----NS 2484 ++ V K L P H SK T S N ++ A ++ V+P +P S + Sbjct: 1266 AIQTNQSVAGKEVLLPSHDSKNTNSPNSISGATRRRKNPLSHVVPKSYPTRSSFVFSASK 1325 Query: 2485 RKLYSSHATKSRTWCRTVNSTAAVA---EPKIEPSPIPQSNGTSAARSVPSSYIRKGNSL 2655 S++ TK RTW RT NS+A+ +P +P+ + AA SYIRKGNSL Sbjct: 1326 NTTPSTNITKPRTWHRTNNSSASPLSGNKPSSSANPLQRQMPKKAAFFQSPSYIRKGNSL 1385 Query: 2656 VRKD---SPSDDASHGVPGSSSVYRLS--------HCTNTRKNDLATDYMTGDADA---- 2790 VRK SH + SSSVYR++ T A D TG A+A Sbjct: 1386 VRKPVAVPALPQGSHSL--SSSVYRMNPGVVDEVKKGTGPNSRVGAVDLRTGGANASFER 1443 Query: 2791 PTVKRRGQISTSVMTKAVTLNHSGNSLKCTPCHLEEPLSLSNPHINDCPPRTLDVTKERI 2970 PT +S T N G +CT L EP I+DC ++ Sbjct: 1444 PTTPPLSSVSK---VPNCTSNSPG---ECTSSPLAEP------SISDCCETAINHASSME 1491 Query: 2971 RSSVVSEC-----------QTDSVINSDSQSTARDGN----SEKKIIYVKRRSNQLVAAC 3105 + V++ Q SV N + + + N + K++ YVK +SNQLVA Sbjct: 1492 INDVLNSPEDGLKTFETLNQNGSVNNLEECTEQSESNLVPSNAKRLTYVKPKSNQLVATS 1551 Query: 3106 NSGDMSMLGVDNTR--SQLSDGYYKSRGNQLLRASSKNHVAKG-----NAHATVSGLVPQ 3264 G S+L D + S SDGYYK NQL+R + ++H+ + N +V + + Sbjct: 1552 ECGRTSILNADKNQNFSAPSDGYYKKSKNQLIRTALESHIKQAVTMSDNKTNSVGQVAAK 1611 Query: 3265 SVVPKTSTRRQSG--FAKSCRYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFSSKRAA 3438 + +T +RQS K+ + SKFS VW L ++ S+ NSL KV P F KR Sbjct: 1612 VMPSRTVGKRQSNKVVGKTHKPSKFSLVWTLHSARLSKNDGNSLRRPKVLPQLFPWKRMT 1671 Query: 3439 YWRSLMLG----IKPSLSNISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSK 3606 YWRS L SLS IS+K+L+SRKR +YTRS +G+S++ SKV SVGGSSLKWSK Sbjct: 1672 YWRSFKLNSVSSCNSSLSTISRKMLLSRKRNTVYTRSINGFSIRKSKVFSVGGSSLKWSK 1731 Query: 3607 SIEKSSRXXXXXXXXXXXXXXXXXXXXXGCVSIASKSRNHVSRKWVLSVKLRPGERIFRI 3786 SIE++SR G VS K R++ K V +LRPGERIFRI Sbjct: 1732 SIERNSRKANEEATLAVAEAERKKREQKGTVSRTGK-RSYSCHKVVHGTELRPGERIFRI 1790 >ref|XP_007019227.1| Zinc finger C-x8-C-x5-C-x3-H type family protein, putative isoform 2 [Theobroma cacao] gi|508724555|gb|EOY16452.1| Zinc finger C-x8-C-x5-C-x3-H type family protein, putative isoform 2 [Theobroma cacao] Length = 1962 Score = 215 bits (547), Expect = 1e-52 Identities = 220/660 (33%), Positives = 302/660 (45%), Gaps = 71/660 (10%) Frame = +1 Query: 2020 KDDLALNASGLLCS-ERNGPSGSDSGDESLASSFDMRSCMTSPEELHVNSDSSFLENTKA 2196 KDDL L+ + N S ++S DE + + D+ S + SP N D+ + A Sbjct: 1162 KDDLPSALISLVFGVDANEVSATNSNDEVMPAP-DIVSDVGSP----YNHDNFVIS---A 1213 Query: 2197 SACLLDNEMICQSDNIFNEKHVFAD-------PNTISHGKCSGAAVSKSQSNIGIPQS-- 2349 S C +CQ +EK F D P G S A VS SQ + I +S Sbjct: 1214 STC---KAPLCQQ----SEKQAFGDEKFSDDKPMAEGAGNVS-ALVSYSQHSRTILKSND 1265 Query: 2350 LLKDTSQVVKK--LYPIH-SKLTWSKNQVTSA-------IAKVLPVQHPQNLS-----NS 2484 ++ V K L P H SK T S N ++ A ++ V+P +P S + Sbjct: 1266 AIQTNQSVAGKEVLLPSHDSKNTNSPNSISGATRRRKNPLSHVVPKSYPTRSSFVFSASK 1325 Query: 2485 RKLYSSHATKSRTWCRTVNSTAAVA---EPKIEPSPIPQSNGTSAARSVPSSYIRKGNSL 2655 S++ TK RTW RT NS+A+ +P +P+ + AA SYIRKGNSL Sbjct: 1326 NTTPSTNITKPRTWHRTNNSSASPLSGNKPSSSANPLQRQMPKKAAFFQSPSYIRKGNSL 1385 Query: 2656 VRKD---SPSDDASHGVPGSSSVYRLS--------HCTNTRKNDLATDYMTGDADA---- 2790 VRK SH + SSSVYR++ T A D TG A+A Sbjct: 1386 VRKPVAVPALPQGSHSL--SSSVYRMNPGVVDEVKKGTGPNSRVGAVDLRTGGANASFER 1443 Query: 2791 PTVKRRGQISTSVMTKAVTLNHSGNSLKCTPCHLEEPLSLSNPHINDCPPRTLDVTKERI 2970 PT +S T N G +CT L EP I+DC ++ Sbjct: 1444 PTTPPLSSVSK---VPNCTSNSPG---ECTSSPLAEP------SISDCCETAINHASSME 1491 Query: 2971 RSSVVSEC-----------QTDSVINSDSQSTARDGN----SEKKIIYVKRRSNQLVAAC 3105 + V++ Q SV N + + + N + K++ YVK +SNQLVA Sbjct: 1492 INDVLNSPEDGLKTFETLNQNGSVNNLEECTEQSESNLVPSNAKRLTYVKPKSNQLVATS 1551 Query: 3106 NSGDMSMLGVDNTR--SQLSDGYYKSRGNQLLRASSKNHVAKG-----NAHATVSGLVPQ 3264 G S+L D + S SDGYYK NQL+R + ++H+ + N +V + + Sbjct: 1552 ECGRTSILNADKNQNFSAPSDGYYKKSKNQLIRTALESHIKQAVTMSDNKTNSVGQVAAK 1611 Query: 3265 SVVPKTSTRRQSG--FAKSCRYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFSSKRAA 3438 + +T +RQS K+ + SKFS VW L ++ S+ NSL KV P F KR Sbjct: 1612 VMPSRTVGKRQSNKVVGKTHKPSKFSLVWTLHSARLSKNDGNSLRRPKVLPQLFPWKRMT 1671 Query: 3439 YWRSLMLG----IKPSLSNISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSK 3606 YWRS L SLS IS+K+L+SRKR +YTRS +G+S++ SKV SVGGSSLKWSK Sbjct: 1672 YWRSFKLNSVSSCNSSLSTISRKMLLSRKRNTVYTRSINGFSIRKSKVFSVGGSSLKWSK 1731 Query: 3607 SIEKSSRXXXXXXXXXXXXXXXXXXXXXGCVSIASKSRNHVSRKWVLSVKLRPGERIFRI 3786 SIE++SR G VS K R++ K V +LRPGERIFRI Sbjct: 1732 SIERNSRKANEEATLAVAEAERKKREQKGTVSRTGK-RSYSCHKVVHGTELRPGERIFRI 1790 >ref|XP_007019226.1| Zinc finger C-x8-C-x5-C-x3-H type family protein, putative isoform 1 [Theobroma cacao] gi|508724554|gb|EOY16451.1| Zinc finger C-x8-C-x5-C-x3-H type family protein, putative isoform 1 [Theobroma cacao] Length = 2110 Score = 215 bits (547), Expect = 1e-52 Identities = 220/660 (33%), Positives = 302/660 (45%), Gaps = 71/660 (10%) Frame = +1 Query: 2020 KDDLALNASGLLCS-ERNGPSGSDSGDESLASSFDMRSCMTSPEELHVNSDSSFLENTKA 2196 KDDL L+ + N S ++S DE + + D+ S + SP N D+ + A Sbjct: 1162 KDDLPSALISLVFGVDANEVSATNSNDEVMPAP-DIVSDVGSP----YNHDNFVIS---A 1213 Query: 2197 SACLLDNEMICQSDNIFNEKHVFAD-------PNTISHGKCSGAAVSKSQSNIGIPQS-- 2349 S C +CQ +EK F D P G S A VS SQ + I +S Sbjct: 1214 STC---KAPLCQQ----SEKQAFGDEKFSDDKPMAEGAGNVS-ALVSYSQHSRTILKSND 1265 Query: 2350 LLKDTSQVVKK--LYPIH-SKLTWSKNQVTSA-------IAKVLPVQHPQNLS-----NS 2484 ++ V K L P H SK T S N ++ A ++ V+P +P S + Sbjct: 1266 AIQTNQSVAGKEVLLPSHDSKNTNSPNSISGATRRRKNPLSHVVPKSYPTRSSFVFSASK 1325 Query: 2485 RKLYSSHATKSRTWCRTVNSTAAVA---EPKIEPSPIPQSNGTSAARSVPSSYIRKGNSL 2655 S++ TK RTW RT NS+A+ +P +P+ + AA SYIRKGNSL Sbjct: 1326 NTTPSTNITKPRTWHRTNNSSASPLSGNKPSSSANPLQRQMPKKAAFFQSPSYIRKGNSL 1385 Query: 2656 VRKD---SPSDDASHGVPGSSSVYRLS--------HCTNTRKNDLATDYMTGDADA---- 2790 VRK SH + SSSVYR++ T A D TG A+A Sbjct: 1386 VRKPVAVPALPQGSHSL--SSSVYRMNPGVVDEVKKGTGPNSRVGAVDLRTGGANASFER 1443 Query: 2791 PTVKRRGQISTSVMTKAVTLNHSGNSLKCTPCHLEEPLSLSNPHINDCPPRTLDVTKERI 2970 PT +S T N G +CT L EP I+DC ++ Sbjct: 1444 PTTPPLSSVSK---VPNCTSNSPG---ECTSSPLAEP------SISDCCETAINHASSME 1491 Query: 2971 RSSVVSEC-----------QTDSVINSDSQSTARDGN----SEKKIIYVKRRSNQLVAAC 3105 + V++ Q SV N + + + N + K++ YVK +SNQLVA Sbjct: 1492 INDVLNSPEDGLKTFETLNQNGSVNNLEECTEQSESNLVPSNAKRLTYVKPKSNQLVATS 1551 Query: 3106 NSGDMSMLGVDNTR--SQLSDGYYKSRGNQLLRASSKNHVAKG-----NAHATVSGLVPQ 3264 G S+L D + S SDGYYK NQL+R + ++H+ + N +V + + Sbjct: 1552 ECGRTSILNADKNQNFSAPSDGYYKKSKNQLIRTALESHIKQAVTMSDNKTNSVGQVAAK 1611 Query: 3265 SVVPKTSTRRQSG--FAKSCRYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFSSKRAA 3438 + +T +RQS K+ + SKFS VW L ++ S+ NSL KV P F KR Sbjct: 1612 VMPSRTVGKRQSNKVVGKTHKPSKFSLVWTLHSARLSKNDGNSLRRPKVLPQLFPWKRMT 1671 Query: 3439 YWRSLMLG----IKPSLSNISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSK 3606 YWRS L SLS IS+K+L+SRKR +YTRS +G+S++ SKV SVGGSSLKWSK Sbjct: 1672 YWRSFKLNSVSSCNSSLSTISRKMLLSRKRNTVYTRSINGFSIRKSKVFSVGGSSLKWSK 1731 Query: 3607 SIEKSSRXXXXXXXXXXXXXXXXXXXXXGCVSIASKSRNHVSRKWVLSVKLRPGERIFRI 3786 SIE++SR G VS K R++ K V +LRPGERIFRI Sbjct: 1732 SIERNSRKANEEATLAVAEAERKKREQKGTVSRTGK-RSYSCHKVVHGTELRPGERIFRI 1790 >ref|XP_006434296.1| hypothetical protein CICLE_v10000009mg [Citrus clementina] gi|557536418|gb|ESR47536.1| hypothetical protein CICLE_v10000009mg [Citrus clementina] Length = 2165 Score = 214 bits (545), Expect = 2e-52 Identities = 195/591 (32%), Positives = 284/591 (48%), Gaps = 53/591 (8%) Frame = +1 Query: 2014 LGKDDLALNASGLLCSERNGPSGSDSGDESLASSFDMRSCMTSPEELHVNSDSSFLENTK 2193 L + DL+ A L ++ +G S ++S DE + FD S + SPE L + L N + Sbjct: 1213 LERGDLS-RAYRALVADGDGVSTTNSYDEMM--EFDSISELGSPEILSTVPVMNAL-NHE 1268 Query: 2194 ASACLLDNEMICQSDNIFNEKHV-------FADPNTISHGKCS-------GAAVSKSQSN 2331 ASA + NE +C+ + I +E+ V A + H K + +A +Q Sbjct: 1269 ASASQISNEKVCRIEKIPSEEPVDEGFFNLSAHTSPSEHAKINLKLDDMLESAHLVAQRT 1328 Query: 2332 IGIPQSLLKDTSQVVKKLYPIHSKLTWSKNQVTSAIAKVLPVQHPQNLSNSRKLYSSH-- 2505 + +P +KDT L P+ + K+Q + ++++ P + + SR L SS Sbjct: 1329 VSLPAQDVKDTGLT---LNPMSGETNGKKHQASHCVSRIHPRRSSSVFTASRDLASSTRT 1385 Query: 2506 --ATKSRTWCRTVNSTAAVAEPKIEPSPIPQSNGTSAARSVPSSYIRKGNSLVRKDSPS- 2676 T+ RTW RT +S+A+ A P A+ SYIRKGNSLVRK +P Sbjct: 1386 TCTTRPRTWHRTESSSASPAPGNKSLLPPQNQLPKKVAKYQSMSYIRKGNSLVRKPAPVA 1445 Query: 2677 --DDASHGVPGSSSVYRLS-----HCTNTRKNDLATDYMTGDA-----DAPTVKRRGQIS 2820 SHG+ +SSVY L+ TR ++ D + + +AP + R + Sbjct: 1446 AVSQISHGL--TSSVYWLNSSGIGESKKTRGSEGGADVVDPPSFLRGVNAPLERPR---T 1500 Query: 2821 TSVMTKAVTLNHSGNSL-KCTPCHLEEPLSLSNPHINDCPPRTLDVTKE------RIRSS 2979 + A NH+ +S T + EPL + +++ E + S Sbjct: 1501 PPLPVVAKVPNHATSSTGDYTSSPVAEPLPNGCSETKSDTQKLMEINDELNFSNAALNIS 1560 Query: 2980 VVSECQTDSVINSDSQSTARDG----NSEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTR 3147 QT SV +SQ DG ++ K+I Y+KR+SNQL+AA N +S+ D T+ Sbjct: 1561 KTPVNQTGSVNGLESQGELNDGTLCTSNVKRITYLKRKSNQLIAASNGCSLSVQNPDKTQ 1620 Query: 3148 SQLSDGYYKSRGNQLLRASSKNH----VAKGNAHATVSGLVPQSVVPKTSTRRQSGFA-- 3309 S SDGYYK R NQL+R ++H V+ + T G + + S QS A Sbjct: 1621 STASDGYYKRRKNQLIRTPLESHINQTVSLADGSFTSEGEKCAKDIFRRSDMSQSYKAVK 1680 Query: 3310 KSCRYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFSSKRAAYWRSLM-----LGIKPS 3474 K C+ +FS VW L QSS+ + L KV P F KR YWR + + S Sbjct: 1681 KICKPIRFSLVWTLNSMQSSKSDDHFLYRGKVLPSLFPWKRTLYWRRFVQDPVSISNNSS 1740 Query: 3475 LSNISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSR 3627 LS IS+KLL+ RKR +YTRS+HG+SL+ KVLSVGGSSLKWSKSIE S+ Sbjct: 1741 LSAISRKLLLLRKRDTVYTRSNHGFSLRKYKVLSVGGSSLKWSKSIENRSK 1791 >ref|XP_004237575.1| PREDICTED: uncharacterized protein LOC101244480 [Solanum lycopersicum] Length = 1167 Score = 213 bits (543), Expect = 4e-52 Identities = 203/603 (33%), Positives = 280/603 (46%), Gaps = 31/603 (5%) Frame = +1 Query: 1912 TISQCETTGVEGEVGINDSSVVSLDENMLDVDRS-----LGKDDLALNASGL--LCSERN 2070 ++ ET + V + S V +D+ + L +DD+ L A L ++ + Sbjct: 226 SVVSIETLKMADRVSDDQGSSVGIDQKLAPESHESCHYVLDRDDMPLLADNLSLFANKVS 285 Query: 2071 GPSGSDSGDESLASSF-DMRSCMTSPEELHVNSDSSFLENTKASACLLDNEMICQSDNIF 2247 S D S SF D+ +C S E + +S SS + KA +D DNI Sbjct: 286 VKSMESVPDMSPLLSFPDLTNCSVSEEPIDKSSVSSEIVIEKALR--VDENSRTAYDNIS 343 Query: 2248 NEKHVFADP---NTISHGKCSGAAVSKSQSNIGIPQSLLKDTSQVVKKLYPIHSKLTWSK 2418 + +D + S K G V + Q+ +K + V + + K Sbjct: 344 SSVKTSSDAFEFDRSSDHKVGGNPVVNINTVALSSQNTVKSSKNVSSQGW----KPNLGA 399 Query: 2419 NQVTSAIAKVLPVQHPQNLSNSRKLYSSHATKSRTWCRTVNSTAAVAEPKIEPSPIPQSN 2598 NQ A ++VL V+ P + R + K TW RT NS ++V + + +P + Sbjct: 400 NQQIPAGSRVLSVR-PSSFITPRNV--PVPKKPLTWHRTGNSFSSVVGRGSQMNSLPPQS 456 Query: 2599 GTSAARSVPSSYIRKGNSLVRKDSPSDDASHGV-PGSSSVYRL--SHCTNTRKNDLATDY 2769 S + SYIRKGNSLVR SP G SSS YRL S + R+ Sbjct: 457 HLSKDTAKVGSYIRKGNSLVRNPSPVGSLPKGYHASSSSTYRLNSSGVNDLRRKCENRAE 516 Query: 2770 MTGDADA---------------PTVKRRGQISTSVMTKAVTLNHSGNSLKCTPCHLEEPL 2904 +TG PT T + T + ++H GN T +P+ Sbjct: 517 ITGSPSCRGTPEVNAPSERPKTPTQSESFSCVTLMSTSSPVVDHPGNGDIATN---SDPM 573 Query: 2905 SLSNPHINDCPPRTLDVTKERIRSSVVSECQTDSVINSDSQSTARDGNSEKKIIYVKRRS 3084 +++ +I P L T SS V ECQ +S SQ+T +G+S K I+YVK+RS Sbjct: 574 EVTD-NILALKPSELPST-----SSAVLECQIGLGGDSGSQNTLDEGSSRKVIVYVKQRS 627 Query: 3085 NQLVAACNSGDMSMLGVDNTRSQLSDGYYKSRGNQLLRASSKNHVAKGNAHATVSGLVPQ 3264 NQLVAA + S SDGYYK R NQL+RAS N + + AT +VP Sbjct: 628 NQLVAASDKTQTS-----------SDGYYKRRKNQLIRASGNNQMKQ--RVATTKNIVPF 674 Query: 3265 SVVPKTSTRRQSGFAKSCRYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFSSKRAAYW 3444 Q G AK+ + SKFS VWKL D+QSS KY ++ K+WP F KRA+Y Sbjct: 675 ----------QRGLAKTSKLSKFSLVWKLGDTQSSRKYGGTVEYEKLWPFLFPWKRASYR 724 Query: 3445 RSLMLGIKPS--LSNISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEK 3618 R+ L PS S I +KLL+S+KR IYTRS HG SL+ SKVLSV GSSLKWSKSIE+ Sbjct: 725 RNF-LSSSPSDNSSIIRRKLLLSKKRETIYTRSIHGLSLRRSKVLSVSGSSLKWSKSIEQ 783 Query: 3619 SSR 3627 S+ Sbjct: 784 RSK 786 >ref|XP_006472862.1| PREDICTED: uncharacterized protein At1g21580-like [Citrus sinensis] Length = 2164 Score = 211 bits (536), Expect = 3e-51 Identities = 189/591 (31%), Positives = 279/591 (47%), Gaps = 53/591 (8%) Frame = +1 Query: 2014 LGKDDLALNASGLLCSERNGPSGSDSGDESLASSFDMRSCMTSPEELHVNSDSSFLENTK 2193 L + DL+ A L ++ +G S ++S DE + FD S + SPE L + L N + Sbjct: 1212 LERGDLS-RAYRALVADGDGVSTTNSYDEMM--EFDSISELGSPEILSTVPVMNAL-NHE 1267 Query: 2194 ASACLLDNEMICQSDNIFNEKHV-------FADPNTISHGKCS-------GAAVSKSQSN 2331 ASA + NE +C+ + I +E+ V A + H K + +A +Q Sbjct: 1268 ASASQISNEKVCRIEKIPSEEPVDEGFFNLSAHTSPSEHAKINLKLDDMLESAHLVAQRT 1327 Query: 2332 IGIPQSLLKDTSQVVKKLYPIHSKLTWSKNQVTSAIAKVLPVQHPQNLSNSRKLYSSH-- 2505 + +P +KDT L P+ + K+Q + ++++ P + + SR L SS Sbjct: 1328 VSLPAQDVKDTGLT---LNPMSGETNGKKHQASHCVSRIHPRRSSSVFTASRDLASSTRT 1384 Query: 2506 --ATKSRTWCRTVNSTAAVAEPKIEPSPIPQSNGTSAARSVPSSYIRKGNSLVRKDSPS- 2676 T+ RTW RT +S+A+ A P A+ SYIRKGNSLVRK +P Sbjct: 1385 TCTTRPRTWHRTESSSASPAPGNKSLLPPQNQLPKKVAKFQSMSYIRKGNSLVRKPAPVA 1444 Query: 2677 --DDASHGVPGSSSVYRLS-----HCTNTRKNDLATDYMTGDA-----DAPTVKRRGQIS 2820 SHG+ +SSVY L+ TR ++ D + + +AP + R + Sbjct: 1445 AVSQVSHGL--TSSVYWLNSSGIGESKKTRGSEGGADVVDPTSFLRGVNAPLERPR---T 1499 Query: 2821 TSVMTKAVTLNHSGNSL-KCTPCHLEEPLSLSNPHINDCPPRTLDVTKE------RIRSS 2979 + A NH+ +S T + EPL + +++ E + S Sbjct: 1500 PPLPVVAKVPNHATSSTGDYTSSPVAEPLPNGCSETKSDTQKLMEINDELNFSNAALNIS 1559 Query: 2980 VVSECQTDSVINSDSQSTARDG----NSEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTR 3147 QT SV +SQ DG ++ K+I Y+KR+SNQL+AA N +S+ D T+ Sbjct: 1560 KTPVNQTGSVNGLESQGELNDGTLCTSNVKRITYLKRKSNQLIAASNGCSLSVQNPDKTQ 1619 Query: 3148 SQLSDGYYKSRGNQLLRASSKNHV------AKGNAHATVSGLVPQSVVPKTSTRRQSGFA 3309 S SDGYYK R NQL+R ++ + A G+ + ++ Sbjct: 1620 STASDGYYKRRKNQLIRTPLESQINQTVSLADGSFTSEGEKCAKDIFTRSDMSQSYKAVK 1679 Query: 3310 KSCRYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFSSKRAAYWRSLM-----LGIKPS 3474 K C+ +FS VW L QSS+ + L KV P F KR YWR + + S Sbjct: 1680 KICKPIRFSLVWTLNSMQSSKSDDHFLYRGKVLPSLFPWKRTLYWRRFVQDPVSISNNSS 1739 Query: 3475 LSNISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSR 3627 LS IS+KLL+ RKR +YTRS+HG+SL+ KVLSVGGSSLKWSKSIE S+ Sbjct: 1740 LSAISRKLLLLRKRDTVYTRSNHGFSLRKYKVLSVGGSSLKWSKSIENRSK 1790 >ref|XP_002520303.1| protein with unknown function [Ricinus communis] gi|223540522|gb|EEF42089.1| protein with unknown function [Ricinus communis] Length = 2030 Score = 207 bits (528), Expect = 2e-50 Identities = 215/711 (30%), Positives = 305/711 (42%), Gaps = 69/711 (9%) Frame = +1 Query: 1861 PSEWEAEQSGNTPPASITISQCETTGVEGEVGINDSSVVSLDENMLDVDRSLGKDDLALN 2040 P+ +E E+ T P IS + + E G + V E L VD + Sbjct: 1018 PTGFEGEKIAGTTPVMAGISH-QNNSIHAESGEGEKMDVDAVEEQLIVDSGTSQCQCPSE 1076 Query: 2041 ASGLLCSER------------NGPSGSDSGDESLASSFDMRSCM------TSPEELHVNS 2166 L ER + +G S +L F +R C TS E + + Sbjct: 1077 VQSLNSDERMPVVNVEDENCLDAKNGLPSASNNL---FSLRDCNGTSTTDTSGEAMVLVP 1133 Query: 2167 DSSFLENTKASACLLDNEMICQSDNIFNEKHVFADPNTISHGKCSGAAVSKSQSNIGIPQ 2346 D+ L N L D I QS + + G+ +S S I + Sbjct: 1134 DT--LPNMDYQETLPDAPSILQSSLSIKQAGGNDEILLGMSATQGGSGISAVTSGSLITE 1191 Query: 2347 SLLKDTSQVVKKLYPIHSKLTWSKNQVTSAIAKVLPVQHPQN-------------LSNSR 2487 + + + S+ T S Q +A++K + + + L+++ Sbjct: 1192 DHAVENANSFGGKATLPSQDTKSSTQTLNAMSKEISGRKSHHNIAAYPGRSSFVFLASTS 1251 Query: 2488 KLYSSHATKSRTWCRTVNSTA-AVAEPKIEPSPIPQSNGT--SAARSVPSSYIRKGNSLV 2658 S+H +K RTW RT +S A A+ K+ S +P + +SYIRKGNSLV Sbjct: 1252 TAPSNHISKPRTWHRTDSSFAPALPGNKVFSSTVPTKCQLPKKVTKFHNTSYIRKGNSLV 1311 Query: 2659 RKDS---PSDDASHGVPGSSSVYRLSHCTNTRKNDLATDYMTGDADAPTVKRRGQISTSV 2829 RK + SHG+ S+ S +KN TD TG AD P + G ++ Sbjct: 1312 RKPTLVAAQPLGSHGLSSSAYWLNSSGKYEVKKN---TDTRTGVADPPNFVKSGVGASFE 1368 Query: 2830 MTKAVTL-------NHSGNSL-KCTPCHLEEPLSL------SNPHINDCPPRTLDVTKER 2967 + L NH NS+ C L E L + S+P + L +++ Sbjct: 1369 RPRTPPLPSSTKISNHPTNSMGDCLSSPLVERLHICAAEAASDPVTSTESNDVLKSSEDT 1428 Query: 2968 IRSSVVSECQTDSVINSDSQSTARDGNS----EKKIIYVKRRSNQLVAACNSGDMSMLGV 3135 ++ S QT + N D ++ DGN+ K I YVKR+SNQL+A N +SM Sbjct: 1429 VKVSEKHMFQTGQINNLDCETEQNDGNAVSSNAKSIKYVKRKSNQLIATSNPCSLSMKNS 1488 Query: 3136 DNTRSQLSDGYYKSRGNQLLRASSKNH----VAKGNAHATVSGLVPQSVVPKTS-TRRQS 3300 +T + SDGYYK R NQL+R S +NH + + G ++ S T+R+S Sbjct: 1489 HSTAALPSDGYYKRRKNQLIRTSVENHEKPTASMPDESVNTEGQALHNITSGRSLTKRRS 1548 Query: 3301 G--FAKSCRYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFSSKRAAYWRSLM-----L 3459 AK+ + SKFSSVW L +QS + +SL +KV P KRA WRS + + Sbjct: 1549 RKVVAKTRKPSKFSSVWTLHSAQSLKDDSHSLHSQKVLPQLLPWKRATSWRSFIPSSAAI 1608 Query: 3460 GIKPSLSNISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSRXXXX 3639 I S S IS+KLL+ RKR +YTRS HGYSL+ SKVLSVGGSSLKWSKSIE+ S+ Sbjct: 1609 SINGSSSLISRKLLLLRKRDTVYTRSKHGYSLRKSKVLSVGGSSLKWSKSIERQSKKANE 1668 Query: 3640 XXXXXXXXXXXXXXXXXGC--VSIASKSRNHVSRKWVLSVKLRPGERIFRI 3786 G V +K+RN SR ERIFRI Sbjct: 1669 EATLAVAEAERKKRERFGASHVDTGTKNRNSSSR-----------ERIFRI 1708 >ref|XP_004292729.1| PREDICTED: uncharacterized protein LOC101310670 [Fragaria vesca subsp. vesca] Length = 1908 Score = 198 bits (503), Expect = 2e-47 Identities = 196/635 (30%), Positives = 295/635 (46%), Gaps = 38/635 (5%) Frame = +1 Query: 1996 LDVDRSLGKDDLALNASGLLC-SERNGPSGSDSGDESLASSFDMRSCMTSPEELHVNSDS 2172 +++D KD L S LL ++ N + ++S DE + S D S +PE +D+ Sbjct: 975 MELDYLCVKDKLPFVPSCLLSIAKGNEVTATNSIDEGMKSVPDTLSDTGTPETSTSITDA 1034 Query: 2173 SFLENTKASACLLDNEMICQSDNIFNEKHVFADP-NTISHGKCSGAAVSKSQSNIGIPQS 2349 L + + D E +C D F K A N S K + + ++ + + Sbjct: 1035 HLLICNPSVVKMFD-EKVCGDDQKFELKSEVASAGNFFSETKTNLTLDNVTEGHQSVTGK 1093 Query: 2350 LLKDTSQVVKKL-YPIH--SKLTWSKNQVTSAIAKVLPVQHPQNLSNSRKLYSS-HATKS 2517 + Q KK + +H S + K+Q+ A K++P + S+K SS H +K Sbjct: 1094 TVPLKLQESKKTSHGLHLLSAESALKSQLGQATHKIVPGHPYPTFTTSQKTTSSTHISKP 1153 Query: 2518 RTWCRTVNSTAAVAEPKIEPSP--IPQSNGTSAARSVPSSYIRKGNSLVRKDSPSDDASH 2691 RTW R NS+A+ P +PQ NG + +SY+RKGN+LVR+ + Sbjct: 1154 RTWHRNANSSASPLHASTLPPQRQLPQRNGKFES----NSYVRKGNTLVRRPASVAAVPQ 1209 Query: 2692 GVPG-SSSVYRL--SHCTNTRKNDLATDYMTGDADAPTVKRRGQISTSVMTKAVTLNHSG 2862 G +SSVY+L S ++KN +D + ++ R G+I L Sbjct: 1210 SSQGLNSSVYQLNISGIDGSKKN-AGSDGRVDIKNPSSLMRTGKIIAPSDRPTAPLPSEV 1268 Query: 2863 NSLKCTPCHLEEPLSLSNPHINDC------PPRTLDV------TKERIRSSVVSECQTDS 3006 L P ++ P ++D P D+ K+ + +S E + Sbjct: 1269 KMYTSAAISLGTPSQVAEPPLSDFFGTKSDPMNCSDMKDAEGSVKDLLATSDPPEHHSGP 1328 Query: 3007 VINSDSQSTARDGNSEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQLSDGYYKSRGN 3186 V NS S A ++ KK+IYVKR+ NQLVA+ N D+S+ DN +Q SDGYYK R + Sbjct: 1329 VTNSHDGSLA--SSNVKKVIYVKRKLNQLVASSNPSDLSVHNADN--NQPSDGYYKRRKH 1384 Query: 3187 QLLRASSKNH------VAKGNAHATVSGLVPQSVVP-KTSTRRQSGFAKSCRYSKFSSVW 3345 QL+R+S +++ + N ++ V + V+P +T +++S A + K S VW Sbjct: 1385 QLIRSSLESNGKDTVLLPTDNLNSRVQKAL--KVIPSRTFNKKRSLKAVARTGKKNSLVW 1442 Query: 3346 KLQDSQSSEKYKNSLGPRKVWPHFFSSKRAAYWRSLMLGIKP-----SLSNISQKLLVSR 3510 +QSS +S +KV PH F KRA WR++M S S IS+KLL+SR Sbjct: 1443 TPSGTQSSNNNGSSFDHQKVLPHLFPWKRARSWRTVMQTQASNFNYSSSSTISKKLLLSR 1502 Query: 3511 KRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSRXXXXXXXXXXXXXXXXXXXXX 3690 R +YTRS+HG+SL+ KVLSVGGSSLKWSKSIE S+ Sbjct: 1503 MRDTVYTRSTHGFSLRKYKVLSVGGSSLKWSKSIESRSK-------------KVNEEATR 1549 Query: 3691 GCVSIASKSRNHVSRKWV---LSVKLRPGERIFRI 3786 +A K R H L ++ PG+RIFRI Sbjct: 1550 AVAEVAKKKREHNGATCASSGLKIRNSPGKRIFRI 1584 >ref|XP_006593806.1| PREDICTED: uncharacterized protein LOC100788859 [Glycine max] Length = 2025 Score = 186 bits (473), Expect = 5e-44 Identities = 200/712 (28%), Positives = 311/712 (43%), Gaps = 73/712 (10%) Frame = +1 Query: 1870 WEAEQSGNTPPASITISQCETTGVEGEVGINDSSVV---SLDENMLDVDRSLG-KDDLAL 2037 W ++ +P + + + +EGE N + +V ++ ++L V G K DL Sbjct: 1004 WHSDIVSFSPCEDLAFPNVQFSSLEGECKENTTPIVPTSNIQTDILAVGNIAGEKTDLQA 1063 Query: 2038 NASGLLCSE---RNGPSGSDSGDES----LASSFDMRSCMTSPEELHVN-SDSSFLENTK 2193 E R+ + + D + L + +++ SC S +E+ N S+ +E+ Sbjct: 1064 VEENYQYREHVQRSPRADMEPNDHNMKNDLLAQWNLMSCPASGDEVTTNNSNDEVIEDAP 1123 Query: 2194 ASACLLDNEMICQSDN-------IFNEKHVFA---DPNTIS----HGKCSGAAVSKSQSN 2331 + + M+ + + N++++F +P+ IS + +++ +++ N Sbjct: 1124 GLSDMFSQGMVSEVPDRRVLEFTAINDENIFGVQENPDNISMVGHDSNLNTSSIQQTKKN 1183 Query: 2332 IG----------IPQSLLKDTSQVVKKL--YPIHSK---LTWSKNQVTSAIAKVLPVQHP 2466 + I + + + SQV K+ ++S L+ +KNQ S I K P H Sbjct: 1184 MKSDHAIEHSNLITKKTMSEQSQVSSKVTTQALNSYCFGLSGTKNQSGSIIPKTFP-GHS 1242 Query: 2467 QNLSNSRKLYSSHATKSRTWCRTVNSTAAVAEPKIEPS--------PIPQSNGTSAARSV 2622 S + S H +K RTW RT N+ A + P+I+PS PI + G Sbjct: 1243 FTFSKT-SASSPHVSKPRTWHRTGNNPPA-SLPRIKPSLGTVPPKKPILEMKGNFQN--- 1297 Query: 2623 PSSYIRKGNSLVRKDSPSDDASHGVPGSSSVYRLSHCTNTRKNDLATDYMTGDADAPTVK 2802 +SY+RKGNSLVRK +P H SSV + S + + + D Sbjct: 1298 -TSYVRKGNSLVRKPTPVSTLPH----ISSVNQTSLGIDEIPKSIKSGGRADVTDKQMYL 1352 Query: 2803 RRGQISTSVMTKAVTLNHSGNSLKCTPCHLEEPLSLSNPHINDCPPRTLDVTK------- 2961 R G + + + L S + T L EP S C D+ K Sbjct: 1353 RTGA-TNAPQQRTPPLPIDTKSEENTSSSLVEPPS------GGCCENASDLRKFIETDNI 1405 Query: 2962 ------ERIRSSVVSECQTDSVINSDSQSTARDGN----SEKKIIYVKRRSNQLVAACNS 3111 + ++ E Q N DSQ A DGN + K+I+Y+K ++NQLVA NS Sbjct: 1406 APNSSEDALKHYETLENQPGPSDNGDSQGEAIDGNVFPLNTKRIVYIKPKTNQLVATSNS 1465 Query: 3112 GDMSMLGVDNTRSQLSDGYYKSRGNQLLRASSKNHV------AKGNAHATVSGLVPQSVV 3273 D+S+ DN ++ SDGYYK R NQL+R + ++H+ + A++ G Sbjct: 1466 CDVSVSTDDNLQTAFSDGYYKRRKNQLIRTTFESHINQTVAMSNNTAYSGGQGTSNALCN 1525 Query: 3274 PKTSTRRQSGFAKS-CRYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFSSKRAAYWRS 3450 + S RR +S C+ S+ S VW L SSE ++S ++ P F KR + S Sbjct: 1526 RRFSKRRTHKVGRSSCKRSRASLVWTLCSKNSSENDRDSQHYQRALPQLFPWKRPTFASS 1585 Query: 3451 LMLGIKPSLSNISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSRX 3630 L SLS IS+KLL RKR +YTRS HG+SL+ S+VL VGG SLKWSKSIEK S+ Sbjct: 1586 LN---NSSLSAISKKLLQLRKRDTVYTRSIHGFSLQKSRVLGVGGCSLKWSKSIEKKSKL 1642 Query: 3631 XXXXXXXXXXXXXXXXXXXXGCVSIASKSRNHVSRKWVLSVKLRPGERIFRI 3786 V I+S+S+ GERIFRI Sbjct: 1643 ANEEATLAVAAVERKRREQKNAVCISSQSKTADC----------AGERIFRI 1684 >ref|XP_007161425.1| hypothetical protein PHAVU_001G067600g [Phaseolus vulgaris] gi|561034889|gb|ESW33419.1| hypothetical protein PHAVU_001G067600g [Phaseolus vulgaris] Length = 1984 Score = 184 bits (468), Expect = 2e-43 Identities = 188/622 (30%), Positives = 286/622 (45%), Gaps = 42/622 (6%) Frame = +1 Query: 2035 LNASGLLCSERNGPSGSDSGDESLASSFDMRSCMTSPEELHVNSDSSFLEN-TKASACLL 2211 LN L +++N S SGDE S+ S +EL V++ + + ++ A + Sbjct: 1061 LNVKNDLLAQQNLMSCPASGDEVTTSN--------SNDELIVDAPGALSDIFSQGMASEV 1112 Query: 2212 DNEMICQSDNIFNEKHVFADPNT--ISHGKCSGAAVSKSQSNIGIPQSLLKDTSQVVKK- 2382 + + + I +E + NT + K +G + N+ I +++ ++SQV K Sbjct: 1113 PDRRVLELTAINDENICGVEENTSSVQEMKQNGRSDHAFGHNMMIKKTI-SESSQVSSKV 1171 Query: 2383 ----LYPIHSKLTWSKNQVTSAIAKVLPVQHPQNLSNSR---KLYSSHATKSRTWCRTVN 2541 L L+ +KNQ S I K P H S S S+H +K RTW RT N Sbjct: 1172 TTQALNSYRFGLSGTKNQSGSVIPKTFP-GHSLTFSRSETKSSASSTHVSKPRTWHRTGN 1230 Query: 2542 STAAVAEPKIE-----PS--PIPQSNGTSAARSVPSSYIRKGNSLVRKDSPSDDASHGVP 2700 ++ P+I PS PI + G +SY+RKGNSLVRK +P +P Sbjct: 1231 PPISL--PRINSVGTIPSKRPILERKGNFQN----TSYVRKGNSLVRKPTPVS----ALP 1280 Query: 2701 GSSSVYRLSHC-----TNTRKNDLATDYMTGDADAPTVKRRGQISTSVMTKAVTLNHSGN 2865 SSV + S + K++ D + P R G + + L + Sbjct: 1281 QISSVNQSSSLGFDDVSKGTKSESRVDL----TNQPMYLRAGATYSQQRQRTPPLPINTK 1336 Query: 2866 SLKCTPCHLEEPLSLSNPHINDCPPRTLDV-------TKERIRSSVVSECQTDSVINSDS 3024 S + T L EP S + P +++ +++ ++ + E Q + N +S Sbjct: 1337 SEENTSSSLVEPPSGGSCENVSDPTSFIEINNNVRNSSEDTLKHYEIPENQPVPLDNGES 1396 Query: 3025 QSTARDGN----SEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQLSDGYYKSRGNQL 3192 Q A +GN + K+I+Y+K ++NQLVA NS D+S+ DN ++ SD YYK R NQL Sbjct: 1397 QVEANNGNPLSLNTKRIVYIKPKTNQLVATSNSCDVSVPADDNGQTAFSDAYYKRRKNQL 1456 Query: 3193 LRASSKNH------VAKGNAHATVSGLVPQSVVPKTSTRRQSGFAKS-CRYSKFSSVWKL 3351 +R + ++H V G A++ G + S +R + +S C+ S+ S VW L Sbjct: 1457 VRTTFESHNNQTAIVPNGKANSDGQGTSNALCNRRFSKKRLNKVGRSSCKRSRASLVWTL 1516 Query: 3352 QDSQSSEKYKNSLGPRKVWPHFFSSKRAAYWRSLMLGIKPSLSNISQKLLVSRKRGAIYT 3531 SSE +NS +KV P F KRA + S S+S IS+KLL RKR +YT Sbjct: 1517 CSKSSSENDRNSRHYQKVLPQLFPWKRATFASSFN---SSSVSAISKKLLQLRKRDTVYT 1573 Query: 3532 RSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSRXXXXXXXXXXXXXXXXXXXXXGCVSIAS 3711 RS HG+SL S+VL VGG SLKWSKSIEK+S+ V I+S Sbjct: 1574 RSKHGFSLWKSRVLGVGGCSLKWSKSIEKNSKQANEEATLAVAAVEKKKREQKNAVCISS 1633 Query: 3712 KS-RNHVSRKWVLSVKLRPGER 3774 +S R + R + ++ P R Sbjct: 1634 QSKRERIFRFGSVRYRMDPSRR 1655 >ref|XP_007161424.1| hypothetical protein PHAVU_001G067600g [Phaseolus vulgaris] gi|561034888|gb|ESW33418.1| hypothetical protein PHAVU_001G067600g [Phaseolus vulgaris] Length = 1979 Score = 184 bits (468), Expect = 2e-43 Identities = 188/622 (30%), Positives = 286/622 (45%), Gaps = 42/622 (6%) Frame = +1 Query: 2035 LNASGLLCSERNGPSGSDSGDESLASSFDMRSCMTSPEELHVNSDSSFLEN-TKASACLL 2211 LN L +++N S SGDE S+ S +EL V++ + + ++ A + Sbjct: 1061 LNVKNDLLAQQNLMSCPASGDEVTTSN--------SNDELIVDAPGALSDIFSQGMASEV 1112 Query: 2212 DNEMICQSDNIFNEKHVFADPNT--ISHGKCSGAAVSKSQSNIGIPQSLLKDTSQVVKK- 2382 + + + I +E + NT + K +G + N+ I +++ ++SQV K Sbjct: 1113 PDRRVLELTAINDENICGVEENTSSVQEMKQNGRSDHAFGHNMMIKKTI-SESSQVSSKV 1171 Query: 2383 ----LYPIHSKLTWSKNQVTSAIAKVLPVQHPQNLSNSR---KLYSSHATKSRTWCRTVN 2541 L L+ +KNQ S I K P H S S S+H +K RTW RT N Sbjct: 1172 TTQALNSYRFGLSGTKNQSGSVIPKTFP-GHSLTFSRSETKSSASSTHVSKPRTWHRTGN 1230 Query: 2542 STAAVAEPKIE-----PS--PIPQSNGTSAARSVPSSYIRKGNSLVRKDSPSDDASHGVP 2700 ++ P+I PS PI + G +SY+RKGNSLVRK +P +P Sbjct: 1231 PPISL--PRINSVGTIPSKRPILERKGNFQN----TSYVRKGNSLVRKPTPVS----ALP 1280 Query: 2701 GSSSVYRLSHC-----TNTRKNDLATDYMTGDADAPTVKRRGQISTSVMTKAVTLNHSGN 2865 SSV + S + K++ D + P R G + + L + Sbjct: 1281 QISSVNQSSSLGFDDVSKGTKSESRVDL----TNQPMYLRAGATYSQQRQRTPPLPINTK 1336 Query: 2866 SLKCTPCHLEEPLSLSNPHINDCPPRTLDV-------TKERIRSSVVSECQTDSVINSDS 3024 S + T L EP S + P +++ +++ ++ + E Q + N +S Sbjct: 1337 SEENTSSSLVEPPSGGSCENVSDPTSFIEINNNVRNSSEDTLKHYEIPENQPVPLDNGES 1396 Query: 3025 QSTARDGN----SEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQLSDGYYKSRGNQL 3192 Q A +GN + K+I+Y+K ++NQLVA NS D+S+ DN ++ SD YYK R NQL Sbjct: 1397 QVEANNGNPLSLNTKRIVYIKPKTNQLVATSNSCDVSVPADDNGQTAFSDAYYKRRKNQL 1456 Query: 3193 LRASSKNH------VAKGNAHATVSGLVPQSVVPKTSTRRQSGFAKS-CRYSKFSSVWKL 3351 +R + ++H V G A++ G + S +R + +S C+ S+ S VW L Sbjct: 1457 VRTTFESHNNQTAIVPNGKANSDGQGTSNALCNRRFSKKRLNKVGRSSCKRSRASLVWTL 1516 Query: 3352 QDSQSSEKYKNSLGPRKVWPHFFSSKRAAYWRSLMLGIKPSLSNISQKLLVSRKRGAIYT 3531 SSE +NS +KV P F KRA + S S+S IS+KLL RKR +YT Sbjct: 1517 CSKSSSENDRNSRHYQKVLPQLFPWKRATFASSFN---SSSVSAISKKLLQLRKRDTVYT 1573 Query: 3532 RSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSRXXXXXXXXXXXXXXXXXXXXXGCVSIAS 3711 RS HG+SL S+VL VGG SLKWSKSIEK+S+ V I+S Sbjct: 1574 RSKHGFSLWKSRVLGVGGCSLKWSKSIEKNSKQANEEATLAVAAVEKKKREQKNAVCISS 1633 Query: 3712 KS-RNHVSRKWVLSVKLRPGER 3774 +S R + R + ++ P R Sbjct: 1634 QSKRERIFRFGSVRYRMDPSRR 1655 >gb|EPS73988.1| hypothetical protein M569_00768, partial [Genlisea aurea] Length = 694 Score = 184 bits (466), Expect = 4e-43 Identities = 143/432 (33%), Positives = 212/432 (49%), Gaps = 2/432 (0%) Frame = +1 Query: 2497 SSHATKSRTWCRTVNSTAAVAEPKIEPSPIPQSNGTSAARSVPSS-YIRKGNSLVRKDSP 2673 S+ KSRTW R+ N + VAEP+ + S +P + + ++PS+ YIR+GNSLVR SP Sbjct: 27 SACIAKSRTWHRSGNVSVVVAEPRSQSSTLPVVHKSKMTENMPSTAYIRQGNSLVRNPSP 86 Query: 2674 SDDASHGVPGS-SSVYRLSHCTNTRKNDLATDYMTGDADAPTVKRRGQISTSVMTKAVTL 2850 S + S SVY+L+ KN + DA ++ ++ S K++ L Sbjct: 87 SGVFPPAIRSSVKSVYKLASLE--AKNTQQFKGKVHEVDASSLLAVKPVTVS---KSLAL 141 Query: 2851 NHSGNSLKCTPCHLEEPLSLSNPHINDCPPRTLDVTKERIRSSVVSECQTDSVINSDSQS 3030 N + ++ C SL +++ C RT D K+ + +V +C +DS N++ Sbjct: 142 NRNVKAVNCP-----SDKSLLTRNVSPC--RTSDALKDTKETILVPKCPSDSRDNAECLH 194 Query: 3031 TARDGNSEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQLSDGYYKSRGNQLLRASSK 3210 T + +K+I+Y+KR+ NQLVAA S S++ +D ++ L+DGYYKS+ NQL+R SS Sbjct: 195 TPEE-EQKKEIVYIKRKRNQLVAASTSTSRSVVQLDKSKVSLTDGYYKSKKNQLVRKSSS 253 Query: 3211 NHVAKGNAHATVSGLVPQSVVPKTSTRRQSGFAKSCRYSKFSSVWKLQDSQSSEKYKNSL 3390 NH K A S ++P + K STR+ S +K + VW L ++S + Sbjct: 254 NH-TKRRASLNFSKVLPLKTITKPSTRQMSTLSK-------AFVWNLHATESPK------ 299 Query: 3391 GPRKVWPHFFSSKRAAYWRSLMLGIKPSLSNISQKLLVSRKRGAIYTRSSHGYSLKMSKV 3570 RKV P KRA +WRS + ++ + L SRKRG +Y RSSHGYSLKMS V Sbjct: 300 -TRKVLPLIVPWKRATHWRSCKYAL--NIRQNVRALPTSRKRGTVYLRSSHGYSLKMSGV 356 Query: 3571 LSVGGSSLKWSKSIEKSSRXXXXXXXXXXXXXXXXXXXXXGCVSIASKSRNHVSRKWVLS 3750 SV SLK + S C S A+ +N+ V Sbjct: 357 RSVAECSLKGQNPPDMKSE---------------NTNEGDACTSTATMEQNNDEGPAVHM 401 Query: 3751 VKLRPGERIFRI 3786 ER+FRI Sbjct: 402 PSTSRRERVFRI 413 >ref|XP_006596227.1| PREDICTED: uncharacterized protein At1g21580-like [Glycine max] Length = 1672 Score = 178 bits (451), Expect = 2e-41 Identities = 158/477 (33%), Positives = 224/477 (46%), Gaps = 34/477 (7%) Frame = +1 Query: 2299 SGAAVSKSQSNIGIPQSLLKDTSQVVKK-----LYPIHSKLTWSKNQVTSAIAKVLPVQH 2463 SG A+ S I + + + SQV + L L+ +KNQ S I K P H Sbjct: 1156 SGHAIEHSNL---ITKKTMSEPSQVSSRVTTQALNSYRFGLSGTKNQSGSVIPKTFP-GH 1211 Query: 2464 PQNLSNSRKLYSSHATKSRTWCRTVN---STAAVAEPKIEPSPIPQSNGTSAARSVPSSY 2634 S + S H +K RTW RT N ++ +P +E P + + +SY Sbjct: 1212 SFTFSKA-SASSPHVSKPRTWLRTGNIPPTSVLRIKPSVETVPPKRPILETKGNFQNTSY 1270 Query: 2635 IRKGNSLVRKDSPSDDASHGVPGSSSVYRLSHC-TNTRKNDLATDYMTGDADAPTVKRRG 2811 +RKGNSLVRK +P +P SSV + S + + + D P + G Sbjct: 1271 VRKGNSLVRKPTPVST----LPQISSVNQTSSLGIDEIPKSIKSGRRADGTDKPMYLKTG 1326 Query: 2812 QISTSVM-TKAVTLNHSGNSLKCTPCHLEEPLSLSNPHINDCPPRTLDVTK--------- 2961 I+ T + ++ T SL P C DV K Sbjct: 1327 AINAPQQRTPPLPID--------TKLEENRSSSLVEPPSGGCCENASDVRKFIETDNIAP 1378 Query: 2962 ----ERIRSSVVSECQTDSVINSDSQSTARDGN----SEKKIIYVKRRSNQLVAACNSGD 3117 + ++ E Q+ N +SQ A DGN + K+I+Y+K ++NQLVA NS D Sbjct: 1379 NSSEDALKHCETPENQSGPSDNGESQGEANDGNVFPLNTKRIVYIKPKTNQLVATSNSYD 1438 Query: 3118 MSMLGVDNTRSQLSDGYYKSRGNQLLRASSKNH----VAKGNAHATVSGLVPQSVV--PK 3279 +S+ DN ++ SDGYYK R NQL+R + ++H VA N A G + + + Sbjct: 1439 VSVSTDDNLQTAFSDGYYKRRKNQLVRTTIESHINQTVAMPNNTANSDGQGTSNALCNRR 1498 Query: 3280 TSTRRQSGFAKSC-RYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFSSKRAAYWRSLM 3456 S +R +S + S+ S VW L SSE ++S ++ P F KRAA+ SL Sbjct: 1499 FSKKRTHKVGRSSFKRSRASLVWTLCSKNSSENDRDSRHYQRALPLLFPWKRAAFASSLN 1558 Query: 3457 LGIKPSLSNISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSR 3627 SLS IS+KLL RKR +YTRS HG+SL+ S+VL VGG SLKWSKSIEK+S+ Sbjct: 1559 ---NSSLSAISKKLLQLRKRDTVYTRSIHGFSLRKSRVLGVGGCSLKWSKSIEKNSK 1612 >ref|XP_002302217.2| zinc finger family protein [Populus trichocarpa] gi|550344506|gb|EEE81490.2| zinc finger family protein [Populus trichocarpa] Length = 2120 Score = 177 bits (449), Expect = 3e-41 Identities = 211/672 (31%), Positives = 294/672 (43%), Gaps = 70/672 (10%) Frame = +1 Query: 1981 LDENMLDVDRSLGKDDLALNASGLLCSERN------GPSGSDSGDESLASSFDMRSCMTS 2142 LDEN+ +D G A N S + + + G S ++SGDE + + S S Sbjct: 1161 LDENIPSIDVDDGGFHGAKNDSPCMSNNPSSFGDGFGVSFTNSGDELVEIVPETLSDRGS 1220 Query: 2143 PEELHVNSDSSFLENTKASACLLDNEMICQSDNIFNEKHVFADPNTISHGKCSGAAVSKS 2322 PE L +S +N+ E I ++D+ + A+ I+ G S ++S S Sbjct: 1221 PETLPDVMGTSLSKNSV--------EKIHENDD-----KIPAERPVINVGSDSSMSISSS 1267 Query: 2323 QSN---IGIPQSLLKDTSQVVKK--LYPIHSKLTWS------------KNQVTSAIAKVL 2451 Q+ + + ++ +D K L SK+T KN + I+K+ Sbjct: 1268 QNAKVVLNLDHAVERDQLLTGKTGHLPSQDSKITTQMPNAKSGDLYGKKNHSSHPISKIY 1327 Query: 2452 PVQHPQNLSNSRK-LYSSHATKSRTWCRTVN-STAAVAEPKIEPSPIPQSN--GTSAARS 2619 + S S+ SS +K+RTW R N S +A K S +P +S Sbjct: 1328 SGRSSFVFSASKSSASSSRISKTRTWHRNDNCSDSAPPSNKAFSSTVPAQRLFPRKGDKS 1387 Query: 2620 VPSSYIRKGNSLVRKDSPSDDASHGVPGSSSVYRL-SHCTNTRKNDLATDYMTGDADAPT 2796 +SYIRKGNSLVRK + + SSSVY+L S T+ K +D AD Sbjct: 1388 QRTSYIRKGNSLVRKPTSVAQSPGPHALSSSVYQLNSSGTDEPKKSAGSDSRIDLADPLN 1447 Query: 2797 VKRRGQISTS--------VMTKAVTLNHSGNSL--KCTPCHLEEPLSLSNPHI------- 2925 V R G + S + + + N + NSL + + E SL + Sbjct: 1448 VLRTGGMDASFEKPRTPSLSSVSKISNRASNSLGGRASSPLAEHLHSLCTETVTVPAKLL 1507 Query: 2926 --NDCPPRTLDVTKERIRSSVVSECQTDSVINSDSQSTARDGNSE-----KKIIYVKRRS 3084 ND P + DV K I S ++ Q + N + S DGN+ K + YVKR+S Sbjct: 1508 ESNDVPKSSDDVLK--ISGSPIT--QNSQISNLECHSDTNDGNTVALANGKSLTYVKRKS 1563 Query: 3085 NQLVAACNSGDMSMLGVDNTRSQLSDGYYKSRGNQLLRASSKNHVAKGNAHATVSGLVPQ 3264 NQLVA+ N S+ NT S D YYK R NQL+R S ++ + K A L + Sbjct: 1564 NQLVASSNPCASSVQNAHNTSS---DSYYKRRKNQLIRTSLESQI-KQTASIPDESLNSE 1619 Query: 3265 SVVPKTS-------TRRQSGFAKSCRYSKFSSVWKLQDSQSSEKYKNSLGPRKVWPHFFS 3423 S R++ K+C+ SK S VW L +Q S+ +S KV PH F Sbjct: 1620 GQTALNSFSRNFSKRRQRKVVTKTCKPSKLSLVWTLHGAQLSKNDGDSSHCGKVLPHLFP 1679 Query: 3424 SKRAAYWRSLM-----LGIKPSLSNISQ----KLLVSRKRGAIYTRSSHGYSLKMSKVLS 3576 KRA Y RS + + SLS I KLL+ RKR YTRS HG+SL+ SKVLS Sbjct: 1680 WKRATYRRSSLPNSSSISDHSSLSTIGYNNWWKLLLLRKRNTEYTRSKHGFSLRKSKVLS 1739 Query: 3577 VGGSSLKWSKSIEKSSRXXXXXXXXXXXXXXXXXXXXXGCVSIA--SKSRNHVSRKWVLS 3750 VGGSSLKWSKSIEK S+ G +A +KSRN +SR Sbjct: 1740 VGGSSLKWSKSIEKHSKKANEEATLAVAAAERKKREQRGAAHVACPTKSRN-ISR----- 1793 Query: 3751 VKLRPGERIFRI 3786 ERIFR+ Sbjct: 1794 ------ERIFRV 1799