BLASTX nr result
ID: Mentha22_contig00020549
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha22_contig00020549 (794 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU32492.1| hypothetical protein MIMGU_mgv1a0001072mg, partia... 294 2e-77 ref|XP_006357327.1| PREDICTED: uncharacterized protein LOC102595... 181 3e-43 ref|XP_006357328.1| PREDICTED: uncharacterized protein LOC102595... 179 1e-42 ref|XP_004237575.1| PREDICTED: uncharacterized protein LOC101244... 172 1e-40 ref|XP_007221926.1| hypothetical protein PRUPE_ppa000052mg [Prun... 168 2e-39 ref|XP_002520303.1| protein with unknown function [Ricinus commu... 166 9e-39 gb|EXB28444.1| Zinc finger CCCH domain-containing protein 7 [Mor... 163 8e-38 emb|CBI18961.3| unnamed protein product [Vitis vinifera] 162 1e-37 ref|XP_006434296.1| hypothetical protein CICLE_v10000009mg [Citr... 160 4e-37 ref|XP_007019228.1| Zinc finger C-x8-C-x5-C-x3-H type family pro... 160 5e-37 ref|XP_007019227.1| Zinc finger C-x8-C-x5-C-x3-H type family pro... 160 5e-37 ref|XP_007019226.1| Zinc finger C-x8-C-x5-C-x3-H type family pro... 160 5e-37 ref|XP_006472862.1| PREDICTED: uncharacterized protein At1g21580... 157 6e-36 ref|XP_007161425.1| hypothetical protein PHAVU_001G067600g [Phas... 154 4e-35 ref|XP_007161424.1| hypothetical protein PHAVU_001G067600g [Phas... 154 4e-35 ref|XP_004498428.1| PREDICTED: uncharacterized protein At1g21580... 154 5e-35 ref|XP_006593806.1| PREDICTED: uncharacterized protein LOC100788... 153 6e-35 ref|XP_006596227.1| PREDICTED: uncharacterized protein At1g21580... 151 2e-34 ref|XP_004292729.1| PREDICTED: uncharacterized protein LOC101310... 149 9e-34 ref|XP_002302217.2| zinc finger family protein [Populus trichoca... 142 1e-31 >gb|EYU32492.1| hypothetical protein MIMGU_mgv1a0001072mg, partial [Mimulus guttatus] Length = 1562 Score = 294 bits (753), Expect = 2e-77 Identities = 161/265 (60%), Positives = 190/265 (71%), Gaps = 5/265 (1%) Frame = -1 Query: 794 SSVVSECQTDSVINSDSQSTARDGNSEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQ 615 SS V EC+TD V N D QS GN EKKI+YVKRRSNQL+AA +S D S+ G D T++ Sbjct: 983 SSAVPECRTDPVSNPDGQSKLA-GNLEKKILYVKRRSNQLIAASSSIDTSIPGADKTQAS 1041 Query: 614 LSDGYYKSRGNQLLRASSENHVAKGNANATVSGLVPQSVVPKTSTRRQSGFAKSCRYSKF 435 LSDGYYKS+ NQL+RASSENHV K +AN + L P + +P+TS R SGFAKSCR+SKF Sbjct: 1042 LSDGYYKSKKNQLVRASSENHVKKEDANVNLLRLAPHTNLPRTSKRPVSGFAKSCRHSKF 1101 Query: 434 SFVWKLQDSQSSEKYKNSLGPRKVWPHLFSSKRAAYWRSLM--LGIKP---SLSNISQKL 270 S VWKL D QSSEK+KNS+ PRKVWPHLF KRA Y R+ M LG KP SLS SQKL Sbjct: 1102 SSVWKLHDKQSSEKHKNSVVPRKVWPHLFPWKRATYLRNFMHALGAKPNSSSLSTTSQKL 1161 Query: 269 LVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSRXXXXXXXXXXXXXXXXX 90 L+SRKRGAIYTRS+HGYSL+MSKVLSVG SSLKWSKSIE++S+ Sbjct: 1162 LLSRKRGAIYTRSTHGYSLRMSKVLSVGASSLKWSKSIERNSKMANEEATRAVAAAEKKK 1221 Query: 89 XXXKGCVSIASKSRNHVSRKWVLSV 15 G V IA++SRNHVSR+ + + Sbjct: 1222 KEETGAVPIATRSRNHVSRERIFRI 1246 >ref|XP_006357327.1| PREDICTED: uncharacterized protein LOC102595922 isoform X1 [Solanum tuberosum] Length = 1952 Score = 181 bits (459), Expect = 3e-43 Identities = 110/221 (49%), Positives = 142/221 (64%), Gaps = 3/221 (1%) Frame = -1 Query: 794 SSVVSECQTDSVINSDSQSTARDGNSEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQ 615 SS V ECQ +S SQ+T +G+S+K I+YVK+RSNQL+AA + S Sbjct: 1370 SSAVPECQIGLGGDSGSQNTLDEGSSKKNIVYVKQRSNQLLAASDKTQTS---------- 1419 Query: 614 LSDGYYKSRGNQLLRASSENHVAKGNANATVSGLVPQSVVP-KTSTRRQSGFAKSCRYSK 438 SDGYYK R NQL+RAS NH+ + +++VP + T+R +G AK+ + SK Sbjct: 1420 -SDGYYKRRKNQLIRASGNNHMKQRIVTT-------KTIVPFQRGTKRLNGLAKTSKLSK 1471 Query: 437 FSFVWKLQDSQSSEKYKNSLGPRKVWPHLFSSKRAAYWRSLMLGIKPS--LSNISQKLLV 264 FS VWKL D+QSS KY ++ K+WP+LF KRA+Y RS L PS S I +KLL+ Sbjct: 1472 FSLVWKLGDTQSSRKYGGTVEYEKLWPYLFPWKRASYRRS-FLSSSPSDNSSIIRRKLLL 1530 Query: 263 SRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSR 141 S+KR IYTRS HG SL+ SKVLSV GSSLKWSKSIE+ S+ Sbjct: 1531 SKKRETIYTRSIHGLSLRRSKVLSVSGSSLKWSKSIEQRSK 1571 >ref|XP_006357328.1| PREDICTED: uncharacterized protein LOC102595922 isoform X2 [Solanum tuberosum] Length = 1946 Score = 179 bits (453), Expect = 1e-42 Identities = 110/220 (50%), Positives = 137/220 (62%), Gaps = 2/220 (0%) Frame = -1 Query: 794 SSVVSECQTDSVINSDSQSTARDGNSEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQ 615 SS V ECQ +S SQ+T +G+S+K I+YVK+RSNQL+AA + S Sbjct: 1370 SSAVPECQIGLGGDSGSQNTLDEGSSKKNIVYVKQRSNQLLAASDKTQTS---------- 1419 Query: 614 LSDGYYKSRGNQLLRASSENHVAKGNANATVSGLVPQSVVPKTSTRRQSGFAKSCRYSKF 435 SDGYYK R NQL+RAS NH+ + + V KT Q G AK+ + SKF Sbjct: 1420 -SDGYYKRRKNQLIRASGNNHMKQ------------RIVTTKTIVPFQRGLAKTSKLSKF 1466 Query: 434 SFVWKLQDSQSSEKYKNSLGPRKVWPHLFSSKRAAYWRSLMLGIKPS--LSNISQKLLVS 261 S VWKL D+QSS KY ++ K+WP+LF KRA+Y RS L PS S I +KLL+S Sbjct: 1467 SLVWKLGDTQSSRKYGGTVEYEKLWPYLFPWKRASYRRSF-LSSSPSDNSSIIRRKLLLS 1525 Query: 260 RKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSR 141 +KR IYTRS HG SL+ SKVLSV GSSLKWSKSIE+ S+ Sbjct: 1526 KKRETIYTRSIHGLSLRRSKVLSVSGSSLKWSKSIEQRSK 1565 >ref|XP_004237575.1| PREDICTED: uncharacterized protein LOC101244480 [Solanum lycopersicum] Length = 1167 Score = 172 bits (437), Expect = 1e-40 Identities = 110/220 (50%), Positives = 135/220 (61%), Gaps = 2/220 (0%) Frame = -1 Query: 794 SSVVSECQTDSVINSDSQSTARDGNSEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQ 615 SS V ECQ +S SQ+T +G+S K I+YVK+RSNQLVAA + S Sbjct: 591 SSAVLECQIGLGGDSGSQNTLDEGSSRKVIVYVKQRSNQLVAASDKTQTS---------- 640 Query: 614 LSDGYYKSRGNQLLRASSENHVAKGNANATVSGLVPQSVVPKTSTRRQSGFAKSCRYSKF 435 SDGYYK R NQL+RAS N + + AT +VP Q G AK+ + SKF Sbjct: 641 -SDGYYKRRKNQLIRASGNNQMKQ--RVATTKNIVPF----------QRGLAKTSKLSKF 687 Query: 434 SFVWKLQDSQSSEKYKNSLGPRKVWPHLFSSKRAAYWRSLMLGIKPS--LSNISQKLLVS 261 S VWKL D+QSS KY ++ K+WP LF KRA+Y R+ L PS S I +KLL+S Sbjct: 688 SLVWKLGDTQSSRKYGGTVEYEKLWPFLFPWKRASYRRNF-LSSSPSDNSSIIRRKLLLS 746 Query: 260 RKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSR 141 +KR IYTRS HG SL+ SKVLSV GSSLKWSKSIE+ S+ Sbjct: 747 KKRETIYTRSIHGLSLRRSKVLSVSGSSLKWSKSIEQRSK 786 >ref|XP_007221926.1| hypothetical protein PRUPE_ppa000052mg [Prunus persica] gi|462418862|gb|EMJ23125.1| hypothetical protein PRUPE_ppa000052mg [Prunus persica] Length = 2092 Score = 168 bits (426), Expect = 2e-39 Identities = 116/283 (40%), Positives = 152/283 (53%), Gaps = 23/283 (8%) Frame = -1 Query: 794 SSVVSECQTD--SVINS-DSQSTARDGNSE----KKIIYVKRRSNQLVAACNSGDMSMLG 636 S V SE Q + NS ++Q+ DGNS K I+YVK + NQLVA+ + D+ + Sbjct: 1483 SLVTSETQENHSGPFNSLENQTELHDGNSAPSNTKNIVYVKHKLNQLVASSSPCDLPVHN 1542 Query: 635 VDNTRSQLSDGYYKSRGNQLLRASSENHVAKG------NANATV---SGLVPQSVVPKTS 483 D + DGYYK R NQL+R SSE H + N N+ V S +VP + K Sbjct: 1543 TDKIQHSSFDGYYKRRKNQLIRTSSEGHAKQAVITSNDNLNSQVQKVSKIVPSRIYGKK- 1601 Query: 482 TRRQSGFAKSCRYSKFSFVWKLQDSQSSEKYKNSLGPRKVWPHLFSSKRAAYWRSLMLGI 303 R Q AK+ + K S VW + +QSS +S +KV PHLF KRA +WR+ M Sbjct: 1602 -RSQKVIAKTSKTGKHSLVWTPRGTQSSNNDGDSFDHQKVLPHLFPWKRARHWRTSMQSQ 1660 Query: 302 KP-----SLSNISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSRX 138 S S IS+KLL+SR+R +YTRS+HG+SL+M KVLSVGGSSLKWSKSIE S+ Sbjct: 1661 ASNFKYSSASTISKKLLLSRRRDTVYTRSTHGFSLRMYKVLSVGGSSLKWSKSIENRSKK 1720 Query: 137 XXXXXXXXXXXXXXXXXXXKG--CVSIASKSRNHVSRKWVLSV 15 G CVS SK RN++S K + + Sbjct: 1721 ANEEATRAVAAVEKKKREHSGAACVSSGSKFRNNISGKRIFRI 1763 >ref|XP_002520303.1| protein with unknown function [Ricinus communis] gi|223540522|gb|EEF42089.1| protein with unknown function [Ricinus communis] Length = 2030 Score = 166 bits (420), Expect = 9e-39 Identities = 111/271 (40%), Positives = 147/271 (54%), Gaps = 18/271 (6%) Frame = -1 Query: 773 QTDSVINSDSQSTARDGNS----EKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQLSD 606 QT + N D ++ DGN+ K I YVKR+SNQL+A N +SM +T + SD Sbjct: 1438 QTGQINNLDCETEQNDGNAVSSNAKSIKYVKRKSNQLIATSNPCSLSMKNSHSTAALPSD 1497 Query: 605 GYYKSRGNQLLRASSENH----VAKGNANATVSGLVPQSVVPKTS-TRRQSG--FAKSCR 447 GYYK R NQL+R S ENH + + + G ++ S T+R+S AK+ + Sbjct: 1498 GYYKRRKNQLIRTSVENHEKPTASMPDESVNTEGQALHNITSGRSLTKRRSRKVVAKTRK 1557 Query: 446 YSKFSFVWKLQDSQSSEKYKNSLGPRKVWPHLFSSKRAAYWRSLM-----LGIKPSLSNI 282 SKFS VW L +QS + +SL +KV P L KRA WRS + + I S S I Sbjct: 1558 PSKFSSVWTLHSAQSLKDDSHSLHSQKVLPQLLPWKRATSWRSFIPSSAAISINGSSSLI 1617 Query: 281 SQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSRXXXXXXXXXXXXX 102 S+KLL+ RKR +YTRS HGYSL+ SKVLSVGGSSLKWSKSIE+ S+ Sbjct: 1618 SRKLLLLRKRDTVYTRSKHGYSLRKSKVLSVGGSSLKWSKSIERQSKKANEEATLAVAEA 1677 Query: 101 XXXXXXXKGC--VSIASKSRNHVSRKWVLSV 15 G V +K+RN SR+ + + Sbjct: 1678 ERKKRERFGASHVDTGTKNRNSSSRERIFRI 1708 >gb|EXB28444.1| Zinc finger CCCH domain-containing protein 7 [Morus notabilis] Length = 2046 Score = 163 bits (412), Expect = 8e-38 Identities = 108/264 (40%), Positives = 145/264 (54%), Gaps = 11/264 (4%) Frame = -1 Query: 773 QTDSVINSDSQSTARDGNSE-KKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQLSDGYY 597 Q +S+ N S A +S K+I+YVKR+SNQLVA NS D ++ SDGYY Sbjct: 1521 QLNSLDNQTELSNANLASSNMKQIVYVKRKSNQLVATSNS-----TSADKIQTSSSDGYY 1575 Query: 596 KSRGNQLLRASSENHVAKG---NANATVSGLVPQSVVPKTSTRR-QSGFAKSCRYSKFSF 429 K + NQL+R S E+H + + N + + V+P S RR K+ + S S Sbjct: 1576 KRKKNQLIRTSLESHTKQPVMPDDNFNLGVQMTLGVIPNRSKRRGHKVVPKTFKRSTNSL 1635 Query: 428 VWKLQDSQSSEKYKNSLGPRKVWPHLFSSKRAAYWRSLMLG----IKPSLSNISQKLLVS 261 VW L ++S++ SL +KV+PHLF KR YWRS ML K S IS+KLL+S Sbjct: 1636 VWTLCSTESTKVNSGSLYHQKVFPHLFPWKRTTYWRSFMLNSNLIYKSSSLAISKKLLLS 1695 Query: 260 RKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSR--XXXXXXXXXXXXXXXXXX 87 RKR +YTRS +G+SL+ SKVLSVGG+SLKWSKS+E S+ Sbjct: 1696 RKRDTLYTRSLNGFSLRKSKVLSVGGASLKWSKSLENRSKKVNEEATLAVVAVDKKKREQ 1755 Query: 86 XXKGCVSIASKSRNHVSRKWVLSV 15 C+S SKSRNH SR+ + + Sbjct: 1756 KEATCISSGSKSRNHSSRERIFRI 1779 >emb|CBI18961.3| unnamed protein product [Vitis vinifera] Length = 2149 Score = 162 bits (410), Expect = 1e-37 Identities = 114/273 (41%), Positives = 146/273 (53%), Gaps = 13/273 (4%) Frame = -1 Query: 794 SSVVSECQTDSVINSDSQSTARDGNSE----KKIIYVKRRSNQLVAACNSGDMSMLGVDN 627 SS +E QT + N +SQS DGNSE K++ YVKR+SNQLVAA N DMS+ D Sbjct: 1571 SSGSTENQTGLINNLESQSVLNDGNSESSKLKRVTYVKRKSNQLVAASNPHDMSVQNADK 1630 Query: 626 TRSQLSDGYYKSRGNQLLRASSENHVAKGNANATVSGLVPQSVVPKTSTRRQSG--FAKS 453 T + SD + Q P+ V K+S++R S +K+ Sbjct: 1631 TPALSSDDDGSNSEGQR---------------------PPKLVSSKSSSKRPSDKVLSKT 1669 Query: 452 CRYSKFSFVWKLQDSQSSEKYKNSLGPRKVWPHLFSSKRAAYWRSLM-----LGIKPSLS 288 SKFS VW L+ +QSSEK NS+ + V P LF KRA YWRS M + SLS Sbjct: 1670 REPSKFSLVWTLRGAQSSEKDGNSVHSQGVLPSLFPWKRATYWRSFMHNPASIPNSTSLS 1729 Query: 287 NISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSRXXXXXXXXXXX 108 IS+KLL+ RKR +YTRS+ G+SL+ SKVL VGGSSLKWSKSIE+ S+ Sbjct: 1730 MISRKLLLLRKRDTVYTRSTGGFSLRKSKVLGVGGSSLKWSKSIERQSKKANEEATLAVA 1789 Query: 107 XXXXXXXXXKGCVSIAS--KSRNHVSRKWVLSV 15 G S+ S +SRNH SR+ + V Sbjct: 1790 AVERKKREQNGAASVISETESRNHSSRERIFRV 1822 >ref|XP_006434296.1| hypothetical protein CICLE_v10000009mg [Citrus clementina] gi|557536418|gb|ESR47536.1| hypothetical protein CICLE_v10000009mg [Citrus clementina] Length = 2165 Score = 160 bits (406), Expect = 4e-37 Identities = 102/226 (45%), Positives = 132/226 (58%), Gaps = 15/226 (6%) Frame = -1 Query: 773 QTDSVINSDSQSTARDG----NSEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQLSD 606 QT SV +SQ DG ++ K+I Y+KR+SNQL+AA N +S+ D T+S SD Sbjct: 1566 QTGSVNGLESQGELNDGTLCTSNVKRITYLKRKSNQLIAASNGCSLSVQNPDKTQSTASD 1625 Query: 605 GYYKSRGNQLLRASSENH----VAKGNANATVSGLVPQSVVPKTSTRRQSGFA--KSCRY 444 GYYK R NQL+R E+H V+ + + T G + + S QS A K C+ Sbjct: 1626 GYYKRRKNQLIRTPLESHINQTVSLADGSFTSEGEKCAKDIFRRSDMSQSYKAVKKICKP 1685 Query: 443 SKFSFVWKLQDSQSSEKYKNSLGPRKVWPHLFSSKRAAYWRSLM-----LGIKPSLSNIS 279 +FS VW L QSS+ + L KV P LF KR YWR + + SLS IS Sbjct: 1686 IRFSLVWTLNSMQSSKSDDHFLYRGKVLPSLFPWKRTLYWRRFVQDPVSISNNSSLSAIS 1745 Query: 278 QKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSR 141 +KLL+ RKR +YTRS+HG+SL+ KVLSVGGSSLKWSKSIE S+ Sbjct: 1746 RKLLLLRKRDTVYTRSNHGFSLRKYKVLSVGGSSLKWSKSIENRSK 1791 >ref|XP_007019228.1| Zinc finger C-x8-C-x5-C-x3-H type family protein, putative isoform 3 [Theobroma cacao] gi|508724556|gb|EOY16453.1| Zinc finger C-x8-C-x5-C-x3-H type family protein, putative isoform 3 [Theobroma cacao] Length = 1935 Score = 160 bits (405), Expect = 5e-37 Identities = 114/274 (41%), Positives = 150/274 (54%), Gaps = 17/274 (6%) Frame = -1 Query: 773 QTDSVINSDSQSTARDGN----SEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTR--SQL 612 Q SV N + + + N + K++ YVK +SNQLVA G S+L D + S Sbjct: 1511 QNGSVNNLEECTEQSESNLVPSNAKRLTYVKPKSNQLVATSECGRTSILNADKNQNFSAP 1570 Query: 611 SDGYYKSRGNQLLRASSENHVAKG---NANATVS-GLVPQSVVP-KTSTRRQSG--FAKS 453 SDGYYK NQL+R + E+H+ + + N T S G V V+P +T +RQS K+ Sbjct: 1571 SDGYYKKSKNQLIRTALESHIKQAVTMSDNKTNSVGQVAAKVMPSRTVGKRQSNKVVGKT 1630 Query: 452 CRYSKFSFVWKLQDSQSSEKYKNSLGPRKVWPHLFSSKRAAYWRSLMLG----IKPSLSN 285 + SKFS VW L ++ S+ NSL KV P LF KR YWRS L SLS Sbjct: 1631 HKPSKFSLVWTLHSARLSKNDGNSLRRPKVLPQLFPWKRMTYWRSFKLNSVSSCNSSLST 1690 Query: 284 ISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSRXXXXXXXXXXXX 105 IS+K+L+SRKR +YTRS +G+S++ SKV SVGGSSLKWSKSIE++SR Sbjct: 1691 ISRKMLLSRKRNTVYTRSINGFSIRKSKVFSVGGSSLKWSKSIERNSRKANEEATLAVAE 1750 Query: 104 XXXXXXXXKGCVSIASKSRNHVSRKWVLSVKLRP 3 KG VS K R++ K V +LRP Sbjct: 1751 AERKKREQKGTVSRTGK-RSYSCHKVVHGTELRP 1783 >ref|XP_007019227.1| Zinc finger C-x8-C-x5-C-x3-H type family protein, putative isoform 2 [Theobroma cacao] gi|508724555|gb|EOY16452.1| Zinc finger C-x8-C-x5-C-x3-H type family protein, putative isoform 2 [Theobroma cacao] Length = 1962 Score = 160 bits (405), Expect = 5e-37 Identities = 114/274 (41%), Positives = 150/274 (54%), Gaps = 17/274 (6%) Frame = -1 Query: 773 QTDSVINSDSQSTARDGN----SEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTR--SQL 612 Q SV N + + + N + K++ YVK +SNQLVA G S+L D + S Sbjct: 1511 QNGSVNNLEECTEQSESNLVPSNAKRLTYVKPKSNQLVATSECGRTSILNADKNQNFSAP 1570 Query: 611 SDGYYKSRGNQLLRASSENHVAKG---NANATVS-GLVPQSVVP-KTSTRRQSG--FAKS 453 SDGYYK NQL+R + E+H+ + + N T S G V V+P +T +RQS K+ Sbjct: 1571 SDGYYKKSKNQLIRTALESHIKQAVTMSDNKTNSVGQVAAKVMPSRTVGKRQSNKVVGKT 1630 Query: 452 CRYSKFSFVWKLQDSQSSEKYKNSLGPRKVWPHLFSSKRAAYWRSLMLG----IKPSLSN 285 + SKFS VW L ++ S+ NSL KV P LF KR YWRS L SLS Sbjct: 1631 HKPSKFSLVWTLHSARLSKNDGNSLRRPKVLPQLFPWKRMTYWRSFKLNSVSSCNSSLST 1690 Query: 284 ISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSRXXXXXXXXXXXX 105 IS+K+L+SRKR +YTRS +G+S++ SKV SVGGSSLKWSKSIE++SR Sbjct: 1691 ISRKMLLSRKRNTVYTRSINGFSIRKSKVFSVGGSSLKWSKSIERNSRKANEEATLAVAE 1750 Query: 104 XXXXXXXXKGCVSIASKSRNHVSRKWVLSVKLRP 3 KG VS K R++ K V +LRP Sbjct: 1751 AERKKREQKGTVSRTGK-RSYSCHKVVHGTELRP 1783 >ref|XP_007019226.1| Zinc finger C-x8-C-x5-C-x3-H type family protein, putative isoform 1 [Theobroma cacao] gi|508724554|gb|EOY16451.1| Zinc finger C-x8-C-x5-C-x3-H type family protein, putative isoform 1 [Theobroma cacao] Length = 2110 Score = 160 bits (405), Expect = 5e-37 Identities = 114/274 (41%), Positives = 150/274 (54%), Gaps = 17/274 (6%) Frame = -1 Query: 773 QTDSVINSDSQSTARDGN----SEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTR--SQL 612 Q SV N + + + N + K++ YVK +SNQLVA G S+L D + S Sbjct: 1511 QNGSVNNLEECTEQSESNLVPSNAKRLTYVKPKSNQLVATSECGRTSILNADKNQNFSAP 1570 Query: 611 SDGYYKSRGNQLLRASSENHVAKG---NANATVS-GLVPQSVVP-KTSTRRQSG--FAKS 453 SDGYYK NQL+R + E+H+ + + N T S G V V+P +T +RQS K+ Sbjct: 1571 SDGYYKKSKNQLIRTALESHIKQAVTMSDNKTNSVGQVAAKVMPSRTVGKRQSNKVVGKT 1630 Query: 452 CRYSKFSFVWKLQDSQSSEKYKNSLGPRKVWPHLFSSKRAAYWRSLMLG----IKPSLSN 285 + SKFS VW L ++ S+ NSL KV P LF KR YWRS L SLS Sbjct: 1631 HKPSKFSLVWTLHSARLSKNDGNSLRRPKVLPQLFPWKRMTYWRSFKLNSVSSCNSSLST 1690 Query: 284 ISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSRXXXXXXXXXXXX 105 IS+K+L+SRKR +YTRS +G+S++ SKV SVGGSSLKWSKSIE++SR Sbjct: 1691 ISRKMLLSRKRNTVYTRSINGFSIRKSKVFSVGGSSLKWSKSIERNSRKANEEATLAVAE 1750 Query: 104 XXXXXXXXKGCVSIASKSRNHVSRKWVLSVKLRP 3 KG VS K R++ K V +LRP Sbjct: 1751 AERKKREQKGTVSRTGK-RSYSCHKVVHGTELRP 1783 >ref|XP_006472862.1| PREDICTED: uncharacterized protein At1g21580-like [Citrus sinensis] Length = 2164 Score = 157 bits (396), Expect = 6e-36 Identities = 96/226 (42%), Positives = 126/226 (55%), Gaps = 15/226 (6%) Frame = -1 Query: 773 QTDSVINSDSQSTARDG----NSEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQLSD 606 QT SV +SQ DG ++ K+I Y+KR+SNQL+AA N +S+ D T+S SD Sbjct: 1565 QTGSVNGLESQGELNDGTLCTSNVKRITYLKRKSNQLIAASNGCSLSVQNPDKTQSTASD 1624 Query: 605 GYYKSRGNQLLRASSENHV------AKGNANATVSGLVPQSVVPKTSTRRQSGFAKSCRY 444 GYYK R NQL+R E+ + A G+ + ++ K C+ Sbjct: 1625 GYYKRRKNQLIRTPLESQINQTVSLADGSFTSEGEKCAKDIFTRSDMSQSYKAVKKICKP 1684 Query: 443 SKFSFVWKLQDSQSSEKYKNSLGPRKVWPHLFSSKRAAYWRSLM-----LGIKPSLSNIS 279 +FS VW L QSS+ + L KV P LF KR YWR + + SLS IS Sbjct: 1685 IRFSLVWTLNSMQSSKSDDHFLYRGKVLPSLFPWKRTLYWRRFVQDPVSISNNSSLSAIS 1744 Query: 278 QKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSR 141 +KLL+ RKR +YTRS+HG+SL+ KVLSVGGSSLKWSKSIE S+ Sbjct: 1745 RKLLLLRKRDTVYTRSNHGFSLRKYKVLSVGGSSLKWSKSIENRSK 1790 >ref|XP_007161425.1| hypothetical protein PHAVU_001G067600g [Phaseolus vulgaris] gi|561034889|gb|ESW33419.1| hypothetical protein PHAVU_001G067600g [Phaseolus vulgaris] Length = 1984 Score = 154 bits (389), Expect = 4e-35 Identities = 98/226 (43%), Positives = 132/226 (58%), Gaps = 11/226 (4%) Frame = -1 Query: 785 VSECQTDSVINSDSQSTARDGN----SEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRS 618 + E Q + N +SQ A +GN + K+I+Y+K ++NQLVA NS D+S+ DN ++ Sbjct: 1383 IPENQPVPLDNGESQVEANNGNPLSLNTKRIVYIKPKTNQLVATSNSCDVSVPADDNGQT 1442 Query: 617 QLSDGYYKSRGNQLLRASSENH------VAKGNANATVSGLVPQSVVPKTSTRRQSGFAK 456 SD YYK R NQL+R + E+H V G AN+ G + S +R + + Sbjct: 1443 AFSDAYYKRRKNQLVRTTFESHNNQTAIVPNGKANSDGQGTSNALCNRRFSKKRLNKVGR 1502 Query: 455 S-CRYSKFSFVWKLQDSQSSEKYKNSLGPRKVWPHLFSSKRAAYWRSLMLGIKPSLSNIS 279 S C+ S+ S VW L SSE +NS +KV P LF KRA + S S+S IS Sbjct: 1503 SSCKRSRASLVWTLCSKSSSENDRNSRHYQKVLPQLFPWKRATFASSFN---SSSVSAIS 1559 Query: 278 QKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSR 141 +KLL RKR +YTRS HG+SL S+VL VGG SLKWSKSIEK+S+ Sbjct: 1560 KKLLQLRKRDTVYTRSKHGFSLWKSRVLGVGGCSLKWSKSIEKNSK 1605 >ref|XP_007161424.1| hypothetical protein PHAVU_001G067600g [Phaseolus vulgaris] gi|561034888|gb|ESW33418.1| hypothetical protein PHAVU_001G067600g [Phaseolus vulgaris] Length = 1979 Score = 154 bits (389), Expect = 4e-35 Identities = 98/226 (43%), Positives = 132/226 (58%), Gaps = 11/226 (4%) Frame = -1 Query: 785 VSECQTDSVINSDSQSTARDGN----SEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRS 618 + E Q + N +SQ A +GN + K+I+Y+K ++NQLVA NS D+S+ DN ++ Sbjct: 1383 IPENQPVPLDNGESQVEANNGNPLSLNTKRIVYIKPKTNQLVATSNSCDVSVPADDNGQT 1442 Query: 617 QLSDGYYKSRGNQLLRASSENH------VAKGNANATVSGLVPQSVVPKTSTRRQSGFAK 456 SD YYK R NQL+R + E+H V G AN+ G + S +R + + Sbjct: 1443 AFSDAYYKRRKNQLVRTTFESHNNQTAIVPNGKANSDGQGTSNALCNRRFSKKRLNKVGR 1502 Query: 455 S-CRYSKFSFVWKLQDSQSSEKYKNSLGPRKVWPHLFSSKRAAYWRSLMLGIKPSLSNIS 279 S C+ S+ S VW L SSE +NS +KV P LF KRA + S S+S IS Sbjct: 1503 SSCKRSRASLVWTLCSKSSSENDRNSRHYQKVLPQLFPWKRATFASSFN---SSSVSAIS 1559 Query: 278 QKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSR 141 +KLL RKR +YTRS HG+SL S+VL VGG SLKWSKSIEK+S+ Sbjct: 1560 KKLLQLRKRDTVYTRSKHGFSLWKSRVLGVGGCSLKWSKSIEKNSK 1605 >ref|XP_004498428.1| PREDICTED: uncharacterized protein At1g21580-like [Cicer arietinum] Length = 2014 Score = 154 bits (388), Expect = 5e-35 Identities = 110/280 (39%), Positives = 146/280 (52%), Gaps = 25/280 (8%) Frame = -1 Query: 779 ECQTDSVINSDSQSTARDGN----SEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQL 612 E QT N +SQ+ DGN + KKI+Y+K ++NQLVA +S D+ D ++ Sbjct: 1416 ENQTGPSSNGESQAEGNDGNVSSLNSKKIVYIKPKTNQLVATSSSCDIIASIDDKGQTAC 1475 Query: 611 SDGYYKSRGNQLLRASSENHVAKGNANATVSGLVPQSVVP-------------KTSTRRQ 471 SD YYK R NQL+R + ENHV N TV+ +P ++V K + RR Sbjct: 1476 SDSYYKRRKNQLVRTTFENHV-----NQTVA--MPNNIVNHDGQGARKVLCNRKFTKRRS 1528 Query: 470 SGFAK-SCRYSKFSFVWKLQDSQSSEKYKNSLGPRKVWPHLFSSKRAAYWRSLMLGIKPS 294 + A SC+ S+ S VW L+ SS +++ +KV PHLF KR Y RS + S Sbjct: 1529 NKVAGVSCKSSRASLVWTLRSKNSSGNDRDAWHHQKVLPHLFPWKRTTYSRSFIHNSASS 1588 Query: 293 -----LSNISQKLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSR--XX 135 LS + +KLL+ RKR +YTRS+ G+SL SKVL VGGSSLKWSKSIEK S+ Sbjct: 1589 FNSGSLSAVGKKLLMLRKRDTVYTRSTRGFSLWKSKVLGVGGSSLKWSKSIEKHSKKANE 1648 Query: 134 XXXXXXXXXXXXXXXXXXKGCVSIASKSRNHVSRKWVLSV 15 CVS +KSR H S K + V Sbjct: 1649 EATLAVAAVEKKKREQKDPACVSRQTKSRKHFSMKRIFRV 1688 >ref|XP_006593806.1| PREDICTED: uncharacterized protein LOC100788859 [Glycine max] Length = 2025 Score = 153 bits (387), Expect = 6e-35 Identities = 98/216 (45%), Positives = 128/216 (59%), Gaps = 11/216 (5%) Frame = -1 Query: 755 NSDSQSTARDGN----SEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQLSDGYYKSR 588 N DSQ A DGN + K+I+Y+K ++NQLVA NS D+S+ DN ++ SDGYYK R Sbjct: 1429 NGDSQGEAIDGNVFPLNTKRIVYIKPKTNQLVATSNSCDVSVSTDDNLQTAFSDGYYKRR 1488 Query: 587 GNQLLRASSENH----VAKGNANATVSGLVPQSVV--PKTSTRRQSGFAKS-CRYSKFSF 429 NQL+R + E+H VA N A G + + + S RR +S C+ S+ S Sbjct: 1489 KNQLIRTTFESHINQTVAMSNNTAYSGGQGTSNALCNRRFSKRRTHKVGRSSCKRSRASL 1548 Query: 428 VWKLQDSQSSEKYKNSLGPRKVWPHLFSSKRAAYWRSLMLGIKPSLSNISQKLLVSRKRG 249 VW L SSE ++S ++ P LF KR + SL SLS IS+KLL RKR Sbjct: 1549 VWTLCSKNSSENDRDSQHYQRALPQLFPWKRPTFASSLN---NSSLSAISKKLLQLRKRD 1605 Query: 248 AIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSR 141 +YTRS HG+SL+ S+VL VGG SLKWSKSIEK S+ Sbjct: 1606 TVYTRSIHGFSLQKSRVLGVGGCSLKWSKSIEKKSK 1641 >ref|XP_006596227.1| PREDICTED: uncharacterized protein At1g21580-like [Glycine max] Length = 1672 Score = 151 bits (382), Expect = 2e-34 Identities = 97/224 (43%), Positives = 132/224 (58%), Gaps = 11/224 (4%) Frame = -1 Query: 779 ECQTDSVINSDSQSTARDGN----SEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQL 612 E Q+ N +SQ A DGN + K+I+Y+K ++NQLVA NS D+S+ DN ++ Sbjct: 1392 ENQSGPSDNGESQGEANDGNVFPLNTKRIVYIKPKTNQLVATSNSYDVSVSTDDNLQTAF 1451 Query: 611 SDGYYKSRGNQLLRASSENHVAK------GNANATVSGLVPQSVVPKTSTRRQSGFAKSC 450 SDGYYK R NQL+R + E+H+ + AN+ G + S +R +S Sbjct: 1452 SDGYYKRRKNQLVRTTIESHINQTVAMPNNTANSDGQGTSNALCNRRFSKKRTHKVGRSS 1511 Query: 449 -RYSKFSFVWKLQDSQSSEKYKNSLGPRKVWPHLFSSKRAAYWRSLMLGIKPSLSNISQK 273 + S+ S VW L SSE ++S ++ P LF KRAA+ SL SLS IS+K Sbjct: 1512 FKRSRASLVWTLCSKNSSENDRDSRHYQRALPLLFPWKRAAFASSLN---NSSLSAISKK 1568 Query: 272 LLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSR 141 LL RKR +YTRS HG+SL+ S+VL VGG SLKWSKSIEK+S+ Sbjct: 1569 LLQLRKRDTVYTRSIHGFSLRKSRVLGVGGCSLKWSKSIEKNSK 1612 >ref|XP_004292729.1| PREDICTED: uncharacterized protein LOC101310670 [Fragaria vesca subsp. vesca] Length = 1908 Score = 149 bits (377), Expect = 9e-34 Identities = 100/225 (44%), Positives = 135/225 (60%), Gaps = 12/225 (5%) Frame = -1 Query: 779 ECQTDSVINSDSQSTARDGNSEKKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQLSDGY 600 E + V NS S A ++ KK+IYVKR+ NQLVA+ N D+S+ DN +Q SDGY Sbjct: 1323 EHHSGPVTNSHDGSLA--SSNVKKVIYVKRKLNQLVASSNPSDLSVHNADN--NQPSDGY 1378 Query: 599 YKSRGNQLLRASSENH------VAKGNANATVSGLVPQSVVP-KTSTRRQSGFAKSCRYS 441 YK R +QL+R+S E++ + N N+ V + V+P +T +++S A + Sbjct: 1379 YKRRKHQLIRSSLESNGKDTVLLPTDNLNSRVQKAL--KVIPSRTFNKKRSLKAVARTGK 1436 Query: 440 KFSFVWKLQDSQSSEKYKNSLGPRKVWPHLFSSKRAAYWRSLMLGIKP-----SLSNISQ 276 K S VW +QSS +S +KV PHLF KRA WR++M S S IS+ Sbjct: 1437 KNSLVWTPSGTQSSNNNGSSFDHQKVLPHLFPWKRARSWRTVMQTQASNFNYSSSSTISK 1496 Query: 275 KLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSR 141 KLL+SR R +YTRS+HG+SL+ KVLSVGGSSLKWSKSIE S+ Sbjct: 1497 KLLLSRMRDTVYTRSTHGFSLRKYKVLSVGGSSLKWSKSIESRSK 1541 >ref|XP_002302217.2| zinc finger family protein [Populus trichocarpa] gi|550344506|gb|EEE81490.2| zinc finger family protein [Populus trichocarpa] Length = 2120 Score = 142 bits (359), Expect = 1e-31 Identities = 108/276 (39%), Positives = 139/276 (50%), Gaps = 23/276 (8%) Frame = -1 Query: 773 QTDSVINSDSQSTARDGNSE-----KKIIYVKRRSNQLVAACNSGDMSMLGVDNTRSQLS 609 Q + N + S DGN+ K + YVKR+SNQLVA+ N S+ NT S Sbjct: 1529 QNSQISNLECHSDTNDGNTVALANGKSLTYVKRKSNQLVASSNPCASSVQNAHNTSS--- 1585 Query: 608 DGYYKSRGNQLLRASSENHVAKGNANATVSGLVPQSVVPKTS-------TRRQSGFAKSC 450 D YYK R NQL+R S E+ + K A+ L + S R++ K+C Sbjct: 1586 DSYYKRRKNQLIRTSLESQI-KQTASIPDESLNSEGQTALNSFSRNFSKRRQRKVVTKTC 1644 Query: 449 RYSKFSFVWKLQDSQSSEKYKNSLGPRKVWPHLFSSKRAAYWRSLM-----LGIKPSLSN 285 + SK S VW L +Q S+ +S KV PHLF KRA Y RS + + SLS Sbjct: 1645 KPSKLSLVWTLHGAQLSKNDGDSSHCGKVLPHLFPWKRATYRRSSLPNSSSISDHSSLST 1704 Query: 284 ISQ----KLLVSRKRGAIYTRSSHGYSLKMSKVLSVGGSSLKWSKSIEKSSRXXXXXXXX 117 I KLL+ RKR YTRS HG+SL+ SKVLSVGGSSLKWSKSIEK S+ Sbjct: 1705 IGYNNWWKLLLLRKRNTEYTRSKHGFSLRKSKVLSVGGSSLKWSKSIEKHSKKANEEATL 1764 Query: 116 XXXXXXXXXXXXKGCVSIA--SKSRNHVSRKWVLSV 15 +G +A +KSRN +SR+ + V Sbjct: 1765 AVAAAERKKREQRGAAHVACPTKSRN-ISRERIFRV 1799