BLASTX nr result
ID: Mentha29_contig00000898
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha29_contig00000898 (2612 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU32993.1| hypothetical protein MIMGU_mgv1a004259mg [Mimulus... 282 6e-73 ref|XP_006352121.1| PREDICTED: uncharacterized protein LOC102591... 270 2e-69 ref|XP_004247650.1| PREDICTED: uncharacterized protein LOC101245... 270 2e-69 emb|CBI26371.3| unnamed protein product [Vitis vinifera] 259 6e-66 ref|XP_006403213.1| hypothetical protein EUTSA_v10003150mg [Eutr... 250 2e-63 ref|NP_199176.1| zinc knuckle (CCHC-type) family protein [Arabid... 250 2e-63 gb|EYU32992.1| hypothetical protein MIMGU_mgv1a004259mg [Mimulus... 234 2e-58 ref|XP_002865410.1| zinc knuckle (CCHC-type) family protein [Ara... 228 8e-57 gb|EXB29868.1| RuBisCO large subunit-binding protein subunit alp... 189 4e-45 gb|EYU32519.1| hypothetical protein MIMGU_mgv1a004371mg [Mimulus... 189 6e-45 ref|XP_007034986.1| Zinc knuckle family protein, putative isofor... 188 9e-45 ref|XP_007034984.1| Zinc knuckle family protein, putative isofor... 188 9e-45 ref|XP_006280040.1| hypothetical protein CARUB_v10025917mg [Caps... 181 2e-42 ref|XP_004134425.1| PREDICTED: uncharacterized protein LOC101216... 176 6e-41 ref|XP_004511402.1| PREDICTED: uncharacterized protein LOC101494... 172 9e-40 ref|XP_004170660.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 171 2e-39 ref|XP_002312573.2| hypothetical protein POPTR_0008s16240g [Popu... 168 1e-38 ref|XP_006420121.1| hypothetical protein CICLE_v10004215mg [Citr... 159 5e-36 ref|XP_006489529.1| PREDICTED: dentin sialophosphoprotein-like i... 158 1e-35 ref|XP_006489528.1| PREDICTED: dentin sialophosphoprotein-like i... 158 1e-35 >gb|EYU32993.1| hypothetical protein MIMGU_mgv1a004259mg [Mimulus guttatus] Length = 482 Score = 282 bits (721), Expect = 6e-73 Identities = 216/590 (36%), Positives = 292/590 (49%), Gaps = 33/590 (5%) Frame = -1 Query: 1802 LPIEGSCES-----SKICLYQEKEKGKALSGEDIYGRSSDVEDDNHESASSSNSRLSTQG 1638 + IEG E+ S+ICL QE K KALS D +DD+H S S NS +T Sbjct: 1 MEIEGCAENDLAQNSRICLSQENGKEKALS---------DDKDDSHVSMESCNS--ATLF 49 Query: 1637 KKGVKRQIFDED-MVTGSKRTKTQIHGSN------TKPDSSFMKWISNMTRGGLSGLNLE 1479 KGVK++ ++ + SK+ K+QI G+N KPDSSFM WISNM +G LS N + Sbjct: 50 SKGVKKRFLEQGHQLVESKKMKSQIEGNNYGSTSVVKPDSSFMNWISNMVKG-LSDSNNK 108 Query: 1478 DSLSPLPLACSNGVLSRKYDENFMCFRPQNS-KNLSTGFQTVFQSLYCRETDASKKXXXX 1302 S L L RP+N K+ + GFQ+VF+S+Y + A + Sbjct: 109 KDPSALALVS----------------RPENDCKSPNAGFQSVFRSMYTSDKKAYEG---- 148 Query: 1301 XXXXXXXXXXXXEGSPENLRESDERIFGNSSEQTIPSSKEDDRSRDPSES---------- 1152 E D NS +Q + S+K+ ++ + Sbjct: 149 --------------------EKD-----NSCKQIVLSNKDVNQRTSGGSNVHPINPWIFS 183 Query: 1151 -ENQVIRAGPLDEETKVGTKEAYPDIPLATSSVLERSDRRVSLWISRLSTKTLRSERGKE 975 +++V +G +E K T+ P+IPL K + Sbjct: 184 LKDEVSPSGSFSKEAKTTTENTSPNIPLPEK-------------------KPFFPKAANN 224 Query: 974 ILVEADLDSNEESADLKSVNELCTVLPSRRFSSEAMASDFARRLDALKHITSSKKGIYST 795 + E ++N +S DLK P R+ S+ A+ Sbjct: 225 LDCEKISEANGDSFDLK---------PPRKSSTCALI----------------------- 252 Query: 794 CLFCGGC-HDVRGCSGVTRSELEYLLLKSSAFDSRVEESPCFCIRCSKPDHWAVSCP--- 627 C +CG H +R C +T +E++ L +K +FD +VEES CFCIRC + DHWA+SCP Sbjct: 253 CFYCGRSDHYLRKCPELTETEIKGLQVKIGSFD-KVEESCCFCIRCFRFDHWAISCPSVA 311 Query: 626 VGPTSRFGA---RRLMLCGGEDQSLRLRDTSDVNNNSA--KSEIFDAIRSLRLTRADILR 462 V P R A ++ + L+ N+ A + EIF AI+ LR++R+DILR Sbjct: 312 VPPRRRHVACTSKKSSFASDSENYLKFPRGIFANSQDAVAEGEIFRAIKKLRMSRSDILR 371 Query: 461 WMNSNVPSSHLNGFFLRLRLGSVEAGLGGSSYYVACITGDVKGNAGCTSKNSILVDVGGI 282 MNSN+ S+HLNGFFLRLRLG +EAGLG + YYVA ITG SK SILVDVGGI Sbjct: 372 LMNSNISSTHLNGFFLRLRLGKLEAGLGWTGYYVARITGYTTEIIDYKSKKSILVDVGGI 431 Query: 281 LSSVACQYISNHDFLEDEIKAWWSRTEGSGCRLPSLAELNSKFNDRQSLG 132 SSV QY+SNHDFLEDEIKAWWSR +G ++P L ELNSKF DR+SLG Sbjct: 432 KSSVGSQYVSNHDFLEDEIKAWWSRLSKTGDKIPLLDELNSKFEDRESLG 481 >ref|XP_006352121.1| PREDICTED: uncharacterized protein LOC102591467 isoform X1 [Solanum tuberosum] gi|565371045|ref|XP_006352122.1| PREDICTED: uncharacterized protein LOC102591467 isoform X2 [Solanum tuberosum] Length = 979 Score = 270 bits (691), Expect = 2e-69 Identities = 232/714 (32%), Positives = 310/714 (43%), Gaps = 142/714 (19%) Frame = -1 Query: 1847 DECSLVEPELSIKNTLPIEGSCESSKICLYQEKEKGKALSGEDIYGRSSDVEDDNHESAS 1668 + C E +L +++P E S+ Y+ K K KALS + + S+ E+D+HES Sbjct: 278 ETCDQNEEQLLRGSSVPPETPPTHSRSSSYRRKGKAKALSDGNSNTKMSNDEEDSHESVE 337 Query: 1667 SSNSRLSTQGKKGVKRQIFDEDMVTGSKRTKTQIHG-----SNTKPDSSFMKWISNMTRG 1503 S NS + KG KR F++ GSKR +T IH S +SSF+ WISNM +G Sbjct: 338 SCNS--TGLNPKGKKRWHFEQQFFVGSKRIRTDIHRDPATESTVAHNSSFVTWISNMVKG 395 Query: 1502 GLSGLNLEDSLSPLPLACSNGVLSR----KYDENFMCFRPQNSKNLSTGFQTVFQSLYCR 1335 LS LE S + N S + E M + +S + S GF++VFQSLYC Sbjct: 396 -LSKSKLEGSPTLALTFTPNNEESHGKETNHQEIVMYDKDHDSGSRSMGFRSVFQSLYCP 454 Query: 1334 ETDASKKXXXXXXXXXXXXXXXXEGSPENLRESDERIFGNSSEQTIPSSKEDDR----SR 1167 S+ G P+ L +D+ + P D S Sbjct: 455 TLKVSETEIPKEDHSV--------GEPKKLSSADKILIDVPPISCHPGGDMLDAHMLMSN 506 Query: 1166 DPSESENQVIRAGPLDEE---------------TKVGTKEAYPDIPLATSSVLE------ 1050 D S + PL E T K + + +S+ E Sbjct: 507 DNSNQSTVACKEVPLMETQITPAVVAPREVSRTTSAENKASNGSMSRLRTSICEEKNTSH 566 Query: 1049 --------RSDRRVSLWISRLSTKT--------------------LRSERGKEILVE-AD 957 R+ SLWI+R S KT R E+ + E +D Sbjct: 567 SSEYDMSSRNQSLRSLWITRFSNKTPGTVVNIDNSKPTTHETSVVCRIEQANSDVKETSD 626 Query: 956 LDSNEESADL----------KSVNELCTVLPSRRFS-SEAMASDFARRLDALKHITS-SK 813 D ++ A +S+N L ++ S +F SEA+AS F+RRLDALK I S Sbjct: 627 KDQYDDVAASSKEIRDNNYERSMNNLQPIVSSAKFKKSEALASLFSRRLDALKFIGPFST 686 Query: 812 KGIYS----TCLFCGGC-HDVRGCSGVTRSELEYLLLKSSAFDSRVEESPCFCIRCSKPD 648 + YS TC FCG HD+R CS V SELE L+ A++ EES C CIRC + D Sbjct: 687 RNEYSYTRTTCFFCGKSGHDLRNCSEVIESELEVLIRSIRAYEG-AEESSCLCIRCFQLD 745 Query: 647 HWAVSCPVGPTSRFGARRLM---------------------------------------- 588 HWA+SCP ++R R++ Sbjct: 746 HWAISCPTSASNRSDNLRVLSGNECLPSQLEIKQGHPIELANRVHHSRDRSSSDLMHNRK 805 Query: 587 -----LCGGEDQSLRLRDTSDVNNNSAKSEI-----------------FDAIRSLRLTRA 474 + G +Q L+ R TSD NS K I FD IR LRL+R Sbjct: 806 QFLFAITSGSNQVLKQR-TSDSTENSLKENIISSNFVTKETADVPRGIFDVIRGLRLSRI 864 Query: 473 DILRWMNSNVPSSHLNGFFLRLRLGSVEAGLGGSSYYVACITGDVKGNAGCTSKNSILVD 294 DIL+WMNS+ SHL+GFFLRLRLG EAGLGG+ YYVACI G N S N I V+ Sbjct: 865 DILKWMNSHTSLSHLDGFFLRLRLGRSEAGLGGTGYYVACINGLKGENLERDSNNCIYVN 924 Query: 293 VGGILSSVACQYISNHDFLEDEIKAWWSRTEGSGCRLPSLAELNSKFNDRQSLG 132 V G+ V QYISN DFLEDE+ WW + SG ++P +L K ++R LG Sbjct: 925 VCGVKCPVGSQYISNQDFLEDELSTWWHKMLESGGKVPEEGDLRLKLDERMKLG 978 Score = 70.5 bits (171), Expect = 4e-09 Identities = 50/147 (34%), Positives = 69/147 (46%), Gaps = 29/147 (19%) Frame = -1 Query: 2351 MMNIEDKDVDSEHE----PNFRIRAKLLSTSSGAGVNAGSKVNRAFATCDPLSELVWSPK 2184 M N D D+D + AKL + GAGVNA S FA DPLSELVWSP+ Sbjct: 1 MTNFNDDDIDLGLALGCTTTRNVHAKL-KDAVGAGVNASSTGGMTFAASDPLSELVWSPR 59 Query: 2183 NGVELKCANFRADDNRKPFLLWNVG-----------------------LKPVVDQGNLTV 2073 G+ LKCA D +KPF LWNVG + ++DQ L + Sbjct: 60 KGLSLKCAESGLAD-KKPFRLWNVGPTTLITAPSQSDRFKGTYDENAAYEKIIDQERLEI 118 Query: 2072 SQMVLDANDII--IGKATVLKDSGGLE 1998 ++MVL + + I K ++ + G++ Sbjct: 119 NKMVLKSGNEIGCSSKVKIMNTADGVD 145 >ref|XP_004247650.1| PREDICTED: uncharacterized protein LOC101245795 [Solanum lycopersicum] Length = 981 Score = 270 bits (690), Expect = 2e-69 Identities = 230/719 (31%), Positives = 313/719 (43%), Gaps = 147/719 (20%) Frame = -1 Query: 1847 DECSLVEPELSIKNTLPIEGSCESSKICLYQEKEKGKALSGEDIYGRSSDVEDDNHESAS 1668 + C E +L +++P E S+ Y+ K K KALS + + S+ E+D+HES Sbjct: 278 ETCDQNEEQLLRGSSVPPETPPTHSRSSSYRRKGKAKALSDGNSNNKMSNDEEDSHESVE 337 Query: 1667 SSNSRLSTQGKKGVKRQIFDEDMVTGSKRTKTQIHG-----SNTKPDSSFMKWISNMTRG 1503 S NS + KG KR F++ GSKR +T +H S +SSF+ WISNM +G Sbjct: 338 SCNS--TGLNPKGKKRWHFEKQFFVGSKRIRTDVHRDPSTESTVAHNSSFVTWISNMVKG 395 Query: 1502 GLSGLNLEDS----LSPLPLACSNGVLSRKYDENFMCFRPQNSKNLSTGFQTVFQSLYCR 1335 L NLEDS L+ P N V + E + +S + S GFQ++FQSLYC Sbjct: 396 -LPKSNLEDSPTLALTFTPNNEENHVKETNHQEIVAYEKDHDSASRSMGFQSLFQSLYCP 454 Query: 1334 ETDASKKXXXXXXXXXXXXXXXXEGSPENLRESDERIFGNSSEQTIPSSKEDDR------ 1173 S+ G P+ + +D+ + I +E D Sbjct: 455 TLKVSETEIPKEDHSV--------GEPKKIPSADKILI---DFPLISCHREGDMLDTHML 503 Query: 1172 -SRDPSESENQVIRAGPL---------------DEETKVGTKEAYPDIPLATSSVLE--- 1050 S D S + PL T V K + + +S+ E Sbjct: 504 MSNDKSNQSTVACKEVPLMQTHIMPAVVAPREVSRNTSVENKASNDSLSRLRTSICEEKN 563 Query: 1049 -----------RSDRRVSLWISRLSTKT------------LRSERGKEILVE-------- 963 R+ SLWI+R S KT E E +E Sbjct: 564 TSHSSEYDMSSRNQSLRSLWITRFSNKTPGTVVNIDDSKPTTHETSVECRIEQASSDVKG 623 Query: 962 -ADLDSNEESADL----------KSVNELCTVLPSRRFS-SEAMASDFARRLDALKHITS 819 +D D +++ A +S+N L ++ S +F SEA++S F+RRLDALK I Sbjct: 624 TSDKDQHDDVAASSKEIRDNNFERSMNNLHPIVSSPKFKKSEALSSLFSRRLDALKLIGP 683 Query: 818 -SKKGIYS------TCLFCGGC-HDVRGCSGVTRSELEYLLLKSSAFDSRVEESPCFCIR 663 S + YS TC FCG HD+R CS VT SELE L+ A++ E S C CIR Sbjct: 684 FSTRNEYSSSYTRTTCFFCGKSGHDLRNCSEVTESELEVLIRSIRAYEG-AEGSSCLCIR 742 Query: 662 CSKPDHWAVSCPVGPTSRFGARRLM----------------------------------- 588 C + DHWA+SCP ++R R++ Sbjct: 743 CFQLDHWAISCPTSASNRGNNLRVVSVNECLPSQLEIKQSHPIELANRVHHSRDKSSSDL 802 Query: 587 ----------LCGGEDQSLRLRDTSDVNNNSAKSEI-----------------FDAIRSL 489 + G +Q + R TS+ NS K I FD IR L Sbjct: 803 MHKRKQFLFAITSGSNQVPKQR-TSESTENSLKEHIISSNFVSKEIAVVPKGIFDVIRGL 861 Query: 488 RLTRADILRWMNSNVPSSHLNGFFLRLRLGSVEAGLGGSSYYVACITGDVKGNAGCTSKN 309 RL+R DIL+WMNS+ SHL+GFFLRLRLG EAGLGG+ YYVACI G S N Sbjct: 862 RLSRIDILKWMNSHTSLSHLDGFFLRLRLGRSEAGLGGTGYYVACINGLKGEKLERDSNN 921 Query: 308 SILVDVGGILSSVACQYISNHDFLEDEIKAWWSRTEGSGCRLPSLAELNSKFNDRQSLG 132 I VDV G+ V QYISN DFLEDE+ WW + SG ++P ++L K ++R LG Sbjct: 922 CICVDVCGVKCPVGSQYISNQDFLEDELSTWWHKMLESGGKVPEESDLRLKLDERMKLG 980 Score = 73.6 bits (179), Expect = 4e-10 Identities = 42/84 (50%), Positives = 49/84 (58%), Gaps = 3/84 (3%) Frame = -1 Query: 2351 MMNIEDKDVDSEHEPNFRIRAKL---LSTSSGAGVNAGSKVNRAFATCDPLSELVWSPKN 2181 M NI D D+D + L + GAGVNA S V+ AFA DPLSELVWSP+ Sbjct: 1 MTNINDDDIDLGLALGCTTTRNVHTKLKDAVGAGVNASSTVDMAFAESDPLSELVWSPRK 60 Query: 2180 GVELKCANFRADDNRKPFLLWNVG 2109 G+ LKCA D +KPF LWNVG Sbjct: 61 GLSLKCAESSLAD-KKPFRLWNVG 83 >emb|CBI26371.3| unnamed protein product [Vitis vinifera] Length = 975 Score = 259 bits (661), Expect = 6e-66 Identities = 210/607 (34%), Positives = 280/607 (46%), Gaps = 50/607 (8%) Frame = -1 Query: 1802 LPIEGSCESSKICLYQEKEKGKALSGEDIYGRSSDVEDDNHESASSSNSR-LSTQGKKGV 1626 LP+ S S + ++ K KGKALS D GR S+ EDD+ ES S NS L + GKK Sbjct: 446 LPVNNSPNKSGMYRHRTKGKGKALSDGDRSGRKSNKEDDSDESVESCNSAALFSTGKK-- 503 Query: 1625 KRQIFDEDMVTGSKRTKTQIHGSN-----TKPDSSFMKWISNMTRGGLSGLNLEDSLS-P 1464 R +++ ++TGSKR + QI+GS + DSSFM WISNM +G LS N +++ S Sbjct: 504 -RWGYEQQLITGSKRIRKQINGSPGSTSFVRQDSSFMSWISNMMKG-LSKSNQDETPSLA 561 Query: 1463 LPLACSNGVLSRKYDENFM-CFRPQNSKNLSTGFQTVFQSLYCRETDASKKXXXXXXXXX 1287 L LA N YD+ + C + Q+ + GFQ++FQSLYC T + Sbjct: 562 LTLARPN---HDNYDQKLVTCNKNQDPGCRNIGFQSIFQSLYCPTTKVQESRTL------ 612 Query: 1286 XXXXXXXEGSPENLRESDERIFGNSSEQTIPSSKEDDRSRDPSESENQVI--RAGPLDEE 1113 N+ QT SKE + + RAGP + Sbjct: 613 -----------------------NADNQTGEGSKEFCLANKLCDFNQSTFGNRAGPSTQP 649 Query: 1112 TKVGTKEAYPDIPLATSSVLE----RSDRRVSLWISRLSTKTL----------RSERGKE 975 + K A TSS + +SD SLW++R S KT ++ +E Sbjct: 650 KVLSAKFAVSQENYKTSSTIHNFGYKSDLLGSLWVTRFSPKTSSPTCKVDHCNQNTGTRE 709 Query: 974 ILVEADLD--------------------SNEESADLKSVNELCTVLPSRRF-SSEAMASD 858 E L N + S+ +L + PS+RF SSEAMAS Sbjct: 710 YCTEEPLTIVGAELQNCSGGTEVSFGFKKNNAHNNQNSIYKLNPISPSQRFKSSEAMASL 769 Query: 857 FARRLDALKHI-----TSSKKGIYSTCLFCGGCHDVRGCSGVTRSELEYLLLKSSAFDSR 693 FARRLDALK+I T ++ TC FCG GC + L K+ + Sbjct: 770 FARRLDALKNIITLNQTDTEARATPTCFFCGIRAQSLGCC------MSQCLEKAKSIR-- 821 Query: 692 VEESPCFCIRCSKPDHWAVSCPVGPTSRFGARRLMLCGGEDQSLRLRDTSDVNNNSAKSE 513 M C E Q + L + + + Sbjct: 822 ----------------------------------MWCFFESQIIPLCNFVNPQISDVPKG 847 Query: 512 IFDAIRSLRLTRADILRWMNSNVPSSHLNGFFLRLRLGSVEAGLGGSSYYVACITGDVKG 333 IFDAI+ LRL+R DIL+WMNS P SHLNGFFLRLRLG E GLGG+ YYVACI+G K Sbjct: 848 IFDAIKRLRLSRGDILKWMNSVFPFSHLNGFFLRLRLGKWEEGLGGTGYYVACISGAQKE 907 Query: 332 NAGCTSKNSILVDVGGILSSVACQYISNHDFLEDEIKAWWSRTEGSGCRLPSLAELNSKF 153 +SKN I V++GG+ V QYISNHDFLEDE+ AWW T +G ++PS +L K Sbjct: 908 RPSQSSKNPIAVNIGGVKCLVQSQYISNHDFLEDELMAWWGATTRAGGKIPSEEDLKVKL 967 Query: 152 NDRQSLG 132 +R+ G Sbjct: 968 EERKKFG 974 Score = 71.2 bits (173), Expect = 2e-09 Identities = 37/95 (38%), Positives = 53/95 (55%), Gaps = 6/95 (6%) Frame = -1 Query: 2288 KLLSTSSGAGVNAGSKVNRAFATCDPLSELVWSPKNGVELKCANFRADDNRKPFLLWNVG 2109 K L+ SGAG NAGS+V+ DPLSELVWSP G+ LKCA + D ++P LLW VG Sbjct: 98 KALNNDSGAGANAGSRVDMTLVATDPLSELVWSPHKGLSLKCAE-NSTDEKRPSLLWGVG 156 Query: 2108 LKPVVD------QGNLTVSQMVLDANDIIIGKATV 2022 ++ T+S + +++ +AT+ Sbjct: 157 PSNMIHSPPQGISARKTISDEPMGEGNLVTSQATL 191 >ref|XP_006403213.1| hypothetical protein EUTSA_v10003150mg [Eutrema salsugineum] gi|567185350|ref|XP_006403214.1| hypothetical protein EUTSA_v10003150mg [Eutrema salsugineum] gi|557104326|gb|ESQ44666.1| hypothetical protein EUTSA_v10003150mg [Eutrema salsugineum] gi|557104327|gb|ESQ44667.1| hypothetical protein EUTSA_v10003150mg [Eutrema salsugineum] Length = 814 Score = 250 bits (639), Expect = 2e-63 Identities = 219/640 (34%), Positives = 301/640 (47%), Gaps = 60/640 (9%) Frame = -1 Query: 1871 DLCRLTEKDECSLVEPELSIKNTLPIEGSCESSKICLYQEKEKGKALSGEDIYGRSSDVE 1692 DL K+EC L E E ++ P +K + K K KALS G D + Sbjct: 194 DLVVFESKEEC-LAEDETDVEKAGPSGSYRRRAK----ELKGKEKALSD----GNFDDAD 244 Query: 1691 DDNHESASSSNSRLSTQGKKGVKRQIFDEDMVTGSKRTKT---QIHGSNTK--PDSSFMK 1527 DD+ S + + +G KR F++ ++ GSKR KT + GS +K DSSFM Sbjct: 245 DDDESFGSVESCNSAGLLLRGKKRPGFEQQLILGSKRLKTLSQECLGSTSKLKQDSSFMN 304 Query: 1526 WISNMTRGGLSGLNLEDSLSPLPLACS---NGVLSRKYDENFMCFRPQNSKNLSTGFQTV 1356 WISNMT+G G ED+ + L + NG ++ D+ + + +NS +TGFQ+ Sbjct: 305 WISNMTKGIWKGNEEEDNSPFVALTTTSDANGQVNAIVDQQQLSLK-ENSGCRNTGFQSF 363 Query: 1355 FQSLYC---RETDASKKXXXXXXXXXXXXXXXXEGSPENLRESDERIFGNSSEQTIPSSK 1185 F S+YC R DA + ++L S I G +E I S K Sbjct: 364 FHSIYCPKKRSQDAVEMDSTDDAKVASLQELCLITKGDHLSSSGNEI-GPVTEHNISSEK 422 Query: 1184 -------EDDRSRDPSESENQVIRAGPLDEETKVGTKEAYPDIP-LATSSVLERSDRRVS 1029 E S E + I L + G +A + T + R+ S Sbjct: 423 VGFNKTSETFSSEKKHEDKEPNISLLSLSKSKTNGELKACGEADEKVTQCLTNRNSGLES 482 Query: 1028 LWISRLSTKTLRS------ERGKEILVEADLDSNEESADLKSVNELC------TVLPS-- 891 LWISR S+K+ ER + + DS E+A + + T+LP Sbjct: 483 LWISRFSSKSSSPQKKNLHERITNEVAKVANDSATEAAKTRDSQRMLIDNNPNTILPIVS 542 Query: 890 --RRFSSEAMASDFARRLDALKHITSSKKGIYS--------TCLFCGGC-HDVRGCSGVT 744 R SSEAMAS FARRL+A+KHI S + C +CG H ++ C VT Sbjct: 543 SLRIESSEAMASLFARRLEAMKHIMPSSSLAENEEEGQANLVCFYCGKKGHRLQDCLEVT 602 Query: 743 RSELEYLLLKSSAFDSRVEESPCFCIRCSKPDHWAVSCPVGPTSRFGARRLMLCGGEDQS 564 +EL L+ S+ + R EE CIRC + HWA +CP P GA ED++ Sbjct: 603 DTELRDLVQNISSHNGR-EEGSSLCIRCFQLSHWAATCPNAPPYSSGA--------EDRA 653 Query: 563 LR--LRDTSDVNN-----NSAKSEIFDAIRSLRLTRADILRWMNSNVPSSHLNGFFLRLR 405 ++ L TS A +FDA++ LRLTR D+L+W+N+ S L GFFLRLR Sbjct: 654 MKHALASTSGTKLPLSGFTDAPKAVFDAVQVLRLTRTDVLKWINTKKSVSGLEGFFLRLR 713 Query: 404 LGSVEAGLGGSSYYVACITGDVKG-NAGCTSKN-SILVDVGGILSSVACQYISNHDFLED 231 LG E GLGG+ YYVA I G +G N+ S+N SI V VGG+ V Q+ISNHDFLE+ Sbjct: 714 LGKWEEGLGGTGYYVARIDGATEGQNSRKHSENSSISVKVGGMTCFVESQFISNHDFLEE 773 Query: 230 EIKAWW-------SRTEGSGCRLPSLAELNSKFNDRQSLG 132 E+KAWW R+ G +PS EL+ K R+ LG Sbjct: 774 ELKAWWRSAEKIARRSGDGGDGIPSAEELSRKIQQRKMLG 813 >ref|NP_199176.1| zinc knuckle (CCHC-type) family protein [Arabidopsis thaliana] gi|10178202|dbj|BAB11626.1| unnamed protein product [Arabidopsis thaliana] gi|28393193|gb|AAO42027.1| unknown protein [Arabidopsis thaliana] gi|28973589|gb|AAO64119.1| unknown protein [Arabidopsis thaliana] gi|332007606|gb|AED94989.1| zinc knuckle (CCHC-type) family protein [Arabidopsis thaliana] Length = 831 Score = 250 bits (639), Expect = 2e-63 Identities = 247/826 (29%), Positives = 358/826 (43%), Gaps = 112/826 (13%) Frame = -1 Query: 2273 SSGAGVNAGSKVNRAFATCDPLSELVWSPKNGVELKCANFRADDNRKPFL--LWNVGLKP 2100 SSG A ++ FA D ++ELVWSP NG+ L+CA+ K +++GL Sbjct: 22 SSGTAGAANAEARMKFAAVDAITELVWSPSNGLSLRCADISFTGKAKLLSPNFFDIGLTN 81 Query: 2099 VVDQGNLTVSQMVLDANDIIIGKATVLKDS--GGLESDRE---LEEKADKQDAILXXXXX 1935 + N T + D D+ + + + GG D + +E+K + D I Sbjct: 82 MAIHSNSTSIEDQEDHVDVELRNRDQVNQAMIGGSVEDMKPEMVEDKVETNDDIKNEEAG 141 Query: 1934 XXXXXXXXXXXXXXXXXXXE--------------------NDLCRLTEKDECSLVEPELS 1815 N + RL DE +L + Sbjct: 142 CSKRSSDSPKAMEGETRDLLVNEQLRMESAGSQEEGDKAHNRVDRLESMDENNLATLAVV 201 Query: 1814 I---KNTLPIEGSCESSKICLYQEKEKGK--ALSGEDIYGRSSDVEDDNHESASSSNSR- 1653 K EG S +EK KGK ALS E+ G D ++++ S S NS Sbjct: 202 ACEGKGDYLPEGEAGPSGSYRRREKAKGKEKALSDENFGGDGEDEDEESFGSVESCNSAG 261 Query: 1652 LSTQGKKGVKRQIFDEDMVTGSKRTKT---QIHGSNTK--PDSSFMKWISNMTRGGLSGL 1488 L ++GKK R F+E ++ GSKR KT + GS +K DSSFM WISNMT+G G Sbjct: 262 LLSRGKK---RPGFEEQLIFGSKRLKTLNQECLGSTSKLKQDSSFMNWISNMTKGIWKG- 317 Query: 1487 NLEDS-----LSPLPLACSNGVLSRKYDENFM--CFRPQNSKNLSTGFQTVFQSLYCRET 1329 N ED+ L+ A +G ++ D+ + C +NS +TGFQ+ FQS+YC + Sbjct: 318 NEEDNSPFVALTTTSNANGHGQVNAIVDQQQLSPCCVKENSGCRNTGFQSFFQSIYCPKK 377 Query: 1328 DASK----------------------KXXXXXXXXXXXXXXXXEGSPENLRESDERIFGN 1215 + + G S ++ N Sbjct: 378 QSQDVVDMDFPNDVNAAPLQELPWIPEHCDISKGDDLSSSGNEIGPVAEPNISSGKVVFN 437 Query: 1214 SSEQTIPSSKEDDRSRDPSESENQVIRAGPLDEETKVGTKEAYPDIPLATSSVLERSDRR 1035 + +T SS+ ++P+ S + ++ P +E G + + + R+ Sbjct: 438 QTSKT-QSSENKREDKEPNISLMSLSKSKPNEEPKTCGEADGK-----VSPCLTNRNSGL 491 Query: 1034 VSLWISRLSTK-TLRSERGKEILVEADLDSNEESADLKSVNELC---------------- 906 SLWISR S+K + ++ E EA+ +++ + S L Sbjct: 492 KSLWISRFSSKGSFPQKKASETAKEANASASDAAKTRDSRKMLADKNVIRPSISSVDGPD 551 Query: 905 ---TVLP----SRRFSSEAMASDFARRLDALKHITSSKKGIYST--------CLFCGGC- 774 TVLP R SSEAMAS FARRL+A+K I S + C +CG Sbjct: 552 KPDTVLPIVSSMRIESSEAMASLFARRLEAMKSIMPSGSLAENAEEEQRDLICFYCGKKG 611 Query: 773 HDVRGCSGVTRSELEYLLLKSSAFDSRVEESPCFCIRCSKPDHWAVSCPVGPTSRFGAR- 597 H +R C VT +EL L+ S + R EE+ CIRC + HWA +CP P GA Sbjct: 612 HCLRDCLEVTDTELRDLVQNISVRNGR-EEASSLCIRCFQLSHWAATCPNAPLYGSGAEG 670 Query: 596 ---RLMLCGGEDQSLRLRDTSDVNNNSAKSEIFDAIRSLRLTRADILRWMNSNVPSSHLN 426 + L L + +DV +FDA++ LRL+R D+L+W+N+ S L Sbjct: 671 RAMKNALASTSGMKLPISGFTDVPR-----AVFDAVQVLRLSRTDVLKWINTKKSVSGLE 725 Query: 425 GFFLRLRLGSVEAGLGGSSYYVACITGDVKGNAG--CTSKNSILVDVGGILSSVACQYIS 252 GFFLRLRLG E GLGG+ YYVA I GD +G + + K+ I V V G+ V Q+IS Sbjct: 726 GFFLRLRLGKWEEGLGGTGYYVARIDGDTEGQSSRRHSEKSLISVKVKGVTCLVESQFIS 785 Query: 251 NHDFLEDEIKAWW------SRTEGSGCRLPSLAELNSKFNDRQSLG 132 N DFLE+E+KAWW +RT G +PS EL+ K R+ LG Sbjct: 786 NQDFLEEELKAWWQSAGKSARTSGYD-GIPSAEELSRKIQQRKMLG 830 >gb|EYU32992.1| hypothetical protein MIMGU_mgv1a004259mg [Mimulus guttatus] Length = 537 Score = 234 bits (597), Expect = 2e-58 Identities = 193/556 (34%), Positives = 265/556 (47%), Gaps = 33/556 (5%) Frame = -1 Query: 1802 LPIEGSCES-----SKICLYQEKEKGKALSGEDIYGRSSDVEDDNHESASSSNSRLSTQG 1638 + IEG E+ S+ICL QE K KALS D +DD+H S S NS +T Sbjct: 1 MEIEGCAENDLAQNSRICLSQENGKEKALS---------DDKDDSHVSMESCNS--ATLF 49 Query: 1637 KKGVKRQIFDED-MVTGSKRTKTQIHGSN------TKPDSSFMKWISNMTRGGLSGLNLE 1479 KGVK++ ++ + SK+ K+QI G+N KPDSSFM WISNM +G LS N + Sbjct: 50 SKGVKKRFLEQGHQLVESKKMKSQIEGNNYGSTSVVKPDSSFMNWISNMVKG-LSDSNNK 108 Query: 1478 DSLSPLPLACSNGVLSRKYDENFMCFRPQNS-KNLSTGFQTVFQSLYCRETDASKKXXXX 1302 S L L RP+N K+ + GFQ+VF+S+Y + A + Sbjct: 109 KDPSALALVS----------------RPENDCKSPNAGFQSVFRSMYTSDKKAYEG---- 148 Query: 1301 XXXXXXXXXXXXEGSPENLRESDERIFGNSSEQTIPSSKEDDRSRDPSES---------- 1152 E D NS +Q + S+K+ ++ + Sbjct: 149 --------------------EKD-----NSCKQIVLSNKDVNQRTSGGSNVHPINPWIFS 183 Query: 1151 -ENQVIRAGPLDEETKVGTKEAYPDIPLATSSVLERSDRRVSLWISRLSTKTLRSERGKE 975 +++V +G +E K T+ P+IPL K + Sbjct: 184 LKDEVSPSGSFSKEAKTTTENTSPNIPLPEK-------------------KPFFPKAANN 224 Query: 974 ILVEADLDSNEESADLKSVNELCTVLPSRRFSSEAMASDFARRLDALKHITSSKKGIYST 795 + E ++N +S DLK P R+ S+ A+ Sbjct: 225 LDCEKISEANGDSFDLK---------PPRKSSTCALI----------------------- 252 Query: 794 CLFCGGC-HDVRGCSGVTRSELEYLLLKSSAFDSRVEESPCFCIRCSKPDHWAVSCP--- 627 C +CG H +R C +T +E++ L +K +FD +VEES CFCIRC + DHWA+SCP Sbjct: 253 CFYCGRSDHYLRKCPELTETEIKGLQVKIGSFD-KVEESCCFCIRCFRFDHWAISCPSVA 311 Query: 626 VGPTSRFGA---RRLMLCGGEDQSLRLRDTSDVNNNSA--KSEIFDAIRSLRLTRADILR 462 V P R A ++ + L+ N+ A + EIF AI+ LR++R+DILR Sbjct: 312 VPPRRRHVACTSKKSSFASDSENYLKFPRGIFANSQDAVAEGEIFRAIKKLRMSRSDILR 371 Query: 461 WMNSNVPSSHLNGFFLRLRLGSVEAGLGGSSYYVACITGDVKGNAGCTSKNSILVDVGGI 282 MNSN+ S+HLNGFFLRLRLG +EAGLG + YYVA ITG SK SILVDVGGI Sbjct: 372 LMNSNISSTHLNGFFLRLRLGKLEAGLGWTGYYVARITGYTTEIIDYKSKKSILVDVGGI 431 Query: 281 LSSVACQYISNHDFLE 234 SSV QY+SNHDFLE Sbjct: 432 KSSVGSQYVSNHDFLE 447 >ref|XP_002865410.1| zinc knuckle (CCHC-type) family protein [Arabidopsis lyrata subsp. lyrata] gi|297311245|gb|EFH41669.1| zinc knuckle (CCHC-type) family protein [Arabidopsis lyrata subsp. lyrata] Length = 759 Score = 228 bits (582), Expect = 8e-57 Identities = 234/800 (29%), Positives = 359/800 (44%), Gaps = 85/800 (10%) Frame = -1 Query: 2279 STSSGAGVNAGSKVNRAFATCDPLSELVWSPKNGVELKCANFRADDNRKPFL--LWNVGL 2106 S S +A ++ FA+ D ++ELVWSP NG+ L+CA+ K +++GL Sbjct: 18 SRRSSGPSSANAEARMKFASVDAITELVWSPGNGLSLRCADISFTGKAKLVSPNFFDIGL 77 Query: 2105 KPVVDQGNLTVSQMVLDANDIIIGKATVLKDSGGLESDRE--LEEKADKQDAILXXXXXX 1932 + N T + D + + G +E + +E+K + D I Sbjct: 78 TNMAIHSNSTSIEHQEDVELRSRDQVNQERIGGSVEDMKPEMVEDKVETDDDI------- 130 Query: 1931 XXXXXXXXXXXXXXXXXXENDLCRLTEK--DECSLVEPE---LSIKNTLPIEGSCESSKI 1767 +N++ +++ D ++E E L + L +E + Sbjct: 131 ------------------KNEVAGSSKRSSDSPKVMEGETRDLLVNEQLRMESA------ 166 Query: 1766 CLYQEKEKGKALSGEDIYGRSSDVEDDNHESASSSN-SRLSTQGKKGV-KRQIFDEDMVT 1593 ++G G+ G ++ D ES +N + L+ +G + + +E + Sbjct: 167 ----GSQEGPTNRGDKDEGDKANNRIDRLESMDENNLATLAVVACEGQGEYSLENEAGPS 222 Query: 1592 GSKRTKTQIHGSNTKPDSSFMKWISNMTRGGLSGLNLEDS----LSPLPLACSNGVLSRK 1425 GS R K DSSFM WISNMT+G G +DS L+ A +G ++ Sbjct: 223 GSYR--------RPKQDSSFMNWISNMTKGIWKGNEEDDSPFAALTTTSDANGHGQVNAI 274 Query: 1424 YDENFM--CFRPQNSKNLSTGFQTVFQSLYC---RETDA------SKKXXXXXXXXXXXX 1278 D+ + C +NS +TGFQ++FQS+YC R DA + Sbjct: 275 VDQQQLSPCCVKENSGCRNTGFQSLFQSIYCPKKRSQDAVEMDFPNDANATSLQELPWIP 334 Query: 1277 XXXXEGSPENLRESDERI--------------FGNSSEQTIPSSKEDDRSRDPSESENQV 1140 ++L SD I F SE +K +D+ +P+ S + Sbjct: 335 EQCGIAKGDDLSSSDNDIGPVAEPNISSGKVGFNQRSETLSSENKREDK--EPNISLMSL 392 Query: 1139 IRAGPLDEETKVGTKEAYPDIPLATSSVLERSDRRVSLWISRLSTKT-----LRSERGKE 975 ++ P +EE K+ + P + R+ SLWISR S+K+ SE KE Sbjct: 393 SKSKP-NEEPKICGEAGGKVSPCLNN----RNSGLQSLWISRFSSKSPFPQKKTSETAKE 447 Query: 974 ILVEAD------------LDSNEESADLKSVN---ELCTVLP----SRRFSSEAMASDFA 852 + A +++N + SV+ +L TVLP R SSEAMAS FA Sbjct: 448 VNASASDTAKTHDSQKMLVNNNVVIPSISSVDGLDKLNTVLPIVSSMRIESSEAMASLFA 507 Query: 851 RRLDALKHI--------TSSKKGIYSTCLFCGGC-HDVRGCSGVTRSELEYLLLKSSAFD 699 RRL+A+KHI + ++ C +CG H ++ C VT +EL L+ S+ + Sbjct: 508 RRLEAMKHIIPAGSLAENAEEEQPNLICFYCGKKGHCLQDCLEVTDTELRDLVQNISSRN 567 Query: 698 SRVEESPCFCIRCSKPDHWAVSCPVGPTSRFGARRLMLCGGEDQSLR--LRDTSDVNN-- 531 R EE+ CIRC + HWA +CP GP L G ED++++ L TS + Sbjct: 568 GR-EEASSLCIRCFQLSHWAATCPNGP--------LYSSGAEDRAMKHTLASTSGMKLPV 618 Query: 530 ---NSAKSEIFDAIRSLRLTRADILRWMNSNVPSSHLNGFFLRLRLGSVEAGLGGSSYYV 360 +F+A++ LRL+R D+L+W+N+ S L GFFLRLRLG E GLGG+ YYV Sbjct: 619 SGFTDVPKAVFEAVQVLRLSRTDVLKWINTKKSVSGLEGFFLRLRLGKWEEGLGGTGYYV 678 Query: 359 ACITGDVKGNAGCTSKNSILVDVGGILSSVACQYISNHDFLEDEIKAWW----SRTEGSG 192 A I + + + + K+SI V V G+ V Q+ISNHDFLE+E+KAWW E SG Sbjct: 679 ARI-DEGQSSRRPSEKSSISVKVKGVTCLVESQFISNHDFLEEELKAWWRSAGKSAERSG 737 Query: 191 CR-LPSLAELNSKFNDRQSL 135 C +PS EL+ K R+ L Sbjct: 738 CEGIPSAEELSRKIQQRKML 757 >gb|EXB29868.1| RuBisCO large subunit-binding protein subunit alpha [Morus notabilis] Length = 1599 Score = 189 bits (481), Expect = 4e-45 Identities = 123/333 (36%), Positives = 170/333 (51%), Gaps = 67/333 (20%) Frame = -1 Query: 932 DLKSVNELCTVLPSRRFS-SEAMASDFARRLDALKHITSSK-----KGIYSTCLFCG-GC 774 D KS+ +L VLP + + S+AMAS FA+RLDA KHITSS+ TC FCG Sbjct: 705 DTKSMYKLTPVLPFPQLNHSDAMASVFAKRLDAFKHITSSRVTSDAAHATMTCFFCGVKG 764 Query: 773 HDVRGCSGVTRSELEYLLLKSSAFDSRVEESPCFCIRCSKPDHWAVSCP-VGPTSRFGAR 597 H++R CS + ++ELE LL + S +EE PC CIRC + HWAV+CP P+ R Sbjct: 765 HNLRDCSEIKQTELEELLRNLNTC-SGIEELPCLCIRCFQRSHWAVACPKTSPSKRLQLE 823 Query: 596 ------RLMLCGGEDQSLRLRDTSDV---------------------------------- 537 ++ G SL+L+ D+ Sbjct: 824 SNASFSEMLPSTGNRDSLKLQSDEDMITETDFNSKVDEMMNFQKKLSSTSPVKKHIASVP 883 Query: 536 -------------------NNNSAKSEIFDAIRSLRLTRADILRWMNSNVPSSHLNGFFL 414 N+ +FDA++ LRL+R+ I++W +S + S L+GFFL Sbjct: 884 EENMSIENRIMPFQYIVSEQNSDVPKGLFDAVKRLRLSRSHIIKWKSSRMSLSQLDGFFL 943 Query: 413 RLRLGSVEAGLGGSSYYVACITGDVKGNAGCTSKNSILVDVGGILSSVACQYISNHDFLE 234 RLRLG E GLGG+ Y+VACI G ++ SILV VGGI V ++ISNHDFLE Sbjct: 944 RLRLGKWEEGLGGTGYHVACIIGAQGDGKTQDAEGSILVKVGGIKCLVGSRFISNHDFLE 1003 Query: 233 DEIKAWWSRTEGSGCRLPSLAELNSKFNDRQSL 135 DE+ AWWS T +G ++PS +L K+ ++L Sbjct: 1004 DELLAWWSITSRNGDKIPSEEDLGVKYVTGEAL 1036 Score = 73.6 bits (179), Expect = 4e-10 Identities = 61/174 (35%), Positives = 91/174 (52%), Gaps = 11/174 (6%) Frame = -1 Query: 1805 TLPIEGSCESSKICLYQEKEKGKALSGEDIYGRSSDVEDDNHESASSSNSR-LSTQGKKG 1629 T+ E S SS++ + ++K K KALS G +DD+HES S NS L GK+ Sbjct: 318 TVSAEHSLTSSRVRVKRKKGKEKALSD----GMMPKDDDDSHESVESCNSAGLFPTGKR- 372 Query: 1628 VKRQIFDEDMVTGSKRTKTQIH---GSNT--KPDSSFMKWISNMTRGGLSGLNLEDSLSP 1464 R+ F+ED+V G+K K QIH GS + + +SSFM WISNM + + E +P Sbjct: 373 --RRSFEEDLVVGTKGFKKQIHCLDGSTSVARQNSSFMNWISNMMKRFSQSVQDE---AP 427 Query: 1463 LPLACSNGVLSRKYDENF-----MCFRPQNSKNLSTGFQTVFQSLYCRETDASK 1317 PL+ V EN + Q++ + GFQ++FQS+YC + + + Sbjct: 428 FPLSI---VRPDDRHENIDKRLTTVDKNQDAGSKIIGFQSIFQSMYCGKAEVQE 478 Score = 60.5 bits (145), Expect = 4e-06 Identities = 34/67 (50%), Positives = 40/67 (59%), Gaps = 5/67 (7%) Frame = -1 Query: 2282 LSTSSGAGVNAGSKVNRAFATCDPLSELVWSPKNGVELKCANFRADDNRKPFLLW----- 2118 L+ SGAG NAGS +N F +PLSELVWSP G+ LKCA+ D+ K L W Sbjct: 29 LNNGSGAGANAGSGLNMTFVAQNPLSELVWSPHKGLNLKCADSSLADS-KTSLFWGAGPS 87 Query: 2117 NVGLKPV 2097 NV L PV Sbjct: 88 NVALLPV 94 >gb|EYU32519.1| hypothetical protein MIMGU_mgv1a004371mg [Mimulus guttatus] Length = 531 Score = 189 bits (480), Expect = 6e-45 Identities = 92/135 (68%), Positives = 107/135 (79%) Frame = -1 Query: 536 NNNSAKSEIFDAIRSLRLTRADILRWMNSNVPSSHLNGFFLRLRLGSVEAGLGGSSYYVA 357 NN + S++F AIR+LRLTRADILRWMNS V SHL+GFFLRLRLG+VEAG G SYYVA Sbjct: 396 NNTAVSSKVFHAIRNLRLTRADILRWMNSGVSLSHLSGFFLRLRLGNVEAGQEGGSYYVA 455 Query: 356 CITGDVKGNAGCTSKNSILVDVGGILSSVACQYISNHDFLEDEIKAWWSRTEGSGCRLPS 177 CITGD + + G SK S+LVDVGGI+SSV QY+SN +FLEDEI+AWW R SGC++PS Sbjct: 456 CITGDGREHKGSRSKKSVLVDVGGIISSVESQYVSNQEFLEDEIEAWWCRIMDSGCKIPS 515 Query: 176 LAELNSKFNDRQSLG 132 L ELNSK DR LG Sbjct: 516 LDELNSKLKDRHILG 530 Score = 95.1 bits (235), Expect = 1e-16 Identities = 100/301 (33%), Positives = 143/301 (47%), Gaps = 15/301 (4%) Frame = -1 Query: 1829 EPELSIKNTLPIEGSCESSKICLYQEKEKGKALS-GEDIYGRSSDVEDDNHESASSSNSR 1653 E ++ +N+ +EGS S++ L+Q+K K K LS G+ G S+D E++ HES S NS Sbjct: 56 EMKVHEENSRSVEGSPTGSRVFLHQDKGKEKVLSDGDRNVGPSTDDEENTHESVESCNSA 115 Query: 1652 LSTQGKKGVKRQIFDEDMVTGSKRTKTQIHGSNTKPDSSFMKWISNMTRGGLSGLN---- 1485 + KGVKRQ D ++V SKR K + + DSSFM WISNM + G+S N Sbjct: 116 V-LYCPKGVKRQSCD-NLVLESKRMKKEDSSFILRHDSSFMNWISNMVK-GISDSNKEYS 172 Query: 1484 -LEDSLS---PLPLACSNGVLSRKYDENFMCFRPQNSKNLSTGFQTVFQSLYCRETDASK 1317 LEDS S L LACS V +K +SKNLS G TV ++ + S+ Sbjct: 173 PLEDSPSAHLALTLACSTDVYGKK---------THDSKNLSMGNTTVIY----KDKEESR 219 Query: 1316 KXXXXXXXXXXXXXXXXEGSPENLRESDERIFGNSS-----EQTIPSSKEDDRSRDPSE- 1155 + SPE+ S E+ NSS E + + + SE Sbjct: 220 ------------VMVAEKSSPESPSLSSEKDKNNSSGLCNKEANPITVGQTSKPWIFSEY 267 Query: 1154 SENQVIRAGPLDEETKVGTKEAYPDIPLATSSVLERSDRRVSLWISRLSTKTLRSERGKE 975 EN+ + + E G K +I + ++S+ SLWI+RLSTK R E+ E Sbjct: 268 VENEDLAKKGVMESDSSGEK---TNITAEIKATPDKSNPLTSLWITRLSTKNPRLEKSDE 324 Query: 974 I 972 + Sbjct: 325 V 325 >ref|XP_007034986.1| Zinc knuckle family protein, putative isoform 3 [Theobroma cacao] gi|508714015|gb|EOY05912.1| Zinc knuckle family protein, putative isoform 3 [Theobroma cacao] Length = 909 Score = 188 bits (478), Expect = 9e-45 Identities = 150/404 (37%), Positives = 201/404 (49%), Gaps = 88/404 (21%) Frame = -1 Query: 1079 IPLATSSVLERSDRRVSLWISRLSTKTLRSERGKEIL-----VEADLDSNEESA--DLKS 921 IP + ++ S+ ++ + + K L S GKE+ +EA + N+ + D KS Sbjct: 509 IPCSQNNFNASSNLKIMEASQKCAEKPLTSS-GKELPNCATEIEASIGFNKITVQNDQKS 567 Query: 920 VNELCTVLPSRRFS-SEAMASDFARRLDALKHI-----TSSKKGIYSTCLFCGGC-HDVR 762 ++ T+LPS R SEAMAS FARRLDALKHI + S TC FCG H ++ Sbjct: 568 KYKVSTILPSPRLKDSEAMASLFARRLDALKHIMPSGVSDSTASSTITCFFCGRKGHHLQ 627 Query: 761 GCSGVTRSELEYLL--LKSSAFDSRVEESP-----CF-----CIRC----SKPDHWAVS- 633 C +T +E+E LL +KSS SR+EE P CF + C S+ H + Sbjct: 628 YCPEITDNEIEDLLRNMKSS---SRLEELPCVCIRCFELNHWAVACPNTSSRGQHQSAHR 684 Query: 632 ------CPVGPTSRFGARRLMLCGGEDQ-----------------------SLRLRDTSD 540 C + +RF + +L ED + ++R ++ Sbjct: 685 ASLANLCKLHCYARFEEHKRLLDDNEDAIASPTVCDGVDTGKGPGTDYGVTAEKVRSNTN 744 Query: 539 VNNN----SAKS------------------------EIFDAIRSLRLTRADILRWMNSNV 444 VN S+K IF A+R LRL+R DIL+W NS + Sbjct: 745 VNKKYVAYSSKEIELKENQITPWGNFINQQVSGMPKAIFSAVRMLRLSRTDILKWTNSQI 804 Query: 443 PSSHLNGFFLRLRLGSVEAGLGGSSYYVACITGDVKGNAGCTSKNSILVDVGGILSSVAC 264 SHL GFFLRLRLG E GLGG+ YYVACITG + + SK+S+ V VGGI V Sbjct: 805 SISHLEGFFLRLRLGKWEEGLGGTGYYVACITGAHRQSTQRNSKSSVSVSVGGIKCLVES 864 Query: 263 QYISNHDFLEDEIKAWWSRTEGSGCRLPSLAELNSKFNDRQSLG 132 QYISNHDFLEDE+ AWWS T SG ++PS EL SK +R+ LG Sbjct: 865 QYISNHDFLEDELMAWWSATTRSGGKIPSEEELTSKVKERRMLG 908 Score = 82.0 bits (201), Expect = 1e-12 Identities = 63/162 (38%), Positives = 85/162 (52%), Gaps = 12/162 (7%) Frame = -1 Query: 1778 SSKICLYQEKEKGKALSGEDIYGRSSDVEDDNHESASSSNSR-LSTQGKKGVKRQIFDED 1602 +S+I + K K K LS D+ G S EDD+HES S NS L + GK KR F+++ Sbjct: 184 NSRIHRFSRKGKEKVLSDGDVKGMMSKEEDDSHESVESCNSTGLFSTGK---KRWGFEQE 240 Query: 1601 MVTGSKRTKTQI-----HGSNTKPDSSFMKWISNMTRGGLSGLNLEDSLSPLPLACSNGV 1437 ++ GSK K QI S K DSSFM WISNM +G +D PL L +N Sbjct: 241 LIVGSKIVKKQIDESPCSSSFVKQDSSFMNWISNMMKGFSKS---KDETPPLALTVANPK 297 Query: 1436 LSRK-YDENFMCFRPQNSKN-----LSTGFQTVFQSLYCRET 1329 S + D+N N+KN + GFQ++FQS+Y +T Sbjct: 298 QSHEGPDKNL----DANNKNQDPGCRNIGFQSIFQSIYSPKT 335 >ref|XP_007034984.1| Zinc knuckle family protein, putative isoform 1 [Theobroma cacao] gi|590658913|ref|XP_007034985.1| Zinc knuckle family protein, putative isoform 1 [Theobroma cacao] gi|508714013|gb|EOY05910.1| Zinc knuckle family protein, putative isoform 1 [Theobroma cacao] gi|508714014|gb|EOY05911.1| Zinc knuckle family protein, putative isoform 1 [Theobroma cacao] Length = 1087 Score = 188 bits (478), Expect = 9e-45 Identities = 150/404 (37%), Positives = 201/404 (49%), Gaps = 88/404 (21%) Frame = -1 Query: 1079 IPLATSSVLERSDRRVSLWISRLSTKTLRSERGKEIL-----VEADLDSNEESA--DLKS 921 IP + ++ S+ ++ + + K L S GKE+ +EA + N+ + D KS Sbjct: 687 IPCSQNNFNASSNLKIMEASQKCAEKPLTSS-GKELPNCATEIEASIGFNKITVQNDQKS 745 Query: 920 VNELCTVLPSRRFS-SEAMASDFARRLDALKHI-----TSSKKGIYSTCLFCGGC-HDVR 762 ++ T+LPS R SEAMAS FARRLDALKHI + S TC FCG H ++ Sbjct: 746 KYKVSTILPSPRLKDSEAMASLFARRLDALKHIMPSGVSDSTASSTITCFFCGRKGHHLQ 805 Query: 761 GCSGVTRSELEYLL--LKSSAFDSRVEESP-----CF-----CIRC----SKPDHWAVS- 633 C +T +E+E LL +KSS SR+EE P CF + C S+ H + Sbjct: 806 YCPEITDNEIEDLLRNMKSS---SRLEELPCVCIRCFELNHWAVACPNTSSRGQHQSAHR 862 Query: 632 ------CPVGPTSRFGARRLMLCGGEDQ-----------------------SLRLRDTSD 540 C + +RF + +L ED + ++R ++ Sbjct: 863 ASLANLCKLHCYARFEEHKRLLDDNEDAIASPTVCDGVDTGKGPGTDYGVTAEKVRSNTN 922 Query: 539 VNNN----SAKS------------------------EIFDAIRSLRLTRADILRWMNSNV 444 VN S+K IF A+R LRL+R DIL+W NS + Sbjct: 923 VNKKYVAYSSKEIELKENQITPWGNFINQQVSGMPKAIFSAVRMLRLSRTDILKWTNSQI 982 Query: 443 PSSHLNGFFLRLRLGSVEAGLGGSSYYVACITGDVKGNAGCTSKNSILVDVGGILSSVAC 264 SHL GFFLRLRLG E GLGG+ YYVACITG + + SK+S+ V VGGI V Sbjct: 983 SISHLEGFFLRLRLGKWEEGLGGTGYYVACITGAHRQSTQRNSKSSVSVSVGGIKCLVES 1042 Query: 263 QYISNHDFLEDEIKAWWSRTEGSGCRLPSLAELNSKFNDRQSLG 132 QYISNHDFLEDE+ AWWS T SG ++PS EL SK +R+ LG Sbjct: 1043 QYISNHDFLEDELMAWWSATTRSGGKIPSEEELTSKVKERRMLG 1086 Score = 82.0 bits (201), Expect = 1e-12 Identities = 63/162 (38%), Positives = 85/162 (52%), Gaps = 12/162 (7%) Frame = -1 Query: 1778 SSKICLYQEKEKGKALSGEDIYGRSSDVEDDNHESASSSNSR-LSTQGKKGVKRQIFDED 1602 +S+I + K K K LS D+ G S EDD+HES S NS L + GK KR F+++ Sbjct: 362 NSRIHRFSRKGKEKVLSDGDVKGMMSKEEDDSHESVESCNSTGLFSTGK---KRWGFEQE 418 Query: 1601 MVTGSKRTKTQI-----HGSNTKPDSSFMKWISNMTRGGLSGLNLEDSLSPLPLACSNGV 1437 ++ GSK K QI S K DSSFM WISNM +G +D PL L +N Sbjct: 419 LIVGSKIVKKQIDESPCSSSFVKQDSSFMNWISNMMKGFSKS---KDETPPLALTVANPK 475 Query: 1436 LSRK-YDENFMCFRPQNSKN-----LSTGFQTVFQSLYCRET 1329 S + D+N N+KN + GFQ++FQS+Y +T Sbjct: 476 QSHEGPDKNL----DANNKNQDPGCRNIGFQSIFQSIYSPKT 513 Score = 62.4 bits (150), Expect = 1e-06 Identities = 31/63 (49%), Positives = 38/63 (60%) Frame = -1 Query: 2282 LSTSSGAGVNAGSKVNRAFATCDPLSELVWSPKNGVELKCANFRADDNRKPFLLWNVGLK 2103 LS GAG NA S+++ F T DPLSELVWSP NG LKC + D +K L+W G Sbjct: 29 LSNDLGAGANAASRIDMTFVTTDPLSELVWSPHNGPSLKCTDCCFSD-KKQSLVWGAGPS 87 Query: 2102 PVV 2094 V+ Sbjct: 88 NVI 90 >ref|XP_006280040.1| hypothetical protein CARUB_v10025917mg [Capsella rubella] gi|482548744|gb|EOA12938.1| hypothetical protein CARUB_v10025917mg [Capsella rubella] Length = 780 Score = 181 bits (458), Expect = 2e-42 Identities = 180/544 (33%), Positives = 250/544 (45%), Gaps = 70/544 (12%) Frame = -1 Query: 1766 CLYQEKEKGKALSGEDIYGRSSDVEDDNHESASSSNSR-LSTQGKKGVKRQIFDEDMVTG 1590 C +EK K KALS + G +D D++ S S NS L T+GKK R F++ ++ G Sbjct: 249 CRAKEKGKEKALSDGNSEGDEND--DESFGSVESCNSAGLLTRGKK---RPGFEQQLILG 303 Query: 1589 SKRTKT---QIHGSNTK--PDSSFMKWISNMTRGGLSGLNLEDS----LSPLPLACSNGV 1437 SKR KT + GS +K DSSFM WISNMT+G G +DS L+ A +G Sbjct: 304 SKRLKTLSQECLGSTSKLKQDSSFMNWISNMTKGIWKGNEEDDSPFVALTTTSDANGHGQ 363 Query: 1436 LSRKYDENFM--CFRPQNSKNLSTGFQTVFQSLYCRETDASKKXXXXXXXXXXXXXXXXE 1263 ++ D+ + C +NS +TGFQ+ FQS+YC + ++ Sbjct: 364 VNVIADQQKLSPCCVKENSGCRNTGFQSFFQSIYCPKKESQDAVEMDFANDANATSLQEL 423 Query: 1262 G-SPENLRESDERIFGNS----SEQTIPSSK-------EDDRSRDPSESENQVIRAGPL- 1122 PE + GN +E I S K E S D E + I PL Sbjct: 424 PWIPEECDITKVTSSGNEIGPVAEPNISSEKVGFNQTSEPLSSADKHEVKEPNISLMPLS 483 Query: 1121 ----DEETKVGTKEAYPDIPLATSSVLERSDRRVSLWISRLSTKTLRSERGKEILVEADL 954 +EE K+ + P T+ R+ SLWISR S+ + + ++ +E E + Sbjct: 484 KSKLNEEPKICGEADGKVSPCLTN----RNSGLESLWISRFSSISSQ-KKARETAKEGN- 537 Query: 953 DSNEESADLKSVNELC-----------------------TVLP----SRRFSSEAMASDF 855 DS ++ + ++ T LP R SSEAMAS F Sbjct: 538 DSASDATQTRDSQKMLEDNNVFIEPKPNISLLYRLDKQNTALPIVSSMRIDSSEAMASLF 597 Query: 854 ARRLDALKHITSS-----KKGIYSTCLFCGGC----HDVRGCSGVTRSELEYLLLKSSAF 702 AR+L+A+KHI S + T L C C H +R C VT EL L+ SA Sbjct: 598 ARKLEAMKHIMLSGDIAENAEVEQTNLICFYCGKKGHCLRDCLEVTDIELRDLVQNISAR 657 Query: 701 DSRVEESPCFCIRCSKPDHWAVSCPVGPTSRFGA-----RRLMLCGGEDQSLRLRDTSDV 537 + R EE+ CIRC + HWA +CP P GA ++ + + L L +DV Sbjct: 658 NGR-EEASSLCIRCFQLSHWAATCPNAPLYSSGAEDRAVKQALASTSGTKLLPLSGFTDV 716 Query: 536 NNNSAKSEIFDAIRSLRLTRADILRWMNSNVPSSHLNGFFLRLRLGSVEAGLGGSSYYVA 357 +FDA++ LRLTR +L+W+N+ S L GFFLRLRLG E GLGG+ YYVA Sbjct: 717 -----PKAVFDAVQVLRLTRTHVLKWLNTKKSVSGLEGFFLRLRLGKWEEGLGGTGYYVA 771 Query: 356 CITG 345 I G Sbjct: 772 RIDG 775 >ref|XP_004134425.1| PREDICTED: uncharacterized protein LOC101216376 [Cucumis sativus] Length = 1004 Score = 176 bits (445), Expect = 6e-41 Identities = 113/314 (35%), Positives = 165/314 (52%), Gaps = 43/314 (13%) Frame = -1 Query: 944 EESADLKSVNELCTVLPSRRFSS-EAMASDFARRLDALKHITSSKKGIYS-----TCLFC 783 ++ ++ KS+++ + L S + S EAMAS FARRL ALKHI S I TC FC Sbjct: 701 KDHSEQKSISKFKSALRSPKIRSPEAMASVFARRLGALKHIIPSDLTINVGNETVTCFFC 760 Query: 782 GGC-HDVRGCSGVTRSELEYLLLKSSAFDSRVEESPCFCIRCSKPDHWAVSCPVGPTS-- 612 G H++ CS +T E+E L ++ F + + PC CIRC + +HWA++CP+ P Sbjct: 761 GTKGHNLHNCSEITEREIEDLS-RNIRFCNETVDPPCSCIRCFQLNHWAIACPLAPARCQ 819 Query: 611 ---------------------------------RFGARRLMLCGGEDQSLRLRDTSDVNN 531 RF + L + +++ D N Sbjct: 820 QQSDSHVSLADRYDSVTEQVKSAAISFPKCVPPRFPEKSLK----GSEMVQVDSFVDNQN 875 Query: 530 NSAKSEIFDAIRSLRLTRADILRWMNSNVPSSHLNGFFLRLRLGSVEAGLGGSSYYVACI 351 ++ + +A++ LRL+R+++L+ M+S+ S L+GFFLR+RLG E GLGG+ Y+VACI Sbjct: 876 SNISHAVLNAVKKLRLSRSNVLKCMSSHTSLSLLDGFFLRIRLGKWEEGLGGTGYHVACI 935 Query: 350 TGDVKGNAGCTSKNSILVDVGGILSSVACQYISNHDFLEDEIKAWWSRTEGSGCR-LPSL 174 G +KNSI V V G+ V QYISNHDFLEDE++AWW GC LP Sbjct: 936 RG------AQLTKNSISVIVRGVECQVQTQYISNHDFLEDELRAWWCTISRDGCNALPLA 989 Query: 173 AELNSKFNDRQSLG 132 A+L +K ++ LG Sbjct: 990 ADLRAKVKKKRELG 1003 Score = 68.6 bits (166), Expect = 1e-08 Identities = 50/155 (32%), Positives = 78/155 (50%), Gaps = 6/155 (3%) Frame = -1 Query: 1787 SCESSKICLYQEKEKGKALSGEDIYGRSSDVEDDNHESASSSNSRLSTQGKKGVKRQIFD 1608 S S ++ Q K K KALS D++GR +D+++ S S NS + K +R F+ Sbjct: 328 SPSSCRMHWIQRKGKEKALSDGDVHGRMLKKDDNSYGSVESCNSAFRSTSK---RRWSFE 384 Query: 1607 EDMVTGSKRTKTQIHG-----SNTKPDSSFMKWISNMTRGGLSGLNLEDSLSPLPLACSN 1443 + ++ G+KR K Q SN DSSFM WISNM +G +++D L L + Sbjct: 385 QRLIVGNKRAKKQDGNASGPTSNLGQDSSFMIWISNMMKG--FSESIQDEAPTLDLTLAK 442 Query: 1442 GVLSRKYDENFMCFRPQNSKNLS-TGFQTVFQSLY 1341 + + ++ N+ S GFQ++F+SLY Sbjct: 443 CDVEQGGPNEEPIYKKINAPGFSGIGFQSIFRSLY 477 Score = 59.7 bits (143), Expect = 7e-06 Identities = 33/95 (34%), Positives = 54/95 (56%), Gaps = 4/95 (4%) Frame = -1 Query: 2282 LSTSSGAGVNAGSKVNRAFATCDPLSELVWSPKNGVELKCANFRADDNRKPFLLWNVGLK 2103 L+ SG G NAGS V+ + T D LSELVWSP G+ L+CA+ + +NRK +LW+ Sbjct: 29 LTNRSGVGANAGSMVDVKYVTTDSLSELVWSPHKGLSLRCAD-SSFNNRKTSILWDA--- 84 Query: 2102 PVVDQGNLTVSQMVL----DANDIIIGKATVLKDS 2010 ++ N + Q V+ +N+++ + +L + Sbjct: 85 -AANKANFALPQSVIAEKSTSNNLLDNRTIILSQA 118 >ref|XP_004511402.1| PREDICTED: uncharacterized protein LOC101494426 isoform X5 [Cicer arietinum] Length = 836 Score = 172 bits (435), Expect = 9e-40 Identities = 217/854 (25%), Positives = 330/854 (38%), Gaps = 134/854 (15%) Frame = -1 Query: 2288 KLLSTSSGAGVNAGS-KVNRAFATCDPLSELVWSP------------------------- 2187 K+L SGAG NA S + + A DPLSE+VWSP Sbjct: 28 KILKNESGAGANAASSRADMTLAANDPLSEIVWSPEKERASNLPDDQDGNPTKDWEKNTG 87 Query: 2186 -KNGVEL-KCANFRADDNRKP---FLLWNVGLKPVVDQGNLTVSQMVLDANDIIIGKATV 2022 K G E K + R+P L+ + KP + + N + + + DI GK Sbjct: 88 DKAGTETDKLSTISGQVGRRPSYNILIQSDEPKPTIMERNTSPRRPSNEGIDIDTGK--- 144 Query: 2021 LKDSGGLESDREL--EEKADKQDAILXXXXXXXXXXXXXXXXXXXXXXXXENDLCRLTEK 1848 K++G + D + E K + +D ENDL + + Sbjct: 145 -KEAGTTDDDLHISFEPKIEYKDL----GASGTNLTSSTRNFLEKPESGAENDLRNVETE 199 Query: 1847 DECSLV---------------EPELSIKNTLPIEGSCESSKICLYQEKEKGKALSGEDIY 1713 C++ E L L + SS+I + ++K K K+LS D Sbjct: 200 TACAVTCGVIVNETKNESQDNEMTLLCDKVLSVSHYPCSSRIHMTKDKGKEKSLSDGDAN 259 Query: 1712 GRSSDVEDDNHESASSSNSR--LSTQGKKGVKRQIFDEDMVTGSKRTKTQIHGSN----- 1554 R +++D+H S S NS ST G KR+ + ++ GSKR K I ++ Sbjct: 260 VRLP-MDNDSHSSVESRNSAGFFST----GKKRRSIQQQLIIGSKRVKKNIEETSGSKPC 314 Query: 1553 TKPDSSFMKWISNMTRGGLSGLNLEDSLSPLPLACSNGVLSRKYDENFMCFRPQNSKNLS 1374 TK DSSF WIS+M +G + PL LA +R ++ C Q++ + Sbjct: 315 TKQDSSFKNWISSMVKGLSQSIQHNSDTLPLSLANPYHRHARPDEKLISCKMNQDTVPKN 374 Query: 1373 TG----FQTVF---------QSLYCRETDASKKXXXXXXXXXXXXXXXXEGSPENLRE-- 1239 TG FQ+++ + L+ E + +L E Sbjct: 375 TGFKSIFQSMYRPSLKNVRTRMLHQEEESNEDSEPSKMIHGINATPITCFAANNSLAEQR 434 Query: 1238 --------SDERIFGNSSEQTIPSSK----EDDRSRDPSESENQVIRAGPLDEETKVGTK 1095 S R SE TI ++ +P E+EN + D+E Sbjct: 435 FQSNKFEASPARYDAGPSEPTIAPLNFFNCQESIKNNPVENENCSNLSLSKDKEEMASNS 494 Query: 1094 EAY--------------PDIPLATSSVLERSDRRVSLWISRLSTKTLR------------ 993 + P ++ R D SLWI+R ++K+ Sbjct: 495 SSSRQNANNTDNVDSNAPSERKEAQNICHRRDNLGSLWITRFASKSTPPLTISDRLNERS 554 Query: 992 ---------SERGKEILVEADLDSNEESADLKSVNELCTVLPSRRFS-SEAMASDFARRL 843 + + + L ++ + D KS + S F+ SE MAS FARR Sbjct: 555 FHCKIEETGEQPANDTKISIGLKEDKGNNDHKSHYLFNNISSSPGFTNSEQMASIFARRF 614 Query: 842 DALKHI-------TSSKKGIYSTCLFCGGC-HDVRGCSGVTRSELEYLLLKSSAFDSRVE 687 A+KHI ++ + + C+ GG HD G+ + + LK + D Sbjct: 615 IAIKHIMPTNNEGSARPQPDEADCILSGGTIHD-----GIDHETDQNINLKRKSNDIIT- 668 Query: 686 ESPCFCIRCSKPDHWAVSCPVGPTSRFGARRLMLCGGEDQSLRLRDT--------SDVNN 531 F I CS C + +LRD ++ N Sbjct: 669 ----FKIECSAS----------------------CKSTSKENKLRDKPITSPFRMAEKNI 702 Query: 530 NSAKSEIFDAIRSLRLTRADILRWMNSNVPSSHLNGFFLRLRLGSVEAGLGGSSYYVACI 351 + IFDA+++L+L+R++IL+W+ + S LNGFFLRLRLG E G G + Y+VA I Sbjct: 703 SHVPEGIFDAVKNLQLSRSEILKWITVHGSISQLNGFFLRLRLGKWEEGHGRTGYHVAYI 762 Query: 350 TGDVKGNAGCTSKNSILVDVGGILSSVACQYISNHDFLEDEIKAWWSRTEGSGCRLPSLA 171 + + S+ V V G+ V YISNHDFLE+EI WWS T +G +PS Sbjct: 763 NETERHSLEQHMTKSLSVKVRGMKCMVESHYISNHDFLEEEIMEWWSTTSETGVEIPSEQ 822 Query: 170 ELNSKFNDRQSLGL 129 +L +KF +Q LGL Sbjct: 823 DLIAKFKKKQMLGL 836 >ref|XP_004170660.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC101224596 [Cucumis sativus] Length = 1004 Score = 171 bits (432), Expect = 2e-39 Identities = 110/314 (35%), Positives = 162/314 (51%), Gaps = 43/314 (13%) Frame = -1 Query: 944 EESADLKSVNELCTVLPSRRFSS-EAMASDFARRLDALKHITSSKKGIYS-----TCLFC 783 ++ ++ KS+++ + L S + S EAMAS FARRL ALKHI S I TC FC Sbjct: 701 KDHSEQKSISKFKSALRSPKIRSPEAMASVFARRLGALKHIIPSDLTINVGNETVTCFFC 760 Query: 782 GGC-HDVRGCSGVTRSELEYLLLKSSAFDSRVEESPCFCIRCSKPDHWAVSCPVGPTS-- 612 G H++ CS +T E+E L ++ F + + PC CIRC + +HWA++CP+ P Sbjct: 761 GTKGHNLHNCSEITEREIEDLS-RNIRFCNETVDPPCSCIRCFQLNHWAIACPLAPARCQ 819 Query: 611 ---------------------------------RFGARRLMLCGGEDQSLRLRDTSDVNN 531 RF + L + +++ D N Sbjct: 820 QQSDSHVSLADRYDSVTEQVKSAAISFPKCVPPRFPEKSLK----GSEMVQVDSFVDNQN 875 Query: 530 NSAKSEIFDAIRSLRLTRADILRWMNSNVPSSHLNGFFLRLRLGSVEAGLGGSSYYVACI 351 ++ + +A++ LRL+R+++L+ + +N S + FFLR+RLG E GLGG+ Y+VACI Sbjct: 876 SNISHAVLNAVKKLRLSRSNVLKXVGTNFCPSSIRWFFLRIRLGKWEEGLGGTGYHVACI 935 Query: 350 TGDVKGNAGCTSKNSILVDVGGILSSVACQYISNHDFLEDEIKAWWSRTEGSGCR-LPSL 174 G +KNSI V V G+ V QYISNHDFLEDE++AWW GC LP Sbjct: 936 RG------AQLTKNSISVIVRGVECQVQTQYISNHDFLEDELRAWWCTISRDGCNALPLA 989 Query: 173 AELNSKFNDRQSLG 132 A+L +K ++ LG Sbjct: 990 ADLRAKVKKKRELG 1003 Score = 68.6 bits (166), Expect = 1e-08 Identities = 50/155 (32%), Positives = 78/155 (50%), Gaps = 6/155 (3%) Frame = -1 Query: 1787 SCESSKICLYQEKEKGKALSGEDIYGRSSDVEDDNHESASSSNSRLSTQGKKGVKRQIFD 1608 S S ++ Q K K KALS D++GR +D+++ S S NS + K +R F+ Sbjct: 328 SPSSCRMHWIQRKGKEKALSDGDVHGRMLKKDDNSYGSVESCNSAFRSTSK---RRWSFE 384 Query: 1607 EDMVTGSKRTKTQIHG-----SNTKPDSSFMKWISNMTRGGLSGLNLEDSLSPLPLACSN 1443 + ++ G+KR K Q SN DSSFM WISNM +G +++D L L + Sbjct: 385 QRLIVGNKRAKKQDGNASGPTSNLGQDSSFMIWISNMMKG--FSESIQDEAPTLDLTLAK 442 Query: 1442 GVLSRKYDENFMCFRPQNSKNLS-TGFQTVFQSLY 1341 + + ++ N+ S GFQ++F+SLY Sbjct: 443 CDVEQGGPNEEPIYKKINAPGFSGIGFQSIFRSLY 477 Score = 59.7 bits (143), Expect = 7e-06 Identities = 33/95 (34%), Positives = 54/95 (56%), Gaps = 4/95 (4%) Frame = -1 Query: 2282 LSTSSGAGVNAGSKVNRAFATCDPLSELVWSPKNGVELKCANFRADDNRKPFLLWNVGLK 2103 L+ SG G NAGS V+ + T D LSELVWSP G+ L+CA+ + +NRK +LW+ Sbjct: 29 LTNRSGVGANAGSMVDVKYVTTDSLSELVWSPHKGLSLRCAD-SSFNNRKTSILWDA--- 84 Query: 2102 PVVDQGNLTVSQMVL----DANDIIIGKATVLKDS 2010 ++ N + Q V+ +N+++ + +L + Sbjct: 85 -AANKANFALPQSVIAEKSTSNNLLDNRTIILSQA 118 >ref|XP_002312573.2| hypothetical protein POPTR_0008s16240g [Populus trichocarpa] gi|550333200|gb|EEE89940.2| hypothetical protein POPTR_0008s16240g [Populus trichocarpa] Length = 1045 Score = 168 bits (426), Expect = 1e-38 Identities = 120/309 (38%), Positives = 153/309 (49%), Gaps = 76/309 (24%) Frame = -1 Query: 932 DLKSVNELCTVLPSRRF-SSEAMASDFARRLDALKHI-----TSSKKGIYSTCLFCG-GC 774 D KS+ ++ + LP RF +SEAMAS FARRLDALKHI T TC FCG Sbjct: 737 DEKSMCKVNSTLPFSRFRNSEAMASVFARRLDALKHIMPSYGTDDSSHGNLTCFFCGIKG 796 Query: 773 HDVRGCSGVTRSELEYLLLKSSAFDSRVEESPCFCIRCSKPDHWAVSCPVGPT-----SR 609 H VR C + SEL +L +++F+ E PC CIRC + +HWAV+CP + + Sbjct: 797 HHVRDCPEIIDSELADILRNANSFNG-ANEFPCVCIRCFQSNHWAVACPSASSRTRHQAE 855 Query: 608 FGAR--------RLML----------CGGEDQSLRLRDTSDVNNN--------------- 528 +GA +++L G+D L+ D V N Sbjct: 856 YGASLVHESSPCKILLNPRNEDDAKQSDGKDSQLQAADAPTVCNGKLHEASASRKMNMNM 915 Query: 527 ------------------------SAKSEIFD-------AIRSLRLTRADILRWMNSNVP 441 S S+I D A++ LRL+R IL+WMNS+ P Sbjct: 916 KPFERDTASSSGEKKLKENQVMPLSINSQILDVPKGIFDAVKRLRLSRTIILKWMNSHTP 975 Query: 440 SSHLNGFFLRLRLGSVEAGLGGSSYYVACITGDVKGNAGCTSKNSILVDVGGILSSVACQ 261 SHL+GFFLRLRLG E GLGG+ YYVACITG ++ KNSI V VGG+ V Q Sbjct: 976 PSHLDGFFLRLRLGKWEQGLGGTGYYVACITGVQSQSSKQKFKNSIAVIVGGVKCLVESQ 1035 Query: 260 YISNHDFLE 234 YISNHDF E Sbjct: 1036 YISNHDFTE 1044 Score = 85.9 bits (211), Expect = 9e-14 Identities = 66/172 (38%), Positives = 90/172 (52%), Gaps = 8/172 (4%) Frame = -1 Query: 1808 NTLPIEGSCESSKICLYQEKEKGKALSGEDIYGRSSDVEDDNHESASSSNS-RLSTQGKK 1632 N I+ S S+ YQ K K KALS ++ R D++DD+HES S NS L + GK Sbjct: 345 NDCAIKQSPTYSRTRRYQMKGKAKALSDGNLNERMLDMDDDSHESVESCNSVGLFSTGK- 403 Query: 1631 GVKRQIFDEDMVTGSKRTKTQIH-----GSNTKPDSSFMKWISNMTRGGLSGLNLEDSLS 1467 +++ FD GSK KT+I S K D SFM WISNM +G L + ED Sbjct: 404 --RQRNFDPHSYVGSKSIKTKIQESPGSSSFVKHDGSFMNWISNMMKGFLK--SNEDEAP 459 Query: 1466 PLPLACSNGVLSRK-YDENFM-CFRPQNSKNLSTGFQTVFQSLYCRETDASK 1317 L L +N + D+N + C R Q+ + GF ++FQSLYC +T A + Sbjct: 460 SLALTLANHKHGHEDRDKNLISCNRNQDQGCKTMGFHSLFQSLYCPKTKAQE 511 Score = 59.7 bits (143), Expect = 7e-06 Identities = 36/95 (37%), Positives = 49/95 (51%), Gaps = 6/95 (6%) Frame = -1 Query: 2348 MNIEDKDVDSEHEPNFRIR------AKLLSTSSGAGVNAGSKVNRAFATCDPLSELVWSP 2187 M+ DK+++ + F + + L SGAG NA S V+ F + LSELVWSP Sbjct: 1 MDTNDKNIEPVIDLGFSLGYSNQCIQRRLKNDSGAGANAASSVDMTFVATNALSELVWSP 60 Query: 2186 KNGVELKCANFRADDNRKPFLLWNVGLKPVVDQGN 2082 K G+ LKCA+ N+KP LL G +V N Sbjct: 61 KKGLSLKCAD-GTFSNQKPSLLRGAGPSDMVSGSN 94 >ref|XP_006420121.1| hypothetical protein CICLE_v10004215mg [Citrus clementina] gi|567854004|ref|XP_006420122.1| hypothetical protein CICLE_v10004215mg [Citrus clementina] gi|567854006|ref|XP_006420123.1| hypothetical protein CICLE_v10004215mg [Citrus clementina] gi|557521994|gb|ESR33361.1| hypothetical protein CICLE_v10004215mg [Citrus clementina] gi|557521995|gb|ESR33362.1| hypothetical protein CICLE_v10004215mg [Citrus clementina] gi|557521996|gb|ESR33363.1| hypothetical protein CICLE_v10004215mg [Citrus clementina] Length = 1093 Score = 159 bits (403), Expect = 5e-36 Identities = 124/345 (35%), Positives = 165/345 (47%), Gaps = 77/345 (22%) Frame = -1 Query: 932 DLKSVNELCTVLPSRRFSSEAMASDFARRLDALKHITSSKKGIYS-----TCLFCG-GCH 771 D KS +L ++PS RF + AMAS FARRLDAL+HIT S + TC +CG H Sbjct: 752 DQKSKCKLNPIIPSPRFQNSAMASVFARRLDALRHITPSAVTDNAACTAITCFYCGRKGH 811 Query: 770 DVRGCSGVTRSELEYLLLKSSAFDSRVEESPCFCIRCSKPDHWAVSCPVGPTSRFGARRL 591 +R CS ++ EL+ L ++++ EE C CIRC + DHWAVSCP + Sbjct: 812 PLRDCSEISDGELKDLTRNINSYNG-AEELHCLCIRCFELDHWAVSCPNATSRSQSLLEG 870 Query: 590 MLCGGEDQSLRLRDTSDVN----NN-----SAKSEIFDA-----------IRSL------ 489 CG + L R+ N NN + I+D IR L Sbjct: 871 CNCGPNEFQLNKRNDESKNLLYGNNCLYQATGSHTIYDRDDPQREADPKFIRKLPEVVTS 930 Query: 488 --RLTRADILRWMNS------NVPSSHLN-------GFFLRLRL---------------- 402 + A +++ N+ NV + H++ F R+RL Sbjct: 931 DRMIPNAYLIKDCNASGSGEKNVVNRHISEVPKGIFDFIKRIRLSRTDILKCMNSHMSLA 990 Query: 401 -----------GSVEAGLGGSSYYVACITG---DVKGNAGCTSKNSILVDVGGILSSVAC 264 G + GLGG+ YYVACITG ++ AG SKNSI V+VGGI V Sbjct: 991 HLKGFFLRLRLGKWDEGLGGTGYYVACITGAQREISSPAG--SKNSISVNVGGINCLVES 1048 Query: 263 QYISNHDFLEDEIKAWWSRTEGSGCRLPSLAELNSKFNDRQSLGL 129 QYISNHDFLEDE+ AWWS T SG ++PS +L K +R+ LGL Sbjct: 1049 QYISNHDFLEDELMAWWSATVKSGSKIPSEEDLIPKIKERKMLGL 1093 Score = 89.7 bits (221), Expect = 6e-15 Identities = 67/167 (40%), Positives = 96/167 (57%), Gaps = 7/167 (4%) Frame = -1 Query: 1793 EGSCESSKICLYQEKEKGKALSGEDIYGRSSDVEDDNHESASSSNSRLSTQGKKGVKRQI 1614 E S +S+I Y+ K K KALS D+ R S +DD+HES S NS K KR Sbjct: 358 EHSPTTSRIRRYRRKGKEKALSDGDVNERMSKDDDDSHESVESCNSTGLFSTCK--KRWS 415 Query: 1613 FDEDMVTGSKRTKTQIH---GSNT--KPDSSFMKWISNMTRGGLSGLNLEDSLS-PLPLA 1452 F++ ++ GSK+ K QI GS + K DSSFM WI NM + G NL++S S L LA Sbjct: 416 FEQQLIVGSKKVKKQIRETTGSTSCVKQDSSFMNWILNMMK-GFPKSNLDNSPSVDLTLA 474 Query: 1451 CSNGVLSRKYDENFMCFRP-QNSKNLSTGFQTVFQSLYCRETDASKK 1314 C+N + D+ F+ ++ Q+S+ + GFQ++FQSLY +T ++ Sbjct: 475 CTN-YGHKCSDQKFITYKKNQDSECRNVGFQSIFQSLYRPKTKGQER 520 Score = 61.2 bits (147), Expect = 2e-06 Identities = 33/91 (36%), Positives = 54/91 (59%), Gaps = 6/91 (6%) Frame = -1 Query: 2348 MNIEDKDVDSEHEPNFRIR------AKLLSTSSGAGVNAGSKVNRAFATCDPLSELVWSP 2187 MN+E+++++ + + + L++ SGAG NAGS+++ F +PLSELVWS Sbjct: 4 MNVENENIEPVTDLGLALGYSSQCVQRRLNSDSGAGANAGSRIDMKFVAANPLSELVWSS 63 Query: 2186 KNGVELKCANFRADDNRKPFLLWNVGLKPVV 2094 +NG+ LKCA+ D +K +L+ G VV Sbjct: 64 RNGLSLKCADSSFVD-KKSYLILGAGPSNVV 93 >ref|XP_006489529.1| PREDICTED: dentin sialophosphoprotein-like isoform X6 [Citrus sinensis] Length = 1040 Score = 158 bits (400), Expect = 1e-35 Identities = 131/368 (35%), Positives = 173/368 (47%), Gaps = 83/368 (22%) Frame = -1 Query: 983 GKEI---LVEADLDSN----EESADLKSVNELCTVLPSRRFSSEAMASDFARRLDALKHI 825 GKEI EA+ S E + KS +L ++PS RF + AMAS FARRLDAL+HI Sbjct: 676 GKEIQNCAAEAETSSGFNRIEGHDEQKSKCKLNPIIPSPRFQNSAMASVFARRLDALRHI 735 Query: 824 TSSKKGIYS-----TCLFCG-GCHDVRGCSGVTRSELEYLLLKSSAFDSRVEESPCFCIR 663 T S + TC +CG H +R CS ++ EL+ L ++++ EE C CIR Sbjct: 736 TPSAVTDNAACTAITCFYCGRKGHHLRDCSEISDGELKDLTRNINSYNG-AEELHCLCIR 794 Query: 662 CSKPDHWAVSCPVGPTSRFGARRLMLCGGEDQSLRLRDTSD---VNNN-----SAKSEIF 507 C K DHW VSCP + CG + L R+ S NN + I+ Sbjct: 795 CFKLDHWDVSCPKATSRSQSLLEGCNCGPNEFQLNKRNESKNLLYGNNCLYQATGSHTIY 854 Query: 506 DA-----------IRSL--------RLTRADILRWMNS------NVPSSHLN-------G 423 D IR L + A +++ N+ NV + H++ Sbjct: 855 DRDDPQREADPKFIRKLPEVVTSDQLIPNAYLIKDCNASGSGEKNVVNRHISEVPKGIFD 914 Query: 422 FFLRLRL---------------------------GSVEAGLGGSSYYVACITG---DVKG 333 F R+RL G + GLGG+ YYVACITG ++ Sbjct: 915 FIKRIRLSRTDILKCMNSHMSCAHLKGFFLRLRLGKWDEGLGGTGYYVACITGAQREISS 974 Query: 332 NAGCTSKNSILVDVGGILSSVACQYISNHDFLEDEIKAWWSRTEGSGCRLPSLAELNSKF 153 AG SKNSI V+VGGI V QYISNHDFLEDE+ AWWS T SG ++PS +L K Sbjct: 975 PAG--SKNSISVNVGGINCLVESQYISNHDFLEDELMAWWSATVKSGSKIPSEEDLIPKI 1032 Query: 152 NDRQSLGL 129 +R+ LGL Sbjct: 1033 KERKMLGL 1040 Score = 84.0 bits (206), Expect = 3e-13 Identities = 61/163 (37%), Positives = 87/163 (53%), Gaps = 3/163 (1%) Frame = -1 Query: 1793 EGSCESSKICLYQEKEKGKALSGEDIYGRSSDVEDDNHESASSSNSRLSTQGKKGVKRQI 1614 E S +S+I YQ K K KALS D+ R S +DD+HES S NS K KR Sbjct: 309 EHSPTTSRIRRYQRKGKEKALSDGDVNERMSKDDDDSHESVESCNSTGLFSTCK--KRWS 366 Query: 1613 FDEDMVTGSKRTKTQIHGSNTKPD--SSFMKWISNMTRGGLSGLNLEDSLSPLPLACSNG 1440 F++ ++ GSK +T + S K D SSFM WISNM + G NL++S S Sbjct: 367 FEQQLIVGSKIQETPVSTSCVKQDSSSSFMNWISNMMK-GFPKSNLDESPSVDRTLAHTN 425 Query: 1439 VLSRKYDENFMCFRP-QNSKNLSTGFQTVFQSLYCRETDASKK 1314 + D F+ ++ Q+S+ + GFQ++FQSLY +T ++ Sbjct: 426 YGHKCSDPKFITYKKNQDSECRNVGFQSIFQSLYRPKTKGQER 468 >ref|XP_006489528.1| PREDICTED: dentin sialophosphoprotein-like isoform X5 [Citrus sinensis] Length = 1064 Score = 158 bits (400), Expect = 1e-35 Identities = 131/368 (35%), Positives = 173/368 (47%), Gaps = 83/368 (22%) Frame = -1 Query: 983 GKEI---LVEADLDSN----EESADLKSVNELCTVLPSRRFSSEAMASDFARRLDALKHI 825 GKEI EA+ S E + KS +L ++PS RF + AMAS FARRLDAL+HI Sbjct: 700 GKEIQNCAAEAETSSGFNRIEGHDEQKSKCKLNPIIPSPRFQNSAMASVFARRLDALRHI 759 Query: 824 TSSKKGIYS-----TCLFCG-GCHDVRGCSGVTRSELEYLLLKSSAFDSRVEESPCFCIR 663 T S + TC +CG H +R CS ++ EL+ L ++++ EE C CIR Sbjct: 760 TPSAVTDNAACTAITCFYCGRKGHHLRDCSEISDGELKDLTRNINSYNG-AEELHCLCIR 818 Query: 662 CSKPDHWAVSCPVGPTSRFGARRLMLCGGEDQSLRLRDTSD---VNNN-----SAKSEIF 507 C K DHW VSCP + CG + L R+ S NN + I+ Sbjct: 819 CFKLDHWDVSCPKATSRSQSLLEGCNCGPNEFQLNKRNESKNLLYGNNCLYQATGSHTIY 878 Query: 506 DA-----------IRSL--------RLTRADILRWMNS------NVPSSHLN-------G 423 D IR L + A +++ N+ NV + H++ Sbjct: 879 DRDDPQREADPKFIRKLPEVVTSDQLIPNAYLIKDCNASGSGEKNVVNRHISEVPKGIFD 938 Query: 422 FFLRLRL---------------------------GSVEAGLGGSSYYVACITG---DVKG 333 F R+RL G + GLGG+ YYVACITG ++ Sbjct: 939 FIKRIRLSRTDILKCMNSHMSCAHLKGFFLRLRLGKWDEGLGGTGYYVACITGAQREISS 998 Query: 332 NAGCTSKNSILVDVGGILSSVACQYISNHDFLEDEIKAWWSRTEGSGCRLPSLAELNSKF 153 AG SKNSI V+VGGI V QYISNHDFLEDE+ AWWS T SG ++PS +L K Sbjct: 999 PAG--SKNSISVNVGGINCLVESQYISNHDFLEDELMAWWSATVKSGSKIPSEEDLIPKI 1056 Query: 152 NDRQSLGL 129 +R+ LGL Sbjct: 1057 KERKMLGL 1064 Score = 84.0 bits (206), Expect = 3e-13 Identities = 61/163 (37%), Positives = 87/163 (53%), Gaps = 3/163 (1%) Frame = -1 Query: 1793 EGSCESSKICLYQEKEKGKALSGEDIYGRSSDVEDDNHESASSSNSRLSTQGKKGVKRQI 1614 E S +S+I YQ K K KALS D+ R S +DD+HES S NS K KR Sbjct: 333 EHSPTTSRIRRYQRKGKEKALSDGDVNERMSKDDDDSHESVESCNSTGLFSTCK--KRWS 390 Query: 1613 FDEDMVTGSKRTKTQIHGSNTKPD--SSFMKWISNMTRGGLSGLNLEDSLSPLPLACSNG 1440 F++ ++ GSK +T + S K D SSFM WISNM + G NL++S S Sbjct: 391 FEQQLIVGSKIQETPVSTSCVKQDSSSSFMNWISNMMK-GFPKSNLDESPSVDRTLAHTN 449 Query: 1439 VLSRKYDENFMCFRP-QNSKNLSTGFQTVFQSLYCRETDASKK 1314 + D F+ ++ Q+S+ + GFQ++FQSLY +T ++ Sbjct: 450 YGHKCSDPKFITYKKNQDSECRNVGFQSIFQSLYRPKTKGQER 492