BLASTX nr result
ID: Akebia23_contig00011766
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia23_contig00011766 (4289 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002515032.1| hypothetical protein RCOM_1082870 [Ricinus c... 129 1e-26 emb|CBI16839.3| unnamed protein product [Vitis vinifera] 124 4e-25 gb|EXB82454.1| hypothetical protein L484_027628 [Morus notabilis] 102 2e-18 ref|XP_007044929.1| Uncharacterized protein isoform 1 [Theobroma... 101 3e-18 ref|XP_006438166.1| hypothetical protein CICLE_v10030561mg [Citr... 99 1e-17 ref|XP_006484006.1| PREDICTED: uncharacterized protein LOC102606... 98 3e-17 gb|EYU30581.1| hypothetical protein MIMGU_mgv1a000837mg [Mimulus... 95 3e-16 ref|XP_004170528.1| PREDICTED: uncharacterized protein LOC101231... 91 4e-15 ref|XP_004143053.1| PREDICTED: uncharacterized protein LOC101207... 91 4e-15 ref|XP_006601919.1| PREDICTED: dentin sialophosphoprotein-like i... 89 2e-14 ref|XP_007044930.1| Uncharacterized protein isoform 2 [Theobroma... 89 2e-14 ref|XP_003552797.1| PREDICTED: dentin sialophosphoprotein-like i... 89 2e-14 ref|XP_002314574.2| COP1-interacting protein 4.1 [Populus tricho... 88 4e-14 ref|XP_003601863.1| hypothetical protein MTR_3g086220 [Medicago ... 88 4e-14 ref|XP_004502350.1| PREDICTED: dentin sialophosphoprotein-like [... 87 9e-14 ref|XP_003537551.1| PREDICTED: dentin sialophosphoprotein-like [... 87 9e-14 ref|XP_006283028.1| hypothetical protein CARUB_v10004020mg [Caps... 84 4e-13 ref|XP_006857783.1| hypothetical protein AMTR_s00061p00209430 [A... 81 5e-12 dbj|BAB32952.1| COP1-interacting protein 4.1 [Arabidopsis thaliana] 80 6e-12 dbj|BAB32951.1| COP1-interacting protein 4 [Arabidopsis thaliana] 80 6e-12 >ref|XP_002515032.1| hypothetical protein RCOM_1082870 [Ricinus communis] gi|223546083|gb|EEF47586.1| hypothetical protein RCOM_1082870 [Ricinus communis] Length = 1078 Score = 129 bits (324), Expect = 1e-26 Identities = 128/494 (25%), Positives = 199/494 (40%), Gaps = 17/494 (3%) Frame = -2 Query: 1726 VPEHADDSAGDIVPLRSDHDKSLDSDRPGGDTNREEDNILSQTEETKESKIKSLNAPLLG 1547 V EH + P + K+ + D G + N+EE N+ +E KE +A LL Sbjct: 606 VIEHVEGFVSGTSPTKPH--KATNGDHSGDNVNKEESNV--PPKEGKEVSEMETSASLLA 661 Query: 1546 FDKKDDQ------QRITQVTRVEETTNYSEDKDRDLTPPTDELKAEENPPXXXXXXXXXX 1385 +K+ D + + Q+ +VE + + K R T ++ P Sbjct: 662 TEKEIDDVIRNAMESVQQIGQVEVSAENMDGKSRKKTKKKGTSDVKDLPELKNENEKLSA 721 Query: 1384 XXXXXXXSINHLTDSTMEAGKEYQISGDVEKPQLPSDTVECNVQGSPLVNSRTDEANNDI 1205 + ++ +++ + Q K +E V G+P + E ++ Sbjct: 722 PAGNKIREAEYSSNGPLKS-QSSQGQPHKTKSNREGRCLEAAVNGNPSKSGHAIEGTCNL 780 Query: 1204 EIPNVDREVNFIDYFLPKQDDQEIV----------TPTXXXXXXXXXXKVQGGVSDKSPL 1055 ++ +NF +YF+P+Q +IV T T + + + S Sbjct: 781 DVSCESSGINFKNYFVPRQQSNKIVGSDEALVDKATKTMEAYGEMKGNENKKKLGAHSHG 840 Query: 1054 QVPLSKNEQSRQASQPNGKASIIAGDS-VKAPTSNDHDKIDAFPXXXXXXXXXXXSGTIS 878 P +N S G + DS VKAP + DK+D+ T S Sbjct: 841 PSPDLQNSYSLTEDHGVGAKPLKVSDSEVKAPLPSKSDKLDS-----------ASENTRS 889 Query: 877 PVHAPPVRSLFNANANTPXXXXXXXXENGVYENKRHGERQLQSNHRRVTVXXXXXXXXXK 698 P S N ++ + N+R QL + R+ + Sbjct: 890 NALKPSATSTHAKNKKAGSVSSLESSKDTNFLNRRVNGPQLHEDDNRMNSRRTSTINSRE 949 Query: 697 VLNNSNNEKSLLATANTIFKDSSTESSKDEGGVNNSDAXXXXXXXXXXXXXXSEGVHEAI 518 V+N S +++SL+ +++IFKD + E+S E +NSDA S+G A Sbjct: 950 VVNGSQHKRSLIGVSDSIFKDVTDEASSTED--DNSDASTRTPSDKSLSSDYSDGESNAD 1007 Query: 517 MESPENGPYVRKRVENGGNNTPKPQSVGPKNMTLDMIFRSSKSYKKAKLTASQSQLDDTE 338 SP NG KR + G KP S G +TLD I RSS YKKAKLTA+Q QL+DTE Sbjct: 1008 FNSPLNGSNSCKRKDGGQKTIRKPLSSG---LTLDAILRSSSRYKKAKLTAAQLQLEDTE 1064 Query: 337 SQPIDFVLDSQA*P 296 SQP++FV DSQA P Sbjct: 1065 SQPVEFVPDSQAKP 1078 >emb|CBI16839.3| unnamed protein product [Vitis vinifera] Length = 1309 Score = 124 bits (311), Expect = 4e-25 Identities = 90/242 (37%), Positives = 128/242 (52%) Frame = -2 Query: 1057 LQVPLSKNEQSRQASQPNGKASIIAGDSVKAPTSNDHDKIDAFPXXXXXXXXXXXSGTIS 878 LQ PLS + ++ + K S ++ + +K+P +D K D P SGT S Sbjct: 996 LQDPLSVDGHNKLMPESVSKFSKVSRNDLKSP--HDIGKFDTIPEEIRWPNVVNASGTSS 1053 Query: 877 PVHAPPVRSLFNANANTPXXXXXXXXENGVYENKRHGERQLQSNHRRVTVXXXXXXXXXK 698 HA ++ A+ +T + Y+NKR G+RQ + RVTV + Sbjct: 1054 TAHAF-LKENGKASLSTSSSDSSE---DRTYQNKR-GKRQSNLDRYRVTVRKAPRKNPGE 1108 Query: 697 VLNNSNNEKSLLATANTIFKDSSTESSKDEGGVNNSDAXXXXXXXXXXXXXXSEGVHEAI 518 V+N+S+ KSLLAT +IF D +ESS+D GV NSDA +EG + Sbjct: 1109 VVNSSHQRKSLLATYGSIFNDGGSESSEDHDGVENSDASTRTPSDSSASSDYTEGENNQH 1168 Query: 517 MESPENGPYVRKRVENGGNNTPKPQSVGPKNMTLDMIFRSSKSYKKAKLTASQSQLDDTE 338 ++S +G Y KR E+G + K S G KN+T+D+I RSS +KKAKLTASQS+L+DTE Sbjct: 1169 LDS-SHGLYSTKRNESGAKSIGKSNSSGSKNVTMDVILRSSSRFKKAKLTASQSELNDTE 1227 Query: 337 SQ 332 SQ Sbjct: 1228 SQ 1229 Score = 107 bits (267), Expect = 5e-20 Identities = 52/82 (63%), Positives = 64/82 (78%) Frame = -2 Query: 4288 DTVRDLKQKIMIEHPRCFPSVGEIIIHALKVRRKAFFYHLSDSMHVKSAFDGVNRSWFIY 4109 DTV DLK+KI++EHP CF S+G+I IHALKV+RK FFYHLSDSM V+SAFDGV R+WF++ Sbjct: 36 DTVADLKKKILLEHPLCFSSIGKIKIHALKVKRKGFFYHLSDSMFVRSAFDGVKRTWFLH 95 Query: 4108 ADATSSQLGHIGNQLHLEPGFG 4043 DA+SS + NQL P G Sbjct: 96 VDASSS-VEQSENQLACNPDSG 116 >gb|EXB82454.1| hypothetical protein L484_027628 [Morus notabilis] Length = 1284 Score = 102 bits (254), Expect = 2e-18 Identities = 99/347 (28%), Positives = 146/347 (42%), Gaps = 1/347 (0%) Frame = -2 Query: 1345 DSTMEAGKEYQISGDVEKPQLPSDTVECNVQGSPLVNSRTDEANNDIEIPNVDREVNFID 1166 D + A K+ +ISG + LP + LV+ +T AN D + P + D Sbjct: 951 DPSDGANKDIEISGAGSEKPLP------DTSSGGLVDKKTG-ANKDAKTPKSKTNIENPD 1003 Query: 1165 YFLPKQDDQEIVTPTXXXXXXXXXXKVQGGVSDKSPLQVPLSKNEQSRQASQPNGKASII 986 + K + K G S +PLQ LSK+ A QP K Sbjct: 1004 TYSDKISSA-FQSSQKANRKQGIEKKAPAGKSSTTPLQ-SLSKDNPDESAVQPTEKLQKA 1061 Query: 985 AGDSVKAPTSNDHDKIDAFPXXXXXXXXXXXSGTISPVHAPPVRSLFNANANTPXXXXXX 806 + KA ++ K+++ SGT ++S N + Sbjct: 1062 SKTEAKASPTDVSGKLNSTRKETKMQHAVGVSGT-------NIQSEKNTGLASVSNSPME 1114 Query: 805 XXENGVYENKRHGERQLQSNHRRVTVXXXXXXXXXKVLNNSNNEKSLLATANTIFKDS-S 629 N + ++ + Q + R K++N+ K L+AT TIF+D S Sbjct: 1115 SSRNIISKDVGSNKHQPGMHSYRAANIKAAVKGDGKIVNSLEPTKKLIATPGTIFRDDDS 1174 Query: 628 TESSKDEGGVNNSDAXXXXXXXXXXXXXXSEGVHEAIMESPENGPYVRKRVENGGNNTPK 449 ESS+DEGG ++SD S+G + SPE G Y R+++GG +T K Sbjct: 1175 GESSEDEGGTDDSDTSTRTPSDYSQSSDYSDGESNSNFNSPERGSYASNRMKSGGRSTIK 1234 Query: 448 PQSVGPKNMTLDMIFRSSKSYKKAKLTASQSQLDDTESQPIDFVLDS 308 S +NMT D I +SS +K+AK TASQ QL+D ESQP +FV DS Sbjct: 1235 SCSSSARNMTFDSILKSSSRFKRAKETASQLQLED-ESQPDEFVPDS 1280 Score = 90.1 bits (222), Expect = 8e-15 Identities = 62/203 (30%), Positives = 96/203 (47%), Gaps = 17/203 (8%) Frame = -2 Query: 4285 TVRDLKQKIMIEHPRCFPSVGEIIIHALKVRRKAFFYHLSDSMHVKSAFDGVNRSWFIYA 4106 +V D K+KI EH CF ++G I++HALKV+RK YHLSDSM VK AFDG +++WF+ Sbjct: 39 SVSDFKRKIEKEHTLCFTNIGNIVVHALKVKRKGHLYHLSDSMFVKDAFDGASKNWFLSV 98 Query: 4105 DAT-------SSQLGHIGNQLHLEPGFG--AKKTSDSEVLKNHDLVSEGNEVPGTHTGYK 3953 DA+ +L H + +L +G +++ L +G P T + Sbjct: 99 DASIVEEKRDEKRLVHNPDSHNLLTCYGLLCNASANGVDLSLDGSPDQGTSDPNNATSPQ 158 Query: 3952 KGKSKKRPCD--------DKFGETLKKHKNEKKIEEAFSCPVKDPFNEGDSRTFVSSKER 3797 K +K DK GE + + K + F V++ +GD+ V + Sbjct: 159 KHVERKSDVSNHGILGKCDKSGEEASQSDHAAKRKRKFDDEVRNEHLQGDNIDTVKDISK 218 Query: 3796 TQPEKKIFPENTLEDNEKIGNVA 3728 ++I ++ L D EK N A Sbjct: 219 ----REIISQHALGDEEKSKNAA 237 >ref|XP_007044929.1| Uncharacterized protein isoform 1 [Theobroma cacao] gi|508708864|gb|EOY00761.1| Uncharacterized protein isoform 1 [Theobroma cacao] Length = 1112 Score = 101 bits (251), Expect = 3e-18 Identities = 45/67 (67%), Positives = 58/67 (86%) Frame = -2 Query: 4288 DTVRDLKQKIMIEHPRCFPSVGEIIIHALKVRRKAFFYHLSDSMHVKSAFDGVNRSWFIY 4109 DTV DLK+KI+ EHP CFP++GEI I+ALKV+RK + YHLSDSM VKSAFDGV++SWF+ Sbjct: 37 DTVSDLKKKILYEHPLCFPNIGEIKINALKVKRKGYLYHLSDSMFVKSAFDGVSKSWFLS 96 Query: 4108 ADATSSQ 4088 DA+S++ Sbjct: 97 VDASSAE 103 Score = 89.0 bits (219), Expect = 2e-14 Identities = 57/132 (43%), Positives = 72/132 (54%) Frame = -2 Query: 697 VLNNSNNEKSLLATANTIFKDSSTESSKDEGGVNNSDAXXXXXXXXXXXXXXSEGVHEAI 518 V+N+ N+KSLLATA IFK ESS D+ ++ D+ + Sbjct: 986 VVNSLENKKSLLATAGPIFKHDDKESSDDDVVDDSDDSTRSPLDNSSSDDDSNMN----- 1040 Query: 517 MESPENGPYVRKRVENGGNNTPKPQSVGPKNMTLDMIFRSSKSYKKAKLTASQSQLDDTE 338 S +NG + E GG P S PK+M+L I R+S SYKKAKLTASQSQLDD + Sbjct: 1041 SSSSQNGSH-NSEGEGGGRERKNPGSTSPKSMSLHAILRNSSSYKKAKLTASQSQLDDLD 1099 Query: 337 SQPIDFVLDSQA 302 S P +FV DSQA Sbjct: 1100 SLPDEFVPDSQA 1111 >ref|XP_006438166.1| hypothetical protein CICLE_v10030561mg [Citrus clementina] gi|557540362|gb|ESR51406.1| hypothetical protein CICLE_v10030561mg [Citrus clementina] Length = 1128 Score = 99.4 bits (246), Expect = 1e-17 Identities = 82/260 (31%), Positives = 119/260 (45%), Gaps = 77/260 (29%) Frame = -2 Query: 4288 DTVRDLKQKIMIEHPRCFPSVGEIIIHALKVRRKAFFYHLSDSMHVKSAFDGVNRSWFIY 4109 DTV DLK+KIM EHP CFP +G I IHALKV+R+ +FYHLSDSM V+SAF GV++SWF+ Sbjct: 37 DTVSDLKKKIMDEHPLCFPEIGGINIHALKVKRRGYFYHLSDSMFVRSAFHGVSKSWFLS 96 Query: 4108 ADATS----SQLGHIG------------NQLHLEPGFGAKK------------------- 4034 +A++ S+ H+G + L+L P A K Sbjct: 97 VEASNVGEQSESRHLGVARFGIMNKPSADGLNLLPYGPATKLSNSDYSSLPQVQRHQIAG 156 Query: 4033 ----------------------TSDSEVLKNHDLVSEGNEVPGTHTGYKKGKSKKRPCDD 3920 SD+E+ +NHDL + E P HT YK+ S+ D Sbjct: 157 MNPAADHSAHNNCNILSLETNHRSDTELQENHDLNIKEYEDPVRHTEYKEDSSRNVTGDA 216 Query: 3919 KFGETLK---KHKN-EKKIEEAFSCPV-----KDPFNEGD-----------SRTFVSSKE 3800 + +L+ KH + KK + P K GD + VS K+ Sbjct: 217 QVNVSLEGSPKHGSVSKKRRVSLEGPAAKKRSKRKKRRGDEVHNHALKQDIASASVSDKD 276 Query: 3799 RTQPEKKIFPENTLEDNEKI 3740 +Q + + P+N+L + E++ Sbjct: 277 ASQ-QDNVVPDNSLLNQERV 295 Score = 84.3 bits (207), Expect = 4e-13 Identities = 56/134 (41%), Positives = 75/134 (55%) Frame = -2 Query: 697 VLNNSNNEKSLLATANTIFKDSSTESSKDEGGVNNSDAXXXXXXXXXXXXXXSEGVHEAI 518 V+N S +KSLLA + TIF+D S SS DE GV+NSD S+GV A Sbjct: 1003 VVNCSKPKKSLLAKSGTIFEDDSNGSSDDEVGVDNSDGSTKAPSDNSLSSNYSDGVSTA- 1061 Query: 517 MESPENGPYVRKRVENGGNNTPKPQSVGPKNMTLDMIFRSSKSYKKAKLTASQSQLDDTE 338 +NG + +++ N KP S + LD I R S +YK +KLTASQ QL+ + Sbjct: 1062 ---KQNGSVSSQIMDSARRNIIKPNS----GVKLDKIMRRSDNYKASKLTASQLQLEAPK 1114 Query: 337 SQPIDFVLDSQA*P 296 SQP++FV DS+A P Sbjct: 1115 SQPVEFVPDSEANP 1128 >ref|XP_006484006.1| PREDICTED: uncharacterized protein LOC102606666 [Citrus sinensis] Length = 1128 Score = 98.2 bits (243), Expect = 3e-17 Identities = 81/260 (31%), Positives = 119/260 (45%), Gaps = 77/260 (29%) Frame = -2 Query: 4288 DTVRDLKQKIMIEHPRCFPSVGEIIIHALKVRRKAFFYHLSDSMHVKSAFDGVNRSWFIY 4109 DTV DLK+KIM EHP CFP +G I IHALKV+R+ +FYHLSDSM V++AF GV++SWF+ Sbjct: 37 DTVSDLKKKIMDEHPLCFPEIGGINIHALKVKRRGYFYHLSDSMFVRTAFHGVSKSWFLS 96 Query: 4108 ADATS----SQLGHIG------------NQLHLEPGFGAKK------------------- 4034 +A++ S+ H+G + L+L P A K Sbjct: 97 VEASNVGEQSESRHLGVARFGIMNKPSADGLNLLPYGPATKLSNSDYSSLPQVQRHQIAG 156 Query: 4033 ----------------------TSDSEVLKNHDLVSEGNEVPGTHTGYKKGKSKKRPCDD 3920 SD+E+ +NHDL + E P HT YK+ S+ D Sbjct: 157 MNPAADHSAHNNCNILSLETNHRSDTELQENHDLNIKEYEDPVRHTEYKEDSSRNVTGDA 216 Query: 3919 KFGETLK---KHKN-EKKIEEAFSCPV-----KDPFNEGD-----------SRTFVSSKE 3800 + +L+ KH + KK + P K GD + VS K+ Sbjct: 217 QVNVSLEGSPKHGSVSKKRRVSLEGPAAKKRSKRKKRRGDEVHNHALKQDVASASVSDKD 276 Query: 3799 RTQPEKKIFPENTLEDNEKI 3740 +Q + + P+N+L + E++ Sbjct: 277 ASQ-QDNVVPDNSLLNQERV 295 Score = 84.3 bits (207), Expect = 4e-13 Identities = 56/134 (41%), Positives = 75/134 (55%) Frame = -2 Query: 697 VLNNSNNEKSLLATANTIFKDSSTESSKDEGGVNNSDAXXXXXXXXXXXXXXSEGVHEAI 518 V+N S +KSLLA + TIF+D S SS DE GV+NSD S+GV A Sbjct: 1003 VVNCSKPKKSLLAKSGTIFEDDSNGSSDDEVGVDNSDGSTKSPSDNSLSSNYSDGVSTA- 1061 Query: 517 MESPENGPYVRKRVENGGNNTPKPQSVGPKNMTLDMIFRSSKSYKKAKLTASQSQLDDTE 338 +NG + +++ N KP S + LD I R S +YK +KLTASQ QL+ + Sbjct: 1062 ---KQNGSVSSQIMDSARRNIIKPNS----GVKLDKIMRRSDNYKASKLTASQLQLEAPK 1114 Query: 337 SQPIDFVLDSQA*P 296 SQP++FV DS+A P Sbjct: 1115 SQPVEFVPDSEANP 1128 >gb|EYU30581.1| hypothetical protein MIMGU_mgv1a000837mg [Mimulus guttatus] Length = 967 Score = 94.7 bits (234), Expect = 3e-16 Identities = 60/173 (34%), Positives = 95/173 (54%), Gaps = 2/173 (1%) Frame = -2 Query: 4288 DTVRDLKQKIMIEHPRCFPSVGEIIIHALKVRRKAFFYHLSDSMHVKSAFDGVNRSWFIY 4109 DTV D K+K +EH RCFP +GEI IH+LKV+R+A FYHL ++M V+SA N SWF+ Sbjct: 41 DTVSDFKRKTALEHMRCFPEIGEIHIHSLKVKRRAVFYHLPETMLVRSALQAGNSSWFLS 100 Query: 4108 ADATSSQLGHIG-NQLHLEPGFGAKKTSDSEVLKN-HDLVSEGNEVPGTHTGYKKGKSKK 3935 ADA+++ + N L L+P +G K + + N DL+ N + G S+K Sbjct: 101 ADASATPARQLNQNSLQLDPSYGVKMDAKNIDDNNCRDLLPVVNVLQALPMPLPDGVSEK 160 Query: 3934 RPCDDKFGETLKKHKNEKKIEEAFSCPVKDPFNEGDSRTFVSSKERTQPEKKI 3776 + E + + +K +E+A P + ++E D + + R + ++KI Sbjct: 161 ----NLASEMIPACEVDKSLEKAIEIP-SNSYSEED--CIGTGESRAKKKRKI 206 >ref|XP_004170528.1| PREDICTED: uncharacterized protein LOC101231424 [Cucumis sativus] Length = 843 Score = 91.3 bits (225), Expect = 4e-15 Identities = 67/193 (34%), Positives = 99/193 (51%), Gaps = 11/193 (5%) Frame = -2 Query: 4288 DTVRDLKQKIMIEHPRCFPSVGEIIIHALKVRRKAFFYHLSDSMHVKSAFDGVNRSWFIY 4109 DTV D+K+KI EHP CFP +G I IHA+KV R+ +FYHLSDSM++KSAF G + SWF+ Sbjct: 38 DTVFDVKEKIEKEHPLCFPHLGAIKIHAIKVTRRGYFYHLSDSMYLKSAFVGYDDSWFLS 97 Query: 4108 ADATSSQLGHIGNQLHLEPGFGA-KKTSDSEVLKNHDLVSEGNEVPGTHTGYK---KGKS 3941 DA++ GH +P G+ + + S L N+D + V + + S Sbjct: 98 IDASTVD-GH-----STDPNTGSVARNNHSGHLPNYDAQKLKDIVAQQYVNEEAPDSCHS 151 Query: 3940 KKRPCDDKFGETLKKHKNEKKIEEAFSCPVKDPFNEG-DSRTFVSSKERTQPEKKI---- 3776 KR + E KN K + + + + FNE +S V R++ K I Sbjct: 152 SKRDLMIEKAEVTHSVKNRSKHQSSRTMNDCEGFNEKLESLPAVKQNHRSKKSKTILINE 211 Query: 3775 --FPENTLEDNEK 3743 F +T +DN++ Sbjct: 212 HKFANHTSDDNDQ 224 >ref|XP_004143053.1| PREDICTED: uncharacterized protein LOC101207835 [Cucumis sativus] Length = 1107 Score = 91.3 bits (225), Expect = 4e-15 Identities = 67/193 (34%), Positives = 99/193 (51%), Gaps = 11/193 (5%) Frame = -2 Query: 4288 DTVRDLKQKIMIEHPRCFPSVGEIIIHALKVRRKAFFYHLSDSMHVKSAFDGVNRSWFIY 4109 DTV D+K+KI EHP CFP +G I IHA+KV R+ +FYHLSDSM++KSAF G + SWF+ Sbjct: 38 DTVFDVKEKIEKEHPLCFPHLGAIKIHAIKVTRRGYFYHLSDSMYLKSAFVGYDDSWFLS 97 Query: 4108 ADATSSQLGHIGNQLHLEPGFGA-KKTSDSEVLKNHDLVSEGNEVPGTHTGYK---KGKS 3941 DA++ GH +P G+ + + S L N+D + V + + S Sbjct: 98 IDASTVD-GH-----STDPNTGSVARNNHSGHLPNYDAQKLKDIVAQQYVNEEAPDSCHS 151 Query: 3940 KKRPCDDKFGETLKKHKNEKKIEEAFSCPVKDPFNEG-DSRTFVSSKERTQPEKKI---- 3776 KR + E KN K + + + + FNE +S V R++ K I Sbjct: 152 SKRDLMIEKAEVTHSVKNRSKHQSSRTMNDCEGFNEKLESLPAVKQNHRSKKSKTILINE 211 Query: 3775 --FPENTLEDNEK 3743 F +T +DN++ Sbjct: 212 HKFANHTSDDNDQ 224 Score = 85.1 bits (209), Expect = 3e-13 Identities = 52/126 (41%), Positives = 72/126 (57%) Frame = -2 Query: 682 NNEKSLLATANTIFKDSSTESSKDEGGVNNSDAXXXXXXXXXXXXXXSEGVHEAIMESPE 503 + +++L T+ IFKD+S++SS+DE G+ +SDA +E++ Sbjct: 989 SQRRNVLLTSGGIFKDASSDSSEDEAGIVDSDASTKSPDNSQISDFSDGESNESVDLERT 1048 Query: 502 NGPYVRKRVENGGNNTPKPQSVGPKNMTLDMIFRSSKSYKKAKLTASQSQLDDTESQPID 323 N R++ N P P+N+TLD I RSS YKKAK+TASQ Q DDTESQP+D Sbjct: 1049 NIRRSRRK------NDPS----SPENLTLDTILRSSSRYKKAKMTASQLQQDDTESQPVD 1098 Query: 322 FVLDSQ 305 FV DSQ Sbjct: 1099 FVPDSQ 1104 >ref|XP_006601919.1| PREDICTED: dentin sialophosphoprotein-like isoform X2 [Glycine max] Length = 1013 Score = 89.0 bits (219), Expect = 2e-14 Identities = 37/64 (57%), Positives = 49/64 (76%) Frame = -2 Query: 4288 DTVRDLKQKIMIEHPRCFPSVGEIIIHALKVRRKAFFYHLSDSMHVKSAFDGVNRSWFIY 4109 DTV DLK+ I+ EHP CFP +G++ IH +KV RK +FYHL+DSM V+SAF G+N SWF+ Sbjct: 29 DTVSDLKKSILSEHPLCFPKIGQVQIHGIKVERKGYFYHLTDSMPVRSAFRGINGSWFLS 88 Query: 4108 ADAT 4097 D + Sbjct: 89 VDVS 92 Score = 63.9 bits (154), Expect = 6e-07 Identities = 52/132 (39%), Positives = 73/132 (55%) Frame = -2 Query: 697 VLNNSNNEKSLLATANTIFKDSSTESSKDEGGVNNSDAXXXXXXXXXXXXXXSEGVHEAI 518 V + +KSLL+ A IFKD S+ +S DE V+NSDA S+G ++ Sbjct: 895 VASKIQQKKSLLSGA--IFKDDSSGTSVDE--VDNSDASTRTPSYNPLLSDFSDGDSSSV 950 Query: 517 MESPENGPYVRKRVENGGNNTPKPQSVGPKNMTLDMIFRSSKSYKKAKLTASQSQLDDTE 338 NG + +ENG ++ K + G K M++D + RSS YKKA+ TA SQL++T+ Sbjct: 951 ----SNGG---RSLENGARSSIKARLSGTKGMSIDHVLRSSSRYKKARTTA--SQLEETQ 1001 Query: 337 SQPIDFVLDSQA 302 SQP FV DS A Sbjct: 1002 SQP-KFVPDSLA 1012 >ref|XP_007044930.1| Uncharacterized protein isoform 2 [Theobroma cacao] gi|508708865|gb|EOY00762.1| Uncharacterized protein isoform 2 [Theobroma cacao] Length = 1033 Score = 89.0 bits (219), Expect = 2e-14 Identities = 57/132 (43%), Positives = 72/132 (54%) Frame = -2 Query: 697 VLNNSNNEKSLLATANTIFKDSSTESSKDEGGVNNSDAXXXXXXXXXXXXXXSEGVHEAI 518 V+N+ N+KSLLATA IFK ESS D+ ++ D+ + Sbjct: 907 VVNSLENKKSLLATAGPIFKHDDKESSDDDVVDDSDDSTRSPLDNSSSDDDSNMN----- 961 Query: 517 MESPENGPYVRKRVENGGNNTPKPQSVGPKNMTLDMIFRSSKSYKKAKLTASQSQLDDTE 338 S +NG + E GG P S PK+M+L I R+S SYKKAKLTASQSQLDD + Sbjct: 962 SSSSQNGSH-NSEGEGGGRERKNPGSTSPKSMSLHAILRNSSSYKKAKLTASQSQLDDLD 1020 Query: 337 SQPIDFVLDSQA 302 S P +FV DSQA Sbjct: 1021 SLPDEFVPDSQA 1032 >ref|XP_003552797.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Glycine max] Length = 1133 Score = 89.0 bits (219), Expect = 2e-14 Identities = 37/64 (57%), Positives = 49/64 (76%) Frame = -2 Query: 4288 DTVRDLKQKIMIEHPRCFPSVGEIIIHALKVRRKAFFYHLSDSMHVKSAFDGVNRSWFIY 4109 DTV DLK+ I+ EHP CFP +G++ IH +KV RK +FYHL+DSM V+SAF G+N SWF+ Sbjct: 29 DTVSDLKKSILSEHPLCFPKIGQVQIHGIKVERKGYFYHLTDSMPVRSAFRGINGSWFLS 88 Query: 4108 ADAT 4097 D + Sbjct: 89 VDVS 92 Score = 63.9 bits (154), Expect = 6e-07 Identities = 52/132 (39%), Positives = 73/132 (55%) Frame = -2 Query: 697 VLNNSNNEKSLLATANTIFKDSSTESSKDEGGVNNSDAXXXXXXXXXXXXXXSEGVHEAI 518 V + +KSLL+ A IFKD S+ +S DE V+NSDA S+G ++ Sbjct: 1015 VASKIQQKKSLLSGA--IFKDDSSGTSVDE--VDNSDASTRTPSYNPLLSDFSDGDSSSV 1070 Query: 517 MESPENGPYVRKRVENGGNNTPKPQSVGPKNMTLDMIFRSSKSYKKAKLTASQSQLDDTE 338 NG + +ENG ++ K + G K M++D + RSS YKKA+ TA SQL++T+ Sbjct: 1071 ----SNGG---RSLENGARSSIKARLSGTKGMSIDHVLRSSSRYKKARTTA--SQLEETQ 1121 Query: 337 SQPIDFVLDSQA 302 SQP FV DS A Sbjct: 1122 SQP-KFVPDSLA 1132 >ref|XP_002314574.2| COP1-interacting protein 4.1 [Populus trichocarpa] gi|550329199|gb|EEF00745.2| COP1-interacting protein 4.1 [Populus trichocarpa] Length = 1153 Score = 87.8 bits (216), Expect = 4e-14 Identities = 39/65 (60%), Positives = 51/65 (78%) Frame = -2 Query: 4288 DTVRDLKQKIMIEHPRCFPSVGEIIIHALKVRRKAFFYHLSDSMHVKSAFDGVNRSWFIY 4109 DTV DLK+KI+ EH CFP+ G+I IHALKV+R+ YHLS+SM VKSAFDG ++WF+ Sbjct: 25 DTVSDLKKKILHEHKLCFPTNGDIKIHALKVKRRGILYHLSESMFVKSAFDGTGKNWFVS 84 Query: 4108 ADATS 4094 DA++ Sbjct: 85 VDAST 89 >ref|XP_003601863.1| hypothetical protein MTR_3g086220 [Medicago truncatula] gi|355490911|gb|AES72114.1| hypothetical protein MTR_3g086220 [Medicago truncatula] Length = 1188 Score = 87.8 bits (216), Expect = 4e-14 Identities = 52/137 (37%), Positives = 77/137 (56%) Frame = -2 Query: 4288 DTVRDLKQKIMIEHPRCFPSVGEIIIHALKVRRKAFFYHLSDSMHVKSAFDGVNRSWFIY 4109 DTV DLK+ I+ EH CFP +G+I IH +KV+R FYHLSDSM V+SAF GVN+SWF+ Sbjct: 30 DTVSDLKKLIVSEHASCFPKIGQIQIHGIKVKRNGHFYHLSDSMVVRSAFIGVNKSWFLS 89 Query: 4108 ADATSSQLGHIGNQLHLEPGFGAKKTSDSEVLKNHDLVSEGNEVPGTHTGYKKGKSKKRP 3929 D ++ + +L G+ + +S + N+ LV G + G P Sbjct: 90 VDVSALEDSRPNEKL---LPHGSLRQVESIGIVNNALVGSGGDNNGIIL----------P 136 Query: 3928 CDDKFGETLKKHKNEKK 3878 C+ +F L ++K +K+ Sbjct: 137 CNSQF--QLLENKKDKR 151 Score = 66.6 bits (161), Expect = 9e-08 Identities = 49/122 (40%), Positives = 67/122 (54%), Gaps = 2/122 (1%) Frame = -2 Query: 697 VLNNSNNEKSLLATANTIFKDSSTESSKDEGG--VNNSDAXXXXXXXXXXXXXXSEGVHE 524 V++ S +KSLL A TIFKD S+ SS DEG V+NSDA +G Sbjct: 1064 VVSKSQQKKSLLEGA-TIFKDDSSSSSDDEGQEKVDNSDASTRTPSDNSHANYL-DGYDS 1121 Query: 523 AIMESPENGPYVRKRVENGGNNTPKPQSVGPKNMTLDMIFRSSKSYKKAKLTASQSQLDD 344 ++S +NG Y +R+EN + K G M++D + R S YK+A++TA SQLDD Sbjct: 1122 PGVDSRQNGSYDGERLENDERSPFKAGLSGTTKMSIDDVVRRSTRYKQARMTA--SQLDD 1179 Query: 343 TE 338 TE Sbjct: 1180 TE 1181 >ref|XP_004502350.1| PREDICTED: dentin sialophosphoprotein-like [Cicer arietinum] Length = 1421 Score = 86.7 bits (213), Expect = 9e-14 Identities = 41/77 (53%), Positives = 54/77 (70%) Frame = -2 Query: 4288 DTVRDLKQKIMIEHPRCFPSVGEIIIHALKVRRKAFFYHLSDSMHVKSAFDGVNRSWFIY 4109 DTV DLK++I+ EH CFP VG+I IH +KV+R+ +FYHLSDSM V++AF G N++WF+ Sbjct: 30 DTVSDLKKRIVSEHTSCFPKVGQIQIHGIKVKRRGYFYHLSDSMVVRTAFIGFNKNWFLS 89 Query: 4108 ADATSSQLGHIGNQLHL 4058 D S LG HL Sbjct: 90 VDV--SALGECKQNDHL 104 Score = 78.6 bits (192), Expect = 2e-11 Identities = 54/133 (40%), Positives = 75/133 (56%), Gaps = 2/133 (1%) Frame = -2 Query: 694 LNNSNNEKSLLATANTIFKDSSTESSKDEGG--VNNSDAXXXXXXXXXXXXXXSEGVHEA 521 +NN+ +KSLL A IFKD S+ +S+DE V+NSDA +G Sbjct: 1292 VNNTQQKKSLLEGA--IFKDDSSSASEDEDEDQVDNSDASTRTPSINSLASDFLDGYDSP 1349 Query: 520 IMESPENGPYVRKRVENGGNNTPKPQSVGPKNMTLDMIFRSSKSYKKAKLTASQSQLDDT 341 ++S +NG + K +EN ++ K K M++D + RSS YKKAK+ A SQLD++ Sbjct: 1350 GLDSQQNGSHDGKSLENSKGSSLKASLSDTKGMSIDCVLRSSSRYKKAKIIA--SQLDES 1407 Query: 340 ESQPIDFVLDSQA 302 ESQP DFV DS A Sbjct: 1408 ESQP-DFVPDSFA 1419 >ref|XP_003537551.1| PREDICTED: dentin sialophosphoprotein-like [Glycine max] Length = 1131 Score = 86.7 bits (213), Expect = 9e-14 Identities = 37/64 (57%), Positives = 48/64 (75%) Frame = -2 Query: 4288 DTVRDLKQKIMIEHPRCFPSVGEIIIHALKVRRKAFFYHLSDSMHVKSAFDGVNRSWFIY 4109 DTV +LK+ I+ EHP CFP +G+I IH +KV RK +FYHL+DSM V+SAF GV SWF+ Sbjct: 29 DTVSNLKKSILSEHPLCFPKIGKIQIHGIKVERKGYFYHLTDSMPVRSAFSGVKESWFLT 88 Query: 4108 ADAT 4097 D + Sbjct: 89 VDVS 92 Score = 61.2 bits (147), Expect = 4e-06 Identities = 48/132 (36%), Positives = 71/132 (53%) Frame = -2 Query: 697 VLNNSNNEKSLLATANTIFKDSSTESSKDEGGVNNSDAXXXXXXXXXXXXXXSEGVHEAI 518 V + + EKSLL+ A IFKD S+ +S+DE V+NSDA S+G ++ Sbjct: 1014 VASKTQQEKSLLSGA--IFKDDSSSTSEDE--VDNSDASTRTPSYNPLMSDFSDGDSSSV 1069 Query: 517 MESPENGPYVRKRVENGGNNTPKPQSVGPKNMTLDMIFRSSKSYKKAKLTASQSQLDDTE 338 Y + ENG ++ K G K M++D + RSS +KKA+ + S L++T+ Sbjct: 1070 S-------YGGRSQENGARSSVKASFSGTKGMSIDDVLRSSSRFKKAR---TASLLEETQ 1119 Query: 337 SQPIDFVLDSQA 302 SQP +FV DS A Sbjct: 1120 SQP-EFVPDSLA 1130 >ref|XP_006283028.1| hypothetical protein CARUB_v10004020mg [Capsella rubella] gi|482551733|gb|EOA15926.1| hypothetical protein CARUB_v10004020mg [Capsella rubella] Length = 1149 Score = 84.3 bits (207), Expect = 4e-13 Identities = 48/138 (34%), Positives = 74/138 (53%), Gaps = 7/138 (5%) Frame = -2 Query: 4288 DTVRDLKQKIMIEHPRCFPSVGEIIIHALKVRRKAFFYHLSDSMHVKSAFDGVNRSWFIY 4109 + + D K K+ EH R FP +GEI + ALKV+R+ FYH ++SM+V AFDGV R+WFIY Sbjct: 32 EIISDFKDKLRYEHKRAFPEIGEINVSALKVKRRRKFYHFAESMNVYKAFDGVGRNWFIY 91 Query: 4108 ADATSSQLGHI-------GNQLHLEPGFGAKKTSDSEVLKNHDLVSEGNEVPGTHTGYKK 3950 DA + + ++ +LE K+ + + + DL+ E G T + Sbjct: 92 VDAVRVEKSEVLAIMDADEHRSNLEMVEKKKEIAIVDGMHTKDLILE----EGLETEVVE 147 Query: 3949 GKSKKRPCDDKFGETLKK 3896 K++KR G+T +K Sbjct: 148 SKTRKRKIRSSDGKTSRK 165 >ref|XP_006857783.1| hypothetical protein AMTR_s00061p00209430 [Amborella trichopoda] gi|548861879|gb|ERN19250.1| hypothetical protein AMTR_s00061p00209430 [Amborella trichopoda] Length = 403 Score = 80.9 bits (198), Expect = 5e-12 Identities = 54/158 (34%), Positives = 82/158 (51%), Gaps = 21/158 (13%) Frame = -2 Query: 4288 DTVRDLKQKIMIEHPRCFPSVGEIIIHALKVRRKAFFYHLSDSMHVKSAFDGVNRSWFIY 4109 D+V +LK+ + EHP FP++GEI++ AL V+RK++FYHL DS+ +KSA +G+ SWF++ Sbjct: 25 DSVGNLKRILREEHPLSFPNLGEIMVQALMVKRKSYFYHLPDSLPIKSALEGLRGSWFLF 84 Query: 4108 ADATSSQLGHI--GNQLH--LEPGFGAKKTSDSEVLKNHDLVSEGNEVPGTHTGYKKGKS 3941 DA +L + GN + + S+ V + VSE +E H+ + K+ Sbjct: 85 MDAILMELPEVSKGNVISDTVRGTLHDMGKSNVTVQFHESNVSEKSE---KHSALRNQKA 141 Query: 3940 KKRPCDDKFGE-----------------TLKKHKNEKK 3878 KRP GE T K+ KNE K Sbjct: 142 GKRPRHQHNGEHVQIEGNNVLLCKRLDNTRKRRKNENK 179 >dbj|BAB32952.1| COP1-interacting protein 4.1 [Arabidopsis thaliana] Length = 371 Score = 80.5 bits (197), Expect = 6e-12 Identities = 47/153 (30%), Positives = 76/153 (49%), Gaps = 12/153 (7%) Frame = -2 Query: 4288 DTVRDLKQKIMIEHPRCFPSVGEIIIHALKVRRKAFFYHLSDSMHVKSAFDGVNRSWFIY 4109 + + D K K++ EH + FP +GEI I A+KV+R+ FYH S+S++V AFDG++ WF+Y Sbjct: 33 EIISDFKDKVLKEHKQVFPEIGEINISAMKVKRRREFYHFSESLNVCKAFDGISTDWFMY 92 Query: 4108 ADATSSQLGHIGNQLHLEPGFGAKKTSDSEVLKNHDLVSEGNEVP------------GTH 3965 DA G K + +V +N +LV + E+P G Sbjct: 93 IDAVRVDKG--------------KTLAIMDVDQNLELVEKKEEIPNGKNTKDLTIGEGLE 138 Query: 3964 TGYKKGKSKKRPCDDKFGETLKKHKNEKKIEEA 3866 T + K++KR G+T +K ++ + A Sbjct: 139 TQLVEKKTRKRRIVSSGGKTSRKKSKDQSVVAA 171 >dbj|BAB32951.1| COP1-interacting protein 4 [Arabidopsis thaliana] Length = 915 Score = 80.5 bits (197), Expect = 6e-12 Identities = 61/217 (28%), Positives = 93/217 (42%), Gaps = 34/217 (15%) Frame = -2 Query: 4288 DTVRDLKQKIMIEHPRCFPSVGEIIIHALKVRRKAFFYHLSDSMHVKSAFDGVNRSWFIY 4109 + + D K +++ EH + FP +GEI I ALKV+R+ FYH SDS+HV AFDG++R+WF+Y Sbjct: 33 EIISDFKDRLLKEHKQVFPEIGEIQISALKVKRRRKFYHFSDSLHVCKAFDGISRNWFMY 92 Query: 4108 ADATSSQLG-------------------HIGNQLHLEPGFGAKKTSDSEVLKNHDLVSEG 3986 DA G I N L L K + E L+ ++V E Sbjct: 93 IDAIRVDKGKMYAIMAADQNLELVEKKEEIANGLVLVDDMNNKDLTSGEGLET-EVVEEK 151 Query: 3985 NE-----VPGTHTGYKKGKSKKRP---------CDDKFGETLKKHKNEKKIEEAFSCPVK 3848 PG +T KK K P C GE + + E+ + Sbjct: 152 TRKRRIISPGGNTSPKKSKVDLSPSAVAATTELCGKVKGEVVSQSCAVSPREKLDDVVTR 211 Query: 3847 DPFNEGD-SRTFVSSKERTQPEKKIFPENTLEDNEKI 3740 G+ S + K++T +++ E L N ++ Sbjct: 212 ADIESGEKSGLSMGEKQQTSVTERLLEEKNLTVNSEL 248