BLASTX nr result
ID: Catharanthus23_contig00013628
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00013628 (1896 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002269690.2| PREDICTED: uncharacterized protein LOC100253... 548 e-153 emb|CBI25860.3| unnamed protein product [Vitis vinifera] 546 e-152 ref|XP_006373553.1| hypothetical protein POPTR_0016s00330g [Popu... 538 e-150 gb|EOY29057.1| Zinc finger family protein isoform 2 [Theobroma c... 518 e-144 gb|EOY29056.1| Zinc finger family protein isoform 1 [Theobroma c... 516 e-143 ref|XP_002308735.1| hypothetical protein POPTR_0006s00300g [Popu... 512 e-142 ref|XP_002525234.1| conserved hypothetical protein [Ricinus comm... 508 e-141 ref|XP_006449970.1| hypothetical protein CICLE_v10014880mg [Citr... 508 e-141 ref|XP_006467242.1| PREDICTED: uncharacterized protein LOC102628... 505 e-140 ref|XP_006350231.1| PREDICTED: uncharacterized protein LOC102590... 504 e-140 ref|XP_004236639.1| PREDICTED: uncharacterized protein LOC101252... 504 e-140 gb|EXB64631.1| hypothetical protein L484_017963 [Morus notabilis] 490 e-135 gb|ESW19556.1| hypothetical protein PHAVU_006G134900g [Phaseolus... 486 e-134 ref|XP_003535004.1| PREDICTED: uncharacterized protein LOC100780... 486 e-134 ref|XP_003594365.1| hypothetical protein MTR_2g027780 [Medicago ... 486 e-134 gb|ABN08848.1| Zinc finger, RING-type [Medicago truncatula] 484 e-134 gb|EMJ13897.1| hypothetical protein PRUPE_ppa018588mg [Prunus pe... 479 e-132 ref|XP_006597631.1| PREDICTED: uncharacterized protein LOC100785... 477 e-132 ref|XP_003546214.1| PREDICTED: uncharacterized protein LOC100785... 477 e-132 ref|XP_004293426.1| PREDICTED: uncharacterized protein LOC101308... 471 e-130 >ref|XP_002269690.2| PREDICTED: uncharacterized protein LOC100253188 [Vitis vinifera] gi|147840889|emb|CAN66503.1| hypothetical protein VITISV_035496 [Vitis vinifera] Length = 523 Score = 548 bits (1412), Expect = e-153 Identities = 301/515 (58%), Positives = 359/515 (69%), Gaps = 27/515 (5%) Frame = -3 Query: 1579 CGSFFWSRQQQ--------AHALQTISNA-AAVSPDSRTFPNIICXXXXXXXXXXXXXNK 1427 CGSF SR+Q A TI+ A AA+S + N+ K Sbjct: 20 CGSF--SRRQSLVDPVLGDTSADATIATATAAISSSPKWGGNVSENAADEAESCNALLTK 77 Query: 1426 NLCAICLDPLNYSCGSSSCPAIFTAQCSHAFHFACISSNIHHGSVTCPICRAHWTQLPRD 1247 NLCAICLDPL+YS G+S PAIFTAQCSHAFHFACISSN+ HGSVTCPICRAHWTQLPR+ Sbjct: 78 NLCAICLDPLSYSTGTSPGPAIFTAQCSHAFHFACISSNVRHGSVTCPICRAHWTQLPRN 137 Query: 1246 PKSH-CSLHGNQSDPILQILDDSIASFRVHRRSFLRSARYDDDDPVEPNCTSNHLRLHIS 1070 CSL GNQ+DPIL+ILDDSIA+FRVHRRSFLRSARYDDDDP+EP+ + NH RLH+S Sbjct: 138 LNPPPCSLAGNQTDPILRILDDSIANFRVHRRSFLRSARYDDDDPIEPDHSPNHPRLHLS 197 Query: 1069 LLPIQLTHPTTIHFFSESAERLAAVAQKPFVCASST-----------------RAYLCVR 941 L+P+ LTHPT + +A + Q + +SS+ RAYL V+ Sbjct: 198 LIPLPLTHPTFHPYTLNNAFSYLSPLQN--LTSSSSLLPTPEHYSATGQTLYHRAYLSVK 255 Query: 940 LAHQPGIDLILVVSPNGPHLRLMKQAMALVIFSLRPMDRLAIVIYSTAAMRMFPLKRMTS 761 LAHQ DL+LV SPNGPHLRL+KQ+MALV+FSLRP+DRLAIV YS+AA R+FPL+RMTS Sbjct: 256 LAHQQATDLVLVASPNGPHLRLLKQSMALVVFSLRPVDRLAIVTYSSAAARVFPLRRMTS 315 Query: 760 YGKRTALQVIDRIFCTGHADSSEGLKKGVKILGDRVYKNPQSCILHLSDSPTTRSYHHFN 581 YGKRTALQVIDR+F G AD EGLKKG+KIL DR +KNPQSCILHLSDSP TRSYH N Sbjct: 316 YGKRTALQVIDRLFYMGQADPIEGLKKGIKILEDRAHKNPQSCILHLSDSP-TRSYHAMN 374 Query: 580 VEVPSITIXXXXXXXXXXXXXXXVMHEFEDFLARVLGGAIREIKLKIKDGNRIIKLGELR 401 ++VP I I VMHEFE+FLAR+LGG IR+I+L+I D RII+LGELR Sbjct: 375 MQVP-IPIHRFHVGFGFGASNGFVMHEFEEFLARLLGGVIRDIQLRIGDDGRIIRLGELR 433 Query: 400 DGEERRIPLTLHKSGHICVEYSYIDSGGGGNEESTKCGETLVCIPDQHAAGDRERSDGLD 221 GEERRIPL + H+CV YSY++ GG ++ + GET+VC D+ + G D Sbjct: 434 GGEERRIPLDMGDCEHVCVGYSYME---GGIDDCIRTGETVVCAEDKTETSESAEVGGGD 490 Query: 220 GTRGGISGRTSNVENWDYHDTFMARRWAKHLHGYR 116 + G GRTS+VE+WDYHD +MARRWAKHLHGYR Sbjct: 491 VSLG---GRTSSVESWDYHDPYMARRWAKHLHGYR 522 >emb|CBI25860.3| unnamed protein product [Vitis vinifera] Length = 518 Score = 546 bits (1407), Expect = e-152 Identities = 286/456 (62%), Positives = 339/456 (74%), Gaps = 18/456 (3%) Frame = -3 Query: 1429 KNLCAICLDPLNYSCGSSSCPAIFTAQCSHAFHFACISSNIHHGSVTCPICRAHWTQLPR 1250 KNLCAICLDPL+YS G+S PAIFTAQCSHAFHFACISSN+ HGSVTCPICRAHWTQLPR Sbjct: 72 KNLCAICLDPLSYSTGTSPGPAIFTAQCSHAFHFACISSNVRHGSVTCPICRAHWTQLPR 131 Query: 1249 DPKSH-CSLHGNQSDPILQILDDSIASFRVHRRSFLRSARYDDDDPVEPNCTSNHLRLHI 1073 + CSL GNQ+DPIL+ILDDSIA+FRVHRRSFLRSARYDDDDP+EP+ + NH RLH+ Sbjct: 132 NLNPPPCSLAGNQTDPILRILDDSIANFRVHRRSFLRSARYDDDDPIEPDHSPNHPRLHL 191 Query: 1072 SLLPIQLTHPTTIHFFSESAERLAAVAQKPFVCASST-----------------RAYLCV 944 SL+P+ LTHPT + +A + Q + +SS+ RAYL V Sbjct: 192 SLIPLPLTHPTFHPYTLNNAFSYLSPLQN--LTSSSSLLPTPEHYSATGQTLYHRAYLSV 249 Query: 943 RLAHQPGIDLILVVSPNGPHLRLMKQAMALVIFSLRPMDRLAIVIYSTAAMRMFPLKRMT 764 +LAHQ DL+LV SPNGPHLRL+KQ+MALV+FSLRP+DRLAIV YS+AA R+FPL+RMT Sbjct: 250 KLAHQQATDLVLVASPNGPHLRLLKQSMALVVFSLRPVDRLAIVTYSSAAARVFPLRRMT 309 Query: 763 SYGKRTALQVIDRIFCTGHADSSEGLKKGVKILGDRVYKNPQSCILHLSDSPTTRSYHHF 584 SYGKRTALQVIDR+F G AD EGLKKG+KIL DR +KNPQSCILHLSDSP TRSYH Sbjct: 310 SYGKRTALQVIDRLFYMGQADPIEGLKKGIKILEDRAHKNPQSCILHLSDSP-TRSYHAM 368 Query: 583 NVEVPSITIXXXXXXXXXXXXXXXVMHEFEDFLARVLGGAIREIKLKIKDGNRIIKLGEL 404 N++VP I I VMHEFE+FLAR+LGG IR+I+L+I D RII+LGEL Sbjct: 369 NMQVP-IPIHRFHVGFGFGASNGFVMHEFEEFLARLLGGVIRDIQLRIGDDGRIIRLGEL 427 Query: 403 RDGEERRIPLTLHKSGHICVEYSYIDSGGGGNEESTKCGETLVCIPDQHAAGDRERSDGL 224 R GEERRIPL + H+CV YSY++ GG ++ + GET+VC D+ + G Sbjct: 428 RGGEERRIPLDMGDCEHVCVGYSYME---GGIDDCIRTGETVVCAEDKTETSESAEVGGG 484 Query: 223 DGTRGGISGRTSNVENWDYHDTFMARRWAKHLHGYR 116 D + G GRTS+VE+WDYHD +MARRWAKHLHGYR Sbjct: 485 DVSLG---GRTSSVESWDYHDPYMARRWAKHLHGYR 517 >ref|XP_006373553.1| hypothetical protein POPTR_0016s00330g [Populus trichocarpa] gi|550320463|gb|ERP51350.1| hypothetical protein POPTR_0016s00330g [Populus trichocarpa] Length = 539 Score = 538 bits (1385), Expect = e-150 Identities = 286/451 (63%), Positives = 338/451 (74%), Gaps = 13/451 (2%) Frame = -3 Query: 1429 KNLCAICLDPLNYSCGSSSCPAIFTAQCSHAFHFACISSNIHHGSVTCPICRAHWTQLPR 1250 KNLCAICLDPL+YS G+S AIFTAQCSHAFHFACISSN+ HGSVTCPICRAHWTQLPR Sbjct: 94 KNLCAICLDPLSYSTGNSPGQAIFTAQCSHAFHFACISSNVRHGSVTCPICRAHWTQLPR 153 Query: 1249 DPKSHCSLHGNQSDPILQILDDSIASFRVHRRSFLRSARYDDDDPVEPNCTSNHLRLHIS 1070 + CSL N +DPI QILDDSIA+FRVHRRSFLRSARYDDDDP+EP+ T NH RL S Sbjct: 154 NLNIPCSLSCNHADPIFQILDDSIANFRVHRRSFLRSARYDDDDPIEPDQTPNHPRLDFS 213 Query: 1069 LLPIQLT---HPTTIHF---FSESAERLAAVAQKPFVCASS-----TRAYLCVRLAHQPG 923 L+PI LT HP T H+ ++ ++ L + + C SS T AYL VRLA+Q Sbjct: 214 LVPIPLTIFHHPRTQHYQHHYNLTSSSLLSHPPASYACTSSSNRRTTAAYLSVRLANQRP 273 Query: 922 IDLILVVSPNGPHLRLMKQAMALVIFSLRPMDRLAIVIYSTAAMRMFPLKRMTSYGKRTA 743 D++LV SPNGPHLRL+KQ+MALV+FSLRP+DRLAIV YS+AA R+FPL+RMTSYGKRTA Sbjct: 274 TDMVLVASPNGPHLRLLKQSMALVVFSLRPIDRLAIVTYSSAAARVFPLRRMTSYGKRTA 333 Query: 742 LQVIDRIFCTGHADSSEGLKKGVKILGDRVYKNPQSCILHLSDSPTTRSYHHFNVEVPSI 563 LQVIDR+F G AD EGLKKG+KIL DR +KNPQS ILHLSDSP TRSY+ N++VP I Sbjct: 334 LQVIDRLFYMGQADPIEGLKKGIKILEDRAHKNPQSTILHLSDSP-TRSYNAINLQVP-I 391 Query: 562 TIXXXXXXXXXXXXXXXVMHEFEDFLARVLGGAIREIKLKIKDGNRIIKLGELRDGEERR 383 I VMHEFE+FLAR+LGG IR+++L+I D RII+LGELR GEERR Sbjct: 392 PIHRFHVGFGFGTSNGFVMHEFEEFLARLLGGVIRDVQLRIGDEARIIRLGELRGGEERR 451 Query: 382 IPLTLHKSGHICVEYSYIDSGGGGNEESTKCGETLVCIPDQHAAG--DRERSDGLDGTRG 209 I L + +SG++CV YSYID GG EE + GET V + D+ A DRE G D + Sbjct: 452 IVLEVGESGYVCVGYSYID---GGVEEFNRTGETAVALGDKREANEDDREAVVGRDSS-S 507 Query: 208 GISGRTSNVENWDYHDTFMARRWAKHLHGYR 116 + GR+S+VE+WDYHD +MARRWAKHLHGYR Sbjct: 508 ILGGRSSSVESWDYHDPYMARRWAKHLHGYR 538 >gb|EOY29057.1| Zinc finger family protein isoform 2 [Theobroma cacao] Length = 604 Score = 518 bits (1333), Expect = e-144 Identities = 274/473 (57%), Positives = 335/473 (70%), Gaps = 35/473 (7%) Frame = -3 Query: 1429 KNLCAICLDPLNYSCGSSSCPAIFTAQCSHAFHFACISSNIHHGSVTCPICRAHWTQLPR 1250 KNLCAICL+ L+YS GSS AIFTAQCSHAFHF+CISSN+ HGS+TCPICRAHWTQLPR Sbjct: 140 KNLCAICLEVLSYSSGSSPGQAIFTAQCSHAFHFSCISSNVRHGSITCPICRAHWTQLPR 199 Query: 1249 DPKSH-CSLHGNQSDPILQILDDSIASFRVHRRSFLRSARYDDDDPVEPNCTSNHLRLHI 1073 + CSL NQSDP+ +ILDDSIA+FRVHRRSFLRSARYDDDDP+EP+ T NH RL + Sbjct: 200 NLNPPACSLSCNQSDPVFRILDDSIATFRVHRRSFLRSARYDDDDPIEPDHTQNHPRLDL 259 Query: 1072 SLLPIQ---LTHPTTI---------------------------HFFSESAERLAAVAQKP 983 +L+P+Q LTHP HF S S+ L ++ Sbjct: 260 ALIPLQPAVLTHPCCFRRQSCSHSSSLQMPGIGHNSNHHHHHHHFSSSSSSSLLLQPRQT 319 Query: 982 --FVCASSTR--AYLCVRLAHQPGIDLILVVSPNGPHLRLMKQAMALVIFSLRPMDRLAI 815 ++C+SS R AYLC++L H D++LV SPNGPHLRL+KQ+MALV+FSLRP+DRLAI Sbjct: 320 PSYLCSSSNRRPAYLCIKLTHPRATDMVLVASPNGPHLRLLKQSMALVVFSLRPIDRLAI 379 Query: 814 VIYSTAAMRMFPLKRMTSYGKRTALQVIDRIFCTGHADSSEGLKKGVKILGDRVYKNPQS 635 V YS+AA R+FPL+RMTSYGKR+ALQVIDR+F G AD EGLKKG+KIL DR +KNPQS Sbjct: 380 VTYSSAAARVFPLRRMTSYGKRSALQVIDRLFYMGQADPIEGLKKGIKILEDRAHKNPQS 439 Query: 634 CILHLSDSPTTRSYHHFNVEVPSITIXXXXXXXXXXXXXXXVMHEFEDFLARVLGGAIRE 455 CILHLSDSP TRSYH N+++P I I VMHEFE+FL ++LGG IR+ Sbjct: 440 CILHLSDSP-TRSYHAMNLQLP-IPIHRFHVGFGFGTSNGFVMHEFEEFLRQLLGGVIRD 497 Query: 454 IKLKIKDGNRIIKLGELRDGEERRIPLTLHKSGHICVEYSYIDSGGGGNEESTKCGETLV 275 I+L+I + +II+LG+LR GEERR+ L L + H+ V YSY++ GGN+E K GET+V Sbjct: 498 IQLRIGEEAKIIRLGDLRGGEERRVLLDLGECVHVSVGYSYVE---GGNDECIKTGETMV 554 Query: 274 CIPDQHAAGDRERSDGLDGTRGGISGRTSNVENWDYHDTFMARRWAKHLHGYR 116 I D+ D +R D + GRTS+VE WDYHD +MARRWAKHLHGYR Sbjct: 555 SIEDKRETDDGDR----DTAISIVGGRTSSVEGWDYHDPYMARRWAKHLHGYR 603 >gb|EOY29056.1| Zinc finger family protein isoform 1 [Theobroma cacao] Length = 605 Score = 516 bits (1329), Expect = e-143 Identities = 273/473 (57%), Positives = 335/473 (70%), Gaps = 35/473 (7%) Frame = -3 Query: 1429 KNLCAICLDPLNYSCGSSSCPAIFTAQCSHAFHFACISSNIHHGSVTCPICRAHWTQLPR 1250 +NLCAICL+ L+YS GSS AIFTAQCSHAFHF+CISSN+ HGS+TCPICRAHWTQLPR Sbjct: 141 QNLCAICLEVLSYSSGSSPGQAIFTAQCSHAFHFSCISSNVRHGSITCPICRAHWTQLPR 200 Query: 1249 DPKSH-CSLHGNQSDPILQILDDSIASFRVHRRSFLRSARYDDDDPVEPNCTSNHLRLHI 1073 + CSL NQSDP+ +ILDDSIA+FRVHRRSFLRSARYDDDDP+EP+ T NH RL + Sbjct: 201 NLNPPACSLSCNQSDPVFRILDDSIATFRVHRRSFLRSARYDDDDPIEPDHTQNHPRLDL 260 Query: 1072 SLLPIQ---LTHPTTI---------------------------HFFSESAERLAAVAQKP 983 +L+P+Q LTHP HF S S+ L ++ Sbjct: 261 ALIPLQPAVLTHPCCFRRQSCSHSSSLQMPGIGHNSNHHHHHHHFSSSSSSSLLLQPRQT 320 Query: 982 --FVCASSTR--AYLCVRLAHQPGIDLILVVSPNGPHLRLMKQAMALVIFSLRPMDRLAI 815 ++C+SS R AYLC++L H D++LV SPNGPHLRL+KQ+MALV+FSLRP+DRLAI Sbjct: 321 PSYLCSSSNRRPAYLCIKLTHPRATDMVLVASPNGPHLRLLKQSMALVVFSLRPIDRLAI 380 Query: 814 VIYSTAAMRMFPLKRMTSYGKRTALQVIDRIFCTGHADSSEGLKKGVKILGDRVYKNPQS 635 V YS+AA R+FPL+RMTSYGKR+ALQVIDR+F G AD EGLKKG+KIL DR +KNPQS Sbjct: 381 VTYSSAAARVFPLRRMTSYGKRSALQVIDRLFYMGQADPIEGLKKGIKILEDRAHKNPQS 440 Query: 634 CILHLSDSPTTRSYHHFNVEVPSITIXXXXXXXXXXXXXXXVMHEFEDFLARVLGGAIRE 455 CILHLSDSP TRSYH N+++P I I VMHEFE+FL ++LGG IR+ Sbjct: 441 CILHLSDSP-TRSYHAMNLQLP-IPIHRFHVGFGFGTSNGFVMHEFEEFLRQLLGGVIRD 498 Query: 454 IKLKIKDGNRIIKLGELRDGEERRIPLTLHKSGHICVEYSYIDSGGGGNEESTKCGETLV 275 I+L+I + +II+LG+LR GEERR+ L L + H+ V YSY++ GGN+E K GET+V Sbjct: 499 IQLRIGEEAKIIRLGDLRGGEERRVLLDLGECVHVSVGYSYVE---GGNDECIKTGETMV 555 Query: 274 CIPDQHAAGDRERSDGLDGTRGGISGRTSNVENWDYHDTFMARRWAKHLHGYR 116 I D+ D +R D + GRTS+VE WDYHD +MARRWAKHLHGYR Sbjct: 556 SIEDKRETDDGDR----DTAISIVGGRTSSVEGWDYHDPYMARRWAKHLHGYR 604 >ref|XP_002308735.1| hypothetical protein POPTR_0006s00300g [Populus trichocarpa] gi|222854711|gb|EEE92258.1| hypothetical protein POPTR_0006s00300g [Populus trichocarpa] Length = 542 Score = 512 bits (1318), Expect = e-142 Identities = 278/455 (61%), Positives = 327/455 (71%), Gaps = 17/455 (3%) Frame = -3 Query: 1429 KNLCAICLDPLNYSCGSSSCPAIFTAQCSHAFHFACISSNIHHGSVTCPICRAHWTQLPR 1250 KNLCAICLDPL+YS +S AIFTAQC HAFHFACISSN+ HGSVTCPICRA WTQLPR Sbjct: 96 KNLCAICLDPLSYSTSNSPGQAIFTAQCRHAFHFACISSNVRHGSVTCPICRARWTQLPR 155 Query: 1249 DPKSHCSLHGNQSDPILQILDDSIASFRVHRRSFLRSARYDDDDPVEPNCTSNHLRLHIS 1070 + CSL NQ+DPILQILDDSIA+FRVHR SFLRSARYDDDDP+EP+ T N+ RL S Sbjct: 156 NLNMPCSLSCNQTDPILQILDDSIANFRVHRHSFLRSARYDDDDPIEPDQTPNYPRLDFS 215 Query: 1069 LLPIQLT---HPTTIH------------FFSESAERLAAVAQKPFVCASSTRAYLCVRLA 935 ++PI LT HP T H FFS A + + ST AYL V+LA Sbjct: 216 IVPIPLTIFHHPRTQHYQHHHNLTAGSSFFSHPPASYACTSSSNRI---STAAYLSVKLA 272 Query: 934 HQPGIDLILVVSPNGPHLRLMKQAMALVIFSLRPMDRLAIVIYSTAAMRMFPLKRMTSYG 755 +Q DLILV SPNGPHLRL+KQ+MALV+FSLRP+DRLAIV YS+AA R+FPL+RMT YG Sbjct: 273 NQRPTDLILVASPNGPHLRLLKQSMALVVFSLRPIDRLAIVTYSSAAARVFPLRRMTFYG 332 Query: 754 KRTALQVIDRIFCTGHADSSEGLKKGVKILGDRVYKNPQSCILHLSDSPTTRSYHHFNVE 575 KRTALQVIDR++ G AD EGLKKG+KIL DR +KNPQS ILHLSDSP TRSYH N++ Sbjct: 333 KRTALQVIDRLYFMGQADPIEGLKKGIKILEDRAHKNPQSTILHLSDSP-TRSYHTINMQ 391 Query: 574 VPSITIXXXXXXXXXXXXXXXVMHEFEDFLARVLGGAIREIKLKIKDGNRIIKLGELRDG 395 VP I I VMHEFE+FLAR+LGG IR+++L+I D RI +LGELR G Sbjct: 392 VP-IPIHRFHVGFGFGTSNGFVMHEFEEFLARMLGGVIRDVQLRIGDEARITRLGELRGG 450 Query: 394 EERRIPLTLHKSGHICVEYSYIDSGGGGNEESTKCGETLVCIPDQHAAGD--RERSDGLD 221 EERRI L L +S ++ V YSYID G G E + GET+V + ++ A + RE G D Sbjct: 451 EERRIVLELGESNYVSVGYSYIDGGVG---ECNRTGETVVTLGEKWEANEDGREAVAGRD 507 Query: 220 GTRGGISGRTSNVENWDYHDTFMARRWAKHLHGYR 116 + GR+S+VE+WDYHD +MARRWAKHLHGYR Sbjct: 508 SS-SIFGGRSSSVESWDYHDPYMARRWAKHLHGYR 541 >ref|XP_002525234.1| conserved hypothetical protein [Ricinus communis] gi|223535531|gb|EEF37200.1| conserved hypothetical protein [Ricinus communis] Length = 520 Score = 508 bits (1308), Expect = e-141 Identities = 267/451 (59%), Positives = 322/451 (71%), Gaps = 13/451 (2%) Frame = -3 Query: 1429 KNLCAICLDPLNYSCGSSSCPAIFTAQCSHAFHFACISSNIHHGSVTCPICRAHWTQLPR 1250 +NLCAICL+ L+YS G+S AIFTAQCSHAFHFACISSN+ HGSVTCPICRAHWTQLPR Sbjct: 77 QNLCAICLEALSYSTGNSPGQAIFTAQCSHAFHFACISSNVRHGSVTCPICRAHWTQLPR 136 Query: 1249 DPKSHCSLHGNQSDPILQILDDSIASFRVHRRSFLRSARYDDDDPVEPNCTSNHLRLHIS 1070 + CSL NQSDPI QILDDSIA+FRVHRRSFLRSARY+DDDP+EP+ TS+H RL S Sbjct: 137 NLNPPCSLSCNQSDPIFQILDDSIATFRVHRRSFLRSARYNDDDPIEPDDTSSHPRLDFS 196 Query: 1069 LLPI----------QLTHPTTIHFFSESAERLAAVAQKPFVCASSTR---AYLCVRLAHQ 929 L+ I Q HP H + S+ L + P+ S+ R AYL V+ Q Sbjct: 197 LVSIPPLPFRHRCTQYQHP---HHITGSSSSLFSYPPTPYSYTSNRRLAAAYLSVKSIQQ 253 Query: 928 PGIDLILVVSPNGPHLRLMKQAMALVIFSLRPMDRLAIVIYSTAAMRMFPLKRMTSYGKR 749 +DL+LV SPNGPHLRL+KQ+MALV+FSLRP+DRLA+V YS+ A R+FPL+RMTSYGKR Sbjct: 254 RAMDLVLVASPNGPHLRLVKQSMALVVFSLRPIDRLAVVTYSSFAARVFPLRRMTSYGKR 313 Query: 748 TALQVIDRIFCTGHADSSEGLKKGVKILGDRVYKNPQSCILHLSDSPTTRSYHHFNVEVP 569 TALQVIDR+F G AD EGLKKG+KIL DR +KNPQSC+LHLSDSP TRSYH FN+++P Sbjct: 314 TALQVIDRLFFMGQADPMEGLKKGIKILEDRAHKNPQSCLLHLSDSP-TRSYHTFNMQIP 372 Query: 568 SITIXXXXXXXXXXXXXXXVMHEFEDFLARVLGGAIREIKLKIKDGNRIIKLGELRDGEE 389 I VMHEFE+FL R+LGG IR+++L+I + RII+LGELR EE Sbjct: 373 -FPIHRFHVGFGFGTSNGFVMHEFEEFLVRLLGGVIRDVQLRIGEEGRIIRLGELRGNEE 431 Query: 388 RRIPLTLHKSGHICVEYSYIDSGGGGNEESTKCGETLVCIPDQHAAGDRERSDGLDGTRG 209 RRI L L + H+ V YSY++ GN+E GET+V + ++ D R Sbjct: 432 RRILLDLGEREHVFVGYSYVED---GNDECAITGETIVSVAEKREPHDTSREAPAGRDVN 488 Query: 208 GISGRTSNVENWDYHDTFMARRWAKHLHGYR 116 I GRTS+VE+WDYHD +MARRWAKHLHGYR Sbjct: 489 IIGGRTSSVESWDYHDPYMARRWAKHLHGYR 519 >ref|XP_006449970.1| hypothetical protein CICLE_v10014880mg [Citrus clementina] gi|557552581|gb|ESR63210.1| hypothetical protein CICLE_v10014880mg [Citrus clementina] Length = 530 Score = 508 bits (1307), Expect = e-141 Identities = 278/477 (58%), Positives = 334/477 (70%), Gaps = 39/477 (8%) Frame = -3 Query: 1429 KNLCAICLDPLNYSCGSSSCPAIFTAQCSHAFHFACISSNIHHGSVTCPICRAHWTQLPR 1250 KNLCAICL+ L+YS G S AIFTAQCSHAFHFACISSN+ HGSVTCPICRAHWTQLPR Sbjct: 59 KNLCAICLEALSYSSGGSPGQAIFTAQCSHAFHFACISSNVRHGSVTCPICRAHWTQLPR 118 Query: 1249 DP-KSHCSLHGNQSDPILQILDDSIASFRVHRRSFLRSARYDDDDPVEPNCTSNHLRLHI 1073 + + CS+ NQ+DP+ +ILDDSIA+FRVHRRSFLRSARYDDDDP+EP+ ++NH RL Sbjct: 119 NLYPAACSISCNQNDPVFRILDDSIATFRVHRRSFLRSARYDDDDPIEPDHSTNHPRLDF 178 Query: 1072 SLLPIQLT----------------------------------HPTTIHFFSESAERLAAV 995 SL P+ T +PT+ S S + Sbjct: 179 SLTPVPPTLLSHSCGFQHHPRAHSSRHTSGNGQTPHHLHHHNYPTSSSSSSSSLLFQTPI 238 Query: 994 AQKP-FVCASSTR--AYLCVRLAHQPGIDLILVVSPNGPHLRLMKQAMALVIFSLRPMDR 824 Q P +V A S R AYL V+LAHQP DL+LV SPNGPHLRL+KQ+MALV+FSLRP+DR Sbjct: 239 GQTPSYVRAPSNRRAAYLSVKLAHQPATDLVLVASPNGPHLRLLKQSMALVVFSLRPIDR 298 Query: 823 LAIVIYSTAAMRMFPLKRMTSYGKRTALQVIDRIFCTGHADSSEGLKKGVKILGDRVYKN 644 LAIV YS+AA R+FPLKRMTSYGKR ALQVIDR+F G AD EGLKKG+KIL DR +KN Sbjct: 299 LAIVTYSSAAARVFPLKRMTSYGKRMALQVIDRLFYMGQADPIEGLKKGIKILEDRAHKN 358 Query: 643 PQSCILHLSDSPTTRSYHHFNVEVPSITIXXXXXXXXXXXXXXXVMHEFEDFLARVLGGA 464 PQSCILHLSD+P TR+YH N++VP + VMHEFE+FLA +LGG Sbjct: 359 PQSCILHLSDTP-TRTYHAINLQVP-FPVHRFHVGFGFGSSNGFVMHEFEEFLATLLGGN 416 Query: 463 IREIKLKIKDGNRIIKLGELRDGEERRIPLTLHKSGHICVEYSYIDSGGGGNEESTKCGE 284 +REI+L+I++ RII+LGELR GEERRI L L + + VEYSY++ GG +E + GE Sbjct: 417 VREIQLRIREEARIIRLGELRGGEERRILLDLGECEDVRVEYSYVE---GGIDECIRTGE 473 Query: 283 TLVCIPDQHAAGDRERSDGLDGTRGG-ISGRTSNVENWDYHDTFMARRWAKHLHGYR 116 TLV I D+ A + ER + + GT I GRTS+VE+WDYHD +MARRWAKHLHGYR Sbjct: 474 TLVNIEDKREASN-ERIEPVSGTDVSIIGGRTSSVESWDYHDPYMARRWAKHLHGYR 529 >ref|XP_006467242.1| PREDICTED: uncharacterized protein LOC102628285 [Citrus sinensis] Length = 529 Score = 505 bits (1301), Expect = e-140 Identities = 278/477 (58%), Positives = 333/477 (69%), Gaps = 39/477 (8%) Frame = -3 Query: 1429 KNLCAICLDPLNYSCGSSSCPAIFTAQCSHAFHFACISSNIHHGSVTCPICRAHWTQLPR 1250 KNLCAICL+ L+YS G S AIFTAQCSHAFHFACISSN+ HGSVTCPICRAHWTQLPR Sbjct: 58 KNLCAICLEALSYSSGGSPGQAIFTAQCSHAFHFACISSNVRHGSVTCPICRAHWTQLPR 117 Query: 1249 DP-KSHCSLHGNQSDPILQILDDSIASFRVHRRSFLRSARYDDDDPVEPNCTSNHLRLHI 1073 + + CS+ NQ+DP+ +ILDDSIA+FRVHRRSFLRSARYDDDDP+EP+ ++NH RL Sbjct: 118 NLYPAACSISCNQNDPVFRILDDSIATFRVHRRSFLRSARYDDDDPIEPDHSTNHPRLDF 177 Query: 1072 SLLPIQLT----------------------------------HPTTIHFFSESAERLAAV 995 SL P+ T +PT+ S S + Sbjct: 178 SLTPVPPTLLSHSCGFQHHPRAHSSWHTSGNGQTPHHLHHHNYPTSSSSSSSSLLFQTPI 237 Query: 994 AQKP-FVCASSTR--AYLCVRLAHQPGIDLILVVSPNGPHLRLMKQAMALVIFSLRPMDR 824 Q P +V ASS R AYL V+LAHQP DL+LV SPNGPHLRL+KQ+MALV+FSLRP DR Sbjct: 238 GQTPSYVRASSNRRAAYLSVKLAHQPATDLVLVASPNGPHLRLLKQSMALVVFSLRPNDR 297 Query: 823 LAIVIYSTAAMRMFPLKRMTSYGKRTALQVIDRIFCTGHADSSEGLKKGVKILGDRVYKN 644 LAIV YS+AA R+FPLKRMTSYGKR ALQVIDR+F G AD EGLKKG+KIL DR +KN Sbjct: 298 LAIVTYSSAAARVFPLKRMTSYGKRMALQVIDRLFYMGQADPIEGLKKGIKILEDRAHKN 357 Query: 643 PQSCILHLSDSPTTRSYHHFNVEVPSITIXXXXXXXXXXXXXXXVMHEFEDFLARVLGGA 464 PQSCILHLSD+P TR+YH N++VP + VMHEFE+FLA +LGG Sbjct: 358 PQSCILHLSDTP-TRTYHAINLQVP-FPVHRFHVGFGFGSSNGFVMHEFEEFLATLLGGN 415 Query: 463 IREIKLKIKDGNRIIKLGELRDGEERRIPLTLHKSGHICVEYSYIDSGGGGNEESTKCGE 284 ++EI+L+I + RII+LGELR GEERRI L L + + VEYSY++ GG +E + GE Sbjct: 416 VQEIQLRIGEEARIIRLGELRGGEERRILLDLGECEDVRVEYSYVE---GGIDECIRTGE 472 Query: 283 TLVCIPDQHAAGDRERSDGLDGTRGG-ISGRTSNVENWDYHDTFMARRWAKHLHGYR 116 TLV I D+ A + ER + + GT I GRTS+VE+WDYHD +MARRWAKHLHGYR Sbjct: 473 TLVNIEDKREASN-ERIEPVSGTDVSIIGGRTSSVESWDYHDPYMARRWAKHLHGYR 528 >ref|XP_006350231.1| PREDICTED: uncharacterized protein LOC102590531 [Solanum tuberosum] Length = 530 Score = 504 bits (1299), Expect = e-140 Identities = 273/459 (59%), Positives = 325/459 (70%), Gaps = 21/459 (4%) Frame = -3 Query: 1429 KNLCAICLDPLNYSCGSSSCPAIFTAQCSHAFHFACISSNIHHGSVTCPICRAHWTQLPR 1250 KN C ICLD L+YSC SS AIFTAQCSHAFHFACISSNI HG+VTCP+CRAHWTQLPR Sbjct: 84 KNFCPICLDSLSYSCDSSPGQAIFTAQCSHAFHFACISSNIRHGNVTCPVCRAHWTQLPR 143 Query: 1249 DPKSHCSLHGNQSDPILQILDDSIASFRVHRRSFLRSARYDDDDPVEPNCTSNHLRLHIS 1070 K H S H NQ+DPILQILD+SIA+ RVHRRSFLRSARYDDDDPVEP+ TSN RLH+S Sbjct: 144 TLKMHYSPHSNQADPILQILDESIATSRVHRRSFLRSARYDDDDPVEPDHTSNIHRLHLS 203 Query: 1069 LLPIQLTHPTTI--------------HFFSESAERLAAVAQ-------KPFVCASSTRAY 953 L P+ H T++ H+ E + AQ P VC+SS+ AY Sbjct: 204 LSPV--PHSTSVFDPCSNPKSSFSSCHYPQHCLEPSQSAAQHFVETGLSPLVCSSSSSAY 261 Query: 952 LCVRLAHQPGIDLILVVSPNGPHLRLMKQAMALVIFSLRPMDRLAIVIYSTAAMRMFPLK 773 LC++LAHQP DL+LV SPNGPHLRLMKQAMA V+FSLRP+DRLAIV YS+AA R+FPLK Sbjct: 262 LCLKLAHQPATDLVLVASPNGPHLRLMKQAMAFVVFSLRPIDRLAIVTYSSAAARIFPLK 321 Query: 772 RMTSYGKRTALQVIDRIFCTGHADSSEGLKKGVKILGDRVYKNPQSCILHLSDSPTTRSY 593 MTSYGKRTALQVIDR+F G AD EGLKKGVKIL +R ++N S ILHLSD+P TRS+ Sbjct: 322 CMTSYGKRTALQVIDRLFYMGQADPVEGLKKGVKILRERSHQNTHSFILHLSDNP-TRSF 380 Query: 592 HHFNVEVPSITIXXXXXXXXXXXXXXXVMHEFEDFLARVLGGAIREIKLKIKDGNRIIKL 413 H F++E+P ITI VMHEFE FLA++L GA+R+I L I + RI++L Sbjct: 381 HGFHLELP-ITIHKFHVGFGFGTSNGFVMHEFERFLAKILCGAVRDIALMIGEDTRIVRL 439 Query: 412 GELRDGEERRIPLTLHKSGHICVEYSYIDSGGGGNEESTKCGETLVCIPDQHAAGDRERS 233 GELR GEERRIPL L + + V Y+YID ++S K GE +V + D+ +E + Sbjct: 440 GELRGGEERRIPLLLEELDKVRVVYTYIDC---MMDDSVKTGEVVVGVGDR-----KELT 491 Query: 232 DGLDGTRGGISGRTSNVENWDYHDTFMARRWAKHLHGYR 116 D +D GR+S+VE W+YHD FMARRWAK LHGYR Sbjct: 492 DTIDIVE-NTGGRSSSVEGWEYHDPFMARRWAKRLHGYR 529 >ref|XP_004236639.1| PREDICTED: uncharacterized protein LOC101252274 [Solanum lycopersicum] Length = 532 Score = 504 bits (1299), Expect = e-140 Identities = 271/459 (59%), Positives = 324/459 (70%), Gaps = 21/459 (4%) Frame = -3 Query: 1429 KNLCAICLDPLNYSCGSSSCPAIFTAQCSHAFHFACISSNIHHGSVTCPICRAHWTQLPR 1250 KN C ICLD L+YSC SS AIFTAQCSHAFHFACISSNI HG+VTCP+CRAHWTQLPR Sbjct: 86 KNFCPICLDSLSYSCDSSPGQAIFTAQCSHAFHFACISSNIRHGNVTCPVCRAHWTQLPR 145 Query: 1249 DPKSHCSLHGNQSDPILQILDDSIASFRVHRRSFLRSARYDDDDPVEPNCTSNHLRLHIS 1070 K H S H N++DPILQILD+SIA+ RVHRRSFLRSARYDDDDPVEP+ TSN RLH+S Sbjct: 146 TLKMHYSPHSNRADPILQILDESIATSRVHRRSFLRSARYDDDDPVEPDRTSNIHRLHLS 205 Query: 1069 LLPIQLTHPTTI--------------HF-------FSESAERLAAVAQKPFVCASSTRAY 953 L P+ H T++ H+ +A+ Q P VC+SS+ AY Sbjct: 206 LSPV--PHSTSVFDPCSNPKSSFSSCHYPQHCLESSQSAAQHFVETGQSPLVCSSSSSAY 263 Query: 952 LCVRLAHQPGIDLILVVSPNGPHLRLMKQAMALVIFSLRPMDRLAIVIYSTAAMRMFPLK 773 LC++LAHQP DL+LV SPNGPHLRLMKQAMA V+FSLRP+DRLAIV YS+AA R+FPLK Sbjct: 264 LCLKLAHQPATDLVLVASPNGPHLRLMKQAMAFVVFSLRPIDRLAIVTYSSAAARIFPLK 323 Query: 772 RMTSYGKRTALQVIDRIFCTGHADSSEGLKKGVKILGDRVYKNPQSCILHLSDSPTTRSY 593 MTSYGKRTALQVIDR+F G AD EGLKKGVKIL +R ++N S ILHLSD+P TRS+ Sbjct: 324 CMTSYGKRTALQVIDRLFYMGQADPVEGLKKGVKILRERSHQNTHSFILHLSDNP-TRSF 382 Query: 592 HHFNVEVPSITIXXXXXXXXXXXXXXXVMHEFEDFLARVLGGAIREIKLKIKDGNRIIKL 413 H F++E+P ITI VMHEFE FLA++L GA+R+I L I + RI++L Sbjct: 383 HGFHLELP-ITIHKFHVGFGFGTSNGFVMHEFERFLAKILCGAVRDIALMIGEDTRIVRL 441 Query: 412 GELRDGEERRIPLTLHKSGHICVEYSYIDSGGGGNEESTKCGETLVCIPDQHAAGDRERS 233 GELR GEERRIPL L + V Y+YID ++S K GE +V + D+ +E + Sbjct: 442 GELRGGEERRIPLLLEDMDKVRVVYTYIDC---MMDDSVKTGEVVVGVGDR-----KELT 493 Query: 232 DGLDGTRGGISGRTSNVENWDYHDTFMARRWAKHLHGYR 116 D +D GR+S+VE W+YHD FMARRWAK LHGYR Sbjct: 494 DTIDIVE-NTGGRSSSVEGWEYHDPFMARRWAKRLHGYR 531 >gb|EXB64631.1| hypothetical protein L484_017963 [Morus notabilis] Length = 569 Score = 490 bits (1261), Expect = e-135 Identities = 273/478 (57%), Positives = 326/478 (68%), Gaps = 40/478 (8%) Frame = -3 Query: 1429 KNLCAICLDPLNY-SCGSSSCPAIFTAQCSHAFHFACISSNIHHGSVTCPICRAHWTQLP 1253 KNLCAICLDPL+Y S G S AIFTAQCSHAFHFACISSN+ HGSVTCPICRAHWTQLP Sbjct: 97 KNLCAICLDPLSYNSRGGSPSQAIFTAQCSHAFHFACISSNVRHGSVTCPICRAHWTQLP 156 Query: 1252 RDPKSHC-SLHG-NQSDPILQILDDSIASFRVHRRSFLRSARYDDDDPVEPNCTSNHLRL 1079 R+ C SL NQ+DPIL+ILDDSIA+FR+HRRSFLRSARYDDDDP+EP+ N RL Sbjct: 157 RNLNPPCGSLSSCNQNDPILRILDDSIATFRIHRRSFLRSARYDDDDPIEPDDMPNCPRL 216 Query: 1078 HISLLPIQLTHPTT----------IH----------FFSESAERLAAVAQKPFVCASSTR 959 H+SL+P+ T PTT +H F + +L+ V +C SS + Sbjct: 217 HLSLVPVPTTSPTTNFQPYPYHQNLHAHPPICGSSSFLQSPSRQLSYV-----MCTSSNK 271 Query: 958 AYLCVRLAHQPGIDLILVVSPNGPHLRLMKQAMALVIFSLRPMDRLAIVIYSTAAMRMFP 779 YL V+LA+Q DL+LV SPNGPHLRL+KQ MALV+FSLRP+DRLAIV YS+AA R+FP Sbjct: 272 GYLSVKLANQRATDLVLVASPNGPHLRLLKQCMALVVFSLRPIDRLAIVTYSSAAARVFP 331 Query: 778 LKRMTSYGKRTALQVIDRIFCTGHADSSEGLKKGVKILGDRVYKNPQSCILHLSDSPTTR 599 L+RMTSYGKRTALQVIDR+F G AD EGLKKG+KIL DR +KNP S ILHLSDSPT Sbjct: 332 LRRMTSYGKRTALQVIDRLFYMGQADPVEGLKKGIKILQDRAHKNPDSSILHLSDSPTQS 391 Query: 598 SYHHFNVEVP-SITIXXXXXXXXXXXXXXXVMHEFEDFLARVLGGAIREIKLKIKDG--- 431 + +VE+P + +MHEFE+FLA +LGG IR+I+L+I+ G Sbjct: 392 YHAAMDVEIPIPVHRFHVGFGFGIGTSNGFIMHEFEEFLAGLLGGVIRDIQLRIRVGEES 451 Query: 430 --NRIIKLGELRDGEERRIPLTLHKSGHICVEYSYIDSGGGGNEESTKCGETLVCIPDQH 257 +RI+++GELR EERRI L L +SGH CVEYSY + G +E GETLV + D Sbjct: 452 SCSRIVRIGELRGDEERRILLDLGESGHACVEYSYCEDVGEVDERFI-TGETLVSLGDSK 510 Query: 256 AAGDRERSDG-----LDGTRGG------ISGRTSNVENWDYHDTFMARRWAKHLHGYR 116 + E S G T GG GR S+VE+WDYHD +MARRWAKHLHGYR Sbjct: 511 STNAAEASAGPAVAEAGATGGGGRRDAIGGGRPSSVESWDYHDPYMARRWAKHLHGYR 568 >gb|ESW19556.1| hypothetical protein PHAVU_006G134900g [Phaseolus vulgaris] Length = 559 Score = 486 bits (1251), Expect = e-134 Identities = 270/471 (57%), Positives = 327/471 (69%), Gaps = 33/471 (7%) Frame = -3 Query: 1429 KNLCAICLDPLNY-SCGSSSCPAIFTAQCSHAFHFACISSNIHHGSVTCPICRAHWTQLP 1253 KNLCAICLDPL+Y S GSS AIFTAQCSHAFHFACISSN+ HGSVTCPICRAHWTQLP Sbjct: 91 KNLCAICLDPLSYHSKGSSPGQAIFTAQCSHAFHFACISSNVRHGSVTCPICRAHWTQLP 150 Query: 1252 RDPKSHCS---LHGNQSDPILQILDDSIASFRVHRRSFLRSARYDDDDPVEPNCTSNHLR 1082 R+ ++ NQSDPIL+ILDDSIA+FRVHRRS LRSARYDDDDPVEP+ T + + Sbjct: 151 RNLNNNLGGPLTSTNQSDPILRILDDSIATFRVHRRSLLRSARYDDDDPVEPDDTPDSPK 210 Query: 1081 LHISLLPIQLTHPTTIH--------------------FFSESAERLAAVAQKPFV-CASS 965 L SL+PI PT+ H S S+ + Q P++ C SS Sbjct: 211 LCFSLVPIPPNAPTSYHPALQVTKHASCPCHLSLHPLTCSSSSLVQSPPMQTPYIMCPSS 270 Query: 964 TRAYLCVRLAHQPGIDLILVVSPNGPHLRLMKQAMALVIFSLRPMDRLAIVIYSTAAMRM 785 RAYL V+L H+ DL+LV SPNGPHLRL+KQAMALV+FSLR +DRLAIV YS+AA R+ Sbjct: 271 NRAYLSVKLTHERATDLVLVASPNGPHLRLLKQAMALVVFSLRQIDRLAIVTYSSAAARV 330 Query: 784 FPLKRMTSYGKRTALQVIDRIFCTGHADSSEGLKKGVKILGDRVYKNPQSCILHLSDSPT 605 FPL+RMTSYGKRTALQVIDR+F G AD EGLKKG+KIL DRV+KNP+SCILHLSD+P Sbjct: 331 FPLRRMTSYGKRTALQVIDRLFYMGQADPVEGLKKGIKILEDRVHKNPESCILHLSDNP- 389 Query: 604 TRSYHHFNVEVPSITIXXXXXXXXXXXXXXXVMHEFEDFLARVLGGAIREIKLKI----- 440 TR YH ++E+PS I V+ EFE+FLA++LGG +REI+L+I Sbjct: 390 TRPYHAVSMELPSTPIHRFHVGFGFGTSSGFVIQEFEEFLAKMLGGIVREIQLRICGAGE 449 Query: 439 -KDGNRIIKLGELRDGEERRIPLTLHKSGHICVEYSYIDSGGGGNEESTKCGETLVCIPD 263 R+I++GE+R GEERRI L L H+ VEYSYI+ G +E + GET+V + + Sbjct: 450 EVGSGRVIRIGEIRGGEERRILLDLGDCTHVYVEYSYIE--GEIDECVRRTGETVVGVGE 507 Query: 262 Q--HAAGDRERSDGLDGTRGGISGRTSNVENWDYHDTFMARRWAKHLHGYR 116 A+ D E + T GG GRTS+VE+WD+H +MARRWAKHLHGYR Sbjct: 508 HKGDASEDGEETVRDMNTGGGGGGRTSSVESWDFHGPYMARRWAKHLHGYR 558 >ref|XP_003535004.1| PREDICTED: uncharacterized protein LOC100780745 [Glycine max] Length = 550 Score = 486 bits (1251), Expect = e-134 Identities = 269/468 (57%), Positives = 329/468 (70%), Gaps = 30/468 (6%) Frame = -3 Query: 1429 KNLCAICLDPLNY-SCGSSSCPAIFTAQCSHAFHFACISSNIHHGSVTCPICRAHWTQLP 1253 KNLCAICLDPL+Y S GSS AIFTAQCSH FHFACISSN+ HGSVTCPICRAHWTQLP Sbjct: 89 KNLCAICLDPLSYHSKGSSPGQAIFTAQCSHTFHFACISSNVRHGSVTCPICRAHWTQLP 148 Query: 1252 RDPKSHCS--LHGNQSDPILQILDDSIASFRVHRRSFLRSARYDDDDPVEPNCTSNHLRL 1079 R+ ++ NQSDPIL+ILDDSIA+FRVHRRS LRSARYDDDDPVEP+ T +L Sbjct: 149 RNLNNNLGPFTSSNQSDPILRILDDSIATFRVHRRSLLRSARYDDDDPVEPDETPESPKL 208 Query: 1078 HISLLPIQLTHPTT------------------IHFFSESAERL--AAVAQKPFV-CASST 962 SL+PI PT+ +H + S+ L + QKP+V C SS Sbjct: 209 CFSLVPIPPNAPTSYNPALQVTKHASCPCHLSLHPLTCSSLSLLQSPPMQKPYVMCPSSN 268 Query: 961 RAYLCVRLAHQPGIDLILVVSPNGPHLRLMKQAMALVIFSLRPMDRLAIVIYSTAAMRMF 782 RAYL V+L+H+ DL+LV SPNGPHLRL+KQAMALV+FSLR +DRLAIV YS+AA R+F Sbjct: 269 RAYLSVKLSHERATDLVLVASPNGPHLRLLKQAMALVVFSLRHIDRLAIVTYSSAAARVF 328 Query: 781 PLKRMTSYGKRTALQVIDRIFCTGHADSSEGLKKGVKILGDRVYKNPQSCILHLSDSPTT 602 PL+RMTSYGKRTALQVIDR+F G AD EGLKKG+KIL DRV+KNP+SCILHLSD+P T Sbjct: 329 PLRRMTSYGKRTALQVIDRLFYMGQADPVEGLKKGIKILEDRVHKNPESCILHLSDNP-T 387 Query: 601 RSYHHFNVEVPSITIXXXXXXXXXXXXXXXVMHEFEDFLARVLGGAIREIKLKI------ 440 R YH ++E+PS I V+ EFE+FLA++LGG +REI+L+I Sbjct: 388 RPYHAVSMELPSTPIHRFHVGFGFGTSSGFVIQEFEEFLAKMLGGIVREIQLRICGAGEE 447 Query: 439 KDGNRIIKLGELRDGEERRIPLTLHKSGHICVEYSYIDSGGGGNEESTKCGETLVCIPDQ 260 R+I++GE+R GEERRI L L H+ VEYSYI+ G +E + GET+V + + Sbjct: 448 VGSGRVIRIGEIRGGEERRILLDLGDCTHVYVEYSYIE--GEIDECVRRTGETVVGVGEH 505 Query: 259 HAAGDRERSDGLDGTRGGISGRTSNVENWDYHDTFMARRWAKHLHGYR 116 + S+ + T GG GR+S+VE+WD+HD +MARRWAKHLHGYR Sbjct: 506 KG----DVSENGENTGGGGGGRSSSVESWDFHDPYMARRWAKHLHGYR 549 >ref|XP_003594365.1| hypothetical protein MTR_2g027780 [Medicago truncatula] gi|355483413|gb|AES64616.1| hypothetical protein MTR_2g027780 [Medicago truncatula] Length = 554 Score = 486 bits (1250), Expect = e-134 Identities = 265/465 (56%), Positives = 327/465 (70%), Gaps = 27/465 (5%) Frame = -3 Query: 1429 KNLCAICLDPLNY-SCGSSSCPAIFTAQCSHAFHFACISSNIHHGSVTCPICRAHWTQLP 1253 K LCAICLDPL+Y S GSS AIFTAQCSHAFHFACISSN+ HGSVTCPICRAHWTQLP Sbjct: 92 KGLCAICLDPLSYHSKGSSPGQAIFTAQCSHAFHFACISSNVRHGSVTCPICRAHWTQLP 151 Query: 1252 RDPKSHCS---LHGNQSDPILQILDDSIASFRVHRRSFLRSARYDDDDPVEPNCTSNHLR 1082 R+ + S NQSDPIL+ILDDSIA+FRVHRRS LR+ARYDDDDPVEPN + + + Sbjct: 152 RNLNNTLSGPFASSNQSDPILRILDDSIATFRVHRRSILRTARYDDDDPVEPNDSPDTPK 211 Query: 1081 LHISLLPIQLTHPTTIHFF-----------SESAERLAAVAQKPFV-CASSTRAYLCVRL 938 L SL PI PT+ H S S+ ++ P++ C SS RAYL V+L Sbjct: 212 LCFSLEPIPPNAPTSFHQALQVTNHASCPCSSSSMLHSSPMHTPYITCPSSNRAYLSVKL 271 Query: 937 AHQPGIDLILVVSPNGPHLRLMKQAMALVIFSLRPMDRLAIVIYSTAAMRMFPLKRMTSY 758 AH+ DL+LV SPNGPHLRL+KQAMALV+FSLR +DRLAIV YS+AA R+FPL+RMT+Y Sbjct: 272 AHERATDLVLVASPNGPHLRLLKQAMALVVFSLRHIDRLAIVTYSSAAARVFPLRRMTTY 331 Query: 757 GKRTALQVIDRIFCTGHADSSEGLKKGVKILGDRVYKNPQSCILHLSDSPTTRSYHHFNV 578 GKRTALQVIDR+F G AD EGLKKG+KIL DR+++NP+SCILHLSD+P TR YH ++ Sbjct: 332 GKRTALQVIDRLFYMGQADPVEGLKKGIKILEDRLHRNPESCILHLSDNP-TRPYHAISM 390 Query: 577 EVPSITIXXXXXXXXXXXXXXXVMHEFEDFLARVLGGAIREIKLKI----KDG--NRIIK 416 E+PS I VM EFE+FLA++LGG IREI+L+I +DG R+++ Sbjct: 391 ELPSTPIHRFHVGFGFGTSSGFVMQEFEEFLAKMLGGIIREIQLRICGAGEDGRNGRVVR 450 Query: 415 LGELRDGEERRIPLTLHKSGHICVEYSYIDSGGGGNEESTKCGETLVCIPDQHAAGD--- 245 +GE+R GEERRI L L H+ +EYSYI+ G +E + GE++V + D+H D Sbjct: 451 IGEIRGGEERRIVLDLGDCSHVYLEYSYIE--GEIDECVRRTGESVVGVEDEHKGDDVSE 508 Query: 244 --RERSDGLDGTRGGISGRTSNVENWDYHDTFMARRWAKHLHGYR 116 E D GR+S+VE+WD+HD +MARRWAK+LHGYR Sbjct: 509 DGEENESERDMNTTNTGGRSSSVESWDFHDPYMARRWAKYLHGYR 553 >gb|ABN08848.1| Zinc finger, RING-type [Medicago truncatula] Length = 463 Score = 484 bits (1245), Expect = e-134 Identities = 264/463 (57%), Positives = 326/463 (70%), Gaps = 27/463 (5%) Frame = -3 Query: 1423 LCAICLDPLNY-SCGSSSCPAIFTAQCSHAFHFACISSNIHHGSVTCPICRAHWTQLPRD 1247 LCAICLDPL+Y S GSS AIFTAQCSHAFHFACISSN+ HGSVTCPICRAHWTQLPR+ Sbjct: 3 LCAICLDPLSYHSKGSSPGQAIFTAQCSHAFHFACISSNVRHGSVTCPICRAHWTQLPRN 62 Query: 1246 PKSHCS---LHGNQSDPILQILDDSIASFRVHRRSFLRSARYDDDDPVEPNCTSNHLRLH 1076 + S NQSDPIL+ILDDSIA+FRVHRRS LR+ARYDDDDPVEPN + + +L Sbjct: 63 LNNTLSGPFASSNQSDPILRILDDSIATFRVHRRSILRTARYDDDDPVEPNDSPDTPKLC 122 Query: 1075 ISLLPIQLTHPTTIHFF-----------SESAERLAAVAQKPFV-CASSTRAYLCVRLAH 932 SL PI PT+ H S S+ ++ P++ C SS RAYL V+LAH Sbjct: 123 FSLEPIPPNAPTSFHQALQVTNHASCPCSSSSMLHSSPMHTPYITCPSSNRAYLSVKLAH 182 Query: 931 QPGIDLILVVSPNGPHLRLMKQAMALVIFSLRPMDRLAIVIYSTAAMRMFPLKRMTSYGK 752 + DL+LV SPNGPHLRL+KQAMALV+FSLR +DRLAIV YS+AA R+FPL+RMT+YGK Sbjct: 183 ERATDLVLVASPNGPHLRLLKQAMALVVFSLRHIDRLAIVTYSSAAARVFPLRRMTTYGK 242 Query: 751 RTALQVIDRIFCTGHADSSEGLKKGVKILGDRVYKNPQSCILHLSDSPTTRSYHHFNVEV 572 RTALQVIDR+F G AD EGLKKG+KIL DR+++NP+SCILHLSD+P TR YH ++E+ Sbjct: 243 RTALQVIDRLFYMGQADPVEGLKKGIKILEDRLHRNPESCILHLSDNP-TRPYHAISMEL 301 Query: 571 PSITIXXXXXXXXXXXXXXXVMHEFEDFLARVLGGAIREIKLKI----KDG--NRIIKLG 410 PS I VM EFE+FLA++LGG IREI+L+I +DG R++++G Sbjct: 302 PSTPIHRFHVGFGFGTSSGFVMQEFEEFLAKMLGGIIREIQLRICGAGEDGRNGRVVRIG 361 Query: 409 ELRDGEERRIPLTLHKSGHICVEYSYIDSGGGGNEESTKCGETLVCIPDQHAAGD----- 245 E+R GEERRI L L H+ +EYSYI+ G +E + GE++V + D+H D Sbjct: 362 EIRGGEERRIVLDLGDCSHVYLEYSYIE--GEIDECVRRTGESVVGVEDEHKGDDVSEDG 419 Query: 244 RERSDGLDGTRGGISGRTSNVENWDYHDTFMARRWAKHLHGYR 116 E D GR+S+VE+WD+HD +MARRWAK+LHGYR Sbjct: 420 EENESERDMNTTNTGGRSSSVESWDFHDPYMARRWAKYLHGYR 462 >gb|EMJ13897.1| hypothetical protein PRUPE_ppa018588mg [Prunus persica] Length = 552 Score = 479 bits (1233), Expect = e-132 Identities = 261/460 (56%), Positives = 319/460 (69%), Gaps = 22/460 (4%) Frame = -3 Query: 1429 KNLCAICLDPLNYSCGSSSCP-AIFTAQCSHAFHFACISSNIHHGSVTCPICRAHWTQLP 1253 KNLCAICLDPL+Y SS+ AIFTAQCSHAFHFACISSN+ HGSVTCPICRAHWTQLP Sbjct: 95 KNLCAICLDPLSYHSKSSTPGLAIFTAQCSHAFHFACISSNVRHGSVTCPICRAHWTQLP 154 Query: 1252 RD--PKSHCSLHGNQSDPILQILDDSIASFRVHRRSFLRSARYDDDDPVEPNCTSNHLRL 1079 R+ P N+ DPILQILDDSIA+FR+HRRSFLRSA YDDDDP+EP+ N RL Sbjct: 155 RNLNPPGGSLSSCNRPDPILQILDDSIATFRIHRRSFLRSAHYDDDDPIEPDHMPNWPRL 214 Query: 1078 HISLLPIQLT-----------HPTTIHFFSESAERLAAVAQ-KPF-VCASSTRAYLCVRL 938 +SL+PI + HP+ H S+ L + + KPF +CA S RAYL V+L Sbjct: 215 QLSLIPIPPSAPPSWCTPYPYHPSPHHQSCSSSSLLQSPTRPKPFTLCAFSDRAYLSVKL 274 Query: 937 AHQPGIDLILVVSPNGPHLRLMKQAMALVIFSLRPMDRLAIVIYSTAAMRMFPLKRMTSY 758 AHQ DL+LV SPNGPHLRL+KQ MALV+FSLRP+DRLAIV YS+AA R+FPL+RMTSY Sbjct: 275 AHQRATDLVLVASPNGPHLRLLKQCMALVVFSLRPIDRLAIVTYSSAAARLFPLRRMTSY 334 Query: 757 GKRTALQVIDRIFCTGHADSSEGLKKGVKILGDRVYKNPQSCILHLSDSPTTRSYHHFNV 578 GKRTA QVIDR+F G AD EG+KKG+KIL DR YKNP+S ILHLSDSPT + ++ Sbjct: 335 GKRTAQQVIDRLFYMGQADPIEGIKKGIKILEDRAYKNPESSILHLSDSPTQSYHAAMSM 394 Query: 577 EVPSITIXXXXXXXXXXXXXXXVMHEFEDFLARVLGGAIREIKLKIKDG----NRIIKLG 410 E+P I + +MHEFE+ L ++GG +RE++L+I+ G +RI+++G Sbjct: 395 ELP-IPVHRFHVGFGFGTSNGFIMHEFEELLGTLIGGIVREVQLRIRIGEEASSRIVRIG 453 Query: 409 ELRDGEERRIPLTLHKSGHICVEYSYIDSGGGGNEESTKCGETLVCIPDQHAAGDRERSD 230 ELR GEER+I + L GHICV YSYI+ G +E GET+V I D + + Sbjct: 454 ELRGGEERKILVELGVCGHICVGYSYIEE--GEIDEPFTTGETVVSIGDSKSKATEGAAA 511 Query: 229 GLDGTRGGISGRTS--NVENWDYHDTFMARRWAKHLHGYR 116 + I GR+S +VE WDYHD +MARRWAKHLHGYR Sbjct: 512 ASGTSDAIICGRSSSASVETWDYHDPYMARRWAKHLHGYR 551 >ref|XP_006597631.1| PREDICTED: uncharacterized protein LOC100785882 isoform X2 [Glycine max] Length = 562 Score = 477 bits (1228), Expect = e-132 Identities = 268/470 (57%), Positives = 327/470 (69%), Gaps = 32/470 (6%) Frame = -3 Query: 1429 KNLCAICLDPLNY-SCGSSSCPAIFTAQCSHAFHFACISSNIHHGSVTCPICRAHWTQLP 1253 KNLCAICLDPL+Y S GSS AIFTAQCSHAFHFACISSN+ HGSVTCPICRAHWTQLP Sbjct: 98 KNLCAICLDPLSYQSKGSSPGQAIFTAQCSHAFHFACISSNVRHGSVTCPICRAHWTQLP 157 Query: 1252 RDPKSHCS--LHGNQSDPILQILDDSIASFRVHRRSFLRSARYDDDDPVEPNCTSNHLRL 1079 R+ ++ NQSDPIL+ILDDSIA+FRVHRRS LRSARYDDDDPVEP+ T +L Sbjct: 158 RNLNNNLGPFTSSNQSDPILRILDDSIATFRVHRRSLLRSARYDDDDPVEPDETHESPKL 217 Query: 1078 HISLLPIQLTHPT------------------TIHFFSESAERL--AAVAQKPFV-CASST 962 SL+PI PT ++H S S+ L + Q P++ C SS Sbjct: 218 GFSLVPIPPNAPTGYHPALQVTKHASCPCHLSLHPLSCSSSSLLQSPPMQTPYIMCPSSN 277 Query: 961 RAYLCVRLAHQPGIDLILVVSPNGPHLRLMKQAMALVIFSLRPMDRLAIVIYSTAAMRMF 782 RAYL V+L H+ DL+LV SPNGPHLRL+KQAMALV+FSLR +DRLAIV YS+AA R+F Sbjct: 278 RAYLSVKLTHERATDLVLVASPNGPHLRLLKQAMALVVFSLRHIDRLAIVTYSSAAARVF 337 Query: 781 PLKRMTSYGKRTALQVIDRIFCTGHADSSEGLKKGVKILGDRVYKNPQSCILHLSDSPTT 602 PL+RMTSYGKRTALQVIDR+F G +D EGLKKG+KIL DRV+KNP+SCILHLSD+P T Sbjct: 338 PLRRMTSYGKRTALQVIDRLFYMGQSDPVEGLKKGIKILEDRVHKNPESCILHLSDNP-T 396 Query: 601 RSYHHFNVEVPSITIXXXXXXXXXXXXXXXVMHEFEDFLARVLGGAIREIKLKI------ 440 R YH ++E+PS I V+ EFE+FLA++LGG +REI+L+I Sbjct: 397 RPYHAVSMELPSTPIHRFHVGFGFGTSSGFVIQEFEEFLAKMLGGIVREIQLRICGAGEE 456 Query: 439 KDGNRIIKLGELRDGEERRIPLTLHKSGHICVEYSYIDSGGGGNEESTKCGETLVCIPDQ 260 R+I++GE+R G+ERRI L L H+ VEYSYI+ G +E + GET+V + + Sbjct: 457 VGSGRVIRIGEIRGGKERRILLDLGDFTHVYVEYSYIE--GEIDECVRRTGETVVGV-GE 513 Query: 259 HAAGDRERSDGL--DGTRGGISGRTSNVENWDYHDTFMARRWAKHLHGYR 116 H E + D GG GR+S+VE+WD+HD +MARRWAKHLHGYR Sbjct: 514 HKDDVLENGEETVRDMNTGG--GRSSSVESWDFHDPYMARRWAKHLHGYR 561 >ref|XP_003546214.1| PREDICTED: uncharacterized protein LOC100785882 isoform X1 [Glycine max] Length = 553 Score = 477 bits (1228), Expect = e-132 Identities = 268/470 (57%), Positives = 327/470 (69%), Gaps = 32/470 (6%) Frame = -3 Query: 1429 KNLCAICLDPLNY-SCGSSSCPAIFTAQCSHAFHFACISSNIHHGSVTCPICRAHWTQLP 1253 KNLCAICLDPL+Y S GSS AIFTAQCSHAFHFACISSN+ HGSVTCPICRAHWTQLP Sbjct: 89 KNLCAICLDPLSYQSKGSSPGQAIFTAQCSHAFHFACISSNVRHGSVTCPICRAHWTQLP 148 Query: 1252 RDPKSHCS--LHGNQSDPILQILDDSIASFRVHRRSFLRSARYDDDDPVEPNCTSNHLRL 1079 R+ ++ NQSDPIL+ILDDSIA+FRVHRRS LRSARYDDDDPVEP+ T +L Sbjct: 149 RNLNNNLGPFTSSNQSDPILRILDDSIATFRVHRRSLLRSARYDDDDPVEPDETHESPKL 208 Query: 1078 HISLLPIQLTHPT------------------TIHFFSESAERL--AAVAQKPFV-CASST 962 SL+PI PT ++H S S+ L + Q P++ C SS Sbjct: 209 GFSLVPIPPNAPTGYHPALQVTKHASCPCHLSLHPLSCSSSSLLQSPPMQTPYIMCPSSN 268 Query: 961 RAYLCVRLAHQPGIDLILVVSPNGPHLRLMKQAMALVIFSLRPMDRLAIVIYSTAAMRMF 782 RAYL V+L H+ DL+LV SPNGPHLRL+KQAMALV+FSLR +DRLAIV YS+AA R+F Sbjct: 269 RAYLSVKLTHERATDLVLVASPNGPHLRLLKQAMALVVFSLRHIDRLAIVTYSSAAARVF 328 Query: 781 PLKRMTSYGKRTALQVIDRIFCTGHADSSEGLKKGVKILGDRVYKNPQSCILHLSDSPTT 602 PL+RMTSYGKRTALQVIDR+F G +D EGLKKG+KIL DRV+KNP+SCILHLSD+P T Sbjct: 329 PLRRMTSYGKRTALQVIDRLFYMGQSDPVEGLKKGIKILEDRVHKNPESCILHLSDNP-T 387 Query: 601 RSYHHFNVEVPSITIXXXXXXXXXXXXXXXVMHEFEDFLARVLGGAIREIKLKI------ 440 R YH ++E+PS I V+ EFE+FLA++LGG +REI+L+I Sbjct: 388 RPYHAVSMELPSTPIHRFHVGFGFGTSSGFVIQEFEEFLAKMLGGIVREIQLRICGAGEE 447 Query: 439 KDGNRIIKLGELRDGEERRIPLTLHKSGHICVEYSYIDSGGGGNEESTKCGETLVCIPDQ 260 R+I++GE+R G+ERRI L L H+ VEYSYI+ G +E + GET+V + + Sbjct: 448 VGSGRVIRIGEIRGGKERRILLDLGDFTHVYVEYSYIE--GEIDECVRRTGETVVGV-GE 504 Query: 259 HAAGDRERSDGL--DGTRGGISGRTSNVENWDYHDTFMARRWAKHLHGYR 116 H E + D GG GR+S+VE+WD+HD +MARRWAKHLHGYR Sbjct: 505 HKDDVLENGEETVRDMNTGG--GRSSSVESWDFHDPYMARRWAKHLHGYR 552 >ref|XP_004293426.1| PREDICTED: uncharacterized protein LOC101308157 isoform 2 [Fragaria vesca subsp. vesca] Length = 552 Score = 471 bits (1211), Expect = e-130 Identities = 269/471 (57%), Positives = 323/471 (68%), Gaps = 33/471 (7%) Frame = -3 Query: 1429 KNLCAICLDPLNY-SCGSSSCPAIFTAQCSHAFHFACISSNIHHGSVTCPICRAHWTQLP 1253 KNLCAICLDPL+Y S G SS AIFTAQCSHAFHFACISSN+ HGSVTCPICRAHWTQLP Sbjct: 87 KNLCAICLDPLSYHSKGKSSGQAIFTAQCSHAFHFACISSNVRHGSVTCPICRAHWTQLP 146 Query: 1252 RD--PKSHCSLHGNQSDPILQILDDSIASFRVHRRSFLRSARYDDDDPVEP-NCTSNHLR 1082 R+ P N +DPILQILDDSIA+FR+HRRSFLRSARYDDDDP+EP + N R Sbjct: 147 RNLNPPGGSLSSCNPTDPILQILDDSIATFRIHRRSFLRSARYDDDDPIEPPDHMPNFPR 206 Query: 1081 LHISLLPIQ-------------LTHP--TTIHFFSESAERLAAVAQKPF-------VCAS 968 L SL P+ THP H S S+ ++ Q P + AS Sbjct: 207 LQFSLAPLPPSTPPHSFQPSWCATHPYHPPPHHLSYSSS--ISLLQSPTGLKSYNTMYAS 264 Query: 967 STRAYLCVRLAHQPGIDLILVVSPNGPHLRLMKQAMALVIFSLRPMDRLAIVIYSTAAMR 788 S RAYL V+L+ Q DL+LV SPNGPHLRL+KQ MALV+FSLRP+DRLAIV YS+AA R Sbjct: 265 SDRAYLSVKLSQQRATDLVLVASPNGPHLRLLKQCMALVVFSLRPIDRLAIVTYSSAAAR 324 Query: 787 MFPLKRMTSYGKRTALQVIDRIFCTGHADSSEGLKKGVKILGDRVYKNPQSCILHLSDSP 608 +FPL+RMTSYGKRTA QVIDR+F G AD+ EGLKKG+KIL DRVYKN S ILHLSD+P Sbjct: 325 VFPLRRMTSYGKRTAQQVIDRLFYMGQADAVEGLKKGIKILQDRVYKNQDSRILHLSDTP 384 Query: 607 TTRSYH-HFNVEVPSITIXXXXXXXXXXXXXXXVMHEFEDFLARVLGGAIREIKLKI--- 440 TRSYH N++VP I + +MHEFE+FL R++GG +R+I+L+I Sbjct: 385 -TRSYHAALNMDVP-IPVHRFHVGFGVGTSNGFIMHEFEEFLRRLIGGVVRDIQLRINTT 442 Query: 439 -KDGNRIIKLGELRDGEERRIPLTLHKSGHICVEYSYIDSGGGGNEESTKCGETLVCIPD 263 + +RI+++GELR+GEER+I + L GHICV YSYI+ G +E GET+V I D Sbjct: 443 GEASSRIVRIGELRNGEERKILVDLRVCGHICVGYSYIED--GEIDEPITTGETVVSIGD 500 Query: 262 QHA-AGDRERSDGLDGTRGG-ISGRTSNVENWDYHDTFMARRWAKHLHGYR 116 + E + GT G I GR S+ ++WDYHD +MARRWAKHLHGYR Sbjct: 501 DKSNINLDEAASAGSGTSGPIIGGRPSSADSWDYHDPYMARRWAKHLHGYR 551