BLASTX nr result
ID: Paeonia23_contig00007543
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Paeonia23_contig00007543 (2414 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003632266.1| PREDICTED: uncharacterized protein LOC100854... 874 0.0 emb|CAN70085.1| hypothetical protein VITISV_003006 [Vitis vinifera] 873 0.0 ref|XP_002527444.1| protein dimerization, putative [Ricinus comm... 852 0.0 ref|XP_006484968.1| PREDICTED: uncharacterized protein LOC102615... 825 0.0 ref|XP_006424350.1| hypothetical protein CICLE_v10028008mg [Citr... 824 0.0 ref|XP_007014534.1| Uncharacterized protein TCM_039722 [Theobrom... 235 8e-59 ref|XP_006841838.1| hypothetical protein AMTR_s00003p00270420 [A... 226 5e-56 ref|XP_004299161.1| PREDICTED: uncharacterized protein LOC101293... 218 1e-53 ref|XP_003543854.2| PREDICTED: uncharacterized protein LOC100780... 207 2e-50 ref|XP_007039961.1| HAT transposon superfamily [Theobroma cacao]... 207 2e-50 ref|XP_007214864.1| hypothetical protein PRUPE_ppa018860mg [Prun... 207 2e-50 ref|XP_006577689.1| PREDICTED: uncharacterized protein LOC102662... 205 9e-50 ref|XP_004292297.1| PREDICTED: uncharacterized protein LOC101307... 204 1e-49 ref|XP_002509591.1| DNA binding protein, putative [Ricinus commu... 200 2e-48 ref|XP_006477267.1| PREDICTED: uncharacterized protein LOC102627... 200 3e-48 ref|XP_007161271.1| hypothetical protein PHAVU_001G056200g, part... 199 5e-48 ref|XP_004159512.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 198 8e-48 ref|XP_004147940.1| PREDICTED: uncharacterized protein LOC101222... 198 8e-48 ref|XP_006603987.1| PREDICTED: uncharacterized protein LOC102660... 196 5e-47 ref|XP_006579099.1| PREDICTED: uncharacterized protein LOC102660... 194 2e-46 >ref|XP_003632266.1| PREDICTED: uncharacterized protein LOC100854857 [Vitis vinifera] Length = 635 Score = 874 bits (2257), Expect = 0.0 Identities = 432/613 (70%), Positives = 504/613 (82%), Gaps = 4/613 (0%) Frame = +1 Query: 175 MQSESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 354 M +ESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP Sbjct: 1 MPTESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 60 Query: 355 AIDRSMREAFQILEEERLARKKKRTSGSGKPGKRIRTSQLSLNHIWKSISKEDVDDVVAR 534 AIDRS+REAFQILEEERLARKKKRTSGSGK GKRIRTSQ S+ +WK+I+KEDVDD+VAR Sbjct: 61 AIDRSLREAFQILEEERLARKKKRTSGSGKTGKRIRTSQPSVTCVWKTIAKEDVDDIVAR 120 Query: 535 FFYAEGLNIHVANSPYFYELINTVASFGPGYESPSVDKLSGSFLIKEKARIEKSLASVRE 714 FFYA+GL+ ++ NSPYF E+ +A+FGPGYE P+ +KLS FL KEKA+IEK++A VRE Sbjct: 121 FFYADGLDFNIVNSPYFLEMTKAIAAFGPGYEPPTTEKLSDLFLSKEKAKIEKAMALVRE 180 Query: 715 SWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKAVRVNDGDTLENVFTGAISDAI 894 SWP TGCTILCVN L T+G N+FVSSPRGLMF KA+ +NDGD ++N+F +SDAI Sbjct: 181 SWPHTGCTILCVNRLCRTQGRYYTNIFVSSPRGLMFLKALDINDGDGMDNMFVDVLSDAI 240 Query: 895 MEVGSTNVLQIMLNLGHGSESFESLMMPKFPRIFWSPCTSHSIRQLMENIAELDWIKPIV 1074 MEV TNVLQI+ NLGH SESFESL++ KF +FWSPCTSHSI LME+I +LDWIKPIV Sbjct: 241 MEVEPTNVLQIISNLGHASESFESLILSKFRHLFWSPCTSHSICVLMEDITKLDWIKPIV 300 Query: 1075 LCAKGIEQCMLTFQRSSPNVFTQDLKQSSDPLSAKFAPSYYLVHRIFEIKQALQXXXXXX 1254 LCAK I++C+LT+QRSS V T +SSDPLS KFAPSY +V RIFE+KQAL Sbjct: 301 LCAKEIDECILTYQRSSLCVLT---LESSDPLSTKFAPSYCIVERIFELKQALLGVVVSE 357 Query: 1255 XXXXXXLMTLSEDNISIEASILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMGDVFDW 1434 L T+ ED +++E +ILGDNFWS LQ EPFVRLL T +I+KSVMGDVF+W Sbjct: 358 EWKQWKL-TIQEDVLNVETAILGDNFWSRACSLLQFFEPFVRLLTTLDIEKSVMGDVFNW 416 Query: 1435 RMWALEAIRRKGIDDDALNQVEVLLESRWDMYFSPLHAAGYILNPRYFGKNQAKDKTAMR 1614 R+ ALEA++ KG+DD LNQ+E+L+ES+WDM FSPLHA+GYILNP+YFGK Q+KDKT MR Sbjct: 417 RVQALEAVKSKGVDDILLNQLELLIESKWDMLFSPLHASGYILNPKYFGKGQSKDKTIMR 476 Query: 1615 GWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGFETPH 1794 GWK+TL+RYESDS RRVLREQLSSYWR+EGS G+EDAVDCRDKMDPVAWWENFGFETPH Sbjct: 477 GWKATLDRYESDSATRRVLREQLSSYWRLEGSFGEEDAVDCRDKMDPVAWWENFGFETPH 536 Query: 1795 LQTLAVKILCQVSSVGICQV----SDIPCQEAVNRLKVERVEDLVFVQNNLRLHSQRIGN 1962 LQTLA+KIL QVSSV + Q ++ CQ AVN L VER EDLVFV+NNLRLHSQR GN Sbjct: 537 LQTLAIKILSQVSSVSMYQETWQDNEFLCQTAVNGLGVERTEDLVFVRNNLRLHSQRNGN 596 Query: 1963 LNSPYGVKHGMAS 2001 +S G ++ +S Sbjct: 597 SSSSPGNRNQSSS 609 >emb|CAN70085.1| hypothetical protein VITISV_003006 [Vitis vinifera] Length = 635 Score = 873 bits (2255), Expect = 0.0 Identities = 431/613 (70%), Positives = 504/613 (82%), Gaps = 4/613 (0%) Frame = +1 Query: 175 MQSESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 354 M +ESDKWGWKHVSVFGGFDKGSGTKRWKCNHCN+RYNGSYSRVRAHLLGFTGVGVKSCP Sbjct: 1 MPTESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNIRYNGSYSRVRAHLLGFTGVGVKSCP 60 Query: 355 AIDRSMREAFQILEEERLARKKKRTSGSGKPGKRIRTSQLSLNHIWKSISKEDVDDVVAR 534 AIDRS+REAFQILEEERLARKKKRTSGSGK GKRIRTSQ S+ +WK+I+KEDVDD+VAR Sbjct: 61 AIDRSLREAFQILEEERLARKKKRTSGSGKTGKRIRTSQPSVTCVWKTIAKEDVDDIVAR 120 Query: 535 FFYAEGLNIHVANSPYFYELINTVASFGPGYESPSVDKLSGSFLIKEKARIEKSLASVRE 714 FFYA+GL+ ++ NSPYF E+ +A+FGPGYE P+ +KLS FL KEKA+IEK++A VRE Sbjct: 121 FFYADGLDFNIVNSPYFLEMTKAIAAFGPGYEPPTTEKLSDLFLSKEKAKIEKAMALVRE 180 Query: 715 SWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKAVRVNDGDTLENVFTGAISDAI 894 SWP TGCTILCVN L T+G N+FVSSPRGLMF KA+ +NDGD ++N+F +SDAI Sbjct: 181 SWPHTGCTILCVNRLCRTQGRYYTNIFVSSPRGLMFLKALDINDGDGMDNMFVDVLSDAI 240 Query: 895 MEVGSTNVLQIMLNLGHGSESFESLMMPKFPRIFWSPCTSHSIRQLMENIAELDWIKPIV 1074 MEV TNVLQI+ NLGH SESFESL++ KF +FWSPCTSHSI LME+I +LDWIKPIV Sbjct: 241 MEVEPTNVLQIISNLGHASESFESLILSKFRHLFWSPCTSHSICVLMEDITKLDWIKPIV 300 Query: 1075 LCAKGIEQCMLTFQRSSPNVFTQDLKQSSDPLSAKFAPSYYLVHRIFEIKQALQXXXXXX 1254 LCAK I++C+LT+QRSS V T +SSDPLS KFAPSY +V RIFE+KQAL Sbjct: 301 LCAKEIDECILTYQRSSLCVLT---LESSDPLSTKFAPSYCIVERIFELKQALLGVVVSE 357 Query: 1255 XXXXXXLMTLSEDNISIEASILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMGDVFDW 1434 L T+ ED +++E +ILGDNFWS LQ EPFVRLL T +I+KSVMGDVF+W Sbjct: 358 EWKQWKL-TIQEDVLNVETAILGDNFWSRACSLLQFFEPFVRLLTTLDIEKSVMGDVFNW 416 Query: 1435 RMWALEAIRRKGIDDDALNQVEVLLESRWDMYFSPLHAAGYILNPRYFGKNQAKDKTAMR 1614 R+ ALEA++ KG+DD LNQ+E+L+ES+WDM FSPLHA+GYILNP+YFGK Q+KDKT MR Sbjct: 417 RVQALEAVKSKGVDDILLNQLELLIESKWDMLFSPLHASGYILNPKYFGKGQSKDKTIMR 476 Query: 1615 GWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGFETPH 1794 GWK+TL+RYESDS RRVLREQLSSYWR+EGS G+EDAVDCRDKMDPVAWWENFGFETPH Sbjct: 477 GWKATLDRYESDSATRRVLREQLSSYWRLEGSFGEEDAVDCRDKMDPVAWWENFGFETPH 536 Query: 1795 LQTLAVKILCQVSSVGICQV----SDIPCQEAVNRLKVERVEDLVFVQNNLRLHSQRIGN 1962 LQTLA+KIL QVSSV + Q ++ CQ AVN L VER EDLVFV+NNLRLHSQR GN Sbjct: 537 LQTLAIKILSQVSSVSMYQETWQDNEFLCQTAVNGLGVERAEDLVFVRNNLRLHSQRNGN 596 Query: 1963 LNSPYGVKHGMAS 2001 +S G ++ +S Sbjct: 597 SSSSPGNRNQSSS 609 >ref|XP_002527444.1| protein dimerization, putative [Ricinus communis] gi|223533179|gb|EEF34936.1| protein dimerization, putative [Ricinus communis] Length = 633 Score = 852 bits (2200), Expect = 0.0 Identities = 415/613 (67%), Positives = 500/613 (81%), Gaps = 4/613 (0%) Frame = +1 Query: 175 MQSESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 354 M SESDKWGW+HVSVFGGFD+GSGTKRWKCNHCNLRYNGSYSRVRAHLLGF+GVGVKSCP Sbjct: 1 MPSESDKWGWEHVSVFGGFDRGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFSGVGVKSCP 60 Query: 355 AIDRSMREAFQILEEERLARKKKRTSGSGKPGKRIRTSQLSLNHIWKSISKEDVDDVVAR 534 AIDRS+REAFQILEEERL RKKK+ S +GKPGKR R SQ S++ WK+I+KEDVDD+VAR Sbjct: 61 AIDRSLREAFQILEEERLVRKKKKNSANGKPGKRTRISQASIS--WKTITKEDVDDIVAR 118 Query: 535 FFYAEGLNIHVANSPYFYELINTVASFGPGYESPSVDKLSGSFLIKEKARIEKSLASVRE 714 FFYA+GLNI V NSPYF+E++ + +FG GYE PS+DKLS SFL KEK RIEKSLA +RE Sbjct: 119 FFYADGLNIDVVNSPYFHEMVKAIGAFGSGYELPSIDKLSDSFLGKEKGRIEKSLALLRE 178 Query: 715 SWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKAVRVNDGDTLENVFTGAISDAI 894 SWP TGCTILCV LD GC IN+FVSSPRGL+F KAV V+D D ++V GA+SDAI Sbjct: 179 SWPHTGCTILCVGRLDGAIGCFHINIFVSSPRGLIFLKAVDVDDCDEGDHVLAGALSDAI 238 Query: 895 MEVGSTNVLQIMLNLGHGSESFESLMMPKFPRIFWSPCTSHSIRQLMENIAELDWIKPIV 1074 +EVG +NVLQI+ +LG +S ES ++ KFP IFWSPCTSHSI LME IAEL+W+KPIV Sbjct: 239 LEVGPSNVLQIISHLGDACKSSESYILSKFPHIFWSPCTSHSILMLMEEIAELEWVKPIV 298 Query: 1075 LCAKGIEQCMLTFQRSSPNVFTQDLKQSSDPLSAKFAPSYYLVHRIFEIKQALQXXXXXX 1254 LCA+ IEQC++T+Q ++ +F Q K+S D +SAKFAPSY+ V RIFE++Q LQ Sbjct: 299 LCARRIEQCIMTYQHATSCIFMQSPKESCDLISAKFAPSYFFVQRIFELRQTLQEVVVSE 358 Query: 1255 XXXXXXLMTLSEDNISIEASILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMGDVFDW 1434 ++ ++ SIE++ILGD+FWS +HL LQL EPF++LL +IDKSV+G V+DW Sbjct: 359 QWKH----SIGDNVESIESAILGDDFWSKSHLLLQLYEPFIKLLGLLDIDKSVIGAVYDW 414 Query: 1435 RMWALEAIRRKGIDDDALNQVEVLLESRWDMYFSPLHAAGYILNPRYFGKNQAKDKTAMR 1614 R+ ALEA+R K IDDD LNQ+EVL+E++WD+ FSPLHA GYILNPRY GK Q KDK+ MR Sbjct: 415 RVQALEALRSKAIDDDILNQLEVLIENKWDVLFSPLHATGYILNPRYIGKFQTKDKSVMR 474 Query: 1615 GWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGFETPH 1794 GWK+TLERYE +S ARRVLREQLSSYWR+EGSLGDEDAVDCRDKMDPVAWWENFGFETP Sbjct: 475 GWKATLERYEGESTARRVLREQLSSYWRLEGSLGDEDAVDCRDKMDPVAWWENFGFETPS 534 Query: 1795 LQTLAVKILCQVSSVGIC----QVSDIPCQEAVNRLKVERVEDLVFVQNNLRLHSQRIGN 1962 LQTLA+K+L QVSSV +C Q +D CQEA NRL V+RVEDL+FV+NNLRLH Q+ N Sbjct: 535 LQTLAIKVLSQVSSVALCQEIWQTNDFSCQEAANRLGVQRVEDLLFVRNNLRLHYQKNCN 594 Query: 1963 LNSPYGVKHGMAS 2001 L++ G+++ ++S Sbjct: 595 LSTSPGLRNTISS 607 >ref|XP_006484968.1| PREDICTED: uncharacterized protein LOC102615434 isoform X1 [Citrus sinensis] gi|568863036|ref|XP_006484969.1| PREDICTED: uncharacterized protein LOC102615434 isoform X2 [Citrus sinensis] Length = 636 Score = 825 bits (2130), Expect = 0.0 Identities = 409/604 (67%), Positives = 481/604 (79%), Gaps = 4/604 (0%) Frame = +1 Query: 175 MQSESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 354 M SESDKWGW+HVSVFGGF++GSGTKRWKCNHCNLRYNGSYSRVRAHLLGF+GVGVKSCP Sbjct: 1 MPSESDKWGWEHVSVFGGFERGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFSGVGVKSCP 60 Query: 355 AIDRSMREAFQILEEERLARKKKRTSGSGKPGKRIRTSQLSLNHIWKSISKEDVDDVVAR 534 AIDRSMRE FQILEEER+ARKKKRTSG K GKRIR Q S+ + K+ISKEDVD++VAR Sbjct: 61 AIDRSMRETFQILEEERIARKKKRTSGIAKHGKRIRACQSSI--VSKAISKEDVDEMVAR 118 Query: 535 FFYAEGLNIHVANSPYFYELINTVASFGPGYESPSVDKLSGSFLIKEKARIEKSLASVRE 714 FFYA GLN++V NSPYF E++ ++A+FG GY+ PS++ LS SFL KEK +IEK +ASVRE Sbjct: 119 FFYAAGLNVNVVNSPYFLEMVRSIAAFGHGYDLPSLENLSDSFLSKEKGKIEKFIASVRE 178 Query: 715 SWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKAVRVNDGDTLENVFTGAISDAI 894 SWP TGCTILCV+ LD GC +FVSSPRGL+F KA+ ++D D EN+F +SDAI Sbjct: 179 SWPHTGCTILCVSSLDGRLGCFPTGIFVSSPRGLVFLKALDLDDTDEAENLFITVLSDAI 238 Query: 895 MEVGSTNVLQIMLNLGHGSESFESLMMPKFPRIFWSPCTSHSIRQLMENIAELDWIKPIV 1074 +EVG NVLQI+ +LGH +S+ESL++ KFP IF SPCT SI ME IA L+WIK V Sbjct: 239 LEVGPKNVLQIISHLGHACKSYESLVLSKFPHIFLSPCTLQSIHMFMEEIASLEWIKSTV 298 Query: 1075 LCAKGIEQCMLTFQRSSPNVFTQDLKQSSDPLSAKFAPSYYLVHRIFEIKQALQXXXXXX 1254 LCAK IEQ ++ +Q + P +F +LK+SSD +S K APSY V RI E+KQ LQ Sbjct: 299 LCAKRIEQHIMYYQHAYPCLFPHNLKESSDQVSTKIAPSYCFVQRIIELKQVLQEAVVSE 358 Query: 1255 XXXXXXLMTLSEDNISIEASILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMGDVFDW 1434 L ++ D+ +E++ILGD+FW HLFLQLCEPFVRLLATF+IDKSVMG V+DW Sbjct: 359 EFKQWKL-SMPGDHGIVESAILGDDFWGKAHLFLQLCEPFVRLLATFDIDKSVMGAVYDW 417 Query: 1435 RMWALEAIRRKGIDDDALNQVEVLLESRWDMYFSPLHAAGYILNPRYFGKNQAKDKTAMR 1614 R ALEA+R KGID ALNQ+EVL E+RWD FSPLHAAGYILNPRYFG+ Q KDKT MR Sbjct: 418 RFQALEAVRMKGIDATALNQLEVLTENRWDALFSPLHAAGYILNPRYFGRGQNKDKTVMR 477 Query: 1615 GWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGFETPH 1794 GWKSTLERYESDS RR+LREQLSSYWR+EGSLG+EDAVD RDKM+PVAWWENFGFE H Sbjct: 478 GWKSTLERYESDSATRRILREQLSSYWRLEGSLGEEDAVDFRDKMEPVAWWENFGFEISH 537 Query: 1795 LQTLAVKILCQVSSVGICQV----SDIPCQEAVNRLKVERVEDLVFVQNNLRLHSQRIGN 1962 LQTLA+K+L QVSSV ICQ +D PC+EA NR VER EDL+FV+NNLRLH+QR N Sbjct: 538 LQTLAIKVLSQVSSVAICQEIWQDNDFPCREAANRSGVERPEDLIFVRNNLRLHNQRNVN 597 Query: 1963 LNSP 1974 L+SP Sbjct: 598 LSSP 601 >ref|XP_006424350.1| hypothetical protein CICLE_v10028008mg [Citrus clementina] gi|557526284|gb|ESR37590.1| hypothetical protein CICLE_v10028008mg [Citrus clementina] Length = 636 Score = 824 bits (2128), Expect = 0.0 Identities = 408/604 (67%), Positives = 481/604 (79%), Gaps = 4/604 (0%) Frame = +1 Query: 175 MQSESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 354 M SESDKWGW+HVSVFGGF++GSGTKRWKCNHCNLRYNGSYSRVRAHLLGF+GVGVKSCP Sbjct: 1 MPSESDKWGWEHVSVFGGFERGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFSGVGVKSCP 60 Query: 355 AIDRSMREAFQILEEERLARKKKRTSGSGKPGKRIRTSQLSLNHIWKSISKEDVDDVVAR 534 AIDRSMRE FQILEEER+ARKKKRTSG K GKRIR Q S+ + K+ISKEDVD++VAR Sbjct: 61 AIDRSMRETFQILEEERIARKKKRTSGIAKHGKRIRACQSSI--VSKAISKEDVDEMVAR 118 Query: 535 FFYAEGLNIHVANSPYFYELINTVASFGPGYESPSVDKLSGSFLIKEKARIEKSLASVRE 714 FFYA GLN++V NSPYF E++ ++A+FG GY+ PS++ LS SFL KEK +IEK +ASVRE Sbjct: 119 FFYAAGLNVNVVNSPYFLEMVRSIAAFGHGYDLPSLENLSDSFLSKEKGKIEKFIASVRE 178 Query: 715 SWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKAVRVNDGDTLENVFTGAISDAI 894 SWP TGCTILCV+ LD GC +FVSSPRGL+F KA+ ++D D EN+F +SDAI Sbjct: 179 SWPHTGCTILCVSSLDGQLGCFPTGIFVSSPRGLVFLKALDLDDTDEAENLFITVLSDAI 238 Query: 895 MEVGSTNVLQIMLNLGHGSESFESLMMPKFPRIFWSPCTSHSIRQLMENIAELDWIKPIV 1074 ++VG NVLQI+ +LGH +S+ESL++ KFP IF SPCT SI ME IA L+WIK V Sbjct: 239 LDVGPKNVLQIISHLGHACKSYESLVLSKFPHIFLSPCTLQSIHMFMEEIASLEWIKSTV 298 Query: 1075 LCAKGIEQCMLTFQRSSPNVFTQDLKQSSDPLSAKFAPSYYLVHRIFEIKQALQXXXXXX 1254 LCAK IEQ +L +Q + P +F +LK+SSD +S K APSY V RI E+KQ LQ Sbjct: 299 LCAKRIEQHILYYQHAYPCLFPHNLKESSDQVSTKIAPSYCFVQRIIELKQVLQEAVVSE 358 Query: 1255 XXXXXXLMTLSEDNISIEASILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMGDVFDW 1434 L ++ D+ +E++ILGD+FW HLFLQLCEPFVRLLATF+IDKSVMG V+DW Sbjct: 359 EFKQWKL-SMPGDHGIVESAILGDDFWGKAHLFLQLCEPFVRLLATFDIDKSVMGAVYDW 417 Query: 1435 RMWALEAIRRKGIDDDALNQVEVLLESRWDMYFSPLHAAGYILNPRYFGKNQAKDKTAMR 1614 R ALEA+R KGID ALNQ+EVL E+RWD FSPLHAAGYILNPRYFG+ Q KDKT MR Sbjct: 418 RFQALEAVRMKGIDATALNQLEVLTENRWDALFSPLHAAGYILNPRYFGRGQNKDKTVMR 477 Query: 1615 GWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGFETPH 1794 GWKSTLERYESDS RR+LREQLSSYWR+EGSLG+EDAVD RDKM+PVAWWENFGFE H Sbjct: 478 GWKSTLERYESDSATRRILREQLSSYWRLEGSLGEEDAVDFRDKMEPVAWWENFGFEISH 537 Query: 1795 LQTLAVKILCQVSSVGICQV----SDIPCQEAVNRLKVERVEDLVFVQNNLRLHSQRIGN 1962 LQTLA+K+L QVSSV +CQ +D PC+EA NR VER EDL+FV+NNLRLH+QR N Sbjct: 538 LQTLAIKVLSQVSSVAVCQEIWQDNDFPCREAANRSGVERPEDLIFVRNNLRLHNQRNVN 597 Query: 1963 LNSP 1974 L+SP Sbjct: 598 LSSP 601 >ref|XP_007014534.1| Uncharacterized protein TCM_039722 [Theobroma cacao] gi|508784897|gb|EOY32153.1| Uncharacterized protein TCM_039722 [Theobroma cacao] Length = 381 Score = 235 bits (599), Expect = 8e-59 Identities = 114/191 (59%), Positives = 142/191 (74%) Frame = +1 Query: 1279 TLSEDNISIEASILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMGDVFDWRMWALEAI 1458 ++ +D + IEASILGD FWS H+ LQL +PF +LLA +IDKSVMG ++DWR+ ALE + Sbjct: 163 SILKDILIIEASILGDEFWSNAHMMLQLFKPFAKLLAMLDIDKSVMGAIYDWRVQALEVV 222 Query: 1459 RRKGIDDDALNQVEVLLESRWDMYFSPLHAAGYILNPRYFGKNQAKDKTAMRGWKSTLER 1638 R K ID+ ALNQ+EVL+E++W++ FS LHAAGYILNP YFGK Sbjct: 223 RSKEIDETALNQLEVLIENKWNVLFSLLHAAGYILNPGYFGK------------------ 264 Query: 1639 YESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGFETPHLQTLAVKI 1818 AR VLR+QLSSYWR+EGS G+EDA+DCRDKMD VAWWENFGFETPHLQTLA+K+ Sbjct: 265 ------ARWVLRKQLSSYWRLEGSFGEEDALDCRDKMDLVAWWENFGFETPHLQTLAIKV 318 Query: 1819 LCQVSSVGICQ 1851 L QVS++ +CQ Sbjct: 319 LSQVSTISMCQ 329 Score = 167 bits (422), Expect = 3e-38 Identities = 77/91 (84%), Positives = 84/91 (92%) Frame = +1 Query: 175 MQSESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 354 M SE DKWGW+HV+VFG FD+GSGTKRWKCNHCNLRYNGSYSRVRAHLL F+GVGVKSC Sbjct: 1 MASEFDKWGWEHVTVFGVFDRGSGTKRWKCNHCNLRYNGSYSRVRAHLLRFSGVGVKSCL 60 Query: 355 AIDRSMREAFQILEEERLARKKKRTSGSGKP 447 AI+R++REAF ILEEERLARKKKRT GSGKP Sbjct: 61 AINRTLREAFHILEEERLARKKKRTFGSGKP 91 Score = 79.0 bits (193), Expect = 1e-11 Identities = 36/52 (69%), Positives = 40/52 (76%) Frame = +1 Query: 610 SFGPGYESPSVDKLSGSFLIKEKARIEKSLASVRESWPQTGCTILCVNCLDC 765 +FG GYE PS+DKLS FL KEK RIEKS+ VRESWP TG T+LCV CL C Sbjct: 92 TFGCGYEPPSMDKLSDCFLSKEKGRIEKSITLVRESWPHTGYTVLCVGCLGC 143 >ref|XP_006841838.1| hypothetical protein AMTR_s00003p00270420 [Amborella trichopoda] gi|548843859|gb|ERN03513.1| hypothetical protein AMTR_s00003p00270420 [Amborella trichopoda] Length = 732 Score = 226 bits (575), Expect = 5e-56 Identities = 173/642 (26%), Positives = 277/642 (43%), Gaps = 58/642 (9%) Frame = +1 Query: 202 WKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSMREA 381 W ++ G G G +C C + GSY+RV++HLLG G GVK C ID Sbjct: 35 WAYMEKIGRCHTGGGNWMLRCVLCKAEFKGSYTRVKSHLLGKVGTGVKRCLGIDNETLAT 94 Query: 382 FQILEEERLARKKKRTSGSGKPGKRIRTSQLSL------NHIWKSIS---KEDVDDVVAR 534 L +E RK + +S S P ++ + + L N + K + K+ +D ++AR Sbjct: 95 LLRLNDEGSTRKIRSSSRSSVPLLKVNSGSIGLKKRRGANDLVKLLDLAPKDVLDRMIAR 154 Query: 535 FFYAEGLNIHVANSPYFYELINTVASFG-PGYESPSVDKLSGSFLIKEKARIEKSLASVR 711 FYA G+++++ SPYF ++I GY P+ D L S L EKA IE+S+ R Sbjct: 155 CFYASGISLNLIRSPYFRDMIRYACENSLEGYVLPTFDNLRTSLLDAEKANIEQSVKPFR 214 Query: 712 ESWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKA----VRVNDGDTLENVFTGA 879 SW G ++L D T IN +S G +F KA V + + D ++N+F Sbjct: 215 SSWGSRGVSLLTDGWTDTTAKRPLINFMAASDIGSIFLKAIDSSVEMMNTDYMKNLFL-- 272 Query: 880 ISDAIMEVGSTNVLQIMLNLGHGSESFESLMMPKFPRIFWSPCTSHSIRQLMENIAELD- 1056 + + EVG T+V+QI+ + + P IFW+PC H++ ++NI D Sbjct: 273 --EMVAEVGPTSVVQIITDNSPICRVAGQRVEGMHPYIFWTPCVIHTLNLALKNICSPDD 330 Query: 1057 -----------WIKPIVLCAKGI------EQCMLTFQRSSPNVFTQDLKQSSDPLSAKFA 1185 WI+ + K I +LT P + + +S +FA Sbjct: 331 ERKAEKYLHCQWIRDLDRDVKMIRSFVVDHNAVLTIYSQYPTLRLLSVTES------RFA 384 Query: 1186 PSYYLVHRIFEIKQALQXXXXXXXXXXXXLMTLSEDNISIEASILGDNFWSGTHLFLQLC 1365 + +V RI E+K AL + +E +++ ++ D +W + Sbjct: 385 STVIIVKRIKEVKPAL-CRMVVDSYWKVLVEEDAEKARRVKSCLVDDLWWEKIEFLIAFT 443 Query: 1366 EPFVRLLATFNIDKSVMGDVFDWRMWALEAIRRKGI------------DDDALNQVEVLL 1509 EP + +L + D+ + +V+D MWA +GI + + +L Sbjct: 444 EPILAMLRAIDTDEPTLHEVYD--MWATMIEEVRGIIFRNEGKNIFLNESSFYEDIHRIL 501 Query: 1510 ESRWDMYFSPLHAAGYILNPRYFGKN----------QAKDKTAMRGWKSTLERYESDSGA 1659 W+ +PL + LNP+Y+ KD+ G R Sbjct: 502 VGSWNKSKTPLQCLAHSLNPKYYSDEWLGEVPSRLPPHKDREVSDGRNVCFARLFPAPSE 561 Query: 1660 RRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGFETPHLQTLAVKILCQVSSV 1839 + + E+ + +G G D + R M P++WWENFG P L LA ++L Q SS Sbjct: 562 LQKVHEEFEMFSMCKGHFGHWDVMSSRFSMSPISWWENFGAHVPRLAKLADRLLSQPSSS 621 Query: 1840 GICQ----VSDIPCQEAVNRLKVERVEDLVFVQNNLRLHSQR 1953 C+ + + NRL +R EDLV+V +NLRL S+R Sbjct: 622 SCCERNWGTFSLIKKIKQNRLASQRAEDLVYVHSNLRLLSRR 663 >ref|XP_004299161.1| PREDICTED: uncharacterized protein LOC101293587 [Fragaria vesca subsp. vesca] Length = 730 Score = 218 bits (555), Expect = 1e-53 Identities = 167/656 (25%), Positives = 295/656 (44%), Gaps = 72/656 (10%) Frame = +1 Query: 202 WKHVSVFGGFDKGSGTK-RWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDR-SMR 375 WK+V++ KG G ++C+ C +++NGS+ RV+ HLL G GV+ C I R Sbjct: 31 WKYVTITREAKKGQGGNCEFQCSFCKIKFNGSHYRVKHHLLQIIGKGVRKCEKIPPPKKR 90 Query: 376 EAFQILEEERLARK------------KKRTSGSGKP---------------GKRIRTSQL 474 E ++E L++K K S SG K+ + Sbjct: 91 ELMALMESYELSKKMAGPRLVPLPSSSKDPSSSGSTFGFGQDLLDDIVVDTSKKRKEVGG 150 Query: 475 SLNHIWKSISKEDVDDVVARFFYAEGLNIHVANSPYFYELINTVASFG-PGYESPSVDKL 651 SL + + ++E +D +AR FY GL+ ++A +P++ N ++ GY P+ + L Sbjct: 151 SLEKSFNNGAREQLDGEIARMFYTGGLSFNLAKNPHYIRAFNRACAYPIAGYRPPNYNAL 210 Query: 652 SGSFLIKEKARIEKSLASVRESWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKA 831 + L KE+ IE+ L ++ +W Q G ++ C + T+ IN+ + G MF +A Sbjct: 211 RTTLLEKERNHIERLLEPIKLTWKQKGVSV-CSDGWSDTQRRPLINVMAACESGPMFLRA 269 Query: 832 VRVNDGDTLENVFTGAISDAIMEVGSTNVLQIMLNLGHGSESFESLMMPKFPRIFWSPCT 1011 ++ + + ++I+E+G T+V+Q++ + ++ +++ +FP IFW+PC Sbjct: 270 ENCEGESKDKHFISDLLIESILEIGPTHVVQVITDNASNCKAAGAIINARFPHIFWTPCV 329 Query: 1012 SHSIRQLMENIA-------------ELDWIKPIVLCAKGIEQCMLTFQRSSPNVFTQ--D 1146 H++ ++NI E WI I ++ ++ +F Q + Sbjct: 330 VHTLNLALKNICAPSSIPTKRAAYDECHWISEIADDVYFVKNFIMNHGMRLA-MFNQHSE 388 Query: 1147 LKQSSDPLSAKFAPSYYLVHRIFEIKQALQXXXXXXXXXXXXLMTLSEDNI----SIEAS 1314 LK S +FA + ++ R +IKQ+LQ T +D++ ++ Sbjct: 389 LKMLS-VAETRFASAVVMLKRFKKIKQSLQRMMISDEWD-----TYKDDDVGKARAVSDY 442 Query: 1315 ILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMGDVFDWRMWALEAIRR-------KGI 1473 IL + +W + P +L + DK + V++W E ++ K Sbjct: 443 ILSNEWWRKIDYIISFTLPIYTMLRRCDTDKPCLHKVYEWWDTMFEEVKVAIYINECKEY 502 Query: 1474 DDDA--LNQVEVLLESRWDMYFSPLHAAGYILNPRYFGKNQA----------KDKTAMRG 1617 ++++ N V +L SRW +PLH + LNPRY+ +D + Sbjct: 503 EEESPFYNVVYSILLSRWTKSSTPLHCMAHSLNPRYYSTEYLSGAPNRTPPHQDSEIAKE 562 Query: 1618 WKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGFETPHL 1797 K L++Y ++ R++ E+ +S+ + D++ R KMDP+ WW G TP+L Sbjct: 563 RKECLKKYYANEDQMRLVNEEFASFSACLDEFANSDSMSDRGKMDPMKWWIVHGSTTPNL 622 Query: 1798 QTLAVKILCQVSSVGICQ----VSDIPCQEAVNRLKVERVEDLVFVQNNLRLHSQR 1953 Q +A+K+L Q S C+ NR+ +R EDLVFV NNLRL S R Sbjct: 623 QKIALKLLGQPCSSSCCERNWSTYTFIHSLRRNRITPQRAEDLVFVHNNLRLLSTR 678 >ref|XP_003543854.2| PREDICTED: uncharacterized protein LOC100780312 [Glycine max] Length = 701 Score = 207 bits (527), Expect = 2e-50 Identities = 175/660 (26%), Positives = 295/660 (44%), Gaps = 63/660 (9%) Frame = +1 Query: 160 PSHESMQSESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVG 339 PS Q + K W +V+ G G KCN C+ +NGSY+RVRAHLL TG G Sbjct: 6 PSQAKEQDDDTKPLWTYVTKIKSV-AGGGNYEIKCNICDFTFNGSYTRVRAHLLKMTGKG 64 Query: 340 VKSCPAIDRSMREAFQILEEE---RLARKKKR--------------TSGSGKPGKRIRTS 468 V+ C + + + ++ E R+ R K + T+ G K+ +TS Sbjct: 65 VRVCQKVTVAKLIDLKKIDNEATLRVERSKTKSVSLPPVSTQHQMDTNTLGIDPKKRKTS 124 Query: 469 QLSLNHIWKSISKEDVDDVVARFFYAEGLNIHVANSPYFYELINTVASFG-PGYESPSVD 645 S+ + + ++E +D +AR FY+ GL H+A +P++ + A+ GY+ P + Sbjct: 125 --SVENAFNLQARETLDHEIARMFYSSGLPFHLARNPHYRKAFAYAANNQISGYQPPGYN 182 Query: 646 KLSGSFLIKEKARIEKSLASVRESWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFF 825 KL + L E+ +E L ++ +W Q G +I+ D R S IN V + G MF Sbjct: 183 KLRTTLLQNERRHVENLLQPIKNAWSQKGVSIVSDGWSDPQRR-SLINFMVVTESGPMFL 241 Query: 826 KAVRVNDGDTLENVFTGAISDAIMEVGSTNVLQIMLNLGHGSESFESLMMPKFPRIFWSP 1005 KA+ ++ ++ + + IMEVG +NV+QI+ + ++ ++ +FP I+W+P Sbjct: 242 KAIDCSNEIKDKDFIAKHMREVIMEVGHSNVVQIVTDNAAVCKAAGLIIEAEFPSIYWTP 301 Query: 1006 CTSHSIRQLMENIA-------------ELDWIKPIVLCAKGIEQCML--TFQRSSPNVFT 1140 C H++ ++NI E WI I A ++ ++ + + S N F Sbjct: 302 CVVHTLNLALKNICAAKNTEKNNVAYEECSWITQIADDAMFVKNFVMSHSMRLSIFNSFN 361 Query: 1141 QDLKQSSDPLSAKFAPSYYLVHRIFEIKQALQXXXXXXXXXXXXLMTLSEDNIS----IE 1308 S P +FA + ++ R ++K+ LQ + ED+++ ++ Sbjct: 362 SLKLLSIAP--TRFASTIVMLKRFKQLKKGLQEMVISDQW-----SSYKEDDVAKAKFVK 414 Query: 1309 ASILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMGDVFDWRMW---------ALEAIR 1461 ++L D +W L P +L + + S + V++ MW A+ Sbjct: 415 DTLLDDKWWDKVDYILSFTSPIYDVLRRTDTEASSLHLVYE--MWDSMIEKVKNAIYQYE 472 Query: 1462 RKGIDDDA--LNQVEVLLESRWDMYFSPLHAAGYILNPRYFGKN----------QAKDKT 1605 RK + + V +L RW +PLH + LNPRY+ +D Sbjct: 473 RKEESEGSTFYEVVHSILIDRWTKSSTPLHCLAHSLNPRYYSHEWLSEDSNRVPPHQDME 532 Query: 1606 AMRGWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGFE 1785 R +R+ D RR + + +++ D D+++ R +MDP AWW G Sbjct: 533 LTRERLKCFKRFFLDVDVRRKVNIEFANFSDGREGFDDLDSLNDRGQMDPKAWWLVHGIN 592 Query: 1786 TPHLQTLAVKILCQVSSVGICQ-----VSDIPCQEAVNRLKVERVEDLVFVQNNLRLHSQ 1950 P LQ +A+K+L Q S C+ S I + N++ R EDLVFV +NLRL S+ Sbjct: 593 APILQKIALKLLAQPCSSSCCERNWSTYSFIHSLKR-NKMTPHRAEDLVFVHSNLRLLSR 651 >ref|XP_007039961.1| HAT transposon superfamily [Theobroma cacao] gi|508777206|gb|EOY24462.1| HAT transposon superfamily [Theobroma cacao] Length = 674 Score = 207 bits (526), Expect = 2e-50 Identities = 154/609 (25%), Positives = 280/609 (45%), Gaps = 46/609 (7%) Frame = +1 Query: 259 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSMREAFQILEEERLARKKKRTSGS 438 +CN+C+ ++G R++ HL + C + +R+ Q + + KK++T Sbjct: 23 RCNYCHREFSGGVYRMKFHLAQIKNKDIVPCAEVPDDVRDHIQTILN---SPKKQKTPKK 79 Query: 439 GKPGKRIRTSQ---------LSLNH---------------------------IWKSISKE 510 K K + Q L LNH + +E Sbjct: 80 PKVDKAVANDQQNSSSASGGLHLNHGSSGQHGSTCPSLLFPRPSPSEQPAVDDGQKQKQE 139 Query: 511 DVDDVVARFFYAEGLNIHVANSPYFYELINTVASFGPGYESPSVDKLSGSFLIKEKARIE 690 D D +A FF+ + A S Y+ E+++ +A G GY++PS + L + L K K I Sbjct: 140 DADKKIAVFFFHNSIPFSAAKSMYYQEMVDAIAKCGVGYKAPSYENLRSTLLEKVKGDIH 199 Query: 691 KSLASVRESWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKAVRVNDGDTLENVF 870 R+ W +TGCTILC + D R S + V+ P+G +F K+V V+ + + Sbjct: 200 DCYKKYRDEWKETGCTILCDSWSD-GRTKSFVIFSVTCPKGTLFLKSVDVSGHEDDASYL 258 Query: 871 TGAISDAIMEVGSTNVLQIMLNLGHGSESFESLMMPKFPRIFWSPCTSHSIRQLMENIAE 1050 + ++EVG NV+Q++ + L+M K+ +FWSPC S+ I +++E+I++ Sbjct: 259 FELLESVVLEVGLENVIQVITDTAASYVYAGRLLMAKYSSLFWSPCASYCINKMLEDISK 318 Query: 1051 LDWIKPIVLCAKGIEQCMLT--FQRSSPNVFTQDLKQSSDPLSAKFAPSYYLVHRIFEIK 1224 +W+ ++ AK I Q + + + + FT ++ P +F +Y + I I+ Sbjct: 319 QEWVGIVLEEAKSIVQYIYSHAWIVNMMRKFTGG-RELMRPRITRFVANYLTLRSII-IQ 376 Query: 1225 QALQXXXXXXXXXXXXLMTLSEDNISIEASILGDNFWSGTHLFLQLCEPFVRLLATFNID 1404 + + + D +I++ + + FW H + + EP V++L + D Sbjct: 377 EDNLKHMFSHSEWLSSIYSRRSDAQAIKSLLYLERFWKSAHEAVSVSEPLVKILRIVDGD 436 Query: 1405 KSVMGDVFDWRMWALEAIRR--KGIDDDALNQVEVLLESRWDMYF-SPLHAAGYILNPRY 1575 MG +++ A AI+ KG+++ + + +++ RW+M SPLHAA LNP Sbjct: 437 MPAMGYIYEGIERAKVAIKAYYKGLEEKYM-PIWDIIDRRWNMQLHSPLHAAAAFLNPSI 495 Query: 1576 FGKNQAKDKTAMR-GWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMD 1752 F K MR G++ + + + + + ++ Y +G+LG + A+ R Sbjct: 496 FYNPNFKIDLRMRNGFQEAMLKLATTDKDKIEITKEHPMYINAQGALGTDFAIMGRTLNA 555 Query: 1753 PVAWWENFGFETPHLQTLAVKILCQVSSVGICQ----VSDIPCQEAVNRLKVERVEDLVF 1920 P WW ++G+E P LQ +A++IL Q S C+ + + N++++E+ DLVF Sbjct: 556 PGDWWASYGYEIPTLQRVAIRILSQPCSSHWCRWNWSTFESIHTKKRNKVELEKFNDLVF 615 Query: 1921 VQNNLRLHS 1947 V NL L + Sbjct: 616 VHCNLCLQA 624 >ref|XP_007214864.1| hypothetical protein PRUPE_ppa018860mg [Prunus persica] gi|462411014|gb|EMJ16063.1| hypothetical protein PRUPE_ppa018860mg [Prunus persica] Length = 805 Score = 207 bits (526), Expect = 2e-50 Identities = 176/670 (26%), Positives = 289/670 (43%), Gaps = 70/670 (10%) Frame = +1 Query: 154 IEPSHESMQSESDKWG-WKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFT 330 + PS + ++D WK+V K G ++CN+C + GSY RV++HLL Sbjct: 111 LAPSRQLKHGDNDNTPLWKYVKKLEKDGKAGGNTSFQCNYCQKTFKGSYFRVKSHLLKLK 170 Query: 331 GVGVKSCPAIDRS-MREAFQILEEERLARKKKR-------TSGSGKPGKRIRTSQLSLNH 486 G GV SC + S + E +++EE L K + TS + G +S L ++ Sbjct: 171 GNGVASCTKVTNSHLMEMEKVVEEAELRVKMAQLRDVPLPTSNTSSQGGS--SSGLGMSS 228 Query: 487 IWKSISK----------------EDVDDVVARFFYAEGLNIHVANSPYFYELINTVASFG 618 W S SK E +D +AR FY GL+ + +P++ S Sbjct: 229 NWCSDSKKRKGNPIEKAFNNNLREQLDGEIARMFYTGGLSFQFSRNPHYVNAFRIACSKT 288 Query: 619 -PGYESPSVDKLSGSFLIKEKARIEKSLASVRESWPQTGCTILCVNCLDCTRGCSCINMF 795 PGY+ P + L + L KEK IE+ ++ + W L IN+ Sbjct: 289 LPGYQPPGYNMLRTTLLQKEKNNIEEWVSVCSDGWSDAQRRPL-------------INVM 335 Query: 796 VSSPRGLMFFKAVRVNDGDTLENVF-TGAISDAIMEVGSTNVLQIMLNLGHGSESFESLM 972 G MF KA+ +G+ + F + ++I E+G NV+Q++ + ++ ++ Sbjct: 336 AICESGPMFLKAINC-EGECKDKFFMANLLIESIREIGPQNVVQVVTDNAPVCKAAGHIV 394 Query: 973 MPKFPRIFWSPCTSHSIRQLMENIAELDWIKPIVLCAKGIEQCMLTFQRSSPNVFTQD-- 1146 KF IFW+PC H++ ++NI P+ + EQC SS F ++ Sbjct: 395 EAKFKHIFWTPCVVHTLNLALKNICS-----PVPRNPEVYEQCSWISTISSDAWFIKNFI 449 Query: 1147 ------LKQSSDPLSAK--------FAPSYYLVHRIFEIKQALQXXXXXXXXXXXXLMTL 1284 L +D K FA + ++ R ++KQ L+ Sbjct: 450 MNHNMRLSMYNDHCKLKLLSVAETRFASTIVMLRRFKQVKQGLEQMVISEQWDIY----- 504 Query: 1285 SEDNI----SIEASILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMGDVFDWRMWALE 1452 ED++ +++ IL + FW L P +L + D + +++W +E Sbjct: 505 KEDDVVKARTVKEKILDECFWEDIDYILNFTSPIYEMLRLSDTDMPCLHLIYEWWDSMIE 564 Query: 1453 AIR-------RKGIDDDAL--NQVEVLLESRWDMYFSPLHAAGYILNPRYFGKNQA---- 1593 ++ RK ++++++ N V +L RW +PLH + LNP+Y+ K Sbjct: 565 KVKTIIYRKERKQLNEESMFFNVVHEILVDRWTKSSTPLHCFAHSLNPKYYCKEWLDMAH 624 Query: 1594 ------KDKTAMRGWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDP 1755 KD R K +ER+ S+ RR + E+ +S+ D++ R M P Sbjct: 625 NRCPPHKDIEITRERKQCIERFFSNEVERRAVNEEYASFSACIEDFSGMDSMKDRGFMAP 684 Query: 1756 VAWWENFGFETPHLQTLAVKILCQVSSVGICQ--VSDIPCQEAVNRLKV--ERVEDLVFV 1923 V WW G TP LQT+A+K+L SS C+ S ++ R K+ ER EDLVFV Sbjct: 685 VKWWVIHGASTPKLQTIALKLLGHPSSSSCCERNWSTYNFIHSIKRNKITPERAEDLVFV 744 Query: 1924 QNNLRLHSQR 1953 +NLRL S++ Sbjct: 745 HSNLRLLSRK 754 >ref|XP_006577689.1| PREDICTED: uncharacterized protein LOC102662659 [Glycine max] Length = 847 Score = 205 bits (521), Expect = 9e-50 Identities = 172/658 (26%), Positives = 295/658 (44%), Gaps = 61/658 (9%) Frame = +1 Query: 160 PSHESMQSESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVG 339 PS Q + K W +V+ G GT KCN C+ +NGSY+RVRAHLL TG G Sbjct: 152 PSQAKEQDDDTKPLWTYVTKIKSV-AGGGTYEIKCNICDFTFNGSYTRVRAHLLKMTGKG 210 Query: 340 VKSCPAIDRSMREAFQILEEE---RLARKKKR--------------TSGSGKPGKRIRTS 468 V+ C + + + ++ E R+ R K + T+ G K+ +TS Sbjct: 211 VRVCQKVTVAKLIDLKKIDNEATLRVERSKTKSVSLPPVSTQHQMDTNTLGVDPKKRKTS 270 Query: 469 QLSLNHIWKSISKEDVDDVVARFFYAEGLNIHVANSPYFYELINTVASFG-PGYESPSVD 645 S+ + + ++E +D +AR FY+ GL H+A +P++ + A+ GY+ P + Sbjct: 271 --SVENAFNLQARETLDHEIARMFYSSGLPFHLARNPHYRKAFAYAANNQISGYQPPGYN 328 Query: 646 KLSGSFLIKEKARIEKSLASVRESWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFF 825 KL + L E+ +E L ++ +W Q G +I+ R S IN V + G MF Sbjct: 329 KLRITLLQNERRHVENLLQPIKNAWSQKGVSIVSDGWSGPQRR-SLINFMVVTESGPMFL 387 Query: 826 KAVRVNDGDTLENVFTGAISDAIMEVGSTNVLQIMLNLGHGSESFESLMMPKFPRIFWSP 1005 KA+ ++ ++ + + IMEVG +NV+QI+ + ++ ++ +FP I+W+P Sbjct: 388 KAIDCSNEIKDKDFIAKHMREVIMEVGHSNVVQIVTDNAAVCKAAGLIIEAEFPSIYWTP 447 Query: 1006 CTSHSIRQLMENIA-------------ELDWIKPIVLCAKGIEQCML--TFQRSSPNVFT 1140 C H++ ++NI E WI I A ++ ++ + + S N F Sbjct: 448 CVVHTLNLALKNICAAKNTEKNNVAYEECSWITQIADDAMFVKNFVMSHSMRLSIFNSFN 507 Query: 1141 QDLKQSSDPLSAKFAPSYYLVHRIFEIKQALQXXXXXXXXXXXXLMTLSEDNIS----IE 1308 S P +FA + ++ R ++K+ LQ + ED+++ ++ Sbjct: 508 SLKLLSIAP--TRFASTIVMLKRFKQLKKGLQEMVISDQW-----SSYKEDDVAKAKFVK 560 Query: 1309 ASILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMGDVFDWRMWALEAIR------RKG 1470 ++L D +W L P +L + + S + V++ +E ++ + Sbjct: 561 DTLLDDKWWDKVDYILSFTSPIYDVLRRTDTEASSLHLVYEMWDSMIEKVKNAIYQYERN 620 Query: 1471 IDDDALNQVEV---LLESRWDMYFSPLHAAGYILNPRYFGKN----------QAKDKTAM 1611 + + EV +L RW +PLH + LNPRY+ +D Sbjct: 621 EESEGSTFYEVVHSILIDRWTKSSTPLHCLAHSLNPRYYSHEWLSEDSNRVPPHQDMELT 680 Query: 1612 RGWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGFETP 1791 R +R+ D RR + + +++ D D+++ R +MDP AWW G P Sbjct: 681 RERLKCFKRFFLDVDVRRKVNIEFANFSDGREGFDDLDSLNDRGQMDPKAWWLVHGINAP 740 Query: 1792 HLQTLAVKILCQVSSVGICQ-----VSDIPCQEAVNRLKVERVEDLVFVQNNLRLHSQ 1950 LQ +A+K+L Q S C+ S I + N++ R EDLVFV +NLRL S+ Sbjct: 741 ILQKIALKLLAQPCSSSCCERNWSTYSFIHSLKR-NKMTPHRAEDLVFVHSNLRLLSR 797 >ref|XP_004292297.1| PREDICTED: uncharacterized protein LOC101307174 [Fragaria vesca subsp. vesca] Length = 719 Score = 204 bits (520), Expect = 1e-49 Identities = 158/661 (23%), Positives = 288/661 (43%), Gaps = 62/661 (9%) Frame = +1 Query: 157 EPSHESMQSES-DKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTG 333 EPS ES +S+ D WK+V++ G DK G + CN C + GS+SRV++HLL G Sbjct: 9 EPSVESTKSQRLDAPLWKYVTITSGSDKSGGNVAFTCNFCGGKLTGSHSRVKSHLLRIKG 68 Query: 334 VGVKSCPAIDRSMREAFQIL-----------EEERLARKKKRTSGSG----------KPG 450 GVK P I R Q L + ++A +GSG Sbjct: 69 TGVKIYPTITRDQTVELQALLDHCDQQLNAKAQHKVALPPSSMTGSGISYFPLREREDEV 128 Query: 451 KRIRTSQLSLNHIWKSISKEDVDDVVARFFYAEGLNIHVANSPYFYELINTVASFGPGYE 630 K+ R L+ ++ + + D VAR FY+ GL +VA +P + E ++AS PGY Sbjct: 129 KKRRGLSPQLSKAFRQEDRRECDASVARLFYSSGLAFNVARNPNYRESY-SLASKIPGYV 187 Query: 631 SPSVDKLSGSFLIKEKARIEKSLASVRESWPQTGCTILCVNCLDCTRGCSCINMFVSSPR 810 P + L + L EK IE++L ++++W +TG + LC + + INM ++ Sbjct: 188 PPGYNALRTTLLDNEKRHIERTLLPIKKTWKETGVS-LCSDGWTDGQKRPLINMMAAAKD 246 Query: 811 GLMFFKAVRVNDGDTLENVFTGAISDAIMEVGSTNVLQIMLNLGHGSESFESLMMPKFPR 990 G M KA+ + + ++I E+G NV+Q++ + S + +++ P Sbjct: 247 GAMMLKAINCEGVTKSKEEIGRLLLESINEIGPENVVQVVTDNAPVSAAAGAIVEITHPH 306 Query: 991 IFWSPCTSHSIRQLMEN-------------IAELDWIKPIVLCAKGIEQCMLTFQRSSPN 1131 IFW+PC H++ +++ + EL W+ + I+ ++ Sbjct: 307 IFWTPCVVHTLNLALKDLLKAKSYLPGETVVEELGWLMEVYNDVWFIKNFVVNHNMRLAM 366 Query: 1132 VFTQDLKQSSDPLSAKFAPSYYLVHRIFEIKQALQXXXXXXXXXXXXLMTLSEDNISIEA 1311 + +FA + ++ R ++K LQ S+ + ++ Sbjct: 367 YHEHCALRLLQVAPTRFASHFIVLKRFRDVKSGLQQMVISQRWDLYKEDDASKARV-VKE 425 Query: 1312 SILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMGDVFDWRMWALEAIRRKGIDDDAL- 1488 +L + FW + L P ++ ++D+ + V++W +E +++ + + + Sbjct: 426 MLLKEKFWEQIDFLIALMGPIYEMIRMSDMDRPCLHLVYEWWNSMIEKVKKAVFNPEFVH 485 Query: 1489 ------------NQVEVLLESRWDMYFSPLHAAGYILNPRYFGKN----------QAKDK 1602 + V +L +RW +PLH + LNP+Y+ +D Sbjct: 486 VITEHCDVTRFYDVVYPILTARWTKSCTPLHCLAHSLNPKYYSSQWLEEDPNRVPPHRDA 545 Query: 1603 TAMRGWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGF 1782 + ++ DS R + E+ + + G DA++ + +P+ WW ++G Sbjct: 546 ELNNERRRCFQKLFPDSQTRNKVMEEFARFSLNMGDFSSSDALENKFCFEPLTWWVSYGP 605 Query: 1783 ETPHLQTLAVKILCQVSSVGICQ--VSDIPCQEAV--NRLKVERVEDLVFVQNNLRLHSQ 1950 TP LQ+LA+K+L Q S C+ S + + N+L+ R +DLV+V NLRL ++ Sbjct: 606 STPLLQSLALKLLNQPCSSSCCERNWSTYAFIQGLKRNKLQPRRAQDLVYVHTNLRLLAR 665 Query: 1951 R 1953 + Sbjct: 666 K 666 >ref|XP_002509591.1| DNA binding protein, putative [Ricinus communis] gi|223549490|gb|EEF50978.1| DNA binding protein, putative [Ricinus communis] Length = 670 Score = 200 bits (509), Expect = 2e-48 Identities = 154/614 (25%), Positives = 284/614 (46%), Gaps = 46/614 (7%) Frame = +1 Query: 259 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSMREAFQIL----EEERLARKKK- 423 +CN+CN ++G R++ HL + C + +R Q + ++++ +K+K Sbjct: 23 RCNYCNREFSGGVYRMKFHLAQIKNKDIVPCAEVPDDVRNHIQSILSTPKKQKTPKKQKT 82 Query: 424 -------------------RTSGSGKPG---------KRIRTSQLSLNHIWKSISKEDVD 519 SG+ G + + TSQ ++ ++ + + D Sbjct: 83 DQAENGQDNSSSASGGVHPNRGSSGQHGSTCPSLLFSRPLPTSQPVVDDA-QNEKQNNAD 141 Query: 520 DVVARFFYAEGLNIHVANSPYFYELINTVASFGPGYESPSVDKLSGSFLIKEKARIEKSL 699 +A FF+ + A S Y+ E+ + VA G GY++PS +KL S L K K I Sbjct: 142 KRIAVFFFHNSIAFSAAKSIYYQEMFDAVAECGQGYKAPSFEKLRSSLLEKVKGDIHDWY 201 Query: 700 ASVRESWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKAVRVNDGDTLENVFTGA 879 R+ W +TGCTILC D R S I V+ P+G +F K+V ++ + N Sbjct: 202 RKYRDDWKETGCTILCDGWSD-GRTKSVIVFSVTCPKGTLFLKSVDISGHENDANYLFEL 260 Query: 880 ISDAIMEVGSTNVLQIMLNLGHGSESFESLMMPKFPRIFWSPCTSHSIRQLMENIAELDW 1059 + ++EVG NV+Q++ + L+M K+ +FWSPC S+ + +++E+I++ +W Sbjct: 261 LESILLEVGVENVIQVITDSTASYVYAGRLLMAKYSSLFWSPCASYCVNKMLEDISKQEW 320 Query: 1060 IKPIVLCAKGIEQCMLT--FQRSSPNVFTQDLKQSSDPLSAKFAPSYYLVHRIFEIKQAL 1233 + ++ A I + + + + + FT ++ P ++ S YL R I++ Sbjct: 321 VGTVMEEANTITKYIYSHAWTLNMMRRFTGG-RELIRPRITRYV-SNYLSLRAIVIQEDN 378 Query: 1234 QXXXXXXXXXXXXLMTLSEDNISIEASILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSV 1413 + + D +++ + D FW H + + EP +++L + D Sbjct: 379 LKHMFSHSEWLSSMHSRRPDAQIVKSFLSQDRFWKFAHEAVSISEPLIKILRIVDGDMPA 438 Query: 1414 MGDVFDWRMWALEAIRR--KGIDDDALNQVEVLLESRWDMYF-SPLHAAGYILNPRYFGK 1584 MG +++ A +I+ KGI+D + E+ ++ RW++ SPLHAA LNP F Sbjct: 439 MGYIYEVLERAKVSIKAYYKGIEDKYMPIWEI-IDRRWNIQLHSPLHAAAAFLNPSIFYN 497 Query: 1585 NQAKDKTAMR-GWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVA 1761 K MR G++ + + + + + ++ Y +G+LG + A+ R P Sbjct: 498 QNFKIDLRMRNGFQEAMIKMATSDIDKIEITKEHPIYINGQGALGTDFAIMGRTLNSPGD 557 Query: 1762 WWENFGFETPHLQTLAVKILCQVSSVGICQ----VSDIPCQEAVNRLKVERVEDLVFVQN 1929 WW +G+E P LQ +A+++L Q S C+ + + N+ ++E++ DLVFV Sbjct: 558 WWAGYGYEIPTLQRVAIRLLSQPCSSHWCRWNWSTFESIHTKKRNKAELEKLNDLVFVHC 617 Query: 1930 NL---RLHSQRIGN 1962 NL ++ R+GN Sbjct: 618 NLWLQAIYQSRVGN 631 >ref|XP_006477267.1| PREDICTED: uncharacterized protein LOC102627361 [Citrus sinensis] Length = 674 Score = 200 bits (508), Expect = 3e-48 Identities = 157/611 (25%), Positives = 280/611 (45%), Gaps = 48/611 (7%) Frame = +1 Query: 259 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSMREAFQ-ILEEERLARKKKR--- 426 +CN+C ++G R++ HL + C + +R+ Q IL + + KR Sbjct: 23 RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPDDVRDHIQRILSIPKKQKNPKRPKV 82 Query: 427 -----------TSGSGKPGKRIRTS--------QLSLNHIWKSIS----------KEDVD 519 +S SG + R+S L H SI ++D D Sbjct: 83 EKATANGQQNSSSASGGIHQNNRSSGQHGSSCPSLLFRHPSPSIQPIVDDTQKQRQDDTD 142 Query: 520 DVVARFFYAEGLNIHVANSPYFYELINTVASFGPGYESPSVDKLSGSFLIKEKARIEKSL 699 +A FF+ + A S Y+ E++N +A G GY +PS +KL + L K K I+ Sbjct: 143 KKIAVFFFHNSIPFSAAKSMYYQEMVNAIAECGVGYIAPSYEKLRSTLLEKVKVDIDDCC 202 Query: 700 ASVRESWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKAVRVNDGDTLENVFTGA 879 RE W +TGCTILC N D R S + V+ P+G +F K+V V+ + Sbjct: 203 KKYREEWKETGCTILCDNWSD-ERTKSLVVFSVACPKGTLFLKSVDVSGHEEDATFLFEL 261 Query: 880 ISDAIMEVGSTNVLQIMLNLGHGSESFESLMMPKFPRIFWSPCTSHSIRQLMENIAELDW 1059 + +++VG NV+Q++ + L+M K+ +FWSPC ++ I +++E+I++ +W Sbjct: 262 LESVVLDVGVENVIQVITDSAACYVYAGRLLMTKYSSLFWSPCAAYCIDKMLEDISKQEW 321 Query: 1060 IKPIVLCAKGIEQCMLTFQRSSPNVFTQDL-------KQSSDPLSAKFAPSYYLVHRIFE 1218 + ++ AK I + + + +T ++ ++ P +F +Y + I Sbjct: 322 VAMVLEEAKTITKYFYS------HAWTLNMMRKLTGGRELIRPRITRFVANYLSLRSIVI 375 Query: 1219 IKQALQXXXXXXXXXXXXLMTLSEDNISIEASILGDNFWSGTHLFLQLCEPFVRLLATFN 1398 ++ L+ + + D +I++ + D FW H + + EP V++L + Sbjct: 376 HEENLK-HMFSHSEWLSSIYSRRPDAQAIKSLLYLDRFWRSAHEVVSVSEPLVKILRIVD 434 Query: 1399 IDKSVMGDVFDWRMWALEAIRR--KGIDDDALNQVEVLLESRWDMYF-SPLHAAGYILNP 1569 D MG +++ A AI+ KG+++ + + +++ RW+M SPLHAA LNP Sbjct: 435 GDMPAMGYMYEGIERAKLAIQAYYKGVEEKYV-PIWDIIDRRWNMQLHSPLHAAAAFLNP 493 Query: 1570 RYFGKNQAKDKTAMR-GWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDK 1746 F K MR G++ + + + + + ++ Y +G+LG + AV R Sbjct: 494 SIFYNPNFKIDLRMRNGFQEAMIKLATADKDKIEITKEHPVYINAQGALGTDFAVLGRKL 553 Query: 1747 MDPVAWWENFGFETPHLQTLAVKILCQVSSV----GICQVSDIPCQEAVNRLKVERVEDL 1914 P WW ++G+E P LQ A++IL Q S + + N++++E+ DL Sbjct: 554 NAPGDWWASYGYEIPTLQRAAIRILSQPCSSYWYRWNWSTFESIHNKKRNKVEMEKFNDL 613 Query: 1915 VFVQNNLRLHS 1947 +FV NLRL + Sbjct: 614 LFVHCNLRLQA 624 >ref|XP_007161271.1| hypothetical protein PHAVU_001G056200g, partial [Phaseolus vulgaris] gi|561034735|gb|ESW33265.1| hypothetical protein PHAVU_001G056200g, partial [Phaseolus vulgaris] Length = 702 Score = 199 bits (506), Expect = 5e-48 Identities = 159/618 (25%), Positives = 272/618 (44%), Gaps = 32/618 (5%) Frame = +1 Query: 199 GWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSMRE 378 GWKH G D K+ KC++C+ +G R + HL G T + C ++ +R+ Sbjct: 24 GWKH-----GIDINGNGKKVKCSYCSKTMSGGIFRFKHHLAG-TREDSEPCCSVPEEIRD 77 Query: 379 AF-QILEEERLARKKKR----------------------TSGSGKPGKRIRTSQLSLNHI 489 +I+ E + A KKR + G K G R Q ++N + Sbjct: 78 LMIKIVAEAKQASLKKRKLNIIDEDQGCEGLEERQHIFGSKGKEKVGSR-GAVQATINQM 136 Query: 490 WKSISKEDVDDVVARFFYAEGLNIHVANSPYFYELINTVASFGPGYESPSVDKLSGSFLI 669 K KE+VD VA FFY + +V +P F ++ + +G GY+ PS + L Sbjct: 137 MKKGYKEEVDAQVAEFFYTSAIPFNVIKNPAFTKMCEMIGKYGAGYKPPSYHDIREKLLK 196 Query: 670 KEKARIEKSLASVRESWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKAVRVND- 846 + + + L +E W +TGCTI+ D R C N V+SP+G +F ++ +D Sbjct: 197 QAIDKTDLVLQEYKEEWKKTGCTIMSDGWTDKKRRSIC-NFLVNSPKGTVFMYSLDTSDI 255 Query: 847 GDTLENVFTGAISDAIMEVGSTNVLQIMLNLGHGSESFESLMMPKFPRIFWSPCTSHSIR 1026 T + VF + D + VG NV+Q++ + ++ L+M K ++W+PC +H I Sbjct: 256 SKTADKVFK-MLDDVVELVGEENVVQVVTDNAANFKAAGELLMQKREHLYWTPCAAHCID 314 Query: 1027 QLMENIAELDWIKPIVLCAKGIEQCMLTFQRSSPNVFTQDLKQSSD---PLSAKFAPSYY 1197 E+ + + + + KG + + RS + + D P +FA +Y Sbjct: 315 LSFEDFEKKLKVHELTI-KKGRKITTYIYGRSMLISMLKKFTKERDLIRPGVTRFATAYL 373 Query: 1198 LVHRIFEIKQALQXXXXXXXXXXXXLMTLSEDNISIEASILGDNFWSGTHLFLQLCEPFV 1377 + + E+K +L T S++ +E IL + FW L++ P + Sbjct: 374 TLGCLHELKASLLTMFSSEEWKTSKFGT-SQEGKKVENMILDNRFWKNISTCLKVAAPLM 432 Query: 1378 RLLATFNID-KSVMGDVFDWRMWALEAIRRK-GIDDDALNQVEVLLESRWD-MYFSPLHA 1548 +L + D K MG +++ A E I+ + +V ++++RWD PLHA Sbjct: 433 VVLRLVDSDAKPAMGFIYEEMDRAKEKIKNNFNHIKKSYEEVWKIIDARWDNQLHRPLHA 492 Query: 1549 AGYILNPR--YFGKNQAKDKTAMRGWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDE 1722 A Y LNP+ Y + ++ D G +++ R D+ RR++ QL Y G+ + Sbjct: 493 AAYYLNPQFHYEPEFRSDDPEVKEGLYTSMRRLVKDAAERRIINVQLVEYHFGRGAFAMD 552 Query: 1723 DAVDCRDKMDPVAWWENFGFETPHLQTLAVKILCQVSSVGICQVSDIPCQEAVNRLKVER 1902 DA + R + P WWE FG+ TP L+ N L ++ Sbjct: 553 DAKESRKTILPGEWWEMFGYRTPELKRR-------------------------NHLHQKK 587 Query: 1903 VEDLVFVQNNLRLHSQRI 1956 + DL++V NL+L +++I Sbjct: 588 MNDLLYVMYNLKLSNKQI 605 >ref|XP_004159512.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101222344 [Cucumis sativus] Length = 673 Score = 198 bits (504), Expect = 8e-48 Identities = 150/603 (24%), Positives = 275/603 (45%), Gaps = 42/603 (6%) Frame = +1 Query: 259 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSMREAFQIL----EEERLARKKKR 426 +CN+C ++G R++ HL + C + +R+ Q + ++++ +K K Sbjct: 23 RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKV 82 Query: 427 TSGSGKPGKRIRTSQLSLNHIWKS---------------------------ISKEDVDDV 525 + G++ +S H S K++ D Sbjct: 83 DMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETDKK 142 Query: 526 VARFFYAEGLNIHVANSPYFYELINTVASFGPGYESPSVDKLSGSFLIKEKARIEKSLAS 705 VA FF+ + A S Y+ E+++ +A +G GY++PS +KL + L K K I S Sbjct: 143 VAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSYKK 202 Query: 706 VRESWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKAVRVNDGDTLENVFTGAIS 885 R+ W +TGCTILC + D + S + + V+ +G +F K+V ++ + + + Sbjct: 203 HRDEWKETGCTILCDSWSD-GQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLE 261 Query: 886 DAIMEVGSTNVLQIMLNLGHGSESFESLMMPKFPRIFWSPCTSHSIRQLMENIAELDWIK 1065 I+EVG NV+QI+ + L+M K+ +FWSPC S+ + Q++E+I++++W+ Sbjct: 262 TIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVS 321 Query: 1066 PIVLCAKGIEQCMLTFQR--SSPNVFTQDLKQSSDPLSAKFAPSYYLVHRIFEIKQALQX 1239 ++ AK I + + + ++ FT K+ P +F ++ + I ++ L+ Sbjct: 322 AVLEEAKIITRYIYSHASILNTMRKFTGG-KELIRPRITRFVTNFLSLRSIVILEDNLK- 379 Query: 1240 XXXXXXXXXXXLMTLSEDNISIEASILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMG 1419 + + D +I + + D FW H + +CEP +R+L + D MG Sbjct: 380 HMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMG 439 Query: 1420 DVFDWRMWALEAIRR--KGIDDDALNQVEVLLESRWDMYF-SPLHAAGYILNP-RYFGKN 1587 +F+ A I+ G +D + E ++ RW++ + LH A LNP ++ N Sbjct: 440 YIFEGIERAKVEIKTYYNGFEDKYMPIWET-IDRRWNLQLHTTLHTAAAFLNPSXFYNPN 498 Query: 1588 QAKDKTAMRGWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWW 1767 D G++ + + + + + + +Y +G+LG + A+ R P WW Sbjct: 499 FKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWW 558 Query: 1768 ENFGFETPHLQTLAVKILCQVSSVGICQVSDIPCQEAV-----NRLKVERVEDLVFVQNN 1932 +G+E P LQ AV+IL Q S C + E + +R + E++ DLVFVQ N Sbjct: 559 SGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCN 618 Query: 1933 LRL 1941 L L Sbjct: 619 LWL 621 >ref|XP_004147940.1| PREDICTED: uncharacterized protein LOC101222344 [Cucumis sativus] Length = 673 Score = 198 bits (504), Expect = 8e-48 Identities = 150/603 (24%), Positives = 275/603 (45%), Gaps = 42/603 (6%) Frame = +1 Query: 259 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSMREAFQIL----EEERLARKKKR 426 +CN+C ++G R++ HL + C + +R+ Q + ++++ +K K Sbjct: 23 RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRDHIQGILSTPKKQKAPKKPKV 82 Query: 427 TSGSGKPGKRIRTSQLSLNHIWKS---------------------------ISKEDVDDV 525 + G++ +S H S K++ D Sbjct: 83 DMETATNGQQHSSSASGGIHHGSSGQNESNCPSTYPCLSPSAQPPIDDAQKQKKDETDKK 142 Query: 526 VARFFYAEGLNIHVANSPYFYELINTVASFGPGYESPSVDKLSGSFLIKEKARIEKSLAS 705 VA FF+ + A S Y+ E+++ +A +G GY++PS +KL + L K K I S Sbjct: 143 VAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSYKK 202 Query: 706 VRESWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKAVRVNDGDTLENVFTGAIS 885 R+ W +TGCTILC + D + S + + V+ +G +F K+V ++ + + + Sbjct: 203 HRDEWKETGCTILCDSWSD-GQTKSFLVISVTCSKGTLFLKSVDISGHEDDATYLSDLLE 261 Query: 886 DAIMEVGSTNVLQIMLNLGHGSESFESLMMPKFPRIFWSPCTSHSIRQLMENIAELDWIK 1065 I+EVG NV+QI+ + L+M K+ +FWSPC S+ + Q++E+I++++W+ Sbjct: 262 TIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEWVS 321 Query: 1066 PIVLCAKGIEQCMLTFQR--SSPNVFTQDLKQSSDPLSAKFAPSYYLVHRIFEIKQALQX 1239 ++ AK I + + + ++ FT K+ P +F ++ + I ++ L+ Sbjct: 322 AVLEEAKIITRYIYSHASILNTMRKFTGG-KELIRPRITRFVTNFLSLRSIVILEDNLK- 379 Query: 1240 XXXXXXXXXXXLMTLSEDNISIEASILGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMG 1419 + + D +I + + D FW H + +CEP +R+L + D MG Sbjct: 380 HMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMG 439 Query: 1420 DVFDWRMWALEAIRR--KGIDDDALNQVEVLLESRWDMYF-SPLHAAGYILNPR-YFGKN 1587 +F+ A I+ G +D + E ++ RW++ + LH A LNP ++ N Sbjct: 440 YIFEGIERAKVEIKTYYNGFEDKYMPIWET-IDRRWNLQLHTTLHTAAAFLNPSVFYNPN 498 Query: 1588 QAKDKTAMRGWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWW 1767 D G++ + + + + + + +Y +G+LG + A+ R P WW Sbjct: 499 FKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWW 558 Query: 1768 ENFGFETPHLQTLAVKILCQVSSVGICQVSDIPCQEAV-----NRLKVERVEDLVFVQNN 1932 +G+E P LQ AV+IL Q S C + E + +R + E++ DLVFVQ N Sbjct: 559 SGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCN 618 Query: 1933 LRL 1941 L L Sbjct: 619 LWL 621 >ref|XP_006603987.1| PREDICTED: uncharacterized protein LOC102660926 [Glycine max] Length = 698 Score = 196 bits (497), Expect = 5e-47 Identities = 168/657 (25%), Positives = 292/657 (44%), Gaps = 61/657 (9%) Frame = +1 Query: 163 SHESMQSESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGV 342 S Q + K W +++ G G KCN C+ +NGSY+RVRAHLL TG GV Sbjct: 7 SQAKEQDDDTKPIWTYITKIKSV-AGGGNYEIKCNICDFTFNGSYTRVRAHLLKMTGKGV 65 Query: 343 KSCPAIDRSMREAFQILEEE---RLARKKKR--------------TSGSGKPGKRIRTSQ 471 + C + + A + ++ + R+ R K + T+ G K+ +TS Sbjct: 66 RVCQKVTVAKLIALKKIDNKATLRVVRSKTKSVSLPPVSTQHQMDTNTLGVDPKKRKTS- 124 Query: 472 LSLNHIWKSISKEDVDDVVARFFYAEGLNIHVANSPYFYELINTVASFG-PGYESPSVDK 648 S+ + + ++E +D +AR FY+ GL H+A +P++ + A+ GY+ +K Sbjct: 125 -SVENAFNLQARETLDHEIARMFYSSGLPFHLARNPHYRKTFAYAANNQISGYQPSGYNK 183 Query: 649 LSGSFLIKEKARIEKSLASVRESWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFK 828 L + L E+ +E L ++ +W Q G +I+ D R S IN V + G MF K Sbjct: 184 LRTTLLQNERRHVENLLQPIKNAWNQKGVSIVSDGWSDPQRR-SLINFMVVTESGPMFLK 242 Query: 829 AVRVNDGDTLENVFTGAISDAIMEVGSTNVLQIMLNLGHGSESFESLMMPKFPRIFWSPC 1008 A+ ++ ++ + + IMEVG +NV+QI+++ ++ ++ +FP I+W+PC Sbjct: 243 AIDCSNEIKDKDFIAKHMREVIMEVGHSNVVQIVIDNAAVCKAAGLIIEAEFPSIYWTPC 302 Query: 1009 TSHSIRQLMENIA-------------ELDWIKPIVLCAKGIEQCMLTFQRSSPNVFTQDL 1149 H++ ++NI E WI I A ++ +++ ++F Sbjct: 303 VVHTLNLALKNICAAKNTEKNNVAYEECSWITQIADDAMFVKIFIMSHSMRL-SIFNSLK 361 Query: 1150 KQSSDPLSAKFAPSYYLVHRIFEIKQALQXXXXXXXXXXXXLMTLSEDNIS----IEASI 1317 S P +FA + ++ R ++K+ LQ + ED+++ ++ ++ Sbjct: 362 LLSIAP--TRFASTIVMLKRFKQLKKGLQEMVISDQW-----SSYKEDDVAKAKFVKDTL 414 Query: 1318 LGDNFWSGTHLFLQLCEPFVRLLATFNIDKSVMGDVFDWRMW---------ALEAIRRKG 1470 L D +W L P +L + S + V++ MW A+ RK Sbjct: 415 LDDKWWDKVDYILSFTSPIYDVLRRTDTKVSSLHLVYE--MWDSMIEKVKNAIYQYERKE 472 Query: 1471 IDDDA--LNQVEVLLESRWDMYFSPLHAAGYILNPRYFGKN----------QAKDKTAMR 1614 + + V +L RW +PLH + LNPRY+ +D R Sbjct: 473 ESEGSTFYEVVHSILIDRWTKSSTPLHCLAHSLNPRYYSHEWLSEDSNRVPPHQDMELTR 532 Query: 1615 GWKSTLERYESDSGARRVLREQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGFETPH 1794 +R+ D RR + + +++ D D+++ R +MDP AWW P Sbjct: 533 ERLKCFKRFFLDVDVRRKVNIEFANFSDGREGFDDLDSLNDRGQMDPKAWWLVHDINAPI 592 Query: 1795 LQTLAVKILCQVSSVGICQ-----VSDIPCQEAVNRLKVERVEDLVFVQNNLRLHSQ 1950 LQ +A+K+L Q S C+ S I + N++ R E+LVFV +NLRL S+ Sbjct: 593 LQKIALKLLAQPCSSSCCERNWSTYSFIHSLKR-NKMTPHRAENLVFVHSNLRLLSR 648 >ref|XP_006579099.1| PREDICTED: uncharacterized protein LOC102660479 [Glycine max] Length = 765 Score = 194 bits (493), Expect = 2e-46 Identities = 165/638 (25%), Positives = 280/638 (43%), Gaps = 54/638 (8%) Frame = +1 Query: 202 WKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAID------ 363 W V++ G G + W CN C SYSRV+AHLL G G+ +CP + Sbjct: 21 WSFVTIKEKIGDGGGNRLWSCNFCEKVVKSSYSRVKAHLLRICGSGIDTCPKVTDAYLVY 80 Query: 364 --RSMREAFQILEEER--LARKKKRTSGSGKPGKRIRTSQLSLNHIWKSISKEDVDDVVA 531 R EA IL+ + L K+ + P KR ++S ++ + + + +A Sbjct: 81 LRRVCEEAESILKSKNVPLPTDKRTPTPPTLPPKRRKSS--NIESAFNIEDRNHLRAEIA 138 Query: 532 RFFYAEGLNIHVANSPYFYELINTVASFG-PGYESPSVDKLSGSFLIKEKARIEKSLASV 708 R FY+ L+ H+A +PYF + A+ G+ PS + L S L +E++ IE+ L + Sbjct: 139 RMFYSASLSFHLARNPYFVSSYSFAANCNLSGFLPPSYNALRTSLLQQERSYIERLLQPI 198 Query: 709 RESWPQTGCTILCVNCLDCTRGCSCINMFVSSPRGLMFFKAVRVNDGDTLENVFTGAISD 888 + W G T++ D + IN S G MF KA+ + ++ + D Sbjct: 199 KSLWSLKGVTLVVDGWTD-AQIRPLINFMAISEEGPMFLKAIDGSKEYKDKHYMFDLLKD 257 Query: 889 AIMEVGSTNVLQIMLNLGHGSESFESLMMPKFPRIFWSPCTSHSIRQLMENIA------- 1047 I EVG +V+Q++ + + ++ L+ +FP IFW+PC H++ ++NI Sbjct: 258 VIKEVGPQSVVQVITDNAYVCKAAGLLIEVEFPHIFWTPCVVHTLNLGVKNICAAKNVDG 317 Query: 1048 ------ELDWIKPIVLCAKGIEQCMLT--FQRSSPNVFTQDLKQSSDPLSAKFAPSYYLV 1203 E WI ++ A I+ ++T + + N F+ LK S +FA ++ Sbjct: 318 NENVFNEGGWIAEVIGDASFIKVFIMTHSMRLAIFNEFS-SLKLLS-IAETRFASMIVML 375 Query: 1204 HRIFEIKQALQXXXXXXXXXXXXLMTLSEDNI----SIEASILGDNFWSGTHLFLQLCEP 1371 R+ +K+ LQ + ED++ ++ IL D +W L +P Sbjct: 376 KRLKLLKRCLQNMVISDQWN-----SYREDDVRKAAHVKELILNDIWWDKVDYILSFMDP 430 Query: 1372 FVRLLATFNIDKSVMGDVFDWRMWALEAIRRKGIDDDALNQVEV---------LLESRWD 1524 ++ + + S + V++ +E ++ D + + EV +L SRW Sbjct: 431 IYSMIRICDTNASNLHLVYEMWDSMIEKVKTTIYRHDEVLENEVSTFFEVIHEILNSRWS 490 Query: 1525 MYFSPLHAAGYILNPRYFGKN----------QAKDKTAMRGWKSTLERYESDSGARRVLR 1674 +PLH + LNPRY+ N +D L+RY + R + Sbjct: 491 KSCNPLHCLAHSLNPRYYSDNWLNEVPNRVPPHRDDELSSQRNKCLKRYFPNVNVRTKVY 550 Query: 1675 EQLSSYWRVEGSLGDEDAVDCRDKMDPVAWWENFGFETPHLQTLAVKILCQVSSVGICQ- 1851 E+ S + G G D ++ R +D WW G TP LQ +A+K+L Q S C+ Sbjct: 551 EEFSKFSSCAGDFGSFDIIEDRWALDSKTWWVMHGSSTPILQKVALKLLVQPCSSSCCER 610 Query: 1852 ----VSDIPCQEAVNRLKVERVEDLVFVQNNLRLHSQR 1953 S I + N++ ++ +DLVFV +NLRL S++ Sbjct: 611 NWSTYSFIHSLKR-NKMDPKKAKDLVFVHSNLRLLSRK 647