BLASTX nr result
ID: Akebia22_contig00015914
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia22_contig00015914 (2472 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN70085.1| hypothetical protein VITISV_003006 [Vitis vinifera] 840 0.0 ref|XP_003632266.1| PREDICTED: uncharacterized protein LOC100854... 840 0.0 ref|XP_002527444.1| protein dimerization, putative [Ricinus comm... 803 0.0 ref|XP_006424350.1| hypothetical protein CICLE_v10028008mg [Citr... 770 0.0 ref|XP_006484968.1| PREDICTED: uncharacterized protein LOC102615... 768 0.0 ref|XP_007014534.1| Uncharacterized protein TCM_039722 [Theobrom... 263 2e-67 ref|XP_007214864.1| hypothetical protein PRUPE_ppa018860mg [Prun... 243 2e-61 ref|XP_006841838.1| hypothetical protein AMTR_s00003p00270420 [A... 243 4e-61 ref|XP_007039961.1| HAT transposon superfamily [Theobroma cacao]... 236 5e-59 ref|XP_006477267.1| PREDICTED: uncharacterized protein LOC102627... 229 5e-57 ref|XP_003538648.1| PREDICTED: uncharacterized protein LOC100805... 229 6e-57 ref|XP_002443069.1| hypothetical protein SORBIDRAFT_08g007560 [S... 225 9e-56 ref|XP_004292297.1| PREDICTED: uncharacterized protein LOC101307... 224 1e-55 ref|XP_006579099.1| PREDICTED: uncharacterized protein LOC102660... 223 3e-55 ref|XP_002509591.1| DNA binding protein, putative [Ricinus commu... 222 7e-55 ref|XP_006857527.1| hypothetical protein AMTR_s00061p00028660 [A... 220 2e-54 ref|XP_004159512.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 219 5e-54 ref|XP_004147940.1| PREDICTED: uncharacterized protein LOC101222... 219 5e-54 ref|XP_007161271.1| hypothetical protein PHAVU_001G056200g, part... 218 8e-54 ref|XP_006577689.1| PREDICTED: uncharacterized protein LOC102662... 218 1e-53 >emb|CAN70085.1| hypothetical protein VITISV_003006 [Vitis vinifera] Length = 635 Score = 840 bits (2171), Expect = 0.0 Identities = 406/597 (68%), Positives = 489/597 (81%) Frame = +1 Query: 13 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 192 MP+ESDKWGWKHVSVFGGF+ +GTKRWKCNHCN+RYNGSYSRVRAHLLGFTGVGVKSCP Sbjct: 1 MPTESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNIRYNGSYSRVRAHLLGFTGVGVKSCP 60 Query: 193 AIDRSLREAFHIQEEERLTRKKKKIPTSGKSSKRIRSSQLAITSVGKAFGKEDVDDVVAR 372 AIDRSLREAF I EEERL RKKK+ SGK+ KRIR+SQ ++T V K KEDVDD+VAR Sbjct: 61 AIDRSLREAFQILEEERLARKKKRTSGSGKTGKRIRTSQPSVTCVWKTIAKEDVDDIVAR 120 Query: 373 FFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVRE 552 FFYADGL+FNI+ SPYF +M KAIA+FGPGYEPP+ +KL D FL+KEKA+++KA++ VRE Sbjct: 121 FFYADGLDFNIVNSPYFLEMTKAIAAFGPGYEPPTTEKLSDLFLSKEKAKIEKAMALVRE 180 Query: 553 SWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTKAI 732 SWP TGCTI C+++L T + NIFVSSPRGL+FL+ +DI GDG D++F +VL+ AI Sbjct: 181 SWPHTGCTILCVNRLCRTQGRYYTNIFVSSPRGLMFLKALDINDGDGMDNMFVDVLSDAI 240 Query: 733 MDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDWMKPFV 912 M+V P NVLQ+I + G +S+ SLI SKF H+FWS CT+H I +LMEDIT+LDW+KP V Sbjct: 241 MEVEPTNVLQIISNLGHASESFESLILSKFRHLFWSPCTSHSICVLMEDITKLDWIKPIV 300 Query: 913 SYAKGIEQCILAXXXXXXXXXXXXXXXXXIDPISAKFAPTYNMVWRIIKLKQALQEVVGS 1092 AK I++CIL DP+S KFAP+Y +V RI +LKQAL VV S Sbjct: 301 LCAKEIDECILTYQRSSLCVLTLESS----DPLSTKFAPSYCIVERIFELKQALLGVVVS 356 Query: 1093 EEWRQWKLMYPEDVLSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMGDVYNW 1272 EEW+QWKL EDVL++E A+LG++FW RA +LQ EPFVRLL +L++++SVMGDV+NW Sbjct: 357 EEWKQWKLTIQEDVLNVETAILGDNFWSRACSLLQFFEPFVRLLTTLDIEKSVMGDVFNW 416 Query: 1273 RVQALEVVRSKRIDDMVLKQLEVVLENRWEMLFSPLHASGYILNPRYFGKGQAKDKTVMR 1452 RVQALE V+SK +DD++L QLE+++E++W+MLFSPLHASGYILNP+YFGKGQ+KDKT+MR Sbjct: 417 RVQALEAVKSKGVDDILLNQLELLIESKWDMLFSPLHASGYILNPKYFGKGQSKDKTIMR 476 Query: 1453 GWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAWWENFGSETPQ 1632 GWKATLDRYESD RRVLREQLSSYWRL+GS GEEDA+DCRDKMDPVAWWENFG ETP Sbjct: 477 GWKATLDRYESDSATRRVLREQLSSYWRLEGSFGEEDAVDCRDKMDPVAWWENFGFETPH 536 Query: 1633 LQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFVRNNLRLHSKK 1803 LQTLAIKILSQ+SSV+ +Q +W DN CQ AVN LG ERAEDLVFVRNNLRLHS++ Sbjct: 537 LQTLAIKILSQVSSVSMYQETWQDNEFLCQTAVNGLGVERAEDLVFVRNNLRLHSQR 593 >ref|XP_003632266.1| PREDICTED: uncharacterized protein LOC100854857 [Vitis vinifera] Length = 635 Score = 840 bits (2169), Expect = 0.0 Identities = 406/597 (68%), Positives = 488/597 (81%) Frame = +1 Query: 13 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 192 MP+ESDKWGWKHVSVFGGF+ +GTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP Sbjct: 1 MPTESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 60 Query: 193 AIDRSLREAFHIQEEERLTRKKKKIPTSGKSSKRIRSSQLAITSVGKAFGKEDVDDVVAR 372 AIDRSLREAF I EEERL RKKK+ SGK+ KRIR+SQ ++T V K KEDVDD+VAR Sbjct: 61 AIDRSLREAFQILEEERLARKKKRTSGSGKTGKRIRTSQPSVTCVWKTIAKEDVDDIVAR 120 Query: 373 FFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVRE 552 FFYADGL+FNI+ SPYF +M KAIA+FGPGYEPP+ +KL D FL+KEKA+++KA++ VRE Sbjct: 121 FFYADGLDFNIVNSPYFLEMTKAIAAFGPGYEPPTTEKLSDLFLSKEKAKIEKAMALVRE 180 Query: 553 SWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTKAI 732 SWP TGCTI C+++L T + NIFVSSPRGL+FL+ +DI GDG D++F +VL+ AI Sbjct: 181 SWPHTGCTILCVNRLCRTQGRYYTNIFVSSPRGLMFLKALDINDGDGMDNMFVDVLSDAI 240 Query: 733 MDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDWMKPFV 912 M+V P NVLQ+I + G +S+ SLI SKF H+FWS CT+H I +LMEDIT+LDW+KP V Sbjct: 241 MEVEPTNVLQIISNLGHASESFESLILSKFRHLFWSPCTSHSICVLMEDITKLDWIKPIV 300 Query: 913 SYAKGIEQCILAXXXXXXXXXXXXXXXXXIDPISAKFAPTYNMVWRIIKLKQALQEVVGS 1092 AK I++CIL DP+S KFAP+Y +V RI +LKQAL VV S Sbjct: 301 LCAKEIDECILTYQRSSLCVLTLESS----DPLSTKFAPSYCIVERIFELKQALLGVVVS 356 Query: 1093 EEWRQWKLMYPEDVLSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMGDVYNW 1272 EEW+QWKL EDVL++E A+LG++FW RA +LQ EPFVRLL +L++++SVMGDV+NW Sbjct: 357 EEWKQWKLTIQEDVLNVETAILGDNFWSRACSLLQFFEPFVRLLTTLDIEKSVMGDVFNW 416 Query: 1273 RVQALEVVRSKRIDDMVLKQLEVVLENRWEMLFSPLHASGYILNPRYFGKGQAKDKTVMR 1452 RVQALE V+SK +DD++L QLE+++E++W+MLFSPLHASGYILNP+YFGKGQ+KDKT+MR Sbjct: 417 RVQALEAVKSKGVDDILLNQLELLIESKWDMLFSPLHASGYILNPKYFGKGQSKDKTIMR 476 Query: 1453 GWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAWWENFGSETPQ 1632 GWKATLDRYESD RRVLREQLSSYWRL+GS GEEDA+DCRDKMDPVAWWENFG ETP Sbjct: 477 GWKATLDRYESDSATRRVLREQLSSYWRLEGSFGEEDAVDCRDKMDPVAWWENFGFETPH 536 Query: 1633 LQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFVRNNLRLHSKK 1803 LQTLAIKILSQ+SSV+ +Q +W DN CQ AVN LG ER EDLVFVRNNLRLHS++ Sbjct: 537 LQTLAIKILSQVSSVSMYQETWQDNEFLCQTAVNGLGVERTEDLVFVRNNLRLHSQR 593 >ref|XP_002527444.1| protein dimerization, putative [Ricinus communis] gi|223533179|gb|EEF34936.1| protein dimerization, putative [Ricinus communis] Length = 633 Score = 803 bits (2073), Expect = 0.0 Identities = 392/597 (65%), Positives = 472/597 (79%) Frame = +1 Query: 13 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 192 MPSESDKWGW+HVSVFGGF+ +GTKRWKCNHCNLRYNGSYSRVRAHLLGF+GVGVKSCP Sbjct: 1 MPSESDKWGWEHVSVFGGFDRGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFSGVGVKSCP 60 Query: 193 AIDRSLREAFHIQEEERLTRKKKKIPTSGKSSKRIRSSQLAITSVGKAFGKEDVDDVVAR 372 AIDRSLREAF I EEERL RKKKK +GK KR R SQ +I+ K KEDVDD+VAR Sbjct: 61 AIDRSLREAFQILEEERLVRKKKKNSANGKPGKRTRISQASIS--WKTITKEDVDDIVAR 118 Query: 373 FFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVRE 552 FFYADGLN +++ SPYFH+M KAI +FG GYE PS+DKL DSFL KEK R++K+++ +RE Sbjct: 119 FFYADGLNIDVVNSPYFHEMVKAIGAFGSGYELPSIDKLSDSFLGKEKGRIEKSLALLRE 178 Query: 553 SWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTKAI 732 SWP TGCTI C+ +LDG + CF+INIFVSSPRGL+FL+ +D++ D D V L+ AI Sbjct: 179 SWPHTGCTILCVGRLDGAIGCFHINIFVSSPRGLIFLKAVDVDDCDEGDHVLAGALSDAI 238 Query: 733 MDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDWMKPFV 912 ++VGP+NVLQ+I H G + K S S I SKFPHIFWS CT+H I +LME+I EL+W+KP V Sbjct: 239 LEVGPSNVLQIISHLGDACKSSESYILSKFPHIFWSPCTSHSILMLMEEIAELEWVKPIV 298 Query: 913 SYAKGIEQCILAXXXXXXXXXXXXXXXXXIDPISAKFAPTYNMVWRIIKLKQALQEVVGS 1092 A+ IEQCI+ D ISAKFAP+Y V RI +L+Q LQEVV S Sbjct: 299 LCARRIEQCIMTYQHATSCIFMQSPKESC-DLISAKFAPSYFFVQRIFELRQTLQEVVVS 357 Query: 1093 EEWRQWKLMYPEDVLSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMGDVYNW 1272 E QWK ++V SIE+A+LG+DFW ++HL+LQL EPF++LLG L++D+SV+G VY+W Sbjct: 358 E---QWKHSIGDNVESIESAILGDDFWSKSHLLLQLYEPFIKLLGLLDIDKSVIGAVYDW 414 Query: 1273 RVQALEVVRSKRIDDMVLKQLEVVLENRWEMLFSPLHASGYILNPRYFGKGQAKDKTVMR 1452 RVQALE +RSK IDD +L QLEV++EN+W++LFSPLHA+GYILNPRY GK Q KDK+VMR Sbjct: 415 RVQALEALRSKAIDDDILNQLEVLIENKWDVLFSPLHATGYILNPRYIGKFQTKDKSVMR 474 Query: 1453 GWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAWWENFGSETPQ 1632 GWKATL+RYE + ARRVLREQLSSYWRL+GSLG+EDA+DCRDKMDPVAWWENFG ETP Sbjct: 475 GWKATLERYEGESTARRVLREQLSSYWRLEGSLGDEDAVDCRDKMDPVAWWENFGFETPS 534 Query: 1633 LQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFVRNNLRLHSKK 1803 LQTLAIK+LSQ+SSV Q W N +CQEA N LG +R EDL+FVRNNLRLH +K Sbjct: 535 LQTLAIKVLSQVSSVALCQEIWQTNDFSCQEAANRLGVQRVEDLLFVRNNLRLHYQK 591 >ref|XP_006424350.1| hypothetical protein CICLE_v10028008mg [Citrus clementina] gi|557526284|gb|ESR37590.1| hypothetical protein CICLE_v10028008mg [Citrus clementina] Length = 636 Score = 770 bits (1989), Expect = 0.0 Identities = 380/599 (63%), Positives = 455/599 (75%) Frame = +1 Query: 13 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 192 MPSESDKWGW+HVSVFGGFE +GTKRWKCNHCNLRYNGSYSRVRAHLLGF+GVGVKSCP Sbjct: 1 MPSESDKWGWEHVSVFGGFERGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFSGVGVKSCP 60 Query: 193 AIDRSLREAFHIQEEERLTRKKKKIPTSGKSSKRIRSSQLAITSVGKAFGKEDVDDVVAR 372 AIDRS+RE F I EEER+ RKKK+ K KRIR+ Q +I S KA KEDVD++VAR Sbjct: 61 AIDRSMRETFQILEEERIARKKKRTSGIAKHGKRIRACQSSIVS--KAISKEDVDEMVAR 118 Query: 373 FFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVRE 552 FFYA GLN N++ SPYF +M ++IA+FG GY+ PS++ L DSFL+KEK +++K ++ VRE Sbjct: 119 FFYAAGLNVNVVNSPYFLEMVRSIAAFGHGYDLPSLENLSDSFLSKEKGKIEKFIASVRE 178 Query: 553 SWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTKAI 732 SWP TGCTI C+S LDG L CF IFVSSPRGL+FL+ +D++ D +++F VL+ AI Sbjct: 179 SWPHTGCTILCVSSLDGQLGCFPTGIFVSSPRGLVFLKALDLDDTDEAENLFITVLSDAI 238 Query: 733 MDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDWMKPFV 912 +DVGP NVLQ+I H G + K SL+ SKFPHIF S CT I + ME+I L+W+K V Sbjct: 239 LDVGPKNVLQIISHLGHACKSYESLVLSKFPHIFLSPCTLQSIHMFMEEIASLEWIKSTV 298 Query: 913 SYAKGIEQCILAXXXXXXXXXXXXXXXXXIDPISAKFAPTYNMVWRIIKLKQALQEVVGS 1092 AK IEQ IL D +S K AP+Y V RII+LKQ LQE V S Sbjct: 299 LCAKRIEQHILYYQHAYPCLFPHNLKESS-DQVSTKIAPSYCFVQRIIELKQVLQEAVVS 357 Query: 1093 EEWRQWKLMYPEDVLSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMGDVYNW 1272 EE++QWKL P D +E+A+LG+DFWG+AHL LQLCEPFVRLL + ++D+SVMG VY+W Sbjct: 358 EEFKQWKLSMPGDHGIVESAILGDDFWGKAHLFLQLCEPFVRLLATFDIDKSVMGAVYDW 417 Query: 1273 RVQALEVVRSKRIDDMVLKQLEVVLENRWEMLFSPLHASGYILNPRYFGKGQAKDKTVMR 1452 R QALE VR K ID L QLEV+ ENRW+ LFSPLHA+GYILNPRYFG+GQ KDKTVMR Sbjct: 418 RFQALEAVRMKGIDATALNQLEVLTENRWDALFSPLHAAGYILNPRYFGRGQNKDKTVMR 477 Query: 1453 GWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAWWENFGSETPQ 1632 GWK+TL+RYESD RR+LREQLSSYWRL+GSLGEEDA+D RDKM+PVAWWENFG E Sbjct: 478 GWKSTLERYESDSATRRILREQLSSYWRLEGSLGEEDAVDFRDKMEPVAWWENFGFEISH 537 Query: 1633 LQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFVRNNLRLHSKKLV 1809 LQTLAIK+LSQ+SSV Q W DN C+EA N G ER EDL+FVRNNLRLH+++ V Sbjct: 538 LQTLAIKVLSQVSSVAVCQEIWQDNDFPCREAANRSGVERPEDLIFVRNNLRLHNQRNV 596 >ref|XP_006484968.1| PREDICTED: uncharacterized protein LOC102615434 isoform X1 [Citrus sinensis] gi|568863036|ref|XP_006484969.1| PREDICTED: uncharacterized protein LOC102615434 isoform X2 [Citrus sinensis] Length = 636 Score = 768 bits (1982), Expect = 0.0 Identities = 378/599 (63%), Positives = 455/599 (75%) Frame = +1 Query: 13 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 192 MPSESDKWGW+HVSVFGGFE +GTKRWKCNHCNLRYNGSYSRVRAHLLGF+GVGVKSCP Sbjct: 1 MPSESDKWGWEHVSVFGGFERGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFSGVGVKSCP 60 Query: 193 AIDRSLREAFHIQEEERLTRKKKKIPTSGKSSKRIRSSQLAITSVGKAFGKEDVDDVVAR 372 AIDRS+RE F I EEER+ RKKK+ K KRIR+ Q +I S KA KEDVD++VAR Sbjct: 61 AIDRSMRETFQILEEERIARKKKRTSGIAKHGKRIRACQSSIVS--KAISKEDVDEMVAR 118 Query: 373 FFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVRE 552 FFYA GLN N++ SPYF +M ++IA+FG GY+ PS++ L DSFL+KEK +++K ++ VRE Sbjct: 119 FFYAAGLNVNVVNSPYFLEMVRSIAAFGHGYDLPSLENLSDSFLSKEKGKIEKFIASVRE 178 Query: 553 SWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTKAI 732 SWP TGCTI C+S LDG L CF IFVSSPRGL+FL+ +D++ D +++F VL+ AI Sbjct: 179 SWPHTGCTILCVSSLDGRLGCFPTGIFVSSPRGLVFLKALDLDDTDEAENLFITVLSDAI 238 Query: 733 MDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDWMKPFV 912 ++VGP NVLQ+I H G + K SL+ SKFPHIF S CT I + ME+I L+W+K V Sbjct: 239 LEVGPKNVLQIISHLGHACKSYESLVLSKFPHIFLSPCTLQSIHMFMEEIASLEWIKSTV 298 Query: 913 SYAKGIEQCILAXXXXXXXXXXXXXXXXXIDPISAKFAPTYNMVWRIIKLKQALQEVVGS 1092 AK IEQ I+ D +S K AP+Y V RII+LKQ LQE V S Sbjct: 299 LCAKRIEQHIMYYQHAYPCLFPHNLKESS-DQVSTKIAPSYCFVQRIIELKQVLQEAVVS 357 Query: 1093 EEWRQWKLMYPEDVLSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMGDVYNW 1272 EE++QWKL P D +E+A+LG+DFWG+AHL LQLCEPFVRLL + ++D+SVMG VY+W Sbjct: 358 EEFKQWKLSMPGDHGIVESAILGDDFWGKAHLFLQLCEPFVRLLATFDIDKSVMGAVYDW 417 Query: 1273 RVQALEVVRSKRIDDMVLKQLEVVLENRWEMLFSPLHASGYILNPRYFGKGQAKDKTVMR 1452 R QALE VR K ID L QLEV+ ENRW+ LFSPLHA+GYILNPRYFG+GQ KDKTVMR Sbjct: 418 RFQALEAVRMKGIDATALNQLEVLTENRWDALFSPLHAAGYILNPRYFGRGQNKDKTVMR 477 Query: 1453 GWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAWWENFGSETPQ 1632 GWK+TL+RYESD RR+LREQLSSYWRL+GSLGEEDA+D RDKM+PVAWWENFG E Sbjct: 478 GWKSTLERYESDSATRRILREQLSSYWRLEGSLGEEDAVDFRDKMEPVAWWENFGFEISH 537 Query: 1633 LQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFVRNNLRLHSKKLV 1809 LQTLAIK+LSQ+SSV Q W DN C+EA N G ER EDL+FVRNNLRLH+++ V Sbjct: 538 LQTLAIKVLSQVSSVAICQEIWQDNDFPCREAANRSGVERPEDLIFVRNNLRLHNQRNV 596 >ref|XP_007014534.1| Uncharacterized protein TCM_039722 [Theobroma cacao] gi|508784897|gb|EOY32153.1| Uncharacterized protein TCM_039722 [Theobroma cacao] Length = 381 Score = 263 bits (673), Expect = 2e-67 Identities = 128/213 (60%), Positives = 159/213 (74%) Frame = +1 Query: 1066 QALQEVVGSEEWRQWKLMYPEDVLSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDR 1245 +ALQ+VV SEEW+QWK +D+L IEA++LG++FW AH+MLQL +PF +LL L++D+ Sbjct: 146 KALQDVVVSEEWKQWKHSILKDILIIEASILGDEFWSNAHMMLQLFKPFAKLLAMLDIDK 205 Query: 1246 SVMGDVYNWRVQALEVVRSKRIDDMVLKQLEVVLENRWEMLFSPLHASGYILNPRYFGKG 1425 SVMG +Y+WRVQALEVVRSK ID+ L QLEV++EN+W +LFS LHA+GYILNP YFGK Sbjct: 206 SVMGAIYDWRVQALEVVRSKEIDETALNQLEVLIENKWNVLFSLLHAAGYILNPGYFGK- 264 Query: 1426 QAKDKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAWW 1605 AR VLR+QLSSYWRL+GS GEEDA+DCRDKMD VAWW Sbjct: 265 -----------------------ARWVLRKQLSSYWRLEGSFGEEDALDCRDKMDLVAWW 301 Query: 1606 ENFGSETPQLQTLAIKILSQISSVTTFQGSWHD 1704 ENFG ETP LQTLAIK+LSQ+S+++ Q W D Sbjct: 302 ENFGFETPHLQTLAIKVLSQVSTISMCQDIWQD 334 Score = 181 bits (459), Expect = 1e-42 Identities = 100/201 (49%), Positives = 117/201 (58%) Frame = +1 Query: 13 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 192 M SE DKWGW+HV+VFG F+ +GTKRWKCNHCNLRYNGSYSRVRAHLL F+GVGVKSC Sbjct: 1 MASEFDKWGWEHVTVFGVFDRGSGTKRWKCNHCNLRYNGSYSRVRAHLLRFSGVGVKSCL 60 Query: 193 AIDRSLREAFHIQEEERLTRKKKKIPTSGKSSKRIRSSQLAITSVGKAFGKEDVDDVVAR 372 AI+R+LREAFHI EEERL R KK T G Sbjct: 61 AINRTLREAFHILEEERLAR--KKKRTFGSGKP--------------------------- 91 Query: 373 FFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVRE 552 +FG GYEPPS+DKL D FL+KEK R++K+++ VRE Sbjct: 92 -------------------------TFGCGYEPPSMDKLSDCFLSKEKGRIEKSITLVRE 126 Query: 553 SWPLTGCTIFCLSQLDGTLSC 615 SWP TG T+ C+ G L C Sbjct: 127 SWPHTGYTVLCV----GCLGC 143 >ref|XP_007214864.1| hypothetical protein PRUPE_ppa018860mg [Prunus persica] gi|462411014|gb|EMJ16063.1| hypothetical protein PRUPE_ppa018860mg [Prunus persica] Length = 805 Score = 243 bits (621), Expect = 2e-61 Identities = 183/648 (28%), Positives = 297/648 (45%), Gaps = 60/648 (9%) Frame = +1 Query: 40 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRS-LRE 216 WK+V G ++CN+C + GSY RV++HLL G GV SC + S L E Sbjct: 128 WKYVKKLEKDGKAGGNTSFQCNYCQKTFKGSYFRVKSHLLKLKGNGVASCTKVTNSHLME 187 Query: 217 AFHIQEEERLTRKKKKI-----PTSGKSSKRIRSSQLAITS-------------VGKAFG 342 + EE L K ++ PTS SS+ SS L ++S + KAF Sbjct: 188 MEKVVEEAELRVKMAQLRDVPLPTSNTSSQGGSSSGLGMSSNWCSDSKKRKGNPIEKAFN 247 Query: 343 ---KEDVDDVVARFFYADGLNFNIIKSPYFHDMAK-AIASFGPGYEPPSVDKLLDSFLTK 510 +E +D +AR FY GL+F ++P++ + + A + PGY+PP + L + L K Sbjct: 248 NNLREQLDGEIARMFYTGGLSFQFSRNPHYVNAFRIACSKTLPGYQPPGYNMLRTTLLQK 307 Query: 511 EKARMDKAVSPVRESW------PLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTI 672 EK +++ VS + W PL IN+ G +FL+ I Sbjct: 308 EKNNIEEWVSVCSDGWSDAQRRPL-------------------INVMAICESGPMFLKAI 348 Query: 673 DIEKGDGEDDVF-TEVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCT 849 + E G+ +D F +L ++I ++GP NV+QV+ K +G ++ +KF HIFW+ C Sbjct: 349 NCE-GECKDKFFMANLLIESIREIGPQNVVQVVTDNAPVCKAAGHIVEAKFKHIFWTPCV 407 Query: 850 AHCIQLLMEDI-----------TELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXX 996 H + L +++I + W+ S A I+ I+ Sbjct: 408 VHTLNLALKNICSPVPRNPEVYEQCSWISTISSDAWFIKNFIM-NHNMRLSMYNDHCKLK 466 Query: 997 XIDPISAKFAPTYNMVWRIIKLKQALQEVVGSEEWRQWKLMYPEDVLSIEAAVLGNDFWG 1176 + +FA T M+ R ++KQ L+++V SE+W +K +++ +L FW Sbjct: 467 LLSVAETRFASTIVMLRRFKQVKQGLEQMVISEQWDIYKEDDVVKARTVKEKILDECFWE 526 Query: 1177 RAHLMLQLCEPFVRLLGSLNVDRSVMGDVYNWRVQALEVVRS-------KRIDD--MVLK 1329 +L P +L + D + +Y W +E V++ K++++ M Sbjct: 527 DIDYILNFTSPIYEMLRLSDTDMPCLHLIYEWWDSMIEKVKTIIYRKERKQLNEESMFFN 586 Query: 1330 QLEVVLENRWEMLFSPLHASGYILNPRYFGK----------GQAKDKTVMRGWKATLDRY 1479 + +L +RW +PLH + LNP+Y+ K KD + R K ++R+ Sbjct: 587 VVHEILVDRWTKSSTPLHCFAHSLNPKYYCKEWLDMAHNRCPPHKDIEITRERKQCIERF 646 Query: 1480 ESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAWWENFGSETPQLQTLAIKIL 1659 S+ + RR + E+ +S+ D+M R M PV WW G+ TP+LQT+A+K+L Sbjct: 647 FSNEVERRAVNEEYASFSACIEDFSGMDSMKDRGFMAPVKWWVIHGASTPKLQTIALKLL 706 Query: 1660 SQISSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFVRNNLRLHSKK 1803 SS + + +W N + ERAEDLVFV +NLRL S+K Sbjct: 707 GHPSSSSCCERNWSTYNFIHSIKRNKITPERAEDLVFVHSNLRLLSRK 754 >ref|XP_006841838.1| hypothetical protein AMTR_s00003p00270420 [Amborella trichopoda] gi|548843859|gb|ERN03513.1| hypothetical protein AMTR_s00003p00270420 [Amborella trichopoda] Length = 732 Score = 243 bits (619), Expect = 4e-61 Identities = 168/630 (26%), Positives = 279/630 (44%), Gaps = 42/630 (6%) Frame = +1 Query: 40 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREA 219 W ++ G T G +C C + GSY+RV++HLLG G GVK C ID Sbjct: 35 WAYMEKIGRCHTGGGNWMLRCVLCKAEFKGSYTRVKSHLLGKVGTGVKRCLGIDNETLAT 94 Query: 220 FHIQEEERLTRKKKKIPTSGKSSKRIRSSQLAITSVGKAFG---------KEDVDDVVAR 372 +E TRK + S ++ S + + A K+ +D ++AR Sbjct: 95 LLRLNDEGSTRKIRSSSRSSVPLLKVNSGSIGLKKRRGANDLVKLLDLAPKDVLDRMIAR 154 Query: 373 FFYADGLNFNIIKSPYFHDMAK-AIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVR 549 FYA G++ N+I+SPYF DM + A + GY P+ D L S L EKA ++++V P R Sbjct: 155 CFYASGISLNLIRSPYFRDMIRYACENSLEGYVLPTFDNLRTSLLDAEKANIEQSVKPFR 214 Query: 550 ESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTKA 729 SW G ++ D T IN +S G +FL+ ID D + + Sbjct: 215 SSWGSRGVSLLTDGWTDTTAKRPLINFMAASDIGSIFLKAIDSSVEMMNTDYMKNLFLEM 274 Query: 730 IMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELD----- 894 + +VGP +V+Q+I +++G + P+IFW+ C H + L +++I D Sbjct: 275 VAEVGPTSVVQIITDNSPICRVAGQRVEGMHPYIFWTPCVIHTLNLALKNICSPDDERKA 334 Query: 895 -------WMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXXIDPISAKFAPTYNMVWRI 1053 W++ K I ++ + ++FA T +V RI Sbjct: 335 EKYLHCQWIRDLDRDVKMIRSFVV-DHNAVLTIYSQYPTLRLLSVTESRFASTVIIVKRI 393 Query: 1054 IKLKQALQEVVGSEEWRQWKLMYPEDVLSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSL 1233 ++K AL +V W+ E +++ ++ + +W + ++ EP + +L ++ Sbjct: 394 KEVKPALCRMVVDSYWKVLVEEDAEKARRVKSCLVDDLWWEKIEFLIAFTEPILAMLRAI 453 Query: 1234 NVDRSVMGDVYNWRVQALEVVRS-------KRI---DDMVLKQLEVVLENRWEMLFSPLH 1383 + D + +VY+ +E VR K I + + + +L W +PL Sbjct: 454 DTDEPTLHEVYDMWATMIEEVRGIIFRNEGKNIFLNESSFYEDIHRILVGSWNKSKTPLQ 513 Query: 1384 ASGYILNPRYFGK---GQA-------KDKTVMRGWKATLDRYESDGMARRVLREQLSSYW 1533 + LNP+Y+ G+ KD+ V G R + + E+ + Sbjct: 514 CLAHSLNPKYYSDEWLGEVPSRLPPHKDREVSDGRNVCFARLFPAPSELQKVHEEFEMFS 573 Query: 1534 RLDGSLGEEDAMDCRDKMDPVAWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGS 1713 G G D M R M P++WWENFG+ P+L LA ++LSQ SS + + +W Sbjct: 574 MCKGHFGHWDVMSSRFSMSPISWWENFGAHVPRLAKLADRLLSQPSSSSCCERNWGTFSL 633 Query: 1714 TCQEAVNLLGAERAEDLVFVRNNLRLHSKK 1803 + N L ++RAEDLV+V +NLRL S++ Sbjct: 634 IKKIKQNRLASQRAEDLVYVHSNLRLLSRR 663 >ref|XP_007039961.1| HAT transposon superfamily [Theobroma cacao] gi|508777206|gb|EOY24462.1| HAT transposon superfamily [Theobroma cacao] Length = 674 Score = 236 bits (601), Expect = 5e-59 Identities = 165/607 (27%), Positives = 276/607 (45%), Gaps = 40/607 (6%) Frame = +1 Query: 97 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHIQEEERLTRKKKKIPTS 276 +CN+C+ ++G R++ HL + C + +R+ HIQ + KK+K P Sbjct: 23 RCNYCHREFSGGVYRMKFHLAQIKNKDIVPCAEVPDDVRD--HIQTILN-SPKKQKTPKK 79 Query: 277 GKSSKRIRSSQLAITSV------------------------------------GKAFGKE 348 K K + + Q +S G+ +E Sbjct: 80 PKVDKAVANDQQNSSSASGGLHLNHGSSGQHGSTCPSLLFPRPSPSEQPAVDDGQKQKQE 139 Query: 349 DVDDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMD 528 D D +A FF+ + + F+ KS Y+ +M AIA G GY+ PS + L + L K K + Sbjct: 140 DADKKIAVFFFHNSIPFSAAKSMYYQEMVDAIAKCGVGYKAPSYENLRSTLLEKVKGDIH 199 Query: 529 KAVSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVF 708 R+ W TGCTI C S DG F I V+ P+G LFL+++D+ + + Sbjct: 200 DCYKKYRDEWKETGCTILCDSWSDGRTKSFVI-FSVTCPKGTLFLKSVDVSGHEDDASYL 258 Query: 709 TEVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITE 888 E+L +++VG NV+QVI S +G L+ +K+ +FWS C ++CI ++EDI++ Sbjct: 259 FELLESVVLEVGLENVIQVITDTAASYVYAGRLLMAKYSSLFWSPCASYCINKMLEDISK 318 Query: 889 LDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXXIDPISAKFAPTYNMVWRIIKLKQ 1068 +W+ + AK I Q I + + P +F Y + II + Sbjct: 319 QEWVGIVLEEAKSIVQYIYSHAWIVNMMRKFTGGRELMRPRITRFVANYLTLRSIIIQED 378 Query: 1069 ALQEVVGSEEWRQWKLMYPEDVLSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRS 1248 L+ + EW D +I++ + FW AH + + EP V++L ++ D Sbjct: 379 NLKHMFSHSEWLSSIYSRRSDAQAIKSLLYLERFWKSAHEAVSVSEPLVKILRIVDGDMP 438 Query: 1249 VMGDVYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHASGYILNPRYFG 1419 MG +Y +A +++ K +++ + +++ + RW M L SPLHA+ LNP F Sbjct: 439 AMGYIYEGIERAKVAIKAYYKGLEEKYMPIWDII-DRRWNMQLHSPLHAAAAFLNPSIFY 497 Query: 1420 KGQAK-DKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPV 1596 K D + G++ + + + + + ++ Y G+LG + A+ R P Sbjct: 498 NPNFKIDLRMRNGFQEAMLKLATTDKDKIEITKEHPMYINAQGALGTDFAIMGRTLNAPG 557 Query: 1597 AWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFVR 1776 WW ++G E P LQ +AI+ILSQ S + +W S + N + E+ DLVFV Sbjct: 558 DWWASYGYEIPTLQRVAIRILSQPCSSHWCRWNWSTFESIHTKKRNKVELEKFNDLVFVH 617 Query: 1777 NNLRLHS 1797 NL L + Sbjct: 618 CNLCLQA 624 >ref|XP_006477267.1| PREDICTED: uncharacterized protein LOC102627361 [Citrus sinensis] Length = 674 Score = 229 bits (584), Expect = 5e-57 Identities = 164/608 (26%), Positives = 275/608 (45%), Gaps = 41/608 (6%) Frame = +1 Query: 97 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHIQEEERLTRKKKKIPTS 276 +CN+C ++G R++ HL + C + +R+ HIQ + +K+K P Sbjct: 23 RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPDDVRD--HIQRILSIPKKQKN-PKR 79 Query: 277 GKSSKRIRSSQLAITSVGKAFGK------------------------------------E 348 K K + Q +S + + Sbjct: 80 PKVEKATANGQQNSSSASGGIHQNNRSSGQHGSSCPSLLFRHPSPSIQPIVDDTQKQRQD 139 Query: 349 DVDDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMD 528 D D +A FF+ + + F+ KS Y+ +M AIA G GY PS +KL + L K K +D Sbjct: 140 DTDKKIAVFFFHNSIPFSAAKSMYYQEMVNAIAECGVGYIAPSYEKLRSTLLEKVKVDID 199 Query: 529 KAVSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVF 708 RE W TGCTI C + D + V+ P+G LFL+++D+ G ED F Sbjct: 200 DCCKKYREEWKETGCTILCDNWSDERTKSLVV-FSVACPKGTLFLKSVDVS-GHEEDATF 257 Query: 709 T-EVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDIT 885 E+L ++DVG NV+QVI +G L+ +K+ +FWS C A+CI ++EDI+ Sbjct: 258 LFELLESVVLDVGVENVIQVITDSAACYVYAGRLLMTKYSSLFWSPCAAYCIDKMLEDIS 317 Query: 886 ELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXXIDPISAKFAPTYNMVWRIIKLK 1065 + +W+ + AK I + + I P +F Y + I+ + Sbjct: 318 KQEWVAMVLEEAKTITKYFYSHAWTLNMMRKLTGGRELIRPRITRFVANYLSLRSIVIHE 377 Query: 1066 QALQEVVGSEEWRQWKLMYPEDVLSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDR 1245 + L+ + EW D +I++ + + FW AH ++ + EP V++L ++ D Sbjct: 378 ENLKHMFSHSEWLSSIYSRRPDAQAIKSLLYLDRFWRSAHEVVSVSEPLVKILRIVDGDM 437 Query: 1246 SVMGDVYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHASGYILNPRYF 1416 MG +Y +A +++ K +++ + +++ + RW M L SPLHA+ LNP F Sbjct: 438 PAMGYMYEGIERAKLAIQAYYKGVEEKYVPIWDII-DRRWNMQLHSPLHAAAAFLNPSIF 496 Query: 1417 GKGQAK-DKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDP 1593 K D + G++ + + + + + ++ Y G+LG + A+ R P Sbjct: 497 YNPNFKIDLRMRNGFQEAMIKLATADKDKIEITKEHPVYINAQGALGTDFAVLGRKLNAP 556 Query: 1594 VAWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFV 1773 WW ++G E P LQ AI+ILSQ S ++ +W S + N + E+ DL+FV Sbjct: 557 GDWWASYGYEIPTLQRAAIRILSQPCSSYWYRWNWSTFESIHNKKRNKVEMEKFNDLLFV 616 Query: 1774 RNNLRLHS 1797 NLRL + Sbjct: 617 HCNLRLQA 624 >ref|XP_003538648.1| PREDICTED: uncharacterized protein LOC100805582 isoform X1 [Glycine max] gi|571487050|ref|XP_006590550.1| PREDICTED: uncharacterized protein LOC100805582 isoform X2 [Glycine max] Length = 675 Score = 229 bits (583), Expect = 6e-57 Identities = 155/607 (25%), Positives = 267/607 (43%), Gaps = 40/607 (6%) Frame = +1 Query: 97 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHIQEEERLTRKKKKIPTS 276 +CN+C ++G R++ HL + C + +R+ HIQ +K K Sbjct: 23 RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRD--HIQSILSAPKKPKTPKKQ 80 Query: 277 GKSSKRIRSSQLAITSVGKAFG------------------------------------KE 348 + + Q +S F ++ Sbjct: 81 KTDQATVANGQQNSSSASGGFHHNHGYSGQNGSACPSLLFPNPSPSAQPLEHDAQKQKQD 140 Query: 349 DVDDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMD 528 D D +A FF+ + + F+ KS Y+ +M A+A G GY+ PS +KL + L K KA + Sbjct: 141 DADRKLAIFFFHNSIPFSAAKSIYYQEMVDAVAQCGVGYKAPSYEKLRSTLLEKVKADIH 200 Query: 529 KAVSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVF 708 R+ W TGCT+ C + DG + V+ P+G LFL+++D+ + + Sbjct: 201 SDYKKYRDEWKETGCTVLCDNWSDGRTGSLAV-FSVACPKGTLFLKSVDVSGHENDSTYL 259 Query: 709 TEVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITE 888 E+L +++VG NV+QVI S +G L+ +++ +FWS C A+CI ++EDI Sbjct: 260 FELLESVVLEVGAENVVQVITDASASYVCAGRLLIARYSFLFWSPCVAYCIDKMLEDIGR 319 Query: 889 LDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXXIDPISAKFAPTYNMVWRIIKLKQ 1068 DW+ + AK I Q I + I P +F + + I+ + Sbjct: 320 QDWVGTVLEEAKTITQYIYSHAWILNIMRKFTGGKELIRPKITRFVTNFLSLKSIVMQED 379 Query: 1069 ALQEVVGSEEWRQWKLMYPEDVLSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRS 1248 ++ + EW D +I + + + FW AH + + EP V+ L ++ D Sbjct: 380 NIKHMFSHSEWLSSIYRRRPDAQAINSLLYSDRFWKYAHEAVSVSEPLVKCLRMVDGDMP 439 Query: 1249 VMGDVYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHASGYILNPRY-F 1416 MG VY +A +++ K I++ + +++ + RW M + S LHA+ LNP + Sbjct: 440 AMGYVYEGIERAKVAIKAYYKGIEEKYIPIWDII-DRRWNMQIHSSLHAAAAFLNPSISY 498 Query: 1417 GKGQAKDKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPV 1596 KD + G++ + R + + ++L +Y G+LG + A+ R P Sbjct: 499 NPNFKKDLRMRNGFQEAMLRLAITDKDKMEITKELPTYINAQGALGTDFAVLGRTLNAPG 558 Query: 1597 AWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFVR 1776 WW ++G E P LQ A++ILSQ S ++ +W S N + E+ +LVFV Sbjct: 559 DWWASYGYEIPTLQKAAVRILSQPCSSLWYRWNWSTFESIHNRKRNRVELEKFSELVFVH 618 Query: 1777 NNLRLHS 1797 +NL L + Sbjct: 619 SNLWLQT 625 >ref|XP_002443069.1| hypothetical protein SORBIDRAFT_08g007560 [Sorghum bicolor] gi|241943762|gb|EES16907.1| hypothetical protein SORBIDRAFT_08g007560 [Sorghum bicolor] Length = 713 Score = 225 bits (573), Expect = 9e-56 Identities = 164/633 (25%), Positives = 285/633 (45%), Gaps = 45/633 (7%) Frame = +1 Query: 40 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSL--- 210 W HV + G W+C + L Y GSYSR+++HLL +G G+K C A+D+ + Sbjct: 23 WNHVVLLEK-AAAGGNAVWRCKYYKLEYKGSYSRIKSHLLRISGGGIKICTAVDKFILAQ 81 Query: 211 ---REAFHIQEEERLTRKKKKIPTSG-KSSKRIRSSQLAITSVGKAFGKE---DVDDVVA 369 A E ER K +P +S +R+ + +++ KAF E +D ++ Sbjct: 82 LKSEVAEAADEIERSKAKVIPLPVENVDASNSMRNKRQRSSALEKAFDMETRNQLDAIIG 141 Query: 370 RFFYADGLNFNIIKSPYFHDMAKAIASFG-PGYEPPSVDKLLDSFLTKEKARMDKAVSPV 546 R FY+ G++FNI ++PY+ + + AS GY PPS +KL + L +E+A ++ + + Sbjct: 142 RLFYSGGVSFNIARNPYYRESYRFAASHNLDGYVPPSYNKLRTTLLKQERAHVESLLDRM 201 Query: 547 RESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTK 726 + W G TI C + IN +FL+ ID + E L + Sbjct: 202 KSVWAEKGVTI-CSDGWSDSQRRPLINFIAVCKGKPMFLRAIDASGEEKTKFFIAEKLIQ 260 Query: 727 AIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDIT------- 885 + +VGP NV+Q+I + K +G ++ K+ +IFW+ C H + L +++I Sbjct: 261 VVEEVGPKNVVQIITDNAANCKGAGLIVQQKYDNIFWTPCIVHTLNLALKNICAAKLPRT 320 Query: 886 --------ELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXXIDPISAKFAPTYNM 1041 EL W+ A I+ I+ + +FA M Sbjct: 321 EEQEIVYDELHWITLVAGDANMIKNYIM-NHSMRLSMFNEFSKLKLLAVAETRFASVVVM 379 Query: 1042 VWRIIKLKQALQEVVGSEEWRQWKLMYPEDVLSIEAAVLGNDFWGRAHLMLQLCEPFVRL 1221 + R + +K+ALQ +V S+ W +K + +L + +W ++ +P + Sbjct: 380 LTRFLMVKRALQRMVISDAWESYKDDNAGTAKHVREKILCSKWWDNVQYIVDFTDPIYEM 439 Query: 1222 LGSLNVDRSVMGDVYN-W-----RVQALEVVRSKRIDD---MVLKQLEVVLENRWEMLFS 1374 L + DR + +Y W +V+ + + K+ +D ++ +L +RW + Sbjct: 440 LRMADTDRPCLHLIYEMWDTMIAKVKKVVYTKEKKNNDEQSTFFSTVQDILLDRWTKSNT 499 Query: 1375 PLHASGYILNPRYF-------GKGQA---KDKTVMRGWKATLDRYESDGMARRVLREQLS 1524 PL + LNPRY+ +G+ KD + ++ G ++++ S Sbjct: 500 PLICLAHSLNPRYYHEKWISENEGREPPHKDLEISVQRMKCFRKFFPVGKDLNQVKDEYS 559 Query: 1525 SYWRLDGSLGEEDAMDCRDKMDPVAWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHD 1704 + L + D++ R +DP+ WW N G P LQ LA+K+L+Q +S ++ + +W Sbjct: 560 RFATCSEELNDFDSIYDRWILDPLKWWANHGQSIPMLQKLALKLLNQPASSSSCERNWST 619 Query: 1705 NGSTCQEAVNLLGAERAEDLVFVRNNLRLHSKK 1803 N L E AEDLVF+ NNLRL ++K Sbjct: 620 YSFVHSMLRNKLAPECAEDLVFIHNNLRLLARK 652 >ref|XP_004292297.1| PREDICTED: uncharacterized protein LOC101307174 [Fragaria vesca subsp. vesca] Length = 719 Score = 224 bits (571), Expect = 1e-55 Identities = 157/645 (24%), Positives = 286/645 (44%), Gaps = 57/645 (8%) Frame = +1 Query: 40 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLR-- 213 WK+V++ G + G + CN C + GS+SRV++HLL G GVK P I R Sbjct: 25 WKYVTITSGSDKSGGNVAFTCNFCGGKLTGSHSRVKSHLLRIKGTGVKIYPTITRDQTVE 84 Query: 214 ---------EAFHIQEEERLTRKKKKIPTSGKSSKRIRSSQLAITS-------VGKAFGK 345 + + + + ++ + SG S +R + + + KAF + Sbjct: 85 LQALLDHCDQQLNAKAQHKVALPPSSMTGSGISYFPLREREDEVKKRRGLSPQLSKAFRQ 144 Query: 346 ED---VDDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEK 516 ED D VAR FY+ GL FN+ ++P + + + ++AS PGY PP + L + L EK Sbjct: 145 EDRRECDASVARLFYSSGLAFNVARNPNYRE-SYSLASKIPGYVPPGYNALRTTLLDNEK 203 Query: 517 ARMDKAVSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGE 696 +++ + P++++W TG ++ DG IN+ ++ G + L+ I+ E Sbjct: 204 RHIERTLLPIKKTWKETGVSLCSDGWTDGQKRPL-INMMAAAKDGAMMLKAINCEGVTKS 262 Query: 697 DDVFTEVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLME 876 + +L ++I ++GP NV+QV+ S +G+++ PHIFW+ C H + L ++ Sbjct: 263 KEEIGRLLLESINEIGPENVVQVVTDNAPVSAAAGAIVEITHPHIFWTPCVVHTLNLALK 322 Query: 877 D-------------ITELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXXIDPISA 1017 D + EL W+ + I+ ++ + Sbjct: 323 DLLKAKSYLPGETVVEELGWLMEVYNDVWFIKNFVV-NHNMRLAMYHEHCALRLLQVAPT 381 Query: 1018 KFAPTYNMVWRIIKLKQALQEVVGSEEWRQWKLMYPEDVLSIEAAVLGNDFWGRAHLMLQ 1197 +FA + ++ R +K LQ++V S+ W +K ++ +L FW + ++ Sbjct: 382 RFASHFIVLKRFRDVKSGLQQMVISQRWDLYKEDDASKARVVKEMLLKEKFWEQIDFLIA 441 Query: 1198 LCEPFVRLLGSLNVDRSVMGDVYNWRVQALEVVRSKRIDD----MVLKQLEV-------- 1341 L P ++ ++DR + VY W +E V+ + ++ + +V Sbjct: 442 LMGPIYEMIRMSDMDRPCLHLVYEWWNSMIEKVKKAVFNPEFVHVITEHCDVTRFYDVVY 501 Query: 1342 -VLENRWEMLFSPLHASGYILNPRYFGKGQA----------KDKTVMRGWKATLDRYESD 1488 +L RW +PLH + LNP+Y+ +D + + + D Sbjct: 502 PILTARWTKSCTPLHCLAHSLNPKYYSSQWLEEDPNRVPPHRDAELNNERRRCFQKLFPD 561 Query: 1489 GMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAWWENFGSETPQLQTLAIKILSQI 1668 R + E+ + + G DA++ + +P+ WW ++G TP LQ+LA+K+L+Q Sbjct: 562 SQTRNKVMEEFARFSLNMGDFSSSDALENKFCFEPLTWWVSYGPSTPLLQSLALKLLNQP 621 Query: 1669 SSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFVRNNLRLHSKK 1803 S + + +W N L RA+DLV+V NLRL ++K Sbjct: 622 CSSSCCERNWSTYAFIQGLKRNKLQPRRAQDLVYVHTNLRLLARK 666 >ref|XP_006579099.1| PREDICTED: uncharacterized protein LOC102660479 [Glycine max] Length = 765 Score = 223 bits (569), Expect = 3e-55 Identities = 168/634 (26%), Positives = 279/634 (44%), Gaps = 46/634 (7%) Frame = +1 Query: 40 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAI-DRSLRE 216 W V++ G + W CN C SYSRV+AHLL G G+ +CP + D L Sbjct: 21 WSFVTIKEKIGDGGGNRLWSCNFCEKVVKSSYSRVKAHLLRICGSGIDTCPKVTDAYLVY 80 Query: 217 AFHIQEEERLTRKKKKIPTSGKSS---------KRIRSSQLAITSVGKAFGKEDVDDV-- 363 + EE K K +P KR +SS ++ AF ED + + Sbjct: 81 LRRVCEEAESILKSKNVPLPTDKRTPTPPTLPPKRRKSS-----NIESAFNIEDRNHLRA 135 Query: 364 -VARFFYADGLNFNIIKSPYFHDMAKAIASFG-PGYEPPSVDKLLDSFLTKEKARMDKAV 537 +AR FY+ L+F++ ++PYF A+ G+ PPS + L S L +E++ +++ + Sbjct: 136 EIARMFYSASLSFHLARNPYFVSSYSFAANCNLSGFLPPSYNALRTSLLQQERSYIERLL 195 Query: 538 SPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEV 717 P++ W L G T+ D + IN S G +FL+ ID K + ++ Sbjct: 196 QPIKSLWSLKGVTLVVDGWTDAQIRPL-INFMAISEEGPMFLKAIDGSKEYKDKHYMFDL 254 Query: 718 LTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDI----- 882 L I +VGP +V+QVI K +G LI +FPHIFW+ C H + L +++I Sbjct: 255 LKDVIKEVGPQSVVQVITDNAYVCKAAGLLIEVEFPHIFWTPCVVHTLNLGVKNICAAKN 314 Query: 883 --------TELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXXIDPISAKFAPTYN 1038 E W+ + A I+ I+ + +FA Sbjct: 315 VDGNENVFNEGGWIAEVIGDASFIKVFIMT-HSMRLAIFNEFSSLKLLSIAETRFASMIV 373 Query: 1039 MVWRIIKLKQALQEVVGSEEWRQWKLMYPEDVLSIEAAVLGNDFWGRAHLMLQLCEPFVR 1218 M+ R+ LK+ LQ +V S++W ++ ++ +L + +W + +L +P Sbjct: 374 MLKRLKLLKRCLQNMVISDQWNSYREDDVRKAAHVKELILNDIWWDKVDYILSFMDPIYS 433 Query: 1219 LLGSLNVDRSVMGDVYNWRVQALEVVRSK--RIDDMVLKQLEV-------VLENRWEMLF 1371 ++ + + S + VY +E V++ R D+++ ++ +L +RW Sbjct: 434 MIRICDTNASNLHLVYEMWDSMIEKVKTTIYRHDEVLENEVSTFFEVIHEILNSRWSKSC 493 Query: 1372 SPLHASGYILNPRYFGKG----------QAKDKTVMRGWKATLDRYESDGMARRVLREQL 1521 +PLH + LNPRY+ +D + L RY + R + E+ Sbjct: 494 NPLHCLAHSLNPRYYSDNWLNEVPNRVPPHRDDELSSQRNKCLKRYFPNVNVRTKVYEEF 553 Query: 1522 SSYWRLDGSLGEEDAMDCRDKMDPVAWWENFGSETPQLQTLAIKILSQISSVTTFQGSWH 1701 S + G G D ++ R +D WW GS TP LQ +A+K+L Q S + + +W Sbjct: 554 SKFSSCAGDFGSFDIIEDRWALDSKTWWVMHGSSTPILQKVALKLLVQPCSSSCCERNWS 613 Query: 1702 DNGSTCQEAVNLLGAERAEDLVFVRNNLRLHSKK 1803 N + ++A+DLVFV +NLRL S+K Sbjct: 614 TYSFIHSLKRNKMDPKKAKDLVFVHSNLRLLSRK 647 >ref|XP_002509591.1| DNA binding protein, putative [Ricinus communis] gi|223549490|gb|EEF50978.1| DNA binding protein, putative [Ricinus communis] Length = 670 Score = 222 bits (565), Expect = 7e-55 Identities = 160/605 (26%), Positives = 272/605 (44%), Gaps = 38/605 (6%) Frame = +1 Query: 97 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHIQE-----EERLTRKKK 261 +CN+CN ++G R++ HL + C + +R HIQ +++ T KK+ Sbjct: 23 RCNYCNREFSGGVYRMKFHLAQIKNKDIVPCAEVPDDVRN--HIQSILSTPKKQKTPKKQ 80 Query: 262 KIP------------TSGKSSKRIRSSQLAITSVGKAFGK-----------------EDV 354 K + G R S Q T F + + Sbjct: 81 KTDQAENGQDNSSSASGGVHPNRGSSGQHGSTCPSLLFSRPLPTSQPVVDDAQNEKQNNA 140 Query: 355 DDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKA 534 D +A FF+ + + F+ KS Y+ +M A+A G GY+ PS +KL S L K K + Sbjct: 141 DKRIAVFFFHNSIAFSAAKSIYYQEMFDAVAECGQGYKAPSFEKLRSSLLEKVKGDIHDW 200 Query: 535 VSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTE 714 R+ W TGCTI C DG I V+ P+G LFL+++DI + + + E Sbjct: 201 YRKYRDDWKETGCTILCDGWSDGRTKSV-IVFSVTCPKGTLFLKSVDISGHENDANYLFE 259 Query: 715 VLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELD 894 +L +++VG NV+QVI S +G L+ +K+ +FWS C ++C+ ++EDI++ + Sbjct: 260 LLESILLEVGVENVIQVITDSTASYVYAGRLLMAKYSSLFWSPCASYCVNKMLEDISKQE 319 Query: 895 WMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXXIDPISAKFAPTYNMVWRIIKLKQAL 1074 W+ + A I + I + I P ++ Y + I+ + L Sbjct: 320 WVGTVMEEANTITKYIYSHAWTLNMMRRFTGGRELIRPRITRYVSNYLSLRAIVIQEDNL 379 Query: 1075 QEVVGSEEWRQWKLMYPEDVLSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVM 1254 + + EW D +++ + + FW AH + + EP +++L ++ D M Sbjct: 380 KHMFSHSEWLSSMHSRRPDAQIVKSFLSQDRFWKFAHEAVSISEPLIKILRIVDGDMPAM 439 Query: 1255 GDVYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHASGYILNPRYFGKG 1425 G +Y +A +++ K I+D + E++ + RW + L SPLHA+ LNP F Sbjct: 440 GYIYEVLERAKVSIKAYYKGIEDKYMPIWEII-DRRWNIQLHSPLHAAAAFLNPSIFYNQ 498 Query: 1426 QAK-DKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAW 1602 K D + G++ + + + + + + ++ Y G+LG + A+ R P W Sbjct: 499 NFKIDLRMRNGFQEAMIKMATSDIDKIEITKEHPIYINGQGALGTDFAIMGRTLNSPGDW 558 Query: 1603 WENFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFVRNN 1782 W +G E P LQ +AI++LSQ S + +W S + N E+ DLVFV N Sbjct: 559 WAGYGYEIPTLQRVAIRLLSQPCSSHWCRWNWSTFESIHTKKRNKAELEKLNDLVFVHCN 618 Query: 1783 LRLHS 1797 L L + Sbjct: 619 LWLQA 623 >ref|XP_006857527.1| hypothetical protein AMTR_s00061p00028660 [Amborella trichopoda] gi|548861623|gb|ERN18994.1| hypothetical protein AMTR_s00061p00028660 [Amborella trichopoda] Length = 863 Score = 220 bits (561), Expect = 2e-54 Identities = 156/603 (25%), Positives = 265/603 (43%), Gaps = 36/603 (5%) Frame = +1 Query: 97 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHI---QEEERLTRKKKKI 267 +CN+C ++G R++ HL + C + +R+ ++ T KK KI Sbjct: 209 RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSDVPNDVRDLIQSVLNTPRKQKTPKKPKI 268 Query: 268 PTSGKSSKRIRSSQ----LAITSVGKAFG-------------------------KEDVDD 360 + S S+ L + S G+ +E+ D Sbjct: 269 EQTPNSPHNSSSASGGFHLNVGSSGQRGSTCPSLLFPHPSPSGQPILDDSQRQKQEEADK 328 Query: 361 VVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVS 540 +A FF+ + + F+ KS Y+H M AIA G GY PS D+L + L K K + + Sbjct: 329 KIALFFFHNSIPFSSSKSIYYHGMVDAIADCGVGYRAPSYDRLRTTLLEKVKVEITDSYK 388 Query: 541 PVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVL 720 R+ W +GCTI DG S F I V+ PRG LFL+++D + E+L Sbjct: 389 TYRDEWRESGCTIMSDGWTDGR-SKFLIVFSVACPRGTLFLKSVDASAHVDDAHYLFELL 447 Query: 721 TKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDWM 900 +++VG ++QVI + +G L+ +K+P +FWS C ++CI ++EDI++ +W+ Sbjct: 448 ESVVLEVGLEYIVQVITDSAANYVYAGRLLTAKYPSLFWSPCASYCIDRMLEDISKQEWV 507 Query: 901 KPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXXIDPISAKFAPTYNMVWRIIKLKQALQE 1080 + A+ I + I + +F + + I+ + L+ Sbjct: 508 STVIEEARSITKYIYGHSWVLNLMKRFTGGKELLRSRITRFVTHFLSLRSIVIHEDNLKH 567 Query: 1081 VVGSEEWRQWKLMYPEDVLSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMGD 1260 + EW D ++ + + + FW A ++ L EP +++L ++ D MG Sbjct: 568 MFSHTEWLSSLYSKKSDAQAVRSLIYLDRFWKSAQEVVNLSEPLIKVLRIVDGDMPAMGY 627 Query: 1261 VYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHASGYILNPRYFGKGQA 1431 +Y +A +++ K +D + E++ + RW + L SPLHA+ LNP F Sbjct: 628 IYEGIERAKVAIKAYYKGSEDKYMPIWEII-DRRWNLQLHSPLHAAAAFLNPAIFYNPSF 686 Query: 1432 K-DKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAWWE 1608 K D + G+ + + + + L ++ Y G+LG + AM R P WW Sbjct: 687 KIDSKIRNGFHEAMMKMVLNDKDKMELTKETPMYINAHGALGNDFAMMARTLNTPGDWWA 746 Query: 1609 NFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFVRNNLR 1788 +G E P LQ AI+ILSQ S + +W + + N L E+ DLV+V NLR Sbjct: 747 GYGYEVPVLQRAAIRILSQPCSSYWCRWNWGTFENVHTKKRNRLEQEKFNDLVYVHCNLR 806 Query: 1789 LHS 1797 + Sbjct: 807 FQA 809 >ref|XP_004159512.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101222344 [Cucumis sativus] Length = 673 Score = 219 bits (558), Expect = 5e-54 Identities = 159/604 (26%), Positives = 274/604 (45%), Gaps = 39/604 (6%) Frame = +1 Query: 97 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHIQEEERLTRKKKKIP-- 270 +CN+C ++G R++ HL + C + +R+ HIQ T KK+K P Sbjct: 23 RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRD--HIQGILS-TPKKQKAPKK 79 Query: 271 --------TSGKSSKRIRSSQLAITSVGKAFG------------------------KEDV 354 T+G+ S + S G+ K++ Sbjct: 80 PKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDET 139 Query: 355 DDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKA 534 D VA FF+ + + F+ KS Y+ +M AIA +G GY+ PS +KL + L K K + + Sbjct: 140 DKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSS 199 Query: 535 VSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTE 714 R+ W TGCTI C S DG F + I V+ +G LFL+++DI + + ++ Sbjct: 200 YKKHRDEWKETGCTILCDSWSDGQTKSFLV-ISVTCSKGTLFLKSVDISGHEDDATYLSD 258 Query: 715 VLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELD 894 +L I++VG NV+Q+I S +G L+ +K+ +FWS C ++C+ ++EDI++++ Sbjct: 259 LLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIE 318 Query: 895 WMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXXIDPISAKFAPTYNMVWRIIKLKQAL 1074 W+ + AK I + I + I P +F + + I+ L+ L Sbjct: 319 WVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNL 378 Query: 1075 QEVVGSEEWRQWKLMYPEDVLSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVM 1254 + + EW D +I + + + FW AH + +CEP +R+L ++ D M Sbjct: 379 KHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAM 438 Query: 1255 GDVYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHASGYILNPRYFGKG 1425 G ++ +A +++ +D + E + + RW + L + LH + LNP F Sbjct: 439 GYIFEGIERAKVEIKTYYNGFEDKYMPIWETI-DRRWNLQLHTTLHTAAAFLNPSXFYNP 497 Query: 1426 QAK-DKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAW 1602 K D + G++ + + + + + + +Y G+LG + A+ R P W Sbjct: 498 NFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDW 557 Query: 1603 WENFGSETPQLQTLAIKILSQISSVTTFQG-SWHDNGSTCQEAVNLLGAERAEDLVFVRN 1779 W +G E P LQ A++ILSQ S G +W + + + E+ DLVFV+ Sbjct: 558 WSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQC 617 Query: 1780 NLRL 1791 NL L Sbjct: 618 NLWL 621 >ref|XP_004147940.1| PREDICTED: uncharacterized protein LOC101222344 [Cucumis sativus] Length = 673 Score = 219 bits (558), Expect = 5e-54 Identities = 159/604 (26%), Positives = 274/604 (45%), Gaps = 39/604 (6%) Frame = +1 Query: 97 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHIQEEERLTRKKKKIP-- 270 +CN+C ++G R++ HL + C + +R+ HIQ T KK+K P Sbjct: 23 RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRD--HIQGILS-TPKKQKAPKK 79 Query: 271 --------TSGKSSKRIRSSQLAITSVGKAFG------------------------KEDV 354 T+G+ S + S G+ K++ Sbjct: 80 PKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTYPCLSPSAQPPIDDAQKQKKDET 139 Query: 355 DDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKA 534 D VA FF+ + + F+ KS Y+ +M AIA +G GY+ PS +KL + L K K + + Sbjct: 140 DKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSS 199 Query: 535 VSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTE 714 R+ W TGCTI C S DG F + I V+ +G LFL+++DI + + ++ Sbjct: 200 YKKHRDEWKETGCTILCDSWSDGQTKSFLV-ISVTCSKGTLFLKSVDISGHEDDATYLSD 258 Query: 715 VLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELD 894 +L I++VG NV+Q+I S +G L+ +K+ +FWS C ++C+ ++EDI++++ Sbjct: 259 LLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIE 318 Query: 895 WMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXXIDPISAKFAPTYNMVWRIIKLKQAL 1074 W+ + AK I + I + I P +F + + I+ L+ L Sbjct: 319 WVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNL 378 Query: 1075 QEVVGSEEWRQWKLMYPEDVLSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVM 1254 + + EW D +I + + + FW AH + +CEP +R+L ++ D M Sbjct: 379 KHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAM 438 Query: 1255 GDVYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHASGYILNPRYFGKG 1425 G ++ +A +++ +D + E + + RW + L + LH + LNP F Sbjct: 439 GYIFEGIERAKVEIKTYYNGFEDKYMPIWETI-DRRWNLQLHTTLHTAAAFLNPSVFYNP 497 Query: 1426 QAK-DKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAW 1602 K D + G++ + + + + + + +Y G+LG + A+ R P W Sbjct: 498 NFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDW 557 Query: 1603 WENFGSETPQLQTLAIKILSQISSVTTFQG-SWHDNGSTCQEAVNLLGAERAEDLVFVRN 1779 W +G E P LQ A++ILSQ S G +W + + + E+ DLVFV+ Sbjct: 558 WSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQC 617 Query: 1780 NLRL 1791 NL L Sbjct: 618 NLWL 621 >ref|XP_007161271.1| hypothetical protein PHAVU_001G056200g, partial [Phaseolus vulgaris] gi|561034735|gb|ESW33265.1| hypothetical protein PHAVU_001G056200g, partial [Phaseolus vulgaris] Length = 702 Score = 218 bits (556), Expect = 8e-54 Identities = 167/629 (26%), Positives = 274/629 (43%), Gaps = 32/629 (5%) Frame = +1 Query: 16 PSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPA 195 P GWKH G + K+ KC++C+ +G R + HL G T + C + Sbjct: 17 PGNRTDVGWKH-----GIDINGNGKKVKCSYCSKTMSGGIFRFKHHLAG-TREDSEPCCS 70 Query: 196 IDRSLREAFH--IQEEERLTRKKKKIP-------------------TSGKSSKRIRSS-Q 309 + +R+ + E ++ + KK+K+ + GK R + Q Sbjct: 71 VPEEIRDLMIKIVAEAKQASLKKRKLNIIDEDQGCEGLEERQHIFGSKGKEKVGSRGAVQ 130 Query: 310 LAITSVGKAFGKEDVDDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKL 489 I + K KE+VD VA FFY + FN+IK+P F M + I +G GY+PPS + Sbjct: 131 ATINQMMKKGYKEEVDAQVAEFFYTSAIPFNVIKNPAFTKMCEMIGKYGAGYKPPSYHDI 190 Query: 490 LDSFLTKEKARMDKAVSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQT 669 + L + + D + +E W TGCTI D N V+SP+G +F+ + Sbjct: 191 REKLLKQAIDKTDLVLQEYKEEWKKTGCTIMSDGWTDKKRRSI-CNFLVNSPKGTVFMYS 249 Query: 670 IDIEKGDGEDDVFTEVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCT 849 +D D ++L + VG NV+QV+ + K +G L+ K H++W+ C Sbjct: 250 LDTSDISKTADKVFKMLDDVVELVGEENVVQVVTDNAANFKAAGELLMQKREHLYWTPCA 309 Query: 850 AHCIQLLMEDI-TELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXXIDPISAKFA 1026 AHCI L ED +L + + + I I I P +FA Sbjct: 310 AHCIDLSFEDFEKKLKVHELTIKKGRKITTYIYGRSMLISMLKKFTKERDLIRPGVTRFA 369 Query: 1027 PTYNMVWRIIKLKQALQEVVGSEEWRQWKLMYPEDVLSIEAAVLGNDFWGRAHLMLQLCE 1206 Y + + +LK +L + SEEW+ K ++ +E +L N FW L++ Sbjct: 370 TAYLTLGCLHELKASLLTMFSSEEWKTSKFGTSQEGKKVENMILDNRFWKNISTCLKVAA 429 Query: 1207 PFVRLLGSLNVD-RSVMGDVYNWRVQALEVVRS-----KRIDDMVLKQLEVVLENRWE-M 1365 P + +L ++ D + MG +Y +A E +++ K+ + V K +++ RW+ Sbjct: 430 PLMVVLRLVDSDAKPAMGFIYEEMDRAKEKIKNNFNHIKKSYEEVWK----IIDARWDNQ 485 Query: 1366 LFSPLHASGYILNPR--YFGKGQAKDKTVMRGWKATLDRYESDGMARRVLREQLSSYWRL 1539 L PLHA+ Y LNP+ Y + ++ D V G ++ R D RR++ QL Y Sbjct: 486 LHRPLHAAAYYLNPQFHYEPEFRSDDPEVKEGLYTSMRRLVKDAAERRIINVQLVEYHFG 545 Query: 1540 DGSLGEEDAMDCRDKMDPVAWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTC 1719 G+ +DA + R + P WWE FG TP+L+ Sbjct: 546 RGAFAMDDAKESRKTILPGEWWEMFGYRTPELKRR------------------------- 580 Query: 1720 QEAVNLLGAERAEDLVFVRNNLRLHSKKL 1806 N L ++ DL++V NL+L +K++ Sbjct: 581 ----NHLHQKKMNDLLYVMYNLKLSNKQI 605 >ref|XP_006577689.1| PREDICTED: uncharacterized protein LOC102662659 [Glycine max] Length = 847 Score = 218 bits (554), Expect = 1e-53 Identities = 171/644 (26%), Positives = 286/644 (44%), Gaps = 57/644 (8%) Frame = +1 Query: 40 WKHV----SVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAID-R 204 W +V SV GG GT KCN C+ +NGSY+RVRAHLL TG GV+ C + Sbjct: 166 WTYVTKIKSVAGG-----GTYEIKCNICDFTFNGSYTRVRAHLLKMTGKGVRVCQKVTVA 220 Query: 205 SLREAFHIQEE-----ERLTRKKKKIP-----------TSGKSSKRIRSSQLAITSVGKA 336 L + I E ER K +P T G K+ ++S SV A Sbjct: 221 KLIDLKKIDNEATLRVERSKTKSVSLPPVSTQHQMDTNTLGVDPKKRKTS-----SVENA 275 Query: 337 FG---KEDVDDVVARFFYADGLNFNIIKSPYFHD-MAKAIASFGPGYEPPSVDKLLDSFL 504 F +E +D +AR FY+ GL F++ ++P++ A A + GY+PP +KL + L Sbjct: 276 FNLQARETLDHEIARMFYSSGLPFHLARNPHYRKAFAYAANNQISGYQPPGYNKLRITLL 335 Query: 505 TKEKARMDKAVSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEK 684 E+ ++ + P++ +W G +I G IN V + G +FL+ ID Sbjct: 336 QNERRHVENLLQPIKNAWSQKGVSIVS-DGWSGPQRRSLINFMVVTESGPMFLKAIDCSN 394 Query: 685 GDGEDDVFTEVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQ 864 + D + + + IM+VG +NV+Q++ K +G +I ++FP I+W+ C H + Sbjct: 395 EIKDKDFIAKHMREVIMEVGHSNVVQIVTDNAAVCKAAGLIIEAEFPSIYWTPCVVHTLN 454 Query: 865 LLMEDI-------------TELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXXID 1005 L +++I E W+ A ++ +++ + Sbjct: 455 LALKNICAAKNTEKNNVAYEECSWITQIADDAMFVKNFVMS-HSMRLSIFNSFNSLKLLS 513 Query: 1006 PISAKFAPTYNMVWRIIKLKQALQEVVGSEEWRQWKLMYPEDVLSIEAAVLGNDFWGRAH 1185 +FA T M+ R +LK+ LQE+V S++W +K ++ +L + +W + Sbjct: 514 IAPTRFASTIVMLKRFKQLKKGLQEMVISDQWSSYKEDDVAKAKFVKDTLLDDKWWDKVD 573 Query: 1186 LMLQLCEPFVRLLGSLNVDRSVMGDVYNWRVQALEVVRS---------KRIDDMVLKQLE 1338 +L P +L + + S + VY +E V++ + + + Sbjct: 574 YILSFTSPIYDVLRRTDTEASSLHLVYEMWDSMIEKVKNAIYQYERNEESEGSTFYEVVH 633 Query: 1339 VVLENRWEMLFSPLHASGYILNPRYFGK----------GQAKDKTVMRGWKATLDRYESD 1488 +L +RW +PLH + LNPRY+ +D + R R+ D Sbjct: 634 SILIDRWTKSSTPLHCLAHSLNPRYYSHEWLSEDSNRVPPHQDMELTRERLKCFKRFFLD 693 Query: 1489 GMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAWWENFGSETPQLQTLAIKILSQI 1668 RR + + +++ + D+++ R +MDP AWW G P LQ +A+K+L+Q Sbjct: 694 VDVRRKVNIEFANFSDGREGFDDLDSLNDRGQMDPKAWWLVHGINAPILQKIALKLLAQP 753 Query: 1669 SSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFVRNNLRLHSK 1800 S + + +W N + RAEDLVFV +NLRL S+ Sbjct: 754 CSSSCCERNWSTYSFIHSLKRNKMTPHRAEDLVFVHSNLRLLSR 797