BLASTX nr result
ID: Akebia25_contig00003172
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia25_contig00003172 (2474 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN70085.1| hypothetical protein VITISV_003006 [Vitis vinifera] 832 0.0 ref|XP_003632266.1| PREDICTED: uncharacterized protein LOC100854... 831 0.0 ref|XP_002527444.1| protein dimerization, putative [Ricinus comm... 797 0.0 ref|XP_006424350.1| hypothetical protein CICLE_v10028008mg [Citr... 774 0.0 ref|XP_006484968.1| PREDICTED: uncharacterized protein LOC102615... 771 0.0 ref|XP_007014534.1| Uncharacterized protein TCM_039722 [Theobrom... 258 1e-65 ref|XP_007214864.1| hypothetical protein PRUPE_ppa018860mg [Prun... 244 1e-61 ref|XP_006841838.1| hypothetical protein AMTR_s00003p00270420 [A... 242 7e-61 ref|XP_007039961.1| HAT transposon superfamily [Theobroma cacao]... 238 1e-59 ref|XP_003538648.1| PREDICTED: uncharacterized protein LOC100805... 232 5e-58 ref|XP_006477267.1| PREDICTED: uncharacterized protein LOC102627... 231 9e-58 ref|XP_004292297.1| PREDICTED: uncharacterized protein LOC101307... 227 2e-56 ref|XP_002443069.1| hypothetical protein SORBIDRAFT_08g007560 [S... 225 9e-56 ref|XP_006579099.1| PREDICTED: uncharacterized protein LOC102660... 224 1e-55 ref|XP_004159512.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 223 3e-55 ref|XP_004147940.1| PREDICTED: uncharacterized protein LOC101222... 223 3e-55 ref|XP_002509591.1| DNA binding protein, putative [Ricinus commu... 223 4e-55 ref|XP_006857527.1| hypothetical protein AMTR_s00061p00028660 [A... 220 3e-54 ref|XP_007161271.1| hypothetical protein PHAVU_001G056200g, part... 219 4e-54 ref|XP_003618961.1| hypothetical protein MTR_6g029340 [Medicago ... 219 4e-54 >emb|CAN70085.1| hypothetical protein VITISV_003006 [Vitis vinifera] Length = 635 Score = 832 bits (2149), Expect = 0.0 Identities = 403/597 (67%), Positives = 487/597 (81%) Frame = -3 Query: 2454 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 2275 MP+ESDKWGWKHVSVFGGF+ +GTKRWKCNHCN+RYNGSYSRVRAHLLGFTGVGVKSCP Sbjct: 1 MPTESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNIRYNGSYSRVRAHLLGFTGVGVKSCP 60 Query: 2274 AIDRSLREAFHIQEEERLARKKKKIPTSGKSSKRIRSSQLAITSVGKAFGKEDVDDVVAR 2095 AIDRSLREAF I EEERLARKKK+ SGK+ KRIR+SQ ++T V K KEDVDD+VAR Sbjct: 61 AIDRSLREAFQILEEERLARKKKRTSGSGKTGKRIRTSQPSVTCVWKTIAKEDVDDIVAR 120 Query: 2094 FFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVRE 1915 FFYADGL+FNI+ SPYF +M KAIA+FGPGYEPP+ +KL D FL+KEKA+++KA++ VRE Sbjct: 121 FFYADGLDFNIVNSPYFLEMTKAIAAFGPGYEPPTTEKLSDLFLSKEKAKIEKAMALVRE 180 Query: 1914 SWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTKAI 1735 SWP TGCTI C+++L T + NIFVSSPRGL+FL+ +DI GDG D++F +VL+ AI Sbjct: 181 SWPHTGCTILCVNRLCRTQGRYYTNIFVSSPRGLMFLKALDINDGDGMDNMFVDVLSDAI 240 Query: 1734 MDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDWMKPFV 1555 M+V P NVLQ+I + G +S+ SLI SKF H+FWS CT+H I +LMEDIT+LDW+KP V Sbjct: 241 MEVEPTNVLQIISNLGHASESFESLILSKFRHLFWSPCTSHSICVLMEDITKLDWIKPIV 300 Query: 1554 SYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQALQEVVGS 1375 AK I++CIL DP+S KFAP+Y +V RI +LKQAL VV S Sbjct: 301 LCAKEIDECILTYQRSSLCVLTLESS----DPLSTKFAPSYCIVERIFELKQALLGVVVS 356 Query: 1374 EEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMGDVYNW 1195 EEW+QWKL EDV ++E A+LG++FW RA +LQ EPFVRLL +L++++SVMGDV+NW Sbjct: 357 EEWKQWKLTIQEDVLNVETAILGDNFWSRACSLLQFFEPFVRLLTTLDIEKSVMGDVFNW 416 Query: 1194 RVQALEVVRSKRIDDMVLKQLEVVLENRWEMLFSPLHAAGYILNPRYFGKGQAKDKTVMR 1015 RVQALE V+SK +DD++L QLE+++E++W+MLFSPLHA+GYILNP+YFGKGQ+KDKT+MR Sbjct: 417 RVQALEAVKSKGVDDILLNQLELLIESKWDMLFSPLHASGYILNPKYFGKGQSKDKTIMR 476 Query: 1014 GWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPVAWWENFGSETPQ 835 GWKATLDRYESD RRVLREQLSSYWRL+GS GEEDA+D RDKMDPVAWWENFG ETP Sbjct: 477 GWKATLDRYESDSATRRVLREQLSSYWRLEGSFGEEDAVDCRDKMDPVAWWENFGFETPH 536 Query: 834 LQTLAIKILSQISSVTTFQGSWHDNGSTCQEEVNLLGAERAEDLVFVRNNLRLHSKK 664 LQTLAIKILSQ+SSV+ +Q +W DN CQ VN LG ERAEDLVFVRNNLRLHS++ Sbjct: 537 LQTLAIKILSQVSSVSMYQETWQDNEFLCQTAVNGLGVERAEDLVFVRNNLRLHSQR 593 >ref|XP_003632266.1| PREDICTED: uncharacterized protein LOC100854857 [Vitis vinifera] Length = 635 Score = 831 bits (2147), Expect = 0.0 Identities = 403/597 (67%), Positives = 486/597 (81%) Frame = -3 Query: 2454 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 2275 MP+ESDKWGWKHVSVFGGF+ +GTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP Sbjct: 1 MPTESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 60 Query: 2274 AIDRSLREAFHIQEEERLARKKKKIPTSGKSSKRIRSSQLAITSVGKAFGKEDVDDVVAR 2095 AIDRSLREAF I EEERLARKKK+ SGK+ KRIR+SQ ++T V K KEDVDD+VAR Sbjct: 61 AIDRSLREAFQILEEERLARKKKRTSGSGKTGKRIRTSQPSVTCVWKTIAKEDVDDIVAR 120 Query: 2094 FFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVRE 1915 FFYADGL+FNI+ SPYF +M KAIA+FGPGYEPP+ +KL D FL+KEKA+++KA++ VRE Sbjct: 121 FFYADGLDFNIVNSPYFLEMTKAIAAFGPGYEPPTTEKLSDLFLSKEKAKIEKAMALVRE 180 Query: 1914 SWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTKAI 1735 SWP TGCTI C+++L T + NIFVSSPRGL+FL+ +DI GDG D++F +VL+ AI Sbjct: 181 SWPHTGCTILCVNRLCRTQGRYYTNIFVSSPRGLMFLKALDINDGDGMDNMFVDVLSDAI 240 Query: 1734 MDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDWMKPFV 1555 M+V P NVLQ+I + G +S+ SLI SKF H+FWS CT+H I +LMEDIT+LDW+KP V Sbjct: 241 MEVEPTNVLQIISNLGHASESFESLILSKFRHLFWSPCTSHSICVLMEDITKLDWIKPIV 300 Query: 1554 SYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQALQEVVGS 1375 AK I++CIL DP+S KFAP+Y +V RI +LKQAL VV S Sbjct: 301 LCAKEIDECILTYQRSSLCVLTLESS----DPLSTKFAPSYCIVERIFELKQALLGVVVS 356 Query: 1374 EEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMGDVYNW 1195 EEW+QWKL EDV ++E A+LG++FW RA +LQ EPFVRLL +L++++SVMGDV+NW Sbjct: 357 EEWKQWKLTIQEDVLNVETAILGDNFWSRACSLLQFFEPFVRLLTTLDIEKSVMGDVFNW 416 Query: 1194 RVQALEVVRSKRIDDMVLKQLEVVLENRWEMLFSPLHAAGYILNPRYFGKGQAKDKTVMR 1015 RVQALE V+SK +DD++L QLE+++E++W+MLFSPLHA+GYILNP+YFGKGQ+KDKT+MR Sbjct: 417 RVQALEAVKSKGVDDILLNQLELLIESKWDMLFSPLHASGYILNPKYFGKGQSKDKTIMR 476 Query: 1014 GWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPVAWWENFGSETPQ 835 GWKATLDRYESD RRVLREQLSSYWRL+GS GEEDA+D RDKMDPVAWWENFG ETP Sbjct: 477 GWKATLDRYESDSATRRVLREQLSSYWRLEGSFGEEDAVDCRDKMDPVAWWENFGFETPH 536 Query: 834 LQTLAIKILSQISSVTTFQGSWHDNGSTCQEEVNLLGAERAEDLVFVRNNLRLHSKK 664 LQTLAIKILSQ+SSV+ +Q +W DN CQ VN LG ER EDLVFVRNNLRLHS++ Sbjct: 537 LQTLAIKILSQVSSVSMYQETWQDNEFLCQTAVNGLGVERTEDLVFVRNNLRLHSQR 593 >ref|XP_002527444.1| protein dimerization, putative [Ricinus communis] gi|223533179|gb|EEF34936.1| protein dimerization, putative [Ricinus communis] Length = 633 Score = 797 bits (2058), Expect = 0.0 Identities = 390/597 (65%), Positives = 469/597 (78%) Frame = -3 Query: 2454 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 2275 MPSESDKWGW+HVSVFGGF+ +GTKRWKCNHCNLRYNGSYSRVRAHLLGF+GVGVKSCP Sbjct: 1 MPSESDKWGWEHVSVFGGFDRGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFSGVGVKSCP 60 Query: 2274 AIDRSLREAFHIQEEERLARKKKKIPTSGKSSKRIRSSQLAITSVGKAFGKEDVDDVVAR 2095 AIDRSLREAF I EEERL RKKKK +GK KR R SQ +I+ K KEDVDD+VAR Sbjct: 61 AIDRSLREAFQILEEERLVRKKKKNSANGKPGKRTRISQASIS--WKTITKEDVDDIVAR 118 Query: 2094 FFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVRE 1915 FFYADGLN +++ SPYFH+M KAI +FG GYE PS+DKL DSFL KEK R++K+++ +RE Sbjct: 119 FFYADGLNIDVVNSPYFHEMVKAIGAFGSGYELPSIDKLSDSFLGKEKGRIEKSLALLRE 178 Query: 1914 SWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTKAI 1735 SWP TGCTI C+ +LDG + CF+INIFVSSPRGL+FL+ +D++ D D V L+ AI Sbjct: 179 SWPHTGCTILCVGRLDGAIGCFHINIFVSSPRGLIFLKAVDVDDCDEGDHVLAGALSDAI 238 Query: 1734 MDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDWMKPFV 1555 ++VGP+NVLQ+I H G + K S S I SKFPHIFWS CT+H I +LME+I EL+W+KP V Sbjct: 239 LEVGPSNVLQIISHLGDACKSSESYILSKFPHIFWSPCTSHSILMLMEEIAELEWVKPIV 298 Query: 1554 SYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQALQEVVGS 1375 A+ IEQCI+ D ISAKFAP+Y V RI +L+Q LQEVV S Sbjct: 299 LCARRIEQCIMTYQHATSCIFMQSPKESC-DLISAKFAPSYFFVQRIFELRQTLQEVVVS 357 Query: 1374 EEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMGDVYNW 1195 E QWK ++V SIE+A+LG+DFW ++HL+LQL EPF++LLG L++D+SV+G VY+W Sbjct: 358 E---QWKHSIGDNVESIESAILGDDFWSKSHLLLQLYEPFIKLLGLLDIDKSVIGAVYDW 414 Query: 1194 RVQALEVVRSKRIDDMVLKQLEVVLENRWEMLFSPLHAAGYILNPRYFGKGQAKDKTVMR 1015 RVQALE +RSK IDD +L QLEV++EN+W++LFSPLHA GYILNPRY GK Q KDK+VMR Sbjct: 415 RVQALEALRSKAIDDDILNQLEVLIENKWDVLFSPLHATGYILNPRYIGKFQTKDKSVMR 474 Query: 1014 GWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPVAWWENFGSETPQ 835 GWKATL+RYE + ARRVLREQLSSYWRL+GSLG+EDA+D RDKMDPVAWWENFG ETP Sbjct: 475 GWKATLERYEGESTARRVLREQLSSYWRLEGSLGDEDAVDCRDKMDPVAWWENFGFETPS 534 Query: 834 LQTLAIKILSQISSVTTFQGSWHDNGSTCQEEVNLLGAERAEDLVFVRNNLRLHSKK 664 LQTLAIK+LSQ+SSV Q W N +CQE N LG +R EDL+FVRNNLRLH +K Sbjct: 535 LQTLAIKVLSQVSSVALCQEIWQTNDFSCQEAANRLGVQRVEDLLFVRNNLRLHYQK 591 >ref|XP_006424350.1| hypothetical protein CICLE_v10028008mg [Citrus clementina] gi|557526284|gb|ESR37590.1| hypothetical protein CICLE_v10028008mg [Citrus clementina] Length = 636 Score = 774 bits (1998), Expect = 0.0 Identities = 382/599 (63%), Positives = 457/599 (76%) Frame = -3 Query: 2454 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 2275 MPSESDKWGW+HVSVFGGFE +GTKRWKCNHCNLRYNGSYSRVRAHLLGF+GVGVKSCP Sbjct: 1 MPSESDKWGWEHVSVFGGFERGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFSGVGVKSCP 60 Query: 2274 AIDRSLREAFHIQEEERLARKKKKIPTSGKSSKRIRSSQLAITSVGKAFGKEDVDDVVAR 2095 AIDRS+RE F I EEER+ARKKK+ K KRIR+ Q +I S KA KEDVD++VAR Sbjct: 61 AIDRSMRETFQILEEERIARKKKRTSGIAKHGKRIRACQSSIVS--KAISKEDVDEMVAR 118 Query: 2094 FFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVRE 1915 FFYA GLN N++ SPYF +M ++IA+FG GY+ PS++ L DSFL+KEK +++K ++ VRE Sbjct: 119 FFYAAGLNVNVVNSPYFLEMVRSIAAFGHGYDLPSLENLSDSFLSKEKGKIEKFIASVRE 178 Query: 1914 SWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTKAI 1735 SWP TGCTI C+S LDG L CF IFVSSPRGL+FL+ +D++ D +++F VL+ AI Sbjct: 179 SWPHTGCTILCVSSLDGQLGCFPTGIFVSSPRGLVFLKALDLDDTDEAENLFITVLSDAI 238 Query: 1734 MDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDWMKPFV 1555 +DVGP NVLQ+I H G + K SL+ SKFPHIF S CT I + ME+I L+W+K V Sbjct: 239 LDVGPKNVLQIISHLGHACKSYESLVLSKFPHIFLSPCTLQSIHMFMEEIASLEWIKSTV 298 Query: 1554 SYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQALQEVVGS 1375 AK IEQ IL S D +S K AP+Y V RII+LKQ LQE V S Sbjct: 299 LCAKRIEQHILYYQHAYPCLFPHNLKESS-DQVSTKIAPSYCFVQRIIELKQVLQEAVVS 357 Query: 1374 EEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMGDVYNW 1195 EE++QWKL P D +E+A+LG+DFWG+AHL LQLCEPFVRLL + ++D+SVMG VY+W Sbjct: 358 EEFKQWKLSMPGDHGIVESAILGDDFWGKAHLFLQLCEPFVRLLATFDIDKSVMGAVYDW 417 Query: 1194 RVQALEVVRSKRIDDMVLKQLEVVLENRWEMLFSPLHAAGYILNPRYFGKGQAKDKTVMR 1015 R QALE VR K ID L QLEV+ ENRW+ LFSPLHAAGYILNPRYFG+GQ KDKTVMR Sbjct: 418 RFQALEAVRMKGIDATALNQLEVLTENRWDALFSPLHAAGYILNPRYFGRGQNKDKTVMR 477 Query: 1014 GWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPVAWWENFGSETPQ 835 GWK+TL+RYESD RR+LREQLSSYWRL+GSLGEEDA+D+RDKM+PVAWWENFG E Sbjct: 478 GWKSTLERYESDSATRRILREQLSSYWRLEGSLGEEDAVDFRDKMEPVAWWENFGFEISH 537 Query: 834 LQTLAIKILSQISSVTTFQGSWHDNGSTCQEEVNLLGAERAEDLVFVRNNLRLHSKKLV 658 LQTLAIK+LSQ+SSV Q W DN C+E N G ER EDL+FVRNNLRLH+++ V Sbjct: 538 LQTLAIKVLSQVSSVAVCQEIWQDNDFPCREAANRSGVERPEDLIFVRNNLRLHNQRNV 596 >ref|XP_006484968.1| PREDICTED: uncharacterized protein LOC102615434 isoform X1 [Citrus sinensis] gi|568863036|ref|XP_006484969.1| PREDICTED: uncharacterized protein LOC102615434 isoform X2 [Citrus sinensis] Length = 636 Score = 771 bits (1991), Expect = 0.0 Identities = 380/599 (63%), Positives = 457/599 (76%) Frame = -3 Query: 2454 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 2275 MPSESDKWGW+HVSVFGGFE +GTKRWKCNHCNLRYNGSYSRVRAHLLGF+GVGVKSCP Sbjct: 1 MPSESDKWGWEHVSVFGGFERGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFSGVGVKSCP 60 Query: 2274 AIDRSLREAFHIQEEERLARKKKKIPTSGKSSKRIRSSQLAITSVGKAFGKEDVDDVVAR 2095 AIDRS+RE F I EEER+ARKKK+ K KRIR+ Q +I S KA KEDVD++VAR Sbjct: 61 AIDRSMRETFQILEEERIARKKKRTSGIAKHGKRIRACQSSIVS--KAISKEDVDEMVAR 118 Query: 2094 FFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVRE 1915 FFYA GLN N++ SPYF +M ++IA+FG GY+ PS++ L DSFL+KEK +++K ++ VRE Sbjct: 119 FFYAAGLNVNVVNSPYFLEMVRSIAAFGHGYDLPSLENLSDSFLSKEKGKIEKFIASVRE 178 Query: 1914 SWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTKAI 1735 SWP TGCTI C+S LDG L CF IFVSSPRGL+FL+ +D++ D +++F VL+ AI Sbjct: 179 SWPHTGCTILCVSSLDGRLGCFPTGIFVSSPRGLVFLKALDLDDTDEAENLFITVLSDAI 238 Query: 1734 MDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDWMKPFV 1555 ++VGP NVLQ+I H G + K SL+ SKFPHIF S CT I + ME+I L+W+K V Sbjct: 239 LEVGPKNVLQIISHLGHACKSYESLVLSKFPHIFLSPCTLQSIHMFMEEIASLEWIKSTV 298 Query: 1554 SYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQALQEVVGS 1375 AK IEQ I+ S D +S K AP+Y V RII+LKQ LQE V S Sbjct: 299 LCAKRIEQHIMYYQHAYPCLFPHNLKESS-DQVSTKIAPSYCFVQRIIELKQVLQEAVVS 357 Query: 1374 EEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMGDVYNW 1195 EE++QWKL P D +E+A+LG+DFWG+AHL LQLCEPFVRLL + ++D+SVMG VY+W Sbjct: 358 EEFKQWKLSMPGDHGIVESAILGDDFWGKAHLFLQLCEPFVRLLATFDIDKSVMGAVYDW 417 Query: 1194 RVQALEVVRSKRIDDMVLKQLEVVLENRWEMLFSPLHAAGYILNPRYFGKGQAKDKTVMR 1015 R QALE VR K ID L QLEV+ ENRW+ LFSPLHAAGYILNPRYFG+GQ KDKTVMR Sbjct: 418 RFQALEAVRMKGIDATALNQLEVLTENRWDALFSPLHAAGYILNPRYFGRGQNKDKTVMR 477 Query: 1014 GWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPVAWWENFGSETPQ 835 GWK+TL+RYESD RR+LREQLSSYWRL+GSLGEEDA+D+RDKM+PVAWWENFG E Sbjct: 478 GWKSTLERYESDSATRRILREQLSSYWRLEGSLGEEDAVDFRDKMEPVAWWENFGFEISH 537 Query: 834 LQTLAIKILSQISSVTTFQGSWHDNGSTCQEEVNLLGAERAEDLVFVRNNLRLHSKKLV 658 LQTLAIK+LSQ+SSV Q W DN C+E N G ER EDL+FVRNNLRLH+++ V Sbjct: 538 LQTLAIKVLSQVSSVAICQEIWQDNDFPCREAANRSGVERPEDLIFVRNNLRLHNQRNV 596 >ref|XP_007014534.1| Uncharacterized protein TCM_039722 [Theobroma cacao] gi|508784897|gb|EOY32153.1| Uncharacterized protein TCM_039722 [Theobroma cacao] Length = 381 Score = 258 bits (658), Expect = 1e-65 Identities = 127/213 (59%), Positives = 157/213 (73%) Frame = -3 Query: 1401 QALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDR 1222 +ALQ+VV SEEW+QWK +D+ IEA++LG++FW AH+MLQL +PF +LL L++D+ Sbjct: 146 KALQDVVVSEEWKQWKHSILKDILIIEASILGDEFWSNAHMMLQLFKPFAKLLAMLDIDK 205 Query: 1221 SVMGDVYNWRVQALEVVRSKRIDDMVLKQLEVVLENRWEMLFSPLHAAGYILNPRYFGKG 1042 SVMG +Y+WRVQALEVVRSK ID+ L QLEV++EN+W +LFS LHAAGYILNP YFGK Sbjct: 206 SVMGAIYDWRVQALEVVRSKEIDETALNQLEVLIENKWNVLFSLLHAAGYILNPGYFGK- 264 Query: 1041 QAKDKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPVAWW 862 AR VLR+QLSSYWRL+GS GEEDA+D RDKMD VAWW Sbjct: 265 -----------------------ARWVLRKQLSSYWRLEGSFGEEDALDCRDKMDLVAWW 301 Query: 861 ENFGSETPQLQTLAIKILSQISSVTTFQGSWHD 763 ENFG ETP LQTLAIK+LSQ+S+++ Q W D Sbjct: 302 ENFGFETPHLQTLAIKVLSQVSTISMCQDIWQD 334 Score = 182 bits (463), Expect = 5e-43 Identities = 101/201 (50%), Positives = 118/201 (58%) Frame = -3 Query: 2454 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 2275 M SE DKWGW+HV+VFG F+ +GTKRWKCNHCNLRYNGSYSRVRAHLL F+GVGVKSC Sbjct: 1 MASEFDKWGWEHVTVFGVFDRGSGTKRWKCNHCNLRYNGSYSRVRAHLLRFSGVGVKSCL 60 Query: 2274 AIDRSLREAFHIQEEERLARKKKKIPTSGKSSKRIRSSQLAITSVGKAFGKEDVDDVVAR 2095 AI+R+LREAFHI EEERLAR KK T G Sbjct: 61 AINRTLREAFHILEEERLAR--KKKRTFGSGKP--------------------------- 91 Query: 2094 FFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVRE 1915 +FG GYEPPS+DKL D FL+KEK R++K+++ VRE Sbjct: 92 -------------------------TFGCGYEPPSMDKLSDCFLSKEKGRIEKSITLVRE 126 Query: 1914 SWPLTGCTIFCLSQLDGTLSC 1852 SWP TG T+ C+ G L C Sbjct: 127 SWPHTGYTVLCV----GCLGC 143 >ref|XP_007214864.1| hypothetical protein PRUPE_ppa018860mg [Prunus persica] gi|462411014|gb|EMJ16063.1| hypothetical protein PRUPE_ppa018860mg [Prunus persica] Length = 805 Score = 244 bits (623), Expect = 1e-61 Identities = 183/648 (28%), Positives = 298/648 (45%), Gaps = 60/648 (9%) Frame = -3 Query: 2427 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRS-LRE 2251 WK+V G ++CN+C + GSY RV++HLL G GV SC + S L E Sbjct: 128 WKYVKKLEKDGKAGGNTSFQCNYCQKTFKGSYFRVKSHLLKLKGNGVASCTKVTNSHLME 187 Query: 2250 AFHIQEEERLARKKKKI-----PTSGKSSKRIRSSQLAITS-------------VGKAFG 2125 + EE L K ++ PTS SS+ SS L ++S + KAF Sbjct: 188 MEKVVEEAELRVKMAQLRDVPLPTSNTSSQGGSSSGLGMSSNWCSDSKKRKGNPIEKAFN 247 Query: 2124 ---KEDVDDVVARFFYADGLNFNIIKSPYFHDMAK-AIASFGPGYEPPSVDKLLDSFLTK 1957 +E +D +AR FY GL+F ++P++ + + A + PGY+PP + L + L K Sbjct: 248 NNLREQLDGEIARMFYTGGLSFQFSRNPHYVNAFRIACSKTLPGYQPPGYNMLRTTLLQK 307 Query: 1956 EKARMDKAVSPVRESW------PLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTI 1795 EK +++ VS + W PL IN+ G +FL+ I Sbjct: 308 EKNNIEEWVSVCSDGWSDAQRRPL-------------------INVMAICESGPMFLKAI 348 Query: 1794 DIEKGDGEDDVF-TEVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCT 1618 + E G+ +D F +L ++I ++GP NV+QV+ K +G ++ +KF HIFW+ C Sbjct: 349 NCE-GECKDKFFMANLLIESIREIGPQNVVQVVTDNAPVCKAAGHIVEAKFKHIFWTPCV 407 Query: 1617 AHCIQLLMEDIT-----------ELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXX 1471 H + L +++I + W+ S A I+ I+ Sbjct: 408 VHTLNLALKNICSPVPRNPEVYEQCSWISTISSDAWFIKNFIM-NHNMRLSMYNDHCKLK 466 Query: 1470 SIDPISAKFAPTYNMVWRIIKLKQALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWG 1291 + +FA T M+ R ++KQ L+++V SE+W +K +++ +L FW Sbjct: 467 LLSVAETRFASTIVMLRRFKQVKQGLEQMVISEQWDIYKEDDVVKARTVKEKILDECFWE 526 Query: 1290 RAHLMLQLCEPFVRLLGSLNVDRSVMGDVYNWRVQALEVVRS-------KRIDD--MVLK 1138 +L P +L + D + +Y W +E V++ K++++ M Sbjct: 527 DIDYILNFTSPIYEMLRLSDTDMPCLHLIYEWWDSMIEKVKTIIYRKERKQLNEESMFFN 586 Query: 1137 QLEVVLENRWEMLFSPLHAAGYILNPRYFGKGQA----------KDKTVMRGWKATLDRY 988 + +L +RW +PLH + LNP+Y+ K KD + R K ++R+ Sbjct: 587 VVHEILVDRWTKSSTPLHCFAHSLNPKYYCKEWLDMAHNRCPPHKDIEITRERKQCIERF 646 Query: 987 ESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPVAWWENFGSETPQLQTLAIKIL 808 S+ + RR + E+ +S+ D+M R M PV WW G+ TP+LQT+A+K+L Sbjct: 647 FSNEVERRAVNEEYASFSACIEDFSGMDSMKDRGFMAPVKWWVIHGASTPKLQTIALKLL 706 Query: 807 SQISSVTTFQGSWHDNGSTCQEEVNLLGAERAEDLVFVRNNLRLHSKK 664 SS + + +W + N + ERAEDLVFV +NLRL S+K Sbjct: 707 GHPSSSSCCERNWSTYNFIHSIKRNKITPERAEDLVFVHSNLRLLSRK 754 >ref|XP_006841838.1| hypothetical protein AMTR_s00003p00270420 [Amborella trichopoda] gi|548843859|gb|ERN03513.1| hypothetical protein AMTR_s00003p00270420 [Amborella trichopoda] Length = 732 Score = 242 bits (617), Expect = 7e-61 Identities = 169/633 (26%), Positives = 282/633 (44%), Gaps = 45/633 (7%) Frame = -3 Query: 2427 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREA 2248 W ++ G T G +C C + GSY+RV++HLLG G GVK C ID Sbjct: 35 WAYMEKIGRCHTGGGNWMLRCVLCKAEFKGSYTRVKSHLLGKVGTGVKRCLGIDNETLAT 94 Query: 2247 FHIQEEERLARKKKKIPTSGKSSKRIRSSQLAITSVGKAFG---------KEDVDDVVAR 2095 +E RK + S ++ S + + A K+ +D ++AR Sbjct: 95 LLRLNDEGSTRKIRSSSRSSVPLLKVNSGSIGLKKRRGANDLVKLLDLAPKDVLDRMIAR 154 Query: 2094 FFYADGLNFNIIKSPYFHDMAK-AIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVR 1918 FYA G++ N+I+SPYF DM + A + GY P+ D L S L EKA ++++V P R Sbjct: 155 CFYASGISLNLIRSPYFRDMIRYACENSLEGYVLPTFDNLRTSLLDAEKANIEQSVKPFR 214 Query: 1917 ESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTKA 1738 SW G ++ D T IN +S G +FL+ ID D + + Sbjct: 215 SSWGSRGVSLLTDGWTDTTAKRPLINFMAASDIGSIFLKAIDSSVEMMNTDYMKNLFLEM 274 Query: 1737 IMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELD----- 1573 + +VGP +V+Q+I +++G + P+IFW+ C H + L +++I D Sbjct: 275 VAEVGPTSVVQIITDNSPICRVAGQRVEGMHPYIFWTPCVIHTLNLALKNICSPDDERKA 334 Query: 1572 -------WMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRI 1414 W++ K I ++ + ++FA T +V RI Sbjct: 335 EKYLHCQWIRDLDRDVKMIRSFVV-DHNAVLTIYSQYPTLRLLSVTESRFASTVIIVKRI 393 Query: 1413 IKLKQALQEVVGSEEWRQWKLMYPEDVPS---IEAAVLGNDFWGRAHLMLQLCEPFVRLL 1243 ++K AL +V WK++ ED +++ ++ + +W + ++ EP + +L Sbjct: 394 KEVKPALCRMVVDS---YWKVLVEEDAEKARRVKSCLVDDLWWEKIEFLIAFTEPILAML 450 Query: 1242 GSLNVDRSVMGDVYNWRVQALEVVRS-------KRI---DDMVLKQLEVVLENRWEMLFS 1093 +++ D + +VY+ +E VR K I + + + +L W + Sbjct: 451 RAIDTDEPTLHEVYDMWATMIEEVRGIIFRNEGKNIFLNESSFYEDIHRILVGSWNKSKT 510 Query: 1092 PLHAAGYILNPRYFGK---GQA-------KDKTVMRGWKATLDRYESDGMARRVLREQLS 943 PL + LNP+Y+ G+ KD+ V G R + + E+ Sbjct: 511 PLQCLAHSLNPKYYSDEWLGEVPSRLPPHKDREVSDGRNVCFARLFPAPSELQKVHEEFE 570 Query: 942 SYWRLDGSLGEEDAMDYRDKMDPVAWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHD 763 + G G D M R M P++WWENFG+ P+L LA ++LSQ SS + + +W Sbjct: 571 MFSMCKGHFGHWDVMSSRFSMSPISWWENFGAHVPRLAKLADRLLSQPSSSSCCERNWGT 630 Query: 762 NGSTCQEEVNLLGAERAEDLVFVRNNLRLHSKK 664 + + N L ++RAEDLV+V +NLRL S++ Sbjct: 631 FSLIKKIKQNRLASQRAEDLVYVHSNLRLLSRR 663 >ref|XP_007039961.1| HAT transposon superfamily [Theobroma cacao] gi|508777206|gb|EOY24462.1| HAT transposon superfamily [Theobroma cacao] Length = 674 Score = 238 bits (607), Expect = 1e-59 Identities = 166/607 (27%), Positives = 277/607 (45%), Gaps = 40/607 (6%) Frame = -3 Query: 2370 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHIQEEERLARKKKKIPTS 2191 +CN+C+ ++G R++ HL + C + +R+ HIQ + KK+K P Sbjct: 23 RCNYCHREFSGGVYRMKFHLAQIKNKDIVPCAEVPDDVRD--HIQTILN-SPKKQKTPKK 79 Query: 2190 GKSSKRIRSSQLAITSV------------------------------------GKAFGKE 2119 K K + + Q +S G+ +E Sbjct: 80 PKVDKAVANDQQNSSSASGGLHLNHGSSGQHGSTCPSLLFPRPSPSEQPAVDDGQKQKQE 139 Query: 2118 DVDDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMD 1939 D D +A FF+ + + F+ KS Y+ +M AIA G GY+ PS + L + L K K + Sbjct: 140 DADKKIAVFFFHNSIPFSAAKSMYYQEMVDAIAKCGVGYKAPSYENLRSTLLEKVKGDIH 199 Query: 1938 KAVSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVF 1759 R+ W TGCTI C S DG F I V+ P+G LFL+++D+ + + Sbjct: 200 DCYKKYRDEWKETGCTILCDSWSDGRTKSFVI-FSVTCPKGTLFLKSVDVSGHEDDASYL 258 Query: 1758 TEVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITE 1579 E+L +++VG NV+QVI S +G L+ +K+ +FWS C ++CI ++EDI++ Sbjct: 259 FELLESVVLEVGLENVIQVITDTAASYVYAGRLLMAKYSSLFWSPCASYCINKMLEDISK 318 Query: 1578 LDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQ 1399 +W+ + AK I Q I + + P +F Y + II + Sbjct: 319 QEWVGIVLEEAKSIVQYIYSHAWIVNMMRKFTGGRELMRPRITRFVANYLTLRSIIIQED 378 Query: 1398 ALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRS 1219 L+ + EW D +I++ + FW AH + + EP V++L ++ D Sbjct: 379 NLKHMFSHSEWLSSIYSRRSDAQAIKSLLYLERFWKSAHEAVSVSEPLVKILRIVDGDMP 438 Query: 1218 VMGDVYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHAAGYILNPRYFG 1048 MG +Y +A +++ K +++ + +++ + RW M L SPLHAA LNP F Sbjct: 439 AMGYIYEGIERAKVAIKAYYKGLEEKYMPIWDII-DRRWNMQLHSPLHAAAAFLNPSIFY 497 Query: 1047 KGQAK-DKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPV 871 K D + G++ + + + + + ++ Y G+LG + A+ R P Sbjct: 498 NPNFKIDLRMRNGFQEAMLKLATTDKDKIEITKEHPMYINAQGALGTDFAIMGRTLNAPG 557 Query: 870 AWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTCQEEVNLLGAERAEDLVFVR 691 WW ++G E P LQ +AI+ILSQ S + +W S ++ N + E+ DLVFV Sbjct: 558 DWWASYGYEIPTLQRVAIRILSQPCSSHWCRWNWSTFESIHTKKRNKVELEKFNDLVFVH 617 Query: 690 NNLRLHS 670 NL L + Sbjct: 618 CNLCLQA 624 >ref|XP_003538648.1| PREDICTED: uncharacterized protein LOC100805582 isoform X1 [Glycine max] gi|571487050|ref|XP_006590550.1| PREDICTED: uncharacterized protein LOC100805582 isoform X2 [Glycine max] Length = 675 Score = 232 bits (592), Expect = 5e-58 Identities = 160/608 (26%), Positives = 273/608 (44%), Gaps = 41/608 (6%) Frame = -3 Query: 2370 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHIQEEERLARKKKKIPTS 2191 +CN+C ++G R++ HL + C + +R+ HIQ A KK K P Sbjct: 23 RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRD--HIQSILS-APKKPKTPKK 79 Query: 2190 GKSSKR-IRSSQLAITSVGKAFG------------------------------------K 2122 K+ + + + Q +S F + Sbjct: 80 QKTDQATVANGQQNSSSASGGFHHNHGYSGQNGSACPSLLFPNPSPSAQPLEHDAQKQKQ 139 Query: 2121 EDVDDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARM 1942 +D D +A FF+ + + F+ KS Y+ +M A+A G GY+ PS +KL + L K KA + Sbjct: 140 DDADRKLAIFFFHNSIPFSAAKSIYYQEMVDAVAQCGVGYKAPSYEKLRSTLLEKVKADI 199 Query: 1941 DKAVSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDV 1762 R+ W TGCT+ C + DG + V+ P+G LFL+++D+ + + Sbjct: 200 HSDYKKYRDEWKETGCTVLCDNWSDGRTGSLAV-FSVACPKGTLFLKSVDVSGHENDSTY 258 Query: 1761 FTEVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDIT 1582 E+L +++VG NV+QVI S +G L+ +++ +FWS C A+CI ++EDI Sbjct: 259 LFELLESVVLEVGAENVVQVITDASASYVCAGRLLIARYSFLFWSPCVAYCIDKMLEDIG 318 Query: 1581 ELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLK 1402 DW+ + AK I Q I + I P +F + + I+ + Sbjct: 319 RQDWVGTVLEEAKTITQYIYSHAWILNIMRKFTGGKELIRPKITRFVTNFLSLKSIVMQE 378 Query: 1401 QALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDR 1222 ++ + EW D +I + + + FW AH + + EP V+ L ++ D Sbjct: 379 DNIKHMFSHSEWLSSIYRRRPDAQAINSLLYSDRFWKYAHEAVSVSEPLVKCLRMVDGDM 438 Query: 1221 SVMGDVYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHAAGYILNPRY- 1054 MG VY +A +++ K I++ + +++ + RW M + S LHAA LNP Sbjct: 439 PAMGYVYEGIERAKVAIKAYYKGIEEKYIPIWDII-DRRWNMQIHSSLHAAAAFLNPSIS 497 Query: 1053 FGKGQAKDKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDP 874 + KD + G++ + R + + ++L +Y G+LG + A+ R P Sbjct: 498 YNPNFKKDLRMRNGFQEAMLRLAITDKDKMEITKELPTYINAQGALGTDFAVLGRTLNAP 557 Query: 873 VAWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTCQEEVNLLGAERAEDLVFV 694 WW ++G E P LQ A++ILSQ S ++ +W S + N + E+ +LVFV Sbjct: 558 GDWWASYGYEIPTLQKAAVRILSQPCSSLWYRWNWSTFESIHNRKRNRVELEKFSELVFV 617 Query: 693 RNNLRLHS 670 +NL L + Sbjct: 618 HSNLWLQT 625 >ref|XP_006477267.1| PREDICTED: uncharacterized protein LOC102627361 [Citrus sinensis] Length = 674 Score = 231 bits (590), Expect = 9e-58 Identities = 165/608 (27%), Positives = 276/608 (45%), Gaps = 41/608 (6%) Frame = -3 Query: 2370 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHIQEEERLARKKKKIPTS 2191 +CN+C ++G R++ HL + C + +R+ HIQ + +K+K P Sbjct: 23 RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPDDVRD--HIQRILSIPKKQKN-PKR 79 Query: 2190 GKSSKRIRSSQLAITSVGKAFGK------------------------------------E 2119 K K + Q +S + + Sbjct: 80 PKVEKATANGQQNSSSASGGIHQNNRSSGQHGSSCPSLLFRHPSPSIQPIVDDTQKQRQD 139 Query: 2118 DVDDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMD 1939 D D +A FF+ + + F+ KS Y+ +M AIA G GY PS +KL + L K K +D Sbjct: 140 DTDKKIAVFFFHNSIPFSAAKSMYYQEMVNAIAECGVGYIAPSYEKLRSTLLEKVKVDID 199 Query: 1938 KAVSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVF 1759 RE W TGCTI C + D + V+ P+G LFL+++D+ G ED F Sbjct: 200 DCCKKYREEWKETGCTILCDNWSDERTKSLVV-FSVACPKGTLFLKSVDVS-GHEEDATF 257 Query: 1758 T-EVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDIT 1582 E+L ++DVG NV+QVI +G L+ +K+ +FWS C A+CI ++EDI+ Sbjct: 258 LFELLESVVLDVGVENVIQVITDSAACYVYAGRLLMTKYSSLFWSPCAAYCIDKMLEDIS 317 Query: 1581 ELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLK 1402 + +W+ + AK I + + I P +F Y + I+ + Sbjct: 318 KQEWVAMVLEEAKTITKYFYSHAWTLNMMRKLTGGRELIRPRITRFVANYLSLRSIVIHE 377 Query: 1401 QALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDR 1222 + L+ + EW D +I++ + + FW AH ++ + EP V++L ++ D Sbjct: 378 ENLKHMFSHSEWLSSIYSRRPDAQAIKSLLYLDRFWRSAHEVVSVSEPLVKILRIVDGDM 437 Query: 1221 SVMGDVYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHAAGYILNPRYF 1051 MG +Y +A +++ K +++ + +++ + RW M L SPLHAA LNP F Sbjct: 438 PAMGYMYEGIERAKLAIQAYYKGVEEKYVPIWDII-DRRWNMQLHSPLHAAAAFLNPSIF 496 Query: 1050 GKGQAK-DKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDP 874 K D + G++ + + + + + ++ Y G+LG + A+ R P Sbjct: 497 YNPNFKIDLRMRNGFQEAMIKLATADKDKIEITKEHPVYINAQGALGTDFAVLGRKLNAP 556 Query: 873 VAWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTCQEEVNLLGAERAEDLVFV 694 WW ++G E P LQ AI+ILSQ S ++ +W S ++ N + E+ DL+FV Sbjct: 557 GDWWASYGYEIPTLQRAAIRILSQPCSSYWYRWNWSTFESIHNKKRNKVEMEKFNDLLFV 616 Query: 693 RNNLRLHS 670 NLRL + Sbjct: 617 HCNLRLQA 624 >ref|XP_004292297.1| PREDICTED: uncharacterized protein LOC101307174 [Fragaria vesca subsp. vesca] Length = 719 Score = 227 bits (579), Expect = 2e-56 Identities = 158/645 (24%), Positives = 288/645 (44%), Gaps = 57/645 (8%) Frame = -3 Query: 2427 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLR-- 2254 WK+V++ G + G + CN C + GS+SRV++HLL G GVK P I R Sbjct: 25 WKYVTITSGSDKSGGNVAFTCNFCGGKLTGSHSRVKSHLLRIKGTGVKIYPTITRDQTVE 84 Query: 2253 ---------EAFHIQEEERLARKKKKIPTSGKSSKRIRSSQLAITS-------VGKAFGK 2122 + + + + ++A + SG S +R + + + KAF + Sbjct: 85 LQALLDHCDQQLNAKAQHKVALPPSSMTGSGISYFPLREREDEVKKRRGLSPQLSKAFRQ 144 Query: 2121 ED---VDDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEK 1951 ED D VAR FY+ GL FN+ ++P + + + ++AS PGY PP + L + L EK Sbjct: 145 EDRRECDASVARLFYSSGLAFNVARNPNYRE-SYSLASKIPGYVPPGYNALRTTLLDNEK 203 Query: 1950 ARMDKAVSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGE 1771 +++ + P++++W TG ++ DG IN+ ++ G + L+ I+ E Sbjct: 204 RHIERTLLPIKKTWKETGVSLCSDGWTDGQKRPL-INMMAAAKDGAMMLKAINCEGVTKS 262 Query: 1770 DDVFTEVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLME 1591 + +L ++I ++GP NV+QV+ S +G+++ PHIFW+ C H + L ++ Sbjct: 263 KEEIGRLLLESINEIGPENVVQVVTDNAPVSAAAGAIVEITHPHIFWTPCVVHTLNLALK 322 Query: 1590 D-------------ITELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISA 1450 D + EL W+ + I+ ++ + Sbjct: 323 DLLKAKSYLPGETVVEELGWLMEVYNDVWFIKNFVV-NHNMRLAMYHEHCALRLLQVAPT 381 Query: 1449 KFAPTYNMVWRIIKLKQALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQ 1270 +FA + ++ R +K LQ++V S+ W +K ++ +L FW + ++ Sbjct: 382 RFASHFIVLKRFRDVKSGLQQMVISQRWDLYKEDDASKARVVKEMLLKEKFWEQIDFLIA 441 Query: 1269 LCEPFVRLLGSLNVDRSVMGDVYNWRVQALEVVRSKRIDD----MVLKQLEV-------- 1126 L P ++ ++DR + VY W +E V+ + ++ + +V Sbjct: 442 LMGPIYEMIRMSDMDRPCLHLVYEWWNSMIEKVKKAVFNPEFVHVITEHCDVTRFYDVVY 501 Query: 1125 -VLENRWEMLFSPLHAAGYILNPRYFGKGQA----------KDKTVMRGWKATLDRYESD 979 +L RW +PLH + LNP+Y+ +D + + + D Sbjct: 502 PILTARWTKSCTPLHCLAHSLNPKYYSSQWLEEDPNRVPPHRDAELNNERRRCFQKLFPD 561 Query: 978 GMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPVAWWENFGSETPQLQTLAIKILSQI 799 R + E+ + + G DA++ + +P+ WW ++G TP LQ+LA+K+L+Q Sbjct: 562 SQTRNKVMEEFARFSLNMGDFSSSDALENKFCFEPLTWWVSYGPSTPLLQSLALKLLNQP 621 Query: 798 SSVTTFQGSWHDNGSTCQEEVNLLGAERAEDLVFVRNNLRLHSKK 664 S + + +W + N L RA+DLV+V NLRL ++K Sbjct: 622 CSSSCCERNWSTYAFIQGLKRNKLQPRRAQDLVYVHTNLRLLARK 666 >ref|XP_002443069.1| hypothetical protein SORBIDRAFT_08g007560 [Sorghum bicolor] gi|241943762|gb|EES16907.1| hypothetical protein SORBIDRAFT_08g007560 [Sorghum bicolor] Length = 713 Score = 225 bits (573), Expect = 9e-56 Identities = 164/633 (25%), Positives = 285/633 (45%), Gaps = 45/633 (7%) Frame = -3 Query: 2427 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSL--- 2257 W HV + G W+C + L Y GSYSR+++HLL +G G+K C A+D+ + Sbjct: 23 WNHVVLLEK-AAAGGNAVWRCKYYKLEYKGSYSRIKSHLLRISGGGIKICTAVDKFILAQ 81 Query: 2256 ---REAFHIQEEERLARKKKKIPTSG-KSSKRIRSSQLAITSVGKAFGKE---DVDDVVA 2098 A E ER K +P +S +R+ + +++ KAF E +D ++ Sbjct: 82 LKSEVAEAADEIERSKAKVIPLPVENVDASNSMRNKRQRSSALEKAFDMETRNQLDAIIG 141 Query: 2097 RFFYADGLNFNIIKSPYFHDMAKAIASFG-PGYEPPSVDKLLDSFLTKEKARMDKAVSPV 1921 R FY+ G++FNI ++PY+ + + AS GY PPS +KL + L +E+A ++ + + Sbjct: 142 RLFYSGGVSFNIARNPYYRESYRFAASHNLDGYVPPSYNKLRTTLLKQERAHVESLLDRM 201 Query: 1920 RESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTK 1741 + W G TI C + IN +FL+ ID + E L + Sbjct: 202 KSVWAEKGVTI-CSDGWSDSQRRPLINFIAVCKGKPMFLRAIDASGEEKTKFFIAEKLIQ 260 Query: 1740 AIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDI-------- 1585 + +VGP NV+Q+I + K +G ++ K+ +IFW+ C H + L +++I Sbjct: 261 VVEEVGPKNVVQIITDNAANCKGAGLIVQQKYDNIFWTPCIVHTLNLALKNICAAKLPRT 320 Query: 1584 -------TELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNM 1426 EL W+ A I+ I+ + +FA M Sbjct: 321 EEQEIVYDELHWITLVAGDANMIKNYIM-NHSMRLSMFNEFSKLKLLAVAETRFASVVVM 379 Query: 1425 VWRIIKLKQALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRL 1246 + R + +K+ALQ +V S+ W +K + +L + +W ++ +P + Sbjct: 380 LTRFLMVKRALQRMVISDAWESYKDDNAGTAKHVREKILCSKWWDNVQYIVDFTDPIYEM 439 Query: 1245 LGSLNVDRSVMGDVYN-W-----RVQALEVVRSKRIDD---MVLKQLEVVLENRWEMLFS 1093 L + DR + +Y W +V+ + + K+ +D ++ +L +RW + Sbjct: 440 LRMADTDRPCLHLIYEMWDTMIAKVKKVVYTKEKKNNDEQSTFFSTVQDILLDRWTKSNT 499 Query: 1092 PLHAAGYILNPRYF-------GKGQA---KDKTVMRGWKATLDRYESDGMARRVLREQLS 943 PL + LNPRY+ +G+ KD + ++ G ++++ S Sbjct: 500 PLICLAHSLNPRYYHEKWISENEGREPPHKDLEISVQRMKCFRKFFPVGKDLNQVKDEYS 559 Query: 942 SYWRLDGSLGEEDAMDYRDKMDPVAWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHD 763 + L + D++ R +DP+ WW N G P LQ LA+K+L+Q +S ++ + +W Sbjct: 560 RFATCSEELNDFDSIYDRWILDPLKWWANHGQSIPMLQKLALKLLNQPASSSSCERNWST 619 Query: 762 NGSTCQEEVNLLGAERAEDLVFVRNNLRLHSKK 664 N L E AEDLVF+ NNLRL ++K Sbjct: 620 YSFVHSMLRNKLAPECAEDLVFIHNNLRLLARK 652 >ref|XP_006579099.1| PREDICTED: uncharacterized protein LOC102660479 [Glycine max] Length = 765 Score = 224 bits (572), Expect = 1e-55 Identities = 168/634 (26%), Positives = 280/634 (44%), Gaps = 46/634 (7%) Frame = -3 Query: 2427 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAI-DRSLRE 2251 W V++ G + W CN C SYSRV+AHLL G G+ +CP + D L Sbjct: 21 WSFVTIKEKIGDGGGNRLWSCNFCEKVVKSSYSRVKAHLLRICGSGIDTCPKVTDAYLVY 80 Query: 2250 AFHIQEEERLARKKKKIPTSGKSS---------KRIRSSQLAITSVGKAFGKEDVDDV-- 2104 + EE K K +P KR +SS ++ AF ED + + Sbjct: 81 LRRVCEEAESILKSKNVPLPTDKRTPTPPTLPPKRRKSS-----NIESAFNIEDRNHLRA 135 Query: 2103 -VARFFYADGLNFNIIKSPYFHDMAKAIASFG-PGYEPPSVDKLLDSFLTKEKARMDKAV 1930 +AR FY+ L+F++ ++PYF A+ G+ PPS + L S L +E++ +++ + Sbjct: 136 EIARMFYSASLSFHLARNPYFVSSYSFAANCNLSGFLPPSYNALRTSLLQQERSYIERLL 195 Query: 1929 SPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEV 1750 P++ W L G T+ D + IN S G +FL+ ID K + ++ Sbjct: 196 QPIKSLWSLKGVTLVVDGWTDAQIRPL-INFMAISEEGPMFLKAIDGSKEYKDKHYMFDL 254 Query: 1749 LTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDI----- 1585 L I +VGP +V+QVI K +G LI +FPHIFW+ C H + L +++I Sbjct: 255 LKDVIKEVGPQSVVQVITDNAYVCKAAGLLIEVEFPHIFWTPCVVHTLNLGVKNICAAKN 314 Query: 1584 --------TELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYN 1429 E W+ + A I+ I+ + +FA Sbjct: 315 VDGNENVFNEGGWIAEVIGDASFIKVFIMTHSMRLAIFNEFSSLKL-LSIAETRFASMIV 373 Query: 1428 MVWRIIKLKQALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVR 1249 M+ R+ LK+ LQ +V S++W ++ ++ +L + +W + +L +P Sbjct: 374 MLKRLKLLKRCLQNMVISDQWNSYREDDVRKAAHVKELILNDIWWDKVDYILSFMDPIYS 433 Query: 1248 LLGSLNVDRSVMGDVYNWRVQALEVVRSK--RIDDMVLKQLEV-------VLENRWEMLF 1096 ++ + + S + VY +E V++ R D+++ ++ +L +RW Sbjct: 434 MIRICDTNASNLHLVYEMWDSMIEKVKTTIYRHDEVLENEVSTFFEVIHEILNSRWSKSC 493 Query: 1095 SPLHAAGYILNPRYFGKGQA----------KDKTVMRGWKATLDRYESDGMARRVLREQL 946 +PLH + LNPRY+ +D + L RY + R + E+ Sbjct: 494 NPLHCLAHSLNPRYYSDNWLNEVPNRVPPHRDDELSSQRNKCLKRYFPNVNVRTKVYEEF 553 Query: 945 SSYWRLDGSLGEEDAMDYRDKMDPVAWWENFGSETPQLQTLAIKILSQISSVTTFQGSWH 766 S + G G D ++ R +D WW GS TP LQ +A+K+L Q S + + +W Sbjct: 554 SKFSSCAGDFGSFDIIEDRWALDSKTWWVMHGSSTPILQKVALKLLVQPCSSSCCERNWS 613 Query: 765 DNGSTCQEEVNLLGAERAEDLVFVRNNLRLHSKK 664 + N + ++A+DLVFV +NLRL S+K Sbjct: 614 TYSFIHSLKRNKMDPKKAKDLVFVHSNLRLLSRK 647 >ref|XP_004159512.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101222344 [Cucumis sativus] Length = 673 Score = 223 bits (568), Expect = 3e-55 Identities = 159/603 (26%), Positives = 277/603 (45%), Gaps = 38/603 (6%) Frame = -3 Query: 2370 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHIQ-----EEERLARKKK 2206 +CN+C ++G R++ HL + C + +R+ HIQ +++ A KK Sbjct: 23 RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRD--HIQGILSTPKKQKAPKKP 80 Query: 2205 KIP----TSGKSSKRIRSSQLAITSVGKAFG------------------------KEDVD 2110 K+ T+G+ S + S G+ K++ D Sbjct: 81 KVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETD 140 Query: 2109 DVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAV 1930 VA FF+ + + F+ KS Y+ +M AIA +G GY+ PS +KL + L K K + + Sbjct: 141 KKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY 200 Query: 1929 SPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEV 1750 R+ W TGCTI C S DG F + I V+ +G LFL+++DI + + +++ Sbjct: 201 KKHRDEWKETGCTILCDSWSDGQTKSFLV-ISVTCSKGTLFLKSVDISGHEDDATYLSDL 259 Query: 1749 LTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDW 1570 L I++VG NV+Q+I S +G L+ +K+ +FWS C ++C+ ++EDI++++W Sbjct: 260 LETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEW 319 Query: 1569 MKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQALQ 1390 + + AK I + I + I P +F + + I+ L+ L+ Sbjct: 320 VSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLK 379 Query: 1389 EVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMG 1210 + EW D +I + + + FW AH + +CEP +R+L ++ D MG Sbjct: 380 HMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMG 439 Query: 1209 DVYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHAAGYILNPRYFGKGQ 1039 ++ +A +++ +D + E + + RW + L + LH A LNP F Sbjct: 440 YIFEGIERAKVEIKTYYNGFEDKYMPIWETI-DRRWNLQLHTTLHTAAAFLNPSXFYNPN 498 Query: 1038 AK-DKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPVAWW 862 K D + G++ + + + + + + +Y G+LG + A+ R P WW Sbjct: 499 FKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWW 558 Query: 861 ENFGSETPQLQTLAIKILSQISSVTTFQG-SWHDNGSTCQEEVNLLGAERAEDLVFVRNN 685 +G E P LQ A++ILSQ S G +W + ++ + E+ DLVFV+ N Sbjct: 559 SGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCN 618 Query: 684 LRL 676 L L Sbjct: 619 LWL 621 >ref|XP_004147940.1| PREDICTED: uncharacterized protein LOC101222344 [Cucumis sativus] Length = 673 Score = 223 bits (568), Expect = 3e-55 Identities = 159/603 (26%), Positives = 277/603 (45%), Gaps = 38/603 (6%) Frame = -3 Query: 2370 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHIQ-----EEERLARKKK 2206 +CN+C ++G R++ HL + C + +R+ HIQ +++ A KK Sbjct: 23 RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRD--HIQGILSTPKKQKAPKKP 80 Query: 2205 KIP----TSGKSSKRIRSSQLAITSVGKAFG------------------------KEDVD 2110 K+ T+G+ S + S G+ K++ D Sbjct: 81 KVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTYPCLSPSAQPPIDDAQKQKKDETD 140 Query: 2109 DVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAV 1930 VA FF+ + + F+ KS Y+ +M AIA +G GY+ PS +KL + L K K + + Sbjct: 141 KKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY 200 Query: 1929 SPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEV 1750 R+ W TGCTI C S DG F + I V+ +G LFL+++DI + + +++ Sbjct: 201 KKHRDEWKETGCTILCDSWSDGQTKSFLV-ISVTCSKGTLFLKSVDISGHEDDATYLSDL 259 Query: 1749 LTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDW 1570 L I++VG NV+Q+I S +G L+ +K+ +FWS C ++C+ ++EDI++++W Sbjct: 260 LETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEW 319 Query: 1569 MKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQALQ 1390 + + AK I + I + I P +F + + I+ L+ L+ Sbjct: 320 VSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLK 379 Query: 1389 EVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMG 1210 + EW D +I + + + FW AH + +CEP +R+L ++ D MG Sbjct: 380 HMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMG 439 Query: 1209 DVYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHAAGYILNPRYFGKGQ 1039 ++ +A +++ +D + E + + RW + L + LH A LNP F Sbjct: 440 YIFEGIERAKVEIKTYYNGFEDKYMPIWETI-DRRWNLQLHTTLHTAAAFLNPSVFYNPN 498 Query: 1038 AK-DKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPVAWW 862 K D + G++ + + + + + + +Y G+LG + A+ R P WW Sbjct: 499 FKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWW 558 Query: 861 ENFGSETPQLQTLAIKILSQISSVTTFQG-SWHDNGSTCQEEVNLLGAERAEDLVFVRNN 685 +G E P LQ A++ILSQ S G +W + ++ + E+ DLVFV+ N Sbjct: 559 SGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCN 618 Query: 684 LRL 676 L L Sbjct: 619 LWL 621 >ref|XP_002509591.1| DNA binding protein, putative [Ricinus communis] gi|223549490|gb|EEF50978.1| DNA binding protein, putative [Ricinus communis] Length = 670 Score = 223 bits (567), Expect = 4e-55 Identities = 159/605 (26%), Positives = 273/605 (45%), Gaps = 38/605 (6%) Frame = -3 Query: 2370 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHIQ------EEERLARKK 2209 +CN+CN ++G R++ HL + C + +R HIQ ++++ +K+ Sbjct: 23 RCNYCNREFSGGVYRMKFHLAQIKNKDIVPCAEVPDDVRN--HIQSILSTPKKQKTPKKQ 80 Query: 2208 K-----------KIPTSGKSSKRIRSSQLAITSVGKAFGK-----------------EDV 2113 K + G R S Q T F + + Sbjct: 81 KTDQAENGQDNSSSASGGVHPNRGSSGQHGSTCPSLLFSRPLPTSQPVVDDAQNEKQNNA 140 Query: 2112 DDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKA 1933 D +A FF+ + + F+ KS Y+ +M A+A G GY+ PS +KL S L K K + Sbjct: 141 DKRIAVFFFHNSIAFSAAKSIYYQEMFDAVAECGQGYKAPSFEKLRSSLLEKVKGDIHDW 200 Query: 1932 VSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTE 1753 R+ W TGCTI C DG I V+ P+G LFL+++DI + + + E Sbjct: 201 YRKYRDDWKETGCTILCDGWSDGRTKSV-IVFSVTCPKGTLFLKSVDISGHENDANYLFE 259 Query: 1752 VLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELD 1573 +L +++VG NV+QVI S +G L+ +K+ +FWS C ++C+ ++EDI++ + Sbjct: 260 LLESILLEVGVENVIQVITDSTASYVYAGRLLMAKYSSLFWSPCASYCVNKMLEDISKQE 319 Query: 1572 WMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQAL 1393 W+ + A I + I + I P ++ Y + I+ + L Sbjct: 320 WVGTVMEEANTITKYIYSHAWTLNMMRRFTGGRELIRPRITRYVSNYLSLRAIVIQEDNL 379 Query: 1392 QEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVM 1213 + + EW D +++ + + FW AH + + EP +++L ++ D M Sbjct: 380 KHMFSHSEWLSSMHSRRPDAQIVKSFLSQDRFWKFAHEAVSISEPLIKILRIVDGDMPAM 439 Query: 1212 GDVYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHAAGYILNPRYFGKG 1042 G +Y +A +++ K I+D + E++ + RW + L SPLHAA LNP F Sbjct: 440 GYIYEVLERAKVSIKAYYKGIEDKYMPIWEII-DRRWNIQLHSPLHAAAAFLNPSIFYNQ 498 Query: 1041 QAK-DKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPVAW 865 K D + G++ + + + + + + ++ Y G+LG + A+ R P W Sbjct: 499 NFKIDLRMRNGFQEAMIKMATSDIDKIEITKEHPIYINGQGALGTDFAIMGRTLNSPGDW 558 Query: 864 WENFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTCQEEVNLLGAERAEDLVFVRNN 685 W +G E P LQ +AI++LSQ S + +W S ++ N E+ DLVFV N Sbjct: 559 WAGYGYEIPTLQRVAIRLLSQPCSSHWCRWNWSTFESIHTKKRNKAELEKLNDLVFVHCN 618 Query: 684 LRLHS 670 L L + Sbjct: 619 LWLQA 623 >ref|XP_006857527.1| hypothetical protein AMTR_s00061p00028660 [Amborella trichopoda] gi|548861623|gb|ERN18994.1| hypothetical protein AMTR_s00061p00028660 [Amborella trichopoda] Length = 863 Score = 220 bits (560), Expect = 3e-54 Identities = 156/603 (25%), Positives = 265/603 (43%), Gaps = 36/603 (5%) Frame = -3 Query: 2370 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHI---QEEERLARKKKKI 2200 +CN+C ++G R++ HL + C + +R+ ++ KK KI Sbjct: 209 RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSDVPNDVRDLIQSVLNTPRKQKTPKKPKI 268 Query: 2199 PTSGKSSKRIRSSQ----LAITSVGKAFG-------------------------KEDVDD 2107 + S S+ L + S G+ +E+ D Sbjct: 269 EQTPNSPHNSSSASGGFHLNVGSSGQRGSTCPSLLFPHPSPSGQPILDDSQRQKQEEADK 328 Query: 2106 VVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVS 1927 +A FF+ + + F+ KS Y+H M AIA G GY PS D+L + L K K + + Sbjct: 329 KIALFFFHNSIPFSSSKSIYYHGMVDAIADCGVGYRAPSYDRLRTTLLEKVKVEITDSYK 388 Query: 1926 PVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVL 1747 R+ W +GCTI DG S F I V+ PRG LFL+++D + E+L Sbjct: 389 TYRDEWRESGCTIMSDGWTDGR-SKFLIVFSVACPRGTLFLKSVDASAHVDDAHYLFELL 447 Query: 1746 TKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDWM 1567 +++VG ++QVI + +G L+ +K+P +FWS C ++CI ++EDI++ +W+ Sbjct: 448 ESVVLEVGLEYIVQVITDSAANYVYAGRLLTAKYPSLFWSPCASYCIDRMLEDISKQEWV 507 Query: 1566 KPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQALQE 1387 + A+ I + I + +F + + I+ + L+ Sbjct: 508 STVIEEARSITKYIYGHSWVLNLMKRFTGGKELLRSRITRFVTHFLSLRSIVIHEDNLKH 567 Query: 1386 VVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMGD 1207 + EW D ++ + + + FW A ++ L EP +++L ++ D MG Sbjct: 568 MFSHTEWLSSLYSKKSDAQAVRSLIYLDRFWKSAQEVVNLSEPLIKVLRIVDGDMPAMGY 627 Query: 1206 VYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHAAGYILNPRYFGKGQA 1036 +Y +A +++ K +D + E++ + RW + L SPLHAA LNP F Sbjct: 628 IYEGIERAKVAIKAYYKGSEDKYMPIWEII-DRRWNLQLHSPLHAAAAFLNPAIFYNPSF 686 Query: 1035 K-DKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPVAWWE 859 K D + G+ + + + + L ++ Y G+LG + AM R P WW Sbjct: 687 KIDSKIRNGFHEAMMKMVLNDKDKMELTKETPMYINAHGALGNDFAMMARTLNTPGDWWA 746 Query: 858 NFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTCQEEVNLLGAERAEDLVFVRNNLR 679 +G E P LQ AI+ILSQ S + +W + ++ N L E+ DLV+V NLR Sbjct: 747 GYGYEVPVLQRAAIRILSQPCSSYWCRWNWGTFENVHTKKRNRLEQEKFNDLVYVHCNLR 806 Query: 678 LHS 670 + Sbjct: 807 FQA 809 >ref|XP_007161271.1| hypothetical protein PHAVU_001G056200g, partial [Phaseolus vulgaris] gi|561034735|gb|ESW33265.1| hypothetical protein PHAVU_001G056200g, partial [Phaseolus vulgaris] Length = 702 Score = 219 bits (559), Expect = 4e-54 Identities = 168/629 (26%), Positives = 274/629 (43%), Gaps = 32/629 (5%) Frame = -3 Query: 2451 PSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPA 2272 P GWKH G + K+ KC++C+ +G R + HL G T + C + Sbjct: 17 PGNRTDVGWKH-----GIDINGNGKKVKCSYCSKTMSGGIFRFKHHLAG-TREDSEPCCS 70 Query: 2271 IDRSLREAFH--IQEEERLARKKKKIP-------------------TSGKSSKRIRSS-Q 2158 + +R+ + E ++ + KK+K+ + GK R + Q Sbjct: 71 VPEEIRDLMIKIVAEAKQASLKKRKLNIIDEDQGCEGLEERQHIFGSKGKEKVGSRGAVQ 130 Query: 2157 LAITSVGKAFGKEDVDDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKL 1978 I + K KE+VD VA FFY + FN+IK+P F M + I +G GY+PPS + Sbjct: 131 ATINQMMKKGYKEEVDAQVAEFFYTSAIPFNVIKNPAFTKMCEMIGKYGAGYKPPSYHDI 190 Query: 1977 LDSFLTKEKARMDKAVSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQT 1798 + L + + D + +E W TGCTI D N V+SP+G +F+ + Sbjct: 191 REKLLKQAIDKTDLVLQEYKEEWKKTGCTIMSDGWTDKKRRSI-CNFLVNSPKGTVFMYS 249 Query: 1797 IDIEKGDGEDDVFTEVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCT 1618 +D D ++L + VG NV+QV+ + K +G L+ K H++W+ C Sbjct: 250 LDTSDISKTADKVFKMLDDVVELVGEENVVQVVTDNAANFKAAGELLMQKREHLYWTPCA 309 Query: 1617 AHCIQLLMEDI-TELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFA 1441 AHCI L ED +L + + + I I I P +FA Sbjct: 310 AHCIDLSFEDFEKKLKVHELTIKKGRKITTYIYGRSMLISMLKKFTKERDLIRPGVTRFA 369 Query: 1440 PTYNMVWRIIKLKQALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCE 1261 Y + + +LK +L + SEEW+ K ++ +E +L N FW L++ Sbjct: 370 TAYLTLGCLHELKASLLTMFSSEEWKTSKFGTSQEGKKVENMILDNRFWKNISTCLKVAA 429 Query: 1260 PFVRLLGSLNVD-RSVMGDVYNWRVQALEVVRS-----KRIDDMVLKQLEVVLENRWE-M 1102 P + +L ++ D + MG +Y +A E +++ K+ + V K +++ RW+ Sbjct: 430 PLMVVLRLVDSDAKPAMGFIYEEMDRAKEKIKNNFNHIKKSYEEVWK----IIDARWDNQ 485 Query: 1101 LFSPLHAAGYILNPR--YFGKGQAKDKTVMRGWKATLDRYESDGMARRVLREQLSSYWRL 928 L PLHAA Y LNP+ Y + ++ D V G ++ R D RR++ QL Y Sbjct: 486 LHRPLHAAAYYLNPQFHYEPEFRSDDPEVKEGLYTSMRRLVKDAAERRIINVQLVEYHFG 545 Query: 927 DGSLGEEDAMDYRDKMDPVAWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTC 748 G+ +DA + R + P WWE FG TP+L Sbjct: 546 RGAFAMDDAKESRKTILPGEWWEMFGYRTPEL---------------------------- 577 Query: 747 QEEVNLLGAERAEDLVFVRNNLRLHSKKL 661 + N L ++ DL++V NL+L +K++ Sbjct: 578 -KRRNHLHQKKMNDLLYVMYNLKLSNKQI 605 >ref|XP_003618961.1| hypothetical protein MTR_6g029340 [Medicago truncatula] gi|355493976|gb|AES75179.1| hypothetical protein MTR_6g029340 [Medicago truncatula] Length = 725 Score = 219 bits (559), Expect = 4e-54 Identities = 168/622 (27%), Positives = 274/622 (44%), Gaps = 33/622 (5%) Frame = -3 Query: 2430 GWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLRE 2251 GWK+ G + ++ KC+ C +G R + HL G + D E Sbjct: 35 GWKY-----GTDVNGDARKVKCSFCAKVISGGVYRFKHHLAGTSDDSGPCAQVSDEVKME 89 Query: 2250 AFH-IQEEERLARKKKKIPTSGKSS---------------KRIRS------SQLAITSVG 2137 + E A +K+K+ + + +++R +Q I ++ Sbjct: 90 MLKWVATLEEAAERKRKMAEIAQGNVTEDPAFEVEVSQHLQKVRGKASASGTQTKIDAIA 149 Query: 2136 KAFGKEDVDDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTK 1957 K K + DD VA FFY + FN I++P F M AI +GP Y+PPS + D L + Sbjct: 150 KKPLKVEADDAVAEFFYTSAIAFNCIRNPAFAKMCVAIGKYGPDYKPPSYRDISDKLLVR 209 Query: 1956 EKARMDKAVSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGD 1777 R ++ V +E W TGC+I D N V+SP+G +FL ++D Sbjct: 210 AVDRTNEIVDKFKEEWKTTGCSIMSDGWTDRKRRSI-CNFMVNSPKGTVFLYSLDTSDIS 268 Query: 1776 GEDDVFTEVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLL 1597 D ++L + VG NV+QV+ + K G L+ K +FW+ C AHCI L+ Sbjct: 269 KTADKVFKMLDDVVEAVGEDNVIQVVTDNAANFKAGGELLMLKRTKLFWTPCAAHCIDLI 328 Query: 1596 MEDI-TELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVW 1420 +ED E+ + A+ + I I P +FA Y + Sbjct: 329 LEDFEKEMIIHNVTIKNARKLTTYIYNRTMLITMVRKFTNGRDLIRPALTRFATAYLTIG 388 Query: 1419 RIIKLKQALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLG 1240 + LK +L + S +W+ + E+ + + +L FW + L+ P + +L Sbjct: 389 CLNDLKSSLINMFDSNDWKSSRFATTEEGKKMASGILDQRFWKNIGVCLKTAAPLMDVLH 448 Query: 1239 SLNVD-RSVMGDVYNWRVQALEVVRSKRIDDM--VLKQLEVV---LENRW-EMLFSPLHA 1081 ++ D + MG +Y +A++ + + ++ V K E V ++ RW L PLHA Sbjct: 449 LVDSDEKPAMGYIY----EAMDACKKQIQNNFNNVQKCYEPVCKIIDQRWMGQLHRPLHA 504 Query: 1080 AGYILNPR-YFGKG-QAKDKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSL-GE 910 AGY LNP+ +FG + D + G + + + SD R + QL+ + G L G Sbjct: 505 AGYYLNPQIHFGPNFKGNDIDIKNGLFSVISKLVSDAAERSKINSQLADFHFSRGPLFGS 564 Query: 909 EDAMDYRDKMDPVAWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTCQEEVNL 730 E A R +M P WWE +G TP+L+ AI+ILS S + + +W ++ N Sbjct: 565 EYAKKARAEMHPGQWWEMYGDYTPELKRFAIRILSLTCSSSGCERNWSAFEMVHTKKRNR 624 Query: 729 LGAERAEDLVFVRNNLRLHSKK 664 L ++ DLV+V N+RL K+ Sbjct: 625 LRQQKMNDLVYVMANMRLTRKE 646