BLASTX nr result
ID: Akebia24_contig00019217
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia24_contig00019217 (2476 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_003632266.1| PREDICTED: uncharacterized protein LOC100854... 833 0.0 emb|CAN70085.1| hypothetical protein VITISV_003006 [Vitis vinifera] 832 0.0 ref|XP_002527444.1| protein dimerization, putative [Ricinus comm... 800 0.0 ref|XP_006424350.1| hypothetical protein CICLE_v10028008mg [Citr... 775 0.0 ref|XP_006484968.1| PREDICTED: uncharacterized protein LOC102615... 773 0.0 ref|XP_007014534.1| Uncharacterized protein TCM_039722 [Theobrom... 258 1e-65 ref|XP_007214864.1| hypothetical protein PRUPE_ppa018860mg [Prun... 242 7e-61 ref|XP_006841838.1| hypothetical protein AMTR_s00003p00270420 [A... 239 3e-60 ref|XP_007039961.1| HAT transposon superfamily [Theobroma cacao]... 238 1e-59 ref|XP_003538648.1| PREDICTED: uncharacterized protein LOC100805... 232 7e-58 ref|XP_006477267.1| PREDICTED: uncharacterized protein LOC102627... 231 1e-57 ref|XP_004292297.1| PREDICTED: uncharacterized protein LOC101307... 225 9e-56 ref|XP_002443069.1| hypothetical protein SORBIDRAFT_08g007560 [S... 224 1e-55 ref|XP_004159512.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 223 3e-55 ref|XP_004147940.1| PREDICTED: uncharacterized protein LOC101222... 223 3e-55 ref|XP_002509591.1| DNA binding protein, putative [Ricinus commu... 223 4e-55 ref|XP_006579099.1| PREDICTED: uncharacterized protein LOC102660... 222 6e-55 ref|XP_007161271.1| hypothetical protein PHAVU_001G056200g, part... 220 2e-54 ref|XP_006857527.1| hypothetical protein AMTR_s00061p00028660 [A... 219 4e-54 ref|XP_003618961.1| hypothetical protein MTR_6g029340 [Medicago ... 219 4e-54 >ref|XP_003632266.1| PREDICTED: uncharacterized protein LOC100854857 [Vitis vinifera] Length = 635 Score = 833 bits (2152), Expect = 0.0 Identities = 404/597 (67%), Positives = 487/597 (81%) Frame = -3 Query: 2456 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 2277 MP+ESDKWGWKHVSVFGGF+ +GTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP Sbjct: 1 MPTESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 60 Query: 2276 AIDRSLREAFHIQEEERLARKKKKIPTSGKSSKRIRSSQLAITSVGKAFGKEDVDDVVAR 2097 AIDRSLREAF I EEERLARKKK+ SGK+ KRIR+SQ ++T V K KEDVDD+VAR Sbjct: 61 AIDRSLREAFQILEEERLARKKKRTSGSGKTGKRIRTSQPSVTCVWKTIAKEDVDDIVAR 120 Query: 2096 FFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVRE 1917 FFYADGL+FNI+ SPYF +M KAIA+FGPGYEPP+ +KL D FL+KEKA+++KA++ VRE Sbjct: 121 FFYADGLDFNIVNSPYFLEMTKAIAAFGPGYEPPTTEKLSDLFLSKEKAKIEKAMALVRE 180 Query: 1916 SWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTKAI 1737 SWP TGCTI C+++L T + NIFVSSPRGL+FL+ +DI GDG D++F +VL+ AI Sbjct: 181 SWPHTGCTILCVNRLCRTQGRYYTNIFVSSPRGLMFLKALDINDGDGMDNMFVDVLSDAI 240 Query: 1736 MDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDWMKPFV 1557 M+V P NVLQ+I + G +S+ SLI SKF H+FWS CT+H I +LMEDIT+LDW+KP V Sbjct: 241 MEVEPTNVLQIISNLGHASESFESLILSKFRHLFWSPCTSHSICVLMEDITKLDWIKPIV 300 Query: 1556 SYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQALQEVVGS 1377 AK I++CIL DP+S KFAP+Y +V RI +LKQAL VV S Sbjct: 301 LCAKEIDECILTYQRSSLCVLTLESS----DPLSTKFAPSYCIVERIFELKQALLGVVVS 356 Query: 1376 EEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMGDVYNW 1197 EEW+QWKL EDV ++E A+LG++FW RA +LQ EPFVRLL +L++++SVMGDV+NW Sbjct: 357 EEWKQWKLTIQEDVLNVETAILGDNFWSRACSLLQFFEPFVRLLTTLDIEKSVMGDVFNW 416 Query: 1196 RVQALEVVRSKRIDDMVLKQLEVVLENRWEMLFSPLHAAGYILNPRYFGKGQAKDKTVMR 1017 RVQALE V+SK +DD++L QLE+++E++W+MLFSPLHA+GYILNP+YFGKGQ+KDKT+MR Sbjct: 417 RVQALEAVKSKGVDDILLNQLELLIESKWDMLFSPLHASGYILNPKYFGKGQSKDKTIMR 476 Query: 1016 GWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPVAWWENFGSETPQ 837 GWKATLDRYESD RRVLREQLSSYWRL+GS GEEDA+D RDKMDPVAWWENFG ETP Sbjct: 477 GWKATLDRYESDSATRRVLREQLSSYWRLEGSFGEEDAVDCRDKMDPVAWWENFGFETPH 536 Query: 836 LQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERVEDLVFVRNNLRLHSKK 666 LQTLAIKILSQ+SSV+ +Q +W DN CQ AVN LG ER EDLVFVRNNLRLHS++ Sbjct: 537 LQTLAIKILSQVSSVSMYQETWQDNEFLCQTAVNGLGVERTEDLVFVRNNLRLHSQR 593 >emb|CAN70085.1| hypothetical protein VITISV_003006 [Vitis vinifera] Length = 635 Score = 832 bits (2150), Expect = 0.0 Identities = 403/597 (67%), Positives = 487/597 (81%) Frame = -3 Query: 2456 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 2277 MP+ESDKWGWKHVSVFGGF+ +GTKRWKCNHCN+RYNGSYSRVRAHLLGFTGVGVKSCP Sbjct: 1 MPTESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNIRYNGSYSRVRAHLLGFTGVGVKSCP 60 Query: 2276 AIDRSLREAFHIQEEERLARKKKKIPTSGKSSKRIRSSQLAITSVGKAFGKEDVDDVVAR 2097 AIDRSLREAF I EEERLARKKK+ SGK+ KRIR+SQ ++T V K KEDVDD+VAR Sbjct: 61 AIDRSLREAFQILEEERLARKKKRTSGSGKTGKRIRTSQPSVTCVWKTIAKEDVDDIVAR 120 Query: 2096 FFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVRE 1917 FFYADGL+FNI+ SPYF +M KAIA+FGPGYEPP+ +KL D FL+KEKA+++KA++ VRE Sbjct: 121 FFYADGLDFNIVNSPYFLEMTKAIAAFGPGYEPPTTEKLSDLFLSKEKAKIEKAMALVRE 180 Query: 1916 SWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTKAI 1737 SWP TGCTI C+++L T + NIFVSSPRGL+FL+ +DI GDG D++F +VL+ AI Sbjct: 181 SWPHTGCTILCVNRLCRTQGRYYTNIFVSSPRGLMFLKALDINDGDGMDNMFVDVLSDAI 240 Query: 1736 MDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDWMKPFV 1557 M+V P NVLQ+I + G +S+ SLI SKF H+FWS CT+H I +LMEDIT+LDW+KP V Sbjct: 241 MEVEPTNVLQIISNLGHASESFESLILSKFRHLFWSPCTSHSICVLMEDITKLDWIKPIV 300 Query: 1556 SYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQALQEVVGS 1377 AK I++CIL DP+S KFAP+Y +V RI +LKQAL VV S Sbjct: 301 LCAKEIDECILTYQRSSLCVLTLESS----DPLSTKFAPSYCIVERIFELKQALLGVVVS 356 Query: 1376 EEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMGDVYNW 1197 EEW+QWKL EDV ++E A+LG++FW RA +LQ EPFVRLL +L++++SVMGDV+NW Sbjct: 357 EEWKQWKLTIQEDVLNVETAILGDNFWSRACSLLQFFEPFVRLLTTLDIEKSVMGDVFNW 416 Query: 1196 RVQALEVVRSKRIDDMVLKQLEVVLENRWEMLFSPLHAAGYILNPRYFGKGQAKDKTVMR 1017 RVQALE V+SK +DD++L QLE+++E++W+MLFSPLHA+GYILNP+YFGKGQ+KDKT+MR Sbjct: 417 RVQALEAVKSKGVDDILLNQLELLIESKWDMLFSPLHASGYILNPKYFGKGQSKDKTIMR 476 Query: 1016 GWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPVAWWENFGSETPQ 837 GWKATLDRYESD RRVLREQLSSYWRL+GS GEEDA+D RDKMDPVAWWENFG ETP Sbjct: 477 GWKATLDRYESDSATRRVLREQLSSYWRLEGSFGEEDAVDCRDKMDPVAWWENFGFETPH 536 Query: 836 LQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERVEDLVFVRNNLRLHSKK 666 LQTLAIKILSQ+SSV+ +Q +W DN CQ AVN LG ER EDLVFVRNNLRLHS++ Sbjct: 537 LQTLAIKILSQVSSVSMYQETWQDNEFLCQTAVNGLGVERAEDLVFVRNNLRLHSQR 593 >ref|XP_002527444.1| protein dimerization, putative [Ricinus communis] gi|223533179|gb|EEF34936.1| protein dimerization, putative [Ricinus communis] Length = 633 Score = 800 bits (2067), Expect = 0.0 Identities = 392/597 (65%), Positives = 471/597 (78%) Frame = -3 Query: 2456 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 2277 MPSESDKWGW+HVSVFGGF+ +GTKRWKCNHCNLRYNGSYSRVRAHLLGF+GVGVKSCP Sbjct: 1 MPSESDKWGWEHVSVFGGFDRGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFSGVGVKSCP 60 Query: 2276 AIDRSLREAFHIQEEERLARKKKKIPTSGKSSKRIRSSQLAITSVGKAFGKEDVDDVVAR 2097 AIDRSLREAF I EEERL RKKKK +GK KR R SQ +I+ K KEDVDD+VAR Sbjct: 61 AIDRSLREAFQILEEERLVRKKKKNSANGKPGKRTRISQASIS--WKTITKEDVDDIVAR 118 Query: 2096 FFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVRE 1917 FFYADGLN +++ SPYFH+M KAI +FG GYE PS+DKL DSFL KEK R++K+++ +RE Sbjct: 119 FFYADGLNIDVVNSPYFHEMVKAIGAFGSGYELPSIDKLSDSFLGKEKGRIEKSLALLRE 178 Query: 1916 SWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTKAI 1737 SWP TGCTI C+ +LDG + CF+INIFVSSPRGL+FL+ +D++ D D V L+ AI Sbjct: 179 SWPHTGCTILCVGRLDGAIGCFHINIFVSSPRGLIFLKAVDVDDCDEGDHVLAGALSDAI 238 Query: 1736 MDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDWMKPFV 1557 ++VGP+NVLQ+I H G + K S S I SKFPHIFWS CT+H I +LME+I EL+W+KP V Sbjct: 239 LEVGPSNVLQIISHLGDACKSSESYILSKFPHIFWSPCTSHSILMLMEEIAELEWVKPIV 298 Query: 1556 SYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQALQEVVGS 1377 A+ IEQCI+ D ISAKFAP+Y V RI +L+Q LQEVV S Sbjct: 299 LCARRIEQCIMTYQHATSCIFMQSPKESC-DLISAKFAPSYFFVQRIFELRQTLQEVVVS 357 Query: 1376 EEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMGDVYNW 1197 E QWK ++V SIE+A+LG+DFW ++HL+LQL EPF++LLG L++D+SV+G VY+W Sbjct: 358 E---QWKHSIGDNVESIESAILGDDFWSKSHLLLQLYEPFIKLLGLLDIDKSVIGAVYDW 414 Query: 1196 RVQALEVVRSKRIDDMVLKQLEVVLENRWEMLFSPLHAAGYILNPRYFGKGQAKDKTVMR 1017 RVQALE +RSK IDD +L QLEV++EN+W++LFSPLHA GYILNPRY GK Q KDK+VMR Sbjct: 415 RVQALEALRSKAIDDDILNQLEVLIENKWDVLFSPLHATGYILNPRYIGKFQTKDKSVMR 474 Query: 1016 GWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPVAWWENFGSETPQ 837 GWKATL+RYE + ARRVLREQLSSYWRL+GSLG+EDA+D RDKMDPVAWWENFG ETP Sbjct: 475 GWKATLERYEGESTARRVLREQLSSYWRLEGSLGDEDAVDCRDKMDPVAWWENFGFETPS 534 Query: 836 LQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERVEDLVFVRNNLRLHSKK 666 LQTLAIK+LSQ+SSV Q W N +CQEA N LG +RVEDL+FVRNNLRLH +K Sbjct: 535 LQTLAIKVLSQVSSVALCQEIWQTNDFSCQEAANRLGVQRVEDLLFVRNNLRLHYQK 591 >ref|XP_006424350.1| hypothetical protein CICLE_v10028008mg [Citrus clementina] gi|557526284|gb|ESR37590.1| hypothetical protein CICLE_v10028008mg [Citrus clementina] Length = 636 Score = 775 bits (2002), Expect = 0.0 Identities = 383/599 (63%), Positives = 458/599 (76%) Frame = -3 Query: 2456 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 2277 MPSESDKWGW+HVSVFGGFE +GTKRWKCNHCNLRYNGSYSRVRAHLLGF+GVGVKSCP Sbjct: 1 MPSESDKWGWEHVSVFGGFERGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFSGVGVKSCP 60 Query: 2276 AIDRSLREAFHIQEEERLARKKKKIPTSGKSSKRIRSSQLAITSVGKAFGKEDVDDVVAR 2097 AIDRS+RE F I EEER+ARKKK+ K KRIR+ Q +I S KA KEDVD++VAR Sbjct: 61 AIDRSMRETFQILEEERIARKKKRTSGIAKHGKRIRACQSSIVS--KAISKEDVDEMVAR 118 Query: 2096 FFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVRE 1917 FFYA GLN N++ SPYF +M ++IA+FG GY+ PS++ L DSFL+KEK +++K ++ VRE Sbjct: 119 FFYAAGLNVNVVNSPYFLEMVRSIAAFGHGYDLPSLENLSDSFLSKEKGKIEKFIASVRE 178 Query: 1916 SWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTKAI 1737 SWP TGCTI C+S LDG L CF IFVSSPRGL+FL+ +D++ D +++F VL+ AI Sbjct: 179 SWPHTGCTILCVSSLDGQLGCFPTGIFVSSPRGLVFLKALDLDDTDEAENLFITVLSDAI 238 Query: 1736 MDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDWMKPFV 1557 +DVGP NVLQ+I H G + K SL+ SKFPHIF S CT I + ME+I L+W+K V Sbjct: 239 LDVGPKNVLQIISHLGHACKSYESLVLSKFPHIFLSPCTLQSIHMFMEEIASLEWIKSTV 298 Query: 1556 SYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQALQEVVGS 1377 AK IEQ IL S D +S K AP+Y V RII+LKQ LQE V S Sbjct: 299 LCAKRIEQHILYYQHAYPCLFPHNLKESS-DQVSTKIAPSYCFVQRIIELKQVLQEAVVS 357 Query: 1376 EEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMGDVYNW 1197 EE++QWKL P D +E+A+LG+DFWG+AHL LQLCEPFVRLL + ++D+SVMG VY+W Sbjct: 358 EEFKQWKLSMPGDHGIVESAILGDDFWGKAHLFLQLCEPFVRLLATFDIDKSVMGAVYDW 417 Query: 1196 RVQALEVVRSKRIDDMVLKQLEVVLENRWEMLFSPLHAAGYILNPRYFGKGQAKDKTVMR 1017 R QALE VR K ID L QLEV+ ENRW+ LFSPLHAAGYILNPRYFG+GQ KDKTVMR Sbjct: 418 RFQALEAVRMKGIDATALNQLEVLTENRWDALFSPLHAAGYILNPRYFGRGQNKDKTVMR 477 Query: 1016 GWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPVAWWENFGSETPQ 837 GWK+TL+RYESD RR+LREQLSSYWRL+GSLGEEDA+D+RDKM+PVAWWENFG E Sbjct: 478 GWKSTLERYESDSATRRILREQLSSYWRLEGSLGEEDAVDFRDKMEPVAWWENFGFEISH 537 Query: 836 LQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERVEDLVFVRNNLRLHSKKLV 660 LQTLAIK+LSQ+SSV Q W DN C+EA N G ER EDL+FVRNNLRLH+++ V Sbjct: 538 LQTLAIKVLSQVSSVAVCQEIWQDNDFPCREAANRSGVERPEDLIFVRNNLRLHNQRNV 596 >ref|XP_006484968.1| PREDICTED: uncharacterized protein LOC102615434 isoform X1 [Citrus sinensis] gi|568863036|ref|XP_006484969.1| PREDICTED: uncharacterized protein LOC102615434 isoform X2 [Citrus sinensis] Length = 636 Score = 773 bits (1995), Expect = 0.0 Identities = 381/599 (63%), Positives = 458/599 (76%) Frame = -3 Query: 2456 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 2277 MPSESDKWGW+HVSVFGGFE +GTKRWKCNHCNLRYNGSYSRVRAHLLGF+GVGVKSCP Sbjct: 1 MPSESDKWGWEHVSVFGGFERGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFSGVGVKSCP 60 Query: 2276 AIDRSLREAFHIQEEERLARKKKKIPTSGKSSKRIRSSQLAITSVGKAFGKEDVDDVVAR 2097 AIDRS+RE F I EEER+ARKKK+ K KRIR+ Q +I S KA KEDVD++VAR Sbjct: 61 AIDRSMRETFQILEEERIARKKKRTSGIAKHGKRIRACQSSIVS--KAISKEDVDEMVAR 118 Query: 2096 FFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVRE 1917 FFYA GLN N++ SPYF +M ++IA+FG GY+ PS++ L DSFL+KEK +++K ++ VRE Sbjct: 119 FFYAAGLNVNVVNSPYFLEMVRSIAAFGHGYDLPSLENLSDSFLSKEKGKIEKFIASVRE 178 Query: 1916 SWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTKAI 1737 SWP TGCTI C+S LDG L CF IFVSSPRGL+FL+ +D++ D +++F VL+ AI Sbjct: 179 SWPHTGCTILCVSSLDGRLGCFPTGIFVSSPRGLVFLKALDLDDTDEAENLFITVLSDAI 238 Query: 1736 MDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDWMKPFV 1557 ++VGP NVLQ+I H G + K SL+ SKFPHIF S CT I + ME+I L+W+K V Sbjct: 239 LEVGPKNVLQIISHLGHACKSYESLVLSKFPHIFLSPCTLQSIHMFMEEIASLEWIKSTV 298 Query: 1556 SYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQALQEVVGS 1377 AK IEQ I+ S D +S K AP+Y V RII+LKQ LQE V S Sbjct: 299 LCAKRIEQHIMYYQHAYPCLFPHNLKESS-DQVSTKIAPSYCFVQRIIELKQVLQEAVVS 357 Query: 1376 EEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMGDVYNW 1197 EE++QWKL P D +E+A+LG+DFWG+AHL LQLCEPFVRLL + ++D+SVMG VY+W Sbjct: 358 EEFKQWKLSMPGDHGIVESAILGDDFWGKAHLFLQLCEPFVRLLATFDIDKSVMGAVYDW 417 Query: 1196 RVQALEVVRSKRIDDMVLKQLEVVLENRWEMLFSPLHAAGYILNPRYFGKGQAKDKTVMR 1017 R QALE VR K ID L QLEV+ ENRW+ LFSPLHAAGYILNPRYFG+GQ KDKTVMR Sbjct: 418 RFQALEAVRMKGIDATALNQLEVLTENRWDALFSPLHAAGYILNPRYFGRGQNKDKTVMR 477 Query: 1016 GWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPVAWWENFGSETPQ 837 GWK+TL+RYESD RR+LREQLSSYWRL+GSLGEEDA+D+RDKM+PVAWWENFG E Sbjct: 478 GWKSTLERYESDSATRRILREQLSSYWRLEGSLGEEDAVDFRDKMEPVAWWENFGFEISH 537 Query: 836 LQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERVEDLVFVRNNLRLHSKKLV 660 LQTLAIK+LSQ+SSV Q W DN C+EA N G ER EDL+FVRNNLRLH+++ V Sbjct: 538 LQTLAIKVLSQVSSVAICQEIWQDNDFPCREAANRSGVERPEDLIFVRNNLRLHNQRNV 596 >ref|XP_007014534.1| Uncharacterized protein TCM_039722 [Theobroma cacao] gi|508784897|gb|EOY32153.1| Uncharacterized protein TCM_039722 [Theobroma cacao] Length = 381 Score = 258 bits (658), Expect = 1e-65 Identities = 127/213 (59%), Positives = 157/213 (73%) Frame = -3 Query: 1403 QALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDR 1224 +ALQ+VV SEEW+QWK +D+ IEA++LG++FW AH+MLQL +PF +LL L++D+ Sbjct: 146 KALQDVVVSEEWKQWKHSILKDILIIEASILGDEFWSNAHMMLQLFKPFAKLLAMLDIDK 205 Query: 1223 SVMGDVYNWRVQALEVVRSKRIDDMVLKQLEVVLENRWEMLFSPLHAAGYILNPRYFGKG 1044 SVMG +Y+WRVQALEVVRSK ID+ L QLEV++EN+W +LFS LHAAGYILNP YFGK Sbjct: 206 SVMGAIYDWRVQALEVVRSKEIDETALNQLEVLIENKWNVLFSLLHAAGYILNPGYFGK- 264 Query: 1043 QAKDKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPVAWW 864 AR VLR+QLSSYWRL+GS GEEDA+D RDKMD VAWW Sbjct: 265 -----------------------ARWVLRKQLSSYWRLEGSFGEEDALDCRDKMDLVAWW 301 Query: 863 ENFGSETPQLQTLAIKILSQISSVTTFQGSWHD 765 ENFG ETP LQTLAIK+LSQ+S+++ Q W D Sbjct: 302 ENFGFETPHLQTLAIKVLSQVSTISMCQDIWQD 334 Score = 182 bits (463), Expect = 5e-43 Identities = 101/201 (50%), Positives = 118/201 (58%) Frame = -3 Query: 2456 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 2277 M SE DKWGW+HV+VFG F+ +GTKRWKCNHCNLRYNGSYSRVRAHLL F+GVGVKSC Sbjct: 1 MASEFDKWGWEHVTVFGVFDRGSGTKRWKCNHCNLRYNGSYSRVRAHLLRFSGVGVKSCL 60 Query: 2276 AIDRSLREAFHIQEEERLARKKKKIPTSGKSSKRIRSSQLAITSVGKAFGKEDVDDVVAR 2097 AI+R+LREAFHI EEERLAR KK T G Sbjct: 61 AINRTLREAFHILEEERLAR--KKKRTFGSGKP--------------------------- 91 Query: 2096 FFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVRE 1917 +FG GYEPPS+DKL D FL+KEK R++K+++ VRE Sbjct: 92 -------------------------TFGCGYEPPSMDKLSDCFLSKEKGRIEKSITLVRE 126 Query: 1916 SWPLTGCTIFCLSQLDGTLSC 1854 SWP TG T+ C+ G L C Sbjct: 127 SWPHTGYTVLCV----GCLGC 143 >ref|XP_007214864.1| hypothetical protein PRUPE_ppa018860mg [Prunus persica] gi|462411014|gb|EMJ16063.1| hypothetical protein PRUPE_ppa018860mg [Prunus persica] Length = 805 Score = 242 bits (617), Expect = 7e-61 Identities = 182/648 (28%), Positives = 296/648 (45%), Gaps = 60/648 (9%) Frame = -3 Query: 2429 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRS-LRE 2253 WK+V G ++CN+C + GSY RV++HLL G GV SC + S L E Sbjct: 128 WKYVKKLEKDGKAGGNTSFQCNYCQKTFKGSYFRVKSHLLKLKGNGVASCTKVTNSHLME 187 Query: 2252 AFHIQEEERLARKKKKI-----PTSGKSSKRIRSSQLAITS-------------VGKAFG 2127 + EE L K ++ PTS SS+ SS L ++S + KAF Sbjct: 188 MEKVVEEAELRVKMAQLRDVPLPTSNTSSQGGSSSGLGMSSNWCSDSKKRKGNPIEKAFN 247 Query: 2126 ---KEDVDDVVARFFYADGLNFNIIKSPYFHDMAK-AIASFGPGYEPPSVDKLLDSFLTK 1959 +E +D +AR FY GL+F ++P++ + + A + PGY+PP + L + L K Sbjct: 248 NNLREQLDGEIARMFYTGGLSFQFSRNPHYVNAFRIACSKTLPGYQPPGYNMLRTTLLQK 307 Query: 1958 EKARMDKAVSPVRESW------PLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTI 1797 EK +++ VS + W PL IN+ G +FL+ I Sbjct: 308 EKNNIEEWVSVCSDGWSDAQRRPL-------------------INVMAICESGPMFLKAI 348 Query: 1796 DIEKGDGEDDVF-TEVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCT 1620 + E G+ +D F +L ++I ++GP NV+QV+ K +G ++ +KF HIFW+ C Sbjct: 349 NCE-GECKDKFFMANLLIESIREIGPQNVVQVVTDNAPVCKAAGHIVEAKFKHIFWTPCV 407 Query: 1619 AHCIQLLMEDI-----------TELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXX 1473 H + L +++I + W+ S A I+ I+ Sbjct: 408 VHTLNLALKNICSPVPRNPEVYEQCSWISTISSDAWFIKNFIM-NHNMRLSMYNDHCKLK 466 Query: 1472 SIDPISAKFAPTYNMVWRIIKLKQALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWG 1293 + +FA T M+ R ++KQ L+++V SE+W +K +++ +L FW Sbjct: 467 LLSVAETRFASTIVMLRRFKQVKQGLEQMVISEQWDIYKEDDVVKARTVKEKILDECFWE 526 Query: 1292 RAHLMLQLCEPFVRLLGSLNVDRSVMGDVYNWRVQALEVVRS-------KRIDD--MVLK 1140 +L P +L + D + +Y W +E V++ K++++ M Sbjct: 527 DIDYILNFTSPIYEMLRLSDTDMPCLHLIYEWWDSMIEKVKTIIYRKERKQLNEESMFFN 586 Query: 1139 QLEVVLENRWEMLFSPLHAAGYILNPRYFGK----------GQAKDKTVMRGWKATLDRY 990 + +L +RW +PLH + LNP+Y+ K KD + R K ++R+ Sbjct: 587 VVHEILVDRWTKSSTPLHCFAHSLNPKYYCKEWLDMAHNRCPPHKDIEITRERKQCIERF 646 Query: 989 ESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPVAWWENFGSETPQLQTLAIKIL 810 S+ + RR + E+ +S+ D+M R M PV WW G+ TP+LQT+A+K+L Sbjct: 647 FSNEVERRAVNEEYASFSACIEDFSGMDSMKDRGFMAPVKWWVIHGASTPKLQTIALKLL 706 Query: 809 SQISSVTTFQGSWHDNGSTCQEAVNLLGAERVEDLVFVRNNLRLHSKK 666 SS + + +W N + ER EDLVFV +NLRL S+K Sbjct: 707 GHPSSSSCCERNWSTYNFIHSIKRNKITPERAEDLVFVHSNLRLLSRK 754 >ref|XP_006841838.1| hypothetical protein AMTR_s00003p00270420 [Amborella trichopoda] gi|548843859|gb|ERN03513.1| hypothetical protein AMTR_s00003p00270420 [Amborella trichopoda] Length = 732 Score = 239 bits (611), Expect = 3e-60 Identities = 168/633 (26%), Positives = 280/633 (44%), Gaps = 45/633 (7%) Frame = -3 Query: 2429 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREA 2250 W ++ G T G +C C + GSY+RV++HLLG G GVK C ID Sbjct: 35 WAYMEKIGRCHTGGGNWMLRCVLCKAEFKGSYTRVKSHLLGKVGTGVKRCLGIDNETLAT 94 Query: 2249 FHIQEEERLARKKKKIPTSGKSSKRIRSSQLAITSVGKAFG---------KEDVDDVVAR 2097 +E RK + S ++ S + + A K+ +D ++AR Sbjct: 95 LLRLNDEGSTRKIRSSSRSSVPLLKVNSGSIGLKKRRGANDLVKLLDLAPKDVLDRMIAR 154 Query: 2096 FFYADGLNFNIIKSPYFHDMAK-AIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVR 1920 FYA G++ N+I+SPYF DM + A + GY P+ D L S L EKA ++++V P R Sbjct: 155 CFYASGISLNLIRSPYFRDMIRYACENSLEGYVLPTFDNLRTSLLDAEKANIEQSVKPFR 214 Query: 1919 ESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTKA 1740 SW G ++ D T IN +S G +FL+ ID D + + Sbjct: 215 SSWGSRGVSLLTDGWTDTTAKRPLINFMAASDIGSIFLKAIDSSVEMMNTDYMKNLFLEM 274 Query: 1739 IMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELD----- 1575 + +VGP +V+Q+I +++G + P+IFW+ C H + L +++I D Sbjct: 275 VAEVGPTSVVQIITDNSPICRVAGQRVEGMHPYIFWTPCVIHTLNLALKNICSPDDERKA 334 Query: 1574 -------WMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRI 1416 W++ K I ++ + ++FA T +V RI Sbjct: 335 EKYLHCQWIRDLDRDVKMIRSFVV-DHNAVLTIYSQYPTLRLLSVTESRFASTVIIVKRI 393 Query: 1415 IKLKQALQEVVGSEEWRQWKLMYPEDVPS---IEAAVLGNDFWGRAHLMLQLCEPFVRLL 1245 ++K AL +V WK++ ED +++ ++ + +W + ++ EP + +L Sbjct: 394 KEVKPALCRMVVDS---YWKVLVEEDAEKARRVKSCLVDDLWWEKIEFLIAFTEPILAML 450 Query: 1244 GSLNVDRSVMGDVYNWRVQALEVVRS-------KRI---DDMVLKQLEVVLENRWEMLFS 1095 +++ D + +VY+ +E VR K I + + + +L W + Sbjct: 451 RAIDTDEPTLHEVYDMWATMIEEVRGIIFRNEGKNIFLNESSFYEDIHRILVGSWNKSKT 510 Query: 1094 PLHAAGYILNPRYFGK---GQA-------KDKTVMRGWKATLDRYESDGMARRVLREQLS 945 PL + LNP+Y+ G+ KD+ V G R + + E+ Sbjct: 511 PLQCLAHSLNPKYYSDEWLGEVPSRLPPHKDREVSDGRNVCFARLFPAPSELQKVHEEFE 570 Query: 944 SYWRLDGSLGEEDAMDYRDKMDPVAWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHD 765 + G G D M R M P++WWENFG+ P+L LA ++LSQ SS + + +W Sbjct: 571 MFSMCKGHFGHWDVMSSRFSMSPISWWENFGAHVPRLAKLADRLLSQPSSSSCCERNWGT 630 Query: 764 NGSTCQEAVNLLGAERVEDLVFVRNNLRLHSKK 666 + N L ++R EDLV+V +NLRL S++ Sbjct: 631 FSLIKKIKQNRLASQRAEDLVYVHSNLRLLSRR 663 >ref|XP_007039961.1| HAT transposon superfamily [Theobroma cacao] gi|508777206|gb|EOY24462.1| HAT transposon superfamily [Theobroma cacao] Length = 674 Score = 238 bits (606), Expect = 1e-59 Identities = 166/607 (27%), Positives = 276/607 (45%), Gaps = 40/607 (6%) Frame = -3 Query: 2372 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHIQEEERLARKKKKIPTS 2193 +CN+C+ ++G R++ HL + C + +R+ HIQ + KK+K P Sbjct: 23 RCNYCHREFSGGVYRMKFHLAQIKNKDIVPCAEVPDDVRD--HIQTILN-SPKKQKTPKK 79 Query: 2192 GKSSKRIRSSQLAITSV------------------------------------GKAFGKE 2121 K K + + Q +S G+ +E Sbjct: 80 PKVDKAVANDQQNSSSASGGLHLNHGSSGQHGSTCPSLLFPRPSPSEQPAVDDGQKQKQE 139 Query: 2120 DVDDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMD 1941 D D +A FF+ + + F+ KS Y+ +M AIA G GY+ PS + L + L K K + Sbjct: 140 DADKKIAVFFFHNSIPFSAAKSMYYQEMVDAIAKCGVGYKAPSYENLRSTLLEKVKGDIH 199 Query: 1940 KAVSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVF 1761 R+ W TGCTI C S DG F I V+ P+G LFL+++D+ + + Sbjct: 200 DCYKKYRDEWKETGCTILCDSWSDGRTKSFVI-FSVTCPKGTLFLKSVDVSGHEDDASYL 258 Query: 1760 TEVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITE 1581 E+L +++VG NV+QVI S +G L+ +K+ +FWS C ++CI ++EDI++ Sbjct: 259 FELLESVVLEVGLENVIQVITDTAASYVYAGRLLMAKYSSLFWSPCASYCINKMLEDISK 318 Query: 1580 LDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQ 1401 +W+ + AK I Q I + + P +F Y + II + Sbjct: 319 QEWVGIVLEEAKSIVQYIYSHAWIVNMMRKFTGGRELMRPRITRFVANYLTLRSIIIQED 378 Query: 1400 ALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRS 1221 L+ + EW D +I++ + FW AH + + EP V++L ++ D Sbjct: 379 NLKHMFSHSEWLSSIYSRRSDAQAIKSLLYLERFWKSAHEAVSVSEPLVKILRIVDGDMP 438 Query: 1220 VMGDVYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHAAGYILNPRYFG 1050 MG +Y +A +++ K +++ + +++ + RW M L SPLHAA LNP F Sbjct: 439 AMGYIYEGIERAKVAIKAYYKGLEEKYMPIWDII-DRRWNMQLHSPLHAAAAFLNPSIFY 497 Query: 1049 KGQAK-DKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPV 873 K D + G++ + + + + + ++ Y G+LG + A+ R P Sbjct: 498 NPNFKIDLRMRNGFQEAMLKLATTDKDKIEITKEHPMYINAQGALGTDFAIMGRTLNAPG 557 Query: 872 AWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERVEDLVFVR 693 WW ++G E P LQ +AI+ILSQ S + +W S + N + E+ DLVFV Sbjct: 558 DWWASYGYEIPTLQRVAIRILSQPCSSHWCRWNWSTFESIHTKKRNKVELEKFNDLVFVH 617 Query: 692 NNLRLHS 672 NL L + Sbjct: 618 CNLCLQA 624 >ref|XP_003538648.1| PREDICTED: uncharacterized protein LOC100805582 isoform X1 [Glycine max] gi|571487050|ref|XP_006590550.1| PREDICTED: uncharacterized protein LOC100805582 isoform X2 [Glycine max] Length = 675 Score = 232 bits (591), Expect = 7e-58 Identities = 160/608 (26%), Positives = 272/608 (44%), Gaps = 41/608 (6%) Frame = -3 Query: 2372 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHIQEEERLARKKKKIPTS 2193 +CN+C ++G R++ HL + C + +R+ HIQ A KK K P Sbjct: 23 RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRD--HIQSILS-APKKPKTPKK 79 Query: 2192 GKSSKR-IRSSQLAITSVGKAFG------------------------------------K 2124 K+ + + + Q +S F + Sbjct: 80 QKTDQATVANGQQNSSSASGGFHHNHGYSGQNGSACPSLLFPNPSPSAQPLEHDAQKQKQ 139 Query: 2123 EDVDDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARM 1944 +D D +A FF+ + + F+ KS Y+ +M A+A G GY+ PS +KL + L K KA + Sbjct: 140 DDADRKLAIFFFHNSIPFSAAKSIYYQEMVDAVAQCGVGYKAPSYEKLRSTLLEKVKADI 199 Query: 1943 DKAVSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDV 1764 R+ W TGCT+ C + DG + V+ P+G LFL+++D+ + + Sbjct: 200 HSDYKKYRDEWKETGCTVLCDNWSDGRTGSLAV-FSVACPKGTLFLKSVDVSGHENDSTY 258 Query: 1763 FTEVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDIT 1584 E+L +++VG NV+QVI S +G L+ +++ +FWS C A+CI ++EDI Sbjct: 259 LFELLESVVLEVGAENVVQVITDASASYVCAGRLLIARYSFLFWSPCVAYCIDKMLEDIG 318 Query: 1583 ELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLK 1404 DW+ + AK I Q I + I P +F + + I+ + Sbjct: 319 RQDWVGTVLEEAKTITQYIYSHAWILNIMRKFTGGKELIRPKITRFVTNFLSLKSIVMQE 378 Query: 1403 QALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDR 1224 ++ + EW D +I + + + FW AH + + EP V+ L ++ D Sbjct: 379 DNIKHMFSHSEWLSSIYRRRPDAQAINSLLYSDRFWKYAHEAVSVSEPLVKCLRMVDGDM 438 Query: 1223 SVMGDVYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHAAGYILNPRY- 1056 MG VY +A +++ K I++ + +++ + RW M + S LHAA LNP Sbjct: 439 PAMGYVYEGIERAKVAIKAYYKGIEEKYIPIWDII-DRRWNMQIHSSLHAAAAFLNPSIS 497 Query: 1055 FGKGQAKDKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDP 876 + KD + G++ + R + + ++L +Y G+LG + A+ R P Sbjct: 498 YNPNFKKDLRMRNGFQEAMLRLAITDKDKMEITKELPTYINAQGALGTDFAVLGRTLNAP 557 Query: 875 VAWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERVEDLVFV 696 WW ++G E P LQ A++ILSQ S ++ +W S N + E+ +LVFV Sbjct: 558 GDWWASYGYEIPTLQKAAVRILSQPCSSLWYRWNWSTFESIHNRKRNRVELEKFSELVFV 617 Query: 695 RNNLRLHS 672 +NL L + Sbjct: 618 HSNLWLQT 625 >ref|XP_006477267.1| PREDICTED: uncharacterized protein LOC102627361 [Citrus sinensis] Length = 674 Score = 231 bits (589), Expect = 1e-57 Identities = 165/608 (27%), Positives = 275/608 (45%), Gaps = 41/608 (6%) Frame = -3 Query: 2372 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHIQEEERLARKKKKIPTS 2193 +CN+C ++G R++ HL + C + +R+ HIQ + +K+K P Sbjct: 23 RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPDDVRD--HIQRILSIPKKQKN-PKR 79 Query: 2192 GKSSKRIRSSQLAITSVGKAFGK------------------------------------E 2121 K K + Q +S + + Sbjct: 80 PKVEKATANGQQNSSSASGGIHQNNRSSGQHGSSCPSLLFRHPSPSIQPIVDDTQKQRQD 139 Query: 2120 DVDDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMD 1941 D D +A FF+ + + F+ KS Y+ +M AIA G GY PS +KL + L K K +D Sbjct: 140 DTDKKIAVFFFHNSIPFSAAKSMYYQEMVNAIAECGVGYIAPSYEKLRSTLLEKVKVDID 199 Query: 1940 KAVSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVF 1761 RE W TGCTI C + D + V+ P+G LFL+++D+ G ED F Sbjct: 200 DCCKKYREEWKETGCTILCDNWSDERTKSLVV-FSVACPKGTLFLKSVDVS-GHEEDATF 257 Query: 1760 T-EVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDIT 1584 E+L ++DVG NV+QVI +G L+ +K+ +FWS C A+CI ++EDI+ Sbjct: 258 LFELLESVVLDVGVENVIQVITDSAACYVYAGRLLMTKYSSLFWSPCAAYCIDKMLEDIS 317 Query: 1583 ELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLK 1404 + +W+ + AK I + + I P +F Y + I+ + Sbjct: 318 KQEWVAMVLEEAKTITKYFYSHAWTLNMMRKLTGGRELIRPRITRFVANYLSLRSIVIHE 377 Query: 1403 QALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDR 1224 + L+ + EW D +I++ + + FW AH ++ + EP V++L ++ D Sbjct: 378 ENLKHMFSHSEWLSSIYSRRPDAQAIKSLLYLDRFWRSAHEVVSVSEPLVKILRIVDGDM 437 Query: 1223 SVMGDVYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHAAGYILNPRYF 1053 MG +Y +A +++ K +++ + +++ + RW M L SPLHAA LNP F Sbjct: 438 PAMGYMYEGIERAKLAIQAYYKGVEEKYVPIWDII-DRRWNMQLHSPLHAAAAFLNPSIF 496 Query: 1052 GKGQAK-DKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDP 876 K D + G++ + + + + + ++ Y G+LG + A+ R P Sbjct: 497 YNPNFKIDLRMRNGFQEAMIKLATADKDKIEITKEHPVYINAQGALGTDFAVLGRKLNAP 556 Query: 875 VAWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERVEDLVFV 696 WW ++G E P LQ AI+ILSQ S ++ +W S + N + E+ DL+FV Sbjct: 557 GDWWASYGYEIPTLQRAAIRILSQPCSSYWYRWNWSTFESIHNKKRNKVEMEKFNDLLFV 616 Query: 695 RNNLRLHS 672 NLRL + Sbjct: 617 HCNLRLQA 624 >ref|XP_004292297.1| PREDICTED: uncharacterized protein LOC101307174 [Fragaria vesca subsp. vesca] Length = 719 Score = 225 bits (573), Expect = 9e-56 Identities = 157/645 (24%), Positives = 286/645 (44%), Gaps = 57/645 (8%) Frame = -3 Query: 2429 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLR-- 2256 WK+V++ G + G + CN C + GS+SRV++HLL G GVK P I R Sbjct: 25 WKYVTITSGSDKSGGNVAFTCNFCGGKLTGSHSRVKSHLLRIKGTGVKIYPTITRDQTVE 84 Query: 2255 ---------EAFHIQEEERLARKKKKIPTSGKSSKRIRSSQLAITS-------VGKAFGK 2124 + + + + ++A + SG S +R + + + KAF + Sbjct: 85 LQALLDHCDQQLNAKAQHKVALPPSSMTGSGISYFPLREREDEVKKRRGLSPQLSKAFRQ 144 Query: 2123 ED---VDDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEK 1953 ED D VAR FY+ GL FN+ ++P + + + ++AS PGY PP + L + L EK Sbjct: 145 EDRRECDASVARLFYSSGLAFNVARNPNYRE-SYSLASKIPGYVPPGYNALRTTLLDNEK 203 Query: 1952 ARMDKAVSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGE 1773 +++ + P++++W TG ++ DG IN+ ++ G + L+ I+ E Sbjct: 204 RHIERTLLPIKKTWKETGVSLCSDGWTDGQKRPL-INMMAAAKDGAMMLKAINCEGVTKS 262 Query: 1772 DDVFTEVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLME 1593 + +L ++I ++GP NV+QV+ S +G+++ PHIFW+ C H + L ++ Sbjct: 263 KEEIGRLLLESINEIGPENVVQVVTDNAPVSAAAGAIVEITHPHIFWTPCVVHTLNLALK 322 Query: 1592 D-------------ITELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISA 1452 D + EL W+ + I+ ++ + Sbjct: 323 DLLKAKSYLPGETVVEELGWLMEVYNDVWFIKNFVV-NHNMRLAMYHEHCALRLLQVAPT 381 Query: 1451 KFAPTYNMVWRIIKLKQALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQ 1272 +FA + ++ R +K LQ++V S+ W +K ++ +L FW + ++ Sbjct: 382 RFASHFIVLKRFRDVKSGLQQMVISQRWDLYKEDDASKARVVKEMLLKEKFWEQIDFLIA 441 Query: 1271 LCEPFVRLLGSLNVDRSVMGDVYNWRVQALEVVRSKRIDD----MVLKQLEV-------- 1128 L P ++ ++DR + VY W +E V+ + ++ + +V Sbjct: 442 LMGPIYEMIRMSDMDRPCLHLVYEWWNSMIEKVKKAVFNPEFVHVITEHCDVTRFYDVVY 501 Query: 1127 -VLENRWEMLFSPLHAAGYILNPRYFGKGQA----------KDKTVMRGWKATLDRYESD 981 +L RW +PLH + LNP+Y+ +D + + + D Sbjct: 502 PILTARWTKSCTPLHCLAHSLNPKYYSSQWLEEDPNRVPPHRDAELNNERRRCFQKLFPD 561 Query: 980 GMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPVAWWENFGSETPQLQTLAIKILSQI 801 R + E+ + + G DA++ + +P+ WW ++G TP LQ+LA+K+L+Q Sbjct: 562 SQTRNKVMEEFARFSLNMGDFSSSDALENKFCFEPLTWWVSYGPSTPLLQSLALKLLNQP 621 Query: 800 SSVTTFQGSWHDNGSTCQEAVNLLGAERVEDLVFVRNNLRLHSKK 666 S + + +W N L R +DLV+V NLRL ++K Sbjct: 622 CSSSCCERNWSTYAFIQGLKRNKLQPRRAQDLVYVHTNLRLLARK 666 >ref|XP_002443069.1| hypothetical protein SORBIDRAFT_08g007560 [Sorghum bicolor] gi|241943762|gb|EES16907.1| hypothetical protein SORBIDRAFT_08g007560 [Sorghum bicolor] Length = 713 Score = 224 bits (571), Expect = 1e-55 Identities = 163/633 (25%), Positives = 284/633 (44%), Gaps = 45/633 (7%) Frame = -3 Query: 2429 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSL--- 2259 W HV + G W+C + L Y GSYSR+++HLL +G G+K C A+D+ + Sbjct: 23 WNHVVLLEK-AAAGGNAVWRCKYYKLEYKGSYSRIKSHLLRISGGGIKICTAVDKFILAQ 81 Query: 2258 ---REAFHIQEEERLARKKKKIPTSG-KSSKRIRSSQLAITSVGKAFGKE---DVDDVVA 2100 A E ER K +P +S +R+ + +++ KAF E +D ++ Sbjct: 82 LKSEVAEAADEIERSKAKVIPLPVENVDASNSMRNKRQRSSALEKAFDMETRNQLDAIIG 141 Query: 2099 RFFYADGLNFNIIKSPYFHDMAKAIASFG-PGYEPPSVDKLLDSFLTKEKARMDKAVSPV 1923 R FY+ G++FNI ++PY+ + + AS GY PPS +KL + L +E+A ++ + + Sbjct: 142 RLFYSGGVSFNIARNPYYRESYRFAASHNLDGYVPPSYNKLRTTLLKQERAHVESLLDRM 201 Query: 1922 RESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTK 1743 + W G TI C + IN +FL+ ID + E L + Sbjct: 202 KSVWAEKGVTI-CSDGWSDSQRRPLINFIAVCKGKPMFLRAIDASGEEKTKFFIAEKLIQ 260 Query: 1742 AIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDIT------- 1584 + +VGP NV+Q+I + K +G ++ K+ +IFW+ C H + L +++I Sbjct: 261 VVEEVGPKNVVQIITDNAANCKGAGLIVQQKYDNIFWTPCIVHTLNLALKNICAAKLPRT 320 Query: 1583 --------ELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNM 1428 EL W+ A I+ I+ + +FA M Sbjct: 321 EEQEIVYDELHWITLVAGDANMIKNYIM-NHSMRLSMFNEFSKLKLLAVAETRFASVVVM 379 Query: 1427 VWRIIKLKQALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRL 1248 + R + +K+ALQ +V S+ W +K + +L + +W ++ +P + Sbjct: 380 LTRFLMVKRALQRMVISDAWESYKDDNAGTAKHVREKILCSKWWDNVQYIVDFTDPIYEM 439 Query: 1247 LGSLNVDRSVMGDVYN-W-----RVQALEVVRSKRIDD---MVLKQLEVVLENRWEMLFS 1095 L + DR + +Y W +V+ + + K+ +D ++ +L +RW + Sbjct: 440 LRMADTDRPCLHLIYEMWDTMIAKVKKVVYTKEKKNNDEQSTFFSTVQDILLDRWTKSNT 499 Query: 1094 PLHAAGYILNPRYF-------GKGQA---KDKTVMRGWKATLDRYESDGMARRVLREQLS 945 PL + LNPRY+ +G+ KD + ++ G ++++ S Sbjct: 500 PLICLAHSLNPRYYHEKWISENEGREPPHKDLEISVQRMKCFRKFFPVGKDLNQVKDEYS 559 Query: 944 SYWRLDGSLGEEDAMDYRDKMDPVAWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHD 765 + L + D++ R +DP+ WW N G P LQ LA+K+L+Q +S ++ + +W Sbjct: 560 RFATCSEELNDFDSIYDRWILDPLKWWANHGQSIPMLQKLALKLLNQPASSSSCERNWST 619 Query: 764 NGSTCQEAVNLLGAERVEDLVFVRNNLRLHSKK 666 N L E EDLVF+ NNLRL ++K Sbjct: 620 YSFVHSMLRNKLAPECAEDLVFIHNNLRLLARK 652 >ref|XP_004159512.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101222344 [Cucumis sativus] Length = 673 Score = 223 bits (568), Expect = 3e-55 Identities = 159/603 (26%), Positives = 277/603 (45%), Gaps = 38/603 (6%) Frame = -3 Query: 2372 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHIQ-----EEERLARKKK 2208 +CN+C ++G R++ HL + C + +R+ HIQ +++ A KK Sbjct: 23 RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRD--HIQGILSTPKKQKAPKKP 80 Query: 2207 KIP----TSGKSSKRIRSSQLAITSVGKAFG------------------------KEDVD 2112 K+ T+G+ S + S G+ K++ D Sbjct: 81 KVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDETD 140 Query: 2111 DVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAV 1932 VA FF+ + + F+ KS Y+ +M AIA +G GY+ PS +KL + L K K + + Sbjct: 141 KKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY 200 Query: 1931 SPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEV 1752 R+ W TGCTI C S DG F + I V+ +G LFL+++DI + + +++ Sbjct: 201 KKHRDEWKETGCTILCDSWSDGQTKSFLV-ISVTCSKGTLFLKSVDISGHEDDATYLSDL 259 Query: 1751 LTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDW 1572 L I++VG NV+Q+I S +G L+ +K+ +FWS C ++C+ ++EDI++++W Sbjct: 260 LETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEW 319 Query: 1571 MKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQALQ 1392 + + AK I + I + I P +F + + I+ L+ L+ Sbjct: 320 VSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLK 379 Query: 1391 EVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMG 1212 + EW D +I + + + FW AH + +CEP +R+L ++ D MG Sbjct: 380 HMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMG 439 Query: 1211 DVYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHAAGYILNPRYFGKGQ 1041 ++ +A +++ +D + E + + RW + L + LH A LNP F Sbjct: 440 YIFEGIERAKVEIKTYYNGFEDKYMPIWETI-DRRWNLQLHTTLHTAAAFLNPSXFYNPN 498 Query: 1040 AK-DKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPVAWW 864 K D + G++ + + + + + + +Y G+LG + A+ R P WW Sbjct: 499 FKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWW 558 Query: 863 ENFGSETPQLQTLAIKILSQISSVTTFQG-SWHDNGSTCQEAVNLLGAERVEDLVFVRNN 687 +G E P LQ A++ILSQ S G +W + + + E++ DLVFV+ N Sbjct: 559 SGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCN 618 Query: 686 LRL 678 L L Sbjct: 619 LWL 621 >ref|XP_004147940.1| PREDICTED: uncharacterized protein LOC101222344 [Cucumis sativus] Length = 673 Score = 223 bits (568), Expect = 3e-55 Identities = 159/603 (26%), Positives = 277/603 (45%), Gaps = 38/603 (6%) Frame = -3 Query: 2372 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHIQ-----EEERLARKKK 2208 +CN+C ++G R++ HL + C + +R+ HIQ +++ A KK Sbjct: 23 RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRD--HIQGILSTPKKQKAPKKP 80 Query: 2207 KIP----TSGKSSKRIRSSQLAITSVGKAFG------------------------KEDVD 2112 K+ T+G+ S + S G+ K++ D Sbjct: 81 KVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTYPCLSPSAQPPIDDAQKQKKDETD 140 Query: 2111 DVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAV 1932 VA FF+ + + F+ KS Y+ +M AIA +G GY+ PS +KL + L K K + + Sbjct: 141 KKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSSY 200 Query: 1931 SPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEV 1752 R+ W TGCTI C S DG F + I V+ +G LFL+++DI + + +++ Sbjct: 201 KKHRDEWKETGCTILCDSWSDGQTKSFLV-ISVTCSKGTLFLKSVDISGHEDDATYLSDL 259 Query: 1751 LTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDW 1572 L I++VG NV+Q+I S +G L+ +K+ +FWS C ++C+ ++EDI++++W Sbjct: 260 LETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIEW 319 Query: 1571 MKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQALQ 1392 + + AK I + I + I P +F + + I+ L+ L+ Sbjct: 320 VSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNLK 379 Query: 1391 EVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMG 1212 + EW D +I + + + FW AH + +CEP +R+L ++ D MG Sbjct: 380 HMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAMG 439 Query: 1211 DVYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHAAGYILNPRYFGKGQ 1041 ++ +A +++ +D + E + + RW + L + LH A LNP F Sbjct: 440 YIFEGIERAKVEIKTYYNGFEDKYMPIWETI-DRRWNLQLHTTLHTAAAFLNPSVFYNPN 498 Query: 1040 AK-DKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPVAWW 864 K D + G++ + + + + + + +Y G+LG + A+ R P WW Sbjct: 499 FKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDWW 558 Query: 863 ENFGSETPQLQTLAIKILSQISSVTTFQG-SWHDNGSTCQEAVNLLGAERVEDLVFVRNN 687 +G E P LQ A++ILSQ S G +W + + + E++ DLVFV+ N Sbjct: 559 SGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQCN 618 Query: 686 LRL 678 L L Sbjct: 619 LWL 621 >ref|XP_002509591.1| DNA binding protein, putative [Ricinus communis] gi|223549490|gb|EEF50978.1| DNA binding protein, putative [Ricinus communis] Length = 670 Score = 223 bits (567), Expect = 4e-55 Identities = 159/605 (26%), Positives = 273/605 (45%), Gaps = 38/605 (6%) Frame = -3 Query: 2372 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHIQ------EEERLARKK 2211 +CN+CN ++G R++ HL + C + +R HIQ ++++ +K+ Sbjct: 23 RCNYCNREFSGGVYRMKFHLAQIKNKDIVPCAEVPDDVRN--HIQSILSTPKKQKTPKKQ 80 Query: 2210 K-----------KIPTSGKSSKRIRSSQLAITSVGKAFGK-----------------EDV 2115 K + G R S Q T F + + Sbjct: 81 KTDQAENGQDNSSSASGGVHPNRGSSGQHGSTCPSLLFSRPLPTSQPVVDDAQNEKQNNA 140 Query: 2114 DDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKA 1935 D +A FF+ + + F+ KS Y+ +M A+A G GY+ PS +KL S L K K + Sbjct: 141 DKRIAVFFFHNSIAFSAAKSIYYQEMFDAVAECGQGYKAPSFEKLRSSLLEKVKGDIHDW 200 Query: 1934 VSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTE 1755 R+ W TGCTI C DG I V+ P+G LFL+++DI + + + E Sbjct: 201 YRKYRDDWKETGCTILCDGWSDGRTKSV-IVFSVTCPKGTLFLKSVDISGHENDANYLFE 259 Query: 1754 VLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELD 1575 +L +++VG NV+QVI S +G L+ +K+ +FWS C ++C+ ++EDI++ + Sbjct: 260 LLESILLEVGVENVIQVITDSTASYVYAGRLLMAKYSSLFWSPCASYCVNKMLEDISKQE 319 Query: 1574 WMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQAL 1395 W+ + A I + I + I P ++ Y + I+ + L Sbjct: 320 WVGTVMEEANTITKYIYSHAWTLNMMRRFTGGRELIRPRITRYVSNYLSLRAIVIQEDNL 379 Query: 1394 QEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVM 1215 + + EW D +++ + + FW AH + + EP +++L ++ D M Sbjct: 380 KHMFSHSEWLSSMHSRRPDAQIVKSFLSQDRFWKFAHEAVSISEPLIKILRIVDGDMPAM 439 Query: 1214 GDVYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHAAGYILNPRYFGKG 1044 G +Y +A +++ K I+D + E++ + RW + L SPLHAA LNP F Sbjct: 440 GYIYEVLERAKVSIKAYYKGIEDKYMPIWEII-DRRWNIQLHSPLHAAAAFLNPSIFYNQ 498 Query: 1043 QAK-DKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPVAW 867 K D + G++ + + + + + + ++ Y G+LG + A+ R P W Sbjct: 499 NFKIDLRMRNGFQEAMIKMATSDIDKIEITKEHPIYINGQGALGTDFAIMGRTLNSPGDW 558 Query: 866 WENFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERVEDLVFVRNN 687 W +G E P LQ +AI++LSQ S + +W S + N E++ DLVFV N Sbjct: 559 WAGYGYEIPTLQRVAIRLLSQPCSSHWCRWNWSTFESIHTKKRNKAELEKLNDLVFVHCN 618 Query: 686 LRLHS 672 L L + Sbjct: 619 LWLQA 623 >ref|XP_006579099.1| PREDICTED: uncharacterized protein LOC102660479 [Glycine max] Length = 765 Score = 222 bits (566), Expect = 6e-55 Identities = 167/634 (26%), Positives = 278/634 (43%), Gaps = 46/634 (7%) Frame = -3 Query: 2429 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAI-DRSLRE 2253 W V++ G + W CN C SYSRV+AHLL G G+ +CP + D L Sbjct: 21 WSFVTIKEKIGDGGGNRLWSCNFCEKVVKSSYSRVKAHLLRICGSGIDTCPKVTDAYLVY 80 Query: 2252 AFHIQEEERLARKKKKIPTSGKSS---------KRIRSSQLAITSVGKAFGKEDVDDV-- 2106 + EE K K +P KR +SS ++ AF ED + + Sbjct: 81 LRRVCEEAESILKSKNVPLPTDKRTPTPPTLPPKRRKSS-----NIESAFNIEDRNHLRA 135 Query: 2105 -VARFFYADGLNFNIIKSPYFHDMAKAIASFG-PGYEPPSVDKLLDSFLTKEKARMDKAV 1932 +AR FY+ L+F++ ++PYF A+ G+ PPS + L S L +E++ +++ + Sbjct: 136 EIARMFYSASLSFHLARNPYFVSSYSFAANCNLSGFLPPSYNALRTSLLQQERSYIERLL 195 Query: 1931 SPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEV 1752 P++ W L G T+ D + IN S G +FL+ ID K + ++ Sbjct: 196 QPIKSLWSLKGVTLVVDGWTDAQIRPL-INFMAISEEGPMFLKAIDGSKEYKDKHYMFDL 254 Query: 1751 LTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDI----- 1587 L I +VGP +V+QVI K +G LI +FPHIFW+ C H + L +++I Sbjct: 255 LKDVIKEVGPQSVVQVITDNAYVCKAAGLLIEVEFPHIFWTPCVVHTLNLGVKNICAAKN 314 Query: 1586 --------TELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYN 1431 E W+ + A I+ I+ + +FA Sbjct: 315 VDGNENVFNEGGWIAEVIGDASFIKVFIMT-HSMRLAIFNEFSSLKLLSIAETRFASMIV 373 Query: 1430 MVWRIIKLKQALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVR 1251 M+ R+ LK+ LQ +V S++W ++ ++ +L + +W + +L +P Sbjct: 374 MLKRLKLLKRCLQNMVISDQWNSYREDDVRKAAHVKELILNDIWWDKVDYILSFMDPIYS 433 Query: 1250 LLGSLNVDRSVMGDVYNWRVQALEVVRSK--RIDDMVLKQLEV-------VLENRWEMLF 1098 ++ + + S + VY +E V++ R D+++ ++ +L +RW Sbjct: 434 MIRICDTNASNLHLVYEMWDSMIEKVKTTIYRHDEVLENEVSTFFEVIHEILNSRWSKSC 493 Query: 1097 SPLHAAGYILNPRYFGKG----------QAKDKTVMRGWKATLDRYESDGMARRVLREQL 948 +PLH + LNPRY+ +D + L RY + R + E+ Sbjct: 494 NPLHCLAHSLNPRYYSDNWLNEVPNRVPPHRDDELSSQRNKCLKRYFPNVNVRTKVYEEF 553 Query: 947 SSYWRLDGSLGEEDAMDYRDKMDPVAWWENFGSETPQLQTLAIKILSQISSVTTFQGSWH 768 S + G G D ++ R +D WW GS TP LQ +A+K+L Q S + + +W Sbjct: 554 SKFSSCAGDFGSFDIIEDRWALDSKTWWVMHGSSTPILQKVALKLLVQPCSSSCCERNWS 613 Query: 767 DNGSTCQEAVNLLGAERVEDLVFVRNNLRLHSKK 666 N + ++ +DLVFV +NLRL S+K Sbjct: 614 TYSFIHSLKRNKMDPKKAKDLVFVHSNLRLLSRK 647 >ref|XP_007161271.1| hypothetical protein PHAVU_001G056200g, partial [Phaseolus vulgaris] gi|561034735|gb|ESW33265.1| hypothetical protein PHAVU_001G056200g, partial [Phaseolus vulgaris] Length = 702 Score = 220 bits (561), Expect = 2e-54 Identities = 168/629 (26%), Positives = 275/629 (43%), Gaps = 32/629 (5%) Frame = -3 Query: 2453 PSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPA 2274 P GWKH G + K+ KC++C+ +G R + HL G T + C + Sbjct: 17 PGNRTDVGWKH-----GIDINGNGKKVKCSYCSKTMSGGIFRFKHHLAG-TREDSEPCCS 70 Query: 2273 IDRSLREAFH--IQEEERLARKKKKIP-------------------TSGKSSKRIRSS-Q 2160 + +R+ + E ++ + KK+K+ + GK R + Q Sbjct: 71 VPEEIRDLMIKIVAEAKQASLKKRKLNIIDEDQGCEGLEERQHIFGSKGKEKVGSRGAVQ 130 Query: 2159 LAITSVGKAFGKEDVDDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKL 1980 I + K KE+VD VA FFY + FN+IK+P F M + I +G GY+PPS + Sbjct: 131 ATINQMMKKGYKEEVDAQVAEFFYTSAIPFNVIKNPAFTKMCEMIGKYGAGYKPPSYHDI 190 Query: 1979 LDSFLTKEKARMDKAVSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQT 1800 + L + + D + +E W TGCTI D N V+SP+G +F+ + Sbjct: 191 REKLLKQAIDKTDLVLQEYKEEWKKTGCTIMSDGWTDKKRRSI-CNFLVNSPKGTVFMYS 249 Query: 1799 IDIEKGDGEDDVFTEVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCT 1620 +D D ++L + VG NV+QV+ + K +G L+ K H++W+ C Sbjct: 250 LDTSDISKTADKVFKMLDDVVELVGEENVVQVVTDNAANFKAAGELLMQKREHLYWTPCA 309 Query: 1619 AHCIQLLMEDI-TELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFA 1443 AHCI L ED +L + + + I I I P +FA Sbjct: 310 AHCIDLSFEDFEKKLKVHELTIKKGRKITTYIYGRSMLISMLKKFTKERDLIRPGVTRFA 369 Query: 1442 PTYNMVWRIIKLKQALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCE 1263 Y + + +LK +L + SEEW+ K ++ +E +L N FW L++ Sbjct: 370 TAYLTLGCLHELKASLLTMFSSEEWKTSKFGTSQEGKKVENMILDNRFWKNISTCLKVAA 429 Query: 1262 PFVRLLGSLNVD-RSVMGDVYNWRVQALEVVRS-----KRIDDMVLKQLEVVLENRWE-M 1104 P + +L ++ D + MG +Y +A E +++ K+ + V K +++ RW+ Sbjct: 430 PLMVVLRLVDSDAKPAMGFIYEEMDRAKEKIKNNFNHIKKSYEEVWK----IIDARWDNQ 485 Query: 1103 LFSPLHAAGYILNPR--YFGKGQAKDKTVMRGWKATLDRYESDGMARRVLREQLSSYWRL 930 L PLHAA Y LNP+ Y + ++ D V G ++ R D RR++ QL Y Sbjct: 486 LHRPLHAAAYYLNPQFHYEPEFRSDDPEVKEGLYTSMRRLVKDAAERRIINVQLVEYHFG 545 Query: 929 DGSLGEEDAMDYRDKMDPVAWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTC 750 G+ +DA + R + P WWE FG TP+L+ Sbjct: 546 RGAFAMDDAKESRKTILPGEWWEMFGYRTPELKRR------------------------- 580 Query: 749 QEAVNLLGAERVEDLVFVRNNLRLHSKKL 663 N L +++ DL++V NL+L +K++ Sbjct: 581 ----NHLHQKKMNDLLYVMYNLKLSNKQI 605 >ref|XP_006857527.1| hypothetical protein AMTR_s00061p00028660 [Amborella trichopoda] gi|548861623|gb|ERN18994.1| hypothetical protein AMTR_s00061p00028660 [Amborella trichopoda] Length = 863 Score = 219 bits (559), Expect = 4e-54 Identities = 156/603 (25%), Positives = 264/603 (43%), Gaps = 36/603 (5%) Frame = -3 Query: 2372 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHI---QEEERLARKKKKI 2202 +CN+C ++G R++ HL + C + +R+ ++ KK KI Sbjct: 209 RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSDVPNDVRDLIQSVLNTPRKQKTPKKPKI 268 Query: 2201 PTSGKSSKRIRSSQ----LAITSVGKAFG-------------------------KEDVDD 2109 + S S+ L + S G+ +E+ D Sbjct: 269 EQTPNSPHNSSSASGGFHLNVGSSGQRGSTCPSLLFPHPSPSGQPILDDSQRQKQEEADK 328 Query: 2108 VVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVS 1929 +A FF+ + + F+ KS Y+H M AIA G GY PS D+L + L K K + + Sbjct: 329 KIALFFFHNSIPFSSSKSIYYHGMVDAIADCGVGYRAPSYDRLRTTLLEKVKVEITDSYK 388 Query: 1928 PVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVL 1749 R+ W +GCTI DG S F I V+ PRG LFL+++D + E+L Sbjct: 389 TYRDEWRESGCTIMSDGWTDGR-SKFLIVFSVACPRGTLFLKSVDASAHVDDAHYLFELL 447 Query: 1748 TKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDWM 1569 +++VG ++QVI + +G L+ +K+P +FWS C ++CI ++EDI++ +W+ Sbjct: 448 ESVVLEVGLEYIVQVITDSAANYVYAGRLLTAKYPSLFWSPCASYCIDRMLEDISKQEWV 507 Query: 1568 KPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQALQE 1389 + A+ I + I + +F + + I+ + L+ Sbjct: 508 STVIEEARSITKYIYGHSWVLNLMKRFTGGKELLRSRITRFVTHFLSLRSIVIHEDNLKH 567 Query: 1388 VVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMGD 1209 + EW D ++ + + + FW A ++ L EP +++L ++ D MG Sbjct: 568 MFSHTEWLSSLYSKKSDAQAVRSLIYLDRFWKSAQEVVNLSEPLIKVLRIVDGDMPAMGY 627 Query: 1208 VYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHAAGYILNPRYFGKGQA 1038 +Y +A +++ K +D + E++ + RW + L SPLHAA LNP F Sbjct: 628 IYEGIERAKVAIKAYYKGSEDKYMPIWEII-DRRWNLQLHSPLHAAAAFLNPAIFYNPSF 686 Query: 1037 K-DKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDYRDKMDPVAWWE 861 K D + G+ + + + + L ++ Y G+LG + AM R P WW Sbjct: 687 KIDSKIRNGFHEAMMKMVLNDKDKMELTKETPMYINAHGALGNDFAMMARTLNTPGDWWA 746 Query: 860 NFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERVEDLVFVRNNLR 681 +G E P LQ AI+ILSQ S + +W + + N L E+ DLV+V NLR Sbjct: 747 GYGYEVPVLQRAAIRILSQPCSSYWCRWNWGTFENVHTKKRNRLEQEKFNDLVYVHCNLR 806 Query: 680 LHS 672 + Sbjct: 807 FQA 809 >ref|XP_003618961.1| hypothetical protein MTR_6g029340 [Medicago truncatula] gi|355493976|gb|AES75179.1| hypothetical protein MTR_6g029340 [Medicago truncatula] Length = 725 Score = 219 bits (559), Expect = 4e-54 Identities = 168/622 (27%), Positives = 274/622 (44%), Gaps = 33/622 (5%) Frame = -3 Query: 2432 GWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLRE 2253 GWK+ G + ++ KC+ C +G R + HL G + D E Sbjct: 35 GWKY-----GTDVNGDARKVKCSFCAKVISGGVYRFKHHLAGTSDDSGPCAQVSDEVKME 89 Query: 2252 AFH-IQEEERLARKKKKIPTSGKSS---------------KRIRS------SQLAITSVG 2139 + E A +K+K+ + + +++R +Q I ++ Sbjct: 90 MLKWVATLEEAAERKRKMAEIAQGNVTEDPAFEVEVSQHLQKVRGKASASGTQTKIDAIA 149 Query: 2138 KAFGKEDVDDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTK 1959 K K + DD VA FFY + FN I++P F M AI +GP Y+PPS + D L + Sbjct: 150 KKPLKVEADDAVAEFFYTSAIAFNCIRNPAFAKMCVAIGKYGPDYKPPSYRDISDKLLVR 209 Query: 1958 EKARMDKAVSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGD 1779 R ++ V +E W TGC+I D N V+SP+G +FL ++D Sbjct: 210 AVDRTNEIVDKFKEEWKTTGCSIMSDGWTDRKRRSI-CNFMVNSPKGTVFLYSLDTSDIS 268 Query: 1778 GEDDVFTEVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLL 1599 D ++L + VG NV+QV+ + K G L+ K +FW+ C AHCI L+ Sbjct: 269 KTADKVFKMLDDVVEAVGEDNVIQVVTDNAANFKAGGELLMLKRTKLFWTPCAAHCIDLI 328 Query: 1598 MEDI-TELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVW 1422 +ED E+ + A+ + I I P +FA Y + Sbjct: 329 LEDFEKEMIIHNVTIKNARKLTTYIYNRTMLITMVRKFTNGRDLIRPALTRFATAYLTIG 388 Query: 1421 RIIKLKQALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLG 1242 + LK +L + S +W+ + E+ + + +L FW + L+ P + +L Sbjct: 389 CLNDLKSSLINMFDSNDWKSSRFATTEEGKKMASGILDQRFWKNIGVCLKTAAPLMDVLH 448 Query: 1241 SLNVD-RSVMGDVYNWRVQALEVVRSKRIDDM--VLKQLEVV---LENRW-EMLFSPLHA 1083 ++ D + MG +Y +A++ + + ++ V K E V ++ RW L PLHA Sbjct: 449 LVDSDEKPAMGYIY----EAMDACKKQIQNNFNNVQKCYEPVCKIIDQRWMGQLHRPLHA 504 Query: 1082 AGYILNPR-YFGKG-QAKDKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSL-GE 912 AGY LNP+ +FG + D + G + + + SD R + QL+ + G L G Sbjct: 505 AGYYLNPQIHFGPNFKGNDIDIKNGLFSVISKLVSDAAERSKINSQLADFHFSRGPLFGS 564 Query: 911 EDAMDYRDKMDPVAWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNL 732 E A R +M P WWE +G TP+L+ AI+ILS S + + +W + N Sbjct: 565 EYAKKARAEMHPGQWWEMYGDYTPELKRFAIRILSLTCSSSGCERNWSAFEMVHTKKRNR 624 Query: 731 LGAERVEDLVFVRNNLRLHSKK 666 L +++ DLV+V N+RL K+ Sbjct: 625 LRQQKMNDLVYVMANMRLTRKE 646