BLASTX nr result
ID: Akebia23_contig00005145
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Akebia23_contig00005145 (2371 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN70085.1| hypothetical protein VITISV_003006 [Vitis vinifera] 838 0.0 ref|XP_003632266.1| PREDICTED: uncharacterized protein LOC100854... 837 0.0 ref|XP_002527444.1| protein dimerization, putative [Ricinus comm... 803 0.0 ref|XP_006424350.1| hypothetical protein CICLE_v10028008mg [Citr... 771 0.0 ref|XP_006484968.1| PREDICTED: uncharacterized protein LOC102615... 768 0.0 ref|XP_007014534.1| Uncharacterized protein TCM_039722 [Theobrom... 261 1e-66 ref|XP_007214864.1| hypothetical protein PRUPE_ppa018860mg [Prun... 243 2e-61 ref|XP_006841838.1| hypothetical protein AMTR_s00003p00270420 [A... 243 3e-61 ref|XP_007039961.1| HAT transposon superfamily [Theobroma cacao]... 236 4e-59 ref|XP_006477267.1| PREDICTED: uncharacterized protein LOC102627... 229 3e-57 ref|XP_003538648.1| PREDICTED: uncharacterized protein LOC100805... 229 4e-57 ref|XP_002443069.1| hypothetical protein SORBIDRAFT_08g007560 [S... 225 6e-56 ref|XP_004292297.1| PREDICTED: uncharacterized protein LOC101307... 224 1e-55 ref|XP_006579099.1| PREDICTED: uncharacterized protein LOC102660... 223 2e-55 ref|XP_002509591.1| DNA binding protein, putative [Ricinus commu... 222 5e-55 ref|XP_006857527.1| hypothetical protein AMTR_s00061p00028660 [A... 221 2e-54 ref|XP_004159512.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri... 219 3e-54 ref|XP_004147940.1| PREDICTED: uncharacterized protein LOC101222... 219 3e-54 ref|XP_007161271.1| hypothetical protein PHAVU_001G056200g, part... 219 6e-54 ref|XP_006577689.1| PREDICTED: uncharacterized protein LOC102662... 218 1e-53 >emb|CAN70085.1| hypothetical protein VITISV_003006 [Vitis vinifera] Length = 635 Score = 838 bits (2164), Expect = 0.0 Identities = 405/597 (67%), Positives = 488/597 (81%) Frame = -2 Query: 2361 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 2182 MP+ESDKWGWKHVSVFGGF+ +GTKRWKCNHCN+RYNGSYSRVRAHLLGFTGVGVKSCP Sbjct: 1 MPTESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNIRYNGSYSRVRAHLLGFTGVGVKSCP 60 Query: 2181 AIDRSLREAFHIQEEERLTRKKKKIPTSGKSSKRIRSSQLAITSVGKAFGKEDVDDVVAR 2002 AIDRSLREAF I EEERL RKKK+ SGK+ KRIR+SQ ++T V K KEDVDD+VAR Sbjct: 61 AIDRSLREAFQILEEERLARKKKRTSGSGKTGKRIRTSQPSVTCVWKTIAKEDVDDIVAR 120 Query: 2001 FFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVRE 1822 FFYADGL+FNI+ SPYF +M KAIA+FGPGYEPP+ +KL D FL+KEKA+++KA++ VRE Sbjct: 121 FFYADGLDFNIVNSPYFLEMTKAIAAFGPGYEPPTTEKLSDLFLSKEKAKIEKAMALVRE 180 Query: 1821 SWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTKAI 1642 SWP TGCTI C+++L T + NIFVSSPRGL+FL+ +DI GDG D++F +VL+ AI Sbjct: 181 SWPHTGCTILCVNRLCRTQGRYYTNIFVSSPRGLMFLKALDINDGDGMDNMFVDVLSDAI 240 Query: 1641 MDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDWMKPFV 1462 M+V P NVLQ+I + G +S+ SLI SKF H+FWS CT+H I +LMEDIT+LDW+KP V Sbjct: 241 MEVEPTNVLQIISNLGHASESFESLILSKFRHLFWSPCTSHSICVLMEDITKLDWIKPIV 300 Query: 1461 SYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQALQEVVGS 1282 AK I++CIL DP+S KFAP+Y +V RI +LKQAL VV S Sbjct: 301 LCAKEIDECILTYQRSSLCVLTLESS----DPLSTKFAPSYCIVERIFELKQALLGVVVS 356 Query: 1281 EEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMGDVYNW 1102 EEW+QWKL EDV ++E A+LG++FW RA +LQ EPFVRLL +L++++SVMGDV+NW Sbjct: 357 EEWKQWKLTIQEDVLNVETAILGDNFWSRACSLLQFFEPFVRLLTTLDIEKSVMGDVFNW 416 Query: 1101 RVQALEVVRSKRIDDMVLKQLEVVLENRWEMLFSPLHASGYILNPRYFGKGQAKDKTVMR 922 RVQALE V+SK +DD++L QLE+++E++W+MLFSPLHASGYILNP+YFGKGQ+KDKT+MR Sbjct: 417 RVQALEAVKSKGVDDILLNQLELLIESKWDMLFSPLHASGYILNPKYFGKGQSKDKTIMR 476 Query: 921 GWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAWWENFGSETPQ 742 GWKATLDRYESD RRVLREQLSSYWRL+GS GEEDA+DCRDKMDPVAWWENFG ETP Sbjct: 477 GWKATLDRYESDSATRRVLREQLSSYWRLEGSFGEEDAVDCRDKMDPVAWWENFGFETPH 536 Query: 741 LQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFVRNNLRLHSKK 571 LQTLAIKILSQ+SSV+ +Q +W DN CQ AVN LG ERAEDLVFVRNNLRLHS++ Sbjct: 537 LQTLAIKILSQVSSVSMYQETWQDNEFLCQTAVNGLGVERAEDLVFVRNNLRLHSQR 593 >ref|XP_003632266.1| PREDICTED: uncharacterized protein LOC100854857 [Vitis vinifera] Length = 635 Score = 837 bits (2162), Expect = 0.0 Identities = 405/597 (67%), Positives = 487/597 (81%) Frame = -2 Query: 2361 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 2182 MP+ESDKWGWKHVSVFGGF+ +GTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP Sbjct: 1 MPTESDKWGWKHVSVFGGFDKGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 60 Query: 2181 AIDRSLREAFHIQEEERLTRKKKKIPTSGKSSKRIRSSQLAITSVGKAFGKEDVDDVVAR 2002 AIDRSLREAF I EEERL RKKK+ SGK+ KRIR+SQ ++T V K KEDVDD+VAR Sbjct: 61 AIDRSLREAFQILEEERLARKKKRTSGSGKTGKRIRTSQPSVTCVWKTIAKEDVDDIVAR 120 Query: 2001 FFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVRE 1822 FFYADGL+FNI+ SPYF +M KAIA+FGPGYEPP+ +KL D FL+KEKA+++KA++ VRE Sbjct: 121 FFYADGLDFNIVNSPYFLEMTKAIAAFGPGYEPPTTEKLSDLFLSKEKAKIEKAMALVRE 180 Query: 1821 SWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTKAI 1642 SWP TGCTI C+++L T + NIFVSSPRGL+FL+ +DI GDG D++F +VL+ AI Sbjct: 181 SWPHTGCTILCVNRLCRTQGRYYTNIFVSSPRGLMFLKALDINDGDGMDNMFVDVLSDAI 240 Query: 1641 MDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDWMKPFV 1462 M+V P NVLQ+I + G +S+ SLI SKF H+FWS CT+H I +LMEDIT+LDW+KP V Sbjct: 241 MEVEPTNVLQIISNLGHASESFESLILSKFRHLFWSPCTSHSICVLMEDITKLDWIKPIV 300 Query: 1461 SYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQALQEVVGS 1282 AK I++CIL DP+S KFAP+Y +V RI +LKQAL VV S Sbjct: 301 LCAKEIDECILTYQRSSLCVLTLESS----DPLSTKFAPSYCIVERIFELKQALLGVVVS 356 Query: 1281 EEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMGDVYNW 1102 EEW+QWKL EDV ++E A+LG++FW RA +LQ EPFVRLL +L++++SVMGDV+NW Sbjct: 357 EEWKQWKLTIQEDVLNVETAILGDNFWSRACSLLQFFEPFVRLLTTLDIEKSVMGDVFNW 416 Query: 1101 RVQALEVVRSKRIDDMVLKQLEVVLENRWEMLFSPLHASGYILNPRYFGKGQAKDKTVMR 922 RVQALE V+SK +DD++L QLE+++E++W+MLFSPLHASGYILNP+YFGKGQ+KDKT+MR Sbjct: 417 RVQALEAVKSKGVDDILLNQLELLIESKWDMLFSPLHASGYILNPKYFGKGQSKDKTIMR 476 Query: 921 GWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAWWENFGSETPQ 742 GWKATLDRYESD RRVLREQLSSYWRL+GS GEEDA+DCRDKMDPVAWWENFG ETP Sbjct: 477 GWKATLDRYESDSATRRVLREQLSSYWRLEGSFGEEDAVDCRDKMDPVAWWENFGFETPH 536 Query: 741 LQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFVRNNLRLHSKK 571 LQTLAIKILSQ+SSV+ +Q +W DN CQ AVN LG ER EDLVFVRNNLRLHS++ Sbjct: 537 LQTLAIKILSQVSSVSMYQETWQDNEFLCQTAVNGLGVERTEDLVFVRNNLRLHSQR 593 >ref|XP_002527444.1| protein dimerization, putative [Ricinus communis] gi|223533179|gb|EEF34936.1| protein dimerization, putative [Ricinus communis] Length = 633 Score = 803 bits (2075), Expect = 0.0 Identities = 392/597 (65%), Positives = 472/597 (79%) Frame = -2 Query: 2361 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 2182 MPSESDKWGW+HVSVFGGF+ +GTKRWKCNHCNLRYNGSYSRVRAHLLGF+GVGVKSCP Sbjct: 1 MPSESDKWGWEHVSVFGGFDRGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFSGVGVKSCP 60 Query: 2181 AIDRSLREAFHIQEEERLTRKKKKIPTSGKSSKRIRSSQLAITSVGKAFGKEDVDDVVAR 2002 AIDRSLREAF I EEERL RKKKK +GK KR R SQ +I+ K KEDVDD+VAR Sbjct: 61 AIDRSLREAFQILEEERLVRKKKKNSANGKPGKRTRISQASIS--WKTITKEDVDDIVAR 118 Query: 2001 FFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVRE 1822 FFYADGLN +++ SPYFH+M KAI +FG GYE PS+DKL DSFL KEK R++K+++ +RE Sbjct: 119 FFYADGLNIDVVNSPYFHEMVKAIGAFGSGYELPSIDKLSDSFLGKEKGRIEKSLALLRE 178 Query: 1821 SWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTKAI 1642 SWP TGCTI C+ +LDG + CF+INIFVSSPRGL+FL+ +D++ D D V L+ AI Sbjct: 179 SWPHTGCTILCVGRLDGAIGCFHINIFVSSPRGLIFLKAVDVDDCDEGDHVLAGALSDAI 238 Query: 1641 MDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDWMKPFV 1462 ++VGP+NVLQ+I H G + K S S I SKFPHIFWS CT+H I +LME+I EL+W+KP V Sbjct: 239 LEVGPSNVLQIISHLGDACKSSESYILSKFPHIFWSPCTSHSILMLMEEIAELEWVKPIV 298 Query: 1461 SYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQALQEVVGS 1282 A+ IEQCI+ D ISAKFAP+Y V RI +L+Q LQEVV S Sbjct: 299 LCARRIEQCIMTYQHATSCIFMQSPKESC-DLISAKFAPSYFFVQRIFELRQTLQEVVVS 357 Query: 1281 EEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMGDVYNW 1102 E QWK ++V SIE+A+LG+DFW ++HL+LQL EPF++LLG L++D+SV+G VY+W Sbjct: 358 E---QWKHSIGDNVESIESAILGDDFWSKSHLLLQLYEPFIKLLGLLDIDKSVIGAVYDW 414 Query: 1101 RVQALEVVRSKRIDDMVLKQLEVVLENRWEMLFSPLHASGYILNPRYFGKGQAKDKTVMR 922 RVQALE +RSK IDD +L QLEV++EN+W++LFSPLHA+GYILNPRY GK Q KDK+VMR Sbjct: 415 RVQALEALRSKAIDDDILNQLEVLIENKWDVLFSPLHATGYILNPRYIGKFQTKDKSVMR 474 Query: 921 GWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAWWENFGSETPQ 742 GWKATL+RYE + ARRVLREQLSSYWRL+GSLG+EDA+DCRDKMDPVAWWENFG ETP Sbjct: 475 GWKATLERYEGESTARRVLREQLSSYWRLEGSLGDEDAVDCRDKMDPVAWWENFGFETPS 534 Query: 741 LQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFVRNNLRLHSKK 571 LQTLAIK+LSQ+SSV Q W N +CQEA N LG +R EDL+FVRNNLRLH +K Sbjct: 535 LQTLAIKVLSQVSSVALCQEIWQTNDFSCQEAANRLGVQRVEDLLFVRNNLRLHYQK 591 >ref|XP_006424350.1| hypothetical protein CICLE_v10028008mg [Citrus clementina] gi|557526284|gb|ESR37590.1| hypothetical protein CICLE_v10028008mg [Citrus clementina] Length = 636 Score = 771 bits (1991), Expect = 0.0 Identities = 381/599 (63%), Positives = 456/599 (76%) Frame = -2 Query: 2361 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 2182 MPSESDKWGW+HVSVFGGFE +GTKRWKCNHCNLRYNGSYSRVRAHLLGF+GVGVKSCP Sbjct: 1 MPSESDKWGWEHVSVFGGFERGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFSGVGVKSCP 60 Query: 2181 AIDRSLREAFHIQEEERLTRKKKKIPTSGKSSKRIRSSQLAITSVGKAFGKEDVDDVVAR 2002 AIDRS+RE F I EEER+ RKKK+ K KRIR+ Q +I S KA KEDVD++VAR Sbjct: 61 AIDRSMRETFQILEEERIARKKKRTSGIAKHGKRIRACQSSIVS--KAISKEDVDEMVAR 118 Query: 2001 FFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVRE 1822 FFYA GLN N++ SPYF +M ++IA+FG GY+ PS++ L DSFL+KEK +++K ++ VRE Sbjct: 119 FFYAAGLNVNVVNSPYFLEMVRSIAAFGHGYDLPSLENLSDSFLSKEKGKIEKFIASVRE 178 Query: 1821 SWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTKAI 1642 SWP TGCTI C+S LDG L CF IFVSSPRGL+FL+ +D++ D +++F VL+ AI Sbjct: 179 SWPHTGCTILCVSSLDGQLGCFPTGIFVSSPRGLVFLKALDLDDTDEAENLFITVLSDAI 238 Query: 1641 MDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDWMKPFV 1462 +DVGP NVLQ+I H G + K SL+ SKFPHIF S CT I + ME+I L+W+K V Sbjct: 239 LDVGPKNVLQIISHLGHACKSYESLVLSKFPHIFLSPCTLQSIHMFMEEIASLEWIKSTV 298 Query: 1461 SYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQALQEVVGS 1282 AK IEQ IL S D +S K AP+Y V RII+LKQ LQE V S Sbjct: 299 LCAKRIEQHILYYQHAYPCLFPHNLKESS-DQVSTKIAPSYCFVQRIIELKQVLQEAVVS 357 Query: 1281 EEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMGDVYNW 1102 EE++QWKL P D +E+A+LG+DFWG+AHL LQLCEPFVRLL + ++D+SVMG VY+W Sbjct: 358 EEFKQWKLSMPGDHGIVESAILGDDFWGKAHLFLQLCEPFVRLLATFDIDKSVMGAVYDW 417 Query: 1101 RVQALEVVRSKRIDDMVLKQLEVVLENRWEMLFSPLHASGYILNPRYFGKGQAKDKTVMR 922 R QALE VR K ID L QLEV+ ENRW+ LFSPLHA+GYILNPRYFG+GQ KDKTVMR Sbjct: 418 RFQALEAVRMKGIDATALNQLEVLTENRWDALFSPLHAAGYILNPRYFGRGQNKDKTVMR 477 Query: 921 GWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAWWENFGSETPQ 742 GWK+TL+RYESD RR+LREQLSSYWRL+GSLGEEDA+D RDKM+PVAWWENFG E Sbjct: 478 GWKSTLERYESDSATRRILREQLSSYWRLEGSLGEEDAVDFRDKMEPVAWWENFGFEISH 537 Query: 741 LQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFVRNNLRLHSKKLV 565 LQTLAIK+LSQ+SSV Q W DN C+EA N G ER EDL+FVRNNLRLH+++ V Sbjct: 538 LQTLAIKVLSQVSSVAVCQEIWQDNDFPCREAANRSGVERPEDLIFVRNNLRLHNQRNV 596 >ref|XP_006484968.1| PREDICTED: uncharacterized protein LOC102615434 isoform X1 [Citrus sinensis] gi|568863036|ref|XP_006484969.1| PREDICTED: uncharacterized protein LOC102615434 isoform X2 [Citrus sinensis] Length = 636 Score = 768 bits (1984), Expect = 0.0 Identities = 379/599 (63%), Positives = 456/599 (76%) Frame = -2 Query: 2361 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 2182 MPSESDKWGW+HVSVFGGFE +GTKRWKCNHCNLRYNGSYSRVRAHLLGF+GVGVKSCP Sbjct: 1 MPSESDKWGWEHVSVFGGFERGSGTKRWKCNHCNLRYNGSYSRVRAHLLGFSGVGVKSCP 60 Query: 2181 AIDRSLREAFHIQEEERLTRKKKKIPTSGKSSKRIRSSQLAITSVGKAFGKEDVDDVVAR 2002 AIDRS+RE F I EEER+ RKKK+ K KRIR+ Q +I S KA KEDVD++VAR Sbjct: 61 AIDRSMRETFQILEEERIARKKKRTSGIAKHGKRIRACQSSIVS--KAISKEDVDEMVAR 118 Query: 2001 FFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVRE 1822 FFYA GLN N++ SPYF +M ++IA+FG GY+ PS++ L DSFL+KEK +++K ++ VRE Sbjct: 119 FFYAAGLNVNVVNSPYFLEMVRSIAAFGHGYDLPSLENLSDSFLSKEKGKIEKFIASVRE 178 Query: 1821 SWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTKAI 1642 SWP TGCTI C+S LDG L CF IFVSSPRGL+FL+ +D++ D +++F VL+ AI Sbjct: 179 SWPHTGCTILCVSSLDGRLGCFPTGIFVSSPRGLVFLKALDLDDTDEAENLFITVLSDAI 238 Query: 1641 MDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDWMKPFV 1462 ++VGP NVLQ+I H G + K SL+ SKFPHIF S CT I + ME+I L+W+K V Sbjct: 239 LEVGPKNVLQIISHLGHACKSYESLVLSKFPHIFLSPCTLQSIHMFMEEIASLEWIKSTV 298 Query: 1461 SYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQALQEVVGS 1282 AK IEQ I+ S D +S K AP+Y V RII+LKQ LQE V S Sbjct: 299 LCAKRIEQHIMYYQHAYPCLFPHNLKESS-DQVSTKIAPSYCFVQRIIELKQVLQEAVVS 357 Query: 1281 EEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMGDVYNW 1102 EE++QWKL P D +E+A+LG+DFWG+AHL LQLCEPFVRLL + ++D+SVMG VY+W Sbjct: 358 EEFKQWKLSMPGDHGIVESAILGDDFWGKAHLFLQLCEPFVRLLATFDIDKSVMGAVYDW 417 Query: 1101 RVQALEVVRSKRIDDMVLKQLEVVLENRWEMLFSPLHASGYILNPRYFGKGQAKDKTVMR 922 R QALE VR K ID L QLEV+ ENRW+ LFSPLHA+GYILNPRYFG+GQ KDKTVMR Sbjct: 418 RFQALEAVRMKGIDATALNQLEVLTENRWDALFSPLHAAGYILNPRYFGRGQNKDKTVMR 477 Query: 921 GWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAWWENFGSETPQ 742 GWK+TL+RYESD RR+LREQLSSYWRL+GSLGEEDA+D RDKM+PVAWWENFG E Sbjct: 478 GWKSTLERYESDSATRRILREQLSSYWRLEGSLGEEDAVDFRDKMEPVAWWENFGFEISH 537 Query: 741 LQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFVRNNLRLHSKKLV 565 LQTLAIK+LSQ+SSV Q W DN C+EA N G ER EDL+FVRNNLRLH+++ V Sbjct: 538 LQTLAIKVLSQVSSVAICQEIWQDNDFPCREAANRSGVERPEDLIFVRNNLRLHNQRNV 596 >ref|XP_007014534.1| Uncharacterized protein TCM_039722 [Theobroma cacao] gi|508784897|gb|EOY32153.1| Uncharacterized protein TCM_039722 [Theobroma cacao] Length = 381 Score = 261 bits (666), Expect = 1e-66 Identities = 127/213 (59%), Positives = 158/213 (74%) Frame = -2 Query: 1308 QALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDR 1129 +ALQ+VV SEEW+QWK +D+ IEA++LG++FW AH+MLQL +PF +LL L++D+ Sbjct: 146 KALQDVVVSEEWKQWKHSILKDILIIEASILGDEFWSNAHMMLQLFKPFAKLLAMLDIDK 205 Query: 1128 SVMGDVYNWRVQALEVVRSKRIDDMVLKQLEVVLENRWEMLFSPLHASGYILNPRYFGKG 949 SVMG +Y+WRVQALEVVRSK ID+ L QLEV++EN+W +LFS LHA+GYILNP YFGK Sbjct: 206 SVMGAIYDWRVQALEVVRSKEIDETALNQLEVLIENKWNVLFSLLHAAGYILNPGYFGK- 264 Query: 948 QAKDKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAWW 769 AR VLR+QLSSYWRL+GS GEEDA+DCRDKMD VAWW Sbjct: 265 -----------------------ARWVLRKQLSSYWRLEGSFGEEDALDCRDKMDLVAWW 301 Query: 768 ENFGSETPQLQTLAIKILSQISSVTTFQGSWHD 670 ENFG ETP LQTLAIK+LSQ+S+++ Q W D Sbjct: 302 ENFGFETPHLQTLAIKVLSQVSTISMCQDIWQD 334 Score = 181 bits (459), Expect = 1e-42 Identities = 100/201 (49%), Positives = 117/201 (58%) Frame = -2 Query: 2361 MPSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCP 2182 M SE DKWGW+HV+VFG F+ +GTKRWKCNHCNLRYNGSYSRVRAHLL F+GVGVKSC Sbjct: 1 MASEFDKWGWEHVTVFGVFDRGSGTKRWKCNHCNLRYNGSYSRVRAHLLRFSGVGVKSCL 60 Query: 2181 AIDRSLREAFHIQEEERLTRKKKKIPTSGKSSKRIRSSQLAITSVGKAFGKEDVDDVVAR 2002 AI+R+LREAFHI EEERL R KK T G Sbjct: 61 AINRTLREAFHILEEERLAR--KKKRTFGSGKP--------------------------- 91 Query: 2001 FFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVRE 1822 +FG GYEPPS+DKL D FL+KEK R++K+++ VRE Sbjct: 92 -------------------------TFGCGYEPPSMDKLSDCFLSKEKGRIEKSITLVRE 126 Query: 1821 SWPLTGCTIFCLSQLDGTLSC 1759 SWP TG T+ C+ G L C Sbjct: 127 SWPHTGYTVLCV----GCLGC 143 >ref|XP_007214864.1| hypothetical protein PRUPE_ppa018860mg [Prunus persica] gi|462411014|gb|EMJ16063.1| hypothetical protein PRUPE_ppa018860mg [Prunus persica] Length = 805 Score = 243 bits (621), Expect = 2e-61 Identities = 183/648 (28%), Positives = 297/648 (45%), Gaps = 60/648 (9%) Frame = -2 Query: 2334 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRS-LRE 2158 WK+V G ++CN+C + GSY RV++HLL G GV SC + S L E Sbjct: 128 WKYVKKLEKDGKAGGNTSFQCNYCQKTFKGSYFRVKSHLLKLKGNGVASCTKVTNSHLME 187 Query: 2157 AFHIQEEERLTRKKKKI-----PTSGKSSKRIRSSQLAITS-------------VGKAFG 2032 + EE L K ++ PTS SS+ SS L ++S + KAF Sbjct: 188 MEKVVEEAELRVKMAQLRDVPLPTSNTSSQGGSSSGLGMSSNWCSDSKKRKGNPIEKAFN 247 Query: 2031 ---KEDVDDVVARFFYADGLNFNIIKSPYFHDMAK-AIASFGPGYEPPSVDKLLDSFLTK 1864 +E +D +AR FY GL+F ++P++ + + A + PGY+PP + L + L K Sbjct: 248 NNLREQLDGEIARMFYTGGLSFQFSRNPHYVNAFRIACSKTLPGYQPPGYNMLRTTLLQK 307 Query: 1863 EKARMDKAVSPVRESW------PLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTI 1702 EK +++ VS + W PL IN+ G +FL+ I Sbjct: 308 EKNNIEEWVSVCSDGWSDAQRRPL-------------------INVMAICESGPMFLKAI 348 Query: 1701 DIEKGDGEDDVF-TEVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCT 1525 + E G+ +D F +L ++I ++GP NV+QV+ K +G ++ +KF HIFW+ C Sbjct: 349 NCE-GECKDKFFMANLLIESIREIGPQNVVQVVTDNAPVCKAAGHIVEAKFKHIFWTPCV 407 Query: 1524 AHCIQLLMEDI-----------TELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXX 1378 H + L +++I + W+ S A I+ I+ Sbjct: 408 VHTLNLALKNICSPVPRNPEVYEQCSWISTISSDAWFIKNFIM-NHNMRLSMYNDHCKLK 466 Query: 1377 SIDPISAKFAPTYNMVWRIIKLKQALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWG 1198 + +FA T M+ R ++KQ L+++V SE+W +K +++ +L FW Sbjct: 467 LLSVAETRFASTIVMLRRFKQVKQGLEQMVISEQWDIYKEDDVVKARTVKEKILDECFWE 526 Query: 1197 RAHLMLQLCEPFVRLLGSLNVDRSVMGDVYNWRVQALEVVRS-------KRIDD--MVLK 1045 +L P +L + D + +Y W +E V++ K++++ M Sbjct: 527 DIDYILNFTSPIYEMLRLSDTDMPCLHLIYEWWDSMIEKVKTIIYRKERKQLNEESMFFN 586 Query: 1044 QLEVVLENRWEMLFSPLHASGYILNPRYFGK----------GQAKDKTVMRGWKATLDRY 895 + +L +RW +PLH + LNP+Y+ K KD + R K ++R+ Sbjct: 587 VVHEILVDRWTKSSTPLHCFAHSLNPKYYCKEWLDMAHNRCPPHKDIEITRERKQCIERF 646 Query: 894 ESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAWWENFGSETPQLQTLAIKIL 715 S+ + RR + E+ +S+ D+M R M PV WW G+ TP+LQT+A+K+L Sbjct: 647 FSNEVERRAVNEEYASFSACIEDFSGMDSMKDRGFMAPVKWWVIHGASTPKLQTIALKLL 706 Query: 714 SQISSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFVRNNLRLHSKK 571 SS + + +W N + ERAEDLVFV +NLRL S+K Sbjct: 707 GHPSSSSCCERNWSTYNFIHSIKRNKITPERAEDLVFVHSNLRLLSRK 754 >ref|XP_006841838.1| hypothetical protein AMTR_s00003p00270420 [Amborella trichopoda] gi|548843859|gb|ERN03513.1| hypothetical protein AMTR_s00003p00270420 [Amborella trichopoda] Length = 732 Score = 243 bits (620), Expect = 3e-61 Identities = 170/633 (26%), Positives = 282/633 (44%), Gaps = 45/633 (7%) Frame = -2 Query: 2334 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREA 2155 W ++ G T G +C C + GSY+RV++HLLG G GVK C ID Sbjct: 35 WAYMEKIGRCHTGGGNWMLRCVLCKAEFKGSYTRVKSHLLGKVGTGVKRCLGIDNETLAT 94 Query: 2154 FHIQEEERLTRKKKKIPTSGKSSKRIRSSQLAITSVGKAFG---------KEDVDDVVAR 2002 +E TRK + S ++ S + + A K+ +D ++AR Sbjct: 95 LLRLNDEGSTRKIRSSSRSSVPLLKVNSGSIGLKKRRGANDLVKLLDLAPKDVLDRMIAR 154 Query: 2001 FFYADGLNFNIIKSPYFHDMAK-AIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVSPVR 1825 FYA G++ N+I+SPYF DM + A + GY P+ D L S L EKA ++++V P R Sbjct: 155 CFYASGISLNLIRSPYFRDMIRYACENSLEGYVLPTFDNLRTSLLDAEKANIEQSVKPFR 214 Query: 1824 ESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTKA 1645 SW G ++ D T IN +S G +FL+ ID D + + Sbjct: 215 SSWGSRGVSLLTDGWTDTTAKRPLINFMAASDIGSIFLKAIDSSVEMMNTDYMKNLFLEM 274 Query: 1644 IMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELD----- 1480 + +VGP +V+Q+I +++G + P+IFW+ C H + L +++I D Sbjct: 275 VAEVGPTSVVQIITDNSPICRVAGQRVEGMHPYIFWTPCVIHTLNLALKNICSPDDERKA 334 Query: 1479 -------WMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRI 1321 W++ K I ++ + ++FA T +V RI Sbjct: 335 EKYLHCQWIRDLDRDVKMIRSFVV-DHNAVLTIYSQYPTLRLLSVTESRFASTVIIVKRI 393 Query: 1320 IKLKQALQEVVGSEEWRQWKLMYPEDVPS---IEAAVLGNDFWGRAHLMLQLCEPFVRLL 1150 ++K AL +V WK++ ED +++ ++ + +W + ++ EP + +L Sbjct: 394 KEVKPALCRMVVDS---YWKVLVEEDAEKARRVKSCLVDDLWWEKIEFLIAFTEPILAML 450 Query: 1149 GSLNVDRSVMGDVYNWRVQALEVVRS-------KRI---DDMVLKQLEVVLENRWEMLFS 1000 +++ D + +VY+ +E VR K I + + + +L W + Sbjct: 451 RAIDTDEPTLHEVYDMWATMIEEVRGIIFRNEGKNIFLNESSFYEDIHRILVGSWNKSKT 510 Query: 999 PLHASGYILNPRYFGK---GQA-------KDKTVMRGWKATLDRYESDGMARRVLREQLS 850 PL + LNP+Y+ G+ KD+ V G R + + E+ Sbjct: 511 PLQCLAHSLNPKYYSDEWLGEVPSRLPPHKDREVSDGRNVCFARLFPAPSELQKVHEEFE 570 Query: 849 SYWRLDGSLGEEDAMDCRDKMDPVAWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHD 670 + G G D M R M P++WWENFG+ P+L LA ++LSQ SS + + +W Sbjct: 571 MFSMCKGHFGHWDVMSSRFSMSPISWWENFGAHVPRLAKLADRLLSQPSSSSCCERNWGT 630 Query: 669 NGSTCQEAVNLLGAERAEDLVFVRNNLRLHSKK 571 + N L ++RAEDLV+V +NLRL S++ Sbjct: 631 FSLIKKIKQNRLASQRAEDLVYVHSNLRLLSRR 663 >ref|XP_007039961.1| HAT transposon superfamily [Theobroma cacao] gi|508777206|gb|EOY24462.1| HAT transposon superfamily [Theobroma cacao] Length = 674 Score = 236 bits (602), Expect = 4e-59 Identities = 165/607 (27%), Positives = 276/607 (45%), Gaps = 40/607 (6%) Frame = -2 Query: 2277 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHIQEEERLTRKKKKIPTS 2098 +CN+C+ ++G R++ HL + C + +R+ HIQ + KK+K P Sbjct: 23 RCNYCHREFSGGVYRMKFHLAQIKNKDIVPCAEVPDDVRD--HIQTILN-SPKKQKTPKK 79 Query: 2097 GKSSKRIRSSQLAITSV------------------------------------GKAFGKE 2026 K K + + Q +S G+ +E Sbjct: 80 PKVDKAVANDQQNSSSASGGLHLNHGSSGQHGSTCPSLLFPRPSPSEQPAVDDGQKQKQE 139 Query: 2025 DVDDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMD 1846 D D +A FF+ + + F+ KS Y+ +M AIA G GY+ PS + L + L K K + Sbjct: 140 DADKKIAVFFFHNSIPFSAAKSMYYQEMVDAIAKCGVGYKAPSYENLRSTLLEKVKGDIH 199 Query: 1845 KAVSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVF 1666 R+ W TGCTI C S DG F I V+ P+G LFL+++D+ + + Sbjct: 200 DCYKKYRDEWKETGCTILCDSWSDGRTKSFVI-FSVTCPKGTLFLKSVDVSGHEDDASYL 258 Query: 1665 TEVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITE 1486 E+L +++VG NV+QVI S +G L+ +K+ +FWS C ++CI ++EDI++ Sbjct: 259 FELLESVVLEVGLENVIQVITDTAASYVYAGRLLMAKYSSLFWSPCASYCINKMLEDISK 318 Query: 1485 LDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQ 1306 +W+ + AK I Q I + + P +F Y + II + Sbjct: 319 QEWVGIVLEEAKSIVQYIYSHAWIVNMMRKFTGGRELMRPRITRFVANYLTLRSIIIQED 378 Query: 1305 ALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRS 1126 L+ + EW D +I++ + FW AH + + EP V++L ++ D Sbjct: 379 NLKHMFSHSEWLSSIYSRRSDAQAIKSLLYLERFWKSAHEAVSVSEPLVKILRIVDGDMP 438 Query: 1125 VMGDVYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHASGYILNPRYFG 955 MG +Y +A +++ K +++ + +++ + RW M L SPLHA+ LNP F Sbjct: 439 AMGYIYEGIERAKVAIKAYYKGLEEKYMPIWDII-DRRWNMQLHSPLHAAAAFLNPSIFY 497 Query: 954 KGQAK-DKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPV 778 K D + G++ + + + + + ++ Y G+LG + A+ R P Sbjct: 498 NPNFKIDLRMRNGFQEAMLKLATTDKDKIEITKEHPMYINAQGALGTDFAIMGRTLNAPG 557 Query: 777 AWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFVR 598 WW ++G E P LQ +AI+ILSQ S + +W S + N + E+ DLVFV Sbjct: 558 DWWASYGYEIPTLQRVAIRILSQPCSSHWCRWNWSTFESIHTKKRNKVELEKFNDLVFVH 617 Query: 597 NNLRLHS 577 NL L + Sbjct: 618 CNLCLQA 624 >ref|XP_006477267.1| PREDICTED: uncharacterized protein LOC102627361 [Citrus sinensis] Length = 674 Score = 229 bits (585), Expect = 3e-57 Identities = 164/608 (26%), Positives = 275/608 (45%), Gaps = 41/608 (6%) Frame = -2 Query: 2277 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHIQEEERLTRKKKKIPTS 2098 +CN+C ++G R++ HL + C + +R+ HIQ + +K+K P Sbjct: 23 RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSEVPDDVRD--HIQRILSIPKKQKN-PKR 79 Query: 2097 GKSSKRIRSSQLAITSVGKAFGK------------------------------------E 2026 K K + Q +S + + Sbjct: 80 PKVEKATANGQQNSSSASGGIHQNNRSSGQHGSSCPSLLFRHPSPSIQPIVDDTQKQRQD 139 Query: 2025 DVDDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMD 1846 D D +A FF+ + + F+ KS Y+ +M AIA G GY PS +KL + L K K +D Sbjct: 140 DTDKKIAVFFFHNSIPFSAAKSMYYQEMVNAIAECGVGYIAPSYEKLRSTLLEKVKVDID 199 Query: 1845 KAVSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVF 1666 RE W TGCTI C + D + V+ P+G LFL+++D+ G ED F Sbjct: 200 DCCKKYREEWKETGCTILCDNWSDERTKSLVV-FSVACPKGTLFLKSVDVS-GHEEDATF 257 Query: 1665 T-EVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDIT 1489 E+L ++DVG NV+QVI +G L+ +K+ +FWS C A+CI ++EDI+ Sbjct: 258 LFELLESVVLDVGVENVIQVITDSAACYVYAGRLLMTKYSSLFWSPCAAYCIDKMLEDIS 317 Query: 1488 ELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLK 1309 + +W+ + AK I + + I P +F Y + I+ + Sbjct: 318 KQEWVAMVLEEAKTITKYFYSHAWTLNMMRKLTGGRELIRPRITRFVANYLSLRSIVIHE 377 Query: 1308 QALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDR 1129 + L+ + EW D +I++ + + FW AH ++ + EP V++L ++ D Sbjct: 378 ENLKHMFSHSEWLSSIYSRRPDAQAIKSLLYLDRFWRSAHEVVSVSEPLVKILRIVDGDM 437 Query: 1128 SVMGDVYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHASGYILNPRYF 958 MG +Y +A +++ K +++ + +++ + RW M L SPLHA+ LNP F Sbjct: 438 PAMGYMYEGIERAKLAIQAYYKGVEEKYVPIWDII-DRRWNMQLHSPLHAAAAFLNPSIF 496 Query: 957 GKGQAK-DKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDP 781 K D + G++ + + + + + ++ Y G+LG + A+ R P Sbjct: 497 YNPNFKIDLRMRNGFQEAMIKLATADKDKIEITKEHPVYINAQGALGTDFAVLGRKLNAP 556 Query: 780 VAWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFV 601 WW ++G E P LQ AI+ILSQ S ++ +W S + N + E+ DL+FV Sbjct: 557 GDWWASYGYEIPTLQRAAIRILSQPCSSYWYRWNWSTFESIHNKKRNKVEMEKFNDLLFV 616 Query: 600 RNNLRLHS 577 NLRL + Sbjct: 617 HCNLRLQA 624 >ref|XP_003538648.1| PREDICTED: uncharacterized protein LOC100805582 isoform X1 [Glycine max] gi|571487050|ref|XP_006590550.1| PREDICTED: uncharacterized protein LOC100805582 isoform X2 [Glycine max] Length = 675 Score = 229 bits (584), Expect = 4e-57 Identities = 155/607 (25%), Positives = 267/607 (43%), Gaps = 40/607 (6%) Frame = -2 Query: 2277 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHIQEEERLTRKKKKIPTS 2098 +CN+C ++G R++ HL + C + +R+ HIQ +K K Sbjct: 23 RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRD--HIQSILSAPKKPKTPKKQ 80 Query: 2097 GKSSKRIRSSQLAITSVGKAFG------------------------------------KE 2026 + + Q +S F ++ Sbjct: 81 KTDQATVANGQQNSSSASGGFHHNHGYSGQNGSACPSLLFPNPSPSAQPLEHDAQKQKQD 140 Query: 2025 DVDDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMD 1846 D D +A FF+ + + F+ KS Y+ +M A+A G GY+ PS +KL + L K KA + Sbjct: 141 DADRKLAIFFFHNSIPFSAAKSIYYQEMVDAVAQCGVGYKAPSYEKLRSTLLEKVKADIH 200 Query: 1845 KAVSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVF 1666 R+ W TGCT+ C + DG + V+ P+G LFL+++D+ + + Sbjct: 201 SDYKKYRDEWKETGCTVLCDNWSDGRTGSLAV-FSVACPKGTLFLKSVDVSGHENDSTYL 259 Query: 1665 TEVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITE 1486 E+L +++VG NV+QVI S +G L+ +++ +FWS C A+CI ++EDI Sbjct: 260 FELLESVVLEVGAENVVQVITDASASYVCAGRLLIARYSFLFWSPCVAYCIDKMLEDIGR 319 Query: 1485 LDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQ 1306 DW+ + AK I Q I + I P +F + + I+ + Sbjct: 320 QDWVGTVLEEAKTITQYIYSHAWILNIMRKFTGGKELIRPKITRFVTNFLSLKSIVMQED 379 Query: 1305 ALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRS 1126 ++ + EW D +I + + + FW AH + + EP V+ L ++ D Sbjct: 380 NIKHMFSHSEWLSSIYRRRPDAQAINSLLYSDRFWKYAHEAVSVSEPLVKCLRMVDGDMP 439 Query: 1125 VMGDVYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHASGYILNPRY-F 958 MG VY +A +++ K I++ + +++ + RW M + S LHA+ LNP + Sbjct: 440 AMGYVYEGIERAKVAIKAYYKGIEEKYIPIWDII-DRRWNMQIHSSLHAAAAFLNPSISY 498 Query: 957 GKGQAKDKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPV 778 KD + G++ + R + + ++L +Y G+LG + A+ R P Sbjct: 499 NPNFKKDLRMRNGFQEAMLRLAITDKDKMEITKELPTYINAQGALGTDFAVLGRTLNAPG 558 Query: 777 AWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFVR 598 WW ++G E P LQ A++ILSQ S ++ +W S N + E+ +LVFV Sbjct: 559 DWWASYGYEIPTLQKAAVRILSQPCSSLWYRWNWSTFESIHNRKRNRVELEKFSELVFVH 618 Query: 597 NNLRLHS 577 +NL L + Sbjct: 619 SNLWLQT 625 >ref|XP_002443069.1| hypothetical protein SORBIDRAFT_08g007560 [Sorghum bicolor] gi|241943762|gb|EES16907.1| hypothetical protein SORBIDRAFT_08g007560 [Sorghum bicolor] Length = 713 Score = 225 bits (574), Expect = 6e-56 Identities = 164/633 (25%), Positives = 285/633 (45%), Gaps = 45/633 (7%) Frame = -2 Query: 2334 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSL--- 2164 W HV + G W+C + L Y GSYSR+++HLL +G G+K C A+D+ + Sbjct: 23 WNHVVLLEK-AAAGGNAVWRCKYYKLEYKGSYSRIKSHLLRISGGGIKICTAVDKFILAQ 81 Query: 2163 ---REAFHIQEEERLTRKKKKIPTSG-KSSKRIRSSQLAITSVGKAFGKE---DVDDVVA 2005 A E ER K +P +S +R+ + +++ KAF E +D ++ Sbjct: 82 LKSEVAEAADEIERSKAKVIPLPVENVDASNSMRNKRQRSSALEKAFDMETRNQLDAIIG 141 Query: 2004 RFFYADGLNFNIIKSPYFHDMAKAIASFG-PGYEPPSVDKLLDSFLTKEKARMDKAVSPV 1828 R FY+ G++FNI ++PY+ + + AS GY PPS +KL + L +E+A ++ + + Sbjct: 142 RLFYSGGVSFNIARNPYYRESYRFAASHNLDGYVPPSYNKLRTTLLKQERAHVESLLDRM 201 Query: 1827 RESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVLTK 1648 + W G TI C + IN +FL+ ID + E L + Sbjct: 202 KSVWAEKGVTI-CSDGWSDSQRRPLINFIAVCKGKPMFLRAIDASGEEKTKFFIAEKLIQ 260 Query: 1647 AIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDIT------- 1489 + +VGP NV+Q+I + K +G ++ K+ +IFW+ C H + L +++I Sbjct: 261 VVEEVGPKNVVQIITDNAANCKGAGLIVQQKYDNIFWTPCIVHTLNLALKNICAAKLPRT 320 Query: 1488 --------ELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNM 1333 EL W+ A I+ I+ + +FA M Sbjct: 321 EEQEIVYDELHWITLVAGDANMIKNYIM-NHSMRLSMFNEFSKLKLLAVAETRFASVVVM 379 Query: 1332 VWRIIKLKQALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRL 1153 + R + +K+ALQ +V S+ W +K + +L + +W ++ +P + Sbjct: 380 LTRFLMVKRALQRMVISDAWESYKDDNAGTAKHVREKILCSKWWDNVQYIVDFTDPIYEM 439 Query: 1152 LGSLNVDRSVMGDVYN-W-----RVQALEVVRSKRIDD---MVLKQLEVVLENRWEMLFS 1000 L + DR + +Y W +V+ + + K+ +D ++ +L +RW + Sbjct: 440 LRMADTDRPCLHLIYEMWDTMIAKVKKVVYTKEKKNNDEQSTFFSTVQDILLDRWTKSNT 499 Query: 999 PLHASGYILNPRYF-------GKGQA---KDKTVMRGWKATLDRYESDGMARRVLREQLS 850 PL + LNPRY+ +G+ KD + ++ G ++++ S Sbjct: 500 PLICLAHSLNPRYYHEKWISENEGREPPHKDLEISVQRMKCFRKFFPVGKDLNQVKDEYS 559 Query: 849 SYWRLDGSLGEEDAMDCRDKMDPVAWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHD 670 + L + D++ R +DP+ WW N G P LQ LA+K+L+Q +S ++ + +W Sbjct: 560 RFATCSEELNDFDSIYDRWILDPLKWWANHGQSIPMLQKLALKLLNQPASSSSCERNWST 619 Query: 669 NGSTCQEAVNLLGAERAEDLVFVRNNLRLHSKK 571 N L E AEDLVF+ NNLRL ++K Sbjct: 620 YSFVHSMLRNKLAPECAEDLVFIHNNLRLLARK 652 >ref|XP_004292297.1| PREDICTED: uncharacterized protein LOC101307174 [Fragaria vesca subsp. vesca] Length = 719 Score = 224 bits (571), Expect = 1e-55 Identities = 157/645 (24%), Positives = 286/645 (44%), Gaps = 57/645 (8%) Frame = -2 Query: 2334 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLR-- 2161 WK+V++ G + G + CN C + GS+SRV++HLL G GVK P I R Sbjct: 25 WKYVTITSGSDKSGGNVAFTCNFCGGKLTGSHSRVKSHLLRIKGTGVKIYPTITRDQTVE 84 Query: 2160 ---------EAFHIQEEERLTRKKKKIPTSGKSSKRIRSSQLAITS-------VGKAFGK 2029 + + + + ++ + SG S +R + + + KAF + Sbjct: 85 LQALLDHCDQQLNAKAQHKVALPPSSMTGSGISYFPLREREDEVKKRRGLSPQLSKAFRQ 144 Query: 2028 ED---VDDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEK 1858 ED D VAR FY+ GL FN+ ++P + + + ++AS PGY PP + L + L EK Sbjct: 145 EDRRECDASVARLFYSSGLAFNVARNPNYRE-SYSLASKIPGYVPPGYNALRTTLLDNEK 203 Query: 1857 ARMDKAVSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGE 1678 +++ + P++++W TG ++ DG IN+ ++ G + L+ I+ E Sbjct: 204 RHIERTLLPIKKTWKETGVSLCSDGWTDGQKRPL-INMMAAAKDGAMMLKAINCEGVTKS 262 Query: 1677 DDVFTEVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLME 1498 + +L ++I ++GP NV+QV+ S +G+++ PHIFW+ C H + L ++ Sbjct: 263 KEEIGRLLLESINEIGPENVVQVVTDNAPVSAAAGAIVEITHPHIFWTPCVVHTLNLALK 322 Query: 1497 D-------------ITELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISA 1357 D + EL W+ + I+ ++ + Sbjct: 323 DLLKAKSYLPGETVVEELGWLMEVYNDVWFIKNFVV-NHNMRLAMYHEHCALRLLQVAPT 381 Query: 1356 KFAPTYNMVWRIIKLKQALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQ 1177 +FA + ++ R +K LQ++V S+ W +K ++ +L FW + ++ Sbjct: 382 RFASHFIVLKRFRDVKSGLQQMVISQRWDLYKEDDASKARVVKEMLLKEKFWEQIDFLIA 441 Query: 1176 LCEPFVRLLGSLNVDRSVMGDVYNWRVQALEVVRSKRIDD----MVLKQLEV-------- 1033 L P ++ ++DR + VY W +E V+ + ++ + +V Sbjct: 442 LMGPIYEMIRMSDMDRPCLHLVYEWWNSMIEKVKKAVFNPEFVHVITEHCDVTRFYDVVY 501 Query: 1032 -VLENRWEMLFSPLHASGYILNPRYFGKGQA----------KDKTVMRGWKATLDRYESD 886 +L RW +PLH + LNP+Y+ +D + + + D Sbjct: 502 PILTARWTKSCTPLHCLAHSLNPKYYSSQWLEEDPNRVPPHRDAELNNERRRCFQKLFPD 561 Query: 885 GMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAWWENFGSETPQLQTLAIKILSQI 706 R + E+ + + G DA++ + +P+ WW ++G TP LQ+LA+K+L+Q Sbjct: 562 SQTRNKVMEEFARFSLNMGDFSSSDALENKFCFEPLTWWVSYGPSTPLLQSLALKLLNQP 621 Query: 705 SSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFVRNNLRLHSKK 571 S + + +W N L RA+DLV+V NLRL ++K Sbjct: 622 CSSSCCERNWSTYAFIQGLKRNKLQPRRAQDLVYVHTNLRLLARK 666 >ref|XP_006579099.1| PREDICTED: uncharacterized protein LOC102660479 [Glycine max] Length = 765 Score = 223 bits (569), Expect = 2e-55 Identities = 168/634 (26%), Positives = 279/634 (44%), Gaps = 46/634 (7%) Frame = -2 Query: 2334 WKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAI-DRSLRE 2158 W V++ G + W CN C SYSRV+AHLL G G+ +CP + D L Sbjct: 21 WSFVTIKEKIGDGGGNRLWSCNFCEKVVKSSYSRVKAHLLRICGSGIDTCPKVTDAYLVY 80 Query: 2157 AFHIQEEERLTRKKKKIPTSGKSS---------KRIRSSQLAITSVGKAFGKEDVDDV-- 2011 + EE K K +P KR +SS ++ AF ED + + Sbjct: 81 LRRVCEEAESILKSKNVPLPTDKRTPTPPTLPPKRRKSS-----NIESAFNIEDRNHLRA 135 Query: 2010 -VARFFYADGLNFNIIKSPYFHDMAKAIASFG-PGYEPPSVDKLLDSFLTKEKARMDKAV 1837 +AR FY+ L+F++ ++PYF A+ G+ PPS + L S L +E++ +++ + Sbjct: 136 EIARMFYSASLSFHLARNPYFVSSYSFAANCNLSGFLPPSYNALRTSLLQQERSYIERLL 195 Query: 1836 SPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEV 1657 P++ W L G T+ D + IN S G +FL+ ID K + ++ Sbjct: 196 QPIKSLWSLKGVTLVVDGWTDAQIRPL-INFMAISEEGPMFLKAIDGSKEYKDKHYMFDL 254 Query: 1656 LTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDI----- 1492 L I +VGP +V+QVI K +G LI +FPHIFW+ C H + L +++I Sbjct: 255 LKDVIKEVGPQSVVQVITDNAYVCKAAGLLIEVEFPHIFWTPCVVHTLNLGVKNICAAKN 314 Query: 1491 --------TELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYN 1336 E W+ + A I+ I+ + +FA Sbjct: 315 VDGNENVFNEGGWIAEVIGDASFIKVFIMT-HSMRLAIFNEFSSLKLLSIAETRFASMIV 373 Query: 1335 MVWRIIKLKQALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVR 1156 M+ R+ LK+ LQ +V S++W ++ ++ +L + +W + +L +P Sbjct: 374 MLKRLKLLKRCLQNMVISDQWNSYREDDVRKAAHVKELILNDIWWDKVDYILSFMDPIYS 433 Query: 1155 LLGSLNVDRSVMGDVYNWRVQALEVVRSK--RIDDMVLKQLEV-------VLENRWEMLF 1003 ++ + + S + VY +E V++ R D+++ ++ +L +RW Sbjct: 434 MIRICDTNASNLHLVYEMWDSMIEKVKTTIYRHDEVLENEVSTFFEVIHEILNSRWSKSC 493 Query: 1002 SPLHASGYILNPRYFGKG----------QAKDKTVMRGWKATLDRYESDGMARRVLREQL 853 +PLH + LNPRY+ +D + L RY + R + E+ Sbjct: 494 NPLHCLAHSLNPRYYSDNWLNEVPNRVPPHRDDELSSQRNKCLKRYFPNVNVRTKVYEEF 553 Query: 852 SSYWRLDGSLGEEDAMDCRDKMDPVAWWENFGSETPQLQTLAIKILSQISSVTTFQGSWH 673 S + G G D ++ R +D WW GS TP LQ +A+K+L Q S + + +W Sbjct: 554 SKFSSCAGDFGSFDIIEDRWALDSKTWWVMHGSSTPILQKVALKLLVQPCSSSCCERNWS 613 Query: 672 DNGSTCQEAVNLLGAERAEDLVFVRNNLRLHSKK 571 N + ++A+DLVFV +NLRL S+K Sbjct: 614 TYSFIHSLKRNKMDPKKAKDLVFVHSNLRLLSRK 647 >ref|XP_002509591.1| DNA binding protein, putative [Ricinus communis] gi|223549490|gb|EEF50978.1| DNA binding protein, putative [Ricinus communis] Length = 670 Score = 222 bits (566), Expect = 5e-55 Identities = 160/605 (26%), Positives = 272/605 (44%), Gaps = 38/605 (6%) Frame = -2 Query: 2277 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHIQE-----EERLTRKKK 2113 +CN+CN ++G R++ HL + C + +R HIQ +++ T KK+ Sbjct: 23 RCNYCNREFSGGVYRMKFHLAQIKNKDIVPCAEVPDDVRN--HIQSILSTPKKQKTPKKQ 80 Query: 2112 KIP------------TSGKSSKRIRSSQLAITSVGKAFGK-----------------EDV 2020 K + G R S Q T F + + Sbjct: 81 KTDQAENGQDNSSSASGGVHPNRGSSGQHGSTCPSLLFSRPLPTSQPVVDDAQNEKQNNA 140 Query: 2019 DDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKA 1840 D +A FF+ + + F+ KS Y+ +M A+A G GY+ PS +KL S L K K + Sbjct: 141 DKRIAVFFFHNSIAFSAAKSIYYQEMFDAVAECGQGYKAPSFEKLRSSLLEKVKGDIHDW 200 Query: 1839 VSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTE 1660 R+ W TGCTI C DG I V+ P+G LFL+++DI + + + E Sbjct: 201 YRKYRDDWKETGCTILCDGWSDGRTKSV-IVFSVTCPKGTLFLKSVDISGHENDANYLFE 259 Query: 1659 VLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELD 1480 +L +++VG NV+QVI S +G L+ +K+ +FWS C ++C+ ++EDI++ + Sbjct: 260 LLESILLEVGVENVIQVITDSTASYVYAGRLLMAKYSSLFWSPCASYCVNKMLEDISKQE 319 Query: 1479 WMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQAL 1300 W+ + A I + I + I P ++ Y + I+ + L Sbjct: 320 WVGTVMEEANTITKYIYSHAWTLNMMRRFTGGRELIRPRITRYVSNYLSLRAIVIQEDNL 379 Query: 1299 QEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVM 1120 + + EW D +++ + + FW AH + + EP +++L ++ D M Sbjct: 380 KHMFSHSEWLSSMHSRRPDAQIVKSFLSQDRFWKFAHEAVSISEPLIKILRIVDGDMPAM 439 Query: 1119 GDVYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHASGYILNPRYFGKG 949 G +Y +A +++ K I+D + E++ + RW + L SPLHA+ LNP F Sbjct: 440 GYIYEVLERAKVSIKAYYKGIEDKYMPIWEII-DRRWNIQLHSPLHAAAAFLNPSIFYNQ 498 Query: 948 QAK-DKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAW 772 K D + G++ + + + + + + ++ Y G+LG + A+ R P W Sbjct: 499 NFKIDLRMRNGFQEAMIKMATSDIDKIEITKEHPIYINGQGALGTDFAIMGRTLNSPGDW 558 Query: 771 WENFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFVRNN 592 W +G E P LQ +AI++LSQ S + +W S + N E+ DLVFV N Sbjct: 559 WAGYGYEIPTLQRVAIRLLSQPCSSHWCRWNWSTFESIHTKKRNKAELEKLNDLVFVHCN 618 Query: 591 LRLHS 577 L L + Sbjct: 619 LWLQA 623 >ref|XP_006857527.1| hypothetical protein AMTR_s00061p00028660 [Amborella trichopoda] gi|548861623|gb|ERN18994.1| hypothetical protein AMTR_s00061p00028660 [Amborella trichopoda] Length = 863 Score = 221 bits (562), Expect = 2e-54 Identities = 156/603 (25%), Positives = 265/603 (43%), Gaps = 36/603 (5%) Frame = -2 Query: 2277 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHI---QEEERLTRKKKKI 2107 +CN+C ++G R++ HL + C + +R+ ++ T KK KI Sbjct: 209 RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCSDVPNDVRDLIQSVLNTPRKQKTPKKPKI 268 Query: 2106 PTSGKSSKRIRSSQ----LAITSVGKAFG-------------------------KEDVDD 2014 + S S+ L + S G+ +E+ D Sbjct: 269 EQTPNSPHNSSSASGGFHLNVGSSGQRGSTCPSLLFPHPSPSGQPILDDSQRQKQEEADK 328 Query: 2013 VVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKAVS 1834 +A FF+ + + F+ KS Y+H M AIA G GY PS D+L + L K K + + Sbjct: 329 KIALFFFHNSIPFSSSKSIYYHGMVDAIADCGVGYRAPSYDRLRTTLLEKVKVEITDSYK 388 Query: 1833 PVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTEVL 1654 R+ W +GCTI DG S F I V+ PRG LFL+++D + E+L Sbjct: 389 TYRDEWRESGCTIMSDGWTDGR-SKFLIVFSVACPRGTLFLKSVDASAHVDDAHYLFELL 447 Query: 1653 TKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELDWM 1474 +++VG ++QVI + +G L+ +K+P +FWS C ++CI ++EDI++ +W+ Sbjct: 448 ESVVLEVGLEYIVQVITDSAANYVYAGRLLTAKYPSLFWSPCASYCIDRMLEDISKQEWV 507 Query: 1473 KPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQALQE 1294 + A+ I + I + +F + + I+ + L+ Sbjct: 508 STVIEEARSITKYIYGHSWVLNLMKRFTGGKELLRSRITRFVTHFLSLRSIVIHEDNLKH 567 Query: 1293 VVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVMGD 1114 + EW D ++ + + + FW A ++ L EP +++L ++ D MG Sbjct: 568 MFSHTEWLSSLYSKKSDAQAVRSLIYLDRFWKSAQEVVNLSEPLIKVLRIVDGDMPAMGY 627 Query: 1113 VYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHASGYILNPRYFGKGQA 943 +Y +A +++ K +D + E++ + RW + L SPLHA+ LNP F Sbjct: 628 IYEGIERAKVAIKAYYKGSEDKYMPIWEII-DRRWNLQLHSPLHAAAAFLNPAIFYNPSF 686 Query: 942 K-DKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAWWE 766 K D + G+ + + + + L ++ Y G+LG + AM R P WW Sbjct: 687 KIDSKIRNGFHEAMMKMVLNDKDKMELTKETPMYINAHGALGNDFAMMARTLNTPGDWWA 746 Query: 765 NFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFVRNNLR 586 +G E P LQ AI+ILSQ S + +W + + N L E+ DLV+V NLR Sbjct: 747 GYGYEVPVLQRAAIRILSQPCSSYWCRWNWGTFENVHTKKRNRLEQEKFNDLVYVHCNLR 806 Query: 585 LHS 577 + Sbjct: 807 FQA 809 >ref|XP_004159512.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101222344 [Cucumis sativus] Length = 673 Score = 219 bits (559), Expect = 3e-54 Identities = 159/604 (26%), Positives = 274/604 (45%), Gaps = 39/604 (6%) Frame = -2 Query: 2277 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHIQEEERLTRKKKKIP-- 2104 +CN+C ++G R++ HL + C + +R+ HIQ T KK+K P Sbjct: 23 RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRD--HIQGILS-TPKKQKAPKK 79 Query: 2103 --------TSGKSSKRIRSSQLAITSVGKAFG------------------------KEDV 2020 T+G+ S + S G+ K++ Sbjct: 80 PKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTFPCLSPSAQPPIDDAQKQKKDET 139 Query: 2019 DDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKA 1840 D VA FF+ + + F+ KS Y+ +M AIA +G GY+ PS +KL + L K K + + Sbjct: 140 DKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSS 199 Query: 1839 VSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTE 1660 R+ W TGCTI C S DG F + I V+ +G LFL+++DI + + ++ Sbjct: 200 YKKHRDEWKETGCTILCDSWSDGQTKSFLV-ISVTCSKGTLFLKSVDISGHEDDATYLSD 258 Query: 1659 VLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELD 1480 +L I++VG NV+Q+I S +G L+ +K+ +FWS C ++C+ ++EDI++++ Sbjct: 259 LLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIE 318 Query: 1479 WMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQAL 1300 W+ + AK I + I + I P +F + + I+ L+ L Sbjct: 319 WVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNL 378 Query: 1299 QEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVM 1120 + + EW D +I + + + FW AH + +CEP +R+L ++ D M Sbjct: 379 KHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAM 438 Query: 1119 GDVYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHASGYILNPRYFGKG 949 G ++ +A +++ +D + E + + RW + L + LH + LNP F Sbjct: 439 GYIFEGIERAKVEIKTYYNGFEDKYMPIWETI-DRRWNLQLHTTLHTAAAFLNPSXFYNP 497 Query: 948 QAK-DKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAW 772 K D + G++ + + + + + + +Y G+LG + A+ R P W Sbjct: 498 NFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDW 557 Query: 771 WENFGSETPQLQTLAIKILSQISSVTTFQG-SWHDNGSTCQEAVNLLGAERAEDLVFVRN 595 W +G E P LQ A++ILSQ S G +W + + + E+ DLVFV+ Sbjct: 558 WSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQC 617 Query: 594 NLRL 583 NL L Sbjct: 618 NLWL 621 >ref|XP_004147940.1| PREDICTED: uncharacterized protein LOC101222344 [Cucumis sativus] Length = 673 Score = 219 bits (559), Expect = 3e-54 Identities = 159/604 (26%), Positives = 274/604 (45%), Gaps = 39/604 (6%) Frame = -2 Query: 2277 KCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAIDRSLREAFHIQEEERLTRKKKKIP-- 2104 +CN+C ++G R++ HL + C + +R+ HIQ T KK+K P Sbjct: 23 RCNYCQREFSGGVYRMKFHLAQIKNKDIVPCTEVPTDVRD--HIQGILS-TPKKQKAPKK 79 Query: 2103 --------TSGKSSKRIRSSQLAITSVGKAFG------------------------KEDV 2020 T+G+ S + S G+ K++ Sbjct: 80 PKVDMETATNGQQHSSSASGGIHHGSSGQNESNCPSTYPCLSPSAQPPIDDAQKQKKDET 139 Query: 2019 DDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKLLDSFLTKEKARMDKA 1840 D VA FF+ + + F+ KS Y+ +M AIA +G GY+ PS +KL + L K K + + Sbjct: 140 DKKVAIFFFHNSIPFSAAKSLYYQEMVDAIAEYGGGYKAPSYEKLKSTLLDKVKGDIHSS 199 Query: 1839 VSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEKGDGEDDVFTE 1660 R+ W TGCTI C S DG F + I V+ +G LFL+++DI + + ++ Sbjct: 200 YKKHRDEWKETGCTILCDSWSDGQTKSFLV-ISVTCSKGTLFLKSVDISGHEDDATYLSD 258 Query: 1659 VLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQLLMEDITELD 1480 +L I++VG NV+Q+I S +G L+ +K+ +FWS C ++C+ ++EDI++++ Sbjct: 259 LLETIILEVGVENVVQIITDATASYVYAGRLLMTKYTSLFWSPCVSYCVNQMLEDISKIE 318 Query: 1479 WMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFAPTYNMVWRIIKLKQAL 1300 W+ + AK I + I + I P +F + + I+ L+ L Sbjct: 319 WVSAVLEEAKIITRYIYSHASILNTMRKFTGGKELIRPRITRFVTNFLSLRSIVILEDNL 378 Query: 1299 QEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCEPFVRLLGSLNVDRSVM 1120 + + EW D +I + + + FW AH + +CEP +R+L ++ D M Sbjct: 379 KHMFAHSEWLSSIYSRRPDAQAIISLLYLDRFWKDAHEAINICEPLIRILRIVDGDMPAM 438 Query: 1119 GDVYNWRVQALEVVRS--KRIDDMVLKQLEVVLENRWEM-LFSPLHASGYILNPRYFGKG 949 G ++ +A +++ +D + E + + RW + L + LH + LNP F Sbjct: 439 GYIFEGIERAKVEIKTYYNGFEDKYMPIWETI-DRRWNLQLHTTLHTAAAFLNPSVFYNP 497 Query: 948 QAK-DKTVMRGWKATLDRYESDGMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAW 772 K D + G++ + + + + + + +Y G+LG + A+ R P W Sbjct: 498 NFKIDLRIRNGFQEAMLKMATTDKDKMEITREHPAYVNGQGALGTDFAILGRTINAPGDW 557 Query: 771 WENFGSETPQLQTLAIKILSQISSVTTFQG-SWHDNGSTCQEAVNLLGAERAEDLVFVRN 595 W +G E P LQ A++ILSQ S G +W + + + E+ DLVFV+ Sbjct: 558 WSGYGYEIPTLQRAAVRILSQPCSSYGCSGWNWSTFETLHSKKHSRAEQEKLTDLVFVQC 617 Query: 594 NLRL 583 NL L Sbjct: 618 NLWL 621 >ref|XP_007161271.1| hypothetical protein PHAVU_001G056200g, partial [Phaseolus vulgaris] gi|561034735|gb|ESW33265.1| hypothetical protein PHAVU_001G056200g, partial [Phaseolus vulgaris] Length = 702 Score = 219 bits (557), Expect = 6e-54 Identities = 167/629 (26%), Positives = 274/629 (43%), Gaps = 32/629 (5%) Frame = -2 Query: 2358 PSESDKWGWKHVSVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPA 2179 P GWKH G + K+ KC++C+ +G R + HL G T + C + Sbjct: 17 PGNRTDVGWKH-----GIDINGNGKKVKCSYCSKTMSGGIFRFKHHLAG-TREDSEPCCS 70 Query: 2178 IDRSLREAFH--IQEEERLTRKKKKIP-------------------TSGKSSKRIRSS-Q 2065 + +R+ + E ++ + KK+K+ + GK R + Q Sbjct: 71 VPEEIRDLMIKIVAEAKQASLKKRKLNIIDEDQGCEGLEERQHIFGSKGKEKVGSRGAVQ 130 Query: 2064 LAITSVGKAFGKEDVDDVVARFFYADGLNFNIIKSPYFHDMAKAIASFGPGYEPPSVDKL 1885 I + K KE+VD VA FFY + FN+IK+P F M + I +G GY+PPS + Sbjct: 131 ATINQMMKKGYKEEVDAQVAEFFYTSAIPFNVIKNPAFTKMCEMIGKYGAGYKPPSYHDI 190 Query: 1884 LDSFLTKEKARMDKAVSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQT 1705 + L + + D + +E W TGCTI D N V+SP+G +F+ + Sbjct: 191 REKLLKQAIDKTDLVLQEYKEEWKKTGCTIMSDGWTDKKRRSI-CNFLVNSPKGTVFMYS 249 Query: 1704 IDIEKGDGEDDVFTEVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCT 1525 +D D ++L + VG NV+QV+ + K +G L+ K H++W+ C Sbjct: 250 LDTSDISKTADKVFKMLDDVVELVGEENVVQVVTDNAANFKAAGELLMQKREHLYWTPCA 309 Query: 1524 AHCIQLLMEDI-TELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSIDPISAKFA 1348 AHCI L ED +L + + + I I I P +FA Sbjct: 310 AHCIDLSFEDFEKKLKVHELTIKKGRKITTYIYGRSMLISMLKKFTKERDLIRPGVTRFA 369 Query: 1347 PTYNMVWRIIKLKQALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAHLMLQLCE 1168 Y + + +LK +L + SEEW+ K ++ +E +L N FW L++ Sbjct: 370 TAYLTLGCLHELKASLLTMFSSEEWKTSKFGTSQEGKKVENMILDNRFWKNISTCLKVAA 429 Query: 1167 PFVRLLGSLNVD-RSVMGDVYNWRVQALEVVRS-----KRIDDMVLKQLEVVLENRWE-M 1009 P + +L ++ D + MG +Y +A E +++ K+ + V K +++ RW+ Sbjct: 430 PLMVVLRLVDSDAKPAMGFIYEEMDRAKEKIKNNFNHIKKSYEEVWK----IIDARWDNQ 485 Query: 1008 LFSPLHASGYILNPR--YFGKGQAKDKTVMRGWKATLDRYESDGMARRVLREQLSSYWRL 835 L PLHA+ Y LNP+ Y + ++ D V G ++ R D RR++ QL Y Sbjct: 486 LHRPLHAAAYYLNPQFHYEPEFRSDDPEVKEGLYTSMRRLVKDAAERRIINVQLVEYHFG 545 Query: 834 DGSLGEEDAMDCRDKMDPVAWWENFGSETPQLQTLAIKILSQISSVTTFQGSWHDNGSTC 655 G+ +DA + R + P WWE FG TP+L+ Sbjct: 546 RGAFAMDDAKESRKTILPGEWWEMFGYRTPELKRR------------------------- 580 Query: 654 QEAVNLLGAERAEDLVFVRNNLRLHSKKL 568 N L ++ DL++V NL+L +K++ Sbjct: 581 ----NHLHQKKMNDLLYVMYNLKLSNKQI 605 >ref|XP_006577689.1| PREDICTED: uncharacterized protein LOC102662659 [Glycine max] Length = 847 Score = 218 bits (555), Expect = 1e-53 Identities = 171/644 (26%), Positives = 286/644 (44%), Gaps = 57/644 (8%) Frame = -2 Query: 2334 WKHV----SVFGGFETRTGTKRWKCNHCNLRYNGSYSRVRAHLLGFTGVGVKSCPAID-R 2170 W +V SV GG GT KCN C+ +NGSY+RVRAHLL TG GV+ C + Sbjct: 166 WTYVTKIKSVAGG-----GTYEIKCNICDFTFNGSYTRVRAHLLKMTGKGVRVCQKVTVA 220 Query: 2169 SLREAFHIQEE-----ERLTRKKKKIP-----------TSGKSSKRIRSSQLAITSVGKA 2038 L + I E ER K +P T G K+ ++S SV A Sbjct: 221 KLIDLKKIDNEATLRVERSKTKSVSLPPVSTQHQMDTNTLGVDPKKRKTS-----SVENA 275 Query: 2037 FG---KEDVDDVVARFFYADGLNFNIIKSPYFHD-MAKAIASFGPGYEPPSVDKLLDSFL 1870 F +E +D +AR FY+ GL F++ ++P++ A A + GY+PP +KL + L Sbjct: 276 FNLQARETLDHEIARMFYSSGLPFHLARNPHYRKAFAYAANNQISGYQPPGYNKLRITLL 335 Query: 1869 TKEKARMDKAVSPVRESWPLTGCTIFCLSQLDGTLSCFNINIFVSSPRGLLFLQTIDIEK 1690 E+ ++ + P++ +W G +I G IN V + G +FL+ ID Sbjct: 336 QNERRHVENLLQPIKNAWSQKGVSIVS-DGWSGPQRRSLINFMVVTESGPMFLKAIDCSN 394 Query: 1689 GDGEDDVFTEVLTKAIMDVGPANVLQVILHPGRSSKLSGSLIHSKFPHIFWSTCTAHCIQ 1510 + D + + + IM+VG +NV+Q++ K +G +I ++FP I+W+ C H + Sbjct: 395 EIKDKDFIAKHMREVIMEVGHSNVVQIVTDNAAVCKAAGLIIEAEFPSIYWTPCVVHTLN 454 Query: 1509 LLMEDI-------------TELDWMKPFVSYAKGIEQCILAXXXXXXXXXXXXXXXXSID 1369 L +++I E W+ A ++ +++ + Sbjct: 455 LALKNICAAKNTEKNNVAYEECSWITQIADDAMFVKNFVMS-HSMRLSIFNSFNSLKLLS 513 Query: 1368 PISAKFAPTYNMVWRIIKLKQALQEVVGSEEWRQWKLMYPEDVPSIEAAVLGNDFWGRAH 1189 +FA T M+ R +LK+ LQE+V S++W +K ++ +L + +W + Sbjct: 514 IAPTRFASTIVMLKRFKQLKKGLQEMVISDQWSSYKEDDVAKAKFVKDTLLDDKWWDKVD 573 Query: 1188 LMLQLCEPFVRLLGSLNVDRSVMGDVYNWRVQALEVVRS---------KRIDDMVLKQLE 1036 +L P +L + + S + VY +E V++ + + + Sbjct: 574 YILSFTSPIYDVLRRTDTEASSLHLVYEMWDSMIEKVKNAIYQYERNEESEGSTFYEVVH 633 Query: 1035 VVLENRWEMLFSPLHASGYILNPRYFGK----------GQAKDKTVMRGWKATLDRYESD 886 +L +RW +PLH + LNPRY+ +D + R R+ D Sbjct: 634 SILIDRWTKSSTPLHCLAHSLNPRYYSHEWLSEDSNRVPPHQDMELTRERLKCFKRFFLD 693 Query: 885 GMARRVLREQLSSYWRLDGSLGEEDAMDCRDKMDPVAWWENFGSETPQLQTLAIKILSQI 706 RR + + +++ + D+++ R +MDP AWW G P LQ +A+K+L+Q Sbjct: 694 VDVRRKVNIEFANFSDGREGFDDLDSLNDRGQMDPKAWWLVHGINAPILQKIALKLLAQP 753 Query: 705 SSVTTFQGSWHDNGSTCQEAVNLLGAERAEDLVFVRNNLRLHSK 574 S + + +W N + RAEDLVFV +NLRL S+ Sbjct: 754 CSSSCCERNWSTYSFIHSLKRNKMTPHRAEDLVFVHSNLRLLSR 797