BLASTX nr result
ID: Mentha25_contig00026704
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha25_contig00026704 (947 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU46397.1| hypothetical protein MIMGU_mgv1a003803mg [Mimulus... 359 8e-97 ref|XP_004247134.1| PREDICTED: uncharacterized protein LOC101247... 335 2e-89 emb|CBI37192.3| unnamed protein product [Vitis vinifera] 329 9e-88 ref|XP_002272156.1| PREDICTED: uncharacterized protein LOC100259... 329 9e-88 ref|XP_006355887.1| PREDICTED: uncharacterized protein LOC102585... 326 9e-87 emb|CAN77840.1| hypothetical protein VITISV_015561 [Vitis vinifera] 324 3e-86 ref|XP_006443731.1| hypothetical protein CICLE_v10019502mg [Citr... 310 4e-82 ref|XP_002526543.1| conserved hypothetical protein [Ricinus comm... 310 7e-82 ref|XP_007199743.1| hypothetical protein PRUPE_ppa003532mg [Prun... 302 1e-79 ref|XP_002320610.1| hypothetical protein POPTR_0014s18710g [Popu... 296 8e-78 ref|XP_007050164.1| Vacuolar protein sorting-associated protein ... 296 1e-77 ref|XP_006403160.1| hypothetical protein EUTSA_v10003172mg [Eutr... 293 9e-77 gb|EXB46011.1| hypothetical protein L484_015871 [Morus notabilis] 291 3e-76 ref|XP_006281615.1| hypothetical protein CARUB_v10027740mg [Caps... 288 2e-75 gb|AAC16751.1| Contains similarity to pre-mRNA processing protei... 287 5e-75 ref|NP_171905.1| uncharacterized protein [Arabidopsis thaliana] ... 287 5e-75 ref|NP_199208.1| uncharacterized protein [Arabidopsis thaliana] ... 285 1e-74 ref|XP_002863623.1| hypothetical protein ARALYDRAFT_494605 [Arab... 285 1e-74 ref|XP_002889490.1| hypothetical protein ARALYDRAFT_333729 [Arab... 285 2e-74 ref|XP_006304926.1| hypothetical protein CARUB_v10012003mg [Caps... 278 2e-72 >gb|EYU46397.1| hypothetical protein MIMGU_mgv1a003803mg [Mimulus guttatus] Length = 563 Score = 359 bits (922), Expect = 8e-97 Identities = 176/267 (65%), Positives = 209/267 (78%), Gaps = 2/267 (0%) Frame = +1 Query: 151 MLGCQCLAWSRVTDLIQPDPELFSLPEPLPQWPQGSGFATSMISLGELEVCQITNFEFIW 330 MLGC C WSRVTDLIQP + FSLP+P+P WPQGSGFATS I LGELEVCQIT FEF+W Sbjct: 1 MLGCNCFVWSRVTDLIQPGSDYFSLPDPIPHWPQGSGFATSRIGLGELEVCQITRFEFVW 60 Query: 331 GSNPSMDVKKSISFYKPKLIPDGFSCLGHYCQSDKKPLRGFVLVAREKTVAQHEHSAHGS 510 G N S D K+ +SFYKP IPDGF CLGHYCQS++ PLRGFVLVARE T + + S Sbjct: 61 GCNLSTDGKRGVSFYKPISIPDGFFCLGHYCQSNENPLRGFVLVAREMT------TQNSS 114 Query: 511 GYQSP-LVNPVDYTLVWSSYDG-DDENFEGCGCFWLPQPPPGYKALGFVVTSKSSKPGLN 684 Y P L PVDY+L+WSS +G DDE+F+GCG FWLPQ P GYK+LGFVVT+KS KP L+ Sbjct: 115 CYNMPSLSKPVDYSLMWSSNEGDDDESFDGCGYFWLPQSPEGYKSLGFVVTNKSDKPDLD 174 Query: 685 EVRCVRADLTDACQAYRLVVGMVSQVLRVPLTIWNARPLNRGIYDRGVSTGTFYCSTLCS 864 EV+CVR+DLTD C+AY L++ + S+ ++PLTI N RPL+RG+Y RGVS GTF+C Sbjct: 175 EVKCVRSDLTDTCEAYGLIMDIHSKFSKIPLTIRNTRPLHRGMYGRGVSVGTFFCG-YSF 233 Query: 865 SSGEELLNIACLKNLDPNLHAMPNMDQ 945 SSGEE L+IACLKNLD +LHAMPN+DQ Sbjct: 234 SSGEE-LSIACLKNLDSDLHAMPNIDQ 259 >ref|XP_004247134.1| PREDICTED: uncharacterized protein LOC101247062 [Solanum lycopersicum] Length = 567 Score = 335 bits (858), Expect = 2e-89 Identities = 162/267 (60%), Positives = 200/267 (74%), Gaps = 2/267 (0%) Frame = +1 Query: 151 MLGCQCLAWSRVTDLIQPDPELFSLPEPLPQWPQGSGFATSMISLGELEVCQITNFEFIW 330 M GC+C WS+++DL +P+ F LPEP+PQWPQG+GFA+ I LGELEV +I+ F+F+W Sbjct: 1 MFGCKCFRWSKISDLSSREPDTFLLPEPIPQWPQGTGFASGTIKLGELEVHKISKFDFVW 60 Query: 331 GSNPSMDVKKSISFYKPKLIPDGFSCLGHYCQSDKKPLRGFVLVAREKTVAQHEHSAHGS 510 G N S D K+ +SFYKP+ +PDGF LGHYCQS+KKPLRGFVLVARE VA+ E H S Sbjct: 61 GCNLSQDWKQGVSFYKPRGVPDGFFSLGHYCQSNKKPLRGFVLVARE--VAKPETGDHCS 118 Query: 511 GYQ--SPLVNPVDYTLVWSSYDGDDENFEGCGCFWLPQPPPGYKALGFVVTSKSSKPGLN 684 G L NP+DYTLVWSS DG +ENF+G G FWLPQPP GYKALGF+VT+K KP L Sbjct: 119 GNPCLPALQNPLDYTLVWSSNDGTEENFDGSGYFWLPQPPEGYKALGFIVTTKPVKPELG 178 Query: 685 EVRCVRADLTDACQAYRLVVGMVSQVLRVPLTIWNARPLNRGIYDRGVSTGTFYCSTLCS 864 EV+CVRADLTD C+ YRL++ S VL L IW+ RP +RG++ +G+S GTF+CS+ S Sbjct: 179 EVKCVRADLTDECETYRLIL-KTSSVLSEVLAIWSIRPRHRGMHGKGISIGTFFCSSYWS 237 Query: 865 SSGEELLNIACLKNLDPNLHAMPNMDQ 945 + E LNIACLKN D NL AMPN+DQ Sbjct: 238 TGQE--LNIACLKNFDTNLQAMPNLDQ 262 >emb|CBI37192.3| unnamed protein product [Vitis vinifera] Length = 625 Score = 329 bits (844), Expect = 9e-88 Identities = 152/266 (57%), Positives = 202/266 (75%), Gaps = 1/266 (0%) Frame = +1 Query: 151 MLGCQCLAWSRVTDLIQPDPELFSLPEPLPQWPQGSGFATSMISLGELEVCQITNFEFIW 330 M GCQC WSR+ +L+ PD + FSLP P+P WPQG GFA+ +I+LGELEV QI+ FEF+W Sbjct: 1 MFGCQCFQWSRIAELLPPDTKTFSLPAPIPTWPQGQGFASGVINLGELEVFQISRFEFVW 60 Query: 331 GSNPSMDVKKSISFYKPKLIPDGFSCLGHYCQSDKKPLRGFVLVAREKTVAQHEHSAHGS 510 GSN S D KK ++FYKP IP+GF LGHYCQS+ +PL+GFVLVARE + E + + Sbjct: 61 GSNLSQDKKKGVTFYKPVGIPNGFFSLGHYCQSNDQPLQGFVLVAREVACSNPEVAQICN 120 Query: 511 GYQS-PLVNPVDYTLVWSSYDGDDENFEGCGCFWLPQPPPGYKALGFVVTSKSSKPGLNE 687 +S PL P+DYTL+WS DG +EN++ CG FWLPQPP GY+A+GFVVT+K +P L+E Sbjct: 121 LDKSPPLQKPLDYTLLWSPDDGSEENYDSCGYFWLPQPPEGYEAMGFVVTNKPDRPELDE 180 Query: 688 VRCVRADLTDACQAYRLVVGMVSQVLRVPLTIWNARPLNRGIYDRGVSTGTFYCSTLCSS 867 VRCVRADLTD+C+ + L+ +S++ +VP +W+ RP +RG+ +G+ TGTF+CS+ + Sbjct: 181 VRCVRADLTDSCETHHLIFKTISKLSKVPFRVWSLRPCHRGMLGKGIPTGTFFCSSYW-N 239 Query: 868 SGEELLNIACLKNLDPNLHAMPNMDQ 945 GEE LNI CLKNL+P+LHAMPN+DQ Sbjct: 240 HGEE-LNIVCLKNLNPSLHAMPNLDQ 264 >ref|XP_002272156.1| PREDICTED: uncharacterized protein LOC100259944 [Vitis vinifera] Length = 569 Score = 329 bits (844), Expect = 9e-88 Identities = 152/266 (57%), Positives = 202/266 (75%), Gaps = 1/266 (0%) Frame = +1 Query: 151 MLGCQCLAWSRVTDLIQPDPELFSLPEPLPQWPQGSGFATSMISLGELEVCQITNFEFIW 330 M GCQC WSR+ +L+ PD + FSLP P+P WPQG GFA+ +I+LGELEV QI+ FEF+W Sbjct: 1 MFGCQCFQWSRIAELLPPDTKTFSLPAPIPTWPQGQGFASGVINLGELEVFQISRFEFVW 60 Query: 331 GSNPSMDVKKSISFYKPKLIPDGFSCLGHYCQSDKKPLRGFVLVAREKTVAQHEHSAHGS 510 GSN S D KK ++FYKP IP+GF LGHYCQS+ +PL+GFVLVARE + E + + Sbjct: 61 GSNLSQDKKKGVTFYKPVGIPNGFFSLGHYCQSNDQPLQGFVLVAREVACSNPEVAQICN 120 Query: 511 GYQS-PLVNPVDYTLVWSSYDGDDENFEGCGCFWLPQPPPGYKALGFVVTSKSSKPGLNE 687 +S PL P+DYTL+WS DG +EN++ CG FWLPQPP GY+A+GFVVT+K +P L+E Sbjct: 121 LDKSPPLQKPLDYTLLWSPDDGSEENYDSCGYFWLPQPPEGYEAMGFVVTNKPDRPELDE 180 Query: 688 VRCVRADLTDACQAYRLVVGMVSQVLRVPLTIWNARPLNRGIYDRGVSTGTFYCSTLCSS 867 VRCVRADLTD+C+ + L+ +S++ +VP +W+ RP +RG+ +G+ TGTF+CS+ + Sbjct: 181 VRCVRADLTDSCETHHLIFKTISKLSKVPFRVWSLRPCHRGMLGKGIPTGTFFCSSYW-N 239 Query: 868 SGEELLNIACLKNLDPNLHAMPNMDQ 945 GEE LNI CLKNL+P+LHAMPN+DQ Sbjct: 240 HGEE-LNIVCLKNLNPSLHAMPNLDQ 264 >ref|XP_006355887.1| PREDICTED: uncharacterized protein LOC102585636 [Solanum tuberosum] Length = 570 Score = 326 bits (835), Expect = 9e-87 Identities = 155/268 (57%), Positives = 195/268 (72%), Gaps = 3/268 (1%) Frame = +1 Query: 151 MLGCQCLAWSRVTDLIQPDPELFSLPEPLPQWPQG---SGFATSMISLGELEVCQITNFE 321 M GC+C WS+++DL +P+ F LPEP+PQWPQG +GFA+ I LGELEV +I+ F+ Sbjct: 1 MFGCKCFRWSKISDLSSREPDTFLLPEPIPQWPQGCSGTGFASGTIKLGELEVHKISKFD 60 Query: 322 FIWGSNPSMDVKKSISFYKPKLIPDGFSCLGHYCQSDKKPLRGFVLVAREKTVAQHEHSA 501 F+WG N S D K+ +SFYKP+ +PDGF LGHYCQS+KKPLRGFVLVARE + E Sbjct: 61 FVWGCNLSQDRKQGVSFYKPRGVPDGFFSLGHYCQSNKKPLRGFVLVAREVAKPEAEDHC 120 Query: 502 HGSGYQSPLVNPVDYTLVWSSYDGDDENFEGCGCFWLPQPPPGYKALGFVVTSKSSKPGL 681 G+ L NP+DYTLVWSS DG +ENF+G G FWLPQPP GYKALGF+VT+ KP L Sbjct: 121 SGNPCLPALQNPLDYTLVWSSNDGTEENFDGSGYFWLPQPPEGYKALGFIVTTNPVKPEL 180 Query: 682 NEVRCVRADLTDACQAYRLVVGMVSQVLRVPLTIWNARPLNRGIYDRGVSTGTFYCSTLC 861 EV+CVR DLTD C+ YRL++ S VL L IW+ RP +RG++ +G+S GTF+CS+ Sbjct: 181 GEVKCVRGDLTDECETYRLIL-KTSSVLSEVLAIWSIRPRHRGMHGKGISVGTFFCSSYW 239 Query: 862 SSSGEELLNIACLKNLDPNLHAMPNMDQ 945 S+ E LNIACLKN D +L AMPN+DQ Sbjct: 240 STGQE--LNIACLKNFDTSLQAMPNLDQ 265 >emb|CAN77840.1| hypothetical protein VITISV_015561 [Vitis vinifera] Length = 569 Score = 324 bits (831), Expect = 3e-86 Identities = 150/266 (56%), Positives = 201/266 (75%), Gaps = 1/266 (0%) Frame = +1 Query: 151 MLGCQCLAWSRVTDLIQPDPELFSLPEPLPQWPQGSGFATSMISLGELEVCQITNFEFIW 330 M GCQC WSR+ +L+ PD + FSLP P+P WPQG GFA+ +I+LGELEV QI+ FEF+W Sbjct: 1 MFGCQCFQWSRIAELLPPDTKTFSLPAPIPTWPQGQGFASGVINLGELEVFQISRFEFVW 60 Query: 331 GSNPSMDVKKSISFYKPKLIPDGFSCLGHYCQSDKKPLRGFVLVAREKTVAQHEHSAHGS 510 GSN S D KK ++F+KP IP+GF LG YCQS+ +PL+GFVLVARE + E + + Sbjct: 61 GSNLSQDKKKGVTFFKPVGIPNGFFSLGDYCQSNDQPLQGFVLVAREVACSNPEVAQICN 120 Query: 511 GYQS-PLVNPVDYTLVWSSYDGDDENFEGCGCFWLPQPPPGYKALGFVVTSKSSKPGLNE 687 +S PL P+DYTL+WS DG +EN++ CG FWLPQPP GY+A+GFVVT+K +P L+E Sbjct: 121 LDKSPPLQKPLDYTLLWSPDDGSEENYDSCGYFWLPQPPEGYEAMGFVVTNKPDRPELDE 180 Query: 688 VRCVRADLTDACQAYRLVVGMVSQVLRVPLTIWNARPLNRGIYDRGVSTGTFYCSTLCSS 867 VRCVRADLTD+C+ + L+ +S++ +VP +W+ RP +RG+ +G+ TGTF+CS+ + Sbjct: 181 VRCVRADLTDSCETHHLIFKTISKLSKVPFRVWSLRPCHRGMLGKGIPTGTFFCSSYW-N 239 Query: 868 SGEELLNIACLKNLDPNLHAMPNMDQ 945 GEE LNI CLKNL+P+LHAMPN+DQ Sbjct: 240 HGEE-LNIVCLKNLNPSLHAMPNLDQ 264 >ref|XP_006443731.1| hypothetical protein CICLE_v10019502mg [Citrus clementina] gi|568851520|ref|XP_006479438.1| PREDICTED: uncharacterized protein LOC102613175 [Citrus sinensis] gi|557545993|gb|ESR56971.1| hypothetical protein CICLE_v10019502mg [Citrus clementina] Length = 569 Score = 310 bits (795), Expect = 4e-82 Identities = 146/266 (54%), Positives = 185/266 (69%), Gaps = 1/266 (0%) Frame = +1 Query: 151 MLGCQCLAWSRVTDLIQPDPELFSLPEPLPQWPQGSGFATSMISLGELEVCQITNFEFIW 330 M GC+C W+ V ++ +P FSLP PLP WPQG GFA+ I+LGE+EVC+I+ F FIW Sbjct: 1 MFGCKCFYWNEVNNMSPTEPGTFSLPAPLPTWPQGQGFASGRINLGEIEVCRISRFNFIW 60 Query: 331 GSNPSMDVKKSISFYKPKLIPDGFSCLGHYCQSDKKPLRGFVLVAREKTVAQHEHSAHGS 510 N KKS +FY+P IPDGF LGHYCQ D +PLRGFVLVAR+ ++ E + + Sbjct: 61 SCNLLQSKKKSATFYEPAGIPDGFYSLGHYCQFDSRPLRGFVLVARDLASSEAEGAHTSN 120 Query: 511 GYQSP-LVNPVDYTLVWSSYDGDDENFEGCGCFWLPQPPPGYKALGFVVTSKSSKPGLNE 687 ++SP L P+DYTLVW S +G N+EGC FWLPQPP GYK++GF+VT +KP L+E Sbjct: 121 LFKSPALQKPLDYTLVWCSDEGGQGNYEGCAFFWLPQPPDGYKSMGFLVTKTPNKPELDE 180 Query: 688 VRCVRADLTDACQAYRLVVGMVSQVLRVPLTIWNARPLNRGIYDRGVSTGTFYCSTLCSS 867 VRCVR DLTD C+ + L+ +S+ P ++W+ RP NRG+ RGVS GTF+CS+ S Sbjct: 181 VRCVRDDLTDKCEVHHLIFDAISKFSSSPFSVWSTRPCNRGMLGRGVSVGTFFCSSNWIS 240 Query: 868 SGEELLNIACLKNLDPNLHAMPNMDQ 945 E LNIACLKNLDP LHAMPN DQ Sbjct: 241 GQE--LNIACLKNLDPKLHAMPNCDQ 264 >ref|XP_002526543.1| conserved hypothetical protein [Ricinus communis] gi|223534104|gb|EEF35821.1| conserved hypothetical protein [Ricinus communis] Length = 571 Score = 310 bits (793), Expect = 7e-82 Identities = 142/269 (52%), Positives = 189/269 (70%), Gaps = 1/269 (0%) Frame = +1 Query: 142 MDNMLGCQCLAWSRVTDLIQPDPELFSLPEPLPQWPQGSGFATSMISLGELEVCQITNFE 321 M + GC+C W R+ +L+ +P+ +SLP LP WP G GFA+ ISLGE+EV +I+ E Sbjct: 1 MFEIFGCKCFHWKRIDNLLPSEPDTYSLPASLPDWPPGQGFASGRISLGEIEVIKISRLE 60 Query: 322 FIWGSNPSMDVKKSISFYKPKLIPDGFSCLGHYCQSDKKPLRGFVLVAREKTVAQHEHSA 501 FIW D KK +SFYKP +PDGF+ LGH CQ + +PLR F+LVARE + + E + Sbjct: 61 FIWTCKLPQDEKKGVSFYKPAAVPDGFNSLGHQCQINNQPLRSFLLVAREVAITKTEAAI 120 Query: 502 HGSGYQSP-LVNPVDYTLVWSSYDGDDENFEGCGCFWLPQPPPGYKALGFVVTSKSSKPG 678 S SP L P+DY LVWSSY +DE+++GCG FWLPQPP GYK LG++VT+ KP Sbjct: 121 FSSPVNSPALRKPIDYILVWSSYSFNDESYDGCGFFWLPQPPDGYKPLGYLVTNNPDKPD 180 Query: 679 LNEVRCVRADLTDACQAYRLVVGMVSQVLRVPLTIWNARPLNRGIYDRGVSTGTFYCSTL 858 L+EVRCVRADLTD CQAYR ++ + S+ P +W+ RP +RG+ +GVS GTF+C + Sbjct: 181 LDEVRCVRADLTDGCQAYRPILNVYSKFSSFPFEVWSTRPSHRGMIGKGVSVGTFFCGSY 240 Query: 859 CSSSGEELLNIACLKNLDPNLHAMPNMDQ 945 C +SGEE LNIACL+N +P LH+MPN++Q Sbjct: 241 C-TSGEE-LNIACLRNANPELHSMPNLEQ 267 >ref|XP_007199743.1| hypothetical protein PRUPE_ppa003532mg [Prunus persica] gi|462395143|gb|EMJ00942.1| hypothetical protein PRUPE_ppa003532mg [Prunus persica] Length = 567 Score = 302 bits (774), Expect = 1e-79 Identities = 142/266 (53%), Positives = 184/266 (69%), Gaps = 1/266 (0%) Frame = +1 Query: 151 MLGCQCLAWSRVTDLIQPDPELFSLPEPLPQWPQGSGFATSMISLGELEVCQITNFEFIW 330 M GC+C W +++DL P+PE FSLP+P+PQWP G GFA+ +SLGE+EV +I FEFIW Sbjct: 1 MFGCKCFYWKKLSDLFPPEPEPFSLPDPIPQWPPGEGFASGKVSLGEIEVFKINRFEFIW 60 Query: 331 GSNPSMDVKKSISFYKPKLIPDGFSCLGHYCQSDKKPLRGFVLVAREKTVAQHEHSAHGS 510 + D KK ++FYKP IPDGF +GHYCQS+ KPL GFVLV RE + + Sbjct: 61 TCSLPEDKKKCVTFYKPAGIPDGFHSIGHYCQSNDKPLHGFVLVVREADMPETADVLER- 119 Query: 511 GYQSP-LVNPVDYTLVWSSYDGDDENFEGCGCFWLPQPPPGYKALGFVVTSKSSKPGLNE 687 +SP L P+DYTLVWS DG++E + CG FWLPQPP GYKA+GF+VT+K KPGL+E Sbjct: 120 -VKSPALSKPLDYTLVWSPDDGNEEIYGACGYFWLPQPPEGYKAMGFLVTNKPDKPGLDE 178 Query: 688 VRCVRADLTDACQAYRLVVGMVSQVLRVPLTIWNARPLNRGIYDRGVSTGTFYCSTLCSS 867 VRCVRADLTD C+ Y L++ ++ L +P +W RP +RG+ +GVS GTF+CS Sbjct: 179 VRCVRADLTDRCETYTLILNAITTSLNLPFQVWTTRPHHRGMMGKGVSVGTFFCSNDLGI 238 Query: 868 SGEELLNIACLKNLDPNLHAMPNMDQ 945 + L+I CLKNL+P L MPN+DQ Sbjct: 239 VKD--LHIRCLKNLNPKLSGMPNLDQ 262 >ref|XP_002320610.1| hypothetical protein POPTR_0014s18710g [Populus trichocarpa] gi|222861383|gb|EEE98925.1| hypothetical protein POPTR_0014s18710g [Populus trichocarpa] Length = 568 Score = 296 bits (758), Expect = 8e-78 Identities = 147/272 (54%), Positives = 185/272 (68%) Frame = +1 Query: 130 LNWMMDNMLGCQCLAWSRVTDLIQPDPELFSLPEPLPQWPQGSGFATSMISLGELEVCQI 309 ++W + C CL W + L+ P+PE FSLP PLP W QG GFA+ I+LG++E +I Sbjct: 1 MSWCEERNFWCNCLYWRKTAILLPPEPETFSLPSPLPDWSQGRGFASGRINLGKIEALKI 60 Query: 310 TNFEFIWGSNPSMDVKKSISFYKPKLIPDGFSCLGHYCQSDKKPLRGFVLVAREKTVAQH 489 + FEFIW SN D KK +SFYKP +P+GF LGHYCQ + KPL GFVLV RE VA Sbjct: 61 SRFEFIWSSNLLQDKKKGVSFYKPVGVPNGFYSLGHYCQFNNKPLWGFVLVVRE--VACF 118 Query: 490 EHSAHGSGYQSPLVNPVDYTLVWSSYDGDDENFEGCGCFWLPQPPPGYKALGFVVTSKSS 669 E A S L+ P+DYTLVWSS D +E + GCG FWLPQPP GYK LGF+VT+ Sbjct: 119 EPEAANS---PTLLKPLDYTLVWSSDDESEEKYGGCGFFWLPQPPEGYKPLGFLVTNNPD 175 Query: 670 KPGLNEVRCVRADLTDACQAYRLVVGMVSQVLRVPLTIWNARPLNRGIYDRGVSTGTFYC 849 KP L+EVRCVRADLTD C+ YRL++ S+ L +P+ + + RP +RG+ +GVS GTF+C Sbjct: 176 KPDLDEVRCVRADLTDECEPYRLLLESYSKFLNLPVRVSSTRPSHRGVLGKGVSVGTFFC 235 Query: 850 STLCSSSGEELLNIACLKNLDPNLHAMPNMDQ 945 S EE LNIACLKNL+ LHAMPN++Q Sbjct: 236 GYWTS---EEELNIACLKNLN-QLHAMPNLEQ 263 >ref|XP_007050164.1| Vacuolar protein sorting-associated protein YPR157W [Theobroma cacao] gi|508702425|gb|EOX94321.1| Vacuolar protein sorting-associated protein YPR157W [Theobroma cacao] Length = 568 Score = 296 bits (757), Expect = 1e-77 Identities = 145/267 (54%), Positives = 184/267 (68%), Gaps = 2/267 (0%) Frame = +1 Query: 151 MLGCQCLAWSRVTDLIQPDPELFSLPEPLPQWPQGSGFATSMISLGELEVCQITNFEFIW 330 M GC+C W+++ L+ +PE FSLP PLPQWPQG GFA+ I+LGELEV +I+ FEFIW Sbjct: 1 MFGCKCFYWNKMDQLLPCEPETFSLPAPLPQWPQGQGFASGKINLGELEVVKISRFEFIW 60 Query: 331 GSNPSMDVKKSISFYKPKLIPDGFSCLGHYCQSDKKPLRGFVLVAREKTVAQHEHSAHGS 510 SN D KK ++FY+P IPDGF LGHYCQS+ +PLRG+VLVAREK +AH S Sbjct: 61 SSNLLRDKKKGVTFYEPVGIPDGFYSLGHYCQSNDQPLRGYVLVAREKPF--KSEAAHFS 118 Query: 511 GYQS--PLVNPVDYTLVWSSYDGDDENFEGCGCFWLPQPPPGYKALGFVVTSKSSKPGLN 684 S L P+D +LVWSS +E+ EGCG FWLPQPP GYK++G++VT+ KP L+ Sbjct: 119 ACVSSPALREPLDCSLVWSSNGRSEESLEGCGFFWLPQPPEGYKSMGYLVTNTPKKPKLD 178 Query: 685 EVRCVRADLTDACQAYRLVVGMVSQVLRVPLTIWNARPLNRGIYDRGVSTGTFYCSTLCS 864 +VRCVRADLTD C+ Y++V + P +W+ RP +RG+ RGVS GTF C + + Sbjct: 179 KVRCVRADLTDRCENYQVVHNGHMRFSEFPFQVWSTRPSHRGMLGRGVSVGTFSCGSFWT 238 Query: 865 SSGEELLNIACLKNLDPNLHAMPNMDQ 945 E L+IACLKN DP LHAMPN DQ Sbjct: 239 PGQE--LSIACLKNSDPTLHAMPNCDQ 263 >ref|XP_006403160.1| hypothetical protein EUTSA_v10003172mg [Eutrema salsugineum] gi|557104273|gb|ESQ44613.1| hypothetical protein EUTSA_v10003172mg [Eutrema salsugineum] Length = 564 Score = 293 bits (749), Expect = 9e-77 Identities = 146/270 (54%), Positives = 178/270 (65%), Gaps = 5/270 (1%) Frame = +1 Query: 151 MLGCQCLAWSRVTDLIQ-PDPELFSLPEPLPQWPQGSGFATSMISLGELEVCQITNFEFI 327 M+GC+CL W+ + + +PE FSLP LPQWP G GFA+ I+LGELEV +IT F+F+ Sbjct: 1 MIGCKCLYWNNLREFPPLKEPETFSLPASLPQWPSGQGFASRRINLGELEVAEITTFDFV 60 Query: 328 WGSNPSMDVKKSISFYKPKLIPDGFSCLGHYCQSDKKPLRGFVLVAREKTVAQHEHSAHG 507 W D KS+SFYKP IP+ F CLGHYC SD LRGFVLVARE Sbjct: 61 WRYISPRDNNKSVSFYKPNNIPENFHCLGHYCHSDSNLLRGFVLVAREVVAKS------- 113 Query: 508 SGYQSPLVNPVDYTLVWSSYDGDDE-NFEGCGCFWLPQPPPGYKALGFVVTSKSSKPGLN 684 L P+DYTLVWSS D +E GCG FWLPQPP GYK +GFVVT+ SKP + Sbjct: 114 ------LAKPLDYTLVWSSNDLSEEPQRRGCGYFWLPQPPRGYKPMGFVVTTSPSKPESD 167 Query: 685 EVRCVRADLTDACQAYRLVVGMVSQVLRVPLTIWNARPLNRGIYDRGVSTGTFYCSTLCS 864 +VRCVRADLTD C+AYR+++ +S L VPL IW RP +RG++ RGVS GTF C T Sbjct: 168 QVRCVRADLTDECEAYRVIITAISDSLSVPLFIWKTRPSDRGMWGRGVSVGTFNCGTRSP 227 Query: 865 SSGEELL---NIACLKNLDPNLHAMPNMDQ 945 E+++ NIACLKN D +LHAMPN+DQ Sbjct: 228 EEEEDIVSINNIACLKNNDSSLHAMPNLDQ 257 >gb|EXB46011.1| hypothetical protein L484_015871 [Morus notabilis] Length = 571 Score = 291 bits (744), Expect = 3e-76 Identities = 146/265 (55%), Positives = 179/265 (67%), Gaps = 3/265 (1%) Frame = +1 Query: 160 CQCLAWSRVTDLIQPDPELFSLPEPLPQWPQGSGFATSMISLGELEVCQITNFEFIWGSN 339 C CL W++ TD +PE FSLP PLP+WPQG GFA+ IS+GELEV ++T FEFIW N Sbjct: 6 CNCLCWNKHTDFSLSEPETFSLPAPLPKWPQGKGFASGRISIGELEVFKVTRFEFIWCYN 65 Query: 340 PSMDVKKSISFYKPKLIPDGFSCLGHYCQSDKKPLRGFVLVAREKTVAQH-EHSAHGSGY 516 S D K SFYKP IPDGF LGH+CQ + +PLRGF+LVARE VA H S H S Sbjct: 66 LSQDKNKGFSFYKPAEIPDGFHSLGHFCQPNNQPLRGFLLVARE--VASHMPESCHASNT 123 Query: 517 QS--PLVNPVDYTLVWSSYDGDDENFEGCGCFWLPQPPPGYKALGFVVTSKSSKPGLNEV 690 L P+DY LVWS D +E GCG FWLPQ P GYK +GF+VT+K KP L+EV Sbjct: 124 AKLPVLCEPLDYVLVWSPDDWSEEKCGGCGYFWLPQAPEGYKPVGFLVTNKPVKPRLDEV 183 Query: 691 RCVRADLTDACQAYRLVVGMVSQVLRVPLTIWNARPLNRGIYDRGVSTGTFYCSTLCSSS 870 RCVR+DL D C AYRL++ S+ + +W+ RP +RG+ +GV GTF+CS+ S+ Sbjct: 184 RCVRSDLMDECDAYRLLLTCNSRYMNFSFRVWSTRPRHRGVTGKGVPVGTFFCSS-SWSA 242 Query: 871 GEELLNIACLKNLDPNLHAMPNMDQ 945 GE+L I CLKNLDP L AMPN+DQ Sbjct: 243 GEDLC-IGCLKNLDPTLPAMPNLDQ 266 >ref|XP_006281615.1| hypothetical protein CARUB_v10027740mg [Capsella rubella] gi|482550319|gb|EOA14513.1| hypothetical protein CARUB_v10027740mg [Capsella rubella] Length = 565 Score = 288 bits (737), Expect = 2e-75 Identities = 140/268 (52%), Positives = 185/268 (69%), Gaps = 3/268 (1%) Frame = +1 Query: 151 MLGC-QCLAWSRVTDLIQ-PDPELFSLPEPLPQWPQGSGFATSMISLGELEVCQITNFEF 324 M GC +CL W+ + +L PE FSLP +P WP G GF + I+LG++EV +IT+FEF Sbjct: 1 MFGCSRCLYWNNLRELPPLKTPETFSLPSSIPHWPSGQGFGSRRINLGDIEVAEITSFEF 60 Query: 325 IWGSNPSMDVKKSISFYKPKLIPDGFSCLGHYCQSDKKPLRGFVLVAREKTVAQHEHSAH 504 +W D KKS+SFYKP +P+GF CLGHYC SD LRGF+LVAR+ + A Sbjct: 61 VWRYCSRRDSKKSVSFYKPDKLPEGFHCLGHYCHSDSHLLRGFLLVARQVKSLEFREPA- 119 Query: 505 GSGYQSPLVNPVDYTLVWSSYD-GDDENFEGCGCFWLPQPPPGYKALGFVVTSKSSKPGL 681 LV P+DYTLVWSS D ++ E CG FWLPQPP GYK +GF+VT+ +P L Sbjct: 120 -------LVQPLDYTLVWSSNDLSEERQSESCGYFWLPQPPQGYKPIGFLVTTSPVRPEL 172 Query: 682 NEVRCVRADLTDACQAYRLVVGMVSQVLRVPLTIWNARPLNRGIYDRGVSTGTFYCSTLC 861 ++VRCVRADLTD C+A+++++ +S L+VPL IW RPL+RG++ +GVSTGTF+C+T Sbjct: 173 DQVRCVRADLTDQCEAHKVIITGISDSLQVPLFIWKTRPLDRGMWGKGVSTGTFFCTTQ- 231 Query: 862 SSSGEELLNIACLKNLDPNLHAMPNMDQ 945 S + L IACLKNLD +LHAMPN++Q Sbjct: 232 SPEEDHLRTIACLKNLDSSLHAMPNIEQ 259 >gb|AAC16751.1| Contains similarity to pre-mRNA processing protein PRP39 gb|L29224 from S. cerevisiae. ESTs gb|R64908 and gb|T88158, gb|N38703 and gb|AA651043 come from this gene [Arabidopsis thaliana] Length = 1345 Score = 287 bits (734), Expect = 5e-75 Identities = 150/273 (54%), Positives = 188/273 (68%), Gaps = 8/273 (2%) Frame = +1 Query: 151 MLGCQCLAWSRVTDLIQ-PDPELFSLPEPLPQWPQGSGFATSMISLGELEVCQITNFEFI 327 MLG +CL W+ + DL DPE FSLP +P WP G GF + I+LG+L+V +IT+FEFI Sbjct: 774 MLGYKCLHWNNLIDLPPLKDPETFSLPSSIPHWPPGQGFGSGTINLGKLQVIKITDFEFI 833 Query: 328 WGSNPSMDVKKSISFYKPK-LIPDGFSCLGHYCQSDKKPLRGFVLVARE--KTVAQHEHS 498 W S + KK+ISFYKPK L+P F CLGHYCQSD PLRG+VL AR+ ++ Q E Sbjct: 834 WRYR-STEKKKNISFYKPKGLLPKDFHCLGHYCQSDSHPLRGYVLAARDLVDSLEQVEKP 892 Query: 499 AHGSGYQSPLVNPVDYTLVWSSYDGDDENFEG---CGCFWLPQPPPGYKALGFVVTSKSS 669 A LV PVD+TLVWSS D + CG FWLPQPP GY+++GFVVT S Sbjct: 893 A--------LVEPVDFTLVWSSNDSAENECSSKSECGYFWLPQPPEGYRSIGFVVTKTSV 944 Query: 670 KPGLNEVRCVRADLTDACQAYRLVVGMVSQVLRVPLTIWNARPLNRGIYDRGVSTGTFYC 849 KP LNEVRCVRADLTD C+ + ++V VS+ L VPL IW RP +RG++ +GVS GTF+C Sbjct: 945 KPELNEVRCVRADLTDICEPHNVIVTAVSESLGVPLFIWRTRPSDRGMWGKGVSAGTFFC 1004 Query: 850 STLCSSSGEEL-LNIACLKNLDPNLHAMPNMDQ 945 T ++ E+L + IACLKNLD +LHAMPN+DQ Sbjct: 1005 RTRLVAAREDLGIGIACLKNLDLSLHAMPNVDQ 1037 >ref|NP_171905.1| uncharacterized protein [Arabidopsis thaliana] gi|63003838|gb|AAY25448.1| At1g04090 [Arabidopsis thaliana] gi|133778830|gb|ABO38755.1| At1g04090 [Arabidopsis thaliana] gi|332189534|gb|AEE27655.1| uncharacterized protein AT1G04090 [Arabidopsis thaliana] Length = 572 Score = 287 bits (734), Expect = 5e-75 Identities = 150/273 (54%), Positives = 188/273 (68%), Gaps = 8/273 (2%) Frame = +1 Query: 151 MLGCQCLAWSRVTDLIQ-PDPELFSLPEPLPQWPQGSGFATSMISLGELEVCQITNFEFI 327 MLG +CL W+ + DL DPE FSLP +P WP G GF + I+LG+L+V +IT+FEFI Sbjct: 1 MLGYKCLHWNNLIDLPPLKDPETFSLPSSIPHWPPGQGFGSGTINLGKLQVIKITDFEFI 60 Query: 328 WGSNPSMDVKKSISFYKPK-LIPDGFSCLGHYCQSDKKPLRGFVLVARE--KTVAQHEHS 498 W S + KK+ISFYKPK L+P F CLGHYCQSD PLRG+VL AR+ ++ Q E Sbjct: 61 WRYR-STEKKKNISFYKPKGLLPKDFHCLGHYCQSDSHPLRGYVLAARDLVDSLEQVEKP 119 Query: 499 AHGSGYQSPLVNPVDYTLVWSSYDGDDENFEG---CGCFWLPQPPPGYKALGFVVTSKSS 669 A LV PVD+TLVWSS D + CG FWLPQPP GY+++GFVVT S Sbjct: 120 A--------LVEPVDFTLVWSSNDSAENECSSKSECGYFWLPQPPEGYRSIGFVVTKTSV 171 Query: 670 KPGLNEVRCVRADLTDACQAYRLVVGMVSQVLRVPLTIWNARPLNRGIYDRGVSTGTFYC 849 KP LNEVRCVRADLTD C+ + ++V VS+ L VPL IW RP +RG++ +GVS GTF+C Sbjct: 172 KPELNEVRCVRADLTDICEPHNVIVTAVSESLGVPLFIWRTRPSDRGMWGKGVSAGTFFC 231 Query: 850 STLCSSSGEEL-LNIACLKNLDPNLHAMPNMDQ 945 T ++ E+L + IACLKNLD +LHAMPN+DQ Sbjct: 232 RTRLVAAREDLGIGIACLKNLDLSLHAMPNVDQ 264 >ref|NP_199208.1| uncharacterized protein [Arabidopsis thaliana] gi|9758554|dbj|BAB09055.1| unnamed protein product [Arabidopsis thaliana] gi|16648997|gb|AAL24350.1| Unknown protein [Arabidopsis thaliana] gi|20259924|gb|AAM13309.1| unknown protein [Arabidopsis thaliana] gi|332007653|gb|AED95036.1| uncharacterized protein AT5G43950 [Arabidopsis thaliana] Length = 566 Score = 285 bits (730), Expect = 1e-74 Identities = 138/267 (51%), Positives = 182/267 (68%), Gaps = 2/267 (0%) Frame = +1 Query: 151 MLGCQCLAWSRVTDLIQ-PDPELFSLPEPLPQWPQGSGFATSMISLGELEVCQITNFEFI 327 M GC+CL W+ + + +PE FSLP LPQWP G GF I+LGELEV +IT+FEF+ Sbjct: 1 MFGCKCLYWNNLKEYPPLKEPETFSLPASLPQWPSGQGFGLGRINLGELEVAEITSFEFV 60 Query: 328 WGSNPSMDVKKSISFYKPKLIPDGFSCLGHYCQSDKKPLRGFVLVAREKTVAQHEHSAHG 507 W D KKS+SFYKP +P+ F CLGHYCQSD LRGF+LVAR+ + Sbjct: 61 WRYCSRRDNKKSVSFYKPDKLPEDFHCLGHYCQSDSHLLRGFLLVARQVNKSS------- 113 Query: 508 SGYQSPLVNPVDYTLVWSSYD-GDDENFEGCGCFWLPQPPPGYKALGFVVTSKSSKPGLN 684 + LV P+DYTLVWSS D ++ E G FWLPQPP GYK +G++VT+ +KP L+ Sbjct: 114 ---EPALVQPLDYTLVWSSNDLSEERQSESYGYFWLPQPPQGYKPIGYLVTTSPAKPELD 170 Query: 685 EVRCVRADLTDACQAYRLVVGMVSQVLRVPLTIWNARPLNRGIYDRGVSTGTFYCSTLCS 864 +VRCVRADLTD C+A+++++ +S L +P+ IW RP +RG+ +GVSTGTF+C+T S Sbjct: 171 QVRCVRADLTDKCEAHKVIITAISDSLSIPMFIWKTRPSDRGMRGKGVSTGTFFCTTQ-S 229 Query: 865 SSGEELLNIACLKNLDPNLHAMPNMDQ 945 + L IACLKNLD +LHAMPN++Q Sbjct: 230 PEEDHLSTIACLKNLDSSLHAMPNIEQ 256 >ref|XP_002863623.1| hypothetical protein ARALYDRAFT_494605 [Arabidopsis lyrata subsp. lyrata] gi|297309458|gb|EFH39882.1| hypothetical protein ARALYDRAFT_494605 [Arabidopsis lyrata subsp. lyrata] Length = 562 Score = 285 bits (730), Expect = 1e-74 Identities = 138/266 (51%), Positives = 184/266 (69%), Gaps = 1/266 (0%) Frame = +1 Query: 151 MLGCQCLAWSRVTDLIQ-PDPELFSLPEPLPQWPQGSGFATSMISLGELEVCQITNFEFI 327 M GC+CL W+ + +L +PE FSLP +PQWP G GF + I+LG+LE+ ++T+FEF+ Sbjct: 1 MFGCKCLYWNNLRELPPLKEPETFSLPASIPQWPSGQGFGSGRINLGDLELAEVTSFEFV 60 Query: 328 WGSNPSMDVKKSISFYKPKLIPDGFSCLGHYCQSDKKPLRGFVLVAREKTVAQHEHSAHG 507 W D KKS+SFYKP + + F CLGHYCQSD LRGF+LVAR+ + Sbjct: 61 WRYCSRRDNKKSVSFYKPDKLLEDFHCLGHYCQSDSHLLRGFLLVARQVNKSS------- 113 Query: 508 SGYQSPLVNPVDYTLVWSSYDGDDENFEGCGCFWLPQPPPGYKALGFVVTSKSSKPGLNE 687 + LV P+DYTLVWSS D +E+ G FWLPQPP GYK +GF+VT+ SKP L++ Sbjct: 114 ---EPALVQPLDYTLVWSSNDLSEESQSGY--FWLPQPPQGYKTIGFLVTTSPSKPELDQ 168 Query: 688 VRCVRADLTDACQAYRLVVGMVSQVLRVPLTIWNARPLNRGIYDRGVSTGTFYCSTLCSS 867 VRCVRADLTD C+A+++++ +S L +PL IW RP +RG++ +GVSTGTF+C+T S Sbjct: 169 VRCVRADLTDKCEAHKVIITAISNSLSIPLFIWQTRPSDRGMWGKGVSTGTFFCTTQ-SP 227 Query: 868 SGEELLNIACLKNLDPNLHAMPNMDQ 945 + L IACLKNLD +LHAMPNM+Q Sbjct: 228 EEDHLSTIACLKNLDSSLHAMPNMEQ 253 >ref|XP_002889490.1| hypothetical protein ARALYDRAFT_333729 [Arabidopsis lyrata subsp. lyrata] gi|297335332|gb|EFH65749.1| hypothetical protein ARALYDRAFT_333729 [Arabidopsis lyrata subsp. lyrata] Length = 1328 Score = 285 bits (729), Expect = 2e-74 Identities = 149/274 (54%), Positives = 186/274 (67%), Gaps = 8/274 (2%) Frame = +1 Query: 148 NMLGCQCLAWSRVTDLIQ-PDPELFSLPEPLPQWPQGSGFATSMISLGELEVCQITNFEF 324 NMLG +CL W+ + DL DPE FSLP +P WP G GF + I+LG+L+V +IT+FEF Sbjct: 756 NMLGYKCLHWNNLIDLPPLKDPETFSLPASIPHWPPGQGFGSGTINLGKLQVIKITDFEF 815 Query: 325 IWGSNPSMDVKKSISFYKPK-LIPDGFSCLGHYCQSDKKPLRGFVLVARE--KTVAQHEH 495 IW S + KSISFYKPK L P F CLGHYCQSD PLRG++L AR+ ++ Q E Sbjct: 816 IWRYR-STEKNKSISFYKPKGLFPKDFHCLGHYCQSDSHPLRGYLLAARDLVDSLEQEEK 874 Query: 496 SAHGSGYQSPLVNPVDYTLVWSSYDGDDENFEGC---GCFWLPQPPPGYKALGFVVTSKS 666 A LV PVD+TLVWSS D ++ G FWLPQPP GY+++GFVVT S Sbjct: 875 PA--------LVEPVDFTLVWSSNDSVEDECSSKSERGYFWLPQPPEGYRSIGFVVTKSS 926 Query: 667 SKPGLNEVRCVRADLTDACQAYRLVVGMVSQVLRVPLTIWNARPLNRGIYDRGVSTGTFY 846 KP LNEVRCVRADLTD C+ + ++V VS+ L VPL IW RP +RG++ RGVS GTF+ Sbjct: 927 VKPELNEVRCVRADLTDKCETHNVIVTAVSESLGVPLFIWRTRPSDRGMWGRGVSAGTFF 986 Query: 847 CSTLCSSSGEEL-LNIACLKNLDPNLHAMPNMDQ 945 C T + E++ + IACLKNLD N+HAMPN+DQ Sbjct: 987 CRTRLVPAREDIGIGIACLKNLDTNVHAMPNVDQ 1020 >ref|XP_006304926.1| hypothetical protein CARUB_v10012003mg [Capsella rubella] gi|482573637|gb|EOA37824.1| hypothetical protein CARUB_v10012003mg [Capsella rubella] Length = 571 Score = 278 bits (712), Expect = 2e-72 Identities = 145/270 (53%), Positives = 181/270 (67%), Gaps = 5/270 (1%) Frame = +1 Query: 151 MLGCQCLAWSRVTDLIQ-PDPELFSLPEPLPQWPQGSGFATSMISLGELEVCQITNFEFI 327 MLG +CL W+ VTDL DP FSLP +PQWP G GF + I+LG+LEV +ITNFEF+ Sbjct: 1 MLGYKCLHWNNVTDLPPLKDPGTFSLPSSIPQWPPGQGFGSGTINLGKLEVIKITNFEFV 60 Query: 328 WGSNPSMDVKKSISFYKPK-LIPDGFSCLGHYCQSDKKPLRGFVLVAREKTVAQHEHSAH 504 W S++ KS+SFYKP+ +P F CLGHYCQSD PLRG+VL AR+ + E Sbjct: 61 WRYR-SVEKNKSMSFYKPEGSLPKNFHCLGHYCQSDSHPLRGYVLAARDLVDSLEEEK-- 117 Query: 505 GSGYQSPLVNPVDYTLVWSSYDGDDENFEG--CGCFWLPQPPPGYKALGFVVTSKSSKPG 678 + LV PVD+ LVWS D +E + CG FW PQPP GY+A+GFVVT S+KP Sbjct: 118 ----KPALVEPVDFVLVWSLNDSAEERNDTNECGYFWFPQPPEGYRAIGFVVTKNSAKPE 173 Query: 679 LNEVRCVRADLTDACQAYRLVVGMVSQVLRVPLTIWNARPLNRGIYDRGVSTGTFYCSTL 858 LN VRCVRADLTD C+ ++++V VS+ L VPL IW RP +RG++ +GVS GTF C T Sbjct: 174 LNVVRCVRADLTDNCEIHKVIVTAVSESLCVPLFIWRTRPSDRGMWGKGVSAGTFVCRTR 233 Query: 859 CSSSGEEL-LNIACLKNLDPNLHAMPNMDQ 945 + E L L IACLKNL LHAMPN++Q Sbjct: 234 LLVARENLKLGIACLKNLASGLHAMPNVEQ 263