BLASTX nr result
ID: Paeonia23_contig00012996
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Paeonia23_contig00012996 (1916 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_006478864.1| PREDICTED: uncharacterized protein At3g49140... 551 e-154 ref|XP_007034031.1| Pentatricopeptide repeat (PPR) superfamily p... 544 e-152 ref|XP_006478863.1| PREDICTED: uncharacterized protein At3g49140... 543 e-151 ref|XP_007223038.1| hypothetical protein PRUPE_ppa005374mg [Prun... 536 e-149 ref|XP_002302882.1| hypothetical protein POPTR_0002s23140g [Popu... 536 e-149 ref|XP_007034030.1| Pentatricopeptide repeat superfamily protein... 533 e-148 gb|EXB68700.1| hypothetical protein L484_024718 [Morus notabilis] 524 e-146 ref|XP_004296717.1| PREDICTED: uncharacterized protein At3g49140... 508 e-141 emb|CBI39163.3| unnamed protein product [Vitis vinifera] 506 e-140 ref|XP_002530542.1| conserved hypothetical protein [Ricinus comm... 504 e-140 ref|XP_004152092.1| PREDICTED: uncharacterized protein At3g49140... 486 e-134 ref|XP_004172166.1| PREDICTED: uncharacterized protein At3g49140... 477 e-132 ref|XP_002268699.1| PREDICTED: uncharacterized protein LOC100253... 469 e-129 ref|XP_004241392.1| PREDICTED: uncharacterized protein LOC101247... 468 e-129 ref|XP_002320457.2| hypothetical protein POPTR_0014s14980g [Popu... 458 e-126 ref|XP_003617306.1| hypothetical protein MTR_5g090150 [Medicago ... 451 e-124 ref|NP_567080.1| pentatricopeptide repeat-containing protein-lik... 447 e-123 ref|XP_006291096.1| hypothetical protein CARUB_v10017210mg [Caps... 445 e-122 ref|XP_006402701.1| hypothetical protein EUTSA_v10005960mg [Eutr... 442 e-121 ref|XP_002876496.1| hypothetical protein ARALYDRAFT_486397 [Arab... 442 e-121 >ref|XP_006478864.1| PREDICTED: uncharacterized protein At3g49140-like isoform X2 [Citrus sinensis] Length = 458 Score = 551 bits (1419), Expect = e-154 Identities = 294/453 (64%), Positives = 328/453 (72%), Gaps = 6/453 (1%) Frame = +2 Query: 212 AAASSMSLDGISCSTTYGATCAWAK-PPLDVRRVSDLVGMRCKHPFFGATQFHWLPTGHD 388 AAASS SL+GISCST+ G T W K P D R D +R +PFFG+T+F+WL TG D Sbjct: 8 AAASSFSLEGISCSTSNGITSTWIKNPSFDAHRAPDFPAIR--NPFFGSTKFNWLSTGRD 65 Query: 389 -CLXXXXXXXXXXXXXXXXXXXXXNRGYHPLEELKVRKRIRDIKLTSPEIARTTVEANSS 565 CL NRGYHPLEELK KR+RD KLTS EIARTT EAN+S Sbjct: 66 LCLSKVSVAADYSDSVPDSSNYVNNRGYHPLEELKFHKRVRDTKLTSAEIARTTAEANNS 125 Query: 566 ALLVFPGTVHCEPHENITWAEFEYVIDDNGDIFFEIFDDGNILQDRGASNPVNALIGMDV 745 +LLVFPGTVHCEPHE I+WAEF+YVIDD GDIFFEIFDD NIL D GA+N V A IGMD+ Sbjct: 126 SLLVFPGTVHCEPHEQISWAEFQYVIDDYGDIFFEIFDDENILHDPGANNLVTAFIGMDI 185 Query: 746 SIHENRRIASE--YNXXXXXXXXXXXXXXXYVEFVDPEVSDVSDDWGMPDTSSLVHPIYF 919 ++N+R+A+ YN Y E VD EVSD S DWGMPDTSS VHPIYF Sbjct: 186 PKYDNQRVAAGAGYNYSDFVTSDDIPLDEDYFEVVDSEVSDGSVDWGMPDTSSWVHPIYF 245 Query: 920 AKCLTKAINMECGKKMDHPSNSVSIVGRLRPVFIDEESYLRTLFHPEDSDGYSSDLKDGE 1099 +KCLTKA+NME +KMDHPSN +SIVG LRP F DEESYLR FH EDSDG +SD +DGE Sbjct: 246 SKCLTKAVNMEYDRKMDHPSNGLSIVGYLRPAFADEESYLRRQFHSEDSDGDNSDWQDGE 305 Query: 1100 I--LXXXXXXXXXXXILYRLEIMRIQLFSVYGIQCAVSLQDFQDAEPDFLVHSTSAILER 1273 LYRLEIMRI+LFSVYGI+ VSLQDFQDAEPD LVHSTSAI+E Sbjct: 306 TPNFSSKNGRSNTGSTLYRLEIMRIELFSVYGIRSPVSLQDFQDAEPDILVHSTSAIIEH 365 Query: 1274 YSEEGMRCNVALKGLCKKKGLQVERANLIGVDSLGMDVRVSSGSEVQTHRFPFKARATSE 1453 +S +G+RCN ALK LCKKKGL VE ANLIGVDSLGMDVRV SG EV+THRFPFK RATSE Sbjct: 366 FSLKGIRCNGALKALCKKKGLNVEEANLIGVDSLGMDVRVFSGVEVRTHRFPFKIRATSE 425 Query: 1454 VAAEKQIQQLLFPRSRRKKLNSRSDELKDPDYF 1552 VAAEKQIQQLLFPRSRRKKL S+ D LK+ D F Sbjct: 426 VAAEKQIQQLLFPRSRRKKLRSQRDVLKELDCF 458 >ref|XP_007034031.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 2 [Theobroma cacao] gi|508713060|gb|EOY04957.1| Pentatricopeptide repeat (PPR) superfamily protein isoform 2 [Theobroma cacao] Length = 467 Score = 544 bits (1401), Expect = e-152 Identities = 285/467 (61%), Positives = 330/467 (70%), Gaps = 17/467 (3%) Frame = +2 Query: 203 MPIAAASSMSL----------DGISCSTTYGATCAWAKPPLDVRRVSDLVGM--RCKHPF 346 M IAAASS S+ +G+ S +G +W K D R SDL G+ RC+ PF Sbjct: 1 MAIAAASSFSVGPSQCHLCQVEGVYYSPLHGVNSSWVKTTFDGCRTSDLSGVSFRCRSPF 60 Query: 347 FGATQFHWLPTGHD-CLXXXXXXXXXXXXXXXXXXXXXNRGYHPLEELKVRKRIRDIKLT 523 FG+TQFHW GHD CL N+GYHPLEELKV KR+R+ KL+ Sbjct: 61 FGSTQFHWWSAGHDHCLSKVSVAADYSDSVPDSSSYARNQGYHPLEELKVLKRMRETKLS 120 Query: 524 SPEIARTTVEANSSALLVFPGTVHCEPHENITWAEFEYVIDDNGDIFFEIFDDGNILQDR 703 + E+ARTTVEANS+ALLVFPGTVH EPHE I+WAEF YVIDD GDIFFEIFDD NILQDR Sbjct: 121 AAEVARTTVEANSTALLVFPGTVHSEPHEQISWAEFHYVIDDYGDIFFEIFDDENILQDR 180 Query: 704 GASNPVNALIGMDVSIHENRRIASEYNXXXXXXXXXXXXXXXYVEFVDPEVSDVSDDWGM 883 GASN VNALIGMD+ +HEN R+A EYN Y E +D E+S+ DWGM Sbjct: 181 GASNLVNALIGMDIPMHENNRVAGEYNISDIGNDDEIPFDDDYFEVMDSEMSEAPVDWGM 240 Query: 884 PDTSSL--VHPIYFAKCLTKAINMECGKKMDHPSNSVSIVGRLRPVFIDEESYLRTLFHP 1057 PDT++ VHPIYFAKCLTKA++ME +KMDHPSN VSIVG LRP F DEESYLR LFH Sbjct: 241 PDTATATWVHPIYFAKCLTKAVHMEHDRKMDHPSNGVSIVGCLRPAFYDEESYLRRLFHF 300 Query: 1058 EDSDGYSSDLKDGEILXXXXXXXXXXX--ILYRLEIMRIQLFSVYGIQCAVSLQDFQDAE 1231 ED+DGY+SD KDGE LYR+EIMR++LFS+YG+Q +SLQDFQDAE Sbjct: 301 EDNDGYTSDWKDGETSRSSSKYGGSKSDSTLYRMEIMRMELFSIYGVQSLISLQDFQDAE 360 Query: 1232 PDFLVHSTSAILERYSEEGMRCNVALKGLCKKKGLQVERANLIGVDSLGMDVRVSSGSEV 1411 PD LVHSTSAILER+S+ G+RCNVALK LCKKKGLQ+E ANLIGVDSLG+DVR+ SG EV Sbjct: 361 PDVLVHSTSAILERFSQNGIRCNVALKALCKKKGLQIEGANLIGVDSLGIDVRIFSGVEV 420 Query: 1412 QTHRFPFKARATSEVAAEKQIQQLLFPRSRRKKLNSRSDELKDPDYF 1552 +THRFPFK RA SE AAEKQI +LLFPRS RKK + D +DP F Sbjct: 421 RTHRFPFKVRAMSETAAEKQILKLLFPRSHRKKFRTDGDGFRDPASF 467 >ref|XP_006478863.1| PREDICTED: uncharacterized protein At3g49140-like isoform X1 [Citrus sinensis] Length = 468 Score = 543 bits (1398), Expect = e-151 Identities = 294/463 (63%), Positives = 328/463 (70%), Gaps = 16/463 (3%) Frame = +2 Query: 212 AAASSMSL----------DGISCSTTYGATCAWAK-PPLDVRRVSDLVGMRCKHPFFGAT 358 AAASS SL +GISCST+ G T W K P D R D +R +PFFG+T Sbjct: 8 AAASSFSLGHSECHLCRAEGISCSTSNGITSTWIKNPSFDAHRAPDFPAIR--NPFFGST 65 Query: 359 QFHWLPTGHD-CLXXXXXXXXXXXXXXXXXXXXXNRGYHPLEELKVRKRIRDIKLTSPEI 535 +F+WL TG D CL NRGYHPLEELK KR+RD KLTS EI Sbjct: 66 KFNWLSTGRDLCLSKVSVAADYSDSVPDSSNYVNNRGYHPLEELKFHKRVRDTKLTSAEI 125 Query: 536 ARTTVEANSSALLVFPGTVHCEPHENITWAEFEYVIDDNGDIFFEIFDDGNILQDRGASN 715 ARTT EAN+S+LLVFPGTVHCEPHE I+WAEF+YVIDD GDIFFEIFDD NIL D GA+N Sbjct: 126 ARTTAEANNSSLLVFPGTVHCEPHEQISWAEFQYVIDDYGDIFFEIFDDENILHDPGANN 185 Query: 716 PVNALIGMDVSIHENRRIASE--YNXXXXXXXXXXXXXXXYVEFVDPEVSDVSDDWGMPD 889 V A IGMD+ ++N+R+A+ YN Y E VD EVSD S DWGMPD Sbjct: 186 LVTAFIGMDIPKYDNQRVAAGAGYNYSDFVTSDDIPLDEDYFEVVDSEVSDGSVDWGMPD 245 Query: 890 TSSLVHPIYFAKCLTKAINMECGKKMDHPSNSVSIVGRLRPVFIDEESYLRTLFHPEDSD 1069 TSS VHPIYF+KCLTKA+NME +KMDHPSN +SIVG LRP F DEESYLR FH EDSD Sbjct: 246 TSSWVHPIYFSKCLTKAVNMEYDRKMDHPSNGLSIVGYLRPAFADEESYLRRQFHSEDSD 305 Query: 1070 GYSSDLKDGEI--LXXXXXXXXXXXILYRLEIMRIQLFSVYGIQCAVSLQDFQDAEPDFL 1243 G +SD +DGE LYRLEIMRI+LFSVYGI+ VSLQDFQDAEPD L Sbjct: 306 GDNSDWQDGETPNFSSKNGRSNTGSTLYRLEIMRIELFSVYGIRSPVSLQDFQDAEPDIL 365 Query: 1244 VHSTSAILERYSEEGMRCNVALKGLCKKKGLQVERANLIGVDSLGMDVRVSSGSEVQTHR 1423 VHSTSAI+E +S +G+RCN ALK LCKKKGL VE ANLIGVDSLGMDVRV SG EV+THR Sbjct: 366 VHSTSAIIEHFSLKGIRCNGALKALCKKKGLNVEEANLIGVDSLGMDVRVFSGVEVRTHR 425 Query: 1424 FPFKARATSEVAAEKQIQQLLFPRSRRKKLNSRSDELKDPDYF 1552 FPFK RATSEVAAEKQIQQLLFPRSRRKKL S+ D LK+ D F Sbjct: 426 FPFKIRATSEVAAEKQIQQLLFPRSRRKKLRSQRDVLKELDCF 468 >ref|XP_007223038.1| hypothetical protein PRUPE_ppa005374mg [Prunus persica] gi|462419974|gb|EMJ24237.1| hypothetical protein PRUPE_ppa005374mg [Prunus persica] Length = 464 Score = 536 bits (1382), Expect = e-149 Identities = 278/465 (59%), Positives = 332/465 (71%), Gaps = 15/465 (3%) Frame = +2 Query: 203 MPIAAASSMSL----------DGISCSTTYGATCAWAKPPLDVRRVSDLVGM--RCKHPF 346 M IAAA+S+ + + CSTT+G + +W KPP D RR DL G+ C++P Sbjct: 1 MAIAAATSLPFGSSHCHSCHAERVCCSTTHGISNSWMKPPSDGRRALDLPGVSFNCRNPL 60 Query: 347 FGATQFHWLPTGHD-CLXXXXXXXXXXXXXXXXXXXXXNRGYHPLEELKVRKRIRDIKLT 523 G+TQFHWL GHD CL N+GYHPLEE+KV K +RD KLT Sbjct: 61 LGSTQFHWLSIGHDLCLSKVLVAADYSDSVPDSSSYITNQGYHPLEEVKVCKMVRDTKLT 120 Query: 524 SPEIARTTVEANSSALLVFPGTVHCEPHENITWAEFEYVIDDNGDIFFEIFDDGNILQDR 703 S EIARTTVEAN SALLVFPG +HCEPHE I+WA+FEYVIDD GD++FEIFDD N+L+D Sbjct: 121 SAEIARTTVEANCSALLVFPGKIHCEPHEQISWADFEYVIDDYGDLYFEIFDDANLLEDP 180 Query: 704 GASNPVNALIGMDVSIHENRRIASEYNXXXXXXXXXXXXXXXYVEFVDPEVSDVSDDWGM 883 ASNPVNAL GMD+ +++ RIA E+N Y+E V+ EVSDV D WG+ Sbjct: 181 AASNPVNALFGMDIPTYDDGRIAGEFNILGGGNSDEIPFDDDYLEVVESEVSDVLD-WGL 239 Query: 884 PDTSSLVHPIYFAKCLTKAINMECGKKMDHPSNSVSIVGRLRPVFIDEESYLRTLFHPED 1063 PDTSS +HPIYFAKCLTK IN+E KKMDHPSN VSI+G LRP F DEE Y+R LFH ED Sbjct: 240 PDTSSSIHPIYFAKCLTKVINIEYHKKMDHPSNGVSILGCLRPAFADEEFYVRRLFHYED 299 Query: 1064 SDGYSSDLKDGEILXXXXXXXXXXXI--LYRLEIMRIQLFSVYGIQCAVSLQDFQDAEPD 1237 SDGY+SD KDG+ L LYRLEIMRI+LFSVYG+Q +SL+DFQDAEPD Sbjct: 300 SDGYNSDWKDGKSLSLSSKSDRIKTCSTLYRLEIMRIELFSVYGVQSTISLEDFQDAEPD 359 Query: 1238 FLVHSTSAILERYSEEGMRCNVALKGLCKKKGLQVERANLIGVDSLGMDVRVSSGSEVQT 1417 LV++T I++R++E G+RC+VALK LCK+KGL VE A+LIGVDSLGMDVRV SG EVQT Sbjct: 360 VLVNATLEIVDRFNERGIRCDVALKALCKRKGLHVEGAHLIGVDSLGMDVRVFSGLEVQT 419 Query: 1418 HRFPFKARATSEVAAEKQIQQLLFPRSRRKKLNSRSDELKDPDYF 1552 HRFPFK RATSEVAAEKQIQQLLFPRSRRKKL S+ + + + + Sbjct: 420 HRFPFKVRATSEVAAEKQIQQLLFPRSRRKKLKSQGNNFRGVELY 464 >ref|XP_002302882.1| hypothetical protein POPTR_0002s23140g [Populus trichocarpa] gi|222844608|gb|EEE82155.1| hypothetical protein POPTR_0002s23140g [Populus trichocarpa] Length = 469 Score = 536 bits (1381), Expect = e-149 Identities = 279/460 (60%), Positives = 325/460 (70%), Gaps = 13/460 (2%) Frame = +2 Query: 212 AAASSMSL----------DGISCSTTYGATCAWAKPPLDVRRVSDLVGMRCKHPFFGATQ 361 A ASS+SL D CST++G T +W K P+D R DL +R ++PFFG+TQ Sbjct: 10 AIASSLSLGTSHCQLCQADAFCCSTSHGGTNSWNKSPIDSCRPCDLSSIRYRNPFFGSTQ 69 Query: 362 FHWLPTGHD-CLXXXXXXXXXXXXXXXXXXXXXNRGYHPLEELKVRKRIRDIKLTSPEIA 538 F W G + CL +RGYHPLEE+K+ KR R+ +LTS EIA Sbjct: 70 FQWSSVGRNLCLQKVSVAADYSDSVPDSSNYTSHRGYHPLEEVKLSKRTRETQLTSAEIA 129 Query: 539 RTTVEANSSALLVFPGTVHCEPHENITWAEFEYVIDDNGDIFFEIFDDGNILQDRGASNP 718 RTTVEAN+SALLVFPG+VHCEPH I+WAEF+Y+IDD GDIFFEIFD+ NILQDRGASNP Sbjct: 130 RTTVEANTSALLVFPGSVHCEPHGQISWAEFQYIIDDYGDIFFEIFDNSNILQDRGASNP 189 Query: 719 VNALIGMDVSIHENRRIASEYNXXXXXXXXXXXXXXXYVEFVDPEVSDVSDDWGMPDTSS 898 VN LIGMD+ ++EN+++ +EYN Y E +D E S+V DWGMP TSS Sbjct: 190 VNVLIGMDIPMYENKKVVNEYNIFNVGSEDDIPFDEDYFEVMDSEDSEVPVDWGMPYTSS 249 Query: 899 LVHPIYFAKCLTKAINMECGKKMDHPSNSVSIVGRLRPVFIDEESYLRTLFHPEDSDGYS 1078 LVHPIYFAKC+TKAINME +KMDHPSN VSIVG LRP F DEE YLRT FH DSDGY+ Sbjct: 250 LVHPIYFAKCMTKAINMEYYRKMDHPSNGVSIVGCLRPAFSDEELYLRTSFHCGDSDGYN 309 Query: 1079 SDLKDGEIL--XXXXXXXXXXXILYRLEIMRIQLFSVYGIQCAVSLQDFQDAEPDFLVHS 1252 SD KD EIL L+ LEIMRI+LFS+YG Q AVSLQDFQ+AEPD L HS Sbjct: 310 SDRKDTEILSFNSKSDVSSSGSTLHCLEIMRIELFSLYGSQSAVSLQDFQEAEPDVLAHS 369 Query: 1253 TSAILERYSEEGMRCNVALKGLCKKKGLQVERANLIGVDSLGMDVRVSSGSEVQTHRFPF 1432 T AILE +SE+G RCN+ALK LCKKKGL VERANLIGVDSLGMDVR+ SG E +THRFPF Sbjct: 370 TPAILEHFSEKGSRCNIALKALCKKKGLHVERANLIGVDSLGMDVRIFSGVEARTHRFPF 429 Query: 1433 KARATSEVAAEKQIQQLLFPRSRRKKLNSRSDELKDPDYF 1552 K RAT + AA+KQI QLLFPR+RRKK + DEL D YF Sbjct: 430 KVRATCKTAAQKQIHQLLFPRARRKKFKTHEDELGDSSYF 469 >ref|XP_007034030.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma cacao] gi|508713059|gb|EOY04956.1| Pentatricopeptide repeat superfamily protein isoform 1 [Theobroma cacao] Length = 486 Score = 533 bits (1372), Expect = e-148 Identities = 285/486 (58%), Positives = 330/486 (67%), Gaps = 36/486 (7%) Frame = +2 Query: 203 MPIAAASSMSL----------DGISCSTTYGATCAWAKPPLDVRRVSDLVGM--RCKHPF 346 M IAAASS S+ +G+ S +G +W K D R SDL G+ RC+ PF Sbjct: 1 MAIAAASSFSVGPSQCHLCQVEGVYYSPLHGVNSSWVKTTFDGCRTSDLSGVSFRCRSPF 60 Query: 347 FGATQFHWLPTGHD-CLXXXXXXXXXXXXXXXXXXXXXNRGYHPLEELKVRKRIRDIKLT 523 FG+TQFHW GHD CL N+GYHPLEELKV KR+R+ KL+ Sbjct: 61 FGSTQFHWWSAGHDHCLSKVSVAADYSDSVPDSSSYARNQGYHPLEELKVLKRMRETKLS 120 Query: 524 SPEIARTTVEANSSALLVFPGTVHCEPHENITWAEFEYVIDDNGDIFFEIFDDGNILQDR 703 + E+ARTTVEANS+ALLVFPGTVH EPHE I+WAEF YVIDD GDIFFEIFDD NILQDR Sbjct: 121 AAEVARTTVEANSTALLVFPGTVHSEPHEQISWAEFHYVIDDYGDIFFEIFDDENILQDR 180 Query: 704 GASNPVNALIGMDVSIHENRRIASEYNXXXXXXXXXXXXXXXYVEFVDPEVSDVSDDWGM 883 GASN VNALIGMD+ +HEN R+A EYN Y E +D E+S+ DWGM Sbjct: 181 GASNLVNALIGMDIPMHENNRVAGEYNISDIGNDDEIPFDDDYFEVMDSEMSEAPVDWGM 240 Query: 884 PDTSSL--VHPIYFAKCLTKAINMECGKKMDHPSNSVSIVGRLRPVFIDEESYLRTLFHP 1057 PDT++ VHPIYFAKCLTKA++ME +KMDHPSN VSIVG LRP F DEESYLR LFH Sbjct: 241 PDTATATWVHPIYFAKCLTKAVHMEHDRKMDHPSNGVSIVGCLRPAFYDEESYLRRLFHF 300 Query: 1058 EDSDGYSSDLKDGEI--LXXXXXXXXXXXILYRLEIMRIQLFSVYGIQC----------- 1198 ED+DGY+SD KDGE LYR+EIMR++LFS+YG+Q Sbjct: 301 EDNDGYTSDWKDGETSRSSSKYGGSKSDSTLYRMEIMRMELFSIYGVQAFLMKRIMEERL 360 Query: 1199 --------AVSLQDFQDAEPDFLVHSTSAILERYSEEGMRCNVALKGLCKKKGLQVERAN 1354 +SLQDFQDAEPD LVHSTSAILER+S+ G+RCNVALK LCKKKGLQ+E AN Sbjct: 361 SSCFLYLSLISLQDFQDAEPDVLVHSTSAILERFSQNGIRCNVALKALCKKKGLQIEGAN 420 Query: 1355 LIGVDSLGMDVRVSSGSEVQTHRFPFKARATSEVAAEKQIQQLLFPRSRRKKLNSRSDEL 1534 LIGVDSLG+DVR+ SG EV+THRFPFK RA SE AAEKQI +LLFPRS RKK + D Sbjct: 421 LIGVDSLGIDVRIFSGVEVRTHRFPFKVRAMSETAAEKQILKLLFPRSHRKKFRTDGDGF 480 Query: 1535 KDPDYF 1552 +DP F Sbjct: 481 RDPASF 486 >gb|EXB68700.1| hypothetical protein L484_024718 [Morus notabilis] Length = 459 Score = 524 bits (1349), Expect = e-146 Identities = 275/461 (59%), Positives = 324/461 (70%), Gaps = 14/461 (3%) Frame = +2 Query: 203 MPIAAASSMSL----------DGISCSTTYGATCAWAKPPLDVRRVSDLVGMRCKHPFFG 352 M + A SS+SL +GI C T+YG T + KPPLD R V G+RC+ PFF Sbjct: 1 MAVVAPSSLSLGQSHCHSCRGEGIYCLTSYGITNKFKKPPLDGRMVPHYAGIRCRSPFFS 60 Query: 353 ATQFHWLPTGHD-CLXXXXXXXXXXXXXXXXXXXXXNRGYHPLEELKVRKRIRDIKLTSP 529 ++QF WL G D CL N GYHPLEELKV K+ + LTS Sbjct: 61 SSQFRWLSVGRDLCLWKVSVAADYSDSVPDSSNFMTNGGYHPLEELKVDKKNWETNLTSA 120 Query: 530 EIARTTVEANSSALLVFPGTVHCEPHENITWAEFEYVIDDNGDIFFEIFDDGNILQDRGA 709 EIART VEAN+SALL+FPGT+HCEPHE I+WAEF+YVIDD GDI+FE+ DD NIL+D A Sbjct: 121 EIARTAVEANNSALLIFPGTIHCEPHEQISWAEFQYVIDDYGDIYFEMLDDANILEDPSA 180 Query: 710 SNPVNALIGMDVSIHENRRIASEYNXXXXXXXXXXXXXXX-YVEFVDPEVSDVSDDWGMP 886 SNPVNALIGMD+ ++EN+R+A EYN Y E V+ EVS++ DWGMP Sbjct: 181 SNPVNALIGMDMPMYENKRVAGEYNISDNSGSIDEIPFDDDYFEVVESEVSEIPFDWGMP 240 Query: 887 DTSSLVHPIYFAKCLTKAINMECGKKMDHPSNSVSIVGRLRPVFIDEESYLRTLFHPEDS 1066 S+L+HPIYFAKCLTK +NME +KMDHPSN VSI+G LRP F DEES++R LF ED Sbjct: 241 HASTLIHPIYFAKCLTKVVNMEYDRKMDHPSNGVSILGCLRPAFADEESHIRRLFCYEDG 300 Query: 1067 DGYSSDLKDGEILXXXXXXXXXXX--ILYRLEIMRIQLFSVYGIQCAVSLQDFQDAEPDF 1240 DGY S+ DGE L LYRLEI+RI+LFS A+SLQDFQDAEPDF Sbjct: 301 DGYHSEWSDGETLSSNSRRDRGNSGSTLYRLEILRIELFS-----SAISLQDFQDAEPDF 355 Query: 1241 LVHSTSAILERYSEEGMRCNVALKGLCKKKGLQVERANLIGVDSLGMDVRVSSGSEVQTH 1420 LVHSTSAI+ER+SE+G+RC+VALK LCKKKGL VE A+LIGVDSLGMDVRVS GSEVQTH Sbjct: 356 LVHSTSAIVERFSEKGIRCDVALKALCKKKGLHVEGAHLIGVDSLGMDVRVSVGSEVQTH 415 Query: 1421 RFPFKARATSEVAAEKQIQQLLFPRSRRKKLNSRSDELKDP 1543 RFPFK RATSE+AAEKQI+QL+FPR+RRKKL S L+DP Sbjct: 416 RFPFKVRATSEIAAEKQIRQLMFPRARRKKLRSHGTGLRDP 456 >ref|XP_004296717.1| PREDICTED: uncharacterized protein At3g49140-like [Fragaria vesca subsp. vesca] Length = 463 Score = 508 bits (1307), Expect = e-141 Identities = 266/460 (57%), Positives = 321/460 (69%), Gaps = 15/460 (3%) Frame = +2 Query: 203 MPIAAASSMSL----------DGISCSTTYGATCAWAKPPLDVRRVSDLVG--MRCKHPF 346 M IAAASS+ L +G+ CST +G + +W KP D RR D +G ++C++P Sbjct: 1 MAIAAASSLPLGSSHCHSCHTEGVCCSTKHGISNSWMKPHFDGRRSPDRLGVSLKCRNPL 60 Query: 347 FGATQFHWLPTGHD-CLXXXXXXXXXXXXXXXXXXXXXNRGYHPLEELKVRKRIRDIKLT 523 G TQFHWL GH CL N+GYHPLEE+K K +RD KLT Sbjct: 61 VGPTQFHWLSIGHGLCLSKVFVAADFSDSAPESSSYMTNQGYHPLEEVKACKTVRDTKLT 120 Query: 524 SPEIARTTVEANSSALLVFPGTVHCEPHENITWAEFEYVIDDNGDIFFEIFDDGNILQDR 703 S EIARTTVEAN +ALLVFPG +H EPHE I+WAEF+YVIDD GD++FE+FDD NIL+D Sbjct: 121 SAEIARTTVEANDNALLVFPGKIHSEPHEQISWAEFQYVIDDYGDLYFELFDDANILEDP 180 Query: 704 GASNPVNALIGMDVSIHENRRIASEYNXXXXXXXXXXXXXXXYVEFVDPEVSDVSDDWGM 883 ASNPVNAL GMD+ H N RI ++ Y+E V+PE DV D W + Sbjct: 181 TASNPVNALFGMDIPAHNNGRITGGFSILDDYNSDDMPFDDDYLEVVEPEAFDVLD-WEI 239 Query: 884 PDTSSLVHPIYFAKCLTKAINMECGKKMDHPSNSVSIVGRLRPVFIDEESYLRTLFHPED 1063 PD S+++HPIYFAKCLTKAIN+ +KMDHPSN VSI+G L P F DEE Y+R LFH ED Sbjct: 240 PDASTVIHPIYFAKCLTKAINIRHDRKMDHPSNGVSILGCLIPAFADEEFYVRRLFHHED 299 Query: 1064 SDGYSSDLKDGE--ILXXXXXXXXXXXILYRLEIMRIQLFSVYGIQCAVSLQDFQDAEPD 1237 SD Y SD KDG+ + LYRLEIMRI+LFSVYG+Q A+SLQDFQDAEPD Sbjct: 300 SD-YDSDEKDGKGVSISSKSDRSKTRSTLYRLEIMRIELFSVYGVQSAISLQDFQDAEPD 358 Query: 1238 FLVHSTSAILERYSEEGMRCNVALKGLCKKKGLQVERANLIGVDSLGMDVRVSSGSEVQT 1417 FL++S S I+ER++E G+RC+VALK LCK+KGLQVE A+LIGVDSLGMDVRV SGSEVQT Sbjct: 359 FLINSISDIVERFNERGIRCDVALKALCKRKGLQVEGAHLIGVDSLGMDVRVFSGSEVQT 418 Query: 1418 HRFPFKARATSEVAAEKQIQQLLFPRSRRKKLNSRSDELK 1537 HRFPF+ RA SE+ AEKQI+QLLFPRSRRKKL S+ + + Sbjct: 419 HRFPFRVRAKSELVAEKQIEQLLFPRSRRKKLRSQGNNFQ 458 >emb|CBI39163.3| unnamed protein product [Vitis vinifera] Length = 470 Score = 506 bits (1303), Expect = e-140 Identities = 274/464 (59%), Positives = 316/464 (68%), Gaps = 13/464 (2%) Frame = +2 Query: 167 F*SYRFSSMHPLMPIAAASSMSL----------DGISCSTTYGATCAWAKPPLDVRRVSD 316 F + SS+ P M IAA S SL +G CST+ A W + D R V + Sbjct: 5 FNRWTLSSLSPSMAIAAVPSFSLGLSYCHSCQGEGFCCSTSCRAISCWNRS-FDGRLVPN 63 Query: 317 LVGMRCKHPFFGATQFHWLPTGHD-CLXXXXXXXXXXXXXXXXXXXXXNRGYHPLEELKV 493 L G R + FG+TQF WLP G D CL N+GYHPLEELK Sbjct: 64 LTGARKQ--IFGSTQFQWLPAGRDYCLSKVQVAADYSDSVPDSPKYMGNQGYHPLEELKE 121 Query: 494 RKRIRDIKLTSPEIARTTVEANSSALLVFPGTVHCEPHENITWAEFEYVIDDNGDIFFEI 673 KRI++ +LT+ E ARTTVEAN SALL+ P VH EPH++I+WAEF+Y+IDD GDIFF+I Sbjct: 122 SKRIQEKRLTAAEAARTTVEANGSALLLLPRIVHSEPHDHISWAEFQYIIDDFGDIFFQI 181 Query: 674 FDDGNILQDRGASNPVNALIGMDVSIHENRRIASEYNXXXXXXXXXXXXXXXYVEFVDPE 853 FDD NILQD GASNPVNALIGMD+S+++NRR+A EYN Y E D E Sbjct: 182 FDDQNILQDPGASNPVNALIGMDLSLYKNRRVAGEYNISESGSTDDISLDDDYFEVEDSE 241 Query: 854 VSDVSDDWGMPDTSSLVHPIYFAKCLTKAINMECGKKMDHPSNSVSIVGRLRPVFIDEES 1033 +SD+ DWG+PDTSSLVHPIYFAKCLTKA+NME K+MDHPSN +S+VG LRP FIDEE Sbjct: 242 MSDIPVDWGIPDTSSLVHPIYFAKCLTKAVNMEYNKEMDHPSNGISMVGCLRPAFIDEEP 301 Query: 1034 YLRTLFHPEDSDGYSSDLKDGEI--LXXXXXXXXXXXILYRLEIMRIQLFSVYGIQCAVS 1207 YLR LF EDSDGY+SD KD EI YRLEIMRI+LFSVYGIQ +S Sbjct: 302 YLRRLFSCEDSDGYTSDWKDEEITGFSSKGDGHNPRSTFYRLEIMRIELFSVYGIQALIS 361 Query: 1208 LQDFQDAEPDFLVHSTSAILERYSEEGMRCNVALKGLCKKKGLQVERANLIGVDSLGMDV 1387 LQDFQDAEPD LVHST AI+E ++E G NVALK LCKKKG VE ANLIGVDSLGMDV Sbjct: 362 LQDFQDAEPDVLVHSTKAIVEHFTENGTWFNVALKALCKKKGFHVEGANLIGVDSLGMDV 421 Query: 1388 RVSSGSEVQTHRFPFKARATSEVAAEKQIQQLLFPRSRRKKLNS 1519 RV +G E+QTHRF FK RATS AAEKQIQQLLFP SRRKK+ + Sbjct: 422 RVFTGVEIQTHRFSFKVRATSAAAAEKQIQQLLFPPSRRKKVQN 465 >ref|XP_002530542.1| conserved hypothetical protein [Ricinus communis] gi|223529904|gb|EEF31833.1| conserved hypothetical protein [Ricinus communis] Length = 409 Score = 504 bits (1297), Expect = e-140 Identities = 259/408 (63%), Positives = 296/408 (72%), Gaps = 3/408 (0%) Frame = +2 Query: 326 MRCKHPFFGATQFHWLPTGHD-CLXXXXXXXXXXXXXXXXXXXXXNRGYHPLEELKVRKR 502 MR ++P+FG+TQFHWL G+D CL ++ YHPLE++KV +R Sbjct: 1 MRYRYPYFGSTQFHWLTVGYDRCLWKASVAADYSDSVPDSSSYTSHQSYHPLEDVKVNRR 60 Query: 503 IRDIKLTSPEIARTTVEANSSALLVFPGTVHCEPHENITWAEFEYVIDDNGDIFFEIFDD 682 IRD +L+S EIARTTVEANSSALLVFPGTVHCEPHE I+WAEF+YV+DD GDIFFEIFDD Sbjct: 61 IRDTQLSSAEIARTTVEANSSALLVFPGTVHCEPHEQISWAEFQYVVDDYGDIFFEIFDD 120 Query: 683 GNILQDRGASNPVNALIGMDVSIHENRRIASEYNXXXXXXXXXXXXXXXYVEFVDPEVSD 862 +ILQD GA+NP+NA IGMD+ ++EN+RIA+EYN Y E +D EVSD Sbjct: 121 ISILQDPGATNPMNAFIGMDIPMYENKRIANEYNVFDIGSTDDIPFDDDYFEVMDSEVSD 180 Query: 863 VSDDWGMPDTSSLVHPIYFAKCLTKAINMECGKKMDHPSNSVSIVGRLRPVFIDEESYLR 1042 V DWGMPDTS+ VHPIYFAKCLTKA +MEC +KMDHPSN VSI+G LRP F DEESYLR Sbjct: 181 VPVDWGMPDTSTWVHPIYFAKCLTKATDMECDRKMDHPSNGVSILGCLRPAFADEESYLR 240 Query: 1043 TLFHPEDSDGYSSDLKDGEIL--XXXXXXXXXXXILYRLEIMRIQLFSVYGIQCAVSLQD 1216 LFH +DSD Y+SD D EIL LYRLEIMRI+LFSVYG Q Sbjct: 241 RLFHCQDSDNYNSDWTDVEILSFSSKGDGSSRGSTLYRLEIMRIELFSVYGAQACTY--- 297 Query: 1217 FQDAEPDFLVHSTSAILERYSEEGMRCNVALKGLCKKKGLQVERANLIGVDSLGMDVRVS 1396 FQDAEPD LVHSTSAIL+ +S G+RCN ALK LCKKKGL VE ANLIG+DSLG+DVR Sbjct: 298 FQDAEPDVLVHSTSAILDHFSNNGIRCNAALKALCKKKGLHVEGANLIGIDSLGIDVRTF 357 Query: 1397 SGSEVQTHRFPFKARATSEVAAEKQIQQLLFPRSRRKKLNSRSDELKD 1540 SG EVQT RFPFK RAT E AAEKQI QLLFP SRRKK S D L+D Sbjct: 358 SGVEVQTQRFPFKVRATCEAAAEKQIHQLLFPPSRRKKFRSHGDRLRD 405 >ref|XP_004152092.1| PREDICTED: uncharacterized protein At3g49140-like [Cucumis sativus] Length = 446 Score = 486 bits (1250), Expect = e-134 Identities = 249/452 (55%), Positives = 316/452 (69%), Gaps = 6/452 (1%) Frame = +2 Query: 203 MPIAAASSMSLDGISCSTTYGATCAWAKPPLDVRRVSDLVGMRC-KHPFFGATQFHWLPT 379 M IA ASS++ +G CS +Y T +W + DV C ++ FG+T+FHWL Sbjct: 1 MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDV----------CGRNKKFGSTEFHWLSK 50 Query: 380 GHD-CLXXXXXXXXXXXXXXXXXXXXXNRGYHPLEELKVRKRIRDIKLTSPEIARTTVEA 556 G D CL N+GYHPLE+LKV K +R+ +LT+ E+ART VE Sbjct: 51 GRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTAAEVARTAVEV 110 Query: 557 NSSALLVFPGTVHCEPHENITWAEFEYVIDDNGDIFFEIFDDGNILQDRGASNPVNALIG 736 NS+ALL+FPGTVH EPHE ++W EF+YV DD GD++FEIFD N+L+DR A NPVNALIG Sbjct: 111 NSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIG 170 Query: 737 MDVSIHENRRIASEYNXXXXXXXXXXXXXXXYVEFVDPEVSDVSDDWGMPDTSSLVHPIY 916 MD+ ++E+RRI +Y+ Y+E V+ +++++ DWG+PD SS+VHP+Y Sbjct: 171 MDMQMYESRRIVGDYSDVDSGYGDVAPFDYDYIEVVEADLANIPVDWGVPDVSSMVHPVY 230 Query: 917 FAKCLTKAINMECGKKMDHPSNSVSIVGRLRPVFIDEESYLRTLFHPEDSDGYSSDLK-- 1090 FAKCL K INME + M HPSN VSI+G LRP + DEESY+R LF+ E+S+GY+++ K Sbjct: 231 FAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESEGYNTEWKGL 290 Query: 1091 DGEI--LXXXXXXXXXXXILYRLEIMRIQLFSVYGIQCAVSLQDFQDAEPDFLVHSTSAI 1264 +GE L LYRLEIMRI+LFSVYG+Q VSLQDFQDAEPD L+HST+ I Sbjct: 291 EGETSNLESKIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEI 350 Query: 1265 LERYSEEGMRCNVALKGLCKKKGLQVERANLIGVDSLGMDVRVSSGSEVQTHRFPFKARA 1444 LER++E+G++CN+ALK LCKK+GL VE A LIGVDSLGMDVRV G+EV+T RFPFK RA Sbjct: 351 LERFNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRA 410 Query: 1445 TSEVAAEKQIQQLLFPRSRRKKLNSRSDELKD 1540 TSE AAEKQIQQLLFPRSRRKKL S D L+D Sbjct: 411 TSEAAAEKQIQQLLFPRSRRKKLRSHGDGLRD 442 >ref|XP_004172166.1| PREDICTED: uncharacterized protein At3g49140-like [Cucumis sativus] Length = 437 Score = 477 bits (1228), Expect = e-132 Identities = 246/449 (54%), Positives = 312/449 (69%), Gaps = 3/449 (0%) Frame = +2 Query: 203 MPIAAASSMSLDGISCSTTYGATCAWAKPPLDVRRVSDLVGMRC-KHPFFGATQFHWLPT 379 M IA ASS++ +G CS +Y T +W + DV C ++ FG+T+FHWL Sbjct: 1 MAIAVASSLTFEGAPCSKSYAFTSSWNRSSFDV----------CGRNKKFGSTEFHWLSK 50 Query: 380 GHD-CLXXXXXXXXXXXXXXXXXXXXXNRGYHPLEELKVRKRIRDIKLTSPEIARTTVEA 556 G D CL N+GYHPLE+LKV K +R+ +LT+ E+ART VE Sbjct: 51 GRDLCLSKVSVAADYPDSVPDSSSYLTNKGYHPLEDLKVCKSVRNTELTAAEVARTAVEV 110 Query: 557 NSSALLVFPGTVHCEPHENITWAEFEYVIDDNGDIFFEIFDDGNILQDRGASNPVNALIG 736 NS+ALL+FPGTVH EPHE ++W EF+YV DD GD++FEIFD N+L+DR A NPVNALIG Sbjct: 111 NSNALLLFPGTVHSEPHEQVSWDEFQYVTDDYGDLYFEIFDSVNMLEDRRAHNPVNALIG 170 Query: 737 MDVSIHENRRIASEYNXXXXXXXXXXXXXXXYVEFVDPEVSDVSDDWGMPDTSSLVHPIY 916 MD+ ++E+RRI +Y+ Y+E V+ +++++ DWG+PD SS+VHP+Y Sbjct: 171 MDMQMYESRRIVGDYSDVDSGYGDVAPFDYDYIEVVEADLANIPVDWGVPDVSSMVHPVY 230 Query: 917 FAKCLTKAINMECGKKMDHPSNSVSIVGRLRPVFIDEESYLRTLFHPEDS-DGYSSDLKD 1093 FAKCL K INME + M HPSN VSI+G LRP + DEESY+R LF+ E+S +G +S+L+ Sbjct: 231 FAKCLKKVINMEYDRNMKHPSNGVSILGCLRPAYADEESYIRRLFYFEESLEGETSNLES 290 Query: 1094 GEILXXXXXXXXXXXILYRLEIMRIQLFSVYGIQCAVSLQDFQDAEPDFLVHSTSAILER 1273 LYRLEIMRI+LFSVYG+Q VSLQDFQDAEPD L+HST+ ILER Sbjct: 291 ------KIDRSSQRSTLYRLEIMRIELFSVYGVQSEVSLQDFQDAEPDILLHSTAEILER 344 Query: 1274 YSEEGMRCNVALKGLCKKKGLQVERANLIGVDSLGMDVRVSSGSEVQTHRFPFKARATSE 1453 ++E+G++CN+ALK LCKK+GL VE A LIGVDSLGMDVRV G+EV+T RFPFK RATSE Sbjct: 345 FNEKGIKCNIALKALCKKRGLHVEDAILIGVDSLGMDVRVCVGTEVRTFRFPFKIRATSE 404 Query: 1454 VAAEKQIQQLLFPRSRRKKLNSRSDELKD 1540 AAEKQIQQLLFPRSRRKKL S D L+D Sbjct: 405 AAAEKQIQQLLFPRSRRKKLRSHGDGLRD 433 >ref|XP_002268699.1| PREDICTED: uncharacterized protein LOC100253226 [Vitis vinifera] Length = 518 Score = 469 bits (1207), Expect = e-129 Identities = 239/356 (67%), Positives = 273/356 (76%), Gaps = 2/356 (0%) Frame = +2 Query: 458 NRGYHPLEELKVRKRIRDIKLTSPEIARTTVEANSSALLVFPGTVHCEPHENITWAEFEY 637 N+GYHPLEELK KRI++ +LT+ E ARTTVEAN SALL+ P VH EPH++I+WAEF+Y Sbjct: 158 NQGYHPLEELKESKRIQEKRLTAAEAARTTVEANGSALLLLPRIVHSEPHDHISWAEFQY 217 Query: 638 VIDDNGDIFFEIFDDGNILQDRGASNPVNALIGMDVSIHENRRIASEYNXXXXXXXXXXX 817 +IDD GDIFF+IFDD NILQD GASNPVNALIGMD+S+++NRR+A EYN Sbjct: 218 IIDDFGDIFFQIFDDQNILQDPGASNPVNALIGMDLSLYKNRRVAGEYNISESGSTDDIS 277 Query: 818 XXXXYVEFVDPEVSDVSDDWGMPDTSSLVHPIYFAKCLTKAINMECGKKMDHPSNSVSIV 997 Y E D E+SD+ DWG+PDTSSLVHPIYFAKCLTKA+NME K+MDHPSN +S+V Sbjct: 278 LDDDYFEVEDSEMSDIPVDWGIPDTSSLVHPIYFAKCLTKAVNMEYNKEMDHPSNGISMV 337 Query: 998 GRLRPVFIDEESYLRTLFHPEDSDGYSSDLKDGEI--LXXXXXXXXXXXILYRLEIMRIQ 1171 G LRP FIDEE YLR LF EDSDGY+SD KD EI YRLEIMRI+ Sbjct: 338 GCLRPAFIDEEPYLRRLFSCEDSDGYTSDWKDEEITGFSSKGDGHNPRSTFYRLEIMRIE 397 Query: 1172 LFSVYGIQCAVSLQDFQDAEPDFLVHSTSAILERYSEEGMRCNVALKGLCKKKGLQVERA 1351 LFSVYGIQ +SLQDFQDAEPD LVHST AI+E ++E G NVALK LCKKKG VE A Sbjct: 398 LFSVYGIQALISLQDFQDAEPDVLVHSTKAIVEHFTENGTWFNVALKALCKKKGFHVEGA 457 Query: 1352 NLIGVDSLGMDVRVSSGSEVQTHRFPFKARATSEVAAEKQIQQLLFPRSRRKKLNS 1519 NLIGVDSLGMDVRV +G E+QTHRF FK RATS AAEKQIQQLLFP SRRKK+ + Sbjct: 458 NLIGVDSLGMDVRVFTGVEIQTHRFSFKVRATSAAAAEKQIQQLLFPPSRRKKVQN 513 >ref|XP_004241392.1| PREDICTED: uncharacterized protein LOC101247332 [Solanum lycopersicum] Length = 467 Score = 468 bits (1205), Expect = e-129 Identities = 249/442 (56%), Positives = 308/442 (69%), Gaps = 9/442 (2%) Frame = +2 Query: 236 DGISCSTTYGATCAWAKPPLDVRRVSDLVGM--RCKHPFFGATQFHWLPTGHDC-LXXXX 406 DG SCS + GA +W KP +V+ SD G+ R ++PFFGA Q HWL GH+ L Sbjct: 23 DGGSCSASLGAASSWIKPSYEVQIFSDHSGISFRTENPFFGAAQSHWLAVGHESSLSRIS 82 Query: 407 XXXXXXXXXXXXXXXXXNRGYHPLEELKVRKRIRDIKLTSPEIARTTVEANSSALLVFPG 586 N GYHPLE ++ ++R+RD +LT+ EIARTTVEAN++ALL+FPG Sbjct: 83 VAADYPDSVPDSPNYVRNSGYHPLEGMRDQRRVRDTELTAAEIARTTVEANNNALLIFPG 142 Query: 587 TVHCEPHENITWAEFEYVIDDNGDIFFEIFDDGNILQDRGASNPVNALIGMDVSIHENRR 766 TVHCEPHE ++WAEF+YVID+ GDIFFEI+DD NIL++R ASN VNALIGM+ S +E RR Sbjct: 143 TVHCEPHEQVSWAEFQYVIDEYGDIFFEIYDDKNILRNRDASNSVNALIGMEFSQYEKRR 202 Query: 767 IASEYNXXXXXXXXXXXXXXX-YVEFVDPEVSDVSDDWGMPDTSSLVHPIYFAKCLTKAI 943 + S + Y E E+ D DWGMPD+SS +HP+YFAKCLTKA+ Sbjct: 203 VESPDDINLAGDSVDDSNFFDDYFEGESSEMYDYQVDWGMPDSSSPLHPVYFAKCLTKAV 262 Query: 944 NMECGKKMDHPSNSVSIVGRLRPVFIDEESYLRTLFHPED-SDGYSSDLKDGEILXXXXX 1120 +M+ K MDHPSN +SI GRL+P F++EE Y+R LF ++ SDG + D KDGEIL Sbjct: 263 HMKHAKMMDHPSNGISIWGRLKPAFLEEEYYVRRLFSGDEVSDGSTLDWKDGEILSFSSR 322 Query: 1121 XXXXXXI--LYRLEIMRIQLFSVYGIQCAVSLQDFQDAEPDFLVHSTSAILERYSEEGMR 1294 + +YRLEIMR+ LFSVYG Q AV+L DF DAEPD LV+S AILE + ++G+R Sbjct: 323 YDKSRTLSSIYRLEIMRVDLFSVYGAQLAVNLYDFHDAEPDSLVYSAPAILEWFRQQGIR 382 Query: 1295 CNVALKGLCKKKGLQVERANLIGVDSLGMDVRVSSGSEVQTHRFPFKARATSEVAAEKQI 1474 C ALK LC+KKGL VERANLIGVDSLGMDVRV SG+EV THRFPFK RA SE+AAEKQI Sbjct: 383 CKYALKALCRKKGLHVERANLIGVDSLGMDVRVLSGTEVWTHRFPFKVRAHSEIAAEKQI 442 Query: 1475 QQLLFPRSRRKKLNS--RSDEL 1534 +QLLFPRSRRKK + RS +L Sbjct: 443 RQLLFPRSRRKKFRTAERSGDL 464 >ref|XP_002320457.2| hypothetical protein POPTR_0014s14980g [Populus trichocarpa] gi|550324233|gb|EEE98772.2| hypothetical protein POPTR_0014s14980g [Populus trichocarpa] Length = 538 Score = 458 bits (1178), Expect = e-126 Identities = 261/501 (52%), Positives = 314/501 (62%), Gaps = 41/501 (8%) Frame = +2 Query: 161 LYF*SYRFSSMHPLMPIAAASSMSL---DGISCSTTYGATCAWAKPPLDVRRVSDLVGMR 331 L+ SY+ M + ++ AS L D I CST YG T W K P++ R DL +R Sbjct: 40 LHVISYKVFLMTFMFTVSGASHCQLSQADRICCSTPYGFTNGWIKSPINSCRSCDLSSIR 99 Query: 332 CKHPFFGATQFHWLPTGHD-CLXXXXXXXXXXXXXXXXXXXXXNRGYHPLEELKVRKRIR 508 ++PFFG+TQF W + CL ++GYHPLEE+K+ KR R Sbjct: 100 YRNPFFGSTQFQWSSVDRELCLLKVSVAADYSDSVPDSSNYTSHQGYHPLEEVKISKRTR 159 Query: 509 DIKLTSPEIARTTVEANSSALLVFPGTVHCEPHENITWAEFEYVIDDNGD------IFFE 670 + +LTS EIARTTVEAN+SALLVFPG+VHCEPH+ I+W EF+Y+ID+ G Sbjct: 160 ETQLTSAEIARTTVEANTSALLVFPGSVHCEPHKQISWTEFQYIIDEYGGKKKTAKTREA 219 Query: 671 IFDDGNILQDRGASNPV--NALIGMDVSIHENRRIASEYNXXXXXXXXXXXXXXXYVEFV 844 + DG L+D G + V N LIGMD+ I+EN+++A+EY+ Y E + Sbjct: 220 MLRDG--LRDDGHATIVFKNVLIGMDIPIYENKKVANEYSIFNIGSEDDLPFDEDYFEAM 277 Query: 845 DPEVSDVSDDWGMPDTSSLVHPIYFAKCLTK---------------------------AI 943 D S+VS DWGMPDT SLVHPIYF+KC+TK AI Sbjct: 278 D---SEVSVDWGMPDTFSLVHPIYFSKCMTKWRNHICHTTMEMAWDWLSLYGLENSQMAI 334 Query: 944 NMECGKKMDHPSNSVSIVGRLRPVFIDEESYLRTLFHPEDSDGYSSDLKDGEILXXXXXX 1123 NME +KMDHPSN VSIVG LRP F DEESYLR FH EDSDG +SD KDGEIL Sbjct: 335 NMEYCRKMDHPSNGVSIVGCLRPSFADEESYLRRSFHCEDSDGCNSDWKDGEILSFSSKS 394 Query: 1124 XXXXX--ILYRLEIMRIQLFSVYGIQCAVSLQDFQDAEPDFLVHSTSAILERYSEEGMRC 1297 L+RLEI+RI+LFS+YG Q VSLQDFQDAEPD L STSAILE +S +G RC Sbjct: 395 DGSSSGSTLHRLEILRIELFSLYGSQSVVSLQDFQDAEPDVLAPSTSAILEHFSGKGSRC 454 Query: 1298 NVALKGLCKKKGLQVERANLIGVDSLGMDVRVSSGSEVQTHRFPFKARATSEVAAEKQIQ 1477 NVALK LCKKKGL VE ANL+G+DSLGMDVR+ G E +THRFPFK RAT EVAA+KQ+ Sbjct: 455 NVALKALCKKKGLHVEAANLVGIDSLGMDVRIFCGVEARTHRFPFKVRATCEVAAQKQMH 514 Query: 1478 QLLFPRSRRKKLNSRSDELKD 1540 QLLFPRSRRKK S DEL D Sbjct: 515 QLLFPRSRRKKFRSHEDELGD 535 >ref|XP_003617306.1| hypothetical protein MTR_5g090150 [Medicago truncatula] gi|355518641|gb|AET00265.1| hypothetical protein MTR_5g090150 [Medicago truncatula] Length = 445 Score = 451 bits (1160), Expect = e-124 Identities = 237/448 (52%), Positives = 299/448 (66%) Frame = +2 Query: 200 LMPIAAASSMSLDGISCSTTYGATCAWAKPPLDVRRVSDLVGMRCKHPFFGATQFHWLPT 379 L+P A+ +GI T+YG TC K P+D RRV DL RCK PFFG+++F W T Sbjct: 6 LLPFAS------EGICYPTSYGITCNSIKFPIDGRRVHDLTSTRCKSPFFGSSRFFWQST 59 Query: 380 GHDCLXXXXXXXXXXXXXXXXXXXXXNRGYHPLEELKVRKRIRDIKLTSPEIARTTVEAN 559 GHD + +GYHPLEELKV + +L+S EIARTT+EAN Sbjct: 60 GHDFVSKIGVAADYSDSIPDSSSYMGKQGYHPLEELKVSNDLPPARLSSAEIARTTIEAN 119 Query: 560 SSALLVFPGTVHCEPHENITWAEFEYVIDDNGDIFFEIFDDGNILQDRGASNPVNALIGM 739 +ALLVFPG+VH EPHE I+WAEF+Y+IDD GD++FEIFDD N+L+DRGA NPVNALIGM Sbjct: 120 KNALLVFPGSVHSEPHEQISWAEFQYLIDDFGDLYFEIFDDVNLLEDRGAHNPVNALIGM 179 Query: 740 DVSIHENRRIASEYNXXXXXXXXXXXXXXXYVEFVDPEVSDVSDDWGMPDTSSLVHPIYF 919 D+ +++NRR SEY+ Y+E + E S+ +WG+ D S+ VHPIYF Sbjct: 180 DIPMYDNRRPISEYDIFNGGITDEFPFDEDYIEVPEIEESNAPVNWGLSDNSNPVHPIYF 239 Query: 920 AKCLTKAINMECGKKMDHPSNSVSIVGRLRPVFIDEESYLRTLFHPEDSDGYSSDLKDGE 1099 +KCL KA+N+E K+MDHPSN VSI+G LRP + DEESY+R ++H ED DGYSSD KD Sbjct: 240 SKCLEKAVNVEYDKRMDHPSNGVSILGYLRPAYADEESYIRMIYHTEDDDGYSSDWKD-F 298 Query: 1100 ILXXXXXXXXXXXILYRLEIMRIQLFSVYGIQCAVSLQDFQDAEPDFLVHSTSAILERYS 1279 ILY+LEI +I+L VYG Q +SL +FQDAEPD +V+STSAILER + Sbjct: 299 YSNSINDQRDANLILYKLEIEKIKLHCVYGSQSEISLLEFQDAEPDIIVYSTSAILERIN 358 Query: 1280 EEGMRCNVALKGLCKKKGLQVERANLIGVDSLGMDVRVSSGSEVQTHRFPFKARATSEVA 1459 G + AL+ CKKKGL E A+LIGVD LG+DVRV SGSEV+THRF FK +A S Sbjct: 359 RNG---HDALQAFCKKKGLDAEEAHLIGVDHLGVDVRVLSGSEVKTHRFAFKVQANSGYM 415 Query: 1460 AEKQIQQLLFPRSRRKKLNSRSDELKDP 1543 AEKQI QLL+PRSRRK+ + L++P Sbjct: 416 AEKQIVQLLYPRSRRKR--NMQQSLRNP 441 >ref|NP_567080.1| pentatricopeptide repeat-containing protein-like protein [Arabidopsis thaliana] gi|15292859|gb|AAK92800.1| unknown protein [Arabidopsis thaliana] gi|20258901|gb|AAM14144.1| unknown protein [Arabidopsis thaliana] gi|332646380|gb|AEE79901.1| pentatricopeptide repeat-containing protein-like protein [Arabidopsis thaliana] Length = 459 Score = 447 bits (1151), Expect = e-123 Identities = 244/460 (53%), Positives = 302/460 (65%), Gaps = 14/460 (3%) Frame = +2 Query: 203 MPIAAASSMSLDGISCSTTY----GATCAWAKPPLDVRRVSD--------LVGMRCKHPF 346 M IAAASS SL C +Y ++ + + RV D ++ RCK PF Sbjct: 1 MVIAAASSFSLGPSHCHQSYTDEFSSSIPYKRTSNARNRVFDGCGSANLSVLSSRCKIPF 60 Query: 347 FGATQFHWLPTGHDCLXXXXXXXXXXXXXXXXXXXXXNRGYHPLEELKVRKRIRDIKLTS 526 FG+ FH GHD GYHPLE+LK KR+++ KL++ Sbjct: 61 FGSA-FHVSSGGHDLGLTKVSVAADYSDSVPDSSFY---GYHPLEDLKPSKRVQETKLSA 116 Query: 527 PEIARTTVEANSSALLVFPGTVHCEPHENITWAEFEYVIDDNGDIFFEIFDDGNILQDRG 706 E+ARTTVEANSSA+LVFPG +HCEPH++ +W+EF+YVIDD GDIFFEI DD NIL+D G Sbjct: 117 SEVARTTVEANSSAVLVFPGAIHCEPHDHNSWSEFKYVIDDYGDIFFEIPDDENILEDPG 176 Query: 707 ASNPVNALIGMDVSIHENRRIASEYNXXXXXXXXXXXXXXXYVEFVDPEVSDVSDDWGMP 886 ASNPV A GMDV +EN R EYN Y E +D E D+ DWGMP Sbjct: 177 ASNPVKAFFGMDVPRYENTRHHEEYNISDIGNLDQIIFDDHYFEIMDSEARDIPIDWGMP 236 Query: 887 DTSSLVHPIYFAKCLTKAINMECGKKMDHPSNSVSIVGRLRPVFIDEESYLRTLFHPEDS 1066 DTS+ VHPIYFAK L+KAI+M+ +KMD+PSN VSI+G LRP F+DEESY+R LF ED Sbjct: 237 DTSNGVHPIYFAKHLSKAISMDYDRKMDYPSNGVSILGCLRPAFLDEESYIRRLFLSEDR 296 Query: 1067 DGYSSDLK--DGEILXXXXXXXXXXXILYRLEIMRIQLFSVYGIQCAVSLQDFQDAEPDF 1240 D YS +++ D I LYRLEI+ I+L S+YG + ++SLQDFQDAEPD Sbjct: 297 DDYSWEVQGDDNPITSSRRDENDMSSSLYRLEIVGIELLSLYGAESSISLQDFQDAEPDI 356 Query: 1241 LVHSTSAILERYSEEGMRCNVALKGLCKKKGLQVERANLIGVDSLGMDVRVSSGSEVQTH 1420 LVHSTSAI+ER++ G+ ++ALK LCKKKGL E ANLI VDSLGMDVRV +G++VQTH Sbjct: 357 LVHSTSAIIERFNNRGINSSIALKALCKKKGLHAEEANLISVDSLGMDVRVFAGAQVQTH 416 Query: 1421 RFPFKARATSEVAAEKQIQQLLFPRSRRKKLNSRSDELKD 1540 RFPFK RAT+E+AAEK+I QLLFPRSRR+KL + LKD Sbjct: 417 RFPFKTRATTEMAAEKKIHQLLFPRSRRRKLKCHDESLKD 456 >ref|XP_006291096.1| hypothetical protein CARUB_v10017210mg [Capsella rubella] gi|482559803|gb|EOA23994.1| hypothetical protein CARUB_v10017210mg [Capsella rubella] Length = 457 Score = 445 bits (1144), Expect = e-122 Identities = 238/458 (51%), Positives = 298/458 (65%), Gaps = 12/458 (2%) Frame = +2 Query: 203 MPIAAASSMSLDGISCSTT----YGATCAWAKPPLDVRRVSD--------LVGMRCKHPF 346 M IAAASS SL C + + ++ + + RV D ++ RCK PF Sbjct: 1 MVIAAASSFSLGPSHCHQSHADEFSSSMPYKRNGSSRSRVFDGCASANLSVLSSRCKIPF 60 Query: 347 FGATQFHWLPTGHDCLXXXXXXXXXXXXXXXXXXXXXNRGYHPLEELKVRKRIRDIKLTS 526 FG+ FH GHD GYHPLE+LK KR+++ KL+ Sbjct: 61 FGSA-FHVSSGGHDLGLTKVSVAADYSDSVPDSSFY---GYHPLEDLKPSKRVQETKLSP 116 Query: 527 PEIARTTVEANSSALLVFPGTVHCEPHENITWAEFEYVIDDNGDIFFEIFDDGNILQDRG 706 E+ARTTVEANSSA+L+FPG +HCEPH+ +W+EF+YVID+ GDIFFEI DD NIL+D Sbjct: 117 AEVARTTVEANSSAVLIFPGAIHCEPHDQTSWSEFKYVIDEYGDIFFEIPDDVNILEDPE 176 Query: 707 ASNPVNALIGMDVSIHENRRIASEYNXXXXXXXXXXXXXXXYVEFVDPEVSDVSDDWGMP 886 ASNPV A GMDV +EN R+ EYN Y E +D E D+ DWGMP Sbjct: 177 ASNPVKAFFGMDVPRYENTRLHEEYNISDIGNLDQIIFDDHYFEIMDSEARDIPVDWGMP 236 Query: 887 DTSSLVHPIYFAKCLTKAINMECGKKMDHPSNSVSIVGRLRPVFIDEESYLRTLFHPEDS 1066 DTS+ VHPIYFAK ++KAI+M+ +KMD+PSN VSI+G LRP F+DEESY+R LF ED Sbjct: 237 DTSNAVHPIYFAKHMSKAISMDYDRKMDYPSNGVSILGCLRPAFLDEESYIRRLFTSEDR 296 Query: 1067 DGYSSDLKDGEILXXXXXXXXXXXILYRLEIMRIQLFSVYGIQCAVSLQDFQDAEPDFLV 1246 D YS + +D LYRLEI+ I+L S+YG + ++SLQDFQDAEPD LV Sbjct: 297 DDYSWEAQDNPSTSLRRDEKDISSSLYRLEIVGIELLSLYGAESSISLQDFQDAEPDILV 356 Query: 1247 HSTSAILERYSEEGMRCNVALKGLCKKKGLQVERANLIGVDSLGMDVRVSSGSEVQTHRF 1426 HSTSAI+ER++ G+ ++ALK LCKKKGL E ANLI VDSLGMDVRV +G++VQTHRF Sbjct: 357 HSTSAIIERFNNRGINSSIALKALCKKKGLHAEEANLISVDSLGMDVRVFAGAQVQTHRF 416 Query: 1427 PFKARATSEVAAEKQIQQLLFPRSRRKKLNSRSDELKD 1540 PFK RAT+E+AAEK+I QLLFPRSRR+KL S + L D Sbjct: 417 PFKTRATTEMAAEKKIHQLLFPRSRRRKLKSHDESLND 454 >ref|XP_006402701.1| hypothetical protein EUTSA_v10005960mg [Eutrema salsugineum] gi|557103800|gb|ESQ44154.1| hypothetical protein EUTSA_v10005960mg [Eutrema salsugineum] Length = 459 Score = 442 bits (1138), Expect = e-121 Identities = 241/460 (52%), Positives = 300/460 (65%), Gaps = 14/460 (3%) Frame = +2 Query: 203 MPIAAASSMSLDGI----SCSTTYGATCAWAKPPLDVRRVSD--------LVGMRCKHPF 346 M IAA SS SL S + + ++ + + RV D ++ RCK PF Sbjct: 1 MVIAATSSFSLGPSHYHQSYTDEFSSSMPYKRNGSARNRVFDGCGSANLSVLSSRCKIPF 60 Query: 347 FGATQFHWLPTGHDCLXXXXXXXXXXXXXXXXXXXXXNRGYHPLEELKVRKRIRDIKLTS 526 G+ FH GHD GYHPLEELK KR+++ KL++ Sbjct: 61 LGSA-FHVSTGGHDLGLTKVSVAADYSDSVPDSSFY---GYHPLEELKPSKRVQETKLSA 116 Query: 527 PEIARTTVEANSSALLVFPGTVHCEPHENITWAEFEYVIDDNGDIFFEIFDDGNILQDRG 706 PE+ARTTVEANSSA+LVFPG +HCEPH+ +W+EF+YVIDD GDIFFEI DD NIL+D G Sbjct: 117 PEVARTTVEANSSAVLVFPGAIHCEPHDQNSWSEFKYVIDDYGDIFFEIPDDENILEDPG 176 Query: 707 ASNPVNALIGMDVSIHENRRIASEYNXXXXXXXXXXXXXXXYVEFVDPEVSDVSDDWGMP 886 ASNPV A GMDV +EN R+ EYN Y E +D E D+ DWGMP Sbjct: 177 ASNPVKAFFGMDVPRYENARLHEEYNMSDIGNLDQIIFDDHYFEIMDSEARDIPVDWGMP 236 Query: 887 DTSSLVHPIYFAKCLTKAINMECGKKMDHPSNSVSIVGRLRPVFIDEESYLRTLFHPEDS 1066 DTS+ VHPIYFAK L+KAI+++ +KMD+PSN VSI+G LRP F+DEESY+R LF ED Sbjct: 237 DTSNGVHPIYFAKHLSKAISVDYDRKMDYPSNGVSILGCLRPAFLDEESYIRRLFLSEDR 296 Query: 1067 DGYSSDLK--DGEILXXXXXXXXXXXILYRLEIMRIQLFSVYGIQCAVSLQDFQDAEPDF 1240 D YS D++ D LYRLEI+ I+L S+YG + ++SLQDFQDAEPD Sbjct: 297 DDYSWDVQGDDNPSTSSRREENDMSSSLYRLEIVGIELLSLYGAESSISLQDFQDAEPDI 356 Query: 1241 LVHSTSAILERYSEEGMRCNVALKGLCKKKGLQVERANLIGVDSLGMDVRVSSGSEVQTH 1420 LVHSTSAI+ER++ G+ ++ALK LCKKKGL E ANLI VDSLGMDVRV +G++VQTH Sbjct: 357 LVHSTSAIIERFNNRGINSSIALKALCKKKGLHAEEANLISVDSLGMDVRVFAGAQVQTH 416 Query: 1421 RFPFKARATSEVAAEKQIQQLLFPRSRRKKLNSRSDELKD 1540 RFPFK RA +E+AAEK+I QLLFPRSRR+K+ S + LKD Sbjct: 417 RFPFKTRAMTEIAAEKKIHQLLFPRSRRRKMKSHEESLKD 456 >ref|XP_002876496.1| hypothetical protein ARALYDRAFT_486397 [Arabidopsis lyrata subsp. lyrata] gi|297322334|gb|EFH52755.1| hypothetical protein ARALYDRAFT_486397 [Arabidopsis lyrata subsp. lyrata] Length = 459 Score = 442 bits (1138), Expect = e-121 Identities = 242/460 (52%), Positives = 300/460 (65%), Gaps = 14/460 (3%) Frame = +2 Query: 203 MPIAAASSMSLDGISCSTTYG----ATCAWAKPPLDVRRVSD--------LVGMRCKHPF 346 M IAAASS SL C +Y ++ + + RV D ++ RCK PF Sbjct: 1 MVIAAASSFSLGASHCHQSYADEFSSSIPYKRNGNARNRVFDGCGSANLSVLSSRCKIPF 60 Query: 347 FGATQFHWLPTGHDCLXXXXXXXXXXXXXXXXXXXXXNRGYHPLEELKVRKRIRDIKLTS 526 FG+ FH GHD GYHPLE+LK KR+++ KL++ Sbjct: 61 FGSA-FHVSSGGHDLGLTKVSVAADYSDSVPDSSFY---GYHPLEDLKPSKRVQETKLSA 116 Query: 527 PEIARTTVEANSSALLVFPGTVHCEPHENITWAEFEYVIDDNGDIFFEIFDDGNILQDRG 706 E+ARTTVEANSSA+LVFPG +HCEPH++ +W+EF+YVIDD GDIFFEI DD NIL+D G Sbjct: 117 SEVARTTVEANSSAVLVFPGAIHCEPHDHNSWSEFKYVIDDYGDIFFEIPDDENILEDPG 176 Query: 707 ASNPVNALIGMDVSIHENRRIASEYNXXXXXXXXXXXXXXXYVEFVDPEVSDVSDDWGMP 886 ASNPV A GMDV +EN R EYN Y E +D E D+ DWGMP Sbjct: 177 ASNPVKAFFGMDVPRYENTRHHEEYNISDIGNLDQIIFDDHYFEIMDSEARDIPIDWGMP 236 Query: 887 DTSSLVHPIYFAKCLTKAINMECGKKMDHPSNSVSIVGRLRPVFIDEESYLRTLFHPEDS 1066 DTS+ VHPIYFAK L+KAI+M+ +KMD+PSN VSI+G LRP F+DEESY+R LF ED Sbjct: 237 DTSNGVHPIYFAKHLSKAISMDYDRKMDYPSNGVSILGCLRPAFLDEESYIRRLFLSEDR 296 Query: 1067 DGYSSDLK--DGEILXXXXXXXXXXXILYRLEIMRIQLFSVYGIQCAVSLQDFQDAEPDF 1240 D YS +++ D LYRLEI+ I+L S+YG + ++SLQDFQDAEPD Sbjct: 297 DDYSWEVQGDDNPNTSSRQDENDMSSSLYRLEIVGIELLSLYGAESSISLQDFQDAEPDI 356 Query: 1241 LVHSTSAILERYSEEGMRCNVALKGLCKKKGLQVERANLIGVDSLGMDVRVSSGSEVQTH 1420 LVHS SAI+ER++ G+ ++ALK LCKKKGL E ANLI VDSLGMDVRV +G++VQTH Sbjct: 357 LVHSMSAIIERFNNRGINSSIALKALCKKKGLHAEEANLISVDSLGMDVRVFAGAQVQTH 416 Query: 1421 RFPFKARATSEVAAEKQIQQLLFPRSRRKKLNSRSDELKD 1540 RFPFK RAT+E+AAEK+I QLLFPRSRR+KL S + L D Sbjct: 417 RFPFKTRATTEMAAEKKIHQLLFPRSRRRKLKSHDESLND 456