BLASTX nr result
ID: Rehmannia25_contig00010374
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia25_contig00010374 (1277 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera] 285 4e-74 ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2... 281 3e-73 emb|CBI24128.3| unnamed protein product [Vitis vinifera] 265 2e-68 gb|EOY14331.1| Eukaryotic aspartyl protease family protein, puta... 249 1e-63 gb|EOX93240.1| Eukaryotic aspartyl protease family protein, puta... 247 9e-63 gb|EPS68033.1| hypothetical protein M569_06741 [Genlisea aurea] 241 4e-61 gb|EXB51212.1| Aspartic proteinase nepenthesin-1 [Morus notabilis] 231 5e-58 ref|XP_006297668.1| hypothetical protein CARUB_v10013693mg [Caps... 231 6e-58 ref|XP_006422317.1| hypothetical protein CICLE_v10004908mg [Citr... 229 2e-57 ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arab... 228 4e-57 ref|NP_187876.2| aspartyl protease family protein [Arabidopsis t... 226 1e-56 gb|AAL49921.1| unknown protein [Arabidopsis thaliana] 226 1e-56 ref|XP_006407304.1| hypothetical protein EUTSA_v10020732mg [Eutr... 226 2e-56 ref|XP_004293837.1| PREDICTED: aspartic proteinase nepenthesin-1... 224 6e-56 ref|XP_006429804.1| hypothetical protein CICLE_v10013820mg [Citr... 214 5e-53 ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative... 207 6e-51 gb|EMJ13046.1| hypothetical protein PRUPE_ppa004710mg [Prunus pe... 204 5e-50 gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indi... 198 5e-48 ref|XP_006859053.1| hypothetical protein AMTR_s00068p00192210 [A... 197 1e-47 gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays] 193 1e-46 >emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera] Length = 449 Score = 285 bits (728), Expect = 4e-74 Identities = 147/270 (54%), Positives = 190/270 (70%), Gaps = 10/270 (3%) Frame = +2 Query: 497 IKFELTHRRNGA------TQLERLRQLLHSDTIRLRGISEKVSQKQGNVNTNIRRQIQEK 658 ++ EL HR + TQL+RL++L+HSD++R I K+ + G + RR+ +E Sbjct: 1 MRLELIHRHSPQVMGRPKTQLQRLKELVHSDSVRQLMILHKL--RGGQIP---RRKAKEV 55 Query: 659 NTYFPDCTNSTRLKKDYNVSAELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLT 838 + ++S R D + E+ M AADYGIGQYFV F++G+P+QK ML+ADTGSDLT Sbjct: 56 LS-----SSSGRGSDD---AIEVPMHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLT 107 Query: 839 WMNCRYRCRGASCR----RNSRKRRIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCAS 1006 WM+C+Y CR +C R R +R+F A+ SSSF+ +PC + CKI+L +LFSL C + Sbjct: 108 WMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPT 167 Query: 1007 PMDPCAYDYRYSDGSAALGVFGNETVTFSLTNGRKTRLHHVLVGCSESSRGRSFQVADGV 1186 P+ PC YDYRYSDGS ALG F NETVT L GRK +LH+VL+GCSES +G+SFQ ADGV Sbjct: 168 PLTPCGYDYRYSDGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGV 227 Query: 1187 MGLGYSNYSFAVKAANKFGGKFSYCLVDHL 1276 MGLGYS YSFA+KAA KFGGKFSYCLVDHL Sbjct: 228 MGLGYSKYSFAIKAAEKFGGKFSYCLVDHL 257 >ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera] Length = 449 Score = 281 bits (720), Expect = 3e-73 Identities = 146/270 (54%), Positives = 189/270 (70%), Gaps = 10/270 (3%) Frame = +2 Query: 497 IKFELTHRRNGA------TQLERLRQLLHSDTIRLRGISEKVSQKQGNVNTNIRRQIQEK 658 ++ EL HR + TQL+RL++L+HSD++R I K+ + G + RR+ +E Sbjct: 1 MRLELIHRHSPQVMGRPKTQLQRLKELVHSDSVRQLMILHKL--RGGQIP---RRKAKEV 55 Query: 659 NTYFPDCTNSTRLKKDYNVSAELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLT 838 + ++S R D + E+ M AADYGIGQY V F++G+P+QK ML+ADTGSDLT Sbjct: 56 LS-----SSSGRGSDD---AIEVPMHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLT 107 Query: 839 WMNCRYRCRGASCR----RNSRKRRIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCAS 1006 WM+C+Y CR +C R R +R+F A+ SSSF+ +PC + CKI+L +LFSL C + Sbjct: 108 WMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPT 167 Query: 1007 PMDPCAYDYRYSDGSAALGVFGNETVTFSLTNGRKTRLHHVLVGCSESSRGRSFQVADGV 1186 P+ PC YDYRYSDGS ALG F NETVT L GRK +LH+VL+GCSES +G+SFQ ADGV Sbjct: 168 PLTPCGYDYRYSDGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGV 227 Query: 1187 MGLGYSNYSFAVKAANKFGGKFSYCLVDHL 1276 MGLGYS YSFA+KAA KFGGKFSYCLVDHL Sbjct: 228 MGLGYSKYSFAIKAAEKFGGKFSYCLVDHL 257 >emb|CBI24128.3| unnamed protein product [Vitis vinifera] Length = 378 Score = 265 bits (678), Expect = 2e-68 Identities = 122/186 (65%), Positives = 146/186 (78%), Gaps = 4/186 (2%) Frame = +2 Query: 731 MRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTWMNCRYRCRGASCR----RNSRKR 898 M AADYGIGQY V F++G+P+QK ML+ADTGSDLTWM+C+Y CR +C R R + Sbjct: 1 MHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHK 60 Query: 899 RIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAALGVFGNE 1078 R+F A+ SSSF+ +PC + CKI+L +LFSL C +P+ PC YDYRYSDGS ALG F NE Sbjct: 61 RVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANE 120 Query: 1079 TVTFSLTNGRKTRLHHVLVGCSESSRGRSFQVADGVMGLGYSNYSFAVKAANKFGGKFSY 1258 TVT L GRK +LH+VL+GCSES +G+SFQ ADGVMGLGYS YSFA+KAA KFGGKFSY Sbjct: 121 TVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSY 180 Query: 1259 CLVDHL 1276 CLVDHL Sbjct: 181 CLVDHL 186 >gb|EOY14331.1| Eukaryotic aspartyl protease family protein, putative [Theobroma cacao] Length = 473 Score = 249 bits (637), Expect = 1e-63 Identities = 141/301 (46%), Positives = 182/301 (60%), Gaps = 20/301 (6%) Frame = +2 Query: 434 FLIIAVFIIINSAKLLEGHGGIKFELTHRR------NGATQLERLRQLLHSDTIRLRGIS 595 +LI +FI++ S +++ IK EL HR TQ ERL+ L+H D IR Sbjct: 4 WLIPLLFIVLPS--IVQAQDSIKLELLHRHAPQLHARPKTQHERLKDLVHHDFIR----- 56 Query: 596 EKVSQKQGNVNTNIRRQIQEKNTYFPDCTNSTRLKKDYNVSAELLMRSAADYGIGQYFVR 775 + RRQ E P T +T K N + ++ + + D+GIGQY Sbjct: 57 ------------HNRRQAWET----PKTTTATASKT--NAAIQMPLSAGRDFGIGQYVTT 98 Query: 776 FRIGSPAQKLMLIADTGSDLTWMNCRYRC-RGASC---RRNSRKRRIFRADHSSSFRAVP 943 F++G+P+QK LI DTGSDLTW+NCRYRC RG +C R ++ R+FRA SSSFR +P Sbjct: 99 FKVGTPSQKFRLIVDTGSDLTWINCRYRCARGDNCTTQERGIKRGRVFRAHLSSSFRPIP 158 Query: 944 CSSSTCKIDLANLFSLARCASPMDPCAYDYR----------YSDGSAALGVFGNETVTFS 1093 C S CK++L NLFSL C +P+ PCAYDYR Y DGS A+GVF E+VT Sbjct: 159 CFSQMCKVELRNLFSLTICPTPLTPCAYDYRFNSLKLVLNRYIDGSDAMGVFAKESVTVG 218 Query: 1094 LTNGRKTRLHHVLVGCSESSRGRSFQVADGVMGLGYSNYSFAVKAANKFGGKFSYCLVDH 1273 LTN R RLH VL+GCS+SS+GR+ + DGV+GL S YSF KAA ++GGKFSYCLVDH Sbjct: 219 LTNSRMARLHDVLIGCSDSSQGRTVKNVDGVLGLANSKYSFVTKAAERWGGKFSYCLVDH 278 Query: 1274 L 1276 L Sbjct: 279 L 279 >gb|EOX93240.1| Eukaryotic aspartyl protease family protein, putative [Theobroma cacao] Length = 478 Score = 247 bits (630), Expect = 9e-63 Identities = 132/302 (43%), Positives = 182/302 (60%), Gaps = 22/302 (7%) Frame = +2 Query: 437 LIIAVFIIINSAKLLEGHGG---------IKFELTHRRN-------GAT------QLERL 550 + +++F+ N + + H ++F+L HR + G T ER+ Sbjct: 8 VFLSLFLFFNHSFFFQAHASEAITPPNEKVRFKLIHRHSPELGEDHGTTLGPPTSTRERI 67 Query: 551 RQLLHSDTIRLRGISEKVSQKQGNVNTNIRRQIQEKNTYFPDCTNSTRLKKDYNVSAELL 730 +QL+HSD RL IS+++ + + T+ S+ L EL Sbjct: 68 KQLVHSDNARLHTISQRLGPR--------------RMTFEMKMMGSSNL-------VELP 106 Query: 731 MRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTWMNCRYRCRGASCRRNSRKRRIFR 910 MRSAAD G GQYFV FR+GSP +K ++IADTGS LTWM C Y+C+ S R RIF Sbjct: 107 MRSAADIGTGQYFVSFRVGSPPKKFIMIADTGSSLTWMRCSYKCKNFSMDRTKLHERIFY 166 Query: 911 ADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAALGVFGNETVTF 1090 A+ S +F+ +PCSS CK++L+ FSLA C +PM PCAYDYRY+DG+ +G+FGN+TV Sbjct: 167 ANQSRTFKPIPCSSDVCKVELSQSFSLALCPTPMAPCAYDYRYADGTRVVGIFGNDTVKV 226 Query: 1091 SLTNGRKTRLHHVLVGCSESSRGRSFQVADGVMGLGYSNYSFAVKAANKFGGKFSYCLVD 1270 L+ G+K ++ V+VGCSE+ RG +F DGVMGLG+ +SFAVKAA +FG KFSYCLVD Sbjct: 227 RLSGGQKIKVTDVMVGCSEAIRG-NFHDIDGVMGLGFDQHSFAVKAAKEFGDKFSYCLVD 285 Query: 1271 HL 1276 HL Sbjct: 286 HL 287 >gb|EPS68033.1| hypothetical protein M569_06741 [Genlisea aurea] Length = 449 Score = 241 bits (616), Expect = 4e-61 Identities = 137/286 (47%), Positives = 172/286 (60%), Gaps = 1/286 (0%) Frame = +2 Query: 422 QIGFFLIIAVFIIINSAKLLEGHGGIKFELTHRRNGATQLERLRQLLHSDTIRLRGISEK 601 +I +FL I +F NSA GIK +L HRR ++ R LL + G+ Sbjct: 3 KITYFLPIVLFFTANSA-------GIKLQLIHRRIKFSE----RSLLSG----VYGLQPM 47 Query: 602 VSQKQGNVNTNIRRQIQEKNTYFPDCTNSTRLKKDYNVSAELLMRSAADYGIGQYFVRFR 781 N I R I+ + E+ M + AD GI QY V FR Sbjct: 48 SGNSNSRRNDRINRPIRFGGEIY----------------GEMPMYAGADLGIAQYLVAFR 91 Query: 782 IGSPAQKLMLIADTGSDLTWMNCRYRCRGASCRRNSRKRRIFRADHSSSFRAVPCSSSTC 961 +GSPAQ + LIADTGSDLTW C Y C G CRR+S R+F AD S+SF+ V CSS+TC Sbjct: 92 VGSPAQSVALIADTGSDLTWTKCSYGC-GGGCRRSSG--RLFDADRSTSFKTVECSSTTC 148 Query: 962 KIDLANLFSLARCASPMDPCAYDYRYSDGSAALGVFGNETVTFSLTNGR-KTRLHHVLVG 1138 +DLA FSL+RC+ P DPCAYDYRY+DGS+A G+F ETV L GR K RL +VL+G Sbjct: 149 TVDLAGAFSLSRCSPPSDPCAYDYRYADGSSAEGIFAGETVELKLAKGRGKARLQNVLIG 208 Query: 1139 CSESSRGRSFQVADGVMGLGYSNYSFAVKAANKFGGKFSYCLVDHL 1276 C+++ G SFQ +DGV+GLGYSN+SFA AA +FG KFSYCL+DHL Sbjct: 209 CTKNFSGSSFQTSDGVLGLGYSNFSFAHAAAARFGDKFSYCLLDHL 254 >gb|EXB51212.1| Aspartic proteinase nepenthesin-1 [Morus notabilis] Length = 464 Score = 231 bits (589), Expect = 5e-58 Identities = 106/190 (55%), Positives = 140/190 (73%), Gaps = 3/190 (1%) Frame = +2 Query: 716 SAELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTWMNCRYRCRGASCRRNSRK 895 S + M + ADYG+G+YFV +G+P Q+ ML+ADTGSDLTWM+CR R + + Sbjct: 80 SIAMPMNAGADYGVGEYFVHVTVGTPGQRFMLVADTGSDLTWMHCRCGRRCGTHKGRLNN 139 Query: 896 RRIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAALGVFGN 1075 RR+F AD SSSF+ +PC S CK++LANLFSL++C +P+ PCAYDYRY +GS+A+G F N Sbjct: 140 RRVFHADRSSSFKTIPCLSEMCKVELANLFSLSKCPTPLTPCAYDYRYLEGSSAIGFFAN 199 Query: 1076 ETVTFSLTNGRKTRLHHVLVGCSESSRG---RSFQVADGVMGLGYSNYSFAVKAANKFGG 1246 ET++ L NG+K +L VLVGC+ES +G F+ ADGV+GLG+ N++F KAA FGG Sbjct: 200 ETISVRLANGKKRKLRDVLVGCTESVQGAEESGFKGADGVLGLGFGNHTFTRKAAQYFGG 259 Query: 1247 KFSYCLVDHL 1276 KFSYCLVDHL Sbjct: 260 KFSYCLVDHL 269 >ref|XP_006297668.1| hypothetical protein CARUB_v10013693mg [Capsella rubella] gi|482566377|gb|EOA30566.1| hypothetical protein CARUB_v10013693mg [Capsella rubella] Length = 448 Score = 231 bits (588), Expect = 6e-58 Identities = 107/192 (55%), Positives = 137/192 (71%) Frame = +2 Query: 701 KDYNVSAELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTWMNCRYRCRGASCR 880 + Y ++ + S DYG QYF R+G+PA+K ++ DTGS+LTW+NC+YR RG Sbjct: 68 RKYKGGVKMPLGSGIDYGTAQYFTEVRVGTPAKKFRVVVDTGSELTWVNCKYRGRGKGRV 127 Query: 881 RNSRKRRIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAAL 1060 N RR+FRA+ S SFR V C + TCK+DL NLFSL+ C +P PC+YDYRY+DGSAA Sbjct: 128 EN---RRVFRAEESKSFRTVGCFTQTCKVDLMNLFSLSTCPTPSTPCSYDYRYADGSAAQ 184 Query: 1061 GVFGNETVTFSLTNGRKTRLHHVLVGCSESSRGRSFQVADGVMGLGYSNYSFAVKAANKF 1240 G+F ETVT LTNGRK RLH +L+GCS S G+SF+ ADGV+GL +S++SF A + F Sbjct: 185 GIFAKETVTVGLTNGRKARLHGLLIGCSSSFSGQSFRGADGVLGLAFSDFSFTSTATSLF 244 Query: 1241 GGKFSYCLVDHL 1276 G KFSYCLVDHL Sbjct: 245 GAKFSYCLVDHL 256 >ref|XP_006422317.1| hypothetical protein CICLE_v10004908mg [Citrus clementina] gi|568881779|ref|XP_006493729.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Citrus sinensis] gi|557524190|gb|ESR35557.1| hypothetical protein CICLE_v10004908mg [Citrus clementina] Length = 470 Score = 229 bits (583), Expect = 2e-57 Identities = 127/305 (41%), Positives = 180/305 (59%), Gaps = 14/305 (4%) Frame = +2 Query: 404 MVTCRRQIGFFLIIAVFIIINSAKLLEGHGGIKFELTHRRNGA-------TQLERLRQLL 562 M+ RR I L+I II+ + ++ ++ EL HR + +++ER+++LL Sbjct: 2 MLKGRRPIFLVLVILFSNIIHFSSMVMVVA-VRMELIHRHSPKLNNMPMMSEVERMKELL 60 Query: 563 HSDTIRLRGISEKVSQKQGNVNTNIRRQIQEKNTYFPDCTNSTRLKKDYNVSAELLMRSA 742 H+D IR N R++++ TN+ + E+ +++ Sbjct: 61 HNDIIRQ--------------NKRRGRRLRQ--------TNNNNNNGASGSAIEMPLQAG 98 Query: 743 ADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTWMNCRYRCRGASCRRNSR----KRRIFR 910 DYG G YFV ++G+P+QKL LI DTGS+ +W++CRY C G SC + +RR+F+ Sbjct: 99 RDYGTGMYFVEIKVGTPSQKLRLIVDTGSEFSWISCRYHC-GPSCTKKGTIAGSRRRVFK 157 Query: 911 ADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAALGVFGNETVTF 1090 AD SSSF+ +PCSS CK + A LFSL C +P PCAYDYRY+DGSAA G+FG E VT Sbjct: 158 ADLSSSFKTIPCSSDMCKSEFARLFSLTFCPTPTSPCAYDYRYADGSAAKGIFGKERVTI 217 Query: 1091 SLTNGRKTRLHHVLVGCSESSRGRSFQVADGVMGLGYSNYSFAVKAANK---FGGKFSYC 1261 L NG KTR+ V++GCS++ +G+ F ADGV+GL Y YSFA K N GKF+YC Sbjct: 218 GLENGGKTRIEEVVMGCSDTIQGQIFAEADGVLGLSYDKYSFAQKVTNGSTFARGKFAYC 277 Query: 1262 LVDHL 1276 LVDHL Sbjct: 278 LVDHL 282 >ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp. lyrata] gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp. lyrata] Length = 449 Score = 228 bits (581), Expect = 4e-57 Identities = 107/193 (55%), Positives = 137/193 (70%) Frame = +2 Query: 698 KKDYNVSAELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTWMNCRYRCRGASC 877 K+ + ++ + S DYG QYF R+G+PA+K ++ DTGS+LTW+NCRYR RG Sbjct: 66 KRKFKGGVKMDLGSGIDYGTAQYFTEVRVGTPAKKFRVVVDTGSELTWVNCRYRGRGKGK 125 Query: 878 RRNSRKRRIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAA 1057 +N RR+FRA+ S SF+ V C + TCK+DL NLFSL+ C +P PC+YDYRY+DGSAA Sbjct: 126 VKN---RRVFRAEESKSFKTVGCFTQTCKVDLMNLFSLSTCPTPSTPCSYDYRYADGSAA 182 Query: 1058 LGVFGNETVTFSLTNGRKTRLHHVLVGCSESSRGRSFQVADGVMGLGYSNYSFAVKAANK 1237 GVF ET+T LTNGRK RL +LVGCS S G+SFQ ADGV+GL +S++SF A + Sbjct: 183 QGVFAKETITVGLTNGRKARLRGLLVGCSSSFSGQSFQGADGVLGLAFSDFSFTSTATSL 242 Query: 1238 FGGKFSYCLVDHL 1276 FG K SYCLVDHL Sbjct: 243 FGAKLSYCLVDHL 255 >ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana] gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis thaliana] gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana] Length = 461 Score = 226 bits (577), Expect = 1e-56 Identities = 106/193 (54%), Positives = 134/193 (69%) Frame = +2 Query: 698 KKDYNVSAELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTWMNCRYRCRGASC 877 K++ V ++ + S DYG QYF R+G+PA+K ++ DTGS+LTW+NCRYR RG Sbjct: 84 KRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKD- 142 Query: 878 RRNSRKRRIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAA 1057 RR+FRAD S SF+ V C + TCK+DL NLFSL C +P PC+YDYRY+DGSAA Sbjct: 143 -----NRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAA 197 Query: 1058 LGVFGNETVTFSLTNGRKTRLHHVLVGCSESSRGRSFQVADGVMGLGYSNYSFAVKAANK 1237 GVF ET+T LTNGR RL L+GCS S G+SFQ ADGV+GL +S++SF A + Sbjct: 198 QGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSL 257 Query: 1238 FGGKFSYCLVDHL 1276 +G KFSYCLVDHL Sbjct: 258 YGAKFSYCLVDHL 270 >gb|AAL49921.1| unknown protein [Arabidopsis thaliana] Length = 439 Score = 226 bits (577), Expect = 1e-56 Identities = 106/193 (54%), Positives = 134/193 (69%) Frame = +2 Query: 698 KKDYNVSAELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTWMNCRYRCRGASC 877 K++ V ++ + S DYG QYF R+G+PA+K ++ DTGS+LTW+NCRYR RG Sbjct: 62 KRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKD- 120 Query: 878 RRNSRKRRIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAA 1057 RR+FRAD S SF+ V C + TCK+DL NLFSL C +P PC+YDYRY+DGSAA Sbjct: 121 -----NRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAA 175 Query: 1058 LGVFGNETVTFSLTNGRKTRLHHVLVGCSESSRGRSFQVADGVMGLGYSNYSFAVKAANK 1237 GVF ET+T LTNGR RL L+GCS S G+SFQ ADGV+GL +S++SF A + Sbjct: 176 QGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSL 235 Query: 1238 FGGKFSYCLVDHL 1276 +G KFSYCLVDHL Sbjct: 236 YGAKFSYCLVDHL 248 >ref|XP_006407304.1| hypothetical protein EUTSA_v10020732mg [Eutrema salsugineum] gi|557108450|gb|ESQ48757.1| hypothetical protein EUTSA_v10020732mg [Eutrema salsugineum] Length = 444 Score = 226 bits (576), Expect = 2e-56 Identities = 106/186 (56%), Positives = 133/186 (71%) Frame = +2 Query: 719 AELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTWMNCRYRCRGASCRRNSRKR 898 A++ + S DYG QYF R+G+PA++ ++ DTGS+LTW+NCR+ +G R Sbjct: 72 AKMALGSGFDYGAAQYFAEVRVGTPAKRFRVVVDTGSELTWVNCRFHGKG------KENR 125 Query: 899 RIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAALGVFGNE 1078 R+FRA+ SSSFR V C + TCK+DL NLFSL+ C +P PC+YDYRY+DGSAA GVF E Sbjct: 126 RVFRAEESSSFRKVGCLTQTCKVDLMNLFSLSNCPTPSTPCSYDYRYADGSAAQGVFAKE 185 Query: 1079 TVTFSLTNGRKTRLHHVLVGCSESSRGRSFQVADGVMGLGYSNYSFAVKAANKFGGKFSY 1258 T T LTNGRK +L +L+GCS S G SF+ ADGV+GL S+YSF KA N FGGKFSY Sbjct: 186 TFTVGLTNGRKAKLRGLLIGCSSSFSGDSFRGADGVLGLALSDYSFTSKATNIFGGKFSY 245 Query: 1259 CLVDHL 1276 CLVDHL Sbjct: 246 CLVDHL 251 >ref|XP_004293837.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Fragaria vesca subsp. vesca] Length = 482 Score = 224 bits (571), Expect = 6e-56 Identities = 117/273 (42%), Positives = 160/273 (58%), Gaps = 13/273 (4%) Frame = +2 Query: 497 IKFELTHRRN-----GATQLERLRQLLHSDTIRLRGISEKVSQKQGNVNTNIRRQIQEKN 661 +K EL HR + TQLE + +L D IR + IS + ++ T +RR E Sbjct: 37 MKLELIHRHSLRVEMPKTQLELIEELQRHDVIRHQMISRRRQHHHHSIPTGLRRNALETA 96 Query: 662 TYFPDCTNSTRLKKDYNVSAELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTW 841 S + + SA D+G GQYFV+ ++G+P+Q+ +LIADTGSDLTW Sbjct: 97 A-----------------SIAMPLSSAWDFGAGQYFVQIKVGTPSQRFLLIADTGSDLTW 139 Query: 842 MNCRYRCRGASC-----RRNSRKRRIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCAS 1006 M C+YRC C K+++FR SS+F+ +PCSS CK +L FS C + Sbjct: 140 MKCKYRCVADKCGLKRATMKKNKKKVFRPAQSSTFKIIPCSSEMCKFELE--FSRQECPT 197 Query: 1007 PMDPCAYDYRYSDGSAALGVFGNETVTFSLTNGRKTRLHHVLVGCSES---SRGRSFQVA 1177 P+ PC YDYRY++ S ALG F NETV LTNGR+ RL+ VL+GC+ES +G S + Sbjct: 198 PLSPCKYDYRYAESSGALGFFANETVRVPLTNGRRARLNDVLIGCTESIEGPKGASIRAG 257 Query: 1178 DGVMGLGYSNYSFAVKAANKFGGKFSYCLVDHL 1276 DG++GLG+ +SF KAA+ G KFSYCLVDH+ Sbjct: 258 DGILGLGFGKHSFVAKAASNLGDKFSYCLVDHM 290 >ref|XP_006429804.1| hypothetical protein CICLE_v10013820mg [Citrus clementina] gi|557531861|gb|ESR43044.1| hypothetical protein CICLE_v10013820mg [Citrus clementina] Length = 475 Score = 214 bits (546), Expect = 5e-53 Identities = 117/248 (47%), Positives = 157/248 (63%), Gaps = 3/248 (1%) Frame = +2 Query: 542 ERLRQLLHSDTIRLRGISEKVSQKQGNVNTNIRRQIQEKNTYFPDCTNSTRLKKDYNVSA 721 ER+RQL+ D R IS ++ ++ +I T+ N T N+ Sbjct: 65 ERIRQLIDGDIARQEMISRRLEDRRRRGRIRKASEISHHRTF-----NGTS-----NI-V 113 Query: 722 ELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTWMNCRYRCRGASCRRN--SRK 895 ++ +RS AD G+GQYFV FR+GSP QK +LIADTGSDLTWM+C ++ G +C ++ + Sbjct: 114 KIPLRSGADRGLGQYFVSFRVGSPPQKFVLIADTGSDLTWMHCNHK--GENCPKDGLTPP 171 Query: 896 RRIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAALGVFGN 1075 R+F+AD SS+F+ +PCSS TCK+DL + FSL+ C +P+ PCAYDY Y DGS G F N Sbjct: 172 NRMFQADASSTFKTIPCSSRTCKVDLQDTFSLSMCPTPVTPCAYDYSYFDGSKVRGFFAN 231 Query: 1076 ETVTF-SLTNGRKTRLHHVLVGCSESSRGRSFQVADGVMGLGYSNYSFAVKAANKFGGKF 1252 ETVT S+ +K RL V VGC++ + G +F ADGV+GLG+ SFA AA F KF Sbjct: 232 ETVTAGSIDRRKKVRLKEVTVGCTDWANG-NFHNADGVLGLGFGKNSFAATAAKLFDNKF 290 Query: 1253 SYCLVDHL 1276 SYCLVDHL Sbjct: 291 SYCLVDHL 298 >ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis] gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis] Length = 489 Score = 207 bits (528), Expect = 6e-51 Identities = 126/286 (44%), Positives = 168/286 (58%), Gaps = 4/286 (1%) Frame = +2 Query: 431 FFLIIAVFIIINSAKLLEGHGGIKFELTHRRNGATQLERLRQLLHSDTIRLRGISEKVSQ 610 FF + A F + +K + G+ FE+ H + +L+ + L RL G + + Sbjct: 22 FFQVDATFEFDDDSKN-NNNSGVWFEMFHMHS--PKLKSQSKFLGPPKSRLDGTRQLLQ- 77 Query: 611 KQGNVNTNIRRQIQEKNTYFPDCTNSTRLKKDYNVS--AELLMRSAADYGIGQYFVRFRI 784 + N RRQ+ + TR +K + VS A++ + S AD G QYFV RI Sbjct: 78 -----SDNARRQM------ISSLRHGTR-RKAFEVSHTAQIPIHSGADSGQSQYFVSIRI 125 Query: 785 GSPA-QKLMLIADTGSDLTWMNCRYRCRGASCRR-NSRKRRIFRADHSSSFRAVPCSSST 958 G+P QK +L+ DTGSDLTWMNC Y C+ SC + N R+FRA+ SSSFR +PCSS Sbjct: 126 GTPRPQKFILVTDTGSDLTWMNCEYWCK--SCPKPNPHPGRVFRANDSSSFRTIPCSSDD 183 Query: 959 CKIDLANLFSLARCASPMDPCAYDYRYSDGSAALGVFGNETVTFSLTNGRKTRLHHVLVG 1138 CKI+L + FSL C +P PC +DYRY +G A+GVF NETVT L + +K RL VL+G Sbjct: 184 CKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANETVTVGLNDHKKIRLFDVLIG 243 Query: 1139 CSESSRGRSFQVADGVMGLGYSNYSFAVKAANKFGGKFSYCLVDHL 1276 C+ES + DGVMGLGY +S A++ A FG KFSYCLVDHL Sbjct: 244 CTESF-NETNGFPDGVMGLGYRKHSLALRLAEIFGNKFSYCLVDHL 288 >gb|EMJ13046.1| hypothetical protein PRUPE_ppa004710mg [Prunus persica] Length = 495 Score = 204 bits (520), Expect = 5e-50 Identities = 110/253 (43%), Positives = 154/253 (60%), Gaps = 5/253 (1%) Frame = +2 Query: 533 TQLERLRQLLHSDTIRLRGISEKVSQKQGNVNTNIRRQIQEKNTYFPDCTNSTR-LKKDY 709 TQ +++L D RL+ +++K Q + N +NSTR + Sbjct: 63 TQQALIQELHRHDVFRLQMMAQKRQQNGHDQGLNSSSS-----------SNSTRRMDMQT 111 Query: 710 NVSAELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTWMNCRYRCRGASCR-RN 886 +S + M + DYGIGQY V+ ++G+PAQK +I TGSDLTW+ C C G SC R Sbjct: 112 RLSVTMPMNAGWDYGIGQYLVKLKLGTPAQKFTVIPSTGSDLTWVRCGSHC-GKSCGIRK 170 Query: 887 SR--KRRIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAAL 1060 R R+F D SS+F++V CSS C+ DLAN SL +C P+ PC YDY Y +GS+AL Sbjct: 171 GRIDHSRVFNTDRSSTFKSVTCSSKMCEFDLANFNSLNKCPRPLSPCRYDYSYVEGSSAL 230 Query: 1061 GVFGNETVTFSLTNGRKTRLHHVLVGCSESSRGR-SFQVADGVMGLGYSNYSFAVKAANK 1237 G FG + V SL+NGR+ R+ VL+GC+ES G+ + + +DG++GLG+ YSF KAA K Sbjct: 231 GTFGTDIVRASLSNGRRNRMKDVLIGCTESIIGKGTAKGSDGILGLGFGKYSFTTKAALK 290 Query: 1238 FGGKFSYCLVDHL 1276 +GGK SYCL+DH+ Sbjct: 291 YGGKVSYCLLDHM 303 >gb|EAY75371.1| hypothetical protein OsI_03267 [Oryza sativa Indica Group] Length = 484 Score = 198 bits (503), Expect = 5e-48 Identities = 102/194 (52%), Positives = 131/194 (67%), Gaps = 12/194 (6%) Frame = +2 Query: 731 MRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTWMNCRYRCRGASCR-RNSRK---- 895 + S A G GQYFVRFR+G+PAQ +L+ADTGSDLTW+ C AS RN+ Sbjct: 76 LSSGAYTGTGQYFVRFRVGTPAQPFLLVADTGSDLTWVKCHRAAAAASASPRNASSLPAP 135 Query: 896 -----RRIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAAL 1060 RR FR D S ++ +PCSS+TC+ L FSLA CA+P +PCAYDYRY DGSAA Sbjct: 136 APASPRRTFRPDKSRTWAPIPCSSATCRESLP--FSLAACATPANPCAYDYRYKDGSAAR 193 Query: 1061 GVFGNETVTFSLTN--GRKTRLHHVLVGCSESSRGRSFQVADGVMGLGYSNYSFAVKAAN 1234 G G ++ T +L+ RK +L V++GC+ S G+SF +DGV+ LGYSN SFA +AA+ Sbjct: 194 GTVGVDSATIALSGRAARKAKLRGVVLGCTTSYNGQSFLASDGVLSLGYSNISFASRAAS 253 Query: 1235 KFGGKFSYCLVDHL 1276 +FGG+FSYCLVDHL Sbjct: 254 RFGGRFSYCLVDHL 267 >ref|XP_006859053.1| hypothetical protein AMTR_s00068p00192210 [Amborella trichopoda] gi|548863165|gb|ERN20520.1| hypothetical protein AMTR_s00068p00192210 [Amborella trichopoda] Length = 500 Score = 197 bits (500), Expect = 1e-47 Identities = 120/276 (43%), Positives = 153/276 (55%), Gaps = 16/276 (5%) Frame = +2 Query: 497 IKFELTHRR---------NGA--TQLERLRQLLHSDTIRLRGISEKVSQKQGNVNTNIRR 643 IK L HR NGA ++L+ LR+LLH D +R + I Sbjct: 48 IKLHLLHRHGRELRGNPTNGAPPSKLDDLRELLHHDQLRKQMIH---------------- 91 Query: 644 QIQEKNTYFPDCTNSTRLKKDYNVSAELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADT 823 ++ R + V A + + S A G GQYFV+FR G+P Q L+L+ADT Sbjct: 92 -------------SALRGRSRGGVGAAMSISSGAFAGTGQYFVKFRAGTPPQNLLLVADT 138 Query: 824 GSDLTWMNCRYRCRGASCRRNSRKRRIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCA 1003 GSDLTWMNCR+R + R+FRA SSSF + CS+ +C FSL C Sbjct: 139 GSDLTWMNCRFRPKTRVFSPRINGTRVFRASSSSSFSPLLCSAPSCP---TLPFSLTACP 195 Query: 1004 SPMDPCAYDYRYSDGSAALGVFGNETVTFSLT--NGR---KTRLHHVLVGCSESSRGRSF 1168 + PC YDYRY DGS A G F NE+VT S NGR RL H+L+GCS++ +GRSF Sbjct: 196 TASTPCRYDYRYVDGSFARGFFANESVTLSAVKPNGRHDGNVRLRHLLIGCSDAFQGRSF 255 Query: 1169 QVADGVMGLGYSNYSFAVKAANKFGGKFSYCLVDHL 1276 + ADGV+GLG S SFAV+ + +F GKFSYCLVDHL Sbjct: 256 KEADGVLGLGQSAVSFAVQLSRRFDGKFSYCLVDHL 291 >gb|AFW79104.1| hypothetical protein ZEAMMB73_209101 [Zea mays] Length = 480 Score = 193 bits (491), Expect = 1e-46 Identities = 100/189 (52%), Positives = 128/189 (67%), Gaps = 7/189 (3%) Frame = +2 Query: 731 MRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTWMNCRYRCRGASCRRNSRKRRIFR 910 + S A G GQYFVRFR+G+PAQ +L+ADTGSDLTW+ +C GA RR+FR Sbjct: 101 LSSGAYTGTGQYFVRFRVGTPAQPFVLVADTGSDLTWV----KCSGAGDGTGDAPRRVFR 156 Query: 911 ADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAALGVFGNETVTF 1090 A S S+ + CSS TC + FSLA C+SP PCAYDYRY+DGSAA GV G ++ T Sbjct: 157 AAASRSWAPIACSSDTCTSYVP--FSLANCSSPASPCAYDYRYNDGSAARGVVGTDSATI 214 Query: 1091 SLTN-------GRKTRLHHVLVGCSESSRGRSFQVADGVMGLGYSNYSFAVKAANKFGGK 1249 +L+ GR+ +L V++GC+ S G+SFQ +DGV+ LG SN SFA +AA +FGG+ Sbjct: 215 ALSGSESRDGGGRRAKLQGVVLGCTASYDGQSFQSSDGVLSLGNSNISFASRAAARFGGR 274 Query: 1250 FSYCLVDHL 1276 FSYCLVDHL Sbjct: 275 FSYCLVDHL 283