BLASTX nr result
ID: Rehmannia23_contig00005808
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Rehmannia23_contig00005808 (1762 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera] 542 e-151 ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2... 539 e-150 emb|CBI24128.3| unnamed protein product [Vitis vinifera] 523 e-145 gb|EPS68033.1| hypothetical protein M569_06741 [Genlisea aurea] 461 e-127 gb|EOY14331.1| Eukaryotic aspartyl protease family protein, puta... 456 e-125 gb|EOX93240.1| Eukaryotic aspartyl protease family protein, puta... 454 e-125 ref|XP_006422317.1| hypothetical protein CICLE_v10004908mg [Citr... 445 e-122 ref|XP_004293837.1| PREDICTED: aspartic proteinase nepenthesin-1... 424 e-116 gb|EXB51212.1| Aspartic proteinase nepenthesin-1 [Morus notabilis] 423 e-115 ref|XP_006297668.1| hypothetical protein CARUB_v10013693mg [Caps... 409 e-111 ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arab... 408 e-111 ref|NP_187876.2| aspartyl protease family protein [Arabidopsis t... 404 e-110 gb|AAL49921.1| unknown protein [Arabidopsis thaliana] 404 e-110 ref|XP_006407304.1| hypothetical protein EUTSA_v10020732mg [Eutr... 404 e-110 gb|EMJ13046.1| hypothetical protein PRUPE_ppa004710mg [Prunus pe... 394 e-107 ref|XP_006429804.1| hypothetical protein CICLE_v10013820mg [Citr... 386 e-104 ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative... 381 e-103 ref|XP_004488613.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR... 364 7e-98 ref|XP_003532899.1| PREDICTED: protein ASPARTIC PROTEASE IN GUAR... 359 2e-96 ref|XP_006859053.1| hypothetical protein AMTR_s00068p00192210 [A... 353 2e-94 >emb|CAN65806.1| hypothetical protein VITISV_015630 [Vitis vinifera] Length = 449 Score = 542 bits (1396), Expect = e-151 Identities = 271/461 (58%), Positives = 337/461 (73%), Gaps = 10/461 (2%) Frame = +2 Query: 290 IKFELTHRRNGA------TQLERLRQLLHSDTIRLRGISEKVSQKQGNVNTNIRRQIQEK 451 ++ EL HR + TQL+RL++L+HSD++R I K+ + G + RR+ +E Sbjct: 1 MRLELIHRHSPQVMGRPKTQLQRLKELVHSDSVRQLMILHKL--RGGQIP---RRKAKEV 55 Query: 452 NTYFPDCTNSTRLKKDYNVSAELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLT 631 + ++S R D + E+ M AADYGIGQYFV F++G+P+QK ML+ADTGSDLT Sbjct: 56 LS-----SSSGRGSDD---AIEVPMHPAADYGIGQYFVAFKVGTPSQKFMLVADTGSDLT 107 Query: 632 WMNCRYRCRGASCR----RNSRKRRIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCAS 799 WM+C+Y CR +C R R +R+F A+ SSSF+ +PC + CKI+L +LFSL C + Sbjct: 108 WMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPT 167 Query: 800 PMDPCAYDYRYSDGSAALGVFGNETVTFSLTNGRKTRLHHVLVGCSESSRGRSFQVADGV 979 P+ PC YDYRYSDGS ALG F NETVT L GRK +LH+VL+GCSES +G+SFQ ADGV Sbjct: 168 PLTPCGYDYRYSDGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGV 227 Query: 980 MGLGYSNYSFAVKAANKFGGKFSYCLVDHLSPKNISSYLIFGSHKEVNISFNRMKYTELI 1159 MGLGYS YSFA+KAA KFGGKFSYCLVDHLS KN+S+YL FGS + N M YTEL+ Sbjct: 228 MGLGYSKYSFAIKAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELV 287 Query: 1160 LGVINPFYAVNIKGISIGGSMLDIPAETWNLXXXXXXXXXXXXXLTGLTQPAYQPVMAAL 1339 LG++N FYAVN+ GISIGG+ML IP+E W++ LT LT+PAYQPVMAAL Sbjct: 288 LGMVNSFYAVNMMGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAAL 347 Query: 1340 RLSLVNFKNLNLDIGPVEYCFNSTGFSESLVPRLVFHFSDGARFEPPVKSYVIDAAPGVK 1519 R+SL+ F+ + +DIGP+EYCFNSTGF ESLVPRLVFHF+DGA FEPPVKSYVI AA GV+ Sbjct: 348 RVSLLKFRKVEMDIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVR 407 Query: 1520 CLGFVPAAWPGASVIGNIMQQNYFWEFDLANGRLGFATSSC 1642 CLGFV AWPG SV+GNIMQQN+ WEFDL +LGFA SSC Sbjct: 408 CLGFVSVAWPGTSVVGNIMQQNHLWEFDLGLKKLGFAPSSC 448 >ref|XP_002265771.2| PREDICTED: aspartic proteinase nepenthesin-2-like [Vitis vinifera] Length = 449 Score = 539 bits (1388), Expect = e-150 Identities = 270/461 (58%), Positives = 336/461 (72%), Gaps = 10/461 (2%) Frame = +2 Query: 290 IKFELTHRRNGA------TQLERLRQLLHSDTIRLRGISEKVSQKQGNVNTNIRRQIQEK 451 ++ EL HR + TQL+RL++L+HSD++R I K+ + G + RR+ +E Sbjct: 1 MRLELIHRHSPQVMGRPKTQLQRLKELVHSDSVRQLMILHKL--RGGQIP---RRKAKEV 55 Query: 452 NTYFPDCTNSTRLKKDYNVSAELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLT 631 + ++S R D + E+ M AADYGIGQY V F++G+P+QK ML+ADTGSDLT Sbjct: 56 LS-----SSSGRGSDD---AIEVPMHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLT 107 Query: 632 WMNCRYRCRGASCR----RNSRKRRIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCAS 799 WM+C+Y CR +C R R +R+F A+ SSSF+ +PC + CKI+L +LFSL C + Sbjct: 108 WMSCKYHCRSRNCSNRKARRIRHKRVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPT 167 Query: 800 PMDPCAYDYRYSDGSAALGVFGNETVTFSLTNGRKTRLHHVLVGCSESSRGRSFQVADGV 979 P+ PC YDYRYSDGS ALG F NETVT L GRK +LH+VL+GCSES +G+SFQ ADGV Sbjct: 168 PLTPCGYDYRYSDGSTALGFFANETVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGV 227 Query: 980 MGLGYSNYSFAVKAANKFGGKFSYCLVDHLSPKNISSYLIFGSHKEVNISFNRMKYTELI 1159 MGLGYS YSFA+KAA KFGGKFSYCLVDHLS KN+S+YL FGS + N M YTEL+ Sbjct: 228 MGLGYSKYSFAIKAAEKFGGKFSYCLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELV 287 Query: 1160 LGVINPFYAVNIKGISIGGSMLDIPAETWNLXXXXXXXXXXXXXLTGLTQPAYQPVMAAL 1339 LG++N FYAVN+ GISIGG+ML IP+E W++ LT LT+PAYQPVMAAL Sbjct: 288 LGMVNSFYAVNMMGISIGGAMLKIPSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAAL 347 Query: 1340 RLSLVNFKNLNLDIGPVEYCFNSTGFSESLVPRLVFHFSDGARFEPPVKSYVIDAAPGVK 1519 R+SL+ F+ + +DIGP+EYCFNSTGF ESLVPRLVFHF+DGA FEPPVKSYVI AA GV+ Sbjct: 348 RVSLLKFRKVEMDIGPLEYCFNSTGFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVR 407 Query: 1520 CLGFVPAAWPGASVIGNIMQQNYFWEFDLANGRLGFATSSC 1642 CLGFV AWPG SV+GNIMQQN+ WEFDL +LGFA SSC Sbjct: 408 CLGFVSVAWPGTSVVGNIMQQNHLWEFDLGLKKLGFAPSSC 448 >emb|CBI24128.3| unnamed protein product [Vitis vinifera] Length = 378 Score = 523 bits (1346), Expect = e-145 Identities = 246/377 (65%), Positives = 293/377 (77%), Gaps = 4/377 (1%) Frame = +2 Query: 524 MRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTWMNCRYRCRGASCR----RNSRKR 691 M AADYGIGQY V F++G+P+QK ML+ADTGSDLTWM+C+Y CR +C R R + Sbjct: 1 MHPAADYGIGQYSVAFKVGTPSQKFMLVADTGSDLTWMSCKYHCRSRNCSNRKARRIRHK 60 Query: 692 RIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAALGVFGNE 871 R+F A+ SSSF+ +PC + CKI+L +LFSL C +P+ PC YDYRYSDGS ALG F NE Sbjct: 61 RVFHANLSSSFKTIPCLTDMCKIELMDLFSLTNCPTPLTPCGYDYRYSDGSTALGFFANE 120 Query: 872 TVTFSLTNGRKTRLHHVLVGCSESSRGRSFQVADGVMGLGYSNYSFAVKAANKFGGKFSY 1051 TVT L GRK +LH+VL+GCSES +G+SFQ ADGVMGLGYS YSFA+KAA KFGGKFSY Sbjct: 121 TVTVELKEGRKMKLHNVLIGCSESFQGQSFQAADGVMGLGYSKYSFAIKAAEKFGGKFSY 180 Query: 1052 CLVDHLSPKNISSYLIFGSHKEVNISFNRMKYTELILGVINPFYAVNIKGISIGGSMLDI 1231 CLVDHLS KN+S+YL FGS + N M YTEL+LG++N FYAVN+ GISIGG+ML I Sbjct: 181 CLVDHLSHKNVSNYLTFGSSRSKEALLNNMTYTELVLGMVNSFYAVNMMGISIGGAMLKI 240 Query: 1232 PAETWNLXXXXXXXXXXXXXLTGLTQPAYQPVMAALRLSLVNFKNLNLDIGPVEYCFNST 1411 P+E W++ LT LT+PAYQPVMAALR+SL+ F+ + +DIGP+EYCFNST Sbjct: 241 PSEVWDVKGAGGTILDSGSSLTFLTEPAYQPVMAALRVSLLKFRKVEMDIGPLEYCFNST 300 Query: 1412 GFSESLVPRLVFHFSDGARFEPPVKSYVIDAAPGVKCLGFVPAAWPGASVIGNIMQQNYF 1591 GF ESLVPRLVFHF+DGA FEPPVKSYVI AA GV+CLGFV AWPG SV+GNIMQQN+ Sbjct: 301 GFEESLVPRLVFHFADGAEFEPPVKSYVISAADGVRCLGFVSVAWPGTSVVGNIMQQNHL 360 Query: 1592 WEFDLANGRLGFATSSC 1642 WEFDL +LGFA SSC Sbjct: 361 WEFDLGLKKLGFAPSSC 377 >gb|EPS68033.1| hypothetical protein M569_06741 [Genlisea aurea] Length = 449 Score = 461 bits (1186), Expect = e-127 Identities = 252/480 (52%), Positives = 311/480 (64%), Gaps = 4/480 (0%) Frame = +2 Query: 215 QIGFFLIIAVFIIINSAKLLEGHGGIKFELTHRRNGATQLERLRQLLHSDTIRLRGISEK 394 +I +FL I +F NSA GIK +L HRR ++ R LL + G+ Sbjct: 3 KITYFLPIVLFFTANSA-------GIKLQLIHRRIKFSE----RSLLSG----VYGLQPM 47 Query: 395 VSQKQGNVNTNIRRQIQEKNTYFPDCTNSTRLKKDYNVSAELLMRSAADYGIGQYFVRFR 574 N I R I+ + E+ M + AD GI QY V FR Sbjct: 48 SGNSNSRRNDRINRPIRFGGEIY----------------GEMPMYAGADLGIAQYLVAFR 91 Query: 575 IGSPAQKLMLIADTGSDLTWMNCRYRCRGASCRRNSRKRRIFRADHSSSFRAVPCSSSTC 754 +GSPAQ + LIADTGSDLTW C Y C G CRR+S R+F AD S+SF+ V CSS+TC Sbjct: 92 VGSPAQSVALIADTGSDLTWTKCSYGC-GGGCRRSSG--RLFDADRSTSFKTVECSSTTC 148 Query: 755 KIDLANLFSLARCASPMDPCAYDYRYSDGSAALGVFGNETVTFSLTNGR-KTRLHHVLVG 931 +DLA FSL+RC+ P DPCAYDYRY+DGS+A G+F ETV L GR K RL +VL+G Sbjct: 149 TVDLAGAFSLSRCSPPSDPCAYDYRYADGSSAEGIFAGETVELKLAKGRGKARLQNVLIG 208 Query: 932 CSESSRGRSFQVADGVMGLGYSNYSFAVKAANKFGGKFSYCLVDHLSPKNISSYLIFGSH 1111 C+++ G SFQ +DGV+GLGYSN+SFA AA +FG KFSYCL+DHL+ KN SSY+ F S Sbjct: 209 CTKNFSGSSFQTSDGVLGLGYSNFSFAHAAAARFGDKFSYCLLDHLAAKNKSSYITFSSG 268 Query: 1112 KEVN--ISFNRMKYTELILGVINPFYAVNIKGISIGGSMLDIPAETWN-LXXXXXXXXXX 1282 + ++ IS ++YT+L+LGVI YAVN++GISIGGS L IP++TWN L Sbjct: 269 RSISASISAGPIRYTDLVLGVIGSNYAVNVRGISIGGSWLRIPSDTWNNLSGSGGVIIDS 328 Query: 1283 XXXLTGLTQPAYQPVMAALRLSLVNFKNLNLDIGPVEYCFNSTGFSESLVPRLVFHFSDG 1462 LT L PAY PV+AAL SL F + ++ IGP+E CFNSTGF ES+VP+L HF+ G Sbjct: 329 GSSLTALAPPAYAPVIAALNRSLARFGDPHVKIGPMECCFNSTGFHESVVPKLAIHFAGG 388 Query: 1463 ARFEPPVKSYVIDAAPGVKCLGFVPAAWPGASVIGNIMQQNYFWEFDLANGRLGFATSSC 1642 RFEPPVKSYVIDAAPGV CLGFV AA PG SVIGNI+QQN++WEFDL N RLGFA S C Sbjct: 389 TRFEPPVKSYVIDAAPGVVCLGFVQAASPGVSVIGNILQQNHWWEFDLGNRRLGFAASDC 448 >gb|EOY14331.1| Eukaryotic aspartyl protease family protein, putative [Theobroma cacao] Length = 473 Score = 456 bits (1172), Expect = e-125 Identities = 244/492 (49%), Positives = 313/492 (63%), Gaps = 20/492 (4%) Frame = +2 Query: 227 FLIIAVFIIINSAKLLEGHGGIKFELTHRR------NGATQLERLRQLLHSDTIRLRGIS 388 +LI +FI++ S +++ IK EL HR TQ ERL+ L+H D IR Sbjct: 4 WLIPLLFIVLPS--IVQAQDSIKLELLHRHAPQLHARPKTQHERLKDLVHHDFIR----- 56 Query: 389 EKVSQKQGNVNTNIRRQIQEKNTYFPDCTNSTRLKKDYNVSAELLMRSAADYGIGQYFVR 568 + RRQ E P T +T K N + ++ + + D+GIGQY Sbjct: 57 ------------HNRRQAWET----PKTTTATASKT--NAAIQMPLSAGRDFGIGQYVTT 98 Query: 569 FRIGSPAQKLMLIADTGSDLTWMNCRYRC-RGASC---RRNSRKRRIFRADHSSSFRAVP 736 F++G+P+QK LI DTGSDLTW+NCRYRC RG +C R ++ R+FRA SSSFR +P Sbjct: 99 FKVGTPSQKFRLIVDTGSDLTWINCRYRCARGDNCTTQERGIKRGRVFRAHLSSSFRPIP 158 Query: 737 CSSSTCKIDLANLFSLARCASPMDPCAYDYR----------YSDGSAALGVFGNETVTFS 886 C S CK++L NLFSL C +P+ PCAYDYR Y DGS A+GVF E+VT Sbjct: 159 CFSQMCKVELRNLFSLTICPTPLTPCAYDYRFNSLKLVLNRYIDGSDAMGVFAKESVTVG 218 Query: 887 LTNGRKTRLHHVLVGCSESSRGRSFQVADGVMGLGYSNYSFAVKAANKFGGKFSYCLVDH 1066 LTN R RLH VL+GCS+SS+GR+ + DGV+GL S YSF KAA ++GGKFSYCLVDH Sbjct: 219 LTNSRMARLHDVLIGCSDSSQGRTVKNVDGVLGLANSKYSFVTKAAERWGGKFSYCLVDH 278 Query: 1067 LSPKNISSYLIFGSHKEVNISFNRMKYTELILGVINPFYAVNIKGISIGGSMLDIPAETW 1246 LS N S+YLIFG++ +YT L L +++ YAVN++GISIGG MLDIP + W Sbjct: 279 LSHINASNYLIFGANNNQLTVLGNTRYTRLELNLVSFSYAVNVQGISIGGKMLDIPLQVW 338 Query: 1247 NLXXXXXXXXXXXXXLTGLTQPAYQPVMAALRLSLVNFKNLNLDIGPVEYCFNSTGFSES 1426 + L+ LT PAYQPVMAA+++S+ + + L P+EYCFNSTGF E+ Sbjct: 339 DTRKGGGTILDSGTSLSFLTDPAYQPVMAAIKMSVSKYPQVKLHGVPMEYCFNSTGFDET 398 Query: 1427 LVPRLVFHFSDGARFEPPVKSYVIDAAPGVKCLGFVPAAWPGASVIGNIMQQNYFWEFDL 1606 LVP+L+ HF+DGARFEP +SYVI AA GV+CLGF+PA +P SVIGNIMQQNY WEFDL Sbjct: 399 LVPKLIIHFADGARFEPHWRSYVISAADGVRCLGFLPARFPSVSVIGNIMQQNYLWEFDL 458 Query: 1607 ANGRLGFATSSC 1642 +L FA SSC Sbjct: 459 EGNKLRFAPSSC 470 >gb|EOX93240.1| Eukaryotic aspartyl protease family protein, putative [Theobroma cacao] Length = 478 Score = 454 bits (1168), Expect = e-125 Identities = 232/493 (47%), Positives = 314/493 (63%), Gaps = 22/493 (4%) Frame = +2 Query: 230 LIIAVFIIINSAKLLEGHGG---------IKFELTHRRN-------GAT------QLERL 343 + +++F+ N + + H ++F+L HR + G T ER+ Sbjct: 8 VFLSLFLFFNHSFFFQAHASEAITPPNEKVRFKLIHRHSPELGEDHGTTLGPPTSTRERI 67 Query: 344 RQLLHSDTIRLRGISEKVSQKQGNVNTNIRRQIQEKNTYFPDCTNSTRLKKDYNVSAELL 523 +QL+HSD RL IS+++ + + T+ S+ L EL Sbjct: 68 KQLVHSDNARLHTISQRLGPR--------------RMTFEMKMMGSSNL-------VELP 106 Query: 524 MRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTWMNCRYRCRGASCRRNSRKRRIFR 703 MRSAAD G GQYFV FR+GSP +K ++IADTGS LTWM C Y+C+ S R RIF Sbjct: 107 MRSAADIGTGQYFVSFRVGSPPKKFIMIADTGSSLTWMRCSYKCKNFSMDRTKLHERIFY 166 Query: 704 ADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAALGVFGNETVTF 883 A+ S +F+ +PCSS CK++L+ FSLA C +PM PCAYDYRY+DG+ +G+FGN+TV Sbjct: 167 ANQSRTFKPIPCSSDVCKVELSQSFSLALCPTPMAPCAYDYRYADGTRVVGIFGNDTVKV 226 Query: 884 SLTNGRKTRLHHVLVGCSESSRGRSFQVADGVMGLGYSNYSFAVKAANKFGGKFSYCLVD 1063 L+ G+K ++ V+VGCSE+ RG +F DGVMGLG+ +SFAVKAA +FG KFSYCLVD Sbjct: 227 RLSGGQKIKVTDVMVGCSEAIRG-NFHDIDGVMGLGFDQHSFAVKAAKEFGDKFSYCLVD 285 Query: 1064 HLSPKNISSYLIFGSHKEVNISFNRMKYTELILGVINPFYAVNIKGISIGGSMLDIPAET 1243 HLSP N+ ++L+FG + M++T+LILG++NP+YAVN+ GIS+ G MLDIP+ Sbjct: 286 HLSPSNLVNFLVFGG--VTSSPLPNMQFTQLILGIVNPYYAVNVSGISVNGKMLDIPSYI 343 Query: 1244 WNLXXXXXXXXXXXXXLTGLTQPAYQPVMAALRLSLVNFKNLNLDIGPVEYCFNSTGFSE 1423 W++ LT L +P + V+AA + L FK L L++GP +YCF++ GF E Sbjct: 344 WDVKGDGGVIMDSGSSLTYLVKPLFDKVIAAFQAPLSKFKKLELNLGP-DYCFSAAGFEE 402 Query: 1424 SLVPRLVFHFSDGARFEPPVKSYVIDAAPGVKCLGFVPAAWPGASVIGNIMQQNYFWEFD 1603 SL+P+L FHF+DGA+ PPVKSYVIDA VKCLGF +WPG SVIGNI+QQN+ WEFD Sbjct: 403 SLMPKLAFHFADGAKLVPPVKSYVIDAEEAVKCLGFSSTSWPGPSVIGNILQQNHLWEFD 462 Query: 1604 LANGRLGFATSSC 1642 L N RLGFA SSC Sbjct: 463 LLNSRLGFAASSC 475 >ref|XP_006422317.1| hypothetical protein CICLE_v10004908mg [Citrus clementina] gi|568881779|ref|XP_006493729.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Citrus sinensis] gi|557524190|gb|ESR35557.1| hypothetical protein CICLE_v10004908mg [Citrus clementina] Length = 470 Score = 445 bits (1145), Expect = e-122 Identities = 239/496 (48%), Positives = 316/496 (63%), Gaps = 14/496 (2%) Frame = +2 Query: 197 MVTCRRQIGFFLIIAVFIIINSAKLLEGHGGIKFELTHRRNGA-------TQLERLRQLL 355 M+ RR I L+I II+ + ++ ++ EL HR + +++ER+++LL Sbjct: 2 MLKGRRPIFLVLVILFSNIIHFSSMVMVVA-VRMELIHRHSPKLNNMPMMSEVERMKELL 60 Query: 356 HSDTIRLRGISEKVSQKQGNVNTNIRRQIQEKNTYFPDCTNSTRLKKDYNVSAELLMRSA 535 H+D IR N R++++ TN+ + E+ +++ Sbjct: 61 HNDIIRQ--------------NKRRGRRLRQ--------TNNNNNNGASGSAIEMPLQAG 98 Query: 536 ADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTWMNCRYRCRGASCRRNSR----KRRIFR 703 DYG G YFV ++G+P+QKL LI DTGS+ +W++CRY C G SC + +RR+F+ Sbjct: 99 RDYGTGMYFVEIKVGTPSQKLRLIVDTGSEFSWISCRYHC-GPSCTKKGTIAGSRRRVFK 157 Query: 704 ADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAALGVFGNETVTF 883 AD SSSF+ +PCSS CK + A LFSL C +P PCAYDYRY+DGSAA G+FG E VT Sbjct: 158 ADLSSSFKTIPCSSDMCKSEFARLFSLTFCPTPTSPCAYDYRYADGSAAKGIFGKERVTI 217 Query: 884 SLTNGRKTRLHHVLVGCSESSRGRSFQVADGVMGLGYSNYSFAVKAANK---FGGKFSYC 1054 L NG KTR+ V++GCS++ +G+ F ADGV+GL Y YSFA K N GKF+YC Sbjct: 218 GLENGGKTRIEEVVMGCSDTIQGQIFAEADGVLGLSYDKYSFAQKVTNGSTFARGKFAYC 277 Query: 1055 LVDHLSPKNISSYLIFGSHKEVNISFNRMKYTELILGVINPFYAVNIKGISIGGSMLDIP 1234 LVDHLS KN+S+YLIFG +E RM+YT +LG+I P Y V++KGISIGG ML+IP Sbjct: 278 LVDHLSHKNVSNYLIFG--EESKRMRMRMRYT--LLGLIGPDYGVSVKGISIGGVMLNIP 333 Query: 1235 AETWNLXXXXXXXXXXXXXLTGLTQPAYQPVMAALRLSLVNFKNLNLDIGPVEYCFNSTG 1414 ++ W+ LT L +PAY+PV+AAL +SL ++ L D P EYCFNSTG Sbjct: 334 SQVWDFNRGGGTAFDSGTTLTFLAEPAYKPVVAALEMSLSRYQRLKRD-APFEYCFNSTG 392 Query: 1415 FSESLVPRLVFHFSDGARFEPPVKSYVIDAAPGVKCLGFVPAAWPGASVIGNIMQQNYFW 1594 F ES VP+LVFHF+DGARFEP KSY+I A G++CLGFV A WPGAS IGNIMQQNYFW Sbjct: 393 FDESSVPKLVFHFADGARFEPHTKSYIIRVAHGIRCLGFVSATWPGASAIGNIMQQNYFW 452 Query: 1595 EFDLANGRLGFATSSC 1642 EFDL RLGFA S+C Sbjct: 453 EFDLLKDRLGFAPSTC 468 >ref|XP_004293837.1| PREDICTED: aspartic proteinase nepenthesin-1-like [Fragaria vesca subsp. vesca] Length = 482 Score = 424 bits (1089), Expect = e-116 Identities = 225/466 (48%), Positives = 285/466 (61%), Gaps = 15/466 (3%) Frame = +2 Query: 290 IKFELTHRRN-----GATQLERLRQLLHSDTIRLRGISEKVSQKQGNVNTNIRRQIQEKN 454 +K EL HR + TQLE + +L D IR + IS + ++ T +RR E Sbjct: 37 MKLELIHRHSLRVEMPKTQLELIEELQRHDVIRHQMISRRRQHHHHSIPTGLRRNALETA 96 Query: 455 TYFPDCTNSTRLKKDYNVSAELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTW 634 S + + SA D+G GQYFV+ ++G+P+Q+ +LIADTGSDLTW Sbjct: 97 A-----------------SIAMPLSSAWDFGAGQYFVQIKVGTPSQRFLLIADTGSDLTW 139 Query: 635 MNCRYRCRGASC-----RRNSRKRRIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCAS 799 M C+YRC C K+++FR SS+F+ +PCSS CK +L FS C + Sbjct: 140 MKCKYRCVADKCGLKRATMKKNKKKVFRPAQSSTFKIIPCSSEMCKFELE--FSRQECPT 197 Query: 800 PMDPCAYDYRYSDGSAALGVFGNETVTFSLTNGRKTRLHHVLVGCSES---SRGRSFQVA 970 P+ PC YDYRY++ S ALG F NETV LTNGR+ RL+ VL+GC+ES +G S + Sbjct: 198 PLSPCKYDYRYAESSGALGFFANETVRVPLTNGRRARLNDVLIGCTESIEGPKGASIRAG 257 Query: 971 DGVMGLGYSNYSFAVKAANKFGGKFSYCLVDHLSPKNISSYLIFGSHKEVNISFNRMKYT 1150 DG++GLG+ +SF KAA+ G KFSYCLVDH+S KN+SSYL FG + E +RM+YT Sbjct: 258 DGILGLGFGKHSFVAKAASNLGDKFSYCLVDHMSNKNVSSYLTFGRNAETAQQNSRMRYT 317 Query: 1151 ELILG--VINPFYAVNIKGISIGGSMLDIPAETWNLXXXXXXXXXXXXXLTGLTQPAYQP 1324 +L LG I PFYAVN+ GIS G ML IP E WN LT LT PAY Sbjct: 318 KLALGGPKIGPFYAVNLVGISAGSKMLKIPNEVWNENLGGGTIVDSGTSLTFLTSPAYIH 377 Query: 1325 VMAALRLSLVNFKNLNLDIGPVEYCFNSTGFSESLVPRLVFHFSDGARFEPPVKSYVIDA 1504 VM L ++L +K + D E+CFNSTG+ +SLVPR HF+DGA+FEPPVKSYVID Sbjct: 378 VMDELTMALSKYKKIPSDA--FEFCFNSTGYDQSLVPRFAIHFADGAKFEPPVKSYVIDV 435 Query: 1505 APGVKCLGFVPAAWPGASVIGNIMQQNYFWEFDLANGRLGFATSSC 1642 A KCLGF A +PG VIGNIMQQNY WEFDL GRLG+A SSC Sbjct: 436 AIQTKCLGFQSAPFPGTIVIGNIMQQNYLWEFDLRGGRLGYAPSSC 481 >gb|EXB51212.1| Aspartic proteinase nepenthesin-1 [Morus notabilis] Length = 464 Score = 423 bits (1087), Expect = e-115 Identities = 209/384 (54%), Positives = 266/384 (69%), Gaps = 6/384 (1%) Frame = +2 Query: 509 SAELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTWMNCRYRCRGASCRRNSRK 688 S + M + ADYG+G+YFV +G+P Q+ ML+ADTGSDLTWM+CR R + + Sbjct: 80 SIAMPMNAGADYGVGEYFVHVTVGTPGQRFMLVADTGSDLTWMHCRCGRRCGTHKGRLNN 139 Query: 689 RRIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAALGVFGN 868 RR+F AD SSSF+ +PC S CK++LANLFSL++C +P+ PCAYDYRY +GS+A+G F N Sbjct: 140 RRVFHADRSSSFKTIPCLSEMCKVELANLFSLSKCPTPLTPCAYDYRYLEGSSAIGFFAN 199 Query: 869 ETVTFSLTNGRKTRLHHVLVGCSESSRG---RSFQVADGVMGLGYSNYSFAVKAANKFGG 1039 ET++ L NG+K +L VLVGC+ES +G F+ ADGV+GLG+ N++F KAA FGG Sbjct: 200 ETISVRLANGKKRKLRDVLVGCTESVQGAEESGFKGADGVLGLGFGNHTFTRKAAQYFGG 259 Query: 1040 KFSYCLVDHLSPKNISSYLIFGSHKEVNIS-FNRMKYTELIL-GVINPFYAVNIKGISIG 1213 KFSYCLVDHLSPKN+S+Y+IFG K S + +++T+L+L G PFY VN+ GISIG Sbjct: 260 KFSYCLVDHLSPKNLSNYIIFGHDKADKASCSSSLQHTDLVLGGDYGPFYGVNLSGISIG 319 Query: 1214 GSMLDIPAETWNLXXXXXXXXXXXXXLTGLTQPAYQPVMAALRLSLVNFKN-LNLDIGPV 1390 G +L IP+ WN LT LT P Y PV + L F L GP Sbjct: 320 GVLLRIPSVAWNASLGGGAILESGTSLTFLTDPVYGPVTSELNKFTSRFGTLLPPGGGPF 379 Query: 1391 EYCFNSTGFSESLVPRLVFHFSDGARFEPPVKSYVIDAAPGVKCLGFVPAAWPGASVIGN 1570 E+CFNSTG+ ES +P L HFS+GA FEPPVKSY++D AP KCLGFV A+WPG S+IGN Sbjct: 380 EFCFNSTGYDESKMPPLRIHFSNGAIFEPPVKSYILDIAPEKKCLGFVSASWPGTSIIGN 439 Query: 1571 IMQQNYFWEFDLANGRLGFATSSC 1642 IMQQN+ WEFDL N RLGFA S+C Sbjct: 440 IMQQNHLWEFDLENTRLGFAPSTC 463 >ref|XP_006297668.1| hypothetical protein CARUB_v10013693mg [Capsella rubella] gi|482566377|gb|EOA30566.1| hypothetical protein CARUB_v10013693mg [Capsella rubella] Length = 448 Score = 409 bits (1051), Expect = e-111 Identities = 203/384 (52%), Positives = 260/384 (67%), Gaps = 1/384 (0%) Frame = +2 Query: 494 KDYNVSAELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTWMNCRYRCRGASCR 673 + Y ++ + S DYG QYF R+G+PA+K ++ DTGS+LTW+NC+YR RG Sbjct: 68 RKYKGGVKMPLGSGIDYGTAQYFTEVRVGTPAKKFRVVVDTGSELTWVNCKYRGRGKGRV 127 Query: 674 RNSRKRRIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAAL 853 N RR+FRA+ S SFR V C + TCK+DL NLFSL+ C +P PC+YDYRY+DGSAA Sbjct: 128 EN---RRVFRAEESKSFRTVGCFTQTCKVDLMNLFSLSTCPTPSTPCSYDYRYADGSAAQ 184 Query: 854 GVFGNETVTFSLTNGRKTRLHHVLVGCSESSRGRSFQVADGVMGLGYSNYSFAVKAANKF 1033 G+F ETVT LTNGRK RLH +L+GCS S G+SF+ ADGV+GL +S++SF A + F Sbjct: 185 GIFAKETVTVGLTNGRKARLHGLLIGCSSSFSGQSFRGADGVLGLAFSDFSFTSTATSLF 244 Query: 1034 GGKFSYCLVDHLSPKNISSYLIFGSHKEVNISFNRMKYTELILGVINPFYAVNIKGISIG 1213 G KFSYCLVDHLSPKN+S+YLIFGS + + T L L +I PFYA+++ GIS+G Sbjct: 245 GAKFSYCLVDHLSPKNVSNYLIFGSSSSATKNAPG-RTTPLDLTLIPPFYAISVIGISLG 303 Query: 1214 GSMLDIPAETWNLXXXXXXXXXXXXXLTGLTQPAYQPVMAALRLSLVNFKNLNLDIGPVE 1393 MLDIPA+ W+ LT L++ AY+PV+ L L + + + P+E Sbjct: 304 EDMLDIPAQVWDATTGGGTVLDSGTSLTLLSEAAYKPVVTGLARYLDELERVKPEGVPIE 363 Query: 1394 YCFNST-GFSESLVPRLVFHFSDGARFEPPVKSYVIDAAPGVKCLGFVPAAWPGASVIGN 1570 YCF+ST GF+ES +P+L FH GARFEP KSY+ID APGVKCLGF+ A P +V+GN Sbjct: 364 YCFSSTSGFNESKLPQLTFHMKGGARFEPHRKSYLIDTAPGVKCLGFMSAGTPATNVVGN 423 Query: 1571 IMQQNYFWEFDLANGRLGFATSSC 1642 IMQQNY WEFDL L FA SSC Sbjct: 424 IMQQNYLWEFDLMASTLSFAPSSC 447 >ref|XP_002882786.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp. lyrata] gi|297328626|gb|EFH59045.1| hypothetical protein ARALYDRAFT_478632 [Arabidopsis lyrata subsp. lyrata] Length = 449 Score = 408 bits (1049), Expect = e-111 Identities = 206/386 (53%), Positives = 259/386 (67%), Gaps = 2/386 (0%) Frame = +2 Query: 491 KKDYNVSAELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTWMNCRYRCRGASC 670 K+ + ++ + S DYG QYF R+G+PA+K ++ DTGS+LTW+NCRYR RG Sbjct: 66 KRKFKGGVKMDLGSGIDYGTAQYFTEVRVGTPAKKFRVVVDTGSELTWVNCRYRGRGKGK 125 Query: 671 RRNSRKRRIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAA 850 +N RR+FRA+ S SF+ V C + TCK+DL NLFSL+ C +P PC+YDYRY+DGSAA Sbjct: 126 VKN---RRVFRAEESKSFKTVGCFTQTCKVDLMNLFSLSTCPTPSTPCSYDYRYADGSAA 182 Query: 851 LGVFGNETVTFSLTNGRKTRLHHVLVGCSESSRGRSFQVADGVMGLGYSNYSFAVKAANK 1030 GVF ET+T LTNGRK RL +LVGCS S G+SFQ ADGV+GL +S++SF A + Sbjct: 183 QGVFAKETITVGLTNGRKARLRGLLVGCSSSFSGQSFQGADGVLGLAFSDFSFTSTATSL 242 Query: 1031 FGGKFSYCLVDHLSPKNISSYLIFG-SHKEVNISFNRMKYTELILGVINPFYAVNIKGIS 1207 FG K SYCLVDHLS KNIS+YLIFG S + + T L L +I PFYA+NI GIS Sbjct: 243 FGAKLSYCLVDHLSNKNISNYLIFGYSSSSTSTKTAPGRTTPLDLTLIPPFYAINIIGIS 302 Query: 1208 IGGSMLDIPAETWNLXXXXXXXXXXXXXLTGLTQPAYQPVMAALRLSLVNFKNLNLDIGP 1387 IG MLDIP + W+ LT L + AY+PV+ L LV K + + P Sbjct: 303 IGDDMLDIPTQVWDATTGGGTILDSGTSLTLLAEAAYKPVVTGLARYLVELKRVKPEGIP 362 Query: 1388 VEYCFNST-GFSESLVPRLVFHFSDGARFEPPVKSYVIDAAPGVKCLGFVPAAWPGASVI 1564 +EYCF+ST GF+ES +P+L FH GARFEP KSY++DAAPGVKCLGF+ A P +V+ Sbjct: 363 IEYCFSSTSGFNESKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFMSAGTPATNVV 422 Query: 1565 GNIMQQNYFWEFDLANGRLGFATSSC 1642 GNIMQQNY WEFDL L FA S+C Sbjct: 423 GNIMQQNYLWEFDLMASTLSFAPSTC 448 >ref|NP_187876.2| aspartyl protease family protein [Arabidopsis thaliana] gi|11994412|dbj|BAB02414.1| chloroplast nucleoid DNA binding protein-like [Arabidopsis thaliana] gi|332641715|gb|AEE75236.1| aspartyl protease family protein [Arabidopsis thaliana] Length = 461 Score = 404 bits (1038), Expect = e-110 Identities = 203/385 (52%), Positives = 254/385 (65%), Gaps = 1/385 (0%) Frame = +2 Query: 491 KKDYNVSAELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTWMNCRYRCRGASC 670 K++ V ++ + S DYG QYF R+G+PA+K ++ DTGS+LTW+NCRYR RG Sbjct: 84 KRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKD- 142 Query: 671 RRNSRKRRIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAA 850 RR+FRAD S SF+ V C + TCK+DL NLFSL C +P PC+YDYRY+DGSAA Sbjct: 143 -----NRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAA 197 Query: 851 LGVFGNETVTFSLTNGRKTRLHHVLVGCSESSRGRSFQVADGVMGLGYSNYSFAVKAANK 1030 GVF ET+T LTNGR RL L+GCS S G+SFQ ADGV+GL +S++SF A + Sbjct: 198 QGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSL 257 Query: 1031 FGGKFSYCLVDHLSPKNISSYLIFGSHKEVNISFNRMKYTELILGVINPFYAVNIKGISI 1210 +G KFSYCLVDHLS KN+S+YLIFGS + +F R T L L I PFYA+N+ GIS+ Sbjct: 258 YGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRT--TPLDLTRIPPFYAINVIGISL 315 Query: 1211 GGSMLDIPAETWNLXXXXXXXXXXXXXLTGLTQPAYQPVMAALRLSLVNFKNLNLDIGPV 1390 G MLDIP++ W+ LT L AY+ V+ L LV K + + P+ Sbjct: 316 GYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPI 375 Query: 1391 EYCFNST-GFSESLVPRLVFHFSDGARFEPPVKSYVIDAAPGVKCLGFVPAAWPGASVIG 1567 EYCF+ T GF+ S +P+L FH GARFEP KSY++DAAPGVKCLGFV A P +VIG Sbjct: 376 EYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIG 435 Query: 1568 NIMQQNYFWEFDLANGRLGFATSSC 1642 NIMQQNY WEFDL L FA S+C Sbjct: 436 NIMQQNYLWEFDLMASTLSFAPSAC 460 >gb|AAL49921.1| unknown protein [Arabidopsis thaliana] Length = 439 Score = 404 bits (1038), Expect = e-110 Identities = 203/385 (52%), Positives = 254/385 (65%), Gaps = 1/385 (0%) Frame = +2 Query: 491 KKDYNVSAELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTWMNCRYRCRGASC 670 K++ V ++ + S DYG QYF R+G+PA+K ++ DTGS+LTW+NCRYR RG Sbjct: 62 KRNSTVGVKMDLGSGIDYGTAQYFTEIRVGTPAKKFRVVVDTGSELTWVNCRYRARGKD- 120 Query: 671 RRNSRKRRIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAA 850 RR+FRAD S SF+ V C + TCK+DL NLFSL C +P PC+YDYRY+DGSAA Sbjct: 121 -----NRRVFRADESKSFKTVGCLTQTCKVDLMNLFSLTTCPTPSTPCSYDYRYADGSAA 175 Query: 851 LGVFGNETVTFSLTNGRKTRLHHVLVGCSESSRGRSFQVADGVMGLGYSNYSFAVKAANK 1030 GVF ET+T LTNGR RL L+GCS S G+SFQ ADGV+GL +S++SF A + Sbjct: 176 QGVFAKETITVGLTNGRMARLPGHLIGCSSSFTGQSFQGADGVLGLAFSDFSFTSTATSL 235 Query: 1031 FGGKFSYCLVDHLSPKNISSYLIFGSHKEVNISFNRMKYTELILGVINPFYAVNIKGISI 1210 +G KFSYCLVDHLS KN+S+YLIFGS + +F R T L L I PFYA+N+ GIS+ Sbjct: 236 YGAKFSYCLVDHLSNKNVSNYLIFGSSRSTKTAFRRT--TPLDLTRIPPFYAINVIGISL 293 Query: 1211 GGSMLDIPAETWNLXXXXXXXXXXXXXLTGLTQPAYQPVMAALRLSLVNFKNLNLDIGPV 1390 G MLDIP++ W+ LT L AY+ V+ L LV K + + P+ Sbjct: 294 GYDMLDIPSQVWDATSGGGTILDSGTSLTLLADAAYKQVVTGLARYLVELKRVKPEGVPI 353 Query: 1391 EYCFNST-GFSESLVPRLVFHFSDGARFEPPVKSYVIDAAPGVKCLGFVPAAWPGASVIG 1567 EYCF+ T GF+ S +P+L FH GARFEP KSY++DAAPGVKCLGFV A P +VIG Sbjct: 354 EYCFSFTSGFNVSKLPQLTFHLKGGARFEPHRKSYLVDAAPGVKCLGFVSAGTPATNVIG 413 Query: 1568 NIMQQNYFWEFDLANGRLGFATSSC 1642 NIMQQNY WEFDL L FA S+C Sbjct: 414 NIMQQNYLWEFDLMASTLSFAPSAC 438 >ref|XP_006407304.1| hypothetical protein EUTSA_v10020732mg [Eutrema salsugineum] gi|557108450|gb|ESQ48757.1| hypothetical protein EUTSA_v10020732mg [Eutrema salsugineum] Length = 444 Score = 404 bits (1037), Expect = e-110 Identities = 199/379 (52%), Positives = 253/379 (66%), Gaps = 1/379 (0%) Frame = +2 Query: 512 AELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTWMNCRYRCRGASCRRNSRKR 691 A++ + S DYG QYF R+G+PA++ ++ DTGS+LTW+NCR+ +G R Sbjct: 72 AKMALGSGFDYGAAQYFAEVRVGTPAKRFRVVVDTGSELTWVNCRFHGKG------KENR 125 Query: 692 RIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAALGVFGNE 871 R+FRA+ SSSFR V C + TCK+DL NLFSL+ C +P PC+YDYRY+DGSAA GVF E Sbjct: 126 RVFRAEESSSFRKVGCLTQTCKVDLMNLFSLSNCPTPSTPCSYDYRYADGSAAQGVFAKE 185 Query: 872 TVTFSLTNGRKTRLHHVLVGCSESSRGRSFQVADGVMGLGYSNYSFAVKAANKFGGKFSY 1051 T T LTNGRK +L +L+GCS S G SF+ ADGV+GL S+YSF KA N FGGKFSY Sbjct: 186 TFTVGLTNGRKAKLRGLLIGCSSSFSGDSFRGADGVLGLALSDYSFTSKATNIFGGKFSY 245 Query: 1052 CLVDHLSPKNISSYLIFGSHKEVNISFNRMKYTELILGVINPFYAVNIKGISIGGSMLDI 1231 CLVDHLS KN+S+YL FGS + ++ T L L +I PFYA+NI GISIG MLDI Sbjct: 246 CLVDHLSNKNVSNYLTFGSSSSTTKTAASIRTTPLDLKLIPPFYAINIIGISIGDDMLDI 305 Query: 1232 PAETWNLXXXXXXXXXXXXXLTGLTQPAYQPVMAALRLSLVNFKNLNLDIGPVEYCFNST 1411 P + W+ LT L AY+ V++ L LV FK + + P+EYCF++T Sbjct: 306 PTQVWDATAGGGTILDSGTSLTFLADAAYKAVVSGLERYLVGFKRVKPEGVPIEYCFDTT 365 Query: 1412 -GFSESLVPRLVFHFSDGARFEPPVKSYVIDAAPGVKCLGFVPAAWPGASVIGNIMQQNY 1588 GF+ES +P+L FHF GARFEP +SYV+D GV+CLGFV P +V+GNIMQQNY Sbjct: 366 SGFNESKLPQLTFHFKGGARFEPHRRSYVVDTLEGVRCLGFVSTGSPATNVVGNIMQQNY 425 Query: 1589 FWEFDLANGRLGFATSSCI 1645 WEFDL L FA S+C+ Sbjct: 426 LWEFDLVASTLSFAPSTCL 444 >gb|EMJ13046.1| hypothetical protein PRUPE_ppa004710mg [Prunus persica] Length = 495 Score = 394 bits (1013), Expect = e-107 Identities = 205/446 (45%), Positives = 284/446 (63%), Gaps = 7/446 (1%) Frame = +2 Query: 326 TQLERLRQLLHSDTIRLRGISEKVSQKQGNVNTNIRRQIQEKNTYFPDCTNSTR-LKKDY 502 TQ +++L D RL+ +++K Q + N +NSTR + Sbjct: 63 TQQALIQELHRHDVFRLQMMAQKRQQNGHDQGLNSSSS-----------SNSTRRMDMQT 111 Query: 503 NVSAELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTWMNCRYRCRGASCR-RN 679 +S + M + DYGIGQY V+ ++G+PAQK +I TGSDLTW+ C C G SC R Sbjct: 112 RLSVTMPMNAGWDYGIGQYLVKLKLGTPAQKFTVIPSTGSDLTWVRCGSHC-GKSCGIRK 170 Query: 680 SR--KRRIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAAL 853 R R+F D SS+F++V CSS C+ DLAN SL +C P+ PC YDY Y +GS+AL Sbjct: 171 GRIDHSRVFNTDRSSTFKSVTCSSKMCEFDLANFNSLNKCPRPLSPCRYDYSYVEGSSAL 230 Query: 854 GVFGNETVTFSLTNGRKTRLHHVLVGCSESSRGR-SFQVADGVMGLGYSNYSFAVKAANK 1030 G FG + V SL+NGR+ R+ VL+GC+ES G+ + + +DG++GLG+ YSF KAA K Sbjct: 231 GTFGTDIVRASLSNGRRNRMKDVLIGCTESIIGKGTAKGSDGILGLGFGKYSFTTKAALK 290 Query: 1031 FGGKFSYCLVDHLSPKNISSYLIFGSHKEVNISFNRMKYTELILGVIN--PFYAVNIKGI 1204 +GGK SYCL+DH+SPKN++SYL FG +K+ + +M+YT+L+ G N FY VN++GI Sbjct: 291 YGGKVSYCLLDHMSPKNVTSYLTFGDNKKAVLQ-GKMRYTQLVFGNPNKGSFYGVNLQGI 349 Query: 1205 SIGGSMLDIPAETWNLXXXXXXXXXXXXXLTGLTQPAYQPVMAALRLSLVNFKNLNLDIG 1384 S+GG ML+IP WN LT LT+PAY+PVM AL + L F+ L + Sbjct: 350 SVGGKMLNIPLHIWNPKLGGGALVDSGMSLTFLTKPAYKPVMTALTMPLTKFRRLRSEED 409 Query: 1385 PVEYCFNSTGFSESLVPRLVFHFSDGARFEPPVKSYVIDAAPGVKCLGFVPAAWPGASVI 1564 ++CF+ G+ + LVP+LVFHF+ GA+F PPVKSYVID +PG+KC+G +P A GA +I Sbjct: 410 DFDFCFDPRGYRDRLVPKLVFHFAGGAKFAPPVKSYVIDVSPGMKCIGILPLA-EGACII 468 Query: 1565 GNIMQQNYFWEFDLANGRLGFATSSC 1642 GNI+QQN+ WEF+L LGFA S+C Sbjct: 469 GNIIQQNHLWEFNLVRKTLGFAPSTC 494 >ref|XP_006429804.1| hypothetical protein CICLE_v10013820mg [Citrus clementina] gi|557531861|gb|ESR43044.1| hypothetical protein CICLE_v10013820mg [Citrus clementina] Length = 475 Score = 386 bits (991), Expect = e-104 Identities = 202/423 (47%), Positives = 271/423 (64%), Gaps = 3/423 (0%) Frame = +2 Query: 335 ERLRQLLHSDTIRLRGISEKVSQKQGNVNTNIRRQIQEKNTYFPDCTNSTRLKKDYNVSA 514 ER+RQL+ D R IS ++ ++ +I T+ N T N+ Sbjct: 65 ERIRQLIDGDIARQEMISRRLEDRRRRGRIRKASEISHHRTF-----NGTS-----NI-V 113 Query: 515 ELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSDLTWMNCRYRCRGASCRRN--SRK 688 ++ +RS AD G+GQYFV FR+GSP QK +LIADTGSDLTWM+C ++ G +C ++ + Sbjct: 114 KIPLRSGADRGLGQYFVSFRVGSPPQKFVLIADTGSDLTWMHCNHK--GENCPKDGLTPP 171 Query: 689 RRIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAALGVFGN 868 R+F+AD SS+F+ +PCSS TCK+DL + FSL+ C +P+ PCAYDY Y DGS G F N Sbjct: 172 NRMFQADASSTFKTIPCSSRTCKVDLQDTFSLSMCPTPVTPCAYDYSYFDGSKVRGFFAN 231 Query: 869 ETVTF-SLTNGRKTRLHHVLVGCSESSRGRSFQVADGVMGLGYSNYSFAVKAANKFGGKF 1045 ETVT S+ +K RL V VGC++ + G +F ADGV+GLG+ SFA AA F KF Sbjct: 232 ETVTAGSIDRRKKVRLKEVTVGCTDWANG-NFHNADGVLGLGFGKNSFAATAAKLFDNKF 290 Query: 1046 SYCLVDHLSPKNISSYLIFGSHKEVNISFNRMKYTELILGVINPFYAVNIKGISIGGSML 1225 SYCLVDHLSP N +++L FG+ + +I M++T+LILG +NPFYAVN+ GISI G ML Sbjct: 291 SYCLVDHLSPSNFANFLNFGNTSKQHIQ--NMQHTQLILGELNPFYAVNVSGISIAGKML 348 Query: 1226 DIPAETWNLXXXXXXXXXXXXXLTGLTQPAYQPVMAALRLSLVNFKNLNLDIGPVEYCFN 1405 ++P E W++ LT L +PAY +AALR L +K L +GP+ +C+N Sbjct: 349 NVPPEMWHIHGAGGVILDSGTTLTFLGEPAYAAAVAALRAPLEKYKKLGHVLGPLRFCYN 408 Query: 1406 STGFSESLVPRLVFHFSDGARFEPPVKSYVIDAAPGVKCLGFVPAAWPGASVIGNIMQQN 1585 F + VP+ V HF+DGA+F PP KSYVIDA GVKC+GF A WP +VIGNIMQQN Sbjct: 409 DPRFDMADVPQFVLHFADGAKFVPPKKSYVIDADVGVKCIGFASAGWPANTVIGNIMQQN 468 Query: 1586 YFW 1594 + W Sbjct: 469 HLW 471 >ref|XP_002518395.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis] gi|223542240|gb|EEF43782.1| Aspartic proteinase Asp1 precursor, putative [Ricinus communis] Length = 489 Score = 381 bits (979), Expect = e-103 Identities = 216/484 (44%), Positives = 291/484 (60%), Gaps = 10/484 (2%) Frame = +2 Query: 224 FFLIIAVFIIINSAKLLEGHGGIKFELTHRRNGATQLERLRQLLHSDTIRLRGISEKVSQ 403 FF + A F + +K + G+ FE+ H + +L+ + L RL G + + Sbjct: 22 FFQVDATFEFDDDSKN-NNNSGVWFEMFHMHS--PKLKSQSKFLGPPKSRLDGTRQLLQ- 77 Query: 404 KQGNVNTNIRRQIQEKNTYFPDCTNSTRLKKDYNVS--AELLMRSAADYGIGQYFVRFRI 577 + N RRQ+ + TR +K + VS A++ + S AD G QYFV RI Sbjct: 78 -----SDNARRQM------ISSLRHGTR-RKAFEVSHTAQIPIHSGADSGQSQYFVSIRI 125 Query: 578 GSPA-QKLMLIADTGSDLTWMNCRYRCRGASCRR-NSRKRRIFRADHSSSFRAVPCSSST 751 G+P QK +L+ DTGSDLTWMNC Y C+ SC + N R+FRA+ SSSFR +PCSS Sbjct: 126 GTPRPQKFILVTDTGSDLTWMNCEYWCK--SCPKPNPHPGRVFRANDSSSFRTIPCSSDD 183 Query: 752 CKIDLANLFSLARCASPMDPCAYDYRYSDGSAALGVFGNETVTFSLTNGRKTRLHHVLVG 931 CKI+L + FSL C +P PC +DYRY +G A+GVF NETVT L + +K RL VL+G Sbjct: 184 CKIELQDYFSLTECPNPNAPCLFDYRYLNGPRAIGVFANETVTVGLNDHKKIRLFDVLIG 243 Query: 932 CSESSRGRSFQVADGVMGLGYSNYSFAVKAANKFGGKFSYCLVDHLSPKNISSYLIFGSH 1111 C+ES + DGVMGLGY +S A++ A FG KFSYCLVDHLS N ++L FG Sbjct: 244 CTESFNETN-GFPDGVMGLGYRKHSLALRLAEIFGNKFSYCLVDHLSSSNHKNFLSFGDI 302 Query: 1112 KEVNISFNRMKYTELILGVINPFYAVNIKGISIGGSMLDIPAETWNLXXXXXXXXXXXXX 1291 E+ + +M++TEL+LG IN FY VN+ GIS+GGSML I ++ WN+ Sbjct: 303 PEMKLP--KMQHTELLLGYINAFYPVNVSGISVGGSMLSISSDIWNVTGVGGMIVDSGTS 360 Query: 1292 LTGLTQPAYQPVMAALRLSLVNFKNLNLDIGPVE------YCFNSTGFSESLVPRLVFHF 1453 LT L AY V+ AL+ K + P+E +CF GF + VPRL+ HF Sbjct: 361 LTMLAGEAYDKVVDALKPIFDKHKK----VVPIELPELNNFCFEDKGFDRAAVPRLLIHF 416 Query: 1454 SDGARFEPPVKSYVIDAAPGVKCLGFVPAAWPGASVIGNIMQQNYFWEFDLANGRLGFAT 1633 +DGA F+PPVKSY+ID A G+KCLG + A +PG+S++GN+MQQN+ WE+DL G+LGF Sbjct: 417 ADGAIFKPPVKSYIIDVAEGIKCLGIIKADFPGSSILGNVMQQNHLWEYDLGRGKLGFGP 476 Query: 1634 SSCI 1645 SSCI Sbjct: 477 SSCI 480 >ref|XP_004488613.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Cicer arietinum] Length = 501 Score = 364 bits (934), Expect = 7e-98 Identities = 209/511 (40%), Positives = 295/511 (57%), Gaps = 38/511 (7%) Frame = +2 Query: 224 FFLIIAVFIIINSAKLLE-GHGGIKFELTHRRNGAT-----QLERLRQLLHSDTIRLRGI 385 F + I V IN + E + + EL HR + QLE ++ + D R Sbjct: 19 FLIQIIVVHSINEKEEEEVDNENMSLELVHRHDSRVIGDVDQLEAIKGSIQRDYFR---- 74 Query: 386 SEKVSQKQGNVNTNIRRQIQEKNTYFPDCTNSTRLKKDYNVSAELLMRSAADYGIGQYFV 565 ++++Q +G +N N RR+ E + ++ M S DY +G+YFV Sbjct: 75 RQRMNQIKG-INQNHRRKDIETQQF------------------QMPMHSGRDYALGEYFV 115 Query: 566 RFRIGSPAQKLMLIADTGSDLTWMNC-------------RYRCRGASCRRNSRKRR---- 694 +IGSP Q L+ADTGS+ TW NC +++ +G S + K + Sbjct: 116 GVKIGSPGQSFWLVADTGSEFTWFNCKPRGKHLGNGGGHKHKHKGESKSKTKTKTKTTTA 175 Query: 695 -------------IFRADHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYS 835 +F S++F+ V CSS CK++L+NLFSL+ C +P DPC +D Y+ Sbjct: 176 RRKRGASNNPCYGVFCPHRSNTFQQVTCSSHKCKVELSNLFSLSYCPNPSDPCLFDISYA 235 Query: 836 DGSAALGVFGNETVTFSLTNGRKTRLHHVLVGCSESS-RGRSF-QVADGVMGLGYSNYSF 1009 DGS+A G FG +TVT +LTNG+K +LH++ +GC+++ G +F + G++GLGY+ SF Sbjct: 236 DGSSAKGFFGTDTVTVALTNGKKGKLHNLTIGCTQTMLNGVTFNENTGGILGLGYAKDSF 295 Query: 1010 AVKAANKFGGKFSYCLVDHLSPKNISSYLIFGSHKEVNISFNRMKYTELILGVINPFYAV 1189 KA+ ++G K SYCLVDHLS +N+SSYL FG+ K +S ++ TEL L +PFY V Sbjct: 296 VDKASLQYGAKLSYCLVDHLSHQNVSSYLTFGTPKVKLLS--EIRKTELFL--YSPFYGV 351 Query: 1190 NIKGISIGGSMLDIPAETWNLXXXXXXXXXXXXXLTGLTQPAYQPVMAALRLSLVNFKNL 1369 ++ GIS+G ML IP + W+ L GL AY PV AL+ SL N K Sbjct: 352 HVLGISVGDQMLKIPHQVWDFNAEGGMIIDSGTTLAGLVLEAYDPVFEALKKSLTNVK-- 409 Query: 1370 NLDIGPVEYCFNSTGFSESLVPRLVFHFSDGARFEPPVKSYVIDAAPGVKCLGFVPAAWP 1549 LDIG +++CF+S GF+E VPRLVFHF+ GARFEPP+KSY+ID P VKC+G VP Sbjct: 410 RLDIGVLDFCFDSEGFNERTVPRLVFHFAGGARFEPPIKSYIIDVEPKVKCIGIVPINGT 469 Query: 1550 GASVIGNIMQQNYFWEFDLANGRLGFATSSC 1642 GASVIGNIMQQ++ WEFDLA +GFA+S+C Sbjct: 470 GASVIGNIMQQDFLWEFDLAKNTVGFASSTC 500 >ref|XP_003532899.1| PREDICTED: protein ASPARTIC PROTEASE IN GUARD CELL 1-like [Glycine max] Length = 507 Score = 359 bits (921), Expect = 2e-96 Identities = 205/496 (41%), Positives = 280/496 (56%), Gaps = 45/496 (9%) Frame = +2 Query: 290 IKFELTHRRN--------GATQLERLRQLLHSDTIRLRGISEKVSQKQGNVNTNIRRQIQ 445 ++ EL HR + Q+E ++ ++ D +R ++++Q+ G N + RR+ Sbjct: 33 MRLELVHRHHERFSGGGGDVDQVEAVKGFVNRDGLR----RQRMNQRWGVSNYDRRRKGL 88 Query: 446 EKNTYFPDCTNSTRLKKDYNVSAELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADTGSD 625 E T E+ MR+ D +G+YF ++GSP Q+ L ADTGS+ Sbjct: 89 ETTT---------------TTEVEMPMRAGRDDALGEYFTEVKVGSPGQRFWLAADTGSE 133 Query: 626 LTWMNC--------------------------RYRCRGASCRRNSRKRR-------IFRA 706 TW NC R R + RR +K+ +F Sbjct: 134 FTWFNCVMRNATTTATTKKTRKNKTKKKHHHHSKRNRTRTTRRTKKKKAKSNPCKGVFCP 193 Query: 707 DHSSSFRAVPCSSSTCKIDLANLFSLARCASPMDPCAYDYRYSDGSAALGVFGNETVTFS 886 S SF+AV C+S CKIDL+ LFSL+ C P DPC YD Y+DGS+A G FG +T+T Sbjct: 194 HRSKSFQAVTCASQKCKIDLSQLFSLSLCPKPSDPCLYDISYADGSSAKGFFGTDTITVD 253 Query: 887 LTNGRKTRLHHVLVGCSES-SRGRSF-QVADGVMGLGYSNYSFAVKAANKFGGKFSYCLV 1060 L NG++ +L+++ +GC++S G +F + G++GLG++ SF KAA ++G KFSYCLV Sbjct: 254 LKNGKEGKLNNLTIGCTKSMENGVNFNEDTGGILGLGFAKDSFIDKAAYEYGAKFSYCLV 313 Query: 1061 DHLSPKNISSYLIFGSHKEVNISFNRMKYTELILGVINPFYAVNIKGISIGGSMLDIPAE 1240 DHLS +N+SSYL G H + +K TELIL PFY VN+ GISIGG ML IP + Sbjct: 314 DHLSHRNVSSYLTIGGHHNAKL-LGEIKRTELIL--FPPFYGVNVVGISIGGQMLKIPPQ 370 Query: 1241 TWNLXXXXXXXXXXXXXLTGLTQPAYQPVMAALRLSLVNFKNL-NLDIGPVEYCFNSTGF 1417 W+ LT L PAY+PV AL SL K + D G +++CF++ GF Sbjct: 371 VWDFNSQGGTLIDSGTTLTALLVPAYEPVFEALIKSLTKVKRVTGEDFGALDFCFDAEGF 430 Query: 1418 SESLVPRLVFHFSDGARFEPPVKSYVIDAAPGVKCLGFVPA-AWPGASVIGNIMQQNYFW 1594 +S+VPRLVFHF+ GARFEPPVKSY+ID AP VKC+G VP GASVIGNIMQQN+ W Sbjct: 431 DDSVVPRLVFHFAGGARFEPPVKSYIIDVAPLVKCIGIVPIDGIGGASVIGNIMQQNHLW 490 Query: 1595 EFDLANGRLGFATSSC 1642 EFDL+ +GFA S C Sbjct: 491 EFDLSTNTIGFAPSIC 506 >ref|XP_006859053.1| hypothetical protein AMTR_s00068p00192210 [Amborella trichopoda] gi|548863165|gb|ERN20520.1| hypothetical protein AMTR_s00068p00192210 [Amborella trichopoda] Length = 500 Score = 353 bits (905), Expect = 2e-94 Identities = 207/484 (42%), Positives = 273/484 (56%), Gaps = 33/484 (6%) Frame = +2 Query: 290 IKFELTHRR---------NGA--TQLERLRQLLHSDTIRLRGISEKVSQKQGNVNTNIRR 436 IK L HR NGA ++L+ LR+LLH D +R + I Sbjct: 48 IKLHLLHRHGRELRGNPTNGAPPSKLDDLRELLHHDQLRKQMIH---------------- 91 Query: 437 QIQEKNTYFPDCTNSTRLKKDYNVSAELLMRSAADYGIGQYFVRFRIGSPAQKLMLIADT 616 ++ R + V A + + S A G GQYFV+FR G+P Q L+L+ADT Sbjct: 92 -------------SALRGRSRGGVGAAMSISSGAFAGTGQYFVKFRAGTPPQNLLLVADT 138 Query: 617 GSDLTWMNCRYRCRGASCRRNSRKRRIFRADHSSSFRAVPCSSSTCKIDLANLFSLARCA 796 GSDLTWMNCR+R + R+FRA SSSF + CS+ +C FSL C Sbjct: 139 GSDLTWMNCRFRPKTRVFSPRINGTRVFRASSSSSFSPLLCSAPSCP---TLPFSLTACP 195 Query: 797 SPMDPCAYDYRYSDGSAALGVFGNETVTFSLT--NGR---KTRLHHVLVGCSESSRGRSF 961 + PC YDYRY DGS A G F NE+VT S NGR RL H+L+GCS++ +GRSF Sbjct: 196 TASTPCRYDYRYVDGSFARGFFANESVTLSAVKPNGRHDGNVRLRHLLIGCSDAFQGRSF 255 Query: 962 QVADGVMGLGYSNYSFAVKAANKFGGKFSYCLVDHLSPKNISSYLIFGSHKEVNISFNRM 1141 + ADGV+GLG S SFAV+ + +F GKFSYCLVDHL+PKN +S+LIFG+ N S + Sbjct: 256 KEADGVLGLGQSAVSFAVQLSRRFDGKFSYCLVDHLAPKNHTSFLIFGNAPGANRSLSPK 315 Query: 1142 KY--TELILG-VINPFYAVNIKGISIGGSMLDIPAETWNLXXXXXXXXXXXXX---LTGL 1303 ++ T LIL + PFY V ++GIS+ G +++IP W + LT L Sbjct: 316 EFRRTPLILDQALQPFYGVKVRGISLDGKLVEIPDSVWMMNLTAQSGGVILDSGTTLTAL 375 Query: 1304 TQPAYQPVMAALRLSLVNFKNLNLDIGPVEYCFNSTGFS-----------ESLVPRLVFH 1450 +PAY+ V+ A + L + + L P ++CFNS+ E ++P++V+H Sbjct: 376 VEPAYEAVLTAFKEKLTGVRRVELS--PFDFCFNSSSSERGNSSEVEREREIVIPKMVWH 433 Query: 1451 FSDGARFEPPVKSYVIDAAPGVKCLGFVPAAWPGASVIGNIMQQNYFWEFDLANGRLGFA 1630 G RFEP +SYVID A GVKCLG AAWPG S IGNIMQQ+++WEFDL NG LGF Sbjct: 434 LGGGVRFEPRGESYVIDVAKGVKCLGIQGAAWPGFSTIGNIMQQSFYWEFDLKNGMLGFG 493 Query: 1631 TSSC 1642 SSC Sbjct: 494 RSSC 497