BLASTX nr result
ID: Mentha27_contig00020166
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Mentha27_contig00020166 (2203 letters) Database: ./nr 38,876,450 sequences; 13,856,398,315 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value gb|EYU28892.1| hypothetical protein MIMGU_mgv1a006975mg [Mimulus... 255 7e-65 ref|XP_007045749.1| 18S pre-ribosomal assembly protein gar2-rela... 173 3e-40 ref|XP_007045750.1| 18S pre-ribosomal assembly protein gar2-rela... 167 2e-38 ref|XP_006348792.1| PREDICTED: dentin sialophosphoprotein-like [... 161 1e-36 ref|XP_004239018.1| PREDICTED: uncharacterized protein LOC101258... 158 1e-35 ref|XP_006395670.1| hypothetical protein EUTSA_v10004181mg [Eutr... 155 1e-34 gb|AAX55105.1| hypothetical protein At2g03810 [Arabidopsis thali... 155 1e-34 ref|NP_178475.2| 18S pre-ribosomal assembly protein gar2-related... 155 1e-34 ref|XP_002875229.1| hypothetical protein ARALYDRAFT_484289 [Arab... 155 1e-34 ref|XP_004239019.1| PREDICTED: uncharacterized protein LOC101258... 154 2e-34 gb|AAM77645.1|AF517846_1 hypothetical protein [Arabidopsis thali... 151 1e-33 ref|XP_004237062.1| PREDICTED: uncharacterized protein LOC101254... 147 2e-32 ref|XP_002514588.1| conserved hypothetical protein [Ricinus comm... 146 4e-32 ref|XP_006484258.1| PREDICTED: dentin sialophosphoprotein-like i... 145 8e-32 ref|XP_006484256.1| PREDICTED: dentin sialophosphoprotein-like i... 145 8e-32 ref|XP_006291111.1| hypothetical protein CARUB_v10017222mg [Caps... 144 2e-31 ref|XP_006363556.1| PREDICTED: dentin sialophosphoprotein-like i... 144 2e-31 emb|CBI27399.3| unnamed protein product [Vitis vinifera] 139 4e-30 ref|XP_007222119.1| hypothetical protein PRUPE_ppa004630mg [Prun... 132 9e-28 ref|XP_004144157.1| PREDICTED: uncharacterized protein LOC101217... 130 3e-27 >gb|EYU28892.1| hypothetical protein MIMGU_mgv1a006975mg [Mimulus guttatus] Length = 424 Score = 255 bits (651), Expect = 7e-65 Identities = 184/489 (37%), Positives = 259/489 (52%), Gaps = 26/489 (5%) Frame = +1 Query: 541 DDCSIDDENGKIDSPRKEGHNGFSHDLIMDNGIQDFPRMECKDGPAHLSSWDKDADSLST 720 DD S D E+G +DSP+KE + L++D HLS DKD DSLS+ Sbjct: 2 DDISEDSESGTVDSPKKEVEDV----LLID----------------HLSGSDKDLDSLSS 41 Query: 721 DSYDMEKEQGSPERESF-----DTVEDSADCNASD--SRDSGSPLFTDKNVLECGVPEFE 879 ++ D +KE+ + E+ ++ + S CN++ S+ + + LFTDKNVLECG+PEFE Sbjct: 42 NTCDKDKERQNLGHENMPCNGNESQDSSPPCNSASGLSQTTDANLFTDKNVLECGMPEFE 101 Query: 880 VCYRENDCQLLKDICIDEGSPEKDVNAIESGSSPLPPKEDPLLDADSFT---------AE 1032 V +E D Q++KDIC+DEG P+ ES K D L + + A Sbjct: 102 VFCKEIDYQIVKDICVDEGRPDNKDKITESCKDD---KSDGLFHQPTNSNHSEITITEAN 158 Query: 1033 PCGSKEGNDVIKLISQEEKLDSSLKNLFDKDSIKH-CEPENTVETSEACFDETLSPKDSL 1209 CG+KE ND + D+S FD+D+ K C+P +V+TSE ++ +DSL Sbjct: 159 QCGTKEEND------GKSPSDTS----FDEDTAKKDCDPAKSVQTSEITDNQE---EDSL 205 Query: 1210 ADRKLPIEDYGDQNSHGL------DEGNKVMLLSDQVLDEETVXXXXXXXXXXXXXTG-- 1365 K P+++ +NS DEG V DQ+L+E+ G Sbjct: 206 VGIKPPVQELVTRNSLRSFLYPLGDEGGVVTQPPDQILNEKPASRSSAATSSSAEAEGVE 265 Query: 1366 KDVQ-SSCLPYNSKVENEIITFNFSSPEGAAASNGTAEDIEEKCSENLPAATSNVEDDSH 1542 +DV+ SS + YNS+VE+ ITFNF S E+++ + S + + TSN D Sbjct: 266 EDVEASSSVLYNSEVESGTITFNFDST--------VTENMKPQDSVDSSSVTSNNID--- 314 Query: 1543 KQSPVXXXXXXXXXXXXXCANAPEASSEHKQANSDVISIDEAHSSEPNAPVVNQLQQDMG 1722 C + + + + NS+ + +A + Q++ + G Sbjct: 315 ------------------CVGSSKDREDENEKNSE-------QNEGSSAIISRQMKYEEG 349 Query: 1723 ETSFSAASMITYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRDF 1902 ETSF+AAS++TYSGPIA+SGSLS RSDGS SG+SFAFP+LQSEWNSSPVRMAKADRR F Sbjct: 350 ETSFAAASLVTYSGPIAYSGSLSLRSDGSAASGRSFAFPILQSEWNSSPVRMAKADRRHF 409 Query: 1903 RKHKGWRSG 1929 RKHKGWRSG Sbjct: 410 RKHKGWRSG 418 >ref|XP_007045749.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 1 [Theobroma cacao] gi|508709684|gb|EOY01581.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 1 [Theobroma cacao] Length = 527 Score = 173 bits (438), Expect = 3e-40 Identities = 155/540 (28%), Positives = 237/540 (43%), Gaps = 48/540 (8%) Frame = +1 Query: 454 YSLNGSKMDANLAELEAKDRDDYSNVKVRDDCSIDDENGKIDSPRKEGHNGFSHDLIMDN 633 +S+ G K D+ A D + N D +D + KE NG HD+ ++ Sbjct: 11 HSITGHKSDSKPYSFLA-DTKPFEN----KDKPLDSTGLNAEGVVKENQNGVMHDIKGND 65 Query: 634 GIQDFPRMECKDGPAHLSSWDKDADSLSTDSYDMEKEQGSPERESFDTVEDSADCNASDS 813 G D P + + + D S+S + + E+E D V ++ + Sbjct: 66 GDSD-PSLYLDNTRGGWPALKLDC-SISVNDF-----ANGNEKEVRDFVTSNSPSLKNMD 118 Query: 814 RDSGSPLFTDKNVLECGVPEFEVCYRENDCQLLKDICIDEGSPEKDVNAIESGSS----- 978 S + DK+V+EC +PE VCY+E+ ++KDICIDEG P +D E+G Sbjct: 119 SFQNSVFYLDKSVMECELPELVVCYKESTYHVVKDICIDEGVPTQDKFLFETGMDEKIDC 178 Query: 979 ---PLPPKEDPLLDADSFTAEPCGSKEGNDVIKLISQEEKLDSSLKNLFDKDSIKHCEPE 1149 P ++D L + + C ++ S +N KD C Sbjct: 179 NFLPSEKEQDSQLMTEKLETDMC-------------MQDVSMSPGENQSGKDIDNECGSN 225 Query: 1150 NTVETSEACFDETLSPKDSLADRKLPIE-DYGDQNSHGLDEGNKVMLLSDQVLDEETVXX 1326 V+T D +LS + + +++ +P + D D + +G+ + +++D V E Sbjct: 226 KKVDTDTCMQDVSLSLEKNESNKGIPNQCDSKDLMLTRVVKGDAMKMVTDDVSKELFTLG 285 Query: 1327 XXXXXXXXXXXTGKDVQSSCLPYNSKVENEIITFNFSSPEGAAASNGTAEDIEEKCSEN- 1503 + + S C + +E + +F SS + +EE N Sbjct: 286 ELLSMSELSKVNSEAMSSDC--KSDGIEQQ--SFQSSSKKEVMVMPPLVSAVEESKDSNE 341 Query: 1504 -----LPAATSNVED-DSHKQSPVXXXXXXXXXXXXXCANA--PEASSEHK--------- 1632 +PA S E+ DS K + +++ E S ++K Sbjct: 342 EAIVSVPALVSATEELDSGKGEAILISPAQVSTSEESTSSSLVNEVSYDNKLETGSITFN 401 Query: 1633 -QANSDVISIDEAH---SSEP-------------NAPVVNQLQQDMGETSFSAASMIT-- 1755 +++ S DE H SEP + + N LQQ +GE+SFSAA ++T Sbjct: 402 LDSSAPTSSKDECHHNLDSEPLGTGSTPKLEVAADQSISNNLQQGIGESSFSAAGLVTGL 461 Query: 1756 --YSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRDFRKHKGWRSG 1929 YSGP+A+SGSLS RSD STTS +SFAFP+LQSEWN SPVRMAKADRR +RKHKGWR G Sbjct: 462 ISYSGPVAYSGSLSLRSDSSTTSTRSFAFPILQSEWNCSPVRMAKADRRHYRKHKGWRHG 521 >ref|XP_007045750.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 2 [Theobroma cacao] gi|590698568|ref|XP_007045751.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 2 [Theobroma cacao] gi|590698571|ref|XP_007045752.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 2 [Theobroma cacao] gi|508709685|gb|EOY01582.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 2 [Theobroma cacao] gi|508709686|gb|EOY01583.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 2 [Theobroma cacao] gi|508709687|gb|EOY01584.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 2 [Theobroma cacao] Length = 470 Score = 167 bits (423), Expect = 2e-38 Identities = 134/439 (30%), Positives = 201/439 (45%), Gaps = 48/439 (10%) Frame = +1 Query: 757 ERESFDTVEDSADCNASDSRDSGSPLFTDKNVLECGVPEFEVCYRENDCQLLKDICIDEG 936 E+E D V ++ + S + DK+V+EC +PE VCY+E+ ++KDICIDEG Sbjct: 43 EKEVRDFVTSNSPSLKNMDSFQNSVFYLDKSVMECELPELVVCYKESTYHVVKDICIDEG 102 Query: 937 SPEKDVNAIESGSS--------PLPPKEDPLLDADSFTAEPCGSKEGNDVIKLISQEEKL 1092 P +D E+G P ++D L + + C ++ Sbjct: 103 VPTQDKFLFETGMDEKIDCNFLPSEKEQDSQLMTEKLETDMC-------------MQDVS 149 Query: 1093 DSSLKNLFDKDSIKHCEPENTVETSEACFDETLSPKDSLADRKLPIE-DYGDQNSHGLDE 1269 S +N KD C V+T D +LS + + +++ +P + D D + + Sbjct: 150 MSPGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEKNESNKGIPNQCDSKDLMLTRVVK 209 Query: 1270 GNKVMLLSDQVLDEETVXXXXXXXXXXXXXTGKDVQSSCLPYNSKVENEIITFNFSSPEG 1449 G+ + +++D V E + + S C + +E + +F SS + Sbjct: 210 GDAMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDC--KSDGIEQQ--SFQSSSKKE 265 Query: 1450 AAASNGTAEDIEEKCSEN------LPAATSNVED-DSHKQSPVXXXXXXXXXXXXXCANA 1608 +EE N +PA S E+ DS K + +++ Sbjct: 266 VMVMPPLVSAVEESKDSNEEAIVSVPALVSATEELDSGKGEAILISPAQVSTSEESTSSS 325 Query: 1609 --PEASSEHK----------QANSDVISIDEAH---SSEP-------------NAPVVNQ 1704 E S ++K +++ S DE H SEP + + N Sbjct: 326 LVNEVSYDNKLETGSITFNLDSSAPTSSKDECHHNLDSEPLGTGSTPKLEVAADQSISNN 385 Query: 1705 LQQDMGETSFSAASMIT----YSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPV 1872 LQQ +GE+SFSAA ++T YSGP+A+SGSLS RSD STTS +SFAFP+LQSEWN SPV Sbjct: 386 LQQGIGESSFSAAGLVTGLISYSGPVAYSGSLSLRSDSSTTSTRSFAFPILQSEWNCSPV 445 Query: 1873 RMAKADRRDFRKHKGWRSG 1929 RMAKADRR +RKHKGWR G Sbjct: 446 RMAKADRRHYRKHKGWRHG 464 >ref|XP_006348792.1| PREDICTED: dentin sialophosphoprotein-like [Solanum tuberosum] Length = 586 Score = 161 bits (407), Expect = 1e-36 Identities = 165/600 (27%), Positives = 243/600 (40%), Gaps = 73/600 (12%) Frame = +1 Query: 349 AEQIMTMKENQNGTSFDSRSSGM-QTDTLTFGSKERYSLNGSKMDANLAELEAKDRDDYS 525 AE+ TM NQNG S S+G + D+L + + N + + KD +++ Sbjct: 33 AEEKPTMNGNQNGIL--SHSNGYKEADSLGIPVNDFGNTNVHDNKEDPLACDRKDGNEFW 90 Query: 526 NVKVRDDCSIDDENGKIDSPR-KEGHNGFSHDLIMDNGIQ--------DFPRMECKD--- 669 V DD D N +I + ++ HN DL NG D P E + Sbjct: 91 EVPELDDSIFFDNNNEIKASNVRDDHNV---DLSKINGDNRGGNPFACDIPSSETNEIVA 147 Query: 670 ---------GPAHLSSWDKDADSLSTDSYDMEKEQGSPERESF--------DTVE-DSAD 795 G +++ + + D+ D ++ PE ES +T++ DS Sbjct: 148 ASVTDDQNGGLSNIIHSKRGGNPFECDTKDRDQPWNIPEYESLGFLDDKENETIDSDSPF 207 Query: 796 CNASDSRDSGSPLFTDKNVLECGVPEFEVCYRENDCQLLKDICIDEGSPEKDVNAIESGS 975 + S+ DS ++DK V + +PE VCYREN+ ++KDIC+DEG P D IES Sbjct: 208 TSHSELFDSNKHFYSDKGVTDHELPELTVCYRENNFNMVKDICMDEGVPAVDKVLIESWK 267 Query: 976 SPLPPKEDPLLDADSFTAEPCGSKEGNDVIKLISQEEKL--------------------- 1092 P + + + S + I +SQ+ Sbjct: 268 DGQPSTSVSVDADEEQQSNTRKSVDMGSTIASVSQDSSFKDAKNIAVTHDTEIEATGAPV 327 Query: 1093 ----DSSLKNLFDKDSIKHCEPENTVETSEACFDETLSPKDSLADRKLPIEDYGDQNSHG 1260 + SL+N +KD+ K E+ + + S K S + + +E+ + S Sbjct: 328 PNGFNPSLENNANKDADKDSYLEDLLMIFGSKCTTNASEKPSSLNTVVRVEESNIKTS-- 385 Query: 1261 LDEGNKVMLLSDQVLDEETVXXXXXXXXXXXXXTGKDVQSSCLPYNSKVENEIITFNFSS 1440 +G++ L DQV E+T+ +++ V I N + Sbjct: 386 --DGDQSTLQPDQVPSEQTLKSQTAVSASGQTNNKGNIKEG-------VGTSIFDVNLTK 436 Query: 1441 PEGAAASNGTAEDIEEKCSENLPAATSNVEDDSHKQSPVXXXXXXXXXXXXXCANAPEAS 1620 PE + G N+ +DSH P+A Sbjct: 437 PESTKTTEG---------------GVGNLPEDSHM---------------------PKAV 460 Query: 1621 SEHKQANSDVISI----------DEAHSSEPNAPVVNQLQQDM--GETSFSAA-----SM 1749 S HK NSD S D AH + + Q GE SFSAA Sbjct: 461 SVHKNGNSDNNSASSQVPFANTADNAHQQHLESQNMANGQSHFADGEASFSAARGPISGS 520 Query: 1750 ITYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRDFRKHKGWRSG 1929 ITYSGPI++SGS+S RS+ STTS +SFAFPVLQ+EWNSSPVRMAKA+RR K KGW+ G Sbjct: 521 ITYSGPISYSGSVSLRSESSTTSTRSFAFPVLQNEWNSSPVRMAKAERRRLSKQKGWKQG 580 >ref|XP_004239018.1| PREDICTED: uncharacterized protein LOC101258367 isoform 1 [Solanum lycopersicum] Length = 586 Score = 158 bits (399), Expect = 1e-35 Identities = 180/617 (29%), Positives = 247/617 (40%), Gaps = 90/617 (14%) Frame = +1 Query: 349 AEQIMTMKENQNGTSFDSRSSGMQTDTLTFGSKERYSLNGSKMDANLAELEAKDRDDYSN 528 AE+ TM NQNG S + D L F + + N + + KD + + Sbjct: 27 AEEKPTMNGNQNGILGHSNGY-KEADALGFPVNDFGNTNVHDNREDPLACDRKDGNKFWE 85 Query: 529 VKVRDDCSIDDENGKIDSPR-KEGHNGFSHDLIMDNGIQ--------DFPRMECKDGPAH 681 V DD D N +I + ++ HN DL NG D P E + A Sbjct: 86 VPELDDSIFFDNNDEIKASNVRDNHNV---DLSTINGDNRGGNPFACDIPSSETNEIVA- 141 Query: 682 LSSWDKDADSLST-------------DSYDMEKEQGSPERESFDTVEDSADCNASDSRDS 822 S D SLS D+ D + PE ES D ++D + ++ DS Sbjct: 142 ASVTDDQTGSLSNIIHTKRGGNPFECDTKDRNQPWNIPEYESLDFLDDKGN----ETIDS 197 Query: 823 GSPL-------------FTDKNVLECGVPEFEVCYRENDCQLLKDICIDEGSPEKDVNAI 963 SP ++DK V + + E VCYREN+ ++KDIC+DEG P D Sbjct: 198 DSPFTSHSELFENNKHFYSDKGVTDHELSELTVCYRENNFNIVKDICMDEGVPAVDKVLT 257 Query: 964 ESGSSPLPPKEDPLLDADSFTAEP---CGSKEGNDV---IKLISQEEKLDS--------- 1098 ES K+D L + S A+ +K+ D+ I +SQ+ + Sbjct: 258 ESW------KDDQLSTSVSVDADEEHQSNTKKSVDMGSSIATVSQDSSCEDAKNIAVTHG 311 Query: 1099 ----------------SLKNLFDKDSIKHCEPENTVET-SEACFDE----TLSPKDSLAD 1215 SL+N +KD+ K E+ + C S K S + Sbjct: 312 AEIEPTGAPIPNDFNPSLENKANKDADKDSYLEDLLMIFGSKCTTNGKTTNASEKPSSPN 371 Query: 1216 RKLPIEDYGDQNSHGLDEGNKVMLLSDQVLDEETVXXXXXXXXXXXXXTGKDVQSSCLPY 1395 + +E+ + S +G++ L DQV ++T+ K Sbjct: 372 TVVRVEESNIKTS----DGDQSTLQPDQVPFDQTLKSQTAISAADESNNNKG-------- 419 Query: 1396 NSK--VENEIITFNFSSPEGAAASNGTAEDIEEKCSENLPAATSNVEDDSHKQSPVXXXX 1569 NSK I FN + PE + G E N+ +DSHK Sbjct: 420 NSKEGAGTNIFDFNLTKPESTTTTEGGVE---------------NLPEDSHK-------- 456 Query: 1570 XXXXXXXXXCANAPEASSEHKQANSDVISI----------DEAHSSEPNAPVVNQLQQDM 1719 P+A S HK NSD IS D AH + + Q Sbjct: 457 -------------PKAVSVHKNGNSDNISASSQVPFANTADNAHQQHLESQNMANGQGHF 503 Query: 1720 --GETSFSAA-----SMITYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRM 1878 GE SFSAA ITYSGPI++SGSLS RS+ STTS +SFAFPVLQ+EWNSSPVRM Sbjct: 504 ADGEASFSAARGPISGSITYSGPISYSGSLSLRSESSTTSTRSFAFPVLQNEWNSSPVRM 563 Query: 1879 AKADRRDFRKHKGWRSG 1929 AKA+RR K KGW+ G Sbjct: 564 AKAERRRLSKQKGWKQG 580 >ref|XP_006395670.1| hypothetical protein EUTSA_v10004181mg [Eutrema salsugineum] gi|567142661|ref|XP_006395671.1| hypothetical protein EUTSA_v10004181mg [Eutrema salsugineum] gi|557092309|gb|ESQ32956.1| hypothetical protein EUTSA_v10004181mg [Eutrema salsugineum] gi|557092310|gb|ESQ32957.1| hypothetical protein EUTSA_v10004181mg [Eutrema salsugineum] Length = 458 Score = 155 bits (391), Expect = 1e-34 Identities = 135/444 (30%), Positives = 197/444 (44%), Gaps = 33/444 (7%) Frame = +1 Query: 691 WDKDADSLSTDSYDMEKEQGSPERESFDTVEDSADCNAS--DSRDSGSPLF-TDKNVLEC 861 WDK+ D +E+ E+ + D++ A+ DS + P+F DKNV C Sbjct: 60 WDKE-----NDGNTLERHSCGDSNEAVKKIPDNSHDVAAKRDSLEKLDPVFYMDKNVTAC 114 Query: 862 GVPEFEVCYRENDCQLLKDICIDEGSP--------EKDVNAIESGSSPLPPKEDPLLDAD 1017 +PE VCY+EN ++KDIC+DEG P EKD +++ S+ + + L++AD Sbjct: 115 DLPEIVVCYKENTYHVVKDICVDEGVPVQEKFLFGEKD--SVKCSSNSNKCESEDLMEAD 172 Query: 1018 SFTAEPCGSKEGNDVIKLISQEEKL----------DSSLKNLFDKDSIKHCEPENTVETS 1167 ++ SK D + E +SS + D + +C E+ T Sbjct: 173 KASSNLLESKSLEDRNSKLDDSELCNGTKTNRDVEESSREEFADAEGSSNCNQEHLTVTR 232 Query: 1168 EACFDETLSPKDSLADRKLPIEDYGDQNSHGLDEGNKVMLLSDQVLDEETVXXXXXXXXX 1347 EA SP + ++ E D+NS + ++S+ L Sbjct: 233 EA----KDSPTHGVNHSEISHEIESDENSKKHEVATSENVVSECCL-------------- 274 Query: 1348 XXXXTGKDVQSSCLPYNSKVENEIITFNFSSPEGAAASNGTAEDIEEKCSENLPAATSNV 1527 T D+ S + E + + N SS S +++E++ E P T Sbjct: 275 ----TLGDILSR------EDEQKHLNNNNSSNRREEHSPPLLQEMEKRSLETTPLETEEP 324 Query: 1528 EDDSHKQSPVXXXXXXXXXXXXXCANAPEASSEHKQANSDVISIDEAHSSEPNAPVVNQL 1707 + K S V + S E + +D + + +P V + Sbjct: 325 KQAEEKLSSV----------------STTTSQEPNKTCNDPERPETENQQQPKLRVEDSY 368 Query: 1708 QQD------MGETSFSAASM------ITYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQS 1851 + D GETSFSA+ ITYSGPIAFSGSLS RSD STTSG+SFAFP+LQS Sbjct: 369 EDDKLFSSGFGETSFSASEPVSISGHITYSGPIAFSGSLSVRSDASTTSGRSFAFPILQS 428 Query: 1852 EWNSSPVRMAKADRRDFRKHKGWR 1923 EWNSSPVRMAKAD+ R+ KGWR Sbjct: 429 EWNSSPVRMAKADK---RRQKGWR 449 >gb|AAX55105.1| hypothetical protein At2g03810 [Arabidopsis thaliana] Length = 439 Score = 155 bits (391), Expect = 1e-34 Identities = 133/416 (31%), Positives = 182/416 (43%), Gaps = 27/416 (6%) Frame = +1 Query: 757 ERESFDTVEDSA-DCNAS-DSRDSGSPLF-TDKNVLECGVPEFEVCYRENDCQLLKDICI 927 E E+ V D++ DC+A+ DS + P+F DKNV C +PE VCY+EN ++KDIC+ Sbjct: 61 ENEAGKKVRDTSHDCDANVDSPEKKDPVFYMDKNVTACDLPEIVVCYKENTYHIVKDICV 120 Query: 928 DEGSPEKDVNAIESGSSPLPPKEDPLLDADSFTAEPCGSKEGNDVIKLISQEE-----KL 1092 DEG P ++ S + L+ AD P +K D I + E K Sbjct: 121 DEGVPVQEKFLFGEKDSVKSSSTEDLMKADKTNVNPSETKSAEDSISKVDDSEFCNDHKT 180 Query: 1093 DSSLKNLFDKD------SIKHCEPENTVETSEACFDETLSPKDSLADRKL-PIEDYGDQN 1251 D ++ +D + + E+ + T E SP L+ ++ P E+ D+ Sbjct: 181 DRDVEESSGEDFADAEGTSSNYNQEHLIVTEEV----KASPTHGLSPSEIEPDENSKDEV 236 Query: 1252 SHGLDEGNKVMLLSDQVLDEETVXXXXXXXXXXXXXTGKDVQSSCLPYNSKVENEIITFN 1431 + D +K L +L E D Q S + N Sbjct: 237 AISQDNDSKECLTLGDILSRE------------------DEQKS-----------LNQDN 267 Query: 1432 FSSPEGAAASNGTAEDIEEKCSENLPAATSNVEDDSHKQSPVXXXXXXXXXXXXXCANAP 1611 SS S +D E++ E T + + KQ + Sbjct: 268 ISSDSHEEQSPSQLQDKEKRSLETTAIETELEKTEEPKQGE----------EKLSSVSTT 317 Query: 1612 EASSEHKQANSDVISIDEAHSSEPNAPVVNQLQQD------MGETSFSAASM------IT 1755 + +K N E H + N V N + D GETSFSAA IT Sbjct: 318 TSQEPNKTCNEPEKPETENHHQQ-NCLVENSYEDDKFSSSRFGETSFSAADSVSISGHIT 376 Query: 1756 YSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRDFRKHKGWR 1923 YSGPIA+SGSLS RSD STTSG+SFAFP+LQSEWNSSPVRMAKAD+R R+ GWR Sbjct: 377 YSGPIAYSGSLSVRSDASTTSGRSFAFPILQSEWNSSPVRMAKADKR--RQKGGWR 430 >ref|NP_178475.2| 18S pre-ribosomal assembly protein gar2-related protein [Arabidopsis thaliana] gi|42570677|ref|NP_973412.1| 18S pre-ribosomal assembly protein gar2-related protein [Arabidopsis thaliana] gi|79316683|ref|NP_001030966.1| 18S pre-ribosomal assembly protein gar2-related protein [Arabidopsis thaliana] gi|186499149|ref|NP_001118260.1| 18S pre-ribosomal assembly protein gar2-related protein [Arabidopsis thaliana] gi|330250656|gb|AEC05750.1| 18S pre-ribosomal assembly protein gar2-related protein [Arabidopsis thaliana] gi|330250657|gb|AEC05751.1| 18S pre-ribosomal assembly protein gar2-related protein [Arabidopsis thaliana] gi|330250658|gb|AEC05752.1| 18S pre-ribosomal assembly protein gar2-related protein [Arabidopsis thaliana] gi|330250659|gb|AEC05753.1| 18S pre-ribosomal assembly protein gar2-related protein [Arabidopsis thaliana] Length = 439 Score = 155 bits (391), Expect = 1e-34 Identities = 133/416 (31%), Positives = 182/416 (43%), Gaps = 27/416 (6%) Frame = +1 Query: 757 ERESFDTVEDSA-DCNAS-DSRDSGSPLF-TDKNVLECGVPEFEVCYRENDCQLLKDICI 927 E E+ V D++ DC+A+ DS + P+F DKNV C +PE VCY+EN ++KDIC+ Sbjct: 61 ENEAGKKVRDTSHDCDANVDSPEKKDPVFYMDKNVTACDLPEIVVCYKENTYHIVKDICV 120 Query: 928 DEGSPEKDVNAIESGSSPLPPKEDPLLDADSFTAEPCGSKEGNDVIKLISQEE-----KL 1092 DEG P ++ S + L+ AD P +K D I + E K Sbjct: 121 DEGVPVQEKFLFGEKDSVKSSSTEDLMKADKTNVNPSETKSAEDSISKVDDSEFCNDHKT 180 Query: 1093 DSSLKNLFDKD------SIKHCEPENTVETSEACFDETLSPKDSLADRKL-PIEDYGDQN 1251 D ++ +D + + E+ + T E SP L+ ++ P E+ D+ Sbjct: 181 DRDVEESSGEDFADAEGTSSNYNQEHLIVTEEV----KASPTHGLSPSEIEPDENSKDEV 236 Query: 1252 SHGLDEGNKVMLLSDQVLDEETVXXXXXXXXXXXXXTGKDVQSSCLPYNSKVENEIITFN 1431 + D +K L +L E D Q S + N Sbjct: 237 AISQDNDSKECLTLGDILSRE------------------DEQKS-----------LNQDN 267 Query: 1432 FSSPEGAAASNGTAEDIEEKCSENLPAATSNVEDDSHKQSPVXXXXXXXXXXXXXCANAP 1611 SS S +D E++ E T + + KQ + Sbjct: 268 ISSDSHEEQSPSQLQDKEKRSLETTAIETELEKTEEPKQGE----------EKLSSVSTT 317 Query: 1612 EASSEHKQANSDVISIDEAHSSEPNAPVVNQLQQD------MGETSFSAASM------IT 1755 + +K N E H + N V N + D GETSFSAA IT Sbjct: 318 TSQEPNKTCNEPEKPETENHHQQ-NCLVENSYEDDKFSSSRFGETSFSAADSVSISGHIT 376 Query: 1756 YSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRDFRKHKGWR 1923 YSGPIA+SGSLS RSD STTSG+SFAFP+LQSEWNSSPVRMAKAD+R R+ GWR Sbjct: 377 YSGPIAYSGSLSVRSDASTTSGRSFAFPILQSEWNSSPVRMAKADKR--RQKGGWR 430 >ref|XP_002875229.1| hypothetical protein ARALYDRAFT_484289 [Arabidopsis lyrata subsp. lyrata] gi|297321067|gb|EFH51488.1| hypothetical protein ARALYDRAFT_484289 [Arabidopsis lyrata subsp. lyrata] Length = 435 Score = 155 bits (391), Expect = 1e-34 Identities = 142/431 (32%), Positives = 187/431 (43%), Gaps = 25/431 (5%) Frame = +1 Query: 706 DSLSTDSYDMEKEQGSPERESFDTVEDSADCNAS-DSRDSGSPLF-TDKNVLECGVPEFE 879 D+ S D +D E G R+ S DC+A+ DS D P+F DKNV C +PE Sbjct: 53 DTRSGDEWD--NEAGKKVRDI------SHDCDANVDSPDKKDPVFYMDKNVTACDLPEIV 104 Query: 880 VCYRENDCQLLKDICIDEGSPEKDVNAIESGSSPLPPKEDPLLDADSFTAEPCGSKEGND 1059 VCY+EN ++KDIC+DEG P ++ S + L AD P SK D Sbjct: 105 VCYKENTYHVVKDICVDEGVPVQEKFLFGEKDSVKSSSTEDLTKADKTNVNPSESKSAED 164 Query: 1060 VIKLISQEE-----KLD-----SSLKNLFDKDSIKHCEPENTVETSEACFDETLSPKDSL 1209 + E K D SS ++ D + E+ + T EA SP L Sbjct: 165 SNTKVDDSEFCNNCKTDRDVEESSREDFADAEGSSAYNQEHLIVTEEA----KASPSHGL 220 Query: 1210 ADRKL-PIEDYGDQNSHGLDEGNKVMLLSDQVLDEETVXXXXXXXXXXXXXTGKDVQSSC 1386 ++ P E+ D+ + + +K L +L E D Q S Sbjct: 221 NPSEIEPDENSNDEVAISSETDSKESLTLGDILSRE------------------DEQKS- 261 Query: 1387 LPYNSKVENEIITFNFSSPEGAAASNGTAEDIEEKCSENLPAATSNVEDDSHKQSPVXXX 1566 + N SS S +D E++ E AA + + + PV Sbjct: 262 ----------LNHGNISSDSHEEQSPSQLQDKEKRSLET--AAIETELEKTEEPKPVEEK 309 Query: 1567 XXXXXXXXXXCANAPEASSEHKQANSDVISIDEAHSSEPNAPVVNQLQQD------MGET 1728 A+ +K N E H + N+ V N + D GET Sbjct: 310 LPS--------ASTTTLQEPNKTCNDPEKPETENHHQQ-NSLVENSYEDDKLSSSRFGET 360 Query: 1729 SFSAASM------ITYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKAD 1890 SFSAA ITYSGPIA+SGSLS RSD STTSG+SFAFP+LQSEWNSSPVRMAKAD Sbjct: 361 SFSAAESVSISGHITYSGPIAYSGSLSVRSDASTTSGRSFAFPILQSEWNSSPVRMAKAD 420 Query: 1891 RRDFRKHKGWR 1923 +R R+ GWR Sbjct: 421 KR--RQKGGWR 429 >ref|XP_004239019.1| PREDICTED: uncharacterized protein LOC101258367 isoform 2 [Solanum lycopersicum] Length = 554 Score = 154 bits (388), Expect = 2e-34 Identities = 177/611 (28%), Positives = 243/611 (39%), Gaps = 90/611 (14%) Frame = +1 Query: 367 MKENQNGTSFDSRSSGMQTDTLTFGSKERYSLNGSKMDANLAELEAKDRDDYSNVKVRDD 546 M NQNG S + D L F + + N + + KD + + V DD Sbjct: 1 MNGNQNGILGHSNGY-KEADALGFPVNDFGNTNVHDNREDPLACDRKDGNKFWEVPELDD 59 Query: 547 CSIDDENGKIDSPR-KEGHNGFSHDLIMDNGIQ--------DFPRMECKDGPAHLSSWDK 699 D N +I + ++ HN DL NG D P E + A S D Sbjct: 60 SIFFDNNDEIKASNVRDNHNV---DLSTINGDNRGGNPFACDIPSSETNEIVA-ASVTDD 115 Query: 700 DADSLST-------------DSYDMEKEQGSPERESFDTVEDSADCNASDSRDSGSPL-- 834 SLS D+ D + PE ES D ++D + ++ DS SP Sbjct: 116 QTGSLSNIIHTKRGGNPFECDTKDRNQPWNIPEYESLDFLDDKGN----ETIDSDSPFTS 171 Query: 835 -----------FTDKNVLECGVPEFEVCYRENDCQLLKDICIDEGSPEKDVNAIESGSSP 981 ++DK V + + E VCYREN+ ++KDIC+DEG P D ES Sbjct: 172 HSELFENNKHFYSDKGVTDHELSELTVCYRENNFNIVKDICMDEGVPAVDKVLTESW--- 228 Query: 982 LPPKEDPLLDADSFTAEP---CGSKEGNDV---IKLISQEEKLDS--------------- 1098 K+D L + S A+ +K+ D+ I +SQ+ + Sbjct: 229 ---KDDQLSTSVSVDADEEHQSNTKKSVDMGSSIATVSQDSSCEDAKNIAVTHGAEIEPT 285 Query: 1099 ----------SLKNLFDKDSIKHCEPENTVET-SEACFDE----TLSPKDSLADRKLPIE 1233 SL+N +KD+ K E+ + C S K S + + +E Sbjct: 286 GAPIPNDFNPSLENKANKDADKDSYLEDLLMIFGSKCTTNGKTTNASEKPSSPNTVVRVE 345 Query: 1234 DYGDQNSHGLDEGNKVMLLSDQVLDEETVXXXXXXXXXXXXXTGKDVQSSCLPYNSK--V 1407 + + S +G++ L DQV ++T+ K NSK Sbjct: 346 ESNIKTS----DGDQSTLQPDQVPFDQTLKSQTAISAADESNNNKG--------NSKEGA 393 Query: 1408 ENEIITFNFSSPEGAAASNGTAEDIEEKCSENLPAATSNVEDDSHKQSPVXXXXXXXXXX 1587 I FN + PE + G E N+ +DSHK Sbjct: 394 GTNIFDFNLTKPESTTTTEGGVE---------------NLPEDSHK-------------- 424 Query: 1588 XXXCANAPEASSEHKQANSDVISI----------DEAHSSEPNAPVVNQLQQDM--GETS 1731 P+A S HK NSD IS D AH + + Q GE S Sbjct: 425 -------PKAVSVHKNGNSDNISASSQVPFANTADNAHQQHLESQNMANGQGHFADGEAS 477 Query: 1732 FSAA-----SMITYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRR 1896 FSAA ITYSGPI++SGSLS RS+ STTS +SFAFPVLQ+EWNSSPVRMAKA+RR Sbjct: 478 FSAARGPISGSITYSGPISYSGSLSLRSESSTTSTRSFAFPVLQNEWNSSPVRMAKAERR 537 Query: 1897 DFRKHKGWRSG 1929 K KGW+ G Sbjct: 538 RLSKQKGWKQG 548 >gb|AAM77645.1|AF517846_1 hypothetical protein [Arabidopsis thaliana] gi|41059759|gb|AAR99354.1| hypothetical protein At2g03810 [Arabidopsis thaliana] Length = 439 Score = 151 bits (381), Expect = 1e-33 Identities = 131/416 (31%), Positives = 180/416 (43%), Gaps = 27/416 (6%) Frame = +1 Query: 757 ERESFDTVEDSA-DCNAS-DSRDSGSPLF-TDKNVLECGVPEFEVCYRENDCQLLKDICI 927 E E+ V D++ DC+A+ DS + P+F DKNV C +PE CY+EN ++KDIC+ Sbjct: 61 ENEAGKKVRDTSHDCDANVDSPEKKDPVFYMDKNVTACDLPEIVACYKENTYHIVKDICV 120 Query: 928 DEGSPEKDVNAIESGSSPLPPKEDPLLDADSFTAEPCGSKEGNDVIKLISQEE-----KL 1092 DE P ++ S + L+ AD P +K D I + E K Sbjct: 121 DESVPVQEKFLFGEKDSVKSSSTEDLMKADKTNVNPSETKSAEDSISKVDDSEFCNDHKT 180 Query: 1093 DSSLKNLFDKD------SIKHCEPENTVETSEACFDETLSPKDSLADRKL-PIEDYGDQN 1251 D ++ +D + + E+ + T E SP L+ ++ P E+ D+ Sbjct: 181 DRDVEESSGEDFADAEGTSSNYNQEHLIVTEEVX----ASPTHGLSPSEIEPDENSKDEV 236 Query: 1252 SHGLDEGNKVMLLSDQVLDEETVXXXXXXXXXXXXXTGKDVQSSCLPYNSKVENEIITFN 1431 + D +K L +L E D Q S + N Sbjct: 237 AISQDNDSKECLTLGDILSRE------------------DEQKS-----------LNQDN 267 Query: 1432 FSSPEGAAASNGTAEDIEEKCSENLPAATSNVEDDSHKQSPVXXXXXXXXXXXXXCANAP 1611 SS S +D E++ E T + + KQ + Sbjct: 268 ISSDSHEEQSPSQLQDKEKRSLETTAIETELEKTEEPKQGE----------EKLSSVSTT 317 Query: 1612 EASSEHKQANSDVISIDEAHSSEPNAPVVNQLQQD------MGETSFSAASM------IT 1755 + +K N E H + N V N + D GETSFSAA IT Sbjct: 318 TSQEPNKTCNEPEKPETENHHQQ-NCLVENSYEDDKFSSSRFGETSFSAADSVSISGHIT 376 Query: 1756 YSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRDFRKHKGWR 1923 YSGPIA+SGSLS RSD STTSG+SFAFP+LQSEWNSSPVRMAKAD+R R+ GWR Sbjct: 377 YSGPIAYSGSLSVRSDASTTSGRSFAFPILQSEWNSSPVRMAKADKR--RQKGGWR 430 >ref|XP_004237062.1| PREDICTED: uncharacterized protein LOC101254294 [Solanum lycopersicum] Length = 532 Score = 147 bits (372), Expect = 2e-32 Identities = 145/536 (27%), Positives = 240/536 (44%), Gaps = 49/536 (9%) Frame = +1 Query: 463 NGSKMDANLAELEAKDR-DDYSNVKVRDDCSIDDENGKIDSPRKEGHNGFSHDLIMDNGI 639 NG K D+ L KD D +D + + E + + ++ + F D+ N + Sbjct: 13 NGCK-DSKSLVLPTKDLLDSNGRDSTKDSLACEKEKNEFWNVQELDDSVFIEDISRSNKL 71 Query: 640 QDFP---RMECKDGPAHLSSWDKDADSLSTDSYDMEKEQGSP------------ERESFD 774 ++ + + + P+HL+S ++ + + D+ D + P ++E Sbjct: 72 ENRASPLKDDPDEAPSHLTSCKRNGNPFACDTADRDHPWSIPKFEDPIIVNFFDDKEKET 131 Query: 775 TVEDSADCNASDSRDSGSPLFTDKNVLECGVPEFEVCYRENDCQLLKDICIDEGSPEKDV 954 V + + S+ + + L+TDK VLE +PE +CY+END ++KDIC+DEG P D Sbjct: 132 VVSSTQFTSLSELFGADTHLYTDKGVLEFELPESTICYKENDYNIMKDICMDEGVPLMDK 191 Query: 955 NAIESGSSPLPPKEDPLLDADSFTAEPCGSKEGNDVIKLISQEEK---LDSSLKNLFDKD 1125 ES P D + + +P ++EG D + S E K ++S++K D Sbjct: 192 IVTESRKYDQP---DSSISLAADEHQPRITREGVDSELVSSGESKASSVESAVKISVDHH 248 Query: 1126 SIKHCE--------------PENTVETSE-----------ACFDETLSPKDSLADRKLPI 1230 + K E +N + +E D T++ ++++++ Sbjct: 249 TTKEDEGNKSLVPNGINPFLEDNMSKDAEKDPYLDVMKIFGSKDTTMAKPTNISEKESDS 308 Query: 1231 EDYGDQNS---HGLDEGNKVMLLSDQVLDEETVXXXXXXXXXXXXXTGKDVQSSCLPYNS 1401 +++ + NS + N++ + + TV T S NS Sbjct: 309 QNFKESNSDADQSAQQANQMPTSVEAFNSQYTVSPADG--------TNNYGPGSNFSNNS 360 Query: 1402 KVENEIITFNFSSPEGAAASNGTAEDIEEKCSENLPAATSNVEDDS-HKQSPVXXXXXXX 1578 K ++ IT +F+ E A +S+ T D ++LP + +E S K Sbjct: 361 KSKSGAITCDFNLTELALSSSVTKSD------KHLPEQSHKLEAVSGQKDGSSDSFSAAT 414 Query: 1579 XXXXXXCANAPEASSEHKQANSDVISIDEAHSSEPNAPVVNQLQQDMGETSFSAAS-MIT 1755 ++ +S+ H +V +++E +SS V GE SF AS +I+ Sbjct: 415 QVHFANSVDSSNSSTIHADP-PNVANLEEKNSSSIPLGVHGHFAN--GEASFGPASGLIS 471 Query: 1756 YSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRDFRKHKGWR 1923 YSG IA SG++S RSD STTS +SFAFPVLQSEWNSSPVRMAKA+RR + KGWR Sbjct: 472 YSGHIAHSGNISLRSDSSTTSARSFAFPVLQSEWNSSPVRMAKAERRHY---KGWR 524 >ref|XP_002514588.1| conserved hypothetical protein [Ricinus communis] gi|223546192|gb|EEF47694.1| conserved hypothetical protein [Ricinus communis] Length = 488 Score = 146 bits (368), Expect = 4e-32 Identities = 129/429 (30%), Positives = 187/429 (43%), Gaps = 55/429 (12%) Frame = +1 Query: 808 DSRDSGSPLFTDKNVLECGVPEFEVCYRENDCQLLKDICIDEGSPEKDVNAIESGSSPLP 987 +S D S + DKNV+E +PE +CY+EN ++KDIC+DEG +P Sbjct: 104 ESFDKDSVFYIDKNVMEPELPELVLCYKENTYHVVKDICVDEG---------------VP 148 Query: 988 PKEDPLLDADSFTAEPCGSKEGNDVIKLISQEEK--LDSSLKNLFDKDSIKHCEPENTVE 1161 +E+ L D + C IK Q+E+ LD S + L D+ C+ + ++ Sbjct: 149 SQENFLFDTSVDQEKLCPYLIPEKDIKSEIQKERVDLDMSTQYLSKNDNSFKCDSKESMA 208 Query: 1162 TSEACFDETLSPKDSLADRKLPIEDYGDQNSHGLDEGNKVMLLSDQVLDEETVXXXXXXX 1341 +E D++ + I +Y + + L E +LL +V+ E Sbjct: 209 IAEI-------EDDAMEE----IANYTSKETFSLGE----LLLMPEVVAE---------- 243 Query: 1342 XXXXXXTGKDVQSSCLPYNSKVENEIITFNFSSPEGAAASNGTAEDIEEKCSENL---PA 1512 + S NS E E ++ S A+ E+ + + L PA Sbjct: 244 ----------LSHSKSLLNSTDEAEQLSIQRPSENIVLATASACEESKYATEQFLLVTPA 293 Query: 1513 ATSNVEDDSHKQSPVXXXXXXXXXXXXXCAN--------APEASSEH--------KQANS 1644 VE+ H+++ + + AP ++E K + Sbjct: 294 VDPLVEESGHEEAKLGTLTSDSSPKASDHGHDEVILASLAPSYATEEPENGAKAAKSPSH 353 Query: 1645 DVISIDEAHSSEPNA------------------------------PVVNQLQQDMGETSF 1734 + S+ + +SS P A P QLQ GE+SF Sbjct: 354 TLDSVSDLNSSAPTASGGEEGSQVGGSEHLESRNSSRHEDTSITEPFSGQLQYSHGESSF 413 Query: 1735 SAAS----MITYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRDF 1902 SAA +I+YSGPIA+SGSLS RSD STTS +SFAFP+LQSEWNSSPVRMAKADRR F Sbjct: 414 SAAGPLSGLISYSGPIAYSGSLSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRRHF 473 Query: 1903 RKHKGWRSG 1929 RKH+ WR G Sbjct: 474 RKHRSWRQG 482 >ref|XP_006484258.1| PREDICTED: dentin sialophosphoprotein-like isoform X3 [Citrus sinensis] Length = 483 Score = 145 bits (366), Expect = 8e-32 Identities = 141/470 (30%), Positives = 207/470 (44%), Gaps = 65/470 (13%) Frame = +1 Query: 715 STDSYDMEKEQGSPERESFDTVEDSADCNASDSRDSGSPLF-TDKNVLECGVPEFEVCYR 891 ST D+ K+ ++ +E + + P+F DK+V EC +PE VCY+ Sbjct: 48 STSLNDLAKDN----EKNVQDLESPNSHSCGEMESFREPVFYMDKSVTECELPELIVCYK 103 Query: 892 ENDCQLLKDICIDEGSPEKDVNAIESG-----SSPLPPKEDPLLDADSFTAEPCGSKEGN 1056 EN + KDICIDEG D ES S LPPKED +S E + N Sbjct: 104 ENTYHV-KDICIDEGVHSHDRILFESDVGKSVRSFLPPKED----RNSELLE----ESKN 154 Query: 1057 DVIKLISQEEKLDSSLKNLFDKDSIKHC----EPENTVETSEACFDETLSPKDSLADRKL 1224 VI + + L SS +N D+ + C E ++ + + C + L P + D Sbjct: 155 SVIPI---PDVLKSSAENYSDEHIVNRCGSSQESDSDEDIDDICDSKDLRPAGDVKD--- 208 Query: 1225 PIEDYGDQNSHGLDEGNKVMLLSDQVLDEETVXXXXXXXXXXXXXTGKDVQSSCLPYNSK 1404 D ++N++ D K+ LL D +L V T + S + ++ Sbjct: 209 ---DATEENTN--DVSRKLFLLGD-LLSMHNVG------------TKNSLSKSAI--GNE 248 Query: 1405 VENEIITFNFSSPEGAAAS-----NGTAEDI-------------EEKCSEN------LPA 1512 ++ E +F SS + A A+ GTAE+I + C E L + Sbjct: 249 IDAEKESFQGSSAKAALANPEEANGGTAEEILTGADFVSASEESQNGCGEGISGNPTLVS 308 Query: 1513 ATSNVEDDSHKQSPVXXXXXXXXXXXXXCANAP-----------------EASSEHKQAN 1641 A+ D S + S + A +AS+ Sbjct: 309 ASEKAHDKSEEASLASPDGVSALSESTKISTAEKSSYNSMVETGSITFDFDASAPGASGK 368 Query: 1642 SDVISIDEAHSSE----------PNAPVVNQLQQDMGETSFSAA----SMITYSGPIAFS 1779 + + I ++ E P V +Q +GE+SFSAA S+I+YSGP+A+S Sbjct: 369 EEPLQIGDSQRIETPGMSRLEDAPRQSVSSQFHSGLGESSFSAAGSLPSLISYSGPVAYS 428 Query: 1780 GSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRDFRKHKGWRSG 1929 GS+S RSD STTS +SFAFP+LQ+EW+ SPVRMAKADRR +RKHK W+ G Sbjct: 429 GSISLRSDSSTTSTRSFAFPILQTEWDRSPVRMAKADRRHYRKHK-WKQG 477 >ref|XP_006484256.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Citrus sinensis] gi|568861537|ref|XP_006484257.1| PREDICTED: dentin sialophosphoprotein-like isoform X2 [Citrus sinensis] Length = 496 Score = 145 bits (366), Expect = 8e-32 Identities = 141/470 (30%), Positives = 207/470 (44%), Gaps = 65/470 (13%) Frame = +1 Query: 715 STDSYDMEKEQGSPERESFDTVEDSADCNASDSRDSGSPLF-TDKNVLECGVPEFEVCYR 891 ST D+ K+ ++ +E + + P+F DK+V EC +PE VCY+ Sbjct: 61 STSLNDLAKDN----EKNVQDLESPNSHSCGEMESFREPVFYMDKSVTECELPELIVCYK 116 Query: 892 ENDCQLLKDICIDEGSPEKDVNAIESG-----SSPLPPKEDPLLDADSFTAEPCGSKEGN 1056 EN + KDICIDEG D ES S LPPKED +S E + N Sbjct: 117 ENTYHV-KDICIDEGVHSHDRILFESDVGKSVRSFLPPKED----RNSELLE----ESKN 167 Query: 1057 DVIKLISQEEKLDSSLKNLFDKDSIKHC----EPENTVETSEACFDETLSPKDSLADRKL 1224 VI + + L SS +N D+ + C E ++ + + C + L P + D Sbjct: 168 SVIPI---PDVLKSSAENYSDEHIVNRCGSSQESDSDEDIDDICDSKDLRPAGDVKD--- 221 Query: 1225 PIEDYGDQNSHGLDEGNKVMLLSDQVLDEETVXXXXXXXXXXXXXTGKDVQSSCLPYNSK 1404 D ++N++ D K+ LL D +L V T + S + ++ Sbjct: 222 ---DATEENTN--DVSRKLFLLGD-LLSMHNVG------------TKNSLSKSAI--GNE 261 Query: 1405 VENEIITFNFSSPEGAAAS-----NGTAEDI-------------EEKCSEN------LPA 1512 ++ E +F SS + A A+ GTAE+I + C E L + Sbjct: 262 IDAEKESFQGSSAKAALANPEEANGGTAEEILTGADFVSASEESQNGCGEGISGNPTLVS 321 Query: 1513 ATSNVEDDSHKQSPVXXXXXXXXXXXXXCANAP-----------------EASSEHKQAN 1641 A+ D S + S + A +AS+ Sbjct: 322 ASEKAHDKSEEASLASPDGVSALSESTKISTAEKSSYNSMVETGSITFDFDASAPGASGK 381 Query: 1642 SDVISIDEAHSSE----------PNAPVVNQLQQDMGETSFSAA----SMITYSGPIAFS 1779 + + I ++ E P V +Q +GE+SFSAA S+I+YSGP+A+S Sbjct: 382 EEPLQIGDSQRIETPGMSRLEDAPRQSVSSQFHSGLGESSFSAAGSLPSLISYSGPVAYS 441 Query: 1780 GSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRDFRKHKGWRSG 1929 GS+S RSD STTS +SFAFP+LQ+EW+ SPVRMAKADRR +RKHK W+ G Sbjct: 442 GSISLRSDSSTTSTRSFAFPILQTEWDRSPVRMAKADRRHYRKHK-WKQG 490 >ref|XP_006291111.1| hypothetical protein CARUB_v10017222mg [Capsella rubella] gi|482559818|gb|EOA24009.1| hypothetical protein CARUB_v10017222mg [Capsella rubella] Length = 455 Score = 144 bits (363), Expect = 2e-31 Identities = 146/457 (31%), Positives = 197/457 (43%), Gaps = 46/457 (10%) Frame = +1 Query: 691 WDKDADSLSTDSYDMEKEQGSPERESFDTVEDSADCNAS-DSRDSGSPLF-TDKNVLECG 864 WDK+ D + + G + T + S D A DS + +P+F DKNV C Sbjct: 60 WDKENDGNILEPHSC----GDADEAGKKTRDTSHDFVAKGDSPEKVNPVFYMDKNVTACD 115 Query: 865 VPEFEVCYRENDCQLLKDICIDEGSP--------EKD-----VNAIESGSSPL------- 984 +PE VCY+EN ++KDIC+DEG P EKD N+ GS L Sbjct: 116 LPEIVVCYKENSYHVVKDICVDEGVPVQEKFLFGEKDSVKSTTNSNHCGSVDLMKVDKTD 175 Query: 985 --PPKEDPLLDADSF---TAEPCGSKEGNDVIKLISQEEKLDSSLKNLFDKD-------- 1125 P + L D++S ++E C K DV + S+E D+ + +D++ Sbjct: 176 VKPSETKSLEDSNSKVDDSSEVCNDKTVQDVEES-SREAFADAEGSSNYDQEHLIVTSPT 234 Query: 1126 -SIKHCEPENTVETSEACFDETLSPKDSLADRKLPIEDY----GDQNSHGLDEGNKVMLL 1290 ++K E VE+ E DE + + L + D Q S D GN+ L Sbjct: 235 LALKPSEISLEVESEEISKDEVVISSEDFLSESLTLGDILSREDKQKSLKNDNGNRPEEL 294 Query: 1291 SDQVLDEETVXXXXXXXXXXXXXTGKDVQSSCLPYNSKVENEIITFNFSSPEGAAASNGT 1470 S E+ TG D + KVE T Sbjct: 295 SPPQHQEKE--------KRSLETTGLDTKLE------KVEEP----------------KT 324 Query: 1471 AEDIEEKCSENLPAATSNVEDDSHKQSPVXXXXXXXXXXXXXCANAPEASSEHKQANSDV 1650 AE ENL +A++ + +K C + + +E+ Q N V Sbjct: 325 AE-------ENLSSASTTTVQEPNKS----------------CNDLEKPETENHQQNRLV 361 Query: 1651 ISIDEAHSSEPNAPVVNQLQQDMGETSFSAASM------ITYSGPIAFSGSLSHRSDGST 1812 S ++ S GETSFSAA ITYSGPIA+SGSLS RSD ST Sbjct: 362 NSYEDDKLSSSR----------FGETSFSAAESVSISGHITYSGPIAYSGSLSVRSDAST 411 Query: 1813 TSGKSFAFPVLQSEWNSSPVRMAKADRRDFRKHKGWR 1923 TSG+SFAFP+LQSEWNSSPVRMAKAD+R R+ GWR Sbjct: 412 TSGRSFAFPILQSEWNSSPVRMAKADKR--RQKGGWR 446 >ref|XP_006363556.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Solanum tuberosum] gi|565395867|ref|XP_006363557.1| PREDICTED: dentin sialophosphoprotein-like isoform X2 [Solanum tuberosum] gi|565395869|ref|XP_006363558.1| PREDICTED: dentin sialophosphoprotein-like isoform X3 [Solanum tuberosum] gi|565395871|ref|XP_006363559.1| PREDICTED: dentin sialophosphoprotein-like isoform X4 [Solanum tuberosum] Length = 532 Score = 144 bits (362), Expect = 2e-31 Identities = 159/543 (29%), Positives = 235/543 (43%), Gaps = 56/543 (10%) Frame = +1 Query: 463 NGSKMDANLAELEAKDR-DDYSNVKVRDDCSIDDENGKIDSPRKEGHNGFSHDLIMDNGI 639 NG K D+ L KD D +D + + E + + ++ + F D+ N Sbjct: 14 NGCK-DSKSLVLPTKDLLDSNGRDGTKDSLACEKERNEFWNVQELDDSEFFEDISRSNK- 71 Query: 640 QDFPRMECKDGP----AHLSSWDKDADSLSTDSYDMEKEQGSPERES------FD----- 774 + KD P ++L+S ++ + + D+ D + P+ E FD Sbjct: 72 HEIRASPLKDDPIEALSNLTSCKRNGNPFACDTADRDHPWSIPKFEDPMIVNFFDDKEKE 131 Query: 775 TVEDSADCNA-SDSRDSGSPLFTDKNVLECGVPEFEVCYRENDCQLLKDICIDEGSPEKD 951 TV SA + S+ + + L+TDK VLE +PE +CY EN+ ++KDIC+DEG P D Sbjct: 132 TVVSSAQFTSLSELFGTNTHLYTDKGVLEFKLPELTICYNENNYNIMKDICMDEGVPLMD 191 Query: 952 VNAIESGSSPLPPKEDPLLDADSFTAEPCGSKEGNDVIKLISQEEKLDSSLKNLFDKDSI 1131 ES P L + +P ++EG D +L+S E DSS++N K S+ Sbjct: 192 KIVTESRKYHQPDSSISLAVDEH---QPRNTREGVDS-ELVSSGESKDSSVENAV-KISV 246 Query: 1132 KHCEPENTVETSEACFDETLSPKDSLADRKLPIEDYGDQNSHGLDEGNKVMLLSDQVLDE 1311 H + +T ++L P + + Y D++S LD K+ D + Sbjct: 247 DHHTTKEDEDT------KSLGPNGINPFLEDNMSKYADKDS-SLDV-MKIFGSKDTTTAK 298 Query: 1312 ETVXXXXXXXXXXXXXTGKDVQSSCLPYNSKVENEIITFNFSSPEGAA------------ 1455 T + D + S L N ++ + FN + AA Sbjct: 299 ATNISENESDIQNLKESNSDAEQSALQAN-QIPTFVAAFNSQNTVSAADGTNNNGPGSNF 357 Query: 1456 ASNGTAEDIEEKCSENLPA-ATSNVEDDSHKQSPVXXXXXXXXXXXXXCANAPEASSEHK 1632 ++N +E C NL A S+ S K P ++ EA S K Sbjct: 358 SNNSKSESGAITCDFNLTELALSSSVAKSDKHLPEQ-------------SHKLEAVSSQK 404 Query: 1633 QANSDVIS----------IDEAHSS-EPNAPVVNQLQQDM--------------GETSFS 1737 +SD S +D +SS + P V L++ GE SF Sbjct: 405 DGSSDSFSAATQVHFANSVDSCNSSIHADPPNVANLEEKNSGSIPLGVHGHFANGEASFG 464 Query: 1738 AAS-MITYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRDFRKHK 1914 AS +I+YSG I SG++S RSD STTS +SFAFPVLQSEWNSSPVRMAKA+RR + K Sbjct: 465 PASGLISYSGHITHSGNISLRSDSSTTSARSFAFPVLQSEWNSSPVRMAKAERRHY---K 521 Query: 1915 GWR 1923 GWR Sbjct: 522 GWR 524 >emb|CBI27399.3| unnamed protein product [Vitis vinifera] Length = 435 Score = 139 bits (351), Expect = 4e-30 Identities = 141/481 (29%), Positives = 205/481 (42%), Gaps = 37/481 (7%) Frame = +1 Query: 598 HNGFSHDLIMDNGIQDFPRMECKDGPAHLSSWDKDADSLSTDSYDMEKEQGSPERESFDT 777 +N +S D + + + K L ++DAD L + ++ + ER+ Sbjct: 25 YNDYSLDTAVPKSGNEIVKENQKVISCDLKGHERDADPLDGE----DRFWNTSERDCSIN 80 Query: 778 VEDSADCNASDSRDS----------------GSPLFTDKNVLECGVPEFEVCYRENDCQL 909 V+D A+ ++ R+S + TDK+V + +P VC E+ Sbjct: 81 VDDIANACGNEVRNSVATCVVSSEKLESFEKDGDMCTDKSVTKHELP---VCCEESTYHA 137 Query: 910 LKDICIDEG--SPEKDVNAIESGSSP-------LPPKEDPLLDADSFTAE-----PCGSK 1047 +KDICIDEG SPEK + +E+G LPP D +D TA+ P G K Sbjct: 138 VKDICIDEGMLSPEKIL--VENGKEEHEGFCPFLPPDTDKNVDPTKETADKELPLPDGQK 195 Query: 1048 EG--NDVIKLISQEEKLDSSLKNLFDKDSIKHCEPENTVETSEACFDETLSPKDSLADRK 1221 ND K + QEE+ + + S + PE+ E + S +S Sbjct: 196 ASAENDCGKDLMQEEENYDARDKIISDTSEEKIVPEDIFLIPE--LSKANSMPESSEFNG 253 Query: 1222 LPIEDYGDQNSHGLDEGNKVMLLSDQVLDEETVXXXXXXXXXXXXXTGKDVQSSCLPYNS 1401 + IE QN +G L+S+ EE+ K+ + L YNS Sbjct: 254 MEIEHQCIQNPNGEAVLENPALVSEA---EES---------------DKNSFPNELSYNS 295 Query: 1402 KVENEIITFNFSSPEGAAASNGTAEDIEEKCSENLPAAT-SNVEDDSHKQSPVXXXXXXX 1578 K+E+ ITF+F S + S + C L + S +ED S Sbjct: 296 KLESGTITFDFGSSTTSMDSGREVSPQNDGCEPPLESQNLSKLEDGSE------------ 343 Query: 1579 XXXXXXCANAPEASSEHKQANSDVISIDEAHSSEPNAPVVNQLQQDMGETSFSAA----S 1746 + P Q+Q+ +GE+SFSAA + Sbjct: 344 -----------------------------------SLPFSGQIQRGLGESSFSAAGPSSA 368 Query: 1747 MITYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRRDFRKHKGWRS 1926 +I+YSG I SG++S RSD STTS +SFAFPVLQ+EWNSSPVRMAKA+RR RKH+ WR Sbjct: 369 LISYSGQITHSGNISLRSDSSTTSTRSFAFPVLQTEWNSSPVRMAKAERRHLRKHRSWRR 428 Query: 1927 G 1929 G Sbjct: 429 G 429 >ref|XP_007222119.1| hypothetical protein PRUPE_ppa004630mg [Prunus persica] gi|462419055|gb|EMJ23318.1| hypothetical protein PRUPE_ppa004630mg [Prunus persica] Length = 499 Score = 132 bits (331), Expect = 9e-28 Identities = 130/488 (26%), Positives = 204/488 (41%), Gaps = 85/488 (17%) Frame = +1 Query: 712 LSTDSYDMEKEQGSPE--RESFDTVED-----SADCNASDSRDSGSPLFTDKNVLECGVP 870 +S S D E++ G + D V+D + ++ + S + DK+V+EC +P Sbjct: 8 VSCGSKDNEEDAGQVPYVKNDEDEVKDFVPPYTLSSEKLEALEKESDYYMDKSVMECELP 67 Query: 871 EFEVCYRENDCQLLKDICIDEGSPEKDVNAIESGSSP------LPPKEDP---LLDA--- 1014 E VCY+E+ C +KDICIDEG P +D N E+G L P ED LL+ Sbjct: 68 ELIVCYKESSCNTIKDICIDEGVPSQDKNRFETGVDEKECCTFLSPDEDQNKQLLEEQMD 127 Query: 1015 ------DSFTAE-----------PCGSK---EGNDVIKLISQEEKLDSSLKNLFDKDSIK 1134 D F + PC SK + D I ++ +++ S + F + + Sbjct: 128 IVVTLPDGFKSSAHDDLEKGFVIPCDSKGLTQIGDAIYYTQEKTEIEVSKEIFFPANVLP 187 Query: 1135 ---------HCEPENTVETSEACFDE--------------------TLSPKDSLADRKLP 1227 H + E++EA D +++ + S +++K Sbjct: 188 MQELGAGNAHSSKSSNEESTEAVQDTVQSSGEKVSEIAQTGSTAVVSVTEESSHSEKKAL 247 Query: 1228 IEDYGDQNSHGLDEGNKVMLLSDQV---LDEETVXXXXXXXXXXXXXTGKDVQSSCLPYN 1398 + + N H + N + + L + +V K ++ +P Sbjct: 248 VSAAEESNFHVDELSNNSKVENGSTTSGLSDTSVHVSTTRDACPDNDVHKHFETQTMPAG 307 Query: 1399 SKVEN--------EIITFNFSSPEGAAASNGTAEDIEEKCSENLPAATSNVEDDSHKQSP 1554 ++ EI+ P A G E E + L ++++ DD S Sbjct: 308 DDGDDNDDNMPDAEIVPSQVQ-PCSAPVVTGREECPENGVCQPLDTSSTSKVDDEIPHSV 366 Query: 1555 VXXXXXXXXXXXXXCANA--PEASSEHKQANSDVISIDEAHSSEPNAPVVNQLQQDMGET 1728 + + PE S+ + + +S A +Q+ GE+ Sbjct: 367 IVSSQVQHYSAPVTISREERPENGVWQCPETSNAFMVGDVNSDTQYASF--HVQRGFGES 424 Query: 1729 SFSAA----SMITYSGPIAFSGSLSHRSDGSTTSGKSFAFPVLQSEWNSSPVRMAKADRR 1896 SFSAA S++ SGP +SG++S RS+ STTS +SFAFPVLQSEWNSSPVRMAKADRR Sbjct: 425 SFSAAGHFSSLMNTSGP--YSGNVSLRSESSTTSTRSFAFPVLQSEWNSSPVRMAKADRR 482 Query: 1897 DFRKHKGW 1920 RKH+GW Sbjct: 483 HLRKHRGW 490 >ref|XP_004144157.1| PREDICTED: uncharacterized protein LOC101217989 [Cucumis sativus] gi|449523672|ref|XP_004168847.1| PREDICTED: uncharacterized protein LOC101224727 [Cucumis sativus] Length = 431 Score = 130 bits (326), Expect = 3e-27 Identities = 115/402 (28%), Positives = 183/402 (45%), Gaps = 12/402 (2%) Frame = +1 Query: 760 RESFDTVEDS--ADCNASDSRDSGSPLFT--DKNVLECGVPEFEVCYRENDCQLLKDICI 927 R S D +D+ +A + + P F+ DK+V+EC + + VC +E + +KDICI Sbjct: 75 RSSTDVFDDNNAEGISAFGASSNMKPSFSYVDKSVMECQMSKTIVCDQEVNVNDVKDICI 134 Query: 928 DEGSPEKDVNAIESGSSPLPPKEDPLLDADSFTAEPCGSKEGNDVIKLISQEEKLDSSLK 1107 D+G + +S + K PL + D + ++V K I+ + K+ SL+ Sbjct: 135 DDGVASLENFFFKSTAEKSISKISPL-EEDRNEGSIKEKETSSEVSKFIADDRKV--SLE 191 Query: 1108 NLFDKDSIKHCEPENTVETSEACFDETLSPKDSLADRKLPIEDYGDQNSHGLD---EGNK 1278 + F D H + ++ + E ++ + L +KL Y ++ + G K Sbjct: 192 DHFAMDWTTHNDAKDLTQIEE---EKLNLSEPELLMQKLVKRSYSSESLDKIGLQISGEK 248 Query: 1279 VMLLSDQVLDEETVXXXXXXXXXXXXXTGKD-VQSSCLPYNSKVENEIITFNFSSPEGAA 1455 L + KD + + YN + EN I F+S A Sbjct: 249 TNLEDPSSASKSVDSCNDTPALDSAAEPPKDNIPAHPSGYNDEFENGSIALTFNSISPVA 308 Query: 1456 ASNGTAEDIEEKCSENLPAATSNVEDDSHKQSPVXXXXXXXXXXXXXCANAPEASSEHKQ 1635 NG E+ +E C + + V + E++ Sbjct: 309 --NG-GEERQECCGRSDSVIGTQVL----------------------------TNLEYRT 337 Query: 1636 ANSDVISIDEAHSSEPNAPVVNQLQQDMGETSFSA----ASMITYSGPIAFSGSLSHRSD 1803 ++S ++S H D+GE+SFSA AS++TYSGP+A+SGS+S RS+ Sbjct: 338 SDSRLLSSQNMH--------------DIGESSFSAVDPLASLVTYSGPVAYSGSISLRSE 383 Query: 1804 GSTTSGKSFAFPVLQSEWNSSPVRMAKADRRDFRKHKGWRSG 1929 STTS +SFAFP+LQSEWNSSPV+M KA+RR +RK++GWR G Sbjct: 384 SSTTSTRSFAFPILQSEWNSSPVKMVKAERRHYRKYRGWREG 425