BLASTX nr result
ID: Catharanthus23_contig00004536
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Catharanthus23_contig00004536 (2121 letters) Database: ./nr 37,332,560 sequences; 13,225,080,153 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_004239018.1| PREDICTED: uncharacterized protein LOC101258... 171 8e-43 ref|XP_006348792.1| PREDICTED: dentin sialophosphoprotein-like [... 175 6e-41 emb|CBI27399.3| unnamed protein product [Vitis vinifera] 173 3e-40 ref|XP_006363556.1| PREDICTED: dentin sialophosphoprotein-like i... 172 4e-40 ref|XP_004237062.1| PREDICTED: uncharacterized protein LOC101254... 171 1e-39 ref|XP_004239019.1| PREDICTED: uncharacterized protein LOC101258... 167 2e-38 gb|EOY01582.1| 18S pre-ribosomal assembly protein gar2-related, ... 149 4e-33 gb|EOY01581.1| 18S pre-ribosomal assembly protein gar2-related, ... 149 4e-33 ref|XP_002514588.1| conserved hypothetical protein [Ricinus comm... 142 4e-31 ref|XP_004144157.1| PREDICTED: uncharacterized protein LOC101217... 140 3e-30 gb|EMJ23318.1| hypothetical protein PRUPE_ppa004630mg [Prunus pe... 134 2e-28 ref|XP_006484258.1| PREDICTED: dentin sialophosphoprotein-like i... 127 3e-26 ref|XP_006484256.1| PREDICTED: dentin sialophosphoprotein-like i... 127 3e-26 ref|XP_002875229.1| hypothetical protein ARALYDRAFT_484289 [Arab... 125 7e-26 ref|XP_002311412.1| 18S pre-ribosomal assembly protein gar2 [Pop... 124 1e-25 ref|XP_006291111.1| hypothetical protein CARUB_v10017222mg [Caps... 122 6e-25 gb|ESW25465.1| hypothetical protein PHAVU_003G038300g [Phaseolus... 119 5e-24 gb|AAX55105.1| hypothetical protein At2g03810 [Arabidopsis thali... 119 7e-24 ref|NP_178475.2| 18S pre-ribosomal assembly protein gar2-related... 119 7e-24 ref|XP_002266889.2| PREDICTED: uncharacterized protein LOC100247... 118 1e-23 >ref|XP_004239018.1| PREDICTED: uncharacterized protein LOC101258367 isoform 1 [Solanum lycopersicum] Length = 586 Score = 171 bits (434), Expect(2) = 8e-43 Identities = 170/587 (28%), Positives = 248/587 (42%), Gaps = 88/587 (14%) Frame = +1 Query: 337 DTDKPKLSMMEDKIGIVCDSNAYGKETDRLSFSASDPLHANGHGGNSNSLVFGIQDEHDF 516 DT + K +M ++ GI+ SN Y KE D L F +D + N H + L +D + F Sbjct: 25 DTAEEKPTMNGNQNGILGHSNGY-KEADALGFPVNDFGNTNVHDNREDPLACDRKDGNKF 83 Query: 517 W-------------NSAVFKSSLLDD-----STRSNDNEPGGSP----VDHLNGFEIDAE 630 W N + S++ D+ ST + DN GG+P + EI A Sbjct: 84 WEVPELDDSIFFDNNDEIKASNVRDNHNVDLSTINGDNR-GGNPFACDIPSSETNEIVAA 142 Query: 631 S---------------------FLFDTRDKNGAQITEEAAHSVMDGQTANGIEEESKDSE 747 S F DT+D+N E + +D G E DS Sbjct: 143 SVTDDQTGSLSNIIHTKRGGNPFECDTKDRNQPWNIPE--YESLDFLDDKGNETIDSDSP 200 Query: 748 TSTVPHTFESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVHEENKILIESS 927 ++ FE++ ++DK V + EL + VCY+E+N VKDIC+DEGV +K+L ES Sbjct: 201 FTSHSELFENNKHFYSDKGVTDHELSELTVCYRENNFNIVKDICMDEGVPAVDKVLTESW 260 Query: 928 KDEHAGSVVSQPSDEDRYGGTTNDPDIEFFVPDGFKVSSPEHNRHDTASEFGAEKIDKXX 1107 KD+ + VS +DE+ T D+ + + SS E ++ A GAE I+ Sbjct: 261 KDDQLSTSVSVDADEEHQSNTKKSVDMGSSIATVSQDSSCEDAKN-IAVTHGAE-IEPTG 318 Query: 1108 XXXXXXXXXXXXXXA---CVSEELILQKALLECSKC------------------------ 1206 A + + ++ SKC Sbjct: 319 APIPNDFNPSLENKANKDADKDSYLEDLLMIFGSKCTTNGKTTNASEKPSSPNTVVRVEE 378 Query: 1207 ------DEDKVSPQPDEVPCLESVLESLAVAFTSEQSKTDGPVSNTCYNSKVEGGTITFD 1368 D D+ + QPD+VP +++ A++ E + G NSK GT FD Sbjct: 379 SNIKTSDGDQSTLQPDQVPFDQTLKSQTAISAADESNNNKG-------NSKEGAGTNIFD 431 Query: 1369 FNSSKPNVSNSIDASAELTTG-----KAPETEDEKPSDRF-ASSPAQLVNNEGKIKENPS 1530 FN +KP + + + E KA SD ASS N +N Sbjct: 432 FNLTKPESTTTTEGGVENLPEDSHKPKAVSVHKNGNSDNISASSQVPFANTA----DNAH 487 Query: 1531 DRKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTS-GPVSGLITYSGPIAYSGNVXX 1707 + L ++ +N + G+F+ DGE+SFS + GP+SG ITYSGPI+YSG++ Sbjct: 488 QQHLESQNMANGQ----GHFA--------DGEASFSAARGPISGSITYSGPISYSGSLSL 535 Query: 1708 XXXXXXXXXXXFAFPILQNEWNSSPVRMEKAR-----KHRGWRHGLL 1833 FAFP+LQNEWNSSPVRM KA K +GW+ GLL Sbjct: 536 RSESSTTSTRSFAFPVLQNEWNSSPVRMAKAERRRLSKQKGWKQGLL 582 Score = 31.6 bits (70), Expect(2) = 8e-43 Identities = 13/21 (61%), Positives = 17/21 (80%) Frame = +3 Query: 273 MFASQLLRILQTIPTDVDHET 335 MFASQLLR+L+T+P D E+ Sbjct: 1 MFASQLLRLLETLPADTSSES 21 >ref|XP_006348792.1| PREDICTED: dentin sialophosphoprotein-like [Solanum tuberosum] Length = 586 Score = 175 bits (444), Expect = 6e-41 Identities = 166/583 (28%), Positives = 251/583 (43%), Gaps = 84/583 (14%) Frame = +1 Query: 337 DTDKPKLSMMEDKIGIVCDSNAYGKETDRLSFSASDPLHANGHGGNSNSLVFGIQDEHDF 516 DT + K +M ++ GI+ SN Y KE D L +D + N H + L +D ++F Sbjct: 31 DTAEEKPTMNGNQNGILSHSNGY-KEADSLGIPVNDFGNTNVHDNKEDPLACDRKDGNEF 89 Query: 517 W-------------NSAVFKSSLLDDS----TRSNDNEPGGSP----------------- 594 W N+ + S++ DD ++ N + GG+P Sbjct: 90 WEVPELDDSIFFDNNNEIKASNVRDDHNVDLSKINGDNRGGNPFACDIPSSETNEIVAAS 149 Query: 595 -VDHLNG-------FEIDAESFLFDTRDKNGA-QITEEAAHSVMDGQTANGIEEESKDSE 747 D NG + F DT+D++ I E + +D + E E+ DS+ Sbjct: 150 VTDDQNGGLSNIIHSKRGGNPFECDTKDRDQPWNIPEYESLGFLDDK-----ENETIDSD 204 Query: 748 TSTVPHT--FESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVHEENKILIE 921 + H+ F+S+ ++DK V + ELP+ VCY+E+N VKDIC+DEGV +K+LIE Sbjct: 205 SPFTSHSELFDSNKHFYSDKGVTDHELPELTVCYRENNFNMVKDICMDEGVPAVDKVLIE 264 Query: 922 SSKDEHAGSVVSQPSDEDRYGGT---------------------------TNDPDIEFF- 1017 S KD + VS +DE++ T T+D +IE Sbjct: 265 SWKDGQPSTSVSVDADEEQQSNTRKSVDMGSTIASVSQDSSFKDAKNIAVTHDTEIEATG 324 Query: 1018 --VPDGFKVSSPEHNRHDTASEFGAEKIDKXXXXXXXXXXXXXXXXACVSEELILQKALL 1191 VP+GF S + D + E D + ++ + ++++ + Sbjct: 325 APVPNGFNPSLENNANKDADKDSYLE--DLLMIFGSKCTTNASEKPSSLNTVVRVEESNI 382 Query: 1192 ECSKCDEDKVSPQPDEVPCLESVLESLAVAFTSEQSKTDGPVSNTCYNSKVEGGTITFDF 1371 + S D D+ + QPD+VP E L+S S Q+ G N K GT FD Sbjct: 383 KTS--DGDQSTLQPDQVPS-EQTLKSQTAVSASGQTNNKG-------NIKEGVGTSIFDV 432 Query: 1372 NSSKPNVSNSIDASAELTTGKAPETEDEKPSDRFASSPAQLVNNEGKIK---ENPSDRKL 1542 N +KP + + + G PE + P NN + N +D Sbjct: 433 NLTKPESTKTTEGG----VGNLPE-DSHMPKAVSVHKNGNSDNNSASSQVPFANTADNAH 487 Query: 1543 LRKDASNDEIGNSGNFSVSSYLERGDGESSFSTS-GPVSGLITYSGPIAYSGNVXXXXXX 1719 + S + +F+ DGE+SFS + GP+SG ITYSGPI+YSG+V Sbjct: 488 QQHLESQNMANGQSHFA--------DGEASFSAARGPISGSITYSGPISYSGSVSLRSES 539 Query: 1720 XXXXXXXFAFPILQNEWNSSPVRMEKAR-----KHRGWRHGLL 1833 FAFP+LQNEWNSSPVRM KA K +GW+ G+L Sbjct: 540 STTSTRSFAFPVLQNEWNSSPVRMAKAERRRLSKQKGWKQGIL 582 >emb|CBI27399.3| unnamed protein product [Vitis vinifera] Length = 435 Score = 173 bits (438), Expect = 3e-40 Identities = 142/422 (33%), Positives = 197/422 (46%), Gaps = 12/422 (2%) Frame = +1 Query: 604 LNGFEIDAESFLFDTRDKNGAQITEEAAHSVMDGQTANGIEEESKDSETSTVP-----HT 768 L G E DA+ + R N T E S+ AN E ++S + V + Sbjct: 53 LKGHERDADPLDGEDRFWN----TSERDCSINVDDIANACGNEVRNSVATCVVSSEKLES 108 Query: 769 FESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVHEENKILIESSKDEHAGS 948 FE D TDK+V + ELP VC +ES VKDIC+DEG+ KIL+E+ K+EH G Sbjct: 109 FEKDGDMCTDKSVTKHELP---VCCEESTYHAVKDICIDEGMLSPEKILVENGKEEHEGF 165 Query: 949 VVSQPSDEDRYGGTTNDP-DIEFFVPDGFKVSSPEHNRHDTASEFGAEKID-KXXXXXXX 1122 P D D+ T + D E +PDG K S+ D E E D + Sbjct: 166 CPFLPPDTDKNVDPTKETADKELPLPDGQKASAENDCGKDLMQE--EENYDARDKIISDT 223 Query: 1123 XXXXXXXXXACVSEELILQKALLECSKCDEDKVSPQPDEVPCLESVLESLAVAFTSEQSK 1302 + EL ++ E S+ + ++ Q + P E+VLE+ A+ +E+S Sbjct: 224 SEEKIVPEDIFLIPELSKANSMPESSEFNGMEIEHQCIQNPNGEAVLENPALVSEAEESD 283 Query: 1303 TDGPVSNTCYNSKVEGGTITFDFNSSKPNVSNSIDASAELTTGKAPETEDEKPSDRFASS 1482 + + YNSK+E GTITFDF SS + S+D+ E++ P+ + +P Sbjct: 284 KNSFPNELSYNSKLESGTITFDFGSS----TTSMDSGREVS----PQNDGCEP------- 328 Query: 1483 PAQLVNNEGKIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTSGPVSGL 1662 P + + L K E + S ++RG GESSFS +GP S L Sbjct: 329 --------------PLESQNLSKLEDGSE-----SLPFSGQIQRGLGESSFSAAGPSSAL 369 Query: 1663 ITYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA-----RKHRGWRHG 1827 I+YSG I +SGN+ FAFP+LQ EWNSSPVRM KA RKHR WR G Sbjct: 370 ISYSGQITHSGNISLRSDSSTTSTRSFAFPVLQTEWNSSPVRMAKAERRHLRKHRSWRRG 429 Query: 1828 LL 1833 +L Sbjct: 430 IL 431 >ref|XP_006363556.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Solanum tuberosum] gi|565395867|ref|XP_006363557.1| PREDICTED: dentin sialophosphoprotein-like isoform X2 [Solanum tuberosum] gi|565395869|ref|XP_006363558.1| PREDICTED: dentin sialophosphoprotein-like isoform X3 [Solanum tuberosum] gi|565395871|ref|XP_006363559.1| PREDICTED: dentin sialophosphoprotein-like isoform X4 [Solanum tuberosum] Length = 532 Score = 172 bits (437), Expect = 4e-40 Identities = 147/522 (28%), Positives = 228/522 (43%), Gaps = 47/522 (9%) Frame = +1 Query: 409 KETDRLSFSASDPLHANGHGGNSNSLVFGIQDEHDFWN-SAVFKSSLLDDSTRSNDNEPG 585 K++ L D L +NG G +SL ++ ++FWN + S +D +RSN +E Sbjct: 17 KDSKSLVLPTKDLLDSNGRDGTKDSLACE-KERNEFWNVQELDDSEFFEDISRSNKHEIR 75 Query: 586 GSPV--------DHLNGFEIDAESFLFDTRDKNGAQITEEAAHSVMDGQTANGIEEESKD 741 SP+ +L + + F DT D++ + D N +++ K+ Sbjct: 76 ASPLKDDPIEALSNLTSCKRNGNPFACDTADRDHPW----SIPKFEDPMIVNFFDDKEKE 131 Query: 742 SETSTVPHTFESDLKS-----FTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVHEEN 906 + S+ T S+L +TDK V+E +LP+ +CY E+N +KDIC+DEGV + Sbjct: 132 TVVSSAQFTSLSELFGTNTHLYTDKGVLEFKLPELTICYNENNYNIMKDICMDEGVPLMD 191 Query: 907 KILIESSKDEHAGSVVSQPSDEDRYGGTTNDPDIEFF---------VPDGFKVSSPEHNR 1059 KI+ ES K S +S DE + T D E V + K+S H Sbjct: 192 KIVTESRKYHQPDSSISLAVDEHQPRNTREGVDSELVSSGESKDSSVENAVKISVDHHTT 251 Query: 1060 ------------------HDTASEFGAEKIDKXXXXXXXXXXXXXXXXACVSE-ELILQK 1182 D S++ + +SE E +Q Sbjct: 252 KEDEDTKSLGPNGINPFLEDNMSKYADKDSSLDVMKIFGSKDTTTAKATNISENESDIQN 311 Query: 1183 ALLECSKCDEDKVSPQPDEVPCLESVLESLAVAFTSEQSKTDGPVSNTCYNSKVEGGTIT 1362 L+ S D ++ + Q +++P + S ++ + +GP SN NSK E G IT Sbjct: 312 --LKESNSDAEQSALQANQIPTFVAAFNSQNTVSAADGTNNNGPGSNFSNNSKSESGAIT 369 Query: 1363 FDFNSSKPNVSNSI---DASAELTTGKAPETEDEKPSDRFASSPAQLVNNEGKIKENPSD 1533 DFN ++ +S+S+ D + K +K + S A V+ + S Sbjct: 370 CDFNLTELALSSSVAKSDKHLPEQSHKLEAVSSQKDGSSDSFSAATQVHFANSVDSCNSS 429 Query: 1534 RKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTSGPVSGLITYSGPIAYSGNVXXXX 1713 + +N E NSG+ + + +GE+SF GP SGLI+YSG I +SGN+ Sbjct: 430 IHADPPNVANLEEKNSGSIPLGVHGHFANGEASF---GPASGLISYSGHITHSGNISLRS 486 Query: 1714 XXXXXXXXXFAFPILQNEWNSSPVRMEKA--RKHRGWRHGLL 1833 FAFP+LQ+EWNSSPVRM KA R ++GWR LL Sbjct: 487 DSSTTSARSFAFPVLQSEWNSSPVRMAKAERRHYKGWRQSLL 528 >ref|XP_004237062.1| PREDICTED: uncharacterized protein LOC101254294 [Solanum lycopersicum] Length = 532 Score = 171 bits (433), Expect = 1e-39 Identities = 152/529 (28%), Positives = 235/529 (44%), Gaps = 54/529 (10%) Frame = +1 Query: 409 KETDRLSFSASDPLHANGHGGNSNSLVFGIQDEHDFWNSAVFKSSL-LDDSTRSNDNEPG 585 K++ L D L +NG +SL +++++FWN S+ ++D +RSN E Sbjct: 16 KDSKSLVLPTKDLLDSNGRDSTKDSLACE-KEKNEFWNVQELDDSVFIEDISRSNKLENR 74 Query: 586 GSPV--------DHLNGFEIDAESFLFDTRDKNGAQITEEAAHSVMDGQTANGIEEESKD 741 SP+ HL + + F DT D++ + ++ N +++ K+ Sbjct: 75 ASPLKDDPDEAPSHLTSCKRNGNPFACDTADRDHPWSIPKFEDPII----VNFFDDKEKE 130 Query: 742 SETSTVPHT-----FESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVHEEN 906 + S+ T F +D +TDK V+E ELP+ +CYKE++ +KDIC+DEGV + Sbjct: 131 TVVSSTQFTSLSELFGADTHLYTDKGVLEFELPESTICYKENDYNIMKDICMDEGVPLMD 190 Query: 907 KILIESSKDEHAGSVVSQPSDEDRYGGTTNDPDIEFFVPDGFKVSSPEH------NRHDT 1068 KI+ ES K + S +S +DE + T D E K SS E + H T Sbjct: 191 KIVTESRKYDQPDSSISLAADEHQPRITREGVDSELVSSGESKASSVESAVKISVDHHTT 250 Query: 1069 ASEFG--------------------AEKIDKXXXXXXXXXXXXXXXXACVSEELILQKAL 1188 + G AEK E Sbjct: 251 KEDEGNKSLVPNGINPFLEDNMSKDAEKDPYLDVMKIFGSKDTTMAKPTNISEKESDSQN 310 Query: 1189 LECSKCDEDKVSPQPDEVPCLESVLESLAVAFTSEQSKTDGPVSNTCYNSKVEGGTITFD 1368 + S D D+ + Q +++P S ++ + GP SN NSK + G IT D Sbjct: 311 FKESNSDADQSAQQANQMPTSVEAFNSQYTVSPADGTNNYGPGSNFSNNSKSKSGAITCD 370 Query: 1369 FNSSKPNVSNSIDASAELTTGKAPETE-----DEKPSDRF-ASSPAQLVN-----NEGKI 1515 FN ++ +S+S+ S + ++ + E + SD F A++ N N I Sbjct: 371 FNLTELALSSSVTKSDKHLPEQSHKLEAVSGQKDGSSDSFSAATQVHFANSVDSSNSSTI 430 Query: 1516 KENPSD-RKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTSGPVSGLITYSGPIAYS 1692 +P + L K++S+ +G G+F+ +GE+SF GP SGLI+YSG IA+S Sbjct: 431 HADPPNVANLEEKNSSSIPLGVHGHFA--------NGEASF---GPASGLISYSGHIAHS 479 Query: 1693 GNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA--RKHRGWRHGLL 1833 GN+ FAFP+LQ+EWNSSPVRM KA R ++GWR LL Sbjct: 480 GNISLRSDSSTTSARSFAFPVLQSEWNSSPVRMAKAERRHYKGWRQSLL 528 >ref|XP_004239019.1| PREDICTED: uncharacterized protein LOC101258367 isoform 2 [Solanum lycopersicum] Length = 554 Score = 167 bits (422), Expect = 2e-38 Identities = 166/573 (28%), Positives = 240/573 (41%), Gaps = 88/573 (15%) Frame = +1 Query: 379 GIVCDSNAYGKETDRLSFSASDPLHANGHGGNSNSLVFGIQDEHDFW------------- 519 GI+ SN Y KE D L F +D + N H + L +D + FW Sbjct: 7 GILGHSNGY-KEADALGFPVNDFGNTNVHDNREDPLACDRKDGNKFWEVPELDDSIFFDN 65 Query: 520 NSAVFKSSLLDD-----STRSNDNEPGGSP----VDHLNGFEIDAES------------- 633 N + S++ D+ ST + DN GG+P + EI A S Sbjct: 66 NDEIKASNVRDNHNVDLSTINGDNR-GGNPFACDIPSSETNEIVAASVTDDQTGSLSNII 124 Query: 634 --------FLFDTRDKNGAQITEEAAHSVMDGQTANGIEEESKDSETSTVPHTFESDLKS 789 F DT+D+N E + +D G E DS ++ FE++ Sbjct: 125 HTKRGGNPFECDTKDRNQPWNIPE--YESLDFLDDKGNETIDSDSPFTSHSELFENNKHF 182 Query: 790 FTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVHEENKILIESSKDEHAGSVVSQPSD 969 ++DK V + EL + VCY+E+N VKDIC+DEGV +K+L ES KD+ + VS +D Sbjct: 183 YSDKGVTDHELSELTVCYRENNFNIVKDICMDEGVPAVDKVLTESWKDDQLSTSVSVDAD 242 Query: 970 EDRYGGTTNDPDIEFFVPDGFKVSSPEHNRHDTASEFGAEKIDKXXXXXXXXXXXXXXXX 1149 E+ T D+ + + SS E ++ A GAE I+ Sbjct: 243 EEHQSNTKKSVDMGSSIATVSQDSSCEDAKN-IAVTHGAE-IEPTGAPIPNDFNPSLENK 300 Query: 1150 A---CVSEELILQKALLECSKC------------------------------DEDKVSPQ 1230 A + + ++ SKC D D+ + Q Sbjct: 301 ANKDADKDSYLEDLLMIFGSKCTTNGKTTNASEKPSSPNTVVRVEESNIKTSDGDQSTLQ 360 Query: 1231 PDEVPCLESVLESLAVAFTSEQSKTDGPVSNTCYNSKVEGGTITFDFNSSKPNVSNSIDA 1410 PD+VP +++ A++ E + G NSK GT FDFN +KP + + + Sbjct: 361 PDQVPFDQTLKSQTAISAADESNNNKG-------NSKEGAGTNIFDFNLTKPESTTTTEG 413 Query: 1411 SAELTTG-----KAPETEDEKPSDRF-ASSPAQLVNNEGKIKENPSDRKLLRKDASNDEI 1572 E KA SD ASS N +N + L ++ +N + Sbjct: 414 GVENLPEDSHKPKAVSVHKNGNSDNISASSQVPFANTA----DNAHQQHLESQNMANGQ- 468 Query: 1573 GNSGNFSVSSYLERGDGESSFSTS-GPVSGLITYSGPIAYSGNVXXXXXXXXXXXXXFAF 1749 G+F+ DGE+SFS + GP+SG ITYSGPI+YSG++ FAF Sbjct: 469 ---GHFA--------DGEASFSAARGPISGSITYSGPISYSGSLSLRSESSTTSTRSFAF 517 Query: 1750 PILQNEWNSSPVRMEKAR-----KHRGWRHGLL 1833 P+LQNEWNSSPVRM KA K +GW+ GLL Sbjct: 518 PVLQNEWNSSPVRMAKAERRRLSKQKGWKQGLL 550 >gb|EOY01582.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 2 [Theobroma cacao] gi|508709686|gb|EOY01583.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 2 [Theobroma cacao] gi|508709687|gb|EOY01584.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 2 [Theobroma cacao] Length = 470 Score = 149 bits (377), Expect = 4e-33 Identities = 134/438 (30%), Positives = 189/438 (43%), Gaps = 57/438 (13%) Frame = +1 Query: 691 SVMDGQTANGIEEESKDSETSTVPHTFESDLKS----FTDKNVVECELPDFVVCYKESNT 858 S+ ANG E+E +D TS P D + DK+V+ECELP+ VVCYKES Sbjct: 32 SISVNDFANGNEKEVRDFVTSNSPSLKNMDSFQNSVFYLDKSVMECELPELVVCYKESTY 91 Query: 859 PHVKDICVDEGVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTNDPDIEFFVPDGFKV 1038 VKDIC+DEGV ++K L E+ DE E T + + + D Sbjct: 92 HVVKDICIDEGVPTQDKFLFETGMDEKIDCNFLPSEKEQDSQLMTEKLETDMCMQDVSMS 151 Query: 1039 SSPEHNRHDTASEFGAEK-------IDKXXXXXXXXXXXXXXXXACVSEELILQKALL-E 1194 + D +E G+ K + C S++L+L + + + Sbjct: 152 PGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEKNESNKGIPNQCDSKDLMLTRVVKGD 211 Query: 1195 CSKCDEDKVSPQPDEVPCLESVLESLAV--AFTSEQSKTDGPVSNTCYNSKVEGGTITFD 1368 K D VS + + L S+ E V S K+DG + +S + + Sbjct: 212 AMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDCKSDGIEQQSFQSSSKKEVMVMPP 271 Query: 1369 FNSSKPNVSNSIDASAELTTG-----KAPETEDEKPSDRFASSPAQLVNNE-----GKIK 1518 S+ V S D++ E A E D + SPAQ+ +E + Sbjct: 272 LVSA---VEESKDSNEEAIVSVPALVSATEELDSGKGEAILISPAQVSTSEESTSSSLVN 328 Query: 1519 ENPSDRKL---------------LRKD-------------ASNDEIGNSGNFSVSSYLER 1614 E D KL KD S ++ + + S+S+ L++ Sbjct: 329 EVSYDNKLETGSITFNLDSSAPTSSKDECHHNLDSEPLGTGSTPKLEVAADQSISNNLQQ 388 Query: 1615 GDGESSFSTSGPVSGLITYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRME 1794 G GESSFS +G V+GLI+YSGP+AYSG++ FAFPILQ+EWN SPVRM Sbjct: 389 GIGESSFSAAGLVTGLISYSGPVAYSGSLSLRSDSSTTSTRSFAFPILQSEWNCSPVRMA 448 Query: 1795 KA-----RKHRGWRHGLL 1833 KA RKH+GWRHGLL Sbjct: 449 KADRRHYRKHKGWRHGLL 466 >gb|EOY01581.1| 18S pre-ribosomal assembly protein gar2-related, putative isoform 1 [Theobroma cacao] Length = 527 Score = 149 bits (377), Expect = 4e-33 Identities = 134/438 (30%), Positives = 189/438 (43%), Gaps = 57/438 (13%) Frame = +1 Query: 691 SVMDGQTANGIEEESKDSETSTVPHTFESDLKS----FTDKNVVECELPDFVVCYKESNT 858 S+ ANG E+E +D TS P D + DK+V+ECELP+ VVCYKES Sbjct: 89 SISVNDFANGNEKEVRDFVTSNSPSLKNMDSFQNSVFYLDKSVMECELPELVVCYKESTY 148 Query: 859 PHVKDICVDEGVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTNDPDIEFFVPDGFKV 1038 VKDIC+DEGV ++K L E+ DE E T + + + D Sbjct: 149 HVVKDICIDEGVPTQDKFLFETGMDEKIDCNFLPSEKEQDSQLMTEKLETDMCMQDVSMS 208 Query: 1039 SSPEHNRHDTASEFGAEK-------IDKXXXXXXXXXXXXXXXXACVSEELILQKALL-E 1194 + D +E G+ K + C S++L+L + + + Sbjct: 209 PGENQSGKDIDNECGSNKKVDTDTCMQDVSLSLEKNESNKGIPNQCDSKDLMLTRVVKGD 268 Query: 1195 CSKCDEDKVSPQPDEVPCLESVLESLAV--AFTSEQSKTDGPVSNTCYNSKVEGGTITFD 1368 K D VS + + L S+ E V S K+DG + +S + + Sbjct: 269 AMKMVTDDVSKELFTLGELLSMSELSKVNSEAMSSDCKSDGIEQQSFQSSSKKEVMVMPP 328 Query: 1369 FNSSKPNVSNSIDASAELTTG-----KAPETEDEKPSDRFASSPAQLVNNE-----GKIK 1518 S+ V S D++ E A E D + SPAQ+ +E + Sbjct: 329 LVSA---VEESKDSNEEAIVSVPALVSATEELDSGKGEAILISPAQVSTSEESTSSSLVN 385 Query: 1519 ENPSDRKL---------------LRKD-------------ASNDEIGNSGNFSVSSYLER 1614 E D KL KD S ++ + + S+S+ L++ Sbjct: 386 EVSYDNKLETGSITFNLDSSAPTSSKDECHHNLDSEPLGTGSTPKLEVAADQSISNNLQQ 445 Query: 1615 GDGESSFSTSGPVSGLITYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRME 1794 G GESSFS +G V+GLI+YSGP+AYSG++ FAFPILQ+EWN SPVRM Sbjct: 446 GIGESSFSAAGLVTGLISYSGPVAYSGSLSLRSDSSTTSTRSFAFPILQSEWNCSPVRMA 505 Query: 1795 KA-----RKHRGWRHGLL 1833 KA RKH+GWRHGLL Sbjct: 506 KADRRHYRKHKGWRHGLL 523 >ref|XP_002514588.1| conserved hypothetical protein [Ricinus communis] gi|223546192|gb|EEF47694.1| conserved hypothetical protein [Ricinus communis] Length = 488 Score = 142 bits (359), Expect = 4e-31 Identities = 123/413 (29%), Positives = 178/413 (43%), Gaps = 32/413 (7%) Frame = +1 Query: 691 SVMDGQTANGIEEESKDSE----TSTVPHTFESDLKSFTDKNVVECELPDFVVCYKESNT 858 S +D T + K+ E TS +F+ D + DKNV+E ELP+ V+CYKE+ Sbjct: 76 SKLDSCTGVNVSIHDKEEEVRNFTSLKIESFDKDSVFYIDKNVMEPELPELVLCYKENTY 135 Query: 859 PHVKDICVDEGVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTNDPDIEFFVP----- 1023 VKDICVDEGV + L ++S D+ P + + D++ Sbjct: 136 HVVKDICVDEGVPSQENFLFDTSVDQEKLCPYLIPEKDIKSEIQKERVDLDMSTQYLSKN 195 Query: 1024 -DGFKVSSPEHNRHDTASEFGAEKIDKXXXXXXXXXXXXXXXXACVSEELILQKALLECS 1200 + FK S E + E+I V+E L K+LL + Sbjct: 196 DNSFKCDSKESMAIAEIEDDAMEEIANYTSKETFSLGELLLMPEVVAE-LSHSKSLLNST 254 Query: 1201 KCDEDKVSPQPDEVPCLESVLESLAVAFTSEQ----SKTDGPVSNTCYNSKVEGGTITFD 1368 E +P E L + + +EQ + P+ + + + GT+T D Sbjct: 255 DEAEQLSIQRPSENIVLATASACEESKYATEQFLLVTPAVDPLVEESGHEEAKLGTLTSD 314 Query: 1369 FNSSKPNVSNSIDASAELT-------------TGKAPETEDEKPSDRFASSPAQLVNNEG 1509 + + + A L K+P + SD +S+P EG Sbjct: 315 SSPKASDHGHDEVILASLAPSYATEEPENGAKAAKSPSHTLDSVSDLNSSAPTASGGEEG 374 Query: 1510 KIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTSGPVSGLITYSGPIAY 1689 + S+ R + +++ + FS L+ GESSFS +GP+SGLI+YSGPIAY Sbjct: 375 S-QVGGSEHLESRNSSRHEDTSITEPFS--GQLQYSHGESSFSAAGPLSGLISYSGPIAY 431 Query: 1690 SGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA-----RKHRGWRHGLL 1833 SG++ FAFPILQ+EWNSSPVRM KA RKHR WR GLL Sbjct: 432 SGSLSLRSDSSTTSTRSFAFPILQSEWNSSPVRMAKADRRHFRKHRSWRQGLL 484 >ref|XP_004144157.1| PREDICTED: uncharacterized protein LOC101217989 [Cucumis sativus] gi|449523672|ref|XP_004168847.1| PREDICTED: uncharacterized protein LOC101224727 [Cucumis sativus] Length = 431 Score = 140 bits (352), Expect = 3e-30 Identities = 121/418 (28%), Positives = 185/418 (44%), Gaps = 24/418 (5%) Frame = +1 Query: 652 DKNGAQITE--EAAHSVMDGQTANGIEEESKDSETSTVPHTFESDLKSFTDKNVVECELP 825 D N IT+ ++ V D A GI S +S + +F S+ DK+V+EC++ Sbjct: 64 DGNSCMITKINRSSTDVFDDNNAEGI---SAFGASSNMKPSF-----SYVDKSVMECQMS 115 Query: 826 DFVVCYKESNTPHVKDICVDEGVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTND-- 999 +VC +E N VKDIC+D+GV +S+ ++ + P +EDR G+ + Sbjct: 116 KTIVCDQEVNVNDVKDICIDDGVASLENFFFKSTAEKSISKI--SPLEEDRNEGSIKEKE 173 Query: 1000 --PDIEFFVPDGFKVSSPEHNRHDTASEFGAEKIDKXXXXXXXXXXXXXXXXACVSEELI 1173 ++ F+ D KVS +H D + A+ + + +SE + Sbjct: 174 TSSEVSKFIADDRKVSLEDHFAMDWTTHNDAKDLTQIEEEKLN-----------LSEPEL 222 Query: 1174 LQKALLECSKCDE--DKVSPQ-----------PDEVPCLESVLESLAVAFTSEQSKTDGP 1314 L + L++ S E DK+ Q ++S ++ A+ +E K + P Sbjct: 223 LMQKLVKRSYSSESLDKIGLQISGEKTNLEDPSSASKSVDSCNDTPALDSAAEPPKDNIP 282 Query: 1315 VSNTCYNSKVEGGTITFDFNSSKPNVSNSIDASAELTTGKAPETEDEKPSDRFASSPAQL 1494 + YN + E G+I FNS P + G E SD + Q+ Sbjct: 283 AHPSGYNDEFENGSIALTFNSISP-----------VANGGEERQECCGRSDSVIGT--QV 329 Query: 1495 VNNEGKIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTSGPVSGLITYS 1674 + N ++ SD +LL +D GESSFS P++ L+TYS Sbjct: 330 LTN---LEYRTSDSRLLSSQNMHD-----------------IGESSFSAVDPLASLVTYS 369 Query: 1675 GPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA-----RKHRGWRHGLL 1833 GP+AYSG++ FAFPILQ+EWNSSPV+M KA RK+RGWR GLL Sbjct: 370 GPVAYSGSISLRSESSTTSTRSFAFPILQSEWNSSPVKMVKAERRHYRKYRGWREGLL 427 >gb|EMJ23318.1| hypothetical protein PRUPE_ppa004630mg [Prunus persica] Length = 499 Score = 134 bits (337), Expect = 2e-28 Identities = 134/474 (28%), Positives = 187/474 (39%), Gaps = 104/474 (21%) Frame = +1 Query: 724 EEESKD-----SETSTVPHTFESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDICVDE 888 E+E KD + +S E + + DK+V+ECELP+ +VCYKES+ +KDIC+DE Sbjct: 29 EDEVKDFVPPYTLSSEKLEALEKESDYYMDKSVMECELPELIVCYKESSCNTIKDICIDE 88 Query: 889 GVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTNDPDIEFFVPDGFKVSSPEHNRHDT 1068 GV ++K E+ DE P ++ DI +PDGFK S+ HD Sbjct: 89 GVPSQDKNRFETGVDEKECCTFLSPDEDQNKQLLEEQMDIVVTLPDGFKSSA-----HDD 143 Query: 1069 ASEFGAEKIDKXXXXXXXXXXXXXXXXA--CVSEELILQKALL----------------- 1191 + D VS+E+ +L Sbjct: 144 LEKGFVIPCDSKGLTQIGDAIYYTQEKTEIEVSKEIFFPANVLPMQELGAGNAHSSKSSN 203 Query: 1192 -ECSKCDEDKVSPQPDEVPCLESVLESLAVAFTSEQSKTDGP------------VSNTCY 1332 E ++ +D V ++V + + V+ T E S ++ V Sbjct: 204 EESTEAVQDTVQSSGEKVSEIAQTGSTAVVSVTEESSHSEKKALVSAAEESNFHVDELSN 263 Query: 1333 NSKVEGGTITFDFNSSKPNVSNSIDA---------------------------------- 1410 NSKVE G+ T + + +VS + DA Sbjct: 264 NSKVENGSTTSGLSDTSVHVSTTRDACPDNDVHKHFETQTMPAGDDGDDNDDNMPDAEIV 323 Query: 1411 -------SAELTTGK--APE--------------TEDEKPSDRFASSPAQLVNNEGKI-- 1515 SA + TG+ PE +DE P SS Q + I Sbjct: 324 PSQVQPCSAPVVTGREECPENGVCQPLDTSSTSKVDDEIPHSVIVSSQVQHYSAPVTISR 383 Query: 1516 KENPSDRKLLRKDASND-EIG--NSGNFSVSSYLERGDGESSFSTSGPVSGLITYSGPIA 1686 +E P + + SN +G NS S +++RG GESSFS +G S L+ SGP Sbjct: 384 EERPENGVWQCPETSNAFMVGDVNSDTQYASFHVQRGFGESSFSAAGHFSSLMNTSGP-- 441 Query: 1687 YSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA-----RKHRGWRHGLL 1833 YSGNV FAFP+LQ+EWNSSPVRM KA RKHRGW H LL Sbjct: 442 YSGNVSLRSESSTTSTRSFAFPVLQSEWNSSPVRMAKADRRHLRKHRGWGHSLL 495 >ref|XP_006484258.1| PREDICTED: dentin sialophosphoprotein-like isoform X3 [Citrus sinensis] Length = 483 Score = 127 bits (318), Expect = 3e-26 Identities = 132/436 (30%), Positives = 170/436 (38%), Gaps = 88/436 (20%) Frame = +1 Query: 790 FTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVH-----------------------E 900 + DK+V ECELP+ +VCYKE NT HVKDIC+DEGVH + Sbjct: 85 YMDKSVTECELPELIVCYKE-NTYHVKDICIDEGVHSHDRILFESDVGKSVRSFLPPKED 143 Query: 901 ENKILIESSK------------------DEH----AGSVVSQPSDED----------RYG 984 N L+E SK DEH GS SDED R Sbjct: 144 RNSELLEESKNSVIPIPDVLKSSAENYSDEHIVNRCGSSQESDSDEDIDDICDSKDLRPA 203 Query: 985 GTTNDPDIE----------FFVPDGFKV----SSPEHNRHDTASEFGAEKIDKXXXXXXX 1122 G D E F + D + + ++ +E AEK Sbjct: 204 GDVKDDATEENTNDVSRKLFLLGDLLSMHNVGTKNSLSKSAIGNEIDAEKESFQGSSAKA 263 Query: 1123 XXXXXXXXXACVSEELILQKALLECSK-----CDEDKVSPQPDEVPCLESVLE-----SL 1272 +EE++ + S+ C E +S P V E + SL Sbjct: 264 ALANPEEANGGTAEEILTGADFVSASEESQNGCGEG-ISGNPTLVSASEKAHDKSEEASL 322 Query: 1273 A----VAFTSEQSKTDGPVSNTCYNSKVEGGTITFDFNSSKPNVSNSIDASAELTTGKAP 1440 A V+ SE +K + YNS VE G+ITFDF++S P S + L G + Sbjct: 323 ASPDGVSALSESTKIS-TAEKSSYNSMVETGSITFDFDASAPGASGKEEP---LQIGDSQ 378 Query: 1441 ETEDEKPSDRFASSPAQLVNNEGKIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGD 1620 E S R +P Q SVSS G Sbjct: 379 RIETPGMS-RLEDAPRQ---------------------------------SVSSQFHSGL 404 Query: 1621 GESSFSTSGPVSGLITYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA 1800 GESSFS +G + LI+YSGP+AYSG++ FAFPILQ EW+ SPVRM KA Sbjct: 405 GESSFSAAGSLPSLISYSGPVAYSGSISLRSDSSTTSTRSFAFPILQTEWDRSPVRMAKA 464 Query: 1801 -----RKHRGWRHGLL 1833 RKH+ W+ GLL Sbjct: 465 DRRHYRKHK-WKQGLL 479 >ref|XP_006484256.1| PREDICTED: dentin sialophosphoprotein-like isoform X1 [Citrus sinensis] gi|568861537|ref|XP_006484257.1| PREDICTED: dentin sialophosphoprotein-like isoform X2 [Citrus sinensis] Length = 496 Score = 127 bits (318), Expect = 3e-26 Identities = 132/436 (30%), Positives = 170/436 (38%), Gaps = 88/436 (20%) Frame = +1 Query: 790 FTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVH-----------------------E 900 + DK+V ECELP+ +VCYKE NT HVKDIC+DEGVH + Sbjct: 98 YMDKSVTECELPELIVCYKE-NTYHVKDICIDEGVHSHDRILFESDVGKSVRSFLPPKED 156 Query: 901 ENKILIESSK------------------DEH----AGSVVSQPSDED----------RYG 984 N L+E SK DEH GS SDED R Sbjct: 157 RNSELLEESKNSVIPIPDVLKSSAENYSDEHIVNRCGSSQESDSDEDIDDICDSKDLRPA 216 Query: 985 GTTNDPDIE----------FFVPDGFKV----SSPEHNRHDTASEFGAEKIDKXXXXXXX 1122 G D E F + D + + ++ +E AEK Sbjct: 217 GDVKDDATEENTNDVSRKLFLLGDLLSMHNVGTKNSLSKSAIGNEIDAEKESFQGSSAKA 276 Query: 1123 XXXXXXXXXACVSEELILQKALLECSK-----CDEDKVSPQPDEVPCLESVLE-----SL 1272 +EE++ + S+ C E +S P V E + SL Sbjct: 277 ALANPEEANGGTAEEILTGADFVSASEESQNGCGEG-ISGNPTLVSASEKAHDKSEEASL 335 Query: 1273 A----VAFTSEQSKTDGPVSNTCYNSKVEGGTITFDFNSSKPNVSNSIDASAELTTGKAP 1440 A V+ SE +K + YNS VE G+ITFDF++S P S + L G + Sbjct: 336 ASPDGVSALSESTKIS-TAEKSSYNSMVETGSITFDFDASAPGASGKEEP---LQIGDSQ 391 Query: 1441 ETEDEKPSDRFASSPAQLVNNEGKIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGD 1620 E S R +P Q SVSS G Sbjct: 392 RIETPGMS-RLEDAPRQ---------------------------------SVSSQFHSGL 417 Query: 1621 GESSFSTSGPVSGLITYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA 1800 GESSFS +G + LI+YSGP+AYSG++ FAFPILQ EW+ SPVRM KA Sbjct: 418 GESSFSAAGSLPSLISYSGPVAYSGSISLRSDSSTTSTRSFAFPILQTEWDRSPVRMAKA 477 Query: 1801 -----RKHRGWRHGLL 1833 RKH+ W+ GLL Sbjct: 478 DRRHYRKHK-WKQGLL 492 >ref|XP_002875229.1| hypothetical protein ARALYDRAFT_484289 [Arabidopsis lyrata subsp. lyrata] gi|297321067|gb|EFH51488.1| hypothetical protein ARALYDRAFT_484289 [Arabidopsis lyrata subsp. lyrata] Length = 435 Score = 125 bits (314), Expect = 7e-26 Identities = 114/393 (29%), Positives = 168/393 (42%), Gaps = 15/393 (3%) Frame = +1 Query: 700 DGQTANGIEEESKDSETSTVPHTFESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDIC 879 D + + + S D + + V + D + DKNV C+LP+ VVCYKE+ VKDIC Sbjct: 61 DNEAGKKVRDISHDCDAN-VDSPDKKDPVFYMDKNVTACDLPEIVVCYKENTYHVVKDIC 119 Query: 880 VDEGVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTNDPDIEFFVPDGFKVSSPEHNR 1059 VDEGV + K L KD SV S +++ TN P K + + + Sbjct: 120 VDEGVPVQEKFLF-GEKD----SVKSSSTEDLTKADKTN------VNPSESKSAEDSNTK 168 Query: 1060 HDTASEFGAEKIDKXXXXXXXXXXXXXXXXACVSEELILQKALLECSKCDEDKVSPQ--- 1230 D + K D+ + ++E ++ +E K SP Sbjct: 169 VDDSEFCNNCKTDRDVEESSREDFADAEGSSAYNQEHLIVT--------EEAKASPSHGL 220 Query: 1231 -PDEVPCLESVLESLAVAFT--SEQSKTDGPV-SNTCYNSKVEGGTITFDFNSSKPNVSN 1398 P E+ E+ + +A++ S++S T G + S + G I+ D + + Sbjct: 221 NPSEIEPDENSNDEVAISSETDSKESLTLGDILSREDEQKSLNHGNISSDSHEEQSPSQL 280 Query: 1399 SIDASAELTTG----KAPETEDEKP-SDRFASSPAQLVNNEGKIKENPSDRKLLRKDASN 1563 L T + +TE+ KP ++ S+ + K +P + N Sbjct: 281 QDKEKRSLETAAIETELEKTEEPKPVEEKLPSASTTTLQEPNKTCNDPEKPETENHHQQN 340 Query: 1564 DEIGNSGNFSVSSYLERGDGESSFSTSGPVSGLITYSGPIAYSGNVXXXXXXXXXXXXXF 1743 + NS S G+ S + S +SG ITYSGPIAYSG++ F Sbjct: 341 SLVENSYEDDKLSSSRFGETSFSAAESVSISGHITYSGPIAYSGSLSVRSDASTTSGRSF 400 Query: 1744 AFPILQNEWNSSPVRMEKARKHR---GWRHGLL 1833 AFPILQ+EWNSSPVRM KA K R GWRH LL Sbjct: 401 AFPILQSEWNSSPVRMAKADKRRQKGGWRHTLL 433 >ref|XP_002311412.1| 18S pre-ribosomal assembly protein gar2 [Populus trichocarpa] gi|222851232|gb|EEE88779.1| 18S pre-ribosomal assembly protein gar2 [Populus trichocarpa] Length = 486 Score = 124 bits (312), Expect = 1e-25 Identities = 122/421 (28%), Positives = 183/421 (43%), Gaps = 73/421 (17%) Frame = +1 Query: 790 FTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVHEENKILIES------------SKD 933 + DK+V+ E+P+ +VCYKE NT HVKDICVDEGV ++K L ++ S+ Sbjct: 74 YMDKSVMVREVPELIVCYKE-NTYHVKDICVDEGVPLQDKFLFDTDAHKKNMCEFLPSER 132 Query: 934 EHAGSVVSQPSDED-------RYGGTTNDPDIEFFVPDGFKVSSPEHNRHDTA------- 1071 + +V + SD D + + D+ VPD S + ++HD + Sbjct: 133 DMNNEMVKEKSDLDMLIPEMLKSSSEKQNVDLHLPVPDVLISSEEKGSKHDLSLDCDPKH 192 Query: 1072 -------SEFGAEKI----DKXXXXXXXXXXXXXXXXAC------------VSEELIL-- 1176 ++G +K+ K C V ++ +L Sbjct: 193 LMPTEEVMDYGTKKVTDNASKEILSLRDLLSMSELGAKCTPANASYHNMDKVEQQSLLCP 252 Query: 1177 -QKALLECSKCDEDKVSPQPDEVPCLESVLESLAVAF-TSEQSKTDGPVSNTCYNSKVEG 1350 + A+LE E+ S E ++ LES +A T + + +G +T + + Sbjct: 253 RENAILETDSASEE--SEHCGEETISDNGLESATLAIPTQDPAYQEGDHGHT--EAVLVS 308 Query: 1351 GTITFDFNSSKPN----VSNSIDASAELTTGKAPETEDEKPS-----------DRFASSP 1485 T+T S S+++D+ +E G EDE P D +S+P Sbjct: 309 PTLTSAAEESDSKETKLASHALDSFSE---GSTSRIEDELPYNSKTETRSISFDNDSSAP 365 Query: 1486 AQLVNNEGKIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTSGPVSGLI 1665 A +N ++L + S E N+ S L+ DGESSFS+SGP+ GL Sbjct: 366 AASARES---PQNGESQRLGTRIVSRFEDPNAERLS-GGQLQYADGESSFSSSGPLFGLT 421 Query: 1666 TYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA-----RKHRGWRHGL 1830 ++SGPIAYSG+V FAFPILQ+EWNSSP RM KA +K R W GL Sbjct: 422 SHSGPIAYSGSVSLRSDSSTTSTRSFAFPILQSEWNSSPARMAKADRRHFQKPRKWMQGL 481 Query: 1831 L 1833 L Sbjct: 482 L 482 >ref|XP_006291111.1| hypothetical protein CARUB_v10017222mg [Capsella rubella] gi|482559818|gb|EOA24009.1| hypothetical protein CARUB_v10017222mg [Capsella rubella] Length = 455 Score = 122 bits (306), Expect = 6e-25 Identities = 118/419 (28%), Positives = 178/419 (42%), Gaps = 21/419 (5%) Frame = +1 Query: 640 FDTRDKNGAQITEEAAHSVMDGQTANGIEEESKDSETSTVPHTFESDLKS------FTDK 801 +DTR +G + +E ++++ + +E K + ++ + D + DK Sbjct: 52 YDTR--SGDEWDKENDGNILEPHSCGDADEAGKKTRDTSHDFVAKGDSPEKVNPVFYMDK 109 Query: 802 NVVECELPDFVVCYKESNTPHVKDICVDEGVHEENKILI-------ESSKDEHAGSVVSQ 960 NV C+LP+ VVCYKE++ VKDICVDEGV + K L ++ H GSV Sbjct: 110 NVTACDLPEIVVCYKENSYHVVKDICVDEGVPVQEKFLFGEKDSVKSTTNSNHCGSVDLM 169 Query: 961 PSDEDRYGGTTNDPDIEFFVPDGFKVSSPEHNRHDTASEFGAEKIDKXXXXXXXXXXXXX 1140 D+ D++ P K +++ D +SE +K + Sbjct: 170 KVDK---------TDVK---PSETKSLEDSNSKVDDSSEVCNDKTVQDVEESSREAFADA 217 Query: 1141 XXXACVSEELILQKALLECSKCDEDKVSPQPDEVPCLESVLESLAVAFTSEQSKTDGPVS 1320 + +E ++ + K E + + +E+ E V+ S F SE +S Sbjct: 218 EGSSNYDQEHLIVTSPTLALKPSEISLEVESEEISKDEVVISS--EDFLSESLTLGDILS 275 Query: 1321 NTCYNSKVEGGTITFDFNSSKPNVSNSIDASAELTTG---KAPETEDEKPSDRFASSPAQ 1491 ++ S P S E TTG K + E+ K ++ SS + Sbjct: 276 REDKQKSLKNDNGNRPEELSPPQHQEKEKRSLE-TTGLDTKLEKVEEPKTAEENLSSAST 334 Query: 1492 LVNNEGKIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFST--SGPVSGLI 1665 E N ++ N + + + +SS GE+SFS S +SG I Sbjct: 335 TTVQEPNKSCNDLEKPETENHQQNRLVNSYEDDKLSS---SRFGETSFSAAESVSISGHI 391 Query: 1666 TYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKARKHR---GWRHGLL 1833 TYSGPIAYSG++ FAFPILQ+EWNSSPVRM KA K R GWRH LL Sbjct: 392 TYSGPIAYSGSLSVRSDASTTSGRSFAFPILQSEWNSSPVRMAKADKRRQKGGWRHTLL 450 >gb|ESW25465.1| hypothetical protein PHAVU_003G038300g [Phaseolus vulgaris] Length = 430 Score = 119 bits (298), Expect = 5e-24 Identities = 122/424 (28%), Positives = 182/424 (42%), Gaps = 19/424 (4%) Frame = +1 Query: 619 IDAESFLFDTRDKNGAQITEEAAHSVMDGQT----ANGIEEESKDSETSTVP------HT 768 +D E+ ++T+ ++ + E +HS D ++ NGIE K S TS + + Sbjct: 70 VDCETNEYETKVRD---LVEPLSHSSKDIESFMKFPNGIESV-KRSPTSPISSPREGVES 125 Query: 769 FESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDICVDEGVHEENKILIESSKDEHAGS 948 + + F K V ECE P VCY ESN VKDIC+DEGV +++ I++ + DE A Sbjct: 126 LRNSVDVFMVKTVTECE-PHPEVCYNESNYHVVKDICIDEGVLKKDNIMVVNPVDEKAHD 184 Query: 949 VVSQPSDE--DRYGGTTNDPDIEFFVPDGFKVSSPEHNRH-DTASEFGAEKIDKXXXXXX 1119 S E ++ T+ + +G HN+H D + ++K Sbjct: 185 FFPFESYETKEKQKDNTSINVLSLTPTEGSDKVFANHNQHKDLMLTEVSGDVNKQTPSP- 243 Query: 1120 XXXXXXXXXXACVSEELILQKALLECSKCDEDKVSPQPDEVPCLESVLESLAVAFTSEQS 1299 ++++LQ L E S +DK Q P L S+ E ++A ++S Sbjct: 244 -------------GDKVLLQDLLTEDSASSDDK-GEQISIEPGLHSISEDPSMAAGEDES 289 Query: 1300 KTDGPVSNTCYNSKVEGGTITFDFNSSKPNVSNSIDASAELTTGKAPETEDEKPSDRFAS 1479 K D SK + +D SA GK E+ + S S Sbjct: 290 KND-----------------------SKAPENAKVDPSAPADCGK----EECRQS---GS 319 Query: 1480 SPAQLVNNEGKIKENPSDRKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTSGPVSG 1659 + + + E SD + + +S++ GESSFS GPVSG Sbjct: 320 CKCDEIQHTSRPMEWKSDDQ-----------------AATSHIRHSLGESSFSAMGPVSG 362 Query: 1660 LITYSGPIAYSGNVXXXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA--RKHRG----WR 1821 I+YSGP+ +SG++ FAFPI+Q+EWNSSPVRM KA R HR WR Sbjct: 363 RISYSGPVPFSGSISLRSDSSTTSTRSFAFPIIQSEWNSSPVRMAKADRRHHRKQRCCWR 422 Query: 1822 HGLL 1833 G L Sbjct: 423 GGFL 426 >gb|AAX55105.1| hypothetical protein At2g03810 [Arabidopsis thaliana] Length = 439 Score = 119 bits (297), Expect = 7e-24 Identities = 120/394 (30%), Positives = 170/394 (43%), Gaps = 16/394 (4%) Frame = +1 Query: 700 DGQTANGIEEESKDSETSTVPHTFESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDIC 879 + + + + S D + + V + D + DKNV C+LP+ VVCYKE+ VKDIC Sbjct: 61 ENEAGKKVRDTSHDCDAN-VDSPEKKDPVFYMDKNVTACDLPEIVVCYKENTYHIVKDIC 119 Query: 880 VDEGVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTN-DPDIEFFVPDGF-KVSSPEH 1053 VDEGV + K L KD SV S +++ TN +P D KV E Sbjct: 120 VDEGVPVQEKFLF-GEKD----SVKSSSTEDLMKADKTNVNPSETKSAEDSISKVDDSEF 174 Query: 1054 -NRHDTASEFGAEKIDKXXXXXXXXXXXXXXXXACVSEELI------LQKALLECSKCDE 1212 N H T + E + V+EE+ L + +E + + Sbjct: 175 CNDHKTDRDV-EESSGEDFADAEGTSSNYNQEHLIVTEEVKASPTHGLSPSEIEPDENSK 233 Query: 1213 DKV--SPQPDEVPCLESVLESLAVAFTSEQSKTDGPVSNTCYNSKVEGGTITFDFNSSKP 1386 D+V S D CL +L + E + N +S E + Sbjct: 234 DEVAISQDNDSKECL-----TLGDILSREDEQKSLNQDNISSDSHEEQSPSQLQDKEKRS 288 Query: 1387 NVSNSIDASAELTTGKAPETEDEKPSDRFASSPAQLVNNEGKIKENPSDRKLLRKDASND 1566 + +I+ E T + P+ +EK S +++ +Q N E P +++ + Sbjct: 289 LETTAIETELEKT--EEPKQGEEKLSS-VSTTTSQEPNKTCNEPEKPETENHHQQNCLVE 345 Query: 1567 EIGNSGNFSVSSYLERGDGESSFSTSGPVS--GLITYSGPIAYSGNVXXXXXXXXXXXXX 1740 FS S + GE+SFS + VS G ITYSGPIAYSG++ Sbjct: 346 NSYEDDKFSSSRF-----GETSFSAADSVSISGHITYSGPIAYSGSLSVRSDASTTSGRS 400 Query: 1741 FAFPILQNEWNSSPVRMEKARKHR---GWRHGLL 1833 FAFPILQ+EWNSSPVRM KA K R GWRH LL Sbjct: 401 FAFPILQSEWNSSPVRMAKADKRRQKGGWRHTLL 434 >ref|NP_178475.2| 18S pre-ribosomal assembly protein gar2-related protein [Arabidopsis thaliana] gi|42570677|ref|NP_973412.1| 18S pre-ribosomal assembly protein gar2-related protein [Arabidopsis thaliana] gi|79316683|ref|NP_001030966.1| 18S pre-ribosomal assembly protein gar2-related protein [Arabidopsis thaliana] gi|186499149|ref|NP_001118260.1| 18S pre-ribosomal assembly protein gar2-related protein [Arabidopsis thaliana] gi|330250656|gb|AEC05750.1| 18S pre-ribosomal assembly protein gar2-related protein [Arabidopsis thaliana] gi|330250657|gb|AEC05751.1| 18S pre-ribosomal assembly protein gar2-related protein [Arabidopsis thaliana] gi|330250658|gb|AEC05752.1| 18S pre-ribosomal assembly protein gar2-related protein [Arabidopsis thaliana] gi|330250659|gb|AEC05753.1| 18S pre-ribosomal assembly protein gar2-related protein [Arabidopsis thaliana] Length = 439 Score = 119 bits (297), Expect = 7e-24 Identities = 120/394 (30%), Positives = 170/394 (43%), Gaps = 16/394 (4%) Frame = +1 Query: 700 DGQTANGIEEESKDSETSTVPHTFESDLKSFTDKNVVECELPDFVVCYKESNTPHVKDIC 879 + + + + S D + + V + D + DKNV C+LP+ VVCYKE+ VKDIC Sbjct: 61 ENEAGKKVRDTSHDCDAN-VDSPEKKDPVFYMDKNVTACDLPEIVVCYKENTYHIVKDIC 119 Query: 880 VDEGVHEENKILIESSKDEHAGSVVSQPSDEDRYGGTTN-DPDIEFFVPDGF-KVSSPEH 1053 VDEGV + K L KD SV S +++ TN +P D KV E Sbjct: 120 VDEGVPVQEKFLF-GEKD----SVKSSSTEDLMKADKTNVNPSETKSAEDSISKVDDSEF 174 Query: 1054 -NRHDTASEFGAEKIDKXXXXXXXXXXXXXXXXACVSEELI------LQKALLECSKCDE 1212 N H T + E + V+EE+ L + +E + + Sbjct: 175 CNDHKTDRDV-EESSGEDFADAEGTSSNYNQEHLIVTEEVKASPTHGLSPSEIEPDENSK 233 Query: 1213 DKV--SPQPDEVPCLESVLESLAVAFTSEQSKTDGPVSNTCYNSKVEGGTITFDFNSSKP 1386 D+V S D CL +L + E + N +S E + Sbjct: 234 DEVAISQDNDSKECL-----TLGDILSREDEQKSLNQDNISSDSHEEQSPSQLQDKEKRS 288 Query: 1387 NVSNSIDASAELTTGKAPETEDEKPSDRFASSPAQLVNNEGKIKENPSDRKLLRKDASND 1566 + +I+ E T + P+ +EK S +++ +Q N E P +++ + Sbjct: 289 LETTAIETELEKT--EEPKQGEEKLSS-VSTTTSQEPNKTCNEPEKPETENHHQQNCLVE 345 Query: 1567 EIGNSGNFSVSSYLERGDGESSFSTSGPVS--GLITYSGPIAYSGNVXXXXXXXXXXXXX 1740 FS S + GE+SFS + VS G ITYSGPIAYSG++ Sbjct: 346 NSYEDDKFSSSRF-----GETSFSAADSVSISGHITYSGPIAYSGSLSVRSDASTTSGRS 400 Query: 1741 FAFPILQNEWNSSPVRMEKARKHR---GWRHGLL 1833 FAFPILQ+EWNSSPVRM KA K R GWRH LL Sbjct: 401 FAFPILQSEWNSSPVRMAKADKRRQKGGWRHTLL 434 >ref|XP_002266889.2| PREDICTED: uncharacterized protein LOC100247891 [Vitis vinifera] Length = 229 Score = 118 bits (295), Expect = 1e-23 Identities = 82/228 (35%), Positives = 116/228 (50%), Gaps = 5/228 (2%) Frame = +1 Query: 1165 ELILQKALLECSKCDEDKVSPQPDEVPCLESVLESLAVAFTSEQSKTDGPVSNTCYNSKV 1344 EL ++ E S+ + ++ Q + P E+VLE+ A+ +E+S + + YNSK+ Sbjct: 32 ELSKANSMPESSEFNGMEIEHQCIQNPNGEAVLENPALVSEAEESDKNSFPNELSYNSKL 91 Query: 1345 EGGTITFDFNSSKPNVSNSIDASAELTTGKAPETEDEKPSDRFASSPAQLVNNEGKIKEN 1524 E GTITFDF SS + S+D+ E++ P+ + +P Sbjct: 92 ESGTITFDFGSS----TTSMDSGREVS----PQNDGCEP--------------------- 122 Query: 1525 PSDRKLLRKDASNDEIGNSGNFSVSSYLERGDGESSFSTSGPVSGLITYSGPIAYSGNVX 1704 P + + L K E + S ++RG GESSFS +GP S LI+YSG I +SGN+ Sbjct: 123 PLESQNLSKLEDGSE-----SLPFSGQIQRGLGESSFSAAGPSSALISYSGQITHSGNIS 177 Query: 1705 XXXXXXXXXXXXFAFPILQNEWNSSPVRMEKA-----RKHRGWRHGLL 1833 FAFP+LQ EWNSSPVRM KA RKHR WR G+L Sbjct: 178 LRSDSSTTSTRSFAFPVLQTEWNSSPVRMAKAERRHLRKHRSWRRGIL 225