BLASTX nr result
ID: Jatropha_contig00043220
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Jatropha_contig00043220 (839 letters) Database: NCBI-nr (updated 2014/02/11) 35,149,712 sequences; 12,374,887,350 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002523666.1| conserved hypothetical protein [Ricinus comm... 301 8e-86 gb|EMJ14490.1| hypothetical protein PRUPE_ppa006080mg [Prunus pe... 259 2e-72 emb|CBI17195.3| unnamed protein product [Vitis vinifera] 263 3e-72 gb|EOY32159.1| Hydroxyproline-rich glycoprotein family protein, ... 263 7e-72 ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247... 261 2e-70 ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanu... 259 5e-70 ref|XP_004295550.1| PREDICTED: uncharacterized protein LOC101300... 251 1e-69 ref|XP_002274822.2| PREDICTED: uncharacterized protein LOC100253... 264 3e-68 gb|ESW19411.1| hypothetical protein PHAVU_006G122700g [Phaseolus... 245 3e-66 ref|XP_004509236.1| PREDICTED: uncharacterized protein LOC101507... 241 9e-66 ref|XP_003533102.1| PREDICTED: uncharacterized protein LOC100780... 237 2e-64 gb|EEF06007.2| hydroxyproline-rich glycoprotein [Populus trichoc... 246 5e-63 ref|XP_002321880.1| predicted protein [Populus trichocarpa] 246 5e-63 ref|XP_004147751.1| PREDICTED: uncharacterized protein LOC101215... 231 7e-63 ref|XP_003629502.1| hypothetical protein MTR_8g078230 [Medicago ... 228 4e-61 gb|AAF78422.1|AC018748_1 Contains similarity to RNA-binding prot... 226 2e-59 ref|NP_564639.1| hydroxyproline-rich glycoprotein family protein... 226 2e-59 gb|AAM65660.1| Contains similarity to RNA-binding protein from A... 226 2e-59 ref|XP_002894457.1| predicted protein [Arabidopsis lyrata subsp.... 226 3e-59 gb|ESQ30058.1| hypothetical protein EUTSA_v10011382mg [Eutrema s... 223 5e-59 >ref|XP_002523666.1| conserved hypothetical protein [Ricinus communis] gi|223537066|gb|EEF38701.1| conserved hypothetical protein [Ricinus communis] Length = 436 Score = 301 bits (771), Expect(2) = 8e-86 Identities = 146/166 (87%), Positives = 153/166 (92%) Frame = -1 Query: 692 EGFEEMSERVLPSPMHDAYLEALHTNYMIEFEPEYLMGEFDQNPDIDEKPPLPLRDVLEK 513 EG+EEMS RVLPSPM DAYL+ALHTNYMIEFEPEYLMGEFDQNPDIDEKPP+PLRDVLEK Sbjct: 271 EGYEEMSGRVLPSPMEDAYLDALHTNYMIEFEPEYLMGEFDQNPDIDEKPPMPLRDVLEK 330 Query: 512 VKPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRVTAKKQQEELERVAKTI 333 VKPFIMAY+GIQSQ ETMKNVPL KEIVDYYSGPDR+TAKKQ+EELERVA TI Sbjct: 331 VKPFIMAYEGIQSQEEWEAAVEETMKNVPLFKEIVDYYSGPDRITAKKQEEELERVANTI 390 Query: 332 PASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQCYN 195 PASAPASVKRFADRAVLSLQSNPGWGFD+KCQFMDKL REVNQCYN Sbjct: 391 PASAPASVKRFADRAVLSLQSNPGWGFDKKCQFMDKLVREVNQCYN 436 Score = 43.5 bits (101), Expect(2) = 8e-86 Identities = 19/30 (63%), Positives = 24/30 (80%) Frame = -2 Query: 799 MEDSEPEFATGRFLGDNADGEKLAERVGVE 710 M+D + F +G FLGDNADGEKLA ++GVE Sbjct: 235 MDDVDEGFGSGLFLGDNADGEKLAGKIGVE 264 >gb|EMJ14490.1| hypothetical protein PRUPE_ppa006080mg [Prunus persica] Length = 428 Score = 259 bits (661), Expect(2) = 2e-72 Identities = 125/166 (75%), Positives = 142/166 (85%) Frame = -1 Query: 692 EGFEEMSERVLPSPMHDAYLEALHTNYMIEFEPEYLMGEFDQNPDIDEKPPLPLRDVLEK 513 E FEEMS VLPSP+ DAY++A+HTN+MIE EPEYLMGEF++NPDIDEKPP+ LRD LEK Sbjct: 263 ERFEEMSSEVLPSPLDDAYVDAMHTNFMIECEPEYLMGEFNKNPDIDEKPPISLRDALEK 322 Query: 512 VKPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRVTAKKQQEELERVAKTI 333 +KPF+MAY+ I+SQ ETM+ VPLLKEIVD+YSGPDRVTAKKQQEELERVAKT+ Sbjct: 323 MKPFLMAYENIESQEEWEEVVNETMERVPLLKEIVDHYSGPDRVTAKKQQEELERVAKTL 382 Query: 332 PASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQCYN 195 PA P SVKRF DRAVLSLQSNPGWGFDRKCQFMDKL +V+Q YN Sbjct: 383 PAKVPDSVKRFTDRAVLSLQSNPGWGFDRKCQFMDKLVAKVSQHYN 428 Score = 41.2 bits (95), Expect(2) = 2e-72 Identities = 17/29 (58%), Positives = 25/29 (86%) Frame = -2 Query: 796 EDSEPEFATGRFLGDNADGEKLAERVGVE 710 +DS+ +A+G +LGDNADGEKLA+++G E Sbjct: 228 KDSDGSYASGLYLGDNADGEKLAKKLGPE 256 >emb|CBI17195.3| unnamed protein product [Vitis vinifera] Length = 209 Score = 263 bits (671), Expect(2) = 3e-72 Identities = 127/168 (75%), Positives = 143/168 (85%) Frame = -1 Query: 701 QIDEGFEEMSERVLPSPMHDAYLEALHTNYMIEFEPEYLMGEFDQNPDIDEKPPLPLRDV 522 ++DE FEEMS RVLPSP+ DAYL+ALHTN +IEFEPEYLM EF NPDIDE PP+PLRD Sbjct: 41 KLDEAFEEMSGRVLPSPIEDAYLDALHTNCLIEFEPEYLMEEFGTNPDIDENPPIPLRDA 100 Query: 521 LEKVKPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRVTAKKQQEELERVA 342 LEK+KPF+M Y+GIQSQ ETM+NVP LKE+VDYYSGPDRVTAKKQQEELERVA Sbjct: 101 LEKMKPFLMQYEGIQSQEEWEEVMKETMENVPYLKELVDYYSGPDRVTAKKQQEELERVA 160 Query: 341 KTIPASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQCY 198 KT+P +AP SVKRF DRA+LSLQSNPGWGFD+KCQFMDKL EV+Q Y Sbjct: 161 KTLPETAPNSVKRFTDRAILSLQSNPGWGFDKKCQFMDKLVWEVSQHY 208 Score = 37.0 bits (84), Expect(2) = 3e-72 Identities = 13/28 (46%), Positives = 22/28 (78%) Frame = -2 Query: 793 DSEPEFATGRFLGDNADGEKLAERVGVE 710 D++ ++ G +LGDNAD EKL+ ++G+E Sbjct: 10 DAQDDYGAGLYLGDNADAEKLSNKIGLE 37 >gb|EOY32159.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] Length = 474 Score = 263 bits (672), Expect(2) = 7e-72 Identities = 130/165 (78%), Positives = 142/165 (86%) Frame = -1 Query: 692 EGFEEMSERVLPSPMHDAYLEALHTNYMIEFEPEYLMGEFDQNPDIDEKPPLPLRDVLEK 513 EGFEEM RVLPSPM DAYL+ALHTN IEFEPEYLM EF NPDIDEKPP+PLRD LEK Sbjct: 309 EGFEEMGSRVLPSPMDDAYLDALHTNCSIEFEPEYLMEEFGTNPDIDEKPPMPLRDALEK 368 Query: 512 VKPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRVTAKKQQEELERVAKTI 333 +KPF+MAY+GIQSQ ETM+ VPLL+EIVDYYSGPDRVTAKKQQEELERVAKTI Sbjct: 369 MKPFLMAYEGIQSQEEWEEVIKETMERVPLLQEIVDYYSGPDRVTAKKQQEELERVAKTI 428 Query: 332 PASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQCY 198 P AP+SVK+FA+RAVLSLQSNPGWGFD+KCQFMDKL EV+Q Y Sbjct: 429 PERAPSSVKQFANRAVLSLQSNPGWGFDKKCQFMDKLVWEVSQQY 473 Score = 35.0 bits (79), Expect(2) = 7e-72 Identities = 15/30 (50%), Positives = 21/30 (70%) Frame = -2 Query: 799 MEDSEPEFATGRFLGDNADGEKLAERVGVE 710 ++DS A G +LGDNADGEK A+ +G + Sbjct: 273 VKDSGEGSADGLYLGDNADGEKFAQTIGAD 302 >ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247662 isoform 1 [Solanum lycopersicum] gi|460368563|ref|XP_004230135.1| PREDICTED: uncharacterized protein LOC101247662 isoform 2 [Solanum lycopersicum] Length = 473 Score = 261 bits (667), Expect(2) = 2e-70 Identities = 126/165 (76%), Positives = 142/165 (86%) Frame = -1 Query: 692 EGFEEMSERVLPSPMHDAYLEALHTNYMIEFEPEYLMGEFDQNPDIDEKPPLPLRDVLEK 513 EGFEEMS RVLPSPM DAYLEALHTN MIE EPEYLMG+F+ NPDIDE PP+PLRD LEK Sbjct: 308 EGFEEMSARVLPSPMDDAYLEALHTNMMIECEPEYLMGDFESNPDIDETPPIPLRDALEK 367 Query: 512 VKPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRVTAKKQQEELERVAKTI 333 +KPF+MAY+GI+ Q ETM+ VPL+KEIVDYYSGPDRVTAK+QQ+ELERVAKT+ Sbjct: 368 MKPFLMAYEGIKDQEEWEEVIKETMETVPLMKEIVDYYSGPDRVTAKQQQQELERVAKTL 427 Query: 332 PASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQCY 198 P SAP SVKRF +RAVLSLQSNPGWGFD+KCQFMDK+ EV+Q Y Sbjct: 428 PESAPNSVKRFTERAVLSLQSNPGWGFDKKCQFMDKVVMEVSQHY 472 Score = 32.0 bits (71), Expect(2) = 2e-70 Identities = 14/29 (48%), Positives = 20/29 (68%) Frame = -2 Query: 796 EDSEPEFATGRFLGDNADGEKLAERVGVE 710 E + +G +LGD+ADGEKLA ++G E Sbjct: 273 ERGDGNLESGFYLGDDADGEKLAAKLGPE 301 >ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanum tuberosum] Length = 480 Score = 259 bits (661), Expect(2) = 5e-70 Identities = 124/165 (75%), Positives = 141/165 (85%) Frame = -1 Query: 692 EGFEEMSERVLPSPMHDAYLEALHTNYMIEFEPEYLMGEFDQNPDIDEKPPLPLRDVLEK 513 EGFEEMS RVLPSPM DAY+EALHTN MIE EPEYLMG+F+ NPDIDE PP+PLRD LEK Sbjct: 315 EGFEEMSARVLPSPMDDAYIEALHTNMMIECEPEYLMGDFESNPDIDETPPIPLRDALEK 374 Query: 512 VKPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRVTAKKQQEELERVAKTI 333 +KPF+MAY+GI+ Q ETM+ VPL+KEIVDYYSGPDRVTAK+QQ+ELERVAKT+ Sbjct: 375 MKPFLMAYEGIKDQEEWEEVIKETMETVPLMKEIVDYYSGPDRVTAKQQQQELERVAKTL 434 Query: 332 PASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQCY 198 P SAP SVKRF +RAVLSLQSNPGWGFD+KCQFMDK+ E +Q Y Sbjct: 435 PESAPNSVKRFTERAVLSLQSNPGWGFDKKCQFMDKVVMEASQHY 479 Score = 33.1 bits (74), Expect(2) = 5e-70 Identities = 14/29 (48%), Positives = 21/29 (72%) Frame = -2 Query: 796 EDSEPEFATGRFLGDNADGEKLAERVGVE 710 E + +G +LGD+ADGEKLA+++G E Sbjct: 280 ERGDGSLESGFYLGDDADGEKLAQKLGPE 308 >ref|XP_004295550.1| PREDICTED: uncharacterized protein LOC101300131 [Fragaria vesca subsp. vesca] Length = 464 Score = 251 bits (642), Expect(2) = 1e-69 Identities = 121/168 (72%), Positives = 143/168 (85%) Frame = -1 Query: 701 QIDEGFEEMSERVLPSPMHDAYLEALHTNYMIEFEPEYLMGEFDQNPDIDEKPPLPLRDV 522 Q+ E FE+MS VLPSP+ DAY++AL TN IEFEPEYLMGEF+QNPDIDE+PP+PLRD Sbjct: 296 QLTEAFEDMSTHVLPSPLDDAYVDALDTNCKIEFEPEYLMGEFNQNPDIDEEPPIPLRDA 355 Query: 521 LEKVKPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRVTAKKQQEELERVA 342 LEK+KPF+MAY+GIQSQ ETM+ VPLLK+IVD+YSGPDRVTAKKQ+EELERVA Sbjct: 356 LEKMKPFLMAYEGIQSQEEWEEAIKETMERVPLLKKIVDHYSGPDRVTAKKQREELERVA 415 Query: 341 KTIPASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQCY 198 KT+PA+ P SVK+F DRAVLSLQ NPGWGF RKCQFMDKL ++V++ Y Sbjct: 416 KTLPANVPDSVKQFTDRAVLSLQGNPGWGFHRKCQFMDKLTQKVSKHY 463 Score = 38.9 bits (89), Expect(2) = 1e-69 Identities = 17/28 (60%), Positives = 22/28 (78%) Frame = -2 Query: 793 DSEPEFATGRFLGDNADGEKLAERVGVE 710 D + A+G +LGDNADGEKLAE++G E Sbjct: 265 DEDGGIASGLYLGDNADGEKLAEKLGPE 292 >ref|XP_002274822.2| PREDICTED: uncharacterized protein LOC100253300 [Vitis vinifera] Length = 482 Score = 264 bits (674), Expect = 3e-68 Identities = 138/219 (63%), Positives = 160/219 (73%), Gaps = 5/219 (2%) Frame = -1 Query: 839 KGEMERKMGSERENGG-----FGARICYRSVLGRQC*W*EAGGKGWC*EYEQIDEGFEEM 675 +G +MG R GG +GA + LG + K + ++DE FEEM Sbjct: 267 RGRGRGRMGDRRGRGGDAQDDYGAGL----YLGDNADAEKLSNKIGLEKMSKLDEAFEEM 322 Query: 674 SERVLPSPMHDAYLEALHTNYMIEFEPEYLMGEFDQNPDIDEKPPLPLRDVLEKVKPFIM 495 S RVLPSP+ DAYL+ALHTN +IEFEPEYLM EF NPDIDE PP+PLRD LEK+KPF+M Sbjct: 323 SGRVLPSPIEDAYLDALHTNCLIEFEPEYLMEEFGTNPDIDENPPIPLRDALEKMKPFLM 382 Query: 494 AYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRVTAKKQQEELERVAKTIPASAPA 315 Y+GIQSQ ETM+NVP LKE+VDYYSGPDRVTAKKQQEELERVAKT+P +AP Sbjct: 383 QYEGIQSQEEWEEVMKETMENVPYLKELVDYYSGPDRVTAKKQQEELERVAKTLPETAPN 442 Query: 314 SVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQCY 198 SVKRF DRA+LSLQSNPGWGFD+KCQFMDKL EV+Q Y Sbjct: 443 SVKRFTDRAILSLQSNPGWGFDKKCQFMDKLVWEVSQHY 481 >gb|ESW19411.1| hypothetical protein PHAVU_006G122700g [Phaseolus vulgaris] Length = 532 Score = 245 bits (626), Expect(2) = 3e-66 Identities = 124/168 (73%), Positives = 141/168 (83%) Frame = -1 Query: 701 QIDEGFEEMSERVLPSPMHDAYLEALHTNYMIEFEPEYLMGEFDQNPDIDEKPPLPLRDV 522 Q+ EGFEEM+ RVLPSP+ D YL+AL NY IEFEPEYL+ EFD NPDIDEK P+PLRD Sbjct: 366 QLTEGFEEMAGRVLPSPLEDEYLDALDINYAIEFEPEYLV-EFD-NPDIDEKEPIPLRDA 423 Query: 521 LEKVKPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRVTAKKQQEELERVA 342 LEK+KPF+MAY+GIQSQ ETM VPLLKEIVD+YSGPDRVTAKKQQEELERVA Sbjct: 424 LEKMKPFLMAYEGIQSQEEWEEIMEETMAQVPLLKEIVDHYSGPDRVTAKKQQEELERVA 483 Query: 341 KTIPASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQCY 198 KT+P SAP+SVK+F +RAV+SLQSNPGWGFD+KC FMDKL EV+Q Y Sbjct: 484 KTLPESAPSSVKQFTNRAVVSLQSNPGWGFDKKCHFMDKLVWEVSQHY 531 Score = 33.9 bits (76), Expect(2) = 3e-66 Identities = 15/28 (53%), Positives = 21/28 (75%) Frame = -2 Query: 793 DSEPEFATGRFLGDNADGEKLAERVGVE 710 D+E G ++GD+ADGEKLA++VG E Sbjct: 335 DAEASDDIGPYVGDDADGEKLAKKVGPE 362 >ref|XP_004509236.1| PREDICTED: uncharacterized protein LOC101507965 [Cicer arietinum] Length = 504 Score = 241 bits (616), Expect(2) = 9e-66 Identities = 120/166 (72%), Positives = 137/166 (82%) Frame = -1 Query: 701 QIDEGFEEMSERVLPSPMHDAYLEALHTNYMIEFEPEYLMGEFDQNPDIDEKPPLPLRDV 522 Q EGFEEM RVLPSP+ D Y+EA N IEFEPEY+M EFD NPDIDEK P+PLRD Sbjct: 337 QFTEGFEEMISRVLPSPLEDEYVEAFDINCAIEFEPEYIM-EFDSNPDIDEKEPIPLRDA 395 Query: 521 LEKVKPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRVTAKKQQEELERVA 342 LEK+KPF+M Y+GIQSQ ETM+ VPLLK+IVD+YSGPDRVTAKKQQEELERVA Sbjct: 396 LEKMKPFLMNYEGIQSQEEWEAIMEETMERVPLLKKIVDHYSGPDRVTAKKQQEELERVA 455 Query: 341 KTIPASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQ 204 KT+PASAP+SV +F +RAV+SLQSNPGWGFD+KCQFMDKL EV+Q Sbjct: 456 KTLPASAPSSVVQFTNRAVMSLQSNPGWGFDKKCQFMDKLVFEVSQ 501 Score = 36.2 bits (82), Expect(2) = 9e-66 Identities = 16/30 (53%), Positives = 23/30 (76%) Frame = -2 Query: 799 MEDSEPEFATGRFLGDNADGEKLAERVGVE 710 ++D+ A+G FLGD+ DGEKLA++VG E Sbjct: 304 IQDNARSNASGLFLGDDVDGEKLAKKVGPE 333 >ref|XP_003533102.1| PREDICTED: uncharacterized protein LOC100780900 [Glycine max] Length = 481 Score = 237 bits (605), Expect(2) = 2e-64 Identities = 118/168 (70%), Positives = 138/168 (82%) Frame = -1 Query: 701 QIDEGFEEMSERVLPSPMHDAYLEALHTNYMIEFEPEYLMGEFDQNPDIDEKPPLPLRDV 522 Q+ EGFEEM+ RVLPSP+ D +L+AL NY IEFEPEYL+ EFD NPDIDEK P+ LRD Sbjct: 315 QLTEGFEEMTSRVLPSPLEDEFLDALDINYAIEFEPEYLV-EFD-NPDIDEKEPISLRDA 372 Query: 521 LEKVKPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRVTAKKQQEELERVA 342 LEK KPF+M+Y+GIQSQ ETM VPLLK+I+D+YSGPDRVTAKKQQEELERVA Sbjct: 373 LEKAKPFLMSYEGIQSQEEWEEIMEETMARVPLLKKIIDHYSGPDRVTAKKQQEELERVA 432 Query: 341 KTIPASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQCY 198 KT+P S P+SVK+F +RAV+SLQSNPGWGFD+KC FMDKL EV+Q Y Sbjct: 433 KTLPGSVPSSVKQFTNRAVISLQSNPGWGFDKKCHFMDKLVWEVSQHY 480 Score = 36.2 bits (82), Expect(2) = 2e-64 Identities = 15/24 (62%), Positives = 20/24 (83%) Frame = -2 Query: 781 EFATGRFLGDNADGEKLAERVGVE 710 ++ATG + GD+ADGEKLA +VG E Sbjct: 288 DYATGLYAGDDADGEKLARKVGPE 311 >gb|EEF06007.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 466 Score = 246 bits (629), Expect = 5e-63 Identities = 121/165 (73%), Positives = 137/165 (83%) Frame = -1 Query: 692 EGFEEMSERVLPSPMHDAYLEALHTNYMIEFEPEYLMGEFDQNPDIDEKPPLPLRDVLEK 513 E FEEMS RVLP P+ D Y++A TN EFEPEYLMGEFD+NPDIDEKPP+PLRD LEK Sbjct: 301 EAFEEMSGRVLPCPIEDEYVDAFDTNCSFEFEPEYLMGEFDKNPDIDEKPPMPLRDALEK 360 Query: 512 VKPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRVTAKKQQEELERVAKTI 333 VKPF+MAY GI++ ETMK+ PL+K+IVD YSGPDRV+ KKQ+EELERVAKTI Sbjct: 361 VKPFMMAYMGIKTHEEWEEIVEETMKDAPLMKKIVDSYSGPDRVSGKKQKEELERVAKTI 420 Query: 332 PASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQCY 198 PASAP SVK FADRAVLSLQSNPGWGFD+KC FMDKLA+EV+Q Y Sbjct: 421 PASAPDSVKSFADRAVLSLQSNPGWGFDKKCMFMDKLAKEVSQHY 465 >ref|XP_002321880.1| predicted protein [Populus trichocarpa] Length = 466 Score = 246 bits (629), Expect = 5e-63 Identities = 121/165 (73%), Positives = 137/165 (83%) Frame = -1 Query: 692 EGFEEMSERVLPSPMHDAYLEALHTNYMIEFEPEYLMGEFDQNPDIDEKPPLPLRDVLEK 513 E FEEMS RVLP P+ D Y++A TN EFEPEYLMGEFD+NPDIDEKPP+PLRD LEK Sbjct: 301 EAFEEMSGRVLPCPIEDEYVDAFDTNCSFEFEPEYLMGEFDKNPDIDEKPPMPLRDALEK 360 Query: 512 VKPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRVTAKKQQEELERVAKTI 333 VKPF+MAY GI++ ETMK+ PL+K+IVD YSGPDRV+ KKQ+EELERVAKTI Sbjct: 361 VKPFMMAYMGIKTHEEWEEIVEETMKDAPLMKKIVDSYSGPDRVSGKKQKEELERVAKTI 420 Query: 332 PASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQCY 198 PASAP SVK FADRAVLSLQSNPGWGFD+KC FMDKLA+EV+Q Y Sbjct: 421 PASAPDSVKSFADRAVLSLQSNPGWGFDKKCMFMDKLAKEVSQHY 465 >ref|XP_004147751.1| PREDICTED: uncharacterized protein LOC101215545 [Cucumis sativus] gi|449502143|ref|XP_004161555.1| PREDICTED: uncharacterized protein LOC101224016 [Cucumis sativus] Length = 478 Score = 231 bits (588), Expect(2) = 7e-63 Identities = 113/165 (68%), Positives = 132/165 (80%) Frame = -1 Query: 692 EGFEEMSERVLPSPMHDAYLEALHTNYMIEFEPEYLMGEFDQNPDIDEKPPLPLRDVLEK 513 EGFEEMS RVLPSP+ D YL+ + TN+MIE EPEYLMG+F+ NPDIDE PP+PLRD LEK Sbjct: 313 EGFEEMSGRVLPSPLVDQYLDGMDTNFMIECEPEYLMGDFENNPDIDENPPIPLRDALEK 372 Query: 512 VKPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRVTAKKQQEELERVAKTI 333 +KPF+MAY+ IQS ETM++VPLLKEIVD Y GPDRVTAK+QQ ELERVAKT+ Sbjct: 373 MKPFLMAYENIQSHEEWEEIVEETMQSVPLLKEIVDAYGGPDRVTAKEQQGELERVAKTL 432 Query: 332 PASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQCY 198 P SAP SVK+F +R VLSLQSNPGWGFD+K Q MDKL ++ Y Sbjct: 433 PQSAPNSVKQFTNRVVLSLQSNPGWGFDKKWQLMDKLVEGFSKRY 477 Score = 37.4 bits (85), Expect(2) = 7e-63 Identities = 15/28 (53%), Positives = 21/28 (75%) Frame = -2 Query: 793 DSEPEFATGRFLGDNADGEKLAERVGVE 710 D E +A G +LG+N DGE+LA+R+G E Sbjct: 279 DKEDGYAAGLYLGNNEDGERLAKRIGTE 306 >ref|XP_003629502.1| hypothetical protein MTR_8g078230 [Medicago truncatula] gi|355523524|gb|AET03978.1| hypothetical protein MTR_8g078230 [Medicago truncatula] Length = 502 Score = 228 bits (582), Expect(2) = 4e-61 Identities = 115/167 (68%), Positives = 136/167 (81%) Frame = -1 Query: 704 EQIDEGFEEMSERVLPSPMHDAYLEALHTNYMIEFEPEYLMGEFDQNPDIDEKPPLPLRD 525 +QI E +EE+ ERVLPSP+ D Y+EA+ N IEFEPEY + EFD NPDIDEK P+ LRD Sbjct: 335 DQITEAYEEIIERVLPSPLQDEYVEAMDINCAIEFEPEYAV-EFD-NPDIDEKEPIALRD 392 Query: 524 VLEKVKPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRVTAKKQQEELERV 345 LEK+KPF+M Y+GI+SQ E M+ VPLLK+IVD+YSGPDRVTAKKQQEELERV Sbjct: 393 ALEKMKPFLMTYEGIRSQEEWEEVIEELMQRVPLLKKIVDHYSGPDRVTAKKQQEELERV 452 Query: 344 AKTIPASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQ 204 AKT+P SAP+SVK F +RAV+SLQSNPGWGFD+KCQFMDKL EV+Q Sbjct: 453 AKTLPTSAPSSVKEFTNRAVVSLQSNPGWGFDKKCQFMDKLVFEVSQ 499 Score = 33.9 bits (76), Expect(2) = 4e-61 Identities = 14/22 (63%), Positives = 19/22 (86%) Frame = -2 Query: 775 ATGRFLGDNADGEKLAERVGVE 710 A G ++GDNADGEKLA+++G E Sbjct: 311 ADGLYVGDNADGEKLAKKLGPE 332 >gb|AAF78422.1|AC018748_1 Contains similarity to RNA-binding protein from Arabidopsis thaliana gi|2129727 and contains RNA recognition PF|00076 domain. ESTs gb|H37317, gb|F14415, gb|AA651290 come from this gene [Arabidopsis thaliana] Length = 829 Score = 226 bits (577), Expect(2) = 2e-59 Identities = 110/165 (66%), Positives = 130/165 (78%) Frame = -1 Query: 692 EGFEEMSERVLPSPMHDAYLEALHTNYMIEFEPEYLMGEFDQNPDIDEKPPLPLRDVLEK 513 EGFEE+ E+ LPS HDA ++A TN MIE EPEY+M +F NPDIDEKPP+ LR+ LEK Sbjct: 664 EGFEEICEKALPSTTHDAIIDAYDTNLMIECEPEYIMPDFGSNPDIDEKPPMSLRECLEK 723 Query: 512 VKPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRVTAKKQQEELERVAKTI 333 VKPFI+AY+GI+ Q E M PL+KEIVD+YSGPDRVTAKKQ EEL+R+A T+ Sbjct: 724 VKPFIVAYEGIKDQEEWEEAINEAMTQAPLMKEIVDHYSGPDRVTAKKQNEELDRIATTL 783 Query: 332 PASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQCY 198 PASAP SVKRFADRA L+L+SNPGWGFD+K QFMDKL EV+Q Y Sbjct: 784 PASAPDSVKRFADRAALTLKSNPGWGFDKKYQFMDKLVLEVSQSY 828 Score = 30.4 bits (67), Expect(2) = 2e-59 Identities = 16/29 (55%), Positives = 20/29 (68%) Frame = -2 Query: 796 EDSEPEFATGRFLGDNADGEKLAERVGVE 710 E+ E E A F GD+ADGEK AE++G E Sbjct: 630 EEGEQE-AMRIFAGDSADGEKFAEKMGPE 657 >ref|NP_564639.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] gi|12324041|gb|AAG51990.1|AC024260_28 unknown protein; 43598-45751 [Arabidopsis thaliana] gi|16323139|gb|AAL15304.1| At1g53640/F22G10.8 [Arabidopsis thaliana] gi|23506017|gb|AAN28868.1| At1g53640/F22G10.8 [Arabidopsis thaliana] gi|110740318|dbj|BAF02054.1| hypothetical protein [Arabidopsis thaliana] gi|332194854|gb|AEE32975.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 523 Score = 226 bits (577), Expect(2) = 2e-59 Identities = 110/165 (66%), Positives = 130/165 (78%) Frame = -1 Query: 692 EGFEEMSERVLPSPMHDAYLEALHTNYMIEFEPEYLMGEFDQNPDIDEKPPLPLRDVLEK 513 EGFEE+ E+ LPS HDA ++A TN MIE EPEY+M +F NPDIDEKPP+ LR+ LEK Sbjct: 358 EGFEEICEKALPSTTHDAIIDAYDTNLMIECEPEYIMPDFGSNPDIDEKPPMSLRECLEK 417 Query: 512 VKPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRVTAKKQQEELERVAKTI 333 VKPFI+AY+GI+ Q E M PL+KEIVD+YSGPDRVTAKKQ EEL+R+A T+ Sbjct: 418 VKPFIVAYEGIKDQEEWEEAINEAMTQAPLMKEIVDHYSGPDRVTAKKQNEELDRIATTL 477 Query: 332 PASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQCY 198 PASAP SVKRFADRA L+L+SNPGWGFD+K QFMDKL EV+Q Y Sbjct: 478 PASAPDSVKRFADRAALTLKSNPGWGFDKKYQFMDKLVLEVSQSY 522 Score = 30.4 bits (67), Expect(2) = 2e-59 Identities = 16/29 (55%), Positives = 20/29 (68%) Frame = -2 Query: 796 EDSEPEFATGRFLGDNADGEKLAERVGVE 710 E+ E E A F GD+ADGEK AE++G E Sbjct: 324 EEGEQE-AMRIFAGDSADGEKFAEKMGPE 351 >gb|AAM65660.1| Contains similarity to RNA-binding protein from Arabidopsis thaliana gi|2129727 and contains RNA recognition PF|00076 domain [Arabidopsis thaliana] Length = 523 Score = 226 bits (577), Expect(2) = 2e-59 Identities = 110/165 (66%), Positives = 130/165 (78%) Frame = -1 Query: 692 EGFEEMSERVLPSPMHDAYLEALHTNYMIEFEPEYLMGEFDQNPDIDEKPPLPLRDVLEK 513 EGFEE+ E+ LPS HDA ++A TN MIE EPEY+M +F NPDIDEKPP+ LR+ LEK Sbjct: 358 EGFEEICEKALPSTTHDAIIDAYDTNLMIECEPEYIMPDFGSNPDIDEKPPMSLRECLEK 417 Query: 512 VKPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRVTAKKQQEELERVAKTI 333 VKPFI+AY+GI+ Q E M PL+KEIVD+YSGPDRVTAKKQ EEL+R+A T+ Sbjct: 418 VKPFIVAYEGIKDQEEWEEAINEAMTQAPLMKEIVDHYSGPDRVTAKKQNEELDRIATTL 477 Query: 332 PASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQCY 198 PASAP SVKRFADRA L+L+SNPGWGFD+K QFMDKL EV+Q Y Sbjct: 478 PASAPDSVKRFADRAALTLKSNPGWGFDKKYQFMDKLVLEVSQSY 522 Score = 30.4 bits (67), Expect(2) = 2e-59 Identities = 16/29 (55%), Positives = 20/29 (68%) Frame = -2 Query: 796 EDSEPEFATGRFLGDNADGEKLAERVGVE 710 E+ E E A F GD+ADGEK AE++G E Sbjct: 324 EEGEQE-AMRIFAGDSADGEKFAEKMGPE 351 >ref|XP_002894457.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297340299|gb|EFH70716.1| predicted protein [Arabidopsis lyrata subsp. lyrata] Length = 769 Score = 226 bits (575), Expect(2) = 3e-59 Identities = 110/165 (66%), Positives = 129/165 (78%) Frame = -1 Query: 692 EGFEEMSERVLPSPMHDAYLEALHTNYMIEFEPEYLMGEFDQNPDIDEKPPLPLRDVLEK 513 EGFEE+ E+ LPS HDA ++A TN MIE EPEY+M +F NPDIDEKPP+ LR+ LEK Sbjct: 604 EGFEEVCEKALPSTTHDAIIDAYDTNLMIECEPEYIMADFGSNPDIDEKPPMSLRECLEK 663 Query: 512 VKPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRVTAKKQQEELERVAKTI 333 VKPFI+AY+GI+ Q E M PL+KEIVD+YSGPDRVTAKKQ EEL+ +A TI Sbjct: 664 VKPFIVAYEGIKDQEEWEEAVNEAMAQAPLMKEIVDHYSGPDRVTAKKQNEELDSIATTI 723 Query: 332 PASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQCY 198 PASAP SVKRFADRA L+L+SNPGWGFD+K QFMDKL EV+Q Y Sbjct: 724 PASAPDSVKRFADRAALTLKSNPGWGFDKKYQFMDKLVLEVSQSY 768 Score = 30.0 bits (66), Expect(2) = 3e-59 Identities = 15/29 (51%), Positives = 20/29 (68%) Frame = -2 Query: 796 EDSEPEFATGRFLGDNADGEKLAERVGVE 710 E+ E E A F GD+ADGEK A+++G E Sbjct: 570 EEGEQE-AMSIFAGDSADGEKFAQKMGPE 597 >gb|ESQ30058.1| hypothetical protein EUTSA_v10011382mg [Eutrema salsugineum] Length = 531 Score = 223 bits (567), Expect(2) = 5e-59 Identities = 109/169 (64%), Positives = 130/169 (76%) Frame = -1 Query: 704 EQIDEGFEEMSERVLPSPMHDAYLEALHTNYMIEFEPEYLMGEFDQNPDIDEKPPLPLRD 525 + + +G+E++ ER LPS +DA L+A TN MIE EPEYLM F NPDIDEKPP+ LR+ Sbjct: 362 KMLADGYEDICERALPSTANDAVLDAYETNLMIECEPEYLMPAFGSNPDIDEKPPMSLRE 421 Query: 524 VLEKVKPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRVTAKKQQEELERV 345 LEKVKPFI+AY+GI+ Q E M PL+KEIVD+YSGPDRVTAKKQ EEL+R+ Sbjct: 422 CLEKVKPFIVAYEGIKDQEEWEEAIDEVMAQAPLIKEIVDHYSGPDRVTAKKQNEELDRI 481 Query: 344 AKTIPASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQCY 198 A T+P SAP SVKRFADRA LSL+SNPGWGFD+K QFMDKL EV+Q Y Sbjct: 482 ATTVPKSAPDSVKRFADRAALSLKSNPGWGFDKKYQFMDKLVAEVSQSY 530 Score = 32.7 bits (73), Expect(2) = 5e-59 Identities = 14/30 (46%), Positives = 20/30 (66%) Frame = -2 Query: 799 MEDSEPEFATGRFLGDNADGEKLAERVGVE 710 ME+ + A F+GD+ADGEK A ++G E Sbjct: 330 MEEEAEQEAISTFVGDSADGEKFANKMGPE 359