BLASTX nr result
ID: Jatropha_contig00043194
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Jatropha_contig00043194 (838 letters) Database: NCBI-nr (updated 2014/02/11) 35,149,712 sequences; 12,374,887,350 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_002523666.1| conserved hypothetical protein [Ricinus comm... 195 1e-47 gb|EOY32159.1| Hydroxyproline-rich glycoprotein family protein, ... 169 2e-46 ref|XP_002274822.2| PREDICTED: uncharacterized protein LOC100253... 167 5e-45 ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247... 162 5e-45 emb|CBI17195.3| unnamed protein product [Vitis vinifera] 167 5e-45 ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanu... 160 2e-44 gb|ESW19411.1| hypothetical protein PHAVU_006G122700g [Phaseolus... 164 7e-41 ref|XP_004509236.1| PREDICTED: uncharacterized protein LOC101507... 161 3e-40 gb|EMJ14490.1| hypothetical protein PRUPE_ppa006080mg [Prunus pe... 171 4e-40 gb|EEF06007.2| hydroxyproline-rich glycoprotein [Populus trichoc... 164 4e-38 ref|XP_002321880.1| predicted protein [Populus trichocarpa] 164 4e-38 ref|XP_003629502.1| hypothetical protein MTR_8g078230 [Medicago ... 159 5e-38 ref|XP_004295550.1| PREDICTED: uncharacterized protein LOC101300... 162 1e-37 ref|XP_003533102.1| PREDICTED: uncharacterized protein LOC100780... 160 4e-37 gb|ESQ30058.1| hypothetical protein EUTSA_v10011382mg [Eutrema s... 158 2e-36 gb|AAF78422.1|AC018748_1 Contains similarity to RNA-binding prot... 158 2e-36 ref|NP_564639.1| hydroxyproline-rich glycoprotein family protein... 158 2e-36 gb|AAM65660.1| Contains similarity to RNA-binding protein from A... 158 2e-36 ref|XP_002894457.1| predicted protein [Arabidopsis lyrata subsp.... 157 5e-36 ref|XP_006307233.1| hypothetical protein CARUB_v10008838mg [Caps... 156 7e-36 >ref|XP_002523666.1| conserved hypothetical protein [Ricinus communis] gi|223537066|gb|EEF38701.1| conserved hypothetical protein [Ricinus communis] Length = 436 Score = 195 bits (496), Expect = 1e-47 Identities = 101/140 (72%), Positives = 111/140 (79%) Frame = -2 Query: 648 LNFEPRVFDGRV*SKSQIIDEKAPLYLSEMCWRKVNPFIMAYDGIQSQXXXXXXXXETMK 469 + FEP G ++ IDEK P+ L ++ KV PFIMAY+GIQSQ ETMK Sbjct: 299 IEFEPEYLMGEF-DQNPDIDEKPPMPLRDVL-EKVKPFIMAYEGIQSQEEWEAAVEETMK 356 Query: 468 NVPLLKEIVDYYSGPDRVTAKKQQEELERVAKTIPASAPASVKRFADRAVLSLQSNPGWG 289 NVPL KEIVDYYSGPDR+TAKKQ+EELERVA TIPASAPASVKRFADRAVLSLQSNPGWG Sbjct: 357 NVPLFKEIVDYYSGPDRITAKKQEEELERVANTIPASAPASVKRFADRAVLSLQSNPGWG 416 Query: 288 FDRKCQFMDKLAREVNQCYN 229 FD+KCQFMDKL REVNQCYN Sbjct: 417 FDKKCQFMDKLVREVNQCYN 436 >gb|EOY32159.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] Length = 474 Score = 169 bits (428), Expect(2) = 2e-46 Identities = 87/121 (71%), Positives = 99/121 (81%) Frame = -2 Query: 594 IDEKAPLYLSEMCWRKVNPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRV 415 IDEK P+ L + K+ PF+MAY+GIQSQ ETM+ VPLL+EIVDYYSGPDRV Sbjct: 354 IDEKPPMPLRDAL-EKMKPFLMAYEGIQSQEEWEEVIKETMERVPLLQEIVDYYSGPDRV 412 Query: 414 TAKKQQEELERVAKTIPASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQC 235 TAKKQQEELERVAKTIP AP+SVK+FA+RAVLSLQSNPGWGFD+KCQFMDKL EV+Q Sbjct: 413 TAKKQQEELERVAKTIPERAPSSVKQFANRAVLSLQSNPGWGFDKKCQFMDKLVWEVSQQ 472 Query: 234 Y 232 Y Sbjct: 473 Y 473 Score = 44.3 bits (103), Expect(2) = 2e-46 Identities = 38/91 (41%), Positives = 43/91 (47%), Gaps = 4/91 (4%) Frame = -3 Query: 836 GARIATGLVLGDNC*W*EAGGKGWLLR----NMNKID*RI*RNE*KECCPSPMQDAYLEA 669 G A GL LGDN A G+ + NMNK+ PSPM DAYL+A Sbjct: 277 GEGSADGLYLGDN-----ADGEKFAQTIGADNMNKLVEGF-EEMGSRVLPSPMDDAYLDA 330 Query: 668 LHTKLYD*ILSPEYLMGEFDQNPRLLMRKPP 576 LHT PEYLM EF NP + KPP Sbjct: 331 LHTNC-SIEFEPEYLMEEFGTNPD-IDEKPP 359 >ref|XP_002274822.2| PREDICTED: uncharacterized protein LOC100253300 [Vitis vinifera] Length = 482 Score = 167 bits (422), Expect(2) = 5e-45 Identities = 84/121 (69%), Positives = 96/121 (79%) Frame = -2 Query: 594 IDEKAPLYLSEMCWRKVNPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRV 415 IDE P+ L + K+ PF+M Y+GIQSQ ETM+NVP LKE+VDYYSGPDRV Sbjct: 362 IDENPPIPLRDAL-EKMKPFLMQYEGIQSQEEWEEVMKETMENVPYLKELVDYYSGPDRV 420 Query: 414 TAKKQQEELERVAKTIPASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQC 235 TAKKQQEELERVAKT+P +AP SVKRF DRA+LSLQSNPGWGFD+KCQFMDKL EV+Q Sbjct: 421 TAKKQQEELERVAKTLPETAPNSVKRFTDRAILSLQSNPGWGFDKKCQFMDKLVWEVSQH 480 Query: 234 Y 232 Y Sbjct: 481 Y 481 Score = 41.6 bits (96), Expect(2) = 5e-45 Identities = 31/73 (42%), Positives = 38/73 (52%) Frame = -3 Query: 818 GLVLGDNC*W*EAGGKGWLLRNMNKID*RI*RNE*KECCPSPMQDAYLEALHTKLYD*IL 639 GL LGDN + K L M+K+D + PSP++DAYL+ALHT Sbjct: 291 GLYLGDNADAEKLSNKIGL-EKMSKLDEAFEEMSGR-VLPSPIEDAYLDALHTNCLI-EF 347 Query: 638 SPEYLMGEFDQNP 600 PEYLM EF NP Sbjct: 348 EPEYLMEEFGTNP 360 >ref|XP_004230134.1| PREDICTED: uncharacterized protein LOC101247662 isoform 1 [Solanum lycopersicum] gi|460368563|ref|XP_004230135.1| PREDICTED: uncharacterized protein LOC101247662 isoform 2 [Solanum lycopersicum] Length = 473 Score = 162 bits (409), Expect(2) = 5e-45 Identities = 81/121 (66%), Positives = 96/121 (79%) Frame = -2 Query: 594 IDEKAPLYLSEMCWRKVNPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRV 415 IDE P+ L + K+ PF+MAY+GI+ Q ETM+ VPL+KEIVDYYSGPDRV Sbjct: 353 IDETPPIPLRDAL-EKMKPFLMAYEGIKDQEEWEEVIKETMETVPLMKEIVDYYSGPDRV 411 Query: 414 TAKKQQEELERVAKTIPASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQC 235 TAK+QQ+ELERVAKT+P SAP SVKRF +RAVLSLQSNPGWGFD+KCQFMDK+ EV+Q Sbjct: 412 TAKQQQQELERVAKTLPESAPNSVKRFTERAVLSLQSNPGWGFDKKCQFMDKVVMEVSQH 471 Query: 234 Y 232 Y Sbjct: 472 Y 472 Score = 46.6 bits (109), Expect(2) = 5e-45 Identities = 22/34 (64%), Positives = 25/34 (73%) Frame = -3 Query: 701 PSPMQDAYLEALHTKLYD*ILSPEYLMGEFDQNP 600 PSPM DAYLEALHT + PEYLMG+F+ NP Sbjct: 319 PSPMDDAYLEALHTNMMI-ECEPEYLMGDFESNP 351 >emb|CBI17195.3| unnamed protein product [Vitis vinifera] Length = 209 Score = 167 bits (422), Expect(2) = 5e-45 Identities = 84/121 (69%), Positives = 96/121 (79%) Frame = -2 Query: 594 IDEKAPLYLSEMCWRKVNPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRV 415 IDE P+ L + K+ PF+M Y+GIQSQ ETM+NVP LKE+VDYYSGPDRV Sbjct: 89 IDENPPIPLRDAL-EKMKPFLMQYEGIQSQEEWEEVMKETMENVPYLKELVDYYSGPDRV 147 Query: 414 TAKKQQEELERVAKTIPASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQC 235 TAKKQQEELERVAKT+P +AP SVKRF DRA+LSLQSNPGWGFD+KCQFMDKL EV+Q Sbjct: 148 TAKKQQEELERVAKTLPETAPNSVKRFTDRAILSLQSNPGWGFDKKCQFMDKLVWEVSQH 207 Query: 234 Y 232 Y Sbjct: 208 Y 208 Score = 41.6 bits (96), Expect(2) = 5e-45 Identities = 31/73 (42%), Positives = 38/73 (52%) Frame = -3 Query: 818 GLVLGDNC*W*EAGGKGWLLRNMNKID*RI*RNE*KECCPSPMQDAYLEALHTKLYD*IL 639 GL LGDN + K L M+K+D + PSP++DAYL+ALHT Sbjct: 18 GLYLGDNADAEKLSNKIGL-EKMSKLDEAFEEMSGR-VLPSPIEDAYLDALHTNCLI-EF 74 Query: 638 SPEYLMGEFDQNP 600 PEYLM EF NP Sbjct: 75 EPEYLMEEFGTNP 87 >ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanum tuberosum] Length = 480 Score = 160 bits (405), Expect(2) = 2e-44 Identities = 80/121 (66%), Positives = 95/121 (78%) Frame = -2 Query: 594 IDEKAPLYLSEMCWRKVNPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRV 415 IDE P+ L + K+ PF+MAY+GI+ Q ETM+ VPL+KEIVDYYSGPDRV Sbjct: 360 IDETPPIPLRDAL-EKMKPFLMAYEGIKDQEEWEEVIKETMETVPLMKEIVDYYSGPDRV 418 Query: 414 TAKKQQEELERVAKTIPASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQC 235 TAK+QQ+ELERVAKT+P SAP SVKRF +RAVLSLQSNPGWGFD+KCQFMDK+ E +Q Sbjct: 419 TAKQQQQELERVAKTLPESAPNSVKRFTERAVLSLQSNPGWGFDKKCQFMDKVVMEASQH 478 Query: 234 Y 232 Y Sbjct: 479 Y 479 Score = 45.8 bits (107), Expect(2) = 2e-44 Identities = 21/34 (61%), Positives = 25/34 (73%) Frame = -3 Query: 701 PSPMQDAYLEALHTKLYD*ILSPEYLMGEFDQNP 600 PSPM DAY+EALHT + PEYLMG+F+ NP Sbjct: 326 PSPMDDAYIEALHTNMMI-ECEPEYLMGDFESNP 358 >gb|ESW19411.1| hypothetical protein PHAVU_006G122700g [Phaseolus vulgaris] Length = 532 Score = 164 bits (416), Expect(2) = 7e-41 Identities = 84/121 (69%), Positives = 97/121 (80%) Frame = -2 Query: 594 IDEKAPLYLSEMCWRKVNPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRV 415 IDEK P+ L + K+ PF+MAY+GIQSQ ETM VPLLKEIVD+YSGPDRV Sbjct: 412 IDEKEPIPLRDAL-EKMKPFLMAYEGIQSQEEWEEIMEETMAQVPLLKEIVDHYSGPDRV 470 Query: 414 TAKKQQEELERVAKTIPASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQC 235 TAKKQQEELERVAKT+P SAP+SVK+F +RAV+SLQSNPGWGFD+KC FMDKL EV+Q Sbjct: 471 TAKKQQEELERVAKTLPESAPSSVKQFTNRAVVSLQSNPGWGFDKKCHFMDKLVWEVSQH 530 Query: 234 Y 232 Y Sbjct: 531 Y 531 Score = 30.0 bits (66), Expect(2) = 7e-41 Identities = 19/41 (46%), Positives = 26/41 (63%) Frame = -3 Query: 701 PSPMQDAYLEALHTKLYD*ILSPEYLMGEFDQNPRLLMRKP 579 PSP++D YL+AL Y PEYL+ EFD NP + ++P Sbjct: 380 PSPLEDEYLDALDIN-YAIEFEPEYLV-EFD-NPDIDEKEP 417 >ref|XP_004509236.1| PREDICTED: uncharacterized protein LOC101507965 [Cicer arietinum] Length = 504 Score = 161 bits (408), Expect(2) = 3e-40 Identities = 82/119 (68%), Positives = 97/119 (81%) Frame = -2 Query: 594 IDEKAPLYLSEMCWRKVNPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRV 415 IDEK P+ L + K+ PF+M Y+GIQSQ ETM+ VPLLK+IVD+YSGPDRV Sbjct: 384 IDEKEPIPLRDAL-EKMKPFLMNYEGIQSQEEWEAIMEETMERVPLLKKIVDHYSGPDRV 442 Query: 414 TAKKQQEELERVAKTIPASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQ 238 TAKKQQEELERVAKT+PASAP+SV +F +RAV+SLQSNPGWGFD+KCQFMDKL EV+Q Sbjct: 443 TAKKQQEELERVAKTLPASAPSSVVQFTNRAVMSLQSNPGWGFDKKCQFMDKLVFEVSQ 501 Score = 31.2 bits (69), Expect(2) = 3e-40 Identities = 17/41 (41%), Positives = 24/41 (58%) Frame = -3 Query: 701 PSPMQDAYLEALHTKLYD*ILSPEYLMGEFDQNPRLLMRKP 579 PSP++D Y+EA PEY+M EFD NP + ++P Sbjct: 351 PSPLEDEYVEAFDINCAI-EFEPEYIM-EFDSNPDIDEKEP 389 >gb|EMJ14490.1| hypothetical protein PRUPE_ppa006080mg [Prunus persica] Length = 428 Score = 171 bits (432), Expect = 4e-40 Identities = 90/137 (65%), Positives = 103/137 (75%) Frame = -2 Query: 639 EPRVFDGRV*SKSQIIDEKAPLYLSEMCWRKVNPFIMAYDGIQSQXXXXXXXXETMKNVP 460 EP G +K+ IDEK P+ L + K+ PF+MAY+ I+SQ ETM+ VP Sbjct: 294 EPEYLMGEF-NKNPDIDEKPPISLRDAL-EKMKPFLMAYENIESQEEWEEVVNETMERVP 351 Query: 459 LLKEIVDYYSGPDRVTAKKQQEELERVAKTIPASAPASVKRFADRAVLSLQSNPGWGFDR 280 LLKEIVD+YSGPDRVTAKKQQEELERVAKT+PA P SVKRF DRAVLSLQSNPGWGFDR Sbjct: 352 LLKEIVDHYSGPDRVTAKKQQEELERVAKTLPAKVPDSVKRFTDRAVLSLQSNPGWGFDR 411 Query: 279 KCQFMDKLAREVNQCYN 229 KCQFMDKL +V+Q YN Sbjct: 412 KCQFMDKLVAKVSQHYN 428 >gb|EEF06007.2| hydroxyproline-rich glycoprotein [Populus trichocarpa] Length = 466 Score = 164 bits (414), Expect = 4e-38 Identities = 88/137 (64%), Positives = 102/137 (74%) Frame = -2 Query: 642 FEPRVFDGRV*SKSQIIDEKAPLYLSEMCWRKVNPFIMAYDGIQSQXXXXXXXXETMKNV 463 FEP G K+ IDEK P+ L + KV PF+MAY GI++ ETMK+ Sbjct: 331 FEPEYLMGEF-DKNPDIDEKPPMPLRDAL-EKVKPFMMAYMGIKTHEEWEEIVEETMKDA 388 Query: 462 PLLKEIVDYYSGPDRVTAKKQQEELERVAKTIPASAPASVKRFADRAVLSLQSNPGWGFD 283 PL+K+IVD YSGPDRV+ KKQ+EELERVAKTIPASAP SVK FADRAVLSLQSNPGWGFD Sbjct: 389 PLMKKIVDSYSGPDRVSGKKQKEELERVAKTIPASAPDSVKSFADRAVLSLQSNPGWGFD 448 Query: 282 RKCQFMDKLAREVNQCY 232 +KC FMDKLA+EV+Q Y Sbjct: 449 KKCMFMDKLAKEVSQHY 465 >ref|XP_002321880.1| predicted protein [Populus trichocarpa] Length = 466 Score = 164 bits (414), Expect = 4e-38 Identities = 88/137 (64%), Positives = 102/137 (74%) Frame = -2 Query: 642 FEPRVFDGRV*SKSQIIDEKAPLYLSEMCWRKVNPFIMAYDGIQSQXXXXXXXXETMKNV 463 FEP G K+ IDEK P+ L + KV PF+MAY GI++ ETMK+ Sbjct: 331 FEPEYLMGEF-DKNPDIDEKPPMPLRDAL-EKVKPFMMAYMGIKTHEEWEEIVEETMKDA 388 Query: 462 PLLKEIVDYYSGPDRVTAKKQQEELERVAKTIPASAPASVKRFADRAVLSLQSNPGWGFD 283 PL+K+IVD YSGPDRV+ KKQ+EELERVAKTIPASAP SVK FADRAVLSLQSNPGWGFD Sbjct: 389 PLMKKIVDSYSGPDRVSGKKQKEELERVAKTIPASAPDSVKSFADRAVLSLQSNPGWGFD 448 Query: 282 RKCQFMDKLAREVNQCY 232 +KC FMDKLA+EV+Q Y Sbjct: 449 KKCMFMDKLAKEVSQHY 465 >ref|XP_003629502.1| hypothetical protein MTR_8g078230 [Medicago truncatula] gi|355523524|gb|AET03978.1| hypothetical protein MTR_8g078230 [Medicago truncatula] Length = 502 Score = 159 bits (402), Expect(2) = 5e-38 Identities = 80/119 (67%), Positives = 95/119 (79%) Frame = -2 Query: 594 IDEKAPLYLSEMCWRKVNPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRV 415 IDEK P+ L + K+ PF+M Y+GI+SQ E M+ VPLLK+IVD+YSGPDRV Sbjct: 382 IDEKEPIALRDAL-EKMKPFLMTYEGIRSQEEWEEVIEELMQRVPLLKKIVDHYSGPDRV 440 Query: 414 TAKKQQEELERVAKTIPASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQ 238 TAKKQQEELERVAKT+P SAP+SVK F +RAV+SLQSNPGWGFD+KCQFMDKL EV+Q Sbjct: 441 TAKKQQEELERVAKTLPTSAPSSVKEFTNRAVVSLQSNPGWGFDKKCQFMDKLVFEVSQ 499 Score = 25.8 bits (55), Expect(2) = 5e-38 Identities = 17/41 (41%), Positives = 24/41 (58%) Frame = -3 Query: 701 PSPMQDAYLEALHTKLYD*ILSPEYLMGEFDQNPRLLMRKP 579 PSP+QD Y+EA+ PEY + EFD NP + ++P Sbjct: 350 PSPLQDEYVEAMDINCAI-EFEPEYAV-EFD-NPDIDEKEP 387 >ref|XP_004295550.1| PREDICTED: uncharacterized protein LOC101300131 [Fragaria vesca subsp. vesca] Length = 464 Score = 162 bits (410), Expect = 1e-37 Identities = 84/139 (60%), Positives = 105/139 (75%) Frame = -2 Query: 648 LNFEPRVFDGRV*SKSQIIDEKAPLYLSEMCWRKVNPFIMAYDGIQSQXXXXXXXXETMK 469 + FEP G +++ IDE+ P+ L + K+ PF+MAY+GIQSQ ETM+ Sbjct: 327 IEFEPEYLMGEF-NQNPDIDEEPPIPLRDAL-EKMKPFLMAYEGIQSQEEWEEAIKETME 384 Query: 468 NVPLLKEIVDYYSGPDRVTAKKQQEELERVAKTIPASAPASVKRFADRAVLSLQSNPGWG 289 VPLLK+IVD+YSGPDRVTAKKQ+EELERVAKT+PA+ P SVK+F DRAVLSLQ NPGWG Sbjct: 385 RVPLLKKIVDHYSGPDRVTAKKQREELERVAKTLPANVPDSVKQFTDRAVLSLQGNPGWG 444 Query: 288 FDRKCQFMDKLAREVNQCY 232 F RKCQFMDKL ++V++ Y Sbjct: 445 FHRKCQFMDKLTQKVSKHY 463 >ref|XP_003533102.1| PREDICTED: uncharacterized protein LOC100780900 [Glycine max] Length = 481 Score = 160 bits (406), Expect = 4e-37 Identities = 80/121 (66%), Positives = 95/121 (78%) Frame = -2 Query: 594 IDEKAPLYLSEMCWRKVNPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRV 415 IDEK P+ L + K PF+M+Y+GIQSQ ETM VPLLK+I+D+YSGPDRV Sbjct: 361 IDEKEPISLRDAL-EKAKPFLMSYEGIQSQEEWEEIMEETMARVPLLKKIIDHYSGPDRV 419 Query: 414 TAKKQQEELERVAKTIPASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQC 235 TAKKQQEELERVAKT+P S P+SVK+F +RAV+SLQSNPGWGFD+KC FMDKL EV+Q Sbjct: 420 TAKKQQEELERVAKTLPGSVPSSVKQFTNRAVISLQSNPGWGFDKKCHFMDKLVWEVSQH 479 Query: 234 Y 232 Y Sbjct: 480 Y 480 >gb|ESQ30058.1| hypothetical protein EUTSA_v10011382mg [Eutrema salsugineum] Length = 531 Score = 158 bits (400), Expect = 2e-36 Identities = 80/121 (66%), Positives = 92/121 (76%) Frame = -2 Query: 594 IDEKAPLYLSEMCWRKVNPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRV 415 IDEK P+ L E C KV PFI+AY+GI+ Q E M PL+KEIVD+YSGPDRV Sbjct: 411 IDEKPPMSLRE-CLEKVKPFIVAYEGIKDQEEWEEAIDEVMAQAPLIKEIVDHYSGPDRV 469 Query: 414 TAKKQQEELERVAKTIPASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQC 235 TAKKQ EEL+R+A T+P SAP SVKRFADRA LSL+SNPGWGFD+K QFMDKL EV+Q Sbjct: 470 TAKKQNEELDRIATTVPKSAPDSVKRFADRAALSLKSNPGWGFDKKYQFMDKLVAEVSQS 529 Query: 234 Y 232 Y Sbjct: 530 Y 530 >gb|AAF78422.1|AC018748_1 Contains similarity to RNA-binding protein from Arabidopsis thaliana gi|2129727 and contains RNA recognition PF|00076 domain. ESTs gb|H37317, gb|F14415, gb|AA651290 come from this gene [Arabidopsis thaliana] Length = 829 Score = 158 bits (400), Expect = 2e-36 Identities = 80/121 (66%), Positives = 93/121 (76%) Frame = -2 Query: 594 IDEKAPLYLSEMCWRKVNPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRV 415 IDEK P+ L E C KV PFI+AY+GI+ Q E M PL+KEIVD+YSGPDRV Sbjct: 709 IDEKPPMSLRE-CLEKVKPFIVAYEGIKDQEEWEEAINEAMTQAPLMKEIVDHYSGPDRV 767 Query: 414 TAKKQQEELERVAKTIPASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQC 235 TAKKQ EEL+R+A T+PASAP SVKRFADRA L+L+SNPGWGFD+K QFMDKL EV+Q Sbjct: 768 TAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKSNPGWGFDKKYQFMDKLVLEVSQS 827 Query: 234 Y 232 Y Sbjct: 828 Y 828 >ref|NP_564639.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] gi|12324041|gb|AAG51990.1|AC024260_28 unknown protein; 43598-45751 [Arabidopsis thaliana] gi|16323139|gb|AAL15304.1| At1g53640/F22G10.8 [Arabidopsis thaliana] gi|23506017|gb|AAN28868.1| At1g53640/F22G10.8 [Arabidopsis thaliana] gi|110740318|dbj|BAF02054.1| hypothetical protein [Arabidopsis thaliana] gi|332194854|gb|AEE32975.1| hydroxyproline-rich glycoprotein family protein [Arabidopsis thaliana] Length = 523 Score = 158 bits (400), Expect = 2e-36 Identities = 80/121 (66%), Positives = 93/121 (76%) Frame = -2 Query: 594 IDEKAPLYLSEMCWRKVNPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRV 415 IDEK P+ L E C KV PFI+AY+GI+ Q E M PL+KEIVD+YSGPDRV Sbjct: 403 IDEKPPMSLRE-CLEKVKPFIVAYEGIKDQEEWEEAINEAMTQAPLMKEIVDHYSGPDRV 461 Query: 414 TAKKQQEELERVAKTIPASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQC 235 TAKKQ EEL+R+A T+PASAP SVKRFADRA L+L+SNPGWGFD+K QFMDKL EV+Q Sbjct: 462 TAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKSNPGWGFDKKYQFMDKLVLEVSQS 521 Query: 234 Y 232 Y Sbjct: 522 Y 522 >gb|AAM65660.1| Contains similarity to RNA-binding protein from Arabidopsis thaliana gi|2129727 and contains RNA recognition PF|00076 domain [Arabidopsis thaliana] Length = 523 Score = 158 bits (400), Expect = 2e-36 Identities = 80/121 (66%), Positives = 93/121 (76%) Frame = -2 Query: 594 IDEKAPLYLSEMCWRKVNPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRV 415 IDEK P+ L E C KV PFI+AY+GI+ Q E M PL+KEIVD+YSGPDRV Sbjct: 403 IDEKPPMSLRE-CLEKVKPFIVAYEGIKDQEEWEEAINEAMTQAPLMKEIVDHYSGPDRV 461 Query: 414 TAKKQQEELERVAKTIPASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQC 235 TAKKQ EEL+R+A T+PASAP SVKRFADRA L+L+SNPGWGFD+K QFMDKL EV+Q Sbjct: 462 TAKKQNEELDRIATTLPASAPDSVKRFADRAALTLKSNPGWGFDKKYQFMDKLVLEVSQS 521 Query: 234 Y 232 Y Sbjct: 522 Y 522 >ref|XP_002894457.1| predicted protein [Arabidopsis lyrata subsp. lyrata] gi|297340299|gb|EFH70716.1| predicted protein [Arabidopsis lyrata subsp. lyrata] Length = 769 Score = 157 bits (396), Expect = 5e-36 Identities = 80/121 (66%), Positives = 92/121 (76%) Frame = -2 Query: 594 IDEKAPLYLSEMCWRKVNPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRV 415 IDEK P+ L E C KV PFI+AY+GI+ Q E M PL+KEIVD+YSGPDRV Sbjct: 649 IDEKPPMSLRE-CLEKVKPFIVAYEGIKDQEEWEEAVNEAMAQAPLMKEIVDHYSGPDRV 707 Query: 414 TAKKQQEELERVAKTIPASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQC 235 TAKKQ EEL+ +A TIPASAP SVKRFADRA L+L+SNPGWGFD+K QFMDKL EV+Q Sbjct: 708 TAKKQNEELDSIATTIPASAPDSVKRFADRAALTLKSNPGWGFDKKYQFMDKLVLEVSQS 767 Query: 234 Y 232 Y Sbjct: 768 Y 768 >ref|XP_006307233.1| hypothetical protein CARUB_v10008838mg [Capsella rubella] gi|482575944|gb|EOA40131.1| hypothetical protein CARUB_v10008838mg [Capsella rubella] Length = 525 Score = 156 bits (395), Expect = 7e-36 Identities = 79/121 (65%), Positives = 92/121 (76%) Frame = -2 Query: 594 IDEKAPLYLSEMCWRKVNPFIMAYDGIQSQXXXXXXXXETMKNVPLLKEIVDYYSGPDRV 415 IDEK P+ L E C KV PFI+AY+GI+ Q E M PL+KEIVD+YSGPDRV Sbjct: 405 IDEKPPMSLRE-CLEKVKPFIVAYEGIKDQEEWEEAINEAMAQAPLMKEIVDHYSGPDRV 463 Query: 414 TAKKQQEELERVAKTIPASAPASVKRFADRAVLSLQSNPGWGFDRKCQFMDKLAREVNQC 235 TAKKQ EEL+R+A T+P SAP SVKRFADRA L+L+SNPGWGFD+K QFMDKL EV+Q Sbjct: 464 TAKKQNEELDRIATTLPKSAPDSVKRFADRAALTLKSNPGWGFDKKYQFMDKLVLEVSQS 523 Query: 234 Y 232 Y Sbjct: 524 Y 524