BLASTX nr result
ID: Forsythia22_contig00007765
seq
BLASTX 2.2.25 [Feb-01-2011] Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 25:3389-3402. Query= Forsythia22_contig00007765 (2566 letters) Database: ./nr 69,698,275 sequences; 24,982,196,650 total letters Searching..................................................done Score E Sequences producing significant alignments: (bits) Value ref|XP_011086328.1| PREDICTED: uncharacterized protein LOC105168... 428 e-117 ref|XP_004230134.1| PREDICTED: la-related protein 1 [Solanum lyc... 376 e-101 ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanu... 370 3e-99 ref|XP_009796159.1| PREDICTED: la-related protein 1 [Nicotiana s... 367 3e-98 ref|XP_012841899.1| PREDICTED: la-related protein 1 [Erythranthe... 363 3e-97 ref|XP_009616263.1| PREDICTED: la-related protein 1 [Nicotiana t... 362 8e-97 emb|CDP13552.1| unnamed protein product [Coffea canephora] 360 4e-96 gb|KDO45643.1| hypothetical protein CISIN_1g009722mg [Citrus sin... 347 2e-92 ref|XP_007014540.1| Hydroxyproline-rich glycoprotein family prot... 343 4e-91 ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1... 342 1e-90 ref|XP_010274926.1| PREDICTED: pro-resilin [Nelumbo nucifera] 338 1e-89 gb|KHG06267.1| FYVE, RhoGEF and PH domain-containing 2 [Gossypiu... 335 8e-89 ref|XP_002274822.2| PREDICTED: coilin [Vitis vinifera] 331 2e-87 ref|XP_012463685.1| PREDICTED: uncharacterized protein LOC105783... 331 2e-87 ref|XP_002523666.1| conserved hypothetical protein [Ricinus comm... 330 4e-87 ref|XP_008225991.1| PREDICTED: la-related protein 1 [Prunus mume] 325 9e-86 ref|XP_012066680.1| PREDICTED: uncharacterized protein LOC105629... 325 1e-85 ref|XP_011621191.1| PREDICTED: uncharacterized protein LOC184282... 325 1e-85 gb|KDP42449.1| hypothetical protein JCGZ_00246 [Jatropha curcas] 325 1e-85 ref|XP_009340130.1| PREDICTED: collagen alpha-1(III) chain-like ... 315 9e-83 >ref|XP_011086328.1| PREDICTED: uncharacterized protein LOC105168091 [Sesamum indicum] Length = 484 Score = 428 bits (1101), Expect = e-117 Identities = 254/492 (51%), Positives = 296/492 (60%), Gaps = 13/492 (2%) Frame = -1 Query: 1669 MRRSLGKFPYPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXSNN-FQFNRIGNPEPE-NSI 1496 MRRSL K PYPV + FQF E E +S Sbjct: 1 MRRSLTKIPYPVISRPTATTSSISAAFSTSSSGGGGRGRGRASPFQFTVDSPTENEPDSA 60 Query: 1495 DESPTNPSLPGQGHGRAKXXXXXXXXXXXXXLVNNDAVAR--GRGRGYGPINASPHPSSL 1322 +P G G GR K +NND+ A GRGRG+ P S PS Sbjct: 61 KHDDVSPVPHGHGRGRGKLLPSAPVIPSFSSFLNNDSRAPPLGRGRGFVP---SKSPSPP 117 Query: 1321 PKESEHAQQPPALKPNDRKSFSFIKGDT--HDDETSAVPTRPGNPRERSLPSDIIDILSG 1148 P+E + P+ K N + F+K + HD S VP R +E+ LP++I+++LSG Sbjct: 118 PQEESDSSGKPS-KANVKMPLLFVKDEEAQHDSAESEVPVR----KEKELPTEIVNVLSG 172 Query: 1147 AGRGKPKTQPVLPTERSMAENRHLRPRQHPKPRVVNAED-GAGDKSSPREKLSTEEKVKK 971 GRGKP P + +E+ ENRH+R RQ P A D A DKSSPRE+LS EEKVK+ Sbjct: 173 VGRGKPIKPPAVQSEKPKMENRHIRQRQQPNSAEAVASDVPALDKSSPREQLSQEEKVKR 232 Query: 970 AVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXG------QDRYQDSDDELG 809 AVGILS D Y+DSDDE G Sbjct: 233 AVGILSRGDQEGERGGAGVRGGRGASAGRGRGRGRGRGRGRIGGRGRGDDMYEDSDDEAG 292 Query: 808 GLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLPSPVDDAYLDALHTNLMIECEPE 629 GLYLGDPA+GEKLAQKLGPE+MNKL E FEE S+ VLP+PVDDAYLDALHTNL+IECEPE Sbjct: 293 GLYLGDPADGEKLAQKLGPESMNKLAEAFEEASNSVLPAPVDDAYLDALHTNLLIECEPE 352 Query: 628 YLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQSQXXXXXXXXETMKNVPLMEKI 449 YLME FG+NPDIDEKPPI LRDAL+K+KPF MAYEGIQSQ ETMK VPL+++I Sbjct: 353 YLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQEEWEEVMEETMKTVPLIKEI 412 Query: 448 VDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFVDRAVLSLQSNPGWGFDKKCQFM 269 VD+YSGPDRVTAKQQQQELERVAKT+P + +SVK F +RAVLSLQSNPGWGFDKKCQFM Sbjct: 413 VDHYSGPDRVTAKQQQQELERVAKTLPASAPSSVKRFTERAVLSLQSNPGWGFDKKCQFM 472 Query: 268 DKLVFEVSQQPK 233 DKLV EVSQQ K Sbjct: 473 DKLVMEVSQQYK 484 >ref|XP_004230134.1| PREDICTED: la-related protein 1 [Solanum lycopersicum] Length = 473 Score = 376 bits (965), Expect = e-101 Identities = 225/446 (50%), Positives = 260/446 (58%), Gaps = 9/446 (2%) Frame = -1 Query: 1543 NFQFNRIGNPEPENSIDES--PTNPSLPGQGHGRAKXXXXXXXXXXXXXLVNNDAVARGR 1370 NF F+ G E+S ES P PS G G GR K V+N GR Sbjct: 46 NFGFSP-GKSASEDSKPESSTPATPSGTGHGRGRGKPLPSSPIVPSFHSFVDNPNTPAGR 104 Query: 1369 GRG-YGPINASPHPSSLPKESEHAQQPPALKPNDRKSFSFIKGDTHDDETSAVPTRPGNP 1193 GRG GP + P P + QQ P KP F K + D S+ P Sbjct: 105 GRGGIGPFSPPPQP-------QQQQQQPLRKP-----IFFAKEEETTDSNSSSSNAPKPR 152 Query: 1192 RERSLPSDIIDILSGAGRGKPKTQPVLPTERSMAENRHLRPRQHPKPRVVNAEDGAGDKS 1013 + +LPS +I +L+GAGRGKP +E+ ENRHLRPRQ A+ G S Sbjct: 153 DDSNLPSSVISVLTGAGRGKPLQTASSVSEKPKEENRHLRPRQQKV-----ADSGERASS 207 Query: 1012 SPREKLSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQDRY 833 P ++LS E+ VKKAVGILS Sbjct: 208 PPPQRLSREDAVKKAVGILSRSDDGDVGGGRGMGGGFRGRGGRGAVRGRGGRGRGRGRGR 267 Query: 832 QDSDDELG------GLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLPSPVDDAYL 671 D+E G G YLGD A+GEKLA KLGPE+MN L EGFEEMS+RVLPSP+DDAYL Sbjct: 268 GRRDEERGDGNLESGFYLGDDADGEKLAAKLGPESMNTLAEGFEEMSARVLPSPMDDAYL 327 Query: 670 DALHTNLMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQSQXXXXXX 491 +ALHTN+MIECEPEYLM F SNPDIDE PPI LRDAL+K+KPF MAYEGI+ Q Sbjct: 328 EALHTNMMIECEPEYLMGDFESNPDIDETPPIPLRDALEKMKPFLMAYEGIKDQEEWEEV 387 Query: 490 XXETMKNVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFVDRAVLSLQ 311 ETM+ VPLM++IVDYYSGPDRVTAKQQQQELERVAKT+PE+ SVK F +RAVLSLQ Sbjct: 388 IKETMETVPLMKEIVDYYSGPDRVTAKQQQQELERVAKTLPESAPNSVKRFTERAVLSLQ 447 Query: 310 SNPGWGFDKKCQFMDKLVFEVSQQPK 233 SNPGWGFDKKCQFMDK+V EVSQ K Sbjct: 448 SNPGWGFDKKCQFMDKVVMEVSQHYK 473 >ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanum tuberosum] Length = 480 Score = 370 bits (950), Expect = 3e-99 Identities = 222/450 (49%), Positives = 261/450 (58%), Gaps = 13/450 (2%) Frame = -1 Query: 1543 NFQFNRIGNPEPENSIDES--PTNPSLPGQGHGRAKXXXXXXXXXXXXXLVNNDAVARGR 1370 NF F+ G E+S ES PT PS G G GR K +V+N GR Sbjct: 46 NFGFSP-GKSASEDSKPESSTPTTPSGTGHGRGRGKPLPSSPIVPSFYSVVDNPNPPAGR 104 Query: 1369 GRG-YGPINASPHPSSLPKESEHAQQPPALKPNDRKSFSFIKGDTHDDETSAVPTRPGNP 1193 GRG GP + P P ++ + QQ P KP F K + D S+ P Sbjct: 105 GRGGIGPFSPPPQP----QQQQQQQQQPLRKP-----IFFAKEEETADSNSSSSDAPTPR 155 Query: 1192 RERSLPSDIIDILSGAGRGKPKTQPVLPTERSMAENRHLRPRQHPKPRVVNAEDGAGDKS 1013 + +L S +I +L+GAGRGKP +E+ ENRHLRPRQ A+ G S Sbjct: 156 DDSNLSSSVISVLTGAGRGKPLQTASPVSEKPKEENRHLRPRQQKV-----ADSGERASS 210 Query: 1012 SPREKLSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXG---- 845 P ++LS E+ VKKAVGILS Sbjct: 211 PPPQRLSREDAVKKAVGILSRSDDGDGDGDVGGGRGMGGGFRGRGGRGAVRGRGGRGRGR 270 Query: 844 ------QDRYQDSDDELGGLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLPSPVD 683 +D + G YLGD A+GEKLAQKLGPE MN L EGFEEMS+RVLPSP+D Sbjct: 271 GRGRGRRDEERGDGSLESGFYLGDDADGEKLAQKLGPEGMNTLAEGFEEMSARVLPSPMD 330 Query: 682 DAYLDALHTNLMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQSQXX 503 DAY++ALHTN+MIECEPEYLM F SNPDIDE PPI LRDAL+K+KPF MAYEGI+ Q Sbjct: 331 DAYIEALHTNMMIECEPEYLMGDFESNPDIDETPPIPLRDALEKMKPFLMAYEGIKDQEE 390 Query: 502 XXXXXXETMKNVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFVDRAV 323 ETM+ VPLM++IVDYYSGPDRVTAKQQQQELERVAKT+PE+ SVK F +RAV Sbjct: 391 WEEVIKETMETVPLMKEIVDYYSGPDRVTAKQQQQELERVAKTLPESAPNSVKRFTERAV 450 Query: 322 LSLQSNPGWGFDKKCQFMDKLVFEVSQQPK 233 LSLQSNPGWGFDKKCQFMDK+V E SQ K Sbjct: 451 LSLQSNPGWGFDKKCQFMDKVVMEASQHYK 480 >ref|XP_009796159.1| PREDICTED: la-related protein 1 [Nicotiana sylvestris] Length = 485 Score = 367 bits (942), Expect = 3e-98 Identities = 223/450 (49%), Positives = 265/450 (58%), Gaps = 13/450 (2%) Frame = -1 Query: 1543 NFQFNRIGNPEPENSIDESPTNPSLPGQGHGRAKXXXXXXXXXXXXXLVN----NDAVAR 1376 NF F+ G PE + P+ G G GR K V+ N Sbjct: 47 NFGFSP-GKPESKPESSPPTATPTGIGHGRGRGKPFPSSPILPSFSSFVDKPNPNPNPPA 105 Query: 1375 GRGRGYGPINASPHPSSLPKESEHAQQPPALKPNDRKSFSFIKGDTHDDETSAVPTRPGN 1196 GRGRG GP +P P+ +H QQP LK K F K + D + P Sbjct: 106 GRGRG-GPGQFTPPQ---PQPQQHQQQPSPLK----KPIFFAKEEETSDSNPSSSDAPKQ 157 Query: 1195 PRERSLPSDIIDILSGAGRGKPKTQPVLP-TERSMAENRHLRPRQHPKPRVVNAEDGAGD 1019 + +L S + +LSGAGRGKP P +E+ ENRHLR RQ + +A+ G + Sbjct: 158 REDSNLASSLTSLLSGAGRGKPLQTASSPVSEKPKEENRHLRVRQQQQR--ADADSGKRE 215 Query: 1018 KSSPREKLSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQD 839 S P ++LS E+ VKKAVGILS G+ Sbjct: 216 SSPPPQRLSREDAVKKAVGILSRHSDGDGGGDGGGGRGVGGFGGRGGRGAMRGRGGRGRG 275 Query: 838 RY-------QDSDDEL-GGLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLPSPVD 683 R ++ DD L G YLGD A+GEKLA KLGPETMN L E FEEMS+RVLPSP+D Sbjct: 276 RGRGYGRRDENEDDSLESGFYLGDNADGEKLANKLGPETMNTLAEAFEEMSARVLPSPMD 335 Query: 682 DAYLDALHTNLMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQSQXX 503 DAY++ALHTN+MIECEPEYLM F SNPDIDEKPPISLRDAL+K+KPF MAYEGI+ Q Sbjct: 336 DAYVEALHTNMMIECEPEYLMGDFESNPDIDEKPPISLRDALEKMKPFLMAYEGIKDQEE 395 Query: 502 XXXXXXETMKNVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFVDRAV 323 ETM+ VPLM++IVDYYSGPDRVTAKQQQQELERVAKT+P++ SVK F +RAV Sbjct: 396 WEKVIEETMETVPLMKEIVDYYSGPDRVTAKQQQQELERVAKTLPQSAPNSVKRFTERAV 455 Query: 322 LSLQSNPGWGFDKKCQFMDKLVFEVSQQPK 233 LSLQSNPGWGFDKKCQFMDK V+EVSQ K Sbjct: 456 LSLQSNPGWGFDKKCQFMDKAVWEVSQHYK 485 >ref|XP_012841899.1| PREDICTED: la-related protein 1 [Erythranthe guttatus] gi|604328137|gb|EYU33805.1| hypothetical protein MIMGU_mgv1a005203mg [Erythranthe guttata] Length = 493 Score = 363 bits (933), Expect = 3e-97 Identities = 219/457 (47%), Positives = 269/457 (58%), Gaps = 21/457 (4%) Frame = -1 Query: 1540 FQFNRIGNPEPE---NSIDESPTNPSLPGQGHGRAKXXXXXXXXXXXXXLVNNDAVAR-G 1373 FQF +P+ + NS E T P G G GR +N G Sbjct: 43 FQFTVDASPDDQTDKNSKTEVETPPPSYGHGRGRGTPLPSSPVLPSFSSFLNESKPPPVG 102 Query: 1372 RGRGYGPINASPHPSSLP-KESEHAQQPPALKPNDRKSFSFIKGDTHDDETSAVPTRPGN 1196 RGRG I ASP P P + SE + P KPN + F F+K + +++ A + + Sbjct: 103 RGRGVA-IPASPTPPPPPPRVSESPSEKPPPKPNVKLPFLFVKDE--EEQADAAESEVPS 159 Query: 1195 PRERSLPSDIIDILSGAGRGKPKTQPVLPT-ERSMAENRHLRPRQ-HPKPRVVNAEDGAG 1022 +E L SDI+ +LSGAGRGKP P E+ +ENRH+R R KP V + DGA Sbjct: 160 AQETLLRSDIVSVLSGAGRGKPGKPPTAAQPEKPQSENRHIRQRPPQGKPPVAVSSDGA- 218 Query: 1021 DKSSPREKLSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQ 842 + P +LS EE VKKA ILS G+ Sbjct: 219 --APPAVQLSKEEMVKKAKEILSKGDEDGGVSRPEVRDNRDNRDNRGGGRGGRGERGRGR 276 Query: 841 --------------DRYQDSDDELGGLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSR 704 DRY++SDDE L++GDPA+ EK+AQKLGP+ M +L EG +EMSSR Sbjct: 277 GRGRGRGRGRGRGDDRYEESDDESDALFIGDPADEEKVAQKLGPDVMAQLAEGIDEMSSR 336 Query: 703 VLPSPVDDAYLDALHTNLMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYE 524 VLPSP DDAY+DA TNL IECEPEYLME FG+NPDIDEKPPI LRDAL+K+KPF M YE Sbjct: 337 VLPSPFDDAYMDAFETNLRIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMVYE 396 Query: 523 GIQSQXXXXXXXXETMKNVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVK 344 GI+ Q ETMK+VPL+++IVD+YSGPDRVTAKQQ +ELERVAKT+P + ASVK Sbjct: 397 GIKDQEEWEKIIEETMKDVPLIKEIVDHYSGPDRVTAKQQNEELERVAKTLPASAPASVK 456 Query: 343 NFVDRAVLSLQSNPGWGFDKKCQFMDKLVFEVSQQPK 233 F +RA+LSLQSNPGWGFDKKCQFMDK++ EVSQ K Sbjct: 457 RFTERALLSLQSNPGWGFDKKCQFMDKVIMEVSQNYK 493 >ref|XP_009616263.1| PREDICTED: la-related protein 1 [Nicotiana tomentosiformis] Length = 488 Score = 362 bits (929), Expect = 8e-97 Identities = 219/449 (48%), Positives = 263/449 (58%), Gaps = 12/449 (2%) Frame = -1 Query: 1543 NFQFNRIGNPEPENSIDESPTNPSLPGQGHGRAKXXXXXXXXXXXXXLVN----NDAVAR 1376 NF F+ G PE + P G G GR K V+ N + Sbjct: 49 NFGFSP-GKPESKPESFPPTATPDGIGHGGGRGKPFPSSPILPSFSSFVDKPNPNPSPPA 107 Query: 1375 GRGRGYGPINASPHPSSLPKESEHAQQPPALKPNDRKSFSFIKGDTHDDETSAVPTRPGN 1196 GRGRG GP +P P+ + QQP LK K F K + D + P Sbjct: 108 GRGRG-GPGQFTPPQ---PQPQQQHQQPSPLK----KPIFFAKEEETSDSNPSSSDAPKQ 159 Query: 1195 PRERSLPSDIIDILSGAGRGKPKTQPVLP-TERSMAENRHLRPRQHPKPRVVNAEDGAGD 1019 + +L S + +LSGAGRGKP P +E+ ENRHLR RQ + + +A+ G Sbjct: 160 REDSNLASSLTSLLSGAGRGKPLQTASSPVSEKPKEENRHLRVRQQQQQQRADADSGKRA 219 Query: 1018 KSSPREKLSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQD 839 S P ++LS E+ VKKAVGILS Sbjct: 220 SSPPPQRLSREDAVKKAVGILSRHDDGDGDGDGGGRGVGGFRGRGGRGAMRGRGGRGRGR 279 Query: 838 ------RYQDSDDEL-GGLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLPSPVDD 680 R ++ +D L G YLGD A+GEKLA KLGPETMN L E FEEMS+RVLPSP+DD Sbjct: 280 GRGYGRREENENDSLESGFYLGDNADGEKLANKLGPETMNTLAEAFEEMSARVLPSPMDD 339 Query: 679 AYLDALHTNLMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQSQXXX 500 AY++ALHTN+MIECEPEYL+ F SNPDIDEKPPISLRDAL+K+KPF MAYEGI+ Q Sbjct: 340 AYVEALHTNMMIECEPEYLVGDFESNPDIDEKPPISLRDALEKMKPFLMAYEGIKDQEEW 399 Query: 499 XXXXXETMKNVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFVDRAVL 320 ETM+ VPLM++IVDYYSGPDRVTAKQQQQELERVAKT+P++ SVK F +RAVL Sbjct: 400 EKVIEETMETVPLMKEIVDYYSGPDRVTAKQQQQELERVAKTLPQSAPNSVKRFTERAVL 459 Query: 319 SLQSNPGWGFDKKCQFMDKLVFEVSQQPK 233 SLQSNPGWGFDKKCQFMDK+V+EVSQ K Sbjct: 460 SLQSNPGWGFDKKCQFMDKVVWEVSQHYK 488 >emb|CDP13552.1| unnamed protein product [Coffea canephora] Length = 499 Score = 360 bits (923), Expect = 4e-96 Identities = 206/428 (48%), Positives = 251/428 (58%), Gaps = 6/428 (1%) Frame = -1 Query: 1498 IDESPTNPSLPGQGHGRAKXXXXXXXXXXXXXLVNNDAVARGRGRGYGPINASPHPSSLP 1319 +D++ +PG+G GR RG G G G + +P P+ P Sbjct: 99 VDKTTVPVPVPGRGQGRG----------------------RGIGAGLGAGHVTP-PTPAP 135 Query: 1318 KESEHAQQPPALKPNDRKSFSFIKGDTHDDETSAVPTRPGNPRERSLPSDIIDILSGAGR 1139 + + P D D+H + PT P NP + LPS I+ ILSGAGR Sbjct: 136 AQPSGPSRKPIFSAKDG---GVAPHDSHFPPPTQSPTVPRNPDDTHLPSSILTILSGAGR 192 Query: 1138 GKPKTQPVLPTERSMAENRHLRPRQHPKPRVVNAEDGAGDKSSPREKLSTEEKVKKAVGI 959 GK P ++ + ENRH+R RQ P P + ++ ++LS EE KKAVGI Sbjct: 193 GKAPRSPSPVPDKPIEENRHIRARQQP-PGATREDSSTNSAATSAQRLSPEEAAKKAVGI 251 Query: 958 LSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXG----QDRYQDSDDE--LGGLYL 797 LS Y+D+DD+ GLYL Sbjct: 252 LSGGRGDTGRDEGARGGRGGGGGGGPRGQGDRGARFEDAGFEDTGYEDTDDDDSAAGLYL 311 Query: 796 GDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLPSPVDDAYLDALHTNLMIECEPEYLME 617 GD A+G+KL Q+LGP+ ++L EGFEEMSSRVLPSP DDAYLDALHTNL+IECEPEY+M Sbjct: 312 GDDADGDKLTQRLGPDIEDQLSEGFEEMSSRVLPSPEDDAYLDALHTNLLIECEPEYVMG 371 Query: 616 VFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQSQXXXXXXXXETMKNVPLMEKIVDYY 437 F NPDIDEKPPI LRDAL+K+KPF MAYEGIQSQ ETMK VPL+++IVDYY Sbjct: 372 NFDINPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQQEWEEAVEETMKKVPLLKEIVDYY 431 Query: 436 SGPDRVTAKQQQQELERVAKTIPENTSASVKNFVDRAVLSLQSNPGWGFDKKCQFMDKLV 257 SGPDRVTAKQQQ+E+ERVAK +PE+ ASVK F +RAVLSLQSNPGWGFDKKCQFMDKLV Sbjct: 432 SGPDRVTAKQQQEEIERVAKALPESVPASVKRFTNRAVLSLQSNPGWGFDKKCQFMDKLV 491 Query: 256 FEVSQQPK 233 E+SQ K Sbjct: 492 SEISQHYK 499 >gb|KDO45643.1| hypothetical protein CISIN_1g009722mg [Citrus sinensis] Length = 527 Score = 347 bits (891), Expect = 2e-92 Identities = 215/446 (48%), Positives = 254/446 (56%), Gaps = 16/446 (3%) Frame = -1 Query: 1522 GNPEPENSIDESPTNPSLP--GQGHGRAKXXXXXXXXXXXXXLVNNDAVARGRGRGYGPI 1349 G P E+ D SP P P G GHGR + AV G GRG + Sbjct: 106 GQPASESKPD-SPPQPQAPPSGSGHGRGQPSAAPSPSISSFSSFLT-AVKSGAGRGR--V 161 Query: 1348 NASPHPSSLPKESEHAQQPPALKPNDRKSFSFIKGDTHDDETSAVPTRPGNPRERSLPSD 1169 + + P+ P+ +P PN E++ T+P P +LPS Sbjct: 162 SFASDPNESPRPDAQPAKPRTCTPN---------------ESATDSTQPSEP---NLPSS 203 Query: 1168 IIDILSGAGRGK----------PKTQPVLPTERSMAENRHLRPRQHPKPRVVNAEDGAGD 1019 II L GAGRGK + Q P ENRH+R R P+PR A A + Sbjct: 204 IISTLPGAGRGKTAVTQQQQQQQQHQRQQPGPPPQEENRHIRARLQPQPRPEKAP--AAE 261 Query: 1018 KSSPREKLSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQD 839 S + KLS E+ VK A+ ILS G+ Sbjct: 262 TGSAQPKLSKEDAVKMAMKILSRGEEGEGEGISAGGPGRGRGMGRGRGRGRGRGQGRGRM 321 Query: 838 RYQ----DSDDELGGLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLPSPVDDAYL 671 R Q D D GGLYLGD A+GEKLA+K+G E MN LVEGFEEMS RVLPSP++DAY+ Sbjct: 322 RRQEMEDDEDGRFGGLYLGDNADGEKLAEKVGAEKMNMLVEGFEEMSGRVLPSPMEDAYI 381 Query: 670 DALHTNLMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQSQXXXXXX 491 DALHTN MIE EPEYLME FG+NPDIDEKPPI LRDAL+K+KPF MAYEGIQSQ Sbjct: 382 DALHTNCMIEFEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQEEWEEA 441 Query: 490 XXETMKNVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFVDRAVLSLQ 311 E M+ VPL+++IVD+YSGPDRVTAKQQ +ELERVAKTIPE+ AS+K F +RAVLSLQ Sbjct: 442 VNEVMERVPLLKEIVDHYSGPDRVTAKQQGEELERVAKTIPESAPASIKRFANRAVLSLQ 501 Query: 310 SNPGWGFDKKCQFMDKLVFEVSQQPK 233 SNPGWGFDKKCQFMDKL +EVSQ K Sbjct: 502 SNPGWGFDKKCQFMDKLAWEVSQHYK 527 >ref|XP_007014540.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] gi|508784903|gb|EOY32159.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1 [Theobroma cacao] Length = 474 Score = 343 bits (880), Expect = 4e-91 Identities = 207/410 (50%), Positives = 243/410 (59%), Gaps = 26/410 (6%) Frame = -1 Query: 1384 VARGRGRGYGPINASPHP-----------SSLPKESEHAQQPPALKPNDRKSFSFIKGDT 1238 V GRGRG GP+++ P P S + + + PP P K FIK Sbjct: 82 VGHGRGRG-GPLSSDPIPHPFSSFVSQTGSGRGRVTSESVPPPPPPPAQAKQPIFIKKKD 140 Query: 1237 HDDETSAVPT--RPGNPRERSLPSDI--IDILSGAGRGKPKTQPVLPTERSMAENRHLRP 1070 D+ S+ P E P +I + +LSGAGRGKP QP P R ENRH+R Sbjct: 141 EDETESSAKAAAEPIQSSEPIFPPNILPVSVLSGAGRGKPVKQPE-PASRRQEENRHIRV 199 Query: 1069 RQHPKPRVVNAEDGAGDKSSPREKLSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXX 890 Q + SP ++S EE KKA+GILS Sbjct: 200 AQ---------------QQSPSAQMSQEEATKKAMGILSRRSESGESGMVGRGGRASMGM 244 Query: 889 XXXXXXXXXXXXXXGQDR----------YQDSDD-ELGGLYLGDPAEGEKLAQKLGPETM 743 G+ R +DS + GLYLGD A+GEK AQ +G + M Sbjct: 245 GGGRGRGRGRGRGMGRGRGRRQGEDTRIVKDSGEGSADGLYLGDNADGEKFAQTIGADNM 304 Query: 742 NKLVEGFEEMSSRVLPSPVDDAYLDALHTNLMIECEPEYLMEVFGSNPDIDEKPPISLRD 563 NKLVEGFEEM SRVLPSP+DDAYLDALHTN IE EPEYLME FG+NPDIDEKPP+ LRD Sbjct: 305 NKLVEGFEEMGSRVLPSPMDDAYLDALHTNCSIEFEPEYLMEEFGTNPDIDEKPPMPLRD 364 Query: 562 ALDKVKPFFMAYEGIQSQXXXXXXXXETMKNVPLMEKIVDYYSGPDRVTAKQQQQELERV 383 AL+K+KPF MAYEGIQSQ ETM+ VPL+++IVDYYSGPDRVTAK+QQ+ELERV Sbjct: 365 ALEKMKPFLMAYEGIQSQEEWEEVIKETMERVPLLQEIVDYYSGPDRVTAKKQQEELERV 424 Query: 382 AKTIPENTSASVKNFVDRAVLSLQSNPGWGFDKKCQFMDKLVFEVSQQPK 233 AKTIPE +SVK F +RAVLSLQSNPGWGFDKKCQFMDKLV+EVSQQ K Sbjct: 425 AKTIPERAPSSVKQFANRAVLSLQSNPGWGFDKKCQFMDKLVWEVSQQYK 474 >ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1-like [Citrus sinensis] Length = 407 Score = 342 bits (876), Expect = 1e-90 Identities = 199/394 (50%), Positives = 238/394 (60%), Gaps = 14/394 (3%) Frame = -1 Query: 1372 RGRGYGPINASPHPSSLPKESEHAQQPPALKPNDRKSFSFIKGDTHDDETSAVPTRPGNP 1193 +G G G ++ + P+ P+ +P PN E++ T+P P Sbjct: 34 QGAGRGRVSFASDPNESPRPDAQPAKPRTCTPN---------------ESATDSTQPSEP 78 Query: 1192 RERSLPSDIIDILSGAGRGK----------PKTQPVLPTERSMAENRHLRPRQHPKPRVV 1043 +LPS II L GAGRGK + Q P ENRH+R R P+PR Sbjct: 79 ---NLPSSIISTLPGAGRGKTAVTQQQQQQQQHQRQQPGPPPQEENRHIRARLQPQPRPE 135 Query: 1042 NAEDGAGDKSSPREKLSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 863 A A + S + KLS E+ VK A+ +LS Sbjct: 136 KAP--AAETGSAQPKLSKEDAVKMAMKVLSRGEEGEGEGISAGGPGRGRGMGRGRGRGRG 193 Query: 862 XXXXXGQDRYQ----DSDDELGGLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLP 695 G+ R Q D D GGLYLGD A+GEKLA+K+G E MN LVEGFEEMS RVLP Sbjct: 194 RGQGRGRMRRQEMEDDEDGRFGGLYLGDNADGEKLAEKVGAEKMNMLVEGFEEMSGRVLP 253 Query: 694 SPVDDAYLDALHTNLMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQ 515 SP++DAY+DALHTN MIE EPEYLME FG+NPDIDEKPPI LRDAL+K+KPF MAYEGIQ Sbjct: 254 SPMEDAYIDALHTNCMIEFEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQ 313 Query: 514 SQXXXXXXXXETMKNVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFV 335 SQ E M+ VPL+++IVD+YSGPDRVTAKQQ +ELERVAKTIPE+ AS+K F Sbjct: 314 SQEEWEEAVNEVMERVPLLKEIVDHYSGPDRVTAKQQGEELERVAKTIPESAPASIKRFA 373 Query: 334 DRAVLSLQSNPGWGFDKKCQFMDKLVFEVSQQPK 233 +RAVLSLQSNPGWGFDKKCQFMDKL +EVSQQ K Sbjct: 374 NRAVLSLQSNPGWGFDKKCQFMDKLAWEVSQQYK 407 >ref|XP_010274926.1| PREDICTED: pro-resilin [Nelumbo nucifera] Length = 482 Score = 338 bits (868), Expect = 1e-89 Identities = 212/438 (48%), Positives = 256/438 (58%), Gaps = 8/438 (1%) Frame = -1 Query: 1522 GNPEPENSIDESPTNPSLP-GQGHGRAKXXXXXXXXXXXXXLVNNDAVARGRGRGYGPIN 1346 G P+ + D + LP G GHGR K V+ + GRG Sbjct: 63 GKPDTTGADDAEADDSFLPSGLGHGRGKPIPSTPILPSFSSWVSGMRPSAGRGGRSTQQQ 122 Query: 1345 ASPHPSSLPKESEHAQQPPALKPNDRKSFSFIKGDTHDDETSAVP-TRPG-NPRERSLPS 1172 + HPS +P +P +K F + D T P + PG +P LPS Sbjct: 123 SDSHPS----------EPQDFQP--KKPIFFSREDPQGPLTQNPPISEPGRSPGGIVLPS 170 Query: 1171 DIIDILSGAGRGKPKTQPVLPTERSMAE-NRHLRPRQHPKPRVVNAEDGAGDKSSPRE-K 998 + L GAGRGKP + P+E S++E NRHLRPR+ G D++SP + Sbjct: 171 SLSSGLPGAGRGKPPKPSLGPSETSVSEENRHLRPRRE------GVAVGLQDRTSPSPPR 224 Query: 997 LSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQD--RYQDS 824 LS E+ VKKAVGIL G+ R++D Sbjct: 225 LSREDAVKKAVGILRRGGDGMEEGGRGRGTRGRGGRGRGGRGVQGWRGRGGRSGGRFRDL 284 Query: 823 DDELG-GLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLPSPVDDAYLDALHTNLM 647 +D G GLYLGD A+GE+LA +LG E M+KLVE FEEMS VLPSP+DDAYLDA+HTN + Sbjct: 285 EDNYGTGLYLGDNADGERLANRLGTENMDKLVEAFEEMSYSVLPSPMDDAYLDAVHTNNL 344 Query: 646 IECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQSQXXXXXXXXETMKNV 467 IE EPEYLM F +NPDIDEKPPI LRDAL+KVKPF MAYEGIQSQ ETM+ + Sbjct: 345 IEYEPEYLMGDFETNPDIDEKPPIPLRDALEKVKPFLMAYEGIQSQEEWEEIMKETMEKL 404 Query: 466 PLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFVDRAVLSLQSNPGWGFD 287 P M++++D YSGPDRVT KQQQQELERVAKT+PEN +SVK F DRAVLSLQSNPGWGFD Sbjct: 405 PYMKELIDIYSGPDRVTGKQQQQELERVAKTLPENVPSSVKCFTDRAVLSLQSNPGWGFD 464 Query: 286 KKCQFMDKLVFEVSQQPK 233 KKCQFMDKLV+EVSQ K Sbjct: 465 KKCQFMDKLVWEVSQHYK 482 >gb|KHG06267.1| FYVE, RhoGEF and PH domain-containing 2 [Gossypium arboreum] Length = 486 Score = 335 bits (860), Expect = 8e-89 Identities = 203/440 (46%), Positives = 262/440 (59%), Gaps = 9/440 (2%) Frame = -1 Query: 1525 IGNPEPENSIDESPTNPSLPGQGHGRAKXXXXXXXXXXXXXLVNNDAVARGRGRGYGPIN 1346 +G P+ E++ +S + ++ G GHGR + GRGR N Sbjct: 66 LGKPDSEDTKRDSAESQAV-GSGHGRGRGIPLSSEPIIPSFSSFVSQNGSGRGR---VTN 121 Query: 1345 ASPHPSSLPKESEHAQQPPALKPNDRKSFSFI--KGDTHDDETSAVPTRPGNPRERSLP- 1175 S P+ P PP L P + K F+ + + D ++ +P ER+ Sbjct: 122 ESVRPTPPPPP------PPPLPPREAKQPIFVMKQDEIETDLSAKLPAESVQSSERTFSP 175 Query: 1174 -SDIIDILSGAGRGKPKTQPVLPTERSMAENRHLRPRQHPKPRVVNAEDGAGDKSSPREK 998 + + LSGAGRGKP QP P ++ ENRH+R +Q + + + P + Sbjct: 176 RTPSVASLSGAGRGKPVKQPG-PVLQTKEENRHIRLKQQQQQQQ--------QQQPPSPR 226 Query: 997 LSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQ----DRYQ 830 LS EE VKKA+GILS + + Sbjct: 227 LSKEEAVKKAMGILSRKSESDEREDMGRSGGRGRGRGRGRGARMGRGRGRREREDTGEEE 286 Query: 829 DSDDELGG-LYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLPSPVDDAYLDALHTN 653 ++D EL LYLG+ A+GE+LA+ +G ++MNKLVEGFEE+SSRVLPSP+DDAYL+ALHTN Sbjct: 287 EADKELRDELYLGNNADGERLAETIGADSMNKLVEGFEEISSRVLPSPMDDAYLEALHTN 346 Query: 652 LMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQSQXXXXXXXXETMK 473 MIE EPEYLME FG+NPDIDEKPP+SLRDAL+KVKPF M+YEGI++Q ETM Sbjct: 347 FMIEFEPEYLMEEFGTNPDIDEKPPMSLRDALEKVKPFLMSYEGIENQEEWEEAIKETMD 406 Query: 472 NVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFVDRAVLSLQSNPGWG 293 VPL+++I+DYYSGPDRVTAK+QQ+ELERVAKTIP++ ASVK F +RAVL+LQSNPGWG Sbjct: 407 KVPLLQEIIDYYSGPDRVTAKKQQEELERVAKTIPKSVPASVKQFANRAVLTLQSNPGWG 466 Query: 292 FDKKCQFMDKLVFEVSQQPK 233 FDKKCQFMDKLV EVSQQ K Sbjct: 467 FDKKCQFMDKLVCEVSQQYK 486 >ref|XP_002274822.2| PREDICTED: coilin [Vitis vinifera] Length = 482 Score = 331 bits (849), Expect = 2e-87 Identities = 214/440 (48%), Positives = 253/440 (57%), Gaps = 20/440 (4%) Frame = -1 Query: 1492 ESPTNPSLPGQGHGRAKXXXXXXXXXXXXXLVNNDAVARGRGRGYGPINASPHPSSLPKE 1313 ES +P G GHGR K + + G GRG G + A P S+P Sbjct: 64 ESSESPFPLGLGHGRGKPPSQPSAPTLPSF---SSFASTGIGRGRGRLTAHP-TDSVP-- 117 Query: 1312 SEHAQQPPALKPNDRKSFSFIKGDTHDDET---SAVPTRPGNPRERSLPSDIIDILSG-A 1145 QQ P P +K F K D D S + T P P E +LP I+ LSG A Sbjct: 118 ----QQSPDFAP--KKPIFFSKEDAADSAPKPQSQLGTTP--PEENNLPVSILSALSGGA 169 Query: 1144 GRGKPKTQPVLPTERSMAENRHLRPRQHPKPRVVNAEDGAGDKSSPREKLSTEEKVKKAV 965 GRG+P Q P + ENRHLR + P R + AG P+ +LS EE VKKAV Sbjct: 170 GRGQPLKQTPAPPKE---ENRHLRQPRQPVFRSPQ-QPVAGP---PQPRLSREEAVKKAV 222 Query: 964 GILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQDRYQ--------------- 830 GILS G+ R + Sbjct: 223 GILSRGGDGGGDGDGDEGGRGRGFRGRGRGRGRGAQGWMGRGRGRGRGRGRMGDRRGRGG 282 Query: 829 DSDDELG-GLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLPSPVDDAYLDALHTN 653 D+ D+ G GLYLGD A+ EKL+ K+G E M+KL E FEEMS RVLPSP++DAYLDALHTN Sbjct: 283 DAQDDYGAGLYLGDNADAEKLSNKIGLEKMSKLDEAFEEMSGRVLPSPIEDAYLDALHTN 342 Query: 652 LMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQSQXXXXXXXXETMK 473 +IE EPEYLME FG+NPDIDE PPI LRDAL+K+KPF M YEGIQSQ ETM+ Sbjct: 343 CLIEFEPEYLMEEFGTNPDIDENPPIPLRDALEKMKPFLMQYEGIQSQEEWEEVMKETME 402 Query: 472 NVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFVDRAVLSLQSNPGWG 293 NVP ++++VDYYSGPDRVTAK+QQ+ELERVAKT+PE SVK F DRA+LSLQSNPGWG Sbjct: 403 NVPYLKELVDYYSGPDRVTAKKQQEELERVAKTLPETAPNSVKRFTDRAILSLQSNPGWG 462 Query: 292 FDKKCQFMDKLVFEVSQQPK 233 FDKKCQFMDKLV+EVSQ K Sbjct: 463 FDKKCQFMDKLVWEVSQHYK 482 >ref|XP_012463685.1| PREDICTED: uncharacterized protein LOC105783052 isoform X1 [Gossypium raimondii] gi|763816483|gb|KJB83335.1| hypothetical protein B456_013G241700 [Gossypium raimondii] Length = 484 Score = 331 bits (848), Expect = 2e-87 Identities = 198/440 (45%), Positives = 255/440 (57%), Gaps = 9/440 (2%) Frame = -1 Query: 1525 IGNPEPENSIDESPTNPSLPGQGHGRAKXXXXXXXXXXXXXLVNNDAVARGRGRGYGPIN 1346 +G P+ E++ +S + ++ G GHGR + GRGR Sbjct: 66 LGKPDSEDTKRDSAESQAV-GLGHGRGRGIPFSSEPIIPSFSSFVSQNGSGRGR------ 118 Query: 1345 ASPHPSSLPKESEHAQQPPALKPNDRKSFSFI--KGDTHDDETSAVPTRPGNPRERSLPS 1172 + ES PP P + K F+ + + D ++ +P ER+ Sbjct: 119 -------VTNESVRQTPPPPPPPREAKQPIFVMKQDEIETDSSAKLPAESVQSSERTFSP 171 Query: 1171 DIIDI--LSGAGRGKPKTQPVLPTERSMAENRHLRPRQHPKPRVVNAEDGAGDKSSPREK 998 + LSGAGRGKP QP P ++ ENRH+R +Q + + + P + Sbjct: 172 STPSVASLSGAGRGKPVKQPE-PVLQTKEENRHIRLKQKQQ------QQQQQQQQPPSPR 224 Query: 997 LSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQDRYQDSDD 818 LS EE VKKA+ ILS + ++ Sbjct: 225 LSKEEAVKKAMCILSRKSGSDEREDMGRSGGRGRGRGRGRGAQMGRGRGRREGEDTREEE 284 Query: 817 EL-----GGLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLPSPVDDAYLDALHTN 653 E LYLG+ A+GE+LA+ +G ++MNKLVEGFEEMSSRVLPSP+DDAYL+ALHTN Sbjct: 285 EAVKELRDELYLGNNADGERLAETIGADSMNKLVEGFEEMSSRVLPSPMDDAYLEALHTN 344 Query: 652 LMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQSQXXXXXXXXETMK 473 MIE EPEYLME FG+NPDIDEKPP+SLRDAL+KVKPF M+YEGI++Q ETM Sbjct: 345 FMIEFEPEYLMEEFGTNPDIDEKPPMSLRDALEKVKPFLMSYEGIENQEEWEEAIKETMD 404 Query: 472 NVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFVDRAVLSLQSNPGWG 293 VPL+++I+DYYSGPDRVTAK+QQ+ELERVAKTIP++ ASVK F +RAVL+LQSNPGWG Sbjct: 405 KVPLLQEIIDYYSGPDRVTAKKQQEELERVAKTIPKSAPASVKQFANRAVLTLQSNPGWG 464 Query: 292 FDKKCQFMDKLVFEVSQQPK 233 FDKKCQFMDKLV+EVSQQ K Sbjct: 465 FDKKCQFMDKLVWEVSQQYK 484 >ref|XP_002523666.1| conserved hypothetical protein [Ricinus communis] gi|223537066|gb|EEF38701.1| conserved hypothetical protein [Ricinus communis] Length = 436 Score = 330 bits (846), Expect = 4e-87 Identities = 204/406 (50%), Positives = 234/406 (57%), Gaps = 28/406 (6%) Frame = -1 Query: 1375 GRGRGYGPI------NASPHPSSLPKESEHAQQPPALKPNDRKSF--------------- 1259 GRGRG P A P S H PP P R Sbjct: 42 GRGRGSNPNLFDFTGKAPAKPESSDVAKPHYPPPPPPPPPPRNGVGHGHGGGNPILPAFS 101 Query: 1258 SFI----KGDTHDDETSAVPTRPGNPRERS-LPSDIIDILSGAGRGKPKTQPVLPTERSM 1094 SF+ +G D +P + S LPS I LSG GRG+P +PV+PT + Sbjct: 102 SFVSSIGRGRAITDPEPGPSRQPTESQSDSVLPSTIHSSLSGFGRGEPD-KPVVPTPQVK 160 Query: 1093 AENRHLRPRQHPKPRVVNAEDGAGDKSSPREKLSTEEKVKKAVGILSXXXXXXXXXXXXX 914 ENRH+R R KP+ AE A + K+S EE VK+AV ILS Sbjct: 161 EENRHIRDRSRAKPKTEEAEVRA------KPKISREEAVKRAVSILSQGDTGEGMGRGRG 214 Query: 913 XXXXXXXXXXXXXXXXXXXXXXGQDRYQDSDDEL--GGLYLGDPAEGEKLAQKLGPETMN 740 + R D DE GL+LGD A+GEKLA K+G E MN Sbjct: 215 GGRGRGRGRGRGRLEQ-------RGRMMDDVDEGFGSGLFLGDNADGEKLAGKIGVENMN 267 Query: 739 KLVEGFEEMSSRVLPSPVDDAYLDALHTNLMIECEPEYLMEVFGSNPDIDEKPPISLRDA 560 KLVEG+EEMS RVLPSP++DAYLDALHTN MIE EPEYLM F NPDIDEKPP+ LRD Sbjct: 268 KLVEGYEEMSGRVLPSPMEDAYLDALHTNYMIEFEPEYLMGEFDQNPDIDEKPPMPLRDV 327 Query: 559 LDKVKPFFMAYEGIQSQXXXXXXXXETMKNVPLMEKIVDYYSGPDRVTAKQQQQELERVA 380 L+KVKPF MAYEGIQSQ ETMKNVPL ++IVDYYSGPDR+TAK+Q++ELERVA Sbjct: 328 LEKVKPFIMAYEGIQSQEEWEAAVEETMKNVPLFKEIVDYYSGPDRITAKKQEEELERVA 387 Query: 379 KTIPENTSASVKNFVDRAVLSLQSNPGWGFDKKCQFMDKLVFEVSQ 242 TIP + ASVK F DRAVLSLQSNPGWGFDKKCQFMDKLV EV+Q Sbjct: 388 NTIPASAPASVKRFADRAVLSLQSNPGWGFDKKCQFMDKLVREVNQ 433 >ref|XP_008225991.1| PREDICTED: la-related protein 1 [Prunus mume] Length = 460 Score = 325 bits (834), Expect = 9e-86 Identities = 207/441 (46%), Positives = 247/441 (56%), Gaps = 14/441 (3%) Frame = -1 Query: 1522 GNPEPENSIDESPTNPSLPGQGHGRAKXXXXXXXXXXXXXLVNNDAVARGRGRGYGPINA 1343 G P+ ++ + P PS PG GHGR K A+ G G G N Sbjct: 65 GQPDSDDPKPDPP--PSAPGLGHGRGKPLPTFSSFV--------SAIKPNSGTGRGQPN- 113 Query: 1342 SPHPSSLPKESEHAQQPPALKPNDRKSFSFIKGDTHDDETSAVPTRPGNPRERSLPSDII 1163 S+P ES + P A K F++GD D PT PG+ Sbjct: 114 --QVQSIP-ESRDSVAPDAGPSKPIKPIFFVRGDGSD------PTLPGS----------- 153 Query: 1162 DILSGAGRGKPK--TQPVLPTERSMAENRHLRPRQHPKPRVVNAEDGAGDKSSPR-EKLS 992 GRGKP T+P + + ENRH++ R P P ++ PR KL+ Sbjct: 154 ------GRGKPMNFTRPEVQVKE---ENRHIQARSEPDPDQ--------PRTRPRGPKLT 196 Query: 991 TEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQD--------- 839 EE VK+A+GIL+ + Sbjct: 197 REEAVKQALGILAQDGAEGDDVGGGGGGGRGRGRGRGMRGRGRGRGRGRGNFRMSERGDG 256 Query: 838 -RYQDSDDELG-GLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLPSPVDDAYLDA 665 R +DSDD GLYLGD A+GEKLA+KLGPE MNKLVE FEEMSS VLPSP+DDAY+DA Sbjct: 257 RRGKDSDDSYASGLYLGDNADGEKLAKKLGPEIMNKLVESFEEMSSEVLPSPLDDAYVDA 316 Query: 664 LHTNLMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQSQXXXXXXXX 485 +HTN MIECEPEYLM F NPDIDEKPPISLRDAL+K+KPF MAYE IQSQ Sbjct: 317 MHTNFMIECEPEYLMGEFNKNPDIDEKPPISLRDALEKMKPFLMAYENIQSQEEWEEVVN 376 Query: 484 ETMKNVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFVDRAVLSLQSN 305 ETM+ VPL+++IVD+YSGPDRVTAK+QQ+ELERVAKT+P SVK F DRAVLSLQSN Sbjct: 377 ETMERVPLLKEIVDHYSGPDRVTAKKQQEELERVAKTLPAKVPDSVKRFTDRAVLSLQSN 436 Query: 304 PGWGFDKKCQFMDKLVFEVSQ 242 PGWGFD+KCQFMDKLV +VSQ Sbjct: 437 PGWGFDRKCQFMDKLVAKVSQ 457 >ref|XP_012066680.1| PREDICTED: uncharacterized protein LOC105629668 [Jatropha curcas] Length = 548 Score = 325 bits (833), Expect = 1e-85 Identities = 202/451 (44%), Positives = 248/451 (54%), Gaps = 18/451 (3%) Frame = -1 Query: 1540 FQFNRIGNPEPENSIDESPTNPSLP-GQGHGRAKXXXXXXXXXXXXXLVNNDAVARGRGR 1364 F F P+ +S ES P P G GHGR + + A GRGR Sbjct: 115 FDFTAPSKPDTNDSKTESSDRPQQPAGIGHGRGRPPILPAFSSLISS-LKTSISATGRGR 173 Query: 1363 GYGPINASP--------HPSSLP-----KESEHAQQPPALKPNDRKSFSFIKGDTHDDET 1223 G P P + P +E+ H + P ++P G D+ Sbjct: 174 GNLPSAVESGVGRGKLDKPVAAPTNQEDEENRHIRFRPTVEPG--------VGRGKPDKP 225 Query: 1222 SAVPTRPGNPRERSL---PSDIIDILSGAGRGKPKTQPVLPTERSMAENRHLRPRQHPKP 1052 A T + R + P+ + G GRGK T++ ENRH+ R P+P Sbjct: 226 VAASTNQEDEENRHIRFRPT----VEPGVGRGKTDKPVAASTQQIKEENRHIGARSTPRP 281 Query: 1051 RVVNAEDGAGDKSSPREKLSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXX 872 R V + G S + K+S EE ++AV IL Sbjct: 282 RTVPSRKGL---ESDKPKVSLEEATRRAVSILEQGEDDGGGGIGRGRGSRVRGRGRGRGR 338 Query: 871 XXXXXXXXGQDRYQDSDDELG-GLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLP 695 + R +DS+ E GL+LGD A+GEKLA+++G E MNKL+EGFEEMS RVLP Sbjct: 339 GRWDQ----RGRMEDSEPEFATGLFLGDNADGEKLAERVGVENMNKLIEGFEEMSERVLP 394 Query: 694 SPVDDAYLDALHTNLMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQ 515 SP+ DAYL+ALHTN MIE EPEYLM F NPDIDEKPP+ LRD L+KVKPF MAY+GIQ Sbjct: 395 SPMQDAYLEALHTNYMIEFEPEYLMGEFDQNPDIDEKPPLPLRDVLEKVKPFIMAYDGIQ 454 Query: 514 SQXXXXXXXXETMKNVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFV 335 SQ ETMKNVPL+++IVDYYSGPDRVTAK+QQ+ELERVAKTIP + ASVK F Sbjct: 455 SQEEWEEVVEETMKNVPLLKEIVDYYSGPDRVTAKKQQEELERVAKTIPASAPASVKRFA 514 Query: 334 DRAVLSLQSNPGWGFDKKCQFMDKLVFEVSQ 242 DRAVLSLQSNPGWGFD+KCQFMDKL EV+Q Sbjct: 515 DRAVLSLQSNPGWGFDRKCQFMDKLAREVNQ 545 >ref|XP_011621191.1| PREDICTED: uncharacterized protein LOC18428267 [Amborella trichopoda] gi|769819022|ref|XP_011621192.1| PREDICTED: uncharacterized protein LOC18428267 [Amborella trichopoda] Length = 471 Score = 325 bits (833), Expect = 1e-85 Identities = 193/414 (46%), Positives = 239/414 (57%), Gaps = 2/414 (0%) Frame = -1 Query: 1468 PGQGHGRAKXXXXXXXXXXXXXLVNNDAVARGRGRGYGPINASPHPSSLPKESEHAQQPP 1289 PG GHGR + ++ GRGR P LP + +H+ P Sbjct: 71 PGIGHGRGQPIQTTPILPSFAPWMSGPVPGTGRGRPSSP---------LPPQLDHS--PN 119 Query: 1288 ALKPNDRKSFSFIKGDTHDDETSAVPTRPGNPRERSLPSDIIDI-LSGAGRGKPKTQPVL 1112 +P RK F + + + V + P E LP I + G GRGKP T P+L Sbjct: 120 QQEPPSRKPIFFKRDEIEGTDEGRVQAQNLPPTESPLPRSISPAPIEGFGRGKP-TSPLL 178 Query: 1111 PTERSMAENRHLRPRQHPKPRVVNAEDGAGDKSSPREKLSTEEKVKKAVGILSXXXXXXX 932 ENRH+R R P R A G ++S KLS+EE V+ A ILS Sbjct: 179 SHGIEEEENRHIRRRSPPPERAGQASRG---RASNERKLSSEEAVRNAKDILSRGEGRGG 235 Query: 931 XXXXXXXXXXXXXXXXXXXXXXXXXXXXGQDRYQDS-DDELGGLYLGDPAEGEKLAQKLG 755 RYQD +D+ GLYLGD A+GEKL ++LG Sbjct: 236 RGLRGGRGLRGGRGRGGVWAGRGRQGRGA--RYQDRREDDSVGLYLGDDADGEKLVKRLG 293 Query: 754 PETMNKLVEGFEEMSSRVLPSPVDDAYLDALHTNLMIECEPEYLMEVFGSNPDIDEKPPI 575 E +N++ E F+EMS RVLPSP+++AYLDALHTN +IE EPEY ME FG+NPDIDEKPPI Sbjct: 294 EENVNQIFEAFDEMSGRVLPSPMEEAYLDALHTNCLIEFEPEYHMEEFGTNPDIDEKPPI 353 Query: 574 SLRDALDKVKPFFMAYEGIQSQXXXXXXXXETMKNVPLMEKIVDYYSGPDRVTAKQQQQE 395 L DAL+K+KPF M YEGIQ+Q ETM VP ++++VD YSGPDRVTA+QQQQE Sbjct: 354 PLCDALEKIKPFIMTYEGIQNQEEWEEVVKETMDKVPYLKELVDIYSGPDRVTARQQQQE 413 Query: 394 LERVAKTIPENTSASVKNFVDRAVLSLQSNPGWGFDKKCQFMDKLVFEVSQQPK 233 LERVA T+PEN +SVKNF +RAVLSLQSNPGWG+DKKCQFMDKLV++VSQ K Sbjct: 414 LERVASTLPENVPSSVKNFTNRAVLSLQSNPGWGWDKKCQFMDKLVWQVSQDYK 467 >gb|KDP42449.1| hypothetical protein JCGZ_00246 [Jatropha curcas] Length = 485 Score = 325 bits (833), Expect = 1e-85 Identities = 202/451 (44%), Positives = 248/451 (54%), Gaps = 18/451 (3%) Frame = -1 Query: 1540 FQFNRIGNPEPENSIDESPTNPSLP-GQGHGRAKXXXXXXXXXXXXXLVNNDAVARGRGR 1364 F F P+ +S ES P P G GHGR + + A GRGR Sbjct: 52 FDFTAPSKPDTNDSKTESSDRPQQPAGIGHGRGRPPILPAFSSLISS-LKTSISATGRGR 110 Query: 1363 GYGPINASP--------HPSSLP-----KESEHAQQPPALKPNDRKSFSFIKGDTHDDET 1223 G P P + P +E+ H + P ++P G D+ Sbjct: 111 GNLPSAVESGVGRGKLDKPVAAPTNQEDEENRHIRFRPTVEPG--------VGRGKPDKP 162 Query: 1222 SAVPTRPGNPRERSL---PSDIIDILSGAGRGKPKTQPVLPTERSMAENRHLRPRQHPKP 1052 A T + R + P+ + G GRGK T++ ENRH+ R P+P Sbjct: 163 VAASTNQEDEENRHIRFRPT----VEPGVGRGKTDKPVAASTQQIKEENRHIGARSTPRP 218 Query: 1051 RVVNAEDGAGDKSSPREKLSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXX 872 R V + G S + K+S EE ++AV IL Sbjct: 219 RTVPSRKGL---ESDKPKVSLEEATRRAVSILEQGEDDGGGGIGRGRGSRVRGRGRGRGR 275 Query: 871 XXXXXXXXGQDRYQDSDDELG-GLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLP 695 + R +DS+ E GL+LGD A+GEKLA+++G E MNKL+EGFEEMS RVLP Sbjct: 276 GRWDQ----RGRMEDSEPEFATGLFLGDNADGEKLAERVGVENMNKLIEGFEEMSERVLP 331 Query: 694 SPVDDAYLDALHTNLMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQ 515 SP+ DAYL+ALHTN MIE EPEYLM F NPDIDEKPP+ LRD L+KVKPF MAY+GIQ Sbjct: 332 SPMQDAYLEALHTNYMIEFEPEYLMGEFDQNPDIDEKPPLPLRDVLEKVKPFIMAYDGIQ 391 Query: 514 SQXXXXXXXXETMKNVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFV 335 SQ ETMKNVPL+++IVDYYSGPDRVTAK+QQ+ELERVAKTIP + ASVK F Sbjct: 392 SQEEWEEVVEETMKNVPLLKEIVDYYSGPDRVTAKKQQEELERVAKTIPASAPASVKRFA 451 Query: 334 DRAVLSLQSNPGWGFDKKCQFMDKLVFEVSQ 242 DRAVLSLQSNPGWGFD+KCQFMDKL EV+Q Sbjct: 452 DRAVLSLQSNPGWGFDRKCQFMDKLAREVNQ 482 >ref|XP_009340130.1| PREDICTED: collagen alpha-1(III) chain-like [Pyrus x bretschneideri] Length = 521 Score = 315 bits (808), Expect = 9e-83 Identities = 207/478 (43%), Positives = 258/478 (53%), Gaps = 45/478 (9%) Frame = -1 Query: 1531 NRIGNPEPENSIDESPTNPSLPGQGHGRAKXXXXXXXXXXXXXLVNNDAVARGRGR---- 1364 NR+ N P S+PG GHGR K V + GR Sbjct: 58 NRVPGQSDSNEPKSEPP-ASVPGIGHGRGKPLASSQPPSSFSSFVTSIRPDSAAGRVQPG 116 Query: 1363 -------GYGPINASPHPSSLPK----ESEHAQQPPAL-----KPNDRKSFSFIKGDTHD 1232 + P+ + PS E PP KP + S ++ Sbjct: 117 QVQPGPKAHDPVASDAGPSKPAAPIFFRGEDGSDPPLPGGGRGKPMSQPSPCQVQPGPQA 176 Query: 1231 DE---TSAVPTRPGNP----RERSLPSDIIDI-LSGAGRGKPKTQP-------------V 1115 + + A P++P P RE D +D+ L G GRGKP +QP V Sbjct: 177 RDPVASDASPSKPATPFFFRRE-----DGLDLPLPGGGRGKPMSQPGPELLVKEVNRHFV 231 Query: 1114 LPTERSMAENRHLRPRQHPKPRVVNAEDGAGDKSSPR-EKLSTEEKVKKAVGILSXXXXX 938 P + ENRH++ R +D A ++++PR KL+ EE V KA+GIL Sbjct: 232 APKSQIEKENRHIQARPD--------QDPAHNRTAPRGPKLTREEAVAKALGILQRDDAE 283 Query: 937 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQ--DRYQDSDDELG-GLYLGDPAEGEKLA 767 + D+ +D D+ G GLYLGD A+GEKLA Sbjct: 284 GGSGGGGDRGGGRGRGMRGRRGGRGRGRGDFRRSDKGKDLDEGKGSGLYLGDNADGEKLA 343 Query: 766 QKLGPETMNKLVEGFEEMSSRVLPSPVDDAYLDALHTNLMIECEPEYLMEVFGSNPDIDE 587 + LGPE MNKLVEGFEEMSS VLPSP+D+A++DA+HTN MIECEPE+LME F NPDIDE Sbjct: 344 KTLGPENMNKLVEGFEEMSSEVLPSPLDEAFVDAMHTNYMIECEPEFLMEDFSKNPDIDE 403 Query: 586 KPPISLRDALDKVKPFFMAYEGIQSQXXXXXXXXETMKNVPLMEKIVDYYSGPDRVTAKQ 407 KPPISLRDAL+K+KPF MAYEGIQS E M+ VPL+++IVD+YSGPDRVTAK+ Sbjct: 404 KPPISLRDALEKMKPFLMAYEGIQSHEEWEEAVKEVMERVPLLKEIVDHYSGPDRVTAKK 463 Query: 406 QQQELERVAKTIPENTSASVKNFVDRAVLSLQSNPGWGFDKKCQFMDKLVFEVSQQPK 233 QQ+ELERVAKT+P SVK F DRAVLSLQSNPGWGFD+KCQFMDKLV +VS+ K Sbjct: 464 QQEELERVAKTLPTKVPESVKRFTDRAVLSLQSNPGWGFDRKCQFMDKLVEKVSKHYK 521