BLASTX nr result

ID: Forsythia22_contig00007765 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Forsythia22_contig00007765
         (2566 letters)

Database: ./nr 
           69,698,275 sequences; 24,982,196,650 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_011086328.1| PREDICTED: uncharacterized protein LOC105168...   428   e-117
ref|XP_004230134.1| PREDICTED: la-related protein 1 [Solanum lyc...   376   e-101
ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanu...   370   3e-99
ref|XP_009796159.1| PREDICTED: la-related protein 1 [Nicotiana s...   367   3e-98
ref|XP_012841899.1| PREDICTED: la-related protein 1 [Erythranthe...   363   3e-97
ref|XP_009616263.1| PREDICTED: la-related protein 1 [Nicotiana t...   362   8e-97
emb|CDP13552.1| unnamed protein product [Coffea canephora]            360   4e-96
gb|KDO45643.1| hypothetical protein CISIN_1g009722mg [Citrus sin...   347   2e-92
ref|XP_007014540.1| Hydroxyproline-rich glycoprotein family prot...   343   4e-91
ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1...   342   1e-90
ref|XP_010274926.1| PREDICTED: pro-resilin [Nelumbo nucifera]         338   1e-89
gb|KHG06267.1| FYVE, RhoGEF and PH domain-containing 2 [Gossypiu...   335   8e-89
ref|XP_002274822.2| PREDICTED: coilin [Vitis vinifera]                331   2e-87
ref|XP_012463685.1| PREDICTED: uncharacterized protein LOC105783...   331   2e-87
ref|XP_002523666.1| conserved hypothetical protein [Ricinus comm...   330   4e-87
ref|XP_008225991.1| PREDICTED: la-related protein 1 [Prunus mume]     325   9e-86
ref|XP_012066680.1| PREDICTED: uncharacterized protein LOC105629...   325   1e-85
ref|XP_011621191.1| PREDICTED: uncharacterized protein LOC184282...   325   1e-85
gb|KDP42449.1| hypothetical protein JCGZ_00246 [Jatropha curcas]      325   1e-85
ref|XP_009340130.1| PREDICTED: collagen alpha-1(III) chain-like ...   315   9e-83

>ref|XP_011086328.1| PREDICTED: uncharacterized protein LOC105168091 [Sesamum indicum]
          Length = 484

 Score =  428 bits (1101), Expect = e-117
 Identities = 254/492 (51%), Positives = 296/492 (60%), Gaps = 13/492 (2%)
 Frame = -1

Query: 1669 MRRSLGKFPYPVXXXXXXXXXXXXXXXXXXXXXXXXXXXXSNN-FQFNRIGNPEPE-NSI 1496
            MRRSL K PYPV                              + FQF      E E +S 
Sbjct: 1    MRRSLTKIPYPVISRPTATTSSISAAFSTSSSGGGGRGRGRASPFQFTVDSPTENEPDSA 60

Query: 1495 DESPTNPSLPGQGHGRAKXXXXXXXXXXXXXLVNNDAVAR--GRGRGYGPINASPHPSSL 1322
                 +P   G G GR K              +NND+ A   GRGRG+ P   S  PS  
Sbjct: 61   KHDDVSPVPHGHGRGRGKLLPSAPVIPSFSSFLNNDSRAPPLGRGRGFVP---SKSPSPP 117

Query: 1321 PKESEHAQQPPALKPNDRKSFSFIKGDT--HDDETSAVPTRPGNPRERSLPSDIIDILSG 1148
            P+E   +   P+ K N +    F+K +   HD   S VP R    +E+ LP++I+++LSG
Sbjct: 118  PQEESDSSGKPS-KANVKMPLLFVKDEEAQHDSAESEVPVR----KEKELPTEIVNVLSG 172

Query: 1147 AGRGKPKTQPVLPTERSMAENRHLRPRQHPKPRVVNAED-GAGDKSSPREKLSTEEKVKK 971
             GRGKP   P + +E+   ENRH+R RQ P      A D  A DKSSPRE+LS EEKVK+
Sbjct: 173  VGRGKPIKPPAVQSEKPKMENRHIRQRQQPNSAEAVASDVPALDKSSPREQLSQEEKVKR 232

Query: 970  AVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXG------QDRYQDSDDELG 809
            AVGILS                                           D Y+DSDDE G
Sbjct: 233  AVGILSRGDQEGERGGAGVRGGRGASAGRGRGRGRGRGRGRIGGRGRGDDMYEDSDDEAG 292

Query: 808  GLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLPSPVDDAYLDALHTNLMIECEPE 629
            GLYLGDPA+GEKLAQKLGPE+MNKL E FEE S+ VLP+PVDDAYLDALHTNL+IECEPE
Sbjct: 293  GLYLGDPADGEKLAQKLGPESMNKLAEAFEEASNSVLPAPVDDAYLDALHTNLLIECEPE 352

Query: 628  YLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQSQXXXXXXXXETMKNVPLMEKI 449
            YLME FG+NPDIDEKPPI LRDAL+K+KPF MAYEGIQSQ        ETMK VPL+++I
Sbjct: 353  YLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQEEWEEVMEETMKTVPLIKEI 412

Query: 448  VDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFVDRAVLSLQSNPGWGFDKKCQFM 269
            VD+YSGPDRVTAKQQQQELERVAKT+P +  +SVK F +RAVLSLQSNPGWGFDKKCQFM
Sbjct: 413  VDHYSGPDRVTAKQQQQELERVAKTLPASAPSSVKRFTERAVLSLQSNPGWGFDKKCQFM 472

Query: 268  DKLVFEVSQQPK 233
            DKLV EVSQQ K
Sbjct: 473  DKLVMEVSQQYK 484


>ref|XP_004230134.1| PREDICTED: la-related protein 1 [Solanum lycopersicum]
          Length = 473

 Score =  376 bits (965), Expect = e-101
 Identities = 225/446 (50%), Positives = 260/446 (58%), Gaps = 9/446 (2%)
 Frame = -1

Query: 1543 NFQFNRIGNPEPENSIDES--PTNPSLPGQGHGRAKXXXXXXXXXXXXXLVNNDAVARGR 1370
            NF F+  G    E+S  ES  P  PS  G G GR K              V+N     GR
Sbjct: 46   NFGFSP-GKSASEDSKPESSTPATPSGTGHGRGRGKPLPSSPIVPSFHSFVDNPNTPAGR 104

Query: 1369 GRG-YGPINASPHPSSLPKESEHAQQPPALKPNDRKSFSFIKGDTHDDETSAVPTRPGNP 1193
            GRG  GP +  P P       +  QQ P  KP       F K +   D  S+    P   
Sbjct: 105  GRGGIGPFSPPPQP-------QQQQQQPLRKP-----IFFAKEEETTDSNSSSSNAPKPR 152

Query: 1192 RERSLPSDIIDILSGAGRGKPKTQPVLPTERSMAENRHLRPRQHPKPRVVNAEDGAGDKS 1013
             + +LPS +I +L+GAGRGKP       +E+   ENRHLRPRQ        A+ G    S
Sbjct: 153  DDSNLPSSVISVLTGAGRGKPLQTASSVSEKPKEENRHLRPRQQKV-----ADSGERASS 207

Query: 1012 SPREKLSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQDRY 833
             P ++LS E+ VKKAVGILS                                        
Sbjct: 208  PPPQRLSREDAVKKAVGILSRSDDGDVGGGRGMGGGFRGRGGRGAVRGRGGRGRGRGRGR 267

Query: 832  QDSDDELG------GLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLPSPVDDAYL 671
               D+E G      G YLGD A+GEKLA KLGPE+MN L EGFEEMS+RVLPSP+DDAYL
Sbjct: 268  GRRDEERGDGNLESGFYLGDDADGEKLAAKLGPESMNTLAEGFEEMSARVLPSPMDDAYL 327

Query: 670  DALHTNLMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQSQXXXXXX 491
            +ALHTN+MIECEPEYLM  F SNPDIDE PPI LRDAL+K+KPF MAYEGI+ Q      
Sbjct: 328  EALHTNMMIECEPEYLMGDFESNPDIDETPPIPLRDALEKMKPFLMAYEGIKDQEEWEEV 387

Query: 490  XXETMKNVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFVDRAVLSLQ 311
              ETM+ VPLM++IVDYYSGPDRVTAKQQQQELERVAKT+PE+   SVK F +RAVLSLQ
Sbjct: 388  IKETMETVPLMKEIVDYYSGPDRVTAKQQQQELERVAKTLPESAPNSVKRFTERAVLSLQ 447

Query: 310  SNPGWGFDKKCQFMDKLVFEVSQQPK 233
            SNPGWGFDKKCQFMDK+V EVSQ  K
Sbjct: 448  SNPGWGFDKKCQFMDKVVMEVSQHYK 473


>ref|XP_006347816.1| PREDICTED: la-related protein 1-like [Solanum tuberosum]
          Length = 480

 Score =  370 bits (950), Expect = 3e-99
 Identities = 222/450 (49%), Positives = 261/450 (58%), Gaps = 13/450 (2%)
 Frame = -1

Query: 1543 NFQFNRIGNPEPENSIDES--PTNPSLPGQGHGRAKXXXXXXXXXXXXXLVNNDAVARGR 1370
            NF F+  G    E+S  ES  PT PS  G G GR K             +V+N     GR
Sbjct: 46   NFGFSP-GKSASEDSKPESSTPTTPSGTGHGRGRGKPLPSSPIVPSFYSVVDNPNPPAGR 104

Query: 1369 GRG-YGPINASPHPSSLPKESEHAQQPPALKPNDRKSFSFIKGDTHDDETSAVPTRPGNP 1193
            GRG  GP +  P P    ++ +  QQ P  KP       F K +   D  S+    P   
Sbjct: 105  GRGGIGPFSPPPQP----QQQQQQQQQPLRKP-----IFFAKEEETADSNSSSSDAPTPR 155

Query: 1192 RERSLPSDIIDILSGAGRGKPKTQPVLPTERSMAENRHLRPRQHPKPRVVNAEDGAGDKS 1013
             + +L S +I +L+GAGRGKP       +E+   ENRHLRPRQ        A+ G    S
Sbjct: 156  DDSNLSSSVISVLTGAGRGKPLQTASPVSEKPKEENRHLRPRQQKV-----ADSGERASS 210

Query: 1012 SPREKLSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXG---- 845
             P ++LS E+ VKKAVGILS                                        
Sbjct: 211  PPPQRLSREDAVKKAVGILSRSDDGDGDGDVGGGRGMGGGFRGRGGRGAVRGRGGRGRGR 270

Query: 844  ------QDRYQDSDDELGGLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLPSPVD 683
                  +D  +       G YLGD A+GEKLAQKLGPE MN L EGFEEMS+RVLPSP+D
Sbjct: 271  GRGRGRRDEERGDGSLESGFYLGDDADGEKLAQKLGPEGMNTLAEGFEEMSARVLPSPMD 330

Query: 682  DAYLDALHTNLMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQSQXX 503
            DAY++ALHTN+MIECEPEYLM  F SNPDIDE PPI LRDAL+K+KPF MAYEGI+ Q  
Sbjct: 331  DAYIEALHTNMMIECEPEYLMGDFESNPDIDETPPIPLRDALEKMKPFLMAYEGIKDQEE 390

Query: 502  XXXXXXETMKNVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFVDRAV 323
                  ETM+ VPLM++IVDYYSGPDRVTAKQQQQELERVAKT+PE+   SVK F +RAV
Sbjct: 391  WEEVIKETMETVPLMKEIVDYYSGPDRVTAKQQQQELERVAKTLPESAPNSVKRFTERAV 450

Query: 322  LSLQSNPGWGFDKKCQFMDKLVFEVSQQPK 233
            LSLQSNPGWGFDKKCQFMDK+V E SQ  K
Sbjct: 451  LSLQSNPGWGFDKKCQFMDKVVMEASQHYK 480


>ref|XP_009796159.1| PREDICTED: la-related protein 1 [Nicotiana sylvestris]
          Length = 485

 Score =  367 bits (942), Expect = 3e-98
 Identities = 223/450 (49%), Positives = 265/450 (58%), Gaps = 13/450 (2%)
 Frame = -1

Query: 1543 NFQFNRIGNPEPENSIDESPTNPSLPGQGHGRAKXXXXXXXXXXXXXLVN----NDAVAR 1376
            NF F+  G PE +         P+  G G GR K              V+    N     
Sbjct: 47   NFGFSP-GKPESKPESSPPTATPTGIGHGRGRGKPFPSSPILPSFSSFVDKPNPNPNPPA 105

Query: 1375 GRGRGYGPINASPHPSSLPKESEHAQQPPALKPNDRKSFSFIKGDTHDDETSAVPTRPGN 1196
            GRGRG GP   +P     P+  +H QQP  LK    K   F K +   D   +    P  
Sbjct: 106  GRGRG-GPGQFTPPQ---PQPQQHQQQPSPLK----KPIFFAKEEETSDSNPSSSDAPKQ 157

Query: 1195 PRERSLPSDIIDILSGAGRGKPKTQPVLP-TERSMAENRHLRPRQHPKPRVVNAEDGAGD 1019
              + +L S +  +LSGAGRGKP      P +E+   ENRHLR RQ  +    +A+ G  +
Sbjct: 158  REDSNLASSLTSLLSGAGRGKPLQTASSPVSEKPKEENRHLRVRQQQQR--ADADSGKRE 215

Query: 1018 KSSPREKLSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQD 839
             S P ++LS E+ VKKAVGILS                                   G+ 
Sbjct: 216  SSPPPQRLSREDAVKKAVGILSRHSDGDGGGDGGGGRGVGGFGGRGGRGAMRGRGGRGRG 275

Query: 838  RY-------QDSDDEL-GGLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLPSPVD 683
            R        ++ DD L  G YLGD A+GEKLA KLGPETMN L E FEEMS+RVLPSP+D
Sbjct: 276  RGRGYGRRDENEDDSLESGFYLGDNADGEKLANKLGPETMNTLAEAFEEMSARVLPSPMD 335

Query: 682  DAYLDALHTNLMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQSQXX 503
            DAY++ALHTN+MIECEPEYLM  F SNPDIDEKPPISLRDAL+K+KPF MAYEGI+ Q  
Sbjct: 336  DAYVEALHTNMMIECEPEYLMGDFESNPDIDEKPPISLRDALEKMKPFLMAYEGIKDQEE 395

Query: 502  XXXXXXETMKNVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFVDRAV 323
                  ETM+ VPLM++IVDYYSGPDRVTAKQQQQELERVAKT+P++   SVK F +RAV
Sbjct: 396  WEKVIEETMETVPLMKEIVDYYSGPDRVTAKQQQQELERVAKTLPQSAPNSVKRFTERAV 455

Query: 322  LSLQSNPGWGFDKKCQFMDKLVFEVSQQPK 233
            LSLQSNPGWGFDKKCQFMDK V+EVSQ  K
Sbjct: 456  LSLQSNPGWGFDKKCQFMDKAVWEVSQHYK 485


>ref|XP_012841899.1| PREDICTED: la-related protein 1 [Erythranthe guttatus]
            gi|604328137|gb|EYU33805.1| hypothetical protein
            MIMGU_mgv1a005203mg [Erythranthe guttata]
          Length = 493

 Score =  363 bits (933), Expect = 3e-97
 Identities = 219/457 (47%), Positives = 269/457 (58%), Gaps = 21/457 (4%)
 Frame = -1

Query: 1540 FQFNRIGNPEPE---NSIDESPTNPSLPGQGHGRAKXXXXXXXXXXXXXLVNNDAVAR-G 1373
            FQF    +P+ +   NS  E  T P   G G GR                +N       G
Sbjct: 43   FQFTVDASPDDQTDKNSKTEVETPPPSYGHGRGRGTPLPSSPVLPSFSSFLNESKPPPVG 102

Query: 1372 RGRGYGPINASPHPSSLP-KESEHAQQPPALKPNDRKSFSFIKGDTHDDETSAVPTRPGN 1196
            RGRG   I ASP P   P + SE   + P  KPN +  F F+K +  +++  A  +   +
Sbjct: 103  RGRGVA-IPASPTPPPPPPRVSESPSEKPPPKPNVKLPFLFVKDE--EEQADAAESEVPS 159

Query: 1195 PRERSLPSDIIDILSGAGRGKPKTQPVLPT-ERSMAENRHLRPRQ-HPKPRVVNAEDGAG 1022
             +E  L SDI+ +LSGAGRGKP   P     E+  +ENRH+R R    KP V  + DGA 
Sbjct: 160  AQETLLRSDIVSVLSGAGRGKPGKPPTAAQPEKPQSENRHIRQRPPQGKPPVAVSSDGA- 218

Query: 1021 DKSSPREKLSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQ 842
              + P  +LS EE VKKA  ILS                                   G+
Sbjct: 219  --APPAVQLSKEEMVKKAKEILSKGDEDGGVSRPEVRDNRDNRDNRGGGRGGRGERGRGR 276

Query: 841  --------------DRYQDSDDELGGLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSR 704
                          DRY++SDDE   L++GDPA+ EK+AQKLGP+ M +L EG +EMSSR
Sbjct: 277  GRGRGRGRGRGRGDDRYEESDDESDALFIGDPADEEKVAQKLGPDVMAQLAEGIDEMSSR 336

Query: 703  VLPSPVDDAYLDALHTNLMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYE 524
            VLPSP DDAY+DA  TNL IECEPEYLME FG+NPDIDEKPPI LRDAL+K+KPF M YE
Sbjct: 337  VLPSPFDDAYMDAFETNLRIECEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMVYE 396

Query: 523  GIQSQXXXXXXXXETMKNVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVK 344
            GI+ Q        ETMK+VPL+++IVD+YSGPDRVTAKQQ +ELERVAKT+P +  ASVK
Sbjct: 397  GIKDQEEWEKIIEETMKDVPLIKEIVDHYSGPDRVTAKQQNEELERVAKTLPASAPASVK 456

Query: 343  NFVDRAVLSLQSNPGWGFDKKCQFMDKLVFEVSQQPK 233
             F +RA+LSLQSNPGWGFDKKCQFMDK++ EVSQ  K
Sbjct: 457  RFTERALLSLQSNPGWGFDKKCQFMDKVIMEVSQNYK 493


>ref|XP_009616263.1| PREDICTED: la-related protein 1 [Nicotiana tomentosiformis]
          Length = 488

 Score =  362 bits (929), Expect = 8e-97
 Identities = 219/449 (48%), Positives = 263/449 (58%), Gaps = 12/449 (2%)
 Frame = -1

Query: 1543 NFQFNRIGNPEPENSIDESPTNPSLPGQGHGRAKXXXXXXXXXXXXXLVN----NDAVAR 1376
            NF F+  G PE +         P   G G GR K              V+    N +   
Sbjct: 49   NFGFSP-GKPESKPESFPPTATPDGIGHGGGRGKPFPSSPILPSFSSFVDKPNPNPSPPA 107

Query: 1375 GRGRGYGPINASPHPSSLPKESEHAQQPPALKPNDRKSFSFIKGDTHDDETSAVPTRPGN 1196
            GRGRG GP   +P     P+  +  QQP  LK    K   F K +   D   +    P  
Sbjct: 108  GRGRG-GPGQFTPPQ---PQPQQQHQQPSPLK----KPIFFAKEEETSDSNPSSSDAPKQ 159

Query: 1195 PRERSLPSDIIDILSGAGRGKPKTQPVLP-TERSMAENRHLRPRQHPKPRVVNAEDGAGD 1019
              + +L S +  +LSGAGRGKP      P +E+   ENRHLR RQ  + +  +A+ G   
Sbjct: 160  REDSNLASSLTSLLSGAGRGKPLQTASSPVSEKPKEENRHLRVRQQQQQQRADADSGKRA 219

Query: 1018 KSSPREKLSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQD 839
             S P ++LS E+ VKKAVGILS                                      
Sbjct: 220  SSPPPQRLSREDAVKKAVGILSRHDDGDGDGDGGGRGVGGFRGRGGRGAMRGRGGRGRGR 279

Query: 838  ------RYQDSDDEL-GGLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLPSPVDD 680
                  R ++ +D L  G YLGD A+GEKLA KLGPETMN L E FEEMS+RVLPSP+DD
Sbjct: 280  GRGYGRREENENDSLESGFYLGDNADGEKLANKLGPETMNTLAEAFEEMSARVLPSPMDD 339

Query: 679  AYLDALHTNLMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQSQXXX 500
            AY++ALHTN+MIECEPEYL+  F SNPDIDEKPPISLRDAL+K+KPF MAYEGI+ Q   
Sbjct: 340  AYVEALHTNMMIECEPEYLVGDFESNPDIDEKPPISLRDALEKMKPFLMAYEGIKDQEEW 399

Query: 499  XXXXXETMKNVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFVDRAVL 320
                 ETM+ VPLM++IVDYYSGPDRVTAKQQQQELERVAKT+P++   SVK F +RAVL
Sbjct: 400  EKVIEETMETVPLMKEIVDYYSGPDRVTAKQQQQELERVAKTLPQSAPNSVKRFTERAVL 459

Query: 319  SLQSNPGWGFDKKCQFMDKLVFEVSQQPK 233
            SLQSNPGWGFDKKCQFMDK+V+EVSQ  K
Sbjct: 460  SLQSNPGWGFDKKCQFMDKVVWEVSQHYK 488


>emb|CDP13552.1| unnamed protein product [Coffea canephora]
          Length = 499

 Score =  360 bits (923), Expect = 4e-96
 Identities = 206/428 (48%), Positives = 251/428 (58%), Gaps = 6/428 (1%)
 Frame = -1

Query: 1498 IDESPTNPSLPGQGHGRAKXXXXXXXXXXXXXLVNNDAVARGRGRGYGPINASPHPSSLP 1319
            +D++     +PG+G GR                       RG G G G  + +P P+  P
Sbjct: 99   VDKTTVPVPVPGRGQGRG----------------------RGIGAGLGAGHVTP-PTPAP 135

Query: 1318 KESEHAQQPPALKPNDRKSFSFIKGDTHDDETSAVPTRPGNPRERSLPSDIIDILSGAGR 1139
             +     + P     D         D+H    +  PT P NP +  LPS I+ ILSGAGR
Sbjct: 136  AQPSGPSRKPIFSAKDG---GVAPHDSHFPPPTQSPTVPRNPDDTHLPSSILTILSGAGR 192

Query: 1138 GKPKTQPVLPTERSMAENRHLRPRQHPKPRVVNAEDGAGDKSSPREKLSTEEKVKKAVGI 959
            GK    P    ++ + ENRH+R RQ P P     +      ++  ++LS EE  KKAVGI
Sbjct: 193  GKAPRSPSPVPDKPIEENRHIRARQQP-PGATREDSSTNSAATSAQRLSPEEAAKKAVGI 251

Query: 958  LSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXG----QDRYQDSDDE--LGGLYL 797
            LS                                           Y+D+DD+    GLYL
Sbjct: 252  LSGGRGDTGRDEGARGGRGGGGGGGPRGQGDRGARFEDAGFEDTGYEDTDDDDSAAGLYL 311

Query: 796  GDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLPSPVDDAYLDALHTNLMIECEPEYLME 617
            GD A+G+KL Q+LGP+  ++L EGFEEMSSRVLPSP DDAYLDALHTNL+IECEPEY+M 
Sbjct: 312  GDDADGDKLTQRLGPDIEDQLSEGFEEMSSRVLPSPEDDAYLDALHTNLLIECEPEYVMG 371

Query: 616  VFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQSQXXXXXXXXETMKNVPLMEKIVDYY 437
             F  NPDIDEKPPI LRDAL+K+KPF MAYEGIQSQ        ETMK VPL+++IVDYY
Sbjct: 372  NFDINPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQQEWEEAVEETMKKVPLLKEIVDYY 431

Query: 436  SGPDRVTAKQQQQELERVAKTIPENTSASVKNFVDRAVLSLQSNPGWGFDKKCQFMDKLV 257
            SGPDRVTAKQQQ+E+ERVAK +PE+  ASVK F +RAVLSLQSNPGWGFDKKCQFMDKLV
Sbjct: 432  SGPDRVTAKQQQEEIERVAKALPESVPASVKRFTNRAVLSLQSNPGWGFDKKCQFMDKLV 491

Query: 256  FEVSQQPK 233
             E+SQ  K
Sbjct: 492  SEISQHYK 499


>gb|KDO45643.1| hypothetical protein CISIN_1g009722mg [Citrus sinensis]
          Length = 527

 Score =  347 bits (891), Expect = 2e-92
 Identities = 215/446 (48%), Positives = 254/446 (56%), Gaps = 16/446 (3%)
 Frame = -1

Query: 1522 GNPEPENSIDESPTNPSLP--GQGHGRAKXXXXXXXXXXXXXLVNNDAVARGRGRGYGPI 1349
            G P  E+  D SP  P  P  G GHGR +                  AV  G GRG   +
Sbjct: 106  GQPASESKPD-SPPQPQAPPSGSGHGRGQPSAAPSPSISSFSSFLT-AVKSGAGRGR--V 161

Query: 1348 NASPHPSSLPKESEHAQQPPALKPNDRKSFSFIKGDTHDDETSAVPTRPGNPRERSLPSD 1169
            + +  P+  P+      +P    PN               E++   T+P  P   +LPS 
Sbjct: 162  SFASDPNESPRPDAQPAKPRTCTPN---------------ESATDSTQPSEP---NLPSS 203

Query: 1168 IIDILSGAGRGK----------PKTQPVLPTERSMAENRHLRPRQHPKPRVVNAEDGAGD 1019
            II  L GAGRGK           + Q   P      ENRH+R R  P+PR   A   A +
Sbjct: 204  IISTLPGAGRGKTAVTQQQQQQQQHQRQQPGPPPQEENRHIRARLQPQPRPEKAP--AAE 261

Query: 1018 KSSPREKLSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQD 839
              S + KLS E+ VK A+ ILS                                   G+ 
Sbjct: 262  TGSAQPKLSKEDAVKMAMKILSRGEEGEGEGISAGGPGRGRGMGRGRGRGRGRGQGRGRM 321

Query: 838  RYQ----DSDDELGGLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLPSPVDDAYL 671
            R Q    D D   GGLYLGD A+GEKLA+K+G E MN LVEGFEEMS RVLPSP++DAY+
Sbjct: 322  RRQEMEDDEDGRFGGLYLGDNADGEKLAEKVGAEKMNMLVEGFEEMSGRVLPSPMEDAYI 381

Query: 670  DALHTNLMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQSQXXXXXX 491
            DALHTN MIE EPEYLME FG+NPDIDEKPPI LRDAL+K+KPF MAYEGIQSQ      
Sbjct: 382  DALHTNCMIEFEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQSQEEWEEA 441

Query: 490  XXETMKNVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFVDRAVLSLQ 311
              E M+ VPL+++IVD+YSGPDRVTAKQQ +ELERVAKTIPE+  AS+K F +RAVLSLQ
Sbjct: 442  VNEVMERVPLLKEIVDHYSGPDRVTAKQQGEELERVAKTIPESAPASIKRFANRAVLSLQ 501

Query: 310  SNPGWGFDKKCQFMDKLVFEVSQQPK 233
            SNPGWGFDKKCQFMDKL +EVSQ  K
Sbjct: 502  SNPGWGFDKKCQFMDKLAWEVSQHYK 527


>ref|XP_007014540.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
            [Theobroma cacao] gi|508784903|gb|EOY32159.1|
            Hydroxyproline-rich glycoprotein family protein, putative
            isoform 1 [Theobroma cacao]
          Length = 474

 Score =  343 bits (880), Expect = 4e-91
 Identities = 207/410 (50%), Positives = 243/410 (59%), Gaps = 26/410 (6%)
 Frame = -1

Query: 1384 VARGRGRGYGPINASPHP-----------SSLPKESEHAQQPPALKPNDRKSFSFIKGDT 1238
            V  GRGRG GP+++ P P           S   + +  +  PP   P   K   FIK   
Sbjct: 82   VGHGRGRG-GPLSSDPIPHPFSSFVSQTGSGRGRVTSESVPPPPPPPAQAKQPIFIKKKD 140

Query: 1237 HDDETSAVPT--RPGNPRERSLPSDI--IDILSGAGRGKPKTQPVLPTERSMAENRHLRP 1070
             D+  S+      P    E   P +I  + +LSGAGRGKP  QP  P  R   ENRH+R 
Sbjct: 141  EDETESSAKAAAEPIQSSEPIFPPNILPVSVLSGAGRGKPVKQPE-PASRRQEENRHIRV 199

Query: 1069 RQHPKPRVVNAEDGAGDKSSPREKLSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXX 890
             Q               + SP  ++S EE  KKA+GILS                     
Sbjct: 200  AQ---------------QQSPSAQMSQEEATKKAMGILSRRSESGESGMVGRGGRASMGM 244

Query: 889  XXXXXXXXXXXXXXGQDR----------YQDSDD-ELGGLYLGDPAEGEKLAQKLGPETM 743
                          G+ R           +DS +    GLYLGD A+GEK AQ +G + M
Sbjct: 245  GGGRGRGRGRGRGMGRGRGRRQGEDTRIVKDSGEGSADGLYLGDNADGEKFAQTIGADNM 304

Query: 742  NKLVEGFEEMSSRVLPSPVDDAYLDALHTNLMIECEPEYLMEVFGSNPDIDEKPPISLRD 563
            NKLVEGFEEM SRVLPSP+DDAYLDALHTN  IE EPEYLME FG+NPDIDEKPP+ LRD
Sbjct: 305  NKLVEGFEEMGSRVLPSPMDDAYLDALHTNCSIEFEPEYLMEEFGTNPDIDEKPPMPLRD 364

Query: 562  ALDKVKPFFMAYEGIQSQXXXXXXXXETMKNVPLMEKIVDYYSGPDRVTAKQQQQELERV 383
            AL+K+KPF MAYEGIQSQ        ETM+ VPL+++IVDYYSGPDRVTAK+QQ+ELERV
Sbjct: 365  ALEKMKPFLMAYEGIQSQEEWEEVIKETMERVPLLQEIVDYYSGPDRVTAKKQQEELERV 424

Query: 382  AKTIPENTSASVKNFVDRAVLSLQSNPGWGFDKKCQFMDKLVFEVSQQPK 233
            AKTIPE   +SVK F +RAVLSLQSNPGWGFDKKCQFMDKLV+EVSQQ K
Sbjct: 425  AKTIPERAPSSVKQFANRAVLSLQSNPGWGFDKKCQFMDKLVWEVSQQYK 474


>ref|XP_006477961.1| PREDICTED: DDRGK domain-containing protein 1-like [Citrus sinensis]
          Length = 407

 Score =  342 bits (876), Expect = 1e-90
 Identities = 199/394 (50%), Positives = 238/394 (60%), Gaps = 14/394 (3%)
 Frame = -1

Query: 1372 RGRGYGPINASPHPSSLPKESEHAQQPPALKPNDRKSFSFIKGDTHDDETSAVPTRPGNP 1193
            +G G G ++ +  P+  P+      +P    PN               E++   T+P  P
Sbjct: 34   QGAGRGRVSFASDPNESPRPDAQPAKPRTCTPN---------------ESATDSTQPSEP 78

Query: 1192 RERSLPSDIIDILSGAGRGK----------PKTQPVLPTERSMAENRHLRPRQHPKPRVV 1043
               +LPS II  L GAGRGK           + Q   P      ENRH+R R  P+PR  
Sbjct: 79   ---NLPSSIISTLPGAGRGKTAVTQQQQQQQQHQRQQPGPPPQEENRHIRARLQPQPRPE 135

Query: 1042 NAEDGAGDKSSPREKLSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 863
             A   A +  S + KLS E+ VK A+ +LS                              
Sbjct: 136  KAP--AAETGSAQPKLSKEDAVKMAMKVLSRGEEGEGEGISAGGPGRGRGMGRGRGRGRG 193

Query: 862  XXXXXGQDRYQ----DSDDELGGLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLP 695
                 G+ R Q    D D   GGLYLGD A+GEKLA+K+G E MN LVEGFEEMS RVLP
Sbjct: 194  RGQGRGRMRRQEMEDDEDGRFGGLYLGDNADGEKLAEKVGAEKMNMLVEGFEEMSGRVLP 253

Query: 694  SPVDDAYLDALHTNLMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQ 515
            SP++DAY+DALHTN MIE EPEYLME FG+NPDIDEKPPI LRDAL+K+KPF MAYEGIQ
Sbjct: 254  SPMEDAYIDALHTNCMIEFEPEYLMEEFGTNPDIDEKPPIPLRDALEKMKPFLMAYEGIQ 313

Query: 514  SQXXXXXXXXETMKNVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFV 335
            SQ        E M+ VPL+++IVD+YSGPDRVTAKQQ +ELERVAKTIPE+  AS+K F 
Sbjct: 314  SQEEWEEAVNEVMERVPLLKEIVDHYSGPDRVTAKQQGEELERVAKTIPESAPASIKRFA 373

Query: 334  DRAVLSLQSNPGWGFDKKCQFMDKLVFEVSQQPK 233
            +RAVLSLQSNPGWGFDKKCQFMDKL +EVSQQ K
Sbjct: 374  NRAVLSLQSNPGWGFDKKCQFMDKLAWEVSQQYK 407


>ref|XP_010274926.1| PREDICTED: pro-resilin [Nelumbo nucifera]
          Length = 482

 Score =  338 bits (868), Expect = 1e-89
 Identities = 212/438 (48%), Positives = 256/438 (58%), Gaps = 8/438 (1%)
 Frame = -1

Query: 1522 GNPEPENSIDESPTNPSLP-GQGHGRAKXXXXXXXXXXXXXLVNNDAVARGRGRGYGPIN 1346
            G P+   + D    +  LP G GHGR K              V+    + GRG       
Sbjct: 63   GKPDTTGADDAEADDSFLPSGLGHGRGKPIPSTPILPSFSSWVSGMRPSAGRGGRSTQQQ 122

Query: 1345 ASPHPSSLPKESEHAQQPPALKPNDRKSFSFIKGDTHDDETSAVP-TRPG-NPRERSLPS 1172
            +  HPS          +P   +P  +K   F + D     T   P + PG +P    LPS
Sbjct: 123  SDSHPS----------EPQDFQP--KKPIFFSREDPQGPLTQNPPISEPGRSPGGIVLPS 170

Query: 1171 DIIDILSGAGRGKPKTQPVLPTERSMAE-NRHLRPRQHPKPRVVNAEDGAGDKSSPRE-K 998
             +   L GAGRGKP    + P+E S++E NRHLRPR+           G  D++SP   +
Sbjct: 171  SLSSGLPGAGRGKPPKPSLGPSETSVSEENRHLRPRRE------GVAVGLQDRTSPSPPR 224

Query: 997  LSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQD--RYQDS 824
            LS E+ VKKAVGIL                                    G+   R++D 
Sbjct: 225  LSREDAVKKAVGILRRGGDGMEEGGRGRGTRGRGGRGRGGRGVQGWRGRGGRSGGRFRDL 284

Query: 823  DDELG-GLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLPSPVDDAYLDALHTNLM 647
            +D  G GLYLGD A+GE+LA +LG E M+KLVE FEEMS  VLPSP+DDAYLDA+HTN +
Sbjct: 285  EDNYGTGLYLGDNADGERLANRLGTENMDKLVEAFEEMSYSVLPSPMDDAYLDAVHTNNL 344

Query: 646  IECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQSQXXXXXXXXETMKNV 467
            IE EPEYLM  F +NPDIDEKPPI LRDAL+KVKPF MAYEGIQSQ        ETM+ +
Sbjct: 345  IEYEPEYLMGDFETNPDIDEKPPIPLRDALEKVKPFLMAYEGIQSQEEWEEIMKETMEKL 404

Query: 466  PLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFVDRAVLSLQSNPGWGFD 287
            P M++++D YSGPDRVT KQQQQELERVAKT+PEN  +SVK F DRAVLSLQSNPGWGFD
Sbjct: 405  PYMKELIDIYSGPDRVTGKQQQQELERVAKTLPENVPSSVKCFTDRAVLSLQSNPGWGFD 464

Query: 286  KKCQFMDKLVFEVSQQPK 233
            KKCQFMDKLV+EVSQ  K
Sbjct: 465  KKCQFMDKLVWEVSQHYK 482


>gb|KHG06267.1| FYVE, RhoGEF and PH domain-containing 2 [Gossypium arboreum]
          Length = 486

 Score =  335 bits (860), Expect = 8e-89
 Identities = 203/440 (46%), Positives = 262/440 (59%), Gaps = 9/440 (2%)
 Frame = -1

Query: 1525 IGNPEPENSIDESPTNPSLPGQGHGRAKXXXXXXXXXXXXXLVNNDAVARGRGRGYGPIN 1346
            +G P+ E++  +S  + ++ G GHGR +                      GRGR     N
Sbjct: 66   LGKPDSEDTKRDSAESQAV-GSGHGRGRGIPLSSEPIIPSFSSFVSQNGSGRGR---VTN 121

Query: 1345 ASPHPSSLPKESEHAQQPPALKPNDRKSFSFI--KGDTHDDETSAVPTRPGNPRERSLP- 1175
             S  P+  P        PP L P + K   F+  + +   D ++ +P       ER+   
Sbjct: 122  ESVRPTPPPPP------PPPLPPREAKQPIFVMKQDEIETDLSAKLPAESVQSSERTFSP 175

Query: 1174 -SDIIDILSGAGRGKPKTQPVLPTERSMAENRHLRPRQHPKPRVVNAEDGAGDKSSPREK 998
             +  +  LSGAGRGKP  QP  P  ++  ENRH+R +Q  + +          +  P  +
Sbjct: 176  RTPSVASLSGAGRGKPVKQPG-PVLQTKEENRHIRLKQQQQQQQ--------QQQPPSPR 226

Query: 997  LSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQ----DRYQ 830
            LS EE VKKA+GILS                                    +       +
Sbjct: 227  LSKEEAVKKAMGILSRKSESDEREDMGRSGGRGRGRGRGRGARMGRGRGRREREDTGEEE 286

Query: 829  DSDDELGG-LYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLPSPVDDAYLDALHTN 653
            ++D EL   LYLG+ A+GE+LA+ +G ++MNKLVEGFEE+SSRVLPSP+DDAYL+ALHTN
Sbjct: 287  EADKELRDELYLGNNADGERLAETIGADSMNKLVEGFEEISSRVLPSPMDDAYLEALHTN 346

Query: 652  LMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQSQXXXXXXXXETMK 473
             MIE EPEYLME FG+NPDIDEKPP+SLRDAL+KVKPF M+YEGI++Q        ETM 
Sbjct: 347  FMIEFEPEYLMEEFGTNPDIDEKPPMSLRDALEKVKPFLMSYEGIENQEEWEEAIKETMD 406

Query: 472  NVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFVDRAVLSLQSNPGWG 293
             VPL+++I+DYYSGPDRVTAK+QQ+ELERVAKTIP++  ASVK F +RAVL+LQSNPGWG
Sbjct: 407  KVPLLQEIIDYYSGPDRVTAKKQQEELERVAKTIPKSVPASVKQFANRAVLTLQSNPGWG 466

Query: 292  FDKKCQFMDKLVFEVSQQPK 233
            FDKKCQFMDKLV EVSQQ K
Sbjct: 467  FDKKCQFMDKLVCEVSQQYK 486


>ref|XP_002274822.2| PREDICTED: coilin [Vitis vinifera]
          Length = 482

 Score =  331 bits (849), Expect = 2e-87
 Identities = 214/440 (48%), Positives = 253/440 (57%), Gaps = 20/440 (4%)
 Frame = -1

Query: 1492 ESPTNPSLPGQGHGRAKXXXXXXXXXXXXXLVNNDAVARGRGRGYGPINASPHPSSLPKE 1313
            ES  +P   G GHGR K                +   + G GRG G + A P   S+P  
Sbjct: 64   ESSESPFPLGLGHGRGKPPSQPSAPTLPSF---SSFASTGIGRGRGRLTAHP-TDSVP-- 117

Query: 1312 SEHAQQPPALKPNDRKSFSFIKGDTHDDET---SAVPTRPGNPRERSLPSDIIDILSG-A 1145
                QQ P   P  +K   F K D  D      S + T P  P E +LP  I+  LSG A
Sbjct: 118  ----QQSPDFAP--KKPIFFSKEDAADSAPKPQSQLGTTP--PEENNLPVSILSALSGGA 169

Query: 1144 GRGKPKTQPVLPTERSMAENRHLRPRQHPKPRVVNAEDGAGDKSSPREKLSTEEKVKKAV 965
            GRG+P  Q   P +    ENRHLR  + P  R    +  AG    P+ +LS EE VKKAV
Sbjct: 170  GRGQPLKQTPAPPKE---ENRHLRQPRQPVFRSPQ-QPVAGP---PQPRLSREEAVKKAV 222

Query: 964  GILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQDRYQ--------------- 830
            GILS                                   G+ R +               
Sbjct: 223  GILSRGGDGGGDGDGDEGGRGRGFRGRGRGRGRGAQGWMGRGRGRGRGRGRMGDRRGRGG 282

Query: 829  DSDDELG-GLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLPSPVDDAYLDALHTN 653
            D+ D+ G GLYLGD A+ EKL+ K+G E M+KL E FEEMS RVLPSP++DAYLDALHTN
Sbjct: 283  DAQDDYGAGLYLGDNADAEKLSNKIGLEKMSKLDEAFEEMSGRVLPSPIEDAYLDALHTN 342

Query: 652  LMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQSQXXXXXXXXETMK 473
             +IE EPEYLME FG+NPDIDE PPI LRDAL+K+KPF M YEGIQSQ        ETM+
Sbjct: 343  CLIEFEPEYLMEEFGTNPDIDENPPIPLRDALEKMKPFLMQYEGIQSQEEWEEVMKETME 402

Query: 472  NVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFVDRAVLSLQSNPGWG 293
            NVP ++++VDYYSGPDRVTAK+QQ+ELERVAKT+PE    SVK F DRA+LSLQSNPGWG
Sbjct: 403  NVPYLKELVDYYSGPDRVTAKKQQEELERVAKTLPETAPNSVKRFTDRAILSLQSNPGWG 462

Query: 292  FDKKCQFMDKLVFEVSQQPK 233
            FDKKCQFMDKLV+EVSQ  K
Sbjct: 463  FDKKCQFMDKLVWEVSQHYK 482


>ref|XP_012463685.1| PREDICTED: uncharacterized protein LOC105783052 isoform X1 [Gossypium
            raimondii] gi|763816483|gb|KJB83335.1| hypothetical
            protein B456_013G241700 [Gossypium raimondii]
          Length = 484

 Score =  331 bits (848), Expect = 2e-87
 Identities = 198/440 (45%), Positives = 255/440 (57%), Gaps = 9/440 (2%)
 Frame = -1

Query: 1525 IGNPEPENSIDESPTNPSLPGQGHGRAKXXXXXXXXXXXXXLVNNDAVARGRGRGYGPIN 1346
            +G P+ E++  +S  + ++ G GHGR +                      GRGR      
Sbjct: 66   LGKPDSEDTKRDSAESQAV-GLGHGRGRGIPFSSEPIIPSFSSFVSQNGSGRGR------ 118

Query: 1345 ASPHPSSLPKESEHAQQPPALKPNDRKSFSFI--KGDTHDDETSAVPTRPGNPRERSLPS 1172
                   +  ES     PP   P + K   F+  + +   D ++ +P       ER+   
Sbjct: 119  -------VTNESVRQTPPPPPPPREAKQPIFVMKQDEIETDSSAKLPAESVQSSERTFSP 171

Query: 1171 DIIDI--LSGAGRGKPKTQPVLPTERSMAENRHLRPRQHPKPRVVNAEDGAGDKSSPREK 998
                +  LSGAGRGKP  QP  P  ++  ENRH+R +Q  +      +     +  P  +
Sbjct: 172  STPSVASLSGAGRGKPVKQPE-PVLQTKEENRHIRLKQKQQ------QQQQQQQQPPSPR 224

Query: 997  LSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQDRYQDSDD 818
            LS EE VKKA+ ILS                                    +      ++
Sbjct: 225  LSKEEAVKKAMCILSRKSGSDEREDMGRSGGRGRGRGRGRGAQMGRGRGRREGEDTREEE 284

Query: 817  EL-----GGLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLPSPVDDAYLDALHTN 653
            E        LYLG+ A+GE+LA+ +G ++MNKLVEGFEEMSSRVLPSP+DDAYL+ALHTN
Sbjct: 285  EAVKELRDELYLGNNADGERLAETIGADSMNKLVEGFEEMSSRVLPSPMDDAYLEALHTN 344

Query: 652  LMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQSQXXXXXXXXETMK 473
             MIE EPEYLME FG+NPDIDEKPP+SLRDAL+KVKPF M+YEGI++Q        ETM 
Sbjct: 345  FMIEFEPEYLMEEFGTNPDIDEKPPMSLRDALEKVKPFLMSYEGIENQEEWEEAIKETMD 404

Query: 472  NVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFVDRAVLSLQSNPGWG 293
             VPL+++I+DYYSGPDRVTAK+QQ+ELERVAKTIP++  ASVK F +RAVL+LQSNPGWG
Sbjct: 405  KVPLLQEIIDYYSGPDRVTAKKQQEELERVAKTIPKSAPASVKQFANRAVLTLQSNPGWG 464

Query: 292  FDKKCQFMDKLVFEVSQQPK 233
            FDKKCQFMDKLV+EVSQQ K
Sbjct: 465  FDKKCQFMDKLVWEVSQQYK 484


>ref|XP_002523666.1| conserved hypothetical protein [Ricinus communis]
            gi|223537066|gb|EEF38701.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 436

 Score =  330 bits (846), Expect = 4e-87
 Identities = 204/406 (50%), Positives = 234/406 (57%), Gaps = 28/406 (6%)
 Frame = -1

Query: 1375 GRGRGYGPI------NASPHPSSLPKESEHAQQPPALKPNDRKSF--------------- 1259
            GRGRG  P        A   P S      H   PP   P  R                  
Sbjct: 42   GRGRGSNPNLFDFTGKAPAKPESSDVAKPHYPPPPPPPPPPRNGVGHGHGGGNPILPAFS 101

Query: 1258 SFI----KGDTHDDETSAVPTRPGNPRERS-LPSDIIDILSGAGRGKPKTQPVLPTERSM 1094
            SF+    +G    D       +P   +  S LPS I   LSG GRG+P  +PV+PT +  
Sbjct: 102  SFVSSIGRGRAITDPEPGPSRQPTESQSDSVLPSTIHSSLSGFGRGEPD-KPVVPTPQVK 160

Query: 1093 AENRHLRPRQHPKPRVVNAEDGAGDKSSPREKLSTEEKVKKAVGILSXXXXXXXXXXXXX 914
             ENRH+R R   KP+   AE  A      + K+S EE VK+AV ILS             
Sbjct: 161  EENRHIRDRSRAKPKTEEAEVRA------KPKISREEAVKRAVSILSQGDTGEGMGRGRG 214

Query: 913  XXXXXXXXXXXXXXXXXXXXXXGQDRYQDSDDEL--GGLYLGDPAEGEKLAQKLGPETMN 740
                                   + R  D  DE    GL+LGD A+GEKLA K+G E MN
Sbjct: 215  GGRGRGRGRGRGRLEQ-------RGRMMDDVDEGFGSGLFLGDNADGEKLAGKIGVENMN 267

Query: 739  KLVEGFEEMSSRVLPSPVDDAYLDALHTNLMIECEPEYLMEVFGSNPDIDEKPPISLRDA 560
            KLVEG+EEMS RVLPSP++DAYLDALHTN MIE EPEYLM  F  NPDIDEKPP+ LRD 
Sbjct: 268  KLVEGYEEMSGRVLPSPMEDAYLDALHTNYMIEFEPEYLMGEFDQNPDIDEKPPMPLRDV 327

Query: 559  LDKVKPFFMAYEGIQSQXXXXXXXXETMKNVPLMEKIVDYYSGPDRVTAKQQQQELERVA 380
            L+KVKPF MAYEGIQSQ        ETMKNVPL ++IVDYYSGPDR+TAK+Q++ELERVA
Sbjct: 328  LEKVKPFIMAYEGIQSQEEWEAAVEETMKNVPLFKEIVDYYSGPDRITAKKQEEELERVA 387

Query: 379  KTIPENTSASVKNFVDRAVLSLQSNPGWGFDKKCQFMDKLVFEVSQ 242
             TIP +  ASVK F DRAVLSLQSNPGWGFDKKCQFMDKLV EV+Q
Sbjct: 388  NTIPASAPASVKRFADRAVLSLQSNPGWGFDKKCQFMDKLVREVNQ 433


>ref|XP_008225991.1| PREDICTED: la-related protein 1 [Prunus mume]
          Length = 460

 Score =  325 bits (834), Expect = 9e-86
 Identities = 207/441 (46%), Positives = 247/441 (56%), Gaps = 14/441 (3%)
 Frame = -1

Query: 1522 GNPEPENSIDESPTNPSLPGQGHGRAKXXXXXXXXXXXXXLVNNDAVARGRGRGYGPINA 1343
            G P+ ++   + P  PS PG GHGR K                  A+    G G G  N 
Sbjct: 65   GQPDSDDPKPDPP--PSAPGLGHGRGKPLPTFSSFV--------SAIKPNSGTGRGQPN- 113

Query: 1342 SPHPSSLPKESEHAQQPPALKPNDRKSFSFIKGDTHDDETSAVPTRPGNPRERSLPSDII 1163
                 S+P ES  +  P A      K   F++GD  D      PT PG+           
Sbjct: 114  --QVQSIP-ESRDSVAPDAGPSKPIKPIFFVRGDGSD------PTLPGS----------- 153

Query: 1162 DILSGAGRGKPK--TQPVLPTERSMAENRHLRPRQHPKPRVVNAEDGAGDKSSPR-EKLS 992
                  GRGKP   T+P +  +    ENRH++ R  P P           ++ PR  KL+
Sbjct: 154  ------GRGKPMNFTRPEVQVKE---ENRHIQARSEPDPDQ--------PRTRPRGPKLT 196

Query: 991  TEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQD--------- 839
             EE VK+A+GIL+                                     +         
Sbjct: 197  REEAVKQALGILAQDGAEGDDVGGGGGGGRGRGRGRGMRGRGRGRGRGRGNFRMSERGDG 256

Query: 838  -RYQDSDDELG-GLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLPSPVDDAYLDA 665
             R +DSDD    GLYLGD A+GEKLA+KLGPE MNKLVE FEEMSS VLPSP+DDAY+DA
Sbjct: 257  RRGKDSDDSYASGLYLGDNADGEKLAKKLGPEIMNKLVESFEEMSSEVLPSPLDDAYVDA 316

Query: 664  LHTNLMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQSQXXXXXXXX 485
            +HTN MIECEPEYLM  F  NPDIDEKPPISLRDAL+K+KPF MAYE IQSQ        
Sbjct: 317  MHTNFMIECEPEYLMGEFNKNPDIDEKPPISLRDALEKMKPFLMAYENIQSQEEWEEVVN 376

Query: 484  ETMKNVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFVDRAVLSLQSN 305
            ETM+ VPL+++IVD+YSGPDRVTAK+QQ+ELERVAKT+P     SVK F DRAVLSLQSN
Sbjct: 377  ETMERVPLLKEIVDHYSGPDRVTAKKQQEELERVAKTLPAKVPDSVKRFTDRAVLSLQSN 436

Query: 304  PGWGFDKKCQFMDKLVFEVSQ 242
            PGWGFD+KCQFMDKLV +VSQ
Sbjct: 437  PGWGFDRKCQFMDKLVAKVSQ 457


>ref|XP_012066680.1| PREDICTED: uncharacterized protein LOC105629668 [Jatropha curcas]
          Length = 548

 Score =  325 bits (833), Expect = 1e-85
 Identities = 202/451 (44%), Positives = 248/451 (54%), Gaps = 18/451 (3%)
 Frame = -1

Query: 1540 FQFNRIGNPEPENSIDESPTNPSLP-GQGHGRAKXXXXXXXXXXXXXLVNNDAVARGRGR 1364
            F F     P+  +S  ES   P  P G GHGR +              +     A GRGR
Sbjct: 115  FDFTAPSKPDTNDSKTESSDRPQQPAGIGHGRGRPPILPAFSSLISS-LKTSISATGRGR 173

Query: 1363 GYGPINASP--------HPSSLP-----KESEHAQQPPALKPNDRKSFSFIKGDTHDDET 1223
            G  P              P + P     +E+ H +  P ++P          G    D+ 
Sbjct: 174  GNLPSAVESGVGRGKLDKPVAAPTNQEDEENRHIRFRPTVEPG--------VGRGKPDKP 225

Query: 1222 SAVPTRPGNPRERSL---PSDIIDILSGAGRGKPKTQPVLPTERSMAENRHLRPRQHPKP 1052
             A  T   +   R +   P+    +  G GRGK        T++   ENRH+  R  P+P
Sbjct: 226  VAASTNQEDEENRHIRFRPT----VEPGVGRGKTDKPVAASTQQIKEENRHIGARSTPRP 281

Query: 1051 RVVNAEDGAGDKSSPREKLSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXX 872
            R V +  G     S + K+S EE  ++AV IL                            
Sbjct: 282  RTVPSRKGL---ESDKPKVSLEEATRRAVSILEQGEDDGGGGIGRGRGSRVRGRGRGRGR 338

Query: 871  XXXXXXXXGQDRYQDSDDELG-GLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLP 695
                     + R +DS+ E   GL+LGD A+GEKLA+++G E MNKL+EGFEEMS RVLP
Sbjct: 339  GRWDQ----RGRMEDSEPEFATGLFLGDNADGEKLAERVGVENMNKLIEGFEEMSERVLP 394

Query: 694  SPVDDAYLDALHTNLMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQ 515
            SP+ DAYL+ALHTN MIE EPEYLM  F  NPDIDEKPP+ LRD L+KVKPF MAY+GIQ
Sbjct: 395  SPMQDAYLEALHTNYMIEFEPEYLMGEFDQNPDIDEKPPLPLRDVLEKVKPFIMAYDGIQ 454

Query: 514  SQXXXXXXXXETMKNVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFV 335
            SQ        ETMKNVPL+++IVDYYSGPDRVTAK+QQ+ELERVAKTIP +  ASVK F 
Sbjct: 455  SQEEWEEVVEETMKNVPLLKEIVDYYSGPDRVTAKKQQEELERVAKTIPASAPASVKRFA 514

Query: 334  DRAVLSLQSNPGWGFDKKCQFMDKLVFEVSQ 242
            DRAVLSLQSNPGWGFD+KCQFMDKL  EV+Q
Sbjct: 515  DRAVLSLQSNPGWGFDRKCQFMDKLAREVNQ 545


>ref|XP_011621191.1| PREDICTED: uncharacterized protein LOC18428267 [Amborella trichopoda]
            gi|769819022|ref|XP_011621192.1| PREDICTED:
            uncharacterized protein LOC18428267 [Amborella
            trichopoda]
          Length = 471

 Score =  325 bits (833), Expect = 1e-85
 Identities = 193/414 (46%), Positives = 239/414 (57%), Gaps = 2/414 (0%)
 Frame = -1

Query: 1468 PGQGHGRAKXXXXXXXXXXXXXLVNNDAVARGRGRGYGPINASPHPSSLPKESEHAQQPP 1289
            PG GHGR +              ++      GRGR   P         LP + +H+  P 
Sbjct: 71   PGIGHGRGQPIQTTPILPSFAPWMSGPVPGTGRGRPSSP---------LPPQLDHS--PN 119

Query: 1288 ALKPNDRKSFSFIKGDTHDDETSAVPTRPGNPRERSLPSDIIDI-LSGAGRGKPKTQPVL 1112
              +P  RK   F + +    +   V  +   P E  LP  I    + G GRGKP T P+L
Sbjct: 120  QQEPPSRKPIFFKRDEIEGTDEGRVQAQNLPPTESPLPRSISPAPIEGFGRGKP-TSPLL 178

Query: 1111 PTERSMAENRHLRPRQHPKPRVVNAEDGAGDKSSPREKLSTEEKVKKAVGILSXXXXXXX 932
                   ENRH+R R  P  R   A  G   ++S   KLS+EE V+ A  ILS       
Sbjct: 179  SHGIEEEENRHIRRRSPPPERAGQASRG---RASNERKLSSEEAVRNAKDILSRGEGRGG 235

Query: 931  XXXXXXXXXXXXXXXXXXXXXXXXXXXXGQDRYQDS-DDELGGLYLGDPAEGEKLAQKLG 755
                                           RYQD  +D+  GLYLGD A+GEKL ++LG
Sbjct: 236  RGLRGGRGLRGGRGRGGVWAGRGRQGRGA--RYQDRREDDSVGLYLGDDADGEKLVKRLG 293

Query: 754  PETMNKLVEGFEEMSSRVLPSPVDDAYLDALHTNLMIECEPEYLMEVFGSNPDIDEKPPI 575
             E +N++ E F+EMS RVLPSP+++AYLDALHTN +IE EPEY ME FG+NPDIDEKPPI
Sbjct: 294  EENVNQIFEAFDEMSGRVLPSPMEEAYLDALHTNCLIEFEPEYHMEEFGTNPDIDEKPPI 353

Query: 574  SLRDALDKVKPFFMAYEGIQSQXXXXXXXXETMKNVPLMEKIVDYYSGPDRVTAKQQQQE 395
             L DAL+K+KPF M YEGIQ+Q        ETM  VP ++++VD YSGPDRVTA+QQQQE
Sbjct: 354  PLCDALEKIKPFIMTYEGIQNQEEWEEVVKETMDKVPYLKELVDIYSGPDRVTARQQQQE 413

Query: 394  LERVAKTIPENTSASVKNFVDRAVLSLQSNPGWGFDKKCQFMDKLVFEVSQQPK 233
            LERVA T+PEN  +SVKNF +RAVLSLQSNPGWG+DKKCQFMDKLV++VSQ  K
Sbjct: 414  LERVASTLPENVPSSVKNFTNRAVLSLQSNPGWGWDKKCQFMDKLVWQVSQDYK 467


>gb|KDP42449.1| hypothetical protein JCGZ_00246 [Jatropha curcas]
          Length = 485

 Score =  325 bits (833), Expect = 1e-85
 Identities = 202/451 (44%), Positives = 248/451 (54%), Gaps = 18/451 (3%)
 Frame = -1

Query: 1540 FQFNRIGNPEPENSIDESPTNPSLP-GQGHGRAKXXXXXXXXXXXXXLVNNDAVARGRGR 1364
            F F     P+  +S  ES   P  P G GHGR +              +     A GRGR
Sbjct: 52   FDFTAPSKPDTNDSKTESSDRPQQPAGIGHGRGRPPILPAFSSLISS-LKTSISATGRGR 110

Query: 1363 GYGPINASP--------HPSSLP-----KESEHAQQPPALKPNDRKSFSFIKGDTHDDET 1223
            G  P              P + P     +E+ H +  P ++P          G    D+ 
Sbjct: 111  GNLPSAVESGVGRGKLDKPVAAPTNQEDEENRHIRFRPTVEPG--------VGRGKPDKP 162

Query: 1222 SAVPTRPGNPRERSL---PSDIIDILSGAGRGKPKTQPVLPTERSMAENRHLRPRQHPKP 1052
             A  T   +   R +   P+    +  G GRGK        T++   ENRH+  R  P+P
Sbjct: 163  VAASTNQEDEENRHIRFRPT----VEPGVGRGKTDKPVAASTQQIKEENRHIGARSTPRP 218

Query: 1051 RVVNAEDGAGDKSSPREKLSTEEKVKKAVGILSXXXXXXXXXXXXXXXXXXXXXXXXXXX 872
            R V +  G     S + K+S EE  ++AV IL                            
Sbjct: 219  RTVPSRKGL---ESDKPKVSLEEATRRAVSILEQGEDDGGGGIGRGRGSRVRGRGRGRGR 275

Query: 871  XXXXXXXXGQDRYQDSDDELG-GLYLGDPAEGEKLAQKLGPETMNKLVEGFEEMSSRVLP 695
                     + R +DS+ E   GL+LGD A+GEKLA+++G E MNKL+EGFEEMS RVLP
Sbjct: 276  GRWDQ----RGRMEDSEPEFATGLFLGDNADGEKLAERVGVENMNKLIEGFEEMSERVLP 331

Query: 694  SPVDDAYLDALHTNLMIECEPEYLMEVFGSNPDIDEKPPISLRDALDKVKPFFMAYEGIQ 515
            SP+ DAYL+ALHTN MIE EPEYLM  F  NPDIDEKPP+ LRD L+KVKPF MAY+GIQ
Sbjct: 332  SPMQDAYLEALHTNYMIEFEPEYLMGEFDQNPDIDEKPPLPLRDVLEKVKPFIMAYDGIQ 391

Query: 514  SQXXXXXXXXETMKNVPLMEKIVDYYSGPDRVTAKQQQQELERVAKTIPENTSASVKNFV 335
            SQ        ETMKNVPL+++IVDYYSGPDRVTAK+QQ+ELERVAKTIP +  ASVK F 
Sbjct: 392  SQEEWEEVVEETMKNVPLLKEIVDYYSGPDRVTAKKQQEELERVAKTIPASAPASVKRFA 451

Query: 334  DRAVLSLQSNPGWGFDKKCQFMDKLVFEVSQ 242
            DRAVLSLQSNPGWGFD+KCQFMDKL  EV+Q
Sbjct: 452  DRAVLSLQSNPGWGFDRKCQFMDKLAREVNQ 482


>ref|XP_009340130.1| PREDICTED: collagen alpha-1(III) chain-like [Pyrus x bretschneideri]
          Length = 521

 Score =  315 bits (808), Expect = 9e-83
 Identities = 207/478 (43%), Positives = 258/478 (53%), Gaps = 45/478 (9%)
 Frame = -1

Query: 1531 NRIGNPEPENSIDESPTNPSLPGQGHGRAKXXXXXXXXXXXXXLVNNDAVARGRGR---- 1364
            NR+      N     P   S+PG GHGR K              V +       GR    
Sbjct: 58   NRVPGQSDSNEPKSEPP-ASVPGIGHGRGKPLASSQPPSSFSSFVTSIRPDSAAGRVQPG 116

Query: 1363 -------GYGPINASPHPSSLPK----ESEHAQQPPAL-----KPNDRKSFSFIKGDTHD 1232
                    + P+ +   PS          E    PP       KP  + S   ++     
Sbjct: 117  QVQPGPKAHDPVASDAGPSKPAAPIFFRGEDGSDPPLPGGGRGKPMSQPSPCQVQPGPQA 176

Query: 1231 DE---TSAVPTRPGNP----RERSLPSDIIDI-LSGAGRGKPKTQP-------------V 1115
             +   + A P++P  P    RE     D +D+ L G GRGKP +QP             V
Sbjct: 177  RDPVASDASPSKPATPFFFRRE-----DGLDLPLPGGGRGKPMSQPGPELLVKEVNRHFV 231

Query: 1114 LPTERSMAENRHLRPRQHPKPRVVNAEDGAGDKSSPR-EKLSTEEKVKKAVGILSXXXXX 938
             P  +   ENRH++ R          +D A ++++PR  KL+ EE V KA+GIL      
Sbjct: 232  APKSQIEKENRHIQARPD--------QDPAHNRTAPRGPKLTREEAVAKALGILQRDDAE 283

Query: 937  XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXGQ--DRYQDSDDELG-GLYLGDPAEGEKLA 767
                                           +  D+ +D D+  G GLYLGD A+GEKLA
Sbjct: 284  GGSGGGGDRGGGRGRGMRGRRGGRGRGRGDFRRSDKGKDLDEGKGSGLYLGDNADGEKLA 343

Query: 766  QKLGPETMNKLVEGFEEMSSRVLPSPVDDAYLDALHTNLMIECEPEYLMEVFGSNPDIDE 587
            + LGPE MNKLVEGFEEMSS VLPSP+D+A++DA+HTN MIECEPE+LME F  NPDIDE
Sbjct: 344  KTLGPENMNKLVEGFEEMSSEVLPSPLDEAFVDAMHTNYMIECEPEFLMEDFSKNPDIDE 403

Query: 586  KPPISLRDALDKVKPFFMAYEGIQSQXXXXXXXXETMKNVPLMEKIVDYYSGPDRVTAKQ 407
            KPPISLRDAL+K+KPF MAYEGIQS         E M+ VPL+++IVD+YSGPDRVTAK+
Sbjct: 404  KPPISLRDALEKMKPFLMAYEGIQSHEEWEEAVKEVMERVPLLKEIVDHYSGPDRVTAKK 463

Query: 406  QQQELERVAKTIPENTSASVKNFVDRAVLSLQSNPGWGFDKKCQFMDKLVFEVSQQPK 233
            QQ+ELERVAKT+P     SVK F DRAVLSLQSNPGWGFD+KCQFMDKLV +VS+  K
Sbjct: 464  QQEELERVAKTLPTKVPESVKRFTDRAVLSLQSNPGWGFDRKCQFMDKLVEKVSKHYK 521


Top