BLASTX nr result

ID: Jatropha_contig00026128 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Jatropha_contig00026128
         (613 letters)

Database: NCBI-nr (updated 2014/02/11) 
           35,149,712 sequences; 12,374,887,350 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EEF00505.2| hypothetical protein POPTR_0010s00550g [Populus t...   177   3e-42
gb|ESR32449.1| hypothetical protein CICLE_v10004653mg [Citrus cl...   170   2e-40
ref|XP_002519338.1| conserved hypothetical protein [Ricinus comm...   165   8e-39
gb|EMJ24269.1| hypothetical protein PRUPE_ppa003741mg [Prunus pe...   137   3e-30
ref|XP_003547541.1| PREDICTED: uncharacterized protein LOC100820...   132   6e-29
ref|XP_004297066.1| PREDICTED: uncharacterized protein LOC101306...   132   7e-29
ref|XP_003534989.1| PREDICTED: uncharacterized protein LOC100809...   131   1e-28
emb|CBI26022.3| unnamed protein product [Vitis vinifera]              131   2e-28
emb|CAN81150.1| hypothetical protein VITISV_020816 [Vitis vinifera]   131   2e-28
ref|XP_002275219.2| PREDICTED: uncharacterized protein LOC100256...   129   8e-28
gb|ESW10567.1| hypothetical protein PHAVU_009G220500g [Phaseolus...   125   7e-27
gb|EOY06943.1| Hydroxyproline-rich glycoprotein family protein, ...   122   7e-26
gb|EOY06942.1| Hydroxyproline-rich glycoprotein family protein, ...   122   7e-26
gb|EOY06940.1| Hydroxyproline-rich glycoprotein family protein, ...   122   7e-26
gb|EOY06939.1| Hydroxyproline-rich glycoprotein family protein, ...   122   7e-26
gb|ESQ30731.1| hypothetical protein EUTSA_v10012201mg [Eutrema s...   113   3e-23
ref|XP_006307495.1| hypothetical protein CARUB_v10009119mg [Caps...   110   4e-22
ref|NP_564524.1| hydroxyproline-rich glycoprotein-like protein [...   109   5e-22
ref|XP_002891421.1| hypothetical protein ARALYDRAFT_473964 [Arab...   109   7e-22
ref|XP_003541329.1| PREDICTED: uncharacterized protein LOC100798...   108   1e-21

>gb|EEF00505.2| hypothetical protein POPTR_0010s00550g [Populus trichocarpa]
          Length = 547

 Score =  177 bits (448), Expect = 3e-42
 Identities = 111/187 (59%), Positives = 137/187 (73%), Gaps = 12/187 (6%)
 Frame = +1

Query: 88  HSTTPSRFRLNSKTKEAA----NGSL--SPAQEARAKSVPPDVKKDSKLKRSLM-LNKQK 246
           HSTTPSR R+N KT + A    NGS   SPA + RAKSVPPDVKKD+K+++SL+  NK K
Sbjct: 4   HSTTPSRHRVNFKTPKPAEVANNGSPVPSPANKTRAKSVPPDVKKDTKVRKSLVGNNKPK 63

Query: 247 SGEQLLGSQNGVKVVARSVNRPVVEQFAKPRR----LPQLDSSARRKEEGDKE-LQERLQ 411
           SGE ++GSQ+ V VV RSVNRP  EQFA+PRR    L  +++S R +EE  K+ L E+L+
Sbjct: 64  SGELVVGSQD-VTVVGRSVNRPGSEQFARPRRQRPVLDPINASRRNEEESYKKGLHEKLE 122

Query: 412 LNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQESNG 591
           L+E LI DLQS+VLALK ELDKA   N ELELQNKKL +DLAAAEAKV+A + R Q S G
Sbjct: 123 LSETLINDLQSEVLALKVELDKANGLNQELELQNKKLTEDLAAAEAKVSALNTRHQ-SVG 181

Query: 592 KQKSPIF 612
           + + P F
Sbjct: 182 EHQRPRF 188


>gb|ESR32449.1| hypothetical protein CICLE_v10004653mg [Citrus clementina]
          Length = 561

 Score =  170 bits (431), Expect = 2e-40
 Identities = 110/202 (54%), Positives = 139/202 (68%), Gaps = 20/202 (9%)
 Frame = +1

Query: 67  AKPTAMSHST---TPSRFRLNSKTKEAA------NG-SLSPAQEARAKSVPPDVKKD--S 210
           +K   MSHST   T SR R NSKT+E+       NG SLSP  +ARAKSVPPDVK +  S
Sbjct: 8   SKTNNMSHSTAATTTSRLRANSKTRESPKQEAGINGVSLSPELKARAKSVPPDVKTNNIS 67

Query: 211 KLKRSLMLNKQKSGEQLLGSQNG--VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEG 384
           K +R+L+LNK KS E  +GS     VKV  RS+NRPVVEQFA+PRR   +D++  + E+G
Sbjct: 68  KSRRALVLNKPKSAEGAVGSHKDDEVKVFGRSLNRPVVEQFARPRRQRIVDANPGKIEDG 127

Query: 385 -----DKELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEA 549
                 KE +E+L+L+E L+KDLQS+V ALKAE  KAQS N ELE QNKKL +DL AAEA
Sbjct: 128 LMDKKKKEFEEKLRLSENLVKDLQSEVFALKAEFVKAQSLNAELEKQNKKLVEDLVAAEA 187

Query: 550 KVAAFHIRDQ-ESNGKQKSPIF 612
           K+A+   R+Q E+ G+ +SP F
Sbjct: 188 KIASLSSREQREAVGEYQSPKF 209


>ref|XP_002519338.1| conserved hypothetical protein [Ricinus communis]
           gi|223541653|gb|EEF43202.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 532

 Score =  165 bits (418), Expect = 8e-39
 Identities = 108/184 (58%), Positives = 131/184 (71%), Gaps = 7/184 (3%)
 Frame = +1

Query: 82  MSHSTTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPDVKKDSKLKRSLMLN-KQKSGEQ 258
           MS  TTPSRFRLNSK  +       PA++ RA+SVPPD KKD+KL+RS+++N K KS ++
Sbjct: 1   MSQPTTPSRFRLNSKAPKPE----PPAKKERAQSVPPDFKKDTKLRRSVLVNTKPKSRDE 56

Query: 259 LLGSQNGVKVVAR---SVNRPVVEQFAKPRRLPQLDSSARRKEEGDK-ELQERLQLNEIL 426
           LLGSQ  V  V     SVNRPV EQF+KPR       SAR+ EE  K EL ER++LN+ L
Sbjct: 57  LLGSQMEVARVVSPSLSVNRPVHEQFSKPRT----QRSARKIEEDTKKELLERIELNDNL 112

Query: 427 IKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAF--HIRDQESNGKQK 600
           I+DL+SQVL+LKAELDKAQS N+ELE QNKKL QDLA+AEAKVAA   +    ES G  +
Sbjct: 113 IQDLKSQVLSLKAELDKAQSLNEELESQNKKLQQDLASAEAKVAAALNNTPLPESIGGYQ 172

Query: 601 SPIF 612
           SP F
Sbjct: 173 SPKF 176


>gb|EMJ24269.1| hypothetical protein PRUPE_ppa003741mg [Prunus persica]
          Length = 552

 Score =  137 bits (344), Expect = 3e-30
 Identities = 89/187 (47%), Positives = 116/187 (62%), Gaps = 10/187 (5%)
 Frame = +1

Query: 82  MSHSTTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPDVKKDSKLKRSLMLNKQKSGEQL 261
           MS  T PS  R ++ +K   + S  P+   RAKS+          +RSL+LNK KSGE +
Sbjct: 20  MSQPTPPSYLRASASSKAKESPSPRPS---RAKSI----------RRSLLLNKPKSGELV 66

Query: 262 LGSQNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEG----DKELQERLQL 414
           LGSQ        K V R  NR V EQFA+PR     D +++R EE     ++ELQERL +
Sbjct: 67  LGSQKSKELEETKAVGRPGNRQVAEQFARPRPQRPADPNSKRNEEDPHVKNRELQERLDM 126

Query: 415 NEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQ-ESNG 591
           +E L  + Q++VLALKAELDKAQ  N EL+ QNK L + LAAAEAK+AAF  R+Q E+NG
Sbjct: 127 SESLTMNFQAEVLALKAELDKAQGLNVELQSQNKNLTEKLAAAEAKIAAFTTREQRETNG 186

Query: 592 KQKSPIF 612
           + +SP F
Sbjct: 187 EYQSPKF 193


>ref|XP_003547541.1| PREDICTED: uncharacterized protein LOC100820086 [Glycine max]
          Length = 576

 Score =  132 bits (333), Expect = 6e-29
 Identities = 87/194 (44%), Positives = 121/194 (62%), Gaps = 17/194 (8%)
 Frame = +1

Query: 82  MSHSTTPSRFRLNSKTKE-------AANGSLSPAQEARAKSVPPDVKKDSKLKRSLMLNK 240
           + +S TPSR RL SK +E         N  +      RAKSV P++K +S++K+ L+LNK
Sbjct: 27  IQNSLTPSRLRLPSKYREPPKTPPEVVNNGMVSTPLRRAKSVTPELKHNSRIKKGLVLNK 86

Query: 241 QKSGEQLLGS-QNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGD----K 390
            K  E++LG+ Q G      KVV+R V    VEQF++PR      +  R KE+ D    K
Sbjct: 87  AKPNEEVLGTTQRGREVEEAKVVSRFVRPHAVEQFSRPRSGVGDFAFKRDKEDPDGKSKK 146

Query: 391 ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHI 570
           EL E+L+ +E LIK+LQS+VLALKAEL+K +  N ELE  N+KL +DLAAAEAKV +   
Sbjct: 147 ELMEKLEASESLIKNLQSEVLALKAELEKVKGLNVELESNNRKLTEDLAAAEAKVVSLS- 205

Query: 571 RDQESNGKQKSPIF 612
            +++ NG+ +SP F
Sbjct: 206 GNEKPNGEHQSPKF 219


>ref|XP_004297066.1| PREDICTED: uncharacterized protein LOC101306034 [Fragaria vesca
           subsp. vesca]
          Length = 560

 Score =  132 bits (332), Expect = 7e-29
 Identities = 91/190 (47%), Positives = 120/190 (63%), Gaps = 15/190 (7%)
 Frame = +1

Query: 88  HSTTPSRFRLNSKTKEAANGSLSPAQE-ARAKSVPPDVKKDS---KLKRSLMLNKQKSGE 255
           HST  S+ R +SK KE    S SP Q  +RAKSV PDV   S    ++R+L+ NK KSGE
Sbjct: 15  HSTNMSQLRASSKAKE----SQSPTQRPSRAKSVTPDVNHSSDSRSVRRALLQNKPKSGE 70

Query: 256 QLLGSQNG-----VKVVARSVNRPVVEQFAKPRRL-PQLDSSARRKEEGD----KELQER 405
            +LGSQ        KVV  S    VVEQFAKPRR  P ++++ +R E+      KE+QE+
Sbjct: 71  LVLGSQKSKDFEEFKVVGSSRKPQVVEQFAKPRRQRPVVEANCKRNEDDPHRNMKEMQEK 130

Query: 406 LQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQ-E 582
           ++++E +I  LQ++VL LK ELDK    N EL+ +NKKL+++L AAEAK+AA     Q E
Sbjct: 131 IEMSESMIMKLQAEVLGLKVELDKEHGLNLELQAKNKKLSENLTAAEAKIAALTTPQQRE 190

Query: 583 SNGKQKSPIF 612
           SNG Q SP F
Sbjct: 191 SNGYQ-SPKF 199


>ref|XP_003534989.1| PREDICTED: uncharacterized protein LOC100809254 [Glycine max]
          Length = 562

 Score =  131 bits (330), Expect = 1e-28
 Identities = 87/195 (44%), Positives = 119/195 (61%), Gaps = 18/195 (9%)
 Frame = +1

Query: 82  MSHSTTPSRFRLNSKTKE--------AANGSLSPAQEARAKSVPPDVKKDSKLKRSLMLN 237
           + +S TPSR RL SK +E          N  +      RAKSV P++K +S++KR L+LN
Sbjct: 10  LQNSLTPSRLRLPSKYREPPKTPPEVVVNNVVVSTPSRRAKSVTPELKHNSRIKRGLVLN 69

Query: 238 KQKSGEQLLGS-QNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGD---- 387
           K K  E+++G+ Q G      KVVAR V   VVEQFA+PR      +  R KE+ D    
Sbjct: 70  KAKPNEEVVGTTQRGREAEETKVVARFVRPHVVEQFARPRNGAGDFAFKRDKEDSDEKSK 129

Query: 388 KELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFH 567
           KEL E+L+ +E LIK+LQS+V ALKAEL+K +    ELE  N+KL +DLAAAE KV +  
Sbjct: 130 KELMEKLEASESLIKNLQSEVQALKAELEKVKGLKVELESHNRKLTEDLAAAEVKVVSLG 189

Query: 568 IRDQESNGKQKSPIF 612
             +++ NG+ +SP F
Sbjct: 190 -GNEKPNGEHQSPKF 203


>emb|CBI26022.3| unnamed protein product [Vitis vinifera]
          Length = 572

 Score =  131 bits (329), Expect = 2e-28
 Identities = 95/215 (44%), Positives = 120/215 (55%), Gaps = 16/215 (7%)
 Frame = +1

Query: 16  KEIDATNNRKGNINKPA-AKPTAMSHSTTPSRFRLNSKTKEAA-------NG--SLSPAQ 165
           + IDA +      N P   K T  SH   PS    +S +  +        NG  S SPA 
Sbjct: 12  RPIDALSQEAMKQNPPTPCKTTTPSHLRRPSSSSSSSSSSSSKVKAVGVLNGVSSPSPAP 71

Query: 166 EARAKSVPPDVKKDSKLKRSLMLNKQKSGEQLLGSQNG-----VKVVARSVNRPVVEQFA 330
             RA+S P ++    K +RSL+LNK KSG+  LGSQ       VKV+ RS NRPVV+Q A
Sbjct: 72  RPRARSGPLEMNNSHKARRSLLLNKPKSGDHALGSQKPRDAEEVKVMGRSRNRPVVDQLA 131

Query: 331 KPRRLPQLDSSARRKEEGDKELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQ 510
                P+  S     ++  KELQE+L L + LI +LQS+VL LKAELDKAQS N EL+  
Sbjct: 132 -----PRRPSEGPEPDDKTKELQEKLDLRQNLINNLQSEVLGLKAELDKAQSFNLELQSL 186

Query: 511 NKKLAQDLAAAEAKVAAFHIRDQ-ESNGKQKSPIF 612
           N KL +DLAAA AK+ A   R Q ES  + +SP F
Sbjct: 187 NAKLTEDLAAALAKITALTSRQQEESVTEYQSPKF 221


>emb|CAN81150.1| hypothetical protein VITISV_020816 [Vitis vinifera]
          Length = 348

 Score =  131 bits (329), Expect = 2e-28
 Identities = 95/215 (44%), Positives = 120/215 (55%), Gaps = 16/215 (7%)
 Frame = +1

Query: 16  KEIDATNNRKGNINKPA-AKPTAMSHSTTPSRFRLNSKTKEAA-------NG--SLSPAQ 165
           + IDA +      N P   K T  SH   PS    +S +  +        NG  S SPA 
Sbjct: 12  RPIDALSQEAMKQNPPTPCKTTTPSHLRRPSSSSSSSSSSSSKVKAVGVLNGVSSPSPAP 71

Query: 166 EARAKSVPPDVKKDSKLKRSLMLNKQKSGEQLLGSQNG-----VKVVARSVNRPVVEQFA 330
             RA+S P ++    K +RSL+LNK KSG+  LGSQ       VKV+ RS NRPVV+Q A
Sbjct: 72  RPRARSGPLEMNNSHKARRSLLLNKPKSGDHALGSQKPRDAEEVKVMGRSRNRPVVDQLA 131

Query: 331 KPRRLPQLDSSARRKEEGDKELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQ 510
                P+  S     ++  KELQE+L L + LI +LQS+VL LKAELDKAQS N EL+  
Sbjct: 132 -----PRRPSEGPEPDDKTKELQEKLDLRQNLINNLQSEVLGLKAELDKAQSFNLELQSL 186

Query: 511 NKKLAQDLAAAEAKVAAFHIRDQ-ESNGKQKSPIF 612
           N KL +DLAAA AK+ A   R Q ES  + +SP F
Sbjct: 187 NAKLTEDLAAALAKITALTSRQQEESVTEYQSPKF 221


>ref|XP_002275219.2| PREDICTED: uncharacterized protein LOC100256278 [Vitis vinifera]
          Length = 551

 Score =  129 bits (323), Expect = 8e-28
 Identities = 85/187 (45%), Positives = 113/187 (60%), Gaps = 6/187 (3%)
 Frame = +1

Query: 70  KPTAMSHSTTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPDVKKDSKLKRSLMLNKQKS 249
           +P++ S S++ S  ++ +        S SPA   RA+S P ++    K +RSL+LNK KS
Sbjct: 19  RPSSSSSSSSSSSSKVKAVGVLNGVSSPSPAPRPRARSGPLEMNNSHKARRSLLLNKPKS 78

Query: 250 GEQLLGSQNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDKELQERLQL 414
           G+  LGSQ       VKV+ RS NRPVV+Q A     P+  S     ++  KELQE+L L
Sbjct: 79  GDHALGSQKPRDAEEVKVMGRSRNRPVVDQLA-----PRRPSEGPEPDDKTKELQEKLDL 133

Query: 415 NEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQ-ESNG 591
            + LI +LQS+VL LKAELDKAQS N EL+  N KL +DLAAA AK+ A   R Q ES  
Sbjct: 134 RQNLINNLQSEVLGLKAELDKAQSFNLELQSLNAKLTEDLAAALAKITALTSRQQEESVT 193

Query: 592 KQKSPIF 612
           + +SP F
Sbjct: 194 EYQSPKF 200


>gb|ESW10567.1| hypothetical protein PHAVU_009G220500g [Phaseolus vulgaris]
           gi|561011661|gb|ESW10568.1| hypothetical protein
           PHAVU_009G220500g [Phaseolus vulgaris]
          Length = 584

 Score =  125 bits (315), Expect = 7e-27
 Identities = 85/195 (43%), Positives = 117/195 (60%), Gaps = 18/195 (9%)
 Frame = +1

Query: 82  MSHSTTPSRFRLNSKTKE-------AANGSLSPAQEARAKSVPPDVKKDSKLKRSLMLNK 240
           +  S TPSR RL SK +E         NG +S     RAKSV P++K  S++KR L+LNK
Sbjct: 28  LQSSLTPSRLRLPSKYREPPRTPPEVVNGVVSTPTR-RAKSVTPELKHASRIKRGLVLNK 86

Query: 241 QKSGEQLLGSQNG-----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGD----KE 393
            K  E+++G+  G      K V R +    VEQFA PR      +  R KEE D    KE
Sbjct: 87  AKPNEEVVGTHRGREAVEPKAVPRFMRPHAVEQFASPRSAVGDFAMKRDKEEPDGKSKKE 146

Query: 394 LQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAF--H 567
           L E+L+++E LI++LQS+VLALKAEL+K +  N ELE  N+KL +D+AAAE+KV +    
Sbjct: 147 LMEKLEVSESLIRNLQSEVLALKAELEKVKGLNVELESHNRKLTKDIAAAESKVMSLGGS 206

Query: 568 IRDQESNGKQKSPIF 612
            + +E  G+ +SP F
Sbjct: 207 EKMKEPIGEHQSPKF 221


>gb|EOY06943.1| Hydroxyproline-rich glycoprotein family protein, putative isoform
           5, partial [Theobroma cacao]
          Length = 458

 Score =  122 bits (306), Expect = 7e-26
 Identities = 92/200 (46%), Positives = 119/200 (59%), Gaps = 18/200 (9%)
 Frame = +1

Query: 58  KPAA-KPTAMSH--STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPDVKKDSK-LKRS 225
           KPAA K T MSH  STTPSR R+NSK         S   EAR ++  P VK  +K   +S
Sbjct: 19  KPAACKLTPMSHLQSTTPSRCRVNSKPINH-----SAKAEARPETATPHVKDSTKNSSKS 73

Query: 226 LMLNKQKSGEQLLGSQNGVKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDK----- 390
           L+LNK KSG+Q        +VV       VV+QFA+PRRL   +++  +K E  +     
Sbjct: 74  LLLNKPKSGDQ-------PQVVGSHHKGRVVDQFARPRRL---NANLTKKSEESRSAIEK 123

Query: 391 ----ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVA 558
               EL+E+L  +E L+KDL++QVL LKAELD A+S N ELE  N+KL +DL AAEAK+A
Sbjct: 124 NNIDELREKLSCSEALVKDLRTQVLGLKAELDGARSLNMELESLNRKLNEDLVAAEAKIA 183

Query: 559 AFHIRD-----QESNGKQKS 603
           A   RD     +ESNG  +S
Sbjct: 184 ALASRDKVQLQRESNGDDQS 203


>gb|EOY06942.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 4
           [Theobroma cacao]
          Length = 565

 Score =  122 bits (306), Expect = 7e-26
 Identities = 92/200 (46%), Positives = 119/200 (59%), Gaps = 18/200 (9%)
 Frame = +1

Query: 58  KPAA-KPTAMSH--STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPDVKKDSK-LKRS 225
           KPAA K T MSH  STTPSR R+NSK         S   EAR ++  P VK  +K   +S
Sbjct: 19  KPAACKLTPMSHLQSTTPSRCRVNSKPINH-----SAKAEARPETATPHVKDSTKNSSKS 73

Query: 226 LMLNKQKSGEQLLGSQNGVKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDK----- 390
           L+LNK KSG+Q        +VV       VV+QFA+PRRL   +++  +K E  +     
Sbjct: 74  LLLNKPKSGDQ-------PQVVGSHHKGRVVDQFARPRRL---NANLTKKSEESRSAIEK 123

Query: 391 ----ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVA 558
               EL+E+L  +E L+KDL++QVL LKAELD A+S N ELE  N+KL +DL AAEAK+A
Sbjct: 124 NNIDELREKLSCSEALVKDLRTQVLGLKAELDGARSLNMELESLNRKLNEDLVAAEAKIA 183

Query: 559 AFHIRD-----QESNGKQKS 603
           A   RD     +ESNG  +S
Sbjct: 184 ALASRDKVQLQRESNGDDQS 203


>gb|EOY06940.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 2
           [Theobroma cacao]
          Length = 561

 Score =  122 bits (306), Expect = 7e-26
 Identities = 92/200 (46%), Positives = 119/200 (59%), Gaps = 18/200 (9%)
 Frame = +1

Query: 58  KPAA-KPTAMSH--STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPDVKKDSK-LKRS 225
           KPAA K T MSH  STTPSR R+NSK         S   EAR ++  P VK  +K   +S
Sbjct: 19  KPAACKLTPMSHLQSTTPSRCRVNSKPINH-----SAKAEARPETATPHVKDSTKNSSKS 73

Query: 226 LMLNKQKSGEQLLGSQNGVKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDK----- 390
           L+LNK KSG+Q        +VV       VV+QFA+PRRL   +++  +K E  +     
Sbjct: 74  LLLNKPKSGDQ-------PQVVGSHHKGRVVDQFARPRRL---NANLTKKSEESRSAIEK 123

Query: 391 ----ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVA 558
               EL+E+L  +E L+KDL++QVL LKAELD A+S N ELE  N+KL +DL AAEAK+A
Sbjct: 124 NNIDELREKLSCSEALVKDLRTQVLGLKAELDGARSLNMELESLNRKLNEDLVAAEAKIA 183

Query: 559 AFHIRD-----QESNGKQKS 603
           A   RD     +ESNG  +S
Sbjct: 184 ALASRDKVQLQRESNGDDQS 203


>gb|EOY06939.1| Hydroxyproline-rich glycoprotein family protein, putative isoform 1
           [Theobroma cacao]
          Length = 564

 Score =  122 bits (306), Expect = 7e-26
 Identities = 92/200 (46%), Positives = 119/200 (59%), Gaps = 18/200 (9%)
 Frame = +1

Query: 58  KPAA-KPTAMSH--STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPDVKKDSK-LKRS 225
           KPAA K T MSH  STTPSR R+NSK         S   EAR ++  P VK  +K   +S
Sbjct: 19  KPAACKLTPMSHLQSTTPSRCRVNSKPINH-----SAKAEARPETATPHVKDSTKNSSKS 73

Query: 226 LMLNKQKSGEQLLGSQNGVKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDK----- 390
           L+LNK KSG+Q        +VV       VV+QFA+PRRL   +++  +K E  +     
Sbjct: 74  LLLNKPKSGDQ-------PQVVGSHHKGRVVDQFARPRRL---NANLTKKSEESRSAIEK 123

Query: 391 ----ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVA 558
               EL+E+L  +E L+KDL++QVL LKAELD A+S N ELE  N+KL +DL AAEAK+A
Sbjct: 124 NNIDELREKLSCSEALVKDLRTQVLGLKAELDGARSLNMELESLNRKLNEDLVAAEAKIA 183

Query: 559 AFHIRD-----QESNGKQKS 603
           A   RD     +ESNG  +S
Sbjct: 184 ALASRDKVQLQRESNGDDQS 203


>gb|ESQ30731.1| hypothetical protein EUTSA_v10012201mg [Eutrema salsugineum]
          Length = 554

 Score =  113 bits (283), Expect = 3e-23
 Identities = 82/184 (44%), Positives = 107/184 (58%), Gaps = 13/184 (7%)
 Frame = +1

Query: 91  STTPSRFRLNSKTKEAANGSLSPAQEARAKSVPPDVKKDS----KLKRSLMLNKQKSGEQ 258
           STTPSR R       AAN   S     RA+      K       K +RS++L + KSGE+
Sbjct: 8   STTPSRVR-------AANSHYSVISRPRAQDDNGKPKSSGHDPGKNRRSILLKRAKSGEE 60

Query: 259 ---LLGSQNGVKVVARSVNRP-VVEQFAKPRRLPQLDSSARRKEEGDK-----ELQERLQ 411
              +L  Q      ARSVNRP VVEQF  PRR     S     EE +K     EL+E+L 
Sbjct: 61  ETAVLAPQR-----ARSVNRPAVVEQFGCPRRPISRKSEEMAAEEDEKRKKMEELEEKLV 115

Query: 412 LNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHIRDQESNG 591
            NE LIKDLQ+QV +LKAEL++A+SSN ELEL+N+KL+QDL +AEAK+++    D+ +  
Sbjct: 116 ANESLIKDLQAQVSSLKAELEEARSSNAELELKNRKLSQDLVSAEAKISSLSSNDKPAKE 175

Query: 592 KQKS 603
            Q +
Sbjct: 176 HQNT 179


>ref|XP_006307495.1| hypothetical protein CARUB_v10009119mg [Capsella rubella]
           gi|482576206|gb|EOA40393.1| hypothetical protein
           CARUB_v10009119mg [Capsella rubella]
          Length = 450

 Score =  110 bits (274), Expect = 4e-22
 Identities = 85/196 (43%), Positives = 112/196 (57%), Gaps = 25/196 (12%)
 Frame = +1

Query: 91  STTPSRFRLNSKTKEAANGSLSPAQEARA-----------KSVPPDVKKD-SKLKRSLML 234
           STTPSR R       AAN   S     RA           K    DVK D +K +RS++L
Sbjct: 8   STTPSRVR-------AANSHYSVISRPRAQDDNGLAGGKPKHSNHDVKNDPAKNRRSILL 60

Query: 235 NKQKSGEQLLGSQNGVKVVARSVNRP-VVEQFAKPRR----------LPQLDSSARRKEE 381
            K KSG++   +   V   ARSVNRP VVEQF  PRR          +   +++A   E+
Sbjct: 61  RKAKSGDEETTAVL-VPQRARSVNRPAVVEQFGCPRRPISRKIEETVMSTAEAAAEEDEK 119

Query: 382 GDK--ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKV 555
             +  EL+E+L +NE LIKDLQ QVL LK EL++A+SSN ELEL N+KL+QDLA+AEAK+
Sbjct: 120 RKRMEELEEKLVVNESLIKDLQLQVLNLKTELEEARSSNVELELNNRKLSQDLASAEAKI 179

Query: 556 AAFHIRDQESNGKQKS 603
           ++    D+ +   Q S
Sbjct: 180 SSLSSNDKPAKEHQNS 195


>ref|NP_564524.1| hydroxyproline-rich glycoprotein-like protein [Arabidopsis
           thaliana] gi|8778962|gb|AAD49768.2|AC007932_16 F11A17.16
           [Arabidopsis thaliana] gi|332194150|gb|AEE32271.1|
           hydroxyproline-rich glycoprotein-like protein
           [Arabidopsis thaliana]
          Length = 558

 Score =  109 bits (273), Expect = 5e-22
 Identities = 85/195 (43%), Positives = 110/195 (56%), Gaps = 24/195 (12%)
 Frame = +1

Query: 91  STTPSRFRLNSKTKEAANGSLSPAQEARA-----------KSVPPDVKKDSKLKRSLMLN 237
           STTPSR R       AAN   S   + RA           KS   DVK D   +RS++L 
Sbjct: 8   STTPSRVR-------AANSHYSVISKPRAQDDNGLTGGKPKSSGYDVKNDPAKRRSILLK 60

Query: 238 KQKSGEQ---LLGSQNGVKVVARSVNRP-VVEQFAKPRRLPQLDS------SARRKEEGD 387
           + KS E+   +L  Q      ARSVNRP VVEQF  PRR     S      +A  ++E  
Sbjct: 61  RAKSAEEEMAVLAPQR-----ARSVNRPAVVEQFGCPRRPISRKSEETVMATAAAEDEKR 115

Query: 388 K---ELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVA 558
           K   EL+E+L +NE LIKDLQ QVL LK EL++A++SN ELEL N+KL+QDL +AEAK++
Sbjct: 116 KRMEELEEKLVVNESLIKDLQLQVLNLKTELEEARNSNVELELNNRKLSQDLVSAEAKIS 175

Query: 559 AFHIRDQESNGKQKS 603
           +    D+ +   Q S
Sbjct: 176 SLSSNDKPAKEHQNS 190


>ref|XP_002891421.1| hypothetical protein ARALYDRAFT_473964 [Arabidopsis lyrata subsp.
           lyrata] gi|297337263|gb|EFH67680.1| hypothetical protein
           ARALYDRAFT_473964 [Arabidopsis lyrata subsp. lyrata]
          Length = 567

 Score =  109 bits (272), Expect = 7e-22
 Identities = 84/197 (42%), Positives = 112/197 (56%), Gaps = 26/197 (13%)
 Frame = +1

Query: 91  STTPSRFRLNSKTKEAANGSLSPAQEARA-----------KSVPPDVKKD-SKLKRSLML 234
           STTPSR R       AAN   S   + RA           KS   DVK D +K +RS++L
Sbjct: 8   STTPSRVR-------AANSHYSVISKPRAQDDNGLTGGKPKSSGHDVKNDPAKNRRSILL 60

Query: 235 NKQKSGEQ---LLGSQNGVKVVARSVNRP-VVEQFAKPRR----------LPQLDSSARR 372
            + K GE+   +L  Q      ARSVNRP VVEQF  PRR          +     +   
Sbjct: 61  KRAKYGEEETAVLAPQR-----ARSVNRPAVVEQFGCPRRPISRKTEESVMATAVVAEDE 115

Query: 373 KEEGDKELQERLQLNEILIKDLQSQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAK 552
           K +  +EL+E+L +NE LIKDLQ QVL LK EL++A++SN ELEL+NKKL+QDLA+AEAK
Sbjct: 116 KRKRMEELEEKLVVNESLIKDLQLQVLNLKTELEEARNSNAELELKNKKLSQDLASAEAK 175

Query: 553 VAAFHIRDQESNGKQKS 603
           +++    D+ +   Q +
Sbjct: 176 ISSLSSNDKPAKEHQNT 192


>ref|XP_003541329.1| PREDICTED: uncharacterized protein LOC100798183 [Glycine max]
          Length = 565

 Score =  108 bits (269), Expect = 1e-21
 Identities = 75/178 (42%), Positives = 109/178 (61%), Gaps = 7/178 (3%)
 Frame = +1

Query: 100 PSRFRLNSKTKEAANGSLS--PAQEARAKSVPPDVKKDSKLKRSLMLNKQKSGEQLLGSQ 273
           P R R +SK  ++    ++       RA+SVPPD+K  S+ KR +++NK K  E++LGSQ
Sbjct: 33  PPRLRASSKAPKSPPEVVNRESISSTRAESVPPDLKNVSRAKRGVVVNKPKLNEEVLGSQ 92

Query: 274 NG----VKVVARSVNRPVVEQFAKPRRLPQLDSSARRKEEGDKELQERLQLNEILIKDLQ 441
                 + +VAR   R  V  F   R+    DS  ++K+E    LQE+L+++E LIK LQ
Sbjct: 93  KAEEGKIVIVARPRRR--VGDFGS-RKSEDDDSHGKKKKE---LLQEKLEVSENLIKSLQ 146

Query: 442 SQVLALKAELDKAQSSNDELELQNKKLAQDLAAAEAKVAAFHI-RDQESNGKQKSPIF 612
           S+VLAL+ ELD+ +S N ELE QN KL Q+LAAAEAK++   I  + +  G+ +SP F
Sbjct: 147 SEVLALREELDRVKSLNVELESQNTKLTQNLAAAEAKISNVGIGNNGKPIGEHRSPKF 204


Top