BLASTX nr result

ID: Rehmannia26_contig00012514 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Rehmannia26_contig00012514
         (805 letters)

Database: ./nr 
           37,332,560 sequences; 13,225,080,153 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006343918.1| PREDICTED: uncharacterized protein LOC102579...   151   3e-34
gb|EOY24871.1| Late embryogenesis abundant hydroxyproline-rich g...   127   5e-27
ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295...   116   8e-24
gb|EOY13741.1| Late embryogenesis abundant hydroxyproline-rich g...   112   2e-22
ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295...   111   3e-22
ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296...   109   1e-21
gb|EOY23369.1| Late embryogenesis abundant hydroxyproline-rich g...   108   2e-21
ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293...   106   1e-20
gb|EOY23370.1| Uncharacterized protein TCM_015287 [Theobroma cacao]   105   2e-20
ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303...   105   2e-20
ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296...   105   2e-20
gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis]     103   5e-20
gb|EOY24872.1| Late embryogenesis abundant hydroxyproline-rich g...   103   7e-20
emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera]   103   7e-20
gb|EOY24870.1| Late embryogenesis abundant hydroxyproline-rich g...   103   9e-20
ref|XP_004300828.1| PREDICTED: uncharacterized protein LOC101295...   101   3e-19
ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302...   100   5e-19
emb|CBI22611.3| unnamed protein product [Vitis vinifera]              100   5e-19
gb|EXC05942.1| hypothetical protein L484_014210 [Morus notabilis]     100   8e-19
gb|EOY24868.1| Late embryogenesis abundant hydroxyproline-rich g...   100   1e-18

>ref|XP_006343918.1| PREDICTED: uncharacterized protein LOC102579067 [Solanum tuberosum]
          Length = 197

 Score =  151 bits (381), Expect = 3e-34
 Identities = 82/199 (41%), Positives = 125/199 (62%), Gaps = 4/199 (2%)
 Frame = -3

Query: 752 QGYPLAPSTIVPRSDEEYATNTNFQSQEQMKKKKRMKCLXXXXXXXXXXXXXXXXXXXIF 573
           Q YPLAPS I+PRSD E+ATN NFQS  Q +KKK    L                    F
Sbjct: 5   QKYPLAPSNIMPRSDAEFATN-NFQSNNQRRKKK----LRSTFLLTIFLTGIILLFCFTF 59

Query: 572 VRVRTPRVRLDDITVTSDANTGNVRFTGRVSVRNRNFGRYEFDSSLATIRSGN-TNVGQF 396
           +R+++P++R+++I +T+D + G + F+ +V +RNRNF RY +DS+L TI +   T +G+F
Sbjct: 60  LRIKSPKIRIENIRITNDGD-GRINFSAQVFLRNRNFWRYGYDSTLGTINTAEGTTIGRF 118

Query: 395 VIHDARARARSTRRIYVIADLRVPG---TNSSVLDLNVEARLRGKVRLVRVIRRNRSADM 225
           VI D   R RST+ IYV+ +  +P      S +L +  EA++RGKV++ RV R  ++ D+
Sbjct: 119 VIPDGEVRRRSTKTIYVMENFILPSRLNNTSGILPVISEAKIRGKVKVFRVFRWKKNVDL 178

Query: 224 NCTMRINLSTNVVQNLRCE 168
           +CTM INL+ + +Q+L C+
Sbjct: 179 SCTMSINLTISAIQDLDCQ 197


>gb|EOY24871.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
          Length = 215

 Score =  127 bits (319), Expect = 5e-27
 Identities = 82/222 (36%), Positives = 127/222 (57%), Gaps = 18/222 (8%)
 Frame = -3

Query: 779 MAEKYQQQTQGYPLAPSTIVPRSDEEYATNTNFQSQEQMKKKKRMKCLXXXXXXXXXXXX 600
           MAEK QQ    +PLAP+   PRSDEE A+    QS+E +K+KKR+K              
Sbjct: 1   MAEKDQQV---HPLAPANGHPRSDEESAS---LQSKE-LKRKKRIKYAVYIAAFAVFQTV 53

Query: 599 XXXXXXXIFVRVRTPRVRLDDITV-------TSDANTGNVRFTGRVSVRNRNFGRYEFDS 441
                    +RV+ P+VR+  +TV       T  A + N+RF  +V+V+N NFG Y+FD+
Sbjct: 54  VILIFALTVMRVKNPKVRIGKVTVETMETSNTEAAASFNLRFITQVTVKNTNFGHYKFDN 113

Query: 440 SLATIRSGNTNVGQFVIHDARARARSTRRIYVIADLRVPG-----------TNSSVLDLN 294
           +  +       VG+ +I  ARARARST+++ V  ++                +SSVL LN
Sbjct: 114 ATMSFLYDGVMVGEAIIPKARARARSTKKLDVTVEVNSSALTSTTTGLGSELSSSVLTLN 173

Query: 293 VEARLRGKVRLVRVIRRNRSADMNCTMRINLSTNVVQNLRCE 168
            +A+L+GKV L++V+++ +S +MNCT+  N+ST  +Q+L+C+
Sbjct: 174 SQAKLKGKVELMKVMKKKKSPEMNCTLIFNVSTRSLQDLKCK 215


>ref|XP_004298842.1| PREDICTED: uncharacterized protein LOC101295333 [Fragaria vesca
           subsp. vesca]
          Length = 200

 Score =  116 bits (291), Expect = 8e-24
 Identities = 72/209 (34%), Positives = 113/209 (54%), Gaps = 5/209 (2%)
 Frame = -3

Query: 779 MAEKYQQQTQGYPLAPSTIVPRSDEEYATNTNFQSQEQMKKKKRMKCLXXXXXXXXXXXX 600
           MA+K+QQ    YPLAPS    RSD E        S++++K+KKR+KC             
Sbjct: 1   MADKHQQV---YPLAPSNGYTRSDGESL------SEDELKRKKRIKCFAYIGIFIVFQMA 51

Query: 599 XXXXXXXIFVRVRTPRVRLDDIT----VTSDANTGNVRFTGRVSVRNRNFGRYEFDSSLA 432
                    ++V+TP+VRLD  +    VTS   + +  F  ++ V+N N+G Y+FD  + 
Sbjct: 52  VGAVFGLTVLKVKTPKVRLDTTSTLSGVTSSTTSFSSTFNTQIRVKNTNWGPYKFDEGVV 111

Query: 431 TIRSGNTNVGQFVIHDARARARSTRRIYVIADLRVPGTNSS-VLDLNVEARLRGKVRLVR 255
           T +   T VG F +   +A  R T++I     L     NSS  L L  EA+L GKV L+ 
Sbjct: 112 TFKYQGTPVGTFTVPKGKAGMRGTKKIDASVSLNTAALNSSGELTLTSEAKLTGKVTLMF 171

Query: 254 VIRRNRSADMNCTMRINLSTNVVQNLRCE 168
           ++++ +SA MNCT++I++S   V+++ C+
Sbjct: 172 IMKKKKSASMNCTIQIDVSGQTVKSVVCK 200


>gb|EOY13741.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
          Length = 259

 Score =  112 bits (279), Expect = 2e-22
 Identities = 61/187 (32%), Positives = 105/187 (56%), Gaps = 17/187 (9%)
 Frame = -3

Query: 680 QSQEQMKKKKRMKCLXXXXXXXXXXXXXXXXXXXIFVRVRTPRVRL-----DDITVTSDA 516
           +  +++K+KKRMKCL                     +R++ P+ R+     DD+T  + +
Sbjct: 11  EQSKELKRKKRMKCLAYVAAFVIFQTAIILVFALTVMRIKNPKFRIRSVLVDDLTFNNSS 70

Query: 515 NTGNVRFTGRVSVRNRNFGRYEFDSSLATIRSGNTNVGQFVIHD--ARARARSTRRIYVI 342
            + N++F  +V+V+N NFG Y+F++S  T     + VG+ ++    ARARARST+++ V 
Sbjct: 71  PSFNMKFIAQVTVKNTNFGHYKFENSTVTFAYKGSQVGEALVTKGRARARARSTKKMNVT 130

Query: 341 ADLRVPGT----------NSSVLDLNVEARLRGKVRLVRVIRRNRSADMNCTMRINLSTN 192
            DL   G           NS  L L  ++ L GKV L++VI++ +S +MNCTM +NL+  
Sbjct: 131 MDLNSNGVANDSDLGSDLNSGFLTLTSQSILNGKVHLMKVIKKKKSVEMNCTMTVNLAQK 190

Query: 191 VVQNLRC 171
           +V++++C
Sbjct: 191 LVRDIKC 197


>ref|XP_004300829.1| PREDICTED: uncharacterized protein LOC101295918 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  111 bits (277), Expect = 3e-22
 Identities = 72/217 (33%), Positives = 109/217 (50%), Gaps = 13/217 (5%)
 Frame = -3

Query: 779 MAEKYQQQTQGYPLAPSTIVPRSDEEYATNTNFQSQEQMKKKKRMKCLXXXXXXXXXXXX 600
           MAEK Q+  Q YPLA      RSD E        S++++K+KKR+KC             
Sbjct: 1   MAEKSQKTHQTYPLASENGYTRSDGESL------SEDELKRKKRIKCFAYIGIFIVFQMA 54

Query: 599 XXXXXXXIFVRVRTPRVRLDDIT---VTSDANTGNVRFTGRVSVRNRNFGRYEFDSSLAT 429
                    ++V+TP+VRL   T   VTS   + +  F  ++ V+N N+G Y+FD  + T
Sbjct: 55  IGAVFGLTVLKVKTPKVRLGTSTLSDVTSSTTSFSSTFNTQIRVKNTNWGPYKFDQGVVT 114

Query: 428 IRSGNTNVGQFVIHDARARARSTRRIYVIADLRVPGTNSS----------VLDLNVEARL 279
                  VG  V+   +A  R T++I V   L      SS          VL L  EA+L
Sbjct: 115 FMYQGAPVGTVVVPKGKAGMRGTKKINVNVSLNTAALPSSSTLSSELSGGVLTLTSEAKL 174

Query: 278 RGKVRLVRVIRRNRSADMNCTMRINLSTNVVQNLRCE 168
            GKV L+ ++++ +SA MNCT++I++S   V++L C+
Sbjct: 175 TGKVELMLIMKKKKSASMNCTIQIDVSGKTVKSLECK 211


>ref|XP_004300831.1| PREDICTED: uncharacterized protein LOC101296490 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  109 bits (272), Expect = 1e-21
 Identities = 75/219 (34%), Positives = 115/219 (52%), Gaps = 15/219 (6%)
 Frame = -3

Query: 779 MAEKYQQQTQGYPLAPSTIVPRSDEEYATNTNFQSQEQMKKKKRMKCLXXXXXXXXXXXX 600
           MAEK     Q YPLAP+    RSD E     +  S++++K++KR K              
Sbjct: 1   MAEK---TNQAYPLAPANGYTRSDGE-----SLVSEDELKRQKRRKLFMYIGIFIVVQII 52

Query: 599 XXXXXXXIFVRVRTPRVRLDDITVTS-----DANTGNVRFTGRVSVRNRNFGRYEFDSSL 435
                    ++V+TP+VRL  I V S        + +  FT ++ V+N N+G Y+FD+S 
Sbjct: 53  VMTVFGLTVMKVKTPKVRLGGINVQSLNSVPATPSFDTSFTTQIRVKNTNWGPYKFDAST 112

Query: 434 ATIRSGNTNVGQFVIHDARARARSTRRIYVIADLRVP--------GT--NSSVLDLNVEA 285
           AT       VGQ  I  ++AR RST++I V   L           GT  NS +L L  +A
Sbjct: 113 ATFMYQGVAVGQVSIPKSKARMRSTKKISVSVILNTNALPSSSTIGTELNSGILTLTSQA 172

Query: 284 RLRGKVRLVRVIRRNRSADMNCTMRINLSTNVVQNLRCE 168
           +L GKV L+ ++++ +SA M+CT+  +LST  V++L+C+
Sbjct: 173 KLTGKVELMLIMKKKKSATMDCTIAFDLSTKTVKSLQCK 211


>gb|EOY23369.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
          Length = 201

 Score =  108 bits (270), Expect = 2e-21
 Identities = 59/192 (30%), Positives = 101/192 (52%), Gaps = 17/192 (8%)
 Frame = -3

Query: 692 NTNFQSQEQMKKKKRMKCLXXXXXXXXXXXXXXXXXXXIFVRVRTPRVRLDDITVTSDAN 513
           N + +S  ++K+KKRMK                       +R++ P+ R+  ITV   A 
Sbjct: 10  NIDMESAAELKRKKRMKLFAYAAAFVVFQTIVILVFSLTVMRIKNPKFRVRSITVEDIAY 69

Query: 512 TG-------NVRFTGRVSVRNRNFGRYEFDSSLATIRSGNTNVGQFVIHDARARARSTRR 354
           T        N++F   V+V+N NFG ++FD++  +   G   VG+  +   RA+ARST++
Sbjct: 70  TSTPNPPSFNMKFNAEVAVKNTNFGHFKFDNTTISFDYGGVQVGEAFVAKGRAKARSTKK 129

Query: 353 IYVIADL---RVPGT-------NSSVLDLNVEARLRGKVRLVRVIRRNRSADMNCTMRIN 204
           + V  DL    +P         +S  L L    +L GKV L+++I++ +SA MNCTM +N
Sbjct: 130 MNVTVDLNSNNIPANSNLASDISSGFLTLTTHTKLSGKVHLMKLIKKKKSAQMNCTMTVN 189

Query: 203 LSTNVVQNLRCE 168
           L++  +Q+++C+
Sbjct: 190 LASRAIQDIKCQ 201


>ref|XP_004298133.1| PREDICTED: uncharacterized protein LOC101293877 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  106 bits (264), Expect = 1e-20
 Identities = 68/211 (32%), Positives = 105/211 (49%), Gaps = 16/211 (7%)
 Frame = -3

Query: 752 QGYPLAPSTIVPRSDEEYATNTNFQSQEQMKKKKRMKCLXXXXXXXXXXXXXXXXXXXIF 573
           Q YPLAPS    RSD E        S++++K+KKR+KC                      
Sbjct: 7   QSYPLAPSNGYTRSDGESL------SEDELKRKKRIKCFAYIGIFIVFQIAVMTVFGLTI 60

Query: 572 VRVRTPRVRLD-----DITVTSDANTGNVRFTGRVSVRNRNFGRYEFDSSLATIRSGNTN 408
           ++V+TP+VRL      D T +  A + +  F  ++ V+N N+G Y+FD  + T       
Sbjct: 61  MKVKTPKVRLGTSTLTDFTSSDTAPSFDTTFNTQIRVKNTNWGPYKFDQGVVTFMYQGMP 120

Query: 407 VGQFVIHDARARARSTRRIYVIADLRVPGTNSS-----------VLDLNVEARLRGKVRL 261
           VG  V+   +A  R T++I V   L      SS           VL L  EA+L GKV L
Sbjct: 121 VGTVVVPKGKAGMRGTKKINVNVRLNTAALPSSSSTLSTELSGGVLTLTSEAKLTGKVEL 180

Query: 260 VRVIRRNRSADMNCTMRINLSTNVVQNLRCE 168
           + ++++ +SA MNCT++I++S   V++L C+
Sbjct: 181 MLIMKKKKSASMNCTIQIDVSGKTVKSLECK 211


>gb|EOY23370.1| Uncharacterized protein TCM_015287 [Theobroma cacao]
          Length = 214

 Score =  105 bits (262), Expect = 2e-20
 Identities = 69/210 (32%), Positives = 108/210 (51%), Gaps = 17/210 (8%)
 Frame = -3

Query: 746 YPLAPSTIV-PRSDEEYATNTNFQSQEQMKKKKRMKCLXXXXXXXXXXXXXXXXXXXIFV 570
           YPL P+     RSDEE          +++KKKKRMKCL                     +
Sbjct: 9   YPLVPAANGHERSDEESVA----AHSKELKKKKRMKCLLYIVLFAVFQTGIILLFALTVM 64

Query: 569 RVRTP--RVRLDDITV----TSDANTGNVRFTGRVSVRNRNFGRYEFDSSLATIRSGNTN 408
           R+R P  RVR    T     T  + + +++   + +V+N NFG ++++  L T     T 
Sbjct: 65  RIRNPKFRVRSGSFTTFNVGTEASPSFDLQMNTQFTVKNTNFGHFKYEGGLVTFAYRGTP 124

Query: 407 VGQFVIHDARARARSTRRIYVIADLR---VPGTN-------SSVLDLNVEARLRGKVRLV 258
           VG+  I  ARARARST+++ V+ +L    +P TN       + VL L   ++L GK+ L+
Sbjct: 125 VGRATIQKARARARSTKKVDVVVELSSNGLPNTNELGRDISAGVLTLTSSSKLDGKIHLM 184

Query: 257 RVIRRNRSADMNCTMRINLSTNVVQNLRCE 168
           +VI++ +S  MNCTM + + T  V+N+ C+
Sbjct: 185 KVIKKKKSTQMNCTMDVAIDTRTVRNIICK 214


>ref|XP_004309173.1| PREDICTED: uncharacterized protein LOC101303177 [Fragaria vesca
           subsp. vesca]
          Length = 213

 Score =  105 bits (261), Expect = 2e-20
 Identities = 65/216 (30%), Positives = 112/216 (51%), Gaps = 13/216 (6%)
 Frame = -3

Query: 779 MAEKYQQQTQGYPLAPSTIVPRSDEEYATNTNFQSQEQMKKKKRMKCLXXXXXXXXXXXX 600
           MAE+ Q+     P A    + RSD E   ++   S  +++KKKR+KCL            
Sbjct: 1   MAERNQEAYPFAPYANGQAMARSDAE---SSRAHSDHELRKKKRIKCLIYIAVFAVFQII 57

Query: 599 XXXXXXXIFVRVRTPRVRLDDITVTSDANTGN-------VRFTGRVSVRNRNFGRYEFDS 441
                    +++++P+ R+  ITV  D  T N       + F   VSV+N NFGRY++D 
Sbjct: 58  VITVFALTVMKIKSPKFRIKSITV-QDLTTSNSANPSLSMSFVAEVSVKNPNFGRYKYDQ 116

Query: 440 SLATIRSGNTNVGQFVIHDARARARSTRRIYVIADLRVPGTN------SSVLDLNVEARL 279
           +  +     T VG  V+  A AR ++TR+  V   ++   +N      +  + L+  +++
Sbjct: 117 TSISFIYEGTQVGDAVVPKATARTKATRKEIVSGAVKTVNSNLASDISAGSVTLSTYSKI 176

Query: 278 RGKVRLVRVIRRNRSADMNCTMRINLSTNVVQNLRC 171
            GKV L+ +I++ +SA+M CTM ++LS+  VQ+++C
Sbjct: 177 NGKVYLMNMIKKKKSAEMKCTMVVHLSSKQVQDIKC 212


>ref|XP_004300830.1| PREDICTED: uncharacterized protein LOC101296206 [Fragaria vesca
           subsp. vesca]
          Length = 211

 Score =  105 bits (261), Expect = 2e-20
 Identities = 70/220 (31%), Positives = 114/220 (51%), Gaps = 16/220 (7%)
 Frame = -3

Query: 779 MAEKYQQQTQGYPLAPSTIVPRSDEEYATNTNFQSQEQMKKKKRMKCLXXXXXXXXXXXX 600
           MAEK  Q    YPLAP+    RSD E     +  S++++K++KR++              
Sbjct: 1   MAEKTHQ---AYPLAPANGYTRSDGE-----SLVSKDELKRRKRIRLFTYIGIFIVFQII 52

Query: 599 XXXXXXXIFVRVRTPRVRLDDITVTSDANTG------NVRFTGRVSVRNRNFGRYEFDSS 438
                    ++V+TP+VRL +I V  D N+       +  FT ++ V+N N+G Y+FD+S
Sbjct: 53  VMTVFGLTVMKVKTPKVRLGEINV-QDFNSVPATPSFDTTFTTQIRVKNTNWGPYKFDAS 111

Query: 437 LATIRSGNTNVGQFVIHDARARARSTRRIYVIADLRVPGT----------NSSVLDLNVE 288
             T       VGQ  +   +A  RST+++ V   L   G           NS VL LN +
Sbjct: 112 TVTFMYQGVAVGQVTVPKGKAGMRSTKKMNVEVSLNANGLPSSSNLGSELNSGVLTLNSQ 171

Query: 287 ARLRGKVRLVRVIRRNRSADMNCTMRINLSTNVVQNLRCE 168
           A+L GKV L+ ++++ +S+ M+C +  +LST  V++L+C+
Sbjct: 172 AKLSGKVELMLIMKKKKSSTMDCMIGFDLSTKTVKSLQCK 211


>gb|EXC34336.1| hypothetical protein L484_006691 [Morus notabilis]
          Length = 212

 Score =  103 bits (258), Expect = 5e-20
 Identities = 73/220 (33%), Positives = 119/220 (54%), Gaps = 17/220 (7%)
 Frame = -3

Query: 779 MAEKYQQQTQGYPLAPSTIVPRSDEEYATNTNFQSQEQMKKKKRMKCLXXXXXXXXXXXX 600
           MAE+YQQ    YPLAP+   PRSDEE   ++N  ++E +K++KR+K              
Sbjct: 1   MAERYQQV---YPLAPANGHPRSDEE---SSNLDAKE-LKRRKRIKLAIYAFIFTASQII 53

Query: 599 XXXXXXXIFVRVRTPRVRLDDITV--TSDANTGN-----VRFTGRVSVRNRNFGRYEFDS 441
                  + +RV++P++RL D     T + N+G+     + FT ++ V+N N+G Y+FD+
Sbjct: 54  VTLVFVLVVMRVKSPKLRLSDKFEFQTIETNSGSKPSFDISFTTQLRVKNTNWGPYKFDN 113

Query: 440 SLATIRSGNTNVGQFVIHDARARARSTRRIYVIADLRVP----GTN------SSVLDLNV 291
           + A        VGQ VI   +A  RST+++ V   L        TN        +L L  
Sbjct: 114 TTAAFAYEGETVGQVVIPKGKAGMRSTKKVPVSVSLSSSQLKNNTNLGSELSGGILTLRC 173

Query: 290 EARLRGKVRLVRVIRRNRSADMNCTMRINLSTNVVQNLRC 171
            A++ GKV+L+ ++++ +SA+MNCT+ I++    V NL+C
Sbjct: 174 TAKMTGKVKLMLIMKKKKSANMNCTINIHVKEKTV-NLKC 212


>gb|EOY24872.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
          Length = 213

 Score =  103 bits (257), Expect = 7e-20
 Identities = 71/215 (33%), Positives = 105/215 (48%), Gaps = 16/215 (7%)
 Frame = -3

Query: 764 QQQTQGYPLAPSTIVPRSDEEYATNTNFQSQEQMKKKKRMKCLXXXXXXXXXXXXXXXXX 585
           Q+  Q  PLAP    PRSD E+       SQ   +K+K  KCL                 
Sbjct: 2   QEDPQAKPLAPVEYYPRSDMEFGGIKPTASQ---RKEKSSKCLVYVLVGMVIQGAVLLIF 58

Query: 584 XXIFVRVRTPRVRLDDITV------TSDANTGNVRFTGRVSVRNRNFGRYEFDSSLATIR 423
             I +R RTP V +  +TV       S A + N+     V+V N NFG ++F+++  T+ 
Sbjct: 59  ASIVLRARTPDVEIVSVTVRNLKYGNSSAPSFNLTLVTEVTVENSNFGDFKFENTTGTVW 118

Query: 422 SGNTNVGQFVIHDARARARSTRRIYVIAD---LRVPGT-------NSSVLDLNVEARLRG 273
            G+  VG+  I   RA+AR+T R+ V  D   L +P T       +S +L+LN   +L G
Sbjct: 119 CGSVVVGKMKIPTGRAQARATERLNVSVDVSSLPLPDTKNVSCNISSGLLELNSHVKLSG 178

Query: 272 KVRLVRVIRRNRSADMNCTMRINLSTNVVQNLRCE 168
           KV ++  ++R R  +MNC M +NL+    Q+  CE
Sbjct: 179 KVSIMNFMKRRRHPEMNCFMTLNLTGQTKQDFPCE 213


>emb|CAN79447.1| hypothetical protein VITISV_037464 [Vitis vinifera]
          Length = 186

 Score =  103 bits (257), Expect = 7e-20
 Identities = 59/179 (32%), Positives = 103/179 (57%), Gaps = 13/179 (7%)
 Frame = -3

Query: 665 MKKKKRMKCLXXXXXXXXXXXXXXXXXXXIFVRVRTPRVRLDDITVTSD---ANTGNVRF 495
           +++KK +KCL                   + +++R P+VR+  I+V +     N+ ++  
Sbjct: 8   VRRKKSLKCLAYVAAFVVFQTGIILLFVLLVLKIRDPKVRIASISVENQHFSTNSFSMDL 67

Query: 494 TGRVSVRNRNFGRYEFDSSLATIRSGNTNVGQFVIHDARARARSTRRIYV---IADLRVP 324
             RV+V+N NFG ++FD+S ATI    T VG+  I  ARAR+RST+R  +   I+  +V 
Sbjct: 68  KARVTVKNTNFGHFKFDNSTATISYFGTAVGEATILKARARSRSTKRFNITVPISSSKVN 127

Query: 323 G-------TNSSVLDLNVEARLRGKVRLVRVIRRNRSADMNCTMRINLSTNVVQNLRCE 168
                    NS VL+L+  A+L GK+ L ++ ++ +SA+M+CTM ++ +T+ ++NL C+
Sbjct: 128 NHRQLRRDLNSGVLNLSSTAKLSGKIHLFKIFKKKKSAEMSCTMELHTNTSSIENLSCK 186


>gb|EOY24870.1| Late embryogenesis abundant hydroxyproline-rich glycofamily
           protein, putative [Theobroma cacao]
          Length = 188

 Score =  103 bits (256), Expect = 9e-20
 Identities = 61/184 (33%), Positives = 98/184 (53%), Gaps = 16/184 (8%)
 Frame = -3

Query: 671 EQMKKKKRMKCLXXXXXXXXXXXXXXXXXXXIFVRVRTPRVRLDDITVTS-DANTG---- 507
           E+ K+ + MKC                      +R++TP  RL  +TV S + N      
Sbjct: 5   EKYKRMQNMKCYAYIIAGVVFQTIIILVFALTVMRIKTPSARLRSVTVQSLNYNASGVPH 64

Query: 506 -NVRFTGRVSVRNRNFGRYEFDSSLATIRSGNTNVGQFVIHDARARARSTRRIYVIADLR 330
            N+R    ++V+N+NFG + FD++ A +  G+  VG   I  +RARAR T+R+ V  D+ 
Sbjct: 65  FNMRLIMEIAVKNKNFGHFRFDNTTANVTFGSVMVGDGEIVKSRARARKTKRMNVTVDVS 124

Query: 329 VPGTN----------SSVLDLNVEARLRGKVRLVRVIRRNRSADMNCTMRINLSTNVVQN 180
               +          S  L L   ARLRGKV L++++++ ++A+MNCTM +NL+++ VQ+
Sbjct: 125 SSAVSDEDELRTKLSSGTLTLTGVARLRGKVTLMKLMKKRKTAEMNCTMTVNLNSHAVQD 184

Query: 179 LRCE 168
           L CE
Sbjct: 185 LDCE 188


>ref|XP_004300828.1| PREDICTED: uncharacterized protein LOC101295630 [Fragaria vesca
           subsp. vesca]
          Length = 212

 Score =  101 bits (251), Expect = 3e-19
 Identities = 67/222 (30%), Positives = 112/222 (50%), Gaps = 18/222 (8%)
 Frame = -3

Query: 779 MAEKYQQ---QTQGYPLAPSTIVPRSDEEYATNTNFQSQEQMKKKKRMKCLXXXXXXXXX 609
           MAEK++      +GY         + DE+  T   FQS+E++K++KR+K           
Sbjct: 1   MAEKFKHALASVKGY-------ATKKDEQLPT---FQSEEELKRQKRIKLFTYIGIFIGF 50

Query: 608 XXXXXXXXXXIFVRVRTPRVRL-----DDITVTSDANTGNVRFTGRVSVRNRNFGRYEFD 444
                       ++V+TP+VRL      ++     + + +  F  ++ ++N N+G Y+FD
Sbjct: 51  QIIVMTVFGLTVMKVKTPKVRLGATNVQNLNFVPTSPSFDTTFATQIRIKNTNWGPYKFD 110

Query: 443 SSLATIRSGNTNVGQFVIHDARARARSTRRIYVIADLR---VPGTN-------SSVLDLN 294
           +  AT       VGQ     ++A  RST++I     L    +P T+       S VL L 
Sbjct: 111 AGTATFMYQGVAVGQVSFPKSKAGMRSTKKINAEVSLNSNEIPSTSNLGSELSSGVLTLT 170

Query: 293 VEARLRGKVRLVRVIRRNRSADMNCTMRINLSTNVVQNLRCE 168
            EA+L GKV L+ ++++ +SA MNCTM+++LST  +Q L C+
Sbjct: 171 SEAKLTGKVELMLIMKKKKSATMNCTMKLDLSTKTIQALECK 212


>ref|XP_004309172.1| PREDICTED: uncharacterized protein LOC101302889 [Fragaria vesca
           subsp. vesca]
          Length = 222

 Score =  100 bits (250), Expect = 5e-19
 Identities = 66/223 (29%), Positives = 114/223 (51%), Gaps = 19/223 (8%)
 Frame = -3

Query: 779 MAE-KYQQQTQGYPLAPST-IVPRSDEEYATNTNFQSQEQMKKKKRMKCLXXXXXXXXXX 606
           MAE K    T  YPL PS     RSD+E A +    S E+++ KKRM+CL          
Sbjct: 1   MAENKEAAATSPYPLMPSAPSYMRSDQEAAASAP-PSAEELRHKKRMRCLLYVSIFAVFQ 59

Query: 605 XXXXXXXXXIFVRVRTPRVRLDDITVT-----SDANTG-NVRFTGRVSVRNRNFGRYEFD 444
                      +++++P+ R+   ++T     S +N   N+       V+N NFG +E++
Sbjct: 60  VVVITVFALTVMKIKSPKFRVRTASITGFEVGSASNPSFNLEMDVHFGVKNTNFGHFEYE 119

Query: 443 SSLATIRSGNTNVGQFVIHDARARARSTRRIYVIA-DLRVPGT----------NSSVLDL 297
             +      +  +GQ  + + R RARSTR++ V + DL   G           ++ ++ +
Sbjct: 120 DGIVVFTYRDVRIGQTNVEEERVRARSTRKVDVSSVDLTSRGLPANSRLGSDISTGIIPI 179

Query: 296 NVEARLRGKVRLVRVIRRNRSADMNCTMRINLSTNVVQNLRCE 168
            + ++L GK+ L+++I++ +SA MNCTM + L+T  VQN+ C+
Sbjct: 180 TISSKLDGKIHLMKIIKKKKSAQMNCTMEVVLATKSVQNVVCK 222


>emb|CBI22611.3| unnamed protein product [Vitis vinifera]
          Length = 297

 Score =  100 bits (250), Expect = 5e-19
 Identities = 66/228 (28%), Positives = 112/228 (49%), Gaps = 17/228 (7%)
 Frame = -3

Query: 803 FIKKTYKTMAEKYQQQTQGYPLAPSTIVPRSDEEYATNTNFQSQEQMKKKKRMKCLXXXX 624
           F +   K MA+K QQ    +P+ P+    ++D E          E++++ K  + +    
Sbjct: 81  FSRAKAKKMAQKKQQV---HPIEPTGGPAKTDVE---------SEELRRMKCTRYIAYLS 128

Query: 623 XXXXXXXXXXXXXXXIFVRVRTPRVR-----LDDITVTSDANTG--NVRFTGRVSVRNRN 465
                            +R+R+P+ R     ++++  TSD  +   N+RF  +V+V+N N
Sbjct: 129 AFALFETIVIMVCVVTLMRIRSPKFRFRAVSIENLNYTSDTTSPSFNIRFNAKVAVKNTN 188

Query: 464 FGRYEFDSSLATIRSGNTNVGQFVIHDARARARSTRRIYVIADLR----------VPGTN 315
           FG ++F +S  T+     +VG   I  ARARARST+++ V  D+               N
Sbjct: 189 FGHFKFKNSTITLAYRGDHVGDAKISKARARARSTKKMNVTVDVTSNNVSSNSNLASDIN 248

Query: 314 SSVLDLNVEARLRGKVRLVRVIRRNRSADMNCTMRINLSTNVVQNLRC 171
           S  L L  + +L GKV L++V ++ +S  MNCT++INL   V+Q  +C
Sbjct: 249 SGFLTLTGQGKLNGKVHLMKVFKKKKSPQMNCTIKINLENKVIQEWKC 296


>gb|EXC05942.1| hypothetical protein L484_014210 [Morus notabilis]
          Length = 213

 Score =  100 bits (248), Expect = 8e-19
 Identities = 67/218 (30%), Positives = 115/218 (52%), Gaps = 14/218 (6%)
 Frame = -3

Query: 779 MAEKYQQQTQGYPLAPSTIVPRSDEEYATNTNFQSQ-EQMKKKKRMKCLXXXXXXXXXXX 603
           MAEK  Q     PL   T     D E A++ + +S  ++++ KKRM+ L           
Sbjct: 1   MAEKDDQ-----PLQAETY----DLESASSADHESNAKELQHKKRMRRLGGVTAIVVLLT 51

Query: 602 XXXXXXXXIFVRVRTPRVRL-----DDITVT-SDANTGNV--RFTGRVSVRNRNFGRYEF 447
                     +R++ P +R+     +D+T++ SD N+ ++  +F   + V+N NFG ++F
Sbjct: 52  VVILVFPQTVMRIKGPELRIRSVAIEDLTISNSDTNSPSLSMKFDSEIGVKNTNFGEFKF 111

Query: 446 DSSLATIRSGNTNVGQFVIHDARARARSTRRIYVIADLRVPGT-----NSSVLDLNVEAR 282
           D S  T     T VG   +   +A+ARST+++ V A++           S  L L  +++
Sbjct: 112 DESSITFVYKGTEVGDASVEKGKAKARSTKKMNVTAEVNANSNLANDVRSGFLTLTSQSK 171

Query: 281 LRGKVRLVRVIRRNRSADMNCTMRINLSTNVVQNLRCE 168
           L GKV L++VI++ ++A+MNCT+ INL   VVQ+ +C+
Sbjct: 172 LNGKVHLMKVIKKKKTAEMNCTITINLENKVVQDFKCK 209


>gb|EOY24868.1| Late embryogenesis abundant hydroxyproline-rich glycofamily protein
           [Theobroma cacao]
          Length = 185

 Score = 99.8 bits (247), Expect = 1e-18
 Identities = 62/181 (34%), Positives = 98/181 (54%), Gaps = 17/181 (9%)
 Frame = -3

Query: 659 KKKRMKCLXXXXXXXXXXXXXXXXXXXIFVRVRTPRVRLDDITVTSDANTGN-------V 501
           K+   KCL                     +R++ P+VR   +TV  + +TGN       +
Sbjct: 6   KRSNAKCLAYVAVFVVFQTAIILIFALTVMRIKNPKVRFGAVTV-ENFSTGNSSSPFFDM 64

Query: 500 RFTGRVSVRNRNFGRYEFDSSLATIRSGNTNVGQFVIHDARARARSTRRIYVIADLRVP- 324
           R   +V+V+N NFG +++++S   I  G   VG+  I  ARARAR T++  V  D+    
Sbjct: 65  RLMAQVTVKNTNFGHFKYENSSIRILYGGMPVGEATIVKARARARQTKKFDVTIDISSSK 124

Query: 323 -GTNSS--------VLDLNVEARLRGKVRLVRVIRRNRSADMNCTMRINLSTNVVQNLRC 171
             TNS+        VL L+ EA+L GKV L++VI++ +S++M+CTM IN+ T  VQ+L+C
Sbjct: 125 LSTNSNLGNDIASGVLPLSSEAKLSGKVHLMKVIKKKKSSEMSCTMGINIGTRTVQDLKC 184

Query: 170 E 168
           +
Sbjct: 185 K 185


Top