BLASTX nr result

ID: Akebia23_contig00015735 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00015735
         (961 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002280320.1| PREDICTED: uncharacterized protein LOC100260...   246   8e-63
ref|XP_007051635.1| Late embryogenesis abundant hydroxyproline-r...   230   6e-58
gb|EXC22514.1| hypothetical protein L484_003064 [Morus notabilis]     219   2e-54
ref|XP_002511957.1| conserved hypothetical protein [Ricinus comm...   209   1e-51
ref|XP_006444894.1| hypothetical protein CICLE_v10021837mg [Citr...   207   4e-51
ref|XP_002301395.2| hypothetical protein POPTR_0002s16890g [Popu...   200   8e-49
ref|XP_002320178.2| hypothetical protein POPTR_0014s09010g [Popu...   191   5e-46
ref|XP_002880232.1| hypothetical protein ARALYDRAFT_483780 [Arab...   174   4e-41
ref|XP_006397816.1| hypothetical protein EUTSA_v10001596mg [Eutr...   172   2e-40
ref|XP_004133831.1| PREDICTED: uncharacterized protein LOC101214...   171   3e-40
ref|XP_006577337.1| PREDICTED: uncharacterized protein LOC102670...   169   2e-39
ref|NP_182153.2| late embryogenesis abundant hydroxyproline-rich...   167   6e-39
dbj|BAC41966.1| unknown protein [Arabidopsis thaliana]                167   6e-39
gb|ABK28538.1| unknown [Arabidopsis thaliana]                         167   6e-39
ref|XP_006295826.1| hypothetical protein CARUB_v10024952mg [Caps...   167   8e-39
ref|XP_006577338.1| PREDICTED: uncharacterized protein LOC102670...   166   2e-38
gb|AAC62876.1| hypothetical protein [Arabidopsis thaliana] gi|22...   166   2e-38
ref|XP_004140888.1| PREDICTED: uncharacterized protein LOC101205...   164   6e-38
ref|XP_004160824.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   160   5e-37
ref|XP_007220159.1| hypothetical protein PRUPE_ppa018680mg [Prun...   159   1e-36

>ref|XP_002280320.1| PREDICTED: uncharacterized protein LOC100260268 [Vitis vinifera]
          Length = 246

 Score =  246 bits (629), Expect = 8e-63
 Identities = 127/247 (51%), Positives = 163/247 (65%), Gaps = 8/247 (3%)
 Frame = +1

Query: 121 MTEPTRKPILQKPPGYRDPNAPVSLA-----RKPPIPRSFHPKPKRKSXXXXXXXXXXXX 285
           M EP  KP+LQKPPGYRDPNAPV L      RKP +P +F  K +R+S            
Sbjct: 1   MAEPP-KPVLQKPPGYRDPNAPVRLPPKPGLRKPILPSTFPVKRRRRSCCRIFCCFFCIF 59

Query: 286 XXXXXXXXALAGSLFYLWFEPKFPNFHIQSIKFSRFNVTVKLDGTFLTSQTIVRVQARNP 465
                    L G+ FYLWF PK P FH+QS+K  RFNVTVK D T++ SQT V+V+ RNP
Sbjct: 60  SFILIIILFLGGAFFYLWFNPKVPVFHLQSLKIQRFNVTVKSDATYIDSQTAVKVEVRNP 119

Query: 466 NEKITFYYDRIHVRMTA---VDDVDLGSGSIPAFNQGKKNTTVLKFGTQVKNMLIDDDVG 636
           N+KITF Y +  V +TA    D+ +LGSGS   F QGKK+TTV+K+    KN+L+ D+VG
Sbjct: 120 NDKITFRYGKTSVTLTAGLGEDETELGSGSSGEFTQGKKSTTVVKWTVHEKNVLVADEVG 179

Query: 637 TKLKARFRSKEIMISVEVRTKFQLGLQWWRLGKVPVRIFCEGVSLKKLDGRSPPKCTISL 816
            KLKAR+RS+ +++SV VRT+  LG+  WR+G V + I C  V+LKKLDG   PKCT+++
Sbjct: 180 AKLKARYRSRAMVVSVVVRTRVSLGVGGWRIGTVGMHISCGDVALKKLDGGDMPKCTVNV 239

Query: 817 LKWINIH 837
           LKWIN H
Sbjct: 240 LKWINFH 246


>ref|XP_007051635.1| Late embryogenesis abundant hydroxyproline-rich glycoprotein
           family, putative [Theobroma cacao]
           gi|508703896|gb|EOX95792.1| Late embryogenesis abundant
           hydroxyproline-rich glycoprotein family, putative
           [Theobroma cacao]
          Length = 280

 Score =  230 bits (587), Expect = 6e-58
 Identities = 112/244 (45%), Positives = 156/244 (63%), Gaps = 9/244 (3%)
 Frame = +1

Query: 121 MTEPTRKPILQKPPGYRDPNAPVSLA------RKPPIPRSFHPKPKRKSXXXXXXXXXXX 282
           M EP  KP+LQKPPGY+DP+AP          RKP +P SFHPK +R             
Sbjct: 1   MPEPPLKPVLQKPPGYKDPSAPAVKPGFRPPPRKPVLPPSFHPKKRRGGCCRVCCCCFCI 60

Query: 283 XXXXXXXXXALAGSLFYLWFEPKFPNFHIQSIKFSRFNVTVKLDGTFLTSQTIVRVQARN 462
                     + G++FYLWF+PK P FH+QS++ SRFNVT K DGT+L +QT  R++ +N
Sbjct: 61  FFLILILLLLICGAVFYLWFDPKLPGFHVQSVRISRFNVTNKPDGTYLDAQTTTRLEVKN 120

Query: 463 PNEKITFYYDRIHVRMTAV---DDVDLGSGSIPAFNQGKKNTTVLKFGTQVKNMLIDDDV 633
           PN K+T+YY    V ++     D+ +LG+ ++  F  GK+NTT LK  T+V N L+DD V
Sbjct: 121 PNAKMTYYYGNTEVDVSVGEGGDETELGTTTVHGFTMGKQNTTSLKVETKVINKLVDDGV 180

Query: 634 GTKLKARFRSKEIMISVEVRTKFQLGLQWWRLGKVPVRIFCEGVSLKKLDGRSPPKCTIS 813
           GT+L+AR+RSK + +SVE RTK  LG+   ++G V V + C+G++LK+LDG   PKC I+
Sbjct: 181 GTRLQARYRSKSLRVSVEARTKIGLGVAGLKIGMVGVTVKCDGIALKRLDGGDMPKCVIN 240

Query: 814 LLKW 825
           +LKW
Sbjct: 241 MLKW 244


>gb|EXC22514.1| hypothetical protein L484_003064 [Morus notabilis]
          Length = 281

 Score =  219 bits (557), Expect = 2e-54
 Identities = 106/241 (43%), Positives = 150/241 (62%), Gaps = 7/241 (2%)
 Frame = +1

Query: 127 EPTRKPILQKPPGYRDPNAPVSLARKPP-----IPRSFHPKPKRKSXXXXXXXXXXXXXX 291
           +P + P LQKPPGYRDP AP     +PP     +P SFHP+ +R++              
Sbjct: 4   QPLKPPPLQKPPGYRDPAAPGKPVARPPQRKPVLPASFHPRKRRRNWCRTCCCFVFVFLL 63

Query: 292 XXXXXXALAGSLFYLWFEPKFPNFHIQSIKFSRFNVTVKLDGTFLTSQTIVRVQARNPNE 471
                 A+AG +FYLWFEPK P FH+QS++  +FNVTVK DGT+L + T+ R++ +NPN 
Sbjct: 64  LLTLAVAIAGGIFYLWFEPKLPVFHLQSLRIPQFNVTVKPDGTYLDAGTVTRIEVKNPNG 123

Query: 472 KITFYYDRIHVRMTAVDDVD--LGSGSIPAFNQGKKNTTVLKFGTQVKNMLIDDDVGTKL 645
           K+  YY   HV ++  +D D  LG   +  F QGK+NTT LK  T VKN L+DD +G +L
Sbjct: 124 KLELYYGGTHVEVSVGEDEDAELGRKDLEGFTQGKENTTSLKVETTVKNQLVDDGLGKRL 183

Query: 646 KARFRSKEIMISVEVRTKFQLGLQWWRLGKVPVRIFCEGVSLKKLDGRSPPKCTISLLKW 825
           K+ ++SK++++ +E +T     +Q  ++G V V + C GVSLKKLD    PKC+I LLKW
Sbjct: 184 KSGYKSKDLVVKIEAKTSVGYIVQGVKIGTVEVGVLCGGVSLKKLDSGDMPKCSIDLLKW 243

Query: 826 I 828
           +
Sbjct: 244 V 244


>ref|XP_002511957.1| conserved hypothetical protein [Ricinus communis]
           gi|223549137|gb|EEF50626.1| conserved hypothetical
           protein [Ricinus communis]
          Length = 243

 Score =  209 bits (532), Expect = 1e-51
 Identities = 105/239 (43%), Positives = 147/239 (61%), Gaps = 6/239 (2%)
 Frame = +1

Query: 127 EPTRKPILQKPPGYRDPNAPVSLA----RKPPIPRSFHPKPKRKSXXXXXXXXXXXXXXX 294
           E   KPILQKPPG+RDP+ PV       RK  +P SF P+ +RK+               
Sbjct: 5   EQAMKPILQKPPGFRDPSKPVPRPPPPLRKAALPPSFQPRKRRKNYGGMCCRILVIISFT 64

Query: 295 XXXXXALAGSLFYLWFEPKFPNFHIQSIKFSRFNVTVKLDGTFLTSQTIVRVQARNPNEK 474
                 + G +FYLWF+PK P FH+QS K S F VT K DGT+L + T+ RV+ RNPN K
Sbjct: 65  VLLILFILGGVFYLWFDPKLPVFHLQSFKISSFRVTTKPDGTYLNAATVARVEVRNPNSK 124

Query: 475 ITFYYDRIHVRMTAVDD--VDLGSGSIPAFNQGKKNTTVLKFGTQVKNMLIDDDVGTKLK 648
           +T+ Y    V+MT   D    LGS S+P F Q KKNTT  K    VKN LI+D VG++LK
Sbjct: 125 LTYRYSESQVQMTLGQDQGTQLGSMSLPGFLQDKKNTTSFKIQMSVKNELIEDGVGSRLK 184

Query: 649 ARFRSKEIMISVEVRTKFQLGLQWWRLGKVPVRIFCEGVSLKKLDGRSPPKCTISLLKW 825
           ++F+S++++++V+V TK  + +Q   +G + V + C+G++LK++DG   PKC+I  LKW
Sbjct: 185 SQFKSRKLVVNVQVTTKVGVDVQGLEIGMLGVDVSCDGITLKQIDGDDMPKCSIHTLKW 243


>ref|XP_006444894.1| hypothetical protein CICLE_v10021837mg [Citrus clementina]
           gi|568876318|ref|XP_006491228.1| PREDICTED:
           uncharacterized protein LOC102608686 [Citrus sinensis]
           gi|557547156|gb|ESR58134.1| hypothetical protein
           CICLE_v10021837mg [Citrus clementina]
          Length = 251

 Score =  207 bits (528), Expect = 4e-51
 Identities = 111/253 (43%), Positives = 152/253 (60%), Gaps = 8/253 (3%)
 Frame = +1

Query: 103 TNPQFKMTEPTRKPILQKPPGYRDPNAP------VSLARKPPIPRSFHPKPKRKSXXXXX 264
           + P  ++     KPILQKPPGYRDPN P      +   RKPPIP SF  K +RKS     
Sbjct: 2   STPTPRIAGQQAKPILQKPPGYRDPNGPQARPRPIGPPRKPPIPPSFPAKRRRKSCCRVC 61

Query: 265 XXXXXXXXXXXXXXXALAGSLFYLWFEPKFPNFHIQSIKFSRFNVTVKLDGTFLTSQTIV 444
                           +AG+LFYLWF+PK P FH+QS  F  FNV+VK DGT+L + T+ 
Sbjct: 62  CCCFCFFIVLLIILIVIAGALFYLWFDPKLPVFHLQSFSFRHFNVSVKSDGTYLHAATLT 121

Query: 445 RVQARNPNEKITFYYDRIHVRMTAVDD--VDLGSGSIPAFNQGKKNTTVLKFGTQVKNML 618
           RV+ARNPN K+ +YY    V +TA  D  +DLG+GS+P F QG KN   LK  T+  + L
Sbjct: 122 RVEARNPNGKLRYYYGHTDVEVTAGKDKEIDLGTGSVPGFTQGTKNARSLKIETKT-DEL 180

Query: 619 IDDDVGTKLKARFRSKEIMISVEVRTKFQLGLQWWRLGKVPVRIFCEGVSLKKLDGRSPP 798
           ++D +G +L +  +SK+++++V V+T   + +Q  +   + V++ C G SLK LD    P
Sbjct: 181 VEDGMGPRLMSHHKSKDLVVNVVVKTTVAVIVQGRKTRPLAVKVTCGGQSLKALD--KMP 238

Query: 799 KCTISLLKWINIH 837
           KCTI  LKWI+IH
Sbjct: 239 KCTIHFLKWISIH 251


>ref|XP_002301395.2| hypothetical protein POPTR_0002s16890g [Populus trichocarpa]
           gi|550345181|gb|EEE80668.2| hypothetical protein
           POPTR_0002s16890g [Populus trichocarpa]
          Length = 244

 Score =  200 bits (508), Expect = 8e-49
 Identities = 102/238 (42%), Positives = 145/238 (60%), Gaps = 6/238 (2%)
 Frame = +1

Query: 139 KPILQKPPGYRDPN-----APVSLARKPPIPRSFHPKPKRKSXXXXXXXXXXXXXXXXXX 303
           KP+LQ+PPGY DPN     AP  L  K  +P SF P+ +R                    
Sbjct: 7   KPVLQRPPGYTDPNLQAKPAPRPLPTKALLPPSFEPRKRRSRHCRLCLCCLSLLLIIAIL 66

Query: 304 XXALAGSLFYLWFEPKFPNFHIQSIKFSRFNVTVKLDGTFLTSQTIVRVQARNPNEKITF 483
              +AG LFYLWF+PK P FH+QS KFS FN+T + DGT+LT++ + R++ RNPNE I +
Sbjct: 67  LMIIAGGLFYLWFDPKLPVFHLQSFKFSAFNITKRSDGTYLTAKMVARIEVRNPNENIIY 126

Query: 484 YYDRIHVRMTAVDD-VDLGSGSIPAFNQGKKNTTVLKFGTQVKNMLIDDDVGTKLKARFR 660
           ++    V  TA DD V+LGS ++P F QGKKNTT L+  T V N LI+D +G+K+  +F 
Sbjct: 127 HFGESKVETTAGDDEVNLGSTTLPEFTQGKKNTTSLEIETSVNNELIEDGIGSKILDQFT 186

Query: 661 SKEIMISVEVRTKFQLGLQWWRLGKVPVRIFCEGVSLKKLDGRSPPKCTISLLKWINI 834
           SK++ + ++V+T   +G++  + G + V + C GV+LK+      P+C IS LKWI I
Sbjct: 187 SKKLKVDMDVKTSIGIGVEGVKTGLLGVEVVCGGVTLKE-TSTEMPRCIISTLKWIII 243


>ref|XP_002320178.2| hypothetical protein POPTR_0014s09010g [Populus trichocarpa]
           gi|550323805|gb|EEE98493.2| hypothetical protein
           POPTR_0014s09010g [Populus trichocarpa]
          Length = 246

 Score =  191 bits (484), Expect = 5e-46
 Identities = 96/245 (39%), Positives = 144/245 (58%), Gaps = 7/245 (2%)
 Frame = +1

Query: 121 MTEPTRKPILQKPPGYRDPNAPVSLARKPP-----IPRSFHPKPKR-KSXXXXXXXXXXX 282
           M +   KP+L +PPGYRDPN PV  AR+P      +P SF P+ +R +            
Sbjct: 1   MADKPMKPVLPRPPGYRDPNHPVKSARRPLPTKTLVPHSFQPRKRRSRHWCRLCLCCLIL 60

Query: 283 XXXXXXXXXALAGSLFYLWFEPKFPNFHIQSIKFSRFNVTVKLDGTFLTSQTIVRVQARN 462
                     ++G LFY+WFEPK P FH+QS KF  F+VT K DGT+L ++ + R++ RN
Sbjct: 61  LLIIVILLLIISGGLFYIWFEPKLPVFHLQSFKFPTFSVTKKSDGTYLKAKMVARIEVRN 120

Query: 463 PNEKITFYYDRIHVRMTAVDD-VDLGSGSIPAFNQGKKNTTVLKFGTQVKNMLIDDDVGT 639
           PNEKI +++    V  T  D+ V+LGS S+P F Q K N T LK  T V N LI+D +G+
Sbjct: 121 PNEKIIYHFGESKVETTTGDEEVNLGSTSLPKFTQEKGNATSLKTVTNVNNELIEDRIGS 180

Query: 640 KLKARFRSKEIMISVEVRTKFQLGLQWWRLGKVPVRIFCEGVSLKKLDGRSPPKCTISLL 819
           K+  +F SK++ + + V+T+  +G+   + G +   + C GV+LK+ +    P+C + +L
Sbjct: 181 KILHQFTSKKLKVKMNVKTRVGIGVAGMKTGLLGAEVLCGGVTLKETESGEMPRCVMKIL 240

Query: 820 KWINI 834
           +WI I
Sbjct: 241 QWIII 245


>ref|XP_002880232.1| hypothetical protein ARALYDRAFT_483780 [Arabidopsis lyrata subsp.
           lyrata] gi|297326071|gb|EFH56491.1| hypothetical protein
           ARALYDRAFT_483780 [Arabidopsis lyrata subsp. lyrata]
          Length = 252

 Score =  174 bits (442), Expect = 4e-41
 Identities = 93/252 (36%), Positives = 142/252 (56%), Gaps = 14/252 (5%)
 Frame = +1

Query: 121 MTEPTRKPILQKPPGYRDPNA------PVSLARKP-----PIPRSFHPKPKRKSXXXXXX 267
           M +   KP+LQKPPGYRDPN       P  ++++P     P+P S+ PK KR+S      
Sbjct: 1   MADYQMKPVLQKPPGYRDPNMSSPPPPPPPMSQQPMRKAVPMPTSYRPKKKRRSCCRFCC 60

Query: 268 XXXXXXXXXXXXXXALAGSLFYLWFEPKFPNFHIQSIKFSRFNVTVKLDGTFLTSQTIVR 447
                          +  ++FYLWF+PK P F + S +   F +    DG  L++  + R
Sbjct: 61  CCICITLVLFIFLLLVGTAVFYLWFDPKLPTFSLASFRLDGFKLADDPDGASLSATAVAR 120

Query: 448 VQARNPNEKITFYYDRIHVRMTAV---DDVDLGSGSIPAFNQGKKNTTVLKFGTQVKNML 618
           V+ +NPN K+ FYY    V M+     D+  +G  ++  F QG KN+T +K  T VKN L
Sbjct: 121 VEMKNPNSKLVFYYGNTAVEMSVGSGNDETGMGETTVNGFRQGPKNSTSVKVETTVKNEL 180

Query: 619 IDDDVGTKLKARFRSKEIMISVEVRTKFQLGLQWWRLGKVPVRIFCEGVSLKKLDGRSPP 798
           ++  +  +L A+F+SK+++I+V  +TK  LG+   ++G + V + C GVSL KLD  S P
Sbjct: 181 VERGLAKRLAAKFQSKDLVINVVAKTKVGLGVGGIKIGMLAVNLRCGGVSLNKLDTDS-P 239

Query: 799 KCTISLLKWINI 834
           +CT++ LKW+NI
Sbjct: 240 QCTLNTLKWLNI 251


>ref|XP_006397816.1| hypothetical protein EUTSA_v10001596mg [Eutrema salsugineum]
           gi|557098889|gb|ESQ39269.1| hypothetical protein
           EUTSA_v10001596mg [Eutrema salsugineum]
          Length = 255

 Score =  172 bits (436), Expect = 2e-40
 Identities = 94/255 (36%), Positives = 139/255 (54%), Gaps = 17/255 (6%)
 Frame = +1

Query: 121 MTEPTRKPILQKPPGYRDPNA------PVSLARKPP--------IPRSFHPKPKRKSXXX 258
           M +   +P+LQKPPGYRDPN       P  LA +P         +P SF PK KR+    
Sbjct: 1   MADYPMQPVLQKPPGYRDPNMATPPPPPPPLAARPQHPMRKTAGMPSSFRPKKKRRGCCR 60

Query: 259 XXXXXXXXXXXXXXXXXALAGSLFYLWFEPKFPNFHIQSIKFSRFNVTVKLDGTFLTSQT 438
                             +  ++FYLWF+PK P F + S +   F ++   DG  L++  
Sbjct: 61  FFCCCLCITLVLFIFLLLVGTAVFYLWFDPKLPTFSLASFRLDGFKLSDDPDGALLSATA 120

Query: 439 IVRVQARNPNEKITFYYDRIHVRMTAV---DDVDLGSGSIPAFNQGKKNTTVLKFGTQVK 609
           + RV+ RNPN K+ FYY    V M+     D+  +G  +I  F QG KN+T +K  T VK
Sbjct: 121 VARVEMRNPNTKLVFYYGNTDVEMSVRSGNDETGMGETTINGFRQGPKNSTSVKVETSVK 180

Query: 610 NMLIDDDVGTKLKARFRSKEIMISVEVRTKFQLGLQWWRLGKVPVRIFCEGVSLKKLDGR 789
           + L++  +  +L A+F+SK+++I+V  +TK  LG+   ++G + V + C GVSL KLD  
Sbjct: 181 SQLVERGLAKRLAAKFQSKDLVINVVAKTKVGLGVSGIKIGMLAVNLRCGGVSLNKLDTD 240

Query: 790 SPPKCTISLLKWINI 834
           S PKC ++ LKW+NI
Sbjct: 241 S-PKCILNTLKWVNI 254


>ref|XP_004133831.1| PREDICTED: uncharacterized protein LOC101214208 [Cucumis sativus]
           gi|449478172|ref|XP_004155241.1| PREDICTED:
           uncharacterized LOC101214208 [Cucumis sativus]
          Length = 246

 Score =  171 bits (434), Expect = 3e-40
 Identities = 92/240 (38%), Positives = 133/240 (55%), Gaps = 8/240 (3%)
 Frame = +1

Query: 130 PTRKPILQKPPGYRDPNAPVSLARKPPIPRSFHPKP------KRKSXXXXXXXXXXXXXX 291
           P  KPILQKPPG++DPN       +PP  +   P P      KR+S              
Sbjct: 5   PPLKPILQKPPGFKDPNHIALPVPRPPARKLILPSPLSQKNKKRRSCWRRCCCFFCLLVL 64

Query: 292 XXXXXXALAGSLFYLWFEPKFPNFHIQSIKFSRFNVTVKLDGTFLTSQTIVRVQARNPNE 471
                    G + YLWFEPK P  H+QS + S+FNVT K DG++L ++TI R++ +NPN 
Sbjct: 65  ILIVAILAVGGVLYLWFEPKLPVVHLQSFRISKFNVTDKSDGSYLNAKTIGRIEIKNPNS 124

Query: 472 KITFYYDRIHVRMTAVDDV--DLGSGSIPAFNQGKKNTTVLKFGTQVKNMLIDDDVGTKL 645
           K++  Y  I V++ A +    +LGS  +P+F Q ++NTT LK  T V N  +DD  G  L
Sbjct: 125 KLSLNYGDIEVQIAAGEGTRTELGSMIVPSFIQSEENTTSLKIETMVSNETVDDGAGRNL 184

Query: 646 KARFRSKEIMISVEVRTKFQLGLQWWRLGKVPVRIFCEGVSLKKLDGRSPPKCTISLLKW 825
            +  R+ E++++VE RTK    +   R+  V + + C  VSLK+LD  + PKC+I L +W
Sbjct: 185 NSGNRTGELVVNVEARTKIGFVVDGRRMPPVKIEVSCGSVSLKRLDRGNVPKCSIHLRRW 244


>ref|XP_006577337.1| PREDICTED: uncharacterized protein LOC102670360 isoform X1 [Glycine
           max]
          Length = 248

 Score =  169 bits (427), Expect = 2e-39
 Identities = 92/241 (38%), Positives = 137/241 (56%), Gaps = 3/241 (1%)
 Frame = +1

Query: 121 MTEPTRKPILQKPPGYRDPNA--PVSLARKPPIPRSFHPKPKRKSXXXXXXXXXXXXXXX 294
           M EP  KPILQKPPGYRDP++  P    RKP +P SF PKPKR+S               
Sbjct: 1   MEEP--KPILQKPPGYRDPDSKPPPPPPRKPMLPPSFQPKPKRRSCCRICCCTFCLTILI 58

Query: 295 XXXXXALAGSLFYLWFEPKFPNFHIQSIKFSRFNVTVKLDGTFLTSQTIVRVQARNPNEK 474
                 +A +LFYL ++P  P FH+ S +  + NVT   DG +L + T  RV+ +N + +
Sbjct: 59  LILVVVIAAALFYLIYDPSLPEFHLDSFRVPKLNVTEAADGAYLDADTSARVEVKNRSGR 118

Query: 475 ITFYYDRIHVRMTAVD-DVDLGSGSIPAFNQGKKNTTVLKFGTQVKNMLIDDDVGTKLKA 651
           +T+++ +    ++A + D++LGS  +  F   +K  T LK  T VK + +++    ++K+
Sbjct: 119 MTWHFSQSQFTVSAENGDLNLGSTKVAGFTVKEKGVTGLKAETSVKELALNERQRRRIKS 178

Query: 652 RFRSKEIMISVEVRTKFQLGLQWWRLGKVPVRIFCEGVSLKKLDGRSPPKCTISLLKWIN 831
              SK ++ SVEVRTK  LGL  W    + V I C  V++++L    PP C+I+LLKWI 
Sbjct: 179 AVESKALVPSVEVRTKTGLGLGGWNSPSISVTIVCGDVTMRQLQKGDPPLCSITLLKWIK 238

Query: 832 I 834
           I
Sbjct: 239 I 239


>ref|NP_182153.2| late embryogenesis abundant hydroxyproline-rich glycoprotein
           [Arabidopsis thaliana] gi|91806363|gb|ABE65909.1|
           unknown [Arabidopsis thaliana]
           gi|330255579|gb|AEC10673.1| late embryogenesis abundant
           hydroxyproline-rich glycoprotein [Arabidopsis thaliana]
          Length = 252

 Score =  167 bits (423), Expect = 6e-39
 Identities = 90/252 (35%), Positives = 138/252 (54%), Gaps = 14/252 (5%)
 Frame = +1

Query: 121 MTEPTRKPILQKPPGYRDPNA------PVSLARKP-----PIPRSFHPKPKRKSXXXXXX 267
           M +    P+LQKPPGYRDPN       P  + ++P     P+P S+ PK KR+S      
Sbjct: 1   MADYQMNPVLQKPPGYRDPNMSSPPPPPPPIQQQPMRKAVPMPTSYRPKKKRRSCCRFCC 60

Query: 268 XXXXXXXXXXXXXXALAGSLFYLWFEPKFPNFHIQSIKFSRFNVTVKLDGTFLTSQTIVR 447
                          +  ++FYLWF+PK P F + S +   F +    DG  L++  + R
Sbjct: 61  CCICITLVLFIFLLLVGTAVFYLWFDPKLPTFSLASFRLDGFKLADDPDGASLSATAVAR 120

Query: 448 VQARNPNEKITFYYDRIHVRMTAV---DDVDLGSGSIPAFNQGKKNTTVLKFGTQVKNML 618
           V+ +NPN K+ FYY    V ++     D+  +G  ++  F QG KN+T +K  T VKN L
Sbjct: 121 VEMKNPNSKLVFYYGNTAVDLSVGSGNDETGMGETTMNGFRQGPKNSTSVKVETTVKNQL 180

Query: 619 IDDDVGTKLKARFRSKEIMISVEVRTKFQLGLQWWRLGKVPVRIFCEGVSLKKLDGRSPP 798
           ++  +  +L A+F+SK+++I+V  +TK  LG+   ++G + V + C GVSL KLD  S P
Sbjct: 181 VERGLAKRLAAKFQSKDLVINVVAKTKVGLGVGGIKIGMLAVNLRCGGVSLNKLDTDS-P 239

Query: 799 KCTISLLKWINI 834
           KC ++ LKW+ I
Sbjct: 240 KCILNTLKWVTI 251


>dbj|BAC41966.1| unknown protein [Arabidopsis thaliana]
          Length = 252

 Score =  167 bits (423), Expect = 6e-39
 Identities = 90/252 (35%), Positives = 138/252 (54%), Gaps = 14/252 (5%)
 Frame = +1

Query: 121 MTEPTRKPILQKPPGYRDPNA------PVSLARKP-----PIPRSFHPKPKRKSXXXXXX 267
           M +    P+LQKPPGYRDPN       P  + ++P     P+P S+ PK KR+S      
Sbjct: 1   MADYQMNPVLQKPPGYRDPNMSSPPPPPPPIQQQPMRKAVPMPTSYRPKKKRRSCCRFCC 60

Query: 268 XXXXXXXXXXXXXXALAGSLFYLWFEPKFPNFHIQSIKFSRFNVTVKLDGTFLTSQTIVR 447
                          +  ++FYLWF+PK P F + S +   F +    DG  L++  + R
Sbjct: 61  CCICITLVLFIFLLLVGTAVFYLWFDPKLPTFSLASFRLDGFKLADDPDGASLSATAVAR 120

Query: 448 VQARNPNEKITFYYDRIHVRMTAV---DDVDLGSGSIPAFNQGKKNTTVLKFGTQVKNML 618
           V+ +NPN K+ FYY    V ++     D+  +G  ++  F QG KN+T +K  T VKN L
Sbjct: 121 VEMKNPNSKLVFYYGNTAVDLSVGSGNDETGMGETTMNGFRQGPKNSTSVKVETTVKNQL 180

Query: 619 IDDDVGTKLKARFRSKEIMISVEVRTKFQLGLQWWRLGKVPVRIFCEGVSLKKLDGRSPP 798
           ++  +  +L A+F+SK+++I+V  +TK  LG+   ++G + V + C GVSL KLD  S P
Sbjct: 181 VERGLAKRLAAKFQSKDLVINVVAKTKVGLGVGGIKIGMLTVNLRCGGVSLNKLDTDS-P 239

Query: 799 KCTISLLKWINI 834
           KC ++ LKW+ I
Sbjct: 240 KCILNTLKWVTI 251


>gb|ABK28538.1| unknown [Arabidopsis thaliana]
          Length = 253

 Score =  167 bits (423), Expect = 6e-39
 Identities = 90/252 (35%), Positives = 138/252 (54%), Gaps = 14/252 (5%)
 Frame = +1

Query: 121 MTEPTRKPILQKPPGYRDPNA------PVSLARKP-----PIPRSFHPKPKRKSXXXXXX 267
           M +    P+LQKPPGYRDPN       P  + ++P     P+P S+ PK KR+S      
Sbjct: 1   MADYQMNPVLQKPPGYRDPNMSSPPPPPPPIQQQPMRKAVPMPTSYRPKKKRRSCCRFCC 60

Query: 268 XXXXXXXXXXXXXXALAGSLFYLWFEPKFPNFHIQSIKFSRFNVTVKLDGTFLTSQTIVR 447
                          +  ++FYLWF+PK P F + S +   F +    DG  L++  + R
Sbjct: 61  CCICITLVLFIFLLLVGTAVFYLWFDPKLPTFSLASFRLDGFKLADDPDGASLSATAVAR 120

Query: 448 VQARNPNEKITFYYDRIHVRMTAV---DDVDLGSGSIPAFNQGKKNTTVLKFGTQVKNML 618
           V+ +NPN K+ FYY    V ++     D+  +G  ++  F QG KN+T +K  T VKN L
Sbjct: 121 VEMKNPNSKLVFYYGNTAVDLSVGSGNDETGMGETTMNGFRQGPKNSTSVKVETTVKNQL 180

Query: 619 IDDDVGTKLKARFRSKEIMISVEVRTKFQLGLQWWRLGKVPVRIFCEGVSLKKLDGRSPP 798
           ++  +  +L A+F+SK+++I+V  +TK  LG+   ++G + V + C GVSL KLD  S P
Sbjct: 181 VERGLAKRLAAKFQSKDLVINVVAKTKVGLGVGGIKIGMLAVNLRCGGVSLNKLDTDS-P 239

Query: 799 KCTISLLKWINI 834
           KC ++ LKW+ I
Sbjct: 240 KCILNTLKWVTI 251


>ref|XP_006295826.1| hypothetical protein CARUB_v10024952mg [Capsella rubella]
           gi|482564534|gb|EOA28724.1| hypothetical protein
           CARUB_v10024952mg [Capsella rubella]
          Length = 249

 Score =  167 bits (422), Expect = 8e-39
 Identities = 90/250 (36%), Positives = 133/250 (53%), Gaps = 15/250 (6%)
 Frame = +1

Query: 121 MTEPTRKPILQKPPGYRDPNAPVSLARKPPI------------PRSFHPKPKRKSXXXXX 264
           M +   +P+LQKPPGYRDPN        PP+            P SF PK KR+S     
Sbjct: 1   MADYHMQPVLQKPPGYRDPNNATPPPPPPPVSQQPMRKASAAMPSSFRPKRKRRSCCRFC 60

Query: 265 XXXXXXXXXXXXXXXALAGSLFYLWFEPKFPNFHIQSIKFSRFNVTVKLDGTFLTSQTIV 444
                           +  ++FYLWF+PK P F + S +   F +    DG  L++  + 
Sbjct: 61  CCCVCISLVLFIFLLLVGTAVFYLWFDPKLPTFSLASFRLDGFKLADDPDGASLSATAVA 120

Query: 445 RVQARNPNEKITFYYDRIHVRM---TAVDDVDLGSGSIPAFNQGKKNTTVLKFGTQVKNM 615
           RV+ +NPN K+ FYY    V M   T  D+  +G+ ++  F QG KN+T +K  T VKN 
Sbjct: 121 RVEMKNPNSKLVFYYGNTDVEMSVGTGNDETGMGATTVNGFRQGPKNSTSVKVETTVKNQ 180

Query: 616 LIDDDVGTKLKARFRSKEIMISVEVRTKFQLGLQWWRLGKVPVRIFCEGVSLKKLDGRSP 795
           L++  +  +L  +F+SK+++I+V  +TK  LG+   ++G + V + C GVSL KLD  S 
Sbjct: 181 LVERALAKRLATKFQSKDLVINVVAKTKVGLGVAGVKIGMLAVNLRCGGVSLNKLDTDS- 239

Query: 796 PKCTISLLKW 825
           PKC ++ LKW
Sbjct: 240 PKCILNTLKW 249


>ref|XP_006577338.1| PREDICTED: uncharacterized protein LOC102670360 isoform X2 [Glycine
           max]
          Length = 241

 Score =  166 bits (419), Expect = 2e-38
 Identities = 90/238 (37%), Positives = 135/238 (56%), Gaps = 3/238 (1%)
 Frame = +1

Query: 121 MTEPTRKPILQKPPGYRDPNA--PVSLARKPPIPRSFHPKPKRKSXXXXXXXXXXXXXXX 294
           M EP  KPILQKPPGYRDP++  P    RKP +P SF PKPKR+S               
Sbjct: 1   MEEP--KPILQKPPGYRDPDSKPPPPPPRKPMLPPSFQPKPKRRSCCRICCCTFCLTILI 58

Query: 295 XXXXXALAGSLFYLWFEPKFPNFHIQSIKFSRFNVTVKLDGTFLTSQTIVRVQARNPNEK 474
                 +A +LFYL ++P  P FH+ S +  + NVT   DG +L + T  RV+ +N + +
Sbjct: 59  LILVVVIAAALFYLIYDPSLPEFHLDSFRVPKLNVTEAADGAYLDADTSARVEVKNRSGR 118

Query: 475 ITFYYDRIHVRMTAVD-DVDLGSGSIPAFNQGKKNTTVLKFGTQVKNMLIDDDVGTKLKA 651
           +T+++ +    ++A + D++LGS  +  F   +K  T LK  T VK + +++    ++K+
Sbjct: 119 MTWHFSQSQFTVSAENGDLNLGSTKVAGFTVKEKGVTGLKAETSVKELALNERQRRRIKS 178

Query: 652 RFRSKEIMISVEVRTKFQLGLQWWRLGKVPVRIFCEGVSLKKLDGRSPPKCTISLLKW 825
              SK ++ SVEVRTK  LGL  W    + V I C  V++++L    PP C+I+LLKW
Sbjct: 179 AVESKALVPSVEVRTKTGLGLGGWNSPSISVTIVCGDVTMRQLQKGDPPLCSITLLKW 236


>gb|AAC62876.1| hypothetical protein [Arabidopsis thaliana]
           gi|227204111|dbj|BAH56908.1| AT2G46300 [Arabidopsis
           thaliana]
          Length = 254

 Score =  166 bits (419), Expect = 2e-38
 Identities = 90/252 (35%), Positives = 137/252 (54%), Gaps = 14/252 (5%)
 Frame = +1

Query: 121 MTEPTRKPILQKPPGYRDPNA------PVSLARKP-----PIPRSFHPKPKRKSXXXXXX 267
           M +    P+LQKPPGYRDPN       P  + ++P     P+P S+ PK KR+S      
Sbjct: 1   MADYQMNPVLQKPPGYRDPNMSSPPPPPPPIQQQPMRKAVPMPTSYRPKKKRRSCCRFCC 60

Query: 268 XXXXXXXXXXXXXXALAGSLFYLWFEPKFPNFHIQSIKFSRFNVTVKLDGTFLTSQTIVR 447
                          +  ++FYLWF+PK P F + S +   F +    DG  L++  + R
Sbjct: 61  CCICITLVLFIFLLLVGTAVFYLWFDPKLPTFSLASFRLDGFKLADDPDGASLSATAVAR 120

Query: 448 VQARNPNEKITFYYDRIHVRMTAV---DDVDLGSGSIPAFNQGKKNTTVLKFGTQVKNML 618
           V+ +NPN K+ FYY    V ++     D+  +G  ++  F QG KN+T +K  T VKN L
Sbjct: 121 VEMKNPNSKLVFYYGNTAVDLSVGSGNDETGMGETTMNGFRQGPKNSTSVKVETTVKNQL 180

Query: 619 IDDDVGTKLKARFRSKEIMISVEVRTKFQLGLQWWRLGKVPVRIFCEGVSLKKLDGRSPP 798
           ++  +  +L A+F+SK+++I+V  +TK  LG+   ++G + V + C GVSL KLD  S P
Sbjct: 181 VERGLAKRLAAKFQSKDLVINVVAKTKVGLGVGGIKIGMLAVNLRCGGVSLNKLDTDS-P 239

Query: 799 KCTISLLKWINI 834
           KC ++ LKW  I
Sbjct: 240 KCILNTLKWYKI 251


>ref|XP_004140888.1| PREDICTED: uncharacterized protein LOC101205096 [Cucumis sativus]
          Length = 253

 Score =  164 bits (414), Expect = 6e-38
 Identities = 89/252 (35%), Positives = 135/252 (53%), Gaps = 14/252 (5%)
 Frame = +1

Query: 121 MTEPTRKPILQKPPGYRDPNA--------------PVSLARKPPIPRSFHPKPKRKSXXX 258
           M +   KP LQKPPGY+D N               P  L  KP  P S+ PK ++++   
Sbjct: 1   MADLPLKPPLQKPPGYKDHNTTATSSSSASTATHLPPPLRPKPRPPSSYKPKKRKRNCCR 60

Query: 259 XXXXXXXXXXXXXXXXXALAGSLFYLWFEPKFPNFHIQSIKFSRFNVTVKLDGTFLTSQT 438
                            ALA +LFYL ++PK P FH+ + + S F V+   DG+FL SQ 
Sbjct: 61  TCCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSTTPDGSFLDSQV 120

Query: 439 IVRVQARNPNEKITFYYDRIHVRMTAVDDVDLGSGSIPAFNQGKKNTTVLKFGTQVKNML 618
            +RV+ +NPNEK++  Y +I   +T     + G   +  F QG+++TT +K    VKN +
Sbjct: 121 SIRVEFKNPNEKLSIKYGKIEYDVTVGQATEFGRRELAGFTQGRRSTTTVKAEAAVKNKM 180

Query: 619 IDDDVGTKLKARFRSKEIMISVEVRTKFQLGLQWWRLGKVPVRIFCEGVSLKKLDGRSPP 798
           +  + G +L ++F+SK + + VE  T+  + +Q W LG + V++ CE   LK +DG   P
Sbjct: 181 LAVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCES-KLKNIDGGDMP 239

Query: 799 KCTISLLKWINI 834
            C I+LL+WINI
Sbjct: 240 TCNINLLRWINI 251


>ref|XP_004160824.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized LOC101205096
           [Cucumis sativus]
          Length = 252

 Score =  160 bits (406), Expect = 5e-37
 Identities = 87/251 (34%), Positives = 133/251 (52%), Gaps = 13/251 (5%)
 Frame = +1

Query: 121 MTEPTRKPILQKPPGYRDPNAPVSLARKPP-------------IPRSFHPKPKRKSXXXX 261
           M +   KP LQKPPGY+D N        PP              P S+ PK ++++    
Sbjct: 1   MADLPLKPPLQKPPGYKDHNTTAPPPPPPPPPPTFLHLSVPNLSPSSYKPKKRKRNCCRT 60

Query: 262 XXXXXXXXXXXXXXXXALAGSLFYLWFEPKFPNFHIQSIKFSRFNVTVKLDGTFLTSQTI 441
                           ALA +LFYL ++PK P FH+ + + S F V+   DG+FL SQ  
Sbjct: 61  CCCIFCFLILFLIVVAALALALFYLLYDPKLPVFHLLAFRISSFKVSTTPDGSFLDSQVS 120

Query: 442 VRVQARNPNEKITFYYDRIHVRMTAVDDVDLGSGSIPAFNQGKKNTTVLKFGTQVKNMLI 621
           +RV+ +NPNEK++  Y +I   +T     + G   +  F QG+++TT +K    VKN ++
Sbjct: 121 IRVEFKNPNEKLSIKYGKIEYDVTVGQATEFGRRELAGFTQGRRSTTTVKAEAAVKNKML 180

Query: 622 DDDVGTKLKARFRSKEIMISVEVRTKFQLGLQWWRLGKVPVRIFCEGVSLKKLDGRSPPK 801
             + G +L ++F+SK + + VE  T+  + +Q W LG + V++ CE   LK +DG   P 
Sbjct: 181 AVEDGGRLLSKFQSKALEVKVEAETEVGVVVQGWGLGPITVKLDCES-KLKNIDGGDMPT 239

Query: 802 CTISLLKWINI 834
           C I+LL+WINI
Sbjct: 240 CNINLLRWINI 250


>ref|XP_007220159.1| hypothetical protein PRUPE_ppa018680mg [Prunus persica]
           gi|462416621|gb|EMJ21358.1| hypothetical protein
           PRUPE_ppa018680mg [Prunus persica]
          Length = 254

 Score =  159 bits (403), Expect = 1e-36
 Identities = 85/245 (34%), Positives = 128/245 (52%), Gaps = 13/245 (5%)
 Frame = +1

Query: 139 KPILQKPPGYRDPNAPVSLARKPPIPR------SFHPKPKRK--SXXXXXXXXXXXXXXX 294
           KP+LQKPPGYR PN P      PP PR      +   K K++  S               
Sbjct: 8   KPVLQKPPGYRTPNYPAQPVPGPPPPRKPVYPPTLRQKQKKRGGSCCKICCCVFCAFLLI 67

Query: 295 XXXXXALAGSLFYLWFEPKFPNFHIQSIKFSRFNVTVKLDGTFLTSQTIVRVQARNPNEK 474
                ALAG +FYL F+P+ P F++ S +  +F+   K DGT L  Q +  V+ +NPN K
Sbjct: 68  VVILVALAGGIFYLLFDPRLPAFYLISFQIPKFDAVSKSDGTHLDVQAVTSVEVKNPNPK 127

Query: 475 ITFYYDRIHVRMTAVDD-----VDLGSGSIPAFNQGKKNTTVLKFGTQVKNMLIDDDVGT 639
           +  YY        ++ D     + +G+  +  F Q  +NTT +K  + V+N +++  VG 
Sbjct: 128 LDIYYSEGFEMSLSIGDENDGGLGIGTKEVKGFTQRHRNTTYVKVESGVRNKVVEQPVGK 187

Query: 640 KLKARFRSKEIMISVEVRTKFQLGLQWWRLGKVPVRIFCEGVSLKKLDGRSPPKCTISLL 819
           KL  +F+SKEI +++E +T+    +Q WR+G + + + C GV LK +D    PKCTI+  
Sbjct: 188 KLLGQFKSKEIKVALEGKTRVGYVIQGWRVGTMQINVLCGGVRLKNVDAGDMPKCTINAF 247

Query: 820 KWINI 834
           KW  I
Sbjct: 248 KWYAI 252


Top