BLASTX nr result

ID: Akebia25_contig00005313 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00005313
         (3122 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006859070.1| hypothetical protein AMTR_s00068p00200420 [A...   291   2e-75
ref|XP_006385862.1| hypothetical protein POPTR_0003s15620g [Popu...   248   2e-62
ref|XP_002303725.2| hypothetical protein POPTR_0003s15620g [Popu...   248   2e-62
ref|XP_007212594.1| hypothetical protein PRUPE_ppa016040mg [Prun...   247   3e-62
ref|XP_002299375.2| hypothetical protein POPTR_0001s12440g [Popu...   238   1e-59
ref|XP_007040960.1| Uncharacterized protein isoform 1 [Theobroma...   236   4e-59
ref|XP_004293834.1| PREDICTED: uncharacterized protein LOC101310...   226   7e-56
ref|XP_007040961.1| Uncharacterized protein isoform 2, partial [...   223   3e-55
ref|XP_006468614.1| PREDICTED: AAC-rich mRNA clone AAC11 protein...   223   6e-55
ref|XP_006448557.1| hypothetical protein CICLE_v10015606mg [Citr...   219   6e-54
gb|EXB56441.1| hypothetical protein L484_009867 [Morus notabilis]     219   8e-54
ref|XP_007040963.1| Uncharacterized protein isoform 4, partial [...   215   1e-52
ref|XP_006284024.1| hypothetical protein CARUB_v10005146mg [Caps...   210   3e-51
ref|NP_001043875.1| Os01g0680700 [Oryza sativa Japonica Group] g...   209   6e-51
ref|NP_567597.2| uncharacterized protein [Arabidopsis thaliana] ...   209   6e-51
ref|XP_002869941.1| hypothetical protein ARALYDRAFT_914630 [Arab...   204   2e-49
ref|XP_003569564.1| PREDICTED: uncharacterized protein LOC100838...   202   8e-49
ref|XP_006398083.1| hypothetical protein EUTSA_v10000929mg [Eutr...   201   2e-48
ref|XP_003628966.1| hypothetical protein MTR_8g070650 [Medicago ...   199   7e-48
ref|XP_006413885.1| hypothetical protein EUTSA_v10025632mg [Eutr...   197   3e-47

>ref|XP_006859070.1| hypothetical protein AMTR_s00068p00200420 [Amborella trichopoda]
            gi|548863182|gb|ERN20537.1| hypothetical protein
            AMTR_s00068p00200420 [Amborella trichopoda]
          Length = 380

 Score =  291 bits (744), Expect = 2e-75
 Identities = 190/407 (46%), Positives = 231/407 (56%), Gaps = 16/407 (3%)
 Frame = +1

Query: 1699 FLVMCNRFQRVSPDCLPLSNGRKSNVRTCKEED---DNGENGRIPNYSSTSSFEGKSVLR 1869
            FL  C R+QRVSPDCL LSNGRK ++R CKE+D    NG NG+I  Y+  +   G   +R
Sbjct: 7    FLHGC-RYQRVSPDCLHLSNGRKPSLRICKEDDIEGSNGNNGKIQTYNH-NPLNGFPRIR 64

Query: 1870 YXXXXXXXXXXDHNNFTAHFIFSESSPQSENHANTQTQKNQNREXXXXXXXXXNRGGDVL 2049
                       DHN   +    SE+     NH N     N N           N GGD++
Sbjct: 65   ---TTPSSTSQDHNYAPS---VSETPQTENNHDNN----NNNNNVGKTHALENNMGGDII 114

Query: 2050 LQWGQNKRSRGSRAENRVMVMTDDQSAVQSRQVIKIQRRV---EKQQQGIMKXXXXXXXX 2220
            LQWGQNKRSRG R+ENRV+    D+S+ Q+RQ +KI RRV   EK Q             
Sbjct: 115  LQWGQNKRSRGFRSENRVL---GDESSTQARQAVKIPRRVVGPEKLQS------------ 159

Query: 2221 XXXXXXXXXXXXXXXYSRSGNPRSSPSIRET-TGSHLNRNLEDRS-------VXXXXXXX 2376
                           YSR+ N R    +RE  TGS + RNLE++S       +       
Sbjct: 160  -----HGAHQTQVNSYSRNTNLRPCTPVREPPTGSIIYRNLEEQSGSGHPKGINVFQSGT 214

Query: 2377 XXXXXXXXXXVARSSDKRSPDKTENK-VGSCSVVSTT-NMGEKQNGSSMAQQADLIMSHH 2550
                          ++KRSP+K +   V SC   STT N     + +S + + +    HH
Sbjct: 215  SNNSGGRFLQRVGDNNKRSPEKPDKAGVASCPPPSTTMNNNNSNHYNSSSPKNNNNNDHH 274

Query: 2551 VESTTPVPSDVVNGEKVNLDMFEWPRIYISLSRKEKEDDFLAMKGTKLPQRPKKRAKNVD 2730
             E T       V  EK+NL++FEWP+IYISLSRKEKEDDFLA+KGTKLPQRPKKRAKNVD
Sbjct: 275  QEITVAEHEPCVAFEKLNLELFEWPKIYISLSRKEKEDDFLAIKGTKLPQRPKKRAKNVD 334

Query: 2731 KTLQYCFPGMWLSDLTRGRYEVREKKCVKKQKRRGLKGMESMESDSE 2871
            KTLQYCFPGMWLS+L RGRYEVREKKCVKK +RRGLKG+ESMESDSE
Sbjct: 335  KTLQYCFPGMWLSELGRGRYEVREKKCVKK-RRRGLKGLESMESDSE 380


>ref|XP_006385862.1| hypothetical protein POPTR_0003s15620g [Populus trichocarpa]
            gi|550343257|gb|ERP63659.1| hypothetical protein
            POPTR_0003s15620g [Populus trichocarpa]
          Length = 368

 Score =  248 bits (632), Expect = 2e-62
 Identities = 178/405 (43%), Positives = 216/405 (53%), Gaps = 20/405 (4%)
 Frame = +1

Query: 1717 RFQRVSPDCLPLSNGRKSNVRTCKEEDDNGENGR-IPN-YSSTSSFEGKSVLRYXXXXXX 1890
            R+QRVSPDC+PLSNG+K N           ENGR IPN ++STS+      LR+      
Sbjct: 2    RYQRVSPDCVPLSNGKKPN---------GAENGRSIPNGFNSTSTNFDTKGLRFRSPSRN 52

Query: 1891 XXXXDHNNFTAHFIFSESSPQSENHANTQTQKNQNREXXXXXXXXXNRGG--DVLLQWGQ 2064
                 HNN T       SSP SEN+ N QTQ++ +           +RGG  DVLLQWGQ
Sbjct: 53   QDH--HNNSTT------SSPHSENNHN-QTQRHDSSPGPSP-----SRGGNGDVLLQWGQ 98

Query: 2065 NKRSRGSRAENRVMVMTDDQSAVQSRQVI-KIQRRVEKQQQGIMKXXXXXXXXXXXXXXX 2241
             KR+R SR+E R +   +  S+ Q+RQ I ++ RRV+ +                     
Sbjct: 99   KKRARVSRSEIRALA-DESSSSGQARQPINRVPRRVDNK------------FSPPTMPPP 145

Query: 2242 XXXXXXXXYSRSGNPRSSPSIRETTGSHLNRNLEDRSVXXXXXXXXXXXXXXXXXVARSS 2421
                     S S + R     +E +G   +RNLE RS                     ++
Sbjct: 146  PPPPPPPKQSISTSIRGGNLKKENSGFLSHRNLEKRSGAGNGSPSRNSGGSSRVVSRSTA 205

Query: 2422 DKRSPDKTENKVGSCSVVSTTNMGEKQNGSSMAQQADLIMSH--------------HVES 2559
             KRSP   EN         +    EK NGS +  QAD  M+                  +
Sbjct: 206  GKRSPPTPENIDRKMPSSRSAAKDEKPNGSLV--QADHQMNQVDSTRAKSEKEAGVTTSN 263

Query: 2560 TTPVPSDVVNGEKVNLD-MFEWPRIYISLSRKEKEDDFLAMKGTKLPQRPKKRAKNVDKT 2736
            T  VP     GEK N + + EWPRIYI+LSRKEKEDDF AMKGTKLPQRPKKRAKN+DK 
Sbjct: 264  TVSVPVVASGGEKANNNGVIEWPRIYIALSRKEKEDDFFAMKGTKLPQRPKKRAKNIDKA 323

Query: 2737 LQYCFPGMWLSDLTRGRYEVREKKCVKKQKRRGLKGMESMESDSE 2871
            LQYCFPGMWLSDLT+ RYEVREKKCVKKQKRRGLKGMESM+SDSE
Sbjct: 324  LQYCFPGMWLSDLTKSRYEVREKKCVKKQKRRGLKGMESMDSDSE 368


>ref|XP_002303725.2| hypothetical protein POPTR_0003s15620g [Populus trichocarpa]
            gi|550343256|gb|EEE78704.2| hypothetical protein
            POPTR_0003s15620g [Populus trichocarpa]
          Length = 369

 Score =  248 bits (632), Expect = 2e-62
 Identities = 178/405 (43%), Positives = 216/405 (53%), Gaps = 20/405 (4%)
 Frame = +1

Query: 1717 RFQRVSPDCLPLSNGRKSNVRTCKEEDDNGENGR-IPN-YSSTSSFEGKSVLRYXXXXXX 1890
            R+QRVSPDC+PLSNG+K N           ENGR IPN ++STS+      LR+      
Sbjct: 3    RYQRVSPDCVPLSNGKKPN---------GAENGRSIPNGFNSTSTNFDTKGLRFRSPSRN 53

Query: 1891 XXXXDHNNFTAHFIFSESSPQSENHANTQTQKNQNREXXXXXXXXXNRGG--DVLLQWGQ 2064
                 HNN T       SSP SEN+ N QTQ++ +           +RGG  DVLLQWGQ
Sbjct: 54   QDH--HNNSTT------SSPHSENNHN-QTQRHDSSPGPSP-----SRGGNGDVLLQWGQ 99

Query: 2065 NKRSRGSRAENRVMVMTDDQSAVQSRQVI-KIQRRVEKQQQGIMKXXXXXXXXXXXXXXX 2241
             KR+R SR+E R +   +  S+ Q+RQ I ++ RRV+ +                     
Sbjct: 100  KKRARVSRSEIRALA-DESSSSGQARQPINRVPRRVDNK------------FSPPTMPPP 146

Query: 2242 XXXXXXXXYSRSGNPRSSPSIRETTGSHLNRNLEDRSVXXXXXXXXXXXXXXXXXVARSS 2421
                     S S + R     +E +G   +RNLE RS                     ++
Sbjct: 147  PPPPPPPKQSISTSIRGGNLKKENSGFLSHRNLEKRSGAGNGSPSRNSGGSSRVVSRSTA 206

Query: 2422 DKRSPDKTENKVGSCSVVSTTNMGEKQNGSSMAQQADLIMSH--------------HVES 2559
             KRSP   EN         +    EK NGS +  QAD  M+                  +
Sbjct: 207  GKRSPPTPENIDRKMPSSRSAAKDEKPNGSLV--QADHQMNQVDSTRAKSEKEAGVTTSN 264

Query: 2560 TTPVPSDVVNGEKVNLD-MFEWPRIYISLSRKEKEDDFLAMKGTKLPQRPKKRAKNVDKT 2736
            T  VP     GEK N + + EWPRIYI+LSRKEKEDDF AMKGTKLPQRPKKRAKN+DK 
Sbjct: 265  TVSVPVVASGGEKANNNGVIEWPRIYIALSRKEKEDDFFAMKGTKLPQRPKKRAKNIDKA 324

Query: 2737 LQYCFPGMWLSDLTRGRYEVREKKCVKKQKRRGLKGMESMESDSE 2871
            LQYCFPGMWLSDLT+ RYEVREKKCVKKQKRRGLKGMESM+SDSE
Sbjct: 325  LQYCFPGMWLSDLTKSRYEVREKKCVKKQKRRGLKGMESMDSDSE 369


>ref|XP_007212594.1| hypothetical protein PRUPE_ppa016040mg [Prunus persica]
            gi|462408459|gb|EMJ13793.1| hypothetical protein
            PRUPE_ppa016040mg [Prunus persica]
          Length = 369

 Score =  247 bits (630), Expect = 3e-62
 Identities = 171/398 (42%), Positives = 219/398 (55%), Gaps = 14/398 (3%)
 Frame = +1

Query: 1720 FQRVSPDCLPLSNGRKSNVRTCKEEDDNGENGRIPNYSSTSSFEGKSVLRYXXXXXXXXX 1899
            +QRVSPDC+PLSNG+K  +R   +ED   E       S+ S+ E     R+         
Sbjct: 5    YQRVSPDCVPLSNGKKPAMRAISKEDGLTET---LTTSTVSTLEPTKPFRFRSQPTTQDP 61

Query: 1900 XDHNNFTAHFIFSESSPQSENHANTQTQKNQNREXXXXXXXXXNRG-GDVLLQWGQNKRS 2076
                   + F    +SP S+NH  + TQ+   ++         +RG GDVLLQWG  KRS
Sbjct: 62   TQ-----SQFGARPTSPNSDNHHRSPTQR---QDKSPSRTPSPSRGAGDVLLQWGHKKRS 113

Query: 2077 RGSRAENRVMVMTDDQSAVQSRQV-IKIQRRVEKQQQGIMKXXXXXXXXXXXXXXXXXXX 2253
            R SR E R     +  S+ Q+RQ  +K+QRR +                           
Sbjct: 114  RVSRTEIRAAT-DESSSSAQARQAGVKLQRRDKSMPP-------------------PPPP 153

Query: 2254 XXXXYSRSGNPRSSPSIRETTGSHL-NRNLEDRS-VXXXXXXXXXXXXXXXXXVARSS-D 2424
                 S + +  S+  +R+   + L +RNLEDRS V                 V+RS+  
Sbjct: 154  PPLSSSSATSSFSNGRLRKEASALLPSRNLEDRSAVVNGSPSRNPTGGSNSRAVSRSTVG 213

Query: 2425 KRSP--DKTENKVGSCSVVSTTNMGEKQNGSSM-------AQQADLIMSHHVESTTPVPS 2577
            KRSP  +K + K+  CS  S+    +K NG S+       A  A L  S  + +T    +
Sbjct: 214  KRSPPPEKNDRKLPPCSGRSSAK-DDKPNGPSVQVDRQHHADSASL-QSDQLAATANGAA 271

Query: 2578 DVVNGEKVNLDMFEWPRIYISLSRKEKEDDFLAMKGTKLPQRPKKRAKNVDKTLQYCFPG 2757
             V   +KVN ++ EWPRIYI+LSRKEKEDDFLAMKGTKLPQRPKKRAKN+D+TLQYCFPG
Sbjct: 272  PVAAADKVNYEVVEWPRIYIALSRKEKEDDFLAMKGTKLPQRPKKRAKNIDRTLQYCFPG 331

Query: 2758 MWLSDLTRGRYEVREKKCVKKQKRRGLKGMESMESDSE 2871
            MWLSDLTR RYEVREKKCVKKQKRRGLKGMES++SDSE
Sbjct: 332  MWLSDLTRNRYEVREKKCVKKQKRRGLKGMESVDSDSE 369


>ref|XP_002299375.2| hypothetical protein POPTR_0001s12440g [Populus trichocarpa]
            gi|550347094|gb|EEE84180.2| hypothetical protein
            POPTR_0001s12440g [Populus trichocarpa]
          Length = 338

 Score =  238 bits (607), Expect = 1e-59
 Identities = 169/389 (43%), Positives = 200/389 (51%), Gaps = 4/389 (1%)
 Frame = +1

Query: 1717 RFQRVSPDCLPLSNGRKSNVRTCKEEDDNGENGR-IPN-YSSTSS-FEGKSVLRYXXXXX 1887
            R+QRVSPDC+PLSNG+K N           ENGR IPN +SSTS+ FE K+   +     
Sbjct: 3    RYQRVSPDCVPLSNGKKPN---------GVENGRSIPNGFSSTSTNFETKA---FRFRSP 50

Query: 1888 XXXXXDHNNFTAHFIFSESSPQSENHANTQTQKNQNREXXXXXXXXXNRGGDVLLQWGQN 2067
                  HNN T       S P S+N  N  TQ++                GDVLLQWGQ 
Sbjct: 51   SRNQDHHNNSTT------SPPHSDNSHN-HTQRHGTSPSPSPSRVG---NGDVLLQWGQK 100

Query: 2068 KRSRGSRAENRVMVMTDDQSAVQSRQVI-KIQRRVEKQQQGIMKXXXXXXXXXXXXXXXX 2244
            KR+R SR+E R     +  S+ Q+RQ I KI RRV+ +                      
Sbjct: 101  KRARVSRSEIRAFP-DESSSSGQARQPINKIPRRVDNKLS------------PSSMPPPP 147

Query: 2245 XXXXXXXYSRSGNPRSSPSIRETTGSHLNRNLEDRSVXXXXXXXXXXXXXXXXXVARSSD 2424
                    S S N R     +E +G   +RNLE RS                     ++ 
Sbjct: 148  PPPSSQQQSTSTNTRGGNLKKENSGILSHRNLEKRSGAGNGSPSRNSGGSGKVVSRSTAG 207

Query: 2425 KRSPDKTENKVGSCSVVSTTNMGEKQNGSSMAQQADLIMSHHVESTTPVPSDVVNGEKVN 2604
            KRSP   EN         +    EK NGS +      +  H                  N
Sbjct: 208  KRSPPTPENIDRKMPSSRSAAKDEKPNGSIV------VADHQTRQVN------------N 249

Query: 2605 LDMFEWPRIYISLSRKEKEDDFLAMKGTKLPQRPKKRAKNVDKTLQYCFPGMWLSDLTRG 2784
             ++ EWPRIYI+LSRKEKEDDF AMKGTKLPQRPKKRAKN+DK LQYCFPGMWLSDLT+ 
Sbjct: 250  NEVIEWPRIYIALSRKEKEDDFFAMKGTKLPQRPKKRAKNIDKALQYCFPGMWLSDLTKS 309

Query: 2785 RYEVREKKCVKKQKRRGLKGMESMESDSE 2871
            RYEVREKKCVKKQKRRGLKGMESM+SDSE
Sbjct: 310  RYEVREKKCVKKQKRRGLKGMESMDSDSE 338


>ref|XP_007040960.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508704895|gb|EOX96791.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 386

 Score =  236 bits (603), Expect = 4e-59
 Identities = 163/422 (38%), Positives = 207/422 (49%), Gaps = 37/422 (8%)
 Frame = +1

Query: 1717 RFQRVSPDCLPLSNGRKSNVRT--------CKEE------DDNGENGRIPNYSSTSSFEG 1854
            R+QRVSPDC PLS+ +K  ++         CKEE      + N ENGR  +    ++FEG
Sbjct: 3    RYQRVSPDCPPLSSAKKLGLKPTITTTSTMCKEEGGSCSNNSNIENGRCISKDIITAFEG 62

Query: 1855 KSVLRYXXXXXXXXXXDHNNFTAH---FIFSESSPQSENHANTQTQKNQNREXXXXXXXX 2025
               +RY           HN+  +H    + +  +P S   A  QT+ N + E        
Sbjct: 63   AKGVRYRPPSRTQDHHLHNSNLSHPSSGVGANGAPNSPPKAQAQTENNHHHEMPKRSETT 122

Query: 2026 XNRGGDVLLQWGQNKRSRGSRAENRVMVMTDDQSAVQSRQVI--KIQRRVEKQQQGIMKX 2199
                GDVLLQWGQ KR+R SR+E R +      S V  RQ I  K+ RRV          
Sbjct: 123  SPNRGDVLLQWGQKKRARVSRSEIRPLADDSSSSTVPGRQPIGNKVPRRV---------- 172

Query: 2200 XXXXXXXXXXXXXXXXXXXXXXYSRSGNPRSSPSIRETTGSHL------NRNLEDRSVXX 2361
                                  ++    P  +P       S L      +RNL++RS   
Sbjct: 173  ---------------------LHATMPPPPPAPPSNSARCSTLRNGLLSSRNLDERSAAA 211

Query: 2362 XXXXXXXXXXXXXXXVARSSDKRSP-----DKTENKVGSC-------SVVSTTNMGEKQN 2505
                               + K+SP     D+ +   GS        S V T  M +   
Sbjct: 212  SGSPSRNSGGTSRAASRAMAGKKSPPLETIDRKKLCAGSVKDGQQNGSAVQTDRMNQTDY 271

Query: 2506 GSSMAQQADLIMSHHVESTTPVPSDVVNGEKVNLDMFEWPRIYISLSRKEKEDDFLAMKG 2685
                +++A         +     S    GEKVN+++ EWPRIYISLSRKEKE+DFLAMKG
Sbjct: 272  APVQSERAG-------GAANSTASAAGVGEKVNVEVIEWPRIYISLSRKEKEEDFLAMKG 324

Query: 2686 TKLPQRPKKRAKNVDKTLQYCFPGMWLSDLTRGRYEVREKKCVKKQKRRGLKGMESMESD 2865
            TKLPQRPKKRAKNVD+TLQYCFPGMWLSDLT+ RYEVREKK  KKQKR+GLKGME +ESD
Sbjct: 325  TKLPQRPKKRAKNVDRTLQYCFPGMWLSDLTKSRYEVREKKSAKKQKRKGLKGMECVESD 384

Query: 2866 SE 2871
            SE
Sbjct: 385  SE 386


>ref|XP_004293834.1| PREDICTED: uncharacterized protein LOC101310966 [Fragaria vesca
            subsp. vesca]
          Length = 338

 Score =  226 bits (575), Expect = 7e-56
 Identities = 146/292 (50%), Positives = 179/292 (61%), Gaps = 13/292 (4%)
 Frame = +1

Query: 2035 GGDVLLQWGQNKRSRGSRAENRVMVMTDDQSAVQSRQVIKIQRRVEKQQQGIMKXXXXXX 2214
            GGDVLLQWGQ KRSR SR E RV+   +  S+ Q+RQ  K+QRR       +        
Sbjct: 65   GGDVLLQWGQRKRSRVSRTEIRVLA-DESSSSAQARQA-KVQRRAA-HAAAVAADKSMPP 121

Query: 2215 XXXXXXXXXXXXXXXXXYSRSGNPRSSPSIRETTGSHLNRNLEDRSVXXXXXXXXXXXXX 2394
                             +S +G  R     +E +G   NRNLEDRS              
Sbjct: 122  PPPPPPPHPSSTTSTSSFS-NGRLR-----KEASGLLPNRNLEDRSAVVNGSPSRSTVVG 175

Query: 2395 XXXXVARS-SDKRSP--DKTENKVGSCSVVSTTNMGEKQNGSSMAQQADLIMSHHVESTT 2565
                 +RS + KRSP  +K+E K+ SCS  S  +  +K NGSS  +      ++HV+ST+
Sbjct: 176  NGRAASRSIAGKRSPPPEKSERKMPSCSGRSAKD--DKANGSSDHR------ANHVDSTS 227

Query: 2566 PVPSDVVNG----------EKVNLDMFEWPRIYISLSRKEKEDDFLAMKGTKLPQRPKKR 2715
             + S+ + G          EK+N ++ EWPRIY++LSRKEKEDDFLAMKGTKLPQRPKKR
Sbjct: 228  -LQSEQLAGAANHSAALAAEKLNHEVVEWPRIYLALSRKEKEDDFLAMKGTKLPQRPKKR 286

Query: 2716 AKNVDKTLQYCFPGMWLSDLTRGRYEVREKKCVKKQKRRGLKGMESMESDSE 2871
            AKNVD+TLQYCFPGMWLSDLTR RYEVREKKCVKKQKRRGLKGMES+ES+SE
Sbjct: 287  AKNVDRTLQYCFPGMWLSDLTRNRYEVREKKCVKKQKRRGLKGMESVESESE 338


>ref|XP_007040961.1| Uncharacterized protein isoform 2, partial [Theobroma cacao]
            gi|508704896|gb|EOX96792.1| Uncharacterized protein
            isoform 2, partial [Theobroma cacao]
          Length = 385

 Score =  223 bits (569), Expect = 3e-55
 Identities = 156/413 (37%), Positives = 199/413 (48%), Gaps = 37/413 (8%)
 Frame = +1

Query: 1720 FQRVSPDCLPLSNGRKSNVRT--------CKEE------DDNGENGRIPNYSSTSSFEGK 1857
            +QRVSPDC PLS+ +K  ++         CKEE      + N ENGR  +    ++FEG 
Sbjct: 11   YQRVSPDCPPLSSAKKLGLKPTITTTSTMCKEEGGSCSNNSNIENGRCISKDIITAFEGA 70

Query: 1858 SVLRYXXXXXXXXXXDHNNFTAH---FIFSESSPQSENHANTQTQKNQNREXXXXXXXXX 2028
              +RY           HN+  +H    + +  +P S   A  QT+ N + E         
Sbjct: 71   KGVRYRPPSRTQDHHLHNSNLSHPSSGVGANGAPNSPPKAQAQTENNHHHEMPKRSETTS 130

Query: 2029 NRGGDVLLQWGQNKRSRGSRAENRVMVMTDDQSAVQSRQVI--KIQRRVEKQQQGIMKXX 2202
               GDVLLQWGQ KR+R SR+E R +      S V  RQ I  K+ RRV           
Sbjct: 131  PNRGDVLLQWGQKKRARVSRSEIRPLADDSSSSTVPGRQPIGNKVPRRV----------- 179

Query: 2203 XXXXXXXXXXXXXXXXXXXXXYSRSGNPRSSPSIRETTGSHL------NRNLEDRSVXXX 2364
                                 ++    P  +P       S L      +RNL++RS    
Sbjct: 180  --------------------LHATMPPPPPAPPSNSARCSTLRNGLLSSRNLDERSAAAS 219

Query: 2365 XXXXXXXXXXXXXXVARSSDKRSP-----DKTENKVGSC-------SVVSTTNMGEKQNG 2508
                              + K+SP     D+ +   GS        S V T  M +    
Sbjct: 220  GSPSRNSGGTSRAASRAMAGKKSPPLETIDRKKLCAGSVKDGQQNGSAVQTDRMNQTDYA 279

Query: 2509 SSMAQQADLIMSHHVESTTPVPSDVVNGEKVNLDMFEWPRIYISLSRKEKEDDFLAMKGT 2688
               +++A         +     S    GEKVN+++ EWPRIYISLSRKEKE+DFLAMKGT
Sbjct: 280  PVQSERAG-------GAANSTASAAGVGEKVNVEVIEWPRIYISLSRKEKEEDFLAMKGT 332

Query: 2689 KLPQRPKKRAKNVDKTLQYCFPGMWLSDLTRGRYEVREKKCVKKQKRRGLKGM 2847
            KLPQRPKKRAKNVD+TLQYCFPGMWLSDLT+ RYEVREKK  KKQKR+GLKGM
Sbjct: 333  KLPQRPKKRAKNVDRTLQYCFPGMWLSDLTKSRYEVREKKSAKKQKRKGLKGM 385


>ref|XP_006468614.1| PREDICTED: AAC-rich mRNA clone AAC11 protein-like [Citrus sinensis]
          Length = 374

 Score =  223 bits (567), Expect = 6e-55
 Identities = 161/400 (40%), Positives = 192/400 (48%), Gaps = 15/400 (3%)
 Frame = +1

Query: 1717 RFQRVSPDCLPLSN-GRKSNVRTCKEEDDNGENGRIPNYSSTSSFEGKSVLRYXXXXXXX 1893
            R+QRVSPD  PLSN G KS+ R C+EE   GENG     ++T+  E     R        
Sbjct: 3    RYQRVSPDYNPLSNNGSKSSFR-CREE---GENGTKTVLTTTNGIESGFRFRSPSKPPPP 58

Query: 1894 XXXDHNNFTAHFIFSESSPQSENHANTQTQKNQNREXXXXXXXXXNRGGDVLLQ--WGQN 2067
                 NN T H+        S N+ N  T                 RGGDVLLQ  WG  
Sbjct: 59   PPSQENNNTNHY------NHSINNNNNTTTTASASPLSDHNKLSNGRGGDVLLQLQWGHK 112

Query: 2068 KRSRGSRAENRVMVMTDDQSAVQSRQVIKIQRRVEKQQQGIMKXXXXXXXXXXXXXXXXX 2247
            KR+R SR E R + + DD S+          RR                           
Sbjct: 113  KRARLSRTEIRSLSVNDDSSSSSP-----FHRRAGSP------------FIDKPPPPPPH 155

Query: 2248 XXXXXXYSRSGNPRSSPSIRETTGSHLNRNLEDRSVXXXXXXXXXXXXXXXXXVARSS-D 2424
                   S S  P S+   ++++G   NRN+EDR                   V+RS+  
Sbjct: 156  FHPSNATSTSTRP-SNLRTKDSSGFINNRNIEDRPAAANGSPSRNAAGDSSRGVSRSAVG 214

Query: 2425 KRSPDKTENKVGSCSVVSTTNMGEKQNGSSMAQQADLIMSHHVE-----------STTPV 2571
            KRSP  +E           T  G      S A    +                  S T V
Sbjct: 215  KRSPHSSEKLDKKIRKDENTTNGSITRADSAATTNPVRSEQETRAGAVAAGAAGTSNTVV 274

Query: 2572 PSDVVNGEKVNLDMFEWPRIYISLSRKEKEDDFLAMKGTKLPQRPKKRAKNVDKTLQYCF 2751
               V   +KVN ++ EWP+IY++LSRKEKEDDFLAMKGTKLP RPKKRAKN+D+TLQYCF
Sbjct: 275  SVSVSVAQKVNAEVIEWPKIYVALSRKEKEDDFLAMKGTKLPHRPKKRAKNIDRTLQYCF 334

Query: 2752 PGMWLSDLTRGRYEVREKKCVKKQKRRGLKGMESMESDSE 2871
            PGMWLSDLT+ RYEVREKK VKKQKRRGLKGMESMESDSE
Sbjct: 335  PGMWLSDLTKSRYEVREKKSVKKQKRRGLKGMESMESDSE 374


>ref|XP_006448557.1| hypothetical protein CICLE_v10015606mg [Citrus clementina]
            gi|557551168|gb|ESR61797.1| hypothetical protein
            CICLE_v10015606mg [Citrus clementina]
          Length = 380

 Score =  219 bits (558), Expect = 6e-54
 Identities = 159/402 (39%), Positives = 194/402 (48%), Gaps = 17/402 (4%)
 Frame = +1

Query: 1717 RFQRVSPDCLPLSN-GRKSNVRTCKEEDDNGENGRIPNYSSTSSFEGKSVLRYXXXXXXX 1893
            R+QRVSPD  PLSN G KS+ R C+EE   GENG     ++T+  E     R        
Sbjct: 3    RYQRVSPDYNPLSNNGSKSSFR-CREE---GENGTKTVLTTTNGIESGFRFRSPSKPPPP 58

Query: 1894 XXX--DHNNFTAHFIFSESSPQSENHANTQTQKNQNREXXXXXXXXXNRGGDVLLQ--WG 2061
                   NN T H+  + S   + N+ N  T                 RGGDVLLQ  WG
Sbjct: 59   PPPPSQENNNTNHY--NHSINNNNNNNNNTTTTASASPLSDHNKLSNGRGGDVLLQLQWG 116

Query: 2062 QNKRSRGSRAENRVMVMTDDQSAVQSRQVIKIQRRVEKQQQGIMKXXXXXXXXXXXXXXX 2241
              KR+R SR E R + + DD S+          RR                         
Sbjct: 117  HKKRARLSRTEIRSLSVNDDSSSSSP-----FHRRAGSP------------FIDKPPPPP 159

Query: 2242 XXXXXXXXYSRSGNPRSSPSIRETTGSHLNRNLEDRSVXXXXXXXXXXXXXXXXXVARSS 2421
                     S S  P S+   ++++G   NRN+EDR                   V+RS+
Sbjct: 160  PHSHPSNATSTSTRP-SNLRTKDSSGFINNRNIEDRPAAANGSPSRNAAGDSSRGVSRSA 218

Query: 2422 -DKRSPDKTENKVGSCSVVSTTNMGEKQNGSSMAQQADLIMSHHVESTTPVPSDVVNG-- 2592
              KRSP  +E           T  G      S A    +       +     + V  G  
Sbjct: 219  VGKRSPHSSEKLDKKIRKDENTTNGSITRADSAATTTPVRSEQETRAGAGAVAAVAAGTS 278

Query: 2593 ---------EKVNLDMFEWPRIYISLSRKEKEDDFLAMKGTKLPQRPKKRAKNVDKTLQY 2745
                     +KVN ++ EWP+IY++LSRKEKEDDFLAMKGTKLP RPKKRAKN+D+TLQY
Sbjct: 279  NTVVSVSVAQKVNAEVIEWPKIYVALSRKEKEDDFLAMKGTKLPHRPKKRAKNIDRTLQY 338

Query: 2746 CFPGMWLSDLTRGRYEVREKKCVKKQKRRGLKGMESMESDSE 2871
            CFPGMWLSDLT+ RYEVREKK VKKQKRRGLKGMESMESDSE
Sbjct: 339  CFPGMWLSDLTKSRYEVREKKSVKKQKRRGLKGMESMESDSE 380


>gb|EXB56441.1| hypothetical protein L484_009867 [Morus notabilis]
          Length = 373

 Score =  219 bits (557), Expect = 8e-54
 Identities = 151/383 (39%), Positives = 198/383 (51%), Gaps = 14/383 (3%)
 Frame = +1

Query: 1714 NRFQRVSPDCLPLSNGRKSNVRTCKEEDDNGENGRIPNYSSTSSFEGKSVLRYXXXXXXX 1893
            +++QRVSPDCLPLSNG+K N          G    I   SS+SSFE +S           
Sbjct: 21   SQYQRVSPDCLPLSNGKKPN----------GVENAIT--SSSSSFEQQSKSFRFRSPSRT 68

Query: 1894 XXXDHNNFTAHFIFSESSPQSENHANTQTQKNQNREXXXXXXXXXNRGGDVLLQWGQNKR 2073
               DH++   H     S+  + N+ N     +             + GGD+LLQWG  KR
Sbjct: 69   TTQDHHHSNHHQ--HTSTFDNNNNNNNNNHFHHESSLSPSPSPSPSHGGDILLQWGHKKR 126

Query: 2074 SRGSRAENRVMVMTDDQSAVQSRQVIKIQRRVEKQQQGIMKXXXXXXXXXXXXXXXXXXX 2253
            SR SR E R +  TDD S+  S +  + Q+ ++ Q++ +                     
Sbjct: 127  SRVSRTEIRAL--TDDSSSSSSAKQQQPQQALKPQRRVV---------GPTTAMPPPPPP 175

Query: 2254 XXXXYSRSGNPRSSPSIRETTGSHLNRNLEDRS-VXXXXXXXXXXXXXXXXXVARSSDKR 2430
                 S S N R+    ++++GSH  RNLEDRS V                  + +  KR
Sbjct: 176  PPPLLSSSSNGRAR---KDSSGSHPGRNLEDRSGVVNGSPSRNYAGNNRAASRSTAGGKR 232

Query: 2431 SPDKTENKVGSCSVVSTTNMGEKQNGSSMAQQADLIMSHHVESTT-------------PV 2571
            SP   +N+  + S   + N  +K NGSS       + S+H +S +             P 
Sbjct: 233  SPQPEKNERKNFSSGRSAN--DKPNGSSTP-----VRSNHNDSASLRTEQEGGATHANPA 285

Query: 2572 PSDVVNGEKVNLDMFEWPRIYISLSRKEKEDDFLAMKGTKLPQRPKKRAKNVDKTLQYCF 2751
            P +    EKVN++M EWPRI+I+LSRKEKEDDFL MKGTKLPQRPKKRAKN+D+ LQYCF
Sbjct: 286  PKE----EKVNVEMMEWPRIHIALSRKEKEDDFLVMKGTKLPQRPKKRAKNIDRALQYCF 341

Query: 2752 PGMWLSDLTRGRYEVREKKCVKK 2820
            PGMWLSDLTR RYEVREKKCVKK
Sbjct: 342  PGMWLSDLTRNRYEVREKKCVKK 364


>ref|XP_007040963.1| Uncharacterized protein isoform 4, partial [Theobroma cacao]
            gi|508704898|gb|EOX96794.1| Uncharacterized protein
            isoform 4, partial [Theobroma cacao]
          Length = 347

 Score =  215 bits (547), Expect = 1e-52
 Identities = 147/382 (38%), Positives = 186/382 (48%), Gaps = 23/382 (6%)
 Frame = +1

Query: 1795 DDNGENGRIPNYSSTSSFEGKSVLRYXXXXXXXXXXDHNNFTAH---FIFSESSPQSENH 1965
            + N ENGR  +    ++FEG   +RY           HN+  +H    + +  +P S   
Sbjct: 4    NSNIENGRCISKDIITAFEGAKGVRYRPPSRTQDHHLHNSNLSHPSSGVGANGAPNSPPK 63

Query: 1966 ANTQTQKNQNREXXXXXXXXXNRGGDVLLQWGQNKRSRGSRAENRVMVMTDDQSAVQSRQ 2145
            A  QT+ N + E            GDVLLQWGQ KR+R SR+E R +      S V  RQ
Sbjct: 64   AQAQTENNHHHEMPKRSETTSPNRGDVLLQWGQKKRARVSRSEIRPLADDSSSSTVPGRQ 123

Query: 2146 VI--KIQRRVEKQQQGIMKXXXXXXXXXXXXXXXXXXXXXXXYSRSGNPRSSPSIRETTG 2319
             I  K+ RRV                                ++    P  +P       
Sbjct: 124  PIGNKVPRRV-------------------------------LHATMPPPPPAPPSNSARC 152

Query: 2320 SHL------NRNLEDRSVXXXXXXXXXXXXXXXXXVARSSDKRSP-----DKTENKVGSC 2466
            S L      +RNL++RS                      + K+SP     D+ +   GS 
Sbjct: 153  STLRNGLLSSRNLDERSAAASGSPSRNSGGTSRAASRAMAGKKSPPLETIDRKKLCAGSV 212

Query: 2467 -------SVVSTTNMGEKQNGSSMAQQADLIMSHHVESTTPVPSDVVNGEKVNLDMFEWP 2625
                   S V T  M +       +++A         +     S    GEKVN+++ EWP
Sbjct: 213  KDGQQNGSAVQTDRMNQTDYAPVQSERAG-------GAANSTASAAGVGEKVNVEVIEWP 265

Query: 2626 RIYISLSRKEKEDDFLAMKGTKLPQRPKKRAKNVDKTLQYCFPGMWLSDLTRGRYEVREK 2805
            RIYISLSRKEKE+DFLAMKGTKLPQRPKKRAKNVD+TLQYCFPGMWLSDLT+ RYEVREK
Sbjct: 266  RIYISLSRKEKEEDFLAMKGTKLPQRPKKRAKNVDRTLQYCFPGMWLSDLTKSRYEVREK 325

Query: 2806 KCVKKQKRRGLKGMESMESDSE 2871
            K  KKQKR+GLKGME +ESDSE
Sbjct: 326  KSAKKQKRKGLKGMECVESDSE 347


>ref|XP_006284024.1| hypothetical protein CARUB_v10005146mg [Capsella rubella]
            gi|482552729|gb|EOA16922.1| hypothetical protein
            CARUB_v10005146mg [Capsella rubella]
          Length = 352

 Score =  210 bits (535), Expect = 3e-51
 Identities = 152/395 (38%), Positives = 196/395 (49%), Gaps = 10/395 (2%)
 Frame = +1

Query: 1717 RFQRVSPDCLPLSNG-RKSNVRTCKEEDDNGENGRIPNYSSTSSFEGKSVLRYXXXXXXX 1893
            R+QRVSPDCLPL+NG +K  +R       N +N      ++T+S   +            
Sbjct: 3    RYQRVSPDCLPLTNGSKKPYLRPSPSRSTNEDNTTTVITTTTTSIAARGF---------- 52

Query: 1894 XXXDHNNFTAHFIFSESSPQSENHANTQTQKNQNREXXXXXXXXXNRGGDVLLQWGQNKR 2073
               +           +  P+     + Q Q+ Q +E          RGGDVLLQWGQ KR
Sbjct: 53   ---NAGGSCTTTSSLDGVPKGFRFRSIQQQQQQQQEQDPSPS---RRGGDVLLQWGQRKR 106

Query: 2074 SRGSRAENRVMVMTDDQSAVQSRQVIKIQRRVEKQQQGIMKXXXXXXXXXXXXXXXXXXX 2253
            SR SRAE R  +  DD S+   +  I+  + V +     M                    
Sbjct: 107  SRASRAEIR-SITADDSSSSSGQGKIQPNKLVRRSVNPSMPPPPPAPPVFS--------- 156

Query: 2254 XXXXYSRSGNPRSSPSIRETTGSHLNRNLEDRSVXXXXXXXXXXXXXXXXXVARSSDKRS 2433
                 SRS NPR+   + + +    +RNLEDRS                          S
Sbjct: 157  -----SRSTNPRNGFVVGKES-FFPSRNLEDRSANGSPSRNNINGRMMSRSGGSKRSPPS 210

Query: 2434 PDKTENKVGSCSVVSTTNMGEKQNGSSMAQQADLIMSHHV-------ESTTPVPSDV-VN 2589
            PD+ E +       S+    ++QNG    QQ      HH        E+T     +V VN
Sbjct: 211  PDQIEKR-------SSVLRDQRQNGFDHQQQ-----QHHQHQRVNRSETTGQGHQEVEVN 258

Query: 2590 GEKVNLDMFEWPRIYISLSRKEKEDDFLAMKGTKLPQRPKKRAKNVDKTLQYCFPGMWLS 2769
            GE+      EWPRIYI+LSRKEKE+DFL MKGTKLP RP+KRAKN+DK LQ+CFPGMWLS
Sbjct: 259  GEREKATQ-EWPRIYIALSRKEKEEDFLVMKGTKLPHRPRKRAKNIDKALQFCFPGMWLS 317

Query: 2770 DLTRGRYEVREKKCVKKQ-KRRGLKGMESMESDSE 2871
            DLT+ RYEVREKK VKKQ KRRGLKGME++++DSE
Sbjct: 318  DLTKNRYEVREKKNVKKQPKRRGLKGMENLDTDSE 352


>ref|NP_001043875.1| Os01g0680700 [Oryza sativa Japonica Group]
            gi|56202291|dbj|BAD73750.1| unknown protein [Oryza sativa
            Japonica Group] gi|113533406|dbj|BAF05789.1| Os01g0680700
            [Oryza sativa Japonica Group]
            gi|215704813|dbj|BAG94841.1| unnamed protein product
            [Oryza sativa Japonica Group]
          Length = 384

 Score =  209 bits (532), Expect = 6e-51
 Identities = 152/410 (37%), Positives = 195/410 (47%), Gaps = 25/410 (6%)
 Frame = +1

Query: 1717 RFQRVSPDCLPLSNG---------RKSNVRTCKEEDDN----GENGRIPNYSSTSSFEGK 1857
            R+QR+SPDCLPL+NG         RK   R+CK++D       ++ R+ +Y  +S  + K
Sbjct: 32   RYQRLSPDCLPLANGGGGGSGSVTRKPASRSCKDDDGGMAVAADSSRLSSYLPSSQLDSK 91

Query: 1858 SVLRYXXXXXXXXXXDHNNFTAHFIFSESSPQSENHANTQTQKNQNREXXXXXXXXXNRG 2037
             +               ++  A +  +     + ++ +     + + +           G
Sbjct: 92   PL-------RARAPQPSSSSAAAWSPARDHAHAHHNHHHHHHPSDSSDTASPSSNGAGTG 144

Query: 2038 GDVLLQWGQNKRSRGSRAENRVMVMTDDQSAVQSRQVI----KIQRRVEKQQQGIMKXXX 2205
            GDVLLQWG NKRSR  R  +         S  Q RQ      KI RR     + +M    
Sbjct: 145  GDVLLQWGHNKRSRCRRDASSSANAAPSSS--QRRQTASAAGKILRRSSAPAEKLMPPPP 202

Query: 2206 XXXXXXXXXXXXXXXXXXXXYSRSGNPRSSPSIRETTGS--------HLNRNLEDRSVXX 2361
                                Y+R  N RS+ S    + +        H    +E+RS   
Sbjct: 203  PSTTTGS-------------YTRGSNLRSASSFPTRSAAAAAVGDAHHHRSAVEERS--- 246

Query: 2362 XXXXXXXXXXXXXXXVARSSDKRSPDKTENKVGSCSVVSTTNMGEKQNGSSMAQQADLIM 2541
                                 KRSPDK        ++ +  +M  K N            
Sbjct: 247  -----------------GGGYKRSPDKAHKS----ALDAALHMDSKNNHHH--------- 276

Query: 2542 SHHVESTTPVPSDVVNGEKVNLDMFEWPRIYISLSRKEKEDDFLAMKGTKLPQRPKKRAK 2721
             HH +S+         GEK+  + FE PRIYISLSRKEKEDDFL MKGTKLPQRPKKRAK
Sbjct: 277  -HHHDSSVTANGGAGAGEKIGSERFELPRIYISLSRKEKEDDFLIMKGTKLPQRPKKRAK 335

Query: 2722 NVDKTLQYCFPGMWLSDLTRGRYEVREKKCVKKQKRRGLKGMESMESDSE 2871
            NVDKTLQY FPGMWLSDLTRGRYEVREKKCVKK +RRGLKGMESM+SDSE
Sbjct: 336  NVDKTLQYVFPGMWLSDLTRGRYEVREKKCVKK-RRRGLKGMESMDSDSE 384


>ref|NP_567597.2| uncharacterized protein [Arabidopsis thaliana]
            gi|20466570|gb|AAM20602.1| putative protein [Arabidopsis
            thaliana] gi|23198140|gb|AAN15597.1| putative protein
            [Arabidopsis thaliana] gi|332658901|gb|AEE84301.1|
            uncharacterized protein AT4G20300 [Arabidopsis thaliana]
          Length = 352

 Score =  209 bits (532), Expect = 6e-51
 Identities = 150/391 (38%), Positives = 192/391 (49%), Gaps = 6/391 (1%)
 Frame = +1

Query: 1717 RFQRVSPDCLPLSNGRKSNVRTCKEEDDNGENGRIPNYSSTSSFEGKSVLRYXXXXXXXX 1896
            R+QRVSPDCLPL+NG K             E+       +T+S  G+             
Sbjct: 3    RYQRVSPDCLPLTNGGKKPYLRPSPSRATNEDTTTTTVITTTSIAGRGF------NGGSC 56

Query: 1897 XXDHNNFTAHFIFSESSPQSENHANTQTQKNQNREXXXXXXXXXNRGGDVLLQWGQNKRS 2076
                NN ++     +  P+     +TQ Q+ Q             RGGDVLLQWGQ KRS
Sbjct: 57   TTTTNNTSS----LDGVPKGFRFRSTQQQQQQQ------DPSPSRRGGDVLLQWGQRKRS 106

Query: 2077 RGSRAENR----VMVMTDDQSAVQSRQVIKIQRRVEKQQQGIMKXXXXXXXXXXXXXXXX 2244
            R SRAE R    +    DD S+   +  I+  +   +     M                 
Sbjct: 107  RASRAEIRSTTTITTTADDSSSSSGQGKIQSNKPQRRSMNPSMPPPPPAPPIFS------ 160

Query: 2245 XXXXXXXYSRSGNPRSSPSIRETTGSHLNRNLEDRSVXXXXXXXXXXXXXXXXXVARSSD 2424
                     RS NPR+   I + +    +RNLEDRS                        
Sbjct: 161  --------GRSTNPRNGFVIGKES-FFPSRNLEDRSANGSPSRNNINGRMISRSGGSKRS 211

Query: 2425 KRSPDKTENKVGSCSVVSTTNMGEKQNGSSMAQQADLIMSHHVESTTPVPSDV-VNGEKV 2601
              SPD+ E +        ++   ++QNG    QQ    ++   EST     +V +NGE+ 
Sbjct: 212  PPSPDQIEKR--------SSVRDQRQNGFDHQQQQHQRVNRS-ESTAQGHQEVEINGERE 262

Query: 2602 NLDMFEWPRIYISLSRKEKEDDFLAMKGTKLPQRPKKRAKNVDKTLQYCFPGMWLSDLTR 2781
                 EWPRIYI+LSRKEKE+DFL MKGTKLP RP+KRAKN+DK LQ+CFPGMWLSDLT+
Sbjct: 263  KATQ-EWPRIYIALSRKEKEEDFLVMKGTKLPHRPRKRAKNIDKALQFCFPGMWLSDLTK 321

Query: 2782 GRYEVREKKCVKK-QKRRGLKGMESMESDSE 2871
             RYEVREKK VKK QKRRGLKGME+M++DSE
Sbjct: 322  NRYEVREKKNVKKQQKRRGLKGMENMDTDSE 352


>ref|XP_002869941.1| hypothetical protein ARALYDRAFT_914630 [Arabidopsis lyrata subsp.
            lyrata] gi|297315777|gb|EFH46200.1| hypothetical protein
            ARALYDRAFT_914630 [Arabidopsis lyrata subsp. lyrata]
          Length = 351

 Score =  204 bits (519), Expect = 2e-49
 Identities = 146/391 (37%), Positives = 187/391 (47%), Gaps = 6/391 (1%)
 Frame = +1

Query: 1717 RFQRVSPDCLPLSNGRKSNVRTCKEEDDNGENGRIPNYSSTSSFEGKSVLRYXXXXXXXX 1896
            R+QRVSPDCLPL+NG K             E+       +T+S  G+             
Sbjct: 3    RYQRVSPDCLPLTNGGKKPYLRPSPSRATNEDTTTTTVITTTSIAGRGF----------- 51

Query: 1897 XXDHNNFTAHFIFSESSPQSENHANTQTQKNQNREXXXXXXXXXNRGGDVLLQWGQNKRS 2076
                   T +    +  P+     +TQ Q+ Q+            RGGDVLLQWGQ KRS
Sbjct: 52   NGGSCTTTTNTSSLDGVPKGFRFRSTQQQQQQDPSPS-------RRGGDVLLQWGQRKRS 104

Query: 2077 RGSRAENRVMVMT---DDQSAVQSRQVIKIQRRVEKQQQGIMKXXXXXXXXXXXXXXXXX 2247
            R SRAE R    T   DD S+   +  I+  +   +     M                  
Sbjct: 105  RASRAEIRSTTTTTTADDSSSSSGQGKIQSSKLQRRSMNPSMPPPPPAPPIFS------- 157

Query: 2248 XXXXXXYSRSGNPRSSPSIRETTGSHLNRNLEDRSVXXXXXXXXXXXXXXXXXVARSSDK 2427
                    RS NPR+   I + +    +RNLEDRS                         
Sbjct: 158  -------GRSTNPRNGFVIGKES-FFPSRNLEDRSANGSPSRNNINGRMISRSGGSKRSP 209

Query: 2428 RSPDKTENKVGSCSVVSTTNMGEKQNGSSMA--QQADLIMSHHVESTTPVPSDVVNGEKV 2601
             SPD+ E +        ++    +QNG      QQ    ++    +    P   +NGE+ 
Sbjct: 210  PSPDQIEKR--------SSVRDHRQNGFDHHHHQQQHQRVNRSESTAQGHPEVEINGERE 261

Query: 2602 NLDMFEWPRIYISLSRKEKEDDFLAMKGTKLPQRPKKRAKNVDKTLQYCFPGMWLSDLTR 2781
                 EWPRIYI+LSRKEKE+DFL MKGTKLP RP+KRAKN+DK LQ+CFPGMWLSDLT+
Sbjct: 262  KATQ-EWPRIYIALSRKEKEEDFLVMKGTKLPHRPRKRAKNIDKALQFCFPGMWLSDLTK 320

Query: 2782 GRYEVREKKCVKK-QKRRGLKGMESMESDSE 2871
             RYEVREKK VKK QKRRGLKGME++++DSE
Sbjct: 321  NRYEVREKKNVKKQQKRRGLKGMENLDTDSE 351


>ref|XP_003569564.1| PREDICTED: uncharacterized protein LOC100838590 [Brachypodium
            distachyon]
          Length = 363

 Score =  202 bits (514), Expect = 8e-49
 Identities = 153/396 (38%), Positives = 196/396 (49%), Gaps = 11/396 (2%)
 Frame = +1

Query: 1717 RFQRVSPDCLPLSNG------RKSNVRTCKEEDDNGENG-RIPNYSSTSSFEGKSVLRYX 1875
            R+QR+SPDCLPL+NG      RK   R+  ++DD   +G R+ +Y + S  +  S     
Sbjct: 32   RYQRLSPDCLPLANGGGSGVARKPASRSSFKDDDAATDGSRLASYLAASQPDSSS----- 86

Query: 1876 XXXXXXXXXDHNNFTAHFIFSESSPQSENHANTQTQKNQNREXXXXXXXXXNRGGDVLLQ 2055
                          TA       SP + +HA+  +    +           +  GDVLLQ
Sbjct: 87   -KPARARAPPPQTATAR------SP-ARDHAHGDSSDTASPS---------SNAGDVLLQ 129

Query: 2056 WGQNKRSRGSR-AENRVMVMTDDQSAVQSRQVI-KIQRRVEKQQQGIMKXXXXXXXXXXX 2229
            WG NKRSR  R A +        Q  + S  V  KIQRR     + +M            
Sbjct: 130  WGHNKRSRCRRDASSSSSAAPSPQRRLSSGGVNGKIQRRASAPTEKLMPPPPAATAI--- 186

Query: 2230 XXXXXXXXXXXXYSRSGNPRSSPSI-RETTGSHLNRNLEDRSVXXXXXXXXXXXXXXXXX 2406
                         +R  N RS+ S      G   N +  +RSV                 
Sbjct: 187  -------------TRGSNLRSASSFPARAAGGDANHHGNNRSVEERSGG----------- 222

Query: 2407 VARSSDKRSPDKTENKVGSCSVVSTTNMGEKQNGSSMAQQADLIMSHHVESTTPVPSDVV 2586
               +  ++SPDK  +K  +   +   N     +             HH  S +P+ ++  
Sbjct: 223  ---AQKRQSPDKAHSKAAAVDHMDPKNSNNHHHPY-----------HHHNSDSPLVANGG 268

Query: 2587 NGEKVN-LDMFEWPRIYISLSRKEKEDDFLAMKGTKLPQRPKKRAKNVDKTLQYCFPGMW 2763
             GEK+  ++ FE PRIYISLSRKEKEDDFLAMKGTKLPQRPKKRAKNVDK+LQ+ FPGMW
Sbjct: 269  GGEKLGAVERFELPRIYISLSRKEKEDDFLAMKGTKLPQRPKKRAKNVDKSLQFVFPGMW 328

Query: 2764 LSDLTRGRYEVREKKCVKKQKRRGLKGMESMESDSE 2871
            LSDLTR RYEVREKKCVKK +RRGLKGMESM+SDSE
Sbjct: 329  LSDLTRSRYEVREKKCVKK-RRRGLKGMESMDSDSE 363


>ref|XP_006398083.1| hypothetical protein EUTSA_v10000929mg [Eutrema salsugineum]
            gi|557099172|gb|ESQ39536.1| hypothetical protein
            EUTSA_v10000929mg [Eutrema salsugineum]
          Length = 372

 Score =  201 bits (511), Expect = 2e-48
 Identities = 154/399 (38%), Positives = 205/399 (51%), Gaps = 14/399 (3%)
 Frame = +1

Query: 1717 RFQRVSPDCLPLSNGRKSNVRTCKEED-DNGENGRIPNYSSTSSFEGKSVLRYXXXXXXX 1893
            R+QRVSPD LPL+N +K  +R       DNG        ++T++     V R+       
Sbjct: 3    RYQRVSPDYLPLTNTKKPYLRPSPSRSIDNGG-------TATTAAISTGVGRFNGTSTTI 55

Query: 1894 XXXDHNNFTAHFIFSESSPQSENHANTQTQKNQNREXXXXXXXXXNRGGDVLLQWGQNKR 2073
               + +     F F  +S  +   A  Q Q+                GGD LLQWGQ KR
Sbjct: 56   SSSNLDGVPKGFRFRSTSITT---ATQQQQEEDLSHDSTTNPSGSGGGGDGLLQWGQRKR 112

Query: 2074 SRGSRAENR-VMVMTDDQSAVQSRQVIKIQRRVEKQQQGIMKXXXXXXXXXXXXXXXXXX 2250
            SR SR E R V V   D S+  S Q +    R++++   ++                   
Sbjct: 113  SRASRTEIRSVSVAAADDSSSSSGQNLIQSNRIQRRSTNLIMPPPSLSSSPLCGGG---- 168

Query: 2251 XXXXXYSRSGNPRSSPSI-RETTGSHLNRNLEDRSVXXXXXXXXXXXXXXXXXVARSSD- 2424
                   RS NPRS   I +E++     R+LEDRSV                 V+RS++ 
Sbjct: 169  ------GRSTNPRSGFVIGKESSRFVPTRHLEDRSVTGSPSRNIGVSGGRM--VSRSANG 220

Query: 2425 --KRS---PDKTENKVGSCSVVSTTNMGEKQNGSSMAQQADLIMSHHVESTTPVPSDVV- 2586
              KRS   P+KTE +          +  ++QNG     Q      +  EST  + S++  
Sbjct: 221  GLKRSTPSPEKTETRSNG---KDHHHHHQRQNGLDNHHQR----MNRSESTAQIHSEIET 273

Query: 2587 -NGEKVN--LDMFEWPRIYISLSRKEKEDDFLAMKGTKLPQRPKKRAKNVDKTLQYCFPG 2757
             NGEK    ++  EWPRIYI+LSRKEKE+DFLAMKGTKLP RP+KRAKN+DK LQ+CFPG
Sbjct: 274  NNGEKTTTQVEFKEWPRIYIALSRKEKEEDFLAMKGTKLPHRPRKRAKNIDKGLQFCFPG 333

Query: 2758 MWLSDLTRGRYEVREKKCVKK-QKRRGLKGMESMESDSE 2871
            M++SDL + RYEVREKK  KK QKRRGLKGME+++SDSE
Sbjct: 334  MYMSDLNKSRYEVREKKSAKKQQKRRGLKGMENLDSDSE 372


>ref|XP_003628966.1| hypothetical protein MTR_8g070650 [Medicago truncatula]
            gi|355522988|gb|AET03442.1| hypothetical protein
            MTR_8g070650 [Medicago truncatula]
          Length = 253

 Score =  199 bits (506), Expect = 7e-48
 Identities = 131/316 (41%), Positives = 173/316 (54%), Gaps = 5/316 (1%)
 Frame = +1

Query: 1939 ESSPQSENHANTQTQKNQNREXXXXXXXXXNRGGDVLLQWGQNKRSRGSRAENRVMVMTD 2118
            +SSPQ+ +H+N+    + +             GGDVLL+WGQ KRSR SR      ++ D
Sbjct: 5    DSSPQT-SHSNSNNSTSPS-----------GGGGDVLLKWGQRKRSRVSRT-----LIED 47

Query: 2119 DQSAVQSRQVIKIQRRVEKQQQGIMKXXXXXXXXXXXXXXXXXXXXXXXYSRSGNPRSSP 2298
              S+V + Q  K   +                                 +S +  P   P
Sbjct: 48   SSSSVHTNQRKKFPTK---------------------------------FSSASMPPPPP 74

Query: 2299 SIRETTGS----HLNRNLEDRSVXXXXXXXXXXXXXXXXXVARSSDKRSPDKTENKVGSC 2466
             +  + G     ++ RNLED S                  +A+ +   S  +  NK   C
Sbjct: 75   LVSASNGRGRKHNIPRNLEDPS------EPSRMNQNVSRSIAQKNSTPSCMEKSNKRMPC 128

Query: 2467 SVVSTTNMGEKQNGSSMAQQADLIMSHHVESTTPVPSDVVNGEKVNLDMFEWPRIYISLS 2646
            S  S +   +K NGSS  Q  + + ++H ++         NGEKV++++ EWP+IYI+LS
Sbjct: 129  S--SGSAKCKKPNGSSTKQATEKLNNNHGDT---------NGEKVSVEVIEWPKIYIALS 177

Query: 2647 RKEKEDDFLAMKGTKLPQRPKKRAKNVDKTLQYCFPGMWLSDLTRGRYEVREKKCVKKQK 2826
            RKEKEDDFLAMKGTK+PQRPKKRAKN+DKTLQYCFPGMWLSDL++ RYEVREKK VKKQK
Sbjct: 178  RKEKEDDFLAMKGTKIPQRPKKRAKNIDKTLQYCFPGMWLSDLSKSRYEVREKKSVKKQK 237

Query: 2827 R-RGLKGMESMESDSE 2871
            R RGLKGMES+ESDSE
Sbjct: 238  RCRGLKGMESLESDSE 253


>ref|XP_006413885.1| hypothetical protein EUTSA_v10025632mg [Eutrema salsugineum]
            gi|557115055|gb|ESQ55338.1| hypothetical protein
            EUTSA_v10025632mg [Eutrema salsugineum]
          Length = 344

 Score =  197 bits (501), Expect = 3e-47
 Identities = 149/394 (37%), Positives = 191/394 (48%), Gaps = 9/394 (2%)
 Frame = +1

Query: 1717 RFQRVSPDCLPLSNG-RKSNVRTCKEEDDNGENGRIPNYSSTSSFEGKSVLRYXXXXXXX 1893
            R+QRVSPDCLPL+NG +K N+R            R  N  + + F G S           
Sbjct: 3    RYQRVSPDCLPLTNGGKKPNLRPSPS--------RASNEVARTEFNGGSCTT-------- 46

Query: 1894 XXXDHNNFTAHFIFSESSPQSENHANTQTQKNQNREXXXXXXXXXNRGGDVLLQWGQNKR 2073
                    T      +  P+     +TQ Q                 GGDVLLQWGQ KR
Sbjct: 47   --------TTTSSSLDGVPKGFRFRSTQQQDPSPSRRG---------GGDVLLQWGQRKR 89

Query: 2074 SRGSRAENRVMVMTDDQSAVQSRQVIKIQRRVEKQQQGIMKXXXXXXXXXXXXXXXXXXX 2253
            SR SRAE R      D S+  S Q             G M+                   
Sbjct: 90   SRISRAEIRSTTAAADDSSSSSGQ-------------GKMQSSKSLRRSVNPSMPPPAPP 136

Query: 2254 XXXXYSRSGNPRSSPSIRETTGSHLNRNLEDRSVXXXXXXXXXXXXXXXXXVARSSDKRS 2433
                  RS   R+     +     L+RNLEDRS                  V++ S   S
Sbjct: 137  PPVFSGRSAKVRNG--FVDGKEFLLSRNLEDRSANGSPSRNTNGRMVSRSAVSKRSPP-S 193

Query: 2434 PDKTENKVGSCSVVSTTNMGEKQNG---SSMAQQADLIMSHHVESTTP-VPSDVVNGEK- 2598
            PD+ E +    S+    +  ++QNG   + + Q   +  S       P +  D  +GE+ 
Sbjct: 194  PDQIEKR---SSIRDHHHHNQRQNGFDHNHLQQHQRVNRSESTAQAHPELERDNNSGERE 250

Query: 2599 --VNLDMFEWPRIYISLSRKEKEDDFLAMKGTKLPQRPKKRAKNVDKTLQYCFPGMWLSD 2772
               ++++ EWPRIYI+LSRKEKE+DFL MKGTKLP RP+KRAKN+DK+LQYCFPGMWLSD
Sbjct: 251  KATHVEVVEWPRIYIALSRKEKEEDFLVMKGTKLPHRPRKRAKNIDKSLQYCFPGMWLSD 310

Query: 2773 LTRGRYEVREKKCVKK-QKRRGLKGMESMESDSE 2871
            LT+ RYEVREKK VKK QKRRGLKGME++++DSE
Sbjct: 311  LTKNRYEVREKKNVKKQQKRRGLKGMENLDTDSE 344


Top