BLASTX nr result

ID: Akebia23_contig00006143 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia23_contig00006143
         (1321 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_006494894.1| PREDICTED: uncharacterized protein LOC102607...   329   2e-87
ref|XP_006424090.1| hypothetical protein CICLE_v10028702mg [Citr...   328   2e-87
ref|XP_006369380.1| hypothetical protein POPTR_0001s22420g [Popu...   328   3e-87
ref|XP_006494893.1| PREDICTED: uncharacterized protein LOC102607...   327   6e-87
ref|XP_002299772.1| hypothetical protein POPTR_0001s22420g [Popu...   325   3e-86
gb|ABK96612.1| unknown [Populus trichocarpa x Populus deltoides]      325   4e-86
ref|XP_004291672.1| PREDICTED: uncharacterized protein LOC101308...   324   6e-86
gb|EXB39659.1| hypothetical protein L484_017132 [Morus notabilis]     319   2e-84
ref|XP_004241335.1| PREDICTED: uncharacterized protein LOC101256...   317   7e-84
ref|NP_001242895.1| uncharacterized protein LOC100817151 [Glycin...   317   1e-83
ref|XP_007015668.1| Uncharacterized protein isoform 2 [Theobroma...   315   3e-83
ref|XP_007132498.1| hypothetical protein PHAVU_011G099200g [Phas...   313   1e-82
ref|XP_006361151.1| PREDICTED: uncharacterized protein LOC102595...   311   3e-82
ref|XP_004139076.1| PREDICTED: uncharacterized protein LOC101203...   305   2e-80
ref|XP_004154660.1| PREDICTED: uncharacterized protein LOC101228...   305   4e-80
ref|XP_003539850.1| PREDICTED: R3H and coiled-coil domain-contai...   303   9e-80
ref|XP_002513720.1| conserved hypothetical protein [Ricinus comm...   298   5e-78
ref|XP_007015667.1| Uncharacterized protein isoform 1 [Theobroma...   295   3e-77
ref|XP_006289362.1| hypothetical protein CARUB_v10002848mg [Caps...   281   3e-73
ref|XP_002874049.1| predicted protein [Arabidopsis lyrata subsp....   273   2e-70

>ref|XP_006494894.1| PREDICTED: uncharacterized protein LOC102607047 isoform X2 [Citrus
            sinensis]
          Length = 347

 Score =  329 bits (843), Expect = 2e-87
 Identities = 194/358 (54%), Positives = 237/358 (66%), Gaps = 15/358 (4%)
 Frame = +2

Query: 38   EAEDSWSQAVEDLIDGGDVEGAISLLESVISKLETLNSSPS-----DLQLASALTDLANL 202
            + E +WS+AVEDL++ G+ E AISLLES ISKLE +  S       +LQLASALT+LANL
Sbjct: 8    QEETNWSEAVEDLVEAGNTEAAISLLESTISKLEKIEQSQPTKESLNLQLASALTNLANL 67

Query: 203  YSSRGFSLKSDELRTRAFLIKQSSQSNQPIHPPLRDSEIVKKVDSVKNEVSTXXXXXXXX 382
            YSS GFSLKSD L +RAF I+ ++ +        +DS+   K D+ +N  ST        
Sbjct: 68   YSSNGFSLKSDHLLSRAFQIRDAAANIAK-----KDSKDTSK-DTARNSSSTDD------ 115

Query: 383  XXXXXXXWETIADRPSNELLSPQLDAEISKLSLGDTGF--QTPKRRGRGAFLYMKNGLYS 556
                   WE +ADR  +ELLS Q   E+SKLSL DTG   Q PKRRGRG F Y KN LYS
Sbjct: 116  -------WEAMADRDPDELLSSQGLPEVSKLSLEDTGVKVQAPKRRGRGTFSYKKNELYS 168

Query: 557  DQQPNLTASDNSD--------DEEKTEIGNSRFGASHVLVLADFPPRTTTTELEKLFENF 712
            D Q + +  ++++         E KTE+ +S +G  HVLVLADF P T TT+LEKLFE+F
Sbjct: 169  DWQDDKSIVEDAEVDDDSSLSSESKTELRHSNYGTRHVLVLADFSPSTRTTDLEKLFEDF 228

Query: 713  RERGVVIRWVNDTVALAVFRTPSIALEARDSIHFTFTVRVLSENXXXXXXXXXXXXEPPY 892
            R+RGV IRW+NDT ALAVFRTP+IALEAR+ I   F +R+L E+            EPP 
Sbjct: 229  RDRGVSIRWINDTTALAVFRTPAIALEARNHIQLPFKMRILDEDDIILASVSPRDLEPPR 288

Query: 893  PRPKTSARTAQRLIAQGMGRKLSTTTFGSGELRKQEEARRNRIVTRQILRDEAWGADE 1066
             RP+TSARTAQRLIAQ MG KL  TTFGS EL+ QEEARRNRI TRQ LRD+AWG D+
Sbjct: 289  QRPQTSARTAQRLIAQSMGLKL-PTTFGSKELKNQEEARRNRIQTRQKLRDDAWGPDD 345


>ref|XP_006424090.1| hypothetical protein CICLE_v10028702mg [Citrus clementina]
            gi|557526024|gb|ESR37330.1| hypothetical protein
            CICLE_v10028702mg [Citrus clementina]
          Length = 364

 Score =  328 bits (842), Expect = 2e-87
 Identities = 195/362 (53%), Positives = 237/362 (65%), Gaps = 19/362 (5%)
 Frame = +2

Query: 38   EAEDSWSQAVEDLIDGGDVEGAISLLESVISKLETLNSSPS-----DLQLASALTDLANL 202
            + E +WS+AVEDL++ G+ E AISLLES ISKLE +  S       +LQLASALT+LANL
Sbjct: 8    QEETNWSEAVEDLVEAGNTEAAISLLESTISKLEKIQQSQPTKESLNLQLASALTNLANL 67

Query: 203  YSSRGFSLKSDELRTRAFLIKQSSQSNQPIHPPLRDSEIVKKVDSVKNEVSTXXXXXXXX 382
            YSS GFSLKSD L +RAF I+ ++ +        +DS+   K D+ +N  ST        
Sbjct: 68   YSSNGFSLKSDHLLSRAFQIRDAAANIAK-----KDSKDTSK-DTARNSSSTDVKSFGND 121

Query: 383  XXXXXXX----WETIADRPSNELLSPQLDAEISKLSLGDTGF--QTPKRRGRGAFLYMKN 544
                       WE +ADR  +ELLS Q   E+SKLSL DTG   Q PKRRGRG F Y KN
Sbjct: 122  KLPQDGSSDDDWEAMADRDPDELLSSQGLPEVSKLSLEDTGVKVQAPKRRGRGTFSYKKN 181

Query: 545  GLYSDQQPNLTASDNSD--------DEEKTEIGNSRFGASHVLVLADFPPRTTTTELEKL 700
             LYSD Q + +  ++++         E KTE+ +S +G  HVLVLADF P T TT+LEKL
Sbjct: 182  ELYSDWQDDKSIVEDAEVDDDSCLGSESKTELRHSNYGTRHVLVLADFSPSTRTTDLEKL 241

Query: 701  FENFRERGVVIRWVNDTVALAVFRTPSIALEARDSIHFTFTVRVLSENXXXXXXXXXXXX 880
            FE+FR+RGV IRW+NDT ALAVFRTP+IALEAR+ I   F VR+L E+            
Sbjct: 242  FEDFRDRGVSIRWINDTTALAVFRTPAIALEARNHIQLPFKVRILDEDDIILASVSPRDL 301

Query: 881  EPPYPRPKTSARTAQRLIAQGMGRKLSTTTFGSGELRKQEEARRNRIVTRQILRDEAWGA 1060
            EPP  RP+TSARTAQRLIAQ MG KL  TTFGS EL+ QEEARRNRI TRQ LRD+AWG 
Sbjct: 302  EPPRQRPQTSARTAQRLIAQSMGLKL-PTTFGSKELKNQEEARRNRIQTRQKLRDDAWGP 360

Query: 1061 DE 1066
            D+
Sbjct: 361  DD 362


>ref|XP_006369380.1| hypothetical protein POPTR_0001s22420g [Populus trichocarpa]
            gi|550347893|gb|ERP65949.1| hypothetical protein
            POPTR_0001s22420g [Populus trichocarpa]
          Length = 359

 Score =  328 bits (841), Expect = 3e-87
 Identities = 186/351 (52%), Positives = 233/351 (66%), Gaps = 9/351 (2%)
 Frame = +2

Query: 38   EAEDSWSQAVEDLIDGGDVEGAISLLESVISKLETLN-SSPSDLQLASALTDLANLYSSR 214
            ++  +WS+ VEDL+  GD EGAI+LLE+ +S+LETLN S  ++LQL SALT+LA LYSS+
Sbjct: 13   QSNQNWSETVEDLVTAGDTEGAITLLETEVSRLETLNPSEAANLQLVSALTELAKLYSSK 72

Query: 215  GFSLKSDELRTRAFLIKQSSQSNQPIHPPLRDSEIVKKVDSVKNE--VSTXXXXXXXXXX 388
             FSLKSDEL  RA  IKQ S  +  +    ++ EI  K ++V N+  +            
Sbjct: 73   HFSLKSDELLFRASFIKQRSSGD--VESVEKEDEI-SKCNAVSNDGHLEKSSNPRDDVSP 129

Query: 389  XXXXXWETIADRPSNELLSPQLDAEISKLSLGDTGFQTPKRRGRGAFLYMKNGLYSDQQP 568
                 WE IAD   +ELLSPQ    +S + L D   QT KRRGRG F Y K+ LYSD+Q 
Sbjct: 130  CSDDDWEAIADHAPDELLSPQSLPSVSNICLEDAKVQTSKRRGRGPFTYKKHELYSDRQS 189

Query: 569  NLTASDNSDDEE------KTEIGNSRFGASHVLVLADFPPRTTTTELEKLFENFRERGVV 730
            + T  D+ DDE+       TE+ NS++G  HVLVLADFPP   TT+LEKLFE+F++RG V
Sbjct: 190  DATLVDDVDDEDLGRSTQNTELTNSKYGTHHVLVLADFPPSMRTTDLEKLFEDFKDRGFV 249

Query: 731  IRWVNDTVALAVFRTPSIALEARDSIHFTFTVRVLSENXXXXXXXXXXXXEPPYPRPKTS 910
            IRW+NDT ALAVF+TPSIALEAR+ I  +FTVR+L  +            EPP  RPKTS
Sbjct: 250  IRWINDTAALAVFQTPSIALEARNHIQCSFTVRILDADDELMGSIPTKDLEPPRQRPKTS 309

Query: 911  ARTAQRLIAQGMGRKLSTTTFGSGELRKQEEARRNRIVTRQILRDEAWGAD 1063
            ARTAQRLIA GMG KL   TFGS EL+ QEE R+NRIVTRQ ++D+AWG D
Sbjct: 310  ARTAQRLIAHGMGLKL-PMTFGSRELKNQEETRKNRIVTRQKMKDDAWGDD 359


>ref|XP_006494893.1| PREDICTED: uncharacterized protein LOC102607047 isoform X1 [Citrus
            sinensis]
          Length = 364

 Score =  327 bits (839), Expect = 6e-87
 Identities = 194/362 (53%), Positives = 237/362 (65%), Gaps = 19/362 (5%)
 Frame = +2

Query: 38   EAEDSWSQAVEDLIDGGDVEGAISLLESVISKLETLNSSPS-----DLQLASALTDLANL 202
            + E +WS+AVEDL++ G+ E AISLLES ISKLE +  S       +LQLASALT+LANL
Sbjct: 8    QEETNWSEAVEDLVEAGNTEAAISLLESTISKLEKIEQSQPTKESLNLQLASALTNLANL 67

Query: 203  YSSRGFSLKSDELRTRAFLIKQSSQSNQPIHPPLRDSEIVKKVDSVKNEVSTXXXXXXXX 382
            YSS GFSLKSD L +RAF I+ ++ +        +DS+   K D+ +N  ST        
Sbjct: 68   YSSNGFSLKSDHLLSRAFQIRDAAANIAK-----KDSKDTSK-DTARNSSSTDVKSFGND 121

Query: 383  XXXXXXX----WETIADRPSNELLSPQLDAEISKLSLGDTGF--QTPKRRGRGAFLYMKN 544
                       WE +ADR  +ELLS Q   E+SKLSL DTG   Q PKRRGRG F Y KN
Sbjct: 122  KLPQDGSSDDDWEAMADRDPDELLSSQGLPEVSKLSLEDTGVKVQAPKRRGRGTFSYKKN 181

Query: 545  GLYSDQQPNLTASDNSD--------DEEKTEIGNSRFGASHVLVLADFPPRTTTTELEKL 700
             LYSD Q + +  ++++         E KTE+ +S +G  HVLVLADF P T TT+LEKL
Sbjct: 182  ELYSDWQDDKSIVEDAEVDDDSSLSSESKTELRHSNYGTRHVLVLADFSPSTRTTDLEKL 241

Query: 701  FENFRERGVVIRWVNDTVALAVFRTPSIALEARDSIHFTFTVRVLSENXXXXXXXXXXXX 880
            FE+FR+RGV IRW+NDT ALAVFRTP+IALEAR+ I   F +R+L E+            
Sbjct: 242  FEDFRDRGVSIRWINDTTALAVFRTPAIALEARNHIQLPFKMRILDEDDIILASVSPRDL 301

Query: 881  EPPYPRPKTSARTAQRLIAQGMGRKLSTTTFGSGELRKQEEARRNRIVTRQILRDEAWGA 1060
            EPP  RP+TSARTAQRLIAQ MG KL  TTFGS EL+ QEEARRNRI TRQ LRD+AWG 
Sbjct: 302  EPPRQRPQTSARTAQRLIAQSMGLKL-PTTFGSKELKNQEEARRNRIQTRQKLRDDAWGP 360

Query: 1061 DE 1066
            D+
Sbjct: 361  DD 362


>ref|XP_002299772.1| hypothetical protein POPTR_0001s22420g [Populus trichocarpa]
            gi|222847030|gb|EEE84577.1| hypothetical protein
            POPTR_0001s22420g [Populus trichocarpa]
          Length = 370

 Score =  325 bits (833), Expect = 3e-86
 Identities = 186/364 (51%), Positives = 230/364 (63%), Gaps = 22/364 (6%)
 Frame = +2

Query: 38   EAEDSWSQAVEDLIDGGDVEGAISLLESVISKLETLN-SSPSDLQLASALTDLANLYSSR 214
            ++  +WS+ VEDL+  GD EGAI+LLE+ +S+LETLN S  ++LQL SALT+LA LYSS+
Sbjct: 13   QSNQNWSETVEDLVTAGDTEGAITLLETEVSRLETLNPSEAANLQLVSALTELAKLYSSK 72

Query: 215  GFSLKSDELRTRAFLIKQSSQSNQ---------------PIHPPLRDSEIVKKVDSVKNE 349
             FSLKSDEL  RA  IKQ S                     +    D +++  V  +KNE
Sbjct: 73   HFSLKSDELLFRASFIKQRSSGYSFFFFSRSVEKEDEISKCNAVSNDGKLISYVSLIKNE 132

Query: 350  VSTXXXXXXXXXXXXXXXWETIADRPSNELLSPQLDAEISKLSLGDTGFQTPKRRGRGAF 529
                              WE IAD   +ELLSPQ    +S + L D   QT KRRGRG F
Sbjct: 133  -----NFMIEMLLWYATDWEAIADHAPDELLSPQSLPSVSNICLEDAKVQTSKRRGRGPF 187

Query: 530  LYMKNGLYSDQQPNLTASDNSDDEE------KTEIGNSRFGASHVLVLADFPPRTTTTEL 691
             Y K+ LYSD+Q + T  D+ DDE+       TE+ NS++G  HVLVLADFPP   TT+L
Sbjct: 188  TYKKHELYSDRQSDATLVDDVDDEDLGRSTQNTELTNSKYGTHHVLVLADFPPSMRTTDL 247

Query: 692  EKLFENFRERGVVIRWVNDTVALAVFRTPSIALEARDSIHFTFTVRVLSENXXXXXXXXX 871
            EKLFE+F++RG VIRW+NDT ALAVF+TPSIALEAR+ I  +FTVR+L  +         
Sbjct: 248  EKLFEDFKDRGFVIRWINDTAALAVFQTPSIALEARNHIQCSFTVRILDADDELMGSIPT 307

Query: 872  XXXEPPYPRPKTSARTAQRLIAQGMGRKLSTTTFGSGELRKQEEARRNRIVTRQILRDEA 1051
               EPP  RPKTSARTAQRLIA GMG KL   TFGS EL+ QEE R+NRIVTRQ ++D+A
Sbjct: 308  KDLEPPRQRPKTSARTAQRLIAHGMGLKL-PMTFGSRELKNQEETRKNRIVTRQKMKDDA 366

Query: 1052 WGAD 1063
            WG D
Sbjct: 367  WGDD 370


>gb|ABK96612.1| unknown [Populus trichocarpa x Populus deltoides]
          Length = 359

 Score =  325 bits (832), Expect = 4e-86
 Identities = 185/351 (52%), Positives = 232/351 (66%), Gaps = 9/351 (2%)
 Frame = +2

Query: 38   EAEDSWSQAVEDLIDGGDVEGAISLLESVISKLETLN-SSPSDLQLASALTDLANLYSSR 214
            ++  +WS+ VEDL+   D EGAI+LLE+ +S+LETLN S  ++LQL SALT+LA LYSS+
Sbjct: 13   QSNQNWSETVEDLVTACDTEGAITLLETEVSRLETLNPSEAANLQLVSALTELAKLYSSK 72

Query: 215  GFSLKSDELRTRAFLIKQSSQSNQPIHPPLRDSEIVKKVDSVKNE--VSTXXXXXXXXXX 388
             FSLKSDEL  RA  IKQ S  +  +    ++ EI  K ++V N+  +            
Sbjct: 73   HFSLKSDELLFRASFIKQRSSGD--VESVEKEDEI-SKCNAVSNDGHLEKSSNPRDDVSP 129

Query: 389  XXXXXWETIADRPSNELLSPQLDAEISKLSLGDTGFQTPKRRGRGAFLYMKNGLYSDQQP 568
                 WE IAD   +ELLSPQ    +S + L D   QT KRRGRG F Y K+ LYSD+Q 
Sbjct: 130  CSDDDWEAIADHAPDELLSPQSLPSVSNICLEDAKVQTSKRRGRGPFTYKKHELYSDRQS 189

Query: 569  NLTASDNSDDEE------KTEIGNSRFGASHVLVLADFPPRTTTTELEKLFENFRERGVV 730
            + T  D+ DDE+       TE+ NS++G  HVLVLADFPP   TT+LEKLFE+F++RG V
Sbjct: 190  DATLVDDVDDEDLGRSTQNTELTNSKYGTHHVLVLADFPPSMRTTDLEKLFEDFKDRGFV 249

Query: 731  IRWVNDTVALAVFRTPSIALEARDSIHFTFTVRVLSENXXXXXXXXXXXXEPPYPRPKTS 910
            IRW+NDT ALAVF+TPSIALEAR+ I  +FTVR+L  +            EPP  RPKTS
Sbjct: 250  IRWINDTAALAVFQTPSIALEARNHIQCSFTVRILDADDELMGSIPTKDLEPPRQRPKTS 309

Query: 911  ARTAQRLIAQGMGRKLSTTTFGSGELRKQEEARRNRIVTRQILRDEAWGAD 1063
            ARTAQRLIA GMG KL   TFGS EL+ QEE R+NRIVTRQ ++D+AWG D
Sbjct: 310  ARTAQRLIAHGMGLKL-PMTFGSRELKNQEETRKNRIVTRQKMKDDAWGDD 359


>ref|XP_004291672.1| PREDICTED: uncharacterized protein LOC101308047 [Fragaria vesca
            subsp. vesca]
          Length = 359

 Score =  324 bits (830), Expect = 6e-86
 Identities = 189/353 (53%), Positives = 228/353 (64%), Gaps = 12/353 (3%)
 Frame = +2

Query: 44   EDSWSQAVEDLIDGGDVEGAISLLESVISKLETLNSSPSDL-QLASALTDLANLYSSRGF 220
            ED+WS+AVEDL+  GD + AIS+LESVIS LE      S   +LASAL+DLA LYSS+GF
Sbjct: 6    EDNWSEAVEDLVTSGDTDAAISVLESVISNLENKGLPDSGPPELASALSDLAELYSSKGF 65

Query: 221  SLKSDELRTRAFLIK--QSSQSNQPIHPPLRDSEIVKKVDSVKNEVSTXXXXXXXXXXXX 394
            SLK+D+L++RA LIK   SS S   +    + S   K       E ST            
Sbjct: 66   SLKADDLQSRASLIKLRHSSSSTSGVATEKQSSMPGKHSTDGHLEKSTKSQDSSACNGAS 125

Query: 395  XXXWETIADRPSNELLSPQLDAEISKLSLGDTGFQTPKRRGRGAFLYMKNGLYSDQQPNL 574
               WE IADR  +ELLS Q    +SKLSL DT  Q PKRRGRG F Y K+ LYSDQ  + 
Sbjct: 126  DDDWEAIADRTPDELLSSQSLPGVSKLSLEDTKVQAPKRRGRGTFAYKKHELYSDQLSSK 185

Query: 575  TASDNSDDEEKTEIGN---------SRFGASHVLVLADFPPRTTTTELEKLFENFRERGV 727
               DN   EE++E  N         S++G  H+LVLA FPP T T ELE LF++FR+ GV
Sbjct: 186  IVVDNDSLEEESECHNLEGGEETRNSKYGTRHILVLAGFPPSTRTMELENLFKDFRDHGV 245

Query: 728  VIRWVNDTVALAVFRTPSIALEARDSIHFTFTVRVLSENXXXXXXXXXXXXEPPYPRPKT 907
            VIRWVNDTVALAVF+TP+IALEAR+ I  + TVRVL+E+            EPP  RPKT
Sbjct: 246  VIRWVNDTVALAVFQTPAIALEARNHIQCSMTVRVLNEDDTLLSSISPKDLEPPRQRPKT 305

Query: 908  SARTAQRLIAQGMGRKLSTTTFGSGELRKQEEARRNRIVTRQILRDEAWGADE 1066
            SARTAQRLIA GMG KL +T FGS +L++QE  RR+RIVTRQ L+D+AWG DE
Sbjct: 306  SARTAQRLIAHGMGLKLPSTAFGSRDLKEQENDRRSRIVTRQKLKDDAWGGDE 358


>gb|EXB39659.1| hypothetical protein L484_017132 [Morus notabilis]
          Length = 366

 Score =  319 bits (817), Expect = 2e-84
 Identities = 183/367 (49%), Positives = 232/367 (63%), Gaps = 27/367 (7%)
 Frame = +2

Query: 47   DSWSQAVEDLIDGGDVEGAISLLESVISKLETLNSSPSDLQLASALTDLANLYSSRGFSL 226
            ++WS++VEDL   GD + AISLLESVIS L     +PSD QL SALTDLANLYSS+GFSL
Sbjct: 12   NNWSESVEDLAAAGDADAAISLLESVISDL-----NPSDSQLPSALTDLANLYSSKGFSL 66

Query: 227  KSDELRTRAFLIKQSSQSNQPIHPPLRDSEI--------------------VKKVDSVKN 346
            K+D+L +RAFL++Q   S+  +   L++ +                     V+K   ++N
Sbjct: 67   KADQLHSRAFLLQQRRSSSGVLDEDLKEEKKKQGLSPNNSLPCDESSKDGNVEKSTKLQN 126

Query: 347  EVSTXXXXXXXXXXXXXXXWETIADRPSNELLSPQLDAEISKLSLGDTGFQTPKRRGRGA 526
            + S                WE IADR  +ELLS Q    +S+LSL D+    PK+RGRG 
Sbjct: 127  DASPQNETLDDD-------WEAIADRTPDELLSSQCLPGVSELSLQDSKTNAPKQRGRGT 179

Query: 527  FLYMKNGLYSDQQPNLTASDNSDDEE-------KTEIGNSRFGASHVLVLADFPPRTTTT 685
            F Y K+ LYSD     T SD ++DE+        T++  S +G  HVL+LADFPP T T 
Sbjct: 180  FSYKKHELYSDHLSKKTVSDYTEDEDVGHDLESNTDVRKSIYGTRHVLILADFPPSTRTI 239

Query: 686  ELEKLFENFRERGVVIRWVNDTVALAVFRTPSIALEARDSIHFTFTVRVLSENXXXXXXX 865
            +LEKLF++FR+RGVVIRW+NDT ALAVFRTP IALEA + +   FTVR+L E        
Sbjct: 240  DLEKLFDDFRDRGVVIRWINDTTALAVFRTPPIALEASNRVSCPFTVRILDEADDLISSI 299

Query: 866  XXXXXEPPYPRPKTSARTAQRLIAQGMGRKLSTTTFGSGELRKQEEARRNRIVTRQILRD 1045
                 EPP  RPKTSA TAQRLIAQGMG KL++ +FGS ELRKQE  RRNRIVTRQ L++
Sbjct: 300  QAKDLEPPRQRPKTSATTAQRLIAQGMGLKLTSASFGSRELRKQEGDRRNRIVTRQKLKE 359

Query: 1046 EAWGADE 1066
            +AWG D+
Sbjct: 360  DAWGGDD 366


>ref|XP_004241335.1| PREDICTED: uncharacterized protein LOC101256295 [Solanum
            lycopersicum]
          Length = 346

 Score =  317 bits (812), Expect = 7e-84
 Identities = 181/353 (51%), Positives = 234/353 (66%), Gaps = 9/353 (2%)
 Frame = +2

Query: 35   MEAEDSWSQAVEDLIDGGDVEGAISLLESVISKLE--TLNSSPSDLQLASALTDLANLYS 208
            M+++ +WS+ VEDL+D G+++GAISLLE +++KLE  + NSS S L L++AL +L+ LYS
Sbjct: 1    MDSDTNWSEKVEDLVDAGEIDGAISLLEELVAKLEYESQNSSNSQLPLSTALLELSKLYS 60

Query: 209  SRGFSLKSDELRTRAFLIKQSSQSNQPIHPPLRDSEIVKKVDSVKNEVSTXXXXXXXXXX 388
            ++G SL++D+ R++AFLIKQ  Q N+ ++     +      D+ K+  S           
Sbjct: 61   TQGLSLRADQTRSKAFLIKQQ-QENRDVNATKESTGDGISGDN-KDHASLQIDASQNDED 118

Query: 389  XXXXXWETIADRPSNELLSPQLDAEISKLSLGDTGFQTPKRRGRGAFLYMKNGLYSDQQP 568
                 WE IADR  +ELLSPQ   E+SK+SL D+  Q PKRRGRG F Y K  LYSDQQ 
Sbjct: 119  DD---WEAIADRAPDELLSPQHLPEVSKISLQDSKVQAPKRRGRGTFSYQKQSLYSDQQS 175

Query: 569  NLTASDNSDDEE-------KTEIGNSRFGASHVLVLADFPPRTTTTELEKLFENFRERGV 727
            +  A D+ +DE         ++  N  +G  HVLVLADFPP T T +LEKL E F++  V
Sbjct: 176  DEPADDDIEDEAVSSTPEGSSDTKNLNYGTRHVLVLADFPPSTKTNDLEKLLEKFKD--V 233

Query: 728  VIRWVNDTVALAVFRTPSIALEARDSIHFTFTVRVLSENXXXXXXXXXXXXEPPYPRPKT 907
             IRWVNDTVALAVFRTP++ALEA +SIH  FTVRVL E             EPP  RP+T
Sbjct: 234  AIRWVNDTVALAVFRTPTLALEASNSIHCPFTVRVLCEEDELLNSIPPRDLEPPRRRPQT 293

Query: 908  SARTAQRLIAQGMGRKLSTTTFGSGELRKQEEARRNRIVTRQILRDEAWGADE 1066
            SARTAQRLIAQ MG KL +T FGS E R+QEEAR+NRIV+RQ L+ +AWG DE
Sbjct: 294  SARTAQRLIAQSMGIKLPSTDFGSREYRRQEEARKNRIVSRQNLKHDAWGDDE 346


>ref|NP_001242895.1| uncharacterized protein LOC100817151 [Glycine max]
            gi|255642348|gb|ACU21438.1| unknown [Glycine max]
          Length = 366

 Score =  317 bits (811), Expect = 1e-83
 Identities = 189/362 (52%), Positives = 228/362 (62%), Gaps = 23/362 (6%)
 Frame = +2

Query: 47   DSWSQAVEDLIDGGDVEGAISLLESVISKLETLN--SSPSDLQLASALTDLANLYSSRGF 220
            ++WS+AVEDL+D GDVE AISLLESV+   ETLN   S S L LASAL+DLANLYSS+GF
Sbjct: 8    ENWSEAVEDLVDAGDVESAISLLESVV---ETLNPSDSASQLPLASALSDLANLYSSKGF 64

Query: 221  SLKSDELRTRAFLIKQSSQSNQPIHPPLRDSE---IVKKVD---------SVKNEVSTXX 364
            SLK+D L +RA ++KQ   SN P     ++S+    VK            SV+   +   
Sbjct: 65   SLKADHLHSRASVLKQLHHSNSPGEQVPKESKEDGAVKSTSVASRRAAEGSVEKRAAEFP 124

Query: 365  XXXXXXXXXXXXXWETIADRPSNELL---SPQLDAEISKLSLGDTGFQTPKRRGRGAFLY 535
                         WE IAD   +ELL   S    + IS L L +    TPKRRGRG F Y
Sbjct: 125  AQTSAGGGCSDEDWEAIADLEPDELLPTVSSDCSSGISNLKLENAKSGTPKRRGRGTFSY 184

Query: 536  MKNGLYSDQQPNLTASD------NSDDEEKTEIGNSRFGASHVLVLADFPPRTTTTELEK 697
             K  LYSDQ  + +  D      +   E+  ++ NS++G SHVLVLADF P T TTELEK
Sbjct: 185  EKKELYSDQLLDSSVVDVEQEETHRSSEDNKDVQNSKYGTSHVLVLADFSPSTRTTELEK 244

Query: 698  LFENFRERGVVIRWVNDTVALAVFRTPSIALEARDSIHFTFTVRVLSENXXXXXXXXXXX 877
            LFENF++RG+VIRWVNDTVALAVFRTP +ALEA +S+  +FT R+L E+           
Sbjct: 245  LFENFKDRGLVIRWVNDTVALAVFRTPPVALEALNSVRCSFTTRILDEDDTLLSSIKARD 304

Query: 878  XEPPYPRPKTSARTAQRLIAQGMGRKLSTTTFGSGELRKQEEARRNRIVTRQILRDEAWG 1057
             EPP  RPKTSA+ AQRLIA  MG KLS+T  GS E RKQE+ARR RIVTRQ LRDEAWG
Sbjct: 305  LEPPRQRPKTSAQAAQRLIAHSMGLKLSSTGAGSREYRKQEDARRERIVTRQKLRDEAWG 364

Query: 1058 AD 1063
             D
Sbjct: 365  DD 366


>ref|XP_007015668.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508786031|gb|EOY33287.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 443

 Score =  315 bits (807), Expect = 3e-83
 Identities = 187/373 (50%), Positives = 240/373 (64%), Gaps = 20/373 (5%)
 Frame = +2

Query: 5    NRKDFPQDSIMEAEDSWSQAVEDLIDGGDVEGAISLLESVISKLETLNSSPSDLQLASAL 184
            N+K   +  + + + +WS+ VEDL+  GD +GAIS LE+++SKLET  SS  DLQLASAL
Sbjct: 72   NQKQKEKKKMEKGKANWSEEVEDLVTAGDTQGAISFLENLVSKLETTPSS-DDLQLASAL 130

Query: 185  TDLANLYSSRGFSLKSDELRTRAFLIKQSSQSNQPIHPPLRD----SEIVKKVDSVKNEV 352
            +DLA LYSS G+SLKSD+L +RA L+KQ + S+  +    +D    S  +  V    N+ 
Sbjct: 131  SDLAALYSSIGYSLKSDQLFSRASLLKQRAHSSSDVGLAKKDLKEDSLPLPNVSLAGNDK 190

Query: 353  S----------TXXXXXXXXXXXXXXXWETIADRPSNELLSPQLDAEISKLSLGDTGFQT 502
                                       WE IADR  NELLS +    +S LSL D+  + 
Sbjct: 191  PFTHGNIEKGPMTGDDGEPSKLSSDDDWEAIADREPNELLSSEGLPGVSSLSLKDSKVEA 250

Query: 503  PKRRGRGAFLYMKNGLYSDQ-QPNLTASDNSDDEE-----KTEIGNSRFGASHVLVLADF 664
            PKRRGRG F Y K+ LYSDQ    + A+ ++++E+     + +   +++G  HVLVLADF
Sbjct: 251  PKRRGRGTFSYRKSELYSDQLSDGVFATKDTENEDVCIDSEIKTVETKYGTHHVLVLADF 310

Query: 665  PPRTTTTELEKLFENFRERGVVIRWVNDTVALAVFRTPSIALEARDSIHFTFTVRVLSEN 844
             P T TT LEKLFE+FR+RGVVIRWVNDT ALAVF TPSIALEA + ++  FTVR+L E+
Sbjct: 311  SPSTRTTYLEKLFEDFRDRGVVIRWVNDTTALAVFCTPSIALEACNHVNCPFTVRILDED 370

Query: 845  XXXXXXXXXXXXEPPYPRPKTSARTAQRLIAQGMGRKLSTTTFGSGELRKQEEARRNRIV 1024
                        EPP  RP+TSARTAQRLIAQGMG KLS++TFGS ELR QEEAR+NRIV
Sbjct: 371  DMLLGSISARDLEPPRQRPQTSARTAQRLIAQGMGLKLSSSTFGSRELRNQEEARKNRIV 430

Query: 1025 TRQILRDEAWGAD 1063
            TRQ L+D+AWG D
Sbjct: 431  TRQKLKDDAWGDD 443


>ref|XP_007132498.1| hypothetical protein PHAVU_011G099200g [Phaseolus vulgaris]
            gi|561005498|gb|ESW04492.1| hypothetical protein
            PHAVU_011G099200g [Phaseolus vulgaris]
          Length = 364

 Score =  313 bits (801), Expect = 1e-82
 Identities = 181/358 (50%), Positives = 231/358 (64%), Gaps = 19/358 (5%)
 Frame = +2

Query: 47   DSWSQAVEDLIDGGDVEGAISLLESVISKLETLNSSPSDLQLASALTDLANLYSSRGFSL 226
            ++WS+ VEDL+D GDVE AISLLESV+  L   +S+ S L LASAL+DLA+LYSS+GFSL
Sbjct: 8    ENWSETVEDLVDAGDVESAISLLESVVQTLNPSDSA-SQLPLASALSDLADLYSSKGFSL 66

Query: 227  KSDELRTRAFLIKQSSQSNQPIHPPLRDSE---IVK-------KVDSVKNEVSTXXXXXX 376
            K+D L++R+ ++KQ  +S+ P     ++S    +VK       + D    + +       
Sbjct: 67   KADHLQSRSSILKQLHRSSSPGEQVPKESNEDGVVKPTTFASRRSDGSVEKRAELTAQTS 126

Query: 377  XXXXXXXXXWETIADRPSNELL---SPQLDAEISKLSLGDTGFQTPKRRGRGAFLYMKNG 547
                     WE IADR  +ELL   S    +  S L L +    TPKRRGRG F Y K  
Sbjct: 127  AAGGSSEEDWEAIADREPDELLPTVSSDSTSGKSNLKLENAKSGTPKRRGRGTFSYEKQE 186

Query: 548  LYSDQQPNLTASD------NSDDEEKTEIGNSRFGASHVLVLADFPPRTTTTELEKLFEN 709
            LYSDQ  + + +D       S+ E+  ++ ++++G SHV+VLADF P T TTELEKLFE 
Sbjct: 187  LYSDQLFDSSVADVEQAETRSNSEDNRDVQSTKYGTSHVIVLADFSPSTRTTELEKLFEG 246

Query: 710  FRERGVVIRWVNDTVALAVFRTPSIALEARDSIHFTFTVRVLSENXXXXXXXXXXXXEPP 889
            F++RG VIRWVNDTVALAVFRTPS+ALEA +S+  +FT R+L E+            EPP
Sbjct: 247  FKDRGFVIRWVNDTVALAVFRTPSVALEALNSVRCSFTTRILDEDDTLLTSIKARDLEPP 306

Query: 890  YPRPKTSARTAQRLIAQGMGRKLSTTTFGSGELRKQEEARRNRIVTRQILRDEAWGAD 1063
              RPKTSA+ AQRLIA GMG KLS+T+ GSGE RKQE ARR RIVTRQ LRDEAWG D
Sbjct: 307  LQRPKTSAQAAQRLIAHGMGLKLSSTSVGSGEYRKQENARRERIVTRQKLRDEAWGED 364


>ref|XP_006361151.1| PREDICTED: uncharacterized protein LOC102595388 [Solanum tuberosum]
          Length = 354

 Score =  311 bits (798), Expect = 3e-82
 Identities = 179/356 (50%), Positives = 232/356 (65%), Gaps = 12/356 (3%)
 Frame = +2

Query: 35   MEAEDSWSQAVEDLIDGGDVEGAISLLESVISKLE--TLNSSPSDLQLASALTDLANLYS 208
            M+++ +WS+ VEDL+D G++  AISLLE +++KLE  + NSS S L+L++AL +L+ LYS
Sbjct: 1    MDSDTNWSEKVEDLVDAGEINEAISLLEELVAKLEFESQNSSNSQLRLSTALLELSKLYS 60

Query: 209  SRGFSLKSDELRTRAFLIKQSSQSNQPIHPPLR---DSEIVKKVDSVKNEVSTXXXXXXX 379
            ++G SL++D+ R++AFLIKQ  Q N+ ++       D     +V    N+          
Sbjct: 61   TQGLSLRADQTRSKAFLIKQQ-QENRNVNATKESTGDGISGSRVSQSDNK-DHASLQIYT 118

Query: 380  XXXXXXXXWETIADRPSNELLSPQLDAEISKLSLGDTGFQTPKRRGRGAFLYMKNGLYSD 559
                    WE IADR  +ELLSPQ   E+SK+SL D+  Q PKRRGRG F Y K  LYSD
Sbjct: 119  SQNDEDDDWEAIADRAPDELLSPQHLPEVSKISLQDSKVQAPKRRGRGTFSYQKQSLYSD 178

Query: 560  QQPNLTASDNSDDEE-------KTEIGNSRFGASHVLVLADFPPRTTTTELEKLFENFRE 718
            QQ +  A D+ +DE         ++  N  +G  HVLVLADFPP T T +LEKL E F++
Sbjct: 179  QQSDEPAVDDIEDETVSGTPEGSSDTKNLNYGTRHVLVLADFPPSTKTNDLEKLLEKFKD 238

Query: 719  RGVVIRWVNDTVALAVFRTPSIALEARDSIHFTFTVRVLSENXXXXXXXXXXXXEPPYPR 898
              V IRWVNDTVALAVFRTP++ALEA +SIH  FTVRVL E             EPP  R
Sbjct: 239  -DVAIRWVNDTVALAVFRTPALALEASNSIHCPFTVRVLCEENELLSSIPPRDLEPPRRR 297

Query: 899  PKTSARTAQRLIAQGMGRKLSTTTFGSGELRKQEEARRNRIVTRQILRDEAWGADE 1066
            P+TSARTAQRLIAQ MG KL  T FGS E R+QEEAR+NRIV+RQ L+++AWG D+
Sbjct: 298  PQTSARTAQRLIAQSMGIKLPCTDFGSREYRRQEEARKNRIVSRQNLKNDAWGDDD 353


>ref|XP_004139076.1| PREDICTED: uncharacterized protein LOC101203386 [Cucumis sativus]
          Length = 377

 Score =  305 bits (782), Expect = 2e-80
 Identities = 179/364 (49%), Positives = 231/364 (63%), Gaps = 25/364 (6%)
 Frame = +2

Query: 50   SWSQAVEDLIDGGDVEGAISLLESVISKLETLNSSPSDLQLASALTDLANLYSSRGFSLK 229
            +WS+ VEDL+  GD + AISLL+SV+S L+T  +S  D QLA+ALTDL+ LYSS+G SLK
Sbjct: 13   NWSETVEDLVTAGDTDAAISLLQSVVSDLQTSQNSNPDPQLAAALTDLSALYSSKGLSLK 72

Query: 230  SDELRTRAFLIKQSSQSNQP-----IHPPLRDSEIVKKVDSVKNEVS-----------TX 361
            +D++  +AFL+K  +Q + P     I    R S     + SV +E S           + 
Sbjct: 73   ADDIAAKAFLLKHQAQVSCPTGYGKIMNEDRTSPTTVSLSSV-DEASVGTGNLDRTRDSP 131

Query: 362  XXXXXXXXXXXXXXWETIADRPSNELLSPQLDAEISKLSLGDTGFQTPKRRGRGAFLYMK 541
                          WE IADRP NELLS + + +  + S+ +   QTP+RRGRG F Y K
Sbjct: 132  DNAVSCSASLDDDDWEAIADRPPNELLSLESEPDKPEQSVKEMKAQTPRRRGRGTFSYNK 191

Query: 542  NGLYSDQQPNLTASDNSDDEEKT-------EIGNSRFGASHVLVLADFPPRTTTTELEKL 700
            + LYSD+  + + +D++++EE +       E+ ++++G  HVLVLADFPP T T +LE+L
Sbjct: 192  HELYSDKLSDSSTTDDTNEEESSHMIEGRRELKSAQYGTQHVLVLADFPPSTKTIDLERL 251

Query: 701  FENFRERGVVIRWVNDTVALAVFRTPSIALEARDSIHFTFTVRVLSENXXXXXXXXXXXX 880
              NF   GVVIRWVNDTVALAVF+TPS ALE  + +   FT+R L EN            
Sbjct: 252  LGNFMNSGVVIRWVNDTVALAVFQTPSTALEVLNHVRCPFTLRQLDENDTLLSSIPPRDL 311

Query: 881  EPPYPRPKTSARTAQRLIAQGMGRKL--STTTFGSGELRKQEEARRNRIVTRQILRDEAW 1054
             PP  RPKTSARTAQRLIAQGMG KL  STT+FGS ELRKQEE RRNRIV+RQ LRDEAW
Sbjct: 312  VPPKQRPKTSARTAQRLIAQGMGLKLPNSTTSFGSKELRKQEEDRRNRIVSRQKLRDEAW 371

Query: 1055 GADE 1066
            G D+
Sbjct: 372  GDDD 375


>ref|XP_004154660.1| PREDICTED: uncharacterized protein LOC101228893 [Cucumis sativus]
          Length = 377

 Score =  305 bits (780), Expect = 4e-80
 Identities = 179/364 (49%), Positives = 230/364 (63%), Gaps = 25/364 (6%)
 Frame = +2

Query: 50   SWSQAVEDLIDGGDVEGAISLLESVISKLETLNSSPSDLQLASALTDLANLYSSRGFSLK 229
            +WS+ VEDL+  GD + AISLL+SV+S L+T  +S  D QLA+ALTDL+ LYSS+G SLK
Sbjct: 13   NWSETVEDLVTAGDTDAAISLLQSVVSDLQTSQNSNPDPQLAAALTDLSALYSSKGLSLK 72

Query: 230  SDELRTRAFLIKQSSQSNQP-----IHPPLRDSEIVKKVDSVKNEVS-----------TX 361
            +D++  +AFL+K  +Q + P     I    R S     + SV +E S           + 
Sbjct: 73   ADDIAAKAFLLKHQAQVSCPTGYGKIMNEDRTSPTTVSLSSV-DEASVGTGNLDRTRDSP 131

Query: 362  XXXXXXXXXXXXXXWETIADRPSNELLSPQLDAEISKLSLGDTGFQTPKRRGRGAFLYMK 541
                          WE IADRP NELLS + + +  + S+ +   QTP+RRGRG F Y K
Sbjct: 132  DNAVSCSASLDDDDWEAIADRPPNELLSLESEPDKPEQSVKEMKAQTPRRRGRGTFSYNK 191

Query: 542  NGLYSDQQPNLTASDNSDDEEKT-------EIGNSRFGASHVLVLADFPPRTTTTELEKL 700
            + LYSD+  + +  D++++EE +       E+ ++++G  HVLVLADFPP T T +LE+L
Sbjct: 192  HELYSDKLSDSSTMDDTNEEESSHMIEGRRELKSAQYGTQHVLVLADFPPSTKTIDLERL 251

Query: 701  FENFRERGVVIRWVNDTVALAVFRTPSIALEARDSIHFTFTVRVLSENXXXXXXXXXXXX 880
              NF   GVVIRWVNDTVALAVF+TPS ALE  + +   FT+R L EN            
Sbjct: 252  LGNFMNSGVVIRWVNDTVALAVFQTPSTALEVLNHVRCPFTLRQLDENDTLLSSIPPRDL 311

Query: 881  EPPYPRPKTSARTAQRLIAQGMGRKL--STTTFGSGELRKQEEARRNRIVTRQILRDEAW 1054
             PP  RPKTSARTAQRLIAQGMG KL  STT+FGS ELRKQEE RRNRIV+RQ LRDEAW
Sbjct: 312  VPPKQRPKTSARTAQRLIAQGMGLKLPNSTTSFGSKELRKQEEDRRNRIVSRQKLRDEAW 371

Query: 1055 GADE 1066
            G D+
Sbjct: 372  GDDD 375


>ref|XP_003539850.1| PREDICTED: R3H and coiled-coil domain-containing protein 1-like
            [Glycine max]
          Length = 357

 Score =  303 bits (777), Expect = 9e-80
 Identities = 185/356 (51%), Positives = 221/356 (62%), Gaps = 20/356 (5%)
 Frame = +2

Query: 50   SWSQAVEDLIDGGDVEGAISLLESVISKLETLNSSPSDLQLASALTDLANLYSSRGFSLK 229
            +WS++VEDL+D GDVE AISLLESV    ETLN S S    ASAL+DLANLYSSRGFSLK
Sbjct: 7    NWSESVEDLVDAGDVESAISLLESVA---ETLNPSDS----ASALSDLANLYSSRGFSLK 59

Query: 230  SDELRTRAFLIKQSSQSNQPIHPPLRDSE---IVKKVDSVKNEVSTXXXXXXXXXXXXXX 400
            +D L +RA L+KQ   SN P     ++S+   +VK         +               
Sbjct: 60   ADHLLSRASLLKQLHHSNTPAERVPKESKEDGVVKSTTVASRRAAEGSVEKRGEFPAQTS 119

Query: 401  X--------WETIADRPSNELL---SPQLDAEISKLSLGDTGFQTPKRRGRGAFLYMKNG 547
                     WE IAD   +ELL   S    + IS L L +    TPKRRGRG F Y K  
Sbjct: 120  AAGGSSDEDWEAIADLEPDELLPTVSWDCSSGISNLKLENAKSGTPKRRGRGTFSYEKKE 179

Query: 548  LYSDQQPNLTASDNSDDE------EKTEIGNSRFGASHVLVLADFPPRTTTTELEKLFEN 709
            LYSDQ  + +  D   +E      + T++  S++G  HVLVLADF P T TTELEKLFEN
Sbjct: 180  LYSDQLLDRSVVDVEREETPRSSEDNTDVQISKYGTGHVLVLADFSPSTRTTELEKLFEN 239

Query: 710  FRERGVVIRWVNDTVALAVFRTPSIALEARDSIHFTFTVRVLSENXXXXXXXXXXXXEPP 889
            F++RG VIRWVNDTVALAVFRTP++ALEA +S+  +FT R+L E+            EPP
Sbjct: 240  FQDRGFVIRWVNDTVALAVFRTPAVALEALNSVRCSFTTRILDEDDTLLSSIKARDLEPP 299

Query: 890  YPRPKTSARTAQRLIAQGMGRKLSTTTFGSGELRKQEEARRNRIVTRQILRDEAWG 1057
              RPKTSA+ AQRLIA GMG KLS+T  GS E RKQE+ARR RIVTRQ LRDEAWG
Sbjct: 300  RLRPKTSAQAAQRLIAHGMGLKLSSTGVGSREYRKQEDARRERIVTRQKLRDEAWG 355


>ref|XP_002513720.1| conserved hypothetical protein [Ricinus communis]
            gi|223547171|gb|EEF48667.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 383

 Score =  298 bits (762), Expect = 5e-78
 Identities = 175/370 (47%), Positives = 233/370 (62%), Gaps = 31/370 (8%)
 Frame = +2

Query: 50   SWSQAVEDLIDGGDVEGAISLLESVISKLETLNSSPS---DLQLASALTDLANLYSSRGF 220
            +WS+AVEDL+  GD+ GAISLLE+V+SKLE + SSPS   DLQLASAL +L+ LYS+  F
Sbjct: 14   NWSEAVEDLVTAGDINGAISLLETVVSKLEGI-SSPSETVDLQLASALDELSKLYSTNHF 72

Query: 221  SLKSDELRTRAFLIKQSSQSNQP------IHPPLRDSEIVKK---------------VDS 337
            SLKSDEL +RA L+K  +  ++P      +   +++  + K                ++ 
Sbjct: 73   SLKSDELLSRASLLKHRALHSRPSVNTDGLEKDVKEENVSKSNQLLCCKDPIADGSSMNG 132

Query: 338  VKNEVSTXXXXXXXXXXXXXXXWETIADRPSNELLSPQLDAEISKLSLGDTGFQTPKRRG 517
               E  +               WE IADR  +ELLS      +S LSL DT  Q PKRRG
Sbjct: 133  HFEESLSPPDDASSCNGPSDDDWEAIADRAPSELLSSPGLPSVSNLSLEDTKVQGPKRRG 192

Query: 518  RGAFLYMKNGLYSDQQPNLTASDNSDDEEKTEIG-------NSRFGASHVLVLADFPPRT 676
            RG F Y +  LYSD+Q +++ S +++DE  ++         +S++G  HVLVLADFPP T
Sbjct: 193  RGTFSYNQEKLYSDRQSDVSFSGDTEDENLSKSKEQNMKPIHSKYGTRHVLVLADFPPST 252

Query: 677  TTTELEKLFENFRERGVVIRWVNDTVALAVFRTPSIALEARDSIHFTFTVRVLSENXXXX 856
             T +LEKLF +F  RGVVIRWVNDT+ALAVF+TP+IALEA++ + F F V +L E+    
Sbjct: 253  RTIDLEKLFRDFTGRGVVIRWVNDTMALAVFQTPAIALEAQNHVQFPFKVHILDEDDIVL 312

Query: 857  XXXXXXXXEPPYPRPKTSARTAQRLIAQGMGRKLSTTTFGSGELRKQEEARRNRIVTRQI 1036
                    EPP  RP+TS RTAQRLIAQGMG KL +T+FGS EL+ QEEAR+ RIV+RQ 
Sbjct: 313  SLIPVKDLEPPRRRPQTSTRTAQRLIAQGMGLKLPSTSFGSRELKNQEEARKIRIVSRQK 372

Query: 1037 LRDEAWGADE 1066
            + ++AWG D+
Sbjct: 373  MIEDAWGDDK 382


>ref|XP_007015667.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508786030|gb|EOY33286.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 617

 Score =  295 bits (755), Expect = 3e-77
 Identities = 177/360 (49%), Positives = 229/360 (63%), Gaps = 20/360 (5%)
 Frame = +2

Query: 5    NRKDFPQDSIMEAEDSWSQAVEDLIDGGDVEGAISLLESVISKLETLNSSPSDLQLASAL 184
            N+K   +  + + + +WS+ VEDL+  GD +GAIS LE+++SKLET  SS  DLQLASAL
Sbjct: 72   NQKQKEKKKMEKGKANWSEEVEDLVTAGDTQGAISFLENLVSKLETTPSS-DDLQLASAL 130

Query: 185  TDLANLYSSRGFSLKSDELRTRAFLIKQSSQSNQPIHPPLRD----SEIVKKVDSVKNEV 352
            +DLA LYSS G+SLKSD+L +RA L+KQ + S+  +    +D    S  +  V    N+ 
Sbjct: 131  SDLAALYSSIGYSLKSDQLFSRASLLKQRAHSSSDVGLAKKDLKEDSLPLPNVSLAGNDK 190

Query: 353  S----------TXXXXXXXXXXXXXXXWETIADRPSNELLSPQLDAEISKLSLGDTGFQT 502
                                       WE IADR  NELLS +    +S LSL D+  + 
Sbjct: 191  PFTHGNIEKGPMTGDDGEPSKLSSDDDWEAIADREPNELLSSEGLPGVSSLSLKDSKVEA 250

Query: 503  PKRRGRGAFLYMKNGLYSDQ-QPNLTASDNSDDEE-----KTEIGNSRFGASHVLVLADF 664
            PKRRGRG F Y K+ LYSDQ    + A+ ++++E+     + +   +++G  HVLVLADF
Sbjct: 251  PKRRGRGTFSYRKSELYSDQLSDGVFATKDTENEDVCIDSEIKTVETKYGTHHVLVLADF 310

Query: 665  PPRTTTTELEKLFENFRERGVVIRWVNDTVALAVFRTPSIALEARDSIHFTFTVRVLSEN 844
             P T TT LEKLFE+FR+RGVVIRWVNDT ALAVF TPSIALEA + ++  FTVR+L E+
Sbjct: 311  SPSTRTTYLEKLFEDFRDRGVVIRWVNDTTALAVFCTPSIALEACNHVNCPFTVRILDED 370

Query: 845  XXXXXXXXXXXXEPPYPRPKTSARTAQRLIAQGMGRKLSTTTFGSGELRKQEEARRNRIV 1024
                        EPP  RP+TSARTAQRLIAQGMG KLS++TFGS ELR QEEAR+NRI+
Sbjct: 371  DMLLGSISARDLEPPRQRPQTSARTAQRLIAQGMGLKLSSSTFGSRELRNQEEARKNRII 430


>ref|XP_006289362.1| hypothetical protein CARUB_v10002848mg [Capsella rubella]
            gi|482558068|gb|EOA22260.1| hypothetical protein
            CARUB_v10002848mg [Capsella rubella]
          Length = 378

 Score =  281 bits (720), Expect = 3e-73
 Identities = 170/376 (45%), Positives = 227/376 (60%), Gaps = 28/376 (7%)
 Frame = +2

Query: 20   PQDSIMEAEDSWSQAVEDLIDGGDVEGAISLLESVISKLETLNSSPS-------DLQLAS 178
            P +    +E +WS+ VEDL+  GDV  AIS L+S+++ L++   S S        LQLA+
Sbjct: 6    PSEGDKTSEPNWSERVEDLVAAGDVTAAISFLDSLVTNLQSRIGSSSAGERTEFGLQLAA 65

Query: 179  ALTDLANLYSSRGFSLKSDELRTRAFLIKQ--------SSQSNQPIHPP------LRDSE 316
            ALT LA+LYSS+G SLKSDELRTR+ LIKQ        SS+ +  +         L+   
Sbjct: 66   ALTQLADLYSSQGLSLKSDELRTRSSLIKQRALDCDLASSRGSGDVENQIIASNGLKSDS 125

Query: 317  IVKKVDSVKNEVSTXXXXXXXXXXXXXXXWETIADRPSNELLSPQLDAEISKLSLGDTGF 496
             V   D  K + +T               WE +AD   ++LL  +   EISKLS+ +   
Sbjct: 126  NVSPADGWKTKDTTKAVSNNDSSDDD---WEALADLEPSKLLPVEELPEISKLSVEEPKV 182

Query: 497  QTPKRRGRGAFLYMKNGLYSDQQPNLTASDNSDDEEKT-------EIGNSRFGASHVLVL 655
            Q PKRRGRG F Y ++ +YSD+  + +  D+S+D + +       E   S++G  HVLVL
Sbjct: 183  QGPKRRGRGTFTYNRDAMYSDRDFSESRFDDSEDNDTSHDSQKIDEALKSKYGTRHVLVL 242

Query: 656  ADFPPRTTTTELEKLFENFRERGVVIRWVNDTVALAVFRTPSIALEARDSIHFTFTVRVL 835
            A F P   TT+LEKLF++F++ G++IRWVNDT ALAVF+TPS ALEA + +  +FTVRVL
Sbjct: 243  AGFSPSLRTTDLEKLFKDFKDSGLIIRWVNDTTALAVFKTPSAALEACNHVQCSFTVRVL 302

Query: 836  SENXXXXXXXXXXXXEPPYPRPKTSARTAQRLIAQGMGRKLSTTTFGSGELRKQEEARRN 1015
             ++            EPP  RPKTSARTAQRLIA  MG KL T+ FGS ELR QE AR+N
Sbjct: 303  GDHDSLLGSISGKDLEPPSQRPKTSARTAQRLIAHSMGLKLPTSGFGSKELRDQEAARKN 362

Query: 1016 RIVTRQILRDEAWGAD 1063
            RIV+RQ  R++AWG D
Sbjct: 363  RIVSRQKQREDAWGDD 378


>ref|XP_002874049.1| predicted protein [Arabidopsis lyrata subsp. lyrata]
            gi|297319886|gb|EFH50308.1| predicted protein
            [Arabidopsis lyrata subsp. lyrata]
          Length = 384

 Score =  273 bits (697), Expect = 2e-70
 Identities = 170/379 (44%), Positives = 224/379 (59%), Gaps = 31/379 (8%)
 Frame = +2

Query: 20   PQDSIMEAEDSWSQAVEDLIDGGDVEGAISLLESVISKLET-LNSSPSD------LQLAS 178
            P +    +E +WS+ VEDL+  GDV  AIS LES+ + L++ L SS S       LQLA+
Sbjct: 6    PNEEGRISEPNWSERVEDLVAAGDVTAAISFLESLETNLQSRLGSSSSSERTEFVLQLAA 65

Query: 179  ALTDLANLYSSRGFSLKSDELRTRAFLIKQSSQS-------------NQPIHPP-LRDSE 316
            ALT LA+LYSS+G SLKSDELR R+ LIKQ +               NQ I    L+   
Sbjct: 66   ALTQLADLYSSQGLSLKSDELRIRSSLIKQRALDCDRASSRDSGDVENQSIASNGLKSDA 125

Query: 317  IVKKVDSVKNEV---STXXXXXXXXXXXXXXXWETIADRPSNELLSPQLDAEISKLSLGD 487
             V   D  K +    +                WE +AD   ++LL  +   EISKLS+ +
Sbjct: 126  NVSPADGYKGKTKDSTNVPSNNSAAHDSSDDDWEALADLEPSKLLPVEELPEISKLSVEE 185

Query: 488  TGFQTPKRRGRGAFLYMKNGLYSDQQPNLTASDNSDDE------EKTEIG-NSRFGASHV 646
               + PKRRGRG F Y ++ +YSD+  + +  D+S+D       EKT+    S++G  HV
Sbjct: 186  PKVEGPKRRGRGTFTYKRDAMYSDRDFSESRFDDSEDNDLSRDSEKTDESLKSKYGTRHV 245

Query: 647  LVLADFPPRTTTTELEKLFENFRERGVVIRWVNDTVALAVFRTPSIALEARDSIHFTFTV 826
            LVLADF P   T +LEKLF++F++ G +IRWVNDT ALAVF+TP+ ALEA + +  +FT+
Sbjct: 246  LVLADFSPSLRTADLEKLFKDFKDSGFIIRWVNDTTALAVFKTPAAALEACNHVQCSFTI 305

Query: 827  RVLSENXXXXXXXXXXXXEPPYPRPKTSARTAQRLIAQGMGRKLSTTTFGSGELRKQEEA 1006
            RVL ++            EPP  RPKTSARTAQRLIA  MG KL  + FGS ELR QE A
Sbjct: 306  RVLDDHDSLLGSISGKDLEPPSQRPKTSARTAQRLIAHSMGLKLPASGFGSKELRDQEAA 365

Query: 1007 RRNRIVTRQILRDEAWGAD 1063
            R+NRIV+RQ  R++AWG D
Sbjct: 366  RKNRIVSRQKQREDAWGDD 384


Top