BLASTX nr result

ID: Akebia25_contig00021889 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia25_contig00021889
         (1745 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_004230386.1| PREDICTED: uncharacterized protein LOC101247...   395   e-107
ref|XP_002304112.2| hypothetical protein POPTR_0003s03710g [Popu...   384   e-104
ref|XP_006358484.1| PREDICTED: uncharacterized protein LOC102593...   381   e-103
gb|EXC47697.1| hypothetical protein L484_002408 [Morus notabilis]     374   e-101
ref|XP_006470786.1| PREDICTED: uncharacterized protein LOC102629...   365   5e-98
ref|XP_006431360.1| hypothetical protein CICLE_v10001110mg [Citr...   361   7e-97
ref|XP_006853038.1| hypothetical protein AMTR_s00038p00020700 [A...   360   1e-96
ref|XP_007023216.1| Uncharacterized protein isoform 1 [Theobroma...   354   8e-95
gb|EEC82605.1| hypothetical protein OsI_27177 [Oryza sativa Indi...   351   7e-94
ref|XP_003534756.2| PREDICTED: uncharacterized protein LOC100781...   347   1e-92
ref|XP_007147543.1| hypothetical protein PHAVU_006G133500g [Phas...   346   2e-92
ref|XP_004959865.1| PREDICTED: uncharacterized protein LOC101766...   343   1e-91
ref|XP_002519384.1| conserved hypothetical protein [Ricinus comm...   338   3e-90
gb|EEE67737.1| hypothetical protein OsJ_25428 [Oryza sativa Japo...   330   1e-87
dbj|BAC15471.1| hypothetical protein [Oryza sativa Japonica Grou...   326   2e-86
ref|XP_007023217.1| Uncharacterized protein isoform 2 [Theobroma...   322   3e-85
ref|XP_006470787.1| PREDICTED: uncharacterized protein LOC102629...   315   5e-83
ref|XP_007023218.1| Uncharacterized protein isoform 3 [Theobroma...   308   4e-81
ref|XP_007023219.1| Uncharacterized protein isoform 4 [Theobroma...   289   3e-75
gb|EMT03969.1| hypothetical protein F775_22747 [Aegilops tauschii]    287   9e-75

>ref|XP_004230386.1| PREDICTED: uncharacterized protein LOC101247758 [Solanum
            lycopersicum]
          Length = 483

 Score =  395 bits (1014), Expect = e-107
 Identities = 234/467 (50%), Positives = 286/467 (61%), Gaps = 16/467 (3%)
 Frame = -1

Query: 1679 LKLELGDSY-SSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXX 1503
            L LE G+ Y +SFDLEKAVCSHGLFMMAPN WD  +KTL+RP                  
Sbjct: 17   LPLEDGNGYCASFDLEKAVCSHGLFMMAPNRWDTLSKTLERPLRLSENINDDDHEQSVLV 76

Query: 1502 XXXXXXXXXXXXXXSPLD--------QQFLLGQVARMLRLSESDEMCIKEFHKIHPEAKN 1347
                            LD        Q+ LLGQV RM+RLS  +   +K F +I  EAK 
Sbjct: 77   QITQPSDYPHSLLLRVLDTDSLSTIHQRSLLGQVRRMVRLSVEENKRVKLFQEICGEAKE 136

Query: 1346 RGFGRVFRSPTLFEDMVKCMLLCNCQWPRTLTMARALCELQLNLKSDSFKYLGTEVASQD 1167
            RGFGRVFRSPTLFEDMVKCMLLCNCQW RTL+MA ALCELQL L   S     +  +  D
Sbjct: 137  RGFGRVFRSPTLFEDMVKCMLLCNCQWSRTLSMAEALCELQLELNCPS-----SAASFPD 191

Query: 1166 P---NCLKPNT---EGFLPITPIGRELKRKRSMKKIPANLDCKFSENETKLEAETTNCHQ 1005
            P   N LK  T   E F P TP G+EL+++        NL  + +E E  ++ +      
Sbjct: 192  PDNQNQLKGVTSKSEHFTPRTPAGKELRKRAGAYGCSRNLLERLNEVEEIVDIDKPGV-- 249

Query: 1004 QTTCFLSKEKPSPSFLISAEEDDSKGKRNSCQLLNDNNKVDACSMSDETLSEGRTDSSC- 828
                       +P+F +  E      K N CQ   +  +V   +  +   SE R  SS  
Sbjct: 250  ---------TVTPAFSVGEEVLQ---KSNLCQDTTEVWEVSVSAPLNPDPSEDRKLSSFN 297

Query: 827  RIGDFPSPKELASLDVDFLAKRCKLGYRANRIIELARSITEGRFQIEQLEELDCNREIPS 648
            ++G+FPSPK+LASLD  FLAKRC LGYRA RII+LA+ I EG  Q+ +LEE  C+    S
Sbjct: 298  QLGNFPSPKQLASLDESFLAKRCGLGYRAGRIIKLAKGIVEGSIQLNELEEA-CSNPSLS 356

Query: 647  LYDKLAKQLMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKKIHGISSTNRTVQRDV 468
             YDK+A+QL EIDGFGPFTCANVLMC+G+Y VIP DSET+RHLK++H  +ST + VQRDV
Sbjct: 357  NYDKMAEQLREIDGFGPFTCANVLMCLGYYHVIPTDSETIRHLKQVHARTSTIQNVQRDV 416

Query: 467  EKVYGKYAQFKFLAYWSELWNFYEKSFGKASEMPPPNYHLITASNMR 327
            E +YGKYA F+FLAYWSE+W+FYE+ FGK SEMP   Y LITA+NMR
Sbjct: 417  ENIYGKYAPFQFLAYWSEVWHFYEERFGKLSEMPHSEYKLITAANMR 463


>ref|XP_002304112.2| hypothetical protein POPTR_0003s03710g [Populus trichocarpa]
            gi|550342350|gb|EEE79091.2| hypothetical protein
            POPTR_0003s03710g [Populus trichocarpa]
          Length = 489

 Score =  384 bits (985), Expect = e-104
 Identities = 228/492 (46%), Positives = 284/492 (57%), Gaps = 24/492 (4%)
 Frame = -1

Query: 1730 GRMDEEHHNPXXXXSCLLKLELGDSYSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXX 1551
            G+ +EE  +       + ++ LGD+  +F+LEKAVCSHGLFMM+PN WDP + T  RP  
Sbjct: 8    GKEEEEEES------VVFEIPLGDAAETFNLEKAVCSHGLFMMSPNHWDPLSLTFSRPLR 61

Query: 1550 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXS-----------PLDQQFLLGQVARMLRLS 1404
                                                      P  Q+ L+ QV RMLRLS
Sbjct: 62   LSLSDSDPQVSTPTTSLFVSISHPPHLPRSLSVRVYGTRCLSPKHQESLVAQVVRMLRLS 121

Query: 1403 ESDEMCIKEFHKIHPEAKNR-------GFG-RVFRSPTLFEDMVKCMLLCNCQWPRTLTM 1248
            E+DE   +EF KI   A          GFG RVFRSPTLFEDMVKC+LLCNCQWPRTL+M
Sbjct: 122  ETDERNAREFRKIAEAAAAEENNSWLTGFGGRVFRSPTLFEDMVKCILLCNCQWPRTLSM 181

Query: 1247 ARALCELQLNLK-SDSFKYLGTEVASQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPA 1071
            ARALCELQ  L+   S  ++   V +   N        F+P T  G+E KR     K+  
Sbjct: 182  ARALCELQCELQCKSSGVFVAQAVNATVKNKCNDTAHNFIPNTSAGKESKRNIRASKVTK 241

Query: 1070 NLDCKFSENETKLEAETTNCHQQTTCFLSKEKPSPSFLISAEEDDSKGKRNSCQLLNDNN 891
            NL  K  E ET LEA+     +  +  + +E      L S E D       SC   +  +
Sbjct: 242  NLASKIVETETLLEADANL--KTDSAHIGRET-----LESVEND-------SCARCSSRH 287

Query: 890  KVDACS----MSDETLSEGRTDSSCRIGDFPSPKELASLDVDFLAKRCKLGYRANRIIEL 723
              D+ +     S   +  G     C   +FPSP+ELA+LD  FLAKRC LGYRA RII+L
Sbjct: 288  GSDSWAPDSLQSQHGIQPGVNKMIC---NFPSPRELANLDESFLAKRCNLGYRAIRIIKL 344

Query: 722  ARSITEGRFQIEQLEELDCNREIPSLYDKLAKQLMEIDGFGPFTCANVLMCMGFYQVIPI 543
            A+SI EGR  + ++EE   N    S Y+KLA Q  +IDGFGPFTCANVLMCMGFY +IP 
Sbjct: 345  AQSIVEGRIPLREVEEDCANGASSSCYNKLADQFRQIDGFGPFTCANVLMCMGFYHIIPT 404

Query: 542  DSETLRHLKKIHGISSTNRTVQRDVEKVYGKYAQFKFLAYWSELWNFYEKSFGKASEMPP 363
            DSET+RHLK++H   ST +TVQRDVE++YGKYA F+FLAYW+ELW+FYEK FGK SE+P 
Sbjct: 405  DSETVRHLKQVHAKKSTIQTVQRDVEEIYGKYAPFQFLAYWAELWHFYEKRFGKLSEIPT 464

Query: 362  PNYHLITASNMR 327
             +Y LITASNMR
Sbjct: 465  SDYKLITASNMR 476


>ref|XP_006358484.1| PREDICTED: uncharacterized protein LOC102593287 isoform X1 [Solanum
            tuberosum] gi|565385158|ref|XP_006358485.1| PREDICTED:
            uncharacterized protein LOC102593287 isoform X2 [Solanum
            tuberosum]
          Length = 485

 Score =  381 bits (978), Expect = e-103
 Identities = 228/472 (48%), Positives = 286/472 (60%), Gaps = 20/472 (4%)
 Frame = -1

Query: 1682 LLKLELGDS-----YSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXX 1518
            +++L LGD       ++FDLEKAVCSHGLFMMAPN WD  +KTL+RP             
Sbjct: 14   VVELPLGDGDGDGGCATFDLEKAVCSHGLFMMAPNRWDSLSKTLERPLHLSENINDDDHE 73

Query: 1517 XXXXXXXXXXXXXXXXXXXS--------PLDQQFLLGQVARMLRLSESDEMCIKEFHKIH 1362
                                         + Q+ LLGQV RM+RLS  +   +K+F +I 
Sbjct: 74   QSVLVQINQPSDSPHSLLLRVFGTASLSTIHQRSLLGQVRRMVRLSVEENKRVKQFQEIC 133

Query: 1361 PEAKNRGFGRVFRSPTLFEDMVKCMLLCNCQWPRTLTMARALCELQLNLKSDSFKYLGTE 1182
             EAK+RG GRVFRSPTLFEDMVKCMLLCNCQW RTL+MA ALCELQL L   S     + 
Sbjct: 134  GEAKDRGLGRVFRSPTLFEDMVKCMLLCNCQWSRTLSMAEALCELQLELNCPS-----SA 188

Query: 1181 VASQDP---NCLKPNT---EGFLPITPIGRELKRKRSMKKIPANLDCKFSENETKLEAET 1020
             +  DP   N LK  T   E F P TP G+E +++         L  + +E E  ++   
Sbjct: 189  ASFPDPDNQNQLKGVTFKSEHFTPRTPAGKESRKRAGAYGCSRKLLERLTEVEEIID--- 245

Query: 1019 TNCHQQTTCFLSKEKPSPSFLISAEEDDSKGKRNSCQLLNDNNKVDACSMSDETLSEGRT 840
                      + K   + +   S  E+  K K N C+   +   V   +  +   SE R 
Sbjct: 246  ----------IGKPGVTVTPAFSVGEEVLK-KSNLCRDTTEVCDVGTSAPFNLDPSEDRK 294

Query: 839  DSSC-RIGDFPSPKELASLDVDFLAKRCKLGYRANRIIELARSITEGRFQIEQLEELDCN 663
             SS  ++G+FPSPKELASLD  FLAKRC LGYRA RII+LA+ I EG  Q+++LEE  C+
Sbjct: 295  LSSFNQLGNFPSPKELASLDESFLAKRCGLGYRAGRIIKLAKGIVEGSIQLKELEEA-CS 353

Query: 662  REIPSLYDKLAKQLMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKKIHGISSTNRT 483
                S YDK+A+QL EIDGFGPFTCANVLMC+G+Y VIP DSET+RHLK++H  +ST + 
Sbjct: 354  NPSLSDYDKMAEQLREIDGFGPFTCANVLMCLGYYHVIPTDSETIRHLKQVHARTSTIQN 413

Query: 482  VQRDVEKVYGKYAQFKFLAYWSELWNFYEKSFGKASEMPPPNYHLITASNMR 327
            VQRDVE +YGKYA F+FLAYWSE+W+FYE+ FGK SEMP   Y LITA+NMR
Sbjct: 414  VQRDVENIYGKYAPFQFLAYWSEVWHFYEERFGKLSEMPHSEYKLITAANMR 465


>gb|EXC47697.1| hypothetical protein L484_002408 [Morus notabilis]
          Length = 472

 Score =  374 bits (960), Expect = e-101
 Identities = 220/469 (46%), Positives = 282/469 (60%), Gaps = 18/469 (3%)
 Frame = -1

Query: 1679 LKLELGDSYSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXX 1500
            L+L LGD+ ++F LE AVCSHGLFMMAPN WDP +KTL RP                   
Sbjct: 5    LELPLGDAAATFRLETAVCSHGLFMMAPNQWDPLSKTLLRPLRLTLHHHHWNPQQQQDDS 64

Query: 1499 XXXXXXXXXXXXXSPL-------------DQQFLLGQVARMLRLSESDEMCIKEFHKIHP 1359
                                         ++Q LL QV+RMLRLS+++E   +EF +++ 
Sbjct: 65   VMARISQPHDRLHCLRVLVHAGTRSLTSDNKQALLAQVSRMLRLSQTEERICREFSEVY- 123

Query: 1358 EAKNRGFGRVFRSPTLFEDMVKCMLLCNCQWPRTLTMARALCELQLNLKSDSFKYLGTEV 1179
                 G GRVFRSPTLFEDMVKC+LLCNCQWPRTL+MA+ALC+LQ  L+  S        
Sbjct: 124  -GCGSGLGRVFRSPTLFEDMVKCILLCNCQWPRTLSMAQALCDLQRELQLQS-------- 174

Query: 1178 ASQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKF-SENETKLEAETTNCH-- 1008
                   +   T  F+P TP G+E KRK    K    L  +F +++   LE+ + +    
Sbjct: 175  -------VPSKTVDFVPKTPAGKEPKRKVEKLKASTCLTSQFDAQSNEGLESHSNDLSID 227

Query: 1007 --QQTTCFLSKEKPSPSFLISAEEDDSKGKRNSCQLLNDNNKVDACSMSDETLSEGRTDS 834
              Q T    S +  SPS L+S   ++      +C+   ++  VD+ S+ +  +   R   
Sbjct: 228  ISQPTP---SAQNLSPSSLLSVPMENV-----TCE---ESYGVDSASLCNPQILRDREFE 276

Query: 833  SCRIGDFPSPKELASLDVDFLAKRCKLGYRANRIIELARSITEGRFQIEQLEELDCNREI 654
                GDFP+P ELA LD  FLAKRCKLGYRA RI++LAR I EGR Q+ +LEE    R +
Sbjct: 277  GT--GDFPTPTELAKLDEKFLAKRCKLGYRAGRILKLARGIVEGRIQLRELEETCMERSL 334

Query: 653  PSLYDKLAKQLMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKKIHGISSTNRTVQR 474
             S Y KLA QL +IDGFGPFTCANVLMCMGFY VIP DSET+RHL+++HG +ST RT++R
Sbjct: 335  CS-YSKLAVQLRQIDGFGPFTCANVLMCMGFYHVIPSDSETIRHLQQVHGRNSTVRTIER 393

Query: 473  DVEKVYGKYAQFKFLAYWSELWNFYEKSFGKASEMPPPNYHLITASNMR 327
            DV+++Y KY  F+FLAYWSELW+FYEK FGK SEMP   Y L TASNM+
Sbjct: 394  DVQQIYAKYEPFQFLAYWSELWHFYEKKFGKISEMPCSAYKLFTASNMK 442


>ref|XP_006470786.1| PREDICTED: uncharacterized protein LOC102629917 isoform X1 [Citrus
            sinensis]
          Length = 454

 Score =  365 bits (936), Expect = 5e-98
 Identities = 217/475 (45%), Positives = 278/475 (58%), Gaps = 24/475 (5%)
 Frame = -1

Query: 1682 LLKLELGDSYSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXX 1503
            LLKL L ++   F+LE AVCSHGLFMM+PN WDP +++L RP                  
Sbjct: 7    LLKLPLAET---FNLETAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVD 63

Query: 1502 XXXXXXXXXXXXXXSPL--------------DQQFLLGQVARMLRLSESDEMCIKEFHKI 1365
                            +               Q  LL QV RMLRLSE+DE  ++EF +I
Sbjct: 64   VTICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVREFKRI 123

Query: 1364 HPE-AKNRG---------FGRVFRSPTLFEDMVKCMLLCNCQWPRTLTMARALCELQLNL 1215
              + A+  G          GRVFRSPTLFEDMVKCMLLCNCQWPRTL+MARALCELQ  L
Sbjct: 124  VRQVAQEEGEETQYMEDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLSMARALCELQWEL 183

Query: 1214 KSDSFKYLGTEVASQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKFSENETK 1035
            +                +C    +E F+P TP G+E KR++ + K+ + L  + +E++  
Sbjct: 184  Q----------------HCSPSISEDFIPQTPAGKESKRRQKVSKVASKLTSRIAESKAS 227

Query: 1034 LEAETTNCHQQTTCFLSKEKPSPSFLISAEEDDSKGKRNSCQLLNDNNKVDACSMSDETL 855
             E +  N        L +E   PSF  +  E D  G       LN+ +  D  S  D   
Sbjct: 228  SE-DYMNLKLDCAGVL-EENVQPSFPQNDIESDLHG-------LNELSTTDPPSARD--- 275

Query: 854  SEGRTDSSCRIGDFPSPKELASLDVDFLAKRCKLGYRANRIIELARSITEGRFQIEQLEE 675
                     RIG+FPSP+ELA+LD  FLAKRC LGYRA RI++LAR I +G+ Q+ +LE+
Sbjct: 276  ---------RIGNFPSPRELANLDESFLAKRCNLGYRAGRILKLARGIVDGQIQLRELED 326

Query: 674  LDCNREIPSLYDKLAKQLMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKKIHGISS 495
            + CN    + Y KLA+QL +I+GFGPFT  NVL+C+GFY VIP DSET+RHLK++H  + 
Sbjct: 327  M-CNEASLTAYVKLAEQLSQINGFGPFTRNNVLVCIGFYHVIPTDSETIRHLKQVHARNC 385

Query: 494  TNRTVQRDVEKVYGKYAQFKFLAYWSELWNFYEKSFGKASEMPPPNYHLITASNM 330
            T++TVQ   E +YGKYA F+FLAYWSELW+FYEK FGK SEMP  +Y LITASNM
Sbjct: 386  TSKTVQMIAESIYGKYAPFQFLAYWSELWHFYEKRFGKLSEMPYSDYKLITASNM 440


>ref|XP_006431360.1| hypothetical protein CICLE_v10001110mg [Citrus clementina]
            gi|557533482|gb|ESR44600.1| hypothetical protein
            CICLE_v10001110mg [Citrus clementina]
          Length = 454

 Score =  361 bits (926), Expect = 7e-97
 Identities = 214/475 (45%), Positives = 278/475 (58%), Gaps = 24/475 (5%)
 Frame = -1

Query: 1682 LLKLELGDSYSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXX 1503
            +LKL L ++   F+LE AVCSHGLFMM+PN WDP +++L RP                  
Sbjct: 7    VLKLPLAET---FNLEAAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVD 63

Query: 1502 XXXXXXXXXXXXXXSPL--------------DQQFLLGQVARMLRLSESDEMCIKEFHKI 1365
                            +               Q  LL QV RMLRLSE+DE  +++F +I
Sbjct: 64   VTICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVRDFKRI 123

Query: 1364 HPE-AKNRG---------FGRVFRSPTLFEDMVKCMLLCNCQWPRTLTMARALCELQLNL 1215
              + A+  G          GRVFRSPTLFEDMVKCMLLCNCQWPRTL MARALCELQ  L
Sbjct: 124  VRQVAQEEGEESQYMTDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLNMARALCELQWEL 183

Query: 1214 KSDSFKYLGTEVASQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKFSENETK 1035
            +                +C    +E F+P TP G+E KR++ + K+ + L  + +E++  
Sbjct: 184  Q----------------HCSPSISEDFIPQTPAGKESKRRQKVSKVASKLTSRIAESKAS 227

Query: 1034 LEAETTNCHQQTTCFLSKEKPSPSFLISAEEDDSKGKRNSCQLLNDNNKVDACSMSDETL 855
             E +  N     T  L +E   PSF  +  E D  G       LN+ +  D  S  D   
Sbjct: 228  SE-DDMNLKLDCTGAL-EENVQPSFPRNDIESDLHG-------LNELSTTDPPSACD--- 275

Query: 854  SEGRTDSSCRIGDFPSPKELASLDVDFLAKRCKLGYRANRIIELARSITEGRFQIEQLEE 675
                     RIG+FPSP+ELA+LD  FLAKRC LGYRA RI++LA+ I +G+ Q+ +LE+
Sbjct: 276  ---------RIGNFPSPRELANLDESFLAKRCNLGYRAGRILKLAQGIVDGQIQLRELED 326

Query: 674  LDCNREIPSLYDKLAKQLMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKKIHGISS 495
              CN    + Y+KLA+QL +I+GFGPFT  NVL+C+GFY VIP DSET+RHLK++H  + 
Sbjct: 327  T-CNEASLTTYNKLAEQLSQINGFGPFTRNNVLVCIGFYHVIPTDSETIRHLKQVHARNC 385

Query: 494  TNRTVQRDVEKVYGKYAQFKFLAYWSELWNFYEKSFGKASEMPPPNYHLITASNM 330
            T++TVQ   E +YGKY+ F+FLAYWSELW+FYEK FGK SEMP  +Y LITASNM
Sbjct: 386  TSKTVQIIAESIYGKYSPFQFLAYWSELWHFYEKRFGKLSEMPYSDYKLITASNM 440


>ref|XP_006853038.1| hypothetical protein AMTR_s00038p00020700 [Amborella trichopoda]
            gi|548856677|gb|ERN14505.1| hypothetical protein
            AMTR_s00038p00020700 [Amborella trichopoda]
          Length = 458

 Score =  360 bits (924), Expect = 1e-96
 Identities = 208/447 (46%), Positives = 274/447 (61%), Gaps = 6/447 (1%)
 Frame = -1

Query: 1649 SFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1470
            SF+LEKAVCSHG FMMAPNLW  S++TLQRP                             
Sbjct: 17   SFELEKAVCSHGFFMMAPNLWFSSSQTLQRPLRLTDRSSVPVRITQLSLSSQKSLQILVL 76

Query: 1469 XXXSPL--DQQFLLGQVARMLRLSESDEMCIKEFHKIHPEAKNRGFGRVFRSPTLFEDMV 1296
                    DQQ+LL QVARMLR+SE D++ + +FH+++P AK  GFGRVFRSPTLFEDMV
Sbjct: 77   GASKLYQHDQQYLLAQVARMLRISEEDDLKVNKFHEMYPVAKETGFGRVFRSPTLFEDMV 136

Query: 1295 KCMLLCNCQWPRTLTMARALCELQLNLKSDSFKYLGTEVASQDPNCLKPNTEGFLPITPI 1116
            K +LLCNCQW RTL+MARALCELQL L  +S +      +++D +  K  +    P+TP+
Sbjct: 137  KSILLCNCQWTRTLSMARALCELQLELNGNSLRQ-----SNKDTDFSK--SVNLSPVTPM 189

Query: 1115 GRELK--RKRSMKKIPANLDCKFSENETKLEAETTNCHQQTTCFLSKEKPSPSFLISAEE 942
              E K  RK   + I  NL  KFSENET L A+ +          SK  P+    + + E
Sbjct: 190  QLEHKKRRKNPNQNIIMNLMTKFSENETHLAADESLRPIDLAKDFSKNSPT----MFSSE 245

Query: 941  DDSKGKRNSCQLLNDNNKVDACSMSDETLSEGRTDS-SCRIGDFPSPKELASLDVDFLAK 765
            +   GK N  Q+     K+   ++ D  L E +T S     G+FP P+ELA+LD   L K
Sbjct: 246  EGRNGKLNYDQV--SEEKLGDGAILDNQLLENKTLSFFLEAGNFPCPEELANLDEKILEK 303

Query: 764  RCKLGYRANRIIELARSITEGRFQIEQLEELDCNREIPSLYDKLAKQLMEIDGFGPFTCA 585
            RCK+G+R+ RI++LA+SI EG   + ++E L  +++ P   D L +QL+ I G GP+ C 
Sbjct: 304  RCKVGFRSKRIVKLAQSIVEGALDLGKIEVL--SQQDPIHLDGLMRQLLSIYGVGPYVCN 361

Query: 584  NVLMCMGFYQVIPIDSETLRHLKKIHGISS-TNRTVQRDVEKVYGKYAQFKFLAYWSELW 408
            NVLM MG YQ IP D+ETLRHLK+ H     T  T+Q+D+E++YGK+  F+FL YWSE+W
Sbjct: 362  NVLMSMGIYQRIPADTETLRHLKQFHARKQCTIGTIQKDIEEIYGKHEPFQFLVYWSEMW 421

Query: 407  NFYEKSFGKASEMPPPNYHLITASNMR 327
             FYEK FGK S+MPP +Y LITA NM+
Sbjct: 422  EFYEKRFGKLSQMPPSDYELITAHNMK 448


>ref|XP_007023216.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508778582|gb|EOY25838.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 467

 Score =  354 bits (908), Expect = 8e-95
 Identities = 220/482 (45%), Positives = 273/482 (56%), Gaps = 18/482 (3%)
 Frame = -1

Query: 1718 EEHHNPXXXXSCLLKLELGDSYSS-----FDLEKAVCSHGLFMMAPNLWDPSTKTLQRPX 1554
            EE+ N     S L++L +G++ ++     F+LEKAVCSHGLFMMAPN WDP +++L RP 
Sbjct: 36   EENGNSSSCCSVLIELPVGEAAAAEGAGPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPL 95

Query: 1553 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSPLDQQF---LLGQVARMLRLSESDEMCI 1383
                                             L  Q    LL QV+RMLRLSE +E  +
Sbjct: 96   RLLDHHSPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKV 155

Query: 1382 KEFHKI----HPEAKN-----RGF-GRVFRSPTLFEDMVKCMLLCNCQWPRTLTMARALC 1233
            +EF KI    H E +      R F GRVFRSPTLFEDMVKC+LLCNCQ+ RTL+MA+ALC
Sbjct: 156  REFRKIVEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQFSRTLSMAKALC 215

Query: 1232 ELQLNLKSDSFKYLGTEVASQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKF 1053
            ELQ   +     + G   A  D          F+P TP G ELKRK  + K+   L+ KF
Sbjct: 216  ELQFETQRP---FSGVRAAEDD----------FIPKTPAGNELKRKLRVSKVSMRLEGKF 262

Query: 1052 SENETKLEAETTNCHQQTTCFLSKEKPSPSFLISAEEDDSKGKRNSCQLLNDNNKVDACS 873
                    AE    H ++    S+E   P           KG                  
Sbjct: 263  --------AEPRADHSKSDLQPSQELDEPHAY--------KG------------------ 288

Query: 872  MSDETLSEGRTDSSCRIGDFPSPKELASLDVDFLAKRCKLGYRANRIIELARSITEGRFQ 693
                            +G FPSP+ELA+LD  FLAKRC LGYRA+RI++LA+ I +G  Q
Sbjct: 289  ----------------MGSFPSPEELANLDESFLAKRCNLGYRASRILKLAKGIVQGIIQ 332

Query: 692  IEQLEELDCNREIPSLYDKLAKQLMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKK 513
            + QLEE  C     S Y+KLA+QL +IDGFGPFTCANVLMCMGFY VIP DSET+RHLK+
Sbjct: 333  LMQLEE-GCKEISLSSYNKLAEQLRQIDGFGPFTCANVLMCMGFYHVIPADSETIRHLKQ 391

Query: 512  IHGISSTNRTVQRDVEKVYGKYAQFKFLAYWSELWNFYEKSFGKASEMPPPNYHLITASN 333
            +H  SST +TV RDVE +Y KYA F+FLAYW+ELW++YE+ FGK SEMP   Y LITASN
Sbjct: 392  VHSKSSTMQTVGRDVEGIYAKYAPFQFLAYWAELWHYYEQRFGKLSEMPFCGYKLITASN 451

Query: 332  MR 327
            M+
Sbjct: 452  MK 453


>gb|EEC82605.1| hypothetical protein OsI_27177 [Oryza sativa Indica Group]
          Length = 463

 Score =  351 bits (900), Expect = 7e-94
 Identities = 206/461 (44%), Positives = 265/461 (57%), Gaps = 21/461 (4%)
 Frame = -1

Query: 1646 FDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1467
            FDLE AVCSHGLFMMAPN WDP+++ L RP                              
Sbjct: 37   FDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVRVSRHPARPSDALLVSV 96

Query: 1466 XXSP------LDQQFLLGQVARMLRLSESDEMCIKEFHKIHPEAKNRGFGRVFRSPTLFE 1305
              +P       DQ  +L QV RMLRL E D     EF  +H  A+  GFGR+FRSPTLFE
Sbjct: 97   LGAPGDALSPPDQTSILEQVRRMLRLDEEDGRAAAEFQAMHAVAREAGFGRIFRSPTLFE 156

Query: 1304 DMVKCMLLCNCQWPRTLTMARALCELQLNLKSDSFKYLGTEVASQDPNCLKPNTEGFLPI 1125
            DMVKC+LLCNCQW RTL+M+ ALCELQL L+S S                  +TE F   
Sbjct: 157  DMVKCILLCNCQWTRTLSMSTALCELQLELRSSS------------------STENFQSR 198

Query: 1124 TPIGRELKRKRSMKK-IPANLDCKFSEN------ETKLEAETTNCHQQTTCFLSKEKPSP 966
            TP  RE KRKRS K+ +   L+ KF+E+      +  L  +T N       F     PS 
Sbjct: 199  TPPIRECKRKRSNKRNVRVKLETKFNEDKLVCLEDPNLATDTANLQTYENSF---NLPSA 255

Query: 965  SFLISAEEDDSKGKRNSCQLLNDNNKVDACSMSDETLSEGRTDSSCRIGDFPSPKELASL 786
            +           G  N+ ++  D++++    + +E   E      C  GDFP+P+ELA+L
Sbjct: 256  A----------SGTGNTSEVSLDHSEL---KLRNEPCLE-----DCG-GDFPTPEELANL 296

Query: 785  DVDFLAKRCKLGYRANRIIELARSITEGRFQIEQLEEL--------DCNREIPSLYDKLA 630
            D DFLAKRC LGYRA RI+ LARSI EG+  +++LEE+        +     PS YD+L 
Sbjct: 297  DEDFLAKRCNLGYRARRIVMLARSIVEGKICLQKLEEIRKMSVPTVEGLSTTPSTYDRLN 356

Query: 629  KQLMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKKIHGISSTNRTVQRDVEKVYGK 450
            ++L  I GFGPFT ANVLMCMGF+ +IP D+ET+RHLK+ H  +ST  +VQ++++ +YGK
Sbjct: 357  EELSTISGFGPFTRANVLMCMGFFHMIPADTETIRHLKQFHKRASTISSVQKELDNIYGK 416

Query: 449  YAQFKFLAYWSELWNFYEKSFGKASEMPPPNYHLITASNMR 327
            YA F+FLAYW ELW FY K FGK S+M P NY L TAS ++
Sbjct: 417  YAPFQFLAYWCELWGFYNKQFGKISDMEPINYRLFTASKLK 457


>ref|XP_003534756.2| PREDICTED: uncharacterized protein LOC100781827 [Glycine max]
          Length = 443

 Score =  347 bits (890), Expect = 1e-92
 Identities = 199/446 (44%), Positives = 267/446 (59%), Gaps = 4/446 (0%)
 Frame = -1

Query: 1652 SSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1473
            S F LE+AVCSHGLFMM PN WDP +KTL RP                            
Sbjct: 22   SPFQLEQAVCSHGLFMMPPNHWDPLSKTLIRPLRSSPSSFLVSLSQHSQSLAVRVHATHA 81

Query: 1472 XXXXSPLDQQFLLGQVARMLRLSESDEMCIKEFHKIHP-EAKNRGF-GRVFRSPTLFEDM 1299
                 P  Q  +  QV+RMLR SE++E  ++EF  +H  +  NR F GRVFRSPTLFEDM
Sbjct: 82   LS---PQQQNHITAQVSRMLRFSEAEEKAVREFRSLHVVDHPNRSFSGRVFRSPTLFEDM 138

Query: 1298 VKCMLLCNCQWPRTLTMARALCELQLNLKSDSFKYLGTEVASQDPNCLKPNTEGFLPITP 1119
            VKC+LLCNCQWPRTL+MA+ALCELQL L++ S   +     S      K  +EGF+P TP
Sbjct: 139  VKCILLCNCQWPRTLSMAQALCELQLELQNGSPCTIAVSGNS------KGESEGFIPKTP 192

Query: 1118 IGRELKRKRSMKKIPANLDCKFSENETKLEAETTNCHQQTTCFLSKEKPSPSFLISAEED 939
              +E +R +   K        F + + +L+      H      +     + + L++ +  
Sbjct: 193  ASKETRRNKVSTK------GMFCKKKLELDGNLQIDH------VVASSSTATTLLTTDNG 240

Query: 938  DSKGKRN--SCQLLNDNNKVDACSMSDETLSEGRTDSSCRIGDFPSPKELASLDVDFLAK 765
            DS+  R+  SC   ++ N+  +                 R G+FPSP ELA+LD  FLAK
Sbjct: 241  DSEELRSHDSCHEFSNGNEYFS-----------------RTGNFPSPSELANLDESFLAK 283

Query: 764  RCKLGYRANRIIELARSITEGRFQIEQLEELDCNREIPSLYDKLAKQLMEIDGFGPFTCA 585
            RC LGYRA  IIELAR+I EG+ Q+ QLEEL  +  + S Y +L  QL +I G+GPFT A
Sbjct: 284  RCGLGYRAGYIIELARAIVEGKIQLGQLEELSKDASL-SNYKQLDDQLKQIRGYGPFTRA 342

Query: 584  NVLMCMGFYQVIPIDSETLRHLKKIHGISSTNRTVQRDVEKVYGKYAQFKFLAYWSELWN 405
            NVLMC+G+Y VIP DSET+RHLK++H   +T++T++R++E++YGKY  ++FLA+WSE+W+
Sbjct: 343  NVLMCLGYYHVIPTDSETVRHLKQVHSRYTTSKTIERELEEIYGKYEPYQFLAFWSEVWD 402

Query: 404  FYEKSFGKASEMPPPNYHLITASNMR 327
            FYE  FGK +EM   +Y LITA NMR
Sbjct: 403  FYETRFGKLNEMHSSDYKLITACNMR 428


>ref|XP_007147543.1| hypothetical protein PHAVU_006G133500g [Phaseolus vulgaris]
            gi|561020766|gb|ESW19537.1| hypothetical protein
            PHAVU_006G133500g [Phaseolus vulgaris]
          Length = 474

 Score =  346 bits (888), Expect = 2e-92
 Identities = 200/454 (44%), Positives = 260/454 (57%), Gaps = 5/454 (1%)
 Frame = -1

Query: 1673 LELGDSYSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXXXX 1494
            +EL      F L++AVCSHG FMMAPN WDP +KTL RP                     
Sbjct: 37   MELPSETEPFQLDQAVCSHGFFMMAPNHWDPLSKTLTRPLLLHNPSSSSSSSLLVSLSQR 96

Query: 1493 XXXXXXXXXXXS---PLDQQFLLGQVARMLRLSESDEMCIKEFHKIHP-EAKNRGFG-RV 1329
                           P  Q+ +  Q+ RMLRLSE++E  ++EF  +H  +  NR FG RV
Sbjct: 97   PQSLAVRVHSVHFISPQQQRHIKAQITRMLRLSEAEEKAVREFRSVHAADHPNRSFGGRV 156

Query: 1328 FRSPTLFEDMVKCMLLCNCQWPRTLTMARALCELQLNLKSDSFKYLGTEVASQDPNCLKP 1149
            FRSPTLFEDMVKC+LLCNCQWPRTL+MA+ALCELQ  L++      G   A +     K 
Sbjct: 157  FRSPTLFEDMVKCILLCNCQWPRTLSMAQALCELQSGLQN------GLPCAVEGSGNPKV 210

Query: 1148 NTEGFLPITPIGRELKRKRSMKKIPANLDCKFSENETKLEAETTNCHQQTTCFLSKEKPS 969
              E F+P TP  +E +RK    K P        + E +LE E     Q    F S    S
Sbjct: 211  EAEEFVPKTPASKENRRK----KAPTKGVLLKKKLELELEMEVDGNLQMDHMFAS----S 262

Query: 968  PSFLISAEEDDSKGKRNSCQLLNDNNKVDACSMSDETLSEGRTDSSCRIGDFPSPKELAS 789
                +  + +  +   + CQ  N+    D                    G+FPSP ELA+
Sbjct: 263  SDTTLLGDLEVLRSDDSCCQFPNEGEYFD------------------HTGNFPSPIELAN 304

Query: 788  LDVDFLAKRCKLGYRANRIIELARSITEGRFQIEQLEELDCNREIPSLYDKLAKQLMEID 609
            L   FLAKRCKLGYRA  I+ELA+ I EG+ Q+EQLEEL  +  + S Y +L  QL  I 
Sbjct: 305  LSESFLAKRCKLGYRAGYILELAQGIVEGKIQLEQLEELSKDASL-SCYKQLGDQLKPIK 363

Query: 608  GFGPFTCANVLMCMGFYQVIPIDSETLRHLKKIHGISSTNRTVQRDVEKVYGKYAQFKFL 429
            GFGPFT ANVLMC+G+Y VIP DSET+RHLK++H  +++++T++RD+E++YGKY  ++FL
Sbjct: 364  GFGPFTRANVLMCLGYYHVIPWDSETVRHLKQVHSKNTSSKTIERDLEEIYGKYEPYQFL 423

Query: 428  AYWSELWNFYEKSFGKASEMPPPNYHLITASNMR 327
            A+WSE+W+FYE  FGK +EM    Y  ITASNMR
Sbjct: 424  AFWSEIWDFYETRFGKMNEMHSSEYKRITASNMR 457


>ref|XP_004959865.1| PREDICTED: uncharacterized protein LOC101766322 [Setaria italica]
          Length = 461

 Score =  343 bits (881), Expect = 1e-91
 Identities = 203/459 (44%), Positives = 263/459 (57%), Gaps = 19/459 (4%)
 Frame = -1

Query: 1646 FDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1467
            FDL  AVCSHGLFMMAPN WDP+ + L RP                              
Sbjct: 36   FDLAAAVCSHGLFMMAPNRWDPAARALVRPLRLASDRSASLLARVSAHPARPGTALLVAV 95

Query: 1466 XXSP----LDQQFLLGQVARMLRLSESDEMCIKEFHKIHPEAKNRGFGRVFRSPTLFEDM 1299
              +     LD+ ++L QV RMLRLSE D   + EF  +H  A+  GFGR+FRSPTLFEDM
Sbjct: 96   EGADALSSLDRDYILEQVRRMLRLSEEDGAAVAEFQAMHAAAREEGFGRIFRSPTLFEDM 155

Query: 1298 VKCMLLCNCQWPRTLTMARALCELQLNLKSDSFKYLGTEVASQDPNCLKPNTEGFLPITP 1119
            VKC+LLCNCQW RTL+MA ALCE+QL LK  S                  + E F   TP
Sbjct: 156  VKCILLCNCQWTRTLSMATALCEIQLELKCSS------------------SVEDFQSRTP 197

Query: 1118 IGRELKRKRSMKK-IPANLDCKFSENETK---LEAETTN--CHQQTTCFLSKEKPSPSFL 957
              RE KRKRS ++ +   L+ +F+E++ +   + + T+N   H +T  +LS      S  
Sbjct: 198  PIRERKRKRSKRQSVRIKLETRFAEDKLEGPTIASGTSNDLTHPETNEYLSSLASVASET 257

Query: 956  ISAEEDDSKGKRNSCQLLNDNNKVDACSMSDETLSEGRTDSSCRIGDFPSPKELASLDVD 777
             SA  D      NS   LN+   ++ C                 IGDFP+P+ELA+LD  
Sbjct: 258  GSAC-DSLPSLDNSELSLNNAPGLEDC-----------------IGDFPTPEELANLDEG 299

Query: 776  FLAKRCKLGYRANRIIELARSITEGRFQIEQLEELDCNREIP---------SLYDKLAKQ 624
            FLAKRC LGYRA RI+ LAR + EG+  +++LEE+ C   +P         S  ++L K+
Sbjct: 300  FLAKRCNLGYRAKRIVMLARGVVEGKVCLQKLEEM-CRISVPAAEEVSTIESACERLNKE 358

Query: 623  LMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKKIHGISSTNRTVQRDVEKVYGKYA 444
            L  I GFGPFT ANVLMCMGF   IP D+ET+RHLK++H  +ST  +V ++++K+YGKYA
Sbjct: 359  LSAISGFGPFTRANVLMCMGFNHTIPADTETIRHLKQVHKRASTISSVHQELDKIYGKYA 418

Query: 443  QFKFLAYWSELWNFYEKSFGKASEMPPPNYHLITASNMR 327
             F+FLAYW ELW FY K FGK  EM P NY L TAS+++
Sbjct: 419  PFQFLAYWFELWGFYNKQFGKICEMEPSNYRLFTASHLK 457


>ref|XP_002519384.1| conserved hypothetical protein [Ricinus communis]
            gi|223541451|gb|EEF43001.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 458

 Score =  338 bits (868), Expect = 3e-90
 Identities = 199/458 (43%), Positives = 255/458 (55%), Gaps = 12/458 (2%)
 Frame = -1

Query: 1664 GDSYSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXXXXXXX 1485
            G++  +FDLEK VCSHGLFM++PN WDP ++T  RP                        
Sbjct: 16   GEAADTFDLEKTVCSHGLFMLSPNHWDPLSRTFSRPLRLNDDTDNSLMVSISQHLSKSLL 75

Query: 1484 XXXXXXXXS-PLDQQFLLGQVARMLRLSESDEMCIKEFHKIHPEAKNRGF-------GRV 1329
                      P  Q+ LL Q+ RMLRLS+ DE   +EF KI    +           GRV
Sbjct: 76   VRVYGNRSLSPKHQESLLVQIVRMLRLSDMDEFNAREFRKIVSAFEGEECPLIGDFGGRV 135

Query: 1328 FRSPTLFEDMVKCMLLCNCQWPRTLTMARALCELQLNLKSDSFKYLGTEVASQDPNCLKP 1149
             RSPTLFEDMVKC+LLCNCQW RTL+MA ALC+ Q+ L S S +              K 
Sbjct: 136  LRSPTLFEDMVKCILLCNCQWSRTLSMADALCKFQIELHSQSPQQ-------------KH 182

Query: 1148 NTEGFLPITPIGRELKRKRSMKKIPANLDCKFSENETKLEAETTNCHQQTTCFLSKEKPS 969
                F+P TP+ +E KRK  + K+P                E+ +     TC  + +   
Sbjct: 183  AFNHFIPNTPVKKEPKRKIRLSKVPT---------------ESMDLEAADTCLTTDDSQM 227

Query: 968  P-SFLISAEEDDSKGKRNSCQLLNDNNKVDACSMSDETLSEGRTDSSCR---IGDFPSPK 801
              S  ++  +D S     SCQ  N        + SD  +        C     G+FPSP+
Sbjct: 228  KISNSLNCVDDGSFDNLKSCQGSNTFYSTGPYATSD--IQSHLVTQHCAKKTTGNFPSPR 285

Query: 800  ELASLDVDFLAKRCKLGYRANRIIELARSITEGRFQIEQLEELDCNREIPSLYDKLAKQL 621
            ELA+LD  FLAKRC LGYRA RII+LA+ I EGR  + + E++     + S Y KL  QL
Sbjct: 286  ELANLDERFLAKRCGLGYRAGRIIKLAQGIVEGRIPLREFEQVSNGGSL-STYSKLTDQL 344

Query: 620  MEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKKIHGISSTNRTVQRDVEKVYGKYAQ 441
             EI+GFGPFT ANVLMCMGFY VIP DSET+RH K++H  +ST +TVQ + E++Y K+A 
Sbjct: 345  REIEGFGPFTRANVLMCMGFYHVIPTDSETVRHFKQVHAKNSTIKTVQSEAEEIYRKFAP 404

Query: 440  FKFLAYWSELWNFYEKSFGKASEMPPPNYHLITASNMR 327
            F+FL YW+ELW+FYE+ FGK SEMP  NY LITASN+R
Sbjct: 405  FQFLVYWAELWHFYEQRFGKLSEMPCSNYKLITASNLR 442


>gb|EEE67737.1| hypothetical protein OsJ_25428 [Oryza sativa Japonica Group]
          Length = 442

 Score =  330 bits (846), Expect = 1e-87
 Identities = 196/454 (43%), Positives = 255/454 (56%), Gaps = 14/454 (3%)
 Frame = -1

Query: 1646 FDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1467
            FDLE AVCSHGLFMMAPN WDP+++ L RP                              
Sbjct: 37   FDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVRVSRHPARPSDALLVSV 96

Query: 1466 XXSP-------LDQQFLLGQVARMLRLSESDEMCIKEFHKIHPEAKNRGFGRVFRSPTLF 1308
              +P       LDQ  +L QV RMLRL E D   + EF  +H  A+  GFGR+FRSPTLF
Sbjct: 97   LGAPDDDALSPLDQTSILEQVRRMLRLDEEDGRAVAEFQAMHAVAREVGFGRIFRSPTLF 156

Query: 1307 EDMVKCMLLCNCQWPRTLTMARALCELQLNLKSDSFKYLGTEVASQDPNCLKPNTEGFLP 1128
            EDM+KC+LLCNCQW RTL+M+ ALCELQL L+S S                  +TE F  
Sbjct: 157  EDMIKCILLCNCQWTRTLSMSTALCELQLELRSSS------------------STENFQS 198

Query: 1127 ITPIGRELKRKRSMKK-IPANLDCKFSEN------ETKLEAETTNCHQQTTCFLSKEKPS 969
             TP  RE KRKRS K+ +   L+ KF+E+      +  L   T N +  +    + E  +
Sbjct: 199  RTPPIRECKRKRSNKRNVRVKLETKFNEDKMVCLEDPNLATNTANENLFSLPSSANETGN 258

Query: 968  PSFLISAEEDDSKGKRNSCQLLNDNNKVDACSMSDETLSEGRTDSSCRIGDFPSPKELAS 789
             S  +S +  + K +   C        ++ C                  GDFP+P+ELA+
Sbjct: 259  TSE-VSLDHSELKLRYELC--------LEDCG-----------------GDFPTPEELAN 292

Query: 788  LDVDFLAKRCKLGYRANRIIELARSITEGRFQIEQLEELDCNREIPSLYDKLAKQLMEID 609
            LD DFLAKRC LGYRA RI+ LARSI EG+  +++LEE+   R+I      L ++L  I 
Sbjct: 293  LDEDFLAKRCNLGYRARRIVMLARSIVEGKICLQKLEEI---RKI------LIEELSTIS 343

Query: 608  GFGPFTCANVLMCMGFYQVIPIDSETLRHLKKIHGISSTNRTVQRDVEKVYGKYAQFKFL 429
            G  PF   NVLMCMGF+ +IP D+ET+RHLK+ H  +ST  +VQ++++ +YGKYA F+FL
Sbjct: 344  GIWPFHSCNVLMCMGFFHMIPADTETIRHLKQFHKRASTISSVQKELDNIYGKYAPFQFL 403

Query: 428  AYWSELWNFYEKSFGKASEMPPPNYHLITASNMR 327
            AYW ELW FY K FG  S+M P NY L TAS ++
Sbjct: 404  AYWCELWGFYNKQFGIISDMEPINYRLFTASKLK 437


>dbj|BAC15471.1| hypothetical protein [Oryza sativa Japonica Group]
            gi|50510134|dbj|BAD31099.1| hypothetical protein [Oryza
            sativa Japonica Group]
          Length = 501

 Score =  326 bits (835), Expect = 2e-86
 Identities = 202/504 (40%), Positives = 262/504 (51%), Gaps = 64/504 (12%)
 Frame = -1

Query: 1646 FDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXXXXXXXXXXXXXX 1467
            FDLE AVCSHGLFMMAPN WDP+++ L RP                              
Sbjct: 37   FDLEAAVCSHGLFMMAPNRWDPASRALVRPLRLASDRAASVAVRVSRHPARPSDALLVSV 96

Query: 1466 XXSP-------LDQQFLLGQVARMLRLSESDEMCIKEFHKIHPEAKNRGFGRVFRSPTLF 1308
              +P       LDQ  +L QV RMLRL E D   + EF  +H  A+  GFGR+FRSPTLF
Sbjct: 97   LGAPDDDALSPLDQTSILEQVRRMLRLDEEDGRAVAEFQAMHAVAREVGFGRIFRSPTLF 156

Query: 1307 EDMVKCMLLCNCQ------------------------------------------WPRTL 1254
            EDM+KC+LLCNCQ                                          W RTL
Sbjct: 157  EDMIKCILLCNCQFSLPLPLPSLASTSMRNSDTNMSRYLGIAIFHLHSTVLFNCRWTRTL 216

Query: 1253 TMARALCELQLNLKSDSFKYLGTEVASQDPNCLKPNTEGFLPITPIGRELKRKRSMKK-I 1077
            +M+ ALCELQL L+S S                  +TE F   TP  RE KRKRS K+ +
Sbjct: 217  SMSTALCELQLELRSSS------------------STENFQSRTPPIRECKRKRSNKRNV 258

Query: 1076 PANLDCKFSEN------ETKLEAETTNCHQQTTCFLSKEKPSPSFLISAEEDDSKGKRNS 915
               L+ KF+E+      +  L   T N +  +    + E  + S  +S +  + K +   
Sbjct: 259  RVKLETKFNEDKMVCLEDPNLATNTANENLFSLPSSANETGNTSE-VSLDHSELKLRYEL 317

Query: 914  CQLLNDNNKVDACSMSDETLSEGRTDSSCRIGDFPSPKELASLDVDFLAKRCKLGYRANR 735
            C        ++ C                  GDFP+P+ELA+LD DFLAKRC LGYRA R
Sbjct: 318  C--------LEDCG-----------------GDFPTPEELANLDEDFLAKRCNLGYRARR 352

Query: 734  IIELARSITEGRFQIEQLEEL--------DCNREIPSLYDKLAKQLMEIDGFGPFTCANV 579
            I+ LARSI EG+  +++LEE+        +     PS YD+L ++L  I GFGPFT ANV
Sbjct: 353  IVMLARSIVEGKICLQKLEEIRKMSVPTVEGLSTTPSTYDRLNEELSTISGFGPFTRANV 412

Query: 578  LMCMGFYQVIPIDSETLRHLKKIHGISSTNRTVQRDVEKVYGKYAQFKFLAYWSELWNFY 399
            LMCMGF+ +IP D+ET+RHLK+ H  +ST  +VQ++++ +YGKYA F+FLAYW ELW FY
Sbjct: 413  LMCMGFFHMIPADTETIRHLKQFHKRASTISSVQKELDNIYGKYAPFQFLAYWCELWGFY 472

Query: 398  EKSFGKASEMPPPNYHLITASNMR 327
             K FG  S+M P NY L TAS ++
Sbjct: 473  NKQFGIISDMEPINYRLFTASKLK 496


>ref|XP_007023217.1| Uncharacterized protein isoform 2 [Theobroma cacao]
            gi|508778583|gb|EOY25839.1| Uncharacterized protein
            isoform 2 [Theobroma cacao]
          Length = 426

 Score =  322 bits (826), Expect = 3e-85
 Identities = 206/482 (42%), Positives = 257/482 (53%), Gaps = 18/482 (3%)
 Frame = -1

Query: 1718 EEHHNPXXXXSCLLKLELGDSYSS-----FDLEKAVCSHGLFMMAPNLWDPSTKTLQRPX 1554
            EE+ N     S L++L +G++ ++     F+LEKAVCSHGLFMMAPN WDP +++L RP 
Sbjct: 21   EENGNSSSCCSVLIELPVGEAAAAEGAGPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPL 80

Query: 1553 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSPLDQQF---LLGQVARMLRLSESDEMCI 1383
                                             L  Q    LL QV+RMLRLSE +E  +
Sbjct: 81   RLLDHHSPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKV 140

Query: 1382 KEFHKI----HPEAKN-----RGF-GRVFRSPTLFEDMVKCMLLCNCQWPRTLTMARALC 1233
            +EF KI    H E +      R F GRVFRSPTLFEDMVKC+LLCNCQ            
Sbjct: 141  REFRKIVEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQ------------ 188

Query: 1232 ELQLNLKSDSFKYLGTEVASQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKF 1053
                              A++D          F+P TP G ELKRK  + K+   L+ KF
Sbjct: 189  ------------------AAEDD---------FIPKTPAGNELKRKLRVSKVSMRLEGKF 221

Query: 1052 SENETKLEAETTNCHQQTTCFLSKEKPSPSFLISAEEDDSKGKRNSCQLLNDNNKVDACS 873
            +E                                   D SK      Q L++ +      
Sbjct: 222  AEPRA--------------------------------DHSKSDLQPSQELDEPHAYKG-- 247

Query: 872  MSDETLSEGRTDSSCRIGDFPSPKELASLDVDFLAKRCKLGYRANRIIELARSITEGRFQ 693
                            +G FPSP+ELA+LD  FLAKRC LGYRA+RI++LA+ I +G  Q
Sbjct: 248  ----------------MGSFPSPEELANLDESFLAKRCNLGYRASRILKLAKGIVQGIIQ 291

Query: 692  IEQLEELDCNREIPSLYDKLAKQLMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKK 513
            + QLEE  C     S Y+KLA+QL +IDGFGPFTCANVLMCMGFY VIP DSET+RHLK+
Sbjct: 292  LMQLEE-GCKEISLSSYNKLAEQLRQIDGFGPFTCANVLMCMGFYHVIPADSETIRHLKQ 350

Query: 512  IHGISSTNRTVQRDVEKVYGKYAQFKFLAYWSELWNFYEKSFGKASEMPPPNYHLITASN 333
            +H  SST +TV RDVE +Y KYA F+FLAYW+ELW++YE+ FGK SEMP   Y LITASN
Sbjct: 351  VHSKSSTMQTVGRDVEGIYAKYAPFQFLAYWAELWHYYEQRFGKLSEMPFCGYKLITASN 410

Query: 332  MR 327
            M+
Sbjct: 411  MK 412


>ref|XP_006470787.1| PREDICTED: uncharacterized protein LOC102629917 isoform X2 [Citrus
            sinensis]
          Length = 409

 Score =  315 bits (806), Expect = 5e-83
 Identities = 193/444 (43%), Positives = 252/444 (56%), Gaps = 24/444 (5%)
 Frame = -1

Query: 1682 LLKLELGDSYSSFDLEKAVCSHGLFMMAPNLWDPSTKTLQRPXXXXXXXXXXXXXXXXXX 1503
            LLKL L ++   F+LE AVCSHGLFMM+PN WDP +++L RP                  
Sbjct: 7    LLKLPLAET---FNLETAVCSHGLFMMSPNRWDPLSRSLSRPLHLSNSLDNTDIPSVSVD 63

Query: 1502 XXXXXXXXXXXXXXSPL--------------DQQFLLGQVARMLRLSESDEMCIKEFHKI 1365
                            +               Q  LL QV RMLRLSE+DE  ++EF +I
Sbjct: 64   VTICQPQQDPHSLRIEVRNSASGSAPSLSQEQQDALLAQVKRMLRLSEADERNVREFKRI 123

Query: 1364 HPE-AKNRG---------FGRVFRSPTLFEDMVKCMLLCNCQWPRTLTMARALCELQLNL 1215
              + A+  G          GRVFRSPTLFEDMVKCMLLCNCQWPRTL+MARALCELQ  L
Sbjct: 124  VRQVAQEEGEETQYMEDFSGRVFRSPTLFEDMVKCMLLCNCQWPRTLSMARALCELQWEL 183

Query: 1214 KSDSFKYLGTEVASQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKFSENETK 1035
            +                +C    +E F+P TP G+E KR++ + K+ + L  + +E++  
Sbjct: 184  Q----------------HCSPSISEDFIPQTPAGKESKRRQKVSKVASKLTSRIAESKAS 227

Query: 1034 LEAETTNCHQQTTCFLSKEKPSPSFLISAEEDDSKGKRNSCQLLNDNNKVDACSMSDETL 855
             E +  N        L +E   PSF  +  E D  G       LN+ +  D  S  D   
Sbjct: 228  SE-DYMNLKLDCAGVL-EENVQPSFPQNDIESDLHG-------LNELSTTDPPSARD--- 275

Query: 854  SEGRTDSSCRIGDFPSPKELASLDVDFLAKRCKLGYRANRIIELARSITEGRFQIEQLEE 675
                     RIG+FPSP+ELA+LD  FLAKRC LGYRA RI++LAR I +G+ Q+ +LE+
Sbjct: 276  ---------RIGNFPSPRELANLDESFLAKRCNLGYRAGRILKLARGIVDGQIQLRELED 326

Query: 674  LDCNREIPSLYDKLAKQLMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKKIHGISS 495
            + CN    + Y KLA+QL +I+GFGPFT  NVL+C+GFY VIP DSET+RHLK++H  + 
Sbjct: 327  M-CNEASLTAYVKLAEQLSQINGFGPFTRNNVLVCIGFYHVIPTDSETIRHLKQVHARNC 385

Query: 494  TNRTVQRDVEKVYGKYAQFKFLAY 423
            T++TVQ   E +YGKYA F+FLAY
Sbjct: 386  TSKTVQMIAESIYGKYAPFQFLAY 409


>ref|XP_007023218.1| Uncharacterized protein isoform 3 [Theobroma cacao]
            gi|508778584|gb|EOY25840.1| Uncharacterized protein
            isoform 3 [Theobroma cacao]
          Length = 421

 Score =  308 bits (790), Expect = 4e-81
 Identities = 199/450 (44%), Positives = 247/450 (54%), Gaps = 18/450 (4%)
 Frame = -1

Query: 1718 EEHHNPXXXXSCLLKLELGDSYSS-----FDLEKAVCSHGLFMMAPNLWDPSTKTLQRPX 1554
            EE+ N     S L++L +G++ ++     F+LEKAVCSHGLFMMAPN WDP +++L RP 
Sbjct: 36   EENGNSSSCCSVLIELPVGEAAAAEGAGPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPL 95

Query: 1553 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSPLDQQF---LLGQVARMLRLSESDEMCI 1383
                                             L  Q    LL QV+RMLRLSE +E  +
Sbjct: 96   RLLDHHSPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKV 155

Query: 1382 KEFHKI----HPEAKN-----RGF-GRVFRSPTLFEDMVKCMLLCNCQWPRTLTMARALC 1233
            +EF KI    H E +      R F GRVFRSPTLFEDMVKC+LLCNCQ+ RTL+MA+ALC
Sbjct: 156  REFRKIVEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQFSRTLSMAKALC 215

Query: 1232 ELQLNLKSDSFKYLGTEVASQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKF 1053
            ELQ   +     + G   A  D          F+P TP G ELKRK  + K+   L+ KF
Sbjct: 216  ELQFETQRP---FSGVRAAEDD----------FIPKTPAGNELKRKLRVSKVSMRLEGKF 262

Query: 1052 SENETKLEAETTNCHQQTTCFLSKEKPSPSFLISAEEDDSKGKRNSCQLLNDNNKVDACS 873
                    AE    H ++    S+E   P           KG                  
Sbjct: 263  --------AEPRADHSKSDLQPSQELDEPHAY--------KG------------------ 288

Query: 872  MSDETLSEGRTDSSCRIGDFPSPKELASLDVDFLAKRCKLGYRANRIIELARSITEGRFQ 693
                            +G FPSP+ELA+LD  FLAKRC LGYRA+RI++LA+ I +G  Q
Sbjct: 289  ----------------MGSFPSPEELANLDESFLAKRCNLGYRASRILKLAKGIVQGIIQ 332

Query: 692  IEQLEELDCNREIPSLYDKLAKQLMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKK 513
            + QLEE  C     S Y+KLA+QL +IDGFGPFTCANVLMCMGFY VIP DSET+RHLK+
Sbjct: 333  LMQLEE-GCKEISLSSYNKLAEQLRQIDGFGPFTCANVLMCMGFYHVIPADSETIRHLKQ 391

Query: 512  IHGISSTNRTVQRDVEKVYGKYAQFKFLAY 423
            +H  SST +TV RDVE +Y KYA F+FLAY
Sbjct: 392  VHSKSSTMQTVGRDVEGIYAKYAPFQFLAY 421


>ref|XP_007023219.1| Uncharacterized protein isoform 4 [Theobroma cacao]
            gi|508778585|gb|EOY25841.1| Uncharacterized protein
            isoform 4 [Theobroma cacao]
          Length = 406

 Score =  289 bits (739), Expect = 3e-75
 Identities = 190/436 (43%), Positives = 236/436 (54%), Gaps = 18/436 (4%)
 Frame = -1

Query: 1718 EEHHNPXXXXSCLLKLELGDSYSS-----FDLEKAVCSHGLFMMAPNLWDPSTKTLQRPX 1554
            EE+ N     S L++L +G++ ++     F+LEKAVCSHGLFMMAPN WDP +++L RP 
Sbjct: 21   EENGNSSSCCSVLIELPVGEAAAAEGAGPFNLEKAVCSHGLFMMAPNQWDPISRSLSRPL 80

Query: 1553 XXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXSPLDQQF---LLGQVARMLRLSESDEMCI 1383
                                             L  Q    LL QV+RMLRLSE +E  +
Sbjct: 81   RLLDHHSPPLTVQVRISQPTASTLHLRVYGTRCLSPQHRHSLLNQVSRMLRLSEEEESKV 140

Query: 1382 KEFHKI----HPEAKN-----RGF-GRVFRSPTLFEDMVKCMLLCNCQWPRTLTMARALC 1233
            +EF KI    H E +      R F GRVFRSPTLFEDMVKC+LLCNCQ+ RTL+MA+ALC
Sbjct: 141  REFRKIVEALHGEEEAAAECLRSFSGRVFRSPTLFEDMVKCILLCNCQFSRTLSMAKALC 200

Query: 1232 ELQLNLKSDSFKYLGTEVASQDPNCLKPNTEGFLPITPIGRELKRKRSMKKIPANLDCKF 1053
            ELQ   +     + G   A  D          F+P TP G ELKRK  + K+   L+ KF
Sbjct: 201  ELQFETQRP---FSGVRAAEDD----------FIPKTPAGNELKRKLRVSKVSMRLEGKF 247

Query: 1052 SENETKLEAETTNCHQQTTCFLSKEKPSPSFLISAEEDDSKGKRNSCQLLNDNNKVDACS 873
                    AE    H ++    S+E   P           KG                  
Sbjct: 248  --------AEPRADHSKSDLQPSQELDEPHAY--------KG------------------ 273

Query: 872  MSDETLSEGRTDSSCRIGDFPSPKELASLDVDFLAKRCKLGYRANRIIELARSITEGRFQ 693
                            +G FPSP+ELA+LD  FLAKRC LGYRA+RI++LA+ I +G  Q
Sbjct: 274  ----------------MGSFPSPEELANLDESFLAKRCNLGYRASRILKLAKGIVQGIIQ 317

Query: 692  IEQLEELDCNREIPSLYDKLAKQLMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKK 513
            + QLEE  C     S Y+KLA+QL +IDGFGPFTCANVLMCMGFY VIP DSET+RHLK+
Sbjct: 318  LMQLEE-GCKEISLSSYNKLAEQLRQIDGFGPFTCANVLMCMGFYHVIPADSETIRHLKQ 376

Query: 512  IHGISSTNRTVQRDVE 465
            +H  SST +TV RDVE
Sbjct: 377  VHSKSSTMQTVGRDVE 392


>gb|EMT03969.1| hypothetical protein F775_22747 [Aegilops tauschii]
          Length = 333

 Score =  287 bits (735), Expect = 9e-75
 Identities = 173/356 (48%), Positives = 216/356 (60%), Gaps = 9/356 (2%)
 Frame = -1

Query: 1367 IHPEAKNRGFGRVFRSPTLFEDMVKCMLLCNCQWPRTLTMARALCELQLNLKSDSFKYLG 1188
            +H  A+  GFGR+FRSPTLFEDMVKC+LLCNCQW RTL+MA ALCELQL LK  +     
Sbjct: 1    MHAAAREAGFGRIFRSPTLFEDMVKCILLCNCQWTRTLSMATALCELQLELKCSA----- 55

Query: 1187 TEVASQDPNCLKPNTEGFLPITPIGRELKRKRSMKK-IPANLDCKFSENETKLEAETTNC 1011
                          TE     TP  RE KRKRS  + +   L+ KF+E E  LE      
Sbjct: 56   -------------GTEDLQLRTPPIREHKRKRSKNQNVRVKLEKKFTELEC-LEDPRVET 101

Query: 1010 HQQTTCFLSKEKPSPSFLISAEEDDSKGKRNSCQLLNDNNKVDACSMSDETLSEGRTDSS 831
             Q T     +     S +I+  E D K   +  Q+  +   V     S E   EG     
Sbjct: 102  AQDT-----RVATGTSDVITHLEADEK-LASLPQVAPETGSVCQSFDSSELSLEG----- 150

Query: 830  CRIGDFPSPKELASLDVDFLAKRCKLGYRANRIIELARSITEGRFQIEQLEE-----LDC 666
            C IGDFP+P+ELA+LD DFLAKRC LGYRA RI+ LARSI EG+   + LEE     L  
Sbjct: 151  C-IGDFPTPEELANLDEDFLAKRCGLGYRAERIVLLARSIVEGKVCPQNLEEMQKMSLPA 209

Query: 665  NRE---IPSLYDKLAKQLMEIDGFGPFTCANVLMCMGFYQVIPIDSETLRHLKKIHGISS 495
              E   IPS Y++L  +L  I GFGPFT ANVLMCMGF+ +IP D+ET+RHLK+ H I+S
Sbjct: 210  TEELSTIPSTYERLNNELTTISGFGPFTRANVLMCMGFFHMIPADTETIRHLKQCHEIAS 269

Query: 494  TNRTVQRDVEKVYGKYAQFKFLAYWSELWNFYEKSFGKASEMPPPNYHLITASNMR 327
            T ++V  +++K+YG+YA F+FLAYW ELW FY+K FGK +EM P  Y L TAS ++
Sbjct: 270  TIKSVHMELDKIYGEYAPFQFLAYWFELWGFYDKQFGKITEMDPSTYRLFTASALK 325


Top