BLASTX nr result

ID: Akebia26_contig00014335 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia26_contig00014335
         (981 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_002278231.1| PREDICTED: uncharacterized protein LOC100253...   370   e-100
emb|CAN64033.1| hypothetical protein VITISV_028159 [Vitis vinifera]   362   1e-97
ref|XP_004308966.1| PREDICTED: uncharacterized protein LOC101308...   347   6e-93
ref|XP_006353411.1| PREDICTED: uncharacterized protein LOC102605...   329   9e-88
ref|XP_006435834.1| hypothetical protein CICLE_v10030811mg [Citr...   328   2e-87
ref|XP_004167177.1| PREDICTED: LOW QUALITY PROTEIN: uncharacteri...   328   3e-87
ref|XP_004138409.1| PREDICTED: uncharacterized protein LOC101220...   328   3e-87
gb|EXB25852.1| hypothetical protein L484_012278 [Morus notabilis]     325   2e-86
ref|XP_004240891.1| PREDICTED: uncharacterized protein LOC101264...   323   7e-86
gb|AET79247.1| hypothetical protein [Glycine max]                     323   7e-86
ref|XP_003552200.1| PREDICTED: uncharacterized protein LOC100790...   323   7e-86
ref|XP_007163589.1| hypothetical protein PHAVU_001G247100g [Phas...   322   2e-85
ref|XP_003538488.1| PREDICTED: uncharacterized protein LOC100810...   321   3e-85
ref|XP_007220241.1| hypothetical protein PRUPE_ppa001873mg [Prun...   320   4e-85
ref|XP_006828155.1| hypothetical protein AMTR_s00023p00088020 [A...   316   1e-83
gb|EPS71417.1| hypothetical protein M569_03329 [Genlisea aurea]       315   2e-83
ref|XP_002311251.1| hypothetical protein POPTR_0008s07430g [Popu...   312   2e-82
ref|XP_002876351.1| hypothetical protein ARALYDRAFT_486055 [Arab...   311   2e-82
dbj|BAE98930.1| hypothetical protein [Arabidopsis thaliana]           310   6e-82
emb|CAB87413.1| putative protein [Arabidopsis thaliana]               310   6e-82

>ref|XP_002278231.1| PREDICTED: uncharacterized protein LOC100253544 [Vitis vinifera]
          Length = 749

 Score =  370 bits (949), Expect = e-100
 Identities = 200/329 (60%), Positives = 238/329 (72%), Gaps = 11/329 (3%)
 Frame = -1

Query: 954 MKPQLHSYKPVSLCSLQINSSA---------HISQKFTSNSLNFRPISPENRTRVKFSVP 802
           MKP  H+++     S+ ++S++         HI + F + +  FR IS   R  V  +V 
Sbjct: 1   MKP--HTHRGFGSGSVFVSSASGRRFDLQGNHIRKAFFAGNHVFRRISTGKRGGVDVAVR 58

Query: 801 YCRXXXXXXXXXXETSRN--PGGTRREMLVTPILAIGSYSLRSMVARADEKGSETXXXXX 628
            CR                   GTRRE+LVTP LAIG+YSLRS+VARA+E G+E      
Sbjct: 59  CCRSPVEKREPSCSDRDRLFEVGTRREVLVTPFLAIGAYSLRSVVARAEE-GTEAVMPAA 117

Query: 627 XXXXXXXXXVEEKMKKEDVIISRVYDATAIGEPLAVGKDKRRVWEKLLNARIVFLGEAEQ 448
                     E+KM  E+ I+SR+YDAT IGEP+A+GKDKR+VWEKL+NARIV+LGEAEQ
Sbjct: 118 ASGTVPAAA-EKKM--EEAIVSRIYDATVIGEPMALGKDKRKVWEKLMNARIVYLGEAEQ 174

Query: 447 VPVRDDKELELEIVKTLRNKCLEQERSISLAFEAFPCNLQEQLNLYMDKRIDGETLKSYT 268
           VP+RDD+ELELEIVK LR +C E ER +SLA EAFPCNLQE LN YMD RIDGETLKSY 
Sbjct: 175 VPIRDDRELELEIVKKLRKRCAENERPLSLALEAFPCNLQEPLNQYMDYRIDGETLKSYA 234

Query: 267 SHWPPRRWEEYEPLLSYCRDSGVRLVACGAPLEVLRTVQAEGIRGFSKADRKMYTPPVGS 88
           SHWPP+RW+EYEPLLSYCRD+GVRLVACG PLEVLRTVQAEGIRG SKA+R+ Y PP GS
Sbjct: 235 SHWPPQRWQEYEPLLSYCRDNGVRLVACGTPLEVLRTVQAEGIRGLSKAERRKYAPPAGS 294

Query: 87  GFIAGFTSISRRSSIEKGSPNQSVPFGPS 1
           GFI+GFTSISR+SSI+  SPNQSVPFGPS
Sbjct: 295 GFISGFTSISRKSSIDTNSPNQSVPFGPS 323


>emb|CAN64033.1| hypothetical protein VITISV_028159 [Vitis vinifera]
          Length = 749

 Score =  362 bits (930), Expect = 1e-97
 Identities = 192/298 (64%), Positives = 222/298 (74%), Gaps = 2/298 (0%)
 Frame = -1

Query: 888 HISQKFTSNSLNFRPISPENRTRVKFSVPYCRXXXXXXXXXXETSRN--PGGTRREMLVT 715
           HI + F + +  FR IS   R  V  +V  CR                   GTRRE+LVT
Sbjct: 30  HIRKAFFAGNHVFRRISTGKRGGVDVAVRCCRSPVEKREXSCSDRXRLFEVGTRREVLVT 89

Query: 714 PILAIGSYSLRSMVARADEKGSETXXXXXXXXXXXXXXVEEKMKKEDVIISRVYDATAIG 535
           P LAIG+YSLRS+VARA+E G+E                E+KM  E+ I+SR+YDAT IG
Sbjct: 90  PFLAIGAYSLRSVVARAEE-GTEAVMPAAASGTVPAAA-EKKM--EEAIVSRIYDATVIG 145

Query: 534 EPLAVGKDKRRVWEKLLNARIVFLGEAEQVPVRDDKELELEIVKTLRNKCLEQERSISLA 355
           EP+A+GKDKR+VWEKL+NARIV+LGEAEQVP+RDD+ELELEIVK LR +C E ER +SLA
Sbjct: 146 EPMALGKDKRKVWEKLMNARIVYLGEAEQVPIRDDRELELEIVKKLRKRCAENERPLSLA 205

Query: 354 FEAFPCNLQEQLNLYMDKRIDGETLKSYTSHWPPRRWEEYEPLLSYCRDSGVRLVACGAP 175
            EAFPCNLQE LN YMD RIDGETLKSY SHWP + W+EYEP LSYCRD+GVRLVACG P
Sbjct: 206 LEAFPCNLQEXLNQYMDYRIDGETLKSYASHWPXQXWQEYEPXLSYCRDNGVRLVACGTP 265

Query: 174 LEVLRTVQAEGIRGFSKADRKMYTPPVGSGFIAGFTSISRRSSIEKGSPNQSVPFGPS 1
           LEVLRTVQAEGIRG SKA+R+ Y PP GSGFI+GFTSISR+SSI+  SPNQSVPFGPS
Sbjct: 266 LEVLRTVQAEGIRGLSKAERRKYAPPAGSGFISGFTSISRKSSIDTNSPNQSVPFGPS 323


>ref|XP_004308966.1| PREDICTED: uncharacterized protein LOC101308136 [Fragaria vesca
           subsp. vesca]
          Length = 745

 Score =  347 bits (889), Expect = 6e-93
 Identities = 169/246 (68%), Positives = 199/246 (80%)
 Frame = -1

Query: 738 TRREMLVTPILAIGSYSLRSMVARADEKGSETXXXXXXXXXXXXXXVEEKMKKEDVIISR 559
           TRR+ L+ P LA+G++ L+S VA AD+  S                 EE  K+E+VI SR
Sbjct: 76  TRRQALLLPSLALGAWFLKSAVASADD--SPPPSAPSMTVPVPVPRAEELKKEEEVITSR 133

Query: 558 VYDATAIGEPLAVGKDKRRVWEKLLNARIVFLGEAEQVPVRDDKELELEIVKTLRNKCLE 379
           +YDATAIGEP+AVGKDK +VWEK++NARIV+LGEAEQVP+RDDKELELEIV+ L  +CLE
Sbjct: 134 IYDATAIGEPMAVGKDKSKVWEKVMNARIVYLGEAEQVPIRDDKELELEIVRNLNKRCLE 193

Query: 378 QERSISLAFEAFPCNLQEQLNLYMDKRIDGETLKSYTSHWPPRRWEEYEPLLSYCRDSGV 199
            ER++SLA EAFPC+LQEQLN YM+K IDGE LKSYTSHWPP+RW+EYEPLLSYCRD+GV
Sbjct: 194 SERALSLALEAFPCDLQEQLNQYMNKSIDGEALKSYTSHWPPQRWQEYEPLLSYCRDNGV 253

Query: 198 RLVACGAPLEVLRTVQAEGIRGFSKADRKMYTPPVGSGFIAGFTSISRRSSIEKGSPNQS 19
           R+VACG PL VLRTVQAEGI G SKADRKMY PP GSGFI+GFTSI+RRS ++  SPNQ 
Sbjct: 254 RIVACGTPLAVLRTVQAEGIHGLSKADRKMYAPPAGSGFISGFTSIARRSPVDSNSPNQI 313

Query: 18  VPFGPS 1
           VPFGPS
Sbjct: 314 VPFGPS 319


>ref|XP_006353411.1| PREDICTED: uncharacterized protein LOC102605434 isoform X1 [Solanum
           tuberosum]
          Length = 727

 Score =  329 bits (844), Expect = 9e-88
 Identities = 161/251 (64%), Positives = 194/251 (77%)
 Frame = -1

Query: 753 RNPGGTRREMLVTPILAIGSYSLRSMVARADEKGSETXXXXXXXXXXXXXXVEEKMKKED 574
           +N   TRR +L+ P+L IG  +LRS +ARAD+K                   +  +K E+
Sbjct: 71  QNSSTTRRNVLLMPLLTIGVCALRSAIARADDKPPPESTPQPPVTTVEAPPPDPVVKAEE 130

Query: 573 VIISRVYDATAIGEPLAVGKDKRRVWEKLLNARIVFLGEAEQVPVRDDKELELEIVKTLR 394
           VI SR+YDAT IGEPLA+GKDK++VWEKL+NAR+V+LGEAEQVP +DDKE+ELEIVK LR
Sbjct: 131 VINSRIYDATVIGEPLALGKDKKKVWEKLMNARVVYLGEAEQVPTQDDKEVELEIVKNLR 190

Query: 393 NKCLEQERSISLAFEAFPCNLQEQLNLYMDKRIDGETLKSYTSHWPPRRWEEYEPLLSYC 214
            +C E ERSISLA EAFP NLQEQLN Y+ KRIDGE+LKSY +HWP + W +YEPLL+YC
Sbjct: 191 KRCAEAERSISLALEAFPSNLQEQLNQYLAKRIDGESLKSYVAHWPTQYWHDYEPLLTYC 250

Query: 213 RDSGVRLVACGAPLEVLRTVQAEGIRGFSKADRKMYTPPVGSGFIAGFTSISRRSSIEKG 34
           R++GVRLVACG PLEVLRTVQAEGIRG SKADRK Y PP GSGFI+GF+S+SRRSS++  
Sbjct: 251 RENGVRLVACGLPLEVLRTVQAEGIRGLSKADRKKYAPPAGSGFISGFSSMSRRSSVDVN 310

Query: 33  SPNQSVPFGPS 1
             NQ  PFGPS
Sbjct: 311 MLNQPTPFGPS 321


>ref|XP_006435834.1| hypothetical protein CICLE_v10030811mg [Citrus clementina]
           gi|568865759|ref|XP_006486238.1| PREDICTED:
           uncharacterized protein LOC102607971 [Citrus sinensis]
           gi|557538030|gb|ESR49074.1| hypothetical protein
           CICLE_v10030811mg [Citrus clementina]
          Length = 729

 Score =  328 bits (842), Expect = 2e-87
 Identities = 161/247 (65%), Positives = 196/247 (79%), Gaps = 1/247 (0%)
 Frame = -1

Query: 738 TRREMLVTPILAIG-SYSLRSMVARADEKGSETXXXXXXXXXXXXXXVEEKMKKEDVIIS 562
           +RR + ++P++A+G S  L+S  A ADE                     E +K E+V++S
Sbjct: 62  SRRHVFLSPLIAVGASILLQSATASADETQPSPPGQPTTSIMPQIP---ETVKAEEVVVS 118

Query: 561 RVYDATAIGEPLAVGKDKRRVWEKLLNARIVFLGEAEQVPVRDDKELELEIVKTLRNKCL 382
           R+YDAT IGEPLAVG DKR+VWEKL+NAR+V+LGEAEQVPVRDD+ELEL+IVK LR +C+
Sbjct: 119 RIYDATVIGEPLAVGMDKRKVWEKLMNARVVYLGEAEQVPVRDDRELELQIVKNLRKRCV 178

Query: 381 EQERSISLAFEAFPCNLQEQLNLYMDKRIDGETLKSYTSHWPPRRWEEYEPLLSYCRDSG 202
           E ER+I+LA EAFP +LQ+QLN Y DKRIDGETLKSY SHWPP+RW+EYEPLLSYCRD+G
Sbjct: 179 ESERTITLALEAFPSDLQDQLNQYTDKRIDGETLKSYASHWPPQRWQEYEPLLSYCRDNG 238

Query: 201 VRLVACGAPLEVLRTVQAEGIRGFSKADRKMYTPPVGSGFIAGFTSISRRSSIEKGSPNQ 22
           V+L+ACG PL+VLRTVQAEGI G SKADRK+Y PP GSGFI+GFTSIS RSS++  S  Q
Sbjct: 239 VQLLACGTPLKVLRTVQAEGIHGLSKADRKLYAPPAGSGFISGFTSISHRSSVDMNSLTQ 298

Query: 21  SVPFGPS 1
           SVPFGPS
Sbjct: 299 SVPFGPS 305


>ref|XP_004167177.1| PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein
           LOC101230293 [Cucumis sativus]
          Length = 756

 Score =  328 bits (840), Expect = 3e-87
 Identities = 159/246 (64%), Positives = 193/246 (78%)
 Frame = -1

Query: 738 TRREMLVTPILAIGSYSLRSMVARADEKGSETXXXXXXXXXXXXXXVEEKMKKEDVIISR 559
           TRR +L  P++ IG+  L+S V RA+EK SET                    +E+VI SR
Sbjct: 85  TRRAVLGVPLIVIGARFLQSAVVRAEEKSSETVTPVVEAVTSPSPSPIAPTAEEEVITSR 144

Query: 558 VYDATAIGEPLAVGKDKRRVWEKLLNARIVFLGEAEQVPVRDDKELELEIVKTLRNKCLE 379
           +YDAT IGEPLAVGKDK +VWEK++NAR+V+LGEAEQVP+RDDKELELEIVK L+ +C E
Sbjct: 145 IYDATVIGEPLAVGKDKSKVWEKIMNARVVYLGEAEQVPIRDDKELELEIVKNLKRRCGE 204

Query: 378 QERSISLAFEAFPCNLQEQLNLYMDKRIDGETLKSYTSHWPPRRWEEYEPLLSYCRDSGV 199
            ER++SLA EAFP +LQEQLN Y+DK IDGETLKSYT+HWPP+RW+EYEPLLSYCR +GV
Sbjct: 205 SERTLSLALEAFPSDLQEQLNQYVDKTIDGETLKSYTAHWPPQRWQEYEPLLSYCRVNGV 264

Query: 198 RLVACGAPLEVLRTVQAEGIRGFSKADRKMYTPPVGSGFIAGFTSISRRSSIEKGSPNQS 19
           RL+ACG PL+VLR VQAEGIRG SKADRK++ PP GSGFI+GF +ISRR+S +  S  Q 
Sbjct: 265 RLIACGTPLKVLRIVQAEGIRGLSKADRKVFAPPAGSGFISGFAAISRRTSADLNSSYQP 324

Query: 18  VPFGPS 1
           +PFGPS
Sbjct: 325 IPFGPS 330


>ref|XP_004138409.1| PREDICTED: uncharacterized protein LOC101220818 [Cucumis sativus]
          Length = 756

 Score =  328 bits (840), Expect = 3e-87
 Identities = 159/246 (64%), Positives = 193/246 (78%)
 Frame = -1

Query: 738 TRREMLVTPILAIGSYSLRSMVARADEKGSETXXXXXXXXXXXXXXVEEKMKKEDVIISR 559
           TRR +L  P++ IG+  L+S V RA+EK SET                    +E+VI SR
Sbjct: 85  TRRAVLGVPLIVIGARFLQSAVVRAEEKSSETVTPVVEAVTSPSPSPIAPTAEEEVITSR 144

Query: 558 VYDATAIGEPLAVGKDKRRVWEKLLNARIVFLGEAEQVPVRDDKELELEIVKTLRNKCLE 379
           +YDAT IGEPLAVGKDK +VWEK++NAR+V+LGEAEQVP+RDDKELELEIVK L+ +C E
Sbjct: 145 IYDATVIGEPLAVGKDKSKVWEKIMNARVVYLGEAEQVPIRDDKELELEIVKNLKRRCGE 204

Query: 378 QERSISLAFEAFPCNLQEQLNLYMDKRIDGETLKSYTSHWPPRRWEEYEPLLSYCRDSGV 199
            ER++SLA EAFP +LQEQLN Y+DK IDGETLKSYT+HWPP+RW+EYEPLLSYCR +GV
Sbjct: 205 SERTLSLALEAFPSDLQEQLNQYVDKTIDGETLKSYTAHWPPQRWQEYEPLLSYCRVNGV 264

Query: 198 RLVACGAPLEVLRTVQAEGIRGFSKADRKMYTPPVGSGFIAGFTSISRRSSIEKGSPNQS 19
           RL+ACG PL+VLR VQAEGIRG SKADRK++ PP GSGFI+GF +ISRR+S +  S  Q 
Sbjct: 265 RLIACGTPLKVLRIVQAEGIRGLSKADRKVFAPPAGSGFISGFAAISRRTSADLNSSYQP 324

Query: 18  VPFGPS 1
           +PFGPS
Sbjct: 325 IPFGPS 330


>gb|EXB25852.1| hypothetical protein L484_012278 [Morus notabilis]
          Length = 744

 Score =  325 bits (833), Expect = 2e-86
 Identities = 159/251 (63%), Positives = 194/251 (77%)
 Frame = -1

Query: 753 RNPGGTRREMLVTPILAIGSYSLRSMVARADEKGSETXXXXXXXXXXXXXXVEEKMKKED 574
           R  G TRR+ L+   LA+G++ L+S  A  ++                    EEK KKED
Sbjct: 71  RIKGHTRRQALLASSLALGAWFLQSATASGEDA---PPPPQQQKVTEAVPRDEEKEKKED 127

Query: 573 VIISRVYDATAIGEPLAVGKDKRRVWEKLLNARIVFLGEAEQVPVRDDKELELEIVKTLR 394
            I SR+YDAT IGEP+A+GKDK +VWEK++NARIV+LGEAEQVP+ DDK+LELEIVK L+
Sbjct: 128 AITSRIYDATVIGEPMAIGKDKGKVWEKVMNARIVYLGEAEQVPIGDDKDLELEIVKNLK 187

Query: 393 NKCLEQERSISLAFEAFPCNLQEQLNLYMDKRIDGETLKSYTSHWPPRRWEEYEPLLSYC 214
            +C E ER +SLA EAFP +LQ+QLN YMDK IDG+TLK YTS+WPP+RW+EYEPLLSYC
Sbjct: 188 KRCAEIERPMSLALEAFPSDLQDQLNQYMDKSIDGQTLKGYTSYWPPQRWQEYEPLLSYC 247

Query: 213 RDSGVRLVACGAPLEVLRTVQAEGIRGFSKADRKMYTPPVGSGFIAGFTSISRRSSIEKG 34
           RD+GVRLVACG PL+VLRTVQAEG+ G SKADRK+YTPP GSGFI+GF++ISRRSS++  
Sbjct: 248 RDNGVRLVACGTPLKVLRTVQAEGVTGLSKADRKLYTPPAGSGFISGFSAISRRSSVDMN 307

Query: 33  SPNQSVPFGPS 1
            PNQ VPF PS
Sbjct: 308 YPNQFVPFSPS 318


>ref|XP_004240891.1| PREDICTED: uncharacterized protein LOC101264128 [Solanum
           lycopersicum]
          Length = 747

 Score =  323 bits (828), Expect = 7e-86
 Identities = 159/246 (64%), Positives = 189/246 (76%)
 Frame = -1

Query: 738 TRREMLVTPILAIGSYSLRSMVARADEKGSETXXXXXXXXXXXXXXVEEKMKKEDVIISR 559
           TRR +L+ P+L IG  +LRS +ARAD+K                   +  +K E+VI SR
Sbjct: 76  TRRNVLLMPLLTIGVCALRSAIARADDKPPPESTPQPPVTTVEAPTPDPVVKAEEVINSR 135

Query: 558 VYDATAIGEPLAVGKDKRRVWEKLLNARIVFLGEAEQVPVRDDKELELEIVKTLRNKCLE 379
           +YDAT IGEPLA+GKDK++VWEKL+NAR+V+LGEAEQVP +DDKE+ELEIVK LR +C E
Sbjct: 136 IYDATVIGEPLALGKDKKKVWEKLMNARVVYLGEAEQVPTQDDKEVELEIVKNLRKRCAE 195

Query: 378 QERSISLAFEAFPCNLQEQLNLYMDKRIDGETLKSYTSHWPPRRWEEYEPLLSYCRDSGV 199
            ER ISLA EAFP NLQEQLN Y+ KRIDGE+LKSY  HWP + W EYEPLL+YCR++GV
Sbjct: 196 AERPISLALEAFPSNLQEQLNQYLAKRIDGESLKSYVVHWPTQYWHEYEPLLTYCRENGV 255

Query: 198 RLVACGAPLEVLRTVQAEGIRGFSKADRKMYTPPVGSGFIAGFTSISRRSSIEKGSPNQS 19
           RLVACG PLEVLRTVQAEGIRG SKADRK Y PP GSGFI+GF+S+SRRS+ +    NQ 
Sbjct: 256 RLVACGLPLEVLRTVQAEGIRGLSKADRKKYAPPAGSGFISGFSSMSRRSAADVNMLNQP 315

Query: 18  VPFGPS 1
            PFGPS
Sbjct: 316 TPFGPS 321


>gb|AET79247.1| hypothetical protein [Glycine max]
          Length = 673

 Score =  323 bits (828), Expect = 7e-86
 Identities = 168/298 (56%), Positives = 209/298 (70%), Gaps = 12/298 (4%)
 Frame = -1

Query: 858 LNFRPISPENRTRVKFSVPYCRXXXXXXXXXXETSRNPGGT-----------RREMLVTP 712
           L FR +S   R RV  SV +              + NPGG+           RR +L+ P
Sbjct: 41  LEFRRVSTAKRRRVSLSVCHASRVT--------AASNPGGSDGDGDTRARSSRRGVLMAP 92

Query: 711 ILAIG-SYSLRSMVARADEKGSETXXXXXXXXXXXXXXVEEKMKKEDVIISRVYDATAIG 535
            L  G S  L +  ARA+EK +E+               EE    E+VI SR+YDAT IG
Sbjct: 93  FLVAGASILLSAATARAEEKAAESPLASAPKPEEPPKKKEE----EEVITSRIYDATVIG 148

Query: 534 EPLAVGKDKRRVWEKLLNARIVFLGEAEQVPVRDDKELELEIVKTLRNKCLEQERSISLA 355
           EPLA+GK+K +VWEKL+NAR+V+LGEAEQVPVRDD+ELELEIVK L  +CLE+E+ +SLA
Sbjct: 149 EPLAIGKEKGKVWEKLMNARVVYLGEAEQVPVRDDRELELEIVKNLHRRCLEKEKLLSLA 208

Query: 354 FEAFPCNLQEQLNLYMDKRIDGETLKSYTSHWPPRRWEEYEPLLSYCRDSGVRLVACGAP 175
            E FP NLQE LN YMDK+IDG+TLKSYT HWPP+RW+EYEP+LSYCR++G+ LVACG P
Sbjct: 209 LEVFPANLQEPLNQYMDKKIDGDTLKSYTLHWPPQRWQEYEPILSYCRENGIHLVACGTP 268

Query: 174 LEVLRTVQAEGIRGFSKADRKMYTPPVGSGFIAGFTSISRRSSIEKGSPNQSVPFGPS 1
           L++LRTVQAEGIRG +K +RK+Y PP GSGFI+GFTSISRRSS++  + N S+PFGPS
Sbjct: 269 LKILRTVQAEGIRGLTKDERKLYAPPAGSGFISGFTSISRRSSVD-STQNLSIPFGPS 325


>ref|XP_003552200.1| PREDICTED: uncharacterized protein LOC100790538 [Glycine max]
          Length = 747

 Score =  323 bits (828), Expect = 7e-86
 Identities = 168/298 (56%), Positives = 209/298 (70%), Gaps = 12/298 (4%)
 Frame = -1

Query: 858 LNFRPISPENRTRVKFSVPYCRXXXXXXXXXXETSRNPGGT-----------RREMLVTP 712
           L FR +S   R RV  SV +              + NPGG+           RR +L+ P
Sbjct: 41  LEFRRVSTAKRRRVSLSVCHASRVT--------AASNPGGSDGDGDTRARSSRRGVLMAP 92

Query: 711 ILAIG-SYSLRSMVARADEKGSETXXXXXXXXXXXXXXVEEKMKKEDVIISRVYDATAIG 535
            L  G S  L +  ARA+EK +E+               EE    E+VI SR+YDAT IG
Sbjct: 93  FLVAGASILLSAATARAEEKAAESPLASAPKPEEPPKKKEE----EEVITSRIYDATVIG 148

Query: 534 EPLAVGKDKRRVWEKLLNARIVFLGEAEQVPVRDDKELELEIVKTLRNKCLEQERSISLA 355
           EPLA+GK+K +VWEKL+NAR+V+LGEAEQVPVRDD+ELELEIVK L  +CLE+E+ +SLA
Sbjct: 149 EPLAIGKEKGKVWEKLMNARVVYLGEAEQVPVRDDRELELEIVKNLHRRCLEKEKLLSLA 208

Query: 354 FEAFPCNLQEQLNLYMDKRIDGETLKSYTSHWPPRRWEEYEPLLSYCRDSGVRLVACGAP 175
            E FP NLQE LN YMDK+IDG+TLKSYT HWPP+RW+EYEP+LSYCR++G+ LVACG P
Sbjct: 209 LEVFPANLQEPLNQYMDKKIDGDTLKSYTLHWPPQRWQEYEPILSYCRENGIHLVACGTP 268

Query: 174 LEVLRTVQAEGIRGFSKADRKMYTPPVGSGFIAGFTSISRRSSIEKGSPNQSVPFGPS 1
           L++LRTVQAEGIRG +K +RK+Y PP GSGFI+GFTSISRRSS++  + N S+PFGPS
Sbjct: 269 LKILRTVQAEGIRGLTKDERKLYAPPAGSGFISGFTSISRRSSVD-STQNLSIPFGPS 325


>ref|XP_007163589.1| hypothetical protein PHAVU_001G247100g [Phaseolus vulgaris]
           gi|561037053|gb|ESW35583.1| hypothetical protein
           PHAVU_001G247100g [Phaseolus vulgaris]
          Length = 742

 Score =  322 bits (824), Expect = 2e-85
 Identities = 165/294 (56%), Positives = 210/294 (71%), Gaps = 10/294 (3%)
 Frame = -1

Query: 852 FRPISPENRTRVKFSVPYCRXXXXXXXXXXETSRNPGGT----------RREMLVTPILA 703
           FR I    R RV  SV +              + NPGG+          RR +L+ P LA
Sbjct: 42  FRRIWTAKRRRVHLSVRHSTRVA--------AALNPGGSNGDESRPRSSRRGVLMAPFLA 93

Query: 702 IGSYSLRSMVARADEKGSETXXXXXXXXXXXXXXVEEKMKKEDVIISRVYDATAIGEPLA 523
            G+  L + VARA++K +E                 +K ++E+VI SR+YDA  IGEPLA
Sbjct: 94  AGASILLTAVARAEDKAAEPAPTAPKLEET------KKKEEEEVITSRIYDAAVIGEPLA 147

Query: 522 VGKDKRRVWEKLLNARIVFLGEAEQVPVRDDKELELEIVKTLRNKCLEQERSISLAFEAF 343
           +GK+K +VWEKL+NAR+V+LGEAEQVPVRDD+ELELEIVK L  +C E+E+ +SLA EAF
Sbjct: 148 IGKEKGKVWEKLMNARVVYLGEAEQVPVRDDRELELEIVKNLHRRCSEKEKKLSLALEAF 207

Query: 342 PCNLQEQLNLYMDKRIDGETLKSYTSHWPPRRWEEYEPLLSYCRDSGVRLVACGAPLEVL 163
           P NLQE LN YMDK+IDG+TLKSYT HWPP+RW+EYEP+LSYCR++G+RLVACG PL++L
Sbjct: 208 PSNLQEPLNQYMDKKIDGDTLKSYTLHWPPQRWQEYEPILSYCRENGIRLVACGTPLKIL 267

Query: 162 RTVQAEGIRGFSKADRKMYTPPVGSGFIAGFTSISRRSSIEKGSPNQSVPFGPS 1
           RTVQAEGIRG +K +RK+Y PP GSGF++GFTSISRRSS++  + N S+PFGPS
Sbjct: 268 RTVQAEGIRGLTKEERKLYAPPAGSGFVSGFTSISRRSSVD-STLNLSIPFGPS 320


>ref|XP_003538488.1| PREDICTED: uncharacterized protein LOC100810366 [Glycine max]
          Length = 748

 Score =  321 bits (822), Expect = 3e-85
 Identities = 165/304 (54%), Positives = 212/304 (69%), Gaps = 14/304 (4%)
 Frame = -1

Query: 870 TSNSLNFRPISPENRTRVKFSVPYCRXXXXXXXXXXETSRNPGGT-----------RREM 724
           ++  L FR +S   R  +  SV +              + NPGG+           RR +
Sbjct: 37  SAGGLEFRRVSTSKRRLINLSVRHASRVT--------AASNPGGSDGDGDTRARSCRRGV 88

Query: 723 LVTPILAIGSYSLRSMV---ARADEKGSETXXXXXXXXXXXXXXVEEKMKKEDVIISRVY 553
           L+TP L  G+  L S     ARADEK +E+                +K ++E+VI SR+Y
Sbjct: 89  LMTPFLVAGASILLSAATATARADEKAAESAPAPAAPEEPP-----KKKEEEEVITSRIY 143

Query: 552 DATAIGEPLAVGKDKRRVWEKLLNARIVFLGEAEQVPVRDDKELELEIVKTLRNKCLEQE 373
           DAT IGEPLA+GK+K ++WEKL+NAR+V+LGEAEQVPVRDD+ELELEIVK L  +CL +E
Sbjct: 144 DATVIGEPLAIGKEKGKIWEKLMNARVVYLGEAEQVPVRDDRELELEIVKNLHRRCLVKE 203

Query: 372 RSISLAFEAFPCNLQEQLNLYMDKRIDGETLKSYTSHWPPRRWEEYEPLLSYCRDSGVRL 193
           + +SLA E FP NLQE LN YMDK+IDG+TLKSYT HWPP+RW+EYEP+LSYC ++G+RL
Sbjct: 204 KRLSLALEVFPANLQEPLNQYMDKKIDGDTLKSYTLHWPPQRWQEYEPILSYCHENGIRL 263

Query: 192 VACGAPLEVLRTVQAEGIRGFSKADRKMYTPPVGSGFIAGFTSISRRSSIEKGSPNQSVP 13
           VACG PL++LRTVQAEGIRG +K +RK+Y PP GSGFI+GFTSISRRSS++  + N S+P
Sbjct: 264 VACGTPLKILRTVQAEGIRGLTKDERKLYAPPAGSGFISGFTSISRRSSVD-STQNLSIP 322

Query: 12  FGPS 1
           FGPS
Sbjct: 323 FGPS 326


>ref|XP_007220241.1| hypothetical protein PRUPE_ppa001873mg [Prunus persica]
           gi|462416703|gb|EMJ21440.1| hypothetical protein
           PRUPE_ppa001873mg [Prunus persica]
          Length = 750

 Score =  320 bits (821), Expect = 4e-85
 Identities = 158/251 (62%), Positives = 193/251 (76%)
 Frame = -1

Query: 753 RNPGGTRREMLVTPILAIGSYSLRSMVARADEKGSETXXXXXXXXXXXXXXVEEKMKKED 574
           R+P  TRR  L+ P LA+G++ L+S VA A++  S                      + D
Sbjct: 84  RSPLHTRRHALLAPSLALGAWFLKSTVASAEDAPSPPPSP----------------SQTD 127

Query: 573 VIISRVYDATAIGEPLAVGKDKRRVWEKLLNARIVFLGEAEQVPVRDDKELELEIVKTLR 394
            I SR+YDA+AIGEP+AVGKDK +VWEK++NARI++LGEAEQVP+RDDKELELEIVK L 
Sbjct: 128 AITSRIYDASAIGEPVAVGKDKSKVWEKVMNARILYLGEAEQVPIRDDKELELEIVKNLW 187

Query: 393 NKCLEQERSISLAFEAFPCNLQEQLNLYMDKRIDGETLKSYTSHWPPRRWEEYEPLLSYC 214
            +CLE ER++SLA EAFP +LQ+QLN YM K IDG+ LKSYTSHWP +RW+EYEPLLSYC
Sbjct: 188 KRCLESERALSLALEAFPSDLQDQLNQYMKKSIDGDALKSYTSHWPSQRWQEYEPLLSYC 247

Query: 213 RDSGVRLVACGAPLEVLRTVQAEGIRGFSKADRKMYTPPVGSGFIAGFTSISRRSSIEKG 34
           RD+GVRLVACG PL+VLRTVQ++GI G SKADRK Y PP GSGFI+GFTS +RR+ ++  
Sbjct: 248 RDNGVRLVACGTPLKVLRTVQSKGISGLSKADRKAYAPPAGSGFISGFTSSTRRTPVDSN 307

Query: 33  SPNQSVPFGPS 1
           SPNQSVPFGPS
Sbjct: 308 SPNQSVPFGPS 318


>ref|XP_006828155.1| hypothetical protein AMTR_s00023p00088020 [Amborella trichopoda]
           gi|548832802|gb|ERM95571.1| hypothetical protein
           AMTR_s00023p00088020 [Amborella trichopoda]
          Length = 753

 Score =  316 bits (809), Expect = 1e-83
 Identities = 163/250 (65%), Positives = 188/250 (75%), Gaps = 5/250 (2%)
 Frame = -1

Query: 735 RREMLVTPILAIGSYSLRSMVARADEK---GSETXXXXXXXXXXXXXXV--EEKMKKEDV 571
           +R+ ++ P+LA+G   L S V RA+E    GS T                  E  +KE+ 
Sbjct: 70  KRQAILKPLLAVGFCFLHSKV-RAEEAPLTGSATKQESDRKEAPLTGSTVKPESNRKEET 128

Query: 570 IISRVYDATAIGEPLAVGKDKRRVWEKLLNARIVFLGEAEQVPVRDDKELELEIVKTLRN 391
           + SR+YDA  IGEPLA+GKDK RVW+KLLN+RIV+LGEAEQVPVRDDK+LELEIVK LRN
Sbjct: 129 LNSRIYDANVIGEPLALGKDKSRVWDKLLNSRIVYLGEAEQVPVRDDKDLELEIVKNLRN 188

Query: 390 KCLEQERSISLAFEAFPCNLQEQLNLYMDKRIDGETLKSYTSHWPPRRWEEYEPLLSYCR 211
           KC EQ+R ISLA EAFPC++QEQLN YM KRIDGETLKS+  HWPP RW EYEPLL YC 
Sbjct: 189 KCFEQQRPISLALEAFPCDIQEQLNQYMSKRIDGETLKSFVPHWPPERWPEYEPLLRYCC 248

Query: 210 DSGVRLVACGAPLEVLRTVQAEGIRGFSKADRKMYTPPVGSGFIAGFTSISRRSSIEKGS 31
           D+GVRLVACG PLEVLRTVQAEG+RG SKADR  Y PP G+GFI GFTSISRRS I+  S
Sbjct: 249 DNGVRLVACGTPLEVLRTVQAEGMRGLSKADRNKYAPPNGAGFINGFTSISRRSPIDMIS 308

Query: 30  PNQSVPFGPS 1
           PN S+ FGPS
Sbjct: 309 PNPSLSFGPS 318


>gb|EPS71417.1| hypothetical protein M569_03329 [Genlisea aurea]
          Length = 723

 Score =  315 bits (807), Expect = 2e-83
 Identities = 157/257 (61%), Positives = 189/257 (73%), Gaps = 10/257 (3%)
 Frame = -1

Query: 744 GGTRREMLVTPILAIGSYSLRSMVARADEK---------GSETXXXXXXXXXXXXXXVEE 592
           G  RR++L+TP LA G+Y LRS VARA+EK                              
Sbjct: 43  GCKRRDVLITPFLAAGAYVLRSAVARAEEKSLPEAVGLSAPTLQQHVVETSPTPSDAASP 102

Query: 591 KMKKEDVIISRVYDATAIGEPLAVGKDKR-RVWEKLLNARIVFLGEAEQVPVRDDKELEL 415
              KE+VI SR+YDAT IGEP+A+GKDKR +VW+KL+N+RIV+LGEAEQVPVRDDKELEL
Sbjct: 103 TTPKEEVINSRIYDATVIGEPMALGKDKRNKVWDKLMNSRIVYLGEAEQVPVRDDKELEL 162

Query: 414 EIVKTLRNKCLEQERSISLAFEAFPCNLQEQLNLYMDKRIDGETLKSYTSHWPPRRWEEY 235
           EIVK  + +C E ER ISLA EAFPC+LQEQLN +MD+RI+ ETLKS+  HWPP RW+EY
Sbjct: 163 EIVKNFKRRCTEDERQISLALEAFPCDLQEQLNQFMDQRINAETLKSFVGHWPPERWQEY 222

Query: 234 EPLLSYCRDSGVRLVACGAPLEVLRTVQAEGIRGFSKADRKMYTPPVGSGFIAGFTSISR 55
           EPLL+YCRD+ VRL+ACG PLEVLRTVQ+EG+RG SK D K Y PP GSGFI+GF+S+SR
Sbjct: 223 EPLLTYCRDNAVRLIACGVPLEVLRTVQSEGVRGLSKPDLKKYAPPAGSGFISGFSSMSR 282

Query: 54  RSSIEKGSPNQSVPFGP 4
           RSSI+    NQS  +GP
Sbjct: 283 RSSIDMNFSNQSASYGP 299


>ref|XP_002311251.1| hypothetical protein POPTR_0008s07430g [Populus trichocarpa]
           gi|222851071|gb|EEE88618.1| hypothetical protein
           POPTR_0008s07430g [Populus trichocarpa]
          Length = 726

 Score =  312 bits (799), Expect = 2e-82
 Identities = 155/246 (63%), Positives = 193/246 (78%), Gaps = 1/246 (0%)
 Frame = -1

Query: 735 RREMLVTPILAIGSYSLRSMVARADEKGSETXXXXXXXXXXXXXXVEEKMKKEDVIISRV 556
           RR++L+TP+LA+G   L+S  ++A+    E                E + K E+VI SR+
Sbjct: 61  RRQVLLTPLLALGVSILQSAASKAEVANKEPDSPPPPPPPV-----EAEKKAEEVISSRI 115

Query: 555 YDATAIGEPLAVGKDKRRVWEKLLNARIVFLGEAEQVPVRDDKELELEIVKTLRNKCLEQ 376
           YDAT IGEP+AVGKDKR+VWEK++N RIV+LGEAEQVP++DDKELELEIVK L+ +C E+
Sbjct: 116 YDATVIGEPMAVGKDKRKVWEKIMNGRIVYLGEAEQVPIKDDKELELEIVKNLKKQCDER 175

Query: 375 ERSISLAFEAFPCNLQEQLNLYMDKR-IDGETLKSYTSHWPPRRWEEYEPLLSYCRDSGV 199
           E+SISLA EAFPC+LQ  LN Y+DKR IDGETLK Y + WPP+ W E EPLLSYCRD+G+
Sbjct: 176 EKSISLAMEAFPCDLQRLLNEYLDKRWIDGETLKGYMTQWPPQGWRECEPLLSYCRDNGI 235

Query: 198 RLVACGAPLEVLRTVQAEGIRGFSKADRKMYTPPVGSGFIAGFTSISRRSSIEKGSPNQS 19
           R+VACG PL+VLRTVQAEGIRG SKADRK+Y PP G+GFI+GF+SISRRS+ +  +P QS
Sbjct: 236 RIVACGVPLKVLRTVQAEGIRGLSKADRKLYAPPAGTGFISGFSSISRRST-DMNAPKQS 294

Query: 18  VPFGPS 1
           VPFGPS
Sbjct: 295 VPFGPS 300


>ref|XP_002876351.1| hypothetical protein ARALYDRAFT_486055 [Arabidopsis lyrata subsp.
           lyrata] gi|297322189|gb|EFH52610.1| hypothetical protein
           ARALYDRAFT_486055 [Arabidopsis lyrata subsp. lyrata]
          Length = 744

 Score =  311 bits (798), Expect = 2e-82
 Identities = 155/261 (59%), Positives = 192/261 (73%), Gaps = 9/261 (3%)
 Frame = -1

Query: 756 SRNPGGTRREMLVTPILAIG-SYSLRSMVARADEKGS--------ETXXXXXXXXXXXXX 604
           SR     R  +L  P+L++  S  L+  V+ A E  S        E+             
Sbjct: 57  SRTAVSRRAFLLAPPLLSVAASLFLKPSVSLATEASSSATVTSPAESAAPPPPATATAPS 116

Query: 603 XVEEKMKKEDVIISRVYDATAIGEPLAVGKDKRRVWEKLLNARIVFLGEAEQVPVRDDKE 424
                + KE+ I SR+YDATAIGEP+A+GKDK++VWEKL+NAR+V+LGEAEQVP +DDKE
Sbjct: 117 PPPAPVNKEETITSRIYDATAIGEPMAMGKDKKKVWEKLMNARVVYLGEAEQVPTKDDKE 176

Query: 423 LELEIVKTLRNKCLEQERSISLAFEAFPCNLQEQLNLYMDKRIDGETLKSYTSHWPPRRW 244
           LELEIV+ LR +CLE ER IS+A EAFP +LQ+QLN YMDKR+DGETLKSY +HWP +RW
Sbjct: 177 LELEIVRNLRKRCLESERQISVALEAFPLDLQDQLNQYMDKRMDGETLKSYVTHWPAQRW 236

Query: 243 EEYEPLLSYCRDSGVRLVACGAPLEVLRTVQAEGIRGFSKADRKMYTPPVGSGFIAGFTS 64
           +EYEPLLSYCRD+ VRL+ACG PL+VLRTVQAEGIRG SK++RK+YTPP GSGFI+GF+S
Sbjct: 237 QEYEPLLSYCRDNSVRLIACGTPLKVLRTVQAEGIRGLSKSERKLYTPPAGSGFISGFSS 296

Query: 63  ISRRSSIEKGSPNQSVPFGPS 1
            SRRS+ +   P Q VPFGPS
Sbjct: 297 FSRRSTFDMSLPTQIVPFGPS 317


>dbj|BAE98930.1| hypothetical protein [Arabidopsis thaliana]
          Length = 735

 Score =  310 bits (794), Expect = 6e-82
 Identities = 153/246 (62%), Positives = 186/246 (75%)
 Frame = -1

Query: 738 TRREMLVTPILAIGSYSLRSMVARADEKGSETXXXXXXXXXXXXXXVEEKMKKEDVIISR 559
           TRR +LV P L   + SL   ++ A    +ET                  ++KE+ I SR
Sbjct: 66  TRRAILVAPPLLAAAASLFLSISSA--ASAETSAESVALPPVATAPPPPPVEKEEAITSR 123

Query: 558 VYDATAIGEPLAVGKDKRRVWEKLLNARIVFLGEAEQVPVRDDKELELEIVKTLRNKCLE 379
           +YDA+ +GEP+AVGKDK+RVWEKLLNARIV+LGEAEQVP RDDK LELEIV+ LR +C+E
Sbjct: 124 IYDASVLGEPMAVGKDKKRVWEKLLNARIVYLGEAEQVPTRDDKVLELEIVRNLRKRCIE 183

Query: 378 QERSISLAFEAFPCNLQEQLNLYMDKRIDGETLKSYTSHWPPRRWEEYEPLLSYCRDSGV 199
            +R +SLA EAFP +LQEQLN YMDKR+DGE LKSY SHWP +RW+EYEPLLSYCRD+GV
Sbjct: 184 SDRQLSLALEAFPLDLQEQLNQYMDKRMDGEVLKSYVSHWPVQRWQEYEPLLSYCRDNGV 243

Query: 198 RLVACGAPLEVLRTVQAEGIRGFSKADRKMYTPPVGSGFIAGFTSISRRSSIEKGSPNQS 19
           +L+ACG PL+VLRTVQAEGIRG S+++RK+YTPP GSGFI+GFTS SR SS+      Q 
Sbjct: 244 KLIACGTPLKVLRTVQAEGIRGLSESERKLYTPPAGSGFISGFTSFSRSSSLNMNPLTQI 303

Query: 18  VPFGPS 1
           VPFGPS
Sbjct: 304 VPFGPS 309


>emb|CAB87413.1| putative protein [Arabidopsis thaliana]
          Length = 755

 Score =  310 bits (794), Expect = 6e-82
 Identities = 142/196 (72%), Positives = 172/196 (87%)
 Frame = -1

Query: 588 MKKEDVIISRVYDATAIGEPLAVGKDKRRVWEKLLNARIVFLGEAEQVPVRDDKELELEI 409
           + KE+ I SR+YDATAIGEP+A+GKDK++VWEKLLNAR+V+LGEAEQVP +DDKELELEI
Sbjct: 123 VNKEETITSRIYDATAIGEPMAMGKDKKKVWEKLLNARVVYLGEAEQVPTKDDKELELEI 182

Query: 408 VKTLRNKCLEQERSISLAFEAFPCNLQEQLNLYMDKRIDGETLKSYTSHWPPRRWEEYEP 229
           V+ LR +C+E ER IS+A EAFP +LQ+QLN YMDKR+DGETLKSY +HWP +RW+EYEP
Sbjct: 183 VRNLRKRCVESERQISVALEAFPLDLQDQLNQYMDKRMDGETLKSYVTHWPAQRWQEYEP 242

Query: 228 LLSYCRDSGVRLVACGAPLEVLRTVQAEGIRGFSKADRKMYTPPVGSGFIAGFTSISRRS 49
           LLSYCRD+ VRL+ACG PL+VLRTVQAEGIRG SK++RK+YTPP GSGFI+GF+S SRRS
Sbjct: 243 LLSYCRDNSVRLIACGTPLKVLRTVQAEGIRGLSKSERKLYTPPAGSGFISGFSSFSRRS 302

Query: 48  SIEKGSPNQSVPFGPS 1
           + +   P Q VPFGPS
Sbjct: 303 TFDMSLPTQIVPFGPS 318


Top