BLASTX nr result

ID: Akebia22_contig00004195 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia22_contig00004195
         (2441 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

ref|XP_007211756.1| hypothetical protein PRUPE_ppa009484mg [Prun...   344   1e-91
ref|XP_002274647.1| PREDICTED: uncharacterized protein LOC100252...   339   3e-90
emb|CBI31835.3| unnamed protein product [Vitis vinifera]              337   2e-89
ref|XP_004140790.1| PREDICTED: uncharacterized protein LOC101219...   334   1e-88
ref|XP_004306725.1| PREDICTED: uncharacterized protein LOC101305...   332   5e-88
gb|EXC15938.1| hypothetical protein L484_015739 [Morus notabilis]     329   4e-87
ref|XP_002322447.2| hypothetical protein POPTR_0015s13640g [Popu...   327   2e-86
ref|XP_006490330.1| PREDICTED: uncharacterized protein LOC102620...   321   9e-85
ref|XP_002510838.1| conserved hypothetical protein [Ricinus comm...   311   1e-81
ref|XP_007038593.1| Uncharacterized protein isoform 1 [Theobroma...   310   2e-81
ref|XP_004234358.1| PREDICTED: uncharacterized protein LOC101265...   307   2e-80
ref|XP_006353363.1| PREDICTED: uncharacterized protein LOC102585...   305   5e-80
ref|XP_006353362.1| PREDICTED: uncharacterized protein LOC102585...   305   5e-80
ref|XP_006353360.1| PREDICTED: uncharacterized protein LOC102585...   305   5e-80
ref|XP_006353361.1| PREDICTED: uncharacterized protein LOC102585...   305   6e-80
ref|XP_006281716.1| hypothetical protein CARUB_v10027872mg [Caps...   295   5e-77
ref|XP_006599632.1| PREDICTED: uncharacterized protein LOC100817...   293   3e-76
ref|XP_003548220.1| PREDICTED: uncharacterized protein LOC100817...   293   3e-76
ref|XP_002865893.1| hypothetical protein ARALYDRAFT_495283 [Arab...   293   3e-76
ref|XP_007152339.1| hypothetical protein PHAVU_004G121600g [Phas...   293   3e-76

>ref|XP_007211756.1| hypothetical protein PRUPE_ppa009484mg [Prunus persica]
            gi|462407621|gb|EMJ12955.1| hypothetical protein
            PRUPE_ppa009484mg [Prunus persica]
          Length = 290

 Score =  344 bits (883), Expect = 1e-91
 Identities = 175/293 (59%), Positives = 222/293 (75%), Gaps = 2/293 (0%)
 Frame = +2

Query: 1226 ITSKPLIPLGFPFRFRAKTINNXXXXXXXXXXXXXXXXXXXLDDS-TETKQQQLNLSVLR 1402
            ++   LI L  P +FRA+                       LD+S + + + QLNLSVLR
Sbjct: 8    LSPNSLIQLKIPPKFRARNCRTNFSAVSAR-----------LDNSKSSSAEPQLNLSVLR 56

Query: 1403 FTLGIRGLDESYLPRWIGYTFGSLLLLNHFIGSTSPSPTTPAQLISEVLGLSLAAFSIAL 1582
            FTLGI GLDESYLPRWIGY FGSLL+LNHF GS SP+ TTPAQL +E LGLSLAAFSIAL
Sbjct: 57   FTLGIPGLDESYLPRWIGYGFGSLLILNHFAGSISPASTTPAQLRTEALGLSLAAFSIAL 116

Query: 1583 PYMGKFLKGATPVDQGTLPEGNRQIFVMSEHLSDTQKEDLAWGTYVLLKNTNTISVLMAV 1762
            PY+G+FLKGATP+DQ ++P G  QIFV+S+++S+TQKEDLAW TY+LL+NTNTI+V++++
Sbjct: 117  PYLGRFLKGATPMDQTSIPRGCEQIFVISQNVSNTQKEDLAWATYILLRNTNTIAVIISI 176

Query: 1763 KDALCVRGYWSLPKDASKARILDLFERQVQQIGLSDLKDTLYFPR-SDTGAWEMLPEGTR 1939
            ++ LCVRGYW++P D SK  +L  FE+Q++ IGLSD+K+TLY  +  D+G WEMLP+GTR
Sbjct: 177  RNELCVRGYWNIPDDVSKTNVLAWFEKQIESIGLSDVKETLYLSQIEDSGLWEMLPQGTR 236

Query: 1940 SLLVQPVPGALDLSPNETSKKIGGFILLASSMSYAYSDRDRAWIGAIANKFRG 2098
            SLLVQP+   L  S NE  K   GF++LASSM YAYSD+D+AWIGAIANKF+G
Sbjct: 237  SLLVQPIVQVLPSSDNEIQKS-EGFVMLASSMRYAYSDKDKAWIGAIANKFKG 288


>ref|XP_002274647.1| PREDICTED: uncharacterized protein LOC100252183 [Vitis vinifera]
          Length = 311

 Score =  339 bits (870), Expect = 3e-90
 Identities = 168/248 (67%), Positives = 204/248 (82%), Gaps = 1/248 (0%)
 Frame = +2

Query: 1358 STETKQQQLNLSVLRFTLGIRGLDESYLPRWIGYTFGSLLLLNHFIGSTSPSPTTPAQLI 1537
            S   +QQQLNLSVLRFTLGI G DESYLPRWIGY FGS +LLNHF+GS   +  T AQL 
Sbjct: 66   SASNQQQQLNLSVLRFTLGIPGFDESYLPRWIGYGFGSFILLNHFVGSDL-NTITAAQLR 124

Query: 1538 SEVLGLSLAAFSIALPYMGKFLKGATPVDQGTLPEGNRQIFVMSEHLSDTQKEDLAWGTY 1717
            +E LGL LAAFS+ LPY+GKFLKGA PVDQ TLPEG  QIFVM++++SD  KEDLAW TY
Sbjct: 125  TEALGLCLAAFSVVLPYLGKFLKGAAPVDQTTLPEGIEQIFVMTQNISDILKEDLAWATY 184

Query: 1718 VLLKNTNTISVLMAVKDALCVRGYWSLPKDASKARILDLFERQVQQIGLSDLKDTLYFPR 1897
            +LL+NTNTI+VL++++ ALCVRGYW+ P D SKAR+LD  E+++++IGLSDLKDTLYFP+
Sbjct: 185  ILLRNTNTIAVLISIRGALCVRGYWNTPDDVSKARVLDWVEKEIEKIGLSDLKDTLYFPQ 244

Query: 1898 S-DTGAWEMLPEGTRSLLVQPVPGALDLSPNETSKKIGGFILLASSMSYAYSDRDRAWIG 2074
            S D+G WEMLP+GT SLLVQPV   +     +  +KI GF+LLASSM+YAY+D+DRAWIG
Sbjct: 245  SADSGLWEMLPKGTCSLLVQPV-SQIPSQGTDEMEKIDGFVLLASSMNYAYTDKDRAWIG 303

Query: 2075 AIANKFRG 2098
            A+ANKFRG
Sbjct: 304  AVANKFRG 311


>emb|CBI31835.3| unnamed protein product [Vitis vinifera]
          Length = 313

 Score =  337 bits (864), Expect = 2e-89
 Identities = 167/247 (67%), Positives = 203/247 (82%), Gaps = 1/247 (0%)
 Frame = +2

Query: 1358 STETKQQQLNLSVLRFTLGIRGLDESYLPRWIGYTFGSLLLLNHFIGSTSPSPTTPAQLI 1537
            S   +QQQLNLSVLRFTLGI G DESYLPRWIGY FGS +LLNHF+GS   +  T AQL 
Sbjct: 66   SASNQQQQLNLSVLRFTLGIPGFDESYLPRWIGYGFGSFILLNHFVGSDL-NTITAAQLR 124

Query: 1538 SEVLGLSLAAFSIALPYMGKFLKGATPVDQGTLPEGNRQIFVMSEHLSDTQKEDLAWGTY 1717
            +E LGL LAAFS+ LPY+GKFLKGA PVDQ TLPEG  QIFVM++++SD  KEDLAW TY
Sbjct: 125  TEALGLCLAAFSVVLPYLGKFLKGAAPVDQTTLPEGIEQIFVMTQNISDILKEDLAWATY 184

Query: 1718 VLLKNTNTISVLMAVKDALCVRGYWSLPKDASKARILDLFERQVQQIGLSDLKDTLYFPR 1897
            +LL+NTNTI+VL++++ ALCVRGYW+ P D SKAR+LD  E+++++IGLSDLKDTLYFP+
Sbjct: 185  ILLRNTNTIAVLISIRGALCVRGYWNTPDDVSKARVLDWVEKEIEKIGLSDLKDTLYFPQ 244

Query: 1898 S-DTGAWEMLPEGTRSLLVQPVPGALDLSPNETSKKIGGFILLASSMSYAYSDRDRAWIG 2074
            S D+G WEMLP+GT SLLVQPV   +     +  +KI GF+LLASSM+YAY+D+DRAWIG
Sbjct: 245  SADSGLWEMLPKGTCSLLVQPV-SQIPSQGTDEMEKIDGFVLLASSMNYAYTDKDRAWIG 303

Query: 2075 AIANKFR 2095
            A+ANKFR
Sbjct: 304  AVANKFR 310


>ref|XP_004140790.1| PREDICTED: uncharacterized protein LOC101219803 [Cucumis sativus]
            gi|449530412|ref|XP_004172189.1| PREDICTED:
            uncharacterized LOC101219803 [Cucumis sativus]
          Length = 288

 Score =  334 bits (856), Expect = 1e-88
 Identities = 169/249 (67%), Positives = 201/249 (80%), Gaps = 1/249 (0%)
 Frame = +2

Query: 1349 LDDSTETKQQQLNLSVLRFTLGIRGLDESYLPRWIGYTFGSLLLLNHFIGSTSPSPTTPA 1528
            LDDS  +  QQLNLSVLRFTLGI GLDESYLPRWIGY FGSLLLLNHF+GS S + TTPA
Sbjct: 35   LDDSKNSANQQLNLSVLRFTLGIPGLDESYLPRWIGYGFGSLLLLNHFVGSNSAALTTPA 94

Query: 1529 QLISEVLGLSLAAFSIALPYMGKFLKGATPVDQGTLPEGNRQIFVMSEHLSDTQKEDLAW 1708
            QL +E LG+SLAAFSIALPY+GKFLKGA P  +  LPEG  QIF++S+ LSD  KED+AW
Sbjct: 95   QLRTEALGISLAAFSIALPYLGKFLKGALPSGEAILPEGTEQIFLLSQILSDNLKEDIAW 154

Query: 1709 GTYVLLKNTNTISVLMAVKDALCVRGYWSLPKDASKARILDLFERQVQQIGLSDLKDTLY 1888
             TY+LL+NTN+ISVL+  + ALCVRGYW+ P D S A +L  FE Q+Q IGLS LKD +Y
Sbjct: 155  ATYILLRNTNSISVLIQTQGALCVRGYWNSPNDISSADLLAWFEEQLQSIGLSALKDAVY 214

Query: 1889 FPR-SDTGAWEMLPEGTRSLLVQPVPGALDLSPNETSKKIGGFILLASSMSYAYSDRDRA 2065
            FP+ S++G W+MLP+GTRS+LVQPV   L  S NE  + +GGFILLASS+SYA+SD+DRA
Sbjct: 215  FPQISESGLWQMLPKGTRSVLVQPVVQNLKQSGNEV-QNMGGFILLASSLSYAFSDKDRA 273

Query: 2066 WIGAIANKF 2092
            WI A+ANKF
Sbjct: 274  WIRAVANKF 282


>ref|XP_004306725.1| PREDICTED: uncharacterized protein LOC101305879 [Fragaria vesca
            subsp. vesca]
          Length = 283

 Score =  332 bits (851), Expect = 5e-88
 Identities = 169/252 (67%), Positives = 208/252 (82%), Gaps = 2/252 (0%)
 Frame = +2

Query: 1349 LDDS-TETKQQQLNLSVLRFTLGIRGLDESYLPRWIGYTFGSLLLLNHFIGSTSPSPTTP 1525
            LD+S + +   QLNLSVLRFTLGI GLDESYLPRWIGY FGSLL+LNHF GS SP   T 
Sbjct: 35   LDNSKSNSTDPQLNLSVLRFTLGIPGLDESYLPRWIGYGFGSLLVLNHFAGSVSP---TL 91

Query: 1526 AQLISEVLGLSLAAFSIALPYMGKFLKGATPVDQGTLPEGNRQIFVMSEHLSDTQKEDLA 1705
             QL +E LG+SLAAFSIALPY+GKFLKGATP+DQ ++P+G  Q+F++SE  S+T+KEDLA
Sbjct: 92   PQLRTEALGVSLAAFSIALPYLGKFLKGATPMDQTSIPDGCEQMFLISETTSNTRKEDLA 151

Query: 1706 WGTYVLLKNTNTISVLMAVKDALCVRGYWSLPKDASKARILDLFERQVQQIGLSDLKDTL 1885
            W TY+LL+NTNTISV+++V++ LCVRGYWS+P+D SK  +LD FE+Q++ IGLS+LK+TL
Sbjct: 152  WATYILLRNTNTISVIISVQEELCVRGYWSIPEDVSKPNVLDWFEKQIKSIGLSNLKETL 211

Query: 1886 YFPR-SDTGAWEMLPEGTRSLLVQPVPGALDLSPNETSKKIGGFILLASSMSYAYSDRDR 2062
            Y P+  D G WEMLP+GTRSLLVQPV   L  S +E ++KI GF+LLASSM YAYSD+DR
Sbjct: 212  YLPQIEDFGLWEMLPKGTRSLLVQPVMDVLHPSDSE-NEKIQGFVLLASSMRYAYSDKDR 270

Query: 2063 AWIGAIANKFRG 2098
            AW GA+A KFRG
Sbjct: 271  AWAGALAKKFRG 282


>gb|EXC15938.1| hypothetical protein L484_015739 [Morus notabilis]
          Length = 320

 Score =  329 bits (843), Expect = 4e-87
 Identities = 166/264 (62%), Positives = 212/264 (80%), Gaps = 12/264 (4%)
 Frame = +2

Query: 1349 LDDSTETKQQ--QLNLSVLRFTLGIRGLDESYLPRWIGYTFGSLLLLNHFIGSTSPSPTT 1522
            LDD++ + Q   QLNLSVLRFTLGI GLDESYLPRWIGY FGSLL+LNHF+GS S +  T
Sbjct: 36   LDDNSRSGQPNPQLNLSVLRFTLGIPGLDESYLPRWIGYGFGSLLVLNHFVGSNSVTDIT 95

Query: 1523 PAQLISEVLGLSLAAFSIALPYMGKFLK---------GATPVDQGTLPEGNRQIFVMSEH 1675
             AQL +E LGLSLAAFSI LPY+GKFLK         GATP+DQ T+PEG+ QIF++SE+
Sbjct: 96   SAQLRTEALGLSLAAFSIVLPYLGKFLKLYEDEKYLQGATPMDQTTIPEGSEQIFMLSEN 155

Query: 1676 LSDTQKEDLAWGTYVLLKNTNTISVLMAVKDALCVRGYWSLPKDASKARILDLFERQVQQ 1855
            +S+T+KEDLAW TY+LL+NTNT++VL++++  LCVRGYW+ P D SK  +LD F RQ++Q
Sbjct: 156  VSNTEKEDLAWATYILLRNTNTMAVLISIQGELCVRGYWNTPTDVSKTDLLDWFGRQIEQ 215

Query: 1856 IGLSDLKDTLYFPR-SDTGAWEMLPEGTRSLLVQPVPGALDLSPNETSKKIGGFILLASS 2032
             G+SD+KDTLYFP+ SD+G W++LP+GTRS+LVQPVP   D S ++T +   GFIL+AS+
Sbjct: 216  FGISDVKDTLYFPQISDSGLWDILPKGTRSVLVQPVPQVPD-SSDKTMETNQGFILVAST 274

Query: 2033 MSYAYSDRDRAWIGAIANKFRGLH 2104
            +SYAY+ +DRAWIGA+A KF  LH
Sbjct: 275  ISYAYNVKDRAWIGALAKKFADLH 298


>ref|XP_002322447.2| hypothetical protein POPTR_0015s13640g [Populus trichocarpa]
            gi|550322657|gb|EEF06574.2| hypothetical protein
            POPTR_0015s13640g [Populus trichocarpa]
          Length = 288

 Score =  327 bits (838), Expect = 2e-86
 Identities = 175/293 (59%), Positives = 210/293 (71%), Gaps = 1/293 (0%)
 Frame = +2

Query: 1223 SITSKPLIPLGFPFRFRAKTINNXXXXXXXXXXXXXXXXXXXLDDSTETKQQQLNLSVLR 1402
            S++  PLI L    +FRAK                        +  ++ +QQQLNLSVLR
Sbjct: 3    SLSIHPLIQLKTHHQFRAKKTRKSIAIHASSD-----------NPQSQRQQQQLNLSVLR 51

Query: 1403 FTLGIRGLDESYLPRWIGYTFGSLLLLNHFIGSTSPSPTTPAQLISEVLGLSLAAFSIAL 1582
            FT GI GLDESYLPRWIGY FGSLL+LNHF+GS     TT AQL +EVLGLSLAAFS AL
Sbjct: 52   FTFGIPGLDESYLPRWIGYGFGSLLILNHFLGSNPD--TTQAQLRTEVLGLSLAAFSAAL 109

Query: 1583 PYMGKFLKGATPVDQGTLPEGNRQIFVMSEHLSDTQKEDLAWGTYVLLKNTNTISVLMAV 1762
            PY G+FLKGATPVDQGTLP+   QIF MS+++SD QKEDLAW TY+LL+NTNTI+VL+++
Sbjct: 110  PYFGRFLKGATPVDQGTLPQDAEQIFAMSQNISDAQKEDLAWATYILLRNTNTIAVLISI 169

Query: 1763 KDALCVRGYWSLPKDASKARILDLFERQVQQIGLSDLKDTLYFPR-SDTGAWEMLPEGTR 1939
            +  LCVRGYW      SK  +LD F+ Q++ IGLSD+KDTLYFP+ +++  WEMLPEGTR
Sbjct: 170  QGELCVRGYWKTSDKMSKDEVLDWFKEQIENIGLSDVKDTLYFPQTTESEIWEMLPEGTR 229

Query: 1940 SLLVQPVPGALDLSPNETSKKIGGFILLASSMSYAYSDRDRAWIGAIANKFRG 2098
            SLLV+PV  A   S N+T     GFILLASS+ YAYSD+DRAWI A  NKFRG
Sbjct: 230  SLLVEPVLQATVQSGNKTENN-EGFILLASSIGYAYSDKDRAWIRATGNKFRG 281


>ref|XP_006490330.1| PREDICTED: uncharacterized protein LOC102620511 [Citrus sinensis]
          Length = 280

 Score =  321 bits (823), Expect = 9e-85
 Identities = 164/250 (65%), Positives = 201/250 (80%), Gaps = 1/250 (0%)
 Frame = +2

Query: 1355 DSTETKQQQLNLSVLRFTLGIRGLDESYLPRWIGYTFGSLLLLNHFIGSTSPSPTTPAQL 1534
            D+++ ++QQLNLSVLRFT GI G DESYLPRWIGY FGSL++LNHF  S S    T AQL
Sbjct: 34   DNSQQEEQQLNLSVLRFTFGIPGFDESYLPRWIGYGFGSLIVLNHFAFSNS---VTSAQL 90

Query: 1535 ISEVLGLSLAAFSIALPYMGKFLKGATPVDQGTLPEGNRQIFVMSEHLSDTQKEDLAWGT 1714
             SEVLGLSLAAFS+ LPY+GKFLKGA+PV Q +LPE   QIFVMS+++SD  KE+LAW T
Sbjct: 91   RSEVLGLSLAAFSVTLPYLGKFLKGASPVSQKSLPESGEQIFVMSQNISDALKENLAWAT 150

Query: 1715 YVLLKNTNTISVLMAVKDALCVRGYWSLPKDASKARILDLFERQVQQIGLSDLKDTLYFP 1894
            YVLL+NTN+ISVL++++  LCVRGYW  P  ASK ++L+ FERQ++ IGLSDLKD+LYFP
Sbjct: 151  YVLLRNTNSISVLISIRGELCVRGYWQTPDGASKTQLLEWFERQIENIGLSDLKDSLYFP 210

Query: 1895 RS-DTGAWEMLPEGTRSLLVQPVPGALDLSPNETSKKIGGFILLASSMSYAYSDRDRAWI 2071
            +S D G WEMLP+GT S+ VQPV  A + S  E  +KI GF+LLASSM+YAYS +DRAWI
Sbjct: 211  QSADAGQWEMLPKGTCSVFVQPVIQAPNPSAVEV-EKIEGFVLLASSMTYAYSHKDRAWI 269

Query: 2072 GAIANKFRGL 2101
             A++NKFR L
Sbjct: 270  KAVSNKFRDL 279


>ref|XP_002510838.1| conserved hypothetical protein [Ricinus communis]
            gi|223549953|gb|EEF51440.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 308

 Score =  311 bits (796), Expect = 1e-81
 Identities = 160/251 (63%), Positives = 199/251 (79%), Gaps = 3/251 (1%)
 Frame = +2

Query: 1349 LDDSTETKQQQ--LNLSVLRFTLGIRGLDESYLPRWIGYTFGSLLLLNHFIGSTSPSPTT 1522
            +D+S   K QQ  LNLSVLRFT GI GLDESYLPRWIGY FGSLL+LNHF+GS S   T+
Sbjct: 32   IDNSQRKKDQQQDLNLSVLRFTFGIPGLDESYLPRWIGYGFGSLLVLNHFLGSNSV--TS 89

Query: 1523 PAQLISEVLGLSLAAFSIALPYMGKFLKGATPVDQGTLPEGNRQIFVMSEHLSDTQKEDL 1702
              Q+ +E LGLSLAAFSIALPY G+FLKGATPVDQ  LP+G+ QIFVMSE++SDT KEDL
Sbjct: 90   LPQMRTEALGLSLAAFSIALPYFGRFLKGATPVDQTALPQGSEQIFVMSENVSDTLKEDL 149

Query: 1703 AWGTYVLLKNTNTISVLMAVKDALCVRGYWSLPKDASKARILDLFERQVQQIGLSDLKDT 1882
            AW TYVLL+NTN+I+VL+ ++  LCVRGYW+ P + SKA+++D F+ +++ IGL DLKDT
Sbjct: 150  AWATYVLLRNTNSIAVLIYIQGELCVRGYWNTPDNISKAQVIDWFKGRIEDIGLFDLKDT 209

Query: 1883 LYFPR-SDTGAWEMLPEGTRSLLVQPVPGALDLSPNETSKKIGGFILLASSMSYAYSDRD 2059
            LYFP+ +++  WEMLP GTRSLLV+PV     L   +   +  GF+LLASS+  AY+D+D
Sbjct: 210  LYFPQTAESRLWEMLPRGTRSLLVEPVL----LQNAKEMDRAEGFVLLASSIDTAYTDKD 265

Query: 2060 RAWIGAIANKF 2092
            RAWIGA+ANKF
Sbjct: 266  RAWIGAVANKF 276


>ref|XP_007038593.1| Uncharacterized protein isoform 1 [Theobroma cacao]
            gi|508775838|gb|EOY23094.1| Uncharacterized protein
            isoform 1 [Theobroma cacao]
          Length = 285

 Score =  310 bits (795), Expect = 2e-81
 Identities = 161/245 (65%), Positives = 189/245 (77%), Gaps = 1/245 (0%)
 Frame = +2

Query: 1364 ETKQQQLNLSVLRFTLGIRGLDESYLPRWIGYTFGSLLLLNHFIGSTSPSPTTPAQLISE 1543
            +  QQ+LNLSVLRFTLGI GLDESYLPRWIGY FGSLL+LNH  GS S    T AQL SE
Sbjct: 37   DNPQQRLNLSVLRFTLGIPGLDESYLPRWIGYGFGSLLILNHLFGSDS---VTAAQLRSE 93

Query: 1544 VLGLSLAAFSIALPYMGKFLKGATPVDQGTLPEGNRQIFVMSEHLSDTQKEDLAWGTYVL 1723
             LG+SLAAFS+ LPY+GKFLKGATP+DQ TLPEG  QIFVMS+++S  QKEDLAW TYVL
Sbjct: 94   ALGISLAAFSVTLPYLGKFLKGATPIDQTTLPEGAEQIFVMSQNVSVAQKEDLAWATYVL 153

Query: 1724 LKNTNTISVLMAVKDALCVRGYWSLPKDASKARILDLFERQVQQIGLSDLKDTLYFPRS- 1900
            L+NTNT SVL+     LCVRGYW++P    K  +LD F+  +++ GLSDL DTLYFP++ 
Sbjct: 154  LRNTNTTSVLILAGGELCVRGYWNVPDVVPKDNVLDWFKSNIEETGLSDLTDTLYFPQTG 213

Query: 1901 DTGAWEMLPEGTRSLLVQPVPGALDLSPNETSKKIGGFILLASSMSYAYSDRDRAWIGAI 2080
            D   W+MLP+GTRS LVQPV    +LS NE    I GF+LLASSM YAYSD+DRAWI A+
Sbjct: 214  DAEFWKMLPQGTRSALVQPVLLDPNLSNNEMG-SIEGFVLLASSMRYAYSDKDRAWIRAV 272

Query: 2081 ANKFR 2095
            +NK R
Sbjct: 273  SNKLR 277


>ref|XP_004234358.1| PREDICTED: uncharacterized protein LOC101265513 [Solanum
            lycopersicum]
          Length = 300

 Score =  307 bits (786), Expect = 2e-80
 Identities = 154/249 (61%), Positives = 196/249 (78%), Gaps = 1/249 (0%)
 Frame = +2

Query: 1355 DSTETKQQQLNLSVLRFTLGIRGLDESYLPRWIGYTFGSLLLLNHFIGSTSPSPTTPAQL 1534
            +ST  +QQQLNLSVLRFTLGI GLDESYLPR+IGY FG LL+LNHF+GS  PS  T AQL
Sbjct: 52   ESTNAQQQQLNLSVLRFTLGIPGLDESYLPRYIGYAFGFLLVLNHFLGS-DPSAITAAQL 110

Query: 1535 ISEVLGLSLAAFSIALPYMGKFLKGATPVDQGTLPEGNRQIFVMSEHLSDTQKEDLAWGT 1714
             +EVLGL LA FS+ +PY+GKFLKG+  V++  LPE   Q FV+SE++SD  KEDLAWGT
Sbjct: 111  RTEVLGLLLALFSVIVPYLGKFLKGSVLVEERNLPEDAEQAFVISENISDILKEDLAWGT 170

Query: 1715 YVLLKNTNTISVLMAVKDALCVRGYWSLPKDASKARILDLFERQVQQIGLSDLKDTLYFP 1894
            YVLL+NT++ISVL +++D +C RGYW  PKD  KA + D FE+Q+QQ GL DLK+TLYFP
Sbjct: 171  YVLLRNTSSISVLFSLQDTICARGYWRTPKDVLKAHLCDWFEKQIQQSGLHDLKETLYFP 230

Query: 1895 R-SDTGAWEMLPEGTRSLLVQPVPGALDLSPNETSKKIGGFILLASSMSYAYSDRDRAWI 2071
            + SD+  W+MLP+GTRSLLVQP+  + + S +    K  GF+L+ASS SYAY+++DR+W+
Sbjct: 231  QVSDSEVWQMLPKGTRSLLVQPLIQS-ETSASSQEWKNNGFVLVASSNSYAYNNKDRSWV 289

Query: 2072 GAIANKFRG 2098
            GA+A KF G
Sbjct: 290  GAVAKKFGG 298


>ref|XP_006353363.1| PREDICTED: uncharacterized protein LOC102585395 isoform X4 [Solanum
            tuberosum]
          Length = 299

 Score =  305 bits (782), Expect = 5e-80
 Identities = 156/253 (61%), Positives = 201/253 (79%), Gaps = 3/253 (1%)
 Frame = +2

Query: 1349 LDDSTET--KQQQLNLSVLRFTLGIRGLDESYLPRWIGYTFGSLLLLNHFIGSTSPSPTT 1522
            L++STE+  +QQQLNLSVLRFTLGI GLDESYLPR+IGY FG LL+LNHF+GS S + TT
Sbjct: 47   LENSTESNAQQQQLNLSVLRFTLGIPGLDESYLPRYIGYAFGFLLVLNHFLGSDSSAITT 106

Query: 1523 PAQLISEVLGLSLAAFSIALPYMGKFLKGATPVDQGTLPEGNRQIFVMSEHLSDTQKEDL 1702
             AQL +EVLGL LA FS+ +PY+GKFLKG+  V++  LPE   Q FV++E++SD  KEDL
Sbjct: 107  -AQLRTEVLGLLLAVFSVIVPYLGKFLKGSVLVEERNLPEDAEQAFVITENISDILKEDL 165

Query: 1703 AWGTYVLLKNTNTISVLMAVKDALCVRGYWSLPKDASKARILDLFERQVQQIGLSDLKDT 1882
            AWGTYVLL+NT++ISVL +++D +C RGYW  PKD  KA + D FE+Q+QQ GL DLK+T
Sbjct: 166  AWGTYVLLRNTSSISVLFSLQDTICARGYWRTPKDVLKAHLCDWFEKQIQQSGLHDLKET 225

Query: 1883 LYFPR-SDTGAWEMLPEGTRSLLVQPVPGALDLSPNETSKKIGGFILLASSMSYAYSDRD 2059
            LYFP+ SD+  WEMLP+GTRSLLVQP+  + + S +    K  GF+L+ASS SYAY+++D
Sbjct: 226  LYFPQVSDSEVWEMLPKGTRSLLVQPLIQS-ETSDSSQKWKNNGFVLVASSNSYAYNNKD 284

Query: 2060 RAWIGAIANKFRG 2098
            R+W+GA+A KF G
Sbjct: 285  RSWVGAVAKKFGG 297


>ref|XP_006353362.1| PREDICTED: uncharacterized protein LOC102585395 isoform X3 [Solanum
            tuberosum]
          Length = 304

 Score =  305 bits (782), Expect = 5e-80
 Identities = 156/253 (61%), Positives = 201/253 (79%), Gaps = 3/253 (1%)
 Frame = +2

Query: 1349 LDDSTET--KQQQLNLSVLRFTLGIRGLDESYLPRWIGYTFGSLLLLNHFIGSTSPSPTT 1522
            L++STE+  +QQQLNLSVLRFTLGI GLDESYLPR+IGY FG LL+LNHF+GS S + TT
Sbjct: 47   LENSTESNAQQQQLNLSVLRFTLGIPGLDESYLPRYIGYAFGFLLVLNHFLGSDSSAITT 106

Query: 1523 PAQLISEVLGLSLAAFSIALPYMGKFLKGATPVDQGTLPEGNRQIFVMSEHLSDTQKEDL 1702
             AQL +EVLGL LA FS+ +PY+GKFLKG+  V++  LPE   Q FV++E++SD  KEDL
Sbjct: 107  -AQLRTEVLGLLLAVFSVIVPYLGKFLKGSVLVEERNLPEDAEQAFVITENISDILKEDL 165

Query: 1703 AWGTYVLLKNTNTISVLMAVKDALCVRGYWSLPKDASKARILDLFERQVQQIGLSDLKDT 1882
            AWGTYVLL+NT++ISVL +++D +C RGYW  PKD  KA + D FE+Q+QQ GL DLK+T
Sbjct: 166  AWGTYVLLRNTSSISVLFSLQDTICARGYWRTPKDVLKAHLCDWFEKQIQQSGLHDLKET 225

Query: 1883 LYFPR-SDTGAWEMLPEGTRSLLVQPVPGALDLSPNETSKKIGGFILLASSMSYAYSDRD 2059
            LYFP+ SD+  WEMLP+GTRSLLVQP+  + + S +    K  GF+L+ASS SYAY+++D
Sbjct: 226  LYFPQVSDSEVWEMLPKGTRSLLVQPLIQS-ETSDSSQKWKNNGFVLVASSNSYAYNNKD 284

Query: 2060 RAWIGAIANKFRG 2098
            R+W+GA+A KF G
Sbjct: 285  RSWVGAVAKKFGG 297


>ref|XP_006353360.1| PREDICTED: uncharacterized protein LOC102585395 isoform X1 [Solanum
            tuberosum]
          Length = 316

 Score =  305 bits (782), Expect = 5e-80
 Identities = 156/253 (61%), Positives = 201/253 (79%), Gaps = 3/253 (1%)
 Frame = +2

Query: 1349 LDDSTET--KQQQLNLSVLRFTLGIRGLDESYLPRWIGYTFGSLLLLNHFIGSTSPSPTT 1522
            L++STE+  +QQQLNLSVLRFTLGI GLDESYLPR+IGY FG LL+LNHF+GS S + TT
Sbjct: 47   LENSTESNAQQQQLNLSVLRFTLGIPGLDESYLPRYIGYAFGFLLVLNHFLGSDSSAITT 106

Query: 1523 PAQLISEVLGLSLAAFSIALPYMGKFLKGATPVDQGTLPEGNRQIFVMSEHLSDTQKEDL 1702
             AQL +EVLGL LA FS+ +PY+GKFLKG+  V++  LPE   Q FV++E++SD  KEDL
Sbjct: 107  -AQLRTEVLGLLLAVFSVIVPYLGKFLKGSVLVEERNLPEDAEQAFVITENISDILKEDL 165

Query: 1703 AWGTYVLLKNTNTISVLMAVKDALCVRGYWSLPKDASKARILDLFERQVQQIGLSDLKDT 1882
            AWGTYVLL+NT++ISVL +++D +C RGYW  PKD  KA + D FE+Q+QQ GL DLK+T
Sbjct: 166  AWGTYVLLRNTSSISVLFSLQDTICARGYWRTPKDVLKAHLCDWFEKQIQQSGLHDLKET 225

Query: 1883 LYFPR-SDTGAWEMLPEGTRSLLVQPVPGALDLSPNETSKKIGGFILLASSMSYAYSDRD 2059
            LYFP+ SD+  WEMLP+GTRSLLVQP+  + + S +    K  GF+L+ASS SYAY+++D
Sbjct: 226  LYFPQVSDSEVWEMLPKGTRSLLVQPLIQS-ETSDSSQKWKNNGFVLVASSNSYAYNNKD 284

Query: 2060 RAWIGAIANKFRG 2098
            R+W+GA+A KF G
Sbjct: 285  RSWVGAVAKKFGG 297


>ref|XP_006353361.1| PREDICTED: uncharacterized protein LOC102585395 isoform X2 [Solanum
            tuberosum]
          Length = 310

 Score =  305 bits (781), Expect = 6e-80
 Identities = 157/255 (61%), Positives = 203/255 (79%), Gaps = 3/255 (1%)
 Frame = +2

Query: 1349 LDDSTET--KQQQLNLSVLRFTLGIRGLDESYLPRWIGYTFGSLLLLNHFIGSTSPSPTT 1522
            L++STE+  +QQQLNLSVLRFTLGI GLDESYLPR+IGY FG LL+LNHF+GS S + TT
Sbjct: 47   LENSTESNAQQQQLNLSVLRFTLGIPGLDESYLPRYIGYAFGFLLVLNHFLGSDSSAITT 106

Query: 1523 PAQLISEVLGLSLAAFSIALPYMGKFLKGATPVDQGTLPEGNRQIFVMSEHLSDTQKEDL 1702
             AQL +EVLGL LA FS+ +PY+GKFLKG+  V++  LPE   Q FV++E++SD  KEDL
Sbjct: 107  -AQLRTEVLGLLLAVFSVIVPYLGKFLKGSVLVEERNLPEDAEQAFVITENISDILKEDL 165

Query: 1703 AWGTYVLLKNTNTISVLMAVKDALCVRGYWSLPKDASKARILDLFERQVQQIGLSDLKDT 1882
            AWGTYVLL+NT++ISVL +++D +C RGYW  PKD  KA + D FE+Q+QQ GL DLK+T
Sbjct: 166  AWGTYVLLRNTSSISVLFSLQDTICARGYWRTPKDVLKAHLCDWFEKQIQQSGLHDLKET 225

Query: 1883 LYFPR-SDTGAWEMLPEGTRSLLVQPVPGALDLSPNETSKKIGGFILLASSMSYAYSDRD 2059
            LYFP+ SD+  WEMLP+GTRSLLVQP+  + + S +    K  GF+L+ASS SYAY+++D
Sbjct: 226  LYFPQVSDSEVWEMLPKGTRSLLVQPLIQS-ETSDSSQKWKNNGFVLVASSNSYAYNNKD 284

Query: 2060 RAWIGAIANKFRGLH 2104
            R+W+GA+A KF G+H
Sbjct: 285  RSWVGAVAKKF-GVH 298


>ref|XP_006281716.1| hypothetical protein CARUB_v10027872mg [Capsella rubella]
            gi|482550420|gb|EOA14614.1| hypothetical protein
            CARUB_v10027872mg [Capsella rubella]
          Length = 279

 Score =  295 bits (756), Expect = 5e-77
 Identities = 152/254 (59%), Positives = 197/254 (77%), Gaps = 5/254 (1%)
 Frame = +2

Query: 1352 DDSTETK---QQQLNLSVLRFTLGIRGLDESYLPRWIGYTFGSLLLLNHFIGSTSPSPTT 1522
            +DS ++K   QQQLNLSVLRFT GI GLDESYLPRW+GY FGSLLLLNHF  S    P +
Sbjct: 35   NDSLQSKASDQQQLNLSVLRFTFGIPGLDESYLPRWLGYGFGSLLLLNHFSAS---GPVS 91

Query: 1523 PAQLISEVLGLSLAAFSIALPYMGKFLKGATPVDQGTLPEGNRQIFVMSEHLSDTQKEDL 1702
              QL SE LG+SLAAFS+ALPY+GKFLKG+  V+Q TLPE   Q+FV+S ++ D+ KEDL
Sbjct: 92   EPQLRSEALGISLAAFSVALPYIGKFLKGSV-VEQRTLPEEGEQVFVISSNIGDSLKEDL 150

Query: 1703 AWGTYVLLKNTNTISVLMAVKDALCVRGYWSLPKDASKARILDLFERQVQQIGLSDLKDT 1882
            AW TYVLL+NT+TI+VL++V+  LCVRGYW+ P   SKA++ D F+++V +IGL+D+K+T
Sbjct: 151  AWATYVLLRNTSTIAVLISVQGELCVRGYWNCPDQMSKAQLHDWFKKKVDEIGLADVKET 210

Query: 1883 LYFPRSDTGA--WEMLPEGTRSLLVQPVPGALDLSPNETSKKIGGFILLASSMSYAYSDR 2056
            LYFP+ +  A  W++LP+GTRSL VQP+     +      KK+ GF+L+AS+  YAYSD+
Sbjct: 211  LYFPQYEGSAASWDILPDGTRSLFVQPL-----VQDINEPKKMDGFLLVASTAGYAYSDK 265

Query: 2057 DRAWIGAIANKFRG 2098
            DRAWIGA+A+KFRG
Sbjct: 266  DRAWIGAMADKFRG 279


>ref|XP_006599632.1| PREDICTED: uncharacterized protein LOC100817953 isoform X2 [Glycine
            max]
          Length = 289

 Score =  293 bits (750), Expect = 3e-76
 Identities = 153/249 (61%), Positives = 194/249 (77%), Gaps = 4/249 (1%)
 Frame = +2

Query: 1364 ETKQQQLNLSVLRFTLGIRGLDESYLPRWIGYTFGSLLLLNHFIGSTSPSPTTPAQLISE 1543
            + +QQQLNLSVLRFTLGI GLDESYLPRWIGY FGSLLLLNHF+GS S +  TPAQL +E
Sbjct: 42   QQQQQQLNLSVLRFTLGIPGLDESYLPRWIGYGFGSLLLLNHFLGSDSAT-VTPAQLSTE 100

Query: 1544 VLGLSLAAFSIALPYMGKFLKGATPVDQGTLPEGNRQIFVMSEHLSDTQKEDLAWGTYVL 1723
            VLGLSLA+FSI LPY+GKFLKGA PVD+ T+P+G  QIFVMS    D  KEDLAW +YVL
Sbjct: 101  VLGLSLASFSIVLPYLGKFLKGAQPVDEKTIPDGTEQIFVMSTDRVDGLKEDLAWASYVL 160

Query: 1724 LKNTNTISVLMAVKDALCVRGYWSLPKDASKARILDLFERQVQQIGLSDLKDTLYFPR-S 1900
            L NTN I++L+ ++  +C RGYW++P D SK  +   F+++++  GL DLKDTLYFP+ +
Sbjct: 161  LCNTNAIAMLIFIQGEICARGYWNIPDDTSKEILPGWFKKKIENAGLYDLKDTLYFPQDA 220

Query: 1901 DTGAWEMLPEGTRSLLVQPVPGALDLSPNETS---KKIGGFILLASSMSYAYSDRDRAWI 2071
            D+   +++P GTR LL+QPV   L +S NE+    +K GGFILLAS+  YA+S++D+AWI
Sbjct: 221  DSEFQDLVPIGTRCLLIQPV---LQVS-NESDTGLQKPGGFILLASTTRYAFSNKDKAWI 276

Query: 2072 GAIANKFRG 2098
             A+ANKFRG
Sbjct: 277  AAVANKFRG 285


>ref|XP_003548220.1| PREDICTED: uncharacterized protein LOC100817953 isoform X1 [Glycine
            max]
          Length = 287

 Score =  293 bits (750), Expect = 3e-76
 Identities = 153/249 (61%), Positives = 194/249 (77%), Gaps = 4/249 (1%)
 Frame = +2

Query: 1364 ETKQQQLNLSVLRFTLGIRGLDESYLPRWIGYTFGSLLLLNHFIGSTSPSPTTPAQLISE 1543
            + +QQQLNLSVLRFTLGI GLDESYLPRWIGY FGSLLLLNHF+GS S +  TPAQL +E
Sbjct: 42   QQQQQQLNLSVLRFTLGIPGLDESYLPRWIGYGFGSLLLLNHFLGSDSAT-VTPAQLSTE 100

Query: 1544 VLGLSLAAFSIALPYMGKFLKGATPVDQGTLPEGNRQIFVMSEHLSDTQKEDLAWGTYVL 1723
            VLGLSLA+FSI LPY+GKFLKGA PVD+ T+P+G  QIFVMS    D  KEDLAW +YVL
Sbjct: 101  VLGLSLASFSIVLPYLGKFLKGAQPVDEKTIPDGTEQIFVMSTDRVDGLKEDLAWASYVL 160

Query: 1724 LKNTNTISVLMAVKDALCVRGYWSLPKDASKARILDLFERQVQQIGLSDLKDTLYFPR-S 1900
            L NTN I++L+ ++  +C RGYW++P D SK  +   F+++++  GL DLKDTLYFP+ +
Sbjct: 161  LCNTNAIAMLIFIQGEICARGYWNIPDDTSKEILPGWFKKKIENAGLYDLKDTLYFPQDA 220

Query: 1901 DTGAWEMLPEGTRSLLVQPVPGALDLSPNETS---KKIGGFILLASSMSYAYSDRDRAWI 2071
            D+   +++P GTR LL+QPV   L +S NE+    +K GGFILLAS+  YA+S++D+AWI
Sbjct: 221  DSEFQDLVPIGTRCLLIQPV---LQVS-NESDTGLQKPGGFILLASTTRYAFSNKDKAWI 276

Query: 2072 GAIANKFRG 2098
             A+ANKFRG
Sbjct: 277  AAVANKFRG 285


>ref|XP_002865893.1| hypothetical protein ARALYDRAFT_495283 [Arabidopsis lyrata subsp.
            lyrata] gi|297311728|gb|EFH42152.1| hypothetical protein
            ARALYDRAFT_495283 [Arabidopsis lyrata subsp. lyrata]
          Length = 272

 Score =  293 bits (750), Expect = 3e-76
 Identities = 150/251 (59%), Positives = 194/251 (77%), Gaps = 2/251 (0%)
 Frame = +2

Query: 1352 DDSTETKQQQLNLSVLRFTLGIRGLDESYLPRWIGYTFGSLLLLNHFIGSTSPSPTTPAQ 1531
            ++ +  +QQQLNLSVLRFT GI GLDESYLPRWIGY FGSLLLLNHF  S   +P + +Q
Sbjct: 31   ENDSPQQQQQLNLSVLRFTFGIPGLDESYLPRWIGYGFGSLLLLNHFSAS---APISESQ 87

Query: 1532 LISEVLGLSLAAFSIALPYMGKFLKGATPVDQGTLPEGNRQIFVMSEHLSDTQKEDLAWG 1711
            + SE LGLSLA+FSIALPY+GKFLKG+  V+Q TLPE   QIFV+S ++ D+ KEDLAW 
Sbjct: 88   MRSEALGLSLASFSIALPYIGKFLKGSV-VEQRTLPEEGEQIFVISSNIGDSLKEDLAWA 146

Query: 1712 TYVLLKNTNTISVLMAVKDALCVRGYWSLPKDASKARILDLFERQVQQIGLSDLKDTLYF 1891
            TYVLL+NT+TI+VL+ V+  LCVRGYW+ P   SKA++ D F+++V +IGL+D+KDTLYF
Sbjct: 147  TYVLLRNTSTIAVLILVQGELCVRGYWNCPDQMSKAQLHDWFKKKVDEIGLADVKDTLYF 206

Query: 1892 PRSDTGA--WEMLPEGTRSLLVQPVPGALDLSPNETSKKIGGFILLASSMSYAYSDRDRA 2065
            P+    A  W++LP+GTRSL +QP+     +      +K+ GF+L+AS+  YAYSD+DRA
Sbjct: 207  PQYAGSALSWDILPDGTRSLFMQPL-----VQNISEPQKVNGFLLVASTAGYAYSDKDRA 261

Query: 2066 WIGAIANKFRG 2098
            WIGA+A KFRG
Sbjct: 262  WIGAMAEKFRG 272


>ref|XP_007152339.1| hypothetical protein PHAVU_004G121600g [Phaseolus vulgaris]
            gi|561025648|gb|ESW24333.1| hypothetical protein
            PHAVU_004G121600g [Phaseolus vulgaris]
          Length = 295

 Score =  293 bits (749), Expect = 3e-76
 Identities = 148/247 (59%), Positives = 194/247 (78%), Gaps = 2/247 (0%)
 Frame = +2

Query: 1364 ETKQQQLNLSVLRFTLGIRGLDESYLPRWIGYTFGSLLLLNHFIGSTSPSPTTPAQLISE 1543
            + +QQQLNLSVLRFTLGI GLDESYLPRWIGY FGSLLLLNHF+GS + +  TPAQL +E
Sbjct: 49   QQQQQQLNLSVLRFTLGIPGLDESYLPRWIGYGFGSLLLLNHFLGSDAAA-VTPAQLSTE 107

Query: 1544 VLGLSLAAFSIALPYMGKFLKGATPVDQGTLPEGNRQIFVMSEHLSDTQKEDLAWGTYVL 1723
            VLGLSLA+FSI LPY+GKFLKGA PVD+ T+P+G  QIFVMS  + +  KEDLAW +YVL
Sbjct: 108  VLGLSLASFSIVLPYLGKFLKGAQPVDEKTIPDGTEQIFVMSTEIGNGLKEDLAWASYVL 167

Query: 1724 LKNTNTISVLMAVKDALCVRGYWSLPKDASKARILDLFERQVQQIGLSDLKDTLYFPRSD 1903
            L+NTN I++L+ ++  +C RGYW++P D SK  +LD F++++++  L DLKDTLYFP+ D
Sbjct: 168  LRNTNAIAMLILIQGEICARGYWNIPADTSKEILLDWFKKKIEKAALYDLKDTLYFPQ-D 226

Query: 1904 TGA--WEMLPEGTRSLLVQPVPGALDLSPNETSKKIGGFILLASSMSYAYSDRDRAWIGA 2077
             G+   +++P GTR LL+QPV    + S +  S+   GFILLAS+  YA+S++D+AWI A
Sbjct: 227  AGSEFQDLVPIGTRCLLIQPVLHVSNES-DTGSQNPDGFILLASTTRYAFSNKDKAWIAA 285

Query: 2078 IANKFRG 2098
            +ANK+RG
Sbjct: 286  VANKYRG 292


Top