BLASTX nr result

ID: Akebia27_contig00025587 seq

BLASTX 2.2.25 [Feb-01-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.

Query= Akebia27_contig00025587
         (1296 letters)

Database: ./nr 
           38,876,450 sequences; 13,856,398,315 total letters

Searching..................................................done



                                                                 Score    E
Sequences producing significant alignments:                      (bits) Value

gb|EXB99429.1| hypothetical protein L484_016405 [Morus notabilis]     368   4e-99
ref|XP_007219207.1| hypothetical protein PRUPE_ppa017292mg [Prun...   360   1e-96
ref|XP_002530358.1| conserved hypothetical protein [Ricinus comm...   340   1e-90
ref|XP_006471160.1| PREDICTED: uncharacterized protein LOC102613...   328   2e-87
ref|XP_004145472.1| PREDICTED: uncharacterized protein LOC101205...   325   2e-86
emb|CAN64638.1| hypothetical protein VITISV_033929 [Vitis vinifera]   325   2e-86
ref|XP_006431682.1| hypothetical protein CICLE_v10000213mg [Citr...   324   6e-86
ref|XP_007026747.1| TATA box-binding protein-associated factor R...   319   2e-84
ref|XP_004301624.1| PREDICTED: uncharacterized protein LOC101305...   317   6e-84
ref|XP_006588648.1| PREDICTED: uncharacterized protein LOC100797...   316   1e-83
ref|XP_003600764.1| hypothetical protein MTR_3g069120 [Medicago ...   305   4e-80
ref|XP_002317716.1| hypothetical protein POPTR_0012s03820g [Popu...   303   1e-79
ref|XP_007132389.1| hypothetical protein PHAVU_011G090800g [Phas...   295   2e-77
ref|XP_006844152.1| hypothetical protein AMTR_s00006p00260920 [A...   288   3e-75
gb|EYU36397.1| hypothetical protein MIMGU_mgv1a020247mg [Mimulus...   284   5e-74
ref|XP_004231258.1| PREDICTED: uncharacterized protein LOC101260...   279   2e-72
ref|XP_006841229.1| hypothetical protein AMTR_s00135p00060200 [A...   271   3e-70
ref|XP_004166877.1| PREDICTED: uncharacterized LOC101205354 [Cuc...   258   3e-66
ref|XP_006649708.1| PREDICTED: uncharacterized protein LOC102721...   231   7e-58
gb|EAZ26218.1| hypothetical protein OsJ_10085 [Oryza sativa Japo...   231   7e-58

>gb|EXB99429.1| hypothetical protein L484_016405 [Morus notabilis]
          Length = 1000

 Score =  368 bits (944), Expect = 4e-99
 Identities = 195/400 (48%), Positives = 262/400 (65%), Gaps = 6/400 (1%)
 Frame = +3

Query: 108  SKDPFILPSVISSITADFDS---QSDETSPIVNNNLQILRCPNND-ILLFFPTGENSDFV 275
            S D   LPS  SSI + F     Q D  S   +N LQ+L CP  D  ++FFPTG+N++ V
Sbjct: 73   SDDSSQLPSTSSSIASVFGPHHYQDDVASAFSHNRLQLLHCPRTDKFIVFFPTGDNANQV 132

Query: 276  GFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFSSMSGNSS-TVG 452
            GF+ LS+K S   + VD+    F  D G + +IL+I +  V DSG    ++ GNSS T+G
Sbjct: 133  GFMLLSIKNSCLDVRVDDNGEAFMVDCGSNHQILRISINPVVDSGSALLALGGNSSGTIG 192

Query: 453  FLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNSS-VVHTCWNPHVPEESVVLLD 629
            +LLA T+YSV+W+ +E+  LG +L  P L  +GTK F +  +VH CW+PH+ EES++LL+
Sbjct: 193  YLLASTMYSVHWYVIEVKELGLNLH-PSLTCVGTKVFKTCCIVHACWSPHILEESIILLE 251

Query: 630  NGELFLFDLDACSGVEKLPVKLKGTKVGVSWEDLGLVSNSNSNPGTTSVEEKNGWLSCEF 809
            +G LFLFDL++C     L    KGT++ VSW+D       ++N G         WLSCEF
Sbjct: 252  SGALFLFDLESCLKTNTLSPHFKGTRLKVSWDD-------SNNSGDLK------WLSCEF 298

Query: 810  SWHPRIVIVAHSNAVFLIDWRFERSIVSTLANIEMLDLIHSVRNDRFIAFCKAGLDGFYF 989
            SWHPRI+IVA S+AVF++D R +   VS L  IEML +  SV N+RF+A  +AG DGF+F
Sbjct: 299  SWHPRILIVARSDAVFIVDLRLDLCNVSCLMKIEMLHMYASVENERFLALTRAGSDGFHF 358

Query: 990  TVATRYHLLLLDIRKPLMPVLQWAHGLDNPRYITVFRLSELRPALEEDEYRWASESGFAI 1169
             +A+   L+L D+RKPLMPVLQW H L  P YI V+RL++LR    +D+Y+ ASESGF I
Sbjct: 359  ALASDSLLVLCDVRKPLMPVLQWVHRLAKPCYINVYRLADLRSNSSDDKYKKASESGFCI 418

Query: 1170 VLGSFWNREFSLFCYGPSSPEPHGSVASKILNFCKSFYAW 1289
            +LGSFWN EF+LFCYGP    P G++ S+   FCKSFYAW
Sbjct: 419  ILGSFWNSEFNLFCYGPLL-TPSGTIVSEATEFCKSFYAW 457


>ref|XP_007219207.1| hypothetical protein PRUPE_ppa017292mg [Prunus persica]
            gi|462415669|gb|EMJ20406.1| hypothetical protein
            PRUPE_ppa017292mg [Prunus persica]
          Length = 925

 Score =  360 bits (923), Expect = 1e-96
 Identities = 204/420 (48%), Positives = 268/420 (63%), Gaps = 14/420 (3%)
 Frame = +3

Query: 78   HKALRNF--TRPSKDPFILPSVISSITA---DFDSQSDETSPIVNNNLQILRCPN-NDIL 239
            H +L  F  T PS D   LPS + S+ +       +SD +S ++ N L+ L+CP  N ++
Sbjct: 67   HLSLPRFLLTSPS-DSAPLPSSVPSVASFLGPHHPKSDVSSSLLYNRLEFLQCPQINTVV 125

Query: 240  LFFPTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGF 419
            +FFPTGENSD VGF++L LKGS   + VD    VF +      RI +I V  +     GF
Sbjct: 126  VFFPTGENSDQVGFLQLVLKGSTFDVKVDENGGVFASRRWFSYRISRISVNPIP----GF 181

Query: 420  SSMSGNSS--TVGFLLACTLYSVNWFRVEISNLGSDLEKPI-LVNLGTKKFNSS-VVHTC 587
            SS+ GN S  T+G+LLA T+YSV+WF V++ + G + +  + LV+LG+K F +  VVH C
Sbjct: 182  SSLRGNGSCVTIGYLLASTMYSVHWFIVKVGDFGPNSDSRVSLVHLGSKIFKTCCVVHAC 241

Query: 588  WNPHVPEESVVLLDNGELFLFDLDA---CSGVEKLPVKLKGTKVGVSWE-DLGLVSNSNS 755
            W+PH+ EESVVLL+NG+LFLFDLD+            K  GT++ V W+ D G  S+ N 
Sbjct: 242  WSPHLLEESVVLLENGDLFLFDLDSRLKTPHTLNANFKFNGTRLKVPWDIDDGSGSSRNY 301

Query: 756  NPGTTSVEEKNGWLSCEFSWHPRIVIVAHSNAVFLIDWRFERSIVSTLANIEMLDLIHSV 935
                        WLSCEFSWHPR++IVA S+AVFL+D R     VS L  IEML L   +
Sbjct: 302  R-----------WLSCEFSWHPRLLIVARSDAVFLVDLRAHECNVSCLMKIEMLHLYAFI 350

Query: 936  RNDRFIAFCKAGLDGFYFTVATRYHLLLLDIRKPLMPVLQWAHGLDNPRYITVFRLSELR 1115
              ++F+   KAG D F+F +A+   L++ D+RKPLMPVLQWAHGLD P Y+ V RLSELR
Sbjct: 351  EKEQFLVLSKAGSDDFHFVLASDTLLVVCDVRKPLMPVLQWAHGLDKPSYVDVLRLSELR 410

Query: 1116 PALEEDEYRWASESGFAIVLGSFWNREFSLFCYGPSSPEPHGSVASKILNFCKSFYAWGL 1295
                +D++ WAS+SGF I++GSFWN EFS+FCYGPS P P GSVASKI    KSFYAW L
Sbjct: 411  SQSRDDKFNWASDSGFCIIVGSFWNCEFSIFCYGPSLPAPIGSVASKIAELRKSFYAWEL 470


>ref|XP_002530358.1| conserved hypothetical protein [Ricinus communis]
            gi|223530105|gb|EEF32019.1| conserved hypothetical
            protein [Ricinus communis]
          Length = 912

 Score =  340 bits (871), Expect = 1e-90
 Identities = 191/392 (48%), Positives = 246/392 (62%), Gaps = 4/392 (1%)
 Frame = +3

Query: 132  SVISSITADFDSQSDETSP--IVNNNLQILRCPN-NDILLFFPTGENSDFVGFVKLSLKG 302
            S  SSIT+   SQ  + S   + +N LQ L CP+ N +++FF TG N D VGF+ LS+  
Sbjct: 83   STASSITSRLGSQFHDNSASLLAHNQLQFLNCPHDNSVIVFFSTGCNHDQVGFLLLSVND 142

Query: 303  SKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFSSMSGNSSTVGFLLACTLYSV 482
             +   + D+   VF A+  L+ RI+KILV  V DSG  +   + +S  VG+LL  TL+SV
Sbjct: 143  KRLCAVGDSRGGVFVANKCLNQRIVKILVNPVVDSG--YFEGNASSKIVGYLLVYTLFSV 200

Query: 483  NWFRVEISNLGSDLEKPILVNLGTKKFNS-SVVHTCWNPHVPEESVVLLDNGELFLFDLD 659
            +WF V+I  +    E+PIL ++G K F S S+V  CW+PH+ EESVVLL+NG LFLFDL+
Sbjct: 201  HWFCVKIGEIN---ERPILGHVGCKTFKSCSIVDACWSPHLIEESVVLLENGGLFLFDLN 257

Query: 660  ACSGVEKLPVKLKGTKVGVSWEDLGLVSNSNSNPGTTSVEEKNGWLSCEFSWHPRIVIVA 839
            + S         +GTK+ V W+DLG   N               WL C+FSWHPRI+IVA
Sbjct: 258  SDSS----NAYFRGTKLKVLWDDLGKSKNFK-------------WLGCQFSWHPRILIVA 300

Query: 840  HSNAVFLIDWRFERSIVSTLANIEMLDLIHSVRNDRFIAFCKAGLDGFYFTVATRYHLLL 1019
             S+AVFL+DWR++   V+ LANI+M  +   V N+RF+ F  A  D F F +A+   L L
Sbjct: 301  SSDAVFLVDWRYDEFKVTCLANIDMFGVYAPVENERFLTFSMAVSDHFQFVLASENMLAL 360

Query: 1020 LDIRKPLMPVLQWAHGLDNPRYITVFRLSELRPALEEDEYRWASESGFAIVLGSFWNREF 1199
             D+RKPLMPVLQWAH LD P YI VFRLSELR       + WA+ SGF I+LGSFWN EF
Sbjct: 361  CDVRKPLMPVLQWAHALDRPCYIDVFRLSELRSNSRNSIHEWATTSGFGIILGSFWNCEF 420

Query: 1200 SLFCYGPSSPEPHGSVASKILNFCKSFYAWGL 1295
            SLFCYGP  P   GS+AS+I    KS YAW L
Sbjct: 421  SLFCYGPPLPGQQGSIASEISKISKSAYAWEL 452


>ref|XP_006471160.1| PREDICTED: uncharacterized protein LOC102613824 [Citrus sinensis]
          Length = 910

 Score =  328 bits (842), Expect = 2e-87
 Identities = 188/400 (47%), Positives = 247/400 (61%), Gaps = 9/400 (2%)
 Frame = +3

Query: 123  ILPSVISSITADFDSQSDETSPIVN------NNLQILRCP-NNDILLFFPTGENSDFVGF 281
            +LPS  +SI + FD       P  +      N L++L CP NN  + FFPTG+N+D +GF
Sbjct: 75   LLPSTSTSIASQFDDVGTHQHPNGSLSDQDYNRLRLLYCPLNNTAIAFFPTGDNNDQLGF 134

Query: 282  VKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFSSMSGNSST-VGFL 458
            + +S KGS+  ++ D  D VFT  + L+ RI  ILV  V +    +S+  GNS   VG+L
Sbjct: 135  LVISAKGSRFDVLSDEDDAVFTVVNRLNGRIRGILVNPVEEF---YSAFQGNSLVNVGYL 191

Query: 459  LACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNS-SVVHTCWNPHVPEESVVLLDNG 635
            LA T+YSV+WF V++S       KP++  LG K F + SVV  CW+PH+PEESVVLL +G
Sbjct: 192  LAFTMYSVHWFSVKVSKASESTIKPVVSYLGFKLFKTCSVVGACWSPHLPEESVVLLQSG 251

Query: 636  ELFLFDLDACSGVEKLPVKLKGTKVGVSWEDLGLVSNSNSNPGTTSVEEKNGWLSCEFSW 815
            +LF+FD++            KG ++ VSW D  L S+ +             WL  EFSW
Sbjct: 252  DLFMFDVNGRES--------KGKRLRVSWTDDDLSSSQSC-----------AWLGVEFSW 292

Query: 816  HPRIVIVAHSNAVFLIDWRFERSIVSTLANIEMLDLIHSVRNDRFIAFCKAGLDGFYFTV 995
            HP+I+IVA  +AVFL+D+R +   VS LA I+ML+L   V  + F AF KA  DGF+F +
Sbjct: 293  HPQILIVARMDAVFLVDFRCDDCNVSLLAKIDMLNLYAPVEKELFHAFSKADSDGFHFVL 352

Query: 996  ATRYHLLLLDIRKPLMPVLQWAHGLDNPRYITVFRLSELRPALEEDEYRWASESGFAIVL 1175
            A+   L+L D+R+PLMPVLQWAHGLD P YI  FRLSELR    ++   WA+ESGF I+L
Sbjct: 353  ASDSLLVLCDVRRPLMPVLQWAHGLDKPSYIVSFRLSELRSNSRDNRLEWANESGFGIML 412

Query: 1176 GSFWNREFSLFCYGPSSPEPHGSVASKILNFCKSFYAWGL 1295
            GSF N EFSLFCYGPS P   G  AS+I    KS YAW L
Sbjct: 413  GSFSNCEFSLFCYGPSLPGQGGPFASEISKIFKSLYAWEL 452


>ref|XP_004145472.1| PREDICTED: uncharacterized protein LOC101205354 [Cucumis sativus]
          Length = 907

 Score =  325 bits (834), Expect = 2e-86
 Identities = 184/397 (46%), Positives = 248/397 (62%), Gaps = 8/397 (2%)
 Frame = +3

Query: 123  ILPSVISSITADFDSQ---SDETSPIVNNNLQILRCPNND-ILLFFPTGENSDFVGFVKL 290
            ++PS  SS+ + F  Q   SD  S +  N LQ L CPN+  +++FFPTG NSD VGF+ +
Sbjct: 74   VVPSTSSSVASLFGEQQCCSDPPSVLRYNRLQCLPCPNSSSVVVFFPTGPNSDHVGFLVV 133

Query: 291  SLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFSSMSGNSSTVGFLLACT 470
            S  GS   +  D  ++VF+ +  L+ +I  I V    +   GF  +  +   +GFLLA T
Sbjct: 134  SSNGSGLDVQSDCSNDVFSVESELNYQIFGIAV----NPNSGF--VDDSYEDIGFLLAYT 187

Query: 471  LYSVNWFRVEISNLGSDLEKPI-LVNLGTKKFNS-SVVHTCWNPHVPEESVVLLDNGELF 644
            +YSV WF V+   +GS  +  + LV++G+K F + SVVH CWNPH+ EESVVLL++G LF
Sbjct: 188  MYSVEWFIVKNHAIGSSCQPRVSLVHMGSKVFKTCSVVHACWNPHLSEESVVLLEDGSLF 247

Query: 645  LFDLDACSGVE--KLPVKLKGTKVGVSWEDLGLVSNSNSNPGTTSVEEKNGWLSCEFSWH 818
            LFD++     +     V LKG K+ VSW+ L                +K  WLSCEFSWH
Sbjct: 248  LFDMEPLLKTKDYNANVNLKGIKLKVSWDGL-------------DCSKKVKWLSCEFSWH 294

Query: 819  PRIVIVAHSNAVFLIDWRFERSIVSTLANIEMLDLIHSVRNDRFIAFCKAGLDGFYFTVA 998
            PRI+IVA S+AVFL+D R     +S L  IE          ++F+AF KAG DGFYF++A
Sbjct: 295  PRILIVARSDAVFLVDLRENDCNISCLMKIETFPTYSLGEKEQFLAFSKAGSDGFYFSIA 354

Query: 999  TRYHLLLLDIRKPLMPVLQWAHGLDNPRYITVFRLSELRPALEEDEYRWASESGFAIVLG 1178
            + + LLL DIRKPL PVLQW HGLD+P Y+ VF LSELR +     Y+ ASESG+ IVLG
Sbjct: 355  SNHLLLLCDIRKPLSPVLQWTHGLDDPSYMNVFSLSELRSSPGNIMYKVASESGYCIVLG 414

Query: 1179 SFWNREFSLFCYGPSSPEPHGSVASKILNFCKSFYAW 1289
            SFW+ EF++FCYGPS P    S++S+   + +SFYAW
Sbjct: 415  SFWSSEFNIFCYGPSPPGLDQSISSRSSKYFQSFYAW 451


>emb|CAN64638.1| hypothetical protein VITISV_033929 [Vitis vinifera]
          Length = 865

 Score =  325 bits (834), Expect = 2e-86
 Identities = 179/406 (44%), Positives = 244/406 (60%), Gaps = 7/406 (1%)
 Frame = +3

Query: 99   TRPSKDPFILPSVISSITADFDSQSDETSP------IVNNNLQILRCPNNDILLFFPTGE 260
            ++PS  P       +++T  F   S    P      ++++ L +LRCPN  +L  FPTG 
Sbjct: 25   SKPSLGPLFFNPSPNTLTPLFSKPSFSFPPHLPRSSLLHDRLHLLRCPNAAVLALFPTGV 84

Query: 261  NSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFSSMSGNS 440
            NSD +GF+ LS+K S   +  D   +VF +   L+ RI++IL   +     G+S  SGN 
Sbjct: 85   NSDQIGFLLLSVKDSCLDVRADRNGDVFVSKKRLNHRIVQILATPI-----GYS-FSGNP 138

Query: 441  STVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNS-SVVHTCWNPHVPEESV 617
             +VG +LACT+YSV+WF V   N+ S+   P L+ LG K F S +VV  CW+PH+ EE +
Sbjct: 139  DSVGLVLACTMYSVHWFSVRNDNIDSE---PGLIYLGGKVFKSCAVVSACWSPHLSEECL 195

Query: 618  VLLDNGELFLFDLDACSGVEKLPVKLKGTKVGVSWEDLGLVSNSNSNPGTTSVEEKNGWL 797
            VLL++GELFLFDLD C          KG ++ + W +     +               WL
Sbjct: 196  VLLESGELFLFDLDYCCSNSNF----KGNRLKIMWHNADCSGDGK-------------WL 238

Query: 798  SCEFSWHPRIVIVAHSNAVFLIDWRFERSIVSTLANIEMLDLIHSVRNDRFIAFCKAGLD 977
             CEFSWHPRI+IVA S+AVFL+D RF+   VS LA I M  +   V  + FI+F  AG +
Sbjct: 239  GCEFSWHPRILIVARSDAVFLVDLRFDECSVSCLAKIGMPSVGELVHKEPFISFSMAGSN 298

Query: 978  GFYFTVATRYHLLLLDIRKPLMPVLQWAHGLDNPRYITVFRLSELRPALEEDEYRWASES 1157
            GF+FTVA+   L L DIR PL+PVLQW+HG+D P Y+ VF+LSELR   ++D+Y+ ASES
Sbjct: 299  GFHFTVASNSLLFLYDIRNPLIPVLQWSHGIDKPCYVRVFKLSELRSHSKDDKYKEASES 358

Query: 1158 GFAIVLGSFWNREFSLFCYGPSSPEPHGSVASKILNFCKSFYAWGL 1295
             F I++GSFW  E  +FCYG S  +P GS A +I   CKS+YAW L
Sbjct: 359  AFCIIMGSFWKCECRMFCYGSSFQDPKGSTAYEISKLCKSYYAWEL 404


>ref|XP_006431682.1| hypothetical protein CICLE_v10000213mg [Citrus clementina]
            gi|557533804|gb|ESR44922.1| hypothetical protein
            CICLE_v10000213mg [Citrus clementina]
          Length = 910

 Score =  324 bits (830), Expect = 6e-86
 Identities = 185/400 (46%), Positives = 244/400 (61%), Gaps = 9/400 (2%)
 Frame = +3

Query: 123  ILPSVISSITADFDSQSDETSPIVN------NNLQILRCP-NNDILLFFPTGENSDFVGF 281
            +LPS  +SI + F        P  +      N L++L CP NN  + FFPTG+N+D +GF
Sbjct: 75   LLPSTSTSIASQFGDVGTHQHPDGSLSDQDYNRLRLLYCPLNNTAIAFFPTGDNNDQLGF 134

Query: 282  VKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFSSMSGNSST-VGFL 458
            + +S KGS+  ++ D  D +F   + L+ RI  ILV  V +     S+  GNS   VG+L
Sbjct: 135  LVISAKGSRFDVLSDEDDAIFMVLNRLNGRIRGILVNPVEEFD---SAFQGNSLVNVGYL 191

Query: 459  LACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNS-SVVHTCWNPHVPEESVVLLDNG 635
            LA T+YSV+WF V++S       KP++  LG K F + SVV  CW+PH+PEESVVLL +G
Sbjct: 192  LAFTMYSVHWFSVKVSKASESTTKPVVSYLGFKLFKTCSVVGACWSPHLPEESVVLLQSG 251

Query: 636  ELFLFDLDACSGVEKLPVKLKGTKVGVSWEDLGLVSNSNSNPGTTSVEEKNGWLSCEFSW 815
            +LF+FD++A           KG ++ VSW D  L S+ +             WL  EFSW
Sbjct: 252  DLFMFDVNARES--------KGKRLRVSWTDDDLSSSQSC-----------AWLGVEFSW 292

Query: 816  HPRIVIVAHSNAVFLIDWRFERSIVSTLANIEMLDLIHSVRNDRFIAFCKAGLDGFYFTV 995
            HPRI+IVA  +AVFL+D+R +   VS LA I+ML+L   V  + F  F K   DGF+F +
Sbjct: 293  HPRILIVARMDAVFLVDFRCDDCNVSLLAKIDMLNLYAPVEKELFHTFSKVDSDGFHFVL 352

Query: 996  ATRYHLLLLDIRKPLMPVLQWAHGLDNPRYITVFRLSELRPALEEDEYRWASESGFAIVL 1175
            A+   L+L D+R+PLMPVLQWAHGLD P YI  FRLSELR    ++ + WA+ESGF I+L
Sbjct: 353  ASDSLLVLCDVRRPLMPVLQWAHGLDKPSYIDSFRLSELRSNSRDNRFEWANESGFGIIL 412

Query: 1176 GSFWNREFSLFCYGPSSPEPHGSVASKILNFCKSFYAWGL 1295
            GSF N EFSLFCYGPS P   G  AS+I    KS YAW L
Sbjct: 413  GSFSNCEFSLFCYGPSVPGQGGPFASEISKIFKSLYAWEL 452


>ref|XP_007026747.1| TATA box-binding protein-associated factor RNA polymerase I subunit
            C, putative [Theobroma cacao] gi|508715352|gb|EOY07249.1|
            TATA box-binding protein-associated factor RNA polymerase
            I subunit C, putative [Theobroma cacao]
          Length = 910

 Score =  319 bits (817), Expect = 2e-84
 Identities = 181/404 (44%), Positives = 245/404 (60%), Gaps = 4/404 (0%)
 Frame = +3

Query: 96   FTRPSKDPFILPSVISSITA--DFDSQSDETSPIVNNNLQILRCPNNDI-LLFFPTGENS 266
            F   S  P+   S I+S      F   +  +S + +N L +L CP+ +I ++FF TG N 
Sbjct: 66   FLSTSSVPYSASSSIASRFGLESFYDDAASSSFLSHNRLHLLHCPDQNIAVVFFTTGANH 125

Query: 267  DFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFSSMSGNSST 446
            D +GF  + ++ +  K + D   ++  + +  + +IL+ILV  V D    F   SG+S  
Sbjct: 126  DRIGFFAVHVQDNDFKFLGDRDGDILISHNHCNHKILRILVSPVDDDD--FEENSGDS-V 182

Query: 447  VGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKF-NSSVVHTCWNPHVPEESVVL 623
            VG+L+ACTLYSV+W+ V+        + P L  LG K F +SS+V  C++PH+P+ES+VL
Sbjct: 183  VGYLMACTLYSVHWYSVKFVKSS---KSPALDYLGCKLFKSSSIVSACFSPHLPQESMVL 239

Query: 624  LDNGELFLFDLDACSGVEKLPVKLKGTKVGVSWEDLGLVSNSNSNPGTTSVEEKNGWLSC 803
            L+NG LF FDL++    +      KG K+ V W D             +S  E   WL  
Sbjct: 240  LENGALFFFDLESDVNCQIPNAYFKGNKLRVLWND-------------SSGSENYKWLGV 286

Query: 804  EFSWHPRIVIVAHSNAVFLIDWRFERSIVSTLANIEMLDLIHSVRNDRFIAFCKAGLDGF 983
            EFSWHPRI+IVA S+AVFL+D R ++  V  LA +EML        D+F+AF +AG DGF
Sbjct: 287  EFSWHPRILIVARSDAVFLVDNRLDQCNVICLAKVEMLSPYTVDEEDQFLAFSRAGADGF 346

Query: 984  YFTVATRYHLLLLDIRKPLMPVLQWAHGLDNPRYITVFRLSELRPALEEDEYRWASESGF 1163
             F +A+R  L+L D+RKP+MP+L+WAH LDNP YI VFRLSELR    +D Y WA+ESGF
Sbjct: 347  QFVLASRSLLVLCDVRKPMMPLLRWAHNLDNPCYIHVFRLSELRSQSRDDRYHWATESGF 406

Query: 1164 AIVLGSFWNREFSLFCYGPSSPEPHGSVASKILNFCKSFYAWGL 1295
             I+LGSFWN EF LFCYGP SP   GS AS+I  FCK F AW L
Sbjct: 407  CIILGSFWNCEFRLFCYGP-SPASEGSTASEIAKFCKPFLAWDL 449


>ref|XP_004301624.1| PREDICTED: uncharacterized protein LOC101305856 [Fragaria vesca
            subsp. vesca]
          Length = 914

 Score =  317 bits (813), Expect = 6e-84
 Identities = 185/416 (44%), Positives = 253/416 (60%), Gaps = 10/416 (2%)
 Frame = +3

Query: 78   HKALRNF-TRPSKDPFILPSVISSITADFDSQSDETSPIVN---NNLQILRCPN-NDILL 242
            H +L  F +  S +   LPS  SSI A F       + +++   N L+ L+CP  N IL+
Sbjct: 60   HLSLPRFLSTSSPESAPLPSTSSSI-APFLGPHQYKNDLLSSFRNRLEFLQCPKTNTILI 118

Query: 243  FFPTGENSDFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRI---LKILVISVTDSGL 413
            FFPTGENSD VG ++L LK S   + V           GL  R     +IL ISV     
Sbjct: 119  FFPTGENSDQVGLLELVLKDSTFDVKVG----------GLSTRCQFKYQILRISVNPLP- 167

Query: 414  GFSSMSGNSS-TVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNSS-VVHTC 587
              S+++GN   T+G++LA T+YSV+WF V++ + GS+ +   LV +G + F +  VVH C
Sbjct: 168  SLSNLTGNGPVTIGYVLASTMYSVHWFIVKLGDFGSNSDSIRLVYVGDRVFKACCVVHAC 227

Query: 588  WNPHVPEESVVLLDNGELFLFDLDACSGVEKLPVKLKGTKVGVSWEDLGLVSNSNSNPGT 767
            W+PHVPEESVVLL+NG LFLFDL++           KGT++ V W++ G  S +      
Sbjct: 228  WSPHVPEESVVLLENGALFLFDLESRLRNTISNANFKGTRLKVLWDNNGYDSGNYR---- 283

Query: 768  TSVEEKNGWLSCEFSWHPRIVIVAHSNAVFLIDWRFERSIVSTLANIEMLDLIHSVRNDR 947
                    WLSCEFSWHPR++IVA S+A+FL+D RF    ++ L NIE+L +   +  ++
Sbjct: 284  --------WLSCEFSWHPRVLIVARSDAIFLVDLRFNECSLTCLMNIELLHMYAPMEREQ 335

Query: 948  FIAFCKAGLDGFYFTVATRYHLLLLDIRKPLMPVLQWAHGLDNPRYITVFRLSELRPALE 1127
            F    K   D F+F +A+   LLL D+RKPLMPVLQWAH ++   Y+ VFRLSELR   +
Sbjct: 336  FCVLSKTSSDSFHFVLASDSLLLLCDVRKPLMPVLQWAHSINKASYVDVFRLSELRSHTK 395

Query: 1128 EDEYRWASESGFAIVLGSFWNREFSLFCYGPSSPEPHGSVASKILNFCKSFYAWGL 1295
            ++ Y+W S+SGF I+LGSFWN +F++F YGPS P P GSVASK+    K FYAW L
Sbjct: 396  DNTYKWPSDSGFCIILGSFWNCDFNIFSYGPSLPMPLGSVASKLTELRKCFYAWEL 451


>ref|XP_006588648.1| PREDICTED: uncharacterized protein LOC100797045 isoform X1 [Glycine
            max] gi|571481421|ref|XP_006588649.1| PREDICTED:
            uncharacterized protein LOC100797045 isoform X2 [Glycine
            max]
          Length = 894

 Score =  316 bits (810), Expect = 1e-83
 Identities = 182/396 (45%), Positives = 244/396 (61%), Gaps = 5/396 (1%)
 Frame = +3

Query: 123  ILPSVISSITA--DFDSQSDETSPIVNNNLQILRCPNN-DILLFFPTGENSDFVGFVKLS 293
            ILPS  SS+ +   F +Q+D  S  + N L +L  PN  + ++FFPTG N D +GF  L+
Sbjct: 76   ILPSTASSVASLFSFPNQNDAASLFLRNRLHLLYYPNRPNAVVFFPTGANDDKLGFFILA 135

Query: 294  LKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFSSMSGNSSTVGFLLACTL 473
            +K S+  I++D+  +VF A  G   RIL I V  V DSGL        S  +G+LLA  L
Sbjct: 136  VKDSRLDILLDSNGDVFRASTGSAHRILNISVNPVADSGL-----FNESHVIGYLLASAL 190

Query: 474  YSVNWFRVEISNLGSDLEKPILVNLGTKKFNSS-VVHTCWNPHVPEESVVLLDNGELFLF 650
            YSV+WF V+ +++   L++P +  LG K F +  VVH CW+PH+ EES+VLL+NG+LFLF
Sbjct: 191  YSVHWFAVKHNSV---LDRPSVFYLGGKTFKTCPVVHACWSPHILEESLVLLENGQLFLF 247

Query: 651  DLDACSGVEKLPVKLKGTKVGVSWEDLGLVSNSNSNPGTTSVEEKNGWLSCEFSWHPRIV 830
            DL++    +      KGT++ V W DLG   N+              WLSCEFSWHPR+ 
Sbjct: 248  DLESH---DTTGAAFKGTRLKVPWNDLGFSVNNTV------------WLSCEFSWHPRVF 292

Query: 831  IVAHSNAVFLIDWRFERSIVSTLANIEMLDLIHSVRNDRFIAFCKAGLDGFYFTVATRYH 1010
            +VA S+AVFL+D+R +   VS L  IE L +     N+RF+A  + G D FYF VA+   
Sbjct: 293  VVARSDAVFLVDFRLKECSVSCLMKIETLRMYAPGGNERFLALSRVGPDDFYFAVASTSL 352

Query: 1011 LLLLDIRKPLMPVLQWAHGLDNPRYITVFRLSELRPALEEDEYRWASESGFAIVLGSFWN 1190
            LLL D+RKPL+PVLQW HG++ P +++V  LS LR    +D ++ ASESGF IVLGSFWN
Sbjct: 353  LLLCDMRKPLVPVLQWMHGIEGPCFMSVLSLSNLRSHSRDDAFKLASESGFCIVLGSFWN 412

Query: 1191 REFSLFCYGPSSPEPHGSVASKI-LNFCKSFYAWGL 1295
             EF++FCYG   P   GSV SKI  N C    AW L
Sbjct: 413  CEFNIFCYGSILPFRKGSVTSKINPNIC----AWEL 444


>ref|XP_003600764.1| hypothetical protein MTR_3g069120 [Medicago truncatula]
            gi|355489812|gb|AES71015.1| hypothetical protein
            MTR_3g069120 [Medicago truncatula]
          Length = 884

 Score =  305 bits (780), Expect = 4e-80
 Identities = 177/405 (43%), Positives = 239/405 (59%), Gaps = 9/405 (2%)
 Frame = +3

Query: 108  SKDPFILPSVISSITADFDS----QSDETSPIVNNNLQILRCPNND-ILLFFPTGENSDF 272
            + DP ILPS  S+I   FDS      D  S  ++N +Q+L+CPN    ++ FPTG N + 
Sbjct: 67   TSDPSILPSTASTIAHLFDSTPELDDDNVSHFLHNRIQLLKCPNTPKAVVIFPTGANDET 126

Query: 273  VGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFSSMSGNSSTVG 452
            +GF  L +K S  +  +D   +VF A  G   RIL++ V  VT+      S   +S  +G
Sbjct: 127  IGFFMLGVKDSLLETRLDVKGDVFRASTGSSSRILRMSVNPVTED----DSEPDSSPVIG 182

Query: 453  FLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKK-FNSSVVHTCWNPHVPEESVVLLD 629
            ++LA + YSV WF V+  NL SD   P +  LG  K F  +VV  CW+PH+ EES+VLL+
Sbjct: 183  YVLASSRYSVCWFDVK-HNLSSD--SPSMSYLGRSKVFKEAVVRACWSPHILEESMVLLE 239

Query: 630  NGELFLFDLDACSGVEKLPVKLKGTKVGVSWEDLGLVSNSNSNPGTTSVEEKNGWLSCEF 809
            +G+LFLFD+DA   ++      KGT++ V W D             ++  E   WLSCEF
Sbjct: 240  SGQLFLFDVDAQGSMKTF----KGTRLRVPWND-------------SACSENKAWLSCEF 282

Query: 810  SWHPRIVIVAHSNAVFLIDWRFERSIVSTLANIEMLDLIHSVRNDRFIAFCKAGL---DG 980
            SWHPRI+IVA  +AVFL+D+R     V+ L  IE L +     N+RF+A  + G    D 
Sbjct: 283  SWHPRILIVARYDAVFLVDFRSNECNVTCLLKIETLRMYAPDENERFLALSRVGTESPDN 342

Query: 981  FYFTVATRYHLLLLDIRKPLMPVLQWAHGLDNPRYITVFRLSELRPALEEDEYRWASESG 1160
            FYFTV +R  L+L DIR PL PVLQW HG+D P Y+TV  LS LR   +ED ++ ASE G
Sbjct: 343  FYFTVTSRSLLVLCDIRNPLKPVLQWRHGIDEPCYMTVLSLSTLRSHSKEDTFQLASEMG 402

Query: 1161 FAIVLGSFWNREFSLFCYGPSSPEPHGSVASKILNFCKSFYAWGL 1295
            F I+LGSFWN EF++FCYGP+S    GS+ S +     +F AW L
Sbjct: 403  FCIILGSFWNSEFNIFCYGPASFR-KGSITSTLSKINTTFCAWEL 446


>ref|XP_002317716.1| hypothetical protein POPTR_0012s03820g [Populus trichocarpa]
            gi|222858389|gb|EEE95936.1| hypothetical protein
            POPTR_0012s03820g [Populus trichocarpa]
          Length = 906

 Score =  303 bits (776), Expect = 1e-79
 Identities = 171/394 (43%), Positives = 236/394 (59%), Gaps = 8/394 (2%)
 Frame = +3

Query: 132  SVISSITADFDSQSDE-TSPIVN-NNLQILRCPNND-ILLFFPTGENSDFVGFVKLSLKG 302
            S  SSI   F  Q    +SP++  N LQ L+CP++D +++FF TG N D VGF+ LS+K 
Sbjct: 86   STASSIAFSFGPQDLHFSSPLLAYNRLQFLKCPHDDTVVVFFSTGTNLDRVGFLLLSVKD 145

Query: 303  SKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFSSMSGN---SSTVGFLLACTL 473
                   D    +FTA   L  +I+++LV  + D     S ++GN   S + G+LL  T+
Sbjct: 146  KSLVATGDQKGGIFTASKSLGSKIVRVLVNPIEDD----SFLNGNYSFSGSFGYLLVYTM 201

Query: 474  YSVNWFRVEISNLGSDLEKPILVNLGTKKFNS-SVVHTCWNPHVPEESVVLLDNGELFLF 650
            YSVNWF V+ S     +++P+L  LG K F S  +   CW+P++  +SVVLL+NG LFLF
Sbjct: 202  YSVNWFCVKYSE---SMKRPVLSYLGCKNFKSCGIASACWSPYIKVQSVVLLENGTLFLF 258

Query: 651  DLDA-CSGVEKLPVKLKGTKVGVSWEDLGLVSNSNSNPGTTSVEEKNGWLSCEFSWHPRI 827
            DL+A CS +       +GTK+ VSW D G + +               WL CEFSWH R+
Sbjct: 259  DLEADCSDMY-----FRGTKLKVSWGDEGKLGDGK-------------WLGCEFSWHCRV 300

Query: 828  VIVAHSNAVFLIDWRFERSIVSTLANIEMLDLIHSVRNDRFIAFCKAGLDGFYFTVATRY 1007
            +IVA S+AVF+IDW+     V+ LA I+M         +RF+A  +A  D  +F + +  
Sbjct: 301  LIVARSDAVFMIDWKCGGFDVTCLARIDMFSAYALSEKERFLAMSRAVSDSLHFVLVSET 360

Query: 1008 HLLLLDIRKPLMPVLQWAHGLDNPRYITVFRLSELRPALEEDEYRWASESGFAIVLGSFW 1187
             L++ D+RKP++P+LQWAHGLD P +I VFRLS+LR    +D + WA+ SGF I+LGSFW
Sbjct: 361  MLVICDVRKPMIPLLQWAHGLDKPCFIDVFRLSDLRSNSRDDTHDWANSSGFGIILGSFW 420

Query: 1188 NREFSLFCYGPSSPEPHGSVASKILNFCKSFYAW 1289
            N EFSLFCYGPS P   GS A +I  F    YAW
Sbjct: 421  NCEFSLFCYGPSFPPRKGSFALEISKFSSCLYAW 454


>ref|XP_007132389.1| hypothetical protein PHAVU_011G090800g [Phaseolus vulgaris]
            gi|593199831|ref|XP_007132390.1| hypothetical protein
            PHAVU_011G090800g [Phaseolus vulgaris]
            gi|593199873|ref|XP_007132391.1| hypothetical protein
            PHAVU_011G090800g [Phaseolus vulgaris]
            gi|561005389|gb|ESW04383.1| hypothetical protein
            PHAVU_011G090800g [Phaseolus vulgaris]
            gi|561005390|gb|ESW04384.1| hypothetical protein
            PHAVU_011G090800g [Phaseolus vulgaris]
            gi|561005391|gb|ESW04385.1| hypothetical protein
            PHAVU_011G090800g [Phaseolus vulgaris]
          Length = 894

 Score =  295 bits (756), Expect = 2e-77
 Identities = 173/394 (43%), Positives = 225/394 (57%), Gaps = 6/394 (1%)
 Frame = +3

Query: 96   FTRPSKDPFILPSVISSITADFDS--QSDETSPIVNNNLQILRCPNNDI-LLFFPTGENS 266
            F   S  P ILPS  SSI + F S  Q+D   P ++N L +L  P+    LL FP G N 
Sbjct: 67   FLLSSHPPSILPSTASSIASLFSSTHQNDAAPPFLHNRLHLLTYPHRPYALLLFPAGSND 126

Query: 267  DFVGFVKLSLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFSSMSGN--S 440
              + F  L  K S+    +D   +VF A  G   RIL I V  V D G   S    +  S
Sbjct: 127  HKLAFFTLRFKDSRFHTQLDTKGDVFYASTGSSHRILNISVNPVADFGFTGSDDEDDDAS 186

Query: 441  STVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNSS-VVHTCWNPHVPEESV 617
              +G+LLA TLYSV+WF   ++     L++P +V LG K F +  V H CW+PH+ EESV
Sbjct: 187  RVIGYLLATTLYSVHWF---VARHNQILDRPSVVCLGDKMFKTCPVAHACWSPHILEESV 243

Query: 618  VLLDNGELFLFDLDACSGVEKLPVKLKGTKVGVSWEDLGLVSNSNSNPGTTSVEEKNGWL 797
            VLL++G+LFLFDL+ C          KGT++ V W D                 E   WL
Sbjct: 244  VLLESGQLFLFDLECCGA----GAGFKGTRLKVPWIDSS---------------ESKVWL 284

Query: 798  SCEFSWHPRIVIVAHSNAVFLIDWRFERSIVSTLANIEMLDLIHSVRNDRFIAFCKAGLD 977
            SCEFSWHPRI++VA S+AVFL+D R +   VS L  IE L +     N+RF+A  +A  D
Sbjct: 285  SCEFSWHPRILVVARSDAVFLVDLRLKDCSVSCLMKIETLRMYAPDENERFLAMARAAPD 344

Query: 978  GFYFTVATRYHLLLLDIRKPLMPVLQWAHGLDNPRYITVFRLSELRPALEEDEYRWASES 1157
             FYF V +   LLL D+RKPL+PVLQW HG++ P +++V  LS+LR    ED ++ ASE+
Sbjct: 345  NFYFAVVSSSVLLLCDVRKPLVPVLQWVHGIEGPSFMSVLSLSDLRSHSREDAFKLASET 404

Query: 1158 GFAIVLGSFWNREFSLFCYGPSSPEPHGSVASKI 1259
            GF I+LGS WN EF++FCYG   P    SV SKI
Sbjct: 405  GFCIMLGSIWNCEFNIFCYGNVLPFRKKSVTSKI 438


>ref|XP_006844152.1| hypothetical protein AMTR_s00006p00260920 [Amborella trichopoda]
            gi|548846551|gb|ERN05827.1| hypothetical protein
            AMTR_s00006p00260920 [Amborella trichopoda]
          Length = 929

 Score =  288 bits (738), Expect = 3e-75
 Identities = 174/409 (42%), Positives = 231/409 (56%), Gaps = 9/409 (2%)
 Frame = +3

Query: 96   FTRPSKDPFILPSVISSITADFDSQSDETSPIVNNNLQILRCPNNDILLFFPTGENSDFV 275
            F R S D FI   +I S T     +         N L +L C N + LL FP+GENSD +
Sbjct: 71   FYRRSDDDFIPFPLIFSTTKSAAGKHSSRH-FFGNPLHLLTCRNGEFLLLFPSGENSDRL 129

Query: 276  GFVKLSLKGSKPKIMVDNG-------DNVFTADHGLDCRILKILVISVTDSGLGFSSMSG 434
              V     G + +   DNG       D+VF        RI+++ VIS  D     SS   
Sbjct: 130  ACVV----GRRER---DNGGGFSLVKDSVFLLSPSFKNRIIRVSVISTADCAS--SSEVC 180

Query: 435  NSSTVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNSSVVHTCWNPHVPEES 614
            +  T GF+L C+ Y V+W RV + N       P+  NL +  F + V H CW+P++PEES
Sbjct: 181  DQFTEGFVLLCSHYEVHWLRVGVRN-----STPLSQNLASATFKNQVAHACWSPYLPEES 235

Query: 615  VVLLDNGELFLFDLDACSGVEKLPVKLKGTKVGVSWEDLGLVSNSNSNPGTTSVEEKNGW 794
             VLL NGEL L+DL+ C GV+ LPVK KG  V    ++LG +          S E  N W
Sbjct: 236  AVLLVNGELRLYDLNYCVGVKNLPVKFKGELVS---KNLGSL---------ISRESDNDW 283

Query: 795  LSCEFSWHPRIVIVAHSNAVFLIDWRFERSIVSTLANIEMLDLI--HSVRNDRFIAFCKA 968
              CEF WHPR++IV    +V ++D+R ++  V+ LA IE+ D +  H + +DRF AFCKA
Sbjct: 284  FCCEFGWHPRVLIVTSKTSVLMVDFRDKKVKVTVLAKIELCDSVKHHFIESDRFQAFCKA 343

Query: 969  GLDGFYFTVATRYHLLLLDIRKPLMPVLQWAHGLDNPRYITVFRLSELRPALEEDEYRWA 1148
              DGF F+VAT+Y+LLL D RKPL PVLQW H LD+ RYI ++RLS+LRP+      +W 
Sbjct: 344  SFDGFLFSVATKYYLLLFDTRKPLDPVLQWDHHLDHVRYINMYRLSDLRPS--NGTLKWV 401

Query: 1149 SESGFAIVLGSFWNREFSLFCYGPSSPEPHGSVASKILNFCKSFYAWGL 1295
            S+SG+ I++GSF N EFSLFCYG   P P   +     +   S YAWGL
Sbjct: 402  SDSGYVILVGSFRNCEFSLFCYG---PHPIVDLKPGWTSDSGSLYAWGL 447


>gb|EYU36397.1| hypothetical protein MIMGU_mgv1a020247mg [Mimulus guttatus]
          Length = 935

 Score =  284 bits (727), Expect = 5e-74
 Identities = 168/382 (43%), Positives = 227/382 (59%), Gaps = 16/382 (4%)
 Frame = +3

Query: 198  NNLQILRCPN-NDILLFFPTGENSDFVGFVKLSLKGSKPKIMVDNGDNVF------TADH 356
            N+LQ+L+ P  N I++FFPTGENSD VGF  LSLK     +    G N+F      + +H
Sbjct: 101  NSLQLLQIPTKNLIVVFFPTGENSDHVGFSLLSLKEGNLNVR-SQGGNIFELVKEGSLNH 159

Query: 357  GLDCRILKILVISVTD---SGLGFSSMSGNSSTVGFLLACTLYSVNWFRVEISNLGSDLE 527
                RI ++LV  V D      G ++   N+ T GFL+ CT YSV W++V I+ L    E
Sbjct: 160  H---RITRLLVNPVDDFCGDVNGDNNKHDNAVTAGFLMVCTSYSVCWYKVGITTLRGQDE 216

Query: 528  KPILVN-LGTKKFN----SSVVHTCWNPHVPEESVVLLDNGELFLFDLDACSGVEKLPVK 692
              + V+ LG         + V   CW+PH+ EE +VLLDNG+L LFD+    G +   + 
Sbjct: 217  YSVCVDYLGCANIKMLRGNRVAGACWSPHLREECLVLLDNGDLLLFDVSYYYGEKAESIS 276

Query: 693  LKGTKVGVSWEDLGL-VSNSNSNPGTTSVEEKNGWLSCEFSWHPRIVIVAHSNAVFLIDW 869
            L      V  + + + ++N +      S  +   W  CEFSWHPRI I +HS+ VFL+D 
Sbjct: 277  LVRNNNNVVKKIMQVSLTNESGLEKEESGNDGRCWFECEFSWHPRIFIASHSDGVFLVDL 336

Query: 870  RFERSIVSTLANIEMLDLIHSVRNDRFIAFCKAGLDGFYFTVATRYHLLLLDIRKPLMPV 1049
            R     +S L  +E L +    +ND F A  +AG DGF FTV+TRY LLL D+RKPL P+
Sbjct: 337  RSSVHNISCLLKLETLSM---GKNDEFCALSRAGSDGFTFTVSTRYLLLLFDVRKPLAPI 393

Query: 1050 LQWAHGLDNPRYITVFRLSELRPALEEDEYRWASESGFAIVLGSFWNREFSLFCYGPSSP 1229
            L+WAH + NPRY+TVFRLSELR A  ++ Y+ A ESG+ +VLGSFW+ +FSLFCYGP   
Sbjct: 394  LRWAHDIRNPRYLTVFRLSELR-ANADNTYKLALESGYCVVLGSFWDSQFSLFCYGPDCR 452

Query: 1230 EPHGSVASKILNFCKSFYAWGL 1295
              + SV+S+I  FC   YAWGL
Sbjct: 453  SDNKSVSSEISKFCNLCYAWGL 474


>ref|XP_004231258.1| PREDICTED: uncharacterized protein LOC101260775 [Solanum
            lycopersicum]
          Length = 907

 Score =  279 bits (713), Expect = 2e-72
 Identities = 171/406 (42%), Positives = 231/406 (56%), Gaps = 15/406 (3%)
 Frame = +3

Query: 123  ILPSVISSITADFDSQSDETSPIVN-NNLQILRCPN-------NDILLFFPTGENSDFVG 278
            +L S  SSI  +F  Q  +T  I N N++Q L  PN       N I+   PTGEN D VG
Sbjct: 85   MLFSTASSIATEFSPQVSDT--IHNFNSIQFLPLPNFGENSKPNSIIGISPTGENYDQVG 142

Query: 279  FVKLSLKGSK-PKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFSSMSGNSSTVGF 455
               L  + ++       NG ++   +H L+ RIL++LV  V++      S S +  T G+
Sbjct: 143  LFMLCSEDTQFVAKKFKNGTSILVHNHKLNFRILRLLVNPVSEID---DSCSSSCITFGY 199

Query: 456  LLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFN----SSVVHTCWNPHVPEESVVL 623
            LL CTLYSV+W+ V+I   G   E  +L  +G+   N      V H CW+PH+ EE VV+
Sbjct: 200  LLVCTLYSVHWYSVKIGVKGD--ENVMLDYVGSADRNLFKGGIVSHACWSPHLREECVVM 257

Query: 624  LDNGELFLFDLDACSGVEKLPVK--LKGTKVGVSWEDLGLVSNSNSNPGTTSVEEKNGWL 797
            L NGE+FLFD+ +C   +       L+G K+ V W+ L               +    W+
Sbjct: 258  LKNGEMFLFDMGSCGKSQAFCASDVLQGKKLQVLWDKL---------------DRDEHWV 302

Query: 798  SCEFSWHPRIVIVAHSNAVFLIDWRFERSIVSTLANIEMLDLIHSVRNDRFIAFCKAGLD 977
            +CEFSWHPRI+IVA+S  VFL+D R ++  V TL NIE    + S R DRFIA  +   D
Sbjct: 303  TCEFSWHPRILIVANSRTVFLVDLRSDKCKVCTLLNIEA---VSSGRTDRFIALSRVEAD 359

Query: 978  GFYFTVATRYHLLLLDIRKPLMPVLQWAHGLDNPRYITVFRLSELRPALEEDEYRWASES 1157
             F FT  +   LLL D+RKPLMP+LQW HGL+NP Y+TV RLS+LR    +D++ WA+ES
Sbjct: 360  VFCFTAVSGRSLLLCDVRKPLMPLLQWVHGLNNPAYVTVLRLSDLRRRTRDDKWAWATES 419

Query: 1158 GFAIVLGSFWNREFSLFCYGPSSPEPHGSVASKILNFCKSFYAWGL 1295
            G  I++GSFW+ EF+LFCYGP     H    S+I    KS  AWGL
Sbjct: 420  GRCILVGSFWDCEFALFCYGPD--YNHSHKFSEIARLSKSVNAWGL 463


>ref|XP_006841229.1| hypothetical protein AMTR_s00135p00060200 [Amborella trichopoda]
            gi|548843145|gb|ERN02904.1| hypothetical protein
            AMTR_s00135p00060200 [Amborella trichopoda]
          Length = 397

 Score =  271 bits (694), Expect = 3e-70
 Identities = 156/351 (44%), Positives = 211/351 (60%), Gaps = 11/351 (3%)
 Frame = +3

Query: 198  NNLQILRCPNNDILLFFPTGENSDFVGFVKLSLKGSKPKIMVDNG-------DNVFTADH 356
            N L +L C N +IL+ FP+ ENSD +  V     G + +   DNG       D+VF    
Sbjct: 60   NPLHLLTCRNGEILILFPSRENSDRLACVV----GRRER---DNGGGFSLLKDSVFLLSP 112

Query: 357  GLDCRILKILVISVTDSGLGFSSMSG--NSSTVGFLLACTLYSVNWFRVEISNLGSDLEK 530
                RI+ + VIS  D    ++S S   +  T GF+L C+ Y V+W RV + N       
Sbjct: 113  SFKNRIIGVSVISTAD----YASCSEVCDQFTKGFVLLCSHYEVHWLRVGVRN-----ST 163

Query: 531  PILVNLGTKKFNSSVVHTCWNPHVPEESVVLLDNGELFLFDLDACSGVEKLPVKLKGTKV 710
            P+  NL +  F + V H CW+P++PEES VLL NGEL L+DL+ C GV+ LPVK KG  V
Sbjct: 164  PLSQNLASATFKNQVAHACWSPYLPEESAVLLVNGELRLYDLNYCVGVKNLPVKFKGELV 223

Query: 711  GVSWEDLGLVSNSNSNPGTTSVEEKNGWLSCEFSWHPRIVIVAHSNAVFLIDWRFERSIV 890
                ++LG V          S E  N W  CEF WHPR++IV     V ++D+R ++  V
Sbjct: 224  ---LKNLGSV---------ISRESDNDWFCCEFGWHPRVLIVTSKTTVLMVDFRDKKVKV 271

Query: 891  STLANIEMLDLI--HSVRNDRFIAFCKAGLDGFYFTVATRYHLLLLDIRKPLMPVLQWAH 1064
            + LA IE+ D +  H +++DRF AFCKA  DG  F+VAT+Y+LLL D RKPL PVLQW H
Sbjct: 272  TVLAKIELCDAVKHHFIKSDRFQAFCKASFDGSLFSVATKYYLLLFDTRKPLDPVLQWDH 331

Query: 1065 GLDNPRYITVFRLSELRPALEEDEYRWASESGFAIVLGSFWNREFSLFCYG 1217
             LD+  YI ++RLS+LRP+    + +WAS+SG+ I++GSF N EFSLFCYG
Sbjct: 332  HLDHVHYINMYRLSDLRPS--NGKLKWASDSGYVILVGSFRNCEFSLFCYG 380


>ref|XP_004166877.1| PREDICTED: uncharacterized LOC101205354 [Cucumis sativus]
          Length = 862

 Score =  258 bits (660), Expect = 3e-66
 Identities = 162/397 (40%), Positives = 220/397 (55%), Gaps = 8/397 (2%)
 Frame = +3

Query: 123  ILPSVISSITADFDSQ---SDETSPIVNNNLQILRCPNND-ILLFFPTGENSDFVGFVKL 290
            ++PS  SS+ + F  Q   SD  S +  N LQ L CPN+  +++FFPTG NSD VGF+ +
Sbjct: 69   VVPSTSSSVASLFGEQQCYSDPPSVLRYNRLQCLPCPNSSSVVVFFPTGPNSDHVGFLVV 128

Query: 291  SLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFSSMSGNSSTVGFLLACT 470
            S  GS   +  D  ++VF+ +  L+ +I  I V    +   GF  +  +   +GFLLA T
Sbjct: 129  SSNGSGLDVQSDCSNDVFSVESELNYQIFGIAV----NPNSGF--VDDSYEDIGFLLAYT 182

Query: 471  LYSVNWFRVEISNLGSDLEKPI-LVNLGTKKFNS-SVVHTCWNPHVPEESVVLLDNGELF 644
            +YSV WF V+   +GS  +  + LV++G+K F + SVVH CWNPH+ EESVVLL++G LF
Sbjct: 183  MYSVEWFIVKNHAIGSSCQPRVSLVHMGSKVFKTCSVVHACWNPHLSEESVVLLEDGSLF 242

Query: 645  LFDLDACSGVE--KLPVKLKGTKVGVSWEDLGLVSNSNSNPGTTSVEEKNGWLSCEFSWH 818
            LFD++     +     V LKG K+ VSW+ L       + P T S+ EK  +L       
Sbjct: 243  LFDMEPLLKTKDYNANVNLKGIKLKVSWDGLDCSKKIETFP-TYSLGEKEQFL------- 294

Query: 819  PRIVIVAHSNAVFLIDWRFERSIVSTLANIEMLDLIHSVRNDRFIAFCKAGLDGFYFTVA 998
                                                         AF KAG DGFYF++A
Sbjct: 295  ---------------------------------------------AFSKAGSDGFYFSIA 309

Query: 999  TRYHLLLLDIRKPLMPVLQWAHGLDNPRYITVFRLSELRPALEEDEYRWASESGFAIVLG 1178
            + + LLL DIRKPL PVLQW HGLD+P Y+ VF LSELR +     Y+ ASESG  IVLG
Sbjct: 310  SNHLLLLCDIRKPLSPVLQWTHGLDDPSYMNVFSLSELRSSPGNIMYKVASESGCCIVLG 369

Query: 1179 SFWNREFSLFCYGPSSPEPHGSVASKILNFCKSFYAW 1289
            SFW+ EF++FCYGPS P    S++S+   + +SFYAW
Sbjct: 370  SFWSSEFNIFCYGPSPPGLDQSISSRSSKYFQSFYAW 406


>ref|XP_006649708.1| PREDICTED: uncharacterized protein LOC102721249 [Oryza brachyantha]
          Length = 874

 Score =  231 bits (588), Expect = 7e-58
 Identities = 155/413 (37%), Positives = 219/413 (53%), Gaps = 8/413 (1%)
 Frame = +3

Query: 81   KALRNFTRPSKDPFILPSVISSITADFDSQSDETSPIVNNNLQILR---CPNNDILLFFP 251
            + LR+F R S   FI  S +  ++    +      P  +N L +LR     +  ++LFFP
Sbjct: 62   RLLRHFVRSSS--FIPYSDLDPLSETVLAPPSPPLPAPSNLLAVLRRARSSSQSLVLFFP 119

Query: 252  TGENSDFVGFVKL-SLKGSKPKIMVDNGDNVFTADHGLDCRILKILVISVTDSGLGFSSM 428
            +GEN+D V +V L S+  S P       D      H       +I  ++VT     + S 
Sbjct: 120  SGENADQVSYVTLDSIANSAPLSASVQSDGFMHPRH-------RIQQLAVTACCPSWPSD 172

Query: 429  SGNSSTVGFLLACTLYSVNWFRVEISNLGSDLEKPILVNLGTKKFNSSVVHTCWNPHVPE 608
            SG+    GFLLA TLYSVNWF+VE  + GS    P LV    + F+ +VVH CW+ H+  
Sbjct: 173  SGDDLVEGFLLAATLYSVNWFKVESRSSGS----PALVPTAKQAFDVAVVHACWSKHLQS 228

Query: 609  ESVVLLDNGELFLFDLDACSGVEKLPVKLKGTKVGVSWEDLGLVSNSNSNPGTTSVEEKN 788
            E +VLL++GEL  FDLD   G +         KVG+  ED                 +  
Sbjct: 229  ECLVLLESGELCWFDLDTLRGGKM--------KVGLGCED-----------------DCR 263

Query: 789  GWLSCEFSWHPRIVIVAHSNAVFLIDWRF-ERSIVSTLANIEMLDLIHS---VRNDRFIA 956
             WLSCE+   P  VIVA++ A+FL+D R+ + S    LA + M  L  +   V+ + ++A
Sbjct: 264  VWLSCEYGAQPWTVIVANTKAIFLVDLRYGDHSEYKVLARVGMEGLFETEPFVKTECYLA 323

Query: 957  FCKAGLDGFYFTVATRYHLLLLDIRKPLMPVLQWAHGLDNPRYITVFRLSELRPALEEDE 1136
            FCKA  D    +V T  HL++LDIR+PL PVL W HGLDNP ++ +F+LSELRP+   +E
Sbjct: 324  FCKAPFDDLLISVVTERHLMVLDIRQPLTPVLTWQHGLDNPSHLAMFQLSELRPS---NE 380

Query: 1137 YRWASESGFAIVLGSFWNREFSLFCYGPSSPEPHGSVASKILNFCKSFYAWGL 1295
            + WAS  G AI++GS W+ +F+LF  G   P+  GS  +  L      YAW L
Sbjct: 381  HEWASNFGIAILVGSLWSTDFNLFYCG---PKEQGSTENAHL------YAWDL 424


>gb|EAZ26218.1| hypothetical protein OsJ_10085 [Oryza sativa Japonica Group]
          Length = 876

 Score =  231 bits (588), Expect = 7e-58
 Identities = 145/358 (40%), Positives = 197/358 (55%), Gaps = 9/358 (2%)
 Frame = +3

Query: 186  PIVNNNLQILRCPNND--ILLFFPTGENSDFVGFVKLS--LKGSKPKIMVDNGDNVFTAD 353
            P  +N L +LR P++   +++FFP+GEN++ V +V L      + P       D      
Sbjct: 98   PAPSNLLAVLRAPSSSRSLVVFFPSGENAEQVSYVTLDPVADPTTPLSHSVQSDGFMHPR 157

Query: 354  HGLDCRILKILVISVTDSGLGFSSMSGNSSTVGFLLACTLYSVNWFRVEISNLGSDLEKP 533
            H       +I  ++ T S   + S S +SS  GFLLA TLYSVNWF+VE    GS    P
Sbjct: 158  H-------RIQQLATTASWSSWPSRSRDSSIEGFLLAATLYSVNWFKVESRGSGS----P 206

Query: 534  ILVNLGTKKFNSSVVHTCWNPHVPEESVVLLDNGELFLFDLDACSGVEKLPVKLKGTKVG 713
             LV    + F+++VVH CW+ H+  E VVLL+NG+L  FDLD   G +         KVG
Sbjct: 207  ALVPAAKQAFDAAVVHACWSKHLQSECVVLLENGQLCWFDLDTRRGGKM--------KVG 258

Query: 714  V-SWEDLGLVSNSNSNPGTTSVEEKNGWLSCEFSWHPRIVIVAHSNAVFLIDWRF-ERSI 887
              S +DLG                   WLSCE+   P  VIVA + A+ L+D RF +   
Sbjct: 259  FGSKDDLG------------------DWLSCEYGAQPWTVIVASTAAILLVDMRFGDHGE 300

Query: 888  VSTLANIEMLDLIHS---VRNDRFIAFCKAGLDGFYFTVATRYHLLLLDIRKPLMPVLQW 1058
               LA + M  L  +   V+   ++AFCKA  D F  +V T  HL++ DIR+PL+PVL W
Sbjct: 301  YKVLARVGMEGLFETDPFVKTQCYLAFCKAPFDDFLISVVTERHLMVFDIRRPLIPVLAW 360

Query: 1059 AHGLDNPRYITVFRLSELRPALEEDEYRWASESGFAIVLGSFWNREFSLFCYGPSSPE 1232
             HGLDNP +I +FRLSELRP+    E+ WAS SGFAI++GS W+ EF+LF  GP   +
Sbjct: 361  QHGLDNPNHIAMFRLSELRPS---KEHEWASNSGFAILVGSLWSTEFNLFFCGPKEQD 415


Top