Use -O1 optimization level #682

jmert · 2020-09-21T15:09:49Z

Extracted from #681 so that this can have some more discussion and not slow down that PR. From the other PR:

[This] commit enables a lower optimization level for the entire module in Julia 1.5+. The motivation here is that a lot of functions do not type infer, so we might as well tell the compiler to not try too hard.

musm · 2020-09-21T17:43:33Z

I think Plots.jl is having to do something similar to this.

jmert · 2020-10-29T16:41:00Z

Just out of curiosity: rebased on master (307e7db), and running my timing script:

   master precompile:  2.272 ± 0.0294
   master pkg load:    0.624 ± 0.0097
   master pkg test:   47.815 ± 0.6639
optlevel1 precompile:  2.235 ± 0.0462
optlevel1 pkg load:    0.549 ± 0.0110
optlevel1 pkg test:   40.586 ± 0.4779

musm · 2020-10-29T16:53:12Z

I'm fine with merging this as it does help, as long as we add some comments on it that it's likely unnecessary if we manually fix invalidations in the future.

jmert · 2020-10-29T17:07:40Z

I'm not in a rush to merge this — was just rebasing some branches and thought I'd just leave a contextual breadcrumb.

Invalidations aren't the thing that this is trying to solve — the invalidations are the same on both master and this branch:

julia> using SnoopCompileCore

julia> invs = @snoopr include("test/runtests.jl")
[ Info: Precompiling HDF5 [f67ccb44-e63f-5c2f-98bd-6dc0ccc4ba2f]
HDF5 version 1.10.4
...

julia> using SnoopCompile

julia> invalidation_trees(invs)
2-element Vector{SnoopCompile.MethodInvalidations}:
 inserting datatype(::Type{foo_hdf5}) in Main at /home/justin/.julia/dev/HDF5/test/compound.jl:32 invalidated:
   mt_backedges: 1: signature Tuple{typeof(datatype), Type} triggered MethodInstance for d_create_external(::HDF5.File, ::String, ::GenericString, ::Type, ::Tuple{Int64, Int64}, ::Int64) (1 children)

 inserting names(x::Union{HDF5.Attributes, HDF5.File, HDF5.Group}) in HDF5 at deprecated.jl:70 invalidated:
   mt_backedges: 1: signature Tuple{typeof(names), Any} triggered MethodInstance for iterate(::Base.Generator{Vector{Any}, typeof(names)}, ::Int64) (3 children)
                 2: signature Tuple{typeof(names), Any} triggered MethodInstance for iterate(::Base.Generator{Vector{Any}, typeof(names)}) (4 children)
   10 mt_cache

It's just that compiler optimizations don't achieve much on poorly type-inferred results, so we can head that off by just telling the compiler to not try very hard at optimizing the generated code.

musm · 2020-10-29T17:13:54Z

Oh I see, thanks for the clarification. So we have a lot of poorly type-inferred results. It might be tricky to improve the situation on that front.

jmert · 2020-10-29T17:17:47Z

Yeah, its the very dynamic nature of operations like e.g. read(::Dataset) -> ??? that is going to be the limiting case.

I'm guessing something that will also help is using the other SnoopCompile features and seeing if/where some @nospecialize, ::Any type assertions, and/or other purposeful type-widening might just head off unnecessary inference and compiled specializations.

musm · 2020-12-07T21:42:40Z

For me timings have now improved, the tests timing difference is about ~ 1 s. And the differences in load and precompile are within 1/10 of a second.

jmert · 2020-12-09T20:00:28Z

This is what I get on master, a rebased version of this PR, and then a branch where I've additionally removed the deprecations file.

   master precompile:  2.378 ± 0.0274
   master pkg load:    0.562 ± 0.0123
   master pkg test:   52.051 ± 1.6150
optlevel1 precompile:  2.284 ± 0.0135
optlevel1 pkg load:    0.481 ± 0.0120
optlevel1 pkg test:   43.644 ± 0.7541
  no_deps precompile:  2.137 ± 0.0149
  no_deps pkg load:    0.482 ± 0.0092
  no_deps pkg test:   43.443 ± 0.5663

So package precompile and load times are very close now, but optlevel 1 still reduces the entire test suite time by ~8 seconds (~15% reduction from master).

mkitti

We should do some measurements, but I find it unlikely we are doing anything compute intensive here that warrants extensive optimization. I thus recommend we merge this.

musm · 2022-05-31T21:27:37Z

I'd also recommend testing/benchmarking

if isdefined(Base, :Experimental) && isdefined(Base.Experimental, Symbol("@max_methods"))
    @eval Base.Experimental.@max_methods 1
    @eval Base.Experimental.@optlevel 0
end

mkitti · 2022-05-31T21:41:52Z

Last CI run on master, julia-actions/julia-runtest@latest took 1m 58s for ubuntu-latest Julia 1.
On this pull request, julia-actions/julia-runtest@latest took 1m 46s for ubuntu-latest Julia 1.

mkitti · 2022-05-31T21:51:48Z

Looking further, it's a bit of a wash.

Platform	master	-O1 optimization
ubuntu-latest	1m 58 s	1m 46 s
macOS	1m 55s	2m 16s
windows x64	2m 15s	2m 2s
windows x86	2m 33s	2m 43 s

jmert mentioned this pull request Sep 21, 2020

Speed up (pre)compile and load times #681

Merged

jmert force-pushed the optlevel1 branch from c2c8130 to fcdd5e7 Compare October 7, 2020 18:45

Use -O1 optimization level

f58f9f2

jmert force-pushed the optlevel1 branch from fcdd5e7 to f58f9f2 Compare October 29, 2020 16:41

Merge branch 'master' into optlevel1

c7615af

mkitti requested a review from musm May 31, 2022 21:23

mkitti approved these changes May 31, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Use -O1 optimization level #682

Use -O1 optimization level #682

jmert commented Sep 21, 2020

musm commented Sep 21, 2020

jmert commented Oct 29, 2020

musm commented Oct 29, 2020

jmert commented Oct 29, 2020 •

edited

Loading

musm commented Oct 29, 2020

jmert commented Oct 29, 2020

musm commented Dec 7, 2020

jmert commented Dec 9, 2020 •

edited

Loading

mkitti left a comment

musm commented May 31, 2022

mkitti commented May 31, 2022

mkitti commented May 31, 2022

Use -O1 optimization level #682

Are you sure you want to change the base?

Use -O1 optimization level #682

Conversation

jmert commented Sep 21, 2020

musm commented Sep 21, 2020

jmert commented Oct 29, 2020

musm commented Oct 29, 2020

jmert commented Oct 29, 2020 • edited Loading

musm commented Oct 29, 2020

jmert commented Oct 29, 2020

musm commented Dec 7, 2020

jmert commented Dec 9, 2020 • edited Loading

mkitti left a comment

Choose a reason for hiding this comment

musm commented May 31, 2022

mkitti commented May 31, 2022

mkitti commented May 31, 2022

jmert commented Oct 29, 2020 •

edited

Loading

jmert commented Dec 9, 2020 •

edited

Loading