Optimise use of OpenGL #15

hex539 · 2019-12-26T21:04:36Z

Performance is OK when running as a standalone desktop app connected to HDMI. Still, things get a little bit choppy if casting the screen to another device at the same time, and this is one of the main use cases for the program.

On an i7-6770HQ iwth software rendering, the desktop app can do 4K60fps but the above flame graph shows that it spends a lot more time than it should rendering fonts and using OpenGL 1.1 immediate mode APIs.

Next steps are to:

Add a benchmark target for rendering N frames and then exiting.
Switch all of the glBegin/glEnd usage into VBOs.

hex539 · 2019-12-31T21:12:02Z

After optimising out the glEnable/glDisable calls, the GPU trace tool shows us the following time breakdown per 18ms frame, with and without particles enabled, respectively:

Calls highlighted in blue are glVertex2d (average time per call 147ns). If we go to the worst case for particles there are typically about 30k extra vertices to draw, which take about 9ms out of the 16ms budget (4.5ms from glVertex, 4.5ms from glColor).

Then we have about 5k other vertices being drawn with glVertex2d/glVertex2f and fewer glColor calls in 6ms, which typically we just barely meet.

So it makes sense to start by putting the particles in a vertex buffer and trace again to see if the impact is reduced enough by doing that.

No glDisable(GL_TEXTURE_2D) or glDisable(GL_BLEND) in the middle of rendering a frame. Improves performance- see issue #15

This is less expensive than making tens of thousands of calls per frame to glVertex2d(). Still less expensive would be moving to GPU simulation for particle effects, but I'm not sure if this is still worth it after taking into account the potential compatibility issues with old OpenGL versions on Windows systems. Improves performance. See issue #15

hex539 self-assigned this Dec 26, 2019

hex539 added a commit that referenced this issue Jan 2, 2020

Move all glEnable/glDisable calls to run at start

6e2ead0

No glDisable(GL_TEXTURE_2D) or glDisable(GL_BLEND) in the middle of rendering a frame. Improves performance- see issue #15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Optimise use of OpenGL #15

Optimise use of OpenGL #15

hex539 commented Dec 26, 2019

hex539 commented Dec 31, 2019

Optimise use of OpenGL #15

Optimise use of OpenGL #15

Comments

hex539 commented Dec 26, 2019

hex539 commented Dec 31, 2019