[Haskell-cafe] Estimating the time to garbage collect

Duncan Coutts duncan.coutts at worc.ox.ac.uk
Mon May 4 07:51:53 EDT 2009


On Fri, 2009-05-01 at 09:14 +0100, Neil Davies wrote:
> Hi
> 
> With the discussion on threads and priority, and given that (in  
> Stats.c) there are lots of useful pieces of information that the run  
> time system is collecting, some of which is already visible (like the  
> total amount of memory mutated) and it is easy to make other measures  
> available - it has raised this question in my mind:
> 
> Given that you have access to that information (the stuff that comes  
> out at the end of a run if you use +RTS -S) is it possible to estimate  
> the time a GC will take before asking for one?
> 
> Ignoring, at least for the moment, all the issues of paging, processor  
> cache occupancy etc, what are the complexity drivers for the time to GC?
> 
> I realise that it is going to depend on things like, volume of data  
> mutated, count of objects mutated, what fraction of them are live etc  
> - and even if it turns out that these things are very program specific  
> then I have a follow-on question - what properties do you need from  
> your program to be able to construct a viable estimate of GC time from  
> a past history of such garbage collections?

Would looking at statistics suffice? Treat it mostly as a black box.
Measure all the info you can before and after each GC and then use
statistical methods to look for correlations to see if any set of
variables predicts GC time.

Duncan



More information about the Haskell-Cafe mailing list