[Haskell-cafe] IO and State

Thu Nov 11 05:17:00 EST 2004

At 10:35 10/11/04 -0800, Iavor S. Diatchki wrote:
>Hello,
>Concurrency in Haskell (technically GHC) is much like concurrency in other 
>languages,
>in that you fork off threads and do stuff with them, etc. I think you are 
>perhaps thinking of another kind of concurrency, where different redexes in
>a term are evaluted simultaneously?

Indeed, I was.  I guess this begs the question of why one wants 
concurrency.  I can, offhand, see two reasons:

1. performance, especially in a multiprocessor environment.  In this case, 
simultaneous evaluation of redexes is the goal, without any visible change 
in program semantics.  (I see this as a possible way of achieving very high 
degrees of concurrency in an application, as individual concurrent 
activities can be very lightweight.)

2. coordination with multiple external processes, or between distinct 
computations.  In this case, I think all the concurrent activities would 
need to be (explicitly) in an IO monad.

I'm not sufficiently familiar with the specific interfaces used in your 
example to comment in detail, but when you say:

>In GHC the whole program stops when the main thread exits.
>So if the law I was talking about holds, this program should
>never terminate, as it will forever loop in 'reader'.
>However since there is a concurrent thread running that can modify
>the state, if 'reader' is interrupted between the two readSTRefs
>we will get different values for 'x' and 'y' and 'reader' will stop.
>I tried that in GHC and it stops after a little while, so the law does not 
>hold.

I think there may be a problem with the interaction between forking 
semantics.  and (non-)strictness in Haskell.  Essentially, (reader r) is a 
non-terminating computation.  If the main program requires the value of 
(reader r) to be fully evaluated, then I think the value of program itself 
must be undefined (_|_).  But if it never actually needs the value, it 
should be whatever the rest of the program does return.

If the state value is to be shared between the two branches of a fork, then 
I think it MUST be embedded in IO for the expected language semantics to be 
properly honoured.  IO, as I understand it, is *the* mechanism provided by 
Haskell that allows evaluation of an expression to depend on some activity 
that occurs outside that evaluation.

#g
--

>   Here is an example to illustrate what I was talking about
>(the point about stToIO making that one law unsafe)
>
> > import Control.Monad.ST
> > import Data.STRef
> > import Control.Concurrent
> > import Control.Monad(when)
>
> > reader     :: STRef r Int -> ST r ()
> > reader r    = do x <- readSTRef r
>                   y <- readSTRef r
>                   when (x == y) (reader r)
>
> > writer     :: Int -> STRef r Int -> ST r ()
> > writer n r  = do writeSTRef r n
>                   writer (n+1) r
>
> > main       :: IO ()
> > main        = do r <- stToIO (newSTRef 0)
>                   forkIO (stToIO (writer 1 r))
>                   stToIO (reader r)
>
>In GHC the whole program stops when the main thread exits.
>So if the law I was talking about holds, this program should
>never terminate, as it will forever loop in 'reader'.
>However since there is a concurrent thread running that can modify
>the state, if 'reader' is interrupted between the two readSTRefs
>we will get different values for 'x' and 'y' and 'reader' will stop.
>I tried that in GHC and it stops after a little while, so the law does not 
>hold.
>
>What is interesting is that I think the law does hold if
>we execute an ST computation using runST, so at least in principle
>GHC could perform this optimiziation in such situations.
>I think the law holds then, as I think no reference can escape to 
>concurrent threads,
>as if they did their region parameter would become "RealWorld", and so the 
>computation
>could not be runSTed.
>
>-Iavor
>
>
>
>
>
>
>
>
>
>
>
>
>
>Graham Klyne wrote:
>
>>At 10:38 08/11/04 -0800, Iavor S. Diatchki wrote:
>>
>>>... Now the above law already doesn't hold when all GHC extensions are used,
>>>as when concurrency is present we cannot assume that nobody modified the 
>>>state concurrently.
>>>As a result all pointers in GHC probably behave as if they were 
>>>"volatile", which is not very nice.
>>
>>
>>Eek!  I find this most worrisome.  And I'm not sure that I agree.
>>
>>I thought that part of the reason for having a monad that it was threaded 
>>in a useful way through the path of a *single* computation (expression 
>>evaluation).  If concurrent activities can change that, then I sense that 
>>they're breaking something quite fundamental in the way Haskell should work.
>>
>>e.g. in a sequence like:
>>
>>   v :: SomeMonad
>>   v = do { e1 ; e2 ; e3 }
>>
>>Then I think that exactly the monad created by e1 is passed to e2, and 
>>the result of e2 passed to e3, without any visible external interference 
>>under any circumstance.  Concurrency, as I understand it should apply to 
>>Haskell, would allow different elements of that computation to be 
>>performed in different threads/processes, but the overall result of the 
>>computation should not be changeable.  Put another way, the graph 
>>reduction model for evaluating a Haskell program should not change, just 
>>the mechanics actual processes (or processors) actually perform the 
>>reduction steps.
>>
>>Or am I really overlooking something here?
>>
>>#g
>>
>>
>>------------
>>Graham Klyne
>>For email:
>>http://www.ninebynine.org/#Contact

------------
Graham Klyne
For email:
http://www.ninebynine.org/#Contact