[Haskell-cafe] Maybe a, The Rationale

Sun May 11 22:20:32 EDT 2008

On 12 May 2008, at 1:52 am, Brandon S. Allbery KF8NH wrote:
> My real point was that in the C programming culture it was/is far  
> too common to use an in-band value; that is, one that could be  
> confused with or treated as a valid response:  null pointers,  
> stdio's EOF (= -1).

Here I must disagree.  I've hacked character-level I/O in lots of  
programming languages (the
last time I counted I'd used more than 100), and C was the first  
language I ever met that
made it easy, precisely BECAUSE the perfectly normal "there are no  
more characters" situation
was handled the same way as every other outcome.

>  This just causes problems because code is almost encouraged to  
> ignore the special cases.
>  For example, the ctype macros have to support being passed EOF.

So they do, but it is elementary to do so.  The only reason there is  
anything even
remotely unusual there is that the *same* functions are used in C for  
*character* input
and *byte* input.  I'll grant you that you probably don't want to  
process binary input
using quite the same quasi-FSA code that you want for characters.   
Since C uses NUL for
terminating strings, and since ASCII made it clear that NUL was never  
ever *supposed* to
appear in text, NUL would have been the perfect choice for character  
EOF, and in that
case there would never have been anything odd about having the ctype  
macros handle it.

I've been writing some Smalltalk recently, which uses a Pascal-like  
convention
	aStream atEnd			"test for EOF"
	  ifTrue: [self handleEOF]
	  ifFalse: [self handleCharacter: aStream next]
and the EOF tests clutter up the code inordinately and make it so  
painful that C
starts looking good again.

The C approach here has several benefits:
  - you can *postpone* checking for EOF until after you have checked for
    other things; since EOF is seldom or never what the code is mainly
    *about* this is good for clarity
  - if you want to know "is the next character one of these" you have  
only
    two cases to deal with at that point (yes and no), not three (yes,  
no,
    and you-idiot-you-forgot-to-test-for-EOF-first-and-testing-for-EOF- 
is-
    the-most-important-thing-anybody-could-be-interested-in-when- 
reading).

>  Maybe types force you to deal with it, while simultaneously  
> providing convenience functions to help you deal with it.

I readily grant that Maybe is a wonderful wonderful thing and I use it  
freely and
voluntarily.  BUT it should not dominate the code.

Consider Haskell's getChar and hGetChar.  They DON'T return Maybe  
Char; they
raise an exception at end of file.  You have to keep testing isEOF/ 
hIsEOF before
each character read, as if we had learned nothing since Pascal.
Arguably, maybeGetChar :: IO (Maybe Char) and hMaybeGetChar :: Handle - 
 > IO (Maybe Char)
would be a good idea, at least then one could easily set up some  
combinators to
deal with this nuisance.

>
>
> -- 
> brandon s. allbery [solaris,freebsd,perl,pugs,haskell] allbery at kf8nh.com
> system administrator [openafs,heimdal,too many hats] allbery at ece.cmu.edu
> electrical and computer engineering, carnegie mellon university     
> KF8NH
>
>
> _______________________________________________
> Haskell-Cafe mailing list
> Haskell-Cafe at haskell.org
> http://www.haskell.org/mailman/listinfo/haskell-cafe
>

--
"I don't want to discuss evidence." -- Richard Dawkins, in an
interview with Rupert Sheldrake.  (Fortean times 232, p55.)