Personal tools

UTF-8

From HaskellWiki

(Difference between revisions)
Jump to: navigation, search
(question - what about other string encodings?)
 
(One intermediate revision by one user not shown)
Line 3: Line 3:
 
The simplest solution seems to be to use the [http://hackage.haskell.org/cgi-bin/hackage-scripts/package/utf8-string utf8-string package] from Galois. It
 
The simplest solution seems to be to use the [http://hackage.haskell.org/cgi-bin/hackage-scripts/package/utf8-string utf8-string package] from Galois. It
 
provides a drop-in replacement for System.IO
 
provides a drop-in replacement for System.IO
  +
  +
''What about other string encodings?''
   
 
== Example ==
 
== Example ==
+
If we use a function from System.IO.UTF8, we should also hide the equivalent one from the Prelude. (Alternatively, we could import the UTF8 module qualified)
This example reverses every line in a file (saving the results in the file + ".rev")
 
 
If we use a function from <code>System.IO.UTF8</code>, we should also hide the equivalent one from the Prelude. (Alternatively, we could import the UTF8 module qualified)
 
   
 
<haskell>
 
<haskell>
 
> import System.IO.UTF8
 
> import System.IO.UTF8
 
> import Prelude hiding (readFile, writeFile)
 
> import Prelude hiding (readFile, writeFile)
  +
> import System.Environment (getArgs)
 
</haskell>
 
</haskell>
   
Line 19: Line 22:
 
> do args <- getArgs
 
> do args <- getArgs
 
> mapM_ reverseUTF8File args
 
> mapM_ reverseUTF8File args
+
 
> reverseUTF8File :: FilePath -> IO ()
 
> reverseUTF8File :: FilePath -> IO ()
 
> reverseUTF8File f =
 
> reverseUTF8File f =
> do f <- readFile f
+
> do c <- readFile f
> writeFile (f ++ ".rev) $ reverseLines f
+
> writeFile (f ++ ".rev") $ reverseLines c
 
> where
 
> where
 
> reverseLines = unlines . map reverse . lines
 
> reverseLines = unlines . map reverse . lines

Latest revision as of 02:22, 22 July 2008


The simplest solution seems to be to use the utf8-string package from Galois. It provides a drop-in replacement for System.IO

What about other string encodings?

[edit] Example

If we use a function from System.IO.UTF8, we should also hide the equivalent one from the Prelude. (Alternatively, we could import the UTF8 module qualified)

> import System.IO.UTF8
> import Prelude hiding (readFile, writeFile)
> import System.Environment (getArgs)

The readFile and writeFile functions are the same as before...

> main :: IO ()
> main =
>  do args <- getArgs
>     mapM_ reverseUTF8File args
 
> reverseUTF8File :: FilePath -> IO ()
> reverseUTF8File f =
>   do c <- readFile f
>      writeFile (f ++ ".rev") $ reverseLines c
>   where
>     reverseLines = unlines . map reverse . lines