<div>Hi Marc,</div>
<div> </div>
<div>Thanks for the pointer. Fortunately I don't have this problem. Probably 2GB of memory are enough to parse 20MB file even with this space leak. HXML still works better than the other libraries and has a nice API so I use it. The patch is useful but it is not applied. I also did some other changes in HXML. It seems like the library is not maintained. Should I package it and upload to Hackage?</div>
<div> </div>
<div>Regards,</div>
<div> Krasimir</div>
<div><br><br> </div>
<div class="gmail_quote">On Sat, Oct 25, 2008 at 2:14 AM, Marc A. Ziegert <span dir="ltr"><<a href="mailto:coeus@gmx.de">coeus@gmx.de</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="PADDING-LEFT: 1ex; MARGIN: 0px 0px 0px 0.8ex; BORDER-LEFT: #ccc 1px solid">there was a thread about xml parsing, one month ago.<br><<a href="http://thread.gmane.org/gmane.comp.lang.haskell.cafe/44708" target="_blank">http://thread.gmane.org/gmane.comp.lang.haskell.cafe/44708</a>><br>
well, i don't know much about xml, except what it looks like;<br>but i know about that interesting parsing problem behind it.<br>maybe Lev Walkin has fixed that in HXML. at least he wrote this patch...<br><<a href="http://article.gmane.org/gmane.comp.lang.haskell.cafe/45033" target="_blank">http://article.gmane.org/gmane.comp.lang.haskell.cafe/45033</a>><br>
<br>- marc<br><br><br>-----<br>sometimes i think, i should write a paper about it. but then... naah, i'm like haskell: non-strict.<br><br><br><br><br>Am Donnerstag, 23. Oktober 2008 schrieb Krasimir Angelov:<br>
<div>
<div></div>
<div class="Wj3C7c">> Hi,<br>><br>> Does some one have made performance tests on the different XML libraries for<br>> Haskell? I have a 20MB xml file that I want to read. I remember from my<br>> earlier experiments (years ago) that all libraries were too slow and were<br>
> consuming too much memory. I hoped that this situation had changed but maybe<br>> not. I looked at HaXML, libxml, HXML and HXT. HaXML eats a lot of memory and<br>> is still very slow. libxml is unfinished binding to the C library. Currently<br>
> it only allows to create documents. HXML seems to be very promising. It<br>> works fast and it doesn't eat memory. Unfortunately it is that it seems to<br>> be rather old. It uses its own Arrow and Tree libraries instead of the<br>
> standard libraries. I have not jumped into HXT yet because it seems to be<br>> very large library. Could someone recomend which one is the state of the<br>> art?<br>><br>> Best Regards,<br>> Krasimir<br>
><br></div></div></blockquote></div><br>