Difference between revisions of "Applications and libraries/Bioinformatics"

From HaskellWiki
Jump to navigation Jump to search
(+polyFARM)
(→‎The Biohaskell library and applications: ADPFusion is now redir to CHvS)
 
(7 intermediate revisions by 4 users not shown)
Line 1: Line 1:
  +
== The [http://www.biohaskell.org/ Biohaskell library] and applications ==
== Ketil Malde's Bioinformatics stuff ==
 
   
  +
There are [http://biohaskell.org/Libraries three main kinds of the libraries] within BioHaskell project:
;[http://www.ii.uib.no/~ketil/bioinformatics Ketil's bioinformatics pages]
 
:Some tools, implemented in Haskell, are available from this site
 
   
  +
* sequence and alignment processing -- see [http://malde.org/~ketil/biohaskell/ Ketil's bioinformatics] [http://www.ii.uib.no/~ketil/bioinformatics/ pages], and [http://blog.malde.org his blog]. In particular:
* xsact - a sequence clustering tool (currently in revision 1.5).
+
** xsact - a sequence clustering tool (currently in revision 1.5).
* xtract - a sequence assembly tool which uses an interesting algorithm based on de Bruijn graphs, but unfortunately is hard to tune to give optimal results. Unless you plan to fix it, use CAP3 instead.
+
** xtract - a sequence assembly tool which uses an interesting algorithm based on de Bruijn graphs, but unfortunately is hard to tune to give optimal results. Unless you plan to fix it, use CAP3 instead.
* rbr - a sequence masking tool, masking ESTs based on a statistical measure of word frequencies.
+
** rbr - a sequence masking tool, masking ESTs based on a statistical measure of word frequencies.
* rselect - a simple and efficient tool for extracting random sets of sequences
+
** rselect - a simple and efficient tool for extracting random sets of sequences
  +
** [http://hackage.haskell.org/package/blastxml blastxml] - parser for Blast XML data output.
  +
** [http://hackage.haskell.org/package/blastHTTP blastHTTP] - tool to query [http://blast.ncbi.nlm.nih.gov/ NCBI Blast service]
   
  +
* RNA secondary structure tools, some of them using [http://www.tbi.univie.ac.at/~choener/adpfusion/ ADPFusion framework].
* A bioinformatics library, currently containing bits and pieces needed for the above programs. The main focus at the moment is on nucleotide sequences.
 
   
  +
* Structural libraries:
  +
** [http://hackage.haskell.org/package/hPDB hPDB] - fastest parallel Protein Databank parser according to [http://www.biomedcentral.com/1756-0500/6/483/abstract this benchmark].
  +
** [http://hackage.haskell.org/package/parsestar parseSTAR] - parser for [http://www.bmrb.wisc.edu Biological Magnetic Resonance Bank] data.
  +
** [http://hackage.haskell.org/package/hTalos hTalos] - parser for [http://spin.niddk.nih.gov/NMRPipe/talos/ TALOS+] output.
   
 
== PolyFARM ==
 
== PolyFARM ==
   
;[http://www.aber.ac.uk/compsci/Research/bio/dss/polyfarm/ PolyFARM]
+
;[http://www.aber.ac.uk/en/cs/research/cb/dss/polyfarm/ PolyFARM]
 
:PolyFARM (Poly-machine First-order Association Rule Mining) is a data mining program which finds first order associations in Datalog data. It is based on the ideas from WARMR (L. Dehaspe and L. De Raedt (1997) Mining Association Rules in Multiple Relations. It is described in Clare, A. and King R.D. (2003) "Data mining the yeast genome in a lazy functional language."
 
:PolyFARM (Poly-machine First-order Association Rule Mining) is a data mining program which finds first order associations in Datalog data. It is based on the ideas from WARMR (L. Dehaspe and L. De Raedt (1997) Mining Association Rules in Multiple Relations. It is described in Clare, A. and King R.D. (2003) "Data mining the yeast genome in a lazy functional language."
  +
  +
== Other ==
  +
  +
* [http://people.cs.missouri.edu/~harrisonwl/ Bill Harrison]
  +
* [http://www.techfak.uni-bielefeld.de/ags/pi/ Robert Giegerich's group (AGPI) at Bielefeld University]
   
 
{{LibrariesPage}}
 
{{LibrariesPage}}

Latest revision as of 18:30, 6 December 2013

The Biohaskell library and applications

There are three main kinds of the libraries within BioHaskell project:

  • sequence and alignment processing -- see Ketil's bioinformatics pages, and his blog. In particular:
    • xsact - a sequence clustering tool (currently in revision 1.5).
    • xtract - a sequence assembly tool which uses an interesting algorithm based on de Bruijn graphs, but unfortunately is hard to tune to give optimal results. Unless you plan to fix it, use CAP3 instead.
    • rbr - a sequence masking tool, masking ESTs based on a statistical measure of word frequencies.
    • rselect - a simple and efficient tool for extracting random sets of sequences
    • blastxml - parser for Blast XML data output.
    • blastHTTP - tool to query NCBI Blast service

PolyFARM

PolyFARM
PolyFARM (Poly-machine First-order Association Rule Mining) is a data mining program which finds first order associations in Datalog data. It is based on the ideas from WARMR (L. Dehaspe and L. De Raedt (1997) Mining Association Rules in Multiple Relations. It is described in Clare, A. and King R.D. (2003) "Data mining the yeast genome in a lazy functional language."

Other

This page contains a list of libraries and tools in a certain category. For a comprehensive list of such pages, see Applications and libraries.