let strs = splitOn "," "bob,joe,nick"
Many interesting analysis techniques can be used on a large corpus of words to examine the structure of a sentence or the contents of a book.
let checksum = md5 file
To summarize an item into a small and typically fixed length value, we apply a hashing function to it. This chapter will cover the following recipes.
data Tree = Node v l r | Null
Everything from creating simple binary trees to practical applications such as Huffman trees are covered in this section.
type Graph = Table [Vertex]
A graph allows for representing network data such as social networks, biological gene relationship, and road topologies. Graphs are very common in data analysis and this chapter will cover some essential algorithms.
let (b, m) = linearRegression xs ys
This chapter contains recipes that answer questions about data deviation from the norm, existence of linear and quadratic trends, and probabilistic values of a network.
let clusters = kmeans points
Computer algorithms are becoming better and better at analyzing large data sets. As machines perform faster, so do their ability to detect interesting patterns in data.
a <- rpar task1
This chapter will cover parallel and concurrent design. Massive data analysis is a very real problem which this chapter will try to solve.
plot X11 Data2D [Color Red] [] pts
Visualizing data is important in all steps of data analysis. It is always useful to have an inutitive understanding so this chapter covers many ways to graph data.