Histogrammar

GitHub Gists

Informal repository of usable snippets. Gists can be updated more frequently than tutorials.

StackOverflow

Question-and-answer site for rapid help. Use the [histogrammar] tag.

Installation

Getting it.

Installation

Tutorials

How to make plot X from data Y.

Tutorials

Specification

How every primitive should behave in every language.

Specification

Introduction

Histogrammar is a package for creating histograms, with a Python and Scala implementation. Histogrammar has multiple histogram types, supports numeric and categorical features, and works with Numpy arrays and Pandas and Spark dataframes.

Once a histogram is filled, it’s easy to plot it, store it in JSON format (and retrieve it), or convert it to Numpy arrays for further analysis.

At its core Histogrammar is a suite of data aggregation primitives designed for use in parallel processing. In the simplest case, you can use this to compute histograms, but the generality of the primitives allows much more. Please see the examples for details. Enjoy!

Reporting bugs

Report implementation errors on the GitHub Issues tab of the corresponding language version:

Report errors in this documentation to histogrammar-docs/issues.

For developers

  • Travis-CI for the current build state of all Histogrammar repositories.