Data munging or data wrangling is loosely the process of manually converting or mapping data from one “raw” form into another format that allows for more convenient consumption of the data with the help of semi-automated tools. This may include further munging, data visualization, data aggregation, training a statistical model, as well as many other potential uses. Data munging as a process typically follows a set of general steps which begin with extracting the data in a raw form from the data source, “munging” the raw data using algorithms (e.g. sorting) or parsing the data into predefined data structures, and finally depositing the resulting content into a data sink for storage and future use. Given the rapid growth of the internet such techniques will become increasingly important in the organization of the growing amounts of data available. (wikipedia)
Introduction The Comma Separated File format, or just CSV, especifies a data type store in a tabular manner. Each line represent a record, and each field is delimited by a comma or another separator character. This format is accepted by popular tools like Excel, LibreOffice and Google Docs, and by most of programming languages. In […]
Leading media organizations are using Datawrapper, a simple and free tool, to improve their storytelling this way. Datawrapper is an interactive chart app that can be embedded into stories such as this one, which shows the Nigerian government’s proposed budget in 10 charts. No related posts.
Open Refine (previously Google Refine) is a data cleaning software that uses your web browser as an interface. This means it will look like it runs on the internet but all your data remains on your machine and you do not need internet connection to work with it. No related posts.
No tutorials found.
Accessing useful data about the status of our planet’s environment is often a struggle. From with the variety of sources to the diversity of formats and tools out there, journalists interested in using data in their environmental reporting often spend more time finding and downloading the information than actually analyzing it. Over the past few […]