Abstract. The following paper describes in brief the data set related to our project "Hydrochemical assessment of Semarang Groundwater Quality". All of 58 samples were taken in 1992, 1993, 2003, 2006, and 2007 using well point data from several reports from Ministry of Energy and Min- eral Resources and independent consultants. We provided 20 parameters in each samples (sample id, coord X, coord Y, well depth, water level, water elevation, TDS, pH, EC, K, Ca, Na, Mg, Cl, SO4, HCO3, year, ion balance, screen location, and chemical facies). The chemical composi- tion were tested in the Water Quality Laboratory, Universitas Diponegoro using mas spectrofotometer method.
The statistical treatment for the dataset (available on Zenodo doi:10.5281/zenodo.57293) were described as follows: (1) data preparation in to csv file format, load it in to R environment; (2) data treatment, including: correlation matrix, cluster analysis using kmeans and hierarchical cluster analysis, and principal component analysis. For anal- ysis and visualizations, We used the following R packages: ggplot2, dplyr, factomineR, factoExtra, cluster, ggcorrplot, and ape.
How to cite. Irawan, D. E. and Putranto, T. T.: Hydrochemical assessment of Semarang area using multivariate statistics: A sample based dataset, Earth Syst. Sci. Data Discuss. [preprint], https://doi.org/10.5194/essd-2016-29, 2016.
This paper is written as a part of our project in analyzing groundwater system in Semarang (Indonesia) area based on the water quality. Semarang is highly dense populated, serves as the capital of Mid Java Province. The aquifer in this area is a combination between volcanic and alluvium system. All data has been clarified and validate on-field. We applied free statistical package for later analysis. The code is given in this paper. Hopefully we can gain constructive comments to improve our work.
This paper is written as a part of our project in analyzing groundwater system in Semarang...