A leaf area index data set acquired in Sahelian rangelands of Gourma in Mali over the 2005–2017 period

. The leaf area index of Sahelian rangelands and related variables such as the vegetation cover fraction, the fraction of absorbed photosynthetically active radiation and the clumping index were measured between 2005 and 2017 in the Gourma region of northern Mali. These variables, known as climate essential variables, were derived from the acquisition and the processing of hemispherical photographs taken along 1 km linear sampling transects for ﬁve contrasted canopies and one millet ﬁeld. The same sampling protocol was applied in a seasonally inundated Acacia open forest, along a 0.5 km transect, by taking photographs of the understorey and the tree canopy. These observations collected over more than a decade, in a remote and not very accessible region, provide a relevant and unique data set that can be used for a better understanding of the Sahelian vegetation response to the current rainfall changes. The collected data can also be used for satellite product evaluation and land surface model development and validation. This paper aims to present the ﬁeld work that was carried out during 13 successive rainy seasons, the measured vegetation variables, and the associated open database. Finally, a few examples of data use are shown. DOI of the referenced data set:


Introduction
The Global Climate Observing System, GCOS, identified the leaf area index, LAI, as one of the main terrestrial essential climate variables, ECVs, to be monitored from systematic long-term satellite and in situ measurements (GCOS, 2011;Bojinski et al., 2014). The LAI corresponds to one-half of the total green leaf area per unit horizontal ground surface area (Watson, 1947;Chen and Black, 1992). Similarly, the plant area index, PAI, corresponds here to the above-ground areal extent of green vegetation. LAI or PAI is related both to the fraction of absorbed photosynthetic active radiation (0.4-0.7 µm) by the green vegetation, fAPAR (Myneni and Williams, 1994;Fensholt et al., 2006), and to the canopy green cover fraction, fCover, i.e. the amount of green vegetation distributed in a horizontal plane (Carlson and Ripley, 1997). These ECVs drive the fundamental physiological processes at leaf and canopy levels such as photosynthesis and transpiration, as well as the energy and mass exchanges between the surface and the atmosphere (Sellers, 1985). The seasonal variation of the ECVs, i.e. changes in the vegetation phenology, plays an important role in the modulation of energy, water and gas exchanges (Jarlan et al., 2008;Boulain et al., 2009), surface properties such as surface albedo (Samain et al., 2008;Guichard et al., 2009), and evaporation-transpiration partitioning (Wang et al., 2014). As such, these ECVs are key variables required in most land surface and biogeochemical models (Running and Gower, 1991;Potter et al., 1993;Wang et al., 2007) or production efficiency models (Mougin et al., 1995;Running et al., 2004;Tracol et al., 2006;McCallum et al., 2009).
The Sahel is a vast ecoclimatic and biogeographic region extending south of the Sahara over 6000 km long and 400-600 km wide, from the Atlantic Ocean to the Red Sea (Le Houérou, 1989). The climate is arid to semi-arid tropical with a 1-to 4-month single rainy season centred in July-August, and controlled by the West African monsoon (WAM) (Nicholson et al., 1998;Lebel et al., 2003). Annual rainfall ranges between approximately 100 mm in the northern edge of the Sahel to approximately 600 mm at the southern edge with the humid Sudanian savannahs. The vegetation comprises a herbaceous layer almost exclusively composed of annual plants, among which grasses dominate, and scattered stands of shrubs and low trees . During the short rainy season, the grass layer develops rapidly following the first rains which occur between late May and mid-June, and the herbaceous production is achieved within a few weeks, peaking between late August and mid-September (Breman and Cissé, 1977;Penning de Vries and Djitèye, 1982). Afterwards, at the end of the rainy season in September, the herbaceous canopy suddenly dries out and, consequently, the amount of green vegetation decreases rapidly with plant wilting (Hiernaux and Le Houérou, 2006). Herbage production, and therefore LAI, fCover and fAPAR values are characterized by high spatial and inter-annual variations that are related to the variability of the WAM. After a humid period in the 1950s, the Sahel experienced 26 consecutive years of below average rainfall (Nicholson et al., 1998;Lebel et al., 2009) that was punctuated by major droughts in 1972-1973 and 1983-1984 with dramatic consequences on the natural resources and the population (Stige et al., 2006).
The Sahel is not a very accessible region and in situ measurements of LAI, fAPAR and fCover are scarce and were mainly performed in millet fields (Hanan and Bégué, 1995;Levy and Jarvis, 1999;Soegaard et al., 1999;Boulain et al., 2009), with a few data acquired in fallows (Boulain et al., 2009) and rangelands (Mougin et al., 2014). Since 1998, moderate spatial-resolution (typically 0.5 km to 1 km resolution) sensors such as the Moderate Resolution Imaging Resolution Spectroradiometer (MODIS), the SPOT-VEGETATION, and the PROBA-V (François et al., 2014) instruments have been used to provide continuous estimations of the ECV variables at a global scale Baret et al., 2013;Verger et al., 2015). However, the proposed products still need to be evaluated using reliable field estimates Sea et al., 2011;Yan et al., 2016a, b;Xu et al., 2018a, b). In particular, several studies have pointed out the need to sample ecosystems like the semi-arid rangelands characterized by small, locally heterogeneous, and highly variable seasonally LAI values at the scale of moderate satellite resolution (e.g. Baret et al., 2006;Morisette et al., 2006). In 2000, the Sahelian Gourma sites have been integrated into the site network used in the Validation of Land European Remote Sensing Instruments (VALERI) project (Baret et al., 2005; http: //www.avignon.ina.fr/valeri/, last access: 13 May 2019; Camacho et al., 2013) and the Committee on Earth Observation Satellites (CEOS) Land Product Validation (Morisette et al., 2006; http://lpvs.gsfc.nasa.gov/, last access: 13 May 2019) projects. Among the selected validation sites, the Gourma site stands out as the one with the lowest spatial heterogeneity at the moderate resolution scale . This is a major advantage for validation studies because of the ground geolocation uncertainties of satellite products (Garrigues et al., 2008). In 2010, the Gourma site was also selected as the Sahelian pilot site of West Africa for the international "Tropical Biomes in Transition" project (TROBIT) (http://www.geog.leeds.ac.uk/TROBIT/, last access: 13 May 2019).
The main objective of the current field monitoring is to document the seasonal, inter-annual and decadal variations of LAI and thus primary productivity in relation to the rainfall variability, at different spatio-temporal scales. More precisely, this monitoring aims to provide relevant data to investigate how the WAM and its spatio-temporal variability affect the vegetation cover in central Sahel. This data set can also be used in the interpretation of Sahelian surface processes that are mainly modulated by the vegetation cover such as the partition between latent and sensible energy fluxes (Tim- ouk et al., 2009), and between dust and biogenic chemical emissions from soils (Pierre et al., 2012;Delon et al., 2015).
In the following, we describe the experimental protocol used for the estimation of vegetation variables, the associated open database, and show a few examples of data use.

Site description
The vegetation sites are located in north-east Mali, in the Gourma region which stretches from the loop of the Niger River southward down to the border with Burkina Faso. Since 2011, due to persistent security problems in the region, the monitored sites have been restricted to the vicinity of the town of Hombori (15.3 • N, 1.5 • W) within the 50 km × 50 km so-called super-site ( Fig. 1) of the African Multidisciplinary Monsoon Analysis (AMMA) -Cycle Atmosphérique et Cycle Hydrologique (CATCH) observing system (Lebel et al., 2009(Lebel et al., , 2010Mougin et al., 2009a;Gaillardet et al., 2018;Galle et al., 2018). At the national meteorological station of Hombori, the mean annual rainfall was 373 mm over the 1926-2017 period, and the mean annual temperature was 30.2 • C (1950-2010).
The LAI measures were carried out at the sites previously installed in 1984 and monitored till 1994 by the International Livestock Centre for Africa (ILCA) and by the Institut d'Economie Rurale (IER, Bamako) (Hiernaux et al., 1984;Hiernaux et al., 2009a, b), and reactivated by the AMMA-CATCH observing system during the AMMA project (Re-delsperger et al., 2006). These 1 km × 1 km sites were chosen within large and relatively homogeneous areas to sample the main vegetation types and canopies encountered within the super-site: herbaceous canopies on sandy soils, seasonally inundated Acacia open forest and its understorey herb layer in a plain of dominantly clay loam soils, and a seasonally inundated herbaceous canopy in a clayed loamy plain. In addition, a few data were acquired at an erosion surface site and in a crop millet field, for comparison purposes. Eventually, a total of eight sites on various soils, were retained, corresponding to nine vegetation canopies (Tables S1-S2 and Figs. S1-S2 in the Supplement). The selected vegetation canopies are as follows: one millet field on sandy soil: Bilantao (no. 41, Bi).

Field sampling strategy and data description
Before the set-up of a seasonal and inter-annual survey, two preliminary field campaigns took place in 1999 and 2000, followed since 2005 by a systematic monitoring of eight 1 km × 1 km sites, during the wet season from June to September. The temporal variation of in situ LAI, λ o , fCover and fAPAR were estimated through the analysis of digital hemispherical photographs Mougin et al., 2014). These photographs were collected during the growing season of herbaceous canopies or over the whole year for the forest canopy (Fig. 2a, c) and regularly taken along one or two perpendicular 1 km sampling transects crossing the site, perpendicular to the main orientation of the vegetation pattern that is perpendicular to the dune and interdune succession, usually in the north-south or the east-west directions (Table S3). Along the 1 km transects, this allows spatial heterogeneity to be taken into account. The choice of the sampling lines is described in Hiernaux et al. (2009a). At the seasonally inundated forest site, the unique field transect was limited to 0.5 km due to the difficulties associated with the field work in such an environment. At each site, the starting and ending points of the transects were georeferenced using global positioning systems (Garmin Trex©) to with approximately 10 m horizontal positional accuracy. . Black zones on the (a) and (b) images correspond to masked areas. The processing of hemispherical images is restricted to 0-60 • zenith angles to avoid mixed pixels. In the case of the forest canopy, note that the process could classify either vegetation elements like green herbaceous leaves (b) or non-vegetation elements like sky pixels (d).
At each sampling date, 100 or 50 hemispherical photographs were acquired at the 1 km herbaceous or 0.5 km forest sites, respectively, that is a picture taken every 10 m. At the forest site, photographs were acquired both in the upward and downward directions to sample the forest canopy and the herbaceous understorey. When the forest floor was inundated, which can occur during the rainy season, only the herbaceous vegetation component above the water surface was considered.
All the pictures were taken using a Nikon Coolpix 8400 (8 Mp) equipped with the fisheye FC-E9, viewing vertically downward above the herbaceous vegetation canopy with a distance between the top of the canopy and the lens of approximately 40 cm, providing a sampling area of approximately 1 m 2 (Mougin et al., 2014). At the Acacia forest site, the camera was mounted on a levelled tripod, and orientated upward at a height of approximately 0.8 m above the forest floor, and maintained horizontally using a bubble level.
Prior to the field campaigns, the camera was calibrated using the method described in http://www.avignon.inra.fr/can_ eye (last access: 13 May 2019) to compute the optical centre of the camera-fisheye lens system (Demarez et al., 2008). The collected hemispherical pictures were analysed using the image processing software CAN-EYE V5.0 which is dedicated to the classification of green and non-green vegetation elements and to series of photographs . The CAN-EYE software provides LAI estimates from gap fraction measurements derived from the hemispherical photographs (Fig. 2b, d), and it has been proved to be efficient to derive the vegetation variables of herbaceous (Demarez et al., 2008;Mougin et al., 2014) and forest (Sea et al., 2011) canopies.
The main outputs of the CAN-EYE software are the LAI, the aggregation factor or clumping index λ o , the vegetation cover, fCover, and the daily fraction of integrated absorbed photosynthetically active radiation, fAPAR (Table 1).
Finally, the estimated mean vegetation variables at the 1 km scale are computed by averaging all the 100 or 50 measurements acquired along the sampling transect for the herbaceous and forest canopy, respectively. The performance of the combined use of hemispherical photographs, and the CAN-EYE software for estimating LAI, λ o , fAPAR and fCover at the 1 km transect scale has been analysed by Mougin et al. (2014) for a herbaceous canopy. These variables are estimated at a spatial scale which is compatible with remotesensing observations performed by moderate resolution sensors, such as MODIS, SPOT-VEGETATION and PROBA-V. In particular, the clumping index brings valuable information on the temporal variation of the heterogeneity of grass cover, and can be retrieved from remote-sensing observations (Pisek et al., 2013;He et al., 2016). The accuracy associated to the field LAI estimates is approximately ±17.3 % for the herbaceous canopies (Mougin et al., 2014). Following the same approach and by taking into account the errors of classification and spatial sampling, the accuracy for the Acacia forest PAI is approximately ±36.5 % (this study).
Whenever possible, hemispherical photographs were taken approximately every 10 d during the growing seasons for the herbaceous canopies, whereas at the Kelma forest site, the monitoring took place approximately every 10 d during the leafy period, i.e. from July to January, and otherwise every month during the dry season. An illustration of the high seasonal temporal variation of LAI for five monitored herbaceous canopies, including the seasonally inundated herbaceous canopy, Kelma herbs, is given in Fig. 3 for the 2017 growing season. The strong temporal dynamics of the herba- ceous cover justifies the 10 d resolution sampling to precisely characterize the seasonal vegetation cycle, particularly the maximum LAI, LAI max , which is usually observed towards the end of the rainy season. Afterwards, on sandy soils the herbaceous green canopy composed of annual plants, rapidly dried out. At the forest site, on clay soils temporarily flooded, the understorey did not dry as fast and is partly composed of a perennial grass Sporobolus helvolus. Large differences in LAI max values are observed among sites depending on soil fertility, soil water budget, intensity of grazing by livestock, and length of soil inundation in the clayed sites in depressions (no. 21 and no. 21b). In 2017, maximum LAI values, LAI max , ranged between 0.43 at the seasonally flooded Kelma site, and 1.58 at the Agoufou sandy site.
The magnitude of the inter-annual variability in LAI for herbaceous canopies observed over the 13-year monitoring period is quite high (see the example given in Fig. 4a), with temporal coefficients of variation higher than 40 % for the herbaceous sites, and up to approximately 100 % for the forest understorey (

Database and data availability
The data set presented in this paper is part of the AMMA-CATCH observatory database (http://www.amma-catch. org, last access: 13 May 2019) and is identified by the permanent identifier https://doi.org/10.17178/AMMA-CATCH.CE.Veg_Gh (AMMA-CATCH, 2005), which describes its content with standardized metadata (DataCite 1 ) and gives access instructions. Table 3 shows the data availability included in this data set: it covers the 2005-2017 period and is given for the 13 monitored transects. The dates of hemispherical photograph acquisitions are given in Table S4. Over the 13-year period, a total of 658 sampling transects have been monitored, and approximately 52 000 hemispherical images have been acquired, processed, and analysed. Associated data such as herbaceous aboveground masses, CO 2 , sensible and latent flux measurements, meteorological observations and soil moisture measurements collected at the Agoufou, Kelma and Eguerit sites can also be found in the AMMA-CATCH database.
The data are available online, free of charge, via the web AMMA-CATCH database portal: http://bd.amma-catch.org (last access: 13 May 2019) and without any barriers except a registration. Registration is only required to download the data but not to browse the content of the data sets (metadata including variables names, spatial extent and coordinates, temporal extent, contacts, etc.), or to retrieve their content in a ISO 19115 2 standard file, or to localize on the map the acquisition sites. To access data to download, once connected to the portal, search for the data set by identifying it by its doi suffix (CE.Veg_Gh) or title among all the data sets contained in the database, and unfold the tab describing it. The database architecture also allows to extract a subset of the data at the spatio-temporal or variable level. The data files are provided in two formats: ASCII csv and NetCDF.

Use of the data set
In addition to the documentation of the seasonal, inter-annual and decadal variations of vegetation variables in relation to rainfall variability, we present two examples of data use in validation exercises of remote-sensing products and surface models.

Validation of satellite products
Validation of existing LAI products is still a challenge (Xu et al., 2018a, b) and a homogeneous long-term LAI data set, based on an unchanged field protocol during more than a decade, is certainly of valuable interest with the objectives of validating satellite products (Fang et al., 2012). The spatial sampling along transects of 1 km, as performed during the present monitoring, has also been highly recommended for field-based validation of moderate resolution products (Baret et al., 2005;Sea et al., 2011). The temporal resolution of the sampling, i.e. 10 d, should allow LAI products to be accu-rately evaluated. Moreover, the corrections of the clumping effects performed using the estimated aggregation factor, enable another source of uncertainty to be considered, linked to the non-random green components distribution within the canopy (Chen et al., 1997;Kucharik et al., 1998).
Preliminary LAI and fCover 1 km estimates at the Timbadior site, with a different technique, have already been used in direct validation exercises using the SPOT-VEGETATION products within the frame of the VALERI project (Baret et al., 2005;Weiss et al., 2007;Fang et al., 2012), the MODIS products , and the Global LAnd Surface Satellite, GLASS, fCover products (Xiao et al., 2016).

Validation of land surface models
The LAI data set was also used to evaluate simulations by dynamic vegetation models, particularly those used in the AMMA Land Surface Models Intercomparison Exercise (ALMIP, Boone et al., 2009a, b;Grippa et al., 2017). This project was carried out to evaluate the capability of different state of the art land surface models to reproduce surface processes in West Africa. The second phase of ALMIP focused on the three mesoscale sites in Benin, Niger and Mali, instrumented by the AMMA-CATCH observatory (Galle et al., 2018) that provides specific data for forcing and evaluating model outputs. An example of the comparison between in situ LAI and simulated LAI by the Sahelian-Transpiration-Evaporation-Productivity model, STEP (Mougin et al., 1995;Pierre et al., 2016), for the ALMIP-2 runs is reported in Fig. 6. Both the seasonal and the inter-annual variability in LAI, including water stress events such as that happening in 2008, are well reproduced by STEP. However, the comparison with field data shows that the model response is a bit smoother than in the in situ data set. In the latter case, the MODIS LAI product is compared to the sum of the tree PAI, and the LAI of the understorey.

Conclusions
A 13-year consistent LAI database was built for the 2005-2017 period, in a not easily accessible and weakly sampled semi-arid region, in central Sahel. The data set includes additional essential climatic variables such as the vegetation cover fraction, the fraction of absorbed photosynthetically active radiation, and the clumping index. All these variables were estimated from digital hemispherical photographs taken along 0.5 or 1 km transects crossing different sites in the Gourma region in north-east Mali. The same processing software, CAN-EYE, was used for image classification, gap fraction analysis and LAI derivation providing a homogeneous Figure 6. Comparison between field LAI measurements (+) at the Agoufou site and simulated LAI (-) using the STEP model (2006)(2007)(2008) during the ALMIP-2 exercise (Boone et al., 2009a, b). and consistent data set. Among others, this data set can be used to validate moderate-resolution satellite products, to establish relationships between satellite vegetation indices and vegetation variables, to evaluate land surface models, and to interpret surface fluxes. More specifically, this data set can be useful to interpret the current greening trend that has been observed widely over the Sahel since the 1980s drought.
Author contributions. Data sets were collected by EM, MOD, NS, AM, VD and PH. Data sets were prepared by MOD and EM with the help of VC, who oversees the AMMA-CATCH database. All co-authors contributed to the criticizing of the individual data sets and contributed to the writing of the paper.
sets the conditions of the data use. For any publication using AMMA-CATCH data, depending on the contribution of the data to the scientific results obtained, data users should either propose co-authorship to the principal investigators or at least acknowledge their contribution. Many thanks are expressed to the numerous people who participated in the field data acquisition: Tarawetti Ag Mostafa from Hombori, Hamma Maïga from Agoufou, Youssouf Maïga (deceased) from Gao, Alexis Berg, Marie-Noëlle Mulhaupt, Téri Massa-Mabilo and Yann Tracol. We thank Anne-Marie Cousin for her help with the figures. The two reviewers and the topical editor who provided helpful suggestions and comments on earlier versions of the paper are gratefully acknowledged.
Review statement. This paper was edited by Falk Huettmann and reviewed by two anonymous referees.