139,99 €
This book, presented in three volumes, examines �environmental� disciplines in relation to major players in contemporary science: Big Data, artificial intelligence and cloud computing. Today, there is a real sense of urgency regarding the evolution of computer technology, the ever-increasing volume of data, threats to our climate and the sustainable development of our planet. As such, we need to reduce technology just as much as we need to bridge the global socio-economic gap between the North and South; between universal free access to data (open data) and free software (open source). In this book, we pay particular attention to certain environmental subjects, in order to enrich our understanding of cloud computing. These subjects are: erosion; urban air pollution and atmospheric pollution in Southeast Asia; melting permafrost (causing the accelerated release of soil organic carbon in the atmosphere); alert systems of environmental hazards (such as forest fires, prospective modeling of socio-spatial practices and land use); and web fountains of geographical data. Finally, this book asks the question: in order to find a pattern in the data, how do we move from a traditional computing model-based world to pure mathematical research? After thorough examination of this topic, we conclude that this goal is both transdisciplinary and achievable.
Sie lesen das E-Book in den Legimi-Apps auf:
Seitenzahl: 351
Veröffentlichungsjahr: 2020
Cover
Preface
PART 1: Integrated Analysis in Geography: The Way to Cloud Computing
Introduction to Part 1
1 Geographical Information and Landscape, Elements of Formalization
2 Sampling Strategies
2.1. References
3 Characterization of the Spatial Structure
4 Thematic Information Structures
5 From the Point to the Surface, How to Link Endogenous and Exogenous Data
5.1. References
6 Big Data in Geography
Conclusion to Part 1
PART 2: Basic Mathematical, Statistical and Computational Tools
7 An Introduction to Machine Learning
7.1. Predictive modeling: introduction
7.2. Bayesian modeling
7.3. Generative versus discriminative models
7.4. Classification
7.5. Evaluation metrics for classification evaluation
7.6. Cross-validation and over-fitting
7.7. References
8 Multivariate Data Analysis
8.1. Introduction
8.2. Principal component analysis
8.3. Multiple correspondence analysis
8.4. Clustering
8.5. References
9 Sensitivity Analysis
9.1. Generalities
9.2. Methods based on linear regression
9.3. Morris method
9.4. Methods based on variance analysis
9.5. Conclusion
9.6. References
10 Using R for Multivariate Analysis
10.1. Introduction
10.2. Principal component analysis
10.3. Multiple correspondence analysis
10.4. Clustering
10.5. References
PART 3: Computer Science
11 High Performance and Distributed Computing
11.1. High performance computing
11.2. Systems based on multi-core CPUs
12 Introduction to Distributed Computing
12.1. Introduction
12.2. References
13 Towards Cloud Computing
13.1. Introduction
13.2. Service model
13.3. Deployment model
13.4. Behind the hood, a technological overview
13.5. Conclusion
13.6. References
14 Web-Oriented Architecture – How to design a RESTFull API
14.1. Introduction
14.2. Web services
14.3. Web-Oriented Applications – Microservice applications
14.4. WSDL example
14.5. Conclusion
14.6. References
15 SCALA – Functional Programming
15.1. Introduction
15.2. Functional programming
15.3. Scala
15.4. Rational
15.5. Why immutability matters?
15.6. Conclusion
15.7. References
16 Spark and Machine Learning Library
16.1. Introduction
16.2. Spark
16.3. Spark machine learning library
16.4. Conclusion
17 Database for Cloud Computing
17.1. Introduction
17.2. From myGlsrdbms to NoSQL
17.3. NoSQL database storage paradigms
17.4. SQL versus NoSQL, the war will not take place
17.5. Example: a dive into MongoDB
17.6. Conclusion
17.7. References
18 WRF Performance Analysis and Scalability on Multicore High Performance Computing Systems
18.1. Introduction
18.2. The weather research and forecast model and experimental set-up
18.3. Architecture of multicore HPC system
18.4. Results
18.5. Conclusion
18.6. References
List of Authors
Index
Summary of Volume 2
Summary of Volume 3
End User License Agreement
Preface
Figure P.1. At the beginnig of AI, Dartmouth Summer Research Project, 1956. Sour...
Figure P.2. From freehand potatoid to the cloud icon. The first figure is a sche...
Figure P.3. The heart of TORUS, partnership between Asia and Europe. For a color...
Introduction to Part 1
Figure I.1. In the early days of the “Systemic Landscape” (modified from Brossar...
Chapter 1
Figure 1.1. Elements of formalization of the landscape system. For a color versi...
Chapter 2
Figure 2.1. Sampling the geographical area
Figure 2.2. Illustration of a survey based on a four-level survey (modified from...
Figure 2.3. Sampling the terrain, a probabilistic procedure (bottom) is not alwa...
Figure 2.4. Estimated abundance/dominance and minimum area according to Braun an...
Figure 2.5. Nesting Observation Scales and Sampling (modified from Rhoné 2015, i...
Figure 2.6. Main data types
Figure 2.7. Mapping associated with the field survey, the spatial dimension of i...
Figure 2.8. Map collections to find what makes sense in the geographical area. F...
Chapter 3
Figure 3.1. Texture differences and space co-occurrence matrices (from Whingham ...
Figure 3.2. The main texture indices calculated from space co-occurrence matrice...
Figure 3.3. Texture analysis applied to the biotope of Tetrao urogallus (Image S...
Figure 3.4. Texture analysis to identify areas at risk of HAT. For a color versi...
Figure 3.5. Quantifying the heterogeneity of a landscape (source: ESRI). For a c...
Figure 3.6. Segmentation of a panchromatic image, but which threshold value to u...
Chapter 4
Figure 4.1. Methodological chain of information processing
2
. For a color version...
Figure 4.2. Projection of individuals (red) and modalities (blue) on the first t...
Figure 4.3. Projection of classes in the factorial and geographical space (accor...
Chapter 5
Figure 5.1. Bayes formula seen as the superposition of two decision trees (sourc...
Figure 5.2. Occurrence frequency matrices of classes and categories, i.e. the co...
Figure 5.3. Flow chart of the landscape data analysis (modified from Brossard, 1...
Figure 5.4. Frequencies of appearance of the modalities of the variables for a c...
Figure 5.5. From the point to the surface or how to link endogenous and exogenou...
Figure 5.6. From the point to the surface or how to link endogenous and exogenou...
Figure 5.7. From the point to the surface or how to link endogenous and exogenou...
Figure 5.8. From the prototype to an operational chain – point, trace, order and...
Chapter 6
Figure 6.1. S-curve of Peter Norvig (modified from Norvig P., “Computer Science,...
Figure 6.2. King’s Bay and Lovén glacier, a place of experimentation. For a colo...
Figure 6.3. SPOT 5 take 5, only three images are available on one year. For a co...
Figure 6.4. Sequential tasks of a data framework. For a color version of this fi...
Figure 6.5. From the identification of services to their dynamic orchestration i...
Figure 6.6. Example of Google Earth Exchange environment. For a color version of...
Conclusion to Part 1
Figure C.1. The feeling of landscape (modified from Schuiten F. and Peeters B., ...
Figure C.2. The ideal of geoscience in the age of Big Data and cloud computing. ...
Figure C.3. Datajournalism or how journalists have identified the murderers with...
Chapter 7
Figure 7.1. Left: the margin For a decision boundary is the distance to the near...
Chapter 8
Figure 8.1. Different types of graphs for quantitative and qualitative variables...
Figure 8.2. Source: J.P. Fenelon
Figure 8.3. Illustration of the principal axes. For a color version of this figu...
Figure 8.4. Percentage of variance explained by the principal axes. For a color ...
Figure 8.5. The principal components of the countries on the first two axes. For...
Figure 8.6. Variable map on U
1
and U
2
Figure 8.7. Vector projections of the subspace generated by U
1
and U
2
. For a col...
Figure 8.8. Result of PCA with a new country
Figure 8.9. Projection of the data point cloud on axis 1 and axis 2. If two data...
Figure 8.10. Projection of the level cloud on axis 1 and axis 2. For a color ver...
Figure 8.11. Variable results on the X
1
X
2
plane for PCA (left) and MCA (right). ...
Figure 8.12. Biplot with data points and levels obtained with the dataset presen...
Figure 8.13. Percentage of inertia explained by the axes 8.4. Clustering
Figure 8.14. Three clusters for the countries. For a color version of this figur...
Figure 8.15. Illustration of the three distances. For a color version of this fi...
Figure 8.16. For a color version of this figure, see www.iste.co.uk/laffly/torus...
Figure 8.17. Variance decomposition. For a color version of this figure, see www...
Figure 8.18. Elbow rule for selecting the number of clusters. For a color versio...
Figure 8.19. Steps of the k-means algorithm
Figure 8.20. Influence of outliers on the clusters
Figure 8.21. Dendrogram. For a color version of this figure, see www.iste.co.uk/...
Figure 8.22. Graphics to determine the number of clusters for two different kind...
Chapter 9
Figure 9.1. Evolution of the ratio of the hypercube coverage by the hypersphere
Figure 9.2. Map of sensitivity analysis methods [IOO 15]
Figure 9.3. Effects of each variable
Figure 9.4. Effect according to variability. For a color version of this figure,...
Figure 9.5. Morris’ OAT method: point trajectory. For a color version of this fi...
Figure 9.6. Variable effect according to the couple (mean, standard deviation). ...
Figure 9.7. Graphical overview of the design. For a color version of this figure...
Figure 9.8. Graphical overview of each variable effect
Figure 9.9. Sobol’ method. For a color version of this figure, see www.iste.co.u...
Figure 9.10. Sobol” indices for the Ishigami function. For a color version of th...
Figure 9.11. Sobol’ indices experimental design example. For a color version of ...
Figure 9.12. Experimental design illustration. For a color version of this figur...
Chapter 10
Figure 10.1. RStudio Interface. For a color version of this figure, see www.iste...
Figure 10.2. French cities. For a color version of this figure, see www.iste.co....
Figure 10.3. Package installation. For a color version of this figure, see www.i...
Figure 10.4. Variable map for the principal plan Axes 1 and 2
Figure 10.5. Individual map for the principal plan Axes 1 and 2
Figure 10.6. Correlation circle (a) with or (b) without Biarritz
Figure 10.7. PCA results with or without Error: (a) and (b) are correlation circ...
Figure 10.8. Results of the multiple correspondence analysis. For a color versio...
Figure 10.9. R
2
vs number of clusters
Figure 10.10. Results of AHC: (a) dendrogram; (b) dissimilarity criterion vs. it...
Figure 10.11. Individual map for Axes 1 and 2 with three different colors for th...
Figure 10.12. Results for the clustering with the two first components. For a co...
Figure 10.13. MCA with CLIMAT and CLUSTER. For a color version of this figure, s...
Chapter 11
Figure 11.1. Evolution of processors developed over the last decades. For a colo...
Chapter 12
Figure 12.1. Schematic view of a system based on a distributed operating system ...
Figure 12.2. Schematic view of a system based on a middleware [VAN 07]
Figure 12.3. A schematic view of Beowulf cluster architecture [VAN 16]
Figure 12.4. Grid layers architecture [GRI 19]. For a color version of this figu...
Chapter 13
Figure 13.1. Client/Provider repartition according to aaS models. For a color ve...
Figure 13.2. Cloud taxonomy. For a color version of this figure, see www.iste.co...
Figure 13.3. Hardware of Cloud Torus project. For a color version of this figure...
Figure 13.4. Virtualization types. For a color version of this figure, see www.i...
Chapter 14
Figure 14.1. Global architecture of a SOAP System
Figure 14.2. REST level 0. For a color version of this figure, see www.iste.co.u...
Figure 14.3. REST level 1. For a color version of this figure, see www.iste.co.u...
Figure 14.4. REST level 2. For a color version of this figure, see www.iste.co.u...
Figure 14.5. REST level 3. For a color version of this figure, see www.iste.co.u...
Figure 14.6. Different kind of services. For a color version of this figure, see...
Figure 14.7. Scalability’s problem. For a color version of this figure, see www....
Chapter 15
Figure 15.3. Filter. For a color version of this figure, see www.iste.co.uk/laff...
Figure 15.4. Map. For a color version of this figure, see www.iste.co.uk/laffly/...
Figure 15.5. Fold. For a color version of this figure, see www.iste.co.uk/laffly...
Figure 15.6. Difference between type, value and expression
Chapter 17
Figure 17.1. CAP theorem and database distribution map. For a color version of t...
Chapter 18
Figure 18.1. WRF–ARW modeling system flow chart (source: Wang et al. 2016). For ...
Figure 18.2. Domain configuration for E1
Figure 18.3. Domain configuration for E2
Figure 18.4. Simulation speed measured for all combinations under E1. The lower ...
Figure 18.5. Simulation speed measured for all combinations under experiment E2....
Cover
Table of Contents
Begin Reading
v
iii
iv
xiii
xiv
xv
xvi
xvii
xviii
xix
xxi
xxii
xxiii
xxiv
1
2
3
4
5
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
153
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
179
180
181
182
183
184
185
186
187
188
189
191
192
193
194
195
196
197
198
199
200
201
202
203
204
205
206
207
208
209
210
211
212
213
214
215
216
217
218
219
220
221
222
223
224
225
226
227
228
229
230
231
232
233
234
235
236
237
238
239
240
241
242
243
245
246
247
248
249
250
251
252
253
254
255
256
257
258
259
260
261
262
263
264
265
266
267
268
269
270
271
272
273
274
275
276
277
278
279
280
281
282
283
284
285
286
287
288
289
291
292
293
294
295
296
297
298
299
301
302
303
304
305
306
307
308
309
310
311
312
313
314
315
316
317
Edited by
Dominique Laffly
First published 2020 in Great Britain and the United States by ISTE Ltd and John Wiley & Sons, Inc.
Apart from any fair dealing for the purposes of research or private study, or criticism or review, as permitted under the Copyright, Designs and Patents Act 1988, this publication may only be reproduced, stored or transmitted, in any form or by any means, with the prior permission in writing of the publishers, or in the case of reprographic reproduction in accordance with the terms and licenses issued by the CLA. Enquiries concerning reproduction outside these terms should be sent to the publishers at the undermentioned address:
ISTE Ltd27-37 St George’s RoadLondon SW19 4EUUK
www.iste.co.uk
John Wiley & Sons, Inc.111 River StreetHoboken, NJ 07030USA
www.wiley.com
© ISTE Ltd 2020
The rights of Dominique Laffly to be identified as the author of this work have been asserted by him in accordance with the Copyright, Designs and Patents Act 1988.
Library of Congress Control Number: 2019956841
British Library Cataloguing-in-Publication Data
A CIP record for this book is available from the British Library
ISBN 978-1-78630-599-2
Geography, Ecology, Urbanism, Geology and Climatology – in short, all environmental disciplines are inspired by the great paradigms of Science: they were first descriptive before evolving toward systemic and complexity. The methods followed the same evolution, from the inductive of the initial observations one approached the deductive of models of prediction based on learning. For example, the Bayesian is the preferred approach in this book (see Volume 1, Chapter 5), but random trees, neural networks, classifications and data reductions could all be developed. In the end, all the methods of artificial intelligence (IA) are ubiquitous today in the era of Big Data. We are not unaware, however, that, forged in Dartmouth in 1956 by John McCarthy, Marvin Minsky, Nathaniel Rochester and Claude Shannon, the term artificial intelligence is, after a long period of neglect at the heart of the future issues of the exploitation of massive data (just like the functional and logical languages that accompanied the theory: LISP, 1958, PROLOG, 1977 and SCALA, today – see Chapter 8).
All the environmental disciplines are confronted with this reality of massive data, with the rule of the 3+2Vs: Volume, Speed (from the French translation, “Vitesse”), Variety, Veracity, Value. Every five days – or even less – and only for the optical remote sensing data of the Sentinel 2a and 2b satellites, do we have a complete coverage of the Earth at a spatial resolution of 10 m for a dozen wavelengths. How do we integrate all this, how do we rethink the environmental disciplines where we must now consider at the pixel scale (10 m) an overall analysis of 510 million km2 or more than 5 billion pixels of which there are 1.53 billion for land only? And more important in fact, how do we validate automatic processes and accuracy of results?
Figure P.1.At the beginnig of AI, Dartmouth Summer Research Project, 1956.
Source: http://www.oezratty.net/wordpress/2017/semantique-intelligence-artificielle/
Including social network data, Internet of Things (IoT) and archive data, for many topics such as Smart Cities, it is not surprising that environmental disciplines are interested in cloud computing.
Before understanding the technique (why this shape, why a cloud?), it would seem that to represent a node of connection of a network, we have, as of the last 50 years, drawn a potatoid freehand, which, drawn took the form of a cloud. Figure P.2 gives a perfect illustration on the left, while on the right we see that the cloud is now the norm (screenshot offered by a search engine in relation to the keywords: Internet and network).
What is cloud computing? Let us remember that, even before the term was dedicated to it, cloud computing was based on networks (see Chapter 4), the Internet and this is: “since the 50s when users accessed, from their terminals, applications running on central systems” (Wikipedia). The cloud, as we understand it today, has evolved considerably since the 2000s; it consists of the mutualization of remote computing resources to store data and use services dynamically – to understand software – dedicated via browser interfaces.
Figure P.2.From freehand potatoid to the cloud icon. The first figure is a schematic illustration of a distributed SFPS switch. For a color version of this figure, see www.iste.co.uk/laffly/torus1.zip
This answers the needs of the environmental sciences overwhelmed by the massive data flows: everything is stored in the cloud, everything is processed in the cloud, even the results expected by the end-users recover them according to their needs. It is no wonder that, one after the other, Google and NASA offered in December 2016 – mid-term of TORUS! – cloud-based solutions for the management and processing of satellite data: Google Earth Engine and NASA Earth Exchange.
But how do you do it? Why is it preferable – or not – for HPC (High Performance Computing) and GRIDS? How do we evaluate “Cloud & High Scalability Computing” versus “Grid & High-Performance Computing”? What are the costs? How do you transfer the applications commonly used by environmental science to the cloud? What is the added value for environmental sciences? In short, how does it work?
All these questions and more are at the heart of the TORUS program developed to learn from each other, understand each other and communicate with a common language mastered: geoscience, computer science and information science; and the geosciences between them; computer science and information sciences. TORUS is not a research program. It is an action that aims to bring together too (often) remote scientific communities, in order to bridge the gap that now separates contemporary computing from environmental disciplines for the most part. One evolving at speeds that cannot be followed by others, one that is greedy for data that others provide, one that can offer technical solutions to scientific questioning that is being developed by others and so on.
TORUS is also the result of multiple scientific collaborations initiated in 2008–2010: between the geographer and the computer scientist, between France and Vietnam with an increasing diversity of specialties involved (e.g. remote sensing and image processing, mathematics and statistics, optimization and modeling, erosion and geochemistry, temporal dynamics and social surveys) all within various scientific and university structures (universities, engineering schools, research institutes – IRD, SFRI and IAE Vietnam, central administrations: the Midi-Pyrénées region and Son La district, France–Vietnam partnership) and between research and higher education through national and international PhDs.
Naturally, I would like to say, the Erasmus+ capacity building program of the European Union appeared to be a solution adapted to our project:
“The objectives of the Capacity Building projects are: to support the modernization, accessibility and internationalization of higher education in partner countries; improve the quality, relevance and governance of higher education in partner countries; strengthen the capacity of higher education institutions in partner countries and in the EU, in terms of international cooperation and the process of permanent modernization in particular; and to help them open up to society at large and to the world of work in order to reinforce the interdisciplinary and transdisciplinary nature of higher education, to improve the employability of university graduates, to give the European higher education more visibility and attractiveness in the world, foster the reciprocal development of human resources, promote a better understanding between the peoples and cultures of the EU and partner countries.”1
In 2015, TORUS – funded to the tune of 1 million euros for three years – was part of the projects selected in a pool of more than 575 applications and only 120 retentions. The partnership brings together (Figure P.3) the University of Toulouse 2 Jean Jaurès (coordinator – FR), the International School of Information Processing Sciences (EISTI – FR), the University of Ferrara in Italy, the Vrije University of Brussels, the National University from Vietnam to Hanoi, Nong Lam University in Ho Chi Minh City and two Thai institutions: Pathumthani’s Asian Institute of Technology (AIT) and Walaikak University in Nakhon Si Thammarat.
Figure P.3.The heart of TORUS, partnership between Asia and Europe. For a color version of this figure, see www.iste.co.uk/laffly/torus1.zip
With an equal share between Europe and Asia, 30 researchers, teachers-researchers and engineers are involved in learning from each other during these three years, which will be punctuated by eight workshops between France, Vietnam, Italy, Thailand and Belgium. Finally, after the installation of the two servers in Asia (Asian Institute of Technology – Thailand; and Vietnam National University Hanoi – Vietnam), more than 400 cores will fight in unison with TORUS to bring cloud computing closer to environmental sciences. More than 400 computer hearts beat in unison for TORUS, as well as those of Nathalie, Astrid, Eleonora, Ann, Imeshi, Thanh, Sukhuma, Janitra, Kim, Daniel, Yannick, Florent, Peio, Alex, Lucca, Stefano, Hichem, Hung(s), Thuy, Huy, Le Quoc, Kim Loi, Agustian, Hong, Sothea, Tongchai, Stephane, Simone, Marco, Mario, Trinh, Thiet, Massimiliano, Nikolaos, Minh Tu, Vincent and Dominique.
To all of you, a big thank you.
This book is divided into three volumes.
Volume 1 raises the problem of voluminous data in geosciences before presenting the main methods of analysis and computer solutions mobilized to meet them.
Volume 2 presents remote sensing, geographic information systems (GIS) and spatial data infrastructures (SDI) that are central to all disciplines that deal with geographic space.
Volume 3 is a collection of thematic application cases representative of the specificities of the teams involved in TORUS and which motivated their needs in terms of cloud computing.
Dominique LAFFLY
January 2020
1
http://www.agence-erasmus.fr/page/developpement-des-capacites
.
What is Geography? Literally “writing of the Earth”, the Larousse dictionary gives the following definition:
“Science which has for object the description and the explanation of the current aspect, natural and human, of the surface of the Earth.”
And Robert dictionary “Science that studies and describes the Earth, as a habitat of the human being and all living organisms.”
It is therefore a Science, one that has its roots in China, Egypt, the Inca Empire and Greece for thousands of years because “all societies have constructed an idea of their situation in the world and present a cosmogony that is at the same time, a great account of the origins” (ibid.). The map was always a central element to accompany the thought of the representation of the world, and to manage and act on the territory. All the thinkers and scientists of the time were geographers or at least were geographers at the same time as they were philosophers, anthropologists, mathematicians, biologists or astronomers – Herodotus, Eratosthenes, Qian, Polo, Ptolemy, Al Idrisi, Al-Khuwarizmi, Mercator, Cassini, Van Humboldt, Darwin. Today, perhaps, are we all still geographers? Maybe most of us do not know it or do not (especially) want to claim it. Hence the initial question – what is Geography? Geography is a Geoscience. It is one of the Geosciences that is interested in the interactions of human societies with the geographical space – “the environment”, one that does not go without the other to build the landscapes – visible manifestations of interacting forces. For Geography, thinking about space without the social is an aberration just as the social is thinking while denying space. The spatialization of information is at the heart of Geography; the map – to put it simply – is the bedrock of geographic synthesis: concepts, methods and information to answer the central question “why here and now but not elsewhere?”. It is not enough to superimpose tracks, calculate an index and color shapes to make this cartographic synthesis and get a “good” map. We will see that, like all Geosciences, today’s Geography with the concepts and methods they mobilize are confronted with having to integrate massive data – Big Data. For this, it must not only evolve in its own paradigms but also in the control of analytical methods – artificial intelligence – and computer techniques dedicated to massive data – cloud computing.
“Is it permissible to assimilate the world to what we have seen and experienced? The common claim, as reprehensible as the refusal to dream – at least (for want of something better) – on the future! First: the question of recognizing the order of what is in the elements that the senses (or the tools that arm the senses) grasp is perhaps the very one of Philosophy in all its nobility. It has been said in Latin […] that all knowledge begins with what is sensible in the objects of nature: ‘Omnis cognito initium habet a naturalibus… vel: a sensibilibus.’ Beyond this knowledge, there is only mystery; and the very revelation given by God is meditated on the example of what we have known by the natural play of reason. It is necessary here that the statistician, the surveyor and the sociologist are modest! In seeking what we have always had to look for, each generation cannot have done more than its share: the question remains.”1
Protean word, a little magic in the geographical discourse – as Jean-Claude Wieber (1984)2 liked to say – the landscape is dear to geographers although they have not truly defined a real status of the landscape within the discipline. Is it reasonable after all when a word has so many different meanings? The same author proposes that we refine the content:
“Is the use of the word Landscape in this case an abuse of language? Probably not completely. No one would think of denying relief a fundamental role in the differentiation of landscapes […] by the influence it exerts on the aptitudes of the soils and the adaptations made of them by people and vegetation. In the same way, the examination of the Roman cadastres […] of an ancient organization of space which one can sometimes perceive or guess [is called] ‘landscape analysis’. In these two cases, we study directly, by the measurement of the processes, or indirectly, through the resulting traces, how work sets of forces produce the Landscape.”
The geographical envelope would therefore be the place of expression for all the landscapes themselves considered as a whole that can be approached by the instrumentalization under the constraint of the data available to describe it; the consideration of the landscape is then partial and biased, and the information and the protocols of collection and analysis are at the heart of the analysis of the landscapes considered as a system. E. Schwarz (1988)3 gives a concise definition of systemic analysis that complements the Cartesian analytic approach:
“The systemic approach is a state of mind, a way of seeing the world […] looking for regularities (invariant), to identify structures, functions, processes, evolution, organization. [It] is characterized above all by taking into account the global nature of phenomena, their structure, their interactions, their organization and their own dynamics. […] The systemic brings together the theoretical, practical and methodological approaches to the study of what is recognized as too complex to be approached in a reductionist manner and which poses problems of borders, internal and external relations, structure, emerging laws or properties characterizing the system as such or problems of mode of observation, representation, modeling or simulation of a complex totality.”
Brossard and Wieber4 propose a conceptual diagram of a systemic definition of landscape (Figure I.1). Between production – the “physical” producing system – and consumption – the “social” user system – the landscape is expressed by what is visible and non-reducible – the “visible landscape” system – to one or the other previous subsystems. This specificity of the geographer to understand the landscape so as to make sense of space places it at the crossroads of multidisciplinary scientific paths:
“The specialists of other disciplines now know that ‘nature’ is never quite ‘natural’, or, conversely, that the analysis of social systems can no longer be considered detached from the environments in which they are located. Also, they very often want the intervention of geographers, in the field as in the processing of data provided by satellites; one cannot go without the other.”5
In fact, the satellite images mentioned by the author are not sufficient to describe landscapes. Other information is also available, their collection is essential, as is the methodological and technical mastery to ensure their analysis.
Figure I.1.In the early days of the “Systemic Landscape” (modified from Brossard and Wieber). For a color version of this figure, see www.iste.co.uk/laffly/torus1.zip
Thus chosen as a key concept, the landscape is an entry point for themes that have a practical impact. This concept is linked to an analysis method specific to the geographer and their needs to spatialize – in the sense of continuously covering the space. The landscape’s “signs” – information – allow for a quantitative approach that relies on the use of statistical and computer tools in search of the fundamental structures to, in a way, “replace the ‘visible complicated’ perceived landscapes by ‘the invisible simple’ spatial structure.”6
Introduction written by Dominique LAFFLY.
1
Benzécri J.-P., “In memoriam… Pierre Bourdieu – L’@nalyse des données : histoire, bilan, projets, …, perspective”,
Revue MODULAD
, no. 35, 2006.
2
Wieber J.-C., “Étude du paysage et/ou analyse écologique ?”,
Travaux de l’institut géographique de Reims
, nos 45–46, 1981.
3
Schwarz É. (ed.),
La révolution des systèmes. Une introduction à l’approche systémique
, Editions DelVal, 1988.
4
Brossard T. and Wieber J.-C., “Essai de formalisation systémique d’un mode d’approche du paysage”,
Bulletin de l’association des géographes français 468
, pp. 103–111, 1981.
5
Frémont A., “La télédétection spatiale et la géographie en France aujourd’hui”,
L’Espace géographique
, no. 3, pp. 285–287, 1984.
6
Perrin F.,
Les atomes : présentation et complément
, Gallimard, Paris, 1970.
“Using measures, observations and systems of knowledge inevitably means introducing the notion of representativity in various ways. It includes questions about sampling strategies, the nature of the data, their disaggregation and aggregation, the equations used to model, extrapolate or interpolate information (the same mathematical function can be used for one or the other of these methods)… Any reasoned approach in information analysis tries to integrate at best these different aspects of the measurement and their spatiotemporal representability.”1
The analysis of the landscape thus formulated implies four principles:
– the mastery of space and time and the direct implications on scales and themes;
– the semantic control of the content of information between the “
knowledge
” of the specialist and the “
reality
” of the landscapes;
– the mastery of the constitution of information put at the heart of the process;
– the mastery of methods and instrumentalization.
In geography, we are also confronted with the difficulty of linking thematically specialized (endogenous punctual information) specific descriptions with general ones (exogenous areal information), the only ones that are amenable to taking into account the spatial continuum (Figure 1.1). To do this, in the framework of the systemic formulation of the landscape and the mode of analysis related to it, we present a formalization based on four key elements: point, trace, order and inference.
Figure 1.1.Elements of formalization of the landscape system. For a color version of this figure, see www.iste.co.uk/laffly/torus1.zip
Point: the basic spatial unit of endogenous observations made in situ. It is the subject of a precise location (differential GNNS and/or geocoding of addresses) and a standard description. Surveys are conducted according to a cybernetic logic and a systematic protocol, so as to lend themselves to quantitative analyses that describe and parameterize information structures. Sampling strategies are based on thematic and spatial criteria. For example, for biogeographic facies surveys, stratified nonaligned systematic sampling is commonly used at two levels2: the first to define the overall sampling plan of the points to observe in the field and the second to stop the in situ observation strategy for each previously defined entity3. Here, we find the notion of integrated or holistic analysis.
Trace: this is the message or sign that reflects the links between the structures identified from the analysis of endogenous data and the exogenous information that will serve as a reference for spatialization. This element includes images of satellites and other geographical information, such as altitude, slope, orientation, age of surfaces, distance to objects and any information likely to describe the landscapes and available under the continuous blanket form of space. It is the extension, via the geographical coordinates, of the description of the point in the exogenous information base. Beyond the pixels of images that are ideally suited to our approach, it can nevertheless be generalized to socio-economic data identified by a reference administrative unit, i.e. the most detailed level available: IRIS4 in France, NUTS5 for GADM6. It is still necessary that these data exist and that they are validated, updated and accessible. The point data observed in situ will first be summarized (pivot table) by a reference administrative unit and then confronted with the potential identification of links, here, the trace.
Order: this essentially refers to the spatial structuring of data, the arrangement of landscape elements relative to each other that induces differentiated spatial constraints and practices. In image analysis, order refers to the notions of textures and texture mosaics and spatial autocorrelation, and opens the perspective of the frequency analysis of Fourier transforms and wavelets. From vector objects – typically reference administrative entities – the analysis of spatial structuring uses topological operators of graph theory: shape descriptors (perimeter, surface, width, length, etc.); contiguity; inclusion; neighborhood; connection of smaller distances and so on (see landscape ecology).
Inference: this is inference in the statistical sense of the term, i.e. the application of the rules developed in the previous steps to ensure the link between endogenous and exogenous information. It is an ergodic approach – “which makes it possible to statistically determine all the achievements of a random process from an isolated realization of this process” – based on probabilistic models, which makes it possible to restore the continuity of geographical space from partial knowledge. We think in particular of Bayesian probability models (Bayes, the way!) as well as the Metropolis–Hastings algorithm:
“It is today the whole field of MCMCs, the Monte-Carlo Markow-Chain, whose unreasonable effectiveness in physics, chemistry and biology […] has still not been explained. It is not a deterministic exploration, nor is it a completely random exploration; it is a random walk exploration. But deep down, it’s not new; it’s the same in life: by going a little randomly from one situation to another, we explore so many more possibilities, like a researcher who changes scientific continents with the passing of time.”7
From an operational point of view, the proposed formalization consists of measuring the degrees of connection between endogenous and exogenous information. When they are significant, we use them to generalize all of the space and all or part of the data observed punctually. It is important to distinguish now between the analysis methods we propose and the interpolation calculation procedures that also contribute to spatializing information. These last ones consist of filling the gaps in data of the same nature contained in the same grid of description as a phenomenon. For this we choose a calculation method inspired by the cases encountered, equation [1.1]
