Difference between revisions of "Category:Historical Documents Working Group"
m |
m |
||
Line 727: | Line 727: | ||
=== Cluster: Hydrology === | === Cluster: Hydrology === | ||
+ | |||
+ | see also | ||
+ | |||
+ | Inside this cluster are all hydrology related parameters: water levels, flood magnitudes, ... | ||
+ | |||
+ | *hydrology-cluster | ||
+ | |||
+ | Flood levels (magnitude) i.e. can be | ||
+ | 0: no flood | ||
+ | +1: small flood | ||
+ | +2: flood above average | ||
+ | +3: flood of the century | ||
+ | |||
+ | <poll> | ||
+ | Should the 'flood level' (indices) be: | ||
+ | Essential | ||
+ | Recommended | ||
+ | Desired | ||
+ | Dropped (as not needed) | ||
+ | </poll> | ||
+ | |||
+ | Flood extend i.e. can be | ||
+ | +1: regional | ||
+ | +2: supra-regional | ||
+ | |||
+ | <poll> | ||
+ | Should the 'flood extend' (indices) be: | ||
+ | Essential | ||
+ | Recommended | ||
+ | Desired | ||
+ | Dropped (as not needed) | ||
+ | </poll> | ||
+ | |||
+ | <poll> | ||
+ | Should the 'water level value' (measurement) be: | ||
+ | Essential | ||
+ | Recommended | ||
+ | Desired | ||
+ | Dropped (as not needed) | ||
+ | </poll> | ||
+ | |||
+ | <poll> | ||
+ | Should the 'water level unit' (measurement) be: | ||
+ | SI - meter | ||
+ | Centimeter | ||
+ | Feet | ||
+ | Any | ||
+ | other | ||
+ | </poll> | ||
+ | |||
+ | <poll> | ||
+ | Should the 'tolerance of the water level' measurement be: | ||
+ | Essential | ||
+ | Recommended | ||
+ | Desired | ||
+ | Dropped (as not needed) | ||
+ | </poll> | ||
+ | |||
+ | |||
+ | <poll> | ||
+ | Should the 'flood recurrence time' be: | ||
+ | Essential | ||
+ | Recommended | ||
+ | Desired | ||
+ | Dropped (as not needed) | ||
+ | </poll> | ||
+ | |||
+ | <poll> | ||
+ | Should the 'flood recurrence unit' be: | ||
+ | SI - seconds | ||
+ | days | ||
+ | month | ||
+ | years | ||
+ | Any | ||
+ | other | ||
+ | </poll> | ||
+ | |||
+ | <poll> | ||
+ | Should the 'tolerance of flood recurrence' measurement be: | ||
+ | Essential | ||
+ | Recommended | ||
+ | Desired | ||
+ | Dropped (as not needed) | ||
+ | </poll> | ||
+ | |||
+ | <poll> | ||
+ | Should the 'discharge value' be: | ||
+ | Essential | ||
+ | Recommended | ||
+ | Desired | ||
+ | Dropped (as not needed) | ||
+ | </poll> | ||
+ | |||
+ | <poll> | ||
+ | Should the 'discharge unit' be: | ||
+ | SI - cubic meter per second | ||
+ | liter per second | ||
+ | Any | ||
+ | other | ||
+ | </poll> | ||
+ | |||
+ | <poll> | ||
+ | Should the 'tolerance of discharge' measurement be: | ||
+ | Essential | ||
+ | Recommended | ||
+ | Desired | ||
+ | Dropped (as not needed) | ||
+ | </poll> | ||
+ | |||
+ | |||
+ | <poll> | ||
+ | Should the 'flow velocity' (measurement) be: | ||
+ | Essential | ||
+ | Recommended | ||
+ | Desired | ||
+ | Dropped (as not needed) | ||
+ | </poll> | ||
+ | |||
+ | <poll> | ||
+ | Should the 'flow velocity unit' be: | ||
+ | SI - meter per second | ||
+ | kilometer per hour | ||
+ | miles per hour | ||
+ | Any | ||
+ | other | ||
+ | </poll> | ||
+ | |||
+ | <poll> | ||
+ | Should the 'tolerance of flow velocity' measurement be: | ||
+ | Essential | ||
+ | Recommended | ||
+ | Desired | ||
+ | Dropped (as not needed) | ||
+ | </poll> | ||
+ | |||
+ | |||
+ | low water levels i.e. can be | ||
+ | 0: normal level | ||
+ | -1: low water level (limited use) | ||
+ | -2: very low water level (severely limited use) | ||
+ | -3: extremely low water level (no water) | ||
+ | |||
+ | <poll> | ||
+ | Should the 'low water level' (indices) be: | ||
+ | Essential | ||
+ | Recommended | ||
+ | Desired | ||
+ | Dropped (as not needed) | ||
+ | </poll> | ||
+ | |||
+ | Trend i.e. can be | ||
+ | -1: decreasing | ||
+ | 0: constant | ||
+ | +1: increasing | ||
+ | |||
+ | <poll> | ||
+ | Should the 'water level trend' (indices) be: | ||
+ | Essential | ||
+ | Recommended | ||
+ | Desired | ||
+ | Dropped (as not needed) | ||
+ | </poll> | ||
+ | |||
+ | Storm surge levels i.e. can be | ||
+ | 0: no storm surge | ||
+ | +1: average storm surge | ||
+ | +2: strong storm surge | ||
+ | +3: extreme storm surge | ||
+ | |||
+ | <poll> | ||
+ | Should the 'storm surge level' (indices) be: | ||
+ | Essential | ||
+ | Recommended | ||
+ | Desired | ||
+ | Dropped (as not needed) | ||
+ | </poll> | ||
+ | |||
+ | Following parameters are of type boolean | ||
+ | |||
+ | <poll> | ||
+ | Should the 'occurance of erosion' (boolean) be: | ||
+ | Essential | ||
+ | Recommended | ||
+ | Desired | ||
+ | Dropped (as not needed) | ||
+ | </poll> | ||
+ | |||
+ | <poll> | ||
+ | Should the 'occurance of flash flood' (boolean) be: | ||
+ | Essential | ||
+ | Recommended | ||
+ | Desired | ||
+ | Dropped (as not needed) | ||
+ | </poll> | ||
+ | |||
+ | <poll> | ||
+ | Should the 'occurance of sediment deposition' (boolean) be: | ||
+ | Essential | ||
+ | Recommended | ||
+ | Desired | ||
+ | Dropped (as not needed) | ||
+ | </poll> | ||
+ | |||
+ | <poll> | ||
+ | Should the 'occurance of large plains flooded' (boolean) be: | ||
+ | Essential | ||
+ | Recommended | ||
+ | Desired | ||
+ | Dropped (as not needed) | ||
+ | </poll> | ||
+ | |||
+ | <poll> | ||
+ | Should the 'occurance of long standing water' (boolean) be: | ||
+ | Essential | ||
+ | Recommended | ||
+ | Desired | ||
+ | Dropped (as not needed) | ||
+ | </poll> | ||
=== Cluster: Clouds, Visibility === | === Cluster: Clouds, Visibility === |
Revision as of 12:24, 26 April 2019
Overview
In the Linked Earth context, a working group (WG) is a self-organized coalition of knowledgeable experts, whose activities are governed herewith. This page is dedicated to the discussion of data and metadata standards for historical documents, and aims to formulate a set of recommendations for such a standard.
Members of 'Historical Documents Working Group'
This working group contains only the following member.
Sources
Data is usually compiled from different historical sources. The LiPD data structure supports several Publication, that is normally used for referring to the publication describing the data. So this data cluster can be used for historical sources as well, in addition to the current publication describing the data. It is related to known standards as Dublin Core or BibTEX
- Source-ID for later reference
- Source Type (string), i.e. newspaper, book, ...
- Source Author
- Source Title
- Publication Year
- Publication Date
- Journal Title
- Publisher
- Url to Source (i.e. to PDF)
- DOI of Source (almost never exist for historical documents if not published as data compilation)
- Source
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
Scans, Pages
Each source can optionally have a bunch of images, that are the scans of the pages inside the source. Maybe this can be dropped for the LiPD format and only references to external images should be added to the quotes.
Quotes
Out of each sources, several quotes could be extracted by transcribing them. Related data would most probably go into "measurementTables".
- Quote-ID: For later reference
- Reference to source: maybe short form like "Author()Year" is adequate
- Page: String of the page(s) number where the quote is extracted from
- Scan: Optional link(s) to externally stored image of this page(s)
- Language: The language of the quote (or it's translation)
- Protolanguage: The language the quote was written originally
- Quote: The quote itself (UTF8)
- License: License of the quote (cc)
- DOI: DOI, the quote is published
- Quote
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
Events
Events, that are more like interpretations of the quote would go into "model part". Each events refers to
- a quote
- a position
- a time
and contains a climate related data. (to be defined)
Position
The position different than for the other archive types is not fixed. Usually a compilation of several sources refer to different scattered locations. The location usually can be named but might covers different scales (continent, country, area, city, street,...) and terrain types (city, river, sea, ...). It best corresponds to the Geospatial metadata of LiPD; see also http://schema.org/Place .
- Position
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
Time
The time derived for an event from historical documents is usually more precise than for other archive types. Often the uncertainty is just days or even hours. So it would be best to code the absolute date by gregorian calendar and a string of type ISO 8601. Optionally an absolute gregorian year (may be with floating point fraction instead of integer) can be calculated as well.
- Time
You are not entitled to view results of this poll.
The original sources often contain the information in calendar notation different than gregorian. Julian calendar was used in Europe in historical time, but outside Europe other calendars existed or still exist, likeweise in chinese or arabic documents.
You are not entitled to view results of this poll.
If so, which parameters for start and end should be considered?
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
It is also possible to extract the time related text from the quote and add it to a field 'phrase':
You are not entitled to view results of this poll.
Event itself
The event refers to the quote, maybe directly to the source, to the chronology/timing and the position/location. It also holds the coding of the phenomena, here it gets complicated - see next section.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
Phenomena/Coding
A lot of information can be found inside historical documents. Thereby the coding schema is complex. Some examples and their coding to illustrate this:
- The weather was extremely warm -> temperature index + very hot (+3)
- A lot of rain fell in December -> precipitation type + rain & precipitation amount + more than usual (+1)
- The water level was 13 feet -> water level + value:13.0 + tolerance:1.0 + unit:feet
- The drought lead to bad harvest of potatoes -> harvest + harvest amount: less than usual (-1) & potatoes / precipitation amount: less than usual (-1)
- The wheat harvest was little but the wine quality was good -> harvest + harvest amount: less than usual (-1) & wheat / harvest + harvest quality: better than usual (+1) & wine
+: one dimensional combination
&: two (or more) dimensional combinations
/: two different events (i.e. cause vs effect)
(Coding tree as used in tambora.org can be found here.)
To handle the innumerable amount of different parameters, it makes sense to group them into thematic clusters.
Cluster: Temperature
Inside this cluster are all temperature related parameters: Temperature measurements, descriptive temperature levels, freezing events, ...
- temperature-cluster
Levels i.e. can be
-3: very cold -2: cold -1: cool 0: normal temperature +1: warm +2: hot +3: very hot
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
Trend i.e. can be
-1: decreasing 0: constant +1: increasing
You are not entitled to view results of this poll.
Temperature value is a measured, numerical value with units Standards of other archive types can be used here
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
The following two parameters are of type boolean - they happen or not.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
Cluster: Precipitation
Inside this cluster are all precipitation related parameters: long- and short-term amounts, intensities, measurements, type of precipitation, ...
- precipitation
Levels of long-term precipitation (weeks... month) i.e. can be
-3: extremely dry -2: very dry -1: dry 0: normal precipitation +1: wet +2: very wet +3: extremely wet
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
Levels of short-term precipitation (hours ... days) i.e. can be
0: no precipitation +1: some precipitation +2: much precipitation +3: very much precipitation +4: extremely much precipitation
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
The amount of precipitation is a measured value for a given time frame (see timing)
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
The intensity of precipitation is a measured value, often a peal value
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
The precipitation type is a string i.e. rain, snow, hail, dew, sleet, ... It can be combined with the above quantitative parameters, but can also stand alone
You are not entitled to view results of this poll.
Levels of snow cover i.e. can be
0: no snow cover +1: thin snow cover +2: deep snow cover
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
The snow depth is static (accumulated), whereas the snow fall is per time-frame
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
Cluster: Hydrology
see also
Inside this cluster are all hydrology related parameters: water levels, flood magnitudes, ...
- hydrology-cluster
Flood levels (magnitude) i.e. can be
0: no flood +1: small flood +2: flood above average +3: flood of the century
You are not entitled to view results of this poll.
Flood extend i.e. can be
+1: regional +2: supra-regional
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
low water levels i.e. can be
0: normal level -1: low water level (limited use) -2: very low water level (severely limited use) -3: extremely low water level (no water)
You are not entitled to view results of this poll.
Trend i.e. can be
-1: decreasing 0: constant +1: increasing
You are not entitled to view results of this poll.
Storm surge levels i.e. can be
0: no storm surge +1: average storm surge +2: strong storm surge +3: extreme storm surge
You are not entitled to view results of this poll.
Following parameters are of type boolean
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
You are not entitled to view results of this poll.
Cluster: Clouds, Visibility
Cluster: Wind, Air Pressure
Cluster: Society
Cluster: Plant Phenology
Cluster: Animal Phenology
Cluster: Economy
(codeset_id,codeset_description,category,path,node_label,scale_label,scale_unit,value_label,value_index,average,variance,si_unit,si_average,si_variance )
Example
Rename LiPD file extension from zip to lpd
Description | Tambora-Files | LiPD-Files | Remarks |
---|---|---|---|
Flood Example | Media:flood_tambora_csv.zip | Media:Exp0000.tambora.2017.zip | |
To-Do | To-Do | To-Do | |
To-Do | To-Do | To-Do |
Media in category "Historical Documents Working Group"
The following 6 files are in this category, out of 6 total.