Category:Historical Documents Working Group

From Linked Earth Wiki
Revision as of 06:40, 25 April 2019 by KMicha (Talk | contribs)

Jump to: navigation, search
( Pages with a poll, Working Group )


Overview

In the Linked Earth context, a working group (WG) is a self-organized coalition of knowledgeable experts, whose activities are governed herewith. This page is dedicated to the discussion of data and metadata standards for historical documents, and aims to formulate a set of recommendations for such a standard.

Members of 'Historical Documents Working Group'

    This working group contains only the following member.


Sources

Data is usually compiled from different historical sources. The LiPD data structure supports several Publication, that is normally used for referring to the publication describing the data. So this data cluster can be used for historical sources as well, in addition to the current publication describing the data.


Scans, Pages

Each source can optionally have a bunch of images, that are the scans of the pages inside the source. Maybe this can be dropped for the LiPD format and only references to external images should be added to the quotes.

Quotes

Out of each sources, several quotes could be extracted by transcribing them. Related data would most probably go into "measurementTables".

  • Quote-ID: For later reference
  • Reference to source: maybe short form like "Author()Year" is adequate
  • Page: String of the page(s) number where the quote is extracted from
  • Scan: Optional link(s) to externally stored image of this page(s)
  • Language: The language of the quote (or it's translation)
  • Protolanguage: The language the quote was written originally
  • Quote: The quote itself (UTF8)
  • License: License of the quote (cc)
  • DOI: DOI, the quote is published

Events

Events, that are more like interpretations of the quote would go into "model part". Each events refers to

  • a quote
  • a position
  • a time

and contains a climate related data. (to be defined)

Position

The position different than for the other archive types is not fixed. Usually a compilation of several sources refer to different scattered locations. The location usually can be named but might covers different scales (continent, country, area, city, street,...) and terrain types (city, river, sea, ...). It best corresponds to the Geospatial metadata of LiPD; see also http://schema.org/Place .


  • Position
Should the 'Identifier of Location' (unique for dataset) be:
You are not entitled to vote.
You are not entitled to view results of this poll.
There was one vote since the poll was created on 06:40, 25 April 2019.
poll-id BFFA18B8C1BDF368EA5341588802991E

Should the 'Name of Location' be:
You are not entitled to vote.
You are not entitled to view results of this poll.
There was one vote since the poll was created on 06:35, 25 April 2019.
poll-id D22166FF638CE4D12C622E35A6597CCC

Should the 'Type of Location' be:
You are not entitled to vote.
You are not entitled to view results of this poll.
There was one vote since the poll was created on 06:36, 25 April 2019.
poll-id EB4A5AADA9B4EFD35BDB6A18124045B7

Should the 'Latitude of Location' be:
You are not entitled to vote.
You are not entitled to view results of this poll.
There was one vote since the poll was created on 06:36, 25 April 2019.
poll-id 599A334D27CB4BE975C869CEEA6DE6DB

Should the 'Longitude of Location' be:
You are not entitled to vote.
You are not entitled to view results of this poll.
There was one vote since the poll was created on 06:36, 25 April 2019.
poll-id 5F7945C7D284928206D05C09A087FDAB

Should the 'Elevation of Location' be:
You are not entitled to vote.
You are not entitled to view results of this poll.
There were 0 votes since the poll was created on 06:36, 25 April 2019.
poll-id FEDF25855DEFCA09D29D9518755B2726

Should the '(JSON)-Geometry of Location' be:
You are not entitled to vote.
You are not entitled to view results of this poll.
There was one vote since the poll was created on 06:36, 25 April 2019.
poll-id E7C3C0D00D3823DD5D6499F50D347DCE

Should the 'Reference to Geonames' by ID be:
You are not entitled to vote.
You are not entitled to view results of this poll.
There was one vote since the poll was created on 06:36, 25 April 2019.
poll-id 5C822DFBD7B179D69B3082FC4E672204

Should the extracted corresponding quote piece = 'Phrase' be:
You are not entitled to vote.
You are not entitled to view results of this poll.
There was one vote since the poll was created on 06:36, 25 April 2019.
poll-id 8D03DC5D06E49A47D24FD5E28A9C31C0


(source_id,quote_id,license_event,doi_event,codeset_id,codeset_description,category,path,node_label,scale_label,scale_unit,value_label,value_index,average,variance,si_unit,si_average,si_variance )

Time

The time derived for an event from historical documents is usually more precise than for other archive types. Often the uncertainty is just days or even hours. So it would be best to code the absolute date by gregorian calendar and a string of type ISO 8601. Optionally an absolute gregorian year (may be with floating point fraction instead of integer) can be calculated as well.

  • Time
How should the event time primarily be defined:
You are not entitled to vote.
You are not entitled to view results of this poll.
There was one vote since the poll was created on 00:48, 25 April 2019.
poll-id F5072947F0CB8D3862347216F0EFEB01

The original sources often contain the information in calendar notation different than gregorian. Julian calendar was used in Europe in historical time, but outside Europe other calendars existed or still exist, likeweise in chinese or arabic documents.

Should the time optionally be coded in 'Other Calendar Systems':
You are not entitled to vote.
You are not entitled to view results of this poll.
There was one vote since the poll was created on 06:37, 25 April 2019.
poll-id A808B3FBF7319B5B317A05F072FD6DA9

If so, which parameters for start and end should be considered?

Cycle:
You are not entitled to vote.
You are not entitled to view results of this poll.
There was one vote since the poll was created on 00:59, 25 April 2019.
poll-id 6E47C6143936C80F76DA4595BBF0D6ED

Year:
You are not entitled to vote.
You are not entitled to view results of this poll.
There was one vote since the poll was created on 00:55, 25 April 2019.
poll-id 0EB04C5BBFD4BFC1FC4B11060C545D4B

Season / Solar Term:
You are not entitled to vote.
You are not entitled to view results of this poll.
There was one vote since the poll was created on 01:00, 25 April 2019.
poll-id 410789FBAD74B0772DE299546EC469DF

Month:
You are not entitled to vote.
You are not entitled to view results of this poll.
There was one vote since the poll was created on 00:55, 25 April 2019.
poll-id 03120082E79709CF299C7126AF4C3039

Day:
You are not entitled to vote.
You are not entitled to view results of this poll.
There was one vote since the poll was created on 00:55, 25 April 2019.
poll-id B6DDAAF86768F314B8E23B0F1C574BBF

Hour:
You are not entitled to vote.
You are not entitled to view results of this poll.
There was one vote since the poll was created on 01:00, 25 April 2019.
poll-id 4B6DF41D3FB86CC1DD016AA7C316FAD8

It is also possible to extract the time related text from the quote and add it to a field 'phrase':

Phrase:
You are not entitled to vote.
You are not entitled to view results of this poll.
There was one vote since the poll was created on 01:05, 25 April 2019.
poll-id E522F65760245A919C128E17FB915D93


https://sweet.jpl.nasa.gov/

Example

Rename LiPD file extension from zip to lpd

Description Tambora-Files LiPD-Files Remarks
Flood Example Media:flood_tambora_csv.zip Media:Exp0000.tambora.2017.zip
To-Do To-Do To-Do
To-Do To-Do To-Do

Media in category "Historical Documents Working Group"

The following 6 files are in this category, out of 6 total.