How to construct and publish data on the web the right way

The process of publishing your data on the Web is multifaceted. To increase the value of your data sources, you want the data to be expressed using standard names (codes), schemas and serialization formats. This makes it easier for the consumers of your data to integrate it into their applications and with other data sources.

Names/codes for entities you describe

Whether your data is tabular or not, you are often using entities (or ids or codes representing them) as values. Such as artists in your collection of artworks.

People often mint IDs for the entities they are describing. This means they create new ID spaces and assigning new IDs to entities.

Instead of generating new IDs, consider using standard keys for entities, such as Cofactor IDs. Using standard keys (Cofactor IDs) in your dataset facilitates the adoption and increases the value of your dataset.

HTTP URLs in place of IDs

An alternative to IDs are publicly accessible HTTP URIs pointing at some location on the web. These have the following benefits:

  • They are globally unique by default, so they will never clash with others.
  • They can be dereferenced (resolved on the Web) and provide an easy way to access the description of each entity referenced.
  • They are naturally stable and persistent.

Each Cofactor ID is dereferenceable using its corresponding URI.

Schema

What are you representing in your data? Is it people? Events? Try to find an existing schema that meets your needs. Consider schema.org. You want your data schema to be widely understood and adopted. Reuse common vocabularies as much as possible.

If schema.org doesn't fulfill your needs, you can create and publish a custom vocabulary. Avoid defining a new vocabulary from scratch, but try to extend an existing vocabulary.

Here’s an example:

tri:reporting_year
rdf:type rdf:Property ;
rdfs:label "reporting year" ;
rdfs:comment "Indicates a year when a TRI Report was submitted to the U.S.
 EPA's Toxic Release Inventory system." ;
rdfs:domain tri:Report ;
rdfs:range rdfs:Literal .
Have feedback on this article? Let @g_11fhkb1_61 know.
Need help? Contact us.

Discover Eckher Semantic Web Browser: "http://xmlns.com/foaf/0.1/Person", "http://schema.org/Organization", "http://www.w3.org/2004/02/skos/core#definition", "http://www.wikidata.org/entity/Q1".

Discover English pronunciations: "croissant", "Murraya", "Khai", "Mersenne", "cicada", "foie gras", "Thailand", "axolotl", "cyan", "hemoptysis", "latke", "Siobhán".

Enter IPA characters using Eckher IPA Keyboard.

Navigate the Semantic Web and retrieve the structured data about entities published on the web using Eckher Semantic Web Browser.

Turn your phone into a compass using Eckher Compass.

Browse word pronunciations online using Eckher Dictionary.

Author, enrich, and query structured data using Eckher Database for RDF.

Create TeX-style mathematical formulas online with Eckher Math Editor.

Create knowledge graphs using Eckher RDF Graph Editor.

Send messages and make P2P calls using Eckher Messenger.

Build event-sourced systems using Eckher Database for Event Sourcing.

View PDB files online using Eckher Mol Viewer.

Listen to your text using Eckher Text to Speech.

View FASTA sequence alignments online with Eckher Sequence Alignment Viewer.

Convert Punycode-encoded internationalized domain names (IDNs) to Unicode and back with Eckher Punycode Converter.

Explore the human genome online with Eckher Genome Browser.

Edit text files online with Eckher Simple Text Editor.

Send test emails with Eckher SMTP Testing Tool.

Разбор слов по составу: "кормушка", "светить", "сбоку", "шиповник", "народ", "впервые", "свежесть", "издалека", "лесной".