wiki:Moles Atom Format

Moles Atom Format

atom1.0.xsd

The Atom Syndication Format v1.0 is described fully here:  http://tools.ietf.org/html/rfc4287

The schema, which is available in the ndgUtils egg - in ndgUtils/xquery/atom1.0.xsd - is based on the schema from:  http://www.kbcafe.com/rss/atom.xsd.xml. NB, there is no officially released xsd schema - the ASF provides a Relax-NG schema, to provide a normative description of the atom syntax; this is, however, not expressive enough to capture some of the other validity requirements for atom feeds or entries. The above xsd has been generated using tools to convert the relax schema to xsd and seems to work well.

For the purposes of storing moles information, the schema has been extended using two further schemae:

moles2.0.xsd

This schema lifts elements from the previous moles2alpha format, to add data which doesn't naturally fit within the atom data structure. Specifically, it allows for the addition of the following data:

  • temporal range
    • responsible parties - with
      • name
      • role
      • uri
    • created date
    • metadata language
    • dataset language
    • abbreviation
    • providerID
  • NB, some of this data could, with the introduction of new vocab terms, be moved to the atom links elements.

molesAtom1.0.xsd

This scheme redefines the atom1.0.xsd schema, adding the moles2.0.xsd info as an 'entity' element and also adding a georss element to allow the specification of spatial data.

Schema access

The schemae are included in the ndgUtils egg and can be retrieved using the ndgXqueries module, via, e.g.:

ndgXqueries().getSchema(ndgXqueries.ATOM_SCHEMA)

Also, if the existDBClient class (from ndgUtils.models.existdbclient module) is instantiated with the keyword, 'setUpDB=True', the schema will be added to the related eXist DB in the '/db/atoms' collection.

Example moles atom

An example data entity atom is as follows:

<entry xmlns="http://www.w3.org/2005/Atom" xmlns:georss="http://www.georss.org/georss/10" xmlns:gml="http://www.opengis.net/gml" xmlns:moles="http://ndg.nerc.ac.uk/schema/moles2beta">
    <id>tag:localhost:5000,2008-11-17:/view/badc.nerc.ac.uk__ATOM__dataent_ACSOE</id>
   <title>DATASET NOT IN USE - Atmospheric Chemistry studies in the Ocean Environment (ACSOE)</title>
   <link href="http://localhost:5000/view/badc.nerc.ac.uk__ATOM__dataent_ACSOE" rel="self"/>
   <link href="http://localhost:5000/view/badc.nerc.ac.uk__NDG-B1__dataent_ACSOE" rel="related"/>
   <link href="http://badc.nerc.ac.uk/browse/badc/acsoe/" rel="http://vocab.ndg.nerc.ac.uk/term/N021/2/1" title="datagran_ACSOE"/>
   <link href="/db/atoms/Published/data_entities/" rel="http://vocab.ndg.nerc.ac.uk/term/N041/3/2" title="Organisation details"/>
   <link href="http://badc.nerc.ac.uk/data/acsoe/" rel="http://vocab.ndg.nerc.ac.uk/term/N041/3/2" title="URI"/>
   <link href="http://ndgbeta.badc.rl.ac.uk/view/badc.nerc.ac.uk__NDG-B1__dataent_ACSOE" rel="http://vocab.ndg.nerc.ac.uk/term/N041/3/5" title="VIEW EXTENDED METADATA"/>
   <link href="http://badc.nerc.ac.uk/graphics/logos/small/acsoelogo.gif" rel="NOT YET SET UP/None/LOGO" title="Logo"/>
   <author>
        <name>badc.nerc.ac.uk</name>
      <email>badc.nerc.ac.uk</email>
   </author>
   <moles:entity>
        <moles:molesISO>
            <moles:datasetLanguage>English</moles:datasetLanguage>
         <moles:metadataLanguage>English</moles:metadataLanguage>
         <moles:responsibleParties>
                <moles:responsibleParty>
                    <moles:name>calum byrom</moles:name>
               <moles:uri>www.tessella.com</moles:uri>
               <moles:role>Data Migrator</moles:role>
            </moles:responsibleParty>
            <moles:responsibleParty>
                    <moles:name>badc.nerc.ac.uk</moles:name>
               <moles:uri>http://badc.nerc.ac.uk</moles:uri>
               <moles:role>Data Curator</moles:role>
            </moles:responsibleParty>
         </moles:responsibleParties>
         <moles:abbreviation>ACSOE</moles:abbreviation>
         <moles:providerID>badc.nerc.ac.uk</moles:providerID>
         <moles:created>2006-11-03T00:00:00Z</moles:created>
      </moles:molesISO>
   </moles:entity>
   <category term="NULL" scheme="http://vocab.ndg.nerc.ac.uk/term/P131" label="ATMOSPHERE"/>
   <category term="GCAT0001" scheme="http://vocab.ndg.nerc.ac.uk/term/P111" label="EARTH SCIENCE"/>
   <category term="0" scheme="general_units" label="NULL"/>
   <category term="NULL" scheme="http://vocab.ndg.nerc.ac.uk/term/null" label="CLOUDTOPTEMPERATURE"/>
   <category term="ATOM_TYPE" scheme="DE - NOT YET SET UP/None/DE" label="DE"/>
   <category term="ATOM_SUBTYPE" scheme="NOT YET SET UP/None/dgSimulation" label="dgSimulation"/>
   <summary>ACSOE was a 5-year NERC research programme on tropospheric chemistry coordinated by the University of East Anglia and involving research groups from a number of UK universities and research institutes. The programme consisted of coordinated measurement campaigns from ground stations, ships and aircraft commencing summer 1996, together with associated interpretation and modelling. Work on this dataset has been completed. The BADC is the primary archiving and distribution data centre. ACSOE data is now public.</summary>
   <content type="xhtml">The Atmospheric Chemistry Studies in the Oceanic Environment (ACSOE) programme was a 5-year NERC Community Research programme to investigate the physico-chemical processes in the layer of the atmosphere immediately above the oceans. The programme comprised a wide range of measurements coupled with modelling studies. Measurements were made &lt;ul&gt;&lt;li&gt; at remote field sites (Mace Head, Ireland; Weybourne, Norfolk; Tenerife); &lt;li&gt;from the NERC research vessels Challenger and Discovery; &lt;li&gt;aboard the Meteorological Research Flight C-130 and the Cranfield Jetstream aircrafts.&lt;/ul&gt; The aims of the programme were to understand the processes occuring in the remote marine atmosphere and to discover how these are modified by anthropogenic emissions. The ACSOE study area extends from the cold water regions south of Iceland, through the temperate eastern Atlantic and North Sea, to the Canary Islands in the sub-tropical Atlantic. The project had three consortia of UK institutes and universities, each of which focused on a different scientific topic: &lt;ul&gt; &lt;LI&gt; OXICOA (OXIdising Capacity of the Ocean Atmosphere) - &lt;I&gt; a study of oxidant, radical and related gas-phase chemistry in the clean and moderately polluted marine atmosphere. &lt;/I&gt; &lt;LI&gt; MAGE (Marine Aerosol and Gas Exchange) - &lt;I&gt; a study of aspects of air-sea exchange relevant to atmospheric chemistry and aerosol production. &lt;/I&gt; &lt;LI&gt; (ACE) Aerosol Characterisation Experiment - &lt;I&gt; a study of the processing of gases and aerosols through hill-cap clouds on Tenerife and sub-tropical marine subtropical stratocumuli.&lt;/I&gt;&lt;/ul&gt;</content>
   <updated>2007-03-30T16:03:58Z</updated>
   <published>2008-11-17T17:44:12Z</published>
   <moles:temporalRange>1996-04-02/1998-12-30</moles:temporalRange>
   <georss:where>
        <gml:Envelope>
            <gml:lowerCorner>-30.0 60.0</gml:lowerCorner>
         <gml:upperCorner>10.0 30.0</gml:upperCorner>
      </gml:Envelope>
   </georss:where>
</entry>

Content discussion

A discussion of the data migration from the original moles format to the molesAtom format is available at  http://proj.badc.rl.ac.uk/badc/browser/ceda_software/granulator/trunk/src/molesGranulator/README.txt