ISO/IEC NWI xxx

 

 

Information Technology

 

Procedures for Achieving Content Consistency

In

ISO/IEC 11179 Metadata Registries

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

Working Paper

Draft 3.0

 

 

September 1999

 

Procedures for Achieving

Data Registry Content Consistency

 

Contents

 

-------------------------------------------------------------

 

 

Foreword

 

Introduction

 

1 Scope

 

2 References

 

3 Definitions

 

4 Component framework

5.0       REGISTER A DATA ELEMENT

5.1       General Procedures

5.1.1    Understanding the Data Element

5.1.2    Content Research

5.1.3    Definition and Permissible Values

5.1.4    Name and Identifiers

5.1.5    Other Metadata Attributes

5.1.6    Data Element Concept

5.1.7    Classification Attributes

5.1.8    Quality Control

5.2       International Standard with Enumerated Domain

5.2.1    Understanding the Data Element

5.2.2    Content Research

5.2.3    Definition and Permissible Values

5.2.4    Identify and Name the Data Element

5.2.5    Other Metadata Attributes

5.2.6    Data Element Concept

5.2.7    Classification

5.2.8    Quality Control

5.2.9    Other Codes and Names from ISO 3166

5.2.10  Summary of Attributes

5.3       International Standard with Non-Enumerated Domain

5.3.1    Understanding the Data Element

5.3.2    Content Research

5.3.3    Definition and Permissible Values

5.3.4    Identifying and Naming the Data Element

5.3.5    Other Metadata Attributes

5.3.6    Data Element Concept

5.3.7    Classification

5.3.8    Quality Control

5.3.9    Other Data Elements in ISO 6709

5.3.10  Summary of Metadata Attributes

5.4       Application Data Element

5.4.1    Understanding the Data Element

5.4.2    Content Research

5.4.3    Definition and Permissible Values

5.4.4    Identify and Name the Data Element

5.4.5    Other Metadata Attributes

5.4.6    Data Element Concept

5.4.7    Classification

5.4.8    Quality Control

5.4.9    Related Data Elements

5.4.10  Summary of Metadata Attributes

5.5       Register a Group of Data Elements

5.5.1    Information System Entity Group

5.5.2    Composite Data Element

5.5.3    Use Group

5.6       Linking of Data Elements

5.7       Registration of Associated Sources/Documents

 

 

 

6. Complex data

 

 

Annexes

 

A Bibliography

 

B Definitions of representation class terms

 

C Principles of managing shared data

 

D Data registry uses and users

 

E Conceptual and logical data models

 

F Table of Data Elements Attributes for Examples

 

G Top Down Approach to Data Element Registration

 

G.1      Biological Organisms

G.1.1   Data Element Concepts

G.1.2   Data Elements

G.1.3   Permissible Values

 

G.2      Biological Organism Types

G.2.1   Data Element Concepts

G.2.2   Data Elements

G.2.3   Permissible Values

 

G.3      Top Down Registration

 

 

Y Business Rules for Populating a Metadata Registry

 

Y.1      Data Element Definition

Y.1.1   Mandatory Rules

Y.1.1.1     Uniqueness

Y.1.1.2     Singular

Y.1.1.3     State the Concept; Not Only its Negative

Y.1.1.4     Descriptive Phrase or Sentence

Y.1.1.5     Contain Only Commonly Used Abbreviations

Y.1.1.6     No Embedded Definitions

Y.1.2   Guidelines for Definitions

Y.1.2.1     Essential Meaning of Concept

Y.1.2.2     Precise and Unambiguous

Y.1.2.3     Concise

Y.1.2.4     Stand Alone

Y.1.2.5     No Embedded Information

Y.1.2.6     Avoid Circular Reasoning

Y.1.2.7     Consistency for Related Definitions

Y.1.3   Data Element Definition Syntax

Y.1.4   Terms Commonly Used in Definitions

 

Y.2      Representational Attributes

Y.2.1   Permissible Values

Y.2.2   Value Domain

Y.2.3   Representational Terms

Y.2.4   Example

 

Y.3      Identifying and Naming a Data Element

Y.3.1   Name Context

Y.3.2   Establish a Naming Convention

Y.3.3   Example of a Naming Convention

Y.3.4   Formulating a Data Element Name

 

Y.4      Identification

Y.4.1   Data Element Identifier and Identifier

Y.4.2   Versioning

 

Y.5      Conceptual Relationships

Y.5.1   Data Element Concept

Y.5.2   Conceptual Domain

Y.5.3   Value Meanings

 

Y.6      Classification

 

Y.7      Quality Review

Y.7.1   Registration Status

Y.7.2   Administrative Status

 

Y.8      Reference Documents

 

 


 

 

 

 

Foreword

 

 

 

ISO (the International Organization for Standardization) and the IEC (the International Electrotechnical Commission) form the specialized system for worldwide standardization.  National bodies that are members of ISO or IEC participate in the development of International Standards through technical committees established by the respective organization to deal with particular fields of technical activity.  ISO and IEC technical committees collaborate in fields of mutual interest.  Other international organizations, governmental or non-governmental, in liaison with ISO and IEC, also take part in the work.

 

This document was prepared by ISO/IEC JTC 1/SC 32, Data Management and Interchange.


Introduction

 

 

The exchange of metadata between ISO/IEC 11179 metadata registries depends not only on registry software that conforms to the standard, but also on metadata contents that are compatible between registries. While the standard has provisions for data element specification and registration, there are pragmatic issues pertaining to populating the registries with content.  Based on the experiences of organizations that are implementing the standard, a technical report to explore content issues will help current and future users.

 

Well-formed data elements and their domains can be recorded in a metadata registry as "models" for potential reuse. Additional attributes may be required to record essential facts about how a data element is used in an application, e.g., for data quality, collection method, collection purpose, etc.

 

The proposed revision of ISO/IEC 11179, Part 3, models a data element (DE) and its associated components.  A data element consists of the data element concept plus its representation.  Some questions raised in the process of implementing registries concern this structure.  Creation of an application data element frequently requires additional qualification of the object class and/or property.  Does this creation of an application element always cause the creation of an application data element concept?  Does the qualified concept inherit meaning from the standard concept to which it is related, and is there an adequate place in the current scheme to store this relationship?  How are application DEC’s distinguished from other DEC’s or is there a need to make such a distinction?   These are examples of topics that might be explored in a document addressing content consistency among registry implementations.

 

Conceptualization and articulation of rules and relationships in the creation of object classes, properties, data element concepts and data elements are needed.  Explication of the various possible levels of data elements and data element concepts and their relationships would greatly assist in the creation of shareable, well-formed data.  Relationship and inheritance from the most abstract data element to the most concrete application data element needs to be specified.  Reuse of data value domains should be enabled and regularized. 

 


1 Scope

 

                                                                                               

1.1       Background

 

A registry is a tool for the management of shareable data; a comprehensive, authoritative source of reference information about data. It does not contain data itself, but it provides information on the definition, origin, source, and location of data. It supports the standard‑setting process by recording and disseminating data standards, which facilitates data sharing among organizations and users.  It provides links to documents that refer to data elements and to information systems where data elements are used.  When used in conjunction with an information database, the registry enables users to better understand the information obtained. 

 

This Technical Report is based on the American National Standard Institute (ANSI) X3.285:1999 Standard, Metamodel for the Management of Shareable Data.  The standard specifies the structure of a data registry in the form of a conceptual model.  The conceptual model is more abstract than a logical data model in that it does not consider how the data is represented in any particular way.  It is not intended to be a logical data model for a computer system, much less a physical model. 

 

A data registry contains the metadata that is necessary to clearly describe, inventory, analyze, and classify data.  It provides an understanding of the meaning, representation, and identification of a unit of data.  The ANSI X3.285 standard "outlines the information elements associated with a data element concept that need to be available for determining the meaning of a data element to be shared between systems.  The standard is a complement to the six-part International Organization for Standardization/International Electrotechnical Commission (ISO/IEC) 11179 standard that describes the organization of a data registry for managing the semantics of data elements in data systems."[1] 

 

1.2       Purpose

The purpose of this Technical Report is to describe business rules for the registration of data elements and their attributes in a registry.  This document is not a user’s guide for data entry, but a guide for conceptualizing a data element and its components for the purpose of consistently establishing good quality data elements. 

 

1.3 Scope


 

The scope of this document is limited to the essential components of a data element: the data element identifier, registry name, definition, and example; data concept; conceptual domain with its value meanings; and value domain with its permissible values.  This document is not concerned with the entry of detailed metadata for documents, standards, systems, groups, partners, and message sets. 


 

 

2 References

 

 

 

ISO/IEC DIS 11179-1, Information technology - Specification and standardization of data elements - Part 1: Framework for the specification and standardization of data elements

 

ISO/IEC DIS 11179-2, Information technology - Specification and standardization of data elements - Part 2: Classification for data elements

 

ISO/IEC 11179-3:1994, Information technology - Specification and standardization of data elements - Part 3: Basic attributes of data elements

 

ISO/IEC 11179-4:1995, Information technology - Specification and standardization of data elements - Part 4: Rules and guidelines for the formulation of data definitions

 

ISO/IEC 11179-5:1995, Information technology - Specification and standardization of data elements - Part 5: Naming and identification principles for data elements

 

ISO/IEC DIS 11179-6, Information technology - Specification and standardization of data elements - Part 6: Registration of data elements

 

ISO/IEC TR 15452, Information Technology - Specification of Data Value Domains

 


 

3 Definitions

 

For the purposes of this document, the following definitions apply.

 

 

3.1  attribute: A characteristic of an object or entity.

 

3.2  conceptual domain: A set of possible valid value meanings of a data element expressed without representation.

 

3.3  context: A designation or description of the application environment or discipline in which a name is applied or from which it originates.

 

 3.4  data element: A unit of data for which the identification, meaning, representation and permissible values are specified by means of a  set of attributes.

 

3.5  data element concept (DEC): A concept that can be represented in the form of a data element, described independently of any particular representation.

 

3.6 data element registry: An information resource that describes the meaning and representational form of data elements.

 

3.7  data element representation:  A data element component consisting of a value domain and representation class.

 

3.8  data identifier:  A language independent unique identifier of a data element within a registration authority.  An unambiguous name for an object within a given context.

 

3.9  data item:  An occurrence of a data element value.

 

3.10  data value: An element of a value domain.

 

3.11  data value domain:  A set of possible valid values of a data element expressed in a certain representation, for a data element having a value domain.

 

3.12  enumerated domain: A value domain that is specified by a list of all permissible values.

 

3.13  identifier: See data identifier.

 

3.14 international registration data identifier (IRDI): The unique and registered identifier of a data element.

 

3.15 metadata: Data that defines and describes other data.

 

3.16 name: The primary means of identification of objects and concepts for humans.

 

3.17 object class: A set of ideas, abstractions, or things in the real world that can be identified with explicit boundaries and meaning and whose properties and behavior follow the same rules.

 

3.18  permissible value (label):  An expression of a value meaning in a specific value domain.

 

3.19  property:  A peculiarity common to all members of an object class.

 

3.20  representation class:  A classification of types of representations.

 

3.21  structure set: A method of placing objects in context, revealing relationships to other objects.  Examples include Entity-Relationship Models, taxonomies, and ontologies.

 

3.22  value meaning:  A  valid value in a conceptual domain.

 

3.23  value meaning identifier (VMID): A label that uniquely identifies a value meaning.

 

 


4 Component framework

 

This clause presents a conceptual framework for structuring data elements and data element components in a registry.  Data elements are ideally the result of a process of development, involving several types of abstraction, producing a series of  "layers" related to each other by the method of abstraction used to produce one from the other.  Layers usually progress from the most general (conceptual) to the most specific (ultimately, the physical layer, although a metadata registry would not contain these).

 

One could use layers to structure development of a system using the Zachman Framework, for instance, with the highest levels of definition contained in the business view, and development progressing to the implemented system level.  The number and granularity of layers are driven by user requirements.  This clause will describe several (non-exhaustive) possible layers, none of which are intended to be mandatory for any particular implementation. 

 

The members of each layer are called data element components.  Components are envisioned as a set of building blocks that can be assembled into data elements.  Some components may also be members of a registry in their own right.

 

 

4.1 Abstraction types

 

Abstraction is a tool which has been well-developed by the object-oriented community. It is used as a way of focussing on parts of the model of interest to a particular process or function.  The term "abstraction" is used to refer both to the process and the results of the process.  Abstraction can be applied to the registry environment as a way to articulate the development of components and their relationships to each other.

 

Several methods can be used to achieve the decomposition of layers from the most abstract to the more concrete.  Starting with the most general conceptional notions and progressing to the data elements in applications, these layers can be labeled by the type or types of abstraction used to produce them from another level.

 

The three types of abstraction of most interest to data element development are: decomposition/aggregation, instantiation/classification, and specialization/generalization.

 

·       Decomposition/aggregation relates an item to its parts.  Decomposition may be described as "x is a part of y," or the part-of relationship.  The reverse, aggregation, shows that y may be composed of x among other items.

 

·       Instantiation/classification relates an item to a class of items.  This is described as the is-a relationship, "x is a(n instance of) y."  Classification reverses the relationship; y contains x as well as other items.

 

·       The third type is specialization/generalization.  This is a relationship between two classes, where all items in one (subclass) are also in the other (superclass).

 

 

 

4.2 Conformance

 

 

Layers of abstraction can be used to determine conformance of a registry implementation to a standard.  Specification of the member classes and abstraction types used to determine the layer members can be used to define conformance.  This will lead to improved chances for interoperability among registries.

 

 

4.3 Developing Layers of Abstraction

 

 

The process of deriving layers of abstraction for a registry can be described by a series of examples.  Some or all of these layers may be useful for any given registry.

 

Abstraction relationship types define the boundaries between layers.  Rules for conformance may be derived from both boundary abstraction and the relationships of the components of each layer.

 

A useful starting point is the set of real world things that the registry attempts to model.  These can be described by the phrases "concepts (things, beings, ideas…)," "things about them," "how they look," and "what they mean."  So, the first layer of abstraction is the translation of these phrases to model entities (figure 1).  Applying the abstraction process of specialization, the result is that concepts become object classes, things about concepts become properties, how they look becomes representation, and what they mean becomes the conceptual domain.  By this transformation, the amorphous content of superclasses of things in the real world become subclasses composed of entities of the model, subject to rules governing their behavior.

 

Of course, every model-based registry must include this layer.  This is the basic assumption of model building.

 

Within the model, other layers of abstraction can be applied to produce model entities of use to the developers and users.  For example, aggregation can be applied to the object class and property entities to produce the data element concept.  These can be related to conceptual domains (which contain sets of value meanings) to produce a potentially useful entity, the conceptual generic element (figure 2).

 

Conceptual generic elements consist of the attributes associated with their constituent components.  These serve to describe the object class property and its value meanings without any particular representation assigned. An example, using ISO 3166, would be to describe country identifier without specifying which one of the seven possible representations for names or codes for countries contained in ISO 3166 is preferred.

 

Consider representation.  It was mentioned earlier as if it was a model entity, but it does not exist as such in the model.  Representation is a combination of data value domain with its permissible values (if enumerated) or description (if not); representation class; and datatype, character set, and unit of quantity of the values in the value domain.  Therefore, it must be abstracted by aggregation if it is to be considered as a unit.

 

Combining a property with the representation components can create a useful construct.  A logical generic element such as "height measure in feet" can be used to record conformance criteria such as allowed range values.  A narrower construct, limiting the components to property and representation class, can be created to record generalized conformance criteria such as that "height measure" must only be used with units of measure with values of  "feet," "inches," "meters," "centimeters," etc.  These would potentially be combined with object classes to produce data elements such as "tree height measure" with a conformance criterion of "height >0<500" (figure 3).

 

Another useful object-oriented concept can be applied to allow inheritance of attribute values between layers.  This mechanism enables the process described in the last paragraph to be applied in many-to-one relationships: "height measure" can be applied to "telephone pole height measure" using the same conformance criterion as "tree height measure."

 

Other combinations of components can be created as the registry designer's discretion.  Documentation by attributes and relationships must be complete if registry content consistency is to be maintained.  Full use of generics promotes reuse of standardized data description.

 

 
Figure 1. Abstraction from the real world to the model.
Figure 2. Abstraction of a conceptual generic element.

 

 
 



Figure 3. Inheritance of component values.

 

 

 
 



REFERENCES (INFORMATIVE)

 

 

C                  Codes for the representation of names of countries and their subdivisions Part 1: Country codes, International Organization for Standards (ISO), ISO 3166:1997.

 

C                  Standard representation of latitude, longitude and altitude for geographic point locations, International Organization for Standards, ISO 6709, 1983-05-15.

 

C                  Information technology programming languages, their environments and system software interfaces language-independent datatypes, International Organization for Standards, ISO/IEC 11404, First edition 1996 12-15.

 

C                  Information technology specification of data value domains, ISO/IEC TR 15452, March 1999.

 


 

5.0       REGISTER A DATA ELEMENT

 

Registration of a data element in a data element registry requires that certain characteristics of the data element are recorded to clearly describe and define it.  These characteristics are stored as attributes of the data element.  A Registry can be used to record information about data elements ranging from carefully crafted data standards to those found in applications.  The amount and quality of metadata information available can vary from good, complete information to poor, incomplete information.  This document is intended to describe the population of a Registry with data elements for which good quality, consistent metadata can be created.  Part 3 of the ISO/IEC 11179 specifies attributes for recording information about a data element in a data registry.  This document gives examples that demonstrate the population of a data registry.  It includes attributes that are mandatory and fully defined by the metamodel, as well as those where the registration authority must establish its own profile of required attributes.

 

Many metadata registry practitioners find that using a bottom-up approach to registering a data element is most appropriate.  In many cases where a data element is submitted for registration, only limited information (e.g., a name, definition, and a set of permissible values) is provided.  Other attributes must be determined based on an understanding of the underlying data values and concepts that are implied by those facts.  These are most commonly registered by means of a bottom up registration procedure, where the basic metadata attributes about the data element (e.g., definition, name, and permissible values) are completed prior to defining the conceptual information about the data element.  A bottom-up approach might also be used where  the metadata registry is intended to serve as a distribution mechanism for metadata that describes the data in data products such as public data sets, query results, etc.  The examples provided in this report describe how to formulate attributes about a data element, based on a bottom-up procedure.  First a general procedure for registering data elements is described, followed by examples of registration of three types of data elements, including data elements from:

 

C                  An international standard with an enumerated domain.

C                  An international standard with a non-enumerated domain.

C                  An information system, where the application data element uses an enumerated domain. 

 

The registration procedures are presented in a logical order for analyzing and formulating attributes for a data element.  Annex F contains a table that concisely summarizes the information registered for each data element in the examples that follow. 

 

This report is intended to be used to help metadata registry practitioners to formulate the attributes that describe and define a data element.  Section 5.1 presents an overall approach to data element registration.  Sections 5.2, 5.3, and 5.4 should be consulted for more specific examples of registering the kinds of data elements described in international standards and in information systems.  Annex Y, which is based on ISO/IEC 11179, contains more detailed information and examples to assist the practitioner who is registering data elements.

 

A top-down approach is useful in many circumstances.  Although it requires more "up front" effort, top-down registration has the potential to produce more stable and uniform metadata.  An example of a top-down registration, where registration begins with identification of conceptual domains, is provided in informative Annex G with an example of registration of data elements about biological organisms.

 

5.1       General Procedures

 

Often only a limited amount of information is available about a data element that has been submitted for registration, e.g., the name and definition contained in a document or provided by the submitting organization and a set of permissible values, where  appropriate.  The general procedures that follow are intended to result in the registration of a complete, well-defined data element that meets the requirements of a particular registration authority. 

 

It should be noted that the metadata for some data elements in a registry will never be complete.  This is true of application data elements that are obtained from computer software, where very little information is known except the representational attributes (e.g., field length and datatype).  For these data elements, only the most basic attributes will be entered, and the data element's registration status will remain incomplete. 

 

5.1.1    Understanding the Data Element

 

The first step in the registration procedure is to gain an understanding of the data element.  What kind of data will be stored in this data element?  Is there a definition or description of the data values?  Were permissible values or examples of the data provided?  Will the data values be determined by an arithmetic or statistical procedure?  What will the data values look like; e.g., are they names or descriptions of things, numerals to be calculated, strings of characters and numbers that are identifiers?  Where documentation is inadequate to fully understand the data element, the practitioner must consult those who represent the source of the data element to obtain the necessary information. 

 

The result of this first step is an understanding of the semantic content of the data element. 

 

5.1.2    Content Research

 

Prior to formulating attributes towards registration of a new data element, the registrar should perform content research to determine whether a data element is described in an existing International or National standard, or whether a data element that has the potential for being reused exists in the registry or a federation of registries.  It is necessary to recognize that the registration practitioner must make value decisions when recording metadata into the metadata registry.  The practitioner will determine if a data element might be adapted to meet new requirements, or some attributes of an existing data element (e.g., value domain, data element concept, or conceptual domain) might be reused with the new data element.  Content research should include a search of conceptual domains, data element concepts, and value domains as well as data elements, to identify attributes that might be relevant to the data element to be registered.  If a standard data element exists that can be used as a model to meet the particular specifications for a new purpose, some of its attributes may be reused for registration of the new data element.

 

The result of this step is confirmation that a new data element is needed, or a decision to modify or reuse an existing data element. 

 

5.1.3    Definition and Permissible Values

 

The essential semantic content of a data element must be captured in a data element definition.  Part 4 of ISO/IEC 111179 describes rules and guidelines for formulating definitions.  Part 3 identifies the attributes for describing the domain of potentially valid (i.e., permissible) values.  The permissible values for a data element are defined as a value domain.  Examples are provided in Annex Y for formulating definitions, based on the rules and guidelines set forth in ISO/IEC 11179-4. Annex Y also contains detailed information about the attributes in value domains and examples of how those attributes are used for both enumerated (i.e., established through a list) and non-enumerated domains (i.e., specified through a formula, rule, procedure, or reference).

 

Different attributes are used depending upon whether the potentially valid values are enumerated or non-enumerated.  Each permissible value is associated with a valid value meaning that provides meaning to the permissible value, as described in Section 5.1.6.  Each permissible value is also entered in the registry with its begin date (i.e., the date when that permissible value became valid for that value meaning).  End dates will also be entered, when the permissible value for a value meaning becomes invalid. 

 

Value domains for non-enumerated domains must include a definition/description of the values that are possible valid values for the data element.  This report contains specific examples of registering data elements with enumerated domains (Sections 5.2 and 5.4) and with non-enumerated domains (Section 5.3). 

 

5.1.4    Name and Identifiers

 

Part 5 of ISO/IEC 11179 gives principles for naming and identification of data elements.  Each data element registered within a Registration Authority (RA), i.e.,  an organization authorized to register metadata, is unambiguously identified with a unique identifier.  Although the standard does not specify the format or content of the data element identifier (DI), the DI should carry no useful information about the data element, e.g., it might be a number assigned sequentially by an automated system.  If the attributes of a data element change, a new version of the data element is created and registered with a version identifier (VI). 

 

Since each RA establishes it's own identification scheme, the same DI might be used to identify a different data element in another metadata registry.  Therefore, a Registration Authority Identifier (RAI) must be established for unique identification of a data element.  Data elements registered under the provisions of ISO/IEC 11179 are assigned an international registration data identifier (IRDI), which is a composite of the RAI, the DI, and the VI.  Part 6 of ISO/IEC 11179 describes the requirements for a RA and the construction of a RAI.  The IRDI is discussed further in Part 6. 

 

Most people prefer to use names when talking about a data element, rather than a non-intelligible identifier.  Therefore, one or more names can be assigned to a data element, each associated with the context in which the name is used.  A name can be developed for a scientific discipline, an organization, a particular computer language, a database management system, or other purpose.  Each name is developed according to the naming convention for the particular name context.  The naming convention can vary from "whatever you want to call it" to a highly structured name.  ISO/IEC 11179, part 5 does not specify a mandatory naming convention, but does explain how to document one.  For this report, the data element names are based on a naming convention described in Annex Y.  Annex Y also expands on Part 5 of the standard by providing examples of the use of names and name contexts. 

 

5.1.5    Other Metadata Attributes

 

Other mandatory and optional data element attributes are described in Part 3 of ISO/IEC 11179.  In addition to the definitional attributes described in Section 5.1.3 and the identifying attributes described in Section 5.1.4, there are administrative, relational, classifying, and other miscellaneous attributes that serve to define and describe a data element. 

 

In addition to the mandatory attributes specified by Part 3 of the standard, a RA might establish a profile for a particular metadata registry, where some of the attributes described as optional in the standard are mandatory for that registry, some optional attributes are not included, and additional attributes might be identified to extend the registry. 

 

The attributes that relate data elements through data element concepts (Section 5.1,5), and those that classify data elements (Section 5.1.6) are described in subsequent sections of this report.  Many information sources do not provide information about the data element for these categories. Some administrative information is related to quality control, and is described in Section 5.1.8.  Annex Y includes detailed information about these metadata attributes. 

 

For the registration procedure described in this report, some administrative and miscellaneous attributes are recorded at this time, including:

 

C                  Submitting organization:  The submitting organization is the Office or organization that has submitted the data element for registration. 

 

C                  Data Steward:  The data steward is the individual who has been assigned by a submitting organization to be responsible for authorizing and maintaining one or more data elements. 

 

C                  Note:  A data element may have a "Note" or "Comment" that can be used to capture additional descriptive information about a data element, including usage, procedure, and other explanatory information that is not appropriate to include in the data element definition attribute.

 

C                  Example:  A data element shall be registered with an example, which must be one of the permissible values for enumerated value domains or must conform to the value domain description/definition and other value attributes for non-enumerated domains. 

 

C                  Origin.  A data element can be associated with any kind of source, including a document, standard, system, group, partner, or message set.  One source, as a minimum, must be associated with a data element to indicate the origin of information about the data element. 

 

5.1.6    Data Element Concept

 

At this stage in registering a data element, it is possible to specify conceptual information about the data element through the data element concept.  The data element concept can be thought of as an idea or perception about something, identified and described independently of any representation.  The data element concept may relate several data elements that record data about that concept with different representations, e.g., names and codes that represent provinces of Canada and share the same concept, which is "Canadian Province Identifier." 

 

The data element concept is singular (only one concept is represented).  It can be associated with many data elements, including other names and codes, and it does not include a representation class term in its name or definition.  The data element concept is associated with only one Conceptual Domain, as described in the following paragraph. 

 

Data element concepts are specified through a definition, an identifier, a name, and a conceptual domain, i.e., the meanings of the possible set of valid values for a data element, expressed without representation.  The conceptual domain, "Canadian Provinces", would include valid value meanings such as "The Canadian province of (Alberta,......., Yukon Territory)," where each value meaning would identify one Canadian province.  Each value meaning is entered in the registry, associated with its conceptual domain, with its begin date (i.e., the date when that value meaning became valid) and end (i.e., when the value meaning became invalid).  Permissible values are associated with value meanings, according to the representation defined by the value domain. 

 

Derivation of data element concepts and conceptual domains, including value meanings are described in detail in Annex Y.6.

 

5.1.7    Classification Attributes

 

The classification attributes are recorded, where appropriate, at this time.  Classification helps to add information not easily included in definitions, helps to organize the contents of a metadata registry, and helps to provide access by supporting more meaningful queries.  Part 2 of ISO/IEC 11179 describes general categories of classification; Part 5 describes three classified components: object class, property, and representation class.

 

A metadata registry might choose to classify data elements as groups, e.g., the group of data elements used in a mailing address, the group of data elements used to identify chemical substances, or the group of data elements that locate a point on the surface of the earth. 

Keywords might also be used to classify data elements, e.g., altitude, date, facility, industrial, and organization. 

 

5.1.8    Quality Control

 

Initially, only some of the attributes will be recorded for a newly registered data element.  Such a data element will be assigned the registration status of "incomplete."  When all of the mandatory data elements have been completed, but the quality of the metadata has not been verified, the registration status will be "recorded."  Through the quality review process, some data elements will be determined to be "certified," and some might become "standard."  The "standard" data element is the preferred data element to be used for data sharing, to ensure consistent representation and understanding of the data being communicated. 

 

Part 6 of ISO/IEC 11179 describes the registration process and the registration status assigned to a data element as the metadata are reviewed and quality is improved.  Many data elements might be entered into a data registry, but only a relatively small number of them might be assigned a "standard" registration status.  Annex Y describes the assignment of Registration and Administrative Status throughout the life cycle of a registered data element.  ISO/IEC 11179 Part 6 specifies the levels of registration status; the administrative statuses, however, are established for each registry by the RA. 

 

5.2       International Standard with Enumerated Domain

 

This section provides a specific example of the registration of a data element from an international standard, where the possible valid values are itemized.  The International Organization for Standardization (ISO) 3166-1:1997(E/F), Codes for the representation of names of countries and their subdivisions B Part 1: Country codes, is used as the source for this example.  ISO 3166:1997 is a complete revision of ISO 3166, which was first published in 1974.  The names of countries in the standard correspond to those given, in English and French, in the current "Terminology Bulletin B Country Names," issued by the United Nations Department of Conference Services, entitled "States Members of the United Nations, Members of the Specialized Agencies or Parties for the Statute of the International Court of Justice" and to those published in the "Standard Country or Area Codes for Statistical Use" established by the United Nations Statistics Division.  The full name is the formal title as notified by the country concerned to the UN Secretary General.

 

(ISO) 3166-1:1997(E/F) cancels and replaces the fourth edition (ISO 3166:1993) and comprises a consolidation of all changes to the lists of the fourth edition agreed to by the ISO 3166 Maintenance Agency: ISO 3166 Maintenance Agency Secretariat, c/o DIN Deutsches Institut für Normung e.V., Burggrafenstrasse 6, D-10787 Berlin, Germany. 

 

ISO 3166 includes the following domains: short country name in English, full (official) country name in English (not provided for all countries), 2-character alphabetic code, 3-character alphabetic code, 3-character numeric code, short country name in French, and full country name in French.

 

The following paragraphs are presented in the logical order for formulating attributes for a standard, enumerated data element, using the short English-language country name as the example.  The table in Section 5.2.10 contains all of the metadata attributes recorded for the enumerated data element from an international standard.

 

5.2.1    Understanding the Data Element

 

The data element to be registered is taken from an international standard, and it includes an authoritative conceptual domain of country identifiers for all of the countries of the world.  The short English-language name was selected for standardization because it has the most utility for information systems used by United States (U.S.) federal agencies as well as the private sector.  The short form of the English-language name is used by the U.S. Postal Service (USPS)for all outgoing international mail, in preference to any of the codes or full names that are included in the standard.  The name is also preferred by the USPS to any names that are used locally by a country to identify itself, e.g., Japan is recognized by the USPS in preference to Nihon, which is the country name commonly used by that country itself.  The short form of the name in English has been used in the development of ISO 3166 as the basis for assigning codes to avoid, wherever possible, any reflection of a country's political status. 

 

The English-language short name in the standard varies in length from four alphabetic characters (e.g., Peru) to 44 alphabetic characters (i.e., South Georgia and the South Sandwich Islands).  The names use the English language alphabet for their character set.

 

 

 

5.2.2    Content Research

 

Other standards that contain conceptual domains for country identification include U.S. Federal Information Processing Standards (FIPS), published by the U.S. Department of Commerce, Technology Administration, National Institute of Standards and Technology (NIST).  FIPS 10-4 is maintained by the Office of the Geographer and Global Issues, U.S. Department of State.  It is intended for use in activities by the Department of State and national defense programs, and can also be used for Federal interchanges of information with the non-Federal sector of the U.S.  FIPS 10-4, published in April 1995, reflects changes through May 6, 1993.  FIPS 104-1 implements an American National Standards Institute standard ANSI Z39.27-1984, and adopts, with qualifications, entities, names, and codes prescribed by ISO 3166.  FIPS 104-1 was last updated on May 12, 1986.  The maintenance organization is the National Bureau of Standards (now NIST) in coordination with the U.S. Department of State, the U.S. Board of Geographic Names, and the maintenance organization for ISO 3166.  There are no known plans to update either of the FIPS standards, and neither of these standards is recognized internationally.

 

An authoritative international source of value domains which has ongoing maintenance is a necessity for maintaining data values for the data elements identifying countries of the world.  Therefore, the ISO 3166:1997 is used as the origin of the data element for country name. 

 

5.2.3    Definition and Permissible Values

 

The definition and permissible values are the most important metadata attributes in uniquely describing a data element. 

 

5.2.3.1 Definition

 

Understanding that the essential meaning of this data element is to identify countries using a short name in the English-language, the data element definition can be formulated as "The short name of a country, represented in the English language."  This definition is formulated using the mandatory rules and guidelines established in ISO/IEC 11179-4.  The rules and guidelines from Part 4 are described with examples in Annex Y.2. The definition is singular, since any instance of the data element contains only one value.

 

5.2.3.2 Permissible Values

 

The permissible values for the data element are the short names in English, listed in ISO 3166 (e.g., Afghanistan, Albania, ......., Zimbabwe). Each permissible value is entered into the registry with the date when that permissible value was valid for that value domain (in this case the date is January 10, 1997, the same as the begin date for the value meaning).  There is no end date to enter at this time.

 

The scope of the permissible values for this data element includes the short English-language name for all countries.  A value domain is defined as the permissible values for a data element.  For this example, the value domain is described as "All short, English-language names of all countries."  Note that Part 3 of ISO/IEC 11179 does not require a description or definition for enumerated domains.  Some RA, however, prefer that all value domains be registered with a description/definition.  Record the other value domain attributes for this example at this time, including:

 

C                  Character Set:  The character set for Short English-Language Country Name is "English language."

C                  Domain Type:  Country names are a fixed list of countries, maintained by international standards; therefore, the domain type is "enumerated."

 

C                  Datatype:  The datatype for country name is "alphanumeric."

 

C                  Maximum and minimum field lengths:   Based on prior research (Section 5.2.1), the minimum length for values for the data element is known to be four.  The known maximum length for names in the current standard is 44.  The maximum field length, however, is set to 60, to accommodate any changes or additions to the domain of values.

 

C                  Format:  The format selected by the registration authority for this example is A(60) to accommodate the longest of the English-language short names. 

 

5.2.4    Identify and Name the Data Element                                                                     

 

Name do not identify a data element.  Identification requires a unique identifier, preferably one that does not contain information about the data element.  The name provides a designator so that users of the registry have terms by which they refer to the data element.

 

5.2.4.1 Identification

 

Assign a unique identifier to the data element for short English-language country name, as described in Annex Y for the identification of data elements. In the metadata registry for this example, a unique DI and VI (20903:1) are assigned by the computer at the time of registry.

 

5.2.4.2 Name Context and Naming Convention

 

ISO/IEC 11179 Part 5 describes the naming of data elements.  Annex Y gives examples of name contexts and naming conventions.  For this international standard data element, the name is assigned the context of "Registry," and it is derived based on the example naming convention provided in Annex Y and summarized as follows:

 

Scope:  The scope of this example naming convention is Registry Name.                 

 

$                Authority:  The authority for this example is the U.S. Environmental Protection Agency for its Environmental Data Registry.

 

$                Semantic Rules:  Names shall include an object and a property, where appropriate.  Qualifiers shall be used to differentiate between names that would otherwise be the same.  The representation class term shall always be included as the last term in the name.

 

$                Lexical Rules:  A data element name shall have a maximum of 100 alphanumeric  characters.  The language of the registry shall be English, and the character set ASCII. There are no controlled word lists.

 

$                Name Uniqueness:  Names shall be unique within a registration authority.

 

5.2.4.3 Name the Data Element

 

Using the above naming convention, the name is entered with the context of "Registry."  The convention specifies that the name should include the object "Country", to indicate the data values to be stored in the data element.  The name should also include the representation for the concept, in this example "Name."  For this particular example, it is necessary to qualify the name, since there are four value domains of country names in the ISO 3166 standard.  The qualifiers: "short" and "English-language" are appropriate to this example.  The name that has been formulated for this data element, therefore, is "Short English-Language Country Name." 

 

5.2.5    Other Metadata Attributes

 

Other metadata attributes that can be recorded at this time are:

 

C                  Select the example for this data element; it must be one of the permissible values in the value domain.  

 

Example:  China

 

C                  Identify the origin for this data element as the standard from which the permissible values are obtained. 

 

Origin:  ISO 3166-1:1997, Codes for the representation of names of countries and their subdivisions - Part 1: Country codes (Document)       

 

C                  Record any notes or comments that might provide additional information about the data element that is not included in the definition.

 

Note:   This data element is included in the EPA revised interim Facility Identification Standard.

 

C                  Enter the name of the submitting organization, which is the Office that submitted the data element for registration.

 

Submitting Organization: Office of Information Resources Management

 

C                  Record the name of the individual or organization assigned the responsibility for monitoring and maintaining the data element as the data steward.

 

Data Steward: Marian Cody

 

C                  Administrative metadata, such as Create Date and User Name are recorded or captured automatically by the system where applicable. 

 

5.2.6    Data Element Concept

 

Identification of the data element concept, as described in Section 5.1.6 is based on the data element name and definition, without the representation.  The concept represented by the data element "Short English-Language Country Name" is "Country Identifier," defined as "An identifier for a primary geopolitical entity of the world."  This concept can be represented by all seven of the names and codes included in ISO 3166. 

 

The conceptual domain is a collection of value meanings that provide meaning to the permissible values for a data element.  The conceptual domain that contains value meanings related to the identity of countries of the world is named "Countries of the World."  It is defined as "The primary geopolitical entities of the world."  The value meanings associated with this conceptual domain are defined as "The primary geopolitical entity of the world known as <country name>," where country name is one of the country names listed in ISO 3166.  Each value meaning is identified by its own value meaning identifier (VMID) and each is entered into the registry with the date when that value meaning was entered into the conceptual domain (in this case the date is January 10, 1997).  End dates will also be entered, when the value meaning becomes invalid (e.g., when a country name changes or the territory of a country changes to be combined with another country or to be subdivided into two or more other countries). 

 

5.2.7    Classification

 

This data element might be classified according to the following classification schemes:

 

·       Identify one or more keywords, where the keyword is a name or subject matter descriptor that will facilitate grouping like data elements for retrieval.

 

Keyword: Country.

 

·       Group Short, English-Language Country Name with similar data elements according to concept for translation or by general subject matter.

 

Conceptual group: Country Identifiers

            Subject group:       Geopolitical Entities.

 

 

 

·       Identify the class by which this data element is represented. 

 

Representation Class: Name

 

$                One or more real world objects that identify this data element can be identified at this time.

 

Object: Country

 

5.2.8    Quality Control

 

When all of the mandatory metadata attributes have been entered for this data element, it is assigned the Registration Status of "Recorded" and the administrative status of "In Quality Review."  Because the data element was identified by an international standard, and it is expected to be the preferred data element for representing country name within the example metadata registry, the registration status will be updated to AStandard@ with administrative status AFinal@ -after the necessary quality review has been completed. 

 

5.2.9    Other Codes and Names from ISO 3166

 

Other codes, official English names, and French names (both official and short) from ISO 3166 are registered with their individual value domains, representation, data element definitions, and data element names.  All of the data elements associated with ISO 3166 will share the same data element concept (i.e., Country Identifier, defined as "An identifier for a primary geopolitical entity of the world.") and the same conceptual domain (i.e., Countries of the World, defined as "The primary geopolitical entities of the world.").  All of the ISO 3166 data elements will share the same value meanings.  They will, however, have different sets of permissible values associated with the value meanings, depending upon the data element, its representation, and its value domain.

 

5.2.10  Summary of Attributes

 

The metadata attributes that have been assigned to this data element, the short, English-language country name identified by the ISO 3166:1997 standard, are summarized in the following table, and in the first column of the table in Annex F. 

 


 

             Data Element

Meta--            Example

model            

Attribute Name

 

 

ISO 3166

Enumerated,

Name

 

 

1.  Data Element Definition and Permissible Values

 

 

 

 

 

Data Element Definition Context

 

 

Registry

 

 

 

 

Data Element

Definition

 

 

The English-language short name of a country.

 

 

 

 

Permissible Values

 

 

All English-Language Short Country Names from ISO 3166, matched with value meanings.  (Afghanistan, Albania,......, Zimbabwe)

 

 

 

 

PV Begin Date

 

 

19971001

 

 

 

 

PV End Date

 

 

(Not Applicable)

 

 

 

 

 

Value Domain Definition

 

 

All English-language short  names of all countries.

 

 

 

 

Character Set

 

 

English language

 

 

 

 

Domain type

 

 

Enumerated

 

 

 

 

 

 

Determinant Type

 

 

(Not Applicable)

 

 

Range Limits

 

 

(Not Applicable)

 

 

 

 

Datatype

 

 

Alphanumeric

 

 

 

 

Minimum

 

 

4

 

 

 

 

Maximum

 

 

44

 

 

 

 

Format

 

 

A(60)

 

 

 

 

Unit of Measure

 

 

(Not Applicable)

 

 

 

 

Precision

 

 

(Not Applicable)

 

 

2.  Data Element Name and Identifier

 

 

 

 

 

 

 

Data Element Name Context

 

 

Registry

 

 

Data Element Name

 

 

Short English-Language Country Name

 

 

 

 

DE Identifier/ Version Number (DI:VI)

 

 

20903:1

 

 

3.  Other Metadata Attributes

 

 

 

 

 

Example

 

 

China

 

 

 

 

 

 

 

Origin

 

 

ISO 3166-1:1997, Codes for the representation of names of countries and their subdivisions B Part 1: Country codes (Document)

 

 

Note/Description

 

 

This data element is included in the EPA revised interim Facility Identification Standard.

 

 

Submitting organization

 

 

Office of Information Resources Management

 

 

Data Steward

 

 

Marion Cody

 

 

4.  Data Element Concept (DEC)

 

 

 

 

 

 

Data Element Concept Name

 

 

Country Identifier

 

 

 

 

Data Element Concept Definition

 

 

An identifier for a primary geopolitical entity of the world.

 

 

 

 

Conceptual Domain Name

 

 

Countries of the World

 

 

 

 

Conceptual Domain Definition

 

 

The primary geopolitical entities of the world.

 

 

 

 

 

 

Enumerated Value Meaning Text

 

 

The primary geopolitical entity known as <China>.

 

 

VM Begin Date

 

 

19971001

 

 

VM End Date

 

 

(Not Applicable)

 

 

Classification

 

 

 

 

 

 

 

 

Keyword

 

 

Country

 

 

 

 

Group

 

 

Country Identifiers, Geopolitical Entities

 

 

 

 

Representation Class

 

 

Name

 

 

 

 

Object

 

 

Country

 

 

Quality Control

 

 

 

 

 

Registration Status

 

 

Standard

 

 

 

 

Administrative Status

 

 

Final

 

 


5.3       International Standard with Non-Enumerated Domain

 

This section provides a specific example of the registration of a data element from an international standard, where the possible valid values are not enumerated, but must be determined by a procedure.  The International Organization for Standardization (ISO) 6709-1983 (E), Standard representation of latitude, longitude and altitude for geographic point locations, is used as the source for this example.  ISO 6709 was developed by ISO Technical Committee ISO/TC 97, Information processing systems, and was circulated to member bodies in November 1981.  Eighteen countries approved the standard, no member body expressed disapproval.  There is no known schedule for review and update of the standard.  ISO/TC 32 has been assigned as the maintenance authority for the standard; ISO/TC 211 has expressed an interest in assuming responsibility for its maintenance.  

 

The table in Section 5.3.10 contains all of the metadata attributes recorded for the non-enumerated data element from an international standard.

 

5.3.1    Understanding the Data Element

 

Latitude is a measure of the angular distance on a meridian north or south of the equator.  The standard provides for a variable format and more than one representation for recording the latitude measure (i.e., degrees and decimal degrees and sexagesimal [i.e., degrees, minutes, and seconds.  The standard also includes more than one representation and format for longitude, and a flexible format for altitude.  In addition, a standard format for data transfer is included in the standard. 

 

Although new technology and new tools (e.g. Global Positioning System [GPS]) and analytical and mapping software have caused some geographic information specialists to prefer the measurement of locational coordinates in degrees and decimal degrees, many organizations continue to measure latitude and longitude in degrees, minutes, and seconds.  Therefore, the RA of the metadata registry in this example, has determined a need to register a data element for latitude measured in degrees, minutes, and seconds.  According to the standard, the placement of the decimal point indicates the transition from degrees to sexagesimal measures.  Examples of data in the standard include sexagesimal latitudes that are measured to a range of one or two decimal places for seconds.  The standard, however, does not limit the precision, but requires only that the number of decimal places indicate the precision of the measurement.  The RA for this example requires that latitude be recorded up to 5 decimal positions, where it can be measured to that level of precision. 

 

Latitude values are measured in a range of 0 (on the equator) to 90 degrees.  Minutes and seconds each are measured in a range of 0 to 60.  Latitude values on or North of the equator are recorded as positive numbers; those South of the equator are negative.  Where latitude degrees are measured in single digit, they must be recorded with a preceding zero.  For data transfer, latitude measures must be preceded by the directional symbol (+ or -), and they must include decimal point, when the measurement includes decimal seconds.  Latitude always precedes longitude, which precedes altitude.  The latitude and longitude must be expressed in the same format style and to the same precision (indicated by the number of decimal positions).  There are no separators between the latitude, longitude, and altitude; the directional symbol serves as a separator for the data element values.   

 

5.3.2    Content Research

 

Part 11 of ISO 15046, Spatial referencing by coordinates, describes the minimum data required to define 1-, 2-, and 3-dimensional coordinate reference systems.  The coordinate reference system must be fully defined for a position to be unambiguous.  Knowledge of the reference system is necessary to determine if coordinate points are comparable.   The standard does not, however, provide information about representation of the coordinates.  ISO/TC 211/ WG 3, the workgroup that is currently revising ISO 15046, has expressed an interest in revising (ISO) 6709-1983 (E), Standard representation of latitude, longitude and altitude for geographic point locations.  Because of TC211=s interest in ISO 6709, and their current work on the closely related standard, ISO 15046, it seems likely that ISO 6709 will soon be reviewed and updated if needed.  Therefore, ISO 6709 seems appropriate to be identified as a standard data element for latitude measure where latitude is measured as sexagesimal  (i.e., in degrees, minutes, and seconds). 

 

A search of the metadata registry in our example reveals about 40 data elements related to latitude measure.  One, an EPA interim standard for latitude, measured in degrees and decimal degrees, is compliant with the ISO 6709 data element for degrees.  None of the other data elements has the potential for compliance with ISO 6709 for sexagesimal measure of latitude.  The other latitude data elements in the registry have been assigned the registration status of incomplete, and many data elements are qualified (e.g., latitude where a facility is located, latitude of a smoke stack).  For the purpose of this example, none have the potential for being modified to meet the requirements of the ISO 6709 standard for latitude, measured in degrees, minutes, and seconds.

 

Therefore, in this example, the ISO 6709 latitude, sexagesimal  measure, is selected for registration as a new data element. 

 

5.3.3    Definition and Permissible Values

 

5.3.3.1 Definition

 

The data element definition is formulated according to the rules and guidelines described in Annex Y, based on ISO/IEC 11179-4.  The rules require that a data element definition be unique within the registry, so the unit of measure has been included in the definition as "The sexagesimal measure of the angular distance on a meridian north or south of the equator."  Including the unit of measure in the definition distinguishes the data element from the EPA interim standard, defined simply as "The measure of the angular distance on a meridian north or south of the equator."  The definition is singular, because it refers to only one instance of the data value.  Note that ISO 6709 does not include a definition for latitude. 

 

5.3.3.2 Permissible Values

 

ISO 6709 is an international standard that does not list specific values that are valid for the data element; the measure of latitude is a non-enumerated domain.  There are no stored permissible values in a registry for non-enumerated domains.  The values that are permissible for the ISO 6709 sexagesimal latitude data element are those values that conform to the definition of the value domain and the attributes for datatype, format, unit of measure, and precision.  The value domain for sexagesimal latitude can be described as "All sexagesimal measures of the distance of an angle north or south of the equator."  By including the unit of measure in the definition, the value domain is distinguished from the value domain description for latitude measured in degrees.  The definition is plural, because it includes all possible measurements of latitude determined by this type of measurement. 

 

Latitude values that are measured as degrees, minutes, and seconds must conform to the format +/‑DDMMSS to +/-DDMMSS.SSSSS.  The precision of the value is indicated by the number of decimal places recorded. 

 

Other value domain attributes for this example include:

 

$                Character Set.  The character set for latitude measure is "English language."

 

$                Domain Type.  Non-enumerated.

 

$                Description/definition.  All sexagesimal measures of the distance of an angle north or south of the equator.

 

$                Datatype.  The datatype for latitude measure is "alphanumeric" to explicitly include the directional symbol and decimal point, where appropriate. 

 

$                Maximum and minimum field lengths.   The known minimum field length at this time is seven (+/- DDMMSS) where no decimal seconds are recorded.  The maximum field length is 13 (+/- DDMMSS.sssss), to accommodate up to five decimal places for seconds.

 

$                Format.  The format selected by the registration authority for this example is A(13) to accommodate the maximum number of decimal positions. 

 

$                Range for degrees is 0-90; for minutes is 0-60; for seconds is 0-60.

 

5.3.4    Identifying and Naming the Data Element

 

5.3.4.1 Identifiers

 

A unique identifier is required for the latitude data element.  For the RA in this example, the DI and VI (312345:1) are assigned automatically by the metadata registry software. 

 

5.3.4.2 Name Context and Naming Convention

 

For this ISO standard data element, the name is assigned with the context of Registry, using the naming convention described in the example in Annex Y, summarized as follows:

 

$                Scope:  The scope of this example naming convention is Registry Name.

 

$                Authority:  The authority for this example is the U.S. Environmental Protection Agency for its Environmental Data Registry.

 

$                Semantic Rules:  Names shall include an object and a property, where appropriate.  Qualifiers shall be used to differentiate between names that would otherwise be the same.  The representation class term shall always be included as the last term in the name.

 

$                Lexical Rules:  A data element name shall have a maximum of 100 alphanumeric  characters.  The language of the registry shall be English, and the character set ASCII. There are no controlled word lists.

 

$                Name Uniqueness:  Names shall be unique within a registration authority.

 

5.3.4.3 Name the Data Element

 

Using the above naming convention, the name is entered with the context of "Registry."  The convention specifies that the name should include the object "Latitude", to indicate the data values to be stored in the data element. Include the representation for the concept in the name; in this example "Measure."  There is no requirement in ISO/IEC 11179 Part 5 that data element names be unique in a registry.  However, the naming convention used in this example specifies that names must be unique within a registry.  It is advisable to use a qualifier in the data element name to differentiate between data elements that might otherwise have the same name. The name includes the object (latitude) and the representation (measure).  For this example, the name of the latitude data element will carry the qualifier "sexagesimal" as a discriminator.  The name that has been derived for the latitude data element is "Latitude Sexagesimal Measure."

 

5.3.5    Other Metadata Attributes

 

Other metadata attributes that can be recorded at this time are:

 

$                Provide an example of the data value that conforms to the description in the value domain, and to the datatype, format, and other value domain attributes for this data element.

 

Example: +674532 and +674531.85435

 

$                Record the origin of this data element as the standard where the data element was identified.

 

Origin: ISO 6709-1983 (E), Standard representation of latitude, longitude and altitude for geographic point locations. 

 

$                Record notes and comments that contain additional information about the data element that is not appropriate for the definition.        

 

Note: Latitude sexagesimal converts to latitude degrees by the following formula: seconds x 60 = decimal minutes, total minutes x 60 = decimal degrees.

 

$                List the Office that submitted the data element for registration as the submitting organization.   

 

Submitting Organization: Office of Information Resources Management

 

$                The organization or individual that has responsibility for maintaining and updating the data element is recorded as the data steward for that data element. 

 

Data Steward: Larry Fitzwater

 

$                Administrative metadata, such as Create Date and User Name are recorded or captured automatically by the system where applicable. 

 

5.3.6    Data Element Concept

 

The methodology to be used for deriving a data element concept is described in Section 5.1.6 and Annex Y of this document.  A data element concept is the data element without representation. We have indicated previously that latitude is a distance measure, where measure is its representation.  The data element concept for latitude measure is "Latitude Distance" with the definition "A measure of the angular distance of a point on the surface of the earth north or south of the equator."  Note that this concept definition incorporates the  term "measure," which is a representation term.  The concept of latitude, however, is the measure of a distance.  Therefore, it is appropriate in this instance to use the term measure when defining the concept. 

 

A conceptual domain is a collection of value meanings.  The collection must be identified with a name and a definition.  The latitude is one of the horizontal coordinates that fix a position on the surface of the earth either north or south of the equator.  For this example, the name of the conceptual domain for latitude measure is "Latitude Coordinates" with the definition "The coordinates that indicate the distance north or south of the equator for locations."

 

For non-enumerated domains, such as latitude measure, the value meanings are not explicitly identified.  The conceptual domain for the Latitude Distance data element concept is the perceived repository of all latitudes that mark positions on the earth with relation to the equator.  The value meanings could be defined as "The distance measure of a point north or south of the equator that is <value>."  No value meanings are stored in the registry. 

 

5.3.7    Classification

 

This data element might be classified according to the following classification schemes:

 

$                Identify one or more keywords, where the keyword is a name or subject matter descriptor that will facilitate grouping like data elements for retrieval.

 

Keyword: Latitude, Horizontal Coordinate, Spatial

 

$          Group Short, English-Language Country Name with similar data elements according to concept for translation or by general subject matter.

 

Subject group: Geographic Point Location.

 

$                Identify the class by which this data element is represented. 

 

Representation Class: Measure

 

$                One or more real world objects that identify this data element can be identified at this time.

 

Object: Latitude

 

5.3.8    Quality Control

 

When all of the mandatory metadata attributes have been entered for this data element, it is assigned the registration status of "Recorded" and the administrative status of "In Quality Review."  This data element was identified in an international standard, and so would soon be updated to reflect higher status of the data element.  The data element, however, would not be expected to be assigned the status of AStandard.@  The data element is not expected to be come the preferred representation for latitude measure, since geographic information specialists prefer that latitude and longitude be recorded in degrees and decimal degrees.  Therefore, after quality review has been completed, the data element will be assigned the registration status of ACertified@ with an administrative status of ANo further action.@

 

5.3.9    Other Data Elements in ISO 6709

 

ISO 6709 identifies five data elements: sexagesimal latitude, degrees latitude, sexagesimal longitude, degrees longitude, and altitude.  The different formats represented by the units of measure for latitude (i.e., degrees and sexagesimal) express representation (i.e., unit of measure).  The two latitude data elements from ANSI 6709 are translatable at the concept level, based on their unit of measure representations.  They share the same conceptual domain, because their implied value meanings are the same.  Likewise, the longitude data elements share a data element concept and a conceptual domain, and longitude data can be translated based on unit of measure conversions.  . 

 

Whereas the multiple data elements identified in ISO 3166 share the same data element concept and the same conceptual domain, the data elements identified in ISO do not share data element concepts and conceptual domains. All three concepts: latitude, longitude, and altitude, are distance measures.  Latitude, however, is a north/south measure with respect to the equator; longitude is an east/west measure with respect to the prime meridian; and altitude is a vertical measure with respect to a point of reference such as sea level.  Each has its own data element concept and its own conceptual domain. 

 

These data elements do share classification.  All can be classified as the group "Geographic Point Location" and as the representation class "Measure."

 


5.3.10  Summary of Metadata Attributes

 

The following table summarizes the metadata attributes assigned to latitude sexagesimal measure in the preceding paragraphs in Section 5.3.  The table in Annex F also contains this data in the second metadata column. 

 

 

             Data Element

Meta--            Example

model            

Attribute Name

 

 

ISO 6709

Non-enumerated,

Latitude

 

 

1.  Data Element Definition and Permissible Values

 

 

 

 

 

Data Element

Definition

 

 

The measure in degrees of the angular distance of a position on earth on a meridian north or south of the equator.

 

 

 

 

Permissible Values

 

 

Measures of Latitude in Degrees, Minutes, and Seconds

 

 

 

 

PV Begin Date

 

 

(Not Applicable)

 

 

 

 

PV End Date

 

 

(Not Applicable)

 

 

 

 

 

Value Domain Definition

 

 

All measures of the distance of an angle north or south of the equator measured in degrees, minutes, and seconds. 

 

 

 

 

Character Set

 

 

English language

 

 

 

 

Domain type

 

 

Non-enumerated

 

 

 

 

 

 

Determinant Type

 

 

Range

 

 

Range Limits

 

 

00-90

 

 

 

 

Datatype

 

 

Alphanumeric

 

 

 

 

Minimum

 

 

7

 

 

 

 

Maximum

 

 

13

 

 

 

 

Format

 

 

A(13)   +/-DDMMSS.SSSSS

 

 

 

 

Unit of Measure

 

 

Sexagesimal

 

 

 

 

Precision

 

 

Number of decimal places recorded.

 

 

2.  Data Element Name and Identifier

 

 

 

 

 

 

 

Data Element Name Context

 

 

Registry

 

 

Data Element Name

 

 

Latitude Sexagesimal Measure

 

 

 

 

DE Identifier/ Version Number (DI:VI)

 

 

312345:1

 

 

3.  Other Metadata Attributes

 

 

 

 

 

 

 

 

Example

 

 

+674532 and +674531.85435

 

 

 

 

 

 

 

 

Origin

 

 

ISO 6709-1983 (E), Standard representation of latitude, longitude and altitude for geographic point locations. 

 

 

 

 

Note/Description

 

 

Latitude on or north of the equator is preceded by a plus sign; south of the equator by a minus sign.

 

 

Submitting organization

 

 

Office of Information Resources Management

 

 

Data Steward

 

 

Larry Fitzwater

 

 

4.  Data Element Concept (DEC)

 

 

 

 

 

 

 

 

 

Data Element Concept Name

 

 

Latitude Distance

 

 

 

 

Data Element Concept Definition

 

 

A measure of the angular distance of a point on the surface of the earth north or south of the equator

 

 

 

 

Conceptual Domain Name

 

 

Latitude Coordinates

 

 

 

 

Conceptual Domain Definition

 

 

The coordinates that indicate the distance north or south of the equator for locations.

.

 

 

 

 

 

 

Enumerated Value Meaning Text

 

 

(Not Applicable)

 

 

VM Begin Date

 

 

(Not Applicable)

 

 

VM End Date

 

 

(Not Applicable)

 

 

5. Classification

 

 

 

 

 

 

 

 

Keyword

 

 

Horizontal Coordinate,

Latitude

 

 

 

 

Group

 

 

Geographic Point Locations

 

 

 

 

Representation Class

 

 

Measure

 

 

 

 

Object

 

 

Latitude

 

 

6. Quality Control

 

 

 

 

 

 

 

 

Registration Status

 

 

Recorded

 

 

 

 

Administrative Status

 

 

In quality review

 

 

5.4       Application Data Element