ISO/IEC NWI xxx
Information
Technology
Procedures for
Achieving Content Consistency
In
ISO/IEC 11179
Metadata Registries
Working Paper
Draft 3.0
September 1999
Procedures for Achieving
Data Registry Content Consistency
Contents
-------------------------------------------------------------
Foreword
Introduction
1 Scope
2 References
3 Definitions
4
Component framework
5.0 REGISTER A DATA ELEMENT
5.1 General Procedures
5.1.1 Understanding
the Data Element
5.1.2 Content
Research
5.1.3 Definition and Permissible Values
5.1.4 Name
and Identifiers
5.1.5
Other Metadata Attributes
5.1.6 Data
Element Concept
5.1.7 Classification
Attributes
5.1.8 Quality
Control
5.2 International Standard with Enumerated
Domain
5.2.1 Understanding
the Data Element
5.2.2 Content
Research
5.2.3 Definition
and Permissible Values
5.2.4 Identify
and Name the Data Element
5.2.5 Other
Metadata Attributes
5.2.6 Data
Element Concept
5.2.7 Classification
5.2.8 Quality
Control
5.2.9 Other
Codes and Names from ISO 3166
5.2.10 Summary
of Attributes
5.3 International Standard with
Non-Enumerated Domain
5.3.1 Understanding
the Data Element
5.3.2 Content
Research
5.3.3 Definition
and Permissible Values
5.3.4 Identifying
and Naming the Data Element
5.3.5 Other
Metadata Attributes
5.3.6 Data
Element Concept
5.3.7 Classification
5.3.8 Quality
Control
5.3.9 Other
Data Elements in ISO 6709
5.3.10 Summary
of Metadata Attributes
5.4 Application Data Element
5.4.1 Understanding
the Data Element
5.4.2 Content
Research
5.4.3 Definition
and Permissible Values
5.4.4 Identify
and Name the Data Element
5.4.5 Other
Metadata Attributes
5.4.6 Data
Element Concept
5.4.7 Classification
5.4.8 Quality
Control
5.4.9 Related
Data Elements
5.4.10 Summary
of Metadata Attributes
5.5 Register a Group of Data Elements
5.5.1 Information
System Entity Group
5.5.2 Composite
Data Element
5.5.3 Use
Group
5.6 Linking of Data Elements
5.7 Registration of Associated
Sources/Documents
6.
Complex data
Annexes
A Bibliography
B Definitions of
representation class terms
C Principles of managing
shared data
D Data registry uses and
users
E Conceptual and logical data
models
F Table of Data Elements Attributes for Examples
G Top Down Approach to Data Element Registration
G.1 Biological Organisms
G.1.1 Data Element Concepts
G.1.2 Data Elements
G.1.3 Permissible Values
G.2 Biological Organism Types
G.2.1 Data Element Concepts
G.2.2 Data Elements
G.2.3 Permissible Values
G.3 Top Down Registration
Y Business Rules for Populating a Metadata Registry
Y.1 Data Element Definition
Y.1.1 Mandatory Rules
Y.1.1.1 Uniqueness
Y.1.1.2 Singular
Y.1.1.3 State
the Concept; Not Only its Negative
Y.1.1.4 Descriptive Phrase or Sentence
Y.1.1.5 Contain
Only Commonly Used Abbreviations
Y.1.1.6 No
Embedded Definitions
Y.1.2 Guidelines for Definitions
Y.1.2.1 Essential
Meaning of Concept
Y.1.2.2 Precise
and Unambiguous
Y.1.2.3 Concise
Y.1.2.4 Stand
Alone
Y.1.2.5 No
Embedded Information
Y.1.2.6 Avoid
Circular Reasoning
Y.1.2.7 Consistency
for Related Definitions
Y.1.3 Data Element Definition Syntax
Y.1.4 Terms Commonly Used in Definitions
Y.2 Representational Attributes
Y.2.1 Permissible Values
Y.2.2 Value Domain
Y.2.3 Representational Terms
Y.2.4 Example
Y.3 Identifying and Naming a Data Element
Y.3.1 Name Context
Y.3.2 Establish a Naming Convention
Y.3.3 Example of a Naming Convention
Y.3.4 Formulating a Data Element Name
Y.4 Identification
Y.4.1 Data Element Identifier and Identifier
Y.4.2 Versioning
Y.5 Conceptual Relationships
Y.5.1 Data Element Concept
Y.5.2 Conceptual Domain
Y.5.3 Value Meanings
Y.6 Classification
Y.7 Quality Review
Y.7.1 Registration Status
Y.7.2 Administrative Status
Y.8 Reference Documents
Foreword
ISO
(the International Organization for Standardization) and the IEC (the International
Electrotechnical Commission) form the specialized system for worldwide
standardization. National bodies that
are members of ISO or IEC participate in the development of International
Standards through technical committees established by the respective
organization to deal with particular fields of technical activity. ISO and IEC technical committees collaborate
in fields of mutual interest. Other
international organizations, governmental or non-governmental, in liaison with
ISO and IEC, also take part in the work.
This
document was prepared by ISO/IEC JTC 1/SC 32, Data Management and Interchange.
Introduction
The
exchange of metadata between ISO/IEC 11179 metadata registries depends not only
on registry software that conforms to the standard, but also on metadata
contents that are compatible between registries. While the standard has
provisions for data element specification and registration, there are pragmatic
issues pertaining to populating the registries with content. Based on the experiences of organizations
that are implementing the standard, a technical report to explore content
issues will help current and future users.
Well-formed
data elements and their domains can be recorded in a metadata registry as
"models" for potential reuse. Additional attributes may be required
to record essential facts about how a data element is used in an application,
e.g., for data quality, collection method, collection purpose, etc.
The
proposed revision of ISO/IEC 11179, Part 3, models a data element (DE) and its
associated components. A data element
consists of the data element concept plus its representation. Some questions raised in the process of
implementing registries concern this structure. Creation of an application data element frequently requires
additional qualification of the object class and/or property. Does this creation of an application element
always cause the creation of an application data element concept? Does the qualified concept inherit meaning
from the standard concept to which it is related, and is there an adequate
place in the current scheme to store this relationship? How are application DEC’s distinguished from
other DEC’s or is there a need to make such a distinction? These are examples of topics that might be explored
in a document addressing content consistency among registry implementations.
Conceptualization
and articulation of rules and relationships in the creation of object classes,
properties, data element concepts and data elements are needed. Explication of the various possible levels
of data elements and data element concepts and their relationships would
greatly assist in the creation of shareable, well-formed data. Relationship and inheritance from the most
abstract data element to the most concrete application data element needs to be
specified. Reuse of data value domains
should be enabled and regularized.
1 Scope
1.1 Background
A registry is a tool for the management of shareable
data; a comprehensive, authoritative source of reference information about
data. It does not contain data itself, but it provides information on the
definition, origin, source, and location of data. It supports the standard‑setting
process by recording and disseminating data standards, which facilitates data
sharing among organizations and users.
It provides links to documents that refer to data elements and to
information systems where data elements are used. When used in conjunction with an information database, the
registry enables users to better understand the information obtained.
This
Technical Report is based on the American National Standard Institute (ANSI)
X3.285:1999 Standard, Metamodel for the
Management of Shareable Data. The
standard specifies the structure of a data registry in the form of a conceptual
model. The conceptual model is more
abstract than a logical data model in that it does not consider how the data is
represented in any particular way. It
is not intended to be a logical data model for a computer system, much less a
physical model.
A
data registry contains the metadata that is necessary to clearly describe,
inventory, analyze, and classify data.
It provides an understanding of the meaning, representation, and
identification of a unit of data. The
ANSI X3.285 standard "outlines the information elements associated with a
data element concept that need to be available for determining the meaning of a
data element to be shared between systems.
The standard is a complement to the six-part International Organization
for Standardization/International Electrotechnical Commission (ISO/IEC) 11179
standard that describes the organization of a data registry for managing the
semantics of data elements in data systems."[1]
1.2 Purpose
The
purpose of this Technical Report is to describe business rules for the
registration of data elements and their attributes in a registry. This document is not a user’s guide for data
entry, but a guide for conceptualizing a data element and its components for
the purpose of consistently establishing good quality data elements.
1.3 Scope
The scope of this document
is limited to the essential components of a data element: the data element
identifier, registry name, definition, and example; data concept; conceptual
domain with its value meanings; and value domain with its permissible
values. This document is not concerned
with the entry of detailed metadata for documents, standards, systems, groups,
partners, and message sets.
2 References
ISO/IEC
DIS 11179-1, Information technology - Specification
and standardization of data elements - Part 1: Framework for the specification
and standardization of data elements
ISO/IEC
DIS 11179-2, Information technology -
Specification and standardization of data elements - Part 2: Classification for
data elements
ISO/IEC
11179-3:1994, Information technology -
Specification and standardization of data elements - Part 3: Basic attributes
of data elements
ISO/IEC
11179-4:1995, Information technology -
Specification and standardization of data elements - Part 4: Rules and guidelines for the formulation of
data definitions
ISO/IEC
11179-5:1995, Information technology -
Specification and standardization of data elements - Part 5: Naming and identification principles for
data elements
ISO/IEC
DIS 11179-6, Information technology -
Specification and standardization of data elements - Part 6: Registration of
data elements
ISO/IEC
TR 15452, Information Technology -
Specification of Data Value Domains
3
Definitions
For the purposes of this document, the following
definitions apply.
3.1
attribute: A characteristic of an object or entity.
3.2 conceptual domain: A
set of possible valid value meanings of a data element expressed without
representation.
3.3
context: A designation or description of the application
environment or discipline in which a name is applied or from which it
originates.
3.4 data element: A
unit of data for which the identification, meaning, representation and
permissible values are specified by means of a
set of attributes.
3.5 data
element concept (DEC): A concept that can be represented in the form
of a data element, described independently of any particular representation.
3.6 data element registry: An
information resource that describes the meaning and representational form of
data elements.
3.7 data
element representation: A data
element component consisting of a value domain and representation class.
3.8 data
identifier: A language
independent unique identifier of a data element within a registration
authority. An unambiguous name for an
object within a given context.
3.9 data
item: An occurrence
of a data element value.
3.10 data
value: An element of a value domain.
3.11 data
value domain: A set of
possible valid values of a data element expressed in a certain representation,
for a data element having a value domain.
3.12
enumerated domain: A value domain that is specified by a list of
all permissible values.
3.13
identifier: See data identifier.
3.14 international registration data identifier
(IRDI): The unique and registered identifier of a data
element.
3.15 metadata: Data that defines and
describes other data.
3.16 name: The primary means of
identification of objects and concepts for humans.
3.17 object class: A
set of ideas, abstractions, or things in the real world that can be identified
with explicit boundaries and meaning and whose properties and behavior follow
the same rules.
3.18
permissible value (label): An expression of a value meaning in a specific value domain.
3.19
property: A peculiarity
common to all members of an object class.
3.20
representation class: A
classification of types of representations.
3.21
structure set: A method of placing objects in context,
revealing relationships to other objects.
Examples include Entity-Relationship Models, taxonomies, and ontologies.
3.22
value meaning: A valid value in a conceptual domain.
3.23
value meaning identifier (VMID): A label that uniquely
identifies a value meaning.
4 Component framework
This clause presents a conceptual framework for structuring data elements and data element components in a registry. Data elements are ideally the result of a process of development, involving several types of abstraction, producing a series of "layers" related to each other by the method of abstraction used to produce one from the other. Layers usually progress from the most general (conceptual) to the most specific (ultimately, the physical layer, although a metadata registry would not contain these).
One could use layers to structure development of a system using the Zachman Framework, for instance, with the highest levels of definition contained in the business view, and development progressing to the implemented system level. The number and granularity of layers are driven by user requirements. This clause will describe several (non-exhaustive) possible layers, none of which are intended to be mandatory for any particular implementation.
The
members of each layer are called data element components. Components are envisioned as a set of
building blocks that can be assembled into data elements. Some components may also be members of a
registry in their own right.
4.1 Abstraction types
Abstraction
is a tool which has been well-developed by the object-oriented community. It is
used as a way of focussing on parts of the model of interest to a particular
process or function. The term
"abstraction" is used to refer both to the process and the results of
the process. Abstraction can be applied
to the registry environment as a way to articulate the development of
components and their relationships to each other.
Several
methods can be used to achieve the decomposition of layers from the most
abstract to the more concrete. Starting
with the most general conceptional notions and progressing to the data elements
in applications, these layers can be labeled by the type or types of
abstraction used to produce them from another level.
The
three types of abstraction of most interest to data element development are: decomposition/aggregation,
instantiation/classification, and
specialization/generalization.
·
Decomposition/aggregation relates an item to its
parts. Decomposition may be described
as "x is a part of y," or the part-of
relationship. The reverse, aggregation,
shows that y may be composed of x among other items.
·
Instantiation/classification relates an item to a class
of items. This is described as the is-a relationship, "x is a(n
instance of) y." Classification
reverses the relationship; y contains x as well as other items.
·
The
third type is specialization/generalization. This is a relationship between two classes,
where all items in one (subclass) are also in the other (superclass).
4.2 Conformance
Layers
of abstraction can be used to determine conformance of a registry
implementation to a standard.
Specification of the member classes and abstraction types used to
determine the layer members can be used to define conformance. This will lead to improved chances for
interoperability among registries.
4.3 Developing Layers of
Abstraction
The
process of deriving layers of abstraction for a registry can be described by a
series of examples. Some or all of
these layers may be useful for any given registry.
Abstraction
relationship types define the boundaries between layers. Rules for conformance may be derived from
both boundary abstraction and the relationships of the components of each
layer.
A
useful starting point is the set of real world things that the registry
attempts to model. These can be
described by the phrases "concepts (things, beings, ideas…),"
"things about them," "how they look," and "what they
mean." So, the first layer of
abstraction is the translation of these phrases to model entities (figure
1). Applying the abstraction process of
specialization, the result is that concepts become object classes, things about concepts become properties, how they look becomes representation, and what they mean becomes the conceptual domain. By this
transformation, the amorphous content of superclasses of things in the real
world become subclasses composed of entities of the model, subject to rules
governing their behavior.
Of
course, every model-based registry must include this layer. This is the basic assumption of model
building.
Within
the model, other layers of abstraction can be applied to produce model entities
of use to the developers and users. For
example, aggregation can be applied to the object class and property entities
to produce the data element concept. These can be related to conceptual domains
(which contain sets of value meanings) to produce a potentially useful entity,
the conceptual generic element
(figure 2).
Conceptual
generic elements consist of the attributes associated with their constituent
components. These serve to describe the
object class property and its value meanings without any particular
representation assigned. An example, using ISO 3166, would be to describe country identifier without specifying
which one of the seven possible representations for names or codes for countries
contained in ISO 3166 is preferred.
Consider
representation. It was mentioned
earlier as if it was a model entity, but it does not exist as such in the
model. Representation is a combination
of data value domain with its permissible values (if enumerated) or description
(if not); representation class; and datatype, character set, and unit of
quantity of the values in the value domain.
Therefore, it must be abstracted by aggregation if it is to be considered
as a unit.
Combining
a property with the representation components can create a useful
construct. A logical generic element such as "height measure in feet"
can be used to record conformance criteria such as allowed range values. A narrower construct, limiting the
components to property and representation class, can be created to record
generalized conformance criteria such as that "height measure" must
only be used with units of measure with values of "feet," "inches," "meters,"
"centimeters," etc. These
would potentially be combined with object classes to produce data elements such
as "tree height measure" with a conformance criterion of "height
>0<500" (figure 3).
Another
useful object-oriented concept can be applied to allow inheritance of attribute
values between layers. This mechanism
enables the process described in the last paragraph to be applied in
many-to-one relationships: "height measure" can be applied to
"telephone pole height measure" using the same conformance criterion
as "tree height measure."
Other
combinations of components can be created as the registry designer's
discretion. Documentation by attributes
and relationships must be complete if registry content consistency is to be
maintained. Full use of generics
promotes reuse of standardized data description.
Figure
1. Abstraction from the real world to the model.

Figure 2. Abstraction of a conceptual generic element.

Figure 3. Inheritance of
component values.

REFERENCES
(INFORMATIVE)
C Codes for the representation of names of countries and their subdivisions Part 1: Country codes, International Organization for Standards (ISO), ISO 3166:1997.
C Standard representation of latitude, longitude and altitude for geographic point locations, International Organization for Standards, ISO 6709, 1983-05-15.
C Information technology programming languages, their environments and system software interfaces language-independent datatypes, International Organization for Standards, ISO/IEC 11404, First edition 1996 12-15.
C Information technology specification of data value domains, ISO/IEC TR 15452, March 1999.
5.0 REGISTER A DATA ELEMENT
Registration
of a data element in a data element registry requires that certain
characteristics of the data element are recorded to clearly describe and define
it. These characteristics are stored as
attributes of the data element. A
Registry can be used to record information about data elements ranging from
carefully crafted data standards to those found in applications. The amount and quality of metadata
information available can vary from good, complete information to poor,
incomplete information. This document
is intended to describe the population of a Registry with data elements for
which good quality, consistent metadata can be created. Part 3 of the ISO/IEC 11179 specifies
attributes for recording information about a data element in a data
registry. This document gives examples
that demonstrate the population of a data registry. It includes attributes that are mandatory and fully defined by
the metamodel, as well as those where the registration authority must establish
its own profile of required attributes.
Many
metadata registry practitioners find that using a bottom-up approach to
registering a data element is most appropriate. In many cases where a data element is submitted for registration,
only limited information (e.g., a name, definition, and a set of permissible
values) is provided. Other attributes
must be determined based on an understanding of the underlying data values and
concepts that are implied by those facts.
These are most commonly registered by means of a bottom up registration
procedure, where the basic metadata attributes about the data element (e.g.,
definition, name, and permissible values) are completed prior to defining the
conceptual information about the data element.
A bottom-up approach might also be used where the metadata registry is intended to serve as a distribution
mechanism for metadata that describes the data in data products such as public
data sets, query results, etc. The
examples provided in this report describe how to formulate attributes about a
data element, based on a bottom-up procedure.
First a general procedure for registering data elements is described,
followed by examples of registration of three types of data elements, including
data elements from:
C An international standard with an enumerated domain.
C An international standard with a non-enumerated domain.
C An information system, where the application data element uses an enumerated domain.
The
registration procedures are presented in a logical order for analyzing and
formulating attributes for a data element.
Annex F contains a table that concisely summarizes the information
registered for each data element in the examples that follow.
This
report is intended to be used to help metadata registry practitioners to
formulate the attributes that describe and define a data element. Section 5.1 presents an overall approach to
data element registration. Sections
5.2, 5.3, and 5.4 should be consulted for more specific examples of registering
the kinds of data elements described in international standards and in
information systems. Annex Y, which is
based on ISO/IEC 11179, contains more detailed information and examples to
assist the practitioner who is registering data elements.
A
top-down approach is useful in many circumstances. Although it requires more "up front" effort, top-down
registration has the potential to produce more stable and uniform metadata. An example of a top-down registration, where
registration begins with identification of conceptual domains, is provided in
informative Annex G with an example of registration of data elements about
biological organisms.
5.1 General Procedures
Often
only a limited amount of information is available about a data element that has
been submitted for registration, e.g., the name and definition contained in a
document or provided by the submitting organization and a set of permissible
values, where appropriate. The general procedures that follow are
intended to result in the registration of a complete, well-defined data element
that meets the requirements of a particular registration authority.
It
should be noted that the metadata for some data elements in a registry will
never be complete. This is true of
application data elements that are obtained from computer software, where very
little information is known except the representational attributes (e.g., field
length and datatype). For these data
elements, only the most basic attributes will be entered, and the data
element's registration status will remain incomplete.
5.1.1 Understanding the Data
Element
The first step in the
registration procedure is to gain an understanding of the data element. What kind of data will be stored in this
data element? Is there a definition or
description of the data values? Were
permissible values or examples of the data provided? Will the data values be determined by an arithmetic or
statistical procedure? What will the
data values look like; e.g., are they names or descriptions of things, numerals
to be calculated, strings of characters and numbers that are identifiers? Where documentation is inadequate to fully
understand the data element, the practitioner must consult those who represent
the source of the data element to obtain the necessary information.
The
result of this first step is an understanding of the semantic content of the
data element.
5.1.2 Content Research
Prior
to formulating attributes towards registration of a new data element, the
registrar should perform content research to determine whether a data element
is described in an existing International or National standard, or whether a
data element that has the potential for being reused exists in the registry or
a federation of registries. It is
necessary to recognize that the registration practitioner must make value
decisions when recording metadata into the metadata registry. The practitioner will determine if a data
element might be adapted to meet new requirements, or some attributes of an
existing data element (e.g., value domain, data element concept, or conceptual
domain) might be reused with the new data element. Content research should include a search of conceptual domains,
data element concepts, and value domains as well as data elements, to identify
attributes that might be relevant to the data element to be registered. If a standard data element exists that can
be used as a model to meet the particular specifications for a new purpose,
some of its attributes may be reused for registration of the new data element.
The
result of this step is confirmation that a new data element is needed, or a
decision to modify or reuse an existing data element.
5.1.3 Definition and Permissible Values
The
essential semantic content of a data element must be captured in a data element
definition. Part 4 of ISO/IEC 111179
describes rules and guidelines for formulating definitions. Part 3 identifies the attributes for
describing the domain of potentially valid (i.e., permissible) values. The permissible values for a data element
are defined as a value domain. Examples
are provided in Annex Y for formulating definitions, based on the rules and
guidelines set forth in ISO/IEC 11179-4. Annex Y also contains detailed
information about the attributes in value domains and examples of how those
attributes are used for both enumerated (i.e., established through a list) and
non-enumerated domains (i.e., specified through a formula, rule, procedure, or
reference).
Different
attributes are used depending upon whether the potentially valid values are
enumerated or non-enumerated. Each
permissible value is associated with a valid value meaning that provides
meaning to the permissible value, as described in Section 5.1.6. Each permissible value is also entered in
the registry with its begin date (i.e., the date when that permissible value
became valid for that value meaning).
End dates will also be entered, when the permissible value for a value
meaning becomes invalid.
Value
domains for non-enumerated domains must include a definition/description of the
values that are possible valid values for the data element. This report contains specific examples of
registering data elements with enumerated domains (Sections 5.2 and 5.4) and
with non-enumerated domains (Section 5.3).
5.1.4 Name and Identifiers
Part
5 of ISO/IEC 11179 gives principles for naming and identification of data
elements. Each data element registered
within a Registration Authority (RA), i.e.,
an organization authorized to register metadata, is unambiguously identified
with a unique identifier. Although the
standard does not specify the format or content of the data element identifier
(DI), the DI should carry no useful information about the data element, e.g.,
it might be a number assigned sequentially by an automated system. If the attributes of a data element change,
a new version of the data element is created and registered with a version
identifier (VI).
Since
each RA establishes it's own identification scheme, the same DI might be used
to identify a different data element in another metadata registry. Therefore, a Registration Authority
Identifier (RAI) must be established for unique identification of a data
element. Data elements registered under
the provisions of ISO/IEC 11179 are assigned an international registration data
identifier (IRDI), which is a composite of the RAI, the DI, and the VI. Part 6 of ISO/IEC 11179 describes the
requirements for a RA and the construction of a RAI. The IRDI is discussed further in Part 6.
Most
people prefer to use names when talking about a data element, rather than a
non-intelligible identifier. Therefore,
one or more names can be assigned to a data element, each associated with the
context in which the name is used. A
name can be developed for a scientific discipline, an organization, a
particular computer language, a database management system, or other
purpose. Each name is developed
according to the naming convention for the particular name context. The naming convention can vary from
"whatever you want to call it" to a highly structured name. ISO/IEC 11179, part 5 does not specify a
mandatory naming convention, but does explain how to document one. For this report, the data element names are
based on a naming convention described in Annex Y. Annex Y also expands on Part 5 of the standard by providing
examples of the use of names and name contexts.
5.1.5 Other Metadata Attributes
Other
mandatory and optional data element attributes are described in Part 3 of
ISO/IEC 11179. In addition to the
definitional attributes described in Section 5.1.3 and the identifying
attributes described in Section 5.1.4, there are administrative, relational,
classifying, and other miscellaneous attributes that serve to define and
describe a data element.
In
addition to the mandatory attributes specified by Part 3 of the standard, a RA
might establish a profile for a particular metadata registry, where some of the
attributes described as optional in the standard are mandatory for that
registry, some optional attributes are not included, and additional attributes
might be identified to extend the registry.
The
attributes that relate data elements through data element concepts (Section
5.1,5), and those that classify data elements (Section 5.1.6) are described in
subsequent sections of this report.
Many information sources do not provide information about the data
element for these categories. Some administrative information is related to
quality control, and is described in Section 5.1.8. Annex Y includes detailed information about these metadata
attributes.
For
the registration procedure described in this report, some administrative and
miscellaneous attributes are recorded at this time, including:
C Submitting organization: The submitting organization is the Office or organization that has submitted the data element for registration.
C Data Steward: The data steward is the individual who has been assigned by a submitting organization to be responsible for authorizing and maintaining one or more data elements.
C Note: A data element may have a "Note" or "Comment" that can be used to capture additional descriptive information about a data element, including usage, procedure, and other explanatory information that is not appropriate to include in the data element definition attribute.
C Example: A data element shall be registered with an example, which must be one of the permissible values for enumerated value domains or must conform to the value domain description/definition and other value attributes for non-enumerated domains.
C Origin. A data element can be associated with any kind of source, including a document, standard, system, group, partner, or message set. One source, as a minimum, must be associated with a data element to indicate the origin of information about the data element.
5.1.6 Data Element Concept
At
this stage in registering a data element, it is possible to specify conceptual
information about the data element through the data element concept. The data element concept can be thought of
as an idea or perception about something, identified and described
independently of any representation.
The data element concept may relate several data elements that record
data about that concept with different representations, e.g., names and codes
that represent provinces of Canada and share the same concept, which is
"Canadian Province Identifier."
The
data element concept is singular (only one concept is represented). It can be associated with many data
elements, including other names and codes, and it does not include a
representation class term in its name or definition. The data element concept is associated with only one Conceptual
Domain, as described in the following paragraph.
Data
element concepts are specified through a definition, an identifier, a name, and
a conceptual domain, i.e., the meanings of the possible set of valid values for
a data element, expressed without representation. The conceptual domain, "Canadian Provinces", would
include valid value meanings such as "The Canadian province of
(Alberta,......., Yukon Territory)," where each value meaning would
identify one Canadian province. Each
value meaning is entered in the registry, associated with its conceptual
domain, with its begin date (i.e., the date when that value meaning became
valid) and end (i.e., when the value meaning became invalid). Permissible values are associated with value
meanings, according to the representation defined by the value domain.
Derivation
of data element concepts and conceptual domains, including value meanings are
described in detail in Annex Y.6.
5.1.7 Classification Attributes
The classification
attributes are recorded, where appropriate, at this time. Classification helps to add information not
easily included in definitions, helps to organize the contents of a metadata
registry, and helps to provide access by supporting more meaningful
queries. Part 2 of ISO/IEC 11179
describes general categories of classification; Part 5 describes three
classified components: object class, property, and representation class.
A
metadata registry might choose to classify data elements as groups, e.g., the
group of data elements used in a mailing address, the group of data elements
used to identify chemical substances, or the group of data elements that locate
a point on the surface of the earth.
Keywords
might also be used to classify data elements, e.g., altitude, date, facility,
industrial, and organization.
5.1.8 Quality Control
Initially,
only some of the attributes will be recorded for a newly registered data
element. Such a data element will be
assigned the registration status of "incomplete." When all of the mandatory data elements have
been completed, but the quality of the metadata has not been verified, the
registration status will be "recorded." Through the quality review process, some data elements will be
determined to be "certified," and some might become
"standard." The
"standard" data element is the preferred data element to be used for
data sharing, to ensure consistent representation and understanding of the data
being communicated.
Part
6 of ISO/IEC 11179 describes the registration process and the registration
status assigned to a data element as the metadata are reviewed and quality is
improved. Many data elements might be
entered into a data registry, but only a relatively small number of them might
be assigned a "standard" registration status. Annex Y describes the assignment of
Registration and Administrative Status throughout the life cycle of a
registered data element. ISO/IEC 11179
Part 6 specifies the levels of registration status; the administrative
statuses, however, are established for each registry by the RA.
5.2 International Standard with Enumerated
Domain
This
section provides a specific example of the registration of a data element from
an international standard, where the possible valid values are itemized. The International Organization for
Standardization (ISO) 3166-1:1997(E/F), Codes for the representation of
names of countries and their subdivisions B Part 1: Country codes, is used
as the source for this example. ISO
3166:1997 is a complete revision of ISO 3166, which was first published in
1974. The names of countries in the
standard correspond to those given, in English and French, in the current
"Terminology Bulletin B Country Names," issued by the United Nations
Department of Conference Services, entitled "States Members of the United
Nations, Members of the Specialized Agencies or Parties for the Statute of the
International Court of Justice" and to those published in the
"Standard Country or Area Codes for Statistical Use" established by
the United Nations Statistics Division.
The full name is the formal title as notified by the country concerned
to the UN Secretary General.
(ISO)
3166-1:1997(E/F) cancels and replaces the fourth edition (ISO 3166:1993) and
comprises a consolidation of all changes to the lists of the fourth edition
agreed to by the ISO 3166 Maintenance Agency: ISO 3166 Maintenance Agency
Secretariat, c/o DIN Deutsches Institut für Normung e.V., Burggrafenstrasse 6,
D-10787 Berlin, Germany.
ISO
3166 includes the following domains: short country name in English, full
(official) country name in English (not provided for all countries),
2-character alphabetic code, 3-character alphabetic code, 3-character numeric
code, short country name in French, and full country name in French.
The
following paragraphs are presented in the logical order for formulating
attributes for a standard, enumerated data element, using the short
English-language country name as the example.
The table in Section 5.2.10 contains all of the metadata attributes
recorded for the enumerated data element from an international standard.
5.2.1 Understanding the Data Element
The
data element to be registered is taken from an international standard, and it
includes an authoritative conceptual domain of country identifiers for all of
the countries of the world. The short
English-language name was selected for standardization because it has the most
utility for information systems used by United States (U.S.) federal agencies
as well as the private sector. The
short form of the English-language name is used by the U.S. Postal Service
(USPS)for all outgoing international mail, in preference to any of the codes or
full names that are included in the standard.
The name is also preferred by the USPS to any names that are used
locally by a country to identify itself, e.g., Japan is recognized by the USPS
in preference to Nihon, which is the country name commonly used by that country
itself. The short form of the name in
English has been used in the development of ISO 3166 as the basis for assigning
codes to avoid, wherever possible, any reflection of a country's political
status.
The
English-language short name in the standard varies in length from four
alphabetic characters (e.g., Peru) to 44 alphabetic characters (i.e., South
Georgia and the South Sandwich Islands).
The names use the English language alphabet for their character set.
5.2.2 Content Research
Other standards that contain
conceptual domains for country identification include U.S. Federal Information
Processing Standards (FIPS), published by the U.S. Department of Commerce,
Technology Administration, National Institute of Standards and Technology
(NIST). FIPS 10-4 is maintained by the
Office of the Geographer and Global Issues, U.S. Department of State. It is intended for use in activities by the
Department of State and national defense programs, and can also be used for
Federal interchanges of information with the non-Federal sector of the
U.S. FIPS 10-4, published in April
1995, reflects changes through May 6, 1993.
FIPS 104-1 implements an American National Standards Institute standard
ANSI Z39.27-1984, and adopts, with qualifications, entities, names, and codes
prescribed by ISO 3166. FIPS 104-1 was
last updated on May 12, 1986. The
maintenance organization is the National Bureau of Standards (now NIST) in
coordination with the U.S. Department of State, the U.S. Board of Geographic
Names, and the maintenance organization for ISO 3166. There are no known plans to update either of the FIPS standards,
and neither of these standards is recognized internationally.
An
authoritative international source of value domains which has ongoing
maintenance is a necessity for maintaining data values for the data elements
identifying countries of the world.
Therefore, the ISO 3166:1997 is used as the origin of the data element
for country name.
5.2.3 Definition and Permissible Values
The
definition and permissible values are the most important metadata attributes in
uniquely describing a data element.
5.2.3.1 Definition
Understanding that the
essential meaning of this data element is to identify countries using a short
name in the English-language, the data element definition can be formulated as
"The short name of a country, represented in the English language." This definition is formulated using the
mandatory rules and guidelines established in ISO/IEC 11179-4. The rules and guidelines from Part 4 are
described with examples in Annex Y.2. The definition is singular, since any
instance of the data element contains only one value.
5.2.3.2 Permissible Values
The permissible values for
the data element are the short names in English, listed in ISO 3166 (e.g.,
Afghanistan, Albania, ......., Zimbabwe). Each permissible value is entered
into the registry with the date when that permissible value was valid for that
value domain (in this case the date is January 10, 1997, the same as the begin
date for the value meaning). There is
no end date to enter at this time.
The scope of the permissible
values for this data element includes the short English-language name for all
countries. A value domain is defined as
the permissible values for a data element.
For this example, the value domain is described as "All short,
English-language names of all countries."
Note that Part 3 of ISO/IEC 11179 does not require a description or
definition for enumerated domains. Some
RA, however, prefer that all value domains be registered with a
description/definition. Record the
other value domain attributes for this example at this time, including:
C Character Set: The character set for Short English-Language Country Name is "English language."
C Domain Type: Country names are a fixed list of countries, maintained by international standards; therefore, the domain type is "enumerated."
C Datatype: The datatype for country name is "alphanumeric."
C Maximum and minimum field lengths: Based on prior research (Section 5.2.1), the minimum length for values for the data element is known to be four. The known maximum length for names in the current standard is 44. The maximum field length, however, is set to 60, to accommodate any changes or additions to the domain of values.
C Format: The format selected by the registration authority for this example is A(60) to accommodate the longest of the English-language short names.
5.2.4 Identify and Name the Data Element
Name do not identify a data
element. Identification requires a
unique identifier, preferably one that does not contain information about the
data element. The name provides a
designator so that users of the registry have terms by which they refer to the
data element.
5.2.4.1 Identification
Assign a unique identifier
to the data element for short English-language country name, as described in
Annex Y for the identification of data elements. In the metadata registry for
this example, a unique DI and VI (20903:1) are assigned by the computer at the
time of registry.
5.2.4.2 Name Context and
Naming Convention
ISO/IEC 11179 Part 5
describes the naming of data elements.
Annex Y gives examples of name contexts and naming conventions. For this international standard data element,
the name is assigned the context of "Registry," and it is derived
based on the example naming convention provided in Annex Y and summarized as
follows:
Scope: The scope of this example naming convention is Registry Name.
$ Authority: The authority for this example is the U.S. Environmental Protection Agency for its Environmental Data Registry.
$ Semantic Rules: Names shall include an object and a property, where appropriate. Qualifiers shall be used to differentiate between names that would otherwise be the same. The representation class term shall always be included as the last term in the name.
$ Lexical Rules: A data element name shall have a maximum of 100 alphanumeric characters. The language of the registry shall be English, and the character set ASCII. There are no controlled word lists.
$ Name Uniqueness: Names shall be unique within a registration authority.
5.2.4.3 Name the Data
Element
Using the above naming
convention, the name is entered with the context of "Registry." The convention specifies that the name
should include the object "Country", to indicate the data values to
be stored in the data element. The name
should also include the representation for the concept, in this example
"Name." For this particular
example, it is necessary to qualify the name, since there are four value
domains of country names in the ISO 3166 standard. The qualifiers: "short" and
"English-language" are appropriate to this example. The name that has been formulated for this
data element, therefore, is "Short English-Language Country
Name."
5.2.5 Other Metadata Attributes
Other metadata attributes
that can be recorded at this time are:
C Select the example for this data element; it must be one of the permissible values in the value domain.
Example: China
C Identify the origin for this data element as the standard from which the permissible values are obtained.
Origin: ISO 3166-1:1997, Codes for the
representation of names of countries and their subdivisions - Part 1: Country
codes (Document)
C Record any notes or comments that might provide additional information about the data element that is not included in the definition.
Note: This data element is included in the EPA
revised interim Facility Identification Standard.
C Enter the name of the submitting organization, which is the Office that submitted the data element for registration.
Submitting Organization:
Office of Information Resources Management
C Record the name of the individual or organization assigned the responsibility for monitoring and maintaining the data element as the data steward.
Data Steward: Marian Cody
C Administrative metadata, such as Create Date and User Name are recorded or captured automatically by the system where applicable.
5.2.6 Data Element Concept
Identification of the data
element concept, as described in Section 5.1.6 is based on the data element
name and definition, without the representation. The concept represented by the data element "Short
English-Language Country Name" is "Country Identifier," defined
as "An identifier for a primary geopolitical entity of the world." This concept can be represented by all seven
of the names and codes included in ISO 3166.
The conceptual domain is a
collection of value meanings that provide meaning to the permissible values for
a data element. The conceptual domain
that contains value meanings related to the identity of countries of the world
is named "Countries of the World."
It is defined as "The primary geopolitical entities of the
world." The value meanings
associated with this conceptual domain are defined as "The primary
geopolitical entity of the world known as <country name>,"
where country name is one of the country names listed in ISO 3166. Each value meaning is identified by its own
value meaning identifier (VMID) and each is entered into the registry with the
date when that value meaning was entered into the conceptual domain (in this
case the date is January 10, 1997). End
dates will also be entered, when the value meaning becomes invalid (e.g., when
a country name changes or the territory of a country changes to be combined
with another country or to be subdivided into two or more other
countries).
5.2.7 Classification
This data element might be
classified according to the following classification schemes:
· Identify one or more
keywords, where the keyword is a name or subject matter descriptor that will
facilitate grouping like data elements for retrieval.
Keyword: Country.
· Group Short,
English-Language Country Name with similar data elements according to concept
for translation or by general subject matter.
Conceptual group: Country
Identifiers
Subject group:
Geopolitical Entities.
· Identify the class by which
this data element is represented.
Representation Class: Name
$ One or more real world objects that identify this data element can be identified at this time.
Object: Country
5.2.8 Quality Control
When all of the mandatory
metadata attributes have been entered for this data element, it is assigned the
Registration Status of "Recorded" and the administrative status of
"In Quality Review." Because
the data element was identified by an international standard, and it is
expected to be the preferred data element for representing country name within
the example metadata registry, the registration status will be updated to
AStandard@ with administrative status AFinal@ -after the necessary quality
review has been completed.
5.2.9 Other Codes and Names from ISO 3166
Other codes, official
English names, and French names (both official and short) from ISO 3166 are
registered with their individual value domains, representation, data element
definitions, and data element names.
All of the data elements associated with ISO 3166 will share the same
data element concept (i.e., Country Identifier, defined as "An identifier
for a primary geopolitical entity of the world.") and the same conceptual
domain (i.e., Countries of the World, defined as "The primary geopolitical
entities of the world."). All of
the ISO 3166 data elements will share the same value meanings. They will, however, have different sets of
permissible values associated with the value meanings, depending upon the data
element, its representation, and its value domain.
5.2.10 Summary of Attributes
The metadata attributes that
have been assigned to this data element, the short, English-language country
name identified by the ISO 3166:1997 standard, are summarized in the following
table, and in the first column of the table in Annex F.
|
Data Element Meta--
Example model
Attribute Name |
ISO 3166 Enumerated, Name |
||
|
1. Data
Element Definition and Permissible Values |
|||
|
|
Data Element Definition Context |
Registry |
|
|
|
Data Element Definition |
The English-language short name of a country. |
|
|
|
Permissible Values |
All English-Language Short Country Names from ISO
3166, matched with value meanings.
(Afghanistan, Albania,......, Zimbabwe) |
|
|
|
PV Begin Date |
19971001 |
|
|
|
PV End Date |
(Not Applicable) |
|
|
|
Value Domain Definition |
All English-language short names of all countries. |
|
|
|
Character Set |
English language |
|
|
|
Domain type |
Enumerated |
|
|
|
Determinant Type |
(Not Applicable) |
|
|
Range Limits |
(Not Applicable) |
||
|
|
Datatype |
Alphanumeric |
|
|
|
Minimum |
4 |
|
|
|
Maximum |
44 |
|
|
|
Format |
A(60) |
|
|
|
Unit of Measure |
(Not Applicable) |
|
|
|
Precision |
(Not Applicable) |
|
|
2. Data
Element Name and Identifier |
|||
|
|
Data Element Name Context |
Registry |
|
|
Data Element Name |
Short English-Language Country Name |
||
|
|
DE Identifier/ Version
Number (DI:VI) |
20903:1 |
|
|
3. Other
Metadata Attributes |
|||
|
|
Example |
China |
|
|
|
Origin |
ISO 3166-1:1997, Codes for the representation of
names of countries and their subdivisions B Part 1: Country codes (Document) |
|
|
Note/Description |
This data element is included in the EPA revised
interim Facility Identification Standard. |
||
|
Submitting organization |
Office of Information Resources Management |
||
|
Data Steward |
Marion Cody |
||
|
4. Data
Element Concept (DEC) |
|||
|
|
Data Element Concept Name |
Country Identifier |
|
|
|
Data Element Concept Definition |
An identifier for a primary geopolitical entity of
the world. |
|
|
|
Conceptual Domain Name |
Countries of the World |
|
|
|
Conceptual Domain Definition |
The primary geopolitical entities of the world. |
|
|
|
Enumerated Value Meaning Text |
The primary geopolitical entity known as <China>. |
|
|
VM Begin Date |
19971001 |
||
|
VM End Date |
(Not Applicable) |
||
|
Classification |
|
||
|
|
Keyword |
Country |
|
|
|
Group |
Country Identifiers, Geopolitical Entities |
|
|
|
Representation Class |
Name |
|
|
|
Object |
Country |
|
|
Quality Control |
|||
|
|
Registration Status |
Standard |
|
|
|
Administrative Status |
Final |
|
5.3 International Standard with Non-Enumerated Domain
This section provides a
specific example of the registration of a data element from an international
standard, where the possible valid values are not enumerated, but must be
determined by a procedure. The International
Organization for Standardization (ISO) 6709-1983 (E), Standard
representation of latitude, longitude and altitude for geographic point
locations, is used as the source for this example. ISO 6709 was developed by ISO Technical
Committee ISO/TC 97, Information processing systems, and was circulated to
member bodies in November 1981.
Eighteen countries approved the standard, no member body expressed
disapproval. There is no known schedule
for review and update of the standard.
ISO/TC 32 has been assigned as the maintenance authority for the
standard; ISO/TC 211 has expressed an interest in assuming responsibility for
its maintenance.
The table in Section 5.3.10
contains all of the metadata attributes recorded for the non-enumerated data
element from an international standard.
5.3.1 Understanding the Data Element
Latitude is a measure of the
angular distance on a meridian north or south of the equator. The standard provides for a variable format
and more than one representation for recording the latitude measure (i.e.,
degrees and decimal degrees and sexagesimal [i.e., degrees, minutes, and
seconds. The standard also includes
more than one representation and format for longitude, and a flexible format
for altitude. In addition, a standard
format for data transfer is included in the standard.
Although new technology and
new tools (e.g. Global Positioning System [GPS]) and analytical and mapping
software have caused some geographic information specialists to prefer the
measurement of locational coordinates in degrees and decimal degrees, many
organizations continue to measure latitude and longitude in degrees, minutes,
and seconds. Therefore, the RA of the
metadata registry in this example, has determined a need to register a data
element for latitude measured in degrees, minutes, and seconds. According to the standard, the placement of
the decimal point indicates the transition from degrees to sexagesimal
measures. Examples of data in the standard
include sexagesimal latitudes that are measured to a range of one or two
decimal places for seconds. The
standard, however, does not limit the precision, but requires only that the
number of decimal places indicate the precision of the measurement. The RA for this example requires that
latitude be recorded up to 5 decimal positions, where it can be measured to
that level of precision.
Latitude values are measured
in a range of 0 (on the equator) to 90 degrees. Minutes and seconds each are measured in a range of 0 to 60. Latitude values on or North of the equator
are recorded as positive numbers; those South of the equator are negative. Where latitude degrees are measured in
single digit, they must be recorded with a preceding zero. For data transfer, latitude measures must be
preceded by the directional symbol (+ or -), and they must include decimal
point, when the measurement includes decimal seconds. Latitude always precedes longitude, which precedes altitude. The latitude and longitude must be expressed
in the same format style and to the same precision (indicated by the number of
decimal positions). There are no
separators between the latitude, longitude, and altitude; the directional
symbol serves as a separator for the data element values.
5.3.2 Content Research
Part 11 of ISO 15046,
Spatial referencing by coordinates, describes the minimum data required to
define 1-, 2-, and 3-dimensional coordinate reference systems. The coordinate reference system must be
fully defined for a position to be unambiguous. Knowledge of the reference system is necessary to determine if
coordinate points are comparable. The
standard does not, however, provide information about representation of the
coordinates. ISO/TC 211/ WG 3, the
workgroup that is currently revising ISO 15046, has expressed an interest in
revising (ISO) 6709-1983 (E), Standard representation of latitude, longitude
and altitude for geographic point locations. Because of TC211=s interest in ISO 6709, and their current work
on the closely related standard, ISO 15046, it seems likely that ISO 6709 will
soon be reviewed and updated if needed.
Therefore, ISO 6709 seems appropriate to be identified as a standard data
element for latitude measure where latitude is measured as sexagesimal (i.e., in degrees, minutes, and
seconds).
A search of the metadata
registry in our example reveals about 40 data elements related to latitude
measure. One, an EPA interim standard
for latitude, measured in degrees and decimal degrees, is compliant with the
ISO 6709 data element for degrees. None
of the other data elements has the potential for compliance with ISO 6709 for
sexagesimal measure of latitude. The
other latitude data elements in the registry have been assigned the
registration status of incomplete, and many data elements are qualified (e.g.,
latitude where a facility is located, latitude of a smoke stack). For the purpose of this example, none have
the potential for being modified to meet the requirements of the ISO 6709
standard for latitude, measured in degrees, minutes, and seconds.
Therefore, in this example,
the ISO 6709 latitude, sexagesimal
measure, is selected for registration as a new data element.
5.3.3 Definition and Permissible Values
5.3.3.1 Definition
The data element definition
is formulated according to the rules and guidelines described in Annex Y, based
on ISO/IEC 11179-4. The rules require
that a data element definition be unique within the registry, so the unit of
measure has been included in the definition as "The sexagesimal measure of
the angular distance on a meridian north or south of the equator." Including the unit of measure in the
definition distinguishes the data element from the EPA interim standard,
defined simply as "The measure of the angular distance on a meridian north
or south of the equator." The
definition is singular, because it refers to only one instance of the data
value. Note that ISO 6709 does not
include a definition for latitude.
5.3.3.2 Permissible Values
ISO 6709 is an international
standard that does not list specific values that are valid for the data
element; the measure of latitude is a non-enumerated domain. There are no stored permissible values in a
registry for non-enumerated domains.
The values that are permissible for the ISO 6709 sexagesimal latitude
data element are those values that conform to the definition of the value domain
and the attributes for datatype, format, unit of measure, and precision. The value domain for sexagesimal latitude
can be described as "All sexagesimal measures of the distance of an angle
north or south of the equator." By
including the unit of measure in the definition, the value domain is
distinguished from the value domain description for latitude measured in
degrees. The definition is plural,
because it includes all possible measurements of latitude determined by this
type of measurement.
Latitude values that are
measured as degrees, minutes, and seconds must conform to the format +/‑DDMMSS
to +/-DDMMSS.SSSSS. The precision of
the value is indicated by the number of decimal places recorded.
Other value domain
attributes for this example include:
$ Character Set. The character set for latitude measure is "English language."
$ Domain Type. Non-enumerated.
$ Description/definition. All sexagesimal measures of the distance of an angle north or south of the equator.
$ Datatype. The datatype for latitude measure is "alphanumeric" to explicitly include the directional symbol and decimal point, where appropriate.
$ Maximum and minimum field lengths. The known minimum field length at this time is seven (+/- DDMMSS) where no decimal seconds are recorded. The maximum field length is 13 (+/- DDMMSS.sssss), to accommodate up to five decimal places for seconds.
$ Format. The format selected by the registration authority for this example is A(13) to accommodate the maximum number of decimal positions.
$ Range for degrees is 0-90; for minutes is 0-60; for seconds is 0-60.
5.3.4 Identifying and Naming the Data Element
5.3.4.1 Identifiers
A unique identifier is
required for the latitude data element.
For the RA in this example, the DI and VI (312345:1) are assigned
automatically by the metadata registry software.
5.3.4.2 Name Context and
Naming Convention
For this ISO standard data
element, the name is assigned with the context of Registry, using the naming
convention described in the example in Annex Y, summarized as follows:
$ Scope: The scope of this example naming convention is Registry Name.
$ Authority: The authority for this example is the U.S. Environmental Protection Agency for its Environmental Data Registry.
$ Semantic Rules: Names shall include an object and a property, where appropriate. Qualifiers shall be used to differentiate between names that would otherwise be the same. The representation class term shall always be included as the last term in the name.
$ Lexical Rules: A data element name shall have a maximum of 100 alphanumeric characters. The language of the registry shall be English, and the character set ASCII. There are no controlled word lists.
$ Name Uniqueness: Names shall be unique within a registration authority.
5.3.4.3 Name the Data
Element
Using the above naming
convention, the name is entered with the context of "Registry." The convention specifies that the name
should include the object "Latitude", to indicate the data values to
be stored in the data element. Include the representation for the concept in
the name; in this example "Measure."
There is no requirement in ISO/IEC 11179 Part 5 that data element names
be unique in a registry. However, the
naming convention used in this example specifies that names must be unique
within a registry. It is advisable to
use a qualifier in the data element name to differentiate between data elements
that might otherwise have the same name. The name includes the object
(latitude) and the representation (measure).
For this example, the name of the latitude data element will carry the
qualifier "sexagesimal" as a discriminator. The name that has been derived for the latitude data element is
"Latitude Sexagesimal Measure."
5.3.5 Other Metadata Attributes
Other metadata attributes
that can be recorded at this time are:
$ Provide an example of the data value that conforms to the description in the value domain, and to the datatype, format, and other value domain attributes for this data element.
Example: +674532 and
+674531.85435
$ Record the origin of this data element as the standard where the data element was identified.
Origin: ISO 6709-1983 (E), Standard
representation of latitude, longitude and altitude for geographic point
locations.
$ Record notes and comments that contain additional information about the data element that is not appropriate for the definition.
Note: Latitude sexagesimal
converts to latitude degrees by the following formula: seconds x 60 = decimal
minutes, total minutes x 60 = decimal degrees.
$ List the Office that submitted the data element for registration as the submitting organization.
Submitting Organization:
Office of Information Resources Management
$ The organization or individual that has responsibility for maintaining and updating the data element is recorded as the data steward for that data element.
Data Steward: Larry
Fitzwater
$ Administrative metadata, such as Create Date and User Name are recorded or captured automatically by the system where applicable.
5.3.6 Data Element Concept
The methodology to be used
for deriving a data element concept is described in Section 5.1.6 and Annex Y
of this document. A data element
concept is the data element without representation. We have indicated previously
that latitude is a distance measure, where measure is its representation. The data element concept for latitude
measure is "Latitude Distance" with the definition "A measure of
the angular distance of a point on the surface of the earth north or south of
the equator." Note that this
concept definition incorporates the
term "measure," which is a representation term. The concept of latitude, however, is the
measure of a distance. Therefore, it is
appropriate in this instance to use the term measure when defining the
concept.
A conceptual domain is a
collection of value meanings. The
collection must be identified with a name and a definition. The latitude is one of the horizontal
coordinates that fix a position on the surface of the earth either north or
south of the equator. For this example,
the name of the conceptual domain for latitude measure is "Latitude
Coordinates" with the definition "The coordinates that indicate the
distance north or south of the equator for locations."
For non-enumerated domains,
such as latitude measure, the value meanings are not explicitly
identified. The conceptual domain for
the Latitude Distance data element concept is the perceived repository of all
latitudes that mark positions on the earth with relation to the equator. The value meanings could be defined as
"The distance measure of a point north or south of the equator that is
<value>." No value meanings
are stored in the registry.
5.3.7 Classification
This data element might be
classified according to the following classification schemes:
$ Identify one or more keywords, where the keyword is a name or subject matter descriptor that will facilitate grouping like data elements for retrieval.
Keyword: Latitude,
Horizontal Coordinate, Spatial
$ Group Short, English-Language Country Name with similar
data elements according to concept for translation or by general subject
matter.
Subject group: Geographic
Point Location.
$ Identify the class by which this data element is represented.
Representation Class:
Measure
$ One or more real world objects that identify this data element can be identified at this time.
Object: Latitude
5.3.8 Quality Control
When all of the mandatory
metadata attributes have been entered for this data element, it is assigned the
registration status of "Recorded" and the administrative status of
"In Quality Review." This
data element was identified in an international standard, and so would soon be
updated to reflect higher status of the data element. The data element, however, would not be expected to be assigned
the status of AStandard.@ The data
element is not expected to be come the preferred representation for latitude
measure, since geographic information specialists prefer that latitude and
longitude be recorded in degrees and decimal degrees. Therefore, after quality review has been completed, the data
element will be assigned the registration status of ACertified@ with an
administrative status of ANo further action.@
5.3.9 Other Data Elements in ISO 6709
ISO 6709 identifies five
data elements: sexagesimal latitude, degrees latitude, sexagesimal longitude,
degrees longitude, and altitude. The
different formats represented by the units of measure for latitude (i.e.,
degrees and sexagesimal) express representation (i.e., unit of measure). The two latitude data elements from ANSI
6709 are translatable at the concept level, based on their unit of measure
representations. They share the same
conceptual domain, because their implied value meanings are the same. Likewise, the longitude data elements share
a data element concept and a conceptual domain, and longitude data can be
translated based on unit of measure conversions. .
Whereas the multiple data elements
identified in ISO 3166 share the same data element concept and the same
conceptual domain, the data elements identified in ISO do not share data
element concepts and conceptual domains. All three concepts: latitude,
longitude, and altitude, are distance measures. Latitude, however, is a north/south measure with respect to the
equator; longitude is an east/west measure with respect to the prime meridian;
and altitude is a vertical measure with respect to a point of reference such as
sea level. Each has its own data
element concept and its own conceptual domain.
These data elements do share
classification. All can be classified
as the group "Geographic Point Location" and as the representation
class "Measure."
5.3.10 Summary of Metadata Attributes
The following table
summarizes the metadata attributes assigned to latitude sexagesimal measure in
the preceding paragraphs in Section 5.3.
The table in Annex F also contains this data in the second metadata
column.
|
Data Element Meta--
Example model
Attribute Name |
ISO 6709 Non-enumerated, Latitude |
||
|
1. Data
Element Definition and Permissible Values |
|||
|
|
Data Element Definition |
The measure in degrees of the angular distance of a
position on earth on a meridian north or south of the equator. |
|
|
|
Permissible Values |
Measures of Latitude in Degrees, Minutes, and Seconds |
|
|
|
PV Begin Date |
(Not Applicable) |
|
|
|
PV End Date |
(Not Applicable) |
|
|
|
Value Domain Definition |
All measures of the distance of an angle north or
south of the equator measured in degrees, minutes, and seconds. |
|
|
|
Character Set |
English language |
|
|
|
Domain type |
Non-enumerated |
|
|
|
Determinant Type |
Range |
|
|
Range Limits |
00-90 |
||
|
|
Datatype |
Alphanumeric |
|
|
|
Minimum |
7 |
|
|
|
Maximum |
13 |
|
|
|
Format |
A(13)
+/-DDMMSS.SSSSS |
|
|
|
Unit of Measure |
Sexagesimal |
|
|
|
Precision |
Number of decimal places recorded. |
|
|
2. Data
Element Name and Identifier |
|||
|
|
Data Element Name Context |
Registry |
|
|
Data Element Name |
Latitude Sexagesimal Measure |
||
|
|
DE Identifier/ Version
Number (DI:VI) |
312345:1 |
|
|
3. Other
Metadata Attributes |
|
||
|
|
Example |
+674532 and +674531.85435 |
|
|
|
Origin |
ISO 6709-1983 (E), Standard representation of
latitude, longitude and altitude for geographic point locations. |
|
|
Note/Description |
Latitude on or north of the equator is preceded by a
plus sign; south of the equator by a minus sign. |
||
|
Submitting organization |
Office of Information Resources Management |
||
|
Data Steward |
Larry Fitzwater |
||
|
4. Data
Element Concept (DEC) |
|
||
|
|
Data Element Concept Name |
Latitude Distance |
|
|
|
Data Element Concept Definition |
A measure of the angular distance of a point on the
surface of the earth north or south of the equator |
|
|
|
Conceptual Domain Name |
Latitude Coordinates |
|
|
|
Conceptual Domain Definition |
The coordinates that indicate the distance north or
south of the equator for locations. . |
|
|
|
Enumerated Value Meaning Text |
(Not Applicable) |
|
|
VM Begin Date |
(Not Applicable) |
||
|
VM End Date |
(Not Applicable) |
||
|
5. Classification |
|
||
|
|
Keyword |
Horizontal Coordinate, Latitude |
|
|
|
Group |
Geographic Point Locations |
|
|
|
Representation Class |
Measure |
|
|
|
Object |
Latitude |
|
|
6. Quality Control |
|
||
|
|
Registration Status |
Recorded |
|
|
|
Administrative Status |
In quality review |
|
5.4 Application Data Element