ISO TC184/SC4/WG12 N15

Date: 2007-04-05

ISO/CD-TS 8000-110
Information quality — Part 110: Master data quality: Syntax, semantic encoding, and conformance to customer requirements

COPYRIGHT NOTICE:

This ISO document is a working draft or Committee Draft and is copyright protected by ISO. While the reproduction of working drafts or Committee Drafts in any form for use by Participants in the ISO standards development process is permitted without prior permission from ISO, neither this document nor any extract from it may be reproduced, stored or transmitted in any form for any other purpose without prior written permission from ISO.

Requests for permission to reproduce this document for the purposes of selling it should be addressed as shown below (via the ISO TC 184/SC4 Secretariat’s member body) or to ISO's member body in the country of the requester.

Copyright Manager
ANSI
11 West 42nd Street
New York, New York 10036
USA
phone: +1-212-642-4900
fax: +1-212-398-0023

Reproduction for sales purposes may be subject to royalty payments or a licensing agreement.

Violators may be prosecuted.

ABSTRACT:

This part of ISO 8000 specifies requirements on master data with regard to syntax, semantic encoding, and conformance to customer needs, for the purpose of achieving master data quality.

COMMENTS TO READER:

Recipients of this draft are invited to submit, with their comments, notification of any relevant patent rights of which they are aware and to provide supporting documentation.

This document has been reviewed using the internal review checklist (see WG12 N12), the project leader checklist (see WG12 N13), and the convener checklist (see WG12 N14), and is ready for this ballot cycle.

Project leader: Peter Benson
Address: ECCMA
2980 Linden St Ste E2
Bethlehem, PA 18017-3283
USA
Telephone: +1 610-861-5990
Facsimile: +1 610-861-5992
Electronic mail: peter.benson@eccma.org
Part editor: Gerald Radack
Address: Concurrent Technologies Corp.
441 Friendship Rd Ste 103
Harrisburg, PA 17111-1204
USA
Telephone: +1 610 449 3853
Facsimile: +1 610 449 0853
Electronic mail: radack@ctc.com

© ISO 2007

All rights reserved. Unless otherwise specified, no part of this publication may be reproduced or utilized in any form or by any means, electronic or mechanical, including photocopying and microfilm, without permission in writing from either ISO at the address below or ISO's member body in the country of the requester.

ISO copyright office
Case postale 56. CH-1211 Geneva 20
Tel. + 41 22 749 01 11
Fax + 41 22 734 10 79
E-mail copyright@iso.ch
Web www.iso.ch

Contents

1 Scope

2 Normative references

3 Terms and definitions

4 Abbreviations

5 Syntax

6 Semantic encoding

     6.1 General

     6.2 All data element values

     6.3 Measures

     6.4 Currency amounts

7 Conformance to customer needs

Annex A (normative) Document identification

Bibliography

Index

Figures

Figure 1 — Example of general data element value

Figure 2 — Example of measure data element value

Figure 3 — Example of currency data element value

Tables

Table 1 — Examples of published syntax specifications

Table 2 — Examples of published concept dictionaries

Table 3 — Examples of data requirements statements

Foreword

ISO (the International Organization for Standardization) is a worldwide federation of national standards bodies (ISO member bodies). The work of preparing International Standards is normally carried out through ISO technical committees. Each member body interested in a subject for which a technical committee has been established has the right to be represented on that committee. International organizations, governmental and non-governmental, in liaison with ISO, also take part in the work. ISO collaborates closely with the International Electrotechnical Commission (IEC) on all matters of electrotechnical standardization.

International Standards are drafted in accordance with the rules given in the ISO/IEC Directives, Part 2.

The main task of technical committees is to prepare International Standards. Draft International Standards adopted by the technical committees are circulated to the member bodies for voting. Publication as an International Standard requires approval by at least 75 % of the member bodies casting a vote.

In other circumstances, particularly when there is an urgent market requirement for such documents, a technical committee may decide to publish other types of normative document:

An ISO/PAS or ISO/TS is reviewed after three years in order to decide whether it will be confirmed for a further three years, revised to become an International Standard, or withdrawn. If the ISO/PAS or ISO/TS is confirmed, it is reviewed again after a further three years, at which time it must either be transformed into an International Standard or be withdrawn.

Attention is drawn to the possibility that some of the elements of this document may be the subject of patent rights. ISO shall not be held responsible for identifying any or all such patent rights.

ISO/CD-TS 8000-110 was prepared by Technical Committee ISO/TC 184, Industrial automation systems and integration, Subcommittee SC4, Industrial data.

A complete list of parts of ISO 8000 is available from the Internet:

http://www.tc184-sc4.org/titles/CMS_Titles.htm.

Introduction

Typical consequences of poor data quality include customer dissatisfaction and lost revenue as well as higher costs associated with additional time to reconcile data, loss of credibility in a system and associated compliance problems. Data quality is dependent both on the quality of the data capture process as well as the processes used to store and transfer data. Finally managing data quality is also about managing the understanding and the expectations of quality and this requires the ability to objectively measure the costs and benefits of data quality.

While data quality is most commonly associated with transactional data, it is important to keep in mind that business information, such as designs, processes, machine instructions and trading partner information, constitutes an ever-increasing proportion of a business's assets and an ever-increasing proportion of this is stored in electronic form. Managing data quality is therefore an essential element in protecting the value of electronic information.

While the bit is the fundamental building block of electronically stored data, data elements are the fundamental building block of electronically stored information. Data elements consist of labeled data values where the data label provides the meaning for the data value.

It follows that as data element values are the fundamental building blocks of electronically stored information, the quality of the data element values is a major determinant of the quality of the information and consequently the accuracy and reliability of the knowledge that can be derived from the information.

  1. As the data label is essential to resolving the meaning of a data element value, the ability to retrieve the definition of a data element through its data label is an intrinsic component of establishing data quality.
  2. The quality of the definitions in terms of clarity of meaning to all stakeholders is also a contributing factor to data quality.
  3. A data element value represents an observation at a specific time and therefore the ability to track the origin in time of data element values is an intrinsic component of establishing data quality.
  4. Information is created as a result of a process owned by an organization, and therefore the ability to track the organization is an intrinsic component of establishing data quality.
  5. Data element values may be created or validated by an organization other than the originating organization of the information, and therefore the ability to track the cataloguing organization is an intrinsic component of establishing data quality.
  6. NOTE The history of the origination, cataloguing and transfer of ownership of information is commonly referred to as the provenance.

  7. The syntax or arrangement of data element values determines the ease with which data can be integrated within and across organizations and is therefore considered to be an intrinsic component of establishing data quality.
  8. The ability of data to satisfy user's requirements is determined by the existence of a minimum specified set of data element values and is therefore an intrinsic component of establishing data quality.

An assertion of accuracy is only meaningful if it is accompanied by reference to the standard or benchmark against which the data element values were assessed along with the identification of the organization that performed the test and the date and time the test was performed.

Where data is a product of an organization, an assertion of reliability is an assertion of the data quality management capabilities of the organization providing the data.

This Technical Specification provides general requirements for data quality, independent of syntax. It may be used with any other standard that specifies a syntax for a data set.


ISO/CD TS 8000-110

Information quality — Part 110: Master data quality: Syntax, semantic encoding, and conformance to customer requirements

1 Scope

This part of ISO 8000 specifies, for the purpose of achieving master data quality, requirements on the content of transactions that are used to exchange of master data between organizations.

EXAMPLE 1 This part of ISO 8000 does not apply to a credit charge transaction sent from a merchant to a credit card company, because the data being sent is not master data.

EXAMPLE 2 This part of ISO 8000 applies to a catalogue transaction, in which a supplier sends a customer characteristic data describing an item that the customer is considering buying.

EXAMPLE 3 This part of ISO 8000 does not apply to the contents of a company's internal parts catalogue, because the data are not part of a transaction.

The following are within the scope of this part of ISO 8000:

The following are outside the scope of this part of ISO 8000:

2 Normative references

The following referenced documents are indispensable for the application of this document. For dated references, only the edition cited applies. For undated references, the latest edition of the referenced document (including any amendments) applies.

ISO/IEC 6523-1, Information technology — Structure for the identification of organizations and organization parts — Part 1: Identification of organization identification schemes.

ISO/IEC 8824-1, Information technology — Abstract Syntax Notation One (ASN.1): Specification of basic notation.

ISO 10303-1, Industrial automation systems and integration — Product data representation and exchange — Part 1: Overview and fundamental principles.

3 Terms and definitions

For the purposes of this part of ISO 8000, the terms and definitions given in part 2 of this Technical Specification apply.

4 Abbreviations

For the purposes of this part of ISO 8000, the following abbreviations apply:

EDIFACTElectronic data interchange for administration, commerce and transport
NRICSCNatural Resources Industry Content Standardization Council

5 Syntax

Each data set shall contain a reference to the specification of the syntax to which the data set complies. The reference shall be a globally unambiguous identifier for the specific version of the specification of the syntax that was used to encode the data set. The reference shall be resolvable to the specification of the syntax through a mechanism that is publicly available, free of charge. The specification of the syntax shall be publicly available.

NOTE 1 It is not required that the specification of the syntax be free of charge. It should be available at a reasonable cost.

NOTE 2 Examples of published syntax specifications are given in Table 1.

Table 1 — Examples of published syntax specifications

Designation Name
ISO 22745-40 Open technical dictionaries and their application to cataloguing — Catalogue representation
ISO 9735-1:2002 Electronic data interchange for administration, commerce and transport (EDIFACT) — Application level syntax rules
ISO 13584-25:2004 Industrial automation systems and integration — Parts library — Part 25: Logical resource: Logical model of supplier library with aggregate values and explicit content
ISO 15926-6 Industrial automation systems and integration — Integration of life-cycle data for process plants including oil and gas production facilities — Part 6: Scope and representation for additional reference data

6 Semantic encoding

6.1 General

Each data element value shall reference all concepts necessary to unambiguously define its meaning. Each reference shall be to a concept dictionary entry contained in a concept dictionary. The reference shall preserve the integrity of the recipient’s data in that the reference to the concept dictionary entry may be integrated with the recipient’s own data without the creation of a joint work.

NOTE 1 Concept dictionary identifiers are subject to copyright and the use of an identifier within a document creates a joint work in which the owner of the copyright in the identifier has equal rights to register and enforce the copyright in the work, regardless of how their shares in the work are divided. A dictionary in which the concept identifiers are in the public domain will protect the recipient from the creation of a joint work.

  1. The concept dictionary is freely and anonymously downloadable in its entirety from the Internet.
  2. The concept dictionary supports an interface for free and anonymous resolution of a concept identifier to the concept dictionary entry that contains the supporting terminology.

NOTE 2 Examples of published concept dictionaries are given in Table 2.

Table 2 — Examples of published concept dictionaries

Designation Name
ISO 13584-501 dictionary Reference Dictionary for Measuring Instruments
ISO 13584-511 Reference Dictionary for Fasteners
ISO 15926-4 Reference Data for Integration of Life-cycle Data for Process Plants Including Oil and Gas Production Facilities
eOTD ECCMA Open Technical Dictionary
ECE/TRADE/362 Trade Data Elements Directory
UNTDED 2005

NOTE 3 The inclusion of ISO 13584-501, ISO 13584-511 and ISO 15926-4 in the final published version of this Technical Specification is subject to confirmation that they will be compliant with this clause (i.e., free download or resolution).

6.2 All data element values

Each data element value shall have a data label that contains a reference to the data element concept that provides its basic meaning. The reference shall be in the form of a globally unambiguous identifier for a concept dictionary entry.

EXAMPLE A flat washer has a property "material document ASTM A682". (The material document is the specification, standard, or manufacturer's reference document that identifies the material.) The property is represented by a data element value consisting of the following components:

In a particular concept dictionary, the concept dictionary entry for "material document" has identifier "0161-1#02-033785#1" The reference from the data element value to the concept dictionary entry is shown in Figure 1.


Figure 1 — Example of general data element value

6.3 Measures

In addition to the requirements of clause 6.2, each data element value that is a measure shall satisfy the requirements given in this clause.

The unit measure shall be clearly and unambiguously specified. The unit of measure shall be given in one of the following ways:

The data element value shall contain a reference to the concept dictionary entry that provides the meaning of the unit of measure. The reference shall be in the form of a globally unambiguous identifier for a concept dictionary entry.

NOTE 1 A name of a unit of measure (e.g., "centimeter") or an abbreviation used in human-readable text (e.g., "cm") does not meet the requirements of this clause, since it is not an identifier for a concept dictionary entry.

If the measure is qualified, then the data element value shall contain a reference to the concept dictionary entry that provides the meaning of the qualifier. The reference shall be in the form of a globally unambiguous identifier for a concept dictionary entry.

NOTE 2 Examples of qualifiers are "nominal", "minimum", and "maximum".

EXAMPLE A flat washer has a property "bolt thread diameter for which designed 10 mm nominal". (The bolt thread diameter for which designed is the length of a straight line which passes through the center of the threaded portion of the bolt, and terminates at the circumference.) The property is represented by a data element value consisting of the following components:

In a particular concept dictionary, the concept dictionary entry for "bolt thread diameter for which designed" has identifier "0161-1#02-001648#1", the concept dictionary entry for "millimetre" has identifier "0161-1#05-000845#1", and the concept dictionary entry for "nominal" has identifier "0161-1#06-000001#1". The references from the data element value components to the concept dictionary entries are shown in Figure 2.


Figure 2 — Example of measure data element value

6.4 Currency amounts

In addition to the requirements of clause 6.2, each data element value that is a currency amount shall satisfy the requirements given in this clause.

The data element value shall contain a reference to the currency. The reference shall be in the form of a globally unambiguous identifier for a concept dictionary entry.

EXAMPLE A flat washer has a property "unit price 42.36 Euro". (The unit price is the price of a single item.) The property is represented by a data element value consisting of the following components:

In a particular concept dictionary, the concept dictionary entry for "unit price" has identifier "0161-1#02-033797#1", and the concept dictionary entry for "Euro" has identifier "0161-1#08-002455#1". The references from the data element value components to the concept dictionary entries are shown in Figure 3.


Figure 3 — Example of currency data element value

7 Conformance to customer needs

Each data set shall contain a reference to the data requirements statement to which the data set complies. The reference shall be a globally unambiguous identifier for the specific version of the data requirements statement that was used to encode the data set. The reference shall be resolvable to the data requirements statement through a mechanism that is publicly available, free of charge. The data requirements statement shall be publicly available.

NOTE 1 It is not required that the data requirements statement be free of charge. It should be available at a reasonable cost.

NOTE 2 Examples of published data requirements statements are given in Table 3.

Table 3 — Examples of data requirements statements

Designation Description Format specification
  NRICSC identification guide for machine bolts ISO 22745-30
FIIG A003B Item Identification Guide for Bolts and Screws DoD 4100.39-M Volume 3
ISO 13584-511 Reference Dictionary for Fasteners ISO 13584-25

NOTE 3 An ISO 13584-25 compliant dictionary contains both a concept dictionary and a data requirements statement.

Annex A
(normative)

Document identification

To provide for unambiguous identification of an information object in an open system, the object identifier

{ iso standard 8000 part (110) version (1) }

is assigned to this Technical Specification. The meaning of this value is defined in ISO 8824-1, and is described in ISO 10303-1.

Bibliography

[1] ISO 7372, Trade data interchange — Trade data elements directory. 3rd ed.

[2] ISO 9735-1, Electronic data interchange for administration, commerce and transport (EDIFACT) — Application level syntax rules. 2nd ed.

[3] ISO 13584-25, Industrial automation systems and integration — Parts library — Part 25: Logical resource: Logical model of supplier library with aggregate values and explicit content. 1st ed.

[4] ISO 13584-501, Industrial automation systems and integration — Parts library — Part 501: Reference dictionary for measuring instruments — Registration procedure. 1st ed.

[5] ISO 13584-511, Industrial automation systems and integration — Parts library — Part 511: Mechanical systems and components for general use — Reference dictionary for fasteners. 1st ed.

[6] ISO/CD-TS 15926-4—1), Industrial automation systems and integration — Integration of life-cycle data for process plants including oil and gas production facilities — Part 4: Initial reference data. 1st ed.

[7] ISO 15926-6—1), Industrial automation systems and integration — Integration of life-cycle data for process plants including oil and gas production facilities — Part 6: Scope and representation for additional reference data.

[8] ISO 22745-40—1), Industrial automation systems and integration — Open technical dictionaries and their application to master data — Catalogue representation. 1st ed.

Index

concept 6.1
concept dictionary 6.1
concept reference 6.1
currency 6.4
currency reference 6.4
data element concept reference 6.2
data label 6.2
data requirements statement reference 7
measure data element value 6.3
qualifier of measure 6.3
qualifier reference 6.3
resolution 5
semantic encoding 6.1
syntax reference 5
syntax specification 5
unit of measure 6.3
unit of measure reference 6.3

1) To be published

© ISO 2007 — All rights reserved