Ontology
An ontology defines formal concepts, their properties and relationships within a domain model. It provides a shared vocabulary for data integration, interoperability and semantic querying.
Classification
- ComplexityHigh
- Impact areaTechnical
- Decision typeArchitectural
- Organizational maturityAdvanced
Technical context
Principles & goals
Use cases & scenarios
Compromises
- Overly complex models that reduce adoption and maintainability.
- Incorrect or inconsistent term definitions cause inconsistencies.
- Lack of governance leads to divergent extensions and silos.
- Start with a lean core model and iterate.
- Involve domain experts early and continuously.
- Version and document changes clearly and traceably.
I/O & resources
- Domain terms, glossaries and requirement documents
- Sample data and source schemas
- Access to subject-matter experts for clarifications
- Machine-readable ontology models (e.g. OWL/RDF)
- Mapping tables and transformation rules
- Governance documents and versioning schemes
Description
Ontologies are formal, machine-readable models that precisely describe concepts, relations and rules of a domain. They harmonize data, enable semantic queries and improve interoperability across systems. Ontology engineering combines domain expertise with technical representations.
✔Benefits
- Improved interoperability through unified terms.
- Enables semantic queries and reasoning-based analyses.
- Supports consistent metric and data interpretation.
✖Limitations
- High initial effort for modelling and governance.
- Maintenance effort when domain requirements change.
- Not all data problems can be solved by ontologies alone.
Trade-offs
Metrics
- Domain coverage
Percentage of relevant terms covered by the ontology.
- Number of reused concepts
How many concepts are used across multiple systems or reports.
- Validation errors per release
Number of semantic inconsistencies or modelling errors after changes.
Examples & implementations
Schema.org (web ontology)
A widely used vocabulary for structuring web content and entities.
FOAF (person relationships)
A lightweight ontology model describing people and their relationships.
SNOMED CT (medical terminology)
An extensive standardized ontology for clinical terms and codings.
Implementation steps
Inventory domain terms and schemas.
Create an initial ontology model with core entities.
Implement mappings and validation rules.
Establish governance and release procedures.
⚠️ Technical debt & bottlenecks
Technical debt
- Unclear or non-versioned concept changes.
- Ad-hoc mappings without a central standard.
- Missing tests for ontology integrity after changes.
Known bottlenecks
Misuse examples
- Using ontology as a substitute for poor data cleansing.
- Overfitting the ontology to a specific tool format.
- Uncontrolled free extensions by many teams.
Typical traps
- Insufficient test cases for semantic validation.
- Misunderstandings between technical and domain stakeholders.
- Unconsidered performance requirements for reasoning.
Required skills
Architectural drivers
Constraints
- • Existing data quality and semantics must be acceptable.
- • Organisational agreement for shared vocabularies required.
- • Platforms must support RDF/OWL or mappings.