- Data Governance
- The set of policies, processes, roles, and standards that define how an organization manages data as a strategic asset.
- Data Owner
- The individual or team accountable for the accuracy, security, and appropriate use of a specific data domain or dataset.
- Data Steward
- A person responsible for the day-to-day management of a data domain β enforcing quality rules, resolving issues, and maintaining metadata.
- Data Classification
- A tiered labeling system that assigns sensitivity levels to data (e.g., public, internal, confidential, restricted) to determine handling and access rules.
- Data Lineage
- A traceable record of where data originated, how it has moved through systems, and how it has been transformed β used to diagnose quality issues and validate reports.
- Metadata
- Descriptive information about a dataset β its source, format, owner, update frequency, and definitions β that makes the data discoverable and interpretable.
- Master Data Management (MDM)
- A discipline that ensures a single, authoritative version of core business entities β customers, products, locations β is maintained and shared across systems.
- Data Quality Dimensions
- Standard criteria for evaluating data quality, typically including accuracy, completeness, consistency, timeliness, and validity.
- Access Control
- Rules that define which users or systems can read, write, or delete specific data, typically enforced through roles and permissions.
- Data Catalog
- A centralized inventory of an organization's data assets, including definitions, ownership, quality scores, and lineage β the searchable index of governed data.
- PII (Personally Identifiable Information)
- Any data that can identify a specific individual β such as name, email, Social Security number, or IP address β subject to privacy regulations.
- Data Lifecycle
- The full arc of a dataset from creation or ingestion through active use, archival, and eventual deletion or destruction.