- Data Pipeline
- An automated sequence of steps that ingests, transforms, and loads data from source systems into a destination β such as a data warehouse or analytics platform.
- ETL / ELT
- Extract, Transform, Load (or Extract, Load, Transform) β the process of moving and reshaping data from operational systems into analytical stores.
- Data Warehouse
- A centralized repository optimized for analytical queries, storing structured data from multiple source systems β common examples include Snowflake, BigQuery, and Redshift.
- Exempt vs. Non-Exempt Classification
- A US FLSA distinction determining overtime eligibility; data engineers typically qualify as exempt under the computer employee or highly compensated exemptions if they meet salary and duties tests.
- IP Assignment
- A clause transferring ownership of code, models, documentation, and other work product created by the employee to the employer during the employment relationship.
- At-Will Employment
- Employment that either party may end at any time for any lawful reason without advance notice β the default standard in most US states.
- Data Governance
- The set of policies, standards, and processes that define how data is collected, stored, accessed, and used across an organization.
- SLA (Service Level Agreement)
- A documented commitment to pipeline uptime, latency thresholds, or data freshness targets that the data engineer is responsible for meeting.
- Confidential Information
- Non-public business information β including data architecture, customer data, proprietary algorithms, and financial data β that the employee is prohibited from disclosing.
- Probationary Period
- A defined initial employment period β typically 30 to 90 days β during which performance is evaluated and termination formalities may be reduced.