1
Identify the parties and describe the licensed data
Enter the full legal names of the licensor and licensee, and attach a Schedule A describing the dataset precisely β format, volume, update frequency, delivery method, and any version or snapshot date.
π‘ A vague description like 'company data' creates disputes. Reference the specific dataset name, file format (e.g., CSV, API feed, SQL database), and the date range of records covered.
2
Define the permitted use and field of use
Write out specifically what the licensee may do with the data β internal analytics, resale to end users, training statistical models β and explicitly list prohibited uses. Include any geographic or industry-sector restrictions.
π‘ If you are unsure whether a use case should be permitted, prohibit it explicitly and address it in a separate addendum with its own fee. Ambiguous permissions create licensing disputes.
3
Choose the license type and exclusivity
Decide whether the license is exclusive (only this licensee may use the data in this field of use) or non-exclusive (the licensor can license the same data to others). Price and duration should reflect the exclusivity decision.
π‘ Exclusive licenses in a narrow field of use command a significant premium β typically 3β5Γ the non-exclusive fee β and should include minimum revenue or usage commitments from the licensee.
4
Set the fee structure and audit mechanism
Enter the license fee amount, payment frequency, and whether fees are flat, tiered, or usage-based. Include the audit-rights clause with a specified notice period and frequency cap.
π‘ For usage-based fees, require the licensee to provide a monthly usage report. Self-reported usage without an audit right is practically unenforceable.
5
Address IP ownership of derivative works
Decide whether derivative works β reports, models, or products built using the licensed data β belong to the licensee, the licensor, or are jointly owned. Document this clearly in the IP clause.
π‘ If the licensee will build AI or ML models using the data, address model ownership explicitly. Courts in most jurisdictions have not yet settled whether training a model on licensed data creates a derivative work.
6
Specify confidentiality and security requirements
Reference a named security standard (ISO 27001, SOC 2 Type II, or equivalent) rather than generic 'reasonable measures.' List which internal roles qualify as Authorized Users and whether contractors are included.
π‘ Require the licensee to notify you within 72 hours of any suspected unauthorized access β this mirrors GDPR breach notification windows and creates a consistent compliance posture.
7
Set the term, renewal, and data-return obligations
Enter the start date, initial term, and auto-renewal mechanics. Include a data-return or destruction clause requiring written certification within 30 days of termination.
π‘ If the data is delivered incrementally over time (e.g., a daily data feed), specify whether the licensee may retain historical snapshots after termination or must delete all accumulated data.
8
Execute before data access is granted
Both parties must sign the agreement before any data is transferred or API credentials are issued. Retroactive agreements are harder to enforce and create a gap in IP protection.
π‘ Use a click-wrap acceptance mechanism for self-serve data products β a signed PDF is better for negotiated enterprise licenses. Never deliver data access credentials in the same email as an unsigned draft.