Data assets

Transfer Pricing and Valuation Benchmark Data Assets Built from Official Sources

DataAlchemist provides structured benchmark data assets for transfer pricing, IP valuation, and advisory work.

Each asset is built from official source materials, normalized into comparable fields, enriched with context, and linked back to the evidence behind the benchmark.

CoveragePublic filings define the available universe. DataAlchemist's edge is completeness, currency, and substance: how source records are maintained, structured, enriched, and made usable for benchmarking.

Source baseDataAlchemist works from the public-source universe available to the market: SEC-filed agreements, franchise disclosure materials, public-company filings, regulatory records, patents, and other official sources across jurisdictions. SEC EDGAR is the primary source base, but each benchmark record is built from broader international public evidence around the transaction.
Span & CurrencyCoverage extends across the SEC electronic-filing era, with critical volume building from the mid-1990s mandate through the current filing record. Source collections are refreshed on a continuing basis, so benchmark records reflect newly disclosed agreements, filings, and supporting evidence as they become available.
Breadth & DepthRecords are structured within the agreement and connected beyond it. Depth comes from payment mechanics, rights, clauses, asset-level classification, contextualized descriptions, and agreement-level FAR and DEMPE indicators. Breadth comes from external conduct evidence and market context drawn from corporate filings, patents, regulatory materials, and market disclosures where available.
Evidence Architecture

A benchmark you can't trace can't be defended.

Source-Linked Assets

Agreements, filings, patents, and market communications are used to construct records that preserve the relationship between the benchmark and the evidence behind it.

The purpose is practical: help professionals identify comparables, understand their context, and support analysis with records suitable for review.

Edgar AI Production layer

The engine that
produces the asset

Every benchmark record is produced, not merely collected and extracted. Edgar is DataAlchemist’s in-house, domain-trained AI model, embedded in a multi-stage production process that constructs each record before it is searched. It structures the agreement, classifies the asset by economic use, links evidence around the transaction, and prepares the record for domain-specialist review.

Most AI in professional data markets is applied at the interface: faster ways to search, screen, and select records after the data asset has already been built. Edgar is applied at production, where the benchmark record itself is constructed. A search layer, however capable, can only retrieve the structure, context, and substance already present in the data asset. Each DataAlchemist record may appear as a single row, but it is the output of a layered, multi-flow production system applied across the corpus.

01 · Construction

Production, not retrieval

Edgar supports the construction of benchmark records before they are searched.

02 · Structure

Substance by default

Asset-level classification, payment structure, FAR, DEMPE indicators, and conduct evidence are built into the record.

03 · Review

Validated, not auto-generated

A domain specialist confirms each record before it enters a benchmark set.

Edgar is the asset behind the assets.

Data Architecture

Benchmark Asset Families

Structured for transfer pricing, valuation, and advisory workflows.

Royalty Rates

License-agreement benchmarks built from filings, patents, and regulatory materials. Records include normalized royalty bases, licensed rights, IP type, exclusivity, payment structures, and asset-level industry classification.

Service Fees

Independent-party service fee benchmarks organized by service type, fee type, payment structure, and recipient industry. Used for service fee benchmarking, benefits tests, cost-plus support, management fees, and procurement arrangements.

Lease Rates

Lease and rental benchmarks drawn from real estate, equipment, facility, and vehicle disclosures. Structured by asset type, lessee industry, lease term, and payment frequency to support intercompany leases, shared facilities, and cost allocations.

Contracts Search

Clause-first search across publicly filed agreements and amendments. Built for research where the answer doesn't fit into a standard benchmark table: unusual fee structures, payment mechanics, rights language, and change-of-control provisions.

Evidence Packs

Connecting selected benchmarks directly to the materials behind them: agreements, clauses, patents, and filings. Designed to let users move from a record to its source documents, supporting audit review, expert reports, and dispute files.

Data Infrastructure

Sourcing & Curation

01

Source Coverage

We work from SEC EDGAR filings, USPTO and EPO patent records, court and regulatory filings, and public-company disclosures across multiple jurisdictions. The objective is to preserve the evidentiary chain, not just accumulate documents.

02

Normalization

Terms are normalized to common fields and units, and descriptions enriched where source language is thin, so comparables align across agreements, industries, and jurisdictions.

03

Analytical Structure

Each asset is structured according to the analysis it supports. The common discipline is the same across all data: structured fields, source-linked records, contextual enrichment, and a comparability-oriented structure.

Guiding Principles

Clarity Over Volume

DataAlchemist prioritizes traceable, normalized, source-linked benchmark records over undifferentiated document volume.

01 True Comparables A large dataset produces weak analysis if records are misclassified. We emphasize cleaner classification and fewer false comparables.
02 Contextual Breadth We do not reduce transactions to a handful of extracted fields. Each record carries the wider context around the transaction, drawn from evidence beyond the agreement itself.
03 Defensibility For professional benchmarking work, the important question is not whether a record exists, but whether it can be understood, compared, and defended.

Request a Data Assets Walkthrough

Review the benchmark data assets behind DataAlchemist: royalty rates, service fees, lease benchmarks, contracts, and evidence packs.

Request a Walkthrough