Transfer Pricing & IP Valuation Benchmark Data Assets

Evidence Architecture

A benchmark you can't trace can't be defended.

Source-Linked Assets

Agreements, filings, patents, and market communications are used to construct records that preserve the relationship between the benchmark and the evidence behind it.

The purpose is practical: help professionals identify comparables, understand their context, and support analysis with records suitable for review.

Edgar AI Production layer

The engine that
produces the asset

Every benchmark record is produced, not merely collected and extracted. Edgar is DataAlchemist’s in-house, domain-trained AI model, embedded in a multi-stage production process that constructs each record before it is searched. It structures the agreement, classifies the asset by economic use, links evidence around the transaction, and prepares the record for domain-specialist review.

Most AI in professional data markets is applied at the interface: faster ways to search, screen, and select records after the data asset has already been built. Edgar is applied at production, where the benchmark record itself is constructed. A search layer, however capable, can only retrieve the structure, context, and substance already present in the data asset. Each DataAlchemist record may appear as a single row, but it is the output of a layered, multi-flow production system applied across the corpus.

01 · Construction

Production, not retrieval

Edgar supports the construction of benchmark records before they are searched.

02 · Structure

Substance by default

Asset-level classification, payment structure, FAR, DEMPE indicators, and conduct evidence are built into the record.

03 · Review

Validated, not auto-generated

A domain specialist confirms each record before it enters a benchmark set.

Edgar is the asset behind the assets.

Data Architecture

Benchmark Asset Families

Structured for transfer pricing, valuation, and advisory workflows.

Royalty Rates

License-agreement benchmarks built from filings, patents, and regulatory materials. Records include normalized royalty bases, licensed rights, IP type, exclusivity, payment structures, and asset-level industry classification.

Service Fees

Independent-party service fee benchmarks organized by service type, fee type, payment structure, and recipient industry. Used for service fee benchmarking, benefits tests, cost-plus support, management fees, and procurement arrangements.

Lease Rates

Lease and rental benchmarks drawn from real estate, equipment, facility, and vehicle disclosures. Structured by asset type, lessee industry, lease term, and payment frequency to support intercompany leases, shared facilities, and cost allocations.

Contracts Search

Clause-first search across publicly filed agreements and amendments. Built for research where the answer doesn't fit into a standard benchmark table: unusual fee structures, payment mechanics, rights language, and change-of-control provisions.

Evidence Packs

Connecting selected benchmarks directly to the materials behind them: agreements, clauses, patents, and filings. Designed to let users move from a record to its source documents, supporting audit review, expert reports, and dispute files.

Data Infrastructure

Sourcing & Curation

01

Source Coverage

We work from SEC EDGAR filings, USPTO and EPO patent records, court and regulatory filings, and public-company disclosures across multiple jurisdictions. The objective is to preserve the evidentiary chain, not just accumulate documents.

02

Normalization

Terms are normalized to common fields and units, and descriptions enriched where source language is thin, so comparables align across agreements, industries, and jurisdictions.

03

Analytical Structure

Each asset is structured according to the analysis it supports. The common discipline is the same across all data: structured fields, source-linked records, contextual enrichment, and a comparability-oriented structure.

Guiding Principles

Clarity Over Volume

DataAlchemist prioritizes traceable, normalized, source-linked benchmark records over undifferentiated document volume.

01 True Comparables A large dataset produces weak analysis if records are misclassified. We emphasize cleaner classification and fewer false comparables.

02 Contextual Breadth We do not reduce transactions to a handful of extracted fields. Each record carries the wider context around the transaction, drawn from evidence beyond the agreement itself.

03 Defensibility For professional benchmarking work, the important question is not whether a record exists, but whether it can be understood, compared, and defended.

Transfer Pricing and Valuation Benchmark Data Assets Built from Official Sources